BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (279 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P77414 Putative colanic acid biosynthesis glycosyl tran... 588 e-167 UniRef50_A8FXL4 Glycosyltransferase involved in cell wall biogen... 140 6e-32 UniRef50_C8QGU2 Glycosyl transferase family 2 n=1 Tax=Pantoea sp... 138 2e-31 UniRef50_A6TBF1 Putative glucuronosyltransferase n=1 Tax=Klebsie... 135 2e-30 UniRef50_A7M6Z6 Glycosyl transferase family 2 n=1 Tax=Klebsiella... 133 5e-30 UniRef50_Q9X4C3 Glycosyltransferase WcaN n=2 Tax=Enterobacteriac... 133 7e-30 UniRef50_C6X6S7 Glycosyl transferase family 2 n=2 Tax=Methylophi... 131 2e-29 UniRef50_D0XH48 Putative uncharacterized protein n=1 Tax=Vibrio ... 126 9e-28 UniRef50_C9Y383 Putative uncharacterized protein n=1 Tax=Cronoba... 119 2e-25 UniRef50_Q1ZRU0 Glycosyltransferase involved in cell wall biogen... 119 2e-25 UniRef50_C1DM76 Glycosyl transferase, family 2 protein n=1 Tax=A... 119 2e-25 UniRef50_C5A2Y9 Glycosyltransferase, family 2 n=1 Tax=Thermococc... 116 1e-24 UniRef50_B2JT44 Glycosyl transferase family 2 n=1 Tax=Burkholder... 115 2e-24 UniRef50_C9K187 Putative glycosyltransferase n=1 Tax=Klebsiella ... 113 7e-24 UniRef50_D2ETY9 Glycosyl transferase, family 2 protein n=1 Tax=B... 110 6e-23 UniRef50_B0C0T9 Glycosyl transferase, group 2 family protein n=1... 106 9e-22 UniRef50_Q8U2R3 Glycosyl transferase n=2 Tax=Pyrococcus RepID=Q8... 105 2e-21 UniRef50_C2SSX0 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 105 2e-21 UniRef50_Q2SBM9 Glycosyltransferase involved in cell wall biogen... 103 7e-21 UniRef50_A0B7S5 Glycosyl transferase, family 2 n=1 Tax=Methanosa... 102 1e-20 UniRef50_D2Q6R3 Glycosyl transferase family 2 n=2 Tax=Bifidobact... 102 2e-20 UniRef50_B0JRA6 Glycosyl transferase family 2 n=1 Tax=Microcysti... 94 4e-18 UniRef50_C3WFL5 Glycosyl transferase n=4 Tax=Bacteria RepID=C3WF... 94 6e-18 UniRef50_C4XII1 Putative glycosyltransferase n=1 Tax=Desulfovibr... 93 1e-17 UniRef50_A6H2F4 Glycosyl transferase, group 2 family protein n=2... 93 1e-17 UniRef50_Q74D09 Glycosyl transferase, group 2 family protein n=1... 92 1e-17 UniRef50_Q8PZ42 Glycosyltransferase n=1 Tax=Methanosarcina mazei... 92 2e-17 UniRef50_Q15RB6 Glycosyl transferase, family 2 n=1 Tax=Pseudoalt... 91 3e-17 UniRef50_A8H2F1 Glycosyl transferase family 2 n=1 Tax=Shewanella... 91 5e-17 UniRef50_Q2NH10 Predicted glycosyltransferase n=3 Tax=cellular o... 91 5e-17 UniRef50_C6X6R1 Glycosyl transferase family 2 n=1 Tax=Methylovor... 91 6e-17 UniRef50_Q5WBN3 Glycosyltransferase n=1 Tax=Bacillus clausii KSM... 91 6e-17 UniRef50_C6J2S1 Glycosyltransferase n=1 Tax=Paenibacillus sp. or... 89 1e-16 UniRef50_D2LU70 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 89 1e-16 UniRef50_A8ZWI7 Glycosyl transferase family 2 n=2 Tax=Desulfobac... 89 1e-16 UniRef50_B7R2A7 Glycosyltransferase n=1 Tax=Thermococcus sp. AM4... 89 2e-16 UniRef50_B0UDC4 Glycosyl transferase family 2 n=1 Tax=Methylobac... 89 2e-16 UniRef50_A4J8Z3 Glycosyl transferase, family 2 n=1 Tax=Desulfoto... 89 2e-16 UniRef50_A6ELC1 TuaG n=3 Tax=Bacteroidetes RepID=A6ELC1_9BACT 89 2e-16 UniRef50_O58167 Putative uncharacterized protein PH0430 n=1 Tax=... 87 5e-16 UniRef50_Q314K2 Glycosyltransferases involved in cell wall bioge... 87 5e-16 UniRef50_B6I8A2 Putative glycosyltransferase n=1 Tax=Escherichia... 87 6e-16 UniRef50_UPI000197ADF7 hypothetical protein BACCOPRO_00943 n=1 T... 87 6e-16 UniRef50_UPI0001B4A4FB putative teichuronic acid biosynthesis gl... 87 6e-16 UniRef50_Q6QW83 Putative glycosyl transferase n=1 Tax=Azospirill... 87 8e-16 UniRef50_A0M6I2 TuaG-like glycosyl transferase n=2 Tax=Flavobact... 87 9e-16 UniRef50_B5YBY2 Glycosyltransferase n=2 Tax=Dictyoglomus thermop... 86 1e-15 UniRef50_C5RMY5 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 86 1e-15 UniRef50_Q9K6L6 Glycosyltransferase n=1 Tax=Bacillus halodurans ... 86 1e-15 UniRef50_Q11C09 Glycosyl transferase, family 2 n=1 Tax=Chelativo... 86 1e-15 UniRef50_A6QB23 Glycosyl transferase n=1 Tax=Sulfurovum sp. NBC3... 86 2e-15 UniRef50_C0CGY4 Putative uncharacterized protein n=1 Tax=Blautia... 86 2e-15 UniRef50_B8D172 Glycosyl transferase family 2 n=4 Tax=Bacteria R... 86 2e-15 UniRef50_B9M559 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 86 2e-15 UniRef50_C4F932 Putative uncharacterized protein n=1 Tax=Collins... 86 2e-15 UniRef50_C0ECJ2 Putative uncharacterized protein n=1 Tax=Clostri... 86 2e-15 UniRef50_C0ED54 Putative uncharacterized protein n=1 Tax=Clostri... 85 2e-15 UniRef50_C6CSD6 Glycosyl transferase family 2 n=1 Tax=Paenibacil... 85 2e-15 UniRef50_A0YSS8 Putative uncharacterized protein n=1 Tax=Lyngbya... 85 3e-15 UniRef50_A8RSJ6 Putative uncharacterized protein n=1 Tax=Clostri... 85 3e-15 UniRef50_C2Y456 Teichuronic acid biosynthesis glycosyl transfera... 84 4e-15 UniRef50_B6YUK0 Glycosyltransferase n=3 Tax=Thermococcaceae RepI... 84 4e-15 UniRef50_A8UI46 Putative uncharacterized protein n=1 Tax=Flavoba... 84 4e-15 UniRef50_D0D9G6 Glycosyl transferase family 2 n=1 Tax=Citreicell... 84 5e-15 UniRef50_Q72XI4 Glycosyl transferase, group 2 family protein n=2... 84 5e-15 UniRef50_C6XYA6 Glycosyl transferase family 2 n=1 Tax=Pedobacter... 84 6e-15 UniRef50_B4VMI0 Glycosyl transferase, group 2 family protein n=2... 84 6e-15 UniRef50_C7TKV7 Glycosyl transferase, group 2 n=5 Tax=Lactobacil... 84 7e-15 UniRef50_A9AAL8 Glycosyl transferase family 2 n=2 Tax=Methanococ... 84 7e-15 UniRef50_Q3A3M0 Glycosyltransferase n=1 Tax=Pelobacter carbinoli... 84 8e-15 UniRef50_A0YST1 Putative uncharacterized protein n=1 Tax=Lyngbya... 83 8e-15 UniRef50_B5WT88 Glycosyl transferase family 2 n=1 Tax=Burkholder... 83 8e-15 UniRef50_C6BYR5 Glycosyl transferase family 2 n=5 Tax=Desulfovib... 83 8e-15 UniRef50_A5ZV71 Putative uncharacterized protein n=2 Tax=Firmicu... 83 9e-15 UniRef50_C6VXV3 Glycosyl transferase family 2 n=1 Tax=Dyadobacte... 83 1e-14 UniRef50_Q8YSL1 Alr3073 protein n=2 Tax=Nostocaceae RepID=Q8YSL1... 83 1e-14 UniRef50_A6LGW8 Glycosyltransferase family 2 n=3 Tax=Bacteroidal... 83 1e-14 UniRef50_B5WL01 Glycosyl transferase family 2 n=2 Tax=Burkholder... 82 1e-14 UniRef50_C6EQF5 Putative uncharacterized protein n=2 Tax=Campylo... 82 1e-14 UniRef50_A4A6G7 Glycosyltransferase n=1 Tax=Congregibacter litor... 82 2e-14 UniRef50_C1DGR7 Glycosyl transferase, family 2 protein n=1 Tax=A... 82 2e-14 UniRef50_A6ETY5 Glycosyltransferase n=1 Tax=unidentified eubacte... 82 2e-14 UniRef50_Q0ACF0 Glycosyl transferase, family 2 n=4 Tax=Ectothior... 82 2e-14 UniRef50_B7AK29 Putative uncharacterized protein n=2 Tax=Bactero... 82 2e-14 UniRef50_Q0VR09 Glycosyl transferase, group 2 family protein, pu... 82 2e-14 UniRef50_B4ECE8 Glycosyltransferase n=3 Tax=Burkholderia cepacia... 82 3e-14 UniRef50_C6IEP2 UDP-galactose phosphate transferase n=2 Tax=Bact... 82 3e-14 UniRef50_A8TT37 Glycosyl transferase, group 2 family protein n=2... 81 3e-14 UniRef50_Q3J960 Glycosyl transferase, family 2 n=2 Tax=Nitrosoco... 81 4e-14 UniRef50_Q67PP6 Glycosyl transferase n=1 Tax=Symbiobacterium the... 81 4e-14 UniRef50_Q314K1 Glycosyltransferases involved in cell wall bioge... 81 4e-14 UniRef50_D0TBE0 Glycosyltransferase family 2 n=2 Tax=Bacteroides... 81 4e-14 UniRef50_C7GBF7 Glycosyltransferase n=1 Tax=Roseburia intestinal... 81 4e-14 UniRef50_A9KVT6 Glycosyl transferase family 2 n=1 Tax=Shewanella... 81 4e-14 UniRef50_A0KUZ3 Glycosyl transferase, family 2 n=7 Tax=Gammaprot... 81 4e-14 UniRef50_UPI0001B495DA glycosyltransferase n=1 Tax=Bacteroides s... 81 4e-14 UniRef50_Q2CBS3 Truncated glycosyltransferase n=1 Tax=Oceanicola... 81 4e-14 UniRef50_A0KM86 Glycosyl transferase, group 2 family protein n=2... 81 5e-14 UniRef50_Q1N7I9 Predicted glycosyltransferase n=1 Tax=Sphingomon... 81 5e-14 UniRef50_Q72QG5 Glycosyltransferase n=2 Tax=Leptospira interroga... 81 5e-14 UniRef50_B0SAI4 Glycosyltransferase n=2 Tax=Leptospira biflexa s... 80 5e-14 UniRef50_B9BSY2 Glycosyl transferase, family 2 n=4 Tax=Proteobac... 80 6e-14 UniRef50_B3E7E3 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 80 6e-14 UniRef50_A3VRJ5 Glycosyltransferase n=1 Tax=Parvularcula bermude... 80 6e-14 UniRef50_B8GFE2 Glycosyl transferase family 2 n=1 Tax=Methanosph... 80 6e-14 UniRef50_Q062P8 Putative teichoic acid/polysaccharide glycosyl t... 80 6e-14 UniRef50_Q4A117 Putative glycosyltransferase n=1 Tax=Staphylococ... 80 7e-14 UniRef50_B0TYV9 Putative teichuronic acid biosynthesis glycosyl ... 80 7e-14 UniRef50_A3I1Y0 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1... 80 7e-14 UniRef50_B2GLA0 Putative glycosyltransferase n=1 Tax=Kocuria rhi... 80 7e-14 UniRef50_B1FXK1 Glycosyl transferase family 2 n=2 Tax=Burkholder... 80 8e-14 UniRef50_A6BH37 Putative uncharacterized protein n=1 Tax=Dorea l... 80 8e-14 UniRef50_A3DF58 Glycosyl transferase, family 2 n=2 Tax=Clostridi... 80 8e-14 UniRef50_B8F9T0 Glycosyl transferase family 2 n=1 Tax=Desulfatib... 80 8e-14 UniRef50_A6BHD3 Putative uncharacterized protein n=1 Tax=Dorea l... 80 8e-14 UniRef50_Q2SD88 Glycosyltransferase involved in cell wall biogen... 80 8e-14 UniRef50_B4WMJ9 Glycosyl transferase, group 2 family protein n=1... 80 9e-14 UniRef50_Q8KI14 Similar to Glycosyl transferase n=1 Tax=Pseudomo... 80 1e-13 UniRef50_Q9XBL5 Beta-D-1,6 Glucosyl transferase n=2 Tax=Gluconac... 80 1e-13 UniRef50_B7MWU8 Glycosyltransferase n=1 Tax=Escherichia coli ED1... 80 1e-13 UniRef50_C5ZW04 Putative glycosyltransferase n=1 Tax=Helicobacte... 80 1e-13 UniRef50_Q64WD3 Glycosyltransferase n=1 Tax=Bacteroides fragilis... 79 1e-13 UniRef50_Q3K8V0 Putative glycosyl transferase n=1 Tax=Pseudomona... 79 1e-13 UniRef50_Q11KM8 Glycosyl transferase, family 2 n=1 Tax=Chelativo... 79 1e-13 UniRef50_D2L8E2 Glycosyl transferase family 2 n=1 Tax=Desulfovib... 79 1e-13 UniRef50_Q1Q4P0 Similar to glycosyltransferase family 2 n=1 Tax=... 79 1e-13 UniRef50_A9GHY3 Glycosyl transferase, family 2 n=4 Tax=Rhodobact... 79 2e-13 UniRef50_C4FFR1 Putative uncharacterized protein n=1 Tax=Bifidob... 79 2e-13 UniRef50_B4RT76 Glycosyl transferase n=1 Tax=Alteromonas macleod... 79 2e-13 UniRef50_B5W037 Glycosyl transferase family 2 n=2 Tax=Arthrospir... 79 2e-13 UniRef50_Q8YUP7 Glucosyltransferase n=2 Tax=Cyanobacteria RepID=... 79 2e-13 UniRef50_B4AD21 Glycosyltransferase n=17 Tax=Enterococcus faecal... 79 2e-13 UniRef50_A7B4C1 Putative uncharacterized protein n=1 Tax=Ruminoc... 79 2e-13 UniRef50_A3XGI2 Putative uncharacterized protein n=1 Tax=Leeuwen... 79 2e-13 UniRef50_C9C3N2 Glycosyl transferase n=1 Tax=Enterococcus faeciu... 79 2e-13 UniRef50_C3AEG0 Glycosyl transferase, family 2 n=2 Tax=Bacillus ... 79 2e-13 UniRef50_Q8YSM1 Alr3063 protein n=7 Tax=Cyanobacteria RepID=Q8YS... 79 2e-13 UniRef50_C6E5U5 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 79 2e-13 UniRef50_C0ZCI8 Putative glycosyl transferase n=1 Tax=Brevibacil... 79 2e-13 UniRef50_A7V5D9 Putative uncharacterized protein n=1 Tax=Bactero... 79 2e-13 UniRef50_D0S7H0 Glycosyltransferase family 2 n=1 Tax=Acinetobact... 79 2e-13 UniRef50_B5L3X5 WfeA n=2 Tax=Enterobacteriaceae RepID=B5L3X5_ECOLX 78 3e-13 UniRef50_C0CGZ7 Putative uncharacterized protein n=1 Tax=Blautia... 78 3e-13 UniRef50_B7BDW0 Putative uncharacterized protein n=1 Tax=Parabac... 78 3e-13 UniRef50_C0QK40 Putative glycosyl transferase (Group 2 family pr... 78 3e-13 UniRef50_B4VWC8 Glycosyl transferase, group 2 family protein, pu... 78 3e-13 UniRef50_Q8DGK0 Tlr2317 protein n=2 Tax=Thermosynechococcus elon... 78 3e-13 UniRef50_A6H0Q4 Glycosyl transferase, group 2 family protein n=1... 78 3e-13 UniRef50_A8GFU6 Glycosyl transferase family 2 n=1 Tax=Serratia p... 78 4e-13 UniRef50_Q3A667 Glycosyltransferase n=1 Tax=Pelobacter carbinoli... 78 4e-13 UniRef50_Q1PUV4 Putative uncharacterized protein n=1 Tax=Candida... 78 4e-13 UniRef50_B7KAG8 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 78 4e-13 UniRef50_B1ZW34 Glycosyl transferase family 2 n=1 Tax=Opitutus t... 78 4e-13 UniRef50_C3RR62 Glycosyl transferase n=3 Tax=Bacteria RepID=C3RR... 78 4e-13 UniRef50_C0FV27 Putative uncharacterized protein n=1 Tax=Rosebur... 77 4e-13 UniRef50_Q1ZNV1 Glucosyltransferase protein n=1 Tax=Photobacteri... 77 5e-13 UniRef50_A9EDG2 Putative uncharacterized protein n=1 Tax=Kordia ... 77 5e-13 UniRef50_UPI00017449A6 glycosyl transferase n=1 Tax=Verrucomicro... 77 5e-13 UniRef50_Q8YYD5 All0915 protein n=6 Tax=Cyanobacteria RepID=Q8YY... 77 5e-13 UniRef50_Q2YCB6 Glycosyl transferase, family 2 n=1 Tax=Nitrososp... 77 5e-13 UniRef50_Q21E22 B-glycosyltransferase-like protein n=2 Tax=Alter... 77 5e-13 UniRef50_C0H3N9 CpsF n=2 Tax=Pantoea RepID=C0H3N9_ERWST 77 5e-13 UniRef50_B5L3F2 WfgD n=3 Tax=Enterobacteriaceae RepID=B5L3F2_ECOLX 77 5e-13 UniRef50_D0MF25 Glycosyl transferase family 2 n=1 Tax=Rhodotherm... 77 6e-13 UniRef50_B4WF86 Glycosyl transferase, group 2 family protein n=1... 77 6e-13 UniRef50_B9M0H9 Glycosyl transferase family 2 n=3 Tax=Proteobact... 77 6e-13 UniRef50_C6MKP2 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 77 6e-13 UniRef50_B0CGB5 Glycosyl transferase, family 2, putative n=1 Tax... 77 6e-13 UniRef50_C6X8A5 Glycosyl transferase family 2 n=1 Tax=Methylovor... 77 6e-13 UniRef50_B8HRB2 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 77 6e-13 UniRef50_C7XEA2 Teichuronic acid biosynthesis glycosyltransferas... 77 7e-13 UniRef50_C1SJ24 Glycosyl transferase n=1 Tax=Denitrovibrio aceti... 77 7e-13 UniRef50_Q11L33 Glycosyl transferase, family 2 n=1 Tax=Chelativo... 77 7e-13 UniRef50_C1F2T6 Glycosyl transferase, group 2 family n=1 Tax=Aci... 77 7e-13 UniRef50_A4EK02 Glycosyl transferase, family 2 n=1 Tax=Roseobact... 77 8e-13 UniRef50_D1PXL6 Putative uncharacterized protein n=1 Tax=Prevote... 77 8e-13 UniRef50_C0YGQ4 Possible glycosyl transferase family protein n=1... 77 8e-13 UniRef50_Q47GL6 Glycosyl transferase, family 2 n=1 Tax=Dechlorom... 77 9e-13 UniRef50_C2P8I0 Family 2 glycosyltransferase n=10 Tax=Bacteria R... 77 9e-13 UniRef50_B1Y722 Glycosyl transferase family 2 n=1 Tax=Leptothrix... 76 1e-12 UniRef50_B1C0P6 Putative uncharacterized protein n=1 Tax=Clostri... 76 1e-12 UniRef50_Q6MBF6 Putative glycosyltransferase n=1 Tax=Candidatus ... 76 1e-12 UniRef50_B6JCX3 Glycosyl transferase, family 2 n=1 Tax=Oligotrop... 76 1e-12 UniRef50_B3PF93 Glycosyl transferase, putative, gt2I n=1 Tax=Cel... 76 1e-12 UniRef50_B8HRA4 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 76 1e-12 UniRef50_A4BIW6 Glycosyltransferase n=1 Tax=Reinekea blandensis ... 76 1e-12 UniRef50_D0QYM3 AcbB n=1 Tax=Avibacterium paragallinarum RepID=D... 76 1e-12 UniRef50_A4G480 Putative Glycosyl transferase, family 2 n=1 Tax=... 76 1e-12 UniRef50_B7JW84 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 76 1e-12 UniRef50_B0S8S5 Glycosyltransferase n=2 Tax=Leptospira biflexa s... 76 1e-12 UniRef50_A4Y8H1 Glycosyl transferase, family 2 n=1 Tax=Shewanell... 76 1e-12 UniRef50_A8RE12 Putative uncharacterized protein n=1 Tax=Eubacte... 76 1e-12 UniRef50_D2RDR5 Glycosyl transferase family 2 n=1 Tax=Archaeoglo... 76 2e-12 UniRef50_Q0BQ52 Glycosyltransferase n=1 Tax=Granulibacter bethes... 76 2e-12 UniRef50_A8U9G2 Glycosyltransferase n=1 Tax=Carnobacterium sp. A... 76 2e-12 UniRef50_D0DJH6 Beta(1,3)galactosyltransferase EpsH n=1 Tax=Lact... 76 2e-12 UniRef50_Q3M3J9 Glycosyl transferase, family 2 n=2 Tax=Nostocace... 76 2e-12 UniRef50_Q8L0V4 N-acetylgalactosaminyl-proteoglycan 3-beta-glucu... 75 2e-12 UniRef50_B4S4G2 Glycosyl transferase family 2 n=1 Tax=Prosthecoc... 75 2e-12 UniRef50_A3W9Y7 Glucosyltransferase n=1 Tax=Erythrobacter sp. NA... 75 2e-12 UniRef50_Q8EMF8 Glycosyltransferase n=1 Tax=Oceanobacillus iheye... 75 2e-12 UniRef50_B1I846 Glycosyl transferase, family 2 n=25 Tax=Streptoc... 75 2e-12 UniRef50_A5FN38 Candidate beta-glycosyltransferase; Glycosyltran... 75 2e-12 UniRef50_Q46CV3 Putative uncharacterized protein n=2 Tax=cellula... 75 2e-12 UniRef50_B4U6C1 Glycosyl transferase family 2 n=1 Tax=Hydrogenob... 75 2e-12 UniRef50_C1DRS5 Glycosyl transferase, family 2 n=1 Tax=Azotobact... 75 2e-12 UniRef50_Q9K6R9 BH3658 protein n=5 Tax=Bacillales RepID=Q9K6R9_B... 75 3e-12 UniRef50_Q0HKL3 Glycosyl transferase, family 2 n=1 Tax=Shewanell... 75 3e-12 UniRef50_B4CZV6 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 75 3e-12 UniRef50_A3YA00 Glycosyltransferase n=1 Tax=Marinomonas sp. MED1... 75 3e-12 UniRef50_Q39T63 Glycosyl transferase, family 2 n=2 Tax=Proteobac... 75 3e-12 UniRef50_C3QGL0 Family 2 glycosyltransferase n=3 Tax=Bacteroides... 75 3e-12 UniRef50_C1QAA3 Glycosyl transferase n=1 Tax=Brachyspira murdoch... 75 3e-12 UniRef50_D2R3I2 Glycosyl transferase family 2 n=1 Tax=Pirellula ... 75 3e-12 UniRef50_A3W1Y5 Glycosyltransferase n=1 Tax=Roseovarius sp. 217 ... 75 3e-12 UniRef50_Q7U8H3 Putative glycosyltransferase family 2 protein n=... 75 3e-12 UniRef50_B6BQY7 Glycosyltransferase n=1 Tax=Candidatus Pelagibac... 75 3e-12 UniRef50_D2RIR4 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 75 3e-12 UniRef50_Q5V6E2 Glycosyl transferase n=1 Tax=Haloarcula marismor... 75 3e-12 UniRef50_C6IGZ0 Glycosyl transferase n=1 Tax=Bacteroides sp. 1_1... 75 4e-12 UniRef50_D2QQL1 Glycosyl transferase family 2 n=1 Tax=Spirosoma ... 75 4e-12 UniRef50_A4J4C0 Glycosyl transferase, family 2 n=1 Tax=Desulfoto... 74 4e-12 UniRef50_A5GQE9 Glycosyltransferase n=1 Tax=Synechococcus sp. RC... 74 4e-12 UniRef50_Q6AK17 Related to glycosyl transferase n=1 Tax=Desulfot... 74 4e-12 UniRef50_B7JHV6 Glycosyl transferase, group 2 family protein n=5... 74 4e-12 UniRef50_A1U141 Glycosyl transferase, family 2 n=1 Tax=Marinobac... 74 4e-12 UniRef50_C4KBW6 Glycosyl transferase family 2 n=1 Tax=Thauera sp... 74 4e-12 UniRef50_A1W3G7 Glycosyl transferase, family 2 n=1 Tax=Acidovora... 74 4e-12 UniRef50_C1QAA4 Predicted glycosyltransferase n=1 Tax=Brachyspir... 74 4e-12 UniRef50_A6M2M4 Glycosyl transferase, family 2 n=1 Tax=Clostridi... 74 4e-12 UniRef50_Q116A2 Glycosyl transferase, group 1 n=1 Tax=Trichodesm... 74 5e-12 UniRef50_C0B9A3 Putative uncharacterized protein n=1 Tax=Coproco... 74 5e-12 UniRef50_Q8PXS5 Glycosyltransferase n=1 Tax=Methanosarcina mazei... 74 5e-12 UniRef50_B2UPJ3 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 74 5e-12 UniRef50_D1W2F4 Glycosyltransferase, group 2 family protein n=2 ... 74 5e-12 UniRef50_B3QQK3 Glycosyl transferase family 2 n=1 Tax=Chlorobacu... 74 5e-12 UniRef50_Q46FX2 Putative uncharacterized protein n=1 Tax=Methano... 74 5e-12 UniRef50_D1RR43 Glycosyl transferase family 2 n=1 Tax=Serratia o... 74 5e-12 UniRef50_D1BX31 Glycosyl transferase family 2 n=1 Tax=Xylanimona... 74 6e-12 UniRef50_C6J2R9 Predicted protein n=1 Tax=Paenibacillus sp. oral... 74 6e-12 UniRef50_B9KX37 Putative uncharacterized protein n=1 Tax=Rhodoba... 74 6e-12 UniRef50_D1BX28 Glycosyl transferase family 2 n=1 Tax=Xylanimona... 74 6e-12 UniRef50_B2VFP3 Glycosyl transferase n=1 Tax=Erwinia tasmaniensi... 74 6e-12 UniRef50_A4SE50 Glycosyl transferase, family 2 n=2 Tax=Chlorobiu... 74 7e-12 UniRef50_Q0C2C3 Glycosyl transferase, group 2 family protein n=1... 74 8e-12 UniRef50_A0YLN7 Glycosyl transferase, group 2 family protein n=3... 74 8e-12 UniRef50_Q98NV3 Mlr9727 protein n=1 Tax=Mesorhizobium loti RepID... 74 8e-12 UniRef50_Q8NTE4 Glycosyltransferases involved in cell wall bioge... 74 8e-12 UniRef50_C8WM64 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 74 8e-12 UniRef50_C3RM81 Glycosyl transferase n=1 Tax=Mollicutes bacteriu... 74 8e-12 >UniRef50_P77414 Putative colanic acid biosynthesis glycosyl transferase wcaA n=98 Tax=Enterobacteriaceae RepID=WCAA_ECOLI Length = 279 Score = 588 bits (1516), Expect = e-167, Method: Compositional matrix adjust. Identities = 279/279 (100%), Positives = 279/279 (100%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP Sbjct: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY Sbjct: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ Sbjct: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK Sbjct: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR 279 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR Sbjct: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR 279 >UniRef50_A8FXL4 Glycosyltransferase involved in cell wall biogenesis-like protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FXL4_SHESH Length = 317 Score = 140 bits (352), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 98/277 (35%), Positives = 142/277 (51%), Gaps = 23/277 (8%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + + P++S+YMPT NR+ L RA++SVL Q Y N+E+IIVDD S A ++P Sbjct: 29 LDHGPMVSVYMPTHNREHLLKRAVESVLAQTYQNFELIIVDDGSRDGSPDYLQSLASSEP 88 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ + I GACA RN AI A+G+YITG+DDDDE+ PNRL +Q L ++ Y Sbjct: 89 RVRFFCQPIAQGACAARNIAIKEAKGKYITGLDDDDEFLPNRL------EQLLSSYDEKY 142 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPY----SRRLFYKRNIIGNQVFTWAWRFKEC-LFDTE 175 A +VCQG + + + S+ L Y N NQV T R + F+ E Sbjct: 143 A--FVCQGTFWHYGSHKKALDANAMIINLSQMLDY--NYSTNQVLTETKRLQSIGGFNPE 198 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHF---YRKHK 232 A QDYD + R++++YG ++ A+ I+H H +I S GY F Y Sbjct: 199 FPACQDYDTWTRLILKYGAAKRIAGASYIIHQGHEGPRIISKSNMQRGYKKFSEEYGSLM 258 Query: 233 DKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNG 269 K +R ++KY + R ++ L SVR G Sbjct: 259 SKGNRINQKYLALV-----SSRKRFKLLDFFQSVRYG 290 >UniRef50_C8QGU2 Glycosyl transferase family 2 n=1 Tax=Pantoea sp. At-9b RepID=C8QGU2_9ENTR Length = 286 Score = 138 bits (347), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 87/265 (32%), Positives = 134/265 (50%), Gaps = 6/265 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+++Y+PT NR L RAI SV Q Y N E+II DD S+ ++ D RI Y Sbjct: 9 PLVTVYIPTHNRSSLVKRAIASVQNQTYRNLEIIICDDGSSDDTEVVIAKMKEQDSRIIY 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + ND GAC RN+ I +A+GE+ITG+DDDD + R++ F+ Q+L FL AN Sbjct: 69 LKNDRPMGACHARNRCIAIAKGEFITGLDDDDYFLNTRIANFVTASQKL-NKPFLCANMI 127 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKAAQDYD 183 + + S + + +N +GNQVF K FD A QDYD Sbjct: 128 FKNRNIEKKWKSYV----GDITLNMMGFKNWVGNQVFIRTSMMKNIGGFDINFPAWQDYD 183 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++ RM+++YG +++++A+ +++I+ +IT+ K + GY F HK R K Sbjct: 184 LWYRMIMQYGSCYRIDDASYVVNIDDDRPRITTGSKAWQGYQKFITTHKPSLSRQHLKSL 243 Query: 244 LFTLYQIRNKRMTWRTLLTLLSVRN 268 F R + ++ S+ N Sbjct: 244 YFQDKLNRKDEINFKEFFHYPSIDN 268 >UniRef50_A6TBF1 Putative glucuronosyltransferase n=1 Tax=Klebsiella pneumoniae subsp. pneumoniae MGH 78578 RepID=A6TBF1_KLEP7 Length = 291 Score = 135 bits (340), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 88/279 (31%), Positives = 141/279 (50%), Gaps = 19/279 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALN 58 M PL+S+Y+PT NR +L A+KSV+ Q Y N E+II +D S + + + + Sbjct: 1 MNKKPLVSVYIPTHNRAELLSNAVKSVINQTYKNIEIIICNDGSGDNTDDVVNKLQKEYK 60 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D Y+ ND GAC RN+ I A G Y+TG+DDDD + PNR+ + L+ Sbjct: 61 DIPFVYLKNDSPMGACFSRNRCIAAASGYYLTGLDDDDYFLPNRI-------ESLIKVCN 113 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSP---YSRRLFYKRNIIGNQVFTWAWRFKECL-FDT 174 D VC ++ L + S + Y+ N++GNQ+ T F++ FDT Sbjct: 114 ERKIDLVCSNLIFKDGNKLKRGKNNSGVITSIDMGYE-NLVGNQLLTRLSYFEDVGGFDT 172 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 A QDYD++ R+++++G K EAT ++ + + +I++S K GY F KH + Sbjct: 173 NFPAWQDYDLWYRIIMKFGPCIKTNEATYVMDVENDRKRISTSSKAHIGYRSFIEKHSET 232 Query: 235 FDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 K RNK++ +TLL+ +G+++A Sbjct: 233 LSEEMKSSLFIRDLINRNKKIPLKTLLS-----SGRKVA 266 >UniRef50_A7M6Z6 Glycosyl transferase family 2 n=1 Tax=Klebsiella pneumoniae RepID=A7M6Z6_KLEPN Length = 281 Score = 133 bits (335), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 83/223 (37%), Positives = 120/223 (53%), Gaps = 11/223 (4%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N P +SIY+ T+NR + RAI SV QDY NWE++I DD ST L D R+ Sbjct: 4 NGPKVSIYISTFNRLEKLKRAIDSVFAQDYLNWELLICDDASTDGTFEFSSELELRDERV 63 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL--- 119 Y N+ N GAC RN I A+G++ITG+DDDDE+TPNRLSVFL + ++FL Sbjct: 64 RYFRNESNKGACETRNLGIFNAKGKFITGLDDDDEFTPNRLSVFLKNWDD--RYSFLCCN 121 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKA 178 + N Y E Y + Y+ + N NQ+FT R ++ FD +K Sbjct: 122 FKNKYPTGKEEYYYKIKK---ERCIYNYKDMLFENEASNQIFTLTERLQQIHGFDKTVKR 178 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINH--GEMQITSSPK 219 QD+D +LR+ YG+ + T I++ +H E++++ + K Sbjct: 179 LQDWDTWLRLSFRYGDFVRYNVCTYIMNHDHQPSEIRVSQNEK 221 >UniRef50_Q9X4C3 Glycosyltransferase WcaN n=2 Tax=Enterobacteriaceae RepID=Q9X4C3_ECOLX Length = 287 Score = 133 bits (334), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 86/234 (36%), Positives = 133/234 (56%), Gaps = 18/234 (7%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST-SWEQL-QQYVTALNDPRITY 64 +SIY+PT NR ++ RA+ S+L Q Y N+++++ +D S+ S++Q+ ++Y + +D Y Sbjct: 5 VSIYIPTHNRPKMLARALDSLLTQTYKNFQVLVCNDGSSESYDQIIKEYRSKFSD--FIY 62 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N GAC RN+ I +A GEYITG+DDDDE+ P+RL+ F+ L +A Y Sbjct: 63 IENQSPMGACCSRNKLINIADGEYITGLDDDDEFLPSRLTDFI-DSPNLDKYA------Y 115 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRL----FYKRNIIGNQVFTWAWRFKEC-LFDTELKAA 179 + G + S L+ + R + RNI+GNQVFT KE FD L A Sbjct: 116 ISAGHI--TKTSSGLFKQKIDERVITLESLLSRNIVGNQVFTKTVFLKESGGFDEHLPAW 173 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 QDYD ++ + + G +++E+ +I+H E +I++S K GY F KHKD Sbjct: 174 QDYDAWINLTKKIGNGYRIEKYNYQWNIDHEEGRISNSSKAEIGYNLFIEKHKD 227 >UniRef50_C6X6S7 Glycosyl transferase family 2 n=2 Tax=Methylophilaceae RepID=C6X6S7_METSD Length = 282 Score = 131 bits (330), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 76/204 (37%), Positives = 118/204 (57%), Gaps = 9/204 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+Y PT NR L I+AI SVL Q + + EMI+++D ST A D R+ ++ Sbjct: 2 LVSVYTPTKNRLDLLIKAIDSVLGQTHQDIEMIVINDGSTDGTAAYLEERAQKDHRLRFV 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL----VTHAFLYA 121 + I+ GA RN AI +A+GE+ITG+DDDD + PNR+ F+A+ Q L + A LY+ Sbjct: 62 NKTISEGAPKARNMAITMAKGEFITGLDDDDAFQPNRVEAFIAYWQLLKNLGLQPACLYS 121 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKAAQ 180 D + + + A + + K + R + N IGNQ+F F E LFD +L A Q Sbjct: 122 QDIIVKNGI----ADVGTHKKGSITYRDMIEFNYIGNQIFAPKQYFVEAGLFDEQLPAWQ 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQI 204 D D+F+R++ ++G + ++ TQ+ Sbjct: 178 DMDMFIRVLKKFGTAYLLDIPTQL 201 >UniRef50_D0XH48 Putative uncharacterized protein n=1 Tax=Vibrio harveyi 1DA3 RepID=D0XH48_VIBHA Length = 278 Score = 126 bits (316), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 84/271 (30%), Positives = 142/271 (52%), Gaps = 13/271 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR-IT 63 L++IY+ T NR QL RAI+S Q Y N E+I+VDD ST + Q + + + Sbjct: 2 KLVTIYIVTHNRSQLLKRAIESAYNQTYKNIEVIVVDDAST--DDTQDIIKQMQATYPLK 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN- 122 Y ND GAC RN A+ A GE ITG+DDDD + +R+ + + T++ + AN Sbjct: 60 YFRNDEGRGACYSRNVALNAASGELITGMDDDDYFGASRVDDLVDAYED--TYSLVCANI 117 Query: 123 -DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKAAQ 180 + E++ + + + + +N++GNQ+ T + K FD + A Q Sbjct: 118 IEVTTDKEIHRKFG----FEQGEFELNNLMYQNLVGNQMLTSVAKLKAIGGFDEAMPAFQ 173 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS-PKKFSGYFHFYRKHKDKFDRAS 239 DYD ++R+V +G+ +K+++ L+ H +I+SS +K G+ F+ KH++K + A Sbjct: 174 DYDTWVRLVDRFGKGYKIDKYNYYLNTEHAGERISSSNDRKTRGFELFFDKHQNKMNAAH 233 Query: 240 KKYQLFTLYQIRNKRMTWRTLLTLLSVRNGK 270 KK Y++ + + T++ L+S N K Sbjct: 234 KKSMEIMRYKVTGEMLPLLTMVRLISRDNYK 264 >UniRef50_C9Y383 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y383_CROTZ Length = 293 Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 80/263 (30%), Positives = 133/263 (50%), Gaps = 30/263 (11%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPRI 62 N LIS+ + T+NR++L RAI+SV+ Q YS E+I++DDCS E+ Q + + + + Sbjct: 13 NELISVIITTFNREELLERAIRSVIAQTYSAVELIVIDDCSN--EKTAQLIEKMRPECEV 70 Query: 63 TYIH-----NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 ++H N+ NSG+ RN+ L+ G ++TG+DDDD + P RLS A ++ +A Sbjct: 71 RFVHFIYERNEKNSGSNFSRNRGYALSHGVFVTGLDDDDYFLPERLSKLAARYEE--RYA 128 Query: 118 FLYANDYVCQGEVYSQPASLPLYPK--SPYSRRLFYK------RNIIGNQVFTWAWRFKE 169 F V PA L + S +R+ N++GNQV T + Sbjct: 129 F-----------VTDSPARLDKKQRTNSQGNRKRIITLQDVLCENVVGNQVLTTREKMLG 177 Query: 170 C-LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY 228 F E+K QD D +++++++YG K +T ++ H +IT KK++ Y Y Sbjct: 178 VGGFSPEIKQQQDRDAWIKLILQYGPGVKYSFSTAMVDAEHSSDRITKKIKKYTSYRKLY 237 Query: 229 RKHKDKFDRASKKYQLFTLYQIR 251 K+++ ++ LF L R Sbjct: 238 FKYREHMSETTRSNNLFQLMSFR 260 >UniRef50_Q1ZRU0 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Photobacterium angustum S14 RepID=Q1ZRU0_PHOAS Length = 275 Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 76/223 (34%), Positives = 118/223 (52%), Gaps = 21/223 (9%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALNDPRITY 64 +S+Y+ T NR +L RA+ SV +Q YSN E+II DD S +WE Y+ + ND + Sbjct: 5 VSVYIATHNRIELLKRALNSVRKQTYSNIEIIICDDGSRDGTWE----YLKSCNDSNLII 60 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA----HKQQLVTHAFLY 120 + N GAC RN+ I A G ITG+DDDD + R+ F+ +K++ +FLY Sbjct: 61 LRNSEPLGACVSRNRCINAATGRLITGLDDDDYFYLKRIERFIEKYNLYKEK--NFSFLY 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRL--FYKRNIIGNQVFTWAWRFKECLFDTELKA 178 G++ + K+P + K+N +GNQVF + K LFD L A Sbjct: 119 -------GDLIINEQNNKFIRKAPKTVTFKDIVKKNHVGNQVFIEKDKIKSELFDESLPA 171 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 QDYD+++R++ +YG V+ + IL I+H +I+ ++ Sbjct: 172 MQDYDLWIRLIKKYGNAIGVQNSGYILDISHPHERISKKSERL 214 >UniRef50_C1DM76 Glycosyl transferase, family 2 protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DM76_AZOVD Length = 292 Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 83/259 (32%), Positives = 140/259 (54%), Gaps = 12/259 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALN 58 M+ PL+S+Y+PT NR + RA++SVL Q Y+N E+++ DD ST ++E++ + A + Sbjct: 1 MRPAPLVSVYIPTRNRLEKLERALRSVLGQTYANHEILVCDDASTDGTFERISRL--ARS 58 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 + +I Y+ N GAC+ RN I A+GE+ITG+DDDDE+TP+RL L +AF Sbjct: 59 ERKIRYLRNPAPRGACSARNLGIFAARGEFITGLDDDDEFTPDRLETLLDAWDD--GYAF 116 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELK 177 + +N + Q A PL ++ R+L K N+ NQV T R ++ F ++ Sbjct: 117 VCSN-FWSQRPGQRLRAYYPLQAQTFTLRQLLLK-NLATNQVLTRTGRLQDIGGFREGVR 174 Query: 178 AAQDYDIFLRMVVEYGEPW-KVEEATQILHINH-GEMQ-ITSSPKKFSGYFHFYRKHKDK 234 QD+D +LR+ YG + + ++H +H E Q ++ S +++D Sbjct: 175 RLQDWDTWLRLCSGYGGRFHRARTPLYVMHHDHEAEAQRVSRSIALDVALEELCERNRDL 234 Query: 235 FDRASKKYQLFTLYQIRNK 253 +D S++ L ++R + Sbjct: 235 YDARSRRLLLSHARELRGE 253 >UniRef50_C5A2Y9 Glycosyltransferase, family 2 n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A2Y9_THEGJ Length = 304 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 85/240 (35%), Positives = 126/240 (52%), Gaps = 24/240 (10%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +PT+NR L RAI SVL Q + ++E+I+VDD S + + V ++ND RI Y Sbjct: 4 PTVSVIIPTYNRANLLRRAITSVLNQTFRDFELIVVDDASP--DNTPEVVESINDGRIRY 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV----FLAHKQQL-VTHAFL 119 + NSG RN I A+G++I +DDDDEW P+RL F +Q + V + Sbjct: 62 VRLKKNSGGPVARNTGIKKARGKFIALLDDDDEWLPHRLETQVRKFETLEQNIGVVYGGF 121 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-----LFDT 174 Y YV Q + LP Y + Y + L + N IG+ +EC LFD Sbjct: 122 Y---YVSQQDGRILGKRLPAYKGNVYDKLL--RENFIGSPTLLIR---RECFKRAGLFDP 173 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF-SGYFHFYRKHKD 233 L ++QD+D++LR+ Y + V+E ++ HG QI+ + KK+ G F RKH D Sbjct: 174 NLSSSQDWDMWLRIARHYKFDY-VDEIVAKYYV-HGR-QISFNMKKYIPGRERFIRKHLD 230 >UniRef50_B2JT44 Glycosyl transferase family 2 n=1 Tax=Burkholderia phymatum STM815 RepID=B2JT44_BURP8 Length = 316 Score = 115 bits (287), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 69/217 (31%), Positives = 112/217 (51%), Gaps = 11/217 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPRIT 63 PL+S+Y+PT NR L RA+ SVL+Q Y E+++ DD ST + ++Y+ L + Sbjct: 9 PLVSVYIPTKNRLPLLQRAVASVLQQTYPRIELVVADDGST--DGTREYLDELAASGKCV 66 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + ++ GACA RN AI + GE++TG+DDDD + P R+ F+ ++L TH D Sbjct: 67 AVFLPVSEGACAARNAAISMTTGEFLTGLDDDDYFLPQRIECFVQRWRELSTHG---RTD 123 Query: 124 YVC----QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 ++ + L+ ++ R +G QVF+ RF LFD ++ Sbjct: 124 HIAGLFDSSRWIGRAMERTLFDRACAHPRDLAAACAVGTQVFSIRERFIAAGLFDRSMQV 183 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 QD+D++LRM YGE + T ++ H +I+ Sbjct: 184 WQDWDLWLRMAYRYGEFVGIRACTYVIDATHQHERIS 220 >UniRef50_C9K187 Putative glycosyltransferase n=1 Tax=Klebsiella pneumoniae RepID=C9K187_KLEPN Length = 290 Score = 113 bits (282), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 71/219 (32%), Positives = 111/219 (50%), Gaps = 6/219 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 L+++Y+ T NR +L R + S+ Q + E+I+VDD S L D RI Y Sbjct: 2 ALVTVYITTCNRLELLKRCLNSIKEQTIRDIEIIVVDDNSNDDTALFMKSECEKDTRIKY 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND- 123 + NDIN GAC RN+AI +A EYITG DDDD + +R+ F+ + +L FLY + Sbjct: 62 LRNDINRGACYSRNKAISVASSEYITGCDDDDYFEKDRIKSFVENSDKLEQFVFLYTDSL 121 Query: 124 -YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDY 182 +G A + + K+ S N +GNQVFT ++ LFD L A QD+ Sbjct: 122 WLTSRG---VNKAGINKFAKAIVSSEDLLYFNYVGNQVFTKTSILQKYLFDVNLPAWQDF 178 Query: 183 DIFLRMVVEYG-EPWKVEEATQILHINHGEMQITSSPKK 220 + + R++ + ++ I I+H +I+S ++ Sbjct: 179 ECWFRILRGTNKKALRINSYNYIQDISHPFSRISSGKQE 217 >UniRef50_D2ETY9 Glycosyl transferase, family 2 protein n=1 Tax=Bacteroides sp. D20 RepID=D2ETY9_9BACE Length = 270 Score = 110 bits (274), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 76/238 (31%), Positives = 119/238 (50%), Gaps = 15/238 (6%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDP 60 N PL+S+ + T NR+ IRA+ SV+ Q Y N E+I+VDD S+ + +++ Q + ND Sbjct: 2 NEPLVSVIVTTHNREDNVIRAVNSVINQSYRNIEIIVVDDHSSDDTAKKIGQLIEK-NDN 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 Y D N GACA RN I A G YI G+DDDDE+ P R+ +A V AF+ Sbjct: 61 LFYYALPDPNRGACAARNYGISQAHGMYIAGLDDDDEFVPERIEKMMAAMSDEV--AFVC 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-----FDTE 175 + +C E + + +S N++GNQV KE + FD E Sbjct: 119 SAGVICD-EKNRKKIVRYFFKDRCFSLYDLLWCNVVGNQVLVK----KELILQANGFDVE 173 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 + QD+D++ R++ + + + QI+H++ IT + G + FY K+ + Sbjct: 174 MPCNQDWDMWTRLLKVKPKAIYLTDILQIIHVDKEAPSITKLANRKLGLYKFYCKNGE 231 >UniRef50_B0C0T9 Glycosyl transferase, group 2 family protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C0T9_ACAM1 Length = 337 Score = 106 bits (264), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 86/249 (34%), Positives = 123/249 (49%), Gaps = 35/249 (14%) Query: 1 MKNN-PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 MKN+ PL+S+ +PT+NR L I AI+SVL Q ++E+I+VDD ST + QQ V ++D Sbjct: 1 MKNSKPLVSVIIPTFNRANLLIPAIQSVLNQTIEDFELIVVDDAST--DDTQQKVAGIDD 58 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ YI + N G C RN + AQG+YI +D DDEW PN+L L F Sbjct: 59 SRLRYILQEKNLGECGTRNTGLGAAQGQYIAFLDSDDEWLPNKLEKQLG--------VFR 110 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRR----------LFYKRNIIGNQVFTWAWRFKE 169 A D V G VYS + K R+ L YK N+IG + +E Sbjct: 111 SAPDEV--GVVYSWLQVINDQGKVVRMRKPNIHGDVKDYLIYK-NLIGTP--STVMIKQE 165 Query: 170 CL-----FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF--S 222 C+ FDT L+ D+D++L++ Y ++V L+ +H E S+ Sbjct: 166 CIEPGLQFDTNLRCCGDWDMWLQISKNY--RFEVIPEPLTLYRDHDEDARGSTNHAMVTE 223 Query: 223 GYFHFYRKH 231 G+ F KH Sbjct: 224 GHLVFLDKH 232 >UniRef50_Q8U2R3 Glycosyl transferase n=2 Tax=Pyrococcus RepID=Q8U2R3_PYRFU Length = 298 Score = 105 bits (261), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 78/236 (33%), Positives = 121/236 (51%), Gaps = 18/236 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +PT NR +L RAI SVL Q + ++E+I+VDD S+ + + ++ D RI Y Sbjct: 4 PTVSVIIPTHNRAKLLKRAIISVLNQTFEDFEIIVVDDASS--DNTPNIIESIKDSRIRY 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-----VTHAFL 119 I + NSGA RN I A+G++I +DDDDEW P +L V + +L V + Sbjct: 62 IRLEKNSGAPTARNIGIKKARGKFIALLDDDDEWLPRKLEVQVKKFNELEKKFGVVYGGF 121 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKEC-LFDTELK 177 Y YV Q LP + + Y L K N IG+ V FK+ LFD +L Sbjct: 122 Y---YVSQQNERIIGKRLPRFRGNVYGELL--KENFIGSPTVLIRRECFKKAGLFDPKLP 176 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 ++QD+D+++R+ Y + V + +++ ++ + K G F KHKD Sbjct: 177 SSQDWDMWVRIARYYYFDYIVAK----YYVHGHQISFNINEKYIPGREIFISKHKD 228 >UniRef50_C2SSX0 Glycosyl transferase family 2 n=1 Tax=Bacillus cereus BDRD-ST196 RepID=C2SSX0_BACCE Length = 300 Score = 105 bits (261), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 66/192 (34%), Positives = 101/192 (52%), Gaps = 8/192 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ LIS+ +PT+NR +L R I SVL Q Y+N+E+IIVDD ST + + V +D Sbjct: 1 MEMEVLISVVVPTYNRSELIKRTIDSVLAQTYTNFELIIVDDAST--DNTEDIVNEYHDD 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI +I + NS RN I ++G+YI +D DDEW PN+L L + + Sbjct: 59 RIKFIKLNENSKGTKPRNMGIKESKGDYIALLDSDDEWLPNKLESQLNFLRAFNDDNMVC 118 Query: 121 ANDYVCQG---EVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTEL 176 D + +G VYS + L+ S +F +N + + ++ + K+ LF+ L Sbjct: 119 FTDLILKGTKKTVYSN--NRDLFENEDISEYIFLGKNWVQTSTYMFSSKLGKQTLFNPTL 176 Query: 177 KAAQDYDIFLRM 188 K QD+D LR+ Sbjct: 177 KKHQDWDFCLRL 188 >UniRef50_Q2SBM9 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SBM9_HAHCH Length = 295 Score = 103 bits (257), Expect = 7e-21, Method: Compositional matrix adjust. Identities = 81/274 (29%), Positives = 135/274 (49%), Gaps = 15/274 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPRIT 63 P + I + T NR +L RA++S + Q Y + E+IIVDD S+ ++ Y+ L + ++ Sbjct: 3 PKVCIVITTKNRLELLKRALRSAINQTYPSIEVIIVDDGSS--DETPYYLKQLEREGKLV 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + N + GAC RN A ++ + IT +DDDD +T +R+ ++LVT A+ Sbjct: 61 AVINQKSLGACKSRNHATKISNAKLITYLDDDDFFTADRI-------EKLVT-AYNETYS 112 Query: 124 YVCQGEVYSQPASLPLY--PKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 +VC + +Y P + + +N G QVFT + K FD L A Q Sbjct: 113 FVCDSHGIYSKGKIRIYSSPSKLITLQDIKIQNYSGIQVFTEREKIIKIGGFDESLDAWQ 172 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+ LR++ +G +++E + I+ +I++S K + GY F +KH + Sbjct: 173 DYDLKLRLIETFGPGFRLEGCSYIVDQESELTRISTSSKAYQGYLQFMKKHHNSLSYYEA 232 Query: 241 KYQ-LFTLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 Q + LY + K + + S+ N KRLA Sbjct: 233 NCQSINDLYNRKVKIGIIKVIQHTKSISNLKRLA 266 >UniRef50_A0B7S5 Glycosyl transferase, family 2 n=1 Tax=Methanosaeta thermophila PT RepID=A0B7S5_METTP Length = 302 Score = 102 bits (254), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 69/198 (34%), Positives = 104/198 (52%), Gaps = 23/198 (11%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +PT+NR + RAI+SVL Q Y ++E+I+VDD ST + ++ V + +D RI YI Sbjct: 4 VSVILPTYNRAHVVGRAIRSVLEQTYEDFELIVVDDGST--DSTEEVVRSFDDRRIRYIR 61 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL----SVFLAHKQQ---LVTHAFL 119 + N G A RN I +A+GEYI D DDEW P +L VF Q + T +L Sbjct: 62 HKQNKGRSAARNTGIKIAKGEYIAFQDSDDEWLPEKLEEQMEVFKTSSPQVGVVYTGFYL 121 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-----FDT 174 + +D ++Y A++ + Y L K N +G A ECL FD Sbjct: 122 FYDD----KKIYIPSANVKTKDGNIYDELL--KGNFVGTPA---AVVRAECLKKVGMFDE 172 Query: 175 ELKAAQDYDIFLRMVVEY 192 L +D+++F+R+ EY Sbjct: 173 SLHCLEDWELFIRISKEY 190 >UniRef50_D2Q6R3 Glycosyl transferase family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q6R3_9BIFI Length = 280 Score = 102 bits (253), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 76/250 (30%), Positives = 117/250 (46%), Gaps = 24/250 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +PT+NR +IKSVL Q + ++E+I+VDD ST + + V A D RI Y Sbjct: 2 PKVSVVIPTFNRADTIGDSIKSVLEQTFKDFEVIVVDDGST--DGTESVVAAFGDSRIKY 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I D N+GACA RN I A GEYI D DD W P +L ++QL + + Sbjct: 60 IMQD-NAGACAARNNGIRHANGEYIAFQDSDDYWMPRKL------ERQLQNISLHDSKID 112 Query: 125 VCQGEVYSQPASLPLYPKSP--------YSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTE 175 +C+ A LP P +S RN + Q + F++ LFD + Sbjct: 113 ICK----MHSAGLPKDEYEPCRQAIEANFSYEYLLGRNFVSTQMILARHELFEDNLFDVK 168 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 + QD+D+ LR++ +Y + +E +I+ + + K Y +K+ D+ Sbjct: 169 IPRFQDWDLMLRLLRDYKLSY-TDEVLVERYISADSIS-QNVHKALEAYSLIEKKYSDEL 226 Query: 236 DRASKKYQLF 245 Y LF Sbjct: 227 SSHRDSYSLF 236 >UniRef50_B0JRA6 Glycosyl transferase family 2 n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JRA6_MICAN Length = 308 Score = 94.4 bits (233), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 65/204 (31%), Positives = 99/204 (48%), Gaps = 31/204 (15%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M PL+SI +P +N + +AI+S Q +N E+I++DDCST + + NDP Sbjct: 1 MNAYPLVSIIIPAYNTESYIAKAIQSAQEQTLTNIEIIVIDDCST--DNTLAIAQSFNDP 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI IHN N G + RN+AI LA+G ++ +D DD + PNRL L Q+ T A L Sbjct: 59 RIIIIHNSENLGVASTRNKAIKLAKGSWLALLDSDDWYEPNRLETLLEVAQK--TQADLV 116 Query: 121 ANDYVC---------------QGEVYSQPASLPLYPKSPYSRRLFYKRNI-------IGN 158 ++D G+ + P L L P S +++ F +++ I N Sbjct: 117 SDDVYLIYDQAPSPWTTVIKMSGQTLAHP--LVLEPISFITKQKFESKSLFVTMTKPIIN 174 Query: 159 QVFTWAWRFKECLFDTELKAAQDY 182 + F A + +D L+ QDY Sbjct: 175 RQFLLAHNLE---YDPTLRQGQDY 195 >UniRef50_C3WFL5 Glycosyl transferase n=4 Tax=Bacteria RepID=C3WFL5_FUSMR Length = 257 Score = 94.0 bits (232), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 70/233 (30%), Positives = 116/233 (49%), Gaps = 22/233 (9%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRITY 64 ISI P +N ++ I SV++Q Y+NWEM+IVDDCS+ S +++YV D RI + Sbjct: 6 ISIITPLYNGEKYIEETILSVIKQTYNNWEMLIVDDCSSDNSPNIVKKYVE--QDKRIKH 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I + NSGA RN+AI +A+GEYI +D DD W +L ++Q+ F+ N+Y Sbjct: 64 IRLEKNSGAAIARNKAIEIAEGEYIAFLDSDDLWKKEKL------EKQI---NFMKENNY 114 Query: 125 VCQGEVYSQPAS-------LPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK 177 Y + L PK P + R + N IG ++ + ++ L+ Sbjct: 115 AISFTEYEEIDEDGRKLNILIRVPKKPVTYRSYLLTNPIGCLTGMYSVKKLGKVYMPNLR 174 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 QD +L+ +++ + + ++E I IN + + F +++ YRK Sbjct: 175 KRQDTGFWLK-ILKLDKAYSLKENLAIYRINSSSLSFKKTD-LFKYHWYLYRK 225 >UniRef50_C4XII1 Putative glycosyltransferase n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XII1_DESMR Length = 308 Score = 92.8 bits (229), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 57/199 (28%), Positives = 95/199 (47%), Gaps = 26/199 (13%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+++ +PT +R +L RA+ S L Q ++ E++IVDD S + + +DPRI Sbjct: 11 GPLVTVIIPTRDRAELLPRAVASALGQTWTALEILIVDDGSV--DDTAGVLAGFDDPRIR 68 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA------HKQQLVTHA 117 + G A RN+A+ A+G+Y+ +D DDEW P + LA H Sbjct: 69 VLRRHTPGGVSAARNEAVRAARGDYLALLDSDDEWLPKKTERQLAYMAATGHAVSQTQEI 128 Query: 118 FLYANDYVCQGEVYSQP------ASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL 171 ++ V ++ +P ASL + SP + ++R +I F Sbjct: 129 WMRGGKRVNPTHIHKKPDGWFFEASLGMCLVSPST--TIFRRGVIEGAGF---------- 176 Query: 172 FDTELKAAQDYDIFLRMVV 190 FD L A +DYD++LR+++ Sbjct: 177 FDESLPACEDYDLWLRIML 195 >UniRef50_A6H2F4 Glycosyl transferase, group 2 family protein n=2 Tax=Flavobacteriales RepID=A6H2F4_FLAPJ Length = 257 Score = 92.8 bits (229), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 71/230 (30%), Positives = 111/230 (48%), Gaps = 10/230 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALNDPR 61 N L+SI PT+N +Q AIKSV Q YS+WE+IIVDDCS + +Q ++ D R Sbjct: 2 NELVSIITPTYNSEQFIAEAIKSVQNQSYSHWEIIIVDDCSKDKTVNIIQNFID--EDHR 59 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFLY 120 I I D NSGA RN AI A+G YI +D DD W P++LS + Q+ + F + Sbjct: 60 IYLIQLDKNSGAGVARNNAINNAKGRYIAFLDSDDLWKPDKLSKQIKFMQRHNIPFTFSF 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + +G++ ++ P K Y + F N IGN + + + + ++ Q Sbjct: 120 YDCINEKGDLLNKRIEAP--KKLSYYQLFFC--NFIGNLTGIYDSNYFGKMAISSIRKRQ 175 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 D+ ++L ++ + V E+ + I + S + YRK Sbjct: 176 DWIVWLTILKKIKTAKPVPESLALYRIRENSIS-ASKVSLLKDNYAIYRK 224 >UniRef50_Q74D09 Glycosyl transferase, group 2 family protein n=1 Tax=Geobacter sulfurreducens RepID=Q74D09_GEOSL Length = 310 Score = 92.4 bits (228), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 69/249 (27%), Positives = 117/249 (46%), Gaps = 35/249 (14%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 +NPL+S+ +P+ NR L I + SV+ Q Y N E+I+VDD S L + R+ Sbjct: 4 HNPLVSVVIPSKNRPDLVIATLVSVIDQTYDNIEIIVVDDGSDM--PLAPLLKDRFSDRV 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL-AHKQQLVTHAFLYA 121 + ND + G RN+ A G+YIT +DDDD W P +L + + A + V +Y Sbjct: 62 LCLRNDQSLGGAVARNRGAQSAHGDYITFLDDDDLWLPKKLEMQVDAFSKHGVDIGVVYC 121 Query: 122 N-DYVCQGEVYSQP--------------ASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR 166 D++C+ E+ + + P +P R+ ++ +++G Sbjct: 122 GFDFLCKEEIVPRQNKFHNDYDLSIAVLSGCPFGSPTPLIRKHYF--DMVGG-------- 171 Query: 167 FKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH 226 FD EL + QD+D+++R+ G + V+E+ + + HG+ T+ KK G Sbjct: 172 -----FDRELPSCQDWDLWIRLSKVCGF-YPVKESLALYRV-HGDQISTNLRKKIDGRKM 224 Query: 227 FYRKHKDKF 235 KH D+ Sbjct: 225 VLAKHYDEI 233 >UniRef50_Q8PZ42 Glycosyltransferase n=1 Tax=Methanosarcina mazei RepID=Q8PZ42_METMA Length = 307 Score = 92.0 bits (227), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 47/99 (47%), Positives = 63/99 (63%), Gaps = 2/99 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +PT+NR L RAIKSVL Q Y + E+IIVDD ST + ++ V D R+ Y Sbjct: 6 PKVSVIIPTYNRAHLIPRAIKSVLNQTYLDIEIIIVDDSST--DNTEEIVKDFKDRRLKY 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 I ++IN GA A RN I ++G+YI D DDEW N+L Sbjct: 64 IRHNINKGASAARNTGIRESRGKYIAFQDSDDEWFSNKL 102 >UniRef50_Q15RB6 Glycosyl transferase, family 2 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15RB6_PSEA6 Length = 402 Score = 91.3 bits (225), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 63/192 (32%), Positives = 93/192 (48%), Gaps = 10/192 (5%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS--WEQLQQYVTALNDPRIT 63 L SI +P +NR + +A++SVL Q N+E++IVDD ST E ++Q V + +P++ Sbjct: 3 LFSIIIPVFNRTKSLKKALESVLSQSIQNYEVLIVDDGSTPSIAEGIEQLVLSFQEPKLK 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAH-KQQLVTHAFLYAN 122 + + IN A RN I A G+YI +D DD W PN+L L + + + FL + Sbjct: 63 LLRHAINKNGAAARNTGINAASGKYICFLDSDDFWLPNKLERALEYINTKTCSDTFLIHH 122 Query: 123 DYVCQGEVYSQPASLPLYPKS---PYSRRLFYKRNIIGNQVFTWA---WRFKECLFDTEL 176 Y E SLP K+ + F N+ G Q T K CLFD Sbjct: 123 QY-SNSENGKLTESLPKAAKASSESVAHYSFVTNNVGGIQSSTICVPTMVAKNCLFDERF 181 Query: 177 KAAQDYDIFLRM 188 QD+D L++ Sbjct: 182 SGHQDWDFALQV 193 >UniRef50_A8H2F1 Glycosyl transferase family 2 n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H2F1_SHEPA Length = 255 Score = 90.9 bits (224), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 46/120 (38%), Positives = 72/120 (60%), Gaps = 7/120 (5%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDP 60 N+PLISI MP +N ++ +I SV+ Q YSNWE+++VDDCST + + ++ + D Sbjct: 5 NSPLISIVMPLFNVEKYVEESIDSVVNQSYSNWELLLVDDCSTDDTCKIIESKFST--DS 62 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV---FLAHKQQLVTHA 117 RI HN++NSGA RN + + G YI +D DD W N+L + F+ + + ++H Sbjct: 63 RIKLFHNEMNSGAGVSRNVGLANSSGIYIAFLDSDDLWCDNKLKLQVSFMLNNKAAISHT 122 >UniRef50_Q2NH10 Predicted glycosyltransferase n=3 Tax=cellular organisms RepID=Q2NH10_METST Length = 776 Score = 90.9 bits (224), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 65/239 (27%), Positives = 113/239 (47%), Gaps = 9/239 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+SI MP +NR+ L AIKSVL Q Y N+E+II+DD S+ ++ + + + D RI Sbjct: 264 PLVSIIMPVYNREGLITNAIKSVLNQTYKNFELIIIDDGSS--DKTCENILKIKDKRIKL 321 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N N G A RN+ + +++G+YI +D D++W +S + + + LY Y Sbjct: 322 IRNKENLGISASRNKGLNISKGKYIMYLDSDNDWDNRYISAMVGAFRVNKDASALYCGQY 381 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVF--TWAWRFKECLFDTELKAAQDY 182 V + +L + ++ L RN I + T K FD L D+ Sbjct: 382 VFE----ESKTNLKYVRFASLNKGLLSNRNYIDLNAYAHTRDVYLKYGGFDESLVKCVDW 437 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 D+ L+ + + + V + + + ITS+P+ + K + + ++ K+ Sbjct: 438 DLILKYST-FAKTYSVPVILSNYYFDLADNTITSNPELSYTIKQVHEKQEKRLEKIGKR 495 >UniRef50_C6X6R1 Glycosyl transferase family 2 n=1 Tax=Methylovorus sp. SIP3-4 RepID=C6X6R1_METSD Length = 298 Score = 90.5 bits (223), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 62/196 (31%), Positives = 98/196 (50%), Gaps = 15/196 (7%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALND 59 ++ LIS+ +PT+NR ++ +AI SVL Q + N E+I+VDD ST S+++LQ ++ ++ Sbjct: 5 QDQTLISVVIPTFNRFEVTCKAIDSVLEQTHQNVEVIVVDDVSTDGSFDRLQALYSSKSN 64 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA-- 117 ++ I N +N+GA RN +A+GE+I +D DD W +L + L Q L A Sbjct: 65 -KVRLIQNPVNAGAATGRNNGADVAKGEFIAFLDSDDIWLKEKLELQLKDYQALTKQANN 123 Query: 118 -FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD--- 173 LY+ + Y + PL +F I G + T W + +FD Sbjct: 124 IILYSPSEFQSPQGYRINPTRPLETNEKVEEYMF----IHGQAIQTSGWFMSKKIFDQVK 179 Query: 174 --TELKAAQDYDIFLR 187 L QDYD +R Sbjct: 180 FTNNLLRHQDYDFVIR 195 >UniRef50_Q5WBN3 Glycosyltransferase n=1 Tax=Bacillus clausii KSM-K16 RepID=Q5WBN3_BACSK Length = 271 Score = 90.5 bits (223), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 74/264 (28%), Positives = 113/264 (42%), Gaps = 37/264 (14%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 IS+ +PT+NR+++ +I SVL+Q E+I+VDD ST + Q + ++ + I Sbjct: 4 ISVVIPTYNREKVIKESIDSVLKQTLQPIEIIVVDDFST--DNTLQILKKYDNGTLKVIK 61 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 N GA RN I+ A+G+ I D DD W N+L QL + +C Sbjct: 62 NQFKKGANGARNTGILFAEGDLIAFQDSDDTWDENKL--------QLQLEKLTSEDADIC 113 Query: 127 QGEVYSQPASLP---LYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKAAQDY 182 + + LP L + Y R RN I Q + K+ +FD ELK QD+ Sbjct: 114 FCSINTGDRKLPYRTLASEEIY--RQLRIRNFISTQSILIKTAVAKKIMFDEELKRYQDW 171 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEM-------QITSSPKKFSGYFHFYRKHKDKF 235 D LR E+ +I+H H + IT FH +KHK+ Sbjct: 172 DFVLR----------ASESFKIVHCPHALVDIRIQTDSITKKVNNMEALFHLAKKHKELL 221 Query: 236 DRA----SKKYQLFTLYQIRNKRM 255 + + KY+L R K++ Sbjct: 222 EEKPINPALKYKLLMYSSYREKKI 245 >UniRef50_C6J2S1 Glycosyltransferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2S1_9BACL Length = 309 Score = 89.4 bits (220), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 71/251 (28%), Positives = 117/251 (46%), Gaps = 22/251 (8%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 +NP ISI MP +NRQ+ +I +L Q Y+N+E+IIVDD ST + + + D RI Sbjct: 4 HNPKISIIMPVYNRQETIKNSIFCILNQSYTNFELIIVDDAST--DNTSAIILSFEDERI 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL----SVFLAHKQQLVTHAF 118 +I + N GA A RN I ++G++I+ D DD W P++L + ++ +F Sbjct: 62 KHIVLNKNVGAAAARNIGIRQSKGDWISFQDSDDYWEPDKLRKQVELINDRSPSIIFSSF 121 Query: 119 LY----ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL--F 172 + + +Y+ +G Y PK N I F E + F Sbjct: 122 IRYKYGSEEYIPRGRKYVN------LPKEGNLHNQLLLGNFIATPTVLIPKTFLEKVGGF 175 Query: 173 DTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS-PKKFSGYFHFYRKH 231 + ++ QD++++LR+ Y W E ++ +++ E I+S K GY + H Sbjct: 176 NEQMPRFQDWELWLRLSAYYPFIWIDEP---LVRVHYTENSISSDMSKMIRGYELIWNLH 232 Query: 232 KDKFDRASKKY 242 K F A +Y Sbjct: 233 KKMFVDAGPEY 243 >UniRef50_D2LU70 Glycosyl transferase family 2 n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LU70_BACS4 Length = 266 Score = 89.4 bits (220), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 63/202 (31%), Positives = 99/202 (49%), Gaps = 7/202 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALND 59 K+ PLISI PT+N + I+SVL Q +WEMIIVDDCST + ++ YV D Sbjct: 12 KSMPLISIITPTYNSENYIADTIESVLAQTLDDWEMIIVDDCSTDNTTSVIRSYVE--KD 69 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ I + NSGA RN AI + G+YI +D DD+W P +L L + Q+ A Sbjct: 70 ERVKLIQLEKNSGAAIARNTAISHSVGKYIAFLDGDDQWLPEKLEKQLQYMQE--KGASF 127 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 Y + + ++ P++ +L K N+IG + ++++ Sbjct: 128 SFTKYKTMDQKGEETGTIVDAPETVNYHQLL-KHNVIGCLTVMLDVEKVGQVKMVDIRSR 186 Query: 180 QDYDIFLRMVVEYGEPWKVEEA 201 QDY ++L + + E ++E Sbjct: 187 QDYVLWLELCKKGFEAHGIQEV 208 >UniRef50_A8ZWI7 Glycosyl transferase family 2 n=2 Tax=Desulfobacteraceae RepID=A8ZWI7_DESOH Length = 289 Score = 89.0 bits (219), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 65/225 (28%), Positives = 111/225 (49%), Gaps = 27/225 (12%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ +PT+NR + A+ SVL QDY E+IIVDD S+ + ++ L + R+T Sbjct: 7 PLVSVVIPTYNRGPMVTEAVASVLAQDYPAIEIIIVDDGSS--DDTPGRLSPLKN-RVTI 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS---VFLAHKQQLVTHAFLYA 121 I + N G A RN + A GEYI +D DD W P ++S F A + + Sbjct: 64 ITQE-NRGVSAARNAGVAHAGGEYIAFLDSDDRWLPEKISTQTAFFASRPDAL------- 115 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRR--LFYKRNIIGNQVFTWAWRFKECL------FD 173 +CQ E PL+P++ + +R + ++ ++ V A ++ FD Sbjct: 116 ---ICQTEETWIKNGKPLFPRARHKKRSGMIFEPSLELCLVSPSAVMMRKDFFLDVGGFD 172 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 L A +DYD++LR+ + + P + E ++ + Q+++ P Sbjct: 173 ESLPACEDYDLWLRISMVH--PVYLIETPLVIKQGGHDGQLSAMP 215 >UniRef50_B7R2A7 Glycosyltransferase n=1 Tax=Thermococcus sp. AM4 RepID=B7R2A7_9EURY Length = 307 Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 64/202 (31%), Positives = 95/202 (47%), Gaps = 23/202 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P ISI +PT+NR + R I+SVL Q + ++E+II+DD ST + + D RI Y Sbjct: 4 PKISIILPTYNRGYIIGRTIRSVLDQTFKDFELIIIDDGSTDNTKNIVEIYKEKDNRIRY 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV-----------FLAHKQQL 113 I + N GA A RN I ++GEYI ID D W P +L + F+ + + + Sbjct: 64 IRHKRNLGANAARNTGIKKSRGEYIAFIDSDTIWKPQKLEIQIKIADNTKENFVIYSRTI 123 Query: 114 --VTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL 171 T + + V QG+ Q L + K+N+I N L Sbjct: 124 RKYTKKTVIVPEDVNQGKKEGQLLDKILVLNFIDMSSILVKKNMIKNS----------GL 173 Query: 172 FDTELKAAQDYDIFLRMVVEYG 193 FD +L QD+DI +R+ +Y Sbjct: 174 FDEKLPRLQDWDILIRLAEKYN 195 >UniRef50_B0UDC4 Glycosyl transferase family 2 n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UDC4_METS4 Length = 324 Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 61/206 (29%), Positives = 94/206 (45%), Gaps = 15/206 (7%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ MP +N RA+ SVL Q ++ E+I+VDD ST + A D R+ + Sbjct: 9 VSVIMPAFNAAATIERALASVLAQSHAALEVIVVDDGSTDATRTLVGSAAARDRRVRLLA 68 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 +D N G A RN A+ A GE++ +D DD W P RL LA F D + Sbjct: 69 SDRNGGPAAARNTALSAATGEWLALLDADDAWRPERLERLLAVAGDRAEVVF----DNLL 124 Query: 127 QGEVYSQPASLPLYPKSPYSRRL--FYKRNIIGNQ--------VFTWAWRFKECL-FDTE 175 + +S PL+P+ P + + + G++ +F A + L FD Sbjct: 125 GLDPHSGAEVGPLFPRLPDAIGVSEMAAAAVPGSRFDYGYLKPIFRRALVARAGLRFDEG 184 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEA 201 L+ ++D FL ++VE G EA Sbjct: 185 LRTSEDLLFFLALLVEGGSARTTSEA 210 >UniRef50_A4J8Z3 Glycosyl transferase, family 2 n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J8Z3_DESRM Length = 297 Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 69/247 (27%), Positives = 120/247 (48%), Gaps = 17/247 (6%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS-WEQLQQYVTALNDPRI 62 N L+S+ +PT+NR++ IR+I+SVL Q Y E+ ++DD ST E++ Q ND ++ Sbjct: 2 NELVSVVIPTYNREKTIIRSIESVLNQTYKAIEVCVIDDASTDKTEKIVQEKYGDND-KV 60 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ----LVTHAF 118 Y NSGAC RN + +A G+YI +D DD++ P ++ + + +Q + + Sbjct: 61 IYYRLAKNSGACVARNTGVSIAHGKYIAFLDSDDKFVPQKIELQMLKLKQCEADICVSNY 120 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 Y + + VY++P K Y L + +I + F+ FD + Sbjct: 121 KYTDRNKNEKIVYAEPKE----GKQLYD-SLLWCNSITTGTILGKRECFEHVQFDAAMPR 175 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF---HFYRKHKDKF 235 QD+D+ LR+ Y + VE+A ++ + ITSS F Y K++ F Sbjct: 176 YQDWDLALRLSQNYRFCY-VEKA--LMEQEFQTVSITSSTGHKKTLFALDRLYEKNRLGF 232 Query: 236 DRASKKY 242 + ++ + Sbjct: 233 EECNEAF 239 >UniRef50_A6ELC1 TuaG n=3 Tax=Bacteroidetes RepID=A6ELC1_9BACT Length = 269 Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 51/114 (44%), Positives = 64/114 (56%), Gaps = 4/114 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--N 58 M + PL+SI P +N + I+SVL Q YSNWE++IVDD ST + Q V N Sbjct: 12 MVSKPLVSIITPLYNAAPFIEKTIQSVLDQTYSNWELLIVDDAST--DNGLQIVRDFFGN 69 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ 112 DPRI + N IN GA RNQA A G+YI +D DD W N+L L Q+ Sbjct: 70 DPRIKFYPNTINKGAAHCRNQATSEATGDYIAFLDSDDLWHKNKLEKQLKFMQK 123 >UniRef50_O58167 Putative uncharacterized protein PH0430 n=1 Tax=Pyrococcus horikoshii RepID=O58167_PYRHO Length = 334 Score = 87.4 bits (215), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 68/202 (33%), Positives = 104/202 (51%), Gaps = 26/202 (12%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALNDPRIT 63 L+S+ +PT+NR ++ RAI+SVL Q Y+N E+IIVDD S + E ++++ + D RI Sbjct: 5 LVSVVLPTYNRAKVLPRAIESVLNQTYTNIELIIVDDGSRDNTKEIIREFQS--QDERIV 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y+ N N GA A RN IM + GE+I +D DD W P +L +Q + ++ L + Sbjct: 63 YLRNKRNLGANAARNIGIMHSTGEFIAFMDSDDMWLPWKL-----ERQIKIMYSSLNSYP 117 Query: 124 YVCQG--EVYSQPAS---LPLYP-KSP--YSRRLFYKRNIIGNQVFTWA---------WR 166 V G +Y + L YP KSP + N++ N F R Sbjct: 118 IVYSGFIRIYKTDKNKVILSYYPQKSPPVSADSTSIHLNLLTNANFISTPTVLIPALYIR 177 Query: 167 FKECLFDTELKAAQDYDIFLRM 188 + LFD ++ QD++ FLR+ Sbjct: 178 QNKLLFDEKMPRLQDWEFFLRL 199 >UniRef50_Q314K2 Glycosyltransferases involved in cell wall biogenesis-like n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314K2_DESDG Length = 321 Score = 87.4 bits (215), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 50/193 (25%), Positives = 92/193 (47%), Gaps = 4/193 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + P +S+ MP + + A++SVL QD++++E+++VDD S ++ + +D Sbjct: 17 MSDRPAVSVIMPVYKTEAYLAEAVRSVLEQDFTDFELLVVDDGSPG--DVRGILAGFDDA 74 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y+H++ N G RN+ I + G Y+ +D DD W P++LS+ +A + +Y Sbjct: 75 RIRYLHHE-NRGPGYTRNRGIRESTGRYVAFLDSDDAWMPHKLSIQVAALDAAPQYDVVY 133 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTE-LKAA 179 + G + P+ L+ I + F + E L+ + Sbjct: 134 SQRETMDGHGRTVAGYTPVLHSGDVLDELYVDSFICMSSAMVRRGVFAVSGYIAEDLRIS 193 Query: 180 QDYDIFLRMVVEY 192 QDYD +LR+ + Sbjct: 194 QDYDFWLRVACHH 206 >UniRef50_B6I8A2 Putative glycosyltransferase n=1 Tax=Escherichia coli SE11 RepID=B6I8A2_ECOSE Length = 255 Score = 87.0 bits (214), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 39/100 (39%), Positives = 62/100 (62%), Gaps = 2/100 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NPLIS+ MP +N + +++ SVL Q Y N+E+II DD ST ++ ++ + +D RI Sbjct: 3 NPLISVIMPAYNAENTIRKSVLSVLNQSYKNFELIICDDGST--DETKKIILTFSDARIK 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 +I+N GA RN AI + GEY++ +D DD W +++ Sbjct: 61 FINNKYEKGAAGARNSAIDICSGEYVSFLDSDDIWASDKI 100 >UniRef50_UPI000197ADF7 hypothetical protein BACCOPRO_00943 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197ADF7 Length = 309 Score = 87.0 bits (214), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 66/199 (33%), Positives = 98/199 (49%), Gaps = 14/199 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE-QLQQYVTA---LNDP 60 PL+S+ +PT+ R Q RAI SVL Q Y N E+I+VDD + E QL + T ++ Sbjct: 8 PLVSVIIPTYARSQYICRAIDSVLNQTYKNVEVIVVDDNGENTENQLATFQTLKSYIDKE 67 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH-AFL 119 +ITYI + N A RN I A+GEYI +DDDDE+ P ++ + QL A + Sbjct: 68 QITYITHKTNRNGSAARNTGIFNAKGEYICLLDDDDEFFPEKVEKQVQVLNQLDDSWAGV 127 Query: 120 YANDYVCQGEVYSQPASLP--LYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL----FD 173 + N ++ L + P++ Y LF + G + R CL FD Sbjct: 128 FCNSINRTITLHGVKEKLNKIVLPENMYEEFLFCR---AGFGSSSLMLRKSVCLEINGFD 184 Query: 174 TELKAAQDYDIFLRMVVEY 192 T K QD++ R++ +Y Sbjct: 185 TSFKRHQDWEFLTRILRKY 203 >UniRef50_UPI0001B4A4FB putative teichuronic acid biosynthesis glycosyl transferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A4FB Length = 256 Score = 87.0 bits (214), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 46/115 (40%), Positives = 63/115 (54%), Gaps = 3/115 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI P + + I+SVL+Q Y+NWEMIIVDDCS V DPRI I Sbjct: 5 LVSIITPMYKGAEFVGDTIESVLKQTYTNWEMIIVDDCSPDDGAGINVVKRYADPRIKLI 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS---VFLAHKQQLVTHA 117 + IN G+ RN A+ AQG+YI +D DD W L +FL++K ++ + Sbjct: 65 ESKINKGSSGARNIALKEAQGQYIAFLDSDDMWPEEYLESQILFLSNKDVVIAYG 119 >UniRef50_Q6QW83 Putative glycosyl transferase n=1 Tax=Azospirillum brasilense RepID=Q6QW83_AZOBR Length = 296 Score = 86.7 bits (213), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 42/101 (41%), Positives = 59/101 (58%), Gaps = 2/101 (1%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL+S+ +P +NR RAI+SV QD S+WE+++VDD ST + + D RI Sbjct: 9 EEPLVSVVIPAFNRAHTLRRAIESVRMQDVSSWELLVVDDGST--DGSVDIPESFGDERI 66 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 I + +N GA A RN + A+G YI +D DDEW P +L Sbjct: 67 RLIRHAVNRGAAAARNTGVAAARGCYIAFLDSDDEWLPGKL 107 >UniRef50_A0M6I2 TuaG-like glycosyl transferase n=2 Tax=Flavobacteriaceae RepID=A0M6I2_GRAFK Length = 261 Score = 86.7 bits (213), Expect = 9e-16, Method: Compositional matrix adjust. Identities = 57/208 (27%), Positives = 103/208 (49%), Gaps = 5/208 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST-SWEQLQQYVTALND 59 M++ L+S+ MP +N + +I+SV+RQ Y NWE++I+DD S+ + +Q+ Q ++ D Sbjct: 1 MEDRGLVSVIMPAYNSEVFITESIQSVIRQTYPNWELLIIDDASSDTTKQIVQKFSS-ED 59 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI + N N+G RN+ I A G +I +D DD+W P +L L + A Sbjct: 60 ERIKLLKNSTNAGTHHSRNKGIKAASGNFIAFLDADDQWKPQKLEKQLKVLSKPNVAACF 119 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 + + + + Q L P Y + L K N +GN + ++ ++ Sbjct: 120 SSYELINEKGASLQKKIQAL-PVLDYDKLL--KANYVGNLTGIYNVSILGKVYCPDIPKR 176 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHI 207 QD+ ++L+++ E G + E+ + I Sbjct: 177 QDWALWLKVIEEGGPIEGIAESLAVYRI 204 >UniRef50_B5YBY2 Glycosyltransferase n=2 Tax=Dictyoglomus thermophilum H-6-12 RepID=B5YBY2_DICT6 Length = 361 Score = 86.3 bits (212), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 42/97 (43%), Positives = 65/97 (67%), Gaps = 4/97 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ--LQQYVTALNDPRI 62 P +SI +PT+NR++L RAI+SV RQ Y NWE++IVDD S+ + +++Y++ D RI Sbjct: 3 PEVSIIIPTYNREKLLPRAIESVRRQTYENWELLIVDDRSSDNTESLVKKYLSI--DNRI 60 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWT 99 Y+ N+ G RN I+ ++G+YI +D DDEW+ Sbjct: 61 RYLKNERKKGPAGARNFGILNSKGKYIAFLDSDDEWS 97 >UniRef50_C5RMY5 Glycosyl transferase family 2 n=1 Tax=Clostridium cellulovorans 743B RepID=C5RMY5_CLOCL Length = 358 Score = 86.3 bits (212), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 45/115 (39%), Positives = 65/115 (56%), Gaps = 2/115 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + PL+SI +PT+NR++L +AI SVL Q Y WE+I+VDD ST + + D Sbjct: 1 MNDRPLVSIIIPTYNREKLISKAIDSVLNQTYDKWELIVVDDKSTDNTKKIVMDYSKKDD 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL--SVFLAHKQQL 113 RI Y+ N+ G A RN I G +I +D DDEW N + S++ K+ + Sbjct: 61 RIKYLLNNGKKGPAAARNLGINNCNGNFIAFLDSDDEWFNNHIEESIYFLEKEDV 115 >UniRef50_Q9K6L6 Glycosyltransferase n=1 Tax=Bacillus halodurans RepID=Q9K6L6_BACHD Length = 303 Score = 85.9 bits (211), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 60/187 (32%), Positives = 89/187 (47%), Gaps = 8/187 (4%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ--LQQYVTALNDPRITY 64 +SI +PT NR +L RA++S L Q Y N E+I+V D ST + +Y D R+ + Sbjct: 3 VSIIIPTHNRAKLLKRALESTLNQTYKNIEVIVVSDGSTDNTDIVMDEYK---RDSRVNF 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I G RN I A+GE+I +DDDDEW P++L + + Q +Y Sbjct: 60 ISYHPAKGGNYARNTGIKNAKGEFIAFLDDDDEWMPDKLELQIKEFNQNANVGLVYTGVE 119 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG--NQVFTWAWRFKEC-LFDTELKAAQD 181 + L K+ + N IG + V EC +FD +LKA QD Sbjct: 120 IIYNFNKRNIKYYSLPKKTGNLSKEILVANCIGTTSSVMVRKNLITECGMFDEKLKARQD 179 Query: 182 YDIFLRM 188 YD+++R+ Sbjct: 180 YDLWIRV 186 >UniRef50_Q11C09 Glycosyl transferase, family 2 n=1 Tax=Chelativorans sp. BNC1 RepID=Q11C09_MESSB Length = 256 Score = 85.9 bits (211), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 66/197 (33%), Positives = 96/197 (48%), Gaps = 17/197 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--DPRI 62 PL+S+ MP N + AI SV Q Y WE+I++DDCS ++ Q V ++ +PRI Sbjct: 6 PLVSVVMPARNAAPFIMDAISSVCAQTYLAWELIVIDDCSD--DETAQIVEEVSKAEPRI 63 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS---VFLAHKQQLVTHAFL 119 + N GA A RN+A+ LAQG YI +D DD W P +L F+ +T Sbjct: 64 RQLTNYRRLGAAASRNRALDLAQGSYIAFLDSDDFWHPMKLDRQLQFIKASGTAITFG-- 121 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTW-AWRFKECLFDTELKA 178 DY+ + QP P+S R+ + N IGN + R ++ F E Sbjct: 122 ---DYI-RMRPDGQPIGSVRAPESTDYHRML-RSNFIGNLTAVYKKSRLQDLRF--ENVG 174 Query: 179 AQDYDIFLRMVVEYGEP 195 A+DY +L+ + GEP Sbjct: 175 AEDYLFWLQALQRLGEP 191 >UniRef50_A6QB23 Glycosyl transferase n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QB23_SULNB Length = 1335 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 64/209 (30%), Positives = 104/209 (49%), Gaps = 21/209 (10%) Query: 5 PLISIYMPTWNRQQLAIR-AIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL SI MP +N ++ AI+SV Q Y WE+ I DD STS E+ + Y+ ++ +I Sbjct: 789 PLFSIVMPVYNVVPKWLKLAIESVENQWYGRWELCIADDASTS-EETKAYLRKIDHHKIK 847 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 N G C N+A+ LA+GEYI +D+DDE TP+ L L D Sbjct: 848 IRFLKKNLGICGASNEALKLAKGEYIALMDNDDELTPDALYEILKAINTKAAELIYSDED 907 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD------TELK 177 + + +++P P +S +F +N + + V K+ L D L+ Sbjct: 908 KIEEDGTFAEPHFKP-----DFSPDMFLSQNYLSHLVV-----IKKELVDRVGGWEAGLE 957 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILH 206 +QDYD++L+ V+E+ E K+ +++L+ Sbjct: 958 GSQDYDLYLK-VLEHTE--KISHISKVLY 983 Score = 70.9 bits (172), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 56/201 (27%), Positives = 94/201 (46%), Gaps = 23/201 (11%) Query: 5 PLISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE---QLQQYVTALNDP 60 PLISI +PT+N +++ ++SVL Q Y NWE+ I DD ST E L+ Y T P Sbjct: 89 PLISIIVPTYNTKKRYLTEMLESVLSQTYGNWELCIADDASTDRETIDTLEYYRT--KHP 146 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 + ++ N N A+ +A G+Y+ +D DD +PN L + LY Sbjct: 147 AVKVVYRKKNGHISEASNTALSIALGDYVAFLDHDDTLSPNALYEMAKKLNEDRKLKILY 206 Query: 121 AN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN------IIGNQVFTWAWRFKECLFD 173 ++ D + + P + KS ++ +F+ +N II ++ F++ Sbjct: 207 SDEDKIDENSNRYMP-----HFKSGWNPDMFFSQNYITHLLIIKKEIIDKVGGFRKG--- 258 Query: 174 TELKAAQDYDIFLRMVVEYGE 194 + +QDYD+ LR + G+ Sbjct: 259 --YEGSQDYDLVLRCLDHIGK 277 Score = 43.5 bits (101), Expect = 0.007, Method: Compositional matrix adjust. Identities = 29/96 (30%), Positives = 54/96 (56%), Gaps = 5/96 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQY--VTALNDPR 61 PL+S+ +PT + + + I+S+L++ Y N+E++IVD+ ST + L+ + + R Sbjct: 347 PLVSLVVPTRDSYNILHKCIESILQKTLYENYEILIVDNESTDPKTLRYFEILKKHEHIR 406 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE 97 I H+ N A+ N + A+GE I +++D E Sbjct: 407 ILEYHHPFN--YSAINNYGVQYARGEIIGLLNNDVE 440 Score = 41.6 bits (96), Expect = 0.033, Method: Compositional matrix adjust. Identities = 26/100 (26%), Positives = 54/100 (54%), Gaps = 3/100 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYV-TALN 58 +K PL+SI +P ++ +L I+S+L++ Y N+E+I +++ S WE ++ Sbjct: 1038 LKEEPLVSIIIPFKDKPELLKTCIESILKKSSYQNYEIIGINNRSKEWETFKEMKRLEKR 1097 Query: 59 DPRITYIHNDINSGACAVRNQAI-MLAQGEYITGIDDDDE 97 D R+ + + + N A+ A+G++I +++D E Sbjct: 1098 DSRVRFCEYNDTFNYSKINNFAVSSCAKGKHIVLMNNDIE 1137 >UniRef50_C0CGY4 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CGY4_9FIRM Length = 255 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 43/97 (44%), Positives = 55/97 (56%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +SI P+WN ++ + IKSV Q Y NWEMIIVDDCST A DPRI I Sbjct: 5 VSIITPSWNSEKYIVDTIKSVQNQIYKNWEMIIVDDCSTDHTVDIIEAIAKEDPRIKLIK 64 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 +N GA RN ++ A G +I +D DD W P++L Sbjct: 65 QPVNGGAAKARNISLNEATGRFIAYLDADDIWKPDKL 101 >UniRef50_B8D172 Glycosyl transferase family 2 n=4 Tax=Bacteria RepID=B8D172_HALOH Length = 250 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 60/214 (28%), Positives = 101/214 (47%), Gaps = 18/214 (8%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW--EQLQQYVTALNDPRIT 63 L+S+ P +N +Q + IKSVL Q Y+NWEMI+VDDCST + +++Y+ + RI Sbjct: 5 LVSVITPLYNSEQFIEKTIKSVLNQTYANWEMIVVDDCSTDSGPDIVKEYLKK--NSRIK 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I NSGA RN+ I +++G YI +D DD W ++L K+Q+ F+ N+ Sbjct: 63 LIKLKKNSGAAVARNKGIKISKGRYIAFLDSDDLWHKDKL------KKQV---EFMKNNN 113 Query: 124 YVCQGEVYSQPASLP-----LYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 V Y + + P + K N IG + ++ ++ + Sbjct: 114 VVLSYTAYRKIDEFGNLRGIIRPPKKINYNQLLKTNSIGCLTAMYDTKYVGKVYMPVIDR 173 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEM 212 QDY ++L+++ + + V E N + Sbjct: 174 RQDYALWLKILKKGITAYGVNEVLAYYRTNKNSL 207 >UniRef50_B9M559 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. FRC-32 RepID=B9M559_GEOSF Length = 281 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 42/99 (42%), Positives = 64/99 (64%), Gaps = 3/99 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +PT+NR +L AI SVL Q Y N+E++I+DD ST + + V P+I YI+ Sbjct: 4 VSVIIPTYNRSELLKCAITSVLDQAYDNYELLIIDDGST--DDTKSIVNGFGSPKIRYIY 61 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV 105 NSG + RN I L++GEY++ +D DD + PN+LS+ Sbjct: 62 QS-NSGRSSARNYGIKLSKGEYVSFLDSDDIFLPNKLSL 99 >UniRef50_C4F932 Putative uncharacterized protein n=1 Tax=Collinsella intestinalis DSM 13280 RepID=C4F932_9ACTN Length = 974 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 64/192 (33%), Positives = 98/192 (51%), Gaps = 20/192 (10%) Query: 8 SIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 SI MPT+N + AI SVL Q YSNWE+ IVDDCSTS E L++Y+ +LN + Sbjct: 52 SILMPTFNVDVRWVSMAIDSVLAQSYSNWELCIVDDCSTSAE-LKEYLLSLNTDNVRVRL 110 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL-SVFLAHKQQLVTHAFLYANDYV 125 D N G N A +A G+Y+ +D+DD TPN L +FL +Y+++ + Sbjct: 111 LDTNLGISGATNCAADMATGDYLVLLDNDDMLTPNALFELFLRATSD--NPDIMYSDNDI 168 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDT------ELKAA 179 + L + K +S L + +G+ + F++ +FD E + Sbjct: 169 ----IDENGNRLSVLFKPDWSPDLMLSQMYVGHLL-----AFRKNIFDAVGGFRNEFNGS 219 Query: 180 QDYDIFLRMVVE 191 QDYD+FLR+ ++ Sbjct: 220 QDYDLFLRLSLQ 231 >UniRef50_C0ECJ2 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0ECJ2_9CLOT Length = 658 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 77/289 (26%), Positives = 120/289 (41%), Gaps = 55/289 (19%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPR 61 +P +S+ + WN + R I+S+L Q Y N E+I DD ST ++ LQ++ A D R Sbjct: 2 DPKVSVLIACWNSEHTIQRCIESILSQTYLNLEIIACDDGSTDGTFGLLQEF--AQRDSR 59 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV---FLAHKQQLV-THA 117 + + + +N GA + RN + A+G YI D DD +P+R+ FL H + A Sbjct: 60 VIPLRSPVNRGAASARNLCLASARGAYIAIQDADDYSSPDRIKKEVDFLEHHPEFAFVSA 119 Query: 118 FLYANDYVCQGEVYSQPASLPLY------PKSPYSRRLFYKRNIIGNQVFTWAWRFKECL 171 + D V + YS P S+PL P ++ LF + + + AW Sbjct: 120 GMVRFDEVGEWSWYSPPRSVPLRWDFLSGPPFAHAVTLFRRTALQAVGGYRVAW------ 173 Query: 172 FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 E + QDYD+F R+ G+ +FY + Sbjct: 174 ---ETRRGQDYDLFFRLCAA-----------------------GYRGANLPGFLYFYLQD 207 Query: 232 KDKFDRASKKYQ----LFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 ++ F R +Y+ + L R RM R L L K LA G+ Sbjct: 208 RNAFSRKQYRYRVAESVIRLKGYRQNRMLLRGLPYLF-----KPLAAGL 251 >UniRef50_C0ED54 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0ED54_9CLOT Length = 347 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 44/107 (41%), Positives = 62/107 (57%), Gaps = 1/107 (0%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M++N ISI MP +N + +AI SVL Q Y N+E+I++DD ST A D Sbjct: 1 MESNKFISIIMPVYNSEHYLKQAIDSVLSQTYQNFELILLDDGSTDASGSMCDAAAQQDQ 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL 107 RI+ +H N G C RN I LA+G+Y+ +D+DD PN LS+ + Sbjct: 61 RISVVHKQ-NEGICRTRNLGIQLAKGDYLYFMDNDDTIDPNALSLLM 106 >UniRef50_C6CSD6 Glycosyl transferase family 2 n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CSD6_PAESJ Length = 304 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 71/244 (29%), Positives = 118/244 (48%), Gaps = 24/244 (9%) Query: 7 ISIYMPTWNRQQLAI-RAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR---I 62 IS + T+ R + RA+ S++ Q YSN E+I+V+D + ++ LN+ I Sbjct: 4 ISCIVTTYKRPIPVLQRAVASIVNQTYSNLELIVVNDAPQILKLVEDIRIMLNEFTNIPI 63 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH---AFL 119 YI N GAC RN I A GEY+ +DDDDEW P++L + L+T A + Sbjct: 64 KYIVQHQNMGACQARNTGIEEASGEYVAFLDDDDEWLPDKLE----KQYALITKERVALV 119 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-------F 172 Y + Y + E L + + ++R ++ N V + ++ F Sbjct: 120 YCSHYEIRQEGTRTLVIEDLAREGIHQDE--FERLLLANFVGSTSYPLLRLSAVKTVGGF 177 Query: 173 DTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS-PKKFSGYFHFYRKH 231 DT LKA+QD+D++LR+ +Y + E +++ H E I+S+ K G+ K+ Sbjct: 178 DTNLKASQDHDLWLRIARKYPIVYCNE---PLVNYYHSEESISSNMENKLQGFNFLLNKY 234 Query: 232 KDKF 235 KD++ Sbjct: 235 KDEY 238 >UniRef50_A0YSS8 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YSS8_9CYAN Length = 321 Score = 85.1 bits (209), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 63/195 (32%), Positives = 93/195 (47%), Gaps = 15/195 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 P ISI +P +N I SVL+Q +S++E+IIV+D ST + E + QY D R+ Sbjct: 2 PQISIIIPVYNGATTIQATIDSVLKQTFSDFELIIVNDGSTDSTLEIISQY----RDRRL 57 Query: 63 ---TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 +Y H +GA A RN+ + A GEYI +D DD WTP++L L Q A Sbjct: 58 QVLSYPH----AGASATRNRGLKQASGEYIAFLDADDLWTPDKLEAQLNALQTHPKAAVA 113 Query: 120 YA-NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELK 177 Y+ D++ + + Q PY+ L Y G+ + FD L Sbjct: 114 YSWTDFLDEAGSWRQAGRHTTVNGEPYAAMLLYNFIESGSNPLIRRDALDDVGGFDESLS 173 Query: 178 AAQDYDIFLRMVVEY 192 QD+D++LR+ Y Sbjct: 174 GGQDWDLYLRLAARY 188 >UniRef50_A8RSJ6 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RSJ6_9CLOT Length = 307 Score = 84.7 bits (208), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 61/227 (26%), Positives = 115/227 (50%), Gaps = 18/227 (7%) Query: 5 PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTS---WEQLQQYVTALNDP 60 PL++ + T+ R+ ++ RA+ S++ Q Y N E+ +V+DC T E L+ + + + Sbjct: 7 PLVTTVITTYKRKPEIVKRALDSIVGQTYDNIEIFVVNDCPTDKKLIEDLKNMIASSSGD 66 Query: 61 R-ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R + YI + N GAC RN A+ A+G+Y +DDDDEW P ++ + + ++ Sbjct: 67 RTVHYIVVEKNGGACRARNIALEKAKGKYFACLDDDDEWLPKKIELQVRALEEHPDAVIA 126 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYK---RNIIGNQVFTWAWRF----KECLF 172 Y N + ++ + + K L+Y+ +N IG+ F +R + F Sbjct: 127 YCNAVIRYADIGKETVR---FTKKQNEGNLYYELIGKNNIGSCSFPM-FRVDAIKRAGGF 182 Query: 173 DTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 D + A QD+D++LR++ E + V E + + G+ +I++ P+ Sbjct: 183 DANMLALQDWDLYLRILKENKAVY-VHEPVAVYYFYAGQ-RISAHPE 227 >UniRef50_C2Y456 Teichuronic acid biosynthesis glycosyl transferase n=2 Tax=Firmicutes RepID=C2Y456_BACCE Length = 253 Score = 84.3 bits (207), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 47/109 (43%), Positives = 65/109 (59%), Gaps = 4/109 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL--QQYVTALNDPRIT 63 L+SI P +N ++ I+SV+ QDY+NWE+IIVDD S +L +QY+ + + RI Sbjct: 6 LVSIITPVYNTEKYLRNTIESVINQDYANWELIIVDDYSKDRSRLIIEQYMES--EKRIK 63 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ 112 I D N G RN AI A G+YI +D DD W PN+LS+ L Q+ Sbjct: 64 LIALDENRGVTFARNTAIKAANGKYIAFLDSDDLWHPNKLSIQLGFMQK 112 >UniRef50_B6YUK0 Glycosyltransferase n=3 Tax=Thermococcaceae RepID=B6YUK0_THEON Length = 294 Score = 84.3 bits (207), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 61/195 (31%), Positives = 97/195 (49%), Gaps = 23/195 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +SI +PT+NR +L RAI+SVL Q + ++E++++D ++ ++ V + D R+ Y Sbjct: 3 PAVSIVIPTYNRNKLLTRAIESVLSQGFDDFEVLVIDGARSN--STRELVRSYGDGRLRY 60 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFLY-AN 122 I G RN + A+GEYI +DDDD+W P +L + + + L T+ +Y A Sbjct: 61 IPQR-GKGIADARNLGVKKARGEYIAFLDDDDQWLPEKLGLQMELFKSLPSTYGLIYTAF 119 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSR----RLFYKRNIIGNQVFTWAWRFKEC-----LFD 173 +Y Y + L K P +R R K NI G +EC LF Sbjct: 120 NY------YYLERNKILGTKKPKARGNVYRHLLKDNITGTSTIMVR---RECFKRAGLFR 170 Query: 174 TELKAAQDYDIFLRM 188 +D+D++LRM Sbjct: 171 MSFPTCEDWDMWLRM 185 >UniRef50_A8UI46 Putative uncharacterized protein n=1 Tax=Flavobacteriales bacterium ALC-1 RepID=A8UI46_9FLAO Length = 313 Score = 84.3 bits (207), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 61/200 (30%), Positives = 100/200 (50%), Gaps = 15/200 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P S+ +P +N+ + IKS L Q + ++E++I++D ST + V N +I Sbjct: 2 PFFSVIIPLYNKAEYISDCIKSALSQTFEDYEIVIINDGST--DNSISVVEKFNSQKIQL 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 H D N+GA RN LA EYI +D DD W PN L+ A L +A L+AN+Y Sbjct: 60 FHQD-NAGASIARNNGANLATAEYIAFLDADDVWKPNHLACIKAS-IDLYPNAGLFANNY 117 Query: 125 -VCQGEVYSQPASL--PLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE------CLFDTE 175 + + + PA L + K P F+K ++ V+T A K+ +F+ Sbjct: 118 FIKHNQNHIAPAKLNFNVIDKQPLVLDNFFKTSLYDTVVWTSAAAIKKQKFIDYGMFNPL 177 Query: 176 LKAAQDYDIFLRMVVEYGEP 195 ++QD D+++R+ ++ EP Sbjct: 178 YLSSQDLDLWIRIALK--EP 195 >UniRef50_D0D9G6 Glycosyl transferase family 2 n=1 Tax=Citreicella sp. SE45 RepID=D0D9G6_9RHOB Length = 319 Score = 84.0 bits (206), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 60/204 (29%), Positives = 98/204 (48%), Gaps = 36/204 (17%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 P +S+ +P NR L RAI SVL Q Y + E+I VDD ST + + L +Y +DPR Sbjct: 26 PTVSVVLPVHNRSSLVTRAIDSVLGQTYRDIELIAVDDASTDDTPDTLARYS---HDPRF 82 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL------AHKQQLVTH 116 + N N GA RN I LA+G Y+ D DD W P +L L A + + Sbjct: 83 RTVRNAQNLGAAGTRNVGIGLARGRYLAFQDSDDRWFPEKLETQLKVLQASADCRACYSG 142 Query: 117 AFLYANDY-----------VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW 165 A Y+ ++ +G++++ +L + P +P + L +R+++ Sbjct: 143 AVYYSKEHSYYIPREGAHGAPKGDIFAD--TLMINPTTPQT--LLIERSLL--------- 189 Query: 166 RFKECLFDTELKAAQDYDIFLRMV 189 + +FD L+ +D+D+ LR+ Sbjct: 190 -TETGVFDDTLRINEDWDLALRLA 212 >UniRef50_Q72XI4 Glycosyl transferase, group 2 family protein n=2 Tax=Firmicutes RepID=Q72XI4_BACC1 Length = 251 Score = 84.0 bits (206), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 40/98 (40%), Positives = 56/98 (57%), Gaps = 2/98 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI P +N ++ IKSVL Q + NWEM I DDCS + + ++ D RI Y Sbjct: 10 LVSIITPVYNSEEYIGDTIKSVLSQTHENWEMFIADDCSQ--DSTAEVISQFKDERIKYF 67 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 D N+GA RN+A+ A G +I +D DD W P++L Sbjct: 68 KLDENAGAAVARNKALEKANGNFIAFLDADDMWKPDKL 105 >UniRef50_C6XYA6 Glycosyl transferase family 2 n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XYA6_PEDHD Length = 338 Score = 83.6 bits (205), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 56/203 (27%), Positives = 96/203 (47%), Gaps = 8/203 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P I+++M +N +IKS+L Q ++++E++IV+D ST ++ Q V ND RI Sbjct: 3 PKITVFMAAYNAANYINESIKSILSQTFTDFELLIVNDGST--DETVQLVLEFNDSRIRL 60 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 +HND N G RN A+ A+GEY+ +D DD PNRL + Q+ +A + Sbjct: 61 VHNDKNRGLVYTRNVALKEARGEYLAILDSDDIALPNRLELQYRFLQENPGYALCGGHGK 120 Query: 125 VC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDY 182 V G++ + L + + + V KE + A+DY Sbjct: 121 VIDKDGKLLDDNRLIVPTGNEHIKMTLLFINTYVNSTVMYKTTVLKELHGYKDFAPAEDY 180 Query: 183 DIFLRMVVEYGEPWKVEEATQIL 205 ++F+R+ E + V+ ++L Sbjct: 181 ELFIRI----AEKYPVDNLDEVL 199 >UniRef50_B4VMI0 Glycosyl transferase, group 2 family protein n=2 Tax=Oscillatoriales RepID=B4VMI0_9CYAN Length = 325 Score = 83.6 bits (205), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 57/192 (29%), Positives = 103/192 (53%), Gaps = 11/192 (5%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 LIS+ +P +N ++ + I+SV+ Q +SN+E+I++DD S + V +++D R+ + Sbjct: 3 LISVVIPVYNGEKTIRQTIESVINQTFSNFELIVIDDGSQ--DSTLDIVRSISDSRLK-V 59 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS----VFLAHKQQLVTHAFLYA 121 + N+G RN+ I LA G+YI+ ID DD WT N+L L + Q V ++ Sbjct: 60 FSFPNAGLATSRNRGIQLASGDYISFIDADDLWTGNKLEAQLKALLDNPQAAVAFSW--- 116 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKAAQ 180 D + + +S+ + + Y++ L G+ + + FK+ F+ L AA+ Sbjct: 117 TDCIDESGQFSRRGNYTTVSGNVYAQLLLTDFIENGSNLLIRSEAFKKVGYFNESLPAAE 176 Query: 181 DYDIFLRMVVEY 192 D+D++LR+ V Y Sbjct: 177 DWDMWLRLAVNY 188 >UniRef50_C7TKV7 Glycosyl transferase, group 2 n=5 Tax=Lactobacillus RepID=C7TKV7_LACRL Length = 251 Score = 83.6 bits (205), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 41/114 (35%), Positives = 66/114 (57%), Gaps = 4/114 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALN 58 M N P++S+ MP +N ++SVL Q + + E++IVDD S+ ++E +++ + Sbjct: 1 MGNTPVVSVVMPAYNAATYLPNTVQSVLEQSFKDLELLIVDDASSDHTFEVVEK--LSAT 58 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ 112 D RITYI D N G RN + ++GEY+ +D DD W P++L + L QQ Sbjct: 59 DSRITYIKLDENQGVANARNVGVQHSRGEYVAFLDSDDIWLPDKLKIQLTFMQQ 112 >UniRef50_A9AAL8 Glycosyl transferase family 2 n=2 Tax=Methanococcus maripaludis RepID=A9AAL8_METM6 Length = 280 Score = 83.6 bits (205), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 58/214 (27%), Positives = 101/214 (47%), Gaps = 21/214 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALND 59 +NNPL+S+ MP +N ++ AI+S+L Q Y ++E II+DDCST SW +Q Y A D Sbjct: 5 ENNPLVSVIMPNYNNEKYLAEAIESILNQTYEHFEFIIIDDCSTDNSWHIIQNY--AKKD 62 Query: 60 PRITYIHNDINSGACAVRNQA--IMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 RI N++N RN+ +M +YI D DD P RL + ++ V + Sbjct: 63 NRIKMFRNEVNLKIVKTRNKGFELMSPHSKYIAIFDSDDVSMPERLEKQINFSEKNVDYG 122 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL------ 171 + ++ + + + + Y K + K + NQ + ++ + Sbjct: 123 LIGSHTLII-----DENSKIIGYRKYETDLKKIIKNMVKKNQFAQPSVMIRKSVLDDVGY 177 Query: 172 FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQIL 205 +D + QDYD++ R+ + +K+E + L Sbjct: 178 YDEKYTRCQDYDLWFRI----AKSFKIENLDEFL 207 >UniRef50_Q3A3M0 Glycosyltransferase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A3M0_PELCD Length = 249 Score = 83.6 bits (205), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 61/198 (30%), Positives = 99/198 (50%), Gaps = 21/198 (10%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRITY 64 +SI P N + IKS+ Q ++NWE I+VDD ST +W+ +Q+Y ++ +PR Sbjct: 4 VSIITPCHNSAPFVLETIKSISEQTFTNWEHILVDDASTDDTWQIIQKYCSS--EPRAKC 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV---FLAHKQQLVT-HAFLY 120 NSGA RN AI AQG YI ID DD W P +L F+ +K ++ ++ Sbjct: 62 FRLTKNSGAALARNMAIEAAQGRYIAFIDSDDLWMPEKLEKQIEFMRYKNAYLSFSSYEK 121 Query: 121 ANDY-VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 N+Y V G + P + Y+ L K +IG + ++ ++ +++ Sbjct: 122 INEYGVRSGRMVDVPNEV------NYNEIL--KGCVIGCLTAMYDSKYLGKVYMPDIRRG 173 Query: 180 QDYDIFLRMV----VEYG 193 QDY ++LR++ + YG Sbjct: 174 QDYGLWLRILRRGHIAYG 191 >UniRef50_A0YST1 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YST1_9CYAN Length = 297 Score = 83.2 bits (204), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 45/131 (34%), Positives = 79/131 (60%), Gaps = 6/131 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P+IS+ +P +N ++ + I+SVL Q + +WE+I+VDD ST + ++ V + D RI + Sbjct: 2 PIISVIIPAYNAEKTIKKTIQSVLNQTFLDWELIVVDDGST--DSTKEVVAEIKDNRI-F 58 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-ND 123 + + N+G A RN+ + + GE++ ID DD WTP++L + L Q+ A Y+ D Sbjct: 59 LFSFPNAGVSAARNRGVNKSSGEFLAFIDADDLWTPDKLELQLKALQEHPKAAVAYSWTD 118 Query: 124 YVCQGE--VYS 132 Y+ + + +YS Sbjct: 119 YIDESDQIIYS 129 >UniRef50_B5WT88 Glycosyl transferase family 2 n=1 Tax=Burkholderia sp. H160 RepID=B5WT88_9BURK Length = 305 Score = 83.2 bits (204), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 56/193 (29%), Positives = 93/193 (48%), Gaps = 23/193 (11%) Query: 11 MPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIHNDIN 70 M +N + AIKSV RQ ++WE+I+VDD ST A NDPRI + + N Sbjct: 1 MANFNGEHWIGEAIKSVQRQSVTDWELIVVDDASTDASTEIVRAAAANDPRIVLLTSSSN 60 Query: 71 SGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVCQGEV 130 +G RN+A+ A+G +IT +D DD + RL LA + T + A+D + E Sbjct: 61 AGPSVARNRALACARGRWITILDSDDLFADGRLESLLARARSDGTG--IVADDLLIMDES 118 Query: 131 YS------------QPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 S + K+P+ L Y + +I ++ + +D +++ Sbjct: 119 GSLTGYSLLGLRTVRTVDAVALVKAPH---LGYLQPMIKTELLA------DLRYDERVRS 169 Query: 179 AQDYDIFLRMVVE 191 A+D+D+ LR++V+ Sbjct: 170 AEDFDLLLRVLVK 182 >UniRef50_C6BYR5 Glycosyl transferase family 2 n=5 Tax=Desulfovibrionales RepID=C6BYR5_DESAD Length = 292 Score = 83.2 bits (204), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 61/236 (25%), Positives = 112/236 (47%), Gaps = 27/236 (11%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N P +S+ +PT+NR RA++SVL Q ++++E ++VDD ST + + + +D R+ Sbjct: 6 NEPKVSVIIPTYNRADRVCRAVESVLAQKFTDFECLVVDDGST--DDTAERLAEFDDSRL 63 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + + N G A RN I A G++I +D DDEW P +L + ++ Sbjct: 64 KILRQE-NKGVSAARNFGIAAAIGDFIALLDSDDEWVPEKLLKQIPFMEEAGFEISQTDE 122 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI----IGNQVFTWAWRFKECL--FDTEL 176 ++ +G+ +Q K +F+ R++ + ++ +F + + FD ++ Sbjct: 123 IWIRKGKRVNQ------CKKHEKPEGMFFDRSLEMCMVSPSCVIFSRKFWDEIGPFDEDM 176 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS---GYFHFYR 229 A +DYD+++R ++Y V + L I HG P + S G YR Sbjct: 177 PACEDYDLWIRAGLKY----PVGLLRERLTIKHG-----GRPDQLSNSVGCLDLYR 223 >UniRef50_A5ZV71 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=A5ZV71_9FIRM Length = 250 Score = 83.2 bits (204), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 56/155 (36%), Positives = 77/155 (49%), Gaps = 21/155 (13%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRIT 63 L+S+ PT+N + I+SV Q Y WEMIIVDDCST + E + +Y+ D RI Sbjct: 5 LVSVITPTYNCAKFIGETIESVQAQTYQQWEMIIVDDCSTDNTKEIVDKYIK--EDSRIK 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y + NSGA R +A+ LA GEY+ +D DD W +L K+QL AF+ +D Sbjct: 63 YFCLENNSGAAVARTKAMELANGEYMAFLDSDDIWPEEKL------KKQL---AFMKKHD 113 Query: 124 YVCQGEVYSQPAS--------LPLYPKSPYSRRLF 150 Y Q + PK+ Y+R L Sbjct: 114 VAFSCTAYEQIDENGKVLNKIIKTVPKADYNRVLL 148 >UniRef50_C6VXV3 Glycosyl transferase family 2 n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VXV3_DYAFD Length = 302 Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 45/127 (35%), Positives = 67/127 (52%), Gaps = 4/127 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PLISI T+NR + +I SVL Q + NWE+I++ DC T + + V DPRIT Sbjct: 3 GPLISIITATYNRSNILAYSIHSVLNQTFRNWELIVIGDCCT--DDTAEVVAGFRDPRIT 60 Query: 64 YIHNDINSGA-CAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 +I+ + N G N+ + A+GEYI ++ DD W P+ L F + Q + A Sbjct: 61 FINLEKNFGEQSGPNNEGLRRARGEYIAFLNHDDLWFPDHLE-FSLRRLQSTGADLVLAA 119 Query: 123 DYVCQGE 129 ++ GE Sbjct: 120 GFIDHGE 126 >UniRef50_Q8YSL1 Alr3073 protein n=2 Tax=Nostocaceae RepID=Q8YSL1_ANASP Length = 324 Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 64/207 (30%), Positives = 113/207 (54%), Gaps = 14/207 (6%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P IS+ +P +N ++ + I SVL Q +S+ E+I+++D ST ++ + + ++D R+ Sbjct: 2 PKISVIIPAYNAERTILETINSVLNQTFSDLEIIVINDGST--DRTVEVLQNVDDARLK- 58 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA----HKQQLVTHAFLY 120 +++ NS A RN I A G++I+ +D DD WTP++L + L+ H + V +++ Y Sbjct: 59 VYSYENSRASGARNHGISHAVGDFISFLDADDLWTPDKLELQLSALNNHPEAGVAYSWTY 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG-NQVFTWAWRFKECLFDTELKAA 179 D +GE+ +P PLY + Y+ L G N + A FDT L++ Sbjct: 119 TID--DKGELL-KPFE-PLYEGNVYTDLLLANFLTNGSNPLIRKAAIASIGEFDTTLRSG 174 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILH 206 +D+D +LR+ Y P+ V + QIL+ Sbjct: 175 EDWDYWLRLA--YKWPFVVVKQHQILY 199 >UniRef50_A6LGW8 Glycosyltransferase family 2 n=3 Tax=Bacteroidales RepID=A6LGW8_PARD8 Length = 259 Score = 82.8 bits (203), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 44/124 (35%), Positives = 71/124 (57%), Gaps = 8/124 (6%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRIT 63 L+SI MP++N + + I S++ Q Y+NWE++I DDCST + E +++Y A D RI Sbjct: 7 LVSIIMPSYNSSKYIAKTIDSIVSQFYTNWELLITDDCSTDNTCEIIKEY--AAYDQRIK 64 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV----FLAHKQQLVTHAFL 119 + N GA RN++I A+G YI D DD W P +L V + ++ ++ ++L Sbjct: 65 LFVMEENKGAGVARNKSIEEAKGRYIAFCDSDDRWKPEKLEVQLRFMVENRVEICYSSYL 124 Query: 120 YAND 123 N+ Sbjct: 125 KCNE 128 >UniRef50_B5WL01 Glycosyl transferase family 2 n=2 Tax=Burkholderia RepID=B5WL01_9BURK Length = 731 Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 68/198 (34%), Positives = 94/198 (47%), Gaps = 19/198 (9%) Query: 5 PLISIYMPTWNRQQLAIR-AIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTALNDP 60 PLISI MPT+N +R AI SV+ Q Y NWE I DDCST+ E L YV DP Sbjct: 191 PLISIVMPTYNTPAKWLRMAIDSVIDQVYENWEFCIADDCSTNPEVKEVLDSYVA--KDP 248 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL----SVFLAHKQQLVTH 116 RI + N N A+ LA GE++ +D DDE P L + AH V + Sbjct: 249 RIKVAYRTTNGHISNSSNTALELAVGEFVGLLDHDDELHPLALYCVAELINAHPDATVIY 308 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTE 175 + D + S P Y K ++ LF +N+I + V+ + + F T Sbjct: 309 S---DEDKISIDGERSDP-----YFKCDFNYDLFLSQNMISHFGVYRTSIMKEVGGFRTG 360 Query: 176 LKAAQDYDIFLRMVVEYG 193 + +QDYD+ LR++ G Sbjct: 361 FEGSQDYDLALRVIDRAG 378 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 30/127 (23%), Positives = 58/127 (45%), Gaps = 6/127 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P + I +PT + L + ++SV R+ Y N+ + I+D+ S E + + D RI Sbjct: 451 PSVEIIIPTRDSAGLVEQCVESVRRKSSYPNFRITIIDNGSVKQETHELFARLQEDERIK 510 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-----WTPNRLSVFLAHKQQLVTHAF 118 + +D A+ N+ + + +++ +++D E W +SV L V Sbjct: 511 VVRDDSPFNYSALNNRVALASTADFVCLMNNDIEVINADWLEEMVSVALQVNVGAVGAKL 570 Query: 119 LYANDYV 125 LY +D + Sbjct: 571 LYPDDTI 577 >UniRef50_C6EQF5 Putative uncharacterized protein n=2 Tax=Campylobacter jejuni RepID=C6EQF5_CAMJE Length = 666 Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 40/101 (39%), Positives = 64/101 (63%), Gaps = 4/101 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALNDPRI 62 PLIS+ MPT+NR++ AI+S+L Q ++++E II+DDCS +++ +++Y D RI Sbjct: 7 PLISVIMPTFNREKYIKFAIESILNQTFTSFEFIIIDDCSNDATYDIIREYAKI--DQRI 64 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 + N N G N+ + L++G+YI +DDDD P RL Sbjct: 65 IVLRNKANKGIVFSLNKGLQLSRGKYIARMDDDDISMPTRL 105 >UniRef50_A4A6G7 Glycosyltransferase n=1 Tax=Congregibacter litoralis KT71 RepID=A4A6G7_9GAMM Length = 291 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 60/199 (30%), Positives = 93/199 (46%), Gaps = 29/199 (14%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ +P +NR +A+ SV+ QDY ++E+I+VDD ST + + V AL P + Y Sbjct: 9 PLVSVVIPCYNRADRIAQAVHSVVDQDYPSFEIIVVDDGST--DNTEAVVAALEIPTLRY 66 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N GA + RN I AQGEYI D DD W+ ++L+ + +Q++TH N Sbjct: 67 LRLQENRGANSARNVGIREAQGEYIAFQDSDDLWSASKLTTQI---KQMLTH-----NAK 118 Query: 125 VCQGEVYSQPASLPL-YPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC------------- 170 VC + PK+ Y + + + C Sbjct: 119 VCFCAFNRNDRGVKTRVPKASY----HVQHGCVDRHAELLRGSYISCQTLIARKETLLSV 174 Query: 171 -LFDTELKAAQDYDIFLRM 188 LFD LK QD+++ LR+ Sbjct: 175 GLFDESLKRLQDWELCLRL 193 >UniRef50_C1DGR7 Glycosyl transferase, family 2 protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DGR7_AZOVD Length = 340 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 57/200 (28%), Positives = 92/200 (46%), Gaps = 23/200 (11%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALN 58 M P +++++P NRQ+ A+ S+L Q + ++E+++VDD ST + E L Y Sbjct: 1 MSRTPRVTVFIPVHNRQRYIATAVDSILAQTFGDFELLVVDDGSTDATLEVLSHY----R 56 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 DPR+ N N G RN+ + LA+GEYI +D DD+ P RL+ + L H Sbjct: 57 DPRLRVECNPRNLGLPGTRNRGLELARGEYIALLDSDDKAWPERLT---RQVEVLDRHPE 113 Query: 119 LYANDYVC-----QGEVYSQPASLPLYPKSPYSRRLFY----KRNIIGNQVFTWAWRFKE 169 L C +G + PL P + FY R I+G +R+ E Sbjct: 114 LVQIGSACDFMDAEGRRLDRVRRRPLAPDEVDASLAFYCALTNRTIMGRTAILREYRYSE 173 Query: 170 CLFDTELKAAQDYDIFLRMV 189 + + +DY++ R+ Sbjct: 174 -----DFPSCEDYELHQRLA 188 >UniRef50_A6ETY5 Glycosyltransferase n=1 Tax=unidentified eubacterium SCB49 RepID=A6ETY5_9BACT Length = 310 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 48/119 (40%), Positives = 67/119 (56%), Gaps = 7/119 (5%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +P +NR + R I SV +Q ++N E+I++DDCS QQ T+L DP I Y Sbjct: 3 VSVIIPYYNRPKKLARCIASVQKQSHTNLEIIVIDDCS------QQAPTSL-DPSIVYKK 55 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+ N G RN+ LA G++IT +D DD W P L LA + A +YAN YV Sbjct: 56 NETNLGPGLSRNEGKALATGQFITFLDCDDYWHPTFLEKALACFLEQKHIAMVYANAYV 114 >UniRef50_Q0ACF0 Glycosyl transferase, family 2 n=4 Tax=Ectothiorhodospiraceae RepID=Q0ACF0_ALHEH Length = 597 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 65/205 (31%), Positives = 97/205 (47%), Gaps = 27/205 (13%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ MP +N AI SVL QDY + E+I++DD S+ + V A D R+ Sbjct: 281 PLVSVIMPAFNAASYIEEAIDSVLAQDYPHKELIVIDDGSS--DDTVARVQAYGD-RVRL 337 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G+ RNQ + AQGEYI +D DD W P +L+ + + + +Y +D+ Sbjct: 338 L-TQANQGSAVARNQGLDAAQGEYIAFLDSDDVWLPGKLTAQVGYLEAHPDVGMIY-SDW 395 Query: 125 VC-----QGEVYSQPASL-PLYPKS---PYSRRL-------FYKRNIIGNQVFTWAWRFK 168 + Q + + P +L P P + P L Y R + G+ + T + Sbjct: 396 LPWKRDKQSKAFPPPEALAPATPDTGVPPEEIPLLTEGSGWLYNRLLFGSLLHTITVMAR 455 Query: 169 ECL------FDTELKAAQDYDIFLR 187 L FD ELK QDYD +LR Sbjct: 456 RELIEQVGRFDPELKRGQDYDYWLR 480 >UniRef50_B7AK29 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=B7AK29_9BACE Length = 252 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 49/117 (41%), Positives = 64/117 (54%), Gaps = 8/117 (6%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALNDPRIT 63 L+SI P +N + I SVLRQ Y+NWEMI+VDD S S E + +YV D RI Sbjct: 5 LVSIVTPVYNGAKYVCETIDSVLRQTYANWEMIVVDDGSKDNSAEIINEYVR--RDKRIV 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA---HKQQLVTHA 117 + N G+ + RN I A+G+YI +D DD W P+ L LA K +V HA Sbjct: 63 LLRQ-ANGGSASARNNGIRNAKGQYIALLDADDLWEPDFLESQLALMKRKSAIVVHA 118 >UniRef50_Q0VR09 Glycosyl transferase, group 2 family protein, putative n=2 Tax=Gammaproteobacteria RepID=Q0VR09_ALCBS Length = 274 Score = 81.6 bits (200), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 67/236 (28%), Positives = 114/236 (48%), Gaps = 21/236 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--NDPRI 62 P +SI P +N + R I+SVL Q++ +WE+++VDD ST + Q+ A D RI Sbjct: 7 PEVSIITPCYNARHTLARTIESVLSQNFLSWELLLVDDKST--DDTDQHAQAFCDRDNRI 64 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ----LVTHAF 118 I NSGA RN+ I A+G YI ID DDEW ++LS+ ++ ++ L A+ Sbjct: 65 KLIRLKENSGAAVARNRGIASARGRYIAFIDADDEWHKDKLSIQISQMKEQGWPLSYTAY 124 Query: 119 LYAN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK 177 N + C V P + + R+L K N IG + +F EL+ Sbjct: 125 TRINPEGACINTVG--------VPPTIHYRQLL-KTNYIGCSTAVYDSEQLGKVFMPELR 175 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY-FHFYRKHK 232 QD+ ++L+++ + + + A ++ + +S+ + +GY + YR + Sbjct: 176 KRQDFGLWLKILKKTTSGYGINLALTRYCVHQDSL--SSNKRNTAGYNWRLYRDEE 229 >UniRef50_B4ECE8 Glycosyltransferase n=3 Tax=Burkholderia cepacia complex RepID=B4ECE8_BURCJ Length = 434 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 45/110 (40%), Positives = 65/110 (59%), Gaps = 6/110 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 P +SI + ++N + I+SV+ Q +WE+IIVDDCST SW +Q A D RI Sbjct: 12 PRVSIIVTSYNYARFIGETIRSVIDQTLDSWELIIVDDCSTDDSWNVIQ----AFQDRRI 67 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ 112 + D N GACA N A+ LA+GE+I +D DD + ++L+V +A Q Sbjct: 68 HAVRFDKNRGACAAYNTALSLARGEFIASLDSDDVFNEDKLAVQVAFLDQ 117 >UniRef50_C6IEP2 UDP-galactose phosphate transferase n=2 Tax=Bacteria RepID=C6IEP2_9BACE Length = 261 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 44/104 (42%), Positives = 55/104 (52%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+N L+SI PTW + I S+ Q Y NWE++I DDCST L A DP Sbjct: 1 MQNYGLVSIITPTWACAKFICETIYSIQAQTYQNWELLIQDDCSTDNTYLVVEPLANLDP 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS 104 RI Y N+ SGA RN A+ A G +I +D DD W P +L Sbjct: 61 RIKYQCNEKRSGAAITRNNALKRANGRWIAFLDSDDLWLPEKLE 104 >UniRef50_A8TT37 Glycosyl transferase, group 2 family protein n=2 Tax=Bacteria RepID=A8TT37_9PROT Length = 1185 Score = 81.3 bits (199), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 67/216 (31%), Positives = 104/216 (48%), Gaps = 22/216 (10%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT--AL 57 MK P++S+ MPT+N ++ RAI+SV Q Y WE+ I DD ST E Q+ + A Sbjct: 634 MKAKPVLSVIMPTYNTDPEVLDRAIQSVRNQIYPRWELCIADDASTHGET-QEVIRRHAG 692 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 D RI + ++N A N A+ LA G+++ +D DDE T + L + + Sbjct: 693 EDDRIKTVFREVNGHISAASNSALDLATGDFVVLLDHDDELTSHALYMIASELNDHPNAD 752 Query: 118 FLYANDYVC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL--FD 173 +Y+++ GE Y QP Y K +S +F +N + N V E + F Sbjct: 753 VIYSDEDKLDDNGEAY-QP-----YFKPDWSPDMFLAQNYL-NHVSAHRRSLVEVVGRFR 805 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINH 209 + +QDYD+ LR++ + A +I HI H Sbjct: 806 EGFEGSQDYDLALRVI-------ERTTAERIRHIPH 834 Score = 57.8 bits (138), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 34/95 (35%), Positives = 56/95 (58%), Gaps = 3/95 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALN-DPRI 62 PL+S+ +PT N+ ++ I + DY NWE+I+VD+ S E L +Y+ AL DPRI Sbjct: 897 PLVSLIVPTRNKAEVLKLCIDGIREHNDYPNWELILVDNGSDEDESL-EYLEALGKDPRI 955 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE 97 T + +D + N+A +A+GE + +++D E Sbjct: 956 TVLRDDGPFNYSRLNNRAARMAKGEILGLLNNDIE 990 >UniRef50_Q3J960 Glycosyl transferase, family 2 n=2 Tax=Nitrosococcus oceani RepID=Q3J960_NITOC Length = 455 Score = 81.3 bits (199), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 42/106 (39%), Positives = 63/106 (59%), Gaps = 6/106 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 PL+S+ MP +N ++ A++SVL Q + E+I++DD S+ +W LQ + D R+ Sbjct: 2 PLVSVLMPAYNHEEYVSSAVESVLGQTHKKLELIVIDDASSDGTWAVLQSF----QDHRL 57 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA 108 +D N GA A N+A+ LAQGEYI +D DD + P RL L+ Sbjct: 58 RRYRHDRNQGAHATLNEAMALAQGEYIAILDSDDRYDPRRLERLLS 103 >UniRef50_Q67PP6 Glycosyl transferase n=1 Tax=Symbiobacterium thermophilum RepID=Q67PP6_SYMTH Length = 242 Score = 81.3 bits (199), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 58/199 (29%), Positives = 98/199 (49%), Gaps = 15/199 (7%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ--LQQYVTALN 58 M +NP +++ +P +N + A++S L Q YSN E+++VDD ST L++Y A+ Sbjct: 1 MPDNPKVTVVIPAYNAARYLAEAVESALNQTYSNLEVLVVDDGSTDRTPFLLERYGRAIR 60 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 R N G + N+ I A+GE+I + DD + P +L+ +A+ + A Sbjct: 61 TIR------KRNGGTPSALNEGIRQARGEWIAWLSADDAFLPEKLAKQMAYAEAHPECAL 114 Query: 119 LYANDYVC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR---FKECLFD 173 +Y N +V +G+ S AS ++ +L R + N T R + LFD Sbjct: 115 IYTNWFVVDGRGKTVSHLASPTFRSRADQVEKLL--RGCVINGSTTLVRRDVYLRAGLFD 172 Query: 174 TELKAAQDYDIFLRMVVEY 192 L A D+D++LR+ +Y Sbjct: 173 ESLPQAHDWDMWLRLARDY 191 >UniRef50_Q314K1 Glycosyltransferases involved in cell wall biogenesis-like n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314K1_DESDG Length = 312 Score = 81.3 bits (199), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 69/294 (23%), Positives = 125/294 (42%), Gaps = 51/294 (17%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M P +S+ +PT+N L + ++SV Q Y ++E+I+VDD ST + A + Sbjct: 1 MTTTPRVSVVIPTYNYGHLLEQCLESVFAQTYKDYEVILVDDGSTDD---TPGIAARYEG 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA------------ 108 RI Y + N G + RN+ I A+G I +D DD W P++L+V A Sbjct: 58 RIRYFRKE-NGGPSSARNEGIRQARGPLIAFLDSDDLWLPHKLAVQTAFADAHPEYGLVY 116 Query: 109 ------HKQQLVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT 162 +++ H++L+ Y C G + Y +F ++ + F+ Sbjct: 117 CDMQHVEHGRVIHHSYLHERGYTCTGSGWK-------YVDLLREGFVFTPTVLVRKECFS 169 Query: 163 WAWRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 A + D L+ QD D++LR+ Y P+ + ++ HG+ T++ + + Sbjct: 170 AAG-----MHDEGLRTCQDLDMWLRIAKLY--PFGFIDEPLVVRQYHGDNSTTNAERYHA 222 Query: 223 GYFH-FYRKHKDKFDRASKK--------------YQLFTLYQIRNKRMTWRTLL 261 + F R+ + D +K+ Y F+ R R +R++L Sbjct: 223 NHIRVFLRELESTADETAKQVLRGRLSEQYFALGYHYFSTGSRRQARRCFRSVL 276 >UniRef50_D0TBE0 Glycosyltransferase family 2 n=2 Tax=Bacteroides RepID=D0TBE0_9BACE Length = 250 Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 49/120 (40%), Positives = 68/120 (56%), Gaps = 6/120 (5%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRIT 63 LISI P +N + I+SVL Q Y WEM+IVDDCST S E +Q YV D RI Sbjct: 4 LISIITPCYNAVPFIAQTIESVLAQTYPYWEMLIVDDCSTDRSAEIIQTYVK--RDSRIK 61 Query: 64 YIHNDINSGACAV-RNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 Y D SG+ ++ RN + AQGEY+ +D DD W P +L +++ + F+Y++ Sbjct: 62 YFKTDHPSGSPSLPRNIGLKQAQGEYVAFLDSDDAWLPVKLEEQISYMETGCC-GFVYSD 120 >UniRef50_C7GBF7 Glycosyltransferase n=1 Tax=Roseburia intestinalis L1-82 RepID=C7GBF7_9FIRM Length = 274 Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 65/205 (31%), Positives = 101/205 (49%), Gaps = 22/205 (10%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK +SI +PT+NR +AI+SVL Q Y ++E+I+VDD ST ++ ++ VT+ D Sbjct: 1 MKQTEPVSIILPTYNRANQIGKAIESVLAQTYEDFELIVVDDGST--DETEKVVTSYKDC 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ------LV 114 RI Y N G RN + LA+ +Y+ D DD W P +L + + +V Sbjct: 59 RIRYHRMPENGGQSRARNCGMKLARYDYLAFEDSDDLWRPGKLKAQMEAMRNAGTNVGMV 118 Query: 115 THAFLYANDYVCQGEVYSQPASLPLYPKSP--YSRRLFYKRNIIGNQVFTWAWRFKECL- 171 H F Y D + +P KS Y++ L+ N++G + T + KEC+ Sbjct: 119 YHKFRY--DLGAGRGMVLPDEKIPYEKKSGDIYAQLLW--DNLVG--IPTMLLK-KECVE 171 Query: 172 ----FDTELKAAQDYDIFLRMVVEY 192 D L+ +DYD LR+ +Y Sbjct: 172 AVGGLDESLRCLEDYDFALRIAKKY 196 >UniRef50_A9KVT6 Glycosyl transferase family 2 n=1 Tax=Shewanella baltica OS195 RepID=A9KVT6_SHEB9 Length = 146 Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 43/107 (40%), Positives = 65/107 (60%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L SI MP++N + +IKS+ +QDY +WE+++ DD ST A +D RI Sbjct: 3 LASIIMPSYNSAATIVSSIKSIQQQDYLHWELLVTDDGSTDNTVELVRKLANDDSRIRID 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ 112 N +NSGA RNQ+I A+G+YI +D DD W PN+LS +++ ++ Sbjct: 63 VNPVNSGAGFSRNQSIKRAKGKYIAFLDADDLWLPNKLSSQVSYTER 109 >UniRef50_A0KUZ3 Glycosyl transferase, family 2 n=7 Tax=Gammaproteobacteria RepID=A0KUZ3_SHESA Length = 249 Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 43/110 (39%), Positives = 65/110 (59%), Gaps = 4/110 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALN 58 M+ PL+SI MP++N + +I+SV+ Q Y NWE++I DD S + + ++ Y Sbjct: 1 MQEQPLVSIIMPSYNSAKTIAESIESVISQTYQNWELLITDDVSVDNTRDIVRSYCA--K 58 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA 108 D RI D+NSGA A RN +I ++G YI +D DD W ++LS +A Sbjct: 59 DSRIKLFELDVNSGAGASRNNSIENSKGIYIAFLDSDDLWFSDKLSTQIA 108 >UniRef50_UPI0001B495DA glycosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B495DA Length = 332 Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 63/224 (28%), Positives = 104/224 (46%), Gaps = 18/224 (8%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRITY 64 ISI +P +N ++ + I+S++ Q+YS+WE+I+VDD ST S + +Y A D RI Sbjct: 9 ISIIVPVYNSEKHLCKMIESIIEQEYSHWELILVDDGSTDRSGDVCDEY--AQKDTRIKV 66 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND- 123 H N G RN+A+ G+Y+T ID DD P+ L F+ F YA+D Sbjct: 67 YHQK-NGGVSVARNKALDKISGDYVTFIDSDDWVEPSYLLRFM---------EFPYADDQ 116 Query: 124 YVCQGEV--YSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 V QG Y + + P R+ + I+G T + FD L+ +D Sbjct: 117 IVFQGYFMDYVGAVAKESTKEIPKQGRILHLGAILGKMFATPIIKLHNIRFDERLRLHED 176 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 + +F ++Y + ++++GE ++ + YF Sbjct: 177 H-VFYYECLKYARQFDFSSYAGYHYMHYGEQSLSRRLQSTDAYF 219 >UniRef50_Q2CBS3 Truncated glycosyltransferase n=1 Tax=Oceanicola granulosus HTCC2516 RepID=Q2CBS3_9RHOB Length = 571 Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 56/210 (26%), Positives = 96/210 (45%), Gaps = 23/210 (10%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+SI M +N + AI ++L Q Y +E+I+VDDCS + A D R+ Sbjct: 314 GPLVSIVMTAYNAEAHIGTAIGAILAQSYRRFELIVVDDCSGDATAERVREIAARDGRVR 373 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + +N G +N I A+G ++T D DD WT + L+ LA Q + +D Sbjct: 374 LMSTKVNGGTYLAKNAGIRTAKGAFVTLCDSDDMWTTDHLAAHLAAMQADAGRT-VSTSD 432 Query: 124 YV-----CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 ++ C+ ++ + + P S + RR + R +G LFD+ ++ Sbjct: 433 WLRLFDDCRVDINPTGTIVEVCPHSTFFRREVFDR--VG-------------LFDS-VRF 476 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHIN 208 D + R+ +E+G+P V ++L I Sbjct: 477 GADREFIARITLEWGQP-AVHHIPRVLTIG 505 >UniRef50_A0KM86 Glycosyl transferase, group 2 family protein n=2 Tax=Gammaproteobacteria RepID=A0KM86_AERHH Length = 268 Score = 80.9 bits (198), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 37/98 (37%), Positives = 56/98 (57%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ MP +N + +I SVL+Q Y+NWE+I++DDCS + D RI + Sbjct: 4 VSVIMPLYNSARFLAASIDSVLKQQYTNWELILIDDCSVDDCVVIAQAYCDRDSRIKLVR 63 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS 104 NSGA RN+ I +A+G +I +D DD W P +L+ Sbjct: 64 LAENSGAAVARNRGIEIAEGRFIAFLDSDDLWLPEKLN 101 >UniRef50_Q1N7I9 Predicted glycosyltransferase n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1N7I9_9SPHN Length = 1454 Score = 80.9 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 61/194 (31%), Positives = 88/194 (45%), Gaps = 9/194 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 + +SI MP NR + AI SVL Q ++++E+II+DD ST + L + DPRI Sbjct: 1000 DTFVSIIMPVLNRAAIIGDAIASVLAQSHAHFELIIIDDGST--DDLPSALERYPDPRIL 1057 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N + A RN+ + A GE I +D D+ W P+ L+V + HA YA Sbjct: 1058 LISNARSERVSAARNRGLDRASGEIIAYLDSDNSWDPDYLAVMVNALADHRGHACAYAGQ 1117 Query: 124 YVCQGEVYSQPASLP---LYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL---FDTELK 177 + Q + A P++R +RN I +F R L FD L Sbjct: 1118 SIFQAVTSADGAGQEERRAIRLCPFNRSRLEERNYIDLNIFVHR-RHLVALHGGFDKSLH 1176 Query: 178 AAQDYDIFLRMVVE 191 D+D+ LR E Sbjct: 1177 RLVDWDLILRYTPE 1190 >UniRef50_Q72QG5 Glycosyltransferase n=2 Tax=Leptospira interrogans RepID=Q72QG5_LEPIC Length = 298 Score = 80.9 bits (198), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 58/197 (29%), Positives = 98/197 (49%), Gaps = 14/197 (7%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP--R 61 NPL+S+ +PT+N ++ SV+ Q Y NWE I++D+ S + + V++ + R Sbjct: 2 NPLVSVVIPTYNHSDFLKLSLASVINQTYLNWEAIVIDNHSN--DNTDEVVSSFGNSKVR 59 Query: 62 ITYIHNDINSGACAV-RNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 +T I N+ G +V RN I A+G++I+ +D DD W PN+L + Q+L + Sbjct: 60 LTKIKNN---GVISVSRNLGIKEAKGDWISFLDSDDLWFPNKLERVVFEIQKLENQIDVL 116 Query: 121 ANDYVCQGEVYSQPASLPLYP-KSPYSRRLFYKRNIIGNQVFTWAWRF---KECLFDT-- 174 ND + +L P + + R++ + N + T +F KE LF+ Sbjct: 117 CNDEYMVHLNQKKKITLNYGPFEDDFYRKMLFYGNRLSTSATTVRKKFLKEKELLFNENQ 176 Query: 175 ELKAAQDYDIFLRMVVE 191 E +DYD +LR+ E Sbjct: 177 EFVTVEDYDFWLRLAKE 193 >UniRef50_B0SAI4 Glycosyltransferase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SAI4_LEPBA Length = 254 Score = 80.5 bits (197), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 40/99 (40%), Positives = 53/99 (53%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ MP +N ++ S+L Q Y +WE++I+DDCS A D RI Sbjct: 2 PKVSVIMPAYNAASYLEESVNSLLSQSYKDWELLIIDDCSKDHTLDVAKNIAKKDKRIRV 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 I + NSG+ RNQ I LA GEYI +D DD W N L Sbjct: 62 IPKEKNSGSADTRNQGIELASGEYIAFLDADDLWESNFL 100 >UniRef50_B9BSY2 Glycosyl transferase, family 2 n=4 Tax=Proteobacteria RepID=B9BSY2_9BURK Length = 632 Score = 80.5 bits (197), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 64/211 (30%), Positives = 103/211 (48%), Gaps = 20/211 (9%) Query: 5 PLISIYMPTWNRQQLAIRA-IKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--DPR 61 PLISI +PT+N + +R I+SVL Q Y +WE+ I DD S+S E ++ + A D R Sbjct: 90 PLISIIVPTYNSDERFLREMIESVLAQVYPHWELCIADDASSS-ESVRAVLEAARARDDR 148 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I ++ +N N A+ +A GEYI +D DD P+ L + + + + LY+ Sbjct: 149 IKVVYRPVNGHISEASNSALEIATGEYIALLDHDDVIPPHALFMVVKYLNKHPEARMLYS 208 Query: 122 ND--YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKA 178 ++ GE S Y KS ++ +LF +N+ + V+ A F + Sbjct: 209 DEDKLSADGERTSP------YFKSDWNPQLFLAQNMFSHLGVYETALVRAAGGFRKGFEG 262 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINH 209 +QDYD+ LR V ++ ++HI H Sbjct: 263 SQDYDLALRCV-------EIAGHNAVVHIPH 286 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 27/94 (28%), Positives = 52/94 (55%), Gaps = 1/94 (1%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P SI +PT + + + I S+ + Y N+E+I+VD+ S + E L + + D R+ Sbjct: 350 PKTSIIIPTRDGLSILKQCIDSIFAKTIYPNYEIIVVDNGSVNPETLAYFDSLRTDSRVR 409 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE 97 + +D A+ N+A+ +A G+Y+ +++D E Sbjct: 410 ILRDDSPFNFSALNNRAVEIATGDYVCLLNNDIE 443 >UniRef50_B3E7E3 Glycosyl transferase family 2 n=1 Tax=Geobacter lovleyi SZ RepID=B3E7E3_GEOLS Length = 663 Score = 80.5 bits (197), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 59/191 (30%), Positives = 94/191 (49%), Gaps = 9/191 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +P +S+ MP +N ++ AI S+L Q + ++E+IIVDD ST + + + +D RI Sbjct: 3 SPRVSVLMPVYNGERFLREAIDSILGQSFHDFELIIVDDGST--DSSAACIASYHDTRII 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS---VFLAHKQQLVTHAFLY 120 + N+IN G RN+ I LA+GEYI +D DD T RL+ FL Q + A Sbjct: 61 SVKNEINQGIVVTRNRGIELARGEYIALMDCDDISTEERLAEQVAFLDTDQSVAAVAACI 120 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKR-NIIGN-QVFTWAWRFKECLFDTELKA 178 + V + + + +P R F R N + N + A ++ + A Sbjct: 121 --ECVDENGTFLSGWADDRSTVTPDQIRAFLPRANCVANSSLMIRAKILRQYRYRGGRDA 178 Query: 179 AQDYDIFLRMV 189 +DYD+ LR+V Sbjct: 179 VEDYDLLLRLV 189 >UniRef50_A3VRJ5 Glycosyltransferase n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VRJ5_9PROT Length = 307 Score = 80.5 bits (197), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 42/126 (33%), Positives = 67/126 (53%), Gaps = 14/126 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL----NDP 60 P++S+ +P + + + + +V +Q ++ WE IIVDDCS V+A+ +DP Sbjct: 10 PVVSVIIPAYKSADVLPQTLATVTQQTFTAWEAIIVDDCSP-----DDIVSAIKPFADDP 64 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI +H+ N GA A RN I A+G ++ +D DDEW P++L+ KQ A + Sbjct: 65 RIRLVHHAHNQGAAAARNTGIGEARGRFVAFLDADDEWHPDKLA-----KQVQAVEAAPH 119 Query: 121 ANDYVC 126 N C Sbjct: 120 PNRVFC 125 >UniRef50_B8GFE2 Glycosyl transferase family 2 n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GFE2_METPE Length = 310 Score = 80.5 bits (197), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 39/99 (39%), Positives = 59/99 (59%), Gaps = 2/99 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +PT+NR L AI+SVL Q Y + E+I+VDD S + ++ + + D R+ Y Sbjct: 6 PTVSVILPTYNRAHLLPTAIESVLNQTYRDLELIVVDDGSK--DSTEEVIRTITDERLRY 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 I + N GA RN I A+G +I D DD+W P++L Sbjct: 64 IRYEPNQGANHARNVGIQAARGPFIAFQDSDDDWFPDKL 102 >UniRef50_Q062P8 Putative teichoic acid/polysaccharide glycosyl transferase, family 2 n=1 Tax=Synechococcus sp. BL107 RepID=Q062P8_9SYNE Length = 269 Score = 80.5 bits (197), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 60/196 (30%), Positives = 90/196 (45%), Gaps = 15/196 (7%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M P +S +P NR A+ S L Q + +WE ++VDD S S + L V A D Sbjct: 1 MSKQPKVSFLIPAKNRPHELKAALGSCLAQSFEDWEALVVDDHSDSSD-LASVVNAFQDD 59 Query: 61 RITYIH-NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLV---TH 116 R+ + +D+N G + RNQAI +AQ + +D DD P+R + QL+ Sbjct: 60 RLRFHRLDDLNRGVSSARNQAIAMAQSDRFITLDSDDLNHPHRAARCF----QLLNPSNP 115 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL---FD 173 A +Y + ++P P P+S LF N I N + + E F Sbjct: 116 ALIYTRVRLFSS---ARPEGRPKPVLQPFSSALFEMVNFITNPGTAFTRKAFEAAGEGFR 172 Query: 174 TELKAAQDYDIFLRMV 189 +L A+DYD++LRM Sbjct: 173 PDLTMAEDYDLYLRMA 188 >UniRef50_Q4A117 Putative glycosyltransferase n=1 Tax=Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 RepID=Q4A117_STAS1 Length = 254 Score = 80.1 bits (196), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 63/207 (30%), Positives = 103/207 (49%), Gaps = 22/207 (10%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND--PR 61 N L+S+ MP +N + ++I SV+ Q Y NWE++I++D S + ++ T +D Sbjct: 3 NGLVSVIMPLYNNEDYIEKSILSVINQTYENWEILIINDKSV--DSSKEIATKYSDIYSN 60 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I+ IN+G RN I A+GEY+ +D DDEW P +L ++Q+ F+ Sbjct: 61 IKLINLKINNGVANARNIGINNARGEYMAFLDSDDEWLPQKL------EKQI---NFMNK 111 Query: 122 NDY--VCQ--GEVYSQPASLPLYPKSPYSRRLFYK---RNIIGNQVFTWAWRFKECLFDT 174 NDY C G++ S+ + LP K YS L Y +N IGN + Sbjct: 112 NDYNFTCTYYGKMDSENSVLPTVIKPKYS--LNYNQILKNNIGNSTAIINVKKLGKFTVP 169 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEA 201 +K DY ++L+++ + + +EE Sbjct: 170 LIKKRNDYALWLKVIKKAKKVHTLEEV 196 >UniRef50_B0TYV9 Putative teichuronic acid biosynthesis glycosyl transferase n=1 Tax=Francisella philomiragia subsp. philomiragia ATCC 25017 RepID=B0TYV9_FRAP2 Length = 247 Score = 80.1 bits (196), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 49/125 (39%), Positives = 68/125 (54%), Gaps = 7/125 (5%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPR 61 + L+SI P++N ++ I+SV+ Q Y NWEMIIVDDCST S + ++Y D R Sbjct: 2 DSLVSIITPSYNSEKYISNTIESVITQTYQNWEMIIVDDCSTDSSCDIAEKYTQK--DSR 59 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS---VFLAHKQQLVTHAF 118 I I + NSG RN+AI A+G YI+ +D DD W +L F+ VT + Sbjct: 60 IKLIKLNKNSGPAKARNRAIKEAKGRYISFLDSDDIWYHRKLEKQVYFMKENDLAVTCSS 119 Query: 119 LYAND 123 Y D Sbjct: 120 YYIVD 124 >UniRef50_A3I1Y0 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1 RepID=A3I1Y0_9SPHI Length = 255 Score = 80.1 bits (196), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 45/119 (37%), Positives = 66/119 (55%), Gaps = 5/119 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRIT 63 L+SI P N + AI+S+L+Q Y+NWE+I++DD S+ S E L Y D RI Sbjct: 2 LVSIITPVHNSIEHVEEAIQSILKQTYTNWELILIDDKSSDSSLEVLHAYEE--KDSRIH 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + N+ N GA RN+ I ++G +I +D DD W P +L +A + HAF Y + Sbjct: 60 LLKNEKNKGAALTRNRGIEASKGRFIAFLDSDDLWEPTKLETQIAFMLE-KGHAFTYTS 117 >UniRef50_B2GLA0 Putative glycosyltransferase n=1 Tax=Kocuria rhizophila DC2201 RepID=B2GLA0_KOCRD Length = 345 Score = 80.1 bits (196), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 69/223 (30%), Positives = 108/223 (48%), Gaps = 14/223 (6%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDP 60 PL+S+ + +N Q + +AI SVLRQ + + E+ +VDD ST + E L Y AL+ P Sbjct: 12 EQPLVSVVIAVYNGQDMVGQAISSVLRQTHPSIEVFVVDDGSTDHTPEVLAGY-EALDRP 70 Query: 61 -RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLV----- 114 R + N+G A RN+A+ L G+Y+T +D DDE P+ L L +QL Sbjct: 71 GRAVRVLRQENAGCGAARNRALELISGDYVTFLDADDELLPHHLETMLERYRQLQREPGR 130 Query: 115 THAFLYANDYVC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN--QVFTWAWRFKEC 170 +Y N + C G + + P R + + N IG+ +F A+ + Sbjct: 131 ERTVVYTNAWQCTPSGINAERMVNREAMPAPEDQRSVILEYN-IGHLFGLFPRAFFEEVG 189 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 FD E +D++++LR V +V+E T IL + MQ Sbjct: 190 TFDPEQVFVEDWELWLRAVYSGWRFDRVDEVTVILSWSGSSMQ 232 >UniRef50_B1FXK1 Glycosyl transferase family 2 n=2 Tax=Burkholderia RepID=B1FXK1_9BURK Length = 723 Score = 80.1 bits (196), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 63/195 (32%), Positives = 96/195 (49%), Gaps = 12/195 (6%) Query: 5 PLISIYMPTWNRQQLAIR-AIKSVLRQDYSNWEMIIVDDCSTSWE---QLQQYVTALNDP 60 P +SI MPT+N + +R A+ SV+ Q Y NWE+ I DDCST E L YV A Sbjct: 182 PKVSIVMPTYNTPEKWLRKALDSVVDQVYENWEICIADDCSTKPEVRMVLDSYV-AKYPG 240 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 ++ + N A N A+ LA GE++ +D DDE P L + + A +Y Sbjct: 241 QVKVAYRTTNGHISASSNTALELATGEFVGLLDHDDELHPLALYCVIEKLNEHPDAALIY 300 Query: 121 AN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKA 178 ++ D + + S P Y K ++ LF +N+I + V+ + F T L+ Sbjct: 301 SDEDKISEEGERSDP-----YFKCDFNYDLFLSQNMISHFGVYKTSVLRDIGGFRTGLEG 355 Query: 179 AQDYDIFLRMVVEYG 193 +QDYD+ LR++ G Sbjct: 356 SQDYDLALRVIDRVG 370 Score = 44.7 bits (104), Expect = 0.003, Method: Compositional matrix adjust. Identities = 30/127 (23%), Positives = 59/127 (46%), Gaps = 6/127 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P + I +PT + +L + ++SV ++ YSN+ + I+D+ S E + D RI Sbjct: 443 PSVEIIIPTRDSAELVEQCVESVRQKSTYSNYRITIIDNGSVKPETHDLFARLQADERIK 502 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-----WTPNRLSVFLAHKQQLVTHAF 118 I +D A+ N+ + + +++ +++D E W +SV + V Sbjct: 503 VIRDDSPFNYSAINNRVALASTADFVCLMNNDIEVINADWLEEMVSVAIQKNVGAVGAKL 562 Query: 119 LYANDYV 125 LY +D + Sbjct: 563 LYPDDTI 569 >UniRef50_A6BH37 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BH37_9FIRM Length = 323 Score = 80.1 bits (196), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 56/173 (32%), Positives = 88/173 (50%), Gaps = 16/173 (9%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 N +ISI +P + ++ + IKS++ Q YSN E+I+VDD S L A D RI Sbjct: 2 NTMISIIIPVYKVEKYLDKCIKSIVSQTYSNLEIILVDDGSPDKSGLICDKWATLDSRIR 61 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ---LVTHAF-- 118 +H N+GA A RN A+ LAQGE+I+ +D DD + N L++ ++ +V F Sbjct: 62 VVHQK-NAGAGAARNVALKLAQGEFISFVDSDDYLSVNMFKNLLSYFEEDIDIVECEFIS 120 Query: 119 ------LYANDYVCQGEVYSQPASLPLYPKSPYSRRL----FYKRNIIGNQVF 161 L+ ++ Q V+S ++ + Y R+L Y+R+ I N F Sbjct: 121 VEEDDVLFIDEKSVQARVFSSEEAMREHIADHYFRQLIWNKLYRRSCIKNVYF 173 >UniRef50_A3DF58 Glycosyl transferase, family 2 n=2 Tax=Clostridium thermocellum RepID=A3DF58_CLOTH Length = 310 Score = 80.1 bits (196), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 41/103 (39%), Positives = 62/103 (60%), Gaps = 4/103 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTA-LNDPR 61 L+S+ +PT+N Q RA+ SVL Q Y N E+IIVDD + E+ Q+ + LND R Sbjct: 3 LVSVIIPTYNGSQYISRALNSVLNQTYKNIEIIIVDDNGRNTEEQLKTQKIIEPFLNDKR 62 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS 104 + Y+ ++ N A RN + G+YI +DDDDE+ PN+++ Sbjct: 63 VKYVIHEENKNGAAARNTGFRNSNGKYIAFLDDDDEYLPNKIA 105 >UniRef50_B8F9T0 Glycosyl transferase family 2 n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9T0_DESAA Length = 309 Score = 80.1 bits (196), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 79/286 (27%), Positives = 126/286 (44%), Gaps = 39/286 (13%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 +S+ +P N RA++SV Q Y++WE +++++ S + V ++ + RI + Sbjct: 2 AFVSVVIPVRNHPAELERALRSVEAQTYTDWECLVIENGSDDPSVNKAVVESMENARIRF 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I + A RN I A G+YI +D DDEW N L++ L K + +Y N Y Sbjct: 62 IDIGLVDNANYARNAGIEKANGDYIAFLDSDDEWLSNHLALSL-EKIRTTGALAVYGNFY 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRR--LFYKRNIIGN---QVFTWAWRFKECL---FDTEL 176 V G S+ A+ KSP R+ K I G Q ++ CL +D +L Sbjct: 121 VDNG--LSRVAT-----KSPSIRKDEAGDKYRIAGGGKAQTSSFVIEKNACLEVGWDNDL 173 Query: 177 KAAQDYDIFLRMVVEYGEPWK-VEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 + QD D F+RM + W V E +++ G + + + Y KH+ ++ Sbjct: 174 RRGQDVDFFIRMHASH--LWAHVHEPAVVVYWQQGAARNIDT----ASCLKMYTKHEKQW 227 Query: 236 DRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKR--LADGIRGR 279 D S+ L +LY L ++ G R L D IRGR Sbjct: 228 DTESRVTYLRSLY--------------LQGIQYGDREDLIDDIRGR 259 >UniRef50_A6BHD3 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BHD3_9FIRM Length = 410 Score = 80.1 bits (196), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 58/202 (28%), Positives = 93/202 (46%), Gaps = 15/202 (7%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NP ISI +P +N ++ AI+SVL Q Y+N+E+IIVDD ST + + A +D RI Sbjct: 2 NPKISIILPNYNNEKYLKEAIESVLNQTYTNFELIIVDDASTDGS--LKIIRAFDDKRIK 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS--VFLAHKQQLVTHAFLYA 121 I +++N N I A GEYI ID DD W +L + + F Sbjct: 60 VITSEVNRHVAYASNLGIKYASGEYIAKIDSDDIWENQKLEKQIEFMENHKEYGGCFTKV 119 Query: 122 N--DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII--GNQVFTWAWRFKECLFDT--- 174 N D + L+ K+ + + R + GN + + ++C+F+ Sbjct: 120 NIIDEKSEDANIKYKVIFDLFEKAKNQSQKEWLRFFLSEGNCLCNSSSLIRKCIFEKIDG 179 Query: 175 ----ELKAAQDYDIFLRMVVEY 192 A+DY++++R V+ Y Sbjct: 180 FFNLAYVGAEDYELWVRTVIRY 201 >UniRef50_Q2SD88 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SD88_HAHCH Length = 343 Score = 80.1 bits (196), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 46/142 (32%), Positives = 77/142 (54%), Gaps = 9/142 (6%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 PL+S+ MP +NR++ +AI+SVL Q ++++E+I+VDD ST S E + + D R+ Sbjct: 13 PLVSVIMPVYNREKTVAQAIESVLAQTFADFELIVVDDGSTDRSAEIVNGFT---QDARV 69 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF---L 119 Y + + RN + LA+G+++ +D DD WTP +L +A +Q L Sbjct: 70 RYRLQENSGRPSLARNSGLKLARGQWVAFLDSDDRWTPAKLERQIALLEQCAAQGVSLDL 129 Query: 120 YANDY-VCQGEVYSQPASLPLY 140 +DY V + +V P+ Y Sbjct: 130 VISDYEVMENDVLKHPSFFKAY 151 >UniRef50_B4WMJ9 Glycosyl transferase, group 2 family protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WMJ9_9SYNE Length = 313 Score = 79.7 bits (195), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 58/195 (29%), Positives = 96/195 (49%), Gaps = 15/195 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL S+ +P +N Q ++SV +Q ++E+I++DD ST + + V+ +DPR+ Sbjct: 2 PLASVIVPVYNGAQTIHATLESVFQQTAQDFELIVIDDGST--DSTLEVVSKFDDPRLR- 58 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-ND 123 + + N+G RN+ + A G+YI+ ID DD WTP +L++ L + A Y+ + Sbjct: 59 VFSYCNAGVATSRNRGVDHASGKYISFIDADDLWTPKKLALQLEALESTPEAAVAYSWTN 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLF-----YKRNI-IGNQVFTWAWRFKECLFDTELK 177 Y+ Q + A + Y+ L N+ I +QVF + FD L Sbjct: 119 YIDQEGRFIDKAQRVDFSGDVYAELLLRDFLESASNVTIRHQVF-----LEFGGFDLSLT 173 Query: 178 AAQDYDIFLRMVVEY 192 A D+D FLR+ Y Sbjct: 174 GAADWDFFLRLAKHY 188 >UniRef50_Q8KI14 Similar to Glycosyl transferase n=1 Tax=Pseudomonas aeruginosa RepID=Q8KI14_PSEAE Length = 264 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 44/106 (41%), Positives = 61/106 (57%), Gaps = 1/106 (0%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N L+SI P++N ++L R I+SVL Q + NWEMIIVDDCS + A D RI Sbjct: 5 NEGLVSIITPSYNAEKLIGRTIQSVLDQTFDNWEMIIVDDCSKDSTRSVVAAYAEKDSRI 64 Query: 63 TYIHNDINSGA-CAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL 107 + + N+GA A RN + A G++I +D DD W P +L + L Sbjct: 65 RLVGLEKNNGAPAAPRNIGVQHASGDWIAFLDADDIWHPRKLELQL 110 >UniRef50_Q9XBL5 Beta-D-1,6 Glucosyl transferase n=2 Tax=Gluconacetobacter xylinus RepID=Q9XBL5_ACEXY Length = 321 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 64/253 (25%), Positives = 114/253 (45%), Gaps = 26/253 (10%) Query: 3 NNPL-ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW--EQLQQYVTALND 59 N P+ +S+ + T+N Q +RA+ S L Q ++ E+IIVDDCST + L + A + Sbjct: 2 NIPMSVSLVITTYNAQAFIMRAVSSALEQTHAPLEIIIVDDCSTDGTPDLLNRLARAHDS 61 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ + N G RN + A+G++I +D DD + P+RL V +A + V Sbjct: 62 IRV--MSTPYNGGPSVARNVGLGAARGDWIAFLDADDAFAPDRLEVLMA--RAAVGDCDA 117 Query: 120 YANDY----VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWA-----WRFKEC 170 A+D G+V + P++P R + N G + F W +R + Sbjct: 118 VADDLAYYDAMAGQVTGRAMGAGQVPRTPIGLRDYIAHNRSGGEGFDWGLLKPVFRREFM 177 Query: 171 L-----FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 L +D ++ +D+ + + ++ G + ++ AT + G S ++ SG Sbjct: 178 LAHAIRYDPVVRHGEDFLLMVAFLMAGGRFFLIDHATYLYTQRQG-----SVSRRASGMS 232 Query: 226 HFYRKHKDKFDRA 238 ++D D A Sbjct: 233 RTVIAYRDMHDTA 245 >UniRef50_B7MWU8 Glycosyltransferase n=1 Tax=Escherichia coli ED1a RepID=B7MWU8_ECO81 Length = 268 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 61/207 (29%), Positives = 100/207 (48%), Gaps = 21/207 (10%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + +SI MPT+N +I SV+ Q Y++W + I+DD ST + ++ D Sbjct: 7 MSSYETVSIIMPTFNAASYISYSIISVIEQSYNDWHLYIIDDGST--DNTAHEISNFTDL 64 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RITY+ INSG RN I +A+G++I +D DD W +LS+ + ++ Sbjct: 65 RITYVMLPINSGVANARNIGIKMAKGKFIAFLDSDDIWEKEKLSLQIPLLEK-------- 116 Query: 121 ANDYVCQGEVY--SQPASLPLYPKSP----YSRRLFYKRNIIGNQVFTWA-WRFKECLFD 173 D VC ++P + +P YSR L + N IGN + R +CL Sbjct: 117 GYDVVCSNYAIFKNEPKKIIGSRTAPQQITYSRLL--RGNCIGNLTGIYNRERLGKCL-- 172 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEE 200 E +DY ++L++V G + +++ Sbjct: 173 QEECGHEDYLMWLQLVKLSGSAYCIQK 199 >UniRef50_C5ZW04 Putative glycosyltransferase n=1 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZW04_9HELI Length = 327 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 41/99 (41%), Positives = 62/99 (62%), Gaps = 4/99 (4%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRITY 64 ISI MP +N ++ R+I SVL+Q+Y NWE+I VDDCS ++++LQ+YV+ D RI Sbjct: 4 ISILMPCYNSEEFLQRSINSVLQQEYENWELICVDDCSKDDTYKKLQEYVS--KDTRIKA 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 I +IN G A N ++ G++I + DDE + + L Sbjct: 62 IKREINGGCAASLNVSLEQCSGDFIFVLGHDDEISRDCL 100 >UniRef50_Q64WD3 Glycosyltransferase n=1 Tax=Bacteroides fragilis RepID=Q64WD3_BACFR Length = 323 Score = 79.3 bits (194), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 43/102 (42%), Positives = 59/102 (57%), Gaps = 4/102 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS----WEQLQQYVTALNDPR 61 L+S+ +PT+ R + RAI SVL Q YSN E+I+VDD S E +Q +DPR Sbjct: 5 LVSVIIPTYKRPNMLGRAIDSVLEQSYSNIEVIVVDDNSDGDKYRLETIQYMERYADDPR 64 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 + YI + IN A RN I + GEYI +DDDD + +R+ Sbjct: 65 VKYIKHKINQSGSAARNTGIQNSVGEYIAFLDDDDYFFKDRV 106 >UniRef50_Q3K8V0 Putative glycosyl transferase n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3K8V0_PSEPF Length = 1076 Score = 79.3 bits (194), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 75/244 (30%), Positives = 112/244 (45%), Gaps = 33/244 (13%) Query: 3 NNPLISIYMPTWNRQQLAIR-AIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALND 59 + P +++ +PT++R ++ +I SVL Q Y N+E+IIVDD S S LQ+Y+ D Sbjct: 20 SKPKVTVILPTYSRGHGPLQESIDSVLAQSYRNFELIIVDDGSRDGSAAVLQEYLK--KD 77 Query: 60 PRITYIHNDINSGACAVR-NQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 PRI NSG A+R NQA + A+G+YI DDD WT + L V + ++ A Sbjct: 78 PRIIVHSYHKNSGLPALRVNQAALRAKGKYIAYQFDDDMWTEHSLQVRVEQLEKCTRPAV 137 Query: 119 LYANDYV----CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDT 174 YAN V G + ++ P++ L N I N + LFD Sbjct: 138 AYANASVDIALADGSITTRKLG------GPFNYGLLMNGNYIANNTVM----HHKSLFDI 187 Query: 175 E--------LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH 226 ++ DYD++LR W V+E T + N + S K+ +F Sbjct: 188 AGMYDSHVMMRRYSDYDLWLRFAKHADFIW-VDEVTSHVRAN----LVDSLGKEIPLFFA 242 Query: 227 FYRK 230 YRK Sbjct: 243 RYRK 246 >UniRef50_Q11KM8 Glycosyl transferase, family 2 n=1 Tax=Chelativorans sp. BNC1 RepID=Q11KM8_MESSB Length = 354 Score = 79.3 bits (194), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 54/204 (26%), Positives = 93/204 (45%), Gaps = 21/204 (10%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALND 59 + PL+S+ M + + A+ SVL Q + E+++ DD S S + +Q Y++ D Sbjct: 7 ETGPLVSVIMANFRSAEYLAAALDSVLAQTIGDIEVLVSDDASPDNSTDIVQHYMS--RD 64 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL----VT 115 R+ I D N G A RN+A+ A+GE+I +D DD P RL + +A ++L + Sbjct: 65 ARVNLITTDENKGPAAARNRALEAARGEWIAIVDSDDIIHPQRLEILIAMAEELGADGIA 124 Query: 116 HAFLYANDYVCQGEVYSQPASL-PLYPKSPYSRRLFYKRNIIGNQVFTWAW--------R 166 L+ +D + A+ PL Y F + N GN + + + Sbjct: 125 DDLLFFSDQGAGPTLLGATAAPEPLEITPSY----FIRSNTSGNGLAPLGYVKPLFRRAK 180 Query: 167 FKECLFDTELKAAQDYDIFLRMVV 190 +D ++ +DYD LR ++ Sbjct: 181 LSGLSYDEAVRIGEDYDFLLRFLL 204 >UniRef50_D2L8E2 Glycosyl transferase family 2 n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L8E2_9DELT Length = 316 Score = 79.3 bits (194), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 51/191 (26%), Positives = 93/191 (48%), Gaps = 14/191 (7%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ +PT +R L +RA+ SVL Q ++ E+++VDD S + ++AL DPR+ + Sbjct: 43 LVSVVVPTRDRAALVVRAVASVLAQTHAALEVLVVDDGSV--DDTVARLSALGDPRLRVL 100 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + G A RN + A+G Y+ +D DDEW P ++ LA ++ ++ Sbjct: 101 SHAAGRGVSAARNTGLGAARGAYVALLDSDDEWLPAKIERQLAFMRERKLSVSQTQEIWM 160 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNII------GNQVFTWAWRFKECLFDTELKAA 179 G + P + P F+++ ++ + A+ + FD L A Sbjct: 161 RNGRRVN-PTAANRKPDG-----FFFEQALVRCVVSPSTSMGVRAYWEEMGGFDETLPAC 214 Query: 180 QDYDIFLRMVV 190 +DYD++LR ++ Sbjct: 215 EDYDLWLRTLL 225 >UniRef50_Q1Q4P0 Similar to glycosyltransferase family 2 n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q4P0_9BACT Length = 295 Score = 79.3 bits (194), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 63/202 (31%), Positives = 95/202 (47%), Gaps = 28/202 (13%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ MPT+N +I+SVL Q Y +E++++DD ST +L V +I Y Sbjct: 6 PLVSVIMPTYNCAVYLQESIESVLAQTYDAYEVVVIDDGSTDNTKL---VLKPYMEKIKY 62 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N G RN I A+GEY+ +D DD W P +L ++ F+ D Sbjct: 63 IDLGRNEGLPTARNLGIQSAKGEYVAFLDADDIWMPEKL--------EMSIDQFMKNPD- 113 Query: 125 VCQGEVYSQPASLPL------------YPKSPYSRRLFYKRN-IIGNQVFTWAWRF-KEC 170 G VYS+ ++ P +LF+++N II + V F K Sbjct: 114 --AGMVYSKHINIDSKSQFLEGKIRRRLPSGNIFTQLFFEQNFIICSSVVVRKEVFNKTA 171 Query: 171 LFDTELKAAQDYDIFLRMVVEY 192 LFD+ L QD+D++LR+ Y Sbjct: 172 LFDSGLVNCQDWDMWLRIAFYY 193 >UniRef50_A9GHY3 Glycosyl transferase, family 2 n=4 Tax=Rhodobacteraceae RepID=A9GHY3_9RHOB Length = 438 Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 102/208 (49%), Gaps = 21/208 (10%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 P +S+ M + N AI++VL Q ++ E+IIVDD S SW ++ V A DPRI Sbjct: 6 PQVSVIMASRNAAAYLPLAIRAVLAQTLTDLELIIVDDASEDDSWAVIRAAVAA--DPRI 63 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + G RN A+ LA+GE++ D DD P RL + L + L A + A+ Sbjct: 64 RAQRRSHSGGPAIARNDALALARGEWVAICDSDDSQHPRRLELMLNAARDLA--ADVVAD 121 Query: 123 DYVC-QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR--------FKECL-- 171 D + G+ ++ A++ L +P + ++ ++++ + V K+C Sbjct: 122 DMILFSGQPMARGATI-LGRHAPKAAQMLNLKDLVLSGVDPEGISHIGYLKPLIKKCFLN 180 Query: 172 ---FDTELKAAQDYDIFLRMVVEYGEPW 196 +D L+ +D+D++LR+V+ + W Sbjct: 181 DLRYDPRLRIGEDHDLYLRLVLAGAKMW 208 >UniRef50_C4FFR1 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FFR1_9BIFI Length = 327 Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 41/96 (42%), Positives = 56/96 (58%), Gaps = 1/96 (1%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PLIS+ +P +N + + + S+LRQ Y+N E+I+VDD ST +L + DPRIT Sbjct: 7 PLISVVVPVYNVKPYVAKCLDSILRQTYTNIEIIVVDDGSTDGSELICDDYSNKDPRITV 66 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTP 100 IH N+G A RN I A GEY+ +D DD P Sbjct: 67 IHQR-NTGLAAARNTGIDAAHGEYLGFVDSDDFIEP 101 >UniRef50_B4RT76 Glycosyl transferase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RT76_ALTMD Length = 292 Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 64/221 (28%), Positives = 99/221 (44%), Gaps = 26/221 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P+IS+ +PT +RQ L +A++SV Q YSN E+I+VDD ST+ ++ A + Y Sbjct: 7 PVISVIVPTRDRQALFNKAVQSVFHQTYSNIELIVVDDHSTAPPVIED---APEHVEVKY 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 N+ SG RN + L+QGE+I +DDDD + N+L +QL + L A Sbjct: 64 HRNNKASGGAISRNTGVRLSQGEFICFLDDDDYYHENKLQTLF---EQLCADSSLSA--- 117 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNII----------GNQVFTWAWRFKECLFDT 174 V+ + L P ++ ++ II N F+E FD Sbjct: 118 -----VFGRIVKLS-EPARTMDKKYLDEQGIIKTPEAIRYLHTNSSLIRRHVFEEIRFDE 171 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 L QD + + ++ + V E + HG QIT Sbjct: 172 TLHKFQDTQLHIELICNKKCKY-VYEDVAYWNDKHGLAQIT 211 >UniRef50_B5W037 Glycosyl transferase family 2 n=2 Tax=Arthrospira RepID=B5W037_SPIMA Length = 330 Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 55/192 (28%), Positives = 95/192 (49%), Gaps = 9/192 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P ISI +P +N + I SV+ Q + ++E+II++D ST ++ +++++DPR+ Sbjct: 2 PQISIIIPAYNADKTIKETIDSVINQTFKDFEIIIINDGST--DRTLDIISSIDDPRVQ- 58 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-ND 123 + + NSGA RN+ A +YI +D DD WT ++L + A + + Y+ + Sbjct: 59 VFSYPNSGASVSRNRGFSKASAQYIAFLDADDLWTSDKLELQYAALEADPEASVAYSWTN 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL---FDTELKAAQ 180 Y+ Q + P Y YS + +N + N R FD LK Q Sbjct: 119 YIDQYGEFLYPGFHQSYQGDVYS--ILLVKNFLENGSNPLITRHALSAIGGFDESLKGGQ 176 Query: 181 DYDIFLRMVVEY 192 D+D++LR+ +Y Sbjct: 177 DWDLYLRLAAQY 188 >UniRef50_Q8YUP7 Glucosyltransferase n=2 Tax=Cyanobacteria RepID=Q8YUP7_ANASP Length = 343 Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 63/225 (28%), Positives = 108/225 (48%), Gaps = 26/225 (11%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRITY 64 +S+ +P +N + ++SVL Q Y + E+IIVDD ST S + +Q+ D RI Sbjct: 3 LSVIIPVYNSESSVAETLRSVLAQTYRDLEIIIVDDGSTDKSIDICKQF----QDERIRI 58 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAH--KQQLVTHAFLYAN 122 IH N G RN I A+GEY+ +D DD W P +L+ L H + V +F ++ Sbjct: 59 IHQQ-NRGLAGARNTGIRQARGEYLAFVDSDDLWLPEKLAKHLEHFERSPEVGVSFSRSS 117 Query: 123 DYVCQGE---VYSQPASLPLYPKSPYSRR-LFYKRNIIGNQVFTWAWRFKECL------- 171 QG+ +Y P + P+ + R + +++ + +F+E L Sbjct: 118 LIDDQGKPLGIYQMPKLTDITPEYLFCRNPISNGSSVVIRRAVLDTIKFQENLYGEVEDF 177 Query: 172 -FDTELKAAQDYDIFLRMVVEYGEPWKVE---EATQILHINHGEM 212 FD + ++D + +LR+ ++ WK+E EA + +N G + Sbjct: 178 YFDDSFRQSEDIECWLRIALQ--TTWKIEGIPEALTLYRVNMGGL 220 >UniRef50_B4AD21 Glycosyltransferase n=17 Tax=Enterococcus faecalis RepID=B4AD21_ENTFA Length = 252 Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 49/158 (31%), Positives = 79/158 (50%), Gaps = 13/158 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 L+SI MP +N + ++I+SVL Q Y NWE++++DD S D RI Sbjct: 4 ALVSIIMPMYNAGKFLSKSIESVLEQTYQNWELLLIDDGSKDDSIDIALAFMEKDSRIFL 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV---FLAHKQQLVTHAFLYA 121 + N+ N G RN+ I ++G+YI +D DD W PN+L V ++ K+ L F + Sbjct: 64 LKNEQNMGIAKTRNKGIEASKGQYIAFLDSDDLWLPNKLEVQIKWMEEKKLL----FTCS 119 Query: 122 NDYVC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIG 157 + +VC G + + + + P + + K N IG Sbjct: 120 SYFVCNENGNITHERN----FSEGPQTYQDLLKTNTIG 153 >UniRef50_A7B4C1 Putative uncharacterized protein n=1 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B4C1_RUMGN Length = 355 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 46/145 (31%), Positives = 69/145 (47%), Gaps = 8/145 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P+ S+ MPT+ ++ +AI+SV Q + NWE++++DDC+ A D RI Sbjct: 3 PVFSVVMPTYGVEKYIEKAIRSVQAQTFENWELLVIDDCTCDQSAEIACEIAKTDSRIRI 62 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 +H++IN G A RN I A GEYI +D DD P + A ++ L+ Sbjct: 63 VHHEINQGLSAARNTGIQEAVGEYIWFMDPDDWVEPELMQAVAASLEKNRAEMVLFG--- 119 Query: 125 VCQGEVYSQPASL----PLYPKSPY 145 Q E Y+ L P+ P Y Sbjct: 120 -LQEEYYAPDGHLQYSHPISPTEQY 143 >UniRef50_A3XGI2 Putative uncharacterized protein n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XGI2_9FLAO Length = 321 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 78/273 (28%), Positives = 122/273 (44%), Gaps = 37/273 (13%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ +PL+SI +PT+NR L + SVL Q Y+NWE I+VDD ST D Sbjct: 1 MQEHPLVSIIIPTYNRANLIGETLDSVLAQTYTNWECIVVDDGSTDTTDDVLAAYVEKDS 60 Query: 61 RITYIH--NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK-----QQL 113 R Y H N+ GA A RN L++GEY+ DDDD + L+HK ++L Sbjct: 61 RFQYHHRPNNRPKGANACRNYGFELSEGEYVNWFDDDD--------LMLSHKLHEGVKEL 112 Query: 114 VTHAFLYANDYVCQGEVYSQPASLP-------LYPKSPYSRRLFYKRNIIGNQVFTWAWR 166 F N +CQ E+YS + L ++P + + +K + W + Sbjct: 113 EKGLF---NFMICQSEMYSLKENRTKGLRCQKLVSQNPLNDYIQFKIFWLTGAPL-WKKK 168 Query: 167 F---KECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQIL---HINHGEMQITSSPKK 220 F + F+ +L +QDYD +R ++ GE +K + ++ H + I + K Sbjct: 169 FLIKNKIYFNEKLYQSQDYDYHIR-ILALGESYKTSDRIGVMIREHEKNMSNSIVDNHHK 227 Query: 221 FSG----YFHFYRKHKDKFDRASKKYQLFTLYQ 249 F F + + + +K+Y LYQ Sbjct: 228 FLSNVIVRFLVLKNYNHLISQETKRYLFTILYQ 260 >UniRef50_C9C3N2 Glycosyl transferase n=1 Tax=Enterococcus faecium 1,231,410 RepID=C9C3N2_ENTFC Length = 288 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 63/251 (25%), Positives = 117/251 (46%), Gaps = 30/251 (11%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-----QQYVTALNDP 60 ++S+ +PT+ R RAI SV +Q+++++E++I+DD E + + + + Sbjct: 2 MVSVVIPTFQRNDFLERAINSVFKQNFTDYEILIIDD--NVGENVYRINNRNLENKIQNS 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS---VFLAHKQQLVTHA 117 +I +I+N+ N G RN I AQG+YI +DDDD + P +L L K+ ++ ++ Sbjct: 60 KIRFIYNEKNLGGAEARNIGIKEAQGKYIAFLDDDDIFLPEKLKKMISVLEEKKGVMAYS 119 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW-RFKECLFDT-- 174 F+ +N VY+ A L+ +G+ T W ++ L + Sbjct: 120 FVKSNLGQEWKNVYNGNALFELFK--------------VGSIAATSQWIILRKALIEVGG 165 Query: 175 --ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 A QD + +++ E +EE + + H +I++S K G + + K+K Sbjct: 166 FDNTPAKQDSILTCKLLENGNEIICIEEFLSV-YFEHEFTRISTSGKTLIGEINLFNKYK 224 Query: 233 DKFDRASKKYQ 243 DR S + Q Sbjct: 225 SIKDRFSIQEQ 235 >UniRef50_C3AEG0 Glycosyl transferase, family 2 n=2 Tax=Bacillus cereus group RepID=C3AEG0_BACMY Length = 298 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 44/128 (34%), Positives = 74/128 (57%), Gaps = 8/128 (6%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 PL+SI +P++N +L A+KSV Q Y+N E+II+DD ST + E L+ N R Sbjct: 4 PLVSIIIPSYNPGRLIEFAVKSVENQTYTNKEIIIIDDGSTDGTLEWLEN-----NKERF 58 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 Y+ NSGA + RN+ + +A G++I +D DD W PN+L + ++ +F+++N Sbjct: 59 KYVTQK-NSGASSARNKGVNIATGDFIAFLDADDIWLPNKLEKQMGIFKEEQDISFVFSN 117 Query: 123 DYVCQGEV 130 + ++ Sbjct: 118 GLIIHDDI 125 >UniRef50_Q8YSM1 Alr3063 protein n=7 Tax=Cyanobacteria RepID=Q8YSM1_ANASP Length = 330 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 57/200 (28%), Positives = 101/200 (50%), Gaps = 13/200 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK+ LIS+ +P +N I+SV +Q +++WE+I+++D ST + + + ++ D Sbjct: 1 MKDMALISVIIPAYNAVLTIKETIESVQKQTFTDWEIIVINDGST--DGTPEIIQSIKDE 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ I N N G RN+ I+ A GE+I +D DD W ++L + L QQ Y Sbjct: 59 RLK-IFNYKNGGLPVARNRGILHASGEFIAFLDADDLWAVDKLEMQLKALQQHPEAGVAY 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYS-RRLFYKRNIIGNQVFTWAWRF--KECL-----F 172 + + C +V Q + P S YS Y+ ++ + + + + KE + F Sbjct: 118 S--WTCFMDVNEQGEPVAYLPSSQYSFTGNVYQNLLVSDFIHSGSNTLIRKEAINSVGEF 175 Query: 173 DTELKAAQDYDIFLRMVVEY 192 D LK+ +D+D +LR+ + Sbjct: 176 DPMLKSCEDWDYWLRLATHW 195 >UniRef50_C6E5U5 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M21 RepID=C6E5U5_GEOSM Length = 327 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 53/193 (27%), Positives = 93/193 (48%), Gaps = 19/193 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S +P +N+ Q A++S+L Q + ++E+++VDD ST +Q V + DPRI Sbjct: 6 PSVSWVVPVFNQAQYLAGALESMLAQTFEDFELVVVDDGST--DQSAAIVRSFKDPRIVL 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS---VFLAHKQQLV---THAF 118 + N N G N+ I A G+YI +D DD TP+RL+ F+ + +H Sbjct: 64 LENSCNQGVSRSLNRGIRAALGKYIARMDGDDLSTPDRLARQVAFMEDNPAIAVCGSHVE 123 Query: 119 LY-ANDYVCQGEVYSQPASLPLYPKSPYSR-RLFYKRNIIGNQVFTWAWRFKECLFDTEL 176 + A + + V S L P++ + +R+++ +D + Sbjct: 124 TFGAESRLVKRPVGSAAIKCFLLAGPPFTHPSVMLRRSVLEQH---------RLFYDEGM 174 Query: 177 KAAQDYDIFLRMV 189 AAQDYD++ R++ Sbjct: 175 GAAQDYDLWFRLL 187 >UniRef50_C0ZCI8 Putative glycosyl transferase n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZCI8_BREBN Length = 267 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 71/258 (27%), Positives = 123/258 (47%), Gaps = 21/258 (8%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPR 61 NP ++ +PT+NR + +AIKSVL Q +W+++I+DD ST + ++++Y + P Sbjct: 2 NPTFTVVIPTYNRAKFIRKAIKSVLNQTSKDWKLLIIDDASTDRTRSKIEKY---MYHPN 58 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA---HKQQLVTHAF 118 I Y + NSG V N+A+ + Y+ +D DD W P R L HK + T A Sbjct: 59 IMYHRMEQNSGIGKVMNKALSMVDTPYLVQLDSDD-WLPKRTLAVLKKYIHKSRKST-AL 116 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN--IIGNQVFTWAWRFKECLFDT-- 174 Y N + + S+ + K+ +++ F K N ++ + + + +DT Sbjct: 117 FYGNMNIWKVRHGKYTKSIKVKHKTFHNKYQFLKYNRWMVSPRCYRVEALHQVGGWDTSD 176 Query: 175 --ELKAAQDYDIFLRMVVEYGEPWKVEEATQILH-INHGEMQITSSPKKFSGYFHFYRKH 231 E + +D I LR++ E ++V QIL+ + Q+T S KF + + Sbjct: 177 RYEGRIMEDRRIILRLI----EKYRVRYINQILYNRTKHKGQLTDSKMKFRRNYLRKKTF 232 Query: 232 KDKFDRASKKYQLFTLYQ 249 K +R KKY+ Y+ Sbjct: 233 KYYLNRWGKKYKAIYGYK 250 >UniRef50_A7V5D9 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V5D9_BACUN Length = 295 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 50/116 (43%), Positives = 67/116 (57%), Gaps = 8/116 (6%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRIT 63 ++SI +P +N+Q + R I+S+L Q Y NWE+IIVDD ST S E ++ Y L D RI Sbjct: 1 MLSIIIPLYNKQAVIARTIESILSQSYINWELIIVDDGSTDGSDEVVRLY---LGDQRIR 57 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS--VFLAHKQQLVTHA 117 YI N G + RN+ I A+GE+I ID DD + P L V LA K + A Sbjct: 58 YIRKP-NGGVSSARNRGIKEAKGEWICYIDADDYFLPEGLKTMVELAEKYNVKVAA 112 >UniRef50_D0S7H0 Glycosyltransferase family 2 n=1 Tax=Acinetobacter calcoaceticus RUH2202 RepID=D0S7H0_ACICA Length = 337 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 57/206 (27%), Positives = 102/206 (49%), Gaps = 22/206 (10%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ +P +N + AI SVL Q +++E+I+++D S ++ ++ + + +D RI Y+ Sbjct: 2 LVSVVLPAYNAELYLKEAIDSVLSQTCTDFELIVLNDGSV--DRTEEIILSYSDSRIVYV 59 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV---FLAHKQQLV---THAFL 119 N+ N G N+ I LA+G+YI +D DD P+R FL + V T A+ Sbjct: 60 KNEKNLGLIGTLNKGIDLAKGKYIARMDADDICYPDRFEKQINFLENNNDYVICGTAAYR 119 Query: 120 YANDYVCQGEVYSQPAS-----LPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDT 174 + ++ ++ P S L + SP+ RN I + K FD Sbjct: 120 FYSNNKSNNNIFRVPESDGLIRLKMLFNSPFIHPSIMMRNSIIKE--------KMLNFDI 171 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEE 200 + K A+DY F +++YG+ + ++E Sbjct: 172 QYKYAEDYK-FWSQLLKYGKAYNLQE 196 >UniRef50_B5L3X5 WfeA n=2 Tax=Enterobacteriaceae RepID=B5L3X5_ECOLX Length = 312 Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 67/235 (28%), Positives = 110/235 (46%), Gaps = 22/235 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND-P 60 K N +S+ +PT+ R RAI SV+ Q Y N E+I++DD + ++ + T + P Sbjct: 17 KMNFKVSVIIPTYGRPDNLKRAIDSVIEQTYKNIEIIVIDDNGINSKKGHETSTLVRHYP 76 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 I YI + NSG RN+ I A G+YIT +DDDD + P ++ ++QL F+ Sbjct: 77 HIIYIKLEKNSGGGMARNKGIERASGDYITFLDDDDYYYPEKI------QKQL---KFMI 127 Query: 121 ANDY---VCQGEVYSQPASLPLYPKSPYSRRLFY--KRNIIGNQVFTWAWRF-KECLFDT 174 N Y +C E+ + P S + YS + K +I FT KE L + Sbjct: 128 ENSYDISLCDMEI-ANPLSKKFKHRKKYSEANGFNLKDFLIAGVAFTPMIMVKKEVLLEV 186 Query: 175 ----ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 + QD+ + L+M+ + E +H + +I++SP+ G+ Sbjct: 187 NGFLDTPRFQDHTLMLKMLTVTENVGHLAEQL-FVHCSDYTARISNSPRSRKGFL 240 >UniRef50_C0CGZ7 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CGZ7_9FIRM Length = 280 Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 70/253 (27%), Positives = 110/253 (43%), Gaps = 41/253 (16%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP--- 60 N LIS+ +P++NR+ + I+SVL Q YSN E+I++DD ST + T ND Sbjct: 2 NELISVVIPSYNRKNTILSCIRSVLEQTYSNLEVIVIDDGSTDYT-----YTLFNDSFDD 56 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAH-KQQLVTHAFL 119 R+ + + N GAC RN L+ G YI D DD W +L+ L++ K++ F Sbjct: 57 RVKFFRYEENRGACYARNLGAELSSGAYIAFQDSDDIWKKTKLAKELSYLKKENADMVFC 116 Query: 120 YAN--DYVCQGEVYSQPASLPLYPKSPY-SRRLFYKRNIIGNQVFTWAWRFK-----ECL 171 N D + + Y YP + S R + + N V T K + Sbjct: 117 GMNRKDPIHKTSKY--------YPDETFRSDRNILAQLLENNAVSTQTILIKREVINKIK 168 Query: 172 FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEM-------QITSSPKKFSGY 224 FD K QD+D L+ +I ++ G + I+++ K+ Y Sbjct: 169 FDVTFKRYQDWDFALQA---------ASSGIKIAYLAEGLVDSTIQGNSISANVKEGVAY 219 Query: 225 FHFYRKHKDKFDR 237 H KH D +++ Sbjct: 220 EHLIEKHMDLYNK 232 >UniRef50_B7BDW0 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BDW0_9PORP Length = 318 Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 49/127 (38%), Positives = 71/127 (55%), Gaps = 9/127 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALNDPRI 62 P +SI +P + +Q I+SVL Q Y NWE+I+VDD S S + +Y A D RI Sbjct: 7 PRVSIIVPCYKVEQYLPTCIESVLHQSYDNWELILVDDGSPDKSGKICDEY--AKEDNRI 64 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDD----EWTPNRLSVFLAHKQQLVTHAF 118 IH N G A RN AI LA+GEYI+ +D DD ++ + +S+ L H+ +V + Sbjct: 65 KVIHK-TNGGVAAARNVAIDLAEGEYISFLDGDDFLHVDYVQDLISLALKHQAGIVQCNY 123 Query: 119 LYANDYV 125 + ND + Sbjct: 124 VRGNDRI 130 >UniRef50_C0QK40 Putative glycosyl transferase (Group 2 family protein) n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QK40_DESAH Length = 565 Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 42/104 (40%), Positives = 54/104 (51%), Gaps = 3/104 (2%) Query: 3 NNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQ--QYVTALND 59 + PLISI P +N L I ++S L Q Y WE+I+VDD S L + + D Sbjct: 97 DQPLISIVTPLYNTPSNLLIECVESALFQSYQQWELILVDDASNDRSTLNCLSRIARIKD 156 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 PRI + N N G C N+ + A GEYI +D DD TPN L Sbjct: 157 PRIKVLTNRKNRGTCQSTNRGVRRASGEYIAFLDHDDRITPNAL 200 >UniRef50_B4VWC8 Glycosyl transferase, group 2 family protein, putative n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VWC8_9CYAN Length = 212 Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 57/187 (30%), Positives = 92/187 (49%), Gaps = 30/187 (16%) Query: 11 MPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ--LQQYVTALNDPRITYIHND 68 MPT+ +Q RAI+SV Q Y NWE+I+V+D S+ + + Q+ L D RI YI+ D Sbjct: 1 MPTFMHEQFIARAIQSVQCQTYKNWELIVVNDGSSDATESIVSQF---LIDSRIHYIYQD 57 Query: 69 INSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVCQG 128 NSG+ RN+ + ++G+YI +D DD + N L V ++ + + Sbjct: 58 -NSGSSCARNKGLEHSKGDYIAYLDSDDIYLANHLEV---------------RHNIIKEH 101 Query: 129 EVYSQPASLPLYPKSPYSRRLFYKR-------NIIGNQVFTWAWRFKECLFDTELKAAQD 181 EV P++ P SR+++Y ++ V + LFD+ + +D Sbjct: 102 EV--DLVFGPVWDIKPSSRKIYYGELADTDTGCVLPLMVMHKRFCLSVGLFDSNIVFEED 159 Query: 182 YDIFLRM 188 D+FLRM Sbjct: 160 LDLFLRM 166 >UniRef50_Q8DGK0 Tlr2317 protein n=2 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DGK0_THEEB Length = 356 Score = 77.8 bits (190), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 46/108 (42%), Positives = 64/108 (59%), Gaps = 5/108 (4%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDP 60 N PL+S+ MP +N + AI+S+L Q + ++E II+DD ST S L++Y A+ D Sbjct: 12 NRPLLSVLMPVFNAEAYVGEAIESILGQSFRDFEFIIIDDGSTDNSLSVLKRY--AVQDA 69 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA 108 RI I + N G A N+ I LA+GE+I +D DD PNR V LA Sbjct: 70 RIRLISRE-NRGLVATLNEGIALARGEWIARMDADDVAMPNRFIVQLA 116 >UniRef50_A6H0Q4 Glycosyl transferase, group 2 family protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6H0Q4_FLAPJ Length = 308 Score = 77.8 bits (190), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 55/195 (28%), Positives = 97/195 (49%), Gaps = 12/195 (6%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P SI +P +N++ +KSVL Q + ++E+IIV+D ST + +Q V ND RI Y Sbjct: 2 PFFSIIIPLYNKENYIEYTLKSVLNQSFIDFEIIIVNDGST--DNSEQIVFQFNDKRIHY 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH-AFLYAND 123 H N G RN I A+ ++I +D DD W N L + + ++ T+ F A + Sbjct: 60 FHKK-NEGVSIARNFGIEQAKSDFICFLDADDFWYKNYLEIMQTYIEKFSTYKVFSSAIE 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF------KECLFDTELK 177 ++ PA + + + F++ + ++T + F K +FD +K Sbjct: 119 IETSKTIF--PAQYSIKKNADFEIINFFEASEKECAIWTSSSVFHKSIFKKTGVFDPNIK 176 Query: 178 AAQDYDIFLRMVVEY 192 A+D D+++R+ ++Y Sbjct: 177 IAEDTDLWIRIGLQY 191 >UniRef50_A8GFU6 Glycosyl transferase family 2 n=1 Tax=Serratia proteamaculans 568 RepID=A8GFU6_SERP5 Length = 300 Score = 77.8 bits (190), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 66/234 (28%), Positives = 104/234 (44%), Gaps = 32/234 (13%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS-------TSWEQLQQYVTALND 59 +S+ +PT+ R +L RAI SVL Q + E+I+VDD + E+L Y+ Sbjct: 5 VSVIIPTYGRSELLARAIDSVLAQTHRPLEIIVVDDNPQGDAHRLATRERLADYI---ER 61 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 I Y N G RN I+ ++GEYIT +DDDD + P +++ LA QQ L Sbjct: 62 GEIIYYPRRQNGGGAQARNSGILRSRGEYITFLDDDDYYHPQKIACQLAFMQQGNYDVSL 121 Query: 120 YANDYVCQGEVYSQP------ASLP--LYPKSPYSRRLFYKRNI-IGNQVFTWAWRFKEC 170 D + G++ ++ A L + Y+ + +R++ I + F R+++ Sbjct: 122 CDMDILKDGQISAEHYYRARCAGLEDFMLAGVAYTPMIMLRRSMAIAVRGFFNTPRYQDH 181 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 +F L AA G +H +H +ITSSPK GY Sbjct: 182 IFLYRLLAA-------------GANIGTLHERLAVHNDHDGERITSSPKGILGY 222 >UniRef50_Q3A667 Glycosyltransferase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A667_PELCD Length = 392 Score = 77.8 bits (190), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 55/188 (29%), Positives = 87/188 (46%), Gaps = 6/188 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 + SI MP WNR + A+ SVL Q + ++E++IVDD ST + L V A D RI + Sbjct: 1 MFSIIMPVWNRAEEVAAAVDSVLAQTFRDFELVIVDDGST--DDLAAVVEAYEDERIRF- 57 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL-AHKQQLVTHAFLYANDY 124 + + G C RN A+ A YI +D D+ W+P+ L++ A + H Y Sbjct: 58 YRRPHEGVCKARNFALSKACYPYIAYLDSDNRWSPDFLAIMRKALQDSPEAHPTAYCQAS 117 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL--FDTELKAAQDY 182 + + S L P+ + ++N I F A + FD L+ D+ Sbjct: 118 LFRRNKNSGDLQLRCLIGQPFGFKQLLRQNYIDLNAFVHARALLDSTGGFDETLRRLNDW 177 Query: 183 DIFLRMVV 190 D+ +R+V Sbjct: 178 DLIIRLVA 185 >UniRef50_Q1PUV4 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUV4_9BACT Length = 1678 Score = 77.8 bits (190), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 58/194 (29%), Positives = 94/194 (48%), Gaps = 9/194 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+SI +PT+NR A++S++ Q Y E I+V+D + L T + I Y Sbjct: 1310 PLVSIIIPTYNRPGTLKIALESIVAQTYKFIETIVVNDAGI--DVLNVIDTFRDRLSIKY 1367 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 ++IN A RN AI A G+YI +DDDD + P+ + ++ + T+ Y + Y Sbjct: 1368 FVHNINKDRSAARNTAIKHASGKYIAYLDDDDIFYPDHIETLVSFLETNNTYKVAYTDAY 1427 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT-WAWRFKEC-----LFDTELKA 178 E + + PYS Y R + GN + T +K C +FD L A Sbjct: 1428 RAFQE-KEHGKYITKNKEIPYSFDFDYDRILTGNFIPTPCIMHYKSCIDVVGIFDETLGA 1486 Query: 179 AQDYDIFLRMVVEY 192 +D+D+++RM ++ Sbjct: 1487 HEDWDLWIRMSTKF 1500 Score = 47.0 bits (110), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 46/197 (23%), Positives = 84/197 (42%), Gaps = 16/197 (8%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDY---SNWEMIIVDDCSTSWEQ--LQQYVTALNDPR 61 +S + T+N ++ ++ + Q E+I++D CS EQ + ++ + D Sbjct: 1066 VSAIVSTYNSERFIRGCLEDLTNQTSYKKGALEIIVIDSCSPQNEQAIVDEFQKSYRD-- 1123 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I YI + A N+AI +A+GEYIT + DD + L + + +Y Sbjct: 1124 IVYIRTEKRETIYAAWNRAIKIAKGEYITNANTDDRHRKDGLEILANELDENPDVVLVYG 1183 Query: 122 NDYV--CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR----FKECLFDTE 175 N + + E + + ++ ++R K IG Q WR + FD Sbjct: 1184 NQIITETENETFEHHKPVGVFQWPDFNRNYLLKVCSIGPQPM---WRKLVHSEFGYFDDI 1240 Query: 176 LKAAQDYDIFLRMVVEY 192 + A DY+ +LR+ +Y Sbjct: 1241 FQVAGDYEFWLRISQKY 1257 >UniRef50_B7KAG8 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KAG8_CYAP7 Length = 344 Score = 77.8 bits (190), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 69/268 (25%), Positives = 121/268 (45%), Gaps = 44/268 (16%) Query: 8 SIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRITYI 65 S+ +P +N ++ + I+SVL Q Y N+E++IV+D S + E QQ+ D R+ I Sbjct: 5 SVIIPAYNVEKYIAKTIQSVLEQTYQNFELLIVNDGSLDRTLEICQQF----TDSRLKII 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+G RN I +QGEYI +D DD W P +L + H ++ T ++ Sbjct: 61 SQK-NTGLSGARNTGIRHSQGEYIAFLDGDDLWLPEKLQKHIQHLEKSPTVGVSFSRSAF 119 Query: 126 C--QGEV---YSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-------------- 166 +G + Y P + P+ + N +GN R Sbjct: 120 IDERGNLLNSYQMPILKNITPE------ILLHGNPVGNGSALVVRREVLEAIKFQNQLSD 173 Query: 167 FKE-CLFDTELKAAQDYDIFLRMVVEYGEPWKVE---EATQILHINHGEM------QITS 216 F+E C FD + + A+D ++ LR+ +E W++E EA + +N + Q+ S Sbjct: 174 FQEYCYFDAQFRRAEDIELLLRIALETS--WQIEGIPEALTLYRVNSQGLSANWQEQLHS 231 Query: 217 SPKKFSGYFHFYRKHKDKFDRASKKYQL 244 K S + + +++++++ YQL Sbjct: 232 WEKVISKTKSYAPELVTRWEKSARAYQL 259 >UniRef50_B1ZW34 Glycosyl transferase family 2 n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZW34_OPITP Length = 303 Score = 77.8 bits (190), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 58/199 (29%), Positives = 93/199 (46%), Gaps = 19/199 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+SI +P N ++ +++S L Q Y NWE+I VDDCST A +P Sbjct: 1 MQQRPLVSIVLPVHNGERYLRESVRSCLDQTYQNWELIAVDDCSTDGTAAMLDSFAAREP 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL----SVFLAHKQQLVTH 116 R+ + N N A N ++GE +T DD+ + PN L S + H++ ++ Sbjct: 61 RMRVLRNPANLRLPASLNVGFAASRGEILTWTSDDNLYLPNALEEMVSALMEHREAMMV- 119 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPK-SPYSRRLFYKRNIIGNQVFTWAWRFKECL--FD 173 Y Q + A + YP SP S F N++ N F + + E + +D Sbjct: 120 -------YAAQDIIDESGAVVVRYPSGSPESLCHF---NVV-NACFAYRRKVLETVGGYD 168 Query: 174 TELKAAQDYDIFLRMVVEY 192 L+ +D+D +LR+ Y Sbjct: 169 PILELVEDWDYWLRIARRY 187 >UniRef50_C3RR62 Glycosyl transferase n=3 Tax=Bacteria RepID=C3RR62_9MOLU Length = 718 Score = 77.8 bits (190), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 60/197 (30%), Positives = 91/197 (46%), Gaps = 25/197 (12%) Query: 4 NPLISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPR 61 NPLISI +P +N ++ + S+L Q Y N+E+ IVDDCST+ E + N D R Sbjct: 185 NPLISILIPVYNVERKFLSECLDSILNQTYQNYEVCIVDDCSTNLETINTLKEYENKDTR 244 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL--SVFLAHKQQLVTHAFL 119 I IN N A+ +A+GE+I +D+DD PN L +V L +K + Sbjct: 245 IVVKTRLINGHISKASNDALEIARGEFICLVDNDDTLAPNALYENVALLNKHK------- 297 Query: 120 YANDYVCQGEVYSQPASLPLYPK--SPYSRRLFYKRNIIGNQVFTWAWRFKECL------ 171 D+ +YS L L + P+ + F ++G + L Sbjct: 298 -DADF-----IYSDEDKLDLRGERCEPHFKSDFAPDTLLGINYICHLAVLRTSLVREVGG 351 Query: 172 FDTELKAAQDYDIFLRM 188 F L+ QD+D+FLR+ Sbjct: 352 FTVGLEGVQDHDLFLRI 368 Score = 41.2 bits (95), Expect = 0.036, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 62/117 (52%), Gaps = 7/117 (5%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQL---QQYVTA 56 + P +SI +PT + + + ++S+ + +YSN+E++IVD+ S E + ++Y Sbjct: 438 LDTEPSVSIIVPTRDFADVTEKCLESIYKLTNYSNFEVVIVDNRSEKQETMELFEKYQMR 497 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQ 112 + R+ I D+ A+ N A+ + + + +++D E TPN L + +++ Q Sbjct: 498 YENFRV--IKADMEFNYSAINNLAVSTCKSDVLVLLNNDTEVLTPNWLKLMVSYAIQ 552 >UniRef50_C0FV27 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FV27_9FIRM Length = 1030 Score = 77.4 bits (189), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 60/191 (31%), Positives = 94/191 (49%), Gaps = 18/191 (9%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 L SI MP +N + + +AI+S+ +Q+Y NWE+ I DDCST E ++++++A+ + RI Sbjct: 92 LFSIVMPVYNVEIKWLDKAIESIEKQNYKNWEICIADDCSTKQE-VREHLSAMKNSRIKI 150 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G N A LA GEYI +D+DDE P+ L F ++ + D Sbjct: 151 KLLEKNQGISGATNAAAALASGEYILLMDNDDELAPSALHEFYQKIKKEGSEIIYSDMDI 210 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDT------ELKA 178 + P P +S LF + +G+ + FK+ LF+ E Sbjct: 211 IDAKGKTRDPLCKP-----DWSPDLFLSQMYLGHLI-----GFKKSLFEKVGGFRGEFNG 260 Query: 179 AQDYDIFLRMV 189 +QDYD+ LRM Sbjct: 261 SQDYDLLLRMT 271 >UniRef50_Q1ZNV1 Glucosyltransferase protein n=1 Tax=Photobacterium angustum S14 RepID=Q1ZNV1_PHOAS Length = 308 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 42/105 (40%), Positives = 58/105 (55%), Gaps = 3/105 (2%) Query: 4 NPL-ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 NP+ +S+ M WN Q +AI SVL+Q E+++VDD ST + + V+ DPR+ Sbjct: 3 NPIQVSVVMAAWNSAQFITQAIDSVLQQTDVELEVLVVDDAST--DNTVEIVSKHPDPRV 60 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL 107 I ++ N G A RN AQGE+I +D DD P RLS L Sbjct: 61 KVITSEKNGGPGAARNIGFQAAQGEWIAVVDSDDAMKPERLSTML 105 >UniRef50_A9EDG2 Putative uncharacterized protein n=1 Tax=Kordia algicida OT-1 RepID=A9EDG2_9FLAO Length = 313 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 58/209 (27%), Positives = 100/209 (47%), Gaps = 27/209 (12%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL S+ +P +N+++ + ++SVL+Q + ++E++IV+D ST + V+ +D RI Sbjct: 2 PLFSVIIPLYNKEKYVQQTVQSVLQQTFVDFELLIVNDAST--DDSVAVVSQFSDERIRL 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N N G A RN I A+G I +D DD W P L + A ++ DY Sbjct: 60 IENPKNLGLSATRNHGISKAKGTIIALLDADDLWLPTFLETIKSLYDSF-PEAVIFGTDY 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRL---------FYKRNIIGNQVFTWAWRFKE--C--- 170 V E Y+ L P+ S L F++ ++ + FK+ C Sbjct: 119 V---ETYTSQED--LVPQKNISASLQGTSFLVPDFFEASMFQPIFLQSSVAFKKDICTEK 173 Query: 171 -LFDTELKAAQDYDIFLRMVVEYGEPWKV 198 +FD+++ A+D D +++ YG ++V Sbjct: 174 EVFDSKITFAEDIDFYIK----YGSQYRV 198 >UniRef50_UPI00017449A6 glycosyl transferase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017449A6 Length = 239 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 59/240 (24%), Positives = 101/240 (42%), Gaps = 19/240 (7%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +PT N + A+KS+L Q E+I+VDD S + + D +I Y+ Sbjct: 10 VSVIVPTRNYGRFLADAVKSILGQSLPAMEVIVVDDASD--DNTVEVARGFGD-KIRYVR 66 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL----AHKQQLVTHAFLYAN 122 D+ GA A RN + LA GE + +D DD W P L + + AH + A + Sbjct: 67 LDVRGGAAAARNTGLKLAVGEVVCFLDSDDIWLPGALEILVGGLEAHPEAAGASARIVNV 126 Query: 123 DYVCQGEVYSQPASLPLYPKSPY-SRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 E + PASL + + L +RN F++ F E + + Sbjct: 127 PDSAMDEALACPASLVERAFQCWGAGGLLLRRN-----------SFEKVGFFDEGTSIAE 175 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 ++ + + G + + +L HG + + P GY H R+H ++ A+ + Sbjct: 176 LVAWVSLARDQGLEFATFQELVVLRRVHGANSVLTMPTLAPGYMHLIRQHMERQRAAASR 235 >UniRef50_Q8YYD5 All0915 protein n=6 Tax=Cyanobacteria RepID=Q8YYD5_ANASP Length = 301 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 66/254 (25%), Positives = 106/254 (41%), Gaps = 41/254 (16%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 +NPL+SI +PT NR L A++S L Q + E+I+VDD ST Q + + PR+ Sbjct: 2 SNPLLSIIIPTHNRPHLVTHAVQSALEQTMEDLEVIVVDDAST------QPIDLPSHPRL 55 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA- 121 I + G RN + A G ++ +DDDD P V L Q A + Sbjct: 56 RLIRLLQSHGGAGARNVGLEAALGRWVAFLDDDDRLLPQMAEVSLEALSQTSLPAPVAVI 115 Query: 122 ---NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF---KECL---- 171 + QG+V Q PL P+ + F+ I +Q + +E L Sbjct: 116 SGLEEVNAQGKVLGQRLPPPLRPRGVH----FFLEEIEPDQSYNTKQTLVVEREVLQKIG 171 Query: 172 -FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHI--------NHGEMQITSSPK-KF 221 +D E+++ ++FLR+ A I+ + NHG +++ +P + Sbjct: 172 GWDEEMRSRVHSELFLRL----------NPACSIIGLPIITYQLYNHGGQRVSRNPMLRQ 221 Query: 222 SGYFHFYRKHKDKF 235 + KHK F Sbjct: 222 ESFLRLVHKHKSLF 235 >UniRef50_Q2YCB6 Glycosyl transferase, family 2 n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2YCB6_NITMU Length = 313 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 59/210 (28%), Positives = 100/210 (47%), Gaps = 31/210 (14%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +SI +PT+N + + SVL Q +++ E+I+VDD ST ++ ++ VT+ P Sbjct: 4 SKPTVSIIIPTYNCEAYIAETLDSVLSQTFTDLELIVVDDGST--DRTREIVTSYGTP-- 59 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + + NSG CA RN I A G Y+ +D DD W P +L++ L +Q+ +H + Sbjct: 60 VRLLSQSNSGVCAARNYGIREAAGSYVCLMDHDDYWFPEKLALQL---EQMFSHPEV--- 113 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR---------------- 166 V V+ P ++PK P S + I + W + Sbjct: 114 GLVYSTFVWWHPDGEGVFPK-PESFNIEKMPGGIDEEFSGWIYHLLLLDCWILTSAALIR 172 Query: 167 ---FKEC-LFDTELKAAQDYDIFLRMVVEY 192 F +C +D L ++D+D++LR+ EY Sbjct: 173 AEVFDKCGAYDESLPYSEDWDLWLRISREY 202 >UniRef50_Q21E22 B-glycosyltransferase-like protein n=2 Tax=Alteromonadales RepID=Q21E22_SACD2 Length = 341 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 42/107 (39%), Positives = 57/107 (53%), Gaps = 3/107 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N P+ S+ MP +N + AI+SVL Q Y +E+I VDD T + V+ DPRI Sbjct: 6 NKPIFSVVMPMYNVAKYVATAIESVLAQTYGQFELICVDDGCT--DNTLDIVSTYKDPRI 63 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAH 109 I N G A RN I A+G Y+ +D DD W P +L + L+H Sbjct: 64 RVIRQK-NMGLSAARNTGISHARGLYVALLDSDDYWAPTKLQMHLSH 109 >UniRef50_C0H3N9 CpsF n=2 Tax=Pantoea RepID=C0H3N9_ERWST Length = 252 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 37/97 (38%), Positives = 56/97 (57%), Gaps = 1/97 (1%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 + I MP +N +Q +RA++SVL Q Y +W + +++D ST + L DPRIT + Sbjct: 10 VGIVMPMYNARQTVLRAVQSVLNQTYQDWHLYLINDKSTD-DSLAWVTEQCQDPRITILD 68 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 N +N GA RN + A+ E I +D DDEW ++L Sbjct: 69 NAVNMGAAETRNVGLRAAKEEIIAFLDSDDEWHSDKL 105 >UniRef50_B5L3F2 WfgD n=3 Tax=Enterobacteriaceae RepID=B5L3F2_ECOLX Length = 253 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 3/117 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI MP++N + +I SVL+Q Y+NWE+++ DD S+ + + V +D RI + Sbjct: 5 LVSIIMPSYNAEHTISASISSVLKQTYANWELLVCDDDSSDNTRFK--VLEFSDSRIKLL 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 N+ GA RN A+ A G +I +D DD W N+L + ++ + +F+Y N Sbjct: 63 TNEYAKGAAGARNTALKYASGRFIAFLDSDDIWIANKLEMQISMMLK-NNISFMYGN 118 >UniRef50_D0MF25 Glycosyl transferase family 2 n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MF25_RHOM4 Length = 320 Score = 77.4 bits (189), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 51/199 (25%), Positives = 94/199 (47%), Gaps = 16/199 (8%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P +S+ + +N ++ AI+SVL QDY E+I+VDD ST + ++ V +P + Sbjct: 4 ERPTVSVVIAAYNAERWIAGAIRSVLGQDYPVLEVIVVDDGST--DATREVVAPFQEP-V 60 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 Y+ + N+G+ A RN I LA+GE + +D DD W P +L+ +A Q + Y + Sbjct: 61 RYVFQE-NAGSAAARNHGIRLARGELVAFLDADDLWLPGKLAAQVACLQAHPDCGWCYTD 119 Query: 123 DYVCQGEVYSQPASLPLYPKSP---------YSRRLFYKRNIIGNQVFTWAWRFKECLFD 173 ++ S+ L P S + + ++ + F+E Sbjct: 120 AWLVDASTRSKKVRLSQLAAMPEGWVLEALLLSNFVPFSSALVRREALEAVGGFRE---G 176 Query: 174 TELKAAQDYDIFLRMVVEY 192 + + ++D+D++LR+ Y Sbjct: 177 PDRRISEDWDLWLRIAAHY 195 >UniRef50_B4WF86 Glycosyl transferase, group 2 family protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4WF86_9CAUL Length = 606 Score = 77.4 bits (189), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 54/193 (27%), Positives = 87/193 (45%), Gaps = 11/193 (5%) Query: 1 MKNNPLISIYMPTWNRQQLAIRA-IKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + PL+S+ +P + +RA + S+L Q Y N+E+I+VDD + D Sbjct: 79 LTARPLLSVVVPLYKTPYPLLRACLASLLDQAYPNFELILVDDAPAAPTGPLAEKLVNGD 138 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ I ++N G N + A+GE+I ID DDE P L F+ H +V H + Sbjct: 139 DRVRIIQPEVNGGISRATNLGVAAARGEFILFIDHDDELVPESLLSFVEH---IVRHPEV 195 Query: 120 ---YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTEL 176 Y++ C G +L + K +S IG+ + A + LFD+E Sbjct: 196 DAWYSDQVTCDG----AGKTLHHFFKPEWSPTYLLGVMYIGHLLAVRAEICAKTLFDSEF 251 Query: 177 KAAQDYDIFLRMV 189 QD++ LR+ Sbjct: 252 DGVQDFEFMLRVA 264 >UniRef50_B9M0H9 Glycosyl transferase family 2 n=3 Tax=Proteobacteria RepID=B9M0H9_GEOSF Length = 3011 Score = 77.0 bits (188), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 64/227 (28%), Positives = 105/227 (46%), Gaps = 45/227 (19%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L SI +P++N+ + S+L QDY +WE ++V+D ST A DPRI Sbjct: 496 LFSILVPSYNQAGFLPATLDSILAQDYPHWEALVVNDGSTDETAAVMARYAEKDPRIRTF 555 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA---FLYAN 122 H + N G + N+ + A+GE+I + DD + P++LS+ H V H F++ N Sbjct: 556 HKE-NGGVSSALNEGLKHARGEWICWLSSDDLFEPDKLSL---HAATAVKHPEVRFMHTN 611 Query: 123 DYVC------------QGEVYSQPASLPLYPKSPYSRRL--FYKRN-------IIGNQVF 161 ++ GE + P +R+L F++ N I QVF Sbjct: 612 YFLLYDQTGEKVSVERNGEELNDP-----------NRQLVNFFEANQVNGISIAIHRQVF 660 Query: 162 TWAWRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHIN 208 FD +L+ QD+D++LR+V Y +P ++E T + I+ Sbjct: 661 DQLGH-----FDEKLRNGQDFDMWLRIVSRY-QPLYLDERTVVTRIH 701 >UniRef50_C6MKP2 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M18 RepID=C6MKP2_9DELT Length = 305 Score = 77.0 bits (188), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 42/101 (41%), Positives = 55/101 (54%), Gaps = 6/101 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 P +S+ MP +N + AI S+ Q Y NWE++IVDD ST SWE YV + DPR+ Sbjct: 4 PKVSVVMPVYNTRPYLEEAIASIFAQTYRNWELVIVDDFSTDGSWE----YVCGIRDPRV 59 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 N+ N N+ I LA GEYI +D DD P R+ Sbjct: 60 RVARNERNMRNSYTLNRGIALATGEYIAKMDADDVSFPERI 100 >UniRef50_B0CGB5 Glycosyl transferase, family 2, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CGB5_ACAM1 Length = 327 Score = 77.0 bits (188), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 53/192 (27%), Positives = 95/192 (49%), Gaps = 22/192 (11%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ + T+N + ++SVL+Q +S+ E+IIVDD S+ + + ++T++ D R+ + H Sbjct: 4 VSVIITTFNSMRFFPETLESVLKQTFSDIEVIIVDDGSS--DGISDWITSVVDSRVVF-H 60 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS----VFLAHKQQLVTHAFLYAN 122 N G + RN I LA+GEYI +D DD W P +L V + + H +L Sbjct: 61 AQSNQGVSSARNTGIRLARGEYIAFLDGDDFWAPTKLEQQVRVLALEDEVGLVHTWLALM 120 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECL-----FDTEL 176 D + + L P S ++ + N + + + CL FD+ L Sbjct: 121 D---------EESRLTGRVMKPQSEGAIWEEIVESNMIACSSVMVRRSCLDVVGIFDSAL 171 Query: 177 KAAQDYDIFLRM 188 A+D+D+++R+ Sbjct: 172 IVAEDWDLWIRV 183 >UniRef50_C6X8A5 Glycosyl transferase family 2 n=1 Tax=Methylovorus sp. SIP3-4 RepID=C6X8A5_METSD Length = 305 Score = 77.0 bits (188), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 43/120 (35%), Positives = 65/120 (54%), Gaps = 7/120 (5%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS----TSWEQLQQYVTALND 59 N S+ +P +N R + SV +Q ++N+E+I++DD S T E L+QY T LN Sbjct: 8 NMKFSVVIPCYNASSTIARTLDSVKQQTFTNFEVIVIDDASNDLATLLEVLEQYRTELN- 66 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 ++ + ND N RN+ I A+GEYI +D DD W P RL + Q+L + F+ Sbjct: 67 --LSVLTNDANMNGAYSRNRGIREAKGEYIAFLDADDSWVPTRLEQAMEAIQRLPANQFV 124 >UniRef50_B8HRB2 Glycosyl transferase family 2 n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HRB2_CYAP4 Length = 373 Score = 77.0 bits (188), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 38/102 (37%), Positives = 57/102 (55%), Gaps = 1/102 (0%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N PL+S+ +P +N + + +KSVL Q Y+N E+++VDD S A +DPR Sbjct: 8 SNLPLVSVIIPAYNAEAFIQKTLKSVLTQTYTNIEVLVVDDGSQDRTAEIVQAIAKDDPR 67 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 + + N+G A RN+AI +GE++ ID DD W P L Sbjct: 68 VDLLQQS-NAGVAAARNKAIQATRGEFVAPIDADDIWYPQNL 108 >UniRef50_C7XEA2 Teichuronic acid biosynthesis glycosyltransferase TuaG n=1 Tax=Parabacteroides sp. D13 RepID=C7XEA2_9PORP Length = 255 Score = 77.0 bits (188), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 62/190 (32%), Positives = 83/190 (43%), Gaps = 28/190 (14%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--NDPRIT 63 L+SI P + I+SV Q YS WEMIIVDDCS V D RI Sbjct: 5 LVSIVTPMYKGAAFIGETIRSVQVQTYSKWEMIIVDDCSPDDGAGVAVVKKFMEKDKRIR 64 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTP----NRLSVFLAHKQQLVTHAFL 119 I + +N G+ RN A+ AQG YIT +D DD W+P ++L LVT ++ Sbjct: 65 LIVSPVNKGSSGARNIALHEAQGRYITFLDSDDIWSPIFLDSQLDFMKEKSAALVTGSYH 124 Query: 120 YANDYVCQGEVYSQPASLP-------LYPKSPYS-RRLFYKRNIIGNQVFTWAWRFKECL 171 N+ QG QP +P + +P S Y R ++G E Sbjct: 125 RINE---QGIQVLQPFIVPEKVSYWDILKSNPVSCLTTIYDRKVVG-----------EHY 170 Query: 172 FDTELKAAQD 181 F ELK+ +D Sbjct: 171 FKEELKSLRD 180 >UniRef50_C1SJ24 Glycosyl transferase n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJ24_9BACT Length = 255 Score = 77.0 bits (188), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 59/194 (30%), Positives = 99/194 (51%), Gaps = 21/194 (10%) Query: 8 SIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALNDPRITYI 65 S+ +P +NR Q AI+SVL Q + ++E+I+VDD S + L+ Y+ P I YI Sbjct: 4 SVIIPVFNRTQTLKTAIESVLAQSFRDFEIIVVDDGSDKSVSSALRPYM-----PMIRYI 58 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS----VFLAHKQQLV-THAFLY 120 + N G RN I A+G Y +D DD W P++LS A K ++ T+ F + Sbjct: 59 RIEKNMGVSYARNIGIREAKGLYTAFLDSDDIWLPDKLSAQHEALTATKLKVCHTNEFWF 118 Query: 121 AND-YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKA 178 D ++ QG + + + ++ K R+ +I V FK C +FD+ ++ Sbjct: 119 RKDRFINQGAKHKRHGGM-IFDKVLDFCRISPSSVVIHRDV------FKTCGVFDSNMRT 171 Query: 179 AQDYDIFLRMVVEY 192 +DYD++LR+ ++ Sbjct: 172 CEDYDLWLRICSKF 185 >UniRef50_Q11L33 Glycosyl transferase, family 2 n=1 Tax=Chelativorans sp. BNC1 RepID=Q11L33_MESSB Length = 609 Score = 77.0 bits (188), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 66/250 (26%), Positives = 114/250 (45%), Gaps = 31/250 (12%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--NDP 60 PL+S+ +P +N Q AI+S+ Q + N E+++VDDCST ++ VT L D Sbjct: 206 GRPLVSVIVPAYNAQYTIATAIRSLQEQTWRNIEILVVDDCST--DETVSIVTKLANKDS 263 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ-----LVT 115 RI + + NSGA RN + A+G+Y+T D DD P +L + H ++ + Sbjct: 264 RIQVLRMERNSGAYCARNLGLKNARGKYVTCHDSDDWSHPEKLQIQAEHLERHTDRIVAN 323 Query: 116 HAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTE 175 +++LY D + + L L S L ++R + +++ W E Sbjct: 324 YSYLYFVDDDLRVRARRLGSYLRLNTSS-----LMFRREPVLSELGFW----------DE 368 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS------GYFHFYR 229 ++ A D + R+ +G + A + + + E +T+S K F G FYR Sbjct: 369 VRFAADSEFLERLYAVFGRISVISIAKPLSLVRYREDSLTAS-KAFGYHGFRMGARQFYR 427 Query: 230 KHKDKFDRAS 239 + +F R+ Sbjct: 428 ERYLRFHRSG 437 >UniRef50_C1F2T6 Glycosyl transferase, group 2 family n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F2T6_ACIC5 Length = 333 Score = 76.6 bits (187), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 67/264 (25%), Positives = 110/264 (41%), Gaps = 27/264 (10%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL+S+ +PT R A++S L Q Y+ E+++VDD + D R+ Sbjct: 9 ERPLVSVVLPTAGRSPWLGHAVRSALAQTYAGLEVVLVDDSREGQADQPWNLLWAGDERV 68 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + GA A R + A+GE+I +DDDDEW P +++ +Q HA A Sbjct: 69 RVLRTGGVGGATA-RQAGVEAARGEWIAFLDDDDEWLPEKIA-----RQMEAAHAARPAR 122 Query: 123 DYV---CQGEVYSQPASLPLYPK------SPYSRRLFYKRNIIGNQVFTWAWRF---KEC 170 +V C+ V + +S L+P+ P + LF +R F + Sbjct: 123 GHVVMACRQWVRTA-SSEYLFPRRVYDGVEPMANYLFCRRGFTQGGGFLQTSTLVAPRAL 181 Query: 171 LFDTELKAA----QDYDIFLRMVVEYGEPWKV-EEATQILHINHGEMQITSSPKKFSGYF 225 L +T +A QD+D LR+ +V EEA + G ++ P + Sbjct: 182 LLETPFRAGLRVHQDWDWLLRVARRGDVRVQVLEEALAVYRTEDGRETVSQRP-DWRASL 240 Query: 226 HFYRKHKDKFDRASKKYQLFTLYQ 249 + R+H+ + D + Y F Q Sbjct: 241 DWIREHRGRID--PRAYSWFVAVQ 262 >UniRef50_A4EK02 Glycosyl transferase, family 2 n=1 Tax=Roseobacter sp. CCS2 RepID=A4EK02_9RHOB Length = 348 Score = 76.6 bits (187), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 54/210 (25%), Positives = 93/210 (44%), Gaps = 32/210 (15%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ M +N + A+KSVL Q E+++ DD ST A DPR+ Sbjct: 16 PTVSVIMANYNAADHVLAALKSVLAQSIPEVEILLADDASTDASITIARQIAAADPRLII 75 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + +D N+G AVRN+A+ +A+GE++ +D DD P R L+ + L T A Y Sbjct: 76 LESDTNAGPAAVRNRALDVAKGEWVAIVDSDDIIHPQRFERMLSAARALQTEAIADDLTY 135 Query: 125 VCQGEV-----------YSQPASL---------PLYPKSPYSRRLFYKRNIIGNQVFTWA 164 + V + P +L P P+ Y + +F + G Sbjct: 136 LIDDTVHPNMTLLGDVRFVNPQTLTAQNFLSTTPRAPELGYLKPMFRADALNG------- 188 Query: 165 WRFKECLFDTELKAAQDYDIFLRMVVEYGE 194 R++E +++ +D D ++R ++ G+ Sbjct: 189 LRYRE-----DIRIGEDNDFYIRFLLNGGQ 213 >UniRef50_D1PXL6 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PXL6_9BACT Length = 278 Score = 76.6 bits (187), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 61/204 (29%), Positives = 95/204 (46%), Gaps = 25/204 (12%) Query: 1 MKNN-----PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV 54 M+NN PL+SI M T+N AI+S+L Q Y+NWE++I DD + + + + Sbjct: 1 MENNDKLHHPLVSIVMATFNESVNFITEAIESILHQTYNNWELLICDDSTNA-----KTI 55 Query: 55 TALN-----DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAH 109 A+N D RI I G N+ + A+GE I +D DD RL +A Sbjct: 56 DAINQLANTDSRIKIIRKSERMGFVNALNEGLSNAKGELIARMDGDDIAIDTRLEKQVAF 115 Query: 110 KQQLVTHAFLYANDYVC--QGEVYSQPASLPLYPKSPYS-RRLFYKRNIIGNQVFTWAWR 166 + N Y+ QGEV ++ Y SP+ +R+F RN + + Sbjct: 116 AATHPNISLFGGNIYIINEQGEVLTERH----YETSPHKIQRMFMYRNPFAHPTIMFRRN 171 Query: 167 F--KECLFDTELKAAQDYDIFLRM 188 + L+DT+ K A+D + +LR+ Sbjct: 172 IIDRGFLYDTQFKKAEDVEFYLRL 195 >UniRef50_C0YGQ4 Possible glycosyl transferase family protein n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YGQ4_9FLAO Length = 267 Score = 76.6 bits (187), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 57/198 (28%), Positives = 95/198 (47%), Gaps = 17/198 (8%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +IS+ +P +N+++ I+ VL Q Y N+E+IIV+D S ++ + V ++D R+ + Sbjct: 1 MISVVIPLYNKEKYIKETIRKVLNQTYQNFEIIIVNDGSK--DKGPEIVNEIDDSRVK-L 57 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N N G + RN I +Q EYI +D DDEW PN L HK L+T A+ +V Sbjct: 58 FNKENGGVSSARNVGIEKSQFEYIAFLDADDEWLPNHLEEI--HK--LITKYKSSADVFV 113 Query: 126 CQ-GEVYSQPASLPLYPKSPYSR-------RLFYKRNIIGNQVFTWAWR--FKECLFDTE 175 YS+ + + ++ K+ +I + + K LFD Sbjct: 114 TNFARKYSEEKIVDNRKQDELKEGIVKDYFKVILKKGLIHTSCVCVSKKALLKAKLFDER 173 Query: 176 LKAAQDYDIFLRMVVEYG 193 + +D D++ R+ EYG Sbjct: 174 ISRGEDMDLWGRLAREYG 191 >UniRef50_Q47GL6 Glycosyl transferase, family 2 n=1 Tax=Dechloromonas aromatica RCB RepID=Q47GL6_DECAR Length = 309 Score = 76.6 bits (187), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 67/230 (29%), Positives = 105/230 (45%), Gaps = 13/230 (5%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +SI PT+N L RAI SVL+Q + +WE+++VD+ S + ++ V D RI Y Sbjct: 23 VSIVTPTFNCAPLLQRAIDSVLKQTHQDWELLVVDNHSN--DGTEELVRGYRDSRIRYFK 80 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL--VTHAFLYANDY 124 N A RN AI A+GE++ +D DD WTP +L V + +Q V + LYA Sbjct: 81 IQNNGVIAASRNLAISQAKGEWLAFLDADDWWTPEKLEVSVRSLRQGWDVVYHDLYAAGP 140 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVF----TWAWRFKECLFDTELKAAQ 180 +G + + S L + P L N + N + R D L A + Sbjct: 141 GSKGRLRRRVPSRAL--RQPVLDDLVNHGNALPNSSVVVRRSLIDRVGGLSEDPALIAME 198 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 D+D +LR+ +++ A I G T++P++ + RK Sbjct: 199 DFDCWLRVARISDRFLRIKGAHGYYWIGGGN---TTNPRRTISTLNELRK 245 >UniRef50_C2P8I0 Family 2 glycosyltransferase n=10 Tax=Bacteria RepID=C2P8I0_BACCE Length = 262 Score = 76.6 bits (187), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 42/109 (38%), Positives = 58/109 (53%), Gaps = 4/109 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALNDPRI 62 PL+SI P++N I+SV Q Y WEMII+DD S + E +++ + D RI Sbjct: 14 PLVSIVTPSYNASSFIKETIQSVQSQTYKKWEMIIIDDVSKDNTCELIKEEIK--KDDRI 71 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ 111 I + N GA RN I A+G+YI +D DD W P +L +A Q Sbjct: 72 RLIELEENGGAAIARNTGINCAEGKYIAFLDSDDLWLPEKLEKQVAFMQ 120 >UniRef50_B1Y722 Glycosyl transferase family 2 n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y722_LEPCP Length = 326 Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 57/202 (28%), Positives = 94/202 (46%), Gaps = 17/202 (8%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + P IS+ +PT+NR AI SVL+Q E+I+VDD ST ++ Q + A Sbjct: 1 MTSTPRISVVIPTYNRAFCLGLAIDSVLQQSLPAHEIIVVDDGST--DRTDQVMRAYAG- 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ---QLVTHA 117 R+ ++ N+G A RN + LAQG++I +D DDEW P+RL + A Q Q + H Sbjct: 58 RVRWVFQP-NAGVSAARNTGLGLAQGDWIAFLDSDDEWHPDRLKIQCADMQAHPQAIAHM 116 Query: 118 F--LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE------ 169 Q ++ + + + P R +++ + F AW Sbjct: 117 LDCTVEEKNGAQRSIFEVRGLVDEFRRQPLRPRPLC--DVLESAFFPSAWLIARQAIDAA 174 Query: 170 CLFDTELKAAQDYDIFLRMVVE 191 FD ++ +D D+ R+ ++ Sbjct: 175 GGFDVSMRVCEDTDLLSRIALQ 196 >UniRef50_B1C0P6 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C0P6_9FIRM Length = 319 Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 58/229 (25%), Positives = 111/229 (48%), Gaps = 14/229 (6%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N PLISI +P +N Q+ R + S+++Q Y N+E I+++D ST A D R Sbjct: 2 NTPLISIIVPVYNVQKHLRRCLDSIIQQTYENFECILINDGSTDGSGEICDEIAKQDKRF 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 IH N+G RN A+++A+G+Y++ +D DD P L+ L + T + + Sbjct: 62 IVIHQQ-NAGLSVARNNALLMAKGDYVSFVDSDDYILPQYLNSLL--NAIISTDSDISKC 118 Query: 123 DYVCQGEVYSQPAS--LPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTEL---- 176 DY +G++ + + + ++R + + +G+Q+ W + +K+ L++ + Sbjct: 119 DYF-RGKLTKDKDTYDVSVVNSKKFTREVLLDK--VGSQL--WQYMYKKSLWNDIVSPAG 173 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 + AQD I ++ + +++ +I+ + S KK G F Sbjct: 174 RYAQDMMILHKITNRAKKIAVIDQKLYFYYIDRNDSTSNDSKKKVKGAF 222 >UniRef50_Q6MBF6 Putative glycosyltransferase n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBF6_PARUW Length = 376 Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 59/199 (29%), Positives = 95/199 (47%), Gaps = 19/199 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M ++PL+SI +P +N+ A+ SVL+Q Y N+E++IVDD ST D Sbjct: 3 MHSSPLVSILIPVFNQVPFIFEALNSVLQQTYPNFEVLIVDDGSTDGTASICQEFCQKDS 62 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ------LV 114 R Y++ N G A RN+ I + GEY +D DD+ R++V L + + Sbjct: 63 RFQYLYQ-TNKGPSAARNRGIAASHGEYFCLLDGDDKMDFQRIAVQLKVLTENPTLDIVY 121 Query: 115 THAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL--- 171 T L ++ GE++ Q + P++ + LF RN+I A R ECL Sbjct: 122 TALLLIDSNGSTIGEMHGQEIN----PENFLATLLF--RNVIPGPSTIMAKR--ECLTSH 173 Query: 172 -FDTELKAAQDYDIFLRMV 189 + A+DY++ +R+ Sbjct: 174 PYHEHFVHAEDYELMIRLA 192 >UniRef50_B6JCX3 Glycosyl transferase, family 2 n=1 Tax=Oligotropha carboxidovorans OM5 RepID=B6JCX3_OLICO Length = 374 Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 12/152 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 P +S+ MP++N + AI S+L Q + + E+IIVDD S S + ++ +V D RI Sbjct: 7 PTVSVIMPSYNHARFIPSAITSILDQTFFDLELIIVDDGSADDSVDVIKTFV----DSRI 62 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + ++ N GA A N+A+ LA+G Y+ + DD W N+L V L+ + A ++++ Sbjct: 63 RTVFSETNQGAAAATNKAVSLARGTYVALCNSDDRWAKNKLDVQLSIFKNQPDLAAVFSD 122 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN 154 + SQ A PL P +P +F + N Sbjct: 123 ----VAWIDSQGA--PLDPSNPSFSNVFQQEN 148 >UniRef50_B3PF93 Glycosyl transferase, putative, gt2I n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PF93_CELJU Length = 304 Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 65/237 (27%), Positives = 110/237 (46%), Gaps = 11/237 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST-SWEQLQQYVTALND 59 M+ PL+S+ + T+N Q +A++S+L Q + N E+I+VDD ST EQ+ Q A + Sbjct: 10 MQTQPLVSVVIATYNMGQYLAQAVESILNQTWKNLEVIVVDDGSTDDTEQVMQAYQA--N 67 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA-HKQQLVTHAF 118 R+ Y+ IN G +N I +G+YI D DD W P +L V L V + Sbjct: 68 TRVYYLKT-INQGQPKAKNTGINNTKGDYIAFCDADDFWEPFKLEVQLPLFNNPAVGVVY 126 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 ++ Q Y +PA+ + ++ + G V A + +FD E + Sbjct: 127 SEVSNIDEQNRRYVRPANEVRHSGQVTNQLMIENFVPFGTSVIRRACIERNGVFDEEFRM 186 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 D+D++LR +++ E E T + + G+M + G + F + +KF Sbjct: 187 GIDWDLWLRYSLDW-EFAYTPERTYVYRVWSGQMS-----TNYRGRYDFAIRILNKF 237 >UniRef50_B8HRA4 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HRA4_CYAP4 Length = 398 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 41/101 (40%), Positives = 61/101 (60%), Gaps = 5/101 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT--ALNDPRI 62 PL+S+ +P +N + + + S+L Q YSN+E++++DD ST +Q + VT A DPRI Sbjct: 14 PLVSVILPVYNGEPFIEKTLNSILTQTYSNFEVLVIDDGST--DQTETVVTSFAQRDPRI 71 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 + N+G A RN I A+GE+I ID DD W P+ L Sbjct: 72 KLLKQ-ANAGVAAARNFGIQSAKGEFIAPIDADDLWFPDFL 111 >UniRef50_A4BIW6 Glycosyltransferase n=1 Tax=Reinekea blandensis MED297 RepID=A4BIW6_9GAMM Length = 254 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 4/103 (3%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL--QQYVTALNDP 60 +N LI++ P +N + I SVL Q + N E I+VDDCS L Q+++ D Sbjct: 2 SNELITVITPVFNCEDFLEETIVSVLNQTHKNIEYILVDDCSVDNSSLIYQRFMEK--DR 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 R+ I NSGA RN+ I LA+GEYI +D DD+W P +L Sbjct: 60 RVKSIKLLQNSGAAIARNKGIELAKGEYIAFLDSDDKWEPTKL 102 >UniRef50_D0QYM3 AcbB n=1 Tax=Avibacterium paragallinarum RepID=D0QYM3_HAEGA Length = 490 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 55/195 (28%), Positives = 95/195 (48%), Gaps = 21/195 (10%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ M T+N Q+ ++ S+++Q+YSN E+I+VDD S+ + ++ P I Sbjct: 219 PLVSVLMTTFNSQEFVEASLLSLIKQNYSNKEIIVVDDNSSDDTCKIIHSLSVKYPFIRL 278 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN-- 122 I +N G ++ AI A+GE++T D DD PN+L + Q L+ L A Sbjct: 279 IQLPVNVGTYVAKSIAIKFARGEFVTCHDSDDWAHPNKL---IEQMQPLLNDPNLVATFS 335 Query: 123 ---DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 +G+ YS+ +P +P S ++R I+ +Q+ W W +K Sbjct: 336 KWFRVSKEGKAYSR-MIIPFIRLNPSSA--LFRRKIVESQIGLWDW----------VKTG 382 Query: 180 QDYDIFLRMVVEYGE 194 D + R+ + +GE Sbjct: 383 ADSEFNARIKLVFGE 397 >UniRef50_A4G480 Putative Glycosyl transferase, family 2 n=1 Tax=Herminiimonas arsenicoxydans RepID=A4G480_HERAR Length = 303 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 44/129 (34%), Positives = 71/129 (55%), Gaps = 12/129 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M ++P++++ +PT+N A++S+ Q +SNWE I+V++ S + V + +DP Sbjct: 1 MNDSPVVTVVIPTYNHAHFLREALQSLCAQSFSNWEAIVVNNYSE--DDTIAVVASFSDP 58 Query: 61 RITYIHNDINSGACAV-RNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI + N N+G A RN+ I LAQG+YI +D DD W P +L+ + +F Sbjct: 59 RIR-LENFRNNGVIASSRNRGIALAQGQYIAFLDSDDIWYPEKLTQCM--------QSFD 109 Query: 120 YANDYVCQG 128 D VC G Sbjct: 110 DNVDLVCHG 118 >UniRef50_B7JW84 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7JW84_CYAP8 Length = 1177 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 42/106 (39%), Positives = 61/106 (57%), Gaps = 2/106 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 +N PL+S+ +P +N Q AI SVL+Q + N+E+I+VDD ST + Q+ + LN P+ Sbjct: 555 QNLPLVSVIIPCYNYGQYLEEAIDSVLQQTFQNFEIIVVDDGSTD-SKTQEVLDNLNKPK 613 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL 107 T I N G RN+ I A+G+YI +D DD+ P L L Sbjct: 614 TTLIRQK-NQGVAIARNEGIFQAKGKYICCLDADDKLKPAYLEKCL 658 >UniRef50_B0S8S5 Glycosyltransferase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0S8S5_LEPBA Length = 328 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 63/206 (30%), Positives = 100/206 (48%), Gaps = 31/206 (15%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWE-----------QLQ 51 PL+SI +PT NR+ L RA+ SVL Q Y NWE+ I+DD ST +W Q+Q Sbjct: 7 PLVSIIIPTHNRRHLVERAVLSVLNQTYPNWELHIIDDGSTDDTWSYLLTQLMGWKRQIQ 66 Query: 52 QYVTALNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ 111 + L + IH + G A RN I + G+++ +D DDEW +L + Q Sbjct: 67 SFGRML---KSIQIHQTEHRGVSAARNFGIQKSLGDWVCFLDSDDEWFKEKLEKQI---Q 120 Query: 112 QLVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSR--RLFYKRNIIGNQVFTWAW-RFK 168 H LY + E++++ +L L PK Y + F + ++ V ++ K Sbjct: 121 YHTDHKELY---FSQTNEIWNKNGNL-LEPKGKYKKLSGHFLEESLALCMVTCSSFIAHK 176 Query: 169 ECL-----FDTELKAAQDYDIFLRMV 189 + L F E+K +DYD++ R++ Sbjct: 177 QTLENIGSFREEMKTCEDYDLWNRIL 202 >UniRef50_A4Y8H1 Glycosyl transferase, family 2 n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y8H1_SHEPC Length = 281 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 62/239 (25%), Positives = 108/239 (45%), Gaps = 28/239 (11%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR- 61 N+ ++S+ +PT NR RAI SVL Q S+ E+IIV+D + + A+ Sbjct: 5 NDIVLSVIIPTCNRNDTLPRAIYSVLGQHISDIEVIIVNDTDIP---IPDDILAIGQSNP 61 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ-----LVTH 116 + + N GA RN + LA G+YIT +DDDD + P RL L H +Q + + Sbjct: 62 VFFFTNPGKHGAANTRNYGVSLANGKYITFLDDDDIYLPGRLDTMLLHMKQNKYIIISSG 121 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL----- 171 F+ +++ S+ L + +K+ GN + K+ Sbjct: 122 RFVECDNF----------KSIVLQENQIFG-EFNFKKIQFGNDIDIGFMMRKDAFINLGG 170 Query: 172 FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 FD L+A +D+D LR ++ G+ +K+ +++ G +++ + GY +K Sbjct: 171 FDITLRALEDWDFILR-ALKLGDGFKIHRFDYVVNRTEGAARVSLGEAQ--GYLELLKK 226 >UniRef50_A8RE12 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RE12_9FIRM Length = 311 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 63/201 (31%), Positives = 95/201 (47%), Gaps = 14/201 (6%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 IS+ + ++NR + +AI SV+ QDYSN E+I+VDDCS + L + D I YI Sbjct: 5 ISVVITSYNRYEYLTQAIHSVVMQDYSNKEIIVVDDCS---DDLNYDEFKMRDD-IIYIR 60 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPN-----RLSVFLAHKQQLVTHAFLYA 121 N+ NSGA R +A GEYI +DDDD +T + ++ FL + ++ Sbjct: 61 NEANSGASMSRKNGYRVATGEYIIFMDDDDYYTNSDFLKKAMNGFLNNTD--ISFVSGNV 118 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRL--FYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 N Y Q E LP Y L F + I F+ + K E+ Sbjct: 119 NVYDQQSEQVLPANKLPFYGNVDREEYLKGFQTKYIKPYSTFSTVFSMKVIDKIIEMPMF 178 Query: 180 QDYDIFLRMVVEYGEPWKVEE 200 D I+L ++ YG+P+ +E+ Sbjct: 179 NDSSIYLGALL-YGKPYFIED 198 >UniRef50_D2RDR5 Glycosyl transferase family 2 n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RDR5_ARCPR Length = 1164 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 68/209 (32%), Positives = 96/209 (45%), Gaps = 22/209 (10%) Query: 5 PLISIYMPTWNRQQLAIR-AIKSVLRQDYSNWEMIIVDDCSTS---WEQLQQYVTALNDP 60 P ISI MPTWN + +R AI+SVL Q Y NWE+ I D ST + L++Y A D Sbjct: 632 PKISIIMPTWNTDERWLRKAIESVLNQVYDNWELCIADGGSTKPHVRKILEEY--AKKDK 689 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI N G N+A+ LA GE++ +D DDE P L + + F+Y Sbjct: 690 RIKVKFLPKNLGIAGNSNEALKLATGEFVAFLDHDDELAPFALYEVVKLLNEKPDLDFIY 749 Query: 121 AN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN-IIGNQVFTWAWRFKECLFDTELKA 178 ++ D + + P + K YS +F N +I V + K F Sbjct: 750 SDEDKIDEKGRRRDP-----FFKPDYSPDMFLSCNYLIHITVIRKSLVDKVGGFRLGYDG 804 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHI 207 +QDYD+FLR+ +E +I HI Sbjct: 805 SQDYDLFLRV---------LEHTDKIAHI 824 Score = 57.8 bits (138), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 38/136 (27%), Positives = 68/136 (50%), Gaps = 6/136 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + NP +SI +PT ++ ++ R ++S+L + Y N+E++IVD+ S + + Y T + Sbjct: 885 INGNPKVSIIIPTKDKVEVLKRCVESILNKTTYQNYEIVIVDNNSQEEKTFEYYETIKDH 944 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQ----LV 114 P+I + + A+ N A+ E+I +++D E T LS L H Q+ V Sbjct: 945 PKIRILEYNKPFNFSAINNYAVSKVDSEFILFLNNDTEVITSEWLSAMLEHAQRKEVGAV 1004 Query: 115 THAFLYANDYVCQGEV 130 LY N+ + V Sbjct: 1005 GAKLLYPNNTIQHAGV 1020 >UniRef50_Q0BQ52 Glycosyltransferase n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BQ52_GRABC Length = 854 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 71/268 (26%), Positives = 110/268 (41%), Gaps = 21/268 (7%) Query: 2 KNNPLISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTAL 57 K L+SI P + R A++SV+ Q Y NWE+I++DDCS LQQ+ A Sbjct: 285 KTAQLVSIICPVYKPRLSDFALAVESVIAQTYQNWELILIDDCSEEANLKNLLQQFAQA- 343 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 D RI ++ NSG N + A G +I +D DD + L + +A + Sbjct: 344 -DSRIKHLTTRKNSGISEASNLGLKAANGSWIAFLDHDDLLEASALDIMIAAAEATGAKL 402 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTEL 176 D + + PA P ++ RL + N I + V F+ + Sbjct: 403 LYSDEDKIDDSGFFRDPAFKP-----DWNYRLLLEVNYICHFVLVQRDVINSAGPFNKQF 457 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILH---INHGEM--QITSSPKKFSGYFHFYRKH 231 AQD+D+ LR + E EP +V +IL+ I G I + P +H Sbjct: 458 DGAQDHDMLLR-IAERLEPSEVFHVPEILYHWRITPGSTAGDIGAKPYAIEAGLQCVSRH 516 Query: 232 KDKFDRASKKYQLFTLYQIRNKRMTWRT 259 +R K + T Y + ++ W T Sbjct: 517 ---LERRGLKADVSTRYGMTLYKIDWET 541 >UniRef50_A8U9G2 Glycosyltransferase n=1 Tax=Carnobacterium sp. AT7 RepID=A8U9G2_9LACT Length = 312 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 45/128 (35%), Positives = 72/128 (56%), Gaps = 10/128 (7%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDD--CSTSWEQLQQYVTALNDP 60 N PLIS+ +PT++R RAI+SV++QDY N E+++V D + + E ++Q +++ Sbjct: 2 NEPLISVVIPTFDRPFFLKRAIESVIKQDYKNIEIVVVVDGFSARTGEYIEQ-AKKMSNI 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS--VFLAHKQQLVTHAF 118 I I + G RN + A GE I +DDDDEW ++LS + L +K QL Sbjct: 61 EINLIQTNTKVGGSEARNIGVKKANGELIGLLDDDDEWFSDKLSSQMDLINKNQLSID-- 118 Query: 119 LYANDYVC 126 +D++C Sbjct: 119 ---DDFLC 123 >UniRef50_D0DJH6 Beta(1,3)galactosyltransferase EpsH n=1 Tax=Lactobacillus crispatus MV-3A-US RepID=D0DJH6_9LACO Length = 272 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 57/217 (26%), Positives = 101/217 (46%), Gaps = 18/217 (8%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPR 61 NP IS+ M +N AIKS+L Q +SN+E II DD ST + + L++ A D R Sbjct: 2 NPEISVIMGVYNLSPRYKEAIKSILDQTFSNFEFIICDDKSTDNTLQVLKE--IAKKDNR 59 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I N N G A N + A+G+YI +DDDD +R + ++ ++F+ + Sbjct: 60 IKIISNKQNMGLGASLNHCLKFARGKYIARMDDDDFSNKDRFQIEFDFLEKHPEYSFVSS 119 Query: 122 NDYVCQG-------EVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDT 174 Y+ G E+ +P+ SP+ + + + + ++ C Sbjct: 120 AYYIDDGVNKYRLKEMEEKPSKGSFLWNSPF----LHPCTMFRTEALKRSNGYRVC---K 172 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGE 211 E A+DYD+++RM + + +++ + I+ + Sbjct: 173 ETMRAEDYDLYMRMYALGMKGYNIQKPLYVYFISQSQ 209 >UniRef50_Q3M3J9 Glycosyl transferase, family 2 n=2 Tax=Nostocaceae RepID=Q3M3J9_ANAVT Length = 337 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 39/100 (39%), Positives = 60/100 (60%), Gaps = 2/100 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P IS+ +P +N + +AI+SVL Q + N+E+I+VDD ST + + + D RI Sbjct: 10 PDISVIIPAYNTEAYIAKAIESVLFQTWRNFELIVVDDAST--DCTVDVIKSFTDQRIKL 67 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS 104 N N+GA RN+A+ +AQG++I +D DD + P RL Sbjct: 68 FVNPKNTGASGARNRALQIAQGKWIAVLDADDWYAPERLE 107 >UniRef50_Q8L0V4 N-acetylgalactosaminyl-proteoglycan 3-beta-glucuronosyltransferase n=10 Tax=Bacteria RepID=CHS_ECOLX Length = 686 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 62/222 (27%), Positives = 106/222 (47%), Gaps = 27/222 (12%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALN 58 +K PL+SIY+P +N + +R ++S L Q ++ E+ I DD ST + LQ++ N Sbjct: 428 LKRVPLVSIYIPAYNCSKYIVRCVESALNQTITDLEVCICDDGSTDDTLRILQEHYA--N 485 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 PR+ +I N G + N A+ L +G YI +D DD P+ + + L ++ ++ A Sbjct: 486 HPRVRFISQK-NKGIGSASNTAVRLCRGFYIGQLDSDDFLEPDAVELCLDEFRKDLSLAC 544 Query: 119 LYANDYVC--QGEVYSQPASLPLYPKSPYSR-------RLFYKRNIIGNQVFTWAWRFKE 169 +Y + +G + S + P+Y + + R+F R AW E Sbjct: 545 VYTTNRNIDREGNLISNGYNWPIYSREKLTSAMICHHFRMFTAR----------AWNLTE 594 Query: 170 CLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGE 211 F+ + A DYD++L++ E G P+K + HGE Sbjct: 595 G-FNESISNAVDYDMYLKL-SEVG-PFKHINKICYNRVLHGE 633 >UniRef50_B4S4G2 Glycosyl transferase family 2 n=1 Tax=Prosthecochloris aestuarii DSM 271 RepID=B4S4G2_PROA2 Length = 334 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 53/190 (27%), Positives = 91/190 (47%), Gaps = 7/190 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +PL+++ MP +N ++ A++S+L Q Y N+E +++DD ST ++ V + D RI Sbjct: 2 HPLVTVLMPVYNGEKHLADAVRSILDQTYRNFEFLVMDDGST--DKSLDIVKSFEDERII 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 +D NSG N+ I LA+G+YI +D DD P+RL L TH+ Sbjct: 60 VKRSDRNSGIAKTLNKGIALARGKYIARMDCDDISLPDRLR---RQADYLETHSDTGLIG 116 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRR--LFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 Q S+ + +P + L ++ + A ++ +D EL AQD Sbjct: 117 SGIQKIKKSRKQKVFTWPSKDTEIKLDLLFQSAFFHPTIMARASLLRKIGYDEELLYAQD 176 Query: 182 YDIFLRMVVE 191 Y ++ ++ E Sbjct: 177 YALWTKLAPE 186 >UniRef50_A3W9Y7 Glucosyltransferase n=1 Tax=Erythrobacter sp. NAP1 RepID=A3W9Y7_9SPHN Length = 328 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 53/206 (25%), Positives = 92/206 (44%), Gaps = 26/206 (12%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P IS+ MP ++ ++ A++SVL Q ++++E+I VDD T + V +DPRI Sbjct: 4 PKISVVMPIYHVEKFVGEAVRSVLDQTFADFELICVDDGGT--DASMDIVRGFSDPRIRI 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G RN I A+GE++ +D DD W P +L++ H YA Sbjct: 62 VCQ-ANRGLAGARNTGIANARGEFVALLDSDDVWHPEKLALHYIHLMANREVGVSYAGS- 119 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYK----RNIIGNQVFTWAWR-------------- 166 + Q ++ P + R+ K RN +GN R Sbjct: 120 ----RMIDQNGNVLRVAMRPKTGRVTAKDIICRNPVGNGSAPVLRRSALDLAVFPHPEEP 175 Query: 167 FKECLFDTELKAAQDYDIFLRMVVEY 192 + C FD + ++D ++++R+ V++ Sbjct: 176 SRTCWFDESFRQSEDIEMWIRLAVKH 201 >UniRef50_Q8EMF8 Glycosyltransferase n=1 Tax=Oceanobacillus iheyensis RepID=Q8EMF8_OCEIH Length = 303 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 40/103 (38%), Positives = 62/103 (60%), Gaps = 4/103 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS-TSWEQLQQYVTALN---DP 60 L+SI +PT+ R RAI SVL Q Y+N E+I+VDD + + E+L+ L + Sbjct: 4 KLVSILIPTYQRPLTLSRAIDSVLNQTYTNIEVIVVDDNNPNTDERLETEGVMLGYKKNS 63 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 ++ YI ++ N A RN A ++GEYI +DDDDE+ P+++ Sbjct: 64 KVKYIKHEYNKNGSAARNTAFQNSKGEYIMLLDDDDEFLPSKV 106 >UniRef50_B1I846 Glycosyl transferase, family 2 n=25 Tax=Streptococcus pneumoniae RepID=B1I846_STRPI Length = 317 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 43/121 (35%), Positives = 70/121 (57%), Gaps = 4/121 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--DPRI 62 L+SI +PT NR + IRA+KS L Q Y N E+II+DD ++ + + + RI Sbjct: 8 ELVSIIIPTHNRYESLIRAVKSCLHQSYKNIEVIIIDDNYSNVNLRNKIIHQFGYTNHRI 67 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA--HKQQLVTHAFLY 120 I ++ + GA RN I ++G+YI+ +DDDDE+ P+R+ +A K ++ A +Y Sbjct: 68 KLILSNEDLGATNARNIGIKNSRGKYISFLDDDDEYMPDRILKLMACFKKSRMKNLALVY 127 Query: 121 A 121 + Sbjct: 128 S 128 >UniRef50_A5FN38 Candidate beta-glycosyltransferase; Glycosyltransferase family 2 n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FN38_FLAJ1 Length = 302 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 58/194 (29%), Positives = 94/194 (48%), Gaps = 16/194 (8%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 + SI +P +N+++ RAIK+VL Q+YSN+E+IIV+D ST + V + D R+ I Sbjct: 1 MFSIVVPLYNKEKSVSRAIKTVLEQNYSNFELIIVNDGST--DNSLDVVEKIKDERVKII 58 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N N G + RN + A ++I +D DD W PN L + +A Q +Y+ V Sbjct: 59 -NKKNGGVSSARNLGVENASTDWICFLDADDYWKPNHLEI-IAELQNKYQDGKIYST-LV 115 Query: 126 CQGEVY-------SQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR--FKECLFDTEL 176 C+ S P Y +S L K I + + + + FDT L Sbjct: 116 CENSEKGVQFIENSLPDDFEGYINDYFS--LASKGTIFHSSSVCVSKKALLEVGCFDTNL 173 Query: 177 KAAQDYDIFLRMVV 190 K +D D++ ++++ Sbjct: 174 KHGEDLDVWFKLMM 187 >UniRef50_Q46CV3 Putative uncharacterized protein n=2 Tax=cellular organisms RepID=Q46CV3_METBF Length = 338 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 45/120 (37%), Positives = 64/120 (53%), Gaps = 4/120 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P ISI +P +N+ RA+ SVL Q + N+E+I+VDD ST + + V NDPRI Sbjct: 4 PEISIIIPLYNKSIHIARALNSVLNQTFYNFEVIVVDDGST--DGGTKIVMDFNDPRIRL 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N GA RN + A+ I +D DDEWT N + + L +++ A +Y Y Sbjct: 62 IQQK-NQGASIARNVGVQYAKANLIAFLDADDEWTQNYIEIILKLRKRY-PQAGMYTTAY 119 >UniRef50_B4U6C1 Glycosyl transferase family 2 n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U6C1_HYDS0 Length = 746 Score = 75.1 bits (183), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 67/214 (31%), Positives = 96/214 (44%), Gaps = 23/214 (10%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTA 56 K P ISI +P WN ++ I I+SVL Q YSNWE+ IVD S E L+ Y Sbjct: 211 FKYEPKISIVVPVWNTPKKFLIDMIESVLNQTYSNWELCIVDGNSKEKHVKETLEHY--T 268 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 L D RI + N G N+AI LA GEYI +D DD P L + + Sbjct: 269 LKDKRIKVKYLKENKGIAGNSNEAIALATGEYIAFLDHDDVLAPFALYEVVRAINENEDV 328 Query: 117 AFLYAN-DYVCQ-GEVYSQPASLPLY-PKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD 173 F+Y++ D + + G P P + P + S +++ ++ F+E Sbjct: 329 DFIYSDEDKITEDGLKRFDPFFKPDFSPDTLRSYNYITHLSVVKKELLNEVGWFREG--- 385 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHI 207 +QDYD+ LR E+A +I+HI Sbjct: 386 --YDGSQDYDLILRC---------TEKAKKIVHI 408 >UniRef50_C1DRS5 Glycosyl transferase, family 2 n=1 Tax=Azotobacter vinelandii DJ RepID=C1DRS5_AZOVD Length = 1182 Score = 75.1 bits (183), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 54/191 (28%), Positives = 94/191 (49%), Gaps = 16/191 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL SI +P +N + A+ S +R Y +WE+I+VDD S S E+ ++ + ++DP+I Sbjct: 655 PLFSIVVPAYNTSSELLDAVLSSVRAQWYPHWELILVDDASPS-EETRRALAEIDDPKIR 713 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPN---RLSVFLAHKQQLVTHAFLY 120 +H + N G N + AQGE+I +D DDE T + L++ + Q F+Y Sbjct: 714 VLHLESNKGISGATNVGLAAAQGEFIVFMDHDDELTVDCLYELALCINRDQP----DFIY 769 Query: 121 AN-DYVCQGEVYSQPASLPLY-PKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 ++ D + + Y+QP P + P + S + + + + + +E Sbjct: 770 SDEDKLTEEGEYTQPHFKPDWSPDTMMSTMFTCHVSCVRRSLLSKVGELR-----SEFDG 824 Query: 179 AQDYDIFLRMV 189 QD+D LR+V Sbjct: 825 CQDWDFILRVV 835 >UniRef50_Q9K6R9 BH3658 protein n=5 Tax=Bacillales RepID=Q9K6R9_BACHD Length = 848 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 68/252 (26%), Positives = 117/252 (46%), Gaps = 34/252 (13%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +++ +P +N + + +I+S+ +Q ++N E+ +VDDCST + A D R+ Y Sbjct: 226 PKVTVIIPAFNAESVIQTSIESMQKQTWTNLEIFVVDDCSTDETASIVHQYAEQDHRVHY 285 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNR-LSVFLAHKQQLVTHAFLYAND 123 + + NSGA RN A+ A G+++T I+D D+W+ +R + + + H L+ H + N Sbjct: 286 LKTETNSGAYVARNIALQHATGDFVT-INDADDWSHSRKIEIQVRH---LLKHPNIIGN- 340 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVF----TWAWRFKECLFDTE---- 175 +SQ K FY+R G +F ++ +R K + Sbjct: 341 -------FSQQVRATNDLK-------FYRRGKPGLYIFANMSSFMFRRKPAMDKLGYWDC 386 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINH--GEMQITSSPKKFSGYFHFYRKH-- 231 ++ A D + R+ + +GE VE T L GE + S F GYF R+ Sbjct: 387 VRFAGDSEFVKRIKLAFGEKAVVELPTAPLSFQRQSGESLTSHSAFGFPGYFMGARREYA 446 Query: 232 --KDKFDRASKK 241 D F R S++ Sbjct: 447 EAHDHFHRTSQE 458 >UniRef50_Q0HKL3 Glycosyl transferase, family 2 n=1 Tax=Shewanella sp. MR-4 RepID=Q0HKL3_SHESM Length = 255 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 43/146 (29%), Positives = 73/146 (50%), Gaps = 4/146 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI MP N + ++ SVL Q YSN+E+II +D ST + + ++ D R+ + Sbjct: 4 LVSIIMPAHNSSATIMDSVNSVLNQSYSNFELIICNDRST--DSTLNIIESIKDSRVVVV 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N GA RN + +A+G ++ +D DD W ++L +++ + + ++ DY Sbjct: 62 ENKTFPGASYARNTGLKIAKGRFVCFLDSDDIWHQSKLLTQISYMR--TSGSYFSYGDYS 119 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFY 151 E + P PKS +RL Y Sbjct: 120 TFREQINFPVGSFFTPKSVSFKRLLY 145 >UniRef50_B4CZV6 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZV6_9BACT Length = 306 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 54/203 (26%), Positives = 93/203 (45%), Gaps = 15/203 (7%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +P +N AI SVL Q + ++E+I+VDD ST + + DPR+ + Sbjct: 5 VSVVIPAYNYAHFLPEAIASVLAQTWEHFELIVVDDGST--DNTPEVCARYTDPRVRAVR 62 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 N+G A RN I A+ ++ +D DD W P L+ ++L F Sbjct: 63 Q-TNAGLSAARNTGIREARYPFVAFLDADDRWQPEFLATVFREFERL-GPTFAAVGTSCS 120 Query: 127 QGEVYSQPASLP---LYPKSPYSRRLFYKRN-IIGNQVFTWAWRFKEC-LFDTELKAAQD 181 + +P + P S + + F RN + + + F+EC FDTEL++++D Sbjct: 121 RMNAIGEPLAPPRQNFLHTSELTVKSFCLRNRPLSSSIVVRRAVFEECGYFDTELRSSED 180 Query: 182 YDIFLRMVVE------YGEPWKV 198 D+++R+ GEP + Sbjct: 181 RDMWIRITARGHRFFYLGEPLAI 203 >UniRef50_A3YA00 Glycosyltransferase n=1 Tax=Marinomonas sp. MED121 RepID=A3YA00_9GAMM Length = 247 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 40/121 (33%), Positives = 64/121 (52%), Gaps = 13/121 (10%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRITY 64 +SI MP +N ++ +I SVL Q ++N+E++I DD ST + + + +Y+ D R+ Sbjct: 4 VSIIMPAFNAEKTISESIDSVLAQTFTNFELVICDDSSTDKTRQLINEYLA--KDKRVRL 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N +GA RN I + G YI +D DD W P++L V H F+ +N Y Sbjct: 62 VSNLYANGAAGARNSCIFESSGRYIAFLDSDDIWLPDKLK---------VQHEFMKSNGY 112 Query: 125 V 125 Sbjct: 113 A 113 >UniRef50_Q39T63 Glycosyl transferase, family 2 n=2 Tax=Proteobacteria RepID=Q39T63_GEOMG Length = 285 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 46/136 (33%), Positives = 75/136 (55%), Gaps = 19/136 (13%) Query: 2 KNNPLISIYMPTWNRQQLAI-RAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 K+ PL+++ + T+NR +L + R+++SVL Q Y N E+I+V DC T + + V + DP Sbjct: 60 KDAPLVTVCIGTYNRGKLLVERSVRSVLEQSYGNIELIVVGDCCT--DDTPRLVQGITDP 117 Query: 61 RITYIH---------NDINSGACAVR---NQAIMLAQGEYITGIDDDDEWTPNRLSVFLA 108 R +I+ N ++ A N A+ A+G++IT +DDDDE+ P+R+ + Sbjct: 118 RFRFINLPERGNYPENPLHRWCVAGTVPFNYALSQARGDFITHLDDDDEFVPDRIEKLVK 177 Query: 109 ----HKQQLVTHAFLY 120 K LV H F Y Sbjct: 178 FIQREKADLVWHPFWY 193 >UniRef50_C3QGL0 Family 2 glycosyltransferase n=3 Tax=Bacteroides RepID=C3QGL0_9BACE Length = 252 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 39/100 (39%), Positives = 59/100 (59%), Gaps = 3/100 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQ--QYVTALNDPRITY 64 +SI MP +N + + +++ Q Y++WE+IIVDDCS + E ++ + V A++D RI Sbjct: 3 VSIVMPYYNAAKYIKETVGAIIAQTYTDWELIIVDDCSLAPETVEVLKAVEAMDD-RIRV 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS 104 + +N GA A RN AI A G Y+ D DD W P +L Sbjct: 62 LRAKVNGGAGAARNIAIKEAYGRYLAFCDSDDWWYPTKLE 101 >UniRef50_C1QAA3 Glycosyl transferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAA3_9SPIR Length = 403 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 2/103 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 +K N SI +P +N ++ R I S++ Q ++ +E+IIVDDCS ++ + + D Sbjct: 2 IKINCFFSIIIPVYNTERYLKRCINSIINQKFTKFEIIIVDDCSNG--DCKEIINSYYDN 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 RI YI ++ N G + R + A+G+YIT ID DDE N L Sbjct: 60 RIIYIRHEQNKGTLSARKTGSIQAKGDYITYIDPDDELNLNAL 102 >UniRef50_D2R3I2 Glycosyl transferase family 2 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R3I2_9PLAN Length = 327 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 42/96 (43%), Positives = 60/96 (62%), Gaps = 2/96 (2%) Query: 8 SIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIHN 67 S+ PT+NR+ + RAI+SV Q S++E +IVDD ST + + YV +L DPRI +I Sbjct: 9 SVVTPTYNRRPVLERAIRSVQAQTCSDYEHLIVDDGST--DSTRDYVKSLRDPRIRFIEL 66 Query: 68 DINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 + GA RN I A+ +IT +D DDE+ P+RL Sbjct: 67 AEHRGANWARNVGIEAARSSWITFLDSDDEFLPHRL 102 >UniRef50_A3W1Y5 Glycosyltransferase n=1 Tax=Roseovarius sp. 217 RepID=A3W1Y5_9RHOB Length = 300 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 65/253 (25%), Positives = 110/253 (43%), Gaps = 36/253 (14%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +PL+S+ +P +NR+ A+ SVL QDY E+I VDD ST + + +++DPR+ Sbjct: 5 SPLVSVIIPCFNRETTVHEAVLSVLTQDYDPIELIAVDDNST--DGTIAVLESIDDPRLR 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 +HN + G RN I + +I D DD W P +L K+Q+ +L +D Sbjct: 63 ILHNSGDRGPSQARNHGIRSSTAPWIAFQDSDDIWLPGKL------KRQM---DYLQDSD 113 Query: 124 YVCQ--GEVYSQPASLPLYPKSPYSRR---------------LFYKRNIIGNQVFTWAWR 166 +V G + + +P++P R N I Q+ Sbjct: 114 FVAAYCGMLIKKDT----HPETPVKYRRPAPETSPLEGDILPSLALENYISTQMLVVRHD 169 Query: 167 FKECL--FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 + + FD L A D+++ LR V G +++ + ++ + SS K+ + Sbjct: 170 ILDQIGGFDEALPALVDWELMLR-VASLGPVAFIDDDLVVQRMSENSL-THSSKKRLAAQ 227 Query: 225 FHFYRKHKDKFDR 237 H +KH D R Sbjct: 228 EHILKKHYDLLGR 240 >UniRef50_Q7U8H3 Putative glycosyltransferase family 2 protein n=1 Tax=Synechococcus sp. WH 8102 RepID=Q7U8H3_SYNPX Length = 804 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 63/211 (29%), Positives = 100/211 (47%), Gaps = 18/211 (8%) Query: 2 KNNPLISIYMPTWNRQQLAIR-AIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTAL 57 ++ PLISI +PT+N +R I+SV RQ Y NWE+ I DD S++ L+ Y ++ Sbjct: 235 QDAPLISIIIPTYNTNSNHLRECIESVCRQSYPNWELCICDDSSSAVSVKTILRSYQSS- 293 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 DPR+ I + N C N A+ +A GEY+ +D DD N L ++A + Q A Sbjct: 294 -DPRVKLIFREKNGHICEASNDALRMATGEYVALLDHDDILADNAL-YWVARELQKKPQA 351 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTEL 176 L +D + ++ + K ++ L N I + V+ + F Sbjct: 352 NLIYSD---EDKINDDGMRACPHFKPAFNIDLLLSYNFISHLGVYRREILKQIGGFRVGF 408 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHI 207 + +QD+D+ LR V+E QI+HI Sbjct: 409 EGSQDHDLALRTVLE-------SSPDQIIHI 432 >UniRef50_B6BQY7 Glycosyltransferase n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQY7_9RICK Length = 296 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 59/197 (29%), Positives = 100/197 (50%), Gaps = 24/197 (12%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+SI +PT+N +A++SVL Q Y+NWE I++D+ ST + + + ND RI Y Sbjct: 6 PLVSIIIPTFNHADYLNKALQSVLHQTYTNWEAIVIDNHST--DDTGKVIARFNDHRIKY 63 Query: 65 IHNDINSGACA-VRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA---HKQQLVTHAFLY 120 + N G A RN I A+G+++ +D DD WT ++L + ++ L+ H Sbjct: 64 LKIQ-NYGVIAKSRNIGIKSAKGDWVAFLDSDDSWTSDKLKTCIENINNQIDLIYHDLEV 122 Query: 121 AND--YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-------QVFTWAWRFKECL 171 +D +G++ ++ L K P L N+I N +V T +E Sbjct: 123 TSDRQIFFKGKI-NKSRQL----KKPILIDLLVNGNVISNSSVIVRKKVLTKIGLIEE-- 175 Query: 172 FDTELKAAQDYDIFLRM 188 + +L AA+DY+ +L++ Sbjct: 176 -NKDLIAAEDYNTWLKI 191 >UniRef50_D2RIR4 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIR4_ACIFE Length = 256 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 40/100 (40%), Positives = 58/100 (58%), Gaps = 4/100 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRIT 63 L+S+ MPT+N + + +I SVL Q ++WE+ IVDDCST + E L+ Y+ P+I Sbjct: 2 LVSVVMPTYNCGKYILESIDSVLAQTITDWELQIVDDCSTDDTQEILKPYLEKY--PQIH 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 Y N G R AI A+G+Y+ +D DD W P +L Sbjct: 60 YYRLSQNGGPAVARTVAIERAEGKYVAFLDSDDLWLPEKL 99 >UniRef50_Q5V6E2 Glycosyl transferase n=1 Tax=Haloarcula marismortui RepID=Q5V6E2_HALMA Length = 318 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 34/101 (33%), Positives = 59/101 (58%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N P +S+ +PT++R ++ A+KSV Q Y + E+++VDD S + + ++ + Sbjct: 2 NEPPVSVVLPTYDRPEILRNAVKSVQEQTYRHIELVVVDDHSPTPAADTLSPLSFDELTV 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 I ++ N GA RN I + GEYI +DDDD+W P+++ Sbjct: 62 EIIRHEENRGANEARNTGIRESTGEYIAFLDDDDDWAPSKI 102 >UniRef50_C6IGZ0 Glycosyl transferase n=1 Tax=Bacteroides sp. 1_1_6 RepID=C6IGZ0_9BACE Length = 304 Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 5/121 (4%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 K PL++I T NR +L R I+S+ Q Y N+E I+ D + + ++ V + NDP Sbjct: 3 KKLPLVTIITNTKNRARLISRCIESIQNQTYQNYEHIVADGGT---DNTKKIVESYNDPH 59 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLV-THAFLY 120 I YI + G A +A L++GE+IT +DDDDE+TP +L L Q L + F+Y Sbjct: 60 IIYI-SIPEGGPVAQTKEAFKLSKGEFITFLDDDDEYTPEKLEKQLDLIQSLSDDYGFIY 118 Query: 121 A 121 Sbjct: 119 G 119 >UniRef50_D2QQL1 Glycosyl transferase family 2 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQL1_9SPHI Length = 308 Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 48/160 (30%), Positives = 79/160 (49%), Gaps = 24/160 (15%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P++++ M T+NR + A++SVLRQ +NWE++++ D T + + + + DPRI Sbjct: 6 HTPVVTVIMATYNRSNILHYAVQSVLRQTETNWELLVIGDHCT--DDTEAVMASFTDPRI 63 Query: 63 TYIHNDINSGA-CAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAH------------ 109 +I+ + N G N + A+G YI ++ DD W P L+V LA Sbjct: 64 RFINLETNVGEQSGPNNTGLRQARGTYIAFLNHDDMWFPEHLAVCLAALRTQQADLVFTL 123 Query: 110 ------KQQLVTHAFLYANDYVCQGEVYSQPASLPLYPKS 143 +Q+V L AN Y VY PAS ++ +S Sbjct: 124 GAVMMPDEQIVLSGILPANRY---NPVYFVPASSWVFHRS 160 >UniRef50_A4J4C0 Glycosyl transferase, family 2 n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J4C0_DESRM Length = 373 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 3/101 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +IS+ MP +N + I+SVL Q N+E+II++D ST +Q + + + +DPRI Y Sbjct: 1 MISVIMPVYNGAEFLKETIESVLNQTEKNFELIIINDGST--DQSEDVILSFSDPRILYF 58 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVF 106 + NSGA A RN+ + A G+++ D DD PNR V Sbjct: 59 KQE-NSGAAAARNRGLEKACGDFVVIQDADDISLPNRFEVL 98 >UniRef50_A5GQE9 Glycosyltransferase n=1 Tax=Synechococcus sp. RCC307 RepID=A5GQE9_SYNR3 Length = 275 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 40/108 (37%), Positives = 59/108 (54%), Gaps = 4/108 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--ND 59 N PL+SI +P +N + +++SV Q +SN++ IIVDDCST +Q Q V + D Sbjct: 4 SNQPLVSIIIPCFNCASTVLESLQSVQSQTFSNFQCIIVDDCST--DQTQALVMSFIEGD 61 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL 107 R Y+ N G + RN A+ +YI +D DD W P L++ L Sbjct: 62 SRFRYLITQSNQGVSSARNLALDNCSSQYIAFLDSDDLWHPEFLTLAL 109 >UniRef50_Q6AK17 Related to glycosyl transferase n=1 Tax=Desulfotalea psychrophila RepID=Q6AK17_DESPS Length = 285 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 56/196 (28%), Positives = 89/196 (45%), Gaps = 19/196 (9%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW-EQLQQYVTALNDPRITYI 65 IS+ +PT+NR Q +RAI SVLRQ E++IVDD ST E L + A I YI Sbjct: 6 ISVVIPTYNRDQYLVRAIDSVLRQSLRVAEIVIVDDGSTDGSEALIRQKKAEQGVPIRYI 65 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL----SVFLAHKQQLVTHAFLYA 121 + N G A RN + A +YI +D DD W ++ A + ++H + Sbjct: 66 YQK-NQGPAAARNNGLQHASSDYIAYLDSDDHWHKKKIEKQYGALCAQPEMRISHTY--- 121 Query: 122 NDYVCQGEVYSQ-----PASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTEL 176 ++ +GE +Q P ++ + ++ +F+ LFD + Sbjct: 122 EKWLRRGEHLNQKKKHIPGHGDIFAHCLQLCAVGMSTVMLHRSIFSEVG-----LFDEGM 176 Query: 177 KAAQDYDIFLRMVVEY 192 + +DYD +LR+ Y Sbjct: 177 RCCEDYDFWLRVSCRY 192 >UniRef50_B7JHV6 Glycosyl transferase, group 2 family protein n=52 Tax=Bacillus RepID=B7JHV6_BACC0 Length = 266 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 45/120 (37%), Positives = 65/120 (54%), Gaps = 5/120 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRI 62 P +S+ P++N + I SV Q Y NWEMIIVDD ST S ++++ + D RI Sbjct: 12 PHVSVITPSYNSIRFIGETIVSVQNQSYENWEMIIVDDASTDESVTKIKEIIEG--DSRI 69 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + N GA RN AI A+G YI +D DD W P++L L +++ +F YA+ Sbjct: 70 RLVSLKENIGAAKARNIAIQEARGRYIAFLDSDDIWLPHKLKTQLLFMEEMNV-SFSYAS 128 >UniRef50_A1U141 Glycosyl transferase, family 2 n=1 Tax=Marinobacter aquaeolei VT8 RepID=A1U141_MARAV Length = 299 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 56/195 (28%), Positives = 91/195 (46%), Gaps = 7/195 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALN 58 M + PL+SI PT+NR A++SVL Q +E+I++DD ST + E +++Y + Sbjct: 1 MYSKPLVSIITPTFNRADYLPIAVESVLAQTMPEFELIVIDDGSTDNTSEVMERYTS--- 57 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D R+ Y + N G RN+ I A+G YI +D D+ W +LS L ++ Sbjct: 58 DSRVKYFKQE-NQGQSIARNRGIEAAKGGYICFLDSDNAWLETKLSKSLQAFKENPEADI 116 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELK 177 +Y + + + Q + + L + I N T F+E FD + Sbjct: 117 VYGDFILIDEDGKEQGVNRMKRYSGRITSHLIHDNFISMNTTMTRRRCFEEMGGFDNADR 176 Query: 178 AAQDYDIFLRMVVEY 192 A+DY ++LR Y Sbjct: 177 LAEDYGLWLRFSTRY 191 >UniRef50_C4KBW6 Glycosyl transferase family 2 n=1 Tax=Thauera sp. MZ1T RepID=C4KBW6_THASP Length = 300 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 61/212 (28%), Positives = 91/212 (42%), Gaps = 42/212 (19%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDP 60 P +S+ +P +N RA+ SVL QD++++E+I+VDD ST + E L+ Y L Sbjct: 5 TEPTVSVVIPAFNAAWCIRRAVDSVLAQDFTDFELIVVDDGSTDDTAEVLRGYGDVLR-- 62 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 + + N G RN I LA+G YI +D DD W P +L +A Q AF Sbjct: 63 ----LVSQPNGGMSRARNAGIRLARGRYIAFLDADDRWLPAKLGRQVALLDQRPELAFCA 118 Query: 121 AND------------YVCQG-------EVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVF 161 A + CQG EV++ A + S +RR +G Sbjct: 119 ATASLEDPDGKPVGIWACQGDGTASVAEVFAAHALIAGGASSVLARR--EPVQALGG--- 173 Query: 162 TWAWRFKECLFDTELKAAQDYDIFLRMVVEYG 193 FD L A+D D+++R+ G Sbjct: 174 ----------FDEALFGAEDTDLWIRLAACGG 195 >UniRef50_A1W3G7 Glycosyl transferase, family 2 n=1 Tax=Acidovorax sp. JS42 RepID=A1W3G7_ACISJ Length = 983 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 36/126 (28%), Positives = 65/126 (51%), Gaps = 3/126 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRIT 63 L+++ +P +N + + A+ SV Q ++ WE+I++DD ST +W LQ Y DPRI Sbjct: 14 LVTVILPAYNHAKYVVEAMDSVRAQTFTRWELIVIDDGSTDDTWAVLQAYAAQHGDPRIR 73 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + N+G+ A N+ + +A+ Y+ ++ DD + P RL + Q F+ Sbjct: 74 LL-TQANAGSHATLNRGLAMARTPYLAILNSDDRYAPERLQRLVDTAQAAADEVFIVTGL 132 Query: 124 YVCQGE 129 + G+ Sbjct: 133 RLIDGD 138 Score = 63.2 bits (152), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 69/292 (23%), Positives = 121/292 (41%), Gaps = 29/292 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRA-IKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + PLISI MP + ++A + SV Q Y +WE+ I DD S+ + Q+ T + Sbjct: 445 LPRRPLISIVMPVHDTPPHFLQAAVDSVQGQWYGHWELCICDDASSRADTRQRLQTLQDQ 504 Query: 60 P-RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 +I + N A+ LAQGE++ +D DD P L F Sbjct: 505 SDKIKVTRRETAGHIVHATNDALALAQGEFVVFLDHDDVLAPQALLRLAQAINAPSAPDF 564 Query: 119 LYANDYVC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVF---TWAWRFKECLFD 173 +Y+++ QG LPL+ K +S L + +N +G+ + W R L Sbjct: 565 IYSDEDKLDEQGR-----RCLPLF-KPQWSPTLQWAQNYVGHIMCVRRAWLERLGGLLEG 618 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILH---INHGEMQITSSPKKFSGYF----- 225 ++ +QD+D+ LR+ + K+E ++L+ I+ + K ++ Sbjct: 619 SQ--GSQDHDLVLRLAAQGA---KIEHIPEVLYHWRIHAASTSASPQSKPYAHLAGRQAV 673 Query: 226 --HFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 H ++ ++FDR F +YQ R + ++ R+ L D Sbjct: 674 ARHLASRYGEQFDRVDDGDHAF-VYQPRFRVPAGTVASIIIPTRDKADLLDA 724 >UniRef50_C1QAA4 Predicted glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAA4_9SPIR Length = 581 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 65/215 (30%), Positives = 102/215 (47%), Gaps = 22/215 (10%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK+NPLISI +P +N + AI S L Q Y N E+I+VDD S + V + D Sbjct: 1 MKSNPLISIIIPVYNGENYIKDAINSALNQTYRNIEVIVVDDGSK--DDTANIVKSFRD- 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 +ITYI+ + N G + N AI A+G+YI+ + DD + PN++ + + + +Y Sbjct: 58 KITYIYKN-NGGVASALNYAIKEAKGDYISWLSHDDIYYPNKIEEEVNELENIDDKTVIY 116 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYS---RRL---FYK---RNIIGNQVFTWAWRFKEC- 170 + E ++ L ++ Y RRL FY +I G + F Sbjct: 117 S-----GFEFVNEKLELITVFENSYKTEYRRLNNNFYSILLSDIGGCTLLIPKEVFSNVG 171 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQIL 205 F+ L QDYD + RM + +KV+ ++L Sbjct: 172 FFNENLLCVQDYDFWFRM---FKHGYKVKYIPKVL 203 >UniRef50_A6M2M4 Glycosyl transferase, family 2 n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M2M4_CLOB8 Length = 1362 Score = 74.3 bits (181), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 68/237 (28%), Positives = 103/237 (43%), Gaps = 45/237 (18%) Query: 4 NPLISIYMPTWN--RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--- 58 NPLISI +P +N +QL + I SV+ Q Y NW++ +VDD ST W + V LN Sbjct: 699 NPLISIVVPVYNVLDEQL-VECIGSVINQTYDNWQLCLVDDAST-WNSV---VKVLNKYE 753 Query: 59 -DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 +P+I ++ N N I +A GE+I +D DD PN L + + Sbjct: 754 KNPKINIVYRKENGHISRATNDGISIANGEFIAFLDCDDVLAPNALYEVAKKLNEDSEYD 813 Query: 118 FLYAND--YVCQGEVYSQPASLPLYPKSPYSRRLF------YKRNIIGNQVFTWAWRFKE 169 F+Y+++ G+ P P + + ++ Y+++II R Sbjct: 814 FIYSDEDKLTADGKHRHSPFFKPDWSPDTFMSLMYTCHFSVYRKSIIDE---IDGLR--- 867 Query: 170 CLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHIN-------HGEMQITSSPK 219 TE AQDYD LR E+A +I HI+ E I S+P+ Sbjct: 868 ----TEFNGAQDYDFTLRF---------TEKAKKIGHIDKILYHWREREESIASNPE 911 Score = 42.4 bits (98), Expect = 0.016, Method: Compositional matrix adjust. Identities = 47/203 (23%), Positives = 86/203 (42%), Gaps = 19/203 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+SI +P+ + ++ I+SV + Y N+E+++VD+ S + E +Y Sbjct: 956 PLVSIIIPSKDNFKILNNCIESVKKYTSYYNYEIVVVDNGSNN-ENKNKYEDICKKYNCR 1014 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSVFLAHKQQLVTHAF 118 Y + +N + N +A+GE+ ++DD +EW L +V Sbjct: 1015 YHYEKMNFNFSKMCNIGSRIAKGEFYLFLNDDIEIFQEEWLEIMLGQASLKHVGVVGAKL 1074 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRL--FYKRN--------IIGNQVFTWAWRFK 168 LY N + Q + P + + + R+ +Y RN + G + +F+ Sbjct: 1075 LYPNSNIIQHIGITNLKIGPSHSQIGFDDRIVYYYGRNRMEYNFIAVTGACLMIERHKFE 1134 Query: 169 ECL-FDTELKAA-QDYDIFLRMV 189 E FD L A D D+ ++V Sbjct: 1135 EISGFDENLSVAYNDVDLCFKLV 1157 >UniRef50_Q116A2 Glycosyl transferase, group 1 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116A2_TRIEI Length = 1991 Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 58/187 (31%), Positives = 86/187 (45%), Gaps = 9/187 (4%) Query: 5 PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT--ALNDPR 61 P IS+ MP ++ + + I SVL Q Y NWE I DDCS++ ++++ +T A D R Sbjct: 994 PKISVVMPVYDPPLKFLHQGISSVLNQVYQNWEFCIADDCSSN-PKIREILTERAKTDSR 1052 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I N A N A LA G+++ +D+DDE TP+ L + Q FLY+ Sbjct: 1053 IKLTFRSENGNISAATNSAAELATGDFLLFLDNDDELTPDALGEVALYISQNSEIDFLYS 1112 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 +D ++ ++ K YS L IG+ F K F + +Q Sbjct: 1113 DD----DKISTEGKRFDPQFKPEYSPELLLSYMYIGHLCVVRKEIFDKIGGFRIGFEGSQ 1168 Query: 181 DYDIFLR 187 DYD LR Sbjct: 1169 DYDFALR 1175 >UniRef50_C0B9A3 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0B9A3_9FIRM Length = 444 Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 37/99 (37%), Positives = 59/99 (59%), Gaps = 2/99 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 ++S+ M T+NR+ +AI SVL Q Y ++E +IVDD ST + ++ + + DPRI Y Sbjct: 1 MVSVLMCTYNRRFCLKQAIDSVLCQTYPDFEFVIVDDGST--DGTEELIKSYQDPRIRYF 58 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS 104 + NS C NQ + QG+Y+ ++ DD W P++L Sbjct: 59 KLNRNSFYCYAANQGLYHCQGDYVAFMNSDDAWLPDKLE 97 >UniRef50_Q8PXS5 Glycosyltransferase n=1 Tax=Methanosarcina mazei RepID=Q8PXS5_METMA Length = 311 Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 58/199 (29%), Positives = 95/199 (47%), Gaps = 23/199 (11%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPRIT 63 LIS+ M +N ++ +I S+L Q Y NWE+II++D S+ + + + Q+ + DPRI Sbjct: 5 LISVVMSAYNSEKFISDSISSILDQTYENWELIIINDASSDNTLKIVNQF--SEKDPRIK 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAH-----KQQLV---T 115 I N+ N G N I ++GE+I +D DD P+RL L H + LV Sbjct: 63 VIDNENNLGLTISLNIGINNSKGEFIARLDSDDFAEPSRLEKELDHLHAYPEAGLVGSGA 122 Query: 116 HAFLYANDYVCQGEVYSQP--ASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD 173 H + + + V SQP + L +P+ R + V ++ +F Sbjct: 123 HLINSSGNKIGSMNVMSQPYFVNRFLINLNPFIHSSIMVRKKALDNVGSYREKF------ 176 Query: 174 TELKAAQDYDIFLRMVVEY 192 + +QDYD+ LR+ +Y Sbjct: 177 ---RYSQDYDLILRLSDKY 192 >UniRef50_B2UPJ3 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPJ3_AKKM8 Length = 345 Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 47/143 (32%), Positives = 74/143 (51%), Gaps = 13/143 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P SI +P +N + ++SVL Q + ++E+IIVDD S+ ++ + + +DPRI Sbjct: 4 PFFSIIIPAYNLENYIAATLQSVLVQTFQDFEIIIVDDGSS--DETVSIIQSFHDPRIRL 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV----FLAHKQQLVTHAFLY 120 + + +N G RN + A G YI +D DD W P L + F H + L Y Sbjct: 62 V-SQVNGGVSRARNAGMKKAVGAYIAFLDGDDYWYPEHLELAADFFNRHPEILA-----Y 115 Query: 121 ANDYVCQGEVYSQPASLPLYPKS 143 AN Y+ + E+ + P P YP+S Sbjct: 116 ANRYM-RDELEAIPPRPPSYPES 137 >UniRef50_D1W2F4 Glycosyltransferase, group 2 family protein n=2 Tax=Prevotella RepID=D1W2F4_9BACT Length = 251 Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 42/104 (40%), Positives = 59/104 (56%), Gaps = 3/104 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDP 60 N L+S+ +P +N + S+L+Q Y N+E++I DD ST S +L Q +T D Sbjct: 2 NTDLVSVIIPCYNSGIYIKECVDSILQQTYQNFEILITDDGSTDISTIKLLQQITQC-DK 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS 104 RI D NSG A RN +I A+G YI D DD+W PN+L+ Sbjct: 61 RINVFFLDGNSGPAAARNNSIKHAKGRYIAFCDSDDKWLPNKLT 104 >UniRef50_B3QQK3 Glycosyl transferase family 2 n=1 Tax=Chlorobaculum parvum NCIB 8327 RepID=B3QQK3_CHLP8 Length = 505 Score = 73.9 bits (180), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 44/101 (43%), Positives = 59/101 (58%), Gaps = 4/101 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--NDPRI 62 P IS+ M +N + AI+SVL Q +SN+E+I+VDDCST + ++ V L +D RI Sbjct: 250 PKISVIMTAFNVENYIETAIRSVLAQSWSNFELIVVDDCST--DATRRIVKQLMEHDSRI 307 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL 103 I N N G RN+A LA GEY+T D DD P +L Sbjct: 308 KLIENQTNCGTYINRNKAYDLATGEYVTCHDSDDWAHPKKL 348 >UniRef50_Q46FX2 Putative uncharacterized protein n=1 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46FX2_METBF Length = 783 Score = 73.9 bits (180), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 59/209 (28%), Positives = 97/209 (46%), Gaps = 22/209 (10%) Query: 5 PLISIYMPTWNRQQLAI-RAIKSVLRQDYSNWEMIIVDDCSTS---WEQLQQYVTALNDP 60 P ISI MP +N ++ + +AI SV+ Q Y+NWE+ I DD ST L +Y + Sbjct: 261 PKISIIMPVYNVDEIWLEKAIDSVINQIYTNWELCIADDASTKPHIKSTLSKYSELDSRI 320 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 ++ Y+ N N G N+A++LA GE+I +D+DDE + + L + + +Y Sbjct: 321 KVKYLSN--NRGISGASNEALLLATGEFIGLLDNDDELSVDALYEVVKVLNKKSDIDLIY 378 Query: 121 AN-DYVCQGEVYSQPASLPLY-PKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 ++ D + P P + P +S +I + RF+ + Sbjct: 379 SDEDKIDMKGNRCNPFFKPGWSPDVLFSVNYVCHFAVIRKTIVDEVGRFR-----IGFEG 433 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHI 207 +QDYD+FLR+ E+ Q+ HI Sbjct: 434 SQDYDLFLRV---------AEKTNQVEHI 453 >UniRef50_D1RR43 Glycosyl transferase family 2 n=1 Tax=Serratia odorifera 4Rx13 RepID=D1RR43_SEROD Length = 316 Score = 73.9 bits (180), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 67/250 (26%), Positives = 106/250 (42%), Gaps = 43/250 (17%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS-------TSWEQLQQYVTALND 59 +S+ +PT+ R +L RAI SVL Q + E+I+VDD + E L Y+ + Sbjct: 21 VSVIIPTYGRSELLTRAIDSVLAQTHRALEIIVVDDNPQGDAHRLATRECLAGYI---DK 77 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 ++ Y N G RN I+ + GEYIT +DDDD + P +++ LA +Q L Sbjct: 78 GQVIYYPRRHNGGGALARNSGILRSSGEYITFLDDDDYYHPQKIAHQLAFMRQSDYDVSL 137 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 D + G++ + +Y+ G + F A + AA Sbjct: 138 CDMDILKNGQINADH---------------YYRARCDGLEDFMLAGVAYTPMIMMRRSAA 182 Query: 180 ------------QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHF 227 QD+ IFL ++ G +H +H +ITSSPK GY Sbjct: 183 IAVRGFFNTPRYQDH-IFLYRLLAAGARIGTLHERLAVHNDHDGERITSSPKGIIGY--- 238 Query: 228 YRKHKDKFDR 237 ++K +F+R Sbjct: 239 --RNKMQFER 246 >UniRef50_D1BX31 Glycosyl transferase family 2 n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BX31_XYLCX Length = 264 Score = 73.9 bits (180), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 26/213 (12%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +++ MP +N ++ + +SVL Q +S E++IVDD S D R+ Sbjct: 7 SPPTVTVVMPAFNSERTLRDSARSVLDQSFSALELVIVDDSSADRTAAIATELGAGDARV 66 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 I N + G RN AI A+G Y+ D DD W P++L L Q T A L + Sbjct: 67 RLIRNPHSLGPAGARNAAISAARGRYVAFCDSDDLWLPSKLERQLEVATQ--TGAALVYS 124 Query: 123 DYVCQGEVYSQPAS------LPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTE- 175 Y +S PAS ++ + + RN++G + DTE Sbjct: 125 GYHRVDADFSGPASGFRPADRVVHVPTLLTHGALLHRNVVG---------CLTAMIDTEQ 175 Query: 176 --------LKAAQDYDIFLRMVVEYGEPWKVEE 200 + A+D+ ++LR++ E G ++E Sbjct: 176 TGPVSMPGIPGAEDWALWLRVLREGGTAAGIDE 208 >UniRef50_C6J2R9 Predicted protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2R9_9BACL Length = 698 Score = 73.9 bits (180), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 62/243 (25%), Positives = 113/243 (46%), Gaps = 30/243 (12%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDPR 61 N IS+ MPT+ +++ +I+S+L Q + N E+I+VDDCST + +++Y + D R Sbjct: 276 NRKISVIMPTYCAEKVIRTSIESILNQTWKNLELIVVDDCSTDNTVSIVKEYASI--DNR 333 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + I + NSG RN + +A GE++T D DD P +L + H L+ + + Sbjct: 334 VILIRAETNSGTYVARNLGLKIATGEFVTCQDADDWAHPQKLEKQILH---LIDNPSVLG 390 Query: 122 N----DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK 177 N + + + + + Y + +S +F + I+ + + + RF Sbjct: 391 NTSETSRITENLDFVRKGKVGCYISNNFSSLMFRRCEILSSVGYWDSVRF---------- 440 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF----HFYRKHKD 233 D + R+ +G+ V T IL ++ +++S +GYF FY K+ Sbjct: 441 -GADSEFIRRIRKVHGDKSIVNLKTCILSLS----RVSSGSLTDNGYFGFNGFFYGARKE 495 Query: 234 KFD 236 FD Sbjct: 496 YFD 498 >UniRef50_B9KX37 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides KD131 RepID=B9KX37_RHOSK Length = 1042 Score = 73.9 bits (180), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 26/191 (13%) Query: 8 SIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST---SWEQLQQYVTALNDPRITY 64 S+ MPT+NR RAI++VL Q + +E+IIVDD ST E +++ + +I Y Sbjct: 551 SVIMPTYNRGWCIGRAIRAVLAQTHRTFELIIVDDGSTDNTEQEVRRRFPDEIASGQIVY 610 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK------QQLVTHAF 118 I + N+G C RN +M A+G++I D D+E P LS+F AH+ + V F Sbjct: 611 IRLERNAGVCKARNIGMMRARGDWIAYADSDNEVRPYLLSMF-AHRILQDPDKDTVYGRF 669 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTW--AWRFKECLFDTEL 176 + N G+ P+ + N I VFT + K FD L Sbjct: 670 INVNSGAIIGK--------------PFESGDVLQGNFIDLGVFTHRRSLMGKLGCFDDLL 715 Query: 177 KAAQDYDIFLR 187 + D+D+ +R Sbjct: 716 RRLVDWDLIIR 726 >UniRef50_D1BX28 Glycosyl transferase family 2 n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BX28_XYLCX Length = 1038 Score = 73.9 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 41/105 (39%), Positives = 59/105 (56%), Gaps = 3/105 (2%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST-SWEQLQQYVTALNDP 60 + + L+S+ MP WNR+ AI SVL Q + WE+IIVDD ST S E++ + A DP Sbjct: 211 RGDVLVSVIMPAWNREDQISPAIDSVLAQTHEAWELIIVDDGSTDSTEEVVRAYMA-EDP 269 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV 105 RI ++ + G RN +A+GEY +D D+ WT L + Sbjct: 270 RIKFVSLS-HGGVARARNSGFGVAEGEYFAFLDTDNTWTSRHLEL 313 >UniRef50_B2VFP3 Glycosyl transferase n=1 Tax=Erwinia tasmaniensis RepID=B2VFP3_ERWT9 Length = 250 Score = 73.6 bits (179), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 44/146 (30%), Positives = 78/146 (53%), Gaps = 6/146 (4%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 + I +P +N + +RA++SV+ Q Y++W + +++D S+ + L+ D RIT I Sbjct: 8 VGIIIPMYNARATILRAVQSVIDQTYTDWHIYLINDRSSD-DSLELVQKNCPDARITIIS 66 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 N++NSGA A RN + A +I +D DD+W ++L + +A QL L +DY Sbjct: 67 NEVNSGAAATRNVGLAAASEPFIAFLDSDDQWDSDKLMLQVA---QLAAGDNLLISDYRY 123 Query: 127 Q--GEVYSQPASLPLYPKSPYSRRLF 150 Q + YS P ++ + ++ F Sbjct: 124 QSPSKSYSVSYGKPYLKQNGFMKKKF 149 >UniRef50_A4SE50 Glycosyl transferase, family 2 n=2 Tax=Chlorobium/Pelodictyon group RepID=A4SE50_PROVI Length = 213 Score = 73.6 bits (179), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 38/104 (36%), Positives = 65/104 (62%), Gaps = 5/104 (4%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST--SWEQLQQYVTALNDP 60 + P +S+ MPT+NR + AIKS+L Q +++WE++IVDD ST + + ++ Y+ +P Sbjct: 5 SKPCVSVIMPTFNRSRFIRTAIKSLLSQTFTDWELLIVDDGSTDSTMQIVEPYL--FENP 62 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS 104 R+ Y+ + N A RN + + G++IT +D DD + PN L+ Sbjct: 63 RVRYMKHS-NRKAPLSRNAGMQASFGKFITFLDSDDYYLPNHLA 105 >UniRef50_Q0C2C3 Glycosyl transferase, group 2 family protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0C2C3_HYPNA Length = 325 Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 37/107 (34%), Positives = 51/107 (47%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ MP + AI SV Q +WE+IIVDDCS A DPR+ Sbjct: 4 PKVSVVMPAYKAAGFIALAIASVRAQTLPDWELIIVDDCSPDDTAAVAMAAAGGDPRVKL 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ 111 + D N G RN A+G++I +D DD + P RL +A + Sbjct: 64 VRADQNGGVARARNLGNSHAKGDWIAVLDSDDAFEPARLETLVAAAE 110 >UniRef50_A0YLN7 Glycosyl transferase, group 2 family protein n=3 Tax=Oscillatoriales RepID=A0YLN7_9CYAN Length = 2105 Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 58/189 (30%), Positives = 90/189 (47%), Gaps = 13/189 (6%) Query: 7 ISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT--ALNDPRIT 63 IS+ MP +N + +AI SV+ Q YSNWE+ I DDCS+ E + +++ A D RI Sbjct: 858 ISVVMPVYNPPIEFLEKAINSVINQVYSNWELCIADDCSSDPE-IAEFLNKFAEEDERIK 916 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N N A +LA GE+I +D DDE TPN L + + +Y +D Sbjct: 917 VIFRPENGNISRATNSAAVLATGEFILLLDHDDELTPNALGEIALYLVENPETDVMYTDD 976 Query: 124 --YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKAAQ 180 +G+ ++ P + K +S L IG+ + F++ + +Q Sbjct: 977 DKIDTEGKRFA-----PQF-KPDWSPELLLSYMYIGHALVLRRKLFEQVAGMRLGFEGSQ 1030 Query: 181 DYDIFLRMV 189 DYD+ LR+ Sbjct: 1031 DYDLALRIT 1039 Score = 44.3 bits (103), Expect = 0.004, Method: Compositional matrix adjust. Identities = 33/119 (27%), Positives = 61/119 (51%), Gaps = 8/119 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVD---DCSTSWEQLQQYVTA 56 + N P ++I +PT N+ L +KS+L + Y N+E++I+D D + E L A Sbjct: 1112 IDNGPSVTIIIPTKNQLNLLQACLKSLLEKTTYQNYEVMIIDNESDDPKTLEYLDWIALA 1171 Query: 57 LNDPRITYIHNDINSGA---CAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQ 111 +P+I+ + G A+ N+A+ Q +Y+ +++D E +P LS + + Q Sbjct: 1172 ETEPKISVLRVANPHGKFSFAAINNRAVEQTQTDYVLFLNNDTEIISPEWLSQMMGYIQ 1230 >UniRef50_Q98NV3 Mlr9727 protein n=1 Tax=Mesorhizobium loti RepID=Q98NV3_RHILO Length = 339 Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 43/117 (36%), Positives = 58/117 (49%), Gaps = 5/117 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCST---SWEQLQQYVTALNDPRI 62 IS+ +PT+ R RAI SVL QD+SNWE+I+ DD +W L Y A DPRI Sbjct: 24 FISVAIPTFRRPDTIRRAIDSVLHQDFSNWELIVSDDEGPDGQTWAMLTDYARA--DPRI 81 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 + N G N ++ +G +I + DDD P L F QQ + AF+ Sbjct: 82 RIVENRRGRGQVENTNNVMLACRGSWIKLLHDDDWLAPKALGTFAQVAQQYPSAAFI 138 >UniRef50_Q8NTE4 Glycosyltransferases involved in cell wall biogenesis n=1 Tax=Corynebacterium glutamicum RepID=Q8NTE4_CORGL Length = 274 Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 66/252 (26%), Positives = 118/252 (46%), Gaps = 24/252 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PLIS+ +PT + ++IKSV Q+Y NW++++V D + + Q+V RI Sbjct: 6 PLISVVIPTIAYDEYCSQSIKSVCEQNYENWQIVLVLD-GAPIKDVPQWVK--EHERIKI 62 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL----- 119 + I G N I + G+ I +D DD P+RLS ++ L H ++ Sbjct: 63 VEQKIRQGTPTSLNNGIKASDGQLIARLDSDDLAAPSRLS---KQEEFLRNHPYIICVAT 119 Query: 120 ---YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL--FDT 174 + N++ G+++ Q A L P S R++ +N I + + + E + + Sbjct: 120 KTKHINEH---GKIFGQSADL---PTSQDIRQILLVKNPIIHSSVMYRKQVVEQIGGYSL 173 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP-KKFSGYFHFYRKHKD 233 E+ +QDY++FLR+ + G ++E+ I+ G+ +SP KK++ R Sbjct: 174 EMTRSQDYELFLRL-SKIGAIGYLDESLSSYRIHGGQHSRKTSPFKKYTWIILKRRMELA 232 Query: 234 KFDRASKKYQLF 245 F + S Q+F Sbjct: 233 SFLKRSPVRQIF 244 >UniRef50_C8WM64 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WM64_EGGLE Length = 333 Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 41/132 (31%), Positives = 68/132 (51%), Gaps = 14/132 (10%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NPL+S+ +P +N + I+SV+ Q YSN E+++V+D ST A +D R+ Sbjct: 2 NPLVSVIIPVYNVESYVSECIESVIAQTYSNIEIVVVNDGSTDGSGFSCDQYACSDSRVV 61 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 +H + N G A RN I + +G+++ +D DD +P VF+ + + HA Sbjct: 62 VVHKE-NEGLSAARNAGIAVCRGDFVAFVDGDDFVSP----VFI----ETLMHAI----- 107 Query: 124 YVCQGEVYSQPA 135 VC E+ + P Sbjct: 108 EVCDCEIAAIPC 119 >UniRef50_C3RM81 Glycosyl transferase n=1 Tax=Mollicutes bacterium D7 RepID=C3RM81_9MOLU Length = 303 Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 71/254 (27%), Positives = 114/254 (44%), Gaps = 32/254 (12%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-------- 57 L+S+ MPT+ R + +RAI SVL Q Y N E+++V+D E +Y L Sbjct: 7 LVSVIMPTYKRSEKLLRAIDSVLNQTYKNLELLLVND----NEPFDEYTELLKKRVEKYS 62 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 ND R I D + RN I A+G+YI +DDDD W N+L + + L Sbjct: 63 NDERFRLILQDKHINGAVARNVGIRQARGQYIAFLDDDDWWELNKLEEQVKELKSLDDSW 122 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG--NQVFTWAWRFKECLFDT- 174 + C+ +Y Q + + K+ R + ++I+ + V T + FDT Sbjct: 123 GAVS----CKFTLYDQNGN--VIGKTKKYRDGYIYKDILNLMSDVATGTLLMRHDYFDTT 176 Query: 175 -----ELKAAQDYDIFLRMVVEYGEPWKVEEATQILH-INHGEMQITSSPKKFSGYFH-F 227 L QD L+++V++ +K++E Q LH ++ + Q +K Y F Sbjct: 177 RYFDENLLRHQD----LQLLVDFTSKYKLKEVNQYLHCVDVSDTQNRPDAQKLIQYKKAF 232 Query: 228 YRKHKDKFDRASKK 241 +R K D S+K Sbjct: 233 FRSIKPVMDSLSEK 246 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77414 Putative colanic acid biosynthesis glycosyl tran... 424 e-117 UniRef50_C8QGU2 Glycosyl transferase family 2 n=1 Tax=Pantoea sp... 250 5e-65 UniRef50_C5A2Y9 Glycosyltransferase, family 2 n=1 Tax=Thermococc... 240 3e-62 UniRef50_A8FXL4 Glycosyltransferase involved in cell wall biogen... 240 6e-62 UniRef50_Q8U2R3 Glycosyl transferase n=2 Tax=Pyrococcus RepID=Q8... 236 5e-61 UniRef50_A6TBF1 Putative glucuronosyltransferase n=1 Tax=Klebsie... 234 2e-60 UniRef50_Q314K1 Glycosyltransferases involved in cell wall bioge... 232 8e-60 UniRef50_C6J2S1 Glycosyltransferase n=1 Tax=Paenibacillus sp. or... 228 2e-58 UniRef50_C2SSX0 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 226 9e-58 UniRef50_Q314K2 Glycosyltransferases involved in cell wall bioge... 225 2e-57 UniRef50_B0C0T9 Glycosyl transferase, group 2 family protein n=1... 223 6e-57 UniRef50_A0B7S5 Glycosyl transferase, family 2 n=1 Tax=Methanosa... 223 6e-57 UniRef50_B2JFA3 Glycosyl transferase family 2 n=10 Tax=Proteobac... 222 9e-57 UniRef50_D2LU70 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 222 1e-56 UniRef50_B9BSY2 Glycosyl transferase, family 2 n=4 Tax=Proteobac... 222 2e-56 UniRef50_D2Q6R3 Glycosyl transferase family 2 n=2 Tax=Bifidobact... 222 2e-56 UniRef50_A0LST8 Glycosyl transferase, family 2 n=1 Tax=Acidother... 220 6e-56 UniRef50_C1DM76 Glycosyl transferase, family 2 protein n=1 Tax=A... 220 6e-56 UniRef50_B7K7D2 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 218 1e-55 UniRef50_A5W7E0 Glycosyl transferase, family 2 n=1 Tax=Pseudomon... 218 2e-55 UniRef50_B2T0J8 Glycosyl transferase family 2 n=2 Tax=Burkholder... 218 2e-55 UniRef50_Q67PP6 Glycosyl transferase n=1 Tax=Symbiobacterium the... 217 3e-55 UniRef50_A1U141 Glycosyl transferase, family 2 n=1 Tax=Marinobac... 217 4e-55 UniRef50_UPI0001C3906A glycosyl transferase family protein n=1 T... 215 1e-54 UniRef50_D0XH48 Putative uncharacterized protein n=1 Tax=Vibrio ... 215 1e-54 UniRef50_A0YSS8 Putative uncharacterized protein n=1 Tax=Lyngbya... 215 1e-54 UniRef50_B3PF93 Glycosyl transferase, putative, gt2I n=1 Tax=Cel... 215 1e-54 UniRef50_C6XS42 Glycosyl transferase family 2 n=1 Tax=Hirschia b... 215 2e-54 UniRef50_B8GFE2 Glycosyl transferase family 2 n=1 Tax=Methanosph... 215 2e-54 UniRef50_A0M6I2 TuaG-like glycosyl transferase n=2 Tax=Flavobact... 214 3e-54 UniRef50_Q1Q4P0 Similar to glycosyltransferase family 2 n=1 Tax=... 214 3e-54 UniRef50_Q8YSM1 Alr3063 protein n=7 Tax=Cyanobacteria RepID=Q8YS... 213 4e-54 UniRef50_Q8YSM2 Alr3062 protein n=6 Tax=Nostocaceae RepID=Q8YSM2... 213 4e-54 UniRef50_D2ETY9 Glycosyl transferase, family 2 protein n=1 Tax=B... 213 5e-54 UniRef50_B4AD21 Glycosyltransferase n=17 Tax=Enterococcus faecal... 213 5e-54 UniRef50_A7M6Z6 Glycosyl transferase family 2 n=1 Tax=Klebsiella... 213 5e-54 UniRef50_Q2YCB6 Glycosyl transferase, family 2 n=1 Tax=Nitrososp... 213 6e-54 UniRef50_Q9K6L6 Glycosyltransferase n=1 Tax=Bacillus halodurans ... 213 7e-54 UniRef50_Q8PZ42 Glycosyltransferase n=1 Tax=Methanosarcina mazei... 212 8e-54 UniRef50_B5WL01 Glycosyl transferase family 2 n=2 Tax=Burkholder... 212 1e-53 UniRef50_B8D172 Glycosyl transferase family 2 n=4 Tax=Bacteria R... 212 2e-53 UniRef50_B4VMI0 Glycosyl transferase, group 2 family protein n=2... 212 2e-53 UniRef50_B7R2A7 Glycosyltransferase n=1 Tax=Thermococcus sp. AM4... 211 2e-53 UniRef50_A2VSE1 Glycosyl transferase n=1 Tax=Burkholderia cenoce... 211 2e-53 UniRef50_Q2NH10 Predicted glycosyltransferase n=3 Tax=cellular o... 211 2e-53 UniRef50_B2J2I1 Glycosyl transferase, family 2 n=7 Tax=Cyanobact... 211 2e-53 UniRef50_B5W037 Glycosyl transferase family 2 n=2 Tax=Arthrospir... 211 3e-53 UniRef50_O58167 Putative uncharacterized protein PH0430 n=1 Tax=... 210 3e-53 UniRef50_Q0ACF0 Glycosyl transferase, family 2 n=4 Tax=Ectothior... 210 4e-53 UniRef50_Q6QW83 Putative glycosyl transferase n=1 Tax=Azospirill... 209 9e-53 UniRef50_C3WFL5 Glycosyl transferase n=4 Tax=Bacteria RepID=C3WF... 209 1e-52 UniRef50_C7QLJ5 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 209 1e-52 UniRef50_C3RR63 Glycosyl transferase n=2 Tax=Bacteria RepID=C3RR... 209 1e-52 UniRef50_B7KAG8 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 208 1e-52 UniRef50_A1W3G7 Glycosyl transferase, family 2 n=1 Tax=Acidovora... 208 1e-52 UniRef50_C6BYR5 Glycosyl transferase family 2 n=5 Tax=Desulfovib... 208 1e-52 UniRef50_Q21E22 B-glycosyltransferase-like protein n=2 Tax=Alter... 207 3e-52 UniRef50_UPI0001AF4953 glycosyl transferase, group 2 family prot... 207 3e-52 UniRef50_B1FXK1 Glycosyl transferase family 2 n=2 Tax=Burkholder... 207 3e-52 UniRef50_A0YLN7 Glycosyl transferase, group 2 family protein n=3... 207 3e-52 UniRef50_Q0VR09 Glycosyl transferase, group 2 family protein, pu... 207 4e-52 UniRef50_Q7U8H3 Putative glycosyltransferase family 2 protein n=... 207 4e-52 UniRef50_B4CZV6 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 207 5e-52 UniRef50_D2RDR5 Glycosyl transferase family 2 n=1 Tax=Archaeoglo... 206 6e-52 UniRef50_A3I1Y0 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1... 206 6e-52 UniRef50_C6X6S7 Glycosyl transferase family 2 n=2 Tax=Methylophi... 206 7e-52 UniRef50_A0YST1 Putative uncharacterized protein n=1 Tax=Lyngbya... 206 7e-52 UniRef50_A8TT37 Glycosyl transferase, group 2 family protein n=2... 206 7e-52 UniRef50_C7TKV7 Glycosyl transferase, group 2 n=5 Tax=Lactobacil... 206 8e-52 UniRef50_Q9X4C3 Glycosyltransferase WcaN n=2 Tax=Enterobacteriac... 206 9e-52 UniRef50_A6QB23 Glycosyl transferase n=1 Tax=Sulfurovum sp. NBC3... 205 1e-51 UniRef50_C3RR62 Glycosyl transferase n=3 Tax=Bacteria RepID=C3RR... 205 1e-51 UniRef50_A8ZWI7 Glycosyl transferase family 2 n=2 Tax=Desulfobac... 205 1e-51 UniRef50_A4J8Z3 Glycosyl transferase, family 2 n=1 Tax=Desulfoto... 205 1e-51 UniRef50_A6ELC1 TuaG n=3 Tax=Bacteroidetes RepID=A6ELC1_9BACT 205 1e-51 UniRef50_C2P8I0 Family 2 glycosyltransferase n=10 Tax=Bacteria R... 205 2e-51 UniRef50_B5L396 WfdV n=10 Tax=Proteobacteria RepID=B5L396_ECOLX 205 2e-51 UniRef50_B8HRA4 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 204 2e-51 UniRef50_Q8YUP7 Glucosyltransferase n=2 Tax=Cyanobacteria RepID=... 204 2e-51 UniRef50_B3E8X3 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 204 3e-51 UniRef50_B1JX11 Glycosyl transferase family 2 n=1 Tax=Burkholder... 204 3e-51 UniRef50_B4U6C1 Glycosyl transferase family 2 n=1 Tax=Hydrogenob... 203 4e-51 UniRef50_Q3A3M0 Glycosyltransferase n=1 Tax=Pelobacter carbinoli... 203 5e-51 UniRef50_Q8YSL1 Alr3073 protein n=2 Tax=Nostocaceae RepID=Q8YSL1... 203 5e-51 UniRef50_D0MF25 Glycosyl transferase family 2 n=1 Tax=Rhodotherm... 203 6e-51 UniRef50_A6H2F4 Glycosyl transferase, group 2 family protein n=2... 203 6e-51 UniRef50_C4KBW6 Glycosyl transferase family 2 n=1 Tax=Thauera sp... 203 6e-51 UniRef50_Q116A2 Glycosyl transferase, group 1 n=1 Tax=Trichodesm... 203 7e-51 UniRef50_B3E7E3 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 202 9e-51 UniRef50_B4S4G2 Glycosyl transferase family 2 n=1 Tax=Prosthecoc... 202 1e-50 UniRef50_B6ARX8 Putative glycosyl transferase, family 2 n=1 Tax=... 202 1e-50 UniRef50_Q2SD88 Glycosyltransferase involved in cell wall biogen... 202 2e-50 UniRef50_A6C910 Truncated O-antigen biosynthesis protein n=1 Tax... 202 2e-50 UniRef50_A6BHD3 Putative uncharacterized protein n=1 Tax=Dorea l... 201 2e-50 UniRef50_A4A6G7 Glycosyltransferase n=1 Tax=Congregibacter litor... 201 2e-50 UniRef50_C7GBF7 Glycosyltransferase n=1 Tax=Roseburia intestinal... 201 2e-50 UniRef50_Q72QG5 Glycosyltransferase n=2 Tax=Leptospira interroga... 201 3e-50 UniRef50_B8I318 Glycosyl transferase family 2 n=2 Tax=Clostridiu... 201 3e-50 UniRef50_A3W1Y5 Glycosyltransferase n=1 Tax=Roseovarius sp. 217 ... 200 3e-50 UniRef50_B0TYV9 Putative teichuronic acid biosynthesis glycosyl ... 200 3e-50 UniRef50_A3W9Y7 Glucosyltransferase n=1 Tax=Erythrobacter sp. NA... 200 5e-50 UniRef50_A8RSJ6 Putative uncharacterized protein n=1 Tax=Clostri... 199 7e-50 UniRef50_B4WMJ9 Glycosyl transferase, group 2 family protein n=1... 199 8e-50 UniRef50_B1ZW34 Glycosyl transferase family 2 n=1 Tax=Opitutus t... 199 8e-50 UniRef50_Q74D09 Glycosyl transferase, group 2 family protein n=1... 199 8e-50 UniRef50_Q04TI3 Glycosyltransferase n=8 Tax=Leptospira RepID=Q04... 199 9e-50 UniRef50_B9M0H9 Glycosyl transferase family 2 n=3 Tax=Proteobact... 199 9e-50 UniRef50_C0CGY4 Putative uncharacterized protein n=1 Tax=Blautia... 199 1e-49 UniRef50_D2RIR4 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 198 1e-49 UniRef50_A6Q2S9 Glycosyl transferase n=1 Tax=Nitratiruptor sp. S... 198 1e-49 UniRef50_A1AUB0 Glycosyl transferase, family 2 n=1 Tax=Pelobacte... 198 1e-49 UniRef50_A0LJ94 Glycosyl transferase, family 2 n=2 Tax=Deltaprot... 198 1e-49 UniRef50_B7DLY9 Glycosyl transferase family 2 n=1 Tax=Alicycloba... 198 2e-49 UniRef50_C4XII1 Putative glycosyltransferase n=1 Tax=Desulfovibr... 198 2e-49 UniRef50_A6M2M4 Glycosyl transferase, family 2 n=1 Tax=Clostridi... 198 2e-49 UniRef50_A0KM86 Glycosyl transferase, group 2 family protein n=2... 198 2e-49 UniRef50_Q15RB6 Glycosyl transferase, family 2 n=1 Tax=Pseudoalt... 198 2e-49 UniRef50_D1BX31 Glycosyl transferase family 2 n=1 Tax=Xylanimona... 198 3e-49 UniRef50_Q3A667 Glycosyltransferase n=1 Tax=Pelobacter carbinoli... 197 3e-49 UniRef50_Q0BQ52 Glycosyltransferase n=1 Tax=Granulibacter bethes... 197 3e-49 UniRef50_D0D9G6 Glycosyl transferase family 2 n=1 Tax=Citreicell... 196 6e-49 UniRef50_C7RJP4 Glycosyl transferase family 2 n=1 Tax=Candidatus... 196 8e-49 UniRef50_A5ZV71 Putative uncharacterized protein n=2 Tax=Firmicu... 196 8e-49 UniRef50_Q47GL6 Glycosyl transferase, family 2 n=1 Tax=Dechlorom... 196 9e-49 UniRef50_A4BPR7 Glycosyl transferase, group 2 family protein n=1... 196 1e-48 UniRef50_A1ZLY5 Glycosyl transferase domain protein n=1 Tax=Micr... 195 1e-48 UniRef50_Q052L4 Glycosyltransferase n=10 Tax=Leptospira RepID=Q0... 195 1e-48 UniRef50_A3XRJ7 TuaG n=1 Tax=Leeuwenhoekiella blandensis MED217 ... 195 1e-48 UniRef50_Q8KI14 Similar to Glycosyl transferase n=1 Tax=Pseudomo... 195 2e-48 UniRef50_Q1ZRU0 Glycosyltransferase involved in cell wall biogen... 195 2e-48 UniRef50_C1VEY2 Glycosyl transferase n=1 Tax=Halogeometricum bor... 194 2e-48 UniRef50_B2GLA0 Putative glycosyltransferase n=1 Tax=Kocuria rhi... 194 3e-48 UniRef50_B9M559 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 194 3e-48 UniRef50_C4V652 Family 2 glycosyltransferase n=1 Tax=Selenomonas... 193 4e-48 UniRef50_C9K187 Putative glycosyltransferase n=1 Tax=Klebsiella ... 193 4e-48 UniRef50_A5G2M9 Glycosyl transferase, family 2 n=1 Tax=Acidiphil... 193 4e-48 UniRef50_B0SAI4 Glycosyltransferase n=2 Tax=Leptospira biflexa s... 193 4e-48 UniRef50_B3QJY1 Glycosyl transferase family 2 n=1 Tax=Rhodopseud... 193 4e-48 UniRef50_Q46FX2 Putative uncharacterized protein n=1 Tax=Methano... 193 4e-48 UniRef50_B8HRB2 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 193 5e-48 UniRef50_B7JHV6 Glycosyl transferase, group 2 family protein n=5... 193 5e-48 UniRef50_A0KUZ3 Glycosyl transferase, family 2 n=7 Tax=Gammaprot... 193 5e-48 UniRef50_C0ECJ2 Putative uncharacterized protein n=1 Tax=Clostri... 193 5e-48 UniRef50_A9AAL8 Glycosyl transferase family 2 n=2 Tax=Methanococ... 193 5e-48 UniRef50_C5SH24 Glycosyl transferase family 2 n=1 Tax=Asticcacau... 193 6e-48 UniRef50_C4F932 Putative uncharacterized protein n=1 Tax=Collins... 193 6e-48 UniRef50_C2Y456 Teichuronic acid biosynthesis glycosyl transfera... 193 6e-48 UniRef50_B7MWU8 Glycosyltransferase n=1 Tax=Escherichia coli ED1... 193 7e-48 UniRef50_C1DRS5 Glycosyl transferase, family 2 n=1 Tax=Azotobact... 192 8e-48 UniRef50_B8HLM9 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 192 9e-48 UniRef50_B0S8S5 Glycosyltransferase n=2 Tax=Leptospira biflexa s... 192 9e-48 UniRef50_C6XYA6 Glycosyl transferase family 2 n=1 Tax=Pedobacter... 192 1e-47 UniRef50_Q5WBN3 Glycosyltransferase n=1 Tax=Bacillus clausii KSM... 192 1e-47 UniRef50_Q10VK0 Glycosyl transferase, family 2 n=3 Tax=Oscillato... 192 1e-47 UniRef50_Q5V6E2 Glycosyl transferase n=1 Tax=Haloarcula marismor... 192 1e-47 UniRef50_B6I8A2 Putative glycosyltransferase n=1 Tax=Escherichia... 192 1e-47 UniRef50_C7NBG2 Glycosyl transferase family 2 n=4 Tax=Fusobacter... 192 1e-47 UniRef50_A0YLN1 Glycosyl transferase, family 2 n=2 Tax=Bacteria ... 191 2e-47 UniRef50_B2JT44 Glycosyl transferase family 2 n=1 Tax=Burkholder... 191 2e-47 UniRef50_B0CGB5 Glycosyl transferase, family 2, putative n=1 Tax... 191 2e-47 UniRef50_D2L8E2 Glycosyl transferase family 2 n=1 Tax=Desulfovib... 191 2e-47 UniRef50_Q72X17 Glycosyl transferase domain protein n=4 Tax=Baci... 191 2e-47 UniRef50_A0YK73 Glycosyl transferase n=5 Tax=Cyanobacteria RepID... 191 2e-47 UniRef50_B7JW84 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 191 2e-47 UniRef50_Q8DGK0 Tlr2317 protein n=2 Tax=Thermosynechococcus elon... 191 2e-47 UniRef50_C6CSD6 Glycosyl transferase family 2 n=1 Tax=Paenibacil... 191 2e-47 UniRef50_D1W2F4 Glycosyltransferase, group 2 family protein n=2 ... 191 3e-47 UniRef50_B4WF86 Glycosyl transferase, group 2 family protein n=1... 191 3e-47 UniRef50_A5G401 Glycosyl transferase, family 2 n=1 Tax=Geobacter... 191 3e-47 UniRef50_Q2WB46 Glycosyltransferase n=1 Tax=Magnetospirillum mag... 191 3e-47 UniRef50_B0JRA6 Glycosyl transferase family 2 n=1 Tax=Microcysti... 191 3e-47 UniRef50_Q2SBM9 Glycosyltransferase involved in cell wall biogen... 190 3e-47 UniRef50_A4G480 Putative Glycosyl transferase, family 2 n=1 Tax=... 190 4e-47 UniRef50_C0X9N7 Group 2 family glycosyl transferase n=24 Tax=Ent... 190 4e-47 UniRef50_C6XWT6 Glycosyl transferase family 2 n=1 Tax=Pedobacter... 190 4e-47 UniRef50_A6H0Q4 Glycosyl transferase, group 2 family protein n=1... 190 4e-47 UniRef50_B6YUK0 Glycosyltransferase n=3 Tax=Thermococcaceae RepI... 190 4e-47 UniRef50_C0CGZ7 Putative uncharacterized protein n=1 Tax=Blautia... 190 4e-47 UniRef50_C6QNL6 Glycosyl transferase family 2 n=1 Tax=Geobacillu... 190 6e-47 UniRef50_Q39U31 Glycosyl transferase, family 2 n=1 Tax=Geobacter... 190 6e-47 UniRef50_C9Y383 Putative uncharacterized protein n=1 Tax=Cronoba... 190 6e-47 UniRef50_D0L0W1 Glycosyl transferase family 2 n=1 Tax=Halothioba... 189 7e-47 UniRef50_Q6MBF6 Putative glycosyltransferase n=1 Tax=Candidatus ... 189 1e-46 UniRef50_C1SJ24 Glycosyl transferase n=1 Tax=Denitrovibrio aceti... 189 1e-46 UniRef50_A5ZX57 Putative uncharacterized protein n=1 Tax=Ruminoc... 189 1e-46 UniRef50_A4BIW6 Glycosyltransferase n=1 Tax=Reinekea blandensis ... 188 1e-46 UniRef50_A6LGW8 Glycosyltransferase family 2 n=3 Tax=Bacteroidal... 188 1e-46 UniRef50_UPI000197ADF7 hypothetical protein BACCOPRO_00943 n=1 T... 188 1e-46 UniRef50_Q11KM8 Glycosyl transferase, family 2 n=1 Tax=Chelativo... 188 1e-46 UniRef50_B7HZB4 Beta-1,3-N-acetylglucosaminyltransferase n=37 Ta... 188 1e-46 UniRef50_Q5FRX1 O-antigen biosynthesis protein RfbC n=1 Tax=Gluc... 188 2e-46 UniRef50_C0QK40 Putative glycosyl transferase (Group 2 family pr... 188 2e-46 UniRef50_B6BQY7 Glycosyltransferase n=1 Tax=Candidatus Pelagibac... 188 2e-46 UniRef50_Q1N7I9 Predicted glycosyltransferase n=1 Tax=Sphingomon... 187 3e-46 UniRef50_C5RQE2 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 187 3e-46 UniRef50_C1DGR7 Glycosyl transferase, family 2 protein n=1 Tax=A... 187 4e-46 UniRef50_D1YZY1 Putative glycosyltransferase n=1 Tax=Methanocell... 187 4e-46 UniRef50_C1QAA4 Predicted glycosyltransferase n=1 Tax=Brachyspir... 187 4e-46 UniRef50_B5JJR3 Glycosyl transferase, group 2 family protein n=1... 186 5e-46 UniRef50_Q2LWN5 Glycosyltransferase n=1 Tax=Syntrophus aciditrop... 186 5e-46 UniRef50_A3U8H8 Glycosyl transferase n=1 Tax=Croceibacter atlant... 186 7e-46 UniRef50_C0YGQ4 Possible glycosyl transferase family protein n=1... 186 7e-46 UniRef50_Q11L33 Glycosyl transferase, family 2 n=1 Tax=Chelativo... 185 1e-45 UniRef50_A6QAH7 Putative uncharacterized protein n=1 Tax=Sulfuro... 185 1e-45 UniRef50_A3VRJ5 Glycosyltransferase n=1 Tax=Parvularcula bermude... 185 1e-45 UniRef50_Q1PUV4 Putative uncharacterized protein n=1 Tax=Candida... 185 1e-45 UniRef50_C6MKP2 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 185 1e-45 UniRef50_B7K6R4 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 185 1e-45 UniRef50_Q3SLZ0 Glycosyltransferase n=1 Tax=Thiobacillus denitri... 185 1e-45 Sequences not found previously or not previously below threshold: UniRef50_B4VZW9 Methyltransferase domain family n=2 Tax=Cyanobac... 214 2e-54 UniRef50_B1ZZR1 Glycosyl transferase family 2 n=1 Tax=Opitutus t... 207 3e-52 UniRef50_C2H8B4 Family 2 glycosyltransferase n=13 Tax=Enterococc... 207 5e-52 UniRef50_D0HB89 Glycosyl transferase group 2 family protein n=2 ... 206 8e-52 UniRef50_Q7BG51 WsaE n=2 Tax=Bacillaceae RepID=Q7BG51_BACST 203 5e-51 UniRef50_C2DBA9 Family 2 glycosyl transferase n=22 Tax=Enterococ... 203 7e-51 UniRef50_B3PFZ6 Glycosyl transferase, putative, gt2E n=1 Tax=Cel... 202 2e-50 UniRef50_C6JDU8 Putative uncharacterized protein n=1 Tax=Ruminoc... 198 1e-49 UniRef50_UPI000174589B glycosyl transferase, family 2 n=1 Tax=Ve... 197 4e-49 UniRef50_Q093P1 Glycosyl transferase, group 2 family protein (Fr... 195 1e-48 UniRef50_Q094Y6 O-antigen biosynthesis protein RfbC n=1 Tax=Stig... 195 1e-48 UniRef50_D0L0V1 Glycosyl transferase family 2 n=2 Tax=Gammaprote... 194 2e-48 UniRef50_C6ATV7 Glycosyl transferase family 2 n=1 Tax=Rhizobium ... 194 4e-48 UniRef50_C7M1X8 Glycosyl transferase family 2 n=1 Tax=Acidimicro... 193 5e-48 UniRef50_D0IZR1 Glycosyl transferase, family 2 n=1 Tax=Comamonas... 192 1e-47 UniRef50_C6MPM7 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 191 2e-47 UniRef50_B8FY70 Glycosyl transferase family 2 n=2 Tax=Desulfitob... 191 2e-47 UniRef50_C6M2Q8 Glycosyltransferase n=2 Tax=Neisseria RepID=C6M2... 190 3e-47 UniRef50_B4AW59 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 190 4e-47 UniRef50_A3CRX6 Glycosyl transferase, family 2 n=1 Tax=Methanocu... 190 5e-47 UniRef50_B9M560 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 188 1e-46 UniRef50_B8IAC9 Glycosyl transferase family 2 n=1 Tax=Methylobac... 188 1e-46 UniRef50_A9EG08 Putative uncharacterized protein n=1 Tax=Oceanib... 188 2e-46 UniRef50_C6ARE2 Glycosyl transferase family 2 n=4 Tax=Rhizobium ... 188 2e-46 UniRef50_D1ALB8 Glycosyl transferase family 2 n=1 Tax=Sebaldella... 188 3e-46 UniRef50_O87159 WblC protein n=1 Tax=Vibrio cholerae RepID=O8715... 187 3e-46 UniRef50_Q74BU3 Glycosyl transferase, group 2 family protein n=1... 187 3e-46 UniRef50_A4EB36 Putative uncharacterized protein n=1 Tax=Collins... 187 4e-46 UniRef50_A1WXB8 Glycosyl transferase, family 2 n=1 Tax=Halorhodo... 187 4e-46 UniRef50_C8WIW4 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 187 4e-46 UniRef50_B9XD27 Glycosyl transferase family 2 n=1 Tax=bacterium ... 186 5e-46 UniRef50_Q5P2A3 Glycosyl transferase, family 2 n=1 Tax=Aromatole... 186 6e-46 UniRef50_Q12TY4 Glycosyl transferase, family 2 n=1 Tax=Methanoco... 186 7e-46 UniRef50_Q47DD2 Glycosyl transferase, family 2:Polysaccharide de... 186 9e-46 UniRef50_Q97H38 Glycosyltransferase n=1 Tax=Clostridium acetobut... 186 1e-45 UniRef50_B3PFA2 Glycosyl transferase, putative, gt2L n=1 Tax=Cel... 185 1e-45 UniRef50_A5UVV3 Glycosyl transferase, family 2 n=1 Tax=Roseiflex... 185 1e-45 UniRef50_A1ANW6 Glycosyl transferase, family 2 n=2 Tax=Desulfuro... 185 1e-45 UniRef50_B1WXJ1 Glycosyl transferase, family 2 n=2 Tax=Cyanothec... 185 1e-45 UniRef50_UPI0001788F1F glycosyl transferase family 2 n=1 Tax=Geo... 185 1e-45 UniRef50_B8IAF3 Glycosyl transferase family 2 n=1 Tax=Methylobac... 185 1e-45 >UniRef50_P77414 Putative colanic acid biosynthesis glycosyl transferase wcaA n=98 Tax=Enterobacteriaceae RepID=WCAA_ECOLI Length = 279 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 279/279 (100%), Positives = 279/279 (100%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP Sbjct: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY Sbjct: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ Sbjct: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK Sbjct: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR 279 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR Sbjct: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR 279 >UniRef50_C8QGU2 Glycosyl transferase family 2 n=1 Tax=Pantoea sp. At-9b RepID=C8QGU2_9ENTR Length = 286 Score = 250 bits (638), Expect = 5e-65, Method: Composition-based stats. Identities = 88/270 (32%), Positives = 135/270 (50%), Gaps = 6/270 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+++Y+PT NR L RAI SV Q Y N E+II DD S+ ++ D RI Sbjct: 8 QPLVTVYIPTHNRSSLVKRAIASVQNQTYRNLEIIICDDGSSDDTEVVIAKMKEQDSRII 67 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y+ ND GAC RN+ I +A+GE+ITG+DDDD + R++ F+ Q+L FL AN Sbjct: 68 YLKNDRPMGACHARNRCIAIAKGEFITGLDDDDYFLNTRIANFVTASQKL-NKPFLCANM 126 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDY 182 + + S + + +N +GNQVF K FD A QDY Sbjct: 127 IFKNRNIEKKWKSYV----GDITLNMMGFKNWVGNQVFIRTSMMKNIGGFDINFPAWQDY 182 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D++ RM+++YG +++++A+ +++I+ +IT+ K + GY F HK R K Sbjct: 183 DLWYRMIMQYGSCYRIDDASYVVNIDDDRPRITTGSKAWQGYQKFITTHKPSLSRQHLKS 242 Query: 243 QLFTLYQIRNKRMTWRTLLTLLSVRNGKRL 272 F R + ++ S+ N L Sbjct: 243 LYFQDKLNRKDEINFKEFFHYPSIDNLLIL 272 >UniRef50_C5A2Y9 Glycosyltransferase, family 2 n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A2Y9_THEGJ Length = 304 Score = 240 bits (614), Expect = 3e-62, Method: Composition-based stats. Identities = 77/237 (32%), Positives = 119/237 (50%), Gaps = 10/237 (4%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +S+ +PT+NR L RAI SVL Q + ++E+I+VDD S + + V ++ND RI Sbjct: 2 SRPTVSVIIPTYNRANLLRRAITSVLNQTFRDFELIVVDDAS--PDNTPEVVESINDGRI 59 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLV-THAFLYA 121 Y+ NSG RN I A+G++I +DDDDEW P+RL + + L +Y Sbjct: 60 RYVRLKKNSGGPVARNTGIKKARGKFIALLDDDDEWLPHRLETQVRKFETLEQNIGVVYG 119 Query: 122 N-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQV-FTWAWRFKE-CLFDTELKA 178 YV Q + LP Y + Y + N IG+ FK LFD L + Sbjct: 120 GFYYVSQQDGRILGKRLPAYKGNVY--DKLLRENFIGSPTLLIRRECFKRAGLFDPNLSS 177 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QD+D++LR+ Y + V+E +++ ++ + K G F RKH D + Sbjct: 178 SQDWDMWLRIARHYKFDY-VDEIVAKYYVHGRQISF-NMKKYIPGRERFIRKHLDIW 232 >UniRef50_A8FXL4 Glycosyltransferase involved in cell wall biogenesis-like protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FXL4_SHESH Length = 317 Score = 240 bits (612), Expect = 6e-62, Method: Composition-based stats. Identities = 91/275 (33%), Positives = 134/275 (48%), Gaps = 19/275 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + + P++S+YMPT NR+ L RA++SVL Q Y N+E+IIVDD S A ++P Sbjct: 29 LDHGPMVSVYMPTHNREHLLKRAVESVLAQTYQNFELIIVDDGSRDGSPDYLQSLASSEP 88 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ + I GACA RN AI A+G+YITG+DDDDE+ PNRL L+ + Sbjct: 89 RVRFFCQPIAQGACAARNIAIKEAKGKYITGLDDDDEFLPNRLEQLLSSYDEKYA----- 143 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRL--FYKRNIIGNQVFTWAWRFK-ECLFDTELK 177 +VCQG + + + L N NQV T R + F+ E Sbjct: 144 ---FVCQGTFWHYGSHKKALDANAMIINLSQMLDYNYSTNQVLTETKRLQSIGGFNPEFP 200 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD- 236 A QDYD + R++++YG ++ A+ I+H H +I S GY F ++ Sbjct: 201 ACQDYDTWTRLILKYGAAKRIAGASYIIHQGHEGPRIISKSNMQRGYKKFSEEYGSLMSK 260 Query: 237 --RASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNG 269 R ++KY + R ++ L SVR G Sbjct: 261 GNRINQKYLALV-----SSRKRFKLLDFFQSVRYG 290 >UniRef50_Q8U2R3 Glycosyl transferase n=2 Tax=Pyrococcus RepID=Q8U2R3_PYRFU Length = 298 Score = 236 bits (603), Expect = 5e-61, Method: Composition-based stats. Identities = 76/235 (32%), Positives = 120/235 (51%), Gaps = 12/235 (5%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +S+ +PT NR +L RAI SVL Q + ++E+I+VDD S+ + + ++ D RI Sbjct: 2 SRPTVSVIIPTHNRAKLLKRAIISVLNQTFEDFEIIVVDDASS--DNTPNIIESIKDSRI 59 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFLYA 121 YI + NSGA RN I A+G++I +DDDDEW P +L V + +L +Y Sbjct: 60 RYIRLEKNSGAPTARNIGIKKARGKFIALLDDDDEWLPRKLEVQVKKFNELEKKFGVVYG 119 Query: 122 N-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKE-CLFDTELKA 178 YV Q LP + + Y K N IG+ V FK+ LFD +L + Sbjct: 120 GFYYVSQQNERIIGKRLPRFRGNVYGE--LLKENFIGSPTVLIRRECFKKAGLFDPKLPS 177 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 +QD+D+++R+ Y + V + +++ ++ + K G F KHKD Sbjct: 178 SQDWDMWVRIARYYYFDYIVAK----YYVHGHQISFNINEKYIPGREIFISKHKD 228 >UniRef50_A6TBF1 Putative glucuronosyltransferase n=1 Tax=Klebsiella pneumoniae subsp. pneumoniae MGH 78578 RepID=A6TBF1_KLEP7 Length = 291 Score = 234 bits (598), Expect = 2e-60, Method: Composition-based stats. Identities = 81/265 (30%), Positives = 130/265 (49%), Gaps = 8/265 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS--TSWEQLQQYVTALN 58 M PL+S+Y+PT NR +L A+KSV+ Q Y N E+II +D S + + + + Sbjct: 1 MNKKPLVSVYIPTHNRAELLSNAVKSVINQTYKNIEIIICNDGSGDNTDDVVNKLQKEYK 60 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D Y+ ND GAC RN+ I A G Y+TG+DDDD + PNR+ + + Sbjct: 61 DIPFVYLKNDSPMGACFSRNRCIAAASGYYLTGLDDDDYFLPNRIESLIKVCNER-KIDL 119 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 + +N G + + + N++GNQ+ T F++ FDT Sbjct: 120 VCSNLIFKDGNKLKRGKN----NSGVITSIDMGYENLVGNQLLTRLSYFEDVGGFDTNFP 175 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 A QDYD++ R+++++G K EAT ++ + + +I++S K GY F KH + Sbjct: 176 AWQDYDLWYRIIMKFGPCIKTNEATYVMDVENDRKRISTSSKAHIGYRSFIEKHSETLSE 235 Query: 238 ASKKYQLFTLYQIRNKRMTWRTLLT 262 K RNK++ +TLL+ Sbjct: 236 EMKSSLFIRDLINRNKKIPLKTLLS 260 >UniRef50_Q314K1 Glycosyltransferases involved in cell wall biogenesis-like n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314K1_DESDG Length = 312 Score = 232 bits (593), Expect = 8e-60, Method: Composition-based stats. Identities = 69/292 (23%), Positives = 120/292 (41%), Gaps = 27/292 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M P +S+ +PT+N L + ++SV Q Y ++E+I+VDD ST + A + Sbjct: 1 MTTTPRVSVVIPTYNYGHLLEQCLESVFAQTYKDYEVILVDDGSTDDTPG---IAARYEG 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y + N G + RN+ I A+G I +D DD W P++L+V A + +Y Sbjct: 58 RIRYFRKE-NGGPSSARNEGIRQARGPLIAFLDSDDLWLPHKLAVQTAFADAHPEYGLVY 116 Query: 121 AN-DYVCQGEV----YSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDT 174 + +V G V Y L + + V F + D Sbjct: 117 CDMQHVEHGRVIHHSYLHERGYTCTGSGWKYVDLLREGFVFTPTVLVRKECFSAAGMHDE 176 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH-FYRKHKD 233 L+ QD D++LR+ Y P+ + ++ HG+ T++ + + + F R+ + Sbjct: 177 GLRTCQDLDMWLRIAKLY--PFGFIDEPLVVRQYHGDNSTTNAERYHANHIRVFLRELES 234 Query: 234 KFDRASKK--------------YQLFTLYQIRNKRMTWRTLLTLLSVRNGKR 271 D +K+ Y F+ R R +R++L + R Sbjct: 235 TADETAKQVLRGRLSEQYFALGYHYFSTGSRRQARRCFRSVLNYGMRADAVR 286 >UniRef50_C6J2S1 Glycosyltransferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2S1_9BACL Length = 309 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 67/256 (26%), Positives = 108/256 (42%), Gaps = 8/256 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 +NP ISI MP +NRQ+ +I +L Q Y+N+E+IIVDD ST + + + D R Sbjct: 3 NHNPKISIIMPVYNRQETIKNSIFCILNQSYTNFELIIVDDAST--DNTSAIILSFEDER 60 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I +I + N GA A RN I ++G++I+ D DD W P++L + + Sbjct: 61 IKHIVLNKNVGAAAARNIGIRQSKGDWISFQDSDDYWEPDKLRKQVELINDRSPSIIFSS 120 Query: 122 NDYVCQG--EVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVF--TWAWRFKECLFDTELK 177 G E + PK N I + K F+ ++ Sbjct: 121 FIRYKYGSEEYIPRGRKYVNLPKEGNLHNQLLLGNFIATPTVLIPKTFLEKVGGFNEQMP 180 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QD++++LR+ Y W ++E +H + + K GY + HK F Sbjct: 181 RFQDWELWLRLSAYYPFIW-IDEPLVRVHYTENSIS-SDMSKMIRGYELIWNLHKKMFVD 238 Query: 238 ASKKYQLFTLYQIRNK 253 A +Y L+ + Sbjct: 239 AGPEYAAQFLFSYGHN 254 >UniRef50_C2SSX0 Glycosyl transferase family 2 n=1 Tax=Bacillus cereus BDRD-ST196 RepID=C2SSX0_BACCE Length = 300 Score = 226 bits (576), Expect = 9e-58, Method: Composition-based stats. Identities = 68/221 (30%), Positives = 110/221 (49%), Gaps = 4/221 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ LIS+ +PT+NR +L R I SVL Q Y+N+E+IIVDD ST + + V +D Sbjct: 1 MEMEVLISVVVPTYNRSELIKRTIDSVLAQTYTNFELIIVDDAST--DNTEDIVNEYHDD 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI +I + NS RN I ++G+YI +D DDEW PN+L L + + Sbjct: 59 RIKFIKLNENSKGTKPRNMGIKESKGDYIALLDSDDEWLPNKLESQLNFLRAFNDDNMVC 118 Query: 121 ANDYVCQG-EVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 D + +G + + L+ S +F +N + + ++ + K+ LF+ LK Sbjct: 119 FTDLILKGTKKTVYSNNRDLFENEDISEYIFLGKNWVQTSTYMFSSKLGKQTLFNPTLKK 178 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 QD+D LR+ + E I + + E +I ++ K Sbjct: 179 HQDWDFCLRLKKNGAKFVNFPEHLAIYYSDDREGRIGNNLK 219 >UniRef50_Q314K2 Glycosyltransferases involved in cell wall biogenesis-like n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314K2_DESDG Length = 321 Score = 225 bits (573), Expect = 2e-57, Method: Composition-based stats. Identities = 58/258 (22%), Positives = 114/258 (44%), Gaps = 8/258 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + P +S+ MP + + A++SVL QD++++E+++VDD S ++ + +D Sbjct: 17 MSDRPAVSVIMPVYKTEAYLAEAVRSVLEQDFTDFELLVVDDGSPG--DVRGILAGFDDA 74 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y+H++ N G RN+ I + G Y+ +D DD W P++LS+ +A + +Y Sbjct: 75 RIRYLHHE-NRGPGYTRNRGIRESTGRYVAFLDSDDAWMPHKLSIQVAALDAAPQYDVVY 133 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD-TELKAA 179 + G + P+ L+ I + F + +L+ + Sbjct: 134 SQRETMDGHGRTVAGYTPVLHSGDVLDELYVDSFICMSSAMVRRGVFAVSGYIAEDLRIS 193 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK---KFSGYFHFYRKHKDKFD 236 QDYD +LR+ + V+E ++ ++ + + Y F R+H + Sbjct: 194 QDYDFWLRVACHH-RFLAVDEPLVRYRVHGTQVSRQVEERVRVVWQIYERFNREHGHRVS 252 Query: 237 RASKKYQLFTLYQIRNKR 254 +++ + Y R R Sbjct: 253 PRARRRSRASFYNGRADR 270 >UniRef50_B0C0T9 Glycosyl transferase, group 2 family protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C0T9_ACAM1 Length = 337 Score = 223 bits (568), Expect = 6e-57, Method: Composition-based stats. Identities = 72/249 (28%), Positives = 116/249 (46%), Gaps = 11/249 (4%) Query: 1 MKN-NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 MKN PL+S+ +PT+NR L I AI+SVL Q ++E+I+VDD ST + QQ V ++D Sbjct: 1 MKNSKPLVSVIIPTFNRANLLIPAIQSVLNQTIEDFELIVVDDAST--DDTQQKVAGIDD 58 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT-HAF 118 R+ YI + N G C RN + AQG+YI +D DDEW PN+L L + Sbjct: 59 SRLRYILQEKNLGECGTRNTGLGAAQGQYIAFLDSDDEWLPNKLEKQLGVFRSAPDEVGV 118 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG--NQVFTWAWRFKECL-FDTE 175 +Y+ V + P + +N+IG + V + L FDT Sbjct: 119 VYSWLQVINDQGKVVRMRKPNIHGDV--KDYLIYKNLIGTPSTVMIKQECIEPGLQFDTN 176 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK-FSGYFHFYRKHKDK 234 L+ D+D++L++ Y + E + + + + +++ G+ F KH Sbjct: 177 LRCCGDWDMWLQISKNY-RFEVIPEPLTLYRDHDEDARGSTNHAMVTEGHLVFLDKHHQG 235 Query: 235 FDRASKKYQ 243 ++ Sbjct: 236 IKEFYRQPS 244 >UniRef50_A0B7S5 Glycosyl transferase, family 2 n=1 Tax=Methanosaeta thermophila PT RepID=A0B7S5_METTP Length = 302 Score = 223 bits (568), Expect = 6e-57, Method: Composition-based stats. Identities = 62/200 (31%), Positives = 98/200 (49%), Gaps = 6/200 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +PT+NR + RAI+SVL Q Y ++E+I+VDD ST + ++ V + +D RI YI Sbjct: 4 VSVILPTYNRAHVVGRAIRSVLEQTYEDFELIVVDDGST--DSTEEVVRSFDDRRIRYIR 61 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA-HKQQLVTHAFLYANDYV 125 + N G A RN I +A+GEYI D DDEW P +L + K +Y Y+ Sbjct: 62 HKQNKGRSAARNTGIKIAKGEYIAFQDSDDEWLPEKLEEQMEVFKTSSPQVGVVYTGFYL 121 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWA--WRFKECLFDTELKAAQDYD 183 + S + K K N +G K +FD L +D++ Sbjct: 122 FYDDKKIYIPSANVKTKDGNIYDELLKGNFVGTPAAVVRAECLKKVGMFDESLHCLEDWE 181 Query: 184 IFLRMVVEYGEPWKVEEATQ 203 +F+R+ EY + +++A Sbjct: 182 LFIRISKEYIFKY-IDKALV 200 >UniRef50_B2JFA3 Glycosyl transferase family 2 n=10 Tax=Proteobacteria RepID=B2JFA3_BURP8 Length = 632 Score = 222 bits (567), Expect = 9e-57, Method: Composition-based stats. Identities = 76/283 (26%), Positives = 115/283 (40%), Gaps = 16/283 (5%) Query: 1 MKNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTA 56 M P IS+ MPT+N AI SV Q Y +WE+ I DD STS E L+QY+ Sbjct: 78 MAEPPRISVVMPTYNADPVWLAEAIDSVRNQIYPHWELCIADDASTSPEVRPLLEQYIR- 136 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 D RI + N A N A+ L +++ +D DD P+ L + Sbjct: 137 -EDSRIRVAFREKNGHISAASNSALELVTSDWVGLLDHDDLLAPHALYFVAKEIIRRPDA 195 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTE 175 LY++ + ++ Y K + LFY +N+I + V+ + F T Sbjct: 196 RLLYSD----EDKIDPMGRRHDPYFKCDMNIDLFYSQNMISHFGVYQKRLLDEIGGFRTG 251 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHIN-HGEMQITSSPKKFSGYFHFYRKHKDK 234 L+ +QD+D+ LR + G V + H H S K R + Sbjct: 252 LEGSQDHDVALRCLERIGAKSIVHIPRVLYHWRVHPSSTAASGDAKPYALIAGERALTEH 311 Query: 235 FDRASKKYQLFTL---YQIRNKRMTWRTLLTL-LSVRNGKRLA 273 F R + L Y++R + R L++L + RNG L Sbjct: 312 FARQGIDASVEGLPFGYRVRYRLPEHRPLVSLIIPTRNGVGLL 354 Score = 92.2 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 49/243 (20%), Positives = 92/243 (37%), Gaps = 22/243 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALND- 59 ++ PL+S+ +PT N L + I S+ + Y+ +E+I++D+ S + Y+ +L Sbjct: 336 EHRPLVSLIIPTRNGVGLLKQCIDSIQDKTTYAPYEIIVIDNGS-DEQATLDYLESLKSA 394 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSVFLAHKQQLV 114 I I +D A+ N AI A GE + I++D D W +S+ L V Sbjct: 395 DNIRVIRDDRPFNYSALNNMAIGSANGELVGLINNDIEVITDTWLEEMVSLALQPGVGAV 454 Query: 115 THAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLF--YKRNII---------GNQVFTW 163 LY + + V + + + + R F + RN + + Sbjct: 455 GAKLLYPDGTIQHAGVVTGLGGVAGHAHRLFPRDAFGYFSRNAVISSFSAVTAACLIIRK 514 Query: 164 AWRFKECLFDTE--LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 + + + A D D LR+ A ++ H +P+K Sbjct: 515 SIYEQVGGLNEADLTVAFNDVDFCLRVRDAGYRNVWTPFA-ELYHHESATRGAEDNPEKV 573 Query: 222 SGY 224 + + Sbjct: 574 ARF 576 >UniRef50_D2LU70 Glycosyl transferase family 2 n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LU70_BACS4 Length = 266 Score = 222 bits (565), Expect = 1e-56, Method: Composition-based stats. Identities = 68/260 (26%), Positives = 115/260 (44%), Gaps = 5/260 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 K+ PLISI PT+N + I+SVL Q +WEMIIVDDCST D R Sbjct: 12 KSMPLISIITPTYNSENYIADTIESVLAQTLDDWEMIIVDDCSTDNTTSVIRSYVEKDER 71 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + I + NSGA RN AI + G+YI +D DD+W P +L L + Q+ A Sbjct: 72 VKLIQLEKNSGAAIARNTAISHSVGKYIAFLDGDDQWLPEKLEKQLQYMQE--KGASFSF 129 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 Y + + ++ P++ +L K N+IG + ++++ QD Sbjct: 130 TKYKTMDQKGEETGTIVDAPETVNYHQLL-KHNVIGCLTVMLDVEKVGQVKMVDIRSRQD 188 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 Y ++L + + E ++E ++ + ++ K + YR+ +D S Sbjct: 189 YVLWLELCKKGFEAHGIQEVLAKYRVSQHSLS-SNKVKMAKQNWKVYREIEDLSLLKSLW 247 Query: 242 YQLFTLYQIRNKRMTWRTLL 261 Y + + + + + +L Sbjct: 248 YFINYAF-YKLSKSLFNKIL 266 >UniRef50_B9BSY2 Glycosyl transferase, family 2 n=4 Tax=Proteobacteria RepID=B9BSY2_9BURK Length = 632 Score = 222 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 67/278 (24%), Positives = 115/278 (41%), Gaps = 17/278 (6%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-NDPR 61 PLISI +PT+N ++ I+SVL Q Y +WE+ I DD S+S A D R Sbjct: 89 RPLISIIVPTYNSDERFLREMIESVLAQVYPHWELCIADDASSSESVRAVLEAARARDDR 148 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I ++ +N N A+ +A GEYI +D DD P+ L + + + + LY+ Sbjct: 149 IKVVYRPVNGHISEASNSALEIATGEYIALLDHDDVIPPHALFMVVKYLNKHPEARMLYS 208 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKAAQ 180 + + ++ + Y KS ++ +LF +N+ + V+ A F + +Q Sbjct: 209 D----EDKLSADGERTSPYFKSDWNPQLFLAQNMFSHLGVYETALVRAAGGFRKGFEGSQ 264 Query: 181 DYDIFLRM--VVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 DYD+ LR + + + I G + S K + R ++ DR Sbjct: 265 DYDLALRCVEIAGHNAVVHIPHVLYHWRILPGSTASSGSEKPY-ALLAAIRALEEHLDRT 323 Query: 239 SKKYQLFT----LYQIRNKRMTWRTLL---TLLSVRNG 269 + + +R K R ++ R+G Sbjct: 324 HTRAIVEHPCDQHSTLRVKYALPRPAPKTSIIIPTRDG 361 Score = 104 bits (261), Expect = 3e-21, Method: Composition-based stats. Identities = 55/264 (20%), Positives = 100/264 (37%), Gaps = 22/264 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P SI +PT + + + I S+ + Y N+E+I+VD+ S + E L + + D R+ Sbjct: 350 PKTSIIIPTRDGLSILKQCIDSIFAKTIYPNYEIIVVDNGSVNPETLAYFDSLRTDSRVR 409 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ----LVTHAF 118 + +D A+ N+A+ +A G+Y+ +++D + +P+ L + Q V A Sbjct: 410 ILRDDSPFNFSALNNRAVEIATGDYVCLLNNDIEIISPDWLDELVGIASQPGNGAVGAAL 469 Query: 119 LYANDYVCQGEVYSQPASLPLY-------PKSPYSRRLFYKRNII----GNQVFTWAWRF 167 Y N+ + G V + + + Y R +N+ V + A Sbjct: 470 WYPNNTLQHGGVVIGLGGVAGHMHTLLPRGQYGYFCRAVAMQNLSAVTAACLVVSKAIYR 529 Query: 168 KECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH 226 + + EL A D D LR+ A H E S Y Sbjct: 530 EVGGLNEELAVAFNDVDFCLRVREAGYRNVWTPYAELY----HHESATRGSDMDPDKYQR 585 Query: 227 FYRKHKDKFDRASKKYQLFTLYQI 250 F + + R + ++ Y Sbjct: 586 FVGEVRWMQSRWAGHFEFDPAYNP 609 >UniRef50_D2Q6R3 Glycosyl transferase family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q6R3_9BIFI Length = 280 Score = 222 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 71/244 (29%), Positives = 113/244 (46%), Gaps = 8/244 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +PT+NR +IKSVL Q + ++E+I+VDD ST + V A D RI Y Sbjct: 2 PKVSVVIPTFNRADTIGDSIKSVLEQTFKDFEVIVVDDGSTDG--TESVVAAFGDSRIKY 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I D N+GACA RN I A GEYI D DD W P +L L + + + Sbjct: 60 IMQD-NAGACAARNNGIRHANGEYIAFQDSDDYWMPRKLERQLQNISLHDSKIDICKMHS 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKAAQDYD 183 + +P + ++ +S RN + Q + F++ LFD ++ QD+D Sbjct: 119 AGLPKDEYEPCRQAI--EANFSYEYLLGRNFVSTQMILARHELFEDNLFDVKIPRFQDWD 176 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 + LR++ +Y + +E +I+ + + K Y +K+ D+ Y Sbjct: 177 LMLRLLRDYKLSY-TDEVLVERYISADSIS-QNVHKALEAYSLIEKKYSDELSSHRDSYS 234 Query: 244 LFTL 247 LF Sbjct: 235 LFMY 238 >UniRef50_A0LST8 Glycosyl transferase, family 2 n=1 Tax=Acidothermus cellulolyticus 11B RepID=A0LST8_ACIC1 Length = 734 Score = 220 bits (560), Expect = 6e-56, Method: Composition-based stats. Identities = 59/262 (22%), Positives = 106/262 (40%), Gaps = 11/262 (4%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALN 58 ++ P IS+ +P N ++ R I SVL Q Y W++ + DD S++ Q A Sbjct: 206 LRRTPTISVLVPVHNAAERWLRRCIDSVLAQSYPYWQLCLADDASSAPHVRQVLEEYAAR 265 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RIT I +N A N A+ AQG++I +D DDE P+ L A + Sbjct: 266 DARITAIFRPVNGHISAASNSALDAAQGDFIALLDHDDELAPDALLHVAALLNEHPDADM 325 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELK 177 +Y++ + ++ + K +S L + V+ + F + Sbjct: 326 IYSD----EDKINEDGVRQEPFFKPDWSPELLLSHMYTCHLGVYRTELVRRIGGFRLGFE 381 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 +QDYD+ LR+ + + + + G ++ K + Y +R ++ R Sbjct: 382 GSQDYDLVLRLTEQTDRIYHIPRVLYHWRRHPGSAAASADAKPY-AYAAAHRAIREALHR 440 Query: 238 ASKKYQLFTLYQIRNKRMTWRT 259 + T+ +I R +R Sbjct: 441 RGA---VATVEEITEARGNFRV 459 Score = 69.1 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 43/254 (16%), Positives = 78/254 (30%), Gaps = 23/254 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWE--QLQQYVTALN 58 N PLISI + N + I+++ + Y N E+I+V D + L + Sbjct: 464 SNQPLISIVVAAPNTGVIDPDCIRALFDRTAYPNIEVIVVHDGRNDEQLRDLLTFWHTQE 523 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQ 112 R + + N A G+ + + D D W L + Sbjct: 524 PKRFRVESLRTPARRSRLMNHGADCASGDLLVFLHGDVAVGTAD-WLDEMAGYALREEIG 582 Query: 113 LVTHAFLYANDYVCQ--------GEVYSQPASLPLYPKSPYSRRLFYKR--NIIGNQVFT 162 V L + + G + P + ++R L + + Sbjct: 583 AVGPVLLRPDGTIDHAGFVLGIAGGIGHPFRGFPAHHPGYFNRLLIPSNVAAVSSTCLMV 642 Query: 163 WAWRFK-ECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 F FD EL A D D LR+ + + E++ ++P++ Sbjct: 643 RCQAFHAAGGFDDELSGAIGDADFCLRLRQRGHRTVLLPYVRLCHYAPEPEVE-KTAPRQ 701 Query: 221 FSGYFHFYRKHKDK 234 S + + D Sbjct: 702 MSLLRRRWATYVDH 715 >UniRef50_C1DM76 Glycosyl transferase, family 2 protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DM76_AZOVD Length = 292 Score = 220 bits (560), Expect = 6e-56, Method: Composition-based stats. Identities = 84/276 (30%), Positives = 139/276 (50%), Gaps = 8/276 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+S+Y+PT NR + RA++SVL Q Y+N E+++ DD ST + A ++ Sbjct: 1 MRPAPLVSVYIPTRNRLEKLERALRSVLGQTYANHEILVCDDASTDGTFERISRLARSER 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 +I Y+ N GAC+ RN I A+GE+ITG+DDDDE+TP+RL L +AF+ Sbjct: 61 KIRYLRNPAPRGACSARNLGIFAARGEFITGLDDDDEFTPDRLETLLDAWDD--GYAFVC 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAA 179 +N + + + A PL ++ R+L K N+ NQV T R ++ F ++ Sbjct: 119 SNFWSQRPGQRLR-AYYPLQAQTFTLRQLLLK-NLATNQVLTRTGRLQDIGGFREGVRRL 176 Query: 180 QDYDIFLRMVVEY-GEPWKVEEATQILHINHGEM--QITSSPKKFSGYFHFYRKHKDKFD 236 QD+D +LR+ Y G + ++H +H +++ S +++D +D Sbjct: 177 QDWDTWLRLCSGYGGRFHRARTPLYVMHHDHEAEAQRVSRSIALDVALEELCERNRDLYD 236 Query: 237 RASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRL 272 S++ L ++R + L L RN L Sbjct: 237 ARSRRLLLSHARELRGEFSLGDLWLNTLHRRNSGPL 272 >UniRef50_B7K7D2 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7K7D2_CYAP7 Length = 1067 Score = 218 bits (557), Expect = 1e-55, Method: Composition-based stats. Identities = 68/286 (23%), Positives = 115/286 (40%), Gaps = 19/286 (6%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTA 56 +K PLIS+ MP +N + AI+SVL Q YSNWE+ I DD ST L +Y+ Sbjct: 531 LKYKPLISVIMPVYNPPENYLREAIESVLNQVYSNWELCIADDASTEPHVKLVLNEYLK- 589 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 D RI I + N N A+ LA GE+I +D DD TP+ Sbjct: 590 -QDSRIKVIFRENNGHISKASNSALELATGEFIALLDHDDVLTPHAFYELALLLNSHPEA 648 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTE 175 +Y++ Q + Q + Y K +S F + ++ + + F Sbjct: 649 DMIYSD----QDYMDDQGQLINPYFKPDWSPDAFLANMYTCHLGLYRHSIVKEIGGFRVG 704 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 + +QDYD+ LR + + + + + I K + + + + Sbjct: 705 FEGSQDYDLVLRFTEKTERIFHLPDILYHWRTHAASTNINPQAKSY-AFIAARKALAEAL 763 Query: 236 DRASKKYQLFTL------YQIRNKRMTWRTLLTLLSVRN-GKRLAD 274 R + +F + Y IR + + +++ +N G+RL + Sbjct: 764 TRRKEPGAVFDIPNYPGCYTIRYQIKDPELVSIIITTKNLGERLDN 809 Score = 91.5 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 92/245 (37%), Gaps = 26/245 (10%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYV--TA 56 +K+ L+SI + T N + + S+ Q Y N+E+I++D+ ST + L+ Sbjct: 788 IKDPELVSIIITTKNLGERLDNCLTSIF-QKTTYQNYEVILIDNGSTEPQALETIEHWKN 846 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITG-------IDDDDEWTPNRLSVFLAH 109 + R+ Y ++ + + N A+ +QG+Y+ + D W + Sbjct: 847 RENFRLKYYGLELPFNSSKINNFAVDKSQGKYLLFLKNETKILTSD--WLEKMIEQAQRP 904 Query: 110 KQQLVTHAFLYANDYVCQGEVY-------SQPASLPLYPKSPYSRRLFYKRNII---GNQ 159 V LY ++ V V Y S Y RL Y N + G Sbjct: 905 SIGAVGALLLYPDNTVQHAGVICGLFGRVGHSHKHYQYGASGYFGRLVYVHNYMAVTGAC 964 Query: 160 VFTWAWRFKE-CLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS 217 + F+E F+ +L + D D+ L+++ + + + + + Sbjct: 965 LMCRREVFEEVGGFEEKLAVSYNDIDLCLKLIDKGYRNLCLPHVVLYHYESKTRGYDSRD 1024 Query: 218 PKKFS 222 +KF+ Sbjct: 1025 TEKFA 1029 >UniRef50_A5W7E0 Glycosyl transferase, family 2 n=1 Tax=Pseudomonas putida F1 RepID=A5W7E0_PSEP1 Length = 1509 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 69/282 (24%), Positives = 114/282 (40%), Gaps = 17/282 (6%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--N 58 K+ PLISI MP +N L AI SV Q Y NWE+ I DD ST E + Y+ L Sbjct: 968 KDQPLISIIMPVYNPPMDLLREAIDSVTAQLYKNWELCIADDASTDPEVI-AYLKGLTKQ 1026 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D R + + N N A+ LA G Y+ +D+DD + L + A Sbjct: 1027 DQRFKVVLREKNGHISKASNSALELASGRYVALMDNDDLLPAHALYWIARAVHENPDAAV 1086 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELK 177 +Y++ + ++ + Y K+ ++ LF N+I + V+ + F ++ Sbjct: 1087 IYSD----EDKIDVEGNRSAPYFKTDWNLYLFRSHNMISHLGVYRRDLVQRVGGFREGME 1142 Query: 178 AAQDYDIFLRMV--VEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QDYD+ LR V V + + ++ G ++ K + Sbjct: 1143 GSQDYDLALRCVEQVRPDQIVHIPHVLYHWRVHPGSTAMSLGEKPY-AQIAGQNALDQHL 1201 Query: 236 DRASKKYQL----FTLYQIRNKRMTWRTLLTL-LSVRNGKRL 272 R K F +Y++ + L++L + RN L Sbjct: 1202 QRTGIKGHAELLDFGMYRVHYDLPADQPLVSLIIPTRNAVGL 1243 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 55/255 (21%), Positives = 109/255 (42%), Gaps = 20/255 (7%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+S+ +PT N L + I+S+ + Y ++E+I+VD+ S E LQ + + Sbjct: 1227 DQPLVSLIIPTRNAVGLVRQCIESITSKTSYQHYEIILVDNGSDDPESLQYFESLKLQQN 1286 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ----LVTH 116 I + +D A+ N+A+ A+GE + I++D + +P+ LS ++ Q V Sbjct: 1287 IRVLRDDGPFNYSALNNRAVREARGELVGLINNDIEVISPDWLSEMVSLALQPGAGAVGA 1346 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPY---SRRLFYKRNIIGNQVFTWAWRF--KECL 171 Y ++ + G V P +L + F + ++I A ++ + Sbjct: 1347 RLWYPDNRLQHGGVIMGPLTLAGHAHKMLPRGHHGYFGRASLIQGMSAVTAACLIVRKSI 1406 Query: 172 FDT-------ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSG 223 F+ ELK A D D L+++ + A + H + +P+K Sbjct: 1407 FEEVGGLNEVELKIAFNDVDFCLKVMAAGYQNIWTPNAD-LYHHESATRGLEDTPEKIER 1465 Query: 224 YFHFYRKHKDKFDRA 238 + R ++K+ + Sbjct: 1466 FKSEVRYVQEKWAKI 1480 >UniRef50_B2T0J8 Glycosyl transferase family 2 n=2 Tax=Burkholderia RepID=B2T0J8_BURPP Length = 723 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 64/285 (22%), Positives = 114/285 (40%), Gaps = 17/285 (5%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALN 58 + PLIS+ +PT+N ++L + SV Q Y +WE+ I DD ST A Sbjct: 177 LARKPLISVVVPTFNSDERLLREMVASVQAQIYPHWELCIADDASTQPHVKAVLEEVAAA 236 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + + N N A+ LA GEY+ +D DD P+ L + + Sbjct: 237 DSRIKVVFRETNGHISEASNSALALATGEYVALLDHDDLLPPHALYMVARYINLHPHGRM 296 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELK 177 Y++ + ++ ++ Y K ++ ++F +N+ + VF F + Sbjct: 297 FYSD----EDKLTTEGKRTTPYFKCDWNPQMFLTQNMFSHLGVFETKLVRDAGGFRKGFE 352 Query: 178 AAQDYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QDYD+ LR V G + I G + S K + R +D Sbjct: 353 GSQDYDLALRCVELAGDDSVIHIPHVLYHWRIVPGSTAGSGSEKPY-ALVAAIRALEDHL 411 Query: 236 DRASKKYQLFTLYQ------IRNKRMTWRTLLT-LLSVRNGKRLA 273 +RA+ + + +R + + ++ ++ R+G L Sbjct: 412 ERANISATVEHPVESLGVLRVRYTLPSPQPKVSIIIPTRDGLALL 456 Score = 101 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 48/256 (18%), Positives = 96/256 (37%), Gaps = 25/256 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 P +SI +PT + L + + SV Y N+E+IIVD+ S E ++ + P Sbjct: 438 SPQPKVSIIIPTRDGLALLKQCVDSVFAYTLYQNFEIIIVDNGSVKPETMKYFDEVSQRP 497 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQ----QLVT 115 + + ++ A+ N A +A GE++ +++D + +P+ L+ ++ V Sbjct: 498 NVRVLRDESPFNFSALNNHAARVATGEFLCLLNNDIEVISPDWLNEMVSLANLPRAGAVG 557 Query: 116 HAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWA--------WRF 167 Y D + G V + + R F G V T Sbjct: 558 ACLWYPTDALQHGGVVLGLGGIAGHMHHMMKRGHF---GYFGRAVATQNLSAVTAACLVV 614 Query: 168 KECLFDT--ELK-----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 K+ ++D L+ A D D ++++ A ++ H P K Sbjct: 615 KKSVYDEVGGLEEDLAVAFNDVDFCMKLLKAGYRNIWTPYA-EMYHHESATRGSDLDPDK 673 Query: 221 FSGYFHFYRKHKDKFD 236 + + R+ + +++ Sbjct: 674 YERFVSEIRRMEARWE 689 >UniRef50_Q67PP6 Glycosyl transferase n=1 Tax=Symbiobacterium thermophilum RepID=Q67PP6_SYMTH Length = 242 Score = 217 bits (554), Expect = 3e-55, Method: Composition-based stats. Identities = 61/240 (25%), Positives = 106/240 (44%), Gaps = 14/240 (5%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE--QLQQYVTALN 58 M +NP +++ +P +N + A++S L Q YSN E+++VDD ST L++Y A Sbjct: 1 MPDNPKVTVVIPAYNAARYLAEAVESALNQTYSNLEVLVVDDGSTDRTPFLLERYGRA-- 58 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 I I N G + N+ I A+GE+I + DD + P +L+ +A+ + A Sbjct: 59 ---IRTIR-KRNGGTPSALNEGIRQARGEWIAWLSADDAFLPEKLAKQMAYAEAHPECAL 114 Query: 119 LYANDYVCQ--GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTE 175 +Y N +V G+ S AS ++ +L I G+ + + LFD Sbjct: 115 IYTNWFVVDGRGKTVSHLASPTFRSRADQVEKLLRGCVINGSTTLVRRDVYLRAGLFDES 174 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 L A D+D++LR+ +Y V E + +++ P + K + Sbjct: 175 LPQAHDWDMWLRLARDY-RFGHVAEPLLRYRWH--GENLSARPDALAYNDRVLAKARAYL 231 >UniRef50_A1U141 Glycosyl transferase, family 2 n=1 Tax=Marinobacter aquaeolei VT8 RepID=A1U141_MARAV Length = 299 Score = 217 bits (552), Expect = 4e-55, Method: Composition-based stats. Identities = 64/280 (22%), Positives = 111/280 (39%), Gaps = 9/280 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + PL+SI PT+NR A++SVL Q +E+I++DD ST + +D Sbjct: 1 MYSKPLVSIITPTFNRADYLPIAVESVLAQTMPEFELIVIDDGSTDNTS-EVMERYTSDS 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ Y + N G RN+ I A+G YI +D D+ W +LS L ++ +Y Sbjct: 60 RVKYFKQE-NQGQSIARNRGIEAAKGGYICFLDSDNAWLETKLSKSLQAFKENPEADIVY 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAA 179 + + + Q + + L + I N T F+E FD + A Sbjct: 119 GDFILIDEDGKEQGVNRMKRYSGRITSHLIHDNFISMNTTMTRRRCFEEMGGFDNADRLA 178 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 +DY ++LR Y + + E + ++ + + R + F A Sbjct: 179 EDYGLWLRFSTRYQFLY-LPEILGYYRVMENQIS-SDKDSRLMANEQIIRTFLESFPEAL 236 Query: 240 KKYQLFTLYQ---IRNKRMTWRTLLTLLSV-RNGKRLADG 275 + + +R R +++ GK LA+ Sbjct: 237 SRGEARHGMSRFFVRKGRYELSQGRYRIALADLGKSLAND 276 >UniRef50_UPI0001C3906A glycosyl transferase family protein n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C3906A Length = 847 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 64/277 (23%), Positives = 110/277 (39%), Gaps = 19/277 (6%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTALND 59 PLIS+ MP +N ++ +AI+SV Q Y WE+ I DD S+ L++Y A D Sbjct: 181 KPLISVIMPVYNTPEKFLKKAIESVTNQVYPYWELCIADDNSSEPWIKPLLKEY--ARKD 238 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI N CA N A+ LA GEYI +D DD TP L + + Sbjct: 239 NRIKVAFRTENGHICAASNSALELATGEYIALLDHDDVITPEALYEVALVLNENPDADMI 298 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKA 178 Y++ + ++ + Y K + F R + + F + Sbjct: 299 YSD----EDKLDQYGQRVFPYFKPDWCPDSFLARMYTCHLGVYRRELVNQVGNFRIGYEG 354 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QDYD+ LR+ + + + + + + G + K + Y + + F R Sbjct: 355 SQDYDLVLRVTEKTNKIFHIPKILYSWRQHSGSTAYDPNSKLY-AYIAAEKALNESFARC 413 Query: 239 SKKYQLFTL-------YQIRNKRMTWRTLLTLLSVRN 268 +K ++ + Y +R + + ++ RN Sbjct: 414 QEKARIVSDHENFPGQYTVRYHIEEHKLVSIIIPTRN 450 Score = 95.7 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 52/290 (17%), Positives = 101/290 (34%), Gaps = 33/290 (11%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWE--QLQQYVTALNDPRI 62 L+SI +PT N ++S+ + Y N+E++++D+ S E + R Sbjct: 441 LVSIIIPTRNLGTTLNHCLQSIFEKSTYPNYEVVVIDNGSDEQETFDIINAWKQKEPQRF 500 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQLVTHA---- 117 + D+ + N + A+G+Y+ +++D E TP+ + + Q+ A Sbjct: 501 SCYEFDVPFNYPQINNYGVTKARGDYLLFLNNDTEVLTPDWIEAMVEQAQRSSIGAVGAL 560 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN----------IIGNQVFTWAWRF 167 LY +D + V + + Y N + G + F Sbjct: 561 LLYPDDTIQHAGVVLGIGGVAGHSHKGYQAHHLGYYNQLVMVNNYSAVTGACLMCRREVF 620 Query: 168 -KECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 + D EL+ A D D L++V + + H +P+K + Sbjct: 621 DQVGGLDEELEVAFNDVDFCLKLVSRGYRNIYLPHV-VLYHYESKSRGYEDTPEKEA--- 676 Query: 226 HFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 F R+ K R + + L+ + G RL++ Sbjct: 677 RFKRESKIMKQRWRD---------LLKNDPCYNPNLSRIRQDYGIRLSND 717 >UniRef50_D0XH48 Putative uncharacterized protein n=1 Tax=Vibrio harveyi 1DA3 RepID=D0XH48_VIBHA Length = 278 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 84/267 (31%), Positives = 138/267 (51%), Gaps = 7/267 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L++IY+ T NR QL RAI+S Q Y N E+I+VDD ST Q P + Y Sbjct: 3 LVTIYIVTHNRSQLLKRAIESAYNQTYKNIEVIVVDDASTDDTQDIIKQMQATYP-LKYF 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 ND GAC RN A+ A GE ITG+DDDD + +R+ + + T++ + AN + Sbjct: 62 RNDEGRGACYSRNVALNAASGELITGMDDDDYFGASRVDDLVDAYED--TYSLVCAN--I 117 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQDYDI 184 + + + + + +N++GNQ+ T + K FD + A QDYD Sbjct: 118 IEVTTDKEIHRKFGFEQGEFELNNLMYQNLVGNQMLTSVAKLKAIGGFDEAMPAFQDYDT 177 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSS-PKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++R+V +G+ +K+++ L+ H +I+SS +K G+ F+ KH++K + A KK Sbjct: 178 WVRLVDRFGKGYKIDKYNYYLNTEHAGERISSSNDRKTRGFELFFDKHQNKMNAAHKKSM 237 Query: 244 LFTLYQIRNKRMTWRTLLTLLSVRNGK 270 Y++ + + T++ L+S N K Sbjct: 238 EIMRYKVTGEMLPLLTMVRLISRDNYK 264 >UniRef50_A0YSS8 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YSS8_9CYAN Length = 321 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 62/215 (28%), Positives = 96/215 (44%), Gaps = 10/215 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P ISI +P +N I SVL+Q +S++E+IIV+D ST + + ++ D R+ Sbjct: 2 PQISIIIPVYNGATTIQATIDSVLKQTFSDFELIIVNDGST--DSTLEIISQYRDRRLQV 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-ND 123 + ++GA A RN+ + A GEYI +D DD WTP++L L Q A Y+ D Sbjct: 60 LSYP-HAGASATRNRGLKQASGEYIAFLDADDLWTPDKLEAQLNALQTHPKAAVAYSWTD 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWAWRFKECLFDTELKAAQ 180 ++ + + Q PY+ L Y N I N + FD L Q Sbjct: 119 FLDEAGSWRQAGRHTTVNGEPYAAMLLY--NFIESGSNPLIRRDALDDVGGFDESLSGGQ 176 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 D+D++LR+ Y V I + + Sbjct: 177 DWDLYLRLAARY-LFVNVPAVQIFYRIRNNSISSN 210 >UniRef50_B3PF93 Glycosyl transferase, putative, gt2I n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PF93_CELJU Length = 304 Score = 215 bits (548), Expect = 1e-54, Method: Composition-based stats. Identities = 68/277 (24%), Positives = 121/277 (43%), Gaps = 17/277 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-D 59 M+ PL+S+ + T+N Q +A++S+L Q + N E+I+VDD ST + +Q + A + Sbjct: 10 MQTQPLVSVVIATYNMGQYLAQAVESILNQTWKNLEVIVVDDGST--DDTEQVMQAYQAN 67 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ Y+ IN G +N I +G+YI D DD W P +L V L + Sbjct: 68 TRVYYLK-TINQGQPKAKNTGINNTKGDYIAFCDADDFWEPFKLEVQLPLFN-NPAVGVV 125 Query: 120 YA--NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK 177 Y+ ++ Q Y +PA+ + ++ + G V A + +FD E + Sbjct: 126 YSEVSNIDEQNRRYVRPANEVRHSGQVTNQLMIENFVPFGTSVIRRACIERNGVFDEEFR 185 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK---KFSGYFHFYRKHKDK 234 D+D++LR +++ + E T + + G+M + F KHK + Sbjct: 186 MGIDWDLWLRYSLDWEFAY-TPERTYVYRVWSGQMSTNYRGRYDFAIRILNKFVTKHKPQ 244 Query: 235 FDRASKKYQLFTLYQ------IRNKRMTWRTLLTLLS 265 + + +Y RN++ W L +L Sbjct: 245 LNPRYVRKAWADMYISKGVVYARNEKRFWMPLQHILH 281 >UniRef50_C6XS42 Glycosyl transferase family 2 n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XS42_HIRBI Length = 753 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 66/280 (23%), Positives = 116/280 (41%), Gaps = 12/280 (4%) Query: 1 MKNNPLISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALN 58 + P+ISI P +N +L AI+SVL Q YS+WE+ + +DCST E + A Sbjct: 209 LSQKPVISIITPVYNPAPELLRAAIESVLSQYYSHWELCLANDCSTDPEIARIIDEYAEQ 268 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI +H N A N A+ LA GE+I +D DDE + L + + Sbjct: 269 DSRIKRVHRTQNGHISAASNSALELASGEFIAFLDHDDELSATALYHVAEAINKNPSCRL 328 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELK 177 Y++ + ++ Y KS ++ L N+ + V+ A + ++ Sbjct: 329 FYSD----EDKIDLDGKRHDPYFKSDWNHDLLLSHNLFTHLSVYDKALIEEVGGLRSKYD 384 Query: 178 AAQDYDIFLRMV--VEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY--FHFYRKHKD 233 AQDYD+ LR + E + + G ++S K ++ H Sbjct: 385 GAQDYDLALRCSARLRADEICHIPFILYHWRVMPGSTALSSDEKPYAMLAGERALNDHLI 444 Query: 234 KFDRASKKYQLFTLYQIRNKRMTWRTLLT-LLSVRNGKRL 272 + +K + +++ T ++ ++ RN ++L Sbjct: 445 ALNIKAKAELIGIGFKVSYDIPTPAPQVSIIIPTRNSQKL 484 Score = 94.2 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 45/236 (19%), Positives = 86/236 (36%), Gaps = 20/236 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +SI +PT N Q+L + + S+ + Y ++E+I+VD+ S E ++ + +I Sbjct: 470 PQVSIIIPTRNSQKLVKQCVDSIYNKTSYPDFEIILVDNGSDDPEAIKYFQDLEAKNQIK 529 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSVFLAHKQQLVTHAF 118 I + A+ N A+ + + +++D D W +S+ L V Sbjct: 530 LISDPRPFNYSALNNLAVAQSNAPVLCLLNNDIEVISDNWLEEMVSLVLQPNVGAVGAML 589 Query: 119 LYANDYVCQGEVYSQPASLPLYP-------KSPYSRRLFYKRNII----GNQVFTWAWRF 167 Y +D + V L + Y R ++++ V + Sbjct: 590 YYPDDTIQHAGVVMGLGGLAAHIHGGLERGTPGYVGRAALRQSLSAVTGACMVVSRDNYE 649 Query: 168 KECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 K D E A D D L++ A ++ H ++P+K Sbjct: 650 KVSGLDEENLAVAYNDIDFCLKLQAIGKRNIWTPHA-ELYHHESASRGYENTPEKL 704 >UniRef50_B8GFE2 Glycosyl transferase family 2 n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GFE2_METPE Length = 310 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 60/238 (25%), Positives = 105/238 (44%), Gaps = 8/238 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P +S+ +PT+NR L AI+SVL Q Y + E+I+VDD S + ++ + + D R Sbjct: 3 STTPTVSVILPTYNRAHLLPTAIESVLNQTYRDLELIVVDDGS--KDSTEEVIRTITDER 60 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFLY 120 + YI + N GA RN I A+G +I D DD+W P++L + + + +Y Sbjct: 61 LRYIRYEPNQGANHARNVGIQAARGPFIAFQDSDDDWFPDKLEKNMKLFEMVGPEVGVVY 120 Query: 121 ANDYVCQGEVYSQPASLPLYPKSP-YSRRLFYKRNIIGNQ--VFTWAWRFKECLFDTELK 177 + + G + + N + Q V K L+ L Sbjct: 121 SGYWKHMGNNQKMYIPFTWVKQRDGDVHHELMRGNFVTTQAAVVRKECFEKSGLWMEGLP 180 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 A Q++++FLR+ +Y + ++E + + T++ + G KH D+F Sbjct: 181 AKQEWELFLRISKDYKFAY-IDEPLLNSNFTETGISNTNTS-VYRGMEMVLNKHYDEF 236 >UniRef50_B4VZW9 Methyltransferase domain family n=2 Tax=Cyanobacteria RepID=B4VZW9_9CYAN Length = 1265 Score = 214 bits (546), Expect = 2e-54, Method: Composition-based stats. Identities = 63/286 (22%), Positives = 115/286 (40%), Gaps = 19/286 (6%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTALND 59 P+IS+ MPT+N + AI+SV+ Q Y WE+ I DD ST +++Y A + Sbjct: 732 QPVISVIMPTFNTSEHFLREAIESVINQVYPYWELCIADDASTQPHVKNIVEEY--AAKE 789 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI + N N A+ +A GE+I +D DD TP+ L + + Sbjct: 790 TRIKVVIRTTNGHISNASNSALEMATGEFIALLDHDDLLTPDALYEVALLLNRHPEADMI 849 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKA 178 Y++ + ++ Y K + F R + ++ K F + Sbjct: 850 YSD----EDKIDETNKVSSPYFKPDWCPDSFLSRMYTCHLGIYRRELVEKIGGFRVGYEG 905 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QDYD+ LR+ + + + + I H + + K Y + D R Sbjct: 906 SQDYDLVLRITEKTKNIFHIPKILYHWRI-HPQSAASGVEAKPYAYKAGEKALIDALHRR 964 Query: 239 SKKYQLF------TLYQIRNKRMTWRTLLTLLSVRN-GKRLADGIR 277 + + LY++R K ++ + ++ R+ G L + ++ Sbjct: 965 GENGIISGLPGFPGLYRVRYKIEDYKRVSIIIPTRDLGNVLNNCLK 1010 Score = 95.7 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 84/237 (35%), Gaps = 24/237 (10%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQ--QYVTALNDPRIT 63 +SI +PT + + +KS+ + Y N+E+I++D+ ST + +Q Y + R + Sbjct: 992 VSIIIPTRDLGNVLNNCLKSIFEKTAYPNYEVIVIDNGSTENDTVQIIDYWKSKEHERFS 1051 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQLVTHA 117 +I + N A+ A+G+Y+ +++D D + L Sbjct: 1052 CYPLNIEFNFSKINNYAVAKAKGDYLLFLNNDIEVITQDWI--DALVEQAQRPSIGAVGG 1109 Query: 118 FLYANDYVCQ--------GEVYSQPASLPLYPKSPYSRRLFYKRNI---IGNQVFTWAWR 166 L D Q G V S Y+ +L N G + Sbjct: 1110 LLLYPDKSIQHAGVVLGIGGVASHGHKKYPSTSPGYAGQLITINNYSAITGACLMCRREV 1169 Query: 167 FK-ECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 ++ F EL A D D L+++ + + H + + S +K Sbjct: 1170 YEAVGGFAEELAIAYNDVDFCLKLLRQGYRNIYLPHVVLYHHESKSRGRDDDSSEKL 1226 >UniRef50_A0M6I2 TuaG-like glycosyl transferase n=2 Tax=Flavobacteriaceae RepID=A0M6I2_GRAFK Length = 261 Score = 214 bits (545), Expect = 3e-54, Method: Composition-based stats. Identities = 54/235 (22%), Positives = 100/235 (42%), Gaps = 3/235 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M++ L+S+ MP +N + +I+SV+RQ Y NWE++I+DD S+ + + D Sbjct: 1 MEDRGLVSVIMPAYNSEVFITESIQSVIRQTYPNWELLIIDDASSDTTKQIVQKFSSEDE 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N N+G RN+ I A G +I +D DD+W P +L L + A Sbjct: 61 RIKLLKNSTNAGTHHSRNKGIKAASGNFIAFLDADDQWKPQKLEKQLKVLSKPNVAACFS 120 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + + + Q L K N +GN + ++ ++ Q Sbjct: 121 SYELINEKGASLQKKIQAL---PVLDYDKLLKANYVGNLTGIYNVSILGKVYCPDIPKRQ 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 D+ ++L+++ E G + E+ + I + + ++ K F Sbjct: 178 DWALWLKVIEEGGPIEGIAESLAVYRIRKNSISTNKLEMLKYNFKVYHSVLKYGF 232 >UniRef50_Q1Q4P0 Similar to glycosyltransferase family 2 n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q4P0_9BACT Length = 295 Score = 214 bits (545), Expect = 3e-54, Method: Composition-based stats. Identities = 60/230 (26%), Positives = 100/230 (43%), Gaps = 9/230 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 PL+S+ MPT+N +I+SVL Q Y +E++++DD ST +L V + Sbjct: 3 NKKPLVSVIMPTYNCAVYLQESIESVLAQTYDAYEVVVIDDGSTDNTKL---VLKPYMEK 59 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I YI N G RN I A+GEY+ +D DD W P +L + + + +Y+ Sbjct: 60 IKYIDLGRNEGLPTARNLGIQSAKGEYVAFLDADDIWMPEKLEMSIDQFMKNPDAGMVYS 119 Query: 122 NDY-VCQGEVYSQPASLPLYPKSPYSRRLFYKRN-IIGNQVFTWAWRF-KECLFDTELKA 178 + + + P +LF+++N II + V F K LFD+ L Sbjct: 120 KHINIDSKSQFLEGKIRRRLPSGNIFTQLFFEQNFIICSSVVVRKEVFNKTALFDSGLVN 179 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY 228 QD+D++LR+ Y + ++ H ++ + Y Sbjct: 180 CQDWDMWLRIAFYY-KAIGIDVPLVKYR--HSAKSLSKNRNNVLKYQKVI 226 >UniRef50_Q8YSM1 Alr3063 protein n=7 Tax=Cyanobacteria RepID=Q8YSM1_ANASP Length = 330 Score = 213 bits (544), Expect = 4e-54, Method: Composition-based stats. Identities = 61/240 (25%), Positives = 104/240 (43%), Gaps = 14/240 (5%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK+ LIS+ +P +N I+SV +Q +++WE+I+++D ST + + ++ D Sbjct: 1 MKDMALISVIIPAYNAVLTIKETIESVQKQTFTDWEIIVINDGSTDG--TPEIIQSIKDE 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ I N N G RN+ I+ A GE+I +D DD W ++L + L QQ Y Sbjct: 59 RLK-IFNYKNGGLPVARNRGILHASGEFIAFLDADDLWAVDKLEMQLKALQQHPEAGVAY 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII--------GNQVFTWAWRFKECLF 172 + + C +V Q + P S YS +N++ N + F Sbjct: 118 S--WTCFMDVNEQGEPVAYLPSSQYSFTGNVYQNLLVSDFIHSGSNTLIRKEAINSVGEF 175 Query: 173 DTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 D LK+ +D+D +LR+ + V E G M K + + ++ Sbjct: 176 DPMLKSCEDWDYWLRLATHWD-FIVVPEYQIFYRRTPGAMSSKVEVMKEACLIAMEKAYQ 234 >UniRef50_Q8YSM2 Alr3062 protein n=6 Tax=Nostocaceae RepID=Q8YSM2_ANASP Length = 321 Score = 213 bits (544), Expect = 4e-54, Method: Composition-based stats. Identities = 64/268 (23%), Positives = 115/268 (42%), Gaps = 19/268 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +P +N ++SVLRQ Y ++E+++V+D S+ + +++V+ + DPR+ Sbjct: 2 PKVSVVIPAYNAMPYLPETLESVLRQTYHDFEVVVVNDGSS--DNTEEWVSQILDPRLKL 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN-D 123 I N G RN I+ A GEYI +D DD W P +L+ ++ + T +Y Sbjct: 60 ISQ-ANQGLAGARNTGIVNASGEYIAFLDADDIWEPTKLAKQVSVLDENPTVGLVYTWVA 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ---VFTWAWRFKECLFDTEL-KAA 179 Y+ + + + + + NI+ + K LFD L Sbjct: 119 YIDEQGKSTGKIFKNQVEGYVWPQ--LTEHNIVECGSVALVRRVCFEKMGLFDRNLGSYV 176 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH-FYRKHKDKFDRA 238 +D+D++LR+ Y V+EA + S+ K + H F + F A Sbjct: 177 EDWDMWLRIATSYDFKV-VKEALVYYR-----QRSNSASKNWEAMAHSFAIVIEKAFATA 230 Query: 239 SKKYQLF--TLYQIRNKRMTWRTLLTLL 264 S+ Q+ Y + W+ L + Sbjct: 231 SQDLQVLKNKSYGFTYLCLAWKPLQSFQ 258 >UniRef50_D2ETY9 Glycosyl transferase, family 2 protein n=1 Tax=Bacteroides sp. D20 RepID=D2ETY9_9BACE Length = 270 Score = 213 bits (543), Expect = 5e-54, Method: Composition-based stats. Identities = 73/268 (27%), Positives = 119/268 (44%), Gaps = 7/268 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N PL+S+ + T NR+ IRA+ SV+ Q Y N E+I+VDD S+ + + + Sbjct: 2 NEPLVSVIVTTHNREDNVIRAVNSVINQSYRNIEIIVVDDHSSDDTAKKIGQLIEKNDNL 61 Query: 63 TYIHND-INSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 Y N GACA RN I A G YI G+DDDDE+ P R+ +A V AF+ + Sbjct: 62 FYYALPDPNRGACAARNYGISQAHGMYIAGLDDDDEFVPERIEKMMAAMSDEV--AFVCS 119 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKAAQ 180 +C + + + +S N++GNQV + FD E+ Q Sbjct: 120 AGVICDEKNRKKIVRY-FFKDRCFSLYDLLWCNVVGNQVLVKKELILQANGFDVEMPCNQ 178 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+D++ R++ + + + QI+H++ IT + G + FY K+ + K Sbjct: 179 DWDMWTRLLKVKPKAIYLTDILQIIHVDKEAPSITKLANRKLGLYKFYCKNGENMTFWQK 238 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRN 268 F + R + + +R Sbjct: 239 --HFFRYREYRFGLRNFSFWMFFSCIRY 264 >UniRef50_B4AD21 Glycosyltransferase n=17 Tax=Enterococcus faecalis RepID=B4AD21_ENTFA Length = 252 Score = 213 bits (543), Expect = 5e-54, Method: Composition-based stats. Identities = 58/250 (23%), Positives = 103/250 (41%), Gaps = 6/250 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI MP +N + ++I+SVL Q Y NWE++++DD S D RI + Sbjct: 5 LVSIIMPMYNAGKFLSKSIESVLEQTYQNWELLLIDDGSKDDSIDIALAFMEKDSRIFLL 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+ N G RN+ I ++G+YI +D DD W PN+L V + ++ F ++ +V Sbjct: 65 KNEQNMGIAKTRNKGIEASKGQYIAFLDSDDLWLPNKLEVQIKWMEE-KKLLFTCSSYFV 123 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 C + + P + + K N IG + +LK +DY + Sbjct: 124 CNENGNITHER--NFSEGPQTYQDLLKTNTIGCLTVVVESNLLKRHLMPDLK-HEDYATW 180 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 L ++ E + + E I ++ S + R++ ++F + L Sbjct: 181 LNILKEINTVYFINEKLAIYRKLTTSTS-SNKWNTISWVWKILRQN-EQFSVIKSSFYLM 238 Query: 246 TLYQIRNKRM 255 + Sbjct: 239 RFLFYTTFKY 248 >UniRef50_A7M6Z6 Glycosyl transferase family 2 n=1 Tax=Klebsiella pneumoniae RepID=A7M6Z6_KLEPN Length = 281 Score = 213 bits (543), Expect = 5e-54, Method: Composition-based stats. Identities = 83/259 (32%), Positives = 130/259 (50%), Gaps = 6/259 (2%) Query: 1 MKNN-PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M +N P +SIY+ T+NR + RAI SV QDY NWE++I DD ST L D Sbjct: 1 MNDNGPKVSIYISTFNRLEKLKRAIDSVFAQDYLNWELLICDDASTDGTFEFSSELELRD 60 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ Y N+ N GAC RN I A+G++ITG+DDDDE+TPNRLSVFL + ++FL Sbjct: 61 ERVRYFRNESNKGACETRNLGIFNAKGKFITGLDDDDEFTPNRLSVFLKNWDD--RYSFL 118 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKA 178 N + + Y+ + N NQ+FT R ++ FD +K Sbjct: 119 CCNFKNKYPTGKEEYYYKIKKERCIYNYKDMLFENEASNQIFTLTERLQQIHGFDKTVKR 178 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINH--GEMQITSSPKKFSGYFHFYRKHKDKFD 236 QD+D +LR+ YG+ + T I++ +H E++++ + K ++++ + Sbjct: 179 LQDWDTWLRLSFRYGDFVRYNVCTYIMNHDHQPSEIRVSQNEKIADSLLSLAKRNRGLYS 238 Query: 237 RASKKYQLFTLYQIRNKRM 255 +Y F + ++ Sbjct: 239 NDEYRYMEFIVNSMKGNMF 257 >UniRef50_Q2YCB6 Glycosyl transferase, family 2 n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2YCB6_NITMU Length = 313 Score = 213 bits (542), Expect = 6e-54, Method: Composition-based stats. Identities = 61/277 (22%), Positives = 114/277 (41%), Gaps = 20/277 (7%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +SI +PT+N + + SVL Q +++ E+I+VDD ST ++ ++ VT+ P + Sbjct: 4 SKPTVSIIIPTYNCEAYIAETLDSVLSQTFTDLELIVVDDGST--DRTREIVTSYGTP-V 60 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + NSG CA RN I A G Y+ +D DD W P +L++ L +Y+ Sbjct: 61 RLLSQS-NSGVCAARNYGIREAAGSYVCLMDHDDYWFPEKLALQLEQMFSHPEVGLVYST 119 Query: 123 D--YVCQGE----------VYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC 170 + GE + P + L I+ + A F +C Sbjct: 120 FVWWHPDGEGVFPKPESFNIEKMPGGIDEEFSGWIYHLLLLDCWILTSAALIRAEVFDKC 179 Query: 171 L-FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 +D L ++D+D++LR+ EY K+ + + + + ++ R Sbjct: 180 GAYDESLPYSEDWDLWLRISREY-PVIKLNKGLTLYRQHPKQG--NRLTREIDYRTVLLR 236 Query: 230 KHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSV 266 +K+ S+ + + Q ++ L + Sbjct: 237 DAAEKWGLCSRDGRCISRQQFMQNIARYQMEFGLKHL 273 >UniRef50_Q9K6L6 Glycosyltransferase n=1 Tax=Bacillus halodurans RepID=Q9K6L6_BACHD Length = 303 Score = 213 bits (542), Expect = 7e-54, Method: Composition-based stats. Identities = 67/250 (26%), Positives = 109/250 (43%), Gaps = 6/250 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +SI +PT NR +L RA++S L Q Y N E+I+V D ST + D R+ +I Sbjct: 2 KVSIIIPTHNRAKLLKRALESTLNQTYKNIEVIVVSDGSTDNTDIVMDEYK-RDSRVNFI 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 G RN I A+GE+I +DDDDEW P++L + + Q +Y + Sbjct: 61 SYHPAKGGNYARNTGIKNAKGEFIAFLDDDDEWMPDKLELQIKEFNQNANVGLVYTGVEI 120 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG--NQVFTWAWRFKEC-LFDTELKAAQDY 182 L K+ + N IG + V EC +FD +LKA QDY Sbjct: 121 IYNFNKRNIKYYSLPKKTGNLSKEILVANCIGTTSSVMVRKNLITECGMFDEKLKARQDY 180 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY-RKHKDKFDRASKK 241 D+++R+ + V + + QI+ KK+ + K+ D + + S++ Sbjct: 181 DLWIRVCQK-TLVGVVNKPLVRYYNYTTNKQISDDIKKYESAIEYIDNKYVDLYSKVSEE 239 Query: 242 YQLFTLYQIR 251 + + + Sbjct: 240 IRRKHRHSMT 249 >UniRef50_Q8PZ42 Glycosyltransferase n=1 Tax=Methanosarcina mazei RepID=Q8PZ42_METMA Length = 307 Score = 212 bits (541), Expect = 8e-54, Method: Composition-based stats. Identities = 70/253 (27%), Positives = 112/253 (44%), Gaps = 8/253 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + P +S+ +PT+NR L RAIKSVL Q Y + E+IIVDD ST + ++ V D Sbjct: 2 LDQKPKVSVIIPTYNRAHLIPRAIKSVLNQTYLDIEIIIVDDSST--DNTEEIVKDFKDR 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFL-AHKQQLVTHAFL 119 R+ YI ++IN GA A RN I ++G+YI D DDEW N+L + A + Sbjct: 60 RLKYIRHNINKGASAARNTGIRESRGKYIAFQDSDDEWFSNKLEKQIEAFADAPPEVGIV 119 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECL-FDTELK 177 Y+ Y + S L K K N +G V F+ F+ L Sbjct: 120 YSGFYRIEANRKLYLPSDQLSLKEGNIHNELLKGNFVGTPTVLMKKECFRNQRYFNESLP 179 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 A +D+++++ + Y + +++ + + + + + H D F + Sbjct: 180 ALEDWELWIELSKYYTFRY-IKKPLLCSYSTPNSINLNEN-NMLKAHEIILLTHLDDFSK 237 Query: 238 ASKKYQLFTLYQI 250 +KK Y I Sbjct: 238 -NKKNLSEHYYDI 249 >UniRef50_B5WL01 Glycosyl transferase family 2 n=2 Tax=Burkholderia RepID=B5WL01_9BURK Length = 731 Score = 212 bits (540), Expect = 1e-53, Method: Composition-based stats. Identities = 68/285 (23%), Positives = 112/285 (39%), Gaps = 19/285 (6%) Query: 3 NNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--ND 59 PLISI MPT+N + AI SV+ Q Y NWE I DDCST+ E +++ + + D Sbjct: 189 QQPLISIVMPTYNTPAKWLRMAIDSVIDQVYENWEFCIADDCSTNPE-VKEVLDSYVAKD 247 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI + N N A+ LA GE++ +D DDE P L + Sbjct: 248 PRIKVAYRTTNGHISNSSNTALELAVGEFVGLLDHDDELHPLALYCVAELINAHPDATVI 307 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKA 178 Y++ + ++ Y K ++ LF +N+I + KE F T + Sbjct: 308 YSD----EDKISIDGERSDPYFKCDFNYDLFLSQNMISHFGVYRTSIMKEVGGFRTGFEG 363 Query: 179 AQDYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGY-------FHFYR 229 +QDYD+ LR++ G + V A + K ++ H R Sbjct: 364 SQDYDLALRVIDRAGHHTVYHVPRALYHWRMIPESTAAGHEAKPYAHIAAMRALDEHLAR 423 Query: 230 KHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLAD 274 + + + F + ++ ++ R+ L + Sbjct: 424 NNINAHTEHAPGTDAFNKVVYDLPAVLP-SVEIIIPTRDSAGLVE 467 Score = 84.9 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 47/241 (19%), Positives = 91/241 (37%), Gaps = 21/241 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P + I +PT + L + ++SV R+ Y N+ + I+D+ S E + + D RI Sbjct: 451 PSVEIIIPTRDSAGLVEQCVESVRRKSSYPNFRITIIDNGSVKQETHELFARLQEDERIK 510 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQLVTHA 117 + +D A+ N+ + + +++ +++D D W +SV L V Sbjct: 511 VVRDDSPFNYSALNNRVALASTADFVCLMNNDIEVINAD-WLEEMVSVALQVNVGAVGAK 569 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPY---SRRLFYK---RNIIG-----NQVFTWAWR 166 LY +D + G V + + + F + RN++ + + Sbjct: 570 LLYPDDTIQHGGVVLGVGGIASHAHKHFPNTHPGYFARARLRNVMCAVTAACLLIRQSIY 629 Query: 167 FKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 + D +L A D D LR+ A ++ H +P+K S + Sbjct: 630 KEVGGLDEKLHVAYNDIDFCLRVRQAGYRNVWTPYA-ELYHHESASRGAEDTPEKISRFN 688 Query: 226 H 226 Sbjct: 689 Q 689 >UniRef50_B8D172 Glycosyl transferase family 2 n=4 Tax=Bacteria RepID=B8D172_HALOH Length = 250 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 58/252 (23%), Positives = 105/252 (41%), Gaps = 10/252 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS--WEQLQQYVTALNDPRIT 63 L+S+ P +N +Q + IKSVL Q Y+NWEMI+VDDCST + +++Y+ + RI Sbjct: 5 LVSVITPLYNSEQFIEKTIKSVLNQTYANWEMIVVDDCSTDSGPDIVKEYLK--KNSRIK 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I NSGA RN+ I +++G YI +D DD W ++L + + A Sbjct: 63 LIKLKKNSGAAVARNKGIKISKGRYIAFLDSDDLWHKDKLKKQVEFMKNNNVVLSYTAYR 122 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYD 183 + + P + K N IG + ++ ++ + QDY Sbjct: 123 KIDEFGNLRGIIRPP----KKINYNQLLKTNSIGCLTAMYDTKYVGKVYMPVIDRRQDYA 178 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++L+++ + + V E N + ++ + YR+ ++ Y Sbjct: 179 LWLKILKKGITAYGVNEVLAYYRTNKNSLS-SNKVVSAKYQWKIYRQIEN-LSIVKSLYY 236 Query: 244 LFTLYQIRNKRM 255 + + Sbjct: 237 FINYFINGLSKY 248 >UniRef50_B4VMI0 Glycosyl transferase, group 2 family protein n=2 Tax=Oscillatoriales RepID=B4VMI0_9CYAN Length = 325 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 59/212 (27%), Positives = 104/212 (49%), Gaps = 6/212 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 LIS+ +P +N ++ + I+SV+ Q +SN+E+I++DD S + V +++D R+ Sbjct: 3 LISVVIPVYNGEKTIRQTIESVINQTFSNFELIVIDDGS--QDSTLDIVRSISDSRLKVF 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-NDY 124 N+G RN+ I LA G+YI+ ID DD WT N+L L A ++ D Sbjct: 61 SFP-NAGLATSRNRGIQLASGDYISFIDADDLWTGNKLEAQLKALLDNPQAAVAFSWTDC 119 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKAAQDYD 183 + + +S+ + + Y++ L G+ + + FK+ F+ L AA+D+D Sbjct: 120 IDESGQFSRRGNYTTVSGNVYAQLLLTDFIENGSNLLIRSEAFKKVGYFNESLPAAEDWD 179 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 ++LR+ V Y E V ++ M Sbjct: 180 MWLRLAVNY-EFAVVPSPQVFYRLSTTSMSAN 210 >UniRef50_B7R2A7 Glycosyltransferase n=1 Tax=Thermococcus sp. AM4 RepID=B7R2A7_9EURY Length = 307 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 68/267 (25%), Positives = 114/267 (42%), Gaps = 6/267 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P ISI +PT+NR + R I+SVL Q + ++E+II+DD ST + + D RI Sbjct: 3 RPKISIILPTYNRGYIIGRTIRSVLDQTFKDFELIIIDDGSTDNTKNIVEIYKEKDNRIR 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 YI + N GA A RN I ++GEYI ID D W P +L + + + +Y+ Sbjct: 63 YIRHKRNLGANAARNTGIKKSRGEYIAFIDSDTIWKPQKLEIQIKIADNTKENFVIYSRT 122 Query: 124 Y-VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG-NQVFTWAWRFK-ECLFDTELKAAQ 180 + P + K N I + + K LFD +L Q Sbjct: 123 IRKYTKKTVIVPEDVNQGKKEGQLLDKILVLNFIDMSSILVKKNMIKNSGLFDEKLPRLQ 182 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+DI +R+ +Y + + + I ++ + + K H +K+ F + K Sbjct: 183 DWDILIRLAEKYNFVY-IPQTLVISYVLSDSISV-DYCKLLKAEMHLLKKYYPLF-KQDK 239 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVR 267 + Y+I + + ++ +R Sbjct: 240 RAIAQRYYEIVLSSLKCKRMIQTNHLR 266 >UniRef50_A2VSE1 Glycosyl transferase n=1 Tax=Burkholderia cenocepacia PC184 RepID=A2VSE1_9BURK Length = 654 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 69/281 (24%), Positives = 108/281 (38%), Gaps = 14/281 (4%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALND 59 +N PLIS+ MP +N + AI+S+ Q Y +WE+ I DD ST A D Sbjct: 118 QNRPLISVVMPVYNPNPVWLVEAIESIRSQLYPHWELCIADDVSTDPAIRPLLERYASED 177 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI + N A N A+ L G ++ D DD + L + + Sbjct: 178 PRIKVAFREKNGHISAASNTALALVTGTWVALFDHDDLLPEHALYCVADAVDRNSAIRLI 237 Query: 120 YAN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELK 177 Y++ D + + V +P Y K ++ LF +N+ + VF A + F + Sbjct: 238 YSDEDKIDESGVRREP-----YFKCDWNADLFLSQNMFSHLGVFQKALLDEVGGFREGYE 292 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHIN-HGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QDYD+ LR + G + H H E + + K R D F Sbjct: 293 GSQDYDLALRCIERAGTAAIHHIPRVLYHWRVHAESTASGTDAKPYAVVAGERALGDHFV 352 Query: 237 RASKKYQLFT---LYQIRNKRMTWRTLLTL-LSVRNGKRLA 273 R + + Y+ L++L + RN L Sbjct: 353 RMGIRGSIEYAGNGYRAHYALPDPVPLVSLIIPARNECVLI 393 Score = 97.6 bits (242), Expect = 4e-19, Method: Composition-based stats. Identities = 51/258 (19%), Positives = 90/258 (34%), Gaps = 22/258 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+S+ +P N L R I+S++ + Y+N+E++I+D+ S L + D RI Sbjct: 378 PLVSLIIPARNECVLIQRCIESIVGKTRYANYEILILDNGSDDLATLDYLASLQADGRIR 437 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDD-DDEWTPNRLSVFLAHKQQ----LVTHAF 118 + N+ A N + A GE + +D + +P L+ + Q V Sbjct: 438 VLRNERPLSFSASCNAGVEAANGEIVGLLDSGAEVISPQWLAEMVGVALQPGVGAVGAKL 497 Query: 119 LYANDYVCQGEVYSQPASL--PLYPKSPY-SRRLFYKRNIIG--------NQVFTWAWRF 167 LY N V + L ++ P S + ++I + + Sbjct: 498 LYPNGAVQHAGIVLGLGGLAGHVHRNVPRASYGYGGRASLISAFSAVSAACMIVRKSVYR 557 Query: 168 KECLFDTELKAAQ--DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 F+ + A D D LR+ A ++ H + P+ Sbjct: 558 DVAGFNEKDLGAAFGDIDFCLRLRDAGYRNVWTPYA-ELYR--HEPQTRSDEPQPVEPAV 614 Query: 226 HFYRKHKDKFDRASKKYQ 243 + R RA Y Sbjct: 615 RYMRSRWGALLRADPFYS 632 >UniRef50_Q2NH10 Predicted glycosyltransferase n=3 Tax=cellular organisms RepID=Q2NH10_METST Length = 776 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 65/242 (26%), Positives = 114/242 (47%), Gaps = 9/242 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+SI MP +NR+ L AIKSVL Q Y N+E+II+DD S+ ++ + + + D R Sbjct: 261 EEKPLVSIIMPVYNREGLITNAIKSVLNQTYKNFELIIIDDGSS--DKTCENILKIKDKR 318 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I N N G A RN+ + +++G+YI +D D++W +S + + + LY Sbjct: 319 IKLIRNKENLGISASRNKGLNISKGKYIMYLDSDNDWDNRYISAMVGAFRVNKDASALYC 378 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVF--TWAWRFKECLFDTELKAA 179 YV + +L + ++ L RN I + T K FD L Sbjct: 379 GQYVFE----ESKTNLKYVRFASLNKGLLSNRNYIDLNAYAHTRDVYLKYGGFDESLVKC 434 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 D+D+ L+ + + + V + + + ITS+P+ + K + + ++ Sbjct: 435 VDWDLILKYST-FAKTYSVPVILSNYYFDLADNTITSNPELSYTIKQVHEKQEKRLEKIG 493 Query: 240 KK 241 K+ Sbjct: 494 KR 495 Score = 59.5 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 60/168 (35%), Gaps = 4/168 (2%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPRITYI 65 +S+ +P++ ++S+ + D E+++VD+ S+ ++ Y+ L + +I I Sbjct: 503 VSVIIPSYEALDDLKICLESLFKLDSKYVEVVVVDNNSSY--DVKNYLRKLKLENKIKLI 560 Query: 66 HNDINSGACAVRNQAIMLAQGEY-ITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 N N G NQAI +A+ + I +++D +T + L Sbjct: 561 LNHENYGFTYAVNQAIEIAKKDNDIVLLNNDAVFTEGAMEYLQEAAYTLENCGITVPQQV 620 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLF 172 + P+ L + N + E F Sbjct: 621 LPANTKTIDVHVPYANPQYMCDVNLSAHHKNVINLPLYHNGIYTEVSF 668 >UniRef50_B2J2I1 Glycosyl transferase, family 2 n=7 Tax=Cyanobacteria RepID=B2J2I1_NOSP7 Length = 358 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 58/236 (24%), Positives = 106/236 (44%), Gaps = 10/236 (4%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N LIS+ +P +N ++ + I+SVL Q + N+E+I+++D S + + + + D RI Sbjct: 7 NKTLISVIIPAYNSEKTIKKTIESVLNQTFHNFELIVINDGS--QDSTLEVIRKIPDSRI 64 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA- 121 N+G RN+ + A G++++ +D DD WTP++L L Q+ VT Y+ Sbjct: 65 QVFSYP-NAGGNVSRNRGLNHAVGKFVSFLDADDVWTPDKLQHQLKALQKNVTAKVAYSW 123 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWAWRFKECLFDTELKA 178 DY+ + + + Y L N + N + FD L A Sbjct: 124 TDYIDEKGEFILSGKRVTANGNVYESLLL--NNFLENGSNPLICRKALITLGGFDESLGA 181 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 AQD+D++LR+ ++ V + I + ++ S R +K++ Sbjct: 182 AQDWDMWLRLAYKF-NFICVPSVQILYRITPNSVSCNLVRQEKSCLQVLERAYKER 236 >UniRef50_B5W037 Glycosyl transferase family 2 n=2 Tax=Arthrospira RepID=B5W037_SPIMA Length = 330 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 59/230 (25%), Positives = 108/230 (46%), Gaps = 11/230 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P ISI +P +N + I SV+ Q + ++E+II++D ST ++ +++++DPR+ Sbjct: 2 PQISIIIPAYNADKTIKETIDSVINQTFKDFEIIIINDGST--DRTLDIISSIDDPRVQV 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-ND 123 NSGA RN+ A +YI +D DD WT ++L + A + + Y+ + Sbjct: 60 FSYP-NSGASVSRNRGFSKASAQYIAFLDADDLWTSDKLELQYAALEADPEASVAYSWTN 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWAWRFKECLFDTELKAAQ 180 Y+ Q + P Y YS + +N + N + T FD LK Q Sbjct: 119 YIDQYGEFLYPGFHQSYQGDVYS--ILLVKNFLENGSNPLITRHALSAIGGFDESLKGGQ 176 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 D+D++LR+ +Y + + +A + I+ + ++ ++ S K Sbjct: 177 DWDLYLRLAAQY-KFVCISQAQILYRISEHSIS-SNIERQESSCLLVLNK 224 >UniRef50_O58167 Putative uncharacterized protein PH0430 n=1 Tax=Pyrococcus horikoshii RepID=O58167_PYRHO Length = 334 Score = 210 bits (536), Expect = 3e-53, Method: Composition-based stats. Identities = 65/258 (25%), Positives = 107/258 (41%), Gaps = 14/258 (5%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ +PT+NR ++ RAI+SVL Q Y+N E+IIVDD S + D RI Y+ Sbjct: 5 LVSVVLPTYNRAKVLPRAIESVLNQTYTNIELIIVDDGSRDNTKEIIREFQSQDERIVYL 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N N GA A RN IM + GE+I +D DD W P +L + + + + ++ Sbjct: 65 RNKRNLGANAARNIGIMHSTGEFIAFMDSDDMWLPWKLERQIKIMYSSLNSYPIVYSGFI 124 Query: 126 CQGEVYSQPASLPLYPK---------SPYSRRLFYKRNIIGNQVFTWAWRF---KECLFD 173 + L YP+ + L N I + + LFD Sbjct: 125 RIYKTDKNKVILSYYPQKSPPVSADSTSIHLNLLTNANFISTPTVLIPALYIRQNKLLFD 184 Query: 174 TELKAAQDYDIFLRMVVEYGEPWK-VEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 ++ QD++ FLR+ ++ + E I ++ + + + K Y K+ Sbjct: 185 EKMPRLQDWEFFLRLSKHCNCNFRFIPEPLVIAYVQPDSISM-NHKKFIKALKILYTKYY 243 Query: 233 DKFDRASKKYQLFTLYQI 250 K + + L Sbjct: 244 SKVNNQEALSTFYFLMGY 261 >UniRef50_Q0ACF0 Glycosyl transferase, family 2 n=4 Tax=Ectothiorhodospiraceae RepID=Q0ACF0_ALHEH Length = 597 Score = 210 bits (535), Expect = 4e-53, Method: Composition-based stats. Identities = 65/235 (27%), Positives = 102/235 (43%), Gaps = 27/235 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ MP +N AI SVL QDY + E+I++DD S+ + V A D R+ Sbjct: 281 PLVSVIMPAFNAASYIEEAIDSVLAQDYPHKELIVIDDGSSDDTVAR--VQAYGD-RVRL 337 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G+ RNQ + AQGEYI +D DD W P +L+ + + + +Y++ Sbjct: 338 LTQ-ANQGSAVARNQGLDAAQGEYIAFLDSDDVWLPGKLTAQVGYLEAHPDVGMIYSDWL 396 Query: 125 VCQGEVYSQ------------------PASLPLYPKSP--YSRRLFYKRNIIGNQVFTWA 164 + + S+ P +PL + RL + + V Sbjct: 397 PWKRDKQSKAFPPPEALAPATPDTGVPPEEIPLLTEGSGWLYNRLLFGSLLHTITVMARR 456 Query: 165 WRFK-ECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 + FD ELK QDYD +LR + E +++ + + HG IT P Sbjct: 457 ELIEQVGRFDPELKRGQDYDYWLR-ASRHTEIHQLDRVFALYRL-HGSGCITQWP 509 >UniRef50_Q6QW83 Putative glycosyl transferase n=1 Tax=Azospirillum brasilense RepID=Q6QW83_AZOBR Length = 296 Score = 209 bits (532), Expect = 9e-53, Method: Composition-based stats. Identities = 55/215 (25%), Positives = 90/215 (41%), Gaps = 6/215 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+S+ +P +NR RAI+SV QD S+WE+++VDD ST + D Sbjct: 8 MEE-PLVSVVIPAFNRAHTLRRAIESVRMQDVSSWELLVVDDGSTDGSV--DIPESFGDE 64 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I + +N GA A RN + A+G YI +D DDEW P +L + + Sbjct: 65 RIRLIRHAVNRGAAAARNTGVAAARGCYIAFLDSDDEWLPGKLRAQIDALESGPDAPQAL 124 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII-GNQVFTWAWRFKECL-FDTELKA 178 ++ E + P+ + L + G+ + F D L Sbjct: 125 CTGFLLHREETGRRTKRWPKPEGTWFETLLDGCYVSPGSTLMVRRDCFAAVGPLDETLPR 184 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 +D+D LR + + + ++H+ Sbjct: 185 LEDWDWLLRCMERFAFD-CLPVLGAVVHVGRPARS 218 >UniRef50_C3WFL5 Glycosyl transferase n=4 Tax=Bacteria RepID=C3WFL5_FUSMR Length = 257 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 66/234 (28%), Positives = 111/234 (47%), Gaps = 5/234 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ N ISI P +N ++ I SV++Q Y+NWEM+IVDDCS+ D Sbjct: 1 MEEN-KISIITPLYNGEKYIEETILSVIKQTYNNWEMLIVDDCSSDNSPNIVKKYVEQDK 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI +I + NSGA RN+AI +A+GEYI +D DD W +L + ++ + Sbjct: 60 RIKHIRLEKNSGAAIARNKAIEIAEGEYIAFLDSDDLWKKEKLEKQINFMKENNYA--IS 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 +Y E + L PK P + R + N IG ++ + ++ L+ Q Sbjct: 118 FTEYEEIDEDGRKLNILIRVPKKPVTYRSYLLTNPIGCLTGMYSVKKLGKVYMPNLRKRQ 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 D +L+++ + + + ++E I IN + + F +++ YRK + Sbjct: 178 DTGFWLKIL-KLDKAYSLKENLAIYRINSSSLSFKKTD-LFKYHWYLYRKIEKL 229 >UniRef50_C7QLJ5 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=C7QLJ5_CYAP0 Length = 1152 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 64/287 (22%), Positives = 112/287 (39%), Gaps = 21/287 (7%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTA 56 ++ PLISI MP +N + AI SVL Q Y NW++ I DD ST+ E L Y A Sbjct: 616 LEYQPLISIIMPVFNPKIAYLKTAINSVLNQVYQNWQLCIADDASTNPQVYEILADY--A 673 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 D RI + N N A +A GE+I +D DD TP+ L ++ Sbjct: 674 AEDTRIKVVFRQENGHIAEASNSAFEIAMGEFIALLDHDDVLTPHALYHVVSMLNDHADA 733 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTE 175 +Y++ + ++ Q + K + F + + V+ + + F Sbjct: 734 DMIYSD----EDKIDEQGYLSDPFFKPDWCPDSFLSKMYTCHLGVYRRSLVEQIGAFRVG 789 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 + +QDYD+ LR+ + + + + I+ K + + + Sbjct: 790 YEGSQDYDLVLRLTEKTDKIFHIPNVLYHWRIHAQSTSTNIDSKNY-AVITAKKALSEAI 848 Query: 236 DRASKKYQLFTL------YQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 +R + + + Y IR + T + ++ K L D + Sbjct: 849 ERRGEPGTVTDVPYCLGNYHIRYELKTDDLVSIIIPT---KDLGDTL 892 Score = 100 bits (250), Expect = 4e-20, Method: Composition-based stats. Identities = 51/249 (20%), Positives = 93/249 (37%), Gaps = 27/249 (10%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCST---SWEQLQQYVTALNDPR 61 L+SI +PT + + +KS+ Q Y N+E+I++D+ ST S E ++Q+ + Sbjct: 878 LVSIIIPTKDLGDTLNQCLKSIFEQSTYPNFEIILIDNGSTEERSLEVMKQWQEKEP-EK 936 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ-------- 112 + I + N A+ +QG+Y+ +++D + TP+ + + Q+ Sbjct: 937 LKVFPLKIPFNYSQINNFAVQHSQGKYLLFLNNDIEVITPDWIEALVEQAQRPSIGAVGA 996 Query: 113 -LVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-------QVFTWA 164 L+ + V G YS S +P S F + N I N + Sbjct: 997 LLLFPDDTIQHAGVIGGIFYSCGHSHKRFP--FRSPGYFNQLNTITNYSAVTGACLMCRR 1054 Query: 165 WRFKE-CLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 F+E FD L D D+ +M+ + + + H S Sbjct: 1055 DVFEEIGGFDETLAVNYNDIDLCFKMINKGYRNIYLPHV-VLYHYESKSRGYDSLNNFKK 1113 Query: 223 GYFHFYRKH 231 K+ Sbjct: 1114 ARLFCEGKY 1122 >UniRef50_C3RR63 Glycosyl transferase n=2 Tax=Bacteria RepID=C3RR63_9MOLU Length = 660 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 64/242 (26%), Positives = 103/242 (42%), Gaps = 9/242 (3%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALND 59 P ISI MP +N + I+S+L+Q Y N+E+ I DDCST+ + ++ D Sbjct: 149 NYQPKISIVMPVYNVPGKYLSYCIESILKQTYQNFEICIADDCSTNIDTIETLKAYKNKD 208 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI I + N N A+ LA GE+I +D+DD P+ L+ + + F+ Sbjct: 209 NRIKVIFREENGHISRATNSALSLASGEFIGLMDNDDTLDPHALNEVVNILNEDKNIDFI 268 Query: 120 YAN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELK 177 Y + D + G S P + K +S Y N I + V K F + Sbjct: 269 YTDEDKIDMGGKRSDP-----HFKPDFSLDTLYGGNYICHFSVIRKNIIEKIGGFRVGYE 323 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 AQD+D+FLR+ E + + + + + G + K R +D F + Sbjct: 324 GAQDFDLFLRVSNETDKIYHIPKILYHWRMIPGSTAVGGDGAKNYAGEAGKRALEDYFKQ 383 Query: 238 AS 239 + Sbjct: 384 KA 385 >UniRef50_B7KAG8 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KAG8_CYAP7 Length = 344 Score = 208 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 60/282 (21%), Positives = 121/282 (42%), Gaps = 28/282 (9%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 S+ +P +N ++ + I+SVL Q Y N+E++IV+D S ++ + D R+ I Sbjct: 3 KFSVIIPAYNVEKYIAKTIQSVLEQTYQNFELLIVNDGS--LDRTLEICQQFTDSRLKII 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-NDY 124 N+G RN I +QGEYI +D DD W P +L + H ++ T ++ + + Sbjct: 61 SQ-KNTGLSGARNTGIRHSQGEYIAFLDGDDLWLPEKLQKHIQHLEKSPTVGVSFSRSAF 119 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ--VFTWAWRFKE------------- 169 + + +P+ + + N +GN + + Sbjct: 120 IDERGNLLNSYQMPILKN--ITPEILLHGNPVGNGSALVVRREVLEAIKFQNQLSDFQEY 177 Query: 170 CLFDTELKAAQDYDIFLRMVVEYG-EPWKVEEATQILHINHG------EMQITSSPKKFS 222 C FD + + A+D ++ LR+ +E + + EA + +N + Q+ S K S Sbjct: 178 CYFDAQFRRAEDIELLLRIALETSWQIEGIPEALTLYRVNSQGLSANWQEQLHSWEKVIS 237 Query: 223 GYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLL 264 + + +++++++ YQL L + ++ + L Sbjct: 238 KTKSYAPELVTRWEKSARAYQLQILARSTIRQKAGSVAVKLF 279 >UniRef50_A1W3G7 Glycosyl transferase, family 2 n=1 Tax=Acidovorax sp. JS42 RepID=A1W3G7_ACISJ Length = 983 Score = 208 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 63/286 (22%), Positives = 110/286 (38%), Gaps = 17/286 (5%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + PLISI MP + A+ SV Q Y +WE+ I DD S+ + +Q + L D Sbjct: 445 LPRRPLISIVMPVHDTPPHFLQAAVDSVQGQWYGHWELCICDDASSRAD-TRQRLQTLQD 503 Query: 60 --PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 +I + N A+ LAQGE++ +D DD P L Sbjct: 504 QSDKIKVTRRETAGHIVHATNDALALAQGEFVVFLDHDDVLAPQALLRLAQAINAPSAPD 563 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTEL 176 F+Y++ + ++ Q K +S L + +N +G+ AW + Sbjct: 564 FIYSD----EDKLDEQGRRCLPLFKPQWSPTLQWAQNYVGHIMCVRRAWLERLGGLLEGS 619 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY-------FHFYR 229 + +QD+D+ LR+ + + + E I+ + K ++ H Sbjct: 620 QGSQDHDLVLRLAAQGAKIEHIPEVLYHWRIHAASTSASPQSKPYAHLAGRQAVARHLAS 679 Query: 230 KHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 ++ ++FDR F YQ R + ++ R+ L D Sbjct: 680 RYGEQFDRVDDGDHAFV-YQPRFRVPAGTVASIIIPTRDKADLLDA 724 Score = 168 bits (427), Expect = 1e-40, Method: Composition-based stats. Identities = 56/258 (21%), Positives = 105/258 (40%), Gaps = 24/258 (9%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS--WEQLQQYVTALNDPRIT 63 L+++ +P +N + + A+ SV Q ++ WE+I++DD ST W LQ Y DPRI Sbjct: 14 LVTVILPAYNHAKYVVEAMDSVRAQTFTRWELIVIDDGSTDDTWAVLQAYAAQHGDPRIR 73 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + N+G+ A N+ + +A+ Y+ ++ DD + P RL + Q F+ Sbjct: 74 LLTQ-ANAGSHATLNRGLAMARTPYLAILNSDDRYAPERLQRLVDTAQAAADEVFIVTGL 132 Query: 124 YVCQGEVYSQPASLPLYPK----------------SPYSRRLFYKRNII--GNQVFTWAW 165 + G+ P S +P + L + + N + Sbjct: 133 RLIDGDGNFFPESYWWNAMYGDILQRWRAAQSSGVNPALQTLLWGNFTVSTSNFFMSRTL 192 Query: 166 RFKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQIL-HINHGEMQITSSP-KKFSG 223 + F L+ D+D LR+ +E ++ +L + HG I + + Sbjct: 193 WQRLGPF-KRLRYVPDWDYALRVAMEQPRAFQFLHDQALLEYRLHGRNTILGGALRNHAE 251 Query: 224 YFHFYRKHKDKFDRASKK 241 +H R ++ K+ A + Sbjct: 252 AYHVLRTYQKKWAAAGQA 269 Score = 73.4 bits (179), Expect = 9e-12, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 73/199 (36%), Gaps = 19/199 (9%) Query: 8 SIYMPTWNRQQLAIRAIKSV-LRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 SI +PT ++ L I+S+ +E++++D+ ST E + D R+ I Sbjct: 710 SIIIPTRDKADLLDACIQSIHRHTTGIAYEILVLDNGSTEPETQACFQRLTQDARVRVIA 769 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQ----QLVTHAFLYA 121 DI + N A+G+ + +++D + TP+ L + + V LY Sbjct: 770 ADIPFNWSRLNNIGRSHARGQVLVFLNNDTEIITPDWLVRLVEYALLPDVATVGALLLYP 829 Query: 122 NDYVCQGEVYSQPASL--PLYPKSPYSRR------LFYKRNIIGN---QVFTWAWRFKE- 169 + + V ++ P RN++ N V RF Sbjct: 830 DRTIQHAGVVVGMGGWADHVFKGEPVQHYPTPFVSSVVPRNVLANTGACVAVATARFDAL 889 Query: 170 CLFDTELKAAQ-DYDIFLR 187 FD + D ++ +R Sbjct: 890 GGFDEAFEICGSDVELGIR 908 >UniRef50_C6BYR5 Glycosyl transferase family 2 n=5 Tax=Desulfovibrionales RepID=C6BYR5_DESAD Length = 292 Score = 208 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 56/229 (24%), Positives = 102/229 (44%), Gaps = 11/229 (4%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N P +S+ +PT+NR RA++SVL Q ++++E ++VDD ST + + + +D R Sbjct: 5 NNEPKVSVIIPTYNRADRVCRAVESVLAQKFTDFECLVVDDGST--DDTAERLAEFDDSR 62 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + + + N G A RN I A G++I +D DDEW P +L + ++ Sbjct: 63 LKILRQE-NKGVSAARNFGIAAAIGDFIALLDSDDEWVPEKLLKQIPFMEE-AGFEISQT 120 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQV-FTWAWRFKECLFDTELKAAQ 180 ++ + P+ + R + + V F+ + + FD ++ A + Sbjct: 121 DEIWIRKGKRVNQCKKHEKPEGMFFDRSLEMCMVSPSCVIFSRKFWDEIGPFDEDMPACE 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 DYD+++R ++Y + E I H + S G YR Sbjct: 181 DYDLWIRAGLKY-PVGLLRERLTIKHGGRPDQLSNS-----VGCLDLYR 223 >UniRef50_Q21E22 B-glycosyltransferase-like protein n=2 Tax=Alteromonadales RepID=Q21E22_SACD2 Length = 341 Score = 207 bits (528), Expect = 3e-52, Method: Composition-based stats. Identities = 66/282 (23%), Positives = 104/282 (36%), Gaps = 30/282 (10%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N P+ S+ MP +N + AI+SVL Q Y +E+I VDD T + V+ DPRI Sbjct: 6 NKPIFSVVMPMYNVAKYVATAIESVLAQTYGQFELICVDDGCT--DNTLDIVSTYKDPRI 63 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 I N G A RN I A+G Y+ +D DD W P +L + L+H + Y+ Sbjct: 64 RVIRQ-KNMGLSAARNTGISHARGLYVALLDSDDYWAPTKLQMHLSHFRNNPKLGVSYSA 122 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR---------------F 167 E K+ +++ +F RN IGN + Sbjct: 123 SQFIDEEGSLLGIGQYPKLKNIHAQDIFC-RNPIGNGSAAVLRKSMLLQMGKHVYNNGEV 181 Query: 168 KECLFDTELKAAQDYDIFLRMVVEYGEPW-KVEEATQILHINHGEMQIT----------S 216 + FD L+ ++D + +LR + + + E +N + S Sbjct: 182 RLTYFDERLRQSEDVEFWLRAALTSDWQFMGIGEGLTFYRVNASGLSANLTNQYQAWRVS 241 Query: 217 SPKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWR 258 + F+ K+Y Q RN + Sbjct: 242 VKNNYHLNPAFFDTWYSMASAYQKRYLARRAVQSRNSFAALK 283 >UniRef50_UPI0001AF4953 glycosyl transferase, group 2 family protein n=2 Tax=Proteobacteria RepID=UPI0001AF4953 Length = 1609 Score = 207 bits (528), Expect = 3e-52, Method: Composition-based stats. Identities = 64/281 (22%), Positives = 119/281 (42%), Gaps = 17/281 (6%) Query: 3 NNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--D 59 P+ISI MP +N L A++SV Q Y WE+ + DD ST E + +Y+ +L+ D Sbjct: 1071 EGPVISIIMPVYNPPLDLLREAVESVRAQLYPQWELCLADDASTYPEVI-EYLKSLSALD 1129 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI + + N A N A+ +A GE++ +D+DD + L ++ + Sbjct: 1130 DRIKVVFREDNGHISAASNSALEIATGEFVALMDNDDLLPRHALYWIAKTIRENPDAGLI 1189 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKA 178 Y++ + ++ + Y KS ++ LF +N++ + E F T + Sbjct: 1190 YSD----EDKISTDGKRSSPYFKSDWNEFLFRSQNMVCHLGAYRRDLVNEVGQFRTGFEG 1245 Query: 179 AQDYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 AQDYD+ LR V + + + I+ G + K ++ + + + Sbjct: 1246 AQDYDLALRCVEKLERNQIIHIPRVLYHWRIHAGSTAMAGDEKPYAA-LAGVKALDEHLE 1304 Query: 237 RAS----KKYQLFTLYQIRNKRMTWRTLLTL-LSVRNGKRL 272 R + ++Y++ L++L + RN L Sbjct: 1305 RKGGVGIAELSSSSMYRVHYALPASPPLVSLIIPTRNAHAL 1345 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 59/263 (22%), Positives = 104/263 (39%), Gaps = 26/263 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL+S+ +PT N L + I S+ +Q Y+++E+I++D+ S E L+ + + I Sbjct: 1331 PLVSLIIPTRNAHALVKQCIDSI-KQLTTYTHYEIILIDNGSDEPESLEYFAQLDQEENI 1389 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHA---- 117 + ++ A+ N A+ +A GE I I++D + TP LS ++ Q A Sbjct: 1390 RVMRDEGPFNYSALNNAAVRIANGELIGLINNDIEVITPEWLSEMVSIALQPNVGAVGAR 1449 Query: 118 FLYANDYVCQGEVYSQ-----PASLPLYPKSPYSRRLFYKRNIIG--NQVFTWAWRFKEC 170 Y +D + G V + S PK + F + +I + V K+ Sbjct: 1450 LWYPDDRLQHGGVITGLGGVAGHSHKYLPKG--APGYFCRAELIQEFSAVTAACLIIKKS 1507 Query: 171 LFD-------TELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 FD LK A D D L++ E G ++ H + +P+K Sbjct: 1508 TFDQVGGLEEEHLKIAFNDVDFCLKV-REAGYVNVWTPFAELYHHESATRGLEDTPQKQE 1566 Query: 223 GYFHFYRKHKDKFDRASKKYQLF 245 + K ++ Y Sbjct: 1567 RFAKEIEYIKSRWPDIQADYAYS 1589 >UniRef50_B1ZZR1 Glycosyl transferase family 2 n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZZR1_OPITP Length = 1275 Score = 207 bits (528), Expect = 3e-52, Method: Composition-based stats. Identities = 65/283 (22%), Positives = 113/283 (39%), Gaps = 15/283 (5%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALN 58 + PLIS+ MP +N ++ ++AI+SV Q Y NWE+ I DD S++ A Sbjct: 737 LAQRPLISVIMPVYNTPEKWLVKAIESVRAQIYPNWELCIADDASSASHVRPLLEDFARR 796 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + D N A N A+ LA+GE+ +D DDE + L + Sbjct: 797 DERIKLVFRDQNGHISAASNSALDLARGEFAALLDHDDELARDALYEVVQCLAAHPDADL 856 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 +Y++ + ++ Y K + LF +N + A ++ F + Sbjct: 857 IYSD----EDKIDEAGRRFDPYFKPDFLPDLFTAQNFTSHLSVYRASLIRQAGGFRIGYE 912 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY--RKHKDKF 235 +QD+D+ LR ++ E +++ ++L+ T+ Y + D F Sbjct: 913 GSQDWDLALR-AIDLTERSRIQHVPKVLYHWRAIPGSTALALGEKNYPMLAARKALADHF 971 Query: 236 DRASKKYQLFTLYQ--IRNKRM---TWRTLLTLLSVRNGKRLA 273 R + +L R KR + ++ RN L Sbjct: 972 ARRGETVELLRQVGDYWRVKRPVPANPPLVSLIIPTRNRADLL 1014 Score = 91.5 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 54/242 (22%), Positives = 89/242 (36%), Gaps = 23/242 (9%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N PL+S+ +PT NR L R + S+L + Y +E+I+VD+ S Y+ L Sbjct: 997 NPPLVSLIIPTRNRADLLSRCVGSILAKTSYPRFEIIVVDNGS-DEPGTLAYLDQLRSGG 1055 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHK---QQLVTHA 117 T + D A+ N A A+GE + +++D + P+ L ++H + A Sbjct: 1056 TTVLRYDEPFNFSAINNFAAAQAEGEILGLLNNDLEVINPDWLDEMVSHAVRPEIGCVGA 1115 Query: 118 FLYANDYVCQ--------GEVYSQPASLPLYPKSPYSRRLFYKRNII----GNQVFTWAW 165 LY D Q G V + Y R +N + Sbjct: 1116 MLYYPDDTIQHAGVLLGLGGVANHAYYHAPRGTCHYFNRAHLLQNYSAVTAACLLIRRKV 1175 Query: 166 RFKECLFD-TELKAA-QDYDIFLRM-VVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 + + T+L A D D LR+ Y W + H +P+K + Sbjct: 1176 FAQVGGLNATDLAIAFNDVDFCLRVRAAGYLNLWT--PFAEFYHHESPSRGSEDTPEKLA 1233 Query: 223 GY 224 + Sbjct: 1234 RF 1235 >UniRef50_B1FXK1 Glycosyl transferase family 2 n=2 Tax=Burkholderia RepID=B1FXK1_9BURK Length = 723 Score = 207 bits (527), Expect = 3e-52, Method: Composition-based stats. Identities = 65/250 (26%), Positives = 105/250 (42%), Gaps = 13/250 (5%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTALND 59 P +SI MPT+N ++ +A+ SV+ Q Y NWE+ I DDCST E L YV Sbjct: 181 QPKVSIVMPTYNTPEKWLRKALDSVVDQVYENWEICIADDCSTKPEVRMVLDSYVAKYP- 239 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 ++ + N A N A+ LA GE++ +D DDE P L + + A + Sbjct: 240 GQVKVAYRTTNGHISASSNTALELATGEFVGLLDHDDELHPLALYCVIEKLNEHPDAALI 299 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKA 178 Y++ + ++ + Y K ++ LF +N+I + ++ F T L+ Sbjct: 300 YSD----EDKISEEGERSDPYFKCDFNYDLFLSQNMISHFGVYKTSVLRDIGGFRTGLEG 355 Query: 179 AQDYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QDYD+ LR++ G + V A I E + K + R D Sbjct: 356 SQDYDLALRVIDRVGHDVVYHVPRALYHWRII-PESTASGHEAKPYAHIAAMRAIDDHLK 414 Query: 237 RASKKYQLFT 246 R + Sbjct: 415 RNNIAAHTIH 424 Score = 86.1 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 92/240 (38%), Gaps = 23/240 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P + I +PT + +L + ++SV RQ YSN+ + I+D+ S E + D RI Sbjct: 443 PSVEIIIPTRDSAELVEQCVESV-RQKSTYSNYRITIIDNGSVKPETHDLFARLQADERI 501 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQLVTH 116 I +D A+ N+ + + +++ +++D D W +SV + V Sbjct: 502 KVIRDDSPFNYSAINNRVALASTADFVCLMNNDIEVINAD-WLEEMVSVAIQKNVGAVGA 560 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRL--FYKRNIIGNQV---------FTWAW 165 LY +D + G V + + + + ++ R + N + + Sbjct: 561 KLLYPDDTIQHGGVVLGVGGIASHAHKHFPNTMAGYFARARLRNAMSAVTAACLLIRQSI 620 Query: 166 RFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 + D EL A D D LR+ A ++ H +P+K S + Sbjct: 621 YKEVGGLDEELHVAYNDIDFCLRVRKAGYRNVWTPYA-ELYHHESATRGAEDTPEKISRF 679 >UniRef50_A0YLN7 Glycosyl transferase, group 2 family protein n=3 Tax=Oscillatoriales RepID=A0YLN7_9CYAN Length = 2105 Score = 207 bits (527), Expect = 3e-52, Method: Composition-based stats. Identities = 65/267 (24%), Positives = 105/267 (39%), Gaps = 11/267 (4%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDPRIT 63 IS+ MP +N + +AI SV+ Q YSNWE+ I DDCS+ E + A D RI Sbjct: 857 KISVVMPVYNPPIEFLEKAINSVINQVYSNWELCIADDCSSDPEIAEFLNKFAEEDERIK 916 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N N A +LA GE+I +D DDE TPN L + + +Y +D Sbjct: 917 VIFRPENGNISRATNSAAVLATGEFILLLDHDDELTPNALGEIALYLVENPETDVMYTDD 976 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDY 182 ++ ++ K +S L IG+ + F+ + +QDY Sbjct: 977 ----DKIDTEGKRFAPQFKPDWSPELLLSYMYIGHALVLRRKLFEQVAGMRLGFEGSQDY 1032 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D+ LR+ + G I+ + K + + ++ DR + Sbjct: 1033 DLALRITENTHHIAHLPLVLYHWRTAPGSTAISGAAKP-ASFEAGRLAVQEALDRREIES 1091 Query: 243 QLFTLYQIRNKRMTWRTLLTLLSVRNG 269 ++ YQ + + + NG Sbjct: 1092 KV---YQPDWAVKQSLGIFSHQFIDNG 1115 Score = 86.4 bits (213), Expect = 9e-16, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 105/285 (36%), Gaps = 29/285 (10%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSW---EQLQQYVTA 56 + N P ++I +PT N+ L +KS+L + Y N+E++I+D+ S E L A Sbjct: 1112 IDNGPSVTIIIPTKNQLNLLQACLKSLLEKTTYQNYEVMIIDNESDDPKTLEYLDWIALA 1171 Query: 57 LNDPRITYIHNDINSG---ACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ 112 +P+I+ + G A+ N+A+ Q +Y+ +++D + +P LS + + Q Sbjct: 1172 ETEPKISVLRVANPHGKFSFAAINNRAVEQTQTDYVLFLNNDTEIISPEWLSQMMGYIQF 1231 Query: 113 LVTHA----FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYK--------RNII---- 156 A ++ +D++ + S R ++ RN Sbjct: 1232 EKVGAVGARLIFPDDHIQHAGIIHGLHHKLAGHAFKLSHRDYFGYLAYSKVVRNYSAVTA 1291 Query: 157 GNQVFTWAWRFKECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQI 214 + + FD + A D D R+ + EA ++LH Sbjct: 1292 ACLLTPRQLFLELGGFDEQNFAVAYNDADYGYRLTEKNYRCVYCAEA-ELLHKEGTSRGF 1350 Query: 215 TSSPKKFSGYFHFYRKHKDKF--DRASKKYQLFTLYQIRNKRMTW 257 PK+ + + Y D + S + F + R Sbjct: 1351 KDDPKEVAHFRQKYADKIDPYYSHHLSLDNEWFHIQPRRFNIQNP 1395 >UniRef50_Q0VR09 Glycosyl transferase, group 2 family protein, putative n=2 Tax=Gammaproteobacteria RepID=Q0VR09_ALCBS Length = 274 Score = 207 bits (527), Expect = 4e-52, Method: Composition-based stats. Identities = 62/262 (23%), Positives = 109/262 (41%), Gaps = 6/262 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +SI P +N + R I+SVL Q++ +WE+++VDD ST D RI Sbjct: 7 PEVSIITPCYNARHTLARTIESVLSQNFLSWELLLVDDKSTDDTDQHAQAFCDRDNRIKL 66 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I NSGA RN+ I A+G YI ID DDEW ++LS+ ++ ++ A Sbjct: 67 IRLKENSGAAVARNRGIASARGRYIAFIDADDEWHKDKLSIQISQMKEQGWPLSYTAYTR 126 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDI 184 + +P R K N IG + +F EL+ QD+ + Sbjct: 127 INPEGACINTVGVP----PTIHYRQLLKTNYIGCSTAVYDSEQLGKVFMPELRKRQDFGL 182 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQL 244 +L+++ + + + A ++ ++S+ + +GY + +++ + Y Sbjct: 183 WLKILKKTTSGYGINLALTRYCVHQD--SLSSNKRNTAGYNWRLYRDEEQLNAIMASYYF 240 Query: 245 FTLYQIRNKRMTWRTLLTLLSV 266 R + L L + Sbjct: 241 ANYALRGVLRSRFPKLALFLGI 262 >UniRef50_Q7U8H3 Putative glycosyltransferase family 2 protein n=1 Tax=Synechococcus sp. WH 8102 RepID=Q7U8H3_SYNPX Length = 804 Score = 207 bits (526), Expect = 4e-52, Method: Composition-based stats. Identities = 60/257 (23%), Positives = 103/257 (40%), Gaps = 8/257 (3%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALND 59 ++ PLISI +PT+N I+SV RQ Y NWE+ I DD S++ +D Sbjct: 235 QDAPLISIIIPTYNTNSNHLRECIESVCRQSYPNWELCICDDSSSAVSVKTILRSYQSSD 294 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PR+ I + N C N A+ +A GEY+ +D DD N L Q+ + Sbjct: 295 PRVKLIFREKNGHICEASNDALRMATGEYVALLDHDDILADNALYWVARELQKKPQANLI 354 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKA 178 Y++ + ++ + K ++ L N I + K+ F + Sbjct: 355 YSD----EDKINDDGMRACPHFKPAFNIDLLLSYNFISHLGVYRREILKQIGGFRVGFEG 410 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP-KKFSGYFHFYRKHKDKFDR 237 +QD+D+ LR V+E + + H S+P K ++ + D Sbjct: 411 SQDHDLALRTVLESSPDQIIHIPRVLYHWRAHSESTASNPDSKDYTTESGHKAVQHFLDE 470 Query: 238 ASKKYQLFTLYQIRNKR 254 ++ + +I+ K Sbjct: 471 QHRRGGVRATARIKAKN 487 Score = 53.3 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 107/300 (35%), Gaps = 27/300 (9%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTS--WEQLQQYVTALND 59 +P + + +PT ++ ++ A+ S++ + Y+N+ + +VD+ S + L + + ++ Sbjct: 498 KSPSVELIIPTRDQAEVLNLAVDSIITKTTYTNYTITVVDNQSEEVATKNLFKNLKRVHG 557 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNR-LSVFLAHKQQ----LV 114 +I I + A+ N A+ + + + +++D E ++ L ++H + V Sbjct: 558 EKINIIKYNKKFNYSAINNYAVRKSTADIVVLVNNDVEVISSKWLQEIVSHTSRPDVGCV 617 Query: 115 THAFLYANDYVCQGEVYSQPASLPLY-------PKSPYSRRLFYKRNIIGNQVFTWAWRF 167 Y+N + G V + + Y RL Y + + V Sbjct: 618 GAKLYYSNRTIQHGGVVIGIGQVAGHAHKYFPGDSPGYVDRLQYVQQM--TAVTAACLAI 675 Query: 168 KECLFDT-------ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILH--INHGEMQITSS 217 + +F+ +L A D D +R+ A H I+ G Sbjct: 676 RREIFNEVGGLNEQDLTIAFNDVDFCMRVHARGYRNIFTPYAELFHHESISRGTEDSPEK 735 Query: 218 PKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 ++F +F D LF + + + SV+ G L R Sbjct: 736 KERFKREINFMLNQYDIQSNGELPSDLFYNPNLTKIHEDFSINTSPESVKQGIELRSNFR 795 >UniRef50_C2H8B4 Family 2 glycosyltransferase n=13 Tax=Enterococcus RepID=C2H8B4_ENTFC Length = 712 Score = 207 bits (526), Expect = 5e-52, Method: Composition-based stats. Identities = 61/276 (22%), Positives = 105/276 (38%), Gaps = 15/276 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALNDPR 61 PLISI MP +N + + + I SVL Q Y +WE+ I DD ST + + D R Sbjct: 177 RPLISILMPVYNVEIKWLEKCIDSVLDQTYDHWELCISDDASTDPAIRKCLESYQAKDDR 236 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N N A+ +A+GE+I +D+DDE P L +Y+ Sbjct: 237 IKVVFRKENGHISLATNSALEIAEGEFIALLDNDDELPPFALYEVAKVLNVHPELDVIYS 296 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQ 180 + + ++ + + K+ +S N I + KE F + +Q Sbjct: 297 D----EDKIDADGNRFDPHFKADWSPDTLMGNNYISHLGVYRTSIVKELGGFRKGYEGSQ 352 Query: 181 DYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 DYD+ LR+ + + ++ G K + Y + D R Sbjct: 353 DYDLVLRVTEQIPADHIYHIDRVLYHWRTIPGSTASNGEAKSYI-YDSGVKALTDALSRR 411 Query: 239 SKKYQ-----LFTLYQIRNKRMTWRTLLTLLSVRNG 269 + K + Y+I + + ++ +NG Sbjct: 412 NIKGSVRPGRISGFYEIAYDVLQEDLVSVIIPTKNG 447 Score = 95.3 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 58/251 (23%), Positives = 99/251 (39%), Gaps = 23/251 (9%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSW---EQLQQYVTALNDPR 61 L+S+ +PT N + + S++ + YSN+E+II D+ ST E +Y L D R Sbjct: 437 LVSVIIPTKNGYEDLKTCVDSIIEKTSYSNYEIIIADNGSTDPKMQELFAEYKHQLKD-R 495 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQ----QLVTH 116 DI + N A A G+Y +++D E P+ ++V +++ Q V Sbjct: 496 FIVELIDIPFNYSRINNLAAEKANGKYFLFLNNDTEVIEPDWMTVMVSYAQFDRIGCVGA 555 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSR-------RLFYKRNII---GNQVFTWAWR 166 Y +D V + + + Y R RL N + + A Sbjct: 556 KLFYPDDTTQHAGVLLGIGGVAGHALNNYDRTHCGYFGRLVIDVNYLAVTAACMMVKAAD 615 Query: 167 FKEC-LFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 F FD L+ A D D+ L++ E G ++ H +P+K + Sbjct: 616 FNAVNGFDETLEVAFNDVDLCLKVY-ELGRYNVYAHQAELYHFESKSRGYEDTPEKQRRF 674 Query: 225 FHFYRKHKDKF 235 +K +DK+ Sbjct: 675 AGEIKKMQDKW 685 >UniRef50_B4CZV6 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZV6_9BACT Length = 306 Score = 207 bits (526), Expect = 5e-52, Method: Composition-based stats. Identities = 53/228 (23%), Positives = 99/228 (43%), Gaps = 8/228 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +P +N AI SVL Q + ++E+I+VDD ST + + DPR+ + Sbjct: 5 VSVVIPAYNYAHFLPEAIASVLAQTWEHFELIVVDDGST--DNTPEVCARYTDPRVRAVR 62 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT---HAFLYAND 123 N+G A RN I A+ ++ +D DD W P L+ ++L + Sbjct: 63 Q-TNAGLSAARNTGIREARYPFVAFLDADDRWQPEFLATVFREFERLGPTFAAVGTSCSR 121 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKAAQDY 182 GE + P L+ + + + + + F+EC FDTEL++++D Sbjct: 122 MNAIGEPLAPPRQNFLHTSELTVKSFCLRNRPLSSSIVVRRAVFEECGYFDTELRSSEDR 181 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 D+++R+ + + E I+ H + ++P+ R+ Sbjct: 182 DMWIRITARGHRFFYLGEPLAIIR-RHPQNMSKNAPRMKRNSGLVLRR 228 >UniRef50_D2RDR5 Glycosyl transferase family 2 n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RDR5_ARCPR Length = 1164 Score = 206 bits (525), Expect = 6e-52, Method: Composition-based stats. Identities = 66/223 (29%), Positives = 99/223 (44%), Gaps = 13/223 (5%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTAL 57 K P ISI MPTWN ++ +AI+SVL Q Y NWE+ I D ST + L++Y A Sbjct: 629 KYRPKISIIMPTWNTDERWLRKAIESVLNQVYDNWELCIADGGSTKPHVRKILEEY--AK 686 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 D RI N G N+A+ LA GE++ +D DDE P L + + Sbjct: 687 KDKRIKVKFLPKNLGIAGNSNEALKLATGEFVAFLDHDDELAPFALYEVVKLLNEKPDLD 746 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI-IGNQVFTWAWRFKECLFDTEL 176 F+Y++ + ++ + + K YS +F N I V + K F Sbjct: 747 FIYSD----EDKIDEKGRRRDPFFKPDYSPDMFLSCNYLIHITVIRKSLVDKVGGFRLGY 802 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 +QDYD+FLR++ + + + H E S P+ Sbjct: 803 DGSQDYDLFLRVLEHTDKIAHIPKIL--YHWRAIETSCASRPE 843 Score = 98.8 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 56/256 (21%), Positives = 102/256 (39%), Gaps = 22/256 (8%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALND 59 + NP +SI +PT ++ ++ R ++S+L + Y N+E++IVD+ S + + Y T + Sbjct: 885 INGNPKVSIIIPTKDKVEVLKRCVESILNKTTYQNYEIVIVDNNSQEEKTFEYYETIKDH 944 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQ----LV 114 P+I + + A+ N A+ E+I +++D E T LS L H Q+ V Sbjct: 945 PKIRILEYNKPFNFSAINNYAVSKVDSEFILFLNNDTEVITSEWLSAMLEHAQRKEVGAV 1004 Query: 115 THAFLYANDYVCQGEVY-----SQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-- 167 LY N+ + V + A S + N++ N A Sbjct: 1005 GAKLLYPNNTIQHAGVILGLGVHRVAGHSHRHYPANSHGYVGRINVVQNLSAVTAACMLT 1064 Query: 168 ------KECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 + FD L A D D L++ + A + H +P+ Sbjct: 1065 KKSLFEEVGGFDEVNLPIAFNDVDYCLKLREKGYLIVYTPYA-VLYHYESLSRGYEDTPE 1123 Query: 220 KFSGYFHFYRKHKDKF 235 K + + R ++K+ Sbjct: 1124 KQARFLREVRYMREKW 1139 >UniRef50_A3I1Y0 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1 RepID=A3I1Y0_9SPHI Length = 255 Score = 206 bits (525), Expect = 6e-52, Method: Composition-based stats. Identities = 57/257 (22%), Positives = 104/257 (40%), Gaps = 8/257 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI P N + AI+S+L+Q Y+NWE+I++DD S+ + D RI + Sbjct: 2 LVSIITPVHNSIEHVEEAIQSILKQTYTNWELILIDDKSSDSSLEVLHAYEEKDSRIHLL 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+ N GA RN+ I ++G +I +D DD W P +L +A + HAF Y + Sbjct: 62 KNEKNKGAALTRNRGIEASKGRFIAFLDSDDLWEPTKLETQIAFMLE-KGHAFTYTSYLT 120 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 + + L + K IG + ++ + QDY ++ Sbjct: 121 FRNDNEEVGV---LEVPEKVNHNDLLKTCSIGCLTAMYDTEVLGKVYMPIISKRQDYALW 177 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY-FHFYRKHKDKFDRASKKYQL 244 L+++ E + + + I+ + K +GY + YR+ ++ Y Sbjct: 178 LKILKEIPFAYGINTPLAKYRLR--SDSISGNKLKAAGYQWKVYREFEN-LSIFQASYYF 234 Query: 245 FTLYQIRNKRMTWRTLL 261 + + + Sbjct: 235 IHYSIFGVIKTYFNKMQ 251 >UniRef50_C6X6S7 Glycosyl transferase family 2 n=2 Tax=Methylophilaceae RepID=C6X6S7_METSD Length = 282 Score = 206 bits (525), Expect = 7e-52, Method: Composition-based stats. Identities = 85/260 (32%), Positives = 137/260 (52%), Gaps = 12/260 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+Y PT NR L I+AI SVL Q + + EMI+++D ST A D R+ ++ Sbjct: 2 LVSVYTPTKNRLDLLIKAIDSVLGQTHQDIEMIVINDGSTDGTAAYLEERAQKDHRLRFV 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT----HAFLYA 121 + I+ GA RN AI +A+GE+ITG+DDDD + PNR+ F+A+ Q L A LY+ Sbjct: 62 NKTISEGAPKARNMAITMAKGEFITGLDDDDAFQPNRVEAFIAYWQLLKNLGLQPACLYS 121 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 D + + + A + + K + R + N IGNQ+F F + LFD +L A Q Sbjct: 122 QDIIVKNGI----ADVGTHKKGSITYRDMIEFNYIGNQIFAPKQYFVEAGLFDEQLPAWQ 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT--SSPKKFSGYFHFYRKHKDKFDRA 238 D D+F+R++ ++G + ++ TQ+ + +I+ S+ K S KH + R Sbjct: 178 DMDMFIRVLKKFGTAYLLDIPTQLYDDSPRTDRISIKSAMKIRSACNALTSKHAEN-ARH 236 Query: 239 SKKYQLFTLYQIRNKRMTWR 258 +++ L + R T + Sbjct: 237 KQQFMLQMFSKFYGIRPTIK 256 >UniRef50_A0YST1 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YST1_9CYAN Length = 297 Score = 206 bits (525), Expect = 7e-52, Method: Composition-based stats. Identities = 54/231 (23%), Positives = 109/231 (47%), Gaps = 10/231 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P+IS+ +P +N ++ + I+SVL Q + +WE+I+VDD ST + ++ V + D RI + Sbjct: 2 PIISVIIPAYNAEKTIKKTIQSVLNQTFLDWELIVVDDGST--DSTKEVVAEIKDNRI-F 58 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-ND 123 + + N+G A RN+ + + GE++ ID DD WTP++L + L Q+ A Y+ D Sbjct: 59 LFSFPNAGVSAARNRGVNKSSGEFLAFIDADDLWTPDKLELQLKALQEHPKAAVAYSWTD 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWAWRFKECLFDTELKAAQ 180 Y+ + + + Y + R ++ N + K FD + + Sbjct: 119 YIDESDQIIYSGRHITLNGNIYEK--LLVRYVLENGSNFLIRKDAYLKVGGFDESIFGVE 176 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 D+D+ +R+ Y + + + + ++ + S ++ + + Sbjct: 177 DWDLSIRLSQLY-KFVAIPKTQVLYRMSPHSITSNFSKQELESLRVIEKAY 226 >UniRef50_A8TT37 Glycosyl transferase, group 2 family protein n=2 Tax=Bacteria RepID=A8TT37_9PROT Length = 1185 Score = 206 bits (525), Expect = 7e-52, Method: Composition-based stats. Identities = 60/270 (22%), Positives = 107/270 (39%), Gaps = 12/270 (4%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALN 58 MK P++S+ MPT+N ++ RAI+SV Q Y WE+ I DD ST E + A Sbjct: 634 MKAKPVLSVIMPTYNTDPEVLDRAIQSVRNQIYPRWELCIADDASTHGETQEVIRRHAGE 693 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + ++N A N A+ LA G+++ +D DDE T + L + + Sbjct: 694 DDRIKTVFREVNGHISAASNSALDLATGDFVVLLDHDDELTSHALYMIASELNDHPNADV 753 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELK 177 +Y++ + ++ + Y K +S +F +N + + + F + Sbjct: 754 IYSD----EDKLDDNGEAYQPYFKPDWSPDMFLAQNYLNHVSAHRRSLVEVVGRFREGFE 809 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS-PKKFSGYFHFYRKHKDKFD 236 +QDYD+ LR++ + H + + K R + + Sbjct: 810 GSQDYDLALRVIERTTAERIRHIPHVLYHWRAVAGSVAGADEAKDYALEAARRAITEHLE 869 Query: 237 RASKKYQLFTLYQIRNKRMTWRTLLTLLSV 266 R ++ R + R + L Sbjct: 870 RTGIDGEVV----PGADRFSHRVIYALPHT 895 Score = 92.6 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 54/254 (21%), Positives = 98/254 (38%), Gaps = 22/254 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+S+ +PT N+ ++ I + DY NWE+I+VD+ S E L+ DPRIT Sbjct: 897 PLVSLIVPTRNKAEVLKLCIDGIREHNDYPNWELILVDNGSDEDESLEYLEALGKDPRIT 956 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQLVTHA 117 + +D + N+A +A+GE + +++D D W + + V Sbjct: 957 VLRDDGPFNYSRLNNRAARMAKGEILGLLNNDIEPINRD-WLTEMVRQVVQPGVGAVGAK 1015 Query: 118 FLYANDYVCQGEVYSQPASLPLYPK---SPYSRRLFYKRNIIGNQVFTWAWRFK--ECLF 172 Y ND + G V + + R F + I+ N A + ++ Sbjct: 1016 LYYPNDTLQHGGVIVGLGGVAGHFDKRLPRNERGYFGRAGIVQNFTAVTAACMLMPKTVW 1075 Query: 173 D--TELK------AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 D L A D D+ L++ + G ++ H + P+K + Sbjct: 1076 DEVEGLDEHHLSVAFNDVDLCLKV-RKAGYRIVWSPYAELYHHESVSRGQDTEPEKLERF 1134 Query: 225 FHFYRKHKDKFDRA 238 R + +++ Sbjct: 1135 ASEVRTMQTRWNTT 1148 >UniRef50_C7TKV7 Glycosyl transferase, group 2 n=5 Tax=Lactobacillus RepID=C7TKV7_LACRL Length = 251 Score = 206 bits (524), Expect = 8e-52, Method: Composition-based stats. Identities = 61/250 (24%), Positives = 98/250 (39%), Gaps = 6/250 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M N P++S+ MP +N ++SVL Q + + E++IVDD S+ + D Sbjct: 1 MGNTPVVSVVMPAYNAATYLPNTVQSVLEQSFKDLELLIVDDASSDHTFEVVEKLSATDS 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RITYI D N G RN + ++GEY+ +D DD W P++L + L QQ F Y Sbjct: 61 RITYIKLDENQGVANARNVGVQHSRGEYVAFLDSDDIWLPDKLKIQLTFMQQH-QVDFSY 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + V K Y K N IG + + + E+ + Sbjct: 120 CSYEVVDDHGTKIGERKISETKLDYHE--MLKGNRIGLLTVMLSRKIAQKYPFPEI-NHE 176 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DY +L + + E + H + + + YR+H+ K Sbjct: 177 DYACWLSIARNGTVAYLASEQILAKYRKHQTSLSANKIQAAQWTWTIYRRHEHLG--LFK 234 Query: 241 KYQLFTLYQI 250 + F Y + Sbjct: 235 SFYYFLHYSL 244 >UniRef50_D0HB89 Glycosyl transferase group 2 family protein n=2 Tax=Vibrio RepID=D0HB89_VIBMI Length = 1073 Score = 206 bits (524), Expect = 8e-52, Method: Composition-based stats. Identities = 65/295 (22%), Positives = 116/295 (39%), Gaps = 21/295 (7%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTA 56 +KN P IS+ +P +N AI+SV Q Y WE+ I DD S + E LQ+Y + Sbjct: 532 LKNKPKISVLIPVYNAPVDYLREAIESVCTQLYPEWELCIADDASPNQEVRDLLQEY--S 589 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 D RI + + N N A+ +A G Y+ +D DD + L Sbjct: 590 QKDIRIKVVEREQNGHISEATNSALDVATGAYVALMDHDDILPNDALYWVAETILANPDV 649 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTE 175 A +Y+++ + ++ KS ++ L +N I + KE F Sbjct: 650 ALIYSDEDKITADGQTR---YDPNFKSQWNPELLLSQNCISHLGVYRTDLAKEIGGFRKG 706 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHIN--HGEMQITSSPKKFSGY--FHFYRKH 231 + AQD+D LR + + + H G I K ++ + R+H Sbjct: 707 FEGAQDWDFALRFSEKVKPEQIIHIPRILYHWRAIEGSTAIDGDEKPYALFAGLKAVREH 766 Query: 232 KDKFDRASK--KYQLFTLYQIRNKRMTWRTLLT-LLSVRNGKRL----ADGIRGR 279 + ++ ++ +++ L++ ++ RNG+ + D I G+ Sbjct: 767 CKRMSINAEVVEHPERHYARVKYNIPKPMPLVSMIIPTRNGQEVLSVCIDSILGK 821 Score = 91.5 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 48/207 (23%), Positives = 85/207 (41%), Gaps = 21/207 (10%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSWEQLQQYVTAL--N 58 K PL+S+ +PT N Q++ I S+L + Y N+E+IIVD+ S + Y+ L Sbjct: 793 KPMPLVSMIIPTRNGQEVLSVCIDSILGKTTYPNYEIIIVDNGS-DCPETLAYLDKLEQK 851 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHA 117 P + + ++ A+ N+A +A+GE + +++D + TP+ L+ + H Q A Sbjct: 852 HPNVVVMRDESPFNYSALNNKAAAIAKGEVLALVNNDVEVITPDWLTEMVGHVIQAQNGA 911 Query: 118 ----FLYANDYVCQGEVYSQPASLPLYPK------SPYSRRLFYKRNII----GNQVFTW 163 Y ++ + G V K Y+ R +N V Sbjct: 912 VGARLWYPDNTLQHGGVIFVGGVAGHAHKHLPKGMPGYACRAIVAQNYSAVTAACLVVRK 971 Query: 164 AWRFKECLFDTE--LKAAQDYDIFLRM 188 A + + A D D L++ Sbjct: 972 AVFEQVGGLNETDLTVAFNDIDFCLKV 998 >UniRef50_Q9X4C3 Glycosyltransferase WcaN n=2 Tax=Enterobacteriaceae RepID=Q9X4C3_ECOLX Length = 287 Score = 206 bits (524), Expect = 9e-52, Method: Composition-based stats. Identities = 85/265 (32%), Positives = 136/265 (51%), Gaps = 10/265 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS--WEQLQQYVTALNDPRITY 64 +SIY+PT NR ++ RA+ S+L Q Y N+++++ +D S+ + +++Y + +D Y Sbjct: 5 VSIYIPTHNRPKMLARALDSLLTQTYKNFQVLVCNDGSSESYDQIIKEYRSKFSD--FIY 62 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N GAC RN+ I +A GEYITG+DDDDE+ P+RL+ F+ L +A++ A Sbjct: 63 IENQSPMGACCSRNKLINIADGEYITGLDDDDEFLPSRLTDFIDSPN-LDKYAYISAGHI 121 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQDYD 183 S + + RNI+GNQVFT KE FD L A QDYD Sbjct: 122 TKT----SSGLFKQKIDERVITLESLLSRNIVGNQVFTKTVFLKESGGFDEHLPAWQDYD 177 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++ + + G +++E+ +I+H E +I++S K GY F KHKD + + + Sbjct: 178 AWINLTKKIGNGYRIEKYNYQWNIDHEEGRISNSSKAEIGYNLFIEKHKDILSKDNLRNL 237 Query: 244 LFTLYQIRNKRMTWRTLLTLLSVRN 268 RN ++ L S+ Sbjct: 238 YIQDKINRNVDISLAELFGNFSISI 262 >UniRef50_A6QB23 Glycosyl transferase n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QB23_SULNB Length = 1335 Score = 205 bits (523), Expect = 1e-51, Method: Composition-based stats. Identities = 61/242 (25%), Positives = 103/242 (42%), Gaps = 10/242 (4%) Query: 5 PLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-NDPRI 62 PLISI +PT+N ++ ++SVL Q Y NWE+ I DD ST E + P + Sbjct: 89 PLISIIVPTYNTKKRYLTEMLESVLSQTYGNWELCIADDASTDRETIDTLEYYRTKHPAV 148 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 ++ N N A+ +A G+Y+ +D DD +PN L + LY++ Sbjct: 149 KVVYRKKNGHISEASNTALSIALGDYVAFLDHDDTLSPNALYEMAKKLNEDRKLKILYSD 208 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQD 181 + ++ + KS ++ +F+ +N I + + K F + +QD Sbjct: 209 ----EDKIDENSNRYMPHFKSGWNPDMFFSQNYITHLLIIKKEIIDKVGGFRKGYEGSQD 264 Query: 182 YDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 YD+ LR + G E +VE+ G S+ K + + R +D F R Sbjct: 265 YDLVLRCLDHIGKEEIGRVEKILYHWRAIKGSTAYGSNEKAY-AHDAGLRGLQDYFLRKD 323 Query: 240 KK 241 + Sbjct: 324 RS 325 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 63/279 (22%), Positives = 118/279 (42%), Gaps = 14/279 (5%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 PL SI MP +N + AI+SV Q Y WE+ I DD STS E+ + Y+ ++ Sbjct: 786 SKQPLFSIVMPVYNVVPKWLKLAIESVENQWYGRWELCIADDASTS-EETKAYLRKIDHH 844 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 +I N G C N+A+ LA+GEYI +D+DDE TP+ L L Sbjct: 845 KIKIRFLKKNLGICGASNEALKLAKGEYIALMDNDDELTPDALYEILKAINTKAAELIYS 904 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAA 179 D + + +++P + K +S +F +N + + V + ++ L+ + Sbjct: 905 DEDKIEEDGTFAEP-----HFKPDFSPDMFLSQNYLSHLVVIKKELVDRVGGWEAGLEGS 959 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QDYD++L+++ + + + G S K + + ++ R + Sbjct: 960 QDYDLYLKVLEHTEKISHISKVLYHWRKVPGSTAAEYSAKSY-AQEAGRKALENAMKRRA 1018 Query: 240 KKYQLFTL-----YQIRNKRMTWRTLLTLLSVRNGKRLA 273 K + Y+++ + + ++ ++ L Sbjct: 1019 IKADVKNGKYPGTYRVKYELKEEPLVSIIIPFKDKPELL 1057 Score = 92.2 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 58/269 (21%), Positives = 107/269 (39%), Gaps = 28/269 (10%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQ--QYVTALNDPR 61 PL+S+ +PT + + + I+S+L++ Y N+E++IVD+ ST + L+ + + R Sbjct: 347 PLVSLVVPTRDSYNILHKCIESILQKTLYENYEILIVDNESTDPKTLRYFEILKKHEHIR 406 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHK---QQLVTHA 117 I H+ N A+ N + A+GE I +++D + + LS + H + A Sbjct: 407 ILEYHHPFNY--SAINNYGVQYARGEIIGLLNNDVEIISSGWLSEMVQHAIRPEIGAVGA 464 Query: 118 FLYANDYVCQ------GEVYSQPASLPLYPKSP--YSRRLFYKRNII----GNQVFTWAW 165 LY +++ Q G S +P++ Y RL +N + + Sbjct: 465 KLYYDNHTIQHAGIVLGIGGVAGHSHKYFPQNHHGYFSRLKIIQNYSAVTAACLLMRKSV 524 Query: 166 RFKECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSG 223 + + E A D D+ L++ + + H E S Sbjct: 525 YLEAGGLNEENLAVAFNDVDLCLKLQQKGYRNLWTPYSELY----HHESISRGSDDTPEK 580 Query: 224 YFHFYRKHKDKFDRASKKYQLFTLYQIRN 252 F R+ K ++ K+ L Y RN Sbjct: 581 TERFGREIKYMLEKWKKQ-LLKDPYYNRN 608 Score = 79.1 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 54/284 (19%), Positives = 107/284 (37%), Gaps = 26/284 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVT-ALN 58 +K PL+SI +P ++ +L I+S+L++ Y N+E+I +++ S WE ++ Sbjct: 1038 LKEEPLVSIIIPFKDKPELLKTCIESILKKSSYQNYEIIGINNRSKEWETFKEMKRLEKR 1097 Query: 59 DPRITYIHNDINSGACAVRNQAIML-AQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTH 116 D R+ + + + N A+ A+G++I +++D + T N + L Q+ Sbjct: 1098 DSRVRFCEYNDTFNYSKINNFAVSSCAKGKHIVLMNNDIEIITSNWIEEMLMFSQRDDVS 1157 Query: 117 AF---LYANDYVCQ--------GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW 165 A LY + Q G V Y RL +N+ + V Sbjct: 1158 AVGSKLYYPNGTIQHAGIVLGIGGVAGHAHKYFSKNVPGYFSRLHIVQNL--SSVTAALL 1215 Query: 166 RFKECLFDT--ELK------AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS 217 K+ ++D L A D D L++ + H + Sbjct: 1216 MVKKAIYDEVGGLDEVNLQVAFNDVDFCLKLQKNGYLNLFTPWV-EAYHHESKSRGEEDT 1274 Query: 218 PKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLL 261 P+K + K+K+ + K+ + + +R + + Sbjct: 1275 PEKQERFKKEVEFMKNKWSKILKEGDPYYNPNLTLEREDFSVIF 1318 >UniRef50_C3RR62 Glycosyl transferase n=3 Tax=Bacteria RepID=C3RR62_9MOLU Length = 718 Score = 205 bits (522), Expect = 1e-51, Method: Composition-based stats. Identities = 64/281 (22%), Positives = 117/281 (41%), Gaps = 13/281 (4%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALND 59 + NPLISI +P +N + + + S+L Q Y N+E+ IVDDCST+ E + D Sbjct: 183 EYNPLISILIPVYNVERKFLSECLDSILNQTYQNYEVCIVDDCSTNLETINTLKEYENKD 242 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI IN N A+ +A+GE+I +D+DD PN L +A + F+ Sbjct: 243 TRIVVKTRLINGHISKASNDALEIARGEFICLVDNDDTLAPNALYENVALLNKHKDADFI 302 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKA 178 Y++ + ++ + + KS ++ N I + +E F L+ Sbjct: 303 YSD----EDKLDLRGERCEPHFKSDFAPDTLLGINYICHLAVLRTSLVREVGGFTVGLEG 358 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 QD+D+FLR+ + + + + + G ++ K + + R Sbjct: 359 VQDHDLFLRITEKTKNIYHIPKILYHWRMIEGSTSLSVDNKAY-AVKKGIETIESTLKRR 417 Query: 239 SKKYQL-----FTLYQIRNKRMTWRTLLTLLSVRNGKRLAD 274 K + T+Y I T ++ ++ R+ + + Sbjct: 418 GVKANVKSLGNSTVYGIEYVLDTEPSVSIIVPTRDFADVTE 458 Score = 76.0 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 49/260 (18%), Positives = 98/260 (37%), Gaps = 24/260 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVT-ALN 58 + P +SI +PT + + + ++S+ + YSN+E++IVD+ S E ++ + + Sbjct: 438 LDTEPSVSIIVPTRDFADVTEKCLESIYKLTNYSNFEVVIVDNRSEKQETMELFEKYQMR 497 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQ----L 113 I D+ A+ N A+ + + + +++D E TPN L + +++ Q Sbjct: 498 YENFRVIKADMEFNYSAINNLAVSTCKSDVLVLLNNDTEVLTPNWLKLMVSYAIQKHIGA 557 Query: 114 VTHAFLYANDYVCQGEVYSQPAS---------LPLYPKSPYSRRLF---YKRNIIGNQVF 161 V LY + + G V + P + Y R Y Sbjct: 558 VGAKLLYPDMTIQHGGVLLGVGNAVAAHAFISHPRDDEGVYGRLKIPYNYSAVTAACLAV 617 Query: 162 TWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 + D LK A D D L+++ + + I + + ++S K Sbjct: 618 ERKKYIQVGGLDETLKVAYNDVDFNLKLLDAGYYNLFIPQVELIHYESKSRGLDSTSEK- 676 Query: 221 FSGYFHFYRKHKDKFDRASK 240 Y F ++ + +K Sbjct: 677 ---YKQFLAENNYMHKKWAK 693 >UniRef50_A8ZWI7 Glycosyl transferase family 2 n=2 Tax=Desulfobacteraceae RepID=A8ZWI7_DESOH Length = 289 Score = 205 bits (522), Expect = 1e-51, Method: Composition-based stats. Identities = 64/234 (27%), Positives = 107/234 (45%), Gaps = 21/234 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ +PT+NR + A+ SVL QDY E+IIVDD S+ + R+T Sbjct: 7 PLVSVVIPTYNRGPMVTEAVASVLAQDYPAIEIIIVDDGSSDDTPGRLSPLK---NRVTI 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I + N G A RN + A GEYI +D DD W P ++S A + Sbjct: 64 ITQE-NRGVSAARNAGVAHAGGEYIAFLDSDDRWLPEKISTQTAFFASRP-------DAL 115 Query: 125 VCQGEVYSQPASLPLYPKSPYSRR--LFYKRNIIGNQVFTWAWRFKE------CLFDTEL 176 +CQ E PL+P++ + +R + ++ ++ V A ++ FD L Sbjct: 116 ICQTEETWIKNGKPLFPRARHKKRSGMIFEPSLELCLVSPSAVMMRKDFFLDVGGFDESL 175 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 A +DYD++LR+ + + P + E ++ + Q+++ P + K Sbjct: 176 PACEDYDLWLRISMVH--PVYLIETPLVIKQGGHDGQLSAMPGLDRYRIYAICK 227 >UniRef50_A4J8Z3 Glycosyl transferase, family 2 n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J8Z3_DESRM Length = 297 Score = 205 bits (522), Expect = 1e-51, Method: Composition-based stats. Identities = 63/248 (25%), Positives = 113/248 (45%), Gaps = 7/248 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 N L+S+ +PT+NR++ IR+I+SVL Q Y E+ ++DD ST + ++ ++ Sbjct: 2 NELVSVVIPTYNREKTIIRSIESVLNQTYKAIEVCVIDDASTDKTEKIVQEKYGDNDKVI 61 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y NSGAC RN + +A G+YI +D DD++ P ++ + + +Q + Sbjct: 62 YYRLAKNSGACVARNTGVSIAHGKYIAFLDSDDKFVPQKIELQMLKLKQCEADICVSNYK 121 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYD 183 Y + + + P K Y L+ +I + F+ FD + QD+D Sbjct: 122 YTDRNKNEKIVYAEPKEGKQLYDSLLWC-NSITTGTILGKRECFEHVQFDAAMPRYQDWD 180 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSS---PKKFSGYFHFYRKHKDKFDRASK 240 + LR+ Y + VE+A + + ITSS K Y K++ F+ ++ Sbjct: 181 LALRLSQNYRFCY-VEKAL--MEQEFQTVSITSSTGHKKTLFALDRLYEKNRLGFEECNE 237 Query: 241 KYQLFTLY 248 + + Sbjct: 238 AFVQIHWF 245 >UniRef50_A6ELC1 TuaG n=3 Tax=Bacteroidetes RepID=A6ELC1_9BACT Length = 269 Score = 205 bits (522), Expect = 1e-51, Method: Composition-based stats. Identities = 67/232 (28%), Positives = 100/232 (43%), Gaps = 5/232 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + PL+SI P +N + I+SVL Q YSNWE++IVDD ST NDP Sbjct: 12 MVSKPLVSIITPLYNAAPFIEKTIQSVLDQTYSNWELLIVDDASTDNGLQIVRDFFGNDP 71 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N IN GA RNQA A G+YI +D DD W N+L L Q+ + Sbjct: 72 RIKFYPNTINKGAAHCRNQATSEATGDYIAFLDSDDLWHKNKLEKQLKFMQKH-DCDVSF 130 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + PY ++ +K N IGN + + ++ Q Sbjct: 131 TSYLQIDQNGNLLGRQVQAIKTLPYKKQ--HKNNYIGNLTGMYNANSLGKIIAPNIRKRQ 188 Query: 181 DYDIFLRMVVEYGEPW-KVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 D+ ++L + + +P ++E ++ M K F FYR + Sbjct: 189 DWAVWLEAIKKSKKPALGLQENLAFYRVHLDSMSANKW-KLIKYNFAFYRTY 239 >UniRef50_C2P8I0 Family 2 glycosyltransferase n=10 Tax=Bacteria RepID=C2P8I0_BACCE Length = 262 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 59/232 (25%), Positives = 93/232 (40%), Gaps = 6/232 (2%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+SI P++N I+SV Q Y WEMII+DD S D R Sbjct: 11 EFQPLVSIVTPSYNASSFIKETIQSVQSQTYKKWEMIIIDDVSKDNTCELIKEEIKKDDR 70 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I + N GA RN I A+G+YI +D DD W P +L +A Q AF + Sbjct: 71 IRLIELEENGGAAIARNTGINCAEGKYIAFLDSDDLWLPEKLEKQVAFMQNN-DLAFSFT 129 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + + + + + K Y+ K IIG + ++ QD Sbjct: 130 SYQIMDQDGKLTEKVVHVPEKINYNG--LLKNTIIGCLTVMLDIEKLGKVKMPNIRTRQD 187 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP-KKFSGYFHFYRKHK 232 +L+++ + + ++E E I+S K + YR+ + Sbjct: 188 TATWLKILKQGHYAYGLDEVLSKYR--KVENSISSKKFKMAKMNWKLYREIE 237 >UniRef50_B5L396 WfdV n=10 Tax=Proteobacteria RepID=B5L396_ECOLX Length = 267 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 60/226 (26%), Positives = 98/226 (43%), Gaps = 5/226 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ PT+N ++ I SVL Q Y+N EMIIVDDCS+ D R+ I Sbjct: 7 LVSVITPTYNSERTIRCTIDSVLSQSYNNIEMIIVDDCSSDNTVSICREYQKKDSRVVLI 66 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+ N GA RN AI A+G +I +D DD W ++ + +AF Y + Sbjct: 67 SNEKNYGAGMSRNIAISKAKGRFIAFLDSDDFWHKEKIKKQIDFMLSN-NYAFTYTSY-- 123 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 Q +S + P S + K N+IG + +F ++ QD ++ Sbjct: 124 -QKVSFSGKLLSQINPVSKVNYSELLKTNVIGCLTAVYDTSSLGKMFMPTIRKRQDMALW 182 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 L ++ + + + E + + ++ K + FYRK+ Sbjct: 183 LSILDKIDYAYCLNETLAYYRVGTDTLS-SNKLKIIFSQWAFYRKY 227 >UniRef50_B8HRA4 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HRA4_CYAP4 Length = 398 Score = 204 bits (520), Expect = 2e-51, Method: Composition-based stats. Identities = 56/257 (21%), Positives = 97/257 (37%), Gaps = 12/257 (4%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+S+ +P +N + + + S+L Q YSN+E++++DD ST + A DPR Sbjct: 11 EELPLVSVILPVYNGEPFIEKTLNSILTQTYSNFEVLVIDDGSTDQTETVVTSFAQRDPR 70 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK-QQLVTHAFLY 120 I + N+G A RN I A+GE+I ID DD W P+ L + + +Y Sbjct: 71 IKLLKQ-ANAGVAAARNFGIQSAKGEFIAPIDADDLWFPDFLEKQVRCMLNSDSSVGLVY 129 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII-GNQVFTWAWRFK-ECLFDTELK- 177 A + L I + V + +D+ LK Sbjct: 130 AWLIKIDQNGSPTGDFCASRIEGDVYPTLLCHDFIANASCVLIRRSCLQCVGGYDSSLKA 189 Query: 178 ----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT--SSPKKFSGYFHFYRKH 231 +D+D++LR+ + + V E M S + + R+ Sbjct: 190 RNGQGCEDWDLYLRIAELH-QIRVVPEFLVGYRQRSDSMSCNYQSMAQSRELIWETIRQK 248 Query: 232 KDKFDRASKKYQLFTLY 248 + + + Y Sbjct: 249 YPNIPAIIYRLSMSSFY 265 >UniRef50_Q8YUP7 Glucosyltransferase n=2 Tax=Cyanobacteria RepID=Q8YUP7_ANASP Length = 343 Score = 204 bits (520), Expect = 2e-51, Method: Composition-based stats. Identities = 59/261 (22%), Positives = 106/261 (40%), Gaps = 27/261 (10%) Query: 8 SIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIHN 67 S+ +P +N + ++SVL Q Y + E+IIVDD ST D RI IH Sbjct: 4 SVIIPVYNSESSVAETLRSVLAQTYRDLEIIIVDDGSTDKSI--DICKQFQDERIRIIHQ 61 Query: 68 DINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVCQ 127 N G RN I A+GEY+ +D DD W P +L+ L H ++ ++ + Sbjct: 62 Q-NRGLAGARNTGIRQARGEYLAFVDSDDLWLPEKLAKHLEHFERSPEVGVSFSRSSLID 120 Query: 128 GEVYSQPASLPLYPK-SPYSRRLFYKRNIIGN--QVFTWAWRFKEC-------------L 171 + +P + PK + + + RN I N V Sbjct: 121 DQ--GKPLGIYQMPKLTDITPEYLFCRNPISNGSSVVIRRAVLDTIKFQENLYGEVEDFY 178 Query: 172 FDTELKAAQDYDIFLRMVVE-YGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 FD + ++D + +LR+ ++ + + EA + +N G + + K++ + K Sbjct: 179 FDDSFRQSEDIECWLRIALQTTWKIEGIPEALTLYRVNMGGLS-ANVLKQYESWERILVK 237 Query: 231 ----HKDKFDRASKKYQLFTL 247 + + +R + + + L Sbjct: 238 TQAYNPEFIERWGNRAKAYQL 258 >UniRef50_B3E8X3 Glycosyl transferase family 2 n=1 Tax=Geobacter lovleyi SZ RepID=B3E8X3_GEOLS Length = 857 Score = 204 bits (519), Expect = 3e-51, Method: Composition-based stats. Identities = 60/217 (27%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTA 56 + PLIS+ MP +N + AI SV+ Q Y WE+ IVDD ST E LQ+Y + Sbjct: 331 LSTQPLISVVMPVYNTPLRYLKEAIDSVIAQVYKTWELCIVDDASTDSEIKGLLQEYERS 390 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 D RI + + N N LA GEYI +D DD T + L+ + Sbjct: 391 --DLRIKVVFRENNGNISTATNSGFELAAGEYIALLDHDDLLTWDALAEVVLAINADPDS 448 Query: 117 AFLYAN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDT 174 LY++ D + + S+P + K +S F + +G+ VF K D Sbjct: 449 DILYSDQDKIDEFGALSEP-----FFKPDWSPDYFCRVMYVGHLLVFRRNLLDKVGGCDP 503 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGE 211 QDY++ LR+ + + G Sbjct: 504 RFDKVQDYELMLRLSEVARKIHHIPRILYHWRTLEGS 540 Score = 88.4 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 46/226 (20%), Positives = 77/226 (34%), Gaps = 34/226 (15%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSW------EQLQQYVTAL 57 PL+SI +PT + Q R ++S+ R Y N+E+I++D+ +T + + Sbjct: 593 PLVSIIIPTKDAPQHIGRCLESIYTRSTYPNYEVIVIDNNTTDPVARHILDNTPAKLVKF 652 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQLVTH 116 D + Y N + A GEY+ +++D E TP + L +Q Sbjct: 653 ID-KFNY---------SRANNIGVQEAAGEYVILLNNDTEVITPRWIEFLLFGLEQKDVA 702 Query: 117 A----FLYANDYVCQGEVY-------SQPASLPLYPKSPYSRRLFYKR---NIIGNQVFT 162 A LY ++ V V Y+ L R + + Sbjct: 703 AVGSLLLYPDNTVQHAGVVLGCRGTADHVMRYFPADSDGYAGSLSCTREVSAVTAACMMC 762 Query: 163 WAWRFKECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILH 206 + E QD D LR+ V V +A + Sbjct: 763 RRADYLSSGSMIEFFGTHYQDVDFCLRLAVGGKRILFVPQAVLYHY 808 >UniRef50_B1JX11 Glycosyl transferase family 2 n=1 Tax=Burkholderia cenocepacia MC0-3 RepID=B1JX11_BURCC Length = 765 Score = 204 bits (519), Expect = 3e-51, Method: Composition-based stats. Identities = 62/282 (21%), Positives = 116/282 (41%), Gaps = 18/282 (6%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALNDPR 61 PLIS+ +P +N + IR I+SV Q Y +WE+ +VDD S A D R Sbjct: 212 RPLISVLIPLYNTPEPFLIRCIESVREQLYDHWELCLVDDASPQPHVQRICERYAAQDSR 271 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y+ + N N A+ LA GE+ +D DDE + L + + + LY+ Sbjct: 272 IRYMRRETNGHIAEATNSALSLATGEFSALLDHDDELAAHALYMVVVELNKQPDLDMLYS 331 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN-IIGNQVFTWAWRFKECLFDTELKAAQ 180 + + ++ Q + KS ++ L +N ++ V+ + + F + +Q Sbjct: 332 D----EDKIDEQGKRYEPWFKSDWNYDLMLSQNAVVHLAVYRTSILREIGGFRSAFNGSQ 387 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINH---GEMQITSSPKKFSGYFHFYRKHKDKFDR 237 DYD+ LR + P ++ IL+ G + + ++ K + Y R ++ +R Sbjct: 388 DYDVTLRFSEQ-TTPERIRHIPFILYHWRAISGSVALATTEKLYP-YEAAERAIREHLER 445 Query: 238 ASKKYQLFT-----LYQIRNKRMTWR-TLLTLLSVRNGKRLA 273 + + YQ+ + ++ ++ L Sbjct: 446 TGRSATVKRQPHLGYYQVTWPVPAPEPKVAIIIPTKDKVELL 487 Score = 66.4 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 82/236 (34%), Gaps = 19/236 (8%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P ++I +PT ++ +L A+ S+L + Y N+E++IV++ S ++ + P++ Sbjct: 471 EPKVAIIIPTKDKVELLRVAVDSILEKTTYVNYEIVIVNNRSVEASTMEYFAQVQESPKV 530 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ----LVTHA 117 + D A+ N A+ + +++D + PN L + H + V Sbjct: 531 RLLDYDKPYSFAALNNWAVTQTDAPLLAFVNNDIEVIEPNWLREMVGHALRPEVGSVGAK 590 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLF-----------YKRNIIGNQVFTWAWR 166 LY N + V L +P F Y V Sbjct: 591 LLYPNGTIQHSGVVVGIGGLAGHPHVGEPGETFGYFGRAACTQRYSAVTAACVVMRREVF 650 Query: 167 FKECLFDT-EL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 + FD A D D+ +R+ A H + T+ ++ Sbjct: 651 LEVSGFDEVNFAVAFNDVDLGMRLGQAGYANVWTPRALLFHHESASLGLPTNEDRR 706 >UniRef50_B4U6C1 Glycosyl transferase family 2 n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U6C1_HYDS0 Length = 746 Score = 203 bits (518), Expect = 4e-51, Method: Composition-based stats. Identities = 70/278 (25%), Positives = 107/278 (38%), Gaps = 14/278 (5%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALND 59 K P ISI +P WN + I I+SVL Q YSNWE+ IVD S + L D Sbjct: 212 KYEPKISIVVPVWNTPKKFLIDMIESVLNQTYSNWELCIVDGNSKEKHVKETLEHYTLKD 271 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI + N G N+AI LA GEYI +D DD P L + + F+ Sbjct: 272 KRIKVKYLKENKGIAGNSNEAIALATGEYIAFLDHDDVLAPFALYEVVRAINENEDVDFI 331 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKA 178 Y+++ + + + K +S N I + E F Sbjct: 332 YSDEDKITEDGLKR---FDPFFKPDFSPDTLRSYNYITHLSVVKKELLNEVGWFREGYDG 388 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK-KFSGYFHFYRKHKDKFDR 237 +QDYD+ LR + + + + IN + + PK K Y + +D DR Sbjct: 389 SQDYDLILRCTEKAKKIVHIPKILYNWRIN--DNSVAQDPKNKMYAYDAAKKALQDHLDR 446 Query: 238 ASKKYQLFTL-----YQIRNKRMTWRTLLTLLSVRNGK 270 K ++ Y+I + ++ ++ K Sbjct: 447 VGLKGKVRDGVFLGSYKIDYDISYHHKVSIIIPNKDHK 484 Score = 73.4 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 52/249 (20%), Positives = 92/249 (36%), Gaps = 20/249 (8%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVT-ALNDPRIT 63 +SI +P + ++ + I S++ + Y N+E+IIV++ S + + Y I Sbjct: 473 KVSIIIPNKDHKEDLEKCITSIINKSTYKNYEIIIVENNSKEKKTFEYYKYLQNKYNNIV 532 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQ----LVTHAF 118 + AV N A A G+ + +++D E N + L + Q+ V Sbjct: 533 LLEWKDKFNYSAVNNFASKYANGDILLFLNNDTEVINENWIEEMLMYAQRKDVGAVGAKL 592 Query: 119 LYANDYVCQGEVY-----SQPASLPLYPKSPYSR--RLFYKRN---IIGNQVFTWAWRFK 168 Y +D + G V S +P+ Y RL +N + G + F Sbjct: 593 YYPDDTIQHGGVILGIGGKVGHSHRFFPRVSYGNVGRLVVVQNLSAVTGACLMMRKDIFN 652 Query: 169 EC-LFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH 226 E FD A D DI L++ + A ++ H +P+K + Sbjct: 653 EVEGFDERYPLALSDIDICLKVREKGYLVVWTPYA-ELYHYESKSRGYEDTPEKQERFKK 711 Query: 227 FYRKHKDKF 235 K K+ Sbjct: 712 EIELFKKKW 720 >UniRef50_Q3A3M0 Glycosyltransferase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A3M0_PELCD Length = 249 Score = 203 bits (517), Expect = 5e-51, Method: Composition-based stats. Identities = 60/250 (24%), Positives = 102/250 (40%), Gaps = 4/250 (1%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +SI P N + IKS+ Q ++NWE I+VDD ST ++PR Sbjct: 3 KVSIITPCHNSAPFVLETIKSISEQTFTNWEHILVDDASTDDTWQIIQKYCSSEPRAKCF 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 NSGA RN AI AQG YI ID DD W P +L + + + + + + Sbjct: 63 RLTKNSGAALARNMAIEAAQGRYIAFIDSDDLWMPEKLEKQIEFMRYKNAYLSFSSYEKI 122 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 + V S + + + K +IG + ++ ++ +++ QDY ++ Sbjct: 123 NEYGVRS---GRMVDVPNEVNYNEILKGCVIGCLTAMYDSKYLGKVYMPDIRRGQDYGLW 179 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 LR++ + + + G + +S KK + YR + S K F Sbjct: 180 LRILRRGHIAYGLNCPLALYRERSGSIS-SSKFKKAYSQWRIYRNIEKLSLICSLKNFFF 238 Query: 246 TLYQIRNKRM 255 Y+ K++ Sbjct: 239 YSYKGFKKKI 248 >UniRef50_Q7BG51 WsaE n=2 Tax=Bacillaceae RepID=Q7BG51_BACST Length = 1127 Score = 203 bits (517), Expect = 5e-51, Method: Composition-based stats. Identities = 52/275 (18%), Positives = 106/275 (38%), Gaps = 17/275 (6%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-- 57 +K P S+ +P +N + + + I SVL Q Y WE+ IVDD S S + ++ + Sbjct: 597 LKYKPKFSVILPVYNVEEKWLRKCIDSVLNQWYPYWELCIVDDNS-SKDYIKPVLEEYSN 655 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 D RI + N N A+ +A G++I +D DDE P L + Sbjct: 656 RDSRIKTVFRSNNGHISEASNTALEIATGDFIALLDHDDELAPEALYENAVLLNEHPDAD 715 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTEL 176 +Y++ + ++ K +S + IG+ ++ F Sbjct: 716 MIYSD----EDKITKDGKRHSPLFKPDWSPDTLRSQMYIGHLTVYRTNLVRQLGGFRKGF 771 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 + +QDYD+ LR+ + + + + + S K + + + + + Sbjct: 772 EGSQDYDLALRVAEKTNNIYHIPKILYSWREIETSTAVNPSSKPY-AHEAGLKALNEHLE 830 Query: 237 R-------ASKKYQLFTLYQIRNKRMTWRTLLTLL 264 R +++ + +Y +R L++++ Sbjct: 831 RVFGKGKAWAEETEYLFVYDVRYAIPEDYPLVSII 865 Score = 84.9 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 48/262 (18%), Positives = 88/262 (33%), Gaps = 26/262 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 ++ PL+SI +PT + +L I+S+L + Y N+E++I+++ S E + + Sbjct: 857 EDYPLVSIIIPTKDNIELLSSCIQSILDKTTYPNYEILIMNNNSVMEETYSWFDKQKENS 916 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-----WTPNRLSVFLAHKQQLVT 115 +I I + N I A GE +++D W + L V Sbjct: 917 KIRIIDAMYEFNWSKLNNHGIREANGEVFVFLNNDTIVISEDWLQRLVEKALREDVGTVG 976 Query: 116 HAFLYANDYVCQGEV---------YSQPASLPLYPKSPYSRRLFYKRNII----GNQVFT 162 LY ++ + V + P++ SP+ + RN+ Sbjct: 977 GLLLYEDNTIQHAGVVIGMGGWADHVYKGMHPVHNTSPFISPVI-NRNVSASTGACLAIA 1035 Query: 163 WAWRFKECLFDTELKAAQ-DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 K F+ E D +I LR + + V + + H T Sbjct: 1036 KKVIEKIGGFNEEFIICGSDVEISLRAL----KMGYVNIYDPYVRLYH-LESKTRDSFIP 1090 Query: 222 SGYFHFYRKHKDKFDRASKKYQ 243 F K+ + Y Sbjct: 1091 ERDFELSAKYYSPYREIGDPYY 1112 >UniRef50_Q8YSL1 Alr3073 protein n=2 Tax=Nostocaceae RepID=Q8YSL1_ANASP Length = 324 Score = 203 bits (517), Expect = 5e-51, Method: Composition-based stats. Identities = 62/219 (28%), Positives = 105/219 (47%), Gaps = 10/219 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P IS+ +P +N ++ + I SVL Q +S+ E+I+++D ST ++ + + ++D R+ Sbjct: 2 PKISVIIPAYNAERTILETINSVLNQTFSDLEIIVINDGST--DRTVEVLQNVDDARLKV 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + NS A RN I A G++I+ +D DD WTP++L + L+ Y+ Y Sbjct: 60 YSYE-NSRASGARNHGISHAVGDFISFLDADDLWTPDKLELQLSALNNHPEAGVAYSWTY 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG-NQVFTWAWRFKECLFDTELKAAQDYD 183 + PLY + Y+ L G N + A FDT L++ +D+D Sbjct: 119 TIDDKGELLKPFEPLYEGNVYTDLLLANFLTNGSNPLIRKAAIASIGEFDTTLRSGEDWD 178 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 +LR+ Y P+ V + QIL+ +TS K Sbjct: 179 YWLRLA--YKWPFVVVKQHQILYRR----SVTSKSFKLQ 211 >UniRef50_D0MF25 Glycosyl transferase family 2 n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MF25_RHOM4 Length = 320 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 52/212 (24%), Positives = 97/212 (45%), Gaps = 11/212 (5%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P +S+ + +N ++ AI+SVL QDY E+I+VDD ST + ++ V +P + Sbjct: 4 ERPTVSVVIAAYNAERWIAGAIRSVLGQDYPVLEVIVVDDGST--DATREVVAPFQEP-V 60 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 Y+ + N+G+ A RN I LA+GE + +D DD W P +L+ +A Q + Y + Sbjct: 61 RYVFQE-NAGSAAARNHGIRLARGELVAFLDADDLWLPGKLAAQVACLQAHPDCGWCYTD 119 Query: 123 DYVCQGEVYSQPASLPLY---PKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDT--EL 176 ++ S+ L P+ L + + + F + Sbjct: 120 AWLVDASTRSKKVRLSQLAAMPEGWVLEALLLSNFVPFSSALVRREALEAVGGFREGPDR 179 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHIN 208 + ++D+D++LR+ Y +V E ++ + Sbjct: 180 RISEDWDLWLRIAAHY-PVCRVAEPLVLIRDH 210 >UniRef50_A6H2F4 Glycosyl transferase, group 2 family protein n=2 Tax=Flavobacteriales RepID=A6H2F4_FLAPJ Length = 257 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 74/258 (28%), Positives = 111/258 (43%), Gaps = 4/258 (1%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 N L+SI PT+N +Q AIKSV Q YS+WE+IIVDDCS D RI Sbjct: 2 NELVSIITPTYNSEQFIAEAIKSVQNQSYSHWEIIIVDDCSKDKTVNIIQNFIDEDHRIY 61 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I D NSGA RN AI A+G YI +D DD W P++LS + Q+ F ++ Sbjct: 62 LIQLDKNSGAGVARNNAINNAKGRYIAFLDSDDLWKPDKLSKQIKFMQRH-NIPFTFS-F 119 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYD 183 Y C E PK +LF+ N IGN + + + + ++ QD+ Sbjct: 120 YDCINEKGDLLNKRIEAPKKLSYYQLFF-CNFIGNLTGIYDSNYFGKMAISSIRKRQDWI 178 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++L ++ + V E+ + I + + + YRK S Sbjct: 179 VWLTILKKIKTAKPVPESLALYRIRENSISASKVS-LLKDNYAIYRKFHHLNIIVSLACM 237 Query: 244 LFTLYQIRNKRMTWRTLL 261 L L+ + + ++ Sbjct: 238 LGFLFTQLLVKPKFVKII 255 >UniRef50_C4KBW6 Glycosyl transferase family 2 n=1 Tax=Thauera sp. MZ1T RepID=C4KBW6_THASP Length = 300 Score = 203 bits (516), Expect = 6e-51, Method: Composition-based stats. Identities = 59/251 (23%), Positives = 101/251 (40%), Gaps = 10/251 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P +S+ +P +N RA+ SVL QD++++E+I+VDD ST + + + D Sbjct: 4 TTEPTVSVVIPAFNAAWCIRRAVDSVLAQDFTDFELIVVDDGST--DDTAEVLRGYGDV- 60 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + + N G RN I LA+G YI +D DD W P +L +A Q AF A Sbjct: 61 LRLVSQP-NGGMSRARNAGIRLARGRYIAFLDADDRWLPAKLGRQVALLDQRPELAFCAA 119 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWAWRFKE-CLFDTELK 177 + + + +F +I + V + FD L Sbjct: 120 TASLEDPDGKPVGIWACQGDGTASVAEVFAAHALIAGGASSVLARREPVQALGGFDEALF 179 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 A+D D+++R+ G + EA ++ G + + + +G RK++ Sbjct: 180 GAEDTDLWIRLAACGGFA-CIPEALVVVLKRPGSVS-RNRARMRAGALAMTRKNRHLLPA 237 Query: 238 ASKKYQLFTLY 248 + LY Sbjct: 238 DKQGAYWRALY 248 >UniRef50_Q116A2 Glycosyl transferase, group 1 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116A2_TRIEI Length = 1991 Score = 203 bits (516), Expect = 7e-51, Method: Composition-based stats. Identities = 65/283 (22%), Positives = 108/283 (38%), Gaps = 17/283 (6%) Query: 5 PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDPRI 62 P IS+ MP ++ + + I SVL Q Y NWE I DDCS++ + + A D RI Sbjct: 994 PKISVVMPVYDPPLKFLHQGISSVLNQVYQNWEFCIADDCSSNPKIREILTERAKTDSRI 1053 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 N A N A LA G+++ +D+DDE TP+ L + Q FLY++ Sbjct: 1054 KLTFRSENGNISAATNSAAELATGDFLLFLDNDDELTPDALGEVALYISQNSEIDFLYSD 1113 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQD 181 D ++ ++ K YS L IG+ F K F + +QD Sbjct: 1114 D----DKISTEGKRFDPQFKPEYSPELLLSYMYIGHLCVVRKEIFDKIGGFRIGFEGSQD 1169 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 YD LR + + G I+ K + + ++ R Sbjct: 1170 YDFALRATEISRQVGHLPLVLYHWRTTPGSTAISGGEKP-KSFLAGQKALQESLIRRGVA 1228 Query: 242 YQLFTL-YQIRNKRMTWRT--------LLTLLSVRNGKRLADG 275 ++ + +R K ++ + ++ +N +L Sbjct: 1229 GNIYQPDWAVREKLGIFKPTFPDQGDSVTIIIPTKNQVKLLKA 1271 Score = 80.7 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 50/282 (17%), Positives = 101/282 (35%), Gaps = 31/282 (10%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-----DPR 61 ++I +PT N+ +L ++S+ + Y N+++ ++D+ S E L +Y+ L + Sbjct: 1256 VTIIIPTKNQVKLLKACVESLRKTTYQNYQIFVIDNESDEPETL-EYLAGLKCLYSGEDN 1314 Query: 62 ITYIHNDINSG---ACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ----L 113 I G A+ N+A+ + EYI +++D + +P LS + + + Sbjct: 1315 ILVFPMKNTDGKFNFAAINNRAVEQVKTEYILFLNNDTEIISPYWLSQMMGYAKIPGVGA 1374 Query: 114 VTHAFLYANDYVCQGEVYS--------QPASLPLYPKSPYSRRLFYKRNII----GNQVF 161 V +Y + + V L Y + F +N + Sbjct: 1375 VGAKLIYPDKRIQHAGVIHGLHHGLAGHAFKLLHSENRGYLSQAFVSKNYSAVTAACMLT 1434 Query: 162 TWAWRFKECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 + FD E A D D R++ +A +++H + +P+ Sbjct: 1435 PRKLFLELGGFDEENFAVAYNDADYGYRLLKSGYRSVYCADA-ELIHKEGTSRGYSDNPQ 1493 Query: 220 KFSGYFHFYRKHKDKF--DRASKKYQLFTLYQIRNKRMTWRT 259 + + Y Y + D F S + F + R Sbjct: 1494 EEANYRRKYSQMIDGFYSPHLSLESDCFQIQPRRYFIRNLEV 1535 >UniRef50_C2DBA9 Family 2 glycosyl transferase n=22 Tax=Enterococcus faecalis RepID=C2DBA9_ENTFA Length = 715 Score = 203 bits (516), Expect = 7e-51, Method: Composition-based stats. Identities = 60/280 (21%), Positives = 110/280 (39%), Gaps = 19/280 (6%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--DP 60 P ISI MP +N + + I S+L Q Y+NWE+ + DD ST +++ +T D Sbjct: 181 QPKISIAMPVYNVEEKWLRLCIDSILNQVYTNWELCMADDASTDP-NVKKILTEYQQLDE 239 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + + N N A+ +A GE++ +D+DDE N + + +Y Sbjct: 240 RIRVVFREQNGHISEATNSALAIATGEFVALLDNDDELAINAFYEVVKVLNENPELDLIY 299 Query: 121 AN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKA 178 ++ D + S PA K +S L N I + V+ + + F + Sbjct: 300 SDEDKIDMDGNRSDPA-----FKPDWSPDLLLGTNYISHLGVYRRSILEEIGGFRKGYEG 354 Query: 179 AQDYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QDYD+ LR + + + + + + K + + R +D Sbjct: 355 SQDYDLVLRFTEKTTKERIKHIPKVLYYWRMLPTSTAVDQGSKGY-AFEAGLRAVQDALV 413 Query: 237 RASKKYQLFT-----LYQIRNKRMTWRTLLTLLSVRNGKR 271 R LY + + + + ++ +NG + Sbjct: 414 RRGINGHATHGAANGLYDVYYDIKSDKLVSIIIPTKNGYK 453 Score = 84.9 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 53/228 (23%), Positives = 91/228 (39%), Gaps = 21/228 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALND 59 +K++ L+SI +PT N + R + S++ + Y N+E+I+ D+ ST + + Y Sbjct: 436 IKSDKLVSIIIPTKNGYKDVQRCVSSIIEKTTYQNYEIIMADNGSTDPKMHELYAEFEQQ 495 Query: 60 --PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQ---- 112 R DI + N+A+ A GEY+ +++D E T N L++ ++ QQ Sbjct: 496 LPGRFFVESIDIPFNFSTINNRAVKKAHGEYLLFLNNDTEVITENWLTLMVSFAQQERIG 555 Query: 113 LVTHAFLYANDYVCQ-------GEVYSQPASLPLYPKSPYSRRLFYKRNII----GNQVF 161 V LY N+ V G V + Y RL N + Sbjct: 556 CVGAKLLYPNNTVQHAGVILGLGGVAGHGHYGYPHGDLGYFGRLAINVNYSAVTAACLLM 615 Query: 162 TWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHIN 208 A F+ A D D+ L++ + + EA ++ H Sbjct: 616 KKADFDAVGGFEEAFTVAFNDVDLCLKVQALGRDNVWLHEA-ELYHFE 662 >UniRef50_B3E7E3 Glycosyl transferase family 2 n=1 Tax=Geobacter lovleyi SZ RepID=B3E7E3_GEOLS Length = 663 Score = 202 bits (515), Expect = 9e-51, Method: Composition-based stats. Identities = 72/273 (26%), Positives = 120/273 (43%), Gaps = 17/273 (6%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +P +S+ MP +N ++ AI S+L Q + ++E+IIVDD ST + + +D RI Sbjct: 3 SPRVSVLMPVYNGERFLREAIDSILGQSFHDFELIIVDDGSTDSSAAC--IASYHDTRII 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ-QLVTHAFLYAN 122 + N+IN G RN+ I LA+GEYI +D DD T RL+ +A A Sbjct: 61 SVKNEINQGIVVTRNRGIELARGEYIALMDCDDISTEERLAEQVAFLDTDQSVAAVAACI 120 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKR-NIIGN-QVFTWAWRFKECLFDTELKAAQ 180 + V + + + +P R F R N + N + A ++ + A + Sbjct: 121 ECVDENGTFLSGWADDRSTVTPDQIRAFLPRANCVANSSLMIRAKILRQYRYRGGRDAVE 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+ LR+V + + K+ I + +S+ + + R K+ F R + Sbjct: 181 DYDLLLRLVSDGLQVAKIPAVLLYYRIVNS--SFSSAGRSLPPEYRHLRA-KNYFLRHA- 236 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 L R + ++LL R+ RLA Sbjct: 237 ------LVNWRLNSFCLQVSVSLL--RDAARLA 261 >UniRef50_B4S4G2 Glycosyl transferase family 2 n=1 Tax=Prosthecochloris aestuarii DSM 271 RepID=B4S4G2_PROA2 Length = 334 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 5/228 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+++ MP +N ++ A++S+L Q Y N+E +++DD ST V + D RI Sbjct: 3 PLVTVLMPVYNGEKHLADAVRSILDQTYRNFEFLVMDDGSTDKSL--DIVKSFEDERIIV 60 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 +D NSG N+ I LA+G+YI +D DD P+RL + + + + Sbjct: 61 KRSDRNSGIAKTLNKGIALARGKYIARMDCDDISLPDRLRRQADYLETHSDTGLIGSGIQ 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDI 184 + + + P + L ++ + A ++ +D EL AQDY + Sbjct: 121 KIKKSRKQKVFTWPS-KDTEIKLDLLFQSAFFHPTIMARASLLRKIGYDEELLYAQDYAL 179 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 + ++ E + EA + H E K+ + +++ Sbjct: 180 WTKLAPETAFA-NLPEALLLYR-THDEQVTKKKGKQQAAIARLVKENY 225 >UniRef50_B6ARX8 Putative glycosyl transferase, family 2 n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6ARX8_9BACT Length = 1023 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 58/243 (23%), Positives = 106/243 (43%), Gaps = 12/243 (4%) Query: 4 NPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALNDPR 61 NPLIS+ +P +N + ++AI+SV Q Y +WE+ I D+ ST + + D R Sbjct: 466 NPLISVILPVYNTPKEWLVKAIESVRNQIYPHWELCISDNASTLPHIKMVLDEYSRIDSR 525 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I ++ + N A N A+ LA GE+I +D DDE + L + +Y+ Sbjct: 526 IHVVYRETNGHISANSNTALTLASGEFIALLDSDDELPEHALFWVAHEINRHPNVDLIYS 585 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKAAQ 180 + + ++ + Y K ++ L +N + V+ ++ K F + +Q Sbjct: 586 D----EDKISEEGIRFNPYFKPDWNPALMLSQNTFSHLGVYRYSLVKKVGGFREGFEGSQ 641 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILH----INHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 D+D+ LR E P ++ +IL+ I+ + +K + R ++ D Sbjct: 642 DHDLVLRCSEE-TTPERIRHIPRILYHWRAISGSTASASGVEEKPYAWTAGKRSIEEHLD 700 Query: 237 RAS 239 R Sbjct: 701 RLG 703 Score = 83.0 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 48/259 (18%), Positives = 90/259 (34%), Gaps = 43/259 (16%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALN----- 58 P +SI MP+ + L I S+L + Y ++E+++V E Sbjct: 727 PKVSIIMPSACKLHLLKPCIDSLLSRTSYPDFEVLLV-----VNEIRYAVPEQAEFLNGI 781 Query: 59 --DPRITYIH---NDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSV--- 105 DPR+ + N + N AI ++G ++ ++DD D+W +L V Sbjct: 782 GADPRVKVLVYGDQPFNY--SKLNNWAIAQSKGSFLCLLNDDIEVITDDWL-EKLVVRTR 838 Query: 106 ---------FLAHKQQLVTHAFLYANDYVCQGEVYSQ-PASLPLYPKSPYSRRLFYKRNI 155 L + + HA + G + + P P Y + + + Sbjct: 839 ITGVGVVGPLLLYPNNRIQHAGVILGLGGVSGHQFVEMPKGYPGYFRRAENEQDL-SCVT 897 Query: 156 IGNQVFTWAWRFKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQI 214 V ++ F+ +L A D D+ +R+ E ++ H + Sbjct: 898 AACMVLRREVFYEVGGFNEDLAVAFNDVDLCIRIRKAGWRILWTPEV-ELYHHESASIGK 956 Query: 215 TSSPKK---FSGYFHFYRK 230 SP++ F + RK Sbjct: 957 HDSPERAILFQAEVDYMRK 975 >UniRef50_B3PFZ6 Glycosyl transferase, putative, gt2E n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PFZ6_CELJU Length = 684 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 64/279 (22%), Positives = 110/279 (39%), Gaps = 14/279 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-NDPR 61 P IS +PT+N AI S+ Q Y NWE+ I DD ST E D R Sbjct: 134 QPCISFVLPTYNSSIPWLKEAIDSLKAQPYQNWELCIADDASTHREVRDFLAQQQSQDSR 193 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + + N N A +A G++I D DD P L L +Y+ Sbjct: 194 IKVVFREKNGHISESSNSAAEIASGDWIALFDHDDLLHPFALYWILQAINNNPDAQLIYS 253 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKAAQ 180 + + ++ Q Y K ++ LF +N + F++ F + +Q Sbjct: 254 D----EDKIDEQGNRHSPYFKPDWNYDLFLSQNCFSHLGLIRRSLFEKIHGFRKGYEGSQ 309 Query: 181 DYDIFLRM--VVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 D+D+ LR +VE + + + ++ ++ K ++ + +D R Sbjct: 310 DHDLILRAIELVEPTQIIHIPKVLYHWRVHADSTAKSTDSKPYAA-IAGEKAIQDHLLRI 368 Query: 239 SKKYQLF---TLYQIRNKRMTWRTLLTL-LSVRNGKRLA 273 + K Q+ Y+++ K L++L + RNG +L Sbjct: 369 NAKAQVSFEGYGYRVKYKLPDNPPLVSLIIPTRNGLQLI 407 Score = 98.4 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 54/261 (20%), Positives = 86/261 (32%), Gaps = 29/261 (11%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 N PL+S+ +PT N QL + I S+ Q Y N+E+I+VD+ S E L + T N P Sbjct: 390 NPPLVSLIIPTRNGLQLIRQCIDSIQ-QKTTYPNYEIIVVDNGSDDPEALTYFDTLKNTP 448 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFL 119 R I ++ + N A+ + GE I I++D + P L+ + H L Sbjct: 449 RFKVIRDNRPFNYSQLNNLAVKHSDGEIIGLINNDVEVIRPEWLNEMVVH--ALRPGVGA 506 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG-----------------NQVFT 162 + E + LF + G V Sbjct: 507 VGAKLLFPDERLQHGGVVLGIGGVANHAHLFIRGTHHGYFARASVTQQFSAVTAACLVIR 566 Query: 163 WAWRFKECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 A + D + A D D +R+ + A H E + Sbjct: 567 KAIYQEVDGLDEQNLAVAFNDVDFCIRVTNKGYRNIWTPYALLY----HHESATRGADIA 622 Query: 221 FSGYFHFYRKHKDKFDRASKK 241 F R+ + K+ Sbjct: 623 PEKRERFVREVEYMMSTWKKE 643 >UniRef50_Q2SD88 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SD88_HAHCH Length = 343 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 66/289 (22%), Positives = 122/289 (42%), Gaps = 40/289 (13%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-NDPRIT 63 PL+S+ MP +NR++ +AI+SVL Q ++++E+I+VDD ST + V D R+ Sbjct: 13 PLVSVIMPVYNREKTVAQAIESVLAQTFADFELIVVDDGSTDRS--AEIVNGFTQDARVR 70 Query: 64 YIHNDINSG-ACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL----VTHAF 118 Y NSG RN + LA+G+++ +D DD WTP +L +A +Q V+ Sbjct: 71 Y-RLQENSGRPSLARNSGLKLARGQWVAFLDSDDRWTPAKLERQIALLEQCAAQGVSLDL 129 Query: 119 LYANDYVCQGEVYSQPASLP--------------------LYPKSPYSRRLFYKRNIIGN 158 + ++ V + +V P+ Y + P+ + L+ Sbjct: 130 VISDYEVMENDVLKHPSFFKAYNVNQRLAAAVEHTFPDGWTYRQRPFLQALYGLGFAATQ 189 Query: 159 QVFTWAWRF-KECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS 217 V + FD+EL A+D D+++ + + + +++HG+ + Sbjct: 190 AVLVRRSLLERVGGFDSELVFAEDNDLWMTISEQG--RVGCSKGIAYTYVHHGDNITSVK 247 Query: 218 PKKFSG--------YFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWR 258 +F + RK + ++Y + L RN+ R Sbjct: 248 SDRFYTDTINVLFKHLATARKLGVPMEPLKERYANYYLSLCRNRLREQR 296 >UniRef50_A6C910 Truncated O-antigen biosynthesis protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C910_9PLAN Length = 903 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 65/285 (22%), Positives = 114/285 (40%), Gaps = 19/285 (6%) Query: 3 NNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTS---WEQLQQYVTALN 58 +PLISI +PT+N +++ I+SVL Q YSNWE+ I DD ST + + +Y + Sbjct: 373 KSPLISIILPTYNTKEKILRACIESVLAQTYSNWELCIADDASTKSRVRDVINEY--SKQ 430 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + N A L +G+YI+ +D DDE N L + + F Sbjct: 431 DSRIKSVFRTENGHISEAMISAAELMEGDYISFLDHDDELNKNALLFIVDAINRSPESEF 490 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELK 177 Y++ + + + K +S L +N IG+ + + + Sbjct: 491 FYSD----EDHINEHGKHQSPFFKPDWSPSLLCSQNYIGHFLCLSKSLYERVGGIRRGFD 546 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF------SGYFHFYR-K 230 AQDYD+ LR + + + + S K + + F K Sbjct: 547 GAQDYDLVLRAGDAAENVYHIPKVLYHWREHENSTSSNSECKPYAHDAGKAAVADFLNQK 606 Query: 231 HKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 + +F + + LFT Y + + + + ++ ++ L D Sbjct: 607 YGSRFIKVNDGEGLFT-YSPQFRFDSEHRVSIIIPTKDKIDLLDD 650 Score = 89.1 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 50/236 (21%), Positives = 93/236 (39%), Gaps = 27/236 (11%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +SI +PT ++ L I+S+ + + NWE+IIVD+ S + + T + D RI + Sbjct: 635 VSIIIPTKDKIDLLDDCIESIRNRSSHINWEIIIVDNRSEETASKEYFSTVVQDSRIKVV 694 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQ-LVTHAF 118 D+ + N A G+ +++D D +L+ + + LV Sbjct: 695 EADVEFNWSMINNIGAKAATGDVFVFLNNDTLVITPDWI--EKLASMASLPEVGLVGPQL 752 Query: 119 LYANDYVCQGEV---------YSQPASLPLYPKSPYSRRLFYKRNII----GNQVFTWAW 165 LY ++ + V + LP++ P+ + RN++ QV A Sbjct: 753 LYEDNTIQHAGVVVGMGGWADHVFKNQLPVHRSGPFVSPML-NRNVLAITGACQVIERAK 811 Query: 166 RFKECLFDTELKAAQ-DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 + FD + D DI +R + + A LH + + + PK+ Sbjct: 812 FEQLGGFDEQFIICGSDVDICIRAHQQGLQNVYC--ADAALHHLESKSRSSFIPKQ 865 >UniRef50_A6BHD3 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BHD3_9FIRM Length = 410 Score = 201 bits (512), Expect = 2e-50, Method: Composition-based stats. Identities = 57/237 (24%), Positives = 96/237 (40%), Gaps = 16/237 (6%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NP ISI +P +N ++ AI+SVL Q Y+N+E+IIVDD ST + + A +D RI Sbjct: 2 NPKISIILPNYNNEKYLKEAIESVLNQTYTNFELIIVDDASTDGS--LKIIRAFDDKRIK 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I +++N N I A GEYI ID DD W +L + + + + Sbjct: 60 VITSEVNRHVAYASNLGIKYASGEYIAKIDSDDIWENQKLEKQIEFMENHKEYGGCFTKV 119 Query: 124 YVCQGE----------VYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKE--C 170 + + ++ + + R + N + N F++ Sbjct: 120 NIIDEKSEDANIKYKVIFDLFEKAKNQSQKEWLRFFLSEGNCLCNSSSLIRKCIFEKIDG 179 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHF 227 F+ A+DY++++R V+ Y + ++E S K Y Sbjct: 180 FFNLAYVGAEDYELWVRTVIRY-PIYVLDERLVRYRWEEDSENKISGLTKEKVYAAI 235 >UniRef50_A4A6G7 Glycosyltransferase n=1 Tax=Congregibacter litoralis KT71 RepID=A4A6G7_9GAMM Length = 291 Score = 201 bits (511), Expect = 2e-50, Method: Composition-based stats. Identities = 61/215 (28%), Positives = 95/215 (44%), Gaps = 6/215 (2%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+S+ +P +NR +A+ SV+ QDY ++E+I+VDD ST + + V AL P Sbjct: 6 NHMPLVSVVIPCYNRADRIAQAVHSVVDQDYPSFEIIVVDDGST--DNTEAVVAALEIPT 63 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + Y+ N GA + RN I AQGEYI D DD W+ ++L+ + A Sbjct: 64 LRYLRLQENRGANSARNVGIREAQGEYIAFQDSDDLWSASKLTTQIKQMLTHNAKVCFCA 123 Query: 122 NDYVCQGEVYSQP-ASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW--RFKECLFDTELKA 178 + +G P AS + + + I Q LFD LK Sbjct: 124 FNRNDRGVKTRVPKASYHVQHGCVDRHAELLRGSYISCQTLIARKETLLSVGLFDESLKR 183 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 QD+++ LR+ + VEE I+ + Sbjct: 184 LQDWELCLRLAQD-NPILYVEEVLVEADISDDSVS 217 >UniRef50_C7GBF7 Glycosyltransferase n=1 Tax=Roseburia intestinalis L1-82 RepID=C7GBF7_9FIRM Length = 274 Score = 201 bits (511), Expect = 2e-50, Method: Composition-based stats. Identities = 59/204 (28%), Positives = 90/204 (44%), Gaps = 12/204 (5%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK +SI +PT+NR +AI+SVL Q Y ++E+I+VDD ST ++ ++ VT+ D Sbjct: 1 MKQTEPVSIILPTYNRANQIGKAIESVLAQTYEDFELIVVDDGST--DETEKVVTSYKDC 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ------LV 114 RI Y N G RN + LA+ +Y+ D DD W P +L + + +V Sbjct: 59 RIRYHRMPENGGQSRARNCGMKLARYDYLAFEDSDDLWRPGKLKAQMEAMRNAGTNVGMV 118 Query: 115 THAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW--RFKECLF 172 H F Y D + +P KS N++G Sbjct: 119 YHKFRY--DLGAGRGMVLPDEKIPYEKKSGDIYAQLLWDNLVGIPTMLLKKECVEAVGGL 176 Query: 173 DTELKAAQDYDIFLRMVVEYGEPW 196 D L+ +DYD LR+ +Y + Sbjct: 177 DESLRCLEDYDFALRIAKKYQAIF 200 >UniRef50_Q72QG5 Glycosyltransferase n=2 Tax=Leptospira interrogans RepID=Q72QG5_LEPIC Length = 298 Score = 201 bits (511), Expect = 3e-50, Method: Composition-based stats. Identities = 60/243 (24%), Positives = 104/243 (42%), Gaps = 9/243 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NPL+S+ +PT+N ++ SV+ Q Y NWE I++D+ S + + V++ + ++ Sbjct: 2 NPLVSVVIPTYNHSDFLKLSLASVINQTYLNWEAIVIDNHSN--DNTDEVVSSFGNSKVR 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 N RN I A+G++I+ +D DD W PN+L + Q+L + ND Sbjct: 60 LTKIKNNGVISVSRNLGIKEAKGDWISFLDSDDLWFPNKLERVVFEIQKLENQIDVLCND 119 Query: 124 YVCQGEVYSQPASLPLYP-KSPYSRRLFYKRNIIGNQVFTWAWRF---KECLFDT--ELK 177 + +L P + + R++ + N + T +F KE LF+ E Sbjct: 120 EYMVHLNQKKKITLNYGPFEDDFYRKMLFYGNRLSTSATTVRKKFLKEKELLFNENQEFV 179 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 +DYD +LR+ E + E I H Q + + + + R H Sbjct: 180 TVEDYDFWLRLAKENARFLFIPEVLGEYTI-HNSNQSAALERHLNHLENLVRYHVFHIQE 238 Query: 238 ASK 240 K Sbjct: 239 FEK 241 >UniRef50_B8I318 Glycosyl transferase family 2 n=2 Tax=Clostridium RepID=B8I318_CLOCE Length = 280 Score = 201 bits (511), Expect = 3e-50, Method: Composition-based stats. Identities = 64/284 (22%), Positives = 119/284 (41%), Gaps = 17/284 (5%) Query: 1 MKNN---PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL 57 M NN P IS+ MP +N +A+ SVL Q ++++E+I VDD S+ ++ Sbjct: 1 MNNNIIFPKISVLMPVYNTSNYVSKALDSVLAQTFTDFEIIAVDDGSSDNSMEILKDYSI 60 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 D R+ N+ N G N A+ A + +D DD +RL + Q+ Sbjct: 61 KDSRVKVFQNEENRGVSYSLNHALRHASAPLVARMDSDDIMVKDRLEKQYTYMQKNPDCV 120 Query: 118 FLYAND-YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF---KECLFD 173 L + Y+ + + + PL +R F++ I + + + FD Sbjct: 121 VLGGQETYIDENDKITGNTKYPL--SDAEIKRKFFQFQPIADPTSMYNRLKIPPEVFYFD 178 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 L A+ D++ R+ +YG+ +E++ + G + T + F + RK Sbjct: 179 ESLTVAEGLDLYFRL-FKYGKFANLEDSIILYRQRQGSLFSTDIKRTFK-FISMVRK--- 233 Query: 234 KFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 RA K+Y + + + + + ++L +R +L D I+ Sbjct: 234 ---RAKKQYGIKAPFTAGIINFSQKIITSILPLRVAVKLNDIIK 274 >UniRef50_A3W1Y5 Glycosyltransferase n=1 Tax=Roseovarius sp. 217 RepID=A3W1Y5_9RHOB Length = 300 Score = 200 bits (510), Expect = 3e-50, Method: Composition-based stats. Identities = 62/241 (25%), Positives = 100/241 (41%), Gaps = 12/241 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +PL+S+ +P +NR+ A+ SVL QDY E+I VDD ST + +++DPR+ Sbjct: 5 SPLVSVIIPCFNRETTVHEAVLSVLTQDYDPIELIAVDDNSTDG--TIAVLESIDDPRLR 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 +HN + G RN I + +I D DD W P +L + + Q Y Sbjct: 63 ILHNSGDRGPSQARNHGIRSSTAPWIAFQDSDDIWLPGKLKRQMDYLQD-SDFVAAYCGM 121 Query: 124 YVCQGEVYSQPA--SLPLYPKSPYSRRLFYK---RNIIGNQ--VFTWAWRFKECLFDTEL 176 + + P P SP + N I Q V + FD L Sbjct: 122 LIKKDTHPETPVKYRRPAPETSPLEGDILPSLALENYISTQMLVVRHDILDQIGGFDEAL 181 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 A D+++ LR+ G +++ + ++ + SS K+ + H +KH D Sbjct: 182 PALVDWELMLRVA-SLGPVAFIDDDLVVQRMSENSLT-HSSKKRLAAQEHILKKHYDLLG 239 Query: 237 R 237 R Sbjct: 240 R 240 >UniRef50_B0TYV9 Putative teichuronic acid biosynthesis glycosyl transferase n=1 Tax=Francisella philomiragia subsp. philomiragia ATCC 25017 RepID=B0TYV9_FRAP2 Length = 247 Score = 200 bits (510), Expect = 3e-50, Method: Composition-based stats. Identities = 65/249 (26%), Positives = 107/249 (42%), Gaps = 6/249 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI P++N ++ I+SV+ Q Y NWEMIIVDDCST D RI I Sbjct: 4 LVSIITPSYNSEKYISNTIESVITQTYQNWEMIIVDDCSTDSSCDIAEKYTQKDSRIKLI 63 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + NSG RN+AI A+G YI+ +D DD W +L + ++ A ++ Y+ Sbjct: 64 KLNKNSGPAKARNRAIKEAKGRYISFLDSDDIWYHRKLEKQVYFMKEN-DLAVTCSSYYI 122 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 + L + K N IGN + ++ ++ +DY ++ Sbjct: 123 VDAKGKKNKTRLV---REKILYSDMLKSNHIGNLTGIYDCIKLGKIYADDV-GHEDYTLW 178 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 L++V + + E + + + ++ K S ++ YR D S Y + Sbjct: 179 LKVVQKSKTVNTISEPLAEYRVLNNSIS-SNKFKTLSWQWNIYRNILDLNVFRSSYYFVC 237 Query: 246 TLYQIRNKR 254 +Y KR Sbjct: 238 YVYYALKKR 246 >UniRef50_A3W9Y7 Glucosyltransferase n=1 Tax=Erythrobacter sp. NAP1 RepID=A3W9Y7_9SPHN Length = 328 Score = 200 bits (509), Expect = 5e-50, Method: Composition-based stats. Identities = 61/287 (21%), Positives = 117/287 (40%), Gaps = 27/287 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P IS+ MP ++ ++ A++SVL Q ++++E+I VDD T V +DPRI Sbjct: 4 PKISVVMPIYHVEKFVGEAVRSVLDQTFADFELICVDDGGTDASM--DIVRGFSDPRIRI 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G RN I A+GE++ +D DD W P +L++ H YA Sbjct: 62 VCQ-ANRGLAGARNTGIANARGEFVALLDSDDVWHPEKLALHYIHLMANREVGVSYAGSR 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR--------------FKEC 170 + ++ ++ + RN +GN R + C Sbjct: 121 MIDQNGNVLRVAMRPKTGRVTAKDIIC-RNPVGNGSAPVLRRSALDLAVFPHPEEPSRTC 179 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPW-KVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 FD + ++D ++++R+ V++ + ++ I G + + K++ + R Sbjct: 180 WFDESFRQSEDIEMWIRLAVKHEVVFAGIDGLLTDYRIIPGALS-ANVVKQYLSWTKMLR 238 Query: 230 KHK-------DKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNG 269 K + ++ ++ YQL L + + + L+S G Sbjct: 239 KLRGYAPEFVEQHGDTARAYQLRYLARRSVQLGNFELARDLMSKAVG 285 >UniRef50_A8RSJ6 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RSJ6_9CLOT Length = 307 Score = 199 bits (507), Expect = 7e-50, Method: Composition-based stats. Identities = 58/256 (22%), Positives = 109/256 (42%), Gaps = 9/256 (3%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQ----YVTA 56 PL++ + T+ R ++ RA+ S++ Q Y N E+ +V+DC T + ++ ++ Sbjct: 4 NKEPLVTTVITTYKRKPEIVKRALDSIVGQTYDNIEIFVVNDCPTDKKLIEDLKNMIASS 63 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 D + YI + N GAC RN A+ A+G+Y +DDDDEW P ++ + + ++ Sbjct: 64 SGDRTVHYIVVEKNGGACRARNIALEKAKGKYFACLDDDDEWLPKKIELQVRALEEHPDA 123 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ---VFTWAWRFKECLFD 173 Y N + ++ + +N IG+ +F + FD Sbjct: 124 VIAYCNAVIRYADIGKETVRFTKKQNEGNLYYELIGKNNIGSCSFPMFRVDAIKRAGGFD 183 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 + A QD+D++LR++ E + V E + + G+ + Y ++K Sbjct: 184 ANMLALQDWDLYLRILKE-NKAVYVHEPVAVYYFYAGQRISAHPENRIIAYERLHQKLGK 242 Query: 234 KFDRASKKYQLFTLYQ 249 + K F L Sbjct: 243 DLENNRKSAAAFYLMG 258 >UniRef50_B4WMJ9 Glycosyl transferase, group 2 family protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WMJ9_9SYNE Length = 313 Score = 199 bits (507), Expect = 8e-50, Method: Composition-based stats. Identities = 58/213 (27%), Positives = 95/213 (44%), Gaps = 6/213 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL S+ +P +N Q ++SV +Q ++E+I++DD ST + + V+ +DPR+ Sbjct: 2 PLASVIVPVYNGAQTIHATLESVFQQTAQDFELIVIDDGST--DSTLEVVSKFDDPRLRV 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-ND 123 + N+G RN+ + A G+YI+ ID DD WTP +L++ L + A Y+ + Sbjct: 60 F-SYCNAGVATSRNRGVDHASGKYISFIDADDLWTPKKLALQLEALESTPEAAVAYSWTN 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQDY 182 Y+ Q + A + Y+ L + V F E FD L A D+ Sbjct: 119 YIDQEGRFIDKAQRVDFSGDVYAELLLRDFLESASNVTIRHQVFLEFGGFDLSLTGAADW 178 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 D FLR+ Y V + + M Sbjct: 179 DFFLRLAKHY-SFVAVPHLGVLYRLLSSSMSAN 210 >UniRef50_B1ZW34 Glycosyl transferase family 2 n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZW34_OPITP Length = 303 Score = 199 bits (507), Expect = 8e-50, Method: Composition-based stats. Identities = 66/301 (21%), Positives = 117/301 (38%), Gaps = 29/301 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+SI +P N ++ +++S L Q Y NWE+I VDDCST A +P Sbjct: 1 MQQRPLVSIVLPVHNGERYLRESVRSCLDQTYQNWELIAVDDCSTDGTAAMLDSFAAREP 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ + N N A N ++GE +T DD+ + PN L ++ + +Y Sbjct: 61 RMRVLRNPANLRLPASLNVGFAASRGEILTWTSDDNLYLPNALEEMVSALMEHREAMMVY 120 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAA 179 A + A + YP S N++ + +D L+ Sbjct: 121 AAQDIID----ESGAVVVRYPSG--SPESLCHFNVVNACFAYRRKVLETVGGYDPILELV 174 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS----------------- 222 +D+D +LR+ Y E ++E + N + + +++ Sbjct: 175 EDWDYWLRIARRY-EIIQLECCLYLYRANPASLSQQTGDRRWKVEERLLEMRLPELRWKW 233 Query: 223 --GYFH-FYRKHKDKFDRASKKYQLFTLYQ-IRNKRMTWRTLLTLLSVRNGKRLADGIRG 278 GY F R ++ R ++ + +R + R L+ + GKR+ +R Sbjct: 234 PAGYARGFIRLCHPRWQRGDRRTAWRHFGRALRYFPVALRHWRALIPLCLGKRVYYALRA 293 Query: 279 R 279 R Sbjct: 294 R 294 >UniRef50_Q74D09 Glycosyl transferase, group 2 family protein n=1 Tax=Geobacter sulfurreducens RepID=Q74D09_GEOSL Length = 310 Score = 199 bits (507), Expect = 8e-50, Method: Composition-based stats. Identities = 67/236 (28%), Positives = 108/236 (45%), Gaps = 9/236 (3%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 +NPL+S+ +P+ NR L I + SV+ Q Y N E+I+VDD S +D R+ Sbjct: 4 HNPLVSVVIPSKNRPDLVIATLVSVIDQTYDNIEIIVVDDGS-DMPLAPLLKDRFSD-RV 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFLYA 121 + ND + G RN+ A G+YIT +DDDD W P +L + + + V +Y Sbjct: 62 LCLRNDQSLGGAVARNRGAQSAHGDYITFLDDDDLWLPKKLEMQVDAFSKHGVDIGVVYC 121 Query: 122 N-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAA 179 D++C+ E+ P + S + F FD EL + Sbjct: 122 GFDFLCKEEIV--PRQNKFHNDYDLSIAVLSGCPFGSPTPLIRKHYFDMVGGFDRELPSC 179 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 QD+D+++R+ G + V+E+ + + HG+ T+ KK G KH D+ Sbjct: 180 QDWDLWIRLSKVCG-FYPVKESLALYRV-HGDQISTNLRKKIDGRKMVLAKHYDEI 233 >UniRef50_Q04TI3 Glycosyltransferase n=8 Tax=Leptospira RepID=Q04TI3_LEPBJ Length = 586 Score = 199 bits (507), Expect = 9e-50, Method: Composition-based stats. Identities = 62/281 (22%), Positives = 109/281 (38%), Gaps = 12/281 (4%) Query: 4 NPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPR 61 PLISI +P +N ++ + ++SV Q Y NWE+I++DD S D R Sbjct: 36 EPLISILVPVYNTKRGYLEKMVRSVEMQTYKNWELILLDDASPDESPGNYLKERCETDSR 95 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y +D N G +A + GEY+ +D DD + N L + L Q+ L Sbjct: 96 IRYFRSDKNEGISLATRKAFECSAGEYVAFLDHDDRLSKNALLIVLEALQEEKNRPELLY 155 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT-WAWRFKECLFDTELKAAQ 180 +D + Q ++ P L K +S N I + V ++ +Q Sbjct: 156 SDEIFQSKI---PGVFSLSTKPEFSPERLISYNYICHFVIVSKNLIYRMGGIRKGFDGSQ 212 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS----PKKFSGYFHFYRKHKDKFD 236 D++ LR ++ + ++ + +Y+ KD + Sbjct: 213 DHEFVLRACRHTDRIIRLPYFLYVWRLHGESFSKQRAQICEESSKKAILEYYKNKKDPVE 272 Query: 237 RASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 Y FT + IR R+ L++++ + L +G + Sbjct: 273 EIVPGYYPFTYHPIR--RLKPNKLISIVVLDIEAILGNGFQ 311 >UniRef50_B9M0H9 Glycosyl transferase family 2 n=3 Tax=Proteobacteria RepID=B9M0H9_GEOSF Length = 3011 Score = 199 bits (506), Expect = 9e-50, Method: Composition-based stats. Identities = 62/236 (26%), Positives = 106/236 (44%), Gaps = 16/236 (6%) Query: 1 MKNNP---------LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQ 51 M NP L SI +P++N+ + S+L QDY +WE ++V+D ST Sbjct: 482 MDRNPKEPAQDSGLLFSILVPSYNQAGFLPATLDSILAQDYPHWEALVVNDGSTDETAAV 541 Query: 52 QYVTALNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ 111 A DPRI H + N G + N+ + A+GE+I + DD + P++LS+ A Sbjct: 542 MARYAEKDPRIRTFHKE-NGGVSSALNEGLKHARGEWICWLSSDDLFEPDKLSLHAATAV 600 Query: 112 QLVTHAFLYANDYVC---QGEVYSQPASLPLYPKSPYSRRLFYKRNII-GNQVFTWAWRF 167 + F++ N ++ GE S + F++ N + G + F Sbjct: 601 KHPEVRFMHTNYFLLYDQTGEKVSVERNGEELNDPNRQLVNFFEANQVNGISIAIHRQVF 660 Query: 168 KECL-FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 + FD +L+ QD+D++LR+V Y +P ++E T + I+ + S Sbjct: 661 DQLGHFDEKLRNGQDFDMWLRIVSRY-QPLYLDERTVVTRIHPAQGTSLSVEAGIY 715 Score = 78.7 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 45/242 (18%), Positives = 82/242 (33%), Gaps = 17/242 (7%) Query: 8 SIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIHN 67 SI + T N+ + + S+ + E+I VD+ S + I N Sbjct: 2100 SIIILTLNQLEYTRECLSSIQKHTPEKHEIIFVDNGSKDGTVKWLRQEMKSHKNYRLIEN 2159 Query: 68 DINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL-YANDYVC 126 N G NQ + A+GEY+ +++D T L+ + N+ Sbjct: 2160 KKNLGFAKGCNQGMAAAKGEYLLLLNNDVVVTEGWLTGMRECLEASADGIVAPMTNNIAG 2219 Query: 127 QGEVYSQP-------ASLPLYPKSPYSRRLFYKRNIIG-NQVFTWAWRFKECLFDTELKA 178 ++ S P + +S + R + ++G +F + + + D Sbjct: 2220 PQQLLSVPYANMDEMQAFAAGFRSAHRHRRIPVKLVVGFCMLFRRSLADQVGMLDEAFAL 2279 Query: 179 A--QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +D D LR + + V +H +HG + F+ RK FD Sbjct: 2280 GNFEDDDFCLRASLAGLQS--VIAGDVFIH-HHGSRTFEGNRIDFNAAMAVNRK---VFD 2333 Query: 237 RA 238 R Sbjct: 2334 RK 2335 Score = 58.3 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 50/267 (18%), Positives = 88/267 (32%), Gaps = 41/267 (15%) Query: 9 IYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTA---LNDPRITYI 65 I T+NR+ + IK+++ + + +I+VD+ S E Q Y+ L D + Sbjct: 1622 IIFLTYNRRDYFEQTIKTLIERTRYPYRIIVVDNHS--REDFQGYLQKTAILYD---HLV 1676 Query: 66 HNDINSGACAVRNQAIMLAQGE-YITGIDDD-DEWTPN-----RLSVFLAHKQQLVTHAF 118 ND N + I LA + Y+ D D P+ L L ++ Sbjct: 1677 FNDDNY-FTVAFQRGIELANSDPYVV---SDPDILVPDLQGKCWLERMLDLHREYPEMGL 1732 Query: 119 LYANDYVCQGEVYSQPASLPL-YPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK 177 + + P + P P + Y I V TW K FD + Sbjct: 1733 I---------GLTLDPVNKPASLPDVILGEKTVYNDKITLANVGTWMQCIKRKYFDGKYT 1783 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 D++ ++ G+ E H+ + E + H K+ D+ Sbjct: 1784 T--DWEACEQVRRNGGKVGYAREFIA-YHLGYDEERDHPD--------HLVEKYHLFKDK 1832 Query: 238 ASKK-YQLFTLYQIRNKRMTWRTLLTL 263 Y ++T + RM + Sbjct: 1833 FGSDLYSMYTDSEEILARMGEKPAAYY 1859 Score = 52.2 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 14/92 (15%) Query: 7 ISIYMPTWNRQQLAIRAIKSV---LRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 IS+ M + RA+ SV + EMI+VD ST ++ + A Sbjct: 2617 ISLCMIAKDEAPHLARALASVKPAVS------EMIVVDTGST--DRTKDIARAFGAS--- 2665 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD 95 RN+++ A G ++ +D D Sbjct: 2666 VFDLPWPGSFALARNESLAKASGTWVLVMDAD 2697 >UniRef50_C0CGY4 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CGY4_9FIRM Length = 255 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 59/236 (25%), Positives = 103/236 (43%), Gaps = 8/236 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M++ +SI P+WN ++ + IKSV Q Y NWEMIIVDDCST A DP Sbjct: 1 MQDK--VSIITPSWNSEKYIVDTIKSVQNQIYKNWEMIIVDDCSTDHTVDIIEAIAKEDP 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I +N GA RN ++ A G +I +D DD W P++L ++ + +F Sbjct: 59 RIKLIKQPVNGGAAKARNISLNEATGRFIAYLDADDIWKPDKLEKQVSFMKSKP-CSFSC 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF--KECLFDTELKA 178 + V + + + P+ Y F N++ ++ L +++ Sbjct: 118 TSYEVVDDDGKPLNKQVHMLPEVDYVG--FLTNNLLQTVGIMVDIDIVDRKYLEMPDIRR 175 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 QD +L+++ + + ++E + ++ K G ++ YR + Sbjct: 176 RQDAATWLQILKAGYKCYGLDEILAEYRRAGNSLS-SNKIKAIKGVWNLYRNIEHL 230 >UniRef50_C6JDU8 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JDU8_9FIRM Length = 635 Score = 198 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 62/258 (24%), Positives = 106/258 (41%), Gaps = 9/258 (3%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--DP 60 +PLISI +P + + + I+S++ Q YSNWE+ IV+ S E++Q+ + + D Sbjct: 80 SPLISIAVPAYQTPVEFLRQMIESLIVQTYSNWELCIVN-ASPDNEEMQKVLAEYSAGDS 138 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ + + N G N+A +A+GE++ +D DD PN L + Q LY Sbjct: 139 RVRFCNLKENLGIAENTNRAFAMAKGEFVGLLDHDDLLAPNALYEIVKILQDHPQADALY 198 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKAA 179 ++ E+ + KS ++ L N I + V + K F E A Sbjct: 199 TDEDKVTTEL---DEHFQPHLKSDFNLDLLRSNNYICHFFVVRKSIVEKTGGFRKEFDGA 255 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QDYD R GE V E + + K + + R + +R Sbjct: 256 QDYDFIFRCTENAGEVLHVPEILYHWRTHKASTADNPASKMY-AFEAGKRAIEAHLERTG 314 Query: 240 KKYQLFTLYQIRNKRMTW 257 K ++ + R+ + Sbjct: 315 TKGEVSHTQDLGFYRVKY 332 Score = 86.8 bits (214), Expect = 6e-16, Method: Composition-based stats. Identities = 44/252 (17%), Positives = 95/252 (37%), Gaps = 24/252 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + PL+S+ +P + ++ ++ + + Y N+E+IIV++ ST+ E + Y + Sbjct: 335 QGKPLVSVIIPNKDEKETLQTCLEMLEKNTGYQNFEIIIVENNSTTDEIFRYYKELSGNR 394 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQLV 114 +I + A+ N A A+GEY+ +++D D W L V + V Sbjct: 395 KIHLLRWGKEFNYSAINNFAAAHAKGEYLLFLNNDVKSINPD-WLEEMLGVCQRPEVGGV 453 Query: 115 THAFLYAND-------YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII----GNQVFTW 163 +Y ++ + G + ++ Y + +++ + Sbjct: 454 GAKLIYPDNTIQHAGCVIGMGGIAGHMFVDMPADRTGYLHKASLLQDMSAVTAACLLMKK 513 Query: 164 AWRFKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHIN---HGEMQITSSPK 219 + F EL A D D+ L++ + G + ++ H+ G + Sbjct: 514 EVFEQAGGFTEELAVAFNDVDLCLKV-RKNGYLIVYDPYAKLYHMESKTRGAEDSKEKVR 572 Query: 220 KFSGYFHFYRKH 231 +F + R H Sbjct: 573 RFQTEIEYMRCH 584 >UniRef50_D2RIR4 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIR4_ACIFE Length = 256 Score = 198 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 52/229 (22%), Positives = 94/229 (41%), Gaps = 2/229 (0%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ MPT+N + + +I SVL Q ++WE+ IVDDCST Q P+I Y Sbjct: 2 LVSVVMPTYNCGKYILESIDSVLAQTITDWELQIVDDCSTDDTQEILKPYLEKYPQIHYY 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N G R AI A+G+Y+ +D DD W P +L ++ + T + + Y Sbjct: 62 RLSQNGGPAVARTVAIERAEGKYVAFLDSDDLWLPEKLEKQISFMKN--TKSAFSCSAYA 119 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 C + + P+ + N IGN + +K D+ ++ Sbjct: 120 CMDDEGNSLHYALFPPEKTDYAKCIRLSNPIGNLTAIYDQEILGKFQVPPIKKRNDFALW 179 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 L+++ + ++E + ++ K+ ++ Y + Sbjct: 180 LQILKKTTFCAGIQEVLGVYRTGRKGSVSSNKLKQAKYHWQLYHDIEHH 228 >UniRef50_A6Q2S9 Glycosyl transferase n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q2S9_NITSB Length = 272 Score = 198 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 54/217 (24%), Positives = 91/217 (41%), Gaps = 13/217 (5%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE--QLQQYVTALNDPR 61 +PL S+ +P +NR RAI+SVL Q + ++E+I+VDD ST L++Y Sbjct: 7 SPLFSVIIPVYNRANFIARAIESVLHQTFRDFELIVVDDGSTDETPKVLERYP------- 59 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I + N G A RN+ I A+G+ I +D DDEW N L Q + Sbjct: 60 IQVIRQE-NKGVSAARNRGIKAAKGKIIALLDSDDEWKKNHLRTHADFFQSHPEYTIHQT 118 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 ++ + + + I + V F + LF + + + Sbjct: 119 DEIWIRNGKFLNKKKIHQKKSGYIFYDSLRLCLISPSAVAIKKELFNEVGLFREDFEVCE 178 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS 217 DY+++LR+ +Y P ++ E Q++ Sbjct: 179 DYELWLRITKKY--PVGFTPIQTVIKYGGHEDQLSQK 213 >UniRef50_A1AUB0 Glycosyl transferase, family 2 n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AUB0_PELPD Length = 1250 Score = 198 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 60/225 (26%), Positives = 100/225 (44%), Gaps = 8/225 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +PL+S+ M N + +AI+S+ Q + N+E II+DD ST + + DPR+ Sbjct: 626 SPLVSVVMSVHNGECYLRQAIQSIFCQSFLNYEFIIIDDAST--DATPDILLQFKDPRLA 683 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N N G A N I A+G+Y+ +D DD P+RL+ + + A + ++ Sbjct: 684 IIRNTGNLGLTASLNIGIRAARGKYVARMDADDISVPHRLARQVESLENNPEIAVVGSSY 743 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFK-ECLFDTELKAAQD 181 Y E + + + + +R ++N G+ V F+ +D AQD Sbjct: 744 YTMD-EHGAVTGIIDVLDRPELIQRELLRQNWFGHGSVMMRKSCFEIVGGYDERFIYAQD 802 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH 226 YD+FLR+ Y + V E + I++ K YF Sbjct: 803 YDLFLRLSERY-KLANVTEPLYCWRESPHG--ISARKKAQQDYFA 844 Score = 190 bits (483), Expect = 5e-47, Method: Composition-based stats. Identities = 59/237 (24%), Positives = 99/237 (41%), Gaps = 13/237 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND-PRIT 63 PL+S+ +PT NR + AI S+L Q ++E+I+V+D T ++ + D I Sbjct: 873 PLVSVIVPTHNRPDMLATAISSILAQTMQSFEIIVVNDAGTD---VRPVLERFGDRENIR 929 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 ++ ++ N G A RN I A+G+YI +DDDD + PN L + + + Y + Sbjct: 930 HLAHETNRGLAAARNTGINAARGKYIAYLDDDDTYYPNHLETLVGYLESN-DCRVAYTDA 988 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRL----FYKRNIIGNQVFTW--AWRFKECLFDTELK 177 C + + + + + YSR N I A +FD L Sbjct: 989 N-CAIQQKAANGFVTVKKEVVYSRDFDYDAILYVNFIPVLCIIHEKACLSLSGMFDEYLS 1047 Query: 178 AAQDYDIFLRMVVEYGEP-WKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 +D+D+++RM K + M +S+P F Y K+ D Sbjct: 1048 RHEDWDLWIRMSRHERFAHIKTVTCEYTYRPDGSSMTSSSAPLFFETYKAVCGKYDD 1104 >UniRef50_A0LJ94 Glycosyl transferase, family 2 n=2 Tax=Deltaproteobacteria RepID=A0LJ94_SYNFM Length = 294 Score = 198 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 58/221 (26%), Positives = 89/221 (40%), Gaps = 6/221 (2%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +PT+NR+ A+ SVL Q+ E+I+VDD S+ V RI + Sbjct: 9 VSVIVPTYNRRAFVKEAVASVLAQEGVFPEIIVVDDGSSDGTDRALEVFG---ERIRRLR 65 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 + GA A RN I +A GE++ +D DD W P++L L + Sbjct: 66 R-AHGGASAARNTGIRIAAGEWLAFLDSDDLWLPSKLRRQLDFLTAHPEFLVCQTEEIWL 124 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDYDIF 185 + V P P+ RL + + + V F LFD L A +DYD++ Sbjct: 125 RNGVRINPRKYHRKPRGHCFERLLDRCLVSPSAVMIHRDVFDSVGLFDENLPACEDYDLW 184 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH 226 LR+ Y VEE + H + + P Sbjct: 185 LRIGCTYALGL-VEEQLVVKRGGHPDQLSATVPALDRYRIR 224 >UniRef50_B7DLY9 Glycosyl transferase family 2 n=1 Tax=Alicyclobacillus acidocaldarius LAA1 RepID=B7DLY9_9BACL Length = 896 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 63/262 (24%), Positives = 100/262 (38%), Gaps = 10/262 (3%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALND 59 K PLIS+ MP +N ++ A+ SVL Q Y WE++I DD S+ E ++ A Sbjct: 567 KERPLISVIMPVYNSKELWLNEAVSSVLNQPYPEWELVISDDNSSEQETIRALECIARRH 626 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI I N N G + A+ A+GE +T +D DD PN + L Sbjct: 627 ERIRVIFNSENRGISGNTSVALAQARGELVTFLDHDDVLAPNAFYEVVRAWNDC-HFDIL 685 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLF-DTELKA 178 Y+++ Y +P + K +S N I + ++ + + Sbjct: 686 YSDEDKIGEHGYEEP-----FFKPDFSPDYLLSVNYINHLTVYRRSLLEDVGYLRSGFDG 740 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 AQDYD+ LR + E G + K + + R KD R Sbjct: 741 AQDYDLLLRATEHAKRIVHIPEVLYHWRKVPGSTAESFDSKSY-AHEAGRRAIKDALMRR 799 Query: 239 SKKYQLFTLYQIRNKRMTWRTL 260 ++ + + R+ R L Sbjct: 800 GEEADVLDTGYPGHYRVNRRIL 821 Score = 46.4 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 3/39 (7%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIV 40 PL+SI + T ++ + I SV+ Q Y ++E+IIV Sbjct: 823 EPLVSIIVLTRDKADVLRTCISSVI-QKSTYRHYEIIIV 860 >UniRef50_C4XII1 Putative glycosyltransferase n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XII1_DESMR Length = 308 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 54/213 (25%), Positives = 97/213 (45%), Gaps = 6/213 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+++ +PT +R +L RA+ S L Q ++ E++IVDD S + + +DPRI Sbjct: 12 PLVTVIIPTRDRAELLPRAVASALGQTWTALEILIVDDGS--VDDTAGVLAGFDDPRIRV 69 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + G A RN+A+ A+G+Y+ +D DDEW P + LA+ HA + Sbjct: 70 LRRHTPGGVSAARNEAVRAARGDYLALLDSDDEWLPKKTERQLAYM-AATGHAVSQTQEI 128 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDYD 183 +G P + P + + + + FD L A +DYD Sbjct: 129 WMRGGKRVNPTHIHKKPDGWFFEASLGMCLVSPSTTIFRRGVIEGAGFFDESLPACEDYD 188 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITS 216 ++LR+++ P + + Q++ Q+++ Sbjct: 189 LWLRIML--TRPIGLLDEYQVVRHGGRGDQLST 219 >UniRef50_A6M2M4 Glycosyl transferase, family 2 n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M2M4_CLOB8 Length = 1362 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 64/286 (22%), Positives = 117/286 (40%), Gaps = 14/286 (4%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + NPLISI +P +N + + I SV+ Q Y NW++ +VDD ST ++ + Sbjct: 696 LSFNPLISIVVPVYNVLDEQLVECIGSVINQTYDNWQLCLVDDASTWNSVVKVLNKYEKN 755 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 P+I ++ N N I +A GE+I +D DD PN L + + F+ Sbjct: 756 PKINIVYRKENGHISRATNDGISIANGEFIAFLDCDDVLAPNALYEVAKKLNEDSEYDFI 815 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKA 178 Y+++ + + + + K +S F + E TE Sbjct: 816 YSDEDKLTADGKHRHSP---FFKPDWSPDTFMSLMYTCHFSVYRKSIIDEIDGLRTEFNG 872 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK-HKDKFDR 237 AQDYD LR + + +++ H E I S+P+ ++ ++ +R Sbjct: 873 AQDYDFTLRFTEKAKKIGHIDKIL--YHWREREESIASNPEAKPYALQAVKRCKEEALER 930 Query: 238 ASKKYQLFTLYQIRNKRMTW----RTLLTLL--SVRNGKRLADGIR 277 K + + + R+ + + L++++ S N K L + I Sbjct: 931 RGLKGTVEYVNDMFQYRVKYTNDEKPLVSIIIPSKDNFKILNNCIE 976 Score = 88.8 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 52/256 (20%), Positives = 100/256 (39%), Gaps = 22/256 (8%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 PL+SI +P+ + ++ I+SV + Y N+E+++VD+ S E +Y Sbjct: 954 EKPLVSIIIPSKDNFKILNNCIESVKKYTSYYNYEIVVVDNGSN-NENKNKYEDICKKYN 1012 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQ----QLVTH 116 Y + +N + N +A+GE+ ++DD + + L + L +V Sbjct: 1013 CRYHYEKMNFNFSKMCNIGSRIAKGEFYLFLNDDIEIFQEEWLEIMLGQASLKHVGVVGA 1072 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFY-------KRNII---GNQVFTWAWR 166 LY N + Q + P + + + R+ Y + N I G + + Sbjct: 1073 KLLYPNSNIIQHIGITNLKIGPSHSQIGFDDRIVYYYGRNRMEYNFIAVTGACLMIERHK 1132 Query: 167 FKEC-LFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 F+E FD L A D D+ ++V + + V L+ +H + Sbjct: 1133 FEEISGFDENLSVAYNDVDLCFKLVEH--KYYNVVRNDVSLY-HHESISRGIDDINKEKM 1189 Query: 225 FHFYRKHKDKFDRASK 240 ++ K FD+ + Sbjct: 1190 TRLLKERKILFDKHKQ 1205 >UniRef50_A0KM86 Glycosyl transferase, group 2 family protein n=2 Tax=Gammaproteobacteria RepID=A0KM86_AERHH Length = 268 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 59/248 (23%), Positives = 99/248 (39%), Gaps = 8/248 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ MP +N + +I SVL+Q Y+NWE+I++DDCS + D RI + Sbjct: 3 KVSVIMPLYNSARFLAASIDSVLKQQYTNWELILIDDCSVDDCVVIAQAYCDRDSRIKLV 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 NSGA RN+ I +A+G +I +D DD W P +L+ + + AF Y Sbjct: 63 RLAENSGAAVARNRGIEIAEGRFIAFLDSDDLWLPEKLNTQIDFMLKN-KVAFSYTAYKK 121 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 L S S R K +IG + ++ +DY ++ Sbjct: 122 IDEY---GNDLFELGVPSSLSYRALLKACVIGCLTVIYDASQIGKVYMPLGTKREDYALW 178 Query: 186 LRMVVEYG-EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQL 244 L+++ E + + + I + + + R ++ S Y Sbjct: 179 LKILRERNVVAYGINKVLASYRIYPHQSS-SKKLNMAKENWKLLRNQEELCLIISLHY-- 235 Query: 245 FTLYQIRN 252 FT Y +R Sbjct: 236 FTQYALRG 243 >UniRef50_Q15RB6 Glycosyl transferase, family 2 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15RB6_PSEA6 Length = 402 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 72/281 (25%), Positives = 110/281 (39%), Gaps = 11/281 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS--WEQLQQYVTALNDPRIT 63 L SI +P +NR + +A++SVL Q N+E++IVDD ST E ++Q V + +P++ Sbjct: 3 LFSIIIPVFNRTKSLKKALESVLSQSIQNYEVLIVDDGSTPSIAEGIEQLVLSFQEPKLK 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + + IN A RN I A G+YI +D DD W PN+L L + + Sbjct: 63 LLRHAINKNGAAARNTGINAASGKYICFLDSDDFWLPNKLERALEYINTKTCSDTFLIHH 122 Query: 124 YVCQGEVYSQPASLPLYPK---SPYSRRLFYKRNIIG---NQVFTWAWRFKECLFDTELK 177 E SLP K + F N+ G + + K CLFD Sbjct: 123 QYSNSENGKLTESLPKAAKASSESVAHYSFVTNNVGGIQSSTICVPTMVAKNCLFDERFS 182 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QD+D L++ + + E I N + + FY + FD Sbjct: 183 GHQDWDFALQVGAMTQHFYFIPEPLTI--RNKDSDDSVADNLSWEYSLWFYVQRASYFDT 240 Query: 238 ASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRG 278 S Y F +R + L + + L R Sbjct: 241 QSALY-FFQRVVLRKAVFFLELMPVLKNKLMLRILLSSPRA 280 >UniRef50_D1BX31 Glycosyl transferase family 2 n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BX31_XYLCX Length = 264 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 51/230 (22%), Positives = 95/230 (41%), Gaps = 9/230 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +++ MP +N ++ + +SVL Q +S E++IVDD S D R+ Sbjct: 9 PTVTVVMPAFNSERTLRDSARSVLDQSFSALELVIVDDSSADRTAAIATELGAGDARVRL 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N + G RN AI A+G Y+ D DD W P++L L Q A +Y+ + Sbjct: 69 IRNPHSLGPAGARNAAISAARGRYVAFCDSDDLWLPSKLERQLEVATQ-TGAALVYSGYH 127 Query: 125 VCQGEVYS-----QPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 + +PA ++ + + RN++G + + A Sbjct: 128 RVDADFSGPASGFRPADRVVHVPTLLTHGALLHRNVVGCLTAMIDTEQTGPVSMPGIPGA 187 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 +D+ ++LR++ E G ++E + + S +++ +R Sbjct: 188 EDWALWLRVLREGGTAAGIDEPLALYRTAQPG---SHSARRWRAVLAVWR 234 >UniRef50_Q3A667 Glycosyltransferase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A667_PELCD Length = 392 Score = 197 bits (502), Expect = 3e-49, Method: Composition-based stats. Identities = 62/230 (26%), Positives = 98/230 (42%), Gaps = 7/230 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 + SI MP WNR + A+ SVL Q + ++E++IVDD ST + L V A D RI + Sbjct: 1 MFSIIMPVWNRAEEVAAAVDSVLAQTFRDFELVIVDDGST--DDLAAVVEAYEDERIRFY 58 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT-HAFLYANDY 124 + G C RN A+ A YI +D D+ W+P+ L++ Q H Y Sbjct: 59 RRP-HEGVCKARNFALSKACYPYIAYLDSDNRWSPDFLAIMRKALQDSPEAHPTAYCQAS 117 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKAAQDY 182 + + S L P+ + ++N I F A + FD L+ D+ Sbjct: 118 LFRRNKNSGDLQLRCLIGQPFGFKQLLRQNYIDLNAFVHARALLDSTGGFDETLRRLNDW 177 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 D+ +R+V P V + + IT+ K + +K+K Sbjct: 178 DLIIRLVA-LVRPVYVPQPLVEYYDCVAGNTITAREKLAPAMRYIRKKYK 226 >UniRef50_Q0BQ52 Glycosyltransferase n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BQ52_GRABC Length = 854 Score = 197 bits (502), Expect = 3e-49, Method: Composition-based stats. Identities = 67/263 (25%), Positives = 104/263 (39%), Gaps = 19/263 (7%) Query: 6 LISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTALNDPR 61 L+SI P + R A++SV+ Q Y NWE+I++DDCS LQQ+ A D R Sbjct: 289 LVSIICPVYKPRLSDFALAVESVIAQTYQNWELILIDDCSEEANLKNLLQQFAQA--DSR 346 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I ++ NSG N + A G +I +D DD + L + +A + Sbjct: 347 IKHLTTRKNSGISEASNLGLKAANGSWIAFLDHDDLLEASALDIMIAAAEATGAKLLYSD 406 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKAAQ 180 D + + PA K ++ RL + N I + V F+ + AQ Sbjct: 407 EDKIDDSGFFRDPA-----FKPDWNYRLLLEVNYICHFVLVQRDVINSAGPFNKQFDGAQ 461 Query: 181 DYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQ--ITSSPKKFSGYFHFYRKHKDKFD 236 D+D+ LR+ E + V E I G I + P +H + Sbjct: 462 DHDMLLRIAERLEPSEVFHVPEILYHWRITPGSTAGDIGAKPYAIEAGLQCVSRH---LE 518 Query: 237 RASKKYQLFTLYQIRNKRMTWRT 259 R K + T Y + ++ W T Sbjct: 519 RRGLKADVSTRYGMTLYKIDWET 541 Score = 62.6 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 50/261 (19%), Positives = 96/261 (36%), Gaps = 28/261 (10%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSW--EQLQQYVTALN 58 N+P +SI +P ++ + + IK+++ + Y+N+++++VD+ ST LQ YV A Sbjct: 542 NNSPSVSIVIPFKDQIDVTLNCIKNIIEKTHYTNYQIVLVDNWSTDPNMSLLQDYVAA-- 599 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHA 117 I I +I + N A +Y +++D T + L +A + Sbjct: 600 HSNIKIIRQEIPFNYSLLNNIACASYPADYYVFLNNDLFVLTSDWLYRLVAEAEVDPRVG 659 Query: 118 -----FLYANDYVCQGEVYSQPASLPLY-------PKSPYSRRLFYKRNI---IGNQVFT 162 F+Y N + G V + + + Y R + + + + Sbjct: 660 AVGGKFVYPNGTIQHGGVILGIGGVAGHVHTGLPGDEGGYGGRANFTQEMSAVTAAGMLV 719 Query: 163 WAWRFKECL-FDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 A F E FD A D D+ LR+ E H E Sbjct: 720 RAKAFHEIGCFDESKLAVAFNDIDLCLRLRAAGYTIIYTPEFFA----EHHESLSRGDDV 775 Query: 220 KFSGYFHFYRKHKDKFDRASK 240 + + F+++ + +R + Sbjct: 776 RPAQERRFFKETQIMLERWGE 796 >UniRef50_UPI000174589B glycosyl transferase, family 2 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174589B Length = 717 Score = 197 bits (501), Expect = 4e-49, Method: Composition-based stats. Identities = 63/284 (22%), Positives = 113/284 (39%), Gaps = 15/284 (5%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALN 58 + + PLIS+ +P +N + R I++VL Q Y WE+ I DD S + + + A Sbjct: 151 LTDGPLISVLLPVYNTPLRWLKRVIETVLGQAYPKWELCIADDASPDPQVRKTIESYAAG 210 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 DPRI + N A N A+ LA G ++ +D DDE P+ L A Sbjct: 211 DPRIKVVFRPSNGHIVAATNSALELATGSFVALLDHDDELRPHALLEMAREIMGNPEAAL 270 Query: 119 LYAN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTEL 176 +Y++ D++ + V P Y K ++ L +N + + A ++ F Sbjct: 271 IYSDEDHLDEAGVRYAP-----YFKPDFNYDLLLSQNCVCHLGVYRADLVRQLGGFRAGT 325 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ----ITSSPKKFSGYFHFYRKHK 232 + AQD+D+ LR+ G + H E ++ S H Sbjct: 326 EGAQDWDLALRVFEAVGRDRIHHIPKILYHWRSIEGSTARGVSEKSYASSAGRKVVEDHL 385 Query: 233 DKFDRASKKYQLFT--LYQIRNKRMTWRTLLTLLSVRNGKRLAD 274 + ++A + + ++R + ++ RN + L + Sbjct: 386 SRTNQAVEGVEEVKQGHLRVRWSLPNPPPVAIIIPTRNFRHLLE 429 Score = 61.4 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 73/213 (34%), Gaps = 31/213 (14%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALN- 58 + N P ++I +PT N + L A++SVL + Y N+ ++IVD+ S E Y+ +L Sbjct: 409 LPNPPPVAIIIPTRNFRHLLEVAVESVLARTDYPNFRLVIVDNDSN-EESTLDYLASLQA 467 Query: 59 ----DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLA 108 D + + N+A+ + +++D D W +S + Sbjct: 468 QGKAD----VLRIPGPFNYSLLNNRAVAACTEPVVCLLNNDIEIHERD-WLREMVSQAVR 522 Query: 109 HKQQLVTHAFLYANDYVCQGEVYSQPASLPLYP-------KSPYSRRLFYKRNII----G 157 V Y + + V + + RL +N Sbjct: 523 PGVGAVGTKLYYPDGRIQHAGVILGMGGAAGHFLKGCDSSNESHGGRLHVCQNFSAVTAA 582 Query: 158 NQVFTWAWRFKECLFDT-ELKAA-QDYDIFLRM 188 V + D + K A D D L++ Sbjct: 583 CLVVERRKYLEVGGLDEGDFKVAFNDIDFCLKL 615 >UniRef50_D0D9G6 Glycosyl transferase family 2 n=1 Tax=Citreicella sp. SE45 RepID=D0D9G6_9RHOB Length = 319 Score = 196 bits (499), Expect = 6e-49, Method: Composition-based stats. Identities = 63/269 (23%), Positives = 104/269 (38%), Gaps = 8/269 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + P +S+ +P NR L RAI SVL Q Y + E+I VDD ST + +DP Sbjct: 22 LSEVPTVSVVLPVHNRSSLVTRAIDSVLGQTYRDIELIAVDDASTD-DTPDTLARYSHDP 80 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R + N N GA RN I LA+G Y+ D DD W P +L L Q Y Sbjct: 81 RFRTVRNAQNLGAAGTRNVGIGLARGRYLAFQDSDDRWFPEKLETQLKVLQASADCRACY 140 Query: 121 AND--YVCQGEVYSQPASLPLYPKSP-YSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK 177 + Y + Y PK ++ L + + + +FD L+ Sbjct: 141 SGAVYYSKEHSYYIPREGAHGAPKGDIFADTLMINPTTPQTLLIERSLLTETGVFDDTLR 200 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF-- 235 +D+D+ LR+ + V E +++ + + + + ++ + F Sbjct: 201 INEDWDLALRLARATPFAF-VPEPLVMIYRTPDSVS-SHMARDTAFRVRMLDQYAEDFAH 258 Query: 236 DRASKKYQLFTLYQIRNKRMTWRTLLTLL 264 + A++ Q + L R + Sbjct: 259 NTAARSRQNYVLGSQYLALGDHRNAMRYF 287 >UniRef50_C7RJP4 Glycosyl transferase family 2 n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RJP4_9PROT Length = 312 Score = 196 bits (498), Expect = 8e-49, Method: Composition-based stats. Identities = 61/279 (21%), Positives = 113/279 (40%), Gaps = 28/279 (10%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P++S+ +P +N + S+L Q+ + E+I+++D ST + +P + Sbjct: 2 PVVSVVIPAYNCALYIGETLASILAQEAAELEIIVINDGSTDETG--AIARSFGEP-VRV 58 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I NSG CA RN I A+GE+I +D DD W PN+L+ LA ++ Sbjct: 59 IDQP-NSGVCAARNHGIREARGEFIALVDHDDYWLPNKLANQLAAFADNPQVDVVFTAFV 117 Query: 125 ----VCQGEVYSQPASL-----PLYPKSPYS----RRLFYKRNIIGNQVFTWAWRFKE-C 170 +G + QPAS P S +S ++ ++ + + + Sbjct: 118 WWRQESEGGRFPQPASFAAQAAPQGIDSDFSGWIYHQMLLDSWVLTSTALARSAVVRASG 177 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 FD L ++D+D +LR+ + K+ EAT + + + + + Sbjct: 178 GFDETLPFSEDWDFWLRVA-RTSQFLKLREATTLYRQHPHQGSRVTRALDYRTRL----- 231 Query: 231 HKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNG 269 ++AS+++ L + +R L S G Sbjct: 232 ----LEQASRQWGLSSQDGRCVAPGVFRRQLAQYSASFG 266 >UniRef50_A5ZV71 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=A5ZV71_9FIRM Length = 250 Score = 196 bits (498), Expect = 8e-49, Method: Composition-based stats. Identities = 62/240 (25%), Positives = 99/240 (41%), Gaps = 7/240 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ PT+N + I+SV Q Y WEMIIVDDCST + D RI Y Sbjct: 5 LVSVITPTYNCAKFIGETIESVQAQTYQQWEMIIVDDCSTDNTKEIVDKYIKEDSRIKYF 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + NSGA R +A+ LA GEY+ +D DD W +L LA ++ AF Sbjct: 65 CLENNSGAAVARTKAMELANGEYMAFLDSDDIWPEEKLKKQLAFMKKH-DVAFSCTAYEQ 123 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD-TELKAAQDYDI 184 + PK+ Y+R L +GN + K F+ ++ D + Sbjct: 124 IDENGKVLNKIIKTVPKADYNRVLL--DCPVGNSTVMYN-VEKMGKFEVPNIRKRNDDAL 180 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQL 244 +L+M+ + + + I + ++ K ++ YR + A + + Sbjct: 181 WLQMLKKEKYIYGMRSVLMKYRIRQNSIS-SNKFKVIKYHWILYRDI-EHLSIARSVFHI 238 >UniRef50_Q47GL6 Glycosyl transferase, family 2 n=1 Tax=Dechloromonas aromatica RCB RepID=Q47GL6_DECAR Length = 309 Score = 196 bits (498), Expect = 9e-49, Method: Composition-based stats. Identities = 67/230 (29%), Positives = 105/230 (45%), Gaps = 13/230 (5%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +SI PT+N L RAI SVL+Q + +WE+++VD+ S ++ V D RI Y Sbjct: 23 VSIVTPTFNCAPLLQRAIDSVLKQTHQDWELLVVDNHSNDG--TEELVRGYRDSRIRYFK 80 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ--LVTHAFLYANDY 124 N A RN AI A+GE++ +D DD WTP +L V + +Q V + LYA Sbjct: 81 IQNNGVIAASRNLAISQAKGEWLAFLDADDWWTPEKLEVSVRSLRQGWDVVYHDLYAAGP 140 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ--VFTWAWRFKECLF--DTELKAAQ 180 +G + + S L + P L N + N V + + D L A + Sbjct: 141 GSKGRLRRRVPSRAL--RQPVLDDLVNHGNALPNSSVVVRRSLIDRVGGLSEDPALIAME 198 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 D+D +LR+ +++ A I G T++P++ + RK Sbjct: 199 DFDCWLRVARISDRFLRIKGAHGYYWIGGG---NTTNPRRTISTLNELRK 245 >UniRef50_A4BPR7 Glycosyl transferase, group 2 family protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BPR7_9GAMM Length = 348 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 55/252 (21%), Positives = 104/252 (41%), Gaps = 8/252 (3%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +++ +P NR AI SVL Q ++++E++I+DD S+ V + DPR+ Sbjct: 5 STPTVTVLIPVHNRVGYIRDAIDSVLAQTFADFELLIIDDASSDGS--ADAVASYRDPRL 62 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL--Y 120 + N+ N G RN+ I A+GEY+ +D DD P RL+ +A + HA + + Sbjct: 63 RLLRNETNLGIPGSRNRGIDEARGEYLAFLDSDDRARPERLARQVAFLRDHPDHAAVGSW 122 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT-WAWRFKECLFDTELKAA 179 G+ + + + + RLF R+ + N T + +K Sbjct: 123 IEWMSQNGKPLGRIKRKAVAAEQIAAERLF--RSCLENSAATARTAILRRYRHRERIKLG 180 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 DYD++ R+ ++ + + + G S + + + + Sbjct: 181 SDYDLWARIAADH-KLAALPHVLTYRRQHGGRSTHGRSEEIKAWRLQIFAEQLGALGLPY 239 Query: 240 KKYQLFTLYQIR 251 L Y++R Sbjct: 240 SADDLERHYELR 251 >UniRef50_A1ZLY5 Glycosyl transferase domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZLY5_9SPHI Length = 337 Score = 195 bits (497), Expect = 1e-48, Method: Composition-based stats. Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 7/216 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + PL+S+ MP +N ++++S+L Q Y+N+E II+DD ST + + D Sbjct: 1 MTDQPLVSVVMPVYNASPYLAQSMQSILNQTYANFEFIIIDDGSTDDS--LKVIEKYTDR 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N G NQ +A+G+YI +D DD R+ + + Sbjct: 59 RIQVYQHSQNQGVITALNQGFGMAKGKYIARMDADDYCEITRMEKQVHLMEAQPDTDLCG 118 Query: 121 AN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKA 178 Y+ Q LP+ ++ + L+ + +F ++ K L FD Sbjct: 119 TWVAYITQHS--KTITQLPVESEAIKAFLLWGNSIMHATTMFRRSFLLKHHLKFDQGFVH 176 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQI 214 A+DYD++ R + + E + N ++ + Sbjct: 177 AEDYDLWTRCLA-IAHFINIPEVLYYIRQNDQQVSV 211 >UniRef50_Q052L4 Glycosyltransferase n=10 Tax=Leptospira RepID=Q052L4_LEPBL Length = 281 Score = 195 bits (497), Expect = 1e-48, Method: Composition-based stats. Identities = 57/233 (24%), Positives = 98/233 (42%), Gaps = 8/233 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NPL+SI +P +N + ++S+LRQ Y WE+IIVDD S + N T Sbjct: 2 NPLVSIVIPCYNYGRYIHETVESILRQRYKYWEVIIVDDGSNDPFTISILEEYKNKSGFT 61 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + SG RN I AQGE+I +D DD + L ++ ++ + +Y Sbjct: 62 VLSIPR-SGPSTARNIGIDTAQGEFILPLDSDDMIHEDYLLEAISAYEKNPSLGIVY--- 117 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA-AQDY 182 C+ E + P+ + + I + VF + + F+ +K +DY Sbjct: 118 --CEAEFFGSIKGKWNLPEYRF-PNILLDNCIFVSAVFKKSDWKEVGGFNENMKNEWEDY 174 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 D +L ++ + G+ +++ + I H S Y Y HK+ + Sbjct: 175 DFWLSLIEKGGKVYRIPKVLFYYRIGHSSRSQRSLKHFLPLYMQLYENHKELY 227 >UniRef50_Q093P1 Glycosyl transferase, group 2 family protein (Fragment) n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093P1_STIAU Length = 1091 Score = 195 bits (496), Expect = 1e-48, Method: Composition-based stats. Identities = 61/275 (22%), Positives = 98/275 (35%), Gaps = 17/275 (6%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTALND 59 P ISI P +N ++ I+SV Q Y NWE+ +VD S E LQ + A+ D Sbjct: 274 RPRISILTPVYNTPPEVLRETIRSVQAQTYPNWELCLVDGNSPMPHVREILQNF--AVQD 331 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ D+N G N+A+ +A GE+I +D DD P L + H + Sbjct: 332 ERVCVRRLDLNLGISGNTNEALAMASGEFIALLDHDDTLAPFALYEIVKHLNAAPRTDMI 391 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKA 178 Y++ + V + K +S L G+ V+ A + F Sbjct: 392 YSD----EDRVDMDGTRHTFFFKPDWSPDLLQSFMYTGHLSVYRRALVEELGGFRPAYDL 447 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QDYD+ LR+ + + G + P +D R Sbjct: 448 SQDYDLALRVTERTQSIAHIPKVLYHWRTMAGSAAVGDKPH---ARITNIGALEDAMKRR 504 Query: 239 SKKYQLFT-LYQIRNKRMTWRTLLT--LLSVRNGK 270 ++ R K R ++ N + Sbjct: 505 GYDAEVIMDPMANRVKFKAPRGAFASIIIPSDNLQ 539 Score = 65.6 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 52/293 (17%), Positives = 101/293 (34%), Gaps = 47/293 (16%) Query: 8 SIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTAL--NDPRITY 64 SI +P+ N Q + VL+ Y ++E+++V + +L V + ++PR+ Sbjct: 530 SIIIPSDN-LQHIHACVGDVLKTTAYPHYEVLVV-----THSRLIPDVQSRYAHEPRVRT 583 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQ----LVTHAFL 119 + D N+ + A+G+++ ++DD P L L + QQ V+ + Sbjct: 584 VLYDEPFNFSLKCNRGVEQAKGDFVLFLNDDARPLDPEWLECMLGYFQQEGVGAVSPKLV 643 Query: 120 YANDYVCQGEVYSQ------------PASLPLYPKSPYSRRL--FYKRNIIGNQVFTWAW 165 Y+N+ + + S P + PLY S R I V Sbjct: 644 YSNNTLQHAGMISGVRNLVGTAFHTLPRNTPLYFNMAQSTRTTSLLSAACI---VMRKKI 700 Query: 166 RFKECLFD-TELKAAQ-DYDIFLRMVVEYGEPWKVEEATQILHINH----GEMQITSSPK 219 FD D D+ ++ E G T + H+ H E + + Sbjct: 701 FEAVGGFDAVNTPIMHSDVDLCFKI-REAGLRLVYTPFTTLEHVGHVSLGEEERKGNRHH 759 Query: 220 KFSGYFHFYRK------HKDKFDRASKKYQLF---TLYQIRNKRMTWRTLLTL 263 + ++ + F +++ F T Y+++ R Sbjct: 760 SQKSDIYVLKRWSEFCAYDPYFPDNMREFLFFDSPTYYKMQAGARCERKATRY 812 >UniRef50_A3XRJ7 TuaG n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XRJ7_9FLAO Length = 256 Score = 195 bits (496), Expect = 1e-48, Method: Composition-based stats. Identities = 56/241 (23%), Positives = 97/241 (40%), Gaps = 7/241 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+SI P++N ++ I SV +Q ++WE++I+DD S+ D RI Sbjct: 10 RPLVSIVTPSYNSEKFIAETIASVQKQTVTDWELLIIDDASSDDTVACVKKLREKDSRIH 69 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N G RN I A+G+Y+T +D DD W P L + ++ H+F++A+ Sbjct: 70 CIPLSENKGPAHARNLGIQKAKGKYLTFLDADDLWFPEFLERSITEAKK---HSFVFASY 126 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYD 183 + QP S K N I + +K QD Sbjct: 127 KRLDENL--QPYLSDFIVPDKVSYTDILKSNAISCLTAFINIEVLGKKYMPLIKKRQDMG 184 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++L+ + + + ++E I I + + + FYR+ ++ + Y Sbjct: 185 LWLQYLKKIDYAYGIQEPLAIYRIRRNSLS-RNKTGLIKYQWQFYRE-VEQLSVLASAYY 242 Query: 244 L 244 L Sbjct: 243 L 243 >UniRef50_Q094Y6 O-antigen biosynthesis protein RfbC n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q094Y6_STIAU Length = 715 Score = 195 bits (496), Expect = 1e-48, Method: Composition-based stats. Identities = 51/244 (20%), Positives = 92/244 (37%), Gaps = 10/244 (4%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--N 58 P S+ +P + + I+SVL Q Y +W++ IVDD S ++ V A Sbjct: 194 TQGPRFSLAVPAYETPEPYLRACIESVLAQTYPDWQLCIVDDGS-RGSGVEHVVRAYARR 252 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 +PRI + N G N A+ A G+++ +D DD P L V H Sbjct: 253 EPRIVFERLARNVGIARATNAALAHATGDFVAFLDHDDVLAPRALEVVAQHMADAPEGEV 312 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 Y++ + ++ + K S L N I + + +E + Sbjct: 313 FYSD----EDKIDAHGVRHAPSFKPALSPDLLRSVNYICHFLVVRTRLLQEVGGIRPGFE 368 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 AQD+++ LR+V + E ++ G + + K + R ++ R Sbjct: 369 GAQDHELLLRLVERTQRFHHIPEVLYHWRVHAGSTS-SDASAKPAASEAGRRAVEEHLQR 427 Query: 238 ASKK 241 ++ Sbjct: 428 QAEA 431 >UniRef50_Q8KI14 Similar to Glycosyl transferase n=1 Tax=Pseudomonas aeruginosa RepID=Q8KI14_PSEAE Length = 264 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 62/233 (26%), Positives = 103/233 (44%), Gaps = 6/233 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N L+SI P++N ++L R I+SVL Q + NWEMIIVDDCS + A D RI Sbjct: 5 NEGLVSIITPSYNAEKLIGRTIQSVLDQTFDNWEMIIVDDCSKDSTRSVVAAYAEKDSRI 64 Query: 63 TYIHNDINSGACAV-RNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFLY 120 + + N+GA A RN + A G++I +D DD W P +L + L + + Sbjct: 65 RLVGLEKNNGAPAAPRNIGVQHASGDWIAFLDADDIWHPRKLELQLEAIKSTRARFSCTQ 124 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK--A 178 D+ E+ + +S++ R I + V E F+ +++ A Sbjct: 125 MVDFFDDKEIVFESVKTVPQQIVDFSQQRIKGR-IPTSSVLLDKELIMEFPFNVDMRYKA 183 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 +DY +LR++ + K+ G++ S F +R++ Sbjct: 184 VEDYHCWLRILSSETKCLKLNAPLLYYRRVVGQIS-GSKKYMLERMFMLHREY 235 >UniRef50_Q1ZRU0 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Photobacterium angustum S14 RepID=Q1ZRU0_PHOAS Length = 275 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 82/271 (30%), Positives = 129/271 (47%), Gaps = 14/271 (5%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+Y+ T NR +L RA+ SV +Q YSN E+II DD S + +Y+ + ND + + Sbjct: 4 KVSVYIATHNRIELLKRALNSVRKQTYSNIEIIICDDGS--RDGTWEYLKSCNDSNLIIL 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ--QLVTHAFLYAND 123 N GAC RN+ I A G ITG+DDDD + R+ F+ + +FLY + Sbjct: 62 RNSEPLGACVSRNRCINAATGRLITGLDDDDYFYLKRIERFIEKYNLYKEKNFSFLYGDL 121 Query: 124 YVC-QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDY 182 + Q + + A + K K+N +GNQVF + K LFD L A QDY Sbjct: 122 IINEQNNKFIRKAPKTVTFKD------IVKKNHVGNQVFIEKDKIKSELFDESLPAMQDY 175 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQIT-SSPKKFSGYFHFYRKHK--DKFDRAS 239 D+++R++ +YG V+ + IL I+H +I+ S + K+K F+ Sbjct: 176 DLWIRLIKKYGNAIGVQNSGYILDISHPHERISKKSERLIRACEVIIGKNKINKVFNEDV 235 Query: 240 KKYQLFTLYQIRNKRMTWRTLLTLLSVRNGK 270 L+ +K ++TL L + Sbjct: 236 FLLNLYAYGFETHKLTFFKTLNYLKHKSFYR 266 >UniRef50_C1VEY2 Glycosyl transferase n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1VEY2_9EURY Length = 332 Score = 194 bits (494), Expect = 2e-48, Method: Composition-based stats. Identities = 52/242 (21%), Positives = 104/242 (42%), Gaps = 9/242 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW-EQLQQYVTALNDPRITY 64 L+S+ +P + R + A++SV Q Y+ E+++VDDCS E + + R+ Sbjct: 9 LVSVVVPAYGRPEYLDAAVESVNDQTYAPIELVVVDDCSPDPIEPIVADAETDSLHRVHV 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT-HAFLYAND 123 + +++N GA A RN I ++GE++ +DDDD W P ++ + + F+Y Sbjct: 69 VRHEVNRGANAARNTGIEASRGEFVAFLDDDDYWRPRKVEKQVRAFENASDEVGFVYTGQ 128 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKR---NIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + +P + ++ + + + + L DT + Q Sbjct: 129 ENVADDGTTTNYRVPETRG--WVTHELFRGAPLCPFSSVMVRRSVIDEVGLLDTRFPSWQ 186 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHIN-HGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 D + +LR+ E V E + ++ HG++ K+ Y F KH+ + Sbjct: 187 DREWYLRVS-EVCAFETVTEPLAVRRVDSHGQISDNYEQKRDVSYPLFLEKHRPLARKYG 245 Query: 240 KK 241 ++ Sbjct: 246 RR 247 >UniRef50_D0L0V1 Glycosyl transferase family 2 n=2 Tax=Gammaproteobacteria RepID=D0L0V1_HALNC Length = 958 Score = 194 bits (494), Expect = 2e-48, Method: Composition-based stats. Identities = 61/289 (21%), Positives = 115/289 (39%), Gaps = 22/289 (7%) Query: 2 KNNPLI-SIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW---EQLQQYVTA 56 ++ P+I SI MP +N ++ I SV Q Y +WE+ I DD S + L +Y+ Sbjct: 420 QSTPVIISIVMPVYNTPEKYLRLCIDSVRAQSYPHWELCIADDKSPQPHVKKVLDEYIK- 478 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 D RI ++ N N A+ LA GEY+ +D DD + L + Sbjct: 479 -KDKRIKVVYRPQNGHISKASNSALKLATGEYVALLDHDDALPEHALYFMAQAIAEHPEA 537 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTE 175 LY++ + ++ + KS ++ LFY +N + + + F T Sbjct: 538 QILYSD----EDKIDIHGQRSEPHFKSDWNPDLFYSQNYVSHLGVYKRELLQRINGFRTG 593 Query: 176 LKAAQDYDIFLRMV--VEYGEPWKVEEATQILHINHGEMQITSSPKKF------SGYFHF 227 ++ +QD D+ LR + V+ + + + G + S K + F Sbjct: 594 VEGSQDQDLLLRCLPHVKAEQIIHIPKILYHWRTLEGSTAMASGEKSYTTDAGIKALSDF 653 Query: 228 YRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLT-LLSVRNGKRLADG 275 + K+ + + + Y++ L++ L+ R+ K + + Sbjct: 654 FEKNGPAGIKIEQG-LVPNTYRVHWPIPNPAPLVSLLIPTRDRKTITEI 701 Score = 89.1 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 54/246 (21%), Positives = 93/246 (37%), Gaps = 24/246 (9%) Query: 5 PLISIYMPTWNRQQLAIR-AIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVT-ALNDPR 61 PL+S+ +PT +R + A++S+L + Y N+E+II+D+ S E L + D R Sbjct: 684 PLVSLLIPTRDR-KTITEIAVRSILDKTTYPNYEIIILDNGSEEPETLDWFAAIQQEDER 742 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ----LVTH 116 + + D A+ N A+G I I++D + +P+ L+ ++H + V Sbjct: 743 VKVLRYDHPFNYSAINNFGAQHAKGSLIGLINNDVEVISPDWLTEMVSHALREDIGCVGA 802 Query: 117 AFLYANDYVCQGEVYSQPASLPLYP-------KSPYSRRLFYKRNII----GNQVFTWAW 165 Y+ND + V + + Y RL +N + + Sbjct: 803 KLYYSNDTLQHAGVILGIGGVANHSHKNSKRDSPGYFARLIVAQNFSAVTAACLIIRKSV 862 Query: 166 RFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILH--INHGEMQITSSPKKF 221 + D LK A D D L++ A H I+ G +F Sbjct: 863 YDQVGGLDEVNLKVAFNDVDFCLKVREAGYRNLWTPYAELYHHESISRGTEDSPEKQARF 922 Query: 222 SGYFHF 227 G F Sbjct: 923 RGEVEF 928 >UniRef50_B2GLA0 Putative glycosyltransferase n=1 Tax=Kocuria rhizophila DC2201 RepID=B2GLA0_KOCRD Length = 345 Score = 194 bits (494), Expect = 3e-48, Method: Composition-based stats. Identities = 70/248 (28%), Positives = 115/248 (46%), Gaps = 15/248 (6%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS--WEQLQQYVTALNDP 60 PL+S+ + +N Q + +AI SVLRQ + + E+ +VDD ST E L Y Sbjct: 12 EQPLVSVVIAVYNGQDMVGQAISSVLRQTHPSIEVFVVDDGSTDHTPEVLAGYEALDRPG 71 Query: 61 R-ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA-- 117 R + + + N+G A RN+A+ L G+Y+T +D DDE P+ L L +QL Sbjct: 72 RAVRVLRQE-NAGCGAARNRALELISGDYVTFLDADDELLPHHLETMLERYRQLQREPGR 130 Query: 118 ---FLYANDYVC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN--QVFTWAWRFKEC 170 +Y N + C G + + P R + + N IG+ +F A+ + Sbjct: 131 ERTVVYTNAWQCTPSGINAERMVNREAMPAPEDQRSVILEYN-IGHLFGLFPRAFFEEVG 189 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 FD E +D++++LR V +V+E T IL + MQ + + +G RK Sbjct: 190 TFDPEQVFVEDWELWLRAVYSGWRFDRVDEVTVILSWSGSSMQ-SQRDRMAAGEEMALRK 248 Query: 231 HKDKFDRA 238 +F++ Sbjct: 249 TLVRFEQE 256 >UniRef50_B9M559 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. FRC-32 RepID=B9M559_GEOSF Length = 281 Score = 194 bits (493), Expect = 3e-48, Method: Composition-based stats. Identities = 57/206 (27%), Positives = 99/206 (48%), Gaps = 7/206 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ +PT+NR +L AI SVL Q Y N+E++I+DD ST + + V P+I YI Sbjct: 3 KVSVIIPTYNRSELLKCAITSVLDQAYDNYELLIIDDGST--DDTKSIVNGFGSPKIRYI 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY- 124 + NSG + RN I L++GEY++ +D DD + PN+LS+ + + +YA Sbjct: 61 YQS-NSGRSSARNYGIKLSKGEYVSFLDSDDIFLPNKLSLQVQALDENPNCGLVYAYAKN 119 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF--KECLFDTELKAAQDY 182 V + + Y L+ K N I + + F+ + +D Sbjct: 120 VDENGRFLDFHFDGNLSGKIYPEMLYIKNNYITTPTVMVRAKVLAQIGGFNESMHMCEDL 179 Query: 183 DIFLRMVVEYGEPWKVEEATQILHIN 208 D++ R+ +Y E ++ ++ + I Sbjct: 180 DLWRRIARKY-EVKQICQSLAHVRIR 204 >UniRef50_C6ATV7 Glycosyl transferase family 2 n=1 Tax=Rhizobium leguminosarum bv. trifolii WSM1325 RepID=C6ATV7_RHILS Length = 1015 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 48/229 (20%), Positives = 87/229 (37%), Gaps = 1/229 (0%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P++SI P + ++S++ Q + +WE I+DD ST A D R Sbjct: 2 PMVSIITPAHGAETTISSTLESLIAQSHPDWECFIIDDGSTDQTAEIADRFAARDTRFRV 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + + SG A RN + A+G+++ +D DD P L L H + L L+ Sbjct: 62 LRQEQ-SGVSAARNAGLAQAKGDWVVFLDSDDALAPFHLETMLNHTRTLPQADILHCGWR 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDI 184 + + + +P++ Y I + + + F+ E+K +D+D+ Sbjct: 121 RLKNGAPWWQSHPAVKMDNPFAVAARYCPFAIHAALLRRSRLAEVGGFNPEMKMCEDWDL 180 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 + R+ E VE + + G + G R H Sbjct: 181 WQRLARAGAEFAPVEGLMADVSVEAGSLSSNRVKHLDFGLKVIRRGHHS 229 >UniRef50_C4V652 Family 2 glycosyltransferase n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V652_9FIRM Length = 253 Score = 193 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 57/249 (22%), Positives = 91/249 (36%), Gaps = 10/249 (4%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ MP +N + AI+SV Q Y WE+I+VDD S + L D RI I Sbjct: 11 VSVIMPVYNDTKYIREAIRSVSCQTYQFWELIVVDDGS-QQDIKMVLTPLLQDHRIHLIR 69 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 + N G RN I A G Y+ +D DD W P++L L ++ + C Sbjct: 70 LERNRGVANARNAGIQQATGRYVAFLDSDDVWLPDKLERQLQFMRKTKAAMSYTSYRRFC 129 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIFL 186 Q A P+ S + + N IG L + +DY +L Sbjct: 130 HD---IQDAGKPVPIVSDVTYDKLLRGNCIGCLTVMIDRSQCASLSMP-MVHHEDYAAWL 185 Query: 187 RMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLFT 246 + + + + + G + K +R ++D + ++ Sbjct: 186 NLAKTGVRIYGMTDDLARYRVRGGSLS----ANKLKSLLWTWRVYRDSQQLTMLRSLIYM 241 Query: 247 -LYQIRNKR 254 Y IR R Sbjct: 242 VHYVIRGIR 250 >UniRef50_C9K187 Putative glycosyltransferase n=1 Tax=Klebsiella pneumoniae RepID=C9K187_KLEPN Length = 290 Score = 193 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 78/262 (29%), Positives = 124/262 (47%), Gaps = 4/262 (1%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+++Y+ T NR +L R + S+ Q + E+I+VDD S L D RI Y+ Sbjct: 3 LVTVYITTCNRLELLKRCLNSIKEQTIRDIEIIVVDDNSNDDTALFMKSECEKDTRIKYL 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 NDIN GAC RN+AI +A EYITG DDDD + +R+ F+ + +L FLY + Sbjct: 63 RNDINRGACYSRNKAISVASSEYITGCDDDDYFEKDRIKSFVENSDKLEQFVFLYTDSLW 122 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 ++ A + + K+ S N +GNQVFT ++ LFD L A QD++ + Sbjct: 123 LTSRGVNK-AGINKFAKAIVSSEDLLYFNYVGNQVFTKTSILQKYLFDVNLPAWQDFECW 181 Query: 186 LRMVVEY-GEPWKVEEATQILHINHGEMQITSSPKK--FSGYFHFYRKHKDKFDRASKKY 242 R++ + ++ I I+H +I+S ++ Y F K+ K Sbjct: 182 FRILRGTNKKALRINSYNYIQDISHPFSRISSGKQERIIKAYDLFCNKYNLKESERKILR 241 Query: 243 QLFTLYQIRNKRMTWRTLLTLL 264 F Y + T++ L+ Sbjct: 242 LHFFNYGFGKQYFINSTIIALI 263 >UniRef50_A5G2M9 Glycosyl transferase, family 2 n=1 Tax=Acidiphilium cryptum JF-5 RepID=A5G2M9_ACICJ Length = 880 Score = 193 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 67/277 (24%), Positives = 102/277 (36%), Gaps = 18/277 (6%) Query: 3 NNPLISIYMPTW--NRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALND 59 + PL+S+ P + + A+ SV Q Y NWE+I+VDD S A D Sbjct: 312 DAPLVSVICPVYRPDHGAFVT-AVDSVRAQTYRNWELILVDDGSGDARLTALMKRLADAD 370 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI N+G A N+AI A G++I D DD P L + Q L Sbjct: 371 PRIRVQIRKANAGIAAATNRAIEAATGDFIAFFDHDDVLEPCALDAMMRA-QAATGARLL 429 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKA 178 Y++ + ++ + A K ++ RL N I + A + D +L Sbjct: 430 YSD----EDKIDRRGAFSEPNFKPDFNYRLLLDLNYICHLAVAEAALVRAAGGLDPQLDG 485 Query: 179 AQDYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 AQD+D+ LR+V G E V E I + S K Sbjct: 486 AQDHDLLLRLVERLGAHEIHHVPEILYHWRIT-AQSTAGSGAAKPKAALAGAAAVAAHLK 544 Query: 237 RASKKYQLFTLYQIRNKRMTWR-----TLLTLLSVRN 268 R + + R ++R + L+ R+ Sbjct: 545 RRGLAARTERRGGLTCYRTSFRMSEDPGVSILIPYRD 581 Score = 49.9 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 91/277 (32%), Gaps = 31/277 (11%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDY-SNWEMIIVDDCSTSWEQ--LQQYVTAL 57 M +P +SI +P + + ++++ + +E++++D+ S E L Sbjct: 567 MSEDPGVSILIPYRDHIGMTRACVEAIRDVTRGTRYEILLLDNWSQDEEAEGFAVEQGNL 626 Query: 58 NDPRITYIHNDINSGACAVR--NQAIMLAQGEYITGID-----DDDEWTPNRLSVFLA-- 108 D R+ I N R N+ + A+ ++ ++ D EW L+ LA Sbjct: 627 PDTRVIRIAEPFN----YSRINNRGVEAARHPFLLFMNNDVFVSDPEWLRTMLNEALADP 682 Query: 109 ----------HKQQLVTHAFLY-ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG 157 + + V HA + + PA P Y + + R Sbjct: 683 GVGAVGAKLLYPNETVQHAGVVLGVGGIADHSFRGLPADKPGYIANAIACRE-VSAVTAA 741 Query: 158 NQVFTWAWRFKECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 + FD + A D D+ +++ + G T H Sbjct: 742 CMLVRREAFAAAGGFDEDGLSVAFNDVDLCMKI-RQAGYRIIFSADTVCEHRESLSRGDD 800 Query: 216 SSPKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRN 252 K + + ++++D A K+ + + R Sbjct: 801 FDESKLARFMLENETMRERWDEALKRDPFYNPHFSRE 837 >UniRef50_B0SAI4 Glycosyltransferase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SAI4_LEPBA Length = 254 Score = 193 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 58/232 (25%), Positives = 101/232 (43%), Gaps = 5/232 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ MP +N ++ S+L Q Y +WE++I+DDCS A D RI Sbjct: 2 PKVSVIMPAYNAASYLEESVNSLLSQSYKDWELLIIDDCSKDHTLDVAKNIAKKDKRIRV 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I + NSG+ RNQ I LA GEYI +D DD W N L + ++ + +F +++ Sbjct: 62 IPKEKNSGSADTRNQGIELASGEYIAFLDADDLWESNFLDKMIPFMEENKS-SFSFSSYR 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW-RFKECLFDTELKAAQ-DY 182 + + + L + +S R N +G + + FD LK+ + DY Sbjct: 121 IIDENGF-EFCEPFLVEQKSFSYRNLLHYNRVGLLTAIYHIPTVGKKYFDPSLKSLRDDY 179 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 ++L ++ + + E + G ++ K +F R ++ Sbjct: 180 ALWLDILRDGRVAFANSEILARYRVRRGAAT-SNKKKVMIAHFKMLRNRENL 230 >UniRef50_B3QJY1 Glycosyl transferase family 2 n=1 Tax=Rhodopseudomonas palustris TIE-1 RepID=B3QJY1_RHOPT Length = 1032 Score = 193 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 58/237 (24%), Positives = 98/237 (41%), Gaps = 12/237 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P SI +PT+N+ Q + S+ Q +WE I+VDD ST A DPR Sbjct: 4 PTFSIVVPTYNQAQYLGACLDSIASQTDGDWEAIVVDDGSTDGTAALADDYAARDPRFRV 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 IH N G + N+ + A+G++I + DD + P +L + +Q F ++ Sbjct: 64 IHQP-NGGVASALNEGLRQARGQWIHWLSSDDLFDPRKLEINREQIRQHPDCKFFFSFFR 122 Query: 125 VCQ---GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT-WAWRFKECLFDTELKAAQ 180 + + E+ PL + LF++ I G + FD L+ AQ Sbjct: 123 LLRESTQELTDHGLWGPLPDRESQIPTLFFRNYISGITICVERTAWQSVGSFDPSLRYAQ 182 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK-----KFSGYFHFYRKHK 232 DYD++LR++ + P + + + + NH P+ F +H+ Sbjct: 183 DYDMWLRLLARF--PARFIDQWTVTNRNHALQGSEVFPQACYYDTAKAAIQFLNQHR 237 >UniRef50_Q46FX2 Putative uncharacterized protein n=1 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46FX2_METBF Length = 783 Score = 193 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 57/259 (22%), Positives = 105/259 (40%), Gaps = 8/259 (3%) Query: 4 NPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALNDPR 61 P ISI MP +N + +AI SV+ Q Y+NWE+ I DD ST + D R Sbjct: 260 KPKISIIMPVYNVDEIWLEKAIDSVINQIYTNWELCIADDASTKPHIKSTLSKYSELDSR 319 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N G N+A++LA GE+I +D+DDE + + L + + +Y+ Sbjct: 320 IKVKYLSNNRGISGASNEALLLATGEFIGLLDNDDELSVDALYEVVKVLNKKSDIDLIYS 379 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKAAQ 180 + + ++ + + K +S + + N + + V + F + +Q Sbjct: 380 D----EDKIDMKGNRCNPFFKPGWSPDVLFSVNYVCHFAVIRKTIVDEVGRFRIGFEGSQ 435 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+FLR+ + + + + + G + K + R +D R Sbjct: 436 DYDLFLRVAEKTNQVEHIPKILYHWRMIPGSTALNIFSK-DKAQINGIRSLQDYLYRKKI 494 Query: 241 KYQLFTLYQIRNKRMTWRT 259 + N ++ + Sbjct: 495 DGKAAATINRTNYQVDYEI 513 Score = 55.2 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 35/260 (13%), Positives = 82/260 (31%), Gaps = 43/260 (16%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDY-SNWEMIIVDDCSTSWEQ--LQQYVTAL 57 +K +PL+SI + ++ + I S+++ + +E++++ S E+ + + Sbjct: 513 IKGSPLVSIII-HFDDNVHVKKCINSIIKNTEETRYEILLM--VSKKNEKQLSINFDPSN 569 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV-----FLAHKQQ 112 P I + D + A+ N A + G+Y+ ++ R+ ++ + Sbjct: 570 KQPAIKVLTYDESWNVSAINNFAAEKSSGDYLLFMNS-------RIETTNRNWLISLLKN 622 Query: 113 LVTHAFLYANDYV---------------CQGEVY-----SQPASLPLYPKSPYSRRLFYK 152 + GE Y + + + +SR Sbjct: 623 AGREGIGCVCPKILNVDGTIKQAGIILGLNGEKYDVFSGNPENNWTNFGLDTWSRDYLAV 682 Query: 153 RNIIGNQVFTWAWRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQI---LHINH 209 + F+ FD D D+ LR+ + + + + + Sbjct: 683 SG--ECLMINKKTFFQAGGFDENKYYGNDIDLCLRVHKKGFRNLCIGTVSVYQLGIGLRK 740 Query: 210 GEMQITSSPKKFSGYFHFYR 229 E+ T + Y F Sbjct: 741 DEISPTDVQRILEPYRIFLE 760 >UniRef50_B8HRB2 Glycosyl transferase family 2 n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HRB2_CYAP4 Length = 373 Score = 193 bits (492), Expect = 5e-48, Method: Composition-based stats. Identities = 59/260 (22%), Positives = 100/260 (38%), Gaps = 14/260 (5%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N PL+S+ +P +N + + +KSVL Q Y+N E+++VDD S A +DPR Sbjct: 8 SNLPLVSVIIPAYNAEAFIQKTLKSVLTQTYTNIEVLVVDDGSQDRTAEIVQAIAKDDPR 67 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA-HKQQLVTHAFLY 120 + + N+G A RN+AI +GE++ ID DD W P L + + F+Y Sbjct: 68 VDLL-QQSNAGVAAARNKAIQATRGEFVAPIDADDIWYPQNLEKQVQCLLKSDSNVGFVY 126 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN---QVFTWAWRFKECLFDTELK 177 + + + + +L + IGN + + ++ EL+ Sbjct: 127 SWSLEIDEQGLLSGGFHAAWHQGEVYLKLLI-QFFIGNASATLIRRVCLEETNGYNCELR 185 Query: 178 -----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY--RK 230 +D+D++LR+ Y V E M S R Sbjct: 186 AMEAQGCEDWDLYLRLAERY-HANVVPEFLVGYRQLKNSMSRNLDLMARSQSLVLEASRN 244 Query: 231 HKDKFDRASKKYQLFTLYQI 250 H + A ++ Y Sbjct: 245 HHPELPTAVYRWSTSRFYIY 264 >UniRef50_B7JHV6 Glycosyl transferase, group 2 family protein n=52 Tax=Bacillus RepID=B7JHV6_BACC0 Length = 266 Score = 193 bits (492), Expect = 5e-48, Method: Composition-based stats. Identities = 60/233 (25%), Positives = 93/233 (39%), Gaps = 9/233 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P +S+ P++N + I SV Q Y NWEMIIVDD ST + D R Sbjct: 9 NKAPHVSVITPSYNSIRFIGETIVSVQNQSYENWEMIIVDDASTDESVTKIKEIIEGDSR 68 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N GA RN AI A+G YI +D DD W P++L L +++ +F YA Sbjct: 69 IRLVSLKENIGAAKARNIAIQEARGRYIAFLDSDDIWLPHKLKTQLLFMEEM-NVSFSYA 127 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ- 180 + + + + Y IIG +E + E+ + Q Sbjct: 128 SYSLIDENGNELNRKVNVPKSVDYHC--LAGNTIIGCLTVIID---RERIPHIEMPSVQP 182 Query: 181 -DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 D ++L+++ E E +++ I + F Y+ R K Sbjct: 183 EDTALWLKLLHEGHEAKGIQQVLAKYRIVANSVSRNKIKAAFR-YWKLLRDQK 234 >UniRef50_C7M1X8 Glycosyl transferase family 2 n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7M1X8_ACIFD Length = 1084 Score = 193 bits (492), Expect = 5e-48, Method: Composition-based stats. Identities = 57/229 (24%), Positives = 99/229 (43%), Gaps = 13/229 (5%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-- 58 ++ PLIS+ MP +N + + AI+SV Q Y +WE+ I DD ST E ++ +T Sbjct: 536 EHPPLISVLMPVYNTPIRHLVTAIESVRAQWYPHWELCIADDASTDPE-IRPILTRYQEA 594 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 DPRI D N G A N A+ LA G+++ +D DDE + L + + Sbjct: 595 DPRIKVAFRDENGGISANSNTALTLANGKFVAYLDADDEISEVALLHYAREIHEYPGVEL 654 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELK 177 L+ + + ++ Y K S L +N + + V+ + +E Sbjct: 655 LFCD----EDKITEDGDRSDPYFKPSLSPALLLGKNCVTHLGVYRTDTVRRLGGMRSEFD 710 Query: 178 AAQDYDIFLRMV----VEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 +QD+D+ LR + +++ ++ HG T K ++ Sbjct: 711 GSQDWDLALRFLPIVGIDFTRARRIPRLLYHWRRIHGSTATTLRSKSWA 759 >UniRef50_A0KUZ3 Glycosyl transferase, family 2 n=7 Tax=Gammaproteobacteria RepID=A0KUZ3_SHESA Length = 249 Score = 193 bits (491), Expect = 5e-48, Method: Composition-based stats. Identities = 56/229 (24%), Positives = 99/229 (43%), Gaps = 7/229 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+SI MP++N + +I+SV+ Q Y NWE++I DD S + D Sbjct: 1 MQEQPLVSIIMPSYNSAKTIAESIESVISQTYQNWELLITDDVSVDNTRDIVRSYCAKDS 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI D+NSGA A RN +I ++G YI +D DD W ++LS +A +L Y Sbjct: 61 RIKLFELDVNSGAGASRNNSIENSKGIYIAFLDSDDLWFSDKLSTQIAFM-KLNNVLLSY 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + + + P S + N+IG + + + ++ Q Sbjct: 120 S----AYQKFSREGDGGVIIPPSSVTYEELLTGNVIGCLTAIYNAKVLGKRYMPLIRKRQ 175 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 D ++L ++ + + + + ++ G + + FYR Sbjct: 176 DMGLWLNILEDIDMAYGISDVLAKYRVDTGMT--KNKLNILKWQWAFYR 222 >UniRef50_C0ECJ2 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0ECJ2_9CLOT Length = 658 Score = 193 bits (491), Expect = 5e-48, Method: Composition-based stats. Identities = 73/284 (25%), Positives = 109/284 (38%), Gaps = 47/284 (16%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ + WN + R I+S+L Q Y N E+I DD ST A D R+ Sbjct: 3 PKVSVLIACWNSEHTIQRCIESILSQTYLNLEIIACDDGSTDGTFGLLQEFAQRDSRVIP 62 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN-- 122 + + +N GA + RN + A+G YI D DD +P+R+ + + AF+ A Sbjct: 63 LRSPVNRGAASARNLCLASARGAYIAIQDADDYSSPDRIKKEVDFLEHHPEFAFVSAGMV 122 Query: 123 --DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQV--FTWAWRFKECLFDT--EL 176 D V + YS P S+PL R F + V F + E Sbjct: 123 RFDEVGEWSWYSPPRSVPL-------RWDFLSGPPFAHAVTLFRRTALQAVGGYRVAWET 175 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 + QDYD+F R+ + G+ +FY + ++ F Sbjct: 176 RRGQDYDLFFRLCAAGYRGANL-----------------------PGFLYFYLQDRNAFS 212 Query: 237 RASKKY----QLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 R +Y + L R RM R L L K LA G+ Sbjct: 213 RKQYRYRVAESVIRLKGYRQNRMLLRGLPYLF-----KPLAAGL 251 >UniRef50_A9AAL8 Glycosyl transferase family 2 n=2 Tax=Methanococcus maripaludis RepID=A9AAL8_METM6 Length = 280 Score = 193 bits (491), Expect = 5e-48, Method: Composition-based stats. Identities = 52/232 (22%), Positives = 92/232 (39%), Gaps = 4/232 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 +NNPL+S+ MP +N ++ AI+S+L Q Y ++E II+DDCST A D R Sbjct: 5 ENNPLVSVIMPNYNNEKYLAEAIESILNQTYEHFEFIIIDDCSTDNSWHIIQNYAKKDNR 64 Query: 62 ITYIHNDINSGACAVRNQAIMLAQ--GEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 I N++N RN+ L +YI D DD P RL + ++ V + + Sbjct: 65 IKMFRNEVNLKIVKTRNKGFELMSPHSKYIAIFDSDDVSMPERLEKQINFSEKNVDYGLI 124 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKA 178 ++ + + + K V + +D + Sbjct: 125 GSHTLIIDENSKIIGYRKYETDLKKIIKNMVKKNQFAQPSVMIRKSVLDDVGYYDEKYTR 184 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 QDYD++ R+ + + ++E I+ + + T + +K Sbjct: 185 CQDYDLWFRIAKSF-KIENLDEFLIKYRISKTQGKSTHLKESLDFTLDIQKK 235 >UniRef50_C5SH24 Glycosyl transferase family 2 n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SH24_9CAUL Length = 302 Score = 193 bits (491), Expect = 6e-48, Method: Composition-based stats. Identities = 58/264 (21%), Positives = 109/264 (41%), Gaps = 8/264 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 ISI +PT+NR + SV Q YS++E I+VDD S + + V ++ DPR YI+ Sbjct: 4 ISIIIPTYNRSSRIKDTLLSVKNQTYSSFECIVVDDGSADASETRAVVESMGDPRFRYIY 63 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 N+GA RN I A G+++ +D DD + P++L + A +++ V Sbjct: 64 Q-ANAGASKARNTGIDSAAGKFVCFLDSDDVFLPHKLERQYNKISEYKN-ALIFSQLLVD 121 Query: 127 QG--EVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDI 184 +G + + +P P + + + + + + F+ L ++QD D Sbjct: 122 RGVEKKWIKPPRGPKSGERVDEYLMCTSGWMQTSTLMLNTNVARSVRFNEMLPSSQDTDF 181 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS----K 240 +R V P +EE +L+ + +++ K + S + Sbjct: 182 AIRCVNAGAVPVFIEEPLIVLNDVYDPSRVSKQTKYQPLLDWIENMRHTQISEKSYWGYR 241 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLL 264 +Q + NK + + + Sbjct: 242 GWQCSRVASYSNKGLAVKLFFSSF 265 >UniRef50_C4F932 Putative uncharacterized protein n=1 Tax=Collinsella intestinalis DSM 13280 RepID=C4F932_9ACTN Length = 974 Score = 193 bits (491), Expect = 6e-48, Method: Composition-based stats. Identities = 67/272 (24%), Positives = 113/272 (41%), Gaps = 16/272 (5%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 SI MPT+N + AI SVL Q YSNWE+ IVDDCSTS E L++Y+ +LN + Sbjct: 50 KFSILMPTFNVDVRWVSMAIDSVLAQSYSNWELCIVDDCSTSAE-LKEYLLSLNTDNVRV 108 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL-SVFLAHKQQLVTHAFLYAND 123 D N G N A +A G+Y+ +D+DD TPN L +FL +Y+++ Sbjct: 109 RLLDTNLGISGATNCAADMATGDYLVLLDNDDMLTPNALFELFLRATSDNP--DIMYSDN 166 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDY 182 + L + K +S L + +G+ + F F E +QDY Sbjct: 167 DIID----ENGNRLSVLFKPDWSPDLMLSQMYVGHLLAFRKNIFDAVGGFRNEFNGSQDY 222 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF------HFYRKHKDKFD 236 D+FLR+ ++ + + K ++ + + K Sbjct: 223 DLFLRLSLQSERISHISKILYSWRALPSSTAANPDSKPYAQFAGLNAVQSYLDSKYGKGA 282 Query: 237 RASKKYQLFTLYQIRNKRMTWRTLLTLLSVRN 268 ++ + +Y +R + ++ ++ Sbjct: 283 ASAFETDDLFVYDVRYNVPNKPLVSIVIPTKD 314 Score = 87.2 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 42/262 (16%), Positives = 91/262 (34%), Gaps = 26/262 (9%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTAL--ND 59 N PL+SI +PT + + AI S++ + Y N+E++I+D+ S Y + D Sbjct: 302 NKPLVSIVIPTKDHAEDLRIAIDSIVERSSYQNFEILILDNNSEEAS-THNYFDEILGKD 360 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-----WTPNRLSVFLAHKQQLV 114 R+ I+ + N + A+G+ +++D E W + + +V Sbjct: 361 DRVRVINASFPFNWSKLNNYGVRHAKGDVFVFLNNDTEVISEDWLDRLVEHAIREDVGVV 420 Query: 115 THAFLYANDYVCQGEV---------YSQPASLPLYPKSPYSRRLFYKR--NIIGNQVFTW 163 LY + + V + + P++ +P+ L + + G + Sbjct: 421 GGLLLYPDGSIQHAGVIIGMGGWADHVYKGAQPIHYGNPFISPLVTRNVSAVTGACMAIS 480 Query: 164 AWRFKE-CLFDTELKAAQ-DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 F + F+ + D ++ L + + + + H E + + Sbjct: 481 REHFDQLGGFNEDFIVCGSDVELCLNAMEHGMR----NVYSPYIRLKHYESKTRDAKDIP 536 Query: 222 SGYFHFYRKHKDKFDRASKKYQ 243 F F + + Sbjct: 537 EVDFRLSETMYRTFRSSGDPFY 558 >UniRef50_C2Y456 Teichuronic acid biosynthesis glycosyl transferase n=2 Tax=Firmicutes RepID=C2Y456_BACCE Length = 253 Score = 193 bits (490), Expect = 6e-48, Method: Composition-based stats. Identities = 67/236 (28%), Positives = 107/236 (45%), Gaps = 8/236 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M L+SI P +N ++ I+SV+ QDY+NWE+IIVDD S +L ++ Sbjct: 1 MYMRELVSIITPVYNTEKYLRNTIESVINQDYANWELIIVDDYSKDRSRLIIEQYMESEK 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I D N G RN AI A G+YI +D DD W PN+LS+ L Q+ +AF + Sbjct: 61 RIKLIALDENRGVTFARNTAIKAANGKYIAFLDSDDLWHPNKLSIQLGFMQKN-NYAFSF 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK--A 178 ++ Y E ++ + P RL N IG KE + D E+ Sbjct: 120 SS-YEWMNEDGTRFNKIIEAPSVVDYHRLLRGGNPIGCLTVVID---KEQVEDIEMPDLR 175 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 +DY +L ++ + + + + + ++ K ++ YRK++ Sbjct: 176 HEDYATWLSIMKSGVIAYGINQNLALYRKS-SNSLSSNKFKTIIWTWNVYRKNQKL 230 >UniRef50_B7MWU8 Glycosyltransferase n=1 Tax=Escherichia coli ED1a RepID=B7MWU8_ECO81 Length = 268 Score = 193 bits (490), Expect = 7e-48, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 110/244 (45%), Gaps = 9/244 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + +SI MPT+N +I SV+ Q Y++W + I+DD ST + ++ D Sbjct: 7 MSSYETVSIIMPTFNAASYISYSIISVIEQSYNDWHLYIIDDGST--DNTAHEISNFTDL 64 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RITY+ INSG RN I +A+G++I +D DD W +LS+ + ++ + + Sbjct: 65 RITYVMLPINSGVANARNIGIKMAKGKFIAFLDSDDIWEKEKLSLQIPLLEK--GYDVVC 122 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 +N + + E S + YSR + N IGN + E + Sbjct: 123 SNYAIFKNEPKKIIGSRTAPQQITYSR--LLRGNCIGNLTGIYNRERLGKCLQEEC-GHE 179 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DY ++L++V G + +++ + + ++ K ++ YRK K K + S Sbjct: 180 DYLMWLQLVKLSGSAYCIQKNLARYRVTEHSVS-SNKLKAAFWQWNIYRK-KLKLNILSS 237 Query: 241 KYQL 244 Y Sbjct: 238 IYYW 241 >UniRef50_C1DRS5 Glycosyl transferase, family 2 n=1 Tax=Azotobacter vinelandii DJ RepID=C1DRS5_AZOVD Length = 1182 Score = 192 bits (489), Expect = 8e-48, Method: Composition-based stats. Identities = 66/279 (23%), Positives = 111/279 (39%), Gaps = 18/279 (6%) Query: 4 NPLISIYMPTWNRQQLAIRAI-KSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL SI +P +N + A+ SV Q Y +WE+I+VDD S S E+ ++ + ++DP+I Sbjct: 654 TPLFSIVVPAYNTSSELLDAVLSSVRAQWYPHWELILVDDASPS-EETRRALAEIDDPKI 712 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 +H + N G N + AQGE+I +D DDE T + L + F+Y++ Sbjct: 713 RVLHLESNKGISGATNVGLAAAQGEFIVFMDHDDELTVDCLYELALCINR-DQPDFIYSD 771 Query: 123 -DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 D + + Y+QP + K +S + K +E Q Sbjct: 772 EDKLTEEGEYTQP-----HFKPDWSPDTMMSTMFTCHVSCVRRSLLSKVGELRSEFDGCQ 826 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+D LR+V + + I + S K + R D +R Sbjct: 827 DWDFILRVVEHTNRISHIPKVLYHWRIIPASVASDISAKPYV-LEASRRVRLDALERRGL 885 Query: 241 KYQLFTLYQIRNK-------RMTWRTLLTLLSVRNGKRL 272 K + + Q+ + + + + S NG L Sbjct: 886 KGSIEPVAQVPGYFRVNYHLQGSPLISIIIPSRDNGSVL 924 Score = 95.3 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 56/254 (22%), Positives = 98/254 (38%), Gaps = 20/254 (7%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALND 59 ++ +PLISI +P+ + + R + S+ + Y N+E+II+D+ S Y+ L + Sbjct: 905 LQGSPLISIIIPSRDNGSVLRRCLDSIQEKSSYRNFEIIILDNGSVEAS-TVAYLKELQE 963 Query: 60 PRI-TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWT-PNRLSVFLAHKQ----QL 113 + I +D + N A GE + ++DD E + L + Q Sbjct: 964 KGVAQIIRHDAPFNFSELNNIGARTAGGELLLFLNDDTEVLCNDWLERMGGYAQLVHIGA 1023 Query: 114 VTHAFLYANDYVCQGEVYSQPASLPLY-------PKSPYSRRLFYKRNII---GNQVFTW 163 V LY + Q A+ P++ + Y R + N + G + Sbjct: 1024 VGAKLLYPDSSEIQHAGVLNLANGPVHAFLRHHSERPGYFMRNLLEYNWLAVTGACLMME 1083 Query: 164 AWRFKE-CLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 A++F E FD L A D ++ +R V + + T I H + P KF Sbjct: 1084 AYKFNELGGFDETLPVAYNDIELCIRAVEKGYYNVVCQSVTLIHHESVSRGLDHVDPVKF 1143 Query: 222 SGYFHFYRKHKDKF 235 + R+ D Sbjct: 1144 ARLQRELRRLYDMH 1157 >UniRef50_B8HLM9 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HLM9_CYAP4 Length = 380 Score = 192 bits (489), Expect = 9e-48, Method: Composition-based stats. Identities = 63/269 (23%), Positives = 105/269 (39%), Gaps = 13/269 (4%) Query: 1 MKNNP-LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M + P L+S+ +P +N + + + SVL Q Y N E+I+VDD S A D Sbjct: 1 MADEPALVSVIIPAYNAEAFIAQTLTSVLNQTYQNLEVIVVDDGSHDQTGEIIQKFAQKD 60 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PR+ ++ N G A RN AI + G+YI ID DD W P++L + Q T L Sbjct: 61 PRVRWL-QQANGGVAAARNLAIRASLGQYIAPIDADDIWFPHKLEKQVNCLSQAETRVGL 119 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN---QVFTWAWRFKECLFDTEL 176 V E Y + +N +G+ + + K ++ L Sbjct: 120 VYAWSVDIDEWGELLGGFHAYQFEHNVHHILAYKNFLGHASAPLIRRSCLDKVGGYNCRL 179 Query: 177 K-----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF--HFYR 229 + +D D++LR+ Y V+E G M + S R Sbjct: 180 RAQNAQGCEDLDLYLRIAEHYDFRV-VKEFLIGYRRVIGSMSCNYTAMARSYLLVQSEIR 238 Query: 230 KHKDKFDRASKKYQLFTLYQIRNKRMTWR 258 + + A+ ++ T Y ++ ++ Sbjct: 239 RTNAEIPDAAFRFFNSTFYLYLARQSSFN 267 >UniRef50_B0S8S5 Glycosyltransferase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0S8S5_LEPBA Length = 328 Score = 192 bits (489), Expect = 9e-48, Method: Composition-based stats. Identities = 55/240 (22%), Positives = 97/240 (40%), Gaps = 18/240 (7%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE------------- 48 + PL+SI +PT NR+ L RA+ SVL Q Y NWE+ I+DD ST Sbjct: 4 EEFPLVSIIIPTHNRRHLVERAVLSVLNQTYPNWELHIIDDGSTDDTWSYLLTQLMGWKR 63 Query: 49 QLQQYVTALNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA 108 Q+Q + L +I H + G A RN I + G+++ +D DDEW +L + Sbjct: 64 QIQSFGRMLKSIQI---HQTEHRGVSAARNFGIQKSLGDWVCFLDSDDEWFKEKLEKQIQ 120 Query: 109 HKQQLVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK 168 + F N+ + +P + + + + Sbjct: 121 YHTDHKELYFSQTNEIWNKNGNLLEPKGKYKKLSGHFLEESLALCMVTCSSFIAHKQTLE 180 Query: 169 ECL-FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHF 227 F E+K +DYD++ R++ + + E + + H E Q+++ + + + Sbjct: 181 NIGSFREEMKTCEDYDLWNRILYQGHSIGLLAENLLVRYGGH-EDQLSNKFQAIERFRLY 239 >UniRef50_C6XYA6 Glycosyl transferase family 2 n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XYA6_PEDHD Length = 338 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 59/248 (23%), Positives = 106/248 (42%), Gaps = 5/248 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P I+++M +N +IKS+L Q ++++E++IV+D ST ++ Q V ND RI Sbjct: 3 PKITVFMAAYNAANYINESIKSILSQTFTDFELLIVNDGST--DETVQLVLEFNDSRIRL 60 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 +HND N G RN A+ A+GEY+ +D DD PNRL + Q+ +A + Sbjct: 61 VHNDKNRGLVYTRNVALKEARGEYLAILDSDDIALPNRLELQYRFLQENPGYALCGGHGK 120 Query: 125 VC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDY 182 V G++ + L + + + V KE + A+DY Sbjct: 121 VIDKDGKLLDDNRLIVPTGNEHIKMTLLFINTYVNSTVMYKTTVLKELHGYKDFAPAEDY 180 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 ++F+R+ +Y ++E + + + ++ + + +K Sbjct: 181 ELFIRIAEKY-PVDNLDEVLVKYRDHESNTSVLHADTSWTKLYEIKKIQLTNLQIQPEKR 239 Query: 243 QLFTLYQI 250 LY I Sbjct: 240 LTDALYSI 247 >UniRef50_Q5WBN3 Glycosyltransferase n=1 Tax=Bacillus clausii KSM-K16 RepID=Q5WBN3_BACSK Length = 271 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 68/254 (26%), Positives = 107/254 (42%), Gaps = 17/254 (6%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 IS+ +PT+NR+++ +I SVL+Q E+I+VDD ST + Q + ++ + I Sbjct: 4 ISVVIPTYNREKVIKESIDSVLKQTLQPIEIIVVDDFST--DNTLQILKKYDNGTLKVIK 61 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 N GA RN I+ A+G+ I D DD W N+L + L + + Sbjct: 62 NQFKKGANGARNTGILFAEGDLIAFQDSDDTWDENKLQLQLEKLTSEDADICFCSINTGD 121 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKAAQDYDIF 185 + Y AS +Y + RN I Q + K+ +FD ELK QD+D Sbjct: 122 RKLPYRTLASEEIYRQ-------LRIRNFISTQSILIKTAVAKKIMFDEELKRYQDWDFV 174 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA----SKK 241 LR + ++ I IT FH +KHK+ + + K Sbjct: 175 LRASESFKIVHC---PHALVDIRIQTDSITKKVNNMEALFHLAKKHKELLEEKPINPALK 231 Query: 242 YQLFTLYQIRNKRM 255 Y+L R K++ Sbjct: 232 YKLLMYSSYREKKI 245 >UniRef50_D0IZR1 Glycosyl transferase, family 2 n=1 Tax=Comamonas testosteroni CNB-2 RepID=D0IZR1_COMTE Length = 773 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 64/279 (22%), Positives = 114/279 (40%), Gaps = 16/279 (5%) Query: 3 NNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALNDP 60 + P+ISI +PT+N + L I SVL Q Y W++ I DD ST E A D Sbjct: 228 SKPVISILLPTYNTPEVLLRECIDSVLGQSYPYWQLCIADDASTLPEVRAVIEEYAFADS 287 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N N A+ LA GE++ +D DD L + ++ +Y Sbjct: 288 RICFTQRKHNGHISECSNSALELASGEWVALLDHDDCLPEYALFEVVKALKEHPQAQIIY 347 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAA 179 ++ + ++ Q + K +S L + +N + + + +E F + + Sbjct: 348 SD----EDKLDLQGRRCDPFFKPDWSPDLLFSQNYMTHLLVYRHQLLREVGGFRKGYEGS 403 Query: 180 QDYDIFLRMVVEYGEPW--KVEEATQILHINHGEMQITSS--PKKFSGYFHFYRKHKDKF 235 QDYD+ LR + E + ++L+ Q T+ +K +D Sbjct: 404 QDYDLLLRCIASLPEATQNNILHIPKVLYHWRMTEQSTAMGHERKDYATPAALCALQDFM 463 Query: 236 DRASKKYQLFTL----YQIRNKRMTWRTLLTL-LSVRNG 269 DR ++ + Y+ R + + L++L + R+G Sbjct: 464 DRCHPGVRMEVVQPGIYRARWPQPSRLPLVSLIIPTRDG 502 Score = 80.3 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 53/244 (21%), Positives = 88/244 (36%), Gaps = 29/244 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSV-LRQDYSNWEMIIVDDCSTSWEQLQQYVTALN----- 58 PL+S+ +PT + + I+S+ + Y N+E+++VD+ ST Y L Sbjct: 491 PLVSLIIPTRDGLEELRTCIESIWEKTTYPNYEILVVDNQST-CSYTLDYFKELEADTRY 549 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHK---QQLV 114 D RI + D A+ N A+ A GE + I++D + P LS ++H Sbjct: 550 DGRIRVLSYDHPFNYSAINNFAVQHANGEVLGLINNDVEVIGPEWLSEMVSHAIRPNIGC 609 Query: 115 THAFLYANDYVCQ--------GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR 166 A LY D Q G V Y RL N+ V Sbjct: 610 VGAKLYYPDGTLQHAGVVLGIGGVAGHSHKYYSRSDGGYFGRLLTIHNVY--AVTGAVLL 667 Query: 167 FKECLFDT-------ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 ++ +FD L+ A D D +++ + + H + + S Sbjct: 668 VRKSVFDEVGRLDDFGLRVAFNDVDFCIKVQKAGYQNIFTPFSELYHHESKTRGRDDSPE 727 Query: 219 KKFS 222 K+ Sbjct: 728 KQVR 731 >UniRef50_Q10VK0 Glycosyl transferase, family 2 n=3 Tax=Oscillatoriales RepID=Q10VK0_TRIEI Length = 1035 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 53/207 (25%), Positives = 102/207 (49%), Gaps = 7/207 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ + T+N + AI S+ Q Y+++E+I++DD ST + +Q + D +I Y Sbjct: 716 PRVSVIIATYNNAHYILEAIASIFNQTYTSYEIIVIDDGST--DNTRQVLEPYLD-KICY 772 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 ++ + N G RN + +AQGE+I+ +D DD + P++L+ +A + +++ Sbjct: 773 VYQE-NKGVSHARNLGLEIAQGEFISFLDADDFFLPDKLAKQVAVFDAHPSLGIVHSGWR 831 Query: 125 V--CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDY 182 + +GE S P+ + +K I + +F+ +W FDT +D Sbjct: 832 LVNKKGEKISDIELWHSSPELDLETWVVWKPVTI-SMMFSKSWIKSVGGFDTRWHHGEDI 890 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINH 209 D+ LR+ V+ E + + T +H Sbjct: 891 DLVLRLSVDGCEAMWLPKVTYCYRQHH 917 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 55/211 (26%), Positives = 93/211 (44%), Gaps = 10/211 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE--QLQQYVTALNDPR 61 NP +S+ +P +N + +AI+SVL Q Y+++E+I+++D ST LQ Y+ Sbjct: 2 NPKVSVIIPVYNCELYIAQAIESVLNQTYTDYEIIVINDGSTDNTHQVLQPYMKK----- 56 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y + + N G A RNQ I +A+GE I +D DD + +L +A + + Sbjct: 57 IRYFYQE-NKGLSATRNQGIKMAKGELIALLDADDLFLCYKLQEQVAIFDAQPNIGLVQS 115 Query: 122 NDYVC--QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 V +GE PK L +K +F W K F+ L+ Sbjct: 116 GWRVVNEKGEKIEDIEPWYKSPKLDLVSWLKWKATNPSAMMFRKEWLEKVNGFNENLRRL 175 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHG 210 +D+DI +R+ + + + + G Sbjct: 176 EDFDIVIRLALASCQATWFPKVAVCYRQHSG 206 Score = 108 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 65/176 (36%), Gaps = 6/176 (3%) Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y++ + N G RN I +AQGEYI +D DD + P +L+ + + T + Sbjct: 446 IKYLYQE-NQGPSIARNYGIEIAQGEYIAFLDADDFFLPEKLAEQVGCFAEDSTLTMVQT 504 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRR--LFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 + + P + + L ++ + +F W FD Sbjct: 505 GWRFVDEKGNTIKDVEPWENSTELNLENWLIWQATLPSAMMFKSKWLKVNGGFDKRYFGI 564 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +D +I LR+ + G+ + + + + + +K + K D F Sbjct: 565 EDLEIVLRLALTGGKATWLRKVCVCYRQRNSSVSGLKNREKIT---QELEKLLDDF 617 Score = 91.8 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 14/82 (17%) Query: 1 MKNN-PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE--QLQQYVTAL 57 M N P++S+ +PT+N ++AI+SV Q Y+ WE+II+DD ST L+ Y+ Sbjct: 320 MPNGLPVVSVVIPTFNNADYILKAIESVCNQTYTLWEIIIIDDGSTDNTYQVLESYLNTT 379 Query: 58 NDPRITYIHNDINSGACAVRNQ 79 + ++ N A RN+ Sbjct: 380 D-----FLQN------SAARNR 390 >UniRef50_Q5V6E2 Glycosyl transferase n=1 Tax=Haloarcula marismortui RepID=Q5V6E2_HALMA Length = 318 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 62/238 (26%), Positives = 108/238 (45%), Gaps = 11/238 (4%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N P +S+ +PT++R ++ A+KSV Q Y + E+++VDD S + + ++ + Sbjct: 2 NEPPVSVVLPTYDRPEILRNAVKSVQEQTYRHIELVVVDDHSPTPAADTLSPLSFDELTV 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLA-HKQQLVTHAFLYA 121 I ++ N GA RN I + GEYI +DDDD+W P+++ +A ++ +Y Sbjct: 62 EIIRHEENRGANEARNTGIRESTGEYIAFLDDDDDWAPSKIEKQVAVFREASPEVGVVYT 121 Query: 122 -NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI--IGNQVFTWAWRFKECLFDTELKA 178 ++YV V + SL + + + R++ + L DT+ + Sbjct: 122 GSEYVYDDYVRTVVFSL----EGDVTEEMLRGRSLGEFTTLMVRRDVAMAAGLPDTDFPS 177 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS--PKKFSGYFHFYRKHKDK 234 QD D LR+ + V EA GE I+ + K+ Y F KH+D Sbjct: 178 WQDRDWLLRLSCH-CKFKTVPEALTTRRRKTGEDSISDNFEEKRDISYPLFIEKHRDL 234 >UniRef50_B6I8A2 Putative glycosyltransferase n=1 Tax=Escherichia coli SE11 RepID=B6I8A2_ECOSE Length = 255 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 56/228 (24%), Positives = 99/228 (43%), Gaps = 8/228 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NPLIS+ MP +N + +++ SVL Q Y N+E+II DD ST ++ ++ + +D RI Sbjct: 3 NPLISVIMPAYNAENTIRKSVLSVLNQSYKNFELIICDDGST--DETKKIILTFSDARIK 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 +I+N GA RN AI + GEY++ +D DD W +++ L+ + F Y + Sbjct: 61 FINNKYEKGAAGARNSAIDICSGEYVSFLDSDDIWASDKIEKQLSFMIKN-DCDFSYGDY 119 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYD 183 + + + L L P L +I V F + K +DY Sbjct: 120 FTFKDD---DNTGLFLAPVKTSYEDLLKTCSIGCLTVMLHKKLLINQRFPSGYK--EDYQ 174 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 ++LR++ + A + + + ++ +H Sbjct: 175 LWLRILKDGIIAANYGSADSYYRLGKNTLSSNKIDELKKQWYVIRLQH 222 >UniRef50_C7NBG2 Glycosyl transferase family 2 n=4 Tax=Fusobacteriaceae RepID=C7NBG2_LEPBD Length = 255 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 61/245 (24%), Positives = 106/245 (43%), Gaps = 7/245 (2%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +SI +P +N ++ + I+SVL Q Y NWEM+I++D ST + A D RI ++ Sbjct: 12 VSIIVPMYNAEKFIGKTIESVLAQTYQNWEMLIMNDVSTDNSLAIVSLYAKKDERIKIVN 71 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 + N G RN I LA G+YI +D DD W +L + ++ +A + +Y Sbjct: 72 TEKNVGVVKGRNFLIDLASGKYIAFLDADDYWHNEKLEKQIKFMKE--KNASISCTEYTR 129 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIFL 186 E ++ + K S K N +G + + + EL+ +DY ++L Sbjct: 130 VKE--NEEKINDVIIKEEISYNDMLKNNYLGCLTVIYDAKKIGKRYFKELEKNEDYVLWL 187 Query: 187 RMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLFT 246 +V + + ++E + + ++ K + YRK + K F Sbjct: 188 EIVKDVNTIYGLKENLAYYRVLDNS-RSSNKVKTAKVRWEIYRKIEKL--SLLKSLYYFL 244 Query: 247 LYQIR 251 Y IR Sbjct: 245 HYAIR 249 >UniRef50_A0YLN1 Glycosyl transferase, family 2 n=2 Tax=Bacteria RepID=A0YLN1_9CYAN Length = 1237 Score = 191 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 57/263 (21%), Positives = 104/263 (39%), Gaps = 22/263 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +P +N+ +A+ SVLRQ Y ++E+IIV+D S+ +Q + + NDPRI Sbjct: 2 PRVSVVIPAYNQAAYISQAVDSVLRQTYQDFEIIIVNDGSS--DQTVEKILENNDPRIRL 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH-AFLYAND 123 + N G N I AQGE I + DD + P +L + Q + A L Sbjct: 60 FSFEQNHGESIATNYGIQQAQGELIAILHSDDVFVPKKLEKQVLFLDQNPQYQAVLSYPQ 119 Query: 124 YVCQGEVYSQPASLPL--------YPKSPYSRRLFYKRNIIG-NQVFTWAWRFKECLF-D 173 + P++ L + + F K N + + + F D Sbjct: 120 LIDSQGENLPPSNSFLDQVFLQQNRTRFQWLNTFFSKDNCLCQTSSLIRKNCYSQIGFYD 179 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 + + D+D ++R ++Y E + + E ++ + +H Sbjct: 180 SRFRQIPDFDFWVRFCLKY-ELYILPEPLVNYRVHQSNISGIK--------PETIIRHNF 230 Query: 234 KFDRASKKYQLFTLYQIRNKRMT 256 + + K+Y +Y+ K Sbjct: 231 ELSQTLKRYFCQDVYENLLKIFP 253 Score = 138 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 57/273 (20%), Positives = 95/273 (34%), Gaps = 33/273 (12%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ +PT+N + +KS Q Y + E+I+ DD S+ I + Sbjct: 361 PLVSLIIPTYNGETFLTETLKSAFSQTYPHLEIILSDDGSSDKTLEIAQSFQQKTS-IPF 419 Query: 65 -IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N G N +I AQG+YI + DD P+ + + +Q +++ Sbjct: 420 EIFPHPNYGLVENLNFSISQAQGKYIKFLFQDDLLEPHCIETLVNLAEQDSEIGLVFSAR 479 Query: 124 YVCQGEVYSQPA--------------SLPLYPKSPYSRRLFYKRNIIGNQ---------- 159 V E + + L N + Q Sbjct: 480 RVFIDENSQSNSLCQAALNGGKDLHQQWSHLKPIQLGQDLLSDANCLKGQLNKIGEPTTV 539 Query: 160 VFTWAWRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHIN--HGEMQITSS 217 + A K FD L D D++ R++ +Y + V + L I+ + S+ Sbjct: 540 LIPKAVFEKVGGFDENLHQLLDGDLWFRIMGDY-KIGFVNQTLSSLRIHTQQQTQKNISA 598 Query: 218 PKKFSGYFHFYRK---HKD-KFDRASKKYQLFT 246 + Y Y+K H D F A K +F Sbjct: 599 GQNLKDYQRLYQKMLFHPDYYFLTAEFKSHIFQ 631 >UniRef50_C6MPM7 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M18 RepID=C6MPM7_9DELT Length = 315 Score = 191 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 52/258 (20%), Positives = 88/258 (34%), Gaps = 9/258 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ--LQQYVTALNDPRIT 63 L S+ +P +N + +A+ SV Q +++E++I +D S ++ Y D + Sbjct: 17 LFSVVIPAYNSEPFIAKALDSVRAQTLADYEVVITNDGSKDGTVQAIEDYARRHPDFPVR 76 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 N G RN I A G +I +D DD W P++L Y ++ Sbjct: 77 LASQQ-NKGIGGARNNGIFRATGRFIAFLDADDYWHPSKLEKMARLLAGRPQVDVAYHDE 135 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF--KECLFDTELKA--A 179 G+ +P S + Y + L ++ N + K F L A Sbjct: 136 IEVAGDGTRRPLSYDEVREPAY-QDLLFRGNRLSTSATVVRRELAQKIGGFSENLDFNSA 194 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 +DY+ +LR+ + E H G + F+ H D Sbjct: 195 EDYEFWLRLARAGARFAHLGEVLGEYHRVEGSIT-QKIEYHHRNIFNVITHHLDLLRADG 253 Query: 240 KKYQLFTLYQIRNKRMTW 257 F R K+ Sbjct: 254 TCSAAFLDRMYRRKKAEH 271 >UniRef50_B2JT44 Glycosyl transferase family 2 n=1 Tax=Burkholderia phymatum STM815 RepID=B2JT44_BURP8 Length = 316 Score = 191 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 71/236 (30%), Positives = 113/236 (47%), Gaps = 7/236 (2%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP- 60 PL+S+Y+PT NR L RA+ SVL+Q Y E+++ DD ST ++Y+ L Sbjct: 6 NTYPLVSVYIPTKNRLPLLQRAVASVLQQTYPRIELVVADDGSTDG--TREYLDELAASG 63 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL- 119 + + ++ GACA RN AI + GE++TG+DDDD + P R+ F+ ++L TH Sbjct: 64 KCVAVFLPVSEGACAARNAAISMTTGEFLTGLDDDDYFLPQRIECFVQRWRELSTHGRTD 123 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 + + L+ ++ R +G QVF+ RF LFD ++ Sbjct: 124 HIAGLFDSSRWIGRAMERTLFDRACAHPRDLAAACAVGTQVFSIRERFIAAGLFDRSMQV 183 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF--SGYFHFYRKHK 232 QD+D++LRM YGE + T ++ H +I+ + F KH Sbjct: 184 WQDWDLWLRMAYRYGEFVGIRACTYVIDATHQHERISDRSEAALRRAAQLFAAKHY 239 >UniRef50_B0CGB5 Glycosyl transferase, family 2, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CGB5_ACAM1 Length = 327 Score = 191 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 52/254 (20%), Positives = 101/254 (39%), Gaps = 11/254 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ + T+N + ++SVL+Q +S+ E+IIVDD S+ + ++T++ D R+ Sbjct: 3 KVSVIITTFNSMRFFPETLESVLKQTFSDIEVIIVDDGSSDG--ISDWITSVVDSRV-VF 59 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 H N G + RN I LA+GEYI +D DD W P +L + ++ + Sbjct: 60 HAQSNQGVSSARNTGIRLARGEYIAFLDGDDFWAPTKLEQQVRVLALEDEVGLVHTWLAL 119 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDYDI 184 E + + + I + V +FD+ L A+D+D+ Sbjct: 120 MDEESRLTGRVMKPQSEGAIWEEIVESNMIACSSVMVRRSCLDVVGIFDSALIVAEDWDL 179 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK-FDRASKKYQ 243 ++R+ +++ I+ +S K++ +R ++ F Y Sbjct: 180 WIRVAAV-CRISVIKDPLVQYRIHP-----SSKSKRYPEMVEDFRTIIERAFQSVPYDYL 233 Query: 244 LFTLYQIRNKRMTW 257 + Sbjct: 234 PIRNRSYGRVNLCI 247 >UniRef50_B8FY70 Glycosyl transferase family 2 n=2 Tax=Desulfitobacterium hafniense RepID=B8FY70_DESHD Length = 235 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 49/213 (23%), Positives = 86/213 (40%), Gaps = 8/213 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +PT+N + A++SVLRQ Y + E+I++DD ST + Q + RI Y Sbjct: 4 PKVSVVIPTYNHARYVTWAVESVLRQKYPSLEIIVIDDGST--DDTAQRLQPYQ-SRINY 60 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I+ N G N + A GEYI + DD + +L + + F Y + Sbjct: 61 IY-KSNGGTPNALNHGLQRATGEYICWLSADDMFLKGKLEKQVQLMEANPQVGFCYTSFI 119 Query: 125 -VCQGEVYSQPASLPLYPKSPYSRRLFYKRNII--GNQVFTWAWRFKECLFDTELKAAQD 181 + + + A YP + I + + + FD L A D Sbjct: 120 VIDENGLKKYAADSEFYPARQELVVNLLRGCFINGSSVMMRRTALERVGYFDEGLPQAHD 179 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQI 214 Y+++ R + + + + E + M + Sbjct: 180 YELWFRFLRHFPAGY-IREPLIAYRWHGENMSL 211 >UniRef50_D2L8E2 Glycosyl transferase family 2 n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L8E2_9DELT Length = 316 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 51/202 (25%), Positives = 96/202 (47%), Gaps = 5/202 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ +PT +R L +RA+ SVL Q ++ E+++VDD S + ++AL DPR+ + Sbjct: 43 LVSVVVPTRDRAALVVRAVASVLAQTHAALEVLVVDDGS--VDDTVARLSALGDPRLRVL 100 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + G A RN + A+G Y+ +D DDEW P ++ LA ++ + + Sbjct: 101 SHAAGRGVSAARNTGLGAARGAYVALLDSDDEWLPAKIERQLAFMRER-KLSVSQTQEIW 159 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNII-GNQVFTWAWRFKECLFDTELKAAQDYDI 184 + P + P + + + + + A+ + FD L A +DYD+ Sbjct: 160 MRNGRRVNPTAANRKPDGFFFEQALVRCVVSPSTSMGVRAYWEEMGGFDETLPACEDYDL 219 Query: 185 FLRMVVEYGEPWKVEEATQILH 206 +LR ++ + ++E + H Sbjct: 220 WLRTLLCH-PIGLLDEYLAVRH 240 >UniRef50_Q72X17 Glycosyl transferase domain protein n=4 Tax=Bacillus cereus group RepID=Q72X17_BACC1 Length = 321 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 50/216 (23%), Positives = 98/216 (45%), Gaps = 8/216 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P I++ MP +N ++ ++ S+L Q ++++E++I++D ST + + + +D RI Sbjct: 2 PKITVLMPVYNGEKYLRESVDSILNQTFTDFELLIINDGSTDSSM--EILNSYSDSRIRI 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N++N N+ I LA GEYI +D DD P RL + L + ++ A Sbjct: 60 VTNEVNLRLIKTLNKGIDLATGEYIARMDCDDIADPKRLEIQLQYMEKHPDVAVCGTGVK 119 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKAAQDY 182 V +P + + + L+ + +I V KE +D A+DY Sbjct: 120 VI--GQNRKPWQISGSSELIRNC-LYVRSCMIHPSVMMRTEFLKEHNIYYDLNYLHAEDY 176 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 ++F R+ +Y + +EE + + ++ Sbjct: 177 ELFQRLSEKY-KVINLEEPLLNYRWSETSVSSLNAS 211 >UniRef50_A0YK73 Glycosyl transferase n=5 Tax=Cyanobacteria RepID=A0YK73_9CYAN Length = 1161 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 57/274 (20%), Positives = 106/274 (38%), Gaps = 11/274 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +S+ +P +N + +A++SVL Q ++E+I+VDD ST + +D RI Sbjct: 7 TPQVSVIIPAYNGSRYISQAVESVLSQTDCHFEVIVVDDGSTDNTNT--ILQHYSD-RIR 63 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y N G A RN+ I A+ E I +D DD + P++L+ A Q+ + + Sbjct: 64 YFSQ-KNQGVAAARNRGIQEAKAEIIALLDQDDIFLPHKLAEQTACFQKFPEVGLVNSGW 122 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ--VFTWAWRFKECLFDTELKAAQD 181 + E Y+ P + + + I +F W K F++ D Sbjct: 123 RLINREGYNISEIEPWHNLPDLDLQTLIIHSPILPSAIMFRRTWWEKIGGFNSRFNGVDD 182 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 + ++ + + + T +H Q S K + F + Sbjct: 183 AEFVWQLAARGCQAIWLPKITVEYRQHH---QTVSHQKAVERAETVIVAQNNFFSQKGLS 239 Query: 242 YQLFTLYQI-RNKRMTWRTLLTLLSVRNGKRLAD 274 + L + R + +TW L + + +A+ Sbjct: 240 DAVIQLEKPARYEELTW-LAWHLYHTNHYQSMAE 272 Score = 188 bits (477), Expect = 2e-46, Method: Composition-based stats. Identities = 62/287 (21%), Positives = 118/287 (41%), Gaps = 16/287 (5%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N P +S+ +P +N + +A+KSVL Q Y+++E+I++DD ST + + ++ D Sbjct: 328 SNLPKVSVIIPAYNCENYIEQAVKSVLEQTYTDYEVIVIDDGST--DNTKNILSPYLDI- 384 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y++ N GA RN+ +AQGE + +D DD +TPN+L+ + +Q T + + Sbjct: 385 IQYVYQS-NQGAAKARNKGCQIAQGELLAFLDGDDFFTPNKLAEQVKVFEQDATIDLVQS 443 Query: 122 NDYVCQ--GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 V GE P+ + +K + K FD Sbjct: 444 GWMVVNKTGENLVDVLPWKNAPELNLETWVLHKCVRPSALMIRKICWKKVGGFDHSYPPT 503 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINH------GEMQITSSPKKFSGYFHF--YRKH 231 +D D LR+ + + V++ + GE I ++ + +F+ K+ Sbjct: 504 EDLDFVLRLALMGCKAVWVKQLHAGYRQHDCNLMSGGERVIKNTELLMNQFFNRRDLPKN 563 Query: 232 KDKFDRAS--KKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 + + +++ F R+ + + T S+ I Sbjct: 564 IQQLKQKEHYQRWVWFAWRMYRDSHLKFVTYCLKKSLDYTSDFKANI 610 >UniRef50_B7JW84 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7JW84_CYAP8 Length = 1177 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 61/259 (23%), Positives = 113/259 (43%), Gaps = 21/259 (8%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 +N PL+S+ +P +N Q AI SVL+Q + N+E+I+VDD ST + Q+ + LN P+ Sbjct: 555 QNLPLVSVIIPCYNYGQYLEEAIDSVLQQTFQNFEIIVVDDGSTD-SKTQEVLDNLNKPK 613 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 T I N G RN+ I A+G+YI +D DD+ P L L + Sbjct: 614 TTLIRQ-KNQGVAIARNEGIFQAKGKYICCLDADDKLKPAYLEKCLIKLETE-------- 664 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG-NQVFTWAWRFKECLFDTELKAAQ 180 N +C + S ++ + + + N + + VF K +D ++ A + Sbjct: 665 NLDICYTWIQEFEESDLVWKTASFELSKLLEENCLEVSAVFRRDIWEKVGGYDPQM-AYE 723 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILH---INHGEMQITSSPKKFSGYFHFYRKHKDK--- 234 D+D+++ M + E + I+ ++ ++ + ++K + Sbjct: 724 DWDLWITMAKMGAIGDVIPEPLFLYRKHGISKHDLDFSNHEEIKQKIETKHQKLYQEPDR 783 Query: 235 ---FDRASKKYQLFTLYQI 250 ++A KY + Y+ Sbjct: 784 VRAIEQAKPKYGVQDGYKN 802 >UniRef50_Q8DGK0 Tlr2317 protein n=2 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DGK0_THEEB Length = 356 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 66/273 (24%), Positives = 113/273 (41%), Gaps = 10/273 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N PL+S+ MP +N + AI+S+L Q + ++E II+DD ST A+ D R Sbjct: 11 SNRPLLSVLMPVFNAEAYVGEAIESILGQSFRDFEFIIIDDGSTDNSLSVLKRYAVQDAR 70 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I + N G A N+ I LA+GE+I +D DD PNR V LA ++ + Sbjct: 71 IRLISRE-NRGLVATLNEGIALARGEWIARMDADDVAMPNRFIVQLAALERERVD---FC 126 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + PL ++ L + V F E +D + A+D Sbjct: 127 GGAIQCFGALRSVCFYPLTHEAC-EVHLLFGVPFGHPTVIGRHSAFAELGYDPDYPYAED 185 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH----KDKFDR 237 Y+++ R + V E ++ G++ ++ + ++H + D Sbjct: 186 YELWQRAWGRGYKFVNVPEVVLRYRVHAGQVSARKKAEQQNTADAVRKRHWRALFPRMDD 245 Query: 238 ASKKYQLFTLYQIRNKRMTWRT-LLTLLSVRNG 269 Y + L+ + + LL+L+S +G Sbjct: 246 KEIDYIIKMLHDGKGETSRLLPCLLSLVSHYSG 278 >UniRef50_C6CSD6 Glycosyl transferase family 2 n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CSD6_PAESJ Length = 304 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 68/243 (27%), Positives = 110/243 (45%), Gaps = 14/243 (5%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCS---TSWEQLQQYVTALNDPR 61 IS + T+ R + RA+ S++ Q YSN E+I+V+D E ++ + + Sbjct: 3 KISCIVTTYKRPIPVLQRAVASIVNQTYSNLELIVVNDAPQILKLVEDIRIMLNEFTNIP 62 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I YI N GAC RN I A GEY+ +DDDDEW P++L A + A +Y Sbjct: 63 IKYIVQHQNMGACQARNTGIEEASGEYVAFLDDDDEWLPDKLEKQYALITKE-RVALVYC 121 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRL--FYKRNIIGNQVF---TWAWRFKECLFDTEL 176 + Y + E L + + N +G+ + + FDT L Sbjct: 122 SHYEIRQEGTRTLVIEDLAREGIHQDEFERLLLANFVGSTSYPLLRLSAVKTVGGFDTNL 181 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS-PKKFSGYFHFYRKHKDKF 235 KA+QD+D++LR+ +Y + E + H E I+S+ K G+ K+KD++ Sbjct: 182 KASQDHDLWLRIARKYPIVYC-NEPLVNYY--HSEESISSNMENKLQGFNFLLNKYKDEY 238 Query: 236 DRA 238 + Sbjct: 239 LKK 241 >UniRef50_D1W2F4 Glycosyltransferase, group 2 family protein n=2 Tax=Prevotella RepID=D1W2F4_9BACT Length = 251 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 57/246 (23%), Positives = 100/246 (40%), Gaps = 7/246 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPR 61 N L+S+ +P +N + S+L+Q Y N+E++I DD ST ++ D R Sbjct: 2 NTDLVSVIIPCYNSGIYIKECVDSILQQTYQNFEILITDDGSTDISTIKLLQQITQCDKR 61 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I D NSG A RN +I A+G YI D DD+W PN+L+ + A +Y+ Sbjct: 62 INVFFLDGNSGPAAARNNSIKHAKGRYIAFCDSDDKWLPNKLTDQIPLF-DNKEVAIVYS 120 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + + + ++ + + K N IGN + + K + +D Sbjct: 121 DYEKMDAQGKRN--NRIVHAPQSLTYAKYLKGNTIGNLTGIYDTQ-KVGKVMQKNIHHED 177 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 Y ++L ++ T + I+ IT++ K + + +H + S Sbjct: 178 YVLWLDILRHGWTALNTNTVTAVYRIH--SNSITANKLKLCTWQWYIYRHHEHLGLISSM 235 Query: 242 YQLFTL 247 Y Sbjct: 236 YYYLHY 241 >UniRef50_B4WF86 Glycosyl transferase, group 2 family protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4WF86_9CAUL Length = 606 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 62/269 (23%), Positives = 101/269 (37%), Gaps = 7/269 (2%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + PL+S+ +P + L + S+L Q Y N+E+I+VDD + D Sbjct: 79 LTARPLLSVVVPLYKTPYPLLRACLASLLDQAYPNFELILVDDAPAAPTGPLAEKLVNGD 138 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ I ++N G N + A+GE+I ID DDE P L F+ H + Sbjct: 139 DRVRIIQPEVNGGISRATNLGVAAARGEFILFIDHDDELVPESLLSFVEHIVRHPEVDAW 198 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 Y++ C G +L + K +S IG+ + A + LFD+E Sbjct: 199 YSDQVTCDG----AGKTLHHFFKPEWSPTYLLGVMYIGHLLAVRAEICAKTLFDSEFDGV 254 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY--RKHKDKFDR 237 QD++ LR+ + V A G + S K KH + R Sbjct: 255 QDFEFMLRVAEQTQRVGHVPGALYKWRATEGSLASGSDEKTGIDALQRLAVEKHLLRQGR 314 Query: 238 ASKKYQLFTLYQIRNKRMTWRTLLTLLSV 266 + + RT+ +S+ Sbjct: 315 TWRAVSSNRHRHRVLLKPGPRTVEPRISI 343 Score = 89.5 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 79/221 (35%), Gaps = 22/221 (9%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P ISI +PT N+ ++ R + S+ Y ++E+I+VD+ +T + + A + + Sbjct: 338 EPRISIIIPTCNQGEMIERCLDSIFAMTDYPDFEVIVVDNRTTEP----RALKAFDRHPV 393 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-----WTPNRLSVFLAHKQQLVTHA 117 + D A N + + GE++ +++D E W + + F + V Sbjct: 394 KRVVFDQAFNYSAANNAGVTASNGEFLLFLNNDTEVLDTDWLSDLVMYFEDRQVGAVGPT 453 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRR-LFYKRNII----------GNQVFTWAWR 166 LY V V + + Y +++ + + Sbjct: 454 LLYPARTVQHAGVVVGARGTADHVMRHFHEDWDGYAGSLVAAREASGVTAACLMMRRSLF 513 Query: 167 FKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEATQILH 206 F + K QD D+ L++ V + I + Sbjct: 514 DDIGGFSEDYAKHYQDVDLCLKIRERDLRILCVGQPRLIHY 554 >UniRef50_A5G401 Glycosyl transferase, family 2 n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5G401_GEOUR Length = 310 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 54/214 (25%), Positives = 95/214 (44%), Gaps = 8/214 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +IS+ +PT+N + AI SVL Q Y+++E++I+DD ST + + P + Y Sbjct: 1 MISVIIPTYNSDKYICEAIDSVLCQTYTDYEIVIIDDGSTDDT---KRIIEDRYPTVRYF 57 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + + N G + RN I AQG+ I +D DD W P++L ++ +Y Sbjct: 58 YVE-NKGVASARNYGISKAQGDLIAFLDADDRWLPDKLEKQAVLFHNCNELGLVFTENYF 116 Query: 126 CQGEVYSQPA--SLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDY 182 S+ + R +F I+ + V F F+ L A+D Sbjct: 117 FDEHGISKHRVNKRERLMRGNIVRNIFLNSYIVTSTVMVRRSVFDAVGFFEEGLVVAEDD 176 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITS 216 ++++R+ + YG + E + H+ G + TS Sbjct: 177 NMWMRIGMRYGVKL-LNEPLVLYHMTEGSLSRTS 209 >UniRef50_Q2WB46 Glycosyltransferase n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2WB46_MAGSA Length = 294 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 60/240 (25%), Positives = 96/240 (40%), Gaps = 18/240 (7%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +SI +PT+N+ + SVL Q ++WE ++V++ S + + V A DPRI Sbjct: 2 TPRVSIVIPTYNQADFLKECLDSVLAQSVADWECVVVNNFS--ADHTRDVVLAYGDPRIQ 59 Query: 64 YIHNDINSG-ACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ---QLVTHAFL 119 I + N G A RN I A+ E++ +D DD W P +L + LA LV+H Sbjct: 60 LI-DFANQGVIAASRNVGIRAAKAEWVAFLDSDDLWEPVKLELCLAAATDQVDLVSHPEH 118 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ--VFTWAWRFKECLF--DTE 175 + D V + PA + RL N + + + F D E Sbjct: 119 FLKDGVTGNVTEAAPAERCGF------ERLVLDGNCLSPSAVLVRRTLLERVGGFCEDRE 172 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 + A+D D++LR+ V+ + HGE + +H Sbjct: 173 VITAEDADLWLRLAALGARMTCVDRPLGFYRL-HGEQNSKGVARHMEASLTVLERHLPAL 231 >UniRef50_B0JRA6 Glycosyl transferase family 2 n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JRA6_MICAN Length = 308 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 70/294 (23%), Positives = 124/294 (42%), Gaps = 30/294 (10%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M PL+SI +P +N + +AI+S Q +N E+I++DDCST + + NDP Sbjct: 1 MNAYPLVSIIIPAYNTESYIAKAIQSAQEQTLTNIEIIVIDDCST--DNTLAIAQSFNDP 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI IHN N G + RN+AI LA+G ++ +D DD + PNRL L Q+ T A L Sbjct: 59 RIIIIHNSENLGVASTRNKAIKLAKGSWLALLDSDDWYEPNRLETLLEVAQK--TQADLV 116 Query: 121 ANDY-------------VCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWA 164 ++D V + + L L P S +++ F +++ + Sbjct: 117 SDDVYLIYDQAPSPWTTVIKMSGQTLAHPLVLEPISFITKQKFESKSLFVTMTKPIINRQ 176 Query: 165 WRFKECL-FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSG 223 + L +D L+ QDY +L +++ + G + + + Sbjct: 177 FLLAHNLEYDPTLRQGQDYWFYLDCLLKEAKFIFYPYPYYYQRTRPGSLTQQKTSLRLES 236 Query: 224 YFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 Y +K +K + + + R + L L +N ++ + ++ Sbjct: 237 YCRANQKFLEKIEVKNSRELT---------RALVKNLRHLQRFKNYYKVVEPLK 281 >UniRef50_C6M2Q8 Glycosyltransferase n=2 Tax=Neisseria RepID=C6M2Q8_NEISI Length = 1239 Score = 190 bits (484), Expect = 3e-47, Method: Composition-based stats. Identities = 57/253 (22%), Positives = 94/253 (37%), Gaps = 17/253 (6%) Query: 1 MKNN---PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL 57 M + P +S+ +P +N ++ + I+SVL Q + N+E+IIVDD S E L Sbjct: 1 MNHQAQSPTVSVMIPYYNCKEYIVETIQSVLSQSHQNFEIIIVDDGS-DPEHADYLREFL 59 Query: 58 ND-PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 D P I Y + N G A RN A LA G+Y +D DD P+ + + + Sbjct: 60 ADKPAIRYAVQN-NQGVAAARNHAARLAGGKYFLFLDSDDLILPDYIEKCVTVLENNPDC 118 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTE 175 +Y E + L P L + A F FD Sbjct: 119 KLVY-----PLAEYFDAQEGLWNLPDYDGLESLLMGNRFPSSVSMHRAEDFAALGGFDEN 173 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH-----FYRK 230 L +D+D ++R++ G ++ E + + ++ Y K Sbjct: 174 LTTHEDWDFWIRLLSNGGTVIRIPEVLFRYRKRRDGSSLINRLEQNPDLNREDWQKVYEK 233 Query: 231 HKDKFDRASKKYQ 243 ++ F + Y Sbjct: 234 NRALFMQYHLGYS 246 >UniRef50_Q2SBM9 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SBM9_HAHCH Length = 295 Score = 190 bits (484), Expect = 3e-47, Method: Composition-based stats. Identities = 76/272 (27%), Positives = 128/272 (47%), Gaps = 11/272 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-NDPRIT 63 P + I + T NR +L RA++S + Q Y + E+IIVDD S+ ++ Y+ L + ++ Sbjct: 3 PKVCIVITTKNRLELLKRALRSAINQTYPSIEVIIVDDGSS--DETPYYLKQLEREGKLV 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + N + GAC RN A ++ + IT +DDDD +T +R+ + + Y+ Sbjct: 61 AVINQKSLGACKSRNHATKISNAKLITYLDDDDFFTADRIEKLVTAYNE------TYSFV 114 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQDY 182 G + P + + +N G QVFT + K FD L A QDY Sbjct: 115 CDSHGIYSKGKIRIYSSPSKLITLQDIKIQNYSGIQVFTEREKIIKIGGFDESLDAWQDY 174 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D+ LR++ +G +++E + I+ +I++S K + GY F +KH + Sbjct: 175 DLKLRLIETFGPGFRLEGCSYIVDQESELTRISTSSKAYQGYLQFMKKHHNSLSYYEANC 234 Query: 243 Q-LFTLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 Q + LY + K + + S+ N KRLA Sbjct: 235 QSINDLYNRKVKIGIIKVIQHTKSISNLKRLA 266 >UniRef50_B4AW59 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AW59_9CHRO Length = 1162 Score = 190 bits (484), Expect = 4e-47, Method: Composition-based stats. Identities = 52/215 (24%), Positives = 81/215 (37%), Gaps = 13/215 (6%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +P +N + IKS+ Q +WE II+DD S AL D R+ I Sbjct: 3 VSVIIPAYNAVETLEETIKSLQAQTLPHWEAIIIDDGSQDETAALATQLALEDNRLRVIT 62 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN---- 122 NSG A RN+ I A +++ +D DD P L + ++ Sbjct: 63 Q-ANSGVSAARNRGIEEAHFDWLLFLDADDWIAPEYLEKMTGALTTDPSLDIIHCGWTFV 121 Query: 123 --DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 D GE Y+ P + L+P YS II + + A FD L + Sbjct: 122 APDGTLMGEKYT-PDLIDLFPLLSYSSTF-----IIHSCIVRKALVIAAGGFDRSLAVCE 175 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 D+D + R+ V+E + + Sbjct: 176 DWDFWQRIARTGSRFGAVKEILAFYRMRPKSLSRN 210 >UniRef50_A4G480 Putative Glycosyl transferase, family 2 n=1 Tax=Herminiimonas arsenicoxydans RepID=A4G480_HERAR Length = 303 Score = 190 bits (484), Expect = 4e-47, Method: Composition-based stats. Identities = 56/257 (21%), Positives = 106/257 (41%), Gaps = 18/257 (7%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M ++P++++ +PT+N A++S+ Q +SNWE I+V++ S + V + +DP Sbjct: 1 MNDSPVVTVVIPTYNHAHFLREALQSLCAQSFSNWEAIVVNNYS--EDDTIAVVASFSDP 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N + RN+ I LAQG+YI +D DD W P +L+ + V Sbjct: 59 RIRLENFRNNGVIASSRNRGIALAQGQYIAFLDSDDIWYPEKLTQCMQSFDDNV------ 112 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPY---SRRLFYKRNIIGNQ--VFTWAWRFKECLF--D 173 D VC G + ++ L K N I V + F + Sbjct: 113 --DLVCHGLYWFGNEERNMFCGPAQRATFDALLDKGNCITPSATVVRKSILDLVDGFSQN 170 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 + ++DY ++L++ +EE I H Q +++ + + Sbjct: 171 PAVVTSEDYHLWLKLAKVGARMKFLEEILGGYRI-HSSNQSSAALRHLESVLQVVEEFFP 229 Query: 234 KFDRASKKYQLFTLYQI 250 ++ +Q+ ++ Sbjct: 230 AQSSSTISFQIRRRKRL 246 >UniRef50_C0X9N7 Group 2 family glycosyl transferase n=24 Tax=Enterococcus faecalis RepID=C0X9N7_ENTFA Length = 1047 Score = 190 bits (484), Expect = 4e-47, Method: Composition-based stats. Identities = 57/284 (20%), Positives = 111/284 (39%), Gaps = 14/284 (4%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-- 58 + P IS+ +P +N + + + S+ Q Y NWE+ + DD S S E ++ + Sbjct: 551 QFQPKISVAVPVYNVEEKWLAACVSSLQNQYYENWELCLADDASPS-EHIKPMLEKYKEL 609 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI I+ + N N A+ +A G++I +D+DDE P L + T F Sbjct: 610 DQRIKVIYREENGHISEATNSALSIATGDFIGFMDNDDELAPQALYEVVKALNTDPTIDF 669 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELK 177 LY + + ++ + KS ++ L N I + V K ++ Sbjct: 670 LYTD----EDKITENGRRFNAFYKSDWNPELILNHNYITHFVVVKRDLLEKVGGLNSAYN 725 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 AQDYD LR + + + + K + Y + + +R Sbjct: 726 GAQDYDFVLRATEQATKIKHIPGMMYHWRAIESSTALNPESKGY-AYVAGQKAVQAATER 784 Query: 238 ASKKYQLFTLYQIRNKRMTWR----TLLTLLSVRNGKRLADGIR 277 K Q+ + ++ + +++L+ + + ++ +R Sbjct: 785 RGLKAQVEIAEFYGSYKINYLYDHVPMVSLIITNDTENMSSYLR 828 Score = 44.1 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 40/93 (43%), Gaps = 22/93 (23%) Query: 5 PLISIYMP--TWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P++S+ + T N + +L + Y+N+E+++ + + + ND R Sbjct: 810 PMVSLIITNDTENMSSYLRQ----LLEKTAYTNYEILL-------PARFENQINIQND-R 857 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDD 94 + Y+ + G I A+GEY+ ++ Sbjct: 858 LRYVSTETRHGM-------IQAAKGEYVALLNA 883 >UniRef50_C6XWT6 Glycosyl transferase family 2 n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWT6_PEDHD Length = 301 Score = 190 bits (484), Expect = 4e-47, Method: Composition-based stats. Identities = 60/251 (23%), Positives = 103/251 (41%), Gaps = 12/251 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 +K L+S+ +P +N + +AI+SVL Q Y+++E+I+VDD S + ++ + Sbjct: 4 LKEEALVSVIIPCYNHGRYLSKAIESVLAQTYTHFEIIVVDDGS--GDNTKETAQNYKE- 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 + YI+ +N G A RN I + GEY+ +D DD + L + L Q AF+ Sbjct: 61 -VKYIYQ-MNQGLSAARNTGIDESTGEYLVFLDADDWLLKDALMINLNFIQASPQLAFVS 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN--QVFTWAWRFKECLFDTELKA 178 + + Y ++N I V W F +DT LK Sbjct: 119 GGFSFFFNKDQKMWDETSKVESNHYCH--LLQKNFIAMIATVMFRRWVFDSFRYDTTLKV 176 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +DYD++L++ + E ++ H S K + K R Sbjct: 177 CEDYDLYLKVARRHPVAHHTE--LIAVYFIHDSNVSKGSVKMLNTALQILDAQKSGL-RN 233 Query: 239 SKKYQLFTLYQ 249 +++ F L Q Sbjct: 234 AEERHWFNLGQ 244 >UniRef50_A6H0Q4 Glycosyl transferase, group 2 family protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6H0Q4_FLAPJ Length = 308 Score = 190 bits (483), Expect = 4e-47, Method: Composition-based stats. Identities = 56/234 (23%), Positives = 109/234 (46%), Gaps = 10/234 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P SI +P +N++ +KSVL Q + ++E+IIV+D ST +Q V ND RI Y Sbjct: 2 PFFSIIIPLYNKENYIEYTLKSVLNQSFIDFEIIIVNDGSTDNS--EQIVFQFNDKRIHY 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 H N G RN I A+ ++I +D DD W N L + + ++ T+ ++++ Sbjct: 60 FH-KKNEGVSIARNFGIEQAKSDFICFLDADDFWYKNYLEIMQTYIEKFSTYK-VFSSAI 117 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRN------IIGNQVFTWAWRFKECLFDTELKA 178 + PA + + + F++ + + VF + K +FD +K Sbjct: 118 EIETSKTIFPAQYSIKKNADFEIINFFEASEKECAIWTSSSVFHKSIFKKTGVFDPNIKI 177 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 A+D D+++R+ ++Y + + + ++ + ++ + S + + K K Sbjct: 178 AEDTDLWIRIGLQYPIVFIWKTIARYVYDEQSVSRASNYIFEESSFLKYAEKEK 231 >UniRef50_B6YUK0 Glycosyltransferase n=3 Tax=Thermococcaceae RepID=B6YUK0_THEON Length = 294 Score = 190 bits (483), Expect = 4e-47, Method: Composition-based stats. Identities = 62/246 (25%), Positives = 103/246 (41%), Gaps = 10/246 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +SI +PT+NR +L RAI+SVL Q + ++E++++D ++ V + D R+ Sbjct: 2 RPAVSIVIPTYNRNKLLTRAIESVLSQGFDDFEVLVIDGA--RSNSTRELVRSYGDGRLR 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLV-THAFLYAN 122 YI G RN + A+GEYI +DDDD+W P +L + + + L T+ +Y Sbjct: 60 YIPQRG-KGIADARNLGVKKARGEYIAFLDDDDQWLPEKLGLQMELFKSLPSTYGLIYTA 118 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQD 181 E + + R L + + FK LF +D Sbjct: 119 FNYYYLERNKILGTKKPKARGNVYRHLLKDNITGTSTIMVRRECFKRAGLFRMSFPTCED 178 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD-KFDRASK 240 +D++LRM ++E I+ G+ K +G + H D + D Sbjct: 179 WDMWLRMSRV-CLFEAIDEPLVNYSIHSGQFSF---AKYLAGRYRMIEAHGDIRHDPQVL 234 Query: 241 KYQLFT 246 Y L Sbjct: 235 SYHLLQ 240 >UniRef50_C0CGZ7 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CGZ7_9FIRM Length = 280 Score = 190 bits (483), Expect = 4e-47, Method: Composition-based stats. Identities = 60/262 (22%), Positives = 102/262 (38%), Gaps = 5/262 (1%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 N LIS+ +P++NR+ + I+SVL Q YSN E+I++DD ST + D R+ Sbjct: 2 NELISVVIPSYNRKNTILSCIRSVLEQTYSNLEVIVIDDGSTDYTYT--LFNDSFDDRVK 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + + N GAC RN L+ G YI D DD W +L+ L++ ++ + Sbjct: 60 FFRYEENRGACYARNLGAELSSGAYIAFQDSDDIWKKTKLAKELSYLKKENADMVFCGMN 119 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKAAQDY 182 S+ + + N + Q + + FD K QD+ Sbjct: 120 RKDPIHKTSKYYPDETFRSDRNILAQLLENNAVSTQTILIKREVINKIKFDVTFKRYQDW 179 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D L+ + + E I I+++ K+ Y H KH D +++ + Sbjct: 180 DFALQAASSGIKIAYLAEGLVDSTIQ--GNSISANVKEGVAYEHLIEKHMDLYNKNPRAL 237 Query: 243 QLFTLYQIRNKRMTWRTLLTLL 264 + R + R Sbjct: 238 AAIYKKIGNSYRNSNRKKSRYF 259 >UniRef50_A3CRX6 Glycosyl transferase, family 2 n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CRX6_METMJ Length = 996 Score = 190 bits (483), Expect = 5e-47, Method: Composition-based stats. Identities = 54/275 (19%), Positives = 104/275 (37%), Gaps = 13/275 (4%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M P SI +P +N L +A++S+ Q Y +WE+I+VDD S S E ++ ++ +ND Sbjct: 465 MAYKPHFSIVVPVYNTPPDLLNKAVRSLQLQWYPHWELILVDDASPSKE-TKRCLSKIND 523 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 P I + N G N AI + G+Y+ +D DDE T + L + Sbjct: 524 PNIKVFTHPQNKGISGATNTAISHSTGDYVVLLDHDDELTEDCLFELALCINRDDPDYIY 583 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKA 178 D + + +++P + K +S + + +E ++ Sbjct: 584 SDEDKIDEKGYFTEP-----HYKPDWSPDTMMSTMYVCHVSCIKKSLLEEVGGLRSDYDG 638 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ--ITSSPKKFSGYFHFYR---KHKD 233 QD+D+ LR+V + + + I G I + K + Sbjct: 639 CQDWDLILRIVEKTDRISHIPKVLYHWRIIPGSTSADIGAKSYVLDASRRVREDALKRRG 698 Query: 234 KFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRN 268 Q+ +++ + + ++ R+ Sbjct: 699 LLGTVEPLEQVNGYFRVNYHLVDKPLISIIIPTRD 733 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 56/266 (21%), Positives = 100/266 (37%), Gaps = 21/266 (7%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PLISI +PT + + R I+S+ ++ Y N+E+I++D+ S L+ +P Sbjct: 721 DKPLISIIIPTRDHGDVLRRCIESIFKKSTYKNFELIVLDNGSVDSATLEYLEEIKAEPN 780 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQ----QLVTH 116 I+ I + + N A+GE + ++DD E T + L + Q V Sbjct: 781 ISVIRHAEPFNYSELNNVGADFAKGEILLFLNDDTEVVTSDWLERMGGYAQLAHIGAVGA 840 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRL--FYKRNII--------GNQVFTWAWR 166 LY Q P + + L +Y RN+I G + + Sbjct: 841 KLLYPGGKKIQHAGVLNLQDGPGHAFLHQNADLPGYYLRNLIEYNWLAVTGACLMIERKK 900 Query: 167 FKEC-LFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 F+ FD A D D+ R+ + G V ++ +++H H + Sbjct: 901 FQSVNGFDVNYPIAYNDVDLCFRL-RDAGFYNLVSQSVRLIH--HESLTRRMDHADHDKT 957 Query: 225 FHFYRKHKDKFDRASKKYQLFTLYQI 250 + + + + YQ Y I Sbjct: 958 ERLKQDMRRLYQKHPFYYQYDPFYSI 983 >UniRef50_C6QNL6 Glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y4.1MC1 RepID=C6QNL6_9BACI Length = 390 Score = 190 bits (482), Expect = 6e-47, Method: Composition-based stats. Identities = 65/255 (25%), Positives = 106/255 (41%), Gaps = 12/255 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK P IS+ MP +N ++ AI SVL Q + +E I+V+D ST + P Sbjct: 1 MK-KPTISVIMPVYNGEKYLAEAIDSVLSQTFDGFEFIVVNDGSTDRTAKILEKYRIEYP 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 I + ++N G N + A EY+ + DD PNRL ++ Sbjct: 60 WIKIVTLEVNQGISKAINVGLAHANAEYVCFVAADDIQYPNRLEDSYREMVLKKLDILMF 119 Query: 121 ANDYVCQ-GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 D + G + +P + +KRN + + K FDT L+ + Sbjct: 120 EFDMIDDTGNPLGRSMRIPNFLTETNILLEEFKRNYFFSGACLIKFDPK-IQFDTNLRTS 178 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 +DYD FLRM++ + + ++ + ++ G + SS +S Y +K FD Sbjct: 179 EDYDWFLRMLMADKKVGFLRKSLLQVRVHKGSL---SSNHSYSYEATKYILNKYDFD--- 232 Query: 240 KKYQLFTLYQIRNKR 254 LF ++IR R Sbjct: 233 ---SLFHFFKIRGFR 244 >UniRef50_Q39U31 Glycosyl transferase, family 2 n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39U31_GEOMG Length = 303 Score = 190 bits (482), Expect = 6e-47, Method: Composition-based stats. Identities = 62/256 (24%), Positives = 102/256 (39%), Gaps = 15/256 (5%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ +PT+N A++SVL+Q YSN+E+IIV+D ST N+ I YI Sbjct: 3 KVSVVLPTYNCGTYVGVAVESVLQQTYSNYELIIVNDGSTDETDQALAPYLCNNGHIRYI 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N G RN I A G+ I ID DD+W P +L + ++ ++ N Y Sbjct: 63 KQ-GNKGHAGARNTGIRAATGDVIAFIDSDDKWLPEKLGEQVQAMEEDPEVGLVHCNVYG 121 Query: 126 CQGEVYSQPASLPL-------YPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 Q L Y + F K I V + +FD + Sbjct: 122 FGENQEVQVRGPLLTQEQLQGYSGYIFDNLYFRKIIITTTTVMIRKSCIDDVGMFDENMT 181 Query: 178 --AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK- 234 ++D ++FLR++ +Y + V + + + G + Y K Sbjct: 182 RYGSEDRELFLRILWKYKARY-VNKPLAMYRNRSDSEGQNY-ERMIKGQEYVYEKITSLY 239 Query: 235 -FDRASKKYQLFTLYQ 249 ++SK+ + +YQ Sbjct: 240 GLPQSSKRAVMSKMYQ 255 >UniRef50_C9Y383 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y383_CROTZ Length = 293 Score = 190 bits (482), Expect = 6e-47, Method: Composition-based stats. Identities = 75/257 (29%), Positives = 123/257 (47%), Gaps = 14/257 (5%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP--- 60 N LIS+ + T+NR++L RAI+SV+ Q YS E+I++DDCS E+ Q + + Sbjct: 13 NELISVIITTFNREELLERAIRSVIAQTYSAVELIVIDDCSN--EKTAQLIEKMRPECEV 70 Query: 61 ---RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 Y N+ NSG+ RN+ L+ G ++TG+DDDD + P RLS A ++ Sbjct: 71 RFVHFIYERNEKNSGSNFSRNRGYALSHGVFVTGLDDDDYFLPERLSKLAARYEERYAFV 130 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTEL 176 + SQ K + + N++GNQV T + F E+ Sbjct: 131 TDSPARLDKKQRTNSQGNR-----KRIITLQDVLCENVVGNQVLTTREKMLGVGGFSPEI 185 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 K QD D +++++++YG K +T ++ H +IT KK++ Y Y K+++ Sbjct: 186 KQQQDRDAWIKLILQYGPGVKYSFSTAMVDAEHSSDRITKKIKKYTSYRKLYFKYREHMS 245 Query: 237 RASKKYQLFTLYQIRNK 253 ++ LF L R Sbjct: 246 ETTRSNNLFQLMSFRGF 262 >UniRef50_D0L0W1 Glycosyl transferase family 2 n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0W1_HALNC Length = 733 Score = 189 bits (481), Expect = 7e-47, Method: Composition-based stats. Identities = 59/242 (24%), Positives = 100/242 (41%), Gaps = 16/242 (6%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALNDPR 61 P+IS+ M TW + AI+SVL+Q Y W++ I DD S + ++ D R Sbjct: 196 QPVISVIMATWQSDLRWLDAAIQSVLQQSYPYWQLCIADDASEQKKLIELLNHYKKKDSR 255 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I N A +N ++LA G Y+T +D DD PN L QQ FLY+ Sbjct: 256 IRVTFRYENGHISAAQNSGLLLATGAYVTFLDHDDLLAPNALLAVAEALQQQPRPLFLYS 315 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQ 180 ++ G + + KS ++ L +N I + + + F ++ +Q Sbjct: 316 DEDKIDG----TGRRVNPHFKSDWNPDLLCAQNYITHLMVVERAMVESVGGFRVGVEGSQ 371 Query: 181 DYDIFLRMVVEYGEP--WKVEEATQILHINHGEMQITSSPKKFSG-------YFHFYRKH 231 D+D+ LR+ + + + I + + K ++ HF RK+ Sbjct: 372 DHDLALRVTEKLPPALIHHIPQVLYHWRITENSTALHADAKNYTSSAGVAAIEEHFKRKN 431 Query: 232 KD 233 +D Sbjct: 432 QD 433 Score = 68.3 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 46/232 (19%), Positives = 81/232 (34%), Gaps = 20/232 (8%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSW-EQLQQYVTALND 59 K PL+++ +PT +R + I S++ + Y +E+II+D+ S L Sbjct: 454 KPAPLVTLLIPTRDRLDMLKPCISSIIEKTSYKPYEIIILDNGSNEPQTHLFFAEIQAQH 513 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHK---QQLVT 115 I + D A+ N + AQG I I++D + L+ ++H + Sbjct: 514 EHIRVLSYDKPFNYSAINNFGVAHAQGTIIGLINNDIEVIQSGWLTELVSHAVRPEIGCV 573 Query: 116 HAFLYANDYVCQ--------GEVYSQPASLPLYPKSPYSRRLFYKRN---IIGNQVFTWA 164 A LY +D Q G V Y RL +N + G + Sbjct: 574 GAKLYFDDGSIQHAGVILGIGGVAGHAHKYFARDAHGYFSRLQLVQNLSAVTGACLLVRK 633 Query: 165 WRFK-ECLFDTE--LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 F+ + + A D D+ L++ A H + + Sbjct: 634 EVFEQVGGLEEDHLTVAFNDIDLCLKVREAGYRNLWTPHAELYHHESKSRGR 685 >UniRef50_Q6MBF6 Putative glycosyltransferase n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBF6_PARUW Length = 376 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 49/222 (22%), Positives = 87/222 (39%), Gaps = 4/222 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M ++PL+SI +P +N+ A+ SVL+Q Y N+E++IVDD ST D Sbjct: 3 MHSSPLVSILIPVFNQVPFIFEALNSVLQQTYPNFEVLIVDDGSTDGTASICQEFCQKDS 62 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R Y++ N G A RN+ I + GEY +D DD+ R++V L + T +Y Sbjct: 63 RFQYLYQ-TNKGPSAARNRGIAASHGEYFCLLDGDDKMDFQRIAVQLKVLTENPTLDIVY 121 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII--GNQVFTWAWRFKECLFDTELKA 178 + + RN+I + + + Sbjct: 122 TALLLIDSNGSTIGEMHGQEINPENFLATLLFRNVIPGPSTIMAKRECLTSHPYHEHFVH 181 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 A+DY++ +R+ Y + ++ + + + Sbjct: 182 AEDYELMIRLAHFYRFQY-IDLPLTYYRRHLNNLSNHLKAHR 222 >UniRef50_C1SJ24 Glycosyl transferase n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJ24_9BACT Length = 255 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 51/224 (22%), Positives = 95/224 (42%), Gaps = 5/224 (2%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 S+ +P +NR Q AI+SVL Q + ++E+I+VDD S ++ P I YI Sbjct: 3 FSVIIPVFNRTQTLKTAIESVLAQSFRDFEIIVVDDGS---DKSVSSALRPYMPMIRYIR 59 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 + N G RN I A+G Y +D DD W P++LS + N++ Sbjct: 60 IEKNMGVSYARNIGIREAKGLYTAFLDSDDIWLPDKLSAQHEAL-TATKLKVCHTNEFWF 118 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKAAQDYDIF 185 + + + + ++ I + V FK C FD+ ++ +DYD++ Sbjct: 119 RKDRFINQGAKHKRHGGMIFDKVLDFCRISPSSVVIHRDVFKTCGVFDSNMRTCEDYDLW 178 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 LR+ ++ + E+ + + ++ + ++ R Sbjct: 179 LRICSKFEVCYLPEKHIIKRAVTNDQLSDSIKNIEYIRLVSLAR 222 >UniRef50_A5ZX57 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZX57_9FIRM Length = 808 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 58/264 (21%), Positives = 98/264 (37%), Gaps = 14/264 (5%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--N 58 + NPLISI +P + I SV Q Y+NW++ + D S ++++Y+ Sbjct: 274 EQNPLISIVIPLYCTPTPYLKELIDSVRAQSYTNWQLCLA-DGSPDQ-KVEEYIQKRYGK 331 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI Y H + N G N+AI +A GEY+ D DD P+ L + Sbjct: 332 DSRILYKHLEENGGISINTNKAIEMATGEYLMLSDHDDTLEPDALYEIVKAINDHQGPEI 391 Query: 119 LYANDYV--CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTE 175 +Y ++ GE Y +P + KS Y+ N I + + E Sbjct: 392 VYTDEDKLSMDGEFYFEP-----HFKSDYNLFRLRDNNYICHIFAVKKALVDQVGGLRQE 446 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QDYD LR + + + + + K + Y R ++ + Sbjct: 447 YDGSQDYDFILRCCEQAKQVIHIPRVLYHWRCHMNSVAANPESKTY-AYEAGCRAIQEHY 505 Query: 236 DRASKKYQLFTLYQIRNKRMTWRT 259 R + ++ R + Sbjct: 506 RRVGIEAEVEMTKHPGWYRSHVKI 529 Score = 69.5 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 46/268 (17%), Positives = 95/268 (35%), Gaps = 30/268 (11%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYV-TALN 58 ++ PL+SI +P + + + S+ + + N+E+++V++ S E + Y + Sbjct: 529 IQGEPLVSILIPNKDHIDDLEKCLSSIYEKSTWKNYEILVVENNSEKPETFEYYKNLSWR 588 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ----L 113 P+ + A+ N A A+G Y+ +++D + TP + L QQ + Sbjct: 589 YPKARVLTWKEGFNYAAINNFAAKDAKGSYLLFLNNDVEVITPGWIEEMLMICQQPDVAI 648 Query: 114 VTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQV------------- 160 V Y ++ + V + + S + G V Sbjct: 649 VGAKLYYPDNLIQHAGVVLGMGGIAGHIMCQASCED---KGYFGRAVNVQEISAVTAACM 705 Query: 161 FTWAWRFK-ECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 F+ FD E A D D+ ++ + A ++ H + +P Sbjct: 706 LMKVEDFEAVGGFDEEFVVAFNDIDLCMKERAAGKKVVFTPYA-ELYHYESKSRGMEDTP 764 Query: 219 KKFSGYF----HFYRKHKDKFDRASKKY 242 +K + HF K ++ + Y Sbjct: 765 EKQFRFEKETKHFEEKWGEQMSKGDPYY 792 >UniRef50_B9M560 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. FRC-32 RepID=B9M560_GEOSF Length = 320 Score = 188 bits (479), Expect = 1e-46, Method: Composition-based stats. Identities = 53/274 (19%), Positives = 109/274 (39%), Gaps = 12/274 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+SI +P +N +AI S L Q Y E+++++D S ++ + D +I Sbjct: 2 TPLVSIIIPVYNGSNYLKQAIDSALNQTYKYIEILVINDGSNDGGLTEEIARSYGD-KIR 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y H N G N I LA G+YI+ + DD + P+++ ++ + + ++ Sbjct: 61 Y-HCKPNGGVATALNTGINLAAGDYISWLSHDDLYLPHKIERQISTIASIGGSDVISYSN 119 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRL-----FYKRNIIGNQVFTWAWRF-KECLFDTELK 177 Y + ++ L Y +L + +I G + F + F L+ Sbjct: 120 YETIDAKNNVLRTVRLETSEIYDCKLAFLLQLFISSIHGCSLLIPRKCFTEVGYFKESLR 179 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT----SSPKKFSGYFHFYRKHKD 233 QDYD++ R++ + + E ++ + + + Y D Sbjct: 180 TTQDYDLWFRLLQKGFAFTHLPEILIQSRFHNEQGTHSLYAVHLKEAEQLYLRAVNTFYD 239 Query: 234 KFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVR 267 F++ K + + +R +++ L S++ Sbjct: 240 DFEKLPIKKIVDLVIDLRERKLKQTANHVLKSIK 273 >UniRef50_A4BIW6 Glycosyltransferase n=1 Tax=Reinekea blandensis MED297 RepID=A4BIW6_9GAMM Length = 254 Score = 188 bits (479), Expect = 1e-46, Method: Composition-based stats. Identities = 63/256 (24%), Positives = 102/256 (39%), Gaps = 5/256 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M N LI++ P +N + I SVL Q + N E I+VDDCS L D Sbjct: 1 MSNE-LITVITPVFNCEDFLEETIVSVLNQTHKNIEYILVDDCSVDNSSLIYQRFMEKDR 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFL 119 R+ I NSGA RN+ I LA+GEYI +D DD+W P +L L ++ + +F Sbjct: 60 RVKSIKLLQNSGAAIARNKGIELAKGEYIAFLDSDDKWEPTKLEEQLMFMKENKASFSFT 119 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 S+ S Y L YK+ +G + L+ Sbjct: 120 SYTSLNISNNSISKVDSKHQKIVFNYF-DLLYKKITLGCSTVMLKSNSIGNMQMPNLRTG 178 Query: 180 QDYDIFLRMVV-EYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 QD+ +L ++ + E +++ I + + + ++ KK + R +++ Sbjct: 179 QDFAFWLLLLKNQIKEVHLLKKCLTIYTVRNDSIS-SNKIKKAIRQWSILRSNENLSTLK 237 Query: 239 SKKYQLFTLYQIRNKR 254 + L Y R Sbjct: 238 AILPFLSYAYHATIGR 253 >UniRef50_B8IAC9 Glycosyl transferase family 2 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IAC9_METNO Length = 450 Score = 188 bits (479), Expect = 1e-46, Method: Composition-based stats. Identities = 52/220 (23%), Positives = 91/220 (41%), Gaps = 5/220 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M N L++I +P +N +AI+S L Q + + E+++VDD ST + + + D Sbjct: 1 MPNPNLVTIIIPVYNGGYFLSQAIESALAQTWPHVEVLVVDDGSTDDGETARTCRSFGD- 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI YI + N G + N+ I A+G YI+ + DD + P ++ + + ++ Sbjct: 60 RIRYIWKE-NGGVASALNRGIAGARGRYISWLSHDDLYDPRKVEIQMGALLAEPDTVIVF 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPY-SRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 + V +G+ S S + I G V F + FD L Sbjct: 119 GDYAVIRGDGSLLTEVRVGEGYSDRASLWSTLEGRINGCAVIVPRECFARHGTFDEGLPT 178 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 QDY+++ R + V ++ G+ TS Sbjct: 179 TQDYELWFRFAQHH-RFVHVPGYLVRHRVHEGQGSRTSRH 217 >UniRef50_A6LGW8 Glycosyltransferase family 2 n=3 Tax=Bacteroidales RepID=A6LGW8_PARD8 Length = 259 Score = 188 bits (479), Expect = 1e-46, Method: Composition-based stats. Identities = 53/225 (23%), Positives = 93/225 (41%), Gaps = 5/225 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI MP++N + + I S++ Q Y+NWE++I DDCST A D RI Sbjct: 7 LVSIIMPSYNSSKYIAKTIDSIVSQFYTNWELLITDDCSTDNTCEIIKEYAAYDQRIKLF 66 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + N GA RN++I A+G YI D DD W P +L V L + Y++ Sbjct: 67 VMEENKGAGVARNKSIEEAKGRYIAFCDSDDRWKPEKLEVQLRFMVEN-RVEICYSSYLK 125 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 C + S S + + + IG + ++ L+ QD+ Sbjct: 126 CNEN---DKVFGIVIAPSKISYKKMTRNDYIGFLTCIYDTHQIGKIYMPTLRKRQDWAWK 182 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 + ++ + + +++ + G + + K Y++ Sbjct: 183 ILLMKKCPIAYGIKDTLAYYRVREGSLS-NNKMKLIRYNVTVYKQ 226 >UniRef50_UPI000197ADF7 hypothetical protein BACCOPRO_00943 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197ADF7 Length = 309 Score = 188 bits (479), Expect = 1e-46, Method: Composition-based stats. Identities = 68/253 (26%), Positives = 113/253 (44%), Gaps = 9/253 (3%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTALND 59 + PL+S+ +PT+ R Q RAI SVL Q Y N E+I+VDD + E Q + + D Sbjct: 6 STPLVSVIIPTYARSQYICRAIDSVLNQTYKNVEVIVVDDNGENTENQLATFQTLKSYID 65 Query: 60 -PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTH-- 116 +ITYI + N A RN I A+GEYI +DDDDE+ P ++ + QL Sbjct: 66 KEQITYITHKTNRNGSAARNTGIFNAKGEYICLLDDDDEFFPEKVEKQVQVLNQLDDSWA 125 Query: 117 -AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-FKECLFDT 174 F + + + + + P++ Y LF + + + + FDT Sbjct: 126 GVFCNSINRTITLHGVKEKLNKIVLPENMYEEFLFCRAGFGSSSLMLRKSVCLEINGFDT 185 Query: 175 ELKAAQDYDIFLRMVVEYG-EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 K QD++ R++ +Y + + A +I + S + H+ K KD Sbjct: 186 SFKRHQDWEFLTRILRKYKLKQVEPNGALLYYYIYPYNINRPSGKQVQIYREHYLDKFKD 245 Query: 234 KFDRASKKYQLFT 246 + +S K +++ Sbjct: 246 DINASSNKNKIYH 258 >UniRef50_Q11KM8 Glycosyl transferase, family 2 n=1 Tax=Chelativorans sp. BNC1 RepID=Q11KM8_MESSB Length = 354 Score = 188 bits (479), Expect = 1e-46, Method: Composition-based stats. Identities = 54/259 (20%), Positives = 101/259 (38%), Gaps = 24/259 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+S+ M + + A+ SVL Q + E+++ DD S D R Sbjct: 7 ETGPLVSVIMANFRSAEYLAAALDSVLAQTIGDIEVLVSDDASPDNSTDIVQHYMSRDAR 66 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + I D N G A RN+A+ A+GE+I +D DD P RL + +A ++L A Sbjct: 67 VNLITTDENKGPAAARNRALEAARGEWIAIVDSDDIIHPQRLEILIAMAEELGADGI--A 124 Query: 122 NDYVCQGEVYSQPASL---PLYPKSPYSRRLFYKRNIIGNQVFT--------WAWRFKEC 170 +D + + + P L + F + N GN + + Sbjct: 125 DDLLFFSDQGAGPTLLGATAAPEPLEITPSYFIRSNTSGNGLAPLGYVKPLFRRAKLSGL 184 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ-----------ITSSPK 219 +D ++ +DYD LR ++ + + + + + + IT+ K Sbjct: 185 SYDEAVRIGEDYDFLLRFLLTGARFYILPDPLYLYRRHSHSISHRLSESTVLAMITNQQK 244 Query: 220 KFSGYFHFYRKHKDKFDRA 238 + + F + +D ++ Sbjct: 245 LVATHGSFSPEMEDLLEKR 263 >UniRef50_B7HZB4 Beta-1,3-N-acetylglucosaminyltransferase n=37 Tax=Bacillus cereus group RepID=B7HZB4_BACC7 Length = 326 Score = 188 bits (479), Expect = 1e-46, Method: Composition-based stats. Identities = 52/241 (21%), Positives = 90/241 (37%), Gaps = 11/241 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK N L+S+ +P +N ++ ++S+L Q Y N E++IVDD S A P Sbjct: 2 MKQNQLVSVVIPLYNAEKYIEETMQSILDQTYKNIEIVIVDDGSKDQSPSIVKNLAEKYP 61 Query: 61 -RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 ++ Y+H N G RN I A GEYI +D DD W P ++ + + A Sbjct: 62 GQVKYVHQ-KNQGVSVARNTGIENASGEYIAFLDSDDLWHPTKIEKQVKSIHKNNMDA-C 119 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF---KECLFDTEL 176 Y E + + K S+ + + + + F Sbjct: 120 YCGYMNFYEETGEKVEHTTNFIKGDMSKAFLTHQVVAQTSTWIFKRAIVMDHNIRFTPGC 179 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +D + +++ + V+E I + + K+ Y K + F+ Sbjct: 180 SWGEDLEFLFKLMSVTNVCY-VDEYLTYYRI----LSEGNLSSKYKDYELKTTKELEVFN 234 Query: 237 R 237 R Sbjct: 235 R 235 >UniRef50_Q5FRX1 O-antigen biosynthesis protein RfbC n=1 Tax=Gluconobacter oxydans RepID=Q5FRX1_GLUOX Length = 876 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 63/225 (28%), Positives = 94/225 (41%), Gaps = 11/225 (4%) Query: 4 NPLISIYMPTWNRQQL-AIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALNDPR 61 NPL+S+ P WN AI+SVL Q + NWE+I+VDDCS + E A D R Sbjct: 281 NPLVSVVCPVWNPDLTDFTAAIQSVLSQTWQNWELILVDDCSKNPELRAVIENFAKTDAR 340 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I + N G N + A+GE+I D DD + L + +Y+ Sbjct: 341 IKPIFQEKNGGISIATNAGLAAAKGEWIAFFDHDDLLADVAVEYMLREATR-SHADLIYS 399 Query: 122 N-DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAA 179 + D + Y +PA K+ ++ RL N + + V + + E A Sbjct: 400 DEDKIDASGYYREPA-----FKTDWNYRLLLGVNYVCHFVMIRQSALQNIGGLNKEYDGA 454 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 QD+D LR E+ K+ +IL+ T+S Y Sbjct: 455 QDHDFLLR-ASEHIPAEKIHHVPEILYHWRITANSTASDIGNKRY 498 Score = 48.7 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 40/258 (15%), Positives = 87/258 (33%), Gaps = 24/258 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 +P ++I +P + R + ++ + Y N+++I+V++ S + E P Sbjct: 536 NQSPSVTIIIPYKDEIATTARCLDAIQKYTRYPNYKVILVNNWSITKEAEAFAQAVDKIP 595 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWT-PNRLSVFLAH-----KQQLV 114 + + + N A +++ +++D N L + + +V Sbjct: 596 NVEILTIKEPFNYSRINNLAAKDETSDFLLFLNNDVFVEQENWLEILVNEALADPLVAIV 655 Query: 115 THAFLYANDYVCQ-------GEVYSQPASLPLYPKSPYSRRLFYKRNII----GNQVFTW 163 F+Y N V G+V + Y+ R ++ + I + Sbjct: 656 GGKFVYPNQTVQHAGVLLGIGDVAGHAHVGIPRDEGGYAGRAYFPQEISAVTAAGMLIRR 715 Query: 164 AWRFKECLFDTE-LKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 FD E LK A D D+ L++ + + H E S + Sbjct: 716 PAFELVGGFDEEHLKVAFNDIDLCLKVRDAGYKVVWTPDFCA----EHHESLSRGSDDRP 771 Query: 222 SGYFHFYRKHKDKFDRAS 239 S F+ + + ++ Sbjct: 772 STERRFFHETQFMIEKWG 789 >UniRef50_C0QK40 Putative glycosyl transferase (Group 2 family protein) n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QK40_DESAH Length = 565 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 68/288 (23%), Positives = 105/288 (36%), Gaps = 19/288 (6%) Query: 3 NNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY--VTALND 59 + PLISI P +N L I ++S L Q Y WE+I+VDD S L + + D Sbjct: 97 DQPLISIVTPLYNTPSNLLIECVESALFQSYQQWELILVDDASNDRSTLNCLSRIARIKD 156 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI + N N G C N+ + A GEYI +D DD TPN L + Sbjct: 157 PRIKVLTNRKNRGTCQSTNRGVRRASGEYIAFLDHDDRITPNALFETARTISGHDRPDLI 216 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKA 178 Y++ + K +S N + + + F E + + + Sbjct: 217 YSDRDLIS----PSGKRHDRISKPAWSPETLLSGNYLFHLMVAKRSLFLELGGYRKDFEG 272 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT-SSPKKFSGYFHFYRKHKDKFDR 237 +QD D LR+ + + + E K + + +D DR Sbjct: 273 SQDLDFVLRLAEVTHRVVHIPKVL--YNFRQAETSCAYDMEAKSFIFEKGVKAVQDALDR 330 Query: 238 A---SKKYQLFTL----YQIR-NKRMTWRTLLTLLSVRNGKRLADGIR 277 K Y+L Y IR R+ T++ + L I+ Sbjct: 331 RGINGKAYELEHAWRGNYGIRLKNRLAHDTIIYNRHPNLAEFLNHRIK 378 >UniRef50_A9EG08 Putative uncharacterized protein n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EG08_9RHOB Length = 1026 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 52/213 (24%), Positives = 89/213 (41%), Gaps = 6/213 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +SI MP +N A+ S+ Q ++ WE+IIVDD S+ A D RI Sbjct: 9 PRVSIVMPAYNAAAFIDEALTSIAAQSFTAWEIIIVDDGSSDDTADIAERWAQQDKRIRV 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNR-LSVFLAHKQQLVTHAFLYAND 123 + N+GA A RN + A+ +I ID DD + + + K++ + F A D Sbjct: 69 LKQ-ANAGASAARNTGLEAARAAWIAFIDSDDWVHRDYLVKLMKPLKKKPLNAVFCIAVD 127 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI-IGNQVFTWAWRFKECLFDTELKAAQDY 182 V G S+ PK I + + +F + + +D +L +D+ Sbjct: 128 VVSDG---SRGKRWIPPPKDNLFPVFATDCPIAVHSGIFRRSVLQEIGGWDPDLGTCEDW 184 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 D+++R+ V E + + G + + Sbjct: 185 DLWMRLARTTNRIENVMEELAFIRLRTGSLSRS 217 >UniRef50_C6ARE2 Glycosyl transferase family 2 n=4 Tax=Rhizobium RepID=C6ARE2_RHILS Length = 390 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 56/238 (23%), Positives = 92/238 (38%), Gaps = 15/238 (6%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N PL+S+ +P +N + R ++S RQ Y N E+I+V+D ST AL D RI Sbjct: 16 NGPLVSVVIPAFNASRYIERTLRSAGRQTYRNLEIIVVNDGSTDDTARVVEQVALADSRI 75 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFLYA 121 + N G A RN I A G ++ +D DD W ++ + +L A +Y Sbjct: 76 RLLSTP-NRGVAAARNTGIREAAGRFVAFLDADDLWHHTKIEKQVNALNRLTPQWAAVYV 134 Query: 122 NDYVCQ-GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ---VFTWAWRFKECLFDTEL- 176 Y+ + + + ++R L +K +GN + + FD+ Sbjct: 135 LHYIINSDDEILRSCRPDVARGYIFARHLTFK--YVGNGSALLVRRDVALEIGGFDSSYA 192 Query: 177 ----KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 +D D L++ Y + + E G M P G R+ Sbjct: 193 AAGIGGCEDLDFELKLAARY-QIEVIPERLVGYRQYPGNMSSNHLPMG-RGALEVIRR 248 >UniRef50_B6BQY7 Glycosyltransferase n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQY7_9RICK Length = 296 Score = 188 bits (477), Expect = 2e-46, Method: Composition-based stats. Identities = 62/277 (22%), Positives = 111/277 (40%), Gaps = 15/277 (5%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 K PL+SI +PT+N +A++SVL Q Y+NWE I++D+ ST + + ND R Sbjct: 3 KFKPLVSIIIPTFNHADYLNKALQSVLHQTYTNWEAIVIDNHSTDDTG--KVIARFNDHR 60 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y+ RN I A+G+++ +D DD WT ++L + + + +Y Sbjct: 61 IKYLKIQNYGVIAKSRNIGIKSAKGDWVAFLDSDDSWTSDKLKTCIENINNQI--DLIYH 118 Query: 122 NDYVCQGE--VYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF--KECLFDTE-- 175 + V + + K P L N+I N + K L + Sbjct: 119 DLEVTSDRQIFFKGKINKSRQLKKPILIDLLVNGNVISNSSVIVRKKVLTKIGLIEENKD 178 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP----KKFSGYFHFYRKH 231 L AA+DY+ +L++ + + + +I+H + + K + Sbjct: 179 LIAAEDYNTWLKISKFTDQFLYIPKKLGYYYIHHEGVSRRNMSMPTQKATQDFLCILNNK 238 Query: 232 KDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRN 268 + K A+ KY + + N + +R Sbjct: 239 QKKKLEANIKY-ISGRFNYLNFNYKQAKKDLVFVLRY 274 >UniRef50_D1ALB8 Glycosyl transferase family 2 n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1ALB8_SEBTE Length = 1334 Score = 188 bits (477), Expect = 3e-46, Method: Composition-based stats. Identities = 55/248 (22%), Positives = 95/248 (38%), Gaps = 14/248 (5%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--N 58 + PLIS+ MP +N +A+ SV+ Q Y NWE+ I DD S E++++ + Sbjct: 491 QYKPLISVLMPVYNTDIHFLKQALDSVIEQTYDNWELCIADDNSP-NEEVREILKEYENK 549 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT-HA 117 D RI I+ N N A+ L GE+ +D DD + L + + + Sbjct: 550 DSRIKVIYRKENGHISLASNSALELVTGEFTALMDHDDLIPKHALYMVVWEINRKKGLVD 609 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTEL 176 +Y ++ GE Y K ++ L +N + + K+ F Sbjct: 610 LIYTDEDKIDGE----NIRYDPYFKMEWNETLINSQNFVAHLGIYRTSILKKIGGFRKGF 665 Query: 177 KAAQDYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFS--GYFHFYRKHK 232 + +QDYDI LR + E + I G ++ S + ++H Sbjct: 666 EGSQDYDILLRFLREISSDRISHIPHVLYHWRIFKGNHTFSTDNHNISDDSAYKALKEHY 725 Query: 233 DKFDRASK 240 + + Sbjct: 726 EILKEDVR 733 Score = 98.4 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 52/279 (18%), Positives = 110/279 (39%), Gaps = 34/279 (12%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +S+ +PT +R ++ + + + Y ++E+IIVD+ S + L+ + D RI Sbjct: 755 PKVSLIIPTRDRVEILKNCVDGLQKNTDYDDFEVIIVDNDSKEKKTLEYFDKISADSRIK 814 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSVFLAHKQQLVTHAF 118 + + + N A+ A+GEY+ +++D ++W +S F +V Sbjct: 815 ILKVEGEFNYSKLNNLAVKEAKGEYLVFMNNDLEIIKNDWLKEMISTFSEENVGIVGAKL 874 Query: 119 LYANDYV----CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN--------QVFTWAWR 166 Y+N+ + C VY + + S F + +I N + Sbjct: 875 YYSNNTIQHAGCVTGVYGVAGHIHKHL-PKNSPGYFGRLGLIHNVSAVTGACLAISKKIF 933 Query: 167 FKECLFDTE-LKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 + FD E LK + D D+ L++ + + E ++ H+ I+ + Sbjct: 934 EEVNGFDEEKLKVSYNDVDLCLKVRDKGYKIIFNPEV-ELYHL----ESISRGKDE---- 984 Query: 225 FHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTL 263 K+K + +R ++ + + + L+L Sbjct: 985 ----SKYKKELNRNERREMISRYGEKLKYDPYYSVNLSL 1019 >UniRef50_Q1N7I9 Predicted glycosyltransferase n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1N7I9_9SPHN Length = 1454 Score = 187 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 64/217 (29%), Positives = 93/217 (42%), Gaps = 8/217 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +SI MP NR + AI SVL Q ++++E+II+DD ST + L + DPRI I Sbjct: 1003 VSIIMPVLNRAAIIGDAIASVLAQSHAHFELIIIDDGST--DDLPSALERYPDPRILLIS 1060 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 N + A RN+ + A GE I +D D+ W P+ L+V + HA YA + Sbjct: 1061 NARSERVSAARNRGLDRASGEIIAYLDSDNSWDPDYLAVMVNALADHRGHACAYAGQSIF 1120 Query: 127 QGEVYSQPAS---LPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF--KECLFDTELKAAQD 181 Q + A P++R +RN I +F FD L D Sbjct: 1121 QAVTSADGAGQEERRAIRLCPFNRSRLEERNYIDLNIFVHRRHLVALHGGFDKSLHRLVD 1180 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 +D+ LR E P V + QIT++ Sbjct: 1181 WDLILRYTPE-SPPLMVPALLGTYRAGAADNQITATE 1216 >UniRef50_O87159 WblC protein n=1 Tax=Vibrio cholerae RepID=O87159_VIBCH Length = 340 Score = 187 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 49/231 (21%), Positives = 89/231 (38%), Gaps = 4/231 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + P IS+ M +N ++ +AI+S+L Q +S++E IIVDD ST D R Sbjct: 3 SDTPTISVIMSVYNGEKYLAQAIESILNQTFSDFEFIIVDDGSTDSSLSIIQAYMDKDDR 62 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I +N G N+AI +++ YI +D DD P RL LA + Sbjct: 63 IVLISR-VNKGLPYSLNEAISVSKANYIARMDADDISLPERLETQLAVMENNPDIGVCGT 121 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC--LFDTELKAA 179 Y+ + RL + I V + +++ + + Sbjct: 122 LAYLFRETPSKNKMMCHPEDHDSLIIRLLFSVCFIHPVVMIRKSVLDQLDYVYNENFRNS 181 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 QDY+++ R+ + + +++ + + F+ F + Sbjct: 182 QDYELWSRIAEK-TRFYTIQKPLLFYRDTPDGITSKVNHDGFNKRFPLVSQ 231 >UniRef50_C5RQE2 Glycosyl transferase family 2 n=1 Tax=Clostridium cellulovorans 743B RepID=C5RQE2_CLOCL Length = 802 Score = 187 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 63/272 (23%), Positives = 104/272 (38%), Gaps = 12/272 (4%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P ISI +PT+N + + I+SV Q Y+NWE+ I D S + E + DP+I Sbjct: 268 EPKISILVPTYNTPENFLVEMIESVNEQTYTNWELCIA-DASNNEETRKVLARYEGDPKI 326 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + D N G N+A L GEYI D DD PN L + + F+Y + Sbjct: 327 PIKYLDENKGISGNTNEAATLVTGEYIALFDHDDLLMPNALFEIVKVINKDREVDFIYTD 386 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQD 181 + + + K ++ F N I + F F E AQD Sbjct: 387 EDKTDE---TSTRRFDPHFKPDFAIDTFRSNNYICHFTTMKRELFDSVGGFRKEYDGAQD 443 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 +D+FLR + + + + ++ + K + +D DR K Sbjct: 444 FDLFLRTTEKAKKIVHIPKVVYHWRVHQNST-AGAGEAKLYAFEAGKNAIQDSLDRNGIK 502 Query: 242 -----YQLFTLYQIRNKRMTWRTLLTLLSVRN 268 + +Y IR + + + L+ ++ Sbjct: 503 GTVEMGKYLGIYNIRYELESTPLVSILIPTKD 534 Score = 94.9 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 60/268 (22%), Positives = 102/268 (38%), Gaps = 30/268 (11%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALND 59 +++ PL+SI +PT + + R IKS++ + Y N+E+I++++ ST E + Y T N Sbjct: 520 LESTPLVSILIPTKDHIEDLDRCIKSIIEKSTYKNYEIIVIENNSTESETFKYYDTLKNY 579 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQL 113 I + A+ N + A GE +++D D + L + Q+ Sbjct: 580 SNIKVVEWKDEFNYSAINNFGVKNASGEVFLLLNNDVEVINGDW-----IERMLQYAQRE 634 Query: 114 VTHA----FLYANDYVCQGEVYSQPASLPLYPKSPYSRR---LFYKRNIIGN-------Q 159 A Y ND + G V + + + R F + N + N Sbjct: 635 DVGAVGAKLYYPNDTIQHGGVIVGLGGIANHAHKHFHREAPGYFARLNFVQNFSAVTAAC 694 Query: 160 VFTWAWRFKEC-LFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILH--INHGEMQIT 215 + +++ D +LK A D D LR+ A H I+ G Sbjct: 695 LMVRRDVYEKLNGLDEDLKVAFNDVDFCLRIRALDKLIVWTPFAELYHHESISRGVEDTP 754 Query: 216 SSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++F G F K KF +A Y Sbjct: 755 EKIERFKGEIRFMEKRWGKFIKAGDPYY 782 >UniRef50_Q74BU3 Glycosyl transferase, group 2 family protein n=1 Tax=Geobacter sulfurreducens RepID=Q74BU3_GEOSL Length = 295 Score = 187 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 56/239 (23%), Positives = 101/239 (42%), Gaps = 10/239 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK P +S+ + +N + S+L Q Y ++E++I++D ST + L++ Sbjct: 1 MK-QPKVSVIVTCYNYATYLEGCLASILNQTYQDFELVIINDGSTDNTD-EVISRFLDND 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 ++ YI N N+G N I A GE I +D DD W P +L + + + +Y Sbjct: 59 KVRYI-NQKNTGQAIATNNGIAAAAGELIAFLDADDLWEPTKLEKQVRLFNR-DSIGVVY 116 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRR----LFYKRNIIGNQVFTWAWRFKE-CLFDTE 175 + E + L +P S R L ++ I + F + +F+ E Sbjct: 117 SRIRFMDAEGCTLDMQLEGKYYTPRSGRVTDALLFENFIPYSSTVVRKECFDKFGMFNPE 176 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 K D+D++LR+ EY E V+E I I H ++ + + + ++ Sbjct: 177 YKNGLDWDLWLRISREY-EFDFVDEYLLIYRIGHPGQLTSNVERSVRCADLIFDRFLEE 234 >UniRef50_C1DGR7 Glycosyl transferase, family 2 protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DGR7_AZOVD Length = 340 Score = 187 bits (475), Expect = 4e-46, Method: Composition-based stats. Identities = 46/192 (23%), Positives = 88/192 (45%), Gaps = 5/192 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M P +++++P NRQ+ A+ S+L Q + ++E+++VDD ST + + ++ DP Sbjct: 1 MSRTPRVTVFIPVHNRQRYIATAVDSILAQTFGDFELLVVDDGST--DATLEVLSHYRDP 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ N N G RN+ + LA+GEYI +D DD+ P RL+ + + + Sbjct: 59 RLRVECNPRNLGLPGTRNRGLELARGEYIALLDSDDKAWPERLTRQVEVLDRHPELVQIG 118 Query: 121 --ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 + +G + PL P + FY + + +E + + + Sbjct: 119 SACDFMDAEGRRLDRVRRRPLAPDEVDASLAFY-CALTNRTIMGRTAILREYRYSEDFPS 177 Query: 179 AQDYDIFLRMVV 190 +DY++ R+ Sbjct: 178 CEDYELHQRLAR 189 >UniRef50_A4EB36 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4EB36_9ACTN Length = 828 Score = 187 bits (475), Expect = 4e-46, Method: Composition-based stats. Identities = 58/284 (20%), Positives = 104/284 (36%), Gaps = 18/284 (6%) Query: 4 NPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL+SI +P + + + SVL Q Y NWE++++ D S W+ + +D R+ Sbjct: 288 RPLVSIVVPCYKTDRVYLRELLDSVLAQSYDNWELLLM-DASPEWDAVADLAAGAHDERV 346 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA---FL 119 N G N I A G+YI +D DD P+ L ++A L Sbjct: 347 RRFGLPGNGGIVVNTNAGIEQAMGDYIAFLDHDDILEPDALFHYVAALNNAAEGERPQVL 406 Query: 120 YANDYVCQG-EVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTE-LK 177 + ++ + Q + QP K+ + L Y N + + + E + Sbjct: 407 FCDEDMFQKTGEWGQPV-----FKTKLNVDLLYSHNCVTHFLMVEKALIDRIGTSPEDVA 461 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEM---QITSSPKKFSGYFHFYRKHKDK 234 AQDYD+ LR + V ++ G S P ++H + Sbjct: 462 GAQDYDLTLRCLAAGARFEHVAHVLYHWRVHPGSTADGSADSKPYAIEAGRLALQRHFNA 521 Query: 235 FDRASKKYQLFT--LYQIRNKRMTWRTLLTL-LSVRNGKRLADG 275 + T +Y++R L+++ + ++ D Sbjct: 522 LGVHGTVEETETPFVYRMRYALPEPSPLVSIVVPTKDHVETLDA 565 Score = 84.1 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 104/274 (37%), Gaps = 46/274 (16%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSW------EQLQQY 53 + +PL+SI +PT + + + S+ Q Y+N+E+++V++ S + E L + Sbjct: 545 EPSPLVSIVVPTKDHVETLDACVMSI-AQKATYANYEIVLVENNSEAPETFAYYESLPER 603 Query: 54 VTALND----PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLA 108 V A ++ R+ Y + N + N + A+G+Y+ +++D E +P+ + Sbjct: 604 VAAASEGKGVARVVYWLGEFNY--SQIINFGVEHAKGDYLLLLNNDTEVISPDFTEEMMG 661 Query: 109 HKQQ----LVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRR---LFYKRNIIGN--- 158 + Q+ +V +A+ V + + +S + + GN Sbjct: 662 YLQRPDAGVVGAKLYFADHLVQHAGILVGVRGALAHANQDFSAKREGYLARAVRPGNFSA 721 Query: 159 -----QVFTWAWRFKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEM 212 Q+ + ++ E D D LR+ A ++ H Sbjct: 722 VTGACQMVRRDVFEQVGGYNEEFAVGFNDADFCLRVWEAGYRTIFTPYA-ELYHYE---- 776 Query: 213 QITSSPKKFSGYFHFYRKHKDKFDRASKKYQLFT 246 TS ++ +++K R ++ LF Sbjct: 777 -FTSRGRE--------EANEEKLRRWKREQALFM 801 >UniRef50_D1YZY1 Putative glycosyltransferase n=1 Tax=Methanocella paludicola SANAE RepID=D1YZY1_METPS Length = 299 Score = 187 bits (475), Expect = 4e-46, Method: Composition-based stats. Identities = 49/210 (23%), Positives = 83/210 (39%), Gaps = 2/210 (0%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +SI +P +N+Q+ A+ SV+ Q +WE IIV+D ST + D R Y Sbjct: 2 PTVSIIIPAYNQQEYIQSALSSVIDQTLKDWEAIIVNDGSTDNTDIIAQEFIKKDSRFNY 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I+ D N G RN + LA+G+Y+ +D DD L + + ++ V + Sbjct: 62 IYQD-NKGLSEARNTGLKLAKGKYVVFLDSDDILDTQMLELTTGYLEKHVNIDIVNGAWD 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKAAQDYD 183 + + Y L + + F C FD L + +D+D Sbjct: 121 QIDDNGKAISRRFGPIIRKNYLYDLMLTNLFPVHSLLIKKAVFDYCGFFDVALNSHEDWD 180 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 ++LR + + + I+ M Sbjct: 181 MWLRALSNGYKMGYLNHLIAHYRIHPQSMT 210 >UniRef50_C1QAA4 Predicted glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAA4_9SPIR Length = 581 Score = 187 bits (475), Expect = 4e-46, Method: Composition-based stats. Identities = 60/261 (22%), Positives = 104/261 (39%), Gaps = 9/261 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK+NPLISI +P +N + AI S L Q Y N E+I+VDD S + V + D Sbjct: 1 MKSNPLISIIIPVYNGENYIKDAINSALNQTYRNIEVIVVDDGS--KDDTANIVKSFRD- 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 +ITYI+ N G + N AI A+G+YI+ + DD + PN++ + + + +Y Sbjct: 58 KITYIY-KNNGGVASALNYAIKEAKGDYISWLSHDDIYYPNKIEEEVNELENIDDKTVIY 116 Query: 121 ANDYVCQGEV----YSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTE 175 + ++ + + Y + + +I G + F F+ Sbjct: 117 SGFEFVNEKLELITVFENSYKTEYRRLNNNFYSILLSDIGGCTLLIPKEVFSNVGFFNEN 176 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 L QDYD + RM + + + I+ + + + + Sbjct: 177 LLCVQDYDFWFRMFKHGYKVKYIPKVLLQYRIHKKQDSNSKKDLLRNEGNKLWINMLKNI 236 Query: 236 DRASKKYQLFTLYQIRNKRMT 256 + + F Y + NK Sbjct: 237 NDDEYQSIFFNRYNVLNKIKN 257 Score = 41.4 bits (96), Expect = 0.038, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 76/230 (33%), Gaps = 32/230 (13%) Query: 29 RQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIHNDINSGACAVRNQAIMLAQGEY 88 Q+Y+N E+I + T+ + + I N N + + G + Sbjct: 315 EQNYNNIEIIFI----TNNKLNIDFNIEYK------IFNPEKD------NNILDIIDGNF 358 Query: 89 ITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVCQ--GEVYSQPASLPLYPKSPYS 146 I I++++ ++L + + Y+N Y + ++Y + SL + Sbjct: 359 IQFINNNEILIKDKLHSQFNEFIEAPSIDISYSNYYFDKDNNKIYPKLDSLSFNEDKQFE 418 Query: 147 RRLFYKRNIIG---NQVFTWAWRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQ 203 +++ N I + F++ + DY+ +++ YG+ + + Sbjct: 419 DMIYFWNNPIYIPLCSMLIKRKVFEKI----NMTYNNDYNTIMQLSF-YGKMKFINKYFF 473 Query: 204 ILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNK 253 + + + + +K DR L Y R+ Sbjct: 474 YYTDDF------QNTNYYIKVQNEINNNKYIIDRFKDYILLDNFYLYRSN 517 >UniRef50_A1WXB8 Glycosyl transferase, family 2 n=1 Tax=Halorhodospira halophila SL1 RepID=A1WXB8_HALHL Length = 333 Score = 187 bits (475), Expect = 4e-46, Method: Composition-based stats. Identities = 63/276 (22%), Positives = 106/276 (38%), Gaps = 24/276 (8%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + P +S+ +P N AI+SVL+Q E++++DD ST ++ V + P Sbjct: 1 MASTPSVSVIIPALNCAPFISAAIESVLKQGGDQLEVLVIDDGST--DETANIVRSYPSP 58 Query: 61 RITYIH-NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 + + SGA A RN + A G I +D DD W P +L++ + + A + Sbjct: 59 -VHLLRTQTPRSGAAAARNVGLRHAHGRLIAFLDGDDVWLPGKLALQTSTLENHPDVALV 117 Query: 120 YAND--YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-------------VFTWA 164 + E Y Q + + FY I + V Sbjct: 118 CTRGGRWEPDNESYEQAHQRFITADATSEPTDFYPAGWIYHMLLLRPAWVWTSTVVMRRE 177 Query: 165 WRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 K FD L+ QDY+ +LR E + +V + + + T+ P+ + Sbjct: 178 LVEKVGEFDESLRLGQDYEYWLRCSRE-TQILRVAKVLALYRQH--ANNSTNKPRNRNFE 234 Query: 225 FHFYRKHKDKFDRASKKYQLFTLYQI--RNKRMTWR 258 ++ D++ R Q QI R RM +R Sbjct: 235 LEVVQRALDRWGRHGPDGQAADKKQIKKRFARMHFR 270 >UniRef50_C8WIW4 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WIW4_EGGLE Length = 832 Score = 187 bits (475), Expect = 4e-46, Method: Composition-based stats. Identities = 55/277 (19%), Positives = 109/277 (39%), Gaps = 11/277 (3%) Query: 5 PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P SI +P + + ++SV RQ Y WE+I+V+ S +L + + ++D R+ Sbjct: 304 PTFSIVVPLYRTPVEYFRSMLQSVQRQSYGGWELILVN-ASPDDGRLVEELENVSDARVR 362 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 ++ + N G N I +AQG+++ +D DD P+ L + +Y + Sbjct: 363 VVNLEENHGIAENTNAGIRVAQGDFVAFLDHDDVLAPDALFGYARAVCDDPLVDIVYCD- 421 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQDY 182 + + S + K +S L +N I + + +E L D AQDY Sbjct: 422 ---EDRIDSVGVHHAPFFKPDFSPELLNAQNYITHFLAVRKSLIEEIGLLDATFDGAQDY 478 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY--FHFYRKHKDK--FDRA 238 D+ LR + ++ + S K ++G H + + Sbjct: 479 DLVLRATERSRSVAHIPRVLYHWRMHEASTSMNSDSKSYAGEAGRAALEAHCRRCGWSAK 538 Query: 239 SKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 ++ L Y++R++ + + L+ ++ L Sbjct: 539 VERTDLPFAYRVRHELVERPKVSILIPSKDKTSLLSA 575 Score = 86.8 bits (214), Expect = 8e-16, Method: Composition-based stats. Identities = 44/260 (16%), Positives = 93/260 (35%), Gaps = 20/260 (7%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P +SI +P+ ++ L ++S++ + Y N+E++++++ S E Y + Sbjct: 556 ERPKVSILIPSKDKTSLLSACVESIVEKTSYDNYEIVVIENNSVEPETFAYYEEVQRLGK 615 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQ----LVTH 116 + + N + G+Y+ +++D E TPN L L + Q +V Sbjct: 616 ARVVEWPDTFNFSKIMNFGVRQCDGDYVLLLNNDTEVITPNYLETMLGYFQAEGVGVVGA 675 Query: 117 AFLYANDYVCQGEVYSQPASLPLY-----PKSP--YSRRLFYKRNII----GNQVFTWAW 165 L+ +D V G V P + PK Y R +N+ Q+ + Sbjct: 676 KLLFPDDTVQHGGVVLGPYRSAGHLFASLPKDDLGYFCRAVLPQNLSAVTGACQLVPRSV 735 Query: 166 RFKECLFDTELK-AAQDYDIFLRMVVEYGEPWKVEEATQILHI--NHGEMQITSSPKKFS 222 + + + D D L++ +A + + G + + ++ Sbjct: 736 FEEVGGYTEAFEVGLNDVDFCLKVREAGYRVVWTPDALLYHYEFSSRGRDREGAQAERAE 795 Query: 223 GYFHFYRKHKDKFDRASKKY 242 R ++ A Y Sbjct: 796 REIALLRTRWPRYFEAGDPY 815 >UniRef50_B9XD27 Glycosyl transferase family 2 n=1 Tax=bacterium Ellin514 RepID=B9XD27_9BACT Length = 305 Score = 186 bits (474), Expect = 5e-46, Method: Composition-based stats. Identities = 58/274 (21%), Positives = 106/274 (38%), Gaps = 23/274 (8%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S +PT+NR + ++SVL Q + E+I+VDD ST + V D R+ I Sbjct: 2 LVSAIIPTYNRARTIGSTLESVLSQTWKELEVIVVDDGST--DDTAAVVAGYGD-RVRLI 58 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA-------- 117 N G RN I ++GE ++ +D DD W P + + + Q+ Sbjct: 59 RQ-RNQGPSVARNTGIKASKGEIVSFLDSDDSWLPEKTARQVNLMQRTAGFGVECCVCNA 117 Query: 118 -FLYANDYVCQGEVYSQPASLPLYPKSPYSR--RLFYKRNIIGNQVFTWAWRF--KECLF 172 +YA + + P P+ ++ ++ R + NQV + F Sbjct: 118 RMVYATQTINS---FDVADLRPESPEGVWTNPAQVLMDRFLFFNQVVAVRRELLERTGYF 174 Query: 173 DTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 +L+ +DYD+ +R+ + + + E I E ++ + ++ Sbjct: 175 REDLRIMEDYDMAMRLSLSGPWGY-IAEPLVIW-QGGAENSLSQGVSQVDACLQTFQILN 232 Query: 233 DKFDRASKKYQL-FTLYQIRNKRMTWRTLLTLLS 265 D + L L R + + R + LS Sbjct: 233 DLKSSSQWGPLLPKHLLHRRMRVLKHRGVAFQLS 266 >UniRef50_B5JJR3 Glycosyl transferase, group 2 family protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJR3_9BACT Length = 1027 Score = 186 bits (474), Expect = 5e-46, Method: Composition-based stats. Identities = 58/238 (24%), Positives = 98/238 (41%), Gaps = 9/238 (3%) Query: 6 LISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQ-YVTALNDPRIT 63 ISI +P ++ ++ + I SVL Q Y+NWE+ I DDCS + + A +D RI Sbjct: 496 KISIILPVYDVEEIWLRKCIDSVLSQIYTNWELCIADDCSNKPHIKEVLHQYAQSDSRIK 555 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN- 122 N N A LA G Y+ +D DDE P+ L+ T +Y++ Sbjct: 556 ITFRKENGHISQASNSAASLATGTYLALLDHDDELAPHALAKVAESISTNPTAKLIYSDE 615 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQD 181 D + Q + P + KS ++ LF N+I + F++ F + AQD Sbjct: 616 DKIDQNGLRHGP-----HFKSDWNYDLFLGCNMISHLGVYRKDIFEQTGGFRGGYEGAQD 670 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 +D+ LR + + + H + E + + + +K + R + Sbjct: 671 WDLALRFIEIIPSSDIIHIPEILYHWRNIEGSTAHNIEHKNYAIAAQKKAIEAHLRRT 728 Score = 79.9 bits (196), Expect = 8e-14, Method: Composition-based stats. Identities = 45/248 (18%), Positives = 92/248 (37%), Gaps = 21/248 (8%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P +SI +PT +R L I S+L++ YSN++++I+D+ S + ++ + + I Sbjct: 752 QPTVSIIIPTKDRIDLLKPCIDSILKKTNYSNYKIVIIDNGSELEDTIRYLNSINSYENI 811 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSVFLAHKQQLVTHA 117 + + + + N+AI E I +++D D W + ++ +V Sbjct: 812 SILKDSEEFNYSRLNNKAISATSSEIICLLNNDIEVTQDNWLNEMVRHTTRNEIGVVGAK 871 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRL---FYKRNIIG--------NQVFTWAWR 166 LY +++V G V + + R ++ +++ +F + Sbjct: 872 LLYPHNHVQHGGVIMGIGGVAAHAFKYLHRDDDGHIHRAHLVSGYSAVTGACMMFKKSLW 931 Query: 167 FKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 + FD LK + D D LR + A L+ E + K Sbjct: 932 SELGGFDEKNLKVSYSDIDFCLRAGKTGKKTILTPFAL--LYHKESETRGNPLSNKKDST 989 Query: 225 FHFYRKHK 232 + Sbjct: 990 QFTLEMNY 997 >UniRef50_Q2LWN5 Glycosyltransferase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LWN5_SYNAS Length = 363 Score = 186 bits (474), Expect = 5e-46, Method: Composition-based stats. Identities = 49/219 (22%), Positives = 91/219 (41%), Gaps = 7/219 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + P+IS+ +P +N +I +L Q YSN+E++I+DD ST V++ DPR Sbjct: 23 NSAPMISVVLPVYNGASYLRESIDCILVQSYSNFELLIIDDGSTDDS--ASIVSSFTDPR 80 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + + N G A N+ I LA+G YI D DD P+RLS +A + Sbjct: 81 IRFYSQE-NKGLAATLNRGIALAKGAYIARQDQDDISLPDRLSKQVAFMETHPDCGMAGT 139 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLF---DTELKA 178 + + + ++ L + + + + F E + + + Sbjct: 140 WASILEEQKPTKRLHRHDADNLSLQFDLLFDNPFVHSSLMIRKTVFDEVGVYCTNPDRQP 199 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS 217 +DY+++ R+ ++ + E I M + + Sbjct: 200 PEDYEMWSRVARKF-RVANIPEVLHIYREVPQSMSRSGN 237 >UniRef50_Q5P2A3 Glycosyl transferase, family 2 n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A3_AZOSE Length = 314 Score = 186 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 57/270 (21%), Positives = 101/270 (37%), Gaps = 23/270 (8%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M L+S+ +P +N + AI SVL+Q+Y + E+I+VDD S V A P Sbjct: 1 MNIRNLVSVVIPAYNVGPHIVEAIDSVLQQNYPHIELIVVDDGSQDDT---ADVVAKRYP 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL- 119 + I N GA RN I A+GE+I +D DD W P +L + + V + + Sbjct: 58 QAILIR-KANGGAATARNAGIRAARGEFIAFLDADDIWLPGKLRAQVEYLGARVDVSMIC 116 Query: 120 --YANDYVCQGEVYSQ------------PASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW 165 +++ + V+ P ++ +L + V Sbjct: 117 TGFSHWTSDRNGVFPDLLTMIPDQSDVAPGAIDSELSGWIYHKLLLHNFVWTTTVMMRRS 176 Query: 166 RFKECLF-DTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 F D + + QDY+ FLR E ++ + + G + + Sbjct: 177 LIDRIGFYDEKFRLGQDYEYFLR-ASRETEVHRLSRIYALYRHHPGSATARGNDYNYGAS 235 Query: 225 FHFYRKHKDKFDRASKKYQLFTLYQIRNKR 254 + +D + AS + + + R + Sbjct: 236 IML--RARDTWGLASPNGESISEREFRERV 263 >UniRef50_Q12TY4 Glycosyl transferase, family 2 n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12TY4_METBU Length = 274 Score = 186 bits (473), Expect = 7e-46, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 94/232 (40%), Gaps = 3/232 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 K + IS+ MP +N + +I+S+L Q Y+++E IIVDD ST + D R Sbjct: 4 KESVKISVVMPAYNAEAYLKDSIESILDQTYTDFEFIIVDDASTDNSHEIIEEYSKKDKR 63 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N+IN G R + ++G+YI D DD RL + ++ + Sbjct: 64 IVVLRNEINLGIAETRTKGTKYSKGKYIAVSDADDISILTRLEKQYNYLEEHKDCGVVGG 123 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSR-RLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAA 179 + + Y R RLF + F+ +D + A Sbjct: 124 FIELFDSDTSKIIGVRKYYEDDANLRKRLFLYCPVAQPVCMMRKEVFETLGYYDPKYPPA 183 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 +D D++ R+ +Y + ++E ++ I+++ K + +K+ Sbjct: 184 EDLDLWFRIGTKY-KFANIQEILLKYRVHKKSATISTTKKMEAMTLKIRKKY 234 >UniRef50_A3U8H8 Glycosyl transferase n=1 Tax=Croceibacter atlanticus HTCC2559 RepID=A3U8H8_9FLAO Length = 272 Score = 186 bits (473), Expect = 7e-46, Method: Composition-based stats. Identities = 64/258 (24%), Positives = 108/258 (41%), Gaps = 15/258 (5%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +ISI P++N+ A+ SVL Q + +WE II+DD ST + ALND R Y Sbjct: 1 MISIITPSYNKGAFLEEALNSVLSQSFKDWECIIIDDGSTDNTREIGEQWALNDKRFKYY 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + N G + RN AI A G YI +D DD + LS + Q+ + + + Sbjct: 61 Y-KKNGGVSSARNYAITKASGTYILPLDADDNIHEDYLSKIVYAFQENPNLKLVSSR--I 117 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 + LP Y +RL Y F + +D E+ + +D++ + Sbjct: 118 QKFGYAHDEYVLPEYS----YKRLLYSNCFSHCSAFKKIDWERIGGYDEEIPSLEDWEFW 173 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSS-----PKKFSGYFHFYRKHKDKFDRASK 240 +R++ E E K++E H +++ F Y Y+K+K +D Sbjct: 174 IRLLDEKSEVLKIQELLFNYR-KHKTNSLSNRFYSDPDFYFEMYDLVYKKNKILYDAYFP 232 Query: 241 KYQLFTLYQIRNKRMTWR 258 + +Q R + + + Sbjct: 233 R--PIIAFQHREELLAFN 248 >UniRef50_C0YGQ4 Possible glycosyl transferase family protein n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YGQ4_9FLAO Length = 267 Score = 186 bits (473), Expect = 7e-46, Method: Composition-based stats. Identities = 59/266 (22%), Positives = 111/266 (41%), Gaps = 11/266 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +IS+ +P +N+++ I+ VL Q Y N+E+IIV+D S ++ + V ++D R+ Sbjct: 1 MISVVIPLYNKEKYIKETIRKVLNQTYQNFEIIIVNDGS--KDKGPEIVNEIDDSRVKLF 58 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N N G + RN I +Q EYI +D DDEW PN L + + A ++ ++ Sbjct: 59 -NKENGGVSSARNVGIEKSQFEYIAFLDADDEWLPNHLEEIHKLITKYKSSADVFVTNFA 117 Query: 126 CQ--GEVYSQPASLPLYPKSP--YSRRLFYKRNIIGNQVFTWAWR--FKECLFDTELKAA 179 + E + ++ K+ +I + + K LFD + Sbjct: 118 RKYSEEKIVDNRKQDELKEGIVKDYFKVILKKGLIHTSCVCVSKKALLKAKLFDERISRG 177 Query: 180 QDYDIFLRMVVEYGEPW-KVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +D D++ R+ EYG + V + + + K + F ++ Sbjct: 178 EDMDLWGRLAREYGIAYSPVITELYLQEAENNSRGKSDITKSIVYHLDF-SNVTSSDEKK 236 Query: 239 SKKYQLFTLYQIRNKRMTWRTLLTLL 264 K L+ Y K + + + + L+ Sbjct: 237 YLKNLLYRKYASLLKNIDFASFIKLM 262 >UniRef50_Q47DD2 Glycosyl transferase, family 2:Polysaccharide deacetylase n=4 Tax=Proteobacteria RepID=Q47DD2_DECAR Length = 672 Score = 186 bits (472), Expect = 9e-46, Method: Composition-based stats. Identities = 52/223 (23%), Positives = 89/223 (39%), Gaps = 19/223 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P+IS+ +P +N ++ ++SVL QD+ E+I+VDD S+ + P +T Sbjct: 360 PMISVVIPCYNAERWLGATLRSVLAQDWPRLEIIVVDDGSSDRS---AELVRTQFPGVTL 416 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + NSG RN I A+G++I +D DD W P +L Y + Sbjct: 417 V-QQKNSGVAVARNNGIANAKGDWIAFVDADDIWLPGKLHAQWQAFTLQPGGRMAYTAWH 475 Query: 125 V-------------CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-C 170 V S ASL P L + + V F E Sbjct: 476 VWPCTNPEPEAALLKDLAQRSTDASLWDGPTGWIYPDLLEDCCVWTSTVLMHRSLFDEIG 535 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 +FD +L+ +DYD++LR + ++ + + ++ + Sbjct: 536 IFDEKLRIGEDYDLWLR-ASQVTPIVRIPKPLALYRMHPDSIT 577 >UniRef50_Q97H38 Glycosyltransferase n=1 Tax=Clostridium acetobutylicum RepID=Q97H38_CLOAB Length = 333 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 53/281 (18%), Positives = 102/281 (36%), Gaps = 10/281 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +IS+ MP +N ++ +I+S+L+Q Y ++E IIV+D S A +D RI + Sbjct: 1 MISVIMPVYNCEKYLEESIESILKQTYRDFEFIIVNDGSNDKSIDIINKYANDDNRIVVV 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 D N G N+ I A+G Y+ +D DD P R + + + L Sbjct: 61 SRDNNMGMVYSLNEGIDRAKGSYVARMDADDIALPERFERQIEYLNKNKDVDILACKVEA 120 Query: 126 CQGEVYSQPASLPLYPKSPYS-----RRLFYKRNIIGN-QVFTWAWRFKE-CLFDTELKA 178 Q + + LF + I + V K ++ K Sbjct: 121 FGDVSREQKLEREHWYNVDLNNSESIESLFLENCYIAHPSVMMKMSVLKALGGYNLNYKR 180 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR- 237 +DY+++LR + + + +EE + + H + +I + FS + + Sbjct: 181 TEDYNLWLRAIAKGYKIAMLEEKLMKIRL-HNDSKIHRDAEGFSSIRDIIQSRLEYVKEK 239 Query: 238 -ASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 K + ++ + + + D + Sbjct: 240 LKLKDFSYVIWGASNGGKIAYEKIKEVFPNAKLNGYIDKFK 280 >UniRef50_Q11L33 Glycosyl transferase, family 2 n=1 Tax=Chelativorans sp. BNC1 RepID=Q11L33_MESSB Length = 609 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 61/242 (25%), Positives = 104/242 (42%), Gaps = 17/242 (7%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+S+ +P +N Q AI+S+ Q + N E+++VDDCST A D RI Sbjct: 207 RPLVSVIVPAYNAQYTIATAIRSLQEQTWRNIEILVVDDCSTDETVSIVTKLANKDSRIQ 266 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL-YAN 122 + + NSGA RN + A+G+Y+T D DD P +L + H ++ Y+ Sbjct: 267 VLRMERNSGAYCARNLGLKNARGKYVTCHDSDDWSHPEKLQIQAEHLERHTDRIVANYSY 326 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDY 182 Y ++ + L Y + S +F + ++ E F E++ A D Sbjct: 327 LYFVDDDLRVRARRLGSYLRLNTSSLMFRREPVLS-----------ELGFWDEVRFAADS 375 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS-----GYFHFYRKHKDKFDR 237 + R+ +G + A + + + E +T+S G FYR+ +F R Sbjct: 376 EFLERLYAVFGRISVISIAKPLSLVRYREDSLTASKAFGYHGFRMGARQFYRERYLRFHR 435 Query: 238 AS 239 + Sbjct: 436 SG 437 >UniRef50_A6QAH7 Putative uncharacterized protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QAH7_SULNB Length = 315 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 56/249 (22%), Positives = 101/249 (40%), Gaps = 12/249 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M PL+++ +P +N+++ RAI SV Q Y N E+++V+D ST + V ++ D Sbjct: 1 MHKTPLVTVIIPLYNKEKWVKRAIDSVQEQTYENLEILVVNDGSTDRSV--EVVQSIKDE 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + +N G + RN+ I A+G +I +D DD W + V + + A L Sbjct: 59 RIQVLE-KVNGGVSSARNRGIEEAKGGFIAFLDADDVWFQKHIEVLVEGVARYPDAAILA 117 Query: 121 ANDYVC----QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF---KECLFD 173 C + EV + Y + L I F K LF+ Sbjct: 118 NRLVYCFEGAEKEVGEHVHTSVNYDSFDFIAALGKGNFPIHIGSTLVRTSFLQEKHLLFN 177 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR-KHK 232 ++ A+D + LR YG + + + + + + K + ++ Sbjct: 178 ESMRLAEDVNFTLR-ASRYGMVILSDYTGLVYYQDDVQSAMKQKAKTAALVPLYFDGMQD 236 Query: 233 DKFDRASKK 241 + + + KK Sbjct: 237 ESWSESEKK 245 >UniRef50_A3VRJ5 Glycosyltransferase n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VRJ5_9PROT Length = 307 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 57/264 (21%), Positives = 111/264 (42%), Gaps = 9/264 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P++S+ +P + + + + +V +Q ++ WE IIVDDCS + + +DPRI Sbjct: 10 PVVSVIIPAYKSADVLPQTLATVTQQTFTAWEAIIVDDCSPD-DIVSAIKPFADDPRIRL 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA--- 121 +H+ N GA A RN I A+G ++ +D DDEW P++L+ + + ++ Sbjct: 69 VHHAHNQGAAAARNTGIGEARGRFVAFLDADDEWHPDKLAKQVQAVEAAPHPNRVFCVTR 128 Query: 122 -NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 V + + +PA L P +F + F + + +L+A + Sbjct: 129 TKVVVDEDKWIVKPARLKA-DSEPLDEFIFVRGGFCQTSSFFLSRDLAAQIRWKDLRAGE 187 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS- 239 D+ + G ++E I H + ++ S+ K F + + + Sbjct: 188 DHLFAIE-ATAAGSYLLIDELLTIYHDEYNPNRL-SNDKTLESGRAFMDVARPLISQKAL 245 Query: 240 KKYQLFTLYQIRNKRMTWRTLLTL 263 + Y+ L ++ R L T+ Sbjct: 246 RGYEARYLGPSLLRKEPVRGLWTI 269 >UniRef50_B3PFA2 Glycosyl transferase, putative, gt2L n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PFA2_CELJU Length = 306 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 56/243 (23%), Positives = 92/243 (37%), Gaps = 10/243 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ +P +N ++ A+ SVL Q N+E+IIVDD S + + ND R+ Sbjct: 4 KVSVIIPAYNAEKFLAFAVNSVLSQTMENFELIIVDDGSVDNTREIALKFSENDMRVKCY 63 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + N G RN A GE++ +D DD W N+L L Q V + Sbjct: 64 SIE-NRGRAGARNFGCRQADGEWLAFLDADDCWADNKLERQLESVQDEVGLIYTERTWVD 122 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDT--ELKAAQDY 182 G++ P+ +L I + V F FD K QDY Sbjct: 123 ENGDILENQPEKYELPQGYIYEKLIDGNYICTSSVILKRDLFLTVGGFDESPNYKNCQDY 182 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D+++R+ + + E ++ ++ K F + R D+ + Y Sbjct: 183 DLWIRISP-LAKFVSLRETLCFYRLHDD-----NAHKNFYSRYIGLRSCMDRLREVGQNY 236 Query: 243 QLF 245 L Sbjct: 237 NLI 239 >UniRef50_Q1PUV4 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUV4_9BACT Length = 1678 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 66/258 (25%), Positives = 110/258 (42%), Gaps = 25/258 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR--I 62 PL+SI +PT+NR A++S++ Q Y E I+V+D + + D R I Sbjct: 1310 PLVSIIIPTYNRPGTLKIALESIVAQTYKFIETIVVNDAGID---VLNVIDTFRD-RLSI 1365 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 Y ++IN A RN AI A G+YI +DDDD + P+ + ++ + T+ Y + Sbjct: 1366 KYFVHNINKDRSAARNTAIKHASGKYIAYLDDDDIFYPDHIETLVSFLETNNTYKVAYTD 1425 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE------CLFDTEL 176 Y E + + PYS Y R + GN + T + +FD L Sbjct: 1426 AYRAFQEK-EHGKYITKNKEIPYSFDFDYDRILTGNFIPTPCIMHYKSCIDVVGIFDETL 1484 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY---------FHF 227 A +D+D+++RM ++ + + + T T K+ Y Sbjct: 1485 GAHEDWDLWIRMSTKF-QFAHIRKITCEYSWREDGSSTTFGRKEVMDYTRNVVLQRGIKI 1543 Query: 228 YRK--HKDKFDRASKKYQ 243 YR+ + F+RA +Q Sbjct: 1544 YREKQIRGLFNRAEASFQ 1561 Score = 133 bits (335), Expect = 7e-30, Method: Composition-based stats. Identities = 50/243 (20%), Positives = 93/243 (38%), Gaps = 11/243 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQD-YS--NWEMIIVDDCSTSWE--QLQQYVTALNDP 60 +S + T+N ++ ++ + Q Y E+I++D CS E + ++ + D Sbjct: 1065 KVSAIVSTYNSERFIRGCLEDLTNQTSYKKGALEIIVIDSCSPQNEQAIVDEFQKSYRD- 1123 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 I YI + A N+AI +A+GEYIT + DD + L + + +Y Sbjct: 1124 -IVYIRTEKRETIYAAWNRAIKIAKGEYITNANTDDRHRKDGLEILANELDENPDVVLVY 1182 Query: 121 ANDYV--CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 N + + E + + ++ ++R K IG Q E FD + Sbjct: 1183 GNQIITETENETFEHHKPVGVFQWPDFNRNYLLKVCSIGPQPMWRKLVHSEFGYFDDIFQ 1242 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 A DY+ +LR+ +Y + E + + + + K+K D Sbjct: 1243 VAGDYEFWLRISQKY-HFKHIPEYLGLYLRSPQSAEYRNQELTQQETLVIQEKYKKLKDI 1301 Query: 238 ASK 240 K Sbjct: 1302 QEK 1304 >UniRef50_C6MKP2 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M18 RepID=C6MKP2_9DELT Length = 305 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 56/238 (23%), Positives = 92/238 (38%), Gaps = 16/238 (6%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P +S+ MP +N + AI S+ Q Y NWE++IVDD ST +YV + DPR+ Sbjct: 2 QQPKVSVVMPVYNTRPYLEEAIASIFAQTYRNWELVIVDDFSTDGSW--EYVCGIRDPRV 59 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 N+ N N+ I LA GEYI +D DD P R+ + + ++ + Sbjct: 60 RVARNERNMRNSYTLNRGIALATGEYIAKMDADDVSFPERIERQMEYLREHPEVDAVGCG 119 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII--------GNQVFTWAWRFKECLFDT 174 Y ++ ++P R + R + + F +D Sbjct: 120 LYRVDRDLKLITVNIPPAAHRDIIRFIAPGRKFVFGPSFPITDGCLVARRSWFLRWQYDP 179 Query: 175 ELKAAQDYDIFLRMVVEYGEPW-KVEEATQILH---INHGEMQITSSPKKFSGYFHFY 228 + AQD+D LR Y + ++E + + + T + G Y Sbjct: 180 AIPYAQDFDQNLR--SHYDSVFANLQEPLYLYRRVGVTSSWLSQTKAVWYKLGSLSLY 235 >UniRef50_A5UVV3 Glycosyl transferase, family 2 n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UVV3_ROSS1 Length = 331 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 50/220 (22%), Positives = 96/220 (43%), Gaps = 7/220 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S + T+N AI+SVL Q E++++DD ST + + V + Y++ Sbjct: 10 VSAVITTYNAAHFVGEAIESVLAQTRLPDEIVVIDDGST--DDTARVVARYASHGVRYVY 67 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 N G + RN+ I G+ + +D DD W P + + LA+ A + + +V Sbjct: 68 QQ-NGGPGSARNRGIRETSGDLVAFLDADDLWLPPKTEMQLAYVATHPDVALVSCDRWVW 126 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ---VFTWAWRFKECLFDTELKAAQDYD 183 + E + + +RR RNI+GN + FDT ++ A+++D Sbjct: 127 KIEKERRYLERFGPHYNLDARREVMVRNIVGNPSQVIARRDALIAAGGFDTTMRWAEEWD 186 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSG 223 +++R+ G V + + + G + + ++ SG Sbjct: 187 LWVRIAAR-GSLGFVHQPLSVYRWHSGGLAHENMWRRLSG 225 >UniRef50_A1ANW6 Glycosyl transferase, family 2 n=2 Tax=Desulfuromonadales RepID=A1ANW6_PELPD Length = 261 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 62/267 (23%), Positives = 109/267 (40%), Gaps = 14/267 (5%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M P +S+ +P +N+ Q AI SVL Q +S+ E+I+VDD S+ ++ + ALN Sbjct: 1 MMPMPEVSVIIPCYNQGQYLHEAIDSVLGQTFSDLEIIVVDDGSSDP-ATREVLDALNRT 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 + N G A RN I A G YI +D DD P+ + LA + +Y Sbjct: 60 ATRLLRR-KNGGLAAARNSGINAAHGRYILPLDCDDRIAPDYIRQALAAFESDPLCGIVY 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSR-RLFYKRNIIGNQVFTWAWRFKECLFDTEL-KA 178 C+ E + A + P+SR R+ I + ++ + + +D L + Sbjct: 119 -----CRAEKF--GAEQGTWRLPPFSRLRMGLGNVIFCSALYQKSDWQRVGGYDENLQRG 171 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK-KFSGYFHFYRKHKDKFD- 236 +D+D +L ++ + + M + P K + + KH D F Sbjct: 172 WEDWDFWLALLELGLTVHCLPYVGFHYRKSESSMASSMDPVLKANLHRQLMHKHPDFFGL 231 Query: 237 -RASKKYQLFTLYQIRNKRMTWRTLLT 262 + + L Y++ + R + Sbjct: 232 LSRTPQPLLEIYYRMAGSGIYRRIKRS 258 >UniRef50_B7K6R4 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7K6R4_CYAP7 Length = 374 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 47/219 (21%), Positives = 83/219 (37%), Gaps = 10/219 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ +P +N ++ R ++SVL Q Y N E++++DD S A D R+ Sbjct: 11 PLVSVIIPAYNAEKFIERTLRSVLSQTYQNIEVLVIDDGSQDKTAEIVQTIANQDQRVIL 70 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK-QQLVTHAFLYAND 123 + NSG A RN I A+GE+I ID DD W P + + + + +Y+ Sbjct: 71 LRQ-ANSGVAAARNLGIQNAKGEFIAPIDADDIWYPENIEKQVQCMLEGGTSVGLVYSWS 129 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI--IGNQVFTWAWRFKECLFDTELK---- 177 A + L + + + ++ + K Sbjct: 130 VDIDENDQLTGAFRVAEIEGDVYGTLVCHYFLGNASCALIRRTCLEEVSSYNCQFKTHNL 189 Query: 178 -AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 +D+++ LR+ Y + V + G M Sbjct: 190 QGCEDWELALRIAENY-QFKAVRDFLVGYRKLPGTMSSN 227 >UniRef50_Q3SLZ0 Glycosyltransferase n=1 Tax=Thiobacillus denitrificans ATCC 25259 RepID=Q3SLZ0_THIDA Length = 318 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 62/238 (26%), Positives = 95/238 (39%), Gaps = 12/238 (5%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M PL+SI MP ++ ++ SVL Q +++WE++ VDD S + ++ A DP Sbjct: 1 MAAKPLVSIVMPCYDAAAHLPASVGSVLAQTFADWELVAVDDGS--RDATLAWLQAQRDP 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N G A RN + A+G ++ +D DD W P+ L + Q Y Sbjct: 59 RIRVLSQP-NGGVSAARNAGLAAARGRHVAFLDADDTWAPDFLDAMVNALQARPDAVLAY 117 Query: 121 AN-DYVCQGEVYSQPASLPLYPKSPYSRRLFYK-RNIIGNQVFTWAWRFKECLFDTELKA 178 V +G+ P P Y + LF R I + FD L Sbjct: 118 CGWQNVGRGDKAGAPFVPPDYENPDKAETLFAGCRWPIHAALVRRDAVTAADGFDPRLTN 177 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP-----KKFSGYFHFYRKH 231 A+DY ++LR+ + G +V H HG Q + P + H Sbjct: 178 AEDYALWLRVALS-GAIVRVPRVLAFYHF-HGAGQASDDPVRAALHHLDAQRAYLESH 233 >UniRef50_B1WXJ1 Glycosyl transferase, family 2 n=2 Tax=Cyanothece RepID=B1WXJ1_CYAA5 Length = 329 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 47/239 (19%), Positives = 96/239 (40%), Gaps = 15/239 (6%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ +PT+NR A+ SVL Q Y+++E+II DD S + + V D RI YI Sbjct: 3 KVSVCIPTYNRSHYLTYAVNSVLNQTYTDFELIICDDGS--PDNTSEVVAGFEDRRIHYI 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 ++ N G ++G+Y DDDD TP L + + F+ N +V Sbjct: 61 RHEKNMGRSRNMRSGFEASKGDYFIKFDDDDALTPEFLEKTVNILDKNPQVDFVCTNHWV 120 Query: 126 CQGEVYS----QPASLPLYPKSPYSRRL--------FYKRNI-IGNQVFTWAWRFKECLF 172 + + + K + FY +++ +G+ +F ++ + Sbjct: 121 IDKDSKRVESATKDNAVRWGKDKLEEGIIHDLDWQTFYYQSLQVGSTLFRFSCLKEVDYM 180 Query: 173 DTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 D + +D+D+ +R+ + + + + + G+ + + + + Sbjct: 181 DPKADGCEDFDLLVRLALIGKVSYFLPDYLMEYRFHGGQTSLGQNLHFLQAKLYCIENY 239 >UniRef50_UPI0001788F1F glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F1F Length = 1106 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 57/273 (20%), Positives = 109/273 (39%), Gaps = 2/273 (0%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 PL+SI +P +N + I S + Q Y + E++IVDDCS + T N+ + Sbjct: 575 NEYPLVSIIIPVYNNVEYLASCIDSAINQTYRHIEVVIVDDCSPDPKVSSILNTYKNNKK 634 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + N+ N G +N + ++G I +D DD N + L + + ++F Sbjct: 635 VRLFKNEYNQGISKTQNICMAKSEGSIIAFLDCDDILELNAIEKCLEYWEPGTKYSFTNR 694 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 E + + L + + L + ++ + K +F++E AQD Sbjct: 695 IHINEDSEEIGRFSCDHLPKDNIFEDHLDVRMYASHFKMISRDVFLKVGVFNSEYDGAQD 754 Query: 182 YDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 YD+ LR+ Y V E I+ + I S+ K+ + + K + Sbjct: 755 YDMVLRVAFHYPNSAFVHVPEFLYKHRIHTNQKSIKSNDKQKNMSIVISDQAKLRRSIRD 814 Query: 240 KKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRL 272 + + I + +TL + S++N ++ Sbjct: 815 GVFHKLISFIIISFGKEDQTLQCIQSIKNTVKI 847 Score = 45.2 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 54/119 (45%), Gaps = 9/119 (7%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYS-NWEMIIVDDCSTSW--EQLQQYVTALNDPRI 62 LIS + ++ ++ ++ I+S+ E+I+ ++ S+ +++++ + P + Sbjct: 820 LISFIIISFGKEDQTLQCIQSIKN-TVKIPHEIILFENGSSESCVSFIKEHIEGI--PGV 876 Query: 63 TYIHNDINSGACAVRNQAIMLAQ--GEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 I+N N G R +A+ A YI+ +D+D E T L L ++ + Sbjct: 877 KVIYNSSNLGPAGGRKEAMKYATPNSYYIS-LDNDIEVTEGWLEELLVRAEESDDIGSV 934 >UniRef50_B8IAF3 Glycosyl transferase family 2 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IAF3_METNO Length = 847 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 48/215 (22%), Positives = 86/215 (40%), Gaps = 6/215 (2%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+S+ +P +N AI SVL Q + N E+++VDD ST + Q + D + Sbjct: 15 EEGPLVSVIIPVFNGGNWLAEAIDSVLAQTWQNLEVLVVDDGSTDGGETQAVAASYGD-Q 73 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I YIH N G + N I +GEY + + DD + +++ + + L A ++ Sbjct: 74 IRYIHKP-NGGVASALNVGIREMKGEYFSWLSHDDLYRSEKVAAQVEALRGLSRPAIAFS 132 Query: 122 NDYVCQ--GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELKA 178 + + G++ + S + + G + F+ FD L Sbjct: 133 DFVLIDRFGQLLHEVPVTNQIAPSDNPVWHVMEGMLNGCALLIPRICFEVAGTFDESLPT 192 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 QDY ++ RM + V + + Sbjct: 193 TQDYQLWFRMSRHF-RFVPVPGYLVQSRQHPDQGS 226 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77414 Putative colanic acid biosynthesis glycosyl tran... 318 1e-85 UniRef50_B7K7D2 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 290 3e-77 UniRef50_A5W7E0 Glycosyl transferase, family 2 n=1 Tax=Pseudomon... 287 3e-76 UniRef50_B4VZW9 Methyltransferase domain family n=2 Tax=Cyanobac... 286 4e-76 UniRef50_B9BSY2 Glycosyl transferase, family 2 n=4 Tax=Proteobac... 284 2e-75 UniRef50_UPI0001C3906A glycosyl transferase family protein n=1 T... 282 8e-75 UniRef50_A0LST8 Glycosyl transferase, family 2 n=1 Tax=Acidother... 282 1e-74 UniRef50_C2DBA9 Family 2 glycosyl transferase n=22 Tax=Enterococ... 280 3e-74 UniRef50_C7QLJ5 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 280 5e-74 UniRef50_B2T0J8 Glycosyl transferase family 2 n=2 Tax=Burkholder... 279 8e-74 UniRef50_B1ZZR1 Glycosyl transferase family 2 n=1 Tax=Opitutus t... 278 2e-73 UniRef50_B2JFA3 Glycosyl transferase family 2 n=10 Tax=Proteobac... 276 5e-73 UniRef50_C2H8B4 Family 2 glycosyltransferase n=13 Tax=Enterococc... 275 9e-73 UniRef50_D0HB89 Glycosyl transferase group 2 family protein n=2 ... 275 1e-72 UniRef50_B5WL01 Glycosyl transferase family 2 n=2 Tax=Burkholder... 275 2e-72 UniRef50_Q7BG51 WsaE n=2 Tax=Bacillaceae RepID=Q7BG51_BACST 274 2e-72 UniRef50_C6XS42 Glycosyl transferase family 2 n=1 Tax=Hirschia b... 274 3e-72 UniRef50_B4U6C1 Glycosyl transferase family 2 n=1 Tax=Hydrogenob... 273 4e-72 UniRef50_UPI000174589B glycosyl transferase, family 2 n=1 Tax=Ve... 272 1e-71 UniRef50_A2VSE1 Glycosyl transferase n=1 Tax=Burkholderia cenoce... 271 1e-71 UniRef50_B3PFZ6 Glycosyl transferase, putative, gt2E n=1 Tax=Cel... 271 3e-71 UniRef50_D2RDR5 Glycosyl transferase family 2 n=1 Tax=Archaeoglo... 270 3e-71 UniRef50_B1JX11 Glycosyl transferase family 2 n=1 Tax=Burkholder... 270 3e-71 UniRef50_UPI0001AF4953 glycosyl transferase, group 2 family prot... 268 2e-70 UniRef50_C6JDU8 Putative uncharacterized protein n=1 Tax=Ruminoc... 268 2e-70 UniRef50_A0YLN7 Glycosyl transferase, group 2 family protein n=3... 268 2e-70 UniRef50_B1FXK1 Glycosyl transferase family 2 n=2 Tax=Burkholder... 267 3e-70 UniRef50_A8TT37 Glycosyl transferase, group 2 family protein n=2... 267 4e-70 UniRef50_C3RR62 Glycosyl transferase n=3 Tax=Bacteria RepID=C3RR... 266 8e-70 UniRef50_Q094Y6 O-antigen biosynthesis protein RfbC n=1 Tax=Stig... 265 9e-70 UniRef50_B3E8X3 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 265 1e-69 UniRef50_D2LU70 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 264 2e-69 UniRef50_C3RR63 Glycosyl transferase n=2 Tax=Bacteria RepID=C3RR... 263 5e-69 UniRef50_C5A2Y9 Glycosyltransferase, family 2 n=1 Tax=Thermococc... 263 7e-69 UniRef50_A6QB23 Glycosyl transferase n=1 Tax=Sulfurovum sp. NBC3... 262 1e-68 UniRef50_D0L0V1 Glycosyl transferase family 2 n=2 Tax=Gammaprote... 262 1e-68 UniRef50_Q7U8H3 Putative glycosyltransferase family 2 protein n=... 262 1e-68 UniRef50_Q116A2 Glycosyl transferase, group 1 n=1 Tax=Trichodesm... 261 2e-68 UniRef50_Q46FX2 Putative uncharacterized protein n=1 Tax=Methano... 260 5e-68 UniRef50_B7DLY9 Glycosyl transferase family 2 n=1 Tax=Alicycloba... 259 8e-68 UniRef50_D0IZR1 Glycosyl transferase, family 2 n=1 Tax=Comamonas... 258 1e-67 UniRef50_Q093P1 Glycosyl transferase, group 2 family protein (Fr... 258 2e-67 UniRef50_A1U141 Glycosyl transferase, family 2 n=1 Tax=Marinobac... 257 2e-67 UniRef50_Q314K1 Glycosyltransferases involved in cell wall bioge... 257 3e-67 UniRef50_B8D172 Glycosyl transferase family 2 n=4 Tax=Bacteria R... 257 3e-67 UniRef50_C5RQE2 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 257 4e-67 UniRef50_Q314K2 Glycosyltransferases involved in cell wall bioge... 256 6e-67 UniRef50_B6ARX8 Putative glycosyl transferase, family 2 n=1 Tax=... 256 6e-67 UniRef50_C6LFR6 Glycosyl transferase family protein n=3 Tax=Clos... 256 7e-67 UniRef50_B8HRA4 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 256 7e-67 UniRef50_C8WIW4 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 256 8e-67 UniRef50_A6M2M4 Glycosyl transferase, family 2 n=1 Tax=Clostridi... 255 9e-67 UniRef50_Q67PP6 Glycosyl transferase n=1 Tax=Symbiobacterium the... 255 1e-66 UniRef50_A6C910 Truncated O-antigen biosynthesis protein n=1 Tax... 255 1e-66 UniRef50_A5ZX57 Putative uncharacterized protein n=1 Tax=Ruminoc... 255 1e-66 UniRef50_C2P8I0 Family 2 glycosyltransferase n=10 Tax=Bacteria R... 254 3e-66 UniRef50_Q8U2R3 Glycosyl transferase n=2 Tax=Pyrococcus RepID=Q8... 254 3e-66 UniRef50_C6J2S1 Glycosyltransferase n=1 Tax=Paenibacillus sp. or... 253 3e-66 UniRef50_C0X9N7 Group 2 family glycosyl transferase n=24 Tax=Ent... 253 4e-66 UniRef50_A1W3G7 Glycosyl transferase, family 2 n=1 Tax=Acidovora... 253 7e-66 UniRef50_A0B7S5 Glycosyl transferase, family 2 n=1 Tax=Methanosa... 252 9e-66 UniRef50_B0C0T9 Glycosyl transferase, group 2 family protein n=1... 251 1e-65 UniRef50_A3CRX6 Glycosyl transferase, family 2 n=1 Tax=Methanocu... 251 2e-65 UniRef50_C0CNU3 Putative uncharacterized protein n=1 Tax=Blautia... 250 3e-65 UniRef50_Q8YSM1 Alr3063 protein n=7 Tax=Cyanobacteria RepID=Q8YS... 250 3e-65 UniRef50_Q2YCB6 Glycosyl transferase, family 2 n=1 Tax=Nitrososp... 250 4e-65 UniRef50_Q3A3M0 Glycosyltransferase n=1 Tax=Pelobacter carbinoli... 249 5e-65 UniRef50_A0YSS8 Putative uncharacterized protein n=1 Tax=Lyngbya... 249 7e-65 UniRef50_B5JJR3 Glycosyl transferase, group 2 family protein n=1... 249 8e-65 UniRef50_B4RT58 Putative glycosyltransferase family 2 protein n=... 249 9e-65 UniRef50_Q5FRX1 O-antigen biosynthesis protein RfbC n=1 Tax=Gluc... 248 1e-64 UniRef50_Q1Q4P0 Similar to glycosyltransferase family 2 n=1 Tax=... 248 1e-64 UniRef50_A7VZ85 Putative uncharacterized protein n=1 Tax=Clostri... 248 1e-64 UniRef50_C7G777 Glycosyl transferase, group 1 n=1 Tax=Roseburia ... 248 2e-64 UniRef50_B1ZJV9 Glycosyl transferase family 2 n=7 Tax=Alphaprote... 248 2e-64 UniRef50_A0M6I2 TuaG-like glycosyl transferase n=2 Tax=Flavobact... 248 2e-64 UniRef50_A1AUB0 Glycosyl transferase, family 2 n=1 Tax=Pelobacte... 248 2e-64 UniRef50_C1DRS5 Glycosyl transferase, family 2 n=1 Tax=Azotobact... 248 2e-64 UniRef50_Q8YSM2 Alr3062 protein n=6 Tax=Nostocaceae RepID=Q8YSM2... 248 2e-64 UniRef50_C0B9B1 Putative uncharacterized protein n=1 Tax=Coproco... 247 2e-64 UniRef50_Q8PGL8 Truncated O-antigen biosynthesis protein n=2 Tax... 247 3e-64 UniRef50_C6JDV1 Glycosyl transferase n=5 Tax=Clostridiales RepID... 246 7e-64 UniRef50_A3I1Y0 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1... 246 9e-64 UniRef50_B0NG36 Putative uncharacterized protein n=1 Tax=Clostri... 245 1e-63 UniRef50_A9H3G6 Glycosyl transferase family 2 n=1 Tax=Gluconacet... 245 1e-63 UniRef50_D0L0W1 Glycosyl transferase family 2 n=1 Tax=Halothioba... 244 2e-63 UniRef50_B0G3W1 Putative uncharacterized protein n=1 Tax=Dorea f... 244 2e-63 UniRef50_B4AD21 Glycosyltransferase n=17 Tax=Enterococcus faecal... 244 3e-63 UniRef50_Q8YUP7 Glucosyltransferase n=2 Tax=Cyanobacteria RepID=... 244 3e-63 UniRef50_A5G2M9 Glycosyl transferase, family 2 n=1 Tax=Acidiphil... 244 3e-63 UniRef50_C7TKV7 Glycosyl transferase, group 2 n=5 Tax=Lactobacil... 244 3e-63 UniRef50_B0TYV9 Putative teichuronic acid biosynthesis glycosyl ... 244 3e-63 UniRef50_B9JE99 Glycosyl transferase n=1 Tax=Agrobacterium radio... 244 3e-63 UniRef50_Q8PZ42 Glycosyltransferase n=1 Tax=Methanosarcina mazei... 243 5e-63 UniRef50_A6ELC1 TuaG n=3 Tax=Bacteroidetes RepID=A6ELC1_9BACT 243 5e-63 UniRef50_C4F932 Putative uncharacterized protein n=1 Tax=Collins... 243 5e-63 UniRef50_B7AUE9 Putative uncharacterized protein n=1 Tax=Bactero... 243 6e-63 UniRef50_B3PF93 Glycosyl transferase, putative, gt2I n=1 Tax=Cel... 243 8e-63 UniRef50_D2RIR4 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 242 1e-62 UniRef50_A0YST1 Putative uncharacterized protein n=1 Tax=Lyngbya... 242 1e-62 UniRef50_C3WFL5 Glycosyl transferase n=4 Tax=Bacteria RepID=C3WF... 242 1e-62 UniRef50_Q9K6L6 Glycosyltransferase n=1 Tax=Bacillus halodurans ... 241 2e-62 UniRef50_A9AAL8 Glycosyl transferase family 2 n=2 Tax=Methanococ... 241 2e-62 UniRef50_B5L396 WfdV n=10 Tax=Proteobacteria RepID=B5L396_ECOLX 241 2e-62 UniRef50_B1ZW34 Glycosyl transferase family 2 n=1 Tax=Opitutus t... 241 2e-62 UniRef50_A5ZV71 Putative uncharacterized protein n=2 Tax=Firmicu... 241 2e-62 UniRef50_O87159 WblC protein n=1 Tax=Vibrio cholerae RepID=O8715... 241 2e-62 UniRef50_B4S4G2 Glycosyl transferase family 2 n=1 Tax=Prosthecoc... 241 3e-62 UniRef50_B2J2I1 Glycosyl transferase, family 2 n=7 Tax=Cyanobact... 240 3e-62 UniRef50_Q60B60 Glycosyl transferase, group 2 family protein n=1... 240 3e-62 UniRef50_C0E8J2 Putative uncharacterized protein n=1 Tax=Clostri... 240 4e-62 UniRef50_A4EB36 Putative uncharacterized protein n=1 Tax=Collins... 240 4e-62 UniRef50_C0QK40 Putative glycosyl transferase (Group 2 family pr... 239 6e-62 UniRef50_A0KM86 Glycosyl transferase, group 2 family protein n=2... 239 6e-62 UniRef50_Q0ACF0 Glycosyl transferase, family 2 n=4 Tax=Ectothior... 239 6e-62 UniRef50_Q0VR09 Glycosyl transferase, group 2 family protein, pu... 239 9e-62 UniRef50_UPI0001BC32BF glycosyl transferase family 2 n=1 Tax=But... 239 9e-62 UniRef50_C0FUT5 Putative uncharacterized protein n=1 Tax=Rosebur... 239 9e-62 UniRef50_B8GFE2 Glycosyl transferase family 2 n=1 Tax=Methanosph... 238 1e-61 UniRef50_B7KAG8 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 238 1e-61 UniRef50_B4VMI0 Glycosyl transferase, group 2 family protein n=2... 238 2e-61 UniRef50_C2SSX0 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 238 2e-61 UniRef50_Q12TY4 Glycosyl transferase, family 2 n=1 Tax=Methanoco... 238 2e-61 UniRef50_C0CGY4 Putative uncharacterized protein n=1 Tax=Blautia... 238 2e-61 UniRef50_Q8DGK0 Tlr2317 protein n=2 Tax=Thermosynechococcus elon... 238 2e-61 UniRef50_B8I318 Glycosyl transferase family 2 n=2 Tax=Clostridiu... 238 2e-61 UniRef50_A6BHD3 Putative uncharacterized protein n=1 Tax=Dorea l... 238 2e-61 UniRef50_O58167 Putative uncharacterized protein PH0430 n=1 Tax=... 237 3e-61 UniRef50_D1ALB8 Glycosyl transferase family 2 n=1 Tax=Sebaldella... 237 3e-61 UniRef50_B8FY70 Glycosyl transferase family 2 n=2 Tax=Desulfitob... 237 3e-61 UniRef50_UPI0001744769 glycosyl transferase, group 1 n=2 Tax=Bac... 237 4e-61 UniRef50_A0K4U8 Glycosyl transferase, family 2 n=3 Tax=Burkholde... 237 4e-61 UniRef50_Q0BQ52 Glycosyltransferase n=1 Tax=Granulibacter bethes... 237 4e-61 UniRef50_C7HAW6 Glycosyl transferase, group 2 family n=2 Tax=Fae... 236 4e-61 UniRef50_A6H2F4 Glycosyl transferase, group 2 family protein n=2... 236 4e-61 UniRef50_B0CGB5 Glycosyl transferase, family 2, putative n=1 Tax... 236 5e-61 UniRef50_B5W037 Glycosyl transferase family 2 n=2 Tax=Arthrospir... 236 6e-61 UniRef50_Q2LWN5 Glycosyltransferase n=1 Tax=Syntrophus aciditrop... 236 7e-61 UniRef50_C7M1X8 Glycosyl transferase family 2 n=1 Tax=Acidimicro... 236 7e-61 UniRef50_B7R2A7 Glycosyltransferase n=1 Tax=Thermococcus sp. AM4... 236 7e-61 UniRef50_C8QGU2 Glycosyl transferase family 2 n=1 Tax=Pantoea sp... 236 9e-61 UniRef50_B8HLM9 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 236 9e-61 UniRef50_C6P727 Glycosyl transferase family 2 n=1 Tax=Sideroxyda... 235 1e-60 UniRef50_P55465 Uncharacterized protein y4gI n=1 Tax=Rhizobium s... 235 1e-60 UniRef50_B8HRB2 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 234 2e-60 UniRef50_A0YK73 Glycosyl transferase n=5 Tax=Cyanobacteria RepID... 234 2e-60 UniRef50_C6MPM7 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 234 2e-60 UniRef50_C0FV27 Putative uncharacterized protein n=1 Tax=Rosebur... 234 2e-60 UniRef50_B3E7E3 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 234 3e-60 UniRef50_B4ECE8 Glycosyltransferase n=3 Tax=Burkholderia cepacia... 234 3e-60 UniRef50_D0MF25 Glycosyl transferase family 2 n=1 Tax=Rhodotherm... 234 4e-60 UniRef50_A4SDY7 Glycosyl transferase, family 2 n=1 Tax=Chlorobiu... 233 5e-60 UniRef50_A1ZLY5 Glycosyl transferase domain protein n=1 Tax=Micr... 233 5e-60 UniRef50_B2TQH5 WsaE n=1 Tax=Clostridium botulinum B str. Eklund... 233 6e-60 UniRef50_Q72X17 Glycosyl transferase domain protein n=4 Tax=Baci... 233 7e-60 UniRef50_C0W8E0 2 family glycosyl transferase n=1 Tax=Actinomyce... 232 9e-60 UniRef50_A3W9Y7 Glucosyltransferase n=1 Tax=Erythrobacter sp. NA... 232 9e-60 UniRef50_Q21E22 B-glycosyltransferase-like protein n=2 Tax=Alter... 232 9e-60 UniRef50_Q13AK4 Glycosyl transferase, family 2 n=5 Tax=Rhodopseu... 232 1e-59 UniRef50_Q97H39 Glycosyltransferase n=2 Tax=Clostridium acetobut... 232 1e-59 UniRef50_A0KUZ3 Glycosyl transferase, family 2 n=7 Tax=Gammaprot... 232 1e-59 UniRef50_B9JW01 Glycosyl transferase n=1 Tax=Agrobacterium vitis... 231 1e-59 UniRef50_C4Z1F8 Putative uncharacterized protein n=1 Tax=Eubacte... 231 2e-59 UniRef50_A2TW95 Glycosyltransferase n=1 Tax=Dokdonia donghaensis... 231 2e-59 UniRef50_A0YLN1 Glycosyl transferase, family 2 n=2 Tax=Bacteria ... 231 2e-59 UniRef50_C4KBW6 Glycosyl transferase family 2 n=1 Tax=Thauera sp... 231 2e-59 UniRef50_C4V652 Family 2 glycosyltransferase n=1 Tax=Selenomonas... 231 2e-59 UniRef50_D2Q6R3 Glycosyl transferase family 2 n=2 Tax=Bifidobact... 231 2e-59 UniRef50_Q6MBF6 Putative glycosyltransferase n=1 Tax=Candidatus ... 231 2e-59 UniRef50_C6M2Q8 Glycosyltransferase n=2 Tax=Neisseria RepID=C6M2... 231 2e-59 UniRef50_A6Q2S9 Glycosyl transferase n=1 Tax=Nitratiruptor sp. S... 231 3e-59 UniRef50_C6ARE2 Glycosyl transferase family 2 n=4 Tax=Rhizobium ... 231 3e-59 UniRef50_A8ZWI7 Glycosyl transferase family 2 n=2 Tax=Desulfobac... 230 3e-59 UniRef50_D0D9G6 Glycosyl transferase family 2 n=1 Tax=Citreicell... 230 3e-59 UniRef50_B4WF86 Glycosyl transferase, group 2 family protein n=1... 230 4e-59 UniRef50_Q97H38 Glycosyltransferase n=1 Tax=Clostridium acetobut... 230 5e-59 UniRef50_B1QL95 Glycosyltransferase, group 2 family n=2 Tax=Clos... 230 5e-59 UniRef50_C6MPB4 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 229 5e-59 UniRef50_B4CZV6 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 229 6e-59 UniRef50_B4D2E1 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 229 6e-59 UniRef50_C7NBG2 Glycosyl transferase family 2 n=4 Tax=Fusobacter... 229 6e-59 UniRef50_B4AW59 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 229 6e-59 UniRef50_B4WMJ9 Glycosyl transferase, group 2 family protein n=1... 229 8e-59 UniRef50_C6BYR5 Glycosyl transferase family 2 n=5 Tax=Desulfovib... 228 1e-58 UniRef50_A8FXL4 Glycosyltransferase involved in cell wall biogen... 228 1e-58 UniRef50_B8J3S6 Glycosyl transferase family 2 n=1 Tax=Desulfovib... 228 1e-58 UniRef50_D1YZY1 Putative glycosyltransferase n=1 Tax=Methanocell... 228 1e-58 UniRef50_B7JHV6 Glycosyl transferase, group 2 family protein n=5... 228 2e-58 UniRef50_C2Y456 Teichuronic acid biosynthesis glycosyl transfera... 228 3e-58 UniRef50_D2YGE4 Putative uncharacterized protein n=1 Tax=Vibrio ... 227 3e-58 UniRef50_B0G3Y0 Putative uncharacterized protein n=1 Tax=Dorea f... 227 4e-58 UniRef50_C7RJP4 Glycosyl transferase family 2 n=1 Tax=Candidatus... 227 4e-58 UniRef50_D1YZY2 Putative glycosyltransferase n=1 Tax=Methanocell... 227 4e-58 UniRef50_O34234 Sugar transferase n=4 Tax=Vibrio cholerae RepID=... 227 4e-58 UniRef50_A4BPR7 Glycosyl transferase, group 2 family protein n=1... 226 4e-58 UniRef50_Q2SD88 Glycosyltransferase involved in cell wall biogen... 226 5e-58 UniRef50_Q6QW83 Putative glycosyl transferase n=1 Tax=Azospirill... 226 5e-58 UniRef50_Q8YSL1 Alr3073 protein n=2 Tax=Nostocaceae RepID=Q8YSL1... 226 5e-58 UniRef50_B6WY68 Putative uncharacterized protein n=1 Tax=Desulfo... 226 5e-58 UniRef50_B1ZSZ7 Glycosyl transferase family 2 n=1 Tax=Opitutus t... 226 5e-58 UniRef50_C6E5U5 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 226 6e-58 UniRef50_B3PFA2 Glycosyl transferase, putative, gt2L n=1 Tax=Cel... 226 7e-58 UniRef50_Q26CE6 Glycosyl transferase n=1 Tax=Flavobacteria bacte... 226 7e-58 UniRef50_A3W1Y5 Glycosyltransferase n=1 Tax=Roseovarius sp. 217 ... 226 7e-58 UniRef50_B7K4C4 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 226 8e-58 UniRef50_D1W2F4 Glycosyltransferase, group 2 family protein n=2 ... 226 8e-58 UniRef50_A6TBF1 Putative glucuronosyltransferase n=1 Tax=Klebsie... 226 8e-58 UniRef50_A8FHL6 Glycosyltransferase n=5 Tax=Firmicutes RepID=A8F... 226 8e-58 UniRef50_Q10VK0 Glycosyl transferase, family 2 n=3 Tax=Oscillato... 226 1e-57 UniRef50_A6LGW8 Glycosyltransferase family 2 n=3 Tax=Bacteroidal... 226 1e-57 UniRef50_B8IAC9 Glycosyl transferase family 2 n=1 Tax=Methylobac... 225 1e-57 UniRef50_A1ANW6 Glycosyl transferase, family 2 n=2 Tax=Desulfuro... 225 1e-57 UniRef50_C6EQF5 Putative uncharacterized protein n=2 Tax=Campylo... 225 1e-57 UniRef50_Q8KI14 Similar to Glycosyl transferase n=1 Tax=Pseudomo... 225 1e-57 UniRef50_Q60B59 Glycosyl transferase, group 2 family protein n=1... 225 1e-57 UniRef50_Q052L4 Glycosyltransferase n=10 Tax=Leptospira RepID=Q0... 225 1e-57 UniRef50_B9M558 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 225 1e-57 UniRef50_B0JGT1 Glycosyl transferase n=2 Tax=Microcystis aerugin... 225 1e-57 UniRef50_Q39U31 Glycosyl transferase, family 2 n=1 Tax=Geobacter... 225 1e-57 UniRef50_B7K6R4 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 225 2e-57 UniRef50_A3XRJ7 TuaG n=1 Tax=Leeuwenhoekiella blandensis MED217 ... 225 2e-57 UniRef50_Q47DD2 Glycosyl transferase, family 2:Polysaccharide de... 224 2e-57 UniRef50_C6XYA6 Glycosyl transferase family 2 n=1 Tax=Pedobacter... 224 2e-57 UniRef50_A4BIW6 Glycosyltransferase n=1 Tax=Reinekea blandensis ... 224 2e-57 UniRef50_A4J8Z3 Glycosyl transferase, family 2 n=1 Tax=Desulfoto... 224 2e-57 UniRef50_B0NG34 Putative uncharacterized protein n=1 Tax=Clostri... 224 2e-57 UniRef50_C1DM76 Glycosyl transferase, family 2 protein n=1 Tax=A... 224 3e-57 UniRef50_Q74BU3 Glycosyl transferase, group 2 family protein n=1... 223 4e-57 UniRef50_D1BX31 Glycosyl transferase family 2 n=1 Tax=Xylanimona... 223 5e-57 UniRef50_Q74D09 Glycosyl transferase, group 2 family protein n=1... 223 5e-57 UniRef50_UPI0001788F1F glycosyl transferase family 2 n=1 Tax=Geo... 223 5e-57 UniRef50_Q8PXS5 Glycosyltransferase n=1 Tax=Methanosarcina mazei... 223 5e-57 UniRef50_B9M560 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 223 5e-57 UniRef50_C0ECJ2 Putative uncharacterized protein n=1 Tax=Clostri... 223 5e-57 UniRef50_A5GEM2 Glycosyl transferase, family 2 n=1 Tax=Geobacter... 223 5e-57 UniRef50_C6MKP2 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 223 6e-57 UniRef50_Q6XQ52 Putative glycosyltransferase n=1 Tax=Escherichia... 223 6e-57 UniRef50_Q7NIJ4 Gll2189 protein n=1 Tax=Gloeobacter violaceus Re... 223 6e-57 UniRef50_B8DIX4 Glycosyl transferase family 2 n=1 Tax=Desulfovib... 223 6e-57 UniRef50_A1AUG5 Glycosyl transferase, family 2 n=1 Tax=Pelobacte... 223 6e-57 UniRef50_Q9LA88 Putative glycosly transferase (Fragment) n=2 Tax... 223 7e-57 UniRef50_C1VEY2 Glycosyl transferase n=1 Tax=Halogeometricum bor... 223 7e-57 UniRef50_B8HRA8 Glycosyl transferase family 2 n=1 Tax=Cyanothece... 223 8e-57 UniRef50_B7RVP1 Glycosyl transferase, group 2 family n=1 Tax=mar... 223 8e-57 UniRef50_A3W9Y6 Glucosyltransferase n=1 Tax=Erythrobacter sp. NA... 223 8e-57 UniRef50_A0LJ94 Glycosyl transferase, family 2 n=2 Tax=Deltaprot... 223 8e-57 UniRef50_C6E5K3 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 222 9e-57 Sequences not found previously or not previously below threshold: UniRef50_B0PBK3 Putative uncharacterized protein n=1 Tax=Anaerot... 237 4e-61 UniRef50_Q0BPG4 Glycosyltransferase n=3 Tax=Acetobacteraceae Rep... 232 1e-59 UniRef50_B0G3X3 Putative uncharacterized protein n=1 Tax=Dorea f... 223 5e-57 >UniRef50_P77414 Putative colanic acid biosynthesis glycosyl transferase wcaA n=98 Tax=Enterobacteriaceae RepID=WCAA_ECOLI Length = 279 Score = 318 bits (816), Expect = 1e-85, Method: Composition-based stats. Identities = 279/279 (100%), Positives = 279/279 (100%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP Sbjct: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY Sbjct: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ Sbjct: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK Sbjct: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR 279 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR Sbjct: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR 279 >UniRef50_B7K7D2 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7K7D2_CYAP7 Length = 1067 Score = 290 bits (744), Expect = 3e-77, Method: Composition-based stats. Identities = 65/286 (22%), Positives = 110/286 (38%), Gaps = 15/286 (5%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALN 58 +K PLIS+ MP +N + AI+SVL Q YSNWE+ I DD ST Sbjct: 531 LKYKPLISVIMPVYNPPENYLREAIESVLNQVYSNWELCIADDASTEPHVKLVLNEYLKQ 590 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI I + N N A+ LA GE+I +D DD TP+ Sbjct: 591 DSRIKVIFRENNGHISKASNSALELATGEFIALLDHDDVLTPHAFYELALLLNSHPEADM 650 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELK 177 +Y++ Q + Y K +S F + ++ + + F + Sbjct: 651 IYSDQDYMDD----QGQLINPYFKPDWSPDAFLANMYTCHLGLYRHSIVKEIGGFRVGFE 706 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 +QDYD+ LR + + + + + I K + + + + R Sbjct: 707 GSQDYDLVLRFTEKTERIFHLPDILYHWRTHAASTNINPQAKSY-AFIAARKALAEALTR 765 Query: 238 ASKKYQLFTL------YQIRNKRMTWRTLLTLLSVRN-GKRLADGI 276 + +F + Y IR + + +++ +N G+RL + + Sbjct: 766 RKEPGAVFDIPNYPGCYTIRYQIKDPELVSIIITTKNLGERLDNCL 811 Score = 100 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 42/271 (15%), Positives = 93/271 (34%), Gaps = 30/271 (11%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYV--TA 56 +K+ L+SI + T N + + S+ Q Y N+E+I++D+ ST + L+ Sbjct: 788 IKDPELVSIIITTKNLGERLDNCLTSIF-QKTTYQNYEVILIDNGSTEPQALETIEHWKN 846 Query: 57 LNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDD-EWTPNRLSVFLAHKQQLVT 115 + R+ Y ++ + + N A+ +QG+Y+ + ++ T + L + Q+ + Sbjct: 847 RENFRLKYYGLELPFNSSKINNFAVDKSQGKYLLFLKNETKILTSDWLEKMIEQAQR-PS 905 Query: 116 HAFLYANDYVCQG------------EVYSQPASLPLYPKSPYSRRLFYKRNII----GNQ 159 + A Y S Y RL Y N + Sbjct: 906 IGAVGALLLYPDNTVQHAGVICGLFGRVGHSHKHYQYGASGYFGRLVYVHNYMAVTGACL 965 Query: 160 VFTWAWRFKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEE-ATQILHINHGEMQITSS 217 + + F+ +L + D D+ L+++ + + Sbjct: 966 MCRREVFEEVGGFEEKLAVSYNDIDLCLKLIDKGYRNLCLPHVVLYHY----ESKTRGYD 1021 Query: 218 PKKFSGYFHFYRKHKDKFDRASKKYQLFTLY 248 + + ++ + +KY + Y Sbjct: 1022 SRDTEKFARLMCENM-YISQKWQKYINYDPY 1051 >UniRef50_A5W7E0 Glycosyl transferase, family 2 n=1 Tax=Pseudomonas putida F1 RepID=A5W7E0_PSEP1 Length = 1509 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 111/282 (39%), Gaps = 15/282 (5%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALND 59 K+ PLISI MP +N L AI SV Q Y NWE+ I DD ST E + D Sbjct: 968 KDQPLISIIMPVYNPPMDLLREAIDSVTAQLYKNWELCIADDASTDPEVIAYLKGLTKQD 1027 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R + + N N A+ LA G Y+ +D+DD + L + A + Sbjct: 1028 QRFKVVLREKNGHISKASNSALELASGRYVALMDNDDLLPAHALYWIARAVHENPDAAVI 1087 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKA 178 Y++ + ++ + Y K+ ++ LF N+I + V+ + F ++ Sbjct: 1088 YSD----EDKIDVEGNRSAPYFKTDWNLYLFRSHNMISHLGVYRRDLVQRVGGFREGMEG 1143 Query: 179 AQDYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QDYD+ LR V + + + ++ G ++ K + Sbjct: 1144 SQDYDLALRCVEQVRPDQIVHIPHVLYHWRVHPGSTAMSLGEKPY-AQIAGQNALDQHLQ 1202 Query: 237 RASKKYQL----FTLYQIRNKRMTWRTLLT-LLSVRNGKRLA 273 R K F +Y++ + L++ ++ RN L Sbjct: 1203 RTGIKGHAELLDFGMYRVHYDLPADQPLVSLIIPTRNAVGLV 1244 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 48/283 (16%), Positives = 110/283 (38%), Gaps = 25/283 (8%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+S+ +PT N L + I+S+ + Y ++E+I+VD+ S E LQ + + Sbjct: 1227 DQPLVSLIIPTRNAVGLVRQCIESITSKTSYQHYEIILVDNGSDDPESLQYFESLKLQQN 1286 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLY 120 I + +D A+ N+A+ A+GE + I++D + +P+ LS ++ Q + Sbjct: 1287 IRVLRDDGPFNYSALNNRAVREARGELVGLINNDIEVISPDWLSEMVSLALQ-PGAGAVG 1345 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLF--YKRNIIG--------------NQVFTWA 164 A + + + + ++ ++ G + + Sbjct: 1346 ARLWYPDNRLQHGGVIMGPLTLAGHAHKMLPRGHHGYFGRASLIQGMSAVTAACLIVRKS 1405 Query: 165 WRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 + + ELK A D D L+++ + A + H + +P+K Sbjct: 1406 IFEEVGGLNEVELKIAFNDVDFCLKVMAAGYQNIWTPNAD-LYHHESATRGLEDTPEKIE 1464 Query: 223 GYFHFYRKHKDKFDR---ASKKYQLFTLYQIRNKRMTWRTLLT 262 + R ++K+ + Y +++ + + + + Sbjct: 1465 RFKSEVRYVQEKWAKIIANDPAYNPNLSFELEDFSLAFPPRIA 1507 >UniRef50_B4VZW9 Methyltransferase domain family n=2 Tax=Cyanobacteria RepID=B4VZW9_9CYAN Length = 1265 Score = 286 bits (734), Expect = 4e-76, Method: Composition-based stats. Identities = 61/284 (21%), Positives = 108/284 (38%), Gaps = 15/284 (5%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-QQYVTALNDPR 61 P+IS+ MPT+N + AI+SV+ Q Y WE+ I DD ST A + R Sbjct: 732 QPVISVIMPTFNTSEHFLREAIESVINQVYPYWELCIADDASTQPHVKNIVEEYAAKETR 791 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N N A+ +A GE+I +D DD TP+ L + +Y+ Sbjct: 792 IKVVIRTTNGHISNASNSALEMATGEFIALLDHDDLLTPDALYEVALLLNRHPEADMIYS 851 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-FKECLFDTELKAAQ 180 + + ++ Y K + F R + K F + +Q Sbjct: 852 D----EDKIDETNKVSSPYFKPDWCPDSFLSRMYTCHLGIYRRELVEKIGGFRVGYEGSQ 907 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+ LR+ + + + + I+ K + Y + D R + Sbjct: 908 DYDLVLRITEKTKNIFHIPKILYHWRIHPQSAASGVEAKPY-AYKAGEKALIDALHRRGE 966 Query: 241 KYQLF------TLYQIRNKRMTWRTLLTLLSVRN-GKRLADGIR 277 + LY++R K ++ + ++ R+ G L + ++ Sbjct: 967 NGIISGLPGFPGLYRVRYKIEDYKRVSIIIPTRDLGNVLNNCLK 1010 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 85/256 (33%), Gaps = 27/256 (10%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVT--ALNDPRIT 63 +SI +PT + + +KS+ + Y N+E+I++D+ ST + +Q + R + Sbjct: 992 VSIIIPTRDLGNVLNNCLKSIFEKTAYPNYEVIVIDNGSTENDTVQIIDYWKSKEHERFS 1051 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAH-------- 109 +I + N A+ A+G+Y+ +++D D + L Sbjct: 1052 CYPLNIEFNFSKINNYAVAKAKGDYLLFLNNDIEVITQDWI--DALVEQAQRPSIGAVGG 1109 Query: 110 KQQLVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI---IGNQVFTWAWR 166 + +A + G V S Y+ +L N G + Sbjct: 1110 LLLYPDKSIQHAGVVLGIGGVASHGHKKYPSTSPGYAGQLITINNYSAITGACLMCRREV 1169 Query: 167 FK-ECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 ++ F EL A D D L+++ + + +H Sbjct: 1170 YEAVGGFAEELAIAYNDVDFCLKLLRQGYRNIYLPHVVLY---HHESKSRGRDDDSSEKL 1226 Query: 225 FHFYRKHKDKFDRASK 240 R+ + + K Sbjct: 1227 VRLGREAEYMKKKWGK 1242 >UniRef50_B9BSY2 Glycosyl transferase, family 2 n=4 Tax=Proteobacteria RepID=B9BSY2_9BURK Length = 632 Score = 284 bits (728), Expect = 2e-75, Method: Composition-based stats. Identities = 64/283 (22%), Positives = 113/283 (39%), Gaps = 17/283 (6%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTA-LNDPR 61 PLISI +PT+N ++ I+SVL Q Y +WE+ I DD S+S A D R Sbjct: 89 RPLISIIVPTYNSDERFLREMIESVLAQVYPHWELCIADDASSSESVRAVLEAARARDDR 148 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I ++ +N N A+ +A GEYI +D DD P+ L + + + + LY+ Sbjct: 149 IKVVYRPVNGHISEASNSALEIATGEYIALLDHDDVIPPHALFMVVKYLNKHPEARMLYS 208 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQ 180 + + ++ + Y KS ++ +LF +N+ + + F + +Q Sbjct: 209 D----EDKLSADGERTSPYFKSDWNPQLFLAQNMFSHLGVYETALVRAAGGFRKGFEGSQ 264 Query: 181 DYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 DYD+ LR V G + I G + S K + R ++ DR Sbjct: 265 DYDLALRCVEIAGHNAVVHIPHVLYHWRILPGSTASSGSEKPY-ALLAAIRALEEHLDRT 323 Query: 239 SKKYQLFT------LYQIRNKRMTWRTL-LTLLSVRNGKRLAD 274 + + +++ ++ R+G + Sbjct: 324 HTRAIVEHPCDQHSTLRVKYALPRPAPKTSIIIPTRDGLSILK 366 Score = 107 bits (269), Expect = 4e-22, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 85/234 (36%), Gaps = 24/234 (10%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P SI +PT + + + I S+ + Y N+E+I+VD+ S + E L + + D R+ Sbjct: 350 PKTSIIIPTRDGLSILKQCIDSIFAKTIYPNYEIIVVDNGSVNPETLAYFDSLRTDSRVR 409 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + +D A+ N+A+ +A G+Y+ +++D + +P+ L + + + A Sbjct: 410 ILRDDSPFNFSALNNRAVEIATGDYVCLLNNDIEIISPDWLDELVG-IASQPGNGAVGAA 468 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI----------------IGNQVFTWAWR 166 + + + L + + L + V + A Sbjct: 469 LWYPNNTLQHGGVVIGLGGVAGHMHTLLPRGQYGYFCRAVAMQNLSAVTAACLVVSKAIY 528 Query: 167 FKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQITSSP 218 + + EL A D D LR+ A + + + Sbjct: 529 REVGGLNEELAVAFNDVDFCLRVREAGYRNVWTPYAELYH---HESATRGSDMD 579 >UniRef50_UPI0001C3906A glycosyl transferase family protein n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C3906A Length = 847 Score = 282 bits (723), Expect = 8e-75, Method: Composition-based stats. Identities = 64/285 (22%), Positives = 110/285 (38%), Gaps = 16/285 (5%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDPR 61 PLIS+ MP +N ++ +AI+SV Q Y WE+ I DD S+ A D R Sbjct: 181 KPLISVIMPVYNTPEKFLKKAIESVTNQVYPYWELCIADDNSSEPWIKPLLKEYARKDNR 240 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I N CA N A+ LA GEYI +D DD TP L + +Y+ Sbjct: 241 IKVAFRTENGHICAASNSALELATGEYIALLDHDDVITPEALYEVALVLNENPDADMIYS 300 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 + + ++ + Y K + F R + + F + +Q Sbjct: 301 D----EDKLDQYGQRVFPYFKPDWCPDSFLARMYTCHLGVYRRELVNQVGNFRIGYEGSQ 356 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+ LR+ + + + + + + G + K + Y + + F R + Sbjct: 357 DYDLVLRVTEKTNKIFHIPKILYSWRQHSGSTAYDPNSKLY-AYIAAEKALNESFARCQE 415 Query: 241 KYQLFTL-------YQIRNKRMTWRTLLTLLSVRN-GKRLADGIR 277 K ++ + Y +R + + ++ RN G L ++ Sbjct: 416 KARIVSDHENFPGQYTVRYHIEEHKLVSIIIPTRNLGTTLNHCLQ 460 Score = 103 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 47/300 (15%), Positives = 96/300 (32%), Gaps = 31/300 (10%) Query: 6 LISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSW-EQLQQYVTALNDP-RI 62 L+SI +PT N ++S+ + Y N+E++++D+ S +P R Sbjct: 441 LVSIIIPTRNLGTTLNHCLQSIFEKSTYPNYEVVVIDNGSDEQETFDIINAWKQKEPQRF 500 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLV----THA 117 + D+ + N + A+G+Y+ +++D + TP+ + + Q+ Sbjct: 501 SCYEFDVPFNYPQINNYGVTKARGDYLLFLNNDTEVLTPDWIEAMVEQAQRSSIGAVGAL 560 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKS-------PYSRRLFYKRNII----GNQVFTWAWR 166 LY +D + V + + Y +L N + Sbjct: 561 LLYPDDTIQHAGVVLGIGGVAGHSHKGYQAHHLGYYNQLVMVNNYSAVTGACLMCRREVF 620 Query: 167 FKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEE-ATQIL----HINHG----EMQITS 216 + D EL+ A D D L++V + E + Sbjct: 621 DQVGGLDEELEVAFNDVDFCLKLVSRGYRNIYLPHVVLYHYESKSRGYEDTPEKEARFKR 680 Query: 217 SPKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 K + K+ ++ + + Y IR + + +S L I Sbjct: 681 ESKIMKQRWRDLLKNDPCYNPNLSR--IRQDYGIRLSNDENQVVAISMSEDREAFLGCAI 738 >UniRef50_A0LST8 Glycosyl transferase, family 2 n=1 Tax=Acidothermus cellulolyticus 11B RepID=A0LST8_ACIC1 Length = 734 Score = 282 bits (722), Expect = 1e-74, Method: Composition-based stats. Identities = 57/285 (20%), Positives = 107/285 (37%), Gaps = 15/285 (5%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALNDPR 61 P IS+ +P N ++ R I SVL Q Y W++ + DD S++ A D R Sbjct: 209 TPTISVLVPVHNAAERWLRRCIDSVLAQSYPYWQLCLADDASSAPHVRQVLEEYAARDAR 268 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 IT I +N A N A+ AQG++I +D DDE P+ L A + +Y+ Sbjct: 269 ITAIFRPVNGHISAASNSALDAAQGDFIALLDHDDELAPDALLHVAALLNEHPDADMIYS 328 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 + + ++ + K +S L + + F + +Q Sbjct: 329 D----EDKINEDGVRQEPFFKPDWSPELLLSHMYTCHLGVYRTELVRRIGGFRLGFEGSQ 384 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+ LR+ + + + + G ++ K + Y +R ++ R Sbjct: 385 DYDLVLRLTEQTDRIYHIPRVLYHWRRHPGSAAASADAKPY-AYAAAHRAIREALHRRGA 443 Query: 241 KYQLFTL------YQIRNKRMTWRTLLTLLSVRNGKRL-ADGIRG 278 + + +++ + +++ N + D IR Sbjct: 444 VATVEEITEARGNFRVHYAVSNQPLISIVVAAPNTGVIDPDCIRA 488 Score = 82.8 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 58/225 (25%), Gaps = 21/225 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVT--ALN 58 N PLISI + N + I+++ + Y N E+I+V D + Sbjct: 464 SNQPLISIVVAAPNTGVIDPDCIRALFDRTAYPNIEVIVVHDGRNDEQLRDLLTFWHTQE 523 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD---------DEWTPNRLSVFLAH 109 R + + N A G+ + + D DE L + Sbjct: 524 PKRFRVESLRTPARRSRLMNHGADCASGDLLVFLHGDVAVGTADWLDEMAGYALREEIGA 583 Query: 110 ---KQQLVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII----GNQVFT 162 +A + P Y RL N+ + Sbjct: 584 VGPVLLRPDGTIDHAGFVLGIAGGIGHPFRGFPAHHPGYFNRLLIPSNVAAVSSTCLMVR 643 Query: 163 WAWRFKECLFDTELKAAQ-DYDIFLRMVVEYGEPWKVEEA-TQIL 205 FD EL A D D LR+ + Sbjct: 644 CQAFHAAGGFDDELSGAIGDADFCLRLRQRGHRTVLLPYVRLCHY 688 >UniRef50_C2DBA9 Family 2 glycosyl transferase n=22 Tax=Enterococcus faecalis RepID=C2DBA9_ENTFA Length = 715 Score = 280 bits (718), Expect = 3e-74, Method: Composition-based stats. Identities = 55/278 (19%), Positives = 104/278 (37%), Gaps = 15/278 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDPR 61 P ISI MP +N + + I S+L Q Y+NWE+ + DD ST + D R Sbjct: 181 QPKISIAMPVYNVEEKWLRLCIDSILNQVYTNWELCMADDASTDPNVKKILTEYQQLDER 240 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + + N N A+ +A GE++ +D+DDE N + + +Y+ Sbjct: 241 IRVVFREQNGHISEATNSALAIATGEFVALLDNDDELAINAFYEVVKVLNENPELDLIYS 300 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKAAQ 180 + + ++ K +S L N I + V+ + + F + +Q Sbjct: 301 D----EDKIDMDGNRSDPAFKPDWSPDLLLGTNYISHLGVYRRSILEEIGGFRKGYEGSQ 356 Query: 181 DYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 DYD+ LR + + + + + + K + + R +D R Sbjct: 357 DYDLVLRFTEKTTKERIKHIPKVLYYWRMLPTSTAVDQGSKGY-AFEAGLRAVQDALVRR 415 Query: 239 SKKYQLFT-----LYQIRNKRMTWRTLLTLLSVRNGKR 271 LY + + + + ++ +NG + Sbjct: 416 GINGHATHGAANGLYDVYYDIKSDKLVSIIIPTKNGYK 453 Score = 88.6 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 46/220 (20%), Positives = 82/220 (37%), Gaps = 21/220 (9%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALND--PRI 62 L+SI +PT N + R + S++ + Y N+E+I+ D+ ST + + Y R Sbjct: 441 LVSIIIPTKNGYKDVQRCVSSIIEKTTYQNYEIIMADNGSTDPKMHELYAEFEQQLPGRF 500 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQ--------- 112 DI + N+A+ A GEY+ +++D + T N L++ ++ QQ Sbjct: 501 FVESIDIPFNFSTINNRAVKKAHGEYLLFLNNDTEVITENWLTLMVSFAQQERIGCVGAK 560 Query: 113 --LVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI----IGNQVFTWAWR 166 + +A + G V + Y RL N + A Sbjct: 561 LLYPNNTVQHAGVILGLGGVAGHGHYGYPHGDLGYFGRLAINVNYSAVTAACLLMKKADF 620 Query: 167 FKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQI 204 F+ A D D+ L++ + + EA Sbjct: 621 DAVGGFEEAFTVAFNDVDLCLKVQALGRDNVWLHEAELYH 660 >UniRef50_C7QLJ5 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=C7QLJ5_CYAP0 Length = 1152 Score = 280 bits (716), Expect = 5e-74, Method: Composition-based stats. Identities = 59/287 (20%), Positives = 110/287 (38%), Gaps = 15/287 (5%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALN 58 ++ PLISI MP +N + AI SVL Q Y NW++ I DD ST+ + A Sbjct: 616 LEYQPLISIIMPVFNPKIAYLKTAINSVLNQVYQNWQLCIADDASTNPQVYEILADYAAE 675 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + N N A +A GE+I +D DD TP+ L ++ Sbjct: 676 DTRIKVVFRQENGHIAEASNSAFEIAMGEFIALLDHDDVLTPHALYHVVSMLNDHADADM 735 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL-FDTELK 177 +Y++ + ++ Q + K + F + + ++ F + Sbjct: 736 IYSD----EDKIDEQGYLSDPFFKPDWCPDSFLSKMYTCHLGVYRRSLVEQIGAFRVGYE 791 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 +QDYD+ LR+ + + + + I+ K + + + +R Sbjct: 792 GSQDYDLVLRLTEKTDKIFHIPNVLYHWRIHAQSTSTNIDSKNY-AVITAKKALSEAIER 850 Query: 238 ASKKYQLFTL------YQIRNKRMTWRTLLTLLSVRN-GKRLADGIR 277 + + + Y IR + T + ++ ++ G L ++ Sbjct: 851 RGEPGTVTDVPYCLGNYHIRYELKTDDLVSIIIPTKDLGDTLNQCLK 897 Score = 113 bits (283), Expect = 8e-24, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 80/238 (33%), Gaps = 24/238 (10%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSW-EQLQQYVTALNDP-RI 62 L+SI +PT + + +KS+ Q Y N+E+I++D+ ST +P ++ Sbjct: 878 LVSIIIPTKDLGDTLNQCLKSIFEQSTYPNFEIILIDNGSTEERSLEVMKQWQEKEPEKL 937 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N A+ +QG+Y+ +++D + TP+ + + Q+ + + A Sbjct: 938 KVFPLKIPFNYSQINNFAVQHSQGKYLLFLNNDIEVITPDWIEALVEQAQR-PSIGAVGA 996 Query: 122 NDYVCQGEVYSQPA------------SLPLYPKSPYSRRLFYKRNII----GNQVFTWAW 165 + + Y +L N + Sbjct: 997 LLLFPDDTIQHAGVIGGIFYSCGHSHKRFPFRSPGYFNQLNTITNYSAVTGACLMCRRDV 1056 Query: 166 RFKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEE-ATQIL-HINHGEMQITSSPKK 220 + FD L D D+ +M+ + + + G + + K Sbjct: 1057 FEEIGGFDETLAVNYNDIDLCFKMINKGYRNIYLPHVVLYHYESKSRGYDSLNNFKKA 1114 >UniRef50_B2T0J8 Glycosyl transferase family 2 n=2 Tax=Burkholderia RepID=B2T0J8_BURPP Length = 723 Score = 279 bits (714), Expect = 8e-74, Method: Composition-based stats. Identities = 64/283 (22%), Positives = 113/283 (39%), Gaps = 17/283 (6%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALNDPR 61 PLIS+ +PT+N ++L + SV Q Y +WE+ I DD ST A D R Sbjct: 180 KPLISVVVPTFNSDERLLREMVASVQAQIYPHWELCIADDASTQPHVKAVLEEVAAADSR 239 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + + N N A+ LA GEY+ +D DD P+ L + + Y+ Sbjct: 240 IKVVFRETNGHISEASNSALALATGEYVALLDHDDLLPPHALYMVARYINLHPHGRMFYS 299 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKAAQ 180 + + ++ ++ Y K ++ ++F +N+ + VF F + +Q Sbjct: 300 D----EDKLTTEGKRTTPYFKCDWNPQMFLTQNMFSHLGVFETKLVRDAGGFRKGFEGSQ 355 Query: 181 DYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 DYD+ LR V G + I G + S K + R +D +RA Sbjct: 356 DYDLALRCVELAGDDSVIHIPHVLYHWRIVPGSTAGSGSEKPY-ALVAAIRALEDHLERA 414 Query: 239 SKKYQLFT------LYQIRNKRMTWRT-LLTLLSVRNGKRLAD 274 + + + ++R + + + ++ R+G L Sbjct: 415 NISATVEHPVESLGVLRVRYTLPSPQPKVSIIIPTRDGLALLK 457 Score = 112 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 96/253 (37%), Gaps = 23/253 (9%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P +SI +PT + L + + SV Y N+E+IIVD+ S E ++ + P + Sbjct: 440 QPKVSIIIPTRDGLALLKQCVDSVFAYTLYQNFEIIIVDNGSVKPETMKYFDEVSQRPNV 499 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYA 121 + ++ A+ N A +A GE++ +++D + +P+ L+ ++ L + A Sbjct: 500 RVLRDESPFNFSALNNHAARVATGEFLCLLNNDIEVISPDWLNEMVS-LANLPRAGAVGA 558 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI----------------IGNQVFTWAW 165 + + L L + + + + + V + Sbjct: 559 CLWYPTDALQHGGVVLGLGGIAGHMHHMMKRGHFGYFGRAVATQNLSAVTAACLVVKKSV 618 Query: 166 RFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITS-SPKKFSG 223 + + +L A D D ++++ + ++ + + + P K+ Sbjct: 619 YDEVGGLEEDLAVAFNDVDFCMKLLKAGYR--NIWTPYAEMYHHESATRGSDLDPDKYER 676 Query: 224 YFHFYRKHKDKFD 236 + R+ + +++ Sbjct: 677 FVSEIRRMEARWE 689 >UniRef50_B1ZZR1 Glycosyl transferase family 2 n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZZR1_OPITP Length = 1275 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 62/284 (21%), Positives = 110/284 (38%), Gaps = 15/284 (5%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALN 58 + PLIS+ MP +N ++ ++AI+SV Q Y NWE+ I DD S++ A Sbjct: 737 LAQRPLISVIMPVYNTPEKWLVKAIESVRAQIYPNWELCIADDASSASHVRPLLEDFARR 796 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + D N A N A+ LA+GE+ +D DDE + L + Sbjct: 797 DERIKLVFRDQNGHISAASNSALDLARGEFAALLDHDDELARDALYEVVQCLAAHPDADL 856 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 +Y++ + ++ Y K + LF +N + A ++ F + Sbjct: 857 IYSD----EDKIDEAGRRFDPYFKPDFLPDLFTAQNFTSHLSVYRASLIRQAGGFRIGYE 912 Query: 178 AAQDYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QD+D+ LR + V + G + K + + D F Sbjct: 913 GSQDWDLALRAIDLTERSRIQHVPKVLYHWRAIPGSTALALGEKNYP-MLAARKALADHF 971 Query: 236 DRASKKYQLFT----LYQIRNKRMTWRTLLT-LLSVRNGKRLAD 274 R + +L ++++ L++ ++ RN L Sbjct: 972 ARRGETVELLRQVGDYWRVKRPVPANPPLVSLIIPTRNRADLLS 1015 Score = 94.0 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 44/206 (21%), Positives = 75/206 (36%), Gaps = 22/206 (10%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N PL+S+ +PT NR L R + S+L + Y +E+I+VD+ S Y+ L Sbjct: 997 NPPLVSLIIPTRNRADLLSRCVGSILAKTSYPRFEIIVVDNGS-DEPGTLAYLDQLRSGG 1055 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLY 120 T + D A+ N A A+GE + +++D + P+ L ++H + + Sbjct: 1056 TTVLRYDEPFNFSAINNFAAAQAEGEILGLLNNDLEVINPDWLDEMVSHAVR-PEIGCVG 1114 Query: 121 ANDYVCQGEVYSQP------------ASLPLYPKSPYSRRLFYKRNI----IGNQVFTWA 164 A Y + Y R +N + Sbjct: 1115 AMLYYPDDTIQHAGVLLGLGGVANHAYYHAPRGTCHYFNRAHLLQNYSAVTAACLLIRRK 1174 Query: 165 WRFKECLFD-TELKAA-QDYDIFLRM 188 + + T+L A D D LR+ Sbjct: 1175 VFAQVGGLNATDLAIAFNDVDFCLRV 1200 >UniRef50_B2JFA3 Glycosyl transferase family 2 n=10 Tax=Proteobacteria RepID=B2JFA3_BURP8 Length = 632 Score = 276 bits (707), Expect = 5e-73, Method: Composition-based stats. Identities = 67/283 (23%), Positives = 109/283 (38%), Gaps = 14/283 (4%) Query: 1 MKNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALN 58 M P IS+ MPT+N AI SV Q Y +WE+ I DD STS E Sbjct: 78 MAEPPRISVVMPTYNADPVWLAEAIDSVRNQIYPHWELCIADDASTSPEVRPLLEQYIRE 137 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + N A N A+ L +++ +D DD P+ L + Sbjct: 138 DSRIRVAFREKNGHISAASNSALELVTSDWVGLLDHDDLLAPHALYFVAKEIIRRPDARL 197 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELK 177 LY++ + ++ Y K + LFY +N+I + V+ + F T L+ Sbjct: 198 LYSD----EDKIDPMGRRHDPYFKCDMNIDLFYSQNMISHFGVYQKRLLDEIGGFRTGLE 253 Query: 178 AAQDYDIFLRMVVE--YGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QD+D+ LR + + ++ + K + R + F Sbjct: 254 GSQDHDVALRCLERIGAKSIVHIPRVLYHWRVHPSSTAASGDAKPY-ALIAGERALTEHF 312 Query: 236 DRASKKYQLFTL---YQIRNKRMTWRTLLT-LLSVRNGKRLAD 274 R + L Y++R + R L++ ++ RNG L Sbjct: 313 ARQGIDASVEGLPFGYRVRYRLPEHRPLVSLIIPTRNGVGLLK 355 Score = 93.6 bits (232), Expect = 7e-18, Method: Composition-based stats. Identities = 46/223 (20%), Positives = 83/223 (37%), Gaps = 31/223 (13%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALND- 59 ++ PL+S+ +PT N L + I S+ + Y+ +E+I++D+ S + Y+ +L Sbjct: 336 EHRPLVSLIIPTRNGVGLLKQCIDSIQDKTTYAPYEIIVIDNGS-DEQATLDYLESLKSA 394 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSVFLAHKQQLV 114 I I +D A+ N AI A GE + I++D D W L ++ Q Sbjct: 395 DNIRVIRDDRPFNYSALNNMAIGSANGELVGLINNDIEVITDTW----LEEMVSLALQ-P 449 Query: 115 THAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLF-------YKRNII---------GN 158 + A G + L + ++ RLF + RN + Sbjct: 450 GVGAVGAKLLYPDGTIQHAGVVTGLGGVAGHAHRLFPRDAFGYFSRNAVISSFSAVTAAC 509 Query: 159 QVFTWAWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVE 199 + + + + +L A D D LR+ Sbjct: 510 LIIRKSIYEQVGGLNEADLTVAFNDVDFCLRVRDAGYRNVWTP 552 >UniRef50_C2H8B4 Family 2 glycosyltransferase n=13 Tax=Enterococcus RepID=C2H8B4_ENTFC Length = 712 Score = 275 bits (705), Expect = 9e-73, Method: Composition-based stats. Identities = 61/276 (22%), Positives = 102/276 (36%), Gaps = 15/276 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALNDPR 61 PLISI MP +N + + + I SVL Q Y +WE+ I DD ST + D R Sbjct: 177 RPLISILMPVYNVEIKWLEKCIDSVLDQTYDHWELCISDDASTDPAIRKCLESYQAKDDR 236 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N N A+ +A+GE+I +D+DDE P L +Y+ Sbjct: 237 IKVVFRKENGHISLATNSALEIAEGEFIALLDNDDELPPFALYEVAKVLNVHPELDVIYS 296 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQ 180 ++ + + K+ +S N I + KE F + +Q Sbjct: 297 DEDKIDAD----GNRFDPHFKADWSPDTLMGNNYISHLGVYRTSIVKELGGFRKGYEGSQ 352 Query: 181 DYDIFLRMVVE--YGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 DYD+ LR+ + + ++ G K + Y + D R Sbjct: 353 DYDLVLRVTEQIPADHIYHIDRVLYHWRTIPGSTASNGEAKSYI-YDSGVKALTDALSRR 411 Query: 239 SKKYQLF-----TLYQIRNKRMTWRTLLTLLSVRNG 269 + K + Y+I + + ++ +NG Sbjct: 412 NIKGSVRPGRISGFYEIAYDVLQEDLVSVIIPTKNG 447 Score = 90.1 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 53/257 (20%), Positives = 94/257 (36%), Gaps = 35/257 (13%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQ---QYVTALNDPR 61 L+S+ +PT N + + S++ + YSN+E+II D+ ST + + +Y L D Sbjct: 437 LVSVIIPTKNGYEDLKTCVDSIIEKTSYSNYEIIIADNGSTDPKMQELFAEYKHQLKDRF 496 Query: 62 IT-YIHNDINSGACAVR--NQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHA 117 I I N R N A A G+Y +++D + P+ ++V +++ Q Sbjct: 497 IVELIDIPFN----YSRINNLAAEKANGKYFLFLNNDTEVIEPDWMTVMVSYAQ-FDRIG 551 Query: 118 FLYANDYVCQGEVYSQPA------------SLPLYPKSPYSRRLFYKRNII---GNQVFT 162 + A + + Y RL N + + Sbjct: 552 CVGAKLFYPDDTTQHAGVLLGIGGVAGHALNNYDRTHCGYFGRLVIDVNYLAVTAACMMV 611 Query: 163 WAWRFK-ECLFDTELKAA-QDYDIFLRMVV--EYGEPWKVEEATQILHINHGEMQITSSP 218 A F FD L+ A D D+ L++ Y E + H +P Sbjct: 612 KAADFNAVNGFDETLEVAFNDVDLCLKVYELGRYNVYAHQAE---LYHFESKSRGYEDTP 668 Query: 219 KKFSGYFHFYRKHKDKF 235 +K + +K +DK+ Sbjct: 669 EKQRRFAGEIKKMQDKW 685 >UniRef50_D0HB89 Glycosyl transferase group 2 family protein n=2 Tax=Vibrio RepID=D0HB89_VIBMI Length = 1073 Score = 275 bits (704), Expect = 1e-72, Method: Composition-based stats. Identities = 57/286 (19%), Positives = 102/286 (35%), Gaps = 15/286 (5%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-QQYVTALN 58 +KN P IS+ +P +N AI+SV Q Y WE+ I DD S + E + Sbjct: 532 LKNKPKISVLIPVYNAPVDYLREAIESVCTQLYPEWELCIADDASPNQEVRDLLQEYSQK 591 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + + N N A+ +A G Y+ +D DD + L A Sbjct: 592 DIRIKVVEREQNGHISEATNSALDVATGAYVALMDHDDILPNDALYWVAETILANPDVAL 651 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-FKECLFDTELK 177 +Y+++ + + KS ++ L +N I + + F + Sbjct: 652 IYSDEDKITADGQT---RYDPNFKSQWNPELLLSQNCISHLGVYRTDLAKEIGGFRKGFE 708 Query: 178 AAQDYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 AQD+D LR + + G I K + F + ++ Sbjct: 709 GAQDWDFALRFSEKVKPEQIIHIPRILYHWRAIEGSTAIDGDEKPY-ALFAGLKAVREHC 767 Query: 236 DRASKKYQL-----FTLYQIRNKRMTWRTLLT-LLSVRNGKRLADG 275 R S ++ +++ L++ ++ RNG+ + Sbjct: 768 KRMSINAEVVEHPERHYARVKYNIPKPMPLVSMIIPTRNGQEVLSV 813 Score = 97.4 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 47/224 (20%), Positives = 81/224 (36%), Gaps = 20/224 (8%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSW-EQLQQYVTALND 59 K PL+S+ +PT N Q++ I S+L + Y N+E+IIVD+ S Sbjct: 793 KPMPLVSMIIPTRNGQEVLSVCIDSILGKTTYPNYEIIIVDNGSDCPETLAYLDKLEQKH 852 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAF 118 P + + ++ A+ N+A +A+GE + +++D + TP+ L+ + H Q A Sbjct: 853 PNVVVMRDESPFNYSALNNKAAAIAKGEVLALVNNDVEVITPDWLTEMVGHVIQAQNGAV 912 Query: 119 ----------LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI----IGNQVFTWA 164 L + G V Y+ R +N V A Sbjct: 913 GARLWYPDNTLQHGGVIFVGGVAGHAHKHLPKGMPGYACRAIVAQNYSAVTAACLVVRKA 972 Query: 165 WRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQIL 205 + + +L A D D L++ A Sbjct: 973 VFEQVGGLNETDLTVAFNDIDFCLKVQEAGYFNVWTPYAELYHY 1016 >UniRef50_B5WL01 Glycosyl transferase family 2 n=2 Tax=Burkholderia RepID=B5WL01_9BURK Length = 731 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 66/284 (23%), Positives = 106/284 (37%), Gaps = 17/284 (5%) Query: 3 NNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDP 60 PLISI MPT+N + AI SV+ Q Y NWE I DDCST+ E + DP Sbjct: 189 QQPLISIVMPTYNTPAKWLRMAIDSVIDQVYENWEFCIADDCSTNPEVKEVLDSYVAKDP 248 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N N A+ LA GE++ +D DDE P L +Y Sbjct: 249 RIKVAYRTTNGHISNSSNTALELAVGEFVGLLDHDDELHPLALYCVAELINAHPDATVIY 308 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAA 179 ++ + ++ Y K ++ LF +N+I + K F T + + Sbjct: 309 SD----EDKISIDGERSDPYFKCDFNYDLFLSQNMISHFGVYRTSIMKEVGGFRTGFEGS 364 Query: 180 QDYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QDYD+ LR++ G + V A + K + + R + R Sbjct: 365 QDYDLALRVIDRAGHHTVYHVPRALYHWRMIPESTAAGHEAKPY-AHIAAMRALDEHLAR 423 Query: 238 ASKKYQLFT------LYQIRNKRMT-WRTLLTLLSVRNGKRLAD 274 + ++ ++ ++ R+ L + Sbjct: 424 NNINAHTEHAPGTDAFNKVVYDLPAVLPSVEIIIPTRDSAGLVE 467 Score = 83.2 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 72/231 (31%), Gaps = 27/231 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P + I +PT + L + ++SV + Y N+ + I+D+ S E + + D RI Sbjct: 451 PSVEIIIPTRDSAGLVEQCVESVRRKSSYPNFRITIIDNGSVKQETHELFARLQEDERIK 510 Query: 64 YIHNDINSGACAVRNQAIMLA-QGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYA 121 + +D N + LA +++ +++D + + L ++ Q V + A Sbjct: 511 VVRDDSPFNYS-ALNNRVALASTADFVCLMNNDIEVINADWLEEMVSVALQ-VNVGAVGA 568 Query: 122 NDYVCQGEVYSQ-------------PASLPLYPKSPYSRRLFYKRNI---IGNQVFTWAW 165 + P ++R + + Sbjct: 569 KLLYPDDTIQHGGVVLGVGGIASHAHKHFPNTHPGYFARARLRNVMCAVTAACLLIRQSI 628 Query: 166 RFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQI 214 + D +L A D D LR+ A +H Sbjct: 629 YKEVGGLDEKLHVAYNDIDFCLRVRQAGYRNVWTPYAELY----HHESASR 675 >UniRef50_Q7BG51 WsaE n=2 Tax=Bacillaceae RepID=Q7BG51_BACST Length = 1127 Score = 274 bits (702), Expect = 2e-72, Method: Composition-based stats. Identities = 51/286 (17%), Positives = 107/286 (37%), Gaps = 16/286 (5%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCST-SWEQLQQYVTALN 58 +K P S+ +P +N + + + I SVL Q Y WE+ IVDD S+ + + + Sbjct: 597 LKYKPKFSVILPVYNVEEKWLRKCIDSVLNQWYPYWELCIVDDNSSKDYIKPVLEEYSNR 656 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + N N A+ +A G++I +D DDE P L + Sbjct: 657 DSRIKTVFRSNNGHISEASNTALEIATGDFIALLDHDDELAPEALYENAVLLNEHPDADM 716 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 +Y++ + ++ K +S + IG+ ++ F + Sbjct: 717 IYSD----EDKITKDGKRHSPLFKPDWSPDTLRSQMYIGHLTVYRTNLVRQLGGFRKGFE 772 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 +QDYD+ LR+ + + + + + S K + + + + +R Sbjct: 773 GSQDYDLALRVAEKTNNIYHIPKILYSWREIETSTAVNPSSKPY-AHEAGLKALNEHLER 831 Query: 238 -------ASKKYQLFTLYQIRNKRMT-WRTLLTLLSVRNGKRLADG 275 +++ + +Y +R + + ++ ++ L Sbjct: 832 VFGKGKAWAEETEYLFVYDVRYAIPEDYPLVSIIIPTKDNIELLSS 877 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 47/266 (17%), Positives = 99/266 (37%), Gaps = 29/266 (10%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 ++ PL+SI +PT + +L I+S+L + Y N+E++I+++ S E + + Sbjct: 857 EDYPLVSIIIPTKDNIELLSSCIQSILDKTTYPNYEILIMNNNSVMEETYSWFDKQKENS 916 Query: 61 RITYIH--NDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFL-AHKQQLVTH 116 +I I + N + N I A GE +++D + + L + ++ V Sbjct: 917 KIRIIDAMYEFNW--SKLNNHGIREANGEVFVFLNNDTIVISEDWLQRLVEKALREDVGT 974 Query: 117 A---FLYANDYVCQGEVYSQ---------PASLPLYPKSPYSRRLFYKRNIIG---NQVF 161 LY ++ + V P++ SP+ + + Sbjct: 975 VGGLLLYEDNTIQHAGVVIGMGGWADHVYKGMHPVHNTSPFISPVINRNVSASTGACLAI 1034 Query: 162 TWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 K F+ E D +I LR + + + L+ + + + P++ Sbjct: 1035 AKKVIEKIGGFNEEFIICGSDVEISLRALKMGY--VNIYDPYVRLYHLESKTRDSFIPER 1092 Query: 221 F----SGYFHFYRKHKDKFDRASKKY 242 + Y+ YR+ D + + Y Sbjct: 1093 DFELSAKYYSPYREIGDPYYNQNLSY 1118 >UniRef50_C6XS42 Glycosyl transferase family 2 n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XS42_HIRBI Length = 753 Score = 274 bits (701), Expect = 3e-72, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 111/282 (39%), Gaps = 12/282 (4%) Query: 1 MKNNPLISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE-QLQQYVTALN 58 + P+ISI P +N +L AI+SVL Q YS+WE+ + +DCST E A Sbjct: 209 LSQKPVISIITPVYNPAPELLRAAIESVLSQYYSHWELCLANDCSTDPEIARIIDEYAEQ 268 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI +H N A N A+ LA GE+I +D DDE + L + + Sbjct: 269 DSRIKRVHRTQNGHISAASNSALELASGEFIAFLDHDDELSATALYHVAEAINKNPSCRL 328 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT-WAWRFKECLFDTELK 177 Y++ + ++ Y KS ++ L N+ + A + ++ Sbjct: 329 FYSD----EDKIDLDGKRHDPYFKSDWNHDLLLSHNLFTHLSVYDKALIEEVGGLRSKYD 384 Query: 178 AAQDYDIFLRMVV--EYGEPWKVEEATQILHINHGEMQITSSPKKFSGY--FHFYRKHKD 233 AQDYD+ LR E + + G ++S K ++ H Sbjct: 385 GAQDYDLALRCSARLRADEICHIPFILYHWRVMPGSTALSSDEKPYAMLAGERALNDHLI 444 Query: 234 KFDRASKKYQLFTLYQIRNKRMTWRT-LLTLLSVRNGKRLAD 274 + +K + +++ T + ++ RN ++L Sbjct: 445 ALNIKAKAELIGIGFKVSYDIPTPAPQVSIIIPTRNSQKLVK 486 Score = 89.4 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 77/235 (32%), Gaps = 34/235 (14%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +SI +PT N Q+L + + S+ + Y ++E+I+VD+ S E ++ + +I Sbjct: 470 PQVSIIIPTRNSQKLVKQCVDSIYNKTSYPDFEIILVDNGSDDPEAIKYFQDLEAKNQIK 529 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSVFLAHKQQLVTHAF 118 I + A+ N A+ + + +++D D W L ++ Q Sbjct: 530 LISDPRPFNYSALNNLAVAQSNAPVLCLLNNDIEVISDNW----LEEMVSLVLQ-PNVGA 584 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN--IIG--------------NQVFT 162 + A Y + + L + + + +G V + Sbjct: 585 VGAMLYYPDDTIQHAGVVMGLGGLAAHIHGGLERGTPGYVGRAALRQSLSAVTGACMVVS 644 Query: 163 WAWRFKECLFDTE--LKAAQDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQI 214 K D E A D D L++ A +H Sbjct: 645 RDNYEKVSGLDEENLAVAYNDIDFCLKLQAIGKRNIWTPHAELY----HHESASR 695 >UniRef50_B4U6C1 Glycosyl transferase family 2 n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U6C1_HYDS0 Length = 746 Score = 273 bits (699), Expect = 4e-72, Method: Composition-based stats. Identities = 67/278 (24%), Positives = 104/278 (37%), Gaps = 12/278 (4%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALND 59 K P ISI +P WN + I I+SVL Q YSNWE+ IVD S + L D Sbjct: 212 KYEPKISIVVPVWNTPKKFLIDMIESVLNQTYSNWELCIVDGNSKEKHVKETLEHYTLKD 271 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI + N G N+AI LA GEYI +D DD P L + + F+ Sbjct: 272 KRIKVKYLKENKGIAGNSNEAIALATGEYIAFLDHDDVLAPFALYEVVRAINENEDVDFI 331 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 Y+++ + + K +S N I + + F Sbjct: 332 YSDEDKITEDGLK---RFDPFFKPDFSPDTLRSYNYITHLSVVKKELLNEVGWFREGYDG 388 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QDYD+ LR + + + + IN + K + Y + +D DR Sbjct: 389 SQDYDLILRCTEKAKKIVHIPKILYNWRINDNSVAQDPKNKMY-AYDAAKKALQDHLDRV 447 Query: 239 SKKYQLFT-----LYQIRNKRMTWRTLLTLLSVRNGKR 271 K ++ Y+I + ++ ++ K Sbjct: 448 GLKGKVRDGVFLGSYKIDYDISYHHKVSIIIPNKDHKE 485 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 45/277 (16%), Positives = 87/277 (31%), Gaps = 24/277 (8%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQQYVT-ALNDPRIT 63 +SI +P + ++ + I S++ Y N+E+IIV++ S + + Y I Sbjct: 473 KVSIIIPNKDHKEDLEKCITSIINKSTYKNYEIIIVENNSKEKKTFEYYKYLQNKYNNIV 532 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + AV N A A G+ + +++D + N + L + Q+ + A Sbjct: 533 LLEWKDKFNYSAVNNFASKYANGDILLFLNNDTEVINENWIEEMLMYAQR-KDVGAVGAK 591 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLF----------------YKRNIIGNQVFTWAWR 166 Y + L + K +S R F + Sbjct: 592 LYYPDDTIQHGGVILGIGGKVGHSHRFFPRVSYGNVGRLVVVQNLSAVTGACLMMRKDIF 651 Query: 167 FKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQITSSPKKFSGY 224 + FD A D DI L++ + A +P+K + Sbjct: 652 NEVEGFDERYPLALSDIDICLKVREKGYLVVWTPYAELYHY--ESKSRGYEDTPEKQERF 709 Query: 225 FHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLL 261 K K+ +K + + R + + Sbjct: 710 KKEIELFKKKWGHILEKGDPYYNPNLTLDREDFSIKI 746 >UniRef50_UPI000174589B glycosyl transferase, family 2 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174589B Length = 717 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 60/285 (21%), Positives = 109/285 (38%), Gaps = 15/285 (5%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALN 58 + + PLIS+ +P +N + R I++VL Q Y WE+ I DD S + + A Sbjct: 151 LTDGPLISVLLPVYNTPLRWLKRVIETVLGQAYPKWELCIADDASPDPQVRKTIESYAAG 210 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 DPRI + N A N A+ LA G ++ +D DDE P+ L A Sbjct: 211 DPRIKVVFRPSNGHIVAATNSALELATGSFVALLDHDDELRPHALLEMAREIMGNPEAAL 270 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 +Y++ + + Y K ++ L +N + + A ++ F + Sbjct: 271 IYSD----EDHLDEAGVRYAPYFKPDFNYDLLLSQNCVCHLGVYRADLVRQLGGFRAGTE 326 Query: 178 AAQDYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 AQD+D+ LR+ G + + G S K ++ + +D Sbjct: 327 GAQDWDLALRVFEAVGRDRIHHIPKILYHWRSIEGSTARGVSEKSYASS-AGRKVVEDHL 385 Query: 236 DRASKKYQ-----LFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 R ++ + ++R + ++ RN + L + Sbjct: 386 SRTNQAVEGVEEVKQGHLRVRWSLPNPPPVAIIIPTRNFRHLLEV 430 Score = 68.2 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 79/250 (31%), Gaps = 24/250 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALND 59 + N P ++I +PT N + L A++SVL + Y N+ ++IVD+ S L + Sbjct: 409 LPNPPPVAIIIPTRNFRHLLEVAVESVLARTDYPNFRLVIVDNDSNEESTLDYLASLQAQ 468 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTP-NRLSVFLAHKQQLVTHAF 118 + + + N+A+ + +++D E + L ++ + Sbjct: 469 GKADVLRIPGPFNYSLLNNRAVAACTEPVVCLLNNDIEIHERDWLREMVSQAVR-PGVGA 527 Query: 119 LYANDYVCQGEVYSQPASL-------PLYPKSPYSR-----RLFYKRNI----IGNQVFT 162 + Y G + L S RL +N V Sbjct: 528 VGTKLYYPDGRIQHAGVILGMGGAAGHFLKGCDSSNESHGGRLHVCQNFSAVTAACLVVE 587 Query: 163 WAWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 + D + K A D D L++ A +H + + Sbjct: 588 RRKYLEVGGLDEGDFKVAFNDIDFCLKLDDAGYRNVYTPFAEMS---HHESASRGAEERT 644 Query: 221 FSGYFHFYRK 230 +G R+ Sbjct: 645 DAGKERATRE 654 >UniRef50_A2VSE1 Glycosyl transferase n=1 Tax=Burkholderia cenocepacia PC184 RepID=A2VSE1_9BURK Length = 654 Score = 271 bits (695), Expect = 1e-71, Method: Composition-based stats. Identities = 62/281 (22%), Positives = 104/281 (37%), Gaps = 14/281 (4%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALND 59 +N PLIS+ MP +N + AI+S+ Q Y +WE+ I DD ST A D Sbjct: 118 QNRPLISVVMPVYNPNPVWLVEAIESIRSQLYPHWELCIADDVSTDPAIRPLLERYASED 177 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI + N A N A+ L G ++ D DD + L + + Sbjct: 178 PRIKVAFREKNGHISAASNTALALVTGTWVALFDHDDLLPEHALYCVADAVDRNSAIRLI 237 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKA 178 Y++ + ++ Y K ++ LF +N+ + VF A + F + Sbjct: 238 YSD----EDKIDESGVRREPYFKCDWNADLFLSQNMFSHLGVFQKALLDEVGGFREGYEG 293 Query: 179 AQDYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QDYD+ LR + G + ++ + K + R D F Sbjct: 294 SQDYDLALRCIERAGTAAIHHIPRVLYHWRVHAESTASGTDAKPY-AVVAGERALGDHFV 352 Query: 237 RASKKYQLFTL---YQIRNKRMTWRTLLT-LLSVRNGKRLA 273 R + + Y+ L++ ++ RN L Sbjct: 353 RMGIRGSIEYAGNGYRAHYALPDPVPLVSLIIPARNECVLI 393 Score = 97.4 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 76/229 (33%), Gaps = 22/229 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+S+ +P N L R I+S++ + Y+N+E++I+D+ S L + D RI Sbjct: 378 PLVSLIIPARNECVLIQRCIESIVGKTRYANYEILILDNGSDDLATLDYLASLQADGRIR 437 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + N+ A N + A GE + +D + +P L+ + Q + A Sbjct: 438 VLRNERPLSFSASCNAGVEAANGEIVGLLDSGAEVISPQWLAEMVGVALQ-PGVGAVGAK 496 Query: 123 DYVCQGEVYSQP--------ASLPLYPKSPYSRRLFYKRNIIG--------NQVFTWAWR 166 G V A S + ++I + + Sbjct: 497 LLYPNGAVQHAGIVLGLGGLAGHVHRNVPRASYGYGGRASLISAFSAVSAACMIVRKSVY 556 Query: 167 FKECLFDTELKAAQ--DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 F+ + A D D LR+ A ++ Sbjct: 557 RDVAGFNEKDLGAAFGDIDFCLRLRDAGYRNVWTPYA-ELYRHEPQTRS 604 >UniRef50_B3PFZ6 Glycosyl transferase, putative, gt2E n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PFZ6_CELJU Length = 684 Score = 271 bits (693), Expect = 3e-71, Method: Composition-based stats. Identities = 62/279 (22%), Positives = 109/279 (39%), Gaps = 14/279 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-NDPR 61 P IS +PT+N AI S+ Q Y NWE+ I DD ST E D R Sbjct: 134 QPCISFVLPTYNSSIPWLKEAIDSLKAQPYQNWELCIADDASTHREVRDFLAQQQSQDSR 193 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + + N N A +A G++I D DD P L L +Y+ Sbjct: 194 IKVVFREKNGHISESSNSAAEIASGDWIALFDHDDLLHPFALYWILQAINNNPDAQLIYS 253 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKAAQ 180 + + ++ Q Y K ++ LF +N + F++ F + +Q Sbjct: 254 D----EDKIDEQGNRHSPYFKPDWNYDLFLSQNCFSHLGLIRRSLFEKIHGFRKGYEGSQ 309 Query: 181 DYDIFLRMVV--EYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 D+D+ LR + E + + + ++ ++ K ++ + +D R Sbjct: 310 DHDLILRAIELVEPTQIIHIPKVLYHWRVHADSTAKSTDSKPYAA-IAGEKAIQDHLLRI 368 Query: 239 SKKYQLF---TLYQIRNKRMTWRTLLT-LLSVRNGKRLA 273 + K Q+ Y+++ K L++ ++ RNG +L Sbjct: 369 NAKAQVSFEGYGYRVKYKLPDNPPLVSLIIPTRNGLQLI 407 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 57/291 (19%), Positives = 93/291 (31%), Gaps = 32/291 (10%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 N PL+S+ +PT N QL + I S+ Q Y N+E+I+VD+ S E L + T N P Sbjct: 390 NPPLVSLIIPTRNGLQLIRQCIDSIQ-QKTTYPNYEIIVVDNGSDDPEALTYFDTLKNTP 448 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRL-SVFLAHKQQLVTHAF 118 R I ++ + N A+ + GE I I++D + P L + + + Sbjct: 449 RFKVIRDNRPFNYSQLNNLAVKHSDGEIIGLINNDVEVIRPEWLNEMVVHALR--PGVGA 506 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG-----------------NQVF 161 + A + E + LF + G V Sbjct: 507 VGA-KLLFPDERLQHGGVVLGIGGVANHAHLFIRGTHHGYFARASVTQQFSAVTAACLVI 565 Query: 162 TWAWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 A + D L A D D +R+ + A +H + Sbjct: 566 RKAIYQEVDGLDEQNLAVAFNDVDFCIRVTNKGYRNIWTPYALLY---HHESATRGADIA 622 Query: 220 KFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGK 270 F R+ + K+ Y + + R GK Sbjct: 623 PEK-RERFVREVEYMMSTWKKELANDPFYNPNLN-LNFPDFSLHSPPRLGK 671 >UniRef50_D2RDR5 Glycosyl transferase family 2 n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RDR5_ARCPR Length = 1164 Score = 270 bits (692), Expect = 3e-71, Method: Composition-based stats. Identities = 65/281 (23%), Positives = 103/281 (36%), Gaps = 12/281 (4%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-QQYVTALND 59 K P ISI MPTWN ++ +AI+SVL Q Y NWE+ I D ST A D Sbjct: 629 KYRPKISIIMPTWNTDERWLRKAIESVLNQVYDNWELCIADGGSTKPHVRKILEEYAKKD 688 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI N G N+A+ LA GE++ +D DDE P L + + F+ Sbjct: 689 KRIKVKFLPKNLGIAGNSNEALKLATGEFVAFLDHDDELAPFALYEVVKLLNEKPDLDFI 748 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI-IGNQVFTWAWRFKECLFDTELKA 178 Y+++ + + K YS +F N I V + K F Sbjct: 749 YSDEDKIDEK----GRRRDPFFKPDYSPDMFLSCNYLIHITVIRKSLVDKVGGFRLGYDG 804 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QDYD+FLR++ + + + K ++ R Sbjct: 805 SQDYDLFLRVLEHTDKIAHIPKILYHWRAIETSCASRPEAKMYAYKAAKKALADAMKRRG 864 Query: 239 SK-----KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLAD 274 + Y+I+ K + ++ ++ + Sbjct: 865 IEIEGVYDGLWLGSYRIKYKINGNPKVSIIIPTKDKVEVLK 905 Score = 95.9 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 46/228 (20%), Positives = 85/228 (37%), Gaps = 24/228 (10%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALND 59 + NP +SI +PT ++ ++ R ++S+L + Y N+E++IVD+ S + + Y T + Sbjct: 885 INGNPKVSIIIPTKDKVEVLKRCVESILNKTTYQNYEIVIVDNNSQEEKTFEYYETIKDH 944 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAF 118 P+I + + A+ N A+ E+I +++D + T LS L H Q+ Sbjct: 945 PKIRILEYNKPFNFSAINNYAVSKVDSEFILFLNNDTEVITSEWLSAMLEHAQR-KEVGA 1003 Query: 119 LYANDYVCQGEVYSQP----------ASLPLYPKSPYSRRLFYKRNIIGNQ-------VF 161 + A + A S + N++ N + Sbjct: 1004 VGAKLLYPNNTIQHAGVILGLGVHRVAGHSHRHYPANSHGYVGRINVVQNLSAVTAACML 1063 Query: 162 TWAWRFK-ECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEE-ATQIL 205 T F+ FD L A D D L++ + Sbjct: 1064 TKKSLFEEVGGFDEVNLPIAFNDVDYCLKLREKGYLIVYTPYAVLYHY 1111 >UniRef50_B1JX11 Glycosyl transferase family 2 n=1 Tax=Burkholderia cenocepacia MC0-3 RepID=B1JX11_BURCC Length = 765 Score = 270 bits (692), Expect = 3e-71, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 111/286 (38%), Gaps = 16/286 (5%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-QQYVTALN 58 + PLIS+ +P +N + IR I+SV Q Y +WE+ +VDD S A Sbjct: 209 LAYRPLISVLIPLYNTPEPFLIRCIESVREQLYDHWELCLVDDASPQPHVQRICERYAAQ 268 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI Y+ + N N A+ LA GE+ +D DDE + L + + + Sbjct: 269 DSRIRYMRRETNGHIAEATNSALSLATGEFSALLDHDDELAAHALYMVVVELNKQPDLDM 328 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELK 177 LY++ + ++ Q + KS ++ L +N + + + F + Sbjct: 329 LYSD----EDKIDEQGKRYEPWFKSDWNYDLMLSQNAVVHLAVYRTSILREIGGFRSAFN 384 Query: 178 AAQDYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QDYD+ LR + + G + + ++ K + Y R ++ Sbjct: 385 GSQDYDVTLRFSEQTTPERIRHIPFILYHWRAISGSVALATTEKLYP-YEAAERAIREHL 443 Query: 236 DRASKKYQLFTLYQIRNKRMTW------RTLLTLLSVRNGKRLADG 275 +R + + + ++TW + ++ ++ L Sbjct: 444 ERTGRSATVKRQPHLGYYQVTWPVPAPEPKVAIIIPTKDKVELLRV 489 Score = 73.2 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 90/283 (31%), Gaps = 29/283 (10%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P ++I +PT ++ +L A+ S+L + Y N+E++IV++ S ++ + P++ Sbjct: 471 EPKVAIIIPTKDKVELLRVAVDSILEKTTYVNYEIVIVNNRSVEASTMEYFAQVQESPKV 530 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFL-- 119 + D A+ N A+ + +++D + PN L + H + + Sbjct: 531 RLLDYDKPYSFAALNNWAVTQTDAPLLAFVNNDIEVIEPNWLREMVGHALR-PEVGSVGA 589 Query: 120 ----------YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI----IGNQVFTWAW 165 ++ V G + P Y R + V Sbjct: 590 KLLYPNGTIQHSGVVVGIGGLAGHPHVGEPGETFGYFGRAACTQRYSAVTAACVVMRREV 649 Query: 166 RFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT-----SSP 218 + FD A D D+ +R+ A +H + Sbjct: 650 FLEVSGFDEVNFAVAFNDVDLGMRLGQAGYANVWTPRALLF---HHESASLGLPTNEDRR 706 Query: 219 KKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLL 261 ++F +R+ R Y + R + + Sbjct: 707 RQFLEECDNFRRIWADVIRNDPFYNPNLTISGGDFRPNFPPRV 749 >UniRef50_UPI0001AF4953 glycosyl transferase, group 2 family protein n=2 Tax=Proteobacteria RepID=UPI0001AF4953 Length = 1609 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 60/282 (21%), Positives = 115/282 (40%), Gaps = 15/282 (5%) Query: 3 NNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQ-QYVTALNDP 60 P+ISI MP +N L A++SV Q Y WE+ + DD ST E ++ + D Sbjct: 1071 EGPVISIIMPVYNPPLDLLREAVESVRAQLYPQWELCLADDASTYPEVIEYLKSLSALDD 1130 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + + N A N A+ +A GE++ +D+DD + L ++ +Y Sbjct: 1131 RIKVVFREDNGHISAASNSALEIATGEFVALMDNDDLLPRHALYWIAKTIRENPDAGLIY 1190 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAA 179 ++ + ++ + Y KS ++ LF +N++ + + F T + A Sbjct: 1191 SD----EDKISTDGKRSSPYFKSDWNEFLFRSQNMVCHLGAYRRDLVNEVGQFRTGFEGA 1246 Query: 180 QDYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QDYD+ LR V + + + I+ G + K ++ + + +R Sbjct: 1247 QDYDLALRCVEKLERNQIIHIPRVLYHWRIHAGSTAMAGDEKPYAA-LAGVKALDEHLER 1305 Query: 238 AS----KKYQLFTLYQIRNKRMTWRTLLT-LLSVRNGKRLAD 274 + ++Y++ L++ ++ RN L Sbjct: 1306 KGGVGIAELSSSSMYRVHYALPASPPLVSLIIPTRNAHALVK 1347 Score = 97.4 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 49/264 (18%), Positives = 95/264 (35%), Gaps = 28/264 (10%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL+S+ +PT N L + I S+ +Q Y+++E+I++D+ S E L+ + + I Sbjct: 1331 PLVSLIIPTRNAHALVKQCIDSI-KQLTTYTHYEIILIDNGSDEPESLEYFAQLDQEENI 1389 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYA 121 + ++ A+ N A+ +A GE I I++D + TP LS ++ Q + A Sbjct: 1390 RVMRDEGPFNYSALNNAAVRIANGELIGLINNDIEVITPEWLSEMVSIALQ-PNVGAVGA 1448 Query: 122 NDYVCQGEVYSQP--------ASLPLYPKSPYSRRLFYKRNII--------GNQVFTWAW 165 + + A + F + +I + + Sbjct: 1449 RLWYPDDRLQHGGVITGLGGVAGHSHKYLPKGAPGYFCRAELIQEFSAVTAACLIIKKST 1508 Query: 166 RFKECLFDTE-LKAA-QDYDIFLRMVVEYGEPWKVE--EATQILHINHGEMQITSSPKKF 221 + + E LK A D D L++ E + H + +P+K Sbjct: 1509 FDQVGGLEEEHLKIAFNDVDFCLKVREAGYVNVWTPFAE---LYHHESATRGLEDTPQKQ 1565 Query: 222 SGYFHFYRKHKDKFDRASKKYQLF 245 + K ++ Y Sbjct: 1566 ERFAKEIEYIKSRWPDIQADYAYS 1589 >UniRef50_C6JDU8 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JDU8_9FIRM Length = 635 Score = 268 bits (685), Expect = 2e-70, Method: Composition-based stats. Identities = 63/279 (22%), Positives = 111/279 (39%), Gaps = 14/279 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY--VTALNDP 60 +PLISI +P + + + I+S++ Q YSNWE+ IV+ S E++Q+ + D Sbjct: 80 SPLISIAVPAYQTPVEFLRQMIESLIVQTYSNWELCIVN-ASPDNEEMQKVLAEYSAGDS 138 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ + + N G N+A +A+GE++ +D DD PN L + Q LY Sbjct: 139 RVRFCNLKENLGIAENTNRAFAMAKGEFVGLLDHDDLLAPNALYEIVKILQDHPQADALY 198 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELKAA 179 ++ E+ + KS ++ L N I + V + K F E A Sbjct: 199 TDEDKVTTELDEH---FQPHLKSDFNLDLLRSNNYICHFFVVRKSIVEKTGGFRKEFDGA 255 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QDYD R GE V E + + K + + R + +R Sbjct: 256 QDYDFIFRCTENAGEVLHVPEILYHWRTHKASTADNPASKMY-AFEAGKRAIEAHLERTG 314 Query: 240 KKYQLFT-----LYQIRNKRMTWRTLLTLLSVRNGKRLA 273 K ++ Y+++ + ++ ++ K Sbjct: 315 TKGEVSHTQDLGFYRVKYPVQGKPLVSVIIPNKDEKETL 353 Score = 86.7 bits (214), Expect = 8e-16, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 75/225 (33%), Gaps = 25/225 (11%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + PL+S+ +P + ++ ++ + + Y N+E+IIV++ ST+ E + Y + Sbjct: 335 QGKPLVSVIIPNKDEKETLQTCLEMLEKNTGYQNFEIIIVENNSTTDEIFRYYKELSGNR 394 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFL 119 +I + A+ N A A+GEY+ +++D P+ L L Q+ + Sbjct: 395 KIHLLRWGKEFNYSAINNFAAAHAKGEYLLFLNNDVKSINPDWLEEMLGVCQR-PEVGGV 453 Query: 120 YANDYVCQGEVYSQP------------------ASLPLYPKSPYSRRLFYKRNIIGNQVF 161 A + K+ + + + Sbjct: 454 GAKLIYPDNTIQHAGCVIGMGGIAGHMFVDMPADRTGYLHKASLLQDM--SAVTAACLLM 511 Query: 162 TWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQI 204 + F EL A D D+ L++ A Sbjct: 512 KKEVFEQAGGFTEELAVAFNDVDLCLKVRKNGYLIVYDPYAKLYH 556 >UniRef50_A0YLN7 Glycosyl transferase, group 2 family protein n=3 Tax=Oscillatoriales RepID=A0YLN7_9CYAN Length = 2105 Score = 268 bits (685), Expect = 2e-70, Method: Composition-based stats. Identities = 62/282 (21%), Positives = 109/282 (38%), Gaps = 17/282 (6%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQ-QYVTALNDPRIT 63 IS+ MP +N + +AI SV+ Q YSNWE+ I DDCS+ E + A D RI Sbjct: 857 KISVVMPVYNPPIEFLEKAINSVINQVYSNWELCIADDCSSDPEIAEFLNKFAEEDERIK 916 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N N A +LA GE+I +D DDE TPN L + + +Y + Sbjct: 917 VIFRPENGNISRATNSAAVLATGEFILLLDHDDELTPNALGEIALYLVENPETDVMYTD- 975 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDY 182 ++ ++ K +S L IG+ + F+ + +QDY Sbjct: 976 ---DDKIDTEGKRFAPQFKPDWSPELLLSYMYIGHALVLRRKLFEQVAGMRLGFEGSQDY 1032 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D+ LR+ + G I+ + K + + ++ DR + Sbjct: 1033 DLALRITENTHHIAHLPLVLYHWRTAPGSTAISGAAKP-ASFEAGRLAVQEALDRREIES 1091 Query: 243 QLFT-LYQIRNKRMTW--------RTLLTLLSVRNGKRLADG 275 +++ + ++ + ++ ++ +N L Sbjct: 1092 KVYQPDWAVKQSLGIFSHQFIDNGPSVTIIIPTKNQLNLLQA 1133 Score = 85.5 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 52/291 (17%), Positives = 96/291 (32%), Gaps = 37/291 (12%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSW---EQLQQYVTALN 58 N P ++I +PT N+ L +KS+L + Y N+E++I+D+ S E L A Sbjct: 1114 NGPSVTIIIPTKNQLNLLQACLKSLLEKTTYQNYEVMIIDNESDDPKTLEYLDWIALAET 1173 Query: 59 DPRITYIHNDINSG----ACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQL 113 +P+I+ + G A N+A+ Q +Y+ +++D + +P LS + + Q Sbjct: 1174 EPKISVLRVANPHGKFSFAAIN-NRAVEQTQTDYVLFLNNDTEIISPEWLSQMMGYIQ-F 1231 Query: 114 VTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG---------------- 157 + A + L+ K R+ G Sbjct: 1232 EKVGAVGARLIFPDDHIQHAGIIHGLHHKLAGHAFKLSHRDYFGYLAYSKVVRNYSAVTA 1291 Query: 158 -NQVFTWAWRFKECLFDT-EL-KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQI 214 + + FD A D D R+ + EA + + Sbjct: 1292 ACLLTPRQLFLELGGFDEQNFAVAYNDADYGYRLTEKNYRCVYCAEAELL---HKEGTSR 1348 Query: 215 --TSSPKKFSGYFHFY-RKHKDKFDRA-SKKYQLFTLYQIRNKRMTWRTLL 261 PK+ + + Y K + S + F + R + Sbjct: 1349 GFKDDPKEVAHFRQKYADKIDPYYSHHLSLDNEWFHIQPRRFNIQNPPVKI 1399 >UniRef50_B1FXK1 Glycosyl transferase family 2 n=2 Tax=Burkholderia RepID=B1FXK1_9BURK Length = 723 Score = 267 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 63/284 (22%), Positives = 112/284 (39%), Gaps = 18/284 (6%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-QQYVTALNDP- 60 P +SI MPT+N ++ +A+ SV+ Q Y NWE+ I DDCST E P Sbjct: 181 QPKVSIVMPTYNTPEKWLRKALDSVVDQVYENWEICIADDCSTKPEVRMVLDSYVAKYPG 240 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 ++ + N A N A+ LA GE++ +D DDE P L + + A +Y Sbjct: 241 QVKVAYRTTNGHISASSNTALELATGEFVGLLDHDDELHPLALYCVIEKLNEHPDAALIY 300 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAA 179 ++ + ++ + Y K ++ LF +N+I + + F T L+ + Sbjct: 301 SD----EDKISEEGERSDPYFKCDFNYDLFLSQNMISHFGVYKTSVLRDIGGFRTGLEGS 356 Query: 180 QDYDIFLRMVVE--YGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QDYD+ LR++ + + V A I K + + R D R Sbjct: 357 QDYDLALRVIDRVGHDVVYHVPRALYHWRIIPESTASGHEAKPY-AHIAAMRAIDDHLKR 415 Query: 238 ASKKYQLFT------LYQIRNKRMT-WRTLLTLLSVRNGKRLAD 274 + ++ + ++ ++ R+ L + Sbjct: 416 NNIAAHTIHAPGTHAFNKVVYELPEVLPSVEIIIPTRDSAELVE 459 Score = 83.6 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 78/220 (35%), Gaps = 21/220 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P + I +PT + +L + ++SV Q YSN+ + I+D+ S E + D RI Sbjct: 443 PSVEIIIPTRDSAELVEQCVESVR-QKSTYSNYRITIIDNGSVKPETHDLFARLQADERI 501 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLA-HKQQLVTHA--- 117 I +D A+ N+ + + +++ +++D + + L ++ Q+ V Sbjct: 502 KVIRDDSPFNYSAINNRVALASTADFVCLMNNDIEVINADWLEEMVSVAIQKNVGAVGAK 561 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRL-----------FYKRNIIGNQVFTWAWR 166 LY +D + G V + + + + + + Sbjct: 562 LLYPDDTIQHGGVVLGVGGIASHAHKHFPNTMAGYFARARLRNAMSAVTAACLLIRQSIY 621 Query: 167 FKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQI 204 + D EL A D D LR+ A Sbjct: 622 KEVGGLDEELHVAYNDIDFCLRVRKAGYRNVWTPYAELYH 661 >UniRef50_A8TT37 Glycosyl transferase, group 2 family protein n=2 Tax=Bacteria RepID=A8TT37_9PROT Length = 1185 Score = 267 bits (683), Expect = 4e-70, Method: Composition-based stats. Identities = 66/292 (22%), Positives = 114/292 (39%), Gaps = 20/292 (6%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALN 58 MK P++S+ MPT+N ++ RAI+SV Q Y WE+ I DD ST E + A Sbjct: 634 MKAKPVLSVIMPTYNTDPEVLDRAIQSVRNQIYPRWELCIADDASTHGETQEVIRRHAGE 693 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + ++N A N A+ LA G+++ +D DDE T + L + + Sbjct: 694 DDRIKTVFREVNGHISAASNSALDLATGDFVVLLDHDDELTSHALYMIASELNDHPNADV 753 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELK 177 +Y+++ + Y K +S +F +N + + + F + Sbjct: 754 IYSDEDKLDDN----GEAYQPYFKPDWSPDMFLAQNYLNHVSAHRRSLVEVVGRFREGFE 809 Query: 178 AAQDYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QDYD+ LR++ + G + K + R + Sbjct: 810 GSQDYDLALRVIERTTAERIRHIPHVLYHWRAVAGSVAGADEAKDY-ALEAARRAITEHL 868 Query: 236 DRASKKYQL------FTLYQIRNKRMTWRTLLTLLSVRNGKRLA----DGIR 277 +R ++ F+ I T + ++ RN + DGIR Sbjct: 869 ERTGIDGEVVPGADRFSHRVIYALPHTLPLVSLIVPTRNKAEVLKLCIDGIR 920 Score = 95.1 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 48/257 (18%), Positives = 92/257 (35%), Gaps = 32/257 (12%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+S+ +PT N+ ++ I + DY NWE+I+VD+ S E L+ DPRIT Sbjct: 897 PLVSLIVPTRNKAEVLKLCIDGIREHNDYPNWELILVDNGSDEDESLEYLEALGKDPRIT 956 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTP---NRLSVFLAHKQQLVTHAFLY 120 + +D + N+A +A+GE + +++D P + L+ + Q + Sbjct: 957 VLRDDGPFNYSRLNNRAARMAKGEILGLLNND--IEPINRDWLTEMVRQVVQ-PGVGAVG 1013 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKR------------NII----GNQVFTWA 164 A Y + + L + + + + N + Sbjct: 1014 AKLYYPNDTLQHGGVIVGLGGVAGHFDKRLPRNERGYFGRAGIVQNFTAVTAACMLMPKT 1073 Query: 165 WRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQITSS--PK 219 + D L A D D+ L++ A +H + P+ Sbjct: 1074 VWDEVEGLDEHHLSVAFNDVDLCLKVRKAGYRIVWSPYAELY----HHESVSRGQDTEPE 1129 Query: 220 KFSGYFHFYRKHKDKFD 236 K + R + +++ Sbjct: 1130 KLERFASEVRTMQTRWN 1146 >UniRef50_C3RR62 Glycosyl transferase n=3 Tax=Bacteria RepID=C3RR62_9MOLU Length = 718 Score = 266 bits (680), Expect = 8e-70, Method: Composition-based stats. Identities = 62/281 (22%), Positives = 114/281 (40%), Gaps = 13/281 (4%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTS-WEQLQQYVTALND 59 + NPLISI +P +N + + + S+L Q Y N+E+ IVDDCST+ D Sbjct: 183 EYNPLISILIPVYNVERKFLSECLDSILNQTYQNYEVCIVDDCSTNLETINTLKEYENKD 242 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI IN N A+ +A+GE+I +D+DD PN L +A + F+ Sbjct: 243 TRIVVKTRLINGHISKASNDALEIARGEFICLVDNDDTLAPNALYENVALLNKHKDADFI 302 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 Y++ + ++ + + KS ++ N I + + F L+ Sbjct: 303 YSD----EDKLDLRGERCEPHFKSDFAPDTLLGINYICHLAVLRTSLVREVGGFTVGLEG 358 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 QD+D+FLR+ + + + + + G ++ K + + R Sbjct: 359 VQDHDLFLRITEKTKNIYHIPKILYHWRMIEGSTSLSVDNKAY-AVKKGIETIESTLKRR 417 Query: 239 SKKYQLFTL-----YQIRNKRMTWRTLLTLLSVRNGKRLAD 274 K + +L Y I T ++ ++ R+ + + Sbjct: 418 GVKANVKSLGNSTVYGIEYVLDTEPSVSIIVPTRDFADVTE 458 Score = 75.1 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 45/273 (16%), Positives = 93/273 (34%), Gaps = 27/273 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCS-TSWEQLQQYVTALN 58 + P +SI +PT + + + ++S+ + YSN+E++IVD+ S + Sbjct: 438 LDTEPSVSIIVPTRDFADVTEKCLESIYKLTNYSNFEVVIVDNRSEKQETMELFEKYQMR 497 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAH-KQQLVTH 116 I D+ A+ N A+ + + + +++D + TPN L + +++ Q+ + Sbjct: 498 YENFRVIKADMEFNYSAINNLAVSTCKSDVLVLLNNDTEVLTPNWLKLMVSYAIQKHIGA 557 Query: 117 A---FLYANDYVCQGEVYSQPAS--------LPLYPKSPYSRRLFYKRNI----IGNQVF 161 LY + + G V + RL N Sbjct: 558 VGAKLLYPDMTIQHGGVLLGVGNAVAAHAFISHPRDDEGVYGRLKIPYNYSAVTAACLAV 617 Query: 162 TWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQITSSPK 219 + D LK A D D L+++ + + + S+ + Sbjct: 618 ERKKYIQVGGLDETLKVAYNDVDFNLKLLDAGYYNLFIPQVELIHY--ESKSRGLDSTSE 675 Query: 220 KFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRN 252 K+ + + + KY Y +N Sbjct: 676 KYKQFL----AENNYMHKKWAKYIECDPYYNKN 704 >UniRef50_Q094Y6 O-antigen biosynthesis protein RfbC n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q094Y6_STIAU Length = 715 Score = 265 bits (679), Expect = 9e-70, Method: Composition-based stats. Identities = 54/271 (19%), Positives = 99/271 (36%), Gaps = 12/271 (4%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-QQYVTALND 59 P S+ +P + + I+SVL Q Y +W++ IVDD S A + Sbjct: 194 TQGPRFSLAVPAYETPEPYLRACIESVLAQTYPDWQLCIVDDGSRGSGVEHVVRAYARRE 253 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI + N G N A+ A G+++ +D DD P L V H Sbjct: 254 PRIVFERLARNVGIARATNAALAHATGDFVAFLDHDDVLAPRALEVVAQHMADAPEGEVF 313 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 Y++ + ++ + K S L N I + + + + Sbjct: 314 YSD----EDKIDAHGVRHAPSFKPALSPDLLRSVNYICHFLVVRTRLLQEVGGIRPGFEG 369 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 AQD+++ LR+V + E ++ G + + K + R ++ R Sbjct: 370 AQDHELLLRLVERTQRFHHIPEVLYHWRVHAGSTS-SDASAKPAASEAGRRAVEEHLQRQ 428 Query: 239 SKKYQLFT----LYQIRNKRMTWRTLLTLLS 265 ++ + T +Y++R + + LL Sbjct: 429 AEAGTVETTEPGVYRVRYPLVEASRVSLLLH 459 >UniRef50_B3E8X3 Glycosyl transferase family 2 n=1 Tax=Geobacter lovleyi SZ RepID=B3E8X3_GEOLS Length = 857 Score = 265 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 59/278 (21%), Positives = 99/278 (35%), Gaps = 15/278 (5%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-QQYVTALN 58 + PLIS+ MP +N + AI SV+ Q Y WE+ IVDD ST E + Sbjct: 331 LSTQPLISVVMPVYNTPLRYLKEAIDSVIAQVYKTWELCIVDDASTDSEIKGLLQEYERS 390 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + + N N LA GEYI +D DD T + L+ + Sbjct: 391 DLRIKVVFRENNGNISTATNSGFELAAGEYIALLDHDDLLTWDALAEVVLAINADPDSDI 450 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELK 177 LY++ Q ++ A + K +S F + +G+ + K D Sbjct: 451 LYSD----QDKIDEFGALSEPFFKPDWSPDYFCRVMYVGHLLVFRRNLLDKVGGCDPRFD 506 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QDY++ LR+ + + G + K F+R Sbjct: 507 KVQDYELMLRLSEVARKIHHIPRILYHWRTLEGSVARDPHGKSDIDQLQV-EAVSAAFER 565 Query: 238 ASKKYQLFTLYQIRNK-------RMTWRTLLTLLSVRN 268 ++ ++ R+ W + ++ ++ Sbjct: 566 RDIAAKIQPHSVFSHRAVIVPADRVDWPLVSIIIPTKD 603 Score = 95.9 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 75/227 (33%), Gaps = 37/227 (16%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSW------EQLQQYVTAL 57 PL+SI +PT + Q R ++S+ R Y N+E+I++D+ +T + + Sbjct: 593 PLVSIIIPTKDAPQHIGRCLESIYTRSTYPNYEVIVIDNNTTDPVARHILDNTPAKLVKF 652 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTH 116 D + Y N + A GEY+ +++D + TP + L +Q Sbjct: 653 ID-KFNY---------SRANNIGVQEAAGEYVILLNNDTEVITPRWIEFLLFGLEQ-KDV 701 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLF------YKRNIIGN---------QVF 161 A + + V L + + R F Y ++ + Sbjct: 702 AAVGSLLLYPDNTVQHAGVVLGCRGTADHVMRYFPADSDGYAGSLSCTREVSAVTAACMM 761 Query: 162 TWAWRFKECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEE-ATQIL 205 + E QD D LR+ V V + Sbjct: 762 CRRADYLSSGSMIEFFGTHYQDVDFCLRLAVGGKRILFVPQAVLYHY 808 >UniRef50_D2LU70 Glycosyl transferase family 2 n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LU70_BACS4 Length = 266 Score = 264 bits (676), Expect = 2e-69, Method: Composition-based stats. Identities = 65/260 (25%), Positives = 110/260 (42%), Gaps = 5/260 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 K+ PLISI PT+N + I+SVL Q +WEMIIVDDCST D R Sbjct: 12 KSMPLISIITPTYNSENYIADTIESVLAQTLDDWEMIIVDDCSTDNTTSVIRSYVEKDER 71 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + I + NSGA RN AI + G+YI +D DD+W P +L L + Q+ +F + Sbjct: 72 VKLIQLEKNSGAAIARNTAISHSVGKYIAFLDGDDQWLPEKLEKQLQYMQE-KGASFSFT 130 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + + + K N+IG + ++++ QD Sbjct: 131 KYKTMDQKGEETGTIVDA--PETVNYHQLLKHNVIGCLTVMLDVEKVGQVKMVDIRSRQD 188 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 Y ++L + + E ++E ++ + ++ K + YR+ +D S Sbjct: 189 YVLWLELCKKGFEAHGIQEVLAKYRVSQHSLS-SNKVKMAKQNWKVYREIEDLSLLKSLW 247 Query: 242 YQLFTLYQIRNKRMTWRTLL 261 Y + + + + + +L Sbjct: 248 YFINYAF-YKLSKSLFNKIL 266 >UniRef50_C3RR63 Glycosyl transferase n=2 Tax=Bacteria RepID=C3RR63_9MOLU Length = 660 Score = 263 bits (673), Expect = 5e-69, Method: Composition-based stats. Identities = 65/284 (22%), Positives = 109/284 (38%), Gaps = 12/284 (4%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTS-WEQLQQYVTALND 59 P ISI MP +N + I+S+L+Q Y N+E+ I DDCST+ D Sbjct: 149 NYQPKISIVMPVYNVPGKYLSYCIESILKQTYQNFEICIADDCSTNIDTIETLKAYKNKD 208 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI I + N N A+ LA GE+I +D+DD P+ L+ + + F+ Sbjct: 209 NRIKVIFREENGHISRATNSALSLASGEFIGLMDNDDTLDPHALNEVVNILNEDKNIDFI 268 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 Y + + ++ + K +S Y N I + K F + Sbjct: 269 YTD----EDKIDMGGKRSDPHFKPDFSLDTLYGGNYICHFSVIRKNIIEKIGGFRVGYEG 324 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 AQD+D+FLR+ E + + + + + G + K R +D F + Sbjct: 325 AQDFDLFLRVSNETDKIYHIPKILYHWRMIPGSTAVGGDGAKNYAGEAGKRALEDYFKQK 384 Query: 239 S----KKYQLFTLYQIRNKRMTWRTLLTLLSVR-NGKRLADGIR 277 + + T Y + + L+ N L + +R Sbjct: 385 AISVKIDNIISTQYFVEYIFENEPKVDILVVFEGNNTSLNNFLR 428 >UniRef50_C5A2Y9 Glycosyltransferase, family 2 n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A2Y9_THEGJ Length = 304 Score = 263 bits (672), Expect = 7e-69, Method: Composition-based stats. Identities = 70/236 (29%), Positives = 111/236 (47%), Gaps = 6/236 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +S+ +PT+NR L RAI SVL Q + ++E+I+VDD S + V ++ND RI Sbjct: 2 SRPTVSVIIPTYNRANLLRRAITSVLNQTFRDFELIVVDDASPDNTP--EVVESINDGRI 59 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ-LVTHAFLYA 121 Y+ NSG RN I A+G++I +DDDDEW P+RL + + +Y Sbjct: 60 RYVRLKKNSGGPVARNTGIKKARGKFIALLDDDDEWLPHRLETQVRKFETLEQNIGVVYG 119 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQ 180 Y + K +L + I + FK LFD L ++Q Sbjct: 120 GFYYVSQQDGRILGKRLPAYKGNVYDKLLRENFIGSPTLLIRRECFKRAGLFDPNLSSSQ 179 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 D+D++LR+ Y + V+E +++ ++ + K G F RKH D + Sbjct: 180 DWDMWLRIARHY-KFDYVDEIVAKYYVHGRQISF-NMKKYIPGRERFIRKHLDIWK 233 >UniRef50_A6QB23 Glycosyl transferase n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QB23_SULNB Length = 1335 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 62/279 (22%), Positives = 112/279 (40%), Gaps = 15/279 (5%) Query: 5 PLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-NDPRI 62 PLISI +PT+N ++ ++SVL Q Y NWE+ I DD ST E + P + Sbjct: 89 PLISIIVPTYNTKKRYLTEMLESVLSQTYGNWELCIADDASTDRETIDTLEYYRTKHPAV 148 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 ++ N N A+ +A G+Y+ +D DD +PN L + LY++ Sbjct: 149 KVVYRKKNGHISEASNTALSIALGDYVAFLDHDDTLSPNALYEMAKKLNEDRKLKILYSD 208 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQD 181 + + KS ++ +F+ +N I + + K F + +QD Sbjct: 209 EDKIDENSN----RYMPHFKSGWNPDMFFSQNYITHLLIIKKEIIDKVGGFRKGYEGSQD 264 Query: 182 YDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 YD+ LR + G+ +VE+ G S+ K + + R +D F R Sbjct: 265 YDLVLRCLDHIGKEEIGRVEKILYHWRAIKGSTAYGSNEKAY-AHDAGLRGLQDYFLRKD 323 Query: 240 KKYQLF-----TLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 + + Y++ + + ++ R+ + Sbjct: 324 RSISVENGLLANTYKVVYPIVEMPLVSLVVPTRDSYNIL 362 Score = 253 bits (646), Expect = 7e-66, Method: Composition-based stats. Identities = 61/280 (21%), Positives = 117/280 (41%), Gaps = 14/280 (5%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 PL SI MP +N + AI+SV Q Y WE+ I DD ST E+ + Y+ ++ Sbjct: 786 SKQPLFSIVMPVYNVVPKWLKLAIESVENQWYGRWELCIADDAST-SEETKAYLRKIDHH 844 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 +I N G C N+A+ LA+GEYI +D+DDE TP+ L L +Y Sbjct: 845 KIKIRFLKKNLGICGASNEALKLAKGEYIALMDNDDELTPDALYEILKAINT-KAAELIY 903 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAA 179 +++ + + + K +S +F +N + + V + ++ L+ + Sbjct: 904 SDEDKIEED----GTFAEPHFKPDFSPDMFLSQNYLSHLVVIKKELVDRVGGWEAGLEGS 959 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QDYD++L+++ + + + G S K + + ++ R + Sbjct: 960 QDYDLYLKVLEHTEKISHISKVLYHWRKVPGSTAAEYSAKSY-AQEAGRKALENAMKRRA 1018 Query: 240 KKYQLFT-----LYQIRNKRMTWRTLLTLLSVRNGKRLAD 274 K + Y+++ + + ++ ++ L Sbjct: 1019 IKADVKNGKYPGTYRVKYELKEEPLVSIIIPFKDKPELLK 1058 Score = 97.1 bits (241), Expect = 5e-19, Method: Composition-based stats. Identities = 51/274 (18%), Positives = 97/274 (35%), Gaps = 29/274 (10%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 PL+S+ +PT + + + I+S+L++ Y N+E++IVD+ ST + L+ + Sbjct: 345 EMPLVSLVVPTRDSYNILHKCIESILQKTLYENYEILIVDNESTDPKTLRYFEILKKHEH 404 Query: 62 ITYI--HNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAF 118 I + H+ N A+ N + A+GE I +++D + + LS + H + Sbjct: 405 IRILEYHHPFNY--SAINNYGVQYARGEIIGLLNNDVEIISSGWLSEMVQHAIR-PEIGA 461 Query: 119 LYANDYVCQ------------GEVYSQPASLPLYPKSPYSRRLFYKRNI----IGNQVFT 162 + A Y G V Y RL +N + Sbjct: 462 VGAKLYYDNHTIQHAGIVLGIGGVAGHSHKYFPQNHHGYFSRLKIIQNYSAVTAACLLMR 521 Query: 163 WAWRFKECLFDTELKAA--QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK 220 + + + E A D D+ L++ + + +H + S Sbjct: 522 KSVYLEAGGLNEENLAVAFNDVDLCLKLQQKGYRNLWTPYSELY---HHESISRGSDD-T 577 Query: 221 FSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKR 254 F R+ K ++ K+ Y R Sbjct: 578 PEKTERFGREIKYMLEKWKKQLLKDPYYNRNLTR 611 Score = 75.1 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 46/285 (16%), Positives = 99/285 (34%), Gaps = 28/285 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTS-WEQLQQYVTALN 58 +K PL+SI +P ++ +L I+S+L + Y N+E+I +++ S + Sbjct: 1038 LKEEPLVSIIIPFKDKPELLKTCIESILKKSSYQNYEIIGINNRSKEWETFKEMKRLEKR 1097 Query: 59 DPRITYIHNDINSGACAVRNQAIML-AQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTH 116 D R+ + + + N A+ A+G++I +++D + T N + L Q Sbjct: 1098 DSRVRFCEYNDTFNYSKINNFAVSSCAKGKHIVLMNNDIEIITSNWIEEML-MFSQRDDV 1156 Query: 117 AFLYANDYVCQGEVYSQPA------------SLPLYPKSPYSRRLFYKRNIIG---NQVF 161 + + + Y G + Y RL +N+ + Sbjct: 1157 SAVGSKLYYPNGTIQHAGIVLGIGGVAGHAHKYFSKNVPGYFSRLHIVQNLSSVTAALLM 1216 Query: 162 TWAWRF-KECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT--S 216 + + D L+ A D D L++ +H Sbjct: 1217 VKKAIYDEVGGLDEVNLQVAFNDVDFCLKLQKNGYLNLFTPWVEAY---HHESKSRGEED 1273 Query: 217 SPKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLL 261 +P+K + K+K+ + K+ + + +R + + Sbjct: 1274 TPEKQERFKKEVEFMKNKWSKILKEGDPYYNPNLTLEREDFSVIF 1318 >UniRef50_D0L0V1 Glycosyl transferase family 2 n=2 Tax=Gammaproteobacteria RepID=D0L0V1_HALNC Length = 958 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 58/281 (20%), Positives = 106/281 (37%), Gaps = 17/281 (6%) Query: 7 ISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDPRITY 64 ISI MP +N ++ I SV Q Y +WE+ I DD S + D RI Sbjct: 426 ISIVMPVYNTPEKYLRLCIDSVRAQSYPHWELCIADDKSPQPHVKKVLDEYIKKDKRIKV 485 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 ++ N N A+ LA GEY+ +D DD + L + LY++ Sbjct: 486 VYRPQNGHISKASNSALKLATGEYVALLDHDDALPEHALYFMAQAIAEHPEAQILYSD-- 543 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQDYD 183 + ++ + KS ++ LFY +N + + + F T ++ +QD D Sbjct: 544 --EDKIDIHGQRSEPHFKSDWNPDLFYSQNYVSHLGVYKRELLQRINGFRTGVEGSQDQD 601 Query: 184 IFLRMV--VEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 + LR + V+ + + + G + S K ++ + D F++ Sbjct: 602 LLLRCLPHVKAEQIIHIPKILYHWRTLEGSTAMASGEKSYT-TDAGIKALSDFFEKNGPA 660 Query: 242 YQLF------TLYQIRNKRMTWRTLLT-LLSVRNGKRLADG 275 Y++ L++ L+ R+ K + + Sbjct: 661 GIKIEQGLVPNTYRVHWPIPNPAPLVSLLIPTRDRKTITEI 701 Score = 93.6 bits (232), Expect = 6e-18, Method: Composition-based stats. Identities = 48/235 (20%), Positives = 87/235 (37%), Gaps = 27/235 (11%) Query: 2 KNNPLISIYMPTWNRQQLAIR-AIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVT-ALN 58 PL+S+ +PT +R + A++S+L + Y N+E+II+D+ S E L + Sbjct: 681 NPAPLVSLLIPTRDR-KTITEIAVRSILDKTTYPNYEIIILDNGSEEPETLDWFAAIQQE 739 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLV--- 114 D R+ + D A+ N A+G I I++D + +P+ L+ ++H + Sbjct: 740 DERVKVLRYDHPFNYSAINNFGAQHAKGSLIGLINNDVEVISPDWLTEMVSHALREDIGC 799 Query: 115 -THAFLYANDYVCQGEVYSQPASLPLYP-------KSPYSRRLFYKRNI----IGNQVFT 162 Y+ND + V + + Y RL +N + Sbjct: 800 VGAKLYYSNDTLQHAGVILGIGGVANHSHKNSKRDSPGYFARLIVAQNFSAVTAACLIIR 859 Query: 163 WAWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQI 214 + + D LK A D D L++ A +H + Sbjct: 860 KSVYDQVGGLDEVNLKVAFNDVDFCLKVREAGYRNLWTPYAELY----HHESISR 910 >UniRef50_Q7U8H3 Putative glycosyltransferase family 2 protein n=1 Tax=Synechococcus sp. WH 8102 RepID=Q7U8H3_SYNPX Length = 804 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 58/287 (20%), Positives = 108/287 (37%), Gaps = 19/287 (6%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALND 59 ++ PLISI +PT+N I+SV RQ Y NWE+ I DD S++ +D Sbjct: 235 QDAPLISIIIPTYNTNSNHLRECIESVCRQSYPNWELCICDDSSSAVSVKTILRSYQSSD 294 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PR+ I + N C N A+ +A GEY+ +D DD N L Q+ + Sbjct: 295 PRVKLIFREKNGHICEASNDALRMATGEYVALLDHDDILADNALYWVARELQKKPQANLI 354 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKA 178 Y+++ + + K ++ L N I + K F + Sbjct: 355 YSDEDKINDD----GMRACPHFKPAFNIDLLLSYNFISHLGVYRREILKQIGGFRVGFEG 410 Query: 179 AQDYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QD+D+ LR V+E + + + K ++ ++ + D Sbjct: 411 SQDHDLALRTVLESSPDQIIHIPRVLYHWRAHSESTASNPDSKDYT-TESGHKAVQHFLD 469 Query: 237 RASKKYQLFTLYQIRNKRM---------TWRTLLTLLSVRNGKRLAD 274 ++ + +I+ K ++ ++ R+ + + Sbjct: 470 EQHRRGGVRATARIKAKNRFTCQWHIPDKSPSVELIIPTRDQAEVLN 516 Score = 57.0 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 44/302 (14%), Positives = 99/302 (32%), Gaps = 31/302 (10%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQY--VTALND 59 +P + + +PT ++ ++ A+ S++ + Y+N+ + +VD+ S + + ++ Sbjct: 498 KSPSVELIIPTRDQAEVLNLAVDSIITKTTYTNYTITVVDNQSEEVATKNLFKNLKRVHG 557 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNR-LSVFLAHKQQLVTHAF 118 +I I + A+ N A+ + + + +++D E ++ L ++H Sbjct: 558 EKINIIKYNKKFNYSAINNYAVRKSTADIVVLVNNDVEVISSKWLQEIVSH-TSRPDVGC 616 Query: 119 L-----YANDYVCQGEVYSQP-------ASLPLYPKSPYSRRLFYKRNII----GNQVFT 162 + Y+N + G V Y RL Y + + Sbjct: 617 VGAKLYYSNRTIQHGGVVIGIGQVAGHAHKYFPGDSPGYVDRLQYVQQMTAVTAACLAIR 676 Query: 163 WAWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITS---- 216 + + +L A D D +R+ A +H + + Sbjct: 677 REIFNEVGGLNEQDLTIAFNDVDFCMRVHARGYRNIFTPYAELF---HHESISRGTEDSP 733 Query: 217 -SPKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 ++F +F D LF + + + SV+ G L Sbjct: 734 EKKERFKREINFMLNQYDIQSNGELPSDLFYNPNLTKIHEDFSINTSPESVKQGIELRSN 793 Query: 276 IR 277 R Sbjct: 794 FR 795 >UniRef50_Q116A2 Glycosyl transferase, group 1 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116A2_TRIEI Length = 1991 Score = 261 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 64/283 (22%), Positives = 107/283 (37%), Gaps = 17/283 (6%) Query: 5 PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDPRI 62 P IS+ MP ++ + + I SVL Q Y NWE I DDCS++ + + A D RI Sbjct: 994 PKISVVMPVYDPPLKFLHQGISSVLNQVYQNWEFCIADDCSSNPKIREILTERAKTDSRI 1053 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 N A N A LA G+++ +D+DDE TP+ L + Q FLY++ Sbjct: 1054 KLTFRSENGNISAATNSAAELATGDFLLFLDNDDELTPDALGEVALYISQNSEIDFLYSD 1113 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQD 181 ++ ++ K YS L IG+ F K F + +QD Sbjct: 1114 ----DDKISTEGKRFDPQFKPEYSPELLLSYMYIGHLCVVRKEIFDKIGGFRIGFEGSQD 1169 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 YD LR + + G I+ K + + ++ R Sbjct: 1170 YDFALRATEISRQVGHLPLVLYHWRTTPGSTAISGGEKP-KSFLAGQKALQESLIRRGVA 1228 Query: 242 YQLFT-LYQIRNKRMTWRT--------LLTLLSVRNGKRLADG 275 ++ + +R K ++ + ++ +N +L Sbjct: 1229 GNIYQPDWAVREKLGIFKPTFPDQGDSVTIIIPTKNQVKLLKA 1271 Score = 77.0 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 41/260 (15%), Positives = 87/260 (33%), Gaps = 38/260 (14%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-----DPR 61 ++I +PT N+ +L ++S+ + Y N+++ ++D+ S + +Y+ L + Sbjct: 1256 VTIIIPTKNQVKLLKACVESLRKTTYQNYQIFVIDNES-DEPETLEYLAGLKCLYSGEDN 1314 Query: 62 ITYIHNDINSG----ACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTH 116 I G A N+A+ + EYI +++D + +P LS + + ++ Sbjct: 1315 ILVFPMKNTDGKFNFAAIN-NRAVEQVKTEYILFLNNDTEIISPYWLSQMMGYA-KIPGV 1372 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPK-------------SPYSRRLFYKRNI----IGNQ 159 + A + L+ Y + F +N Sbjct: 1373 GAVGAKLIYPDKRIQHAGVIHGLHHGLAGHAFKLLHSENRGYLSQAFVSKNYSAVTAACM 1432 Query: 160 VFTWAWRFKECLFDTE--LKAAQDYDIFLRMVVEYGE-PWKVEEATQILHINHGEMQITS 216 + + FD E A D D R++ + + I+ Sbjct: 1433 LTPRKLFLELGGFDEENFAVAYNDADYGYRLLKSGYRSVYCADAEL----IHKEGTSRGY 1488 Query: 217 SPKKFSGYFHFYRKHKDKFD 236 S ++ RK+ D Sbjct: 1489 SDNPQE-EANYRRKYSQMID 1507 >UniRef50_Q46FX2 Putative uncharacterized protein n=1 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46FX2_METBF Length = 783 Score = 260 bits (664), Expect = 5e-68, Method: Composition-based stats. Identities = 59/281 (20%), Positives = 111/281 (39%), Gaps = 13/281 (4%) Query: 4 NPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDPR 61 P ISI MP +N + +AI SV+ Q Y+NWE+ I DD ST + D R Sbjct: 260 KPKISIIMPVYNVDEIWLEKAIDSVINQIYTNWELCIADDASTKPHIKSTLSKYSELDSR 319 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N G N+A++LA GE+I +D+DDE + + L + + +Y+ Sbjct: 320 IKVKYLSNNRGISGASNEALLLATGEFIGLLDNDDELSVDALYEVVKVLNKKSDIDLIYS 379 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWA-WRFKECLFDTELKAAQ 180 + + ++ + + K +S + + N + + + F + +Q Sbjct: 380 D----EDKIDMKGNRCNPFFKPGWSPDVLFSVNYVCHFAVIRKTIVDEVGRFRIGFEGSQ 435 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+FLR+ + + + + + G + K + R +D R Sbjct: 436 DYDLFLRVAEKTNQVEHIPKILYHWRMIPGSTALNIFSK-DKAQINGIRSLQDYLYRKKI 494 Query: 241 KYQL-----FTLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 + T YQ+ + + ++ + + I Sbjct: 495 DGKAAATINRTNYQVDYEIKGSPLVSIIIHFDDNVHVKKCI 535 Score = 54.7 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 72/226 (31%), Gaps = 40/226 (17%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDY-SNWEMIIVDDCSTSWEQLQQYVT--AL 57 +K +PL+SI + ++ + I S+++ + +E++++ S E+ + Sbjct: 513 IKGSPLVSIII-HFDDNVHVKKCINSIIKNTEETRYEILLM--VSKKNEKQLSINFDPSN 569 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSV-----FLAHKQQ 112 P I + D + A+ N A + G+Y+ ++ R+ ++ + Sbjct: 570 KQPAIKVLTYDESWNVSAINNFAAEKSSGDYLLFMNS-------RIETTNRNWLISLLKN 622 Query: 113 --LVTHAFLYANDYVCQGEVYS-------QPASLPLYPKSP-----------YSRRLFYK 152 + G + ++ +P +SR Sbjct: 623 AGREGIGCVCPKILNVDGTIKQAGIILGLNGEKYDVFSGNPENNWTNFGLDTWSRDYLAV 682 Query: 153 RNIIGNQVFTWAWRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKV 198 + F+ FD D D+ LR+ + + Sbjct: 683 SG--ECLMINKKTFFQAGGFDENKYYGNDIDLCLRVHKKGFRNLCI 726 >UniRef50_B7DLY9 Glycosyl transferase family 2 n=1 Tax=Alicyclobacillus acidocaldarius LAA1 RepID=B7DLY9_9BACL Length = 896 Score = 259 bits (662), Expect = 8e-68, Method: Composition-based stats. Identities = 62/280 (22%), Positives = 106/280 (37%), Gaps = 15/280 (5%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALND 59 K PLIS+ MP +N ++ A+ SVL Q Y WE++I DD S+ E ++ A Sbjct: 567 KERPLISVIMPVYNSKELWLNEAVSSVLNQPYPEWELVISDDNSSEQETIRALECIARRH 626 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI I N N G + A+ A+GE +T +D DD PN + L Sbjct: 627 ERIRVIFNSENRGISGNTSVALAQARGELVTFLDHDDVLAPNAFYEVVRAWNDC-HFDIL 685 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLF-DTELKA 178 Y+++ Y +P + K +S N I + ++ + + Sbjct: 686 YSDEDKIGEHGYEEP-----FFKPDFSPDYLLSVNYINHLTVYRRSLLEDVGYLRSGFDG 740 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 AQDYD+ LR + E G + K + + R KD R Sbjct: 741 AQDYDLLLRATEHAKRIVHIPEVLYHWRKVPGSTAESFDSKSY-AHEAGRRAIKDALMRR 799 Query: 239 SKKYQLFT-----LYQIRNKRMTWRTLLTLLSVRNGKRLA 273 ++ + Y++ + + + ++ R+ + Sbjct: 800 GEEADVLDTGYPGHYRVNRRILGEPLVSIIVLTRDKADVL 839 Score = 44.3 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 3/39 (7%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIV 40 PL+SI + T ++ + I SV+ Q Y ++E+IIV Sbjct: 823 EPLVSIIVLTRDKADVLRTCISSVI-QKSTYRHYEIIIV 860 >UniRef50_D0IZR1 Glycosyl transferase, family 2 n=1 Tax=Comamonas testosteroni CNB-2 RepID=D0IZR1_COMTE Length = 773 Score = 258 bits (661), Expect = 1e-67, Method: Composition-based stats. Identities = 59/285 (20%), Positives = 109/285 (38%), Gaps = 21/285 (7%) Query: 1 MK---NNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVT 55 M + P+ISI +PT+N + L I SVL Q Y W++ I DD ST E Sbjct: 223 MNAWDSKPVISILLPTYNTPEVLLRECIDSVLGQSYPYWQLCIADDASTLPEVRAVIEEY 282 Query: 56 ALNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT 115 A D RI + N N A+ LA GE++ +D DD L + ++ Sbjct: 283 AFADSRICFTQRKHNGHISECSNSALELASGEWVALLDHDDCLPEYALFEVVKALKEHPQ 342 Query: 116 HAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDT 174 +Y++ + ++ Q + K +S L + +N + + + + F Sbjct: 343 AQIIYSD----EDKLDLQGRRCDPFFKPDWSPDLLFSQNYMTHLLVYRHQLLREVGGFRK 398 Query: 175 ELKAAQDYDIFLRMVVEY-----GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 + +QDYD+ LR + + + + + +K Sbjct: 399 GYEGSQDYDLLLRCIASLPEATQNNILHIPKVLYHWRMTEQSTAMGH-ERKDYATPAALC 457 Query: 230 KHKDKFDRASKKYQLFTL----YQIRNKRMTWRTLLT-LLSVRNG 269 +D DR ++ + Y+ R + + L++ ++ R+G Sbjct: 458 ALQDFMDRCHPGVRMEVVQPGIYRARWPQPSRLPLVSLIIPTRDG 502 Score = 83.2 bits (205), Expect = 9e-15, Method: Composition-based stats. Identities = 54/278 (19%), Positives = 91/278 (32%), Gaps = 33/278 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSV-LRQDYSNWEMIIVDDCSTSWEQLQQYVTALN----- 58 PL+S+ +PT + + I+S+ + Y N+E+++VD+ ST Y L Sbjct: 491 PLVSLIIPTRDGLEELRTCIESIWEKTTYPNYEILVVDNQST-CSYTLDYFKELEADTRY 549 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHA 117 D RI + D A+ N A+ A GE + I++D + P LS ++H + Sbjct: 550 DGRIRVLSYDHPFNYSAINNFAVQHANGEVLGLINNDVEVIGPEWLSEMVSHAIR-PNIG 608 Query: 118 FLYANDYVCQGE----------VYSQPASLPLYPKSP--YSRRLFYKRN---IIGNQVFT 162 + A Y G S Y +S Y RL N + G + Sbjct: 609 CVGAKLYYPDGTLQHAGVVLGIGGVAGHSHKYYSRSDGGYFGRLLTIHNVYAVTGAVLLV 668 Query: 163 WAWRF-KECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVE-EATQILHINHGEMQITSSP 218 F + D L+ A D D +++ + +H Sbjct: 669 RKSVFDEVGRLDDFGLRVAFNDVDFCIKVQKAGYQNIFTPFSELY----HHESKTRGRDD 724 Query: 219 KKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMT 256 F ++ R Q Y + Sbjct: 725 SPEK-QVRFAKERDLMIQRWGNLLQADPFYNPNLTLVN 761 >UniRef50_Q093P1 Glycosyl transferase, group 2 family protein (Fragment) n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093P1_STIAU Length = 1091 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 55/281 (19%), Positives = 95/281 (33%), Gaps = 13/281 (4%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALND 59 P ISI P +N ++ I+SV Q Y NWE+ +VD S A+ D Sbjct: 272 SYRPRISILTPVYNTPPEVLRETIRSVQAQTYPNWELCLVDGNSPMPHVREILQNFAVQD 331 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ D+N G N+A+ +A GE+I +D DD P L + H + Sbjct: 332 ERVCVRRLDLNLGISGNTNEALAMASGEFIALLDHDDTLAPFALYEIVKHLNAAPRTDMI 391 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKA 178 Y++ + V + K +S L G+ +E F Sbjct: 392 YSD----EDRVDMDGTRHTFFFKPDWSPDLLQSFMYTGHLSVYRRALVEELGGFRPAYDL 447 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QDYD+ LR+ + + G + P +D R Sbjct: 448 SQDYDLALRVTERTQSIAHIPKVLYHWRTMAGSAAVGDKPH---ARITNIGALEDAMKRR 504 Query: 239 SKKYQLF---TLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 ++ +++ K ++ N + + + Sbjct: 505 GYDAEVIMDPMANRVKFKAPRGAFASIIIPSDNLQHIHACV 545 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 42/284 (14%), Positives = 89/284 (31%), Gaps = 46/284 (16%) Query: 1 MKNNPLI--------SIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQ 51 M N SI +P+ N Q + VL+ Y ++E+++V T + Sbjct: 515 MANRVKFKAPRGAFASIIIPSDN-LQHIHACVGDVLKTTAYPHYEVLVV----THSRLIP 569 Query: 52 QYVTAL-NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDE--WTPNRLSVFLA 108 + ++PR+ + D N+ + A+G+++ + +DD P L L Sbjct: 570 DVQSRYAHEPRVRTVLYDEPFNFSLKCNRGVEQAKGDFVLFL-NDDARPLDPEWLECMLG 628 Query: 109 HKQQLV----THAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ----- 159 + QQ + +Y+N+ + + S +L R N+ + Sbjct: 629 YFQQEGVGAVSPKLVYSNNTLQHAGMISGVRNLVGTAFHTLPRNTPLYFNMAQSTRTTSL 688 Query: 160 ------VFTWAWRFKECLFD-TELKAAQ-DYDIFLRMVVEYGEPWKVEEATQILHINHGE 211 V FD D D+ ++ T H+ H Sbjct: 689 LSAACIVMRKKIFEAVGGFDAVNTPIMHSDVDLCFKIREAGLRLVYTPFTTLE-HVGHVS 747 Query: 212 M----QITSSPKKFSGYFHFYRK------HKDKFDRASKKYQLF 245 + + + + ++ + F +++ F Sbjct: 748 LGEEERKGNRHHSQKSDIYVLKRWSEFCAYDPYFPDNMREFLFF 791 >UniRef50_A1U141 Glycosyl transferase, family 2 n=1 Tax=Marinobacter aquaeolei VT8 RepID=A1U141_MARAV Length = 299 Score = 257 bits (658), Expect = 2e-67, Method: Composition-based stats. Identities = 64/280 (22%), Positives = 112/280 (40%), Gaps = 9/280 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + PL+SI PT+NR A++SVL Q +E+I++DD ST +D Sbjct: 1 MYSKPLVSIITPTFNRADYLPIAVESVLAQTMPEFELIVIDDGSTDNTSEVMERYT-SDS 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ Y + N G RN+ I A+G YI +D D+ W +LS L ++ +Y Sbjct: 60 RVKYFKQE-NQGQSIARNRGIEAAKGGYICFLDSDNAWLETKLSKSLQAFKENPEADIVY 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAA 179 + + + Q + + L + I N T F+E FD + A Sbjct: 119 GDFILIDEDGKEQGVNRMKRYSGRITSHLIHDNFISMNTTMTRRRCFEEMGGFDNADRLA 178 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 +DY ++LR Y + + E + ++ + + R + F A Sbjct: 179 EDYGLWLRFSTRYQFLY-LPEILGYYRVMENQIS-SDKDSRLMANEQIIRTFLESFPEAL 236 Query: 240 KKYQLFT---LYQIRNKRMTWRTLLTLLSVRN-GKRLADG 275 + + + +R R +++ + GK LA+ Sbjct: 237 SRGEARHGMSRFFVRKGRYELSQGRYRIALADLGKSLAND 276 >UniRef50_Q314K1 Glycosyltransferases involved in cell wall biogenesis-like n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314K1_DESDG Length = 312 Score = 257 bits (658), Expect = 3e-67, Method: Composition-based stats. Identities = 65/284 (22%), Positives = 113/284 (39%), Gaps = 27/284 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M P +S+ +PT+N L + ++SV Q Y ++E+I+VDD ST Sbjct: 1 MTTTPRVSVVIPTYNYGHLLEQCLESVFAQTYKDYEVILVDDGSTDDTPGIAARYE---G 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y N G + RN+ I A+G I +D DD W P++L+V A + +Y Sbjct: 58 RIRYFR-KENGGPSSARNEGIRQARGPLIAFLDSDDLWLPHKLAVQTAFADAHPEYGLVY 116 Query: 121 ANDYVCQGEVYSQPASLPLY----PKSPYSR-RLFYKRNIIGNQVFTWAWRF-KECLFDT 174 + + + L S + L + + V F + D Sbjct: 117 CDMQHVEHGRVIHHSYLHERGYTCTGSGWKYVDLLREGFVFTPTVLVRKECFSAAGMHDE 176 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH-FYRKHKD 233 L+ QD D++LR+ Y ++E + HG+ T++ + + + F R+ + Sbjct: 177 GLRTCQDLDMWLRIAKLY-PFGFIDEPLVV-RQYHGDNSTTNAERYHANHIRVFLRELES 234 Query: 234 KFDRASKK--------------YQLFTLYQIRNKRMTWRTLLTL 263 D +K+ Y F+ R R +R++L Sbjct: 235 TADETAKQVLRGRLSEQYFALGYHYFSTGSRRQARRCFRSVLNY 278 >UniRef50_B8D172 Glycosyl transferase family 2 n=4 Tax=Bacteria RepID=B8D172_HALOH Length = 250 Score = 257 bits (657), Expect = 3e-67, Method: Composition-based stats. Identities = 57/250 (22%), Positives = 98/250 (39%), Gaps = 6/250 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ P +N +Q + IKSVL Q Y+NWEMI+VDDCST + RI I Sbjct: 5 LVSVITPLYNSEQFIEKTIKSVLNQTYANWEMIVVDDCSTDSGPDIVKEYLKKNSRIKLI 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 NSGA RN+ I +++G YI +D DD W ++L + + Y Sbjct: 65 KLKKNSGAAVARNKGIKISKGRYIAFLDSDDLWHKDKLKKQVEFM-KNNNVVLSYTAYRK 123 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 + P + K N IG + ++ ++ + QDY ++ Sbjct: 124 IDEFGNLRGIIRPPKK---INYNQLLKTNSIGCLTAMYDTKYVGKVYMPVIDRRQDYALW 180 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 L+++ + + V E N + ++ + YR+ ++ Y Sbjct: 181 LKILKKGITAYGVNEVLAYYRTNKNSLS-SNKVVSAKYQWKIYRQIEN-LSIVKSLYYFI 238 Query: 246 TLYQIRNKRM 255 + + Sbjct: 239 NYFINGLSKY 248 >UniRef50_C5RQE2 Glycosyl transferase family 2 n=1 Tax=Clostridium cellulovorans 743B RepID=C5RQE2_CLOCL Length = 802 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 107/282 (37%), Gaps = 13/282 (4%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P ISI +PT+N + + I+SV Q Y+NWE+ I D S + E + DP+I Sbjct: 268 EPKISILVPTYNTPENFLVEMIESVNEQTYTNWELCIA-DASNNEETRKVLARYEGDPKI 326 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + D N G N+A L GEYI D DD PN L + + F+Y + Sbjct: 327 PIKYLDENKGISGNTNEAATLVTGEYIALFDHDDLLMPNALFEIVKVINKDREVDFIYTD 386 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQD 181 + + + K ++ F N I + F F E AQD Sbjct: 387 EDKTDE---TSTRRFDPHFKPDFAIDTFRSNNYICHFTTMKRELFDSVGGFRKEYDGAQD 443 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 +D+FLR + + + + ++ K + + +D DR K Sbjct: 444 FDLFLRTTEKAKKIVHIPKVVYHWRVHQNSTAGAGEAKLY-AFEAGKNAIQDSLDRNGIK 502 Query: 242 YQLFT-----LYQIRNKRMTWRTLLTLLSVRNGK-RLADGIR 277 + +Y IR + + + L+ ++ L I+ Sbjct: 503 GTVEMGKYLGIYNIRYELESTPLVSILIPTKDHIEDLDRCIK 544 Score = 89.7 bits (222), Expect = 9e-17, Method: Composition-based stats. Identities = 55/272 (20%), Positives = 94/272 (34%), Gaps = 38/272 (13%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 +++ PL+SI +PT + + R IKS++ + Y N+E+I++++ ST E + Y T N Sbjct: 520 LESTPLVSILIPTKDHIEDLDRCIKSIIEKSTYKNYEIIVIENNSTESETFKYYDTLKNY 579 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQL 113 I + A+ N + A GE +++D D + L + Q+ Sbjct: 580 SNIKVVEWKDEFNYSAINNFGVKNASGEVFLLLNNDVEVINGDW-----IERMLQYAQRE 634 Query: 114 VTHAFLYANDYVCQGEVY------------SQPASLPLYPKSPYSRRLFYKRNI----IG 157 + A Y + + Y RL + +N Sbjct: 635 -DVGAVGAKLYYPNDTIQHGGVIVGLGGIANHAHKHFHREAPGYFARLNFVQNFSAVTAA 693 Query: 158 NQVFTWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT- 215 + K D +LK A D D LR+ A +H + Sbjct: 694 CLMVRRDVYEKLNGLDEDLKVAFNDVDFCLRIRALDKLIVWTPFAELY---HHESISRGV 750 Query: 216 ----SSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++F G F K KF +A Y Sbjct: 751 EDTPEKIERFKGEIRFMEKRWGKFIKAGDPYY 782 >UniRef50_Q314K2 Glycosyltransferases involved in cell wall biogenesis-like n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314K2_DESDG Length = 321 Score = 256 bits (655), Expect = 6e-67, Method: Composition-based stats. Identities = 58/258 (22%), Positives = 114/258 (44%), Gaps = 8/258 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + P +S+ MP + + A++SVL QD++++E+++VDD S ++ + +D Sbjct: 17 MSDRPAVSVIMPVYKTEAYLAEAVRSVLEQDFTDFELLVVDDGSPGD--VRGILAGFDDA 74 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y+H++ N G RN+ I + G Y+ +D DD W P++LS+ +A + +Y Sbjct: 75 RIRYLHHE-NRGPGYTRNRGIRESTGRYVAFLDSDDAWMPHKLSIQVAALDAAPQYDVVY 133 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD-TELKAA 179 + G + P+ L+ I + F + +L+ + Sbjct: 134 SQRETMDGHGRTVAGYTPVLHSGDVLDELYVDSFICMSSAMVRRGVFAVSGYIAEDLRIS 193 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK---KFSGYFHFYRKHKDKFD 236 QDYD +LR+ + V+E ++ ++ + + Y F R+H + Sbjct: 194 QDYDFWLRVACHH-RFLAVDEPLVRYRVHGTQVSRQVEERVRVVWQIYERFNREHGHRVS 252 Query: 237 RASKKYQLFTLYQIRNKR 254 +++ + Y R R Sbjct: 253 PRARRRSRASFYNGRADR 270 >UniRef50_B6ARX8 Putative glycosyl transferase, family 2 n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6ARX8_9BACT Length = 1023 Score = 256 bits (655), Expect = 6e-67, Method: Composition-based stats. Identities = 56/283 (19%), Positives = 104/283 (36%), Gaps = 15/283 (5%) Query: 4 NPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL-QQYVTALNDPR 61 NPLIS+ +P +N + ++AI+SV Q Y +WE+ I D+ ST + D R Sbjct: 466 NPLISVILPVYNTPKEWLVKAIESVRNQIYPHWELCISDNASTLPHIKMVLDEYSRIDSR 525 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I ++ + N A N A+ LA GE+I +D DDE + L + +Y+ Sbjct: 526 IHVVYRETNGHISANSNTALTLASGEFIALLDSDDELPEHALFWVAHEINRHPNVDLIYS 585 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKAAQ 180 + + ++ + Y K ++ L +N + V+ ++ K F + +Q Sbjct: 586 D----EDKISEEGIRFNPYFKPDWNPALMLSQNTFSHLGVYRYSLVKKVGGFREGFEGSQ 641 Query: 181 DYDIFLRMVVEYG--EPWKVEEATQILHINHGEM-QITSSPKKFSGYFHFYRKHKDKFDR 237 D+D+ LR E + G + +K + R ++ DR Sbjct: 642 DHDLVLRCSEETTPERIRHIPRILYHWRAISGSTASASGVEEKPYAWTAGKRSIEEHLDR 701 Query: 238 ASKKYQLF-----TLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 + + + ++ L Sbjct: 702 LGTPGVVTFAPGGHYRVSYLSDGYFPKVSIIMPSACKLHLLKP 744 Score = 89.4 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 79/252 (31%), Gaps = 44/252 (17%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALN----- 58 P +SI MP+ + L I S+L + Y ++E+++V E Sbjct: 727 PKVSIIMPSACKLHLLKPCIDSLLSRTSYPDFEVLLV-----VNEIRYAVPEQAEFLNGI 781 Query: 59 --DPRITYIH---NDINSGACAVRNQAIMLAQGEYITGIDDDDE--WTPNRLSVFLAHKQ 111 DPR+ + N + N AI ++G ++ + +DD T + L + + Sbjct: 782 GADPRVKVLVYGDQPFNY--SKLNNWAIAQSKGSFLCLL-NDDIEVITDDWLEKLV-VRT 837 Query: 112 QLVTHAFL-----YANDYVCQGEVY------------SQPASLPLYPKSPYSRRLFYKRN 154 ++ + Y N+ + V P P Y + + + Sbjct: 838 RITGVGVVGPLLLYPNNRIQHAGVILGLGGVSGHQFVEMPKGYPGYFRRAENEQDL-SCV 896 Query: 155 IIGNQVFTWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 V ++ F+ +L A D D+ +R+ E +H Sbjct: 897 TAACMVLRREVFYEVGGFNEDLAVAFNDVDLCIRIRKAGWRILWTPEVELY---HHESAS 953 Query: 214 ITSSPKKFSGYF 225 I Sbjct: 954 IGKHDSPERAIL 965 >UniRef50_C6LFR6 Glycosyl transferase family protein n=3 Tax=Clostridiales RepID=C6LFR6_9FIRM Length = 608 Score = 256 bits (655), Expect = 7e-67, Method: Composition-based stats. Identities = 53/276 (19%), Positives = 98/276 (35%), Gaps = 14/276 (5%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWE--QLQQYVTALN 58 K++ SI +P + + I SV +Q Y NWE+ IV+ S E + A Sbjct: 65 KHSCKFSIAVPAYETPETFLREMIASVQKQTYGNWELCIVN-ASPQNERMKQVLEEYAAA 123 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + N G N+A+ +A G+++ +D DD PN L + + Sbjct: 124 DERIRVKNLAENKGIAGNTNEALAMASGDFVCLLDHDDLLAPNALFEAARSLEADSSIDV 183 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-FKECLFDTELK 177 LY ++ + + + K ++ L N I + + FD + Sbjct: 184 LYTDEDKVETDGRTH---FKPNLKPDFNLDLLRSNNYICHFFMVRRALAERAGGFDGAYE 240 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 AQDYD LR + E + K + Y R + +R Sbjct: 241 GAQDYDFILRCTDMAKNIHHIPEILYHWRTHAASTADNPISKMY-AYEAGKRAIEAHLER 299 Query: 238 ASKKYQLFTLYQIRNKRMTW-----RTLLTLLSVRN 268 + ++ + R+ + + ++ ++ Sbjct: 300 RGQAAEVSLKKDLGFYRVKYPVQGKPLVSIIIPNKD 335 Score = 86.7 bits (214), Expect = 8e-16, Method: Composition-based stats. Identities = 52/291 (17%), Positives = 92/291 (31%), Gaps = 42/291 (14%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD--YSNWEMIIVDDCSTSWEQLQQYVTALND 59 + PL+SI +P + + I+S+ + Y N+E+IIV++ S+S E Y D Sbjct: 322 QGKPLVSIIIPNKDETEALRLCIESIKK-TVVYENYEIIIVENNSSSREIFAYYKELSED 380 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLS----VFLAHKQQLVT 115 RI + A+ N + A+GE++ +++D L L Q Sbjct: 381 ARIRIVRWKDAFNYSAINNYGVKYAKGEFLLFLNND----IEALETGWFEELLGNCQRPE 436 Query: 116 HAFLYANDYVCQGEVYSQPASL------------PLYPKSPYSRR----LFYKRNIIGNQ 159 A G + + +S Y + + Y Sbjct: 437 VGITGAKLLYPDGTIQHAGTVIGIGGIAGHMFVGMPAERSGYLHKASLQMDYSAVTAACM 496 Query: 160 VFTWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITS-- 216 + + + F+ L A D D+ LR ++ + Sbjct: 497 MMKRSLFERLGGFEERLSVAFNDVDLCLRANEAGFLVVYDP---YACLRHYESKSRGAED 553 Query: 217 ---SPKKFSGYFHFYRKHKDKFDRASKKYQLFTL------YQIRNKRMTWR 258 ++F F R +K +A Y L Y +R R R Sbjct: 554 SPEKVRRFQEEIEFMRTRWEKLLKAGDPYYNKNLSLSKWNYSLRADRKKGR 604 >UniRef50_B8HRA4 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HRA4_CYAP4 Length = 398 Score = 256 bits (654), Expect = 7e-67, Method: Composition-based stats. Identities = 56/257 (21%), Positives = 97/257 (37%), Gaps = 12/257 (4%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+S+ +P +N + + + S+L Q YSN+E++++DD ST + A DPR Sbjct: 11 EELPLVSVILPVYNGEPFIEKTLNSILTQTYSNFEVLVIDDGSTDQTETVVTSFAQRDPR 70 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK-QQLVTHAFLY 120 I + N+G A RN I A+GE+I ID DD W P+ L + + +Y Sbjct: 71 IKLLKQ-ANAGVAAARNFGIQSAKGEFIAPIDADDLWFPDFLEKQVRCMLNSDSSVGLVY 129 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG-NQVFTWAWRFK-ECLFDTELK- 177 A + L I + V + +D+ LK Sbjct: 130 AWLIKIDQNGSPTGDFCASRIEGDVYPTLLCHDFIANASCVLIRRSCLQCVGGYDSSLKA 189 Query: 178 ----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT--SSPKKFSGYFHFYRKH 231 +D+D++LR+ + + V E M S + + R+ Sbjct: 190 RNGQGCEDWDLYLRIAELH-QIRVVPEFLVGYRQRSDSMSCNYQSMAQSRELIWETIRQK 248 Query: 232 KDKFDRASKKYQLFTLY 248 + + + Y Sbjct: 249 YPNIPAIIYRLSMSSFY 265 >UniRef50_C8WIW4 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WIW4_EGGLE Length = 832 Score = 256 bits (654), Expect = 8e-67, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 107/277 (38%), Gaps = 11/277 (3%) Query: 5 PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P SI +P + + ++SV RQ Y WE+I+V+ S +L + + ++D R+ Sbjct: 304 PTFSIVVPLYRTPVEYFRSMLQSVQRQSYGGWELILVN-ASPDDGRLVEELENVSDARVR 362 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 ++ + N G N I +AQG+++ +D DD P+ L + +Y ++ Sbjct: 363 VVNLEENHGIAENTNAGIRVAQGDFVAFLDHDDVLAPDALFGYARAVCDDPLVDIVYCDE 422 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQDY 182 S + K +S L +N I + + + L D AQDY Sbjct: 423 DRID----SVGVHHAPFFKPDFSPELLNAQNYITHFLAVRKSLIEEIGLLDATFDGAQDY 478 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY--FHFYRKHKDK--FDRA 238 D+ LR + ++ + S K ++G H + + Sbjct: 479 DLVLRATERSRSVAHIPRVLYHWRMHEASTSMNSDSKSYAGEAGRAALEAHCRRCGWSAK 538 Query: 239 SKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 ++ L Y++R++ + + L+ ++ L Sbjct: 539 VERTDLPFAYRVRHELVERPKVSILIPSKDKTSLLSA 575 Score = 96.7 bits (240), Expect = 7e-19, Method: Composition-based stats. Identities = 42/260 (16%), Positives = 91/260 (35%), Gaps = 20/260 (7%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P +SI +P+ ++ L ++S++ + Y N+E++++++ S E Y + Sbjct: 556 ERPKVSILIPSKDKTSLLSACVESIVEKTSYDNYEIVVIENNSVEPETFAYYEEVQRLGK 615 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLV----TH 116 + + N + G+Y+ +++D + TPN L L + Q Sbjct: 616 ARVVEWPDTFNFSKIMNFGVRQCDGDYVLLLNNDTEVITPNYLETMLGYFQAEGVGVVGA 675 Query: 117 AFLYANDYVCQGEVYSQPASLPLY-----PKSP--YSRRLFYKRNII----GNQVFTWAW 165 L+ +D V G V P + PK Y R +N+ Q+ + Sbjct: 676 KLLFPDDTVQHGGVVLGPYRSAGHLFASLPKDDLGYFCRAVLPQNLSAVTGACQLVPRSV 735 Query: 166 RFKECLFDTELK-AAQDYDIFLRMVVEYGEPWKVEEATQILHI--NHGEMQITSSPKKFS 222 + + + D D L++ +A + + G + + ++ Sbjct: 736 FEEVGGYTEAFEVGLNDVDFCLKVREAGYRVVWTPDALLYHYEFSSRGRDREGAQAERAE 795 Query: 223 GYFHFYRKHKDKFDRASKKY 242 R ++ A Y Sbjct: 796 REIALLRTRWPRYFEAGDPY 815 >UniRef50_A6M2M4 Glycosyl transferase, family 2 n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M2M4_CLOB8 Length = 1362 Score = 255 bits (653), Expect = 9e-67, Method: Composition-based stats. Identities = 61/285 (21%), Positives = 110/285 (38%), Gaps = 12/285 (4%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + NPLISI +P +N + + I SV+ Q Y NW++ +VDD ST ++ + Sbjct: 696 LSFNPLISIVVPVYNVLDEQLVECIGSVINQTYDNWQLCLVDDASTWNSVVKVLNKYEKN 755 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 P+I ++ N N I +A GE+I +D DD PN L + + F+ Sbjct: 756 PKINIVYRKENGHISRATNDGISIANGEFIAFLDCDDVLAPNALYEVAKKLNEDSEYDFI 815 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC-LFDTELKA 178 Y+++ + + K +S F + E TE Sbjct: 816 YSDEDKLTADG---KHRHSPFFKPDWSPDTFMSLMYTCHFSVYRKSIIDEIDGLRTEFNG 872 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 AQDYD LR + + +++ + K + R ++ +R Sbjct: 873 AQDYDFTLRFTEKAKKIGHIDKILYHWREREESIASNPEAKPY-ALQAVKRCKEEALERR 931 Query: 239 SKKYQLFTLYQIRNKRMTW----RTLLTLL--SVRNGKRLADGIR 277 K + + + R+ + + L++++ S N K L + I Sbjct: 932 GLKGTVEYVNDMFQYRVKYTNDEKPLVSIIIPSKDNFKILNNCIE 976 Score = 92.8 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 51/297 (17%), Positives = 106/297 (35%), Gaps = 35/297 (11%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 PL+SI +P+ + ++ I+SV + Y N+E+++VD+ S + + + Sbjct: 954 EKPLVSIIIPSKDNFKILNNCIESVKKYTSYYNYEIVVVDNGSNNENKNKYEDICKKYN- 1012 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFL-AHKQQLVTHA-- 117 Y + +N + N +A+GE+ ++DD + + L + L + V Sbjct: 1013 CRYHYEKMNFNFSKMCNIGSRIAKGEFYLFLNDDIEIFQEEWLEIMLGQASLKHVGVVGA 1072 Query: 118 -FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFY-------KRNII----GNQVFTWAW 165 LY N + Q + P + + + R+ Y + N I + Sbjct: 1073 KLLYPNSNIIQHIGITNLKIGPSHSQIGFDDRIVYYYGRNRMEYNFIAVTGACLMIERHK 1132 Query: 166 RFKECLFDTELKAA-QDYDIFLRMVV-EYGEPWKVEEATQILHINHGEMQITSSPKKFSG 223 + FD L A D D+ ++V +Y + + + +H + Sbjct: 1133 FEEISGFDENLSVAYNDVDLCFKLVEHKYYNVVRNDVSLY----HHESISRGIDDINKEK 1188 Query: 224 YFHFYRKHKDKFDRASKKYQL-----------FTLYQIRNKRMTWRTLLTLLSVRNG 269 ++ K FD+ + Y I+ K + L + ++N Sbjct: 1189 MTRLLKERKILFDKHKQFKGWDPFYNTNLTECKVDYDIKYKPEFNKMLEINIEIKNK 1245 >UniRef50_Q67PP6 Glycosyl transferase n=1 Tax=Symbiobacterium thermophilum RepID=Q67PP6_SYMTH Length = 242 Score = 255 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 54/225 (24%), Positives = 93/225 (41%), Gaps = 8/225 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M +NP +++ +P +N + A++S L Q YSN E+++VDD ST Sbjct: 1 MPDNPKVTVVIPAYNAARYLAEAVESALNQTYSNLEVLVVDDGSTDRTPFLLERYGRA-- 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 I I N G + N+ I A+GE+I + DD + P +L+ +A+ + A +Y Sbjct: 59 -IRTIR-KRNGGTPSALNEGIRQARGEWIAWLSADDAFLPEKLAKQMAYAEAHPECALIY 116 Query: 121 ANDYVCQGEVYSQPASLPLYPKS--PYSRRLFYKRNIIGNQVFTWAWR-FKECLFDTELK 177 N +V G + +S +L I G+ + LFD L Sbjct: 117 TNWFVVDGRGKTVSHLASPTFRSRADQVEKLLRGCVINGSTTLVRRDVYLRAGLFDESLP 176 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 A D+D++LR+ +Y V E + + ++ Sbjct: 177 QAHDWDMWLRLARDY-RFGHVAEPLLRYRWHGENLSARPDALAYN 220 >UniRef50_A6C910 Truncated O-antigen biosynthesis protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C910_9PLAN Length = 903 Score = 255 bits (652), Expect = 1e-66, Method: Composition-based stats. Identities = 67/291 (23%), Positives = 113/291 (38%), Gaps = 19/291 (6%) Query: 3 NNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDP 60 +PLISI +PT+N +++ I+SVL Q YSNWE+ I DD ST + D Sbjct: 373 KSPLISIILPTYNTKEKILRACIESVLAQTYSNWELCIADDASTKSRVRDVINEYSKQDS 432 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N A L +G+YI+ +D DDE N L + + F Y Sbjct: 433 RIKSVFRTENGHISEAMISAAELMEGDYISFLDHDDELNKNALLFIVDAINRSPESEFFY 492 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAA 179 ++ + + + K +S L +N IG+ + ++ A Sbjct: 493 SD----EDHINEHGKHQSPFFKPDWSPSLLCSQNYIGHFLCLSKSLYERVGGIRRGFDGA 548 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF------SGYFHFYR-KHK 232 QDYD+ LR + + + + S K + + F K+ Sbjct: 549 QDYDLVLRAGDAAENVYHIPKVLYHWREHENSTSSNSECKPYAHDAGKAAVADFLNQKYG 608 Query: 233 DKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG----IRGR 279 +F + + LFT Y + + + + ++ ++ L D IR R Sbjct: 609 SRFIKVNDGEGLFT-YSPQFRFDSEHRVSIIIPTKDKIDLLDDCIESIRNR 658 Score = 90.5 bits (224), Expect = 5e-17, Method: Composition-based stats. Identities = 43/231 (18%), Positives = 80/231 (34%), Gaps = 26/231 (11%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +SI +PT ++ L I+S+ + NWE+IIVD+ S + + T + D RI + Sbjct: 635 VSIIIPTKDKIDLLDDCIESIRNRSSHINWEIIIVDNRSEETASKEYFSTVVQDSRIKVV 694 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQLVTHAFLYANDY 124 D+ + N A G+ +++D TP+ + LA L + Sbjct: 695 EADVEFNWSMINNIGAKAATGDVFVFLNNDTLVITPDWIEK-LASMASLPEVGLVGPQLL 753 Query: 125 VCQGEVYSQP--------------ASLPLYPKSPYSRRLFYKRNII---GNQVFTWAWRF 167 + LP++ P+ + + + QV A Sbjct: 754 YEDNTIQHAGVVVGMGGWADHVFKNQLPVHRSGPFVSPMLNRNVLAITGACQVIERAKFE 813 Query: 168 KECLFDTELKAA-QDYDIFLRMVVEY-GEPWKVEEATQILHINHGEMQITS 216 + FD + D DI +R + + + A + +S Sbjct: 814 QLGGFDEQFIICGSDVDICIRAHQQGLQNVYCADAAL----HHLESKSRSS 860 >UniRef50_A5ZX57 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZX57_9FIRM Length = 808 Score = 255 bits (652), Expect = 1e-66, Method: Composition-based stats. Identities = 54/278 (19%), Positives = 94/278 (33%), Gaps = 13/278 (4%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL-ND 59 + NPLISI +P + I SV Q Y+NW++ + D S + + D Sbjct: 274 EQNPLISIVIPLYCTPTPYLKELIDSVRAQSYTNWQLCLA-DGSPDQKVEEYIQKRYGKD 332 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI Y H + N G N+AI +A GEY+ D DD P+ L + + Sbjct: 333 SRILYKHLEENGGISINTNKAIEMATGEYLMLSDHDDTLEPDALYEIVKAINDHQGPEIV 392 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKA 178 Y ++ + + KS Y+ N I + E Sbjct: 393 YTDEDKLSMDGE---FYFEPHFKSDYNLFRLRDNNYICHIFAVKKALVDQVGGLRQEYDG 449 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QDYD LR + + + + + K + Y R ++ + R Sbjct: 450 SQDYDFILRCCEQAKQVIHIPRVLYHWRCHMNSVAANPESKTY-AYEAGCRAIQEHYRRV 508 Query: 239 SKKYQLFTLYQIRNKRMTWRT-----LLTLLSVRNGKR 271 + ++ R + + L+ ++ Sbjct: 509 GIEAEVEMTKHPGWYRSHVKIQGEPLVSILIPNKDHID 546 Score = 75.5 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 86/268 (32%), Gaps = 26/268 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSWEQLQQYV-TALN 58 ++ PL+SI +P + + + S+ + + N+E+++V++ S E + Y + Sbjct: 529 IQGEPLVSILIPNKDHIDDLEKCLSSIYEKSTWKNYEILVVENNSEKPETFEYYKNLSWR 588 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHA 117 P+ + A+ N A A+G Y+ +++D + TP + L Q A Sbjct: 589 YPKARVLTWKEGFNYAAINNFAAKDAKGSYLLFLNNDVEVITPGWIEEML-MICQQPDVA 647 Query: 118 FLYANDYVCQGEVYSQPASL-----------------PLYPKSPYSRRLFYKRNIIGNQV 160 + A Y + L Y + + + Sbjct: 648 IVGAKLYYPDNLIQHAGVVLGMGGIAGHIMCQASCEDKGYFGRAVNVQEI-SAVTAACML 706 Query: 161 FTWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQITSSP 218 FD E A D D+ ++ + A + +P Sbjct: 707 MKVEDFEAVGGFDEEFVVAFNDIDLCMKERAAGKKVVFTPYAELYHY--ESKSRGMEDTP 764 Query: 219 KKFSGYFHFYRKHKDKFDRASKKYQLFT 246 +K + + ++K+ K + Sbjct: 765 EKQFRFEKETKHFEEKWGEQMSKGDPYY 792 >UniRef50_C2P8I0 Family 2 glycosyltransferase n=10 Tax=Bacteria RepID=C2P8I0_BACCE Length = 262 Score = 254 bits (649), Expect = 3e-66, Method: Composition-based stats. Identities = 57/256 (22%), Positives = 94/256 (36%), Gaps = 6/256 (2%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+SI P++N I+SV Q Y WEMII+DD S D R Sbjct: 11 EFQPLVSIVTPSYNASSFIKETIQSVQSQTYKKWEMIIIDDVSKDNTCELIKEEIKKDDR 70 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I + N GA RN I A+G+YI +D DD W P +L +A Q AF + Sbjct: 71 IRLIELEENGGAAIARNTGINCAEGKYIAFLDSDDLWLPEKLEKQVAFMQNN-DLAFSFT 129 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + + + ++ + K IIG + ++ QD Sbjct: 130 SYQIMDQDGKL--TEKVVHVPEKINYNGLLKNTIIGCLTVMLDIEKLGKVKMPNIRTRQD 187 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 +L+++ + + ++E + + K + YR+ + K Sbjct: 188 TATWLKILKQGHYAYGLDEVLSKYRKVENSIS-SKKFKMAKMNWKLYREIEGL--SIPKS 244 Query: 242 YQLFTLYQIRNKRMTW 257 F Y + + Sbjct: 245 AWCFMNYALNGAVKHF 260 >UniRef50_Q8U2R3 Glycosyl transferase n=2 Tax=Pyrococcus RepID=Q8U2R3_PYRFU Length = 298 Score = 254 bits (649), Expect = 3e-66, Method: Composition-based stats. Identities = 68/233 (29%), Positives = 110/233 (47%), Gaps = 8/233 (3%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +S+ +PT NR +L RAI SVL Q + ++E+I+VDD S+ + ++ D RI Sbjct: 2 SRPTVSVIIPTHNRAKLLKRAIISVLNQTFEDFEIIVVDDASSDNTPNI--IESIKDSRI 59 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ-LVTHAFLYA 121 YI + NSGA RN I A+G++I +DDDDEW P +L V + + +Y Sbjct: 60 RYIRLEKNSGAPTARNIGIKKARGKFIALLDDDDEWLPRKLEVQVKKFNELEKKFGVVYG 119 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQ 180 Y + + L + I V FK+ LFD +L ++Q Sbjct: 120 GFYYVSQQNERIIGKRLPRFRGNVYGELLKENFIGSPTVLIRRECFKKAGLFDPKLPSSQ 179 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 D+D+++R+ Y + V +++ ++ + K G F KHKD Sbjct: 180 DWDMWVRIARYYYFDYIV----AKYYVHGHQISFNINEKYIPGREIFISKHKD 228 >UniRef50_C6J2S1 Glycosyltransferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2S1_9BACL Length = 309 Score = 253 bits (648), Expect = 3e-66, Method: Composition-based stats. Identities = 65/257 (25%), Positives = 104/257 (40%), Gaps = 8/257 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 +NP ISI MP +NRQ+ +I +L Q Y+N+E+IIVDD ST + D R Sbjct: 3 NHNPKISIIMPVYNRQETIKNSIFCILNQSYTNFELIIVDDASTDNTSAIIL--SFEDER 60 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I +I + N GA A RN I ++G++I+ D DD W P++L + + Sbjct: 61 IKHIVLNKNVGAAAARNIGIRQSKGDWISFQDSDDYWEPDKLRKQVELINDRSPSIIFSS 120 Query: 122 --NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT--WAWRFKECLFDTELK 177 E + PK N I + K F+ ++ Sbjct: 121 FIRYKYGSEEYIPRGRKYVNLPKEGNLHNQLLLGNFIATPTVLIPKTFLEKVGGFNEQMP 180 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QD++++LR+ Y ++E +H + + K GY + HK F Sbjct: 181 RFQDWELWLRLSA-YYPFIWIDEPLVRVHYTENSIS-SDMSKMIRGYELIWNLHKKMFVD 238 Query: 238 ASKKYQLFTLYQIRNKR 254 A +Y L+ + Sbjct: 239 AGPEYAAQFLFSYGHNL 255 >UniRef50_C0X9N7 Group 2 family glycosyl transferase n=24 Tax=Enterococcus faecalis RepID=C0X9N7_ENTFA Length = 1047 Score = 253 bits (648), Expect = 4e-66, Method: Composition-based stats. Identities = 56/283 (19%), Positives = 107/283 (37%), Gaps = 12/283 (4%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-D 59 + P IS+ +P +N + + + S+ Q Y NWE+ + DD S S D Sbjct: 551 QFQPKISVAVPVYNVEEKWLAACVSSLQNQYYENWELCLADDASPSEHIKPMLEKYKELD 610 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI I+ + N N A+ +A G++I +D+DDE P L + T FL Sbjct: 611 QRIKVIYREENGHISEATNSALSIATGDFIGFMDNDDELAPQALYEVVKALNTDPTIDFL 670 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 Y + + ++ + KS ++ L N I + V K ++ Sbjct: 671 YTD----EDKITENGRRFNAFYKSDWNPELILNHNYITHFVVVKRDLLEKVGGLNSAYNG 726 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 AQDYD LR + + + + K + Y + + +R Sbjct: 727 AQDYDFVLRATEQATKIKHIPGMMYHWRAIESSTALNPESKGY-AYVAGQKAVQAATERR 785 Query: 239 SKKYQLFTLYQIRNKRMTWR----TLLTLLSVRNGKRLADGIR 277 K Q+ + ++ + +++L+ + + ++ +R Sbjct: 786 GLKAQVEIAEFYGSYKINYLYDHVPMVSLIITNDTENMSSYLR 828 Score = 43.1 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 39/94 (41%), Gaps = 20/94 (21%) Query: 3 NNPLISIYMP--TWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + P++S+ + T N + ++ + Y+N+E+++ + + + Sbjct: 808 HVPMVSLIITNDTENMSSYLRQLLE---KTAYTNYEILL--------PARFENQINIQND 856 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDD 94 R+ Y+ + G I A+GEY+ ++ Sbjct: 857 RLRYVSTETRHGM-------IQAAKGEYVALLNA 883 >UniRef50_A1W3G7 Glycosyl transferase, family 2 n=1 Tax=Acidovorax sp. JS42 RepID=A1W3G7_ACISJ Length = 983 Score = 253 bits (646), Expect = 7e-66, Method: Composition-based stats. Identities = 58/282 (20%), Positives = 106/282 (37%), Gaps = 15/282 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCST-SWEQLQQYVTALNDPR 61 PLISI MP + A+ SV Q Y +WE+ I DD S+ + + + + Sbjct: 448 RPLISIVMPVHDTPPHFLQAAVDSVQGQWYGHWELCICDDASSRADTRQRLQTLQDQSDK 507 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N A+ LAQGE++ +D DD P L F+Y+ Sbjct: 508 IKVTRRETAGHIVHATNDALALAQGEFVVFLDHDDVLAPQALLRLAQAINAPSAPDFIYS 567 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQ 180 + + ++ Q K +S L + +N +G+ + + + +Q Sbjct: 568 D----EDKLDEQGRRCLPLFKPQWSPTLQWAQNYVGHIMCVRRAWLERLGGLLEGSQGSQ 623 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY-------FHFYRKHKD 233 D+D+ LR+ + + + E I+ + K ++ H ++ + Sbjct: 624 DHDLVLRLAAQGAKIEHIPEVLYHWRIHAASTSASPQSKPYAHLAGRQAVARHLASRYGE 683 Query: 234 KFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 +FDR F +YQ R + ++ R+ L D Sbjct: 684 QFDRVDDGDHAF-VYQPRFRVPAGTVASIIIPTRDKADLLDA 724 Score = 186 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 51/258 (19%), Positives = 101/258 (39%), Gaps = 24/258 (9%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--DPRIT 63 L+++ +P +N + + A+ SV Q ++ WE+I++DD ST A DPRI Sbjct: 14 LVTVILPAYNHAKYVVEAMDSVRAQTFTRWELIVIDDGSTDDTWAVLQAYAAQHGDPRIR 73 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + N+G+ A N+ + +A+ Y+ ++ DD + P RL + Q F+ Sbjct: 74 LLTQ-ANAGSHATLNRGLAMARTPYLAILNSDDRYAPERLQRLVDTAQAAADEVFIVTGL 132 Query: 124 YVCQGEVYSQPASLPLY-------------PKSPYSRRL--FYKRNI---IGNQVFTWAW 165 + G+ P S S + L N N + Sbjct: 133 RLIDGDGNFFPESYWWNAMYGDILQRWRAAQSSGVNPALQTLLWGNFTVSTSNFFMSRTL 192 Query: 166 RFKECLFDTELKAAQDYDIFLRMVVEYGEPWKV--EEATQILHINHGEMQITSSPKKFSG 223 + F L+ D+D LR+ +E ++ ++A ++ + + + + Sbjct: 193 WQRLGPF-KRLRYVPDWDYALRVAMEQPRAFQFLHDQALLEYRLHGRNTILGGALRNHAE 251 Query: 224 YFHFYRKHKDKFDRASKK 241 +H R ++ K+ A + Sbjct: 252 AYHVLRTYQKKWAAAGQA 269 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 66/212 (31%), Gaps = 21/212 (9%) Query: 8 SIYMPTWNRQQLAIRAIKSV-LRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 SI +PT ++ L I+S+ +E++++D+ ST E + D R+ I Sbjct: 710 SIIIPTRDKADLLDACIQSIHRHTTGIAYEILVLDNGSTEPETQACFQRLTQDARVRVIA 769 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAH-----------KQQLV 114 DI + N A+G+ + +++D + TP+ L + + Sbjct: 770 ADIPFNWSRLNNIGRSHARGQVLVFLNNDTEIITPDWLVRLVEYALLPDVATVGALLLYP 829 Query: 115 T-----HAFLYANDYVCQGEVYSQPA-SLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK 168 + +P P S R N A Sbjct: 830 DRTIQHAGVVVGMGGWADHVFKGEPVQHYPTPFVSSVVPRNVLA-NTGACVAVATARFDA 888 Query: 169 ECLFDTELKAA-QDYDIFLRMVVEYGEPWKVE 199 FD + D ++ +R + + Sbjct: 889 LGGFDEAFEICGSDVELGIRAHKQGLLNVYLP 920 >UniRef50_A0B7S5 Glycosyl transferase, family 2 n=1 Tax=Methanosaeta thermophila PT RepID=A0B7S5_METTP Length = 302 Score = 252 bits (645), Expect = 9e-66, Method: Composition-based stats. Identities = 64/250 (25%), Positives = 108/250 (43%), Gaps = 6/250 (2%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +PT+NR + RAI+SVL Q Y ++E+I+VDD ST ++ V + +D RI YI Sbjct: 4 VSVILPTYNRAHVVGRAIRSVLEQTYEDFELIVVDDGSTDST--EEVVRSFDDRRIRYIR 61 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ-QLVTHAFLYANDYV 125 + N G A RN I +A+GEYI D DDEW P +L + + +Y Y+ Sbjct: 62 HKQNKGRSAARNTGIKIAKGEYIAFQDSDDEWLPEKLEEQMEVFKTSSPQVGVVYTGFYL 121 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ--VFTWAWRFKECLFDTELKAAQDYD 183 + S + K K N +G V K +FD L +D++ Sbjct: 122 FYDDKKIYIPSANVKTKDGNIYDELLKGNFVGTPAAVVRAECLKKVGMFDESLHCLEDWE 181 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 +F+R+ EY +++A + + + + + D+ Sbjct: 182 LFIRISKEY-IFKYIDKALVNAFRSPNSILLDPDACATAMMLIMEKHIYPLGDQRRLAEM 240 Query: 244 LFTLYQIRNK 253 +++ K Sbjct: 241 QYSIGNYLFK 250 >UniRef50_B0C0T9 Glycosyl transferase, group 2 family protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C0T9_ACAM1 Length = 337 Score = 251 bits (643), Expect = 1e-65, Method: Composition-based stats. Identities = 71/263 (26%), Positives = 115/263 (43%), Gaps = 11/263 (4%) Query: 1 MKN-NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 MKN PL+S+ +PT+NR L I AI+SVL Q ++E+I+VDD ST Q + V ++D Sbjct: 1 MKNSKPLVSVIIPTFNRANLLIPAIQSVLNQTIEDFELIVVDDASTDDTQQK--VAGIDD 58 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT-HAF 118 R+ YI + N G C RN + AQG+YI +D DDEW PN+L L + Sbjct: 59 SRLRYILQEKNLGECGTRNTGLGAAQGQYIAFLDSDDEWLPNKLEKQLGVFRSAPDEVGV 118 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG--NQVFTWAWRFKECL-FDTE 175 +Y+ V + P + +N+IG + V + L FDT Sbjct: 119 VYSWLQVINDQGKVVRMRKPNIHGDV--KDYLIYKNLIGTPSTVMIKQECIEPGLQFDTN 176 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF-SGYFHFYRKHKDK 234 L+ D+D++L++ Y + E + + + + +++ G+ F KH Sbjct: 177 LRCCGDWDMWLQISKNY-RFEVIPEPLTLYRDHDEDARGSTNHAMVTEGHLVFLDKHHQG 235 Query: 235 FDRASKKYQLFTLYQIRNKRMTW 257 ++ Sbjct: 236 IKEFYRQPSTLDNKHKSLYLFEI 258 >UniRef50_A3CRX6 Glycosyl transferase, family 2 n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CRX6_METMJ Length = 996 Score = 251 bits (642), Expect = 2e-65, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 102/280 (36%), Gaps = 13/280 (4%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M P SI +P +N L +A++S+ Q Y +WE+I+VDD S S E + +ND Sbjct: 465 MAYKPHFSIVVPVYNTPPDLLNKAVRSLQLQWYPHWELILVDDASPSKETKRCLSK-IND 523 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 P I + N G N AI + G+Y+ +D DDE T + L + ++ Sbjct: 524 PNIKVFTHPQNKGISGATNTAISHSTGDYVVLLDHDDELTEDCLFELALCINR-DDPDYI 582 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 Y+++ + Y + K +S + + + ++ Sbjct: 583 YSDEDKIDEKGYFT----EPHYKPDWSPDTMMSTMYVCHVSCIKKSLLEEVGGLRSDYDG 638 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF--SGYFHFYR---KHKD 233 QD+D+ LR+V + + + I G K + K + Sbjct: 639 CQDWDLILRIVEKTDRISHIPKVLYHWRIIPGSTSADIGAKSYVLDASRRVREDALKRRG 698 Query: 234 KFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 Q+ +++ + + ++ R+ + Sbjct: 699 LLGTVEPLEQVNGYFRVNYHLVDKPLISIIIPTRDHGDVL 738 Score = 111 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 51/270 (18%), Positives = 90/270 (33%), Gaps = 25/270 (9%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PLISI +PT + + R I+S+ + Y N+E+I++D+ S L+ +P Sbjct: 721 DKPLISIIIPTRDHGDVLRRCIESIFKKSTYKNFELIVLDNGSVDSATLEYLEEIKAEPN 780 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSV-----FLAHKQQLVT 115 I+ I + + N A+GE + ++DD + T + L LAH V Sbjct: 781 ISVIRHAEPFNYSELNNVGADFAKGEILLFLNDDTEVVTSDWLERMGGYAQLAHIGA-VG 839 Query: 116 HAFLYANDYVCQ-------GEVYSQPASLPLYPKSPYSRRLFYKRNII----GNQVFTWA 164 LY Q + Y R + N + + Sbjct: 840 AKLLYPGGKKIQHAGVLNLQDGPGHAFLHQNADLPGYYLRNLIEYNWLAVTGACLMIERK 899 Query: 165 WRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILH-INHGEMQITSSPKKFS 222 FD A D D+ R+ + + +Q + I+H + Sbjct: 900 KFQSVNGFDVNYPIAYNDVDLCFRLRDAG--FYNL--VSQSVRLIHHESLTRRMDHADHD 955 Query: 223 GYFHFYRKHKDKFDRASKKYQLFTLYQIRN 252 + + + + YQ Y I Sbjct: 956 KTERLKQDMRRLYQKHPFYYQYDPFYSINL 985 >UniRef50_C0CNU3 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CNU3_9FIRM Length = 843 Score = 250 bits (640), Expect = 3e-65, Method: Composition-based stats. Identities = 56/282 (19%), Positives = 100/282 (35%), Gaps = 14/282 (4%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSW-EQLQQYVTALNDPR 61 P ISI +P +N ++ I SV+ Q Y NWE+ + D STS ++ R Sbjct: 296 EPKISIVIPLFNTPEKYLKELIDSVVAQSYGNWELCLA-DGSTSPKTGAYIKKHYNSEGR 354 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y + N G N AI + GE+I D DD PN L + + +Y Sbjct: 355 IVYRKIEENLGISGNTNFAISMGTGEFIMFCDHDDVVAPNALYEMVKVINEKPKTDIVYT 414 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 ++ + + K ++ N I + + + E AQ Sbjct: 415 DEDLINSDGTVH---SSPRFKPDFNFDFLRSINYICHIFLVRKSLIDRVGMLRKEFDGAQ 471 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD LR + V + + K++ R ++ + R Sbjct: 472 DYDFILRCCEQTEHIAHVPKVLYHWRAHDNSTAGNPESKQY-AVDAGKRALEEHYRRMGY 530 Query: 241 KY-----QLFTLYQ-IRNKRMTWRTLLTLLSVRNGKRLADGI 276 + +F +Y+ I + + + +L+ + + L + Sbjct: 531 EAVVENTGIFIVYRTIMKVQGNPKVSVIILNKDHREDLEKCV 572 Score = 76.3 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 39/266 (14%), Positives = 87/266 (32%), Gaps = 24/266 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSW-EQLQQYVTALND 59 + NP +S+ + + ++ + + S+ + Y N+E+I+V++ S Sbjct: 550 QGNPKVSVIILNKDHREDLEKCVVSIEEKTDYPNYEIIVVENNSELPETFAFYEELQRRY 609 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAF 118 + + D A+ N A G+Y +++D + +P +S L + Q+ Sbjct: 610 SNVKVVTWDGPFNYSAINNYGAEYATGDYYLMLNNDIEVISPGWMSEMLGYCQRE-DVGI 668 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKR--NIIG--------------NQVFT 162 + A Y V + + + + F K G + Sbjct: 669 VGAKLYYSDDTVQHAGVVVGVGGFAGHVLTRFRKGETGYFGRLVTIQDTSAVTAACLMIK 728 Query: 163 WAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQITSSPKK 220 + FD E A D D+ L++ A +P+K Sbjct: 729 KSIYQLIGGFDEEFVVALNDIDLCLKVRALGQLVVFNPYAELYHY--ESKSRGFEDTPEK 786 Query: 221 FSGYFHFYRKHKDKFDRASKKYQLFT 246 + + ++ ++K+ K + Sbjct: 787 KARFKKEIKRFREKWGEILSKGDPYY 812 >UniRef50_Q8YSM1 Alr3063 protein n=7 Tax=Cyanobacteria RepID=Q8YSM1_ANASP Length = 330 Score = 250 bits (640), Expect = 3e-65, Method: Composition-based stats. Identities = 62/290 (21%), Positives = 114/290 (39%), Gaps = 22/290 (7%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK+ LIS+ +P +N I+SV +Q +++WE+I+++D ST + + ++ D Sbjct: 1 MKDMALISVIIPAYNAVLTIKETIESVQKQTFTDWEIIVINDGSTDGTP--EIIQSIKDE 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ I N N G RN+ I+ A GE+I +D DD W ++L + L QQ Y Sbjct: 59 RLK-IFNYKNGGLPVARNRGILHASGEFIAFLDADDLWAVDKLEMQLKALQQHPEAGVAY 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYS---RRLFYKRNIIG---NQVFTWAWRFKECLFDT 174 + +P + + ++ + + I N + FD Sbjct: 118 SWTCFMDVNEQGEPVAYLPSSQYSFTGNVYQNLLVSDFIHSGSNTLIRKEAINSVGEFDP 177 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 LK+ +D+D +LR+ + V E G M K + + ++ Sbjct: 178 MLKSCEDWDYWLRLATHWDFIV-VPEYQIFYRRTPGAMSSKVEVMKEACLIAMEKAYQ-- 234 Query: 235 FDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNG-------KRLADGIR 277 A + Q Y + + + +L + + + L IR Sbjct: 235 --AAPPELQYLKKYTMSSFHVYCSSL-YIQHRDDKFAVSQAQEHLLAAIR 281 >UniRef50_Q2YCB6 Glycosyl transferase, family 2 n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2YCB6_NITMU Length = 313 Score = 250 bits (639), Expect = 4e-65, Method: Composition-based stats. Identities = 56/277 (20%), Positives = 104/277 (37%), Gaps = 20/277 (7%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +SI +PT+N + + SVL Q +++ E+I+VDD ST + + Sbjct: 4 SKPTVSIIIPTYNCEAYIAETLDSVLSQTFTDLELIVVDDGSTDRTREIVTSYG---TPV 60 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + NSG CA RN I A G Y+ +D DD W P +L++ L +Y+ Sbjct: 61 RLLSQS-NSGVCAARNYGIREAAGSYVCLMDHDDYWFPEKLALQLEQMFSHPEVGLVYST 119 Query: 123 DYVCQ------------GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC 170 + P + L I+ + A F +C Sbjct: 120 FVWWHPDGEGVFPKPESFNIEKMPGGIDEEFSGWIYHLLLLDCWILTSAALIRAEVFDKC 179 Query: 171 L-FDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 +D L ++D+D++LR+ EY K+ + + + + ++ R Sbjct: 180 GAYDESLPYSEDWDLWLRISREY-PVIKLNKGLTLYRQHPKQG--NRLTREIDYRTVLLR 236 Query: 230 KHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSV 266 +K+ S+ + + Q ++ L + Sbjct: 237 DAAEKWGLCSRDGRCISRQQFMQNIARYQMEFGLKHL 273 >UniRef50_Q3A3M0 Glycosyltransferase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A3M0_PELCD Length = 249 Score = 249 bits (638), Expect = 5e-65, Method: Composition-based stats. Identities = 56/250 (22%), Positives = 98/250 (39%), Gaps = 4/250 (1%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +SI P N + IKS+ Q ++NWE I+VDD ST ++PR Sbjct: 3 KVSIITPCHNSAPFVLETIKSISEQTFTNWEHILVDDASTDDTWQIIQKYCSSEPRAKCF 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 NSGA RN AI AQG YI ID DD W P +L + + +++ Sbjct: 63 RLTKNSGAALARNMAIEAAQGRYIAFIDSDDLWMPEKLEKQIEFMR-YKNAYLSFSSYEK 121 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 + + + + K +IG + ++ ++ +++ QDY ++ Sbjct: 122 INEYGVRSGRMVDV--PNEVNYNEILKGCVIGCLTAMYDSKYLGKVYMPDIRRGQDYGLW 179 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 LR++ + + + G + + K + + YR + S K F Sbjct: 180 LRILRRGHIAYGLNCPLALYRERSGSISSSKFKKAY-SQWRIYRNIEKLSLICSLKNFFF 238 Query: 246 TLYQIRNKRM 255 Y+ K++ Sbjct: 239 YSYKGFKKKI 248 >UniRef50_A0YSS8 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YSS8_9CYAN Length = 321 Score = 249 bits (637), Expect = 7e-65, Method: Composition-based stats. Identities = 62/215 (28%), Positives = 95/215 (44%), Gaps = 10/215 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P ISI +P +N I SVL+Q +S++E+IIV+D ST + ++ D R+ Sbjct: 2 PQISIIIPVYNGATTIQATIDSVLKQTFSDFELIIVNDGSTDST--LEIISQYRDRRLQV 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN-D 123 + ++GA A RN+ + A GEYI +D DD WTP++L L Q A Y+ D Sbjct: 60 LSYP-HAGASATRNRGLKQASGEYIAFLDADDLWTPDKLEAQLNALQTHPKAAVAYSWTD 118 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWAWRFKECLFDTELKAAQ 180 ++ + + Q PY+ L Y N I N + FD L Q Sbjct: 119 FLDEAGSWRQAGRHTTVNGEPYAAMLLY--NFIESGSNPLIRRDALDDVGGFDESLSGGQ 176 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 D+D++LR+ Y V I + + Sbjct: 177 DWDLYLRLAARY-LFVNVPAVQIFYRIRNNSISSN 210 >UniRef50_B5JJR3 Glycosyl transferase, group 2 family protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJR3_9BACT Length = 1027 Score = 249 bits (637), Expect = 8e-65, Method: Composition-based stats. Identities = 59/280 (21%), Positives = 101/280 (36%), Gaps = 15/280 (5%) Query: 6 LISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALNDPRIT 63 ISI +P ++ ++ + I SVL Q Y+NWE+ I DDCS + A +D RI Sbjct: 496 KISIILPVYDVEEIWLRKCIDSVLSQIYTNWELCIADDCSNKPHIKEVLHQYAQSDSRIK 555 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 N N A LA G Y+ +D DDE P+ L+ T +Y+++ Sbjct: 556 ITFRKENGHISQASNSAASLATGTYLALLDHDDELAPHALAKVAESISTNPTAKLIYSDE 615 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQDY 182 + KS ++ LF N+I + F++ F + AQD+ Sbjct: 616 DKIDQNGLRHG----PHFKSDWNYDLFLGCNMISHLGVYRKDIFEQTGGFRGGYEGAQDW 671 Query: 183 DIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+ LR + + E G K + + + R + Sbjct: 672 DLALRFIEIIPSSDIIHIPEILYHWRNIEGSTAHNIEHKNY-AIAAQKKAIEAHLRRTTT 730 Query: 241 KYQLFTLYQIRNKR-----MTWRTLLTLLSVRNGKRLADG 275 K + ++ K T+ ++ ++ L Sbjct: 731 KAIVKSVDGFDWKIEYEIPTPQPTVSIIIPTKDRIDLLKP 770 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 87/261 (33%), Gaps = 32/261 (12%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P +SI +PT +R L I S+L++ YSN++++I+D+ S + ++ + + I Sbjct: 752 QPTVSIIIPTKDRIDLLKPCIDSILKKTNYSNYKIVIIDNGSELEDTIRYLNSINSYENI 811 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-----DEWTPNRLSVFLAHKQQLVTHA 117 + + + + N+AI E I +++D D W L+ + H + Sbjct: 812 SILKDSEEFNYSRLNNKAISATSSEIICLLNNDIEVTQDNW----LNEMVRHTTRN-EIG 866 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLF----------------YKRNIIGNQVF 161 + A V + + + ++ + Y +F Sbjct: 867 VVGAKLLYPHNHVQHGGVIMGIGGVAAHAFKYLHRDDDGHIHRAHLVSGYSAVTGACMMF 926 Query: 162 TWAWRFKECLFDT-ELKAAQ-DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 + + FD LK + D D LR + A + + Sbjct: 927 KKSLWSELGGFDEKNLKVSYSDIDFCLRAGKTGKKTILTPFALLY---HKESETRGNPLS 983 Query: 220 KFSGYFHFYRKHKDKFDRASK 240 F + + + K Sbjct: 984 NKKDSTQFTLEMNYMYGKWRK 1004 >UniRef50_B4RT58 Putative glycosyltransferase family 2 protein n=2 Tax=Alteromonas macleodii RepID=B4RT58_ALTMD Length = 731 Score = 249 bits (636), Expect = 9e-65, Method: Composition-based stats. Identities = 57/284 (20%), Positives = 103/284 (36%), Gaps = 16/284 (5%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTS-WEQLQQYVTALND 59 ++ SI +PT+N I SVL+Q + NWE+ I DD ST+ A Sbjct: 196 QSEVKFSIILPTYNTDPIYLKECIDSVLQQTHKNWELCIADDASTNAETISTLKSYAQKH 255 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 I N N A+ + +Y+ +D DD + LS F L Sbjct: 256 ANIKLNLLSENGHISKASNAALSMVTSDYVLLLDHDDTLPAHTLSFFAKALTDNTNAKVL 315 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKA 178 Y + + +V Q + K ++ L +N I + V K F ++ Sbjct: 316 YGD----EDKVDEQGNRHQPHFKPDWNPDLLLSQNYICHPVVYKTSVLKEIGGFRVGVEG 371 Query: 179 AQDYDIFLRMVV--EYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QD+D+ LR ++ E + + +S K ++ KD D Sbjct: 372 SQDHDLLLRATAGLKHDEVVHLPFILYHWRVIENSTASNASAKSYT-TDAGIEAIKDFLD 430 Query: 237 RASKKYQLF-----TLYQIRNKRMTWRTLLTL-LSVRNGKRLAD 274 ++ + + Y++ + L++L + R+G + Sbjct: 431 QSGQNASVEKGKYPNTYKVNWALPDEQPLVSLVIPTRDGYDILK 474 Score = 84.0 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 77/234 (32%), Gaps = 27/234 (11%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCST-SWEQLQQYVTALNDP 60 PL+S+ +PT + + + ++S+ + Y N+E+I+VD+ +T Sbjct: 456 EQPLVSLVIPTRDGYDILKQCLESIYEKTSYKNFEIIVVDNQTTCDKTLGLFSEYNSTKA 515 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFL 119 + D A+ N A+ AQGE + I++D + LS ++H + + Sbjct: 516 NFRVLKWDKPFNYSAINNFAVSQAQGEVVGLINNDIEVINEEWLSEMMSHALR-PEIGCV 574 Query: 120 YANDYVCQGEVYSQPA------------SLPLYPKSPYSRRLFYKRNI----IGNQVFTW 163 A Y + + Y RL +N+ + Sbjct: 575 GAKLYYPNDTIQHAGVILGIGGVAGHSHKYFHKSEPGYFTRLHLVQNMSAVTAACLLVRK 634 Query: 164 AWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVE-EATQILHINHGEMQI 214 + + + +L A D D L++ +H + Sbjct: 635 SVFEEVGGLNEQDLTVAFNDVDFCLKVHTAGYRNLFTPWAELY----HHESISR 684 >UniRef50_Q5FRX1 O-antigen biosynthesis protein RfbC n=1 Tax=Gluconobacter oxydans RepID=Q5FRX1_GLUOX Length = 876 Score = 248 bits (635), Expect = 1e-64, Method: Composition-based stats. Identities = 62/288 (21%), Positives = 105/288 (36%), Gaps = 20/288 (6%) Query: 4 NPLISIYMPTWNRQQL-AIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALNDPR 61 NPL+S+ P WN AI+SVL Q + NWE+I+VDDCS + E A D R Sbjct: 281 NPLVSVVCPVWNPDLTDFTAAIQSVLSQTWQNWELILVDDCSKNPELRAVIENFAKTDAR 340 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I + N G N + A+GE+I D DD + L + +Y+ Sbjct: 341 IKPIFQEKNGGISIATNAGLAAAKGEWIAFFDHDDLLADVAVEYMLREATR-SHADLIYS 399 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTW-AWRFKECLFDTELKAAQ 180 + + ++ + K+ ++ RL N + + V + + E AQ Sbjct: 400 D----EDKIDASGYYREPAFKTDWNYRLLLGVNYVCHFVMIRQSALQNIGGLNKEYDGAQ 455 Query: 181 DYDIFLRMVVE--YGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 D+D LR + V E I + K + D R Sbjct: 456 DHDFLLRASEHIPAEKIHHVPEILYHWRITANST-ASDIGNKRYAIDAGIKAVSDHLVRR 514 Query: 239 SKKYQLFTLYQIRNKRMTW-----RTLLTLLSVRNG----KRLADGIR 277 + + + + + W ++ ++ ++ R D I+ Sbjct: 515 NLPATVDSQLGMTLYTVNWDFNQSPSVTIIIPYKDEIATTARCLDAIQ 562 Score = 54.7 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 37/270 (13%), Positives = 89/270 (32%), Gaps = 28/270 (10%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALND 59 +P ++I +P + + R + ++ + Y N+++I+V++ S + E Sbjct: 536 NQSPSVTIIIP-YKDEIATTARCLDAIQKYTRYPNYKVILVNNWSITKEAEAFAQAVDKI 594 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTP--NRLSVFLAHKQQLVTHA 117 P + + + N A +++ +++D + N L + + A Sbjct: 595 PNVEILTIKEPFNYSRINNLAAKDETSDFLLFLNND-VFVEQENWLEILVNEALADPLVA 653 Query: 118 FLYANDYVCQ------------GEVYSQPASLPLYPKSPYSRRLFYKRNI---IGNQVFT 162 + G+V + Y+ R ++ + I + Sbjct: 654 IVGGKFVYPNQTVQHAGVLLGIGDVAGHAHVGIPRDEGGYAGRAYFPQEISAVTAAGMLI 713 Query: 163 WAWRFK-ECLFDTE-LKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK 219 F+ FD E LK A D D+ L++ + + +H + S + Sbjct: 714 RRPAFELVGGFDEEHLKVAFNDIDLCLKVRDAGYKVVWTPDFCAE---HHESLSRGSDDR 770 Query: 220 KFSGYFHFYRKHKDKFDRASKKYQLFTLYQ 249 S F+ + + ++ + Y Sbjct: 771 P-STERRFFHETQFMIEKWGNTLRTDPFYN 799 >UniRef50_Q1Q4P0 Similar to glycosyltransferase family 2 n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q4P0_9BACT Length = 295 Score = 248 bits (635), Expect = 1e-64, Method: Composition-based stats. Identities = 63/275 (22%), Positives = 109/275 (39%), Gaps = 11/275 (4%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 PL+S+ MPT+N +I+SVL Q Y +E++++DD ST +L Sbjct: 3 NKKPLVSVIMPTYNCAVYLQESIESVLAQTYDAYEVVVIDDGSTDNTKLVLKPYMEK--- 59 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I YI N G RN I A+GEY+ +D DD W P +L + + + +Y+ Sbjct: 60 IKYIDLGRNEGLPTARNLGIQSAKGEYVAFLDADDIWMPEKLEMSIDQFMKNPDAGMVYS 119 Query: 122 NDYVCQGE-VYSQPASLPLYPKSPYSRRLFYKRNI-IGNQVFTWAWRF-KECLFDTELKA 178 + + + P +LF+++N I + V F K LFD+ L Sbjct: 120 KHINIDSKSQFLEGKIRRRLPSGNIFTQLFFEQNFIICSSVVVRKEVFNKTALFDSGLVN 179 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 QD+D++LR+ Y + ++ + + + K F + Sbjct: 180 CQDWDMWLRIAF-YYKAIGIDVPLVKYRHSAKSLSKNRN-NVLKYQKVIIDKIYTMF-KD 236 Query: 239 SKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 S+ +Y+ R + R G ++ Sbjct: 237 SENGINEKMYKKRLAAHYAKIGRHY--TRMGDKVV 269 >UniRef50_A7VZ85 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VZ85_9CLOT Length = 903 Score = 248 bits (634), Expect = 1e-64, Method: Composition-based stats. Identities = 61/280 (21%), Positives = 108/280 (38%), Gaps = 16/280 (5%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWE---QLQQYVTAL 57 +N S+ +P +N +Q I+SV+ Q Y WE+ + D S + A Sbjct: 372 QNPVTFSVIVPVYNCPEQFLKEMIESVIGQTYPKWELCLA-DGSDQEHGYVETICRGYAQ 430 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 DPR+ Y N G A N AI +A GEYI+ D DD P+ L + ++ Sbjct: 431 KDPRVKYQKLSKNLGISANTNAAIEMAAGEYISLFDHDDLLHPSALFQVMKVIEEQ-HAD 489 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTEL 176 F+Y ++ +G + + K Y+ N I + F+ + F TE Sbjct: 490 FIYTDECSFEGTLE---NPTFAHFKPDYAPDTLRSNNYICHLSTFSRELMDQAGRFSTEH 546 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QDYD+ LR+ + + ++ G + K + R +D + Sbjct: 547 DGSQDYDMILRLTEKAKHIVHIPRVLYYWRVHAGSVASGVGVKPY-CILAAKRALQDHME 605 Query: 237 RASKKYQLF-----TLYQIRNKRMTWRTLLTLLSVRNGKR 271 R K ++ + Y+I + + L+ ++ Sbjct: 606 RVGLKGEVTDSKVPSTYKINYELDETPLISILIPNKDHID 645 Score = 100 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 80/225 (35%), Gaps = 21/225 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + PLISI +P + + I S+L Y N+E++IV++ ST + Y D Sbjct: 628 LDETPLISILIPNKDHIDDLKKCIDSILDYSTYKNFEIVIVENNSTELRTFEYYQELEKD 687 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAF 118 +I + A+ N G+Y+ +++D + TPN + L Q+ Sbjct: 688 SKIKVVTWKGKFNYSAINNYGAGFCSGKYLLLLNNDIEVITPNWIEEMLMFAQRE-DVGA 746 Query: 119 LYANDYVCQGEVYSQPASLPLY------------PKSPYSRRLFYKRNIIGNQ----VFT 162 + A Y G + L L Y+ R+ + +N+ + Sbjct: 747 VGARLYYPDGTLQHGGIILGLGGIAGHAQLGISREDPGYAGRVSFVQNLSACTAACLLVR 806 Query: 163 WAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQIL 205 + + D E + A D D LR+ + A Sbjct: 807 KSVFDEVDGLDEEFQVAFNDVDFCLRIRDKGYLIVYTPYAELYHY 851 >UniRef50_C7G777 Glycosyl transferase, group 1 n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G777_9FIRM Length = 646 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 51/305 (16%), Positives = 94/305 (30%), Gaps = 38/305 (12%) Query: 2 KNNPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ----------L 50 + P ISI +P + + I+SV Q Y NW++ + D S + ++ Sbjct: 68 NDAPKISIVVPLFKTPETFLRAMIESVQAQTYGNWQLCLA-DGSGAGDEDADPKASLVQS 126 Query: 51 QQYVTALNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK 110 A D RI Y N G N AI LA G++I +D DD P+ L + Sbjct: 127 IANEYASADARIKYECLTENQGIAGNTNAAIALADGDWIAFMDHDDLLAPDALFEMVKMI 186 Query: 111 ----------------QQLVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN 154 + + LY ++ + + K ++ L N Sbjct: 187 RQGFHDEDGLAATVYREDGNDYEMLYTDEDKVDMDG---KTHFQPHLKPDFNIDLLRSNN 243 Query: 155 IIGNQVFTWAWRF-KECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 I + + + ++ AQDYD LR + G + + Sbjct: 244 YITHFLAVKRSLLDRVGGIRSDFDGAQDYDFILRCAEQAGAIGHIPRILYHWRCHKESTS 303 Query: 214 ITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF-----TLYQIRNKRMTWRTLLTLLSVRN 268 K++ R + R + Y++ + ++ ++ Sbjct: 304 ENPFSKQY-AVDAGKRAIGEHLKRLGVDAVVTPTKDMGFYEVEYPLTEQPLVSIIIPSKD 362 Query: 269 GKRLA 273 Sbjct: 363 EVETL 367 Score = 64.7 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 38/275 (13%), Positives = 83/275 (30%), Gaps = 44/275 (16%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCS---------------T 45 + PL+SI +P+ + + + I ++ + Y N+E+I+V++ S T Sbjct: 348 LTEQPLVSIIIPSKDEVETLRKCIAAIEKSSYGNYEVIVVENNSCEDTFRYYGDIAPQET 407 Query: 46 SWEQLQQYV-TALNDPRITY-IHNDINSGACAVR--NQAIMLAQGEYITGIDDD------ 95 + + + RI ++ + G + N + A+G Y +++D Sbjct: 408 TVDGTRCMEGKLAGGQRICVAVYME---GFNYSKLNNFGVKFAKGSYYLLMNNDIEMIGN 464 Query: 96 DEWTPN--------------RLSVFLAHKQQLVTHAFLYANDYVCQGEVYSQPASLPLYP 141 D +L Q + + ++ A Sbjct: 465 DWMKRMLGSCLREEVGIVGAKLFYQDHTIQHAGIVVGIGGSARGIGDNMFVGLAGDRSGY 524 Query: 142 KSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEE 200 S +L Y + + F+ +L A D D L++ Sbjct: 525 MHKASLQLDYSAVTAACLMVKREIYEQAGGFEEQLAVAFNDVDFCLKVRRLGKLVVYEPH 584 Query: 201 ATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 H SP+K + + ++ + Sbjct: 585 VQA-YHYESKSRGAEDSPEKTARFQREIEYMRNHW 618 >UniRef50_B1ZJV9 Glycosyl transferase family 2 n=7 Tax=Alphaproteobacteria RepID=B1ZJV9_METPB Length = 742 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 59/284 (20%), Positives = 96/284 (33%), Gaps = 16/284 (5%) Query: 4 NPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND-PR 61 P ISI MP +N + A+ S+ Q Y +WE+ + DD ST+ D PR Sbjct: 213 RPRISIVMPVYNTPKPYLEAALASIRAQGYPDWELCLCDDASTAPHIAPMLDALAADEPR 272 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + N N A+ LA G++IT ID DD P+ L F++ FLY+ Sbjct: 273 VRLHRRTQNGRIVGASNDALALATGDWITLIDHDDVIPPHALFAFVSALNADPQIDFLYS 332 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 + + ++ + + K +S + + F E + AQ Sbjct: 333 D----EDKITVEGERYEPFFKPDWSPETLEACMYTAHLALYRRDIVARIGAFRAECEGAQ 388 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS- 239 DYD LR V + G K + R +D+ R Sbjct: 389 DYDFALRYTEHVTRVHHVPQVLYHWRAIPGSTAQAMDNKGYV-VAAAIRALEDRARRTGG 447 Query: 240 ----KKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR 279 + + +R + ++ +RGR Sbjct: 448 LDSVRPTAFAGSFHLRRPLTERPVVSIVIPT---AGRDSAVRGR 488 Score = 84.0 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 70/236 (29%), Gaps = 39/236 (16%) Query: 1 MKNNPLISIYMPTWNRQ--------QLAIRAIKSVL-RQDYSNWEMIIVDDCSTSWEQLQ 51 + P++SI +PT R L + S+ + Y N E++ VD+ + Sbjct: 466 LTERPVVSIVIPTAGRDSAVRGRTVDLLAACLASIREKSSYDNIEIVAVDNG----DLRP 521 Query: 52 QYVTALNDPRI--TYIHNDINS-GACAVRNQAIMLAQGEYITGIDDDDE--WTPNRLSVF 106 AL R + D A N A+G+ + + +DD +P+ + Sbjct: 522 ATKEAL--ERFGARAVTWDQPVFNVAAKMNLGARAAKGDVLVFL-NDDIEVISPDWIEAM 578 Query: 107 LAHKQQLVTHAFLYANDYVCQGEVY-----------SQPASLPLYPKSPYSRRLFYKRNI 155 LA Q + + GE+ P + RN Sbjct: 579 LALLQ-IPGVGAVGPKLLFETGELQHVGVTVIDATPDHPRRSYPREDPGHFFSTAGNRNY 637 Query: 156 I---GNQVFTWAWRFKEC-LFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEA-TQIL 205 + G V F+ FD D D+ LR+ E Sbjct: 638 LAVTGACVMVRRADFEAIQGFDEGFAINYNDVDLCLRLRERGLRSVYCAEVELYHY 693 >UniRef50_A0M6I2 TuaG-like glycosyl transferase n=2 Tax=Flavobacteriaceae RepID=A0M6I2_GRAFK Length = 261 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 56/228 (24%), Positives = 103/228 (45%), Gaps = 4/228 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M++ L+S+ MP +N + +I+SV+RQ Y NWE++I+DD S+ + + D Sbjct: 1 MEDRGLVSVIMPAYNSEVFITESIQSVIRQTYPNWELLIIDDASSDTTKQIVQKFSSEDE 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N N+G RN+ I A G +I +D DD+W P +L L + A + Sbjct: 61 RIKLLKNSTNAGTHHSRNKGIKAASGNFIAFLDADDQWKPQKLEKQLKVLSK-PNVAACF 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 ++ + + S + P Y K N +GN + ++ ++ Q Sbjct: 120 SSYELINEKGASLQKKIQALPVLDY--DKLLKANYVGNLTGIYNVSILGKVYCPDIPKRQ 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY 228 D+ ++L+++ E G + E+ + I + T+ + F Y Sbjct: 178 DWALWLKVIEEGGPIEGIAESLAVYRIRKNSIS-TNKLEMLKYNFKVY 224 >UniRef50_A1AUB0 Glycosyl transferase, family 2 n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AUB0_PELPD Length = 1250 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 55/221 (24%), Positives = 94/221 (42%), Gaps = 6/221 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +PL+S+ M N + +AI+S+ Q + N+E II+DD ST + DPR+ Sbjct: 626 SPLVSVVMSVHNGECYLRQAIQSIFCQSFLNYEFIIIDDASTD--ATPDILLQFKDPRLA 683 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N N G A N I A+G+Y+ +D DD P+RL+ + + A + ++ Sbjct: 684 IIRNTGNLGLTASLNIGIRAARGKYVARMDADDISVPHRLARQVESLENNPEIAVVGSSY 743 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFK-ECLFDTELKAAQD 181 Y + + + +R ++N G+ V F+ +D AQD Sbjct: 744 YTMDEHGAVTGI-IDVLDRPELIQRELLRQNWFGHGSVMMRKSCFEIVGGYDERFIYAQD 802 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 YD+FLR+ Y + V E + + ++ Sbjct: 803 YDLFLRLSERY-KLANVTEPLYCWRESPHGISARKKAQQDY 842 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 55/242 (22%), Positives = 96/242 (39%), Gaps = 13/242 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND-PRIT 63 PL+S+ +PT NR + AI S+L Q ++E+I+V+D T ++ + D I Sbjct: 873 PLVSVIVPTHNRPDMLATAISSILAQTMQSFEIIVVNDAGTD---VRPVLERFGDRENIR 929 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 ++ ++ N G A RN I A+G+YI +DDDD + PN L + + + Y + Sbjct: 930 HLAHETNRGLAAARNTGINAARGKYIAYLDDDDTYYPNHLETLVGYLESN-DCRVAYTDA 988 Query: 124 YVCQGEVYSQP---ASLPLYPKSPYSRRLFYKRNIIGNQVFT--WAWRFKECLFDTELKA 178 + + + + N I A +FD L Sbjct: 989 NCAIQQKAANGFVTVKKEVVYSRDFDYDAILYVNFIPVLCIIHEKACLSLSGMFDEYLSR 1048 Query: 179 AQDYDIFLRMVVEYGEPWKVEEAT--QILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +D+D+++RM ++ T + M +S+P F Y K+ D Sbjct: 1049 HEDWDLWIRMSRH-ERFAHIKTVTCEYTYRPDGSSMTSSSAPLFFETYKAVCGKYDDFVK 1107 Query: 237 RA 238 Sbjct: 1108 EK 1109 >UniRef50_C1DRS5 Glycosyl transferase, family 2 n=1 Tax=Azotobacter vinelandii DJ RepID=C1DRS5_AZOVD Length = 1182 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 57/278 (20%), Positives = 105/278 (37%), Gaps = 15/278 (5%) Query: 4 NPLISIYMPTWNRQQLAIRAI-KSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL SI +P +N + A+ SV Q Y +WE+I+VDD S E+ ++ + ++DP+I Sbjct: 654 TPLFSIVVPAYNTSSELLDAVLSSVRAQWYPHWELILVDDASP-SEETRRALAEIDDPKI 712 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 +H + N G N + AQGE+I +D DDE T + L + F+Y++ Sbjct: 713 RVLHLESNKGISGATNVGLAAAQGEFIVFMDHDDELTVDCLYELALCINR-DQPDFIYSD 771 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQD 181 + ++ + + K +S + K +E QD Sbjct: 772 ----EDKLTEEGEYTQPHFKPDWSPDTMMSTMFTCHVSCVRRSLLSKVGELRSEFDGCQD 827 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 +D LR+V + + I + S K + R D +R K Sbjct: 828 WDFILRVVEHTNRISHIPKVLYHWRIIPASVASDISAKPYV-LEASRRVRLDALERRGLK 886 Query: 242 YQLF------TLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 + +++ + ++ R+ + Sbjct: 887 GSIEPVAQVPGYFRVNYHLQGSPLISIIIPSRDNGSVL 924 Score = 92.8 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 49/268 (18%), Positives = 94/268 (35%), Gaps = 23/268 (8%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 ++ +PLISI +P+ + + R + S+ + Y N+E+II+D+ S Y+ L + Sbjct: 905 LQGSPLISIIIPSRDNGSVLRRCLDSIQEKSSYRNFEIIILDNGSVEAS-TVAYLKELQE 963 Query: 60 PRI-TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSV-----FLAHKQQ 112 + I +D + N A GE + ++DD + + L L H Sbjct: 964 KGVAQIIRHDAPFNFSELNNIGARTAGGELLLFLNDDTEVLCNDWLERMGGYAQLVHIGA 1023 Query: 113 ------LVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTW 163 + + + + Y R + N + G + Sbjct: 1024 VGAKLLYPDSSEIQHAGVLNLANGPVHAFLRHHSERPGYFMRNLLEYNWLAVTGACLMME 1083 Query: 164 AWRFKE-CLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 A++F E FD L A D ++ +R V + + T I+H + Sbjct: 1084 AYKFNELGGFDETLPVAYNDIELCIRAVEKGYYNVVCQSVTL---IHHESVSRGLDHVDP 1140 Query: 222 SGYFHFYRKHKDKFDRASKKYQLFTLYQ 249 + R+ + +D +Q Y Sbjct: 1141 VKFARLQRELRRLYDMHPMFFQYDPFYN 1168 >UniRef50_Q8YSM2 Alr3062 protein n=6 Tax=Nostocaceae RepID=Q8YSM2_ANASP Length = 321 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 62/266 (23%), Positives = 113/266 (42%), Gaps = 15/266 (5%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +P +N ++SVLRQ Y ++E+++V+D S+ +++V+ + DPR+ Sbjct: 2 PKVSVVIPAYNAMPYLPETLESVLRQTYHDFEVVVVNDGSSDNT--EEWVSQILDPRLKL 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N G RN I+ A GEYI +D DD W P +L+ ++ + T +Y Sbjct: 60 ISQ-ANQGLAGARNTGIVNASGEYIAFLDADDIWEPTKLAKQVSVLDENPTVGLVYTWVA 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNII--GNQVFTWAWRFKE-CLFDTEL-KAAQ 180 + S + + + NI+ G+ F++ LFD L + Sbjct: 119 YIDEQGKSTGKIFKNQVEGYVWPQ-LTEHNIVECGSVALVRRVCFEKMGLFDRNLGSYVE 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+D++LR+ Y V+EA + + + + F + F AS+ Sbjct: 178 DWDMWLRIATSYD-FKVVKEALVYYRQR--SNSASKNWEAMAHSFAIV--IEKAFATASQ 232 Query: 241 KYQL--FTLYQIRNKRMTWRTLLTLL 264 Q+ Y + W+ L + Sbjct: 233 DLQVLKNKSYGFTYLCLAWKPLQSFQ 258 >UniRef50_C0B9B1 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0B9B1_9FIRM Length = 818 Score = 247 bits (632), Expect = 2e-64, Method: Composition-based stats. Identities = 52/282 (18%), Positives = 98/282 (34%), Gaps = 13/282 (4%) Query: 5 PLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQL--QQYVTALNDPR 61 P ISI +P + ++ + SV Q Y NWE+ + D S + D R Sbjct: 279 PKISIVVPLYRTPERYLREMLDSVRGQSYQNWELCLS-DGSGEDSSIAGILEEYTKKDSR 337 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I N N A+ +A G+YI +D DD TP+ L +A + +Y Sbjct: 338 IRVKDNKKQLHISDNTNVALDMATGDYIAFMDHDDLLTPDALYECVAELNEYPDTELIYT 397 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQ 180 ++ + + KS ++ + N + V ++ + E AQ Sbjct: 398 DEDKITMDGEE---YFFPHFKSDFNPDMLCTTNYFCHLVVVKKELYQAAGKLNGEYDGAQ 454 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS--GYFHFYRKHKDKFDRA 238 DYD LR V + + + + G ++ K + R H + Sbjct: 455 DYDFVLRCVEKTDKIRHIPKILYHWRACEGSTAGSADNKSYIVDAGAKAVRAHYRRMGIE 514 Query: 239 SKKYQLFT--LYQIRNKRMTWRTLLTLLSVRNG-KRLADGIR 277 ++ +Y+ + + ++ ++ L +R Sbjct: 515 AEVIPTKYPGMYRTKYPVKQTPKISVIIPNKDHTDDLIKCLR 556 Score = 79.3 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 80/234 (34%), Gaps = 39/234 (16%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVT-ALND 59 K P IS+ +P + I+ ++S+ + Y N E++++++ S + + Y Sbjct: 533 KQTPKISVIIPNKDHTDDLIKCLRSIREKNTYENIEILVIENNSQKKKTFKDYRRIMHEY 592 Query: 60 PRITYIHN--------DINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAH- 109 P++ ++ +IN I A GEY+ +++D + + + L++ Sbjct: 593 PKVKVLYWKGEGFNYPEINQ-------YGIDHATGEYLLFLNNDTEMIGNDCIKEMLSYC 645 Query: 110 KQQLVTHAFLYANDYVCQGEVYSQPASLPLYPK--------SPYSRRLFYKRNIIG---- 157 ++ + A Y G + + L S F + ++ Sbjct: 646 MRE--DVGAVGARMYYEDGTLQHGGVIIGLGGVAGHAFLGIDGDSPGYFARAQVVHDLSA 703 Query: 158 ----NQVFTWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQIL 205 + + FD++ A D D+ L++ A Sbjct: 704 VTAACMMMKKRVYEEVGGFDSKFAVAFNDVDLCLKIRKAGYLIVYDPYAELIHY 757 >UniRef50_Q8PGL8 Truncated O-antigen biosynthesis protein n=2 Tax=Xanthomonas RepID=Q8PGL8_XANAC Length = 614 Score = 247 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 63/287 (21%), Positives = 108/287 (37%), Gaps = 19/287 (6%) Query: 6 LISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALNDPRIT 63 +IS+ +P N + AI+SV Q Y NWE+ I DD S P + Sbjct: 86 MISVIIPVCNTPELFLREAIESVRTQTYQNWEICIHDDASNEPHVGRILAEFKKRIPNLR 145 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 ++ SG A N A+ A+G ++T +D DD N L+ + + A +Y + Sbjct: 146 VTRSETRSGIAATTNAALASARGRWVTFLDHDDVLDRNALAAVVDCHIK-TGSAVVYTDH 204 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQV-FTWAWRFKECLFDTELKAAQDY 182 + + + Y K ++ LF + +G+ V F + L +E +QDY Sbjct: 205 DMLGEDGRLRN----PYFKPDWNYDLFLAQMYLGHLVSFEHDLIKRVGLTRSECDGSQDY 260 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D+ LR V + V ++ G + K + + R + D Sbjct: 261 DLVLRCVAAGAKVGHVPRVLYHWRMHAGSTAANADSKPY-AHDSGKRAIQYHVDAVYPGA 319 Query: 243 QLFT-----LYQIRNKRMTW-RTLLTLLSVRNGKRLA----DGIRGR 279 Q+ Y +R ++ R+G L + I+GR Sbjct: 320 QVEDGPHLFCYDVRYPYAQHAPKASIIIPTRDGLDLLRPCIESIQGR 366 Score = 87.0 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 87/270 (32%), Gaps = 30/270 (11%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 ++ P SI +PT + L I+S+ + Y+N+E+I+VD+ S L+ N+P Sbjct: 338 QHAPKASIIIPTRDGLDLLRPCIESIQGRTDYTNFEIIVVDNGSKDPATLEWLDAMSNEP 397 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTP-NRLSVFLAHKQQLVTHAFL 119 + D+ A+ N A A+GE + +++D E + L + + + Sbjct: 398 WFRVLQADVPFNWSALNNLAAASAEGEVLVFLNNDTEVASADWLQRLVENALR-PDVGAC 456 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-----QVFTWAWRFKE----- 169 A G + + + ++ +F + I + Sbjct: 457 GALLLYADGSIQHAGVVVGM---GGWADHVFKGQQPIHHQHLFVSPILRREVLAVTGACV 513 Query: 170 ----------CLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 FD D ++ LR + V EA I+H Sbjct: 514 AISRANFDAIGGFDESFVVCGSDVELCLRAMANGKRTVYVPEAIL---IHHESKTRDPRV 570 Query: 219 KKFSGYFHFYRKHKDKFDRASKKYQLFTLY 248 S + + + + Y Y Sbjct: 571 IPESDFVRSAQAYSPYRESGDPFYSPNLDY 600 >UniRef50_C6JDV1 Glycosyl transferase n=5 Tax=Clostridiales RepID=C6JDV1_9FIRM Length = 814 Score = 246 bits (628), Expect = 7e-64, Method: Composition-based stats. Identities = 56/279 (20%), Positives = 100/279 (35%), Gaps = 16/279 (5%) Query: 4 NPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTA--LNDP 60 P IS +P + ++ R +S Q YSNWE+ D S + L + + D Sbjct: 279 RPKISFVVPLYKTPEKYLRRLTESFQEQTYSNWELCFS-DGSGAQSPLTELLKELTAKDN 337 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y+ ++ N AI +A G++I D DDE TPN L + + +Y Sbjct: 338 RIKYVSHEEPLQISENTNSAIEIATGDFIAFADHDDELTPNALFECVKAINEKPQTLVIY 397 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTELKAA 179 ++ + + + K Y+ L N I + V + K +E A Sbjct: 398 TDEDKMSMDGHK---FFQPHFKPDYNPDLLCTVNYICHLFVVSRKVIEKVGGLRSEFDGA 454 Query: 180 QDYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QDYD LR V E + + + K + + R + ++R Sbjct: 455 QDYDFVLRCVEAVKDEEICHIPKILYHWRCHEDSTAENPESKLY-AFEAGRRAVQAHYER 513 Query: 238 ASKKYQLFTLYQIRNKRMTW-----RTLLTLLSVRNGKR 271 ++F + R + + ++ ++ Sbjct: 514 TGIHAEVFKGEYLGLYRTKFIRDHDPLISIIIPNKDHID 552 Score = 90.9 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 76/224 (33%), Gaps = 22/224 (9%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSV-LRQDYSNWEMIIVDDCSTSW-EQLQQYVTALNDP 60 ++PLISI +P + R ++S+ + Y N+E IIV++ ST +P Sbjct: 537 HDPLISIIIPNKDHIDDLKRCMESIEQKSTYKNYEYIIVENNSTDSATFEYYKKLEAENP 596 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFL 119 ++ ++ D A+ N A+GEY+ +++D + P+ L L + + + Sbjct: 597 KVRMVYWDGVFNYSAINNYGASFAKGEYLLLLNNDTEIINPDCLEELLGYCMR-KDVGAV 655 Query: 120 YANDYVCQGEVYSQPASL------------PLYPKSPYSRRLFYKRNI----IGNQVFTW 163 A Y + + + Y R+ ++ + Sbjct: 656 GARLYYEDDTIQHAGVVIGFGGIAGHCFVQQKRGTTGYCHRIICAQDYSAVTAACMMVKK 715 Query: 164 AWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQIL 205 + EL A D D +++ A Sbjct: 716 SAFDAVGGLSEELAVAFNDIDFCMKLRKAGYLIVYNPYAELYHY 759 >UniRef50_A3I1Y0 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1 RepID=A3I1Y0_9SPHI Length = 255 Score = 246 bits (628), Expect = 9e-64, Method: Composition-based stats. Identities = 53/256 (20%), Positives = 99/256 (38%), Gaps = 6/256 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI P N + AI+S+L+Q Y+NWE+I++DD S+ + D RI + Sbjct: 2 LVSIITPVHNSIEHVEEAIQSILKQTYTNWELILIDDKSSDSSLEVLHAYEEKDSRIHLL 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+ N GA RN+ I ++G +I +D DD W P +L +A + HAF Y + Sbjct: 62 KNEKNKGAALTRNRGIEASKGRFIAFLDSDDLWEPTKLETQIAFMLE-KGHAFTYTSYLT 120 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 + + + K IG + ++ + QDY ++ Sbjct: 121 FRNDNEEVGVLEVP---EKVNHNDLLKTCSIGCLTAMYDTEVLGKVYMPIISKRQDYALW 177 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 L+++ E + + + + + K + YR+ ++ Y Sbjct: 178 LKILKEIPFAYGINTPLAKYRLRSDSIS-GNKLKAAGYQWKVYREFEN-LSIFQASYYFI 235 Query: 246 TLYQIRNKRMTWRTLL 261 + + + Sbjct: 236 HYSIFGVIKTYFNKMQ 251 >UniRef50_B0NG36 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0NG36_EUBSP Length = 526 Score = 245 bits (626), Expect = 1e-63, Method: Composition-based stats. Identities = 56/286 (19%), Positives = 99/286 (34%), Gaps = 15/286 (5%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 P +SI +PT++ ++ I+SV RQ YSNWE+ I D S + + + D Sbjct: 57 SYRPKVSIIVPTYHTPEKFLREMIESVSRQTYSNWELCIA-DGSEDDKVANVVSSYVTDA 115 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + +IN G N AI ++ GEYI +D DD L + + +Y Sbjct: 116 RIKFKKLEINGGISDNTNAAIEMSTGEYIALLDHDDFLESVALFEIVKAINVNKS-EVIY 174 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAA 179 ++ + + K ++ L + N I + F E A Sbjct: 175 TDE---DKVSWEGSRFFDPHFKPDFNLELLRQNNYICHFFIVSRRILNIVKGFRKEFDGA 231 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QDYD R + V + K + Y + + R Sbjct: 232 QDYDFIFRCIESANSVEHVPIPLYHWRTHKDSTASNPESKAY-AYISGKKAIEAHLYRCQ 290 Query: 240 KKYQLFTLYQIRNKRMTWRT-------LLTLLSVRNGKRLADGIRG 278 +K + + ++ ++ + + K+L I+ Sbjct: 291 EKAVVKSTMHYGFYHTQYKLVAFDKVVIVVIKNNSKSKQLQKCIKA 336 >UniRef50_A9H3G6 Glycosyl transferase family 2 n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9H3G6_GLUDA Length = 988 Score = 245 bits (626), Expect = 1e-63, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 100/285 (35%), Gaps = 16/285 (5%) Query: 1 MKNNPLISIYMPTWNRQQLA-IRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + P SI +P +N R + SVL Q Y +WE+I+VDD S + + L D Sbjct: 458 LAKRPAFSIVVPLYNTPDDLFQRMVGSVLAQWYPHWELILVDDKSPQQSV-RDNASKLVD 516 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI I + N G N+ + A G+YI +D DDE T + L ++ Sbjct: 517 PRIRTILLESNMGISGATNRGLAEAGGDYIVFLDHDDELTDDCLFELAKCIDAE-DPDYV 575 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKA 178 Y++ + ++ + K +S + + +E Sbjct: 576 YSD----EDKIEPDGRFSQPFFKPDWSPDTLMSTMYTCHVSCVRRALLETVGDLRSEFDG 631 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QD+D LR+ V + I + + K + R +R Sbjct: 632 SQDWDFVLRVTEAAKRISHVPKVLYHWRIIPQSVASDLNAKPY-AVDAGRRARMAALERR 690 Query: 239 SKKYQLFT------LYQIRNKRMTWRTLLTLLSVR-NGKRLADGI 276 K + +++ + ++ + NG L + + Sbjct: 691 GLKGTIEAVPQLAGYFRVNYDVQGTPLVSIIIPTKNNGTVLKNCL 735 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 44/279 (15%), Positives = 87/279 (31%), Gaps = 34/279 (12%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + PL+SI +PT N + + S+ Y N+E++++D+ ST + + +P Sbjct: 713 QGTPLVSIIIPTKNNGTVLKNCLDSIFGHSHYRNFEIVLLDNGSTDAATVNYLDSLHANP 772 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDD-EWTPNRLSVFLAHKQ-QLVTHA- 117 + I +D + N + A+G + ++DD +P+ + + Q V Sbjct: 773 NVRVIRHDAPFNYSEINNIGVGDAKGSLLLFLNDDTQVISPDWIGRMAGYAQLTHVGAVG 832 Query: 118 --FLYANDYVCQGEVYSQPASLPLY-------PKSPYSRRLFYKRNII----GNQVFTWA 164 LY + Q A P + Y R + + I + Sbjct: 833 AKLLYPDSRKIQHSGVLNLADGPNHAFWSADAYTPGYFARNLLEYDWIAVTGACLMIERT 892 Query: 165 WRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA---TQILHINHGEMQITSSPKK 220 FD A D D+ R+V + V + + K Sbjct: 893 KFDAVGGFDESFPIAYNDVDLCFRLVEHG--FYNVVCPGAELFHY----ESLSRGNDNKN 946 Query: 221 FSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRT 259 K + ++ Y + + + Sbjct: 947 -------KEKRRRLDQEKNRLYYKNPHFLMHDPFYNPNL 978 >UniRef50_D0L0W1 Glycosyl transferase family 2 n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0W1_HALNC Length = 733 Score = 244 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 62/284 (21%), Positives = 112/284 (39%), Gaps = 17/284 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV-TALNDPR 61 P+IS+ M TW + AI+SVL+Q Y W++ I DD S + ++ D R Sbjct: 196 QPVISVIMATWQSDLRWLDAAIQSVLQQSYPYWQLCIADDASEQKKLIELLNHYKKKDSR 255 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I N A +N ++LA G Y+T +D DD PN L QQ FLY+ Sbjct: 256 IRVTFRYENGHISAAQNSGLLLATGAYVTFLDHDDLLAPNALLAVAEALQQQPRPLFLYS 315 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQ 180 ++ G + + KS ++ L +N I + + + F ++ +Q Sbjct: 316 DEDKIDG----TGRRVNPHFKSDWNPDLLCAQNYITHLMVVERAMVESVGGFRVGVEGSQ 371 Query: 181 DYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 D+D+ LR+ + + + I + + K ++ ++ F R Sbjct: 372 DHDLALRVTEKLPPALIHHIPQVLYHWRITENSTALHADAKNYTSS-AGVAAIEEHFKRK 430 Query: 239 SKK------YQLFTLYQIRNKRMTWRTLLT-LLSVRNGKRLADG 275 ++ L Y++ + L+T L+ R+ + Sbjct: 431 NQDDVMVQPGLLPNTYRVLHPIPKPAPLVTLLIPTRDRLDMLKP 474 Score = 76.3 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 52/290 (17%), Positives = 92/290 (31%), Gaps = 37/290 (12%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSW-EQLQQYVTALNDPRI 62 PL+++ +PT +R + I S++ + Y +E+II+D+ S L I Sbjct: 457 PLVTLLIPTRDRLDMLKPCISSIIEKTSYKPYEIIILDNGSNEPQTHLFFAEIQAQHEHI 516 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYA 121 + D A+ N + AQG I I++D + L+ ++H + + A Sbjct: 517 RVLSYDKPFNYSAINNFGVAHAQGTIIGLINNDIEVIQSGWLTELVSHAVR-PEIGCVGA 575 Query: 122 NDYVCQGEVYSQPASL---------PLYPKSP---YSRRLFYKRN---IIGNQVFTWAWR 166 Y G + L Y Y RL +N + G + Sbjct: 576 KLYFDDGSIQHAGVILGIGGVAGHAHKYFARDAHGYFSRLQLVQNLSAVTGACLLVRKEV 635 Query: 167 FK-ECLFDTE-LKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQITSSPKKFS 222 F+ + + L A D D+ L++ A +H Sbjct: 636 FEQVGGLEEDHLTVAFNDIDLCLKVREAGYRNLWTPHAELY----HHESKSRGRDDAPEK 691 Query: 223 GYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRL 272 + +F R + + N + LTL RL Sbjct: 692 ---------QARFQR-EVAWMIARWGAQLNSDPYYNPNLTLQHEDFSIRL 731 >UniRef50_B0G3W1 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G3W1_9FIRM Length = 575 Score = 244 bits (624), Expect = 2e-63, Method: Composition-based stats. Identities = 58/268 (21%), Positives = 99/268 (36%), Gaps = 17/268 (6%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---------LQQY 53 PLISI +PT+ I SV +Q Y WE+ I D S + ++ + Sbjct: 78 QPLISIVVPTYQTPIPFLKDMIDSVRKQSYEKWELCIA-DGSLNGDENDTKVIRVREELN 136 Query: 54 VTALNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL 113 ++ D RI ++ + N G NQA+ LA GEYI D DD TP+ L + Sbjct: 137 RYSMEDKRIKVVYLEENQGIAENTNQALALATGEYIGLFDHDDMLTPDALYEIVKAINDY 196 Query: 114 VTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW-RFKECLF 172 + LY ++ + K YS L N I + + F Sbjct: 197 -DYDVLYTDEDKISENSH---DYKKPVFKPDYSPELLCANNYITHFFVAKKSIVDRLEGF 252 Query: 173 DTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 E +QDYD R V V + ++ G + + K + Y + + Sbjct: 253 RKEYDGSQDYDFIFRCVELAKNVGHVSKVLYHWRMHGGSVAGDPTSKMY-AYDAGKKAIQ 311 Query: 233 DKFDRASKKYQLFTLYQIRNKRMTWRTL 260 ++R + + + ++ ++ + Sbjct: 312 SHYERVGIQANVEHMERLGLYHTEYKMI 339 >UniRef50_B4AD21 Glycosyltransferase n=17 Tax=Enterococcus faecalis RepID=B4AD21_ENTFA Length = 252 Score = 244 bits (624), Expect = 3e-63, Method: Composition-based stats. Identities = 60/248 (24%), Positives = 104/248 (41%), Gaps = 5/248 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI MP +N + ++I+SVL Q Y NWE++++DD S D RI + Sbjct: 5 LVSIIMPMYNAGKFLSKSIESVLEQTYQNWELLLIDDGSKDDSIDIALAFMEKDSRIFLL 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+ N G RN+ I ++G+YI +D DD W PN+L V + ++ F ++ +V Sbjct: 65 KNEQNMGIAKTRNKGIEASKGQYIAFLDSDDLWLPNKLEVQIKWMEE-KKLLFTCSSYFV 123 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 C + + P + + K N IG + +LK +DY + Sbjct: 124 CNENGNITHER--NFSEGPQTYQDLLKTNTIGCLTVVVESNLLKRHLMPDLK-HEDYATW 180 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 L ++ E + + E I ++ S + R+++ S Y + Sbjct: 181 LNILKEINTVYFINEKLAIYRKLTTSTS-SNKWNTISWVWKILRQNEQFSVIKSSFYLMR 239 Query: 246 TLYQIRNK 253 L+ K Sbjct: 240 FLFYTTFK 247 >UniRef50_Q8YUP7 Glucosyltransferase n=2 Tax=Cyanobacteria RepID=Q8YUP7_ANASP Length = 343 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 61/281 (21%), Positives = 110/281 (39%), Gaps = 28/281 (9%) Query: 8 SIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIHN 67 S+ +P +N + ++SVL Q Y + E+IIVDD ST D RI IH Sbjct: 4 SVIIPVYNSESSVAETLRSVLAQTYRDLEIIIVDDGSTDKS--IDICKQFQDERIRIIHQ 61 Query: 68 DINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVCQ 127 N G RN I A+GEY+ +D DD W P +L+ L H ++ ++ + Sbjct: 62 -QNRGLAGARNTGIRQARGEYLAFVDSDDLWLPEKLAKHLEHFERSPEVGVSFSRSSLID 120 Query: 128 GEVYSQPASLPLYPKSPYSRRLFYKRNII--GNQVFTWAWRFKEC-------------LF 172 + + + + RN I G+ V F Sbjct: 121 DQGKPLGIYQMPKL-TDITPEYLFCRNPISNGSSVVIRRAVLDTIKFQENLYGEVEDFYF 179 Query: 173 DTELKAAQDYDIFLRMVVE-YGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 D + ++D + +LR+ ++ + + EA + +N G + + K++ + K Sbjct: 180 DDSFRQSEDIECWLRIALQTTWKIEGIPEALTLYRVNMGGLSA-NVLKQYESWERILVKT 238 Query: 232 K-------DKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLS 265 + +++ +K YQL L + + + + LL Sbjct: 239 QAYNPEFIERWGNRAKAYQLRYLARRATRGRSPAMAVNLLH 279 >UniRef50_A5G2M9 Glycosyl transferase, family 2 n=1 Tax=Acidiphilium cryptum JF-5 RepID=A5G2M9_ACICJ Length = 880 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 66/284 (23%), Positives = 104/284 (36%), Gaps = 18/284 (6%) Query: 3 NNPLISIYMPTWNRQQL--AIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT-ALND 59 + PL+S+ P + R + A+ SV Q Y NWE+I+VDD S A D Sbjct: 312 DAPLVSVICPVY-RPDHGAFVTAVDSVRAQTYRNWELILVDDGSGDARLTALMKRLADAD 370 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI N+G A N+AI A G++I D DD P L + Q L Sbjct: 371 PRIRVQIRKANAGIAAATNRAIEAATGDFIAFFDHDDVLEPCALDAMMRA-QAATGARLL 429 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKA 178 Y++ + ++ + A K ++ RL N I + A + D +L Sbjct: 430 YSD----EDKIDRRGAFSEPNFKPDFNYRLLLDLNYICHLAVAEAALVRAAGGLDPQLDG 485 Query: 179 AQDYDIFLRMVVE--YGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 AQD+D+ LR+V E V E I + + K + Sbjct: 486 AQDHDLLLRLVERLGAHEIHHVPEILYHWRITAQSTAGSGAAKPKAALAGAAAV-AAHLK 544 Query: 237 RASKKYQLFTLYQIRNKRMTWRT-----LLTLLSVRNGKRLADG 275 R + + R ++R + L+ R+ + Sbjct: 545 RRGLAARTERRGGLTCYRTSFRMSEDPGVSILIPYRDHIGMTRA 588 Score = 56.6 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 42/299 (14%), Positives = 92/299 (30%), Gaps = 40/299 (13%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDY-SNWEMIIVDDCSTSWEQL--QQYVTAL 57 M +P +SI +P + + ++++ + +E++++D+ S E L Sbjct: 567 MSEDPGVSILIPYRDHIGMTRACVEAIRDVTRGTRYEILLLDNWSQDEEAEGFAVEQGNL 626 Query: 58 NDPRITYIHNDINSGACAVR--NQAIMLAQGEYITGIDDD----DEWTPNRLSVFLAHKQ 111 D R+ I N R N+ + A+ ++ +++D D P L L Sbjct: 627 PDTRVIRIAEPFN----YSRINNRGVEAARHPFLLFMNNDVFVSD---PEWLRTMLNEAL 679 Query: 112 QLVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLF------YKRNIIGN------- 158 + A V L + + +S R Y N I Sbjct: 680 ADPGVGAVGAKLLYPNETVQHAGVVLGVGGIADHSFRGLPADKPGYIANAIACREVSAVT 739 Query: 159 ---QVFTWAWRFKECLFDTE-LKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ 213 + FD + L A D D+ +++ + + + Sbjct: 740 AACMLVRREAFAAAGGFDEDGLSVAFNDVDLCMKIRQAGYRIIFSADTVCE---HRESLS 796 Query: 214 ITS--SPKKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGK 270 K + + ++++D A K+ + + +R L + + Sbjct: 797 RGDDFDESKLARFMLENETMRERWDEALKRDPFYNP-HFSREGGVYRDLRVIDPADVRR 854 >UniRef50_C7TKV7 Glycosyl transferase, group 2 n=5 Tax=Lactobacillus RepID=C7TKV7_LACRL Length = 251 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 58/248 (23%), Positives = 94/248 (37%), Gaps = 7/248 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M N P++S+ MP +N ++SVL Q + + E++IVDD S+ + D Sbjct: 1 MGNTPVVSVVMPAYNAATYLPNTVQSVLEQSFKDLELLIVDDASSDHTFEVVEKLSATDS 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RITYI D N G RN + ++GEY+ +D DD W P++L + L QQ F Y Sbjct: 61 RITYIKLDENQGVANARNVGVQHSRGEYVAFLDSDDIWLPDKLKIQLTFMQQH-QVDFSY 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + V K Y K N IG + + + E+ + Sbjct: 120 CSYEVVDDHGTKIGERKISETKLDYHE--MLKGNRIGLLTVMLSRKIAQKYPFPEI-NHE 176 Query: 181 DYDIFLRMVVEYGEPWKVEE-ATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 DY +L + + E + + + + + YR+H + Sbjct: 177 DYACWLSIARNGTVAYLASEQILAKYRKHQTSLSA-NKIQAAQWTWTIYRRH-EHLGLFK 234 Query: 240 KKYQLFTL 247 Y Sbjct: 235 SFYYFLHY 242 >UniRef50_B0TYV9 Putative teichuronic acid biosynthesis glycosyl transferase n=1 Tax=Francisella philomiragia subsp. philomiragia ATCC 25017 RepID=B0TYV9_FRAP2 Length = 247 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 65/249 (26%), Positives = 107/249 (42%), Gaps = 6/249 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI P++N ++ I+SV+ Q Y NWEMIIVDDCST D RI I Sbjct: 4 LVSIITPSYNSEKYISNTIESVITQTYQNWEMIIVDDCSTDSSCDIAEKYTQKDSRIKLI 63 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + NSG RN+AI A+G YI+ +D DD W +L + ++ A ++ Y+ Sbjct: 64 KLNKNSGPAKARNRAIKEAKGRYISFLDSDDIWYHRKLEKQVYFMKEN-DLAVTCSSYYI 122 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 + L + K N IGN + ++ ++ +DY ++ Sbjct: 123 VDAKGKKNKTRLV---REKILYSDMLKSNHIGNLTGIYDCIKLGKIYADDV-GHEDYTLW 178 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 L++V + + E + + + ++ K S ++ YR D S Y + Sbjct: 179 LKVVQKSKTVNTISEPLAEYRVLNNSIS-SNKFKTLSWQWNIYRNILDLNVFRSSYYFVC 237 Query: 246 TLYQIRNKR 254 +Y KR Sbjct: 238 YVYYALKKR 246 >UniRef50_B9JE99 Glycosyl transferase n=1 Tax=Agrobacterium radiobacter K84 RepID=B9JE99_AGRRK Length = 751 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 47/282 (16%), Positives = 107/282 (37%), Gaps = 17/282 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAI-KSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M P+ SI +P++N + + +SV Q Y +WE+I+VDD ST E +++ ++D Sbjct: 220 MSFRPIFSIIVPSYNTPLQLMEKLYESVSSQWYPDWELIVVDDASTRLEA-LEFLKGISD 278 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ-QLVTHAF 118 R+ + N G N + +A G+Y+ +D DDE T + L + + Sbjct: 279 SRVKLEWSSENQGIAGATNSGVKIATGDYVVLLDHDDELTHDCLYELALCVNRENPDY-- 336 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKECLFDTELK 177 +Y++ + ++ + + + K +S + + + ++ Sbjct: 337 IYSD----EDKIDLEGRASEPHYKPDWSPDTMMSTMYVCHVSCIRRSLLLDVGGLRSQYD 392 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QD+D+ LR+ + + + G + + + K + R D R Sbjct: 393 GCQDWDLVLRLTEITKKIAHIPKVLYHWRTLPGSVSVALTEKPYV-LEATRRVRADAIKR 451 Query: 238 ASKKYQLF------TLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 + + ++ + + ++ R+ + Sbjct: 452 RGLDASVEEIPGFPGYFGVKYEPRPGTLISIIIPTRDNCGVL 493 Score = 90.9 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 47/259 (18%), Positives = 75/259 (28%), Gaps = 32/259 (12%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQ---DYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 LISI +PT + + R ++S+ Q Y N E+IIVD+ S + L I Sbjct: 479 LISIIIPTRDNCGVLQRCMESL--QYVNSYKNIEIIIVDNGSRDQKTLIYLQELSAFSNI 536 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQ---- 112 + I D + N A GE I ++DD D L H Sbjct: 537 SIIRWDYPFSFSQLCNIGAAKASGELILFLNDDTEVVEHDSLARMAGFAQLDHIGAVGAK 596 Query: 113 --LVTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII----GNQVFTWAWR 166 + + + Y R N + + + + Sbjct: 597 LLYPGSNLIQHCGLLNLSDGPGHAYLRQDAGGWGYFGRSILDYNWLAVTGACLMASRSKF 656 Query: 167 FKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT-------SSP 218 FD A D ++ R+ + A + +H Sbjct: 657 DAVGGFDETFPVAYNDVELCFRLREAGYYSVMCQRAVLL---HHESFSRGLDDLDPVKLK 713 Query: 219 KKFSGYFHFYRKHKDKFDR 237 + YR+H D F R Sbjct: 714 RLREEMGRLYRRHPDFFQR 732 >UniRef50_Q8PZ42 Glycosyltransferase n=1 Tax=Methanosarcina mazei RepID=Q8PZ42_METMA Length = 307 Score = 243 bits (621), Expect = 5e-63, Method: Composition-based stats. Identities = 69/254 (27%), Positives = 110/254 (43%), Gaps = 8/254 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + P +S+ +PT+NR L RAIKSVL Q Y + E+IIVDD ST ++ V D Sbjct: 2 LDQKPKVSVIIPTYNRAHLIPRAIKSVLNQTYLDIEIIIVDDSSTDNT--EEIVKDFKDR 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK-QQLVTHAFL 119 R+ YI ++IN GA A RN I ++G+YI D DDEW N+L + + Sbjct: 60 RLKYIRHNINKGASAARNTGIRESRGKYIAFQDSDDEWFSNKLEKQIEAFADAPPEVGIV 119 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKEC-LFDTELK 177 Y+ Y + S L K K N +G V F+ F+ L Sbjct: 120 YSGFYRIEANRKLYLPSDQLSLKEGNIHNELLKGNFVGTPTVLMKKECFRNQRYFNESLP 179 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 A +D+++++ + +Y +++ + + + + + H D F + Sbjct: 180 ALEDWELWIELS-KYYTFRYIKKPLLCSYSTPNSINLNEN-NMLKAHEIILLTHLDDFSK 237 Query: 238 ASKKYQLFTLYQIR 251 +KK Y I Sbjct: 238 -NKKNLSEHYYDIG 250 >UniRef50_A6ELC1 TuaG n=3 Tax=Bacteroidetes RepID=A6ELC1_9BACT Length = 269 Score = 243 bits (621), Expect = 5e-63, Method: Composition-based stats. Identities = 67/233 (28%), Positives = 100/233 (42%), Gaps = 5/233 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + PL+SI P +N + I+SVL Q YSNWE++IVDD ST NDP Sbjct: 12 MVSKPLVSIITPLYNAAPFIEKTIQSVLDQTYSNWELLIVDDASTDNGLQIVRDFFGNDP 71 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N IN GA RNQA A G+YI +D DD W N+L L Q+ + Sbjct: 72 RIKFYPNTINKGAAHCRNQATSEATGDYIAFLDSDDLWHKNKLEKQLKFMQKH-DCDVSF 130 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + PY ++ +K N IGN + + ++ Q Sbjct: 131 TSYLQIDQNGNLLGRQVQAIKTLPYKKQ--HKNNYIGNLTGMYNANSLGKIIAPNIRKRQ 188 Query: 181 DYDIFLRMVVEYGEPW-KVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 D+ ++L + + +P ++E ++ M K F FYR + Sbjct: 189 DWAVWLEAIKKSKKPALGLQENLAFYRVHLDSMSANKW-KLIKYNFAFYRTYL 240 >UniRef50_C4F932 Putative uncharacterized protein n=1 Tax=Collinsella intestinalis DSM 13280 RepID=C4F932_9ACTN Length = 974 Score = 243 bits (621), Expect = 5e-63, Method: Composition-based stats. Identities = 69/276 (25%), Positives = 109/276 (39%), Gaps = 18/276 (6%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 SI MPT+N + AI SVL Q YSNWE+ IVDDCSTS E L++Y+ +LN + Sbjct: 50 KFSILMPTFNVDVRWVSMAIDSVLAQSYSNWELCIVDDCSTSAE-LKEYLLSLNTDNVRV 108 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRL-SVFLAHKQQLVTHAFLYAND 123 D N G N A +A G+Y+ +D+DD TPN L +FL +Y+++ Sbjct: 109 RLLDTNLGISGATNCAADMATGDYLVLLDNDDMLTPNALFELFLRATSDNPD--IMYSDN 166 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDY 182 + L + K +S L + +G+ + F F E +QDY Sbjct: 167 DIIDEN----GNRLSVLFKPDWSPDLMLSQMYVGHLLAFRKNIFDAVGGFRNEFNGSQDY 222 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D+FLR+ ++ + + K + F + D K Sbjct: 223 DLFLRLSLQSERISHISKILYSWRALPSSTAANPDSKPY-AQFAGLNAVQSYLDSKYGKG 281 Query: 243 QLFTL-------YQIRNKRMTWRTLLTLLSVRNGKR 271 Y +R + ++ ++ Sbjct: 282 AASAFETDDLFVYDVRYNVPNKPLVSIVIPTKDHAE 317 Score = 90.1 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 78/225 (34%), Gaps = 23/225 (10%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQQYVTAL-NDP 60 N PL+SI +PT + + AI S++ Y N+E++I+D+ S + L D Sbjct: 302 NKPLVSIVIPTKDHAEDLRIAIDSIVERSSYQNFEILILDNNSEEASTHNYFDEILGKDD 361 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFL 119 R+ I+ + N + A+G+ +++D + + + L + H + + Sbjct: 362 RVRVINASFPFNWSKLNNYGVRHAKGDVFVFLNNDTEVISEDWLDRLVEHAIRE-DVGVV 420 Query: 120 YANDYVCQGEVYSQPASL--------------PLYPKSPYSRRLFYKRN--IIGNQVFTW 163 G + + P++ +P+ L + + G + Sbjct: 421 GGLLLYPDGSIQHAGVIIGMGGWADHVYKGAQPIHYGNPFISPLVTRNVSAVTGACMAIS 480 Query: 164 AWRFKE-CLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQIL 205 F + F+ + D ++ L + + Sbjct: 481 REHFDQLGGFNEDFIVCGSDVELCLNAMEHGMRNVYSPYIRLKHY 525 >UniRef50_B7AUE9 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUE9_9BACE Length = 581 Score = 243 bits (620), Expect = 6e-63, Method: Composition-based stats. Identities = 53/269 (19%), Positives = 89/269 (33%), Gaps = 11/269 (4%) Query: 3 NNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDD-CSTSWEQLQQYVTALNDP 60 + P +SI +P +N Q I V Q YSNWE+ + D S S D Sbjct: 63 HEPKVSILIPMYNTPLQFFKELIDCVQAQTYSNWELCLADGTGSVSEAGEYAVQCMAKDS 122 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y D N G N+A+ +A GE+I +D DD TP+ L + + +Y Sbjct: 123 RIVYKMLDSNDGISENTNRALEMATGEFIALMDHDDVITPDALYHLVKAVNEDAMADSVY 182 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-FKECLFDTELKAA 179 +++ + + K ++ N I + T + F A Sbjct: 183 SDEDKMDMDGRKL---FEPHFKPDFNIDYLRSCNYICHMFMTRTSIAREVGGFRKPFDGA 239 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QD+D R + V + K + Y + + RA Sbjct: 240 QDFDFIFRCTEKSRHVAHVPRVIYHWRCHVNSTAAVPESKMY-AYHAGVKSINEHCQRAG 298 Query: 240 K----KYQLFTLYQIRNKRMTWRTLLTLL 264 + Y + + + ++ Sbjct: 299 IPMTGELGTSYGYYRDVPVLDRKPSVDII 327 >UniRef50_B3PF93 Glycosyl transferase, putative, gt2I n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PF93_CELJU Length = 304 Score = 243 bits (620), Expect = 8e-63, Method: Composition-based stats. Identities = 66/276 (23%), Positives = 113/276 (40%), Gaps = 15/276 (5%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+S+ + T+N Q +A++S+L Q + N E+I+VDD ST + N Sbjct: 10 MQTQPLVSVVIATYNMGQYLAQAVESILNQTWKNLEVIVVDDGSTDDTEQVMQAYQAN-T 68 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ Y+ IN G +N I +G+YI D DD W P +L V L +Y Sbjct: 69 RVYYLK-TINQGQPKAKNTGINNTKGDYIAFCDADDFWEPFKLEVQLPLFN-NPAVGVVY 126 Query: 121 ANDYVCQGEVYS--QPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA 178 + + +PA+ + ++ + G V A + +FD E + Sbjct: 127 SEVSNIDEQNRRYVRPANEVRHSGQVTNQLMIENFVPFGTSVIRRACIERNGVFDEEFRM 186 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK---KFSGYFHFYRKHKDKF 235 D+D++LR ++ E E T + + G+M + F KHK + Sbjct: 187 GIDWDLWLRYSLD-WEFAYTPERTYVYRVWSGQMSTNYRGRYDFAIRILNKFVTKHKPQL 245 Query: 236 DRASKKYQLFTLY------QIRNKRMTWRTLLTLLS 265 + + +Y RN++ W L +L Sbjct: 246 NPRYVRKAWADMYISKGVVYARNEKRFWMPLQHILH 281 >UniRef50_D2RIR4 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIR4_ACIFE Length = 256 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 2/229 (0%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ MPT+N + + +I SVL Q ++WE+ IVDDCST Q P+I Y Sbjct: 2 LVSVVMPTYNCGKYILESIDSVLAQTITDWELQIVDDCSTDDTQEILKPYLEKYPQIHYY 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N G R AI A+G+Y+ +D DD W P +L ++ + AF + Sbjct: 62 RLSQNGGPAVARTVAIERAEGKYVAFLDSDDLWLPEKLEKQISFM-KNTKSAFSCSAYAC 120 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 E S +L K+ Y+ + N IGN + +K D+ ++ Sbjct: 121 MDDEGNSLHYALFPPEKTDYA-KCIRLSNPIGNLTAIYDQEILGKFQVPPIKKRNDFALW 179 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 L+++ + ++E + ++ K+ ++ Y + Sbjct: 180 LQILKKTTFCAGIQEVLGVYRTGRKGSVSSNKLKQAKYHWQLYHDIEHH 228 >UniRef50_A0YST1 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YST1_9CYAN Length = 297 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 51/229 (22%), Positives = 100/229 (43%), Gaps = 6/229 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P+IS+ +P +N ++ + I+SVL Q + +WE+I+VDD ST ++ V + D RI Sbjct: 2 PIISVIIPAYNAEKTIKKTIQSVLNQTFLDWELIVVDDGSTDST--KEVVAEIKDNRIFL 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 N+G A RN+ + + GE++ ID DD WTP++L + L Q+ A Y+ Sbjct: 60 FSFP-NAGVSAARNRGVNKSSGEFLAFIDADDLWTPDKLELQLKALQEHPKAAVAYSWTD 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNI--IGNQVFTWAWRFKECLFDTELKAAQDY 182 + + +L + + N + K FD + +D+ Sbjct: 119 YIDESDQIIYSGRHITLNGNIYEKLLVRYVLENGSNFLIRKDAYLKVGGFDESIFGVEDW 178 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 D+ +R+ Y + + + + ++ + S ++ + + Sbjct: 179 DLSIRLSQLY-KFVAIPKTQVLYRMSPHSITSNFSKQELESLRVIEKAY 226 >UniRef50_C3WFL5 Glycosyl transferase n=4 Tax=Bacteria RepID=C3WFL5_FUSMR Length = 257 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 64/261 (24%), Positives = 116/261 (44%), Gaps = 6/261 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ N ISI P +N ++ I SV++Q Y+NWEM+IVDDCS+ D Sbjct: 1 MEEN-KISIITPLYNGEKYIEETILSVIKQTYNNWEMLIVDDCSSDNSPNIVKKYVEQDK 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI +I + NSGA RN+AI +A+GEYI +D DD W +L + ++ +A + Sbjct: 60 RIKHIRLEKNSGAAIARNKAIEIAEGEYIAFLDSDDLWKKEKLEKQINFMKEN-NYAISF 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + + K R N IG ++ + ++ L+ Q Sbjct: 119 TEYEEIDEDGRKLNILIRVPKKPVTYRSYLLT-NPIGCLTGMYSVKKLGKVYMPNLRKRQ 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D +L+++ + + + ++E I IN + + F +++ YRK +K Sbjct: 178 DTGFWLKIL-KLDKAYSLKENLAIYRINSSSLSFKKTD-LFKYHWYLYRKI-EKLTIVES 234 Query: 241 KYQLFTLYQIRNKRMTWRTLL 261 + + T + ++ + + Sbjct: 235 LFYILTTIITKIFKLKEKKIR 255 >UniRef50_Q9K6L6 Glycosyltransferase n=1 Tax=Bacillus halodurans RepID=Q9K6L6_BACHD Length = 303 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 65/250 (26%), Positives = 108/250 (43%), Gaps = 6/250 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +SI +PT NR +L RA++S L Q Y N E+I+V D ST + D R+ +I Sbjct: 2 KVSIIIPTHNRAKLLKRALESTLNQTYKNIEVIVVSDGSTDNTDIVMDEY-KRDSRVNFI 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 G RN I A+GE+I +DDDDEW P++L + + Q +Y + Sbjct: 61 SYHPAKGGNYARNTGIKNAKGEFIAFLDDDDEWMPDKLELQIKEFNQNANVGLVYTGVEI 120 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG--NQVFTWAWRF-KECLFDTELKAAQDY 182 L K+ + N IG + V + +FD +LKA QDY Sbjct: 121 IYNFNKRNIKYYSLPKKTGNLSKEILVANCIGTTSSVMVRKNLITECGMFDEKLKARQDY 180 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY-RKHKDKFDRASKK 241 D+++R+ + V + + QI+ KK+ + K+ D + + S++ Sbjct: 181 DLWIRVCQKT-LVGVVNKPLVRYYNYTTNKQISDDIKKYESAIEYIDNKYVDLYSKVSEE 239 Query: 242 YQLFTLYQIR 251 + + + Sbjct: 240 IRRKHRHSMT 249 >UniRef50_A9AAL8 Glycosyl transferase family 2 n=2 Tax=Methanococcus maripaludis RepID=A9AAL8_METM6 Length = 280 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 57/272 (20%), Positives = 102/272 (37%), Gaps = 4/272 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 +NNPL+S+ MP +N ++ AI+S+L Q Y ++E II+DDCST A D R Sbjct: 5 ENNPLVSVIMPNYNNEKYLAEAIESILNQTYEHFEFIIIDDCSTDNSWHIIQNYAKKDNR 64 Query: 62 ITYIHNDINSGACAVRNQAIMLAQ--GEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 I N++N RN+ L +YI D DD P RL + ++ V + + Sbjct: 65 IKMFRNEVNLKIVKTRNKGFELMSPHSKYIAIFDSDDVSMPERLEKQINFSEKNVDYGLI 124 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKA 178 ++ + + + K V +D + Sbjct: 125 GSHTLIIDENSKIIGYRKYETDLKKIIKNMVKKNQFAQPSVMIRKSVLDDVGYYDEKYTR 184 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 QDYD++ R+ + + ++E I+ + + T + +K Sbjct: 185 CQDYDLWFRIAKSF-KIENLDEFLIKYRISKTQGKSTHLKESLDFTLDIQKKWLFDKKFF 243 Query: 239 SKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGK 270 S + + + + + +L L V N Sbjct: 244 SIFGVFYYIGEHLMYFLPSKVVLKLFKVLNYD 275 >UniRef50_B5L396 WfdV n=10 Tax=Proteobacteria RepID=B5L396_ECOLX Length = 267 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 62/259 (23%), Positives = 101/259 (38%), Gaps = 6/259 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ PT+N ++ I SVL Q Y+N EMIIVDDCS+ D R+ I Sbjct: 7 LVSVITPTYNSERTIRCTIDSVLSQSYNNIEMIIVDDCSSDNTVSICREYQKKDSRVVLI 66 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+ N GA RN AI A+G +I +D DD W ++ + +AF Y + Sbjct: 67 SNEKNYGAGMSRNIAISKAKGRFIAFLDSDDFWHKEKIKKQIDFMLSN-NYAFTYTSYQK 125 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 +S + P S + K N+IG + +F ++ QD ++ Sbjct: 126 V---SFSGKLLSQINPVSKVNYSELLKTNVIGCLTAVYDTSSLGKMFMPTIRKRQDMALW 182 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 L ++ + + + E + + ++ K + FYRK+ F Y Sbjct: 183 LSILDKIDYAYCLNETLAYYRVGTDTLS-SNKLKIIFSQWAFYRKYLK-FGLFKSLYYFS 240 Query: 246 TLYQIRNKRMTWRTLLTLL 264 + L Sbjct: 241 HYAVNAVAKHHINPLCLFF 259 >UniRef50_B1ZW34 Glycosyl transferase family 2 n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZW34_OPITP Length = 303 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 62/301 (20%), Positives = 113/301 (37%), Gaps = 29/301 (9%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+SI +P N ++ +++S L Q Y NWE+I VDDCST A +P Sbjct: 1 MQQRPLVSIVLPVHNGERYLRESVRSCLDQTYQNWELIAVDDCSTDGTAAMLDSFAAREP 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ + N N A N ++GE +T DD+ + PN L ++ + +Y Sbjct: 61 RMRVLRNPANLRLPASLNVGFAASRGEILTWTSDDNLYLPNALEEMVSALMEHREAMMVY 120 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG-NQVFTWAWRFKECLFDTELKAA 179 A + P+S N++ + +D L+ Sbjct: 121 AAQDIIDESGAVVVRYPSGSPES------LCHFNVVNACFAYRRKVLETVGGYDPILELV 174 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH------------- 226 +D+D +LR+ Y E ++E + N + + +++ Sbjct: 175 EDWDYWLRIARRY-EIIQLECCLYLYRANPASLSQQTGDRRWKVEERLLEMRLPELRWKW 233 Query: 227 -------FYRKHKDKFDRASKKYQLFTLYQ-IRNKRMTWRTLLTLLSVRNGKRLADGIRG 278 F R ++ R ++ + +R + R L+ + GKR+ +R Sbjct: 234 PAGYARGFIRLCHPRWQRGDRRTAWRHFGRALRYFPVALRHWRALIPLCLGKRVYYALRA 293 Query: 279 R 279 R Sbjct: 294 R 294 >UniRef50_A5ZV71 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=A5ZV71_9FIRM Length = 250 Score = 241 bits (615), Expect = 2e-62, Method: Composition-based stats. Identities = 61/230 (26%), Positives = 96/230 (41%), Gaps = 6/230 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ PT+N + I+SV Q Y WEMIIVDDCST + D RI Y Sbjct: 5 LVSVITPTYNCAKFIGETIESVQAQTYQQWEMIIVDDCSTDNTKEIVDKYIKEDSRIKYF 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + NSGA R +A+ LA GEY+ +D DD W +L LA ++ AF Sbjct: 65 CLENNSGAAVARTKAMELANGEYMAFLDSDDIWPEEKLKKQLAFMKKH-DVAFSCTAYEQ 123 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD-TELKAAQDYDI 184 + PK+ Y+R L +GN + K F+ ++ D + Sbjct: 124 IDENGKVLNKIIKTVPKADYNRVLL--DCPVGNSTVMY-NVEKMGKFEVPNIRKRNDDAL 180 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 +L+M+ + + + I + ++ K ++ YR + Sbjct: 181 WLQMLKKEKYIYGMRSVLMKYRIRQNSIS-SNKFKVIKYHWILYRDIEHL 229 >UniRef50_O87159 WblC protein n=1 Tax=Vibrio cholerae RepID=O87159_VIBCH Length = 340 Score = 241 bits (615), Expect = 2e-62, Method: Composition-based stats. Identities = 51/258 (19%), Positives = 96/258 (37%), Gaps = 8/258 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + P IS+ M +N ++ +AI+S+L Q +S++E IIVDD ST D R Sbjct: 3 SDTPTISVIMSVYNGEKYLAQAIESILNQTFSDFEFIIVDDGSTDSSLSIIQAYMDKDDR 62 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I +N G N+AI +++ YI +D DD P RL LA + Sbjct: 63 IVLISR-VNKGLPYSLNEAISVSKANYIARMDADDISLPERLETQLAVMENNPDIGVCGT 121 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC--LFDTELKAA 179 Y+ + RL + I V + +++ + + Sbjct: 122 LAYLFRETPSKNKMMCHPEDHDSLIIRLLFSVCFIHPVVMIRKSVLDQLDYVYNENFRNS 181 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP----KKFSGYFHFYRKHKDKF 235 QDY+++ R+ + + +++ + + K+F K +K Sbjct: 182 QDYELWSRIAEKT-RFYTIQKPLLFYRDTPDGITSKVNHDGFNKRFPLVSQVQTKQLEKL 240 Query: 236 DRASKKYQLFTLYQIRNK 253 ++ +++ Sbjct: 241 GLKLNDEEMIMHFRLGLN 258 >UniRef50_B4S4G2 Glycosyl transferase family 2 n=1 Tax=Prosthecochloris aestuarii DSM 271 RepID=B4S4G2_PROA2 Length = 334 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 50/228 (21%), Positives = 95/228 (41%), Gaps = 4/228 (1%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+++ MP +N ++ A++S+L Q Y N+E +++DD ST V + D RI Sbjct: 3 PLVTVLMPVYNGEKHLADAVRSILDQTYRNFEFLVMDDGSTDKS--LDIVKSFEDERIIV 60 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 +D NSG N+ I LA+G+YI +D DD P+RL + + + + Sbjct: 61 KRSDRNSGIAKTLNKGIALARGKYIARMDCDDISLPDRLRRQADYLETHSDTGLIGSGIQ 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDI 184 + Q + L ++ + A ++ +D EL AQDY + Sbjct: 121 KI-KKSRKQKVFTWPSKDTEIKLDLLFQSAFFHPTIMARASLLRKIGYDEELLYAQDYAL 179 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 + ++ E + EA + + ++ ++ + + Sbjct: 180 WTKLAPETA-FANLPEALLLYRTHDEQVTKKKGKQQAAIARLVKENYL 226 >UniRef50_B2J2I1 Glycosyl transferase, family 2 n=7 Tax=Cyanobacteria RepID=B2J2I1_NOSP7 Length = 358 Score = 240 bits (614), Expect = 3e-62, Method: Composition-based stats. Identities = 54/234 (23%), Positives = 100/234 (42%), Gaps = 6/234 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N LIS+ +P +N ++ + I+SVL Q + N+E+I+++D S + + + D RI Sbjct: 7 NKTLISVIIPAYNSEKTIKKTIESVLNQTFHNFELIVINDGSQDST--LEVIRKIPDSRI 64 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 N+G RN+ + A G++++ +D DD WTP++L L Q+ VT Y+ Sbjct: 65 QVFSYP-NAGGNVSRNRGLNHAVGKFVSFLDADDVWTPDKLQHQLKALQKNVTAKVAYSW 123 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI--IGNQVFTWAWRFKECLFDTELKAAQ 180 + + + L + N + FD L AAQ Sbjct: 124 TDYIDEKGEFILSGKRVTANGNVYESLLLNNFLENGSNPLICRKALITLGGFDESLGAAQ 183 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 D+D++LR+ ++ V + I + ++ S R +K++ Sbjct: 184 DWDMWLRLAYKFNFI-CVPSVQILYRITPNSVSCNLVRQEKSCLQVLERAYKER 236 >UniRef50_Q60B60 Glycosyl transferase, group 2 family protein n=1 Tax=Methylococcus capsulatus RepID=Q60B60_METCA Length = 367 Score = 240 bits (614), Expect = 3e-62, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 89/235 (37%), Gaps = 7/235 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P ++I MP +N ++ A++S+L Q + ++ ++I+DD S+ + DPR+ Sbjct: 28 HAPRVTILMPVYNGEKYLAAAMESILDQTFRDFILLIIDDGSSDSSLAIA--RSFGDPRV 85 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 N N G N+ + L Q E++ +D DD P+RL +A + Sbjct: 86 QVERNPKNLGLVKTLNRGLDLVQTEFVARMDCDDIALPDRLEKQIAFLDENPDIGMCGTA 145 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL--FDTELKAAQ 180 + + P L + + V + + + A+ Sbjct: 146 YELFHESLRQTI--RPPCRHEEIVYGLLDDNVFLHSSVIVRMEVLNRHGLRYREDYRLAE 203 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 DY+++ R+ Y + + + + T+ ++ + +H + F Sbjct: 204 DYELWARLA-RYTHIGNLPQVLVRYRSHPENVSNTNKGEQIATRDRVRLEHLESF 257 >UniRef50_C0E8J2 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0E8J2_9CLOT Length = 841 Score = 240 bits (614), Expect = 4e-62, Method: Composition-based stats. Identities = 61/281 (21%), Positives = 107/281 (38%), Gaps = 15/281 (5%) Query: 7 ISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVD--DCSTSWEQLQQYVTALNDPRIT 63 SI +P +N + I+SVL Q YS WE+ + D D S + A D R+ Sbjct: 313 FSILVPLYNTPEAYLHEMIQSVLDQTYSKWELCLADGSDAEHSEVERICLEYAGRDSRVR 372 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y + N G N + +AQGE+I+ D DD P+ L + + F+Y ++ Sbjct: 373 YKRLEQNYGISGNTNACMKMAQGEFISLFDHDDILHPSALFETMMAITEQ-GADFVYTDE 431 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-FKECLFDTELKAAQDY 182 V +G+ + ++ K +S N I + A K F +QDY Sbjct: 432 MVFEGQPDNVTL---IHFKPDFSIDTLRGHNYICHFSSFSAELARKVGYFSENHNGSQDY 488 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D+ LR+ + + + + G + S K + + D +R + Sbjct: 489 DMVLRLCEQAKRIVHIPQVLYFWRSHPGSVASDVSVKPY-CMVSAKKALADHLERVGLRG 547 Query: 243 QLF-----TLYQIRNKRMTWRTLLTLLSVRNG-KRLADGIR 277 + + Y+I + + L+ ++ L IR Sbjct: 548 TVTDSTIPSTYKINYEIEGEPLVSILIPNKDYTADLDKCIR 588 Score = 92.1 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 74/228 (32%), Gaps = 26/228 (11%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 ++ PL+SI +P + + I+S+ YS +E+II+++ S E Y + + Sbjct: 564 IEGEPLVSILIPNKDYTADLDKCIRSIQDKSTYSRFELIIIENNSELPETFAYYDSIQKE 623 Query: 60 -PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLV--- 114 + + A+ N A GEYI +++D + TPN L L Q+ Sbjct: 624 FSNVKVVQWKGPFNFAAINNFGFSFATGEYILMLNNDVEVLTPNWLEEMLMFAQREDVGA 683 Query: 115 -THAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG--------------NQ 159 Y +D + + + + + R I Sbjct: 684 VGAKLYYPDDTIQHAGIIVGIGNTAGHAHKGFPRTD---GGYIHRLTIAQDLSAVTGACL 740 Query: 160 VFTWAWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQIL 205 + + + D + A D D LR+ A Sbjct: 741 MTRRSVWEQLGGMDEDFVVAFNDVDFCLRIRQAGYLVVFTPYAELYHY 788 >UniRef50_A4EB36 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4EB36_9ACTN Length = 828 Score = 240 bits (613), Expect = 4e-62, Method: Composition-based stats. Identities = 54/283 (19%), Positives = 100/283 (35%), Gaps = 16/283 (5%) Query: 4 NPLISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL+SI +P + + + SVL Q Y NWE++++ D S W+ + +D R+ Sbjct: 288 RPLVSIVVPCYKTDRVYLRELLDSVLAQSYDNWELLLM-DASPEWDAVADLAAGAHDERV 346 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT---HAFL 119 N G N I A G+YI +D DD P+ L ++A L Sbjct: 347 RRFGLPGNGGIVVNTNAGIEQAMGDYIAFLDHDDILEPDALFHYVAALNNAAEGERPQVL 406 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD-TELKA 178 + ++ + Q K+ + L Y N + + + ++ Sbjct: 407 FCDEDMFQ----KTGEWGQPVFKTKLNVDLLYSHNCVTHFLMVEKALIDRIGTSPEDVAG 462 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 AQDYD+ LR + V ++ G S+ K + F+ Sbjct: 463 AQDYDLTLRCLAAGARFEHVAHVLYHWRVHPGSTADGSADSKPYAIEAGRLALQRHFNAL 522 Query: 239 SKKYQLF-----TLYQIRNKRMTWRTLLTL-LSVRNGKRLADG 275 + +Y++R L+++ + ++ D Sbjct: 523 GVHGTVEETETPFVYRMRYALPEPSPLVSIVVPTKDHVETLDA 565 Score = 99.8 bits (248), Expect = 9e-20, Method: Composition-based stats. Identities = 38/274 (13%), Positives = 92/274 (33%), Gaps = 40/274 (14%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCS----------TSWEQ 49 + +PL+SI +PT + + + S+ Q Y+N+E+++V++ S + E+ Sbjct: 545 EPSPLVSIVVPTKDHVETLDACVMSI-AQKATYANYEIVLVENNSEAPETFAYYESLPER 603 Query: 50 LQQYVTALNDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLA 108 + R+ Y + N + N + A+G+Y+ +++D + +P+ + Sbjct: 604 VAAASEGKGVARVVYWLGEFNY--SQIINFGVEHAKGDYLLLLNNDTEVISPDFTEEMMG 661 Query: 109 HKQQLVTHAFL-----YANDYVCQGEVYSQPASLPLYPKSPYSRRL-------FYKRNII 156 + Q+ + +A+ V + + +S + N Sbjct: 662 YLQR-PDAGVVGAKLYFADHLVQHAGILVGVRGALAHANQDFSAKREGYLARAVRPGNFS 720 Query: 157 G----NQVFTWAWRFKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEA-TQILHINHG 210 Q+ + ++ E D D LR+ A Sbjct: 721 AVTGACQMVRRDVFEQVGGYNEEFAVGFNDADFCLRVWEAGYRTIFTPYAELYHY----- 775 Query: 211 EMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQL 244 E + +++ + F + ++ L Sbjct: 776 EFTSRGREEANEEKLRRWKREQALFMQRWPEFFL 809 >UniRef50_C0QK40 Putative glycosyl transferase (Group 2 family protein) n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QK40_DESAH Length = 565 Score = 239 bits (612), Expect = 6e-62, Method: Composition-based stats. Identities = 65/285 (22%), Positives = 103/285 (36%), Gaps = 13/285 (4%) Query: 3 NNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY--VTALND 59 + PLISI P +N L I ++S L Q Y WE+I+VDD S L + + D Sbjct: 97 DQPLISIVTPLYNTPSNLLIECVESALFQSYQQWELILVDDASNDRSTLNCLSRIARIKD 156 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI + N N G C N+ + A GEYI +D DD TPN L + Sbjct: 157 PRIKVLTNRKNRGTCQSTNRGVRRASGEYIAFLDHDDRITPNALFETARTISGHDRPDLI 216 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 Y++ + K +S N + + + F + + + + Sbjct: 217 YSDRDLI----SPSGKRHDRISKPAWSPETLLSGNYLFHLMVAKRSLFLELGGYRKDFEG 272 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QD D LR+ + + K F + + +D DR Sbjct: 273 SQDLDFVLRLAEVTHRVVHIPKVLYNFRQAETSCAYDMEAKSFI-FEKGVKAVQDALDRR 331 Query: 239 ---SKKYQLFTLYQIRN-KRMTWRTLLTLLSVRNGKRLADGIRGR 279 K Y+L ++ R+ R + LA+ + R Sbjct: 332 GINGKAYELEHAWRGNYGIRLKNRLAHDTIIYNRHPNLAEFLNHR 376 >UniRef50_A0KM86 Glycosyl transferase, group 2 family protein n=2 Tax=Gammaproteobacteria RepID=A0KM86_AERHH Length = 268 Score = 239 bits (612), Expect = 6e-62, Method: Composition-based stats. Identities = 60/258 (23%), Positives = 101/258 (39%), Gaps = 8/258 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ MP +N + +I SVL+Q Y+NWE+I++DDCS + D RI + Sbjct: 3 KVSVIMPLYNSARFLAASIDSVLKQQYTNWELILIDDCSVDDCVVIAQAYCDRDSRIKLV 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 NSGA RN+ I +A+G +I +D DD W P +L+ + + AF Y Sbjct: 63 RLAENSGAAVARNRGIEIAEGRFIAFLDSDDLWLPEKLNTQIDFMLKNK-VAFSYTAYKK 121 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 L S S R K +IG + ++ +DY ++ Sbjct: 122 IDEYGNDL---FELGVPSSLSYRALLKACVIGCLTVIYDASQIGKVYMPLGTKREDYALW 178 Query: 186 LRMVVEYGEP-WKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQL 244 L+++ E + + + I + + + R ++ S Y Sbjct: 179 LKILRERNVVAYGINKVLASYRIYPHQ-SSSKKLNMAKENWKLLRNQEELCLIISLHY-- 235 Query: 245 FTLYQIRNKRMTWRTLLT 262 FT Y +R L+ Sbjct: 236 FTQYALRGVLRDKYPLIA 253 >UniRef50_Q0ACF0 Glycosyl transferase, family 2 n=4 Tax=Ectothiorhodospiraceae RepID=Q0ACF0_ALHEH Length = 597 Score = 239 bits (612), Expect = 6e-62, Method: Composition-based stats. Identities = 65/301 (21%), Positives = 113/301 (37%), Gaps = 34/301 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ MP +N AI SVL QDY + E+I++DD S+ + R+ Sbjct: 281 PLVSVIMPAFNAASYIEEAIDSVLAQDYPHKELIVIDDGSSDDTVARVQAYG---DRVRL 337 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G+ RNQ + AQGEYI +D DD W P +L+ + + + +Y++ Sbjct: 338 LTQ-ANQGSAVARNQGLDAAQGEYIAFLDSDDVWLPGKLTAQVGYLEAHPDVGMIYSDWL 396 Query: 125 VCQGEVYSQ------------------PASLPLYPK--SPYSRRLFYKRNIIGNQVFTWA 164 + + S+ P +PL + RL + + V Sbjct: 397 PWKRDKQSKAFPPPEALAPATPDTGVPPEEIPLLTEGSGWLYNRLLFGSLLHTITVMARR 456 Query: 165 WRFK-ECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSG 223 + FD ELK QDYD +LR + E +++ + ++ G IT P + Sbjct: 457 ELIEQVGRFDPELKRGQDYDYWLR-ASRHTEIHQLDRVFALYRLH-GSGCITQWP-DINY 513 Query: 224 YFHFYRKHKDKF------DRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 K ++ S + + + + R + + +R Sbjct: 514 EKLVVEKALARWGLEGPTGERSDRKAVERRLAGTCFDFGYHHYWSGNPRRASRSFLEALR 573 Query: 278 G 278 Sbjct: 574 H 574 >UniRef50_Q0VR09 Glycosyl transferase, group 2 family protein, putative n=2 Tax=Gammaproteobacteria RepID=Q0VR09_ALCBS Length = 274 Score = 239 bits (610), Expect = 9e-62, Method: Composition-based stats. Identities = 61/262 (23%), Positives = 104/262 (39%), Gaps = 6/262 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +SI P +N + R I+SVL Q++ +WE+++VDD ST D RI Sbjct: 7 PEVSIITPCYNARHTLARTIESVLSQNFLSWELLLVDDKSTDDTDQHAQAFCDRDNRIKL 66 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I NSGA RN+ I A+G YI ID DDEW ++LS+ ++ ++ Y Sbjct: 67 IRLKENSGAAVARNRGIASARGRYIAFIDADDEWHKDKLSIQISQMKEQ-GWPLSYTAYT 125 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDI 184 E R K N IG + +F EL+ QD+ + Sbjct: 126 RINPEGACINTVGVP---PTIHYRQLLKTNYIGCSTAVYDSEQLGKVFMPELRKRQDFGL 182 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQL 244 +L+++ + + + A ++ + ++ + YR +++ + Y Sbjct: 183 WLKILKKTTSGYGINLALTRYCVHQDSLS-SNKRNTAGYNWRLYRD-EEQLNAIMASYYF 240 Query: 245 FTLYQIRNKRMTWRTLLTLLSV 266 R + L L + Sbjct: 241 ANYALRGVLRSRFPKLALFLGI 262 >UniRef50_UPI0001BC32BF glycosyl transferase family 2 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC32BF Length = 815 Score = 239 bits (610), Expect = 9e-62, Method: Composition-based stats. Identities = 57/282 (20%), Positives = 97/282 (34%), Gaps = 13/282 (4%) Query: 5 PLISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P SI +P + Q IKS+ Q YSNWE+ I D + + ++ Sbjct: 285 PKFSIIIPVYRPVPQYFTDMIKSIKHQTYSNWEICIADGGGEGHTVDKTLFGLVKGDKVR 344 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y+ N G N+A+ +A G+Y D DD + P+ L +Y+++ Sbjct: 345 YVAISENLGISGNTNEAMKMATGDYFVLGDHDDVFEPDALFECARAINSKEKPDMIYSDE 404 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-FKECLFDTELKAAQDY 182 + Y K ++ L N I + K F E AQDY Sbjct: 405 DKITEDG---KKHCEPYFKPDFNIDLLRSNNYICHLFVFSRELSEKVGYFRKEFDGAQDY 461 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D+ LR + + + I G K++ + R + F R K Sbjct: 462 DMILRCSEKAKCIKHIPKVLYSWRIYSGSTSANPESKRY-AFTAGKRAIDEHFKRMGIKA 520 Query: 243 QLFT------LYQIRNKRMTWRTLLTLLSVRNG-KRLADGIR 277 + +Y+ R + + L+ ++ L I+ Sbjct: 521 EAEMNETYLGIYRSRYEITGNPLISILIPNKDHTDDLDKCIK 562 Score = 88.2 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 80/221 (36%), Gaps = 19/221 (8%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND-PRI 62 NPLISI +P + + IKS+L+Q Y N+E+I++++ ST + Y D ++ Sbjct: 541 NPLISILIPNKDHTDDLDKCIKSILKQKYENYEIIVIENNSTEDATFEYYKKLEKDCNKV 600 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLV----THA 117 ++ + N A+G Y+ +++D + + L+ L++ ++ Sbjct: 601 KVVYYKDKFNYSKINNFGRKEAKGGYLLLLNNDTEMINDDCLNELLSYTERDDVGITGAR 660 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII-----------GNQVFTWAWR 166 LY ++ V V L + + + + Sbjct: 661 LLYEDNTVQHAGVVIGYNGLAGHTFVGNEPDDVGYHAYVIQARDYSAVTAACLMVKASLY 720 Query: 167 FKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEE-ATQIL 205 + F+ EL+ A D D L++ + Sbjct: 721 DEVGGFEEELEVAFNDVDFCLKVREKGYLVVYNPYAVLYHY 761 >UniRef50_C0FUT5 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FUT5_9FIRM Length = 519 Score = 239 bits (610), Expect = 9e-62, Method: Composition-based stats. Identities = 57/283 (20%), Positives = 103/283 (36%), Gaps = 25/283 (8%) Query: 2 KNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY----VTA 56 + PL+SI P ++ ++ I+SV+ Y NW++ +VD S E+ + Y Sbjct: 75 EKRPLVSIITPLYHTPEKFLRELIESVMASSYDNWQLCLVDATSVEEERKKIYQIVSEYQ 134 Query: 57 LNDPR------ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK 110 D R I Y N G N+A+ +A+G+YI +D DD TP+ L +A + Sbjct: 135 QKDNRDGGKQKILYEKLRENYGIAENTNRALFMAKGDYIAFLDHDDTITPDALYEMMAME 194 Query: 111 QQL----VTHAFLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR 166 Q++ Y+++ E + K Y+ L N I + + Sbjct: 195 QKMGIDSTRAVMFYSDEDKMNEEKT---CYYEPHFKPDYNLDLLCANNYITHFLMVSRVL 251 Query: 167 FK-ECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 + E AQDYD LR + + + +++ K + Sbjct: 252 LEEVGGIQGEFNGAQDYDFILRCTEKAENVVHIPKVLYHWRVHNASTSGGGGSKDY-ALQ 310 Query: 226 HFYRKHKDKFDRASKKYQ-----LFTLYQIRNKRMTWRTLLTL 263 R + R + F Y + ++ R + + Sbjct: 311 AGRRAIEQHLQRIGEYGTVQCTPYFGFYSVNYEQRNKRKITLI 353 >UniRef50_B8GFE2 Glycosyl transferase family 2 n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GFE2_METPE Length = 310 Score = 238 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 59/239 (24%), Positives = 103/239 (43%), Gaps = 8/239 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P +S+ +PT+NR L AI+SVL Q Y + E+I+VDD S ++ + + D R Sbjct: 3 STTPTVSVILPTYNRAHLLPTAIESVLNQTYRDLELIVVDDGSKDST--EEVIRTITDER 60 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ-LVTHAFLY 120 + YI + N GA RN I A+G +I D DD+W P++L + + +Y Sbjct: 61 LRYIRYEPNQGANHARNVGIQAARGPFIAFQDSDDDWFPDKLEKNMKLFEMVGPEVGVVY 120 Query: 121 ANDYVCQGEVYS--QPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 + + G P + L + F++ L+ L Sbjct: 121 SGYWKHMGNNQKMYIPFTWVKQRDGDVHHELMRGNFVTTQAAVVRKECFEKSGLWMEGLP 180 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 A Q++++FLR+ +Y + ++E + + T++ + G KH D+F Sbjct: 181 AKQEWELFLRISKDY-KFAYIDEPLLNSNFTETGISNTNTS-VYRGMEMVLNKHYDEFK 237 >UniRef50_B7KAG8 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KAG8_CYAP7 Length = 344 Score = 238 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 57/284 (20%), Positives = 113/284 (39%), Gaps = 28/284 (9%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 S+ +P +N ++ + I+SVL Q Y N+E++IV+D S D R+ I Sbjct: 3 KFSVIIPAYNVEKYIAKTIQSVLEQTYQNFELLIVNDGSLDRTLEICQ--QFTDSRLKII 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+G RN I +QGEYI +D DD W P +L + H ++ T ++ Sbjct: 61 SQ-KNTGLSGARNTGIRHSQGEYIAFLDGDDLWLPEKLQKHIQHLEKSPTVGVSFSRSAF 119 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN--QVFTWAWRFKEC------------- 170 + K+ + + N +GN + + Sbjct: 120 IDERGNLLNSYQMPILKN-ITPEILLHGNPVGNGSALVVRREVLEAIKFQNQLSDFQEYC 178 Query: 171 LFDTELKAAQDYDIFLRMVVEY-GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 FD + + A+D ++ LR+ +E + + EA + +N + + ++ + Sbjct: 179 YFDAQFRRAEDIELLLRIALETSWQIEGIPEALTLYRVNSQGLSA-NWQEQLHSWEKVIS 237 Query: 230 KHKD-------KFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSV 266 K K +++++++ YQL L + ++ + L + Sbjct: 238 KTKSYAPELVTRWEKSARAYQLQILARSTIRQKAGSVAVKLFNK 281 >UniRef50_B4VMI0 Glycosyl transferase, group 2 family protein n=2 Tax=Oscillatoriales RepID=B4VMI0_9CYAN Length = 325 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 57/227 (25%), Positives = 98/227 (43%), Gaps = 7/227 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 LIS+ +P +N ++ + I+SV+ Q +SN+E+I++DD S V +++D R+ Sbjct: 3 LISVVIPVYNGEKTIRQTIESVINQTFSNFELIVIDDGSQDST--LDIVRSISDSRLKVF 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+G RN+ I LA G+YI+ ID DD WT N+L L A ++ Sbjct: 61 SFP-NAGLATSRNRGIQLASGDYISFIDADDLWTGNKLEAQLKALLDNPQAAVAFSWTDC 119 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNI-IGNQVFTWAWRF-KECLFDTELKAAQDYD 183 +L I G+ + + F K F+ L AA+D+D Sbjct: 120 IDESGQFSRRGNYTTVSGNVYAQLLLTDFIENGSNLLIRSEAFKKVGYFNESLPAAEDWD 179 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 ++LR+ V Y E V ++ M + ++ + + Sbjct: 180 MWLRLAVNY-EFAVVPSPQVFYRLSTTSMSA-NVVRQEAACLQVIDQ 224 >UniRef50_C2SSX0 Glycosyl transferase family 2 n=1 Tax=Bacillus cereus BDRD-ST196 RepID=C2SSX0_BACCE Length = 300 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 72/282 (25%), Positives = 122/282 (43%), Gaps = 8/282 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ LIS+ +PT+NR +L R I SVL Q Y+N+E+IIVDD ST + +D Sbjct: 1 MEMEVLISVVVPTYNRSELIKRTIDSVLAQTYTNFELIIVDDASTDNTEDIVNEY--HDD 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ-LVTHAFL 119 RI +I + NS RN I ++G+YI +D DDEW PN+L L + + Sbjct: 59 RIKFIKLNENSKGTKPRNMGIKESKGDYIALLDSDDEWLPNKLESQLNFLRAFNDDNMVC 118 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 + + + + + L+ S +F +N + + ++ + K+ LF+ LK Sbjct: 119 FTDLILKGTKKTVYSNNRDLFENEDISEYIFLGKNWVQTSTYMFSSKLGKQTLFNPTLKK 178 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 QD+D LR+ + E I + + E +I ++ K + + Sbjct: 179 HQDWDFCLRLKKNGAKFVNFPEHLAIYYSDDREGRIGNNLK-YKQSLEWINSVSSDISNE 237 Query: 239 SKKYQLFTLYQ---IRNKRMTWRTLLTLLSVRNGKRLADGIR 277 K L + I N + + + L +NG + Sbjct: 238 VKHAFLVRVMTKPLIFNNQRSKALSIYLDGKKNGVIGLSALC 279 >UniRef50_Q12TY4 Glycosyl transferase, family 2 n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12TY4_METBU Length = 274 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 51/235 (21%), Positives = 93/235 (39%), Gaps = 5/235 (2%) Query: 1 MKNNP--LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN 58 M IS+ MP +N + +I+S+L Q Y+++E IIVDD ST + Sbjct: 1 MSEKESVKISVVMPAYNAEAYLKDSIESILDQTYTDFEFIIVDDASTDNSHEIIEEYSKK 60 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + N+IN G R + ++G+YI D DD RL + ++ Sbjct: 61 DKRIVVLRNEINLGIAETRTKGTKYSKGKYIAVSDADDISILTRLEKQYNYLEEHKDCGV 120 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSR-RLFYKRNIIGNQVFTWAWRFKE-CLFDTEL 176 + + + Y R RLF + F+ +D + Sbjct: 121 VGGFIELFDSDTSKIIGVRKYYEDDANLRKRLFLYCPVAQPVCMMRKEVFETLGYYDPKY 180 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 A+D D++ R+ +Y + ++E ++ I+++ K + +K+ Sbjct: 181 PPAEDLDLWFRIGTKY-KFANIQEILLKYRVHKKSATISTTKKMEAMTLKIRKKY 234 >UniRef50_C0CGY4 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CGY4_9FIRM Length = 255 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 61/261 (23%), Positives = 108/261 (41%), Gaps = 9/261 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M++ +SI P+WN ++ + IKSV Q Y NWEMIIVDDCST A DP Sbjct: 1 MQD--KVSIITPSWNSEKYIVDTIKSVQNQIYKNWEMIIVDDCSTDHTVDIIEAIAKEDP 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I +N GA RN ++ A G +I +D DD W P++L ++ + +F Sbjct: 59 RIKLIKQPVNGGAAKARNISLNEATGRFIAYLDADDIWKPDKLEKQVSFMKSKP-CSFSC 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT--WAWRFKECLFDTELKA 178 + V + + + P+ Y F N++ ++ L +++ Sbjct: 118 TSYEVVDDDGKPLNKQVHMLPEVDYVG--FLTNNLLQTVGIMVDIDIVDRKYLEMPDIRR 175 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 QD +L+++ + + ++E + ++ K G ++ YR + Sbjct: 176 RQDAATWLQILKAGYKCYGLDEILAEYRRAGNSLS-SNKIKAIKGVWNLYRNI-EHLSLL 233 Query: 239 SKKYQLFTLYQIRNKRMTWRT 259 Y + + +R Sbjct: 234 FSCYCFIRYAVLAVWKRVYRK 254 >UniRef50_Q8DGK0 Tlr2317 protein n=2 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DGK0_THEEB Length = 356 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 64/269 (23%), Positives = 107/269 (39%), Gaps = 10/269 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N PL+S+ MP +N + AI+S+L Q + ++E II+DD ST A+ D R Sbjct: 11 SNRPLLSVLMPVFNAEAYVGEAIESILGQSFRDFEFIIIDDGSTDNSLSVLKRYAVQDAR 70 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I I + N G A N+ I LA+GE+I +D DD PNR V LA ++ F Sbjct: 71 IRLISRE-NRGLVATLNEGIALARGEWIARMDADDVAMPNRFIVQLAALERE-RVDFCGG 128 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + L + V F E +D + A+D Sbjct: 129 AIQCF---GALRSVCFYPLTHEACEVHLLFGVPFGHPTVIGRHSAFAELGYDPDYPYAED 185 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH----KDKFDR 237 Y+++ R + V E ++ G++ ++ + ++H + D Sbjct: 186 YELWQRAWGRGYKFVNVPEVVLRYRVHAGQVSARKKAEQQNTADAVRKRHWRALFPRMDD 245 Query: 238 ASKKYQLFTLYQIRNKRMTWRT-LLTLLS 265 Y + L+ + + LL+L+S Sbjct: 246 KEIDYIIKMLHDGKGETSRLLPCLLSLVS 274 >UniRef50_B8I318 Glycosyl transferase family 2 n=2 Tax=Clostridium RepID=B8I318_CLOCE Length = 280 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 61/282 (21%), Positives = 112/282 (39%), Gaps = 13/282 (4%) Query: 1 MKNN---PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL 57 M NN P IS+ MP +N +A+ SVL Q ++++E+I VDD S+ ++ Sbjct: 1 MNNNIIFPKISVLMPVYNTSNYVSKALDSVLAQTFTDFEIIAVDDGSSDNSMEILKDYSI 60 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 D R+ N+ N G N A+ A + +D DD +RL + Q+ Sbjct: 61 KDSRVKVFQNEENRGVSYSLNHALRHASAPLVARMDSDDIMVKDRLEKQYTYMQKNPDCV 120 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF--KECLFDTE 175 L + + + R+ F + I + + FD Sbjct: 121 VLGGQETYIDENDKITGNTKYPLSDAEIKRKFFQFQPIADPTSMYNRLKIPPEVFYFDES 180 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 L A+ D++ R+ +YG+ +E++ + G + T + F + RK Sbjct: 181 LTVAEGLDLYFRLF-KYGKFANLEDSIILYRQRQGSLFSTDIKRTFK-FISMVRK----- 233 Query: 236 DRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 RA K+Y + + + + + ++L +R +L D I+ Sbjct: 234 -RAKKQYGIKAPFTAGIINFSQKIITSILPLRVAVKLNDIIK 274 >UniRef50_A6BHD3 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BHD3_9FIRM Length = 410 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 57/237 (24%), Positives = 94/237 (39%), Gaps = 16/237 (6%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NP ISI +P +N ++ AI+SVL Q Y+N+E+IIVDD ST + A +D RI Sbjct: 2 NPKISIILPNYNNEKYLKEAIESVLNQTYTNFELIIVDDASTDGSLKI--IRAFDDKRIK 59 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I +++N N I A GEYI ID DD W +L + + + + Sbjct: 60 VITSEVNRHVAYASNLGIKYASGEYIAKIDSDDIWENQKLEKQIEFMENHKEYGGCFTKV 119 Query: 124 YVCQGEVYSQPASLPLYP----------KSPYSRRLFYKRNIIGN-QVFTWAWRFKE--C 170 + + + + + R + N + N F++ Sbjct: 120 NIIDEKSEDANIKYKVIFDLFEKAKNQSQKEWLRFFLSEGNCLCNSSSLIRKCIFEKIDG 179 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHF 227 F+ A+DY++++R V+ Y + ++E S K Y Sbjct: 180 FFNLAYVGAEDYELWVRTVIRY-PIYVLDERLVRYRWEEDSENKISGLTKEKVYAAI 235 >UniRef50_O58167 Putative uncharacterized protein PH0430 n=1 Tax=Pyrococcus horikoshii RepID=O58167_PYRHO Length = 334 Score = 237 bits (606), Expect = 3e-61, Method: Composition-based stats. Identities = 67/258 (25%), Positives = 103/258 (39%), Gaps = 14/258 (5%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+S+ +PT+NR ++ RAI+SVL Q Y+N E+IIVDD S + D RI Y+ Sbjct: 5 LVSVVLPTYNRAKVLPRAIESVLNQTYTNIELIIVDDGSRDNTKEIIREFQSQDERIVYL 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK-QQLVTHAFLYANDY 124 N N GA A RN IM + GE+I +D DD W P +L + L ++ +Y+ Sbjct: 65 RNKRNLGANAARNIGIMHSTGEFIAFMDSDDMWLPWKLERQIKIMYSSLNSYPIVYSGFI 124 Query: 125 VCQG-EVYSQPASLPLYPKSPYSRR-------LFYKRNIIGNQVFTWAWRF---KECLFD 173 + S P S L N I + + LFD Sbjct: 125 RIYKTDKNKVILSYYPQKSPPVSADSTSIHLNLLTNANFISTPTVLIPALYIRQNKLLFD 184 Query: 174 TELKAAQDYDIFLRMVVEYG-EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK 232 ++ QD++ FLR+ + E I ++ + + K Y K+ Sbjct: 185 EKMPRLQDWEFFLRLSKHCNCNFRFIPEPLVIAYVQPDSISMNHK-KFIKALKILYTKYY 243 Query: 233 DKFDRASKKYQLFTLYQI 250 K + + L Sbjct: 244 SKVNNQEALSTFYFLMGY 261 >UniRef50_D1ALB8 Glycosyl transferase family 2 n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1ALB8_SEBTE Length = 1334 Score = 237 bits (606), Expect = 3e-61, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 104/286 (36%), Gaps = 18/286 (6%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALND 59 + PLIS+ MP +N +A+ SV+ Q Y NWE+ I DD S + E D Sbjct: 491 QYKPLISVLMPVYNTDIHFLKQALDSVIEQTYDNWELCIADDNSPNEEVREILKEYENKD 550 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVT-HAF 118 RI I+ N N A+ L GE+ +D DD + L + + + Sbjct: 551 SRIKVIYRKENGHISLASNSALELVTGEFTALMDHDDLIPKHALYMVVWEINRKKGLVDL 610 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 +Y ++ GE Y K ++ L +N + + K+ F + Sbjct: 611 IYTDEDKIDGENI----RYDPYFKMEWNETLINSQNFVAHLGIYRTSILKKIGGFRKGFE 666 Query: 178 AAQDYDIFLRMVVE--YGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +QDYDI LR + E + I G ++ S Y+ K+ + Sbjct: 667 GSQDYDILLRFLREISSDRISHIPHVLYHWRIFKGNHTFSTDNHNISD-DSAYKALKEHY 725 Query: 236 DRASKKYQLFTLYQI-------RNKRMTWRTLLTLLSVRNGKRLAD 274 + + ++ + R ++ + ++ R+ + Sbjct: 726 EILKEDVRILPVDNFPGCWKIKRQQKTILPKVSLIIPTRDRVEILK 771 Score = 106 bits (266), Expect = 7e-22, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 96/283 (33%), Gaps = 25/283 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +S+ +PT +R ++ + + + Y ++E+IIVD+ S + L+ + D RI Sbjct: 755 PKVSLIIPTRDRVEILKNCVDGLQKNTDYDDFEVIIVDNDSKEKKTLEYFDKISADSRIK 814 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + + + N A+ A+GEY+ +++D + + L ++ + + A Sbjct: 815 ILKVEGEFNYSKLNNLAVKEAKGEYLVFMNNDLEIIKNDWLKEMISTFSEE-NVGIVGAK 873 Query: 123 DYVCQGEVYSQP------------ASLPLYPKSPYSRRLFYKRNII----GNQVFTWAWR 166 Y + Y RL N+ + Sbjct: 874 LYYSNNTIQHAGCVTGVYGVAGHIHKHLPKNSPGYFGRLGLIHNVSAVTGACLAISKKIF 933 Query: 167 FKECLFDTE-LKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 + FD E LK + D D+ L++ + + E + + K+ Sbjct: 934 EEVNGFDEEKLKVSYNDVDLCLKVRDKGYKIIFNPEVELY---HLESISRGKDESKYKKE 990 Query: 225 FHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVR 267 + + ++ R +K + Y + + + Sbjct: 991 LN-RNERREMISRYGEKLKYDPYYSVNLSLENEDVTYSFSPRK 1032 >UniRef50_B8FY70 Glycosyl transferase family 2 n=2 Tax=Desulfitobacterium hafniense RepID=B8FY70_DESHD Length = 235 Score = 237 bits (605), Expect = 3e-61, Method: Composition-based stats. Identities = 51/236 (21%), Positives = 90/236 (38%), Gaps = 8/236 (3%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +S+ +PT+N + A++SVLRQ Y + E+I++DD ST + RI Sbjct: 2 SMPKVSVVIPTYNHARYVTWAVESVLRQKYPSLEIIVIDDGSTDDTAQRLQPYQ---SRI 58 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 YI+ N G N + A GEYI + DD + +L + + F Y + Sbjct: 59 NYIY-KSNGGTPNALNHGLQRATGEYICWLSADDMFLKGKLEKQVQLMEANPQVGFCYTS 117 Query: 123 DYVCQGEVYSQPASLPLYP--KSPYSRRLFYKRNIIGNQVFTWA-WRFKECLFDTELKAA 179 V + A+ + + L I G+ V + FD L A Sbjct: 118 FIVIDENGLKKYAADSEFYPARQELVVNLLRGCFINGSSVMMRRTALERVGYFDEGLPQA 177 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 DY+++ R + + + E + M + + R+ ++ Sbjct: 178 HDYELWFRFLRHF-PAGYIREPLIAYRWHGENMSLHPDNECIRIVQQRARELFPEW 232 >UniRef50_UPI0001744769 glycosyl transferase, group 1 n=2 Tax=Bacteria RepID=UPI0001744769 Length = 1982 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 60/282 (21%), Positives = 101/282 (35%), Gaps = 17/282 (6%) Query: 5 PLISIYMPTWNRQQL-AIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQ-QYVTALNDPRI 62 P IS+ MP +N +A++SV Q Y NW++ I DDCST + Q +D RI Sbjct: 788 PKISVVMPVYNTPLWALQKAVQSVTAQIYDNWQLCIADDCSTEPDIRQYLQALQASDSRI 847 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + + NSG A N A LA+GE++ +D DDE T + L+ Y++ Sbjct: 848 SVTFLEQNSGISAATNAAAALAEGEFLAFLDHDDELTIDALAEMALGLAAHPDWDMAYSD 907 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQD 181 D + K +S L + + + F+ F + AQD Sbjct: 908 DDLID----PAGRRHRPQFKPDWSPTLLLSFMYMSHLLMVRRSVFENLGGFRVGFEGAQD 963 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 YD LR + + + + G + K + + +R Sbjct: 964 YDFALRASEAVNQVGHIPKILYHWRVMPGSTATGGAAKP-ESFERGRLAVQQAVERRKIS 1022 Query: 242 YQLFT--------LYQIRNKRMTW-RTLLTLLSVRNGKRLAD 274 ++ + ++ RNG +L + Sbjct: 1023 AKVNHPDWALQAGCGIFSLHHPDHGPQVDIVIMNRNGGKLLE 1064 Score = 57.4 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 38/254 (14%), Positives = 83/254 (32%), Gaps = 23/254 (9%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P + I + N +L R ++S+ Y N+ + + D+ S L+ + +I Sbjct: 1046 HGPQVDIVIMNRNGGKLLERCLQSLQMTTYQNYVVTVADNTSNDPGTLEVLQKSG--AKI 1103 Query: 63 TYIHNDINSGACAV--RNQAIMLAQGEYITGIDDDDEWT-PNRLSVFL--AHKQQLVTHA 117 + +++ N+A+ + E++ +++D E P LS + A + + Sbjct: 1104 ITLPDEVPGRFSFSNLYNRALSQSTAEFVLLLNNDTEVKNPRWLSQMMGYALMEGVGAVG 1163 Query: 118 FL--YANDYVC---------QGEVYSQPASLPLYPKSPYSRRLFY---KRNIIGNQVFTW 163 + + V +G+V + P + + R + Sbjct: 1164 ACLLFPDHTVQHAGMLLGCHEGKVGHAFKNTPAHEHGYMALRAVTRESSGVTAACMLIRR 1223 Query: 164 AWRFKECLFDTEL--KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 F D + D D+ L++ + EA H P + Sbjct: 1224 DCYFSVGGQDEQRFAVGYNDADLCLKLGDKGLRSIYCGEAELFHHEGKSRGTGNDDPLEE 1283 Query: 222 SGYFHFYRKHKDKF 235 + R D+F Sbjct: 1284 AAIRAEVRHRPDRF 1297 >UniRef50_A0K4U8 Glycosyl transferase, family 2 n=3 Tax=Burkholderia cenocepacia RepID=A0K4U8_BURCH Length = 709 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 65/281 (23%), Positives = 103/281 (36%), Gaps = 15/281 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQ-QYVTALNDPR 61 P ISI MPT+N AI SV Q YSNWE+ I DDCST + + DPR Sbjct: 161 KPKISIVMPTFNPPLNYIREAIDSVRNQFYSNWELCIADDCSTDSDARSFLTSLSALDPR 220 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I D N A N A+ LA+G ++ +D DD + L + +Y+ Sbjct: 221 IKVHLRDSNGHISAASNSALALARGHFVALLDQDDLLPKHALYHVALEINRFPDVNVIYS 280 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 ++ Y KS ++ LF N+I + + F + +Q Sbjct: 281 DEDKIDN----AGNRSDPYFKSDWNPDLFLSHNMISHLGVYRTSLLREIGGFRIGYEGSQ 336 Query: 181 DYDIFLRMVV--EYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF--- 235 DYD+ LR V + + I+ K + Y R +D Sbjct: 337 DYDLALRCVRVSRPDQIRHISHVLYHWRIHDESTAANPHAKHY-AYDAALRAIQDFLFDK 395 Query: 236 -DRASKKYQLFTLYQIRNKRMTWRT-LLTLLSVRNGKRLAD 274 + + Y++ + + L+ R+G+ + Sbjct: 396 PGTRVQHGEFIGTYRVIYPIPETQPHVSLLIPTRDGRSILK 436 Score = 93.6 bits (232), Expect = 6e-18, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 92/269 (34%), Gaps = 26/269 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + P +S+ +PT + + + R IKS+L++ Y N+E+I++++ ST E + Sbjct: 417 ETQPHVSLLIPTRDGRSILKRCIKSILQKTSYRNFEIIVINNQSTCLETINYLKQISEYD 476 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFL 119 ++ I D A+ N A A GE I ++DD + +P L+ ++H + + Sbjct: 477 QVRVIDYDAPFNYSAINNFAEKHASGEIIGLLNDDVEVVSPEWLTEMVSHALR-PEIGVV 535 Query: 120 -----YANDYVCQGEVYSQP-------ASLPLYPKSPYSRRLFYKRNI----IGNQVFTW 163 Y + ++ V L Y+ R +N + Sbjct: 536 GAKLLYPDGFIQHAGVVIGIGGFAGHAHRLHPGMHPGYAGRAALTQNFSAVTAACLIVRR 595 Query: 164 AWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEE-ATQILHINHGEMQITSSPKK 220 + + + +LK A D D LR +H S Sbjct: 596 SVYQELGGLNEQDLKVAFNDIDFCLRAGAAGYYVTWTPHAVLY----HHESYSRGSDQAS 651 Query: 221 FSGYFHFYRKHKDKFDRASKKYQLFTLYQ 249 F R+ DR + Y Sbjct: 652 TESRARFEREKNYMRDRWKTDSKPDRFYN 680 >UniRef50_B0PBK3 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0PBK3_9FIRM Length = 1073 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 55/281 (19%), Positives = 99/281 (35%), Gaps = 14/281 (4%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVD--DCSTSWEQLQQYVTALNDPRI 62 SI +P +N Q I+SVL Q Y NWE+ + D D + D R+ Sbjct: 544 KFSIVVPLYNTPLQFLHEMIRSVLDQTYGNWELCMADGSDAQHGNVEQACKKYCQKDSRV 603 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 Y+ D N G N + +A G YI D DD P L + + F+Y + Sbjct: 604 HYLKLDKNLGISGNTNACLEMATGNYIGLFDHDDLLHPAALFEVMWAICEQ-DADFIYTD 662 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKAAQ 180 + + K Y+ N I + ++ F +E +Q Sbjct: 663 ENTFHDKPEDA---YCPAFKPDYAPDTLRSYNYICHFTVFSKKLLEKTGGGFRSECDGSQ 719 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQ--ITSSPKKFSGYFHFYRKHKDKFDRA 238 DYD+ LR+ + + + + + I++ P + +H + Sbjct: 720 DYDLVLRLTEKASHIVHIPKILYYWRSHSNSVASDISAKPYTVTAAKLALSEHLARIGLP 779 Query: 239 SK--KYQLFTLYQIRNKRMTWRTLLTLLSVRNGK-RLADGI 276 + L + Y+I+ + + + L+ ++ L I Sbjct: 780 GEVLDAVLPSTYKIQYEIIGKPLVSILIPNKDYADDLYKCI 820 Score = 103 bits (257), Expect = 8e-21, Method: Composition-based stats. Identities = 45/274 (16%), Positives = 90/274 (32%), Gaps = 23/274 (8%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL+SI +P + + I S+ + Y WE+II+++ ST + Y ND RI Sbjct: 800 KPLVSILIPNKDYADDLYKCITSICTKTTYPAWEIIIIENNSTEQKTFDYYEQVQNDNRI 859 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + AV N A+GE++ +++D + TP+ + + Q + Sbjct: 860 QVIQWENEFNYAAVNNWGAQYARGEHLLLLNNDVEVITPDWIE-QMLMFSQRRDVGAVGC 918 Query: 122 NDYVCQGE----------VYSQPASLPLYPKSPYSR--RLFYKRNI----IGNQVFTWAW 165 Y + + ++ Y R+ +N + Sbjct: 919 MLYYPDDTVQHAGVILGIGGVAGHAHKHFFRNDYGYMSRMTIAQNYSAVTAACMMIRRDV 978 Query: 166 RFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQILHINHGEMQITSSPKKFSG 223 + D + A D D+ +R+ A +PKK Sbjct: 979 WDEMQGLDEAFQVAFNDVDLCMRIRQAGYLIVWTPYAELYHY--ESKSRGYEDTPKKQKR 1036 Query: 224 YFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTW 257 + R+ + ++ + + + R + Sbjct: 1037 FEGEVRRFQSRWAKELADGDPYYNPNLTLDREDF 1070 >UniRef50_Q0BQ52 Glycosyltransferase n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BQ52_GRABC Length = 854 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 61/283 (21%), Positives = 109/283 (38%), Gaps = 17/283 (6%) Query: 6 LISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS-WEQLQQYVTALNDPRIT 63 L+SI P + R A++SV+ Q Y NWE+I++DDCS + A D RI Sbjct: 289 LVSIICPVYKPRLSDFALAVESVIAQTYQNWELILIDDCSEEANLKNLLQQFAQADSRIK 348 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 ++ NSG N + A G +I +D DD + L + +A + LY+++ Sbjct: 349 HLTTRKNSGISEASNLGLKAANGSWIAFLDHDDLLEASALDIMIAAAEA-TGAKLLYSDE 407 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDY 182 + K ++ RL + N I + V F+ + AQD+ Sbjct: 408 DKIDDSGF----FRDPAFKPDWNYRLLLEVNYICHFVLVQRDVINSAGPFNKQFDGAQDH 463 Query: 183 DIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+ LR+ E + V E I G K + + +R Sbjct: 464 DMLLRIAERLEPSEVFHVPEILYHWRITPGSTAGDIGAKPY-AIEAGLQCVSRHLERRGL 522 Query: 241 KYQLFTLYQIRNKRMTW-----RTLLTLLSVRNGKRL-ADGIR 277 K + T Y + ++ W ++ ++ ++ + + I+ Sbjct: 523 KADVSTRYGMTLYKIDWETNNSPSVSIVIPFKDQIDVTLNCIK 565 Score = 72.4 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 36/259 (13%), Positives = 82/259 (31%), Gaps = 24/259 (9%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 N+P +SI +P ++ + + IK+++ + Y+N+++++VD+ ST Sbjct: 542 NNSPSVSIVIPFKDQIDVTLNCIKNIIEKTHYTNYQIVLVDNWSTDPNMSLLQDYVAAHS 601 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAFL 119 I I +I + N A +Y +++D T + L +A + + Sbjct: 602 NIKIIRQEIPFNYSLLNNIACASYPADYYVFLNNDLFVLTSDWLYRLVAEAEVDPRVGAV 661 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLF----------------YKRNIIGNQVFTW 163 G + L + + + + Sbjct: 662 GGKFVYPNGTIQHGGVILGIGGVAGHVHTGLPGDEGGYGGRANFTQEMSAVTAAGMLVRA 721 Query: 164 AWRFKECLFDTELKAA--QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 + FD A D D+ LR+ E +H + + Sbjct: 722 KAFHEIGCFDESKLAVAFNDIDLCLRLRAAGYTIIYTPEFFAE---HHESLSRGDDVRP- 777 Query: 222 SGYFHFYRKHKDKFDRASK 240 + F+++ + +R + Sbjct: 778 AQERRFFKETQIMLERWGE 796 >UniRef50_C7HAW6 Glycosyl transferase, group 2 family n=2 Tax=Faecalibacterium prausnitzii RepID=C7HAW6_9FIRM Length = 809 Score = 236 bits (604), Expect = 4e-61, Method: Composition-based stats. Identities = 55/276 (19%), Positives = 89/276 (32%), Gaps = 13/276 (4%) Query: 5 PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P ISI +P +N + SV+ Q Y NWE+ VD A D RI Sbjct: 271 PKISIVVPLYNTPLDFLDEMLDSVVNQTYKNWELCCVDAGKDEAVGQHVQARAKADARIR 330 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y + N N+ +A GEYI +D DD P L + F+Y ++ Sbjct: 331 YQKLEKNELIPGNTNKGFEMATGEYIALLDHDDLLHPCALWYAARAIAEQK-ADFVYTDE 389 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK--ECLFDTELKAAQD 181 +G+ + K + N I + + E +QD Sbjct: 390 ATFEGKPEHVVLY---HFKPDFMLDNLRSNNYICHLTVFSRALMERAGGGERMEYNGSQD 446 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF--SGYFHFYRKHKDKFDRAS 239 Y++FLR+ + + + A + G S K + + H + A Sbjct: 447 YELFLRLTEQAEKIVHIPHALYYWRSSPGSTAADISAKTYCIDAGIAALKAHYARCGIAV 506 Query: 240 KKYQLF----TLYQIRNKRMTWRTLLTLLSVRNGKR 271 L Y+ + L+ + R Sbjct: 507 DDVALIPGTPGYYKTDYTIEHPGRVSILIPTCDHIR 542 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 73/222 (32%), Gaps = 24/222 (10%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALND--PRIT 63 +SI +PT + + + S+ + Y ++E+I++++ S E + Y + + Sbjct: 531 VSILIPTCDHIRDLELCVDSIYARTTYPDFELILIENNSKQPETFRAYERMQKEHPDNLK 590 Query: 64 YIHNDINSGACAVR--NQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLV----TH 116 + + G N A GEY+ +++D + TPN L + + QQ Sbjct: 591 VVTWEG-KGFNYSALNNFGEQYATGEYLLLLNNDTEVITPNWLEEMVMYAQQKRVGCVGA 649 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPY---SRRLFYKRNII--------GNQVFTWAW 165 LY +D + + + + + S + N + + Sbjct: 650 KLLYPDDTIQHAGIGFGIGGVAGHLHKYFPAASDGYMGRLNYVQDVYGDTAACLLIRREI 709 Query: 166 RFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEA-TQIL 205 + D A D D +R+ A Sbjct: 710 YDEMNGLDESYAVAFNDVDFCVRVREAGYTNVFTPFAQLYHY 751 >UniRef50_A6H2F4 Glycosyl transferase, group 2 family protein n=2 Tax=Flavobacteriales RepID=A6H2F4_FLAPJ Length = 257 Score = 236 bits (604), Expect = 4e-61, Method: Composition-based stats. Identities = 69/261 (26%), Positives = 107/261 (40%), Gaps = 6/261 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M L+SI PT+N +Q AIKSV Q YS+WE+IIVDDCS D Sbjct: 1 MNE--LVSIITPTYNSEQFIAEAIKSVQNQSYSHWEIIIVDDCSKDKTVNIIQNFIDEDH 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I D NSGA RN AI A+G YI +D DD W P++LS + Q+ F + Sbjct: 59 RIYLIQLDKNSGAGVARNNAINNAKGRYIAFLDSDDLWKPDKLSKQIKFMQRH-NIPFTF 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + + K S + N IGN + + + + ++ Q Sbjct: 118 SFYDCINEKGDLLNKRIEAPKK--LSYYQLFFCNFIGNLTGIYDSNYFGKMAISSIRKRQ 175 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+ ++L ++ + V E+ + I + + + YRK S Sbjct: 176 DWIVWLTILKKIKTAKPVPESLALYRIRENSISASKVS-LLKDNYAIYRKFHHLNIIVSL 234 Query: 241 KYQLFTLYQIRNKRMTWRTLL 261 L L+ + + ++ Sbjct: 235 ACMLGFLFTQLLVKPKFVKII 255 >UniRef50_B0CGB5 Glycosyl transferase, family 2, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CGB5_ACAM1 Length = 327 Score = 236 bits (604), Expect = 5e-61, Method: Composition-based stats. Identities = 50/243 (20%), Positives = 97/243 (39%), Gaps = 11/243 (4%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ + T+N + ++SVL+Q +S+ E+IIVDD S+ + ++T++ D R+ Sbjct: 3 KVSVIITTFNSMRFFPETLESVLKQTFSDIEVIIVDDGSSDG--ISDWITSVVDSRV-VF 59 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 H N G + RN I LA+GEYI +D DD W P +L + ++ + Sbjct: 60 HAQSNQGVSSARNTGIRLARGEYIAFLDGDDFWAPTKLEQQVRVLALEDEVGLVHTWLAL 119 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDYDI 184 E + + + I + V +FD+ L A+D+D+ Sbjct: 120 MDEESRLTGRVMKPQSEGAIWEEIVESNMIACSSVMVRRSCLDVVGIFDSALIVAEDWDL 179 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH-----KDKFDRAS 239 ++R+ +++ I+ P+ + + D + Sbjct: 180 WIRVAA-VCRISVIKDPLVQYRIHPSSKS-KRYPEMVEDFRTIIERAFQSVPYDYLPIRN 237 Query: 240 KKY 242 + Y Sbjct: 238 RSY 240 >UniRef50_B5W037 Glycosyl transferase family 2 n=2 Tax=Arthrospira RepID=B5W037_SPIMA Length = 330 Score = 236 bits (603), Expect = 6e-61, Method: Composition-based stats. Identities = 59/262 (22%), Positives = 107/262 (40%), Gaps = 12/262 (4%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P ISI +P +N + I SV+ Q + ++E+II++D ST +++++DPR+ Sbjct: 2 PQISIIIPAYNADKTIKETIDSVINQTFKDFEIIIINDGSTDRT--LDIISSIDDPRVQV 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 NSGA RN+ A +YI +D DD WT ++L + A + + Y+ Sbjct: 60 FSYP-NSGASVSRNRGFSKASAQYIAFLDADDLWTSDKLELQYAALEADPEASVAYSWTN 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWAWRFKECLFDTELKAAQD 181 + + +N + N + T FD LK QD Sbjct: 119 YIDQYGEFLYPGFHQSYQGDV-YSILLVKNFLENGSNPLITRHALSAIGGFDESLKGGQD 177 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK---HKDKFDRA 238 +D++LR+ +Y + + +A + I+ + ++ ++ S K + + Sbjct: 178 WDLYLRLAAQY-KFVCISQAQILYRISEHSIS-SNIERQESSCLLVLNKAFNNAPSSLKN 235 Query: 239 SKKYQLFTLYQIRNKRMTWRTL 260 KK L LY + L Sbjct: 236 LKKDSLSHLYSYLTFKSLESPL 257 >UniRef50_Q2LWN5 Glycosyltransferase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LWN5_SYNAS Length = 363 Score = 236 bits (603), Expect = 7e-61, Method: Composition-based stats. Identities = 48/229 (20%), Positives = 93/229 (40%), Gaps = 7/229 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + P+IS+ +P +N +I +L Q YSN+E++I+DD ST V++ DPR Sbjct: 23 NSAPMISVVLPVYNGASYLRESIDCILVQSYSNFELLIIDDGSTDDSASI--VSSFTDPR 80 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + + N G A N+ I LA+G YI D DD P+RLS +A + Sbjct: 81 IRFYSQE-NKGLAATLNRGIALAKGAYIARQDQDDISLPDRLSKQVAFMETHPDCGMAGT 139 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLF--DTELKA 178 + + + ++ L + + + + F + ++ + + + Sbjct: 140 WASILEEQKPTKRLHRHDADNLSLQFDLLFDNPFVHSSLMIRKTVFDEVGVYCTNPDRQP 199 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHF 227 +DY+++ R+ ++ + E I M + + Sbjct: 200 PEDYEMWSRVARKF-RVANIPEVLHIYREVPQSMSRSGNNPFLQHLLKI 247 >UniRef50_C7M1X8 Glycosyl transferase family 2 n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7M1X8_ACIFD Length = 1084 Score = 236 bits (602), Expect = 7e-61, Method: Composition-based stats. Identities = 60/282 (21%), Positives = 103/282 (36%), Gaps = 18/282 (6%) Query: 2 KNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALND 59 ++ PLIS+ MP +N + + AI+SV Q Y +WE+ I DD ST E D Sbjct: 536 EHPPLISVLMPVYNTPIRHLVTAIESVRAQWYPHWELCIADDASTDPEIRPILTRYQEAD 595 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 PRI D N G A N A+ LA G+++ +D DDE + L + + L Sbjct: 596 PRIKVAFRDENGGISANSNTALTLANGKFVAYLDADDEISEVALLHYAREIHEYPGVELL 655 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW-RFKECLFDTELKA 178 + + + ++ Y K S L +N + + + +E Sbjct: 656 FCD----EDKITEDGDRSDPYFKPSLSPALLLGKNCVTHLGVYRTDTVRRLGGMRSEFDG 711 Query: 179 AQDYDIFLRMV----VEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 +QD+D+ LR + +++ ++ HG T K + +D Sbjct: 712 SQDWDLALRFLPIVGIDFTRARRIPRLLYHWRRIHGSTATTLRSKSW-AVLAGRHAVQDY 770 Query: 235 FDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 D A + + +L + L + Sbjct: 771 LDTAVPGAKAEPI------PRASNLNRLVLPTPDPAPLVSIL 806 >UniRef50_B7R2A7 Glycosyltransferase n=1 Tax=Thermococcus sp. AM4 RepID=B7R2A7_9EURY Length = 307 Score = 236 bits (602), Expect = 7e-61, Method: Composition-based stats. Identities = 68/273 (24%), Positives = 115/273 (42%), Gaps = 6/273 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P ISI +PT+NR + R I+SVL Q + ++E+II+DD ST + + D RI Sbjct: 3 RPKISIILPTYNRGYIIGRTIRSVLDQTFKDFELIIIDDGSTDNTKNIVEIYKEKDNRIR 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 YI + N GA A RN I ++GEYI ID D W P +L + + + +Y+ Sbjct: 63 YIRHKRNLGANAARNTGIKKSRGEYIAFIDSDTIWKPQKLEIQIKIADNTKENFVIYSRT 122 Query: 124 Y-VCQGEVYSQPASLPLYPKSPYSRRLFYKRNII-GNQVFTWAWRFKE-CLFDTELKAAQ 180 + P + K N I + + K LFD +L Q Sbjct: 123 IRKYTKKTVIVPEDVNQGKKEGQLLDKILVLNFIDMSSILVKKNMIKNSGLFDEKLPRLQ 182 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+DI +R+ +Y + + + I ++ + + K H +K+ F + K Sbjct: 183 DWDILIRLAEKYNFVY-IPQTLVISYVLSDSISV-DYCKLLKAEMHLLKKYYPLF-KQDK 239 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 + Y+I + + ++ +R + Sbjct: 240 RAIAQRYYEIVLSSLKCKRMIQTNHLRGFIKYL 272 >UniRef50_C8QGU2 Glycosyl transferase family 2 n=1 Tax=Pantoea sp. At-9b RepID=C8QGU2_9ENTR Length = 286 Score = 236 bits (602), Expect = 9e-61, Method: Composition-based stats. Identities = 83/272 (30%), Positives = 133/272 (48%), Gaps = 6/272 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+++Y+PT NR L RAI SV Q Y N E+II DD S+ ++ D RI Sbjct: 8 QPLVTVYIPTHNRSSLVKRAIASVQNQTYRNLEIIICDDGSSDDTEVVIAKMKEQDSRII 67 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y+ ND GAC RN+ I +A+GE+ITG+DDDD + R++ F+ Q+L + Sbjct: 68 YLKNDRPMGACHARNRCIAIAKGEFITGLDDDDYFLNTRIANFVTASQKLNKPFL--CAN 125 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDY 182 + + + + + + +N +GNQVF K FD A QDY Sbjct: 126 MIFKNRNIEKKWKSYV---GDITLNMMGFKNWVGNQVFIRTSMMKNIGGFDINFPAWQDY 182 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D++ RM+++YG +++++A+ +++I+ +IT+ K + GY F HK R K Sbjct: 183 DLWYRMIMQYGSCYRIDDASYVVNIDDDRPRITTGSKAWQGYQKFITTHKPSLSRQHLKS 242 Query: 243 QLFTLYQIRNKRMTWRTLLTLLSVRNGKRLAD 274 F R + ++ S+ N L Sbjct: 243 LYFQDKLNRKDEINFKEFFHYPSIDNLLILTK 274 >UniRef50_B8HLM9 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HLM9_CYAP4 Length = 380 Score = 236 bits (602), Expect = 9e-61, Method: Composition-based stats. Identities = 60/270 (22%), Positives = 104/270 (38%), Gaps = 15/270 (5%) Query: 1 MKNNP-LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M + P L+S+ +P +N + + + SVL Q Y N E+I+VDD S A D Sbjct: 1 MADEPALVSVIIPAYNAEAFIAQTLTSVLNQTYQNLEVIVVDDGSHDQTGEIIQKFAQKD 60 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHK-QQLVTHAF 118 PR+ ++ N G A RN AI + G+YI ID DD W P++L + Q Sbjct: 61 PRVRWL-QQANGGVAAARNLAIRASLGQYIAPIDADDIWFPHKLEKQVNCLSQAETRVGL 119 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN---QVFTWAWRFKECLFDTE 175 +YA + + +N +G+ + + K ++ Sbjct: 120 VYAWSVDIDEWGELLGGFHAYQFE-HNVHHILAYKNFLGHASAPLIRRSCLDKVGGYNCR 178 Query: 176 LK-----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF--HFY 228 L+ +D D++LR+ Y V+E G M + S Sbjct: 179 LRAQNAQGCEDLDLYLRIAEHYD-FRVVKEFLIGYRRVIGSMSCNYTAMARSYLLVQSEI 237 Query: 229 RKHKDKFDRASKKYQLFTLYQIRNKRMTWR 258 R+ + A+ ++ T Y ++ ++ Sbjct: 238 RRTNAEIPDAAFRFFNSTFYLYLARQSSFN 267 >UniRef50_C6P727 Glycosyl transferase family 2 n=1 Tax=Sideroxydans lithotrophicus ES-1 RepID=C6P727_9PROT Length = 359 Score = 235 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 52/227 (22%), Positives = 87/227 (38%), Gaps = 4/227 (1%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P+IS+ MP +N ++ AI S+L+Q +S++E II+DD ST D RI Sbjct: 14 PVISVAMPVYNGEKYLAEAIDSILKQTFSDFEFIIIDDGSTDNSLQVLKKYQERDTRIRL 73 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I + N N I LA+G++I +D DD P R L Q+ + Sbjct: 74 ISRE-NRNLATTLNDIIDLAKGQWIARMDQDDVALPQRFERQLRWLQE-TDADICGSWAR 131 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDI 184 + S L + V A K +D + +DYD+ Sbjct: 132 LF--GTADTRVLKHAQSDSAVKAELLFGSPFAHPSVMMKADLVKGLRYDKAWEKCEDYDL 189 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 + R V E + + ++ ++S K+ + R++ Sbjct: 190 WERAAQAGWNMTNVPEVLLLYRQHETQISTSTSNKQQALGQQIRRRY 236 >UniRef50_P55465 Uncharacterized protein y4gI n=1 Tax=Rhizobium sp. NGR234 RepID=Y4GI_RHISN Length = 909 Score = 235 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 66/294 (22%), Positives = 114/294 (38%), Gaps = 20/294 (6%) Query: 1 MKNNPLISIYMPTWNR-QQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALN 58 ++N PLIS+ +P +N L + I+SV Q Y+NWE+ + DDCST E A Sbjct: 362 LQNGPLISVVVPVYNPDPALLVEMIESVRAQSYANWELCLADDCSTDPEVGRVLRNYAAQ 421 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 DPR+ + + N N AI +A+G YI +D DD P+ L + + Sbjct: 422 DPRVRVVFREANGHMSQASNSAIEIARGAYIALLDHDDLLDPDALVLVVQVIDAHPDAKI 481 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELK 177 +Y + + ++ + K ++R L Y N I + A + F + Sbjct: 482 IYTD----EDKIVEGGTRCDAHFKPDWNRDLLYGINYISHLGVFDAALVREVGAFREGFE 537 Query: 178 AAQDYDIFLRMVVEYG--EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 AQDYD+ LR + + + + G ++ K + R ++ Sbjct: 538 GAQDYDMLLRCIERVQDRQIHHIAKVLYSWRATPGSAAASNRAKPY-ANEAGRRALEEHL 596 Query: 236 DRASKKYQLFTL------YQIRNKRMTWRTLLTLLSVRNGKRLA----DGIRGR 279 R + K L Y+ + ++ R+ + + I GR Sbjct: 597 ARTTGKSIPVVLGPIPFSYRALWPMEGTPLVSIIIPTRDHLNVLRATVESILGR 650 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 50/262 (19%), Positives = 92/262 (35%), Gaps = 29/262 (11%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTAL-N 58 M+ PL+SI +PT + + ++S+L + Y N+E+I+VD+ S + L+ + + Sbjct: 621 MEGTPLVSIIIPTRDHLNVLRATVESILGRTMYGNFELIVVDNGSVEADTLEWFGQIEGS 680 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHA 117 D R+ + + A+ N A+ ++GE + ++DD + P+ LS + Q Sbjct: 681 DRRVRVLRDARPFNYSALNNAAVAQSRGEIVALVNDDVEVIAPDWLSEMV-ALAQRPGVG 739 Query: 118 FLYANDYVCQGEVY------------SQPASLPLYPKSPYSRRLFYKRNI----IGNQVF 161 + A Y G + L + Y RL ++N V Sbjct: 740 CVGAKLYYPDGRIQHAGVVIGLGGVAGHGHLLYPGEHAGYFCRLKLRQNYSAVTAACLVI 799 Query: 162 TWAWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVE-EATQILHINHGEMQITSSP 218 + EL A D D+ L++ +H Sbjct: 800 KREIFDAVGGLNESELTVAFSDIDLCLKVRAAGYNNVWTPWAELY----HHESASRGHED 855 Query: 219 KKFSGYFHFYRKHKDKFDRASK 240 +R+ D R K Sbjct: 856 TPEK--RARFRREVDYMKRRWK 875 >UniRef50_B8HRB2 Glycosyl transferase family 2 n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HRB2_CYAP4 Length = 373 Score = 234 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 14/261 (5%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N PL+S+ +P +N + + +KSVL Q Y+N E+++VDD S A +DPR Sbjct: 8 SNLPLVSVIIPAYNAEAFIQKTLKSVLTQTYTNIEVLVVDDGSQDRTAEIVQAIAKDDPR 67 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAH-KQQLVTHAFLY 120 + + N+G A RN+AI +GE++ ID DD W P L + + F+Y Sbjct: 68 VDLL-QQSNAGVAAARNKAIQATRGEFVAPIDADDIWYPQNLEKQVQCLLKSDSNVGFVY 126 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN---QVFTWAWRFKECLFDTELK 177 + + + + +L + IGN + + ++ EL+ Sbjct: 127 SWSLEIDEQGLLSGGFHAAWHQGEVYLKLLI-QFFIGNASATLIRRVCLEETNGYNCELR 185 Query: 178 -----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY--RK 230 +D+D++LR+ Y V E M S R Sbjct: 186 AMEAQGCEDWDLYLRLAERYH-ANVVPEFLVGYRQLKNSMSRNLDLMARSQSLVLEASRN 244 Query: 231 HKDKFDRASKKYQLFTLYQIR 251 H + A ++ Y Sbjct: 245 HHPELPTAVYRWSTSRFYIYL 265 >UniRef50_A0YK73 Glycosyl transferase n=5 Tax=Cyanobacteria RepID=A0YK73_9CYAN Length = 1161 Score = 234 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 51/275 (18%), Positives = 99/275 (36%), Gaps = 9/275 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +S+ +P +N + +A++SVL Q ++E+I+VDD ST + RI Sbjct: 7 TPQVSVIIPAYNGSRYISQAVESVLSQTDCHFEVIVVDDGSTDNTNTILQHYS---DRIR 63 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y N G A RN+ I A+ E I +D DD + P++L+ A Q+ + + Sbjct: 64 YFSQ-KNQGVAAARNRGIQEAKAEIIALLDQDDIFLPHKLAEQTACFQKFPEVGLVNSGW 122 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ--VFTWAWRFKECLFDTELKAAQD 181 + E Y+ P + + + I +F W K F++ D Sbjct: 123 RLINREGYNISEIEPWHNLPDLDLQTLIIHSPILPSAIMFRRTWWEKIGGFNSRFNGVDD 182 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 + ++ + + + T +H + K + F + Sbjct: 183 AEFVWQLAARGCQAIWLPKITVEYRQHHQTVSH---QKAVERAETVIVAQNNFFSQKGLS 239 Query: 242 YQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 + L + L + + +A+ + Sbjct: 240 DAVIQLEKPARYEELTWLAWHLYHTNHYQSMAEFL 274 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 56/272 (20%), Positives = 108/272 (39%), Gaps = 15/272 (5%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N P +S+ +P +N + +A+KSVL Q Y+++E+I++DD ST + Sbjct: 328 SNLPKVSVIIPAYNCENYIEQAVKSVLEQTYTDYEVIVIDDGSTDNTKNILSPYLDI--- 384 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y++ N GA RN+ +AQGE + +D DD +TPN+L+ + +Q T + + Sbjct: 385 IQYVYQS-NQGAAKARNKGCQIAQGELLAFLDGDDFFTPNKLAEQVKVFEQDATIDLVQS 443 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRR--LFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 V + LP + + +K + K FD Sbjct: 444 GWMVVNKTGENLVDVLPWKNAPELNLETWVLHKCVRPSALMIRKICWKKVGGFDHSYPPT 503 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD--- 236 +D D LR+ + + V++ + + ++ + + ++ D Sbjct: 504 EDLDFVLRLALMGCKAVWVKQLHAGYRQHDCNL-MSGGERVIKNTELLMNQFFNRRDLPK 562 Query: 237 -----RASKKYQLFTLYQIRNKRMTWRTLLTL 263 + + YQ + + R R + +T Sbjct: 563 NIQQLKQKEHYQRWVWFAWRMYRDSHLKFVTY 594 >UniRef50_C6MPM7 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M18 RepID=C6MPM7_9DELT Length = 315 Score = 234 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 52/257 (20%), Positives = 84/257 (32%), Gaps = 7/257 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN--DPRIT 63 L S+ +P +N + +A+ SV Q +++E++I +D S A D + Sbjct: 17 LFSVVIPAYNSEPFIAKALDSVRAQTLADYEVVITNDGSKDGTVQAIEDYARRHPDFPVR 76 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 N G RN I A G +I +D DD W P++L Y ++ Sbjct: 77 LASQ-QNKGIGGARNNGIFRATGRFIAFLDADDYWHPSKLEKMARLLAGRPQVDVAYHDE 135 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWR-FKECLFDTELKA--AQ 180 G+ +P S + Y LF + + K F L A+ Sbjct: 136 IEVAGDGTRRPLSYDEVREPAYQDLLFRGNRLSTSATVVRRELAQKIGGFSENLDFNSAE 195 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DY+ +LR+ + E H G + F+ H D Sbjct: 196 DYEFWLRLARAGARFAHLGEVLGEYHRVEGSITQ-KIEYHHRNIFNVITHHLDLLRADGT 254 Query: 241 KYQLFTLYQIRNKRMTW 257 F R K+ Sbjct: 255 CSAAFLDRMYRRKKAEH 271 >UniRef50_C0FV27 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FV27_9FIRM Length = 1030 Score = 234 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 66/279 (23%), Positives = 116/279 (41%), Gaps = 16/279 (5%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 L SI MP +N + + +AI+S+ +Q+Y NWE+ I DDCST E +++++A+ + RI Sbjct: 92 LFSIVMPVYNVEIKWLDKAIESIEKQNYKNWEICIADDCSTKQEV-REHLSAMKNSRIKI 150 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G N A LA GEYI +D+DDE P+ L F ++ +Y++ Sbjct: 151 KLLEKNQGISGATNAAAALASGEYILLMDNDDELAPSALHEFYQKIKKE-GSEIIYSDMD 209 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQDYD 183 + + + K +S LF + +G+ + F K F E +QDYD Sbjct: 210 IIDAK----GKTRDPLCKPDWSPDLFLSQMYLGHLIGFKKSLFEKVGGFRGEFNGSQDYD 265 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR------ 237 + LRM + V E K + ++ DR Sbjct: 266 LLLRMTEMTDKIGHVPEILYHWRDLPSSTAANPESKPY-AQTAGLNAIQEHLDRVYGKGA 324 Query: 238 -ASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 + + + +Y +R + ++ +++ L Sbjct: 325 ATANETENLFVYDVRYHMNEEPKVSIIIPIKDHADLLKA 363 Score = 88.6 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 43/272 (15%), Positives = 82/272 (30%), Gaps = 35/272 (12%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQ-DYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M P +SI +P + L AI S+ + Y N+E+II+++ S E+ Y+ + + Sbjct: 342 MNEEPKVSIIIPIKDHADLLKAAIDSIYAKTTYKNFEIIILNNNS-EREETFTYLKKVKE 400 Query: 60 PRITYIHNDINSGACAVR--NQAIMLAQGE-YITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 I D R N + A G+ Y+ +D + P L+ + + Sbjct: 401 EHDNVIVKDAAFEFNWSRLNNYGMKFATGDVYVCLNNDVEVIEPEWLTRLVEKAIR-KDV 459 Query: 117 AFLYANDYVCQGEVYSQPAS--------------LPLYPKSPYSRRLFYKRNII---GNQ 159 + + P + SP+ + + Sbjct: 460 GVVGGLLLYEDNTIQHAGVVIGMGGWADHVFKGMKPQHYGSPFVSPMVTRNVSAVTGACL 519 Query: 160 VFTWAWRFKECLFDTELKAA-QDYDIFLRMVVEY-GEPWKVEEATQILHINHGEMQITSS 217 + A K FD + D ++ LR + +S Sbjct: 520 AVSKATIEKIGGFDEKFIVCGSDIELALRANQHGLVNIYDPNVRLYHY----ESKSRDAS 575 Query: 218 PKKFSGY------FHFYRKHKDKFDRASKKYQ 243 + + YRK+ D + + Y Sbjct: 576 KIPQIDFDLSDQMYKTYRKNGDPYYNRNLDYY 607 >UniRef50_B3E7E3 Glycosyl transferase family 2 n=1 Tax=Geobacter lovleyi SZ RepID=B3E7E3_GEOLS Length = 663 Score = 234 bits (597), Expect = 3e-60, Method: Composition-based stats. Identities = 67/282 (23%), Positives = 115/282 (40%), Gaps = 14/282 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +P +S+ MP +N ++ AI S+L Q + ++E+IIVDD ST + + +D RI Sbjct: 3 SPRVSVLMPVYNGERFLREAIDSILGQSFHDFELIIVDDGSTDSSAAC--IASYHDTRII 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL-YAN 122 + N+IN G RN+ I LA+GEYI +D DD T RL+ +A + A + Sbjct: 61 SVKNEINQGIVVTRNRGIELARGEYIALMDCDDISTEERLAEQVAFLDTDQSVAAVAACI 120 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFY--KRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + V + + + +P R F + + + A ++ + A + Sbjct: 121 ECVDENGTFLSGWADDRSTVTPDQIRAFLPRANCVANSSLMIRAKILRQYRYRGGRDAVE 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP-----KKFSGYFHFYRKHKDKF 235 DYD+ LR+V + + K+ I + + +F R + Sbjct: 181 DYDLLLRLVSDGLQVAKIPAVLLYYRIVNSSFSSAGRSLPPEYRHLRAKNYFLRHALVNW 240 Query: 236 DRASKKYQLFTLYQIRNKRMTWRTLLTLL----SVRNGKRLA 273 S Q+ R+ + L LL N +LA Sbjct: 241 RLNSFCLQVSVSLLRDAARLAVKRLFRLLLGDWPDDNRDKLA 282 >UniRef50_B4ECE8 Glycosyltransferase n=3 Tax=Burkholderia cepacia complex RepID=B4ECE8_BURCJ Length = 434 Score = 234 bits (597), Expect = 3e-60, Method: Composition-based stats. Identities = 58/261 (22%), Positives = 99/261 (37%), Gaps = 11/261 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 P +SI + ++N + I+SV+ Q +WE+IIVDDCST + A D RI Sbjct: 11 RPRVSIIVTSYNYARFIGETIRSVIDQTLDSWELIIVDDCSTDDSW--NVIQAFQDRRIH 68 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + D N GACA N A+ LA+GE+I +D DD + ++L+V +A Q Sbjct: 69 AVRFDKNRGACAAYNTALSLARGEFIASLDSDDVFNEDKLAVQVAFLDQHPEVDICGTYL 128 Query: 124 YVCQGEVYSQPASLPLYPK------SPYSRRLFYKRNIIGNQ--VFTWAWRFKECLFDTE 175 P+ +Y S + N + + + + + F E Sbjct: 129 EEIDDAGNPVPSESHMYAPWFNKELDLNSPDCWIWENHLCHSGVLMRKSVHDRVGSFREE 188 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPK-KFSGYFHFYRKHKDK 234 L + D++ +LR + + + + + + + R Sbjct: 189 LTYSPDWNFWLRALTAGATFHVIPQELVRYRSHGSNITHRNFDALMWEYADTSARTLHPY 248 Query: 235 FDRASKKYQLFTLYQIRNKRM 255 R + + QI KR Sbjct: 249 LMRVGRPDLIAKNVQIFVKRF 269 >UniRef50_D0MF25 Glycosyl transferase family 2 n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MF25_RHOM4 Length = 320 Score = 234 bits (597), Expect = 4e-60, Method: Composition-based stats. Identities = 52/214 (24%), Positives = 96/214 (44%), Gaps = 11/214 (5%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P +S+ + +N ++ AI+SVL QDY E+I+VDD ST ++ V +P + Sbjct: 4 ERPTVSVVIAAYNAERWIAGAIRSVLGQDYPVLEVIVVDDGSTD--ATREVVAPFQEP-V 60 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 Y+ + N+G+ A RN I LA+GE + +D DD W P +L+ +A Q + Y + Sbjct: 61 RYVFQE-NAGSAAARNHGIRLARGELVAFLDADDLWLPGKLAAQVACLQAHPDCGWCYTD 119 Query: 123 DYVCQGEVYSQPASLPLY---PKSPYSRRLFYKRNI-IGNQVFTWAWRFKECLFDT--EL 176 ++ S+ L P+ L + + + F + Sbjct: 120 AWLVDASTRSKKVRLSQLAAMPEGWVLEALLLSNFVPFSSALVRREALEAVGGFREGPDR 179 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHG 210 + ++D+D++LR+ Y +V E ++ + Sbjct: 180 RISEDWDLWLRIAAHY-PVCRVAEPLVLIRDHAD 212 >UniRef50_A4SDY7 Glycosyl transferase, family 2 n=1 Tax=Chlorobium phaeovibrioides DSM 265 RepID=A4SDY7_PROVI Length = 345 Score = 233 bits (595), Expect = 5e-60, Method: Composition-based stats. Identities = 53/259 (20%), Positives = 93/259 (35%), Gaps = 5/259 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MKN P ++I MP N ++ AI S+L+Q ++++E +I+DD ST V + +P Sbjct: 1 MKN-PQVTILMPVHNGERYLREAIDSILKQSFTDFEFLIIDDGSTDGSTTI--VKSYENP 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I N N G V NQ + LA+G+YI +D DD +RL + ++ Sbjct: 58 RIHLIANPENRGTVHVLNQGLALAKGKYIARMDGDDISLTDRLKKQVGFMEKHPEVGVSG 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + + +L + V + K + L Q Sbjct: 118 TAMRLI-KKGTLKNTRTLPETDEELKIQLLFSTCFFHPTVIMRSDLAKAHPYPENLIYTQ 176 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DY+ + + + + E +H + + + + K Sbjct: 177 DYNYWTSLADKTA-FANIPETLLYFREHHTQTSSKKADMQVMNARKIRCGYLQKLFGVHN 235 Query: 241 KYQLFTLYQIRNKRMTWRT 259 + QI R Sbjct: 236 AAEHDIHNQIAENRTDLNL 254 >UniRef50_A1ZLY5 Glycosyl transferase domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZLY5_9SPHI Length = 337 Score = 233 bits (595), Expect = 5e-60, Method: Composition-based stats. Identities = 45/230 (19%), Positives = 82/230 (35%), Gaps = 7/230 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + PL+S+ MP +N ++++S+L Q Y+N+E II+DD ST + D Sbjct: 1 MTDQPLVSVVMPVYNASPYLAQSMQSILNQTYANFEFIIIDDGSTDDSLK--VIEKYTDR 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N G NQ +A+G+YI +D DD R+ + + Sbjct: 59 RIQVYQHSQNQGVITALNQGFGMAKGKYIARMDADDYCEITRMEKQVHLMEAQPDTDLCG 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKA 178 + L + +I+ + FD Sbjct: 119 TWVAYITQHSKTITQL--PVESEAIKAFLLWGNSIMHATTMFRRSFLLKHHLKFDQGFVH 176 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFY 228 A+DYD++ R + + E + N ++ + + + Sbjct: 177 AEDYDLWTRCLAIAHFI-NIPEVLYYIRQNDQQVSVRFKALQIKNTQLIH 225 >UniRef50_B2TQH5 WsaE n=1 Tax=Clostridium botulinum B str. Eklund 17B RepID=B2TQH5_CLOBB Length = 1106 Score = 233 bits (595), Expect = 6e-60, Method: Composition-based stats. Identities = 52/276 (18%), Positives = 97/276 (35%), Gaps = 17/276 (6%) Query: 6 LISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ---LQQYVTALNDPR 61 SI +P +N + I S + Q Y NWE+ + D S D R Sbjct: 575 KFSIVVPLYNTPKNFLCEMINSCINQTYGNWELCLA-DGSDKEHSYVSKVVEDYIKKDKR 633 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y + N G N+ I +A GEYI D DD + L ++ + F+Y Sbjct: 634 IKYKILEANRGISENTNECIKMAIGEYIALFDHDDVLHQSALFEYMKVICE-KNADFIYC 692 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQ 180 ++ +V + K +S N I + + LF E +Q Sbjct: 693 DEDKFDTDV---NMRFDPHFKPDFSIDNLRANNYICHFTVFKKTLIQETGLFRKEFDGSQ 749 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+D+ LR+ + + + ++ + K ++ K+ R Sbjct: 750 DHDMILRLTEKSKNIVHIPKILYHWRVSSNSVASDPYAKPYT-IEAGINAVKEHLSRCDL 808 Query: 241 KYQLFT------LYQIRNKRMTWRTLLTLLSVRNGK 270 K + + +Y+I+ + + + L+ ++ Sbjct: 809 KATVESSTVHPNIYRIKYSIIDNQLISILIPNKDHI 844 Score = 60.5 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 50/275 (18%), Positives = 102/275 (37%), Gaps = 21/275 (7%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVL-RQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 +N LISI +P + R I S+L + Y N E+II+++ ST + + Y T Sbjct: 830 DNQLISILIPNKDHIIDLSRCINSILEKSTYKNIEIIIIENNSTEQKTFEYYKTLEKYKN 889 Query: 62 ITYIHNDINSGACAVR--NQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLV---- 114 I + + N + A+G + +++D + + N L L + Q+ Sbjct: 890 IKVVTYESKGKFNYSAINNFGVRYAKGNQLLFLNNDIEIISENWLEEMLMYSQREDVGAI 949 Query: 115 THAFLYANDYVCQGEVYSQPASLPLYPKSPYSRR-------LFYKRNIIG----NQVFTW 163 Y ND + + +L + ++R LF+ RN+ G + Sbjct: 950 GAKLYYPNDTIQHAGLGLGILTLAGHYFRHFNRDATGYMGNLFFTRNVSGVTAACIMMKK 1009 Query: 164 AWRFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 + + FD + A D D+ +R+ A H +P++ + Sbjct: 1010 SIFNEINGFDETFEVAFNDVDLCMRIRKAGYLIVWTPYAEA-YHYESISRGTEDTPERKA 1068 Query: 223 GYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMTW 257 + R+ ++++ + K + + R + Sbjct: 1069 RFNGEVRRFQERWKKELKDGDHYYNVNLTLDREDF 1103 >UniRef50_Q72X17 Glycosyl transferase domain protein n=4 Tax=Bacillus cereus group RepID=Q72X17_BACC1 Length = 321 Score = 233 bits (594), Expect = 7e-60, Method: Composition-based stats. Identities = 49/216 (22%), Positives = 95/216 (43%), Gaps = 8/216 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P I++ MP +N ++ ++ S+L Q ++++E++I++D ST + + + +D RI Sbjct: 2 PKITVLMPVYNGEKYLRESVDSILNQTFTDFELLIINDGSTDSSM--EILNSYSDSRIRI 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N++N N+ I LA GEYI +D DD P RL + L + ++ A Sbjct: 60 VTNEVNLRLIKTLNKGIDLATGEYIARMDCDDIADPKRLEIQLQYMEKHPDVAVCGTGVK 119 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKAAQDY 182 V ++ L+ + +I V KE +D A+DY Sbjct: 120 VI---GQNRKPWQISGSSELIRNCLYVRSCMIHPSVMMRTEFLKEHNIYYDLNYLHAEDY 176 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 ++F R+ +Y + +EE + + ++ Sbjct: 177 ELFQRLSEKY-KVINLEEPLLNYRWSETSVSSLNAS 211 >UniRef50_C0W8E0 2 family glycosyl transferase n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W8E0_9ACTO Length = 544 Score = 232 bits (593), Expect = 9e-60, Method: Composition-based stats. Identities = 57/279 (20%), Positives = 96/279 (34%), Gaps = 17/279 (6%) Query: 1 MK-NNPLISIYMPTWN-RQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN 58 M + P SI P + R I+SV Q +WE I+VDD S LQ A Sbjct: 1 MNMSAPFFSILTPVYKPRPDHLELTIESVRSQTCEDWEWILVDDASGDQAVLQVLRRAAQ 60 Query: 59 -DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 DPRI + + N A N A+ A+G +I +D DD P+ L+V A Sbjct: 61 RDPRIHVVERETNGHIVAASNDALARAKGNWIILVDHDDLLVPDSLTVIKAAIDAHPRAG 120 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQV-FTWAWRFKECLFDTEL 176 ++Y ++ + + K +S + +G+ + F Sbjct: 121 YIYTDEDKIDNDGNL----SQEFRKPDWSPERLRHQMYLGHLSGLRHDLVTQVGGFHEGF 176 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 +QD+D+ LR+ E V I G S+ K + + + + Sbjct: 177 DGSQDHDLALRVTEISEEVVHVPRTLYHWRIVPGSTAGDSTAKDY-ASVAGLKAVQAHLE 235 Query: 237 RASKKYQLF--------TLYQIRNKRMTWRTLLTLLSVR 267 R + Y + + + ++ R Sbjct: 236 RVGLDATMTAEPHPTIPYTYNLHRELPEDTLVSVVIPTR 274 Score = 52.8 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 41/294 (13%), Positives = 89/294 (30%), Gaps = 61/294 (20%) Query: 6 LISIYMPTW--------NRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTA 56 L+S+ +PT R+ L + AI+S+L + E+++V D T E L + Sbjct: 266 LVSVVIPTRGTQGRIWGKRRVLVVDAIRSLLTHTSHEQLEIVVVYDLDTPQEVLTELKEV 325 Query: 57 LNDPRITYIHNDINSGACAVR-------------NQAIMLAQGEYITGIDDDDEWTPNRL 103 GA N+ + + G+ + ++DD E +R Sbjct: 326 A--------------GASLTLLPYDAPFNFSDKCNRGFLASSGDVVVFLNDDTEIISDRF 371 Query: 104 --SVFLAHKQQL---VTHAFLYANDYV------CQGEVYSQPASLPLYPKSPYSRRLFYK 152 + ++ Y + + Q + P + L Sbjct: 372 VEELCAPLVEEDVAATGALLFYEDSTIQHAGLGFQDNEFVHPYLGAQKTDPGHFAELCVD 431 Query: 153 RNIIGNQ---VFTWAW-RFKECLFDTELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHI 207 + G V T + F +L + D D ++ + + + Sbjct: 432 HEVSGLTGAAVATRRETFEEIGGFTLDLPSNFNDVDFAFKLQHKGKRMIWLHDVQAY--- 488 Query: 208 NHGEMQITSSPKKFSGYFHFYRKHKDKFDR-ASKKYQLFTLYQIRNKRMTWRTL 260 + ++ + F D++++ + Y F ++ + R L Sbjct: 489 HFESKSRVNTVHGWEHQF-----ILDRWEQPKTDPYMPFATTELTEIQKALRLL 537 >UniRef50_A3W9Y7 Glucosyltransferase n=1 Tax=Erythrobacter sp. NAP1 RepID=A3W9Y7_9SPHN Length = 328 Score = 232 bits (593), Expect = 9e-60, Method: Composition-based stats. Identities = 60/287 (20%), Positives = 117/287 (40%), Gaps = 27/287 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P IS+ MP ++ ++ A++SVL Q ++++E+I VDD T V +DPRI Sbjct: 4 PKISVVMPIYHVEKFVGEAVRSVLDQTFADFELICVDDGGTDASM--DIVRGFSDPRIRI 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G RN I A+GE++ +D DD W P +L++ H YA Sbjct: 62 VCQ-ANRGLAGARNTGIANARGEFVALLDSDDVWHPEKLALHYIHLMANREVGVSYAGSR 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN---QVFTWAWRF-----------KEC 170 + ++ ++ + + N +GN V + + C Sbjct: 121 MIDQNGNVLRVAMRPKTGRVTAKDIICR-NPVGNGSAPVLRRSALDLAVFPHPEEPSRTC 179 Query: 171 LFDTELKAAQDYDIFLRMVVEYG-EPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 FD + ++D ++++R+ V++ ++ I G + + K++ + R Sbjct: 180 WFDESFRQSEDIEMWIRLAVKHEVVFAGIDGLLTDYRIIPGALSA-NVVKQYLSWTKMLR 238 Query: 230 KHK-------DKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNG 269 K + ++ ++ YQL L + + + L+S G Sbjct: 239 KLRGYAPEFVEQHGDTARAYQLRYLARRSVQLGNFELARDLMSKAVG 285 >UniRef50_Q21E22 B-glycosyltransferase-like protein n=2 Tax=Alteromonadales RepID=Q21E22_SACD2 Length = 341 Score = 232 bits (593), Expect = 9e-60, Method: Composition-based stats. Identities = 64/271 (23%), Positives = 104/271 (38%), Gaps = 23/271 (8%) Query: 1 MKN---NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL 57 M N P+ S+ MP +N + AI+SVL Q Y +E+I VDD T V+ Sbjct: 1 MNNTINKPIFSVVMPMYNVAKYVATAIESVLAQTYGQFELICVDDGCTDNT--LDIVSTY 58 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 DPRI I N G A RN I A+G Y+ +D DD W P +L + L+H + Sbjct: 59 KDPRIRVIRQ-KNMGLSAARNTGISHARGLYVALLDSDDYWAPTKLQMHLSHFRNNPKLG 117 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN---QVFTWAWRFKEC---- 170 Y+ E K+ +++ +F + N IGN V + + Sbjct: 118 VSYSASQFIDEEGSLLGIGQYPKLKNIHAQDIFCR-NPIGNGSAAVLRKSMLLQMGKHVY 176 Query: 171 --------LFDTELKAAQDYDIFLRMVVEY-GEPWKVEEATQILHINHGEMQITSSPKKF 221 FD L+ ++D + +LR + + + E +N + + + Sbjct: 177 NNGEVRLTYFDERLRQSEDVEFWLRAALTSDWQFMGIGEGLTFYRVNASGLSANLTNQYQ 236 Query: 222 SGYFHFYRKHKDKFDRASKKYQLFTLYQIRN 252 + + Y + + YQ R Sbjct: 237 AWRVSVKNNYHLNPAFFDTWYSMASAYQKRY 267 >UniRef50_Q13AK4 Glycosyl transferase, family 2 n=5 Tax=Rhodopseudomonas palustris RepID=Q13AK4_RHOPS Length = 361 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 48/220 (21%), Positives = 84/220 (38%), Gaps = 4/220 (1%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +P + Q+ AI SVL Q S+ E++++DD ST DPR+ Sbjct: 21 PQVSVILPVRDGQRWLCEAIDSVLTQTLSDLELVVIDDGSTDATPALLDEVRARDPRVIA 80 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + + G A N+ + A+ I +D DD P+RL+ + L Sbjct: 81 LRQER-EGLVAALNRGLAQARAPLIARLDADDIALPDRLARQCDYLHAHPDVVLLGGWAE 139 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTW-AWRFKECLFDTELKAAQDYD 183 + S+ + P L K I V + + + +A +DYD Sbjct: 140 IIDENGASRGKQMRPNPSG-LRETLARKSPFIHPTVMVRADAARRVGGYRSAFEAGEDYD 198 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSG 223 +LR+ + GE + E + G + T ++ Sbjct: 199 FWLRLA-DAGEIAILPEVLIRYREHGGSVTRTRELRQIYS 237 >UniRef50_Q0BPG4 Glycosyltransferase n=3 Tax=Acetobacteraceae RepID=Q0BPG4_GRABC Length = 990 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 52/272 (19%), Positives = 97/272 (35%), Gaps = 10/272 (3%) Query: 8 SIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQ-LQQYVTALNDPRITYI 65 SI MP +N I A+ SV Q NWE++ ++D S+ A DPRI + Sbjct: 420 SIVMPVYNTPQDWLIEALNSVRAQWCDNWELLCINDASSEPHVETILRAYAQQDPRIRIL 479 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N G N + A G+Y+T +D DD P+ + L + F+Y+++ Sbjct: 480 RTGNNVGIARATNFGLRAASGQYVTFMDHDDYLEPDAVYHLLKAAGE-TQADFIYSDEAT 538 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQDYDI 184 + S + +S + I + + + +D + + D D Sbjct: 539 TDENIASIAEVRG---RPAFSHDYYLSHPYIVHMLCLKRELAHQLGGWDESMAISADVDF 595 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS--KKY 242 LR++ V + G + + + H D+ + ++ Sbjct: 596 VLRIIEHARTIAHVPRVLYRWRTHSGSTGHSKKQQVMEATQKAIQAHLDRLKTGATVEEG 655 Query: 243 QLFTLYQIRNKRMTWRTLLTLLSVRNGKRLAD 274 F ++I + R L ++ +N L Sbjct: 656 VWFNQFRINWPKSQGRVL-IVIPTKNKADLVK 686 Score = 53.5 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 41/287 (14%), Positives = 85/287 (29%), Gaps = 55/287 (19%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYS--NWEMIIVDDCSTSWEQLQQYVTALND 59 K+ + I +PT N+ L A++S+ + ++ +++VD ST E + + Sbjct: 667 KSQGRVLIVIPTKNKADLVKTAVESIEKTTPPDVDYRIVVVDHESTEPESRKYFHALAK- 725 Query: 60 PRITYIHNDINSGACAVR--NQAIMLAQG---EYITGIDDD-----DEW----------- 98 +I + N+A+ G E++ +++D D W Sbjct: 726 ---RHIVMPYQGPFNYSKINNEAVKK-HGDDCEFVLFLNNDVEAITDNWIDRMRSLASRQ 781 Query: 99 ----------TPNRLSVFLAH------KQQLVTHAFLYANDYVCQGEVYSQPASLPLYPK 142 P++ + H + HAF + + Y+ + + L Sbjct: 782 DVGAVGALLLYPDK---RVQHAGVIMGFNESADHAFKFVDAYLNDKGRRNLGYNCSLSSV 838 Query: 143 SPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEA 201 Y + + + F+ D D+ LR+ E + Sbjct: 839 RDY------SAVTAACLMMRKSVFDQLGGFEPRFGVGFNDTDLCLRV-REANLKVLYDGT 891 Query: 202 TQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLFTLY 248 T + H T + ++H Y Y Sbjct: 892 TVLFHYESATRSQTKQVMHPEDTDYLLQRHAAILKNGDPFYNPNLSY 938 >UniRef50_Q97H39 Glycosyltransferase n=2 Tax=Clostridium acetobutylicum RepID=Q97H39_CLOAB Length = 336 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 56/253 (22%), Positives = 101/253 (39%), Gaps = 8/253 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK P +SI MP +N ++ AI+S+L Q Y+++E IIVDD ST A D Sbjct: 1 MKVYPKVSIVMPVYNSERYLAEAIESILDQTYNDFEFIIVDDGSTDESYNIISSYANKDN 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I + + G N+ I +A+G+YI +D DD NR+ + L Sbjct: 61 RIIVISRE-HRGLVDSLNEGINIARGKYIARMDADDISINNRIEKQFEFLELNKDVDILG 119 Query: 121 AND----YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT--WAWRFKECLFDT 174 + + + ++ + S ++F I + K + Sbjct: 120 TRIEAFGDIDEKQKTIYNSAFSIKFDSQNIEQVFLTSCAIPHPSVMFKKDSIVKLRGYRK 179 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 E A+DYD++LR + + +++E +++ + + D Sbjct: 180 EYDTAEDYDLWLRAIRNGYKIVRMDECLIKYRVHNKSKTAVEMFNP-KMVEYTMKAKIDY 238 Query: 235 FDRASKKYQLFTL 247 + +KK ++ L Sbjct: 239 INDTNKKDKVDYL 251 >UniRef50_A0KUZ3 Glycosyl transferase, family 2 n=7 Tax=Gammaproteobacteria RepID=A0KUZ3_SHESA Length = 249 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 59/249 (23%), Positives = 103/249 (41%), Gaps = 7/249 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+SI MP++N + +I+SV+ Q Y NWE++I DD S + D Sbjct: 1 MQEQPLVSIIMPSYNSAKTIAESIESVISQTYQNWELLITDDVSVDNTRDIVRSYCAKDS 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI D+NSGA A RN +I ++G YI +D DD W ++LS +A +L Y Sbjct: 61 RIKLFELDVNSGAGASRNNSIENSKGIYIAFLDSDDLWFSDKLSTQIAFM-KLNNVLLSY 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + E + P S + N+IG + + + ++ Q Sbjct: 120 SAYQKFSREGDGG----VIIPPSSVTYEELLTGNVIGCLTAIYNAKVLGKRYMPLIRKRQ 175 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D ++L ++ + + + + ++ G + + FYR + Sbjct: 176 DMGLWLNILEDIDMAYGISDVLAKYRVDTG--MTKNKLNILKWQWAFYRDVLGLNFFKTI 233 Query: 241 KYQLFTLYQ 249 K + Y+ Sbjct: 234 KCFILYAYK 242 >UniRef50_B9JW01 Glycosyl transferase n=1 Tax=Agrobacterium vitis S4 RepID=B9JW01_AGRVS Length = 810 Score = 231 bits (591), Expect = 1e-59, Method: Composition-based stats. Identities = 53/281 (18%), Positives = 101/281 (35%), Gaps = 17/281 (6%) Query: 2 KNNPLISIYMPTWNRQQL--AIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQY-VTALN 58 +N PL+S+ P + R + A++SVL Q Y N E+++VDD S E A Sbjct: 274 ENTPLVSVICPVY-RPRYRDFKMAVESVLCQTYKNIELVLVDDASNDPELDIILDEFARR 332 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D R+ I + N N AI + G+ I D DD + + Sbjct: 333 DARVKLIKSTKNLNISEATNVAIANSSGDLIAFFDHDDLLEFCAIQFMVDALAAN-DALL 391 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELK 177 +Y+++ KS ++ RL +N + + + A ++ T+ Sbjct: 392 VYSDEDKIDD----AGYFSEPNLKSDWNLRLLMAQNYVCHFLIVSAEMVRKAGKLHTKYN 447 Query: 178 AAQDYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 AQD+D LR+ + + E T K++ + Sbjct: 448 GAQDHDFVLRLSEIIPENRIYHIPEILYHWRKTPNSTASTIDNKEY-AIKAGIAAVQAYV 506 Query: 236 DRASKKYQLFTLYQIRNKRMTWR----TLLTLLSVRNGKRL 272 DR S + + + N + ++ + ++ ++ + Sbjct: 507 DRNSIAASVTSRGNMTNYVVNYKEKSEKVSIIIPFKDQIDM 547 Score = 50.8 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 36/278 (12%), Positives = 82/278 (29%), Gaps = 29/278 (10%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + + +SI +P ++ + + + S+L+ Y N+++I+V++ S S E Sbjct: 530 EKSEKVSIIIPFKDQIDMTRKCVYSILQNTDYKNFDIILVNNFSLSIEADGFRDEIEKLD 589 Query: 61 RITYIHNDINSGACAVR--NQAIMLAQGEYITGIDDDDE-WTPNRLSVFLAHKQQLVTHA 117 + D N R N A + + +++D P L+ + + Sbjct: 590 NVR--MMDANFAFNYSRINNLAAQNLKADAFLFMNNDVIVSDPTWLTELVGELLRSSDIG 647 Query: 118 FLYANDYV-------------CQGEVYSQPASLPLYPKSPYSRRLFYKRNII----GNQV 160 + G + P + R + + + Sbjct: 648 AVGCKLLYENGTVQHAGIILGVNGGIADHPYKGHEAHDPGFMGRGLCSQELSAVTGACLL 707 Query: 161 FTWAWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP 218 + FD A D D+ L++ + E + + +S Sbjct: 708 CSSDVFQAVSGFDELNFGVAFNDVDLCLKIRNLGKKVIWRAETVLE---HRESLSRSSDL 764 Query: 219 KKFSGYFHFYRKHKDKFDRASKKYQLFTLYQIRNKRMT 256 + F ++H ++ Y + R T Sbjct: 765 EPQK-IARFAKEHVAFKEKWQVVLDNAPYYNPKFSRRT 801 >UniRef50_C4Z1F8 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1F8_EUBE2 Length = 833 Score = 231 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 59/284 (20%), Positives = 107/284 (37%), Gaps = 16/284 (5%) Query: 4 NPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYV--TALNDP 60 P +SI +PT+N + I+SV+ Q YSNW++ I D S +L++ + D Sbjct: 301 EPKVSIIVPTYNTPIKYLEEMIQSVINQSYSNWQLCIA-DGSCGNAKLEKVLEDYHKKDS 359 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y D N G N A+ LA GE +D DD P+ L + Q +Y Sbjct: 360 RIVYKLLDSNKGIAGNTNAALELADGEITGLLDHDDTLEPDALYEVVKAFQD-KMVDAVY 418 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAA 179 ++ G + ++ K+ Y+ L N I + K F + A Sbjct: 419 TDEDKILGPDWI---NVDPNFKTDYNIDLLRSHNYITHFFCVKTELLKEIGGFKADYDGA 475 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QDYD+ LR + + + + ++ + K + + + +D R Sbjct: 476 QDYDVILRCTEKARKTEHIAKILYHWRMHDNSTAANPASKAY-CHEAGRKAVEDHLKRLG 534 Query: 240 KKYQLFTLYQIRNKRMTW-----RTLLTLLSVRNG-KRLADGIR 277 ++ R+ + + ++ ++ L +R Sbjct: 535 IPAKVELCKLFGGSRVIYETPGNPLVSIVIPNKDHIDDLDKCVR 578 Score = 73.6 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 78/216 (36%), Gaps = 25/216 (11%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLR-QDYSNWEMIIVDDCSTSWEQLQ---QYVTALND 59 NPL+SI +P + + ++S+ Y N E+IIV++ ST E + ++ Sbjct: 557 NPLVSIVIPNKDHIDDLDKCVRSLFNVNTYKNIEVIIVENNSTQKETFEYYDSIQKEYSN 616 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAF 118 ++ + N A+ N + A+G+YI +++D + P+ +S L + + Sbjct: 617 VKVLVWKREFNY--SAINNFGVKEAKGDYILLLNNDTEMIAPDSISDMLGYCMR-PDVGI 673 Query: 119 LYANDYVCQGEVYSQPASL-------PLYPKSPYSRRLFYKRNIIGN--------QVFTW 163 + A G + + + ++ + R + + + Sbjct: 674 VGAKLLYPDGTIQHAGVIIGLGGIAGHAFIGLDANQYGYMSRAYLSSDYSAVTAACLMIS 733 Query: 164 AWRF-KECLFDTELKAA-QDYDIFLRMVVEYGEPWK 197 + + + A D D +++ + Sbjct: 734 KEIYNEVGGLCEQYAVAFNDVDFCMKVRSKGYLVVY 769 >UniRef50_A2TW95 Glycosyltransferase n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TW95_9FLAO Length = 286 Score = 231 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 55/277 (19%), Positives = 105/277 (37%), Gaps = 5/277 (1%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N+P +SI +P +N+ +++SVL Q + WE IIV+D S + D R Sbjct: 2 NSPKVSIIVPCYNQAHFLNDSLQSVLNQIFDAWECIIVNDGSPDNTEEVAQKWCAKDVRF 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 +I N G + RN I +++G++I +D DD + L L + + Sbjct: 62 KHIK-KTNGGLSSARNAGIAISRGDFILPLDADDLIHQDYLQKLLPVIENNDAIEVVSCY 120 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK-AAQD 181 + + + + P L Y+ I+ ++ + +D +K +D Sbjct: 121 SIFFRDTIDNVVNTR--KPNGTTIDALLYENIIMATSLYRKESWNRVGGYDESMKDGFED 178 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSG-YFHFYRKHKDKFDRASK 240 +D ++ ++ + E V E M + + Y+KH + + Sbjct: 179 WDFWISILKDGAEFKIVPEFLFYYRKAAASMLTNTVANHLESVTKYIYKKHPEVYINKFD 238 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 + Y I+ + + L L R GK + R Sbjct: 239 TTMDYVFYLIKKHKASEDNLKKSLEYRIGKAVTKPFR 275 >UniRef50_A0YLN1 Glycosyl transferase, family 2 n=2 Tax=Bacteria RepID=A0YLN1_9CYAN Length = 1237 Score = 231 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 53/263 (20%), Positives = 102/263 (38%), Gaps = 22/263 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +P +N+ +A+ SVLRQ Y ++E+IIV+D S+ + NDPRI Sbjct: 2 PRVSVVIPAYNQAAYISQAVDSVLRQTYQDFEIIIVNDGSSDQTVEKILE--NNDPRIRL 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G N I AQGE I + DD + P +L + Q + + + Sbjct: 60 FSFEQNHGESIATNYGIQQAQGELIAILHSDDVFVPKKLEKQVLFLDQNPQYQAVLSYPQ 119 Query: 125 VCQGEVYSQPASLPLY---------PKSPYSRRLFYKRNIIG--NQVFTWAWRFKECLFD 173 + + + P S + + F K N + + + + +D Sbjct: 120 LIDSQGENLPPSNSFLDQVFLQQNRTRFQWLNTFFSKDNCLCQTSSLIRKNCYSQIGFYD 179 Query: 174 TELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 + + D+D ++R ++Y E + + E ++ + +H Sbjct: 180 SRFRQIPDFDFWVRFCLKY-ELYILPEPLVNYRVHQSNISGIKPE--------TIIRHNF 230 Query: 234 KFDRASKKYQLFTLYQIRNKRMT 256 + + K+Y +Y+ K Sbjct: 231 ELSQTLKRYFCQDVYENLLKIFP 253 Score = 166 bits (422), Expect = 7e-40, Method: Composition-based stats. Identities = 50/257 (19%), Positives = 86/257 (33%), Gaps = 29/257 (11%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL+S+ +PT+N + +KS Q Y + E+I+ DD S+ I + Sbjct: 361 PLVSLIIPTYNGETFLTETLKSAFSQTYPHLEIILSDDGSSDKTLEIAQSFQQKTS-IPF 419 Query: 65 -IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N G N +I AQG+YI + DD P+ + + +Q +++ Sbjct: 420 EIFPHPNYGLVENLNFSISQAQGKYIKFLFQDDLLEPHCIETLVNLAEQDSEIGLVFSAR 479 Query: 124 YVCQGEVYSQPA--------------SLPLYPKSPYSRRLFYKRNII---------GNQV 160 V E + + L N + V Sbjct: 480 RVFIDENSQSNSLCQAALNGGKDLHQQWSHLKPIQLGQDLLSDANCLKGQLNKIGEPTTV 539 Query: 161 FTWAWRF-KECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP- 218 F K FD L D D++ R++ +Y + V + L I+ + + Sbjct: 540 LIPKAVFEKVGGFDENLHQLLDGDLWFRIMGDY-KIGFVNQTLSSLRIHTQQQTQKNISA 598 Query: 219 -KKFSGYFHFYRKHKDK 234 + Y Y+K Sbjct: 599 GQNLKDYQRLYQKMLFH 615 >UniRef50_C4KBW6 Glycosyl transferase family 2 n=1 Tax=Thauera sp. MZ1T RepID=C4KBW6_THASP Length = 300 Score = 231 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 59/251 (23%), Positives = 99/251 (39%), Gaps = 10/251 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P +S+ +P +N RA+ SVL QD++++E+I+VDD ST + + D Sbjct: 4 TTEPTVSVVIPAFNAAWCIRRAVDSVLAQDFTDFELIVVDDGSTDDTA--EVLRGYGDV- 60 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + + N G RN I LA+G YI +D DD W P +L +A Q AF A Sbjct: 61 LRLVSQP-NGGMSRARNAGIRLARGRYIAFLDADDRWLPAKLGRQVALLDQRPELAFCAA 119 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII---GNQVFTWAW-RFKECLFDTELK 177 + + + +F +I + V FD L Sbjct: 120 TASLEDPDGKPVGIWACQGDGTASVAEVFAAHALIAGGASSVLARREPVQALGGFDEALF 179 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 A+D D+++R+ G + EA ++ G + + + +G RK++ Sbjct: 180 GAEDTDLWIRLAACGG-FACIPEALVVVLKRPGSVSR-NRARMRAGALAMTRKNRHLLPA 237 Query: 238 ASKKYQLFTLY 248 + LY Sbjct: 238 DKQGAYWRALY 248 >UniRef50_C4V652 Family 2 glycosyltransferase n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V652_9FIRM Length = 253 Score = 231 bits (590), Expect = 2e-59, Method: Composition-based stats. Identities = 58/248 (23%), Positives = 93/248 (37%), Gaps = 8/248 (3%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ MP +N + AI+SV Q Y WE+I+VDD S + L D RI I Sbjct: 11 VSVIMPVYNDTKYIREAIRSVSCQTYQFWELIVVDDGS-QQDIKMVLTPLLQDHRIHLIR 69 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 + N G RN I A G Y+ +D DD W P++L L ++ A Y + Sbjct: 70 LERNRGVANARNAGIQQATGRYVAFLDSDDVWLPDKLERQLQFMRKTK-AAMSYTSYRRF 128 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIFL 186 ++ +P+ S + + N IG L + +DY +L Sbjct: 129 CHDIQDAGKPVPIV--SDVTYDKLLRGNCIGCLTVMIDRSQCASLSMP-MVHHEDYAAWL 185 Query: 187 RMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLFT 246 + + + + + G + + K + YR + S Y + Sbjct: 186 NLAKTGVRIYGMTDDLARYRVRGGSLSA-NKLKSLLWTWRVYRDSQQLTMLRSLIYMV-- 242 Query: 247 LYQIRNKR 254 Y IR R Sbjct: 243 HYVIRGIR 250 >UniRef50_D2Q6R3 Glycosyl transferase family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q6R3_9BIFI Length = 280 Score = 231 bits (590), Expect = 2e-59, Method: Composition-based stats. Identities = 71/244 (29%), Positives = 113/244 (46%), Gaps = 8/244 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +PT+NR +IKSVL Q + ++E+I+VDD ST + V A D RI Y Sbjct: 2 PKVSVVIPTFNRADTIGDSIKSVLEQTFKDFEVIVVDDGSTDGTES--VVAAFGDSRIKY 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I D N+GACA RN I A GEYI D DD W P +L L + + + Sbjct: 60 IMQD-NAGACAARNNGIRHANGEYIAFQDSDDYWMPRKLERQLQNISLHDSKIDICKMHS 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT-WAWRFKECLFDTELKAAQDYD 183 + +P + ++ +S RN + Q+ F++ LFD ++ QD+D Sbjct: 119 AGLPKDEYEPCRQAI--EANFSYEYLLGRNFVSTQMILARHELFEDNLFDVKIPRFQDWD 176 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 + LR++ +Y + +E +I+ + + K Y +K+ D+ Y Sbjct: 177 LMLRLLRDYKLSY-TDEVLVERYISADSISQ-NVHKALEAYSLIEKKYSDELSSHRDSYS 234 Query: 244 LFTL 247 LF Sbjct: 235 LFMY 238 >UniRef50_Q6MBF6 Putative glycosyltransferase n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBF6_PARUW Length = 376 Score = 231 bits (590), Expect = 2e-59, Method: Composition-based stats. Identities = 48/233 (20%), Positives = 88/233 (37%), Gaps = 5/233 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M ++PL+SI +P +N+ A+ SVL+Q Y N+E++IVDD ST D Sbjct: 3 MHSSPLVSILIPVFNQVPFIFEALNSVLQQTYPNFEVLIVDDGSTDGTASICQEFCQKDS 62 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R Y++ N G A RN+ I + GEY +D DD+ R++V L + T +Y Sbjct: 63 RFQYLYQ-TNKGPSAARNRGIAASHGEYFCLLDGDDKMDFQRIAVQLKVLTENPTLDIVY 121 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIG--NQVFTWAWRFKECLFDTELKA 178 + + RN+I + + + Sbjct: 122 TALLLIDSNGSTIGEMHGQEINPENFLATLLFRNVIPGPSTIMAKRECLTSHPYHEHFVH 181 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 A+DY++ +R+ + ++ + + +++ Sbjct: 182 AEDYELMIRLA-HFYRFQYIDLPLTYYRRHLNNLS-NHLKAHRQAELKVIQQY 232 >UniRef50_C6M2Q8 Glycosyltransferase n=2 Tax=Neisseria RepID=C6M2Q8_NEISI Length = 1239 Score = 231 bits (589), Expect = 2e-59, Method: Composition-based stats. Identities = 53/252 (21%), Positives = 91/252 (36%), Gaps = 15/252 (5%) Query: 1 MKNN---PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL 57 M + P +S+ +P +N ++ + I+SVL Q + N+E+IIVDD S Sbjct: 1 MNHQAQSPTVSVMIPYYNCKEYIVETIQSVLSQSHQNFEIIIVDDGSDPEHADYLREFLA 60 Query: 58 NDPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 + P I Y N G A RN A LA G+Y +D DD P+ + + + Sbjct: 61 DKPAIRYAVQ-NNQGVAAARNHAARLAGGKYFLFLDSDDLILPDYIEKCVTVLENNPDCK 119 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTEL 176 +Y + L P L + A F FD L Sbjct: 120 LVYPLAEYFDAQ-----EGLWNLPDYDGLESLLMGNRFPSSVSMHRAEDFAALGGFDENL 174 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK-----FSGYFHFYRKH 231 +D+D ++R++ G ++ E + + ++ + Y K+ Sbjct: 175 TTHEDWDFWIRLLSNGGTVIRIPEVLFRYRKRRDGSSLINRLEQNPDLNREDWQKVYEKN 234 Query: 232 KDKFDRASKKYQ 243 + F + Y Sbjct: 235 RALFMQYHLGYS 246 >UniRef50_A6Q2S9 Glycosyl transferase n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q2S9_NITSB Length = 272 Score = 231 bits (589), Expect = 3e-59, Method: Composition-based stats. Identities = 49/213 (23%), Positives = 85/213 (39%), Gaps = 7/213 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +PL S+ +P +NR RAI+SVL Q + ++E+I+VDD ST I Sbjct: 7 SPLFSVIIPVYNRANFIARAIESVLHQTFRDFELIVVDDGSTDETPKVLERYP-----IQ 61 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I + N G A RN+ I A+G+ I +D DDEW N L Q + ++ Sbjct: 62 VIRQE-NKGVSAARNRGIKAAKGKIIALLDSDDEWKKNHLRTHADFFQSHPEYTIHQTDE 120 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQDY 182 + + + I + V F + LF + + +DY Sbjct: 121 IWIRNGKFLNKKKIHQKKSGYIFYDSLRLCLISPSAVAIKKELFNEVGLFREDFEVCEDY 180 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 +++LR+ +Y + + + ++ Sbjct: 181 ELWLRITKKYPVGFTPIQTVIKYGGHEDQLSQK 213 >UniRef50_C6ARE2 Glycosyl transferase family 2 n=4 Tax=Rhizobium RepID=C6ARE2_RHILS Length = 390 Score = 231 bits (589), Expect = 3e-59, Method: Composition-based stats. Identities = 59/280 (21%), Positives = 104/280 (37%), Gaps = 19/280 (6%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N PL+S+ +P +N + R ++S RQ Y N E+I+V+D ST AL D RI Sbjct: 16 NGPLVSVVIPAFNASRYIERTLRSAGRQTYRNLEIIVVNDGSTDDTARVVEQVALADSRI 75 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ-LVTHAFLYA 121 + N G A RN I A G ++ +D DD W ++ + + A +Y Sbjct: 76 RLLSTP-NRGVAAARNTGIREAAGRFVAFLDADDLWHHTKIEKQVNALNRLTPQWAAVYV 134 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN---QVFTWAWRFKECLFDTELK- 177 Y+ + + P + R + +GN + + FD+ Sbjct: 135 LHYIINSDDEILRSCRPDVARGYIFARHLTFK-YVGNGSALLVRRDVALEIGGFDSSYAA 193 Query: 178 ----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR--KH 231 +D D L++ Y + + E G M P R Sbjct: 194 AGIGGCEDLDFELKLAARY-QIEVIPERLVGYRQYPGNMSSNHLPMGRGALEVIRRSLAA 252 Query: 232 KDKFDRASKKYQL-----FTLYQIRNKRMTWRTLLTLLSV 266 + + + + + + ++ R R T+ +L T+ S+ Sbjct: 253 NPQLPQYATRSAINATQKYAFWEFRQARNTYLSLATIWSI 292 >UniRef50_A8ZWI7 Glycosyl transferase family 2 n=2 Tax=Desulfobacteraceae RepID=A8ZWI7_DESOH Length = 289 Score = 230 bits (588), Expect = 3e-59, Method: Composition-based stats. Identities = 59/233 (25%), Positives = 92/233 (39%), Gaps = 9/233 (3%) Query: 1 MKNN--PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN 58 M PL+S+ +PT+NR + A+ SVL QDY E+IIVDD S+ + Sbjct: 1 MNRTSLPLVSVVIPTYNRGPMVTEAVASVLAQDYPAIEIIIVDDGSSDDTPGRLSPLK-- 58 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 R+T I + N G A RN + A GEYI +D DD W P ++S A Sbjct: 59 -NRVTIITQE-NRGVSAARNAGVAHAGGEYIAFLDSDDRWLPEKISTQTAFFASRPDALI 116 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELK 177 + + P + + + V F FD L Sbjct: 117 CQTEETWIKNGKPLFPRARHKKRSGMIFEPSLELCLVSPSAVMMRKDFFLDVGGFDESLP 176 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 A +DYD++LR+ + + + +E I + Q+++ P + K Sbjct: 177 ACEDYDLWLRISMVH-PVYLIETPLVI-KQGGHDGQLSAMPGLDRYRIYAICK 227 >UniRef50_D0D9G6 Glycosyl transferase family 2 n=1 Tax=Citreicella sp. SE45 RepID=D0D9G6_9RHOB Length = 319 Score = 230 bits (588), Expect = 3e-59, Method: Composition-based stats. Identities = 62/269 (23%), Positives = 99/269 (36%), Gaps = 8/269 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + P +S+ +P NR L RAI SVL Q Y + E+I VDD ST + +DP Sbjct: 22 LSEVPTVSVVLPVHNRSSLVTRAIDSVLGQTYRDIELIAVDDASTDDTPDTLARYS-HDP 80 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R + N N GA RN I LA+G Y+ D DD W P +L L Q Y Sbjct: 81 RFRTVRNAQNLGAAGTRNVGIGLARGRYLAFQDSDDRWFPEKLETQLKVLQASADCRACY 140 Query: 121 AN--DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELK 177 + Y + Y PK + + +FD L+ Sbjct: 141 SGAVYYSKEHSYYIPREGAHGAPKGDIFADTLMINPTTPQTLLIERSLLTETGVFDDTLR 200 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF-- 235 +D+D+ LR+ V E +++ + + + + ++ + F Sbjct: 201 INEDWDLALRLA-RATPFAFVPEPLVMIYRTPDSVS-SHMARDTAFRVRMLDQYAEDFAH 258 Query: 236 DRASKKYQLFTLYQIRNKRMTWRTLLTLL 264 + A++ Q + L R + Sbjct: 259 NTAARSRQNYVLGSQYLALGDHRNAMRYF 287 >UniRef50_B4WF86 Glycosyl transferase, group 2 family protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4WF86_9CAUL Length = 606 Score = 230 bits (588), Expect = 4e-59, Method: Composition-based stats. Identities = 62/281 (22%), Positives = 103/281 (36%), Gaps = 11/281 (3%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 + PL+S+ +P + L + S+L Q Y N+E+I+VDD + D Sbjct: 79 LTARPLLSVVVPLYKTPYPLLRACLASLLDQAYPNFELILVDDAPAAPTGPLAEKLVNGD 138 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 R+ I ++N G N + A+GE+I ID DDE P L F+ H + Sbjct: 139 DRVRIIQPEVNGGISRATNLGVAAARGEFILFIDHDDELVPESLLSFVEHIVRHPEVDAW 198 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 Y++ C G +L + K +S IG+ + A + LFD+E Sbjct: 199 YSDQVTCDG----AGKTLHHFFKPEWSPTYLLGVMYIGHLLAVRAEICAKTLFDSEFDGV 254 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFH--FYRKHKDKFDR 237 QD++ LR+ + V A G + S K KH + R Sbjct: 255 QDFEFMLRVAEQTQRVGHVPGALYKWRATEGSLASGSDEKTGIDALQRLAVEKHLLRQGR 314 Query: 238 ASKKYQLFTLYQIRNKRMTWRT----LLTLLSVRNGKRLAD 274 + + RT + ++ N + + Sbjct: 315 TWRAVSSNRHRHRVLLKPGPRTVEPRISIIIPTCNQGEMIE 355 Score = 99.8 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 69/223 (30%), Gaps = 27/223 (12%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 P ISI +PT N+ ++ R + S+ Y ++E+I+VD + + + A + + Sbjct: 338 EPRISIIIPTCNQGEMIERCLDSIFAMTDYPDFEVIVVD----NRTTEPRALKAFDRHPV 393 Query: 63 T--YIHNDINSGACAVRNQAIMLAQGEYITGIDDD-DEWTPNRLSVFLAHKQQLVTHAF- 118 N A N + + GE++ +++D + + LS + + + A Sbjct: 394 KRVVFDQAFNY--SAANNAGVTASNGEFLLFLNNDTEVLDTDWLSDLVMYFEDRQVGAVG 451 Query: 119 ---------LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFY-----KRNIIGNQVFTWA 164 + V + + + + Sbjct: 452 PTLLYPARTVQHAGVVVGARGTADHVMRHFHEDWDGYAGSLVAAREASGVTAACLMMRRS 511 Query: 165 WRFKECLFDTEL-KAAQDYDIFLRMVVEYGEPWKVEEA-TQIL 205 F + K QD D+ L++ V + Sbjct: 512 LFDDIGGFSEDYAKHYQDVDLCLKIRERDLRILCVGQPRLIHY 554 >UniRef50_Q97H38 Glycosyltransferase n=1 Tax=Clostridium acetobutylicum RepID=Q97H38_CLOAB Length = 333 Score = 230 bits (587), Expect = 5e-59, Method: Composition-based stats. Identities = 52/281 (18%), Positives = 100/281 (35%), Gaps = 10/281 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +IS+ MP +N ++ +I+S+L+Q Y ++E IIV+D S A +D RI + Sbjct: 1 MISVIMPVYNCEKYLEESIESILKQTYRDFEFIIVNDGSNDKSIDIINKYANDDNRIVVV 60 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 D N G N+ I A+G Y+ +D DD P R + + + L Sbjct: 61 SRDNNMGMVYSLNEGIDRAKGSYVARMDADDIALPERFERQIEYLNKNKDVDILACKVEA 120 Query: 126 CQGEVYSQPASLPLYPKSPYS-----RRLFYKRNII-GNQVFTWAWRFKE-CLFDTELKA 178 Q + + LF + I V K ++ K Sbjct: 121 FGDVSREQKLEREHWYNVDLNNSESIESLFLENCYIAHPSVMMKMSVLKALGGYNLNYKR 180 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR- 237 +DY+++LR + + + +EE + +++ I + FS + + Sbjct: 181 TEDYNLWLRAIAKGYKIAMLEEKLMKIRLHNDSK-IHRDAEGFSSIRDIIQSRLEYVKEK 239 Query: 238 -ASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 K + ++ + + + D + Sbjct: 240 LKLKDFSYVIWGASNGGKIAYEKIKEVFPNAKLNGYIDKFK 280 >UniRef50_B1QL95 Glycosyltransferase, group 2 family n=2 Tax=Clostridium botulinum RepID=B1QL95_CLOBO Length = 337 Score = 230 bits (587), Expect = 5e-59, Method: Composition-based stats. Identities = 61/287 (21%), Positives = 104/287 (36%), Gaps = 19/287 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M N LISI MP +N ++ +I+S+L Q Y N+E II++D ST D Sbjct: 1 MSINSLISIIMPVYNAEKYLRESIESILNQTYKNFEFIIINDGSTDDSLKIINGYYKKDC 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I + N G N+ I +A+GEYI +D DD +RL + + + L Sbjct: 61 RIKIISRE-NKGLVYSLNEGISIAKGEYIARMDGDDVCNLDRLEKQIKYMKLNPNVDILG 119 Query: 121 ANDYVCQGEVYSQPASLPLYP-------KSPYSRRLFYKRNIIGNQVFTWAWRFKECLFD 173 + + + S Y K LF I KE F Sbjct: 120 SYVEIMADKEVSYRTINIAYEGFNVPINKDNIKNYLFLHCTICHPSSMMKKVFLKEINFY 179 Query: 174 TE-LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQI--TSSPKKFSGYFHFYRK 230 + K ++DYD+++R + ++E ++ + S Y + Sbjct: 180 SNRYKTSEDYDLWMRALKRGYNIDNIKEPLLKYRLHKESKSQVESKSNVVLIDYTNIRLD 239 Query: 231 HKDKFDRASKKYQL--------FTLYQIRNKRMTWRTLLTLLSVRNG 269 + ++ + Y + F I + + + + G Sbjct: 240 YIEELLQNKSTYVIWGASKGGEFVKDIIAKRFPNLKLKYYIDKYKIG 286 >UniRef50_C6MPB4 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M18 RepID=C6MPB4_9DELT Length = 307 Score = 229 bits (586), Expect = 5e-59, Method: Composition-based stats. Identities = 50/225 (22%), Positives = 79/225 (35%), Gaps = 5/225 (2%) Query: 11 MPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIHNDIN 70 M +N Q A+ S+L Q +++E II+DD ST + DPRI I + N Sbjct: 1 MSVYNGSQYLREAVDSILNQTCTDFEFIIIDDGSTDGTAAILD--SCTDPRIRVIRH-KN 57 Query: 71 SGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVCQGEV 130 G N I AQG Y+ D DD P RL+ A + + V Sbjct: 58 QGLTKSLNLGIREAQGTYVARQDADDRSFPERLATQAAFLDSHPEIPLVGSAVKVISSAG 117 Query: 131 YSQPASLPLYPKSPYSRRL-FYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIFLRMV 189 + + L Y G+ +F + +D AQDYD++LR Sbjct: 118 AELATFRHPTDSAEIATTLRTYNCFWHGSIMFRRDSFLELGGYDERFVTAQDYDMWLRFS 177 Query: 190 VEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 + + E G + + + R +++ Sbjct: 178 ERH-LLANLPEPLYAYRFTAGSVTVKRMVSQHRLAVLARRLAEER 221 >UniRef50_B4CZV6 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZV6_9BACT Length = 306 Score = 229 bits (586), Expect = 6e-59, Method: Composition-based stats. Identities = 58/261 (22%), Positives = 102/261 (39%), Gaps = 9/261 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M +S+ +P +N AI SVL Q + ++E+I+VDD ST DP Sbjct: 1 MPEQ--VSVVIPAYNYAHFLPEAIASVLAQTWEHFELIVVDDGSTDNTPEVC--ARYTDP 56 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFL 119 R+ + N+G A RN I A+ ++ +D DD W P L+ ++L T A + Sbjct: 57 RVRAVRQ-TNAGLSAARNTGIREARYPFVAFLDADDRWQPEFLATVFREFERLGPTFAAV 115 Query: 120 YANDYVCQ--GEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTEL 176 + GE + P L+ + + + + + F+E FDTEL Sbjct: 116 GTSCSRMNAIGEPLAPPRQNFLHTSELTVKSFCLRNRPLSSSIVVRRAVFEECGYFDTEL 175 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 ++++D D+++R+ + + E I+ + M + K + R K Sbjct: 176 RSSEDRDMWIRITARGHRFFYLGEPLAIIRRHPQNMSKNAPRMKRNSGLVLRRAWKAGAV 235 Query: 237 RASKKYQLFTLYQIRNKRMTW 257 I + W Sbjct: 236 PRGDVIFWLRTAAIHYFLVAW 256 >UniRef50_B4D2E1 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D2E1_9BACT Length = 372 Score = 229 bits (586), Expect = 6e-59, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 90/238 (37%), Gaps = 10/238 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +PL+++ +P N A++S+ Q + ++E++++DD ST + ++A+ DPR+ Sbjct: 7 SPLVTVLLPVRNGSAHLPAALESIFAQTFRDFELLVIDDGSTDSTP--EILSAVQDPRLR 64 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + N N G N+ + +A+GE++ D DD P RL + + + + Sbjct: 65 VVTNPQNIGLVPALNRGLEMARGEFVARQDHDDVSLPERLQKQVDYLRTHAGCVLVGTEA 124 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKAAQD 181 G+ L + L + I + V ++ + ++D Sbjct: 125 TQTDGQGRRMYRLLRPHGAEDIRWYLCFDNAFIHSSVMFRREVVRQVFGGYAPSF-HSED 183 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHG----EMQITSSPKKFSGYFHFYRKHKDKF 235 Y ++ R+ E + E + + M + R++ Sbjct: 184 YALWSRIA-RSRETANLPEPLLLYREHSSSVTQSMSAADEVRFEEATGAIRRENLRAL 240 >UniRef50_C7NBG2 Glycosyl transferase family 2 n=4 Tax=Fusobacteriaceae RepID=C7NBG2_LEPBD Length = 255 Score = 229 bits (586), Expect = 6e-59, Method: Composition-based stats. Identities = 58/250 (23%), Positives = 100/250 (40%), Gaps = 7/250 (2%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +SI +P +N ++ + I+SVL Q Y NWEM+I++D ST + A D RI ++ Sbjct: 12 VSIIVPMYNAEKFIGKTIESVLAQTYQNWEMLIMNDVSTDNSLAIVSLYAKKDERIKIVN 71 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 + N G RN I LA G+YI +D DD W +L + ++ + Sbjct: 72 TEKNVGVVKGRNFLIDLASGKYIAFLDADDYWHNEKLEKQIKFMKE-KNASISCTEYTRV 130 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIFL 186 + + K S K N +G + + + EL+ +DY ++L Sbjct: 131 KENEEKIN---DVIIKEEISYNDMLKNNYLGCLTVIYDAKKIGKRYFKELEKNEDYVLWL 187 Query: 187 RMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLFT 246 +V + + ++E + + ++ K + YRK + K F Sbjct: 188 EIVKDVNTIYGLKENLAYYRVLDNS-RSSNKVKTAKVRWEIYRKIEKL--SLLKSLYYFL 244 Query: 247 LYQIRNKRMT 256 Y IR Sbjct: 245 HYAIRAVLKN 254 >UniRef50_B4AW59 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AW59_9CHRO Length = 1162 Score = 229 bits (586), Expect = 6e-59, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 72/209 (34%), Gaps = 1/209 (0%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +S+ +P +N + IKS+ Q +WE II+DD S AL D R+ I Sbjct: 3 VSVIIPAYNAVETLEETIKSLQAQTLPHWEAIIIDDGSQDETAALATQLALEDNRLRVIT 62 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 NSG A RN+ I A +++ +D DD P L + ++ Sbjct: 63 Q-ANSGVSAARNRGIEEAHFDWLLFLDADDWIAPEYLEKMTGALTTDPSLDIIHCGWTFV 121 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIFL 186 + + + II + + A FD L +D+D + Sbjct: 122 APDGTLMGEKYTPDLIDLFPLLSYSSTFIIHSCIVRKALVIAAGGFDRSLAVCEDWDFWQ 181 Query: 187 RMVVEYGEPWKVEEATQILHINHGEMQIT 215 R+ V+E + + Sbjct: 182 RIARTGSRFGAVKEILAFYRMRPKSLSRN 210 >UniRef50_B4WMJ9 Glycosyl transferase, group 2 family protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WMJ9_9SYNE Length = 313 Score = 229 bits (585), Expect = 8e-59, Method: Composition-based stats. Identities = 54/231 (23%), Positives = 93/231 (40%), Gaps = 6/231 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 PL S+ +P +N Q ++SV +Q ++E+I++DD ST + V+ +DPR+ Sbjct: 2 PLASVIVPVYNGAQTIHATLESVFQQTAQDFELIVIDDGSTDST--LEVVSKFDDPRLRV 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 N+G RN+ + A G+YI+ ID DD WTP +L++ L + A Y+ Sbjct: 60 FSY-CNAGVATSRNRGVDHASGKYISFIDADDLWTPKKLALQLEALESTPEAAVAYSWTN 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNI--IGNQVFTWAWRFKECLFDTELKAAQDY 182 E + + L + + N + FD L A D+ Sbjct: 119 YIDQEGRFIDKAQRVDFSGDVYAELLLRDFLESASNVTIRHQVFLEFGGFDLSLTGAADW 178 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 D FLR+ Y V + + M + ++ R ++ Sbjct: 179 DFFLRLAKHYSFVA-VPHLGVLYRLLSSSMSANLAIQEKECLTVLDRAYEQ 228 >UniRef50_C6BYR5 Glycosyl transferase family 2 n=5 Tax=Desulfovibrionales RepID=C6BYR5_DESAD Length = 292 Score = 228 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 52/230 (22%), Positives = 101/230 (43%), Gaps = 7/230 (3%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 N P +S+ +PT+NR RA++SVL Q ++++E ++VDD ST + + +D R Sbjct: 5 NNEPKVSVIIPTYNRADRVCRAVESVLAQKFTDFECLVVDDGSTDDTA--ERLAEFDDSR 62 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + + + N G A RN I A G++I +D DDEW P +L + ++ Sbjct: 63 LKILRQE-NKGVSAARNFGIAAAIGDFIALLDSDDEWVPEKLLKQIPFMEE-AGFEISQT 120 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQV-FTWAWRFKECLFDTELKAAQ 180 ++ + P+ + R + + V F+ + + FD ++ A + Sbjct: 121 DEIWIRKGKRVNQCKKHEKPEGMFFDRSLEMCMVSPSCVIFSRKFWDEIGPFDEDMPACE 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILH-INHGEMQITSSPKKFSGYFHFYR 229 DYD+++R ++Y + E I H ++ + + + Sbjct: 181 DYDLWIRAGLKY-PVGLLRERLTIKHGGRPDQLSNSVGCLDLYRIYAIVK 229 >UniRef50_A8FXL4 Glycosyltransferase involved in cell wall biogenesis-like protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FXL4_SHESH Length = 317 Score = 228 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 86/276 (31%), Positives = 133/276 (48%), Gaps = 9/276 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 + + P++S+YMPT NR+ L RA++SVL Q Y N+E+IIVDD S A ++P Sbjct: 29 LDHGPMVSVYMPTHNREHLLKRAVESVLAQTYQNFELIIVDDGSRDGSPDYLQSLASSEP 88 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ + I GACA RN AI A+G+YITG+DDDDE+ PNRL L+ + +AF+ Sbjct: 89 RVRFFCQPIAQGACAARNIAIKEAKGKYITGLDDDDEFLPNRLEQLLSSYDE--KYAFVC 146 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT-WAWRFKECLFDTELKAA 179 + G + + + N NQV T F+ E A Sbjct: 147 QGTFWHYGSHKKALDANAM----IINLSQMLDYNYSTNQVLTETKRLQSIGGFNPEFPAC 202 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QDYD + R++++YG ++ A+ I+H H +I S GY F ++ + + Sbjct: 203 QDYDTWTRLILKYGAAKRIAGASYIIHQGHEGPRIISKSNMQRGYKKFSEEYGSLMSKGN 262 Query: 240 KKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADG 275 + Q + + R ++ L SVR G + Sbjct: 263 RINQ--KYLALVSSRKRFKLLDFFQSVRYGLVVKKC 296 >UniRef50_B8J3S6 Glycosyl transferase family 2 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8J3S6_DESDA Length = 352 Score = 228 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 95/237 (40%), Gaps = 4/237 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + P I++ +P +N +Q A+ S+L Q ++N++++I++D ST A DPR Sbjct: 7 SHTPSITVLLPVFNGEQFIKEAVTSILAQSFTNFDLLIINDGSTDGTGEILQRLAAKDPR 66 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I IH N G A N+ I L E++ +D DD P+RL+V + + A Sbjct: 67 IR-IHQRENRGLIATLNEGIALCSSEFVARMDADDCALPHRLAVQYDYMTKHAETAVCST 125 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + + + Y ++ +R LF + + +D + A+D Sbjct: 126 G---MEEYETGRTLTWEGYGEAVRARLLFGCCLYHPTILARRSALLAVGGYDAAMPCAED 182 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 YD+++R+ + + ++ + + H ++ A Sbjct: 183 YDLWMRLTEAGYSLAVLPQVLLRYRVHPHLSRAAYKWQMGETTAHIQQRQLSLLGIA 239 >UniRef50_D1YZY1 Putative glycosyltransferase n=1 Tax=Methanocella paludicola SANAE RepID=D1YZY1_METPS Length = 299 Score = 228 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 49/227 (21%), Positives = 86/227 (37%), Gaps = 3/227 (1%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +SI +P +N+Q+ A+ SV+ Q +WE IIV+D ST + D R Y Sbjct: 2 PTVSIIIPAYNQQEYIQSALSSVIDQTLKDWEAIIVNDGSTDNTDIIAQEFIKKDSRFNY 61 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I+ D N G RN + LA+G+Y+ +D DD L + + ++ V + Sbjct: 62 IYQD-NKGLSEARNTGLKLAKGKYVVFLDSDDILDTQMLELTTGYLEKHVNIDIVNGAWD 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDYD 183 + + Y L + + F FD L + +D+D Sbjct: 121 QIDDNGKAISRRFGPIIRKNYLYDLMLTNLFPVHSLLIKKAVFDYCGFFDVALNSHEDWD 180 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 ++LR + + + I+ M + + + K Sbjct: 181 MWLRALSNGYKMGYLNHLIAHYRIHPQSMT-KNKQRMINSTCQVLDK 226 >UniRef50_B7JHV6 Glycosyl transferase, group 2 family protein n=52 Tax=Bacillus RepID=B7JHV6_BACC0 Length = 266 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 57/266 (21%), Positives = 94/266 (35%), Gaps = 20/266 (7%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 P +S+ P++N + I SV Q Y NWEMIIVDD ST + D R Sbjct: 9 NKAPHVSVITPSYNSIRFIGETIVSVQNQSYENWEMIIVDDASTDESVTKIKEIIEGDSR 68 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I + N GA RN AI A+G YI +D DD W P++L L +++ +F YA Sbjct: 69 IRLVSLKENIGAAKARNIAIQEARGRYIAFLDSDDIWLPHKLKTQLLFMEEM-NVSFSYA 127 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA-- 179 + + + + Y IIG + Sbjct: 128 SYSLIDENGNELNRKVNVPKSVDY--HCLAGNTIIGCLTVIID--------RERIPHIEM 177 Query: 180 -----QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 +D ++L+++ E E +++ I + + K Y+ R + Sbjct: 178 PSVQPEDTALWLKLLHEGHEAKGIQQVLAKYRIVANSVSR-NKIKAAFRYWKLLRD-QKC 235 Query: 235 FDRASKKYQLFTLYQIRNKRMTWRTL 260 + Y ++ + Sbjct: 236 LNAVQIFYYFSKYAYHAYRKNKINVV 261 >UniRef50_C2Y456 Teichuronic acid biosynthesis glycosyl transferase n=2 Tax=Firmicutes RepID=C2Y456_BACCE Length = 253 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 63/234 (26%), Positives = 103/234 (44%), Gaps = 4/234 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M L+SI P +N ++ I+SV+ QDY+NWE+IIVDD S +L ++ Sbjct: 1 MYMRELVSIITPVYNTEKYLRNTIESVINQDYANWELIIVDDYSKDRSRLIIEQYMESEK 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I D N G RN AI A G+YI +D DD W PN+LS+ L Q+ +AF + Sbjct: 61 RIKLIALDENRGVTFARNTAIKAANGKYIAFLDSDDLWHPNKLSIQLGFMQKN-NYAFSF 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 ++ + + Y RL N IG E + +L+ + Sbjct: 120 SSYEWMNEDGTRFNKIIEAPSVVDY-HRLLRGGNPIGCLTVVIDKEQVEDIEMPDLR-HE 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 DY +L ++ + + + + + ++ K ++ YRK++ Sbjct: 178 DYATWLSIMKSGVIAYGINQNLALYRKSSN-SLSSNKFKTIIWTWNVYRKNQKL 230 >UniRef50_D2YGE4 Putative uncharacterized protein n=1 Tax=Vibrio mimicus VM603 RepID=D2YGE4_VIBMI Length = 316 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 88/231 (38%), Gaps = 5/231 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK+ L+S+ MP +N ++ AI+S+L Q Y N E+I DD S ++ D Sbjct: 1 MKDK-LVSVIMPCFNCEEYLEEAIESILAQTYRNLEIICCDDGSQDTTFQILVKYSMLDE 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + N N G N I + G++I +D DD +R+ + + + Sbjct: 60 RIRVLKNVENRGLIYSLNLMIEESTGDFIARMDSDDISADDRIESLVKYLDDN-GLDCVG 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 + S L + L + + + T + K + + + Sbjct: 119 SYYTCF---GASNKVVLLPSEHEDIAFSLMFSNVMCHPSMLTRSCCLKSHKYSPDFSHVE 175 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKH 231 DY+++ R+V + + ++ ++ ++ ++ ++ Sbjct: 176 DYELWTRLVSSGFKFGNIPKSLLKYRVHSNQISKKKYNEQLPKLSIIKSQY 226 >UniRef50_B0G3Y0 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G3Y0_9FIRM Length = 618 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 59/273 (21%), Positives = 103/273 (37%), Gaps = 17/273 (6%) Query: 6 LISIYMPTWNRQQ-LAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN----DP 60 SI P +N + I I+SV Q Y+NWE+ + D S E D Sbjct: 101 KFSIITPLYNTPKKFLIELIESVENQTYANWELCLA-DGS-DKEHSYVGEFCKKKMSNDD 158 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI Y + N G N+ I LA G Y +D DD P+ L + F+Y Sbjct: 159 RIQYHALEKNEGISDNTNECIKLATGNYYALLDHDDLLHPSALYEVSKVIEA-KGADFIY 217 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKEC--LFDTELKA 178 ++ + +YS K +S+ N I + + + E Sbjct: 218 TDEVKFRKNIYSIDNPYYFNLKPGFSKYDLRSHNYICHLTVFSKQLLESEPVFYRHEFDG 277 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 +QD+D+ LRM + + + + I+ + + K + +R + + +R+ Sbjct: 278 SQDHDMVLRMTEKAKKIVHIPKVLYYWRIHENSVSMNLEVKSY-AVDSAFRAVQAQLERS 336 Query: 239 SKKYQLF------TLYQIRNKRMTWRTLLTLLS 265 +K ++ TLY+IR + + +L Sbjct: 337 GEKGKISNTKSFQTLYRIRYEMAKSPFISIVLH 369 Score = 50.5 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 83/259 (32%), Gaps = 46/259 (17%) Query: 1 MKNNPLISIYM-PTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALN 58 M +P ISI + + ++ R I+ ++ Q Y +E++ + Sbjct: 358 MAKSPFISIVLHGNF-SEKRVKRVIEKIVEQTDYPRYEVVYC-------TEKPLSDAEYK 409 Query: 59 DPRITYIHNDINSGACAVR--NQAIMLAQGEYITGIDDDDEWTPNR--LSVFLAH-KQQL 113 + ++ D + NQAI A GE+I +D D + N+ L + + ++ Sbjct: 410 GVPLIHVKIDRTIEHNNAQKWNQAISEASGEHIVLLDTD-VYPINKYWLQEMVMYSMKKD 468 Query: 114 VTHA---FLYANDYVCQGEV---YSQPASLPLYPKSPYSRRLFYKRNIIG---------- 157 V LY +D + + P L R + Y+ ++ Sbjct: 469 VAVVGPKILYEDDRIAYAGIALKDDYPDKLYYLCGHDSIREIGYEAMLLHVRRTGIVTSA 528 Query: 158 NQVFTWAWRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKV----------EEATQILHI 207 +F FD ++ +D D LR + + + E Sbjct: 529 CMMFEKEIWHDLGGFDAQMLEYEDADFCLR-AMNNIKGVNIWTCFAKLCSKRENLY---Q 584 Query: 208 NHGEMQITSSPKKFSGYFH 226 I++ KK+ Sbjct: 585 TKSSNAISAFVKKYQRIIE 603 >UniRef50_C7RJP4 Glycosyl transferase family 2 n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RJP4_9PROT Length = 312 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 52/279 (18%), Positives = 105/279 (37%), Gaps = 21/279 (7%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P++S+ +P +N + S+L Q+ + E+I+++D ST + +P + Sbjct: 2 PVVSVVIPAYNCALYIGETLASILAQEAAELEIIVINDGSTDETGAIA--RSFGEP-VRV 58 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I NSG CA RN I A+GE+I +D DD W PN+L+ LA ++ Sbjct: 59 IDQP-NSGVCAARNHGIREARGEFIALVDHDDYWLPNKLANQLAAFADNPQVDVVFTAFV 117 Query: 125 VCQGEVYSQ-------------PASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-C 170 + E P + ++ ++ + + + Sbjct: 118 WWRQESEGGRFPQPASFAAQAAPQGIDSDFSGWIYHQMLLDSWVLTSTALARSAVVRASG 177 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 FD L ++D+D +LR+ + K+ EAT + + + + + Sbjct: 178 GFDETLPFSEDWDFWLRVA-RTSQFLKLREATTLYRQHPHQGSR--VTRALDYRTRLLEQ 234 Query: 231 HKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNG 269 ++ +S+ + R + + L ++ G Sbjct: 235 ASRQWGLSSQDGRCVAPGVFRRQLAQYSASFGLGHLQGG 273 >UniRef50_D1YZY2 Putative glycosyltransferase n=1 Tax=Methanocella paludicola SANAE RepID=D1YZY2_METPS Length = 344 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 84/219 (38%), Gaps = 5/219 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ + T+NR+ + I S+ Q + ++E+IIVDD ST I Y Sbjct: 6 PKVSVIVATYNRENVISDTIDSIFNQTFQDYEIIIVDDGSTDNTNAILKKYGNK---INY 62 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 ++ + RN I LA+G+YI +D DD W P +L + + L Sbjct: 63 VNIYHSGSPSYARNTGIKLAKGKYIAILDSDDLWFPEKLEKQVNFMENHSDVGVLGTFFE 122 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQDYD 183 + ++ L + L + I + V + K +D + +DYD Sbjct: 123 TVNDDNGNKKIYLLPTNPNILKWCLIFSNCIAHSSVIIRSEIIKMLGYYDKSVDFCEDYD 182 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS 222 +++R + E HI+ + ++ Sbjct: 183 LWIR-ASRITNIANLPEVLLSYHIHDSSISSVYFEEQEK 220 >UniRef50_O34234 Sugar transferase n=4 Tax=Vibrio cholerae RepID=O34234_VIBCH Length = 337 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 48/233 (20%), Positives = 84/233 (36%), Gaps = 5/233 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 ++P IS+ M +N ++ AI S+L+Q +S++E II++D ST D RI Sbjct: 2 SSPKISVVMSVYNGEKYLGEAIDSILKQTFSDFEFIIINDGSTDKTLEIIKSYMKKDDRI 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + + N G N+ + LA+G+YI +D DD +R + Sbjct: 62 VLVSRE-NKGLIVSLNEGLDLAKGQYIARMDADDISIKSRFEKQIEFLDSNPDIGVCGTW 120 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECL--FDTELKAAQ 180 V GE +L + V + +D K A+ Sbjct: 121 VEVF-GENIKSKKWKMPTQDPDLKAKLTFSVPFAHPTVMIRKKVIDKYKINYDLGYKDAE 179 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 DY ++ +Y V E + + + K+ F K ++ Sbjct: 180 DYKFWVDFS-KYTLFSNVPEILLRYRYHQESISRVADNKENKERFEIISKIQN 231 >UniRef50_A4BPR7 Glycosyl transferase, group 2 family protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BPR7_9GAMM Length = 348 Score = 226 bits (578), Expect = 4e-58, Method: Composition-based stats. Identities = 48/251 (19%), Positives = 94/251 (37%), Gaps = 4/251 (1%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 + P +++ +P NR AI SVL Q ++++E++I+DD S+ V + DPR+ Sbjct: 5 STPTVTVLIPVHNRVGYIRDAIDSVLAQTFADFELLIIDDASSDGSADA--VASYRDPRL 62 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + N+ N G RN+ I A+GEY+ +D DD P RL+ +A + HA + + Sbjct: 63 RLLRNETNLGIPGSRNRGIDEARGEYLAFLDSDDRARPERLARQVAFLRDHPDHAAVGSW 122 Query: 123 DYVCQGEVYSQPA-SLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + ++ + + + +K D Sbjct: 123 IEWMSQNGKPLGRIKRKAVAAEQIAAERLFRSCLENSAATARTAILRRYRHRERIKLGSD 182 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 YD++ R+ ++ + + G S + + + + Sbjct: 183 YDLWARIAADHKLAA-LPHVLTYRRQHGGRSTHGRSEEIKAWRLQIFAEQLGALGLPYSA 241 Query: 242 YQLFTLYQIRN 252 L Y++R Sbjct: 242 DDLERHYELRR 252 >UniRef50_Q2SD88 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SD88_HAHCH Length = 343 Score = 226 bits (578), Expect = 5e-58, Method: Composition-based stats. Identities = 62/289 (21%), Positives = 116/289 (40%), Gaps = 36/289 (12%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 PL+S+ MP +NR++ +AI+SVL Q ++++E+I+VDD ST D R+ Sbjct: 11 EAPLVSVIMPVYNREKTVAQAIESVLAQTFADFELIVVDDGSTDRSAEIVNGFT-QDARV 69 Query: 63 TYIHNDINSG-ACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLV----THA 117 Y NSG RN + LA+G+++ +D DD WTP +L +A +Q + Sbjct: 70 RY-RLQENSGRPSLARNSGLKLARGQWVAFLDSDDRWTPAKLERQIALLEQCAAQGVSLD 128 Query: 118 FLYANDYVCQGEVYSQPASLPLY-------------PKSPYSRRL------FYKRNIIGN 158 + ++ V + +V P+ Y ++ R Y Sbjct: 129 LVISDYEVMENDVLKHPSFFKAYNVNQRLAAAVEHTFPDGWTYRQRPFLQALYGLGFAAT 188 Query: 159 Q--VFTWAWRFKECLFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITS 216 Q + + + FD+EL A+D D+++ + + + + + Sbjct: 189 QAVLVRRSLLERVGGFDSELVFAEDNDLWMTISEQG-RVGCSKGIAYTYVHHGDNITSVK 247 Query: 217 SPKKFSGYFHFYRKHKDK-------FDRASKKYQLFTLYQIRNKRMTWR 258 S + ++ + KH + ++Y + L RN+ R Sbjct: 248 SDRFYTDTINVLFKHLATARKLGVPMEPLKERYANYYLSLCRNRLREQR 296 >UniRef50_Q6QW83 Putative glycosyl transferase n=1 Tax=Azospirillum brasilense RepID=Q6QW83_AZOBR Length = 296 Score = 226 bits (578), Expect = 5e-58, Method: Composition-based stats. Identities = 55/216 (25%), Positives = 90/216 (41%), Gaps = 6/216 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+S+ +P +NR RAI+SV QD S+WE+++VDD ST + D Sbjct: 8 MEE-PLVSVVIPAFNRAHTLRRAIESVRMQDVSSWELLVVDDGSTDGS--VDIPESFGDE 64 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI I + +N GA A RN + A+G YI +D DDEW P +L + + Sbjct: 65 RIRLIRHAVNRGAAAARNTGVAAARGCYIAFLDSDDEWLPGKLRAQIDALESGPDAPQAL 124 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI-IGNQVFTWAWRF-KECLFDTELKA 178 ++ E + P+ + L + G+ + F D L Sbjct: 125 CTGFLLHREETGRRTKRWPKPEGTWFETLLDGCYVSPGSTLMVRRDCFAAVGPLDETLPR 184 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQI 214 +D+D LR + + + ++H+ Sbjct: 185 LEDWDWLLRCMERFA-FDCLPVLGAVVHVGRPARSA 219 >UniRef50_Q8YSL1 Alr3073 protein n=2 Tax=Nostocaceae RepID=Q8YSL1_ANASP Length = 324 Score = 226 bits (578), Expect = 5e-58, Method: Composition-based stats. Identities = 60/281 (21%), Positives = 110/281 (39%), Gaps = 11/281 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P IS+ +P +N ++ + I SVL Q +S+ E+I+++D ST + + ++D R+ Sbjct: 2 PKISVIIPAYNAERTILETINSVLNQTFSDLEIIVINDGSTDRT--VEVLQNVDDARLKV 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + NS A RN I A G++I+ +D DD WTP++L + L+ Y+ Y Sbjct: 60 YSYE-NSRASGARNHGISHAVGDFISFLDADDLWTPDKLELQLSALNNHPEAGVAYSWTY 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNII--GNQVFTWAWRFKECLFDTELKAAQDY 182 + PLY + Y+ L + N + A FDT L++ +D+ Sbjct: 119 TIDDKGELLKPFEPLYEGNVYT-DLLLANFLTNGSNPLIRKAAIASIGEFDTTLRSGEDW 177 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHK----DKFDRA 238 D +LR+ + V++ + + + + + K + Sbjct: 178 DYWLRLAYK-WPFVVVKQHQILYRRSVTSKSFKLQIIREASLAILDKAMKVLPLELQYLK 236 Query: 239 SKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIRGR 279 Y + + T+ L IR R Sbjct: 237 KHSLSNIYRYNVELYLDSINNNSTVDIKYVIGNLLSYIRSR 277 >UniRef50_B6WY68 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WY68_9DELT Length = 349 Score = 226 bits (578), Expect = 5e-58, Method: Composition-based stats. Identities = 48/227 (21%), Positives = 91/227 (40%), Gaps = 6/227 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N P IS+ +P +N Q +A+ S+L+Q ++N+E+I++DD S+ + A D RI Sbjct: 2 NTPHISVLLPVYNNVQYLKKAVLSILQQSFTNFEIILIDDGSSDGTEHLAQQLAEQDARI 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + N G A N+ I L Q +I +D DD PNRLS+ A+ + + Sbjct: 62 RLVQRP-NKGLIASLNEGINLVQSPFIARMDADDFSFPNRLSIQYAYMTDHPDVVAVGSY 120 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRN-IIGNQVFTW-AWRFKECLFDTELKAAQ 180 + + ++ F ++ V K + E +A+ Sbjct: 121 IKFMDNQD--RMYRKKIFSSGQKLLDDFRWGCPLVHPAVMMRTDAVQKIGGYSPEFPSAE 178 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHF 227 DY ++LRM+ + + ++ + + ++ Sbjct: 179 DYALWLRMLSCG-RMDTIPQVLLAYRVHGQSISHVHARQQRDSILRA 224 >UniRef50_B1ZSZ7 Glycosyl transferase family 2 n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZSZ7_OPITP Length = 373 Score = 226 bits (578), Expect = 5e-58, Method: Composition-based stats. Identities = 51/223 (22%), Positives = 88/223 (39%), Gaps = 4/223 (1%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + P +S+ MP +N + AI+SVL Q + ++E II+DD ST + A +DPR Sbjct: 22 TDKPALSVIMPAYNAGRFLAPAIESVLSQTWRDFEFIIIDDGSTDGTREAIERYAESDPR 81 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I N N G N+AI LA+ E+I +D DD P+R +A + + Sbjct: 82 IVSRPNPRNLGVTPTLNRAIALARAEWIARMDADDLSLPHRFERQMALVRAHPEVGLVTC 141 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNII--GNQVFTWAWRFKE-CLFDTELKA 178 + E P + + N + QV A +E + Sbjct: 142 PFNIIDAEDRQLPGWRGICFQQELLPFFLLFYNRLNAHGQVLYSARLVRELGGYRERYLR 201 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKF 221 ++ ++++RMV + E + +P ++ Sbjct: 202 SEATELWIRMVRAGSWAV-IPEPLYAWRAANPNSVTKQNPFRY 243 >UniRef50_C6E5U5 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M21 RepID=C6E5U5_GEOSM Length = 327 Score = 226 bits (577), Expect = 6e-58, Method: Composition-based stats. Identities = 51/245 (20%), Positives = 95/245 (38%), Gaps = 8/245 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S +P +N+ Q A++S+L Q + ++E+++VDD ST V + DPRI Sbjct: 6 PSVSWVVPVFNQAQYLAGALESMLAQTFEDFELVVVDDGSTDQSA--AIVRSFKDPRIVL 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N N G N+ I A G+YI +D DD TP+RL+ +A + A ++ Sbjct: 64 LENSCNQGVSRSLNRGIRAALGKYIARMDGDDLSTPDRLARQVAFMEDNPAIAVCGSHVE 123 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKAAQDY 182 E + L V ++ +D + AAQDY Sbjct: 124 TFGAESRLVKR---PVGSAAIKCFLLAGPPFTHPSVMLRRSVLEQHRLFYDEGMGAAQDY 180 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D++ R++ + V+ ++ ++ ++ + + Sbjct: 181 DLWFRLL-QVAPGGNVDAVLLRHRLHEEQVSKDRWREQEANAAKVRGEVLSLLGVVFDAE 239 Query: 243 QLFTL 247 +L Sbjct: 240 ELERH 244 >UniRef50_B3PFA2 Glycosyl transferase, putative, gt2L n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PFA2_CELJU Length = 306 Score = 226 bits (577), Expect = 7e-58, Method: Composition-based stats. Identities = 57/281 (20%), Positives = 97/281 (34%), Gaps = 19/281 (6%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +S+ +P +N ++ A+ SVL Q N+E+IIVDD S + + ND R+ Sbjct: 4 KVSVIIPAYNAEKFLAFAVNSVLSQTMENFELIIVDDGSVDNTREIALKFSENDMRVKCY 63 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA-NDY 124 + N G RN A GE++ +D DD W N+L L Q +Y + Sbjct: 64 SIE-NRGRAGARNFGCRQADGEWLAFLDADDCWADNKLERQLESVQD--EVGLIYTERTW 120 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT--WAWRFKECLFDT--ELKAAQ 180 V + + Y N I FD K Q Sbjct: 121 VDENGDILENQPEKYELPQGYIYEKLIDGNYICTSSVILKRDLFLTVGGFDESPNYKNCQ 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+++R+ + + E ++ ++ K F + R D+ + Sbjct: 181 DYDLWIRISPLA-KFVSLRETLCFYRLHDD-----NAHKNFYSRYIGLRSCMDRLREVGQ 234 Query: 241 KYQLF-----TLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 Y L +R ++ L + + D + Sbjct: 235 NYNLINDGFLNRIDLREAKICESFSKVLFKSKRYDIVVDAL 275 >UniRef50_Q26CE6 Glycosyl transferase n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26CE6_9BACT Length = 250 Score = 226 bits (577), Expect = 7e-58, Method: Composition-based stats. Identities = 55/252 (21%), Positives = 90/252 (35%), Gaps = 4/252 (1%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 +SI P +N + VL Q +WE I+VDD S + A D RI Sbjct: 3 KVSIITPVYNASRFLEETAGCVLAQTLQDWEWILVDDQSKDNSREIMERLAKKDDRIKIF 62 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N G+ RN+AI AQG+YI +D DD W+P++L + + A Y Sbjct: 63 FLAKNQGSGPARNKAIQEAQGKYIAFLDSDDFWSPDKLKKHVDFMVEHN--AVFSHTSYG 120 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 E P + + K+ I + F +L+ QDY ++ Sbjct: 121 FTNEQGEVIRETYHVSDVPVTYSMLLKKTEISCLTAMYDQERIGKYFMPDLRRKQDYALW 180 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQLF 245 L ++ + ++E + G + K + F + +K Y F Sbjct: 181 LSILKDGHHSIPLDEELAMYRQVSGS-ATNNKFKLIIKHLKFLMEV-EKLSFFKSLYYSF 238 Query: 246 TLYQIRNKRMTW 257 K+ + Sbjct: 239 HWGINGLKKYYF 250 >UniRef50_A3W1Y5 Glycosyltransferase n=1 Tax=Roseovarius sp. 217 RepID=A3W1Y5_9RHOB Length = 300 Score = 226 bits (577), Expect = 7e-58, Method: Composition-based stats. Identities = 59/243 (24%), Positives = 100/243 (41%), Gaps = 14/243 (5%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 +PL+S+ +P +NR+ A+ SVL QDY E+I VDD ST + +++DPR+ Sbjct: 5 SPLVSVIIPCFNRETTVHEAVLSVLTQDYDPIELIAVDDNSTDGT--IAVLESIDDPRLR 62 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 +HN + G RN I + +I D DD W P +L + + Q Y Sbjct: 63 ILHNSGDRGPSQARNHGIRSSTAPWIAFQDSDDIWLPGKLKRQMDYLQD-SDFVAAYCGM 121 Query: 124 YVCQGE------VYSQPASLPLYPKSPYSRRLFYKRNIIGN--QVFTWAWRFKECLFDTE 175 + + Y +PA + L + N I V + FD Sbjct: 122 LIKKDTHPETPVKYRRPAPETSPLEGDILPSLALE-NYISTQMLVVRHDILDQIGGFDEA 180 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 L A D+++ LR+ +++ + ++ + +S K+ + H +KH D Sbjct: 181 LPALVDWELMLRVASLG-PVAFIDDDLVVQRMSENSLTHSSK-KRLAAQEHILKKHYDLL 238 Query: 236 DRA 238 R Sbjct: 239 GRY 241 >UniRef50_B7K4C4 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7K4C4_CYAP8 Length = 306 Score = 226 bits (576), Expect = 8e-58, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 89/229 (38%), Gaps = 5/229 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ M +N ++ +AI S+L Q ++N+E+II++D ST D RI Sbjct: 9 PDVSVLMTVYNSEKYIAKAITSILSQTFNNFELIIINDGSTDRSLSILQNFTQQDQRIRL 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 N N+G N+ ++ +G+++ +D DD P RL+ +A +Q + + + Sbjct: 69 F-NQENTGPAIASNKGLIHVKGKFVARMDADDIAFPTRLAKQVAFLEQNPEYVAVGSRVL 127 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFY--KRNIIGNQVFTWAW-RFKECLFDTELKAAQD 181 + + +S I V K + ++K++ D Sbjct: 128 LIDPDGWSIGPFAQKTDHEDIDAEHLAGSGGAICHPAVMIRTETLDKIGGYTEQMKSSID 187 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 D+FLR+ + + E ++ M + + + Sbjct: 188 RDLFLRLAEVG-KLANLPETLLKYRMHLKSMGNANRVLQRQMAAMAVKA 235 >UniRef50_D1W2F4 Glycosyltransferase, group 2 family protein n=2 Tax=Prevotella RepID=D1W2F4_9BACT Length = 251 Score = 226 bits (576), Expect = 8e-58, Method: Composition-based stats. Identities = 59/256 (23%), Positives = 100/256 (39%), Gaps = 8/256 (3%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPR 61 N L+S+ +P +N + S+L+Q Y N+E++I DD ST ++ D R Sbjct: 2 NTDLVSVIIPCYNSGIYIKECVDSILQQTYQNFEILITDDGSTDISTIKLLQQITQCDKR 61 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I D NSG A RN +I A+G YI D DD+W PN+L+ + A +Y+ Sbjct: 62 INVFFLDGNSGPAAARNNSIKHAKGRYIAFCDSDDKWLPNKLTDQIPLFD-NKEVAIVYS 120 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQD 181 + + ++ + + K N IGN + K + +D Sbjct: 121 DYEKMDAQGKRNN--RIVHAPQSLTYAKYLKGNTIGNLTGIYDT-QKVGKVMQKNIHHED 177 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 Y ++L ++ T + I+ + + K + ++ YR H + S Sbjct: 178 YVLWLDILRHGWTALNTNTVTAVYRIHSNSITA-NKLKLCTWQWYIYR-HHEHLGLISSM 235 Query: 242 YQLFTLYQIRNKRMTW 257 Y Y R Sbjct: 236 YYYLH-YAYRAFVKFL 250 >UniRef50_A6TBF1 Putative glucuronosyltransferase n=1 Tax=Klebsiella pneumoniae subsp. pneumoniae MGH 78578 RepID=A6TBF1_KLEP7 Length = 291 Score = 226 bits (576), Expect = 8e-58, Method: Composition-based stats. Identities = 81/265 (30%), Positives = 125/265 (47%), Gaps = 8/265 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQ--YVTALN 58 M PL+S+Y+PT NR +L A+KSV+ Q Y N E+II +D S Sbjct: 1 MNKKPLVSVYIPTHNRAELLSNAVKSVINQTYKNIEIIICNDGSGDNTDDVVNKLQKEYK 60 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D Y+ ND GAC RN+ I A G Y+TG+DDDD + PNR+ + + Sbjct: 61 DIPFVYLKNDSPMGACFSRNRCIAAASGYYLTGLDDDDYFLPNRIESLIKVCNERK-IDL 119 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELK 177 + +N G + + + N++GNQ+ T F+ FDT Sbjct: 120 VCSNLIFKDGNKLKRGKN----NSGVITSIDMGYENLVGNQLLTRLSYFEDVGGFDTNFP 175 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 A QDYD++ R+++++G K EAT ++ + + +I++S K GY F KH + Sbjct: 176 AWQDYDLWYRIIMKFGPCIKTNEATYVMDVENDRKRISTSSKAHIGYRSFIEKHSETLSE 235 Query: 238 ASKKYQLFTLYQIRNKRMTWRTLLT 262 K RNK++ +TLL+ Sbjct: 236 EMKSSLFIRDLINRNKKIPLKTLLS 260 >UniRef50_A8FHL6 Glycosyltransferase n=5 Tax=Firmicutes RepID=A8FHL6_BACP2 Length = 281 Score = 226 bits (576), Expect = 8e-58, Method: Composition-based stats. Identities = 49/232 (21%), Positives = 89/232 (38%), Gaps = 9/232 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP-RI 62 P +SI M +N Q +I+S+L+Q Y NWE II DD ST + DP RI Sbjct: 5 KPKVSIIMGVYNCQDTVEESIESILQQTYDNWEFIICDDASTDGTYEKVLSYTKRDPERI 64 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 I N+ N A N+ + A+G+ I D DD P+RL + + + + Sbjct: 65 RLIRNEHNQRLAASLNRCLAEARGDLIARQDGDDISVPDRLEKQVHFLETHPEYDVVGTA 124 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN-QVFTWAWRFKEC-LFD--TELKA 178 V +L P R++ + + + A +K+ + + Sbjct: 125 MTVFDESGTKGVRALISAPD----RKVLARGTPFCHGTIMMRAAAYKDLNGYRSVKTTRR 180 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 +D D+++R + + ++E ++ + Q + + Sbjct: 181 MEDIDLWIRFFAAGRKGFNLQEPLYLVREDEAAFQRRKFQYSIDNAWLVLKA 232 >UniRef50_Q10VK0 Glycosyl transferase, family 2 n=3 Tax=Oscillatoriales RepID=Q10VK0_TRIEI Length = 1035 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 52/234 (22%), Positives = 90/234 (38%), Gaps = 11/234 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NP +S+ +P +N + +AI+SVL Q Y+++E+I+++D ST I Sbjct: 2 NPKVSVIIPVYNCELYIAQAIESVLNQTYTDYEIIVINDGSTDNTHQVLQPYMKK---IR 58 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 Y + + N G A RNQ I +A+GE I +D DD + +L +A + + Sbjct: 59 YFYQE-NKGLSATRNQGIKMAKGELIALLDADDLFLCYKLQEQVAIFDAQPNIGLVQSGW 117 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ--VFTWAWRFKECLFDTELKAAQD 181 V + P Y + K +F W K F+ L+ +D Sbjct: 118 RVVNEKGEKIEDIEPWYKSPKLDLVSWLKWKATNPSAMMFRKEWLEKVNGFNENLRRLED 177 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 +DI +R+ + + + + G M K D+ Sbjct: 178 FDIVIRLALASCQATWFPKVAVCYRQHSGNMTRN-----LLAQTEVEEKILDEL 226 Score = 220 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 91/221 (41%), Gaps = 5/221 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ + T+N + AI S+ Q Y+++E+I++DD ST + I Y Sbjct: 716 PRVSVIIATYNNAHYILEAIASIFNQTYTSYEIIVIDDGSTDNTRQVLEPYLDK---ICY 772 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 ++ + N G RN + +AQGE+I+ +D DD + P++L+ +A + +++ Sbjct: 773 VYQE-NKGVSHARNLGLEIAQGEFISFLDADDFFLPDKLAKQVAVFDAHPSLGIVHSGWR 831 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDYD 183 + + + + + + K FDT +D D Sbjct: 832 LVNKKGEKISDIELWHSSPELDLETWVVWKPVTISMMFSKSWIKSVGGFDTRWHHGEDID 891 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGY 224 + LR+ V+ E + + T +H S+ + S Sbjct: 892 LVLRLSVDGCEAMWLPKVTYCYRQHHCNATRKSTQQAASMM 932 Score = 168 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 51/301 (16%), Positives = 95/301 (31%), Gaps = 69/301 (22%) Query: 1 MKNN-PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVT---- 55 M N P++S+ +PT+N ++AI+SV Q Y+ WE+II+DD ST Sbjct: 320 MPNGLPVVSVVIPTFNNADYILKAIESVCNQTYTLWEIIIIDDGSTDNTYQVLESYLNTT 379 Query: 56 ------------------------------------ALNDPRITYIHNDINSGACA---- 75 A + + N G+ Sbjct: 380 DFLQNSAARNRQQKTKINTQEKENNQDLILDLFQNSAKGNRQQKIEKNTQEKGSSQDLTL 439 Query: 76 --VR---------NQAIMLAQ--------GEYITGIDDDDEWTPNRLSVFLAHKQQLVTH 116 R NQ +A+ GEYI +D DD + P +L+ + + T Sbjct: 440 TFARCLIKYLYQENQGPSIARNYGIEIAQGEYIAFLDADDFFLPEKLAEQVGCFAEDSTL 499 Query: 117 AFLYANDYVCQGEVYSQPASLPLYPKSPYSRRL-FYKRNIIGNQVFTWAWRFKE-CLFDT 174 + + + P + + + + + + + K FD Sbjct: 500 TMVQTGWRFVDEKGNTIKDVEPWENSTELNLENWLIWQATLPSAMMFKSKWLKVNGGFDK 559 Query: 175 ELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 +D +I LR+ + G+ + + + + + +K + K D Sbjct: 560 RYFGIEDLEIVLRLALTGGKATWLRKVCVCYRQRNSSVSGLKNREKITQE---LEKLLDD 616 Query: 235 F 235 F Sbjct: 617 F 617 >UniRef50_A6LGW8 Glycosyltransferase family 2 n=3 Tax=Bacteroidales RepID=A6LGW8_PARD8 Length = 259 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 53/229 (23%), Positives = 93/229 (40%), Gaps = 5/229 (2%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 L+SI MP++N + + I S++ Q Y+NWE++I DDCST A D RI Sbjct: 7 LVSIIMPSYNSSKYIAKTIDSIVSQFYTNWELLITDDCSTDNTCEIIKEYAAYDQRIKLF 66 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 + N GA RN++I A+G YI D DD W P +L V L + Y++ Sbjct: 67 VMEENKGAGVARNKSIEEAKGRYIAFCDSDDRWKPEKLEVQLRFMVEN-RVEICYSSYLK 125 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYDIF 185 C + S S + + + IG + ++ L+ QD+ Sbjct: 126 CNENDKVFGIVIAP---SKISYKKMTRNDYIGFLTCIYDTHQIGKIYMPTLRKRQDWAWK 182 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 + ++ + + +++ + G + + K Y++ Sbjct: 183 ILLMKKCPIAYGIKDTLAYYRVREGSLS-NNKMKLIRYNVTVYKQILKY 230 >UniRef50_B8IAC9 Glycosyl transferase family 2 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IAC9_METNO Length = 450 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 52/229 (22%), Positives = 92/229 (40%), Gaps = 5/229 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M N L++I +P +N +AI+S L Q + + E+++VDD ST + + + D Sbjct: 1 MPNPNLVTIIIPVYNGGYFLSQAIESALAQTWPHVEVLVVDDGSTDDGETARTCRSFGD- 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI YI N G + N+ I A+G YI+ + DD + P ++ + + ++ Sbjct: 60 RIRYIW-KENGGVASALNRGIAGARGRYISWLSHDDLYDPRKVEIQMGALLAEPDTVIVF 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPY-SRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 + V +G+ S S + I G V F + FD L Sbjct: 119 GDYAVIRGDGSLLTEVRVGEGYSDRASLWSTLEGRINGCAVIVPRECFARHGTFDEGLPT 178 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHF 227 QDY+++ R + V ++ G+ TS + + Sbjct: 179 TQDYELWFRFAQHH-RFVHVPGYLVRHRVHEGQGSRTSRHLEEASLLWM 226 >UniRef50_A1ANW6 Glycosyl transferase, family 2 n=2 Tax=Desulfuromonadales RepID=A1ANW6_PELPD Length = 261 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 60/266 (22%), Positives = 106/266 (39%), Gaps = 12/266 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M P +S+ +P +N+ Q AI SVL Q +S+ E+I+VDD S+ ++ + ALN Sbjct: 1 MMPMPEVSVIIPCYNQGQYLHEAIDSVLGQTFSDLEIIVVDDGSSDP-ATREVLDALNRT 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 + N G A RN I A G YI +D DD P+ + LA + +Y Sbjct: 60 ATRLLRR-KNGGLAAARNSGINAAHGRYILPLDCDDRIAPDYIRQALAAFESDPLCGIVY 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTEL-KAA 179 E + LP + + R+ I + ++ + + +D L + Sbjct: 119 CRAEKFGAEQGT--WRLPPFSR----LRMGLGNVIFCSALYQKSDWQRVGGYDENLQRGW 172 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSP-KKFSGYFHFYRKHKDKFDRA 238 +D+D +L ++ + + M + P K + + KH D F Sbjct: 173 EDWDFWLALLELGLTVHCLPYVGFHYRKSESSMASSMDPVLKANLHRQLMHKHPDFFGLL 232 Query: 239 SK--KYQLFTLYQIRNKRMTWRTLLT 262 S+ + L Y++ + R + Sbjct: 233 SRTPQPLLEIYYRMAGSGIYRRIKRS 258 >UniRef50_C6EQF5 Putative uncharacterized protein n=2 Tax=Campylobacter jejuni RepID=C6EQF5_CAMJE Length = 666 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 111/274 (40%), Gaps = 10/274 (3%) Query: 1 MKN--NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN 58 M+N PLIS+ MPT+NR++ AI+S+L Q ++++E II+DDCS A Sbjct: 1 MENLEYPLISVIMPTFNREKYIKFAIESILNQTFTSFEFIIIDDCSNDATYDIIREYAKI 60 Query: 59 DPRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAF 118 D RI + N N G N+ + L++G+YI +DDDD P RL ++ + Sbjct: 61 DQRIIVLRNKANKGIVFSLNKGLQLSRGKYIARMDDDDISMPTRLEKQFQIMEKDDSIIV 120 Query: 119 LYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTEL 176 S + + +Y ++ ++ I V FK+ + E Sbjct: 121 AGTYFKYIGRNPDSFASWVNVYEHDMVKIKMMFECPICHPSVIMRKNFFKKYNLKYSQEY 180 Query: 177 KAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFS--GYFHFYRKHKDK 234 + A+DY ++L ++ G+ + E + + + K ++ Sbjct: 181 QYAEDYKLWLDVLQNGGKIINIPEILLLYRATNLSISRNKKTSNLQQIRSRSILFKCREL 240 Query: 235 FDRASKKYQLFTL----YQIRNKRMTWRTLLTLL 264 F K L ++ K+ ++ + + Sbjct: 241 FLDTKSKNLLRDFCNIPMELSKKQEIYKKIQRIQ 274 >UniRef50_Q8KI14 Similar to Glycosyl transferase n=1 Tax=Pseudomonas aeruginosa RepID=Q8KI14_PSEAE Length = 264 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 65/263 (24%), Positives = 109/263 (41%), Gaps = 8/263 (3%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 N L+SI P++N ++L R I+SVL Q + NWEMIIVDDCS + A D RI Sbjct: 5 NEGLVSIITPSYNAEKLIGRTIQSVLDQTFDNWEMIIVDDCSKDSTRSVVAAYAEKDSRI 64 Query: 63 TYIHNDINSGACAV-RNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + + N+GA A RN + A G++I +D DD W P +L + L + F Sbjct: 65 RLVGLEKNNGAPAAPRNIGVQHASGDWIAFLDADDIWHPRKLELQLEAIKS-TRARFSCT 123 Query: 122 N--DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA- 178 D+ E+ + +S++ K I + V E F+ +++ Sbjct: 124 QMVDFFDDKEIVFESVKTVPQQIVDFSQQRI-KGRIPTSSVLLDKELIMEFPFNVDMRYK 182 Query: 179 -AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 +DY +LR++ + K+ G++ + F +R++ Sbjct: 183 AVEDYHCWLRILSSETKCLKLNAPLLYYRRVVGQISGS-KKYMLERMFMLHREYPGGSIL 241 Query: 238 ASKKYQLFTLYQIRNKRMTWRTL 260 + Y L R+ + L Sbjct: 242 KATLYTLTHALGGFYYRILRKGL 264 >UniRef50_Q60B59 Glycosyl transferase, group 2 family protein n=1 Tax=Methylococcus capsulatus RepID=Q60B59_METCA Length = 341 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 52/254 (20%), Positives = 91/254 (35%), Gaps = 10/254 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P I++ +P +N +Q A+ S+L Q Y N+ ++ VDD ST + + ++ DPR++ Sbjct: 8 PRITVLLPVYNGEQYLREALDSLLTQTYRNFSLLAVDDGSTDSS--LEILRSIKDPRLSV 65 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 N N G N+++ L Q EY+ +D DD TP RL LA Sbjct: 66 KVNPRNRGLINTLNESLDLIQSEYVARMDADDIATPYRLEEQLAFMDAHPDVGLCGGYYE 125 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK--ECLFDTELKAAQDY 182 P S L + + N V + +D A+DY Sbjct: 126 RFTD--TESIVVRPPTTHEDISYALIFDNVMAHNTVMFRRSVLERYRLRYDPAFAYAEDY 183 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSG---YFHFYRKHKDKFDRAS 239 ++++R Y + + ++ H +R + R Sbjct: 184 ELWVRFA-RYSRLANLPRVLVRYRFHPANTSSRFRHEQQIAAAKIRHIHRTNLGLEPRPG 242 Query: 240 KKYQLFTLYQIRNK 253 L+ +R + Sbjct: 243 DAALHRQLFDMRFE 256 >UniRef50_Q052L4 Glycosyltransferase n=10 Tax=Leptospira RepID=Q052L4_LEPBL Length = 281 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 58/258 (22%), Positives = 102/258 (39%), Gaps = 9/258 (3%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 NPL+SI +P +N + ++S+LRQ Y WE+IIVDD S + N T Sbjct: 2 NPLVSIVIPCYNYGRYIHETVESILRQRYKYWEVIIVDDGSNDPFTISILEEYKNKSGFT 61 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 + SG RN I AQGE+I +D DD + L ++ ++ + +Y Sbjct: 62 VLSIPR-SGPSTARNIGIDTAQGEFILPLDSDDMIHEDYLLEAISAYEKNPSLGIVYCEA 120 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKA-AQDY 182 +LP Y + I + VF + + F+ +K +DY Sbjct: 121 EFF--GSIKGKWNLPEYR----FPNILLDNCIFVSAVFKKSDWKEVGGFNENMKNEWEDY 174 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 D +L ++ + G+ +++ + I H S Y Y HK+ + + ++ Sbjct: 175 DFWLSLIEKGGKVYRIPKVLFYYRIGHSSRSQRSLKHFLPLYMQLYENHKELY-LKNIQF 233 Query: 243 QLFTLYQIRNKRMTWRTL 260 + + + L Sbjct: 234 LFQRHLEAKRLEDEFLIL 251 >UniRef50_B9M558 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. FRC-32 RepID=B9M558_GEOSF Length = 302 Score = 225 bits (574), Expect = 1e-57, Method: Composition-based stats. Identities = 47/262 (17%), Positives = 95/262 (36%), Gaps = 10/262 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +SI +P +N ++I S L Q Y N E+I+++D ST + + D RI Y Sbjct: 10 PKVSIIIPVYNGSNYLNQSIDSALAQTYQNIEVIVINDGSTDDLATENIAKSYGD-RIRY 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ--LVTHAFLYAN 122 N G + N + G+Y + + DD + PN++ + ++ ++++ Sbjct: 69 FC-KNNGGVGSALNFGLDKMTGKYFSWLSHDDIYQPNKIERQIELLRKQTNPEGVIVFSD 127 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKAAQD 181 S + + S + + G + F + FD L QD Sbjct: 128 WENIDENSSSIGCRIIDQAEIASSIYVVMNCTVNGCTLLIPKKCFDETASFDENLPTTQD 187 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITS----SPKKFSGYFHFYRKHKDKFDR 237 YD++ +M +Y + + E+ I+ + + R D+ Sbjct: 188 YDLWFKMARKY-QFLHLPESLVRYRIHSQQDSYKHPEHVKEQNILHTNFISRLTVDEILN 246 Query: 238 ASKKYQLFTLYQIRNKRMTWRT 259 +F + + + + Sbjct: 247 LENSLPVFYIKRAIFYKYKFPV 268 >UniRef50_B0JGT1 Glycosyl transferase n=2 Tax=Microcystis aeruginosa RepID=B0JGT1_MICAN Length = 325 Score = 225 bits (574), Expect = 1e-57, Method: Composition-based stats. Identities = 50/240 (20%), Positives = 98/240 (40%), Gaps = 10/240 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + P +S+ +P +N+ + AI +VL Q Y N+E++++DD ST E + + P Sbjct: 1 MSHVPQVSVIIPCYNQGRYLDDAITAVLVQTYQNFEILVIDDGSTEPE-TIEILQDYQQP 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 + I + N G RN I AQG YI +D DD+ + L +A + +Y Sbjct: 60 KTRIIRTE-NQGVATARNLGIAQAQGTYILPLDADDKIADSYLEKAVALLEGNEQLGIVY 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK-AA 179 + E + + P+ + + I + +F + ++T L Sbjct: 119 C-----EAEYFGDRQGIWPLPEYEF-PDILVDNMIFCSGLFRKSDWETVGGYNTNLIYGW 172 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT-SSPKKFSGYFHFYRKHKDKFDRA 238 +D+D +L ++ E +++ E M + ++ Y + H + + Sbjct: 173 EDHDFWLSLIELGREVYRIPEVLFFYRQKADSMSRSMTAEHYIYSYTQLFYNHPQLYSQN 232 >UniRef50_Q39U31 Glycosyl transferase, family 2 n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39U31_GEOMG Length = 303 Score = 225 bits (574), Expect = 1e-57, Method: Composition-based stats. Identities = 63/261 (24%), Positives = 103/261 (39%), Gaps = 18/261 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M +S+ +PT+N A++SVL+Q YSN+E+IIV+D ST N+ Sbjct: 1 MN---KVSVVLPTYNCGTYVGVAVESVLQQTYSNYELIIVNDGSTDETDQALAPYLCNNG 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 I YI N G RN I A G+ I ID DD+W P +L + ++ ++ Sbjct: 58 HIRYIKQ-GNKGHAGARNTGIRAATGDVIAFIDSDDKWLPEKLGEQVQAMEEDPEVGLVH 116 Query: 121 ANDYVCQGEVYSQPAS-------LPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLF 172 N Y Q L Y + F K I V +F Sbjct: 117 CNVYGFGENQEVQVRGPLLTQEQLQGYSGYIFDNLYFRKIIITTTTVMIRKSCIDDVGMF 176 Query: 173 DTELK--AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 D + ++D ++FLR++ +Y + V + + + + G + Y K Sbjct: 177 DENMTRYGSEDRELFLRILWKY-KARYVNKPLAMYRNRSDSE-GQNYERMIKGQEYVYEK 234 Query: 231 HKDK--FDRASKKYQLFTLYQ 249 ++SK+ + +YQ Sbjct: 235 ITSLYGLPQSSKRAVMSKMYQ 255 >UniRef50_B7K6R4 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7K6R4_CYAP7 Length = 374 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 49/267 (18%), Positives = 93/267 (34%), Gaps = 12/267 (4%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 + PL+S+ +P +N ++ R ++SVL Q Y N E++++DD S A D R Sbjct: 8 THLPLVSVIIPAYNAEKFIERTLRSVLSQTYQNIEVLVIDDGSQDKTAEIVQTIANQDQR 67 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQ-LVTHAFLY 120 + + NSG A RN I A+GE+I ID DD W P + + + + +Y Sbjct: 68 VILLRQ-ANSGVAAARNLGIQNAKGEFIAPIDADDIWYPENIEKQVQCMLEGGTSVGLVY 126 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNI--IGNQVFTWAWRFKECLFDTELK- 177 + A + L + + + ++ + K Sbjct: 127 SWSVDIDENDQLTGAFRVAEIEGDVYGTLVCHYFLGNASCALIRRTCLEEVSSYNCQFKT 186 Query: 178 ----AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITS--SPKKFSGYFHFYRKH 231 +D+++ LR+ Y + V + G M + + Sbjct: 187 HNLQGCEDWELALRIAENY-QFKAVRDFLVGYRKLPGTMSSNYQVMSRSHGFLLDIVAQR 245 Query: 232 KDKFDRASKKYQLFTLYQIRNKRMTWR 258 + + LY + ++ Sbjct: 246 QPNLPALISRLSKSNLYLYFARLSYFQ 272 >UniRef50_A3XRJ7 TuaG n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XRJ7_9FLAO Length = 256 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 57/246 (23%), Positives = 97/246 (39%), Gaps = 6/246 (2%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+SI P++N ++ I SV +Q ++WE++I+DD S+ D RI Sbjct: 10 RPLVSIVTPSYNSEKFIAETIASVQKQTVTDWELLIIDDASSDDTVACVKKLREKDSRIH 69 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I N G RN I A+G+Y+T +D DD W P L + ++ +F++A+ Sbjct: 70 CIPLSENKGPAHARNLGIQKAKGKYLTFLDADDLWFPEFLERSITEAKKH---SFVFASY 126 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYD 183 + QP S K N I + +K QD Sbjct: 127 KRLDENL--QPYLSDFIVPDKVSYTDILKSNAISCLTAFINIEVLGKKYMPLIKKRQDMG 184 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++L+ + + + ++E I I + + + FYR+ + AS Y Sbjct: 185 LWLQYLKKIDYAYGIQEPLAIYRIRRNSLSRNKTG-LIKYQWQFYREVEQLSVLASAYYL 243 Query: 244 LFTLYQ 249 Y+ Sbjct: 244 TCWAYK 249 >UniRef50_Q47DD2 Glycosyl transferase, family 2:Polysaccharide deacetylase n=4 Tax=Proteobacteria RepID=Q47DD2_DECAR Length = 672 Score = 224 bits (573), Expect = 2e-57, Method: Composition-based stats. Identities = 52/241 (21%), Positives = 91/241 (37%), Gaps = 20/241 (8%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P+IS+ +P +N ++ ++SVL QD+ E+I+VDD S+ P +T Sbjct: 360 PMISVVIPCYNAERWLGATLRSVLAQDWPRLEIIVVDDGSSDRSAELV---RTQFPGVTL 416 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + NSG RN I A+G++I +D DD W P +L Y + Sbjct: 417 V-QQKNSGVAVARNNGIANAKGDWIAFVDADDIWLPGKLHAQWQAFTLQPGGRMAYTAWH 475 Query: 125 VC-------------QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KEC 170 V S ASL P L + + V F + Sbjct: 476 VWPCTNPEPEAALLKDLAQRSTDASLWDGPTGWIYPDLLEDCCVWTSTVLMHRSLFDEIG 535 Query: 171 LFDTELKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRK 230 +FD +L+ +DYD++LR + ++ + + ++ + + + R Sbjct: 536 IFDEKLRIGEDYDLWLR-ASQVTPIVRIPKPLALYRMHPDSITKKAPDTNYQA-LVIVRA 593 Query: 231 H 231 Sbjct: 594 I 594 >UniRef50_C6XYA6 Glycosyl transferase family 2 n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XYA6_PEDHD Length = 338 Score = 224 bits (573), Expect = 2e-57, Method: Composition-based stats. Identities = 56/249 (22%), Positives = 100/249 (40%), Gaps = 5/249 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P I+++M +N +IKS+L Q ++++E++IV+D ST ND RI Sbjct: 3 PKITVFMAAYNAANYINESIKSILSQTFTDFELLIVNDGSTDETVQLVLE--FNDSRIRL 60 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 +HND N G RN A+ A+GEY+ +D DD PNRL + Q+ +A + Sbjct: 61 VHNDKNRGLVYTRNVALKEARGEYLAILDSDDIALPNRLELQYRFLQENPGYALCGGHGK 120 Query: 125 VCQGEVY--SQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDY 182 V + + L + + + V KE + A+DY Sbjct: 121 VIDKDGKLLDDNRLIVPTGNEHIKMTLLFINTYVNSTVMYKTTVLKELHGYKDFAPAEDY 180 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY 242 ++F+R+ +Y ++E + + + ++ + + +K Sbjct: 181 ELFIRIAEKY-PVDNLDEVLVKYRDHESNTSVLHADTSWTKLYEIKKIQLTNLQIQPEKR 239 Query: 243 QLFTLYQIR 251 LY I Sbjct: 240 LTDALYSIL 248 >UniRef50_A4BIW6 Glycosyltransferase n=1 Tax=Reinekea blandensis MED297 RepID=A4BIW6_9GAMM Length = 254 Score = 224 bits (573), Expect = 2e-57, Method: Composition-based stats. Identities = 62/257 (24%), Positives = 104/257 (40%), Gaps = 5/257 (1%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M N LI++ P +N + I SVL Q + N E I+VDDCS L D Sbjct: 1 MSNE-LITVITPVFNCEDFLEETIVSVLNQTHKNIEYILVDDCSVDNSSLIYQRFMEKDR 59 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ I NSGA RN+ I LA+GEYI +D DD+W P +L L ++ +F + Sbjct: 60 RVKSIKLLQNSGAAIARNKGIELAKGEYIAFLDSDDKWEPTKLEEQLMFMKENK-ASFSF 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSR-RLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 + S + K ++ L YK+ +G + L+ Sbjct: 119 TSYTSLNISNNSISKVDSKHQKIVFNYFDLLYKKITLGCSTVMLKSNSIGNMQMPNLRTG 178 Query: 180 QDYDIFLRMVV-EYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRA 238 QD+ +L ++ + E +++ I + + + ++ KK + R +++ Sbjct: 179 QDFAFWLLLLKNQIKEVHLLKKCLTIYTVRNDSIS-SNKIKKAIRQWSILRSNENLSTLK 237 Query: 239 SKKYQLFTLYQIRNKRM 255 + L Y R Sbjct: 238 AILPFLSYAYHATIGRR 254 >UniRef50_A4J8Z3 Glycosyl transferase, family 2 n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J8Z3_DESRM Length = 297 Score = 224 bits (573), Expect = 2e-57, Method: Composition-based stats. Identities = 57/249 (22%), Positives = 103/249 (41%), Gaps = 5/249 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M L+S+ +PT+NR++ IR+I+SVL Q Y E+ ++DD ST + ++ Sbjct: 1 MNE--LVSVVIPTYNREKTIIRSIESVLNQTYKAIEVCVIDDASTDKTEKIVQEKYGDND 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 ++ Y NSGAC RN + +A G+YI +D DD++ P ++ + + +Q Sbjct: 59 KVIYYRLAKNSGACVARNTGVSIAHGKYIAFLDSDDKFVPQKIELQMLKLKQC-EADICV 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQ 180 +N + L + +I + F+ FD + Q Sbjct: 118 SNYKYTDRNKNEKIVYAEPKEGKQLYDSLLWCNSITTGTILGKRECFEHVQFDAAMPRYQ 177 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGE-MQITSSPKKFSGYFHFYRKHKDKFDRAS 239 D+D+ LR+ Y + VE+A T K Y K++ F+ + Sbjct: 178 DWDLALRLSQNYRFCY-VEKALMEQEFQTVSITSSTGHKKTLFALDRLYEKNRLGFEECN 236 Query: 240 KKYQLFTLY 248 + + + Sbjct: 237 EAFVQIHWF 245 >UniRef50_B0NG34 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0NG34_EUBSP Length = 556 Score = 224 bits (572), Expect = 2e-57, Method: Composition-based stats. Identities = 51/272 (18%), Positives = 102/272 (37%), Gaps = 18/272 (6%) Query: 6 LISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTA---LNDPR 61 SI P +N + I ++SV RQ Y+NWE+ + D S + + V D R Sbjct: 64 KFSIITPLYNTPVEYLIELLESVERQTYTNWELCLA-DGSDDKHEYVEKVCKDWMNRDNR 122 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 I Y+ N G N+ + LA G++ +D DD + L + ++ F+Y+ Sbjct: 123 IIYMKLKKNLGISNNTNECLKLASGDFFGLLDHDDVLHESALFEMMCVIEK-TEADFIYS 181 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKAA 179 ++ E+ KS + + N I + + L+ + + Sbjct: 182 DEVKFSKEISDAT---DFNFKSAFGKDELRSHNYICHFTVFSKELLENANELYRSNFDGS 238 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 QD+D+ LR+ + + ++ + + K + R K++ R + Sbjct: 239 QDHDMVLRLTERAKNIVHIPKVLYYWRVHPDSVSMNLDSKSY-AVDAAIRAIKEQLIREN 297 Query: 240 KKYQ------LFTLYQIRNKRMTWRTLLTLLS 265 + T+Y+I + ++ L+ Sbjct: 298 EHGSVSSNLPYRTIYRISYDILQHPKVIILVH 329 >UniRef50_C1DM76 Glycosyl transferase, family 2 protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DM76_AZOVD Length = 292 Score = 224 bits (571), Expect = 3e-57, Method: Composition-based stats. Identities = 79/281 (28%), Positives = 134/281 (47%), Gaps = 8/281 (2%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ PL+S+Y+PT NR + RA++SVL Q Y+N E+++ DD ST + A ++ Sbjct: 1 MRPAPLVSVYIPTRNRLEKLERALRSVLGQTYANHEILVCDDASTDGTFERISRLARSER 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 +I Y+ N GAC+ RN I A+GE+ITG+DDDDE+TP+RL L +AF+ Sbjct: 61 KIRYLRNPAPRGACSARNLGIFAARGEFITGLDDDDEFTPDRLETLLDAWDD--GYAFVC 118 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWA-WRFKECLFDTELKAA 179 +N + + + + ++ R +N+ NQV T F ++ Sbjct: 119 SNFWSQRPGQRLR--AYYPLQAQTFTLRQLLLKNLATNQVLTRTGRLQDIGGFREGVRRL 176 Query: 180 QDYDIFLRMVVEY-GEPWKVEEATQILHINH--GEMQITSSPKKFSGYFHFYRKHKDKFD 236 QD+D +LR+ Y G + ++H +H +++ S +++D +D Sbjct: 177 QDWDTWLRLCSGYGGRFHRARTPLYVMHHDHEAEAQRVSRSIALDVALEELCERNRDLYD 236 Query: 237 RASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 S++ L ++R + L L RN L +R Sbjct: 237 ARSRRLLLSHARELRGEFSLGDLWLNTLHRRNSGPLRIYLR 277 >UniRef50_Q74BU3 Glycosyl transferase, group 2 family protein n=1 Tax=Geobacter sulfurreducens RepID=Q74BU3_GEOSL Length = 295 Score = 223 bits (570), Expect = 4e-57, Method: Composition-based stats. Identities = 53/238 (22%), Positives = 93/238 (39%), Gaps = 10/238 (4%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 MK P +S+ + +N + S+L Q Y ++E++I++D ST ND Sbjct: 1 MK-QPKVSVIVTCYNYATYLEGCLASILNQTYQDFELVIINDGSTDNTDEVISRFLDND- 58 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 ++ YI N N+G N I A GE I +D DD W P +L + + + +Y Sbjct: 59 KVRYI-NQKNTGQAIATNNGIAAAAGELIAFLDADDLWEPTKLEKQVRLFNR-DSIGVVY 116 Query: 121 ANDYVCQGEV----YSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTE 175 + E + L ++ I + F + +F+ E Sbjct: 117 SRIRFMDAEGCTLDMQLEGKYYTPRSGRVTDALLFENFIPYSSTVVRKECFDKFGMFNPE 176 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKD 233 K D+D++LR+ EY E V+E I I H ++ + + + + Sbjct: 177 YKNGLDWDLWLRISREY-EFDFVDEYLLIYRIGHPGQLTSNVERSVRCADLIFDRFLE 233 >UniRef50_D1BX31 Glycosyl transferase family 2 n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BX31_XYLCX Length = 264 Score = 223 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 50/230 (21%), Positives = 91/230 (39%), Gaps = 6/230 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +++ MP +N ++ + +SVL Q +S E++IVDD S D R+ Sbjct: 9 PTVTVVMPAFNSERTLRDSARSVLDQSFSALELVIVDDSSADRTAAIATELGAGDARVRL 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 I N + G RN AI A+G Y+ D DD W P++L L Q A +Y+ + Sbjct: 69 IRNPHSLGPAGARNAAISAARGRYVAFCDSDDLWLPSKLERQLEVATQ-TGAALVYSGYH 127 Query: 125 VCQGEVYS-----QPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAA 179 + +PA ++ + + RN++G + + A Sbjct: 128 RVDADFSGPASGFRPADRVVHVPTLLTHGALLHRNVVGCLTAMIDTEQTGPVSMPGIPGA 187 Query: 180 QDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR 229 +D+ ++LR++ E G ++E + + + R Sbjct: 188 EDWALWLRVLREGGTAAGIDEPLALYRTAQPGSHSARRWRAVLAVWRVLR 237 >UniRef50_Q74D09 Glycosyl transferase, group 2 family protein n=1 Tax=Geobacter sulfurreducens RepID=Q74D09_GEOSL Length = 310 Score = 223 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 62/236 (26%), Positives = 101/236 (42%), Gaps = 7/236 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 +NPL+S+ +P+ NR L I + SV+ Q Y N E+I+VDD S +D R+ Sbjct: 4 HNPLVSVVIPSKNRPDLVIATLVSVIDQTYDNIEIIVVDDGS-DMPLAPLLKDRFSD-RV 61 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLV-THAFLYA 121 + ND + G RN+ A G+YIT +DDDD W P +L + + + +Y Sbjct: 62 LCLRNDQSLGGAVARNRGAQSAHGDYITFLDDDDLWLPKKLEMQVDAFSKHGVDIGVVYC 121 Query: 122 NDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQ 180 E P + S + F FD EL + Q Sbjct: 122 GFDFLCKE-EIVPRQNKFHNDYDLSIAVLSGCPFGSPTPLIRKHYFDMVGGFDRELPSCQ 180 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFD 236 D+D+++R+ G + V+E+ + ++ ++ T+ KK G KH D+ Sbjct: 181 DWDLWIRLSKVCG-FYPVKESLALYRVHGDQIS-TNLRKKIDGRKMVLAKHYDEIK 234 >UniRef50_UPI0001788F1F glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F1F Length = 1106 Score = 223 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 58/279 (20%), Positives = 105/279 (37%), Gaps = 6/279 (2%) Query: 2 KNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 PL+SI +P +N + I S + Q Y + E++IVDDCS + T N+ + Sbjct: 575 NEYPLVSIIIPVYNNVEYLASCIDSAINQTYRHIEVVIVDDCSPDPKVSSILNTYKNNKK 634 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA 121 + N+ N G +N + ++G I +D DD N + L + + + + Sbjct: 635 VRLFKNEYNQGISKTQNICMAKSEGSIIAFLDCDDILELNAIEKCLEYWE--PGTKYSFT 692 Query: 122 NDYVCQGEVYSQPASLPLY-PKSPYSRRLFYKRNIIGNQVFT-WAWRFKECLFDTELKAA 179 N + + PK R + K +F++E A Sbjct: 693 NRIHINEDSEEIGRFSCDHLPKDNIFEDHLDVRMYASHFKMISRDVFLKVGVFNSEYDGA 752 Query: 180 QDYDIFLRMVVEYGE--PWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDR 237 QDYD+ LR+ Y V E I+ + I S+ K+ + + K + Sbjct: 753 QDYDMVLRVAFHYPNSAFVHVPEFLYKHRIHTNQKSIKSNDKQKNMSIVISDQAKLRRSI 812 Query: 238 ASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGI 276 + + I + +TL + S++N ++ I Sbjct: 813 RDGVFHKLISFIIISFGKEDQTLQCIQSIKNTVKIPHEI 851 Score = 46.2 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 46/292 (15%), Positives = 92/292 (31%), Gaps = 33/292 (11%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYS-NWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 LIS + ++ ++ ++ I+S+ E+I+ ++ S+ P + Sbjct: 820 LISFIIISFGKEDQTLQCIQSIKN-TVKIPHEIILFENGSSESCVSFIKEHIEGIPGVKV 878 Query: 65 IHNDINSGACAVRNQAIMLAQ--GEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYA- 121 I+N N G R +A+ A YI+ +D+D E T L L ++ + Sbjct: 879 IYNSSNLGPAGGRKEAMKYATPNSYYIS-LDNDIEVTEGWLEELLVRAEESDDIGSVTCR 937 Query: 122 -----NDYVCQGEVYSQPASLPLYP-----KSPYSRRLFYKR----NIIGNQVFTWAWRF 167 + G + + KS Y + N +G +F W Sbjct: 938 VTFPSDVLQFTGGYEEISGNRVQFKLYNINKSVYDLSTLERYDTDWNPVGATLF-KGWFP 996 Query: 168 KECLFDTELKAAQDYD--IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYF 225 + + +D LR + I+H M KK Y Sbjct: 997 EVEGYP---NVYEDSAISYLLR--NSGKRLVNSPNSLL---IHHHIMFDEERNKKEQEYI 1048 Query: 226 HFYRKHKDKFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLADGIR 277 F ++ + S F I + + + +S + + + + Sbjct: 1049 RF--RYNPQLMLRSLAQFYFDYKLIIDDDYIYSSNNIDISKLSDQEVIETFE 1098 >UniRef50_Q8PXS5 Glycosyltransferase n=1 Tax=Methanosarcina mazei RepID=Q8PXS5_METMA Length = 311 Score = 223 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 58/283 (20%), Positives = 110/283 (38%), Gaps = 11/283 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 LIS+ M +N ++ +I S+L Q Y NWE+II++D S+ + DPRI I Sbjct: 5 LISVVMSAYNSEKFISDSISSILDQTYENWELIIINDASSDNTLKIVNQFSEKDPRIKVI 64 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 N+ N G N I ++GE+I +D DD P+RL L H + + ++ Sbjct: 65 DNENNLGLTISLNIGINNSKGEFIARLDSDDFAEPSRLEKELDHLHAYPEAGLVGSGAHL 124 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAW-RFKECLFDTELKAAQDYDI 184 + + +R L I + + + + + +QDYD+ Sbjct: 125 INSSGNKIGSMNVMSQPYFVNRFLINLNPFIHSSIMVRKKALDNVGSYREKFRYSQDYDL 184 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKK-----FSGYFHFYRKHKDKFDRAS 239 LR+ +Y + + ++ + + + + F R++ S Sbjct: 185 ILRLSDKY-KLSNIALPLIRWRVSDSSVTMQHHTLQRIYADIAREFAIERRNSGHDSYES 243 Query: 240 KKY-QLFTLYQIRN---KRMTWRTLLTLLSVRNGKRLADGIRG 278 ++ L +IRN L + + + ++ I+G Sbjct: 244 TEFDTLIHEMRIRNQGRYLCDHGVYDLLFNKKYKEGCSELIKG 286 >UniRef50_B9M560 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. FRC-32 RepID=B9M560_GEOSF Length = 320 Score = 223 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 51/274 (18%), Positives = 104/274 (37%), Gaps = 12/274 (4%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+SI +P +N +AI S L Q Y E+++++D S ++ + D +I Sbjct: 2 TPLVSIIIPVYNGSNYLKQAIDSALNQTYKYIEILVINDGSNDGGLTEEIARSYGD-KIR 60 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL-YAN 122 Y H N G N I LA G+YI+ + DD + P+++ ++ + + Y+N Sbjct: 61 Y-HCKPNGGVATALNTGINLAAGDYISWLSHDDLYLPHKIERQISTIASIGGSDVISYSN 119 Query: 123 DYVCQGEVYSQPASLPLYP-----KSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK 177 + K + +LF + + + F L+ Sbjct: 120 YETIDAKNNVLRTVRLETSEIYDCKLAFLLQLFISSIHGCSLLIPRKCFTEVGYFKESLR 179 Query: 178 AAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT----SSPKKFSGYFHFYRKHKD 233 QDYD++ R++ + + E ++ + + + Y D Sbjct: 180 TTQDYDLWFRLLQKGFAFTHLPEILIQSRFHNEQGTHSLYAVHLKEAEQLYLRAVNTFYD 239 Query: 234 KFDRASKKYQLFTLYQIRNKRMTWRTLLTLLSVR 267 F++ K + + +R +++ L S++ Sbjct: 240 DFEKLPIKKIVDLVIDLRERKLKQTANHVLKSIK 273 >UniRef50_C0ECJ2 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0ECJ2_9CLOT Length = 658 Score = 223 bits (569), Expect = 5e-57, Method: Composition-based stats. Identities = 60/264 (22%), Positives = 90/264 (34%), Gaps = 26/264 (9%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ + WN + R I+S+L Q Y N E+I DD ST A D R+ Sbjct: 3 PKVSVLIACWNSEHTIQRCIESILSQTYLNLEIIACDDGSTDGTFGLLQEFAQRDSRVIP 62 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + + +N GA + RN + A+G YI D DD +P+R+ + + AF+ A Sbjct: 63 LRSPVNRGAASARNLCLASARGAYIAIQDADDYSSPDRIKKEVDFLEHHPEFAFVSAGMV 122 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGN--QVFTWAWRFKECLFDT--ELKAAQ 180 S P+S R F + +F + E + Q Sbjct: 123 RFDEVGEW---SWYSPPRSVPLRWDFLSGPPFAHAVTLFRRTALQAVGGYRVAWETRRGQ 179 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 DYD+F R+ + + RK R Sbjct: 180 DYDLFFRLCAAGYRGANLPGFLYFYLQDRNAFS---------------RKQY----RYRV 220 Query: 241 KYQLFTLYQIRNKRMTWRTLLTLL 264 + L R RM R L L Sbjct: 221 AESVIRLKGYRQNRMLLRGLPYLF 244 >UniRef50_A5GEM2 Glycosyl transferase, family 2 n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GEM2_GEOUR Length = 365 Score = 223 bits (569), Expect = 5e-57, Method: Composition-based stats. Identities = 50/256 (19%), Positives = 98/256 (38%), Gaps = 6/256 (2%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRI 62 +P +SI +P +N ++ AI+S+L Q Y+++E+II++D ST ++ D RI Sbjct: 7 KSPSVSIILPVFNGEKYLQSAIESLLSQSYADFELIIINDGSTDQSLAIMTKYSICDKRI 66 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 + N N G N+ I L++GEYI +D DD P+RL +A ++ + Sbjct: 67 RIL-NQENKGLIYALNKGIDLSRGEYIARMDSDDISHPDRLKHQVAFLERNRDVGIVGTY 125 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE--CLFDTELKAAQ 180 + ++ +L + + V + +D A+ Sbjct: 126 IDLFDE--FNHTGIKFATKPEEIRAKLLFGCELAHPTVMIRKVFITKYDLRYDINDLHAE 183 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASK 240 D+ +++R Y V + L ++ ++ K K + Sbjct: 184 DFGLWVRCCT-YFNIANVPKILVRLRQHNSQVTKVYGSKTIQTSMEIILKQLLQLGITVT 242 Query: 241 KYQLFTLYQIRNKRMT 256 +L ++ M Sbjct: 243 PAELSIHTELCYGFMP 258 >UniRef50_B0G3X3 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G3X3_9FIRM Length = 533 Score = 223 bits (569), Expect = 5e-57, Method: Composition-based stats. Identities = 48/276 (17%), Positives = 96/276 (34%), Gaps = 12/276 (4%) Query: 5 PLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALN-DPRI 62 PL SI +P +N + SV Q Y W++ I + + E + + D RI Sbjct: 62 PLFSILVPVYNTPIPYLREMLDSVRNQTYQKWQLCIANANPENEEVARILNQYIEMDKRI 121 Query: 63 TYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAN 122 ++ N G N+A+ +A+G+YI +D DD + L +Y N Sbjct: 122 QTVNVPENLGIAQNTNKALSIAKGDYIGLLDHDDMLAADALFEVAKTVNDE-NADVIYTN 180 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQD 181 + K ++ + N I + A K F E AQD Sbjct: 181 EDKITMSGEKH---FQPNFKPEFNLDMLRSNNYICHFFIAKASLMKEIGGFRGEYNGAQD 237 Query: 182 YDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKK 241 +D+ R ++ + ++ K++ + + +D R + Sbjct: 238 HDMIFRCTERADNIVRIPKVLYHWRMHEQSTAENPESKRY-AFDAGKKVIEDHLKRCGES 296 Query: 242 YQLFTLYQIRNKRMTW----RTLLTLLSVRNGKRLA 273 ++ R+ + + ++++ V+ K + Sbjct: 297 AEVEMTEYPGFYRVKYAIKEKPRVSVVIVKETKSMI 332 >UniRef50_C6MKP2 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M18 RepID=C6MKP2_9DELT Length = 305 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 55/216 (25%), Positives = 87/216 (40%), Gaps = 14/216 (6%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M+ P +S+ MP +N + AI S+ Q Y NWE++IVDD ST +YV + DP Sbjct: 1 MQ-QPKVSVVMPVYNTRPYLEEAIASIFAQTYRNWELVIVDDFSTDGSW--EYVCGIRDP 57 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 R+ N+ N N+ I LA GEYI +D DD P R+ + + ++ + Sbjct: 58 RVRVARNERNMRNSYTLNRGIALATGEYIAKMDADDVSFPERIERQMEYLREHPEVDAVG 117 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRL-------FYKRNIIG-NQVFTWAWRFKECLF 172 Y ++ ++P R + F I + F + Sbjct: 118 CGLYRVDRDLKLITVNIPPAAHRDIIRFIAPGRKFVFGPSFPITDGCLVARRSWFLRWQY 177 Query: 173 DTELKAAQDYDIFLRMVVEYG-EPWKVEEATQILHI 207 D + AQD+D LR Y ++E + Sbjct: 178 DPAIPYAQDFDQNLR--SHYDSVFANLQEPLYLYRR 211 >UniRef50_Q6XQ52 Putative glycosyltransferase n=1 Tax=Escherichia coli RepID=Q6XQ52_ECOLX Length = 288 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 65/277 (23%), Positives = 115/277 (41%), Gaps = 7/277 (2%) Query: 1 MKN-NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M++ LI++ MP +N ++ +IKS+L Q YSN+E II++D S L+D Sbjct: 2 MEDTKELITVIMPVYNAEKYIYESIKSILVQSYSNFEFIIINDGSKDSSGKIIENF-LDD 60 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 RI YI N N G N+AI L++G YI +D DD P RL L + A + Sbjct: 61 TRIRYI-NRENKGLVFTLNEAISLSRGNYIARMDADDISHPLRLEKQLNFLLENPDIAVV 119 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRF-KECLFDTELKA 178 + ++ P LF+ + V + + + Sbjct: 120 GCSSFIIDQNSKVINTRKPPLTPLVNKALLFFGPTLTHPSVMFNKMLLGDQLYYSDKYLH 179 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYR--KHKDKFD 236 +DYD++LR++ +Y + +++ IN + T+ ++ Y K+ K+ Sbjct: 180 VEDYDLWLRLINQY-KIANIKDVLFYYRINESGVSQTNLFEQKINAAKAYSEIKYDGKYK 238 Query: 237 RASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKRLA 273 +K ++ L K ++ +L +L L Sbjct: 239 DLLEKLEVIHLRHEMEKSKVFQAVLYILLKYEHDSLL 275 >UniRef50_Q7NIJ4 Gll2189 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NIJ4_GLOVI Length = 312 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 59/251 (23%), Positives = 102/251 (40%), Gaps = 8/251 (3%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P +S+ +P +N + + + S L Q + ++E++IVDD ST +V + D R+ Sbjct: 2 PKVSVIIPAFNALRYLPQTVASALAQSFGDFEVLIVDDESTDGTA--DWVRTIADRRVRL 59 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N GA A RN AI A+GEYI +D DD W P +L +A +Y Sbjct: 60 LEQ-KNQGAAAARNAAIREARGEYIAFLDADDLWEPTKLERQVACLDARPEVGLVYTWVQ 118 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDYD 183 V + + + +L NI + F+ LFD EL+ A+D++ Sbjct: 119 VIDQDGEPTGSLYRPGDEGMVWPQLVRANNIYPSAAMVRRQCFERVGLFDLELRFAEDWE 178 Query: 184 IFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKYQ 243 ++LR+ +Y + E + Q S +G K ++ + + Sbjct: 179 MWLRLAGDYA-FAVLREPLMRYRRHP---QCKSKKSSEAGLLRVMDKAFERGPEHLRHLK 234 Query: 244 LFTLYQIRNKR 254 +I Sbjct: 235 AECYGRIYLYL 245 >UniRef50_B8DIX4 Glycosyl transferase family 2 n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DIX4_DESVM Length = 363 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 50/215 (23%), Positives = 86/215 (40%), Gaps = 6/215 (2%) Query: 5 PLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 P IS+ + +N + AI+SVL Q +S++E+II+DD ST A D RI Sbjct: 9 PTISVGLSVYNAVRFLREAIESVLNQTFSDFELIIIDDGSTDGSDKIIREFAARDGRIRV 68 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + N G A N A+ LA+GE+ +D DD NR V L + + N Sbjct: 69 VTQ-QNKGLVASLNLALSLARGEFFARMDADDVCMNNRFQVQLEYLLAHPDVGVVGGNIA 127 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFK-ECLFDTELKAAQDY 182 +YP+ +R + + + + V + + + +DY Sbjct: 128 AIDEAGNFL--QSVVYPQGDDAREALLRGSPVAHPAVLARTAVLRLVGGYRAYYRHCEDY 185 Query: 183 DIFLRMVVEYGEPWKVEEATQILHINHGEMQITSS 217 D++LR+ V+ + + + +S Sbjct: 186 DLWLRVSECA-RLDNVDTCVLSYRHHRSSVSVINS 219 >UniRef50_A1AUG5 Glycosyl transferase, family 2 n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AUG5_PELPD Length = 308 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 57/255 (22%), Positives = 92/255 (36%), Gaps = 4/255 (1%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYI 65 IS+ M +N ++ ++I+SVL Q YS++E +IV+D ST +D RI + Sbjct: 2 KISVIMSAYNAEKHLTKSIESVLGQTYSDFEFLIVNDASTDDTPALLQKYQRSDNRIALL 61 Query: 66 HNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYV 125 HN N G N AI A+ I D DD P R S + + L + Sbjct: 62 HNRSNIGLTRSLNLAIQHARHNIIARQDADDISHPERFSKQIDFLKTHGEVMLLGTAGQL 121 Query: 126 CQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKE-CLFDTELKAAQDYDI 184 + +RL I V L+ E +AQDYD+ Sbjct: 122 IDDSGRILRNEPVISGSVRLKKRLQRCNQFIHGSVMMRRQCLNVIGLYREEFVSAQDYDL 181 Query: 185 FLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKFDRASKKY-- 242 FLR+ +Y + I+ + I S ++ +D+ + Sbjct: 182 FLRISEQYA-VDNLPVPLYQYRISPSTVSIAKSRQQQVCALIAQEASRDRRSGMPNNWDQ 240 Query: 243 QLFTLYQIRNKRMTW 257 ++ Y R + Sbjct: 241 LMYDAYSRRINSKRF 255 >UniRef50_Q9LA88 Putative glycosly transferase (Fragment) n=2 Tax=Aeromonas hydrophila RepID=Q9LA88_AERHY Length = 314 Score = 223 bits (568), Expect = 7e-57, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 88/233 (37%), Gaps = 4/233 (1%) Query: 4 NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRIT 63 PL+S+ +P +N + A+ S+ Q ++++E++++DD + A +D RI Sbjct: 6 KPLLSVLIPVYNCAEYIEEALASIQAQTFTDFEVLVIDDGCSDETPAIVRRIACHDQRIK 65 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYAND 123 I + N G N+ + LAQG ++ +D DD P+R S LA + + Sbjct: 66 IISRE-NRGIVDSLNEGVALAQGRWLARMDGDDICEPHRFSCQLAFLNAHPEIDIVGSWV 124 Query: 124 YVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELKAAQDYD 183 + LF + + F++ +D + +D + Sbjct: 125 KLFGARSEIWHHRREDIF--IKFMLLFRSSGFSHSSIMGKIELFQKYPYDKHYEDVEDTE 182 Query: 184 IFLRMVV-EYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 ++ RM + + + E ++ ++ ++ Y K+ Sbjct: 183 LWSRMALCRGVKFANIPEVLVHYRVHDAQVSKIKRARQMILYRQIMTKYMQTL 235 >UniRef50_C1VEY2 Glycosyl transferase n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1VEY2_9EURY Length = 332 Score = 223 bits (568), Expect = 7e-57, Method: Composition-based stats. Identities = 54/242 (22%), Positives = 103/242 (42%), Gaps = 9/242 (3%) Query: 6 LISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP--RIT 63 L+S+ +P + R + A++SV Q Y+ E+++VDDCS A D R+ Sbjct: 9 LVSVVVPAYGRPEYLDAAVESVNDQTYAPIELVVVDDCSPDP-IEPIVADAETDSLHRVH 67 Query: 64 YIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQL-VTHAFLYAN 122 + +++N GA A RN I ++GE++ +DDDD W P ++ + + F+Y Sbjct: 68 VVRHEVNRGANAARNTGIEASRGEFVAFLDDDDYWRPRKVEKQVRAFENASDEVGFVYTG 127 Query: 123 DYVCQGEVYSQPASLPLYPKSPYSRRLFYKR--NIIGNQVFTWAWRFKECLFDTELKAAQ 180 + + +P + + LF + + + + L DT + Q Sbjct: 128 QENVADDGTTTNYRVPE-TRGWVTHELFRGAPLCPFSSVMVRRSVIDEVGLLDTRFPSWQ 186 Query: 181 DYDIFLRMVVEYGEPWKVEEATQILHIN-HGEMQITSSPKKFSGYFHFYRKHKDKFDRAS 239 D + +LR+ V E + ++ HG++ K+ Y F KH+ + Sbjct: 187 DREWYLRVSEVCA-FETVTEPLAVRRVDSHGQISDNYEQKRDVSYPLFLEKHRPLARKYG 245 Query: 240 KK 241 ++ Sbjct: 246 RR 247 >UniRef50_B8HRA8 Glycosyl transferase family 2 n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HRA8_CYAP4 Length = 689 Score = 223 bits (568), Expect = 8e-57, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 76/210 (36%), Gaps = 2/210 (0%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIH 66 +SI +P +N ++S+L Q ++NWE I+V+D ST + D RI Sbjct: 9 VSIVIPAFNAADTIAETLQSLLDQTFTNWEAIVVNDGSTDDTVVVVQRFMARDLRIHLFT 68 Query: 67 NDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDYVC 126 + N G RN LAQG+ + +D DD P+ L + +Y Sbjct: 69 QE-NQGLSGARNSGAKLAQGDCVLFLDADDWIFPSHLERLTHRFLADPSLDVVYCGWVHA 127 Query: 127 QGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFKECLFDTELK-AAQDYDIF 185 + LP + Y +++ + + + FDT + +D+D + Sbjct: 128 LPDREYVLPELPTLSGDLFVPFCHYCVSLMSSFLVRRSLVDLVGPFDTSMHCGGEDWDFW 187 Query: 186 LRMVVEYGEPWKVEEATQILHINHGEMQIT 215 R+ V+E + Sbjct: 188 QRIARSGARFGLVKEVLVAYRARPNSLSRN 217 >UniRef50_B7RVP1 Glycosyl transferase, group 2 family n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RVP1_9GAMM Length = 657 Score = 223 bits (568), Expect = 8e-57, Method: Composition-based stats. Identities = 59/285 (20%), Positives = 112/285 (39%), Gaps = 18/285 (6%) Query: 1 MKNNPLISIYMPTWNRQ-QLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 +K PLISI +P +N ++ + SVL Q Y NWE+ I DD S+ + Y+ L D Sbjct: 121 LKYKPLISIIVPCYNPDIEMLQACVSSVLDQSYENWELCISDDASSDESAKK-YIQQLGD 179 Query: 60 --PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHA 117 +I N G N A+ LA+GE++ +D DD T + L + Sbjct: 180 AHSKIRIDLLKDNVGIALNSNAALALAKGEFVALLDQDDVLTKDALLEVAIALNKKQRPK 239 Query: 118 FLYANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQ-VFTWAWRFKECLFDTEL 176 +Y++ + ++ + K ++R FY N I + V F Sbjct: 240 LIYSD----EDKLDESGKRFAPHFKPDWNRDYFYSINYICHLAVMERTTLLDVEGFRKGY 295 Query: 177 KAAQDYDIFLRMVVEY--GEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDK 234 +QDYD+ LR + + + + + + G +++ K + + ++ Sbjct: 296 DGSQDYDLLLRFIANIEDKDIYHIPKILYRWRAHAG-STASAASAKSYSWEAGVKALQEH 354 Query: 235 FDRASKKYQLF-----TLYQIRNKRMTWRTLLT-LLSVRNGKRLA 273 D ++ + Y + + L++ ++ R+ K Sbjct: 355 LDSTNRAGVVNPRPQPNSYCVDWESPADSPLVSLIIPTRDHKDTL 399 Score = 87.8 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 76/217 (35%), Gaps = 31/217 (14%) Query: 3 NNPLISIYMPTWNRQQLAIRAIKSVLRQD-YSNWEMIIVDDCSTSWEQLQQYVTALNDPR 61 ++PL+S+ +PT + + I S+L+ Y N+E+++VD+ ST L + Sbjct: 382 DSPLVSLIIPTRDHKDTLCTCIDSILQNTSYENYEIVVVDNQSTCPLTLSYLEELKAHRK 441 Query: 62 ITYIHNDINSGACAVRNQAIMLAQGEYITGIDDD------DEWTPNRLSVFLAHKQQLVT 115 ++ I A+ N A+ +GEYI +++D D W N + + Sbjct: 442 VSVIRYPHAFNYSAITNFAVSQVRGEYIGLVNNDIEVINRD-WLTNMMKRAVR-----PG 495 Query: 116 HAFLYANDYVCQGEVYSQPA------------SLPLYPKSPYSRRLFYKRNI----IGNQ 159 + A Y + S Y RL +N+ Sbjct: 496 TGCVGAKLYYPDDTIQHAGVILGIGGVAGHSHKYFPNDSSGYYCRLILDQNLSAVTAACL 555 Query: 160 VFTWAWRFKECLFDT-ELKAA-QDYDIFLRMVVEYGE 194 V A + FD L A D D L++ Sbjct: 556 VVKKAIFDECGGFDEKNLTIAFNDIDFCLKVDARGYH 592 >UniRef50_A3W9Y6 Glucosyltransferase n=1 Tax=Erythrobacter sp. NAP1 RepID=A3W9Y6_9SPHN Length = 310 Score = 223 bits (568), Expect = 8e-57, Method: Composition-based stats. Identities = 54/276 (19%), Positives = 93/276 (33%), Gaps = 11/276 (3%) Query: 1 MKNNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDP 60 M + P S+ +P +N + SVL Q ++E+++VDD S+ A+ D Sbjct: 1 MMSAPKFSVVIPAYNAGATLRSTVVSVLAQTDQDFEIVVVDDGSSDGTLNAMLDLAVKDE 60 Query: 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLY 120 RI + NSG A RN LA+GE I +D DD+W +L+ + + Sbjct: 61 RIRIVSQP-NSGVSATRNYGASLAKGELIAFLDADDQWDREKLARHREMHRCDPLLDASF 119 Query: 121 ANDYVCQGEVYSQPASLPLYPKSPYSRRL--FYKRNIIG---NQVFTWAWRFKECLFDTE 175 A L N + N V FD Sbjct: 120 AQVEFAPERKGKLKRGRVRSSIPADYLDLGDVLVENAVCTTSNLVIKRDAFADLGGFDQT 179 Query: 176 LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQITSSPKKFSGYFHFYRKHKDKF 235 ++ A+D + R++ + E ++ + F +R+ + Sbjct: 180 MRYAEDQEFLARLLAHGCLIRGIPEPLVRYRMSEDGLSC-----DFEAMLSNWRRFASCW 234 Query: 236 DRASKKYQLFTLYQIRNKRMTWRTLLTLLSVRNGKR 271 S+ Q +Y R R ++ R+ R Sbjct: 235 IEGSELAQAEAIYCRYLTRRALRAGASMRITRSLAR 270 >UniRef50_A0LJ94 Glycosyl transferase, family 2 n=2 Tax=Deltaproteobacteria RepID=A0LJ94_SYNFM Length = 294 Score = 223 bits (568), Expect = 8e-57, Method: Composition-based stats. Identities = 57/213 (26%), Positives = 89/213 (41%), Gaps = 11/213 (5%) Query: 7 ISIYMPTWNRQQLAIRAIKSVLRQ--DYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITY 64 +S+ +PT+NR+ A+ SVL Q + E+I+VDD S+ V RI Sbjct: 9 VSVIVPTYNRRAFVKEAVASVLAQEGVFP--EIIVVDDGSSDGTDRALEVFG---ERIRR 63 Query: 65 IHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFLYANDY 124 + + GA A RN I +A GE++ +D DD W P++L L + Sbjct: 64 LRR-AHGGASAARNTGIRIAAGEWLAFLDSDDLWLPSKLRRQLDFLTAHPEFLVCQTEEI 122 Query: 125 VCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFTWAWRFK-ECLFDTELKAAQDYD 183 + V P P+ RL + + + V F LFD L A +DYD Sbjct: 123 WLRNGVRINPRKYHRKPRGHCFERLLDRCLVSPSAVMIHRDVFDSVGLFDENLPACEDYD 182 Query: 184 IFLRMVVEYGEPWKVEEATQILH-INHGEMQIT 215 ++LR+ Y VEE + + ++ T Sbjct: 183 LWLRIGCTYA-LGLVEEQLVVKRGGHPDQLSAT 214 >UniRef50_C6E5K3 Glycosyl transferase family 2 n=1 Tax=Geobacter sp. M21 RepID=C6E5K3_GEOSM Length = 318 Score = 222 bits (567), Expect = 9e-57, Method: Composition-based stats. Identities = 51/217 (23%), Positives = 88/217 (40%), Gaps = 7/217 (3%) Query: 1 MKN-NPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALND 59 M + +PL+SI +P +N AI S L Q Y N E+++V+D S + + + D Sbjct: 1 MSSFSPLVSIIIPVYNGSDYLREAIDSALAQSYGNIEILVVNDGSRDEGKTEAIAISYGD 60 Query: 60 PRITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQQLVTHAFL 119 +I YI N G + N I GEY T + DD + P + + ++ A + Sbjct: 61 -KIRYIP-KQNGGVASALNLGIREMTGEYFTWLSHDDLYYPAKTEEQVRLLEEAGGDAVV 118 Query: 120 YANDYVCQGEVYSQPASLPLYPKSPYSRRLFYKRNIIGNQVFT-WAWRFKECLFDTELKA 178 Y++ + YS L + I G V + + LFD L+ Sbjct: 119 YSDYEYIDPSGNYLGTKMAKSSGVKYS--LIMEGTINGCTVMIPRRYFDEIGLFDLALRT 176 Query: 179 AQDYDIFLRMVVEYGEPWKVEEATQILHINHGEMQIT 215 QD+D++ R+ Y + ++ + +T Sbjct: 177 TQDFDMWFRLAEHY-PFLHQAKVLVKSRVHPNQGSLT 212 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.174 0.572 Lambda K H 0.267 0.0534 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,929,233,514 Number of Sequences: 3077464 Number of extensions: 106026712 Number of successful extensions: 284971 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 10577 Number of HSP's successfully gapped in prelim test: 8289 Number of HSP's that attempted gapping in prelim test: 242636 Number of HSP's gapped (non-prelim): 20977 length of query: 279 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 152 effective length of database: 649,558,428 effective search space: 98732881056 effective search space used: 98732881056 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 93 (40.1 bits)