BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (359 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B5FN86 4-alpha-L-fucosyltransferase n=47 Tax=Enterobact... 635 0.0 UniRef50_A4TRB6 4-alpha-L-fucosyltransferase n=126 Tax=Enterobac... 485 e-136 UniRef50_B0BRE0 TDP-Fuc4NAc:lipid II Fuc4NAc transferase n=6 Tax... 333 8e-90 UniRef50_B8F5S7 4-alpha-L-fucosyltransferase n=2 Tax=Haemophilus... 317 6e-85 UniRef50_A9BJR5 Putative uncharacterized protein n=1 Tax=Petroto... 92 3e-17 UniRef50_Q21IU4 Glycosyltransferase-like protein n=1 Tax=Sacchar... 91 5e-17 UniRef50_A6QAJ2 Putative uncharacterized protein n=1 Tax=Sulfuro... 86 2e-15 UniRef50_B1XZR9 4-alpha-L-fucosyltransferase n=1 Tax=Leptothrix ... 84 9e-15 UniRef50_C3WR04 4-alpha-L-fucosyltransferase n=2 Tax=Fusobacteri... 83 2e-14 UniRef50_C1TR64 4-alpha-L-fucosyltransferase (Fuc4NAc transferas... 78 5e-13 UniRef50_A6VTI0 Putative uncharacterized protein n=1 Tax=Marinom... 76 2e-12 UniRef50_B7IGW7 Putative uncharacterized protein n=1 Tax=Thermos... 76 2e-12 UniRef50_C3XKD1 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacte... 74 1e-11 UniRef50_C3XKC6 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacte... 66 2e-09 UniRef50_C1FUB2 Putative uncharacterized protein n=1 Tax=Clostri... 56 2e-06 UniRef50_Q8KWB8 RB124 n=1 Tax=Ruegeria sp. PR1b RepID=Q8KWB8_9RHOB 55 3e-06 UniRef50_Q4ACG5 4-alpha-L-fucosyltransferase (Fragment) n=1 Tax=... 55 3e-06 UniRef50_Q0HKL5 Putative uncharacterized protein n=1 Tax=Shewane... 47 8e-04 UniRef50_D1PK33 Putative uncharacterized protein n=1 Tax=Subdoli... 45 0.004 UniRef50_C3RR99 Predicted protein n=1 Tax=Mollicutes bacterium D... 43 0.014 UniRef50_B5JRF7 Putative uncharacterized protein n=1 Tax=Verruco... 42 0.025 UniRef50_C9CTI5 Rb124 n=1 Tax=Silicibacter sp. TrichCH4B RepID=C... 42 0.043 UniRef50_B3CFK0 Putative uncharacterized protein n=1 Tax=Bactero... 41 0.080 >UniRef50_B5FN86 4-alpha-L-fucosyltransferase n=47 Tax=Enterobacteriaceae RepID=WECF_SALDC Length = 359 Score = 635 bits (1638), Expect = 0.0, Method: Compositional matrix adjust. Identities = 297/357 (83%), Positives = 329/357 (92%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPG 60 MTVLIHVLGSDIPHHN TVLRFFND LAATSEHAREFMV G+D+G ++SCPALS++F+ Sbjct: 1 MTVLIHVLGSDIPHHNHTVLRFFNDTLAATSEHAREFMVAGEDNGFTESCPALSLRFYGS 60 Query: 61 KKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGL 120 KK+LA+AVIAKAKANR+QRFFFHGQFN +LWLALLSGGIKP+QF+WHIWGADLYE+S+GL Sbjct: 61 KKALAQAVIAKAKANRRQRFFFHGQFNTSLWLALLSGGIKPAQFYWHIWGADLYEVSNGL 120 Query: 121 RYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQR 180 +++LFYPLRR+AQ RVGCVFATRGDLS+FA+ HP VRGELL+FPTRMDPSLN MA + QR Sbjct: 121 KFRLFYPLRRIAQGRVGCVFATRGDLSYFARQHPNVRGELLYFPTRMDPSLNAMAKECQR 180 Query: 181 EGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELF 240 GK+TILVGNSGDRSN+HIAALRAV+QQFGDTV VVVPMGYP NN+AYI+EVRQAGL LF Sbjct: 181 AGKLTILVGNSGDRSNQHIAALRAVYQQFGDTVNVVVPMGYPANNQAYIDEVRQAGLALF 240 Query: 241 SEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQ 300 S ENLQILSEK+EFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQA IPCVLNR+NPFWQ Sbjct: 241 SAENLQILSEKMEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQADIPCVLNRDNPFWQ 300 Query: 301 DMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALAIAARE 357 DM EQHLPVLFTTDDLNE +VREAQRQLASVDK+ I FFSPNYLQ W AL IAA E Sbjct: 301 DMAEQHLPVLFTTDDLNEQVVREAQRQLASVDKSGITFFSPNYLQPWHNALRIAAGE 357 >UniRef50_A4TRB6 4-alpha-L-fucosyltransferase n=126 Tax=Enterobacteriaceae RepID=WECF_YERPP Length = 361 Score = 485 bits (1249), Expect = e-136, Method: Compositional matrix adjust. Identities = 228/359 (63%), Positives = 280/359 (77%), Gaps = 2/359 (0%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAAT--SEHAREFMVVGKDDGLSDSCPALSVQFF 58 M L HVLGSDIPHHN TVLRFFND LA E R FMV K+ S P L + + Sbjct: 1 MITLTHVLGSDIPHHNLTVLRFFNDVLAKCLPVEQVRHFMVAAKETAPFSSFPQLDINTY 60 Query: 59 PGKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSS 118 KK+LAEAVIA+A+A+R RFF+HGQFN TLWLALLSG IKP Q +WH+WGADLYE + Sbjct: 61 SDKKALAEAVIARAQADRSARFFWHGQFNATLWLALLSGKIKPGQVYWHVWGADLYEDAK 120 Query: 119 GLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDR 178 L+++LFY LRR+AQ R G VFATRGDL + + HP+V LL+FPTRMDP+L + D+ Sbjct: 121 SLKFRLFYLLRRIAQGRGGHVFATRGDLIHYQQRHPRVPASLLYFPTRMDPALTAINIDK 180 Query: 179 QREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLE 238 G MTILVGNSGD +N HI AL+A+HQQFG V+V++PMGYP NNEAYIE+VRQAGL Sbjct: 181 PLAGPMTILVGNSGDTTNRHIEALKAIHQQFGPDVRVIIPMGYPANNEAYIEQVRQAGLA 240 Query: 239 LFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPF 298 LFS++NL+IL+E++ FD YL +LR+CDLGYFIF RQQGIGTLCLL Q G+P VL+R+NPF Sbjct: 241 LFSQDNLRILTEQIPFDDYLNILRECDLGYFIFNRQQGIGTLCLLTQFGVPFVLSRKNPF 300 Query: 299 WQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALAIAARE 357 WQD+ EQH+PV F D L+E ++REAQRQLA +DK IAFF+PNY++GW++ALA+AA E Sbjct: 301 WQDLAEQHIPVFFYGDTLDEPMIREAQRQLAGLDKQAIAFFNPNYIEGWKQALALAAGE 359 >UniRef50_B0BRE0 TDP-Fuc4NAc:lipid II Fuc4NAc transferase n=6 Tax=Pasteurellaceae RepID=B0BRE0_ACTPJ Length = 356 Score = 333 bits (853), Expect = 8e-90, Method: Compositional matrix adjust. Identities = 165/351 (47%), Positives = 237/351 (67%), Gaps = 9/351 (2%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDAL-AATSEHAREFMVVGKDDGLSDSCPALSVQFFP 59 M + H+LGSDIPHHNRTVL FF D L +E F VVG+ L+ P L++Q F Sbjct: 1 MRPIYHILGSDIPHHNRTVLNFFRDQLLPKLTEQQHYFYVVGQQTLLTQY-PELNLQVFC 59 Query: 60 GKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSG 119 ++++ AV+ AK + +F HGQ+N LW+A+L G + + WHIWGADLYE +SG Sbjct: 60 SRQAITRAVVQTAKQVKTAKFVLHGQYNVWLWIAVLFGYLPACRCIWHIWGADLYEEASG 119 Query: 120 LRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTH---PKVRGELLFFPTRMDPSLNTMAN 176 ++KLFY +RRLAQ+++ ++ATRGDL+F AK H + +L+FPT+M S M + Sbjct: 120 WKFKLFYFIRRLAQQKLPVLWATRGDLTF-AKRHLNRTDTQDRVLYFPTKMG-SRAVMQD 177 Query: 177 DRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAG 236 R+ + + TIL+GNSGD SN H+AAL + Q + V++++PMGYP NN+ YIE+V++ Sbjct: 178 TRENQ-RFTILLGNSGDPSNRHLAALAQLKQSLAEDVRIIIPMGYPSNNQTYIEQVKRQA 236 Query: 237 LELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNREN 296 +ELF + +++L+EKL+F Y LL QCDLGYF F RQQ IGT+CLLIQ +P VL +EN Sbjct: 237 VELFPKHTVEVLTEKLDFTQYQQLLAQCDLGYFYFNRQQAIGTICLLIQQNVPLVLTKEN 296 Query: 297 PFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGW 347 PF DM +++P L+ +D+L VR+ ++QL + DKN I FF+P+Y + W Sbjct: 297 PFCIDMQAENVPFLY-SDELTIAKVRQVKQQLQNCDKNNIGFFAPHYNEQW 346 >UniRef50_B8F5S7 4-alpha-L-fucosyltransferase n=2 Tax=Haemophilus parasuis RepID=B8F5S7_HAEPS Length = 389 Score = 317 bits (811), Expect = 6e-85, Method: Compositional matrix adjust. Identities = 165/354 (46%), Positives = 226/354 (63%), Gaps = 10/354 (2%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPG 60 M +IHVLG++IPHHN T+L FF + L +A F VV + D LS++ L + +P Sbjct: 35 MANIIHVLGANIPHHNHTILNFFQNELLDELPNAFHFYVVSRSD-LSETFTLLDINSYPD 93 Query: 61 KKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGL 120 + L + VI A+ F HGQFN +LWLA+L G + ++ WHIWGADLYELSS L Sbjct: 94 EYLLTQEVIKIARKEPTAWFVLHGQFNTSLWLAILLGLVPANRCVWHIWGADLYELSSSL 153 Query: 121 RYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGEL---LFFPTRMDPSLNTMAND 177 +++LFYPLRRLAQ+++ ++ T GDL+ A K + L L+FPTRM + T Sbjct: 154 KFRLFYPLRRLAQRKIARLWGTLGDLNH-AYQQLKRKSSLDQRLYFPTRMPTNFPTKKCS 212 Query: 178 RQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGL 237 QR TIL+GNSGD SN+H+ L+ + + FG+ V++VVPMGYP N YI +VRQ Sbjct: 213 YQR----TILLGNSGDPSNQHLLGLKQIREIFGENVRIVVPMGYPAGNSKYIAQVRQQAE 268 Query: 238 ELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENP 297 + F + + IL++KLEF++YL LL QCDLGYF F RQQGIGT+CLLIQ IP ++R NP Sbjct: 269 QDFQQGQVNILTQKLEFESYLDLLSQCDLGYFPFERQQGIGTMCLLIQMNIPIAIHRANP 328 Query: 298 FWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRAL 351 F QD+ + + LF + N +I+R + QLA +DK+ I FF +Y Q W L Sbjct: 329 FQQDLQAEGISFLFADEISNSEIIR-VKSQLALLDKSKITFFPLSYKQEWLNCL 381 >UniRef50_A9BJR5 Putative uncharacterized protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BJR5_PETMO Length = 351 Score = 92.0 bits (227), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 80/344 (23%), Positives = 154/344 (44%), Gaps = 22/344 (6%) Query: 2 TVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGK 61 T ++H++ S ++N ++F + + + +++GK++G DS + F K Sbjct: 4 TEILHII-SGTSNYNINFIKF---VYSYFNIDKQRLIILGKNNGFYDS----KILFISKK 55 Query: 62 KSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYEL---SS 118 K + + + KA ++ F H F+P L L L + +W +WG DLY + Sbjct: 56 KEVFKLIKEMRKA---EKIFVHSLFSPHLVLLLFLQPWLLKKSYWVLWGGDLYYYKFRNK 112 Query: 119 GLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGEL--LFFPTRMD-PSLNTMA 175 L+ + +R+ K + A + AK K + + F+P +D L+ + Sbjct: 113 NLKSNFYEFIRKRVIKNFAHIVALVPGDYYLAKNWYKTKAQYHYAFYPNPIDYEYLDKIK 172 Query: 176 NDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQA 235 N ++ ++ I VGNS D N+HI L + + ++++ P+ Y ++ + + V +A Sbjct: 173 NSKKETDRIVIQVGNSADPMNKHIEILNKLSRFKEKNIEIITPLSY--GDQKWAKTVSEA 230 Query: 236 GLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRE 295 G +L+ E+ Q L E L + Y +L D+ F RQQ +G + L+ G + + Sbjct: 231 GKKLYGEK-YQPLLEFLPSEEYSKILNSVDIAIFNHDRQQALGNILALLYLGKKVFIKSD 289 Query: 296 NPFWQDMTEQHLPVL--FTTDDLNEDIVREAQRQLASVDKNTIA 337 W + L V + D+L D + + L ++ IA Sbjct: 290 ITPWDFFKGKGLIVFDTYGLDNLTFDELIYMDKNLKERNREIIA 333 >UniRef50_Q21IU4 Glycosyltransferase-like protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21IU4_SACD2 Length = 355 Score = 91.3 bits (225), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 72/236 (30%), Positives = 112/236 (47%), Gaps = 12/236 (5%) Query: 83 HGQFNPTLWLALLSGGIKPSQFFWHIWGADLY-ELSSGLRYKLFYPLRRLAQKRVGCVFA 141 HG F+ L L L++ S+ +W +WGADLY + + K+ LRR+ R+G V Sbjct: 83 HGLFDNKLILFLVANYTCLSKVYWVMWGADLYVKEEKCFKEKIVSKLRRVICSRLGGVVT 142 Query: 142 -TRGDLSFFAKTHPKVRG---ELLFFPTRM--DPSLNTMANDRQREGK-MTILVGNSGDR 194 RGD + A+ V G + + +P+ + +P + ++ G + ILVG+S D Sbjct: 143 YIRGDYQY-AQQRWGVVGRYCDCIMYPSNIYVEPEEERVQSNSDENGSSINILVGHSADP 201 Query: 195 SNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEF 254 SN H + D +++ P+ Y NE Y ++V Q G ELF ++ + L++ L Sbjct: 202 SNNHKCIFDMLADSGVDNMRIYAPLSY--GNEMYRDDVVQYGKELFGDD-FRPLTKFLPL 258 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVL 310 YLALL D+ F RQQ +G LI G + WQ E + VL Sbjct: 259 KDYLALLADIDIAIFDHKRQQAMGNTINLIGLGKTVYMRTNVTQWQLFNELGVAVL 314 >UniRef50_A6QAJ2 Putative uncharacterized protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QAJ2_SULNB Length = 355 Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 72/277 (25%), Positives = 128/277 (46%), Gaps = 16/277 (5%) Query: 76 RQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLR-YKLFYPL---RRL 131 + ++ HG F+ L L + +W +WG DLY ++ +K + L R++ Sbjct: 79 KAEKIILHGLFSDDLINYLYYHQYFLKKCYWVMWGGDLYGHIDPIKIWKNIFRLHRRRKV 138 Query: 132 AQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPS-LNTMANDRQRE-GKMTILVG 189 Q+ G + +GD K H G+ ++ M S L + + +E + I +G Sbjct: 139 VQEMGGLITYIKGDYELVCK-HYGAAGK--YYECFMYTSNLYKEYDIKHKEHSTINIQLG 195 Query: 190 NSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILS 249 NS D +N HI L + + G+ +K+ +P+ Y N+ Y +EV G ELF ++ L+ Sbjct: 196 NSADLTNNHIEVLNELRKYKGENIKIFIPLSY--GNQEYAKEVIAKGKELFGDK-FVALT 252 Query: 250 EKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPV 309 E + FD YL L + D+ F RQQ +G L+ G + + W+ + ++ + Sbjct: 253 EFMPFDKYLEFLGEIDIAIFAHKRQQAMGNTITLLGLGKKVYMRSDITPWKLFKDINVNI 312 Query: 310 LFTTDDLNEDIVREAQRQLASVDKNTIAFFS-PNYLQ 345 F +++ ++ E R + KN +FS NYL Sbjct: 313 -FDIENIELKLIAEKDR--LNNQKNIKEYFSRENYLN 346 >UniRef50_B1XZR9 4-alpha-L-fucosyltransferase n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1XZR9_LEPCP Length = 350 Score = 84.0 bits (206), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 75/257 (29%), Positives = 116/257 (45%), Gaps = 18/257 (7%) Query: 62 KSLAEAVIAKAKANRQQRFFFHG--QFNPTLWLALLSGGIKPSQFFWHIWGADLY---EL 116 K + AV+AK K N + HG F + LAL +K + +W IWGADLY ++ Sbjct: 60 KPIGLAVLAK-KMNLAGKIVLHGLTHFRLLVLLALQPWLLKKT--YWIIWGADLYAYQKI 116 Query: 117 SSGLRYKLFYPLRRLAQKRVG-CVFATRGDLSFFAKTHPKVRGE---LLFFPTRMDPSLN 172 + + +L LRR R+G V GD++ A+ +G+ L + + + L Sbjct: 117 GTSWQSRLKEMLRRFVIPRIGHLVTYVSGDVAL-ARQWYGAKGQHHDCLCYASNVYHHLE 175 Query: 173 TMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEV 232 R + ILVGNS DRSN H A+ +++ P+ Y ++ Y +EV Sbjct: 176 LPT--RNSGSNLQILVGNSADRSNNHDGIFAALLPHLETGLEIHAPLSY--GDQQYADEV 231 Query: 233 RQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVL 292 + G F + L E + + YL LL D+ F RQQG+G + L+ G + Sbjct: 232 TKFGSAKFGSK-FHALREFMPYGDYLKLLSGIDIAVFNHERQQGMGNIISLLGLGKTVYM 290 Query: 293 NRENPFWQDMTEQHLPV 309 + WQ +T L V Sbjct: 291 RKSTTSWQSLTNLGLTV 307 >UniRef50_C3WR04 4-alpha-L-fucosyltransferase n=2 Tax=Fusobacterium RepID=C3WR04_9FUSO Length = 352 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 64/237 (27%), Positives = 108/237 (45%), Gaps = 12/237 (5%) Query: 78 QRFFFHGQFNP--TLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKR 135 ++ +FHG F+P T+++ +K S +W IWG DLY + FY + + Sbjct: 80 EKIYFHGLFDPRVTIFIYFFRFFLKKS--YWIIWGGDLYSYKDRKKKSFFYNIEDYVKGN 137 Query: 136 V-GCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGK--MTILVGNSG 192 + G + +G+ + KV+G F+ + PS + ++EGK + + VGNS Sbjct: 138 MKGYISYIKGEFKL-VQEWFKVKGN--FYSSFTYPSNLYKKIEIRKEGKEGLWVQVGNSA 194 Query: 193 DRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKL 252 D SN H L + + +K+ + Y N E Y V + G ELF ++ IL+ + Sbjct: 195 DPSNNHFEILEKLSKFKDMNIKLFCILSYGGN-EEYKNRVIKRGSELFKDKFCPILNF-M 252 Query: 253 EFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPV 309 +FD Y+ L D+ F RQQ G + L+ L + +Q + E + V Sbjct: 253 KFDEYMNFLSSLDIAIFAHDRQQAFGNITSLLSMKKTVYLKEKVTTYQTLKEMGIKV 309 >UniRef50_C1TR64 4-alpha-L-fucosyltransferase (Fuc4NAc transferase) n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TR64_9BACT Length = 353 Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 70/278 (25%), Positives = 119/278 (42%), Gaps = 31/278 (11%) Query: 74 ANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYEL---SSGLRYKLFYPLRR 130 N+ FFH N G+ F W IWG DLY LR K+ + +++ Sbjct: 64 VNKYDHVFFHSIDNMISICFFARRGV---TFHWIIWGGDLYSSILPPFTLRKKIGFFIKK 120 Query: 131 LAQKRVGCVF-ATRGDLSFFAKTHPKVRGELLF-----------FPTRMDPSLNTMANDR 178 + R V A GD+S K + K +L+F P +D +L + R Sbjct: 121 VGLIRFKHVHTALEGDVSIARKLYNK---KLVFNRFVYPTLDESIPFDIDLALKNRGDGR 177 Query: 179 QREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLE 238 + K+ I +GNS D SN H+ RA+ G +++ P+ Y ++ Y V + G E Sbjct: 178 K---KIKIQIGNSADPSNNHLEVFRAIKGHLGSDFEILCPLSY--GDQDYATNVIRVGKE 232 Query: 239 LFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPF 298 ++ ++ + L++ + +D Y+ L D RQQG+G L L + G + + Sbjct: 233 MWG-DSFRPLTDFMSYDNYVRELSSVDCLILNHKRQQGLGNLNLALSLGAKVFVRSDTTT 291 Query: 299 WQDMTEQHLPVLFTTDDLNE----DIVREAQRQLASVD 332 ++D + V T L E D + A+ +++ D Sbjct: 292 YKDYSSMGFKVYDTKKILRECLPSDFIFSAKTAVSNRD 329 >UniRef50_A6VTI0 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MWYL1 RepID=A6VTI0_MARMS Length = 347 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 62/228 (27%), Positives = 105/228 (46%), Gaps = 14/228 (6%) Query: 73 KANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLR---YKLFYPLR 129 K N+ ++ HG F+P L L L + +W +WG DLY G R +KL R Sbjct: 70 KMNQAKKVILHGLFDPVLILILFFMPWLLKKCYWVMWGGDLYVYQLGERNWIWKLREFFR 129 Query: 130 RLAQKRVG-CVFATRGDLSFFAKTHPKVRGELLF---FPTRMDPSLNTMANDRQREGKMT 185 R + +G V T GD+ A+ + +G+ + +P+ + + + + Sbjct: 130 RPVIRNMGYLVNGTTGDVDL-ARKWYRAKGQHISCFNYPSNIYKHYDVKT---KTHDTVN 185 Query: 186 ILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENL 245 I +GNS D +N+HI L + + +K+ V + Y ++ Y ++V G + F ++ + Sbjct: 186 IQLGNSADPTNQHIEILDQLVRFKEQNIKIFVVLSY--GDQDYAKKVITEGKKKFDDKFI 243 Query: 246 QILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLN 293 I +E + F+AYL L D+ F RQQ G L+ G LN Sbjct: 244 AI-TEMMPFEAYLEFLASIDVAVFNHNRQQAFGNTITLLGLGKKVFLN 290 >UniRef50_B7IGW7 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IGW7_THEAB Length = 358 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 55/202 (27%), Positives = 98/202 (48%), Gaps = 10/202 (4%) Query: 105 FWHIWGADLYEL----SSGLRYKLFYPLRRLAQKRV-GCVFATRGDLSFFAKTHPKVRGE 159 +W +WG DLY S +R K+ L+R K++ G + + D F + + Sbjct: 95 YWVVWGGDLYNYWLKDSHSVREKVLEKLKRKVIKKIYGIIALVQEDYLFAKEKYKTKAKY 154 Query: 160 LL-FFPTRMD-PSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVV 217 F+ +D L+T ++ + +E K TIL+GNS +N H L ++ + + K++ Sbjct: 155 YYAFYLNPVDFKMLDTFSDQKNKEEK-TILIGNSAAPTNNHFEILSSLSKYRLNNFKIIC 213 Query: 218 PMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGI 277 P+ Y + E YI++V + G++LF +N L+E L + Y +L D+ F RQQ + Sbjct: 214 PLSYGSSQE-YIKKVCEYGVKLFG-DNFIALTEFLSPEEYAKILANVDVAIFAHRRQQAL 271 Query: 278 GTLCLLIQAGIPCVLNRENPFW 299 G + L+ G + + W Sbjct: 272 GNILALLYLGKKVYIRSDISSW 293 >UniRef50_C3XKD1 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XKD1_9HELI Length = 356 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 71/232 (30%), Positives = 106/232 (45%), Gaps = 14/232 (6%) Query: 90 LWLALLSGGI-KPSQFFWHIWGADLY-ELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLS 147 L + LL+ I KP W +W AD+Y S L KL+ R L + GD + Sbjct: 86 LQMQLLACAIFKPK--VWIVWSADMYLRDCSNLLKKLYN--RFLVSRFAYLATPIEGDFA 141 Query: 148 FFAKTHP-KVRGELLFFPTRMDPSLNTMANDRQREGKMT-ILVGNSGDRSNEHIAALRAV 205 + K + FFP D + Q+E + T I VGNSG +N H+ L + Sbjct: 142 NYQKIWGFGAKNLRFFFPFSQDILKIPLT---QKESQTTWIQVGNSGHFTNRHLEVLEML 198 Query: 206 HQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCD 265 +K+V+P+ Y N + Y + V A E+F EE + IL E L F Y+ LL D Sbjct: 199 KCYKDKDIKIVIPLSYGCNKD-YQQSVESAYREVFGEEKIWILKENLPFVEYVKLLGYID 257 Query: 266 LGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPV-LFTTDDL 316 +G F Q+ + LL G C + +N + +M++ V +F T+DL Sbjct: 258 IGIFHHFVQEAGHNVMLLEAFGKKCYICSQNTLY-NMSKVVFNVKVFRTEDL 308 >UniRef50_C3XKC6 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XKC6_9HELI Length = 352 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 47/195 (24%), Positives = 85/195 (43%), Gaps = 9/195 (4%) Query: 105 FWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFP 164 +W +WG D L + + + VG + + GD + K + +GE+ + Sbjct: 109 YWVLWGGDF-----CLGKESYSRRHNFVLQNVGHLISIAGDYEYVKKEY-NTKGEVFYSK 162 Query: 165 T-RMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPP 223 + + N ++GK+ IL+GNS D N H L A+ ++++ P+ Y Sbjct: 163 SFYVSNVFNGELYLSNKDGKLVILIGNSADPLNLHKDILNALKPYRDSNIELICPLSYGS 222 Query: 224 NNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLL 283 N E Y +E+ + G +F + + L E + +AYL LL D+ F QQ G + L Sbjct: 223 NKE-YQDEIIEYGKNIFGAK-FKPLVEFMPLNAYLDLLSSLDIAIFAHKNQQAYGNIIQL 280 Query: 284 IQAGIPCVLNRENPF 298 + G + + + Sbjct: 281 LGMGKKVYMRKTTAY 295 >UniRef50_C1FUB2 Putative uncharacterized protein n=1 Tax=Clostridium botulinum A2 str. Kyoto RepID=C1FUB2_CLOBJ Length = 491 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 59/257 (22%), Positives = 108/257 (42%), Gaps = 31/257 (12%) Query: 100 KPSQFFWHIWGADLYELSSGLRYKLF----------YPLRRL--------AQKRVGCVFA 141 K ++ W +WG D+YE ++ Y + Y RL A K++ + Sbjct: 216 KEAELNWTVWGGDVYEYTNIEIYDQYTREFLIKNNLYIDERLKNSEYRINAIKKIDYILT 275 Query: 142 -TRGDLSFFAKTHPKVRGELLFFPTRMD------PSLNTMANDRQREGKMTILVGNSGDR 194 GD K + L FP D L + N +++ K L+GNSG Sbjct: 276 PIYGDYKIIKKNYN-TNARLKSFPFVYDIINYKNQCLKSAYNLKKK-YKYVFLLGNSGYP 333 Query: 195 SNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEF 254 S+ H+ + + + V+ P+ Y N+ YIE++ + ++ E + L+ +E Sbjct: 334 SSNHLDIIYKLKEIKNKNFCVLCPLSYG--NKNYIEKLIKVSKDILGERFIP-LNNFMEL 390 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFTTD 314 D Y A+L + D+ RQQ +G + LL+ G L + + + E+ + F + Sbjct: 391 DEYTAILDEVDVAIMNHNRQQAVGNMILLLYLGKKIFLKKSVTTFSFLQEKGFQI-FDIE 449 Query: 315 DLNEDIVREAQRQLASV 331 + ++I + QL S+ Sbjct: 450 NFVDNINSIERIQLNSL 466 >UniRef50_Q8KWB8 RB124 n=1 Tax=Ruegeria sp. PR1b RepID=Q8KWB8_9RHOB Length = 345 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 46/181 (25%), Positives = 82/181 (45%), Gaps = 19/181 (10%) Query: 102 SQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAK-THPKVRGEL 160 + W +WG DL+ L++ F R C+ + G+ + K T P+V G Sbjct: 91 AHVVWCVWGGDLHMLATAPGGVEFL-------NRFSCMISFYGETILYPKLTTPEVLGTC 143 Query: 161 LFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMG 220 + +A + E + I++GNSGD SN+H+ L + +F + + +P Sbjct: 144 Y--------KSDAVAAEEGGEKEKLIVLGNSGDPSNDHLYLLE-LASRFKEH-RFHLPFA 193 Query: 221 YPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTL 280 Y E Y + + Q EL + L + E L + Y +++ + ++ + RQQG+G L Sbjct: 194 YNVTPE-YRQSILQKAEELGMLDRLTLQEEMLPINEYNSIIARAEMVFTAHHRQQGLGLL 252 Query: 281 C 281 C Sbjct: 253 C 253 >UniRef50_Q4ACG5 4-alpha-L-fucosyltransferase (Fragment) n=1 Tax=Edwardsiella tarda RepID=Q4ACG5_EDWTA Length = 66 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 29/66 (43%), Positives = 39/66 (59%), Gaps = 2/66 (3%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALA--ATSEHAREFMVVGKDDGLSDSCPALSVQFF 58 MT LIHVL +DIPHHN T+LRFF+ L+ A + R FMVV +D L L ++ Sbjct: 1 MTTLIHVLSADIPHHNLTLLRFFDGMLSQRAATAPRRRFMVVARDAALVVDLTTLDIEAC 60 Query: 59 PGKKSL 64 ++L Sbjct: 61 VNYRAL 66 >UniRef50_Q0HKL5 Putative uncharacterized protein n=1 Tax=Shewanella sp. MR-4 RepID=Q0HKL5_SHESM Length = 394 Score = 47.4 bits (111), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 31/118 (26%), Positives = 57/118 (48%), Gaps = 8/118 (6%) Query: 186 ILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAY---IEEVRQAGLELFSE 242 IL+GNS +N H+ AL + ++ G + +++P+ Y +E Y I+E +LFS+ Sbjct: 217 ILLGNSATATNNHLEALDII-EKTGSSRTIILPLSY--GDEKYAKLIKEYINNNPQLFSQ 273 Query: 243 ENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQ 300 Q+L + Y A++ C RQQ +G + ++ G L E+ ++ Sbjct: 274 --CQVLDNFMPLSEYNAIINSCGFVIMNHVRQQALGNIVAMMYRGSKVFLREESVLYK 329 >UniRef50_D1PK33 Putative uncharacterized protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PK33_9FIRM Length = 171 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 26/114 (22%), Positives = 51/114 (44%), Gaps = 3/114 (2%) Query: 187 LVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQ 246 ++GNS +SN H L + + + + +P+ Y + Y + + E + + Sbjct: 1 MIGNSATKSNRHEEVLNWLSKYSDKEITIYMPLSY--GDSEYRNRIIRISKEKYGISAVP 58 Query: 247 ILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQ 300 I+ + + +Y+ L D+G RQQG+G + L+ G + RE W+ Sbjct: 59 IV-QYMNTLSYVKFLSTMDIGIINCNRQQGMGNILFLLALGKKVYIRRETTMWE 111 >UniRef50_C3RR99 Predicted protein n=1 Tax=Mollicutes bacterium D7 RepID=C3RR99_9MOLU Length = 372 Score = 43.1 bits (100), Expect = 0.014, Method: Compositional matrix adjust. Identities = 28/120 (23%), Positives = 53/120 (44%), Gaps = 3/120 (2%) Query: 174 MANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVR 233 + ND + K +L+G+ G N HI L+ + + + + +P+ Y + YI+ V Sbjct: 190 IENDINKRYKKRVLLGHRGTEENNHIEILKRLSKYNSENFDIFIPLSYGE--KKYIQNVE 247 Query: 234 QAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLN 293 E S+ N+ I+ + ++F Y L D+ F +G L +++ LN Sbjct: 248 NYVKEN-SKGNIVIIKQFMKFSEYAEFLSTIDIAIFDGYTSYALGNLGIILFFNKTVYLN 306 >UniRef50_B5JRF7 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JRF7_9BACT Length = 393 Score = 42.4 bits (98), Expect = 0.025, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 6/116 (5%) Query: 183 KMTILVGNSGDRSNEHIAALRAVHQ-QFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFS 241 K IL+GNS SN H+ AL+ + Q F T+K P+ Y + Y + ++ + ELF Sbjct: 219 KHQILLGNSATPSNNHLEALQLLKQLSFKGTIK--CPLSY--GDATYRDALKTSANELFG 274 Query: 242 EENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENP 297 + + + L D Y L+ + + RQQ +G + + G ++ +P Sbjct: 275 -ASFESIESYLPLDDYNRLIAESSVVVMNHYRQQALGNIITALWYGTRVFISDRSP 329 >UniRef50_C9CTI5 Rb124 n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CTI5_9RHOB Length = 353 Score = 41.6 bits (96), Expect = 0.043, Method: Compositional matrix adjust. Identities = 42/183 (22%), Positives = 80/183 (43%), Gaps = 25/183 (13%) Query: 102 SQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAK-THPKVRGEL 160 + W +WG DL+ ++ P C + G+L + + T P++ G Sbjct: 98 AHVVWCMWGGDLHMVAQA-------PAGFDFLNGFSCAISFCGELVRYPQITLPEIPGSC 150 Query: 161 LFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMG 220 ++D S + D+++ I++GNSGD SN+H+ L + +F D + +P Sbjct: 151 ----HKVDASTQSSDLDKEK----LIILGNSGDPSNDHLYMLE-LASRFKDH-RYHIPFA 200 Query: 221 Y---PPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGI 277 Y P I++ R G+ E + L + Y +++ + +L + RQQ + Sbjct: 201 YNGTPDYRARLIDKARDLGV----WEKTTLQEGMLPLEEYNSIIARAELYFAAHNRQQAL 256 Query: 278 GTL 280 G+L Sbjct: 257 GSL 259 >UniRef50_B3CFK0 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CFK0_9BACE Length = 382 Score = 40.8 bits (94), Expect = 0.080, Method: Compositional matrix adjust. Identities = 27/117 (23%), Positives = 55/117 (47%), Gaps = 3/117 (2%) Query: 185 TILVGNSGDRSNEHIAALRAVHQ-QFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEE 243 TI+VGNS SN H+ L + + D ++ + + Y + + Y+ EV A F ++ Sbjct: 213 TIMVGNSASYSNNHLYVLNFLKRMDLKDELRFTLVLSYGGSKQ-YVSEVENAYKSSFPQK 271 Query: 244 NLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQ 300 +++L+ L Y + + RQ+ IGT+ + G+ ++ +P ++ Sbjct: 272 -VEVLTSYLPLQVYNQIFLKVRSMIMSAWRQESIGTIIMGFYLGVKVFMSERSPLYK 327 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B5FN86 4-alpha-L-fucosyltransferase n=47 Tax=Enterobact... 503 e-141 UniRef50_A4TRB6 4-alpha-L-fucosyltransferase n=126 Tax=Enterobac... 475 e-132 UniRef50_B0BRE0 TDP-Fuc4NAc:lipid II Fuc4NAc transferase n=6 Tax... 395 e-108 UniRef50_B8F5S7 4-alpha-L-fucosyltransferase n=2 Tax=Haemophilus... 387 e-106 UniRef50_A9BJR5 Putative uncharacterized protein n=1 Tax=Petroto... 304 4e-81 UniRef50_A6QAJ2 Putative uncharacterized protein n=1 Tax=Sulfuro... 270 6e-71 UniRef50_B1XZR9 4-alpha-L-fucosyltransferase n=1 Tax=Leptothrix ... 262 2e-68 UniRef50_Q21IU4 Glycosyltransferase-like protein n=1 Tax=Sacchar... 258 3e-67 UniRef50_A6VTI0 Putative uncharacterized protein n=1 Tax=Marinom... 248 2e-64 UniRef50_C3WR04 4-alpha-L-fucosyltransferase n=2 Tax=Fusobacteri... 239 9e-62 UniRef50_C1TR64 4-alpha-L-fucosyltransferase (Fuc4NAc transferas... 235 1e-60 UniRef50_C3XKC6 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacte... 233 1e-59 UniRef50_B7IGW7 Putative uncharacterized protein n=1 Tax=Thermos... 218 2e-55 UniRef50_C3XKD1 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacte... 200 7e-50 UniRef50_C1FUB2 Putative uncharacterized protein n=1 Tax=Clostri... 194 4e-48 UniRef50_Q8KWB8 RB124 n=1 Tax=Ruegeria sp. PR1b RepID=Q8KWB8_9RHOB 168 3e-40 UniRef50_Q0HKL5 Putative uncharacterized protein n=1 Tax=Shewane... 127 7e-28 UniRef50_Q4ACG5 4-alpha-L-fucosyltransferase (Fragment) n=1 Tax=... 84 1e-14 Sequences not found previously or not previously below threshold: UniRef50_C9CTI5 Rb124 n=1 Tax=Silicibacter sp. TrichCH4B RepID=C... 130 6e-29 UniRef50_C3XFT2 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacte... 124 6e-27 UniRef50_D1PK33 Putative uncharacterized protein n=1 Tax=Subdoli... 119 2e-25 UniRef50_B5JRF7 Putative uncharacterized protein n=1 Tax=Verruco... 110 8e-23 UniRef50_B3CFK0 Putative uncharacterized protein n=1 Tax=Bactero... 98 5e-19 UniRef50_C3RR99 Predicted protein n=1 Tax=Mollicutes bacterium D... 95 3e-18 UniRef50_D1JV60 Putative uncharacterized protein n=1 Tax=Bactero... 66 2e-09 UniRef50_A2G6B1 Glycosyl transferase, group 1 family protein n=1... 57 8e-07 UniRef50_D1PK32 Putative uncharacterized protein n=1 Tax=Subdoli... 45 0.005 UniRef50_A4QXH2 Beta-1,4-mannosyltransferase, putative n=8 Tax=S... 44 0.011 UniRef50_C9BE40 Capsular polysaccharide biosynthesis protein n=1... 42 0.034 UniRef50_B9JE97 Glycosyltransferase protein n=1 Tax=Agrobacteriu... 42 0.048 UniRef50_B1CBD5 Putative uncharacterized protein n=1 Tax=Anaerof... 40 0.093 >UniRef50_B5FN86 4-alpha-L-fucosyltransferase n=47 Tax=Enterobacteriaceae RepID=WECF_SALDC Length = 359 Score = 503 bits (1296), Expect = e-141, Method: Composition-based stats. Identities = 297/357 (83%), Positives = 329/357 (92%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPG 60 MTVLIHVLGSDIPHHN TVLRFFND LAATSEHAREFMV G+D+G ++SCPALS++F+ Sbjct: 1 MTVLIHVLGSDIPHHNHTVLRFFNDTLAATSEHAREFMVAGEDNGFTESCPALSLRFYGS 60 Query: 61 KKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGL 120 KK+LA+AVIAKAKANR+QRFFFHGQFN +LWLALLSGGIKP+QF+WHIWGADLYE+S+GL Sbjct: 61 KKALAQAVIAKAKANRRQRFFFHGQFNTSLWLALLSGGIKPAQFYWHIWGADLYEVSNGL 120 Query: 121 RYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQR 180 +++LFYPLRR+AQ RVGCVFATRGDLS+FA+ HP VRGELL+FPTRMDPSLN MA + QR Sbjct: 121 KFRLFYPLRRIAQGRVGCVFATRGDLSYFARQHPNVRGELLYFPTRMDPSLNAMAKECQR 180 Query: 181 EGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELF 240 GK+TILVGNSGDRSN+HIAALRAV+QQFGDTV VVVPMGYP NN+AYI+EVRQAGL LF Sbjct: 181 AGKLTILVGNSGDRSNQHIAALRAVYQQFGDTVNVVVPMGYPANNQAYIDEVRQAGLALF 240 Query: 241 SEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQ 300 S ENLQILSEK+EFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQA IPCVLNR+NPFWQ Sbjct: 241 SAENLQILSEKMEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQADIPCVLNRDNPFWQ 300 Query: 301 DMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALAIAARE 357 DM EQHLPVLFTTDDLNE +VREAQRQLASVDK+ I FFSPNYLQ W AL IAA E Sbjct: 301 DMAEQHLPVLFTTDDLNEQVVREAQRQLASVDKSGITFFSPNYLQPWHNALRIAAGE 357 >UniRef50_A4TRB6 4-alpha-L-fucosyltransferase n=126 Tax=Enterobacteriaceae RepID=WECF_YERPP Length = 361 Score = 475 bits (1222), Expect = e-132, Method: Composition-based stats. Identities = 228/359 (63%), Positives = 280/359 (77%), Gaps = 2/359 (0%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAAT--SEHAREFMVVGKDDGLSDSCPALSVQFF 58 M L HVLGSDIPHHN TVLRFFND LA E R FMV K+ S P L + + Sbjct: 1 MITLTHVLGSDIPHHNLTVLRFFNDVLAKCLPVEQVRHFMVAAKETAPFSSFPQLDINTY 60 Query: 59 PGKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSS 118 KK+LAEAVIA+A+A+R RFF+HGQFN TLWLALLSG IKP Q +WH+WGADLYE + Sbjct: 61 SDKKALAEAVIARAQADRSARFFWHGQFNATLWLALLSGKIKPGQVYWHVWGADLYEDAK 120 Query: 119 GLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDR 178 L+++LFY LRR+AQ R G VFATRGDL + + HP+V LL+FPTRMDP+L + D+ Sbjct: 121 SLKFRLFYLLRRIAQGRGGHVFATRGDLIHYQQRHPRVPASLLYFPTRMDPALTAINIDK 180 Query: 179 QREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLE 238 G MTILVGNSGD +N HI AL+A+HQQFG V+V++PMGYP NNEAYIE+VRQAGL Sbjct: 181 PLAGPMTILVGNSGDTTNRHIEALKAIHQQFGPDVRVIIPMGYPANNEAYIEQVRQAGLA 240 Query: 239 LFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPF 298 LFS++NL+IL+E++ FD YL +LR+CDLGYFIF RQQGIGTLCLL Q G+P VL+R+NPF Sbjct: 241 LFSQDNLRILTEQIPFDDYLNILRECDLGYFIFNRQQGIGTLCLLTQFGVPFVLSRKNPF 300 Query: 299 WQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALAIAARE 357 WQD+ EQH+PV F D L+E ++REAQRQLA +DK IAFF+PNY++GW++ALA+AA E Sbjct: 301 WQDLAEQHIPVFFYGDTLDEPMIREAQRQLAGLDKQAIAFFNPNYIEGWKQALALAAGE 359 >UniRef50_B0BRE0 TDP-Fuc4NAc:lipid II Fuc4NAc transferase n=6 Tax=Pasteurellaceae RepID=B0BRE0_ACTPJ Length = 356 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 161/354 (45%), Positives = 233/354 (65%), Gaps = 7/354 (1%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDAL-AATSEHAREFMVVGKDDGLSDSCPALSVQFFP 59 M + H+LGSDIPHHNRTVL FF D L +E F VVG+ L+ P L++Q F Sbjct: 1 MRPIYHILGSDIPHHNRTVLNFFRDQLLPKLTEQQHYFYVVGQQTLLTQY-PELNLQVFC 59 Query: 60 GKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSG 119 ++++ AV+ AK + +F HGQ+N LW+A+L G + + WHIWGADLYE +SG Sbjct: 60 SRQAITRAVVQTAKQVKTAKFVLHGQYNVWLWIAVLFGYLPACRCIWHIWGADLYEEASG 119 Query: 120 LRYKLFYPLRRLAQKRVGCVFATRGDLSFFAK--THPKVRGELLFFPTRMDPSLNTMAND 177 ++KLFY +RRLAQ+++ ++ATRGDL+F + + +L+FPT+M + D Sbjct: 120 WKFKLFYFIRRLAQQKLPVLWATRGDLTFAKRHLNRTDTQDRVLYFPTKMG--SRAVMQD 177 Query: 178 RQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGL 237 + + TIL+GNSGD SN H+AAL + Q + V++++PMGYP NN+ YIE+V++ + Sbjct: 178 TRENQRFTILLGNSGDPSNRHLAALAQLKQSLAEDVRIIIPMGYPSNNQTYIEQVKRQAV 237 Query: 238 ELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENP 297 ELF + +++L+EKL+F Y LL QCDLGYF F RQQ IGT+CLLIQ +P VL +ENP Sbjct: 238 ELFPKHTVEVLTEKLDFTQYQQLLAQCDLGYFYFNRQQAIGTICLLIQQNVPLVLTKENP 297 Query: 298 FWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRAL 351 F DM +++P L+ +D+L VR+ ++QL + DKN I FF+P+Y + W L Sbjct: 298 FCIDMQAENVPFLY-SDELTIAKVRQVKQQLQNCDKNNIGFFAPHYNEQWLTLL 350 >UniRef50_B8F5S7 4-alpha-L-fucosyltransferase n=2 Tax=Haemophilus parasuis RepID=B8F5S7_HAEPS Length = 389 Score = 387 bits (995), Expect = e-106, Method: Composition-based stats. Identities = 162/353 (45%), Positives = 224/353 (63%), Gaps = 8/353 (2%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPG 60 M +IHVLG++IPHHN T+L FF + L +A F VV + D LS++ L + +P Sbjct: 35 MANIIHVLGANIPHHNHTILNFFQNELLDELPNAFHFYVVSRSD-LSETFTLLDINSYPD 93 Query: 61 KKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGL 120 + L + VI A+ F HGQFN +LWLA+L G + ++ WHIWGADLYELSS L Sbjct: 94 EYLLTQEVIKIARKEPTAWFVLHGQFNTSLWLAILLGLVPANRCVWHIWGADLYELSSSL 153 Query: 121 RYKLFYPLRRLAQKRVGCVFATRGDLSFFAKT--HPKVRGELLFFPTRMDPSLNTMANDR 178 +++LFYPLRRLAQ+++ ++ T GDL+ + + L+FPTRM + T Sbjct: 154 KFRLFYPLRRLAQRKIARLWGTLGDLNHAYQQLKRKSSLDQRLYFPTRMPTNFPTKKCSY 213 Query: 179 QREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLE 238 QR TIL+GNSGD SN+H+ L+ + + FG+ V++VVPMGYP N YI +VRQ + Sbjct: 214 QR----TILLGNSGDPSNQHLLGLKQIREIFGENVRIVVPMGYPAGNSKYIAQVRQQAEQ 269 Query: 239 LFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPF 298 F + + IL++KLEF++YL LL QCDLGYF F RQQGIGT+CLLIQ IP ++R NPF Sbjct: 270 DFQQGQVNILTQKLEFESYLDLLSQCDLGYFPFERQQGIGTMCLLIQMNIPIAIHRANPF 329 Query: 299 WQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRAL 351 QD+ + + LF + N +I+R + QLA +DK+ I FF +Y Q W L Sbjct: 330 QQDLQAEGISFLFADEISNSEIIR-VKSQLALLDKSKITFFPLSYKQEWLNCL 381 >UniRef50_A9BJR5 Putative uncharacterized protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BJR5_PETMO Length = 351 Score = 304 bits (778), Expect = 4e-81, Method: Composition-based stats. Identities = 82/358 (22%), Positives = 158/358 (44%), Gaps = 24/358 (6%) Query: 2 TVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGK 61 T ++H++ S ++N ++F + + + +++GK++G DS + F K Sbjct: 4 TEILHII-SGTSNYNINFIKF---VYSYFNIDKQRLIILGKNNGFYDS----KILFISKK 55 Query: 62 KSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYEL---SS 118 K + + + KA ++ F H F+P L L L + +W +WG DLY + Sbjct: 56 KEVFKLIKEMRKA---EKIFVHSLFSPHLVLLLFLQPWLLKKSYWVLWGGDLYYYKFRNK 112 Query: 119 GLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGEL--LFFPTRMD-PSLNTMA 175 L+ + +R+ K + A + AK K + + F+P +D L+ + Sbjct: 113 NLKSNFYEFIRKRVIKNFAHIVALVPGDYYLAKNWYKTKAQYHYAFYPNPIDYEYLDKIK 172 Query: 176 NDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQA 235 N ++ ++ I VGNS D N+HI L + + ++++ P+ Y ++ + + V +A Sbjct: 173 NSKKETDRIVIQVGNSADPMNKHIEILNKLSRFKEKNIEIITPLSY--GDQKWAKTVSEA 230 Query: 236 GLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRE 295 G +L+ E+ Q L E L + Y +L D+ F RQQ +G + L+ G + + Sbjct: 231 GKKLYGEK-YQPLLEFLPSEEYSKILNSVDIAIFNHDRQQALGNILALLYLGKKVFIKSD 289 Query: 296 NPFWQDMTEQHLPVL--FTTDDLNEDIVREAQRQLASVDKNTIA--FFSPNYLQGWQR 349 W + L V + D+L D + + L ++ IA F + W+ Sbjct: 290 ITPWDFFKGKGLIVFDTYGLDNLTFDELIYMDKNLKERNREIIAKEFSEEKCAELWRN 347 >UniRef50_A6QAJ2 Putative uncharacterized protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QAJ2_SULNB Length = 355 Score = 270 bits (690), Expect = 6e-71, Method: Composition-based stats. Identities = 81/359 (22%), Positives = 159/359 (44%), Gaps = 27/359 (7%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPA-LSVQFFPGKK 62 ++H++ H+++ ++ F + E+ F+ + D+ + P +V + Sbjct: 6 IVHLI-----HNDKFIVPFMDFIAKHFDENEHLFVYLFDDNVVKYPIPESRNVLNLCNRY 60 Query: 63 SLAEAVIAKAK-----ANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELS 117 + + +K + ++ HG F+ L L + +W +WG DLY Sbjct: 61 LGRKNIFGLSKALNPLMEKAEKIILHGLFSDDLINYLYYHQYFLKKCYWVMWGGDLYGHI 120 Query: 118 SGLR-YKLFYPL---RRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPS-LN 172 ++ +K + L R++ Q+ G + +GD K H G+ ++ M S L Sbjct: 121 DPIKIWKNIFRLHRRRKVVQEMGGLITYIKGDYELVCK-HYGAAGK--YYECFMYTSNLY 177 Query: 173 TMANDRQRE-GKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEE 231 + + +E + I +GNS D +N HI L + + G+ +K+ +P+ Y N+ Y +E Sbjct: 178 KEYDIKHKEHSTINIQLGNSADLTNNHIEVLNELRKYKGENIKIFIPLSY--GNQEYAKE 235 Query: 232 VRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCV 291 V G ELF ++ + L+E + FD YL L + D+ F RQQ +G L+ G Sbjct: 236 VIAKGKELFGDKFV-ALTEFMPFDKYLEFLGEIDIAIFAHKRQQAMGNTITLLGLGKKVY 294 Query: 292 LNRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFS-PNYLQGWQR 349 + + W+ + ++ + F +++ ++ E R + KN +FS NYL + Sbjct: 295 MRSDITPWKLFKDINVNI-FDIENIELKLIAEKDR--LNNQKNIKEYFSRENYLNQLRN 350 >UniRef50_B1XZR9 4-alpha-L-fucosyltransferase n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1XZR9_LEPCP Length = 350 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 84/341 (24%), Positives = 134/341 (39%), Gaps = 23/341 (6%) Query: 22 FFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGK--KSLAEAVIAKAKANRQQR 79 F + F + + D L + K + AV+AK K N + Sbjct: 19 FIELTRTRLAADRHHFFFIHRGD-LYQPGEKSRIHRLSDYAAKPIGLAVLAK-KMNLAGK 76 Query: 80 FFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELS---SGLRYKLFYPLRRLAQKRV 136 HG + L + L + +W IWGADLY + + +L LRR R+ Sbjct: 77 IVLHGLTHFRLLVLLALQPWLLKKTYWIIWGADLYAYQKIGTSWQSRLKEMLRRFVIPRI 136 Query: 137 GC-VFATRGDLSFFAKTHPKVRGE---LLFFPTRMDPSLNTMANDRQREGKMTILVGNSG 192 G V GD++ A+ +G+ L + + + L R + ILVGNS Sbjct: 137 GHLVTYVSGDVAL-ARQWYGAKGQHHDCLCYASNVYHHLELPT--RNSGSNLQILVGNSA 193 Query: 193 DRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKL 252 DRSN H A+ +++ P+ Y ++ Y +EV + G F + L E + Sbjct: 194 DRSNNHDGIFAALLPHLETGLEIHAPLSY--GDQQYADEVTKFGSAKFGSK-FHALREFM 250 Query: 253 EFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFT 312 + YL LL D+ F RQQG+G + L+ G + + WQ +T L V Sbjct: 251 PYGDYLKLLSGIDIAVFNHERQQGMGNIISLLGLGKTVYMRKSTTSWQSLTNLGLTV--- 307 Query: 313 TDDLNEDIVREAQRQLASVDKNTI-AFFSPNYL-QGWQRAL 351 ++ + +LA ++ I FS L W+ L Sbjct: 308 -GNIEDFKPAPLSSELAERNRQIIREQFSETKLVNQWRSIL 347 >UniRef50_Q21IU4 Glycosyltransferase-like protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21IU4_SACD2 Length = 355 Score = 258 bits (659), Expect = 3e-67, Method: Composition-based stats. Identities = 80/330 (24%), Positives = 135/330 (40%), Gaps = 14/330 (4%) Query: 25 DALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRFFFHG 84 + L S F++ + L V + + + + HG Sbjct: 25 NELNCNSFDHEYFIIGYHEKYLLPEFGDFKVMPRAYTEKIGYLCRLFRRMMFADKVIVHG 84 Query: 85 QFNPTLWLALLSGGIKPSQFFWHIWGADLY-ELSSGLRYKLFYPLRRLAQKRVG-CVFAT 142 F+ L L L++ S+ +W +WGADLY + + K+ LRR+ R+G V Sbjct: 85 LFDNKLILFLVANYTCLSKVYWVMWGADLYVKEEKCFKEKIVSKLRRVICSRLGGVVTYI 144 Query: 143 RGDLSFFAKTHP--KVRGELLFFPTRMD--PSLNTMANDRQREGK-MTILVGNSGDRSNE 197 RGD + + + + +P+ + P + ++ G + ILVG+S D SN Sbjct: 145 RGDYQYAQQRWGVVGRYCDCIMYPSNIYVEPEEERVQSNSDENGSSINILVGHSADPSNN 204 Query: 198 HIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAY 257 H + D +++ P+ Y NE Y ++V Q G ELF ++ + L++ L Y Sbjct: 205 HKCIFDMLADSGVDNMRIYAPLSY--GNEMYRDDVVQYGKELFGDD-FRPLTKFLPLKDY 261 Query: 258 LALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLN 317 LALL D+ F RQQ +G LI G + WQ E + VL + Sbjct: 262 LALLADIDIAIFDHKRQQAMGNTINLIGLGKTVYMRTNVTQWQLFNELGVAVL-DLERFE 320 Query: 318 EDIVREAQRQLASVDKNTIAFFS-PNYLQG 346 ++ + QR+ + + +FS NY Sbjct: 321 GQLLTQEQRRKNN--QIIKKYFSVENYKNQ 348 >UniRef50_A6VTI0 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MWYL1 RepID=A6VTI0_MARMS Length = 347 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 70/328 (21%), Positives = 132/328 (40%), Gaps = 20/328 (6%) Query: 12 IPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEA---V 68 I + + + F F++ + G + + FF + ++ + Sbjct: 8 ISNWSVFIPPFIEFVKEHFDYKRHVFLITDGNTG--HKLNSDANVFFSKRTLVSRLMHYI 65 Query: 69 IAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELS---SGLRYKLF 125 K N+ ++ HG F+P L L L + +W +WG DLY +KL Sbjct: 66 RVVIKMNQAKKVILHGLFDPVLILILFFMPWLLKKCYWVMWGGDLYVYQLGERNWIWKLR 125 Query: 126 YPLRRLAQKRVGC-VFATRGDLSFFAKTHPKVRGE---LLFFPTRMDPSLNTMANDRQRE 181 RR + +G V T GD+ A+ + +G+ +P+ + + + Sbjct: 126 EFFRRPVIRNMGYLVNGTTGDVDL-ARKWYRAKGQHISCFNYPSNIYKHYDVKT---KTH 181 Query: 182 GKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFS 241 + I +GNS D +N+HI L + + +K+ V + Y ++ Y ++V G + F Sbjct: 182 DTVNIQLGNSADPTNQHIEILDQLVRFKEQNIKIFVVLSY--GDQDYAKKVITEGKKKFD 239 Query: 242 EENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQD 301 ++ + I +E + F+AYL L D+ F RQQ G L+ G LN + Sbjct: 240 DKFIAI-TEMMPFEAYLEFLASIDVAVFNHNRQQAFGNTITLLGLGKKVFLNPASTLNGV 298 Query: 302 MTEQHLPVLFTTDDLNEDIVREAQRQLA 329 +E + + F + + + E +Q Sbjct: 299 FSEFGIQI-FNSKKIELTSLDEVTKQAN 325 >UniRef50_C3WR04 4-alpha-L-fucosyltransferase n=2 Tax=Fusobacterium RepID=C3WR04_9FUSO Length = 352 Score = 239 bits (611), Expect = 9e-62, Method: Composition-based stats. Identities = 65/279 (23%), Positives = 117/279 (41%), Gaps = 13/279 (4%) Query: 76 RQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKR 135 + ++ +FHG F+P + + + + +W IWG DLY + FY + + Sbjct: 78 KCEKIYFHGLFDPRVTIFIYFFRFFLKKSYWIIWGGDLYSYKDRKKKSFFYNIEDYVKGN 137 Query: 136 V-GCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGK--MTILVGNSG 192 + G + +G+ + KV+G F+ + PS + ++EGK + + VGNS Sbjct: 138 MKGYISYIKGEFKLV-QEWFKVKGN--FYSSFTYPSNLYKKIEIRKEGKEGLWVQVGNSA 194 Query: 193 DRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKL 252 D SN H L + + +K+ + Y NE Y V + G ELF ++ IL + Sbjct: 195 DPSNNHFEILEKLSKFKDMNIKLFCILSYG-GNEEYKNRVIKRGSELFKDKFCPIL-NFM 252 Query: 253 EFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFT 312 +FD Y+ L D+ F RQQ G + L+ L + +Q + E + V Sbjct: 253 KFDEYMNFLSSLDIAIFAHDRQQAFGNITSLLSMKKTVYLKEKVTTYQTLKEMGIKVRSF 312 Query: 313 TDDLNEDIVREAQRQLASVDKNTIA--FFSPNYLQGWQR 349 ++ + E ++ I F ++ W+ Sbjct: 313 DKLVDLE---EFDENTLENNRKIIEENFSEEKLIEQWEN 348 >UniRef50_C1TR64 4-alpha-L-fucosyltransferase (Fuc4NAc transferase) n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TR64_9BACT Length = 353 Score = 235 bits (600), Expect = 1e-60, Method: Composition-based stats. Identities = 71/292 (24%), Positives = 124/292 (42%), Gaps = 25/292 (8%) Query: 74 ANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYEL---SSGLRYKLFYPLRR 130 N+ FFH N G+ F W IWG DLY LR K+ + +++ Sbjct: 64 VNKYDHVFFHSIDNMISICFFARRGV---TFHWIIWGGDLYSSILPPFTLRKKIGFFIKK 120 Query: 131 LAQKRVGCV-FATRGDLSFFAKTHPK--VRGELLF------FPTRMDPSLNTMANDRQRE 181 + R V A GD+S K + K V ++ P +D +L + R+ Sbjct: 121 VGLIRFKHVHTALEGDVSIARKLYNKKLVFNRFVYPTLDESIPFDIDLALKNRGDGRK-- 178 Query: 182 GKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFS 241 K+ I +GNS D SN H+ RA+ G +++ P+ Y ++ Y V + G E++ Sbjct: 179 -KIKIQIGNSADPSNNHLEVFRAIKGHLGSDFEILCPLSY--GDQDYATNVIRVGKEMWG 235 Query: 242 EENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQD 301 ++ + L++ + +D Y+ L D RQQG+G L L + G + + ++D Sbjct: 236 -DSFRPLTDFMSYDNYVRELSSVDCLILNHKRQQGLGNLNLALSLGAKVFVRSDTTTYKD 294 Query: 302 MTEQHLPVLFTTDDLNE----DIVREAQRQLASVDKNTIAFFSPNYLQGWQR 349 + V T L E D + A+ +++ D F + + W++ Sbjct: 295 YSSMGFKVYDTKKILRECLPSDFIFSAKTAVSNRDCVFKNFSTQKSRELWEK 346 >UniRef50_C3XKC6 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XKC6_9HELI Length = 352 Score = 233 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 66/350 (18%), Positives = 126/350 (36%), Gaps = 18/350 (5%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKS 63 ++H+L S H+ + F +F+ V D V+ K Sbjct: 16 ILHILSSAT--HSVRFVEFMQK---YFDLKKHKFVYVRPDICKYGLSNFKEVEHISTLKQ 70 Query: 64 LAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYK 123 + + KA+ + HG + + L + +W +WG D L + Sbjct: 71 QLKLIYLMQKAD---KIILHGLWRHEVINLLYFQKWLLKKCYWVLWGGDF-----CLGKE 122 Query: 124 LFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGK 183 + + VG + + GD + K + + N ++GK Sbjct: 123 SYSRRHNFVLQNVGHLISIAGDYEYVKKEYNTKGEVFYSKSFYVSNVFNGELYLSNKDGK 182 Query: 184 MTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEE 243 + IL+GNS D N H L A+ ++++ P+ Y N E Y +E+ + G +F + Sbjct: 183 LVILIGNSADPLNLHKDILNALKPYRDSNIELICPLSYGSNKE-YQDEIIEYGKNIFGAK 241 Query: 244 NLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMT 303 + L E + +AYL LL D+ F QQ G + L+ G + + + ++ Sbjct: 242 -FKPLVEFMPLNAYLDLLSSLDIAIFAHKNQQAYGNIIQLLGMGKKVYMRK-TTAYNEVL 299 Query: 304 EQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALAI 353 + L + D D+ R + + + N L ++ + Sbjct: 300 KNGLKIFDF--DSGVDLCRIDDSAIKNHHLTKNIYSLENMLSEFKELFGV 347 >UniRef50_B7IGW7 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IGW7_THEAB Length = 358 Score = 218 bits (555), Expect = 2e-55, Method: Composition-based stats. Identities = 77/357 (21%), Positives = 144/357 (40%), Gaps = 24/357 (6%) Query: 5 IHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSL 64 +H+L + ++ + F N S F+ K D L Sbjct: 3 LHILNDN--KYSDKFIEFINQ---NFSISDHHFVTFSKRPKYLDRGKVE----IVDIYKL 53 Query: 65 AEAVIAKAKANRQQRFFFHGQFNP-TLWLALLSGGIKPSQFFWHIWGADLYEL----SSG 119 ++ K + + F H F + L + +W +WG DLY S Sbjct: 54 SQIRWLYKKISNADKIFLHSLFRGGKSLMLFLFNRKNLYKTYWVVWGGDLYNYWLKDSHS 113 Query: 120 LRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELL--FFPTRMD-PSLNTMAN 176 +R K+ L+R K++ + A + FAK K + + F+ +D L+T ++ Sbjct: 114 VREKVLEKLKRKVIKKIYGIIALVQEDYLFAKEKYKTKAKYYYAFYLNPVDFKMLDTFSD 173 Query: 177 DRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAG 236 + +E K TIL+GNS +N H L ++ + + K++ P+ Y + + YI++V + G Sbjct: 174 QKNKEEK-TILIGNSAAPTNNHFEILSSLSKYRLNNFKIICPLSYGSS-QEYIKKVCEYG 231 Query: 237 LELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNREN 296 ++LF +N L+E L + Y +L D+ F RQQ +G + L+ G + + Sbjct: 232 VKLFG-DNFIALTEFLSPEEYAKILANVDVAIFAHRRQQALGNILALLYLGKKVYIRSDI 290 Query: 297 PFWQDMTEQHLPVLFTTDDLN----EDIVREAQRQLASVDKNTIAFFSPNYLQGWQR 349 W + V T + L+ + E + + + + F +Q W+ Sbjct: 291 SSWAFFNRFGIKVFDTKNILDGSEKDIFTFEEKIAIKNREIVLNEFSEERCVQLWKN 347 >UniRef50_C3XKD1 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XKD1_9HELI Length = 356 Score = 200 bits (508), Expect = 7e-50, Method: Composition-based stats. Identities = 75/299 (25%), Positives = 124/299 (41%), Gaps = 18/299 (6%) Query: 27 LAATSEHAREFMVV----GKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRFFF 82 L + F V+ G+ + + +V + G K + + A+ + + Sbjct: 21 LNTYFADKKTFYVLVDKKGRKEFPKELLAYDNVMIYQGIKDFWKLLKI---ASGARVVVY 77 Query: 83 HGQF-NPTLWLALLSGGIKPSQFFWHIWGADLY-ELSSGLRYKLFYPLRRLAQKRVGCVF 140 + F L + LL+ I + W +W AD+Y S L KL+ R L + Sbjct: 78 NALFYRFFLQMQLLACAIFKPK-VWIVWSADMYLRDCSNLLKKLYN--RFLVSRFAYLAT 134 Query: 141 ATRGDLSFFAKTH-PKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHI 199 GD + + K + FFP D + ++ I VGNSG +N H+ Sbjct: 135 PIEGDFANYQKIWGFGAKNLRFFFPFSQDILKIPLT--QKESQTTWIQVGNSGHFTNRHL 192 Query: 200 AALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLA 259 L + +K+V+P+ Y N+ Y + V A E+F EE + IL E L F Y+ Sbjct: 193 EVLEMLKCYKDKDIKIVIPLSYGC-NKDYQQSVESAYREVFGEEKIWILKENLPFVEYVK 251 Query: 260 LLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPV-LFTTDDLN 317 LL D+G F Q+ + LL G C + +N + +M++ V +F T+DL Sbjct: 252 LLGYIDIGIFHHFVQEAGHNVMLLEAFGKKCYICSQNTLY-NMSKVVFNVKVFRTEDLE 309 >UniRef50_C1FUB2 Putative uncharacterized protein n=1 Tax=Clostridium botulinum A2 str. Kyoto RepID=C1FUB2_CLOBJ Length = 491 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 57/282 (20%), Positives = 111/282 (39%), Gaps = 29/282 (10%) Query: 77 QQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLF----------Y 126 + + + ++ K ++ W +WG D+YE ++ Y + Y Sbjct: 193 SKAIYIYCLYDYICEFICKYKIYKEAELNWTVWGGDVYEYTNIEIYDQYTREFLIKNNLY 252 Query: 127 PLRRL--------AQKRVGCV-FATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMAND 177 RL A K++ + GD K + L FP D Sbjct: 253 IDERLKNSEYRINAIKKIDYILTPIYGDYKIIKKNYN-TNARLKSFPFVYDIINYKNQCL 311 Query: 178 R-----QREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEV 232 + +++ K L+GNSG S+ H+ + + + V+ P+ Y N+ YIE++ Sbjct: 312 KSAYNLKKKYKYVFLLGNSGYPSSNHLDIIYKLKEIKNKNFCVLCPLSY--GNKNYIEKL 369 Query: 233 RQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVL 292 + ++ E + L+ +E D Y A+L + D+ RQQ +G + LL+ G L Sbjct: 370 IKVSKDILGERFI-PLNNFMELDEYTAILDEVDVAIMNHNRQQAVGNMILLLYLGKKIFL 428 Query: 293 NRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKN 334 + + + E+ + F ++ ++I + QL S+ Sbjct: 429 KKSVTTFSFLQEKGFQI-FDIENFVDNINSIERIQLNSLKNQ 469 >UniRef50_Q8KWB8 RB124 n=1 Tax=Ruegeria sp. PR1b RepID=Q8KWB8_9RHOB Length = 345 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 49/219 (22%), Positives = 88/219 (40%), Gaps = 19/219 (8%) Query: 81 FFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVF 140 H F T + + + W +WG DL+ L++ F R C+ Sbjct: 70 IIHSLFLQTSFDIATQLLQQRAHVVWCVWGGDLHMLATAPGGVEF-------LNRFSCMI 122 Query: 141 ATRGDLSFFAK-THPKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHI 199 + G+ + K T P+V G + +A + E + I++GNSGD SN+H+ Sbjct: 123 SFYGETILYPKLTTPEVLGTCY--------KSDAVAAEEGGEKEKLIVLGNSGDPSNDHL 174 Query: 200 AALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLA 259 L + +F + + +P Y E Y + + Q EL + L + E L + Y + Sbjct: 175 YLLE-LASRFKEH-RFHLPFAYNVTPE-YRQSILQKAEELGMLDRLTLQEEMLPINEYNS 231 Query: 260 LLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPF 298 ++ + ++ + RQQG+G LC + Sbjct: 232 IIARAEMVFTAHHRQQGLGLLCSAYLNNCRVFMRHVITT 270 >UniRef50_C9CTI5 Rb124 n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CTI5_9RHOB Length = 353 Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats. Identities = 47/285 (16%), Positives = 93/285 (32%), Gaps = 38/285 (13%) Query: 26 ALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRFFFHGQ 85 + ++ H + F+V +D S + ++ F P H Sbjct: 39 VVLPSNSHQKSFVVKAQDIWPSTATFTGNLAFAPE-----------------DIVIVHSL 81 Query: 86 FNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGD 145 F + + + W +WG DL+ ++ F C + G+ Sbjct: 82 FLQHSFEISKQLIQRRAHVVWCMWGGDLHMVAQAPAGFDF-------LNGFSCAISFCGE 134 Query: 146 LSFFAKTH-PKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRA 204 L + + P++ G ++ + + I++GNSGD SN+H+ L Sbjct: 135 LVRYPQITLPEIPGSC--------HKVDASTQSSDLDKEKLIILGNSGDPSNDHLYMLEL 186 Query: 205 VHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQC 264 + + +P Y Y + +L E + L + Y +++ + Sbjct: 187 ASRFKDH--RYHIPFAYNGTP-DYRARLIDKARDLGVWEKTTLQEGMLPLEEYNSIIARA 243 Query: 265 DLGYFIFARQQGIGTLCLLIQAGIPCVLNR--ENPFWQDMTEQHL 307 +L + RQQ +G+L + R P Q M Sbjct: 244 ELYFAAHNRQQALGSLASAYLNNTRVFMRRVITTPSGQTMANPGY 288 >UniRef50_Q0HKL5 Putative uncharacterized protein n=1 Tax=Shewanella sp. MR-4 RepID=Q0HKL5_SHESM Length = 394 Score = 127 bits (318), Expect = 7e-28, Method: Composition-based stats. Identities = 54/287 (18%), Positives = 93/287 (32%), Gaps = 56/287 (19%) Query: 70 AKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFF-WHIWGADLYE---LSSGLRYKLF 125 + + FHG L ALL + Q + W WG D Y S L L Sbjct: 63 QRKELKDYDLVIFHG-----LPCALLIPMVLLKQNYAWLGWGYDYYSRPFDSDLLAEPLV 117 Query: 126 YP------------------------------------LRRLAQKRVGCVFATRGDLSFF 149 P +LA + + Sbjct: 118 LPKTMEYTKTFINDENRSFDVLNHIVKSLIKMLVCSKSFYQLAMRNLKVFSPVLPQEYDL 177 Query: 150 AKTHPKVRGELLFFPTRMDPSLNTMANDRQREGKMT---ILVGNSGDRSNEHIAALRAVH 206 K + + + P + + Q + IL+GNS +N H+ AL + Sbjct: 178 VKEKYGLGKDTQYSPWNYGILERHIIKNIQLGEIYSANAILLGNSATATNNHLEALDII- 236 Query: 207 QQFGDTVKVVVPMGYPPNNEAY---IEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQ 263 ++ G + +++P+ Y +E Y I+E +LFS+ Q+L + Y A++ Sbjct: 237 EKTGSSRTIILPLSY--GDEKYAKLIKEYINNNPQLFSQ--CQVLDNFMPLSEYNAIINS 292 Query: 264 CDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVL 310 C RQQ +G + ++ G L E+ ++ V Sbjct: 293 CGFVIMNHVRQQALGNIVAMMYRGSKVFLREESVLYKYFKSMSAYVY 339 >UniRef50_C3XFT2 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XFT2_9HELI Length = 352 Score = 124 bits (310), Expect = 6e-27, Method: Composition-based stats. Identities = 47/275 (17%), Positives = 100/275 (36%), Gaps = 21/275 (7%) Query: 75 NRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQK 134 N ++ H F+ L + ++ +W +WG DLY + Sbjct: 84 NNIKKIICHSLFDEQLVDLFYNNRDLLNKSYWIMWGGDLYHPVRDEKNDFVRK------- 136 Query: 135 RVGCVFATRGDL-SFFAKTHPKVRGELLFFPTRMDPSLNTMAND--RQREGKMTILVGNS 191 DL +A + G+ + + P M ++ +++ +TI + NS Sbjct: 137 ---HFKGYHSDLDKEYALQTYGMEGK-FYRSFYIFPLSREMLDNTVKKQTDCVTIQINNS 192 Query: 192 GDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEK 251 D+S + L + + + V + Y + Y ++ G E+F + + L + Sbjct: 193 SDKST--LEMLDILAKFRDKDIVVRTVVSY--GDTRYNNDIIAKGREIFGNK-FEYLDKL 247 Query: 252 LEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLF 311 L Y L Q D+ A Q+G G + G+ + RE+ + + + Sbjct: 248 LSSHEYAQYLAQNDILILNQANQEGFGNTIASLYLGVKVFIRRESSVYGYLNNDGCHIYD 307 Query: 312 TTD--DLNEDIVREAQRQLASVDKNTIAFFSPNYL 344 + + +L+ D A+ + +++ +YL Sbjct: 308 SMNIVNLSFDEFISNPYSSANKKETREKYYNEDYL 342 >UniRef50_D1PK33 Putative uncharacterized protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PK33_9FIRM Length = 171 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 66/169 (39%), Gaps = 7/169 (4%) Query: 187 LVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQ 246 ++GNS +SN H L + + + + +P+ Y + Y + + E + + Sbjct: 1 MIGNSATKSNRHEEVLNWLSKYSDKEITIYMPLSY--GDSEYRNRIIRISKEKYGISAVP 58 Query: 247 ILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQH 306 I+ + + +Y+ L D+G RQQG+G + L+ G + RE W+ + Sbjct: 59 IV-QYMNTLSYVKFLSTMDIGIINCNRQQGMGNILFLLALGKKVYIRRETTMWESYCSKG 117 Query: 307 LPVLFTTD--DLNEDIVREAQRQLASVDKNT--IAFFSPNYLQGWQRAL 351 + +L + + ++N F Y++ W L Sbjct: 118 YTIFDAAKIPELTYEQFIHFTSKDKEKNENICDQTFTYEQYIREWNDFL 166 >UniRef50_B5JRF7 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JRF7_9BACT Length = 393 Score = 110 bits (275), Expect = 8e-23, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 85/222 (38%), Gaps = 8/222 (3%) Query: 127 PLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGKMTI 186 +R A +R+ C+ + K ++ L F T + K I Sbjct: 164 RDKRAAFQRINCIATHLPNEMEAIKKSLEIAPLWLNFSYYTIEDFKTDNCET-LPKKHQI 222 Query: 187 LVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQ 246 L+GNS SN H+ AL+ + Q + P+ Y + Y + ++ + ELF + + Sbjct: 223 LLGNSATPSNNHLEALQLLKQLSFKG-TIKCPLSY--GDATYRDALKTSANELFGA-SFE 278 Query: 247 ILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQH 306 + L D Y L+ + + RQQ +G + + G ++ +P + Sbjct: 279 SIESYLPLDDYNRLIAESSVVVMNHYRQQALGNIITALWYGTRVFISDRSPALLYFQKLG 338 Query: 307 LPVLFTTDDLNED--IVREAQRQ-LASVDKNTIAFFSPNYLQ 345 + DL +V + + L + + + A+ + +++ Sbjct: 339 CIIHSIERDLKSPKQLVSLSDTEKLTNRNILSQAYAAETFIK 380 >UniRef50_B3CFK0 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CFK0_9BACE Length = 382 Score = 97.9 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 92/274 (33%), Gaps = 42/274 (15%) Query: 103 QFFWHIWGADLYE-----------LSSGLRYKLFYPLRRLAQKRVGCVFATRG------- 144 W ++GADLY + +RY + RR +G Sbjct: 92 HVCWEVYGADLYNQFLEPNGFKLYYTDPVRYDKYRVFRRYLPYLFKLALEVKGYKYQFNF 151 Query: 145 ----DLSFFAKTHPKVRGELLF-------------FPTRMDPSLNTMANDRQREGKM--- 184 + + ++ + + + + + ++ Sbjct: 152 QINKQFKYISHRINSIQHCCYYDVALIEQYASRKIYSYEVFNYSLSEVLGKLKDTPFFDG 211 Query: 185 -TILVGNSGDRSNEHIAALRAVHQQ-FGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSE 242 TI+VGNS SN H+ L + + D ++ + + Y + + Y+ EV A F + Sbjct: 212 DTIMVGNSASYSNNHLYVLNFLKRMDLKDELRFTLVLSYGGSKQ-YVSEVENAYKSSFPQ 270 Query: 243 ENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDM 302 + +++L+ L Y + + RQ+ IGT+ + G+ ++ +P ++ Sbjct: 271 K-VEVLTSYLPLQVYNQIFLKVRSMIMSAWRQESIGTIIMGFYLGVKVFMSERSPLYKWF 329 Query: 303 TEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTI 336 + V ED+ + ++ + Sbjct: 330 VDCGFNVFAIETAKEEDLDTPLSIKDKQRNREIV 363 >UniRef50_C3RR99 Predicted protein n=1 Tax=Mollicutes bacterium D7 RepID=C3RR99_9MOLU Length = 372 Score = 95.2 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 49/275 (17%), Positives = 98/275 (35%), Gaps = 29/275 (10%) Query: 56 QFFPGKKSLAEAVIAKAKANRQQRFFFH-----GQFNPTLWLALLSGGIKPSQFFWHIWG 110 F K + ++ ++ N + ++ H L+ LL+ ++ + WG Sbjct: 50 NVFDDLKQYSNVLLDESNKNLYKVYYKHCHLIISHSGEELYRILLTPKKIKNKVVYRYWG 109 Query: 111 A----DLYELSSGLRYKLFYPLRRLAQKRVGCVFATRG--------DLSFFAKTHPKVRG 158 E S L +++ K+ FA G DLS K + Sbjct: 110 GMRILQYDENSKTFGESLKLKVKKYILKKSFSEFAAIGIANITDIIDLSRILKK--DTKY 167 Query: 159 ELLFFPTRMD-------PSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGD 211 L + + + ND + K +L+G+ G N HI L+ + + + Sbjct: 168 YRLSYASNEYYDTVNKLKQKLDIENDINKRYKKRVLLGHRGTEENNHIEILKRLSKYNSE 227 Query: 212 TVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIF 271 + +P+ Y + YI+ V E S+ N+ I+ + ++F Y L D+ F Sbjct: 228 NFDIFIPLSY--GEKKYIQNVENYVKEN-SKGNIVIIKQFMKFSEYAEFLSTIDIAIFDG 284 Query: 272 ARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQH 306 +G L +++ LN + + ++ Sbjct: 285 YTSYALGNLGIILFFNKTVYLNENGVIAKALESEN 319 >UniRef50_Q4ACG5 4-alpha-L-fucosyltransferase (Fragment) n=1 Tax=Edwardsiella tarda RepID=Q4ACG5_EDWTA Length = 66 Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 29/66 (43%), Positives = 39/66 (59%), Gaps = 2/66 (3%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALA--ATSEHAREFMVVGKDDGLSDSCPALSVQFF 58 MT LIHVL +DIPHHN T+LRFF+ L+ A + R FMVV +D L L ++ Sbjct: 1 MTTLIHVLSADIPHHNLTLLRFFDGMLSQRAATAPRRRFMVVARDAALVVDLTTLDIEAC 60 Query: 59 PGKKSL 64 ++L Sbjct: 61 VNYRAL 66 >UniRef50_D1JV60 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JV60_9BACE Length = 276 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 69/217 (31%), Gaps = 39/217 (17%) Query: 70 AKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLR-------- 121 A NR Q H ++ IK +W WGADLY R Sbjct: 63 AVGDINRYQSIIIHFLSGDSVN---FLNRIKHHNIYWIAWGADLYSGLLEERGYKLYESV 119 Query: 122 ---------------YKLFYPLRRLA-----QKRVGCVFATRGDLSFFAK----THPKVR 157 YKL Y +RR V D + ++ Sbjct: 120 DILWRISKWKIPYFIYKLVYKIRRKINTDRMLTGAKKVHYFVPDSMYDEYPLLLSYYPEL 179 Query: 158 GELLFFPTRMDPSLNTMANDRQREGKM--TILVGNSGDRSNEHIAALRAVHQQFGDTVKV 215 L + P + + D + ++++GNS + H++ + + + Sbjct: 180 AHLEYRDFFYYPIDDILGEDLINTTSIGRSVIIGNSSSPTGNHLSVISYLRNFSLGDKDI 239 Query: 216 VVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKL 252 +VP+ Y N++Y V Q G+ F E + + + Sbjct: 240 IVPLSY--GNKSYAALVEQEGMYAFGENFKVVKNFFV 274 >UniRef50_A2G6B1 Glycosyl transferase, group 1 family protein n=1 Tax=Trichomonas vaginalis RepID=A2G6B1_TRIVA Length = 389 Score = 57.4 bits (137), Expect = 8e-07, Method: Composition-based stats. Identities = 52/283 (18%), Positives = 94/283 (33%), Gaps = 26/283 (9%) Query: 88 PTLWLALLSGGIKPSQFF--WHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGD 145 PTL L IK +F WH G + + + +K+ L + + Sbjct: 106 PTLPFCWLLRVIKGKRFVIDWHNLGWSILQCNKSRGWKVLKFLEYITGRWSDGNITVTNA 165 Query: 146 LSFFAKTHPKVRGELLFFPTRM-----DPSLNTMANDRQREGKMTILVGNSGDRS----- 195 L + H + P+ + + E + I+ S Sbjct: 166 LQAHLREHKIESAVVYDKPSNLFKPTRELRSKYAKQLNLEENSIWIMSSTSWTPDEDIDM 225 Query: 196 -NEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEF 254 N L + + ++ G PN A+I+EV G + + L + Sbjct: 226 INRTAEILDKELGEKKKNITFIIS-GKGPNQRAFIQEV--KGRNYMNIDFCYP---FLPY 279 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLF 311 + Y LL CD G + G +I AG+P + R + + + E +LF Sbjct: 280 EQYAELLGSCDAGVSLHKSSSGFDLPMKGLDMIGAGLPLLSVRYSCIDELVHEGVDGLLF 339 Query: 312 TTDDLNEDIVR----EAQRQLASVDKNTIAFFSPNYLQGWQRA 350 + +I+R E + + K +I + + W+RA Sbjct: 340 NDEQELANIIRSCFIEKTIDIEKIRKGSIEAGAEKWAGLWERA 382 >UniRef50_D1PK32 Putative uncharacterized protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PK32_9FIRM Length = 187 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 68/193 (35%), Gaps = 17/193 (8%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKS 63 +IHV+ D + + F +A F++ + + F+ Sbjct: 3 VIHVVHRD--KFTKGYINFMKTQMARY---EHCFIIQAEQKL---DLVDQNNVFYVKSFE 54 Query: 64 LAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELS-SGLRY 122 + A + F G FN L L + + + WGAD Y S Sbjct: 55 KTISNELLALMDESDAIIFSGIFNSIYLLKQLPRRL-LKKTYLQFWGADFYSYSEFRSPI 113 Query: 123 KLFYPLRRLAQKRV-----GCVFATRGDLSFFAKTHPKV-RGELLFFPTRMDPSLNTMAN 176 + Y L R +KR+ G +F +G+ + K R ++ PT + Sbjct: 114 HIRYYLHRFMRKRLYNSCAGHIFLIQGEYKKYEAIFGKFDRNFVVSMPTDYVKEIVQEIC 173 Query: 177 D-RQREGKMTILV 188 + RQ++ K TIL+ Sbjct: 174 ELRQKKIKKTILI 186 >UniRef50_A4QXH2 Beta-1,4-mannosyltransferase, putative n=8 Tax=Sordariomycetes RepID=A4QXH2_MAGGR Length = 486 Score = 43.5 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 80/205 (39%), Gaps = 19/205 (9%) Query: 165 TRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFG---DTVKVVVPM-- 219 +R+ S + A R ++ I+ S + L A+ Q D +++VP+ Sbjct: 247 SRILESRDLAAAIVDRRTRL-IVSSTSWTPDEDFNLLLSALVQYANSMQDDSQIIVPVVA 305 Query: 220 ---GYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQG 276 G P Y ++++ E N+ I + L F+ Y ALL DLG + G Sbjct: 306 VITGKGPQKAMYEAKIKKMA-EDGLVPNVTIRTAFLSFEDYAALLASADLGVCLHMSSSG 364 Query: 277 IGT---LCLLIQAGIPCVLNRENPFWQDMTEQHLP--VLFTTDDLNEDIVR----EAQRQ 327 + + + AG+P V + ++ + T +L ++ R E Q + Sbjct: 365 VDLPMKVVDMFGAGLPVVAYSAYESFSELVREGENGRGFETAGELTAELTRLLSVEGQEE 424 Query: 328 LASVDKNTIAFFSPNYLQGWQRALA 352 L + + + S + + W ++A Sbjct: 425 LKHLRQGAVLEGSRRWDEEWDASVA 449 >UniRef50_C9BE40 Capsular polysaccharide biosynthesis protein n=1 Tax=Enterococcus faecium 1,141,733 RepID=C9BE40_ENTFC Length = 367 Score = 42.0 bits (97), Expect = 0.034, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 80/248 (32%), Gaps = 18/248 (7%) Query: 81 FFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVF 140 H + + + F+ G Y + L + ++YPL + + + Sbjct: 84 IIHTHTPVASLITRIVCKNMNVKVFYTAHGFHFYRGAPKLNWLVYYPLEKYLSRFTDTIL 143 Query: 141 AT-RGDLSFFAKTHPKVRGELL---------FFPTRMDPSLNTMANDRQREGKMTILVGN 190 + D ++ + L+ + +++ + + IL Sbjct: 144 TINQEDYQIASQKFHSKKVYLINGVGIPIEKYKKIQINTMRKKAELGIKDKKTKIILSVG 203 Query: 191 SGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSE 250 +R+ H + A+ Q K + + Y + + +L EE + +L Sbjct: 204 ELNRNKNHTMVIEALKQFKDKNFKYFI---CGVGSLDYALKEKIKNSDL--EEKVVLLGY 258 Query: 251 KLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVL 310 + L +++ DL F R+ ++ + G+P V + + + L Sbjct: 259 R---TDVLEIMKISDLFVFPSKREGLPVSVMEAMSIGLPVVASNIRGNMDLIQDNIAGKL 315 Query: 311 FTTDDLNE 318 F + L E Sbjct: 316 FDVNALTE 323 >UniRef50_B9JE97 Glycosyltransferase protein n=1 Tax=Agrobacterium radiobacter K84 RepID=B9JE97_AGRRK Length = 994 Score = 41.6 bits (96), Expect = 0.048, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 55/146 (37%), Gaps = 22/146 (15%) Query: 146 LSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAV 205 A++ + + S + ++ ++ IL N H Sbjct: 777 FEHVARSTFNLNEN----SFIVFTSFDGASSISRKNPMAAILAFQKAFPRNTH------- 825 Query: 206 HQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCD 265 V++VV ++ + + VR++ L ++ + I +E L+ DAY LL CD Sbjct: 826 -----PDVQLVVKAMNALDDGLWRDCVRKSYL----DDRIHIRNEVLDRDAYYQLLACCD 876 Query: 266 LGYFIFARQQGIGTLCL-LIQAGIPC 290 + + R +G G L + GIP Sbjct: 877 VVLSMH-RAEGFGRLMAEAMAIGIPV 901 >UniRef50_B1CBD5 Putative uncharacterized protein n=1 Tax=Anaerofustis stercorihominis DSM 17244 RepID=B1CBD5_9FIRM Length = 379 Score = 40.5 bits (93), Expect = 0.093, Method: Composition-based stats. Identities = 36/330 (10%), Positives = 105/330 (31%), Gaps = 32/330 (9%) Query: 23 FNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRFFF 82 F D + + + + L D P L ++ V +A Sbjct: 41 FKDFYNKHNIN---ILQTDRGTFLFDRIPKLKTLV-----NMFRKVKKEAGNGMFDVIHI 92 Query: 83 HGQFNPTLWLALLSGGIKP-SQFFWHIWGADLYELSSGLRYKLFYPLRRLAQ------KR 135 H + + L ++ + WG+DL + K L + Sbjct: 93 HSVPSNFMITFLNKFIVRFGKKIVCTYWGSDLLSKTREQLMKAIPCLDKAECISYSSDGM 152 Query: 136 VGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQR-----EGKMTILVGN 190 GD + + + + + + + + ++ + + K+++ +G Sbjct: 153 DSYFHEVFGD--IYNEKIVRAKFGISIYDVIDEEKKHKTKDECKEFFNIEKDKISVAIGY 210 Query: 191 SGDRSNEHIAALRAVHQQFGD---TVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQI 247 +G +H+ + + + D + +V+ + Y + Y + + ++ + I Sbjct: 211 NGSLRQQHLRVINELSKLNSDILDKLNIVIQLSYGLTCDEYRQNIIDEISKINVKH--VI 268 Query: 248 LSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHL 307 ++ L D L D+ ++ + AG ++N +++ + + Sbjct: 269 INNFLNKDESAMLRIATDIFIHAQESDAFSASIQECVYAGS-ILVNPSWIMYKEFDDIGI 327 Query: 308 PVL----FTTDDLNEDIVREAQRQLASVDK 333 + F + + + + ++++ K Sbjct: 328 DYIKYNSFDELPMIIKDITDGKIKISNNGK 357 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B5FN86 4-alpha-L-fucosyltransferase n=47 Tax=Enterobact... 424 e-117 UniRef50_A4TRB6 4-alpha-L-fucosyltransferase n=126 Tax=Enterobac... 410 e-113 UniRef50_B0BRE0 TDP-Fuc4NAc:lipid II Fuc4NAc transferase n=6 Tax... 344 4e-93 UniRef50_B8F5S7 4-alpha-L-fucosyltransferase n=2 Tax=Haemophilus... 331 2e-89 UniRef50_B1XZR9 4-alpha-L-fucosyltransferase n=1 Tax=Leptothrix ... 282 1e-74 UniRef50_Q21IU4 Glycosyltransferase-like protein n=1 Tax=Sacchar... 282 2e-74 UniRef50_A6VTI0 Putative uncharacterized protein n=1 Tax=Marinom... 277 5e-73 UniRef50_A9BJR5 Putative uncharacterized protein n=1 Tax=Petroto... 276 9e-73 UniRef50_C3XKC6 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacte... 276 1e-72 UniRef50_A6QAJ2 Putative uncharacterized protein n=1 Tax=Sulfuro... 275 2e-72 UniRef50_B7IGW7 Putative uncharacterized protein n=1 Tax=Thermos... 256 9e-67 UniRef50_C3WR04 4-alpha-L-fucosyltransferase n=2 Tax=Fusobacteri... 236 9e-61 UniRef50_C1TR64 4-alpha-L-fucosyltransferase (Fuc4NAc transferas... 224 4e-57 UniRef50_C3XKD1 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacte... 207 6e-52 UniRef50_C9CTI5 Rb124 n=1 Tax=Silicibacter sp. TrichCH4B RepID=C... 204 4e-51 UniRef50_C3XFT2 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacte... 196 1e-48 UniRef50_Q8KWB8 RB124 n=1 Tax=Ruegeria sp. PR1b RepID=Q8KWB8_9RHOB 191 3e-47 UniRef50_A2G6B1 Glycosyl transferase, group 1 family protein n=1... 190 8e-47 UniRef50_C1FUB2 Putative uncharacterized protein n=1 Tax=Clostri... 189 1e-46 UniRef50_B5JRF7 Putative uncharacterized protein n=1 Tax=Verruco... 170 7e-41 UniRef50_Q0HKL5 Putative uncharacterized protein n=1 Tax=Shewane... 168 4e-40 UniRef50_C3RR99 Predicted protein n=1 Tax=Mollicutes bacterium D... 162 2e-38 UniRef50_B3CFK0 Putative uncharacterized protein n=1 Tax=Bactero... 159 9e-38 UniRef50_D1PK33 Putative uncharacterized protein n=1 Tax=Subdoli... 152 2e-35 UniRef50_D1JV60 Putative uncharacterized protein n=1 Tax=Bactero... 116 9e-25 UniRef50_Q4ACG5 4-alpha-L-fucosyltransferase (Fragment) n=1 Tax=... 69 2e-10 Sequences not found previously or not previously below threshold: UniRef50_A4QXH2 Beta-1,4-mannosyltransferase, putative n=8 Tax=S... 70 1e-10 UniRef50_Q22797 Putative uncharacterized protein n=3 Tax=Caenorh... 61 9e-08 UniRef50_A1DPC9 Beta-1,4-mannosyltransferase (Alg1), putative n=... 57 9e-07 UniRef50_A9VC87 Predicted protein n=1 Tax=Monosiga brevicollis R... 54 7e-06 UniRef50_C5DUF7 ZYRO0C16368p n=1 Tax=Zygosaccharomyces rouxii Re... 54 9e-06 UniRef50_P16661 Chitobiosyldiphosphodolichol beta-mannosyltransf... 53 2e-05 UniRef50_A0BGC6 Chromosome undetermined scaffold_106, whole geno... 52 2e-05 UniRef50_D2V0T5 Predicted protein (Fragment) n=1 Tax=Naegleria g... 52 3e-05 UniRef50_B5DX75 GA26165 n=6 Tax=Drosophila RepID=B5DX75_DROPS 52 4e-05 UniRef50_O13933 Chitobiosyldiphosphodolichol beta-mannosyltransf... 51 5e-05 UniRef50_Q6C3K2 Chitobiosyldiphosphodolichol beta-mannosyltransf... 51 6e-05 UniRef50_UPI000180BB10 PREDICTED: similar to beta-1,4-mannosyltr... 51 6e-05 UniRef50_B6JZQ7 Chitobiosyldiphosphodolichol beta-mannosyltransf... 51 7e-05 UniRef50_Q9VEE9 CG18012 n=9 Tax=Diptera RepID=Q9VEE9_DROME 51 8e-05 UniRef50_C9BE40 Capsular polysaccharide biosynthesis protein n=1... 50 1e-04 UniRef50_B1CBD5 Putative uncharacterized protein n=1 Tax=Anaerof... 50 2e-04 UniRef50_C4PYE2 Chitobiosyldiphosphodolichol alpha-mannosyltrans... 49 2e-04 UniRef50_C0SHG1 Chitobiosyldiphosphodolichol beta-mannosyltransf... 49 2e-04 UniRef50_B6HPF8 Pc22g00440 protein n=6 Tax=Eurotiomycetidae RepI... 49 2e-04 UniRef50_C5P1X3 Putative uncharacterized protein n=2 Tax=Coccidi... 49 3e-04 UniRef50_B0ELC7 Chitobiosyldiphosphodolichol beta-mannosyltransf... 47 7e-04 UniRef50_D1HCT1 Whole genome shotgun sequence of line PN40024, s... 47 9e-04 UniRef50_A4S8H0 Predicted protein n=1 Tax=Ostreococcus lucimarin... 47 0.001 UniRef50_C2RVV5 Glycosyltransferase n=1 Tax=Bacillus cereus BDRD... 47 0.001 UniRef50_D1PK32 Putative uncharacterized protein n=1 Tax=Subdoli... 46 0.002 UniRef50_UPI0000E12861 Os06g0564800 n=1 Tax=Oryza sativa Japonic... 46 0.002 UniRef50_A8QHU2 Glycosyl transferase, group 1 family protein n=1... 45 0.004 UniRef50_B8ELN5 Glycosyl transferase group 1 n=4 Tax=Alphaproteo... 45 0.006 UniRef50_B2WNH6 Chitobiosyldiphosphodolichol beta-mannosyltransf... 44 0.006 UniRef50_C6QI52 Glycosyl transferase group 1 n=1 Tax=Hyphomicrob... 44 0.008 UniRef50_UPI000186D588 Chitobiosyldiphosphodolichol beta-mannosy... 44 0.008 UniRef50_C1BUJ6 Glycosyltransferase ALG1-like n=4 Tax=Pancrustac... 44 0.009 UniRef50_Q23MP4 Chitobiosyldiphosphodolichol beta-mannosyltransf... 44 0.012 UniRef50_C6Y0M4 Putative uncharacterized protein n=1 Tax=Pedobac... 43 0.016 UniRef50_Q10QW6 Os03g0180700 protein n=5 Tax=Oryza sativa RepID=... 43 0.017 UniRef50_Q8XN63 Capsular polysaccharide biosynthesis protein n=3... 42 0.024 UniRef50_C2G144 Possible group 1 glycosyl transferase n=2 Tax=Sp... 42 0.025 UniRef50_B1CBE1 Putative uncharacterized protein n=1 Tax=Anaerof... 42 0.028 UniRef50_Q1ZPV1 Putative integrase n=2 Tax=Photobacterium RepID=... 42 0.030 UniRef50_Q64WD8 Putative uncharacterized protein n=3 Tax=Bactero... 42 0.042 UniRef50_P90522 Chitobiosyldiphosphodolichol beta-mannosyltransf... 42 0.047 UniRef50_Q6BS98 Chitobiosyldiphosphodolichol beta-mannosyltransf... 41 0.088 >UniRef50_B5FN86 4-alpha-L-fucosyltransferase n=47 Tax=Enterobacteriaceae RepID=WECF_SALDC Length = 359 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 297/357 (83%), Positives = 329/357 (92%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPG 60 MTVLIHVLGSDIPHHN TVLRFFND LAATSEHAREFMV G+D+G ++SCPALS++F+ Sbjct: 1 MTVLIHVLGSDIPHHNHTVLRFFNDTLAATSEHAREFMVAGEDNGFTESCPALSLRFYGS 60 Query: 61 KKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGL 120 KK+LA+AVIAKAKANR+QRFFFHGQFN +LWLALLSGGIKP+QF+WHIWGADLYE+S+GL Sbjct: 61 KKALAQAVIAKAKANRRQRFFFHGQFNTSLWLALLSGGIKPAQFYWHIWGADLYEVSNGL 120 Query: 121 RYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQR 180 +++LFYPLRR+AQ RVGCVFATRGDLS+FA+ HP VRGELL+FPTRMDPSLN MA + QR Sbjct: 121 KFRLFYPLRRIAQGRVGCVFATRGDLSYFARQHPNVRGELLYFPTRMDPSLNAMAKECQR 180 Query: 181 EGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELF 240 GK+TILVGNSGDRSN+HIAALRAV+QQFGDTV VVVPMGYP NN+AYI+EVRQAGL LF Sbjct: 181 AGKLTILVGNSGDRSNQHIAALRAVYQQFGDTVNVVVPMGYPANNQAYIDEVRQAGLALF 240 Query: 241 SEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQ 300 S ENLQILSEK+EFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQA IPCVLNR+NPFWQ Sbjct: 241 SAENLQILSEKMEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQADIPCVLNRDNPFWQ 300 Query: 301 DMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALAIAARE 357 DM EQHLPVLFTTDDLNE +VREAQRQLASVDK+ I FFSPNYLQ W AL IAA E Sbjct: 301 DMAEQHLPVLFTTDDLNEQVVREAQRQLASVDKSGITFFSPNYLQPWHNALRIAAGE 357 >UniRef50_A4TRB6 4-alpha-L-fucosyltransferase n=126 Tax=Enterobacteriaceae RepID=WECF_YERPP Length = 361 Score = 410 bits (1053), Expect = e-113, Method: Composition-based stats. Identities = 228/359 (63%), Positives = 280/359 (77%), Gaps = 2/359 (0%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAAT--SEHAREFMVVGKDDGLSDSCPALSVQFF 58 M L HVLGSDIPHHN TVLRFFND LA E R FMV K+ S P L + + Sbjct: 1 MITLTHVLGSDIPHHNLTVLRFFNDVLAKCLPVEQVRHFMVAAKETAPFSSFPQLDINTY 60 Query: 59 PGKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSS 118 KK+LAEAVIA+A+A+R RFF+HGQFN TLWLALLSG IKP Q +WH+WGADLYE + Sbjct: 61 SDKKALAEAVIARAQADRSARFFWHGQFNATLWLALLSGKIKPGQVYWHVWGADLYEDAK 120 Query: 119 GLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDR 178 L+++LFY LRR+AQ R G VFATRGDL + + HP+V LL+FPTRMDP+L + D+ Sbjct: 121 SLKFRLFYLLRRIAQGRGGHVFATRGDLIHYQQRHPRVPASLLYFPTRMDPALTAINIDK 180 Query: 179 QREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLE 238 G MTILVGNSGD +N HI AL+A+HQQFG V+V++PMGYP NNEAYIE+VRQAGL Sbjct: 181 PLAGPMTILVGNSGDTTNRHIEALKAIHQQFGPDVRVIIPMGYPANNEAYIEQVRQAGLA 240 Query: 239 LFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPF 298 LFS++NL+IL+E++ FD YL +LR+CDLGYFIF RQQGIGTLCLL Q G+P VL+R+NPF Sbjct: 241 LFSQDNLRILTEQIPFDDYLNILRECDLGYFIFNRQQGIGTLCLLTQFGVPFVLSRKNPF 300 Query: 299 WQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALAIAARE 357 WQD+ EQH+PV F D L+E ++REAQRQLA +DK IAFF+PNY++GW++ALA+AA E Sbjct: 301 WQDLAEQHIPVFFYGDTLDEPMIREAQRQLAGLDKQAIAFFNPNYIEGWKQALALAAGE 359 >UniRef50_B0BRE0 TDP-Fuc4NAc:lipid II Fuc4NAc transferase n=6 Tax=Pasteurellaceae RepID=B0BRE0_ACTPJ Length = 356 Score = 344 bits (881), Expect = 4e-93, Method: Composition-based stats. Identities = 160/354 (45%), Positives = 232/354 (65%), Gaps = 7/354 (1%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDAL-AATSEHAREFMVVGKDDGLSDSCPALSVQFFP 59 M + H+LGSDIPHHNRTVL FF D L +E F VVG+ L+ P L++Q F Sbjct: 1 MRPIYHILGSDIPHHNRTVLNFFRDQLLPKLTEQQHYFYVVGQQTLLTQY-PELNLQVFC 59 Query: 60 GKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSG 119 ++++ AV+ AK + +F HGQ+N LW+A+L G + + WHIWGADLYE +SG Sbjct: 60 SRQAITRAVVQTAKQVKTAKFVLHGQYNVWLWIAVLFGYLPACRCIWHIWGADLYEEASG 119 Query: 120 LRYKLFYPLRRLAQKRVGCVFATRGDLSFFAK--THPKVRGELLFFPTRMDPSLNTMAND 177 ++KLFY +RRLAQ+++ ++ATRGDL+F + + +L+FPT+M + D Sbjct: 120 WKFKLFYFIRRLAQQKLPVLWATRGDLTFAKRHLNRTDTQDRVLYFPTKMG--SRAVMQD 177 Query: 178 RQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGL 237 + + TIL+GNSGD SN H+AAL + Q + V++++PMGYP NN+ YIE+V++ + Sbjct: 178 TRENQRFTILLGNSGDPSNRHLAALAQLKQSLAEDVRIIIPMGYPSNNQTYIEQVKRQAV 237 Query: 238 ELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENP 297 ELF + +++L+EKL+F Y LL QCDLGYF F RQQ IGT+CLLIQ +P VL +ENP Sbjct: 238 ELFPKHTVEVLTEKLDFTQYQQLLAQCDLGYFYFNRQQAIGTICLLIQQNVPLVLTKENP 297 Query: 298 FWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRAL 351 F DM +++P L++ + L VR+ ++QL + DKN I FF+P+Y + W L Sbjct: 298 FCIDMQAENVPFLYSDE-LTIAKVRQVKQQLQNCDKNNIGFFAPHYNEQWLTLL 350 >UniRef50_B8F5S7 4-alpha-L-fucosyltransferase n=2 Tax=Haemophilus parasuis RepID=B8F5S7_HAEPS Length = 389 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 162/353 (45%), Positives = 224/353 (63%), Gaps = 8/353 (2%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPG 60 M +IHVLG++IPHHN T+L FF + L +A F VV + D LS++ L + +P Sbjct: 35 MANIIHVLGANIPHHNHTILNFFQNELLDELPNAFHFYVVSRSD-LSETFTLLDINSYPD 93 Query: 61 KKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGL 120 + L + VI A+ F HGQFN +LWLA+L G + ++ WHIWGADLYELSS L Sbjct: 94 EYLLTQEVIKIARKEPTAWFVLHGQFNTSLWLAILLGLVPANRCVWHIWGADLYELSSSL 153 Query: 121 RYKLFYPLRRLAQKRVGCVFATRGDLSFFAKT--HPKVRGELLFFPTRMDPSLNTMANDR 178 +++LFYPLRRLAQ+++ ++ T GDL+ + + L+FPTRM + T Sbjct: 154 KFRLFYPLRRLAQRKIARLWGTLGDLNHAYQQLKRKSSLDQRLYFPTRMPTNFPTKKCSY 213 Query: 179 QREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLE 238 QR TIL+GNSGD SN+H+ L+ + + FG+ V++VVPMGYP N YI +VRQ + Sbjct: 214 QR----TILLGNSGDPSNQHLLGLKQIREIFGENVRIVVPMGYPAGNSKYIAQVRQQAEQ 269 Query: 239 LFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPF 298 F + + IL++KLEF++YL LL QCDLGYF F RQQGIGT+CLLIQ IP ++R NPF Sbjct: 270 DFQQGQVNILTQKLEFESYLDLLSQCDLGYFPFERQQGIGTMCLLIQMNIPIAIHRANPF 329 Query: 299 WQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRAL 351 QD+ + + LF + N +I+R + QLA +DK+ I FF +Y Q W L Sbjct: 330 QQDLQAEGISFLFADEISNSEIIR-VKSQLALLDKSKITFFPLSYKQEWLNCL 381 >UniRef50_B1XZR9 4-alpha-L-fucosyltransferase n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1XZR9_LEPCP Length = 350 Score = 282 bits (722), Expect = 1e-74, Method: Composition-based stats. Identities = 79/358 (22%), Positives = 134/358 (37%), Gaps = 26/358 (7%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGK-- 61 ++H++ + + F + F + + D L + Sbjct: 6 VLHIV-----EFEKFIPPFIELTRTRLAADRHHFFFIHRGD-LYQPGEKSRIHRLSDYAA 59 Query: 62 KSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSS--- 118 K + AV+A K N + HG + L + L + +W IWGADLY Sbjct: 60 KPIGLAVLA-KKMNLAGKIVLHGLTHFRLLVLLALQPWLLKKTYWIIWGADLYAYQKIGT 118 Query: 119 GLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGE---LLFFPTRMDPSLNTMA 175 + +L LRR R+G + A+ +G+ L + + + L Sbjct: 119 SWQSRLKEMLRRFVIPRIGHLVTYVSGDVALARQWYGAKGQHHDCLCYASNVYHHLELPT 178 Query: 176 NDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQA 235 R + ILVGNS DRSN H A+ +++ P+ Y ++ Y +EV + Sbjct: 179 --RNSGSNLQILVGNSADRSNNHDGIFAALLPHLETGLEIHAPLSY--GDQQYADEVTKF 234 Query: 236 GLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRE 295 G F + L E + + YL LL D+ F RQQG+G + L+ G + + Sbjct: 235 GSAKFGSK-FHALREFMPYGDYLKLLSGIDIAVFNHERQQGMGNIISLLGLGKTVYMRKS 293 Query: 296 NPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTI--AFFSPNYLQGWQRAL 351 WQ +T L V ++ + +LA ++ I F + W+ L Sbjct: 294 TTSWQSLTNLGLTV----GNIEDFKPAPLSSELAERNRQIIREQFSETKLVNQWRSIL 347 >UniRef50_Q21IU4 Glycosyltransferase-like protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21IU4_SACD2 Length = 355 Score = 282 bits (720), Expect = 2e-74, Method: Composition-based stats. Identities = 83/357 (23%), Positives = 142/357 (39%), Gaps = 16/357 (4%) Query: 3 VLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKK 62 ++H+ D + + F + L S F++ + L V + Sbjct: 7 NVVHICSFD--KFIKGFVEF--NELNCNSFDHEYFIIGYHEKYLLPEFGDFKVMPRAYTE 62 Query: 63 SLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLY-ELSSGLR 121 + + + HG F+ L L L++ S+ +W +WGADLY + + Sbjct: 63 KIGYLCRLFRRMMFADKVIVHGLFDNKLILFLVANYTCLSKVYWVMWGADLYVKEEKCFK 122 Query: 122 YKLFYPLRRLAQKRVGC-VFATRGDLSFFAKTHP--KVRGELLFFPTRMD--PSLNTMAN 176 K+ LRR+ R+G V RGD + + + + +P+ + P + + Sbjct: 123 EKIVSKLRRVICSRLGGVVTYIRGDYQYAQQRWGVVGRYCDCIMYPSNIYVEPEEERVQS 182 Query: 177 DRQREG-KMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQA 235 + G + ILVG+S D SN H + D +++ P+ Y NE Y ++V Q Sbjct: 183 NSDENGSSINILVGHSADPSNNHKCIFDMLADSGVDNMRIYAPLSY--GNEMYRDDVVQY 240 Query: 236 GLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRE 295 G ELF ++ + L++ L YLALL D+ F RQQ +G LI G + Sbjct: 241 GKELFGDD-FRPLTKFLPLKDYLALLADIDIAIFDHKRQQAMGNTINLIGLGKTVYMRTN 299 Query: 296 NPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALA 352 WQ E + VL + ++ + QR+ + F NY + A Sbjct: 300 VTQWQLFNELGVAVLD-LERFEGQLLTQEQRR-KNNQIIKKYFSVENYKNQLNKLYA 354 >UniRef50_A6VTI0 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MWYL1 RepID=A6VTI0_MARMS Length = 347 Score = 277 bits (708), Expect = 5e-73, Method: Composition-based stats. Identities = 70/347 (20%), Positives = 137/347 (39%), Gaps = 19/347 (5%) Query: 12 IPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEA---V 68 I + + + F F++ + G + + FF + ++ + Sbjct: 8 ISNWSVFIPPFIEFVKEHFDYKRHVFLITDGNTG--HKLNSDANVFFSKRTLVSRLMHYI 65 Query: 69 IAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELS---SGLRYKLF 125 K N+ ++ HG F+P L L L + +W +WG DLY +KL Sbjct: 66 RVVIKMNQAKKVILHGLFDPVLILILFFMPWLLKKCYWVMWGGDLYVYQLGERNWIWKLR 125 Query: 126 YPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGE---LLFFPTRMDPSLNTMANDRQREG 182 RR + +G + A+ + +G+ +P+ + + + Sbjct: 126 EFFRRPVIRNMGYLVNGTTGDVDLARKWYRAKGQHISCFNYPSNIYKHYDVKT---KTHD 182 Query: 183 KMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSE 242 + I +GNS D +N+HI L + + +K+ V + Y ++ Y ++V G + F + Sbjct: 183 TVNIQLGNSADPTNQHIEILDQLVRFKEQNIKIFVVLSY--GDQDYAKKVITEGKKKFDD 240 Query: 243 ENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDM 302 + + I +E + F+AYL L D+ F RQQ G L+ G LN + Sbjct: 241 KFIAI-TEMMPFEAYLEFLASIDVAVFNHNRQQAFGNTITLLGLGKKVFLNPASTLNGVF 299 Query: 303 TEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQR 349 +E + + F + + + E +Q A++ K F + ++ Q Sbjct: 300 SEFGIQI-FNSKKIELTSLDEVTKQ-ANIAKVKYHFSKDSLVKSLQS 344 >UniRef50_A9BJR5 Putative uncharacterized protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BJR5_PETMO Length = 351 Score = 276 bits (706), Expect = 9e-73, Method: Composition-based stats. Identities = 80/360 (22%), Positives = 156/360 (43%), Gaps = 24/360 (6%) Query: 2 TVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGK 61 T ++H++ S ++N ++F + + + +++GK++G DS + F K Sbjct: 4 TEILHII-SGTSNYNINFIKF---VYSYFNIDKQRLIILGKNNGFYDS----KILFISKK 55 Query: 62 KSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYEL---SS 118 K + + + KA ++ F H F+P L L L + +W +WG DLY + Sbjct: 56 KEVFKLIKEMRKA---EKIFVHSLFSPHLVLLLFLQPWLLKKSYWVLWGGDLYYYKFRNK 112 Query: 119 GLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLF--FPTRM-DPSLNTMA 175 L+ + +R+ K + A + AK K + + + +P + L+ + Sbjct: 113 NLKSNFYEFIRKRVIKNFAHIVALVPGDYYLAKNWYKTKAQYHYAFYPNPIDYEYLDKIK 172 Query: 176 NDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQA 235 N ++ ++ I VGNS D N+HI L + + ++++ P+ Y ++ + + V +A Sbjct: 173 NSKKETDRIVIQVGNSADPMNKHIEILNKLSRFKEKNIEIITPLSY--GDQKWAKTVSEA 230 Query: 236 GLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRE 295 G +L+ E+ Q L E L + Y +L D+ F RQQ +G + L+ G + + Sbjct: 231 GKKLYGEK-YQPLLEFLPSEEYSKILNSVDIAIFNHDRQQALGNILALLYLGKKVFIKSD 289 Query: 296 NPFWQDMTEQHLPVLFT--TDDLNEDIVREAQRQLASVDKNTI--AFFSPNYLQGWQRAL 351 W + L V T D+L D + + L ++ I F + W+ Sbjct: 290 ITPWDFFKGKGLIVFDTYGLDNLTFDELIYMDKNLKERNREIIAKEFSEEKCAELWRNIF 349 >UniRef50_C3XKC6 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XKC6_9HELI Length = 352 Score = 276 bits (705), Expect = 1e-72, Method: Composition-based stats. Identities = 65/350 (18%), Positives = 125/350 (35%), Gaps = 18/350 (5%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKS 63 ++H+L S H+ + F +F+ V D V+ K Sbjct: 16 ILHILSSAT--HSVRFVEFMQK---YFDLKKHKFVYVRPDICKYGLSNFKEVEHISTLKQ 70 Query: 64 LAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYK 123 + + KA + HG + + L + +W +WG D L + Sbjct: 71 QLKLIYLMQKA---DKIILHGLWRHEVINLLYFQKWLLKKCYWVLWGGDF-----CLGKE 122 Query: 124 LFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGK 183 + + VG + + GD + K + + N ++GK Sbjct: 123 SYSRRHNFVLQNVGHLISIAGDYEYVKKEYNTKGEVFYSKSFYVSNVFNGELYLSNKDGK 182 Query: 184 MTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEE 243 + IL+GNS D N H L A+ ++++ P+ Y N+ Y +E+ + G +F + Sbjct: 183 LVILIGNSADPLNLHKDILNALKPYRDSNIELICPLSYGS-NKEYQDEIIEYGKNIFGAK 241 Query: 244 NLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMT 303 + L E + +AYL LL D+ F QQ G + L+ G + + + ++ Sbjct: 242 -FKPLVEFMPLNAYLDLLSSLDIAIFAHKNQQAYGNIIQLLGMGKKVYMRK-TTAYNEVL 299 Query: 304 EQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRALAI 353 + L + D D+ R + + + N L ++ + Sbjct: 300 KNGLKIFDF--DSGVDLCRIDDSAIKNHHLTKNIYSLENMLSEFKELFGV 347 >UniRef50_A6QAJ2 Putative uncharacterized protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QAJ2_SULNB Length = 355 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 79/361 (21%), Positives = 155/361 (42%), Gaps = 27/361 (7%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPA-LSVQFFPGKK 62 ++H++ H+++ ++ F + E+ F+ + D+ + P +V + Sbjct: 6 IVHLI-----HNDKFIVPFMDFIAKHFDENEHLFVYLFDDNVVKYPIPESRNVLNLCNRY 60 Query: 63 SLAEAVIAKAK-----ANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELS 117 + + +K + ++ HG F+ L L + +W +WG DLY Sbjct: 61 LGRKNIFGLSKALNPLMEKAEKIILHGLFSDDLINYLYYHQYFLKKCYWVMWGGDLYGHI 120 Query: 118 SGLR-YKLFYPL---RRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNT 173 ++ +K + L R++ Q+ G + +GD K H G+ ++ M S Sbjct: 121 DPIKIWKNIFRLHRRRKVVQEMGGLITYIKGDYELVCK-HYGAAGK--YYECFMYTSNLY 177 Query: 174 MANDRQ--REGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEE 231 D + + I +GNS D +N HI L + + G+ +K+ +P+ Y N+ Y +E Sbjct: 178 KEYDIKHKEHSTINIQLGNSADLTNNHIEVLNELRKYKGENIKIFIPLSY--GNQEYAKE 235 Query: 232 VRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCV 291 V G ELF ++ + L+E + FD YL L + D+ F RQQ +G L+ G Sbjct: 236 VIAKGKELFGDKFV-ALTEFMPFDKYLEFLGEIDIAIFAHKRQQAMGNTITLLGLGKKVY 294 Query: 292 LNRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFS-PNYLQGWQRA 350 + + W+ + ++ + +++ ++ E R + KN +FS NYL + Sbjct: 295 MRSDITPWKLFKDINVNIFDI-ENIELKLIAEKDR--LNNQKNIKEYFSRENYLNQLRNL 351 Query: 351 L 351 Sbjct: 352 F 352 >UniRef50_B7IGW7 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IGW7_THEAB Length = 358 Score = 256 bits (654), Expect = 9e-67, Method: Composition-based stats. Identities = 74/358 (20%), Positives = 142/358 (39%), Gaps = 22/358 (6%) Query: 5 IHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSL 64 +H+L + ++ + F N S F+ K D L Sbjct: 3 LHILNDN--KYSDKFIEFINQ---NFSISDHHFVTFSKRPKYLDRGKVE----IVDIYKL 53 Query: 65 AEAVIAKAKANRQQRFFFHGQFNP-TLWLALLSGGIKPSQFFWHIWGADLYEL----SSG 119 ++ K + + F H F + L + +W +WG DLY S Sbjct: 54 SQIRWLYKKISNADKIFLHSLFRGGKSLMLFLFNRKNLYKTYWVVWGGDLYNYWLKDSHS 113 Query: 120 LRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELL--FFPTRMDPSLNTMAND 177 +R K+ L+R K++ + A + FAK K + + F+ +D + +D Sbjct: 114 VREKVLEKLKRKVIKKIYGIIALVQEDYLFAKEKYKTKAKYYYAFYLNPVDFKMLDTFSD 173 Query: 178 RQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGL 237 ++ + + TIL+GNS +N H L ++ + + K++ P+ Y + + YI++V + G+ Sbjct: 174 QKNKEEKTILIGNSAAPTNNHFEILSSLSKYRLNNFKIICPLSYGSS-QEYIKKVCEYGV 232 Query: 238 ELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENP 297 +LF +N L+E L + Y +L D+ F RQQ +G + L+ G + + Sbjct: 233 KLFG-DNFIALTEFLSPEEYAKILANVDVAIFAHRRQQALGNILALLYLGKKVYIRSDIS 291 Query: 298 FWQDMTEQHLPVLFTTDDLN----EDIVREAQRQLASVDKNTIAFFSPNYLQGWQRAL 351 W + V T + L+ + E + + + + F +Q W+ Sbjct: 292 SWAFFNRFGIKVFDTKNILDGSEKDIFTFEEKIAIKNREIVLNEFSEERCVQLWKNVF 349 >UniRef50_C3WR04 4-alpha-L-fucosyltransferase n=2 Tax=Fusobacterium RepID=C3WR04_9FUSO Length = 352 Score = 236 bits (602), Expect = 9e-61, Method: Composition-based stats. Identities = 70/355 (19%), Positives = 137/355 (38%), Gaps = 25/355 (7%) Query: 15 HNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGK------------- 61 +++ + F + S+ F V+ + L +++++ K Sbjct: 3 NDKFINPFIDFINKNFSKEENIFFVIDGLETLKV-LNEKNIEWYISKGRNLKSILKKIIL 61 Query: 62 --KSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSG 119 + + + ++ +FHG F+P + + + + +W IWG DLY Sbjct: 62 LLQLPILYIKLFNYCRKCEKIYFHGLFDPRVTIFIYFFRFFLKKSYWIIWGGDLYSYKDR 121 Query: 120 LRYKLFYPLRRLAQKRV-GCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDR 178 + FY + + + G + +G+ + KV+G F+ + PS + Sbjct: 122 KKKSFFYNIEDYVKGNMKGYISYIKGEFKLV-QEWFKVKGN--FYSSFTYPSNLYKKIEI 178 Query: 179 QREGK--MTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAG 236 ++EGK + + VGNS D SN H L + + +K+ + Y NE Y V + G Sbjct: 179 RKEGKEGLWVQVGNSADPSNNHFEILEKLSKFKDMNIKLFCILSYG-GNEEYKNRVIKRG 237 Query: 237 LELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNREN 296 ELF ++ IL ++FD Y+ L D+ F RQQ G + L+ L + Sbjct: 238 SELFKDKFCPIL-NFMKFDEYMNFLSSLDIAIFAHDRQQAFGNITSLLSMKKTVYLKEKV 296 Query: 297 PFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRAL 351 +Q + E + V ++ + E + + F ++ W+ Sbjct: 297 TTYQTLKEMGIKVRSFDKLVDLEEFDENTLE-NNRKIIEENFSEEKLIEQWENIF 350 >UniRef50_C1TR64 4-alpha-L-fucosyltransferase (Fuc4NAc transferase) n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TR64_9BACT Length = 353 Score = 224 bits (571), Expect = 4e-57, Method: Composition-based stats. Identities = 79/363 (21%), Positives = 145/363 (39%), Gaps = 36/363 (9%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKS 63 ++HVL +++ + D F++ + SD + + Sbjct: 5 ILHVLPDST--YSQYFFKLTKDYAGNF------FVLYDYKNNFSDIPCLFKKKPRSRSER 56 Query: 64 LAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYE---LSSGL 120 L + + N+ FFH N G+ F W IWG DLY L Sbjct: 57 LFKLLDVV---NKYDHVFFHSIDNMISICFFARRGV---TFHWIIWGGDLYSSILPPFTL 110 Query: 121 RYKLFYPLRRLAQKRVGCV-FATRGDLSFFAKTHPK--VRGELLF------FPTRMDPSL 171 R K+ + ++++ R V A GD+S K + K V ++ P +D +L Sbjct: 111 RKKIGFFIKKVGLIRFKHVHTALEGDVSIARKLYNKKLVFNRFVYPTLDESIPFDIDLAL 170 Query: 172 NTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEE 231 + R+ K+ I +GNS D SN H+ RA+ G +++ P+ Y ++ Y Sbjct: 171 KNRGDGRK---KIKIQIGNSADPSNNHLEVFRAIKGHLGSDFEILCPLSY--GDQDYATN 225 Query: 232 VRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCV 291 V + G E++ ++ + L++ + +D Y+ L D RQQG+G L L + G Sbjct: 226 VIRVGKEMWG-DSFRPLTDFMSYDNYVRELSSVDCLILNHKRQQGLGNLNLALSLGAKVF 284 Query: 292 LNRENPFWQDMTEQHLPVLFTTDDLNE----DIVREAQRQLASVDKNTIAFFSPNYLQGW 347 + + ++D + V T L E D + A+ +++ D F + + W Sbjct: 285 VRSDTTTYKDYSSMGFKVYDTKKILRECLPSDFIFSAKTAVSNRDCVFKNFSTQKSRELW 344 Query: 348 QRA 350 ++ Sbjct: 345 EKI 347 >UniRef50_C3XKD1 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XKD1_9HELI Length = 356 Score = 207 bits (526), Expect = 6e-52, Method: Composition-based stats. Identities = 77/357 (21%), Positives = 138/357 (38%), Gaps = 23/357 (6%) Query: 15 HNRTVLRFFNDALAATSEHAREFMVV----GKDDGLSDSCPALSVQFFPGKKSLAEAVIA 70 + L + F V+ G+ + + +V + G K + + Sbjct: 9 YEDKFTPQLIYKLNTYFADKKTFYVLVDKKGRKEFPKELLAYDNVMIYQGIKDFWKLLKI 68 Query: 71 KAKANRQQRFFFHGQF-NPTLWLALLSGGIKPSQFFWHIWGADLY-ELSSGLRYKLFYPL 128 A+ + ++ F L + LL+ I + W +W AD+Y S L KL+ Sbjct: 69 ---ASGARVVVYNALFYRFFLQMQLLACAIFKPK-VWIVWSADMYLRDCSNLLKKLYN-- 122 Query: 129 RRLAQKRVGCVFATRGDLSFFAKTH-PKVRGELLFFPTRMDPSLNTMANDRQREGKMTIL 187 R L + GD + + K + FFP D + + I Sbjct: 123 RFLVSRFAYLATPIEGDFANYQKIWGFGAKNLRFFFPFSQDILKIPLTQ--KESQTTWIQ 180 Query: 188 VGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQI 247 VGNSG +N H+ L + +K+V+P+ Y N+ Y + V A E+F EE + I Sbjct: 181 VGNSGHFTNRHLEVLEMLKCYKDKDIKIVIPLSYGC-NKDYQQSVESAYREVFGEEKIWI 239 Query: 248 LSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHL 307 L E L F Y+ LL D+G F Q+ + LL G C + +N + +M++ Sbjct: 240 LKENLPFVEYVKLLGYIDIGIFHHFVQEAGHNVMLLEAFGKKCYICSQNTLY-NMSKVVF 298 Query: 308 PV-LFTTDDLN---EDIVREAQRQLASVDKNTIAFF--SPNYLQGWQRALAIAAREV 358 V +F T+DL + + + + + +F + ++ + +E+ Sbjct: 299 NVKVFRTEDLENSPFEEFIAWDSKDSRENMEKMRYFVSDEFFKDEIEKFYDVLQKEI 355 >UniRef50_C9CTI5 Rb124 n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CTI5_9RHOB Length = 353 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 47/287 (16%), Positives = 93/287 (32%), Gaps = 38/287 (13%) Query: 24 NDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRFFFH 83 + ++ H + F+V +D S + ++ F P H Sbjct: 37 EIVVLPSNSHQKSFVVKAQDIWPSTATFTGNLAFAPE-----------------DIVIVH 79 Query: 84 GQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATR 143 F + + + W +WG DL+ ++ F C + Sbjct: 80 SLFLQHSFEISKQLIQRRAHVVWCMWGGDLHMVAQAPAGFDF-------LNGFSCAISFC 132 Query: 144 GDLSFFAKTH-PKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAAL 202 G+L + + P++ G ++ + + I++GNSGD SN+H+ L Sbjct: 133 GELVRYPQITLPEIPGSC--------HKVDASTQSSDLDKEKLIILGNSGDPSNDHLYML 184 Query: 203 RAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLR 262 + + +P Y Y + +L E + L + Y +++ Sbjct: 185 ELASRFKDH--RYHIPFAYNGTP-DYRARLIDKARDLGVWEKTTLQEGMLPLEEYNSIIA 241 Query: 263 QCDLGYFIFARQQGIGTLCLLIQAGIPCVLNR--ENPFWQDMTEQHL 307 + +L + RQQ +G+L + R P Q M Sbjct: 242 RAELYFAAHNRQQALGSLASAYLNNTRVFMRRVITTPSGQTMANPGY 288 >UniRef50_C3XFT2 4-alpha-L-fucosyltransferase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XFT2_9HELI Length = 352 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 53/335 (15%), Positives = 114/335 (34%), Gaps = 28/335 (8%) Query: 15 HNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKA 74 +++ F + + +++ K S P S S Sbjct: 31 NDKFNKPFVDFLNKYFDKREH--LILCKRTFNEFSFPEGSNVIEVMDYSGLNF-----SC 83 Query: 75 NRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQK 134 N ++ H F+ L + ++ +W +WG DLY + Sbjct: 84 NNIKKIICHSLFDEQLVDLFYNNRDLLNKSYWIMWGGDLYHPVRDEKNDFVRK------- 136 Query: 135 RVGCVFATRGDL-SFFAKTHPKVRGELLFFPTRMDPSLNTMAND--RQREGKMTILVGNS 191 DL +A + G+ + + P M ++ +++ +TI + NS Sbjct: 137 ---HFKGYHSDLDKEYALQTYGMEGK-FYRSFYIFPLSREMLDNTVKKQTDCVTIQINNS 192 Query: 192 GDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEK 251 D+S + L + + + V + Y + Y ++ G E+F + + L + Sbjct: 193 SDKS--TLEMLDILAKFRDKDIVVRTVVSY--GDTRYNNDIIAKGREIFGNK-FEYLDKL 247 Query: 252 LEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLF 311 L Y L Q D+ A Q+G G + G+ + RE+ + + + Sbjct: 248 LSSHEYAQYLAQNDILILNQANQEGFGNTIASLYLGVKVFIRRESSVYGYLNNDGCHIYD 307 Query: 312 TTD--DLNEDIVREAQRQLASVDKNTIAFFSPNYL 344 + + +L+ D A+ + +++ +YL Sbjct: 308 SMNIVNLSFDEFISNPYSSANKKETREKYYNEDYL 342 >UniRef50_Q8KWB8 RB124 n=1 Tax=Ruegeria sp. PR1b RepID=Q8KWB8_9RHOB Length = 345 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 55/257 (21%), Positives = 94/257 (36%), Gaps = 22/257 (8%) Query: 76 RQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKR 135 Q H F T + + + W +WG DL+ L++ F R Sbjct: 65 PQDITIIHSLFLQTSFDIATQLLQQRAHVVWCVWGGDLHMLATAPGGVEF-------LNR 117 Query: 136 VGCVFATRGDLSFFAK-THPKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNSGDR 194 C+ + G+ + K T P+V G + +A + E + I++GNSGD Sbjct: 118 FSCMISFYGETILYPKLTTPEVLGTCY--------KSDAVAAEEGGEKEKLIVLGNSGDP 169 Query: 195 SNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEF 254 SN+H+ L + + +P Y E Y + + Q EL + L + E L Sbjct: 170 SNDHLYLLELASRFKEH--RFHLPFAYNVTPE-YRQSILQKAEELGMLDRLTLQEEMLPI 226 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNR--ENPFWQDMTEQHL-PVLF 311 + Y +++ + ++ + RQQG+G LC + P + M +L Sbjct: 227 NEYNSIIARAEMVFTAHHRQQGLGLLCSAYLNNCRVFMRHVITTPSGETMANPGYMHLLS 286 Query: 312 TTDDLNEDIVREAQRQL 328 DI L Sbjct: 287 YGYVDVADICSLEDEDL 303 >UniRef50_A2G6B1 Glycosyl transferase, group 1 family protein n=1 Tax=Trichomonas vaginalis RepID=A2G6B1_TRIVA Length = 389 Score = 190 bits (482), Expect = 8e-47, Method: Composition-based stats. Identities = 52/283 (18%), Positives = 93/283 (32%), Gaps = 26/283 (9%) Query: 88 PTLWLALLSGGIKPSQFF--WHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGD 145 PTL L IK +F WH G + + + +K+ L + + Sbjct: 106 PTLPFCWLLRVIKGKRFVIDWHNLGWSILQCNKSRGWKVLKFLEYITGRWSDGNITVTNA 165 Query: 146 LSFFAKTHPKVRGELLFFPTRMDP-----SLNTMANDRQREGKMTILVGNSGDRS----- 195 L + H + P+ + E + I+ S Sbjct: 166 LQAHLREHKIESAVVYDKPSNLFKPTRELRSKYAKQLNLEENSIWIMSSTSWTPDEDIDM 225 Query: 196 -NEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEF 254 N L + + ++ G PN A+I+EV G + + L + Sbjct: 226 INRTAEILDKELGEKKKNITFIIS-GKGPNQRAFIQEV--KGRNYMNIDFCYP---FLPY 279 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLF 311 + Y LL CD G + G +I AG+P + R + + + E +LF Sbjct: 280 EQYAELLGSCDAGVSLHKSSSGFDLPMKGLDMIGAGLPLLSVRYSCIDELVHEGVDGLLF 339 Query: 312 TTDDLNEDIVR----EAQRQLASVDKNTIAFFSPNYLQGWQRA 350 + +I+R E + + K +I + + W+RA Sbjct: 340 NDEQELANIIRSCFIEKTIDIEKIRKGSIEAGAEKWAGLWERA 382 >UniRef50_C1FUB2 Putative uncharacterized protein n=1 Tax=Clostridium botulinum A2 str. Kyoto RepID=C1FUB2_CLOBJ Length = 491 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 65/362 (17%), Positives = 133/362 (36%), Gaps = 39/362 (10%) Query: 21 RFFNDALAATSEHAREFMVVGKDDG----LSDSCPALSVQFFPGKKSLAEAVIAKAKANR 76 RF + +F+++ D + DS +V+ + + Sbjct: 136 RFIELINNNLKQEEHKFIIIKNIDYNFRYMKDSLKYDNVEILYDRYFENKLYYYVK---N 192 Query: 77 QQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLF----------Y 126 + + + ++ K ++ W +WG D+YE ++ Y + Y Sbjct: 193 SKAIYIYCLYDYICEFICKYKIYKEAELNWTVWGGDVYEYTNIEIYDQYTREFLIKNNLY 252 Query: 127 PLRRL--------AQKRVGCV-FATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMAND 177 RL A K++ + GD K + L FP D Sbjct: 253 IDERLKNSEYRINAIKKIDYILTPIYGDYKIIKKNYN-TNARLKSFPFVYDIINYKNQCL 311 Query: 178 R-----QREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEV 232 + +++ K L+GNSG S+ H+ + + + V+ P+ Y N+ YIE++ Sbjct: 312 KSAYNLKKKYKYVFLLGNSGYPSSNHLDIIYKLKEIKNKNFCVLCPLSY--GNKNYIEKL 369 Query: 233 RQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVL 292 + ++ E + L+ +E D Y A+L + D+ RQQ +G + LL+ G L Sbjct: 370 IKVSKDILGERFI-PLNNFMELDEYTAILDEVDVAIMNHNRQQAVGNMILLLYLGKKIFL 428 Query: 293 NRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKN---TIAFFSPNYLQGWQR 349 + + + E+ + ++ ++I + QL S+ F + L+ ++ Sbjct: 429 KKSVTTFSFLQEKGFQIFDI-ENFVDNINSIERIQLNSLKNQEAVIKNFSNDKVLEIYKE 487 Query: 350 AL 351 Sbjct: 488 IF 489 >UniRef50_B5JRF7 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JRF7_9BACT Length = 393 Score = 170 bits (431), Expect = 7e-41, Method: Composition-based stats. Identities = 56/397 (14%), Positives = 119/397 (29%), Gaps = 69/397 (17%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAATS---EHAREFMVVGKD----DGLSDSCPAL 53 M L+H+ + RF + ++ F V ++ + Sbjct: 1 MKPLLHIAADE---------RFIDRGISLFERALPGQNRFWVWQRNGENSLEFVKTLRPD 51 Query: 54 SVQFFPGKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADL 113 + + A + FH ++ + + +W +GA+ Sbjct: 52 KDRVISHDTPWWK--HAFIDKTAYRAVLFHNLYSHPQIFLANNLPAELP-AYWLFFGAEY 108 Query: 114 YELSSGLRYKLFYPL------------------------------------------RRL 131 Y + PL +R Sbjct: 109 YNDPHFFKESTIGPLTLDLPRSKKTRDARSTPLKRLKAFLQNRLSRAYRAQNPRARDKRA 168 Query: 132 AQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNS 191 A +R+ C+ + K ++ L F T + K IL+GNS Sbjct: 169 AFQRINCIATHLPNEMEAIKKSLEIAPLWLNFSYYTIEDFKTDNCET-LPKKHQILLGNS 227 Query: 192 GDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEK 251 SN H+ AL+ + Q + P+ Y + Y + ++ + ELF + + + Sbjct: 228 ATPSNNHLEALQLLKQLSFKG-TIKCPLSY--GDATYRDALKTSANELFGA-SFESIESY 283 Query: 252 LEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLF 311 L D Y L+ + + RQQ +G + + G ++ +P + + Sbjct: 284 LPLDDYNRLIAESSVVVMNHYRQQALGNIITALWYGTRVFISDRSPALLYFQKLGCIIHS 343 Query: 312 TTDDLNED---IVREAQRQLASVDKNTIAFFSPNYLQ 345 DL + +L + + + A+ + +++ Sbjct: 344 IERDLKSPKQLVSLSDTEKLTNRNILSQAYAAETFIK 380 >UniRef50_Q0HKL5 Putative uncharacterized protein n=1 Tax=Shewanella sp. MR-4 RepID=Q0HKL5_SHESM Length = 394 Score = 168 bits (424), Expect = 4e-40, Method: Composition-based stats. Identities = 64/395 (16%), Positives = 116/395 (29%), Gaps = 66/395 (16%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPG 60 M + H+ + T + F F + +L Q Sbjct: 1 MLKIAHIAPDE--KFIETAVDIFETVYPC----QNTFYITSSKPWTFIEDNSLYRQLN-K 53 Query: 61 KKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFF-WHIWGADLYE---L 116 KK L + + FHG ALL + Q + W WG D Y Sbjct: 54 KKWLKLLFTQRKELKDYDLVIFHGL-----PCALLIPMVLLKQNYAWLGWGYDYYSRPFD 108 Query: 117 SSGLRYKLFYP------------------------------------LRRLAQKRVGCVF 140 S L L P +LA + + Sbjct: 109 SDLLAEPLVLPKTMEYTKTFINDENRSFDVLNHIVKSLIKMLVCSKSFYQLAMRNLKVFS 168 Query: 141 ATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGKMT---ILVGNSGDRSNE 197 K + + + P + + Q + IL+GNS +N Sbjct: 169 PVLPQEYDLVKEKYGLGKDTQYSPWNYGILERHIIKNIQLGEIYSANAILLGNSATATNN 228 Query: 198 HIAALRAVHQQFGDTVKVVVPMGYPPNNEAY---IEEVRQAGLELFSEENLQILSEKLEF 254 H+ AL + + G + +++P+ Y +E Y I+E +LFS+ Q+L + Sbjct: 229 HLEALDIIEK-TGSSRTIILPLSY--GDEKYAKLIKEYINNNPQLFSQ--CQVLDNFMPL 283 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFTTD 314 Y A++ C RQQ +G + ++ G L E+ ++ V + Sbjct: 284 SEYNAIINSCGFVIMNHVRQQALGNIVAMMYRGSKVFLREESVLYKYFKSMSAYVYSVQE 343 Query: 315 DLNEDIVREA---QRQLASVDKNTIAFFSPNYLQG 346 + + ++ + +S + Sbjct: 344 LEVNPSLLHSHLDTYEVEHNRLILKSVWSEKVILQ 378 >UniRef50_C3RR99 Predicted protein n=1 Tax=Mollicutes bacterium D7 RepID=C3RR99_9MOLU Length = 372 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 59/374 (15%), Positives = 130/374 (34%), Gaps = 44/374 (11%) Query: 5 IHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSL 64 IH++ ++ +L F ++ + F+ K+ F K Sbjct: 12 IHLMYGHDTKFSKLLLDFISNPENGFEINQHLFVTPYKN-------------VFDDLKQY 58 Query: 65 AEAVIAKAKANRQQRFFFH-----GQFNPTLWLALLSGGIKPSQFFWHIWGA----DLYE 115 + ++ ++ N + ++ H L+ LL+ ++ + WG E Sbjct: 59 SNVLLDESNKNLYKVYYKHCHLIISHSGEELYRILLTPKKIKNKVVYRYWGGMRILQYDE 118 Query: 116 LSSGLRYKLFYPLRRLAQKRVGCVFATRG--------DLSFFAKTHPKVRGELLFFPTRM 167 S L +++ K+ FA G DLS K + L + + Sbjct: 119 NSKTFGESLKLKVKKYILKKSFSEFAAIGIANITDIIDLSRILKK--DTKYYRLSYASNE 176 Query: 168 D-------PSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMG 220 + ND + K +L+G+ G N HI L+ + + + + +P+ Sbjct: 177 YYDTVNKLKQKLDIENDINKRYKKRVLLGHRGTEENNHIEILKRLSKYNSENFDIFIPLS 236 Query: 221 YPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTL 280 Y + YI+ V E S+ N+ I+ + ++F Y L D+ F +G L Sbjct: 237 Y--GEKKYIQNVENYVKEN-SKGNIVIIKQFMKFSEYAEFLSTIDIAIFDGYTSYALGNL 293 Query: 281 CLLIQAGIPCVLNRENPFWQDMTEQH--LPVLFTTDDLNEDIVREAQRQLASVDKNTIAF 338 +++ LN + + ++ + ++ + + + A+ + Sbjct: 294 GIILFFNKTVYLNENGVIAKALESENNDYKKISDIGKISFEEFSKPMKYPANYTSDLCII 353 Query: 339 FSPNYLQGWQRALA 352 + ++ W + LA Sbjct: 354 STEERIKNWNKLLA 367 >UniRef50_B3CFK0 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CFK0_9BACE Length = 382 Score = 159 bits (403), Expect = 9e-38, Method: Composition-based stats. Identities = 45/383 (11%), Positives = 112/383 (29%), Gaps = 55/383 (14%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKS 63 ++H++ + V+ + + ++ ++ Sbjct: 3 ILHLII------DHQVIERMLGVYENVFPYHNDVVIFSLTTDFKHLRKYKECPVI--LRN 54 Query: 64 LAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYE-------- 115 + H + + W ++GADLY Sbjct: 55 QGRKEGKVFDFSSYTHIIAHYLTMDMIDFIKSAPIDV--HVCWEVYGADLYNQFLEPNGF 112 Query: 116 ---LSSGLRYKLFYPLRRLAQKRVGCVFATRG-----------DLSFFAKTHPKVRGELL 161 + +RY + RR +G + + ++ Sbjct: 113 KLYYTDPVRYDKYRVFRRYLPYLFKLALEVKGYKYQFNFQINKQFKYISHRINSIQHCCY 172 Query: 162 F-------------FPTRMDPSLNTMANDRQREGKM----TILVGNSGDRSNEHIAALRA 204 + + + + + ++ TI+VGNS SN H+ L Sbjct: 173 YDVALIEQYASRKIYSYEVFNYSLSEVLGKLKDTPFFDGDTIMVGNSASYSNNHLYVLNF 232 Query: 205 VHQQ-FGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQ 263 + + D ++ + + Y ++ Y+ EV A F ++ +++L+ L Y + + Sbjct: 233 LKRMDLKDELRFTLVLSYG-GSKQYVSEVENAYKSSFPQK-VEVLTSYLPLQVYNQIFLK 290 Query: 264 CDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVRE 323 RQ+ IGT+ + G+ ++ +P ++ + V ED+ Sbjct: 291 VRSMIMSAWRQESIGTIIMGFYLGVKVFMSERSPLYKWFVDCGFNVFAIETAKEEDLDTP 350 Query: 324 AQRQLASVDKNTIAFFSPNYLQG 346 + ++ + Y + Sbjct: 351 LSIKDKQRNREIV---LERYNEE 370 >UniRef50_D1PK33 Putative uncharacterized protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PK33_9FIRM Length = 171 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 65/169 (38%), Gaps = 7/169 (4%) Query: 187 LVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQ 246 ++GNS +SN H L + + + + +P+ Y + Y + + E + + Sbjct: 1 MIGNSATKSNRHEEVLNWLSKYSDKEITIYMPLSY--GDSEYRNRIIRISKEKYGISAV- 57 Query: 247 ILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQH 306 + + + +Y+ L D+G RQQG+G + L+ G + RE W+ + Sbjct: 58 PIVQYMNTLSYVKFLSTMDIGIINCNRQQGMGNILFLLALGKKVYIRRETTMWESYCSKG 117 Query: 307 LPVLFTTD--DLNEDIVREAQRQLASVDKNT--IAFFSPNYLQGWQRAL 351 + +L + + ++N F Y++ W L Sbjct: 118 YTIFDAAKIPELTYEQFIHFTSKDKEKNENICDQTFTYEQYIREWNDFL 166 >UniRef50_D1JV60 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JV60_9BACE Length = 276 Score = 116 bits (291), Expect = 9e-25, Method: Composition-based stats. Identities = 47/260 (18%), Positives = 79/260 (30%), Gaps = 46/260 (17%) Query: 31 SEHAREFMVV---GKDDGLSDSCPALSVQFFP-GKKSLAEAVIAKAKANRQQRFFFHGQF 86 F+V+ G +V F K AV NR Q H Sbjct: 23 FPDENTFIVLVCKGLRSPKYVKKDFGNVLFLEYDTKLFWNAV---GDINRYQSIIIHFLS 79 Query: 87 NPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLR-----------------------YK 123 ++ IK +W WGADLY R YK Sbjct: 80 GDSVN---FLNRIKHHNIYWIAWGADLYSGLLEERGYKLYESVDILWRISKWKIPYFIYK 136 Query: 124 LFYPLRRLA-----QKRVGCVFATRGDLSFFAK----THPKVRGELLFFPTRMDPSLNTM 174 L Y +RR V D + ++ L + P + + Sbjct: 137 LVYKIRRKINTDRMLTGAKKVHYFVPDSMYDEYPLLLSYYPELAHLEYRDFFYYPIDDIL 196 Query: 175 ANDRQREGKM--TILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEV 232 D + ++++GNS + H++ + + ++VP+ Y N++Y V Sbjct: 197 GEDLINTTSIGRSVIIGNSSSPTGNHLSVISYLRNFSLGDKDIIVPLSY--GNKSYAALV 254 Query: 233 RQAGLELFSEENLQILSEKL 252 Q G+ F E + + + Sbjct: 255 EQEGMYAFGENFKVVKNFFV 274 >UniRef50_A4QXH2 Beta-1,4-mannosyltransferase, putative n=8 Tax=Sordariomycetes RepID=A4QXH2_MAGGR Length = 486 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 48/294 (16%), Positives = 91/294 (30%), Gaps = 45/294 (15%) Query: 106 WHIWGADLYELSSGLRYKLFYPLR-------RLAQKRVGCVFATRGDLSFFAKTHPKVRG 158 WH +G + + G R+ + R + A L Sbjct: 164 WHNYGWTILSGTRGARHPFVRISKLYECLFGRFGSANLTVTHAMARQLKRAPYGIKSPIV 223 Query: 159 ELLFFPTRMDP----------------SLNTMANDRQREGKMTILVGNSGDRSNEHIAAL 202 + P + +A I+ S + L Sbjct: 224 PMHDRPAAIFKPLNDPMAKLDILSRILESRDLAAAIVDRRTRLIVSSTSWTPDEDFNLLL 283 Query: 203 RAVHQQFG---DTVKVVVPM-----GYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEF 254 A+ Q D +++VP+ G P Y ++++ E N+ I + L F Sbjct: 284 SALVQYANSMQDDSQIIVPVVAVITGKGPQKAMYEAKIKKMA-EDGLVPNVTIRTAFLSF 342 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLP--V 309 + Y ALL DLG + G+ + AG+P V + ++ + Sbjct: 343 EDYAALLASADLGVCLHMSSSGVDLPMKVVDMFGAGLPVVAYSAYESFSELVREGENGRG 402 Query: 310 LFTTDDLNEDIVR----EAQRQLASVDKNTIAFFSPNYLQGWQ----RALAIAA 355 T +L ++ R E Q +L + + + S + + W R L +++ Sbjct: 403 FETAGELTAELTRLLSVEGQEELKHLRQGAVLEGSRRWDEEWDASVARILGLSS 456 >UniRef50_Q4ACG5 4-alpha-L-fucosyltransferase (Fragment) n=1 Tax=Edwardsiella tarda RepID=Q4ACG5_EDWTA Length = 66 Score = 69.0 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 29/66 (43%), Positives = 39/66 (59%), Gaps = 2/66 (3%) Query: 1 MTVLIHVLGSDIPHHNRTVLRFFNDALA--ATSEHAREFMVVGKDDGLSDSCPALSVQFF 58 MT LIHVL +DIPHHN T+LRFF+ L+ A + R FMVV +D L L ++ Sbjct: 1 MTTLIHVLSADIPHHNLTLLRFFDGMLSQRAATAPRRRFMVVARDAALVVDLTTLDIEAC 60 Query: 59 PGKKSL 64 ++L Sbjct: 61 VNYRAL 66 >UniRef50_Q22797 Putative uncharacterized protein n=3 Tax=Caenorhabditis RepID=Q22797_CAEEL Length = 491 Score = 60.6 bits (145), Expect = 9e-08, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 65/183 (35%), Gaps = 20/183 (10%) Query: 181 EGKMTILVGNSGDRSNEHIAALRAVHQQFGDTV----KVVVPMGYPPNNEAYIEEVRQAG 236 + L S L A+ T+ +++ G P Y++E+ + Sbjct: 266 TRPIVFLSSTSWTPDERFEILLDALVAY-DKTIGLPRVLMIITGKGPLKAKYLQEIHEK- 323 Query: 237 LELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLN 293 + +N+ +L+ LE + Y +L DLG + G+ + A +P + Sbjct: 324 ----NLKNVDVLTPWLEAEDYPKILASADLGISLHTSTSGLDLPMKVVDMFGAKVPALAL 379 Query: 294 RENPFWQDMTEQHLPVLFTTDDLNEDIVREAQR-------QLASVDKNTIAFFSPNYLQG 346 + + + E+ LF + + E R +L + KNT ++ Sbjct: 380 KFKCIDELVEEKTNGYLFDDSEQLSRQIIELSRGFPNNCNELIRLKKNTQEQKFDSWEVM 439 Query: 347 WQR 349 W+R Sbjct: 440 WKR 442 >UniRef50_A1DPC9 Beta-1,4-mannosyltransferase (Alg1), putative n=6 Tax=Trichocomaceae RepID=A1DPC9_NEOFI Length = 461 Score = 57.1 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 42/290 (14%), Positives = 86/290 (29%), Gaps = 42/290 (14%) Query: 106 WHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFA---TRGDLSFFAKTHPKVRGELL- 161 WH +G + L G R+ L + + A ++ K H + +L Sbjct: 173 WHNFGYTILALKLGDRHPLVRFSKWYEKSFCRYATAHFCVTEAMASILKNHFGLTAPILP 232 Query: 162 --FFPTRMD----------------PSLNTMANDRQREGKMTILVGNSGDRSNEHIAALR 203 P P + + Q I+ S + + Sbjct: 233 LHDRPASHFQPIFDQSEQKSFLESLPETAPVKDLLQAGSLRVIVSSTSWTADEDFSLLID 292 Query: 204 AVHQQFGDTVK--------VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFD 255 A+ + + + G P E Y++++ E + I + L D Sbjct: 293 ALCRYSNLANTSKPALPAVLAIITGKGPQKEMYLKQI-SKLQEAGKLSKVTIRTTWLTTD 351 Query: 256 AYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLP--VL 310 Y LL LG + G+ + AG+P + W ++ + + Sbjct: 352 DYARLLASASLGISLHTSSSGVDLPMKVVDMFGAGLPVLGWDRFQAWPELVTEGVNGMGF 411 Query: 311 FTTDDLNEDIV--REAQRQLASVDKNTIAFFSPNYLQGWQ----RALAIA 354 ++ +L + +V E +L + + + W + L +A Sbjct: 412 GSSGELLDHLVDLFENPSKLEKIRTGARKESNRRWNDEWDPIAGKLLGLA 461 >UniRef50_A9VC87 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VC87_MONBE Length = 426 Score = 54.0 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 42/310 (13%), Positives = 95/310 (30%), Gaps = 52/310 (16%) Query: 88 PTLWLALLSGGIKPSQFF--WHIWGADLYELSSGLRYKLFYPLRRL-------------- 131 P L +A + + ++ WH +G + L +G R+ L+ + + Sbjct: 113 PVLPIAAVVSRCRGARLVVDWHNYGYTILALKTGTRHPLYNFAKFVETTFGPWGHRHLCV 172 Query: 132 ---AQKRVGCVFATRGDLSFFAKTHPKVRGEL---LFFPTRMDPSLNTMANDRQREGK-- 183 Q+ + + S + L + S + + REG Sbjct: 173 THAMQRDLKDNWNIEPLFSHLTHRVLCAAVQERPDLSQRFGISHSDDAQSALTTREGSNY 232 Query: 184 -------MTILVGNSGDRSNEHIAALRAVHQQFGDTVK-------VVVPMGYPPNNEAYI 229 I+ S + A+ + + + G P Y Sbjct: 233 KRRAGRPALIVSSTSWTPDEDFGILFEALQGYEQAAQRDSSLPHVLCIITGKGPERARYE 292 Query: 230 EEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQA 286 + V++ + + +++ L + Y LL DLG + G+ + + Sbjct: 293 QLVQEQA-----WQKVAVMTVWLALEDYPKLLASADLGISLHTSSSGLDLPMKVVDMFGS 347 Query: 287 GIPCVLNRENPFWQD-MTEQHLPVLFTTDDLNEDIVR-----EAQRQLASVDKNTIAFFS 340 GIP + + +++ V +L++ + +A +L + + AF Sbjct: 348 GIPVCAVDFQCLSELVVHDENGAVFKNAQELSQQLQELLRAPDANTKLGQLKTHVQAFRR 407 Query: 341 PNYLQGWQRA 350 + W + Sbjct: 408 QGWSANWNQV 417 >UniRef50_C5DUF7 ZYRO0C16368p n=1 Tax=Zygosaccharomyces rouxii RepID=C5DUF7_ZYGRO Length = 448 Score = 54.0 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 81/252 (32%), Gaps = 22/252 (8%) Query: 114 YELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNT 173 Y L+ K + + V R + F + + R LL P Sbjct: 198 YHLTVTKAMKKYLVAKFGLNPMRIAVLYDRPAVQFKPLQNDEERLRLLQEPFVAP--YIP 255 Query: 174 MANDRQREGKMTILVGNSGDRSNE------HIAALRAVHQQFGDTV-KVVV-PMGYPPNN 225 D + K+ I S + + +Q+F T+ K++ G P Sbjct: 256 QGFDINKGDKI-IATSTSFTPDEDLGILFGALKIYENSYQKFDHTLPKILCFVTGKGPLK 314 Query: 226 EAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL--- 282 E Y++EV++ F I L + Y L+ CD G + G+ Sbjct: 315 EKYVKEVQE-----FEWNRCHIEFLWLSAEDYPRLISLCDYGVSLHKSSSGLDLPMKILD 369 Query: 283 LIQAGIPCVLNRENPFWQDMTEQ--HLPVLFTTDDLNEDIVREAQRQLASV-DKNTIAFF 339 ++ G+P + + + +T L L + E I + + + K + Sbjct: 370 MLGCGVPAIAFNYDTLDELITHDINGLKFLDRRELHEELIFAVKDQNVNNRLKKGALLES 429 Query: 340 SPNYLQGWQRAL 351 + W+ A+ Sbjct: 430 RVRWNSSWETAM 441 >UniRef50_P16661 Chitobiosyldiphosphodolichol beta-mannosyltransferase n=10 Tax=Saccharomycetaceae RepID=ALG1_YEAST Length = 449 Score = 52.9 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 62/202 (30%), Gaps = 19/202 (9%) Query: 165 TRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPM----- 219 T+ + +G I+ S + L A+ VK + Sbjct: 247 TKAFIKNYIRDDFDTEKGDKIIVTSTSFTPDEDIGILLGALKIYENSYVKFDSSLPKILC 306 Query: 220 ---GYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQG 276 G P E Y+++V + + + QI L + Y LL+ CD G + G Sbjct: 307 FITGKGPLKEKYMKQVEE-----YDWKRCQIEFVWLSAEDYPKLLQLCDYGVSLHTSSSG 361 Query: 277 IGTLCL---LIQAGIPCVLNRENPFWQDMTE--QHLPVLFTTDDLNEDIVREAQRQLASV 331 + + +G+P + + + L + + I L Sbjct: 362 LDLPMKILDMFGSGLPVIAMNYPVLDELVQHNVNGLKFVDRRELHESLIFAMKDADLYQK 421 Query: 332 DKNTIAFFSPN-YLQGWQRALA 352 K + + N + W+R + Sbjct: 422 LKKNVTQEAENRWQSNWERTMR 443 >UniRef50_A0BGC6 Chromosome undetermined scaffold_106, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0BGC6_PARTE Length = 433 Score = 52.5 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 61/180 (33%), Gaps = 19/180 (10%) Query: 186 ILVGNSGDRSNEHIAALRAVHQQFG-DTVK--------VVVPMGYPPNNEAYIEEVRQAG 236 I+ S + + ++A+ + ++ VV G P E + E + Sbjct: 247 IVSSTSWTKDEDFNILVQALQKYEDLANIEQGREYRKLYVVITGKGPMKEEFREIFQ--- 303 Query: 237 LELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLN 293 ++++ L+ D Y LL DLG + G+ + AG P Sbjct: 304 KCNICWNHVKVNLAWLDIDDYPKLLACADLGICLHYSSSGLDLPMKVVDMFGAGTPVFAK 363 Query: 294 RENPFWQDMTEQHLPVLFTTDDLNED----IVREAQRQLASVDKNTIAFFSPNYLQGWQR 349 N + + Q ++F T D D R + L + K F + + Q W+ Sbjct: 364 SFNAISELVQHQKNGIVFDTPDDLFDHLSQAFRFESQILQQLKKGVETFRTETFDQEWRT 423 >UniRef50_D2V0T5 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2V0T5_NAEGR Length = 425 Score = 52.1 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 74/216 (34%), Gaps = 23/216 (10%) Query: 154 PKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTV 213 P++ G L R + ++ D R K+ + S + L ++ + Sbjct: 214 PEMFGSNLSDSERNTLFKDVLSIDTSRNFKLVV-SSTSWTEDEDFSILLSSIMELEKKLE 272 Query: 214 KV-------VVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDL 266 + + G P E Y++++ + + ++ + L + Y LL D+ Sbjct: 273 SISPSIYLEFIITGKGPQKEYYLKKI-----ASLNLKYCRVQTYFLSYADYSKLLASSDV 327 Query: 267 GYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDM-TEQHLPVLFTTDDL-----N 317 G + G+ + +G+P + + + +++ + ++ +L Sbjct: 328 GVCLHYSSSGLDLPMKVVDMFGSGLPVCAIKYLTLPELVKHDENGYIFDSSTNLTKYLEE 387 Query: 318 EDIVREAQRQLAS-VDKNTIAFFSPNYLQGWQRALA 352 I E +L S + F S + W +A Sbjct: 388 LLISPEGSSKLKSMRNHLKQNFQSHRWNDEWNSKVA 423 >UniRef50_B5DX75 GA26165 n=6 Tax=Drosophila RepID=B5DX75_DROPS Length = 447 Score = 51.7 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 59/191 (30%), Gaps = 27/191 (14%) Query: 180 REGKMTILVG-NSGDRSNEHIAALRAVHQQFGDTVK--------VVVPMGYPPNNEAYIE 230 + + ILV S + L+A+ + + V G P E Y Sbjct: 251 KPQRQAILVSSTSWTPDEDFGLLLKALQAYEKTALAEPQIYPALLCVITGKGPQKEQYEA 310 Query: 231 EVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAG 287 E+ + + I++ LE + Y ++L DLG + G+ + +G Sbjct: 311 EI-----AKMHWQKVSIVTPWLEIEDYPSILASADLGVCLHWSTSGLDLPMKVVDMFGSG 365 Query: 288 IPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASV----------DKNTIA 337 +P + + +F+ + +R + ++ Sbjct: 366 LPVCAYNFKCLDELVKHGENGFVFSDHHELAEQLRIWFENFPNNPSIQETQSRFGRSLQQ 425 Query: 338 FFSPNYLQGWQ 348 F + + W+ Sbjct: 426 FQELRWRESWR 436 >UniRef50_O13933 Chitobiosyldiphosphodolichol beta-mannosyltransferase n=1 Tax=Schizosaccharomyces pombe RepID=ALG1_SCHPO Length = 424 Score = 51.3 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 28/270 (10%), Positives = 74/270 (27%), Gaps = 30/270 (11%) Query: 106 WHIWGADLYELSSGLRY---KLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGE-LL 161 WH +G + L G ++ KL + + + +T Sbjct: 153 WHNFGYSILALKLGKQHTFVKLLKIYEKYMARGAYAHLTVSKRMKDVLQTWGMNPCYVCY 212 Query: 162 FFPTRMDP----------SLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGD 211 P S+ + + ++ S + I L ++ Sbjct: 213 DRPPNHFTPIKNEQKKQMSIKKIPCEYNPSSTKLLITSTSWTPDED-IYILWEALNEYDK 271 Query: 212 TVK----VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLG 267 T+ +V+ G P E + + +++ ++ L + Y ++ DLG Sbjct: 272 TLDTPKLLVLITGKGPMKEEFSQYIKKH-----PLHKVRFCMPWLSIEDYPQVMACADLG 326 Query: 268 YFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVREA 324 + G+ L G+P + + + + ++ ++ Sbjct: 327 VCLHTSSSGLDLPMKVVDLFGCGVPVIALSYPTISELVHDGENGLIVNDSKALSKKMQYL 386 Query: 325 QRQLASVDKNTIAFFSP---NYLQGWQRAL 351 ++ + + W + + Sbjct: 387 LTHANELNSLKLGALKESEYRWDDEWNKVI 416 >UniRef50_Q6C3K2 Chitobiosyldiphosphodolichol beta-mannosyltransferase n=1 Tax=Yarrowia lipolytica RepID=ALG1_YARLI Length = 463 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 50/368 (13%), Positives = 100/368 (27%), Gaps = 51/368 (13%) Query: 23 FNDALAATSEHAREF-MVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRFF 81 F++ L +++ L V + + K Sbjct: 81 FDEILNNDLIKIHHIPLILNTRKLPFVVFGILKV-----IRQHWLLISLLYKLRGADYLL 135 Query: 82 F-HGQFNPTLWLALLSGGIK--PSQFF--WHIWGADLYELSSGLRYKLFYPLRRLAQKRV 136 + PTL + ++ WH +G + L + + + Sbjct: 136 VQNPPSIPTLGVVRFYNLFLSTRTKVVLDWHNFGYTILALKLPETHPMVKFAKFYEGFFG 195 Query: 137 G------CVFATRGDLSFFAKTHPKVRGELL----FFPTRMD-PSLNTMANDRQREGKMT 185 G CV G + + G + P P + D R+ K T Sbjct: 196 GRAFVHLCVTVLMGQ---AMRKTFGMSGRRIVPLHDRPAFHFKPLSESEKLDVLRDFKET 252 Query: 186 -----------ILVGNSGDRSNEHIAALRAVHQQFGDTVKV----VVPMGYPPNNEAYIE 230 I+ S L A+ + + V+ G P ++ Sbjct: 253 LYDDMTADHKIIVSSTSYTPDENFNILLDALALYDESKLDLPPLRVIITGKGPMMPEFLA 312 Query: 231 EVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAG 287 +V E + + I + LEF Y +L LG + G + G Sbjct: 313 KV-----EKLQLKRVSIRTAWLEFADYPRILGAAHLGVSLHESSSGYDLPMKVVDMFGCG 367 Query: 288 IPCVLNRENPFWQDM--TEQHLPVLFTTDDLNEDI-VREAQRQLASVDKNTIAFFSPNYL 344 IP V + + + V + N + + + +L ++ + + + Sbjct: 368 IPVVSVDYAALSELVKTNTNGVAVKGHVEMGNTFMSLFSNRGKLDNIKRGAMIESRNTWD 427 Query: 345 QGWQRALA 352 Q W + + Sbjct: 428 QTWVKTVG 435 >UniRef50_UPI000180BB10 PREDICTED: similar to beta-1,4-mannosyltransferase n=1 Tax=Ciona intestinalis RepID=UPI000180BB10 Length = 465 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 68/198 (34%), Gaps = 22/198 (11%) Query: 168 DPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVP------MGY 221 + + + + ++ S + L A+ Q + + + +P G Sbjct: 266 FTHMGELGVSMKSDRPAILISSTSWTEDEDFSVLLEAL-QYYEENTSLDLPNILCVITGK 324 Query: 222 PPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLC 281 P Y +++ + + ++I++ LE Y LL DLG + G+ Sbjct: 325 GPQKSYYQKQIAAK-----NWKRVEIITPWLEASDYPKLLGSADLGVSLHTSSSGLDLPM 379 Query: 282 L---LIQAGIPCVLNRENPFWQDM-TEQHLPVLFTTDDLNEDIVR------EAQRQLASV 331 + + +P N + + + V + +L++ +V + + L + Sbjct: 380 KVVDMFGSSLPVAAINFNCLSELVQHNVNGFVFENSAELSKQLVNIFSDFPQDRTTLNRL 439 Query: 332 DKNTIAFFSPNYLQGWQR 349 K F + + + W + Sbjct: 440 SKEVEKFRNITWNEAWDK 457 >UniRef50_B6JZQ7 Chitobiosyldiphosphodolichol beta-mannosyltransferase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JZQ7_SCHJY Length = 428 Score = 50.9 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 40/283 (14%), Positives = 89/283 (31%), Gaps = 25/283 (8%) Query: 88 PTLWLALLSGGIKPSQFF--WHIWGADLYELSSGLRYKLFYPLRRLAQ---KRVGCVFAT 142 PT ALL S+ WH +G + L G + L ++ + Sbjct: 135 PTFVFALLMRFCFGSRIVIDWHNFGFSILALKLGKNHMLVKIMKAYELFLGRFAYKHLCV 194 Query: 143 RGDLSFFAKTHPKVRGELLFF--PTRMDPSLNTMANDR----QREGKMTILVGNSGDRSN 196 +S +L+ P+ P N + ++ S Sbjct: 195 SNAMSEVLGNWGLKPTYVLYDRPPSHFKPLSKKPYNLLGTAFNPKTCKLLVSSTSWTPDE 254 Query: 197 EHIAALRAVHQQFGD---TVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLE 253 + +A+ + + + V G P + +++ V++ ++++ L+ L Sbjct: 255 DIFVLYKALEEYDAQPNASPILAVITGKGPMKQDFLDHVKEH-----PLQHVRFLTPWLS 309 Query: 254 FDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQ--HLP 308 Y LL DLG + G+ L GIP + + + + Sbjct: 310 TGDYPRLLACADLGVSLHTSSSGVDLPMKVVDLFGCGIPVLSLPFPAITELVKDGRNGKI 369 Query: 309 VLFTTD-DLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQRA 350 V + + + ++L+S+ + ++ + + W Sbjct: 370 VGDAHEMAVTIQNLFTNTKELSSLKRGAMSESKHRWDEEWDTV 412 >UniRef50_Q9VEE9 CG18012 n=9 Tax=Diptera RepID=Q9VEE9_DROME Length = 446 Score = 50.9 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 56/187 (29%), Gaps = 30/187 (16%) Query: 185 TILVGNSGDRSNEHIAALRAVHQQFGDT----------VKVVVPMGYPPNNEAYIEEVRQ 234 ++ S + L+A+ + ++ G P E Y+ E+ Sbjct: 256 VLVSSTSWTPDEDFGILLKALQAYEETAQAEPLVYPSLLCIIT--GKGPQKEHYVAEI-- 311 Query: 235 AGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCV 291 E + + +++ LE + Y +L DLG + G+ + +G+P Sbjct: 312 ---EKLQWQKVSVITPWLEIEDYPTVLASADLGVCLHWSTSGLDLPMKVVDMFGSGLPVC 368 Query: 292 LNRENPFWQDMTEQHLPVLFTTDDLNEDIVRE-------AQRQLASV---DKNTIAFFSP 341 + + +F + +R L + + F Sbjct: 369 AYDFKCLDELVKHGENGFVFGDHVQLAEQLRIWFENFPKNPSILETRAGFQRKIQEFQEL 428 Query: 342 NYLQGWQ 348 + + W+ Sbjct: 429 RWRESWR 435 >UniRef50_C9BE40 Capsular polysaccharide biosynthesis protein n=1 Tax=Enterococcus faecium 1,141,733 RepID=C9BE40_ENTFC Length = 367 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 41/369 (11%), Positives = 98/369 (26%), Gaps = 38/369 (10%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKS 63 + H+ H ++ L + V + ++ + Sbjct: 18 IPHIEMLIEKGHEVSIACSIEQPLKPYFNDRN--IKVYQVPFSRQPLSKQNILSY----- 70 Query: 64 LAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYK 123 + + K N + H + + + F+ G Y + L + Sbjct: 71 --KMLKKIIKENGIE--IIHTHTPVASLITRIVCKNMNVKVFYTAHGFHFYRGAPKLNWL 126 Query: 124 LFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELL----------FFPTRMDPSLNT 173 ++YPL + + + + A + L + +++ Sbjct: 127 VYYPLEKYLSRFTDTILTINQEDYQIASQKFHSKKVYLINGVGIPIEKYKKIQINTMRKK 186 Query: 174 MANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVR 233 + + IL +R+ H + A+ Q K + + Y ++ Sbjct: 187 AELGIKDKKTKIILSVGELNRNKNHTMVIEALKQFKDKNFKYFIC---GVGSLDYA--LK 241 Query: 234 QAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCV-- 291 + EE + +L + L +++ DL F R+ ++ + G+P V Sbjct: 242 EKIKNSDLEEKVVLLGYR---TDVLEIMKISDLFVFPSKREGLPVSVMEAMSIGLPVVAS 298 Query: 292 -------LNRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPNYL 344 L ++N + L L L + + + Sbjct: 299 NIRGNMDLIQDNIAGKLFDVNALTELTEILQLFFSGDMPLDIYSNNASMSIQNYSKEKVA 358 Query: 345 QGWQRALAI 353 I Sbjct: 359 HQIADIYEI 367 >UniRef50_B1CBD5 Putative uncharacterized protein n=1 Tax=Anaerofustis stercorihominis DSM 17244 RepID=B1CBD5_9FIRM Length = 379 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 36/343 (10%), Positives = 99/343 (28%), Gaps = 23/343 (6%) Query: 31 SEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRF-FFHGQFNPT 89 ++ ++F + L P K+L K F H P+ Sbjct: 38 NKDFKDFYNKHNINILQTDRGTFLFDRIPKLKTLVNMFRKVKKEAGNGMFDVIHIHSVPS 97 Query: 90 LWLALLSGGI---KPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQ------KRVGCVF 140 ++ + WG+DL + K L + Sbjct: 98 NFMITFLNKFIVRFGKKIVCTYWGSDLLSKTREQLMKAIPCLDKAECISYSSDGMDSYFH 157 Query: 141 ATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMAND---RQREGKMTILVGNSGDRSNE 197 GD+ K + + K+++ +G +G + Sbjct: 158 EVFGDIYNEKIVRAKFGISIYDVIDEEKKHKTKDECKEFFNIEKDKISVAIGYNGSLRQQ 217 Query: 198 HIAALRAVHQQFGD---TVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEF 254 H+ + + + D + +V+ + Y + Y + + ++ + I++ L Sbjct: 218 HLRVINELSKLNSDILDKLNIVIQLSYGLTCDEYRQNIIDEISKINVKH--VIINNFLNK 275 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFTTD 314 D L D+ ++ + AG ++N +++ + + + Sbjct: 276 DESAMLRIATDIFIHAQESDAFSASIQECVYAGS-ILVNPSWIMYKEFDDIGIDYIKYNS 334 Query: 315 DLNEDIVREA----QRQLASVDKNTIAFFSPNYLQGWQRALAI 353 ++ + + ++++ K + ++ + L + Sbjct: 335 FDELPMIIKDITDGKIKISNNGKVRELYKKYSWNAVKEDWLKL 377 >UniRef50_C4PYE2 Chitobiosyldiphosphodolichol alpha-mannosyltransferase n=1 Tax=Schistosoma mansoni RepID=C4PYE2_SCHMA Length = 471 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 41/318 (12%), Positives = 85/318 (26%), Gaps = 72/318 (22%) Query: 88 PTLWLALLSGGIKPSQFF--WHIWGADLYE---LSSGLRYKLFYPLR-----RLAQKRVG 137 PT ++ L I WH +G L E S L +L+Y L R + Sbjct: 107 PTFFILWLFIKITGKNLVIDWHNYGYTLVELNAPSKSLFTRLYYILEVDFASRFMSRMSD 166 Query: 138 CVFATR--GDLSFFAKTHPKVRGELLFFP-------------------TRMDPSLNTMAN 176 V L + + P + + Sbjct: 167 RVANVCVSKALKYDMRMKSIEATVYYDRPAEEFKPTPVDVAHCLFLKLSDQYAVFRNKLD 226 Query: 177 ---------------DRQREGKMT-------ILVGNSGDRSNEH------IAALRAVHQQ 208 + + ++ S ++ + ++ Sbjct: 227 SCRFTRFTEITALPTNTKNNEPHWRPDRPALVVSSCSWTPDDDFTLAIKALEIYDKAAEK 286 Query: 209 FGDTVK--VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDL 266 + V G P Y + +++ + ++++++ LE+ Y L DL Sbjct: 287 LDSGLPSIVFAVTGRGPLQSYYAKLIQEQ-----NWKHVEVVMLWLEWSDYPVFLGCADL 341 Query: 267 GYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVRE 323 G I G+ L+ +P + + M + + F T + D + + Sbjct: 342 GLSIHRSSSGLDLPMKVVDLLGVNVPVLALGYATLSELMEDNKYGLCFETGEQLADQMCD 401 Query: 324 AQRQLASVDKNTIAFFSP 341 L +TI + Sbjct: 402 L---LKPRRNSTIQYTCE 416 >UniRef50_C0SHG1 Chitobiosyldiphosphodolichol beta-mannosyltransferase n=7 Tax=Leotiomyceta RepID=C0SHG1_PARBP Length = 570 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 62/200 (31%), Gaps = 17/200 (8%) Query: 169 PSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKV--------VVPMG 220 P + D + ++ S + + A+ + + V V+ G Sbjct: 360 PDTSEFVRDMRNGACRLLVSSTSWTPDEDFSILIDALCRYSAISSTVNYDLPRLGVIITG 419 Query: 221 YPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTL 280 P + Y+ V E + I S L Y LL LG + G+ Sbjct: 420 KGPQRDMYLSRVANLMAE-GKLNKVVIKSAWLSLQDYAQLLASASLGVCLHTSTSGVDLP 478 Query: 281 CL---LIQAGIPCVLNRENPFWQDMTEQHLP-VLFTTDDL----NEDIVREAQRQLASVD 332 + AG+P V W ++ + + + F + D D+ + ++LA + Sbjct: 479 MKVVDMFGAGLPVVGWSRYESWPELVTEGINGLGFGSPDELLAHLLDLFGDDGKKLAVLR 538 Query: 333 KNTIAFFSPNYLQGWQRALA 352 + + + W Sbjct: 539 QGALQESERRWDDEWDAVAG 558 >UniRef50_B6HPF8 Pc22g00440 protein n=6 Tax=Eurotiomycetidae RepID=B6HPF8_PENCW Length = 462 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 58/191 (30%), Gaps = 21/191 (10%) Query: 180 REGKMTILVG-NSGDRSNEHIAALRAVHQQFGDTVKV--VVP------MGYPPNNEAYIE 230 + G + +LV S + + A+ + V +P G P E Y+ Sbjct: 269 KAGSLRVLVSSTSWTADEDFSVLIDALLRYSELATTVQPHLPEVLAIITGKGPQKEMYLG 328 Query: 231 EV--RQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQ 285 ++ + +L + + I + L Y LL LG + G+ + Sbjct: 329 QIAALEKASKL---QKVTIRTAWLSVPEYARLLASASLGVSLHTSSSGVDLPMKVVDMFG 385 Query: 286 AGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNED----IVREAQRQLASVDKNTIAFFSP 341 AG+P V W ++ + + L + +L S+ S Sbjct: 386 AGLPVVGWDRFEAWPELVTEGVNGLGFGSSKELAGHLVELFGKSDKLESLRLGAQKESSR 445 Query: 342 NYLQGWQRALA 352 + W Sbjct: 446 RWDDEWNPIAG 456 >UniRef50_C5P1X3 Putative uncharacterized protein n=2 Tax=Coccidioides RepID=C5P1X3_COCP7 Length = 513 Score = 48.6 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 50/306 (16%), Positives = 94/306 (30%), Gaps = 52/306 (16%) Query: 88 PTLWLALLSGGIKPSQFF--WHIWGADLYELSSGLRYKLFYPLR---------------- 129 PTL +A L+ ++ ++ WH +G + + G R+ + LR Sbjct: 202 PTLVMAQLACWLRNTRLIIDWHNFGYSILAMKLGPRHPMVKFLRFHEMTACRFATAHFCV 261 Query: 130 -----RLAQKRVGCVFATRGDLSFFAKTHPKV-------RGELLFFPTRMDPSLNTMAND 177 R+ Q+ + V P E F T + + N + Sbjct: 262 SKAMARMLQQEINLVAPI-----LVLHDRPPELFQPIVREDEKFAFLTSLPETKNFVKAY 316 Query: 178 RQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKV--------VVPMGYPPNNEAYI 229 R ++ S + + L A+ Q V V+ G P Y+ Sbjct: 317 RAGRQCELLVSSTSWTQDEDFSIFLDALCQYSTHAATVDAKLPDLYVIITGKGPLQRTYL 376 Query: 230 EEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL---LIQA 286 + E + I L Y LL LG + G+ + A Sbjct: 377 RAIAALTAE-GKLRKIHIQCAWLTIQDYAKLLACSSLGVCLHTSSSGVDLPMKVVDMFGA 435 Query: 287 GIPCVLNRENPFWQDMTEQHL--PVLFTTDDLN---EDIVREAQRQLASVDKNTIAFFSP 341 G+P V W ++ + + + ++L+ D++ E + QL + + Sbjct: 436 GLPVVAWDRYQAWPELITEGVDGKGFGSAEELSRHLIDLLGEDRSQLQWLRQGARNASKR 495 Query: 342 NYLQGW 347 + W Sbjct: 496 RWDDEW 501 >UniRef50_B0ELC7 Chitobiosyldiphosphodolichol beta-mannosyltransferase, putative n=2 Tax=Entamoeba RepID=B0ELC7_ENTDI Length = 456 Score = 47.5 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 62/205 (30%), Gaps = 22/205 (10%) Query: 165 TRMDPSLNTMANDRQREGKMTI--LVGNSGDRSNEHIAALRAVHQQFGDTVK----VVVP 218 + N ++ + I + S + A+ + + ++ Sbjct: 245 STFPKYSIPFINSLIQDDEKIICGVSSTSWTPDEDFSVLFDALLSYEKNKLNLPKLIIFI 304 Query: 219 MGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIG 278 G P E Y + + + + + + I+ L + Y LL CD G + + Sbjct: 305 TGKGPLREFYEKRIEEEKM-----KRVCIIPIWLSHEDYPYLLSSCDFGISLHQSSSQLD 359 Query: 279 TLCL---LIQAGIPCVLNRENPFWQDMTEQHL--PVLFTTDDLNEDIVREAQRQLA---- 329 + +P + ++ + + T+ L+E I+ Sbjct: 360 LPMKVLDMFGCSLPVLARGYQCLKDELVIEGVYGYCFDTSKQLSELIINIISDDKKSELF 419 Query: 330 --SVDKNTIAFFSPNYLQGWQRALA 352 S+ +N I + Q W+ + Sbjct: 420 FISMKQNVIENTKVTWSQNWKNVVR 444 >UniRef50_D1HCT1 Whole genome shotgun sequence of line PN40024, scaffold_1.assembly12x (Fragment) n=4 Tax=Magnoliophyta RepID=D1HCT1_VITVI Length = 484 Score = 47.1 bits (110), Expect = 9e-04, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 53/146 (36%), Gaps = 16/146 (10%) Query: 215 VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQ 274 + + G PN E Y E++RQ + + L + Y LL DLG + Sbjct: 329 LFIITGKGPNKEKYEEKIRQ-----LKLNRVAFRTMWLSAEDYPLLLGSADLGICLHTSS 383 Query: 275 QGIGTLCL---LIQAGIPCVLNRENPFWQDMT-EQHLPVLFTTDDLNEDIVREAQ----- 325 G+ + G+P + + + E++ + ++ +L +++ + Sbjct: 384 SGLDLPMKVVDMFGCGLPVCAVSYSCIEELVKVEKNGLLFSSSSELANELLMLFKGFPDN 443 Query: 326 -RQLASVDKNTIAF-FSPNYLQGWQR 349 L + + FS + W+R Sbjct: 444 CDALKLLRNGVVEAGFSARWDTEWER 469 >UniRef50_A4S8H0 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4S8H0_OSTLU Length = 419 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 59/204 (28%), Gaps = 25/204 (12%) Query: 167 MDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAV------HQQFGDTVK------ 214 +D L + + I+ S + L A + GD Sbjct: 212 LDRFLRGTHENMTKNKPRFIVSSTSWTPDEDFGVLLDAAVAYDARKRAKGDHASKSYPDI 271 Query: 215 VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQ 274 V++ G P Y +++ + LE + + L+ Y L LG + Sbjct: 272 VIIITGQGPRKTMYEKKINELALEHVAFRTVW-----LDAADYPRALANAHLGVSLHTSS 326 Query: 275 QGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASV 331 G+ + A +P R + + E VLF + + R + Sbjct: 327 SGLDLPMKIVDMFGASLPVAAMRYAVIGELVQEGVNGVLFADATELAAMFAKLLRGDERL 386 Query: 332 DKNTIAFFSPNYLQG-----WQRA 350 + + + + W+R Sbjct: 387 TLRALKHGAAKWGEQTWDDHWKRC 410 >UniRef50_C2RVV5 Glycosyltransferase n=1 Tax=Bacillus cereus BDRD-ST24 RepID=C2RVV5_BACCE Length = 353 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 36/314 (11%), Positives = 98/314 (31%), Gaps = 38/314 (12%) Query: 22 FFNDALAATSEHAREF----MVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQ 77 + + + +E+ M+ + + S ++ +P ++++ + A N+ Sbjct: 14 YLQEVVKYQNENREVLNLKVMLSDTNSDKNFSMNKEDIEAYPYERNIKSIIKAMFYVNKN 73 Query: 78 ------QRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRL 131 H F L K + + G +S L+ K++ + R+ Sbjct: 74 IKEHKPDIIHIHSTFAGFFVRVPLLFQKKRYKVVYCSHGWAFCMETSALKKKVYEIVERV 133 Query: 132 AQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNS 191 R + H + L + + + + + I Sbjct: 134 LATRTDKIINISSS------EHEEALKRGLSYEKCELIHNGISTDLHEGDIEYRI----- 182 Query: 192 GDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEK 251 D S ++ + +Q G + + Y + + ++ G + + ++++I S Sbjct: 183 -DPSKINLLFVGRFDRQKGLDILLKFFESY----QNHNIKLHIIGESILNNDDIEIPSNV 237 Query: 252 LEF--------DAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMT 303 + D+Y L I +R +G G + + ++ ++ Sbjct: 238 VSIGWINHEHIDSYYKLFD----AIIIPSRWEGFGLVAIEAMKNKKAIIVSNRGALPELA 293 Query: 304 EQHLPVLFTTDDLN 317 +F ++L+ Sbjct: 294 NTSNGYVFDLNNLD 307 >UniRef50_D1PK32 Putative uncharacterized protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PK32_9FIRM Length = 187 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 69/193 (35%), Gaps = 17/193 (8%) Query: 4 LIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKS 63 +IHV+ H ++ + N + + F++ + + F+ Sbjct: 3 VIHVV-----HRDKFTKGYINFMKTQMARYEHCFIIQAEQKL---DLVDQNNVFYVKSFE 54 Query: 64 LAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSS----- 118 + A + F G FN +++L + + WGAD Y S Sbjct: 55 KTISNELLALMDESDAIIFSGIFN-SIYLLKQLPRRLLKKTYLQFWGADFYSYSEFRSPI 113 Query: 119 GLRYKLFYPLRRLAQKR-VGCVFATRGDLSFFAKTHPK-VRGELLFFPTRMDPSLNTMAN 176 +RY L +R+ G +F +G+ + K R ++ PT + Sbjct: 114 HIRYYLHRFMRKRLYNSCAGHIFLIQGEYKKYEAIFGKFDRNFVVSMPTDYVKEIVQEIC 173 Query: 177 D-RQREGKMTILV 188 + RQ++ K TIL+ Sbjct: 174 ELRQKKIKKTILI 186 >UniRef50_UPI0000E12861 Os06g0564800 n=1 Tax=Oryza sativa Japonica Group RepID=UPI0000E12861 Length = 416 Score = 45.9 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 50/146 (34%), Gaps = 16/146 (10%) Query: 215 VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQ 274 + + G P+ + Y E++++ + + L + Y LL DLG + Sbjct: 265 LFIITGKGPDRKKYEEQIKR-----LKLRRVSFRTMWLASEDYPLLLGSADLGVSLHTSS 319 Query: 275 QGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVR-------EA 324 G+ + G+P + + + + +LF+T D + E Sbjct: 320 SGLDLPMKVVDMFGCGLPVCAASFSCIDELVKVNNNGLLFSTSSELADELTMLFKGFPEE 379 Query: 325 QRQLASVDKN-TIAFFSPNYLQGWQR 349 +L S+ S + W+R Sbjct: 380 CDELKSLKVGALNTGSSSKWSTEWER 405 >UniRef50_A8QHU2 Glycosyl transferase, group 1 family protein n=1 Tax=Brugia malayi RepID=A8QHU2_BRUMA Length = 594 Score = 45.1 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 80/238 (33%), Gaps = 25/238 (10%) Query: 106 WHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAK-THPKVRGELLFFP 164 W I A +Y+ ++ R K + + G+ F +P ++ + + Sbjct: 250 WDISAATVYDRPPAWSFRKLTDEERH--KFLLKLIDYGGEFEVFKAVNNPCLQIDCISME 307 Query: 165 TRMDPSLNTMANDRQREGKMTILVG-NSGDRSNEHIAALRAVHQQ-------FGDTVKVV 216 + + + R + +LV S + L A+ + Sbjct: 308 ETLISYRDNEGKVQLRNDRPLLLVSSTSWTEDEDFGLLLDALREFDNIAKLSSRTNPATR 367 Query: 217 VPM------GYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFI 270 +P G P Y+ + E +N++IL+ L+ + Y L+ D+G + Sbjct: 368 LPFITCIITGRGPLRSYYLGRI-----EHMQMQNVEILTPWLKAEDYPFLIGCADIGVSL 422 Query: 271 FARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQ 325 G+ ++ G+P + R + +++ H LF T I++ Sbjct: 423 HTSTSGLDLPMKVVDMLGCGLPVIAKRFGCIGELISDGHNGRLFDTSHELSHIIKTLS 480 >UniRef50_B8ELN5 Glycosyl transferase group 1 n=4 Tax=Alphaproteobacteria RepID=B8ELN5_METSB Length = 789 Score = 44.8 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 24/127 (18%), Positives = 42/127 (33%), Gaps = 3/127 (2%) Query: 226 EAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFAR--QQGIGTLCLL 283 E Y E + + L E+++ L + ++ L + CD+ + Q GTL Sbjct: 241 EVYRESLIERVRALGVEDHVVFLDQFVDQSTLLEFIAMCDVYVTPYLNEAQMTSGTLAYS 300 Query: 284 IQAGIPCVLNRENPFWQDMTEQ-HLPVLFTTDDLNEDIVREAQRQLASVDKNTIAFFSPN 342 G V + + + L V F D V A + +S + Sbjct: 301 FGLGKAVVSTPYWHARELLADGRGLLVPFGDAGATGDAVAGLLTDDARREAMRKRAYSSS 360 Query: 343 YLQGWQR 349 W+R Sbjct: 361 RSMTWER 367 >UniRef50_B2WNH6 Chitobiosyldiphosphodolichol beta-mannosyltransferase n=9 Tax=Leotiomyceta RepID=B2WNH6_PYRTR Length = 490 Score = 44.4 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 44/279 (15%), Positives = 81/279 (29%), Gaps = 38/279 (13%) Query: 106 WHIWGADLYELSSGLRYKLF---YPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELL- 161 WH +G + + + L +L K ++ K V L Sbjct: 202 WHNFGYTILAMKLSPTHPLVQISEKYEKLFAKAATHHITVTNAMARVLKASYGVTASALH 261 Query: 162 ----------------FFPTRMDPSLNTMANDRQREGKMT--ILVGNSGDRSNEHIAALR 203 F R+ + A+ ++ S + L Sbjct: 262 DRPASIFQPITPEERSNFLARLPETAQHAADLSPTSQSPWKLVVSATSWTADEDFSLLLS 321 Query: 204 AVHQQFGD-TVKVVVP------MGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDA 256 A+ T K +P G P E Y+E+++Q E N+ I + L Sbjct: 322 ALVAYSAQCTSKTHLPKLLAIITGKGPQKEYYLEQIKQLNQEN-KLLNVVIKTAWLSHSD 380 Query: 257 YLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLP--VLF 311 Y LL DLG + G+ + AG+P V + W ++ +Q + Sbjct: 381 YALLLAAADLGVSLHTSSSGVDLPMKVVDMFGAGLPVVGWGKFEAWPELVKQGVNGLGFQ 440 Query: 312 TTDDLNEDIVREAQRQLASV---DKNTIAFFSPNYLQGW 347 + ++L + R + + + W Sbjct: 441 SEEELALQLEAFFDRDTRLRETLKRGALEESGHRWDDEW 479 >UniRef50_C6QI52 Glycosyl transferase group 1 n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QI52_9RHIZ Length = 379 Score = 44.0 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 61/379 (16%), Positives = 113/379 (29%), Gaps = 43/379 (11%) Query: 1 MTVLIHVL-GSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKD-DGLSDSCPALSVQFF 58 M ++HV+ G LR +H + V + + +V Sbjct: 1 MKRVMHVIAGLGTGGTEVMCLRLARHWQGRFDQHVLAWEVSSRSLERDFQQLSQTNVSVI 60 Query: 59 PGKK----SLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLY 114 P K + K + H P + A+ + G+ + W + Sbjct: 61 PPDKRTHLQRWRWIREKIAQVKPDAVLIHCFGIPHIISAVAAHGVGINSI--SAWAGN-- 116 Query: 115 ELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHP----KVRGELLFFPTRMDPS 170 S L +L + LA + V C S A+ + P +D + Sbjct: 117 PPSRSLISRLRFTAVLLASRVVQC--PVVSCSSAVAQEFGKLGIGMPARSAIVPNAIDVA 174 Query: 171 LNTMANDRQREGKM----TILVGNSGDRSNEHIAALRAVHQQFGD--TVKVVVPMGYPPN 224 + R+ + TI + + D +H L A + D ++ + Sbjct: 175 DILATARKSRDSRRDLTPTIAIVSRLDVIKDHATLLDAFAKIHRDIPNARLWI-----IG 229 Query: 225 NEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIG-TLCLL 283 + + + L ++ + +LL Q D+ F R +G G L Sbjct: 230 DGSLRTSLEAHARNLGISKST---TFFGNRTDVASLLGQADVFAFSTTRDEGFGIVLIEA 286 Query: 284 IQAGIPCVLNRENPFWQDMTEQHLPVLFTTDD-----------LNEDIVREAQRQLASVD 332 + AGIP V + + +L D LN +R A+ S+ Sbjct: 287 MAAGIPIVATDVAACREVLANGEAGLLVAPSDADALALALYNVLNTPELR-ARMSSNSLR 345 Query: 333 KNTIAFFSPNYLQGWQRAL 351 + + Q W+ L Sbjct: 346 RVRAEYSIERCAQRWETQL 364 >UniRef50_UPI000186D588 Chitobiosyldiphosphodolichol beta-mannosyltransferase, putative n=2 Tax=Eumetazoa RepID=UPI000186D588 Length = 430 Score = 44.0 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 51/401 (12%), Positives = 103/401 (25%), Gaps = 61/401 (15%) Query: 5 IHVLGSDIPHHNRTVLRFFNDALAATSEHARE--FMVVGKDDGLSDSCPALSVQFFPGKK 62 H L +N T + + + F + P + F Sbjct: 25 YHGLSFAREKYNVTFVGYSGSTPLKLLRDKKNVNFKYLYPCPNFKQYLPNVLAYIFKVIW 84 Query: 63 SLAEAVIAKAKANRQQ-RFFFHGQFNPTLWLALLSGGIKPSQFF--WHIWGADLYELSSG 119 + A + + P + + L + + WH + + LS G Sbjct: 85 QIVTLFYALLTIDVSDFLLIQNPPALPGIGVCFLYCKLFKVKLVIDWHNYAYSILALSVG 144 Query: 120 LRYKLF-------YPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGEL------------ 160 ++KL + + ++ + A R DL K + Sbjct: 145 DKHKLVKISKWYEFFIGSKSENNLCVTQAMRKDLMDNHKISAITFYDCPPDFFHCTTVEE 204 Query: 161 ---LFFPTRMDPSLNTMANDRQREGKMTILVGN----------------SGDRSNEHIAA 201 LF + + D + GN S + Sbjct: 205 KHNLFLSLGLKYKIFLNNCDSNETVFTKVNAGNKVVLKDDRPAFLISSTSWTEDEDFSIL 264 Query: 202 LRAVHQQFGDTVK-------VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEF 254 L A+ + G P E Y + + + + + +QI++ LE Sbjct: 265 LSALEMYEESKKCSSNLPNLICAITGKGPLKEYYSKIIEEK-----NWKYVQIVTPWLEA 319 Query: 255 DAYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLF 311 + Y + DLG + G+ + +P N + + + +F Sbjct: 320 EDYPLFIGSADLGVCLHKSSSGLDLPMKVVDMFGCSVPVCAINFNCLPELVKHELNGFIF 379 Query: 312 TTDDLNEDIV---REAQRQLASVDKNTIAFFSPNYLQGWQR 349 + E S + +I N ++ W Sbjct: 380 NDASELFTQIKSWFEDFPNSNSPKQKSIKENLSNSVKKWHD 420 >UniRef50_C1BUJ6 Glycosyltransferase ALG1-like n=4 Tax=Pancrustacea RepID=C1BUJ6_9MAXI Length = 260 Score = 44.0 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 27/220 (12%), Positives = 64/220 (29%), Gaps = 25/220 (11%) Query: 148 FFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQ 207 +K P+ + L+ T + + ++ S + L A+ Sbjct: 35 RLSKRIPEFKDPLVNSGTLFTEEFARDRVTLREDRPGLLVSSTSWTEDEDFGILLDALQV 94 Query: 208 QFGDTVK---------VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYL 258 + + V G P + Y + + +++ +++ LE + Y Sbjct: 95 YNDTSSDNSVGFLPHLICVITGKGPMKDKYKGIIASR-----NWQHITVITPWLEPEDYP 149 Query: 259 ALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDD 315 ++ DLG + G+ + G+P + + ++F T Sbjct: 150 LMIASADLGVCLHTSSSGLDLPMKVVDMFGCGLPVAAVNYPTSSELIKNGENGIVFDTSY 209 Query: 316 LNEDIV------REAQRQLASV--DKNTIAFFSPNYLQGW 347 +I+ Q +L N + + W Sbjct: 210 ELAEIIMGWFKGFPEQTELKYNTFSSNLEEYQCLRWSDYW 249 >UniRef50_Q23MP4 Chitobiosyldiphosphodolichol beta-mannosyltransferase, putative n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23MP4_TETTH Length = 465 Score = 43.6 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 52/153 (33%), Gaps = 20/153 (13%) Query: 175 ANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVK----------VVVPMGYPPN 224 ++++ + ++ S + + L A+ + ++ G P Sbjct: 233 KIIKKQQRPLLLVSSTSWTKDEDFSILLDAMQSYETEKEVNKQNSLYPKLHLLITGKGPE 292 Query: 225 NEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCL-- 282 E Y + + + S +N+QI + L+ + Y LL D+G + G+ Sbjct: 293 KERYEQIIEERKK---SWKNIQIQTVWLKAEDYPKLLASADVGICLHYSSSGLDLPMKVV 349 Query: 283 -LIQAGIPCVLNRENPFWQDMTEQHLPVLFTTD 314 ++ + +P +Q +T+ Sbjct: 350 DMLGSNLPVF----AINYQWVTQLVFGKFINKQ 378 >UniRef50_C6Y0M4 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y0M4_PEDHD Length = 372 Score = 43.2 bits (100), Expect = 0.016, Method: Composition-based stats. Identities = 38/352 (10%), Positives = 87/352 (24%), Gaps = 15/352 (4%) Query: 7 VLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAE 66 V+G + RF + D L+D QF+PG+ A Sbjct: 5 VIGDAFSPYTVAFSRFLKQHSPIITIDIINTRHNVSDIDLTDHIRGAYDQFYPGQSISAI 64 Query: 67 AVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSG--LRYKL 124 K + HG ++ L + IWG+D Y + + Sbjct: 65 TSQINEKICQYDVICLHGFWDVVLSIYERLNHKDLF-TVGVIWGSDFYRRNHDTMPLSGI 123 Query: 125 FYPLRRLAQKRVGCVFATRGDLSFFAKTH----PKVRGELLFFPTRMDPSLNTMANDRQR 180 F R+ + K + S + Sbjct: 124 FDRCDRVLVQTDEMEADLLKVYPLLPKKIRKCLFGIEPLESLTAMSGISSAKAKRSLGLA 183 Query: 181 EGKMTILVGNSGDRSNEHIAALRAVH---QQFGDTVKVVVPMGYPPNNEAYIEEVRQAGL 237 + G + H A + + + ++ P Y + Y+ + Sbjct: 184 SDSFVLTCGYNASPFQNHTAIIDQLTAIISRLPANHVLIFPFTYQK-DTRYMSLIENVMK 242 Query: 238 ELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENP 297 + S + + ++ + L D+ + + V+ Sbjct: 243 K--SPLTYYFIEDFMDSEQVSLLCLATDIFIQVQTT-DALSASMREHLFAKSIVITGGWL 299 Query: 298 FWQDMTEQHLPVLFTTD-DLNEDIVREAQRQLASVDKNTIAFFSPNYLQGWQ 348 +Q + + + + V + + +P+ + ++ Sbjct: 300 PYQILVRNGFYFETIDELNSLGTTITGILDNYTEVKRKVELYNTPDKFEQYR 351 >UniRef50_Q10QW6 Os03g0180700 protein n=5 Tax=Oryza sativa RepID=Q10QW6_ORYSJ Length = 473 Score = 42.8 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 48/146 (32%), Gaps = 16/146 (10%) Query: 215 VVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQ 274 + + G P+ Y E++++ + + L + Y LL DLG + Sbjct: 322 LFIITGKGPDRMKYEEQIKR-----LKLRRVAFRTMWLASEDYPLLLGSADLGVSLHTSS 376 Query: 275 QGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVR-------EA 324 G+ + G+P + + + + +LF+T D + E Sbjct: 377 SGLDLPMKVVDMFGCGLPVCAASFSCIDELVKINNNGLLFSTSSELADELMMLFKGFPEE 436 Query: 325 QRQLASVDKN-TIAFFSPNYLQGWQR 349 L S+ S + W+R Sbjct: 437 CDDLKSLKVGALNTGSSSKWSTEWER 462 >UniRef50_Q8XN63 Capsular polysaccharide biosynthesis protein n=3 Tax=Clostridium perfringens RepID=Q8XN63_CLOPE Length = 385 Score = 42.5 bits (98), Expect = 0.024, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 94/317 (29%), Gaps = 20/317 (6%) Query: 38 MVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSG 97 +++ K + + + S F K+ E + K N+ H L Sbjct: 45 VLINKGVKIFNIPFSRSPLSFGNIKAFKELIKL-QKENKYDIVHVHTPVASIYGRLLKIR 103 Query: 98 GIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVR 157 K + G + L + ++YP+ ++ K + KT + Sbjct: 104 FPKLKTIY-TAHGYHFLKGGPKLGWIIYYPIEKVMAKLTDVTININKEDYEITKTKLNPK 162 Query: 158 GELL----------FFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQ 207 L + P + + + +++ + + I ++A+ Sbjct: 163 KCYLVNGVGLDLNQYKPLSKEKQESKRKELGLEKDDFVVIMIAELNENKNQIQLIKAMEL 222 Query: 208 QFGDTVKVV-VPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDL 266 + + +G E +E+ GL+ N ++L + L+ ++ Sbjct: 223 LKDKYPNIKAISIGEGHKFEELQQEINNRGLK----NNFKLLGFR---TDVNELINISNI 275 Query: 267 GYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQR 326 G + R+ + L+ G V + + L +D + Sbjct: 276 GILLSYREGLPRNIMELMANGKRIVATDIRGNRDIVCNDFIGALVEVNDYEATAKAVEKF 335 Query: 327 QLASVDKNTIAFFSPNY 343 L S +KN I Y Sbjct: 336 YLTSANKNKILKEVERY 352 >UniRef50_C2G144 Possible group 1 glycosyl transferase n=2 Tax=Sphingobacterium spiritivorum RepID=C2G144_9SPHI Length = 429 Score = 42.5 bits (98), Expect = 0.025, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 54/155 (34%), Gaps = 17/155 (10%) Query: 201 ALRAVHQQFGDTVKVVVPMGYPPN-----NEAYIEEVRQAGLELFSEENLQILSEKLEFD 255 A+ AV + K ++ PN E Y E + EL E+ ++ + + Sbjct: 228 AIDAVASVKDNDFKYIILGSTHPNIIRHEGEIYRESLMDKVKELGIEDKVEFVDTFATEE 287 Query: 256 AYLALLRQCDLGYFIF--ARQQGIGTLCLLIQAGIPCVLNRENPFW--QDMTEQHLPVLF 311 + L CD+ + Q GTL I AG + P+W +D+ +LF Sbjct: 288 LLVQYLSACDIYVTPYPNENQISSGTLSFAIGAGAAVL---STPYWYAKDLLANDRGILF 344 Query: 312 TTDDLN-----EDIVREAQRQLASVDKNTIAFFSP 341 D +++ E +A N + Sbjct: 345 DFKDSEGLATIINLLLEEPLLMARYRSNAKLYGQE 379 >UniRef50_B1CBE1 Putative uncharacterized protein n=1 Tax=Anaerofustis stercorihominis DSM 17244 RepID=B1CBE1_9FIRM Length = 419 Score = 42.5 bits (98), Expect = 0.028, Method: Composition-based stats. Identities = 43/361 (11%), Positives = 98/361 (27%), Gaps = 50/361 (13%) Query: 5 IHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSL 64 + + + H++ ++ N+ V + + + K++ Sbjct: 35 VKIFCASTIHNSDEIIDLNNNLYIEKKGKDEVPYVFVETTPYMGNGVSRIKNMLSYYKNV 94 Query: 65 AEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQFFWH--------IWGADLYEL 116 +AV K + + +P LAL++G +F +W L E Sbjct: 95 KKAVSEYIKKEGKPDVIYASSVHP---LALVAGIKIKKKFNNIPCISEVRDLWPESLVEY 151 Query: 117 ----SSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLN 172 + K+ Y + K+ V T + K L ++ ++ Sbjct: 152 GIIKRKSIIAKVLYKGEKWIYKKSDAVLFTIPGGENYIKDRNWTEAIPLKKVFYINNGVD 211 Query: 173 TMA------------NDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMG 220 D E I+ S N L + + + Sbjct: 212 LEEFEDNKNKYTFKDEDLNNENLFKIVYAGSIRDVNNVDEILDMAKIYKDKHLDNIKFII 271 Query: 221 YPPNNEAYIEEVRQAGLELFSEENL---QILSEKLEFDAYLALLRQCDLGYFIFARQQGI 277 Y E + + +L + + ++ + + F +L + DL Sbjct: 272 YGDGPRK--ESLEEVAQKLSLDNVVFKGRVEKKYIPF-----ILSKSDL----------- 313 Query: 278 GTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDIVREAQRQLASVDKNTIA 337 L I I N ++ M PV + +++ + + + Sbjct: 314 -NLISGISGNIGAYGVSWNKLFEYMAS-GKPVCANYNLGEFNLIEDNNIGICKKYNDLNK 371 Query: 338 F 338 + Sbjct: 372 Y 372 >UniRef50_Q1ZPV1 Putative integrase n=2 Tax=Photobacterium RepID=Q1ZPV1_PHOAS Length = 594 Score = 42.1 bits (97), Expect = 0.030, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 72/264 (27%), Gaps = 37/264 (14%) Query: 12 IPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAK 71 I HH+ ++ + + +V K DG S F + Sbjct: 4 ISHHHALFDKYSLSTDTSFTMPTIGTVVSVKKDGSPASYFEDDKWNFND-------LFNT 56 Query: 72 AKANRQQRFF-FHGQ-FNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLR 129 A FH Q NP L L L + +W IWG + L Sbjct: 57 KGATESDYIINFHSQKHNPELLLELKQ------RAYWLIWGG---------KGSLLEVEG 101 Query: 130 RLAQKRVGCVFATRGDLSFFAKTHPKVRGELLFFPTRMDPSLNTMANDRQREGKMTILVG 189 +++ + + + + + + + + + ++ + Sbjct: 102 AGTFRKIDSIINALNNFNPLLRIFKGTSINRFRYLSNEIVFSQIIESLKGVGERVIV--- 158 Query: 190 NSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILS 249 + ++ L V+ QF + +P +++ G I Sbjct: 159 ----EKLDVLSVLTQVNPQFPKHQRFTIPYQDGQTARTIAKKIAGKGRGHHPAVIPAIYE 214 Query: 250 EKLEF------DAYLALLRQCDLG 267 + L AYL L D+ Sbjct: 215 QFLSRTVNEVESAYLEFLNGRDIY 238 >UniRef50_Q64WD8 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=Q64WD8_BACFR Length = 392 Score = 41.7 bits (96), Expect = 0.042, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 68/240 (28%), Gaps = 19/240 (7%) Query: 45 GLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIKPSQF 104 L +F + + H N + + Sbjct: 55 ALIVPSFFKKNRFLKVLYQHILFRKYIKRLDCYDVVHIHYVENIIVRDIRFFSKYIRGKL 114 Query: 105 FWHIWGADLYELSSGLRYK---LFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGELL 161 IWG+D S + LF + + + + V Sbjct: 115 IVSIWGSDFLRASEDRKKNMTVLFNRADHITIASDKVIEEFKTCYQKSSFLSKIVLCRFG 174 Query: 162 FFPTRMDPSLNTMANDRQRE--------GKMTILVGNSGDRSNEHIAALRAVHQQ----- 208 P S+ + + K+ I +G + R HI + + + Sbjct: 175 LEPLESLISILRSGANARISKSKIGLCADKIVITIGYNASRLQHHIDIIENIERSPLLSP 234 Query: 209 FGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGY 268 F D V+ ++P+ YP + YI +++ L S+ + ++ + L + L D+ Sbjct: 235 FHDKVEFLLPVTYPE-DAEYIGIIKKTVLN--SKFHYNVIEQFLSDEDIAHLRVASDIFI 291 >UniRef50_P90522 Chitobiosyldiphosphodolichol beta-mannosyltransferase n=1 Tax=Dictyostelium discoideum RepID=ALG1_DICDI Length = 493 Score = 41.7 bits (96), Expect = 0.047, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 46/132 (34%), Gaps = 9/132 (6%) Query: 196 NEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSEKLEFD 255 N + + + + ++ G P E Y E++ S + +I++ L+ + Sbjct: 289 NNKVEEAQDESVVLAENLLFII-TGKGPQKEYYQEKI-----NSLSLKKSRIITVWLDSE 342 Query: 256 AYLALLRQCDLGYFIFARQQGIGTLCL---LIQAGIPCVLNRENPFWQDMTEQHLPVLFT 312 Y LL CDLG + GI + +P + + + + LF Sbjct: 343 DYPKLLACCDLGVSLHNSSSGIDLPMKVVDMFGCCLPVLAIDFKCIGELVKVNYNGFLFK 402 Query: 313 TDDLNEDIVREA 324 D ++ + Sbjct: 403 DSDQLHQLLNQL 414 >UniRef50_Q6BS98 Chitobiosyldiphosphodolichol beta-mannosyltransferase n=13 Tax=Saccharomycetales RepID=ALG1_DEBHA Length = 472 Score = 40.5 bits (93), Expect = 0.088, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 69/197 (35%), Gaps = 25/197 (12%) Query: 177 DRQREGKMTILVG-NSGDRSNEHIAALRAVHQQFGDTVK------VVVPMGYPPNNEAYI 229 D Q K ILV S + L A++Q + +++ G P ++ Sbjct: 272 DIQNISKYKILVSSTSFTPDEDFNLLLSALNQYDNSLAERGLPPILIIITGKGPLKSQFL 331 Query: 230 EEVRQAGLELFSEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLL---IQA 286 ++V +L +N+ I + L + Y +L DL + GI + Sbjct: 332 QKV----KQLNFSDNVIIKNAWLSSEDYPLILSVADLSISLHTSSSGIDLPMKIVDFFGC 387 Query: 287 GIPCVLNRENPFWQDMTEQHLPVL-----FTTDDLNEDIVREAQRQLAS------VDKNT 335 GIP + R + +T ++ ++ + +++I R + + + Sbjct: 388 GIPVITLRFPAIGELVTHGTNGLITKSDKDSSVNESQEIYRLLTEAFKNDELLDKIKQGA 447 Query: 336 IAFFSPNYLQGWQRALA 352 + + + + W + Sbjct: 448 LKESNLRWEENWNNKMG 464 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.315 0.126 0.321 Lambda K H 0.267 0.0387 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,685,392,085 Number of Sequences: 3077464 Number of extensions: 55800251 Number of successful extensions: 235630 Number of sequences better than 1.0e-01: 115 Number of HSP's better than 0.1 without gapping: 57 Number of HSP's successfully gapped in prelim test: 119 Number of HSP's that attempted gapping in prelim test: 235346 Number of HSP's gapped (non-prelim): 192 length of query: 359 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 229 effective length of database: 640,326,036 effective search space: 146634662244 effective search space used: 146634662244 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.1 bits) S2: 93 (40.5 bits)