BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (319 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_O32333 Glucitol/sorbitol-specific phosphotransferase en... 341 2e-92 UniRef50_B3WB83 Phosphoenolpyruvate-dependent phosphotransferase... 329 9e-89 UniRef50_Q1WR02 PTS system, glucitol/sorbitol-specific IIBC comp... 327 4e-88 UniRef50_D1AQB1 Protein-N(Pi)-phosphohistidine--sugar phosphotra... 309 8e-83 UniRef50_Q5WCZ1 BC component PTS system glucitol/sorbitol-specif... 295 1e-78 UniRef50_A1WK11 Protein-N(Pi)-phosphohistidine--sugar phosphotra... 285 1e-75 UniRef50_B0MFH8 Putative uncharacterized protein n=1 Tax=Anaeros... 285 2e-75 UniRef50_B4TE82 Glucitol/sorbitol-specific phosphotransferase en... 258 3e-67 UniRef50_UPI0000D72417 hypothetical protein CdifQ_04002805 n=1 T... 229 1e-58 UniRef50_UPI0001B41E23 hypothetical protein LmonL_13604 n=1 Tax=... 183 6e-45 UniRef50_B5XMU0 Sorbitol phosphotransferase enzyme II n=18 Tax=B... 159 2e-37 UniRef50_B0MI94 Putative uncharacterized protein n=1 Tax=Anaeros... 147 5e-34 UniRef50_A9W9S1 Sorbitol phosphotransferase protein II domain pr... 110 6e-23 UniRef50_B0MI95 Putative uncharacterized protein n=1 Tax=Anaeros... 89 2e-16 UniRef50_Q8X3W5 PTS system, glucitol/sorbitol-specific IIB compo... 65 4e-09 >UniRef50_O32333 Glucitol/sorbitol-specific phosphotransferase enzyme IIB component n=129 Tax=Bacteria RepID=PTHB_CLOB8 Length = 336 Score = 341 bits (874), Expect = 2e-92, Method: Compositional matrix adjust. Identities = 179/334 (53%), Positives = 229/334 (68%), Gaps = 18/334 (5%) Query: 2 THIRIEKGTGGWGGPLELKATPGKK-IVYITAG-TRPAIVDKLAQLTGWQAIDGFKEGEP 59 I+I KG+GG+GGPL +K GK ++YIT G P IV+K+ LTG +A++GFK P Sbjct: 5 NAIKIVKGSGGFGGPLTVKPEEGKDTLLYITGGGAEPEIVEKIVNLTGCKAVNGFKTSVP 64 Query: 60 AEAEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITV 119 E +I + +IDCGGTLRCGIYP++RIPTIN+ GKSGPLA++I EDIYVS V I++ Sbjct: 65 EE-QIFLVIIDCGGTLRCGIYPQKRIPTINVMPVGKSGPLAKFITEDIYVSAVGLNQISL 123 Query: 120 VGDATPQPSSVGR---------DYDTSKKITEQ-----SDGLLAKVGMGMGSTVAVLFQS 165 D++ +P + Y KK+++ ++ K+GMG G V L+Q+ Sbjct: 124 -ADSSAEPIKSTKVPEEGKREFKYSADKKVSQSLAENSKSSIVQKIGMGAGKVVNTLYQA 182 Query: 166 GRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPL 225 GRD + +++ TILPFMAFV+ LIGII SG G+W A L PLA + +GL++L ICS PL Sbjct: 183 GRDAVQSMITTILPFMAFVAMLIGIIQGSGFGNWFAKILVPLAGNGIGLMILGFICSIPL 242 Query: 226 LSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQ 285 LS LGPGAVIAQ++G LIGV+IG G IPP LALPALFAIN Q ACDFIPVGL LAEA Sbjct: 243 LSALLGPGAVIAQIVGTLIGVEIGKGTIPPSLALPALFAINTQCACDFIPVGLGLAEAEP 302 Query: 286 DTVRVGVPSVLVSRFLTGAPTVLIAWFVSGFIYQ 319 +TV VGVPSVL SRF+ G P V +AW S +YQ Sbjct: 303 ETVEVGVPSVLYSRFMIGVPRVAVAWVASIGLYQ 336 >UniRef50_B3WB83 Phosphoenolpyruvate-dependent phosphotransferase system, sorbitol-specific IIBC component n=15 Tax=Firmicutes RepID=B3WB83_LACCB Length = 372 Score = 329 bits (843), Expect = 9e-89, Method: Compositional matrix adjust. Identities = 182/363 (50%), Positives = 225/363 (61%), Gaps = 49/363 (13%) Query: 4 IRIEKGTGGWGGPLELKATPGK-KIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEA 62 I++ KG+GG+GGPL + T K K +Y+T G RPAIVDK+ +LTG +A+DGFK P E Sbjct: 9 IQVVKGSGGYGGPLTITPTEQKHKFIYVTGGNRPAIVDKIVELTGMEAVDGFKTSIP-ED 67 Query: 63 EIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV-G 121 E VA+IDCGGTLRCGIYPK+ I TIN+ TGKSGPLA+YIV +YVS V IT + Sbjct: 68 ETAVAIIDCGGTLRCGIYPKKNILTINVLPTGKSGPLAKYIVPKLYVSNVDVNQITALPD 127 Query: 122 DATPQPS----------------------------------SVGRD----------YDTS 137 DA P S + +D +DT+ Sbjct: 128 DAVPDQSLNGVPFDQRGEAGKQHAALAASAASQATAIETQTTTAKDQEAADAREAKFDTN 187 Query: 138 KKITEQSD--GLLAKVGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVSALIGIIMASG 195 K IT Q +A++G+G G +A Q+ +D++ T+L T++PFMAFV+ LIGII SG Sbjct: 188 KTITAQMKKPNFIARIGIGAGKVIATFNQAAKDSVQTMLNTVIPFMAFVALLIGIIQGSG 247 Query: 196 LGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIGVLIGVQIGLGNIPP 255 LG W A + PLA + GL+++ ICS P LSP LGPGAVIAQVIG LIGV+IG GNI P Sbjct: 248 LGSWFAKLMTPLAGNVFGLIVIGFICSLPFLSPILGPGAVIAQVIGTLIGVEIGRGNIQP 307 Query: 256 HLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPTVLIAWFVSG 315 ALPALFAIN Q A DFIPVGL L EA TV VGV SVL SRFL G P V++AW S Sbjct: 308 QYALPALFAINTQNAADFIPVGLGLEEADSKTVEVGVVSVLYSRFLNGVPRVVVAWLASF 367 Query: 316 FIY 318 +Y Sbjct: 368 GLY 370 >UniRef50_Q1WR02 PTS system, glucitol/sorbitol-specific IIBC component n=63 Tax=Bacteria RepID=Q1WR02_LACS1 Length = 341 Score = 327 bits (838), Expect = 4e-88, Method: Compositional matrix adjust. Identities = 181/336 (53%), Positives = 232/336 (69%), Gaps = 19/336 (5%) Query: 2 THIRIEKGTGGWGGPLELKATPGK-KIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPA 60 I+I KG+GGWGGP+ + T K K+VY+T G RP IVD + +LTG +AIDGFK P Sbjct: 5 NSIKIVKGSGGWGGPIVVTPTEEKHKVVYVTGGNRPDIVDTIVELTGMEAIDGFKTAIPD 64 Query: 61 EAEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV 120 E EI +A++DCGGTLRCGIYP + I T+N+ TGKSGPLA+YI ++YVS V + I++V Sbjct: 65 E-EIALAIVDCGGTLRCGIYPSKNILTVNVLPTGKSGPLAKYITPELYVSAVTPKQISLV 123 Query: 121 G-------------DATPQPSSVGRD--YDTSKKITEQSDG--LLAKVGMGMGSTVAVLF 163 AT + ++ ++ DTSK +T+Q G +AK+G+G+G VA Sbjct: 124 NAEDAEAIVKQQNEKATKEETTADKEDGIDTSKTLTQQGHGGGFIAKIGIGVGKVVATFN 183 Query: 164 QSGRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSF 223 Q+ ++++ TVL TI+PFMAFV+ LIGIIM SG+G+W A + PLA + GL++L ICS Sbjct: 184 QAAKESVQTVLNTIIPFMAFVALLIGIIMGSGIGNWFAKIMVPLAGNVWGLMILGFICSL 243 Query: 224 PLLSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEA 283 P LSP LGPGAVIAQVIG LIGV+IG G+IPP AL ALFAIN Q ACDFIPVGL L EA Sbjct: 244 PFLSPLLGPGAVIAQVIGTLIGVEIGKGHIPPQYALSALFAINTQNACDFIPVGLGLEEA 303 Query: 284 RQDTVRVGVPSVLVSRFLTGAPTVLIAWFVSGFIYQ 319 + +TV VGV SVL SRFL G P V +AW S +Y Sbjct: 304 KPETVEVGVTSVLYSRFLNGLPRVFVAWLASFGLYS 339 >UniRef50_D1AQB1 Protein-N(Pi)-phosphohistidine--sugar phosphotransferase n=2 Tax=Bacteria RepID=D1AQB1_SEBTE Length = 330 Score = 309 bits (792), Expect = 8e-83, Method: Compositional matrix adjust. Identities = 154/322 (47%), Positives = 219/322 (68%), Gaps = 13/322 (4%) Query: 4 IRIEKGTGGWGGPLELKATPGKKIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEAE 63 +++ +G+ GWGGPL LK P KKIV +T + +K+A++ G + ++GFK P + E Sbjct: 5 LKVSRGSNGWGGPLYLKNEPEKKIVSMTGNFIDPVAEKIAEMLGLEVVNGFKH-TPQDTE 63 Query: 64 IGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVGDA 123 I +I+CGGTLRCGIYPK+R+PTIN+ GKSGPLA++I+EDIYVS V+E N+ +V Sbjct: 64 ILCVIINCGGTLRCGIYPKKRVPTINVTPVGKSGPLAEFILEDIYVSDVREGNLELVSGM 123 Query: 124 TPQPSSVGRDYDT----------SKKITEQSD--GLLAKVGMGMGSTVAVLFQSGRDTID 171 ++ R D+ S + TE+ L+ K+G+ +G+ V + FQSG++ +D Sbjct: 124 KENGTAEKRTKDSIEDAVVKKMESMERTEKFSFLRLIEKIGLNIGNVVTLFFQSGKEAVD 183 Query: 172 TVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLG 231 ++KT++PFMAF+S I II ++ +G IA L PL+ +GLV+LALIC P LSP LG Sbjct: 184 IMIKTVVPFMAFISVFIAIINSTAIGTTIAKLLTPLSGSIIGLVVLALICGIPFLSPILG 243 Query: 232 PGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVG 291 PGA IAQV+GVLIG +IG G I P ALPALFAIN Q DF+PVG+S+ EA+ +T+ +G Sbjct: 244 PGAAIAQVVGVLIGAKIGAGEISPVFALPALFAINVQVGADFLPVGMSMQEAKPETIEIG 303 Query: 292 VPSVLVSRFLTGAPTVLIAWFV 313 P+VL+SR LTG V+IA+ + Sbjct: 304 TPAVLLSRQLTGPLAVIIAYLI 325 >UniRef50_Q5WCZ1 BC component PTS system glucitol/sorbitol-specific enzyme II n=34 Tax=Bacteria RepID=Q5WCZ1_BACSK Length = 337 Score = 295 bits (756), Expect = 1e-78, Method: Compositional matrix adjust. Identities = 160/334 (47%), Positives = 219/334 (65%), Gaps = 21/334 (6%) Query: 4 IRIEKGTGGWGGPLELKATPGK-KIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEA 62 + I KG+GGWG L ++ K K+V IT G + ++A+LTG A+DGFK P E Sbjct: 6 VFISKGSGGWGTGLRIEPKGKKTKVVSITGGGIHPVAQRIAELTGAAAVDGFKNSVP-ED 64 Query: 63 EIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVGD 122 E+ VIDCGGT R G+YP +RIPT++I ++ SGPLA++I EDI+VSGV + +++ + Sbjct: 65 EMMCVVIDCGGTARIGLYPMKRIPTVDILASSPSGPLAKHIKEDIFVSGVTPKEVSL-AE 123 Query: 123 ATPQPSSVGRDYDTSK------------------KITEQSDGLLAKVGMGMGSTVAVLFQ 164 AT + + + ++ + Q+D L + G+G V +Q Sbjct: 124 ATEEKHDLAPAEEENRFNPANKEEFKEEYEKVKNEHRSQNDNWLMRFSKGIGKVTGVFYQ 183 Query: 165 SGRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFP 224 +GRD+ID ++K I+PFMAFVS LIGII +G+GD IA+ ++PLAS GL+++ +IC+ P Sbjct: 184 AGRDSIDMLIKNIIPFMAFVSMLIGIINYTGIGDLIANTMSPLASSIWGLIIIVMICTLP 243 Query: 225 LLSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEAR 284 LSP LGPGAVIAQVIGVLIG QI LGNIPP ALPALFAIN Q DF+PVGLSL EA+ Sbjct: 244 FLSPVLGPGAVIAQVIGVLIGSQIALGNIPPQFALPALFAINGQVGADFVPVGLSLGEAK 303 Query: 285 QDTVRVGVPSVLVSRFLTGAPTVLIAWFVSGFIY 318 +TV+ GVP+VL SR +TG V+IA+ S +Y Sbjct: 304 PETVQYGVPAVLYSRLITGVLAVVIAYVASFGMY 337 >UniRef50_A1WK11 Protein-N(Pi)-phosphohistidine--sugar phosphotransferase n=11 Tax=Bacteria RepID=A1WK11_VEREI Length = 334 Score = 285 bits (730), Expect = 1e-75, Method: Compositional matrix adjust. Identities = 151/301 (50%), Positives = 201/301 (66%), Gaps = 12/301 (3%) Query: 25 KKIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEAEIGVAVIDCGGTLRCGIYPKRR 84 KI+ +T G P + +LA +TG +DGF+ P E+E+ V+DCGGT RCG+YP+++ Sbjct: 32 NKIMAVTGGEIPEVARQLAAMTGATVVDGFR-APPPESEVAAVVVDCGGTARCGVYPRKK 90 Query: 85 IPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV---GDATPQPSSVGRDYDTSKKIT 141 IPTIN+ GKSGPLA +I EDIYVSGV + +++ G A+P PS+ + + Sbjct: 91 IPTINLTPVGKSGPLADFITEDIYVSGVTLQTLSLAQGPGPASPGPSAAFAGSPPAVPLR 150 Query: 142 EQSD-------GLLAKVGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVSALIGIIMAS 194 + G++ +G MG V VLF SGR +ID VL+ +LPFMAFV+ LIGII A+ Sbjct: 151 PGVEDRQGGLVGMITFIGRSMGGVVNVLFNSGRKSIDQVLRNVLPFMAFVTMLIGIINAT 210 Query: 195 GLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIG-VLIGVQIGLGNI 253 G+G +AH L PLA + GL++L+ IC P LSP LGPGAVIAQVIG +IG I G I Sbjct: 211 GVGTVLAHLLEPLAGNVFGLLVLSAICGLPFLSPILGPGAVIAQVIGAAIIGPAIATGTI 270 Query: 254 PPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPTVLIAWFV 313 PP +ALPALFA + Q CDF+PVGL+L EA+ DT+RVGVP+VL+SR + G +V IAW Sbjct: 271 PPAMALPALFAYDTQVGCDFVPVGLALGEAKPDTIRVGVPAVLISRQIMGPVSVAIAWAA 330 Query: 314 S 314 S Sbjct: 331 S 331 >UniRef50_B0MFH8 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MFH8_9FIRM Length = 331 Score = 285 bits (728), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 154/329 (46%), Positives = 214/329 (65%), Gaps = 13/329 (3%) Query: 3 HIRIEKGTGGWGGPLELKATPGKKIV-YITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAE 61 + I KG+ GWGGPL + T +K+V +T G+ + +K+A L G +D F + Sbjct: 4 SVTITKGSSGWGGPLTVYETETRKVVASVTGGSIHPLAEKIASLLGVPVVDAFNNKVEPD 63 Query: 62 AEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVG 121 I +AV+DCGGTLRCG+YPK + TI+IH SGPL++YI ED +VSG + IT G Sbjct: 64 TII-IAVVDCGGTLRCGVYPKMGVKTIDIHPISPSGPLSKYIKEDNFVSGTTLDCITQSG 122 Query: 122 DATP-QPSSVGRDY-DTSKKITEQSD---------GLLAKVGMGMGSTVAVLFQSGRDTI 170 +A P +P G+ D ++ ++S + +G G+GS V V +Q+GRDT+ Sbjct: 123 EAAPSEPEVSGQAVPDLAEAEGKESSPAPGGNQFFNFITSMGRGIGSIVNVFYQAGRDTL 182 Query: 171 DTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFL 230 D VLK ILPFM FVS ++GII +G+GD +A+ + PLA GL+ L++ C+ P LSP L Sbjct: 183 DIVLKNILPFMIFVSIMVGIINYTGIGDLLANAIKPLAGSLPGLLGLSVFCAIPFLSPVL 242 Query: 231 GPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRV 290 GPGAVIAQV+GVL+GV+IG G IP +LPALFAIN+Q CDF+PV L+LAEA DTV Sbjct: 243 GPGAVIAQVVGVLLGVEIGKGTIPIAYSLPALFAINSQVGCDFVPVALTLAEAEDDTVSA 302 Query: 291 GVPSVLVSRFLTGAPTVLIAWFVSGFIYQ 319 GVP++L +R +TG VL+A+ +S +Y Sbjct: 303 GVPAMLFTRLVTGPAAVLVAYLLSFGLYS 331 >UniRef50_B4TE82 Glucitol/sorbitol-specific phosphotransferase enzyme iib component n=9 Tax=Enterobacteriaceae RepID=B4TE82_SALHS Length = 326 Score = 258 bits (658), Expect = 3e-67, Method: Compositional matrix adjust. Identities = 141/307 (45%), Positives = 195/307 (63%), Gaps = 10/307 (3%) Query: 17 LELKATPGKKIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEAEIGVAVIDCGGTLR 76 L L GKK++ +T + ++ LT + ++GF + P + +I VI+C G+LR Sbjct: 17 LWLPVASGKKVLSLTGREIHPVAIEIGALTESEVVNGFSD-IPPDNDILCVVINCAGSLR 75 Query: 77 CGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVG-----DATPQPSSVG 131 CG+YP++ IPTIN+ T ++GPLAQ+I ED YVSGV E + +V D P+P SV Sbjct: 76 CGLYPQKGIPTINVLPTWRAGPLAQFISEDNYVSGVTIEQLVLVDTAEAPDGEPEPVSVT 135 Query: 132 --RDYDTS--KKITEQSDGLLAKVGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVSAL 187 R T + E+ ++ + G +G +A+LF + R+ +D L+ ++PFMAFVS L Sbjct: 136 APRIITTPPPARGAEKLVRMVERTGTAVGHVIALLFAASREAVDVSLRNVIPFMAFVSLL 195 Query: 188 IGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIGVLIGVQ 247 I ++ + LG IAH L PLA+ GLV+L+LIC P LSP LGPGA I+QVIGV+IG Q Sbjct: 196 IALVQETALGSLIAHALTPLANSLWGLVLLSLICGIPFLSPVLGPGAAISQVIGVMIGTQ 255 Query: 248 IGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPTV 307 IG G I P ALPALFAIN Q CDF+PVGLS+ EA+ +T+ GVP+ L+SR LTG V Sbjct: 256 IGAGAISPAFALPALFAINVQVGCDFVPVGLSMQEAKPETIAKGVPAFLLSRQLTGPLAV 315 Query: 308 LIAWFVS 314 +I W S Sbjct: 316 IIGWLFS 322 >UniRef50_UPI0000D72417 hypothetical protein CdifQ_04002805 n=1 Tax=Clostridium difficile QCD-32g58 RepID=UPI0000D72417 Length = 258 Score = 229 bits (583), Expect = 1e-58, Method: Compositional matrix adjust. Identities = 123/250 (49%), Positives = 164/250 (65%), Gaps = 15/250 (6%) Query: 82 KRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV-GDATPQPSSVG--------R 132 K + S+ K+GPLA++I E+ YVS V I+VV G+ PQ S R Sbjct: 6 KEKNTYNKCKSSWKTGPLAKFITEEYYVSDVNPNCISVVDGEDMPQKSQENKSENKSSIR 65 Query: 133 DYDTSKKITEQSDGLLAK------VGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVSA 186 D ++ ++ G AK +G G G V+ + +GRDTI V+ ++PFMAFVS Sbjct: 66 KPDNYDEVKSKAQGEYAKKNIILSIGQGAGQVVSKFYDAGRDTIQMVMNNVIPFMAFVSM 125 Query: 187 LIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIGVLIGV 246 L+GII+ASGLGDWIA ++PLA + GL+++++IC+ P LSP LGPGAVIAQV+G L+G Sbjct: 126 LMGIILASGLGDWIARVISPLAGNIGGLLIISVICTLPFLSPILGPGAVIAQVVGTLVGT 185 Query: 247 QIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPT 306 QIGLG IP +LALPALFAIN QA CDF+PVGLSL EA +TV GVP++ SR +TG Sbjct: 186 QIGLGAIPAYLALPALFAINGQAGCDFVPVGLSLGEAEPETVEYGVPALFYSRLITGPIA 245 Query: 307 VLIAWFVSGF 316 V+IA+ V+ F Sbjct: 246 VIIAYGVAVF 255 >UniRef50_UPI0001B41E23 hypothetical protein LmonL_13604 n=1 Tax=Listeria monocytogenes LO28 RepID=UPI0001B41E23 Length = 197 Score = 183 bits (465), Expect = 6e-45, Method: Compositional matrix adjust. Identities = 99/187 (52%), Positives = 127/187 (67%), Gaps = 11/187 (5%) Query: 112 VKEENITVVGDATPQP------SSVGRDY----DTSKKITEQSDGLLAKVGMGMGSTVAV 161 VK E+I V D + +P DY + K+ +E +G L K G+G + V Sbjct: 1 VKPEDIEV-RDISAEPGKPFDREKFKEDYAKIKEDQKEASELKEGFLVKFSRGIGKAMGV 59 Query: 162 LFQSGRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALIC 221 +Q+GRD++D ++K I+PFMAF+S +IGII +G+G IA+ L+PLA G++++A IC Sbjct: 60 FYQAGRDSVDMLIKNIIPFMAFISMIIGIINYTGIGKLIANTLSPLAGSLWGMLLIAFIC 119 Query: 222 SFPLLSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLA 281 S P LSP LGPGAVIAQVIGVLIG QI +G IP ALPALFAINAQA CDFIPVGLSLA Sbjct: 120 SLPFLSPILGPGAVIAQVIGVLIGSQIAIGAIPVTYALPALFAINAQAGCDFIPVGLSLA 179 Query: 282 EARQDTV 288 EA+ TV Sbjct: 180 EAKPKTV 186 >UniRef50_B5XMU0 Sorbitol phosphotransferase enzyme II n=18 Tax=Bacteria RepID=B5XMU0_KLEP3 Length = 572 Score = 159 bits (401), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 76/118 (64%), Positives = 89/118 (75%) Query: 3 HIRIEKGTGGWGGPLELKATPGKKIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEA 62 H+ I KG GWGGPL + GKKI YIT G RP +VD+L++LTGW +ID FK GEP Sbjct: 4 HLLISKGCKGWGGPLTITLGQGKKIAYITGGIRPPVVDRLSELTGWPSIDVFKNGEPPAE 63 Query: 63 EIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV 120 EIG+ VIDCGGTLRCG+YPKR IPTIN+H TGKSGPLA++I E IYVSGV I +V Sbjct: 64 EIGLMVIDCGGTLRCGLYPKRGIPTINLHPTGKSGPLAEFIHEGIYVSGVTPACIEMV 121 >UniRef50_B0MI94 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MI94_9FIRM Length = 186 Score = 147 bits (371), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 80/188 (42%), Positives = 118/188 (62%), Gaps = 7/188 (3%) Query: 131 GRDYDTSKKITEQSDGLLAKVGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVSALIGI 190 G++ T K + V G+G V + Q+ R + L+TI+PF+ FVS + + Sbjct: 6 GKNMKTVSK-------FMTTVAAGIGKFVMTVVQAARTSFKLCLETIIPFLIFVSTVFTL 58 Query: 191 IMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIGVLIGVQIGL 250 I ++G+G+ IA+ L+ LA+ P+GL+++ LI +FPLLSP +GPGAVI QVIG LIG I Sbjct: 59 ITSTGVGNIIANALSGLATSPIGLIVMGLIITFPLLSPIIGPGAVIPQVIGALIGGLIST 118 Query: 251 GNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPTVLIA 310 G +P +ALP +FAI+ DFIPVG+SL EA +TV VGV +VL S+F+ +LIA Sbjct: 119 GAVPLTMALPTVFAIHQPCGADFIPVGMSLCEAEPETVEVGVSAVLFSKFVVAPVEILIA 178 Query: 311 WFVSGFIY 318 + ++ Sbjct: 179 VIIGQLVF 186 >UniRef50_A9W9S1 Sorbitol phosphotransferase protein II domain protein n=2 Tax=Chloroflexus RepID=A9W9S1_CHLAA Length = 128 Score = 110 bits (276), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 53/120 (44%), Positives = 83/120 (69%), Gaps = 2/120 (1%) Query: 4 IRIEKGTGGWGGPLELKATPGKKIVY-ITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEA 62 +RI+KG GWGGPL ++ PG+ ++Y +T G + +A LTG + DGFK + + Sbjct: 10 VRIKKGPKGWGGPLIIEPKPGRDLIYSVTGGGIHPLAQHIANLTGGRPFDGFKS-KADFS 68 Query: 63 EIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVGD 122 EI VAVIDCGGT R G+YP +++PT++I+ T SGPL ++I E+ +VSGV+ E++ ++ + Sbjct: 69 EIAVAVIDCGGTARIGVYPMKKVPTVDIYPTSPSGPLMRFITEEYFVSGVRPEDVELIDE 128 >UniRef50_B0MI95 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MI95_9FIRM Length = 121 Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 48/115 (41%), Positives = 76/115 (66%), Gaps = 4/115 (3%) Query: 6 IEKGTGGWGGPLELKATPGK-KIVYITA-GTRPAIVDKLAQLTGWQAIDGFKEGEPAEAE 63 + KG GG+G PLE+ A + K+V +T G P + +K+A + G +A+DGF++ P +AE Sbjct: 6 VSKGGGGFGTPLEISAEGKRTKVVCMTGVGIHP-LAEKIAGIIGGEAVDGFRKPVP-DAE 63 Query: 64 IGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENIT 118 +++CGG+LR G++PK+ + TIN++ TG SGP + E IYVSG +N+T Sbjct: 64 TACVIVNCGGSLRLGMFPKKGLKTINLNPTGPSGPFGSFCKEGIYVSGSTVKNVT 118 >UniRef50_Q8X3W5 PTS system, glucitol/sorbitol-specific IIB component and second of two IIC components; frag n=1 Tax=Escherichia coli O157:H7 RepID=Q8X3W5_ECO57 Length = 68 Score = 64.7 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 29/31 (93%), Positives = 30/31 (96%) Query: 1 MTHIRIEKGTGGWGGPLELKATPGKKIVYIT 31 MTHIRIEKGTGGWGGPLEL+ TPGKKIVYIT Sbjct: 1 MTHIRIEKGTGGWGGPLELETTPGKKIVYIT 31 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q5WCZ1 BC component PTS system glucitol/sorbitol-specif... 410 e-113 UniRef50_Q1WR02 PTS system, glucitol/sorbitol-specific IIBC comp... 403 e-111 UniRef50_B3WB83 Phosphoenolpyruvate-dependent phosphotransferase... 400 e-110 UniRef50_O32333 Glucitol/sorbitol-specific phosphotransferase en... 397 e-109 UniRef50_D1AQB1 Protein-N(Pi)-phosphohistidine--sugar phosphotra... 393 e-108 UniRef50_B0MFH8 Putative uncharacterized protein n=1 Tax=Anaeros... 383 e-105 UniRef50_B4TE82 Glucitol/sorbitol-specific phosphotransferase en... 366 e-100 UniRef50_A1WK11 Protein-N(Pi)-phosphohistidine--sugar phosphotra... 349 6e-95 UniRef50_UPI0000D72417 hypothetical protein CdifQ_04002805 n=1 T... 302 8e-81 UniRef50_UPI0001B41E23 hypothetical protein LmonL_13604 n=1 Tax=... 230 4e-59 UniRef50_B0MI94 Putative uncharacterized protein n=1 Tax=Anaeros... 202 2e-50 UniRef50_B5XMU0 Sorbitol phosphotransferase enzyme II n=18 Tax=B... 179 1e-43 UniRef50_A9W9S1 Sorbitol phosphotransferase protein II domain pr... 157 4e-37 UniRef50_B0MI95 Putative uncharacterized protein n=1 Tax=Anaeros... 135 2e-30 UniRef50_Q8X3W5 PTS system, glucitol/sorbitol-specific IIB compo... 54 8e-06 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_Q5WCZ1 BC component PTS system glucitol/sorbitol-specific enzyme II n=34 Tax=Bacteria RepID=Q5WCZ1_BACSK Length = 337 Score = 410 bits (1053), Expect = e-113, Method: Composition-based stats. Identities = 160/335 (47%), Positives = 219/335 (65%), Gaps = 21/335 (6%) Query: 3 HIRIEKGTGGWGGPLELKATPGK-KIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAE 61 + I KG+GGWG L ++ K K+V IT G + ++A+LTG A+DGFK P E Sbjct: 5 AVFISKGSGGWGTGLRIEPKGKKTKVVSITGGGIHPVAQRIAELTGAAAVDGFKNSVP-E 63 Query: 62 AEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVG 121 E+ VIDCGGT R G+YP +RIPT++I ++ SGPLA++I EDI+VSGV + +++ Sbjct: 64 DEMMCVVIDCGGTARIGLYPMKRIPTVDILASSPSGPLAKHIKEDIFVSGVTPKEVSL-A 122 Query: 122 DATPQPSSVGRDYDTSK------------------KITEQSDGLLAKVGMGMGSTVAVLF 163 +AT + + + ++ + Q+D L + G+G V + Sbjct: 123 EATEEKHDLAPAEEENRFNPANKEEFKEEYEKVKNEHRSQNDNWLMRFSKGIGKVTGVFY 182 Query: 164 QSGRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSF 223 Q+GRD+ID ++K I+PFMAFVS LIGII +G+GD IA+ ++PLAS GL+++ +IC+ Sbjct: 183 QAGRDSIDMLIKNIIPFMAFVSMLIGIINYTGIGDLIANTMSPLASSIWGLIIIVMICTL 242 Query: 224 PLLSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEA 283 P LSP LGPGAVIAQVIGVLIG QI LGNIPP ALPALFAIN Q DF+PVGLSL EA Sbjct: 243 PFLSPVLGPGAVIAQVIGVLIGSQIALGNIPPQFALPALFAINGQVGADFVPVGLSLGEA 302 Query: 284 RQDTVRVGVPSVLVSRFLTGAPTVLIAWFVSGFIY 318 + +TV+ GVP+VL SR +TG V+IA+ S +Y Sbjct: 303 KPETVQYGVPAVLYSRLITGVLAVVIAYVASFGMY 337 >UniRef50_Q1WR02 PTS system, glucitol/sorbitol-specific IIBC component n=63 Tax=Bacteria RepID=Q1WR02_LACS1 Length = 341 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 181/336 (53%), Positives = 232/336 (69%), Gaps = 19/336 (5%) Query: 2 THIRIEKGTGGWGGPLELKATPGK-KIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPA 60 I+I KG+GGWGGP+ + T K K+VY+T G RP IVD + +LTG +AIDGFK P Sbjct: 5 NSIKIVKGSGGWGGPIVVTPTEEKHKVVYVTGGNRPDIVDTIVELTGMEAIDGFKTAIPD 64 Query: 61 EAEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV 120 E EI +A++DCGGTLRCGIYP + I T+N+ TGKSGPLA+YI ++YVS V + I++V Sbjct: 65 E-EIALAIVDCGGTLRCGIYPSKNILTVNVLPTGKSGPLAKYITPELYVSAVTPKQISLV 123 Query: 121 G-------------DATPQPSSVGRD--YDTSKKITEQSDG--LLAKVGMGMGSTVAVLF 163 AT + ++ ++ DTSK +T+Q G +AK+G+G+G VA Sbjct: 124 NAEDAEAIVKQQNEKATKEETTADKEDGIDTSKTLTQQGHGGGFIAKIGIGVGKVVATFN 183 Query: 164 QSGRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSF 223 Q+ ++++ TVL TI+PFMAFV+ LIGIIM SG+G+W A + PLA + GL++L ICS Sbjct: 184 QAAKESVQTVLNTIIPFMAFVALLIGIIMGSGIGNWFAKIMVPLAGNVWGLMILGFICSL 243 Query: 224 PLLSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEA 283 P LSP LGPGAVIAQVIG LIGV+IG G+IPP AL ALFAIN Q ACDFIPVGL L EA Sbjct: 244 PFLSPLLGPGAVIAQVIGTLIGVEIGKGHIPPQYALSALFAINTQNACDFIPVGLGLEEA 303 Query: 284 RQDTVRVGVPSVLVSRFLTGAPTVLIAWFVSGFIYQ 319 + +TV VGV SVL SRFL G P V +AW S +Y Sbjct: 304 KPETVEVGVTSVLYSRFLNGLPRVFVAWLASFGLYS 339 >UniRef50_B3WB83 Phosphoenolpyruvate-dependent phosphotransferase system, sorbitol-specific IIBC component n=15 Tax=Firmicutes RepID=B3WB83_LACCB Length = 372 Score = 400 bits (1028), Expect = e-110, Method: Composition-based stats. Identities = 182/365 (49%), Positives = 223/365 (61%), Gaps = 49/365 (13%) Query: 3 HIRIEKGTGGWGGPLELKATPGK-KIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAE 61 I++ KG+GG+GGPL + T K K +Y+T G RPAIVDK+ +LTG +A+DGFK P E Sbjct: 8 SIQVVKGSGGYGGPLTITPTEQKHKFIYVTGGNRPAIVDKIVELTGMEAVDGFKTSIP-E 66 Query: 62 AEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENIT-VV 120 E VA+IDCGGTLRCGIYPK+ I TIN+ TGKSGPLA+YIV +YVS V IT + Sbjct: 67 DETAVAIIDCGGTLRCGIYPKKNILTINVLPTGKSGPLAKYIVPKLYVSNVDVNQITALP 126 Query: 121 GDATPQPSSVG--------------------------------------------RDYDT 136 DA P S G +DT Sbjct: 127 DDAVPDQSLNGVPFDQRGEAGKQHAALAASAASQATAIETQTTTAKDQEAADAREAKFDT 186 Query: 137 SKKITEQSD--GLLAKVGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVSALIGIIMAS 194 +K IT Q +A++G+G G +A Q+ +D++ T+L T++PFMAFV+ LIGII S Sbjct: 187 NKTITAQMKKPNFIARIGIGAGKVIATFNQAAKDSVQTMLNTVIPFMAFVALLIGIIQGS 246 Query: 195 GLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIGVLIGVQIGLGNIP 254 GLG W A + PLA + GL+++ ICS P LSP LGPGAVIAQVIG LIGV+IG GNI Sbjct: 247 GLGSWFAKLMTPLAGNVFGLIVIGFICSLPFLSPILGPGAVIAQVIGTLIGVEIGRGNIQ 306 Query: 255 PHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPTVLIAWFVS 314 P ALPALFAIN Q A DFIPVGL L EA TV VGV SVL SRFL G P V++AW S Sbjct: 307 PQYALPALFAINTQNAADFIPVGLGLEEADSKTVEVGVVSVLYSRFLNGVPRVVVAWLAS 366 Query: 315 GFIYQ 319 +Y Sbjct: 367 FGLYA 371 >UniRef50_O32333 Glucitol/sorbitol-specific phosphotransferase enzyme IIB component n=129 Tax=Bacteria RepID=PTHB_CLOB8 Length = 336 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 177/334 (52%), Positives = 226/334 (67%), Gaps = 18/334 (5%) Query: 2 THIRIEKGTGGWGGPLELKATPGKK-IVYITAGT-RPAIVDKLAQLTGWQAIDGFKEGEP 59 I+I KG+GG+GGPL +K GK ++YIT G P IV+K+ LTG +A++GFK P Sbjct: 5 NAIKIVKGSGGFGGPLTVKPEEGKDTLLYITGGGAEPEIVEKIVNLTGCKAVNGFKTSVP 64 Query: 60 AEAEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITV 119 E +I + +IDCGGTLRCGIYP++RIPTIN+ GKSGPLA++I EDIYVS V I++ Sbjct: 65 EE-QIFLVIIDCGGTLRCGIYPQKRIPTINVMPVGKSGPLAKFITEDIYVSAVGLNQISL 123 Query: 120 VGDATPQPSSVGRDYDTSKKI--------------TEQSDGLLAKVGMGMGSTVAVLFQS 165 D++ +P + + K+ ++ K+GMG G V L+Q+ Sbjct: 124 -ADSSAEPIKSTKVPEEGKREFKYSADKKVSQSLAENSKSSIVQKIGMGAGKVVNTLYQA 182 Query: 166 GRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPL 225 GRD + +++ TILPFMAFV+ LIGII SG G+W A L PLA + +GL++L ICS PL Sbjct: 183 GRDAVQSMITTILPFMAFVAMLIGIIQGSGFGNWFAKILVPLAGNGIGLMILGFICSIPL 242 Query: 226 LSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQ 285 LS LGPGAVIAQ++G LIGV+IG G IPP LALPALFAIN Q ACDFIPVGL LAEA Sbjct: 243 LSALLGPGAVIAQIVGTLIGVEIGKGTIPPSLALPALFAINTQCACDFIPVGLGLAEAEP 302 Query: 286 DTVRVGVPSVLVSRFLTGAPTVLIAWFVSGFIYQ 319 +TV VGVPSVL SRF+ G P V +AW S +YQ Sbjct: 303 ETVEVGVPSVLYSRFMIGVPRVAVAWVASIGLYQ 336 >UniRef50_D1AQB1 Protein-N(Pi)-phosphohistidine--sugar phosphotransferase n=2 Tax=Bacteria RepID=D1AQB1_SEBTE Length = 330 Score = 393 bits (1011), Expect = e-108, Method: Composition-based stats. Identities = 154/327 (47%), Positives = 221/327 (67%), Gaps = 13/327 (3%) Query: 4 IRIEKGTGGWGGPLELKATPGKKIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEAE 63 +++ +G+ GWGGPL LK P KKIV +T + +K+A++ G + ++GFK P + E Sbjct: 5 LKVSRGSNGWGGPLYLKNEPEKKIVSMTGNFIDPVAEKIAEMLGLEVVNGFKH-TPQDTE 63 Query: 64 IGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVGDA 123 I +I+CGGTLRCGIYPK+R+PTIN+ GKSGPLA++I+EDIYVS V+E N+ +V Sbjct: 64 ILCVIINCGGTLRCGIYPKKRVPTINVTPVGKSGPLAEFILEDIYVSDVREGNLELVSGM 123 Query: 124 TPQPSSVGRDYDT----------SKKITEQSD--GLLAKVGMGMGSTVAVLFQSGRDTID 171 ++ R D+ S + TE+ L+ K+G+ +G+ V + FQSG++ +D Sbjct: 124 KENGTAEKRTKDSIEDAVVKKMESMERTEKFSFLRLIEKIGLNIGNVVTLFFQSGKEAVD 183 Query: 172 TVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLG 231 ++KT++PFMAF+S I II ++ +G IA L PL+ +GLV+LALIC P LSP LG Sbjct: 184 IMIKTVVPFMAFISVFIAIINSTAIGTTIAKLLTPLSGSIIGLVVLALICGIPFLSPILG 243 Query: 232 PGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVG 291 PGA IAQV+GVLIG +IG G I P ALPALFAIN Q DF+PVG+S+ EA+ +T+ +G Sbjct: 244 PGAAIAQVVGVLIGAKIGAGEISPVFALPALFAINVQVGADFLPVGMSMQEAKPETIEIG 303 Query: 292 VPSVLVSRFLTGAPTVLIAWFVSGFIY 318 P+VL+SR LTG V+IA+ + ++ Sbjct: 304 TPAVLLSRQLTGPLAVIIAYLIGLGLF 330 >UniRef50_B0MFH8 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MFH8_9FIRM Length = 331 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 153/329 (46%), Positives = 210/329 (63%), Gaps = 13/329 (3%) Query: 3 HIRIEKGTGGWGGPLELKATPGKKIV-YITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAE 61 + I KG+ GWGGPL + T +K+V +T G+ + +K+A L G +D F + Sbjct: 4 SVTITKGSSGWGGPLTVYETETRKVVASVTGGSIHPLAEKIASLLGVPVVDAFNNKVEPD 63 Query: 62 AEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVG 121 I +AV+DCGGTLRCG+YPK + TI+IH SGPL++YI ED +VSG + IT G Sbjct: 64 T-IIIAVVDCGGTLRCGVYPKMGVKTIDIHPISPSGPLSKYIKEDNFVSGTTLDCITQSG 122 Query: 122 DATP-QPSSVGRDYD----------TSKKITEQSDGLLAKVGMGMGSTVAVLFQSGRDTI 170 +A P +P G+ + Q + +G G+GS V V +Q+GRDT+ Sbjct: 123 EAAPSEPEVSGQAVPDLAEAEGKESSPAPGGNQFFNFITSMGRGIGSIVNVFYQAGRDTL 182 Query: 171 DTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFL 230 D VLK ILPFM FVS ++GII +G+GD +A+ + PLA GL+ L++ C+ P LSP L Sbjct: 183 DIVLKNILPFMIFVSIMVGIINYTGIGDLLANAIKPLAGSLPGLLGLSVFCAIPFLSPVL 242 Query: 231 GPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRV 290 GPGAVIAQV+GVL+GV+IG G IP +LPALFAIN+Q CDF+PV L+LAEA DTV Sbjct: 243 GPGAVIAQVVGVLLGVEIGKGTIPIAYSLPALFAINSQVGCDFVPVALTLAEAEDDTVSA 302 Query: 291 GVPSVLVSRFLTGAPTVLIAWFVSGFIYQ 319 GVP++L +R +TG VL+A+ +S +Y Sbjct: 303 GVPAMLFTRLVTGPAAVLVAYLLSFGLYS 331 >UniRef50_B4TE82 Glucitol/sorbitol-specific phosphotransferase enzyme iib component n=9 Tax=Enterobacteriaceae RepID=B4TE82_SALHS Length = 326 Score = 366 bits (940), Expect = e-100, Method: Composition-based stats. Identities = 145/327 (44%), Positives = 203/327 (62%), Gaps = 10/327 (3%) Query: 1 MTHIRIEKGTGGWGGPLELKATPGKKIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPA 60 M + I G GG+G L L GKK++ +T + ++ LT + ++GF + P Sbjct: 1 MNTVLIPPGPGGYGKGLWLPVASGKKVLSLTGREIHPVAIEIGALTESEVVNGFSD-IPP 59 Query: 61 EAEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV 120 + +I VI+C G+LRCG+YP++ IPTIN+ T ++GPLAQ+I ED YVSGV E + +V Sbjct: 60 DNDILCVVINCAGSLRCGLYPQKGIPTINVLPTWRAGPLAQFISEDNYVSGVTIEQLVLV 119 Query: 121 G-----DATPQPSSVGR----DYDTSKKITEQSDGLLAKVGMGMGSTVAVLFQSGRDTID 171 D P+P SV + E+ ++ + G +G +A+LF + R+ +D Sbjct: 120 DTAEAPDGEPEPVSVTAPRIITTPPPARGAEKLVRMVERTGTAVGHVIALLFAASREAVD 179 Query: 172 TVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLG 231 L+ ++PFMAFVS LI ++ + LG IAH L PLA+ GLV+L+LIC P LSP LG Sbjct: 180 VSLRNVIPFMAFVSLLIALVQETALGSLIAHALTPLANSLWGLVLLSLICGIPFLSPVLG 239 Query: 232 PGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVG 291 PGA I+QVIGV+IG QIG G I P ALPALFAIN Q CDF+PVGLS+ EA+ +T+ G Sbjct: 240 PGAAISQVIGVMIGTQIGAGAISPAFALPALFAINVQVGCDFVPVGLSMQEAKPETIAKG 299 Query: 292 VPSVLVSRFLTGAPTVLIAWFVSGFIY 318 VP+ L+SR LTG V+I W S ++ Sbjct: 300 VPAFLLSRQLTGPLAVIIGWLFSLGLF 326 >UniRef50_A1WK11 Protein-N(Pi)-phosphohistidine--sugar phosphotransferase n=11 Tax=Bacteria RepID=A1WK11_VEREI Length = 334 Score = 349 bits (897), Expect = 6e-95, Method: Composition-based stats. Identities = 153/313 (48%), Positives = 206/313 (65%), Gaps = 13/313 (4%) Query: 17 LELKATPGK-KIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEAEIGVAVIDCGGTL 75 L + T + KI+ +T G P + +LA +TG +DGF+ P E+E+ V+DCGGT Sbjct: 23 LVITPTAERNKIMAVTGGEIPEVARQLAAMTGATVVDGFR-APPPESEVAAVVVDCGGTA 81 Query: 76 RCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV---GDATPQPSSVGR 132 RCG+YP+++IPTIN+ GKSGPLA +I EDIYVSGV + +++ G A+P PS+ Sbjct: 82 RCGVYPRKKIPTINLTPVGKSGPLADFITEDIYVSGVTLQTLSLAQGPGPASPGPSAAFA 141 Query: 133 DYDTSKKITEQSD-------GLLAKVGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVS 185 + + + G++ +G MG V VLF SGR +ID VL+ +LPFMAFV+ Sbjct: 142 GSPPAVPLRPGVEDRQGGLVGMITFIGRSMGGVVNVLFNSGRKSIDQVLRNVLPFMAFVT 201 Query: 186 ALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIG-VLI 244 LIGII A+G+G +AH L PLA + GL++L+ IC P LSP LGPGAVIAQVIG +I Sbjct: 202 MLIGIINATGVGTVLAHLLEPLAGNVFGLLVLSAICGLPFLSPILGPGAVIAQVIGAAII 261 Query: 245 GVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGA 304 G I G IPP +ALPALFA + Q CDF+PVGL+L EA+ DT+RVGVP+VL+SR + G Sbjct: 262 GPAIATGTIPPAMALPALFAYDTQVGCDFVPVGLALGEAKPDTIRVGVPAVLISRQIMGP 321 Query: 305 PTVLIAWFVSGFI 317 +V IAW S + Sbjct: 322 VSVAIAWAASFML 334 >UniRef50_UPI0000D72417 hypothetical protein CdifQ_04002805 n=1 Tax=Clostridium difficile QCD-32g58 RepID=UPI0000D72417 Length = 258 Score = 302 bits (775), Expect = 8e-81, Method: Composition-based stats. Identities = 123/250 (49%), Positives = 164/250 (65%), Gaps = 15/250 (6%) Query: 82 KRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV-GDATPQPSSVG--------R 132 K + S+ K+GPLA++I E+ YVS V I+VV G+ PQ S R Sbjct: 6 KEKNTYNKCKSSWKTGPLAKFITEEYYVSDVNPNCISVVDGEDMPQKSQENKSENKSSIR 65 Query: 133 DYDTSKKITEQSDGLLAK------VGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVSA 186 D ++ ++ G AK +G G G V+ + +GRDTI V+ ++PFMAFVS Sbjct: 66 KPDNYDEVKSKAQGEYAKKNIILSIGQGAGQVVSKFYDAGRDTIQMVMNNVIPFMAFVSM 125 Query: 187 LIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIGVLIGV 246 L+GII+ASGLGDWIA ++PLA + GL+++++IC+ P LSP LGPGAVIAQV+G L+G Sbjct: 126 LMGIILASGLGDWIARVISPLAGNIGGLLIISVICTLPFLSPILGPGAVIAQVVGTLVGT 185 Query: 247 QIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPT 306 QIGLG IP +LALPALFAIN QA CDF+PVGLSL EA +TV GVP++ SR +TG Sbjct: 186 QIGLGAIPAYLALPALFAINGQAGCDFVPVGLSLGEAEPETVEYGVPALFYSRLITGPIA 245 Query: 307 VLIAWFVSGF 316 V+IA+ V+ F Sbjct: 246 VIIAYGVAVF 255 >UniRef50_UPI0001B41E23 hypothetical protein LmonL_13604 n=1 Tax=Listeria monocytogenes LO28 RepID=UPI0001B41E23 Length = 197 Score = 230 bits (588), Expect = 4e-59, Method: Composition-based stats. Identities = 97/188 (51%), Positives = 126/188 (67%), Gaps = 11/188 (5%) Query: 112 VKEENITVVGDATPQP----------SSVGRDYDTSKKITEQSDGLLAKVGMGMGSTVAV 161 VK E+I V D + +P + + K+ +E +G L K G+G + V Sbjct: 1 VKPEDIE-VRDISAEPGKPFDREKFKEDYAKIKEDQKEASELKEGFLVKFSRGIGKAMGV 59 Query: 162 LFQSGRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALIC 221 +Q+GRD++D ++K I+PFMAF+S +IGII +G+G IA+ L+PLA G++++A IC Sbjct: 60 FYQAGRDSVDMLIKNIIPFMAFISMIIGIINYTGIGKLIANTLSPLAGSLWGMLLIAFIC 119 Query: 222 SFPLLSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLA 281 S P LSP LGPGAVIAQVIGVLIG QI +G IP ALPALFAINAQA CDFIPVGLSLA Sbjct: 120 SLPFLSPILGPGAVIAQVIGVLIGSQIAIGAIPVTYALPALFAINAQAGCDFIPVGLSLA 179 Query: 282 EARQDTVR 289 EA+ TV Sbjct: 180 EAKPKTVS 187 >UniRef50_B0MI94 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MI94_9FIRM Length = 186 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 78/180 (43%), Positives = 115/180 (63%) Query: 139 KITEQSDGLLAKVGMGMGSTVAVLFQSGRDTIDTVLKTILPFMAFVSALIGIIMASGLGD 198 K + + V G+G V + Q+ R + L+TI+PF+ FVS + +I ++G+G+ Sbjct: 7 KNMKTVSKFMTTVAAGIGKFVMTVVQAARTSFKLCLETIIPFLIFVSTVFTLITSTGVGN 66 Query: 199 WIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVIGVLIGVQIGLGNIPPHLA 258 IA+ L+ LA+ P+GL+++ LI +FPLLSP +GPGAVI QVIG LIG I G +P +A Sbjct: 67 IIANALSGLATSPIGLIVMGLIITFPLLSPIIGPGAVIPQVIGALIGGLISTGAVPLTMA 126 Query: 259 LPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPTVLIAWFVSGFIY 318 LP +FAI+ DFIPVG+SL EA +TV VGV +VL S+F+ +LIA + ++ Sbjct: 127 LPTVFAIHQPCGADFIPVGMSLCEAEPETVEVGVSAVLFSKFVVAPVEILIAVIIGQLVF 186 >UniRef50_B5XMU0 Sorbitol phosphotransferase enzyme II n=18 Tax=Bacteria RepID=B5XMU0_KLEP3 Length = 572 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 76/118 (64%), Positives = 89/118 (75%) Query: 3 HIRIEKGTGGWGGPLELKATPGKKIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEA 62 H+ I KG GWGGPL + GKKI YIT G RP +VD+L++LTGW +ID FK GEP Sbjct: 4 HLLISKGCKGWGGPLTITLGQGKKIAYITGGIRPPVVDRLSELTGWPSIDVFKNGEPPAE 63 Query: 63 EIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVV 120 EIG+ VIDCGGTLRCG+YPKR IPTIN+H TGKSGPLA++I E IYVSGV I +V Sbjct: 64 EIGLMVIDCGGTLRCGLYPKRGIPTINLHPTGKSGPLAEFIHEGIYVSGVTPACIEMV 121 >UniRef50_A9W9S1 Sorbitol phosphotransferase protein II domain protein n=2 Tax=Chloroflexus RepID=A9W9S1_CHLAA Length = 128 Score = 157 bits (398), Expect = 4e-37, Method: Composition-based stats. Identities = 53/120 (44%), Positives = 83/120 (69%), Gaps = 2/120 (1%) Query: 4 IRIEKGTGGWGGPLELKATPGKKIVY-ITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEA 62 +RI+KG GWGGPL ++ PG+ ++Y +T G + +A LTG + DGFK + + Sbjct: 10 VRIKKGPKGWGGPLIIEPKPGRDLIYSVTGGGIHPLAQHIANLTGGRPFDGFK-SKADFS 68 Query: 63 EIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVGD 122 EI VAVIDCGGT R G+YP +++PT++I+ T SGPL ++I E+ +VSGV+ E++ ++ + Sbjct: 69 EIAVAVIDCGGTARIGVYPMKKVPTVDIYPTSPSGPLMRFITEEYFVSGVRPEDVELIDE 128 >UniRef50_B0MI95 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MI95_9FIRM Length = 121 Score = 135 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 47/122 (38%), Positives = 75/122 (61%), Gaps = 2/122 (1%) Query: 1 MTHIRIEKGTGGWGGPLELKATPGK-KIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEP 59 M + KG GG+G PLE+ A + K+V +T + +K+A + G +A+DGF++ P Sbjct: 1 MGKAIVSKGGGGFGTPLEISAEGKRTKVVCMTGVGIHPLAEKIAGIIGGEAVDGFRKPVP 60 Query: 60 AEAEIGVAVIDCGGTLRCGIYPKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITV 119 +AE +++CGG+LR G++PK+ + TIN++ TG SGP + E IYVSG +N+T Sbjct: 61 -DAETACVIVNCGGSLRLGMFPKKGLKTINLNPTGPSGPFGSFCKEGIYVSGSTVKNVTA 119 Query: 120 VG 121 Sbjct: 120 ED 121 >UniRef50_Q8X3W5 PTS system, glucitol/sorbitol-specific IIB component and second of two IIC components; frag n=1 Tax=Escherichia coli O157:H7 RepID=Q8X3W5_ECO57 Length = 68 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 29/31 (93%), Positives = 30/31 (96%) Query: 1 MTHIRIEKGTGGWGGPLELKATPGKKIVYIT 31 MTHIRIEKGTGGWGGPLEL+ TPGKKIVYIT Sbjct: 1 MTHIRIEKGTGGWGGPLELETTPGKKIVYIT 31 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.317 0.148 0.420 Lambda K H 0.267 0.0451 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,342,539,688 Number of Sequences: 3077464 Number of extensions: 61174316 Number of successful extensions: 170293 Number of sequences better than 1.0e-01: 28 Number of HSP's better than 0.1 without gapping: 34 Number of HSP's successfully gapped in prelim test: 13 Number of HSP's that attempted gapping in prelim test: 170186 Number of HSP's gapped (non-prelim): 54 length of query: 319 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 191 effective length of database: 646,480,964 effective search space: 123477864124 effective search space used: 123477864124 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 93 (40.3 bits)