BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (388 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli Re... 345 2e-93 UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechlor... 299 1e-79 UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepI... 187 5e-46 UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriav... 165 3e-39 UniRef50_D1PA58 Putative chain length determinant protein n=1 Ta... 162 3e-38 UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification prote... 141 4e-32 UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralston... 129 2e-28 UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevote... 127 6e-28 UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumon... 84 9e-15 UniRef50_C1CYL3 Putative uncharacterized protein n=1 Tax=Deinoco... 80 2e-13 UniRef50_Q8XQM0 Putative o-antigen polymerase transmembrane prot... 63 1e-08 UniRef50_Q03584 O-antigen polymerase n=5 Tax=Enterobacteriaceae ... 53 2e-05 UniRef50_A3CQX5 Oligosaccharide repeat unit polymerase Wzy, puta... 45 0.004 UniRef50_B9Y8A4 Putative uncharacterized protein n=1 Tax=Holdema... 45 0.005 UniRef50_B8C4D4 Predicted protein n=1 Tax=Thalassiosira pseudona... 42 0.055 UniRef50_B7V2X0 Putative uncharacterized protein n=1 Tax=Pseudom... 42 0.056 UniRef50_C0D232 Putative uncharacterized protein n=1 Tax=Clostri... 41 0.071 >UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli RepID=RFC_ECOLI Length = 388 Score = 345 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 388/388 (100%), Positives = 388/388 (100%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL Sbjct: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ Sbjct: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 Query: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI Sbjct: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 Query: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA Sbjct: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 Query: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT Sbjct: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 Query: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES Sbjct: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 Query: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 FMTNISSWIQITLCIIVFSQFLKAQKIK Sbjct: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 >UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47GK7_DECAR Length = 400 Score = 299 bits (766), Expect = 1e-79, Method: Composition-based stats. Identities = 74/397 (18%), Positives = 155/397 (39%), Gaps = 31/397 (7%) Query: 2 IYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLL 61 + +V+S + + Y K + PA +I+ V L Y+I D DA + +L Sbjct: 1 MMIVVSFLCVAIVLLQYRAKKLMAPASINAVIWLAVSLTYQIQGDSLDVLRWDAIAV-VL 59 Query: 62 CNVLTFTLSCLLTESV---LDLNIRKVNNAIYSIPSKKVHNVGLLVISFS-MIYICMRLS 117 + TF + +E + L R + +P+ +G+L I + + + Sbjct: 60 VGIATFGFGAVCSERIHFALPQPSRSGGAPSFFLPAL----LGVLTIGLAGNLGRSLEYV 115 Query: 118 NYQFGTSLL----SYMNLIRDADVEDTSRNFS--AYMQPIILTTFALFIWSKKFTNTKVS 171 ++ G SL S+ +R+ + D +F +Y P+ A + K+ K + Sbjct: 116 HFVDGMSLFGGQNSWYGSLRNTLIADHHGSFGIWSYFLPLSYAAVAYLLCDKE----KSA 171 Query: 172 KTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFIVG----VNRVKHYVYLITAVGVLFSL 227 + + ++ +V + L TG+ + + + +V +N+ + L+ ++F + Sbjct: 172 RHYAIITSLVTFGYVFLATGRTFILLFVTILFVVVAYRGWLNKTRSIALLVPLFLIVFWV 231 Query: 228 YMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSH---VFWFFERLMGLLTGG 284 + + GG Y MY V+P+ F + + +H F ++ L Sbjct: 232 SPIIAGRVSGGGFNYFMMYFVAPLANFD--WGMHGVFACCTHGETTFRTIFAVLAKLGFN 289 Query: 285 VSMS-LHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNYISVK 342 V + L + + L NVYT F Y L + G + + R + + Sbjct: 290 VPVVELIQPWAGTKLSGNVYTVFMPYYRDFGLAGVALFLFFFGALHTWISRQASLDNPLA 349 Query: 343 IFY-SYFIYTFSFIFYHESFMTNISSWIQITLCIIVF 378 +F + F Y F+ + + + +S W+Q+ + + Sbjct: 350 VFLNAIFFYALVMQFFQDQYFSLLSQWVQMIFWMSLL 386 >UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepID=Q9AYY5_BPHK6 Length = 408 Score = 187 bits (475), Expect = 5e-46, Method: Composition-based stats. Identities = 52/262 (19%), Positives = 103/262 (39%), Gaps = 10/262 (3%) Query: 132 IRDADVEDTSRN----FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAII 187 +R A++ED + F P+I+ FA K K+S +L+F + Sbjct: 129 LRLANIEDDYQYPTFIFMPSFYPVIMAMFASICIFKSRWIDKISVCTWVLLFAIGTMGKF 188 Query: 188 LNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYL 247 ++F+ I + + ++ Y ++ A ++ Y + +AY Y+ Sbjct: 189 AVITPIMMFVTIYELKNGISMKKIFIYAPVVLACIIIMHFYRMSDDD-SATIAYIFGTYI 247 Query: 248 VSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV--SMSLHKEFVWVGLPTNVYTA 305 SP+IAF N + + F + + + ++V+V PTNVY+ Sbjct: 248 YSPLIAFGTLID-SGINWSGDYTLRFINAINYKIGISSVEPVKTILDYVYVPSPTNVYSV 306 Query: 306 FSDYVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTFSFIFYHESFMT 363 + VI+G + ++ ++ V + YS + F E+ +T Sbjct: 307 MQPFYSDMGIYGVAFGAVIYGVLLSAIYSSAKAGNLVMLGLYSVLSVSLITQFMSETIIT 366 Query: 364 NISSWIQITLCIIVFSQFLKAQ 385 N+S I++ LC+ V +F + Sbjct: 367 NLSGNIKLLLCMFVVFRFFTKK 388 >UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LL33_RALME Length = 456 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 97/224 (43%), Gaps = 18/224 (8%) Query: 170 VSKTFTLLVFIVFIFAII----LNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLF 225 +TF +++F +F +I + G ++ V+++ +F++ + L+T GV Sbjct: 235 TGRTFFMMLFCFLLFPLIFRGKIKLGGVVIAGVVLAASFVM--------IALLTQRGVSA 286 Query: 226 SLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV 285 + + G+ L +Y +SPI A + S + + F FF L ++ + Sbjct: 287 TAS---VDDNVDGIIKTLRVYFLSPIFAMGSVFDGNGSATYGDYTFRFFYMLANVIGLNI 343 Query: 286 SMS-LHKEFVWVGLPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRN-YISVK 342 + L +++V + TNV+T Y L + G L+ +++ Sbjct: 344 EIPPLIRDYVLIPDMTNVFTVMDPYYRDFGVGGVLIFAALSGLAHDALYEKAKSEGGPYI 403 Query: 343 IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQK 386 ++ F++ F+ + +M+ +S+WIQ+ ++F + +K Sbjct: 404 FIHAAFMFPLVMQFFQDMYMSLLSTWIQVVFWYMLFVRVNSGKK 447 >UniRef50_D1PA58 Putative chain length determinant protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA58_9BACT Length = 683 Score = 162 bits (409), Expect = 3e-38, Method: Composition-based stats. Identities = 79/374 (21%), Positives = 146/374 (39%), Gaps = 22/374 (5%) Query: 22 DIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLN 81 DIF P +I++++ + +SDI + D + + FT + + T ++ N Sbjct: 298 DIFAPWTLSLLIWSILGIIIAFSSDIID-PIQDVFYTNISVWLAIFTATSISTYLLMPAN 356 Query: 82 IR-KVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDADVEDT 140 K I + + L + + +Y+ ++ MN +R+ V Sbjct: 357 QDIKTGVKGIKINITIFNILFFLSMIMTPLYM-YQIYKIVTMFDSKDLMNNMRELAVNGN 415 Query: 141 SRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVII 200 F Y I +W + K + V + I N K F+V I Sbjct: 416 GHGFLNYTMVINQALLLTGLW----RFPYLPKWKIICVVGCCLAYAIANMEKLTFFLVFI 471 Query: 201 SYAFIVGVNRV-KHYVYLITAVGVLFSLYMLFLRGLPGGMAY--------YLSMYLVSPI 251 + F++ R+ K +I + ++ Y+ L Y +L MYL+SP Sbjct: 472 TIFFVLFERRIIKLRTIVICGILLIIGFYIFNLSRSDSDSDYQKNTSILDFLGMYLMSPP 531 Query: 252 IAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEF---VWVGLPTNVYTAFSD 308 +AF + +S+S S W L G S+ H++F V+V +PTNVYT Sbjct: 532 VAFGHLR-RTISDSFCSESLWTIYAYTNRLMGSGSIIQHEDFGEFVYVPMPTNVYTIMKP 590 Query: 309 YVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTFSFIFYHESFMTNIS 366 + + I+G ++G ++R +RN + I Y+Y ++ + F+ E I Sbjct: 591 FYQDLGTIGVAFYAFIYGLVTGFIYRKARNGNAFGICMYTYLVFVLTMQFFDELIFVAIP 650 Query: 367 SWIQITLCIIVFSQ 380 ++Q + + Q Sbjct: 651 QFLQRMFLVYIICQ 664 >UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification protein n=2 Tax=Enterobacteriaceae RepID=Q1RA39_ECOUT Length = 396 Score = 141 bits (356), Expect = 4e-32, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 99/243 (40%), Gaps = 7/243 (2%) Query: 144 FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVIISYA 203 + P+++ FA+ +K K S F + ++ + + +++I + Sbjct: 148 LMPAVYPLMMAMFAIVCLTKTSKLNKYSIYFWMFLYCIGTMGKFSILTPILTYLIIYDFK 207 Query: 204 FIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVS 263 + V + + LI + + + L +Y+ SPIIA + + S Sbjct: 208 HRLKVKKTIKFTLLIIILALTLHFTRMAENDHS-TFLSILGLYIYSPIIALGQL-NEVNS 265 Query: 264 NSASSHVFWFFERLMGLLT--GGVSMSLHKEFVWVGLPTNVYTAFSDYVYISAELSYLM- 320 + + F F + + + ++ ++ +V +PTNVYTA + + Sbjct: 266 SHFGEYTFRFIYAITNKIGLIKELPVNTILDYSYVPVPTNVYTALQPFYQDFGYTGIIFG 325 Query: 321 MVIHGCISGVLWRLSRNYISVKIFYSYFIYTFS--FIFYHESFMTNISSWIQITLCIIVF 378 V++G I L+ + + Y +++ S F+ E+ +TN++ +++ LC I+ Sbjct: 326 AVLYGLIYVSLYTAGVRGNNTQALLIYALFSVSSATAFFAETLVTNLAGNVKLVLCTILL 385 Query: 379 SQF 381 +F Sbjct: 386 WRF 388 >UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Q91_RALEJ Length = 410 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 70/372 (18%), Positives = 149/372 (40%), Gaps = 22/372 (5%) Query: 26 PAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLNIRKV 85 P ++ +LLGY ++ D + + L+ L +L + L + Sbjct: 26 PFQIYFFVWFSLLLGYYLSRDSFISMSVEFVLLILTAKLLALLIMILCCRGLQGRGPEIR 85 Query: 86 NNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMN--LIRDADVEDTSRN 143 + + + ++ + ++ LV ++ + R + G S+ + + +R A ED Sbjct: 86 RHGVIARKTRFI-DLAQLVAIIALPLVYARATEIAGGESVFTVLGYIQLRAAMTEDGEGY 144 Query: 144 -FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQ-IVFMVIIS 201 AY+ + T ++ I+ + +N + + L V + L+TG+ I+F + ++ Sbjct: 145 GILAYLWVLSFVTTSVSIFLYRQSNLGFGRLW--LSVAVSLCYCYLSTGRTYILFFLCLA 202 Query: 202 YAFIVGVNRVKHYVYLITAVGVLFSLYMLF---------LRGLPGGMAYYLSM---YLVS 249 ++ V ++ LIT + + + G+ + +L Y ++ Sbjct: 203 LVPLMNVGAIRMRGLLITLIIFIALSVFVAGMTAKGISADDGIVENVESFLESMKGYTIA 262 Query: 250 PIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGG--VSMSLHKEFVWVGLPTNVYTAFS 307 P++AF S + F + L ++L K + +V PTNVYT + Sbjct: 263 PLLAFSRLVEWNPDLSWGENTFRLLISIQYALGISTLAPVALMKGYAFVPDPTNVYTVYE 322 Query: 308 DYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIF-YSYFIYTFSFIFYHESFMTNIS 366 Y + L L+ + + L+R +R V IF YS +Y F+ + + + +S Sbjct: 323 VYFRDFSYLGVLIPPVFLIVHYWLYRRARARGGVWIFYYSASVYPLVMQFFQDQYFSLLS 382 Query: 367 SWIQITLCIIVF 378 +WIQ+ + Sbjct: 383 TWIQVWFWYWLL 394 >UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LJ75_9BACT Length = 401 Score = 127 bits (320), Expect = 6e-28, Method: Composition-based stats. Identities = 75/409 (18%), Positives = 159/409 (38%), Gaps = 57/409 (13%) Query: 3 YLVISVFLITAFICL--YLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 L I+ ++ AF + Y+ +D+F P V ++ +LL + ++ ++D + + Sbjct: 1 MLFIAFLIVAAFTFVGWYITRDVFSPFVLQPGVWFGILLLFYLSDPELYPIIHDFPISLI 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNA---IYSIPSKKVHNVGLLVISFSMIYICMRLS 117 + + ++ + D ++ PS+ V + L + S+ I L Sbjct: 61 VWTISFLGVAYPTYYYLPDHSLISRRPPLLASVLTPSRLVLKLYLFIAIISVPLILYTLM 120 Query: 118 NYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLL 177 Y + M +R A +DT + V+ + Sbjct: 121 RYGMERGESNLMTYLRIASYDDTL----------------------DKPDLGVAYYTIGV 158 Query: 178 VFIVFIFAIILNTGK--QIVFMVIISYAFIVGVNRVKHYVYLITAVGVLF---------- 225 IVFIF + ++ K +I+ +VI A ++ +++ +V+L+ V VL+ Sbjct: 159 ALIVFIFIFVYSSKKWLKILAVVINVLAALISMSKTGFFVFLVPMVYVLYLRGKIKLRTI 218 Query: 226 ----------SLYMLFLRGLPGGMAYY-----LSMYLVSPIIAFQEFYFQQVSNSASSHV 270 S++ + R + + L++Y++SP +AF + + +V Sbjct: 219 GIILLIFIGFSIWFQYARSMASQQDSFSATSMLTIYIMSPCVAFDYYVEPASATHFGEYV 278 Query: 271 FWFFERLMGLLTGGV-SMSLHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCIS 328 F F+ +M L + +S +FV V TN YT + + L ++G Sbjct: 279 FRFYYAIMHSLGSNIEPVSNVLKFVGVPEETNTYTILYPFYHDFGLPGVGLFGGLYGAFY 338 Query: 329 GVLWRLSRNY-ISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCII 376 L++ +++ I Y+ F+ F E+ ++N S +Q + I+ Sbjct: 339 AFLYKRAQSGQNVYFILYACFLNYLILQFVQENILSNFSLNLQYVILIL 387 >UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LCF1_TOLAT Length = 407 Score = 83.9 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 68/377 (18%), Positives = 137/377 (36%), Gaps = 19/377 (5%) Query: 21 KDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDL 80 K I P +++ V L +T D + N+ ++ + ++ LS +L L Sbjct: 18 KFIANPFKIFFMLWFFVFLTLYLTIDDWVEISNEFIVVNITTSLSVLLLSYILKGKTECL 77 Query: 81 NIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRD-ADVED 139 + + NA+ + + +I F +++ + ++ S + + + D Sbjct: 78 HSWRDENALNELFINRYLCFVFQIICFIGLFLAYYRVSLLIPDNIFSPVGYTKLRMSIGD 137 Query: 140 TSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVI 199 + ++ L+ + +V +L I + L+TG+ M Sbjct: 138 DTVDYGILAYFFTLSFIVTSLTIILRVRGEVGSIRLILSIISSLSYCYLSTGRTFFLMFF 197 Query: 200 ISYA---FIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYY----------LSMY 246 F++GV R +H V + + +LF L + L Y Sbjct: 198 CFSFAPLFVLGVVRRQHIVLISIVIVLLFILVAYLTGKGISEEQSFIENINSFLENLRSY 257 Query: 247 LVSPIIAFQEFYFQQVSNS--ASSHVFWFFERLMGLLTG--GVSMSLHKEFVWVGLPTNV 302 ++P++A Q ++S + F F + L G M L K+FV PTNV Sbjct: 258 TIAPVVAMNMLIEQISNSSIFYGEYSFRFIFVFLSKLGVYDGSVMPLIKDFVDTPYPTNV 317 Query: 303 YTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRN-YISVKIFYSYFIYTFSFIFYHESF 361 YT + YV + + VL+ YS +Y F+ + + Sbjct: 318 YTVYDIYVRDFGYFGFASVFFIMFFHFVLYEKCMLKGGFYVFLYSASLYPLIMQFFQDQY 377 Query: 362 MTNISSWIQITLCIIVF 378 ++ +S+WIQ+++ + Sbjct: 378 LSLLSTWIQVSVIYYIL 394 >UniRef50_C1CYL3 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1CYL3_DEIDV Length = 427 Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 71/390 (18%), Positives = 141/390 (36%), Gaps = 29/390 (7%) Query: 18 YLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESV 77 + +D+ YPAV I++ + L+ Y + Y L+++ ++ + VL F+ LT S Sbjct: 20 FFSRDVRYPAVLQIIVWLVTLVVYVVERHRY-VSLSESVMLIIFLGVLGFSAGSFLTLSS 78 Query: 78 LDLNIRKVNNAIYS-------IPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMN 130 L + + + LL I+ + +YI Q G + +N Sbjct: 79 LGGRSLGSRKTVLLRMRHLSDVRLVLAFFLTLLAIAGAAVYIQTATQFAQTGPTQDLALN 138 Query: 131 LIRDADVEDTSRNFS---AYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAII 187 L V +Y PI+ T + + K L + + Sbjct: 139 LRYLTSVRGEVPPLMRLTSYALPILNTLSGFCLIYYRANRDKRVLPILFLAVASALVMSV 198 Query: 188 LNTGKQIVFMVIISYAFIVGV-NRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA------ 240 +TG+ I+ II I + ++ ++ + + S++ + L G+ Sbjct: 199 FSTGRGIILFFIIEMGIIYAMTSKRLRLRLVLLGLLMFLSIFYIGASVLGKGVDQNASLL 258 Query: 241 -------YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSM-SLHKE 292 +S+YL+S I+A V++ + F L L ++ L + Sbjct: 259 ESFPDLFSSISLYLLSGILALSVQLPTLVTDEGGVNTFRTIHALGRALGFDATVVPLVQA 318 Query: 293 FVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNYISVKIFY--SYFI 349 F + PTNVYT + Y+ L + + G + L+ R + + Sbjct: 319 FTNIPQPTNVYTIYLTYLKDFGWLGIFIFQFLFGILHATLFIAFRRTGGAVALFWLAILS 378 Query: 350 YTFSFIFYHESFMTNISSWIQITLCIIVFS 379 + + + + + +S+WIQ +FS Sbjct: 379 FPLLTQPFTDGYFSLMSTWIQYAFFSSLFS 408 >UniRef50_Q8XQM0 Putative o-antigen polymerase transmembrane protein n=1 Tax=Ralstonia solanacearum RepID=Q8XQM0_RALSO Length = 390 Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 89/242 (36%), Gaps = 14/242 (5%) Query: 146 AYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFI 205 +Y +L FA+++ S+ + S F ++ F+ A I ++G+ ++ ++ S Sbjct: 133 SYYFSALLVVFAVYLVSQA---SHYSPGFLVIGFVAATVAAISSSGRTLLLLLFTSTPVS 189 Query: 206 VGV-----NRVKHYVYLITAVGVLFSLYM----LFLRGLPGGMAYYLSMYLVSPIIAFQE 256 + + + L+ L + F+ L + + L +Y+++ + +F Sbjct: 190 LYLQNKIRKKTFFASLLVFLCFFLALAVLNGKGAFINDLYSQITWNLEVYVLNGLASFNH 249 Query: 257 FYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPTNVYTAFSDYVYISAEL 316 F + + + R + L G + L FV P NVYTA + + L Sbjct: 250 FVTNNHPSFDGNILVPNLLRRIFELEGDA-IPLVLPFVETPFPGNVYTALYPWYHDGGAL 308 Query: 317 SYLMM-VIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCI 375 ++ + G S + +YS Y + + ++ W+ L Sbjct: 309 GLMVGFFLIGAFSQYFYHARHKSFKHTFYYSISAYALIMTIFQDQYIQAYPLWMMAILSP 368 Query: 376 IV 377 + Sbjct: 369 FL 370 >UniRef50_Q03584 O-antigen polymerase n=5 Tax=Enterobacteriaceae RepID=RFC_SHIDY Length = 380 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 85/229 (37%), Gaps = 10/229 (4%) Query: 158 LFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYL 217 F + F + F+ A++ + ++ ++ V++ K +Y Sbjct: 145 SFCCMYLARHENKKNYFYCFTLLSFLLAVLSTSKIFLILFLVYIVGINSYVSKKKLLIYG 204 Query: 218 ITAVG-------VLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHV 270 + G +L + + L +YL S + AF + + + ++ Sbjct: 205 VFVFGLFALSSIILGKFSSDPEGKIISAIFDTLRVYLFSGLAAFNLYVEKN--ATLPENL 262 Query: 271 FWFFERLMGLLTGGVSMSLHKEFVWVGL-PTNVYTAFSDYVYISAELSYLMMVIHGCISG 329 + + + T + + ++ +G+ TNVYTAF+ + + +++ I Sbjct: 263 LLYPFKEVWGTTKDIPKTDILPWINIGVWDTNVYTAFAPWYQSLGLYAAIIIGILLGFYY 322 Query: 330 VLWRLSRNYISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVF 378 +W R ++V + ++ + +F+ E ++ + LC I+ Sbjct: 323 GIWFSFRQNLAVGFYQTFLCFPLLMLFFQEHYLLSWKMHFIYFLCAILL 371 >UniRef50_A3CQX5 Oligosaccharide repeat unit polymerase Wzy, putative n=2 Tax=Streptococcus RepID=A3CQX5_STRSV Length = 441 Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 61/176 (34%), Gaps = 10/176 (5%) Query: 214 YVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWF 273 +V+L+ + V + G+ L Y+ SPI A + S + Sbjct: 246 FVFLVLFMAVGTFVLNRVDSRAEFGILDNLIKYMGSPIQALDYYLKNPTLYSDNQVFGEN 305 Query: 274 FERLMGLLTGGVSMSLHKEFVWVG------LPTNVYTAFSDYVYISAELSYL-MMVIHGC 326 + + +S H+ ++ TNVYT + ++ S L + +G Sbjct: 306 TLIAIYGTLKSLGLSSHELTPFLPVVHFNDDKTNVYTIYYYFIKDFGYFSVLIFQLSYGF 365 Query: 327 ISGVLWRLSRNYISV---KIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFS 379 G + + I ++ F Y F+ E+ ++ +++ I + V Sbjct: 366 FYGSFYYSIKKRYFTPLKVIVFALFAYPLVISFFQETLLSLLTTHINRIIYAFVIY 421 >UniRef50_B9Y8A4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y8A4_9FIRM Length = 441 Score = 45.0 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 50/216 (23%), Positives = 79/216 (36%), Gaps = 21/216 (9%) Query: 193 QIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAY-YLSMYLVSPI 251 I+F V+ S + ++ + + V +F L + YLS+YL +PI Sbjct: 225 IIIFWVMHSKKHKISFKQILLIILAASLVVGMFQTIGELLGRVSAADFGGYLSVYLSAPI 284 Query: 252 IAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSL-HKEFVWVGL------PTNVYT 304 F SA F + L G + +S E V L NVYT Sbjct: 285 RNLDYFLN-NSFASADMFGKMTFYHAINYLGGKLEISSWIYELVLPPLRANGFVTGNVYT 343 Query: 305 AFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNY---------ISVKIFYSYFIYTFSF 354 F Y+Y + + M + G IS + + ++N I YSY Y +F Sbjct: 344 TFYAYIYDFGYVGVPIFMFLMGVISQLFYIKTKNNTKYLQRDRINIWIIIYSYIFYMLAF 403 Query: 355 IFYHESFMTNI--SSWIQITLCIIVFSQFLKAQKIK 388 F+ F I + + + + FL+ KIK Sbjct: 404 SFFSNKFYEGIFSIQFFKYLIYWSLIKLFLENVKIK 439 >UniRef50_B8C4D4 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C4D4_THAPS Length = 948 Score = 41.5 bits (96), Expect = 0.055, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 51/133 (38%), Gaps = 12/133 (9%) Query: 104 VISFSMIYICMRLSNYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSK 163 VI+ + + + R + ++ T+ + LIR A V T + +I Sbjct: 654 VIAGAFVALSGRFAEFEDMTTQSRWEALIRLALVAGTMESI------LIHLMATAICCFV 707 Query: 164 KFTNTKVSKTFTLLVFIVFIF---AIILNTGKQIVFMV---IISYAFIVGVNRVKHYVYL 217 K TK + T++ F+ IF ++L ++ + + ++ +N ++ L Sbjct: 708 KGVKTKTTTLATVIAFLEGIFHSVPLVLIECLILLTLFGPSVFELPYLTTMNPALIWMTL 767 Query: 218 ITAVGVLFSLYML 230 + V + L Sbjct: 768 LPISIVYVWFFRL 780 >UniRef50_B7V2X0 Putative uncharacterized protein n=1 Tax=Pseudomonas aeruginosa LESB58 RepID=B7V2X0_PSEA8 Length = 417 Score = 41.5 bits (96), Expect = 0.056, Method: Composition-based stats. Identities = 63/410 (15%), Positives = 150/410 (36%), Gaps = 30/410 (7%) Query: 3 YLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLC 62 L + LI A L + +P+V + I + L L+ + S I +++ L+ L Sbjct: 4 MLTGATLLIFAVAARLLARSAIHPSVAMPITWGLGLIAVSLASLIGFYRVESDALLIFLF 63 Query: 63 NVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFG 122 V++F+LS + + R ++ + LVI F + +I Y+ Sbjct: 64 GVMSFSLSAGCFSFLYNGYFRAPSSNFLFDSELRTRA---LVIFFCLAHIVFLTVIYRDL 120 Query: 123 TSL------LSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTL 176 +S+ +YM + E + S + T L + + K L Sbjct: 121 SSIAPTLREAAYMARAQSVSGEPVLSSLSMNYLQLGQTVIPLVVLL--YLRGKCGVLGFL 178 Query: 177 LVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNR----------VKHYVYLITAVGVLFS 226 + + ++ I+L +G+ + +++ FI + + + ++L+ AVG + + Sbjct: 179 AISVPWMGVILLASGRASLMQMLVGLFFIYILVKGSPSLKSLLVIGLAMFLVIAVGAVAT 238 Query: 227 LYMLFLRGLPGGMAYY-----LSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLL 281 + F G + ++ Y + + F +Y + F ++ + Sbjct: 239 SKIQFHEGDGISTLFIELYRHVAGYALQGPVLFDRYYQGSIHLEPYWSPLNGFCSILATV 298 Query: 282 TGGVSMSLHKE-FVWVG-LPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRNY 338 LH + + + NVY+ F L + +M ++G + + ++ Sbjct: 299 GLCQKPPLHLDFYEYAPGELGNVYSMFFSMYPHYGALGVIGVMALYGMLCSYAYCKAKKG 358 Query: 339 ISVK-IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQKI 387 + SY F + + T+ ++++T+ + + + ++ Sbjct: 359 SLYFTVLSSYLFSAIVFSLFSDQISTSWWFYVKMTIILGILCFVFRRDRM 408 >UniRef50_C0D232 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D232_9CLOT Length = 478 Score = 41.1 bits (95), Expect = 0.071, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 76/225 (33%), Gaps = 22/225 (9%) Query: 175 TLLVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRG 234 + I + ++ + Q++ V ++ + V R YL+ ++ Y+ Sbjct: 243 AAVTGISILIPLLCVSRFQLILAVGMAVFTFISVRRTFRLKYLLILCALMVPAYLALTVL 302 Query: 235 LPGGMAYYLS-----------------MYLVSPIIAFQEFYFQQVSNSASSHVFWFFERL 277 ++Y MY+ + F Q S+S + + L Sbjct: 303 RSHSVSYLNGIFEMKNPHTPIFVTQPYMYIANNYDNFNCLVEQLGSHSMGLRMLFPVWAL 362 Query: 278 MGLLTGGVSMSLHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWR-LS 335 GL S+ FV T V + Y L L ++ G + +L+R + Sbjct: 363 TGLKFLNPSLVSFPIFVTKEELTTVTLIYDAYY-DFGMLGILLFGLLTGAVCALLYRLRT 421 Query: 336 RNYISV-KIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFS 379 R V + Y+ + F+ ++ +N S+W + + + Sbjct: 422 RASNPVCHVIYAQIAMYMALAFFT-TWFSNPSTWFYLAVTAAAYW 465 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli Re... 246 7e-64 UniRef50_D1PA58 Putative chain length determinant protein n=1 Ta... 232 2e-59 UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechlor... 215 2e-54 UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralston... 214 3e-54 UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriav... 185 2e-45 UniRef50_C1CYL3 Putative uncharacterized protein n=1 Tax=Deinoco... 183 8e-45 UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevote... 181 3e-44 UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumon... 181 4e-44 UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepI... 173 1e-41 UniRef50_Q8XQM0 Putative o-antigen polymerase transmembrane prot... 149 1e-34 UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification prote... 148 4e-34 UniRef50_Q03584 O-antigen polymerase n=5 Tax=Enterobacteriaceae ... 130 1e-28 Sequences not found previously or not previously below threshold: UniRef50_A3CQX5 Oligosaccharide repeat unit polymerase Wzy, puta... 62 4e-08 UniRef50_B9Y8A4 Putative uncharacterized protein n=1 Tax=Holdema... 47 9e-04 UniRef50_B7V2X0 Putative uncharacterized protein n=1 Tax=Pseudom... 46 0.002 UniRef50_B5L442 Wzy n=1 Tax=Shigella boydii RepID=B5L442_SHIBO 45 0.004 UniRef50_B8F9T4 Putative uncharacterized protein n=1 Tax=Desulfa... 45 0.004 UniRef50_Q5FID5 Polysaccharide polymerase n=3 Tax=Lactobacillus ... 42 0.039 UniRef50_A5LJS7 Polysaccharide polymerase n=14 Tax=Streptococcus... 42 0.043 >UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli RepID=RFC_ECOLI Length = 388 Score = 246 bits (629), Expect = 7e-64, Method: Composition-based stats. Identities = 388/388 (100%), Positives = 388/388 (100%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL Sbjct: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ Sbjct: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 Query: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI Sbjct: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 Query: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA Sbjct: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 Query: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT Sbjct: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 Query: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES Sbjct: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 Query: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 FMTNISSWIQITLCIIVFSQFLKAQKIK Sbjct: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 >UniRef50_D1PA58 Putative chain length determinant protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA58_9BACT Length = 683 Score = 232 bits (592), Expect = 2e-59, Method: Composition-based stats. Identities = 79/374 (21%), Positives = 146/374 (39%), Gaps = 22/374 (5%) Query: 22 DIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLN 81 DIF P +I++++ + +SDI + D + + FT + + T ++ N Sbjct: 298 DIFAPWTLSLLIWSILGIIIAFSSDIIDP-IQDVFYTNISVWLAIFTATSISTYLLMPAN 356 Query: 82 IR-KVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDADVEDT 140 K I + + L + + +Y+ ++ MN +R+ V Sbjct: 357 QDIKTGVKGIKINITIFNILFFLSMIMTPLYM-YQIYKIVTMFDSKDLMNNMRELAVNGN 415 Query: 141 SRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVII 200 F Y I +W + K + V + I N K F+V I Sbjct: 416 GHGFLNYTMVINQALLLTGLW----RFPYLPKWKIICVVGCCLAYAIANMEKLTFFLVFI 471 Query: 201 SYAFIVGVNR-VKHYVYLITAVGVLFSLYMLFLRGLPGGMAY--------YLSMYLVSPI 251 + F++ R +K +I + ++ Y+ L Y +L MYL+SP Sbjct: 472 TIFFVLFERRIIKLRTIVICGILLIIGFYIFNLSRSDSDSDYQKNTSILDFLGMYLMSPP 531 Query: 252 IAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKE---FVWVGLPTNVYTAFSD 308 +AF + +S+S S W L G S+ H++ FV+V +PTNVYT Sbjct: 532 VAFGHLR-RTISDSFCSESLWTIYAYTNRLMGSGSIIQHEDFGEFVYVPMPTNVYTIMKP 590 Query: 309 YVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTFSFIFYHESFMTNIS 366 + + I+G ++G ++R +RN + I Y+Y ++ + F+ E I Sbjct: 591 FYQDLGTIGVAFYAFIYGLVTGFIYRKARNGNAFGICMYTYLVFVLTMQFFDELIFVAIP 650 Query: 367 SWIQITLCIIVFSQ 380 ++Q + + Q Sbjct: 651 QFLQRMFLVYIICQ 664 >UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47GK7_DECAR Length = 400 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 74/397 (18%), Positives = 154/397 (38%), Gaps = 31/397 (7%) Query: 2 IYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLL 61 + +V+S + + Y K + PA +I+ V L Y+I D DA + +L Sbjct: 1 MMIVVSFLCVAIVLLQYRAKKLMAPASINAVIWLAVSLTYQIQGDSLDVLRWDAIAV-VL 59 Query: 62 CNVLTFTLSCLLTESV---LDLNIRKVNNAIYSIPSKKVHNVGLLVISFS-MIYICMRLS 117 + TF + +E + L R + +P+ +G+L I + + + Sbjct: 60 VGIATFGFGAVCSERIHFALPQPSRSGGAPSFFLPAL----LGVLTIGLAGNLGRSLEYV 115 Query: 118 NYQFGTSLL----SYMNLIRDADVEDTSRNFS--AYMQPIILTTFALFIWSKKFTNTKVS 171 ++ G SL S+ +R+ + D +F +Y P+ A + K K + Sbjct: 116 HFVDGMSLFGGQNSWYGSLRNTLIADHHGSFGIWSYFLPLSYAAVAYLLCDK----EKSA 171 Query: 172 KTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFIVG----VNRVKHYVYLITAVGVLFSL 227 + + ++ +V + L TG+ + + + +V +N+ + L+ ++F + Sbjct: 172 RHYAIITSLVTFGYVFLATGRTFILLFVTILFVVVAYRGWLNKTRSIALLVPLFLIVFWV 231 Query: 228 YMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSH---VFWFFERLMGLLTGG 284 + + GG Y MY V+P+ F + + +H F ++ L Sbjct: 232 SPIIAGRVSGGGFNYFMMYFVAPLANFD--WGMHGVFACCTHGETTFRTIFAVLAKLGFN 289 Query: 285 VSMS-LHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNYISVK 342 V + L + + L NVYT F Y L + G + + R + + Sbjct: 290 VPVVELIQPWAGTKLSGNVYTVFMPYYRDFGLAGVALFLFFFGALHTWISRQASLDNPLA 349 Query: 343 IFY-SYFIYTFSFIFYHESFMTNISSWIQITLCIIVF 378 +F + F Y F+ + + + +S W+Q+ + + Sbjct: 350 VFLNAIFFYALVMQFFQDQYFSLLSQWVQMIFWMSLL 386 >UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Q91_RALEJ Length = 410 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 68/373 (18%), Positives = 148/373 (39%), Gaps = 22/373 (5%) Query: 25 YPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLNIRK 84 P ++ +LLGY ++ D + + L+ L +L + L + Sbjct: 25 NPFQIYFFVWFSLLLGYYLSRDSFISMSVEFVLLILTAKLLALLIMILCCRGLQGRGPEI 84 Query: 85 VNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNL--IRDADVEDTSR 142 + + + ++ + ++ LV ++ + R + G S+ + + +R A ED Sbjct: 85 RRHGVIARKTRFI-DLAQLVAIIALPLVYARATEIAGGESVFTVLGYIQLRAAMTEDGEG 143 Query: 143 N-FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQ-IVFMVII 200 AY+ + T ++ I+ + +N + + L V + L+TG+ I+F + + Sbjct: 144 YGILAYLWVLSFVTTSVSIFLYRQSNLGFGRLW--LSVAVSLCYCYLSTGRTYILFFLCL 201 Query: 201 SYAFIVGVNRVKHYVYLITAVGVLFSLYMLF---------LRGLPGGMAYYLSM---YLV 248 + ++ V ++ LIT + + + G+ + +L Y + Sbjct: 202 ALVPLMNVGAIRMRGLLITLIIFIALSVFVAGMTAKGISADDGIVENVESFLESMKGYTI 261 Query: 249 SPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGG--VSMSLHKEFVWVGLPTNVYTAF 306 +P++AF S + F + L ++L K + +V PTNVYT + Sbjct: 262 APLLAFSRLVEWNPDLSWGENTFRLLISIQYALGISTLAPVALMKGYAFVPDPTNVYTVY 321 Query: 307 SDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVK-IFYSYFIYTFSFIFYHESFMTNI 365 Y + L L+ + + L+R +R V +YS +Y F+ + + + + Sbjct: 322 EVYFRDFSYLGVLIPPVFLIVHYWLYRRARARGGVWIFYYSASVYPLVMQFFQDQYFSLL 381 Query: 366 SSWIQITLCIIVF 378 S+WIQ+ + Sbjct: 382 STWIQVWFWYWLL 394 >UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LL33_RALME Length = 456 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 68/404 (16%), Positives = 154/404 (38%), Gaps = 21/404 (5%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 M++ + + L+ I + + P +++ V +G + D + D + Sbjct: 47 MLFACVLMALVGLLISVKIGPGRGNPFTAFFCVWSAVTVGAFLAEDTFLTISEDFWA--M 104 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 L L+ L C L S+ + ++ K ++G +V ++ I ++ S Sbjct: 105 LGAFLSIYLGCALYSSLRARVLPVESHISTISYRDKFVDIGQIVSVIALPIIYLKASEIA 164 Query: 121 FGTSLLSYMNL--IRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLV 178 S+ S +R A + + + ++ + ++ I +F + + +K + Sbjct: 165 -EMSVFSPAGYMALRAAFIAEVAASYGPIGYIVPVSFVIASIRLTQFLDKQTNKKNLAIA 223 Query: 179 FIVFIFAIILNTGKQIVFMVIISY-AFIVGVNRVKHYVYLITAVGVLFSLYMLFL----- 232 V + L TG+ M+ ++ ++K +I V + S M+ L Sbjct: 224 IAVALCMAYLTTGRTFFMMLFCFLLFPLIFRGKIKLGGVVIAGVVLAASFVMIALLTQRG 283 Query: 233 -------RGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV 285 G+ L +Y +SPI A + S + + F FF L ++ + Sbjct: 284 VSATASVDDNVDGIIKTLRVYFLSPIFAMGSVFDGNGSATYGDYTFRFFYMLANVIGLNI 343 Query: 286 SMS-LHKEFVWVGLPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRN-YISVK 342 + L +++V + TNV+T Y L + G L+ +++ Sbjct: 344 EIPPLIRDYVLIPDMTNVFTVMDPYYRDFGVGGVLIFAALSGLAHDALYEKAKSEGGPYI 403 Query: 343 IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQK 386 ++ F++ F+ + +M+ +S+WIQ+ ++F + +K Sbjct: 404 FIHAAFMFPLVMQFFQDMYMSLLSTWIQVVFWYMLFVRVNSGKK 447 >UniRef50_C1CYL3 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1CYL3_DEIDV Length = 427 Score = 183 bits (465), Expect = 8e-45, Method: Composition-based stats. Identities = 71/390 (18%), Positives = 141/390 (36%), Gaps = 29/390 (7%) Query: 18 YLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESV 77 + +D+ YPAV I++ + L+ Y + Y L+++ ++ + VL F+ LT S Sbjct: 20 FFSRDVRYPAVLQIIVWLVTLVVYVVERHRY-VSLSESVMLIIFLGVLGFSAGSFLTLSS 78 Query: 78 LDLNIRKVNNAIYS-------IPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMN 130 L + + + LL I+ + +YI Q G + +N Sbjct: 79 LGGRSLGSRKTVLLRMRHLSDVRLVLAFFLTLLAIAGAAVYIQTATQFAQTGPTQDLALN 138 Query: 131 LIRDADVEDTSRNFS---AYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAII 187 L V +Y PI+ T + + K L + + Sbjct: 139 LRYLTSVRGEVPPLMRLTSYALPILNTLSGFCLIYYRANRDKRVLPILFLAVASALVMSV 198 Query: 188 LNTGKQIVFMVIISYAFIVGV-NRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA------ 240 +TG+ I+ II I + ++ ++ + + S++ + L G+ Sbjct: 199 FSTGRGIILFFIIEMGIIYAMTSKRLRLRLVLLGLLMFLSIFYIGASVLGKGVDQNASLL 258 Query: 241 -------YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVS-MSLHKE 292 +S+YL+S I+A V++ + F L L + + L + Sbjct: 259 ESFPDLFSSISLYLLSGILALSVQLPTLVTDEGGVNTFRTIHALGRALGFDATVVPLVQA 318 Query: 293 FVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNYISVKIFY--SYFI 349 F + PTNVYT + Y+ L + + G + L+ R + + Sbjct: 319 FTNIPQPTNVYTIYLTYLKDFGWLGIFIFQFLFGILHATLFIAFRRTGGAVALFWLAILS 378 Query: 350 YTFSFIFYHESFMTNISSWIQITLCIIVFS 379 + + + + + +S+WIQ +FS Sbjct: 379 FPLLTQPFTDGYFSLMSTWIQYAFFSSLFS 408 >UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LJ75_9BACT Length = 401 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 67/391 (17%), Positives = 153/391 (39%), Gaps = 21/391 (5%) Query: 3 YLVISVFLITAFICL--YLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 L I+ ++ AF + Y+ +D+F P V ++ +LL + ++ ++D + + Sbjct: 1 MLFIAFLIVAAFTFVGWYITRDVFSPFVLQPGVWFGILLLFYLSDPELYPIIHDFPISLI 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNA---IYSIPSKKVHNVGLLVISFSMIYICMRLS 117 + + ++ + D ++ PS+ V + L + S+ I L Sbjct: 61 VWTISFLGVAYPTYYYLPDHSLISRRPPLLASVLTPSRLVLKLYLFIAIISVPLILYTLM 120 Query: 118 NYQFGTSLLSYMNLIRDADVEDTS--RNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFT 175 Y + M +R A +DT + I + + K Sbjct: 121 RYGMERGESNLMTYLRIASYDDTLDKPDLGVAYYTIGVALIVFIFIFVYSS----KKWLK 176 Query: 176 LLVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYV--YLITAVGVLFSLYMLFLR 233 +L ++ + A +++ K F+ ++ +++ + +I + + FS++ + R Sbjct: 177 ILAVVINVLAALISMSKTGFFVFLVPMVYVLYLRGKIKLRTIGIILLIFIGFSIWFQYAR 236 Query: 234 GLPGGMAYY-----LSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV-SM 287 + + L++Y++SP +AF + + +VF F+ +M L + + Sbjct: 237 SMASQQDSFSATSMLTIYIMSPCVAFDYYVEPASATHFGEYVFRFYYAIMHSLGSNIEPV 296 Query: 288 SLHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNY-ISVKIFY 345 S +FV V TN YT + + L ++G L++ +++ I Y Sbjct: 297 SNVLKFVGVPEETNTYTILYPFYHDFGLPGVGLFGGLYGAFYAFLYKRAQSGQNVYFILY 356 Query: 346 SYFIYTFSFIFYHESFMTNISSWIQITLCII 376 + F+ F E+ ++N S +Q + I+ Sbjct: 357 ACFLNYLILQFVQENILSNFSLNLQYVILIL 387 >UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LCF1_TOLAT Length = 407 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 68/377 (18%), Positives = 137/377 (36%), Gaps = 19/377 (5%) Query: 21 KDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDL 80 K I P +++ V L +T D + N+ ++ + ++ LS +L L Sbjct: 18 KFIANPFKIFFMLWFFVFLTLYLTIDDWVEISNEFIVVNITTSLSVLLLSYILKGKTECL 77 Query: 81 NIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRD-ADVED 139 + + NA+ + + +I F +++ + ++ S + + + D Sbjct: 78 HSWRDENALNELFINRYLCFVFQIICFIGLFLAYYRVSLLIPDNIFSPVGYTKLRMSIGD 137 Query: 140 TSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVI 199 + ++ L+ + +V +L I + L+TG+ M Sbjct: 138 DTVDYGILAYFFTLSFIVTSLTIILRVRGEVGSIRLILSIISSLSYCYLSTGRTFFLMFF 197 Query: 200 ISYA---FIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYY----------LSMY 246 F++GV R +H V + + +LF L + L Y Sbjct: 198 CFSFAPLFVLGVVRRQHIVLISIVIVLLFILVAYLTGKGISEEQSFIENINSFLENLRSY 257 Query: 247 LVSPIIAFQEFYFQQVSNS--ASSHVFWFFERLMGLLTG--GVSMSLHKEFVWVGLPTNV 302 ++P++A Q ++S + F F + L G M L K+FV PTNV Sbjct: 258 TIAPVVAMNMLIEQISNSSIFYGEYSFRFIFVFLSKLGVYDGSVMPLIKDFVDTPYPTNV 317 Query: 303 YTAFSDYVYISAELSYLMMVIHGCISGVLWRLSR-NYISVKIFYSYFIYTFSFIFYHESF 361 YT + YV + + VL+ YS +Y F+ + + Sbjct: 318 YTVYDIYVRDFGYFGFASVFFIMFFHFVLYEKCMLKGGFYVFLYSASLYPLIMQFFQDQY 377 Query: 362 MTNISSWIQITLCIIVF 378 ++ +S+WIQ+++ + Sbjct: 378 LSLLSTWIQVSVIYYIL 394 >UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepID=Q9AYY5_BPHK6 Length = 408 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 67/393 (17%), Positives = 139/393 (35%), Gaps = 16/393 (4%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 +I + + + Y++ I P + + I+ D + + L Sbjct: 4 LIIIFVLCSAMHILAIKYMRCRITSPLSLSLFSWYFMAFTGIISYDSFYDFRVETFYC-L 62 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 L + + S + E K Y I K +++ I Sbjct: 63 LIWISLTSFSYAVFEIAQRNKPLK-----YKIKEKNRICSRYSLLAIPACLITAYEIYKV 117 Query: 121 FGTSLLSYMNLIRDADVEDTSRN----FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTL 176 ++++ +R A++ED + F P+I+ FA K K+S + Sbjct: 118 GSNGPVNFLLNLRLANIEDDYQYPTFIFMPSFYPVIMAMFASICIFKSRWIDKISVCTWV 177 Query: 177 LVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLP 236 L+F + ++F+ I + + ++ Y ++ A ++ Y + Sbjct: 178 LLFAIGTMGKFAVITPIMMFVTIYELKNGISMKKIFIYAPVVLACIIIMHFYRMSDDD-S 236 Query: 237 GGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV--SMSLHKEFV 294 +AY Y+ SP+IAF N + + F + + + ++V Sbjct: 237 ATIAYIFGTYIYSPLIAFGTLID-SGINWSGDYTLRFINAINYKIGISSVEPVKTILDYV 295 Query: 295 WVGLPTNVYTAFSDYVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTF 352 +V PTNVY+ + VI+G + ++ ++ V + YS + Sbjct: 296 YVPSPTNVYSVMQPFYSDMGIYGVAFGAVIYGVLLSAIYSSAKAGNLVMLGLYSVLSVSL 355 Query: 353 SFIFYHESFMTNISSWIQITLCIIVFSQFLKAQ 385 F E+ +TN+S I++ LC+ V +F + Sbjct: 356 ITQFMSETIITNLSGNIKLLLCMFVVFRFFTKK 388 >UniRef50_Q8XQM0 Putative o-antigen polymerase transmembrane protein n=1 Tax=Ralstonia solanacearum RepID=Q8XQM0_RALSO Length = 390 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 52/375 (13%), Positives = 122/375 (32%), Gaps = 17/375 (4%) Query: 23 IFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLNI 82 I P + +++ ++G+ + + + + +T + + Sbjct: 11 IVSPPIIFLAAWSMQVIGHTLLQRDFDKFSDHTWWLLAAAAFSFILGCAFVTFTYIGRRR 70 Query: 83 RKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDADVEDTSR 142 + K LL I+ + N +S +E + Sbjct: 71 PNRGITPSKLRGGKRAFWILLTA--YGIFGLAPIINILLDQGSISGARDAIVKGIESRND 128 Query: 143 NFS-AYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVIIS 201 + +Y +L FA+++ S+ S F ++ F+ A I ++G+ ++ ++ S Sbjct: 129 SIVRSYYFSALLVVFAVYLVSQAS---HYSPGFLVIGFVAATVAAISSSGRTLLLLLFTS 185 Query: 202 YAFIVGV-----NRVKHYVYLITAVGVLFSLYM----LFLRGLPGGMAYYLSMYLVSPII 252 + + + L+ L + F+ L + + L +Y+++ + Sbjct: 186 TPVSLYLQNKIRKKTFFASLLVFLCFFLALAVLNGKGAFINDLYSQITWNLEVYVLNGLA 245 Query: 253 AFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPTNVYTAFSDYVYI 312 +F F + + + R + L G + L FV P NVYTA + + Sbjct: 246 SFNHFVTNNHPSFDGNILVPNLLRRIFELEGDA-IPLVLPFVETPFPGNVYTALYPWYHD 304 Query: 313 SAELSYLMMV-IHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHESFMTNISSWIQI 371 L ++ + G S + +YS Y + + ++ W+ Sbjct: 305 GGALGLMVGFFLIGAFSQYFYHARHKSFKHTFYYSISAYALIMTIFQDQYIQAYPLWMMA 364 Query: 372 TLCIIVFSQFLKAQK 386 L + S + Sbjct: 365 ILSPFLASALTPKMR 379 >UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification protein n=2 Tax=Enterobacteriaceae RepID=Q1RA39_ECOUT Length = 396 Score = 148 bits (373), Expect = 4e-34, Method: Composition-based stats. Identities = 60/381 (15%), Positives = 139/381 (36%), Gaps = 12/381 (3%) Query: 9 FLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFT 68 + F L K I P + + L L+ D + + L+ + F Sbjct: 12 VIAIMFSLLGTKSRITSPLPLHFLPWLLTLIVGISNYDQFYEFNERSFYSLLIWFTVIFI 71 Query: 69 LSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSY 128 + +++ +N K +++ +Y + G + + Sbjct: 72 FYFI--GELVNYKRENINVYYGLSHIKYECKKYWIIVIPISLYTIFEIYMVGMGGADGFF 129 Query: 129 MNLIRDADVEDTSRN---FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFA 185 +NL +E + + P+++ FA+ +K K S F + ++ + Sbjct: 130 LNLRLANTLEGYTGKKFILMPAVYPLMMAMFAIVCLTKTSKLNKYSIYFWMFLYCIGTMG 189 Query: 186 IILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYYLSM 245 + +++I + + V + + LI + + + L + Sbjct: 190 KFSILTPILTYLIIYDFKHRLKVKKTIKFTLLIIILALTLHFTRMAENDH-STFLSILGL 248 Query: 246 YLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLT--GGVSMSLHKEFVWVGLPTNVY 303 Y+ SPIIA + + S+ + F F + + + ++ ++ +V +PTNVY Sbjct: 249 YIYSPIIALGQL-NEVNSSHFGEYTFRFIYAITNKIGLIKELPVNTILDYSYVPVPTNVY 307 Query: 304 TAFSDYVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKIFYSYFIYTFS--FIFYHES 360 TA + + V++G I L+ + + Y +++ S F+ E+ Sbjct: 308 TALQPFYQDFGYTGIIFGAVLYGLIYVSLYTAGVRGNNTQALLIYALFSVSSATAFFAET 367 Query: 361 FMTNISSWIQITLCIIVFSQF 381 +TN++ +++ LC I+ +F Sbjct: 368 LVTNLAGNVKLVLCTILLWRF 388 >UniRef50_Q03584 O-antigen polymerase n=5 Tax=Enterobacteriaceae RepID=RFC_SHIDY Length = 380 Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 37/262 (14%), Positives = 96/262 (36%), Gaps = 10/262 (3%) Query: 125 LLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIF 184 +L Y N + + + S + ++ F + F + F+ Sbjct: 112 ILLYNNHFSLKVMREGILDGSISGFGLGISLPLSFCCMYLARHENKKNYFYCFTLLSFLL 171 Query: 185 AIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVG-------VLFSLYMLFLRGLPG 237 A++ + ++ ++ V++ K +Y + G +L + Sbjct: 172 AVLSTSKIFLILFLVYIVGINSYVSKKKLLIYGVFVFGLFALSSIILGKFSSDPEGKIIS 231 Query: 238 GMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVG 297 + L +YL S + AF + + + ++ + + + T + + ++ +G Sbjct: 232 AIFDTLRVYLFSGLAAFNLYVEKN--ATLPENLLLYPFKEVWGTTKDIPKTDILPWINIG 289 Query: 298 L-PTNVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIF 356 + TNVYTAF+ + + +++ I +W R ++V + ++ + +F Sbjct: 290 VWDTNVYTAFAPWYQSLGLYAAIIIGILLGFYYGIWFSFRQNLAVGFYQTFLCFPLLMLF 349 Query: 357 YHESFMTNISSWIQITLCIIVF 378 + E ++ + LC I+ Sbjct: 350 FQEHYLLSWKMHFIYFLCAILL 371 >UniRef50_A3CQX5 Oligosaccharide repeat unit polymerase Wzy, putative n=2 Tax=Streptococcus RepID=A3CQX5_STRSV Length = 441 Score = 61.6 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 54/400 (13%), Positives = 117/400 (29%), Gaps = 41/400 (10%) Query: 21 KDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDL 80 +D PA I+ + + S + L+ T+ + F + LL+ ++ Sbjct: 22 RDYANPAFIYLAIWMIASIFTAFYSSKWGEGLSLITVTVIFVGNAIFLMGVLLSSNLFAE 81 Query: 81 NIRKVNNAIYSIP----SKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDAD 136 + + + + V + + + Q L + L R Sbjct: 82 RKLDAKPSQIKVSNFFIILVLLFLAYAVRFIYSDLLYLAAQSKQVPGGLFKTIELARHMT 141 Query: 137 VEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVS-------KTFTLLVFIVFIFAIILN 189 + + + ++ F + S K LLV ++ + +L+ Sbjct: 142 TNYDFSLSRLSLNLLRVNFSLGIVFFYFFCESLFSGKDGIFYKGKLLLVSLISLGVSLLS 201 Query: 190 TGKQIVFMVIISYAFIVGV-------------NRVKHYVYLITAVGVLFSLYMLF----- 231 TG+ + ++ YA + + LI L + Sbjct: 202 TGRTELLGLVAGYAIVYMLFFSKYYSWRDSRYGVKLFRSLLIIGFVFLVLFMAVGTFVLN 261 Query: 232 --LRGLPGGMAYYLSMYLVSPIIAFQEFYFQ----QVSNSASSHVFWFFERLMGLLTGGV 285 G+ L Y+ SPI A + + + + L Sbjct: 262 RVDSRAEFGILDNLIKYMGSPIQALDYYLKNPTLYSDNQVFGENTLIAIYGTLKSLGLSS 321 Query: 286 -SMSLHKEFVWVGLP-TNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRNYISV- 341 ++ V TNVYT + ++ S L + +G G + + Sbjct: 322 HELTPFLPVVHFNDDKTNVYTIYYYFIKDFGYFSVLIFQLSYGFFYGSFYYSIKKRYFTP 381 Query: 342 --KIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFS 379 I ++ F Y F+ E+ ++ +++ I + V Sbjct: 382 LKVIVFALFAYPLVISFFQETLLSLLTTHINRIIYAFVIY 421 >UniRef50_B9Y8A4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y8A4_9FIRM Length = 441 Score = 47.4 bits (111), Expect = 9e-04, Method: Composition-based stats. Identities = 77/435 (17%), Positives = 144/435 (33%), Gaps = 49/435 (11%) Query: 3 YLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLC 62 YL IS+ +I K+DI +P + F L + + + +L T ++ Sbjct: 5 YLAISLIIILVLCFFITKQDIIHPTIAFIAPFTLAAIDLLYNINKWNVELKMNTYYVIIG 64 Query: 63 NVLTFTLSCLL------TESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICM-- 114 L F L+ + NI+ ++ IY +++ + L++ IC+ Sbjct: 65 GTLVFILATFIIDIGYKQLRKKCQNIKYKSDTIYKYNISQINKLLFLMLQIFTFLICLIG 124 Query: 115 ----RLSNYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFAL-FIWSKKFTNTK 169 G+ ++ + +A F+W Sbjct: 125 VIKVARRFGVSGSISELIAGYKNLKTFTTEDVGLGKFVNTLYDFCYASGFVWFYLVAKKY 184 Query: 170 VSKTFTLLVFIVFIFAIILNTGK-----------------QIVFMVIISYAFIVGVNRVK 212 + K + ++ + I + + I+F V+ S + ++ Sbjct: 185 IFKKKWDKLVLINLCLSIAISLEKGSRGGAIALLCSGAVMIIIFWVMHSKKHKISFKQIL 244 Query: 213 HYVYL-ITAVGVLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVF 271 + VG+ ++ L R YLS+YL +PI F +++ Sbjct: 245 LIILAASLVVGMFQTIGELLGRVSAADFGGYLSVYLSAPIRNLDYFLNNSFASADMFGKM 304 Query: 272 WFFERLMGLLTGGVSMSLHKEFVWVGL------PTNVYTAFSDYVYISAELSY-LMMVIH 324 F+ + L S E V L NVYT F Y+Y + + M + Sbjct: 305 TFYHAINYLGGKLEISSWIYELVLPPLRANGFVTGNVYTTFYAYIYDFGYVGVPIFMFLM 364 Query: 325 GCISGVLWRLSRNYISV---------KIFYSYFIYTFSFIFYHESFMTNI--SSWIQITL 373 G IS + + ++N I YSY Y +F F+ F I + + + Sbjct: 365 GVISQLFYIKTKNNTKYLQRDRINIWIIIYSYIFYMLAFSFFSNKFYEGIFSIQFFKYLI 424 Query: 374 CIIVFSQFLKAQKIK 388 + FL+ KIK Sbjct: 425 YWSLIKLFLENVKIK 439 >UniRef50_B7V2X0 Putative uncharacterized protein n=1 Tax=Pseudomonas aeruginosa LESB58 RepID=B7V2X0_PSEA8 Length = 417 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 60/405 (14%), Positives = 144/405 (35%), Gaps = 20/405 (4%) Query: 3 YLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLC 62 L + LI A L + +P+V + I + L L+ + S I +++ L+ L Sbjct: 4 MLTGATLLIFAVAARLLARSAIHPSVAMPITWGLGLIAVSLASLIGFYRVESDALLIFLF 63 Query: 63 NVLTFTLSCLLTESVLDLNIRK-VNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQF 121 V++F+LS + + R +N ++ + V ++ + + Sbjct: 64 GVMSFSLSAGCFSFLYNGYFRAPSSNFLFDSELRTRALVIFFCLAHIVFLTVIYRDLSSI 123 Query: 122 GTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIV 181 +L + R V S M + L + + + K L + + Sbjct: 124 APTLREAAYMARAQSVSGEPVLSSLSMNYLQLGQTVIPLVVLLYLRGKCGVLGFLAISVP 183 Query: 182 FIFAIILNTGKQIVFMVIISYAFIVGVNR----------VKHYVYLITAVGVLFSLYMLF 231 ++ I+L +G+ + +++ FI + + + ++L+ AVG + + + F Sbjct: 184 WMGVILLASGRASLMQMLVGLFFIYILVKGSPSLKSLLVIGLAMFLVIAVGAVATSKIQF 243 Query: 232 -LRGLPGGMAYYL----SMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVS 286 + L + Y + + F +Y + F ++ + Sbjct: 244 HEGDGISTLFIELYRHVAGYALQGPVLFDRYYQGSIHLEPYWSPLNGFCSILATVGLCQK 303 Query: 287 MSLHKEFVWV--GLPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRNYISVK- 342 LH +F G NVY+ F L + +M ++G + + ++ Sbjct: 304 PPLHLDFYEYAPGELGNVYSMFFSMYPHYGALGVIGVMALYGMLCSYAYCKAKKGSLYFT 363 Query: 343 IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQKI 387 + SY F + + T+ ++++T+ + + + ++ Sbjct: 364 VLSSYLFSAIVFSLFSDQISTSWWFYVKMTIILGILCFVFRRDRM 408 >UniRef50_B5L442 Wzy n=1 Tax=Shigella boydii RepID=B5L442_SHIBO Length = 395 Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 61/402 (15%), Positives = 126/402 (31%), Gaps = 34/402 (8%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 M + + L+ + + L + KDI P+ + I+ +L + DI + L L Sbjct: 3 MTIFIAIIMLMYSGLLLSINKDIKSPSAILFFIWGGLLFLSGVNGDITNYTLCIILFSCL 62 Query: 61 LCNVLTFTLS--CLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSN 118 +V + E V NI + + +I F Y + Sbjct: 63 SFSVGALLAKPYAPIFEHVFSYNINNTKALVLITNIISFVILIYTLIVFLYYYKGGFTES 122 Query: 119 YQFGTSLLSY---MNLIRDADVEDTSRNFSAYMQPIILTTFALFIWS--KKFTNTKVSKT 173 Y + ++Y LI+ Y+ + ++ ++T Sbjct: 123 YINSRTEINYGDKSGLIKIYGYIYYLLYPLVYVWSFLYFKNKYKKIEEGERARLPYNNRT 182 Query: 174 FTLLVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFL- 232 F + F +A++ +++ ++I + ++ L+T G++F Y+ L Sbjct: 183 FYFVFFTSLFYALLSTAKIKVLLLIIPIVFLRLFFQKISLKYLLVTVSGIIFFFYLSMLF 242 Query: 233 -------RGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV 285 G + Y L Y ++ I A + + G Sbjct: 243 LNKIDDSSGAFEALKYGLMNYSLANIFALDSVLQGKAVVIDCN--------------GDA 288 Query: 286 SMSLHKEFVWVGLPTNVYTAFSDY--VYISAELSYLMMVIHGCISGVLWRLSRNYISV-- 341 + L + TNV+T F S + + G L+ +++ Sbjct: 289 TCGLANFISYKEYKTNVFTIFYSLTKYSDF-LYSLIFFFVIGFFHSSLYNVAKKNKKTIS 347 Query: 342 KIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLK 383 + S + F F+ + +M + L +I F L Sbjct: 348 VVICSILYFPLLFQFFDQLYMLMFYIYAIAMLYVICFCSRLT 389 >UniRef50_B8F9T4 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9T4_DESAA Length = 449 Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 49/373 (13%), Positives = 120/373 (32%), Gaps = 30/373 (8%) Query: 17 LYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLT-- 74 L + KD+F P + L+ + S Y + + ++ +F L ++ Sbjct: 59 LLIDKDLFSPYTLFALAPFCTLIYDDFLSPSYLPLPDGLAVTCIIIGQASFLLGLKMSGV 118 Query: 75 -ESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQF-GTSLLSYMNLI 132 SV+ N + + + ++ I ++ + +R N QF + + + Sbjct: 119 VRSVIWRNKKGQKHQNRLASKDNYRLLFIVGIVPFLLSLSLRTMNIQFSAEGIDEFRSNF 178 Query: 133 RDADVEDTSRNFSA-----------YMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIV 181 + F+ Y L + + F ++F Sbjct: 179 TLPIISAALSCFTTAGLLGTARERRYFVFFSLIVTMVVVGLFTQAKGSAVLIFLTIIFAS 238 Query: 182 FIFAIILNTGKQIV--FMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPG-- 237 + + +T + + +++ + + G+ R ++ + + + +P Sbjct: 239 KKYWRLKSTSRIVFVSLILMFAMFQVYGMIRQGYFHSRVYVSSYEYYRQHKDIADIPRYL 298 Query: 238 GMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFW---FFERLMGLLTGGVSMSLHKEFV 294 + Y MY+ +P+ F + W +L T + +++ Sbjct: 299 QIFYKPYMYMETPLSNFAYLVENNFTPENGKLTAWPFISIFQLKRFYTIEKPVKPIRKW- 357 Query: 295 WVGLPTNVYTAFSDYVYISAELSYLM-MVIHGCISGVLWRLS--RNYISVKIFYSYFIYT 351 P N +T D+ + L+ + G I ++ + I + Y YF Y Sbjct: 358 ----PYNTHTFLGDFYLDFGVMGILILPFLLGLIVSAMYSRTLVNRDIIMDAVYLYFSYA 413 Query: 352 FSFIFYHESFMTN 364 +F+ F ++ Sbjct: 414 TFMMFFSNHFTSS 426 >UniRef50_Q5FID5 Polysaccharide polymerase n=3 Tax=Lactobacillus acidophilus RepID=Q5FID5_LACAC Length = 431 Score = 42.0 bits (97), Expect = 0.039, Method: Composition-based stats. Identities = 61/426 (14%), Positives = 134/426 (31%), Gaps = 40/426 (9%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 MI LVI++ ++ +++ PA ++ F + L I + + LN T + + Sbjct: 1 MILLVITLLGLSITSYYLNNRNLVSPAFLLSTTFFICSLVALINQNKWQLILNRKTYLVI 60 Query: 61 LCNVLTFTLSCLLTESVL--------DLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYI 112 +L F + L +L + K+N S + L++ +I Sbjct: 61 CGAILEFIIVTYLVNKLLSVVKFQYKNSKKSKLNAPYISTRKSYILFAIQLLLIIYVIRN 120 Query: 113 CMRLSNYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFI------WSKKFT 166 ++ S +N + S + A + + Sbjct: 121 LKEVTKINNIFQAASALNQSSLPNYIGQPIALSKIANIFLAFILASGLYTGYVFFLYIIV 180 Query: 167 NTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVI-------------ISYAFIVGVNRVKH 213 + +FI + + + ++M+I F + V Sbjct: 181 KKNFRFDLFINMFISILAPFVTGSRGNSIYMIISWVIYCYLILWKNNKLNFKMQFKFVMR 240 Query: 214 YVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQV----SNSASSH 269 ++ + +L L + YLS+Y+ + I EF ++ Sbjct: 241 ITLVLIILLLLLPLTAVLFGRRMDNWDEYLSIYIGAQIKNLNEFILNNNFPLQTSIFGQQ 300 Query: 270 VFWFFERLMGLL-TGGVS-MSLHKEFVWVGLP--TNVYTAFSDYVYISAELSY-----LM 320 F+ L+ L + L + +G NVYT F ++Y +M Sbjct: 301 TFFTIIPLVSKLIGLNIPSYKLDLPYQAIGSLSLGNVYTTFYPWLYDFGYKGVFLLTLIM 360 Query: 321 MVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQ 380 ++ I + + + Y Y + +F+ F ++S + + + + Sbjct: 361 AIVVEFIYHLALHSKLQFGLSILLYGYLGSFVALLFFSNKFYEGLNSTLILIIMSWILLI 420 Query: 381 FLKAQK 386 ++ QK Sbjct: 421 YIFKQK 426 >UniRef50_A5LJS7 Polysaccharide polymerase n=14 Tax=Streptococcus pneumoniae RepID=A5LJS7_STRPN Length = 447 Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats. Identities = 68/412 (16%), Positives = 138/412 (33%), Gaps = 48/412 (11%) Query: 22 DIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLN 81 D F PAV + I + + + + +++ +L+ T +L V TF + LLT+ Sbjct: 24 DFFQPAVILTIAYFISIASALVNRNVWGTELHFKTFYLILLGVATFVIVSLLTKLSYRPK 83 Query: 82 IRKVNNAIY-SIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDAD---- 136 + +++ I K+ V LL ++ M+++ +R S S+ N+ Sbjct: 84 VEGISHEELKEINPSKIIYVILLTLNLVMLFLYIREIQKVVLFSGRSFSNITDLISNYRY 143 Query: 137 -------VEDTSRNFSAYMQPIILTTFALFIWS----KKFTNTKVSKTFTLLVFIVFIFA 185 VE+ + II T + ++ T L+ +F Sbjct: 144 LSYYSNEVENRVSGMINQLSKIIPATTLISLYIFMNNYFITKQIKKNFIYLIPIAIFFVY 203 Query: 186 IILNTGKQIVFMVIISYAFIVGVNR----------------VKHYVYLITAVGVLFSLYM 229 I++ G+ + +++ I+ + + + + + F L Sbjct: 204 AIISGGRLPLIRLVVGSLLILYIYSVYGSPKSQLTKSFKMITRSLFTFLILIVLFFLLKF 263 Query: 230 LFLRGLPGGMAYYLSMYLVSPIIAFQEFY--FQQVSNSASSHVFWFFERLMGLLTGGVSM 287 + R Y++ Y+ I F F + + + F ++ L ++ Sbjct: 264 VLGRSSQEDFISYITRYMGGSIQLFDLFVIDPIRRNKELGAETFSGIYEMLAKLGFDNNI 323 Query: 288 SLHKEFVWVG---LPTNVYTAFSDYVYISAELSYLMMVIHGCISGVL-WRLSRNYISVK- 342 E+ NVYTA Y + ++ L + R+Y V Sbjct: 324 IKGLEWRVSPNYYSLGNVYTAIRRYYSDFGVIGIVICQSFTAWLYTLGYEKVRHYSLVTN 383 Query: 343 ------IFYSYFIYTFSFIFYHESFMTNISS---WIQITLCIIVFSQFLKAQ 385 I + Y + F ++ + IQI + +VF LK Q Sbjct: 384 VQRFRLILLAASFYPIFLNGIEDVFYISMVTIGYGIQIVIFYLVFWVLLKVQ 435 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriav... 234 5e-60 UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli Re... 208 3e-52 UniRef50_D1PA58 Putative chain length determinant protein n=1 Ta... 207 8e-52 UniRef50_Q8XQM0 Putative o-antigen polymerase transmembrane prot... 197 5e-49 UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralston... 195 3e-48 UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepI... 195 3e-48 UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechlor... 180 9e-44 UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification prote... 177 8e-43 UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevote... 166 1e-39 UniRef50_C1CYL3 Putative uncharacterized protein n=1 Tax=Deinoco... 159 2e-37 UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumon... 159 2e-37 UniRef50_B9Y8A4 Putative uncharacterized protein n=1 Tax=Holdema... 153 1e-35 UniRef50_A3CQX5 Oligosaccharide repeat unit polymerase Wzy, puta... 132 3e-29 UniRef50_Q03584 O-antigen polymerase n=5 Tax=Enterobacteriaceae ... 126 1e-27 Sequences not found previously or not previously below threshold: UniRef50_Q5FID5 Polysaccharide polymerase n=3 Tax=Lactobacillus ... 52 3e-05 UniRef50_B7V2X0 Putative uncharacterized protein n=1 Tax=Pseudom... 52 5e-05 UniRef50_UPI0001C37E26 polysaccharide polymerase n=1 Tax=Ruminoc... 51 8e-05 UniRef50_B3XL31 Putative uncharacterized protein n=1 Tax=Lactoba... 47 0.001 UniRef50_A5LJS7 Polysaccharide polymerase n=14 Tax=Streptococcus... 46 0.002 UniRef50_B8F9T4 Putative uncharacterized protein n=1 Tax=Desulfa... 45 0.003 UniRef50_Q2VJ26 O-antigen polymerase n=1 Tax=Escherichia coli Re... 43 0.023 UniRef50_C4FH14 Putative uncharacterized protein n=1 Tax=Bifidob... 40 0.099 >UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LL33_RALME Length = 456 Score = 234 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 67/404 (16%), Positives = 154/404 (38%), Gaps = 21/404 (5%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 M++ + + L+ I + + P +++ V +G + D + D + Sbjct: 47 MLFACVLMALVGLLISVKIGPGRGNPFTAFFCVWSAVTVGAFLAEDTFLTISEDFWA--M 104 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 L L+ L C L S+ + ++ K ++G +V ++ I ++ S Sbjct: 105 LGAFLSIYLGCALYSSLRARVLPVESHISTISYRDKFVDIGQIVSVIALPIIYLKASEIA 164 Query: 121 FGTSLLSYMNL--IRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLV 178 S+ S +R A + + + ++ + ++ I +F + + +K + Sbjct: 165 -EMSVFSPAGYMALRAAFIAEVAASYGPIGYIVPVSFVIASIRLTQFLDKQTNKKNLAIA 223 Query: 179 FIVFIFAIILNTGKQIVFMVIISY-AFIVGVNRVKHYVYLITAVGVLFSLYMLFL----- 232 V + L TG+ M+ ++ ++K +I V + S M+ L Sbjct: 224 IAVALCMAYLTTGRTFFMMLFCFLLFPLIFRGKIKLGGVVIAGVVLAASFVMIALLTQRG 283 Query: 233 -------RGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV 285 G+ L +Y +SPI A + S + + F FF L ++ + Sbjct: 284 VSATASVDDNVDGIIKTLRVYFLSPIFAMGSVFDGNGSATYGDYTFRFFYMLANVIGLNI 343 Query: 286 SMS-LHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRN-YISVK 342 + L +++V + TNV+T Y + + G L+ +++ Sbjct: 344 EIPPLIRDYVLIPDMTNVFTVMDPYYRDFGVGGVLIFAALSGLAHDALYEKAKSEGGPYI 403 Query: 343 IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQK 386 ++ F++ F+ + +M+ +S+WIQ+ ++F + +K Sbjct: 404 FIHAAFMFPLVMQFFQDMYMSLLSTWIQVVFWYMLFVRVNSGKK 447 >UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli RepID=RFC_ECOLI Length = 388 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 388/388 (100%), Positives = 388/388 (100%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL Sbjct: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ Sbjct: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 Query: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI Sbjct: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 Query: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA Sbjct: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 Query: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT Sbjct: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 Query: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES Sbjct: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 Query: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 FMTNISSWIQITLCIIVFSQFLKAQKIK Sbjct: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 >UniRef50_D1PA58 Putative chain length determinant protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA58_9BACT Length = 683 Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats. Identities = 78/374 (20%), Positives = 145/374 (38%), Gaps = 22/374 (5%) Query: 22 DIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLN 81 DIF P +I++++ + +SDI D + + FT + + T ++ N Sbjct: 298 DIFAPWTLSLLIWSILGIIIAFSSDIIDPI-QDVFYTNISVWLAIFTATSISTYLLMPAN 356 Query: 82 IR-KVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDADVEDT 140 K I + + L + + +Y+ ++ MN +R+ V Sbjct: 357 QDIKTGVKGIKINITIFNILFFLSMIMTPLYM-YQIYKIVTMFDSKDLMNNMRELAVNGN 415 Query: 141 SRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVII 200 F Y I +W + K + V + I N K F+V I Sbjct: 416 GHGFLNYTMVINQALLLTGLW----RFPYLPKWKIICVVGCCLAYAIANMEKLTFFLVFI 471 Query: 201 SYAFIVGVNR-VKHYVYLITAVGVLFSLYMLFLRGLPGG--------MAYYLSMYLVSPI 251 + F++ R +K +I + ++ Y+ L + +L MYL+SP Sbjct: 472 TIFFVLFERRIIKLRTIVICGILLIIGFYIFNLSRSDSDSDYQKNTSILDFLGMYLMSPP 531 Query: 252 IAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKE---FVWVGLPTNVYTAFSD 308 +AF + +S+S S W L G S+ H++ FV+V +PTNVYT Sbjct: 532 VAFGHLR-RTISDSFCSESLWTIYAYTNRLMGSGSIIQHEDFGEFVYVPMPTNVYTIMKP 590 Query: 309 YVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTFSFIFYHESFMTNIS 366 + + I+G ++G ++R +RN + I Y+Y ++ + F+ E I Sbjct: 591 FYQDLGTIGVAFYAFIYGLVTGFIYRKARNGNAFGICMYTYLVFVLTMQFFDELIFVAIP 650 Query: 367 SWIQITLCIIVFSQ 380 ++Q + + Q Sbjct: 651 QFLQRMFLVYIICQ 664 >UniRef50_Q8XQM0 Putative o-antigen polymerase transmembrane protein n=1 Tax=Ralstonia solanacearum RepID=Q8XQM0_RALSO Length = 390 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 48/379 (12%), Positives = 116/379 (30%), Gaps = 15/379 (3%) Query: 18 YLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESV 77 ++ I P + +++ ++G+ + + + + +T + Sbjct: 6 IMRDMIVSPPIIFLAAWSMQVIGHTLLQRDFDKFSDHTWWLLAAAAFSFILGCAFVTFTY 65 Query: 78 LDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDADV 137 + + K LL I+ + N +S + Sbjct: 66 IGRRRPNRGITPSKLRGGKRAFWILLTA--YGIFGLAPIINILLDQGSISGARDAIVKGI 123 Query: 138 EDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFM 197 E + + +F + S F ++ F+ A I ++G+ ++ + Sbjct: 124 ESRNDSIVRSYYF--SALLVVFAVYLVSQASHYSPGFLVIGFVAATVAAISSSGRTLLLL 181 Query: 198 VIISYAFIVGV-----NRVKHYVYLITAVGVLFSLYM----LFLRGLPGGMAYYLSMYLV 248 + S + + + L+ L + F+ L + + L +Y++ Sbjct: 182 LFTSTPVSLYLQNKIRKKTFFASLLVFLCFFLALAVLNGKGAFINDLYSQITWNLEVYVL 241 Query: 249 SPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPTNVYTAFSD 308 + + +F F + + + R + L G + L FV P NVYTA Sbjct: 242 NGLASFNHFVTNNHPSFDGNILVPNLLRRIFELEGDA-IPLVLPFVETPFPGNVYTALYP 300 Query: 309 YVYISAELSYLMM-VIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHESFMTNISS 367 + + L ++ + G S + +YS Y + + ++ Sbjct: 301 WYHDGGALGLMVGFFLIGAFSQYFYHARHKSFKHTFYYSISAYALIMTIFQDQYIQAYPL 360 Query: 368 WIQITLCIIVFSQFLKAQK 386 W+ L + S + Sbjct: 361 WMMAILSPFLASALTPKMR 379 >UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Q91_RALEJ Length = 410 Score = 195 bits (495), Expect = 3e-48, Method: Composition-based stats. Identities = 65/395 (16%), Positives = 146/395 (36%), Gaps = 18/395 (4%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 M L++ V P ++ +LLGY ++ D + + L+ L Sbjct: 1 MFELLLLVTFAGLLTASRRPLGFGNPFQIYFFVWFSLLLGYYLSRDSFISMSVEFVLLIL 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 +L + L + + + + ++ ++ LV ++ + R + Sbjct: 61 TAKLLALLIMILCCRGLQGRGPEIRRHGVIARKTR-FIDLAQLVAIIALPLVYARATEIA 119 Query: 121 FGTSLLSYMNLIRD-ADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVF 179 G S+ + + I+ A + + + +L+ + + + + L Sbjct: 120 GGESVFTVLGYIQLRAAMTEDGEGYGILAYLWVLSFVTTSVSIFLYRQSNLGFGRLWLSV 179 Query: 180 IVFIFAIILNTGKQ-IVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLF------- 231 V + L+TG+ I+F + ++ ++ V ++ LIT + + + Sbjct: 180 AVSLCYCYLSTGRTYILFFLCLALVPLMNVGAIRMRGLLITLIIFIALSVFVAGMTAKGI 239 Query: 232 --LRGLPGGMAYYLSM---YLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGG-- 284 G+ + +L Y ++P++AF S + F + L Sbjct: 240 SADDGIVENVESFLESMKGYTIAPLLAFSRLVEWNPDLSWGENTFRLLISIQYALGISTL 299 Query: 285 VSMSLHKEFVWVGLPTNVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVK-I 343 ++L K + +V PTNVYT + Y + L L+ + + L+R +R V Sbjct: 300 APVALMKGYAFVPDPTNVYTVYEVYFRDFSYLGVLIPPVFLIVHYWLYRRARARGGVWIF 359 Query: 344 FYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVF 378 +YS +Y F+ + + + +S+WIQ+ + Sbjct: 360 YYSASVYPLVMQFFQDQYFSLLSTWIQVWFWYWLL 394 >UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepID=Q9AYY5_BPHK6 Length = 408 Score = 195 bits (495), Expect = 3e-48, Method: Composition-based stats. Identities = 66/393 (16%), Positives = 139/393 (35%), Gaps = 16/393 (4%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 +I + + + Y++ I P + + I+ D + + L Sbjct: 4 LIIIFVLCSAMHILAIKYMRCRITSPLSLSLFSWYFMAFTGIISYDSFYDFRVETFYCLL 63 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 + + + S + E K Y I K +++ I Sbjct: 64 IW-ISLTSFSYAVFEIAQRNKPLK-----YKIKEKNRICSRYSLLAIPACLITAYEIYKV 117 Query: 121 FGTSLLSYMNLIRDADVEDTSRN----FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTL 176 ++++ +R A++ED + F P+I+ FA K K+S + Sbjct: 118 GSNGPVNFLLNLRLANIEDDYQYPTFIFMPSFYPVIMAMFASICIFKSRWIDKISVCTWV 177 Query: 177 LVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLP 236 L+F + ++F+ I + + ++ Y ++ A ++ Y + Sbjct: 178 LLFAIGTMGKFAVITPIMMFVTIYELKNGISMKKIFIYAPVVLACIIIMHFYRMSDDD-S 236 Query: 237 GGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV--SMSLHKEFV 294 +AY Y+ SP+IAF N + + F + + + ++V Sbjct: 237 ATIAYIFGTYIYSPLIAFGTLID-SGINWSGDYTLRFINAINYKIGISSVEPVKTILDYV 295 Query: 295 WVGLPTNVYTAFSDYVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTF 352 +V PTNVY+ + VI+G + ++ ++ V + YS + Sbjct: 296 YVPSPTNVYSVMQPFYSDMGIYGVAFGAVIYGVLLSAIYSSAKAGNLVMLGLYSVLSVSL 355 Query: 353 SFIFYHESFMTNISSWIQITLCIIVFSQFLKAQ 385 F E+ +TN+S I++ LC+ V +F + Sbjct: 356 ITQFMSETIITNLSGNIKLLLCMFVVFRFFTKK 388 >UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47GK7_DECAR Length = 400 Score = 180 bits (456), Expect = 9e-44, Method: Composition-based stats. Identities = 72/395 (18%), Positives = 149/395 (37%), Gaps = 27/395 (6%) Query: 2 IYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLL 61 + +V+S + + Y K + PA +I+ V L Y+I D DA + L Sbjct: 1 MMIVVSFLCVAIVLLQYRAKKLMAPASINAVIWLAVSLTYQIQGDSLDVLRWDAIAVVL- 59 Query: 62 CNVLTFTLSCLLTESV---LDLNIRKVNNAIYSIPSKKVHNVGLLVISFSM-IYICMRLS 117 + TF + +E + L R + +P+ +G+L I + + + Sbjct: 60 VGIATFGFGAVCSERIHFALPQPSRSGGAPSFFLPAL----LGVLTIGLAGNLGRSLEYV 115 Query: 118 NYQFGTSLL----SYMNLIRDADVEDTSRNFS--AYMQPIILTTFALFIWSKKFTNTKVS 171 ++ G SL S+ +R+ + D +F +Y P+ A + K K + Sbjct: 116 HFVDGMSLFGGQNSWYGSLRNTLIADHHGSFGIWSYFLPLSYAAVAYLLCDK----EKSA 171 Query: 172 KTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFIVG----VNRVKHYVYLITAVGVLFSL 227 + + ++ +V + L TG+ + + + +V +N+ + L+ ++F + Sbjct: 172 RHYAIITSLVTFGYVFLATGRTFILLFVTILFVVVAYRGWLNKTRSIALLVPLFLIVFWV 231 Query: 228 YMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVS-NSASSHVFWFFERLMGLLTGGVS 286 + + GG Y MY V+P+ F + + F ++ L V Sbjct: 232 SPIIAGRVSGGGFNYFMMYFVAPLANFDWGMHGVFACCTHGETTFRTIFAVLAKLGFNVP 291 Query: 287 MS-LHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNYISV-KI 343 + L + + L NVYT F Y L + G + + R + + Sbjct: 292 VVELIQPWAGTKLSGNVYTVFMPYYRDFGLAGVALFLFFFGALHTWISRQASLDNPLAVF 351 Query: 344 FYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVF 378 + F Y F+ + + + +S W+Q+ + + Sbjct: 352 LNAIFFYALVMQFFQDQYFSLLSQWVQMIFWMSLL 386 >UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification protein n=2 Tax=Enterobacteriaceae RepID=Q1RA39_ECOUT Length = 396 Score = 177 bits (448), Expect = 8e-43, Method: Composition-based stats. Identities = 60/382 (15%), Positives = 139/382 (36%), Gaps = 12/382 (3%) Query: 9 FLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFT 68 + F L K I P + + L L+ D + + L+ + F Sbjct: 12 VIAIMFSLLGTKSRITSPLPLHFLPWLLTLIVGISNYDQFYEFNERSFYSLLIWFTVIFI 71 Query: 69 LSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSY 128 + +++ +N K +++ +Y + G + + Sbjct: 72 FYFI--GELVNYKRENINVYYGLSHIKYECKKYWIIVIPISLYTIFEIYMVGMGGADGFF 129 Query: 129 MNLIRDADVEDTSRN---FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFA 185 +NL +E + + P+++ FA+ +K K S F + ++ + Sbjct: 130 LNLRLANTLEGYTGKKFILMPAVYPLMMAMFAIVCLTKTSKLNKYSIYFWMFLYCIGTMG 189 Query: 186 IILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYYLSM 245 + +++I + + V + + LI + + + L + Sbjct: 190 KFSILTPILTYLIIYDFKHRLKVKKTIKFTLLIIILALTLHFTRMAENDH-STFLSILGL 248 Query: 246 YLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLT--GGVSMSLHKEFVWVGLPTNVY 303 Y+ SPIIA + + S+ + F F + + + ++ ++ +V +PTNVY Sbjct: 249 YIYSPIIALGQL-NEVNSSHFGEYTFRFIYAITNKIGLIKELPVNTILDYSYVPVPTNVY 307 Query: 304 TAFSDYVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKIFYSYFIYTFS--FIFYHES 360 TA + + V++G I L+ + + Y +++ S F+ E+ Sbjct: 308 TALQPFYQDFGYTGIIFGAVLYGLIYVSLYTAGVRGNNTQALLIYALFSVSSATAFFAET 367 Query: 361 FMTNISSWIQITLCIIVFSQFL 382 +TN++ +++ LC I+ +F Sbjct: 368 LVTNLAGNVKLVLCTILLWRFT 389 >UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LJ75_9BACT Length = 401 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 67/396 (16%), Positives = 153/396 (38%), Gaps = 21/396 (5%) Query: 3 YLVISVFLITAFICL--YLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 L I+ ++ AF + Y+ +D+F P V ++ +LL + ++ ++D + + Sbjct: 1 MLFIAFLIVAAFTFVGWYITRDVFSPFVLQPGVWFGILLLFYLSDPELYPIIHDFPISLI 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNA---IYSIPSKKVHNVGLLVISFSMIYICMRLS 117 + + ++ + D ++ PS+ V + L + S+ I L Sbjct: 61 VWTISFLGVAYPTYYYLPDHSLISRRPPLLASVLTPSRLVLKLYLFIAIISVPLILYTLM 120 Query: 118 NYQFGTSLLSYMNLIRDADVED--TSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFT 175 Y + M +R A +D + I + + K Sbjct: 121 RYGMERGESNLMTYLRIASYDDTLDKPDLGVAYYTIGVALIVFIFIFVYSS----KKWLK 176 Query: 176 LLVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYV--YLITAVGVLFSLYMLFLR 233 +L ++ + A +++ K F+ ++ +++ + +I + + FS++ + R Sbjct: 177 ILAVVINVLAALISMSKTGFFVFLVPMVYVLYLRGKIKLRTIGIILLIFIGFSIWFQYAR 236 Query: 234 GLPGGMAYY-----LSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV-SM 287 + + L++Y++SP +AF + + +VF F+ +M L + + Sbjct: 237 SMASQQDSFSATSMLTIYIMSPCVAFDYYVEPASATHFGEYVFRFYYAIMHSLGSNIEPV 296 Query: 288 SLHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNY-ISVKIFY 345 S +FV V TN YT + + L ++G L++ +++ I Y Sbjct: 297 SNVLKFVGVPEETNTYTILYPFYHDFGLPGVGLFGGLYGAFYAFLYKRAQSGQNVYFILY 356 Query: 346 SYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQF 381 + F+ F E+ ++N S +Q + I+ F Sbjct: 357 ACFLNYLILQFVQENILSNFSLNLQYVILILFPYIF 392 >UniRef50_C1CYL3 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1CYL3_DEIDV Length = 427 Score = 159 bits (402), Expect = 2e-37, Method: Composition-based stats. Identities = 71/392 (18%), Positives = 141/392 (35%), Gaps = 29/392 (7%) Query: 17 LYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTES 76 + +D+ YPAV I++ + L+ Y + Y L+++ ++ + VL F+ LT S Sbjct: 19 FFFSRDVRYPAVLQIIVWLVTLVVYVVERHRY-VSLSESVMLIIFLGVLGFSAGSFLTLS 77 Query: 77 VLDLNIRKVNNAIYS-------IPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYM 129 L + + + LL I+ + +YI Q G + + Sbjct: 78 SLGGRSLGSRKTVLLRMRHLSDVRLVLAFFLTLLAIAGAAVYIQTATQFAQTGPTQDLAL 137 Query: 130 NLIRDADVEDTSRNFS---AYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAI 186 NL V +Y PI+ T + + K L + Sbjct: 138 NLRYLTSVRGEVPPLMRLTSYALPILNTLSGFCLIYYRANRDKRVLPILFLAVASALVMS 197 Query: 187 ILNTGKQIVFMVIISYAFIVGV-NRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA----- 240 + +TG+ I+ II I + ++ ++ + + S++ + L G+ Sbjct: 198 VFSTGRGIILFFIIEMGIIYAMTSKRLRLRLVLLGLLMFLSIFYIGASVLGKGVDQNASL 257 Query: 241 --------YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVS-MSLHK 291 +S+YL+S I+A V++ + F L L + + L + Sbjct: 258 LESFPDLFSSISLYLLSGILALSVQLPTLVTDEGGVNTFRTIHALGRALGFDATVVPLVQ 317 Query: 292 EFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNYISVKIFY--SYF 348 F + PTNVYT + Y+ L + + G + L+ R + + Sbjct: 318 AFTNIPQPTNVYTIYLTYLKDFGWLGIFIFQFLFGILHATLFIAFRRTGGAVALFWLAIL 377 Query: 349 IYTFSFIFYHESFMTNISSWIQITLCIIVFSQ 380 + + + + + +S+WIQ +FS Sbjct: 378 SFPLLTQPFTDGYFSLMSTWIQYAFFSSLFSL 409 >UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LCF1_TOLAT Length = 407 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 70/390 (17%), Positives = 140/390 (35%), Gaps = 19/390 (4%) Query: 8 VFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTF 67 + LI K I P +++ V L +T D + N+ ++ + ++ Sbjct: 5 LILINLIGYFINMKFIANPFKIFFMLWFFVFLTLYLTIDDWVEISNEFIVVNITTSLSVL 64 Query: 68 TLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLS 127 LS +L L+ + NA+ + + +I F +++ + ++ S Sbjct: 65 LLSYILKGKTECLHSWRDENALNELFINRYLCFVFQIICFIGLFLAYYRVSLLIPDNIFS 124 Query: 128 YMNLIRD-ADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAI 186 + + + D + ++ L+ + +V +L I + Sbjct: 125 PVGYTKLRMSIGDDTVDYGILAYFFTLSFIVTSLTIILRVRGEVGSIRLILSIISSLSYC 184 Query: 187 ILNTGKQIVFMVIISYA---FIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYY- 242 L+TG+ M F++GV R +H V + + +LF L + Sbjct: 185 YLSTGRTFFLMFFCFSFAPLFVLGVVRRQHIVLISIVIVLLFILVAYLTGKGISEEQSFI 244 Query: 243 ---------LSMYLVSPIIAFQEFYFQQVSNS--ASSHVFWFFERLMGLLTG--GVSMSL 289 L Y ++P++A Q ++S + F F + L G M L Sbjct: 245 ENINSFLENLRSYTIAPVVAMNMLIEQISNSSIFYGEYSFRFIFVFLSKLGVYDGSVMPL 304 Query: 290 HKEFVWVGLPTNVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSR-NYISVKIFYSYF 348 K+FV PTNVYT + YV + + VL+ YS Sbjct: 305 IKDFVDTPYPTNVYTVYDIYVRDFGYFGFASVFFIMFFHFVLYEKCMLKGGFYVFLYSAS 364 Query: 349 IYTFSFIFYHESFMTNISSWIQITLCIIVF 378 +Y F+ + +++ +S+WIQ+++ + Sbjct: 365 LYPLIMQFFQDQYLSLLSTWIQVSVIYYIL 394 >UniRef50_B9Y8A4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y8A4_9FIRM Length = 441 Score = 153 bits (386), Expect = 1e-35, Method: Composition-based stats. Identities = 77/435 (17%), Positives = 144/435 (33%), Gaps = 49/435 (11%) Query: 3 YLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLC 62 YL IS+ +I K+DI +P + F L + + + +L T ++ Sbjct: 5 YLAISLIIILVLCFFITKQDIIHPTIAFIAPFTLAAIDLLYNINKWNVELKMNTYYVIIG 64 Query: 63 NVLTFTLSCLL------TESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICM-- 114 L F L+ + NI+ ++ IY +++ + L++ IC+ Sbjct: 65 GTLVFILATFIIDIGYKQLRKKCQNIKYKSDTIYKYNISQINKLLFLMLQIFTFLICLIG 124 Query: 115 ----RLSNYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFAL-FIWSKKFTNTK 169 G+ ++ + +A F+W Sbjct: 125 VIKVARRFGVSGSISELIAGYKNLKTFTTEDVGLGKFVNTLYDFCYASGFVWFYLVAKKY 184 Query: 170 VSKTFTLLVFIVFIFAIILNTGK-----------------QIVFMVIISYAFIVGVNRVK 212 + K + ++ + I + + I+F V+ S + ++ Sbjct: 185 IFKKKWDKLVLINLCLSIAISLEKGSRGGAIALLCSGAVMIIIFWVMHSKKHKISFKQIL 244 Query: 213 HYVY-LITAVGVLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVF 271 + VG+ ++ L R YLS+YL +PI F +++ Sbjct: 245 LIILAASLVVGMFQTIGELLGRVSAADFGGYLSVYLSAPIRNLDYFLNNSFASADMFGKM 304 Query: 272 WFFERLMGLLTGGVSMSLHKEFVWVGL------PTNVYTAFSDYVYISAELSY-LMMVIH 324 F+ + L S E V L NVYT F Y+Y + + M + Sbjct: 305 TFYHAINYLGGKLEISSWIYELVLPPLRANGFVTGNVYTTFYAYIYDFGYVGVPIFMFLM 364 Query: 325 GCISGVLWRLSRNYISV---------KIFYSYFIYTFSFIFYHESFMTNI--SSWIQITL 373 G IS + + ++N I YSY Y +F F+ F I + + + Sbjct: 365 GVISQLFYIKTKNNTKYLQRDRINIWIIIYSYIFYMLAFSFFSNKFYEGIFSIQFFKYLI 424 Query: 374 CIIVFSQFLKAQKIK 388 + FL+ KIK Sbjct: 425 YWSLIKLFLENVKIK 439 >UniRef50_A3CQX5 Oligosaccharide repeat unit polymerase Wzy, putative n=2 Tax=Streptococcus RepID=A3CQX5_STRSV Length = 441 Score = 132 bits (331), Expect = 3e-29, Method: Composition-based stats. Identities = 53/406 (13%), Positives = 118/406 (29%), Gaps = 41/406 (10%) Query: 15 ICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLT 74 + +D PA I+ + + S + L+ T+ + F + LL+ Sbjct: 16 TIKTVGRDYANPAFIYLAIWMIASIFTAFYSSKWGEGLSLITVTVIFVGNAIFLMGVLLS 75 Query: 75 ESVLDLNIRKVNNAIYSIP----SKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMN 130 ++ + + + + V + + + Q L + Sbjct: 76 SNLFAERKLDAKPSQIKVSNFFIILVLLFLAYAVRFIYSDLLYLAAQSKQVPGGLFKTIE 135 Query: 131 LIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVS-------KTFTLLVFIVFI 183 L R + + + ++ F + S K LLV ++ + Sbjct: 136 LARHMTTNYDFSLSRLSLNLLRVNFSLGIVFFYFFCESLFSGKDGIFYKGKLLLVSLISL 195 Query: 184 FAIILNTGKQIVFMVIISYAFIVGV-------------NRVKHYVYLITAVGVLFSLYML 230 +L+TG+ + ++ YA + + LI L + Sbjct: 196 GVSLLSTGRTELLGLVAGYAIVYMLFFSKYYSWRDSRYGVKLFRSLLIIGFVFLVLFMAV 255 Query: 231 F-------LRGLPGGMAYYLSMYLVSPIIAFQEFYFQ----QVSNSASSHVFWFFERLMG 279 G+ L Y+ SPI A + + + + Sbjct: 256 GTFVLNRVDSRAEFGILDNLIKYMGSPIQALDYYLKNPTLYSDNQVFGENTLIAIYGTLK 315 Query: 280 LLTGGV-SMSLHKEFVWVGLP-TNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSR 336 L ++ V TNVYT + ++ S + + +G G + + Sbjct: 316 SLGLSSHELTPFLPVVHFNDDKTNVYTIYYYFIKDFGYFSVLIFQLSYGFFYGSFYYSIK 375 Query: 337 NYISV---KIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFS 379 I ++ F Y F+ E+ ++ +++ I + V Sbjct: 376 KRYFTPLKVIVFALFAYPLVISFFQETLLSLLTTHINRIIYAFVIY 421 >UniRef50_Q03584 O-antigen polymerase n=5 Tax=Enterobacteriaceae RepID=RFC_SHIDY Length = 380 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 95/264 (35%), Gaps = 10/264 (3%) Query: 123 TSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVF 182 +L Y N + + + S + ++ F + F + F Sbjct: 110 GIILLYNNHFSLKVMREGILDGSISGFGLGISLPLSFCCMYLARHENKKNYFYCFTLLSF 169 Query: 183 IFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFS-------LYMLFLRGL 235 + A++ + ++ ++ V++ K +Y + G+ + Sbjct: 170 LLAVLSTSKIFLILFLVYIVGINSYVSKKKLLIYGVFVFGLFALSSIILGKFSSDPEGKI 229 Query: 236 PGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVW 295 + L +YL S + AF + + + ++ + + + T + + ++ Sbjct: 230 ISAIFDTLRVYLFSGLAAFNLYVEKN--ATLPENLLLYPFKEVWGTTKDIPKTDILPWIN 287 Query: 296 VGL-PTNVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSF 354 +G+ TNVYTAF+ + + +++ I +W R ++V + ++ + Sbjct: 288 IGVWDTNVYTAFAPWYQSLGLYAAIIIGILLGFYYGIWFSFRQNLAVGFYQTFLCFPLLM 347 Query: 355 IFYHESFMTNISSWIQITLCIIVF 378 +F+ E ++ + LC I+ Sbjct: 348 LFFQEHYLLSWKMHFIYFLCAILL 371 >UniRef50_Q5FID5 Polysaccharide polymerase n=3 Tax=Lactobacillus acidophilus RepID=Q5FID5_LACAC Length = 431 Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 59/426 (13%), Positives = 131/426 (30%), Gaps = 40/426 (9%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 MI LVI++ ++ +++ PA ++ F + L I + + LN T + + Sbjct: 1 MILLVITLLGLSITSYYLNNRNLVSPAFLLSTTFFICSLVALINQNKWQLILNRKTYLVI 60 Query: 61 LCNVLTFT--------LSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYI 112 +L F L ++ + K+N S + L++ +I Sbjct: 61 CGAILEFIIVTYLVNKLLSVVKFQYKNSKKSKLNAPYISTRKSYILFAIQLLLIIYVIRN 120 Query: 113 CMRLSNYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFAL-----FIWSKKFTN 167 ++ S +N + S + A +++ Sbjct: 121 LKEVTKINNIFQAASALNQSSLPNYIGQPIALSKIANIFLAFILASGLYTGYVFFLYIIV 180 Query: 168 TKVSKTFTLLVFIVFIFAIILNTGKQ--------------IVFMVIISYAFIVGVNRVKH 213 K + + + I A + + ++ F + V Sbjct: 181 KKNFRFDLFINMFISILAPFVTGSRGNSIYMIISWVIYCYLILWKNNKLNFKMQFKFVMR 240 Query: 214 YVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQV----SNSASSH 269 ++ + +L L + YLS+Y+ + I EF ++ Sbjct: 241 ITLVLIILLLLLPLTAVLFGRRMDNWDEYLSIYIGAQIKNLNEFILNNNFPLQTSIFGQQ 300 Query: 270 VFWFFERLMGLLTGGVSMSLHKEF----VWVGLPTNVYTAFSDYVYISAELSY-----LM 320 F+ L+ L G S + + NVYT F ++Y +M Sbjct: 301 TFFTIIPLVSKLIGLNIPSYKLDLPYQAIGSLSLGNVYTTFYPWLYDFGYKGVFLLTLIM 360 Query: 321 MVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQ 380 ++ I + + + Y Y + +F+ F ++S + + + + Sbjct: 361 AIVVEFIYHLALHSKLQFGLSILLYGYLGSFVALLFFSNKFYEGLNSTLILIIMSWILLI 420 Query: 381 FLKAQK 386 ++ QK Sbjct: 421 YIFKQK 426 >UniRef50_B7V2X0 Putative uncharacterized protein n=1 Tax=Pseudomonas aeruginosa LESB58 RepID=B7V2X0_PSEA8 Length = 417 Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 56/405 (13%), Positives = 138/405 (34%), Gaps = 20/405 (4%) Query: 3 YLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDI-YAFQLNDATLIFLL 61 L + LI A L + +P+V + I + L L+ + S I + +DA LIFL Sbjct: 4 MLTGATLLIFAVAARLLARSAIHPSVAMPITWGLGLIAVSLASLIGFYRVESDALLIFLF 63 Query: 62 CNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQF 121 + + + +N ++ + V ++ + + Sbjct: 64 GVMSFSLSAGCFSFLYNGYFRAPSSNFLFDSELRTRALVIFFCLAHIVFLTVIYRDLSSI 123 Query: 122 GTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIV 181 +L + R V S M + L + + + K L + + Sbjct: 124 APTLREAAYMARAQSVSGEPVLSSLSMNYLQLGQTVIPLVVLLYLRGKCGVLGFLAISVP 183 Query: 182 FIFAIILNTGKQIVFMVIISYAFIVGVNR----VKHYVYLITAVGVLFSLYMLFLRGLPG 237 ++ I+L +G+ + +++ FI + + +K + + A+ ++ ++ + + Sbjct: 184 WMGVILLASGRASLMQMLVGLFFIYILVKGSPSLKSLLVIGLAMFLVIAVGAVATSKIQF 243 Query: 238 GMAYYLSM-----------YLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVS 286 +S Y + + F +Y + F ++ + Sbjct: 244 HEGDGISTLFIELYRHVAGYALQGPVLFDRYYQGSIHLEPYWSPLNGFCSILATVGLCQK 303 Query: 287 MSLHKEFVW--VGLPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRNYISVK- 342 LH +F G NVY+ F L + +M ++G + + ++ Sbjct: 304 PPLHLDFYEYAPGELGNVYSMFFSMYPHYGALGVIGVMALYGMLCSYAYCKAKKGSLYFT 363 Query: 343 IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQKI 387 + SY F + + T+ ++++T+ + + + ++ Sbjct: 364 VLSSYLFSAIVFSLFSDQISTSWWFYVKMTIILGILCFVFRRDRM 408 >UniRef50_UPI0001C37E26 polysaccharide polymerase n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37E26 Length = 441 Score = 50.9 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 56/435 (12%), Positives = 123/435 (28%), Gaps = 47/435 (10%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 ++ ++ T K PA F L D + L+ T + Sbjct: 3 LLLTFFTLCFETYISYKIFDKSFCSPAFIFCAGFTLASADLLTMVDYWKVDLHWNTYYVI 62 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVH--------------NVGLLVIS 106 L F L + L + + L ++ Sbjct: 63 TGGCLVFILISFAVKKALRSFKLDTKGISLDVFPTTRNEVNISKFMLLCVLAFNALSIMI 122 Query: 107 FSMIYICMRLSNYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFT 166 S I + S + L S+ + + + + ++ +I+ Sbjct: 123 LSYKVISVIRSFGMYTGILSSFGKYAAISKFSNRNVSLGFLNNLMLFLRAEGYIFGNLVV 182 Query: 167 NTKVSKTFTLLVFIVFIFAIILNTGKQI-----VFMVIISYAFIVGVNRVKHYVYLITAV 221 + ++ + +++T ++M++ + K I A Sbjct: 183 VAYFKNKKIDFLLLLCFVSCVISTFITGSRGGSIYMILSLVPSVYLCREKKILKKPIKAK 242 Query: 222 GVLFSLYMLFLRGLPGGMAY---------------YLSMYLVSPIIAFQEFYFQQVSNSA 266 ++ + F + Y+S+YL +PI F Q S Sbjct: 243 YIILLGLLGFGSLFLLQLMGSVMGRTMKEEVSPWAYISIYLGAPIQNLDYFLQQPHEASE 302 Query: 267 SSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLP------TNVYTAFSDYVYISAELSY-- 318 F+ ++ G + L +F + NVYT F + Y Sbjct: 303 VFGGTTFYYQIQHYAIGHNRLDLIYDFDLPFVKFNNHNAGNVYTIFYAFFYDFGYKGVVI 362 Query: 319 ---LMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFS-FIFYHESFMTNIS-SWIQITL 373 LM ++ I + R S+ + +++ F+ F ++ +++++ + Sbjct: 363 LTGLMALMIQTIYEYTLKSIRKPFSISRLFYMYMFPTIPLSFFSNKFYEGLTIAFVKMII 422 Query: 374 CIIVFSQFLKAQKIK 388 I+ L K K Sbjct: 423 FWIILYLALIQDKYK 437 >UniRef50_B3XL31 Putative uncharacterized protein n=1 Tax=Lactobacillus reuteri 100-23 RepID=B3XL31_LACRE Length = 426 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 54/422 (12%), Positives = 130/422 (30%), Gaps = 37/422 (8%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIY--AFQLNDATLI 58 ++ L + + D PA +++A+ + I + +A I Sbjct: 2 ILGLFLIFIALVFITYQLFGHDFLAPAFLFCVMYAVSIGCALINYQQWGLNDYSQEAFNI 61 Query: 59 FLLCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSN 118 +L +L +S L+ +L N +V + I V L I+ ++ + ++ Sbjct: 62 YLFGALLFIFVSYLVKLLILPNNNLEVPDYTDRININTGITVVLTFINLIVLVLWVKNIK 121 Query: 119 YQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILT--------TFALFIWSKKFTNTKV 170 G LS + + ++ + + +T Sbjct: 122 AIGGGGSLSQALENYRINTSYSIGGAGMPGYLQQMSKITTVSGYIYGFILIYNIVHHTVK 181 Query: 171 SKTFTLLV--FIVFIFAIILNTGKQIVFMVIISYAFIVGVNR---------VKHYVYLIT 219 + L + I+++ + ++ + + VI SY + + + Sbjct: 182 KDDYYLCLPNTIIYVLFSLFDSNRLNILGVIASYVVYYYFMKSSKGKGFKTLIRLTEIFI 241 Query: 220 AVGVLFSLYMLFLRGLPGG---MAYYLSMYLVSPIIAFQEFYFQQVSNS--ASSHVFWFF 274 + ++F L + Y+SMY P+ F F V N+ F Sbjct: 242 VLLIVFYGVRLMIGRSSSQGNNFIEYISMYAGGPVKLFDMFIKDPVQNTGIWGKETFPSL 301 Query: 275 ERLMGLLTGGVSMSLHKE---FVWVGLPTNVYTAFSDYVYISAELSY-----LMMVIHGC 326 + L + + K+ F NVY+A+ +++ + + + C Sbjct: 302 LKTFRSLGYDIPQYISKKEFRFYNGINLGNVYSAYRNWLSDFGINGVYILQTIFALFYSC 361 Query: 327 ISGVLWRLS-RNYISVKIFYSYFIYTFSFIFYHESFMTNI--SSWIQITLCIIVFSQFLK 383 +L ++ + +I I Y Y + + ++ + + + Sbjct: 362 YYYLLRKVGYKKHILALIIYGYMAEAIFLHPIDDWLFSMFVSVGFVIYIVVFWLLYIMIT 421 Query: 384 AQ 385 + Sbjct: 422 KK 423 >UniRef50_A5LJS7 Polysaccharide polymerase n=14 Tax=Streptococcus pneumoniae RepID=A5LJS7_STRPN Length = 447 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 66/412 (16%), Positives = 136/412 (33%), Gaps = 48/412 (11%) Query: 22 DIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLN 81 D F PAV + I + + + + +++ +L+ T +L V TF + LLT+ Sbjct: 24 DFFQPAVILTIAYFISIASALVNRNVWGTELHFKTFYLILLGVATFVIVSLLTKLSYRPK 83 Query: 82 IRKVNNAIYS-IPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDAD---- 136 + +++ I K+ V LL ++ M+++ +R S S+ N+ Sbjct: 84 VEGISHEELKEINPSKIIYVILLTLNLVMLFLYIREIQKVVLFSGRSFSNITDLISNYRY 143 Query: 137 -------VEDTSRNFSAYMQPIILTTFALFIWS----KKFTNTKVSKTFTLLVFIVFIFA 185 VE+ + II T + ++ T L+ +F Sbjct: 144 LSYYSNEVENRVSGMINQLSKIIPATTLISLYIFMNNYFITKQIKKNFIYLIPIAIFFVY 203 Query: 186 IILNTGKQIVFMVIISYAFIVGVNR----------------VKHYVYLITAVGVLFSLYM 229 I++ G+ + +++ I+ + + + + + F L Sbjct: 204 AIISGGRLPLIRLVVGSLLILYIYSVYGSPKSQLTKSFKMITRSLFTFLILIVLFFLLKF 263 Query: 230 LFLRGLPGGMAYYLSMYLVSPIIAFQEFY--FQQVSNSASSHVFWFFERLMGLLTGGVSM 287 + R Y++ Y+ I F F + + + F ++ L ++ Sbjct: 264 VLGRSSQEDFISYITRYMGGSIQLFDLFVIDPIRRNKELGAETFSGIYEMLAKLGFDNNI 323 Query: 288 SLHKEFVWVG---LPTNVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVK-- 342 E+ NVYTA Y + ++ L + S+ Sbjct: 324 IKGLEWRVSPNYYSLGNVYTAIRRYYSDFGVIGIVICQSFTAWLYTLGYEKVRHYSLVTN 383 Query: 343 ------IFYSYFIYTFSFIFYHESFMTNISS---WIQITLCIIVFSQFLKAQ 385 I + Y + F ++ + IQI + +VF LK Q Sbjct: 384 VQRFRLILLAASFYPIFLNGIEDVFYISMVTIGYGIQIVIFYLVFWVLLKVQ 435 >UniRef50_B8F9T4 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9T4_DESAA Length = 449 Score = 45.5 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 48/373 (12%), Positives = 114/373 (30%), Gaps = 30/373 (8%) Query: 17 LYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTE- 75 L + KD+F P + L+ + S Y + + ++ +F L ++ Sbjct: 59 LLIDKDLFSPYTLFALAPFCTLIYDDFLSPSYLPLPDGLAVTCIIIGQASFLLGLKMSGV 118 Query: 76 --SVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFG-TSLLSYMNLI 132 SV+ N + + + ++ I ++ + +R N QF + + + Sbjct: 119 VRSVIWRNKKGQKHQNRLASKDNYRLLFIVGIVPFLLSLSLRTMNIQFSAEGIDEFRSNF 178 Query: 133 RDADVEDTSRNF-----------SAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIV 181 + F Y L + + F ++F Sbjct: 179 TLPIISAALSCFTTAGLLGTARERRYFVFFSLIVTMVVVGLFTQAKGSAVLIFLTIIFAS 238 Query: 182 FIFAIILNTGKQ----IVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPG 237 + + +T + ++ M + + + H +++ + Sbjct: 239 KKYWRLKSTSRIVFVSLILMFAMFQVYGMIRQGYFHSRVYVSSYEYYRQHKDIADIPRYL 298 Query: 238 GMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHV---FWFFERLMGLLTGGVSMSLHKEFV 294 + Y MY+ +P+ F + F +L T + +++ Sbjct: 299 QIFYKPYMYMETPLSNFAYLVENNFTPENGKLTAWPFISIFQLKRFYTIEKPVKPIRKW- 357 Query: 295 WVGLPTNVYTAFSDYVYISAELSYLM-MVIHGCISGVLWRLS--RNYISVKIFYSYFIYT 351 P N +T D+ + L+ + G I ++ + I + Y YF Y Sbjct: 358 ----PYNTHTFLGDFYLDFGVMGILILPFLLGLIVSAMYSRTLVNRDIIMDAVYLYFSYA 413 Query: 352 FSFIFYHESFMTN 364 +F+ F ++ Sbjct: 414 TFMMFFSNHFTSS 426 >UniRef50_Q2VJ26 O-antigen polymerase n=1 Tax=Escherichia coli RepID=Q2VJ26_ECOLX Length = 412 Score = 42.8 bits (99), Expect = 0.023, Method: Composition-based stats. Identities = 53/399 (13%), Positives = 135/399 (33%), Gaps = 28/399 (7%) Query: 8 VFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAF-QLNDATLIFLLCNVLT 66 +L F+ L+K I +P IF+ VL I F +++ + L+ Sbjct: 13 FYLPFIFLKFGLRKKITHPPTLFCAIFSFVLSMAYIAHITLGFYEISYESYAIYGLGNLS 72 Query: 67 FTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLL 126 F ++T+++ NI+ + + + + + + + L Sbjct: 73 FIAGGMITDALNRRNIKNSTQLVINKQEITLLKILIFTLLLYFPIAYSEFKHATPDMPLA 132 Query: 127 SYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAI 186 + IR+ +E+ + I+ + +++ F+ K++ + L F++F Sbjct: 133 LKILRIRERGLEEQVYSTITNNMIILSSCLVMYMTFI-FSIKKINVYYYSLTFVLFCAYN 191 Query: 187 ILNTGKQIVFMVIISYAFIVGVN--------RVKHYVYLITAVGVLFSLYMLFLRGLPGG 238 + + + ++ I+ FI +N ++ +TA+ + + + + Sbjct: 192 FMTGTRAAIILISIACIFIYLLNTNKNNKTSKLFLLSVGMTAIILGALVAIFMGKDGMER 251 Query: 239 MAYYLS----------MYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMS 288 L+ Y V +I F + + + + + ++ +T Sbjct: 252 DESLLANLNKVVDNYFSYTVQGVILFDNYVIGREKITPNWDILSGSAEVINKITSSNIFK 311 Query: 289 LHKEFVWVG-----LPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRNYISVK 342 +F NV T + + + + +++G + +L+ V Sbjct: 312 TDSKFSEFSHFAKDKDGNVCTIYFAIYPLYGLVGVILFFLLYGTVCTILYN--HGPGIVS 369 Query: 343 IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQF 381 + Y T ++E NI I+ +++ F Sbjct: 370 VLVGYINATLCLNIFNEQVFINIIFTIKFICFLLLIRCF 408 >UniRef50_C4FH14 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FH14_9BIFI Length = 468 Score = 40.5 bits (93), Expect = 0.099, Method: Composition-based stats. Identities = 45/404 (11%), Positives = 120/404 (29%), Gaps = 54/404 (13%) Query: 22 DIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCL--------- 72 D P++ + FAL + + F L+ T + ++ + F + Sbjct: 24 DFAEPSILFVLGFALSVFNGLTNYKAWNFNLSLQTCMVVMVGAVVFMATAYGVKTLFHTI 83 Query: 73 ----LTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSY 128 ++ + + +V+S S++ + + + + Sbjct: 84 VVGDVSSRRYKEPRNITLPLWVYVAGLMFTCLSFIVVSRSIVALTLPYGGDGSLSKAIGL 143 Query: 129 MNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFT----NTKVSKTFTLLVFIVFIF 184 + + E S + + + A T F L+ + I Sbjct: 144 YDHLNKFSTEGVSISGIVSLLYLSTNAMAYVWLFFAMRSFVIRTLKKDYFALINALAAIP 203 Query: 185 AIILNTGK--QIVFMVIISYAFIVGVN--------RVKHYVYLITAVGVLF-------SL 227 +++ G+ I V +I+ R++ + VL L Sbjct: 204 MSLISGGRNSLIQLGVAAFAYWILFRRQNNHWQGVRLRFRTVAFFMIIVLAGLALFKPLL 263 Query: 228 YMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSM 287 ++ + YLS+Y+ +P+ F ++ S + + + + Sbjct: 264 SLMGREPGESTIYEYLSIYIGAPMKNLDAFLTGSMNPSLAVKSMKWGDMTLASTFASFPQ 323 Query: 288 S---LHKEFVWVG--------LPTNVYTAFSDYVYISAELSYLMMV-IHGCISGVLWRLS 335 +++ NV+T + +++ ++ V +S + + + Sbjct: 324 VFGHTVLDWLNWQPFQRYGNVDLGNVFTTYYAFIFDWGIAGAMLAVAFIAALSQLCYEST 383 Query: 336 RNYISVK--------IFYSYFIYTFSFIFYHESFMTNISSWIQI 371 + + Y Y +F F+ +M+ + + I + Sbjct: 384 VYALQYGSGSVPLSMMLYGAISYCCAFSFFSNRWMSTMLNQIML 427 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.327 0.132 0.315 Lambda K H 0.267 0.0409 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,586,399,645 Number of Sequences: 3077464 Number of extensions: 57112658 Number of successful extensions: 561968 Number of sequences better than 1.0e-01: 1000 Number of HSP's better than 0.1 without gapping: 81 Number of HSP's successfully gapped in prelim test: 4698 Number of HSP's that attempted gapping in prelim test: 551525 Number of HSP's gapped (non-prelim): 10925 length of query: 388 length of database: 1,040,396,356 effective HSP length: 131 effective length of query: 257 effective length of database: 637,248,572 effective search space: 163772883004 effective search space used: 163772883004 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.3 bits) S2: 94 (40.8 bits)