BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (388 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli Re... 776 0.0 UniRef50_D1PA58 Putative chain length determinant protein n=1 Ta... 54 1e-05 UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumon... 43 0.020 >UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli RepID=RFC_ECOLI Length = 388 Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust. Identities = 388/388 (100%), Positives = 388/388 (100%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL Sbjct: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ Sbjct: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 Query: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI Sbjct: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 Query: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA Sbjct: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 Query: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT Sbjct: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 Query: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES Sbjct: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 Query: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 FMTNISSWIQITLCIIVFSQFLKAQKIK Sbjct: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 >UniRef50_D1PA58 Putative chain length determinant protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA58_9BACT Length = 683 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 39/134 (29%), Positives = 65/134 (48%), Gaps = 6/134 (4%) Query: 242 YLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHK---EFVWVGL 298 +L MYL+SP +AF + +S+S S W L G S+ H+ EFV+V + Sbjct: 522 FLGMYLMSPPVAFGHLR-RTISDSFCSESLWTIYAYTNRLMGSGSIIQHEDFGEFVYVPM 580 Query: 299 PTNVYTAFSDYVYISAELSYLMMV-IHGCISGVLWRLSRNYISVKI-FYSYFIYTFSFIF 356 PTNVYT + + I+G ++G ++R +RN + I Y+Y ++ + F Sbjct: 581 PTNVYTIMKPFYQDLGTIGVAFYAFIYGLVTGFIYRKARNGNAFGICMYTYLVFVLTMQF 640 Query: 357 YHESFMTNISSWIQ 370 + E I ++Q Sbjct: 641 FDELIFVAIPQFLQ 654 >UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LCF1_TOLAT Length = 407 Score = 42.7 bits (99), Expect = 0.020, Method: Compositional matrix adjust. Identities = 51/191 (26%), Positives = 90/191 (47%), Gaps = 27/191 (14%) Query: 204 FIVGVNRVKHYVYLITAVGVLFSLYMLFL--RGLPGGMAYY---------LSMYLVSPII 252 F++GV R +H V LI+ V VL + + +L +G+ ++ L Y ++P++ Sbjct: 205 FVLGVVRRQHIV-LISIVIVLLFILVAYLTGKGISEEQSFIENINSFLENLRSYTIAPVV 263 Query: 253 AFQEFYFQQVSNSA---SSHVFWF---FERLMGLLTGGVSMSLHKEFVWVGLPTNVYTAF 306 A +Q+SNS+ + F F F +G+ G V M L K+FV PTNVYT + Sbjct: 264 AMNML-IEQISNSSIFYGEYSFRFIFVFLSKLGVYDGSV-MPLIKDFVDTPYPTNVYTVY 321 Query: 307 S----DYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHESFM 362 D+ Y + +M H + Y+ YS +Y F+ + ++ Sbjct: 322 DIYVRDFGYFGFASVFFIMFFHFVLYEKCMLKGGFYV---FLYSASLYPLIMQFFQDQYL 378 Query: 363 TNISSWIQITL 373 + +S+WIQ+++ Sbjct: 379 SLLSTWIQVSV 389 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli Re... 545 e-153 UniRef50_D1PA58 Putative chain length determinant protein n=1 Ta... 173 1e-41 Sequences not found previously or not previously below threshold: UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriav... 62 3e-08 UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepI... 62 4e-08 UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification prote... 60 2e-07 UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevote... 56 2e-06 UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralston... 52 4e-05 UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechlor... 51 1e-04 UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumon... 45 0.006 >UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli RepID=RFC_ECOLI Length = 388 Score = 545 bits (1404), Expect = e-153, Method: Composition-based stats. Identities = 388/388 (100%), Positives = 388/388 (100%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL Sbjct: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ Sbjct: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 Query: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI Sbjct: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 Query: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA Sbjct: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 Query: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT Sbjct: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 Query: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES Sbjct: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 Query: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 FMTNISSWIQITLCIIVFSQFLKAQKIK Sbjct: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 >UniRef50_D1PA58 Putative chain length determinant protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA58_9BACT Length = 683 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 53/208 (25%), Positives = 91/208 (43%), Gaps = 15/208 (7%) Query: 187 ILNTGKQIVFMVIISYAFIVGVNRV-KHYVYLITAVGVLFSLYMLFLRGLPGGMAY---- 241 I N K F+V I+ F++ R+ K +I + ++ Y+ L Y Sbjct: 458 IANMEKLTFFLVFITIFFVLFERRIIKLRTIVICGILLIIGFYIFNLSRSDSDSDYQKNT 517 Query: 242 ----YLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHK---EFV 294 +L MYL+SP +AF + +S+S S W L G S+ H+ EFV Sbjct: 518 SILDFLGMYLMSPPVAFGHLR-RTISDSFCSESLWTIYAYTNRLMGSGSIIQHEDFGEFV 576 Query: 295 WVGLPTNVYTAFSDYVYISAELSYLMMV-IHGCISGVLWRLSRNYISVKI-FYSYFIYTF 352 +V +PTNVYT + + I+G ++G ++R +RN + I Y+Y ++ Sbjct: 577 YVPMPTNVYTIMKPFYQDLGTIGVAFYAFIYGLVTGFIYRKARNGNAFGICMYTYLVFVL 636 Query: 353 SFIFYHESFMTNISSWIQITLCIIVFSQ 380 + F+ E I ++Q + + Q Sbjct: 637 TMQFFDELIFVAIPQFLQRMFLVYIICQ 664 >UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LL33_RALME Length = 456 Score = 62.3 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 97/222 (43%), Gaps = 18/222 (8%) Query: 172 KTFTLLVFIVFIFAII----LNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSL 227 +TF +++F +F +I + G ++ V+++ +F++ + L+T GV + Sbjct: 237 RTFFMMLFCFLLFPLIFRGKIKLGGVVIAGVVLAASFVM--------IALLTQRGVSATA 288 Query: 228 YMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSM 287 + G+ L +Y +SPI A + S + + F FF L ++ + + Sbjct: 289 S---VDDNVDGIIKTLRVYFLSPIFAMGSVFDGNGSATYGDYTFRFFYMLANVIGLNIEI 345 Query: 288 S-LHKEFVWVGLPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRN-YISVKIF 344 L +++V + TNV+T Y L + G L+ +++ Sbjct: 346 PPLIRDYVLIPDMTNVFTVMDPYYRDFGVGGVLIFAALSGLAHDALYEKAKSEGGPYIFI 405 Query: 345 YSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQK 386 ++ F++ F+ + +M+ +S+WIQ+ ++F + +K Sbjct: 406 HAAFMFPLVMQFFQDMYMSLLSTWIQVVFWYMLFVRVNSGKK 447 >UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepID=Q9AYY5_BPHK6 Length = 408 Score = 61.9 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 53/263 (20%), Positives = 107/263 (40%), Gaps = 18/263 (6%) Query: 132 IRDADVEDTSRN----FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAII 187 +R A++ED + F P+I+ FA K K+S +L+F + Sbjct: 129 LRLANIEDDYQYPTFIFMPSFYPVIMAMFASICIFKSRWIDKISVCTWVLLFAIGTMGKF 188 Query: 188 LNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYL 247 ++F+ I + + ++ Y ++ A ++ Y + +AY Y+ Sbjct: 189 AVITPIMMFVTIYELKNGISMKKIFIYAPVVLACIIIMHFYRMSDDD-SATIAYIFGTYI 247 Query: 248 VSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHK------EFVWVGLPTN 301 SP+IAF + +S + + R + + + +S + ++V+V PTN Sbjct: 248 YSPLIAFG-----TLIDSGINWSGDYTLRFINAINYKIGISSVEPVKTILDYVYVPSPTN 302 Query: 302 VYTAFSDYVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTFSFIFYHE 359 VY+ + VI+G + ++ ++ V + YS + F E Sbjct: 303 VYSVMQPFYSDMGIYGVAFGAVIYGVLLSAIYSSAKAGNLVMLGLYSVLSVSLITQFMSE 362 Query: 360 SFMTNISSWIQITLCIIVFSQFL 382 + +TN+S I++ LC+ V +F Sbjct: 363 TIITNLSGNIKLLLCMFVVFRFF 385 >UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification protein n=2 Tax=Enterobacteriaceae RepID=Q1RA39_ECOUT Length = 396 Score = 59.6 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 67/144 (46%), Gaps = 6/144 (4%) Query: 243 LSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGL--LTGGVSMSLHKEFVWVGLPT 300 L +Y+ SPIIA + + S+ + F F + L + ++ ++ +V +PT Sbjct: 246 LGLYIYSPIIALGQL-NEVNSSHFGEYTFRFIYAITNKIGLIKELPVNTILDYSYVPVPT 304 Query: 301 NVYTAFSDYVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKIFYSYFIYTFS--FIFY 357 NVYTA + + V++G I L+ + + Y +++ S F+ Sbjct: 305 NVYTALQPFYQDFGYTGIIFGAVLYGLIYVSLYTAGVRGNNTQALLIYALFSVSSATAFF 364 Query: 358 HESFMTNISSWIQITLCIIVFSQF 381 E+ +TN++ +++ LC I+ +F Sbjct: 365 AETLVTNLAGNVKLVLCTILLWRF 388 >UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LJ75_9BACT Length = 401 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 48/230 (20%), Positives = 99/230 (43%), Gaps = 30/230 (13%) Query: 177 LVFIVFIFAIILNTGK--QIVFMVIISYAFIVGVNRVKHYVYLITAVGVLF--------- 225 + IVFIF + ++ K +I+ +VI A ++ +++ +V+L+ V VL+ Sbjct: 158 VALIVFIFIFVYSSKKWLKILAVVINVLAALISMSKTGFFVFLVPMVYVLYLRGKIKLRT 217 Query: 226 -----------SLYMLFLRGLPGGMAYY-----LSMYLVSPIIAFQEFYFQQVSNSASSH 269 S++ + R + + L++Y++SP +AF + + + Sbjct: 218 IGIILLIFIGFSIWFQYARSMASQQDSFSATSMLTIYIMSPCVAFDYYVEPASATHFGEY 277 Query: 270 VFWFFERLMGLLTGGVS-MSLHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCI 327 VF F+ +M L + +S +FV V TN YT + + L ++G Sbjct: 278 VFRFYYAIMHSLGSNIEPVSNVLKFVGVPEETNTYTILYPFYHDFGLPGVGLFGGLYGAF 337 Query: 328 SGVLWRLSRNY-ISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCII 376 L++ +++ I Y+ F+ F E+ ++N S +Q + I+ Sbjct: 338 YAFLYKRAQSGQNVYFILYACFLNYLILQFVQENILSNFSLNLQYVILIL 387 >UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Q91_RALEJ Length = 410 Score = 51.9 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 58/129 (44%), Gaps = 3/129 (2%) Query: 246 YLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGG--VSMSLHKEFVWVGLPTNVY 303 Y ++P++AF S + F + L ++L K + +V PTNVY Sbjct: 259 YTIAPLLAFSRLVEWNPDLSWGENTFRLLISIQYALGISTLAPVALMKGYAFVPDPTNVY 318 Query: 304 TAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIF-YSYFIYTFSFIFYHESFM 362 T + Y + L L+ + + L+R +R V IF YS +Y F+ + + Sbjct: 319 TVYEVYFRDFSYLGVLIPPVFLIVHYWLYRRARARGGVWIFYYSASVYPLVMQFFQDQYF 378 Query: 363 TNISSWIQI 371 + +S+WIQ+ Sbjct: 379 SLLSTWIQV 387 >UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47GK7_DECAR Length = 400 Score = 50.7 bits (120), Expect = 1e-04, Method: Composition-based stats. Identities = 73/390 (18%), Positives = 152/390 (38%), Gaps = 31/390 (7%) Query: 2 IYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLL 61 + +V+S + + Y K + PA +I+ V L Y+I D DA + +L Sbjct: 1 MMIVVSFLCVAIVLLQYRAKKLMAPASINAVIWLAVSLTYQIQGDSLDVLRWDAIAV-VL 59 Query: 62 CNVLTFTLSCLLTESV---LDLNIRKVNNAIYSIPSKKVHNVGLLVISFS-MIYICMRLS 117 + TF + +E + L R + +P+ +G+L I + + + Sbjct: 60 VGIATFGFGAVCSERIHFALPQPSRSGGAPSFFLPAL----LGVLTIGLAGNLGRSLEYV 115 Query: 118 NYQFGTSLL----SYMNLIRDADVEDTSRNFS--AYMQPIILTTFALFIWSKKFTNTKVS 171 ++ G SL S+ +R+ + D +F +Y P+ A + K+ K + Sbjct: 116 HFVDGMSLFGGQNSWYGSLRNTLIADHHGSFGIWSYFLPLSYAAVAYLLCDKE----KSA 171 Query: 172 KTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFIVG----VNRVKHYVYLITAVGVLFSL 227 + + ++ +V + L TG+ + + + +V +N+ + L+ ++F + Sbjct: 172 RHYAIITSLVTFGYVFLATGRTFILLFVTILFVVVAYRGWLNKTRSIALLVPLFLIVFWV 231 Query: 228 YMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSH---VFWFFERLMGLLTGG 284 + + GG Y MY V+P+ F + + +H F ++ L Sbjct: 232 SPIIAGRVSGGGFNYFMMYFVAPLANFD--WGMHGVFACCTHGETTFRTIFAVLAKLGFN 289 Query: 285 VSMS-LHKEFVWVGLPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRNYISVK 342 V + L + + L NVYT F Y + G + + R + + Sbjct: 290 VPVVELIQPWAGTKLSGNVYTVFMPYYRDFGLAGVALFLFFFGALHTWISRQASLDNPLA 349 Query: 343 IFY-SYFIYTFSFIFYHESFMTNISSWIQI 371 +F + F Y F+ + + + +S W+Q+ Sbjct: 350 VFLNAIFFYALVMQFFQDQYFSLLSQWVQM 379 >UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LCF1_TOLAT Length = 407 Score = 44.6 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 46/192 (23%), Positives = 83/192 (43%), Gaps = 17/192 (8%) Query: 204 FIVGVNRVKHYVYLITAVGVLFSLYMLFL-RGLPGGMAYY---------LSMYLVSPIIA 253 F++GV R +H V + + +LF L +G+ ++ L Y ++P++A Sbjct: 205 FVLGVVRRQHIVLISIVIVLLFILVAYLTGKGISEEQSFIENINSFLENLRSYTIAPVVA 264 Query: 254 FQEFYFQQVSNS---ASSHVFWFFERLMGLL--TGGVSMSLHKEFVWVGLPTNVYTAFSD 308 +Q+SNS + F F + L G M L K+FV PTNVYT + Sbjct: 265 MNML-IEQISNSSIFYGEYSFRFIFVFLSKLGVYDGSVMPLIKDFVDTPYPTNVYTVYDI 323 Query: 309 YVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIF-YSYFIYTFSFIFYHESFMTNISS 367 YV + + VL+ +F YS +Y F+ + +++ +S+ Sbjct: 324 YVRDFGYFGFASVFFIMFFHFVLYEKCMLKGGFYVFLYSASLYPLIMQFFQDQYLSLLST 383 Query: 368 WIQITLCIIVFS 379 WIQ+++ + + Sbjct: 384 WIQVSVIYYILT 395 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli Re... 345 2e-93 UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechlor... 299 1e-79 UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepI... 187 5e-46 UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriav... 165 3e-39 UniRef50_D1PA58 Putative chain length determinant protein n=1 Ta... 162 3e-38 UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification prote... 141 4e-32 UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralston... 129 2e-28 UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevote... 127 6e-28 Sequences not found previously or not previously below threshold: UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumon... 84 9e-15 UniRef50_C1CYL3 Putative uncharacterized protein n=1 Tax=Deinoco... 80 2e-13 UniRef50_Q8XQM0 Putative o-antigen polymerase transmembrane prot... 63 1e-08 UniRef50_Q03584 O-antigen polymerase n=5 Tax=Enterobacteriaceae ... 53 2e-05 UniRef50_A3CQX5 Oligosaccharide repeat unit polymerase Wzy, puta... 45 0.004 UniRef50_B9Y8A4 Putative uncharacterized protein n=1 Tax=Holdema... 45 0.005 UniRef50_B8C4D4 Predicted protein n=1 Tax=Thalassiosira pseudona... 42 0.055 UniRef50_B7V2X0 Putative uncharacterized protein n=1 Tax=Pseudom... 42 0.056 UniRef50_C0D232 Putative uncharacterized protein n=1 Tax=Clostri... 41 0.071 >UniRef50_P37748 O-antigen polymerase n=5 Tax=Escherichia coli RepID=RFC_ECOLI Length = 388 Score = 345 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 388/388 (100%), Positives = 388/388 (100%) Query: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL Sbjct: 1 MIYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ Sbjct: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQ 120 Query: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI Sbjct: 121 FGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFI 180 Query: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA Sbjct: 181 VFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA 240 Query: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT Sbjct: 241 YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPT 300 Query: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES Sbjct: 301 NVYTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHES 360 Query: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 FMTNISSWIQITLCIIVFSQFLKAQKIK Sbjct: 361 FMTNISSWIQITLCIIVFSQFLKAQKIK 388 >UniRef50_Q47GK7 Putative uncharacterized protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47GK7_DECAR Length = 400 Score = 299 bits (766), Expect = 1e-79, Method: Composition-based stats. Identities = 74/397 (18%), Positives = 155/397 (39%), Gaps = 31/397 (7%) Query: 2 IYLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLL 61 + +V+S + + Y K + PA +I+ V L Y+I D DA + +L Sbjct: 1 MMIVVSFLCVAIVLLQYRAKKLMAPASINAVIWLAVSLTYQIQGDSLDVLRWDAIAV-VL 59 Query: 62 CNVLTFTLSCLLTESV---LDLNIRKVNNAIYSIPSKKVHNVGLLVISFS-MIYICMRLS 117 + TF + +E + L R + +P+ +G+L I + + + Sbjct: 60 VGIATFGFGAVCSERIHFALPQPSRSGGAPSFFLPAL----LGVLTIGLAGNLGRSLEYV 115 Query: 118 NYQFGTSLL----SYMNLIRDADVEDTSRNFS--AYMQPIILTTFALFIWSKKFTNTKVS 171 ++ G SL S+ +R+ + D +F +Y P+ A + K+ K + Sbjct: 116 HFVDGMSLFGGQNSWYGSLRNTLIADHHGSFGIWSYFLPLSYAAVAYLLCDKE----KSA 171 Query: 172 KTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFIVG----VNRVKHYVYLITAVGVLFSL 227 + + ++ +V + L TG+ + + + +V +N+ + L+ ++F + Sbjct: 172 RHYAIITSLVTFGYVFLATGRTFILLFVTILFVVVAYRGWLNKTRSIALLVPLFLIVFWV 231 Query: 228 YMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSH---VFWFFERLMGLLTGG 284 + + GG Y MY V+P+ F + + +H F ++ L Sbjct: 232 SPIIAGRVSGGGFNYFMMYFVAPLANFD--WGMHGVFACCTHGETTFRTIFAVLAKLGFN 289 Query: 285 VSMS-LHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNYISVK 342 V + L + + L NVYT F Y L + G + + R + + Sbjct: 290 VPVVELIQPWAGTKLSGNVYTVFMPYYRDFGLAGVALFLFFFGALHTWISRQASLDNPLA 349 Query: 343 IFY-SYFIYTFSFIFYHESFMTNISSWIQITLCIIVF 378 +F + F Y F+ + + + +S W+Q+ + + Sbjct: 350 VFLNAIFFYALVMQFFQDQYFSLLSQWVQMIFWMSLL 386 >UniRef50_Q9AYY5 O-antigen modification protein n=2 Tax=root RepID=Q9AYY5_BPHK6 Length = 408 Score = 187 bits (475), Expect = 5e-46, Method: Composition-based stats. Identities = 52/262 (19%), Positives = 103/262 (39%), Gaps = 10/262 (3%) Query: 132 IRDADVEDTSRN----FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAII 187 +R A++ED + F P+I+ FA K K+S +L+F + Sbjct: 129 LRLANIEDDYQYPTFIFMPSFYPVIMAMFASICIFKSRWIDKISVCTWVLLFAIGTMGKF 188 Query: 188 LNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYL 247 ++F+ I + + ++ Y ++ A ++ Y + +AY Y+ Sbjct: 189 AVITPIMMFVTIYELKNGISMKKIFIYAPVVLACIIIMHFYRMSDDD-SATIAYIFGTYI 247 Query: 248 VSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV--SMSLHKEFVWVGLPTNVYTA 305 SP+IAF N + + F + + + ++V+V PTNVY+ Sbjct: 248 YSPLIAFGTLID-SGINWSGDYTLRFINAINYKIGISSVEPVKTILDYVYVPSPTNVYSV 306 Query: 306 FSDYVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTFSFIFYHESFMT 363 + VI+G + ++ ++ V + YS + F E+ +T Sbjct: 307 MQPFYSDMGIYGVAFGAVIYGVLLSAIYSSAKAGNLVMLGLYSVLSVSLITQFMSETIIT 366 Query: 364 NISSWIQITLCIIVFSQFLKAQ 385 N+S I++ LC+ V +F + Sbjct: 367 NLSGNIKLLLCMFVVFRFFTKK 388 >UniRef50_Q1LL33 Putative uncharacterized protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LL33_RALME Length = 456 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 97/224 (43%), Gaps = 18/224 (8%) Query: 170 VSKTFTLLVFIVFIFAII----LNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLF 225 +TF +++F +F +I + G ++ V+++ +F++ + L+T GV Sbjct: 235 TGRTFFMMLFCFLLFPLIFRGKIKLGGVVIAGVVLAASFVM--------IALLTQRGVSA 286 Query: 226 SLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGV 285 + + G+ L +Y +SPI A + S + + F FF L ++ + Sbjct: 287 TAS---VDDNVDGIIKTLRVYFLSPIFAMGSVFDGNGSATYGDYTFRFFYMLANVIGLNI 343 Query: 286 SMS-LHKEFVWVGLPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRN-YISVK 342 + L +++V + TNV+T Y L + G L+ +++ Sbjct: 344 EIPPLIRDYVLIPDMTNVFTVMDPYYRDFGVGGVLIFAALSGLAHDALYEKAKSEGGPYI 403 Query: 343 IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQK 386 ++ F++ F+ + +M+ +S+WIQ+ ++F + +K Sbjct: 404 FIHAAFMFPLVMQFFQDMYMSLLSTWIQVVFWYMLFVRVNSGKK 447 >UniRef50_D1PA58 Putative chain length determinant protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA58_9BACT Length = 683 Score = 162 bits (409), Expect = 3e-38, Method: Composition-based stats. Identities = 79/374 (21%), Positives = 146/374 (39%), Gaps = 22/374 (5%) Query: 22 DIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLN 81 DIF P +I++++ + +SDI + D + + FT + + T ++ N Sbjct: 298 DIFAPWTLSLLIWSILGIIIAFSSDIID-PIQDVFYTNISVWLAIFTATSISTYLLMPAN 356 Query: 82 IR-KVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRDADVEDT 140 K I + + L + + +Y+ ++ MN +R+ V Sbjct: 357 QDIKTGVKGIKINITIFNILFFLSMIMTPLYM-YQIYKIVTMFDSKDLMNNMRELAVNGN 415 Query: 141 SRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVII 200 F Y I +W + K + V + I N K F+V I Sbjct: 416 GHGFLNYTMVINQALLLTGLW----RFPYLPKWKIICVVGCCLAYAIANMEKLTFFLVFI 471 Query: 201 SYAFIVGVNRV-KHYVYLITAVGVLFSLYMLFLRGLPGGMAY--------YLSMYLVSPI 251 + F++ R+ K +I + ++ Y+ L Y +L MYL+SP Sbjct: 472 TIFFVLFERRIIKLRTIVICGILLIIGFYIFNLSRSDSDSDYQKNTSILDFLGMYLMSPP 531 Query: 252 IAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEF---VWVGLPTNVYTAFSD 308 +AF + +S+S S W L G S+ H++F V+V +PTNVYT Sbjct: 532 VAFGHLR-RTISDSFCSESLWTIYAYTNRLMGSGSIIQHEDFGEFVYVPMPTNVYTIMKP 590 Query: 309 YVYISAELSYLM-MVIHGCISGVLWRLSRNYISVKI-FYSYFIYTFSFIFYHESFMTNIS 366 + + I+G ++G ++R +RN + I Y+Y ++ + F+ E I Sbjct: 591 FYQDLGTIGVAFYAFIYGLVTGFIYRKARNGNAFGICMYTYLVFVLTMQFFDELIFVAIP 650 Query: 367 SWIQITLCIIVFSQ 380 ++Q + + Q Sbjct: 651 QFLQRMFLVYIICQ 664 >UniRef50_Q1RA39 Bacteriophage HK620 O-antigen modification protein n=2 Tax=Enterobacteriaceae RepID=Q1RA39_ECOUT Length = 396 Score = 141 bits (356), Expect = 4e-32, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 99/243 (40%), Gaps = 7/243 (2%) Query: 144 FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVIISYA 203 + P+++ FA+ +K K S F + ++ + + +++I + Sbjct: 148 LMPAVYPLMMAMFAIVCLTKTSKLNKYSIYFWMFLYCIGTMGKFSILTPILTYLIIYDFK 207 Query: 204 FIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVS 263 + V + + LI + + + L +Y+ SPIIA + + S Sbjct: 208 HRLKVKKTIKFTLLIIILALTLHFTRMAENDHS-TFLSILGLYIYSPIIALGQL-NEVNS 265 Query: 264 NSASSHVFWFFERLMGLLT--GGVSMSLHKEFVWVGLPTNVYTAFSDYVYISAELSYLM- 320 + + F F + + + ++ ++ +V +PTNVYTA + + Sbjct: 266 SHFGEYTFRFIYAITNKIGLIKELPVNTILDYSYVPVPTNVYTALQPFYQDFGYTGIIFG 325 Query: 321 MVIHGCISGVLWRLSRNYISVKIFYSYFIYTFS--FIFYHESFMTNISSWIQITLCIIVF 378 V++G I L+ + + Y +++ S F+ E+ +TN++ +++ LC I+ Sbjct: 326 AVLYGLIYVSLYTAGVRGNNTQALLIYALFSVSSATAFFAETLVTNLAGNVKLVLCTILL 385 Query: 379 SQF 381 +F Sbjct: 386 WRF 388 >UniRef50_Q46Q91 Putative uncharacterized protein n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Q91_RALEJ Length = 410 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 70/372 (18%), Positives = 149/372 (40%), Gaps = 22/372 (5%) Query: 26 PAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDLNIRKV 85 P ++ +LLGY ++ D + + L+ L +L + L + Sbjct: 26 PFQIYFFVWFSLLLGYYLSRDSFISMSVEFVLLILTAKLLALLIMILCCRGLQGRGPEIR 85 Query: 86 NNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMN--LIRDADVEDTSRN 143 + + + ++ + ++ LV ++ + R + G S+ + + +R A ED Sbjct: 86 RHGVIARKTRFI-DLAQLVAIIALPLVYARATEIAGGESVFTVLGYIQLRAAMTEDGEGY 144 Query: 144 -FSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQ-IVFMVIIS 201 AY+ + T ++ I+ + +N + + L V + L+TG+ I+F + ++ Sbjct: 145 GILAYLWVLSFVTTSVSIFLYRQSNLGFGRLW--LSVAVSLCYCYLSTGRTYILFFLCLA 202 Query: 202 YAFIVGVNRVKHYVYLITAVGVLFSLYMLF---------LRGLPGGMAYYLSM---YLVS 249 ++ V ++ LIT + + + G+ + +L Y ++ Sbjct: 203 LVPLMNVGAIRMRGLLITLIIFIALSVFVAGMTAKGISADDGIVENVESFLESMKGYTIA 262 Query: 250 PIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGG--VSMSLHKEFVWVGLPTNVYTAFS 307 P++AF S + F + L ++L K + +V PTNVYT + Sbjct: 263 PLLAFSRLVEWNPDLSWGENTFRLLISIQYALGISTLAPVALMKGYAFVPDPTNVYTVYE 322 Query: 308 DYVYISAELSYLMMVIHGCISGVLWRLSRNYISVKIF-YSYFIYTFSFIFYHESFMTNIS 366 Y + L L+ + + L+R +R V IF YS +Y F+ + + + +S Sbjct: 323 VYFRDFSYLGVLIPPVFLIVHYWLYRRARARGGVWIFYYSASVYPLVMQFFQDQYFSLLS 382 Query: 367 SWIQITLCIIVF 378 +WIQ+ + Sbjct: 383 TWIQVWFWYWLL 394 >UniRef50_C9LJ75 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LJ75_9BACT Length = 401 Score = 127 bits (320), Expect = 6e-28, Method: Composition-based stats. Identities = 75/409 (18%), Positives = 159/409 (38%), Gaps = 57/409 (13%) Query: 3 YLVISVFLITAFICL--YLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFL 60 L I+ ++ AF + Y+ +D+F P V ++ +LL + ++ ++D + + Sbjct: 1 MLFIAFLIVAAFTFVGWYITRDVFSPFVLQPGVWFGILLLFYLSDPELYPIIHDFPISLI 60 Query: 61 LCNVLTFTLSCLLTESVLDLNIRKVNNA---IYSIPSKKVHNVGLLVISFSMIYICMRLS 117 + + ++ + D ++ PS+ V + L + S+ I L Sbjct: 61 VWTISFLGVAYPTYYYLPDHSLISRRPPLLASVLTPSRLVLKLYLFIAIISVPLILYTLM 120 Query: 118 NYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLL 177 Y + M +R A +DT + V+ + Sbjct: 121 RYGMERGESNLMTYLRIASYDDTL----------------------DKPDLGVAYYTIGV 158 Query: 178 VFIVFIFAIILNTGK--QIVFMVIISYAFIVGVNRVKHYVYLITAVGVLF---------- 225 IVFIF + ++ K +I+ +VI A ++ +++ +V+L+ V VL+ Sbjct: 159 ALIVFIFIFVYSSKKWLKILAVVINVLAALISMSKTGFFVFLVPMVYVLYLRGKIKLRTI 218 Query: 226 ----------SLYMLFLRGLPGGMAYY-----LSMYLVSPIIAFQEFYFQQVSNSASSHV 270 S++ + R + + L++Y++SP +AF + + +V Sbjct: 219 GIILLIFIGFSIWFQYARSMASQQDSFSATSMLTIYIMSPCVAFDYYVEPASATHFGEYV 278 Query: 271 FWFFERLMGLLTGGV-SMSLHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCIS 328 F F+ +M L + +S +FV V TN YT + + L ++G Sbjct: 279 FRFYYAIMHSLGSNIEPVSNVLKFVGVPEETNTYTILYPFYHDFGLPGVGLFGGLYGAFY 338 Query: 329 GVLWRLSRNY-ISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCII 376 L++ +++ I Y+ F+ F E+ ++N S +Q + I+ Sbjct: 339 AFLYKRAQSGQNVYFILYACFLNYLILQFVQENILSNFSLNLQYVILIL 387 >UniRef50_C4LCF1 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LCF1_TOLAT Length = 407 Score = 83.9 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 68/377 (18%), Positives = 137/377 (36%), Gaps = 19/377 (5%) Query: 21 KDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESVLDL 80 K I P +++ V L +T D + N+ ++ + ++ LS +L L Sbjct: 18 KFIANPFKIFFMLWFFVFLTLYLTIDDWVEISNEFIVVNITTSLSVLLLSYILKGKTECL 77 Query: 81 NIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMNLIRD-ADVED 139 + + NA+ + + +I F +++ + ++ S + + + D Sbjct: 78 HSWRDENALNELFINRYLCFVFQIICFIGLFLAYYRVSLLIPDNIFSPVGYTKLRMSIGD 137 Query: 140 TSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVI 199 + ++ L+ + +V +L I + L+TG+ M Sbjct: 138 DTVDYGILAYFFTLSFIVTSLTIILRVRGEVGSIRLILSIISSLSYCYLSTGRTFFLMFF 197 Query: 200 ISYA---FIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAYY----------LSMY 246 F++GV R +H V + + +LF L + L Y Sbjct: 198 CFSFAPLFVLGVVRRQHIVLISIVIVLLFILVAYLTGKGISEEQSFIENINSFLENLRSY 257 Query: 247 LVSPIIAFQEFYFQQVSNS--ASSHVFWFFERLMGLLTG--GVSMSLHKEFVWVGLPTNV 302 ++P++A Q ++S + F F + L G M L K+FV PTNV Sbjct: 258 TIAPVVAMNMLIEQISNSSIFYGEYSFRFIFVFLSKLGVYDGSVMPLIKDFVDTPYPTNV 317 Query: 303 YTAFSDYVYISAELSYLMMVIHGCISGVLWRLSRN-YISVKIFYSYFIYTFSFIFYHESF 361 YT + YV + + VL+ YS +Y F+ + + Sbjct: 318 YTVYDIYVRDFGYFGFASVFFIMFFHFVLYEKCMLKGGFYVFLYSASLYPLIMQFFQDQY 377 Query: 362 MTNISSWIQITLCIIVF 378 ++ +S+WIQ+++ + Sbjct: 378 LSLLSTWIQVSVIYYIL 394 >UniRef50_C1CYL3 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1CYL3_DEIDV Length = 427 Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 71/390 (18%), Positives = 141/390 (36%), Gaps = 29/390 (7%) Query: 18 YLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLCNVLTFTLSCLLTESV 77 + +D+ YPAV I++ + L+ Y + Y L+++ ++ + VL F+ LT S Sbjct: 20 FFSRDVRYPAVLQIIVWLVTLVVYVVERHRY-VSLSESVMLIIFLGVLGFSAGSFLTLSS 78 Query: 78 LDLNIRKVNNAIYS-------IPSKKVHNVGLLVISFSMIYICMRLSNYQFGTSLLSYMN 130 L + + + LL I+ + +YI Q G + +N Sbjct: 79 LGGRSLGSRKTVLLRMRHLSDVRLVLAFFLTLLAIAGAAVYIQTATQFAQTGPTQDLALN 138 Query: 131 LIRDADVEDTSRNFS---AYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAII 187 L V +Y PI+ T + + K L + + Sbjct: 139 LRYLTSVRGEVPPLMRLTSYALPILNTLSGFCLIYYRANRDKRVLPILFLAVASALVMSV 198 Query: 188 LNTGKQIVFMVIISYAFIVGV-NRVKHYVYLITAVGVLFSLYMLFLRGLPGGMA------ 240 +TG+ I+ II I + ++ ++ + + S++ + L G+ Sbjct: 199 FSTGRGIILFFIIEMGIIYAMTSKRLRLRLVLLGLLMFLSIFYIGASVLGKGVDQNASLL 258 Query: 241 -------YYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSM-SLHKE 292 +S+YL+S I+A V++ + F L L ++ L + Sbjct: 259 ESFPDLFSSISLYLLSGILALSVQLPTLVTDEGGVNTFRTIHALGRALGFDATVVPLVQA 318 Query: 293 FVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNYISVKIFY--SYFI 349 F + PTNVYT + Y+ L + + G + L+ R + + Sbjct: 319 FTNIPQPTNVYTIYLTYLKDFGWLGIFIFQFLFGILHATLFIAFRRTGGAVALFWLAILS 378 Query: 350 YTFSFIFYHESFMTNISSWIQITLCIIVFS 379 + + + + + +S+WIQ +FS Sbjct: 379 FPLLTQPFTDGYFSLMSTWIQYAFFSSLFS 408 >UniRef50_Q8XQM0 Putative o-antigen polymerase transmembrane protein n=1 Tax=Ralstonia solanacearum RepID=Q8XQM0_RALSO Length = 390 Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 89/242 (36%), Gaps = 14/242 (5%) Query: 146 AYMQPIILTTFALFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFI 205 +Y +L FA+++ S+ + S F ++ F+ A I ++G+ ++ ++ S Sbjct: 133 SYYFSALLVVFAVYLVSQA---SHYSPGFLVIGFVAATVAAISSSGRTLLLLLFTSTPVS 189 Query: 206 VGV-----NRVKHYVYLITAVGVLFSLYM----LFLRGLPGGMAYYLSMYLVSPIIAFQE 256 + + + L+ L + F+ L + + L +Y+++ + +F Sbjct: 190 LYLQNKIRKKTFFASLLVFLCFFLALAVLNGKGAFINDLYSQITWNLEVYVLNGLASFNH 249 Query: 257 FYFQQVSNSASSHVFWFFERLMGLLTGGVSMSLHKEFVWVGLPTNVYTAFSDYVYISAEL 316 F + + + R + L G + L FV P NVYTA + + L Sbjct: 250 FVTNNHPSFDGNILVPNLLRRIFELEGDA-IPLVLPFVETPFPGNVYTALYPWYHDGGAL 308 Query: 317 SYLMM-VIHGCISGVLWRLSRNYISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCI 375 ++ + G S + +YS Y + + ++ W+ L Sbjct: 309 GLMVGFFLIGAFSQYFYHARHKSFKHTFYYSISAYALIMTIFQDQYIQAYPLWMMAILSP 368 Query: 376 IV 377 + Sbjct: 369 FL 370 >UniRef50_Q03584 O-antigen polymerase n=5 Tax=Enterobacteriaceae RepID=RFC_SHIDY Length = 380 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 85/229 (37%), Gaps = 10/229 (4%) Query: 158 LFIWSKKFTNTKVSKTFTLLVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYL 217 F + F + F+ A++ + ++ ++ V++ K +Y Sbjct: 145 SFCCMYLARHENKKNYFYCFTLLSFLLAVLSTSKIFLILFLVYIVGINSYVSKKKLLIYG 204 Query: 218 ITAVG-------VLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHV 270 + G +L + + L +YL S + AF + + + ++ Sbjct: 205 VFVFGLFALSSIILGKFSSDPEGKIISAIFDTLRVYLFSGLAAFNLYVEKN--ATLPENL 262 Query: 271 FWFFERLMGLLTGGVSMSLHKEFVWVGL-PTNVYTAFSDYVYISAELSYLMMVIHGCISG 329 + + + T + + ++ +G+ TNVYTAF+ + + +++ I Sbjct: 263 LLYPFKEVWGTTKDIPKTDILPWINIGVWDTNVYTAFAPWYQSLGLYAAIIIGILLGFYY 322 Query: 330 VLWRLSRNYISVKIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVF 378 +W R ++V + ++ + +F+ E ++ + LC I+ Sbjct: 323 GIWFSFRQNLAVGFYQTFLCFPLLMLFFQEHYLLSWKMHFIYFLCAILL 371 >UniRef50_A3CQX5 Oligosaccharide repeat unit polymerase Wzy, putative n=2 Tax=Streptococcus RepID=A3CQX5_STRSV Length = 441 Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 61/176 (34%), Gaps = 10/176 (5%) Query: 214 YVYLITAVGVLFSLYMLFLRGLPGGMAYYLSMYLVSPIIAFQEFYFQQVSNSASSHVFWF 273 +V+L+ + V + G+ L Y+ SPI A + S + Sbjct: 246 FVFLVLFMAVGTFVLNRVDSRAEFGILDNLIKYMGSPIQALDYYLKNPTLYSDNQVFGEN 305 Query: 274 FERLMGLLTGGVSMSLHKEFVWVG------LPTNVYTAFSDYVYISAELSYL-MMVIHGC 326 + + +S H+ ++ TNVYT + ++ S L + +G Sbjct: 306 TLIAIYGTLKSLGLSSHELTPFLPVVHFNDDKTNVYTIYYYFIKDFGYFSVLIFQLSYGF 365 Query: 327 ISGVLWRLSRNYISV---KIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFS 379 G + + I ++ F Y F+ E+ ++ +++ I + V Sbjct: 366 FYGSFYYSIKKRYFTPLKVIVFALFAYPLVISFFQETLLSLLTTHINRIIYAFVIY 421 >UniRef50_B9Y8A4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y8A4_9FIRM Length = 441 Score = 45.0 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 50/216 (23%), Positives = 79/216 (36%), Gaps = 21/216 (9%) Query: 193 QIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRGLPGGMAY-YLSMYLVSPI 251 I+F V+ S + ++ + + V +F L + YLS+YL +PI Sbjct: 225 IIIFWVMHSKKHKISFKQILLIILAASLVVGMFQTIGELLGRVSAADFGGYLSVYLSAPI 284 Query: 252 IAFQEFYFQQVSNSASSHVFWFFERLMGLLTGGVSMSL-HKEFVWVGL------PTNVYT 304 F SA F + L G + +S E V L NVYT Sbjct: 285 RNLDYFLN-NSFASADMFGKMTFYHAINYLGGKLEISSWIYELVLPPLRANGFVTGNVYT 343 Query: 305 AFSDYVYISAELSY-LMMVIHGCISGVLWRLSRNY---------ISVKIFYSYFIYTFSF 354 F Y+Y + + M + G IS + + ++N I YSY Y +F Sbjct: 344 TFYAYIYDFGYVGVPIFMFLMGVISQLFYIKTKNNTKYLQRDRINIWIIIYSYIFYMLAF 403 Query: 355 IFYHESFMTNI--SSWIQITLCIIVFSQFLKAQKIK 388 F+ F I + + + + FL+ KIK Sbjct: 404 SFFSNKFYEGIFSIQFFKYLIYWSLIKLFLENVKIK 439 >UniRef50_B8C4D4 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C4D4_THAPS Length = 948 Score = 41.5 bits (96), Expect = 0.055, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 51/133 (38%), Gaps = 12/133 (9%) Query: 104 VISFSMIYICMRLSNYQFGTSLLSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSK 163 VI+ + + + R + ++ T+ + LIR A V T + +I Sbjct: 654 VIAGAFVALSGRFAEFEDMTTQSRWEALIRLALVAGTMESI------LIHLMATAICCFV 707 Query: 164 KFTNTKVSKTFTLLVFIVFIF---AIILNTGKQIVFMV---IISYAFIVGVNRVKHYVYL 217 K TK + T++ F+ IF ++L ++ + + ++ +N ++ L Sbjct: 708 KGVKTKTTTLATVIAFLEGIFHSVPLVLIECLILLTLFGPSVFELPYLTTMNPALIWMTL 767 Query: 218 ITAVGVLFSLYML 230 + V + L Sbjct: 768 LPISIVYVWFFRL 780 >UniRef50_B7V2X0 Putative uncharacterized protein n=1 Tax=Pseudomonas aeruginosa LESB58 RepID=B7V2X0_PSEA8 Length = 417 Score = 41.5 bits (96), Expect = 0.056, Method: Composition-based stats. Identities = 63/410 (15%), Positives = 150/410 (36%), Gaps = 30/410 (7%) Query: 3 YLVISVFLITAFICLYLKKDIFYPAVCVNIIFALVLLGYEITSDIYAFQLNDATLIFLLC 62 L + LI A L + +P+V + I + L L+ + S I +++ L+ L Sbjct: 4 MLTGATLLIFAVAARLLARSAIHPSVAMPITWGLGLIAVSLASLIGFYRVESDALLIFLF 63 Query: 63 NVLTFTLSCLLTESVLDLNIRKVNNAIYSIPSKKVHNVGLLVISFSMIYICMRLSNYQFG 122 V++F+LS + + R ++ + LVI F + +I Y+ Sbjct: 64 GVMSFSLSAGCFSFLYNGYFRAPSSNFLFDSELRTRA---LVIFFCLAHIVFLTVIYRDL 120 Query: 123 TSL------LSYMNLIRDADVEDTSRNFSAYMQPIILTTFALFIWSKKFTNTKVSKTFTL 176 +S+ +YM + E + S + T L + + K L Sbjct: 121 SSIAPTLREAAYMARAQSVSGEPVLSSLSMNYLQLGQTVIPLVVLL--YLRGKCGVLGFL 178 Query: 177 LVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNR----------VKHYVYLITAVGVLFS 226 + + ++ I+L +G+ + +++ FI + + + ++L+ AVG + + Sbjct: 179 AISVPWMGVILLASGRASLMQMLVGLFFIYILVKGSPSLKSLLVIGLAMFLVIAVGAVAT 238 Query: 227 LYMLFLRGLPGGMAYY-----LSMYLVSPIIAFQEFYFQQVSNSASSHVFWFFERLMGLL 281 + F G + ++ Y + + F +Y + F ++ + Sbjct: 239 SKIQFHEGDGISTLFIELYRHVAGYALQGPVLFDRYYQGSIHLEPYWSPLNGFCSILATV 298 Query: 282 TGGVSMSLHKE-FVWVG-LPTNVYTAFSDYVYISAELSYL-MMVIHGCISGVLWRLSRNY 338 LH + + + NVY+ F L + +M ++G + + ++ Sbjct: 299 GLCQKPPLHLDFYEYAPGELGNVYSMFFSMYPHYGALGVIGVMALYGMLCSYAYCKAKKG 358 Query: 339 ISVK-IFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFSQFLKAQKI 387 + SY F + + T+ ++++T+ + + + ++ Sbjct: 359 SLYFTVLSSYLFSAIVFSLFSDQISTSWWFYVKMTIILGILCFVFRRDRM 408 >UniRef50_C0D232 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D232_9CLOT Length = 478 Score = 41.1 bits (95), Expect = 0.071, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 76/225 (33%), Gaps = 22/225 (9%) Query: 175 TLLVFIVFIFAIILNTGKQIVFMVIISYAFIVGVNRVKHYVYLITAVGVLFSLYMLFLRG 234 + I + ++ + Q++ V ++ + V R YL+ ++ Y+ Sbjct: 243 AAVTGISILIPLLCVSRFQLILAVGMAVFTFISVRRTFRLKYLLILCALMVPAYLALTVL 302 Query: 235 LPGGMAYYLS-----------------MYLVSPIIAFQEFYFQQVSNSASSHVFWFFERL 277 ++Y MY+ + F Q S+S + + L Sbjct: 303 RSHSVSYLNGIFEMKNPHTPIFVTQPYMYIANNYDNFNCLVEQLGSHSMGLRMLFPVWAL 362 Query: 278 MGLLTGGVSMSLHKEFVWVGLPTNVYTAFSDYVYISAELSY-LMMVIHGCISGVLWR-LS 335 GL S+ FV T V + Y L L ++ G + +L+R + Sbjct: 363 TGLKFLNPSLVSFPIFVTKEELTTVTLIYDAYY-DFGMLGILLFGLLTGAVCALLYRLRT 421 Query: 336 RNYISV-KIFYSYFIYTFSFIFYHESFMTNISSWIQITLCIIVFS 379 R V + Y+ + F+ ++ +N S+W + + + Sbjct: 422 RASNPVCHVIYAQIAMYMALAFFT-TWFSNPSTWFYLAVTAAAYW 465 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.330 0.140 0.357 Lambda K H 0.267 0.0422 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,366,930,490 Number of Sequences: 3077464 Number of extensions: 108054073 Number of successful extensions: 630751 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 23 Number of HSP's successfully gapped in prelim test: 627 Number of HSP's that attempted gapping in prelim test: 629636 Number of HSP's gapped (non-prelim): 1310 length of query: 388 length of database: 1,040,396,356 effective HSP length: 131 effective length of query: 257 effective length of database: 637,248,572 effective search space: 163772883004 effective search space used: 163772883004 T: 11 A: 40 X1: 16 ( 7.6 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.4 bits) S2: 94 (40.8 bits)