BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (559 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P37659 Uncharacterized protein yhjU n=118 Tax=Bacteria ... 1150 0.0 UniRef50_B5XN20 Cellulose biosynthesis protein BcsG n=11 Tax=Ent... 850 0.0 UniRef50_Q5DZ39 Predicted inner membrane protein n=11 Tax=Vibrio... 447 e-124 UniRef50_Q3IER9 Putative membrane protein ; putative endoglucana... 384 e-105 UniRef50_C5V4W5 Cellulose synthase operon protein YhjU n=1 Tax=G... 374 e-102 UniRef50_Q7NUM4 Putative uncharacterized protein n=1 Tax=Chromob... 348 2e-94 UniRef50_A4SY20 Putative uncharacterized protein n=1 Tax=Polynuc... 338 4e-91 UniRef50_B1Y241 Cellulose synthase operon protein YhjU n=1 Tax=L... 315 3e-84 UniRef50_B1JZP6 Cellulose synthase operon protein YhjU n=54 Tax=... 312 3e-83 UniRef50_D1T8Y6 Putative uncharacterized protein n=1 Tax=Burkhol... 305 3e-81 UniRef50_A6GMN7 Putative uncharacterized protein n=1 Tax=Limnoba... 280 1e-73 UniRef50_A6SYG8 Uncharacterized conserved protein n=1 Tax=Janthi... 235 3e-60 UniRef50_B8L0X3 Membrane protein ; endoglucanase BcsG n=5 Tax=Ga... 181 7e-44 UniRef50_B5WNI3 Putative uncharacterized protein n=1 Tax=Burkhol... 59 3e-07 >UniRef50_P37659 Uncharacterized protein yhjU n=118 Tax=Bacteria RepID=YHJU_ECOLI Length = 559 Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust. Identities = 559/559 (100%), Positives = 559/559 (100%) Query: 1 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR 60 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR Sbjct: 1 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR 60 Query: 61 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG 120 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG Sbjct: 61 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG 120 Query: 121 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA 180 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA Sbjct: 121 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA 180 Query: 181 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI 240 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI Sbjct: 181 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI 240 Query: 241 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ 300 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ Sbjct: 241 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ 300 Query: 301 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD 360 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD Sbjct: 301 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD 360 Query: 361 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE 420 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE Sbjct: 361 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE 420 Query: 421 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP 480 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP Sbjct: 421 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP 480 Query: 481 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ 540 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ Sbjct: 481 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ 540 Query: 541 YQDKPYVRLNGGDWVPYPQ 559 YQDKPYVRLNGGDWVPYPQ Sbjct: 541 YQDKPYVRLNGGDWVPYPQ 559 >UniRef50_B5XN20 Cellulose biosynthesis protein BcsG n=11 Tax=Enterobacteriaceae RepID=B5XN20_KLEP3 Length = 559 Score = 850 bits (2195), Expect = 0.0, Method: Compositional matrix adjust. Identities = 414/557 (74%), Positives = 469/557 (84%), Gaps = 3/557 (0%) Query: 5 TQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLH 64 T+ TA P LWQYWRGL GWNFYFLVKF LLWAGYLNFHP+LNLVF AFLL+P+PR LH Sbjct: 4 TKPTATPLPLWQYWRGLGGWNFYFLVKFALLWAGYLNFHPMLNLVFLAFLLVPIPREKLH 63 Query: 65 RLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFV 124 R+RHWIA+P+GFALFWHDTWLPGPE+++SQGSQ+AGFS Y+ DL+ RFINW M+GA FV Sbjct: 64 RIRHWIAIPLGFALFWHDTWLPGPETLLSQGSQIAGFSASYIWDLIVRFINWSMVGAFFV 123 Query: 125 LLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAA 184 LLV WLF+SQW+R+TVFV A+++WL V L P+F+LWPAGQPTT TT AA Sbjct: 124 LLVLWLFISQWLRVTVFVSAMVVWLAVSPLL-PAFTLWPAGQPTTAAATTAPANTGANAA 182 Query: 185 TGG--APVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINI 242 G +P D+P QT PPT+ANL WLN FY AE KRK+ FP LPADAQPF+LLVINI Sbjct: 183 AGTATSPASSDIPPQTEPPTSANLTNWLNGFYAAEQKRKTPFPDQLPADAQPFDLLVINI 242 Query: 243 CSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPA 302 CSLSWSDIEAAGLM HPLW HFDI FKNFNSATSYSGPAA+RLLRASCGQ SHTNLYQP+ Sbjct: 243 CSLSWSDIEAAGLMDHPLWKHFDIVFKNFNSATSYSGPAAVRLLRASCGQLSHTNLYQPS 302 Query: 303 NNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGS 362 DCYLF+NL+KLGF Q LM+GHNG FG FLKE+R GGMQS LMDQT LPV L FDGS Sbjct: 303 GADCYLFENLAKLGFNQQLMLGHNGLFGDFLKELRSLGGMQSPLMDQTGLPVSLQAFDGS 362 Query: 363 PVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDELD 422 PVY+D AVLNRWL E N RSATFYNTLPLHDGNH+PG SKTADYK RAQK FD+LD Sbjct: 363 PVYEDLAVLNRWLKTEEASNNPRSATFYNTLPLHDGNHFPGQSKTADYKVRAQKLFDDLD 422 Query: 423 AFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQ 482 FFTELEKSGRKVMVVVVPEHGGALKGD+MQVSGLRDIPSPSIT+VP VKFFGMKAPH+ Sbjct: 423 NFFTELEKSGRKVMVVVVPEHGGALKGDKMQVSGLRDIPSPSITNVPTAVKFFGMKAPHE 482 Query: 483 GAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQ 542 GAPI+I+QPSS+LA+S+LVVR LDGK+F+ED+V+W++ + LPQ+A VSEN+NA+VIQYQ Sbjct: 483 GAPIIIDQPSSYLAVSELVVRALDGKMFSEDSVNWQQYVANLPQSAAVSENANALVIQYQ 542 Query: 543 DKPYVRLNGGDWVPYPQ 559 KPYV+LNGG WVPYPQ Sbjct: 543 GKPYVQLNGGSWVPYPQ 559 >UniRef50_Q5DZ39 Predicted inner membrane protein n=11 Tax=Vibrionales RepID=Q5DZ39_VIBF1 Length = 542 Score = 447 bits (1150), Expect = e-124, Method: Compositional matrix adjust. Identities = 228/543 (41%), Positives = 332/543 (61%), Gaps = 18/543 (3%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL WN YF++K GL G ++FHP+ N AFLL+P+ L+ +R ++A+P G L Sbjct: 12 GLGWWNIYFIIKIGLFLQGIIDFHPIENFALVAFLLIPIRHKILNVIRQFLAVPFGLWLM 71 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 +D++LP + + SQ Q+ F+ YLI+L +RF++ + + +FVL+ A+ FL+Q RI+ Sbjct: 72 HYDSFLPPLDRLWSQMGQLLQFNLSYLIELASRFVSLETLLGLFVLVFAYYFLNQIFRIS 131 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 VFVV L+ ++L FS QP T + +A A V +T Sbjct: 132 VFVVITLI---AISLPSDLFS----SQPNTVANVSQQPESAETAQISDHQV-----DETG 179 Query: 200 PPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHP 259 P A LN F+ EA+R+ +FP++ P+ F+LL ++ICS++W DIE AGL SHP Sbjct: 180 PVNDAVLNNAKELFFRNEAQRRVSFPTTSPS--TDFDLLFLSICSVAWDDIEIAGLESHP 237 Query: 260 LWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ-PANNDCYLFDNLSKLGFT 318 L+ FD+ F NF++ATSYSGPA IRLLRASCGQ SH L++ PA+ C+LFDNL+KLGF Sbjct: 238 LFKEFDVMFDNFSAATSYSGPAVIRLLRASCGQESHPELFKAPASKQCFLFDNLAKLGFQ 297 Query: 319 QHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVT 378 ++L++ H+G+F FL ++E+G +++ LM Q L FDGSP+Y D VLNRWLD Sbjct: 298 ENLLLNHDGKFDDFLGLLKEDGDLKAPLMSQAGLTQYQSAFDGSPIYRDKDVLNRWLDKR 357 Query: 379 EKDKNSRSATFYNTLPLHDGNHY---PGVSKTADYKARAQKFFDELDAFFTELEKSGRKV 435 EK ++ + YNT+ LHDGN G YK R + D+L FF EL+ S R + Sbjct: 358 EKSQDGPTVALYNTISLHDGNRIIKASGKVGLVSYKLRLKNLLDDLYDFFQELKASNRNI 417 Query: 436 MVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFL 495 +V++VPEHG ++GD+MQ++G+R+IPS +I PVG+K FG G+ I PSS+L Sbjct: 418 VVMLVPEHGAGMRGDKMQIAGMREIPSATIVHTPVGMKIFGQGMTRLGSTAHISAPSSYL 477 Query: 496 AISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWV 555 A+S LV R+++ I+ + +L LP T V++NS V++Y KPYV L+G W Sbjct: 478 AVSTLVSRIIEEDIYATKTFNTAELVKDLPTTKMVAQNSGTTVMEYNKKPYVSLDGTTWS 537 Query: 556 PYP 558 YP Sbjct: 538 EYP 540 >UniRef50_Q3IER9 Putative membrane protein ; putative endoglucanase BcsG n=2 Tax=Alteromonadales RepID=Q3IER9_PSEHT Length = 525 Score = 384 bits (987), Expect = e-105, Method: Compositional matrix adjust. Identities = 206/545 (37%), Positives = 309/545 (56%), Gaps = 30/545 (5%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL WN YFLVKF L + G + F L N AA + + +L+H+I L Sbjct: 5 GLGIWNLYFLVKFVLFYYGAIKFDFLSNAALAALFALTFSNSQVDKLKHFIGAVFAIVLL 64 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 + D+WLP + + Q + FS Y ++L R +N+ M+ +F++++ + + SQWIR T Sbjct: 65 YKDSWLPPIDRLTKQAGNIQDFSLGYFVELFGRIVNYDMLLGLFIIVICFWYTSQWIRFT 124 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 +A L+++ G +L P + A +V AP + TA Sbjct: 125 TVTIAGLIFI------GYQGALKP------------NDMAVSVQ---NAPNQEQEFSNTA 163 Query: 200 PPTTA--NLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMS 257 T + + LN+FY +++ S FP + F+++V+NICSL+ +D+ A G+ Sbjct: 164 VVTQKLDSADQQLNDFYKQQSQLVSYFPDEY--NGTQFDVVVLNICSLAIADLNAIGVSL 221 Query: 258 HPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGF 317 ++S FDI F +FNSATSYSGPAAIRLLRASCGQTS L+ A C+LF+NL KLG+ Sbjct: 222 DDVYSDFDIVFSDFNSATSYSGPAAIRLLRASCGQTSQPALFDDAPEQCHLFNNLEKLGY 281 Query: 318 TQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDV 377 HL+M H+G F GF V++ G + S L D T+LPV FD P+Y D AVLN WL+ Sbjct: 282 DSHLVMNHDGHFDGFKDLVKKQGKLNSPLFDTTSLPVAQYSFDSKPIYSDEAVLNSWLE- 340 Query: 378 TEKDKNSRSATFYNTLPLHDGNHYPG---VSKTADYKARAQKFFDELDAFFTELEKSGRK 434 + D + A +YNT+ LHDGN ++ Y R Q F +++ F LEK GR Sbjct: 341 EQGDSCAPCAMYYNTISLHDGNQLANSRRMNSDESYPVRQQNLFSDINQFIKNLEKRGRN 400 Query: 435 VMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSF 494 VM+++VPEHG AL+GD++Q +GLR+IPSPSI VP +KF G P + I + SS+ Sbjct: 401 VMLMLVPEHGAALQGDKVQFAGLREIPSPSIVTVPAAIKFIGPDLPRM-SQITVANTSSY 459 Query: 495 LAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDW 554 A+S+L+ +V+ F + + +L S LP + V+EN+ +++ +PY++L+GG+W Sbjct: 460 FALSELITKVMKSNYFAGKSNNMAELVSQLPTSQKVAENAGTIMMYVNKRPYIQLDGGEW 519 Query: 555 VPYPQ 559 YP+ Sbjct: 520 TLYPR 524 >UniRef50_C5V4W5 Cellulose synthase operon protein YhjU n=1 Tax=Gallionella ferruginea ES-2 RepID=C5V4W5_9PROT Length = 709 Score = 374 bits (960), Expect = e-102, Method: Compositional matrix adjust. Identities = 212/540 (39%), Positives = 302/540 (55%), Gaps = 40/540 (7%) Query: 24 WNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFWHDT 83 W +YF K L G + FHP+ NL+FAA LL+P+ L+RLR A + AL ++D+ Sbjct: 198 WGYYFAAKLALFGLGTITFHPMENLLFAALLLLPVSSRLLYRLRAIFATLLALALLYYDS 257 Query: 84 WLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVFVV 143 WLP +++Q SQ++ FS Y+++L RF +WQ G + L +A+ +S+ IR+ VF+V Sbjct: 258 WLPDIRRLITQASQLSDFSWAYVVELSGRFFSWQTTGLLLALSIAYWIVSRRIRVGVFIV 317 Query: 144 AILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAPPTT 203 L +L L G W NAA A V DM Sbjct: 318 -----LGMLMLWG-----WQ-------------NAARLSADKAILNVDLDM--------- 345 Query: 204 ANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPLWSH 263 N L +F+ EA+R F + DA PF+++ I++CSLSW D+ A GL HPLW Sbjct: 346 ---NKVLQDFFLKEAQRSILFVTP-QTDAVPFDVIFIHVCSLSWDDVRAVGLEDHPLWQR 401 Query: 264 FDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQHLMM 323 FDI K FNSA SYSGPAAI +RA CGQT H +Y + CYL ++L GF +L++ Sbjct: 402 FDILMKKFNSAASYSGPAAIHFMRAKCGQTEHGVMYTTVADKCYLMNSLQLSGFEPNLVL 461 Query: 324 GHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEKDKN 383 H+G+F FL +++++G + + M L V FD SPVYDD +VL+RWL + + Sbjct: 462 NHDGKFDDFLGQLKKHGRLNAPPMSLEGLEVAQHAFDRSPVYDDFSVLDRWLQTRQTSAS 521 Query: 384 SRSATFYNTLPLHDGNHYPGVSKTAD----YKARAQKFFDELDAFFTELEKSGRKVMVVV 439 SR A +YNT+ +HDGNH G ++D YK R KF DE D F +LE SGR+ +VV+ Sbjct: 522 SRVAMYYNTVSMHDGNHVSGADASSDTLVNYKNRLNKFLDETDRFLQKLETSGRRAVVVM 581 Query: 440 VPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFLAISD 499 VPEHG A++GD+ Q++GLR+IP+PSI+ VPVG+K G +G + I+QP+S+LAIS Sbjct: 582 VPEHGAAIRGDKRQIAGLREIPTPSISLVPVGIKMVGGGVQREGDALTIDQPTSYLAISH 641 Query: 500 LVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWVPYPQ 559 ++ R L+ F P T VS+N V + Q + Y+ W Y + Sbjct: 642 IIERTLEQSPFANGRFRSADYVQNYPHTRFVSQNETVTVAESQGQYYLSRGASQWDAYTE 701 >UniRef50_Q7NUM4 Putative uncharacterized protein n=1 Tax=Chromobacterium violaceum RepID=Q7NUM4_CHRVO Length = 526 Score = 348 bits (894), Expect = 2e-94, Method: Compositional matrix adjust. Identities = 213/544 (39%), Positives = 308/544 (56%), Gaps = 43/544 (7%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL GW+ YF++K L W G L+ HP +L FA LL+PL R L R +A P L Sbjct: 18 GLGGWSLYFILKLLLAWRGALSAHPAPDLAFALVLLLPLRRRWLRLARDALAWPAAAILL 77 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 ++D+WLP P ++ + S++ GFS YL++L R + ++ V+L +L LS+++R+ Sbjct: 78 YYDSWLPPPAALWRELSELKGFSLSYLMELAGRILTPTLLIGFTVVLAGYLLLSRFLRLG 137 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 V+A LL L+ + LW AA+ +A GGAP AQT Sbjct: 138 TLVMATLLALS-------AHELW----------QQRAPAASASSAFGGAPS-----AQTP 175 Query: 200 PPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQP-FELLVINICSLSWSDIEAAGLMSH 258 + L F+ +E +R+ F L A P F++L++++CSLSW D+ A G + Sbjct: 176 -------DQRLEGFFRSEQRRQVGFQGPL---ADPGFDVLLLHVCSLSWDDLRAVGFDNP 225 Query: 259 PLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFT 318 PL + FDI F FNSA SYSGPAA+R+LRASCGQ H+ LY+PA C+LF+NL+K GF Sbjct: 226 PLLARFDIVFDRFNSAASYSGPAALRVLRASCGQPRHSALYEPAPEQCFLFENLAKAGFK 285 Query: 319 QHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVT 378 L + H+G F FL+++R NG M L Q + V GFD SP+Y D A+LNRWL + Sbjct: 286 TELSLNHDGSFDSFLQQIRRNGRMNLPLTPQDGVAVGQRGFDSSPIYSDYAMLNRWLQLR 345 Query: 379 EKDKNSRSATFYNTLPLHDGNHY---PGVSKTADYKARAQKFFDELDAFFTELEKSGRKV 435 ++ + A +YNT+ LHDGN P + A YK RA + ++ F LE+ RK+ Sbjct: 346 LQEPDPHVAVYYNTISLHDGNRIPEAPALDTDASYKYRAGRLLRDIGQFIDLLEQDHRKM 405 Query: 436 MVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPI--VIEQPSS 493 ++++VPEHG AL+GD+ Q SGLR+IPSP IT VP +K G QG P + Q +S Sbjct: 406 ILLLVPEHGAALRGDKQQFSGLREIPSPLITTVPAAIKVIG----SQGRPAQYRVTQQAS 461 Query: 494 FLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGD 553 + AI+ ++ R+L F D + LP T VSEN+N V+Q + ++ +GG+ Sbjct: 462 YTAIATILSRMLARSPFGAD-YQPESYAQDLPATPFVSENANFTVMQSGSRYLMQSSGGN 520 Query: 554 WVPY 557 W Y Sbjct: 521 WNDY 524 >UniRef50_A4SY20 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SY20_POLSQ Length = 514 Score = 338 bits (866), Expect = 4e-91, Method: Compositional matrix adjust. Identities = 191/538 (35%), Positives = 303/538 (56%), Gaps = 41/538 (7%) Query: 24 WNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFWHDT 83 W FYFL+K L + GY+NFH +NL FA L+ R L +++ W+A+PIG LF+ D+ Sbjct: 4 WAFYFLIKIILFYTGYINFHFFVNLAFALALIFSHARPRLLQIKRWVAIPIGIILFYFDS 63 Query: 84 WLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVFVV 143 LP +I+++ Q+ GFS +Y I+L+ R ++W+++ A+ V V + LS+ IR+T + Sbjct: 64 PLPPLRNIIAKLDQLLGFSFNYYIELLGRILDWRILVALAVSFVIYYALSKKIRMTT--I 121 Query: 144 AILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAPPTT 203 A+L +VL P S N++ + G V+G P+ Sbjct: 122 AMLAIFSVLL---PFHS----------------NSSMQAYDSDGQ-VIG-------IPSD 154 Query: 204 ANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPLWSH 263 A L L+ F+ E+ R + F S A Q F++LVIN+CSL+W D++ +PL+ Sbjct: 155 AVLTESLDGFFVEESGR-TGFNRSQKASGQAFDILVINVCSLAWDDLKYVKEEDNPLFKR 213 Query: 264 FDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQHLMM 323 F F +FNSA+SYSGP+ IRLLRAS GQ +LY+ D LF+NL GF + Sbjct: 214 FHYLFTSFNSASSYSGPSIIRLLRASRGQQDQRDLYKKPVEDSLLFNNLKTAGFQTQFAL 273 Query: 324 GHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEKDKN 383 H+G++G L+E+R +GG+ + L D L FDGS VY+D +VL+ W D K Sbjct: 274 NHDGKYGDLLQEIRTDGGLSAPLFDNKQATPYLRAFDGSQVYEDYSVLSNWWDARMKLPA 333 Query: 384 SRSATFYNTLPLHDGNHYPGVSKTAD----YKARAQKFFDELDAFFTELEKSGRKVMVVV 439 R A FYNT+ LHDGN ++ + Y R K +++D F+T++ SGR+V++V Sbjct: 334 ERVALFYNTITLHDGNRALDGARLENSVETYSRRLHKLLEDVDRFYTKVNNSGRQVVIVF 393 Query: 440 VPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIV---IEQPSSFLA 496 +PEHG A++ ++ ++ G+R+IPSP++T++PVG+ M +P I+QP+S+LA Sbjct: 394 IPEHGAAIRRNKNEIVGMREIPSPNVTNIPVGI----MLTNKSDSPFKTNRIDQPTSYLA 449 Query: 497 ISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDW 554 S+L+ + + F + + LP T V+EN ++V++++ Y R N +W Sbjct: 450 TSELISKFVAKPPFGASTGNLEAYLKDLPSTRFVAENEDSVIMKFGASYYFRSNDINW 507 >UniRef50_B1Y241 Cellulose synthase operon protein YhjU n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y241_LEPCP Length = 529 Score = 315 bits (806), Expect = 3e-84, Method: Compositional matrix adjust. Identities = 188/542 (34%), Positives = 274/542 (50%), Gaps = 28/542 (5%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + W+ YFLVK GL AG + LNL+FA L P R ++A P+ AL + Sbjct: 1 MGSWSLYFLVKLGLHVAGLIQLDVPLNLLFAVALAWPWAHPGWRRAWRFLAWPVAVALLY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 HD++ P I+SQ ++GFS YL++LV R IN Q++ A+ + W L Q +R+ Sbjct: 61 HDSFWPPATRILSQWQAISGFSFAYLVELVGRVINVQLLVAVAMGAALWWVLKQRLRLAT 120 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 +V L+ + L + GG AATG + Sbjct: 121 WVFVGLVAVAALP------------------SQHGGVTDLAQAATGADGTADRAADRATA 162 Query: 201 P--TTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSH 258 P L+ L FY+ E + P + F+LL++NICSLSW D+ AGL + Sbjct: 163 PLLDAGQLDQALQAFYDTERGKILRLPKD--GNVPGFDLLILNICSLSWDDLAFAGLRNA 220 Query: 259 PLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFT 318 P D+ F FNSA SYSGPA +RLL +CGQ + LY A DCYLF NL + G+ Sbjct: 221 PFMRRLDVVFDRFNSAASYSGPAVMRLLHGTCGQPAQHELYGGAVADCYLFRNLEQAGYR 280 Query: 319 QHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNR-WLDV 377 L++ H+G++ F E+R + G+ + + V + FDGSP+ DD L R W D Sbjct: 281 PALLLNHDGRYDNFSTELRRDSGLGLVPEQRFDAAVAMSSFDGSPIRDDGETLTRWWSDR 340 Query: 378 TEKDKNSRSATFYNTLPLHDGNHYPGV---SKTADYKARAQKFFDELDAFFTELEKSGRK 434 T A YN++ LHDGN PG+ S Y RA+K +L+ F +E SGR Sbjct: 341 TAAASGVPLAMLYNSITLHDGNRVPGIQSLSSLETYAPRARKLMADLERFAALVEASGRP 400 Query: 435 VMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGM--KAPHQGAPIVIEQPS 492 ++V+VPEHGGA++GD Q++GLR++P+P+IT VP GV GM + P+ +EQ S Sbjct: 401 TVLVLVPEHGGAVRGDAQQIAGLRELPTPAITHVPAGVMLIGMGERRADGQEPVHVEQTS 460 Query: 493 SFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGG 552 S+L++ +V ++ G ++ LP V+EN +V++ YVR G Sbjct: 461 SYLSLFTVVAALMHGGPEVATPERLTEVAQALPPVEWVAENDKTIVLRRGPHTYVRDFEG 520 Query: 553 DW 554 W Sbjct: 521 RW 522 >UniRef50_B1JZP6 Cellulose synthase operon protein YhjU n=54 Tax=Burkholderiaceae RepID=B1JZP6_BURCC Length = 518 Score = 312 bits (799), Expect = 3e-83, Method: Compositional matrix adjust. Identities = 189/537 (35%), Positives = 281/537 (52%), Gaps = 30/537 (5%) Query: 24 WNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSL--HRLRHWIALPIGFALFWH 81 WN YF++KF L G L L NL FA L+ P S +R +A+ I L Sbjct: 4 WNLYFILKFALFATGRLQPFWLANLAFAVALVASAPIRSRAWRIVRQVVAVAIAVPLLAR 63 Query: 82 DTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVF 141 + P + +V+ F DY ++L+ R + + I +L+ + +++W+R+ F Sbjct: 64 ELHAPSLARLAEAAREVSTFRLDYWMELLPRLLPPVLALTIVGVLIVYFIVNRWLRVATF 123 Query: 142 VVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAPP 201 VVA+L+ + LW AG A VA GA V D P Sbjct: 124 VVAVLVVM----------PLWQAGSGLMARVVAPAQPQANVA---GATRV-DQPE----- 164 Query: 202 TTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPLW 261 + NA L F E++R+ F A F+++V++ICSLSW D++AA + +HP+ Sbjct: 165 ---DHNAALATFRAQESQRQVAFGHLGSDPAAQFDVIVLHICSLSWDDLDAAKVRNHPML 221 Query: 262 SHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQHL 321 SHFD F NF++A SYSGPAAIR+LRASCGQ +H +LY+PA C+LF L+ G+T Sbjct: 222 SHFDYLFTNFSTAASYSGPAAIRVLRASCGQEAHADLYKPAPQQCHLFSQLAGAGYTVQS 281 Query: 322 MMGHNGQFGGFLKEVRENGGM-QSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEK 380 ++ H+G F FL+ + +N G+ + ++ PV + FDGS + DD A L W Sbjct: 282 LLNHDGHFDNFLQVIHDNIGVADAPMISNAAAPVAMHAFDGSAIKDDYATLANWY-AQRA 340 Query: 381 DKNSRSATFYNTLPLHDGNHYPGVSKTA--DYKARAQKFFDELDAFFTELEKSGRKVMVV 438 A +YNT+ LHDGN G + T+ Y RA K + D + +SGR+ ++V Sbjct: 341 AVPGPVALYYNTISLHDGNRVVGSALTSIDSYPQRATKMMTDFDRLADLIAQSGRRAVIV 400 Query: 439 VVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFLAIS 498 VPEHG AL+GD+ Q++GLR+IP+P I PVGV+ G H GA VIEQP+SFLA++ Sbjct: 401 FVPEHGAALRGDKNQIAGLREIPTPRIVHGPVGVRLVGFTGNH-GATTVIEQPTSFLALA 459 Query: 499 DLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWV 555 L+ ++ F + + + LP+T + EN V +Q V+ G W+ Sbjct: 460 QLLSNLVSNSPF-KPGATLAQYAADLPRTRMIGENEGTVTMQTAAGYAVKTPDGVWI 515 >UniRef50_D1T8Y6 Putative uncharacterized protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1T8Y6_9BURK Length = 523 Score = 305 bits (781), Expect = 3e-81, Method: Compositional matrix adjust. Identities = 191/541 (35%), Positives = 283/541 (52%), Gaps = 33/541 (6%) Query: 24 WNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMP--LPRYSLHRLRHWIALPIGFALFWH 81 WN YF++K L G+L L NL FA L++ L R +L +R+ L +G L +H Sbjct: 4 WNLYFILKLYLFAGGHLQPLWLANLGFALALVVTSTLRRRALRIVRNLAGLALGVPLVYH 63 Query: 82 DTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVF 141 + +P + + + FS Y ++L RF+ ++ A LV +L +++W+R+ F Sbjct: 64 EANVPPFSRLTEEFGNLTTFSYGYWLELAQRFLPPMLLLAALGALVGYLIVNRWVRVATF 123 Query: 142 VVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAPP 201 V+ L+ + LW GG A + P Sbjct: 124 VLIALV----------AIPLW----------HEGGVVLAQLRGASANAAANAGANGANNP 163 Query: 202 TTA---NLNAWLNNFYNAEAKRKSTFPSSLPADAQP-FELLVINICSLSWSDIEAAGLMS 257 A + NA L F E++R+ +F L AD F+++V++ICSLSW D++ A + Sbjct: 164 LAAQPLDHNAALAAFRTQESQRQVSF-GHLAADPNAQFDVIVLHICSLSWDDLDVAKARN 222 Query: 258 HPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGF 317 HPL S FD F NF++A SYSGPAAIR+LRASCGQ +H +LY+ A CYL +L++ G+ Sbjct: 223 HPLLSRFDYLFTNFSTAASYSGPAAIRVLRASCGQQAHADLYKNAPQQCYLLADLAQAGY 282 Query: 318 TQHLMMGHNGQFGGFLKEVRENGGMQS-ELMDQTNLPVILLGFDGSPVYDDTAVLNRWLD 376 T M+ H+G F FL+ + +N G+ + L+ T++PV + FDGSP+ DD L W Sbjct: 283 TPQTMLNHDGHFDNFLELIHDNAGVPNVPLIPNTSVPVAMHAFDGSPIRDDYETLAAWY- 341 Query: 377 VTEKDKNSRSATFYNTLPLHDGNHYPGVSKTA--DYKARAQKFFDELDAFFTELEKSGRK 434 A +YNT+ LHDGN P S T+ Y R K + D F + SGR+ Sbjct: 342 AQRASIAGPVALYYNTISLHDGNRLPNSSLTSIDSYPLRVNKLMSDFDRFADLIASSGRR 401 Query: 435 VMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSF 494 ++V VPEHG AL+GD QV+GLR+IP+P I PVGV+ G + H G+ VI+ PSSF Sbjct: 402 AVIVFVPEHGAALRGDTNQVAGLREIPTPRIVHGPVGVRVVGFQGSH-GSTTVIDDPSSF 460 Query: 495 LAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDW 554 LA++ L+ ++ F + V + + LPQT V EN V ++ V+ G W Sbjct: 461 LALAQLLSNLVSNSPF-KPGVSLSQYATNLPQTQMVGENEGTVTMKTASGYVVKTPDGVW 519 Query: 555 V 555 V Sbjct: 520 V 520 >UniRef50_A6GMN7 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GMN7_9BURK Length = 567 Score = 280 bits (715), Expect = 1e-73, Method: Compositional matrix adjust. Identities = 188/556 (33%), Positives = 278/556 (50%), Gaps = 30/556 (5%) Query: 24 WNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLP-----RYSLHRLRHWIALPIGFAL 78 WN YFL+KFGL ++G L PL NL A L+ P + + LR+ + AL Sbjct: 4 WNLYFLIKFGLHFSGQLTLSPLWNLGLFALLIATNPAAYQNKQLMKVLRYLVFTGPAIAL 63 Query: 79 FWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFIN-WQMIGAIFVLLVAWLFLSQWIR 137 H+ L +++ Q + GFS DYL +L+ R I W + GA+ +V + L ++IR Sbjct: 64 LLHELGLVVSLALVDQIKALFGFSLDYLWELLKRTIQPWMLWGALLGFMVVRV-LDRYIR 122 Query: 138 ITVFV-VAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAA-----ATVAATGGAP-- 189 I+ +V ++ L + L P T + A A + A G Sbjct: 123 ISTWVGFGLVCILGIQAL--PFIQQQTQTNKTAALLEPQAQARENFSPAELLALGQVDFQ 180 Query: 190 ---VVGDMPAQTAPPTTA----NLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINI 242 V D +QT A A L F++ + R P + F+++V+ I Sbjct: 181 KLRVARDQESQTKLRDNAYEGAGPGAVLAGFFDRQ--RNIALAPFSPVVSPDFDVIVLQI 238 Query: 243 CSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPA 302 CSLSW+D++ A HP D F+NFNSATSYSGPAAIRLLR CGQT+H LY A Sbjct: 239 CSLSWADLQYAKQSQHPTIRQADFVFENFNSATSYSGPAAIRLLRGKCGQTTHDALYSVA 298 Query: 303 NNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVREN-GGMQSELMDQTNLPVILLGFDG 361 NN C LF+ L +GF + + H+G+F F K V+ N GG +EL+ ++P + FDG Sbjct: 299 NNSCMLFEQLRNVGFEVEMGLNHDGRFQDFSKLVKTNLGGKATELVAHDDVPAGVQAFDG 358 Query: 362 SPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYP--GVSKTADYKARAQKFFD 419 S V D L W + + A +YN++ LHDGN P ++ Y R ++ + Sbjct: 359 SRVGRDGDYLRAWWNKRIQQSGPAVAYYYNSITLHDGNRLPNSNLNSLNSYPLRLERMLN 418 Query: 420 ELDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPV-GVKFFGMK 478 ++ + E+ +S RK +VVVVPEHG L G+ Q+ GLR++P+P+IT VPV G Sbjct: 419 DIQSVLGEIRRSDRKALVVVVPEHGAGLTGEFGQLVGLRELPTPAITKVPVFGYWIAPGY 478 Query: 479 APHQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVV 538 P P+ ++Q S+ A+S+L+ R L + W L S LP T VS+ N V Sbjct: 479 TPASTGPVSVKQSVSYTALSELLARWLAQPAEQQQKPAWPVLLSDLPDTRFVSQQGNITV 538 Query: 539 IQYQDKPYVRLNGGDW 554 ++ Q +++ G W Sbjct: 539 MESQGSYWIKAPGAAW 554 >UniRef50_A6SYG8 Uncharacterized conserved protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6SYG8_JANMA Length = 507 Score = 235 bits (599), Expect = 3e-60, Method: Compositional matrix adjust. Identities = 158/539 (29%), Positives = 266/539 (49%), Gaps = 43/539 (7%) Query: 24 WNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFWHDT 83 W+ YF KF L + + F+ LNL+ A + + IA+ AL ++++ Sbjct: 4 WSLYFFAKFALYFNHAIKFNWYLNLLLAICVSFSFRHPRWRIAQQSIAIIAAIALLYYES 63 Query: 84 WLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVFVV 143 LP P+ ++++ + ++ +S YL++L RFIN + V+L ++ L+ +R + FV Sbjct: 64 SLPPPDRLLAEAANMSSYSFSYLLELFARFINLWYVVVFAVMLALYVLLAGRLRFSSFVF 123 Query: 144 AILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAPPTT 203 +L + +L G P + G Sbjct: 124 VGILLIPLLA---------QFGLPAQNLHLVAG--------------------------A 148 Query: 204 ANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPLWSH 263 + A L+ FY E +R+ + PS A Q F+++++ +LSW D+ + Sbjct: 149 QDPGAMLDTFYAQEKERRLSLPSIANAR-QAFDIVILQPGALSWDDLAFVDAPYPKFLNR 207 Query: 264 FDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQHLMM 323 FD+ NFNSATS+S AA RLLR SCGQ + L + C LF+NL + G+ +M Sbjct: 208 FDLVLLNFNSATSHSAAAAKRLLRGSCGQPGDSVLAEAPATGCSLFENLRQAGYDTAAVM 267 Query: 324 GHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEKDKN 383 HNG++ + + + G E + V + GFDG+PVYDD +VL++W + Sbjct: 268 NHNGRYNRYAETISAYAGTGVE--EGKWGSVAMHGFDGTPVYDDFSVLSKWWNKHHAHPG 325 Query: 384 SRS-ATFYNTLPLHDGNHYPG---VSKTADYKARAQKFFDELDAFFTELEKSGRKVMVVV 439 + A +YN++ LHDGN PG ++ YK R K + D F T+LE S R V+V++ Sbjct: 326 GKPVALYYNSITLHDGNILPGPRAINSVQSYKPRLDKLLADFDHFVTQLEASNRPVVVIL 385 Query: 440 VPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQ-GAPIVIEQPSSFLAIS 498 +P HG A++GD++Q +G+R+IPSP +T VPV +K GM A + G P+ ++ +S+ + Sbjct: 386 MPAHGAAMRGDQLQAAGMREIPSPKLTLVPVAIKLIGMAAAKEAGPPLEVKHATSYFGVF 445 Query: 499 DLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWVPY 557 L+ ++ G+ + + +TA VSEN +V++ +D +R W+ Y Sbjct: 446 SLLADLMAGQANETAGKPLAERLQQVGETAFVSENEKIIVMRSKDGYIMRSAEDLWIKY 504 >UniRef50_B8L0X3 Membrane protein ; endoglucanase BcsG n=5 Tax=Gammaproteobacteria RepID=B8L0X3_9GAMM Length = 182 Score = 181 bits (459), Expect = 7e-44, Method: Compositional matrix adjust. Identities = 96/179 (53%), Positives = 118/179 (65%), Gaps = 7/179 (3%) Query: 387 ATFYNTLPLHDGNHYPGV---SKTADYKARAQKFFDELDAFFTELEKSGRKVMVVVVPEH 443 A FYNT+ LHDGN G S ADYKARA+ ++ F +EKSGR+ M+VVVPEH Sbjct: 2 ALFYNTISLHDGNRIVGADGRSNAADYKARAEMVLGDMAGFVDAVEKSGRRAMIVVVPEH 61 Query: 444 GGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFLAISDLVVR 503 G AL GDRMQ+ G+R+IPSPSIT VPVGVK GM AP G P I +PSS+LA+S+LV R Sbjct: 62 GAALHGDRMQIPGMREIPSPSITHVPVGVKLVGMGAPAAGGPRHIPEPSSYLAVSELVSR 121 Query: 504 V--LDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGD-WVPYPQ 559 V L+ + + +W L GLPQT VSEN A VI Y KP+++L G W PYP+ Sbjct: 122 VYALNAQSPPSER-NWDSLLKGLPQTPSVSENEGAKVIDYGGKPWLQLQGSQTWSPYPE 179 >UniRef50_B5WNI3 Putative uncharacterized protein n=1 Tax=Burkholderia sp. H160 RepID=B5WNI3_9BURK Length = 163 Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 46/131 (35%), Positives = 69/131 (52%), Gaps = 14/131 (10%) Query: 24 WNFYFLVKFGLLWAGYLNFHPL--LNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFWH 81 WNFYF+ K L AG PL LNL+FA LL+P L +R+ +A+ IG AL + Sbjct: 4 WNFYFIAKLYL--AGIGKLQPLWWLNLLFAIALLVPFGDRRLRVVRNLVAVVIGIALLYF 61 Query: 82 DTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVF 141 + L P M+ A + + L+ R + + A+ L VA+ +S+W+R+T F Sbjct: 62 E--LGEPAFSMAA----AHLPQQHALALIVRIVPLSTLVALATLFVAYYVVSRWVRVTTF 115 Query: 142 VV----AILLW 148 V+ AIL+W Sbjct: 116 VLISLFAILVW 126 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37659 Uncharacterized protein yhjU n=118 Tax=Bacteria ... 825 0.0 UniRef50_B5XN20 Cellulose biosynthesis protein BcsG n=11 Tax=Ent... 777 0.0 UniRef50_Q5DZ39 Predicted inner membrane protein n=11 Tax=Vibrio... 703 0.0 UniRef50_Q3IER9 Putative membrane protein ; putative endoglucana... 647 0.0 UniRef50_A4SY20 Putative uncharacterized protein n=1 Tax=Polynuc... 639 0.0 UniRef50_C5V4W5 Cellulose synthase operon protein YhjU n=1 Tax=G... 632 e-179 UniRef50_B1Y241 Cellulose synthase operon protein YhjU n=1 Tax=L... 629 e-178 UniRef50_D1T8Y6 Putative uncharacterized protein n=1 Tax=Burkhol... 628 e-178 UniRef50_B1JZP6 Cellulose synthase operon protein YhjU n=54 Tax=... 627 e-178 UniRef50_Q7NUM4 Putative uncharacterized protein n=1 Tax=Chromob... 618 e-175 UniRef50_A6GMN7 Putative uncharacterized protein n=1 Tax=Limnoba... 599 e-170 UniRef50_A6SYG8 Uncharacterized conserved protein n=1 Tax=Janthi... 563 e-159 UniRef50_B8L0X3 Membrane protein ; endoglucanase BcsG n=5 Tax=Ga... 243 9e-63 UniRef50_B5WNI3 Putative uncharacterized protein n=1 Tax=Burkhol... 130 2e-28 Sequences not found previously or not previously below threshold: UniRef50_UPI00016ABB33 hypothetical protein Bpseu9_34329 n=5 Tax... 107 1e-21 UniRef50_Q3A304 Putative uncharacterized protein n=1 Tax=Pelobac... 44 0.015 UniRef50_D1PM20 Putative uncharacterized protein n=1 Tax=Subdoli... 43 0.028 >UniRef50_P37659 Uncharacterized protein yhjU n=118 Tax=Bacteria RepID=YHJU_ECOLI Length = 559 Score = 825 bits (2132), Expect = 0.0, Method: Composition-based stats. Identities = 559/559 (100%), Positives = 559/559 (100%) Query: 1 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR 60 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR Sbjct: 1 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR 60 Query: 61 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG 120 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG Sbjct: 61 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG 120 Query: 121 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA 180 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA Sbjct: 121 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA 180 Query: 181 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI 240 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI Sbjct: 181 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI 240 Query: 241 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ 300 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ Sbjct: 241 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ 300 Query: 301 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD 360 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD Sbjct: 301 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD 360 Query: 361 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE 420 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE Sbjct: 361 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE 420 Query: 421 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP 480 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP Sbjct: 421 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP 480 Query: 481 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ 540 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ Sbjct: 481 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ 540 Query: 541 YQDKPYVRLNGGDWVPYPQ 559 YQDKPYVRLNGGDWVPYPQ Sbjct: 541 YQDKPYVRLNGGDWVPYPQ 559 >UniRef50_B5XN20 Cellulose biosynthesis protein BcsG n=11 Tax=Enterobacteriaceae RepID=B5XN20_KLEP3 Length = 559 Score = 777 bits (2006), Expect = 0.0, Method: Composition-based stats. Identities = 414/557 (74%), Positives = 469/557 (84%), Gaps = 3/557 (0%) Query: 5 TQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLH 64 T+ TA P LWQYWRGL GWNFYFLVKF LLWAGYLNFHP+LNLVF AFLL+P+PR LH Sbjct: 4 TKPTATPLPLWQYWRGLGGWNFYFLVKFALLWAGYLNFHPMLNLVFLAFLLVPIPREKLH 63 Query: 65 RLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFV 124 R+RHWIA+P+GFALFWHDTWLPGPE+++SQGSQ+AGFS Y+ DL+ RFINW M+GA FV Sbjct: 64 RIRHWIAIPLGFALFWHDTWLPGPETLLSQGSQIAGFSASYIWDLIVRFINWSMVGAFFV 123 Query: 125 LLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAA 184 LLV WLF+SQW+R+TVFV A+++WL V L P+F+LWPAGQPTT TT AA Sbjct: 124 LLVLWLFISQWLRVTVFVSAMVVWLAVSPLL-PAFTLWPAGQPTTAAATTAPANTGANAA 182 Query: 185 TGG--APVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINI 242 G +P D+P QT PPT+ANL WLN FY AE KRK+ FP LPADAQPF+LLVINI Sbjct: 183 AGTATSPASSDIPPQTEPPTSANLTNWLNGFYAAEQKRKTPFPDQLPADAQPFDLLVINI 242 Query: 243 CSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPA 302 CSLSWSDIEAAGLM HPLW HFDI FKNFNSATSYSGPAA+RLLRASCGQ SHTNLYQP+ Sbjct: 243 CSLSWSDIEAAGLMDHPLWKHFDIVFKNFNSATSYSGPAAVRLLRASCGQLSHTNLYQPS 302 Query: 303 NNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGS 362 DCYLF+NL+KLGF Q LM+GHNG FG FLKE+R GGMQS LMDQT LPV L FDGS Sbjct: 303 GADCYLFENLAKLGFNQQLMLGHNGLFGDFLKELRSLGGMQSPLMDQTGLPVSLQAFDGS 362 Query: 363 PVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDELD 422 PVY+D AVLNRWL E N RSATFYNTLPLHDGNH+PG SKTADYK RAQK FD+LD Sbjct: 363 PVYEDLAVLNRWLKTEEASNNPRSATFYNTLPLHDGNHFPGQSKTADYKVRAQKLFDDLD 422 Query: 423 AFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQ 482 FFTELEKSGRKVMVVVVPEHGGALKGD+MQVSGLRDIPSPSIT+VP VKFFGMKAPH+ Sbjct: 423 NFFTELEKSGRKVMVVVVPEHGGALKGDKMQVSGLRDIPSPSITNVPTAVKFFGMKAPHE 482 Query: 483 GAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQ 542 GAPI+I+QPSS+LA+S+LVVR LDGK+F+ED+V+W++ + LPQ+A VSEN+NA+VIQYQ Sbjct: 483 GAPIIIDQPSSYLAVSELVVRALDGKMFSEDSVNWQQYVANLPQSAAVSENANALVIQYQ 542 Query: 543 DKPYVRLNGGDWVPYPQ 559 KPYV+LNGG WVPYPQ Sbjct: 543 GKPYVQLNGGSWVPYPQ 559 >UniRef50_Q5DZ39 Predicted inner membrane protein n=11 Tax=Vibrionales RepID=Q5DZ39_VIBF1 Length = 542 Score = 703 bits (1815), Expect = 0.0, Method: Composition-based stats. Identities = 228/543 (41%), Positives = 332/543 (61%), Gaps = 18/543 (3%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL WN YF++K GL G ++FHP+ N AFLL+P+ L+ +R ++A+P G L Sbjct: 12 GLGWWNIYFIIKIGLFLQGIIDFHPIENFALVAFLLIPIRHKILNVIRQFLAVPFGLWLM 71 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 +D++LP + + SQ Q+ F+ YLI+L +RF++ + + +FVL+ A+ FL+Q RI+ Sbjct: 72 HYDSFLPPLDRLWSQMGQLLQFNLSYLIELASRFVSLETLLGLFVLVFAYYFLNQIFRIS 131 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 VFVV L+ ++ L FS QP T + +A A V +T Sbjct: 132 VFVVITLIAIS---LPSDLFS----SQPNTVANVSQQPESAETAQISDHQV-----DETG 179 Query: 200 PPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHP 259 P A LN F+ EA+R+ +FP++ P+ F+LL ++ICS++W DIE AGL SHP Sbjct: 180 PVNDAVLNNAKELFFRNEAQRRVSFPTTSPS--TDFDLLFLSICSVAWDDIEIAGLESHP 237 Query: 260 LWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ-PANNDCYLFDNLSKLGFT 318 L+ FD+ F NF++ATSYSGPA IRLLRASCGQ SH L++ PA+ C+LFDNL+KLGF Sbjct: 238 LFKEFDVMFDNFSAATSYSGPAVIRLLRASCGQESHPELFKAPASKQCFLFDNLAKLGFQ 297 Query: 319 QHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVT 378 ++L++ H+G+F FL ++E+G +++ LM Q L FDGSP+Y D VLNRWLD Sbjct: 298 ENLLLNHDGKFDDFLGLLKEDGDLKAPLMSQAGLTQYQSAFDGSPIYRDKDVLNRWLDKR 357 Query: 379 EKDKNSRSATFYNTLPLHDGNHY---PGVSKTADYKARAQKFFDELDAFFTELEKSGRKV 435 EK ++ + YNT+ LHDGN G YK R + D+L FF EL+ S R + Sbjct: 358 EKSQDGPTVALYNTISLHDGNRIIKASGKVGLVSYKLRLKNLLDDLYDFFQELKASNRNI 417 Query: 436 MVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFL 495 +V++VPEHG ++GD+MQ++G+R+IPS +I PVG+K FG G+ I PSS+L Sbjct: 418 VVMLVPEHGAGMRGDKMQIAGMREIPSATIVHTPVGMKIFGQGMTRLGSTAHISAPSSYL 477 Query: 496 AISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWV 555 A+S LV R+++ I+ + +L LP T V++NS V++Y KPYV L+G W Sbjct: 478 AVSTLVSRIIEEDIYATKTFNTAELVKDLPTTKMVAQNSGTTVMEYNKKPYVSLDGTTWS 537 Query: 556 PYP 558 YP Sbjct: 538 EYP 540 >UniRef50_Q3IER9 Putative membrane protein ; putative endoglucanase BcsG n=2 Tax=Alteromonadales RepID=Q3IER9_PSEHT Length = 525 Score = 647 bits (1668), Expect = 0.0, Method: Composition-based stats. Identities = 204/545 (37%), Positives = 306/545 (56%), Gaps = 30/545 (5%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL WN YFLVKF L + G + F L N AA + + +L+H+I L Sbjct: 5 GLGIWNLYFLVKFVLFYYGAIKFDFLSNAALAALFALTFSNSQVDKLKHFIGAVFAIVLL 64 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 + D+WLP + + Q + FS Y ++L R +N+ M+ +F++++ + + SQWIR T Sbjct: 65 YKDSWLPPIDRLTKQAGNIQDFSLGYFVELFGRIVNYDMLLGLFIIVICFWYTSQWIRFT 124 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 +A L+++ P + A +V AP + TA Sbjct: 125 TVTIAGLIFIGYQGALKP------------------NDMAVSVQ---NAPNQEQEFSNTA 163 Query: 200 PPTTA--NLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMS 257 T + + LN+FY +++ S FP + F+++V+NICSL+ +D+ A G+ Sbjct: 164 VVTQKLDSADQQLNDFYKQQSQLVSYFPDEY--NGTQFDVVVLNICSLAIADLNAIGVSL 221 Query: 258 HPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGF 317 ++S FDI F +FNSATSYSGPAAIRLLRASCGQTS L+ A C+LF+NL KLG+ Sbjct: 222 DDVYSDFDIVFSDFNSATSYSGPAAIRLLRASCGQTSQPALFDDAPEQCHLFNNLEKLGY 281 Query: 318 TQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDV 377 HL+M H+G F GF V++ G + S L D T+LPV FD P+Y D AVLN WL+ Sbjct: 282 DSHLVMNHDGHFDGFKDLVKKQGKLNSPLFDTTSLPVAQYSFDSKPIYSDEAVLNSWLE- 340 Query: 378 TEKDKNSRSATFYNTLPLHDGNHYPG---VSKTADYKARAQKFFDELDAFFTELEKSGRK 434 + D + A +YNT+ LHDGN ++ Y R Q F +++ F LEK GR Sbjct: 341 EQGDSCAPCAMYYNTISLHDGNQLANSRRMNSDESYPVRQQNLFSDINQFIKNLEKRGRN 400 Query: 435 VMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSF 494 VM+++VPEHG AL+GD++Q +GLR+IPSPSI VP +KF G P + I + SS+ Sbjct: 401 VMLMLVPEHGAALQGDKVQFAGLREIPSPSIVTVPAAIKFIGPDLPRM-SQITVANTSSY 459 Query: 495 LAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDW 554 A+S+L+ +V+ F + + +L S LP + V+EN+ +++ +PY++L+GG+W Sbjct: 460 FALSELITKVMKSNYFAGKSNNMAELVSQLPTSQKVAENAGTIMMYVNKRPYIQLDGGEW 519 Query: 555 VPYPQ 559 YP+ Sbjct: 520 TLYPR 524 >UniRef50_A4SY20 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SY20_POLSQ Length = 514 Score = 639 bits (1647), Expect = 0.0, Method: Composition-based stats. Identities = 189/544 (34%), Positives = 301/544 (55%), Gaps = 41/544 (7%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + W FYFL+K L + GY+NFH +NL FA L+ R L +++ W+A+PIG LF+ Sbjct: 1 MGLWAFYFLIKIILFYTGYINFHFFVNLAFALALIFSHARPRLLQIKRWVAIPIGIILFY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 D+ LP +I+++ Q+ GFS +Y I+L+ R ++W+++ A+ V V + LS+ IR+T Sbjct: 61 FDSPLPPLRNIIAKLDQLLGFSFNYYIELLGRILDWRILVALAVSFVIYYALSKKIRMTT 120 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 +A+L +VL P S N++ + G + Sbjct: 121 --IAMLAIFSVLL---PFHS----------------NSSMQAYDSDGQVI--------GI 151 Query: 201 PTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPL 260 P+ A L L+ F+ E+ R + F S A Q F++LVIN+CSL+W D++ +PL Sbjct: 152 PSDAVLTESLDGFFVEESGR-TGFNRSQKASGQAFDILVINVCSLAWDDLKYVKEEDNPL 210 Query: 261 WSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQH 320 + F F +FNSA+SYSGP+ IRLLRAS GQ +LY+ D LF+NL GF Sbjct: 211 FKRFHYLFTSFNSASSYSGPSIIRLLRASRGQQDQRDLYKKPVEDSLLFNNLKTAGFQTQ 270 Query: 321 LMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEK 380 + H+G++G L+E+R +GG+ + L D L FDGS VY+D +VL+ W D K Sbjct: 271 FALNHDGKYGDLLQEIRTDGGLSAPLFDNKQATPYLRAFDGSQVYEDYSVLSNWWDARMK 330 Query: 381 DKNSRSATFYNTLPLHDGNHYPGV----SKTADYKARAQKFFDELDAFFTELEKSGRKVM 436 R A FYNT+ LHDGN + Y R K +++D F+T++ SGR+V+ Sbjct: 331 LPAERVALFYNTITLHDGNRALDGARLENSVETYSRRLHKLLEDVDRFYTKVNNSGRQVV 390 Query: 437 VVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPI---VIEQPSS 493 +V +PEHG A++ ++ ++ G+R+IPSP++T++PVG+ M +P I+QP+S Sbjct: 391 IVFIPEHGAAIRRNKNEIVGMREIPSPNVTNIPVGI----MLTNKSDSPFKTNRIDQPTS 446 Query: 494 FLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGD 553 +LA S+L+ + + F + + LP T V+EN ++V++++ Y R N + Sbjct: 447 YLATSELISKFVAKPPFGASTGNLEAYLKDLPSTRFVAENEDSVIMKFGASYYFRSNDIN 506 Query: 554 WVPY 557 W + Sbjct: 507 WNLF 510 >UniRef50_C5V4W5 Cellulose synthase operon protein YhjU n=1 Tax=Gallionella ferruginea ES-2 RepID=C5V4W5_9PROT Length = 709 Score = 632 bits (1630), Expect = e-179, Method: Composition-based stats. Identities = 213/543 (39%), Positives = 303/543 (55%), Gaps = 40/543 (7%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 L W +YF K L G + FHP+ NL+FAA LL+P+ L+RLR A + AL + Sbjct: 195 LGLWGYYFAAKLALFGLGTITFHPMENLLFAALLLLPVSSRLLYRLRAIFATLLALALLY 254 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 +D+WLP +++Q SQ++ FS Y+++L RF +WQ G + L +A+ +S+ IR+ V Sbjct: 255 YDSWLPDIRRLITQASQLSDFSWAYVVELSGRFFSWQTTGLLLALSIAYWIVSRRIRVGV 314 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 F+V L +L L G W NAA A V DM Sbjct: 315 FIV-----LGMLMLWG-----WQ-------------NAARLSADKAILNVDLDM------ 345 Query: 201 PTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPL 260 N L +F+ EA+R F + DA PF+++ I++CSLSW D+ A GL HPL Sbjct: 346 ------NKVLQDFFLKEAQRSILFVTP-QTDAVPFDVIFIHVCSLSWDDVRAVGLEDHPL 398 Query: 261 WSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQH 320 W FDI K FNSA SYSGPAAI +RA CGQT H +Y + CYL ++L GF + Sbjct: 399 WQRFDILMKKFNSAASYSGPAAIHFMRAKCGQTEHGVMYTTVADKCYLMNSLQLSGFEPN 458 Query: 321 LMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEK 380 L++ H+G+F FL +++++G + + M L V FD SPVYDD +VL+RWL + Sbjct: 459 LVLNHDGKFDDFLGQLKKHGRLNAPPMSLEGLEVAQHAFDRSPVYDDFSVLDRWLQTRQT 518 Query: 381 DKNSRSATFYNTLPLHDGNHYPGVSKTAD----YKARAQKFFDELDAFFTELEKSGRKVM 436 +SR A +YNT+ +HDGNH G ++D YK R KF DE D F +LE SGR+ + Sbjct: 519 SASSRVAMYYNTVSMHDGNHVSGADASSDTLVNYKNRLNKFLDETDRFLQKLETSGRRAV 578 Query: 437 VVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFLA 496 VV+VPEHG A++GD+ Q++GLR+IP+PSI+ VPVG+K G +G + I+QP+S+LA Sbjct: 579 VVMVPEHGAAIRGDKRQIAGLREIPTPSISLVPVGIKMVGGGVQREGDALTIDQPTSYLA 638 Query: 497 ISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWVP 556 IS ++ R L+ F P T VS+N V + Q + Y+ W Sbjct: 639 ISHIIERTLEQSPFANGRFRSADYVQNYPHTRFVSQNETVTVAESQGQYYLSRGASQWDA 698 Query: 557 YPQ 559 Y + Sbjct: 699 YTE 701 >UniRef50_B1Y241 Cellulose synthase operon protein YhjU n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y241_LEPCP Length = 529 Score = 629 bits (1621), Expect = e-178, Method: Composition-based stats. Identities = 188/542 (34%), Positives = 273/542 (50%), Gaps = 28/542 (5%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + W+ YFLVK GL AG + LNL+FA L P R ++A P+ AL + Sbjct: 1 MGSWSLYFLVKLGLHVAGLIQLDVPLNLLFAVALAWPWAHPGWRRAWRFLAWPVAVALLY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 HD++ P I+SQ ++GFS YL++LV R IN Q++ A+ + W L Q +R+ Sbjct: 61 HDSFWPPATRILSQWQAISGFSFAYLVELVGRVINVQLLVAVAMGAALWWVLKQRLRLAT 120 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 +V L+ + L + GG AATG + Sbjct: 121 WVFVGLVAVAALP------------------SQHGGVTDLAQAATGADGTADRAADRATA 162 Query: 201 P--TTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSH 258 P L+ L FY+ E + P + F+LL++NICSLSW D+ AGL + Sbjct: 163 PLLDAGQLDQALQAFYDTERGKILRLPKD--GNVPGFDLLILNICSLSWDDLAFAGLRNA 220 Query: 259 PLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFT 318 P D+ F FNSA SYSGPA +RLL +CGQ + LY A DCYLF NL + G+ Sbjct: 221 PFMRRLDVVFDRFNSAASYSGPAVMRLLHGTCGQPAQHELYGGAVADCYLFRNLEQAGYR 280 Query: 319 QHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWL-DV 377 L++ H+G++ F E+R + G+ + + V + FDGSP+ DD L RW D Sbjct: 281 PALLLNHDGRYDNFSTELRRDSGLGLVPEQRFDAAVAMSSFDGSPIRDDGETLTRWWSDR 340 Query: 378 TEKDKNSRSATFYNTLPLHDGNHYPGV---SKTADYKARAQKFFDELDAFFTELEKSGRK 434 T A YN++ LHDGN PG+ S Y RA+K +L+ F +E SGR Sbjct: 341 TAAASGVPLAMLYNSITLHDGNRVPGIQSLSSLETYAPRARKLMADLERFAALVEASGRP 400 Query: 435 VMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMK--APHQGAPIVIEQPS 492 ++V+VPEHGGA++GD Q++GLR++P+P+IT VP GV GM P+ +EQ S Sbjct: 401 TVLVLVPEHGGAVRGDAQQIAGLRELPTPAITHVPAGVMLIGMGERRADGQEPVHVEQTS 460 Query: 493 SFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGG 552 S+L++ +V ++ G ++ LP V+EN +V++ YVR G Sbjct: 461 SYLSLFTVVAALMHGGPEVATPERLTEVAQALPPVEWVAENDKTIVLRRGPHTYVRDFEG 520 Query: 553 DW 554 W Sbjct: 521 RW 522 >UniRef50_D1T8Y6 Putative uncharacterized protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1T8Y6_9BURK Length = 523 Score = 628 bits (1620), Expect = e-178, Method: Composition-based stats. Identities = 188/544 (34%), Positives = 281/544 (51%), Gaps = 31/544 (5%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMP--LPRYSLHRLRHWIALPIGFAL 78 ++ WN YF++K L G+L L NL FA L++ L R +L +R+ L +G L Sbjct: 1 MTLWNLYFILKLYLFAGGHLQPLWLANLGFALALVVTSTLRRRALRIVRNLAGLALGVPL 60 Query: 79 FWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRI 138 +H+ +P + + + FS Y ++L RF+ ++ A LV +L +++W+R+ Sbjct: 61 VYHEANVPPFSRLTEEFGNLTTFSYGYWLELAQRFLPPMLLLAALGALVGYLIVNRWVRV 120 Query: 139 TVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQT 198 FV+ L+ + LW GG A + Sbjct: 121 ATFVLIALVAI----------PLW----------HEGGVVLAQLRGASANAAANAGANGA 160 Query: 199 APPTTA---NLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGL 255 P A + NA L F E++R+ +F F+++V++ICSLSW D++ A Sbjct: 161 NNPLAAQPLDHNAALAAFRTQESQRQVSFGHLAADPNAQFDVIVLHICSLSWDDLDVAKA 220 Query: 256 MSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKL 315 +HPL S FD F NF++A SYSGPAAIR+LRASCGQ +H +LY+ A CYL +L++ Sbjct: 221 RNHPLLSRFDYLFTNFSTAASYSGPAAIRVLRASCGQQAHADLYKNAPQQCYLLADLAQA 280 Query: 316 GFTQHLMMGHNGQFGGFLKEVRENGGM-QSELMDQTNLPVILLGFDGSPVYDDTAVLNRW 374 G+T M+ H+G F FL+ + +N G+ L+ T++PV + FDGSP+ DD L W Sbjct: 281 GYTPQTMLNHDGHFDNFLELIHDNAGVPNVPLIPNTSVPVAMHAFDGSPIRDDYETLAAW 340 Query: 375 LDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTA--DYKARAQKFFDELDAFFTELEKSG 432 A +YNT+ LHDGN P S T+ Y R K + D F + SG Sbjct: 341 Y-AQRASIAGPVALYYNTISLHDGNRLPNSSLTSIDSYPLRVNKLMSDFDRFADLIASSG 399 Query: 433 RKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPS 492 R+ ++V VPEHG AL+GD QV+GLR+IP+P I PVGV+ G + H G+ VI+ PS Sbjct: 400 RRAVIVFVPEHGAALRGDTNQVAGLREIPTPRIVHGPVGVRVVGFQGSH-GSTTVIDDPS 458 Query: 493 SFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGG 552 SFLA++ L+ ++ F + V + + LPQT V EN V ++ V+ G Sbjct: 459 SFLALAQLLSNLVSNSPF-KPGVSLSQYATNLPQTQMVGENEGTVTMKTASGYVVKTPDG 517 Query: 553 DWVP 556 WV Sbjct: 518 VWVD 521 >UniRef50_B1JZP6 Cellulose synthase operon protein YhjU n=54 Tax=Burkholderiaceae RepID=B1JZP6_BURCC Length = 518 Score = 627 bits (1616), Expect = e-178, Method: Composition-based stats. Identities = 184/541 (34%), Positives = 280/541 (51%), Gaps = 30/541 (5%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLM--PLPRYSLHRLRHWIALPIGFAL 78 ++ WN YF++KF L G L L NL FA L+ P+ + +R +A+ I L Sbjct: 1 MTFWNLYFILKFALFATGRLQPFWLANLAFAVALVASAPIRSRAWRIVRQVVAVAIAVPL 60 Query: 79 FWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRI 138 + P + +V+ F DY ++L+ R + + I +L+ + +++W+R+ Sbjct: 61 LARELHAPSLARLAEAAREVSTFRLDYWMELLPRLLPPVLALTIVGVLIVYFIVNRWLRV 120 Query: 139 TVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQT 198 FVVA+L+ + LW AG A VA T Sbjct: 121 ATFVVAVLVVM----------PLWQAGSGLMARVVAPAQPQANVAGA------------T 158 Query: 199 APPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSH 258 + NA L F E++R+ F A F+++V++ICSLSW D++AA + +H Sbjct: 159 RVDQPEDHNAALATFRAQESQRQVAFGHLGSDPAAQFDVIVLHICSLSWDDLDAAKVRNH 218 Query: 259 PLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFT 318 P+ SHFD F NF++A SYSGPAAIR+LRASCGQ +H +LY+PA C+LF L+ G+T Sbjct: 219 PMLSHFDYLFTNFSTAASYSGPAAIRVLRASCGQEAHADLYKPAPQQCHLFSQLAGAGYT 278 Query: 319 QHLMMGHNGQFGGFLKEVRENGGM-QSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDV 377 ++ H+G F FL+ + +N G+ + ++ PV + FDGS + DD A L W Sbjct: 279 VQSLLNHDGHFDNFLQVIHDNIGVADAPMISNAAAPVAMHAFDGSAIKDDYATLANWY-A 337 Query: 378 TEKDKNSRSATFYNTLPLHDGNHYPGVSKTA--DYKARAQKFFDELDAFFTELEKSGRKV 435 A +YNT+ LHDGN G + T+ Y RA K + D + +SGR+ Sbjct: 338 QRAAVPGPVALYYNTISLHDGNRVVGSALTSIDSYPQRATKMMTDFDRLADLIAQSGRRA 397 Query: 436 MVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFL 495 ++V VPEHG AL+GD+ Q++GLR+IP+P I PVGV+ G H GA VIEQP+SFL Sbjct: 398 VIVFVPEHGAALRGDKNQIAGLREIPTPRIVHGPVGVRLVGFTGNH-GATTVIEQPTSFL 456 Query: 496 AISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWV 555 A++ L+ ++ F + + + LP+T + EN V +Q V+ G W+ Sbjct: 457 ALAQLLSNLVSNSPF-KPGATLAQYAADLPRTRMIGENEGTVTMQTAAGYAVKTPDGVWI 515 Query: 556 P 556 Sbjct: 516 D 516 >UniRef50_Q7NUM4 Putative uncharacterized protein n=1 Tax=Chromobacterium violaceum RepID=Q7NUM4_CHRVO Length = 526 Score = 618 bits (1594), Expect = e-175, Method: Composition-based stats. Identities = 204/543 (37%), Positives = 301/543 (55%), Gaps = 41/543 (7%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL GW+ YF++K L W G L+ HP +L FA LL+PL R L R +A P L Sbjct: 18 GLGGWSLYFILKLLLAWRGALSAHPAPDLAFALVLLLPLRRRWLRLARDALAWPAAAILL 77 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 ++D+WLP P ++ + S++ GFS YL++L R + ++ V+L +L LS+++R+ Sbjct: 78 YYDSWLPPPAALWRELSELKGFSLSYLMELAGRILTPTLLIGFTVVLAGYLLLSRFLRLG 137 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 V+A LL L+ + LW P + ++ G Sbjct: 138 TLVMATLLALS-------AHELWQQRAPAASASSAFG----------------------G 168 Query: 200 PPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHP 259 P+ + L F+ +E +R+ F L F++L++++CSLSW D+ A G + P Sbjct: 169 APSAQTPDQRLEGFFRSEQRRQVGFQGPLAD--PGFDVLLLHVCSLSWDDLRAVGFDNPP 226 Query: 260 LWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQ 319 L + FDI F FNSA SYSGPAA+R+LRASCGQ H+ LY+PA C+LF+NL+K GF Sbjct: 227 LLARFDIVFDRFNSAASYSGPAALRVLRASCGQPRHSALYEPAPEQCFLFENLAKAGFKT 286 Query: 320 HLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTE 379 L + H+G F FL+++R NG M L Q + V GFD SP+Y D A+LNRWL + Sbjct: 287 ELSLNHDGSFDSFLQQIRRNGRMNLPLTPQDGVAVGQRGFDSSPIYSDYAMLNRWLQLRL 346 Query: 380 KDKNSRSATFYNTLPLHDGNHY---PGVSKTADYKARAQKFFDELDAFFTELEKSGRKVM 436 ++ + A +YNT+ LHDGN P + A YK RA + ++ F LE+ RK++ Sbjct: 347 QEPDPHVAVYYNTISLHDGNRIPEAPALDTDASYKYRAGRLLRDIGQFIDLLEQDHRKMI 406 Query: 437 VVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPI--VIEQPSSF 494 +++VPEHG AL+GD+ Q SGLR+IPSP IT VP +K G QG P + Q +S+ Sbjct: 407 LLLVPEHGAALRGDKQQFSGLREIPSPLITTVPAAIKVIG----SQGRPAQYRVTQQASY 462 Query: 495 LAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDW 554 AI+ ++ R+L F D + LP T VSEN+N V+Q + ++ +GG+W Sbjct: 463 TAIATILSRMLARSPFGAD-YQPESYAQDLPATPFVSENANFTVMQSGSRYLMQSSGGNW 521 Query: 555 VPY 557 Y Sbjct: 522 NDY 524 >UniRef50_A6GMN7 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GMN7_9BURK Length = 567 Score = 599 bits (1544), Expect = e-170, Method: Composition-based stats. Identities = 188/556 (33%), Positives = 277/556 (49%), Gaps = 30/556 (5%) Query: 24 WNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLP-----RYSLHRLRHWIALPIGFAL 78 WN YFL+KFGL ++G L PL NL A L+ P + + LR+ + AL Sbjct: 4 WNLYFLIKFGLHFSGQLTLSPLWNLGLFALLIATNPAAYQNKQLMKVLRYLVFTGPAIAL 63 Query: 79 FWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFI-NWQMIGAIFVLLVAWLFLSQWIR 137 H+ L +++ Q + GFS DYL +L+ R I W + GA+ +V L ++IR Sbjct: 64 LLHELGLVVSLALVDQIKALFGFSLDYLWELLKRTIQPWMLWGALLGFMVVR-VLDRYIR 122 Query: 138 ITVFV-VAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAA-----ATVAATGGAP-- 189 I+ +V ++ L + L P T + A A + A G Sbjct: 123 ISTWVGFGLVCILGIQAL--PFIQQQTQTNKTAALLEPQAQARENFSPAELLALGQVDFQ 180 Query: 190 ---VVGDMPAQTAPPTTA----NLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINI 242 V D +QT A A L F++ +R P + F+++V+ I Sbjct: 181 KLRVARDQESQTKLRDNAYEGAGPGAVLAGFFDR--QRNIALAPFSPVVSPDFDVIVLQI 238 Query: 243 CSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPA 302 CSLSW+D++ A HP D F+NFNSATSYSGPAAIRLLR CGQT+H LY A Sbjct: 239 CSLSWADLQYAKQSQHPTIRQADFVFENFNSATSYSGPAAIRLLRGKCGQTTHDALYSVA 298 Query: 303 NNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVREN-GGMQSELMDQTNLPVILLGFDG 361 NN C LF+ L +GF + + H+G+F F K V+ N GG +EL+ ++P + FDG Sbjct: 299 NNSCMLFEQLRNVGFEVEMGLNHDGRFQDFSKLVKTNLGGKATELVAHDDVPAGVQAFDG 358 Query: 362 SPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYP--GVSKTADYKARAQKFFD 419 S V D L W + + A +YN++ LHDGN P ++ Y R ++ + Sbjct: 359 SRVGRDGDYLRAWWNKRIQQSGPAVAYYYNSITLHDGNRLPNSNLNSLNSYPLRLERMLN 418 Query: 420 ELDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPV-GVKFFGMK 478 ++ + E+ +S RK +VVVVPEHG L G+ Q+ GLR++P+P+IT VPV G Sbjct: 419 DIQSVLGEIRRSDRKALVVVVPEHGAGLTGEFGQLVGLRELPTPAITKVPVFGYWIAPGY 478 Query: 479 APHQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVV 538 P P+ ++Q S+ A+S+L+ R L + W L S LP T VS+ N V Sbjct: 479 TPASTGPVSVKQSVSYTALSELLARWLAQPAEQQQKPAWPVLLSDLPDTRFVSQQGNITV 538 Query: 539 IQYQDKPYVRLNGGDW 554 ++ Q +++ G W Sbjct: 539 MESQGSYWIKAPGAAW 554 >UniRef50_A6SYG8 Uncharacterized conserved protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6SYG8_JANMA Length = 507 Score = 563 bits (1451), Expect = e-159, Method: Composition-based stats. Identities = 158/542 (29%), Positives = 266/542 (49%), Gaps = 43/542 (7%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + W+ YF KF L + + F+ LNL+ A + + IA+ AL + Sbjct: 1 MQYWSLYFFAKFALYFNHAIKFNWYLNLLLAICVSFSFRHPRWRIAQQSIAIIAAIALLY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 +++ LP P+ ++++ + ++ +S YL++L RFIN + V+L ++ L+ +R + Sbjct: 61 YESSLPPPDRLLAEAANMSSYSFSYLLELFARFINLWYVVVFAVMLALYVLLAGRLRFSS 120 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 FV +L + +L G P + G Sbjct: 121 FVFVGILLIPLLA---------QFGLPAQNLHLVAG------------------------ 147 Query: 201 PTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPL 260 + A L+ FY E +R+ + PS A Q F+++++ +LSW D+ Sbjct: 148 --AQDPGAMLDTFYAQEKERRLSLPSIANAR-QAFDIVILQPGALSWDDLAFVDAPYPKF 204 Query: 261 WSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQH 320 + FD+ NFNSATS+S AA RLLR SCGQ + L + C LF+NL + G+ Sbjct: 205 LNRFDLVLLNFNSATSHSAAAAKRLLRGSCGQPGDSVLAEAPATGCSLFENLRQAGYDTA 264 Query: 321 LMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEK 380 +M HNG++ + + + G E + V + GFDG+PVYDD +VL++W + Sbjct: 265 AVMNHNGRYNRYAETISAYAGTGVE--EGKWGSVAMHGFDGTPVYDDFSVLSKWWNKHHA 322 Query: 381 DKNS-RSATFYNTLPLHDGNHYPG---VSKTADYKARAQKFFDELDAFFTELEKSGRKVM 436 A +YN++ LHDGN PG ++ YK R K + D F T+LE S R V+ Sbjct: 323 HPGGKPVALYYNSITLHDGNILPGPRAINSVQSYKPRLDKLLADFDHFVTQLEASNRPVV 382 Query: 437 VVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQ-GAPIVIEQPSSFL 495 V+++P HG A++GD++Q +G+R+IPSP +T VPV +K GM A + G P+ ++ +S+ Sbjct: 383 VILMPAHGAAMRGDQLQAAGMREIPSPKLTLVPVAIKLIGMAAAKEAGPPLEVKHATSYF 442 Query: 496 AISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWV 555 + L+ ++ G+ + + +TA VSEN +V++ +D +R W+ Sbjct: 443 GVFSLLADLMAGQANETAGKPLAERLQQVGETAFVSENEKIIVMRSKDGYIMRSAEDLWI 502 Query: 556 PY 557 Y Sbjct: 503 KY 504 >UniRef50_B8L0X3 Membrane protein ; endoglucanase BcsG n=5 Tax=Gammaproteobacteria RepID=B8L0X3_9GAMM Length = 182 Score = 243 bits (621), Expect = 9e-63, Method: Composition-based stats. Identities = 95/178 (53%), Positives = 116/178 (65%), Gaps = 5/178 (2%) Query: 387 ATFYNTLPLHDGNHYPGV---SKTADYKARAQKFFDELDAFFTELEKSGRKVMVVVVPEH 443 A FYNT+ LHDGN G S ADYKARA+ ++ F +EKSGR+ M+VVVPEH Sbjct: 2 ALFYNTISLHDGNRIVGADGRSNAADYKARAEMVLGDMAGFVDAVEKSGRRAMIVVVPEH 61 Query: 444 GGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFLAISDLVVR 503 G AL GDRMQ+ G+R+IPSPSIT VPVGVK GM AP G P I +PSS+LA+S+LV R Sbjct: 62 GAALHGDRMQIPGMREIPSPSITHVPVGVKLVGMGAPAAGGPRHIPEPSSYLAVSELVSR 121 Query: 504 VLDGKIFTEDNV-DWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGD-WVPYPQ 559 V + + +W L GLPQT VSEN A VI Y KP+++L G W PYP+ Sbjct: 122 VYALNAQSPPSERNWDSLLKGLPQTPSVSENEGAKVIDYGGKPWLQLQGSQTWSPYPE 179 >UniRef50_B5WNI3 Putative uncharacterized protein n=1 Tax=Burkholderia sp. H160 RepID=B5WNI3_9BURK Length = 163 Score = 130 bits (326), Expect = 2e-28, Method: Composition-based stats. Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 6/163 (3%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + WNFYF+ K L G L LNL+FA LL+P L +R+ +A+ IG AL + Sbjct: 1 MGLWNFYFIAKLYLAGIGKLQPLWWLNLLFAIALLVPFGDRRLRVVRNLVAVVIGIALLY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 + L P M+ + + L+ R + + A+ L VA+ +S+W+R+T Sbjct: 61 FE--LGEPAFSMAAA----HLPQQHALALIVRIVPLSTLVALATLFVAYYVVSRWVRVTT 114 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVA 183 FV+ L + V + + + + A Sbjct: 115 FVLISLFAILVWQGFSALTAQASSNVDSARACPPASPGRSEPA 157 >UniRef50_UPI00016ABB33 hypothetical protein Bpseu9_34329 n=5 Tax=pseudomallei group RepID=UPI00016ABB33 Length = 151 Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats. Identities = 39/133 (29%), Positives = 70/133 (52%), Gaps = 2/133 (1%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPL--PRYSLHRLRHWIALPIGFAL 78 ++ WN YF++K L AG+L + NL FA L + R S+ LRH +AL + L Sbjct: 1 MTFWNLYFVLKLYLFAAGHLKPLWIANLGFALALALSAPARRRSVQLLRHALALALAVPL 60 Query: 79 FWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRI 138 + + +P ++ + FS Y ++LV RF+ + A +++ +L +++W+R+ Sbjct: 61 MYREADVPPLARLVETLGGLRAFSAGYWMELVPRFVPPMLALAALGVVIGYLIVNRWLRV 120 Query: 139 TVFVVAILLWLNV 151 FV+ L+ L V Sbjct: 121 ATFVLLALIALPV 133 >UniRef50_Q3A304 Putative uncharacterized protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A304_PELCD Length = 213 Score = 43.9 bits (102), Expect = 0.015, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 37/112 (33%), Gaps = 14/112 (12%) Query: 290 CGQTSHTNLYQPANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQ 349 C +T + P N + L K GF ++ +G F + G L+DQ Sbjct: 23 CPKTRQAVVIDPGGNGSSILAELEKQGFELKAVINTHGHFD--------HIGGNKTLIDQ 74 Query: 350 TNLPVILLGFDGSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHY 401 T ++L + P+ + D + T L DG+ Sbjct: 75 TGADLLLHA-EALPLLRGASSHAASFGCRAIDPSPEP-----TRLLQDGDRI 120 >UniRef50_D1PM20 Putative uncharacterized protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PM20_9FIRM Length = 450 Score = 43.1 bits (100), Expect = 0.028, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 45/146 (30%), Gaps = 20/146 (13%) Query: 18 WRGLS-GWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIA----L 72 WR L YF+ ++ + F L A LL ++ ++A + Sbjct: 135 WRNLGVAGALYFIARWVYFNGQNIWF-----LGLAVVLLAAKDVPLRRAMKAFLACGLPV 189 Query: 73 PIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFL 132 + L + GS F + G I L+ L Sbjct: 190 LALVEVLHFAGILAPGATSERDGSFRLMFGYGH----------PNTFGGIVFGLLLAWVL 239 Query: 133 SQWIRITVFVVAILLWLNVLTLAGPS 158 + +R+ +A + + V L GP+ Sbjct: 240 LRRVRLCWLEIAGVAGVGVFLLLGPA 265 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37659 Uncharacterized protein yhjU n=118 Tax=Bacteria ... 811 0.0 UniRef50_B5XN20 Cellulose biosynthesis protein BcsG n=11 Tax=Ent... 768 0.0 UniRef50_Q5DZ39 Predicted inner membrane protein n=11 Tax=Vibrio... 701 0.0 UniRef50_Q3IER9 Putative membrane protein ; putative endoglucana... 648 0.0 UniRef50_D1T8Y6 Putative uncharacterized protein n=1 Tax=Burkhol... 643 0.0 UniRef50_A4SY20 Putative uncharacterized protein n=1 Tax=Polynuc... 641 0.0 UniRef50_B1JZP6 Cellulose synthase operon protein YhjU n=54 Tax=... 638 0.0 UniRef50_C5V4W5 Cellulose synthase operon protein YhjU n=1 Tax=G... 630 e-179 UniRef50_B1Y241 Cellulose synthase operon protein YhjU n=1 Tax=L... 626 e-178 UniRef50_Q7NUM4 Putative uncharacterized protein n=1 Tax=Chromob... 622 e-176 UniRef50_A6GMN7 Putative uncharacterized protein n=1 Tax=Limnoba... 599 e-170 UniRef50_A6SYG8 Uncharacterized conserved protein n=1 Tax=Janthi... 565 e-159 UniRef50_B8L0X3 Membrane protein ; endoglucanase BcsG n=5 Tax=Ga... 247 8e-64 UniRef50_B5WNI3 Putative uncharacterized protein n=1 Tax=Burkhol... 157 7e-37 UniRef50_UPI00016ABB33 hypothetical protein Bpseu9_34329 n=5 Tax... 127 1e-27 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P37659 Uncharacterized protein yhjU n=118 Tax=Bacteria RepID=YHJU_ECOLI Length = 559 Score = 811 bits (2094), Expect = 0.0, Method: Composition-based stats. Identities = 559/559 (100%), Positives = 559/559 (100%) Query: 1 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR 60 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR Sbjct: 1 MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPR 60 Query: 61 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG 120 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG Sbjct: 61 YSLHRLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIG 120 Query: 121 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA 180 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA Sbjct: 121 AIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAA 180 Query: 181 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI 240 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI Sbjct: 181 TVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI 240 Query: 241 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ 300 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ Sbjct: 241 NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ 300 Query: 301 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD 360 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD Sbjct: 301 PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFD 360 Query: 361 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE 420 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE Sbjct: 361 GSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDE 420 Query: 421 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP 480 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP Sbjct: 421 LDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP 480 Query: 481 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ 540 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ Sbjct: 481 HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQ 540 Query: 541 YQDKPYVRLNGGDWVPYPQ 559 YQDKPYVRLNGGDWVPYPQ Sbjct: 541 YQDKPYVRLNGGDWVPYPQ 559 >UniRef50_B5XN20 Cellulose biosynthesis protein BcsG n=11 Tax=Enterobacteriaceae RepID=B5XN20_KLEP3 Length = 559 Score = 768 bits (1982), Expect = 0.0, Method: Composition-based stats. Identities = 414/557 (74%), Positives = 469/557 (84%), Gaps = 3/557 (0%) Query: 5 TQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLH 64 T+ TA P LWQYWRGL GWNFYFLVKF LLWAGYLNFHP+LNLVF AFLL+P+PR LH Sbjct: 4 TKPTATPLPLWQYWRGLGGWNFYFLVKFALLWAGYLNFHPMLNLVFLAFLLVPIPREKLH 63 Query: 65 RLRHWIALPIGFALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFV 124 R+RHWIA+P+GFALFWHDTWLPGPE+++SQGSQ+AGFS Y+ DL+ RFINW M+GA FV Sbjct: 64 RIRHWIAIPLGFALFWHDTWLPGPETLLSQGSQIAGFSASYIWDLIVRFINWSMVGAFFV 123 Query: 125 LLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAA 184 LLV WLF+SQW+R+TVFV A+++WL V L P+F+LWPAGQPTT TT AA Sbjct: 124 LLVLWLFISQWLRVTVFVSAMVVWLAVSPLL-PAFTLWPAGQPTTAAATTAPANTGANAA 182 Query: 185 TGG--APVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINI 242 G +P D+P QT PPT+ANL WLN FY AE KRK+ FP LPADAQPF+LLVINI Sbjct: 183 AGTATSPASSDIPPQTEPPTSANLTNWLNGFYAAEQKRKTPFPDQLPADAQPFDLLVINI 242 Query: 243 CSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPA 302 CSLSWSDIEAAGLM HPLW HFDI FKNFNSATSYSGPAA+RLLRASCGQ SHTNLYQP+ Sbjct: 243 CSLSWSDIEAAGLMDHPLWKHFDIVFKNFNSATSYSGPAAVRLLRASCGQLSHTNLYQPS 302 Query: 303 NNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGS 362 DCYLF+NL+KLGF Q LM+GHNG FG FLKE+R GGMQS LMDQT LPV L FDGS Sbjct: 303 GADCYLFENLAKLGFNQQLMLGHNGLFGDFLKELRSLGGMQSPLMDQTGLPVSLQAFDGS 362 Query: 363 PVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTADYKARAQKFFDELD 422 PVY+D AVLNRWL E N RSATFYNTLPLHDGNH+PG SKTADYK RAQK FD+LD Sbjct: 363 PVYEDLAVLNRWLKTEEASNNPRSATFYNTLPLHDGNHFPGQSKTADYKVRAQKLFDDLD 422 Query: 423 AFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQ 482 FFTELEKSGRKVMVVVVPEHGGALKGD+MQVSGLRDIPSPSIT+VP VKFFGMKAPH+ Sbjct: 423 NFFTELEKSGRKVMVVVVPEHGGALKGDKMQVSGLRDIPSPSITNVPTAVKFFGMKAPHE 482 Query: 483 GAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQ 542 GAPI+I+QPSS+LA+S+LVVR LDGK+F+ED+V+W++ + LPQ+A VSEN+NA+VIQYQ Sbjct: 483 GAPIIIDQPSSYLAVSELVVRALDGKMFSEDSVNWQQYVANLPQSAAVSENANALVIQYQ 542 Query: 543 DKPYVRLNGGDWVPYPQ 559 KPYV+LNGG WVPYPQ Sbjct: 543 GKPYVQLNGGSWVPYPQ 559 >UniRef50_Q5DZ39 Predicted inner membrane protein n=11 Tax=Vibrionales RepID=Q5DZ39_VIBF1 Length = 542 Score = 701 bits (1809), Expect = 0.0, Method: Composition-based stats. Identities = 228/543 (41%), Positives = 332/543 (61%), Gaps = 18/543 (3%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL WN YF++K GL G ++FHP+ N AFLL+P+ L+ +R ++A+P G L Sbjct: 12 GLGWWNIYFIIKIGLFLQGIIDFHPIENFALVAFLLIPIRHKILNVIRQFLAVPFGLWLM 71 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 +D++LP + + SQ Q+ F+ YLI+L +RF++ + + +FVL+ A+ FL+Q RI+ Sbjct: 72 HYDSFLPPLDRLWSQMGQLLQFNLSYLIELASRFVSLETLLGLFVLVFAYYFLNQIFRIS 131 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 VFVV L+ ++ L FS QP T + +A A V +T Sbjct: 132 VFVVITLIAIS---LPSDLFS----SQPNTVANVSQQPESAETAQISDHQV-----DETG 179 Query: 200 PPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHP 259 P A LN F+ EA+R+ +FP++ P + F+LL ++ICS++W DIE AGL SHP Sbjct: 180 PVNDAVLNNAKELFFRNEAQRRVSFPTTSP--STDFDLLFLSICSVAWDDIEIAGLESHP 237 Query: 260 LWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ-PANNDCYLFDNLSKLGFT 318 L+ FD+ F NF++ATSYSGPA IRLLRASCGQ SH L++ PA+ C+LFDNL+KLGF Sbjct: 238 LFKEFDVMFDNFSAATSYSGPAVIRLLRASCGQESHPELFKAPASKQCFLFDNLAKLGFQ 297 Query: 319 QHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVT 378 ++L++ H+G+F FL ++E+G +++ LM Q L FDGSP+Y D VLNRWLD Sbjct: 298 ENLLLNHDGKFDDFLGLLKEDGDLKAPLMSQAGLTQYQSAFDGSPIYRDKDVLNRWLDKR 357 Query: 379 EKDKNSRSATFYNTLPLHDGNHY---PGVSKTADYKARAQKFFDELDAFFTELEKSGRKV 435 EK ++ + YNT+ LHDGN G YK R + D+L FF EL+ S R + Sbjct: 358 EKSQDGPTVALYNTISLHDGNRIIKASGKVGLVSYKLRLKNLLDDLYDFFQELKASNRNI 417 Query: 436 MVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFL 495 +V++VPEHG ++GD+MQ++G+R+IPS +I PVG+K FG G+ I PSS+L Sbjct: 418 VVMLVPEHGAGMRGDKMQIAGMREIPSATIVHTPVGMKIFGQGMTRLGSTAHISAPSSYL 477 Query: 496 AISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWV 555 A+S LV R+++ I+ + +L LP T V++NS V++Y KPYV L+G W Sbjct: 478 AVSTLVSRIIEEDIYATKTFNTAELVKDLPTTKMVAQNSGTTVMEYNKKPYVSLDGTTWS 537 Query: 556 PYP 558 YP Sbjct: 538 EYP 540 >UniRef50_Q3IER9 Putative membrane protein ; putative endoglucanase BcsG n=2 Tax=Alteromonadales RepID=Q3IER9_PSEHT Length = 525 Score = 648 bits (1672), Expect = 0.0, Method: Composition-based stats. Identities = 204/545 (37%), Positives = 306/545 (56%), Gaps = 30/545 (5%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL WN YFLVKF L + G + F L N AA + + +L+H+I L Sbjct: 5 GLGIWNLYFLVKFVLFYYGAIKFDFLSNAALAALFALTFSNSQVDKLKHFIGAVFAIVLL 64 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 + D+WLP + + Q + FS Y ++L R +N+ M+ +F++++ + + SQWIR T Sbjct: 65 YKDSWLPPIDRLTKQAGNIQDFSLGYFVELFGRIVNYDMLLGLFIIVICFWYTSQWIRFT 124 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 +A L+++ P + A +V AP + TA Sbjct: 125 TVTIAGLIFIGYQGALKP------------------NDMAVSVQ---NAPNQEQEFSNTA 163 Query: 200 PPTTA--NLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMS 257 T + + LN+FY +++ S FP + F+++V+NICSL+ +D+ A G+ Sbjct: 164 VVTQKLDSADQQLNDFYKQQSQLVSYFPDEY--NGTQFDVVVLNICSLAIADLNAIGVSL 221 Query: 258 HPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGF 317 ++S FDI F +FNSATSYSGPAAIRLLRASCGQTS L+ A C+LF+NL KLG+ Sbjct: 222 DDVYSDFDIVFSDFNSATSYSGPAAIRLLRASCGQTSQPALFDDAPEQCHLFNNLEKLGY 281 Query: 318 TQHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDV 377 HL+M H+G F GF V++ G + S L D T+LPV FD P+Y D AVLN WL+ Sbjct: 282 DSHLVMNHDGHFDGFKDLVKKQGKLNSPLFDTTSLPVAQYSFDSKPIYSDEAVLNSWLE- 340 Query: 378 TEKDKNSRSATFYNTLPLHDGNHYPG---VSKTADYKARAQKFFDELDAFFTELEKSGRK 434 + D + A +YNT+ LHDGN ++ Y R Q F +++ F LEK GR Sbjct: 341 EQGDSCAPCAMYYNTISLHDGNQLANSRRMNSDESYPVRQQNLFSDINQFIKNLEKRGRN 400 Query: 435 VMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSF 494 VM+++VPEHG AL+GD++Q +GLR+IPSPSI VP +KF G P + I + SS+ Sbjct: 401 VMLMLVPEHGAALQGDKVQFAGLREIPSPSIVTVPAAIKFIGPDLPRM-SQITVANTSSY 459 Query: 495 LAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDW 554 A+S+L+ +V+ F + + +L S LP + V+EN+ +++ +PY++L+GG+W Sbjct: 460 FALSELITKVMKSNYFAGKSNNMAELVSQLPTSQKVAENAGTIMMYVNKRPYIQLDGGEW 519 Query: 555 VPYPQ 559 YP+ Sbjct: 520 TLYPR 524 >UniRef50_D1T8Y6 Putative uncharacterized protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1T8Y6_9BURK Length = 523 Score = 643 bits (1659), Expect = 0.0, Method: Composition-based stats. Identities = 186/544 (34%), Positives = 280/544 (51%), Gaps = 31/544 (5%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMP--LPRYSLHRLRHWIALPIGFAL 78 ++ WN YF++K L G+L L NL FA L++ L R +L +R+ L +G L Sbjct: 1 MTLWNLYFILKLYLFAGGHLQPLWLANLGFALALVVTSTLRRRALRIVRNLAGLALGVPL 60 Query: 79 FWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRI 138 +H+ +P + + + FS Y ++L RF+ ++ A LV +L +++W+R+ Sbjct: 61 VYHEANVPPFSRLTEEFGNLTTFSYGYWLELAQRFLPPMLLLAALGALVGYLIVNRWVRV 120 Query: 139 TVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQT 198 FV+ L+ + + GG A + Sbjct: 121 ATFVLIALVAIPLW--------------------HEGGVVLAQLRGASANAAANAGANGA 160 Query: 199 APPTTA---NLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGL 255 P A + NA L F E++R+ +F F+++V++ICSLSW D++ A Sbjct: 161 NNPLAAQPLDHNAALAAFRTQESQRQVSFGHLAADPNAQFDVIVLHICSLSWDDLDVAKA 220 Query: 256 MSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKL 315 +HPL S FD F NF++A SYSGPAAIR+LRASCGQ +H +LY+ A CYL +L++ Sbjct: 221 RNHPLLSRFDYLFTNFSTAASYSGPAAIRVLRASCGQQAHADLYKNAPQQCYLLADLAQA 280 Query: 316 GFTQHLMMGHNGQFGGFLKEVRENGGM-QSELMDQTNLPVILLGFDGSPVYDDTAVLNRW 374 G+T M+ H+G F FL+ + +N G+ L+ T++PV + FDGSP+ DD L W Sbjct: 281 GYTPQTMLNHDGHFDNFLELIHDNAGVPNVPLIPNTSVPVAMHAFDGSPIRDDYETLAAW 340 Query: 375 LDVTEKDKNSRSATFYNTLPLHDGNHYPGVSKTA--DYKARAQKFFDELDAFFTELEKSG 432 A +YNT+ LHDGN P S T+ Y R K + D F + SG Sbjct: 341 Y-AQRASIAGPVALYYNTISLHDGNRLPNSSLTSIDSYPLRVNKLMSDFDRFADLIASSG 399 Query: 433 RKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPS 492 R+ ++V VPEHG AL+GD QV+GLR+IP+P I PVGV+ G + H G+ VI+ PS Sbjct: 400 RRAVIVFVPEHGAALRGDTNQVAGLREIPTPRIVHGPVGVRVVGFQGSH-GSTTVIDDPS 458 Query: 493 SFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGG 552 SFLA++ L+ ++ F + V + + LPQT V EN V ++ V+ G Sbjct: 459 SFLALAQLLSNLVSNSPF-KPGVSLSQYATNLPQTQMVGENEGTVTMKTASGYVVKTPDG 517 Query: 553 DWVP 556 WV Sbjct: 518 VWVD 521 >UniRef50_A4SY20 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SY20_POLSQ Length = 514 Score = 641 bits (1653), Expect = 0.0, Method: Composition-based stats. Identities = 189/544 (34%), Positives = 301/544 (55%), Gaps = 41/544 (7%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + W FYFL+K L + GY+NFH +NL FA L+ R L +++ W+A+PIG LF+ Sbjct: 1 MGLWAFYFLIKIILFYTGYINFHFFVNLAFALALIFSHARPRLLQIKRWVAIPIGIILFY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 D+ LP +I+++ Q+ GFS +Y I+L+ R ++W+++ A+ V V + LS+ IR+T Sbjct: 61 FDSPLPPLRNIIAKLDQLLGFSFNYYIELLGRILDWRILVALAVSFVIYYALSKKIRMTT 120 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 +A+L +VL P S N++ + G + Sbjct: 121 --IAMLAIFSVLL---PFHS----------------NSSMQAYDSDGQVI--------GI 151 Query: 201 PTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPL 260 P+ A L L+ F+ E+ R + F S A Q F++LVIN+CSL+W D++ +PL Sbjct: 152 PSDAVLTESLDGFFVEESGR-TGFNRSQKASGQAFDILVINVCSLAWDDLKYVKEEDNPL 210 Query: 261 WSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQH 320 + F F +FNSA+SYSGP+ IRLLRAS GQ +LY+ D LF+NL GF Sbjct: 211 FKRFHYLFTSFNSASSYSGPSIIRLLRASRGQQDQRDLYKKPVEDSLLFNNLKTAGFQTQ 270 Query: 321 LMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEK 380 + H+G++G L+E+R +GG+ + L D L FDGS VY+D +VL+ W D K Sbjct: 271 FALNHDGKYGDLLQEIRTDGGLSAPLFDNKQATPYLRAFDGSQVYEDYSVLSNWWDARMK 330 Query: 381 DKNSRSATFYNTLPLHDGNHYPGV----SKTADYKARAQKFFDELDAFFTELEKSGRKVM 436 R A FYNT+ LHDGN + Y R K +++D F+T++ SGR+V+ Sbjct: 331 LPAERVALFYNTITLHDGNRALDGARLENSVETYSRRLHKLLEDVDRFYTKVNNSGRQVV 390 Query: 437 VVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPI---VIEQPSS 493 +V +PEHG A++ ++ ++ G+R+IPSP++T++PVG+ M +P I+QP+S Sbjct: 391 IVFIPEHGAAIRRNKNEIVGMREIPSPNVTNIPVGI----MLTNKSDSPFKTNRIDQPTS 446 Query: 494 FLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGD 553 +LA S+L+ + + F + + LP T V+EN ++V++++ Y R N + Sbjct: 447 YLATSELISKFVAKPPFGASTGNLEAYLKDLPSTRFVAENEDSVIMKFGASYYFRSNDIN 506 Query: 554 WVPY 557 W + Sbjct: 507 WNLF 510 >UniRef50_B1JZP6 Cellulose synthase operon protein YhjU n=54 Tax=Burkholderiaceae RepID=B1JZP6_BURCC Length = 518 Score = 638 bits (1646), Expect = 0.0, Method: Composition-based stats. Identities = 184/541 (34%), Positives = 280/541 (51%), Gaps = 30/541 (5%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLM--PLPRYSLHRLRHWIALPIGFAL 78 ++ WN YF++KF L G L L NL FA L+ P+ + +R +A+ I L Sbjct: 1 MTFWNLYFILKFALFATGRLQPFWLANLAFAVALVASAPIRSRAWRIVRQVVAVAIAVPL 60 Query: 79 FWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRI 138 + P + +V+ F DY ++L+ R + + I +L+ + +++W+R+ Sbjct: 61 LARELHAPSLARLAEAAREVSTFRLDYWMELLPRLLPPVLALTIVGVLIVYFIVNRWLRV 120 Query: 139 TVFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQT 198 FVVA+L+ + LW AG A VA T Sbjct: 121 ATFVVAVLVVM----------PLWQAGSGLMARVVAPAQPQANVAGA------------T 158 Query: 199 APPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSH 258 + NA L F E++R+ F A F+++V++ICSLSW D++AA + +H Sbjct: 159 RVDQPEDHNAALATFRAQESQRQVAFGHLGSDPAAQFDVIVLHICSLSWDDLDAAKVRNH 218 Query: 259 PLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFT 318 P+ SHFD F NF++A SYSGPAAIR+LRASCGQ +H +LY+PA C+LF L+ G+T Sbjct: 219 PMLSHFDYLFTNFSTAASYSGPAAIRVLRASCGQEAHADLYKPAPQQCHLFSQLAGAGYT 278 Query: 319 QHLMMGHNGQFGGFLKEVRENGGM-QSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDV 377 ++ H+G F FL+ + +N G+ + ++ PV + FDGS + DD A L W Sbjct: 279 VQSLLNHDGHFDNFLQVIHDNIGVADAPMISNAAAPVAMHAFDGSAIKDDYATLANWY-A 337 Query: 378 TEKDKNSRSATFYNTLPLHDGNHYPGVSKTA--DYKARAQKFFDELDAFFTELEKSGRKV 435 A +YNT+ LHDGN G + T+ Y RA K + D + +SGR+ Sbjct: 338 QRAAVPGPVALYYNTISLHDGNRVVGSALTSIDSYPQRATKMMTDFDRLADLIAQSGRRA 397 Query: 436 MVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFL 495 ++V VPEHG AL+GD+ Q++GLR+IP+P I PVGV+ G H GA VIEQP+SFL Sbjct: 398 VIVFVPEHGAALRGDKNQIAGLREIPTPRIVHGPVGVRLVGFTGNH-GATTVIEQPTSFL 456 Query: 496 AISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWV 555 A++ L+ ++ F + + + LP+T + EN V +Q V+ G W+ Sbjct: 457 ALAQLLSNLVSNSPF-KPGATLAQYAADLPRTRMIGENEGTVTMQTAAGYAVKTPDGVWI 515 Query: 556 P 556 Sbjct: 516 D 516 >UniRef50_C5V4W5 Cellulose synthase operon protein YhjU n=1 Tax=Gallionella ferruginea ES-2 RepID=C5V4W5_9PROT Length = 709 Score = 630 bits (1625), Expect = e-179, Method: Composition-based stats. Identities = 210/543 (38%), Positives = 300/543 (55%), Gaps = 40/543 (7%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 L W +YF K L G + FHP+ NL+FAA LL+P+ L+RLR A + AL + Sbjct: 195 LGLWGYYFAAKLALFGLGTITFHPMENLLFAALLLLPVSSRLLYRLRAIFATLLALALLY 254 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 +D+WLP +++Q SQ++ FS Y+++L RF +WQ G + L +A+ +S+ IR+ V Sbjct: 255 YDSWLPDIRRLITQASQLSDFSWAYVVELSGRFFSWQTTGLLLALSIAYWIVSRRIRVGV 314 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 F+V +L L NAA A V DM Sbjct: 315 FIVLGMLMLWGWQ-----------------------NAARLSADKAILNVDLDM------ 345 Query: 201 PTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPL 260 N L +F+ EA+R F + DA PF+++ I++CSLSW D+ A GL HPL Sbjct: 346 ------NKVLQDFFLKEAQRSILFVTP-QTDAVPFDVIFIHVCSLSWDDVRAVGLEDHPL 398 Query: 261 WSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQH 320 W FDI K FNSA SYSGPAAI +RA CGQT H +Y + CYL ++L GF + Sbjct: 399 WQRFDILMKKFNSAASYSGPAAIHFMRAKCGQTEHGVMYTTVADKCYLMNSLQLSGFEPN 458 Query: 321 LMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEK 380 L++ H+G+F FL +++++G + + M L V FD SPVYDD +VL+RWL + Sbjct: 459 LVLNHDGKFDDFLGQLKKHGRLNAPPMSLEGLEVAQHAFDRSPVYDDFSVLDRWLQTRQT 518 Query: 381 DKNSRSATFYNTLPLHDGNHYPGVSKTAD----YKARAQKFFDELDAFFTELEKSGRKVM 436 +SR A +YNT+ +HDGNH G ++D YK R KF DE D F +LE SGR+ + Sbjct: 519 SASSRVAMYYNTVSMHDGNHVSGADASSDTLVNYKNRLNKFLDETDRFLQKLETSGRRAV 578 Query: 437 VVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFLA 496 VV+VPEHG A++GD+ Q++GLR+IP+PSI+ VPVG+K G +G + I+QP+S+LA Sbjct: 579 VVMVPEHGAAIRGDKRQIAGLREIPTPSISLVPVGIKMVGGGVQREGDALTIDQPTSYLA 638 Query: 497 ISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWVP 556 IS ++ R L+ F P T VS+N V + Q + Y+ W Sbjct: 639 ISHIIERTLEQSPFANGRFRSADYVQNYPHTRFVSQNETVTVAESQGQYYLSRGASQWDA 698 Query: 557 YPQ 559 Y + Sbjct: 699 YTE 701 >UniRef50_B1Y241 Cellulose synthase operon protein YhjU n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y241_LEPCP Length = 529 Score = 626 bits (1615), Expect = e-178, Method: Composition-based stats. Identities = 188/542 (34%), Positives = 273/542 (50%), Gaps = 28/542 (5%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + W+ YFLVK GL AG + LNL+FA L P R ++A P+ AL + Sbjct: 1 MGSWSLYFLVKLGLHVAGLIQLDVPLNLLFAVALAWPWAHPGWRRAWRFLAWPVAVALLY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 HD++ P I+SQ ++GFS YL++LV R IN Q++ A+ + W L Q +R+ Sbjct: 61 HDSFWPPATRILSQWQAISGFSFAYLVELVGRVINVQLLVAVAMGAALWWVLKQRLRLAT 120 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 +V L+ + L + GG AATG + Sbjct: 121 WVFVGLVAVAALP------------------SQHGGVTDLAQAATGADGTADRAADRATA 162 Query: 201 P--TTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSH 258 P L+ L FY+ E + P + F+LL++NICSLSW D+ AGL + Sbjct: 163 PLLDAGQLDQALQAFYDTERGKILRLPKD--GNVPGFDLLILNICSLSWDDLAFAGLRNA 220 Query: 259 PLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFT 318 P D+ F FNSA SYSGPA +RLL +CGQ + LY A DCYLF NL + G+ Sbjct: 221 PFMRRLDVVFDRFNSAASYSGPAVMRLLHGTCGQPAQHELYGGAVADCYLFRNLEQAGYR 280 Query: 319 QHLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWL-DV 377 L++ H+G++ F E+R + G+ + + V + FDGSP+ DD L RW D Sbjct: 281 PALLLNHDGRYDNFSTELRRDSGLGLVPEQRFDAAVAMSSFDGSPIRDDGETLTRWWSDR 340 Query: 378 TEKDKNSRSATFYNTLPLHDGNHYPGV---SKTADYKARAQKFFDELDAFFTELEKSGRK 434 T A YN++ LHDGN PG+ S Y RA+K +L+ F +E SGR Sbjct: 341 TAAASGVPLAMLYNSITLHDGNRVPGIQSLSSLETYAPRARKLMADLERFAALVEASGRP 400 Query: 435 VMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMK--APHQGAPIVIEQPS 492 ++V+VPEHGGA++GD Q++GLR++P+P+IT VP GV GM P+ +EQ S Sbjct: 401 TVLVLVPEHGGAVRGDAQQIAGLRELPTPAITHVPAGVMLIGMGERRADGQEPVHVEQTS 460 Query: 493 SFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGG 552 S+L++ +V ++ G ++ LP V+EN +V++ YVR G Sbjct: 461 SYLSLFTVVAALMHGGPEVATPERLTEVAQALPPVEWVAENDKTIVLRRGPHTYVRDFEG 520 Query: 553 DW 554 W Sbjct: 521 RW 522 >UniRef50_Q7NUM4 Putative uncharacterized protein n=1 Tax=Chromobacterium violaceum RepID=Q7NUM4_CHRVO Length = 526 Score = 622 bits (1604), Expect = e-176, Method: Composition-based stats. Identities = 204/543 (37%), Positives = 301/543 (55%), Gaps = 41/543 (7%) Query: 20 GLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALF 79 GL GW+ YF++K L W G L+ HP +L FA LL+PL R L R +A P L Sbjct: 18 GLGGWSLYFILKLLLAWRGALSAHPAPDLAFALVLLLPLRRRWLRLARDALAWPAAAILL 77 Query: 80 WHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRIT 139 ++D+WLP P ++ + S++ GFS YL++L R + ++ V+L +L LS+++R+ Sbjct: 78 YYDSWLPPPAALWRELSELKGFSLSYLMELAGRILTPTLLIGFTVVLAGYLLLSRFLRLG 137 Query: 140 VFVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTA 199 V+A LL L+ + LW P + ++ G Sbjct: 138 TLVMATLLALS-------AHELWQQRAPAASASSAFG----------------------G 168 Query: 200 PPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHP 259 P+ + L F+ +E +R+ F L F++L++++CSLSW D+ A G + P Sbjct: 169 APSAQTPDQRLEGFFRSEQRRQVGFQGPLAD--PGFDVLLLHVCSLSWDDLRAVGFDNPP 226 Query: 260 LWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQ 319 L + FDI F FNSA SYSGPAA+R+LRASCGQ H+ LY+PA C+LF+NL+K GF Sbjct: 227 LLARFDIVFDRFNSAASYSGPAALRVLRASCGQPRHSALYEPAPEQCFLFENLAKAGFKT 286 Query: 320 HLMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTE 379 L + H+G F FL+++R NG M L Q + V GFD SP+Y D A+LNRWL + Sbjct: 287 ELSLNHDGSFDSFLQQIRRNGRMNLPLTPQDGVAVGQRGFDSSPIYSDYAMLNRWLQLRL 346 Query: 380 KDKNSRSATFYNTLPLHDGNHY---PGVSKTADYKARAQKFFDELDAFFTELEKSGRKVM 436 ++ + A +YNT+ LHDGN P + A YK RA + ++ F LE+ RK++ Sbjct: 347 QEPDPHVAVYYNTISLHDGNRIPEAPALDTDASYKYRAGRLLRDIGQFIDLLEQDHRKMI 406 Query: 437 VVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPI--VIEQPSSF 494 +++VPEHG AL+GD+ Q SGLR+IPSP IT VP +K G QG P + Q +S+ Sbjct: 407 LLLVPEHGAALRGDKQQFSGLREIPSPLITTVPAAIKVIG----SQGRPAQYRVTQQASY 462 Query: 495 LAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDW 554 AI+ ++ R+L F D + LP T VSEN+N V+Q + ++ +GG+W Sbjct: 463 TAIATILSRMLARSPFGAD-YQPESYAQDLPATPFVSENANFTVMQSGSRYLMQSSGGNW 521 Query: 555 VPY 557 Y Sbjct: 522 NDY 524 >UniRef50_A6GMN7 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GMN7_9BURK Length = 567 Score = 599 bits (1545), Expect = e-170, Method: Composition-based stats. Identities = 188/559 (33%), Positives = 278/559 (49%), Gaps = 30/559 (5%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLP-----RYSLHRLRHWIALPIG 75 + WN YFL+KFGL ++G L PL NL A L+ P + + LR+ + Sbjct: 1 MLLWNLYFLIKFGLHFSGQLTLSPLWNLGLFALLIATNPAAYQNKQLMKVLRYLVFTGPA 60 Query: 76 FALFWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFI-NWQMIGAIFVLLVAWLFLSQ 134 AL H+ L +++ Q + GFS DYL +L+ R I W + GA+ +V L + Sbjct: 61 IALLLHELGLVVSLALVDQIKALFGFSLDYLWELLKRTIQPWMLWGALLGFMVVR-VLDR 119 Query: 135 WIRITVFV-VAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAA-----ATVAATGGA 188 +IRI+ +V ++ L + L P T + A A + A G Sbjct: 120 YIRISTWVGFGLVCILGIQAL--PFIQQQTQTNKTAALLEPQAQARENFSPAELLALGQV 177 Query: 189 P-----VVGDMPAQTAPPTTA----NLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLV 239 V D +QT A A L F++ +R P + F+++V Sbjct: 178 DFQKLRVARDQESQTKLRDNAYEGAGPGAVLAGFFDR--QRNIALAPFSPVVSPDFDVIV 235 Query: 240 INICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLY 299 + ICSLSW+D++ A HP D F+NFNSATSYSGPAAIRLLR CGQT+H LY Sbjct: 236 LQICSLSWADLQYAKQSQHPTIRQADFVFENFNSATSYSGPAAIRLLRGKCGQTTHDALY 295 Query: 300 QPANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVREN-GGMQSELMDQTNLPVILLG 358 ANN C LF+ L +GF + + H+G+F F K V+ N GG +EL+ ++P + Sbjct: 296 SVANNSCMLFEQLRNVGFEVEMGLNHDGRFQDFSKLVKTNLGGKATELVAHDDVPAGVQA 355 Query: 359 FDGSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNHYP--GVSKTADYKARAQK 416 FDGS V D L W + + A +YN++ LHDGN P ++ Y R ++ Sbjct: 356 FDGSRVGRDGDYLRAWWNKRIQQSGPAVAYYYNSITLHDGNRLPNSNLNSLNSYPLRLER 415 Query: 417 FFDELDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPV-GVKFF 475 +++ + E+ +S RK +VVVVPEHG L G+ Q+ GLR++P+P+IT VPV G Sbjct: 416 MLNDIQSVLGEIRRSDRKALVVVVPEHGAGLTGEFGQLVGLRELPTPAITKVPVFGYWIA 475 Query: 476 GMKAPHQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSN 535 P P+ ++Q S+ A+S+L+ R L + W L S LP T VS+ N Sbjct: 476 PGYTPASTGPVSVKQSVSYTALSELLARWLAQPAEQQQKPAWPVLLSDLPDTRFVSQQGN 535 Query: 536 AVVIQYQDKPYVRLNGGDW 554 V++ Q +++ G W Sbjct: 536 ITVMESQGSYWIKAPGAAW 554 >UniRef50_A6SYG8 Uncharacterized conserved protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6SYG8_JANMA Length = 507 Score = 565 bits (1455), Expect = e-159, Method: Composition-based stats. Identities = 158/542 (29%), Positives = 266/542 (49%), Gaps = 43/542 (7%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + W+ YF KF L + + F+ LNL+ A + + IA+ AL + Sbjct: 1 MQYWSLYFFAKFALYFNHAIKFNWYLNLLLAICVSFSFRHPRWRIAQQSIAIIAAIALLY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 +++ LP P+ ++++ + ++ +S YL++L RFIN + V+L ++ L+ +R + Sbjct: 61 YESSLPPPDRLLAEAANMSSYSFSYLLELFARFINLWYVVVFAVMLALYVLLAGRLRFSS 120 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP 200 FV +L + +L G P + G Sbjct: 121 FVFVGILLIPLLA---------QFGLPAQNLHLVAG------------------------ 147 Query: 201 PTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDIEAAGLMSHPL 260 + A L+ FY E +R+ + PS A Q F+++++ +LSW D+ Sbjct: 148 --AQDPGAMLDTFYAQEKERRLSLPSIANAR-QAFDIVILQPGALSWDDLAFVDAPYPKF 204 Query: 261 WSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQH 320 + FD+ NFNSATS+S AA RLLR SCGQ + L + C LF+NL + G+ Sbjct: 205 LNRFDLVLLNFNSATSHSAAAAKRLLRGSCGQPGDSVLAEAPATGCSLFENLRQAGYDTA 264 Query: 321 LMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEK 380 +M HNG++ + + + G E + V + GFDG+PVYDD +VL++W + Sbjct: 265 AVMNHNGRYNRYAETISAYAGTGVE--EGKWGSVAMHGFDGTPVYDDFSVLSKWWNKHHA 322 Query: 381 DKNS-RSATFYNTLPLHDGNHYPG---VSKTADYKARAQKFFDELDAFFTELEKSGRKVM 436 A +YN++ LHDGN PG ++ YK R K + D F T+LE S R V+ Sbjct: 323 HPGGKPVALYYNSITLHDGNILPGPRAINSVQSYKPRLDKLLADFDHFVTQLEASNRPVV 382 Query: 437 VVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQ-GAPIVIEQPSSFL 495 V+++P HG A++GD++Q +G+R+IPSP +T VPV +K GM A + G P+ ++ +S+ Sbjct: 383 VILMPAHGAAMRGDQLQAAGMREIPSPKLTLVPVAIKLIGMAAAKEAGPPLEVKHATSYF 442 Query: 496 AISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWV 555 + L+ ++ G+ + + +TA VSEN +V++ +D +R W+ Sbjct: 443 GVFSLLADLMAGQANETAGKPLAERLQQVGETAFVSENEKIIVMRSKDGYIMRSAEDLWI 502 Query: 556 PY 557 Y Sbjct: 503 KY 504 >UniRef50_B8L0X3 Membrane protein ; endoglucanase BcsG n=5 Tax=Gammaproteobacteria RepID=B8L0X3_9GAMM Length = 182 Score = 247 bits (631), Expect = 8e-64, Method: Composition-based stats. Identities = 95/179 (53%), Positives = 116/179 (64%), Gaps = 5/179 (2%) Query: 386 SATFYNTLPLHDGNHYPGV---SKTADYKARAQKFFDELDAFFTELEKSGRKVMVVVVPE 442 A FYNT+ LHDGN G S ADYKARA+ ++ F +EKSGR+ M+VVVPE Sbjct: 1 MALFYNTISLHDGNRIVGADGRSNAADYKARAEMVLGDMAGFVDAVEKSGRRAMIVVVPE 60 Query: 443 HGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIEQPSSFLAISDLVV 502 HG AL GDRMQ+ G+R+IPSPSIT VPVGVK GM AP G P I +PSS+LA+S+LV Sbjct: 61 HGAALHGDRMQIPGMREIPSPSITHVPVGVKLVGMGAPAAGGPRHIPEPSSYLAVSELVS 120 Query: 503 RVLDGKIFTEDNV-DWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGD-WVPYPQ 559 RV + + +W L GLPQT VSEN A VI Y KP+++L G W PYP+ Sbjct: 121 RVYALNAQSPPSERNWDSLLKGLPQTPSVSENEGAKVIDYGGKPWLQLQGSQTWSPYPE 179 >UniRef50_B5WNI3 Putative uncharacterized protein n=1 Tax=Burkholderia sp. H160 RepID=B5WNI3_9BURK Length = 163 Score = 157 bits (398), Expect = 7e-37, Method: Composition-based stats. Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 6/163 (3%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW 80 + WNFYF+ K L G L LNL+FA LL+P L +R+ +A+ IG AL + Sbjct: 1 MGLWNFYFIAKLYLAGIGKLQPLWWLNLLFAIALLVPFGDRRLRVVRNLVAVVIGIALLY 60 Query: 81 HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITV 140 + L P M+ + + L+ R + + A+ L VA+ +S+W+R+T Sbjct: 61 FE--LGEPAFSMAAA----HLPQQHALALIVRIVPLSTLVALATLFVAYYVVSRWVRVTT 114 Query: 141 FVVAILLWLNVLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVA 183 FV+ L + V + + + + A Sbjct: 115 FVLISLFAILVWQGFSALTAQASSNVDSARACPPASPGRSEPA 157 >UniRef50_UPI00016ABB33 hypothetical protein Bpseu9_34329 n=5 Tax=pseudomallei group RepID=UPI00016ABB33 Length = 151 Score = 127 bits (318), Expect = 1e-27, Method: Composition-based stats. Identities = 39/138 (28%), Positives = 70/138 (50%), Gaps = 2/138 (1%) Query: 21 LSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPL--PRYSLHRLRHWIALPIGFAL 78 ++ WN YF++K L AG+L + NL FA L + R S+ LRH +AL + L Sbjct: 1 MTFWNLYFVLKLYLFAAGHLKPLWIANLGFALALALSAPARRRSVQLLRHALALALAVPL 60 Query: 79 FWHDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRI 138 + + +P ++ + FS Y ++LV RF+ + A +++ +L +++W+R+ Sbjct: 61 MYREADVPPLARLVETLGGLRAFSAGYWMELVPRFVPPMLALAALGVVIGYLIVNRWLRV 120 Query: 139 TVFVVAILLWLNVLTLAG 156 FV+ L+ L V Sbjct: 121 ATFVLLALIALPVWQAGS 138 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.134 0.379 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,357,261,823 Number of Sequences: 3077464 Number of extensions: 143588339 Number of successful extensions: 447355 Number of sequences better than 1.0e-01: 24 Number of HSP's better than 0.1 without gapping: 45 Number of HSP's successfully gapped in prelim test: 15 Number of HSP's that attempted gapping in prelim test: 447137 Number of HSP's gapped (non-prelim): 68 length of query: 559 length of database: 1,040,396,356 effective HSP length: 134 effective length of query: 425 effective length of database: 628,016,180 effective search space: 266906876500 effective search space used: 266906876500 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 96 (41.6 bits)