BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (745 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AFA6 Bacteriophage N4 adsorption protein B n=53 Tax=P... 1535 0.0 UniRef50_B7LP59 Bacteriophage N4 receptor, inner membrane subuni... 1333 0.0 UniRef50_B3GN83 Bacteriophage N4 adsorption NfrB-like protein n=... 681 0.0 UniRef50_B4SJA4 General secretory system II protein E domain pro... 424 e-117 UniRef50_B2UJM9 General secretory system II protein E domain pro... 400 e-110 UniRef50_A5P922 Bacteriophage N4 receptor, inner membrane subuni... 393 e-107 UniRef50_C6N7W0 Putative uncharacterized protein n=1 Tax=Legione... 339 3e-91 UniRef50_Q136N6 Bacteriophage N4 adsorption protein B n=1 Tax=Rh... 321 6e-86 UniRef50_B8JA61 Bacteriophage N4 adsorption protein B n=2 Tax=An... 289 3e-76 UniRef50_Q01UW7 Bacteriophage N4 receptor, outer membrane protei... 286 2e-75 UniRef50_Q1N778 Bacteriophage N4 adsorption protein B n=1 Tax=Sp... 278 7e-73 UniRef50_C8WHH7 General secretory system II protein E domain pro... 217 1e-54 UniRef50_Q1IXI4 Glycosyltransferase, NfrB-like protein n=2 Tax=D... 214 1e-53 UniRef50_Q2G4Z3 Bacteriophage N4 adsorption protein B n=2 Tax=Sp... 200 2e-49 UniRef50_Q1GVK0 Bacteriophage N4 adsorption protein B n=1 Tax=Sp... 199 5e-49 UniRef50_B8ICV2 General secretion pathway protein E n=2 Tax=Meth... 198 6e-49 UniRef50_D2LI28 General secretion pathway protein E n=1 Tax=Rhod... 189 4e-46 UniRef50_Q2NDA0 Probable inner membrane transmembrane protein n=... 115 7e-24 UniRef50_C6E323 Response regulator receiver protein n=3 Tax=Geob... 74 3e-11 UniRef50_B8FQB2 Type II secretion system protein E n=4 Tax=Clost... 70 3e-10 UniRef50_D1B5J6 Type II secretion system protein E n=3 Tax=Syner... 65 8e-09 UniRef50_C4L4L3 Type II secretion system protein E n=1 Tax=Exigu... 63 5e-08 UniRef50_B5E8D1 General secretory system II protein E domain pro... 62 7e-08 UniRef50_A0LUI4 Type II secretion system protein E n=4 Tax=Bacte... 62 9e-08 UniRef50_Q39ZG4 General secretory system II, protein E-like n=2 ... 62 1e-07 UniRef50_B9Y801 Putative uncharacterized protein n=1 Tax=Holdema... 60 2e-07 UniRef50_B1I3E7 Type II secretion system protein E n=4 Tax=Clost... 60 3e-07 UniRef50_C6CY56 Type II secretion system protein E n=3 Tax=Bacil... 59 6e-07 UniRef50_D1BGY1 Type II secretion system protein E (GspE) n=3 Ta... 59 6e-07 UniRef50_A9FI10 Family membership n=1 Tax=Sorangium cellulosum '... 59 6e-07 UniRef50_B5YDI2 Type IV pilus assembly protein PilB n=20 Tax=Bac... 59 1e-06 UniRef50_B8D2C7 Tfp pilus assembly protein PilB n=5 Tax=Firmicut... 59 1e-06 UniRef50_A1TEN6 Glycosyl transferase, family 2 n=7 Tax=Actinomyc... 58 1e-06 UniRef50_C8W5J2 Type II secretion system protein E n=2 Tax=Clost... 57 2e-06 UniRef50_D2R473 Type II secretion system protein E n=6 Tax=Planc... 57 2e-06 UniRef50_B0MQV8 Putative uncharacterized protein n=2 Tax=Clostri... 57 2e-06 UniRef50_Q1IRV6 Type II secretion system protein E n=24 Tax=Bact... 57 2e-06 UniRef50_B0MCL3 Putative uncharacterized protein n=1 Tax=Anaeros... 57 3e-06 UniRef50_B8E2U7 Type II secretion system protein E n=2 Tax=Dicty... 57 3e-06 UniRef50_C8WIR5 Type II secretion system protein E n=2 Tax=Bacte... 56 4e-06 UniRef50_Q1YKP1 Putative glycosyl transferase n=1 Tax=Aurantimon... 56 5e-06 UniRef50_C4DGM8 Glycosyl transferase n=1 Tax=Stackebrandtia nass... 56 5e-06 UniRef50_Q1J1R8 Tfp pilus assembly pathway, ATPase PilB n=7 Tax=... 56 6e-06 UniRef50_A3DDQ8 Type II secretion system protein E n=3 Tax=Clost... 55 8e-06 UniRef50_Q094V5 General secretion protein E N-terminal domain pr... 55 9e-06 UniRef50_C0QQ17 Type IV pilus assembly protein TapB n=2 Tax=Bact... 55 1e-05 UniRef50_D2R471 Type II secretion system protein E n=6 Tax=Planc... 55 1e-05 UniRef50_B9M6X5 General secretory system II protein E domain pro... 55 1e-05 UniRef50_A9G5Y5 Putative uncharacterized protein n=1 Tax=Sorangi... 55 1e-05 UniRef50_Q1D133 General secretion pathway protein E, N-terminal ... 55 1e-05 UniRef50_C6E8N7 General secretory system II protein E domain pro... 54 2e-05 UniRef50_Q1D9E1 General secretory pathway protein E n=17 Tax=Pro... 54 2e-05 UniRef50_UPI0001B50A66 glycosyl transferase family protein n=1 T... 54 2e-05 UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophi... 54 3e-05 UniRef50_C6PR94 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 53 4e-05 UniRef50_Q7UE44 General secretion pathway protein E n=1 Tax=Rhod... 53 5e-05 UniRef50_A5G3U1 General secretory system II, protein E domain pr... 53 5e-05 UniRef50_Q08RH0 Gspii_e N-terminal domain family n=1 Tax=Stigmat... 52 7e-05 UniRef50_B3E4P2 Response regulator receiver protein n=1 Tax=Geob... 52 8e-05 UniRef50_B0VJ41 Type IV pilus biogenesis protein PilB n=1 Tax=Ca... 52 8e-05 UniRef50_C6XAP3 Type II secretion system protein E n=1 Tax=Methy... 52 1e-04 UniRef50_Q1Q109 Strongly similar to general secretory system typ... 52 1e-04 UniRef50_Q13CF3 Glycosyl transferase, family 2 n=10 Tax=Bradyrhi... 52 1e-04 UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=F... 52 1e-04 UniRef50_B5E974 General secretory system II protein E domain pro... 52 1e-04 UniRef50_Q1D3E0 General secretion pathway protein E, N-terminal ... 51 1e-04 UniRef50_C0QER1 PilB n=10 Tax=Proteobacteria RepID=C0QER1_DESAH 51 2e-04 UniRef50_A9DFB5 Putative uncharacterized protein n=1 Tax=Hoeflea... 51 2e-04 UniRef50_B5YIG2 Type IV-A pilus assembly ATPase PilB n=4 Tax=Bac... 51 2e-04 UniRef50_A5GB67 Response regulator receiver protein n=4 Tax=Geob... 51 2e-04 UniRef50_A8URM8 Putative uncharacterized protein n=3 Tax=Hydroge... 51 2e-04 UniRef50_B6QYB3 Bacteriophage N4 receptor n=1 Tax=Pseudovibrio s... 51 2e-04 UniRef50_C6J4S3 Glycosyl transferase n=1 Tax=Paenibacillus sp. o... 51 2e-04 UniRef50_B5YHZ6 Type IV pilin n=1 Tax=Thermodesulfovibrio yellow... 50 2e-04 UniRef50_A3DEG0 Type II secretion system protein E n=5 Tax=Clost... 50 3e-04 UniRef50_Q0F0L3 Putative uncharacterized protein n=1 Tax=Maripro... 50 3e-04 UniRef50_B0TEE9 Type ii secretion system protein e, putative n=1... 50 3e-04 UniRef50_B3E1K8 Response regulator receiver protein n=5 Tax=Geob... 50 4e-04 UniRef50_Q3A899 Type II secretory pathway and PulE/Tfp pilus ass... 50 4e-04 UniRef50_Q3SKS0 Pilus assembly pathway ATPase PilB n=3 Tax=Prote... 50 4e-04 UniRef50_UPI0001C3693E pili biogenesis protein PilB-like ATPase ... 50 4e-04 UniRef50_Q181B0 Type IV pilus assembly protein n=5 Tax=Clostridi... 50 4e-04 UniRef50_Q0ASE8 Glycosyl transferase, family 2 n=1 Tax=Maricauli... 49 5e-04 UniRef50_C6E6L0 Response regulator receiver protein n=3 Tax=Geob... 49 6e-04 UniRef50_A1ALE6 General secretory system II, protein E domain pr... 49 6e-04 UniRef50_C1XWL4 Type II secretory pathway, ATPase PulE/Tfp pilus... 49 7e-04 UniRef50_A1SEL1 General secretory system II, protein E domain pr... 49 9e-04 UniRef50_A3VPZ4 Putative uncharacterized protein n=2 Tax=Bacteri... 48 0.001 UniRef50_Q0G6A6 Glycosyl transferase, family 2 n=1 Tax=Fulvimari... 47 0.002 UniRef50_D0LP33 General secretory system II protein E domain pro... 47 0.002 UniRef50_A3VHJ3 Glycosyl transferase, family 2 n=2 Tax=Rhodobact... 47 0.002 UniRef50_B1YJT8 Type II secretion system protein E n=5 Tax=Bacil... 47 0.002 UniRef50_Q39Q61 General secretory system II, protein E-like n=2 ... 47 0.003 UniRef50_A8ZYV7 Type II secretion system protein E n=11 Tax=Delt... 47 0.003 UniRef50_C8W151 Type II secretion system protein E n=1 Tax=Desul... 47 0.003 UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli s... 47 0.004 UniRef50_C6MU90 Type II secretion system protein E n=1 Tax=Geoba... 47 0.004 UniRef50_B9M3F1 General secretory system II protein E domain pro... 46 0.005 UniRef50_Q1D3E1 General secretion pathway protein E, N-terminal ... 46 0.005 UniRef50_A6Q5B1 General secretory pathway protein E n=3 Tax=Epsi... 46 0.005 UniRef50_Q7UEJ7 Probable general secretion pathway protein E n=1... 46 0.006 UniRef50_B2A7G1 Type II secretion system protein E n=5 Tax=Firmi... 46 0.007 UniRef50_B4UHA9 General secretory system II protein E domain pro... 45 0.007 UniRef50_B8JAJ3 General secretory system II protein E domain pro... 45 0.007 UniRef50_Q2JNJ5 Glycosyl transferase, group 2 family protein n=6... 45 0.009 UniRef50_Q1D416 General secretory system II protein E, N-termina... 45 0.010 UniRef50_A5FY21 Glycosyl transferase, family 2 n=2 Tax=Alphaprot... 45 0.012 UniRef50_C1AA14 Type IV pilus assembly protein PilB n=1 Tax=Gemm... 45 0.012 UniRef50_Q1AWU6 Type II secretion system protein E n=1 Tax=Rubro... 44 0.018 UniRef50_A6M148 Type II secretion system protein E n=18 Tax=Clos... 44 0.018 UniRef50_Q08Q27 Serine/threonine kinase PKN11 n=1 Tax=Stigmatell... 44 0.021 UniRef50_D2LBM7 Glycosyl transferase family 2 n=1 Tax=Rhodomicro... 44 0.024 UniRef50_C3RLB0 Type II secretion system protein E n=2 Tax=Bacte... 44 0.027 UniRef50_Q098P0 Putative ATPase n=1 Tax=Stigmatella aurantiaca D... 44 0.027 UniRef50_Q01NU4 Type II secretion system protein E (GspE) n=1 Ta... 44 0.029 UniRef50_A6C0T8 Type IV fimbrial assembly protein PilB n=1 Tax=P... 44 0.030 UniRef50_C7IFY8 General secretory system II protein E domain pro... 44 0.031 UniRef50_A1AV36 General secretory system II, protein E domain pr... 43 0.036 UniRef50_B0T1N0 Putative uncharacterized protein n=1 Tax=Cauloba... 43 0.037 UniRef50_A3UGE7 Putative uncharacterized protein n=1 Tax=Oceanic... 43 0.039 UniRef50_B1Y3Z9 General secretory system II protein E domain pro... 43 0.042 UniRef50_Q08MP9 Gspii_e N-terminal domain family (Fragment) n=1 ... 43 0.044 UniRef50_C6MLR2 Type II secretion system protein E n=1 Tax=Geoba... 42 0.089 >UniRef50_P0AFA6 Bacteriophage N4 adsorption protein B n=53 Tax=Proteobacteria RepID=NFRB_ECO57 Length = 745 Score = 1535 bits (3973), Expect = 0.0, Method: Compositional matrix adjust. Identities = 745/745 (100%), Positives = 745/745 (100%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY Sbjct: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV Sbjct: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR Sbjct: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT Sbjct: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF Sbjct: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG Sbjct: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ Sbjct: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV Sbjct: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ Sbjct: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK Sbjct: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ Sbjct: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS Sbjct: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 Query: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 MQSLLLKAGLNTEQVAQLESENEGE Sbjct: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 >UniRef50_B7LP59 Bacteriophage N4 receptor, inner membrane subunit n=7 Tax=Bacteria RepID=B7LP59_ESCF3 Length = 750 Score = 1333 bits (3451), Expect = 0.0, Method: Compositional matrix adjust. Identities = 631/744 (84%), Positives = 687/744 (92%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 ++WLLD+F+TWLYGLK IAI LA++M ISGLDD FIDVVYW+RR+KR LSVYRRYPRM+Y Sbjct: 7 VEWLLDLFSTWLYGLKFIAIALAIMMLISGLDDLFIDVVYWLRRVKRSLSVYRRYPRMNY 66 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV Sbjct: 67 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 126 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSA FAFAGFILHDAEDVISPMELR Sbjct: 127 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSAKFAFAGFILHDAEDVISPMELR 186 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 LFNYLV+RKDLIQIPVYPFER+WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT Sbjct: 187 LFNYLVDRKDLIQIPVYPFERKWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 246 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 CFSRRA++ALLADGDGIAFDVQSLTEDYDIGFRLKEKGM+EIFVRFPVVD+ K E RK Sbjct: 247 CFSRRAISALLADGDGIAFDVQSLTEDYDIGFRLKEKGMSEIFVRFPVVDDGKTGEPRKL 306 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 Q RT NMICVREYFPDTF+TAVRQKSRWIIGIVFQGFKTHKWTS+L LNYFLWRDRKG Sbjct: 307 FQSKRTHNMICVREYFPDTFTTAVRQKSRWIIGIVFQGFKTHKWTSNLILNYFLWRDRKG 366 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 AISNF+SF+AMLV IQL+LL+ Y++ WP+AWHFLSIF+ SA TLLW+NF LMVNRIVQ Sbjct: 367 AISNFISFIAMLVFIQLMLLMLYQTFWPNAWHFLSIFTDSAAFTTLLWMNFALMVNRIVQ 426 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 RVIFVTGYYGLTQG+LSVLRL WGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV Sbjct: 427 RVIFVTGYYGLTQGILSVLRLCWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 486 Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 +G+ R+LRPLGQILLEN VITE QL+ AL NR++GLRLGGSMLMQGLI+A+QLAQALAEQ Sbjct: 487 SGENRALRPLGQILLENHVITETQLEQALTNRIQGLRLGGSMLMQGLITAQQLAQALAEQ 546 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 NGV WES+DAWQIP LI ++PASVALHYAVLPLR+E+D L+VGSEDGIDPVSLAAL+RK Sbjct: 547 NGVGWESVDAWQIPRYLIEQIPASVALHYAVLPLRIEDDVLVVGSEDGIDPVSLAALSRK 606 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 GR+VRYVIVLRGQ+VTGLRHWYARRRG D R +L AV +WLT QQ EIW+Q+V HQ Sbjct: 607 TGRQVRYVIVLRGQVVTGLRHWYARRRGRDARELLEQAVLRRWLTPQQQTEIWQQFVQHQ 666 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 FLFAE+LTTLGHINRSAIN LLLRHERS PLG FLV EGVISQETLDRVL+IQ+ LQVS Sbjct: 667 FLFAEVLTTLGHINRSAINALLLRHERSDRPLGAFLVAEGVISQETLDRVLSIQQNLQVS 726 Query: 721 MQSLLLKAGLNTEQVAQLESENEG 744 MQSLL AGL T Q+A+LE+++EG Sbjct: 727 MQSLLQAAGLTTMQIAELETDHEG 750 >UniRef50_B3GN83 Bacteriophage N4 adsorption NfrB-like protein n=1 Tax=Zymomonas mobilis subsp. mobilis RepID=B3GN83_ZYMMO Length = 729 Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust. Identities = 349/714 (48%), Positives = 461/714 (64%), Gaps = 9/714 (1%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 + IAI +A+++ + G+DD FID +W+R I R+ +Y YP ++L+ +EKPLAIM Sbjct: 20 FRYIAIFVAILVTLFGIDDIFIDSCFWIRSIYRRFFIYSHYPHADEKQLFSKNEKPLAIM 79 Query: 75 VPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 VPAW E GV+ NMA LAA TLDYENYHIFVGTYPNDP+TQ DVD V +++PNVHK+VCAR Sbjct: 80 VPAWREVGVVANMARLAAETLDYENYHIFVGTYPNDPETQNDVDAVVSQYPNVHKIVCAR 139 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQI 194 PGPTSKADCLNNV+DAI FE +A FAGFILHDAEDVISP+ELRLFNYLV RKD+IQI Sbjct: 140 PGPTSKADCLNNVIDAIFHFEEAAAIEFAGFILHDAEDVISPLELRLFNYLVARKDMIQI 199 Query: 195 PVYPF-EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 PVYPF + FT Y+DEFSE HGKDV VREAL GQVPSAGVGTCFSRRA+T LL + Sbjct: 200 PVYPFISDRFGDFTRNHYVDEFSEHHGKDVVVREALTGQVPSAGVGTCFSRRAITLLLKE 259 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE-QRKFLQHARTSNMICV 312 DG FD SLTEDYDI FRL +GM+ IF R+PV D ++K R + +ICV Sbjct: 260 SDGFPFDTTSLTEDYDISFRLYREGMSCIFARYPVTDPQYAFPIKQKIGMDRRYTQVICV 319 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 RE+FPD F AVRQKSRWI GIVFQG + W +NYFLWRDR+G I+N V FLA + Sbjct: 320 REHFPDHFKYAVRQKSRWITGIVFQGTRNLGWEHRAIMNYFLWRDRRGIITNIVGFLANI 379 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 ++ + LL +L W F+S+ S +A L LLW+N +++NR QR FVT YYG+ Sbjct: 380 LLFFVALLWIISALNLKGWSFMSVLSDNALLSVLLWVNGFILLNRAAQRCFFVTKYYGIK 439 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQ 492 QGL S R+ WGN++N A RA QV+ G+ +R+AWDKTTHDFPS+ R P+G Sbjct: 440 QGLTSPFRMVWGNIVNSFACIRAFWQVITIGNIKRMAWDKTTHDFPSIPVSRRE--PIGL 497 Query: 493 ILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQ 552 ++ + L+ L+ + RLG +L++GLI++EQLA+ALA Q + S + + Sbjct: 498 WMVAQNFLKNSDLEQVLQAPRQH-RLGQELLLRGLINSEQLAKALAHQASLKAVSFNIFY 556 Query: 553 IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLR 612 + S LIA P +A YAVLP + L + E + P+SL ++R +G V +I + Sbjct: 557 LDSKLIAAFPRYLACRYAVLPFSQKGKALQLICEHALSPISLGVISRHIGLNVECLIAPQ 616 Query: 613 GQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGH 672 G++ GLR+WY +G+ P + + + L + E + F +IL T GH Sbjct: 617 GRVTLGLRYWYP-GQGNQPST---DRIIKELLKDPNNIEKQDTVCIYLAQFGDILQTTGH 672 Query: 673 INRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLL 726 I +L+ + + LG++LV +ISQE L+ L Q + + + +LL Sbjct: 673 IPEPIFAQVLIDFDPDKMKLGEYLVKRKLISQEILEECLKEQNKQEEMAEKVLL 726 >UniRef50_B4SJA4 General secretory system II protein E domain protein n=15 Tax=Proteobacteria RepID=B4SJA4_STRM5 Length = 715 Score = 424 bits (1091), Expect = e-117, Method: Compositional matrix adjust. Identities = 278/718 (38%), Positives = 388/718 (54%), Gaps = 58/718 (8%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRR--YPRMSYRELYKPDEKPLAIMVPAWNETGV 83 + IS LDD FIDV YWVR R L++ RR Y ++ +L + E+PLAIMVPAW E V Sbjct: 28 ILISSLDDLFIDVWYWVRESWRALTIKRRDAYKPLTQEDLLQRPEQPLAIMVPAWMEYDV 87 Query: 84 IGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADC 143 I M E LDY Y +FVGTYPND T +V+ + R+ + +V GPTSKADC Sbjct: 88 IAQMVENMINVLDYREYVVFVGTYPNDQQTIDEVERMRRRYKRLRRVEVPHDGPTSKADC 147 Query: 144 LNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREW 203 LN ++ AI ++E+ + FAG ILHD+EDV+ PMELR +NYL+ RKD+IQ+PV +REW Sbjct: 148 LNWLILAIFEYEKRHDIEFAGVILHDSEDVLHPMELRFYNYLLPRKDMIQLPVTSLDREW 207 Query: 204 THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQS 263 + Y+DEF+E H KD+ VRE+++G VPSAGVGTCFSRRA+ AL A D F+ S Sbjct: 208 YELVAGVYMDEFAEWHAKDLVVRESVSGMVPSAGVGTCFSRRALLALSAQTDNQPFNTDS 267 Query: 264 LTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF---LQHARTSNM-ICVREYFPDT 319 LTEDYD+G RL GM IF RFPV + + R F RT M +CVREYFPD Sbjct: 268 LTEDYDVGARLAAMGMQSIFARFPV--QFRVRRPSWFGWGPVRERTQQMALCVREYFPDN 325 Query: 320 FSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLL 379 F + RQK+RW++GI Q +++ W SL Y L RDRKG I++FVS +A ++ +QLLL Sbjct: 326 FRASYRQKARWVLGIGLQSWESLGWRGSLATKYLLARDRKGIITSFVSIIAYVIFLQLLL 385 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL 439 + F S+F W M + L + R+VQR FV YG L+S+ Sbjct: 386 FWLLKMTGVWTMQFPSVFQPGTWQMNVALLTTAALATRVVQRFYFVNRLYGWEHALMSIP 445 Query: 440 RLFWGNLINFMANWRALKQVLQH---GDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLE 496 R+ GN+INFMA RA K L + G +R+ WDKT HDFP ++ + LG++L Sbjct: 446 RMVVGNMINFMATARAWKVFLAYLLFG--KRMVWDKTMHDFPDAAQLVQTRKQLGELLGT 503 Query: 497 NQVITEEQLDTALRNRVEGLR--LGGSMLMQGLISAEQLAQALAEQNGVAWESID----- 549 Q + E+L AL + G + LG +L QG + E LA+A+A Q + ID Sbjct: 504 WQAVEPERLQQALDQQHAGRQQPLGRILLTQGWLDDETLAEAIAFQGDLPRAVIDVDYLR 563 Query: 550 AWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK--VRY 607 A Q P S A + + +LPL + + + P AL ++ R + Sbjct: 564 ACQFPVS------ADACVQWRMLPLPPRQEGTLRLAVASPLPEEALALLKQETRSEHIEQ 617 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEIL 667 I +I GLR R+ H L N VP L ++L Sbjct: 618 SIARESEINAGLRLIGGDRQWH-----LDN-------------------VP---LLGDLL 650 Query: 668 TTLGHINRSAINVLL--LRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS 723 + I+ + + L + +R +G +LV +G+ ++E + + + QR ++QS Sbjct: 651 VEMRLIDHARFEIALDDYKPQRDGR-IGDYLVKQGITTEEAVAQAMQEQRRRAATLQS 707 >UniRef50_B2UJM9 General secretory system II protein E domain protein n=12 Tax=cellular organisms RepID=B2UJM9_RALPJ Length = 703 Score = 400 bits (1028), Expect = e-110, Method: Compositional matrix adjust. Identities = 236/607 (38%), Positives = 338/607 (55%), Gaps = 20/607 (3%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 L + + +++ IS DDFF+D YWVR + R +S + L +E+ LAIM Sbjct: 13 LNTLTVATTLVILISTADDFFLDAFYWVRELWLWPQRGRTPVTISAQALRDREEQWLAIM 72 Query: 75 VPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN-VHKVVCA 133 VPAW E VI M E T++Y + IF G Y ND +T +V+ + R+P V + Sbjct: 73 VPAWKEYDVIAKMVENTLATMEYTRFIIFAGAYRNDAETTTEVERMVRRYPGRVVRAAVT 132 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQ 193 GPT KADCLN ++ I ++E FAG I+HD EDVI P+EL+ FNY + +DL+Q Sbjct: 133 HDGPTCKADCLNTIIQTIIRYEAGHGIRFAGVIMHDCEDVIHPLELKYFNYFISDQDLVQ 192 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 +PV ER+W + + TY+D+FSE H KD+ R+AL G VP AGV C+SRRA+ A++ Sbjct: 193 LPVLSLERKWYEWVAGTYMDDFSETHQKDLVARQALTGTVPGAGVALCYSRRAIEAVMKV 252 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPV------VDEAKEREQRKFLQHARTS 307 F+ +LTEDYD FRL+E GM E FV FPV V + R+ + + R Sbjct: 253 RGDAPFNTSTLTEDYDFSFRLRELGMREAFVHFPVCENTAPVADGTGRQPTHWWTNRRRE 312 Query: 308 ---NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISN 364 ++ REYFP TF TA RQ++RW++GI FQG+ W +L Y +RDRKG ++ Sbjct: 313 ARPQLLATREYFPSTFRTAYRQRARWVLGIAFQGWLQMGWKGNLITKYMFFRDRKGVLTA 372 Query: 365 FVSFLA-MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 S LA L + LL+ + + W A + G+ W+ LL +N L++NR+ QRV Sbjct: 373 LFSILAYALSLNYLLVAVLLDKGWVTASEG-AFVVGTIWMQDLLAINATLLINRLAQRVY 431 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQH---GDPRRVAWDKTTHDFPSV 480 FV G Q +L + RL N INF + RA K L + G P +AWDKT H + S Sbjct: 432 FVGRLNGPLQAVLCLPRLVVNNFINFFSVCRAWKIFLIYCFTGKP--IAWDKTQHTYLSN 489 Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAE 539 R+ LG+ LL+ +VIT+EQLD AL + G RLG ++ QGL++ + LA ALAE Sbjct: 490 DALGRTRCKLGETLLKWEVITQEQLDAALAIQQQTGRRLGQVLVQQGLVTPDTLADALAE 549 Query: 540 QNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRL-ENDELIVGSEDGIDPVSLAALT 598 Q + S+ + +L +P +A+ + V+P + E+ L + + D +L L Sbjct: 550 QADLPRVSLTN-VVLGALADCLPRDLAVRHHVVPFSIGEDGSLNIAVSELPDGEALQELA 608 Query: 599 RKVGRKV 605 R GRKV Sbjct: 609 RAAGRKV 615 >UniRef50_A5P922 Bacteriophage N4 receptor, inner membrane subunit n=1 Tax=Erythrobacter sp. SD-21 RepID=A5P922_9SPHN Length = 698 Score = 393 bits (1009), Expect = e-107, Method: Compositional matrix adjust. Identities = 258/728 (35%), Positives = 382/728 (52%), Gaps = 64/728 (8%) Query: 10 TWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPR-MSYRELYKPDE 68 +++ + A+ +A ++ IS LDD F+D V+W+ KR+ + + PR +S L + E Sbjct: 4 SYIVAFECAALVVATLIAISSLDDLFVDSVFWIAMAKRRF-LGKGEPRTVSPETLIERPE 62 Query: 69 KPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 P+AIM+PAW E VI +M E A TL Y NY IF+G+Y NDP+T +V+++ AR+ V Sbjct: 63 APIAIMLPAWQEADVIASMVENAIHTLVYRNYFIFIGSYANDPETILEVEKLAARYGRVR 122 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER 188 V GPT KADCLN+++ I + E+ + FAG +LHD+EDV+ P+EL LFNYL+ Sbjct: 123 HVRVPHYGPTCKADCLNHIVADILRLEKEVDIEFAGLVLHDSEDVLHPLELHLFNYLLPS 182 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 +D+IQ+PV E+ +T F + TY+D+F+E H KD+ VR+ LA VPSAGVGTCFSRRA+ Sbjct: 183 RDMIQLPVVSLEQRFTDFVAGTYMDDFAESHAKDLVVRQMLAKSVPSAGVGTCFSRRAIE 242 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR--- 305 +L G+ F+ Q+LTEDYD+G RL ++G+ +PV E R+F R Sbjct: 243 VMLEAGE--PFNTQTLTEDYDVGSRLAKRGLNASIELYPV-----EFRSRQFGHFGRGPE 295 Query: 306 ----TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGA 361 TS +CVRE+FP+TF + RQK+RWI+GI QG+ W S+ NYFL RDRK Sbjct: 296 RVGTTSKPLCVREHFPNTFRASYRQKARWILGIALQGWAQLGWDRSIVSNYFLCRDRKAL 355 Query: 362 ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLS------IFSGSAWLMTLLWLNFGLMV 415 I+ ++ LA +L L W F S IFS L N + Sbjct: 356 ITPTLAVLAY--------VLTAMYLGATIWSFASGGAAIPIFSNHPIASYLFSFNLFALA 407 Query: 416 NRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQ---HGDPRRVAWDK 472 R+VQRV FV Y T L+V R+ + INF A+ RA++ + G+P +AWDK Sbjct: 408 ARVVQRVYFVAKIYCWTHAFLAVPRMVVLSFINFAASVRAIRIFVGSKFSGNP--IAWDK 465 Query: 473 TTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAE 531 T H FPS + R LG+IL V++ L+ A L + EG LG ++ G+I + Sbjct: 466 TNHRFPSDEALGKEKRRLGEILRGWDVVSSPMLEKALLYQKREGGMLGDLLVRDGVIDED 525 Query: 532 QLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDP 591 L +A++ QN Q+P AE+ + + L R +L + GI Sbjct: 526 VLTEAISTQN----------QLPR---AELNLDMVCEHLDLLDRATMTQLQI-LPFGISS 571 Query: 592 VSLAALTRKVGRKVRYVIVLRGQIVTGLR-HWYARRRGHDPRAMLYNAVQHQWLTEQQAG 650 A L ++ ++ G R H + R + A L N + + Sbjct: 572 KGEALLAVAKPLACEQTRLIWSRMSKGYREHIVPQSRIKEILAALTNIPERNF------- 624 Query: 651 EIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSL-PLGKFLVTEGVISQETLDR 709 VP E+L + + + + LL + + +G+FLV +G I+Q TLD+ Sbjct: 625 -----PVPSVPRVHELLLSQKQLKKKELQNLLKDYNVARHGTIGQFLVAKGTITQATLDK 679 Query: 710 VLTIQREL 717 L ++ L Sbjct: 680 TLELRTSL 687 >UniRef50_C6N7W0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N7W0_9GAMM Length = 501 Score = 339 bits (869), Expect = 3e-91, Method: Compositional matrix adjust. Identities = 188/488 (38%), Positives = 276/488 (56%), Gaps = 13/488 (2%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWN 79 + L+ + ISG+DD F D YW+R + R L R Y ++Y +L + +E+ +A+++P W+ Sbjct: 18 VALSCLFIISGIDDLFFDGYYWIRYVFR-LWKTRGYKPLTYEQLAEKEEQMIAVLIPCWH 76 Query: 80 ETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 E GVIG M + ++DY NY++FVG YPNDP+T +V EV NV V+ PGPT+ Sbjct: 77 EAGVIGTMLKHNCYSIDYSNYYLFVGVYPNDPETVNEVQEVANLIKNVRCVIGTTPGPTN 136 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPF 199 KA LN + + + FE+ N +F+ F+ HD+ED+I PM +L+NYL+ RK++IQIPV+P Sbjct: 137 KAANLNGIYNYVKAFEKELNRSFSIFVFHDSEDIIHPMSFKLYNYLMPRKEMIQIPVFPL 196 Query: 200 EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAF 259 E + +FT Y DEFSE H KD+ VRE++ G VPSAGVGT FSR A+ L F Sbjct: 197 EINYWNFTHWLYADEFSENHTKDIIVRESIHGHVPSAGVGTAFSRHALKLLEDPTTRTPF 256 Query: 260 DVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ---RKFLQHARTSNMICVREYF 316 SLTEDY ++ KG+ +IFV +V K R + RK T I R F Sbjct: 257 STDSLTEDYRTSLAIRIKGLKQIFVTETIV-RMKWRPRGFFRKGYVQKPTREYIATRALF 315 Query: 317 PDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQ 376 P ++ AVRQK+RWIIGIVFQ ++ +W + + L DRK I++F++ V + Sbjct: 316 PLEYTKAVRQKARWIIGIVFQEWQHTQWPKEWIIRFTLAHDRKSFITHFINGFGYFVFLF 375 Query: 377 LLL--LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 L+ L Y + P+ F+ W+ L+ +M+ R++QR+I + YG Sbjct: 376 WLVYSLCTYTN--PEYPSLQEQFNLHPWVWWLIVTVTLMMIERMIQRMIAIRRVYGWIPS 433 Query: 435 LLSVLRLFWGNLINFMANWRALKQVL----QHGDPRRVAWDKTTHDFPSVTGDTRSLRPL 490 LS+ R F+GNL+N A RA ++ +WDKT H FP T + + Sbjct: 434 FLSIPRTFYGNLLNLHALIRAYHVYYTTPKSQATSKQPSWDKTDHHFPGSHILTPYRKKI 493 Query: 491 GQILLENQ 498 G +LLE + Sbjct: 494 GDLLLEKK 501 >UniRef50_Q136N6 Bacteriophage N4 adsorption protein B n=1 Tax=Rhodopseudomonas palustris BisB5 RepID=Q136N6_RHOPS Length = 497 Score = 321 bits (823), Expect = 6e-86, Method: Compositional matrix adjust. Identities = 178/467 (38%), Positives = 254/467 (54%), Gaps = 6/467 (1%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 ++++ + AV++ +S +DD +D++YW RR+ R + + ++ E + P+A++ Sbjct: 15 VEIMLVVTAVLVALSSIDDLVVDLLYWGRRLTRP-NAFDATADLATMEAIP--QAPIAVI 71 Query: 75 VPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 +PAW E VI +M T YENYH+FVG Y ND T +V A+ VH VV R Sbjct: 72 IPAWQEHEVIFSMLAANQATTKYENYHLFVGAYQNDAATLTEVRRAEAQSNRVHLVVVPR 131 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQI 194 GPTSKADCLN V + + FE++ FAG +LHDAED+I P EL LFN++ D IQ+ Sbjct: 132 DGPTSKADCLNVVANGVFAFEQAKGIQFAGLVLHDAEDLIHPYELVLFNFMAHDNDFIQL 191 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 PV+ F+R Y+DEF+E H KD+PVR ++G VP AGV F R +AD Sbjct: 192 PVFSFKRPLRELVGGVYMDEFAESHLKDIPVRRMISGLVPCAGVAAFFGRDIALRTMADN 251 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 G F SLTEDYD RL G FV P + I RE Sbjct: 252 AGSLFRSDSLTEDYDFALRLGLLGARVNFVIAPASYTIDISSSTDLPEIVGRKLPIATRE 311 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVM 374 +FP++F A RQ++RW++GIVFQG ++ W + + Y L RDRK ++ + LA LV+ Sbjct: 312 FFPNSFVAAQRQRARWLMGIVFQGTRSFGWRGTTGIKYALLRDRKSILTAPLIMLAYLVL 371 Query: 375 IQLLLL-LAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 L+ + L + PD + + + LLWLNF ++ R++ R F YGL Sbjct: 372 FGLVSVNLYFRWYLPDEVNQFPLLQ-EPLVQQLLWLNFAFLIWRLLHRFYFTNRIYGLRH 430 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQHG-DPRRVAWDKTTHDFPS 479 GL+S+ RL GN +NF A RA + L H R+ WDKT H +P+ Sbjct: 431 GLMSIPRLPLGNFLNFFAVARACRLYLSHSLLGTRLVWDKTEHQYPT 477 >UniRef50_B8JA61 Bacteriophage N4 adsorption protein B n=2 Tax=Anaeromyxobacter RepID=B8JA61_ANAD2 Length = 502 Score = 289 bits (739), Expect = 3e-76, Method: Compositional matrix adjust. Identities = 169/469 (36%), Positives = 249/469 (53%), Gaps = 18/469 (3%) Query: 16 KVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMV 75 +V+A L + ++ LD+ FIDV Y RR+ R+ + +S L + + K AI++ Sbjct: 9 RVMAGPLGGAILLNQLDELFIDVNYLARRLHRRSATA-----VSAALLRRVEPKRTAILL 63 Query: 76 PAWNETGVIGNMAELAATTLDY--ENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 PAW E VI M EL + +D+ + Y F GTY NDP TQ VD AR V KVV Sbjct: 64 PAWREEDVIERMLELNVSRIDFPRDRYVFFCGTYQNDPATQARVDRAAARGWPVRKVVVP 123 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQ 193 GPTSKADCLN + + ER F ++HDAEDVI P+ LRL++ LV + + +Q Sbjct: 124 HAGPTSKADCLNWIYQGVVLHERERGTRFDILLMHDAEDVIHPLALRLYSLLVPKHEFVQ 183 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 PV+ + + + TYIDEF+E H K++PVR+A+ G +PSAGVG+ F RRA + Sbjct: 184 TPVFSLPLDASQVVAGTYIDEFAEHHLKELPVRQAIGGLIPSAGVGSAFERRAFEQIALA 243 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 FD SLTEDY+IG R + F + + + + E + I R Sbjct: 244 HAQQPFDPASLTEDYEIGLRFRLARRRTHFACYRIAADPDDPEAPAH------DDPIATR 297 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 EYFPD F +VRQ+SRWI+GI Q +++ W + Y LWRDRK ++N + L+ + Sbjct: 298 EYFPDRFQASVRQRSRWILGISLQTWESAGWQGPAAVRYCLWRDRKAVLTNALLALSYAL 357 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 + +++ + + +W I + LL +N + R+ ++ FV YG Sbjct: 358 LAYVVVRVWTAGMTGASWSPARIVPAGGLIQALLLVNLAGFLLRVGVKMGFVGRLYGARL 417 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQH---GDPRRVAWDKTTHDFPS 479 L + RL N+I+ A RA+ ++H G+P R W KT+H FPS Sbjct: 418 ATLCLPRLLVANVISLAATARAVVTYVRHLVTGEPLR--WVKTSHAFPS 464 >UniRef50_Q01UW7 Bacteriophage N4 receptor, outer membrane protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01UW7_SOLUE Length = 466 Score = 286 bits (732), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 181/490 (36%), Positives = 244/490 (49%), Gaps = 47/490 (9%) Query: 20 ITLAVIMFISGLDDFFIDVV-YWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAW 78 + +AV + ISGLDD FI +V + R+ R+P S +L E+P+AI VP W Sbjct: 1 MPVAVWILISGLDDLFITMVGFATSRV--------RFPWPSSGDLKSAAEQPIAIFVPLW 52 Query: 79 NETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPT 138 +E VIG M E + Y NYH+F G YPND T R V+ A P +H +C GPT Sbjct: 53 HEHRVIGRMLEHNLAAVRYGNYHVFAGVYPNDTPTLRAVELQAAVHPKIHTAICPHDGPT 112 Query: 139 SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYP 198 SK DCLN + + +E F +LHDAED+I P LRL N+ +++Q+PV P Sbjct: 113 SKGDCLNWIYQHMRAWEARHGTRFRVVVLHDAEDLIDPESLRLINWFSRDYEMVQVPVLP 172 Query: 199 FE---REWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGD 255 +EWTH Y DEF+E KD+PVR+ L G +PS GVGT F R A+ L + Sbjct: 173 LATAVKEWTH---GLYCDEFAEYQRKDIPVRQQLGGFLPSNGVGTGFGRDALERLADGRN 229 Query: 256 GIAFDVQSLTEDYDIGFRLKEKGMTEIF--VRFPVVDEAKEREQRKFLQHARTSNMICVR 313 G FD LTEDY+ G+ L E G +IF VRF R + R Sbjct: 230 GRPFDPACLTEDYETGYLLHELGCRQIFLPVRF------------------RENGPTATR 271 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 E+FP A+ Q++RW+ GI Q ++ H W L+ Y+ WRDRKG I N +S A L+ Sbjct: 272 EFFPRGARAAISQRTRWVTGIALQSWERHGWRVPLSQLYWFWRDRKGLIGNLLSPAANLL 331 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 + A+ + P AWH S WL L + + R YG Sbjct: 332 FLYGAGSYAFSTGHPSAWHLGSHI--PPWLAGSCRLTLAIAALQTGVRARSAALIYGWKF 389 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR----VAWDKTTHDFPSVTGDTRSLRP 489 LR+ WGNL+NF A AL + G+ RR +AW KT H +P+ T +R Sbjct: 390 AAGVPLRMVWGNLVNFAATAMAL---WEFGNSRRRGGGLAWRKTDHMYPTALA-TSGVR- 444 Query: 490 LGQILLENQV 499 Q LL+N + Sbjct: 445 -YQPLLKNPI 453 >UniRef50_Q1N778 Bacteriophage N4 adsorption protein B n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1N778_9SPHN Length = 470 Score = 278 bits (710), Expect = 7e-73, Method: Compositional matrix adjust. Identities = 178/472 (37%), Positives = 236/472 (50%), Gaps = 29/472 (6%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL-YKPDEKPLAIMVP 76 I + AV + I GLDD ID+ Y+ R+ R + +Y R+ RM+ EL + +A+ VP Sbjct: 21 ILLFAAVGLAIGGLDDLLIDIFYFGRKAWRDIVIYARHQRMTGPELPHSRRPGKIAVFVP 80 Query: 77 AWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG 136 AW E+ VI M A + Y IFVG YPND T V V + + R G Sbjct: 81 AWQESNVIAAMLNHARDSWGEARYRIFVGVYPNDDATIDAVANVACDATWLTLCINDRAG 140 Query: 137 PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPV 196 PT+KADCLN + A+ E +F + ILHDAEDV+ E+RLF+++V+R DL+Q+PV Sbjct: 141 PTTKADCLNLLWRAMRAEEEQGDFRYKAIILHDAEDVVHADEIRLFDFMVDRFDLVQLPV 200 Query: 197 YPFERE---WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 P W + Y DEF+E HGK + VREAL VPSAGV F R + AL +D Sbjct: 201 LPLRGRGGWWRRAIADHYGDEFAESHGKLLSVREALGASVPSAGVACAFERDMLAALASD 260 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 FD SLTEDY+ G R+++ G FVR N+I R Sbjct: 261 EATGPFDPGSLTEDYEAGLRIRDMGGRSAFVRM----------------RDAYGNIIATR 304 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 E+FPD+ AVRQK+RW IGI G+ W + RDR+ ++ V F A L Sbjct: 305 EFFPDSIDAAVRQKARWTIGIALAGWDRLGWRGGPAEFWMRLRDRRAVLAALVLFAAYLT 364 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 ++ L + P LS L LLW N LM+ R+ R +FV YGL Sbjct: 365 LVLWATLALLALVIPFPARPLSPA-----LTGLLWFNLFLMLWRMAMRFLFVARAYGLRA 419 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQH--GDPRRVAWDKTTHDFPSVTGD 483 GL +V R N I +A RA+ L+ G P R WDKT H FP + D Sbjct: 420 GLGAVPRTLIANYIGILAARRAIFLYLRSLAGQPLR--WDKTQHRFPDLKTD 469 >UniRef50_C8WHH7 General secretory system II protein E domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WHH7_EGGLE Length = 711 Score = 217 bits (553), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 195/645 (30%), Positives = 278/645 (43%), Gaps = 112/645 (17%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWN 79 + LA I+F G DD DV R +K R+ + + K LA+++ AW+ Sbjct: 13 VALAFIVF--GADDVLWDVFALFRGTGKK--------RVKLSLINEKPPKMLAVVIAAWH 62 Query: 80 ETGVIGNMAELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA---R 134 E V+G + + + Y Y +F+G YPND T + R VVC Sbjct: 63 EDAVLGEVVDNLVASAQYPRSLYRVFLGVYPNDAATVAVARALEVRHGGT--VVCVVGDD 120 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQI 194 PGPTSKA +N+ + AI ++E + FA +HDAEDV+ P E ++ NYL++ D +Q Sbjct: 121 PGPTSKAANINHTVRAIREYEAERDVRFASVTIHDAEDVVHPNEFKMTNYLIDDYDALQF 180 Query: 195 PVYPFERE------WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 PV+P +R + TS TY DEF+E H + + +R+ L G VPSAG G RR + Sbjct: 181 PVFPLQRMPRLRLFFKTLTSSTYADEFAEHHFRTMVMRDEL-GFVPSAGTGFAIGRRVLD 239 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFV--RFPVVDEAKEREQRKFLQHART 306 A D SLTEDY + L+ +G +V + P VD RT Sbjct: 240 AFR---DEDLLPRNSLTEDYKLSLTLRMRGFRVHYVLEKVPRVD-----------ARGRT 285 Query: 307 S-NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNY----FLWRDRKGA 361 + I R FP TF AVRQK+RW+ GI Q L + FL++ K Sbjct: 286 VWDYIATRSLFPSTFKAAVRQKARWVYGITMQSASMADVFGKSELTFAERTFLYKGLKAK 345 Query: 362 ISNFV-----SFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 +NFV + LA ++ L ++P + S S W+ L +MV Sbjct: 346 FANFVLLPGYAVLAYFLVQTFAPQLELPVMYP-------LHSPSWWMCVFLLF---MMVE 395 Query: 417 RIVQRVIFVTGYYGLTQGLLSV-------LRLFWGNLINFMANWRALKQ----------- 458 R V R + YG S+ LRL WGNLIN A +RA +Q Sbjct: 396 RQVLRGRALANVYGWKTMAFSILLPPLFPLRLLWGNLINMCATFRAWRQKIAYVLLRGRE 455 Query: 459 -------VLQH----------------GDPRRV----------AWDKTTHDFPSVTGDTR 485 V++H GD + AW+KT H+F + R Sbjct: 456 AKAAAAPVVEHRGNAAEEEGERKPATDGDEAQTSNATSAQEGPAWNKTDHEFLPASVLER 515 Query: 486 SLRPLGQILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 R LG LLE + L+ A+ + R G+RLG +L QGL+ L QA A Q Sbjct: 516 YRRLLGDALLERGFVEPGHLEDAVGSARARGVRLGQELLRQGLVEERHLTQAYALQQQSM 575 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGI 589 + + L+ MP + A +A LPL IV +D + Sbjct: 576 YVRAQPDLVLLELMDRMPFAAADRFAALPLVESEKGWIVAVDDDL 620 >UniRef50_Q1IXI4 Glycosyltransferase, NfrB-like protein n=2 Tax=Deinococci RepID=Q1IXI4_DEIGD Length = 670 Score = 214 bits (544), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 186/575 (32%), Positives = 265/575 (46%), Gaps = 53/575 (9%) Query: 57 RMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHI--FVGTYPNDPDTQ 114 R+ +L + LA+M+ AW E GV+ M E + Y + FVG YPND T Sbjct: 46 RLRPADLQQDHPSHLAVMIGAWQEAGVVTPMIESTLRLMHYPASRVEFFVGVYPNDLATL 105 Query: 115 RDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVI 174 +V + RFPNVH VV RPGPTSK+ LN V AI E F +HDAEDVI Sbjct: 106 PEVQALAERFPNVHCVVNERPGPTSKSQNLNGVYAAIKAHEARTGKPFDVIAVHDAEDVI 165 Query: 175 SPMELRLFNYLVERKDLIQIPV---YPFEREW--------THFTSM----TYIDEFSELH 219 P +L++ L++R ++Q+PV +P R W H T +Y DEF+E H Sbjct: 166 HPYTFQLYSTLLKRWKMVQLPVFALFPRGRAWGAGLRGLLRHLTGQIVTGSYADEFAEHH 225 Query: 220 GKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGM 279 + +P REAL +PSAG G RR V ALL + DG +L EDY++ RL +G Sbjct: 226 LRHLPAREALGLFLPSAGTGFAM-RREVMALLEE-DGQVLTEGALAEDYELALRLWRRG- 282 Query: 280 TEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQG- 338 + V F V + Q K + + VREYFP A+RQK RW GI Q Sbjct: 283 --VRVHFHVQPLPRLDTQGKL-----GRDYVAVREYFPTEVQAAIRQKGRWTYGITLQTP 335 Query: 339 FKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFS 398 + +L LW D+KG +N + L + + LLL + +H S Sbjct: 336 HRLRGLRLNLRDRLTLWHDQKGKYTNLIHLLGYPLSLTLLLAPLF------GFHLQS--- 386 Query: 399 GSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSV-------LRLFWGNLINFMA 451 ++ LL G+ R++ R V YGL Q L++ LR GN+IN +A Sbjct: 387 -NSLTRDLLLGVLGVTGWRMLMRAGAVGRIYGLRQALIATLCLPGLPLRWLAGNVINTLA 445 Query: 452 NWRALKQVL--QHGDPRRVA-WDKTTHDFPSVTGDTRSL-RPLGQILLENQVITEEQLDT 507 RA + L + G R A WDKT +++ R LG L + E +L Sbjct: 446 TLRAWRLFLFPERGQKRGTARWDKTERKAYVPDEVLQAVRRRLGDQWLFTGALRERELAR 505 Query: 508 ALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVA 566 LR R RLG + Q L+ Q+ ++LA+ G+ + ++ + ++ A Sbjct: 506 LLRVQRRAAARLGQLAVQQALVDEAQVRRSLAQTQGLMYLNLTPEMLDHRFLSAEQAQ-R 564 Query: 567 LHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV 601 L A+L R D L+V S + P AL R++ Sbjct: 565 LDVAILGKR--GDRLLVASPHAVSPERCEALLREL 597 >UniRef50_Q2G4Z3 Bacteriophage N4 adsorption protein B n=2 Tax=Sphingomonadales RepID=Q2G4Z3_NOVAD Length = 488 Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 140/445 (31%), Positives = 213/445 (47%), Gaps = 29/445 (6%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMF-ISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 WL D WL L+ + A + F I D+ +D ++ + ++L+ +++ Sbjct: 4 WLADSAYQWLAVLEHELLLFAAVWFAIGAADELVMDGIW----LWQRLTGAGPTGQLAGN 59 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 K A+ VPAW E+ VIG M E+ I+VG Y ND +T + + Sbjct: 60 GRDKLSSMA-AVFVPAWRESAVIGPMVAHCLAVWPQEDLRIYVGCYRNDQETLNAL-TIV 117 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL 181 + P V VV R GPT+KADCLN + A+ Q ER + +LHDAED++ P L L Sbjct: 118 SEDPRVRVVVHDRDGPTTKADCLNRLYLAMRQDERRSGQRIGFIVLHDAEDMVHPAALAL 177 Query: 182 FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 + ++ D +Q+PV P + + + + Y DEF+E H +D+ VR+ + +PSAGVG Sbjct: 178 MDRALDTVDFVQLPVRPEPQASSPWVAGHYCDEFAEAHARDMVVRDHIGAGLPSAGVGCA 237 Query: 242 FSRRAVTALLA-DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 FSR A+ ++A G + F LTEDY+ G + E G F+R R+ R Sbjct: 238 FSRAAIERIVAVRGGALPFAADCLTEDYEAGMLVAETGGRSRFIRV--------RDAR-- 287 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 ++ RE+FPD + +VRQK+RW+ GI FQG+ W S + RDR+G Sbjct: 288 ------GELVATREFFPDGLAASVRQKTRWVHGIAFQGWDRLGWNRSAGDLWMRLRDRRG 341 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 + V A L + ++ E F+ L LL N ++ R+V Sbjct: 342 PLVALVLLAAYLALPLWPIVRFGEMA-----GFVVPVPPGPVLKGLLAFNLCSLIWRLVV 396 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGN 445 R +F YG +G+ SV R GN Sbjct: 397 RALFTGSEYGWIEGVRSVFRFPVGN 421 >UniRef50_Q1GVK0 Bacteriophage N4 adsorption protein B n=1 Tax=Sphingopyxis alaskensis RepID=Q1GVK0_SPHAL Length = 483 Score = 199 bits (505), Expect = 5e-49, Method: Compositional matrix adjust. Identities = 138/425 (32%), Positives = 197/425 (46%), Gaps = 39/425 (9%) Query: 66 PDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP 125 P E +AI VPAW+E + M D E++ ++VG YPND T V ++ AR Sbjct: 66 PIEGRIAIFVPAWDEAAALPAMLCRTLAAWDGEDFRLYVGCYPNDTATIYAVSQLVARDA 125 Query: 126 NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYL 185 + V+ GPT+K D LN + A+ ER FA +LHDAED + EL L+ Sbjct: 126 RLRLVIGESEGPTTKGDNLNRLWAALCADERVEARRFAAIVLHDAEDHVHRHELALYRQH 185 Query: 186 VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 + ++QIPV P + Y DEF+E HGKD+PVR L +PSAGVG +R Sbjct: 186 LAHNAMVQIPVVPIIDRRARWIGGHYADEFAEAHGKDMPVRSRLGLPLPSAGVGCALTRS 245 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 A+ L + G F SLTEDY+IG + G+ FV D A +R Sbjct: 246 ALALLAMERGGCPFSSDSLTEDYEIGMVIGAYGLGARFV--DAADPAGDR---------- 293 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSS------------LTLNYF 353 I R FP AVRQKSRWI GI G+ W L + Sbjct: 294 ----IVSRGAFPGRIDAAVRQKSRWIAGIAMAGWDHLGWPGCRLGHKQRSTGRDLLARWM 349 Query: 354 LWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGL 413 LWRDR+ ++ + A +I + +A + L W+ + WL+ +N L Sbjct: 350 LWRDRRAPLAALILLAAYAGLILVAAGVAGQLLL--GWNAIEPGPTLQWLLV---VNALL 404 Query: 414 MVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALK---QVLQHGDPRRVAW 470 + R+ R+ F +G + +V R F N+I +A R++ ++L+ G+ V W Sbjct: 405 LGWRMALRIHFTARLHGWREASFAVPRAFVANIIAMLAARRSVLLYWRILRSGE---VVW 461 Query: 471 DKTTH 475 DKT H Sbjct: 462 DKTDH 466 >UniRef50_B8ICV2 General secretion pathway protein E n=2 Tax=Methylobacterium RepID=B8ICV2_METNO Length = 442 Score = 198 bits (504), Expect = 6e-49, Method: Compositional matrix adjust. Identities = 138/454 (30%), Positives = 215/454 (47%), Gaps = 40/454 (8%) Query: 28 ISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNM 87 +S LDD FID++ + I RK P ++ R D +A+ V W+E V+G M Sbjct: 18 VSSLDDAFIDIIAF--GILRK-----GLPGLAERT----DIPRIAVFVANWHEEEVLGKM 66 Query: 88 AELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN-VHKVVCARPGPTSKADCLNN 146 E + Y + +F+G YPND T R E+ A++P+ V ++ GPTSK LN Sbjct: 67 VEGNLARIPYPSVSLFLGVYPNDTGTLRVAKELEAKYPDRVTVIINTLNGPTSKGQMLNE 126 Query: 147 VLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREWTHF 206 + + + E + A +LHD+EDVI P ++ + D IQ+PV+ R Sbjct: 127 MFQQVFEREDCPDIA----VLHDSEDVIDPRTFPIYAQYSQDHDFIQVPVFSLSRGKGLP 182 Query: 207 TSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTE 266 + TY+DEF+E H +++ VR A+ +PSAGVGT +++ + LA G ++TE Sbjct: 183 VASTYMDEFAERHTREMIVRNAVGAAIPSAGVGTAMTKKLLKYFLAT-RGQVLMSGTVTE 241 Query: 267 DYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQ 326 DY +G K G + A N + RE+FP T + +++Q Sbjct: 242 DYILGVEAKRAGFS-------------AAFAAVSADDASGLNYVATREFFPKTLAASIKQ 288 Query: 327 KSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESL 386 K+RW+ GI F+ W + YF RDRKG I+NF+ ++ + ++ ++L L S Sbjct: 289 KTRWVYGINFEATHKLGWEGNAWDKYFFVRDRKGIITNFLPPVSFVFLVLIVLGLIDPSE 348 Query: 387 WPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNL 446 PD +F S ++LN ++ R RV+ YG + R G Sbjct: 349 MPDPIE--PVFVAS------IYLNLAALIVRYTIRVVASHEVYGTYDLIGIAYRWPIGLY 400 Query: 447 INFMANWRALKQVLQHGD--PRRVAWDKTTHDFP 478 IN A +RA K + + + W KTTHD P Sbjct: 401 INAAAVFRAWKTYIGESQFATKPIVWSKTTHDLP 434 >UniRef50_D2LI28 General secretion pathway protein E n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LI28_RHOVA Length = 450 Score = 189 bits (480), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 136/470 (28%), Positives = 215/470 (45%), Gaps = 52/470 (11%) Query: 22 LAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP-------LAIM 74 +++++ +S DD F+D+ LSV E P EKP + + Sbjct: 12 ISILINVSSFDDAFVDL----------LSVGIIRGNFGPPEDPSP-EKPTSSAIPDIGVF 60 Query: 75 VPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN-VHKVVCA 133 V W E V+G M E + + +++G YPND T + + A++P+ V +V + Sbjct: 61 VANWQEEDVLGRMVEGNLARIPISSVKLYLGVYPNDTGTLAVAEAMAAQYPDRVRVIVNS 120 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN-YLVERKDLI 192 GPTSK LN + + + A +LHD+ED+I P ++ Y E D I Sbjct: 121 MEGPTSKGQMLNEMFRQVYARPGAPEMA----VLHDSEDIIDPRTFGVYTAYAREGYDFI 176 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 Q+PV+ + TY+DEF+E H +++ VR A+ +PSAGVGTC +RR + + Sbjct: 177 QVPVFSLNSIKRSKVAATYMDEFAERHTREMVVRHAVGAMIPSAGVGTCMTRRLLEHFVR 236 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 + G +TEDY +G K G F R + + Sbjct: 237 E-RGFVLANGCVTEDYILGVEAKRAGFRSAFAAVSA-------------DELRGLDFVAT 282 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 REYFP +FS +V+QK+RW+ GI F+ W YF RDRKGAI+NF+ ++++ Sbjct: 283 REYFPKSFSASVKQKTRWVYGINFEATHKLGWGGDFWDKYFFMRDRKGAITNFLPPISLV 342 Query: 373 VMIQLLLLLAYE--SLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 +LLA+E + + IF S ++LN + R + RVI YG Sbjct: 343 ----FWVLLAFEVVDIEQMPLDLVPIFQVS------IFLNMLALGLRYLMRVICCRDVYG 392 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGD--PRRVAWDKTTHDFP 478 + + +R +N +A WRA K + + + + W KT H+ P Sbjct: 393 INDFIGVAVRWPVSLTVNMLAVWRAWKTYVGESEYATKPIVWSKTEHELP 442 >UniRef50_Q2NDA0 Probable inner membrane transmembrane protein n=1 Tax=Erythrobacter litoralis HTCC2594 RepID=Q2NDA0_ERYLH Length = 318 Score = 115 bits (288), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 67/199 (33%), Positives = 99/199 (49%), Gaps = 6/199 (3%) Query: 32 DDFFIDVVY-WVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAEL 90 D+ +D V+ W+R R ++ R RE P E A+ +PAW E VIG E Sbjct: 33 DELIVDAVWLWLRLTGRGETIEVRR-----RERSLPLEGKSAVFIPAWQEANVIGTTVEH 87 Query: 91 AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDA 150 + ++VG Y NDP T + E P + V+ GPT+KADCLN + A Sbjct: 88 MLSAWPQRALRLYVGCYRNDPATLAAIVEAAPGDPRLRVVIHDCDGPTTKADCLNRLYRA 147 Query: 151 ITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREWTHFTSMT 210 + + E + +LHDAED++ P L L + D IQ+PV P ++ + + Sbjct: 148 VEEDEARSGERCRMVLLHDAEDMVDPAALELCGRAIASADFIQLPVLPEPQKRSRWIGSH 207 Query: 211 YIDEFSELHGKDVPVREAL 229 Y +EF+E HGK + VR+AL Sbjct: 208 YCEEFAEAHGKAMVVRDAL 226 >UniRef50_C6E323 Response regulator receiver protein n=3 Tax=Geobacter RepID=C6E323_GEOSM Length = 283 Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 47/139 (33%), Positives = 76/139 (54%), Gaps = 2/139 (1%) Query: 489 PLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW-E 546 PLGQIL+++ +IT + L+ AL R G RLG + G+I+ E+L +ALA+Q+G+ + Sbjct: 4 PLGQILVQSGIITVKTLERALARQEGSGKRLGAILEEMGVITPEELVEALAQQSGMEMVK 63 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 I +P L+ +P VA++ V PL + L + D ID +L L R K+ Sbjct: 64 RITVQNVPGELLELVPGEVAINKLVFPLNRQEGVLAIAVSDPIDSETLDLLERHSCNKIV 123 Query: 607 YVIVLRGQIVTGLRHWYAR 625 V+ R +I+ ++ Y R Sbjct: 124 QVLAAREEILGAVKQHYLR 142 >UniRef50_B8FQB2 Type II secretion system protein E n=4 Tax=Clostridiales RepID=B8FQB2_DESHD Length = 573 Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 50/174 (28%), Positives = 84/174 (48%), Gaps = 16/174 (9%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG+IL+ + EEQL+ AL+ + GLRLG ++ Q +S E++ + + Q G+ + Sbjct: 10 LGEILIAGGALMEEQLNEALKLQKSLGLRLGEVLIRQNFVSEEEILRTIQRQLGLPAVDL 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + + ++ +P SVA Y VLP+ N +L+V + D D ++ L G V+ Sbjct: 70 NRIFVTEKILKMIPESVARKYTVLPVDFTNGQLLVATSDPTDYYAIDDLRLASGMMVKPC 129 Query: 609 IVLRGQIVTGLRHWYA------------RRRGHDPRAMLYNAVQHQWLTEQQAG 650 + + I+ + +Y R++GHD A A Q LT QAG Sbjct: 130 VARKADILRAIDRFYGRSEAEKAVSDFVRQKGHDQVAA---AAQTPVLTVVQAG 180 >UniRef50_D1B5J6 Type II secretion system protein E n=3 Tax=Synergistaceae RepID=D1B5J6_THEAS Length = 559 Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 45/146 (30%), Positives = 81/146 (55%), Gaps = 3/146 (2%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALA 538 +T +T+ LR LG IL++ V+TE L+ AL ++ +RLG ++ G +S + LA+AL+ Sbjct: 1 MTTETKHLR-LGDILIQAGVLTESTLEAALAEQKMSSMRLGEILVKNGWVSEKHLAEALS 59 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLR-LENDELIVGSEDGIDPVSLAAL 597 Q V S+ ++ ++ +P ++A V+PL LEND+L+V + D ++ ++L L Sbjct: 60 RQLKVPLVSLSRYRPTPEVLKIVPENLARRLDVVPLSILENDKLLVATADPLNVMALDEL 119 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWY 623 GR++ I +I +Y Sbjct: 120 KMATGREIDISIATASEIRRAFDQFY 145 >UniRef50_C4L4L3 Type II secretion system protein E n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L4L3_EXISA Length = 556 Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 38/139 (27%), Positives = 72/139 (51%) Query: 485 RSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 R+ R LG++L+E +IT QLD AL + G +LG +++ I+ QL Q + EQ V Sbjct: 3 RTKRRLGEMLIEAALITTNQLDEALEQKRPGEKLGDALIRLNHITETQLIQMIHEQLHVP 62 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + ++ I ++ +P ++A + ++P L + L V + D +D +++ + + G Sbjct: 63 IIELYSYDINVTVTKLVPKALAQKHDIMPFELNGNTLHVATADPLDLIAIDDVRLQTGMN 122 Query: 605 VRYVIVLRGQIVTGLRHWY 623 + I R QI + +Y Sbjct: 123 IEIGIATREQIRKTISRYY 141 >UniRef50_B5E8D1 General secretory system II protein E domain protein n=5 Tax=Geobacter RepID=B5E8D1_GEOBB Length = 552 Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 34/123 (27%), Positives = 70/123 (56%), Gaps = 1/123 (0%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 +G+IL ++Q+ITE++L AL +V G R+G +++ G+++ E + ALA Q + + + Sbjct: 10 IGEILFKSQIITEQELSAALEEQKVSGCRVGEALVRLGVVAQEDIDWALANQLNIPYVRL 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 I + +A++P +A Y++ P+ L EL + D ++ ++ +TR G ++ Sbjct: 70 KKENIDPAAVAKVPGQLARRYSLCPIFLSGSELSIAMADPLNKEAVEEITRVTGCQISIS 129 Query: 609 IVL 611 + L Sbjct: 130 VGL 132 >UniRef50_A0LUI4 Type II secretion system protein E n=4 Tax=Bacteria RepID=A0LUI4_ACIC1 Length = 553 Score = 62.0 bits (149), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 38/138 (27%), Positives = 71/138 (51%), Gaps = 1/138 (0%) Query: 487 LRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 ++ LG ILLE ++T EQL A ++ G LG ++ QG+++ QL ALA Q G+ + Sbjct: 1 MKQLGDILLEGGLVTPEQLAAAYAEHQRNGRSLGRVLVDQGILTEAQLVAALATQIGLRF 60 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 + I S ++ +P +V Y LP+ E+ +L+V D + ++ + G V Sbjct: 61 VDLTDVAIDGSAVSRVPEAVCRRYTALPIGYEDGKLVVAMADPANVFAIDDIRSITGLDV 120 Query: 606 RYVIVLRGQIVTGLRHWY 623 + V+ R ++ + ++ Sbjct: 121 KPVVATRADVLAAINRYH 138 >UniRef50_Q39ZG4 General secretory system II, protein E-like n=2 Tax=Geobacter RepID=Q39ZG4_GEOMG Length = 370 Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 44/140 (31%), Positives = 75/140 (53%), Gaps = 2/140 (1%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG++L++ IT +QLD ALR++V G RLG +++ G I E+LA+ L+E+ V Sbjct: 5 LGEMLVKTGRITPDQLDEALRSQVIFGGRLGTNLVEMGCIDEEELARVLSEKLRVPCVDP 64 Query: 549 DA-WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 D I ++I +P V Y V+PLRLEN L + D D ++ + + G + Sbjct: 65 DELMNISPAIIEAVPLEVVEQYQVVPLRLENRRLFLVMADPSDLPAIDQIAFRTGHVIVP 124 Query: 608 VIVLRGQIVTGLRHWYARRR 627 ++ +++ L +Y +R Sbjct: 125 LVAPEIRLLMALEKYYGIKR 144 >UniRef50_B9Y801 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y801_9FIRM Length = 560 Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 43/139 (30%), Positives = 73/139 (52%), Gaps = 2/139 (1%) Query: 489 PLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 P+GQ+LLE ITEEQL++AL ++ G RLG ++ G I+ E+ +AL+ + V Sbjct: 5 PIGQLLLEQGYITEEQLNSALAHQKAHPGNRLGDVLIELGYITEEKKLKALSVRLNVPVY 64 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + S ++ + VA Y V+PL ++N+ L + + D +D +L + G V Sbjct: 65 EGFQINVNSDIVRLISEDVAKKYQVMPLEIKNNALQLATSDPLDFYALEDIKASCGIPVS 124 Query: 607 YVIVLRGQIVTGLRHWYAR 625 V+ + I +R YA+ Sbjct: 125 PVLAPKEMIENAIRRNYAQ 143 >UniRef50_B1I3E7 Type II secretion system protein E n=4 Tax=Clostridia RepID=B1I3E7_DESAP Length = 561 Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 39/137 (28%), Positives = 74/137 (54%), Gaps = 4/137 (2%) Query: 490 LGQILLENQVITEEQLDTALRNR--VEGLR--LGGSMLMQGLISAEQLAQALAEQNGVAW 545 LG L++ VIT+EQL+ AL+ + +G + LG +++ G + E +AQ +A QNGV + Sbjct: 9 LGMNLVKAGVITQEQLEEALKRQDPKKGGKGFLGATLVELGYCTEEDIAQVIARQNGVPY 68 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 S++ + + + VA Y LP+ +N +L+V + D ++L L GR++ Sbjct: 69 VSLETFAADPQAVGLIAPEVARRYRALPIGFQNGKLVVAMKQPRDVIALDDLRIITGREI 128 Query: 606 RYVIVLRGQIVTGLRHW 622 + V++ Q ++ + Sbjct: 129 QPVVIPDSQFDAAMQRY 145 >UniRef50_C6CY56 Type II secretion system protein E n=3 Tax=Bacillales RepID=C6CY56_PAESJ Length = 554 Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 39/135 (28%), Positives = 73/135 (54%), Gaps = 2/135 (1%) Query: 490 LGQILLENQVITEEQLDTALRNRVEGL-RLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG +L+E+ +I+EEQL AL + + +LG ++ QG I+ +QL + L Q G+ S+ Sbjct: 8 LGDLLVESAIISEEQLQKALLEQSKSKQKLGDLLIAQGYITEQQLIEVLEFQLGIPHVSL 67 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 +QI + +P S+A Y +PL+ + +L+V D +D ++ L G ++ Sbjct: 68 YKYQIDPEITQIIPESMAKRYQAIPLQKDGGKLMVAMADPLDYFAIEELRMSTGFRIEPA 127 Query: 609 IVLRGQIVTGL-RHW 622 I + ++ + RH+ Sbjct: 128 ISSKDELQRAIARHY 142 >UniRef50_D1BGY1 Type II secretion system protein E (GspE) n=3 Tax=Actinobacteridae RepID=D1BGY1_SANKS Length = 557 Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 34/102 (33%), Positives = 57/102 (55%), Gaps = 1/102 (0%) Query: 487 LRPLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 ++ LG+ILLE ++ E QL AL +V G LG ++ G++S QL ALA Q G+ + Sbjct: 1 MKQLGEILLEEGLVNEAQLMAALDEQVVRGTSLGRVLVELGVLSEGQLVSALAAQVGMQF 60 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSED 587 +D + + + ++ + +V Y VLP+ E D L++ D Sbjct: 61 VDLDTFPVDRAAVSRLTGAVCRRYTVLPIAFEGDALVLAMAD 102 >UniRef50_A9FI10 Family membership n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FI10_SORC5 Length = 563 Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 40/145 (27%), Positives = 75/145 (51%), Gaps = 1/145 (0%) Query: 488 RPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 +PLG+ILL+ + +T+ QL+ AL R +G+ L +++ G +S +AL+EQ+GV Sbjct: 8 KPLGRILLQQRAVTQPQLEQALLEARAKGVPLATNLIESGTVSEVAALKALSEQSGVPGI 67 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ++ I S ++ +P +A + +LP+ + D ++V D + L G++V Sbjct: 68 DLNQVCIKLSDLSILPREIAAKHKLLPVLVREDRILVAMAAPADKKVIDELEFVTGKRVF 127 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDP 631 I L G +V + Y + +P Sbjct: 128 PYIALAGPLVRTIAAAYDMKEQGEP 152 >UniRef50_B5YDI2 Type IV pilus assembly protein PilB n=20 Tax=Bacteria RepID=B5YDI2_DICT6 Length = 873 Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 51/175 (29%), Positives = 89/175 (50%), Gaps = 10/175 (5%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG++LLE +IT+EQLD AL + +G+RLG ++L L+ LA+ L+EQ + ++S+ Sbjct: 316 LGEVLLEKNLITKEQLDEALALSSKKGIRLGEALLELKLLDDVALAKLLSEQFDIPFKSL 375 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT--RKVGR-KV 605 +I L + A +LPL +N ++VG +DP ++ AL R V R +V Sbjct: 376 KEVKIDHDLAKLISPQKARENLILPLYRDNGRIVVGI---VDPSNILALDDLRMVTRSEV 432 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 VIV R +++ + + + +L + + E Q E+ + + Q Sbjct: 433 FPVIVPRNELIDAINQIWG---SEEVEKVLEEIIVQKEEEETQYQEVSLEEISSQ 484 >UniRef50_B8D2C7 Tfp pilus assembly protein PilB n=5 Tax=Firmicutes RepID=B8D2C7_HALOH Length = 558 Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 42/143 (29%), Positives = 75/143 (52%), Gaps = 2/143 (1%) Query: 484 TRS-LRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQN 541 TR+ ++ LG++LL+ ITE+QL+ AL+ + + G +LG ++ G ++ L Q L Q Sbjct: 2 TRTHIKKLGELLLDFNFITEKQLNEALKKQNKSGKKLGEILVESGYLNENDLIQVLEFQL 61 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV 601 G+ ++ + I L +P ++A + V+PL +N +L V D + V++ + Sbjct: 62 GIPHADLNKYVINPHLAQYIPENIARRHNVVPLEKKNGKLKVAMVDPTNLVAIEDIEMTS 121 Query: 602 GRKVRYVIVLRGQIVTGLRHWYA 624 G KV +I R I L Y+ Sbjct: 122 GLKVEPLIASRKNIKMALNQIYS 144 >UniRef50_A1TEN6 Glycosyl transferase, family 2 n=7 Tax=Actinomycetales RepID=A1TEN6_MYCVP Length = 461 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 78/292 (26%), Positives = 118/292 (40%), Gaps = 50/292 (17%) Query: 59 SYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVD 118 +R K ++++PA +E V+G+ + A LD+ Y + V +DP+T+ Sbjct: 79 GFRRRSAGRPKGFSLLLPARHEQDVLGDTID-ALARLDHPLYEVIVIIGHDDPETEHVAR 137 Query: 119 EVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME 178 AR P + +VV P +K LN L + G + DAED + P Sbjct: 138 AAAARHPRIVRVVIDTNIPKNKPKALNTALP-------TCRGEIVG--VFDAEDEVHPRL 188 Query: 179 LRLFNYLVE--RKDLIQIPVYPFERE---WTHFTSMTYIDEF-SELHGKDVPVREALAGQ 232 LRL E R D++Q V + W+ + Y F S LH A Sbjct: 189 LRLVEARFEEARADVVQSGVQLMNIQTSWWSLRNCLEYYFWFRSRLH------FHADQRF 242 Query: 233 VPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEA 292 +P G T F+R TALL G +D L ED +IG RL +G R V + Sbjct: 243 IPLGG-NTVFAR---TALLRSVGG--WDRDCLAEDCEIGVRLSTRG-----ARVAVAYDP 291 Query: 293 KEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKW 344 K + RE P + V+Q++RW G + Q ++ +W Sbjct: 292 K----------------VVTREETPGSLRALVKQRTRWDQGFM-QVYRKGEW 326 >UniRef50_C8W5J2 Type II secretion system protein E n=2 Tax=Clostridiales RepID=C8W5J2_DESAS Length = 561 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 35/125 (28%), Positives = 70/125 (56%), Gaps = 6/125 (4%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-----GLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 LG IL++ +IT+EQL+ AL+N+ E GL +G +++ G + + +A+ +AE++G+ Sbjct: 10 LGTILVQKGIITQEQLEDALKNQSEMKGKKGL-IGKTLVRLGYCTEDDIARVIAERSGIP 68 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + S++ +QI + + + Y LP+ +D+L+V D +S+ L G Sbjct: 69 YISLETYQIDPAAVTVLSIDNINRYKALPVSFADDKLVVAMNHPNDIMSIDDLRMLTGYD 128 Query: 605 VRYVI 609 ++ V+ Sbjct: 129 IKPVM 133 >UniRef50_D2R473 Type II secretion system protein E n=6 Tax=Planctomycetaceae RepID=D2R473_9PLAN Length = 572 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 40/144 (27%), Positives = 73/144 (50%), Gaps = 2/144 (1%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNGV 543 ++R +GQI ++ I+++QL+ L + + G LG L++ EQL QALAEQ G+ Sbjct: 2 AIRRIGQIFVDMGFISDDQLEMLLEEQQQRPGTLLGKLAQEMSLVNEEQLVQALAEQMGM 61 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + IP ++ ++ S+A Y V+P++ ++EL V + D + L +G Sbjct: 62 QVVELGDITIPGDVLHKVTESMAQLYRVIPIKFSSNELTVATCDPQNITIQDELRSMLGY 121 Query: 604 KVRYVIVLRGQIVTGLRHWYARRR 627 +R VI I L +++ + Sbjct: 122 DIRVVIASETDIKKTLDRYFSSDK 145 >UniRef50_B0MQV8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0MQV8_9FIRM Length = 563 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 40/140 (28%), Positives = 73/140 (52%), Gaps = 2/140 (1%) Query: 489 PLGQILLENQVITEEQLDTAL-RNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 P+GQIL+EN + E+QL+ AL + R E G +LG +L G +S QLAQAL+ + V + Sbjct: 5 PIGQILVENGFLKEDQLEEALEKQRSEPGKKLGDVLLELGYVSETQLAQALSIRLKVPFI 64 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + +I + ++P ++A + ++ + L V ++D I+ L G ++ Sbjct: 65 DLTTTKIDIEAVKKIPEAIAKKNCCVAFQMTDSRLTVATDDPINFYIFEELKVISGMEIH 124 Query: 607 YVIVLRGQIVTGLRHWYARR 626 +I R I + Y+++ Sbjct: 125 AMIATRTAINETISKAYSQQ 144 >UniRef50_Q1IRV6 Type II secretion system protein E n=24 Tax=Bacteria RepID=Q1IRV6_ACIBL Length = 571 Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 37/135 (27%), Positives = 67/135 (49%), Gaps = 1/135 (0%) Query: 490 LGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG +L+ +VIT EQL+ ALR + G RLG +++ G +S + + L+ Q GV ++ Sbjct: 5 LGDLLVREKVITAEQLEQALREQGSSGTRLGAALVKLGFLSDDDVTNFLSRQYGVPAINL 64 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + ++I S++ +P A Y +LPL L + D + ++ + G + V Sbjct: 65 NYFEIDPSVVKLIPYDTAKRYQILPLSRVGASLTIAMVDPTNVFAMDDIKFMTGFNIEPV 124 Query: 609 IVLRGQIVTGLRHWY 623 + I+ G+ Y Sbjct: 125 VASESAILEGIEKAY 139 >UniRef50_B0MCL3 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MCL3_9FIRM Length = 558 Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 32/138 (23%), Positives = 77/138 (55%), Gaps = 2/138 (1%) Query: 489 PLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 P+G++LL+ IT+EQ+D AL + E G RLG ++ I+ +Q+ +AL ++ ++ Sbjct: 5 PIGEVLLQYGYITKEQIDQALDYQKEHPGKRLGTILMELQFITEQQMLEALGQRLSLSHI 64 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 S+ ++ + S + ++P +A Y +L + +++ +L + D ++ ++ + + G +++ Sbjct: 65 SLGSYPVNSEAVEKIPRQLAFKYNILAVDMKDHQLYIAVNDPLNFYAMEDIRQLTGMQLK 124 Query: 607 YVIVLRGQIVTGLRHWYA 624 + + L ++YA Sbjct: 125 VFLAELSPLKKALEYFYA 142 >UniRef50_B8E2U7 Type II secretion system protein E n=2 Tax=Dictyoglomus RepID=B8E2U7_DICTD Length = 561 Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 36/123 (29%), Positives = 62/123 (50%), Gaps = 1/123 (0%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 +PLG+ LLE +IT+EQL+ AL + + G +LG ++ +G + E + + L Q+ + + Sbjct: 5 KPLGEYLLEQGLITKEQLEKALEEQKKTGAKLGQILIERGYVKPEDIGKVLERQSEIPYI 64 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 S+ QI L ++ Y +P++ E L V IDP + + R V +R Sbjct: 65 SLTEVQIDEKLAGSFSENLLRRYKFIPIKREAGVLHVAVVPPIDPAIINEIRRIVKSPIR 124 Query: 607 YVI 609 I Sbjct: 125 IFI 127 >UniRef50_C8WIR5 Type II secretion system protein E n=2 Tax=Bacteria RepID=C8WIR5_EGGLE Length = 568 Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 64/262 (24%), Positives = 115/262 (43%), Gaps = 33/262 (12%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG +L++ +ITE+QL AL+ + E RLG ++ +G+I+ L +AL Q GV + + Sbjct: 6 LGDVLIDAGLITEDQLGHALKQQKETKRRLGDELIAEGVITEAGLIEALQMQLGVEFVDL 65 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 A + L + +VA Y V+P+R DE+ + D ++ +++ A+ ++V + Sbjct: 66 SAIDLDPELSRVISKNVARQYNVVPVRTSPDEVCLAMSDPLNFMAIEAVKNATRKRVIPM 125 Query: 609 IVLRGQIVTGLRHWY---------------ARRRGHDPRAMLYNAVQHQWLTEQQAGEIW 653 + ++ + Y AR G D +A + T + Sbjct: 126 VTTHDSLMRAIMTLYGNEGAARAIEEMKRDARTTGAD------DASTGSFQTSTLGDDAD 179 Query: 654 RQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTI 713 Q P L I+ S I++ E + L + +GV L +LT+ Sbjct: 180 AQSAPTVRLVNSIIERAATERASDIHL-----EPREIDLHVRMRIDGV-----LRTILTV 229 Query: 714 QRELQVSMQS-LLLKAGLNTEQ 734 +ELQ S+ S L + G+NT + Sbjct: 230 PKELQASVISRLKIMGGMNTSE 251 >UniRef50_Q1YKP1 Putative glycosyl transferase n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YKP1_MOBAS Length = 681 Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 104/430 (24%), Positives = 171/430 (39%), Gaps = 89/430 (20%) Query: 72 AIMVPAWNETGVIGNMAELAATTLDYENYHIFVG--TYPNDPDTQRDVDEVCARFPNVHK 129 +++V + ETGVI + + + LD+ I + +DP T + A P + Sbjct: 297 SVLVALYQETGVIERLVA-SLSRLDWPTSRIEIKLVCEADDPATIGEARRATAGLPQF-E 354 Query: 130 VVCARPG-PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--LFNYLV 186 +V PG P +K LN VL + + E A L+DAED P +LR + Sbjct: 355 IVAVPPGEPRTKPKALNFVL-PLCRGEFVA--------LYDAEDEPDPGQLREAFHGFRN 405 Query: 187 ERKDL--IQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSR 244 DL +Q P+ + T + + E++ L + +P +P G F R Sbjct: 406 GPGDLACLQAPLVVRNGDQNWLTGL-FALEYAALFRRLLPWLARRRLPLPLGGTSNHFRR 464 Query: 245 RAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHA 304 +TA+ A+D ++TED D+G RL +G + P +++A ER Sbjct: 465 HCLTAV------GAWDSHNVTEDADLGMRLYREGWKIGTLTRPTLEDAPER--------- 509 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISN 364 + RQ++RW G + W + LWR+ G IS Sbjct: 510 ---------------WPVWYRQRTRWTKGWL------QTWLVHMRQPRRLWRE-LGPISF 547 Query: 365 ------FVSFLAMLVMIQLLLLLAYESLWPDAWH-----------FLSIFSGSAWLMTLL 407 FV LA ++ + L+L +L H L +F+ + + Sbjct: 548 AVFQMLFVGMLASALIQPVFLVLVLSTLVSALNHGLPGGLAGMIFALDLFNATGGFFAFV 607 Query: 408 WLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR 467 L+ + R +R + YY L L+W L+ +A+ RA+ Q+ + DP R Sbjct: 608 ALSLPAL--RPEERAT-LPKYYALVH-------LYW--LLIALASLRAVCQLAR--DPHR 653 Query: 468 VAWDKTTHDF 477 W+KT HD Sbjct: 654 --WEKTHHDL 661 >UniRef50_C4DGM8 Glycosyl transferase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DGM8_9ACTO Length = 476 Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 105/432 (24%), Positives = 173/432 (40%), Gaps = 89/432 (20%) Query: 70 PLAIMVPAWNETGVIGNM-AELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 P ++VP + E V+ + + L+A + I + +D +T D CA P Sbjct: 98 PYTVLVPLYREATVLPTLVSRLSALDYPRDRLQILLLIEADDAETL-DAAVTCATDPRFE 156 Query: 129 KVVCARPGPTSKADCLN-NVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFN 183 VV P +K N + A+ +F +++DAED P +LR F Sbjct: 157 IVVIPDSVPKTKPKACNIGLARAVGEF----------CVIYDAEDRPDPDQLRKAALAFR 206 Query: 184 YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSE-----LHGKDVPVREALAGQVPSAGV 238 +R +Q + + WT++ + + E++ LHG D R LA +P G Sbjct: 207 LSPQRVVCVQAELQ-YWNPWTNWLTRCFAAEYATNFSMTLHGMD---RYRLA--IPLGGT 260 Query: 239 GTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQR 298 F A+ AL +D ++TED D+G R+ +G VR V +E Sbjct: 261 SNHFRTDALRAL------GGWDPYNVTEDADLGIRIARRGWD---VRMMVSVTEEE---- 307 Query: 299 KFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD- 357 +AR N + RQ+SRWI G + W Y LWR+ Sbjct: 308 ---ANARLGNWL--------------RQRSRWIKGYL------QTWLVHSRRPYRLWREV 344 Query: 358 ---RKGAISNFVSFLA--------MLVMIQLLLLLAYESLWPDAWHFLSIFSGS-AWLMT 405 R A+ + F M M L L++ + L P + +++ G A L+ Sbjct: 345 GTRRSLAVHLTLGFATVTTLVNPVMWAMTILYLIVGPQPLEPLFPKY-NLYGGVIAMLLG 403 Query: 406 LLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDP 465 + + LM+ V+R +F LT + L+WG + +A ++AL Q+L+ Sbjct: 404 NALMCYTLMLG-CVRRGLFAAVRVMLT------IPLYWG--LMSLAAYKALIQLLRPS-- 452 Query: 466 RRVAWDKTTHDF 477 +R W+ T H Sbjct: 453 KRHYWELTEHGL 464 >UniRef50_Q1J1R8 Tfp pilus assembly pathway, ATPase PilB n=7 Tax=Bacteria RepID=Q1J1R8_DEIGD Length = 891 Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 42/142 (29%), Positives = 76/142 (53%), Gaps = 5/142 (3%) Query: 486 SLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 ++PLG++++E E++D AL + G RL +++ G +S E LA++LA Q G Sbjct: 332 KVKPLGEVIVELGFARAEEIDAALQKQNAGGGRLEDTLVQSGKLSPEMLARSLAAQLG-- 389 Query: 545 WESIDAWQ-IPSSLIAEM-PASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 +E +D Q P +A M P + A Y V+P+RL+ + L+V +D + +L L G Sbjct: 390 YEYLDPVQNPPDPQVALMIPEATARRYTVVPVRLQGEALVVAMKDPRNVFALDDLKLITG 449 Query: 603 RKVRYVIVLRGQIVTGLRHWYA 624 R++ ++ IV + ++ Sbjct: 450 REIVPAVMSEKDIVRLIERYFG 471 Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 66/248 (26%), Positives = 103/248 (41%), Gaps = 28/248 (11%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQAL 537 G + S LGQ L+ +I E QL AL + G LG ++ QGL+S +QL + L Sbjct: 156 GAAGSSESGGKLGQRLISRGLINEAQLQVALDVQQQTGEALGHILVTQGLLSEDQLYEVL 215 Query: 538 AEQNGVAW-ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGI-DPVSLA 595 AEQ G + + +Q ++ + + AL + +P+ DE G + DP Sbjct: 216 AEQAGAVYLRNPRDFQPGEEVLGSLLRADALRLSAVPV----DETAQGVTVVVSDPRRRD 271 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWYARR---------RGHDPRAMLYNAVQHQWLTE 646 L +GR V+ V+ G + + +Y +R +G RA L A+Q Q Sbjct: 272 ELEALIGRPVQLVLARPGDVEALIERYYPQRGRLGEQMVQQGSLSRAQLREALQVQA--- 328 Query: 647 QQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQET 706 + G++ E++ LG I+ L + L LV G +S E Sbjct: 329 -RGGKVKP--------LGEVIVELGFARAEEIDAALQKQNAGGGRLEDTLVQSGKLSPEM 379 Query: 707 LDRVLTIQ 714 L R L Q Sbjct: 380 LARSLAAQ 387 >UniRef50_A3DDQ8 Type II secretion system protein E n=3 Tax=Clostridium thermocellum RepID=A3DDQ8_CLOTH Length = 561 Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 34/137 (24%), Positives = 73/137 (53%), Gaps = 1/137 (0%) Query: 488 RPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + LG IL+E +I++EQLD AL+ + G +LG ++ +G+++ E + + L E+ GV Sbjct: 7 KGLGDILVEAGLISKEQLDKALKLQKKTGQKLGVLLVSEGIVTQEDIMRVLEEKIGVLRV 66 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 +++ I ++ + +P +A Y ++P+ ++ L V D ++ ++ + G +V Sbjct: 67 ALEECNIDPAVCSLIPEKLARRYELIPIAQKDGVLRVAMSDPLNVFAIDDIEDYTGMRVE 126 Query: 607 YVIVLRGQIVTGLRHWY 623 V+ I + +Y Sbjct: 127 PVVDFASSIKNAIDKYY 143 >UniRef50_Q094V5 General secretion protein E N-terminal domain protein (Fragment) n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q094V5_STIAU Length = 154 Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 37/105 (35%), Positives = 64/105 (60%), Gaps = 4/105 (3%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG++L++ V+ E QL AL + + G +LG ++ L+S + L +AL++Q + ++ Sbjct: 6 LGELLIKANVLQESQLKAALAEQAKWGGKLGEILVRMSLVSEDILVRALSKQLNIPAVNL 65 Query: 549 DAWQ-IPSSLIAEMPASVALHYAVLPLRLEND--ELIVGSEDGID 590 DA Q IP + A++PA A +AVLPL+L +D L+V D ++ Sbjct: 66 DAVQMIPPHVRAKVPAQTARDFAVLPLQLRDDGKTLVVAVADPLN 110 >UniRef50_C0QQ17 Type IV pilus assembly protein TapB n=2 Tax=Bacteria RepID=C0QQ17_PERMH Length = 558 Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 43/175 (24%), Positives = 81/175 (46%), Gaps = 24/175 (13%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 S +P+GQ+L E +TEEQ+ AL +++G LG + +S ++A+A+A Q+G Sbjct: 2 SRKPIGQLLKEFGYVTEEQIQVALEVQKIKGGLLGEILQELSFVSPREVAEAIARQSGRP 61 Query: 545 WESIDAWQIPSSL--IAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + ID Q P + + + ++A VLP ++ DEL + + D ++ ++R+ Sbjct: 62 Y--IDLSQYPPTRESLRILDKNIAKQLEVLPFEIDKDELHIAMTNPYDINAIDVVSRRTN 119 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYV 657 KV+ + D +L + H +L EQ + + Y+ Sbjct: 120 LKVKVYVA-------------------DKETLLKSIEIHYFLLEQPIDQTVKSYI 155 >UniRef50_D2R471 Type II secretion system protein E n=6 Tax=Planctomycetaceae RepID=D2R471_9PLAN Length = 573 Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 38/145 (26%), Positives = 73/145 (50%), Gaps = 1/145 (0%) Query: 492 QILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 +ILL +V++++QL+ + + L ++ G S E + +A+A+++G + + Sbjct: 9 EILLRRRVVSQDQLNEGRQVAKDTNANLSDVLIRLGYASGEDVMRAVAQEHGREYVDLSE 68 Query: 551 WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIV 610 IP +I +P SVA A+LPL + D L V D D ++ L + RKV + Sbjct: 69 VTIPEDVIELVPESVARENAILPLSEDEDSLKVIVSDPYDIDTIEKLRFILNRKVDIALA 128 Query: 611 LRGQIVTGLRHWYARRRGHDPRAML 635 R +I+ + +Y++ G ++L Sbjct: 129 PREKILEAINKYYSQIEGESADSVL 153 >UniRef50_B9M6X5 General secretory system II protein E domain protein n=2 Tax=Geobacter RepID=B9M6X5_GEOSF Length = 389 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 39/140 (27%), Positives = 73/140 (52%), Gaps = 2/140 (1%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG++L+E+ +IT+ +LD L+++V G ++G +++ G I E+LAQ L+++ GV Sbjct: 5 LGEMLVESGIITQAELDETLKSQVIFGGKIGTNLIEMGYIEEEELAQFLSKKLGVPCAGN 64 Query: 549 DAW-QIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + + + +P + Y V+PL L+ +L V ED + S+ ++ G V Sbjct: 65 EQLINLHPGALKLIPKEIVRKYRVVPLGLDKKKLYVAMEDPSNLASIDEISFMTGFIVMP 124 Query: 608 VIVLRGQIVTGLRHWYARRR 627 +I I+ L Y +R Sbjct: 125 LIATELSIILALEKHYGIKR 144 >UniRef50_A9G5Y5 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G5Y5_SORC5 Length = 527 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 42/140 (30%), Positives = 71/140 (50%), Gaps = 6/140 (4%) Query: 490 LGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LGQ+L++ ++IT++ L+ L + R +G RLG ++ QGLI+ QL Q L+ Q V W S+ Sbjct: 8 LGQLLVDARMITQDALERTLEQQRTDGRRLGTLLVEQGLINETQLTQILSHQLAVPWVSL 67 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPL-----RLENDELIVGSEDGIDPVSLAALTRKVGR 603 + L+ +P VA Y ++P+ R + + L V +D + + G Sbjct: 68 LHIEFSRQLLNLVPHDVAERYCLVPIYVRHVRNQGETLYVAMDDPTNEDGMKECMAFSGL 127 Query: 604 KVRYVIVLRGQIVTGLRHWY 623 VR +I I +R +Y Sbjct: 128 PVRAMIAPPSDIRNAIRVYY 147 >UniRef50_Q1D133 General secretion pathway protein E, N-terminal domain protein n=2 Tax=Cystobacterineae RepID=Q1D133_MYXXD Length = 293 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 38/99 (38%), Positives = 58/99 (58%), Gaps = 5/99 (5%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQN 541 +T S RPLG+ILLE V+ QL L + E + LG +++ +GL S + + LAEQ Sbjct: 9 ETGSRRPLGEILLEQGVLNRAQLRVGLVHHHEVHVPLGRALVREGLCSGADVLRGLAEQF 68 Query: 542 GVAWESIDAWQIP--SSLIAEMPASVALHYAVLPLRLEN 578 GV +++D + P S + +PA VA Y V+PLR++ Sbjct: 69 GV--DAVDLERTPPDSRRLNHIPARVARQYRVVPLRIDK 105 >UniRef50_C6E8N7 General secretory system II protein E domain protein n=3 Tax=Geobacter RepID=C6E8N7_GEOSM Length = 345 Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 36/96 (37%), Positives = 55/96 (57%), Gaps = 2/96 (2%) Query: 490 LGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGV-AWES 547 LG++LL+ +TE+QL+ L + + G RLG +++ GL+ E+LA+ L+EQ GV Sbjct: 5 LGEMLLKVGTLTEDQLEQVLNAQSIYGGRLGTNLVEMGLVEEEELARLLSEQLGVPCAHP 64 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIV 583 + IP SL+ P + Y VLPL L+ L V Sbjct: 65 SELSSIPESLLKMFPLELVQRYRVLPLALDGKRLTV 100 >UniRef50_Q1D9E1 General secretory pathway protein E n=17 Tax=Proteobacteria RepID=Q1D9E1_MYXXD Length = 605 Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 3/141 (2%) Query: 488 RPLGQILLE-NQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 RPLG+IL +TEE+L AL + E G R+G +++ +S E +A+AL Q + + Sbjct: 38 RPLGEILRAIVPSLTEEKLQEALAIQDEKGQRIGEALVGMKAVSEEDVAKALGHQLDLPY 97 Query: 546 -ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 I A ++ + L+ +P + A +LPL LE D + V D +D +L + +G+ Sbjct: 98 LARIFAEEVDAELVKRIPINFAKQSRILPLSLEGDTVAVAVADPLDTAALDHVRVLLGQS 157 Query: 605 VRYVIVLRGQIVTGLRHWYAR 625 V I L I + Y R Sbjct: 158 VSQRIALGSTITDAINSVYDR 178 >UniRef50_UPI0001B50A66 glycosyl transferase family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B50A66 Length = 439 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 74/300 (24%), Positives = 119/300 (39%), Gaps = 56/300 (18%) Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDY--ENYHIFVGTYPNDPDTQRDVDE 119 + + P ++++PA +E VI + E DY E +FV +D T + +E Sbjct: 53 DTFIPPRLSFSVLLPARHEEDVIQSTIERVVRA-DYPAELLEVFVICSQDDDGTVKKAEE 111 Query: 120 VCARFP--NVH--KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVIS 175 + +H +VV GP +K LN T ++AN F DAED I Sbjct: 112 KIDQLAREGLHNVRVVVFDDGPINKPHGLN------TALPQTANKVVTIF---DAEDDIH 162 Query: 176 PMELRLFNYLV--ERKDLIQIPVYPFEREWTHFTSMTYIDEF----SELHGKDVPVREAL 229 P RL N ++ ER ++Q V + ++++ ++ F S LH A Sbjct: 163 PKIFRLVNTVMVKERVRVVQAGVQLMNYQSNWYSTLNVLEYFFWFKSRLH------YHAH 216 Query: 230 AGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVV 289 G +P G F+R + L +D ++LTED D+G R+ G Sbjct: 217 HGSIPLGGNTVFFARELLLRL------GGWDDRNLTEDADMGLRISAMG----------- 259 Query: 290 DEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLT 349 ER + + + +E P T +RQ++RW G + Q K W T Sbjct: 260 ----ERVRVVY------DDRYVTKEETPPTLGHFIRQRTRWSQGFM-QTLKKGTWKKMPT 308 >UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophilus torridus RepID=Q6KZU9_PICTO Length = 395 Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 73/276 (26%), Positives = 117/276 (42%), Gaps = 59/276 (21%) Query: 67 DEKPL-AIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP 125 + KPL +I+VPA NE VIG E + Y+N+ +FV +D DT R + + R Sbjct: 44 NYKPLVSIIVPAKNEETVIGRCIE-SILGQAYDNFELFVVVDNSDDDTYR-IAKSYERDG 101 Query: 126 NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--LFN 183 VH V R G +KA LN +++ E A + DA+ V+ L+ ++ Sbjct: 102 RVH--VFERHGNLTKASALNYAY-SMSHGEIIATY--------DADTVLEKNTLKNAVYG 150 Query: 184 YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQ------VPSAG 237 D++Q RE FT + IDE + V+ ++ G+ VP AG Sbjct: 151 MRYMDADVLQGYNTYINREENIFTRLAAIDE--------IIVKVSMIGRMYLHLFVPVAG 202 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 F R + ++ +G LTED + G R+ A +R + Sbjct: 203 SNQYFKRETIR-IIGGWNG-----NFLTEDLESGVRM-----------------AAKRMR 239 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIG 333 +L A+ V + P T+S ++Q+ RW+ G Sbjct: 240 SAYLPSAK------VYQETPATYSEYIKQRIRWLRG 269 >UniRef50_C6PR94 Glycosyl transferase family 2 n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PR94_9CLOT Length = 743 Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 42/161 (26%), Positives = 77/161 (47%), Gaps = 8/161 (4%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALA 538 + + +S P+G++L+EN IT+EQL AL R G RLG +L G I E+L + LA Sbjct: 113 LNSNYKSKLPIGKMLVENNEITKEQLIKALDLQRKSGGRLGDILLFLGFIKPERLCRYLA 172 Query: 539 EQNGVA--WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAA 596 QN V ++ D ++ ++P +AL Y + L N+ ++ ++ + L Sbjct: 173 TQNNVGRIGKNFDI-----NVSKKLPYKLALKYNAIILNSRNNCYVIAVKELLSWKQLKE 227 Query: 597 LTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYN 637 + + + V V+ +I Y +++ + LY+ Sbjct: 228 IEGYLHKPVEQVLATMLEIDNFWNIVYRKKQSEESVFKLYD 268 >UniRef50_Q7UE44 General secretion pathway protein E n=1 Tax=Rhodopirellula baltica RepID=Q7UE44_RHOBA Length = 587 Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 40/142 (28%), Positives = 69/142 (48%), Gaps = 4/142 (2%) Query: 487 LRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 ++ +G IL+E ++T EQ+D A + G+ +G ++ Q LIS QL AL+EQ V + Sbjct: 1 MKRIGDILVELNILTNEQMDAAFAGKPRGVMIGDWLVRQSLISNAQLGAALSEQFSVPFV 60 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVG--SEDGIDPVSLAALTRKVGRK 604 ID + + +P A A + + + + L + + D I+ ++ A L G K Sbjct: 61 DIDFSSVNPQVARLLPEDFARSQASVAIDVSDRMLTLAMVAPDDIETIAEAELM--TGYK 118 Query: 605 VRYVIVLRGQIVTGLRHWYARR 626 +R V+ L + L Y R Sbjct: 119 IRPVVALEDDVRDLLNRIYDDR 140 >UniRef50_A5G3U1 General secretory system II, protein E domain protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5G3U1_GEOUR Length = 378 Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 39/136 (28%), Positives = 70/136 (51%), Gaps = 6/136 (4%) Query: 492 QILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 +LL+ +I EQ D AL+NRV G ++G S++ G + + LA+ L+++ V + IDA Sbjct: 7 DMLLDAGLINREQFDEALKNRVLYGGKIGTSLIELGYVREDDLARFLSKKLAVPF--IDA 64 Query: 551 ---WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 IP +I +P ++AL Y V+P+ + L + D D ++ ++ G + Sbjct: 65 DRLLTIPPEIIRLIPRNIALTYGVIPIHRDKKRLFLVMSDPADLKAIDEISFITGFIINP 124 Query: 608 VIVLRGQIVTGLRHWY 623 V ++V L +Y Sbjct: 125 VTAPEVRLVQALGKYY 140 >UniRef50_Q08RH0 Gspii_e N-terminal domain family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08RH0_STIAU Length = 590 Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 52/167 (31%), Positives = 87/167 (52%), Gaps = 12/167 (7%) Query: 490 LGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG++L++ ++IT + L+ AL ++V G RLG ++L GL+S + LA+ L + +G A S Sbjct: 3 LGELLIQEKLITRQGLEEALESQVVHGGRLGTNLLELGLLSEKDLARLLGQLHGCAHASG 62 Query: 549 DAWQIPSSLIAEMPASVALHYA----VLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + P +L V L+ A LP+R++ L V + D L AL K G++ Sbjct: 63 ELTPEPQAL-----KLVNLNDADKRDYLPMRVDATRLSVAVMNPHDYAMLDALAFKTGKR 117 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGE 651 V V+V ++ LR + R RA+ NAV+ ++ +GE Sbjct: 118 VVPVVVPEFRMNQLLRRYCKAFRPL--RAIDMNAVRPSKTLQEASGE 162 >UniRef50_B3E4P2 Response regulator receiver protein n=1 Tax=Geobacter lovleyi SZ RepID=B3E4P2_GEOLS Length = 281 Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 4/166 (2%) Query: 490 LGQILLENQVITEEQLD--TALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ-NGVAWE 546 LG+IL+ +++ ++ AL NR E R G + +GLI+ +L+ ALAEQ N Sbjct: 7 LGEILVNKGILSPLTVERMIALANR-EQKRFGWFLEDKGLITGHELSAALAEQFNMKHLT 65 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 SI+ + P L++ + AL + + PLR E L++ D D + + G V Sbjct: 66 SIEQYSYPKELLSLITPETALEFNLFPLRQEGSNLLLAVTDPTDMRMAHTIAKNQGMTVV 125 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEI 652 +V R Y R+ P+ LT + EI Sbjct: 126 PAVVSREAFFAAFCKHYLGRQIQKPKGETVLIADDDKLTREMLKEI 171 >UniRef50_B0VJ41 Type IV pilus biogenesis protein PilB n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJ41_9BACT Length = 569 Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 37/140 (26%), Positives = 71/140 (50%), Gaps = 2/140 (1%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGV-AWES 547 LG IL+ ITEEQL AL + GL+LG +++ G ++ +L +AL +Q G + Sbjct: 10 LGDILVHEGYITEEQLKDALLKQGNFGLKLGETLIKLGYLTENELLEALHKQLGYDVVQD 69 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + + ++++ +P A VL LR E D ++V D + + +L + +G+ ++ Sbjct: 70 KELMDLDINIVSSIPEPYAKENKVLALREEGDGVVVAMTDPENLIVSDSLEKILGKNIKP 129 Query: 608 VIVLRGQIVTGLRHWYARRR 627 V++ + + +Y R Sbjct: 130 VLIGNSSLQDAIEKYYKSIR 149 >UniRef50_C6XAP3 Type II secretion system protein E n=1 Tax=Methylovorus sp. SIP3-4 RepID=C6XAP3_METSD Length = 816 Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 37/145 (25%), Positives = 74/145 (51%), Gaps = 2/145 (1%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQ 540 ++R + LG+ L + ++I+E+QL AL + E G+ LG ++ G++ + L LA++ Sbjct: 211 ESRPILKLGEALRQLELISEDQLQHALNKQKENRGIPLGRILVDMGIVDEQTLKGTLAKK 270 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 G+ + S+ + + I + A VA + ++PL + L+V ED ++ ++ + Sbjct: 271 LGIPYVSLSKFNFDPNAIRLIGAPVARKHLLIPLCMYEGALVVAFEDPMNVKAIDEVRFL 330 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYAR 625 K + R IVT + +Y R Sbjct: 331 TQMKTLPAMASREDIVTAIDSFYGR 355 >UniRef50_Q1Q109 Strongly similar to general secretory system type II protein, ATPase component n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q109_9BACT Length = 582 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 37/152 (24%), Positives = 74/152 (48%), Gaps = 4/152 (2%) Query: 478 PSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQA 536 P D + R GQ+L EN TE+Q+ AL + G LG ++ ++ Q+ Q Sbjct: 9 PDAKSDAQR-RLFGQLLKENGFATEDQIQEALAVQKQNGGLLGDILISMNYVTDPQIMQV 67 Query: 537 LAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDE--LIVGSEDGIDPVSL 594 L+E GV +I+ ++P +I +PA++A Y ++P+ E ++ + + + + +L Sbjct: 68 LSEYLGVEIVNIEDREVPGDVINLVPAAIAQLYRIIPISYEQEKQVITIAQANALAIETL 127 Query: 595 AALTRKVGRKVRYVIVLRGQIVTGLRHWYARR 626 L + V+ V+ + + L +Y ++ Sbjct: 128 DDLRLVLKLNVKPVLCHKDSVARALEKYYPKK 159 >UniRef50_Q13CF3 Glycosyl transferase, family 2 n=10 Tax=Bradyrhizobiaceae RepID=Q13CF3_RHOPS Length = 686 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 103/441 (23%), Positives = 164/441 (37%), Gaps = 79/441 (17%) Query: 55 YPRMSYRELYK-PDEK-PLAIMVPAWN--ETGVIGNMAELAATTLDYENYHIFVGTYPND 110 +PR + R L + PD P+ +V A + E V G +A + A E + + PND Sbjct: 266 WPRAAQRPLRRRPDATLPIYTVVAALHREERSVAGLVAAIEALDYPREKLDVILVIEPND 325 Query: 111 PDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFI-LHD 169 T+ + + R P++ ++ P +K LN L FA FI ++D Sbjct: 326 LATRAAIARLGPR-PHLRVLIAPPVAPQTKPKALNCAL----------AFARGSFIAVYD 374 Query: 170 AEDVISPMELRLFNYLVERKDLI------QIPVYPFEREWTHFTSMTYIDEFSELHGKDV 223 AED P +LR +R + + W S T+ E++ + + Sbjct: 375 AEDQPEPGQLRAALDAFDRHGATTACAQASLCIDNITHSWL---SRTFAAEYAGQFDRLL 431 Query: 224 PVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIF 283 P + +P G F + A+ +D ++TED D+GFRL G + Sbjct: 432 PGLSEMNLPLPLGGTSNHFRTDVLRAI------GGWDPYNVTEDADLGFRLARFGYRSVS 485 Query: 284 VRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHK 343 +EA P TF RQ++RW+ G + Sbjct: 486 FASTTYEEA------------------------PITFDNWRRQRARWMKGFI------QT 515 Query: 344 WTSSLTLNYFLWRD--RKGAIS-NFVSFLAMLVMI--QLLLLLAYESLWPDAWHFLSIFS 398 W + LWRD +G ++ N + +L + L L +A SL AW L Sbjct: 516 WLVHMRHPLRLWRDIGPRGVLALNLIVGGNLLTALVHPLFLGIALASL-AGAWLELPAVL 574 Query: 399 GSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRL----FWGNLINFMANWR 454 + L WL V V+ + G G Q L + L +W + +A W Sbjct: 575 QPSPPSPLHWLAIAAGYASTV--VVGLRGLAGRRQLRLGFVLLLTPAYW--ICLSIAAWC 630 Query: 455 ALKQVLQHGDPRRVAWDKTTH 475 A+ Q + R W+KT H Sbjct: 631 AVAQFVW----RPYYWEKTVH 647 >UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI000038DF5C Length = 405 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 74/296 (25%), Positives = 120/296 (40%), Gaps = 60/296 (20%) Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 ++YKP ++I+VPA NE VI E + Y N+ +FV + +T + E Sbjct: 52 KMYKP---MVSIIVPAKNEETVIKRTIE-SILNQTYTNFELFVVVDNSSDNTYKIAKEYE 107 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISP--MEL 179 +R V+ V RP SKA LN FE++ A + DA+ ++ P +E Sbjct: 108 SRDKRVN--VFNRPDGKSKASALNFC------FEKTKGEVIATY---DADTMLLPNTLEN 156 Query: 180 RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQ------V 233 ++ D++Q RE FT + IDE + V+ L G+ V Sbjct: 157 AVYGMNYFNVDVLQGYNSYINREENIFTRLAVIDE--------ILVKATLIGRTHFNLFV 208 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 P AG F R+ + ++ D LTED + R+ + Sbjct: 209 PVAGSNQYFKRKVIESIGGWDDNF------LTEDLESSIRI-----------------SN 245 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLT 349 R + +L A+ ++E P ++S RQ++RW+ G F + K S T Sbjct: 246 ARYKSAYLGSAK-----ALQET-PASYSEYFRQRTRWLRGYHQVFFHSKKRFSKFT 295 >UniRef50_B5E974 General secretory system II protein E domain protein n=2 Tax=Geobacter RepID=B5E974_GEOBB Length = 391 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 35/140 (25%), Positives = 72/140 (51%), Gaps = 2/140 (1%) Query: 490 LGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG++L++ IT QL+ L+ + + G R G +++ G + ++LA L+++ G+A S Sbjct: 3 LGELLVDAGKITPTQLEETLKGQAIFGGRFGTNLVEMGYLDEQELAHFLSQKTGIAHTSP 62 Query: 549 DA-WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + +IP ++ +P Y V+P+ L N +L + D D ++ + G V Sbjct: 63 EQLMEIPPHVVGAVPEEYVRKYRVMPVALNNRKLTLAMLDPSDFQAIDEIAFATGYIVVP 122 Query: 608 VIVLRGQIVTGLRHWYARRR 627 VI ++++ + +Y +R Sbjct: 123 VIAPELRMLSAMEKYYGIKR 142 >UniRef50_Q1D3E0 General secretion pathway protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D3E0_MYXXD Length = 251 Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 34/105 (32%), Positives = 64/105 (60%), Gaps = 4/105 (3%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG++L++ V+ E QL AL + + G +LG ++ L+S + L +AL++Q G+ ++ Sbjct: 6 LGELLIKANVLQESQLKAALAEQAKWGGKLGEILVRMSLVSEDILVRALSKQLGMPAVNL 65 Query: 549 DAWQ-IPSSLIAEMPASVALHYAVLPLRLEND--ELIVGSEDGID 590 DA Q + + A++PA A ++VLPL++ +D L+V D ++ Sbjct: 66 DAVQMVQPHVKAKIPAQTARDFSVLPLQVRDDGKSLVVAMSDPLN 110 >UniRef50_C0QER1 PilB n=10 Tax=Proteobacteria RepID=C0QER1_DESAH Length = 739 Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 40/163 (24%), Positives = 72/163 (44%), Gaps = 7/163 (4%) Query: 463 GDPRR-VAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGG 520 G P R A KT D S G R +G++L + IT Q +AL +++ G RL Sbjct: 5 GQPSRSTALTKTIKD-QSGAGKVR----IGELLSKEGQITSNQFQSALSQHKKTGTRLSS 59 Query: 521 SMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDE 580 +L G I E + L + + ++ +P +A Y V PL ++ +E Sbjct: 60 VLLTMGFIDPETIINVLGRIYNYPVVRLADIKPDPKILKLLPFDIAKRYMVFPLGMKGEE 119 Query: 581 LIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWY 623 L+V + D ++ L ++VG+ ++ + ++ R +Y Sbjct: 120 LVVTMTEPTDTTAVEELQQEVGKTLKISVSTENDVIQAYRDFY 162 >UniRef50_A9DFB5 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DFB5_9RHIZ Length = 682 Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 91/381 (23%), Positives = 144/381 (37%), Gaps = 75/381 (19%) Query: 136 GPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLFNYLVERKDL 191 GP +K L L R A + ++DAED +P +L F ++ Sbjct: 341 GPRTKPKALQYAL-------RGARGSL--IAVYDAEDKPAPGQLLEAWATFRAGDDQLGC 391 Query: 192 IQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALL 251 +Q P+ +++ S + E+S L +P +P G F R A L Sbjct: 392 LQAPLAVANLR-SNWISGLFALEYSGLFRVLIPFLARTGMPIPLGGTSNHFKRAA----L 446 Query: 252 ADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMIC 311 + G +D ++TED D+G RL G RT + C Sbjct: 447 ENTGG--WDPHNVTEDADLGLRLHAYGY-------------------------RTGILKC 479 Query: 312 VR-EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-RKGAISNFVSFL 369 E P RQ++RW+ G W ++ W A F + Sbjct: 480 ATVESCPVQLDVWKRQRTRWLKGWA------QTWLVAMRNPVATWSSLGPAAFVVFQLLI 533 Query: 370 AMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMT-----LLWLNFGLMVNRIVQRVIF 424 A +++ L+ L Y + A+ F I SG A +++ LLW++ VN ++F Sbjct: 534 AGMLISALVHPLMYLFI---AFSFAWIASGHASVVSDMHAVLLWMD---AVNIFGNYLLF 587 Query: 425 -VTGYYGLTQGLLSVLRLFWGNLINF------MANWRALKQVLQHGDPRRVAWDKTTHDF 477 TG++ T S L+ W +I +A WRAL Q+L + W+KT HD Sbjct: 588 PATGWFAFTAYERSHLKRHWLLMIPAYWLLISLAGWRALTQLLANAH----LWEKTPHDA 643 Query: 478 PSVTGDTRSLRPLGQILLENQ 498 S + P + + ENQ Sbjct: 644 ESSAKTQQDPLPTCEAVPENQ 664 >UniRef50_B5YIG2 Type IV-A pilus assembly ATPase PilB n=4 Tax=Bacteria RepID=B5YIG2_THEYD Length = 572 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 30/109 (27%), Positives = 60/109 (55%), Gaps = 1/109 (0%) Query: 490 LGQILLENQVITEEQL-DTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 +G LL+ I+E+QL D ++EG+++G +++ G I+ ++L ++++E G I Sbjct: 12 IGVFLLKKGKISEKQLIDAQAVQKIEGIKIGAALIKLGYITEDELVESMSELYGYPVFKI 71 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 D+++I ++ +P V Y VLP E + + V D + ++L L Sbjct: 72 DSYKIDPLVVKLLPEDVIRKYKVLPFLREGNIIRVLITDPANEIALEQL 120 >UniRef50_A5GB67 Response regulator receiver protein n=4 Tax=Geobacter RepID=A5GB67_GEOUR Length = 280 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 31/105 (29%), Positives = 60/105 (57%), Gaps = 2/105 (1%) Query: 485 RSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNG- 542 ++ + LG+I + + +ITE+ L+ AL R++ + ++G + +++ E+LA ALA Q G Sbjct: 2 KNRKRLGEIFVASGLITEKTLERALARSKRQNKKVGMVLEEIEMVTGEELASALAVQYGH 61 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSED 587 + + P L +P VA+ + + PL++EN++L V D Sbjct: 62 RVVSNFARYAFPPELFKLIPEDVAMQHLLFPLKIENNKLAVAMAD 106 >UniRef50_A8URM8 Putative uncharacterized protein n=3 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8URM8_9AQUI Length = 552 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 42/139 (30%), Positives = 71/139 (51%), Gaps = 5/139 (3%) Query: 488 RPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + LG++L E ++++QL+ AL ++ G LG ++ +S ++LAQA+A Q G E Sbjct: 4 KKLGELLQELGFLSQDQLEVALEVQKLNGESLGEILVDLSFVSPQELAQAIAHQAG--RE 61 Query: 547 SIDAWQIPSSLIA--EMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 ID P SL A + + A VLPL+LE+ L + D + + + RK G + Sbjct: 62 FIDLSLYPPSLDALRLIDRNTAKQLEVLPLKLEDGRLKLAVSDPFNVNIIDLVKRKTGLE 121 Query: 605 VRYVIVLRGQIVTGLRHWY 623 V + R I+ + +Y Sbjct: 122 VDIYVADRESILRSIEIYY 140 >UniRef50_B6QYB3 Bacteriophage N4 receptor n=1 Tax=Pseudovibrio sp. JE062 RepID=B6QYB3_9RHOB Length = 85 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 1/68 (1%) Query: 413 LMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQH-GDPRRVAWD 471 L+ +R++QR I+ +YGL LR F +IN++A RA+K ++H G + WD Sbjct: 3 LLAHRLIQRHIWTGYHYGLRALAPVTLRYFVSIIINYIAMMRAIKTWIRHLGTGEVIGWD 62 Query: 472 KTTHDFPS 479 KT HD+P Sbjct: 63 KTAHDYPD 70 >UniRef50_C6J4S3 Glycosyl transferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J4S3_9BACL Length = 413 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 77/327 (23%), Positives = 128/327 (39%), Gaps = 69/327 (21%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 L VI I+L VI+ + G+ F + + +YR+ ++ Y K A++ Sbjct: 2 LDVILISLQVILAVVGVYQFGLAL----------FGMYRKKNKVQYEP-----SKSFAVL 46 Query: 75 VPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 V A NE V+G + E L E Y +FV D D R ++ V Sbjct: 47 VAAHNEEKVVGALMENLKQMNYPKELYDVFVIC-----DNCSDNTANIVRSHGMNACVRT 101 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLI 192 P K + +L + + R + ++ DA++++ P LR + N L +I Sbjct: 102 NPNLRGKGYAIEWMLKQLWKMPRQ----YDAVVMFDADNLVHPDFLREMNNDLCAGARVI 157 Query: 193 Q--IPVYPFEREW---THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 Q I E W ++ S Y + +L ++ + L G G CF Sbjct: 158 QGYIDTKNPEDSWITASYGISYWYCNRLWQLSRTNLKMANFLGG------TGMCFE---- 207 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVD-EAKEREQRKFLQHART 306 T LL + I + SL ED + R E+G+ +PV + +AK +++ Sbjct: 208 TELLKE---IGWGATSLVEDLEFTMRCVERGV------YPVFNYDAKLFDEK-------- 250 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIG 333 P TF + RQ+ RW+ G Sbjct: 251 ----------PLTFKASARQRLRWMQG 267 >UniRef50_B5YHZ6 Type IV pilin n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YHZ6_THEYD Length = 546 Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 32/94 (34%), Positives = 53/94 (56%), Gaps = 2/94 (2%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG ILLE +++TE++L+ AL ++ G LG + G I++ +LA+ LA Q+G+ + +I Sbjct: 3 LGDILLEKKLLTEQELNIALNVQKITGQVLGKCLTSLGFITSSELAEVLAIQHGLEYINI 62 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELI 582 I L+ P +V LP+ E D +I Sbjct: 63 REHPIEMGLLKVFPKNVTESARFLPIE-ETDGII 95 >UniRef50_A3DEG0 Type II secretion system protein E n=5 Tax=Clostridia RepID=A3DEG0_CLOTH Length = 787 Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 39/158 (24%), Positives = 74/158 (46%), Gaps = 6/158 (3%) Query: 469 AWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGL 527 +D+ T++ + D +G IL+ VIT++QL+ AL + G +G ++ QG Sbjct: 201 GFDQNTYNESGIFKDK-----IGNILVRAGVITQDQLENALSIQKKSGGLIGQILVKQGY 255 Query: 528 ISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSED 587 I L + L +Q GV + I+ +I +I + ++A + V+P+ + L V D Sbjct: 256 IDRRSLYEFLQKQMGVEYVDIEGIEIDEDIIGLVSPNLAKTHKVIPIEKVDGNLKVAMSD 315 Query: 588 GIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYAR 625 ++ S+ L G ++ + QI L +Y + Sbjct: 316 PMNIFSIDDLRLTTGLEIIPCLADEEQISAQLEKYYGK 353 >UniRef50_Q0F0L3 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F0L3_9PROT Length = 804 Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 37/132 (28%), Positives = 67/132 (50%), Gaps = 2/132 (1%) Query: 485 RSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 R +R LG+ILLE ++I E L AL + G RLG +L +I+ +QL LA++ + Sbjct: 248 RKMR-LGEILLEAKLINEADLKNALDEQKAHGHRLGEILLSTEVITEDQLLDVLAKKFRL 306 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 ++ ++I + A + +V Y +LP+ + L + D + + ++ K G+ Sbjct: 307 PTVDLETYEINPAAGALIERAVVEKYGILPIDTDAHSLTIALSDPMGLEAYDTISFKTGK 366 Query: 604 KVRYVIVLRGQI 615 KV V+ Q+ Sbjct: 367 KVHEVMAKASQL 378 >UniRef50_B0TEE9 Type ii secretion system protein e, putative n=12 Tax=Firmicutes RepID=B0TEE9_HELMI Length = 570 Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 33/111 (29%), Positives = 59/111 (53%), Gaps = 4/111 (3%) Query: 488 RPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 R LG +LLE +IT+EQL AL + G RLG +++ G ++ + + + L Q G+ Sbjct: 7 RKLGDLLLEYNLITDEQLQQALAEQKKRGERLGQTLVRLGFVTRQMINEVLEFQLGIPTI 66 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 S+ + + + +P S+ + LP++ + L V +DP++L AL Sbjct: 67 SLLQYPLHPEVFKLLPESLCRRHKCLPVKRSGNRLTVAM---VDPLNLPAL 114 >UniRef50_B3E1K8 Response regulator receiver protein n=5 Tax=Geobacter RepID=B3E1K8_GEOLS Length = 340 Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 41/146 (28%), Positives = 72/146 (49%), Gaps = 7/146 (4%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 D + LG IL+ ++I+E+ L+ AL R EG +LG + G+I+ +L +AL Q Sbjct: 61 ADINQKKQLGDILVRAKLISEKTLERALERQHTEGKKLGEVLEEMGVITELELVEALGRQ 120 Query: 541 NGVAWESIDAWQ---IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 G ++++ + P+ I +P + V PL+ ++ L V D D + + Sbjct: 121 FG--FKTVTDFSNRTYPAETINLLPTEFVMKRLVFPLKHKDMMLAVAITDPFDGETTDMI 178 Query: 598 TRKVGRKVRYVIVLRGQIVTGL-RHW 622 R G +V VI R +I+ + RH+ Sbjct: 179 ARITGLQVVPVIATRKEILEAIARHY 204 >UniRef50_Q3A899 Type II secretory pathway and PulE/Tfp pilus assembly pathway ATPase PilB n=14 Tax=Bacteria RepID=Q3A899_PELCD Length = 578 Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 34/110 (30%), Positives = 59/110 (53%), Gaps = 4/110 (3%) Query: 489 PLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 PLG+IL + +I EQL+ L R + EG+RLG + + G ++ LA+ALA Q + Sbjct: 9 PLGRILCDQGIINAEQLEHLLSRAKAEGVRLGEAGIEAGCLTDRDLARALARQFYFDYVD 68 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 ++ + S L+AE+ + + +PL + L + +DP ++A L Sbjct: 69 LENFVPDSELLAEISPELLPRFLFMPLHRDVHGLHIAV---VDPTAVAEL 115 >UniRef50_Q3SKS0 Pilus assembly pathway ATPase PilB n=3 Tax=Proteobacteria RepID=Q3SKS0_THIDA Length = 577 Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 35/137 (25%), Positives = 65/137 (47%), Gaps = 1/137 (0%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG +L+E +VI+ LD AL + G RLG ++ GL +AQALA Q + + + Sbjct: 20 LGDLLVEQKVISAADLDIALTAQKKSGRRLGRIIVESGLAGENDIAQALARQLAIPFVDL 79 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + S++ + + A + +PL ++ VG D D + + R + ++ Sbjct: 80 RKFNPDPSILQLLGETQARRFRAIPLGRREGDIFVGMADPTDLFAYDEVARLIEGGIQLA 139 Query: 609 IVLRGQIVTGLRHWYAR 625 +V G +++ + Y R Sbjct: 140 VVAEGDLLSAIDRLYRR 156 >UniRef50_UPI0001C3693E pili biogenesis protein PilB-like ATPase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C3693E Length = 572 Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 39/155 (25%), Positives = 72/155 (46%), Gaps = 6/155 (3%) Query: 490 LGQILLENQVITEEQLDTALRNRVEGL---RLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 +G++L+E IT+EQL+ L+ G RL + G I+ +L L G+ Sbjct: 6 IGEVLVEQGAITKEQLNEGLKLLKAGTNDRRLAEVLTDLGYITERELLDVLGRGMGLEVI 65 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ++ + I + ++P +AL Y V+ + +E L V + D +D +L + +++ Sbjct: 66 DLEFFHIDERAVEKIPKQLALKYTVMAVSMEGSGLTVATADPLDLYALEDIRLVTNMRIQ 125 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQH 641 ++ R QI + Y+ G D RA A +H Sbjct: 126 LILAERTQIRHAIELNYS---GIDARAAARLASEH 157 >UniRef50_Q181B0 Type IV pilus assembly protein n=5 Tax=Clostridium difficile RepID=Q181B0_CLOD6 Length = 561 Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 36/136 (26%), Positives = 63/136 (46%), Gaps = 1/136 (0%) Query: 490 LGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 +G L+E ITEEQL AL + G RLG ++ +GLI + L L E + + Sbjct: 10 IGDKLVEKGYITEEQLKWALSEQKNSGKRLGEFLVQEGLIDSNLLISVLKELLDIESIFL 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + +I + +P ++ Y V P +++ +++ + D D ++ + R G+ V Sbjct: 70 EGTEIDTLATKMVPENICKRYTVFPFKIDGNKICLAMSDPQDREAVQDVRRMSGKDVEIF 129 Query: 609 IVLRGQIVTGLRHWYA 624 I I + H YA Sbjct: 130 ISSTEDINKAIGHAYA 145 >UniRef50_Q0ASE8 Glycosyl transferase, family 2 n=1 Tax=Maricaulis maris MCS10 RepID=Q0ASE8_MARMM Length = 537 Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 96/416 (23%), Positives = 157/416 (37%), Gaps = 68/416 (16%) Query: 73 IMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQR-DVDEVCARFPNVHKVV 131 IMV + E V+ ++A A+ +DY + DT+ V A P ++ Sbjct: 138 IMVALYREAAVLPDLARGLAS-IDYPTDRVAFKLVLEADDTETIHVARRMALDPRFEIII 196 Query: 132 CARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVE 187 P +K LN L + RS +HDAED P +LR F + Sbjct: 197 VPPGNPRTKPRALNYAL----RLCRSELV-----TIHDAEDRPDPYQLRRAAEAFRVADQ 247 Query: 188 RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 R +Q P+ + RE T T ++ + H +P+ + L +P G F R A+ Sbjct: 248 RLACVQAPLNWYNREETWLTRQFALEYAAHFHAL-LPLYQRLGWPLPLGGTSNHFRRDAL 306 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 + +D ++TED D+G RL G + ++EA R Q R Sbjct: 307 VRV------GGWDAWNVTEDADLGLRLHAAGYRCGLIEPKTLEEAPLRLVPWVKQRTR-- 358 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV- 366 ++ Y AVR+ + W W L L GA+++ + Sbjct: 359 ---WIKGYAQTIGVLAVRRDTPW----------RRVWPGMLVLG--------GAVASALL 397 Query: 367 -SFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV 425 + L++ +I L E+ A F+ SA + + + RI Sbjct: 398 HAPLSLACLIALATRAGPEAASLPALAFMLAGYASAITCAAVAMRRAGLPVRIRD----- 452 Query: 426 TGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 L+ + +W L+ +A RAL+Q+L DP R W+KT H ++T Sbjct: 453 ----------LAGMPAYW--LLQTLAAARALRQLLT--DPHR--WEKTEHGVSAMT 492 >UniRef50_C6E6L0 Response regulator receiver protein n=3 Tax=Geobacter RepID=C6E6L0_GEOSM Length = 292 Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 39/189 (20%), Positives = 91/189 (48%), Gaps = 10/189 (5%) Query: 474 THDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQ 532 T D S+ ++ +PLG+I +E ++T+ ++ + + + +G+RLG + + GL+S E+ Sbjct: 2 TADIESLNTTPQTRKPLGEIFVERGLLTKVSVERLIDHAKSKGIRLGELLEVIGLVSPEE 61 Query: 533 LAQALAEQNGV-AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDP 591 LA+ALA Q + +++ +P +A+ + + PL++++ L + D Sbjct: 62 LAEALAIQYRCRKISDFSKYAYSPAMLRLIPMEMAVKHTIFPLKMDDGRLGLAVADP--- 118 Query: 592 VSLAALTRKVGRKVRYVIVL----RGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQ 647 ++ L R++ + + ++L R +I + Y + P V+ L+ + Sbjct: 119 -TMDELFRQIAAQHKVKLILYVATRMEINRAIARHYLGQPATGPEGKTILLVEDDQLSRE 177 Query: 648 QAGEIWRQY 656 +I ++ Sbjct: 178 MVAKILTKH 186 >UniRef50_A1ALE6 General secretory system II, protein E domain protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1ALE6_PELPD Length = 550 Score = 48.9 bits (115), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 35/136 (25%), Positives = 68/136 (50%), Gaps = 1/136 (0%) Query: 490 LGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LGQIL +++I+E + AL + G R G +++ G+ + E + AL+ Q + + + Sbjct: 10 LGQILTASRIISEIDILAALEEQARSGCRFGEALVRLGVATQEDVDWALSSQLDIPYIRL 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + IA +PA++A ++ +PL DEL + D ++ ++ AL G +V Sbjct: 70 KRELVDPGAIALVPAAMARRFSCIPLFRAGDELNIAIADPLNRAAIQALELATGLRVSIS 129 Query: 609 IVLRGQIVTGLRHWYA 624 + L +I+ + Y Sbjct: 130 VALLREIMEMVDECYG 145 >UniRef50_C1XWL4 Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB n=4 Tax=Bacteria RepID=C1XWL4_9DEIN Length = 888 Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 5/103 (4%) Query: 488 RPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 RPLG++L+E + E ++ +L + R G RL +++ G I E LA++LA Q G + Sbjct: 337 RPLGEVLVELGYVKPEDIEESLQKQRQGGGRLEDTLIQSGKIKPEMLARSLAAQLGYPY- 395 Query: 547 SIDAWQIP--SSLIAEMPASVALHYAVLPLRLENDELIVGSED 587 ID + P S++ +P + Y V P +EN L+V +D Sbjct: 396 -IDPLEQPPDPSVMMMVPEATVRRYHVFPHHMENGTLVVLMKD 437 Score = 42.0 bits (97), Expect = 0.097, Method: Compositional matrix adjust. Identities = 57/234 (24%), Positives = 98/234 (41%), Gaps = 24/234 (10%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 GD R LG LL+ ++ +E+L A+ R+R G L + GL+S ++AQA+ E Sbjct: 7 GDKR----LGAALLDMGLLEDEELQKAIERHREIGGSLAEIVAEMGLLSERRVAQAIEEI 62 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 G+ + +IPS + +PA A +P + L V + +D + L L Sbjct: 63 FGIPLVELSEVEIPSEAKSLIPAEKARDLEAIPFAFDGRLLRVALLNPLDNLVLEELED- 121 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 L GQI+ + A R Y +H + + P + Sbjct: 122 ----------LTGQIIEPYQTTRASFR--------YALAKHYPELGLEVPAPPKAATPAE 163 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 ++L G ++ A+ L E+S LG+ L+ +G++S+ L + L Q Sbjct: 164 VKLGDLLVKKGWLSPQALQAALAEQEKSGELLGRVLMQKGLVSELQLYQALAEQ 217 >UniRef50_A1SEL1 General secretory system II, protein E domain protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SEL1_NOCSJ Length = 606 Score = 48.5 bits (114), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 103/440 (23%), Positives = 169/440 (38%), Gaps = 93/440 (21%) Query: 62 ELYKPDEKPL---AIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVD 118 E+ DE+ L I+VP + E G++ + LDY + V + D + Sbjct: 227 EIAAIDERHLPTYTILVPLYKEAGIVPRLVR-DINALDYPRTRLDVKLL-----CEEDDE 280 Query: 119 EVCARF------PNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAED 172 E R P+ H VV P +K N L T ++ ++ DAED Sbjct: 281 ETVQRIRDLQLPPHFHLVVVPDSQPKTKPKACNYGLQLAT-----GDYC----VIFDAED 331 Query: 173 VISPMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREA 228 P +L+ F+ + E +Q + F ++ T+ + +E+S +P A Sbjct: 332 RPDPDQLKKAIIAFSRVPENVVCVQAKLNHFNQDQNMLTAW-FANEYSMHFELVLPAMGA 390 Query: 229 LAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPV 288 +P G F VTA L + A+D ++TED D+G RL +G + Sbjct: 391 AESPIPLGGTSNHF----VTAKLRELG--AWDPFNVTEDADLGIRLHREGYRTAMIDSTT 444 Query: 289 VDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSL 348 ++EA +++ N I RQ+SRW G + W + Sbjct: 445 LEEA----------NSQVPNWI--------------RQRSRWNKGYI------QTWLVHM 474 Query: 349 TLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLL- 407 + L + + F+SF L M +LL W A L +F+ + ++ L Sbjct: 475 RAPFALL--SQTGLKGFLSF--NLTMGSAFVLLLNPIFW--ALTTLYVFTQAGFIEQLFP 528 Query: 408 -WLNFGLMVNRIVQRVIFV---------TGYYGLTQ-GLLSVLRLFWGNLINFMANWRAL 456 + + V +FV G +GLT+ LLS L+WG + +W A Sbjct: 529 GIIFYAASALLFVGNFVFVYLNVAGSLHRGEFGLTRTALLS--PLYWG-----LMSWAAW 581 Query: 457 KQVLQ-HGDPRRVAWDKTTH 475 K +Q +P W+KT H Sbjct: 582 KGFIQLFTNP--FYWEKTVH 599 >UniRef50_A3VPZ4 Putative uncharacterized protein n=2 Tax=Bacteria RepID=A3VPZ4_9PROT Length = 512 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 60/248 (24%), Positives = 100/248 (40%), Gaps = 26/248 (10%) Query: 66 PDEK--PLAIMVPAWNETGVIGNMAELAATTLDY--ENYHIFVGTYPNDPDTQRDVDEVC 121 P+E+ P I+ P ++E + ++ + LDY E I + +D T C Sbjct: 130 PEERLPPFTILCPVYDEAESLPHLVG-SLLLLDYPRERLDIKIILEADDRATIAAARTHC 188 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL-- 179 R P V+ P +K LN+ L + ++ +++DAED +P +L Sbjct: 189 -RAPMFDLVLVPPSAPRTKPKALNHAL-----WTAKGDY----IVIYDAEDRPAPDQLTL 238 Query: 180 --RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 R F L + +Q + + R+ T T + + E++ L +P AL+ VP G Sbjct: 239 AARTFAALPDHIACLQCRLNYYNRDTTILTRL-FALEYALLFDMTLPGLAALSAPVPLGG 297 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 L+A G F+V TED D+G RL G + ++EA + Sbjct: 298 TSNILR---TDILMAVGGWDPFNV---TEDADLGLRLHRAGYETRLLNSTTLEEATDETG 351 Query: 298 RKFLQHAR 305 Q R Sbjct: 352 AWLRQRTR 359 >UniRef50_Q0G6A6 Glycosyl transferase, family 2 n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0G6A6_9RHIZ Length = 692 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 87/374 (23%), Positives = 146/374 (39%), Gaps = 85/374 (22%) Query: 128 HKVVCARPG-PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LF 182 ++V+ PG P +K LN+VL + + +F +L+DAED P +L + Sbjct: 354 YRVILVPPGNPRTKPKALNHVLPIV-----AGDF----LVLYDAEDEPHPGQLEEAYDRY 404 Query: 183 NYLVERKDLIQIP--VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 R +Q P + +R W TS+ + E++ L +P +P G Sbjct: 405 RASDARLACLQAPLVIRNGDRNW--LTSI-FAMEYAGLFRAFLPWLARHRLPIPLGGTSN 461 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F A+L + G +D ++TED D+G RLK G + P +++A Sbjct: 462 HFK----VAVLREVGG--WDSHNVTEDADLGMRLKRAGYDIETISSPTLEDA-------- 507 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 P+T S V Q++RW+ G V Q + H L + R+ Sbjct: 508 ----------------PETVSVWVPQRTRWLKGWV-QTYAVHMRHPMLLM-------REL 543 Query: 361 AISNFVSFLAM---LVMIQLLLLLAYESL-------WPDAWHFLSIFSGSAWLMTLLWLN 410 + FV F + ++ LLL LA+ + W W S LL L+ Sbjct: 544 GVKRFVVFQLLFHGMITAALLLPLAFGLIGFTIWLQWSTGWERTSA-------TALLVLD 596 Query: 411 FGLMVNRIVQRVIFVTGYYGLTQ-----GLLSVLRLFWGNLINFMANWRALKQVLQHGDP 465 + V + + G ++ L + ++W L +A +R L Q+ ++ Sbjct: 597 LAIFVGGYLSFLALTLRGMGTSELRPCVKWLPFVPIYW--LCVSVAAYRGLFQLFKNPH- 653 Query: 466 RRVAWDKTTHDFPS 479 AW+KT H S Sbjct: 654 ---AWEKTAHGLAS 664 >UniRef50_D0LP33 General secretory system II protein E domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LP33_HALO1 Length = 497 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 35/88 (39%), Positives = 52/88 (59%), Gaps = 4/88 (4%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV--AWE 546 +GQ LL ++T EQLD A+ R GLRLG ++ G + A+QLA++L G+ A E Sbjct: 19 IGQRLLAESLVTREQLDEAVALQRRGGLRLGTVLIDLGYLDADQLARSLGRWYGMAPALE 78 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPL 574 + A + P SL A +P S+A + +PL Sbjct: 79 AHFAARNP-SLQARLPPSLAARFGAIPL 105 >UniRef50_A3VHJ3 Glycosyl transferase, family 2 n=2 Tax=Rhodobacterales RepID=A3VHJ3_9RHOB Length = 684 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 71/304 (23%), Positives = 125/304 (41%), Gaps = 33/304 (10%) Query: 64 YKPDEKPL-AIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 + P +P+ +I+VP + E + G + + L T E I + +D TQ + Sbjct: 286 HDPRRRPVVSILVPLYREREIAGRLVKRLERLTYPRELLDICLIVEEDDTLTQETLSN-- 343 Query: 122 ARFPN-VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFI-LHDAEDVISPMEL 179 AR P + ++ R G +K LN LD FA I ++DAED ++ Sbjct: 344 ARLPAWMRQITVPRGGVRTKPRALNFALD----------FARGTIIGVYDAEDAPDADQI 393 Query: 180 -RLFNYLVERKDLIQI--PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 R+ E + + + T++ S + E++ +P E + VP Sbjct: 394 DRIVARFAEAPPRVACLQGMLDYYNARTNWLSRCFTIEYATWFRIVLPGMEKMGFAVPLG 453 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKER- 295 G T F RR V L +D ++TED D+G RL G T V +EA R Sbjct: 454 GT-TLFFRRGVLEQLG-----GWDAHNVTEDADLGIRLARLGYTTELVETVTKEEANCRV 507 Query: 296 -----EQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTL 350 ++ ++++ + + +R+ P + + W + IVF G +H + + Sbjct: 508 WPWIKQRSRWIKGYAMTYGVHMRD--PRRLWRELGARRFWGVQIVFLGTLSHLILAPVLW 565 Query: 351 NYFL 354 +Y+L Sbjct: 566 SYWL 569 >UniRef50_B1YJT8 Type II secretion system protein E n=5 Tax=Bacillales RepID=B1YJT8_EXIS2 Length = 554 Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust. Identities = 31/136 (22%), Positives = 69/136 (50%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 + LG++LLE V+TE Q++ AL + +LG ++L G ++ +QL +AL Q + Sbjct: 6 KRLGEMLLEESVVTEAQIEEALSVKRTSEKLGDTLLRLGHLTEQQLIEALHHQLKIPVIQ 65 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + + + ++ + +A + ++P+ E + L + D +D +++ L + G + Sbjct: 66 LYNYPVDVAVTKLISKELAQRHTLVPVYREGNRLFIAMADPMDLIAIDDLRLQTGLMIEV 125 Query: 608 VIVLRGQIVTGLRHWY 623 + R +I + +Y Sbjct: 126 GLATRDEIRRTILKYY 141 >UniRef50_Q39Q61 General secretory system II, protein E-like n=2 Tax=Geobacter RepID=Q39Q61_GEOMG Length = 540 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 28/104 (26%), Positives = 56/104 (53%), Gaps = 3/104 (2%) Query: 487 LRP--LGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 +RP LG IL Q+I+E + AL + G R G +++ G+++ E + AL+ Q + Sbjct: 5 VRPGSLGDILFRCQIISENDIRAALDEQQTTGCRFGEALVKLGVVAQEDIDWALSNQLNI 64 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSED 587 + + + + +A +PA++A + ++PL + DE+ + D Sbjct: 65 PYVRLKPTMVDTEAVALIPAALARQHNLIPLIVTGDEISIAIAD 108 >UniRef50_A8ZYV7 Type II secretion system protein E n=11 Tax=Deltaproteobacteria RepID=A8ZYV7_DESOH Length = 577 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 35/128 (27%), Positives = 65/128 (50%), Gaps = 3/128 (2%) Query: 485 RSLRPLGQILLENQVITEEQLD--TALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 R+ + LG++L++ +TEE+L A + R GL+LG ++ +G++S + ++ Q G Sbjct: 11 RTRKKLGEMLVDAGYLTEERLTGYVAAQKR-SGLKLGQFLIREGVVSESMIVDLVSRQAG 69 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + + + L + +V+ Y +PLR N L+V D +D SL A+ + Sbjct: 70 IQRFDPAEFPVTMELAKSLAETVSRKYGAVPLRRGNHLLLVAMTDPLDIRSLDAIEDECD 129 Query: 603 RKVRYVIV 610 +V VI Sbjct: 130 LEVEPVIC 137 >UniRef50_C8W151 Type II secretion system protein E n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W151_DESAS Length = 545 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 36/142 (25%), Positives = 68/142 (47%), Gaps = 1/142 (0%) Query: 488 RPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + LG+ L++ V+ E+L ALR R +L ++ QG + QL L E + Sbjct: 4 KNLGEFLVDRGVLGREELAAALRAQRGSKKKLEELLVDQGYLQEAQLTPLLGEFFDMPVF 63 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 D + ++A +P VAL + ++P+ L+ ++L + + + V L L R G+++ Sbjct: 64 PADEVRFAPEVLATVPRPVALKHNIIPVALKENDLFIACSEPANSVILENLRRLTGKRLH 123 Query: 607 YVIVLRGQIVTGLRHWYARRRG 628 V++ + LR Y+ G Sbjct: 124 LVLMSSSGLAGVLRQAYSEDTG 145 >UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli subsp. xyli RepID=Q6ACB6_LEIXX Length = 421 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 59/220 (26%), Positives = 88/220 (40%), Gaps = 22/220 (10%) Query: 71 LAIMVPAWNETGVIGNMAELAATTLDY--ENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 +AI+VPAWNE VIG + TLDY E I+V + DT V E R+P Sbjct: 55 VAIVVPAWNEGAVIGASID-RLVTLDYPKEALRIYVVDDASTDDTSVVVRERAMRYPGNV 113 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER 188 + G KA LN+ ++ I A+ ++ DA+ + P LR + Sbjct: 114 FLFRREKGGQGKAHTLNHGIERIF-----ADDWMEALLIMDADVIYQPDSLRKMTRHLAD 168 Query: 189 KDLIQIPVYPFE----REW-THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFS 243 + + Y E R + T F S Y+ S+ + + L Q AG S Sbjct: 169 PKVGAVSAYIHEGSADRNYLTKFVSTEYV--LSQPTARR--AQNVLGAQACLAGGAQLHS 224 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIF 283 R + A+ G D +L ED F + +G +F Sbjct: 225 RENLIAI-----GGQVDTSTLAEDTITTFETQLRGKRVVF 259 >UniRef50_C6MU90 Type II secretion system protein E n=1 Tax=Geobacter sp. M18 RepID=C6MU90_9DELT Length = 724 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 34/134 (25%), Positives = 63/134 (47%), Gaps = 1/134 (0%) Query: 490 LGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESID 549 +G IL+ +T EQ+++A R E ++G ++ +GLI+ EQL +ALA + G + + Sbjct: 159 IGDILVNEGFVTREQVESA-RQAGERGKIGSVLIARGLITEEQLLKALASKFGSRFVDLS 217 Query: 550 AWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVI 609 +A + + VLPL+L+ +L V + + D + L + V+ Sbjct: 218 EVTPTPEALAVLQKQTVVRMQVLPLQLQGRKLTVATSEPTDLGIMDNLRFITNHHIELVV 277 Query: 610 VLRGQIVTGLRHWY 623 QI + +Y Sbjct: 278 SGSRQIAAAIDRYY 291 >UniRef50_B9M3F1 General secretory system II protein E domain protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M3F1_GEOSF Length = 383 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 36/133 (27%), Positives = 65/133 (48%), Gaps = 2/133 (1%) Query: 493 ILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESID-A 550 +LL +I +EQ + AL+NRV G ++G S++ G + E LA+ L ++ V + D Sbjct: 8 MLLNAGLINKEQFEEALKNRVLYGGKIGTSLIELGYLKEEDLARFLGKKLAVPFVGADRL 67 Query: 551 WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIV 610 I +I +P +AL Y V+P+ + L + D D ++ L+ G + V Sbjct: 68 LNISPEIIELIPKELALTYGVIPIHRDKKRLYLVMSDPADLKAIDELSFTTGFIINPVAA 127 Query: 611 LRGQIVTGLRHWY 623 +++ L +Y Sbjct: 128 PELRLMQALGKYY 140 >UniRef50_Q1D3E1 General secretion pathway protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D3E1_MYXXD Length = 338 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 38/148 (25%), Positives = 76/148 (51%), Gaps = 11/148 (7%) Query: 488 RPLGQILLENQVITEEQLDTAL--RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 + +G++L+E V+TEEQ+ AL R RLG ++ QGL + +AQAL+ Q+ + + Sbjct: 3 KKIGELLVEAGVVTEEQVRVALGRRGAFGSHRLGEVLVAQGLCTPTHIAQALSAQHALPF 62 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLE----NDELIVGSEDGIDPVSLAALTRKV 601 ++ +IP+++ + + +LP R+E ++ ++V +D D + L ++ Sbjct: 63 VAL-PEEIPANVAGLVSVDFQSEHRILPFRMEVEGRSERILVAVDDPADVTLVDELRFQL 121 Query: 602 GRKVRYVIVLRGQIVTGLRHWYARRRGH 629 +++R + + L AR RG Sbjct: 122 RKQMRVFVAASDDLDAAL----ARARGE 145 >UniRef50_A6Q5B1 General secretory pathway protein E n=3 Tax=Epsilonproteobacteria RepID=A6Q5B1_NITSB Length = 560 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 31/118 (26%), Positives = 63/118 (53%), Gaps = 3/118 (2%) Query: 490 LGQILLENQVITEEQLDTALRNRVE---GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 LG +L++ +ITEEQL+ AL+ + E +LG +L +G ++ + L +AL++Q + + Sbjct: 8 LGDLLVKEGLITEEQLEQALKLQKEYGYTKKLGQILLEEGYVTQKDLLKALSKQLHLEFV 67 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + +I ++ P + +P + ++D L V + D ++ +L L R + K Sbjct: 68 DLYGEKIDFEKLSRYPLNTLKAAKAIPFKEDDDYLYVATSDPLNYEALELLERTIAMK 125 >UniRef50_Q7UEJ7 Probable general secretion pathway protein E n=1 Tax=Rhodopirellula baltica RepID=Q7UEJ7_RHOBA Length = 1283 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 35/144 (24%), Positives = 68/144 (47%), Gaps = 2/144 (1%) Query: 494 LLENQVITEEQLDTALRNRVEGLRLGGSMLMQ-GLISAEQLAQALAEQNGVAWESIDAWQ 552 ++ ++I+E Q D + E + ++L G + + +++ALAE G + +D Sbjct: 534 FVDRELISESQADHVMEAASECGKPYFTLLQDYGYAADDDMSRALAEIYGYQFVDLDNLS 593 Query: 553 IPSSLIAEMPASVALHYAVLPLRLEND-ELIVGSEDGIDPVSLAALTRKVGRKVRYVIVL 611 I ++I P S+A V+P+R + D L+ + ID ++ L + R + V+ Sbjct: 594 INEAIIELCPESIARENTVIPIREDFDGNLVFAMSNPIDLETIEKLRFILNRHIETVLAT 653 Query: 612 RGQIVTGLRHWYARRRGHDPRAML 635 IV + H+Y + G +ML Sbjct: 654 PDAIVEAINHFYGQIEGESADSML 677 >UniRef50_B2A7G1 Type II secretion system protein E n=5 Tax=Firmicutes RepID=B2A7G1_NATTJ Length = 571 Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust. Identities = 37/121 (30%), Positives = 66/121 (54%), Gaps = 1/121 (0%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG +LLE+ ITEE L AL ++ + G +LG S++ G+I+ E++ + L Q G+ S+ Sbjct: 9 LGDLLLESGAITEEDLKQALDHQNKSGQKLGASLVDLGIITEEEIIEVLEFQLGIPHVSL 68 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + +PA +A Y VLP+ + +L++ D ++ V++ + G +V V Sbjct: 69 SQYDTNRETATLIPAYLAERYQVLPIDNRSGKLVLAMGDPLNVVAIDDVKMATGMEVEPV 128 Query: 609 I 609 I Sbjct: 129 I 129 >UniRef50_B4UHA9 General secretory system II protein E domain protein n=2 Tax=Anaeromyxobacter RepID=B4UHA9_ANASK Length = 499 Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust. Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 11/148 (7%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQN 541 D R LG++LLE VI QL +AL ++ + G+RLG +++ L + QAL+ + Sbjct: 2 DAIGKRRLGELLLEAGVIDATQLQSALGHQRQWGVRLGQALVDLKLAGEADIVQALSRKY 61 Query: 542 GVAWESIDAWQIPSSL---IAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 G +DA + P +L + +P AL V PL + L V D P +LA + Sbjct: 62 GYEVARLDALE-PYALELALRLVPREFALRNNVFPLGADTGTLAVAMSD---PTNLAVVD 117 Query: 599 R---KVGRKVRYVIVLRGQIVTGLRHWY 623 + GRKV+ I +I +R Y Sbjct: 118 ELRFRTGRKVKVCIGGDREIAAAVRDRY 145 >UniRef50_B8JAJ3 General secretory system II protein E domain protein n=1 Tax=Anaeromyxobacter dehalogenans 2CP-1 RepID=B8JAJ3_ANAD2 Length = 506 Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust. Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 11/148 (7%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQN 541 D R LG++LLE VI QL +AL ++ + G+RLG +++ L + QAL+ + Sbjct: 2 DAIGKRRLGELLLEAGVIDATQLQSALGHQRQWGVRLGQALVDLKLAGEADIVQALSRKY 61 Query: 542 GVAWESIDAWQIPSSL---IAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 G +DA + P +L + +P AL V PL + L V D P +LA + Sbjct: 62 GYEVAHLDALE-PYALELALRLVPREFALRNNVFPLGADTGTLAVAMSD---PTNLAVVD 117 Query: 599 R---KVGRKVRYVIVLRGQIVTGLRHWY 623 + GRKV+ I +I +R Y Sbjct: 118 ELRFRTGRKVKVCIGGDREIAAAVRDRY 145 >UniRef50_Q2JNJ5 Glycosyl transferase, group 2 family protein n=6 Tax=Bacteria RepID=Q2JNJ5_SYNJB Length = 764 Score = 45.4 bits (106), Expect = 0.009, Method: Compositional matrix adjust. Identities = 105/452 (23%), Positives = 174/452 (38%), Gaps = 88/452 (19%) Query: 54 RYPRMSYRELYKPDEKPL---AIMVPAWNETGVIGNMAELAATTLDY--ENYHIFVGTYP 108 R+ +++ E+ D++ L I+VP + E V+ + + + + LDY E + + Sbjct: 365 RFHQITDEEVAALDDRDLPIYTILVPVYKEPEVMPILIK-SLSKLDYPHERLDVLILLEE 423 Query: 109 NDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILH 168 ND DT A+ P +++ P SK + F R ++ Sbjct: 424 NDRDTIEAAR--AAKPPRYVRLLLV---PDSKPKTKPKACNYGLAFARGEYLT-----IY 473 Query: 169 DAEDVISPMELRLFNYLVERKD----LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVP 224 DAED+ P +L+ + D +Q + F R T M + E+S +P Sbjct: 474 DAEDIPDPDQLKKAVIAFRKGDPSLVCVQAALNYFNRSENFLTRM-FTLEYSYWFDYLLP 532 Query: 225 VREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFV 284 E L +P G F T L + G +D ++TED D+G R + G T + Sbjct: 533 GLETLRMPIPLGGTSNHFR----TDRLRELQG--WDPFNVTEDADLGIRASQHGYTVGVI 586 Query: 285 RFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKW 344 +EA V+ + +RQ+SRWI G + Q + H Sbjct: 587 NSTTYEEAN----------------CAVKNW--------IRQRSRWIKGYM-QTWLVHNR 621 Query: 345 TSSLTLNYFLWRDRKGAISNFV--------SFLAMLVMIQLLLLLAYESLWPDAWHFLSI 396 +L RK + N++ SF L + LL Y L W ++ Sbjct: 622 NPLRSL-------RKLGLKNWLSYQFFIGGSFFTFLTSPIMWLLFIYWLLTRAHW-LQNL 673 Query: 397 FSGSAWLMTLLWLNF------GLMVNRIVQRVIFVTGYYGLT-QGLLSVLRLFWGNLINF 449 F +WL+ L N G+ +N + +F GYY L LL+ ++W ++ Sbjct: 674 F--PSWLVYLGLFNLLVGNAIGIYLNLV---AVFRRGYYDLAFYALLN--PIYWQ--LHS 724 Query: 450 MANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 MA + AL Q+ + W+KT H T Sbjct: 725 MAAYMALWQLFT----KPFYWEKTIHGLSKFT 752 >UniRef50_Q1D416 General secretory system II protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D416_MYXXD Length = 431 Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust. Identities = 48/162 (29%), Positives = 83/162 (51%), Gaps = 6/162 (3%) Query: 490 LGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG+ LL++ ++T E L+ AL +V G RLG +++ GL+S LA+AL + + A+ S Sbjct: 3 LGEQLLKDGLVTAEGLEEALEAQVVHGGRLGTNLVELGLLSEVDLAKALGKVHNSAFAS- 61 Query: 549 DAWQIPSSLIAEMPASV-ALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 +P E+ +S A LP+R++ L + + D +L A+ K G++V Sbjct: 62 -GEMVPDPKAMELVSSNHADDKEYLPMRVDATRLSIAVVNPHDFSTLDAIAFKTGKRVVP 120 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQA 649 V++ ++ LR + R RA+ NAV+ + QA Sbjct: 121 VVIPEFRMNQLLRRYCKAFRPL--RAVDMNAVRPRPSAGSQA 160 >UniRef50_A5FY21 Glycosyl transferase, family 2 n=2 Tax=Alphaproteobacteria RepID=A5FY21_ACICJ Length = 642 Score = 45.1 bits (105), Expect = 0.012, Method: Compositional matrix adjust. Identities = 82/335 (24%), Positives = 130/335 (38%), Gaps = 66/335 (19%) Query: 166 ILHDAEDVISPMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGK 221 ++ DAED P +LR LFN +Q + F R+ T M + E+S+ Sbjct: 343 VIFDAEDSPEPDQLRKVVALFNASGPEVACVQARLNYFNRDDNFLTRM-FTLEYSQWFDY 401 Query: 222 DVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTE 281 +P L +P G F + L A+D ++TED D+G RL + G Sbjct: 402 LLPGLYRLNIPIPLGGTSNHFRTEVLHEL------GAWDPYNVTEDADLGIRLTQAGY-- 453 Query: 282 IFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKT 341 R VV+ E L + + Q+SRWI G + Sbjct: 454 ---RVAVVNSTTFEEANGVLH-------------------SWINQRSRWIKGYM------ 485 Query: 342 HKWTSSLTLNYFLWRDRKGAIS-----NFVSFLAMLVMIQLLLLLAY-ESLWPDAWHFLS 395 W + L+R R G + F+ F M +I LL + + S+ Sbjct: 486 QTWLVHMRRPVELYR-RLGPVGFLGFHMFIGFPPMTALINPLLWIMFLVSVIVGRSAVAG 544 Query: 396 IFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTG-----YYGLTQ-GLLSVLRLFWGNLINF 449 F G ++ L F LMV + + +YGL GLL+ +W +++ Sbjct: 545 FFPGPVLVLAL----FDLMVGNAMYVYFNIVAVAKRRWYGLVPWGLLA--PAYW--VLHS 596 Query: 450 MANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDT 484 +A ++AL Q++ + W+KTTH S T +T Sbjct: 597 VAAYKALLQLITNPH----YWEKTTHGTSSRTQET 627 >UniRef50_C1AA14 Type IV pilus assembly protein PilB n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AA14_GEMAT Length = 634 Score = 45.1 bits (105), Expect = 0.012, Method: Compositional matrix adjust. Identities = 34/136 (25%), Positives = 69/136 (50%), Gaps = 6/136 (4%) Query: 465 PRRVA-WDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGS 521 PR + + T P+ +RS LG +L+ +++ E L AL+ + G RLG + Sbjct: 11 PRTCSDGNHVTMAAPAAPLASRSTDRLGDLLVREGLLSRENLTKALQEQSAYPGQRLGLT 70 Query: 522 MLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDEL 581 ++ G++ ++ + LA Q + + +++ + L+ +PA +A + VLPL+ + +L Sbjct: 71 VVRLGMVPETEVVRMLARQYRMPAVDLARFEVDTRLLKLIPAELASKHTVLPLKRDGRQL 130 Query: 582 IVGSEDGIDPVSLAAL 597 V DP ++A + Sbjct: 131 TVAIA---DPTAMAVV 143 >UniRef50_Q1AWU6 Type II secretion system protein E n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AWU6_RUBXD Length = 572 Score = 44.3 bits (103), Expect = 0.018, Method: Compositional matrix adjust. Identities = 37/120 (30%), Positives = 56/120 (46%), Gaps = 4/120 (3%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLR-LGGSMLMQGLISAEQLAQAL 537 TG R + +LL +TEEQL A+ + R LG ++ G +SAE+LA+A Sbjct: 6 GTTGGRERNRSVWSLLLSEGSLTEEQLHRAVEAQKHDPRDLGQILVSLGYVSAEELARAR 65 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 A + G+ + + + + +P V + LPLRLE L+ DP L AL Sbjct: 66 ARRLGLGYLEPSERDVDPAALGLVPERVLRRHRALPLRLEEGRLVAAL---ADPTDLQAL 122 >UniRef50_A6M148 Type II secretion system protein E n=18 Tax=Clostridiaceae RepID=A6M148_CLOB8 Length = 564 Score = 44.3 bits (103), Expect = 0.018, Method: Compositional matrix adjust. Identities = 31/115 (26%), Positives = 57/115 (49%), Gaps = 6/115 (5%) Query: 488 RPLGQILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 R LG IL+ IT QL AL++ R G +LG +L +I+ E + +A+ +Q G+ Sbjct: 6 RRLGNILVNAGKITGYQLQEALKSQRTLGKKLGEILLDSKIITEEDIIEAIEQQTGIKKV 65 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSED-----GIDPVSLAA 596 ++ I +P ++ Y ++P +N+++ V D ID V+++ Sbjct: 66 DLNTINFDRKAITLIPQNLCDKYLLIPFGFDNNKIKVALADPLNIFAIDDVAIST 120 >UniRef50_Q08Q27 Serine/threonine kinase PKN11 n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08Q27_STIAU Length = 990 Score = 43.9 bits (102), Expect = 0.021, Method: Compositional matrix adjust. Identities = 31/102 (30%), Positives = 57/102 (55%), Gaps = 6/102 (5%) Query: 485 RSLRPLGQILLENQVITEEQLDTAL--RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 R+ R +G IL+ ++ E L+ AL + R+ G +LG ++ + L+ AE+L +AL+EQ+G Sbjct: 376 RAGRRIGDILVARGMLPPEALEQALTLQKRLGG-KLGQVLVGERLLEAEELVRALSEQSG 434 Query: 543 ---VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDEL 581 ++ E + +P+ LI +P + +P+ N EL Sbjct: 435 MPHISGERLQTMPVPAELIRLLPMEMCEKLCAVPVAQRNREL 476 >UniRef50_D2LBM7 Glycosyl transferase family 2 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LBM7_RHOVA Length = 643 Score = 43.9 bits (102), Expect = 0.024, Method: Compositional matrix adjust. Identities = 71/285 (24%), Positives = 118/285 (41%), Gaps = 37/285 (12%) Query: 8 FATWLYGLKVIAITLA----VIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 FAT GL + A+ A + + + L FF+ + R +++ P+ L Sbjct: 204 FATVAVGLVLGAVAFAPAETLTLASAMLSIFFLLTI--ALRAAAAVNIALPRPKAKEARL 261 Query: 64 YKPDEKP-LAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 E P ++VP + ET ++ ++A A ++DY + + D RD E Sbjct: 262 LGDAELPRYTVLVPLYRETAILPHLAH-ALASIDYPAAKLDIKIVLEASD--RDTIEAAQ 318 Query: 123 R--FP-NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFI-LHDAEDVISPME 178 + FP NV VV P +K LN L +FA F+ ++DAED P + Sbjct: 319 KLAFPGNVDLVVVPDREPRTKPKALNYAL----------HFASGEFVVIYDAEDRPEPDQ 368 Query: 179 LRLFNYLVERK--DLIQIPV---YPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 LR + + DL+ + Y RE ++ S + E++ L +P+ + Sbjct: 369 LRKAATVFAQAPADLVCLQARLDYYNARE--NWLSRQFTIEYATLFRGLLPLLARFRLPL 426 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKG 278 P G F A+ + A+D ++TED D+G RL G Sbjct: 427 PLGGTSNHFRAAALREI------GAWDPYNVTEDADLGMRLARAG 465 >UniRef50_C3RLB0 Type II secretion system protein E n=2 Tax=Bacteria RepID=C3RLB0_9MOLU Length = 563 Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust. Identities = 31/148 (20%), Positives = 73/148 (49%), Gaps = 8/148 (5%) Query: 482 GDTRSLR--PLGQILLENQVITEEQLDTALR----NRVEGLRLGGSMLMQGLISAEQLAQ 535 G T+ +R P+G++L E I +EQL+ AL NR + RLG ++ G +S Q+ + Sbjct: 3 GRTKYMRNIPIGEVLKEYGYINDEQLNVALEAQKSNRSK--RLGQHLIDLGFVSEYQMLE 60 Query: 536 ALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLA 595 AL+++ + ++ + ++P ++A Y ++ + L + +L + + D ++ + Sbjct: 61 ALSDKLAEPLIELSEIKVDIDAVQKIPRAMADKYNIIAIDLTDQQLTIVTSDPLNFYGIE 120 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWY 623 + G + + + ++ + +Y Sbjct: 121 DVRLVTGMHLNVCLATKAEVSKAIDRYY 148 >UniRef50_Q098P0 Putative ATPase n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q098P0_STIAU Length = 365 Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust. Identities = 43/134 (32%), Positives = 65/134 (48%), Gaps = 7/134 (5%) Query: 484 TRSLRP--LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQ 540 RS+R LG +L ++ E QL AL + G LG ++ G +A+Q+ + LA Q Sbjct: 20 ARSMRKKRLGDLLQAAGLVDELQLRAALGFHHKWGTPLGQVVVDLGFCTAQQVLELLANQ 79 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDE---LIVGSEDGIDPVSLAAL 597 + +DA + L+ +P VA V+PLR E L+V + DPV+L + Sbjct: 80 AQLPMVDLDAEMLDPQLVEVLPVRVAESCRVIPLRQEGPRDSVLVVATAAPGDPVALDEV 139 Query: 598 TRKVGRKVRYVIVL 611 R G K R V +L Sbjct: 140 ARLTG-KTRVVTLL 152 >UniRef50_Q01NU4 Type II secretion system protein E (GspE) n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01NU4_SOLUE Length = 566 Score = 43.5 bits (101), Expect = 0.029, Method: Compositional matrix adjust. Identities = 36/136 (26%), Positives = 64/136 (47%), Gaps = 1/136 (0%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG+IL+E I E L+ AL ++E G +LG ++ GLI+ L AL++Q GV ++ Sbjct: 24 LGEILIERGKIDAEDLERALELQLERGDKLGKIVVDMGLIAQRDLLSALSDQMGVPLIAV 83 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 D + I + P+ L + L + D +D ++AA+ G +V+ Sbjct: 84 DGTPPNAPEIEGLSQRFLRQCRAFPVALNDSVLTIAMADPMDFETIAAVRAFSGLQVQTA 143 Query: 609 IVLRGQIVTGLRHWYA 624 + +I+ + Y Sbjct: 144 LASEQEILDAIDRNYG 159 >UniRef50_A6C0T8 Type IV fimbrial assembly protein PilB n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C0T8_9PLAN Length = 345 Score = 43.5 bits (101), Expect = 0.030, Method: Compositional matrix adjust. Identities = 35/148 (23%), Positives = 67/148 (45%), Gaps = 3/148 (2%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 + + QIL+E I ++Q L+ E GL + + ++AE QA+A G+ + Sbjct: 100 KSVEQILVEQGHIDKDQA-AELKEFAEKRGLTTRDAAVQMRFVNAETATQAMARSKGMPY 158 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 ++ + ++ ++P +A +LPL +++D L+V D L + Sbjct: 159 IDLEETIPDNGILLQLPQQMAKRNTILPLFIDDDMLLVACADQPTHELEDDLRMRYQVPA 218 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRA 633 R+V+ + I TG+ +YA D A Sbjct: 219 RWVLAMPRSINTGITKYYAAAEEQDDEA 246 >UniRef50_C7IFY8 General secretory system II protein E domain protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IFY8_9CLOT Length = 648 Score = 43.5 bits (101), Expect = 0.031, Method: Compositional matrix adjust. Identities = 35/120 (29%), Positives = 61/120 (50%), Gaps = 8/120 (6%) Query: 490 LGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG++LL+ ++I EQL+ L ++ + G+RLG S+L G I L LA Q+ + + + Sbjct: 344 LGKLLLDRELIKPEQLEIGLYHQKKFGIRLGESLLALGYIDETGLYSTLASQSAIDYYEL 403 Query: 549 --DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIV-----GSEDGIDPVSLAALTRKV 601 + ++ + I ++ A +PL + +DE +V S +GI V RKV Sbjct: 404 NPEKEKVDTKWIDKLSVRQAKALMAIPLGVSSDERLVIACSQTSREGITDVLQEIFNRKV 463 >UniRef50_A1AV36 General secretory system II, protein E domain protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AV36_PELPD Length = 368 Score = 43.1 bits (100), Expect = 0.036, Method: Compositional matrix adjust. Identities = 29/93 (31%), Positives = 49/93 (52%), Gaps = 2/93 (2%) Query: 500 ITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAW-ESIDAWQIPSSL 557 IT QL+ AL ++ G++LG ++ G + L +AL+ + GV + + IP L Sbjct: 15 ITNTQLEEALESQAGRGIKLGSALFELGYVEENALGRALSAKLGVPFVGRSELSSIPGDL 74 Query: 558 IAEMPASVALHYAVLPLRLENDELIVGSEDGID 590 I + S+A+ Y V+P +LE + L + D D Sbjct: 75 IRDFSRSMAVKYNVMPFKLERNRLGLAMSDPND 107 >UniRef50_B0T1N0 Putative uncharacterized protein n=1 Tax=Caulobacter sp. K31 RepID=B0T1N0_CAUSK Length = 492 Score = 43.1 bits (100), Expect = 0.037, Method: Compositional matrix adjust. Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 64/338 (18%) Query: 14 GLKVIAITLAVIMFISGLDD-------FFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 GL V A+T+ V + I FF+ + IK + R P ++ L Sbjct: 57 GLAVFALTVIVAVIIEPRTTMEAFHLLFFVGFMA-NSMIKLAAACTPRRPGVA-PSLPDE 114 Query: 67 DEKPLAIMVPAWNETGVIGNMAELAATTLDY--ENYHIFVGTYPNDPDTQRDVDEVCARF 124 D ++VP + E V + L LDY + + + ND +TQ + Sbjct: 115 DLPGYTLIVPLYREASVAAELV-LNLARLDYPRDRLQVLIVLEANDHETQAAFAAL--DL 171 Query: 125 PNVHKVVCARPG-PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL-- 181 P +V+ A PG P +K N L ER+ +++DAED P +LR Sbjct: 172 PVGFQVLIAPPGTPQTKPRACNIAL------ERAHG---EMVVIYDAEDAPHPAQLREAA 222 Query: 182 --FNYLVERKDLIQIP--VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 F R +Q P + P R F + E++ L +P P G Sbjct: 223 AGFAAGDRRLACLQAPLRIEPDPR----FLPDQFALEYAVLFEVFLPALARWRLPFPLGG 278 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 F AV A+ +D ++TED DIGFRL +G + P + A Sbjct: 279 TSNHFRTEAVRAV------GGWDSYNVTEDADIGFRLAARGYQLDVITCPTFETA----- 327 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIV 335 P T T + Q++RWI G V Sbjct: 328 -------------------PTTMKTWIPQRARWIKGHV 346 >UniRef50_A3UGE7 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UGE7_9RHOB Length = 523 Score = 43.1 bits (100), Expect = 0.039, Method: Compositional matrix adjust. Identities = 57/212 (26%), Positives = 82/212 (38%), Gaps = 50/212 (23%) Query: 129 KVVCARP-GPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE 187 +VV P GP +K LN L Q R A ++DAED P +LR Sbjct: 169 RVVVLPPLGPMTKPRALNVAL----QTARGELVA-----VYDAEDAPHPDQLRQAAECFA 219 Query: 188 RKD---LIQIPVYPFEREWTHFTS---MTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 D +IQ P+ + R T+ + Y +F+ L +P+ L +P G Sbjct: 220 ADDRLGIIQAPLGWYNRTENWLTAQFALEYATQFNAL----LPLLARLGWPLPLGGTSNI 275 Query: 242 FSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFL 301 F + +AL+A G F+V TED D+GFR+ G V ++EA Sbjct: 276 FRQ---SALVACGGWDPFNV---TEDADLGFRMARSGWRAGLVAPGTLEEA--------- 320 Query: 302 QHARTSNMICVREYFPDTFSTAVRQKSRWIIG 333 P T Q+SRW+ G Sbjct: 321 ---------------PITLRAWTHQRSRWLKG 337 >UniRef50_B1Y3Z9 General secretory system II protein E domain protein n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y3Z9_LEPCP Length = 1017 Score = 43.1 bits (100), Expect = 0.042, Method: Compositional matrix adjust. Identities = 38/129 (29%), Positives = 56/129 (43%), Gaps = 32/129 (24%) Query: 491 GQILLENQVITEEQLDTAL--RNRVEGLRLG----------------------------- 519 G+ L+E Q++T EQL TAL + R+ +RLG Sbjct: 227 GEQLVERQIVTPEQLLTALDKQARMPSVRLGEALVALGYLTDKQLQEALQLQRTDRVQPL 286 Query: 520 GSMLMQ-GLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEN 578 G +L++ GL+ EQL ALA + G + + + +LI +PA A VLPL Sbjct: 287 GELLVEKGLVEGEQLRIALARKMGYPVVDVAGFPVDPALIPLLPAPAARRLQVLPLMRRG 346 Query: 579 DELIVGSED 587 L+V D Sbjct: 347 GRLVVAMHD 355 >UniRef50_Q08MP9 Gspii_e N-terminal domain family (Fragment) n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08MP9_STIAU Length = 625 Score = 43.1 bits (100), Expect = 0.044, Method: Compositional matrix adjust. Identities = 31/96 (32%), Positives = 56/96 (58%), Gaps = 2/96 (2%) Query: 490 LGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG +L+ +IT+ QLD L+ + + G RLG +++ + E+L + L+EQ+ +I Sbjct: 5 LGALLVRKGLITQTQLDEGLKAQMIYGGRLGTNLVELEFLDIEKLGEVLSEQSRYPQATI 64 Query: 549 DAWQ-IPSSLIAEMPASVALHYAVLPLRLENDELIV 583 ++ + + +A +PA++A +AV PL LE L V Sbjct: 65 QEFEAVTVATLATVPAALAEKHAVFPLHLEGRRLKV 100 >UniRef50_C6MLR2 Type II secretion system protein E n=1 Tax=Geobacter sp. M18 RepID=C6MLR2_9DELT Length = 748 Score = 42.0 bits (97), Expect = 0.089, Method: Compositional matrix adjust. Identities = 38/147 (25%), Positives = 73/147 (49%), Gaps = 19/147 (12%) Query: 490 LGQILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 +G IL+E+ ++T E ++ A ++ + + L++G ++M+GLI+ EQL ALA + + + + Sbjct: 180 VGDILVESGLVTRELVEAAFKSQKGKKLQVGELLIMKGLITEEQLLSALATKFRLRFVDL 239 Query: 549 DAWQIPS-SLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + IPS + + + +A V P+ LE L+V + P L +G +R+ Sbjct: 240 ETV-IPSDAALNAISEGLASRLKVFPISLEGRTLVVAT---CAPTDLT-----IGDNLRF 290 Query: 608 --------VIVLRGQIVTGLRHWYARR 626 V+ QI + +Y R Sbjct: 291 STNFATELVVAPSRQIAAAIEKYYRNR 317 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AFA6 Bacteriophage N4 adsorption protein B n=53 Tax=P... 984 0.0 UniRef50_B7LP59 Bacteriophage N4 receptor, inner membrane subuni... 951 0.0 UniRef50_B3GN83 Bacteriophage N4 adsorption NfrB-like protein n=... 780 0.0 UniRef50_B4SJA4 General secretory system II protein E domain pro... 633 e-180 UniRef50_A5P922 Bacteriophage N4 receptor, inner membrane subuni... 626 e-178 UniRef50_B2UJM9 General secretory system II protein E domain pro... 617 e-175 UniRef50_C6N7W0 Putative uncharacterized protein n=1 Tax=Legione... 508 e-142 UniRef50_Q136N6 Bacteriophage N4 adsorption protein B n=1 Tax=Rh... 478 e-133 UniRef50_Q01UW7 Bacteriophage N4 receptor, outer membrane protei... 459 e-127 UniRef50_Q1N778 Bacteriophage N4 adsorption protein B n=1 Tax=Sp... 447 e-124 UniRef50_B8JA61 Bacteriophage N4 adsorption protein B n=2 Tax=An... 446 e-123 UniRef50_C8WHH7 General secretory system II protein E domain pro... 444 e-123 UniRef50_Q1IXI4 Glycosyltransferase, NfrB-like protein n=2 Tax=D... 416 e-114 UniRef50_D2LI28 General secretion pathway protein E n=1 Tax=Rhod... 406 e-111 UniRef50_B8ICV2 General secretion pathway protein E n=2 Tax=Meth... 405 e-111 UniRef50_Q2G4Z3 Bacteriophage N4 adsorption protein B n=2 Tax=Sp... 399 e-109 UniRef50_Q1GVK0 Bacteriophage N4 adsorption protein B n=1 Tax=Sp... 371 e-101 UniRef50_Q0ASE8 Glycosyl transferase, family 2 n=1 Tax=Maricauli... 264 9e-69 UniRef50_A9DFB5 Putative uncharacterized protein n=1 Tax=Hoeflea... 263 1e-68 UniRef50_Q0G6A6 Glycosyl transferase, family 2 n=1 Tax=Fulvimari... 260 1e-67 UniRef50_Q1YKP1 Putative glycosyl transferase n=1 Tax=Aurantimon... 258 4e-67 UniRef50_C4DGM8 Glycosyl transferase n=1 Tax=Stackebrandtia nass... 258 5e-67 UniRef50_A1SEL1 General secretory system II, protein E domain pr... 258 7e-67 UniRef50_Q13CF3 Glycosyl transferase, family 2 n=10 Tax=Bradyrhi... 257 9e-67 UniRef50_A3VPZ4 Putative uncharacterized protein n=2 Tax=Bacteri... 234 1e-59 UniRef50_UPI0001B50A66 glycosyl transferase family protein n=1 T... 220 1e-55 UniRef50_Q2NDA0 Probable inner membrane transmembrane protein n=... 216 4e-54 UniRef50_A1TEN6 Glycosyl transferase, family 2 n=7 Tax=Actinomyc... 215 4e-54 UniRef50_C6PR94 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 211 6e-53 UniRef50_C6J4S3 Glycosyl transferase n=1 Tax=Paenibacillus sp. o... 210 2e-52 UniRef50_C8WIR5 Type II secretion system protein E n=2 Tax=Bacte... 208 5e-52 UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=F... 200 2e-49 UniRef50_C6CY56 Type II secretion system protein E n=3 Tax=Bacil... 197 1e-48 UniRef50_B8D2C7 Tfp pilus assembly protein PilB n=5 Tax=Firmicut... 193 2e-47 UniRef50_C4L4L3 Type II secretion system protein E n=1 Tax=Exigu... 192 4e-47 UniRef50_A0LUI4 Type II secretion system protein E n=4 Tax=Bacte... 191 8e-47 UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophi... 189 2e-46 UniRef50_Q1IRV6 Type II secretion system protein E n=24 Tax=Bact... 187 1e-45 UniRef50_B8FQB2 Type II secretion system protein E n=4 Tax=Clost... 187 1e-45 UniRef50_B5YDI2 Type IV pilus assembly protein PilB n=20 Tax=Bac... 184 1e-44 UniRef50_A3DEG0 Type II secretion system protein E n=5 Tax=Clost... 178 5e-43 UniRef50_B0MQV8 Putative uncharacterized protein n=2 Tax=Clostri... 178 1e-42 UniRef50_A3DDQ8 Type II secretion system protein E n=3 Tax=Clost... 177 1e-42 UniRef50_D1B5J6 Type II secretion system protein E n=3 Tax=Syner... 177 1e-42 UniRef50_B0TEE9 Type ii secretion system protein e, putative n=1... 177 1e-42 UniRef50_B0MCL3 Putative uncharacterized protein n=1 Tax=Anaeros... 175 5e-42 UniRef50_D1BGY1 Type II secretion system protein E (GspE) n=3 Ta... 172 4e-41 UniRef50_B9Y801 Putative uncharacterized protein n=1 Tax=Holdema... 171 7e-41 UniRef50_Q181B0 Type IV pilus assembly protein n=5 Tax=Clostridi... 171 1e-40 UniRef50_B1I3E7 Type II secretion system protein E n=4 Tax=Clost... 169 2e-40 UniRef50_C8W5J2 Type II secretion system protein E n=2 Tax=Clost... 168 7e-40 UniRef50_B8E2U7 Type II secretion system protein E n=2 Tax=Dicty... 168 8e-40 UniRef50_UPI0001C3693E pili biogenesis protein PilB-like ATPase ... 167 1e-39 UniRef50_C6XAP3 Type II secretion system protein E n=1 Tax=Methy... 166 2e-39 UniRef50_Q7UE44 General secretion pathway protein E n=1 Tax=Rhod... 163 3e-38 UniRef50_B9M6X5 General secretory system II protein E domain pro... 162 4e-38 UniRef50_C1XWL4 Type II secretory pathway, ATPase PulE/Tfp pilus... 162 4e-38 UniRef50_Q1J1R8 Tfp pilus assembly pathway, ATPase PilB n=7 Tax=... 162 6e-38 UniRef50_Q1D9E1 General secretory pathway protein E n=17 Tax=Pro... 161 1e-37 UniRef50_D2R471 Type II secretion system protein E n=6 Tax=Planc... 160 2e-37 UniRef50_A8URM8 Putative uncharacterized protein n=3 Tax=Hydroge... 160 2e-37 UniRef50_Q0F0L3 Putative uncharacterized protein n=1 Tax=Maripro... 159 3e-37 UniRef50_B5E974 General secretory system II protein E domain pro... 158 8e-37 UniRef50_Q39ZG4 General secretory system II, protein E-like n=2 ... 157 1e-36 UniRef50_C6E8N7 General secretory system II protein E domain pro... 157 2e-36 UniRef50_D2R473 Type II secretion system protein E n=6 Tax=Planc... 155 5e-36 UniRef50_B0VJ41 Type IV pilus biogenesis protein PilB n=1 Tax=Ca... 154 9e-36 UniRef50_Q3SKS0 Pilus assembly pathway ATPase PilB n=3 Tax=Prote... 154 1e-35 UniRef50_B5YIG2 Type IV-A pilus assembly ATPase PilB n=4 Tax=Bac... 154 1e-35 UniRef50_C0QER1 PilB n=10 Tax=Proteobacteria RepID=C0QER1_DESAH 154 1e-35 UniRef50_C6E6L0 Response regulator receiver protein n=3 Tax=Geob... 153 3e-35 UniRef50_B3E1K8 Response regulator receiver protein n=5 Tax=Geob... 153 3e-35 UniRef50_C6E323 Response regulator receiver protein n=3 Tax=Geob... 151 8e-35 UniRef50_C0QQ17 Type IV pilus assembly protein TapB n=2 Tax=Bact... 151 1e-34 UniRef50_B3E4P2 Response regulator receiver protein n=1 Tax=Geob... 151 1e-34 UniRef50_Q1Q109 Strongly similar to general secretory system typ... 151 1e-34 UniRef50_B5E8D1 General secretory system II protein E domain pro... 150 2e-34 UniRef50_A5G3U1 General secretory system II, protein E domain pr... 148 1e-33 UniRef50_A9G5Y5 Putative uncharacterized protein n=1 Tax=Sorangi... 145 5e-33 UniRef50_A5GB67 Response regulator receiver protein n=4 Tax=Geob... 145 7e-33 UniRef50_Q1D3E0 General secretion pathway protein E, N-terminal ... 144 1e-32 UniRef50_A1ALE6 General secretory system II, protein E domain pr... 139 3e-31 UniRef50_A9FI10 Family membership n=1 Tax=Sorangium cellulosum '... 139 4e-31 UniRef50_Q094V5 General secretion protein E N-terminal domain pr... 139 5e-31 UniRef50_Q3A899 Type II secretory pathway and PulE/Tfp pilus ass... 136 4e-30 UniRef50_Q08RH0 Gspii_e N-terminal domain family n=1 Tax=Stigmat... 134 1e-29 UniRef50_B5YHZ6 Type IV pilin n=1 Tax=Thermodesulfovibrio yellow... 127 1e-27 UniRef50_Q1D133 General secretion pathway protein E, N-terminal ... 113 2e-23 Sequences not found previously or not previously below threshold: UniRef50_D0B518 Glycosyl transferase, family 2 n=36 Tax=Brucella... 229 3e-58 UniRef50_A5FY21 Glycosyl transferase, family 2 n=2 Tax=Alphaprot... 227 1e-57 UniRef50_Q7D1E2 Glycosyltransferase n=1 Tax=Agrobacterium tumefa... 219 3e-55 UniRef50_A8IJY6 Putative glycosyltransferase n=1 Tax=Azorhizobiu... 217 1e-54 UniRef50_Q2JNJ5 Glycosyl transferase, group 2 family protein n=6... 216 2e-54 UniRef50_B6JD78 Glycosyl transferase, family 2 n=1 Tax=Oligotrop... 215 6e-54 UniRef50_Q11MF6 Glycosyl transferase, group 2 family protein n=1... 215 7e-54 UniRef50_B0RVF2 Glycosyltransferase n=8 Tax=Proteobacteria RepID... 213 2e-53 UniRef50_D2LBM7 Glycosyl transferase family 2 n=1 Tax=Rhodomicro... 213 2e-53 UniRef50_C6XK76 Glycosyl transferase family 2 n=1 Tax=Hirschia b... 211 9e-53 UniRef50_B9JQZ7 Glycosyltransferase n=1 Tax=Agrobacterium vitis ... 210 2e-52 UniRef50_A7HQ64 Glycosyl transferase family 2 n=1 Tax=Parvibacul... 210 2e-52 UniRef50_D2M877 Glycosyl transferase family 2 n=1 Tax=Rhodopseud... 208 8e-52 UniRef50_Q0EZ04 Putative uncharacterized protein n=1 Tax=Maripro... 207 1e-51 UniRef50_A1B414 General secretory system II, protein E domain pr... 203 3e-50 UniRef50_D0L467 Glycosyl transferase family 2 n=1 Tax=Gordonia b... 202 4e-50 UniRef50_A4T169 Putative uncharacterized protein n=4 Tax=Mycobac... 202 4e-50 UniRef50_B9L0Q0 Glycosyl transferase, group 2 family protein n=2... 201 8e-50 UniRef50_A3UGE7 Putative uncharacterized protein n=1 Tax=Oceanic... 200 2e-49 UniRef50_B8HEE8 Glycosyl transferase family 2 n=1 Tax=Arthrobact... 199 4e-49 UniRef50_B5ZHV8 Glycosyl transferase family 2 n=2 Tax=Gluconacet... 198 8e-49 UniRef50_A4WQA1 General secretory system II, protein E domain pr... 196 2e-48 UniRef50_C6QAR9 Glycosyl transferase family 2 n=1 Tax=Hyphomicro... 196 3e-48 UniRef50_B9R454 Glycosyl transferase, group 2 family protein n=1... 194 9e-48 UniRef50_B8H475 N-acetylglucosaminyltransferase n=3 Tax=Caulobac... 193 2e-47 UniRef50_A3TTM2 Glycosyl transferase, family 2 n=1 Tax=Oceanicol... 193 2e-47 UniRef50_B4W7G8 Putative uncharacterized protein n=1 Tax=Brevund... 193 3e-47 UniRef50_A3VHJ3 Glycosyl transferase, family 2 n=2 Tax=Rhodobact... 193 3e-47 UniRef50_B5JBJ3 GSPII_E N-terminal domain family n=2 Tax=Octadec... 192 4e-47 UniRef50_Q28T00 Glycosyl transferase family 2 n=1 Tax=Jannaschia... 191 7e-47 UniRef50_A8LS31 Glycosyl transferase n=1 Tax=Dinoroseobacter shi... 191 1e-46 UniRef50_UPI0001B55850 glycosyl transferase family 2 n=1 Tax=Str... 190 1e-46 UniRef50_D1NA10 General secretion pathway protein E n=1 Tax=Vict... 189 2e-46 UniRef50_Q168M3 Glycosyl transferase, putative n=12 Tax=Rhodobac... 189 4e-46 UniRef50_A3V835 Glycosyltransferase, family 2 n=2 Tax=Rhodobacte... 188 7e-46 UniRef50_C0R5T9 Glycosyl transferase, group 2 family protein n=8... 188 8e-46 UniRef50_A6DXN0 Glycosyl transferase, group 2 family protein n=2... 187 1e-45 UniRef50_B6R4T3 Glycosyl transferase, family 2 n=1 Tax=Pseudovib... 187 2e-45 UniRef50_B0T1N0 Putative uncharacterized protein n=1 Tax=Cauloba... 186 2e-45 UniRef50_Q2FNF4 Glycosyl transferase, family 2 n=1 Tax=Methanosp... 185 5e-45 UniRef50_A7IHY2 Glycosyl transferase family 2 n=1 Tax=Xanthobact... 184 9e-45 UniRef50_A3JQ28 Glycosyl transferase, family 2 n=1 Tax=Rhodobact... 183 2e-44 UniRef50_A3XKW0 Glycosyltransferase related protein n=1 Tax=Leeu... 182 5e-44 UniRef50_Q8NU22 Glycosyltransferases, probably involved in cell ... 181 6e-44 UniRef50_Q0FJ05 Glycosyl transferase, group 2 family protein n=2... 181 7e-44 UniRef50_B7QPG7 Glycosyl transferase, group 2 family n=4 Tax=Rho... 181 9e-44 UniRef50_B8IGT7 Glycosyl transferase family protein n=9 Tax=Alph... 180 2e-43 UniRef50_B6B2W2 Glycosyl transferase, group 2 family protein n=1... 178 7e-43 UniRef50_Q1GE89 Glycosyl transferase family 2 n=3 Tax=Rhodobacte... 178 8e-43 UniRef50_B1YJT8 Type II secretion system protein E n=5 Tax=Bacil... 177 1e-42 UniRef50_Q1RIH7 Glycosyltransferase n=11 Tax=Rickettsia RepID=Q1... 174 8e-42 UniRef50_C8NRH6 Group 2 glycosyl transferase n=5 Tax=Corynebacte... 174 1e-41 UniRef50_B2IIL4 Glycosyl transferase family 2 n=2 Tax=Beijerinck... 171 8e-41 UniRef50_D0XKL7 Putative uncharacterized protein n=1 Tax=Brevund... 167 2e-39 UniRef50_C6CWA2 Glycosyl transferase family 2 n=4 Tax=Bacillales... 166 2e-39 UniRef50_A5CDZ1 Putative glycosyl transferase, group 2 n=2 Tax=O... 166 3e-39 UniRef50_C7M1J2 Glycosyl transferase family 2 n=1 Tax=Acidimicro... 166 4e-39 UniRef50_A8MGF8 Type II secretion system protein E n=1 Tax=Alkal... 161 8e-38 UniRef50_B7APV7 Putative uncharacterized protein n=2 Tax=Bacteri... 161 9e-38 UniRef50_C1AA14 Type IV pilus assembly protein PilB n=1 Tax=Gemm... 158 6e-37 UniRef50_Q15ZI3 Type II secretion system protein E n=119 Tax=Pro... 158 6e-37 UniRef50_B2A7G1 Type II secretion system protein E n=5 Tax=Firmi... 157 1e-36 UniRef50_A6M148 Type II secretion system protein E n=18 Tax=Clos... 157 2e-36 UniRef50_C6MLL4 General secretory system II protein E domain pro... 157 2e-36 UniRef50_C3RLB0 Type II secretion system protein E n=2 Tax=Bacte... 151 8e-35 UniRef50_C4Z588 Type IV pilus assembly protein PilB n=9 Tax=Clos... 150 2e-34 UniRef50_C8WRT0 Glycosyl transferase family 2 n=2 Tax=Alicycloba... 146 2e-33 UniRef50_C5SLZ1 Putative uncharacterized protein n=1 Tax=Asticca... 146 3e-33 UniRef50_Q1AWU6 Type II secretion system protein E n=1 Tax=Rubro... 144 1e-32 UniRef50_A7GMR7 Glycosyl transferase family 2 n=3 Tax=Bacillales... 144 1e-32 UniRef50_A6Q5B1 General secretory pathway protein E n=3 Tax=Epsi... 144 1e-32 UniRef50_Q2RY77 Type II secretion system protein E n=2 Tax=Prote... 143 2e-32 UniRef50_B9M3F1 General secretory system II protein E domain pro... 142 3e-32 UniRef50_C1XM86 Type II secretory pathway, ATPase PulE/Tfp pilus... 142 4e-32 UniRef50_Q47AJ5 Type II secretion system protein E:General secre... 142 4e-32 UniRef50_D0LLW4 General secretory pathway protein E n=2 Tax=Nann... 142 4e-32 UniRef50_Q0A8B9 Type II secretion system protein E n=1 Tax=Alkal... 142 6e-32 UniRef50_B1IIP9 Glycosyl transferase, group 2 family protein n=2... 141 1e-31 UniRef50_C7RLS9 Type II secretion system protein E n=3 Tax=Betap... 140 2e-31 UniRef50_C9RLV9 Type II secretion system protein E n=1 Tax=Fibro... 140 2e-31 UniRef50_Q01NU4 Type II secretion system protein E (GspE) n=1 Ta... 138 7e-31 UniRef50_D2RJA7 Glycosyl transferase family 2 n=10 Tax=Veillonel... 138 7e-31 UniRef50_D0LFN0 General secretory system II protein E domain pro... 138 8e-31 UniRef50_A4J3A4 Type II secretion system protein E n=1 Tax=Desul... 137 1e-30 UniRef50_B8FZG2 Glycosyl transferase family 2 n=4 Tax=Clostridia... 137 1e-30 UniRef50_A5D548 Glycosyltransferases, probably involved in cell ... 137 2e-30 UniRef50_C6PVI5 Glycosyl transferase family 2 n=2 Tax=Clostridiu... 137 2e-30 UniRef50_B2V1W7 Glycosyl transferase, group 2 family protein n=2... 137 2e-30 UniRef50_A2SK33 Type II secretory pathway ATPase PulE/Tfp pilus ... 137 2e-30 UniRef50_A8ZYV7 Type II secretion system protein E n=11 Tax=Delt... 137 2e-30 UniRef50_A6W755 Type II secretion system protein E n=3 Tax=Actin... 136 2e-30 UniRef50_A6BBY4 Msha biogenesis protein mshe n=1 Tax=Vibrio para... 136 3e-30 UniRef50_D2QZA7 Type II secretion system protein E n=1 Tax=Pirel... 136 3e-30 UniRef50_Q39PZ3 Response regulator receiver domain protein (CheY... 135 5e-30 UniRef50_D1B8K4 Glycosyl transferase family 2 n=1 Tax=Thermanaer... 135 5e-30 UniRef50_A1TUR3 General secretory pathway protein E n=5 Tax=Prot... 135 7e-30 UniRef50_Q21L22 Type II secretion system protein E n=4 Tax=Prote... 135 8e-30 UniRef50_A6C2Q8 Type II secretion system protein E n=3 Tax=Planc... 134 8e-30 UniRef50_A3CR73 PilB-like pili biogenesis ATPase, putative n=2 T... 134 1e-29 UniRef50_C6MU90 Type II secretion system protein E n=1 Tax=Geoba... 134 1e-29 UniRef50_A1AV36 General secretory system II, protein E domain pr... 133 2e-29 UniRef50_C9R8Y9 Glycosyl transferase family 2 n=1 Tax=Ammonifex ... 132 3e-29 UniRef50_B4UHA9 General secretory system II protein E domain pro... 132 4e-29 UniRef50_Q1JZD9 Response regulator receiver protein n=1 Tax=Desu... 131 7e-29 UniRef50_B8JAJ3 General secretory system II protein E domain pro... 131 9e-29 UniRef50_Q1D5Q5 General secretory system II protein E, N-termina... 131 1e-28 UniRef50_C6MLR2 Type II secretion system protein E n=1 Tax=Geoba... 131 1e-28 UniRef50_Q3A3E8 Putative type IV pilin n=1 Tax=Pelobacter carbin... 131 1e-28 UniRef50_A9BY08 General secretory pathway protein E n=2 Tax=Prot... 130 2e-28 UniRef50_C5V6N2 Type II secretion system protein E n=1 Tax=Galli... 130 2e-28 UniRef50_Q1D416 General secretory system II protein E, N-termina... 130 2e-28 UniRef50_Q08MP9 Gspii_e N-terminal domain family (Fragment) n=1 ... 130 2e-28 UniRef50_B3EAP3 General secretory system II protein E domain pro... 129 3e-28 UniRef50_D2QZD9 Type II secretion system protein E n=4 Tax=Planc... 129 4e-28 UniRef50_Q39Q61 General secretory system II, protein E-like n=2 ... 129 4e-28 UniRef50_B9XRT3 Type II secretion system protein E n=2 Tax=Verru... 128 7e-28 UniRef50_A1WFR3 Type II secretion system protein E (GspE) n=8 Ta... 127 2e-27 UniRef50_Q1CX93 General secretion protein E N-terminal domain pr... 127 2e-27 UniRef50_B2KC07 Type II secretion system protein E n=1 Tax=Elusi... 126 3e-27 UniRef50_B1Y3Z9 General secretory system II protein E domain pro... 124 2e-26 UniRef50_C5ESB9 Type II secretion system protein E:General secre... 123 2e-26 UniRef50_A3ZX01 General secretion pathway protein E n=3 Tax=Plan... 123 3e-26 UniRef50_Q2JEE3 Glycosyltransferases probably involved in cell w... 122 4e-26 UniRef50_Q1Q644 Strongly similar to general secretion pathway pr... 122 5e-26 UniRef50_Q08Q27 Serine/threonine kinase PKN11 n=1 Tax=Stigmatell... 122 6e-26 UniRef50_Q1D888 General secretory system II protein E, N-termina... 121 8e-26 UniRef50_D2MLP0 Glycosyltransferase, group 2 family protein n=1 ... 120 2e-25 UniRef50_C0A4E0 Type II secretion system protein E n=2 Tax=Chlam... 120 2e-25 UniRef50_B8F8X7 Response regulator receiver protein n=3 Tax=Prot... 120 2e-25 UniRef50_Q7UEJ7 Probable general secretion pathway protein E n=1... 119 3e-25 UniRef50_B2A563 Type II secretion system protein E n=5 Tax=cellu... 119 3e-25 UniRef50_Q2IEU7 General secretory system II, protein E-like n=3 ... 119 4e-25 UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococ... 119 4e-25 UniRef50_C4ZLP1 Type II secretion system protein E n=3 Tax=Betap... 117 1e-24 UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolob... 117 1e-24 UniRef50_D0LM65 General secretory system II protein E domain pro... 117 1e-24 UniRef50_D1WQZ8 Glycosyltransferase probably involved in cell wa... 117 1e-24 UniRef50_B8DWB6 Predicted glycosyltransferase n=10 Tax=Bifidobac... 117 2e-24 UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium... 116 2e-24 UniRef50_A1ASU1 Response regulator receiver protein n=1 Tax=Pelo... 116 3e-24 UniRef50_C8W151 Type II secretion system protein E n=1 Tax=Desul... 116 4e-24 UniRef50_B3E9S8 Type II secretion system protein E n=2 Tax=Geoba... 115 7e-24 UniRef50_Q098P0 Putative ATPase n=1 Tax=Stigmatella aurantiaca D... 114 1e-23 UniRef50_A5KT43 Type II secretion system protein E n=6 Tax=candi... 114 1e-23 UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 114 1e-23 UniRef50_A7HJC2 Type II secretion system protein E n=11 Tax=Ther... 114 1e-23 UniRef50_Q1AST4 Glycosyl transferase, family 2 n=1 Tax=Rubrobact... 114 2e-23 UniRef50_A1SFQ9 Type II secretion system protein E n=1 Tax=Nocar... 114 2e-23 UniRef50_B8JGM0 General secretory system II protein E domain pro... 114 2e-23 UniRef50_A1S0Z5 Glycosyl transferase, family 2 n=1 Tax=Thermofil... 114 2e-23 UniRef50_A3WSK4 Glycosyl transferase, family 2 n=1 Tax=Nitrobact... 112 4e-23 UniRef50_Q1D3E1 General secretion pathway protein E, N-terminal ... 112 4e-23 UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosami... 112 5e-23 UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bact... 112 6e-23 UniRef50_Q09DM8 Serine/threonine-protein kinase Pkn6 n=1 Tax=Sti... 112 6e-23 UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio... 111 9e-23 UniRef50_C8WGD9 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 111 1e-22 UniRef50_UPI00016B268C type II secretion system protein E n=1 Ta... 111 2e-22 UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococc... 111 2e-22 UniRef50_A9GDR9 General secretion pathway protein E n=1 Tax=Sora... 110 2e-22 UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus Re... 110 2e-22 UniRef50_Q749Y3 Type IV pilus assembly protein, putative n=6 Tax... 110 2e-22 >UniRef50_P0AFA6 Bacteriophage N4 adsorption protein B n=53 Tax=Proteobacteria RepID=NFRB_ECO57 Length = 745 Score = 984 bits (2543), Expect = 0.0, Method: Composition-based stats. Identities = 745/745 (100%), Positives = 745/745 (100%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY Sbjct: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV Sbjct: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR Sbjct: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT Sbjct: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF Sbjct: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG Sbjct: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ Sbjct: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV Sbjct: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ Sbjct: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK Sbjct: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ Sbjct: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS Sbjct: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 Query: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 MQSLLLKAGLNTEQVAQLESENEGE Sbjct: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 >UniRef50_B7LP59 Bacteriophage N4 receptor, inner membrane subunit n=7 Tax=Bacteria RepID=B7LP59_ESCF3 Length = 750 Score = 951 bits (2458), Expect = 0.0, Method: Composition-based stats. Identities = 631/744 (84%), Positives = 687/744 (92%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 ++WLLD+F+TWLYGLK IAI LA++M ISGLDD FIDVVYW+RR+KR LSVYRRYPRM+Y Sbjct: 7 VEWLLDLFSTWLYGLKFIAIALAIMMLISGLDDLFIDVVYWLRRVKRSLSVYRRYPRMNY 66 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV Sbjct: 67 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 126 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSA FAFAGFILHDAEDVISPMELR Sbjct: 127 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSAKFAFAGFILHDAEDVISPMELR 186 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 LFNYLV+RKDLIQIPVYPFER+WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT Sbjct: 187 LFNYLVDRKDLIQIPVYPFERKWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 246 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 CFSRRA++ALLADGDGIAFDVQSLTEDYDIGFRLKEKGM+EIFVRFPVVD+ K E RK Sbjct: 247 CFSRRAISALLADGDGIAFDVQSLTEDYDIGFRLKEKGMSEIFVRFPVVDDGKTGEPRKL 306 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 Q RT NMICVREYFPDTF+TAVRQKSRWIIGIVFQGFKTHKWTS+L LNYFLWRDRKG Sbjct: 307 FQSKRTHNMICVREYFPDTFTTAVRQKSRWIIGIVFQGFKTHKWTSNLILNYFLWRDRKG 366 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 AISNF+SF+AMLV IQL+LL+ Y++ WP+AWHFLSIF+ SA TLLW+NF LMVNRIVQ Sbjct: 367 AISNFISFIAMLVFIQLMLLMLYQTFWPNAWHFLSIFTDSAAFTTLLWMNFALMVNRIVQ 426 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 RVIFVTGYYGLTQG+LSVLRL WGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV Sbjct: 427 RVIFVTGYYGLTQGILSVLRLCWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 486 Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 +G+ R+LRPLGQILLEN VITE QL+ AL NR++GLRLGGSMLMQGLI+A+QLAQALAEQ Sbjct: 487 SGENRALRPLGQILLENHVITETQLEQALTNRIQGLRLGGSMLMQGLITAQQLAQALAEQ 546 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 NGV WES+DAWQIP LI ++PASVALHYAVLPLR+E+D L+VGSEDGIDPVSLAAL+RK Sbjct: 547 NGVGWESVDAWQIPRYLIEQIPASVALHYAVLPLRIEDDVLVVGSEDGIDPVSLAALSRK 606 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 GR+VRYVIVLRGQ+VTGLRHWYARRRG D R +L AV +WLT QQ EIW+Q+V HQ Sbjct: 607 TGRQVRYVIVLRGQVVTGLRHWYARRRGRDARELLEQAVLRRWLTPQQQTEIWQQFVQHQ 666 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 FLFAE+LTTLGHINRSAIN LLLRHERS PLG FLV EGVISQETLDRVL+IQ+ LQVS Sbjct: 667 FLFAEVLTTLGHINRSAINALLLRHERSDRPLGAFLVAEGVISQETLDRVLSIQQNLQVS 726 Query: 721 MQSLLLKAGLNTEQVAQLESENEG 744 MQSLL AGL T Q+A+LE+++EG Sbjct: 727 MQSLLQAAGLTTMQIAELETDHEG 750 >UniRef50_B3GN83 Bacteriophage N4 adsorption NfrB-like protein n=1 Tax=Zymomonas mobilis subsp. mobilis RepID=B3GN83_ZYMMO Length = 729 Score = 780 bits (2013), Expect = 0.0, Method: Composition-based stats. Identities = 349/714 (48%), Positives = 461/714 (64%), Gaps = 9/714 (1%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 + IAI +A+++ + G+DD FID +W+R I R+ +Y YP ++L+ +EKPLAIM Sbjct: 20 FRYIAIFVAILVTLFGIDDIFIDSCFWIRSIYRRFFIYSHYPHADEKQLFSKNEKPLAIM 79 Query: 75 VPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 VPAW E GV+ NMA LAA TLDYENYHIFVGTYPNDP+TQ DVD V +++PNVHK+VCAR Sbjct: 80 VPAWREVGVVANMARLAAETLDYENYHIFVGTYPNDPETQNDVDAVVSQYPNVHKIVCAR 139 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQI 194 PGPTSKADCLNNV+DAI FE +A FAGFILHDAEDVISP+ELRLFNYLV RKD+IQI Sbjct: 140 PGPTSKADCLNNVIDAIFHFEEAAAIEFAGFILHDAEDVISPLELRLFNYLVARKDMIQI 199 Query: 195 PVYPF-EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 PVYPF + FT Y+DEFSE HGKDV VREAL GQVPSAGVGTCFSRRA+T LL + Sbjct: 200 PVYPFISDRFGDFTRNHYVDEFSEHHGKDVVVREALTGQVPSAGVGTCFSRRAITLLLKE 259 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE-QRKFLQHARTSNMICV 312 DG FD SLTEDYDI FRL +GM+ IF R+PV D ++K R + +ICV Sbjct: 260 SDGFPFDTTSLTEDYDISFRLYREGMSCIFARYPVTDPQYAFPIKQKIGMDRRYTQVICV 319 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 RE+FPD F AVRQKSRWI GIVFQG + W +NYFLWRDR+G I+N V FLA + Sbjct: 320 REHFPDHFKYAVRQKSRWITGIVFQGTRNLGWEHRAIMNYFLWRDRRGIITNIVGFLANI 379 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 ++ + LL +L W F+S+ S +A L LLW+N +++NR QR FVT YYG+ Sbjct: 380 LLFFVALLWIISALNLKGWSFMSVLSDNALLSVLLWVNGFILLNRAAQRCFFVTKYYGIK 439 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQ 492 QGL S R+ WGN++N A RA QV+ G+ +R+AWDKTTHDFPS+ R P+G Sbjct: 440 QGLTSPFRMVWGNIVNSFACIRAFWQVITIGNIKRMAWDKTTHDFPSIPVSRRE--PIGL 497 Query: 493 ILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQ 552 ++ + L+ L+ + RLG +L++GLI++EQLA+ALA Q + S + + Sbjct: 498 WMVAQNFLKNSDLEQVLQAPRQH-RLGQELLLRGLINSEQLAKALAHQASLKAVSFNIFY 556 Query: 553 IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLR 612 + S LIA P +A YAVLP + L + E + P+SL ++R +G V +I + Sbjct: 557 LDSKLIAAFPRYLACRYAVLPFSQKGKALQLICEHALSPISLGVISRHIGLNVECLIAPQ 616 Query: 613 GQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGH 672 G++ GLR+WY +G+ P + + + L + E + F +IL T GH Sbjct: 617 GRVTLGLRYWYPG-QGNQPST---DRIIKELLKDPNNIEKQDTVCIYLAQFGDILQTTGH 672 Query: 673 INRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLL 726 I +L+ + + LG++LV +ISQE L+ L Q + + + +LL Sbjct: 673 IPEPIFAQVLIDFDPDKMKLGEYLVKRKLISQEILEECLKEQNKQEEMAEKVLL 726 >UniRef50_B4SJA4 General secretory system II protein E domain protein n=15 Tax=Proteobacteria RepID=B4SJA4_STRM5 Length = 715 Score = 633 bits (1632), Expect = e-180, Method: Composition-based stats. Identities = 266/708 (37%), Positives = 380/708 (53%), Gaps = 38/708 (5%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRR--YPRMSYRELYKPDEKPLAIMVPAWNETGV 83 + IS LDD FIDV YWVR R L++ RR Y ++ +L + E+PLAIMVPAW E V Sbjct: 28 ILISSLDDLFIDVWYWVRESWRALTIKRRDAYKPLTQEDLLQRPEQPLAIMVPAWMEYDV 87 Query: 84 IGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADC 143 I M E LDY Y +FVGTYPND T +V+ + R+ + +V GPTSKADC Sbjct: 88 IAQMVENMINVLDYREYVVFVGTYPNDQQTIDEVERMRRRYKRLRRVEVPHDGPTSKADC 147 Query: 144 LNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREW 203 LN ++ AI ++E+ + FAG ILHD+EDV+ PMELR +NYL+ RKD+IQ+PV +REW Sbjct: 148 LNWLILAIFEYEKRHDIEFAGVILHDSEDVLHPMELRFYNYLLPRKDMIQLPVTSLDREW 207 Query: 204 THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQS 263 + Y+DEF+E H KD+ VRE+++G VPSAGVGTCFSRRA+ AL A D F+ S Sbjct: 208 YELVAGVYMDEFAEWHAKDLVVRESVSGMVPSAGVGTCFSRRALLALSAQTDNQPFNTDS 267 Query: 264 LTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ-RKFLQHARTSNM-ICVREYFPDTFS 321 LTEDYD+G RL GM IF RFPV + RT M +CVREYFPD F Sbjct: 268 LTEDYDVGARLAAMGMQSIFARFPVQFRVRRPSWFGWGPVRERTQQMALCVREYFPDNFR 327 Query: 322 TAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLL 381 + RQK+RW++GI Q +++ W SL Y L RDRKG I++FVS +A ++ +QLLL Sbjct: 328 ASYRQKARWVLGIGLQSWESLGWRGSLATKYLLARDRKGIITSFVSIIAYVIFLQLLLFW 387 Query: 382 AYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRL 441 + F S+F W M + L + R+VQR FV YG L+S+ R+ Sbjct: 388 LLKMTGVWTMQFPSVFQPGTWQMNVALLTTAALATRVVQRFYFVNRLYGWEHALMSIPRM 447 Query: 442 FWGNLINFMANWRALKQVLQHGD-PRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVI 500 GN+INFMA RA K L + +R+ WDKT HDFP ++ + LG++L Q + Sbjct: 448 VVGNMINFMATARAWKVFLAYLLFGKRMVWDKTMHDFPDAAQLVQTRKQLGELLGTWQAV 507 Query: 501 TEEQLDTALRNRVEGL--RLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLI 558 E+L AL + G LG +L QG + E LA+A+A Q + ID + Sbjct: 508 EPERLQQALDQQHAGRQQPLGRILLTQGWLDDETLAEAIAFQGDLPRAVID-VDYLRACQ 566 Query: 559 AEMPASVALHYAVLPLR-LENDELIVGSEDGIDPVSLAALTRKVGRK-VRYVIVLRGQIV 616 + A + + +LPL + L + + +LA L ++ + + I +I Sbjct: 567 FPVSADACVQWRMLPLPPRQEGTLRLAVASPLPEEALALLKQETRSEHIEQSIARESEIN 626 Query: 617 TGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRS 676 GLR R+ H + + L ++L + I+ + Sbjct: 627 AGLRLIGGDRQWH---------------------------LDNVPLLGDLLVEMRLIDHA 659 Query: 677 AINVLLLRHER-SSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS 723 + L ++ +G +LV +G+ ++E + + + QR ++QS Sbjct: 660 RFEIALDDYKPQRDGRIGDYLVKQGITTEEAVAQAMQEQRRRAATLQS 707 >UniRef50_A5P922 Bacteriophage N4 receptor, inner membrane subunit n=1 Tax=Erythrobacter sp. SD-21 RepID=A5P922_9SPHN Length = 698 Score = 626 bits (1615), Expect = e-178, Method: Composition-based stats. Identities = 245/717 (34%), Positives = 372/717 (51%), Gaps = 36/717 (5%) Query: 10 TWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEK 69 +++ + A+ +A ++ IS LDD F+D V+W+ KR+ +S L + E Sbjct: 4 SYIVAFECAALVVATLIAISSLDDLFVDSVFWIAMAKRRFLGKGEPRTVSPETLIERPEA 63 Query: 70 PLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK 129 P+AIM+PAW E VI +M E A TL Y NY IF+G+Y NDP+T +V+++ AR+ V Sbjct: 64 PIAIMLPAWQEADVIASMVENAIHTLVYRNYFIFIGSYANDPETILEVEKLAARYGRVRH 123 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERK 189 V GPT KADCLN+++ I + E+ + FAG +LHD+EDV+ P+EL LFNYL+ + Sbjct: 124 VRVPHYGPTCKADCLNHIVADILRLEKEVDIEFAGLVLHDSEDVLHPLELHLFNYLLPSR 183 Query: 190 DLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTA 249 D+IQ+PV E+ +T F + TY+D+F+E H KD+ VR+ LA VPSAGVGTCFSRRA+ Sbjct: 184 DMIQLPVVSLEQRFTDFVAGTYMDDFAESHAKDLVVRQMLAKSVPSAGVGTCFSRRAIEV 243 Query: 250 LLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKER--EQRKFLQHARTS 307 +L G F+ Q+LTEDYD+G RL ++G+ +PV +++ R + TS Sbjct: 244 MLE--AGEPFNTQTLTEDYDVGSRLAKRGLNASIELYPVEFRSRQFGHFGRGPERVGTTS 301 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVS 367 +CVRE+FP+TF + RQK+RWI+GI QG+ W S+ NYFL RDRK I+ ++ Sbjct: 302 KPLCVREHFPNTFRASYRQKARWILGIALQGWAQLGWDRSIVSNYFLCRDRKALITPTLA 361 Query: 368 FLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTG 427 LA ++ L + A + IFS L N + R+VQRV FV Sbjct: 362 VLAYVLTAMYLGATIWSFASGGA--AIPIFSNHPIASYLFSFNLFALAARVVQRVYFVAK 419 Query: 428 YYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHG-DPRRVAWDKTTHDFPSVTGDTRS 486 Y T L+V R+ + INF A+ RA++ + +AWDKT H FPS + Sbjct: 420 IYCWTHAFLAVPRMVVLSFINFAASVRAIRIFVGSKFSGNPIAWDKTNHRFPSDEALGKE 479 Query: 487 LRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 R LG+IL V++ L+ A L + EG LG ++ G+I + L +A++ QN + Sbjct: 480 KRRLGEILRGWDVVSSPMLEKALLYQKREGGMLGDLLVRDGVIDEDVLTEAISTQNQLPR 539 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 ++ + + L R +L + GI A L Sbjct: 540 AELNL-------------DMVCEHLDLLDRATMTQLQI-LPFGISSKGEALLAVAKPLAC 585 Query: 606 RYVIVLRGQIVTGLR-HWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFA 664 ++ ++ G R H + R + A L N + + VP Sbjct: 586 EQTRLIWSRMSKGYREHIVPQSRIKEILAALTNIPERNF------------PVPSVPRVH 633 Query: 665 EILTTLGHINRSAINVLLLRHERSSL-PLGKFLVTEGVISQETLDRVLTIQRELQVS 720 E+L + + + + LL + + +G+FLV +G I+Q TLD+ L ++ L S Sbjct: 634 ELLLSQKQLKKKELQNLLKDYNVARHGTIGQFLVAKGTITQATLDKTLELRTSLIES 690 >UniRef50_B2UJM9 General secretory system II protein E domain protein n=12 Tax=cellular organisms RepID=B2UJM9_RALPJ Length = 703 Score = 617 bits (1591), Expect = e-175, Method: Composition-based stats. Identities = 233/625 (37%), Positives = 343/625 (54%), Gaps = 16/625 (2%) Query: 11 WLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP 70 ++ L + + +++ IS DDFF+D YWVR + R +S + L +E+ Sbjct: 9 YVALLNTLTVATTLVILISTADDFFLDAFYWVRELWLWPQRGRTPVTISAQALRDREEQW 68 Query: 71 LAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP-NVHK 129 LAIMVPAW E VI M E T++Y + IF G Y ND +T +V+ + R+P V + Sbjct: 69 LAIMVPAWKEYDVIAKMVENTLATMEYTRFIIFAGAYRNDAETTTEVERMVRRYPGRVVR 128 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERK 189 GPT KADCLN ++ I ++E FAG I+HD EDVI P+EL+ FNY + + Sbjct: 129 AAVTHDGPTCKADCLNTIIQTIIRYEAGHGIRFAGVIMHDCEDVIHPLELKYFNYFISDQ 188 Query: 190 DLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTA 249 DL+Q+PV ER+W + + TY+D+FSE H KD+ R+AL G VP AGV C+SRRA+ A Sbjct: 189 DLVQLPVLSLERKWYEWVAGTYMDDFSETHQKDLVARQALTGTVPGAGVALCYSRRAIEA 248 Query: 250 LLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVD------EAKEREQRKFLQH 303 ++ F+ +LTEDYD FRL+E GM E FV FPV + + R+ + + Sbjct: 249 VMKVRGDAPFNTSTLTEDYDFSFRLRELGMREAFVHFPVCENTAPVADGTGRQPTHWWTN 308 Query: 304 AR---TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 R ++ REYFP TF TA RQ++RW++GI FQG+ W +L Y +RDRKG Sbjct: 309 RRREARPQLLATREYFPSTFRTAYRQRARWVLGIAFQGWLQMGWKGNLITKYMFFRDRKG 368 Query: 361 AISNFVSFLAMLV-MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIV 419 ++ S LA + + LL+ + + W A + G+ W+ LL +N L++NR+ Sbjct: 369 VLTALFSILAYALSLNYLLVAVLLDKGWVTA-SEGAFVVGTIWMQDLLAINATLLINRLA 427 Query: 420 QRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHG-DPRRVAWDKTTHDFP 478 QRV FV G Q +L + RL N INF + RA K L + + +AWDKT H + Sbjct: 428 QRVYFVGRLNGPLQAVLCLPRLVVNNFINFFSVCRAWKIFLIYCFTGKPIAWDKTQHTYL 487 Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQAL 537 S R+ LG+ LL+ +VIT+EQLD AL + G RLG ++ QGL++ + LA AL Sbjct: 488 SNDALGRTRCKLGETLLKWEVITQEQLDAALAIQQQTGRRLGQVLVQQGLVTPDTLADAL 547 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRL-ENDELIVGSEDGIDPVSLAA 596 AEQ + S+ + +L +P +A+ + V+P + E+ L + + D +L Sbjct: 548 AEQADLPRVSLTNVVL-GALADCLPRDLAVRHHVVPFSIGEDGSLNIAVSELPDGEALQE 606 Query: 597 LTRKVGRKVRYVIVLRGQIVTGLRH 621 L R GRKV + ++ L Sbjct: 607 LARAAGRKVACFMACDHEMSAELAQ 631 >UniRef50_C6N7W0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N7W0_9GAMM Length = 501 Score = 508 bits (1307), Expect = e-142, Method: Composition-based stats. Identities = 186/488 (38%), Positives = 272/488 (55%), Gaps = 9/488 (1%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVP 76 + L+ + ISG+DD F D YW+R + R L R Y ++Y +L + +E+ +A+++P Sbjct: 15 YFLVALSCLFIISGIDDLFFDGYYWIRYVFR-LWKTRGYKPLTYEQLAEKEEQMIAVLIP 73 Query: 77 AWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG 136 W+E GVIG M + ++DY NY++FVG YPNDP+T +V EV NV V+ PG Sbjct: 74 CWHEAGVIGTMLKHNCYSIDYSNYYLFVGVYPNDPETVNEVQEVANLIKNVRCVIGTTPG 133 Query: 137 PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPV 196 PT+KA LN + + + FE+ N +F+ F+ HD+ED+I PM +L+NYL+ RK++IQIPV Sbjct: 134 PTNKAANLNGIYNYVKAFEKELNRSFSIFVFHDSEDIIHPMSFKLYNYLMPRKEMIQIPV 193 Query: 197 YPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDG 256 +P E + +FT Y DEFSE H KD+ VRE++ G VPSAGVGT FSR A+ L Sbjct: 194 FPLEINYWNFTHWLYADEFSENHTKDIIVRESIHGHVPSAGVGTAFSRHALKLLEDPTTR 253 Query: 257 IAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ---RKFLQHARTSNMICVR 313 F SLTEDY ++ KG+ +IFV +V K R + RK T I R Sbjct: 254 TPFSTDSLTEDYRTSLAIRIKGLKQIFVTETIV-RMKWRPRGFFRKGYVQKPTREYIATR 312 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 FP ++ AVRQK+RWIIGIVFQ ++ +W + + L DRK I++F++ V Sbjct: 313 ALFPLEYTKAVRQKARWIIGIVFQEWQHTQWPKEWIIRFTLAHDRKSFITHFINGFGYFV 372 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 + L+ P+ F+ W+ L+ +M+ R++QR+I + YG Sbjct: 373 FLFWLVYSLCTYTNPEYPSLQEQFNLHPWVWWLIVTVTLMMIERMIQRMIAIRRVYGWIP 432 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQ----HGDPRRVAWDKTTHDFPSVTGDTRSLRP 489 LS+ R F+GNL+N A RA ++ +WDKT H FP T + Sbjct: 433 SFLSIPRTFYGNLLNLHALIRAYHVYYTTPKSQATSKQPSWDKTDHHFPGSHILTPYRKK 492 Query: 490 LGQILLEN 497 +G +LLE Sbjct: 493 IGDLLLEK 500 >UniRef50_Q136N6 Bacteriophage N4 adsorption protein B n=1 Tax=Rhodopseudomonas palustris BisB5 RepID=Q136N6_RHOPS Length = 497 Score = 478 bits (1229), Expect = e-133, Method: Composition-based stats. Identities = 178/468 (38%), Positives = 254/468 (54%), Gaps = 6/468 (1%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAI 73 ++++ + AV++ +S +DD +D++YW RR+ R + + ++ E + P+A+ Sbjct: 14 VVEIMLVVTAVLVALSSIDDLVVDLLYWGRRLTRP-NAFDATADLATME--AIPQAPIAV 70 Query: 74 MVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 ++PAW E VI +M T YENYH+FVG Y ND T +V A+ VH VV Sbjct: 71 IIPAWQEHEVIFSMLAANQATTKYENYHLFVGAYQNDAATLTEVRRAEAQSNRVHLVVVP 130 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQ 193 R GPTSKADCLN V + + FE++ FAG +LHDAED+I P EL LFN++ D IQ Sbjct: 131 RDGPTSKADCLNVVANGVFAFEQAKGIQFAGLVLHDAEDLIHPYELVLFNFMAHDNDFIQ 190 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 +PV+ F+R Y+DEF+E H KD+PVR ++G VP AGV F R +AD Sbjct: 191 LPVFSFKRPLRELVGGVYMDEFAESHLKDIPVRRMISGLVPCAGVAAFFGRDIALRTMAD 250 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 G F SLTEDYD RL G FV P + I R Sbjct: 251 NAGSLFRSDSLTEDYDFALRLGLLGARVNFVIAPASYTIDISSSTDLPEIVGRKLPIATR 310 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 E+FP++F A RQ++RW++GIVFQG ++ W + + Y L RDRK ++ + LA LV Sbjct: 311 EFFPNSFVAAQRQRARWLMGIVFQGTRSFGWRGTTGIKYALLRDRKSILTAPLIMLAYLV 370 Query: 374 MIQLLLL-LAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 + L+ + L + PD + + + LLWLNF ++ R++ R F YGL Sbjct: 371 LFGLVSVNLYFRWYLPDEVNQFPLLQ-EPLVQQLLWLNFAFLIWRLLHRFYFTNRIYGLR 429 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHG-DPRRVAWDKTTHDFPS 479 GL+S+ RL GN +NF A RA + L H R+ WDKT H +P+ Sbjct: 430 HGLMSIPRLPLGNFLNFFAVARACRLYLSHSLLGTRLVWDKTEHQYPT 477 >UniRef50_Q01UW7 Bacteriophage N4 receptor, outer membrane protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01UW7_SOLUE Length = 466 Score = 459 bits (1180), Expect = e-127, Method: Composition-based stats. Identities = 172/484 (35%), Positives = 235/484 (48%), Gaps = 35/484 (7%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWN 79 + +AV + ISGLDD FI +V + + R+P S +L E+P+AI VP W+ Sbjct: 1 MPVAVWILISGLDDLFITMVGFA-------TSRVRFPWPSSGDLKSAAEQPIAIFVPLWH 53 Query: 80 ETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 E VIG M E + Y NYH+F G YPND T R V+ A P +H +C GPTS Sbjct: 54 EHRVIGRMLEHNLAAVRYGNYHVFAGVYPNDTPTLRAVELQAAVHPKIHTAICPHDGPTS 113 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPF 199 K DCLN + + +E F +LHDAED+I P LRL N+ +++Q+PV P Sbjct: 114 KGDCLNWIYQHMRAWEARHGTRFRVVVLHDAEDLIDPESLRLINWFSRDYEMVQVPVLPL 173 Query: 200 EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAF 259 +T Y DEF+E KD+PVR+ L G +PS GVGT F R A+ L +G F Sbjct: 174 ATAVKEWTHGLYCDEFAEYQRKDIPVRQQLGGFLPSNGVGTGFGRDALERLADGRNGRPF 233 Query: 260 DVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDT 319 D LTEDY+ G+ L E G +IF R + RE+FP Sbjct: 234 DPACLTEDYETGYLLHELGCRQIF----------------LPVRFRENGPTATREFFPRG 277 Query: 320 FSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLL 379 A+ Q++RW+ GI Q ++ H W L+ Y+ WRDRKG I N +S A L+ + Sbjct: 278 ARAAISQRTRWVTGIALQSWERHGWRVPLSQLYWFWRDRKGLIGNLLSPAANLLFLYGAG 337 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL 439 A+ + P AWH S WL L + + R YG L Sbjct: 338 SYAFSTGHPSAWHLGSHIP--PWLAGSCRLTLAIAALQTGVRARSAALIYGWKFAAGVPL 395 Query: 440 RLFWGNLINFMANWRALKQVLQHGDPRR----VAWDKTTHDFPSVTGDTRSLRPLGQILL 495 R+ WGNL+NF A AL + G+ RR +AW KT H +P+ + Q LL Sbjct: 396 RMVWGNLVNFAATAMALWEF---GNSRRRGGGLAWRKTDHMYPTALATSGVR---YQPLL 449 Query: 496 ENQV 499 +N + Sbjct: 450 KNPI 453 >UniRef50_Q1N778 Bacteriophage N4 adsorption protein B n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1N778_9SPHN Length = 470 Score = 447 bits (1150), Expect = e-124, Method: Composition-based stats. Identities = 176/483 (36%), Positives = 237/483 (49%), Gaps = 25/483 (5%) Query: 5 LDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL- 63 A + I + AV + I GLDD ID+ Y+ R+ R + +Y R+ RM+ EL Sbjct: 8 AGTAALLVAVHHEILLFAAVGLAIGGLDDLLIDIFYFGRKAWRDIVIYARHQRMTGPELP 67 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCAR 123 + +A+ VPAW E+ VI M A + Y IFVG YPND T V V Sbjct: 68 HSRRPGKIAVFVPAWQESNVIAAMLNHARDSWGEARYRIFVGVYPNDDATIDAVANVACD 127 Query: 124 FPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN 183 + + R GPT+KADCLN + A+ E +F + ILHDAEDV+ E+RLF+ Sbjct: 128 ATWLTLCINDRAGPTTKADCLNLLWRAMRAEEEQGDFRYKAIILHDAEDVVHADEIRLFD 187 Query: 184 YLVERKDLIQIPVYPFERE---WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 ++V+R DL+Q+PV P W + Y DEF+E HGK + VREAL VPSAGV Sbjct: 188 FMVDRFDLVQLPVLPLRGRGGWWRRAIADHYGDEFAESHGKLLSVREALGASVPSAGVAC 247 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F R + AL +D FD SLTEDY+ G R+++ G FVR Sbjct: 248 AFERDMLAALASDEATGPFDPGSLTEDYEAGLRIRDMGGRSAFVRM-------------- 293 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 N+I RE+FPD+ AVRQK+RW IGI G+ W + RDR+ Sbjct: 294 --RDAYGNIIATREFFPDSIDAAVRQKARWTIGIALAGWDRLGWRGGPAEFWMRLRDRRA 351 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 ++ V F A L ++ L + P LS L LLW N LM+ R+ Sbjct: 352 VLAALVLFAAYLTLVLWATLALLALVIPFPARPLS-----PALTGLLWFNLFLMLWRMAM 406 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 R +FV YGL GL +V R N I +A RA+ L+ + + WDKT H FP + Sbjct: 407 RFLFVARAYGLRAGLGAVPRTLIANYIGILAARRAIFLYLRSLAGQPLRWDKTQHRFPDL 466 Query: 481 TGD 483 D Sbjct: 467 KTD 469 >UniRef50_B8JA61 Bacteriophage N4 adsorption protein B n=2 Tax=Anaeromyxobacter RepID=B8JA61_ANAD2 Length = 502 Score = 446 bits (1146), Expect = e-123, Method: Composition-based stats. Identities = 166/469 (35%), Positives = 245/469 (52%), Gaps = 14/469 (2%) Query: 16 KVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMV 75 +V+A L + ++ LD+ FIDV Y RR+ R+ + +S L + + K AI++ Sbjct: 9 RVMAGPLGGAILLNQLDELFIDVNYLARRLHRRSATA-----VSAALLRRVEPKRTAILL 63 Query: 76 PAWNETGVIGNMAELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 PAW E VI M EL + +D+ Y F GTY NDP TQ VD AR V KVV Sbjct: 64 PAWREEDVIERMLELNVSRIDFPRDRYVFFCGTYQNDPATQARVDRAAARGWPVRKVVVP 123 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQ 193 GPTSKADCLN + + ER F ++HDAEDVI P+ LRL++ LV + + +Q Sbjct: 124 HAGPTSKADCLNWIYQGVVLHERERGTRFDILLMHDAEDVIHPLALRLYSLLVPKHEFVQ 183 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 PV+ + + + TYIDEF+E H K++PVR+A+ G +PSAGVG+ F RRA + Sbjct: 184 TPVFSLPLDASQVVAGTYIDEFAEHHLKELPVRQAIGGLIPSAGVGSAFERRAFEQIALA 243 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 FD SLTEDY+IG R + F + + + + E + I R Sbjct: 244 HAQQPFDPASLTEDYEIGLRFRLARRRTHFACYRIAADPDDPEAPAH------DDPIATR 297 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 EYFPD F +VRQ+SRWI+GI Q +++ W + Y LWRDRK ++N + L+ + Sbjct: 298 EYFPDRFQASVRQRSRWILGISLQTWESAGWQGPAAVRYCLWRDRKAVLTNALLALSYAL 357 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 + +++ + + +W I + LL +N + R+ ++ FV YG Sbjct: 358 LAYVVVRVWTAGMTGASWSPARIVPAGGLIQALLLVNLAGFLLRVGVKMGFVGRLYGARL 417 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQH-GDPRRVAWDKTTHDFPSVT 481 L + RL N+I+ A RA+ ++H + W KT+H FPS Sbjct: 418 ATLCLPRLLVANVISLAATARAVVTYVRHLVTGEPLRWVKTSHAFPSAE 466 >UniRef50_C8WHH7 General secretory system II protein E domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WHH7_EGGLE Length = 711 Score = 444 bits (1143), Expect = e-123, Method: Composition-based stats. Identities = 187/668 (27%), Positives = 271/668 (40%), Gaps = 90/668 (13%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPA 77 I +A+ + G DD DV R +K R+ + + K LA+++ A Sbjct: 9 IGFFVALAFIVFGADDVLWDVFALFRGTGKK--------RVKLSLINEKPPKMLAVVIAA 60 Query: 78 WNETGVIGNMAELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPN-VHKVVCAR 134 W+E V+G + + + Y Y +F+G YPND T + R V VV Sbjct: 61 WHEDAVLGEVVDNLVASAQYPRSLYRVFLGVYPNDAATVAVARALEVRHGGTVVCVVGDD 120 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQI 194 PGPTSKA +N+ + AI ++E + FA +HDAEDV+ P E ++ NYL++ D +Q Sbjct: 121 PGPTSKAANINHTVRAIREYEAERDVRFASVTIHDAEDVVHPNEFKMTNYLIDDYDALQF 180 Query: 195 PVYPFERE------WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 PV+P +R + TS TY DEF+E H + + +R+ L G VPSAG G RR + Sbjct: 181 PVFPLQRMPRLRLFFKTLTSSTYADEFAEHHFRTMVMRDEL-GFVPSAGTGFAIGRRVLD 239 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 A D SLTEDY + L+ +G +V V R + + Sbjct: 240 AF---RDEDLLPRNSLTEDYKLSLTLRMRGFRVHYVLEKV--------PRVDARGRTVWD 288 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNY----FLWRDRKGAISN 364 I R FP TF AVRQK+RW+ GI Q L + FL++ K +N Sbjct: 289 YIATRSLFPSTFKAAVRQKARWVYGITMQSASMADVFGKSELTFAERTFLYKGLKAKFAN 348 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIF 424 FV V+ L+ L + S S W+ L +MV R V R Sbjct: 349 FVLLPGYAVLAYFLVQTFAPQLELPVM--YPLHSPSWWMCVFLLF---MMVERQVLRGRA 403 Query: 425 VTGYYGLTQGLLSV-------LRLFWGNLINFMANWRALKQVLQH--------------- 462 + YG S+ LRL WGNLIN A +RA +Q + + Sbjct: 404 LANVYGWKTMAFSILLPPLFPLRLLWGNLINMCATFRAWRQKIAYVLLRGREAKAAAAPV 463 Query: 463 -----------------------------GDPRRVAWDKTTHDFPSVTGDTRSLRPLGQI 493 AW+KT H+F + R R LG Sbjct: 464 VEHRGNAAEEEGERKPATDGDEAQTSNATSAQEGPAWNKTDHEFLPASVLERYRRLLGDA 523 Query: 494 LLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQ 552 LLE + L+ A+ R G+RLG +L QGL+ L QA A Q + Sbjct: 524 LLERGFVEPGHLEDAVGSARARGVRLGQELLRQGLVEERHLTQAYALQQQSMYVRAQPDL 583 Query: 553 IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLR 612 + L+ MP + A +A LPL IV +D + L +G ++ Sbjct: 584 VLLELMDRMPFAAADRFAALPLVESEKGWIVAVDDDLSCAERDELAFLLGEPTFFLFSST 643 Query: 613 GQIVTGLR 620 ++ Sbjct: 644 ADLLEAFE 651 >UniRef50_Q1IXI4 Glycosyltransferase, NfrB-like protein n=2 Tax=Deinococci RepID=Q1IXI4_DEIGD Length = 670 Score = 416 bits (1068), Expect = e-114, Method: Composition-based stats. Identities = 178/602 (29%), Positives = 257/602 (42%), Gaps = 60/602 (9%) Query: 57 RMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHI--FVGTYPNDPDTQ 114 R+ +L + LA+M+ AW E GV+ M E + Y + FVG YPND T Sbjct: 46 RLRPADLQQDHPSHLAVMIGAWQEAGVVTPMIESTLRLMHYPASRVEFFVGVYPNDLATL 105 Query: 115 RDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVI 174 +V + RFPNVH VV RPGPTSK+ LN V AI E F +HDAEDVI Sbjct: 106 PEVQALAERFPNVHCVVNERPGPTSKSQNLNGVYAAIKAHEARTGKPFDVIAVHDAEDVI 165 Query: 175 SPMELRLFNYLVERKDLIQIPV---YPFEREW------------THFTSMTYIDEFSELH 219 P +L++ L++R ++Q+PV +P R W + +Y DEF+E H Sbjct: 166 HPYTFQLYSTLLKRWKMVQLPVFALFPRGRAWGAGLRGLLRHLTGQIVTGSYADEFAEHH 225 Query: 220 GKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGM 279 + +P REAL +PSAG G R + L + DG +L EDY++ RL +G+ Sbjct: 226 LRHLPAREALGLFLPSAGTGFAMRREVMALL--EEDGQVLTEGALAEDYELALRLWRRGV 283 Query: 280 TEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGF 339 F P+ R Q + + VREYFP A+RQK RW GI Q Sbjct: 284 RVHFHVQPL--------PRLDTQGKLGRDYVAVREYFPTEVQAAIRQKGRWTYGITLQTP 335 Query: 340 K-THKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFS 398 +L LW D+KG +N + L + + LLL + Sbjct: 336 HRLRGLRLNLRDRLTLWHDQKGKYTNLIHLLGYPLSLTLLLAPLFGF----------HLQ 385 Query: 399 GSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLS-------VLRLFWGNLINFMA 451 ++ LL G+ R++ R V YGL Q L++ LR GN+IN +A Sbjct: 386 SNSLTRDLLLGVLGVTGWRMLMRAGAVGRIYGLRQALIATLCLPGLPLRWLAGNVINTLA 445 Query: 452 NWRALKQVL--QHGDPRRVA-WDKTTHDFPSVTGDTRS-LRPLGQILLENQVITEEQLDT 507 RA + L + G R A WDKT ++ R LG L + E +L Sbjct: 446 TLRAWRLFLFPERGQKRGTARWDKTERKAYVPDEVLQAVRRRLGDQWLFTGALRERELAR 505 Query: 508 ALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVA 566 LR R RLG + Q L+ Q+ ++LA+ G+ + ++ + ++ A A Sbjct: 506 LLRVQRRAAARLGQLAVQQALVDEAQVRRSLAQTQGLMYLNLTPEMLDHRFLS---AEQA 562 Query: 567 LHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG-------RKVRYVIVLRGQIVTGL 619 V L D L+V S + P AL R++ + R + Sbjct: 563 QRLDVAILGKRGDRLLVASPHAVSPERCEALLRELRFCLRAPELPITVYATSRQSLRAAY 622 Query: 620 RH 621 R Sbjct: 623 RR 624 >UniRef50_D2LI28 General secretion pathway protein E n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LI28_RHOVA Length = 450 Score = 406 bits (1044), Expect = e-111, Method: Composition-based stats. Identities = 134/481 (27%), Positives = 214/481 (44%), Gaps = 48/481 (9%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP------ 70 + + +++++ +S DD F+D+ LSV E P EKP Sbjct: 7 TLMLFISILINVSSFDDAFVDL----------LSVGIIRGNFGPPEDPSP-EKPTSSAIP 55 Query: 71 -LAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP-NVH 128 + + V W E V+G M E + + +++G YPND T + + A++P V Sbjct: 56 DIGVFVANWQEEDVLGRMVEGNLARIPISSVKLYLGVYPNDTGTLAVAEAMAAQYPDRVR 115 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF-NYLVE 187 +V + GPTSK LN + + + A +LHD+ED+I P ++ Y E Sbjct: 116 VIVNSMEGPTSKGQMLNEMFRQVYARPGAPEMA----VLHDSEDIIDPRTFGVYTAYARE 171 Query: 188 RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 D IQ+PV+ + TY+DEF+E H +++ VR A+ +PSAGVGTC +RR + Sbjct: 172 GYDFIQVPVFSLNSIKRSKVAATYMDEFAERHTREMVVRHAVGAMIPSAGVGTCMTRRLL 231 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 + + G +TEDY +G K G F R Sbjct: 232 EHFVRER-GFVLANGCVTEDYILGVEAKRAGFRSAFAAVSA-------------DELRGL 277 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVS 367 + + REYFP +FS +V+QK+RW+ GI F+ W YF RDRKGAI+NF+ Sbjct: 278 DFVATREYFPKSFSASVKQKTRWVYGINFEATHKLGWGGDFWDKYFFMRDRKGAITNFLP 337 Query: 368 FLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTG 427 ++++ + L + + + IF S ++LN + R + RVI Sbjct: 338 PISLVFWVLLAFEVV--DIEQMPLDLVPIFQVS------IFLNMLALGLRYLMRVICCRD 389 Query: 428 YYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGD--PRRVAWDKTTHDFPSVTGDTR 485 YG+ + +R +N +A WRA K + + + + W KT H+ P R Sbjct: 390 VYGINDFIGVAVRWPVSLTVNMLAVWRAWKTYVGESEYATKPIVWSKTEHELPDDLMSAR 449 Query: 486 S 486 Sbjct: 450 R 450 >UniRef50_B8ICV2 General secretion pathway protein E n=2 Tax=Methylobacterium RepID=B8ICV2_METNO Length = 442 Score = 405 bits (1041), Expect = e-111, Method: Composition-based stats. Identities = 133/459 (28%), Positives = 211/459 (45%), Gaps = 40/459 (8%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIG 85 + +S LDD FID++ + + + P ++ R D +A+ V W+E V+G Sbjct: 16 INVSSLDDAFIDIIAFG-------ILRKGLPGLAERT----DIPRIAVFVANWHEEEVLG 64 Query: 86 NMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP-NVHKVVCARPGPTSKADCL 144 M E + Y + +F+G YPND T R E+ A++P V ++ GPTSK L Sbjct: 65 KMVEGNLARIPYPSVSLFLGVYPNDTGTLRVAKELEAKYPDRVTVIINTLNGPTSKGQML 124 Query: 145 NNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREWT 204 N + + + E + A +LHD+EDVI P ++ + D IQ+PV+ R Sbjct: 125 NEMFQQVFEREDCPDIA----VLHDSEDVIDPRTFPIYAQYSQDHDFIQVPVFSLSRGKG 180 Query: 205 HFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSL 264 + TY+DEF+E H +++ VR A+ +PSAGVGT +++ + LA G ++ Sbjct: 181 LPVASTYMDEFAERHTREMIVRNAVGAAIPSAGVGTAMTKKLLKYFLA-TRGQVLMSGTV 239 Query: 265 TEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAV 324 TEDY +G K G A N + RE+FP T + ++ Sbjct: 240 TEDYILGVEAKRAG-------------FSAAFAAVSADDASGLNYVATREFFPKTLAASI 286 Query: 325 RQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYE 384 +QK+RW+ GI F+ W + YF RDRKG I+NF+ ++ + ++ ++L L Sbjct: 287 KQKTRWVYGINFEATHKLGWEGNAWDKYFFVRDRKGIITNFLPPVSFVFLVLIVLGLIDP 346 Query: 385 SLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWG 444 S PD + ++LN ++ R RV+ YG + R G Sbjct: 347 SEMPDPIE--------PVFVASIYLNLAALIVRYTIRVVASHEVYGTYDLIGIAYRWPIG 398 Query: 445 NLINFMANWRALKQVLQHGD--PRRVAWDKTTHDFPSVT 481 IN A +RA K + + + W KTTHD P Sbjct: 399 LYINAAAVFRAWKTYIGESQFATKPIVWSKTTHDLPENF 437 >UniRef50_Q2G4Z3 Bacteriophage N4 adsorption protein B n=2 Tax=Sphingomonadales RepID=Q2G4Z3_NOVAD Length = 488 Score = 399 bits (1026), Expect = e-109, Method: Composition-based stats. Identities = 150/499 (30%), Positives = 226/499 (45%), Gaps = 39/499 (7%) Query: 3 WLLDVFATWLYGLKV-IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 WL D WL L+ + + AV I D+ +D ++ + ++L+ +++ Sbjct: 4 WLADSAYQWLAVLEHELLLFAAVWFAIGAADELVMDGIW----LWQRLTGAGPTGQLAGN 59 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 K A+ VPAW E+ VIG M E+ I+VG Y ND +T + + Sbjct: 60 GRDKL-SSMAAVFVPAWRESAVIGPMVAHCLAVWPQEDLRIYVGCYRNDQETLNALT-IV 117 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL 181 + P V VV R GPT+KADCLN + A+ Q ER + +LHDAED++ P L L Sbjct: 118 SEDPRVRVVVHDRDGPTTKADCLNRLYLAMRQDERRSGQRIGFIVLHDAEDMVHPAALAL 177 Query: 182 FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 + ++ D +Q+PV P + + + + Y DEF+E H +D+ VR+ + +PSAGVG Sbjct: 178 MDRALDTVDFVQLPVRPEPQASSPWVAGHYCDEFAEAHARDMVVRDHIGAGLPSAGVGCA 237 Query: 242 FSRRAVTALLADGDGI-AFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 FSR A+ ++A G F LTEDY+ G + E G F+R Sbjct: 238 FSRAAIERIVAVRGGALPFAADCLTEDYEAGMLVAETGGRSRFIRV-------------- 283 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 ++ RE+FPD + +VRQK+RW+ GI FQG+ W S + RDR+G Sbjct: 284 --RDARGELVATREFFPDGLAASVRQKTRWVHGIAFQGWDRLGWNRSAGDLWMRLRDRRG 341 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 + V A L + ++ E F+ L LL N ++ R+V Sbjct: 342 PLVALVLLAAYLALPLWPIVRFGEMAG-----FVVPVPPGPVLKGLLAFNLCSLIWRLVV 396 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS- 479 R +F YG +G+ SV R GN+I MA RA ++ + WD T H Sbjct: 397 RALFTGSEYGWIEGVRSVFRFPVGNIIAIMAARRAAVAYVRVLFGGALTWDHTLHCAHPV 456 Query: 480 ---------VTGDTRSLRP 489 + R RP Sbjct: 457 QAGVGLASHASSTQRPRRP 475 >UniRef50_Q1GVK0 Bacteriophage N4 adsorption protein B n=1 Tax=Sphingopyxis alaskensis RepID=Q1GVK0_SPHAL Length = 483 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 136/447 (30%), Positives = 192/447 (42%), Gaps = 33/447 (7%) Query: 54 RYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDT 113 R R P E +AI VPAW+E + M D E++ ++VG YPND T Sbjct: 54 RGQRRGETARAPPIEGRIAIFVPAWDEAAALPAMLCRTLAAWDGEDFRLYVGCYPNDTAT 113 Query: 114 QRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDV 173 V ++ AR + V+ GPT+K D LN + A+ ER FA +LHDAED Sbjct: 114 IYAVSQLVARDARLRLVIGESEGPTTKGDNLNRLWAALCADERVEARRFAAIVLHDAEDH 173 Query: 174 ISPMELRLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 + EL L+ + ++QIPV P + Y DEF+E HGKD+PVR L + Sbjct: 174 VHRHELALYRQHLAHNAMVQIPVVPIIDRRARWIGGHYADEFAEAHGKDMPVRSRLGLPL 233 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 PSAGVG +R A+ L + G F SLTEDY+IG + G+ FV Sbjct: 234 PSAGVGCALTRSALALLAMERGGCPFSSDSLTEDYEIGMVIGAYGLGARFVD-------- 285 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSS------ 347 + I R FP AVRQKSRWI GI G+ W Sbjct: 286 --------AADPAGDRIVSRGAFPGRIDAAVRQKSRWIAGIAMAGWDHLGWPGCRLGHKQ 337 Query: 348 ------LTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSA 401 L + LWRDR+ ++ + A +I + +A + L + Sbjct: 338 RSTGRDLLARWMLWRDRRAPLAALILLAAYAGLILVAAGVAGQLLLGW-----NAIEPGP 392 Query: 402 WLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQ 461 L LL +N L+ R+ R+ F +G + +V R F N+I +A R++ + Sbjct: 393 TLQWLLVVNALLLGWRMALRIHFTARLHGWREASFAVPRAFVANIIAMLAARRSVLLYWR 452 Query: 462 HGDPRRVAWDKTTHDFPSVTGDTRSLR 488 V WDKT H + +R Sbjct: 453 ILRSGEVVWDKTDHSETGLAVADAPVR 479 >UniRef50_Q0ASE8 Glycosyl transferase, family 2 n=1 Tax=Maricaulis maris MCS10 RepID=Q0ASE8_MARMM Length = 537 Score = 264 bits (675), Expect = 9e-69, Method: Composition-based stats. Identities = 95/452 (21%), Positives = 164/452 (36%), Gaps = 70/452 (15%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDY 97 ++ + R +V + + D IMV + E V+ ++A ++DY Sbjct: 103 ALFTGLIVLRLAAVLAKPAWLDAPGCADGDLPTATIMVALYREAAVLPDLARG-LASIDY 161 Query: 98 ENYHI--FVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFE 155 + + +D +T + A P ++ P +K LN L Sbjct: 162 PTDRVAFKLVLEADDTETIHVARRM-ALDPRFEIIIVPPGNPRTKPRALNYALRLC---- 216 Query: 156 RSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTY 211 +HDAED P +LR F +R +Q P+ + RE T T + Sbjct: 217 -----RSELVTIHDAEDRPDPYQLRRAAEAFRVADQRLACVQAPLNWYNREETWLTRQ-F 270 Query: 212 IDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIG 271 E++ +P+ + L +P G F R A+ + +D ++TED D+G Sbjct: 271 ALEYAAHFHALLPLYQRLGWPLPLGGTSNHFRRDALVRV------GGWDAWNVTEDADLG 324 Query: 272 FRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWI 331 RL G + ++EA R Q R ++ Y AVR+ + W Sbjct: 325 LRLHAAGYRCGLIEPKTLEEAPLRLVPWVKQRTR-----WIKGYAQTIGVLAVRRDTPW- 378 Query: 332 IGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV--SFLAMLVMIQLLLLLAYESLWPD 389 W L L GA+++ + + L++ +I L E+ Sbjct: 379 ---------RRVWPGMLVLG--------GAVASALLHAPLSLACLIALATRAGPEAASLP 421 Query: 390 AWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINF 449 A F+ SA + + + RI L+ + +W L+ Sbjct: 422 ALAFMLAGYASAITCAAVAMRRAGLPVRIRD---------------LAGMPAYW--LLQT 464 Query: 450 MANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 +A RAL+Q+L DP R W+KT H ++T Sbjct: 465 LAAARALRQLLT--DPHR--WEKTEHGVSAMT 492 >UniRef50_A9DFB5 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DFB5_9RHIZ Length = 682 Score = 263 bits (673), Expect = 1e-68, Method: Composition-based stats. Identities = 99/510 (19%), Positives = 176/510 (34%), Gaps = 80/510 (15%) Query: 12 LYGLKVIAITLAVIMFISGLDDF--FIDVVYWVRRIKRKLSVY---RRYPRMSYRELYKP 66 L + + I +I +S L + +++ + R ++ RR + + Sbjct: 212 LALMLCLCIAAFLIWPLSALTVLHVVLTMLFSAGILLRLSALAMALRRPDTVRGGSDIET 271 Query: 67 DEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP 125 +++P ++E + + + A + D T ++ P Sbjct: 272 RMPIYTLLIPLYDEAAMAPALVARIDALRWPKSLLDVKYICEAGDEATIEALEAQ-DLGP 330 Query: 126 NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RL 181 V GP +K L L + ++DAED +P +L Sbjct: 331 ECEIVRVPAFGPRTKPKALQYALR---------GARGSLIAVYDAEDKPAPGQLLEAWAT 381 Query: 182 FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 F ++ +Q P+ +++ S + E+S L +P +P G Sbjct: 382 FRAGDDQLGCLQAPLAVANLR-SNWISGLFALEYSGLFRVLIPFLARTGMPIPLGGTSNH 440 Query: 242 FSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFL 301 F R A+ + +D ++TED D+G RL G ++ Sbjct: 441 FKRAALE------NTGGWDPHNVTEDADLGLRLHAYGYRTGILK---------------- 478 Query: 302 QHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-RKG 360 E P RQ++RW+ G W ++ W Sbjct: 479 --------CATVESCPVQLDVWKRQRTRWLKGWA------QTWLVAMRNPVATWSSLGPA 524 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWL-----MTLLWLNFGLMV 415 A F +A +++ L+ L Y + A+ F I SG A + LLW++ V Sbjct: 525 AFVVFQLLIAGMLISALVHPLMYLFI---AFSFAWIASGHASVVSDMHAVLLWMD---AV 578 Query: 416 NRIVQRVIF-VTGYYGLTQGLLSVLRLFWGNLIN------FMANWRALKQVLQHGDPRRV 468 N ++F TG++ T S L+ W +I +A WRAL Q+L + Sbjct: 579 NIFGNYLLFPATGWFAFTAYERSHLKRHWLLMIPAYWLLISLAGWRALTQLLANAH---- 634 Query: 469 AWDKTTHDFPSVTGDTRSLRPLGQILLENQ 498 W+KT HD S + P + + ENQ Sbjct: 635 LWEKTPHDAESSAKTQQDPLPTCEAVPENQ 664 >UniRef50_Q0G6A6 Glycosyl transferase, family 2 n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0G6A6_9RHIZ Length = 692 Score = 260 bits (664), Expect = 1e-67, Method: Composition-based stats. Identities = 89/499 (17%), Positives = 163/499 (32%), Gaps = 72/499 (14%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 W++ F +++ ++ V + R+ S + P Sbjct: 238 WVMSSFWQVAMTFHLLSAFCFILWIC------LRSVAAFGRQDGTIASELKSGPNKGGGT 291 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 P +++V ET V+ + L +F+ +D T + Sbjct: 292 RPAPI---YSVVVALHKETEVVARLVSALDNLKWPKSCLEVFLVCEADDHATVALCRKHT 348 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR- 180 ++ P +K LN+VL + A +L+DAED P +L Sbjct: 349 EGKLQYRVILVPPGNPRTKPKALNHVLPIV---------AGDFLVLYDAEDEPHPGQLEE 399 Query: 181 ---LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 + R +Q P+ + TS+ + E++ L +P +P G Sbjct: 400 AYDRYRASDARLACLQAPLVIRNGDRNWLTSI-FAMEYAGLFRAFLPWLARHRLPIPLGG 458 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 F + + +D ++TED D+G RLK G + P +++A Sbjct: 459 TSNHFKVAVLREV------GGWDSHNVTEDADLGMRLKRAGYDIETISSPTLEDA----- 507 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD 357 P+T S V Q++RW+ G V + + L R+ Sbjct: 508 -------------------PETVSVWVPQRTRWLKGWV------QTYAVHMRHPMLLMRE 542 Query: 358 RKGA-ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 F ++ LLL LA+ + W S LL L+ + V Sbjct: 543 LGVKRFVVFQLLFHGMITAALLLPLAFGLIGFTIWLQWSTGWERTSATALLVLDLAIFVG 602 Query: 417 RIVQRVIFVTGYYGLTQG-----LLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWD 471 + + G ++ L + ++W L +A +R L Q+ ++ AW+ Sbjct: 603 GYLSFLALTLRGMGTSELRPCVKWLPFVPIYW--LCVSVAAYRGLFQLFKN----PHAWE 656 Query: 472 KTTHDFPSVTGDTRSLRPL 490 KT H S + Sbjct: 657 KTAHGLASRADKLDPRHRM 675 >UniRef50_Q1YKP1 Putative glycosyl transferase n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YKP1_MOBAS Length = 681 Score = 258 bits (660), Expect = 4e-67, Method: Composition-based stats. Identities = 98/441 (22%), Positives = 163/441 (36%), Gaps = 85/441 (19%) Query: 59 SYRELYKPDEKPLAIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDV 117 ++ +++V + ETGVI + L+ I + +DP T + Sbjct: 284 QGADVEAGPLPVYSVLVALYQETGVIERLVASLSRLDWPTSRIEIKLVCEADDPATIGEA 343 Query: 118 DEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM 177 A P V P +K LN VL + + E L+DAED P Sbjct: 344 RRATAGLPQFEIVAVPPGEPRTKPKALNFVLP-LCRGE--------FVALYDAEDEPDPG 394 Query: 178 ELRL----FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 +LR F +Q P+ + T + + E++ L + +P + Sbjct: 395 QLREAFHGFRNGPGDLACLQAPLVVRNGDQNWLTGL-FALEYAALFRRLLPWLARRRLPL 453 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 P G F R +TA+ A+D ++TED D+G RL +G + P +++A Sbjct: 454 PLGGTSNHFRRHCLTAV------GAWDSHNVTEDADLGMRLYREGWKIGTLTRPTLEDA- 506 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF 353 P+ + RQ++RW G + W + Sbjct: 507 -----------------------PERWPVWYRQRTRWTKGWL------QTWLVHMRQPRR 537 Query: 354 LWRDRKGAISN------FVSFLAMLVMIQLLLLLAYESLWPDAWHFLS-----------I 396 LWR+ G IS FV LA ++ + L+L +L H L + Sbjct: 538 LWREL-GPISFAVFQMLFVGMLASALIQPVFLVLVLSTLVSALNHGLPGGLAGMIFALDL 596 Query: 397 FSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRAL 456 F+ + + L+ + R +R + YY L L+W L+ +A+ RA+ Sbjct: 597 FNATGGFFAFVALSLPAL--RPEERAT-LPKYYALVH-------LYW--LLIALASLRAV 644 Query: 457 KQVLQHGDPRRVAWDKTTHDF 477 Q+ + DP R W+KT HD Sbjct: 645 CQLAR--DPHR--WEKTHHDL 661 >UniRef50_C4DGM8 Glycosyl transferase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DGM8_9ACTO Length = 476 Score = 258 bits (659), Expect = 5e-67, Method: Composition-based stats. Identities = 93/435 (21%), Positives = 160/435 (36%), Gaps = 77/435 (17%) Query: 70 PLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 P ++VP + E V+ + L+A + I + +D +T D CA P Sbjct: 98 PYTVLVPLYREATVLPTLVSRLSALDYPRDRLQILLLIEADDAETL-DAAVTCATDPRFE 156 Query: 129 KVVCARPGPTSKADCLN-NVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFN 183 VV P +K N + A+ +F +++DAED P +LR F Sbjct: 157 IVVIPDSVPKTKPKACNIGLARAVGEF----------CVIYDAEDRPDPDQLRKAALAFR 206 Query: 184 YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFS 243 +R +Q + + WT++ + + E++ + + +P G F Sbjct: 207 LSPQRVVCVQAELQYWN-PWTNWLTRCFAAEYATNFSMTLHGMDRYRLAIPLGGTSNHFR 265 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQH 303 A+ AL +D ++TED D+G R+ +G + +EA R Sbjct: 266 TDALRAL------GGWDPYNVTEDADLGIRIARRGWDVRMMVSVTEEEANAR-------- 311 Query: 304 ARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD----RK 359 +RQ+SRWI G + W Y LWR+ R Sbjct: 312 ----------------LGNWLRQRSRWIKGYL------QTWLVHSRRPYRLWREVGTRRS 349 Query: 360 GAISNFVSFLA--------MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNF 411 A+ + F M M L L++ + L P + A L+ + + Sbjct: 350 LAVHLTLGFATVTTLVNPVMWAMTILYLIVGPQPLEPLFPKYNLYGGVIAMLLGNALMCY 409 Query: 412 GLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWD 471 LM+ V+R +F LT + L+WG + +A ++AL Q+L+ +R W+ Sbjct: 410 TLMLG-CVRRGLFAAVRVMLT------IPLYWGLM--SLAAYKALIQLLRPS--KRHYWE 458 Query: 472 KTTHDFPSVTGDTRS 486 T H + Sbjct: 459 LTEHGLVRSEDTVET 473 >UniRef50_A1SEL1 General secretory system II, protein E domain protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SEL1_NOCSJ Length = 606 Score = 258 bits (658), Expect = 7e-67, Method: Composition-based stats. Identities = 100/484 (20%), Positives = 175/484 (36%), Gaps = 82/484 (16%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL---AIM 74 + +AV+ S L + Y R R L + ++ E+ DE+ L I+ Sbjct: 186 METAIAVVAACSLLY--LVVSFYKFRLTLRALGTHLET-DVTDEEIAAIDERHLPTYTIL 242 Query: 75 VPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 VP + E G++ + + A + + +D +T + + ++ P+ H VV Sbjct: 243 VPLYKEAGIVPRLVRDINALDYPRTRLDVKLLCEEDDEETVQRIRDL-QLPPHFHLVVVP 301 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERK 189 P +K N L T ++ DAED P +L+ F+ + E Sbjct: 302 DSQPKTKPKACNYGLQLATG---------DYCVIFDAEDRPDPDQLKKAIIAFSRVPENV 352 Query: 190 DLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTA 249 +Q + F ++ T+ + +E+S +P A +P G F + Sbjct: 353 VCVQAKLNHFNQDQNMLTA-WFANEYSMHFELVLPAMGAAESPIPLGGTSNHFVTAKLRE 411 Query: 250 LLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNM 309 L A+D ++TED D+G RL +G + ++EA Sbjct: 412 L------GAWDPFNVTEDADLGIRLHREGYRTAMIDSTTLEEAN---------------- 449 Query: 310 ICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFL 369 +RQ+SRW G + W + + L + + F+SF Sbjct: 450 --------SQVPNWIRQRSRWNKGYI------QTWLVHMRAPFALLS--QTGLKGFLSF- 492 Query: 370 AMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLL--WLNFGLMVNRIVQRVIFV-- 425 L M +LL W A L +F+ + ++ L + + V +FV Sbjct: 493 -NLTMGSAFVLLLNPIFW--ALTTLYVFTQAGFIEQLFPGIIFYAASALLFVGNFVFVYL 549 Query: 426 -------TGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 G +GLT+ L + L+WG + +W A K +Q W+KT H Sbjct: 550 NVAGSLHRGEFGLTRTAL-LSPLYWG-----LMSWAAWKGFIQL-FTNPFYWEKTVHGLD 602 Query: 479 SVTG 482 G Sbjct: 603 EGHG 606 Score = 96.4 bits (238), Expect = 3e-18, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 62/147 (42%), Gaps = 1/147 (0%) Query: 492 QILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 Q+L + +IT++QL A L G LG ++ I+ + L AL+E + + Sbjct: 5 QMLTRSGLITDDQLQRAMLEYSRTGDPLGDILVSHEAITEDVLVAALSEMYQMQRVGLAG 64 Query: 551 WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIV 610 + + +P +A + +P+ +D +++ +D A + +G R ++ Sbjct: 65 FTPDFEVARRLPERLAHTFQAVPVAATDDLVLLAVARPLDTEQAAEVEEALGSPFRQLLA 124 Query: 611 LRGQIVTGLRHWYARRRGHDPRAMLYN 637 R ++ ++ +AR +L Sbjct: 125 NRTELDQLVQRVHARHYAEVSTRLLME 151 >UniRef50_Q13CF3 Glycosyl transferase, family 2 n=10 Tax=Bradyrhizobiaceae RepID=Q13CF3_RHOPS Length = 686 Score = 257 bits (657), Expect = 9e-67, Method: Composition-based stats. Identities = 93/440 (21%), Positives = 154/440 (35%), Gaps = 71/440 (16%) Query: 55 YPRMSYRELYKPDEK--PLAIMVPAWN--ETGVIGNMAELAATTLDYENYHIFVGTYPND 110 +PR + R L + + P+ +V A + E V G +A + A E + + PND Sbjct: 266 WPRAAQRPLRRRPDATLPIYTVVAALHREERSVAGLVAAIEALDYPREKLDVILVIEPND 325 Query: 111 PDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDA 170 T+ + + R P++ ++ P +K LN L + ++DA Sbjct: 326 LATRAAIARLGPR-PHLRVLIAPPVAPQTKPKALNCALAFA---------RGSFIAVYDA 375 Query: 171 EDVISPMELRLFNYLVERKDLIQIPVYP---FEREWTHFTSMTYIDEFSELHGKDVPVRE 227 ED P +LR +R + + S T+ E++ + +P Sbjct: 376 EDQPEPGQLRAALDAFDRHGATTACAQASLCIDNITHSWLSRTFAAEYAGQFDRLLPGLS 435 Query: 228 ALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFP 287 + +P G F + A+ +D ++TED D+GFRL G + Sbjct: 436 EMNLPLPLGGTSNHFRTDVLRAI------GGWDPYNVTEDADLGFRLARFGYRSVSFAST 489 Query: 288 VVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSS 347 E P TF RQ++RW+ G + W Sbjct: 490 ------------------------TYEEAPITFDNWRRQRARWMKGFI------QTWLVH 519 Query: 348 LTLNYFLWRD--RKGAISNFVSFLAMLVMI---QLLLLLAYESLWPDAWHFLSIFSGSAW 402 + LWRD +G ++ + L+ L L +A SL AW L + Sbjct: 520 MRHPLRLWRDIGPRGVLALNLIVGGNLLTALVHPLFLGIALASL-AGAWLELPAVLQPSP 578 Query: 403 LMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL----RLFWGNLINFMANWRALKQ 458 L WL V V+ + G G Q L + +W + +A W A+ Q Sbjct: 579 PSPLHWLAIAAGYASTV--VVGLRGLAGRRQLRLGFVLLLTPAYW--ICLSIAAWCAVAQ 634 Query: 459 VLQHGDPRRVAWDKTTHDFP 478 + R W+KT H Sbjct: 635 FVW----RPYYWEKTVHGVA 650 >UniRef50_A3VPZ4 Putative uncharacterized protein n=2 Tax=Bacteria RepID=A3VPZ4_9PROT Length = 512 Score = 234 bits (596), Expect = 1e-59, Method: Composition-based stats. Identities = 89/471 (18%), Positives = 178/471 (37%), Gaps = 69/471 (14%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEK--PLAIMV 75 I L ++ ++ F +R LS+ + P+E+ P I+ Sbjct: 82 IIGPLQKLIALNLGVTLFYLFQAGLRSAVLSLSLAQPRRLTLPPPPIAPEERLPPFTILC 141 Query: 76 PAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 P ++E + ++ L E I + +D T C R P V+ Sbjct: 142 PVYDEAESLPHLVGSLLLLDYPRERLDIKIILEADDRATIAAARTHC-RAPMFDLVLVPP 200 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLFNYLVERKD 190 P +K LN+ L + ++ +++DAED +P +L R F L + Sbjct: 201 SAPRTKPKALNHAL-----WTAKGDY----IVIYDAEDRPAPDQLTLAARTFAALPDHIA 251 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 +Q + + R+ T T + + E++ L +P AL+ VP G + A+ Sbjct: 252 CLQCRLNYYNRDTTILTRL-FALEYALLFDMTLPGLAALSAPVPLGGTSNILRTDILMAV 310 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 +D ++TED D+G RL G + ++EA + Sbjct: 311 ------GGWDPFNVTEDADLGLRLHRAGYETRLLNSTTLEEATDETG------------- 351 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG-----AISNF 365 +RQ++RW+ G Q + H + T + + G ++ Sbjct: 352 -----------AWLRQRTRWMKGF-MQTWLVHSRRAPRTGRFGHFLTVHGVVGGTVLAAL 399 Query: 366 VSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV 425 ++ +A + +L + + + L++ + +A+L L + +M+ + +R Sbjct: 400 INPVAWAIYGAWILGV--DGIARLFPTPLNVLALTAFLGGNLLHLYMMMIAPLRRR---- 453 Query: 426 TGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 ++ L + L+W ++ +A +RAL Q+++ R W+KT H Sbjct: 454 --WHDLVPYA-VLSPLYW--ILQSVAAYRALWQLIR----RPSYWEKTKHG 495 >UniRef50_D0B518 Glycosyl transferase, family 2 n=36 Tax=Brucellaceae RepID=D0B518_BRUME Length = 630 Score = 229 bits (584), Expect = 3e-58, Method: Composition-based stats. Identities = 82/455 (18%), Positives = 154/455 (33%), Gaps = 70/455 (15%) Query: 36 IDVVYWVRRIKRKLSVYRRYPRMSYREL--YKPDEKP-LAIMVPAWNETGVIGNMAE-LA 91 + + ++ + R + R+ + E+ +KP + P +I+VP + E V+ + L Sbjct: 205 MSLFFFGCVLIRLFAAASGK-RLQFTEIAPFKPRDLPVYSILVPLYREKDVVAQLIAALN 263 Query: 92 ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP-NVHKVVCARPGPTSKADCLNNVLDA 150 I + +D +T + C P N V+ GP +K LN L Sbjct: 264 RLNWPRSKLDIKLVCEKDDYETIAAIR--CNTMPSNFELVLVPPGGPRTKPKALNYALQF 321 Query: 151 ITQFERSANFAFAGFILHDAEDVISPMEL----RLFNYLVERKDLIQIPVYPFEREWTHF 206 + DAED P +L + F + +Q P+ Sbjct: 322 A---------RGEIVAVFDAEDRPHPDQLLEAWQAFRRGGSKLACVQAPLIIGNFRRNLL 372 Query: 207 TSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTE 266 T M + E++ L +P +P G F R + + +D ++TE Sbjct: 373 TRM-FAFEYAVLFRGLLPWLARRGLVIPLGGTSNHFRRSCLEQV------GGWDAYNVTE 425 Query: 267 DYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQ 326 D D+G RL G + +++A P+ RQ Sbjct: 426 DADLGMRLARFGYRIDVISRGTIEDA------------------------PEEHGVWHRQ 461 Query: 327 KSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR---KGAISNFVSFLAMLVMIQLLLLLAY 383 ++RWI G + W WR+ + +S + + + L+L Sbjct: 462 RTRWIKGWM------QTWLVHGRQPMNTWRELGWWRFVVSQIYTLGIIGSALLHPLMLLM 515 Query: 384 ESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV--TGYYGLTQGLLSVLRL 441 + F + WL+ L +N + G L++ + Sbjct: 516 LAGLCLRMAFGPLTPQGLWLLALDVINILMAYMSFHMLGAKTMEPTELGGYAYFLAIP-I 574 Query: 442 FWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 +W ++ ++ WRA+ Q+++ W+KT H Sbjct: 575 YW--VLISLSAWRAVWQLVRQ----PHLWEKTPHQ 603 >UniRef50_A5FY21 Glycosyl transferase, family 2 n=2 Tax=Alphaproteobacteria RepID=A5FY21_ACICJ Length = 642 Score = 227 bits (579), Expect = 1e-57, Method: Composition-based stats. Identities = 96/490 (19%), Positives = 166/490 (33%), Gaps = 77/490 (15%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRY----PRMSYRELYKPDEKPLAIM 74 L ++ +S L ++ R + + RR L D ++ Sbjct: 205 LTLLGIMATVSALY----ASLFAFRGVLTIVGSGRRTDISVNASELAALKDGDLPVFTVL 260 Query: 75 VPAWNETGVIGNMAELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVC 132 VP + E V+ + + + LDY + + D +T + A + Sbjct: 261 VPMYREAEVLPILVD-SIRRLDYPRAKLDVKLVLEAGDTETIEAAKALGAED-LFEIIRV 318 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVER 188 P +K N L ++ DAED P +LR LFN Sbjct: 319 PDSQPKTKPKACNYALRFA---------RGEYTVIFDAEDSPEPDQLRKVVALFNASGPE 369 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 +Q + F R+ T M + E+S+ +P L +P G F + Sbjct: 370 VACVQARLNYFNRDDNFLTRM-FTLEYSQWFDYLLPGLYRLNIPIPLGGTSNHFRTEVLH 428 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 L A+D ++TED D+G RL + G V +EA Sbjct: 429 EL------GAWDPYNVTEDADLGIRLTQAGYRVAVVNSTTFEEAN--------------- 467 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAI-----S 363 + + Q+SRWI G + W + L+R R G + Sbjct: 468 ---------GVLHSWINQRSRWIKGYM------QTWLVHMRRPVELYR-RLGPVGFLGFH 511 Query: 364 NFVSFLAM-LVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLL-WLNFGLMVNRIVQR 421 F+ F M ++ LL ++ S+ F G ++ L + M Sbjct: 512 MFIGFPPMTALINPLLWIMFLVSVIVGRSAVAGFFPGPVLVLALFDLMVGNAMYVYFNIV 571 Query: 422 VIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 + +YGL + +W +++ +A ++AL Q++ + W+KTTH S T Sbjct: 572 AVAKRRWYGLVP-WGLLAPAYW--VLHSVAAYKALLQLITN----PHYWEKTTHGTSSRT 624 Query: 482 GDTRSLRPLG 491 +T + G Sbjct: 625 QETIASLKAG 634 Score = 57.1 bits (136), Expect = 2e-06, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 66/152 (43%), Gaps = 7/152 (4%) Query: 492 QILLENQVITEEQLDTALRNRVEGLR--LGGSMLMQGLISAEQLAQALAEQNGVAWESID 549 L++ +T Q D A+ + R L + +++ + A+ L+ +G+ + Sbjct: 21 DRLVQAGSVTPLQRDEAI-HTAYAWRVSLPDVLSAMYRVTSLRWARELSAASGLPLVDLR 79 Query: 550 AWQIPSSLIAEMPASVALHYAVLPLRLENDELIV-GSEDGIDPVSLAAL--TRKVGRKVR 606 A +L+AE + L + +P R + D ++V + + DP ++ AL R G V Sbjct: 80 ASPCDHTLLAEAEHDLYLRHLFIPWRRQPDGVVVIATLNPEDP-AIRALMRERMPGCHVE 138 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNA 638 +VI + ++ ++ + R L+N Sbjct: 139 FVITSKFDLIWAVQRIFDPELSEAAREALFNR 170 >UniRef50_UPI0001B50A66 glycosyl transferase family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B50A66 Length = 439 Score = 220 bits (561), Expect = 1e-55, Method: Composition-based stats. Identities = 96/456 (21%), Positives = 162/456 (35%), Gaps = 82/456 (17%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLD 96 V+Y + + R + + + P ++++PA +E VI + E Sbjct: 32 VLYLMLYTWDRRDAER---KARAPDTFIPPRLSFSVLLPARHEEDVIQSTIERVVRADYP 88 Query: 97 YENYHIFVGTYPNDPDTQRDVDEVCARFPN--VH--KVVCARPGPTSKADCLNNVLDAIT 152 E +FV +D T + +E + +H +VV GP +K LN L Sbjct: 89 AELLEVFVICSQDDDGTVKKAEEKIDQLAREGLHNVRVVVFDDGPINKPHGLNTALPQ-- 146 Query: 153 QFERSANFAFAGFILHDAEDVISPMELRLFNYLV--ERKDLIQIPVYPFEREWTHFTSMT 210 A + DAED I P RL N ++ ER ++Q V + ++++ Sbjct: 147 -------TANKVVTIFDAEDDIHPKIFRLVNTVMVKERVRVVQAGVQLMNYQSNWYSTLN 199 Query: 211 YIDEF----SELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTE 266 ++ F S LH A G +P G F+R + L +D ++LTE Sbjct: 200 VLEYFFWFKSRLH------YHAHHGSIPLGGNTVFFARELLLRL------GGWDDRNLTE 247 Query: 267 DYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQ 326 D D+G R+ G V + +E P T +RQ Sbjct: 248 DADMGLRISAMGERVRVV---------------------YDDRYVTKEETPPTLGHFIRQ 286 Query: 327 KSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESL 386 ++RW G Q K W T W LA V++ Sbjct: 287 RTRWSQGF-MQTLKKGTWKKMPTRKQR-W-------------LAFYVLVFPRGQALLGLY 331 Query: 387 WPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV---TGYYGLTQGLLSVLRLFW 443 P + + I + +L L+V + +++ + T +GL +VLR+ Sbjct: 332 LPISLGMILILKVPVLIALCSYLPVLLLVAHFLVQMVGLYEFTDAHGLEASPKAVLRMAI 391 Query: 444 G----NLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 ++ A RA+++ L R W+KT H Sbjct: 392 AWFPFQMVLAYAALRAMRRQLA----GRHDWEKTQH 423 >UniRef50_Q7D1E2 Glycosyltransferase n=1 Tax=Agrobacterium tumefaciens str. C58 RepID=Q7D1E2_AGRT5 Length = 515 Score = 219 bits (558), Expect = 3e-55, Method: Composition-based stats. Identities = 76/418 (18%), Positives = 134/418 (32%), Gaps = 66/418 (15%) Query: 71 LAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK 129 I+V + E VI + L I + +D T + + P++ Sbjct: 143 YTILVALYREEAVIEQLVSALERLDWPRSRLDIKLVCEADDGATIEAIRRINPG-PHMEI 201 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYL 185 V P +K L L + +++DAED P +LR F Sbjct: 202 VQVPPSEPRTKPKALTYAL---------SGARGTFVVVYDAEDRPHPQQLREAYAAFRNQ 252 Query: 186 VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 E +Q P+ + + S + E++ L +P +P G F Sbjct: 253 PEDMACVQAPLI-ISNASSSWLSACFALEYAGLFRCMLPALATHGLPLPLGGTSNHFRTA 311 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 A+ A+D ++TED D+G RL G +R +++A Sbjct: 312 ALRR------AGAWDPYNVTEDADLGLRLHRLGYRCGVIRRQTLEDA------------- 352 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-RKGAISN 364 P + + Q++RW G + W + R A Sbjct: 353 -----------PTSLPVWLNQRTRWFKGWL------QSWLVMTRTPFATARTMGWFAYMT 395 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGS--AWLMTLLWLNFGLMVNRIVQRV 422 F + +++ L L + SL A W L +++ +V V Sbjct: 396 FQLLIGGMLLSSLTHPLLFVSLVFMAIAIRENGVDLLFRWQGALFFIDALNIVGSYTIFV 455 Query: 423 IFVTGYYGLTQGLLS-----VLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + + + L+W L+ +A WRA+ ++ R W+KT H Sbjct: 456 LMGRSRMIAYERRQVGRRWLAMPLYW--LMLSVAAWRAVVEL----KTRPFVWNKTPH 507 >UniRef50_A8IJY6 Putative glycosyltransferase n=1 Tax=Azorhizobium caulinodans ORS 571 RepID=A8IJY6_AZOC5 Length = 645 Score = 217 bits (553), Expect = 1e-54, Method: Composition-based stats. Identities = 86/467 (18%), Positives = 144/467 (30%), Gaps = 65/467 (13%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 L ++ + L + ++ R + R L +I+VP + E Sbjct: 208 PLETLLIVQSL----LSSIFLASATLRIATCLARPEEPPPLGLPDAALPLYSIIVPLYRE 263 Query: 81 TGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 V+ + E I + P+D A P +V GP + Sbjct: 264 ERVLPRLVRALQAIDYPPEKLDIKIVVEPDDAP-VHAALARMALPPWFEIIVAPDVGPRT 322 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLIQIP 195 K LN L + ++ DAEDV P +L+ F +Q Sbjct: 323 KPKALNCALPF---------TRGSFVVVFDAEDVPDPDQLKRALAAFRQGGRNLACVQAR 373 Query: 196 VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGD 255 + + T + S + E++ +P L + G F R + + Sbjct: 374 LSVENADET-WISRLFAAEYAGQFDVLLPGLAQLRMPILLGGTSNHFRRSMLELI----- 427 Query: 256 GIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREY 315 A+D ++TED D+G RL G T + +EA Sbjct: 428 -GAWDPYNVTEDADLGVRLARAGWTTAVIGSSTAEEA----------------------- 463 Query: 316 FPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRK-GAISNFVSFLAMLVM 374 P T + +RQ++RW+ G L R+ G + + A Sbjct: 464 -PITRAAWMRQRTRWLKGWA------QTLLVHGRQPLRLVRELGWGNLVPLLLLTAGPFA 516 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWL----MTLLWLNFGLMVNRIVQRVIFVTGYYG 430 LL L L D + + + L L N + G Sbjct: 517 SALLHPLCVAWLIADVVRGVFLTTPGTTLGVVATALSLTNLAIGYGAAAWSCGLGLKRRG 576 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDF 477 ++ L + L+ +A WRA+ Q++ R WDKT H Sbjct: 577 QFALAPILILLPFYWLLLSVAAWRAVVQLI----VRPYWWDKTEHGL 619 >UniRef50_Q2JNJ5 Glycosyl transferase, group 2 family protein n=6 Tax=Bacteria RepID=Q2JNJ5_SYNJB Length = 764 Score = 216 bits (550), Expect = 2e-54, Method: Composition-based stats. Identities = 86/465 (18%), Positives = 151/465 (32%), Gaps = 76/465 (16%) Query: 34 FFIDVVYWVRRIKRKL----SVYRRYPRMSYRELYKPDEKPL---AIMVPAWNETGVIGN 86 I++ Y + + L R+ +++ E+ D++ L I+VP + E V+ Sbjct: 341 LLINLFYVASILFKLLLSLVGSADRFHQITDEEVAALDDRDLPIYTILVPVYKEPEVMPI 400 Query: 87 MAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN-VHKVVCARPGPTSKADCL 144 + + L+ +E + + ND DT A+ P V ++ P +K Sbjct: 401 LIKSLSKLDYPHERLDVLILLEENDRDTIEAAR--AAKPPRYVRLLLVPDSKPKTKPKAC 458 Query: 145 NNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLIQIPVYPFE 200 N L ++DAED+ P +L+ F +Q + F Sbjct: 459 NYGLAFA---------RGEYLTIYDAEDIPDPDQLKKAVIAFRKGDPSLVCVQAALNYFN 509 Query: 201 REWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFD 260 R T M + E+S +P E L +P G F + L +D Sbjct: 510 RSENFLTRM-FTLEYSYWFDYLLPGLETLRMPIPLGGTSNHFRTDRLREL------QGWD 562 Query: 261 VQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTF 320 ++TED D+G R + G T + +EA Sbjct: 563 PFNVTEDADLGIRASQHGYTVGVINSTTYEEANCA------------------------V 598 Query: 321 STAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISN-----FVSFLAMLVMI 375 +RQ+SRWI G + W R K + N F + + Sbjct: 599 KNWIRQRSRWIKGYM------QTWLVHNRNPLRSLR--KLGLKNWLSYQFFIGGSFFTFL 650 Query: 376 QLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGL--MVNRIVQRVIFVTGYYGLTQ 433 ++ W +WL+ L N + + + V Y Sbjct: 651 TSPIMWLLFIYWLLTRAHWLQNLFPSWLVYLGLFNLLVGNAIGIYLNLVAVFRRGYYDLA 710 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 + ++W ++ MA + AL Q+ + W+KT H Sbjct: 711 FYALLNPIYWQ--LHSMAAYMALWQLFT----KPFYWEKTIHGLS 749 Score = 69.8 bits (169), Expect = 3e-10, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 59/189 (31%), Gaps = 9/189 (4%) Query: 455 ALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRV 513 A+ Q ++H P + + + ++L +T EQ AL R Sbjct: 111 AIWQSVKHLCPEAEKLWEIPATEKQILTVILERKLAAELLGSR--LTLEQWQQALEIRRR 168 Query: 514 EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES----IDAWQIPSSLIAEMPASVALHY 569 G LG + +S + LA G S SL + V + + Sbjct: 169 TGSSLGQVLTRLSYLSTPEYLSILAHLLGYPAVSELMGTGLLHRDESLSRQFDPEVMMRH 228 Query: 570 AVLPLR-LENDELIVGSEDGIDPVSLAAL-TRKVGRKVRYVIVLRGQIVTGLRHWYARRR 627 PL + L V D +D V L + + G ++ V+ I L R Sbjct: 229 LFYPLSWTDEHTLTVMVNDPLDWVVDELLYSWRPGLRIEKVLGTEQDITQLLSQDQGSRF 288 Query: 628 GHDPRAMLY 636 + L Sbjct: 289 SQEAVYKLM 297 >UniRef50_Q2NDA0 Probable inner membrane transmembrane protein n=1 Tax=Erythrobacter litoralis HTCC2594 RepID=Q2NDA0_ERYLH Length = 318 Score = 216 bits (549), Expect = 4e-54, Method: Composition-based stats. Identities = 67/199 (33%), Positives = 99/199 (49%), Gaps = 6/199 (3%) Query: 32 DDFFIDVVY-WVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAEL 90 D+ +D V+ W+R R ++ R RE P E A+ +PAW E VIG E Sbjct: 33 DELIVDAVWLWLRLTGRGETIEVRR-----RERSLPLEGKSAVFIPAWQEANVIGTTVEH 87 Query: 91 AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDA 150 + ++VG Y NDP T + E P + V+ GPT+KADCLN + A Sbjct: 88 MLSAWPQRALRLYVGCYRNDPATLAAIVEAAPGDPRLRVVIHDCDGPTTKADCLNRLYRA 147 Query: 151 ITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREWTHFTSMT 210 + + E + +LHDAED++ P L L + D IQ+PV P ++ + + Sbjct: 148 VEEDEARSGERCRMVLLHDAEDMVDPAALELCGRAIASADFIQLPVLPEPQKRSRWIGSH 207 Query: 211 YIDEFSELHGKDVPVREAL 229 Y +EF+E HGK + VR+AL Sbjct: 208 YCEEFAEAHGKAMVVRDAL 226 >UniRef50_A1TEN6 Glycosyl transferase, family 2 n=7 Tax=Actinomycetales RepID=A1TEN6_MYCVP Length = 461 Score = 215 bits (548), Expect = 4e-54, Method: Composition-based stats. Identities = 92/472 (19%), Positives = 165/472 (34%), Gaps = 71/472 (15%) Query: 10 TWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEK 69 WL L ++ +++++FI ++W+ R +R K Sbjct: 40 QWL--LYIVMTVISLLLFIVAA-----TTLWWMLHAWRSPESLHST---GFRRRSAGRPK 89 Query: 70 PLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK 129 ++++PA +E V+G+ + A LD+ Y + V +DP+T+ AR P + + Sbjct: 90 GFSLLLPARHEQDVLGDTID-ALARLDHPLYEVIVIIGHDDPETEHVARAAAARHPRIVR 148 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE-- 187 VV P +K LN L + DAED + P LRL E Sbjct: 149 VVIDTNIPKNKPKALNTALPTC---------RGEIVGVFDAEDEVHPRLLRLVEARFEEA 199 Query: 188 RKDLIQIPVYPFEREWTHFT---SMTYIDEF-SELHGKDVPVREALAGQVPSAGVGTCFS 243 R D++Q V + + ++ + Y F S LH A +P G T F+ Sbjct: 200 RADVVQSGVQLMNIQTSWWSLRNCLEYYFWFRSRLH------FHADQRFIPLGG-NTVFA 252 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQH 303 R A+ + +D L ED +IG RL +G P Sbjct: 253 RTALLRSV-----GGWDRDCLAEDCEIGVRLSTRGARVAVAYDP---------------- 291 Query: 304 ARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAIS 363 + RE P + V+Q++RW G Q ++ +W + + + ++ Sbjct: 292 -----KVVTREETPGSLRALVKQRTRWDQGF-MQVYRKGEWRKLPSRRQRMLA--RYTLA 343 Query: 364 NFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 A ++ + + + P LS + L+T+ + + Sbjct: 344 MPFLQAATGALVPIAIACMFVLKVPVPLTLLSFLPLAPTLVTVAV--EAAALGEFGKEFG 401 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 Q L + + L+ A +++ G W+KT H Sbjct: 402 IRIRL--WDQVRLVLGAFPYQLLLAAAAVRSVWRELRGQG-----GWEKTEH 446 >UniRef50_B6JD78 Glycosyl transferase, family 2 n=1 Tax=Oligotropha carboxidovorans OM5 RepID=B6JD78_OLICO Length = 669 Score = 215 bits (546), Expect = 6e-54, Method: Composition-based stats. Identities = 90/498 (18%), Positives = 164/498 (32%), Gaps = 73/498 (14%) Query: 10 TWLYGLKVIAITLAVIMF---ISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 W YGLK++A T A ++ ++ L + + + + R + R Sbjct: 217 AWRYGLKLLAFTSAFMLAPTLLTQLSGAVLAIWFLLFNSLRLAGAFAGGERTPRSPRIPD 276 Query: 67 DEKPL-AIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARF 124 + PL +MV ++E + ++ + LAA E I + +D +TQ + + Sbjct: 277 AQLPLYTVMVALYHEGPSVAHLVQSLAALDYPREKLDILLLLEADDIETQAALSRL-HLP 335 Query: 125 PNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR---- 180 NV ++ GP +K LN L + + DAED P +LR Sbjct: 336 GNVQTLIVPPFGPRTKPKALNAGLMSA---------RGEFTAVFDAEDRPDPSQLRDAID 386 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 F + +Q + + M + E++ + +P G Sbjct: 387 AFRHHHTDVACVQASLCIDNSADSWLACM-FTAEYAGQFDVFLRGFSQFGLPLPLGGSSN 445 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F + + +D ++TED D+GFRL +G + Sbjct: 446 HFRTDVLREV------GGWDAYNVTEDADLGFRLARRGYRAVMFDST------------- 486 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 E P +RQ+SRW+ G + W + L + + Sbjct: 487 -----------TYEEAPAHTGAWLRQRSRWMKGWM------QTWIVHMRSPRRLIK--QS 527 Query: 361 AISNFVSF--------LAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFG 412 ++ F + L L L+ E+ ++ Sbjct: 528 GLAGFFTLNLLVGGNVLTALAYPILIAACLLEAGLAATGSTAVAMFSGPFIELHFTTIAA 587 Query: 413 LMVNRIVQRVIFVTGYYGLTQGL-LSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWD 471 ++ +V ++ + L L + L+W L +A WRAL+Q+L W+ Sbjct: 588 GYLSTVVVSLMGLARRGLLRHAWVLVLTPLYWAWL--SIAAWRALRQLLHD----PYHWE 641 Query: 472 KTTHDFPSVTGDTRSLRP 489 KT H + R R Sbjct: 642 KTEHGLARHSRLARHQRK 659 >UniRef50_Q11MF6 Glycosyl transferase, group 2 family protein n=1 Tax=Chelativorans sp. BNC1 RepID=Q11MF6_MESSB Length = 656 Score = 215 bits (546), Expect = 7e-54, Method: Composition-based stats. Identities = 77/443 (17%), Positives = 143/443 (32%), Gaps = 76/443 (17%) Query: 52 YRRYPRMSYRELYKPDEKP-LAIMVPAWNETGVIGNM-AELAATTLDYENYHIFVGTYPN 109 R + +P E P ++V + E V+ + L I + + Sbjct: 238 AREPSPPAVLTTMRPAEMPVYTVLVALYREADVVPELLVSLGRIVWPRSKLEIKLVCESD 297 Query: 110 DPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHD 169 D +T + + + GP +K L+ L +T +L D Sbjct: 298 DTETLAAIRAQE-LHSYIEVIEVPPHGPRTKPKALSYALPLVTG---------EFVVLFD 347 Query: 170 AEDVISPMEL-----RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVP 224 AED PM+L R + +Q P+ R + + + + E++ L +P Sbjct: 348 AEDRPHPMQLVEAWERFRSNESGDLACLQAPLMITNRGES-WIASMFAFEYAALFRGILP 406 Query: 225 VREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFV 284 +P G F R + + +D ++TED D+G RL G + Sbjct: 407 WLAKRDLVLPLGGTSNHFRRALLERV------GGWDPCNVTEDADLGLRLARMGYKTGTI 460 Query: 285 RFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKW 344 P ++A P T + Q++RW G + W Sbjct: 461 SSPTYEDA------------------------PTTACIWLPQRTRWFKGWM------QTW 490 Query: 345 TSSLTLNYFLWRD--RKGAISNFVSFLAMLV--MIQLLLLLAYESLWPDAWHFLSIFSGS 400 + L+R+ R + + + M V + ++ L + + Sbjct: 491 LVHMRDVPRLYRELGRTSFLVTQILTMGMWVSALAYAAFPISAAVLLVIMLAQDNPVNHY 550 Query: 401 AWLMTLLWLN-------FGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANW 453 L+ L +N F + R + + + + + ++W L A W Sbjct: 551 TALLALDAVNVIFGHGAFLALGWRTLPK---TEQHGLWRHMM--WIPVYWALL--SAAAW 603 Query: 454 RALKQVLQHGDPRRVAWDKTTHD 476 RAL Q+ + W+KT H Sbjct: 604 RALWQLYR----CPHVWEKTPHR 622 >UniRef50_B0RVF2 Glycosyltransferase n=8 Tax=Proteobacteria RepID=B0RVF2_XANCB Length = 635 Score = 213 bits (543), Expect = 2e-53, Method: Composition-based stats. Identities = 86/482 (17%), Positives = 161/482 (33%), Gaps = 69/482 (14%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVP 76 + IT+ V++ + L F + ++ + ++ + ++ L D ++VP Sbjct: 206 ITLITINVLVALGFLATFGLKLLLVWFGSRHRIDIKVTEDEVAA--LRDDDLPVYTVLVP 263 Query: 77 AWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 + E V+ + A LDY + + +D +T ++ + Sbjct: 264 MYKEPEVLP-ILANALRKLDYPISKLDVKLVLEADDFETIEAAKKLG-LEAFFEIIRVPP 321 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKD 190 P +K N L ++DAED P +L+ F + Sbjct: 322 SQPKTKPKACNYAL---------HFARGELLTIYDAEDKPEPDQLKRVVAAFRKAEKDVV 372 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 IQ + + + T M + E++ +P E L +P G F + + Sbjct: 373 CIQARLNYYNADENWLTRM-FTLEYTLWFDFYLPALEYLRIPIPLGGTSNHFRLDVLRQV 431 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 A +D ++TED D+G RL + G V +EA Sbjct: 432 RA------WDPYNVTEDADLGVRLIQNGYRVNVVNSTTFEEANV---------------- 469 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-RKGAISNFVSFL 369 + +RQ+SRW+ G + W + L+R F F+ Sbjct: 470 --------SIPNWIRQRSRWLKGYM------QTWLVHMRDPVHLYRSTGFKGFWGFQFFI 515 Query: 370 AMLVMIQLLLLLAYESLWPDAWHFLSIFSG--SAWLMTLLWLNFG---LMVNRIVQRVIF 424 I L + + + IF WL T+ +N + F Sbjct: 516 GGNFFIALGVPVMWTLCLISMLSGARIFDATFPPWLATISLVNLLLANAFFVYVTLVAAF 575 Query: 425 VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDT 484 Y+ L L + +W L+ +A ++ L Q++++ W+KTTH + Sbjct: 576 KRDYFKLAPYAL-TVPFYW--LLQSIAAYKGLWQLIRN----PFYWEKTTHGISKHSEQE 628 Query: 485 RS 486 R Sbjct: 629 RR 630 Score = 76.4 bits (186), Expect = 4e-12, Method: Composition-based stats. Identities = 37/144 (25%), Positives = 60/144 (41%), Gaps = 2/144 (1%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALA 538 R LG+ L+ VIT+EQL AL + RLG +L Q + A++ +A Sbjct: 13 APDIGRERGLLGRSLVSAGVITDEQLRAALALQQRWNSRLGDVILAQRGVPAQRFYAIVA 72 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 G+ + + L+ V +LP R E+ L++ D DP A Sbjct: 73 AHFGLQFVDLVQQPPDPELLTATDLDVYAQRLILPWRREDGVLVLAVADP-DPALFAWAR 131 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHW 622 G +VR+V + I+ L+ + Sbjct: 132 EHYGAQVRFVGTAKFDIIWSLQRY 155 >UniRef50_D2LBM7 Glycosyl transferase family 2 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LBM7_RHOVA Length = 643 Score = 213 bits (541), Expect = 2e-53, Method: Composition-based stats. Identities = 93/489 (19%), Positives = 166/489 (33%), Gaps = 75/489 (15%) Query: 8 FATWLYGLKVIAITLA---VIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELY 64 FAT GL + A+ A + S + F + +R +++ P+ L Sbjct: 204 FATVAVGLVLGAVAFAPAETLTLASAMLSIFFLLTIALRAAA-AVNIALPRPKAKEARLL 262 Query: 65 KPDEKP-LAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVC 121 E P ++VP + ET ++ ++A A ++DY I + +D DT ++ Sbjct: 263 GDAELPRYTVLVPLYRETAILPHLA-HALASIDYPAAKLDIKIVLEASDRDTIEAAQKL- 320 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR- 180 A NV VV P +K LN L + +++DAED P +LR Sbjct: 321 AFPGNVDLVVVPDREPRTKPKALNYALHFASG---------EFVVIYDAEDRPEPDQLRK 371 Query: 181 ---LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 +F +Q + + + + E++ L +P+ +P G Sbjct: 372 AATVFAQAPADLVCLQARLDYYNARENWLSRQ-FTIEYATLFRGLLPLLARFRLPLPLGG 430 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 F A+ + A+D ++TED D+G RL G + +EA R Sbjct: 431 TSNHFRAAALREI------GAWDPYNVTEDADLGMRLARAGYRTGTLESTTWEEACCRPM 484 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD 357 ++Q++RW+ G + + + + Sbjct: 485 P------------------------WLKQRTRWLKGWM------QTFGVHMRRPREAMSE 514 Query: 358 RKGAISNFVSFLAML---VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLM 414 ++ F+SF A ++ L + Y + + A LLW+ Sbjct: 515 --LGVAGFLSFHAYFAGIIVSALAHPVFYILMLYEVLQGRLFAGEGAMENLLLWIAAVNF 572 Query: 415 VNRIVQRVIFVT-GYYGLTQGLL----SVLRLFWGNLINFMANWRALKQVLQHGDPRRVA 469 V + G L + ++W L A +RAL Q+ Sbjct: 573 VGGYAANIALSAFAVAGTRHRHLMLHVMFIPVYW--LFVSAAAYRALWQLFH----APFH 626 Query: 470 WDKTTHDFP 478 W+KT H Sbjct: 627 WEKTEHGVS 635 >UniRef50_C6PR94 Glycosyl transferase family 2 n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PR94_9CLOT Length = 743 Score = 211 bits (538), Expect = 6e-53, Method: Composition-based stats. Identities = 89/480 (18%), Positives = 171/480 (35%), Gaps = 89/480 (18%) Query: 30 GLDDFFIDVVYWVRRIKRKLSVYRRYP----RMSYRELYKPDEKPL---AIMVPAWNETG 82 ++ FF + ++ +K + + Y + E+ DEK L I++P + E Sbjct: 308 SINLFFQSIYAFMTTLKLYIVIKGSYKDKQLHFTTEEIEAIDEKELPTYTILIPVYKEKE 367 Query: 83 VIGNMAELAATTLDYENYHIFV--GTYPNDPDTQRDVDEVCARFPNVH-KVVCARPGPTS 139 VI + + +DY Y + V +D +T V + + P + ++ + P + Sbjct: 368 VIKTLIK-NIENIDYPKYKLDVCILLEEDDDETISTVKAM--KLPEYYSMIIVPKNTPKT 424 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLIQIP 195 K N L +++DAED +L+ F L + IQ Sbjct: 425 KPKACNYGLI---------RARGKYVVIYDAEDRPESDQLKKVYLSFKKLPKNYVCIQSK 475 Query: 196 VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGD 255 + F + T + + E+S + + +P G F + + Sbjct: 476 LNYFNSDQNFLTRL-FTQEYSMWFELLLVGIMQIKTPIPLGGTSNHFKIEFLKEV----- 529 Query: 256 GIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREY 315 A+D ++TED D+G RL +KG V +EA Sbjct: 530 -GAWDPFNVTEDADLGVRLFKKGYNTAVVDSRTWEEANSD-------------------- 568 Query: 316 FPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMI 375 S +RQ+SRWI G + W + L++ + FV + AM++ Sbjct: 569 ----LSNWIRQRSRWIKGYM------QTWFVHMRHPVQLYKS--LGLKGFVGYQAMILGT 616 Query: 376 QLLLLL-----------------AYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI 418 LL L+ S++P +++++ F + N M I Sbjct: 617 PLLPLINPIFWLMLILWYTTKASWIRSMFPGVFYYIAAFQLFFGNFMFTYTNAVGMYWVI 676 Query: 419 VQRVIFVTGYYGLTQGLLSVL-RLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDF 477 + + ++L ++W ++ +A ++AL Q++ + W+KT H Sbjct: 677 RDCSLKKEQPFSYRLVKYALLSPIYW--ILMSVAAYKALIQLI----IKPFYWEKTNHGL 730 Score = 136 bits (341), Expect = 5e-30, Method: Composition-based stats. Identities = 41/159 (25%), Positives = 74/159 (46%), Gaps = 4/159 (2%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALA 538 + + +S P+G++L+EN IT+EQL AL R G RLG +L G I E+L + LA Sbjct: 113 LNSNYKSKLPIGKMLVENNEITKEQLIKALDLQRKSGGRLGDILLFLGFIKPERLCRYLA 172 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 QN V ++ ++P +AL Y + L N+ ++ ++ + L + Sbjct: 173 TQNNVGRI---GKNFDINVSKKLPYKLALKYNAIILNSRNNCYVIAVKELLSWKQLKEIE 229 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYN 637 + + V V+ +I Y +++ + LY+ Sbjct: 230 GYLHKPVEQVLATMLEIDNFWNIVYRKKQSEESVFKLYD 268 Score = 71.7 bits (174), Expect = 1e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 65/203 (32%), Gaps = 32/203 (15%) Query: 489 PLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 +G+ LL+ I+EEQL+ AL+ +G ++ G IS +QLA+ + + Sbjct: 8 RIGESLLKQGYISEEQLEIALKIQEKTNKLIGNILVESGFISQQQLAEYI---TNRQFSK 64 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 I + I I A+ Y + L ++ +G Sbjct: 65 IGEYLIYVKAITPDQLKQAIKYQ----EVNGGRL-------------GSILVSLGF---- 103 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEIL 667 I G+ Y + V++ +T++Q + +IL Sbjct: 104 -------INQGVLDNYLNSNYKSKLPIGKMLVENNEITKEQLIKALDLQRKSGGRLGDIL 156 Query: 668 TTLGHINRSAINVLLLRHERSSL 690 LG I + L Sbjct: 157 LFLGFIKPERLCRYLATQNNVGR 179 >UniRef50_C6XK76 Glycosyl transferase family 2 n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XK76_HIRBI Length = 488 Score = 211 bits (536), Expect = 9e-53, Method: Composition-based stats. Identities = 76/426 (17%), Positives = 141/426 (33%), Gaps = 66/426 (15%) Query: 70 PLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 +++P ++E V + A + +F T +D T+ + + + N Sbjct: 121 KFTLLIPLYHEQAVASRSVSAMEALNYPADKLEVFYLTEEDDKATESALKKAI-KHQNFK 179 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER 188 + + P +K LN L T ++DAED+ P +L + Sbjct: 180 IISVPKHAPRTKPKALNYGLQFSTG---------DIVTVYDAEDIPHPQQLLAAAQAFQN 230 Query: 189 ----KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSR 244 +IQ P++ + E + + + + E++ +P + +P G F + Sbjct: 231 GGTNLAVIQAPLHAYNGEES-WIASQFDLEYAIHFDVWLPAMTKMGWPIPLGGTSNHFKK 289 Query: 245 RAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHA 304 + + A+D ++TED D+G+RL G + + P Sbjct: 290 NVLEKV------GAWDPFNVTEDADLGYRLALNGYSAGMIELP----------------- 326 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLN-YFLWRDRKGAIS 363 RE P + + Q++RWI G Q T+ LWR I+ Sbjct: 327 -------TREEAPINLAQWLPQRTRWIKG-HIQSLAVLSRKPFETIKSLGLWRSLGCLIT 378 Query: 364 NFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 + L + LLL LAY + A + L+ ++ + + + V Sbjct: 379 FVSAILTAGLHGPLLLYLAYSII--TAPNTLNPLHLIPIILAFSSVILAALASSAVTHCF 436 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 L +W + A RA+ ++ R W KT H Sbjct: 437 ----------KPLLTAPFYWP--LMSFAFIRAIWEL----HTRPYIWSKTQHGISKSKIP 480 Query: 484 TRSLRP 489 P Sbjct: 481 LLHKEP 486 >UniRef50_C6J4S3 Glycosyl transferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J4S3_9BACL Length = 413 Score = 210 bits (534), Expect = 2e-52, Method: Composition-based stats. Identities = 86/485 (17%), Positives = 160/485 (32%), Gaps = 86/485 (17%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 L VI I+L VI+ + G+ F + + +YR+ ++ Y K A++ Sbjct: 2 LDVILISLQVILAVVGVYQFGLAL----------FGMYRKKNKVQYEP-----SKSFAVL 46 Query: 75 VPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 V A NE V+G + E E Y +FV D D R ++ V Sbjct: 47 VAAHNEEKVVGALMENLKQMNYPKELYDVFVIC-----DNCSDNTANIVRSHGMNACVRT 101 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLI 192 P K + +L + + R + ++ DA++++ P LR + N L +I Sbjct: 102 NPNLRGKGYAIEWMLKQLWKMPRQ----YDAVVMFDADNLVHPDFLREMNNDLCAGARVI 157 Query: 193 QIPV--YPFEREW---THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 Q + E W ++ S Y + +L ++ + L G G CF + Sbjct: 158 QGYIDTKNPEDSWITASYGISYWYCNRLWQLSRTNLKMANFL------GGTGMCFETELL 211 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 + + SL ED + R E+G+ +F +AK +++ Sbjct: 212 KEI-------GWGATSLVEDLEFTMRCVERGVYPVF-----NYDAKLFDEK--------- 250 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIG---IVFQGFKTHKWTSSLTLNYFLWRDRKGAISN 364 P TF + RQ+ RW+ G + + F W S + A+ Sbjct: 251 ---------PLTFKASARQRLRWMQGHFTVARRYFFPLLWKS---IKERNMVKLDMALYG 298 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWL-NFGLMVNRIVQRVI 423 ++ +L + + +SL + W+ + + N + + + + Sbjct: 299 ANVYIVLLTFLLTAFIWVDQSLMQEPHVKTLYGYLPMWVSYVAIVANVFIFLMAMFLEKV 358 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 Y L +W W T H + Sbjct: 359 KSKKVYAYLVLFPIYLISWWPI------------TFYAFFTQNNKQWSHTQHTRVVRLEE 406 Query: 484 TRSLR 488 +S + Sbjct: 407 VQSNK 411 >UniRef50_B9JQZ7 Glycosyltransferase n=1 Tax=Agrobacterium vitis S4 RepID=B9JQZ7_AGRVS Length = 664 Score = 210 bits (534), Expect = 2e-52, Method: Composition-based stats. Identities = 72/432 (16%), Positives = 144/432 (33%), Gaps = 72/432 (16%) Query: 71 LAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK 129 ++V + E+ +I + + I + +D DT + E ++ Sbjct: 282 YTVLVALYRESSMIPQLIDGLRRLDWPVSRLDIKLVCEADDLDTLGALAE-ADIPAHIEI 340 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYL 185 V GP +K L+ L + +L+DAED P +L+ F Sbjct: 341 VPTPPIGPRTKPKALSYAL---------SGARGDFLVLYDAEDRPHPAQLKEAYAHFLSR 391 Query: 186 VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 +Q P+ + + + S + E++ L +P+ +P G F Sbjct: 392 PPEVACLQAPLIIANGDES-WISALFALEYAALFRGTLPMLAYHGMPLPLGGTSNHFRIE 450 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 A+ D A+D ++TED D+G RL G + +++A Sbjct: 451 ALK------DVGAWDPYNVTEDADLGLRLFRAGYRCETITRQTLEDA------------- 491 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG-AISN 364 P + + Q+SRW G + W + ++ G A + Sbjct: 492 -----------PVSSRIWMGQRSRWFKGWL------QTWLIVMREPRVACKEMGGSAFAV 534 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTL--LWLNFGLMVNRIVQRV 422 F + +++ L L + + + L L W++ ++ + + Sbjct: 535 FHLMIGGMLLSSLSHPALLLFLTMTVYSMANPPADGIPLRDLTVFWIDLVNILGSYLIFL 594 Query: 423 I--------FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 F G + L+W L+ +A WRA+ ++ + W+KT Sbjct: 595 ALGRAAMTEFERRRIGRRYL---FIPLYW--LMTSIAAWRAMIEL----KTKPFFWNKTP 645 Query: 475 HDFPSVTGDTRS 486 H + ++ Sbjct: 646 HAPRGNERNRKA 657 >UniRef50_A7HQ64 Glycosyl transferase family 2 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQ64_PARL1 Length = 652 Score = 210 bits (533), Expect = 2e-52, Method: Composition-based stats. Identities = 81/417 (19%), Positives = 133/417 (31%), Gaps = 64/417 (15%) Query: 71 LAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCARFP-NV 127 +MVP + E V+ + A LDY I +D +T + R P + Sbjct: 272 YTVMVPLFREASVLP-ILATALRELDYPASKLDIKFIFEESDVETYEAAKAL--RLPDHF 328 Query: 128 HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFN 183 +V P +K N L +++DAED P +L+ F Sbjct: 329 EFIVVPTSFPQTKPKACNFALPFA---------RGEFLVIYDAEDAPEPQQLKKAVSAFR 379 Query: 184 YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFS 243 E+ +Q + + T + E++ +P L +P G T F Sbjct: 380 LGDEKLACVQAQLNYYNWRENWLTRQ-FALEYAAFFDLMLPTMARLRLPIPLGGTSTHFR 438 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQH 303 + + A+D ++TED D+G R G +R +EA + Sbjct: 439 TELLR------NAGAWDPNNVTEDADLGLRFALHGYRCSIIRSTTEEEANCK-------- 484 Query: 304 ARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR--DRKGA 361 VRQ+SRWI G + + + L+R +G Sbjct: 485 ----------------LPNWVRQRSRWIKGWM------QTYLVRMRHPVRLYRALGLRGF 522 Query: 362 ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 I F + + LL Y L + L L V Sbjct: 523 I-GFQVLIGGSTLSSLLHPFLYLGLIVPLIESGLAGDLTG-LTVFHLLVLVSGYALAVSA 580 Query: 422 VIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 + GL Q + L + L+ A ++AL Q++ + W+KT H Sbjct: 581 GLAAASARGLPQLFIHTLTMPAYWLLLSFAAYKALWQLV----VKPFHWEKTDHGIS 633 Score = 72.1 bits (175), Expect = 7e-11, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 60/163 (36%), Gaps = 3/163 (1%) Query: 464 DPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSM 522 + R++ K P S LG++ +E ++ E L AL + G LG + Sbjct: 11 EQARLSERKPGRRAPRGRPQPGSRPLLGEMAVEAGLVQPEALAPALEKQAHWGGPLGRIL 70 Query: 523 LMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELI 582 + G + +A+ Q G+ + + +SL + L LP R E I Sbjct: 71 VSIGAMRVADVARLYGRQRGLPFVDLQEEPHETSLASSERLDFYLREMCLPWRRRAGETI 130 Query: 583 VGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYAR 625 + D + A+T + GR + + I + + + Sbjct: 131 YVAADPDRSRA--AITGQEGRPLPVFVTSPRDISRTVTRAFGQ 171 >UniRef50_C8WIR5 Type II secretion system protein E n=2 Tax=Bacteria RepID=C8WIR5_EGGLE Length = 568 Score = 208 bits (530), Expect = 5e-52, Method: Composition-based stats. Identities = 60/258 (23%), Positives = 113/258 (43%), Gaps = 21/258 (8%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + LG +L++ +ITE+QL AL+ + E RLG ++ +G+I+ L +AL Q GV + Sbjct: 4 KRLGDVLIDAGLITEDQLGHALKQQKETKRRLGDELIAEGVITEAGLIEALQMQLGVEFV 63 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + A + L + +VA Y V+P+R DE+ + D ++ +++ A+ ++V Sbjct: 64 DLSAIDLDPELSRVISKNVARQYNVVPVRTSPDEVCLAMSDPLNFMAIEAVKNATRKRVI 123 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLY---------NAVQHQWLTEQQAGEIWRQYV 657 ++ ++ + Y + +A + T + Q Sbjct: 124 PMVTTHDSLMRAIMTLYGNEGAARAIEEMKRDARTTGADDASTGSFQTSTLGDDADAQSA 183 Query: 658 PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 P L I+ S I++ E + L + +GV L +LT+ +EL Sbjct: 184 PTVRLVNSIIERAATERASDIHL-----EPREIDLHVRMRIDGV-----LRTILTVPKEL 233 Query: 718 QVSMQS-LLLKAGLNTEQ 734 Q S+ S L + G+NT + Sbjct: 234 QASVISRLKIMGGMNTSE 251 >UniRef50_D2M877 Glycosyl transferase family 2 n=1 Tax=Rhodopseudomonas palustris DX-1 RepID=D2M877_RHOPA Length = 706 Score = 208 bits (528), Expect = 8e-52, Method: Composition-based stats. Identities = 93/460 (20%), Positives = 162/460 (35%), Gaps = 76/460 (16%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDE-KPLAIMVPA-WNETGVIGNMAE-LAATT 94 ++++ +L R + +PD P+ +V A + E +G + E L A Sbjct: 268 AIWFIGFAGLRLLASLWPRRAPPISVRRPDADLPVYTVVAALYREADSVGPLVEALEALD 327 Query: 95 LDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQF 154 E + + P+D T+ + + R P++ ++ P +K LN L Sbjct: 328 YPPEKLDLILVIEPDDLFTRAALARLKPR-PHLRVLIAPAVAPKTKPKALNYALAFA--- 383 Query: 155 ERSANFAFAGFILHDAEDVISPMELRLFNYLVERK----DLIQ--IPVYPFEREWTHFTS 208 + + DAED P +LR + +Q + + W S Sbjct: 384 ------RGSFIAVFDAEDRPDPGQLRAALAAFDGAGRETACVQASLCIDNLTHSW---LS 434 Query: 209 MTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDY 268 T++ E++ +P AL +P G F + A+ A+D ++TED Sbjct: 435 RTFLAEYAGQFDLFLPGLAALGLPLPLGGSSNHFRTDVLRAI------GAWDPHNVTEDA 488 Query: 269 DIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKS 328 D+GFRL G E P TF +RQ+S Sbjct: 489 DLGFRLARLGYRCGTFAST------------------------TYEEAPLTFGNWLRQRS 524 Query: 329 RWIIGIVFQGFKTHKWTSSLTLNYFLWRD---------RKGAISNFVSFLAMLVMIQLLL 379 RW+ G + W + LWR+ N +S LA +++ + L Sbjct: 525 RWMKGWI------QTWEVHMRHPLRLWRETGIGGVLALNLLLGGNVLSALAYPLLLMIAL 578 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL 439 + A + + + + + L ++ G+ +V + V G+L + Sbjct: 579 MSAADWADSSPN---WLAADTPTALHWLAISSGVASTIVVGLLGLVRRRQWRHAGVLMLT 635 Query: 440 RLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS 479 L+W L +A WRAL + WDKT H S Sbjct: 636 PLYW--LCLSIAAWRALAHYVW----CPYRWDKTQHGVAS 669 >UniRef50_Q0EZ04 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EZ04_9PROT Length = 684 Score = 207 bits (526), Expect = 1e-51, Method: Composition-based stats. Identities = 87/505 (17%), Positives = 167/505 (33%), Gaps = 80/505 (15%) Query: 10 TWLYGLKVIAITLAVIMFISGLDDF------FIDVVYWVRRIKRKLSVYRRYPRMSYRE- 62 T+L+ ++I + I I L F+ + + +R + + + + + E Sbjct: 233 TFLFVWMTLSIIILAIWPIQSLVVLNLSISAFLMLNFGLRMLLGWVGGEKHFDQYVTDEE 292 Query: 63 ---LYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDV 117 L D I++P ++E + N A TLDY I + D +T Sbjct: 293 VLALDDRDLPVYTILLPMFHEAATLPN-IAQALRTLDYPLSKLDIKLILEQEDDETIDAA 351 Query: 118 DEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM 177 E+ + P +K N L ++D ED P Sbjct: 352 KELG-LEGIFEIIRVPESLPQTKPKACNYAL---------HFSRGEMATIYDGEDAPEPD 401 Query: 178 ELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 +L+ F E +IQ + F T M + E+S +P + L + Sbjct: 402 QLKKAVIAFRKSPENTAVIQGRLNYFNVAENWLTRM-FTMEYSLWFDFYLPALDYLRIPI 460 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 P G F + + +D ++TED D G RL + G + +EA Sbjct: 461 PLGGTSNHFKMSVLREM------GGWDPYNVTEDCDFGVRLTQAGYRVGVMNSTTFEEAN 514 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF 353 ++ +RQ+SRW+ G + + + + Sbjct: 515 ------------------------NSIPNWIRQRSRWLKGYM------QSYLVHMRSPFK 544 Query: 354 LWRD-RKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSG--SAWLMTLLWLN 410 L+ + F F+ ++ +L + + F ++++ + N Sbjct: 545 LYGELGHVGFWGFQFFIGGTIVSAMLTPVLFLMYIIWLLTSTFAFDPYFPSFVLYITLFN 604 Query: 411 FGL---MVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR 467 + M+ + F Y+GL L + +W L+ A ++ Q++ + Sbjct: 605 LLIANGMLIYLFMLSGFKRRYFGLIPWAL-TVPFYW--LLQSWAGYKGFWQLIHN----P 657 Query: 468 VAWDKTTHD---FPSVTGDTRSLRP 489 W+KT H F T+ +P Sbjct: 658 FYWEKTHHGLTSFEVTHSATQPEKP 682 Score = 80.6 bits (197), Expect = 2e-13, Method: Composition-based stats. Identities = 39/181 (21%), Positives = 71/181 (39%), Gaps = 14/181 (7%) Query: 477 FPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQ 535 F D +G +L+ V++ EQ++ A++ + E G RLG +L +G +S + Q Sbjct: 56 FYDALTDHFRRGRIGDLLVSKGVLSNEQMEEAVQIQSEWGTRLGDIILAKGWVSPYVMGQ 115 Query: 536 ALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLA 595 LAE + + SL+ SV +P + E+ L V D Sbjct: 116 VLAEHFDKPHVDLMDRRPDISLLDRSKLSVYSENLFMPWQREDGLLKVAVVD-----VTP 170 Query: 596 ALTRKVGRKVRY----VIVLRGQIVTGLRH----WYARRRGHDPRAMLYNAVQHQWLTEQ 647 L + V V ++ + I+ L+ +Y+ + H+ L + T + Sbjct: 171 ELLKIVNETVDEPFDFIVTSKFDIIWLLQEIGGTYYSGKAVHELANTLPQYSASEVFTVK 230 Query: 648 Q 648 Q Sbjct: 231 Q 231 >UniRef50_A1B414 General secretory system II, protein E domain protein n=7 Tax=Rhodobacteraceae RepID=A1B414_PARDP Length = 698 Score = 203 bits (515), Expect = 3e-50, Method: Composition-based stats. Identities = 96/495 (19%), Positives = 166/495 (33%), Gaps = 68/495 (13%) Query: 12 LYGLKVIAITL--AVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEK 69 L + VIA+ V+ I+ + +RR +R + R + Sbjct: 212 LAPIAVIALLTGWTVLTLIASAALKLLSFAAILRRHRRDRTKAEAMARDAIPPPEMTAPL 271 Query: 70 P-LAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNV 127 P +++MVP + E + + L+ E I + D T +++ AR P Sbjct: 272 PVISVMVPLFAEADIAEKLIGRLSRLDYPRELMDILIVVEETDSVTCAALED--ARLPRW 329 Query: 128 HKVVCARPGP-TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLF 182 +VV GP +K LN L+ + + DAED P +L R F Sbjct: 330 LRVVKVPDGPVRTKPRALNYALNFC---------RGSIIGVWDAEDRPEPGQLLKVARGF 380 Query: 183 NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 ++ +Q + + T++ + + E++ + AL VP G F Sbjct: 381 HFAPPEVVCLQGVLDYYN-PRTNWLARAFTIEYASWFRGTLAGAAALDLVVPLGGTTLFF 439 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 R A+ + A+D ++TED D+G RL +G + +EA R Sbjct: 440 RREALEEV------GAWDAWNVTEDADLGVRLTRRGYRTRMLDTVTHEEANCRLIP---- 489 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG-- 360 V+Q+SRW+ G W + LWRD Sbjct: 490 --------------------WVKQRSRWLKGFAM------TWGVHMRDPVALWRDLGARR 523 Query: 361 ----AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 + F S L+ L P + + +L+ F Sbjct: 524 FIGLQVQLFASVSQYLLAPVLWSFWLLSLGLPHPMRGMLSGMLGGNAIAILFTLFVASEL 583 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 + ++ G L V L + +A W+A+ +V+ + WDKT H Sbjct: 584 LNIAIGLWAVRGRGHRHLLPWVPTLHLYFPLGCLAAWKAIYEVVA----KPFYWDKTQHG 639 Query: 477 -FPSVTGDTRSLRPL 490 F + + P+ Sbjct: 640 IFEAGQEEAPEPAPI 654 Score = 80.2 bits (196), Expect = 2e-13, Method: Composition-based stats. Identities = 33/139 (23%), Positives = 54/139 (38%), Gaps = 1/139 (0%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 R LRPLGQIL+E+ + L AL + + RLG +L G + E L +AL+ Q Sbjct: 27 VSARDLRPLGQILIEDGAVDPRNLFKALVMRQRQSARLGEILLANGWVREEALIRALSRQ 86 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 + + A L+ M A + L +P R + + +L + Sbjct: 87 WRASVLDLKALPPDPRLVDAMGAQLCLAEGAVPWRRVGGVTFIATARPEGFQALQDRLPQ 146 Query: 601 VGRKVRYVIVLRGQIVTGL 619 VR ++ + Sbjct: 147 DFGAVRMLLCSENAAREAI 165 >UniRef50_D0L467 Glycosyl transferase family 2 n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0L467_GORB4 Length = 512 Score = 202 bits (513), Expect = 4e-50, Method: Composition-based stats. Identities = 80/434 (18%), Positives = 144/434 (33%), Gaps = 68/434 (15%) Query: 59 SYRELYKPDEKPLAIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDV 117 R + D P ++VPA+ E V+G++ + + + + +D T Sbjct: 117 QARSIPDDDLPPYTVLVPAYGEPEVVGDLIAAVESIEYPRDKLQVLLLLEEDDEPTIVAA 176 Query: 118 DEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM 177 V A V V+ P +K N L T + DAED P+ Sbjct: 177 RAVEA-SGIVTVVLTPPADPRTKPKACNYGLHFATG---------DIVTIFDAEDQPDPL 226 Query: 178 ELRLFNYLVERKD-----LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQ 232 +LR ++ D +Q + T D + G +P Sbjct: 227 QLRRAVHVFTHIDDDSVVCVQGKLSFHNSRDNILTEWFTAD-YGIWFGFLLPGMMVSRAP 285 Query: 233 VPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEA 292 +P G F R + + A+D ++TED D+G R+ + G + ++EA Sbjct: 286 IPLGGTSNHFRRDVLDRI------GAWDPFNVTEDADLGVRIADSGYRTAVLDSVTLEEA 339 Query: 293 KEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNY 352 +RQ+SRW G + W + Sbjct: 340 NVDAI------------------------NWIRQRSRWYKGYL------QTWLVHMRHPV 369 Query: 353 FLWRD-------RKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMT 405 LWR R + +A + M+ L+L+ + +F G + + Sbjct: 370 RLWRILGTVAWLRFTLLIAGTPLIACVNMLFWLILVL--WVAGQPPVVADLFPGPIYYLA 427 Query: 406 LLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDP 465 L+ L FG + + G + ++S L + L+ +A + + Q+L + Sbjct: 428 LISLIFGNGAAIYMN--LIAIRENGRSDLVVSALLVPAYWLLMSVAAIKGVWQILVN--- 482 Query: 466 RRVAWDKTTHDFPS 479 W+KT H + Sbjct: 483 -PSYWEKTFHGLST 495 >UniRef50_A4T169 Putative uncharacterized protein n=4 Tax=Mycobacterium RepID=A4T169_MYCGI Length = 475 Score = 202 bits (513), Expect = 4e-50, Method: Composition-based stats. Identities = 87/490 (17%), Positives = 159/490 (32%), Gaps = 73/490 (14%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAI 73 L + LA + S +D +W+ + R + + Sbjct: 46 VLPAVTTVLAALYLASTIDR------HWLLVQGLRSPSLLTISDEEARAVPDNQLPVYTV 99 Query: 74 MVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVC 132 ++P +NE ++ N+ + I + +D T+R + V ++ Sbjct: 100 LLPVYNEPSIVHNLIAGVGRLEYPKDKLEILLLVEEDDIATRRAMATTELEA--VRLILV 157 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVER 188 P +K N + + ++DAED+ P++LR F L + Sbjct: 158 PNSQPKTKPKACNYGM-------ATPGLKGEMVTIYDAEDIPDPLQLRKTVVAFQQLPDN 210 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 IQ + F E T + E+ + G +P EA VP G Sbjct: 211 VGCIQARLGYFNEEQNLLTR-WFSMEYDQWFGMTLPAVEAAGCVVPLGGTSNHMRTSVWR 269 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 A+ +D ++TED D+G RL G + ++EA Sbjct: 270 AI------GGWDEFNVTEDADLGVRLARAGYRTRILDSVTLEEANSDVL----------- 312 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR--DRKGAI---- 362 +RQ+SRW G + L LW KG + Sbjct: 313 -------------NWIRQRSRWYKGYL------QTMLVHLRHPAALWSQVGGKGILRLLN 353 Query: 363 -SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 + V +A++ ++ + A+ PD + +T+ + L V + Sbjct: 354 MTGAVPIVAVINLVFWATMAAWVLGRPDVVELAFPGATYYVYLTMYVVGAPLSVFMGLIV 413 Query: 422 VIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS-V 480 + Y L + L+W ++ +A +A+ Q++ R W+KT H V Sbjct: 414 TQRLGKPYMWWAAAL--VPLYW--MLQSIAALKAVFQLVT----RPQFWEKTVHGLSDTV 465 Query: 481 TGDTRSLRPL 490 + RP Sbjct: 466 DVPNSTGRPT 475 >UniRef50_B9L0Q0 Glycosyl transferase, group 2 family protein n=2 Tax=Bacteria RepID=B9L0Q0_THERP Length = 635 Score = 201 bits (511), Expect = 8e-50, Method: Composition-based stats. Identities = 75/431 (17%), Positives = 136/431 (31%), Gaps = 69/431 (16%) Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVD 118 + L + ++VP + E V+ ++ E LDY I + +D +T Sbjct: 247 QALRDDELPMYTVLVPVYREANVVPHLIE-NLRNLDYPASKLEILLLIEEDDEETLAAAK 305 Query: 119 EVCARFPN-VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM 177 AR P V +V P +K N L ++DAED P Sbjct: 306 --AARPPETVTFIVVPNGLPKTKPKACNVGLLFA---------RGEFLTIYDAEDRPEPD 354 Query: 178 ELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 +L+ F +Q + + T M + E+S + + L + Sbjct: 355 QLKKAILAFRKGSPDLVCVQAALNYYNATENLLTRM-FTLEYSYWFDYVLTGLDRLRLPI 413 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 P G F + L +D ++TED D+G R +G + +EA Sbjct: 414 PLGGTSNHFRVGRLREL------GGWDPFNVTEDADLGIRAAARGYRVGVINSTTWEEAN 467 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF 353 + +RQ+SRWI G + Sbjct: 468 ------------------------NHVGNWIRQRSRWIKGYL------QTVLVHTRHPLR 497 Query: 354 LWRD---RKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN 410 L R R + A + L+ L + W ++ + N Sbjct: 498 LVRTAGIRNTFGFVLLIGGAPFAFLSLIPLWSLTLTWIVTRTHAFDILFPPVVLYISLFN 557 Query: 411 FGLMVNRIV---QRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR 467 + ++ F Y L L + +W +++ +A+++A+ Q++ + Sbjct: 558 LLIGNGVMIYLGMLAGFKRRRYQLIPFAL-LNPFYW--ILHSIASYKAVWQLIT----KP 610 Query: 468 VAWDKTTHDFP 478 W+KT H Sbjct: 611 FYWEKTRHGLS 621 Score = 109 bits (273), Expect = 3e-22, Method: Composition-based stats. Identities = 32/147 (21%), Positives = 59/147 (40%), Gaps = 2/147 (1%) Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAE 539 TRS +G+ L+ ++ E L+ AL R G R+G ++ GL+ +QL Q LAE Sbjct: 11 PQVTRSRERIGEALVSRGLLRPEDLERALEYQRRTGDRIGRILIALGLVKRQQLYQVLAE 70 Query: 540 QNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTR 599 G + + + + L + P+R +E+ V + + + Sbjct: 71 LWGHPYVDLLREPLDARLARLFDPEALVQRRCFPVRRVGNEVFVATAEPPGAELEEYIRS 130 Query: 600 KVGR-KVRYVIVLRGQIVTGLRHWYAR 625 +G VR ++ I +R + Sbjct: 131 VLGSVTVRPLVTSEWDIDYAIRTIFRD 157 >UniRef50_A3UGE7 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UGE7_9RHOB Length = 523 Score = 200 bits (509), Expect = 2e-49, Method: Composition-based stats. Identities = 81/446 (18%), Positives = 144/446 (32%), Gaps = 60/446 (13%) Query: 48 KLSVYRRYPRMSYRELYKPDEKP-LAIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVG 105 +++ PR + R + P ++++V +E V+ + L+ I + Sbjct: 88 RIAAAIMPPRYADRTTVSDQDLPVISVIVALHDEARVLPGLIAALSRLNYPRSKLDIILA 147 Query: 106 TYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGF 165 +D T+ + R + VV GP +K LN L Sbjct: 148 LEAHDQPTRAAARALAGRK-ALRVVVLPPLGPMTKPRALNVALQTA---------RGELV 197 Query: 166 ILHDAEDVISPMELRLFN---YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKD 222 ++DAED P +LR +R +IQ P+ + R T+ + E++ Sbjct: 198 AVYDAEDAPHPDQLRQAAECFAADDRLGIIQAPLGWYNRTENWLTAQ-FALEYATQFNAL 256 Query: 223 VPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEI 282 +P+ L +P G F + A+ A +D ++TED D+GFR+ G Sbjct: 257 LPLLARLGWPLPLGGTSNIFRQSALVA------CGGWDPFNVTEDADLGFRMARSGWRAG 310 Query: 283 FVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTH 342 V ++EA P T Q+SRW+ G Sbjct: 311 LVAPGTLEEA------------------------PITLRAWTHQRSRWLKG------HFI 340 Query: 343 KWTSSLTLNYFLWRDR--KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGS 400 W + L G S + LA ++ + L+ Sbjct: 341 TWLVHMRDPRGLVDALGWGGVTSLTFTVLANMMSALIHAPSLLMMGAGALLLGLAPGWSV 400 Query: 401 AWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVL 460 W + + I V + + + +W L+ A +AL+++ Sbjct: 401 LWTVGAALMTCAYASAMICAGVAARRAGFSPRLSHMLSMPAYW--LLQAPAALKALRELP 458 Query: 461 QHGDPRRVAWDKTTHDFPSVTGDTRS 486 + WDKT H +T Sbjct: 459 RQ----PYLWDKTQHGVSRARRETPD 480 >UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI000038DF5C Length = 405 Score = 200 bits (508), Expect = 2e-49, Method: Composition-based stats. Identities = 89/455 (19%), Positives = 158/455 (34%), Gaps = 55/455 (12%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAI 73 L+ I + L I + + F I + + + + + + + L ++I Sbjct: 3 ILQGIGLALIAIGLVYVIYQFPI-IYFGYKDFTKYDIDFSKLNEAQFSGLKMYKP-MVSI 60 Query: 74 MVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 +VPA NE VI E + Y N+ +FV + +T + E +R V+ V Sbjct: 61 IVPAKNEETVIKRTIE-SILNQTYTNFELFVVVDNSSDNTYKIAKEYESRDKRVN--VFN 117 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL--RLFNYLVERKDL 191 RP SKA LN FE++ A +DA+ ++ P L ++ D+ Sbjct: 118 RPDGKSKASALNFC------FEKTKGEVIAT---YDADTMLLPNTLENAVYGMNYFNVDV 168 Query: 192 IQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALL 251 +Q RE FT + IDE L + R VP AG F R+ + ++ Sbjct: 169 LQGYNSYINREENIFTRLAVIDEI--LVKATLIGRTHFNLFVPVAGSNQYFKRKVIESI- 225 Query: 252 ADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMIC 311 +D LTED + R+ ++ Sbjct: 226 -----GGWDDNFLTEDLESSIRISNARYKSAYLGSAKAL--------------------- 259 Query: 312 VREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAM 371 + P ++S RQ++RW+ G F + K S T L ++ S + Sbjct: 260 --QETPASYSEYFRQRTRWLRGYHQVFFHSKKRFSKFTDFDALM----IVLAPTFSGILF 313 Query: 372 LVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGL 431 + + LL Y + + ++ ++ L L+ R Q I++ Y Sbjct: 314 FGWLYISLLNFYNPFVHSMRTYFISLILISLIIYVVALVLVLIKKR--QNFIYIPLIY-- 369 Query: 432 TQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPR 466 L+ L + + R +V + G Sbjct: 370 IYLTLNSLIAIYTLFLEITGAKRVWHKVKKTGKTT 404 >UniRef50_B8HEE8 Glycosyl transferase family 2 n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HEE8_ARTCA Length = 664 Score = 199 bits (505), Expect = 4e-49, Method: Composition-based stats. Identities = 82/490 (16%), Positives = 161/490 (32%), Gaps = 81/490 (16%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDE--K 69 L + V+ + ++ L F D ++ K + +RR PD Sbjct: 220 LTAVNVVFLVSIGFKTVASLRQPF-DALHDRSAAKARAREFRRRGLPVEEVARIPDADLP 278 Query: 70 PLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN-V 127 I++P + E +I + + V +D +T +R P V Sbjct: 279 VYTILIPVFREANIIDKLLSNLGQLDYPRSKLDVLVLLEEDDTETIEAAKR--SRPPEYV 336 Query: 128 HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF----- 182 +V R P +K N L +++DAED P +LR Sbjct: 337 RILVVPRGEPQTKPRACNYGLTFA---------RGEYVVIYDAEDRPDPGQLRAAIHAFR 387 Query: 183 ------NYLVERKD---LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 YL + +Q + F + T + + E++ +P + + Sbjct: 388 KDAFERQYLDPDRRPLICVQAALNYFNADQNVLTRL-FTIEYTHWFDSMLPGLDRSGIPL 446 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 P G F R + + A+D ++TED D+G R +G + +EA Sbjct: 447 PLGGTSNHFDTRLLRLV------GAWDPWNVTEDADLGLRAAVEGYRVGVINSTTWEEAC 500 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF 353 ++Q++RWI G + + Sbjct: 501 ------------------------SQVPAWIKQRTRWIKGYMVTAA------VNTRNTLR 530 Query: 354 LWRDRKGAISNFVSFLAMLV-----MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLW 408 ++ ++ V FL +++ + L+L + + ++F+ + L+ + Sbjct: 531 YI--QRTGVAGAVGFLGLILGTPLAFLAYPLVLGFTIVTYVGYNFVGLVLPEWLLVGGVV 588 Query: 409 LNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL-RLFWGNLINFMANWRALKQVLQHGDPRR 467 IV + +G + ++L +W +++ +A WRA Q+L Sbjct: 589 SMLFGNAMMIVVSGVATWRRHGWRIAIFALLNPAYW--VLHSVAAWRAAWQMLTSPHK-- 644 Query: 468 VAWDKTTHDF 477 W+KT H Sbjct: 645 --WEKTPHGL 652 Score = 70.6 bits (171), Expect = 2e-10, Method: Composition-based stats. Identities = 42/177 (23%), Positives = 62/177 (35%), Gaps = 6/177 (3%) Query: 478 PSVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQA 536 P G R LGQ LL+ +IT +QLD AL R EG LG ++++ ++ + + Sbjct: 17 PLAAGPGRQSLALGQTLLQAGLITTDQLDRALQRAATEGGLLGRHIILETGLNRRHVYEV 76 Query: 537 LAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAA 596 LAEQ + + +L+ + S LP LE+ L V + AA Sbjct: 77 LAEQWDAPLVDLVSHPSDDALLERLQFSEVSEPGWLPWHLEDGVLTVATAVKPSEEIRAA 136 Query: 597 LTRKVGRKVRYVIVLRG-----QIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQ 648 R G I R+ L + LT Q Sbjct: 137 AMRATGATDVVFRTTTDWDINHSIQRAFRNHLLYESAERLAEELPDGSARTALTRWQ 193 >UniRef50_B5ZHV8 Glycosyl transferase family 2 n=2 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=B5ZHV8_GLUDA Length = 624 Score = 198 bits (503), Expect = 8e-49, Method: Composition-based stats. Identities = 83/427 (19%), Positives = 139/427 (32%), Gaps = 61/427 (14%) Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVD 118 R L D I+VP + E V+ + A L+Y + + +D +T Sbjct: 238 RSLEDRDFPVYTILVPMYKEPDVLPILV-NAIRNLEYPQSKLDVKLVLEEDDIETIAAAR 296 Query: 119 EVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME 178 ++ A + P +K N L ++DAED + Sbjct: 297 KL-ALEATFEIICVPPSEPRTKPKACNYALRFA---------RGEYLTIYDAEDKPEATQ 346 Query: 179 LR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVP 234 L F L + IQ + + T M + E++ +P E + +P Sbjct: 347 LEKVLVAFRKLPDNVVCIQARLNYYNATENWLTRM-FTLEYTAWFDFYLPALEYMRIPIP 405 Query: 235 SAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKE 294 G F A+ A+ A +D ++TED D+G RL ++G V +EA Sbjct: 406 LGGTSNHFKISALRAVHA------WDPYNVTEDADLGVRLTQRGWKVAVVDSTTFEEANV 459 Query: 295 REQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFL 354 + +RQ+SRW+ G + + + Sbjct: 460 ------------------------SIPNWIRQRSRWLKGYM------QTYLVHMRSPLAF 489 Query: 355 WRDRKG-AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSG--SAWLMTLLWLNF 411 +R G F F+ M LL + + L SG S +M L LN Sbjct: 490 YRKTGGTGFWGFQFFIGGTFMTALLAPIFWVFFILFTLFGLKAGSGVFSGRIMALNALNL 549 Query: 412 GLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWD 471 L +V + + L L + +A ++ L Q+L + W+ Sbjct: 550 LLGNGFLVYTYVLCSFKRNYRHLALYALTTPVYWALQSIAAYKGLFQLLY----KPFYWE 605 Query: 472 KTTHDFP 478 KT H Sbjct: 606 KTQHGLS 612 Score = 66.3 bits (160), Expect = 4e-09, Method: Composition-based stats. Identities = 25/110 (22%), Positives = 43/110 (39%), Gaps = 4/110 (3%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 R GQ L+ ++TE QLD A++ ++ RLG +L +G + + LA Sbjct: 4 VTQPDKRLFGQFLVSRNILTEAQLDEAIQTQKLWKSRLGDIILAKGWLKPRRFYHLLATF 63 Query: 541 NGVAWESIDAWQIPSSLI-AEMPASVALHYAVLPLRLE-NDELIVGSEDG 588 + + + L M A + LP R + +I+ D Sbjct: 64 FDLEFVDLMGHPPDPDLFDRAMIDEYARR-SFLPWRRSADGAIILALADP 112 >UniRef50_C6CY56 Type II secretion system protein E n=3 Tax=Bacillales RepID=C6CY56_PAESJ Length = 554 Score = 197 bits (501), Expect = 1e-48, Method: Composition-based stats. Identities = 56/249 (22%), Positives = 108/249 (43%), Gaps = 15/249 (6%) Query: 487 LRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 + LG +L+E+ +I+EEQL AL +LG ++ QG I+ +QL + L Q G+ Sbjct: 5 KKRLGDLLVESAIISEEQLQKALLEQSKSKQKLGDLLIAQGYITEQQLIEVLEFQLGIPH 64 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 S+ +QI + +P S+A Y +PL+ + +L+V D +D ++ L G ++ Sbjct: 65 VSLYKYQIDPEITQIIPESMAKRYQAIPLQKDGGKLMVAMADPLDYFAIEELRMSTGFRI 124 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAE 665 I + ++ + Y + +M + E + EI + P L + Sbjct: 125 EPAISSKDELQRAIARHYGLQ-----DSMSQMMIDLPTQEEIRETEITDEDSPVVRLVNQ 179 Query: 666 ILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLL 725 ++ + S I+V + + +GV+ E R L Q + ++ L Sbjct: 180 MIQQAVQLRASDIHV-----DPGETSVTIRYRIDGVLRTE---RALPKQMQGFITA-RLK 230 Query: 726 LKAGLNTEQ 734 + + LN + Sbjct: 231 IMSKLNIAE 239 >UniRef50_A4WQA1 General secretory system II, protein E domain protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WQA1_RHOS5 Length = 628 Score = 196 bits (499), Expect = 2e-48, Method: Composition-based stats. Identities = 87/478 (18%), Positives = 164/478 (34%), Gaps = 63/478 (13%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPA 77 + + +++ L F RI ++ RR P ++++V Sbjct: 193 LMLAPGLVVLALSLWALFAMTCGTALRIATAIATLRRRP-ADPPCPPLLRLPIVSVIVAL 251 Query: 78 WNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG 136 + E + G + L ++ I + D T++ + E P + V+ Sbjct: 252 YQEEDIAGRLVARLGRIDYPHDRLEILLVVEEADLRTRKALVE-ARLPPWMRIVISPAGA 310 Query: 137 PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL----FNYLVERKDLI 192 +K LN LD + ++DAED P ++R F+ + + Sbjct: 311 IRTKPRALNVALDHC---------RGSIVGVYDAEDAPDPDQIRRVVEGFSRRGSQVACL 361 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 Q + + T++ S + E++ +P + L VP G F R A+ L Sbjct: 362 QGQLDYYN-PRTNWLSRCFTIEYASWFRLMLPGLDRLGLAVPLGGTTLFFRREALEDL-- 418 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 A+D ++TED D+G RL G + +EA R Sbjct: 419 ----GAWDAHNVTEDADLGIRLARHGYRTDLIDTVTGEEANCRALP-------------- 460 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-RKGAISNFVSFLAM 371 ++Q+SRWI G + W + LWR + F Sbjct: 461 ----------WIKQRSRWIKGFMM------TWAVHMRDPVLLWRQLGPWRFAGFQVMFLG 504 Query: 372 LVMIQLL--LLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVT-GY 428 + LL +L ++ L H ++ + L ++ L G I ++ + Sbjct: 505 SLSQTLLAPVLWSFWLLALGLPHPVTPLLSTPALWAIVGLLLGAEGTSIALGILALRLTR 564 Query: 429 YGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRS 486 + L + + L+ N + A ++AL ++L+ WDKT H + + Sbjct: 565 HKLNPLWVPTMHLY--NPLATFAAYKALWELLR----APFYWDKTRHGLFDGSSRGPA 616 Score = 64.8 bits (156), Expect = 1e-08, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 47/131 (35%), Gaps = 1/131 (0%) Query: 490 LGQILLENQVITEEQLDTALRNRVEGLR-LGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG +LL + ++ ALR L +L +G + E++ A Q+G+ Sbjct: 18 LGVMLLRQGHLAPHRIMGALRRSSGHAAGLADVLLAEGAMDEEEILALTARQSGLPLLDP 77 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 LI + L +LP+ +++ + L ++ +V V Sbjct: 78 ATGAADPRLIDRLGVRTCLRETLLPVHDVGGAVLIAAPSPESFRRHGPLLGQLFGRVIPV 137 Query: 609 IVLRGQIVTGL 619 + R I L Sbjct: 138 LATRTAIEGAL 148 >UniRef50_C6QAR9 Glycosyl transferase family 2 n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QAR9_9RHIZ Length = 506 Score = 196 bits (497), Expect = 3e-48, Method: Composition-based stats. Identities = 78/468 (16%), Positives = 146/468 (31%), Gaps = 61/468 (13%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 L + + L I V V + + + +++VP + E Sbjct: 86 ALILWWVVLALPFLMIATVRLVAVWYVVRRQPKHWRGPLDDRRFDARLPTFSVLVPVYKE 145 Query: 81 TGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPT 138 V+ + A +DY I T +D T++ + + N+ + P Sbjct: 146 EAVVPGLVA-AMRRIDYPPDRVEILFITEEHDQPTRQALLQS-NLAQNMRVLTVPAGHPQ 203 Query: 139 SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV----ERKDLIQI 194 +K LN L ++DAED+ +LR R +Q Sbjct: 204 TKPRALNFALQEAGGI---------LVAVYDAEDIPDRDQLRRAAAAFVAGGPRLACVQA 254 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 + + + + F+ + E+ L +P L +P G F R + Sbjct: 255 QLTIYNAKQSFFSRQ-FALEYKALFSGLLPALAFLKLPIPLGGTSNHFRRDLLRK----- 308 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 +D ++TED D+G R+ G +R +EA Sbjct: 309 -CGGWDPFNVTEDADLGIRIARLGYDVAVIRSETSEEA---------------------- 345 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG-AISNFVSFLAMLV 373 P + T Q++RWI G + + + LWRD F + ++ Sbjct: 346 --PTEWRTWCGQRTRWIKGWI------QTYLVHMRHPLRLWRDLGTWQFIGFQIMIGGMI 397 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 + L+ Y + A ++ A L + + + I Sbjct: 398 LSILVHPWFYVLIVNKAMSGAALMPAGAALQWIFSAHLMIGYGAAWLLTIVTARGSISGL 457 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 L ++W + A +RA+ ++ R W+KT H + Sbjct: 458 WAAIWLPIYWLAI--SWAAYRAVIDLI----FRPFHWEKTAHGAGASH 499 >UniRef50_B9R454 Glycosyl transferase, group 2 family protein n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9R454_9RHOB Length = 617 Score = 194 bits (493), Expect = 9e-48, Method: Composition-based stats. Identities = 88/506 (17%), Positives = 159/506 (31%), Gaps = 79/506 (15%) Query: 12 LYGLKVIAITLAVIMF-----ISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 L + +I LAV + ISG F + V + + + +L Sbjct: 162 LPAVLLILCFLAVFVLGVSQMISGALLFVVTGVVSLACFGAGIIRFVCAQSSQEEDLVYH 221 Query: 67 DEKPLA----------IMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQR 115 +PL+ ++VP + E GV+ ++ L A + I + +D +T Sbjct: 222 LPEPLSSGLIIWPRYTVLVPLYREAGVVPDLLRALNALNYPRDRLQILLLMETDDLETAA 281 Query: 116 DVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVIS 175 + E + +V P +K L+ L A T + DAED Sbjct: 282 ALPEDLPSH--IEALVVPDGTPRTKPRALDYGLAAATG---------TYVTVFDAEDRPD 330 Query: 176 PMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAG 231 P +L+ LF +Q + T F S + E++ L + +P Sbjct: 331 PDQLKKAAFLFAKGPAELACLQARLVVDNANET-FISRQFALEYACLFDQLLPWLFRHRW 389 Query: 232 QVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDE 291 P G F A+ A+ +D ++TED D+G RL+ G + Sbjct: 390 PFPLGGTSNHFRISALHAV------GGWDRYNVTEDADLGVRLERLGFRLGVL------- 436 Query: 292 AKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLN 351 E P T + Q++RW G + Sbjct: 437 -----------------PCQTLEEAPVTLKAWLAQRARWHKGWL------QTIFVHARSP 473 Query: 352 YFLWRD----RKGAISN-FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTL 406 L D R ++ F+ ++ + + +L SL H + +M + Sbjct: 474 RRLLSDLGAVRTAVLAALFLGTFLLIALHPVFFVLLTGSLLGYYDHTYFFGNIVLTVMFV 533 Query: 407 LWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPR 466 G +G+ + ++W + A +RA+ ++ Sbjct: 534 SGAAAGYAGALFALWTGARRRGHGIRLLDAPGVLIYW--MFAGFAFYRAVWELA----SA 587 Query: 467 RVAWDKTTHDFPSVTGDTRSLRPLGQ 492 W+KT H +PL + Sbjct: 588 PYRWNKTEHGVSRQRTSLTDWKPLPE 613 >UniRef50_B8H475 N-acetylglucosaminyltransferase n=3 Tax=Caulobacter RepID=B8H475_CAUCN Length = 497 Score = 193 bits (491), Expect = 2e-47, Method: Composition-based stats. Identities = 79/444 (17%), Positives = 144/444 (32%), Gaps = 63/444 (14%) Query: 39 VYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE 98 ++ + + R + PR L + D ++ P + E V+ + + +DY Sbjct: 101 IFLLGGLTRLAAAMTPLPRHHSPALAEADLPSYTLITPLYREAEVLPELVA-SLAAIDYP 159 Query: 99 NYHI--FVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG-PTSKADCLNNVLDAITQFE 155 + + +D T+ + P+ +V+ PG P +K N L E Sbjct: 160 RDRLQALIVLEADDEVTRAAARAL--DLPSFIQVLVVPPGTPRTKPRACNYAL------E 211 Query: 156 RSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTY 211 R+ +++DAED+ P +LR F R +Q P+ + ++ F + Sbjct: 212 RARG---DLVVIYDAEDMPDPGQLREAAARFAASDARLACLQAPLRIEDPGFSLFLPSQF 268 Query: 212 IDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIG 271 E++ +P P G F + + A+D ++TED D+G Sbjct: 269 RLEYAAHFEVLLPALARWGLPFPLGGTSNHFKIAPLREI------GAWDPYNVTEDADVG 322 Query: 272 FRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWI 331 FRL G + P + A P T + Q++RWI Sbjct: 323 FRLAAAGYRLDVIHRPTWETA------------------------PTTRAQWFPQRARWI 358 Query: 332 IGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAW 391 G R + AI+ ++ + L + ++ Sbjct: 359 KG------HMQTLAVHARGPVP--RQPRNAIALILTLAQSVASSHLHGPVMGVAIALALV 410 Query: 392 HFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMA 451 FL + L+ G + + L + +W L +A Sbjct: 411 DFLPDAAFQIPPHDLVLYFAGWGAAALAGARGVMRAGGRPKALHLLGMPAYW--LCQSVA 468 Query: 452 NWRALKQVLQHGDPRRVAWDKTTH 475 +AL Q + WDKT H Sbjct: 469 AVKALHQFVT----APHHWDKTLH 488 >UniRef50_B8D2C7 Tfp pilus assembly protein PilB n=5 Tax=Firmicutes RepID=B8D2C7_HALOH Length = 558 Score = 193 bits (491), Expect = 2e-47, Method: Composition-based stats. Identities = 57/257 (22%), Positives = 106/257 (41%), Gaps = 15/257 (5%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQN 541 ++ LG++LL+ ITE+QL+ AL+ + G +LG ++ G ++ L Q L Q Sbjct: 2 TRTHIKKLGELLLDFNFITEKQLNEALKKQNKSGKKLGEILVESGYLNENDLIQVLEFQL 61 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV 601 G+ ++ + I L +P ++A + V+PL +N +L V D + V++ + Sbjct: 62 GIPHADLNKYVINPHLAQYIPENIARRHNVVPLEKKNGKLKVAMVDPTNLVAIEDIEMTS 121 Query: 602 GRKVRYVIVLRGQIVTGLRHWYA--RRRGHDPRAMLYNAVQHQWLTEQQAG-EIWRQYVP 658 G KV +I R I L Y+ + A L + + + P Sbjct: 122 GLKVEPLIASRKNIKMALNQIYSVNDSDAAEVFASLNEVTTKTNEEPELNELKEMIEDAP 181 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ 718 L I+ + S I++ E + +GV L +T+ + Q Sbjct: 182 IVRLANLIINQAIQMKASDIHI-----EPQEDQVRVRYRVDGV-----LRENMTVPKHSQ 231 Query: 719 VSMQS-LLLKAGLNTEQ 734 ++ S L + A L+ + Sbjct: 232 AALISRLKIIADLDITE 248 >UniRef50_A3TTM2 Glycosyl transferase, family 2 n=1 Tax=Oceanicola batsensis HTCC2597 RepID=A3TTM2_9RHOB Length = 650 Score = 193 bits (490), Expect = 2e-47, Method: Composition-based stats. Identities = 84/427 (19%), Positives = 134/427 (31%), Gaps = 64/427 (14%) Query: 71 LAIMVPAWNE---TGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNV 127 ++I+VP + E V+ E + +D T R A P + Sbjct: 261 MSILVPLYREDRVASVLPRRLER--LDYPRARLDVIFVLEESDDVT-RAALAAAALPPWI 317 Query: 128 HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFI-LHDAEDVISPMELRL----F 182 V P +K +N L +F I ++DAED P +LR F Sbjct: 318 RIVTVPDGQPRTKPRAMNYAL----------DFCIGDIIGIYDAEDAPEPDQLRKVAAGF 367 Query: 183 NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 +Q + + + + + E++ +P L VP G F Sbjct: 368 AAASGETACLQGALDYYNAAEN-WITRCFTIEYNTWFRLVMPGMAKLGFAVPLGGTTAFF 426 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 R A+ A+ A+D ++TED D+G RL G + +EA R Sbjct: 427 RRDALEAV------GAWDAHNVTEDADLGMRLARAGYRTRVIDTATGEEASARPVS---- 476 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD--RKG 360 VRQ+SRW+ G + + + L RD Sbjct: 477 --------------------WVRQRSRWLKGYLM------TYAVHMRRPRALLRDLGPWQ 510 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 + FL ++ L LL L I + M LL F ++ Sbjct: 511 FLGFQAHFLTAILHFALAPLLWLFWLVIFGVDLPLIAIDTGPAMRLLATAFLGFELLVMT 570 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 T + L + +A W+AL ++ R WDKT H Sbjct: 571 LGAIATRGPRHRFLWPWIPSLHLYWPMGTLAMWKALVELA----CRPFYWDKTEHGHTLT 626 Query: 481 TGDTRSL 487 D ++ Sbjct: 627 EEDQPTV 633 Score = 43.6 bits (101), Expect = 0.029, Method: Composition-based stats. Identities = 21/104 (20%), Positives = 35/104 (33%), Gaps = 4/104 (3%) Query: 525 QGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVG 584 +G ++ +L LA G + L L + LP R D + + Sbjct: 66 EGRLTGPRLLTELAALTGAQPVDLGLTPADPGLARRFDGPTCLRHECLPWRQTADTIWIA 125 Query: 585 SEDGIDPV-SLAALTRKVG---RKVRYVIVLRGQIVTGLRHWYA 624 + +LA L G + R V+ R I +GL + Sbjct: 126 TARPERFDRALADLMPGTGPHPPETRMVLADRSAIQSGLAKVHG 169 >UniRef50_B4W7G8 Putative uncharacterized protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4W7G8_9CAUL Length = 461 Score = 193 bits (489), Expect = 3e-47, Method: Composition-based stats. Identities = 81/483 (16%), Positives = 146/483 (30%), Gaps = 65/483 (13%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFI--DVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEK 69 + L + ++ S L FI + + + R V + Sbjct: 24 VASLATALVAAGIVWPRSTLSASFIGIQMGFVASALWRAALVIACLRPTPSSPKPSRWPR 83 Query: 70 PLAIMVPAWNETGVIGNMAELAATTLDYENYHI--FVGTYPNDPDTQRDVDEVCARFPNV 127 I+ +E V+ + + + +DY + F+ +D T D + R + Sbjct: 84 -YTILAALHDEAAVVPQLIQR-LSKIDYPRRQLEGFLVLEAHDQATI-DAAKAARRPGWL 140 Query: 128 HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV- 186 +V P +K LN+ L T ++DAED P +LR Sbjct: 141 RILVVPPGAPKTKPRALNHALAFATG---------ELLTIYDAEDEPDPGQLREAASRFA 191 Query: 187 --ERKDLIQIPVYPFER--EWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 + +Q P+ R E + F + E++ L ++P L P G Sbjct: 192 GQPQLGCLQAPLRIRRRNAELSTFLDRQFAFEYAALFEVNLPGMAKLNLPFPLGGTSNHL 251 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 A+ + +D ++TED D+GF+L G + P + Sbjct: 252 RTAALRRV------GGWDAYNVTEDADLGFKLWSAGWRLGVLESPTWEAP---------- 295 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAI 362 P + Q++RW+ G + W L R + ++ Sbjct: 296 --------------PGALERWLPQRTRWLKGYM------QTWGVHTRAPRALGRRGQLSL 335 Query: 363 SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRV 422 + + + L + A + +L + Sbjct: 336 AMTLGAAIVSAASHAPTLAWLIAALMVALNIGMAPVVPLASFAVLAVGVIAAWMGCAVGA 395 Query: 423 IFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 Y LT L + +W L +A AL +++ AWDKT HD Sbjct: 396 RRAGQDYRLTDMLAA--PAYWSLL--SLAFIHALWRLI----VAPYAWDKTAHDAEIDAE 447 Query: 483 DTR 485 T Sbjct: 448 CTS 450 >UniRef50_A3VHJ3 Glycosyl transferase, family 2 n=2 Tax=Rhodobacterales RepID=A3VHJ3_9RHOB Length = 684 Score = 193 bits (489), Expect = 3e-47, Method: Composition-based stats. Identities = 90/433 (20%), Positives = 153/433 (35%), Gaps = 67/433 (15%) Query: 64 YKPDEKPL-AIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 + P +P+ +I+VP + E + G + L T E I + +D TQ + Sbjct: 286 HDPRRRPVVSILVPLYREREIAGRLVKRLERLTYPRELLDICLIVEEDDTLTQETLSN-- 343 Query: 122 ARFP-NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL- 179 AR P + ++ R G +K LN LD ++DAED ++ Sbjct: 344 ARLPAWMRQITVPRGGVRTKPRALNFALDFA---------RGTIIGVYDAEDAPDADQID 394 Query: 180 ---RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 F R +Q + + T++ S + E++ +P E + VP Sbjct: 395 RIVARFAEAPPRVACLQGMLDYYN-ARTNWLSRCFTIEYATWFRIVLPGMEKMGFAVPLG 453 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G F R + L +D ++TED D+G RL G T V +EA R Sbjct: 454 GTTLFFRRGVLEQL------GGWDAHNVTEDADLGIRLARLGYTTELVETVTKEEANCRV 507 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 ++Q+SRWI G + + LWR Sbjct: 508 WP------------------------WIKQRSRWIKGYAM------TYGVHMRDPRRLWR 537 Query: 357 D---RK--GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNF 411 + R+ G F+ L+ L++ +L +Y L H L+ + + Sbjct: 538 ELGARRFWGVQIVFLGTLSHLILAP--VLWSYWLLAFGLPHPLAEVMPGWVVWAMFATFL 595 Query: 412 GLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWD 471 I +I V+ G L V L + +A + L ++++ + WD Sbjct: 596 TAEAINIAVGMIAVSTP-GRRFLKLWVPTLHVYFPLASLAALKGLGEIVR----KPFFWD 650 Query: 472 KTTHDFPSVTGDT 484 KT H + +T Sbjct: 651 KTQHGHDDLVEET 663 Score = 72.5 bits (176), Expect = 5e-11, Method: Composition-based stats. Identities = 28/131 (21%), Positives = 50/131 (38%), Gaps = 1/131 (0%) Query: 490 LGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG+ L++ ++ E L A + R LRLG +L +G ++ LA+ALA+ + Sbjct: 62 LGEKLVDMGILRPEDLLAARKARAGTALRLGDVLLARGFVTPYTLARALAKVYDTTLVDL 121 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 LI + L +P + D I+ + + + R V Sbjct: 122 RRDPPDVRLIDAVGLDACLRLGFVPWKRVGDTTIIACACPDRFARIRSGLPESFGACRMV 181 Query: 609 IVLRGQIVTGL 619 + +I L Sbjct: 182 VAQGNEIDAAL 192 >UniRef50_C4L4L3 Type II secretion system protein E n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L4L3_EXISA Length = 556 Score = 192 bits (488), Expect = 4e-47, Method: Composition-based stats. Identities = 53/252 (21%), Positives = 108/252 (42%), Gaps = 17/252 (6%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 R+ R LG++L+E +IT QLD AL + G +LG +++ I+ QL Q + EQ V Sbjct: 2 RRTKRRLGEMLIEAALITTNQLDEALEQKRPGEKLGDALIRLNHITETQLIQMIHEQLHV 61 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + ++ I ++ +P ++A + ++P L + L V + D +D +++ + + G Sbjct: 62 PIIELYSYDINVTVTKLVPKALAQKHDIMPFELNGNTLHVATADPLDLIAIDDVRLQTGM 121 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 + I R QI + +Y + L ++ ++ I R P L Sbjct: 122 NIEIGIATREQIRKTISRYY------EMDHSLVEILKEDAPEIERQETISRDDAPIIRLV 175 Query: 664 AEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ-VSMQ 722 ++ + S I+ + L +G + ET+ +++Q V + Sbjct: 176 NQLFLSAIDQRASDIH-----FDPHEKQLHVRFRIDGDLRTETV-----YPKKIQSVMLT 225 Query: 723 SLLLKAGLNTEQ 734 L + + L+ + Sbjct: 226 RLKVMSNLDITE 237 >UniRef50_B5JBJ3 GSPII_E N-terminal domain family n=2 Tax=Octadecabacter antarcticus RepID=B5JBJ3_9RHOB Length = 631 Score = 192 bits (488), Expect = 4e-47, Method: Composition-based stats. Identities = 87/499 (17%), Positives = 163/499 (32%), Gaps = 74/499 (14%) Query: 2 DWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRK-LSVYRRYPRMSY 60 DW W GL V+ ++ ++ + F + I + L V + Sbjct: 164 DWNTGKAFRW--GLGVVLAIMSCLIVWPQISFFVLCGWAVFTLILKTILKVAAAVVHLFP 221 Query: 61 RELYKPDEKP-------LAIMVPAWNETGVIGNMAELAATTLDYENYHIFV--GTYPNDP 111 + P + I+VP + E + G + E + LDY + V +D Sbjct: 222 KPTSPMPANPQLAHLPIVTILVPLFRERDIAGTLIER-LSRLDYPTDRLDVCLVLEADDG 280 Query: 112 DTQRDVDEVCARFP-NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDA 170 TQ + + P + + +K LN L + ++DA Sbjct: 281 TTQNALAAT--QLPFWMRAIKVPLGTLQTKPRALNYALCFAKG---------SIIGVYDA 329 Query: 171 EDVISPMELRL----FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVR 226 ED +P ++ + F + +Q + + +++ + + E++ +P Sbjct: 330 EDAPAPDQIHIVVNRFAQRGQDVACLQGQLDFYN-SHSNWLARCFTVEYATWFRIMLPGL 388 Query: 227 EALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRF 286 E L +P G F R + L +D ++TED D+G RL G + Sbjct: 389 ERLGLAIPLGGTTLFFRREILEEL------GGWDAHNVTEDADLGIRLARHGYRTEIIDT 442 Query: 287 PVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTS 346 +EA R ++Q+SRW+ G + Sbjct: 443 VTQEEANARAWP------------------------WIKQRSRWLKGYAI------TYGV 472 Query: 347 SLTLNYFLWRDRKG-AISNFVSFLAMLVMIQLL--LLLAYESLWPDAWHFLSIFSGSAWL 403 + LWRD A + LL LL ++ + H L + Sbjct: 473 HMRSPLKLWRDLGAWRFFGLQLLFAGTISQFLLAPLLWSFWLMLLGLPHPLDNVLSTNVT 532 Query: 404 MTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHG 463 +T + + I I VT G + L + + +A ++ + ++ Sbjct: 533 LTFAAIFLLSEIINITVSAIAVTSP-GKYDLIKWTPTLHFYFPLAALAAYKGVIELAT-- 589 Query: 464 DPRRVAWDKTTHDFPSVTG 482 + WDKT+H + T Sbjct: 590 --KPFYWDKTSHGIFAPTQ 606 Score = 45.5 bits (106), Expect = 0.008, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 45/150 (30%), Gaps = 16/150 (10%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSML------MQGL--ISAE 531 + + R G +L +N V A + R G + GL + E Sbjct: 1 MANKLAAPRATGDVLGDNGV----GAARAFTQKDVNRR-GSVAIAVALGHRLGLPSTNPE 55 Query: 532 QLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDP 591 Q+ LAE LI A + L + VLP R + + V + Sbjct: 56 QIEAGLAET---CVVDPVITPADPRLIERFGAELCLKHRVLPWRSVSGRVTVLATSPDHF 112 Query: 592 VSLAALTRKVGRKVRYVIVLRGQIVTGLRH 621 + + V + IV + L Sbjct: 113 LRIRDALVVVFGPIHLAIVTTNNLDAALSR 142 >UniRef50_Q28T00 Glycosyl transferase family 2 n=1 Tax=Jannaschia sp. CCS1 RepID=Q28T00_JANSC Length = 654 Score = 191 bits (485), Expect = 7e-47, Method: Composition-based stats. Identities = 78/449 (17%), Positives = 137/449 (30%), Gaps = 57/449 (12%) Query: 39 VYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDY 97 R R + P S + L+ P ++I+VP + E V G + E Sbjct: 247 FAAFGRTLRGAASPSATPETSAQFLHNPT---VSILVPLFREPEVAGALVERLRRLDYPR 303 Query: 98 ENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERS 157 E I + +DP T + + +V R P +K LN VL+ Sbjct: 304 ERLDIILAVEEDDPLTLSALQ-TGTLPAWMRAIVVPRGSPQTKPRALNYVLNYA------ 356 Query: 158 ANFAFAGFILHDAEDVISPMELRL----FNYLVERKDLIQIPVYPFEREWTHFTSMTYID 213 ++DAED P +++ F + +Q + + + + + Sbjct: 357 ---RGDIVGIYDAEDRPEPDQIQRVVQRFAEVPADVACLQGRLDYYNARHNWLSRL-FTV 412 Query: 214 EFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFR 273 E++ +P + L VP G R + A+D ++TED ++G R Sbjct: 413 EYAAWFRVLLPGVQRLGLVVPLGGTTVFLRRNVLE------GVGAWDAHNVTEDAELGLR 466 Query: 274 LKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIG 333 L G V +EA ++Q+SRW+ G Sbjct: 467 LARAGYQTEIVETTTFEEANAATLP------------------------WIKQRSRWLKG 502 Query: 334 IVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHF 393 + + +L WR I + L L L + WH Sbjct: 503 YLMTWGAAMRRPRALLNELGPWRFAWLQIQFAGAVLGFLTAPLLWSFMLKPFGV---WHP 559 Query: 394 LSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANW 453 + L + ++ + + L + L +A W Sbjct: 560 MDGVMSPFAYGVLGVVMVSGLIGSVAISFYACRAKH-LRHLRPIAPLVEPYYLFGTIAAW 618 Query: 454 RALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 L +++ + W KTTH T Sbjct: 619 IGLFELIA----KPFFWAKTTHGKFGATQ 643 Score = 80.2 bits (196), Expect = 3e-13, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 59/145 (40%), Gaps = 6/145 (4%) Query: 492 QILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 ++LL ++T +Q+ A +R L LG ++ QG IS +L LA+ GV + Sbjct: 48 ELLLARGLVTPDQMRAAQDASRGTSLSLGEILIAQGAISEPELLSTLAQTYGVGIADLSG 107 Query: 551 WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV--GRKVRYV 608 +SL +PA+ ++ + + L++ + + L + G+++ V Sbjct: 108 DLPDTSLAPLLPAAASITAEAVIWKRAGSALVIATSRP---DRIQDLRALLPGGQRIMTV 164 Query: 609 IVLRGQIVTGLRHWYARRRGHDPRA 633 + R QI R Y Sbjct: 165 LASRNQITEAQRKLYGPHLARKAEG 189 >UniRef50_A0LUI4 Type II secretion system protein E n=4 Tax=Bacteria RepID=A0LUI4_ACIC1 Length = 553 Score = 191 bits (485), Expect = 8e-47, Method: Composition-based stats. Identities = 57/250 (22%), Positives = 107/250 (42%), Gaps = 15/250 (6%) Query: 487 LRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 ++ LG ILLE ++T EQL A ++ G LG ++ QG+++ QL ALA Q G+ + Sbjct: 1 MKQLGDILLEGGLVTPEQLAAAYAEHQRNGRSLGRVLVDQGILTEAQLVAALATQIGLRF 60 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 + I S ++ +P +V Y LP+ E+ +L+V D + ++ + G V Sbjct: 61 VDLTDVAIDGSAVSRVPEAVCRRYTALPIGYEDGKLVVAMADPANVFAIDDIRSITGLDV 120 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAE 665 + V+ R ++ + ++ D A + + L + + + P Sbjct: 121 KPVVATRADVLAAINRYHRADEELDDLTSTLAAEETEDL---ASLDEVVEDAPIVKFVNL 177 Query: 666 ILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS-L 724 ++T S I++ E + L +GV L V+ R +Q + S L Sbjct: 178 LITQAVQDRASDIHI-----EPTERDLRVRFRIDGV-----LHEVMRSPRNIQSGVISRL 227 Query: 725 LLKAGLNTEQ 734 + A +N + Sbjct: 228 KIMADMNIAE 237 >UniRef50_A8LS31 Glycosyl transferase n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LS31_DINSH Length = 635 Score = 191 bits (484), Expect = 1e-46, Method: Composition-based stats. Identities = 80/486 (16%), Positives = 148/486 (30%), Gaps = 71/486 (14%) Query: 12 LYGLKVIAITLAVIMFISGLDD----------FFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 + GL + + A ++ I GL I + + + LS R Sbjct: 194 MMGLVCLYLATAALVLIPGLVTSALMWITLAVLAITMGFKLMLAVACLSARPPPQPPPER 253 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 +P ++I+VP E+ V + + D T+ + ++ Sbjct: 254 NKTRPPLPAMSILVPLLRESEVAEKLLNNLDRLRYPRALLDVLFVVEAEDDVTKNALSQL 313 Query: 121 CARFP-NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL 179 R P + R +K LN L ++DAED P +L Sbjct: 314 --RLPQGFRMLEVPRGTVQTKPRALNFALPFC---------RGEIVGIYDAEDRPDPDQL 362 Query: 180 ----RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPS 235 F + +Q + F + + + E++ G + + L +P Sbjct: 363 LKVAEGFRHAAPEVACLQGRLDFFNTRFN-LIARCFTAEYAGWFGLFLQGLDRLGLPIPL 421 Query: 236 AGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKER 295 G R+ + + +D ++TED D+G RL G + +EA R Sbjct: 422 GGTTLFLRRKVLEEV------GPWDAHNVTEDADLGMRLYRHGYRVSLIDTVTQEEANVR 475 Query: 296 EQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW 355 ++Q+SRWI G + + + LW Sbjct: 476 IWP------------------------WIKQRSRWIKGYM------ATYAVHMRSPRALW 505 Query: 356 RD-RKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLM 414 R G + ++ L + L + + A L L ++ Sbjct: 506 RALGPGGFAALQCLFLGSILSALTMPLLLWLWLGHLGAPIVPNAVLATLPPAHVLGPVML 565 Query: 415 VNRIVQRVIFVTGYYGL--TQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDK 472 V ++ G + L L++ +A +AL ++L + WDK Sbjct: 566 GIEAVNLALWAAGVRAARHRHLWPMLPLLHVYFLMSSIAALKALVELLY----KPFYWDK 621 Query: 473 TTHDFP 478 T H Sbjct: 622 TDHGIA 627 Score = 52.5 bits (124), Expect = 5e-05, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 52/173 (30%), Gaps = 1/173 (0%) Query: 464 DPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITE-EQLDTALRNRVEGLRLGGSM 522 P R D P L +L+ I+ E+L R++G R + Sbjct: 7 QPTRDTGDGLLAPVPRKPAPRPERELLADLLVRRGDISPAERLQAVHMARLQGRRPLEVL 66 Query: 523 LMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELI 582 +G ++ E L +A E + LI L + +LP R + + Sbjct: 67 EAEGWLAPELLLEARCEMYRAGRIDPEQAPADPDLIGAYGIDWCLTHGILPWRRLSGATV 126 Query: 583 VGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAML 635 + + +AA + + I +R +A L Sbjct: 127 YLVAEPEEFADIAATLPPEAGQPVLAVAGADAIAEAIRAAHAPHLIARAETAL 179 >UniRef50_UPI0001B55850 glycosyl transferase family 2 n=1 Tax=Streptomyces sp. C RepID=UPI0001B55850 Length = 486 Score = 190 bits (483), Expect = 1e-46, Method: Composition-based stats. Identities = 93/466 (19%), Positives = 145/466 (31%), Gaps = 75/466 (16%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPD-EKPL-AIMVPAWNETGV 83 M + L + + + + P ++ PD E P ++VPA+ E GV Sbjct: 77 MAVLALLTAIVTAYSLLHVVLMLTGLGSGGPLAGADDVPLPDAELPFYTVLVPAYREAGV 136 Query: 84 IGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKA 141 IG + LDY + V +DP T R V R P V V P +K Sbjct: 137 IGGLVRH-LAELDYPPDRLEVLVLVERHDPGTARAVPA-AGRPPFVRLVRLPPGPPQTKP 194 Query: 142 DCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE----RKDLIQIPVY 197 +N L ++ DAED P +LR +Q + Sbjct: 195 RSVNLGLLLA---------RGELLVVFDAEDRPDPGQLRRVAARFAARGADLACVQAQLL 245 Query: 198 PFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGI 257 T + E++ +P L VP G F + A+ Sbjct: 246 FHNAAGNWLTRQ-FAMEYALRFTLALPGLVRLGMPVPLGGTSNHFRTATLRAV------G 298 Query: 258 AFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFP 317 +D ++TED D+G R G + +EA + Sbjct: 299 GWDAWNVTEDADLGMRCAAMGHRTETIGSVTWEEALGAVRP------------------- 339 Query: 318 DTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRK-------GAISNFVSFLA 370 VRQ++RW G + + + +++FV L Sbjct: 340 -----YVRQRTRWFKGFLLTTVVHTRRPRRTVSRFGGRGLLTLLGIVAGAPVTSFVQPL- 393 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 L + L+ L P L + + L W R R ++ Sbjct: 394 -LAALTLIGLCGLSW-SPAGAGLLLPSVAAQAVAALAWTAITFTAAR---RAGLGAPWHA 448 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 L L SVL F A WRA+ Q++ R +W+KT H Sbjct: 449 LLTPLCSVLWWF--------AAWRAVHQLV----FSRFSWEKTPHG 482 >UniRef50_D1NA10 General secretion pathway protein E n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1NA10_9BACT Length = 527 Score = 189 bits (481), Expect = 2e-46, Method: Composition-based stats. Identities = 82/511 (16%), Positives = 152/511 (29%), Gaps = 115/511 (22%) Query: 37 DVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL---AIMVPAWNETGVIGNMAELA-A 92 V++ + +S E+ DEK L I++P ++E + + Sbjct: 68 AAVFFRGGAAVLSWFGQGEEIVSDAEVAALDEKELPVYTILLPLYHEANIAEKIVRNMGR 127 Query: 93 TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAIT 152 E + + +D +T+ ++ P V P +K N L Sbjct: 128 LDYPKEKLDVKLLLEADDDETRLALERTG-LPPYCEVVTVPDAPPRTKPRACNFGL---- 182 Query: 153 QFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKD------LIQIPVYPFEREWTHF 206 R A F+ +++DAED P +L+ Y+V R+D +Q + F Sbjct: 183 ---RRARGEFS--VIYDAEDAPEPDQLKK-AYIVFRRDQEKKVLCVQGKLNYFNARHNLL 236 Query: 207 TSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTE 266 T + + E+S + + +P G F + + +D ++TE Sbjct: 237 TRL-FTVEYSTYFDLTLSGYQLFNLPLPLGGTSNHFRTAELREV------GGWDPFNVTE 289 Query: 267 DYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQ 326 D D+G R+ E+G V +EA +RQ Sbjct: 290 DCDLGIRIYERGYKTRLVNSTTYEEANA------------------------HVWNWIRQ 325 Query: 327 KSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA--------MLVMIQLL 378 +SRW+ G + + + R G F FLA + +I Sbjct: 326 RSRWVKGFI------QTHLVHYRNPFLTVK-RLGLYGAFGGFLAVGGSAMMMLTNLIFWT 378 Query: 379 LLLAYESLWPDAWH------------------------------------------FLSI 396 +LL Y L + Sbjct: 379 VLLIYAGLLIHGFSHGLGLYDQIVGPHLPGGAYEGIRLGGMSFRAWPLVYYGQGEDPFWA 438 Query: 397 FSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRAL 456 + L + F + + V + + ++ +W L+ +A W+ Sbjct: 439 VFSQIFFAGSLIMLFANFIFIGIGVAACVKRKFYYLIPVSLLMPFYW--LLISIAAWKGF 496 Query: 457 KQVLQHGDPRRVAWDKTTHDFPSVTGDTRSL 487 Q+ + W+KT H + L Sbjct: 497 IQIFT----KPFYWEKTIHGLTTDPITEEEL 523 >UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophilus torridus RepID=Q6KZU9_PICTO Length = 395 Score = 189 bits (481), Expect = 2e-46, Method: Composition-based stats. Identities = 84/387 (21%), Positives = 145/387 (37%), Gaps = 66/387 (17%) Query: 67 DEKPL-AIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP 125 + KPL +I+VPA NE VIG E + Y+N+ +FV +D DT R + + R Sbjct: 44 NYKPLVSIIVPAKNEETVIGRCIE-SILGQAYDNFELFVVVDNSDDDTYR-IAKSYERDG 101 Query: 126 NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--LFN 183 VH V R G +KA LN +++ E +DA+ V+ L+ ++ Sbjct: 102 RVH--VFERHGNLTKASALNYAY-SMSHGE--------IIATYDADTVLEKNTLKNAVYG 150 Query: 184 YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFS 243 D++Q RE FT + IDE + + R L VP AG F Sbjct: 151 MRYMDADVLQGYNTYINREENIFTRLAAIDEI--IVKVSMIGRMYLHLFVPVAGSNQYFK 208 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQH 303 R + + ++ LTED + G R+ K M ++ Sbjct: 209 RETIRII------GGWNGNFLTEDLESGVRMAAKRMRSAYLPSAK--------------- 247 Query: 304 ARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAIS 363 V + P T+S ++Q+ RW+ G + K S L+ Sbjct: 248 --------VYQETPATYSEYIKQRIRWLRGYHQVLLHSKKELSGLSG------------- 286 Query: 364 NFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 + L +++ +L + S++ +F + + S + L M+ I Sbjct: 287 --LDILMIVLAPTFSGILLFSSIYISILNFYNPYVHSMRTYFISLLFIFFMIYIIA---- 340 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFM 450 FV + L + ++ ++N + Sbjct: 341 FVLALIKKKENALYIPLVYIYVVLNAI 367 >UniRef50_Q168M3 Glycosyl transferase, putative n=12 Tax=Rhodobacterales RepID=Q168M3_ROSDO Length = 635 Score = 189 bits (479), Expect = 4e-46, Method: Composition-based stats. Identities = 83/485 (17%), Positives = 158/485 (32%), Gaps = 60/485 (12%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYP--RMSY 60 W F + + L + + + + + LS R R S Sbjct: 188 WAAMAFCSLVSALVFAPAWSVTALALWAVITLLMTSTLKAAALFIHLSGARAVQGARASG 247 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDE 119 + P ++++VP E + G + L+ T ++ + D T++ + Sbjct: 248 KPFRMP---RVSVLVPLLKEKEIAGQLIARLSQLTYPKSLLNVVLVLEEGDTLTRQTIAR 304 Query: 120 VCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFI-LHDAEDVISPME 178 V G T+K LN L +F I + DAED + Sbjct: 305 TTLPDWMSVIEVPEAGGLTTKPRALNYAL----------DFCKGSIIGVWDAEDWPEADQ 354 Query: 179 LR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVP 234 + FN + +Q + + + + + E++ +P + VP Sbjct: 355 IEKVVTRFNTAPDNVVCLQGVLDYYNSRSSWL-ARCFTIEYAIWWRIVMPGIARMGLVVP 413 Query: 235 SAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKE 294 G F R A+ L +D ++TED D+G RL G + Sbjct: 414 LGGTTLFFKRTALEEL------GGWDAHNVTEDADLGVRLARHGYKTELL---------- 457 Query: 295 REQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFL 354 RE VRQ+SRW+ G + F + +L + Sbjct: 458 --------------PTVTREEATSRPWAWVRQRSRWLKGFMITYFVHMRRPGALLRDLGF 503 Query: 355 WRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLM 414 WR G + F++ ++ +L H +++ G+ + L Sbjct: 504 WR-FMGVQTIFLAAVSQFAAAPVLWSFWLTFFGVA--HPVAMTLGAPVMWGLAGFFIATE 560 Query: 415 VNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 ++ V+ V+G G + V + + + +A ++AL ++++ WDKT Sbjct: 561 ALSLLLGVVAVSGK-GHRHLIPFVPSMMFYFTLGTVAAYKALWELVR----APFFWDKTQ 615 Query: 475 HDFPS 479 H Sbjct: 616 HGVSV 620 Score = 87.1 bits (214), Expect = 2e-15, Method: Composition-based stats. Identities = 36/137 (26%), Positives = 56/137 (40%), Gaps = 1/137 (0%) Query: 489 PLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 P+G++LL+ I L AL R LG M+ +GLI+ + + ALA Q Sbjct: 22 PIGRVLLDQGKIASNDLTHALNLQRRIDAPLGDIMISEGLINKKDVLSALAAQARAEGAD 81 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 ++ + +PAS+ L + V+P R + L V + L R++ Sbjct: 82 LELDPPEMEMANRLPASLCLRFGVVPWREDARALYVATSSPAGFTQLLDACGPQSRQLFP 141 Query: 608 VIVLRGQIVTGLRHWYA 624 VIV QI Y Sbjct: 142 VIVDDAQIQAHQSRLYG 158 >UniRef50_A3V835 Glycosyltransferase, family 2 n=2 Tax=Rhodobacteraceae RepID=A3V835_9RHOB Length = 633 Score = 188 bits (477), Expect = 7e-46, Method: Composition-based stats. Identities = 79/431 (18%), Positives = 142/431 (32%), Gaps = 70/431 (16%) Query: 60 YRELYKPDEKPLAIMVPAWNETGVIGNM-AELAATTLDYENYHIFVGTYPNDPDTQRDVD 118 R + + D + +++P + ET + ++ LAA + + +D T+ + Sbjct: 237 PRPVAEADLPVITMLIPLYRETAIASHLLVRLAALRYPRALLDVCLVLEQDDATTRATLA 296 Query: 119 EVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME 178 + ++ +K LN L + + DAED +P + Sbjct: 297 RT-QLPGWIRAIIVPPGQVKTKPRALNYALPFA---------RGSIIGVWDAEDAPAPDQ 346 Query: 179 L----RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVP 234 L R F +Q + + T + E++ +P L VP Sbjct: 347 LHVVARHFAAAGAHVACLQGVLDYYNAGTNWLTR-CFTIEYAAWFRVVLPGLARLGLVVP 405 Query: 235 SAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKE 294 G F R + L A+D ++TED D+G RL +G + ++EA Sbjct: 406 LGGTTLFFRRSVLETL------GAWDAHNVTEDADLGLRLARRGYVTALIPTLTMEEANG 459 Query: 295 REQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFL 354 R V+Q++RW+ G + + L Sbjct: 460 RAWP------------------------WVKQRARWLKGYAI------TYGVHMRSPVRL 489 Query: 355 WRDRK-----GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWL 409 RD G F+ L++ ++ L L L + +G+ W MT ++ Sbjct: 490 LRDLGLWQFIGVQVLFLGTLSLFALMPLFWSLWLIPLGVAHPLHGWLSAGAFWAMTYAFI 549 Query: 410 ---NFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPR 466 L+VN + +G L + L +A +R L ++ R Sbjct: 550 AAEALALLVNLVGLHKAGRLRLWGWALTLPAYFPL------GTLAAYRGLAELAT----R 599 Query: 467 RVAWDKTTHDF 477 WDKT H Sbjct: 600 PFYWDKTAHGV 610 Score = 42.5 bits (98), Expect = 0.063, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 41/119 (34%), Gaps = 2/119 (1%) Query: 516 LRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLR 575 + L +L + + +A+ LA+ +G ++LIA A L VLP R Sbjct: 48 VPLSDILLRDYNLPSRTIARYLAKAHGAQVVDPTRRPADATLIARWGARDCLRRGVLPWR 107 Query: 576 LENDELIVGSEDGIDPVSL-AALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRA 633 + V + + L + GR +R I ++ + + D Sbjct: 108 AAQGTVTVLTTCPTRFAAARPDLEQVFGR-IRMAITTEAKLTDAIATLHGPIFAADAET 165 >UniRef50_C0R5T9 Glycosyl transferase, group 2 family protein n=8 Tax=Wolbachia RepID=C0R5T9_WOLWR Length = 532 Score = 188 bits (477), Expect = 8e-46, Method: Composition-based stats. Identities = 71/429 (16%), Positives = 154/429 (35%), Gaps = 69/429 (16%) Query: 56 PRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVG--TYPNDPDT 113 ++ Y +L + D I++PA+ E+ VI + E + +LDY + V +D +T Sbjct: 161 QKVDYSKLNEEDFPIYTILLPAFKESAVIEQLIE-SIESLDYPKSKLDVKLQVESDDQET 219 Query: 114 QRDVDEVC-ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAED 172 +++ ++ + P +KA N + +++DA+D Sbjct: 220 LAAIEKYTLPQY--FEVIKVPHSLPRTKAKSCNYAMSFA---------RGKYAVIYDADD 268 Query: 173 VISPMELRL----FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREA 228 P++L+ FN ++ +Q + + + T + E+ +P + Sbjct: 269 KPDPLQLKKALIEFNKGDDKLACVQAKLNYYNCDCNFLTKS-FSLEYMNWFQYLLPGFQK 327 Query: 229 LAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPV 288 + +P G FS + + + +D S+TED D+G RL + G + Sbjct: 328 MNMPMPLGGSSNHFSVKILRKM------FFWDAYSVTEDADLGLRLAQMGYKTRMIDSET 381 Query: 289 VDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSL 348 + E P ++Q++RWI G + + L Sbjct: 382 L------------------------EESPIAVFAWIKQRARWIKGYM------QTYIVHL 411 Query: 349 TLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLW 408 L++ + ++++ L + A + + LS+ L+ Sbjct: 412 KNIKSLYKH---------TGFKGILLLNLFVGSAAFLFFTTPFLLLSLILTKVLNELFLY 462 Query: 409 LNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRV 468 + V ++ VI V + + +L++ +A + AL + + + + Sbjct: 463 YFVVVYVTNLILLVIAVKQQKMPFYFYIVSIFFPVYSLLHSVAAFLALWEFILYPER--- 519 Query: 469 AWDKTTHDF 477 W+KT H Sbjct: 520 -WNKTQHGL 527 >UniRef50_Q1IRV6 Type II secretion system protein E n=24 Tax=Bacteria RepID=Q1IRV6_ACIBL Length = 571 Score = 187 bits (475), Expect = 1e-45, Method: Composition-based stats. Identities = 53/258 (20%), Positives = 105/258 (40%), Gaps = 22/258 (8%) Query: 489 PLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG +L+ +VIT EQL+ ALR + G RLG +++ G +S + + L+ Q GV + Sbjct: 4 RLGDLLVREKVITAEQLEQALREQGSSGTRLGAALVKLGFLSDDDVTNFLSRQYGVPAIN 63 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 ++ ++I S++ +P A Y +LPL L + D + ++ + G + Sbjct: 64 LNYFEIDPSVVKLIPYDTAKRYQILPLSRVGASLTIAMVDPTNVFAMDDIKFMTGFNIEP 123 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ----------WLTEQQAGEIWRQYV 657 V+ I+ G+ Y D +++ + + + + E + Sbjct: 124 VVASESAILEGIEKAYNTAPEEDLESVMASMGEGEASDIEVQADMEEADSADLERAAEEA 183 Query: 658 PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 P L ILT S I++ E +G+ L ++ +L Sbjct: 184 PIVKLVNMILTEAVKKGASDIHM-----EPYEKEYRVRFRIDGI-----LQTMMNPPMKL 233 Query: 718 QVSMQS-LLLKAGLNTEQ 734 + ++ S + + A L+ + Sbjct: 234 RDAIISRVKIMAKLDISE 251 >UniRef50_A6DXN0 Glycosyl transferase, group 2 family protein n=2 Tax=Rhodobacteraceae RepID=A6DXN0_9RHOB Length = 643 Score = 187 bits (475), Expect = 1e-45, Method: Composition-based stats. Identities = 76/420 (18%), Positives = 145/420 (34%), Gaps = 64/420 (15%) Query: 70 PLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 ++I+VP + ET + + L+ T + + D T++ + P V Sbjct: 252 KVSILVPLFRETEIAHALIARLSRLTYPKCLLDVILVLEEEDALTRQTL-AGIDLPPWVR 310 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLFNY 184 V+ P +K +N LD + DAED ++ R F Sbjct: 311 PVIVPDGKPRTKPRAMNYALDFC---------QGDIIGIFDAEDAPEADQITIIARRFQQ 361 Query: 185 LVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSR 244 + + +Q + + + + E++ +P L +P G F R Sbjct: 362 VPQEVACLQGILDYYNPGQNWL-ARCFTIEYAAWFRTLMPGMARLGLAIPLGGTTLYFRR 420 Query: 245 RAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHA 304 + L +D ++TED D+GFRL G + +EA R Sbjct: 421 DVLEEL------GGWDAHNVTEDADLGFRLARHGYRTEMIHTVTEEEANCRAWP------ 468 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW-----RDRK 359 ++Q+SRW+ G + + + L+ R Sbjct: 469 ------------------WIKQRSRWLKGYM------TTYLVHMRQPRLLYAQLGPRKFW 504 Query: 360 GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIV 419 G ++FVS L+ ++ +L ++ + H L A L+ L L + V + Sbjct: 505 GFQAHFVSALSQFLLAP--VLWSFWLVLFGLPHPLDTVVPHALLVALGSLFLLVEVLNVS 562 Query: 420 QRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS 479 + V+G + + + + +A ++AL +++ + WDKTTH Sbjct: 563 IHMASVSGPR-HRHLMAWAPTMHFYTPLGTIAAYKALYELI----LKPFFWDKTTHGLSV 617 >UniRef50_B8FQB2 Type II secretion system protein E n=4 Tax=Clostridiales RepID=B8FQB2_DESHD Length = 573 Score = 187 bits (474), Expect = 1e-45, Method: Composition-based stats. Identities = 59/266 (22%), Positives = 108/266 (40%), Gaps = 23/266 (8%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 + + LG+IL+ + EEQL+ AL+ + GLRLG ++ Q +S E++ + + Q Sbjct: 2 AIRQERKRLGEILIAGGALMEEQLNEALKLQKSLGLRLGEVLIRQNFVSEEEILRTIQRQ 61 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 G+ ++ + ++ +P SVA Y VLP+ N +L+V + D D ++ L Sbjct: 62 LGLPAVDLNRIFVTEKILKMIPESVARKYTVLPVDFTNGQLLVATSDPTDYYAIDDLRLA 121 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRA---------MLYNAVQHQWLTEQQAG- 650 G V+ + + I+ + +Y R + + A Q LT QAG Sbjct: 122 SGMMVKPCVARKADILRAIDRFYGRSEAEKAVSDFVRQKGHDQVAAAAQTPVLTVVQAGG 181 Query: 651 -EIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDR 709 + P I+ + S I++ E L +GV L Sbjct: 182 ETADEEATPIIKFLNTIIENAVNNFASDIHI-----EPVDDELRVRFRIDGV-----LRE 231 Query: 710 VLTIQRELQVSMQS-LLLKAGLNTEQ 734 ++ + + S + + A LN + Sbjct: 232 IMRTPVGMTGPVVSRVKIMADLNIAE 257 >UniRef50_B6R4T3 Glycosyl transferase, family 2 n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R4T3_9RHOB Length = 672 Score = 187 bits (474), Expect = 2e-45, Method: Composition-based stats. Identities = 77/483 (15%), Positives = 157/483 (32%), Gaps = 83/483 (17%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVRRIKR-------------KLSVYRRYPRMSYRELYK 65 +T++ I F+ + F DV YW I Y + EL Sbjct: 236 LLTVSPIFFLGTI-GFAFDVFYWALFILLTLSLGAIGLLRIASFFTYSDKQDFAVPELEN 294 Query: 66 PDEKPLAIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARF 124 I+VP + E+ + + + L A E + +D TQ+++ ++ + Sbjct: 295 WP--HYTILVPLYKESAICRQLVDALDALDYPKEALDVIFLVEQDDELTQKNLRKLLRKS 352 Query: 125 PNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--LF 182 + ++ P +K L+ L A + DAED P +L+ + Sbjct: 353 --MRMIILPPGKPQTKPRALSVGLAATKG---------EFVTVFDAEDRPEPQQLKKAIC 401 Query: 183 NYLVERKD--LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 + +E D +Q + + + + E++ L +P +P G Sbjct: 402 QFALEGHDVACLQAAL-SIDHAKDGWLVRQFAFEYAALFDVFLPFLSRKNLLLPLGGTSN 460 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F A+ + +D ++TED D+ R +G + Sbjct: 461 HFRVSALRKV------GGWDPFNVTEDADLAVRFARQGFRTRTLNSS------------- 501 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 E P T + Q++RW G + L ++ + Sbjct: 502 -----------TYEEAPLTLKAWLHQRTRWHKGWI------QTLAVHLRNPRLTYK--QL 542 Query: 361 AISNFVSFL-----AMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV 415 +NF L ML + L + H + + A+ + L+ + Sbjct: 543 GATNFALLLLTFFGGMLCLWAAPLTALMFAEVLWGVHQTGLQAFDAFSIYALFCFLFGLG 602 Query: 416 NRIVQRVIFVT-GYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 +V + + ++ + L+W ++ +A+++AL + + + W KT Sbjct: 603 GTVVTVLQGSAKRGFRPRGWEIASIPLYW--VLGCIASYKALFEFI----VKPHYWRKTE 656 Query: 475 HDF 477 H Sbjct: 657 HGI 659 >UniRef50_B0T1N0 Putative uncharacterized protein n=1 Tax=Caulobacter sp. K31 RepID=B0T1N0_CAUSK Length = 492 Score = 186 bits (473), Expect = 2e-45, Method: Composition-based stats. Identities = 96/479 (20%), Positives = 151/479 (31%), Gaps = 79/479 (16%) Query: 13 YGLKVIAITLAVIMFIS------GLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 GL V A+T+ V + I F IK + R P ++ L Sbjct: 56 VGLAVFALTVIVAVIIEPRTTMEAFHLLFFVGFMANSMIKLAAACTPRRPGVAPS-LPDE 114 Query: 67 DEKPLAIMVPAWNETGVIGNMAELAATTLDYENYH--IFVGTYPNDPDTQRDVDEVCARF 124 D ++VP + E V + L LDY + + ND +TQ + Sbjct: 115 DLPGYTLIVPLYREASVAAELV-LNLARLDYPRDRLQVLIVLEANDHETQAAFAAL-DLP 172 Query: 125 PNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL--- 181 ++ P +K N L ER+ +++DAED P +LR Sbjct: 173 VGFQVLIAPPGTPQTKPRACNIAL------ERAHGE---MVVIYDAEDAPHPAQLREAAA 223 Query: 182 -FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 F R +Q P+ F + E++ L +P P G Sbjct: 224 GFAAGDRRLACLQAPLRIEPDP--RFLPDQFALEYAVLFEVFLPALARWRLPFPLGGTSN 281 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F AV A+ +D ++TED DIGFRL +G + P + A Sbjct: 282 HFRTEAVRAV------GGWDSYNVTEDADIGFRLAARGYQLDVITCPTFETA-------- 327 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 P T T + Q++RWI G + RD G Sbjct: 328 ----------------PTTMKTWIPQRARWIKG------HVQTLAVLARGP--IVRDPPG 363 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLL----WLNFGLMVN 416 + ++ + L L + L + + +L W + Sbjct: 364 LAALVLTLALSVASSHLHGPLLAWLVLSWLGSMLDLCPPVPAMDWMLVYFGWTCAAIAGA 423 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + +R G Q L +L + + A +AL Q + WDKT H Sbjct: 424 QAQRRA-------GHRQRPLPLLGAVFYWPLQSFAATKALWQFV----VAPFHWDKTPH 471 >UniRef50_Q2FNF4 Glycosyl transferase, family 2 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNF4_METHJ Length = 848 Score = 185 bits (469), Expect = 5e-45, Method: Composition-based stats. Identities = 72/443 (16%), Positives = 146/443 (32%), Gaps = 86/443 (19%) Query: 71 LAIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEV--------- 120 ++VP ++E ++ ++ + A + + + D +T ++ Sbjct: 242 YTVLVPLFHEQEMLPHILQNIANINYPRDKLDVKILMEEEDTETIEKARKLGLFGNVEEI 301 Query: 121 -----CARFPNV----HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAE 171 + H VV + T+K N L +++DAE Sbjct: 302 ISPMSEPEYHAFLSIFHPVVIPKADITTKPRACNYGLK---------RSRGEFVVIYDAE 352 Query: 172 DVISPMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVRE 227 D+ +L+ F L + +Q + + T + E+S + + + Sbjct: 353 DLPDRDQLKKVVIAFQRLGPKYACVQCLLNFYNPRKNMLTR-WFSIEYSYYYDFYIQGLD 411 Query: 228 ALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFP 287 + +P G F + + L A+D ++TED D+G R+ K + + Sbjct: 412 KIDAPIPLGGTSNHFRMKTLREL------GAWDPYNVTEDADLGMRIARKKLHTAVLNSH 465 Query: 288 VVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSS 347 +EA R +RQ+SRW+ G V W + Sbjct: 466 TYEEAVTRVPS------------------------WIRQRSRWVKGFVI------TWFVT 495 Query: 348 LTLNYFLWRDRKGAISNFVSFLA------MLVMIQLLLLLAYESLWPDAWHFLSIFSGSA 401 + + +D I NF F L ++ L L L + + + +F S F Sbjct: 496 MRHPIKVLKD--IGIKNFFIFQTGFGGNFYLPLMNLFLWLVFAAGFIIPEYFSSWF-DFW 552 Query: 402 WLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL--RLFWGNLINFMANWRALKQV 459 + N + + ++ T LL ++W ++ + W+ Q+ Sbjct: 553 PFAAIAVFNLLIGNLFFLTMMVVATWKEKQRDLLLYAFFSPIYW--ILMSIGAWKGTLQL 610 Query: 460 LQHGDPRRVAWDKTTHDFPSVTG 482 + + W+KT+H V Sbjct: 611 I----FKPYKWEKTSHGTEIVHE 629 Score = 47.1 bits (110), Expect = 0.002, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 41/113 (36%) Query: 530 AEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGI 589 + LA++ G+A+ D + + +P +V + L + L V + + + Sbjct: 46 PDDFYAYLADRLGLAFMERDTLFANPRIGSVLPYAVGEETLIALLESKPTYLKVATANPL 105 Query: 590 DPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ 642 D L G+K+ V+ I+T Y + + L + + Sbjct: 106 DTPLFTRLEEIFGKKIEKVVTPLDAILTITDTSYKGPHAYSALSELVDRQPDE 158 >UniRef50_A7IHY2 Glycosyl transferase family 2 n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7IHY2_XANP2 Length = 678 Score = 184 bits (468), Expect = 9e-45, Method: Composition-based stats. Identities = 82/477 (17%), Positives = 147/477 (30%), Gaps = 66/477 (13%) Query: 17 VIAITLAVIMFISG-----LDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL 71 V+ + LA ++ ++ + +V+ R + P L Sbjct: 219 VVGLPLAGLVALAPEQGVLAVQALLSLVFLGWVSLRLAACAYDAPPDPPPTLDDRQLPVY 278 Query: 72 AIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 +++VP + E + ++ L A E I + +D T+ + + P + +V Sbjct: 279 SLLVPLYREAASVPHLVAALGALDYPPEKLDIKLVVEADDAGTRAAIAALT-LPPQMEEV 337 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLV 186 GP +K L L A + ++DAED+ P +LR F Sbjct: 338 PVPAVGPRTKPKALEVALAAA---------RGSFVAIYDAEDLPEPDQLRRALEAFRTGG 388 Query: 187 ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 + +Q + + + + + ++ E++ +P+ AL + G F RR Sbjct: 389 PKIACVQARLAIDNGDDS-WIAASFAAEYAAQFDVLLPMLSALGLPILLGGTSNHFRRRV 447 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHART 306 + + +D ++TED D+G RL G + Sbjct: 448 LDEV------GGWDPFNVTEDADLGIRLARAGWQTRVISST------------------- 482 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-----RKGA 361 E P T V Q++RW+ G L L D Sbjct: 483 -----TYEEAPVTARAWVGQRTRWLKGWA------QTLLVHLRQPGALMADLGVGPALAL 531 Query: 362 ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 + A ++ + L L A L + Sbjct: 532 LLLAAGPFAAALVHPFCVALLLADLLRGVIGLPRGSMAEALTSALTFTTLFAGYAGTAAI 591 Query: 422 VIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 GL VL + + L+ A WRAL ++L+ R W KT H Sbjct: 592 TYVGLRRRARVPGLKVVLGIPFYWLLLSAAAWRALIELLR----RPHHWQKTEHGVA 644 >UniRef50_B5YDI2 Type IV pilus assembly protein PilB n=20 Tax=Bacteria RepID=B5YDI2_DICT6 Length = 873 Score = 184 bits (467), Expect = 1e-44, Method: Composition-based stats. Identities = 62/265 (23%), Positives = 117/265 (44%), Gaps = 20/265 (7%) Query: 477 FPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQ 535 T T + LG++LLE +IT+EQLD AL + +G+RLG ++L L+ LA+ Sbjct: 303 AKPSTRRTGRRKLLGEVLLEKNLITKEQLDEALALSSKKGIRLGEALLELKLLDDVALAK 362 Query: 536 ALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLA 595 L+EQ + ++S+ +I L + A +LPL +N ++VG D + ++L Sbjct: 363 LLSEQFDIPFKSLKEVKIDHDLAKLISPQKARENLILPLYRDNGRIVVGIVDPSNILALD 422 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQ 655 L +V VIV R +++ + + + +L + + E Q E+ + Sbjct: 423 DLRMVTRSEVFPVIVPRNELIDAINQIWG---SEEVEKVLEEIIVQKEEEETQYQEVSLE 479 Query: 656 YVPHQ-----FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRV 710 + Q L IL S I++ E + + +GV L + Sbjct: 480 EISSQEGPIAKLVNSILVDAVKRGASDIHI-----EPTEKNVRVRFRIDGV-----LHEI 529 Query: 711 LTIQRELQVSMQS-LLLKAGLNTEQ 734 + IQ+ Q ++ S + + + ++ + Sbjct: 530 MFIQKRFQAAIVSRIKIMSDMDISE 554 Score = 77.9 bits (190), Expect = 1e-12, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 71/207 (34%), Gaps = 23/207 (11%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 R +G IL ++ LD L N + G + +L +GLIS E+L L+E G Sbjct: 9 RLIG-ILKSRNIVPAAILDNILSN-LRGKDIQEILLEEGLISKEKLVDLLSEILGWKVLV 66 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 ++ +P + + +PL +E + VG + P ++ + G V Sbjct: 67 GKEFKPNEEAAKSIPPFLTKFHNFIPLGIEEKTIKVGFFPPVKPTAIEDIRLLTGYDVEP 126 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEIL 667 ++ G + + L A + + E + + Sbjct: 127 YLL-------------KISSGEEDLSSLIAAPKTTNVDIDLEPEKVEEPLKG-------- 165 Query: 668 TTLGHINRSAINVLLLRHERSSLPLGK 694 +G + S +N ++ L + Sbjct: 166 IEIGTWDESELNKIIEELTPEGGILEE 192 >UniRef50_A3JQ28 Glycosyl transferase, family 2 n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JQ28_9RHOB Length = 637 Score = 183 bits (465), Expect = 2e-44, Method: Composition-based stats. Identities = 76/490 (15%), Positives = 149/490 (30%), Gaps = 85/490 (17%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIK--RKLSVYRRYPRMSYRELYKPDEK 69 + IA+ ++ + F+ ++ ++ R + S + P E Sbjct: 201 IAVFTAIALAPTLLFTVLFCLASFVLLMNTGFKLWVTAAFMRGRELAKTSTSAIISPPEN 260 Query: 70 P----LAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARF 124 ++I++P ++ET + + +A I D T + A+ Sbjct: 261 MRLPTVSILIPLFHETDIAERLVIRMAKIRYPPALLDIMFLVEEADHAT--KLALCQAKV 318 Query: 125 PNVHKVVCARPGP-TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL---- 179 P +++ GP +K +N L + ++DAED ++ Sbjct: 319 PQNMRIITVPDGPIRTKPRAMNYALPLC---------RGSIIGIYDAEDAPESDQILKIV 369 Query: 180 RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVG 239 F + +Q + + + + E++ +P + L +P G Sbjct: 370 AKFQTSSPKVACLQARLDFYNTSRNWL-ARCFTVEYATWFCVILPGLQCLKMPIPLGGTS 428 Query: 240 TCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRK 299 F R + + A+D ++TED D+G RL G V +EA R Sbjct: 429 VFFRRNVLEKV------GAWDAHNVTEDADLGMRLARNGFKTELVNSTTYEEANCRPWP- 481 Query: 300 FLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR- 358 ++Q+SRW+ G + + L RD Sbjct: 482 -----------------------WIKQRSRWLKGYGL------TYFVMMRKPLQLIRDVG 512 Query: 359 -----------KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLL 407 G ++ F+ +L L + L S L + ++ Sbjct: 513 FINFCGVQILFLGTLTGFILAPVLLSFWFLTMGLPNPSAGMLPQWLLWALLFLFIMAEIV 572 Query: 408 WLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR 467 ++ GL+ I + + MA+ A K + + Sbjct: 573 TISVGLLSISIRGNAKGLGKWVPTMHFYFP------------MASIGAFKAIYEIAT-AP 619 Query: 468 VAWDKTTHDF 477 WDKT H Sbjct: 620 FYWDKTQHGA 629 Score = 47.1 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 14/130 (10%), Positives = 42/130 (32%), Gaps = 7/130 (5%) Query: 494 LLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQ 552 L++ ++ AL + + L ++ G + + + + LA+ + ++ Sbjct: 36 LVKANLLCAADAKRALAMSAIHDASLPDILIFNGFCAGKDVYKNLAKIWNAPFLEKGQFE 95 Query: 553 IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV---RYVI 609 + ++ L LP+ ++ L + K + + Sbjct: 96 SDVETLRKIGVEFCLRNKCLPIIDKDGNQFFALSQP---DKFDDLVQFTPFKQKTKKMAV 152 Query: 610 VLRGQIVTGL 619 V +I+ + Sbjct: 153 VEENEIIDTV 162 >UniRef50_A3XKW0 Glycosyltransferase related protein n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XKW0_9FLAO Length = 631 Score = 182 bits (461), Expect = 5e-44, Method: Composition-based stats. Identities = 89/459 (19%), Positives = 162/459 (35%), Gaps = 89/459 (19%) Query: 56 PRMSYRELYKP--DEKPL-AIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDP 111 ++S EL K DE P I +P + E+ VI +A E + + +D Sbjct: 226 QKISQTELSKINADELPYYTIQLPVFKESEVIYKLASNLQNLDYPKEKLDVKLLIESDDE 285 Query: 112 DTQRDVDEVCARFP-NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDA 170 T V + +FP V+ P +K N L F + ++DA Sbjct: 286 VTFNAVKNL--KFPCIFDPVIIPYAQPKTKPKACNYGL----HFSKGK-----YLTIYDA 334 Query: 171 EDVISPMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVR 226 ED+ +L+ LF+ L E +IQ + F + + T M + E+S +P Sbjct: 335 EDIPDSDQLKMVHALFSKLPEEYIVIQCALNYFNKTENYLTRM-FTLEYSYWFDYMLPGL 393 Query: 227 EALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRF 286 + L +P G F + L +D ++TED D+G R KG + Sbjct: 394 DGLKVPIPLGGTSNHFKFDRLIEL------GGWDGFNVTEDADLGIRAYAKGYKVTVLNS 447 Query: 287 PVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTS 346 +EA + F +RQ+SRWI G + + Sbjct: 448 TTYEEAN------------------------NAFYNWIRQRSRWIKGYM------QTYLV 477 Query: 347 SLTLNYFLWRDRKGAISNFVSF-----------LAMLVMIQL----------LLLLAYES 385 + L+R+ ++ F+ F LA +++ L + S Sbjct: 478 HMRNPSKLYRE--VGLNGFLGFQFFIGGTFFTFLAYPILLLLFLFYIFLTLDISSYVVGS 535 Query: 386 LWPDAWHFLSIFSGSA--WLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFW 443 L + F + + ++ L++ I +F Y L ++ ++W Sbjct: 536 LNTEIIDFFKLIFPEWVIIISVFNFMAGNLLMIYINMIAVFRRKSYSLILYAITNP-IYW 594 Query: 444 GNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 L++ ++ ++ L Q++ + W+KT H Sbjct: 595 --LMHSISAYKGLFQLI----SKPFYWEKTNHGLTKDHK 627 >UniRef50_Q8NU22 Glycosyltransferases, probably involved in cell wall biogenesis n=3 Tax=Corynebacterium glutamicum RepID=Q8NU22_CORGL Length = 487 Score = 181 bits (460), Expect = 6e-44, Method: Composition-based stats. Identities = 73/423 (17%), Positives = 131/423 (30%), Gaps = 67/423 (15%) Query: 69 KPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNV 127 K ++VPA+ E VI + A + + +D T + Sbjct: 120 KTYTVLVPAYGEPEVIAQLLASMHAFDYPKHLLQVLLMLEEDDLPTIAAAEAAGV-DQVA 178 Query: 128 HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFN 183 + P +K N L T + DAED+ P++LR F Sbjct: 179 TIIKVPPAQPRTKPKACNYGLHFATG---------EIVTIFDAEDMPDPLQLRRVVVAFE 229 Query: 184 YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFS 243 +Q + T+ + E+ +P + VP G Sbjct: 230 RSASNTVCVQSRLSYRNARQNLLTA-WFTIEYDVWFNFLLPGVMRMNAPVPLGGTSNHLL 288 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQH 303 + L A+D ++TED D+G R+ KG + + +EA Sbjct: 289 TGVLKDL------GAWDPFNVTEDADLGVRIAAKGYSTAVLDSVTWEEANSDTI------ 336 Query: 304 ARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD------ 357 +RQ+SRW G + W + +L ++ Sbjct: 337 ------------------NWLRQRSRWYKGYL------QTWLVYMRRPKWLVQELGIIPA 372 Query: 358 -RKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 R + +A+L ++ L L + P + F + + L+ L G Sbjct: 373 VRFTFLMAGTPIIAVLNLLFWYLSLTWILGQPGTIEQM--FPPAVYYPALVCLVVGNAAT 430 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 + + G L + L+W L+ +A + Q++ R W+KT H Sbjct: 431 IFMNLIGCREGRDPLLLIAVLTFPLYW--LLMSIAALKGTWQLIT----RPSYWEKTAHG 484 Query: 477 FPS 479 + Sbjct: 485 LEA 487 >UniRef50_Q0FJ05 Glycosyl transferase, group 2 family protein n=2 Tax=Rhodobacteraceae RepID=Q0FJ05_9RHOB Length = 616 Score = 181 bits (460), Expect = 7e-44, Method: Composition-based stats. Identities = 80/452 (17%), Positives = 146/452 (32%), Gaps = 60/452 (13%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLD 96 V + +I R ++ + + ++++VP + E + + L+ Sbjct: 198 AVLILAQITRLAALLASRRSVPDAPVGPVRLPRISLLVPLFREERIAAALLDRLSRLDYP 257 Query: 97 YENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPT-SKADCLNNVLDAITQFE 155 + + +D T+ + R P +V+ GP +K LN L Sbjct: 258 RNRLEVLLLLEASDDTTRAALAAT--RLPPWLRVIEVPGGPIATKPRALNYGLTFA---- 311 Query: 156 RSANFAFAGFILHDAEDVISPMEL----RLFNYLVERKDLIQIPVYPFEREWTHFTSMTY 211 ++DAED +P +L F +Q + + ++ S + Sbjct: 312 -----QGDIVGIYDAEDSPAPDQLLKVAGHFCRAPPETACLQGILDFYN-PHANWLSRCF 365 Query: 212 IDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIG 271 E++ +P L +P G F R A+ + +D ++TED D+G Sbjct: 366 TIEYATWFRLVLPGLARLGFPIPLGGTTVFFRREALDRV------GGWDAHNVTEDADLG 419 Query: 272 FRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWI 331 RL G V + RE + +RQ+SRW+ Sbjct: 420 IRLARFGYVTELV------------------------PLVTREEANNRTWPWIRQRSRWL 455 Query: 332 IGIVFQGFKTHKWTSSLTLNYFLWRDRKG--AISNFVSFLAMLVMIQLLLLLAYESLWPD 389 G + W L RD + V FL ++ L L L Sbjct: 456 KGYMV------TWLVHSRRPLTLLRDLGAWRFVGMQVLFLTTILQFLLAPALWSFWLLLL 509 Query: 390 AWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINF 449 W ++ + LL+ F + + + L V LF + Sbjct: 510 GWQPAALAILTEAQKQLLFGGFLVAEAISLMVSVAAVARSPHQGLLPWVPTLFLYFPLAT 569 Query: 450 MANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 +A ++AL +++ R WDKT H + Sbjct: 570 VAIYKALLELVT----RPFFWDKTMHGHSAPD 597 >UniRef50_B7QPG7 Glycosyl transferase, group 2 family n=4 Tax=Rhodobacteraceae RepID=B7QPG7_9RHOB Length = 661 Score = 181 bits (459), Expect = 9e-44, Method: Composition-based stats. Identities = 82/457 (17%), Positives = 148/457 (32%), Gaps = 58/457 (12%) Query: 35 FIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATT 94 FI + +R ++ + + +P ++++VP + E IG Sbjct: 255 FISHIQTIRSRRQIELPIASRSKFTSPRHRRP---MISVLVPLYKEAE-IGRALLRRLCK 310 Query: 95 LDYENYHIFVGTYPNDPDTQRDVDEVCARFP-NVHKVVCA-RPGPTSKADCLNNVLDAIT 152 L Y + V + D CA P + G T+K +N L+ Sbjct: 311 LTYPRSLLEVLLVLEEEDDITRDAIRCADLPDWFRVIEVPAHGGLTTKPRAMNYALNFC- 369 Query: 153 QFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKD----LIQIPVYPFEREWTHFTS 208 + DAED P +L + D +Q + + + S Sbjct: 370 --------RGEIIGIWDAEDAPEPDQLDHVAAAFAKGDGALACLQGALDYYNPTQN-WIS 420 Query: 209 MTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDY 268 + E++ +P L VP G R + + +D ++TED Sbjct: 421 RCFTLEYASWFRIVLPGIARLGLVVPLGGTTLFIRRDVLEQV------GGWDAHNVTEDA 474 Query: 269 DIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKS 328 D+G RL G + +EA R V+Q+S Sbjct: 475 DLGVRLSRFGYRTDMLPTSTYEEANCRPW------------------------AWVKQRS 510 Query: 329 RWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWP 388 RW+ G + H + L L WR G + F+ + ++ L +L Sbjct: 511 RWLKGF-MVTYLVHMRSPRLLLKQLGWRQFLGLQAFFLGTVGQFLLAPCLWSFWLITLGL 569 Query: 389 DAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLIN 448 + +G+ +L + + F L+ I T G L L + + Sbjct: 570 PHPTAPLLPTGATYLAGVSLVFFELLGMVIAITAACAT---GRRSLALWAPSLIFYFPMG 626 Query: 449 FMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTR 485 +A ++AL +++ + WDKTTH + + Sbjct: 627 VIAVYKALYELI----LKPFYWDKTTHGHSPKSRTQK 659 Score = 83.3 bits (204), Expect = 3e-14, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 63/158 (39%), Gaps = 1/158 (0%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 R L Q L+++ ++++ + AL + + + ++ +G S +Q+ AL+ Q Sbjct: 42 TKQVGRRTLEQRLIQDHAVSKDHVIRALTLQQHQRAPIDRILVSEGWASQDQVLDALSSQ 101 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 + + + + L+A PA L ++VLP ++V D ++ Sbjct: 102 HKIPKVDLSGHNPQARLLARKPAGFWLRHSVLPWMQLGQTVVVAVSDPNKLNTIRTDLAA 161 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNA 638 +V V+ QI L + + + + L A Sbjct: 162 SFGEVMPVLASESQIQQLLVSHFRKDLAVEASSRLPLA 199 >UniRef50_B8IGT7 Glycosyl transferase family protein n=9 Tax=Alphaproteobacteria RepID=B8IGT7_METNO Length = 678 Score = 180 bits (456), Expect = 2e-43, Method: Composition-based stats. Identities = 88/462 (19%), Positives = 150/462 (32%), Gaps = 85/462 (18%) Query: 37 DVVYWVRRIKR-KLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTL 95 ++++ + R + P Y D ++V E V+ ++ A L Sbjct: 200 NLLFLGMMLFRLAAVIEPPLPVPDYPRAADADLPVYTVLVALHREAAVVPHLI-GALERL 258 Query: 96 DYE--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQ 153 DY + + +D +T + A P + VV P +K LN L Sbjct: 259 DYPAAKLDVKLVLEADDAETAGALAAR-ALPPWIEIVVAPPGLPRTKPRALNVALALA-- 315 Query: 154 FERSANFAFAGFILHDAEDVISPMELRL----FNYLVERKDLIQIPVYPFEREWTHFTSM 209 +++DAEDV P +LR+ F R +Q + + T Sbjct: 316 -------RGEYLVVYDAEDVPDPGQLRMAAAIFAGANPRTACLQGRLVIDNTGDSWLTR- 367 Query: 210 TYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYD 269 + E++ L +P A VP G T F + L +D ++TED D Sbjct: 368 CFTLEYTALFDVLIPALAAWRLPVPLGGTTTHFRTATLRTL------HGWDAWNVTEDAD 421 Query: 270 IGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSR 329 +G RL G + E P +RQ+ R Sbjct: 422 LGLRLALAGYHV------------------------GDLPLSTEEEAPAEIRPWLRQRVR 457 Query: 330 WIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPD 389 W+ G V Q TH R A + + L L + L+ +L Sbjct: 458 WMKGFV-QTTITHSR-------------RPAAAATALGPLGSLCALALIPGTVVSALVYP 503 Query: 390 AWHFLS---IFSGSAWLMTLLWLNFGLMVNRI------------VQRVIFVTGYYGLTQG 434 A ++ I A W+N + + R ++GL Sbjct: 504 ALLAVAGWRIVLSPAEPDPSFWVNMVTALGLVLFGAGLAALLLPAARGCLQRRWWGLLP- 562 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 + VL +++G + +A W +L +++ W+KT H Sbjct: 563 WICVLPVYYGLM--SVAAWLSLAELV----LAPSRWNKTEHG 598 >UniRef50_A3DEG0 Type II secretion system protein E n=5 Tax=Clostridia RepID=A3DEG0_CLOTH Length = 787 Score = 178 bits (452), Expect = 5e-43, Method: Composition-based stats. Identities = 53/286 (18%), Positives = 105/286 (36%), Gaps = 35/286 (12%) Query: 469 AWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGL 527 +D+ T++ + D +G IL+ VIT++QL+ AL + G +G ++ QG Sbjct: 201 GFDQNTYNESGIFKD-----KIGNILVRAGVITQDQLENALSIQKKSGGLIGQILVKQGY 255 Query: 528 ISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSED 587 I L + L +Q GV + I+ +I +I + ++A + V+P+ + L V D Sbjct: 256 IDRRSLYEFLQKQMGVEYVDIEGIEIDEDIIGLVSPNLAKTHKVIPIEKVDGNLKVAMSD 315 Query: 588 GIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYAR-------RRGHDPRAMLYN--- 637 ++ S+ L G ++ + QI L +Y + + A L Sbjct: 316 PMNIFSIDDLRLTTGLEIIPCLADEEQISAQLEKYYGKASRKTSAKEIEQKVADLDEEIK 375 Query: 638 --------AVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSS 689 + + + P + I S I++ E Sbjct: 376 KVNEKIAVEITQTEDEDTTIDISDLENAPIVKMVNIIFQKAVATRASDIHI-----EPQE 430 Query: 690 LPLGKFLVTEGVISQETLDRVLTIQRE-LQVSMQSLLLKAGLNTEQ 734 + +G L ++ R+ L + + + +GLN + Sbjct: 431 DCVLIRFRIDG-----QLVEIMRYDRKILSSIVARIKIISGLNIAE 471 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 50/255 (19%), Positives = 105/255 (41%), Gaps = 16/255 (6%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 R + +ILLE V+ L A R + +L GL+S + + A A + G+ Sbjct: 5 DTRGIDEILLEMGVLKIVDLKKAWDIQRESNKNIEDVLLELGLVSQKDIMHANAVKMGIP 64 Query: 545 WESIDAWQI-PSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + + +QI SS+ + ++A Y V+P+ EN L V D D + + Sbjct: 65 FVDLSTYQISDSSVPLLITRNIANRYKVIPIEKENGVLTVAMSDPTDIFCIDDIRLATAL 124 Query: 604 KVRYVIVLRGQIVTGLRHWYA-RRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 +++ V+ +I + ++ ++ + + N Q++ + E + + Sbjct: 125 EIKPVLADVKEIERLIVEYFGEEKKPQESKLKAENEEQNKKEELLKMEEELLGREIYNNI 184 Query: 663 FAEILTTLGHINRSAINVLLLRHERS--------SLPLGKFLVTEGVISQETLDRVLTIQ 714 A++ ++ +++ +G LV GVI+Q+ L+ L+IQ Sbjct: 185 KADV-----ETREPEFDLASKGFDQNTYNESGIFKDKIGNILVRAGVITQDQLENALSIQ 239 Query: 715 RELQVSMQSLLLKAG 729 ++ + +L+K G Sbjct: 240 KKSGGLIGQILVKQG 254 >UniRef50_B6B2W2 Glycosyl transferase, group 2 family protein n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B2W2_9RHOB Length = 588 Score = 178 bits (451), Expect = 7e-43, Method: Composition-based stats. Identities = 74/485 (15%), Positives = 162/485 (33%), Gaps = 65/485 (13%) Query: 4 LLDVFATWL--YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 +L + WL GL AV+M +G+ ++ + ++ + P R Sbjct: 140 VLALALQWLTFAGLYRAIFGFAVVMVFTGI---------VIKTAAAFIQLFVKEPEKQIR 190 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 +++++P +ET ++ + T + + +D T+ ++ Sbjct: 191 AAPTTKLPRVSLLIPLHDETEILEALLRHIGMLTYPETLLDVILIVEASDTLTRSHLETT 250 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 + ++ T+K +N L ++DAED +P +L Sbjct: 251 -DLPNWMRILIVPAGKITTKPRAMNYALPFC---------RGDIIGIYDAEDAPNPDQLS 300 Query: 181 ----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 LF E+ IQ + + S + E++ +P + +P Sbjct: 301 EVVDLFARTSEKTACIQAVLDFYNARSNAL-SRFFAIEYATWFRLILPGIARMGFAIPLG 359 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G F R + L +D ++TED D+G RL G + ++EA + Sbjct: 360 GTSVFFRRNVLEKL------DGWDAHNVTEDADLGIRLARAGYQTTLISSVTLEEANNKA 413 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 ++Q+SRW+ G + F + L W+ Sbjct: 414 WP------------------------WIKQRSRWLKGYLITYFTHMRRPFRLLFELGAWK 449 Query: 357 DRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 G + F+S ++ ++ L+ W + + +L T++ + Sbjct: 450 FL-GFQAFFLSSISQYMLAPLIWSCMALYFGAPQW--MDATVPAEFLNTIMVAFIVMWFT 506 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 ++ V+G + + + + +A ++ +++ + W+KT H Sbjct: 507 STTVYIVAVSGEL-HRHLIPWIPFMSAYMALGTLAIYKGAWELIT----KPFYWEKTAHG 561 Query: 477 FPSVT 481 V Sbjct: 562 HSDVE 566 Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 38/99 (38%) Query: 522 MLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDEL 581 ++ QG IS +L + LA QN + + + + +L + L +P R E + Sbjct: 2 LVSQGQISETELYETLAIQNSLPFIDMQSDIPDLTLQTGLDPHQCLRLQCVPWRREGETT 61 Query: 582 IVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLR 620 I+ + + A V+ I R I +R Sbjct: 62 ILTTSSPEQFENAKATLPAHLHPVKMAIATRSHITEAIR 100 >UniRef50_Q1GE89 Glycosyl transferase family 2 n=3 Tax=Rhodobacteraceae RepID=Q1GE89_SILST Length = 509 Score = 178 bits (451), Expect = 8e-43, Method: Composition-based stats. Identities = 76/443 (17%), Positives = 139/443 (31%), Gaps = 72/443 (16%) Query: 58 MSYRELYKPDEKP-----LAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDP 111 + R E P ++++VP + E + ++ T + + + ND Sbjct: 121 AAARPDQPKSETPKDLPQMSMLVPLYREAEIGKHLLRRLCRLTYPRDRLEVLLVLEENDD 180 Query: 112 DTQRDVDEVCARFP-NVHKVVCARPG-PTSKADCLNNVLDAITQFERSANFAFAGFILHD 169 T+ V CA P V G T+K +N L+ + D Sbjct: 181 VTRNAVK--CADLPDWFRVVEVPGDGTLTTKPRAMNYALNFC---------RGEIIGIWD 229 Query: 170 AEDVISPMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPV 225 AED +P +L F + Q + + + S + E++ + Sbjct: 230 AEDAPAPDQLESAASAFAHAPPDVVCFQGILDFYN-PSRNLISRCFTLEYAGWFRVLLQG 288 Query: 226 REALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVR 285 L +P G R A+ L A+D ++TED D+G R+ + Sbjct: 289 IARLGLVIPLGGTTLFIRRDALEQL------GAWDAHNVTEDADLGVRIARACYRTEMLP 342 Query: 286 FPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWT 345 +EA R ++Q+SRW+ G + + Sbjct: 343 TTTYEEANSRITP------------------------WIKQRSRWLKGFMM------TYL 372 Query: 346 SSLTLNYFLWRD---RK--GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGS 400 + L RD R+ G + F+ L ++ +L +L S+ Sbjct: 373 VHMRAPKALLRDVGWRRFWGLQAFFLGTLGQFLLAPVLWSFWLVALGVSHPLEASL-PRD 431 Query: 401 AWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVL 460 + + L F ++N + G + ++ A ++AL +V Sbjct: 432 MLSVAVGALVFFEVLNLCIWYCG--ARASGRPVLAFCAPLMPLYFILGCFAAYKALWEVF 489 Query: 461 QHGDPRRVAWDKTTHDFPSVTGD 483 WDKT H T + Sbjct: 490 A----APFFWDKTAHGDHGGTTE 508 >UniRef50_B0MQV8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0MQV8_9FIRM Length = 563 Score = 178 bits (450), Expect = 1e-42, Method: Composition-based stats. Identities = 53/251 (21%), Positives = 102/251 (40%), Gaps = 14/251 (5%) Query: 489 PLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 P+GQIL+EN + E+QL+ AL + G +LG +L G +S QLAQAL+ + V + Sbjct: 5 PIGQILVENGFLKEDQLEEALEKQRSEPGKKLGDVLLELGYVSETQLAQALSIRLKVPFI 64 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + +I + ++P ++A + ++ + L V ++D I+ L G ++ Sbjct: 65 DLTTTKIDIEAVKKIPEAIAKKNCCVAFQMTDSRLTVATDDPINFYIFEELKVISGMEIH 124 Query: 607 YVIVLRGQIVTGLRHWYARRRGH---DPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 +I R I + Y+++ D Y + + P L Sbjct: 125 AMIATRTAINETISKAYSQQTVSNVMDNLNKEYTGNTDSVIQDDPESGERVDNAPIVKLV 184 Query: 664 AEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS 723 I+ T +N S I++ E L +G E ++++ + Sbjct: 185 NTIVETSFRMNASDIHI-----EPFKDRTRIRLRVDG----ELIEQMKVKPAAHNSLITR 235 Query: 724 LLLKAGLNTEQ 734 + + G+N + Sbjct: 236 IKILGGMNIAE 246 >UniRef50_A3DDQ8 Type II secretion system protein E n=3 Tax=Clostridium thermocellum RepID=A3DDQ8_CLOTH Length = 561 Score = 177 bits (449), Expect = 1e-42, Method: Composition-based stats. Identities = 44/236 (18%), Positives = 99/236 (41%), Gaps = 6/236 (2%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 + + LG IL+E +I++EQLD AL+ + G +LG ++ +G+++ E + + L E+ G Sbjct: 3 KQKRKGLGDILVEAGLISKEQLDKALKLQKKTGQKLGVLLVSEGIVTQEDIMRVLEEKIG 62 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 V +++ I ++ + +P +A Y ++P+ ++ L V D ++ ++ + G Sbjct: 63 VLRVALEECNIDPAVCSLIPEKLARRYELIPIAQKDGVLRVAMSDPLNVFAIDDIEDYTG 122 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 +V V+ I + +Y + + + + + L Sbjct: 123 MRVEPVVDFASSIKNAIDKYYRTQHVLVEPVKEKGILFKIDEETIELESVEAENESASML 182 Query: 663 FAEILTTL-----GHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTI 713 I+ G I+ + L R+ + + + TE + L ++ I Sbjct: 183 LNSIIEQAIRNGSGDIHIEPLQNALKIRFRTDGQMHEVMRTEIGMLNGVLAKIKAI 238 Score = 46.3 bits (108), Expect = 0.005, Method: Composition-based stats. Identities = 16/52 (30%), Positives = 30/52 (57%) Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 +IL G I++ ++ L +++ LG LV+EG+++QE + RVL + Sbjct: 9 LGDILVEAGLISKEQLDKALKLQKKTGQKLGVLLVSEGIVTQEDIMRVLEEK 60 >UniRef50_B1YJT8 Type II secretion system protein E n=5 Tax=Bacillales RepID=B1YJT8_EXIS2 Length = 554 Score = 177 bits (449), Expect = 1e-42, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 96/222 (43%), Gaps = 13/222 (5%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 + LG++LLE V+TE Q++ AL + +LG ++L G ++ +QL +AL Q + Sbjct: 4 KRKRLGEMLLEESVVTEAQIEEALSVKRTSEKLGDTLLRLGHLTEQQLIEALHHQLKIPV 63 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 + + + ++ + +A + ++P+ E + L + D +D +++ L + G + Sbjct: 64 IQLYNYPVDVAVTKLISKELAQRHTLVPVYREGNRLFIAMADPMDLIAIDDLRLQTGLMI 123 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLT--EQQAGEIWRQYVPHQFLF 663 + R +I + +Y D + L ++ +T + + R+ P L Sbjct: 124 EVGLATRDEIRRTILKYY------DIDSSLRELLESDEMTISDTSRDTVTREDAPIIRLV 177 Query: 664 AEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQE 705 +IL S I++ + L + +G + E Sbjct: 178 NQILENGISQRASDIHM-----DPQETSLSIRIRIDGELRTE 214 >UniRef50_D1B5J6 Type II secretion system protein E n=3 Tax=Synergistaceae RepID=D1B5J6_THEAS Length = 559 Score = 177 bits (449), Expect = 1e-42, Method: Composition-based stats. Identities = 58/258 (22%), Positives = 113/258 (43%), Gaps = 14/258 (5%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALA 538 +T +T+ LR LG IL++ V+TE L+ AL ++ +RLG ++ G +S + LA+AL+ Sbjct: 1 MTTETKHLR-LGDILIQAGVLTESTLEAALAEQKMSSMRLGEILVKNGWVSEKHLAEALS 59 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLR-LENDELIVGSEDGIDPVSLAAL 597 Q V S+ ++ ++ +P ++A V+PL LEND+L+V + D ++ ++L L Sbjct: 60 RQLKVPLVSLSRYRPTPEVLKIVPENLARRLDVVPLSILENDKLLVATADPLNVMALDEL 119 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYV 657 GR++ I +I +Y + + + + + ++ Sbjct: 120 KMATGREIDISIATASEIRRAFDQFYRVQATLEEAMVEVMDEKRGAESSLNLVDVSADDA 179 Query: 658 PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 P L I+ S I++ E +G L L R L Sbjct: 180 PVVKLVNSIMEQAVKEGTSDIHI-----EVFERSARVRYRIDG-----ALFDSLEYPRNL 229 Query: 718 QVSMQS-LLLKAGLNTEQ 734 ++ S + + +G++ + Sbjct: 230 HPAVCSRIKIMSGMDISE 247 >UniRef50_B0TEE9 Type ii secretion system protein e, putative n=12 Tax=Firmicutes RepID=B0TEE9_HELMI Length = 570 Score = 177 bits (449), Expect = 1e-42, Method: Composition-based stats. Identities = 49/258 (18%), Positives = 100/258 (38%), Gaps = 19/258 (7%) Query: 486 SLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + R LG +LLE +IT+EQL AL + G RLG +++ G ++ + + + L Q G+ Sbjct: 5 NRRKLGDLLLEYNLITDEQLQQALAEQKKRGERLGQTLVRLGFVTRQMINEVLEFQLGIP 64 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 S+ + + + +P S+ + LP++ + L V D ++ +L + + Sbjct: 65 TISLLQYPLHPEVFKLLPESLCRRHKCLPVKRSGNRLTVAMVDPLNLPALDDIKMTTNLE 124 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQY-------V 657 + IV ++ Y + ++ E+ ++ Sbjct: 125 IDAAIVAEDELEQVFEKIYGLNEEQEADIKRLEVEANRAEEERSIIDLGELERMTAVGDA 184 Query: 658 PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 P + IL S I++ E + +G+ L VLT+ + Sbjct: 185 PIIRVVNTILQQAAKEGASDIHL-----EPQEGGVRVRYRADGI-----LRHVLTLPKAA 234 Query: 718 QVSMQS-LLLKAGLNTEQ 734 ++ S + L A +N + Sbjct: 235 HPALLSRIKLLAKMNIAE 252 >UniRef50_B0MCL3 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MCL3_9FIRM Length = 558 Score = 175 bits (444), Expect = 5e-42, Method: Composition-based stats. Identities = 49/253 (19%), Positives = 114/253 (45%), Gaps = 19/253 (7%) Query: 489 PLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 P+G++LL+ IT+EQ+D AL + E G RLG ++ I+ +Q+ +AL ++ ++ Sbjct: 5 PIGEVLLQYGYITKEQIDQALDYQKEHPGKRLGTILMELQFITEQQMLEALGQRLSLSHI 64 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 S+ ++ + S + ++P +A Y +L + +++ +L + D ++ ++ + + G +++ Sbjct: 65 SLGSYPVNSEAVEKIPRQLAFKYNILAVDMKDHQLYIAVNDPLNFYAMEDIRQLTGMQLK 124 Query: 607 YVIVLRGQIVTGLRHWY----ARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 + + L ++Y AR+ A +L + E P L Sbjct: 125 VFLAELSPLKKALEYFYAEVSARQAARQANETTQEAEDISFLDD--MDEEADSDAPIIKL 182 Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQ 722 ++ + N S I++ E + +G I +T++R L S+ Sbjct: 183 LNTLVLRAYNTNASDIHI-----EPFEKETVVRMRIDGTIVDY-----VTLKRSLHASLT 232 Query: 723 S-LLLKAGLNTEQ 734 + + + G++ + Sbjct: 233 ARIKIMGGMDIAE 245 >UniRef50_Q1RIH7 Glycosyltransferase n=11 Tax=Rickettsia RepID=Q1RIH7_RICBR Length = 573 Score = 174 bits (442), Expect = 8e-42, Method: Composition-based stats. Identities = 73/482 (15%), Positives = 167/482 (34%), Gaps = 77/482 (15%) Query: 8 FATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPD 67 F +L L + + + IS + + +V+ I+ + + R++ EL Sbjct: 152 FVVFLVILTYVPVLFHIANNISYFVQNVLKSLLFVKAIRDYKPLEVKQARINVEELPI-- 209 Query: 68 EKPLAIMVPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN 126 I+VP + E + ++ + + + + +D +++ + Sbjct: 210 ---YTILVPLYKELSKLRSIIKNISLINYPDSKLDVKIIIEDDDYLMIKEI-ALYNLPAY 265 Query: 127 VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLF 182 H ++ + P +K LN L+ +++DAED P +L +F Sbjct: 266 FHVILVPQSSPRTKPKALNYALEY---------SRGEYVVVYDAEDKPEPDQLLKALAMF 316 Query: 183 NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 L +Q + + + T M + E+S + L P G F Sbjct: 317 KSLAPDFICLQAKLNFYNKNENVLTKM-FNLEYSLWFEYILKGLSLLKLPTPLGGTSNHF 375 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 + L +D ++TED +IG R+ + + ++EA Sbjct: 376 KADILRKL------GGWDAHNVTEDAEIGLRIYSQNYKVTILDSYTLEEA---------- 419 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAI 362 P++ + Q+SRWI G + F + +D+ + Sbjct: 420 --------------PNSLGNWLNQRSRWIKGFLQTFFV-----------FIAQKDKYKKL 454 Query: 363 SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI---- 418 + ++ I + + L+ + W + SI ++ +WL + Sbjct: 455 TLLQ-----IITIYIFIGLSTYNFWCLPFIIFSIIINKNPIIDYVWLVNSIFSLLYLYGT 509 Query: 419 VQRVIFVTGYYGLTQGLLSVLRLFWGN--LINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 V ++ + +G + + + W +++ +A+++A+ +++ W+KT H Sbjct: 510 VIYILKNSLKFGKIKFQDLIALVLWAGYFILHTIASYKAVFEII----FCPFKWNKTKHG 565 Query: 477 FP 478 Sbjct: 566 VS 567 >UniRef50_C8NRH6 Group 2 glycosyl transferase n=5 Tax=Corynebacterineae RepID=C8NRH6_COREF Length = 478 Score = 174 bits (441), Expect = 1e-41, Method: Composition-based stats. Identities = 86/478 (17%), Positives = 152/478 (31%), Gaps = 78/478 (16%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDE--KPLAIM 74 ++ L +M++ L D FI R+ R ++ R L P E K I+ Sbjct: 64 IVIAGLCTLMYVVTLTDRFI----MFRKGLRADAIMRVT---DEEALAVPVERLKAYTIL 116 Query: 75 VPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVV- 131 VPA+ E VI + A + DY + + +D T + A + ++ Sbjct: 117 VPAYGEPEVITQLV-TAMNSFDYPPHLLQVLLLLEEDDLPTIEAAER--ANLGEISTIIK 173 Query: 132 CARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVE 187 P +K N L T + DAED+ P++LR F Sbjct: 174 VPPAQPRTKPKACNYGLHFATG---------EIVTIFDAEDIPDPLQLRRVVVAFENSPA 224 Query: 188 RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 +Q + T + E+ +P + VP G + Sbjct: 225 NTVCVQSRLSYRNARQNLLT-GWFTIEYDVWFNFLLPGIMRMQAPVPLGGTSNHLVTEVL 283 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 L A+D ++TED D+G R+ +G + +EA Sbjct: 284 REL------GAWDPYNVTEDADLGVRIAARGYRTAVLDSVTWEEANSDTI---------- 327 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVS 367 +RQ+SRW G + W + +L R+ G + Sbjct: 328 --------------NWLRQRSRWYKGYL------QTWLVYMRRPRWLVREL-GVLPALRF 366 Query: 368 FLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLL---WLNFGLMVNRIVQRVIF 424 M + +L W + +A L+ L ++ N + Sbjct: 367 TFLMAGTPIVAVLNLLFWYLSLTWILGQPATIAAMFPPLVYYPALICLILGNAATMYMNL 426 Query: 425 VTGYYGLTQGLLSVL---RLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS 479 + G L+ + ++W L+ +A + Q++ R W+KT H + Sbjct: 427 IGCREGRDPLLVVAVLTFPVYW--LLMSIAALKGTWQLIT----RPSYWEKTAHGLEA 478 >UniRef50_D1BGY1 Type II secretion system protein E (GspE) n=3 Tax=Actinobacteridae RepID=D1BGY1_SANKS Length = 557 Score = 172 bits (436), Expect = 4e-41, Method: Composition-based stats. Identities = 53/223 (23%), Positives = 96/223 (43%), Gaps = 13/223 (5%) Query: 487 LRPLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 ++ LG+ILLE ++ E QL AL +V G LG ++ G++S QL ALA Q G+ + Sbjct: 1 MKQLGEILLEEGLVNEAQLMAALDEQVVRGTSLGRVLVELGVLSEGQLVSALAAQVGMQF 60 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 +D + + + ++ + +V Y VLP+ E D L++ D + +++ + G +V Sbjct: 61 VDLDTFPVDRAAVSRLTGAVCRRYTVLPIAFEGDALVLAMADPGNVLAVDDVRSSTGMQV 120 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTE----QQAGEIWRQYVPHQF 661 V+ + + + R D L NA + + + G+ P Sbjct: 121 LPVVATHEDLSRAIDRFV---RADDEMDNLTNAFTEEQRVDDVDLSKIGDSVDDDAPIVR 177 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQ 704 I+T S I++ E S L +GV+ + Sbjct: 178 YVNLIVTQAITDRASDIHI-----EPSEHDLRVRYRIDGVLHE 215 >UniRef50_B9Y801 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y801_9FIRM Length = 560 Score = 171 bits (434), Expect = 7e-41, Method: Composition-based stats. Identities = 54/257 (21%), Positives = 103/257 (40%), Gaps = 23/257 (8%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 P+GQ+LLE ITEEQL++AL ++ G RLG ++ G I+ E+ +AL+ + V Sbjct: 4 LPIGQLLLEQGYITEEQLNSALAHQKAHPGNRLGDVLIELGYITEEKKLKALSVRLNVPV 63 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 + S ++ + VA Y V+PL ++N+ L + + D +D +L + G V Sbjct: 64 YEGFQINVNSDIVRLISEDVAKKYQVMPLEIKNNALQLATSDPLDFYALEDIKASCGIPV 123 Query: 606 RYVIVLRGQIVTGLRHWYAR-------RRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVP 658 V+ + I +R YA+ + + L+E P Sbjct: 124 SPVLAPKEMIENAIRRNYAQANVSSAIDEIQKDLTEDQDLALNDELSELTQ---RVDNAP 180 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQET-LDRVLTIQREL 717 ++ S I++ E + T+GV+ + T + + + Sbjct: 181 VVKFVNNMIRQAYETGASDIHI-----EPFEMTTVIRFRTDGVLHEFTRIAKSVH----- 230 Query: 718 QVSMQSLLLKAGLNTEQ 734 + + + +N + Sbjct: 231 DALITRIKIMGNMNIAE 247 >UniRef50_B2IIL4 Glycosyl transferase family 2 n=2 Tax=Beijerinckiaceae RepID=B2IIL4_BEII9 Length = 650 Score = 171 bits (434), Expect = 8e-41, Method: Composition-based stats. Identities = 76/469 (16%), Positives = 146/469 (31%), Gaps = 70/469 (14%) Query: 34 FFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP-----LAIMVPAWNETGVIGNMA 88 + ++++ + R + + R L++P + +I+V E ++ + Sbjct: 200 LVLAILFFGMLVLR---LAAGAASLGPRSLWRPHVRDATLPLYSIVVALHREARIVPQLV 256 Query: 89 ELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNN 146 + A +DY + + +D +T + P +V P +K LN Sbjct: 257 D-ALERIDYPRAKLEVKLVIEADDRETL-EALRKARLSPLYEIIVAPSGWPRTKPRALNV 314 Query: 147 VLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLIQIPVYPFERE 202 L + + DAED P++LR F + +Q + E Sbjct: 315 ALPLL---------RGTFVTVFDAEDEPDPLQLRHAAEYFLASPKTLACLQARLVIDNVE 365 Query: 203 WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQ 262 + T + + E++ L + L +P G F + A+ +D Sbjct: 366 DSWLTRL-FSIEYAVLFDVLLEGMSELRLPLPLGGSSNHFRADVLRAV------HGWDAW 418 Query: 263 SLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFST 322 ++TED D+G RL G + ++EA P Sbjct: 419 NVTEDADLGMRLARNGYRTATLASQTLEEA------------------------PARLDA 454 Query: 323 AVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD---RKGAISNFVSFLAMLVMIQLLL 379 Q+ RW+ G + G L L + ++G + +L + + Sbjct: 455 WFSQRRRWLKGWMQTG------GVLLRDPRRLLAETGMKQGGALLLLLVGLVLAPLLWPI 508 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVT-GYYGLTQGLLSV 438 L + + L+L L+ + ++ L L + Sbjct: 509 LTGVTLYQWMSGGLPEPTNWLGIFAATLFLAVSLLGVGSTLWLSWLGMRRRNLLNLWLFL 568 Query: 439 LRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSL 487 + L+ A W AL +L R W KT H R+ Sbjct: 569 PLILPYYLLISCAAWAALYDLL----VRPFHWRKTEHGLARTRASRRAP 613 >UniRef50_Q181B0 Type IV pilus assembly protein n=5 Tax=Clostridium difficile RepID=Q181B0_CLOD6 Length = 561 Score = 171 bits (433), Expect = 1e-40, Method: Composition-based stats. Identities = 51/247 (20%), Positives = 99/247 (40%), Gaps = 11/247 (4%) Query: 489 PLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 +G L+E ITEEQL AL + G RLG ++ +GLI + L L E + Sbjct: 9 RIGDKLVEKGYITEEQLKWALSEQKNSGKRLGEFLVQEGLIDSNLLISVLKELLDIESIF 68 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 ++ +I + +P ++ Y V P +++ +++ + D D ++ + R G+ V Sbjct: 69 LEGTEIDTLATKMVPENICKRYTVFPFKIDGNKICLAMSDPQDREAVQDVRRMSGKDVEI 128 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEIL 667 I I + H YA + YN + + E E P L IL Sbjct: 129 FISSTEDINKAIGHAYAHSEINKAMTE-YNKNRTGGVRETVILEEDVNAAPIVRLVNNIL 187 Query: 668 TTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLLK 727 + S I++ E+S + +G++ + R+ + + + + + Sbjct: 188 ENAVRMEASDIHI-----EQSENYMRVRFRIDGMLREYM--RMNSAPYK--AVISRIKIM 238 Query: 728 AGLNTEQ 734 + +N + Sbjct: 239 SDINISE 245 >UniRef50_B1I3E7 Type II secretion system protein E n=4 Tax=Clostridia RepID=B1I3E7_DESAP Length = 561 Score = 169 bits (429), Expect = 2e-40, Method: Composition-based stats. Identities = 55/250 (22%), Positives = 107/250 (42%), Gaps = 20/250 (8%) Query: 490 LGQILLENQVITEEQLDTALRNR--VEGLR--LGGSMLMQGLISAEQLAQALAEQNGVAW 545 LG L++ VIT+EQL+ AL+ + +G + LG +++ G + E +AQ +A QNGV + Sbjct: 9 LGMNLVKAGVITQEQLEEALKRQDPKKGGKGFLGATLVELGYCTEEDIAQVIARQNGVPY 68 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 S++ + + + VA Y LP+ +N +L+V + D ++L L GR++ Sbjct: 69 VSLETFAADPQAVGLIAPEVARRYRALPIGFQNGKLVVAMKQPRDVIALDDLRIITGREI 128 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAE 665 + V++ Q ++ + + + + E ++ P L Sbjct: 129 QPVVIPDSQFDAAMQRY---SQSGLEVELAAAEEEVAEEVVAGLDEAAQR--PAVQLANA 183 Query: 666 ILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS-L 724 I + S +++ E L +GV L +L R L S+ S + Sbjct: 184 IFNQAVWASASDVHI-----EPLEKSLRVRFRIDGV-----LHNILQPPRHLHASLVSRI 233 Query: 725 LLKAGLNTEQ 734 + A ++ + Sbjct: 234 KVMANMDIAE 243 >UniRef50_C8W5J2 Type II secretion system protein E n=2 Tax=Clostridiales RepID=C8W5J2_DESAS Length = 561 Score = 168 bits (426), Expect = 7e-40, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 105/247 (42%), Gaps = 20/247 (8%) Query: 490 LGQILLENQVITEEQLDTALRNRVE--GLR--LGGSMLMQGLISAEQLAQALAEQNGVAW 545 LG IL++ +IT+EQL+ AL+N+ E G + +G +++ G + + +A+ +AE++G+ + Sbjct: 10 LGTILVQKGIITQEQLEDALKNQSEMKGKKGLIGKTLVRLGYCTEDDIARVIAERSGIPY 69 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 S++ +QI + + + Y LP+ +D+L+V D +S+ L G + Sbjct: 70 ISLETYQIDPAAVTVLSIDNINRYKALPVSFADDKLVVAMNHPNDIMSIDDLRMLTGYDI 129 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAE 665 + V+ ++ + + + + + + P L Sbjct: 130 KPVMTSDTELEATIEKY-----SRESLDVEQEDDDVDAYNDLANESVDDADRPAIQLANM 184 Query: 666 ILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS-L 724 IL+ S I++ +GV L ++ + R++ ++ S + Sbjct: 185 ILSQALSARASDIHIEPYEKNSR-----VRFRIDGV-----LHDIMQVPRKMHATLTSRI 234 Query: 725 LLKAGLN 731 + A ++ Sbjct: 235 KVMANMD 241 >UniRef50_B8E2U7 Type II secretion system protein E n=2 Tax=Dictyoglomus RepID=B8E2U7_DICTD Length = 561 Score = 168 bits (425), Expect = 8e-40, Method: Composition-based stats. Identities = 50/252 (19%), Positives = 103/252 (40%), Gaps = 13/252 (5%) Query: 485 RSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 + +PLG+ LLE +IT+EQL+ AL + G +LG ++ +G + E + + L Q+ + Sbjct: 2 KEKKPLGEYLLEQGLITKEQLEKALEEQKKTGAKLGQILIERGYVKPEDIGKVLERQSEI 61 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + S+ QI L ++ Y +P++ E L V IDP + + R V Sbjct: 62 PYISLTEVQIDEKLAGSFSENLLRRYKFIPIKREAGVLHVAVVPPIDPAIINEIRRIVKS 121 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 +R I + + + + + + + E I + P L Sbjct: 122 PIRIFITTDKEFNQIISRLFPLEKTTLSVVQDFQRTAPEPMVETLPA-IGVEEAPIVRLV 180 Query: 664 AEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ-VSMQ 722 + I+ + N S I++ + + G+ L ++ I +E+Q + Sbjct: 181 SSIINEAINRNASDIHL-----DPQEKEMKVRYRIHGI-----LYDIMAIPKEIQDAVVT 230 Query: 723 SLLLKAGLNTEQ 734 + + +G++ + Sbjct: 231 RIKVISGMDIAE 242 >UniRef50_UPI0001C3693E pili biogenesis protein PilB-like ATPase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C3693E Length = 572 Score = 167 bits (423), Expect = 1e-39, Method: Composition-based stats. Identities = 52/233 (22%), Positives = 95/233 (40%), Gaps = 15/233 (6%) Query: 490 LGQILLENQVITEEQLDTALRNRVEG---LRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 +G++L+E IT+EQL+ L+ G RL + G I+ +L L G+ Sbjct: 6 IGEVLVEQGAITKEQLNEGLKLLKAGTNDRRLAEVLTDLGYITERELLDVLGRGMGLEVI 65 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ++ + I + ++P +AL Y V+ + +E L V + D +D +L + +++ Sbjct: 66 DLEFFHIDERAVEKIPKQLALKYTVMAVSMEGSGLTVATADPLDLYALEDIRLVTNMRIQ 125 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEI----WRQYVPHQFL 662 ++ R QI + Y+ G D RA A +H + A + P L Sbjct: 126 LILAERTQIRHAIELNYS---GIDARAAARLASEHAVFSRTFAENLMVNSDEDQAPIVRL 182 Query: 663 FAEILTTLGHINRSAINVLLLRHE-----RSSLPLGKFLVTEGVISQETLDRV 710 +L + N S I++ +E R L ++ I Q + R Sbjct: 183 LNSLLLKGYNTNASDIHIEPYENETVVRMRRDGMLIPYMTLSPAIHQGIVART 235 >UniRef50_D0XKL7 Putative uncharacterized protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XKL7_9CAUL Length = 469 Score = 167 bits (422), Expect = 2e-39, Method: Composition-based stats. Identities = 86/470 (18%), Positives = 152/470 (32%), Gaps = 84/470 (17%) Query: 22 LAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNET 81 M I G F+ V W RI L+ ++ D I+V +E Sbjct: 45 TTGAMLIGGAQLAFVVVAGW--RILLTLAPAPAPGDVAPGS----DLPRYTILVALLDEA 98 Query: 82 GVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 VI + + +DY F+ +D +T D R + ++ P + Sbjct: 99 AVIDQLVGR-LSRIDYPAHRLEAFLLLEAHDHETI-DAAWHADRPDWMSILIAPPGDPRT 156 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE-----RKDLIQI 194 K LN L A T ++DAED P++LR R +Q Sbjct: 157 KPRALNVGLAAATG---------ELVTVYDAEDDPDPLQLREAAARFAADPSGRLSALQA 207 Query: 195 PVYPFEREWTH--FTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 P+ T F + E++ L +P L P G F + A+ Sbjct: 208 PLRIRTATRTRTPFLDRQFAIEYASLFEVTLPAMARLGLPFPMGGTSNHFRASWLRAV-- 265 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 +D ++TED D+GFRL G +R P + Sbjct: 266 ----GGWDAHNVTEDADLGFRLWRAGGRLGVIRHPTHEPP-------------------- 301 Query: 313 REYFPDTFSTAVRQKSRWIIGIVF------QGFKTHKWTSSLTLNYFLWRDRKGAISNFV 366 P + + Q++RW+ G + + F + +W + L + A + + Sbjct: 302 ----PGGLGSWLPQRTRWLKGFMQTLGVHTRSFGSLRWQGVVALLMTILVSLASAAVHAI 357 Query: 367 SFLAMLVMIQLLLLLAYES-LWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV 425 S + ++ + E + L + + +AW+ L+ + R+ + Sbjct: 358 SLAWVTALVLVAAAAGLEPRASLFSLGVLGLGTVAAWISALIGARRAGLSYRMSDAAMA- 416 Query: 426 TGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 +W L +A + A ++ AW+KT H Sbjct: 417 --------------PFYWSLL--TLAFFHAFVRLFYE----PFAWNKTRH 446 >UniRef50_C6CWA2 Glycosyl transferase family 2 n=4 Tax=Bacillales RepID=C6CWA2_PAESJ Length = 412 Score = 166 bits (421), Expect = 2e-39, Method: Composition-based stats. Identities = 80/474 (16%), Positives = 155/474 (32%), Gaps = 90/474 (18%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 I + ++ + GL F+ W R+ +L P +K A++ Sbjct: 2 FNTIMLVFQIVFALLGLYQLFLTCFGWHRK---------------KEDLSHPPQKTFALL 46 Query: 75 VPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 V A NE V+G + E E + IFV T V+ + V+ V Sbjct: 47 VAAHNEEQVVGALIENLLKLKYPRELFDIFVICDNCTDGTVDIVNS----YDGVYACVRN 102 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLI 192 K + +L + + +S + G + DA+++++ L+ + N L+ +I Sbjct: 103 NKNQRGKGYAVEWMLKELWKMPKS----YDGVAIFDADNLVATDFLQYMNNDLINGHRVI 158 Query: 193 QIPV--YPFEREW---THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 Q + W + + + + +L ++ + L G G CF + + Sbjct: 159 QGYLDTKNPNDSWISSANAINYWFCNRLWQLPRTNLGLANFL------GGTGMCFDAKLL 212 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 + + SL ED + R ++G+ F V + K Sbjct: 213 QEM-------GWGATSLVEDLEFTVRCIQRGIYPKFNFEAKVFDEK-------------- 251 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKT----HKWTSSLTLNYFLWRDRKGAIS 363 P TF + RQ+ RW+ G F + W + D + Sbjct: 252 ---------PITFQASARQRLRWMQG-HFDVTRKYMLPLLWQG-IKERSMTKIDASLYVF 300 Query: 364 NFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 N ++LA + ++ + L + S+++ WL+ M +Q + Sbjct: 301 NAYNYLAGF---FIAAIIWGDMLLFGGNNVESVYNLLP-----FWLSIPYMAYVFIQIPL 352 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANW--RALKQVLQHGDPRRVAWDKTTH 475 Y LR+ + F +W + + + W T H Sbjct: 353 ---SMYMAKVPWKLYLRIP--TFLLFTVSWWPITVHAFFTQNNKK---WSHTQH 398 >UniRef50_C6XAP3 Type II secretion system protein E n=1 Tax=Methylovorus sp. SIP3-4 RepID=C6XAP3_METSD Length = 816 Score = 166 bits (421), Expect = 2e-39, Method: Composition-based stats. Identities = 50/281 (17%), Positives = 111/281 (39%), Gaps = 35/281 (12%) Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALA 538 ++R + LG+ L + ++I+E+QL AL + E G+ LG ++ G++ + L LA Sbjct: 209 HQESRPILKLGEALRQLELISEDQLQHALNKQKENRGIPLGRILVDMGIVDEQTLKGTLA 268 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 ++ G+ + S+ + + I + A VA + ++PL + L+V ED ++ ++ + Sbjct: 269 KKLGIPYVSLSKFNFDPNAIRLIGAPVARKHLLIPLCMYEGALVVAFEDPMNVKAIDEVR 328 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYARR-----------------RGHDPRAMLYNAVQH 641 K + R IVT + +Y R + + M + + Sbjct: 329 FLTQMKTLPAMASREDIVTAIDSFYGRSGAFEFSKSKDDMLDFDLKSSNVAGMQIDDLAT 388 Query: 642 QWLTEQQAGEIWRQYVPH-------QFLFAEILTTLGHINRSAINVLLLRHERSSLPLGK 694 + +E+ + + P L +++ S I++ Sbjct: 389 KLFSEENSMQFESAEEPVAESDNTLVQLVNKMILDAYQDGVSDIHIETY---PDRRNTQV 445 Query: 695 FLVTEGVISQETLDRVLTIQRELQVSMQS-LLLKAGLNTEQ 734 +G TL + L I + ++ S + + + L+ + Sbjct: 446 RFRKDG-----TLVQYLEIPSNFRNALISRIKIMSQLDISE 481 >UniRef50_A5CDZ1 Putative glycosyl transferase, group 2 n=2 Tax=Orientia tsutsugamushi RepID=A5CDZ1_ORITB Length = 583 Score = 166 bits (420), Expect = 3e-39, Method: Composition-based stats. Identities = 68/424 (16%), Positives = 133/424 (31%), Gaps = 70/424 (16%) Query: 71 LAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK 129 I+VP ++E + ++ + + + +D T ++ + H Sbjct: 219 YTILVPLYHEVEKLRDIVKAIELLNYPKNRIEVKIIIEEDDVYTMLELTTM-HLAHYFHV 277 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLFNYL 185 + P +K LN ++ I ++DAED P +L F L Sbjct: 278 IKVPFSFPQTKPKALNYAMNYIVG---------EYITVYDAEDSPEPDQLLKVIYHFQNL 328 Query: 186 VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 IQ + + + T + I E+ + L V G + Sbjct: 329 QPDYQCIQARINFYNKNENVLTKLMSI-EYCLWFDFFLYGLTCLGLPVTLGGTSNHCRAK 387 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 + +L +D ++TED D+G R+ G + EA + Sbjct: 388 MLKSL------GYWDAYNVTEDADLGLRIYIAGFKTAVIDSYTYGEAVIDCKGWLH---- 437 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNF 365 Q+SRWI G + F + ++ Sbjct: 438 --------------------QRSRWIKGFIQTSFVFMSYNKNIRNR-------------- 463 Query: 366 VSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN-FGLMVNRIVQRVIF 424 + A + + +L L+ W I L T+LW N + V I Sbjct: 464 LGLCANICICLFILFSPLMFLFIPLWLISGIIDNDCTLGTILWYNMLFALAYMHVMSWIA 523 Query: 425 VTGYYG-----LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS 479 + G Q LL + ++++ +A+++A+ ++ + W+KT H Sbjct: 524 LCRIKGHWSNLTLQDLLCFIIWPLYSMLHVIASYKAIFELC----VKPFKWNKTKHGVSR 579 Query: 480 VTGD 483 + + Sbjct: 580 ININ 583 >UniRef50_C7M1J2 Glycosyl transferase family 2 n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7M1J2_ACIFD Length = 431 Score = 166 bits (419), Expect = 4e-39, Method: Composition-based stats. Identities = 79/413 (19%), Positives = 129/413 (31%), Gaps = 63/413 (15%) Query: 71 LAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 +A+++PA +E V+G E Y+ I +D T + + P+ +V Sbjct: 54 MAVLIPARHEETVLGATLER-LARQPYDALRIIAIVGHDDQATHAVAERAASAHPDRIEV 112 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM---ELRLFNYLVE 187 V P SK L IT E + + + DAED ++ + Sbjct: 113 VVDHHWPKSKPAAL------ITGMEAAGSAELIAIV--DAEDDVAEGFFALAAAEFAMRP 164 Query: 188 RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 D++Q V + F + + E+ + +P +A G VP G F + Sbjct: 165 GLDVLQGGVLLVNLSSSWFATRS-AVEYYLWYSSRLPW-QARHGVVPLGGNTCVFRSHTL 222 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 + +D +LTED D+G RL G R Sbjct: 223 HEV------GLWDPNALTEDADMGIRLATCGAQIA---------------------VRFE 255 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQ----GFKTHKWTSSLTLNYFLWRDRKGAIS 363 + +E P + VRQ++RW G + ++ W + L + A++ Sbjct: 256 EALATQEETPLSLRAFVRQRTRWDQGFIQVLRKGSWRKLPWRRRALALFTLAAPFQQALT 315 Query: 364 NFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 + +AML AW L L L L Sbjct: 316 GVLIPVAML-----------------AWGLTRRLPVDLVLFAFLSLFAELGALSFEVIAA 358 Query: 424 F-VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + G + L L G + + AL L W KT H Sbjct: 359 ARLRRLRGERPRIRDALSLVLGLIPYQLVLVTALLVALVRELRGERGWAKTEH 411 >UniRef50_Q7UE44 General secretion pathway protein E n=1 Tax=Rhodopirellula baltica RepID=Q7UE44_RHOBA Length = 587 Score = 163 bits (412), Expect = 3e-38, Method: Composition-based stats. Identities = 56/255 (21%), Positives = 103/255 (40%), Gaps = 17/255 (6%) Query: 487 LRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 ++ +G IL+E ++T EQ+D A + G+ +G ++ Q LIS QL AL+EQ V + Sbjct: 1 MKRIGDILVELNILTNEQMDAAFAGKPRGVMIGDWLVRQSLISNAQLGAALSEQFSVPFV 60 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ID + + +P A A + + + + L + D ++A G K+R Sbjct: 61 DIDFSSVNPQVARLLPEDFARSQASVAIDVSDRMLTLAMVAPDDIETIAEAELMTGYKIR 120 Query: 607 YVIVLRGQIVTGLRHWYARR---RGHDPRAMLYNAVQHQWLTEQ---QAGEIWRQYVPHQ 660 V+ L + L Y R R + +TE+ + ++ P Sbjct: 121 PVVALEDDVRDLLNRIYDDRAFARQTIVDMKFAEMAESGEVTEEDELAMSAVSQEDAPVV 180 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L IL+ S I++ E + +G L V+TI ++ S Sbjct: 181 KLVQAILSGAVSAGASDIHL-----EPHKPEMRVRYRVDG-----ELQVVMTIPNHIEDS 230 Query: 721 MQS-LLLKAGLNTEQ 734 + S + + ++T + Sbjct: 231 VISRIKVMGDMDTTE 245 >UniRef50_B9M6X5 General secretory system II protein E domain protein n=2 Tax=Geobacter RepID=B9M6X5_GEOSF Length = 389 Score = 162 bits (410), Expect = 4e-38, Method: Composition-based stats. Identities = 38/142 (26%), Positives = 73/142 (51%), Gaps = 2/142 (1%) Query: 488 RPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 LG++L+E+ +IT+ +LD L+++ + G ++G +++ G I E+LAQ L+++ GV Sbjct: 3 LKLGEMLVESGIITQAELDETLKSQVIFGGKIGTNLIEMGYIEEEELAQFLSKKLGVPCA 62 Query: 547 SIDA-WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 + + + +P + Y V+PL L+ +L V ED + S+ ++ G V Sbjct: 63 GNEQLINLHPGALKLIPKEIVRKYRVVPLGLDKKKLYVAMEDPSNLASIDEISFMTGFIV 122 Query: 606 RYVIVLRGQIVTGLRHWYARRR 627 +I I+ L Y +R Sbjct: 123 MPLIATELSIILALEKHYGIKR 144 >UniRef50_C1XWL4 Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB n=4 Tax=Bacteria RepID=C1XWL4_9DEIN Length = 888 Score = 162 bits (410), Expect = 4e-38, Method: Composition-based stats. Identities = 58/314 (18%), Positives = 112/314 (35%), Gaps = 21/314 (6%) Query: 425 VTGYYGLTQGLLSVLRLFWGNLINFM--ANWRALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 V + W + N R + ++Q G R A + SV Sbjct: 276 VARLLDKPAKFMLATPKVWEGIFNKAYPEKARLGETLVQKGKIDREALQ----EALSVQR 331 Query: 483 DTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQN 541 RPLG++L+E + E ++ +L + R G RL +++ G I E LA++LA Q Sbjct: 332 RLGKTRPLGEVLVELGYVKPEDIEESLQKQRQGGGRLEDTLIQSGKIKPEMLARSLAAQL 391 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV 601 G + S++ +P + Y V P +EN L+V +D + ++ L Sbjct: 392 GYPYIDPLEQPPDPSVMMMVPEATVRRYHVFPHHMENGTLVVLMKDPRNIFAIDDLKMIT 451 Query: 602 GRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQF 661 R++ + I + Y G D + + + E+ + Sbjct: 452 KREILPAVSTETAINKLIERSYGG--GGDLDELTKEFEKKKKQEEEVSTSALDDNAV-VR 508 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM 721 L I+ S I++ E + + +G L + + + ++ Sbjct: 509 LVNNIIREAYLQEASDIHI-----EPRQQEILVRIRVDG-----NLREYMKLPKGAGPAI 558 Query: 722 QS-LLLKAGLNTEQ 734 S + + A L+ + Sbjct: 559 ASRVKIMANLDIAE 572 Score = 132 bits (331), Expect = 6e-29, Method: Composition-based stats. Identities = 50/240 (20%), Positives = 93/240 (38%), Gaps = 20/240 (8%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALA 538 + T + LG LL+ ++ +E+L A+ E G L + GL+S ++AQA+ Sbjct: 1 MHVLTIGDKRLGAALLDMGLLEDEELQKAIERHREIGGSLAEIVAEMGLLSERRVAQAIE 60 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 E G+ + +IPS + +PA A +P + L V + +D + L L Sbjct: 61 EIFGIPLVELSEVEIPSEAKSLIPAEKARDLEAIPFAFDGRLLRVALLNPLDNLVLEELE 120 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVP 658 G+ + R L Y + + P Sbjct: 121 DLTGQIIEPYQTTRASFRYALAKHYP-------------------ELGLEVPAPPKAATP 161 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ 718 + ++L G ++ A+ L E+S LG+ L+ +G++S+ L + L Q ++ Sbjct: 162 AEVKLGDLLVKKGWLSPQALQAALAEQEKSGELLGRVLMQKGLVSELQLYQALAEQAGIE 221 Score = 130 bits (326), Expect = 3e-28, Method: Composition-based stats. Identities = 49/232 (21%), Positives = 90/232 (38%), Gaps = 12/232 (5%) Query: 489 PLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG +L++ ++ + L AL G LG ++ +GL+S QL QALAEQ G+ + + Sbjct: 165 KLGDLLVKKGWLSPQALQAALAEQEKSGELLGRVLMQKGLVSELQLYQALAEQAGIEFLN 224 Query: 548 -----IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 D + + A + AL Y +P+ +E + V D + R + Sbjct: 225 ELLKEGDLPEPQPEVTALFLRTDALRYQAVPVDMEGKTVRVILADPRHR---EEVARLLD 281 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 + ++++ Y + + + + L E + ++ + Sbjct: 282 KPAKFMLATPKVWEGIFNKAYPEKARLGETLVQKGKIDREALQEALS---VQRRLGKTRP 338 Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 E+L LG++ I L + + L L+ G I E L R L Q Sbjct: 339 LGEVLVELGYVKPEDIEESLQKQRQGGGRLEDTLIQSGKIKPEMLARSLAAQ 390 >UniRef50_Q1J1R8 Tfp pilus assembly pathway, ATPase PilB n=7 Tax=Bacteria RepID=Q1J1R8_DEIGD Length = 891 Score = 162 bits (409), Expect = 6e-38, Method: Composition-based stats. Identities = 54/285 (18%), Positives = 111/285 (38%), Gaps = 26/285 (9%) Query: 458 QVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGL 516 Q++Q G R + V ++PLG++++E E++D AL + G Sbjct: 308 QMVQQGSLSRAQL----REALQVQARGGKVKPLGEVIVELGFARAEEIDAALQKQNAGGG 363 Query: 517 RLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRL 576 RL +++ G +S E LA++LA Q G + + +P + A Y V+P+RL Sbjct: 364 RLEDTLVQSGKLSPEMLARSLAAQLGYEYLDPVQNPPDPQVALMIPEATARRYTVVPVRL 423 Query: 577 ENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLY 636 + + L+V +D + +L L GR++ ++ IV + ++ G+ A L Sbjct: 424 QGEALVVAMKDPRNVFALDDLKLITGREIVPAVMSEKDIVRLIERYF----GNQDMANLN 479 Query: 637 NAVQHQWLTEQQAGEIWRQYVPH------QFLFAEILTTLGHINRSAINVLLLRHERSSL 690 + + T + E + + ++ S I++ E + Sbjct: 480 QRLAAESKTREARKEADLDFSAGLDDNAVVRVVDNLIREAALQEASDIHI-----EPTES 534 Query: 691 PLGKFLVTEGVISQETLDRVLTIQR-ELQVSMQSLLLKAGLNTEQ 734 + +G L + + Q + + + GL+ + Sbjct: 535 AVRVRYRVDG-----ALREQPELPKGSAQSILARIKIMGGLDIAE 574 Score = 153 bits (385), Expect = 3e-35, Method: Composition-based stats. Identities = 63/248 (25%), Positives = 100/248 (40%), Gaps = 26/248 (10%) Query: 478 PSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQA 536 G + S LGQ L+ +I E QL AL + G LG ++ QGL+S +QL + Sbjct: 155 EGAAGSSESGGKLGQRLISRGLINEAQLQVALDVQQQTGEALGHILVTQGLLSEDQLYEV 214 Query: 537 LAEQNGVAWE-SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLA 595 LAEQ G + + +Q ++ + + AL + +P+ + V D Sbjct: 215 LAEQAGAVYLRNPRDFQPGEEVLGSLLRADALRLSAVPVDETAQGVTVVVSDP---RRRD 271 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWYARR---------RGHDPRAMLYNAVQHQWLTE 646 L +GR V+ V+ G + + +Y +R +G RA L A+Q Q Sbjct: 272 ELEALIGRPVQLVLARPGDVEALIERYYPQRGRLGEQMVQQGSLSRAQLREALQVQA--- 328 Query: 647 QQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQET 706 + G++ E++ LG I+ L + L LV G +S E Sbjct: 329 -RGGKVKP--------LGEVIVELGFARAEEIDAALQKQNAGGGRLEDTLVQSGKLSPEM 379 Query: 707 LDRVLTIQ 714 L R L Q Sbjct: 380 LARSLAAQ 387 Score = 134 bits (336), Expect = 2e-29, Method: Composition-based stats. Identities = 55/228 (24%), Positives = 98/228 (42%), Gaps = 18/228 (7%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 R LG ILLE +T+ L AL E G RL ++ G + +++A+A+ E G+ Sbjct: 8 RRLGAILLEQGYVTDTDLQKALVRHAEVGGRLADILIESGQVGEKRIARAIEEALGIPLV 67 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ++ ++ +A + A A P LE L V D + V++ AL G + Sbjct: 68 NLLVVTPDAAALAAIRAETAKQMQAFPFALEGQTLRVALVDPLSSVAIEALEDDSGLNIE 127 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEI 666 LR Q++ + +Y ++ + + + G+ + Sbjct: 128 PYQALRDQVLWSIATYYPE------LGLMPVLPEGAAGSSESGGK-----------LGQR 170 Query: 667 LTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 L + G IN + + V L +++ LG LVT+G++S++ L VL Q Sbjct: 171 LISRGLINEAQLQVALDVQQQTGEALGHILVTQGLLSEDQLYEVLAEQ 218 >UniRef50_A8MGF8 Type II secretion system protein E n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MGF8_ALKOO Length = 560 Score = 161 bits (407), Expect = 8e-38, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 93/241 (38%), Gaps = 13/241 (5%) Query: 497 NQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPS 555 +T+ QLD AL + G +LG ++ + + + + + L Q G+ +D +++ Sbjct: 15 AGKLTQSQLDNALDIQKKTGKKLGEIVVSEKYTTEDDIIEVLEFQLGIPHVDLDKYEVNP 74 Query: 556 SLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQI 615 ++ +P ++ Y ++ + ++ LIV D ++ +L + V ++ VI + ++ Sbjct: 75 TVATLIPENIVRRYELIAIDKKDTILIVAMTDPLNIFALDDVKLFVKSDIQPVISTKEKL 134 Query: 616 VTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR-QYVPHQFLFAEILTTLGHIN 674 + + +Y+ + + E+ P L I+ Sbjct: 135 IKAIDKFYSSETTKKALEEFEENFLPINTDDIEESELLEVTTAPIVKLLNSIIEQAVKER 194 Query: 675 RSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTI-QRELQVSMQSLLLKAGLNTE 733 S I++ E + + +G L ++T+ + + + + + +N Sbjct: 195 ASDIHI-----EPYAEDIRVRFRIDG-----DLREIMTLAKNSMSGIVTRIKIIGKMNIA 244 Query: 734 Q 734 + Sbjct: 245 E 245 >UniRef50_B7APV7 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B7APV7_9BACE Length = 566 Score = 161 bits (407), Expect = 9e-38, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 100/259 (38%), Gaps = 17/259 (6%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQN 541 + R LG +L+ +I + QL+ AL+ + E G LG ++ G ++ E++ L E Sbjct: 2 NYRKKIRLGDVLMSRGLINQNQLNMALKEQKEKGRMLGEMLVELGYVTQEKINDILCEML 61 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDE---LIVGSEDGIDPVSLAALT 598 + + + + ++ +P V Y ++P+R + + + V D ++ +++ + Sbjct: 62 NIEFIDLQVEEPEENVRDLIPEEVMRKYTLVPMRYDKNNAGVIQVAMADPMNILAMDDIN 121 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRA--MLYNAVQHQWLTEQQAGEIWRQY 656 G++V + I + +++ + + + E++ + + Sbjct: 122 IITGKQVAPYLANASDIRAYFDRVFGKKQAQNIAEMYKKEQGLVQEESEEEKLRKEDVEN 181 Query: 657 VPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRE 716 P L I+ S I++ E + +GV L V+ + Sbjct: 182 APIVQLVNSIIEQAARQRASDIHI-----EPFEESIRVRYRVDGV-----LREVIEYDKS 231 Query: 717 -LQVSMQSLLLKAGLNTEQ 734 L L + +G++ + Sbjct: 232 LLGAITARLKIMSGMDISE 250 >UniRef50_Q1D9E1 General secretory pathway protein E n=17 Tax=Proteobacteria RepID=Q1D9E1_MYXXD Length = 605 Score = 161 bits (407), Expect = 1e-37, Method: Composition-based stats. Identities = 61/257 (23%), Positives = 106/257 (41%), Gaps = 21/257 (8%) Query: 488 RPLGQILLE-NQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 RPLG+IL +TEE+L AL + E G R+G +++ +S E +A+AL Q + + Sbjct: 38 RPLGEILRAIVPSLTEEKLQEALAIQDEKGQRIGEALVGMKAVSEEDVAKALGHQLDLPY 97 Query: 546 -ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 I A ++ + L+ +P + A +LPL LE D + V D +D +L + +G+ Sbjct: 98 LARIFAEEVDAELVKRIPINFAKQSRILPLSLEGDTVAVAVADPLDTAALDHVRVLLGQS 157 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGH------DPRAMLYNAVQHQWLTEQQAGEIWRQYVP 658 V I L I + Y R + +A+ H+ + + + P Sbjct: 158 VSQRIALGSTITDAINSVYDRSVNETEQLVDEMETQDLDAIAHELDEPKDLLDEDDE-AP 216 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ 718 L +L S I++ E L +GV L V+ + Q Sbjct: 217 VIRLVNSVLFRAAKERASDIHI-----EPMERELLVRFRVDGV-----LQEVIKPPKRYQ 266 Query: 719 VSMQS-LLLKAGLNTEQ 734 ++ S + + LN + Sbjct: 267 NAIVSRVKVMGQLNIAE 283 >UniRef50_D2R471 Type II secretion system protein E n=6 Tax=Planctomycetaceae RepID=D2R471_9PLAN Length = 573 Score = 160 bits (405), Expect = 2e-37, Method: Composition-based stats. Identities = 49/251 (19%), Positives = 104/251 (41%), Gaps = 17/251 (6%) Query: 492 QILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 +ILL +V++++QL+ + + L ++ G S E + +A+A+++G + + Sbjct: 9 EILLRRRVVSQDQLNEGRQVAKDTNANLSDVLIRLGYASGEDVMRAVAQEHGREYVDLSE 68 Query: 551 WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIV 610 IP +I +P SVA A+LPL + D L V D D ++ L + RKV + Sbjct: 69 VTIPEDVIELVPESVARENAILPLSEDEDSLKVIVSDPYDIDTIEKLRFILNRKVDIALA 128 Query: 611 LRGQIVTGLRHWYARRRGHDPRAML-------YNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 R +I+ + +Y++ G ++L + + + + P L Sbjct: 129 PREKILEAINKYYSQIEGESADSVLQEFTDTAIDFTETEATKVTSNEAVDENSAPIVRLV 188 Query: 664 AEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS 723 ++ + S I+V E + +G++ + R +R L + Sbjct: 189 QLMIGEAVQLRASDIHV-----EPFEEIVRIRYRIDGILHK----RDSPPRRLLAAIVSR 239 Query: 724 LLLKAGLNTEQ 734 + + A ++ + Sbjct: 240 IKILAKMDIAE 250 >UniRef50_A8URM8 Putative uncharacterized protein n=3 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8URM8_9AQUI Length = 552 Score = 160 bits (405), Expect = 2e-37, Method: Composition-based stats. Identities = 52/252 (20%), Positives = 105/252 (41%), Gaps = 25/252 (9%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + + LG++L E ++++QL+ AL ++ G LG ++ +S ++LAQA+A Q G Sbjct: 2 ARKKLGELLQELGFLSQDQLEVALEVQKLNGESLGEILVDLSFVSPQELAQAIAHQAGRE 61 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + + + + + + A VLPL+LE+ L + D + + + RK G + Sbjct: 62 FIDLSLYPPSLDALRLIDRNTAKQLEVLPLKLEDGRLKLAVSDPFNVNIIDLVKRKTGLE 121 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFA 664 V + R I+ + +Y L ++ + + GE Sbjct: 122 VDIYVADRESILRSIEIYY-----EQLERPLEEQIE-EIVKRAPTGEAD------VPKLV 169 Query: 665 EILTTLGHI-NRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS 723 ++ G I + +++ SL F +GV L +I EL S+ S Sbjct: 170 DLFMNEGIIERATDVHI-----SPESLASHVFYRIDGV-----LHHYFSIPAELHPSLVS 219 Query: 724 -LLLKAGLNTEQ 734 + + +G++ + Sbjct: 220 RVKIISGMDISE 231 >UniRef50_Q0F0L3 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F0L3_9PROT Length = 804 Score = 159 bits (403), Expect = 3e-37, Method: Composition-based stats. Identities = 57/267 (21%), Positives = 113/267 (42%), Gaps = 13/267 (4%) Query: 472 KTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISA 530 + TH ++ +R LG+ILLE ++I E L AL + G RLG +L +I+ Sbjct: 234 QDTHLDSALHMQSRRKMRLGEILLEAKLINEADLKNALDEQKAHGHRLGEILLSTEVITE 293 Query: 531 EQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGID 590 +QL LA++ + ++ ++I + A + +V Y +LP+ + L + D + Sbjct: 294 DQLLDVLAKKFRLPTVDLETYEINPAAGALIERAVVEKYGILPIDTDAHSLTIALSDPMG 353 Query: 591 PVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAG 650 + ++ K G+KV V+ Q+ + + D + + ++ E Q+ Sbjct: 354 LEAYDTISFKTGKKVHEVMAKASQLELKIAQFLKEDLADDELSCEFLHQENDEEDEPQSA 413 Query: 651 EIWRQY---VPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETL 707 + Q P L I+ N S I++L + + L L + ++S+ +L Sbjct: 414 QEMTQSAEDAPIVRLVNRIIRNGLRKNASDIHIL---PQAKKITLAYRLNGQ-LLSENSL 469 Query: 708 DRVLTIQRELQVSMQSLLLKAGLNTEQ 734 DR Q + + G++ + Sbjct: 470 DRGSHKQ-----IAARIKILCGMDISE 491 Score = 64.0 bits (154), Expect = 2e-08, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 66/236 (27%), Gaps = 60/236 (25%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALRNRVEGLR-LGGSMLMQGLISAEQLAQALAEQNG 542 LG +L++ +V+ EE + AL ++ + LG + QG +S L +AL Q Sbjct: 129 QEKPERLGDMLIDQEVLGEEDVRKALDFQISSMPSLGNVLKDQGKVSENDLDEALKTQ-K 187 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + + + + Sbjct: 188 LQRMRLGDLLVHHEFV-------------------------------------------- 203 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ-WLTEQQAGEIWRQYVPHQF 661 + L + L + + + + Sbjct: 204 --------TESDVNEALEE-----QSRSVGTPLGKILIDTGKVQDTHLDSALHMQSRRKM 250 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 EIL IN + + L + LG+ L++ VI+++ L VL + L Sbjct: 251 RLGEILLEAKLINEADLKNALDEQKAHGHRLGEILLSTEVITEDQLLDVLAKKFRL 306 >UniRef50_C1AA14 Type IV pilus assembly protein PilB n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AA14_GEMAT Length = 634 Score = 158 bits (400), Expect = 6e-37, Method: Composition-based stats. Identities = 48/270 (17%), Positives = 97/270 (35%), Gaps = 22/270 (8%) Query: 477 FPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLA 534 P+ +RS LG +L+ +++ E L AL+ + G RLG +++ G++ ++ Sbjct: 24 APAAPLASRSTDRLGDLLVREGLLSRENLTKALQEQSAYPGQRLGLTVVRLGMVPETEVV 83 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSL 594 + LA Q + + +++ + L+ +PA +A + VLPL+ + +L V D + Sbjct: 84 RMLARQYRMPAVDLARFEVDTRLLKLIPAELASKHTVLPLKRDGRQLTVAIADPTAMAVV 143 Query: 595 AALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGH---------DPRAMLYNAVQHQWLT 645 L + V+ + + Y H + + Sbjct: 144 DDLKFITRYDIVPVLAGEYSMRAAIEKHYEANEIHMQSLLQDIAADDDDIEVLDNQDDMV 203 Query: 646 EQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQE 705 + P L IL H S I+ E L +G Sbjct: 204 DASVLAAQVDEAPVVKLINAILGDAVHKGASDIH-----FECFEHELRVRYRIDG----- 253 Query: 706 TLDRVLTIQRELQVSMQS-LLLKAGLNTEQ 734 L V+ +++ ++ S + + LN + Sbjct: 254 ALQEVMKPPMKMRAALISRFKIMSSLNIAE 283 >UniRef50_Q15ZI3 Type II secretion system protein E n=119 Tax=Proteobacteria RepID=Q15ZI3_PSEA6 Length = 637 Score = 158 bits (400), Expect = 6e-37, Method: Composition-based stats. Identities = 55/254 (21%), Positives = 99/254 (38%), Gaps = 11/254 (4%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQN 541 ++ LG +L++ +I+EEQL L + G +LG ++ G ++ QL L++ Sbjct: 2 KPKAKIRLGDLLVQEGIISEEQLMQTLSAQKQSGRKLGYMLIELGFMTENQLLTFLSQHL 61 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV 601 GV + +++ + +P A Y L L + D L+VG D D +L L+ + Sbjct: 62 GVPLIDVTQYRVSVEAVLLLPEVQARRYRALVLDDKGDHLLVGMSDPADLAALDILSGVL 121 Query: 602 GRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAM-LYNAVQHQWLTEQQAGEIWRQYVPHQ 660 + V+ +V Q+ +Y R A L Q + G Q Sbjct: 122 PKPVKVAVVSDAQLFQAYDRFYRRTEDIASFAQELAEEYQDDEEFDFDTGVDNEQDTAVA 181 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L I S I++ E S L L +GV+ + ++ + Sbjct: 182 RLLQSIFEDALQTKASDIHI-----EPDSEMLRIRLRVDGVLQE----NIIKEKNIASAL 232 Query: 721 MQSLLLKAGLNTEQ 734 + L L +GL+ + Sbjct: 233 VLRLKLMSGLDISE 246 >UniRef50_B5E974 General secretory system II protein E domain protein n=2 Tax=Geobacter RepID=B5E974_GEOBB Length = 391 Score = 158 bits (399), Expect = 8e-37, Method: Composition-based stats. Identities = 35/141 (24%), Positives = 71/141 (50%), Gaps = 2/141 (1%) Query: 489 PLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG++L++ IT QL+ L+ + G R G +++ G + ++LA L+++ G+A S Sbjct: 2 RLGELLVDAGKITPTQLEETLKGQAIFGGRFGTNLVEMGYLDEQELAHFLSQKTGIAHTS 61 Query: 548 IDA-WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + +IP ++ +P Y V+P+ L N +L + D D ++ + G V Sbjct: 62 PEQLMEIPPHVVGAVPEEYVRKYRVMPVALNNRKLTLAMLDPSDFQAIDEIAFATGYIVV 121 Query: 607 YVIVLRGQIVTGLRHWYARRR 627 VI ++++ + +Y +R Sbjct: 122 PVIAPELRMLSAMEKYYGIKR 142 >UniRef50_Q39ZG4 General secretory system II, protein E-like n=2 Tax=Geobacter RepID=Q39ZG4_GEOMG Length = 370 Score = 157 bits (397), Expect = 1e-36, Method: Composition-based stats. Identities = 43/142 (30%), Positives = 75/142 (52%), Gaps = 2/142 (1%) Query: 488 RPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 LG++L++ IT +QLD ALR++ + G RLG +++ G I E+LA+ L+E+ V Sbjct: 3 LRLGEMLVKTGRITPDQLDEALRSQVIFGGRLGTNLVEMGCIDEEELARVLSEKLRVPCV 62 Query: 547 SIDA-WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 D I ++I +P V Y V+PLRLEN L + D D ++ + + G + Sbjct: 63 DPDELMNISPAIIEAVPLEVVEQYQVVPLRLENRRLFLVMADPSDLPAIDQIAFRTGHVI 122 Query: 606 RYVIVLRGQIVTGLRHWYARRR 627 ++ +++ L +Y +R Sbjct: 123 VPLVAPEIRLLMALEKYYGIKR 144 >UniRef50_B2A7G1 Type II secretion system protein E n=5 Tax=Firmicutes RepID=B2A7G1_NATTJ Length = 571 Score = 157 bits (397), Expect = 1e-36, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 105/259 (40%), Gaps = 18/259 (6%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 + + LG +LLE+ ITEE L AL ++ G +LG S++ G+I+ E++ + L Q G Sbjct: 3 RQKKQRLGDLLLESGAITEEDLKQALDHQNKSGQKLGASLVDLGIITEEEIIEVLEFQLG 62 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + S+ + +PA +A Y VLP+ + +L++ D ++ V++ + G Sbjct: 63 IPHVSLSQYDTNRETATLIPAYLAERYQVLPIDNRSGKLVLAMGDPLNVVAIDDVKMATG 122 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEI------WRQY 656 +V VI +I + + + + + + A Sbjct: 123 MEVEPVIASPREIEGEINRHFGIQDSVEKAIEEIEGSAEEEAESEIAATEEEELSNLETN 182 Query: 657 VPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRE 716 P + +++ S I++ E + + +GV L V T R Sbjct: 183 APVVKVVNSLVSQAYEQGASDIHI-----EPTKQGMQIRYRIDGV-----LHNVATPPRY 232 Query: 717 LQVSMQS-LLLKAGLNTEQ 734 + + S + + AG++ + Sbjct: 233 AKDLLISRVKIMAGMDITK 251 >UniRef50_A6M148 Type II secretion system protein E n=18 Tax=Clostridiaceae RepID=A6M148_CLOB8 Length = 564 Score = 157 bits (396), Expect = 2e-36, Method: Composition-based stats. Identities = 46/252 (18%), Positives = 101/252 (40%), Gaps = 12/252 (4%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 R LG IL+ IT QL AL+++ G +LG +L +I+ E + +A+ +Q G+ Sbjct: 4 EKRRLGNILVNAGKITGYQLQEALKSQRTLGKKLGEILLDSKIITEEDIIEAIEQQTGIK 63 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 ++ I +P ++ Y ++P +N+++ V D ++ ++ + G + Sbjct: 64 KVDLNTINFDRKAITLIPQNLCDKYLLIPFGFDNNKIKVALADPLNIFAIDDVAISTGFE 123 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQ--YVPHQFL 662 + I + I + +Y+ ++ ++ L +QA + + P + Sbjct: 124 IESFISRKADIKKFIGIYYSSQQVNNAAIQLAKESTKAVKNGKQAIDEMSEVNSAPVVKM 183 Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQ 722 + + S I++ E + +G + L I+ L + Sbjct: 184 VDYLFRNSVEMKTSDIHI-----EPFENEIRIRYRIDGKLQTV---NTLGIE-SLGPLVT 234 Query: 723 SLLLKAGLNTEQ 734 + + AGLN + Sbjct: 235 RIKILAGLNIAE 246 >UniRef50_C6E8N7 General secretory system II protein E domain protein n=3 Tax=Geobacter RepID=C6E8N7_GEOSM Length = 345 Score = 157 bits (396), Expect = 2e-36, Method: Composition-based stats. Identities = 52/224 (23%), Positives = 91/224 (40%), Gaps = 14/224 (6%) Query: 489 PLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG++LL+ +TE+QL+ L + + G RLG +++ GL+ E+LA+ L+EQ GV Sbjct: 4 RLGEMLLKVGTLTEDQLEQVLNAQSIYGGRLGTNLVEMGLVEEEELARLLSEQLGVPCAH 63 Query: 548 I-DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + IP SL+ P + Y VLPL L+ L V + D +L + G + Sbjct: 64 PSELSSIPESLLKMFPLELVQRYRVLPLALDGKRLTVAMTNPSDFKALEDIAFVTGMIII 123 Query: 607 YVIVLRGQIVTGLRHWYARRR---------GHDPRAMLYNAVQHQWLTEQQAGEIWRQYV 657 + ++ L + +R G R A + G + + Sbjct: 124 PRVCSELRLSIALERIFGVKRPMRYIPVEGGARSRFAATLAERGSADPAWDGGAV--CHT 181 Query: 658 PHQFLFAEILTTLGH-INRSAINVLLLRHERSSLPLGKFLVTEG 700 + ++ L + S + +L + G FL +G Sbjct: 182 SERVSLEDLSERLAKAVGESEVVQAVLSYLAGEFDRGAFLRLKG 225 >UniRef50_C6MLL4 General secretory system II protein E domain protein n=1 Tax=Geobacter sp. M18 RepID=C6MLL4_9DELT Length = 450 Score = 157 bits (396), Expect = 2e-36, Method: Composition-based stats. Identities = 40/144 (27%), Positives = 69/144 (47%), Gaps = 2/144 (1%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 LG +L++ IT QLD L+ + G R G +++ G + LA+ L+++ GV Sbjct: 2 QKMRLGDMLVQAGKITPAQLDETLKGQAIFGGRFGTNLVEMGYLDEHDLAEFLSQKTGVP 61 Query: 545 WESIDAW-QIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + + IP+++I +P A Y +P+ L N +L + D D +L + G Sbjct: 62 HAAPEQLLDIPANIIKLIPFDCAKKYRAVPIALNNRKLTLAMVDPTDLHALDEIAFATGY 121 Query: 604 KVRYVIVLRGQIVTGLRHWYARRR 627 + VI +IVT L +Y +R Sbjct: 122 IIVPVIAPELRIVTALEKYYQIKR 145 >UniRef50_D2R473 Type II secretion system protein E n=6 Tax=Planctomycetaceae RepID=D2R473_9PLAN Length = 572 Score = 155 bits (392), Expect = 5e-36, Method: Composition-based stats. Identities = 56/263 (21%), Positives = 103/263 (39%), Gaps = 24/263 (9%) Query: 486 SLRPLGQILLENQVITEEQLDTALR--NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 ++R +GQI ++ I+++QL+ L + G LG L++ EQL QALAEQ G+ Sbjct: 2 AIRRIGQIFVDMGFISDDQLEMLLEEQQQRPGTLLGKLAQEMSLVNEEQLVQALAEQMGM 61 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + IP ++ ++ S+A Y V+P++ ++EL V + D + L +G Sbjct: 62 QVVELGDITIPGDVLHKVTESMAQLYRVIPIKFSSNELTVATCDPQNITIQDELRSMLGY 121 Query: 604 KVRYVIVLRGQIVTGLRHWYARRR-----------GHDPRAMLYNAVQHQWLTEQQAGEI 652 +R VI I L +++ + +AV + + E Sbjct: 122 DIRVVIASETDIKKTLDRYFSSDKDTVDSIVGELEADSELKKAMDAVAKNGAVDLTSVEA 181 Query: 653 WRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLT 712 P + L +L + S I+ E + +GV L ++ Sbjct: 182 LADSAPVRKLLNMVLLLAIKDHASDIH-----FEPFEDEFRIRIKADGV-----LFEMVP 231 Query: 713 IQRELQ-VSMQSLLLKAGLNTEQ 734 R L + + A L+ + Sbjct: 232 PPRHLAFAITTRIKVMANLDIAE 254 >UniRef50_B0VJ41 Type IV pilus biogenesis protein PilB n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJ41_9BACT Length = 569 Score = 154 bits (390), Expect = 9e-36, Method: Composition-based stats. Identities = 54/254 (21%), Positives = 100/254 (39%), Gaps = 18/254 (7%) Query: 489 PLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE- 546 LG IL+ ITEEQL AL + GL+LG +++ G ++ +L +AL +Q G Sbjct: 9 RLGDILVHEGYITEEQLKDALLKQGNFGLKLGETLIKLGYLTENELLEALHKQLGYDVVQ 68 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + + ++++ +P A VL LR E D ++V D + + +L + +G+ ++ Sbjct: 69 DKELMDLDINIVSSIPEPYAKENKVLALREEGDGVVVAMTDPENLIVSDSLEKILGKNIK 128 Query: 607 YVIVLRGQIVTGLRHWYARRRG----HDPRAMLY-NAVQHQWLTEQQAGEIWRQYVPHQF 661 V++ + + +Y R D AV A P Sbjct: 129 PVLIGNSSLQDAIEKYYKSIRTTTEVEDAVGGFEFVAVDEDENEITIAAATEDVDAPVVK 188 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM 721 + I+ + I++ E + +G L V+T + S+ Sbjct: 189 MINLIINEAIKAGATDIHI-----EPLTKISRIRYRVDG-----ALREVMTPPIGMHPSL 238 Query: 722 QSL-LLKAGLNTEQ 734 SL + + LN + Sbjct: 239 ISLVKVMSKLNIAE 252 >UniRef50_Q3SKS0 Pilus assembly pathway ATPase PilB n=3 Tax=Proteobacteria RepID=Q3SKS0_THIDA Length = 577 Score = 154 bits (390), Expect = 1e-35, Method: Composition-based stats. Identities = 51/256 (19%), Positives = 100/256 (39%), Gaps = 17/256 (6%) Query: 485 RSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 R LG +L+E +VI+ LD AL + G RLG ++ GL +AQALA Q + Sbjct: 15 RQKIRLGDLLVEQKVISAADLDIALTAQKKSGRRLGRIIVESGLAGENDIAQALARQLAI 74 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + + + S++ + + A + +PL ++ VG D D + + R + Sbjct: 75 PFVDLRKFNPDPSILQLLGETQARRFRAIPLGRREGDIFVGMADPTDLFAYDEVARLIEG 134 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIW-----RQYVP 658 ++ +V G +++ + Y RR D + + +E + + P Sbjct: 135 GIQLAVVAEGDLLSAIDRLY--RRTDDIHGLTEELARDMGESEASIIGLDALGEGQADAP 192 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ 718 L + +N S +++ E L +GV+ ++T + Sbjct: 193 VVRLLQTLFEDALQVNASDVHI-----EPQEKQLMIRFRIDGVLHRQTEADLRIAP---- 243 Query: 719 VSMQSLLLKAGLNTEQ 734 L + +GL+ + Sbjct: 244 ALALRLKIVSGLDISE 259 >UniRef50_B5YIG2 Type IV-A pilus assembly ATPase PilB n=4 Tax=Bacteria RepID=B5YIG2_THEYD Length = 572 Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats. Identities = 52/264 (19%), Positives = 113/264 (42%), Gaps = 19/264 (7%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALA 538 T T +G LL+ I+E+QL A ++EG+++G +++ G I+ ++L ++++ Sbjct: 2 STKLTSERLTIGVFLLKKGKISEKQLIDAQAVQKIEGIKIGAALIKLGYITEDELVESMS 61 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 E G ID+++I ++ +P V Y VLP E + + V D + ++L L Sbjct: 62 ELYGYPVFKIDSYKIDPLVVKLLPEDVIRKYKVLPFLREGNIIRVLITDPANEIALEQLK 121 Query: 599 -RKVGRKVRYVIVLRGQIVTGLRHWYARRRG-----HDPRAMLYNAVQHQWLTEQQAGEI 652 G K+ + I + ++ ++ +AVQ +T+ + + Sbjct: 122 FFLSGFKILFYIGKDSDFKNLINKFFGEEGAEIYSKETVHELVESAVQEPSITQPEEEQA 181 Query: 653 WRQY-VPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVL 711 + P L +I+ S I++ E + +GV L +L Sbjct: 182 ILEVDAPLIRLVNQIIVNAISKRASDIHI-----EPFEDNIYIRYRIDGV-----LHDIL 231 Query: 712 TIQRELQVSMQS-LLLKAGLNTEQ 734 T+ +L+ ++ + + + A ++ + Sbjct: 232 TLPPKLKSALITRIKIMANMDISE 255 >UniRef50_C0QER1 PilB n=10 Tax=Proteobacteria RepID=C0QER1_DESAH Length = 739 Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats. Identities = 50/277 (18%), Positives = 101/277 (36%), Gaps = 30/277 (10%) Query: 463 GDPRR-VAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRN-RVEGLRLGG 520 G P R A KT D +G++L + IT Q +AL + G RL Sbjct: 5 GQPSRSTALTKTIKDQSGA-----GKVRIGELLSKEGQITSNQFQSALSQHKKTGTRLSS 59 Query: 521 SMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDE 580 +L G I E + L + + ++ +P +A Y V PL ++ +E Sbjct: 60 VLLTMGFIDPETIINVLGRIYNYPVVRLADIKPDPKILKLLPFDIAKRYMVFPLGMKGEE 119 Query: 581 LIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQ 640 L+V + D ++ L ++VG+ ++ + ++ R +Y + ++ Sbjct: 120 LVVTMTEPTDTTAVEELQQEVGKTLKISVSTENDVIQAYRDFYKISEEQYREFIHFD--- 176 Query: 641 HQWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHE---RSSLPLGKFL- 696 + + V F +++ + + ++ S P+ K + Sbjct: 177 --------DEKEDDEPVTSVEDFGSLVSEAAGELEIEPDDNVSDYDEFRASDAPIIKLVN 228 Query: 697 ------VTEGV--ISQETLDRVLTIQRELQVSMQSLL 725 + +GV I E +R L ++ L S+ + Sbjct: 229 GILIKAINDGVSDIHIEPFERSLQVRYRLDGSLYKAM 265 >UniRef50_C6E6L0 Response regulator receiver protein n=3 Tax=Geobacter RepID=C6E6L0_GEOSM Length = 292 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 89/189 (47%), Gaps = 10/189 (5%) Query: 474 THDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQ 532 T D S+ ++ +PLG+I +E ++T+ ++ + + + +G+RLG + + GL+S E+ Sbjct: 2 TADIESLNTTPQTRKPLGEIFVERGLLTKVSVERLIDHAKSKGIRLGELLEVIGLVSPEE 61 Query: 533 LAQALAEQNGV-AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDP 591 LA+ALA Q + +++ +P +A+ + + PL++++ L + D Sbjct: 62 LAEALAIQYRCRKISDFSKYAYSPAMLRLIPMEMAVKHTIFPLKMDDGRLGLAVADP--- 118 Query: 592 VSLAALTRKVGRKVRY----VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQ 647 ++ L R++ + + + R +I + Y + P V+ L+ + Sbjct: 119 -TMDELFRQIAAQHKVKLILYVATRMEINRAIARHYLGQPATGPEGKTILLVEDDQLSRE 177 Query: 648 QAGEIWRQY 656 +I ++ Sbjct: 178 MVAKILTKH 186 >UniRef50_B3E1K8 Response regulator receiver protein n=5 Tax=Geobacter RepID=B3E1K8_GEOLS Length = 340 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 80/222 (36%), Gaps = 2/222 (0%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQAL 537 S D + LG IL+ ++I+E+ L+ AL EG +LG + G+I+ +L +AL Sbjct: 58 SPFADINQKKQLGDILVRAKLISEKTLERALERQHTEGKKLGEVLEEMGVITELELVEAL 117 Query: 538 AEQNGVAWE-SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAA 596 Q G P+ I +P + V PL+ ++ L V D D + Sbjct: 118 GRQFGFKTVTDFSNRTYPAETINLLPTEFVMKRLVFPLKHKDMMLAVAITDPFDGETTDM 177 Query: 597 LTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQY 656 + R G +V VI R +I+ + Y + V+ + Sbjct: 178 IARITGLQVVPVIATRKEILEAIARHYLNAPVNPDAGNTILVVEDSPTVATVVQAALVKE 237 Query: 657 VPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVT 698 + + + + L +++ + L L Sbjct: 238 GYNVIICKDGIEALKTTLTHRPQLVITDAQMPKLDGHGLLRA 279 Score = 72.1 bits (175), Expect = 7e-11, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 39/85 (45%), Gaps = 5/85 (5%) Query: 488 RPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + LG IL++ ++I+ + L+ AL + RLG + G+I+ +L +AL Q+ + Sbjct: 3 KQLGDILVDTELISRKTLERALERQKATAKRLGQVLEEMGVITEAELMEALVSQHS-PFA 61 Query: 547 SIDAWQIPSSLI---AEMPASVALH 568 I+ + ++ + Sbjct: 62 DINQKKQLGDILVRAKLISEKTLER 86 >UniRef50_C6E323 Response regulator receiver protein n=3 Tax=Geobacter RepID=C6E323_GEOSM Length = 283 Score = 151 bits (382), Expect = 8e-35, Method: Composition-based stats. Identities = 50/212 (23%), Positives = 90/212 (42%), Gaps = 2/212 (0%) Query: 489 PLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE- 546 PLGQIL+++ +IT + L+ AL R G RLG + G+I+ E+L +ALA+Q+G+ Sbjct: 4 PLGQILVQSGIITVKTLERALARQEGSGKRLGAILEEMGVITPEELVEALAQQSGMEMVK 63 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 I +P L+ +P VA++ V PL + L + D ID +L L R K+ Sbjct: 64 RITVQNVPGELLELVPGEVAINKLVFPLNRQEGVLAIAVSDPIDSETLDLLERHSCNKIV 123 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEI 666 V+ R +I+ ++ Y R + ++ V + + + Sbjct: 124 QVLAAREEILGAVKQHYLRDEASENVSLKILLVDDSAGLSCDVESALKNEGYQVYTARDG 183 Query: 667 LTTLGHINRSAINVLLLRHERSSLPLGKFLVT 698 + L +++L + + Sbjct: 184 VEGLKIAFSQRPDLILCDAGAPKMDGYALMRA 215 >UniRef50_C3RLB0 Type II secretion system protein E n=2 Tax=Bacteria RepID=C3RLB0_9MOLU Length = 563 Score = 151 bits (382), Expect = 8e-35, Method: Composition-based stats. Identities = 46/251 (18%), Positives = 102/251 (40%), Gaps = 18/251 (7%) Query: 489 PLGQILLENQVITEEQLDTALRNRVEG--LRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 P+G++L E I +EQL+ AL + RLG ++ G +S Q+ +AL+++ Sbjct: 12 PIGEVLKEYGYINDEQLNVALEAQKSNRSKRLGQHLIDLGFVSEYQMLEALSDKLAEPLI 71 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + ++ + ++P ++A Y ++ + L + +L + + D ++ + + G + Sbjct: 72 ELSEIKVDIDAVQKIPRAMADKYNIIAIDLTDQQLTIVTSDPLNFYGIEDVRLVTGMHLN 131 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDP--RAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFA 664 + + ++ + +Y D L V L E P L Sbjct: 132 VCLATKAEVSKAIDRYYNDVAALDIADDIKLNTIVVEDTLDLFNESEDDT---PVVKLVN 188 Query: 665 EILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQ-S 723 +L+ N S I++ E + + +G L LT+Q+ +Q S+ Sbjct: 189 TLLSRGYVNNASDIHI-----EPFEDKVIIRMRVDG-----MLVDYLTLQKNIQNSLIVR 238 Query: 724 LLLKAGLNTEQ 734 + + + L+ + Sbjct: 239 IKILSNLDIAE 249 >UniRef50_C0QQ17 Type IV pilus assembly protein TapB n=2 Tax=Bacteria RepID=C0QQ17_PERMH Length = 558 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 94/228 (41%), Gaps = 35/228 (15%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 S +P+GQ+L E +TEEQ+ AL +++G LG + +S ++A+A+A Q+G Sbjct: 2 SRKPIGQLLKEFGYVTEEQIQVALEVQKIKGGLLGEILQELSFVSPREVAEAIARQSGRP 61 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + + + + + ++A VLP ++ DEL + + D ++ ++R+ K Sbjct: 62 YIDLSQYPPTRESLRILDKNIAKQLEVLPFEIDKDELHIAMTNPYDINAIDVVSRRTNLK 121 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYV------- 657 V+ + + ++ + Y +L EQ + + Y+ Sbjct: 122 VKVYVADKETLLKSIEIHY-------------------FLLEQPIDQTVKSYIEKAKTGT 162 Query: 658 --PHQFLFAEILTTLGHINRS-AINVLLLRHERSSLPLGKFLVTEGVI 702 F + + I+R+ I++ S F +G++ Sbjct: 163 LGTELPKFIDTVLNHAIIDRATDIHI-----SPESAASHIFFRIDGIM 205 >UniRef50_B3E4P2 Response regulator receiver protein n=1 Tax=Geobacter lovleyi SZ RepID=B3E4P2_GEOLS Length = 281 Score = 151 bits (380), Expect = 1e-34, Method: Composition-based stats. Identities = 51/204 (25%), Positives = 84/204 (41%), Gaps = 4/204 (1%) Query: 486 SLRPLGQILLENQVITEEQLDT--ALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 + + LG+IL+ +++ ++ AL NR E R G + +GLI+ +L+ ALAEQ + Sbjct: 3 AKKLLGEILVNKGILSPLTVERMIALANR-EQKRFGWFLEDKGLITGHELSAALAEQFNM 61 Query: 544 AWE-SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 SI+ + P L++ + AL + + PLR E L++ D D + + G Sbjct: 62 KHLTSIEQYSYPKELLSLITPETALEFNLFPLRQEGSNLLLAVTDPTDMRMAHTIAKNQG 121 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 V +V R Y R+ P+ LT + EI L Sbjct: 122 MTVVPAVVSREAFFAAFCKHYLGRQIQKPKGETVLIADDDKLTREMLKEILVSNGFRVLL 181 Query: 663 FAEILTTLGHINRSAINVLLLRHE 686 A+ + I S V+L E Sbjct: 182 AADGMEAYKEIVASRPQVVLTDKE 205 >UniRef50_Q1Q109 Strongly similar to general secretory system type II protein, ATPase component n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q109_9BACT Length = 582 Score = 151 bits (380), Expect = 1e-34, Method: Composition-based stats. Identities = 49/268 (18%), Positives = 107/268 (39%), Gaps = 23/268 (8%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQAL 537 R GQ+L EN TE+Q+ AL + G LG ++ ++ Q+ Q L Sbjct: 9 PDAKSDAQRRLFGQLLKENGFATEDQIQEALAVQKQNGGLLGDILISMNYVTDPQIMQVL 68 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDE--LIVGSEDGIDPVSLA 595 +E GV +I+ ++P +I +PA++A Y ++P+ E ++ + + + + +L Sbjct: 69 SEYLGVEIVNIEDREVPGDVINLVPAAIAQLYRIIPISYEQEKQVITIAQANALAIETLD 128 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAG----- 650 L + V+ V+ + + L +Y ++ +L + + + +G Sbjct: 129 DLRLVLKLNVKPVLCHKDSVARALEKYYPKKH-ESVEQLLLEFKEDKSYAQSVSGNYIDI 187 Query: 651 ---EIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETL 707 + P + + S I+ +E +GV+ + Sbjct: 188 EELKKMASTAPVKKWVGLMFLYAVLDKASDIH-----YEAFEDSFRVRYRIDGVLYER-- 240 Query: 708 DRVLTIQRELQVSMQS-LLLKAGLNTEQ 734 ++ REL + + S + + AG++ + Sbjct: 241 ---VSPPRELGIPINSRIKVMAGMDISE 265 >UniRef50_B5E8D1 General secretory system II protein E domain protein n=5 Tax=Geobacter RepID=B5E8D1_GEOBB Length = 552 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 36/139 (25%), Positives = 74/139 (53%), Gaps = 1/139 (0%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 +G+IL ++Q+ITE++L AL +V G R+G +++ G+++ E + ALA Q + + + Sbjct: 10 IGEILFKSQIITEQELSAALEEQKVSGCRVGEALVRLGVVAQEDIDWALANQLNIPYVRL 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 I + +A++P +A Y++ P+ L EL + D ++ ++ +TR G ++ Sbjct: 70 KKENIDPAAVAKVPGQLARRYSLCPIFLSGSELSIAMADPLNKEAVEEITRVTGCQISIS 129 Query: 609 IVLRGQIVTGLRHWYARRR 627 + L +I Y + Sbjct: 130 VGLIREIREMHDAMYGPDQ 148 >UniRef50_C4Z588 Type IV pilus assembly protein PilB n=9 Tax=Clostridia RepID=C4Z588_EUBE2 Length = 615 Score = 150 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 40/260 (15%), Positives = 97/260 (37%), Gaps = 18/260 (6%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQN 541 + R LG +L++ +I E QL TAL R R +G LG ++ G + + +AL + Sbjct: 2 NYRKKIRLGDVLVKKGIIDENQLQTALSRQREQGKMLGEMVIALGYATQRDINEALCDSL 61 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRL---ENDELIVGSEDGIDPVSLAALT 598 G+ + + + +++ + ++ Y ++PL + V D + +++ + Sbjct: 62 GIDFVDMRETDVSEDVLSMLDENIMRKYTLVPLGDAPDNPGAIRVAMADPTNILAMDDIN 121 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYV- 657 G++V V+ I + +++ + + E + + R+ + Sbjct: 122 IVTGKQVVPVLANASDINAFFDKAFGQKQAQSIVDLYKKEQGDVFKEETKEDKARREEIE 181 Query: 658 --PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQR 715 P L ++ S I++ E + +G L ++ Sbjct: 182 NAPIVQLINSVIEQAVRQRASDIHI-----EPMEKSIRVRYRIDG-----NLREIIDYDN 231 Query: 716 E-LQVSMQSLLLKAGLNTEQ 734 L + + +G++ + Sbjct: 232 TLLGAITTRIKIMSGMDISE 251 >UniRef50_A5G3U1 General secretory system II, protein E domain protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5G3U1_GEOUR Length = 378 Score = 148 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 43/176 (24%), Positives = 77/176 (43%), Gaps = 6/176 (3%) Query: 492 QILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 +LL+ +I EQ D AL+NRV G ++G S++ G + + LA+ L+++ V + D Sbjct: 7 DMLLDAGLINREQFDEALKNRVLYGGKIGTSLIELGYVREDDLARFLSKKLAVPFIDADR 66 Query: 551 W-QIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVI 609 IP +I +P ++AL Y V+P+ + L + D D ++ ++ G + V Sbjct: 67 LLTIPPEIIRLIPRNIALTYGVIPIHRDKKRLFLVMSDPADLKAIDEISFITGFIINPVT 126 Query: 610 VLRGQIVTGLRHWY----ARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQF 661 ++V L +Y RR R + + T R P+ Sbjct: 127 APEVRLVQALGKYYDYEVDRRYAQIIRRIEEEKPTAKPTTTVPRPAPARMETPYVP 182 >UniRef50_C8WRT0 Glycosyl transferase family 2 n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WRT0_ALIAD Length = 417 Score = 146 bits (369), Expect = 2e-33, Method: Composition-based stats. Identities = 88/444 (19%), Positives = 151/444 (34%), Gaps = 79/444 (17%) Query: 49 LSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGT 106 LSVY + R R + +K AI++PA NE VIG + + + Y Y + V Sbjct: 29 LSVYGIWHR--RRPITHAPQKRFAIIIPAHNEECVIGPLLD-SLKRQTYPAHLYDVHVI- 84 Query: 107 YPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFI 166 D D AR V K + +L + + + + Sbjct: 85 ----ADNCTDGTAERARAHGAIVHVRENRAEQGKGYAIEWMLARLKEM----GARYDAIV 136 Query: 167 LHDAEDVISPMELRLFNY-LVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPV 225 + DA++++ P L + N L +IQ + + + S++ + + Sbjct: 137 MFDADNLVHPDFLAIMNDHLCSGDRVIQGYLDTKN-PFDSWISVSLAISYWFDNRLWQYA 195 Query: 226 REALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVR 285 R L G G C + + + LTED + G R +G+ ++ Sbjct: 196 RARLHLPCTLGGTGLCIDYPLLQEM-------GWKATGLTEDLEFGIRCVRRGIIPVWAH 248 Query: 286 FPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWT 345 V + K P +F+ + RQ+ RW G FQ + H Sbjct: 249 DARVYDEK-----------------------PTSFAASFRQRLRWQQG-HFQCAREH--- 281 Query: 346 SSLTLNYFL--WRDRKG-----AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFS 398 + FL R+R AI F +ML+ +++L L PD S + Sbjct: 282 ---LVPMFLEGLRERNLAKIDMAIYLFQPMRSMLLFAGAMIVLGLHYLSPDPTDAAS--N 336 Query: 399 GSAWLMTLLWLN----------FGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLIN 448 +A ++T LW+ L++ R+ R F LL WG + Sbjct: 337 PAALMVTNLWVAVNVILFLEVPLALLLERVNWRAYFAL-------PLLPFFLWTWGPVTL 389 Query: 449 FMANWRALKQVLQHGDPRRVAWDK 472 R+ + R + D+ Sbjct: 390 QAYFTRSNRTWYHTVHKRAIRLDE 413 >UniRef50_C5SLZ1 Putative uncharacterized protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SLZ1_9CAUL Length = 309 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 72/353 (20%), Positives = 107/353 (30%), Gaps = 59/353 (16%) Query: 127 VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LF 182 + V P +K LN L ++DAED+ +P +LR F Sbjct: 1 MRVVRVESGIPLTKPRALNLALYRACG---------DLLAIYDAEDIPAPSQLREAAAAF 51 Query: 183 NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 L +Q P+ P F + + E++ +P P G F Sbjct: 52 ADLPAHIACLQAPLRPAG--SRGFIARQFAAEYAVQFDMLLPALHHFGLTFPLGGTSNHF 109 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 A+ A A+D ++TED D+ +RL G + P Sbjct: 110 RAPALKA------AGAWDAHNVTEDADLAYRLVRLGYGCGLIDAP--------------- 148 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAI 362 RE P+ T + Q++RWI G Q H S L A+ Sbjct: 149 ---------TRESPPEDTRTWLPQRTRWIKG-HMQTLLVHT-RSLSDLPVM------TAL 191 Query: 363 SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRV 422 S + +L H G L L L G +I Sbjct: 192 GLMFSLGLNVFSALFYAPFMALTLCQGLLHLWQPDLGGVGLPDLTLLMCGRGFAQIALDT 251 Query: 423 IFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 L+ L ++WG + AL Q++ H WDKT H Sbjct: 252 GASRAGLKLSLWDRLSLPVYWG--LQSFGALFALYQLMAH----PFHWDKTEH 298 >UniRef50_A9G5Y5 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G5Y5_SORC5 Length = 527 Score = 145 bits (366), Expect = 5e-33, Method: Composition-based stats. Identities = 55/232 (23%), Positives = 92/232 (39%), Gaps = 13/232 (5%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 LGQ+L++ ++IT++ L+ L + G RLG ++ QGLI+ QL Q L+ Q V Sbjct: 4 PRLRLGQLLVDARMITQDALERTLEQQRTDGRRLGTLLVEQGLINETQLTQILSHQLAVP 63 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPL-----RLENDELIVGSEDGIDPVSLAALTR 599 W S+ + L+ +P VA Y ++P+ R + + L V +D + + Sbjct: 64 WVSLLHIEFSRQLLNLVPHDVAERYCLVPIYVRHVRNQGETLYVAMDDPTNEDGMKECMA 123 Query: 600 KVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPH 659 G VR +I I +R +Y G R+ A Q + P Sbjct: 124 FSGLPVRAMIAPPSDIRNAIRVYY----GARARSAPPPAPAKQAQVSEPQSRARPADPPP 179 Query: 660 QFLFAEILTTLGHINRSAINVLLLRH---ERSSLPLGKFLVTEGVISQETLD 708 + T H + + R ++ G LV + + ET D Sbjct: 180 SSELSGAPETARHAEDTPLTPSTPEPLLLTRPAMRSGARLVEDAGPAIETSD 231 >UniRef50_A5GB67 Response regulator receiver protein n=4 Tax=Geobacter RepID=A5GB67_GEOUR Length = 280 Score = 145 bits (365), Expect = 7e-33, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 85/204 (41%), Gaps = 2/204 (0%) Query: 485 RSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 ++ + LG+I + + +ITE+ L+ AL R++ + ++G + +++ E+LA ALA Q G Sbjct: 2 KNRKRLGEIFVASGLITEKTLERALARSKRQNKKVGMVLEEIEMVTGEELASALAVQYGH 61 Query: 544 AWES-IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 S + P L +P VA+ + + PL++EN++L V D + ++ + Sbjct: 62 RVVSNFARYAFPPELFKLIPEDVAMQHLLFPLKIENNKLAVAMADPTETKIVSNIAANND 121 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 + I R I + Y R + + L + + L Sbjct: 122 LTLVPFIATRRDIFAAIARHYLGRDLSASQEKSVLVAEDNKLVYTMLSNVLSKEGYRVIL 181 Query: 663 FAEILTTLGHINRSAINVLLLRHE 686 + + + +V++ E Sbjct: 182 ALDGMDAYKSAIAESPHVIITDKE 205 >UniRef50_Q1AWU6 Type II secretion system protein E n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AWU6_RUBXD Length = 572 Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats. Identities = 57/246 (23%), Positives = 95/246 (38%), Gaps = 14/246 (5%) Query: 493 ILLENQVITEEQLDTALRNRVEGLR-LGGSMLMQGLISAEQLAQALAEQNGVAWESIDAW 551 +LL +TEEQL A+ + R LG ++ G +SAE+LA+A A + G+ + Sbjct: 20 LLLSEGSLTEEQLHRAVEAQKHDPRDLGQILVSLGYVSAEELARARARRLGLGYLEPSER 79 Query: 552 QIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVL 611 + + + +P V + LPLRLE L+ D D +L L G V V+ Sbjct: 80 DVDPAALGLVPERVLRRHRALPLRLEEGRLVAALADPTDLQALDDLRMLSGYPVTPVVAT 139 Query: 612 RGQIVTG-LRHWYARRRGHDPRAMLYNAVQHQWLTE-QQAGEIWRQYVPHQFLFAEILTT 669 I ++ + R + + + P L + IL Sbjct: 140 EEAIRRLQIKLFAVDERVTGILREAELREAREEDDDLDLGAGAGAEERPVIRLVSSILQQ 199 Query: 670 LGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS-LLLKA 728 S I++ E L + +G+ L V++I LQ + S L L + Sbjct: 200 AISDGASDIHL-----EPRPGRLAVRVRVDGL-----LREVMSIPHRLQSGVISRLKLVS 249 Query: 729 GLNTEQ 734 GL+ + Sbjct: 250 GLDIAE 255 >UniRef50_Q1D3E0 General secretion pathway protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D3E0_MYXXD Length = 251 Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats. Identities = 41/170 (24%), Positives = 77/170 (45%), Gaps = 6/170 (3%) Query: 489 PLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG++L++ V+ E QL AL + + G +LG ++ L+S + L +AL++Q G+ + Sbjct: 5 KLGELLIKANVLQESQLKAALAEQAKWGGKLGEILVRMSLVSEDILVRALSKQLGMPAVN 64 Query: 548 IDAWQ-IPSSLIAEMPASVALHYAVLPLRLENDE--LIVGSEDGIDPVSLAALTRKVGRK 604 +DA Q + + A++PA A ++VLPL++ +D L+V D ++ L L + Sbjct: 65 LDAVQMVQPHVKAKIPAQTARDFSVLPLQVRDDGKSLVVAMSDPLNVRMLDELRAITKCR 124 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR 654 + + R I Y + H+ N + Sbjct: 125 IIPNVAGRTSIARAFARIY--EQNHELEDADTNFKVVDAQGRTVVKNLKD 172 >UniRef50_A7GMR7 Glycosyl transferase family 2 n=3 Tax=Bacillales RepID=A7GMR7_BACCN Length = 437 Score = 144 bits (363), Expect = 1e-32, Method: Composition-based stats. Identities = 74/482 (15%), Positives = 156/482 (32%), Gaps = 93/482 (19%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL 71 LY + I L + L + ++ +V++ + SY + Sbjct: 17 LYVMIAFLIFLTQFQGVLSLYQVVVSLLGFVKK------------KNSYVLDHDIAHTRF 64 Query: 72 AIMVPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 AI+V A NE VI + + E Y I V +T + V E + H Sbjct: 65 AILVCAHNEEKVIEQIVKNLKKIDYPKEKYDIHVICDNCTDNTAQIVRENQVKAWERH-- 122 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV--ER 188 K L + + + E+ + ++ DA++V+S L++ N + E+ Sbjct: 123 ---DNQKRGKGYGLEWMFQNLFRLEKEQQEVYDAVVILDADNVVSRNFLQVLNAKLVKEK 179 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 +++Q + ++ S +Y + + R L G G CF+ + Sbjct: 180 YEVVQAYLDSKN-PKDNWISKSYAIAYWSTNRLYQLSRGKLGLSAQLGGTGMCFTMNILK 238 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFR-LKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 + + +SLTED + + + KG + + + K Sbjct: 239 EI-------GWGTESLTEDLEFTAKYILAKGRAVGWAHDAKIYDEK-------------- 277 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHK-WTSSLTLNYFLWRDRKGAISNFV 366 P F + RQ+ RW+ G + + + T + + N + Sbjct: 278 ---------PTDFKVSFRQRIRWMQGHMDCMVRYSGPLLKNFTQTFNM---------NAI 319 Query: 367 SFLAMLV------MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNF------GLM 414 LV + ++L + + + ++ ++ W+ L+ ++F L+ Sbjct: 320 DMFIYLVQPTRTMLSVNSIILFFVTYYDLLPSYIMLYVLHPWIWLLIAVSFYILPIIALL 379 Query: 415 VNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 + V +I+ Y + ++ L + W T Sbjct: 380 QEKKVTNIIWTPIVYIFGFSWVPII-------------------FLGFIKRKEKVWVHTP 420 Query: 475 HD 476 H+ Sbjct: 421 HN 422 >UniRef50_A6Q5B1 General secretory pathway protein E n=3 Tax=Epsilonproteobacteria RepID=A6Q5B1_NITSB Length = 560 Score = 144 bits (362), Expect = 1e-32, Method: Composition-based stats. Identities = 48/254 (18%), Positives = 104/254 (40%), Gaps = 19/254 (7%) Query: 485 RSLRPLGQILLENQVITEEQLDTALR-NRVEG--LRLGGSMLMQGLISAEQLAQALAEQN 541 + LG +L++ +ITEEQL+ AL+ + G +LG +L +G ++ + L +AL++Q Sbjct: 3 QQKIRLGDLLVKEGLITEEQLEQALKLQKEYGYTKKLGQILLEEGYVTQKDLLKALSKQL 62 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV 601 + + + +I ++ P + +P + ++D L V + D ++ +L L R + Sbjct: 63 HLEFVDLYGEKIDFEKLSRYPLNTLKAAKAIPFKEDDDYLYVATSDPLNYEALELLERTI 122 Query: 602 GRK-VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 K ++ + I + R + + + + + + + + + Sbjct: 123 AMKPIKLYLAFEDDI----EAIFHRLEILEKTKEIVE--EVKKELKSEGVKKAGEESAVE 176 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L I+ H S I++ E + +GV+ ET L I L Sbjct: 177 RLIYLIIEDSIHRRASDIHI-----EPDEKRSSVRVRVDGVLY-ETFVFDLEIYNALDT- 229 Query: 721 MQSLLLKAGLNTEQ 734 + L G++ + Sbjct: 230 --RIKLLGGMDISE 241 >UniRef50_Q2RY77 Type II secretion system protein E n=2 Tax=Proteobacteria RepID=Q2RY77_RHORT Length = 688 Score = 143 bits (360), Expect = 2e-32, Method: Composition-based stats. Identities = 50/245 (20%), Positives = 89/245 (36%), Gaps = 14/245 (5%) Query: 490 LGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG LL +I+ +Q+ A + + G LG ++ G I+ LA+ LA+ +G Sbjct: 133 LGDQLLAGGLISRDQMRVAHIEQKRSGAPLGQVLVDLGFITDGVLAEVLAQSSGHDRFDA 192 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR--KVR 606 + + +A +P A V P+ L L + D +D V++ + R GR ++ Sbjct: 193 ASTLADPAALAPLPEREARRLRVFPVGLRGSLLRLAMVDPLDVVAIDEVRRHYGRDVRIE 252 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEI 666 V+ +I + Y D +L P L + Sbjct: 253 PVVSSEAEIAQAIDRSYGHVLALD--GVLRELETLGAAGADPESLRDTDSHPVVRLVNAL 310 Query: 667 LTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLL 726 L S ++ E L L +G +SQ ++ ++ Q L + Sbjct: 311 LLDAVKRRASDLH-----FEPQGAFLRVRLRIDGTLSQ----TLIFHKQYWPAVAQRLKI 361 Query: 727 KAGLN 731 AG+N Sbjct: 362 LAGMN 366 >UniRef50_B9M3F1 General secretory system II protein E domain protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M3F1_GEOSF Length = 383 Score = 142 bits (359), Expect = 3e-32, Method: Composition-based stats. Identities = 36/135 (26%), Positives = 65/135 (48%), Gaps = 2/135 (1%) Query: 492 QILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 +LL +I +EQ + AL+NRV G ++G S++ G + E LA+ L ++ V + D Sbjct: 7 DMLLNAGLINKEQFEEALKNRVLYGGKIGTSLIELGYLKEEDLARFLGKKLAVPFVGADR 66 Query: 551 W-QIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVI 609 I +I +P +AL Y V+P+ + L + D D ++ L+ G + V Sbjct: 67 LLNISPEIIELIPKELALTYGVIPIHRDKKRLYLVMSDPADLKAIDELSFTTGFIINPVA 126 Query: 610 VLRGQIVTGLRHWYA 624 +++ L +Y Sbjct: 127 APELRLMQALGKYYD 141 >UniRef50_C1XM86 Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB n=2 Tax=Thermaceae RepID=C1XM86_MEIRU Length = 547 Score = 142 bits (359), Expect = 4e-32, Method: Composition-based stats. Identities = 49/251 (19%), Positives = 94/251 (37%), Gaps = 22/251 (8%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 LG++L+ +T EQL AL + LG +L +G I +L Q LA+Q Sbjct: 2 EKLKLGELLVRLGKLTPEQLSMALEEQQRRQEPLGQVLLQKGWIRESELYQVLADQQRAP 61 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 S++A +I + + A VLPLRL+ L V D L L + G++ Sbjct: 62 LISLEAVEIQPEALNLLDRRFAREKQVLPLRLDGARLHVAMAHPADLALLDELRFRTGKE 121 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFA 664 + + +I+ L ++ + + ++ + P L Sbjct: 122 IVPYLASDREILQTLDDLLQPQQSLGAV----------QMEARPGPDLTIEQAPAIELAD 171 Query: 665 EILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQ-ETLDRVLTIQRELQVSMQS 723 E++ S +++ E + + +GV+ + LD+ L + Sbjct: 172 ELVRKALGARASDLHL-----EPQESYVRVRIRVDGVLQEIHRLDKGLEAP-----LVAR 221 Query: 724 LLLKAGLNTEQ 734 + AG++ + Sbjct: 222 FKVLAGMDIAE 232 >UniRef50_Q47AJ5 Type II secretion system protein E:General secretory system II, protein E, N-terminal n=4 Tax=cellular organisms RepID=Q47AJ5_DECAR Length = 568 Score = 142 bits (359), Expect = 4e-32, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 104/271 (38%), Gaps = 21/271 (7%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALA 538 ++ RPLGQIL+ +++E+QL AL + +G ++ G ++ L QAL+ Sbjct: 1 MSTTALQRRPLGQILISEGILSEDQLRIALLEQMKQNQPIGKLLVSLGFVTEATLRQALS 60 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLE--NDELIVGSEDGIDPVSLAA 596 E G + I + +P +A + +LPL + N L + D D V L Sbjct: 61 ENLGKQSIDLSHAVIDPQALKLVPRDLAKRHHLLPLDYDRTNRRLALAISDINDIVGLDR 120 Query: 597 LTRKV--GRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR 654 + ++ G ++ ++ +I + +Y D +L+ + + Sbjct: 121 VRSQLEEGTEIETLLAGESEIDHAIDQYYGHELSID--GILHEIETGEIDWHSLSATDNE 178 Query: 655 QYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 P L ILT S I+ E + L +G L ++ + Sbjct: 179 YSQPVVRLIDSILTDAVKREASDIH-----FEPEANFLRIRYRIDG-----MLRQIRALH 228 Query: 715 REL-QVSMQSLLLKAGLNTEQVAQLESENEG 744 + + + +G+N +A++ + +G Sbjct: 229 KSYWPAMTVRIKVLSGMN---IAEMRAPQDG 256 >UniRef50_D0LLW4 General secretory pathway protein E n=2 Tax=Nannocystineae RepID=D0LLW4_HALO1 Length = 610 Score = 142 bits (358), Expect = 4e-32, Method: Composition-based stats. Identities = 55/254 (21%), Positives = 103/254 (40%), Gaps = 14/254 (5%) Query: 486 SLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 LG+IL+ + + E L+ AL R + EG LG ++ + + +AL Q+ + Sbjct: 31 RPLLLGEILMRDAGLRPEHLERALARQQDEGGLLGEILVRLQAVEEAAVMRALGVQHDMP 90 Query: 545 WES--IDAWQIPSSLIAEMPASVALHYAVLPLRLENDE-LIVGSEDGIDPVSLAALTRKV 601 + DA + + LI ++P + A + VLP+R++ DE + V D ++ L ++ + Sbjct: 91 VATELPDAESVDAELIDKIPINFAKTHRVLPIRIDADENVEVLVSDPLEVEVLDDISVLL 150 Query: 602 GRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQ-YVPHQ 660 GR V V+ +IV + Y R RG A + E+ + P Sbjct: 151 GRAVEGVLCPPSRIVDLINKVYGRLRGGAELAEKQDVEDEYGDDEELVDILDLTDEAPII 210 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 +L S I++ E + +GV+ + ++ L Sbjct: 211 RWVNSLLFHAIKERASDIHI-----EPGEKEVMVRYRVDGVLREHKRAH----RQYLPSI 261 Query: 721 MQSLLLKAGLNTEQ 734 + + + AGLN + Sbjct: 262 IARVKIMAGLNIAE 275 >UniRef50_Q0A8B9 Type II secretion system protein E n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0A8B9_ALHEH Length = 587 Score = 142 bits (357), Expect = 6e-32, Method: Composition-based stats. Identities = 57/267 (21%), Positives = 89/267 (33%), Gaps = 15/267 (5%) Query: 474 THDFPS--VTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLIS 529 HD + + +R LG ILLE +I E L AL L+LG ++ I+ Sbjct: 6 EHDLEQALIKARGKQVRRLGGILLERGLIDETTLRAALDTHRAQPHLQLGRWLVEHRHIT 65 Query: 530 AEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGI 589 EQL AL EQ G+ + + + +P + L VLPL L+ + Sbjct: 66 REQLEDALCEQLGIPRVDLAGFVAKPEVAGLIPYEMCLRLNVLPLARHRSVLMAATATPT 125 Query: 590 DPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAV--QHQWLTEQ 647 D LA L G V V+ QI + + Y M + + L Sbjct: 126 DEELLANLRFHTGLNVEPVLAPPHQISSAINRSYKSLAIGGEEGMDTLLTTDEDRDLRRD 185 Query: 648 QAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETL 707 Q E P L ++ S I+ L +G I Sbjct: 186 QEIESQASSRPVVRLVNTVILQAISRGASDIH-----FMPRENDLAVMFRIDGAIQ---- 236 Query: 708 DRVLTIQRELQVSMQSLLLKAGLNTEQ 734 L + +L + + + +N + Sbjct: 237 RVRLVDKAQLAAVVARIKILGRMNIAE 263 >UniRef50_B1IIP9 Glycosyl transferase, group 2 family protein n=20 Tax=Clostridium RepID=B1IIP9_CLOBK Length = 424 Score = 141 bits (354), Expect = 1e-31, Method: Composition-based stats. Identities = 62/352 (17%), Positives = 117/352 (33%), Gaps = 54/352 (15%) Query: 35 FIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAE-LAAT 93 FI + Y + + +YR+ D+ A++V A NE VIGN+ E L Sbjct: 21 FISMYYLIISL---FGIYRKKNN-----KNIGDKTKFALIVAAHNEELVIGNIIESLKMM 72 Query: 94 TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGP--TSKADCLNNVLDAI 151 D + Y IFV T E A +V R K L + + I Sbjct: 73 DYDKKLYDIFVIADNCTDKTAEIAREKGA-------IVRERFDKKRRGKGYALEWMFNII 125 Query: 152 TQFERSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLIQIPVYPFEREWTHFTSMT 210 + E+ + + DA++++ L+ + + + ++Q + + + + Sbjct: 126 FKMEKK----YDAIAVFDADNLVHKNFLKEMNKKMCKGYKVVQGYLDSKN-PEDTWITGS 180 Query: 211 YIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDI 270 Y F + R L G G C + L + LTED + Sbjct: 181 YSIAFWSCNRMFQLARYNLGLSSQLGGTGFCIDTDILKEL-------GWGATCLTEDLEF 233 Query: 271 GFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRW 330 ++ G + ++ + K P T + RQ+ RW Sbjct: 234 SCKIILNGYKVGWAHDAIIYDEK-----------------------PLTLGQSWRQRKRW 270 Query: 331 IIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLA 382 + G + + F + A+ + F+A+L+ + ++ L Sbjct: 271 MQGFADVSSRYFFKLMKKAIKNFNFTAFDCALYSIQPFVAILLGLSAIIGLF 322 >UniRef50_C7RLS9 Type II secretion system protein E n=3 Tax=Betaproteobacteria RepID=C7RLS9_9PROT Length = 571 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 52/263 (19%), Positives = 97/263 (36%), Gaps = 21/263 (7%) Query: 488 RPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 RPLGQIL+ +++E+QL AL +G ++ G +S L AL+E G Sbjct: 12 RPLGQILISKGILSEDQLRIALLEQMKSNRPIGKLLVTLGFVSEATLRDALSESLGKQSV 71 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRL--ENDELIVGSEDGIDPVSLAALTRKVGRK 604 + I + +P +A + +L L EN L + D D V+L + G + Sbjct: 72 DLSNAIIDPLALKLVPRDLAKRHHLLALDYDAENQRLTLAIADINDIVALDKIRSLAGDE 131 Query: 605 VRY--VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 + ++ +I + +Y D +L + P L Sbjct: 132 IEIDTLLAGETEIDRAIDQYYGHELSID--GILNEIETGEIDFRGLQSSADEYSQPVVRL 189 Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL-QVSM 721 ILT + S I+ E + L +G L ++ ++ + Sbjct: 190 IDSILTDAVKHDASDIH-----FEPEASFLRIRYRIDG-----MLRQIRSLHKTYWPAMA 239 Query: 722 QSLLLKAGLNTEQVAQLESENEG 744 + + +G+N +A+ + +G Sbjct: 240 VRIKVLSGMN---IAETRAPQDG 259 >UniRef50_C9RLV9 Type II secretion system protein E n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLV9_FIBSS Length = 641 Score = 140 bits (352), Expect = 2e-31, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 88/235 (37%), Gaps = 11/235 (4%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQL 533 D V + + + +G++LL+ IT++QL+ AL + G RLG ++ I ++L Sbjct: 58 DILGVQMQSVTKKRIGEMLLDQGFITQDQLNEALEKQKTSGGKRLGRVLVDLKFIDEKKL 117 Query: 534 AQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEND---ELIVGSEDGID 590 L Q V + +D ++ + +P ++PL + D L+V D + Sbjct: 118 TDILCCQFEVPYVKLDTIKLDEKVYEFIPEDQCKANKIVPLYVTKDARQALVVAMADPTN 177 Query: 591 PVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWY-ARRRGHDPRAMLYNAV-QHQWLTEQQ 648 ++ KV R V V+ I + + + A L + + T ++ Sbjct: 178 VRLRDSIKFKVKRNVDVVMASEQDIKKTIDTLFAGHGPAEESLAELIGGSGEDELETVER 237 Query: 649 AGEIWRQYVPHQ---FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEG 700 + +I+TTL H + + + E L +G Sbjct: 238 GNGNSDEPELTDEEGRQVVKIVTTLIHEAIAR-HASDIHLEPQETFLKLRYRIDG 291 >UniRef50_A1ALE6 General secretory system II, protein E domain protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1ALE6_PELPD Length = 550 Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats. Identities = 35/136 (25%), Positives = 68/136 (50%), Gaps = 1/136 (0%) Query: 490 LGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LGQIL +++I+E + AL + G R G +++ G+ + E + AL+ Q + + + Sbjct: 10 LGQILTASRIISEIDILAALEEQARSGCRFGEALVRLGVATQEDVDWALSSQLDIPYIRL 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + IA +PA++A ++ +PL DEL + D ++ ++ AL G +V Sbjct: 70 KRELVDPGAIALVPAAMARRFSCIPLFRAGDELNIAIADPLNRAAIQALELATGLRVSIS 129 Query: 609 IVLRGQIVTGLRHWYA 624 + L +I+ + Y Sbjct: 130 VALLREIMEMVDECYG 145 >UniRef50_A9FI10 Family membership n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FI10_SORC5 Length = 563 Score = 139 bits (350), Expect = 4e-31, Method: Composition-based stats. Identities = 40/147 (27%), Positives = 76/147 (51%), Gaps = 1/147 (0%) Query: 486 SLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + +PLG+ILL+ + +T+ QL+ AL R +G+ L +++ G +S +AL+EQ+GV Sbjct: 6 NKKPLGRILLQQRAVTQPQLEQALLEARAKGVPLATNLIESGTVSEVAALKALSEQSGVP 65 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 ++ I S ++ +P +A + +LP+ + D ++V D + L G++ Sbjct: 66 GIDLNQVCIKLSDLSILPREIAAKHKLLPVLVREDRILVAMAAPADKKVIDELEFVTGKR 125 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDP 631 V I L G +V + Y + +P Sbjct: 126 VFPYIALAGPLVRTIAAAYDMKEQGEP 152 >UniRef50_Q094V5 General secretion protein E N-terminal domain protein (Fragment) n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q094V5_STIAU Length = 154 Score = 139 bits (349), Expect = 5e-31, Method: Composition-based stats. Identities = 42/139 (30%), Positives = 72/139 (51%), Gaps = 4/139 (2%) Query: 489 PLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG++L++ V+ E QL AL + + G +LG ++ L+S + L +AL++Q + + Sbjct: 5 KLGELLIKANVLQESQLKAALAEQAKWGGKLGEILVRMSLVSEDILVRALSKQLNIPAVN 64 Query: 548 IDAWQ-IPSSLIAEMPASVALHYAVLPLRLENDE--LIVGSEDGIDPVSLAALTRKVGRK 604 +DA Q IP + A++PA A +AVLPL+L +D L+V D ++ L L + Sbjct: 65 LDAVQMIPPHVRAKVPAQTARDFAVLPLQLRDDGKTLVVAVADPLNVRHLDELRAITRCR 124 Query: 605 VRYVIVLRGQIVTGLRHWY 623 + + R I Y Sbjct: 125 IVPNVAGRTSIARAFARLY 143 >UniRef50_Q01NU4 Type II secretion system protein E (GspE) n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01NU4_SOLUE Length = 566 Score = 138 bits (348), Expect = 7e-31, Method: Composition-based stats. Identities = 58/262 (22%), Positives = 103/262 (39%), Gaps = 19/262 (7%) Query: 478 PSVTGDTRSLR--PLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLA 534 +G +R LG+IL+E I E L+ AL ++E G +LG ++ GLI+ L Sbjct: 10 EPESGTNSGVRYMRLGEILIERGKIDAEDLERALELQLERGDKLGKIVVDMGLIAQRDLL 69 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSL 594 AL++Q GV ++D + I + P+ L + L + D +D ++ Sbjct: 70 SALSDQMGVPLIAVDGTPPNAPEIEGLSQRFLRQCRAFPVALNDSVLTIAMADPMDFETI 129 Query: 595 AALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR 654 AA+ G +V+ + +I+ + Y D + + Q + + Sbjct: 130 AAVRAFSGLQVQTALASEQEILDAIDRNYGE---SDQKTFIGEGDDEQANADLEHLRDMA 186 Query: 655 QYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVI-SQETLDRVLTI 713 P L ++ S I++ E +GV+ +QE Sbjct: 187 SEAPVIRLVNAMIADAIEKRASDIHI-----EPFEKEFRIRFRVDGVLFAQE------NP 235 Query: 714 QRELQVSMQS-LLLKAGLNTEQ 734 REL+ ++ S L L A LN + Sbjct: 236 PRELKAAIISRLKLMAKLNIAE 257 >UniRef50_D2RJA7 Glycosyl transferase family 2 n=10 Tax=Veillonellaceae RepID=D2RJA7_ACIFE Length = 428 Score = 138 bits (348), Expect = 7e-31, Method: Composition-based stats. Identities = 75/469 (15%), Positives = 145/469 (30%), Gaps = 82/469 (17%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 +I + L V++ F + Y+V +S++ PR +++ P A++ Sbjct: 5 FDIIMVPLQVLI-------VFFTIYYFV------ISLFGILPRKKEKKILTPKTT-FAVI 50 Query: 75 VPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCAR-FPNVHKVVC 132 V A NE VIG + E E Y IFV T + A + ++ Sbjct: 51 VAAHNEEKVIGELVENLHMLRYPDELYDIFVIADNCKDHTAEVARKAGALVYERFNQEEV 110 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDL 191 K L + + +R + + DA++++ P L+ + N + + L Sbjct: 111 ------GKGFALEWMFRQLFALDRQ----YDAVAIFDADNLVHPDFLKEMNNRFCKGERL 160 Query: 192 IQIP--VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTA 249 IQ V W S T+ F ++ + + G G + + Sbjct: 161 IQGYLDVKNPNDSW---VSGTFAINFWIVNHVWHLAKYTIGLSSVFGGTGMVIATEVLKK 217 Query: 250 LLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNM 309 + LTED + + +G+ + + ++ + K Sbjct: 218 -------YGWKATCLTEDMEFTMKCLLEGIPTTWCQDAIIYDEK---------------- 254 Query: 310 ICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD---RKGAISNFV 366 P TF + Q+ RW G F + W L RD G I F Sbjct: 255 -------PQTFKASWNQRKRWAQG-QFDVAGRYMW--KLLKEGIRKRDIVILDGVIDVFQ 304 Query: 367 SFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVT 426 + ++ +L Y + ++ W ++ + + I+ ++ Sbjct: 305 PYFMLISTFFVLCSTIYNFVPFYTNVLYALLPYHVW--QVIGVAQYAIPAIILFKINAAP 362 Query: 427 GYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + +L W + +L W T H Sbjct: 363 KSW-FYTLFYPLLLYSWVPI-----------TILGFFHRHEHVWSHTIH 399 >UniRef50_D0LFN0 General secretory system II protein E domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LFN0_HALO1 Length = 244 Score = 138 bits (347), Expect = 8e-31, Method: Composition-based stats. Identities = 51/243 (20%), Positives = 95/243 (39%), Gaps = 18/243 (7%) Query: 486 SLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + + +G+IL++ V+ E L TAL R G LG +++ LI+ E L AL++Q Sbjct: 2 ARKRIGEILVQAGVLDAEGLKTALLEQRRWGGPLGRTLVDLDLITEEALVDALSKQLNFP 61 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 +D + ++ +P +++ + ++P + E L V + + + L + Sbjct: 62 TVDLDTVDVAPEVLELVPGELSVQHHMMPFKREGKFLDVAMSEPTNLGIIDELRIRTQLN 121 Query: 605 VRYVIVLRGQIVTGLRHWYAR---RRGHDPRAMLYNAVQHQWLTEQQAGEIWR------- 654 VR + I L+ ++ + +PR A Q + + GE Sbjct: 122 VRPYLAGPKMIERALQRYHGDILPDKSDNPRTAT-GAPAAQARSANRRGEADAVQPHTGS 180 Query: 655 ------QYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLD 708 QY P E S + LL R E + LV +G+ ++E + Sbjct: 181 NPALRHQYGPRSAQAREEEIRSLQERISQLEALLSRDEDVLRKVLSLLVDKGIATREEIL 240 Query: 709 RVL 711 L Sbjct: 241 ERL 243 >UniRef50_A4J3A4 Type II secretion system protein E n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J3A4_DESRM Length = 554 Score = 137 bits (346), Expect = 1e-30, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 91/246 (36%), Gaps = 22/246 (8%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 +G ILLE I+++QL AL +R LG ++ G ++ +Q++Q L Q + E I Sbjct: 10 IGNILLEKGAISQQQLREALNNHRQTDQPLGQVLVDLGYVTKKQVSQYLDYQEQMEEEVI 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 +I +L+ + Y + PL + ++L V + + +++ L + V Sbjct: 70 HIQEIDKALLKLFSEQILRRYKIFPLFKKGNKLTVAMAEPANVIAIDDLKVISNLDIVPV 129 Query: 609 IVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILT 668 V I + +Y + + + + VP L +I+ Sbjct: 130 EVQEQIIELAIDLYYDITKREVNDKEQRLVITDE------------EEVPIIQLVYQIID 177 Query: 669 TLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLLKA 728 S I++ E + +G++ L L + + + + + Sbjct: 178 RAIDQGASDIHI-----EPQEKRVRIRYRIDGML---ILGMELA-PTLDKAIISRIKIMS 228 Query: 729 GLNTEQ 734 LN + Sbjct: 229 QLNIAE 234 >UniRef50_B8FZG2 Glycosyl transferase family 2 n=4 Tax=Clostridiales RepID=B8FZG2_DESHD Length = 425 Score = 137 bits (345), Expect = 1e-30, Method: Composition-based stats. Identities = 85/487 (17%), Positives = 151/487 (31%), Gaps = 92/487 (18%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 MD++ V + L +I I + +I+ F + + RR +K+ Sbjct: 1 MDFMSGVTGSHL--FNMIMIPVQLIIIFMTFYYFVLSMFGLFRRPDKKV----------- 47 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDE 119 EK A++V A NE VIG + + E Y +FV T Sbjct: 48 ----LEPEKSFALVVAAHNEEAVIGPLVDNLLNLDYPKELYDVFVVADNCTDKTALIAKN 103 Query: 120 VCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL 179 A K L + + + ER + I+ DA+++++ L Sbjct: 104 AGALVHQ-----RFNNEKRGKGYALEWMFHRLFKLER----HYDAVIIFDADNLVNETFL 154 Query: 180 -RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALA-GQVPSAG 237 + + L + ++Q + + + + T+ F + R G Sbjct: 155 VEMNSKLCQGHQIVQCYLDSKN-PYDTWVTNTFSITFWLSNRLLQLARYNTGFLNNVLGG 213 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 G C S + + L + SLTED + + G+ + +V + K Sbjct: 214 TGMCISTKVLKDL-------GWGATSLTEDLEFTMKALISGIKTTWAHDAIVYDEK---- 262 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD 357 P TF A Q+ RW G V + L Y +++ Sbjct: 263 -------------------PLTFIQAWNQRKRWAQGQVDVAGRYF-----FPLIYKAFKE 298 Query: 358 RK-----GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAW----LMTLLW 408 RK A+ F L M+ + + L F + S W +L++ Sbjct: 299 RKLMYFDAAVHLFQPALVMIATFFMFVNLISGLQSSYTQVFNVVMPWSGWQILSAFSLVF 358 Query: 409 LNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRV 468 L + R+ R Y V W ++ L + + Sbjct: 359 PVAALALERLPWRAYAGLILY-------PVFIYSWIPIV-----------FLGFVNRKDK 400 Query: 469 AWDKTTH 475 +W T H Sbjct: 401 SWSHTKH 407 >UniRef50_A5D548 Glycosyltransferases, probably involved in cell wall biogenesis n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D548_PELTS Length = 411 Score = 137 bits (345), Expect = 2e-30, Method: Composition-based stats. Identities = 71/420 (16%), Positives = 117/420 (27%), Gaps = 64/420 (15%) Query: 27 FISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGN 86 I F+ + + I YRR+ E P AI+V A NE VIG Sbjct: 3 AIFYATQVFLTLFTFYHFIISLYGFYRRH-----EECLLPPSSRFAIVVAAHNEEKVIGE 57 Query: 87 MAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLN 145 + E Y ++V D D AR K L Sbjct: 58 LIRNLNELDYPKELYDVYVV-----ADNCTDSTAKIAREKGAVVFERFNKAERGKPYALE 112 Query: 146 NVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY-LVERKDLIQIPVYPFEREWT 204 I + + + + DA++++ L + N L++ + +IQ + T Sbjct: 113 FAFSKIFE----SGIPYDAVCVFDADNLVDTNFLTVMNAHLLKGEKIIQGYLDTKNAGDT 168 Query: 205 HFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSL 264 T Y+ + L G G C S + + + SL Sbjct: 169 WITKSIYVSYILTNRFLQLSKYN-LGLTCALGGTGMCLSVDVLKR-------YGWGMTSL 220 Query: 265 TEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAV 324 TED + + G+ + V + K P T + Sbjct: 221 TEDLEFQTKALLNGIKVTWAHDARVYDEK-----------------------PLTLMQSW 257 Query: 325 RQKSRWIIGIVFQGFKTHKWTSS----------------LTLNYFLWRDRKGAISN-FVS 367 RQ+ RW+ G + L YFL G I+N F+ Sbjct: 258 RQRKRWMQGHTNVAGRYVARLVREGIRTRNFAMIDGAVYLIQPYFLMFTGIGLITNIFMG 317 Query: 368 FLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTG 427 +L L++ + + L++ + V V F Sbjct: 318 PDQILDRPVWLVVGFFAQFFYFGLGLALERVKPVVYWWLIFYPIFALTWIPVAYVGFAMR 377 >UniRef50_C6PVI5 Glycosyl transferase family 2 n=2 Tax=Clostridium RepID=C6PVI5_9CLOT Length = 512 Score = 137 bits (345), Expect = 2e-30, Method: Composition-based stats. Identities = 70/448 (15%), Positives = 141/448 (31%), Gaps = 53/448 (11%) Query: 16 KVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMV 75 + + + + + + + + Y + +S++ Y + EK A++V Sbjct: 10 EFVYNFVICGINVFQVSVIILTMYYLI------ISLFGFYKKEDKEAENCKPEKKFALLV 63 Query: 76 PAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 A NE VIG + + L + Y IFV D D AR + Sbjct: 64 AAHNEEMVIGKIVDSLKELDYPKDLYDIFVI-----ADNCTDKTAEIARKHGGNVYERNV 118 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY-LVERKDLIQ 193 K L + + + + + + DA++++S L NY L++ ++Q Sbjct: 119 SDKRGKGYALEWMFARVFKM----DTKYDAIAIFDADNLVSKNFLNEMNYKLLKGYKVVQ 174 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 + + + + + +Y F + R L G G C + L Sbjct: 175 GYIDSKNPDDS-WITQSYSISFWTANRLFQLGRSNLGLSSQIGGTGFCMDTETLKKL--- 230 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 + LTED + +L G + +V + K Sbjct: 231 ----GWGSTCLTEDLEFTCKLVLNGHKVGWAHNAIVYDEK-------------------- 266 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 P T + Q+ RW+ G + + + A+ + + + Sbjct: 267 ---PLTLKQSWNQRKRWMQGFADVFSRFFVRLMKRAVKERSFITLDCALYTMQPYFTLFM 323 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYY---- 429 ++ L D IF + + + M+ I Q +F Sbjct: 324 AASAVISLIKSFSGIDVITLDGIFRDIFYSSQSSYSQYAWMIFSIGQ-FLFTPIVMLLET 382 Query: 430 GLTQGLLSVLRLFWGNLINFMANWRALK 457 L++ + V L+ N + F +R L+ Sbjct: 383 KLSKKMFGVFALYSLNAVVFSEIFRYLR 410 >UniRef50_B2V1W7 Glycosyl transferase, group 2 family protein n=29 Tax=Clostridium RepID=B2V1W7_CLOBA Length = 476 Score = 137 bits (344), Expect = 2e-30, Method: Composition-based stats. Identities = 49/361 (13%), Positives = 116/361 (32%), Gaps = 47/361 (13%) Query: 24 VIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGV 83 + + + F+ ++ + + R+ + +Y K A+++ A NE V Sbjct: 5 IFTITTTIFQIFVFILTLYYMVLGFFGLIRKKEKKNYTP-----NKKFALLIAAHNEEVV 59 Query: 84 IGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKAD 142 IG + E + Y +FV D D ++ V+ K Sbjct: 60 IGKLIESMLNLNYPKDMYDVFVI-----ADNCTDNTAKISKEYGVNVCERFNKDKRGKGY 114 Query: 143 CLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLIQIPVYPFER 201 L + D +++ ++ + + DA++++ L+ + + + + ++Q + Sbjct: 115 ALEWMFDKLSKMKKQ----YDAVAIFDADNLVHKDFLQEINSKMNDGYKVVQGYIDSKN- 169 Query: 202 EWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDV 261 + + +Y F + RE + G G + L + Sbjct: 170 PEDSWIAASYSIAFWTQNRMFQLARENVGFSNQIGGTGFAIETETLKEL-------GWGA 222 Query: 262 QSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFS 321 LTED + +L G + ++ + K P S Sbjct: 223 TCLTEDLEFTCKLVLNGEKVGWAHDAIIYDEK-----------------------PLKLS 259 Query: 322 TAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLL 381 + Q+ RW+ G + + ++ W A+ F+ +++ +L + Sbjct: 260 QSWTQRKRWMQGFTDVASRYFWRLTKKSIKERKWYIFDCALYVLQPFITLMLAASAVLTI 319 Query: 382 A 382 Sbjct: 320 I 320 >UniRef50_A2SK33 Type II secretory pathway ATPase PulE/Tfp pilus assembly pathway ATPase PilB n=3 Tax=Burkholderiales RepID=A2SK33_METPP Length = 841 Score = 137 bits (344), Expect = 2e-30, Method: Composition-based stats. Identities = 45/247 (18%), Positives = 82/247 (33%), Gaps = 22/247 (8%) Query: 477 FPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRV--EGLRLGGSMLMQGLISAEQLA 534 V + +G+ L+ +I +QLD AL + G+ +G ++ G +S + L Sbjct: 230 EAIVQQSRMPMVRIGEALIALGMIDHQQLDEALEQQKVDRGVPIGELLVRLGRVSRQDLQ 289 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSL 594 ALA + G ++ I + + ++P A LPL L + L+V ED +L Sbjct: 290 TALARKMGYPIVDASSFPIEADALRKLPFPTAQRLNALPLLLRDTTLVVAVEDPSKRGAL 349 Query: 595 AALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQH------------- 641 + KV V+ I L Y+R L + + Sbjct: 350 DEIEFASQCKVAPVLARHHDIQATLHSAYSRIGVEIAGLALDDGPEPEPADAGKLLESLE 409 Query: 642 ----QWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLV 697 + +I + L ++ S I++ L Sbjct: 410 REGADQNARENEKQIEQSDNSLVRLIHTMIIEAYTQGVSDIHIENY---PGREKLKIRFR 466 Query: 698 TEGVISQ 704 +GV+ Sbjct: 467 KDGVLKP 473 >UniRef50_A8ZYV7 Type II secretion system protein E n=11 Tax=Deltaproteobacteria RepID=A8ZYV7_DESOH Length = 577 Score = 137 bits (344), Expect = 2e-30, Method: Composition-based stats. Identities = 56/263 (21%), Positives = 103/263 (39%), Gaps = 24/263 (9%) Query: 484 TRSLRPLGQILLENQVITEEQLD-TALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 R+ + LG++L++ +TEE+L + GL+LG ++ +G++S + ++ Q G Sbjct: 10 KRTRKKLGEMLVDAGYLTEERLTGYVAAQKRSGLKLGQFLIREGVVSESMIVDLVSRQAG 69 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + + + L + +V+ Y +PLR N L+V D +D SL A+ + Sbjct: 70 IQRFDPAEFPVTMELAKSLAETVSRKYGAVPLRRGNHLLLVAMTDPLDIRSLDAIEDECD 129 Query: 603 RKVRYVIVLRGQIVTGLRHWYARR------RGHDPRAMLYNAVQHQWLTEQQAGEIW--- 653 +V VI + Y R G+D + + + + A EI Sbjct: 130 LEVEPVICTEQEFSHLFTQVYGTRIDGFAGEGYD-LTETMDYGEDEEPADAGATEISSLQ 188 Query: 654 --RQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVL 711 + P L +L S I++ + L +GV L V Sbjct: 189 HMAEEAPVVRLVNALLAQAVRQGASDIHI-----SPEKRYVQVRLRVDGV-----LHEVP 238 Query: 712 TIQRELQVSMQS-LLLKAGLNTE 733 + L +S+ S L + A L+ Sbjct: 239 APPKTLFLSIVSRLKILANLDIS 261 >UniRef50_A6W755 Type II secretion system protein E n=3 Tax=Actinomycetales RepID=A6W755_KINRD Length = 591 Score = 136 bits (343), Expect = 2e-30, Method: Composition-based stats. Identities = 54/259 (20%), Positives = 103/259 (39%), Gaps = 18/259 (6%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALRNRV--EG--LRLGGSMLMQGLISAEQLAQALAE 539 T R LG +L+E ++ E LD AL + EG RLG ++ G++S +LAQ LAE Sbjct: 27 TPVRRRLGDVLVEKGLLVPEDLDVALAEQRNVEGPRRRLGQILVELGMVSEAELAQCLAE 86 Query: 540 QNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTR 599 + + + ++ +P +VA VL L + L+V + D + ++L + Sbjct: 87 LLQLEHVDLSRLTLAPDVVRLLPRAVAERCRVLVLDKTPEYLLVAAADPTNVLALDDVKL 146 Query: 600 KV-GRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAG--EIWRQY 656 ++ V+ + QI L ++ + + + A Sbjct: 147 YTRTPELHVVVAMDSQIRDQLARAWSLTEDTSQVSRMVQDATEDDDEDPLAALNGSVDDD 206 Query: 657 VPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRE 716 P L IL+ + S I++ E L +G+ L V++ + Sbjct: 207 APIVKLVNRILSDAVRLRCSDIHL-----ESQRDQLRVRFRVDGL-----LRDVMSAPKR 256 Query: 717 LQVSMQS-LLLKAGLNTEQ 734 + S+ S + + +GL+ + Sbjct: 257 VAPSVISRIKIISGLDISE 275 >UniRef50_A6BBY4 Msha biogenesis protein mshe n=1 Tax=Vibrio parahaemolyticus AQ3810 RepID=A6BBY4_VIBPA Length = 228 Score = 136 bits (343), Expect = 3e-30, Method: Composition-based stats. Identities = 31/147 (21%), Positives = 69/147 (46%), Gaps = 2/147 (1%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQN 541 + + LG +L+E +++E+Q+ AL R G +LG +++ G I+ +Q+ + L++Q Sbjct: 2 KIQLRKRLGDLLVEEGIVSEDQIQQALSAQRSTGQKLGDALIDLGFITEKQMLEFLSQQL 61 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV 601 G+ + + + + +P A + + D L V D D + +L + Sbjct: 62 GLPLIDLGRAPVDADAVQILPEVHARRLRAMVVARNGDTLRVAMSDPADLFTQESLMNLL 121 Query: 602 G-RKVRYVIVLRGQIVTGLRHWYARRR 627 G + ++I Q+++ +Y R + Sbjct: 122 GEYNLEFIIASERQLISSFDRYYRRTK 148 >UniRef50_D2QZA7 Type II secretion system protein E n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QZA7_9PLAN Length = 596 Score = 136 bits (342), Expect = 3e-30, Method: Composition-based stats. Identities = 43/263 (16%), Positives = 88/263 (33%), Gaps = 29/263 (11%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVEGLR---LGGSMLMQGLISAEQLAQALAEQNG 542 + LG IL+E +T L AL ++ R LG ++ L + +Q+ + LA+ Sbjct: 7 PPQRLGNILIERGYLTVAHLQQALDHQQRAGRGKLLGEILVELSLCTEDQVMECLAQVYC 66 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 V + ++ ++ +P V PL L V + + L + G Sbjct: 67 VPYAKLEQRLSDPRIVELLPREYIEKNLVFPLFRIQQTLTVAVTEPSNLFLLEEIRGLTG 126 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYV----- 657 V+ V I + ++ + ++ TE E + + Sbjct: 127 LTVQIVASSAKDIRRMITTL-----PDSKTFVIEDIIEDNSQTEVTLIESAVEDISDSTE 181 Query: 658 -----PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLT 712 P L ++ S I++ E + + +G L + L Sbjct: 182 CAGQSPVIRLVNYVIYHAVKEGASDIHI-----EPAERCVRVRYRIDGK-----LYKSLE 231 Query: 713 IQRELQVSMQS-LLLKAGLNTEQ 734 + L ++ S + + A L+ + Sbjct: 232 VPLNLLGAVTSRIKIMASLDISE 254 >UniRef50_Q3A899 Type II secretory pathway and PulE/Tfp pilus assembly pathway ATPase PilB n=14 Tax=Bacteria RepID=Q3A899_PELCD Length = 578 Score = 136 bits (341), Expect = 4e-30, Method: Composition-based stats. Identities = 57/236 (24%), Positives = 94/236 (39%), Gaps = 12/236 (5%) Query: 486 SLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 PLG+IL + +I EQL+ L R + EG+RLG + + G ++ LA+ALA Q Sbjct: 6 RREPLGRILCDQGIINAEQLEHLLSRAKAEGVRLGEAGIEAGCLTDRDLARALARQFYFD 65 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + ++ + S L+AE+ + + +PL + L + D L L +G Sbjct: 66 YVDLENFVPDSELLAEISPELLPRFLFMPLHRDVHGLHIAVVDPTAVAELDLLESLLGVP 125 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGH---DPRAMLYNAVQHQWLTEQQA-GEIWRQYVPHQ 660 + VIV ++ L +R L+ + + E + +I P Sbjct: 126 LNLVIVPESRLRKVLEGDEGSKRRLREVSEDFKLHLIKETERGEEVLSLDKIGDDASPII 185 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQET--LDRVLTIQ 714 L L + S I+V E S + +GV+ Q T LD Q Sbjct: 186 RLVDSTLLDGLNRRASDIHV-----ESSQDGVNIKYRVDGVLYQATDLLDSQFQDQ 236 >UniRef50_Q39PZ3 Response regulator receiver domain protein (CheY-like) n=6 Tax=Deltaproteobacteria RepID=Q39PZ3_GEOMG Length = 823 Score = 135 bits (340), Expect = 5e-30, Method: Composition-based stats. Identities = 53/333 (15%), Positives = 122/333 (36%), Gaps = 36/333 (10%) Query: 439 LRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQ 498 LRL + A + K++ + ++ +F + + +S+ LG IL E Sbjct: 147 LRLTVSLALQQYALIQENKKLKEIAKAQQTK----IRNFAGLFDEDKSM--LGSILTEAG 200 Query: 499 VITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLI 558 VI ++ L+ + G L +++ + + ++ + L GV + + I ++ Sbjct: 201 VIRKDDFAAVLQGKKPGELLVDALVRTSVSTEAKILKTLQNHLGVEFIDLREANITPGVV 260 Query: 559 AEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTG 618 +P + + ++P+RL+ + L + D D + ++R +G KV ++ +I Sbjct: 261 RCLPRDMCDRHRLIPVRLDGNRLTIAMADPSDIFKIDNISRVLGLKVMPLLSTSSEIQAQ 320 Query: 619 LRHWY---------ARRRGHDPRAMLY-----NAVQHQWLTEQQAGEI--WRQYVPHQFL 662 L Y D L + V + E+ + P + Sbjct: 321 LARIYGEGAAAIVSGGGEELDEFGELEPLDEIDIVIEDEEADVSVDELIGSSKVPPIIRV 380 Query: 663 FAEILTTLGHINRSAINV------LLLRHERSSLPLGKFLVTEGVISQETLDRV------ 710 +++ S I++ ++R+ L L + I + RV Sbjct: 381 VNAVISEAVRYRASDIHIEPKTKCTVIRYRIDGL-LHGKIRIPSDIHAAVVSRVKILAKM 439 Query: 711 -LTIQRELQVSMQSLLLKAGLNTEQVAQLESEN 742 ++ +R+ Q ++ + +V+ L + N Sbjct: 440 DISERRKPQDGRITVKAGTRIVDMRVSTLPTMN 472 >UniRef50_D1B8K4 Glycosyl transferase family 2 n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B8K4_THEAS Length = 438 Score = 135 bits (340), Expect = 5e-30, Method: Composition-based stats. Identities = 76/468 (16%), Positives = 142/468 (30%), Gaps = 78/468 (16%) Query: 24 VIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGV 83 + + GL D +Y R + P+ S + A+++PA NE V Sbjct: 21 CLYLLFGL--LIADGIYQFVVSFRGWWTPKAPPKAS-------RYRRFAVLIPAHNEARV 71 Query: 84 IGNMAELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKA 141 IG + E + DY Y +FV D D A ++ + K Sbjct: 72 IGPLLE-SLKEQDYPKDCYRVFVSC-----DNCTDHTAQVAALHGAVPLIRTDTTKSGKT 125 Query: 142 DCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERK---DLIQIPVYP 198 + L I E ++ DA+++ S L N +E + +Q + Sbjct: 126 WNVRWALTQIPMDE------VDALVMFDADNLASRDFLSRMNDYMEAHPEAEAVQGVLDV 179 Query: 199 FEREWTHFT-----SMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 + T + Y + F +L R G G + + Sbjct: 180 KNPDDNWLTKAYALAYWYTNRFWQL------ARSNWGLSCTLGGTGLVIRSSTLRRI--- 230 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 ++++SLTED ++ RL G + VV + K Sbjct: 231 ----GWNLESLTEDLEMSTRLILSGSRVHWNEHAVVYDEK-------------------- 266 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFL---A 370 P + +VRQ++RW+ G + W + +R R+ + +L A Sbjct: 267 ---PLDYRISVRQRTRWMQGHYWVC-----WRYGMEALKMFFRTRRLQYLDLFLYLLAPA 318 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 + L + + A L W+ F + ++ G Sbjct: 319 KACISLLAMFAGMAYTVINNAILFPTLESKAPTTPLEWMAFVGLPVAMILAHCLFVALVG 378 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVL---QHGDPRRVAWDKTTH 475 + + + ++ + + +L + W KT H Sbjct: 379 PSMHRRRLCLGYVKDVFGYFLFGLSWIPILFKAAFLAKDQGVWVKTEH 426 >UniRef50_A1TUR3 General secretory pathway protein E n=5 Tax=Proteobacteria RepID=A1TUR3_ACIAC Length = 578 Score = 135 bits (339), Expect = 7e-30, Method: Composition-based stats. Identities = 50/264 (18%), Positives = 92/264 (34%), Gaps = 16/264 (6%) Query: 474 THDFPSVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQ 532 P+ LG++L+++ ++ L+ AL + G LG ++ GL+S Sbjct: 8 DRTEPTADAVAVQQPLLGELLVQSGKLSARDLERALSAQQEMGGLLGRVLVRLGLVSETD 67 Query: 533 LAQALAEQNGVAWESIDAWQ-IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDP 591 + QAL+ Q G+ S + + + + +P +V PL +E+ L V D Sbjct: 68 VIQALSRQLGIPLISANDFPDLMPEVEGLLPE-FLQANSVYPLSVEDGRLHVAMAVPQDA 126 Query: 592 VSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGE 651 + AL G V + L I L + + + + Sbjct: 127 FVVKALHLATGLSVVPRLALESDIEKALAE--PVEQAGEEEGDDGFGDGADGGDFVEHLK 184 Query: 652 IWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVL 711 P L I+ + + S I++ E L +GVI L Sbjct: 185 DLASEAPVIRLVNAIIGRVIDLRASDIHL-----EPFDDGLHVRYRVDGVIQLGEL---- 235 Query: 712 TIQRELQVSMQS-LLLKAGLNTEQ 734 + L ++ S + L A L+ + Sbjct: 236 -VPPRLSAAVSSRVKLLAHLDIAE 258 >UniRef50_Q21L22 Type II secretion system protein E n=4 Tax=Proteobacteria RepID=Q21L22_SACD2 Length = 563 Score = 135 bits (339), Expect = 8e-30, Method: Composition-based stats. Identities = 51/266 (19%), Positives = 102/266 (38%), Gaps = 21/266 (7%) Query: 485 RSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 + + LG +L++ V++ +Q+ A+ R LG + G ++ + LAE G Sbjct: 3 KEKKRLGDMLVDQGVVSPDQVVIAITEQRKTKKPLGQVFIDLGFVTEHVIRDTLAETFGQ 62 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRL--ENDELIVGSEDGIDPVSLAALTRKV 601 + + S +A +P +VA V+P+ + EL + D D + L + + Sbjct: 63 ESIDLSSAVPDSEALAMVPKNVATRNNVVPISFNQRSSELRLAMADVYDVMVLDRIRSIL 122 Query: 602 GRKV--RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPH 659 V ++ +I + L +Y D + W + + G+ + Q P Sbjct: 123 PLDVDIVPLLATETEIRSALDLFYGYELSVDGILKEIETGEVDWHSLEAVGQEYSQ--PL 180 Query: 660 QFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL-Q 718 L +L+ + S I+ E L +GV L +V ++ ++ Sbjct: 181 VRLVDAVLSDAVKRDVSDIH-----FEPERGFLRLRYRIDGV-----LRQVRSLHKDYWS 230 Query: 719 VSMQSLLLKAGLNTEQVAQLESENEG 744 L + A LN +A+ + +G Sbjct: 231 AIAVRLKVMANLN---IAETRTPQDG 253 >UniRef50_A6C2Q8 Type II secretion system protein E n=3 Tax=Planctomycetaceae RepID=A6C2Q8_9PLAN Length = 582 Score = 134 bits (338), Expect = 8e-30, Method: Composition-based stats. Identities = 47/264 (17%), Positives = 98/264 (37%), Gaps = 20/264 (7%) Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEG---LRLGGSMLMQGLISAEQLAQAL 537 + LG +L+ + IT EQL++AL + +G LG ++ + +Q+ + L Sbjct: 4 NSLLQPKMRLGDLLVYKEYITLEQLESALEAQSQGDGSQLLGELLVNNEYCTEDQVLECL 63 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 A + + + +D+ S + +P + VLPL + L V + + + L Sbjct: 64 ALEYRIPYVQLDSRMFDSKIFDILPRDFVEKHTVLPLFKVRNVLTVAVAEPTNVFLVDQL 123 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ-WLTEQQAGEI---- 652 +++ V +I ++ + ++ +A L E+ +I Sbjct: 124 RDLTKTEIQIVAASAREIRRMVQTYMPNTNVFVIDDIIDDANGTNVELIEESIDDIGFDA 183 Query: 653 -WRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVL 711 + P L I+ S I++ E + L +GV L + L Sbjct: 184 EFAGQSPIIKLVNYIIYNAVREGASDIHI-----EPTEQQLRVRYRVDGV-----LQQAL 233 Query: 712 TIQRELQVSMQS-LLLKAGLNTEQ 734 L ++ S + + A L+ + Sbjct: 234 EPPVHLAPAVSSRIKIMASLDISE 257 >UniRef50_Q08RH0 Gspii_e N-terminal domain family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08RH0_STIAU Length = 590 Score = 134 bits (338), Expect = 1e-29, Method: Composition-based stats. Identities = 48/164 (29%), Positives = 84/164 (51%), Gaps = 4/164 (2%) Query: 489 PLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG++L++ ++IT + L+ AL ++V G RLG ++L GL+S + LA+ L + +G A S Sbjct: 2 RLGELLIQEKLITRQGLEEALESQVVHGGRLGTNLLELGLLSEKDLARLLGQLHGCAHAS 61 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + + + + A LP+R++ L V + D L AL K G++V Sbjct: 62 GEL-TPEPQALKLVNLNDADKRDYLPMRVDATRLSVAVMNPHDYAMLDALAFKTGKRVVP 120 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGE 651 V+V ++ LR + R RA+ NAV+ ++ +GE Sbjct: 121 VVVPEFRMNQLLRRYCKAFRPL--RAIDMNAVRPSKTLQEASGE 162 >UniRef50_A3CR73 PilB-like pili biogenesis ATPase, putative n=2 Tax=Firmicutes RepID=A3CR73_STRSV Length = 560 Score = 134 bits (337), Expect = 1e-29, Method: Composition-based stats. Identities = 33/249 (13%), Positives = 91/249 (36%), Gaps = 18/249 (7%) Query: 493 ILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAW 551 IL++ +IT Q + L++ ++L ++ +G ++ E + + ++ V ++ + Sbjct: 6 ILVQFNLITAAQKEEILQDMPQSNMQLERYLISKGYVTEEDMLKVMSYYYRVPHVNLSQF 65 Query: 552 QIPSSLIAEMPASVALHYAVLPLRLEND------ELIVGSEDGIDPVSLAALTRKVGRKV 605 I + ++ VA + ++P+ + +L+V D + ++L + V Sbjct: 66 VIEKEAVEKVSEKVAKRHGLIPISFTDGEEGEEPKLVVAMADPSNYIALDDVKIVSKMAV 125 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAE 665 + R I + +Y+ +G + + E ++ + P L Sbjct: 126 EPYVTFRDDIEKYIDQYYS--KGEEAQQAATEIEGFNVDEEIVEEDLEIKNAPVVRLIDS 183 Query: 666 ILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLL 725 I++ S I++ E + +G + + + + Sbjct: 184 IISQAIKTRTSDIHI-----EPFEKVVRVRFRVDGTLVENMQLKA----NAHSAIATRIK 234 Query: 726 LKAGLNTEQ 734 + +GL+ + Sbjct: 235 IMSGLDIAE 243 >UniRef50_C6MU90 Type II secretion system protein E n=1 Tax=Geobacter sp. M18 RepID=C6MU90_9DELT Length = 724 Score = 134 bits (337), Expect = 1e-29, Method: Composition-based stats. Identities = 58/343 (16%), Positives = 110/343 (32%), Gaps = 19/343 (5%) Query: 398 SGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALK 457 + + RI +F G++G + + I A Sbjct: 68 PSGGLFEHVELVTGETFALRIQPDQVFSNGFFGNSATGSAPPERV---FIVSSAVRAREP 124 Query: 458 QVLQHGDPRRVAWDKTTHDFPSVTGDTRSL--RPLGQILLENQVITEEQLDTALRNRVEG 515 + +T + +G IL+ +T EQ+++A R E Sbjct: 125 DLPAPAAEVSPETAQTEGGEAGDLLTHQRAHDSRIGDILVNEGFVTREQVESA-RQAGER 183 Query: 516 LRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLR 575 ++G ++ +GLI+ EQL +ALA + G + + +A + + VLPL+ Sbjct: 184 GKIGSVLIARGLITEEQLLKALASKFGSRFVDLSEVTPTPEALAVLQKQTVVRMQVLPLQ 243 Query: 576 LENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAML 635 L+ +L V + + D + L + V+ QI + +Y + + Sbjct: 244 LQGRKLTVATSEPTDLGIMDNLRFITNHHIELVVSGSRQIAAAIDRYYNNDAAPSLDSFI 303 Query: 636 YNAVQHQWLTEQQAGEIWRQYVPH--QFLFAEILTTLGHINRSAINVLLLRHERSSLPLG 693 V Q + + E L IL S ++ L + PL Sbjct: 304 TEMVDDQPVVVEAVEEEENLEPDSKVVALVNRILVEAYQRTVSDVH---LEPKLFKGPLL 360 Query: 694 KFLVTEG--VISQETLDRVLTIQRELQVSMQSLLLKAGLNTEQ 734 +G V+ E + L + A L+ + Sbjct: 361 IRYRIDGECVLCHEI------PPVHKSAIISRLKVMAKLDISE 397 >UniRef50_A1AV36 General secretory system II, protein E domain protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AV36_PELPD Length = 368 Score = 133 bits (335), Expect = 2e-29, Method: Composition-based stats. Identities = 52/248 (20%), Positives = 88/248 (35%), Gaps = 28/248 (11%) Query: 500 ITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE-SIDAWQIPSSL 557 IT QL+ AL ++ G++LG ++ G + L +AL+ + GV + + IP L Sbjct: 15 ITNTQLEEALESQAGRGIKLGSALFELGYVEENALGRALSAKLGVPFVGRSELSSIPGDL 74 Query: 558 IAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVT 617 I + S+A+ Y V+P +LE + L + D D +L + G V+ I +I Sbjct: 75 IRDFSRSMAVKYNVMPFKLERNRLGLAMSDPNDFRALEDIAFMTGCVVQPYIAPDVRISD 134 Query: 618 GLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSA 677 +Y G L + + P Q E Sbjct: 135 AQARYYRISGGESRYRRLADLRRRN-----------SPPCPGQAAMPEEEQRPRQDEAVE 183 Query: 678 INVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLLKAGLNTEQVAQ 737 L + L EG S+ + + S L +T++V Sbjct: 184 YE--------DFSCLNEALAGEGSCSETIARPAVPQRT-------SGKLARAGSTDEVGD 228 Query: 738 LESENEGE 745 L E+ G+ Sbjct: 229 LLIEHMGQ 236 >UniRef50_C9R8Y9 Glycosyl transferase family 2 n=1 Tax=Ammonifex degensii KC4 RepID=C9R8Y9_AMMDK Length = 415 Score = 132 bits (333), Expect = 3e-29, Method: Composition-based stats. Identities = 83/465 (17%), Positives = 135/465 (29%), Gaps = 88/465 (18%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPA 77 I L + + GL + + RR++ + P A+++ A Sbjct: 4 ILFILQLALASYGLYHILLSLFSLYRRVEDYSAT--------------PPRHSFAVVIAA 49 Query: 78 WNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG 136 NE VIG + + + E Y +FV T + A V R Sbjct: 50 HNEEKVIGELIKSIFRSDYPRELYEVFVIADNCTDRTAEIARSLGAT-------VIERYN 102 Query: 137 P--TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKD-LIQ 193 P K L I R F F++ DA++++SP L++ N+ + R + +IQ Sbjct: 103 PHERGKGYALEYGFQRIFALPRK----FDAFVILDADNLVSPHFLQVMNHRLARGEKIIQ 158 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 + + T + Y+ + R L G G C + + Sbjct: 159 GYLDTKNPDDTWISRSIYVGYLISNRFCQL-ARHNLGLSCALGGTGMCIATEVLKRF--- 214 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 + + +LTED + + G+ + V + K Sbjct: 215 ----GWGMTTLTEDLEFQTKALLCGLRVTWAHDAAVYDEK-------------------- 250 Query: 314 EYFPDTFSTAVRQKSRWIIG---IVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 P T + RQ+ RW+ G + + F W RD K Sbjct: 251 ---PLTLKQSWRQRQRWMQGHCQVAGRYFFRLMWEG------IRTRDFKKIDGALYLLRP 301 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 M+ L L LSIF + L W G + + + Sbjct: 302 YFTMMVGLAAL------------LSIFEFDWSRIDLWWFVKGFSGQYLYMALALLLERAP 349 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 L L W A R AW T H Sbjct: 350 LRAYL-------WLLYYPIFALTWIPITYAGFIYRHRRAWCHTQH 387 >UniRef50_B4UHA9 General secretory system II protein E domain protein n=2 Tax=Anaeromyxobacter RepID=B4UHA9_ANASK Length = 499 Score = 132 bits (332), Expect = 4e-29, Method: Composition-based stats. Identities = 43/160 (26%), Positives = 70/160 (43%), Gaps = 3/160 (1%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 R LG++LLE VI QL +AL ++ + G+RLG +++ L + QAL+ + G Sbjct: 5 GKRRLGELLLEAGVIDATQLQSALGHQRQWGVRLGQALVDLKLAGEADIVQALSRKYGYE 64 Query: 545 WESIDAWQI--PSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 +DA + + +P AL V PL + L V D + + L + G Sbjct: 65 VARLDALEPYALELALRLVPREFALRNNVFPLGADTGTLAVAMSDPTNLAVVDELRFRTG 124 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ 642 RKV+ I +I +R Y + A+ +A Sbjct: 125 RKVKVCIGGDREIAAAVRDRYPHDHAIEAIALDLDADDPP 164 >UniRef50_Q1JZD9 Response regulator receiver protein n=1 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1JZD9_DESAC Length = 287 Score = 131 bits (330), Expect = 7e-29, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 67/171 (39%), Gaps = 2/171 (1%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + G++L++ VI E L AL + G RLG + Q +IS +A LA Q G+ Sbjct: 3 KRKKFGEVLVDEGVIDENILQRALSQQAGTGKRLGQILEEQQVISERDIALVLARQFGLK 62 Query: 545 WE-SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 +I P ++ + + AL + PL++E L + + +D +L L+ G Sbjct: 63 TVKNIADHNFPDKILDLVDSEKALQKLIFPLKVEEKTLYLAMVNPLDMETLDTLSFGTGL 122 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR 654 ++ + +I + Y + + + A I Sbjct: 123 RIVPYLTTTQEIHAAINRHYMKSIQVPAEGKWWRIMLVDTQLPALAASISA 173 >UniRef50_B8JAJ3 General secretory system II protein E domain protein n=1 Tax=Anaeromyxobacter dehalogenans 2CP-1 RepID=B8JAJ3_ANAD2 Length = 506 Score = 131 bits (330), Expect = 9e-29, Method: Composition-based stats. Identities = 43/160 (26%), Positives = 70/160 (43%), Gaps = 3/160 (1%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 R LG++LLE VI QL +AL ++ + G+RLG +++ L + QAL+ + G Sbjct: 5 GKRRLGELLLEAGVIDATQLQSALGHQRQWGVRLGQALVDLKLAGEADIVQALSRKYGYE 64 Query: 545 WESIDAWQI--PSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 +DA + + +P AL V PL + L V D + + L + G Sbjct: 65 VAHLDALEPYALELALRLVPREFALRNNVFPLGADTGTLAVAMSDPTNLAVVDELRFRTG 124 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ 642 RKV+ I +I +R Y + A+ +A Sbjct: 125 RKVKVCIGGDREIAAAVRDRYPHDHAIEAIALDLDADDPP 164 >UniRef50_Q1D5Q5 General secretory system II protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D5Q5_MYXXD Length = 308 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 66/148 (44%), Gaps = 3/148 (2%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQAL 537 R LG+IL++ +++E QL +AL + + G +LG +++ G + + AL Sbjct: 61 PSVSLPGRKRRLGEILMDAGLLSETQLRSALAEQRKWGGKLGLTLVQMGYVDESSMVHAL 120 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLE--NDELIVGSEDGIDPVSLA 595 + Q + ++ + ++ + A +A Y V P+ + L V + D + + Sbjct: 121 SRQLAIPTVDLEQHAASAVVLQALRADIAERYTVFPIAADPATKTLTVATADPTNVEAFQ 180 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWY 623 L G++++ V+ I +R Y Sbjct: 181 ELAFHCGQRLQVVVSSASSIERAIRRHY 208 >UniRef50_C6MLR2 Type II secretion system protein E n=1 Tax=Geobacter sp. M18 RepID=C6MLR2_9DELT Length = 748 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 50/255 (19%), Positives = 96/255 (37%), Gaps = 18/255 (7%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVEGLRL--GGSMLMQGLISAEQLAQALAEQNGVAW 545 +G IL+E+ ++T E ++ A +++ G +L G ++M+GLI+ EQL ALA + + + Sbjct: 178 CRVGDILVESGLVTRELVEAAFKSQK-GKKLQVGELLIMKGLITEEQLLSALATKFRLRF 236 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 ++ + + + +A V P+ LE L+V + D L Sbjct: 237 VDLETVIPSDAALNAISEGLASRLKVFPISLEGRTLVVATCAPTDLTIGDNLRFSTNFAT 296 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQF---- 661 V+ QI + +Y R D L N+++ + T E+ + + Sbjct: 297 ELVVAPSRQIAAAIEKYYRNR--VDTVDTLLNSMKGEAETVTIEEEVDDSRLLFEPDSKI 354 Query: 662 --LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQV 719 L IL + S I+ E + L+ + E + Sbjct: 355 ISLVNRILIDAYNRGASDIH-----FEPGIGT--EPLIIRYRVDGECVSAHKVAASYKGP 407 Query: 720 SMQSLLLKAGLNTEQ 734 + + A LN + Sbjct: 408 IAVRIKIMADLNIAE 422 >UniRef50_Q3A3E8 Putative type IV pilin n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A3E8_PELCD Length = 199 Score = 131 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 78/201 (38%), Gaps = 13/201 (6%) Query: 486 SLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 S LG++LL+ +++E +L AL + + LG S++ G +S + L L + Sbjct: 4 SKLKLGELLLDAGLVSERELKAALCYQKNQRCLLGASLVKLGFLSDDNLLDFLEHSLQLE 63 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLEND----ELIVGSEDGIDPVSLAALTRK 600 +D + ++A +P AL + V P+ L + D + ++ AL Sbjct: 64 RVDLDGFLPVPEVLAYVPEDRALAFTVFPIERCQGHGGPALRMAMADPCNLTAIDALEFM 123 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 G V+ V+ I ++ Y DP A + + + E V Sbjct: 124 TGLNVQPVLASEQSIHAAIQRCYVNNPAPDPCA--------EVTLDSKTSETTMVSVEKF 175 Query: 661 FLFAEILTTLGHINRSAINVL 681 E+L G ++ + +L Sbjct: 176 NKLVELLQAKGLLSLDEVRML 196 >UniRef50_A9BY08 General secretory pathway protein E n=2 Tax=Proteobacteria RepID=A9BY08_DELAS Length = 575 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 58/268 (21%), Positives = 105/268 (39%), Gaps = 22/268 (8%) Query: 473 TTHDFPSVTGDTRSLR-PLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISA 530 T H + G ++ LG+ LL + E L AL+ + E G LG ++ G+++ Sbjct: 4 TNHVMEPIAGVEAPIKGRLGERLLSAGKLNERDLQNALQAQQELGGYLGQVLVQLGVVAE 63 Query: 531 EQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVA---LHYAVLPLRLENDELIVGSED 587 +AQAL+EQ + W + + L+ ++P +A V P+ L++ L V Sbjct: 64 TDVAQALSEQLHMRWLRAEEF---PDLLPDVPGLLASFLDAQCVCPISLQDGVLEVAMSV 120 Query: 588 GIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQ 647 DP AL G +++ V+ L I L G + + +A ++ Sbjct: 121 PQDPFITKALRLATGLQIKPVLALEADIRKALSEAGQEPEGEEGQDWESDATGGDFVEHL 180 Query: 648 QAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETL 707 + P L I++ ++ S I++ E L +GVI L Sbjct: 181 ---KDLASEAPVIRLVTGIISRAIELHASDIHL-----EPFEAGLQVRYRMDGVIHAAEL 232 Query: 708 DRVLTIQRELQVSMQS-LLLKAGLNTEQ 734 + L ++ S + L A L+ + Sbjct: 233 -----VPPRLSAAVGSRVKLLAHLDIAE 255 >UniRef50_C5V6N2 Type II secretion system protein E n=1 Tax=Gallionella ferruginea ES-2 RepID=C5V6N2_9PROT Length = 789 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 46/253 (18%), Positives = 95/253 (37%), Gaps = 12/253 (4%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVEGLR--LGGSMLMQGLISAEQLAQALAEQNGV 543 ++ LG +L+ +IT E L AL + + LG ++ G+++ E + LA++ G+ Sbjct: 209 TVLKLGDMLVNENLITHEVLQEALAKQRTDKKVALGEILINMGVVNTEVVQSMLAKKLGI 268 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + ++ + + S +P + Y++LPL + LIV E+ + L L Sbjct: 269 PFVNVRKFFVEPSTFHLVPINFVTKYSILPLYHTDHSLIVAMENPLLWEPLNELRVITHL 328 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPH-QFL 662 V V+ R IV + + L + E+ E+ + L Sbjct: 329 TVVPVLAAREDIVFVINELRNEKTSRQRIDELAVGMTFDGGGEESGEELVAETDNTLVGL 388 Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQ 722 +++ S I++ +GVIS + + ++ Sbjct: 389 VNKMMLDAYDQGVSDIHIETY---PDKRNTQVRFRRDGVIS-----KYFEFPPAFKKALI 440 Query: 723 S-LLLKAGLNTEQ 734 S + + A L+ + Sbjct: 441 SRIKIMAKLDISE 453 >UniRef50_Q1D416 General secretory system II protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D416_MYXXD Length = 431 Score = 130 bits (326), Expect = 2e-28, Method: Composition-based stats. Identities = 53/219 (24%), Positives = 98/219 (44%), Gaps = 11/219 (5%) Query: 489 PLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG+ LL++ ++T E L+ AL +V G RLG +++ GL+S LA+AL + + A+ S Sbjct: 2 RLGEQLLKDGLVTAEGLEEALEAQVVHGGRLGTNLVELGLLSEVDLAKALGKVHNSAFAS 61 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + + + ++ A LP+R++ L + + D +L A+ K G++V Sbjct: 62 GEMVP-DPKAMELVSSNHADDKEYLPMRVDATRLSIAVVNPHDFSTLDAIAFKTGKRVVP 120 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEIL 667 V++ ++ LR + R RA+ NAV+ + QA P +++ Sbjct: 121 VVIPEFRMNQLLRRYCKAFRPL--RAVDMNAVRPRPSAGSQAELAKAAETPP-----DLM 173 Query: 668 TTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQET 706 + +S L S +G+ + GV E Sbjct: 174 SEEEF--QSVYASALRGGADSDGDMGEEEIITGVEVLEA 210 >UniRef50_Q08MP9 Gspii_e N-terminal domain family (Fragment) n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08MP9_STIAU Length = 625 Score = 130 bits (326), Expect = 2e-28, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 70/143 (48%), Gaps = 2/143 (1%) Query: 489 PLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG +L+ +IT+ QLD L+ + + G RLG +++ + E+L + L+EQ+ + Sbjct: 4 KLGALLVRKGLITQTQLDEGLKAQMIYGGRLGTNLVELEFLDIEKLGEVLSEQSRYPQAT 63 Query: 548 IDAWQ-IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 I ++ + + +A +PA++A +AV PL LE L V D + AL G ++ Sbjct: 64 IQEFEAVTVATLATVPAALAEKHAVFPLHLEGRRLKVAMASPSDMEHVDALAFATGLRIV 123 Query: 607 YVIVLRGQIVTGLRHWYARRRGH 629 IV ++ L Y R Sbjct: 124 PCIVPELRLYIYLEKRYGIVRPE 146 >UniRef50_B3EAP3 General secretory system II protein E domain protein n=1 Tax=Geobacter lovleyi SZ RepID=B3EAP3_GEOLS Length = 550 Score = 129 bits (325), Expect = 3e-28, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 84/208 (40%), Gaps = 4/208 (1%) Query: 490 LGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG IL +++I+E + AL + R G +++ G+++ E + AL+ Q + + + Sbjct: 10 LGAILYNSRIISEADITAALEEQQRSSSRFGEALVSLGIVTQEDIDWALSNQLDIPYIRL 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 I ++ +P + + ++PL DEL + D ++ ++ A G ++ Sbjct: 70 KQEMIDPEALSLLPPHLCRIHQLIPLIRAGDELSIAIADPLNKEAVTAAAEASGCRINLS 129 Query: 609 IVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ--FLFAEI 666 + L +I L Y R D + + L+ A +Q + + Sbjct: 130 VALIREINEMLDLCYGLPR-EDLLGFSSGLLSPEQLSSINADSSGQQLINSLLAYSIQHQ 188 Query: 667 LTTLGHINRSAINVLLLRHERSSLPLGK 694 LT+ + + R +S LG+ Sbjct: 189 LTSFSFRPLEDLISISGRSGATSHELGQ 216 >UniRef50_D2QZD9 Type II secretion system protein E n=4 Tax=Planctomycetaceae RepID=D2QZD9_9PLAN Length = 573 Score = 129 bits (324), Expect = 4e-28, Method: Composition-based stats. Identities = 47/254 (18%), Positives = 95/254 (37%), Gaps = 18/254 (7%) Query: 491 GQILLENQVITEEQLDTALRN---RVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 +ILL +I + QLD A +G R + + G +S E +AL ++ G+ + Sbjct: 4 CEILLRRGLIDKRQLDQARGQANGHGDGARQIEAAIQLGFVSEEAALRALGDEVGIEYVD 63 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + +I SL+ P + ++ P+ + +L+V + + D L ++ G V Sbjct: 64 LTEAEIDLSLLKIFPHRLIHRQSLFPISKTDGQLVVATSNPFDLYPLDEVSAATGLAVMP 123 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIW-------RQYVPHQ 660 V+ R +I ++ ++ A + L E Q Sbjct: 124 VLAARAEIAKLIKRHLGVG-SETVEGLVAQAQEEAALELVGDIETDGSELSEMAQEASVV 182 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L EIL + S +++ + S L + +G++ + + I R Sbjct: 183 RLVNEILLEAIELRASDVHI---ESQPSGLAI--RYRVDGMLQSQPIPP--EIHRFEAAI 235 Query: 721 MQSLLLKAGLNTEQ 734 + L + + LN + Sbjct: 236 VSRLKIMSRLNIAE 249 >UniRef50_Q39Q61 General secretory system II, protein E-like n=2 Tax=Geobacter RepID=Q39Q61_GEOMG Length = 540 Score = 129 bits (324), Expect = 4e-28, Method: Composition-based stats. Identities = 40/175 (22%), Positives = 79/175 (45%), Gaps = 1/175 (0%) Query: 490 LGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG IL Q+I+E + AL + G R G +++ G+++ E + AL+ Q + + + Sbjct: 10 LGDILFRCQIISENDIRAALDEQQTTGCRFGEALVKLGVVAQEDIDWALSNQLNIPYVRL 69 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + + +A +PA++A + ++PL + DE+ + D ++ +LAA+ + G V Sbjct: 70 KPTMVDTEAVALIPAALARQHNLIPLIVTGDEISIAIADPLNTAALAAVEKAAGCPVSVS 129 Query: 609 IVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 + L +I +Y D A L+ ++V + LF Sbjct: 130 VGLLREIREMQEIFYGPPEETDILGFESAAFPASVLSAINHDLTGAKFVDYLLLF 184 >UniRef50_B9XRT3 Type II secretion system protein E n=2 Tax=Verrucomicrobia RepID=B9XRT3_9BACT Length = 580 Score = 128 bits (322), Expect = 7e-28, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 92/262 (35%), Gaps = 25/262 (9%) Query: 489 PLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 + L+E+ ++T +Q++ L + EG RL ++ + ++S + + ++ + Sbjct: 11 RIADALVEDGLLTSKQVEELLEQQKKEGTRLLKLVVEKAIVSEQDMTVSMGRVLNTPPIN 70 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + I + +P VA +Y V+P+ ++L + D ++ +++ + R V Sbjct: 71 LSRISIIPEVADLLPREVAHNYKVIPVSRLENKLFLAMADPLNVLAIDDVKRLTKLDVVP 130 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEI--------------- 652 +I I+ L + A + G + + + E+ Sbjct: 131 MIASEKSIIDKLSNIDASKSGSMQDIIDDAKKAAEEEKDPDNIEVSGIAVEDVNLDQLAA 190 Query: 653 WRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLT 712 + P L IL S I++ E + +G LD Sbjct: 191 SSEEAPVIKLANLILVQAIKDRASDIHI-----EPFEKVIRLRYRVDG----ALLDVTPP 241 Query: 713 IQRELQVSMQSLLLKAGLNTEQ 734 ++ L + + L+ + Sbjct: 242 PKQMQLALASRLKIMSSLDIAE 263 >UniRef50_B5YHZ6 Type IV pilin n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YHZ6_THEYD Length = 546 Score = 127 bits (320), Expect = 1e-27, Method: Composition-based stats. Identities = 35/136 (25%), Positives = 66/136 (48%), Gaps = 1/136 (0%) Query: 489 PLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG ILLE +++TE++L+ AL ++ G LG + G I++ +LA+ LA Q+G+ + + Sbjct: 2 RLGDILLEKKLLTEQELNIALNVQKITGQVLGKCLTSLGFITSSELAEVLAIQHGLEYIN 61 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 I I L+ P +V LP+ + + + + + V+L + G+K + Sbjct: 62 IREHPIEMGLLKVFPKNVTESARFLPIEETDGIIKIAVTEPSNIVALDKVRTITGKKAKP 121 Query: 608 VIVLRGQIVTGLRHWY 623 + + L Y Sbjct: 122 YLTDEEGFIDILEKAY 137 >UniRef50_A1WFR3 Type II secretion system protein E (GspE) n=8 Tax=cellular organisms RepID=A1WFR3_VEREI Length = 578 Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats. Identities = 56/264 (21%), Positives = 94/264 (35%), Gaps = 18/264 (6%) Query: 474 THDFPSVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQ 532 D R +G++L+++ ++ L+ AL + G LG + GL+S Sbjct: 9 PADLQPQDAPLPRPR-IGELLVQSGKLSARDLERALSAQQEMGGLLGRVFVRLGLVSDAD 67 Query: 533 LAQALAEQNGVAWES-IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDP 591 +AQAL+ Q G+ S D + + P +A V PLRLE D+L V D Sbjct: 68 VAQALSAQLGIPLVSEHDFPDLLPEVEGLRPEFLAA-NNVCPLRLEGDQLHVAMAVPQDA 126 Query: 592 VSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGE 651 + AL G + + L I L P+ + + + Sbjct: 127 FVVKALHLATGHAIVPYLALESAIDKALAE--PANAVPQPQDDGFGDGLDGSDFVEHLKD 184 Query: 652 IWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVL 711 P L I+ + + S I++ E L +GVI L Sbjct: 185 -LASEAPVIRLVNTIIGRVIDLRASDIHL-----EPFDDGLHVRYRIDGVIHPGEL---- 234 Query: 712 TIQRELQVSMQS-LLLKAGLNTEQ 734 + L ++ S + L A L+ + Sbjct: 235 -VPPRLSAAVNSRVKLLAHLDIAE 257 >UniRef50_Q1CX93 General secretion protein E N-terminal domain protein n=3 Tax=Cystobacterineae RepID=Q1CX93_MYXXD Length = 296 Score = 127 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 33/143 (23%), Positives = 68/143 (47%), Gaps = 5/143 (3%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + + +G++LLE + I+ QL+ L +R G RLG +++ QG I+ LA AL++ G+ Sbjct: 2 ARKRIGELLLEQRAISVAQLEAGLAAHRKSGQRLGATLIAQGAITEATLADALSQALGLP 61 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLEN----DELIVGSEDGIDPVSLAALTRK 600 + A + + + A + + P+ LE+ +L+V D ++ ++ + Sbjct: 62 RVDLAATTPEWAAVHMLRARFCEQHDLFPVALESTGGRKQLVVAMSDPLNVTAVEEIEFT 121 Query: 601 VGRKVRYVIVLRGQIVTGLRHWY 623 G KV + + + +Y Sbjct: 122 TGLKVSPRVAPLSTVRGAILRYY 144 >UniRef50_B2KC07 Type II secretion system protein E n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KC07_ELUMP Length = 576 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 75/211 (35%), Gaps = 14/211 (6%) Query: 488 RPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 R L IL+++ +I EQL A L + LG + + G + EQ+ AL++ V + Sbjct: 3 RKLDDILIDSGIINAEQLKKATLYANQNNVSLGDATIKLGFATEEQITIALSKHFSVPYA 62 Query: 547 SIDA-WQIPSS---LIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 S + IP L + A VLPL LE L + D + + + G Sbjct: 63 SKENNILIPEKEQNLQDVVNEKFARENMVLPLFLEEGVLAIAMYDPSNVFLVDNVKMMTG 122 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHD---------PRAMLYNAVQHQWLTEQQAGEIW 653 ++ I + QI+T + +Y + + + + + Sbjct: 123 YDIQPFIASKSQILTAIDVFYGGKDLIEEVVVGNAAVKEEEGDDIEVISVEGKLDLDTLK 182 Query: 654 RQYVPHQFLFAEILTTLGHINRSAINVLLLR 684 + L IL S I++ + Sbjct: 183 GSGSHYIKLVNAILKQAISERTSDIHLEMFD 213 >UniRef50_B1Y3Z9 General secretory system II protein E domain protein n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y3Z9_LEPCP Length = 1017 Score = 124 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 34/139 (24%), Positives = 59/139 (42%), Gaps = 2/139 (1%) Query: 489 PLGQILLENQVITEEQLDTALRNRVEGL--RLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 LG+ L+ +T++QL AL+ + LG ++ +GL+ EQL ALA + G Sbjct: 255 RLGEALVALGYLTDKQLQEALQLQRTDRVQPLGELLVEKGLVEGEQLRIALARKMGYPVV 314 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + + + +LI +PA A VLPL L+V D + L R + Sbjct: 315 DVAGFPVDPALIPLLPAPAARRLQVLPLMRRGGRLVVAMHDASQQSVIEELQRLTQSHIA 374 Query: 607 YVIVLRGQIVTGLRHWYAR 625 + + + Y++ Sbjct: 375 PTLAGGTGLAEAIERAYSQ 393 >UniRef50_C5ESB9 Type II secretion system protein E:General secretory system II n=3 Tax=Clostridiales RepID=C5ESB9_9FIRM Length = 570 Score = 123 bits (309), Expect = 2e-26, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 96/252 (38%), Gaps = 18/252 (7%) Query: 492 QILLENQVITEEQLDTAL--RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESID 549 ++L+E++ +T QL+ A+ + + G ++ G ++ + + + A ++ + +D Sbjct: 9 ELLVEHRYLTRSQLERAMYFKEQEPGKTAEQILMDLGYVTEDAVMECAATRDNLQVTDLD 68 Query: 550 AWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVI 609 +++I + + A H ++P+ E L+V + ID + G +V+ V+ Sbjct: 69 SYKIDLKAADMVSPAFAGHNRIIPIDFEEGRLVVAASYPIDMDVIDETATLTGMEVKVVL 128 Query: 610 VLRGQIVTGLRHWYARRRGH-----DPRAMLYNAVQHQWLTEQQ--AGEIWRQYVPHQFL 662 + + Y G +P + A Q + + P + Sbjct: 129 GTSAAVGRAIDRTYENTAGGNTGLMNPASGTPGAGIMDTRNRQMELVLKERVEGAPVVRM 188 Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQ 722 I+ + N S I+V E + L + G + + L ++ + Sbjct: 189 VNAIIENAYNRNASDIHV-----EPGNHELVIRMRINGDL---IVHTTLEMEYHRP-MIT 239 Query: 723 SLLLKAGLNTEQ 734 L L AG++ + Sbjct: 240 RLKLMAGMDIAE 251 >UniRef50_A3ZX01 General secretion pathway protein E n=3 Tax=Planctomycetaceae RepID=A3ZX01_9PLAN Length = 575 Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 89/253 (35%), Gaps = 17/253 (6%) Query: 491 GQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 G+IL+ + +I+ EQL+ R + + G + E +AL + G+ + + Sbjct: 5 GEILVRHGLISAEQLEIVRREQKLPGDAIERAVELGFVDEEDALKALGVEVGLDFVDLTT 64 Query: 551 WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIV 610 I SL+ +P + ++ PL+ N +IV + D D L + G V V+ Sbjct: 65 ADIDLSLLKTLPQRLIYRQSLFPLQRRNGSVIVATSDPFDLYPLDEVAAVTGLSVVPVLA 124 Query: 611 LRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIW---------RQYVPHQF 661 R +I ++ G +L + + +I Q Sbjct: 125 SRVEIAKLIKANLGVG-GETVEGLLALKEEDAGSDIELLDDIESDGSELSEMAQEASVVR 183 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM 721 L EI+ S +++ E L +G++ + + I + Sbjct: 184 LVNEIMFEAIESRASDVHI-----ESQGSGLVVRYRIDGMLHSQPVPP--EINYFQAAII 236 Query: 722 QSLLLKAGLNTEQ 734 L + + LN + Sbjct: 237 SRLKIMSRLNIAE 249 >UniRef50_Q2JEE3 Glycosyltransferases probably involved in cell wall biogenesis-like n=2 Tax=Frankia RepID=Q2JEE3_FRASC Length = 644 Score = 122 bits (306), Expect = 4e-26, Method: Composition-based stats. Identities = 91/494 (18%), Positives = 150/494 (30%), Gaps = 89/494 (18%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL--YKPDEKPL 71 + + L I L + + VR +R R+ + +L Sbjct: 190 AAEHHHVVLLAAWTIVSLLMLLVASLTLVRGQYSW----QRPERVHHVDLTGNLAPRNRF 245 Query: 72 AIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPN--DPDTQRDVDEVCARFPNVH 128 +++VPA +E V+G + + V + D +T+R +E+ R NV Sbjct: 246 SLIVPARDEP-VLGRTLTQILAGDYPGDLVELVVMVSYDEVDQETRRVAEEIAGRHSNVR 304 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER 188 VV SK L + T + DAE +++ LR N L R Sbjct: 305 -VVAPEGSRRSKPLSLEDARRHCTG---------DLVGVVDAESLLAGGLLRYVNTLALR 354 Query: 189 KDLI---QIPVYPFE---------------REWTHF----TSMTY---IDEFSELHGKDV 223 + Q V R + H+ TS E+ + Sbjct: 355 HADVGIFQGGVQLMNARATAWRRAADHSPIRAFLHWLDAGTSWWRARNCLEYYIWFMSRL 414 Query: 224 PVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIF 283 +A A +P G R + L +DV LTED D+G R G+ Sbjct: 415 RF-QARARFIPLGGNTVFIRRTVLERL------GGWDVSCLTEDCDLGVRASAAGIPTAV 467 Query: 284 VRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHK 343 P + RE P++ + + Q++RW++G Q Sbjct: 468 FYHP---------------------DLTTREETPESLTKLIIQRTRWMMGF-MQVLFKGD 505 Query: 344 WTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWL 403 W + + V L M L +L SL + + Sbjct: 506 WRALPGARQRIM---------AVEMLTMPFFQALAGVLLPVSLVLTLFLAAPTGLVIVFW 556 Query: 404 MTLLWLNFGLMVNRIVQRVIFVTGYYGLT--QGLLSVLRLFWGNLINFMANWRALKQVLQ 461 + + + R L + VL L A RA ++L+ Sbjct: 557 LPFGATVMTVFSEQAAFREFAEAYGLDLRRWDSVRLVLCAPLYQLALSAAAVRATARLLR 616 Query: 462 HGDPRRVAWDKTTH 475 RV W+KT+H Sbjct: 617 ----GRVEWEKTSH 626 >UniRef50_Q1Q644 Strongly similar to general secretion pathway protein E n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q644_9BACT Length = 557 Score = 122 bits (306), Expect = 5e-26, Method: Composition-based stats. Identities = 54/248 (21%), Positives = 88/248 (35%), Gaps = 20/248 (8%) Query: 491 GQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESID 549 G++LLE I + L+ A + G +LG ++ G +S E L A + + Sbjct: 10 GEVLLEIGKINRQDLERAFEAQKQTGQKLGRILIDLGTVSEEDLRLAYSRWLEIPVWEKK 69 Query: 550 AWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVI 609 ++ +P VLPL L+ + L + D D + + A+ GR+VR Sbjct: 70 KTDTYP-MLENVPKVFLTTNRVLPLSLDENVLDIALADPQDTLLIEAIALSTGREVRVFA 128 Query: 610 VLRGQIVTGLRHWY--ARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEIL 667 I + L Y D A ++ A E P L IL Sbjct: 129 GTERDISSSLEKLYETGVSEEEDAMASSVEMMEDIEQLRDMASE-----APVIRLVNSIL 183 Query: 668 TTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS-LLL 726 T + S I++ + +GV L + REL S+ S + + Sbjct: 184 TKAIEVGASDIHIEVFERNTR-----LRYRVDGV-----LGELAPPPRELYNSIVSRIKI 233 Query: 727 KAGLNTEQ 734 A LN + Sbjct: 234 MAKLNIAE 241 >UniRef50_Q08Q27 Serine/threonine kinase PKN11 n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08Q27_STIAU Length = 990 Score = 122 bits (305), Expect = 6e-26, Method: Composition-based stats. Identities = 38/144 (26%), Positives = 67/144 (46%), Gaps = 5/144 (3%) Query: 485 RSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 R+ R +G IL+ ++ E L+ AL + G +LG ++ + L+ AE+L +AL+EQ+G+ Sbjct: 376 RAGRRIGDILVARGMLPPEALEQALTLQKRLGGKLGQVLVGERLLEAEELVRALSEQSGM 435 Query: 544 AWES---IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 S + +P+ LI +P + +P+ N EL + D + AL Sbjct: 436 PHISGERLQTMPVPAELIRLLPMEMCEKLCAVPVAQRNRELYCAVLEPRDLKVMDALKFA 495 Query: 601 VG-RKVRYVIVLRGQIVTGLRHWY 623 G V + I +R +Y Sbjct: 496 TGTISVHGLFATESAIRRAIRRFY 519 >UniRef50_Q1D888 General secretory system II protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D888_MYXXD Length = 2136 Score = 121 bits (304), Expect = 8e-26, Method: Composition-based stats. Identities = 35/141 (24%), Positives = 61/141 (43%), Gaps = 2/141 (1%) Query: 489 PLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE- 546 LG IL+ +IT+ QLD ALR + + G RLG +++ ++ + LA L E Sbjct: 4 KLGAILVRKGLITQAQLDEALRAQLIYGGRLGSNLVELDILDIDTLAMVLGEMCRYPVAQ 63 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 D P +++ +PA++A + PL E L V ++ AL G ++ Sbjct: 64 EADFVAAPDAVLQLLPAAMAEKHQAFPLEQEGRRLKVAMASPLEIEHADALGFITGLRIV 123 Query: 607 YVIVLRGQIVTGLRHWYARRR 627 + ++ Y +R Sbjct: 124 PYVTPELRLFQFQELRYGIKR 144 >UniRef50_D2MLP0 Glycosyltransferase, group 2 family protein n=1 Tax=Bulleidia extructa W1219 RepID=D2MLP0_9FIRM Length = 428 Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats. Identities = 70/404 (17%), Positives = 129/404 (31%), Gaps = 52/404 (12%) Query: 33 DFFIDVVYWVRRI-KRKLSVYRRYPRMSYRELYKPDEKPL---AIMVPAWNETGVIGNMA 88 + F DV++ V + Y Y Y+ P+ K L A+ + A NE VIG + Sbjct: 8 ELFTDVIFVVMTVAYLYQFFYIIYSIFKYKVPVMPEAKRLHRYAVFISARNERNVIGELL 67 Query: 89 ELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNN 146 + + T DY Y I+V D D AR + K LN Sbjct: 68 D-SLTNQDYPRDKYDIYV-----TADNCTDDTAQVARDHGAYAFERFNDEKKGKGYALNE 121 Query: 147 VLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE--RKDLIQIPVYPFEREWT 204 + + + + ++ DA++++ L+ N + + D + Sbjct: 122 MYHQVIALKGQGYYE--AVVVFDADNIVDAQFLKEMNKTFDTGKYDALTTYRNSKNFGQN 179 Query: 205 HFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSL 264 T+ Y F R L Q +G G S + + + L Sbjct: 180 WLTA-AYSLWFMHEARHLNYARMMLGAQCMISGTGFVVSTKLMDI------NEGWPYYLL 232 Query: 265 TEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAV 324 TED +G + ++ + + P T+ A Sbjct: 233 TEDIQFSVASTLQGFHIGYCDTAILYDEQ-----------------------PATWKQAW 269 Query: 325 RQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGA---ISNFVSFLAMLVMIQLLLLL 381 RQ+ RW G + + L ++R+ A I V ++L + ++L L Sbjct: 270 RQRLRWAKGFYQIDGR---YLGPLASGVVKGKNRRLAFYDILMTVLPSSLLTVALIILAL 326 Query: 382 AYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV 425 ++++I + L L+ L G + + + Sbjct: 327 WVLVSSSVMPYYVAIVFQNEMLWYLIKLIGGSWIGLTLMAFVTT 370 >UniRef50_C0A4E0 Type II secretion system protein E n=2 Tax=Chlamydiae/Verrucomicrobia group RepID=C0A4E0_9BACT Length = 574 Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats. Identities = 48/257 (18%), Positives = 98/257 (38%), Gaps = 25/257 (9%) Query: 492 QILLENQVITEEQLDTALRNRVEGLRLG-------GSMLMQGLISAEQLAQALAEQNGVA 544 Q+ LE ++T+EQ+DTA E L ++ + I+ + +++ LA++ G+ Sbjct: 10 QLALEKGLLTQEQIDTARAIVAEHTDLTQAPPKPLEVLIREHQITPQIISKMLADEFGMP 69 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + + L+ + ++A + V P+ + L V D +D ++ LT + Sbjct: 70 TVDLHTVRPAPELLKTLNRTLANRFKVFPIEQQGQTLKVAISDPLDVDTIDNLTHLLQLT 129 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQW----LTEQQAGEIWRQYV--P 658 V + I + +Y + A+L + LT AG + V P Sbjct: 130 VDPAVAPLADIEQCIERYYG-KEAESLDALLQDFSATDESQISLTTPAAGTAAGEDVDAP 188 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ 718 L + + S I++ E +GV L V + LQ Sbjct: 189 IIKLVYQTILEAIQRRASDIHL-----EPMEKRFRVRYRIDGV-----LIEVDGPPKRLQ 238 Query: 719 VSMQS-LLLKAGLNTEQ 734 +++ S + + A ++ + Sbjct: 239 LAVISRVKIMANISIAE 255 >UniRef50_B8F8X7 Response regulator receiver protein n=3 Tax=Proteobacteria RepID=B8F8X7_DESAA Length = 804 Score = 120 bits (300), Expect = 2e-25, Method: Composition-based stats. Identities = 36/281 (12%), Positives = 105/281 (37%), Gaps = 25/281 (8%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVEGLR-LGGSMLMQGLISAEQLAQALAEQNGVA 544 + +G++L++ ++T+E L+ A + + + L ++ L + + + E+ + Sbjct: 178 NRSQIGRMLVKRNLLTKEDLEKAQQVQARSDKILPAILMEMELADEKTIMDVMEEELKIN 237 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + + + L + +P+ + + ++PL+ + +L+ D D V + L G Sbjct: 238 RVNPAEFTASAPLASLIPSEICEKHLLVPLKRMDGQLVTAMADPTDLVKIDELRFLTGMP 297 Query: 605 VRYVIVLRGQIVTGLRHWYAR----------RRGHDPRAMLYNAVQHQWLTEQQAGEIWR 654 ++ + I ++ Y DP + ++ + + + Sbjct: 298 IKPALATHEDIRKKVQELYGGESALNSVISEIELMDPTETIEIILEEDDVVDVDELLKSK 357 Query: 655 QYVPHQFLFAEILTTLGHINRSAINV------LLLRHERSSLPLGKFLVTEGVISQETLD 708 P + I++ S +++ L++R+ L L + + + + Sbjct: 358 DQPPAIRIVNSIISDALRHGASDVHIEPKTKYLMVRYRIDDL-LQEKIRIPMAMHPPIVS 416 Query: 709 RV-------LTIQRELQVSMQSLLLKAGLNTEQVAQLESEN 742 R+ +T +R+ Q ++ +++ L + N Sbjct: 417 RIKVMSELDITERRKPQDGRVTVKASTKTVDMRISSLPTVN 457 >UniRef50_Q7UEJ7 Probable general secretion pathway protein E n=1 Tax=Rhodopirellula baltica RepID=Q7UEJ7_RHOBA Length = 1283 Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 83/215 (38%), Gaps = 7/215 (3%) Query: 471 DKTTHDFPSVTGDTRSLRPLGQIL---LENQVITEEQLDTALRNRVE-GLRLGGSMLMQG 526 +++ D TR+ + L ++ ++I+E Q D + E G + G Sbjct: 508 EQSERDPYWAHFSTRTPPIDDETLSKFVDRELISESQADHVMEAASECGKPYFTLLQDYG 567 Query: 527 LISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLE-NDELIVGS 585 + + +++ALAE G + +D I ++I P S+A V+P+R + + L+ Sbjct: 568 YAADDDMSRALAEIYGYQFVDLDNLSINEAIIELCPESIARENTVIPIREDFDGNLVFAM 627 Query: 586 EDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLT 645 + ID ++ L + R + V+ IV + H+Y + G +ML Sbjct: 628 SNPIDLETIEKLRFILNRHIETVLATPDAIVEAINHFYGQIEGESADSMLQEFTDSAIDF 687 Query: 646 EQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINV 680 + VP+ + + ++ Sbjct: 688 TETMDAGCESVVPNPVAAND--SETALSPPEQLDA 720 >UniRef50_B2A563 Type II secretion system protein E n=5 Tax=cellular organisms RepID=B2A563_NATTJ Length = 561 Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats. Identities = 31/248 (12%), Positives = 95/248 (38%), Gaps = 18/248 (7%) Query: 494 LLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQ 552 +++ ++++ QL+ AL + G RL ++ IS Q L ++ G+ + + Sbjct: 1 MIDYNILSKAQLNEALIVQKRTGNRLSSIVIDLQFISENAWVQLLEKKLGLPRIELSDYC 60 Query: 553 IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLR 612 + + +P + + P +L + + + + + ++ + + + G++V + Sbjct: 61 VENDTAYILPFHIVEQNRIFPFKLNQNNVAIATSEPLNILIIDEIKLITGKEVEIWLATP 120 Query: 613 GQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQY---VPHQFLFAEILTT 669 G+I ++ ++ + + L + + + + + + P + ++ Sbjct: 121 GEIDREIKRFFDIKEVVEKEVQLISKESVTNIIDNKNSDQNLNFENEAPAIKIIDTVIKL 180 Query: 670 LGHINRSAINVLLLRHERSSLPLGKFLVTEGVI---------SQETLDRVLTIQRELQVS 720 S I+ E +S + +G++ +QE L + I + ++ Sbjct: 181 AIDQEASDIH-----FEPTSSDMLVRFRVDGMMRQITCFPKHTQELLISRIKIMTNMDIT 235 Query: 721 MQSLLLKA 728 ++ L Sbjct: 236 VKRLPQDG 243 >UniRef50_Q2IEU7 General secretory system II, protein E-like n=3 Tax=Anaeromyxobacter RepID=Q2IEU7_ANADE Length = 182 Score = 119 bits (298), Expect = 4e-25, Method: Composition-based stats. Identities = 31/141 (21%), Positives = 61/141 (43%), Gaps = 5/141 (3%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + +G++L+ I QL++AL ++ G RLG S++ G + + A+ Q GV Sbjct: 2 ARMRIGELLVAQGAIDAVQLESALAHQRRWGGRLGRSIVSLGFLGEPIVLGAVGAQLGVP 61 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRL----ENDELIVGSEDGIDPVSLAALTRK 600 + + +P +++ +P + VLPL ++V D D L +T Sbjct: 62 FMELGDRHVPPAVLRLLPEKLIRTRKVLPLSRVNEPRGAAVVVALADPADLGVLDEITFA 121 Query: 601 VGRKVRYVIVLRGQIVTGLRH 621 G +V+ V+ + + Sbjct: 122 TGLRVKPVLAAEDDLEQAIAR 142 >UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococcaceae RepID=Q9UY40_PYRAB Length = 447 Score = 119 bits (298), Expect = 4e-25, Method: Composition-based stats. Identities = 88/489 (17%), Positives = 158/489 (32%), Gaps = 72/489 (14%) Query: 8 FATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKL------SVYRRYPRMSYR 61 F + LY +I I LA+++ + + +++ + S+ +RYP Sbjct: 8 FQSALYLYILIIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTLLMASLGKRYPYDETG 67 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 + E + +++PA NE VI + DY N + + + T+ ++E+ Sbjct: 68 FNLEFLEPLVYVLIPAHNEERVIYKTVR-SVLGQDYRNMKVILINDNSTDRTRDIMEEIN 126 Query: 122 ARFPNVHKVV-CARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 ++P ++ SK LN L+ I ++ N+ F + DA+ +I P L+ Sbjct: 127 RKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVF----ILDADYLIPPNALK 182 Query: 181 ----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 + + IQ V P T ++ + V A+ G + Sbjct: 183 TLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLE-------RLVGFNVAIEGDMKLN 235 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G A+ F S+TED D+ R G + Sbjct: 236 ENGKYGGTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYH----------- 284 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 + E +T ++Q+SRW G Q H W R Sbjct: 285 ------------GVIGWEEAVETLRDYIKQRSRWAQG-HLQVMIDHYWPVM--------R 323 Query: 357 DRKGAISNFVS---FLAMLVMIQLLLLLAYESL----WPDAWHFLSI---FSGSAWLMTL 406 I +F+ ++ LV + L + S F S S + L Sbjct: 324 SCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIITGAPPLSFARPKLFLSVSIFTFLL 383 Query: 407 LWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPR 466 W + +R Y + L L + + R L ++L Sbjct: 384 FWFSVAYSNWVEKKRH---NYYVPWSFVALYPLYFMVFVIAGVIYTMRGLIRLL----VG 436 Query: 467 RVAWDKTTH 475 R+ W+KT Sbjct: 437 RLHWEKTKR 445 >UniRef50_C4ZLP1 Type II secretion system protein E n=3 Tax=Betaproteobacteria RepID=C4ZLP1_THASP Length = 571 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 57/266 (21%), Positives = 105/266 (39%), Gaps = 25/266 (9%) Query: 485 RSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 + P+GQIL+ +I E+QL AL R LG ++ G +S L +ALA ++G+ Sbjct: 2 NAPLPIGQILIAAGLIGEDQLRIALHEQRGRARPLGRVLVELGFVSEAALREALAARSGL 61 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEN--DELIVGSEDGIDPVSLAALTRKV 601 + + IA +P ++A + +LPL+ + LIV D D V+L L ++ Sbjct: 62 PCVDLASALADPDAIARVPQALARRHRLLPLQYDAARHRLIVAMADAHDIVALDRLRAEL 121 Query: 602 GRK--VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPH 659 G V + ++ + Y + + + + AG Q V Sbjct: 122 GPDVHVELRLAGDNELGRAIEQHYGQASSIE----DMVRELERRAGQPIAGARDPQLV-- 175 Query: 660 QFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL-Q 718 L +L S +++ E + L +G TL +V + + Sbjct: 176 VRLVDALLAEAAARGASDLHL-----EPEAGFLRVRHRIDG-----TLRQVRAMHKSCWA 225 Query: 719 VSMQSLLLKAGLNTEQVAQLESENEG 744 + + AG++ +A+ S +G Sbjct: 226 ELAVRIKVLAGMD---IAESRSPQDG 248 >UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolobaceae RepID=A4YD58_METS5 Length = 395 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 76/469 (16%), Positives = 146/469 (31%), Gaps = 94/469 (20%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 L I I L++I+ I + + ++Y + + +++ Sbjct: 2 LDDIVIGLSIIVSIWSVYN-------------SAFAIYGLSWKSDEPKTSSGPS--FSLL 46 Query: 75 VPAWNETGVIGNMAELAATT-LDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 VP NE V+G + E D Y I V + +T ++ + + V Sbjct: 47 VPVRNEEKVLGRLLERLVNQEYDRSKYEIIVLEDGSTDNTLGVCNKFSEMYSIIKCVHLE 106 Query: 134 RPGPTS-KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV---ERK 189 + + K+ LN L + DA+ V L R Sbjct: 107 KSNVVNGKSRALNYGLKI---------SRGDIIGVFDADTVPRLDVLGYVAQKFISNSRV 157 Query: 190 DLIQIPVYPFEREWTHFTSMTYIDE-FSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 +Q + P + + ++E FSE + R VP G + R A+ Sbjct: 158 GGVQGRLVPINVRESIVARLASLEELFSEYS---ISGRARAGLFVPLEGTCSFVRRDALE 214 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 + ++ LTED D+ +L ++ Sbjct: 215 KV------GGWNENVLTEDLDLSLKLTSLNYLIVYSPS---------------------- 246 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSF 368 + P TFS+ VRQ+ RW G F+ F WR + Sbjct: 247 -VQSWREVPVTFSSLVRQRLRWYRG-NFELTMRISRFK------FTWR---------LVD 289 Query: 369 LAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN--FGLMVNRIVQRVIFVT 426 AMLV + ++L+ + + F+ + + ++ + L++ ++ R Sbjct: 290 AAMLVGTPVFMVLSLANY---SLVFIYSYQLHVLIAAIISFSSMMTLLLIIMISR----- 341 Query: 427 GYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + ++ + +NF + + VL+ W KT Sbjct: 342 -----RHMIETIYIILSALYLNFTISLHLISIVLELA-GAPKGWSKTER 384 >UniRef50_D0LM65 General secretory system II protein E domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LM65_HALO1 Length = 479 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 31/143 (21%), Positives = 65/143 (45%), Gaps = 2/143 (1%) Query: 489 PLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG++L+ + ++ QL+ AL + EG RLG ++ GLI A+ + L + G+ + Sbjct: 2 KLGEMLIRDGCVSAPQLERALARQAQEGGRLGTILVEMGLIDADTVTVYLGLELGIPIAT 61 Query: 548 -IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + + + + + A + +P+ +++ ++I +D D L L R G ++ Sbjct: 62 GATLERAKRTAVRLLTPAQARQFRCIPIIVQDRQIIAALDDPHDLEVLDELYRLTGYRIL 121 Query: 607 YVIVLRGQIVTGLRHWYARRRGH 629 + +I L +Y R Sbjct: 122 PRVAPEIRIFYYLERYYGIPRPQ 144 >UniRef50_D1WQZ8 Glycosyltransferase probably involved in cell wall biogenesis-like protein n=2 Tax=Streptomyces RepID=D1WQZ8_9ACTO Length = 446 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 76/454 (16%), Positives = 138/454 (30%), Gaps = 57/454 (12%) Query: 50 SVYRRYPRMSYRELYKPDEKP--LAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTY 107 R+ R D VP+ +E VI D+ H++V Sbjct: 22 WAGSRHAYARRRPTEPGDPAHHDWHFFVPSRDEETVIATTV--TRLRTDFPAAHVWVIDD 79 Query: 108 PNDPDTQRDVDEVCARFPNVHKVVCARPGPT-SKADCLNNVLDAITQFERSANFAFAGFI 166 +D T V + A V V RP K LN +A+ + + Sbjct: 80 ASDDRTGPIVSALAAEDTYVRLVSRRRPDARIGKGAALNAAYEAMNAHLGEVDRSRVVVC 139 Query: 167 LHDAEDVISPMELRLF----NYLVERKDLIQIPVYPFE------REWTHFTSMTYI---- 212 + DA+ +SP + +QI V + Y Sbjct: 140 VVDADGRLSPDAPAHVSGPDGFADPETGGVQIGVRMRNVDDARPLPERGRIANAYARLLI 199 Query: 213 ----DEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDY 268 EF+ + + G V G G A+ + A + +L EDY Sbjct: 200 RMQDAEFA-ASNTAMQLLRRRTGSVGLGGNGQFTRLTALDRIAAAERR-PWKQDALLEDY 257 Query: 269 DIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKS 328 ++G +++ G V + +E P T + Q++ Sbjct: 258 ELGMQMRLAGYRVTHV----------------------PDAWVTQEALPRT-RRFLTQRT 294 Query: 329 RWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWP 388 RW G Q + + ++ R ++ F +A L ++ L +L + L Sbjct: 295 RWAQG-NIQCVRYAGRI--IGSRHYRARGVLESLYTFFQPIAHLTVLALTAVLVFILLTG 351 Query: 389 DAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLIN 448 L A + +L + ++ V R F LT L + + + Sbjct: 352 VTGAVLFAAWPLALALGILSVVPFVL-WGPVYRKEFAPDRSRLTGVLWGITLWLYAYHL- 409 Query: 449 FMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 F+ + R ++L+ + W KT + + Sbjct: 410 FIVSARGSVRLLR----GKTGWAKTRRNAETAAT 439 >UniRef50_B8DWB6 Predicted glycosyltransferase n=10 Tax=Bifidobacterium RepID=B8DWB6_BIFA0 Length = 428 Score = 117 bits (293), Expect = 2e-24, Method: Composition-based stats. Identities = 70/432 (16%), Positives = 139/432 (32%), Gaps = 57/432 (13%) Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 P +K A+++ A NE V+GN+ + + + I++ D T A Sbjct: 39 AAPMDKRYAVLISARNEEQVVGNLIRDIQSQSYPSKLIDIWLVADNCDDGT-----AQLA 93 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 R H V K L +L+A+ + A+ + F + DA++ + Sbjct: 94 RDLGCHVVERFNQQQVGKGYALTYLLNAMI--DSKASDQYDAFFVFDADNRLDKHYFEEM 151 Query: 183 NYLVE-RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 N + ++ +S + F R L G G Sbjct: 152 NKAYQSGFRILTSYRNSVNLSENWVSSGS-ALWFIRESRFVSASRMWLGNSCHVGGTGFM 210 Query: 242 FSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFL 301 FS+ + + LTED + G +V ++ + + Sbjct: 211 FSQEVMRR------NQGWKFHLLTEDLEFTMDSVLHGDRIGYVGSAILYDEQ-------- 256 Query: 302 QHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGA 361 P TF+ + RQ+ RW G Q F+ + +L RD Sbjct: 257 ---------------PVTFAQSWRQRLRWSKGF-LQVFRYYG--PALVRRAIQERDFSSI 298 Query: 362 -ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWL---MTLLWLNFGLMVNR 417 ++ F+ +L +I++LL + + +W + L +++++ + Sbjct: 299 DLTLFICPFTVLAIIRVLLGTIFAACGFISWSSQGAALFNWMLGVVSSMVFMMVLAGLTM 358 Query: 418 IVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDF 477 +V+R L LS ++ + + +++A+ + W H Sbjct: 359 VVERKQIGASNRELFAYALSFP-IYILSYVPI--SFQAVF--------AKAQWKPIEHQG 407 Query: 478 PSVTGDTRSLRP 489 S D R Sbjct: 408 SSGAEDPRIREQ 419 >UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium RepID=A1UIF2_MYCSK Length = 789 Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats. Identities = 76/455 (16%), Positives = 150/455 (32%), Gaps = 72/455 (15%) Query: 54 RYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPD 112 R R+ + ++ +++++ A+NE VI EL + + + + Sbjct: 397 RQNRLRWNDIGDDQLPMVSVVLAAFNEEKVIARTIAELRRSDYPRSRFEVVAVNDGSTDG 456 Query: 113 TQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAED 172 T R + E+ +P + V A G K+ +NN ++ + + DA+ Sbjct: 457 TLRILTELARDWPKLRVVDQANSG---KSSAINNGINHASA-------VSTVMVTMDADT 506 Query: 173 VISPMELRLFNYLVERK------DLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVR 226 + P +R R + + R T+ ++ S + + R Sbjct: 507 LFRPDTIRNLARHFARHTHGRQVGAVAGHIKVGNRR-NLLTAWQSLEYISGICVTRMAER 565 Query: 227 EALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRF 286 A + G + +SR A+ + F ++ ED D L+ +G + Sbjct: 566 LLNAISI-VPGACSAWSRTALEEI------GGFCDDTMAEDCDATLALQRRGYRILQENN 618 Query: 287 PVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTS 346 + D P+T +Q+ RW G + +K Sbjct: 619 AIADT-----------------------EAPETIRALAKQRKRWTYGNIQALWKHRA--- 652 Query: 347 SLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTL 406 L+R R GA+ + L + ++ LL A +S+ +G+ + L Sbjct: 653 ------MLFRPRYGALG--LVALPYAALSLIVPLLFMPLTIVAA--GMSLAAGNWQSIAL 702 Query: 407 LWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRL---FWGNLINFMANWRALKQVLQHG 463 + I + + ++ V R+ + + + +RA+K + Sbjct: 703 FAGFVAALHMIISITAVAMARERAWHLLVVPVYRIIYEPLRAYLLYASAYRAIKGTI--- 759 Query: 464 DPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQ 498 VAWDK R RP+ IL Q Sbjct: 760 ----VAWDKLERRNTVSAFVERH-RPMPPILGAQQ 789 >UniRef50_A1ASU1 Response regulator receiver protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1ASU1_PELPD Length = 274 Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 77/202 (38%), Gaps = 2/202 (0%) Query: 485 RSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 + + LG IL+E Q++T + L + R G ++ GL++ ++LA+ALA Q + Sbjct: 2 KQWKKLGHILIEEQILTPVAVARLLAIAKRHNTRFGWTLEDLGLVTGDELAKALARQFEL 61 Query: 544 AWE-SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 ++ L+ L V PL+LE+ L+V D D +L G Sbjct: 62 RRVTNLVNGSYSQELLNTFTVEFVLEQVVFPLKLEDKVLVVAVADPTDLKTLHNFGANHG 121 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 ++ + R +I + +Y + + + V L + ++ + + Sbjct: 122 VQIMPCVASRREIHEAICTFYLGKNIQESKQTTVLVVDDDVLIQTILRDMLNSHGYRVVI 181 Query: 663 FAEILTTLGHINRSAINVLLLR 684 + + ++L Sbjct: 182 AKDGIEGFRETIACRPQIILTD 203 >UniRef50_C8W151 Type II secretion system protein E n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W151_DESAS Length = 545 Score = 116 bits (290), Expect = 4e-24, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 89/218 (40%), Gaps = 20/218 (9%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + + LG+ L++ V+ E+L ALR R +L ++ QG + QL L E + Sbjct: 2 TKKNLGEFLVDRGVLGREELAAALRAQRGSKKKLEELLVDQGYLQEAQLTPLLGEFFDMP 61 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 D + ++A +P VAL + ++P+ L+ ++L + + + V L L R G++ Sbjct: 62 VFPADEVRFAPEVLATVPRPVALKHNIIPVALKENDLFIACSEPANSVILENLRRLTGKR 121 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFA 664 + V++ + LR Y+ G +AV T + L Sbjct: 122 LHLVLMSSSGLAGVLRQAYSEDTG--------DAVTADEETAVTGTD------DAIKLLE 167 Query: 665 EILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVI 702 ++ S +++ E S L + +G++ Sbjct: 168 GLIIKAVAQRASDLHL-----EPLSDGLRVRMRVDGML 200 >UniRef50_B3E9S8 Type II secretion system protein E n=2 Tax=Geobacter RepID=B3E9S8_GEOLS Length = 644 Score = 115 bits (288), Expect = 7e-24, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 84/242 (34%), Gaps = 20/242 (8%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLA 534 D V R L ILL+ +I +++L A +R + L ++L I EQLA Sbjct: 50 DLSLVLDQKNKRRKLADILLKEGMIDQQKLSQARELSRQNDIPLERALLKLRFIDEEQLA 109 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSL 594 +++A Q + + I + + L + + A + ++P+ + + L + + + Sbjct: 110 RSVAAQFDLPYVEIHSISLDPDLSRYISSVYAQKHMLVPISMIGNTLTLAMAQPLRHHDI 169 Query: 595 AALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLY----------------NA 638 L + K+ VI + L+ Y AM + Sbjct: 170 RQLEDNIRLKIISVIASEASVQRALKMLYRVDLSSSSSAMDEVNLDLVPDSISELLNKTS 229 Query: 639 VQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVT 698 + E++ ++ + L +I+ S I++ + + Sbjct: 230 AIDEPEVEEEVRKVTEKDSVIVKLVNKIIYDAYMKKASDIHI---EPYPGKRDVTVRIRI 286 Query: 699 EG 700 +G Sbjct: 287 DG 288 >UniRef50_Q098P0 Putative ATPase n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q098P0_STIAU Length = 365 Score = 114 bits (286), Expect = 1e-23, Method: Composition-based stats. Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 5/143 (3%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + LG +L ++ E QL AL + G LG ++ G +A+Q+ + LA Q + Sbjct: 24 RKKRLGDLLQAAGLVDELQLRAALGFHHKWGTPLGQVVVDLGFCTAQQVLELLANQAQLP 83 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDE---LIVGSEDGIDPVSLAALTRKV 601 +DA + L+ +P VA V+PLR E L+V + DPV+L + R Sbjct: 84 MVDLDAEMLDPQLVEVLPVRVAESCRVIPLRQEGPRDSVLVVATAAPGDPVALDEVARLT 143 Query: 602 GR-KVRYVIVLRGQIVTGLRHWY 623 G+ +V ++ I + Y Sbjct: 144 GKTRVVTLLATDAAISQAIDRLY 166 >UniRef50_A5KT43 Type II secretion system protein E n=6 Tax=candidate division TM7 RepID=A5KT43_9BACT Length = 593 Score = 114 bits (285), Expect = 1e-23, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 94/244 (38%), Gaps = 10/244 (4%) Query: 492 QILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 Q+L T++QL + + L + I+ ++L Q A+ + + ID Sbjct: 10 QLLRNTGRATDDQLRVLQKEQATTHRPLQELAVRHQYITPKELTQLYAKTVDIPFIEIDP 69 Query: 551 WQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIV 610 IP+ + +P VA Y + ++EN + ED D +++ L +++G +R I Sbjct: 70 RDIPTDALKLLPERVARQYNAIVFKIENGIKFLAMEDPDDIQAVSFLEKQLGSDIRLHIA 129 Query: 611 LRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTL 670 I+ + + + + + EQ + E + P +L Sbjct: 130 THENILRAIESYRSDVGKELSEVIQVERAEGAEGAEQVSEEDIAEDSPIAQTVNLLLEYA 189 Query: 671 GHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLLKAGL 730 N S I++ E + +GV+ + ++R+ Q+ L + + + + L Sbjct: 190 IRSNASDIHI-----EPREDFVQIRYRIDGVLQE--VNRL--PQKVLNALVSRIKILSNL 240 Query: 731 NTEQ 734 ++ Sbjct: 241 KIDE 244 >UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVJ2_9BACT Length = 1181 Score = 114 bits (285), Expect = 1e-23, Method: Composition-based stats. Identities = 82/483 (16%), Positives = 143/483 (29%), Gaps = 76/483 (15%) Query: 7 VFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 +FA L+ ++I V + I V +V + + Y R R Sbjct: 747 MFALIFGILRFLSIAFVVAIA------LGIARVAFVTSLAIWV-----YFRSKPRGRPIE 795 Query: 67 DEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN 126 + ++I++PA+NE V+G DY + I + T V++ A Sbjct: 796 NPPLVSIIIPAYNEQSVVGRTIRSVLAN-DYPHMEIIFVDDGSTDGTADAVEQEFAGHEK 854 Query: 127 VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL-RLFN-Y 184 V V G KA LN+ + +++ E + DA+ + RL + Sbjct: 855 VRVVRQVNGG---KASALNHGI-LVSKGE--------IIVGLDADTQFRKETITRLIRHF 902 Query: 185 LVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSR 244 R + V R + + E+ D L G + R Sbjct: 903 RDPRVGAVAGNVKVGNRI--NLITRWQALEYITSQNVDRLAYAQLNAVTVVPGAIGAWRR 960 Query: 245 RAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHA 304 A+ + + +L ED D+ +R++ KG V Sbjct: 961 TALDEV------GGYLTDTLAEDMDLTWRIRRKGWKIETEAGAVALT------------- 1001 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTH------KWTSSLTLNYFLWRDR 358 P T +Q+ RW G + +K W + L LW Sbjct: 1002 ----------EAPATTQAFFKQRFRWSFGTLQCLWKHRRALFRYGWFGWVGLP-TLW--- 1047 Query: 359 KGAISNFVSFLAMLVMIQLLLLLAYESLWPD----AWHFLSIFSGSAWLMTLLWLNFGLM 414 F ++ + L L ++ S W + A L L Sbjct: 1048 -LFQILFQVIAPLVDLQVLYSLWSFGSSWFSEHYLGIVNQAATPPGALLQQTLLFYALFY 1106 Query: 415 VNRIVQRVIFVT-GYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 + + V G L+ F+ + + W+A+ L+ R W K Sbjct: 1107 AVELAGAAVAVAIDRDGWRLLPWLFLQRFFYRQLMYAVLWKAV---LRAFLGERTGWGKL 1163 Query: 474 THD 476 Sbjct: 1164 ERR 1166 >UniRef50_A7HJC2 Type II secretion system protein E n=11 Tax=Thermotogaceae RepID=A7HJC2_FERNB Length = 569 Score = 114 bits (285), Expect = 1e-23, Method: Composition-based stats. Identities = 45/254 (17%), Positives = 104/254 (40%), Gaps = 16/254 (6%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + LG +L+E +IT +L+ AL + + LG ++ G + + + +ALAEQ G+ Sbjct: 9 KPKKLGDVLIEKGIITPFELEKALETQSQLKKPLGEVLVQMGYCTWDDIVRALAEQYGIE 68 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLA-ALTRKVGR 603 I +I + + + P + ++P+ + +L VG + D ++ L ++ + Sbjct: 69 AC-IGEVKIDNEFVKKFPKELIQELKIVPISERDGKLFVGISNVYDIPNVKRRLKFRMNK 127 Query: 604 KVRYVIVLRGQIVTGLRH-WYARRRGHDPRAMLYNAV-QHQWLTEQQAGEIWRQYVPHQF 661 V + + + + + V Q + ++ +I + P Sbjct: 128 DVDFCLFSPSVFESMYNTVIHGMSSSVFAEQIGDAIVQQQEEEKTEETEKISEEDTPVVK 187 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM 721 L + I++ ++ S I++ E + +GV L R+ R + ++ Sbjct: 188 LVSNIVSHAIELDASDIHI-----EPQRKNVVVRYRVDGV-----LRRITEYPRNMHSAV 237 Query: 722 QS-LLLKAGLNTEQ 734 S + + +GL+ + Sbjct: 238 VSRIKILSGLDIVE 251 >UniRef50_Q1AST4 Glycosyl transferase, family 2 n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AST4_RUBXD Length = 425 Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats. Identities = 77/414 (18%), Positives = 130/414 (31%), Gaps = 60/414 (14%) Query: 71 LAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 ++VPA NE VI N+ + V ++ T V P V + Sbjct: 41 FVLLVPALNEERVISRTLSSLLRL--RGNFLVLVIDDASEDGTVAAVRPFLE-NPRVRLL 97 Query: 131 VCARP-GPTSKADCLNNVLDAITQFERSA--NFAFAGFILHDAEDVISPMELRLFN--YL 185 K LN + AI + S + DA+ + P L + Sbjct: 98 RQPPEEARRGKGHVLNAGVGAIRRMRISEYFGAENIIVTVFDADARVEPGFLDAVAPCFA 157 Query: 186 VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 R +Q V + + T + EF+ G+ + + L G G G C Sbjct: 158 DPRVAGVQSAVRMYNADANLLTFWQNL-EFAIW-GRVMCRAKNLLGSATLGGNGQCVRLS 215 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 A+ L + + SLTED D+ RL G + Sbjct: 216 ALEGLGEE----PWQAASLTEDLDLSLRLLASGGGLLRFCPSAT---------------- 255 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNF 365 +E P VRQ+SRW+ G ++ L D ++ F Sbjct: 256 -----VWQEAVP-ELGRLVRQRSRWMQG-HLVCWQHLPRLLRGALPLRARLD----LAVF 304 Query: 366 VSFLAMLVMIQLLLLL----AYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 + A ++ + L LL + + L++ +++ L + + R+ R Sbjct: 305 LLLPATVLPVGLASLLSWQQLLSGVGGWSLSGLALTYAIGFVVAPLAVAYLA---RVEGR 361 Query: 422 VIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + + +G L W F+A+ A +L R W KT Sbjct: 362 GVLRSLLHG---HLFVFYSAVW-----FLASVAAWWNIL----LGRRGWAKTAR 403 >UniRef50_A1SFQ9 Type II secretion system protein E n=1 Tax=Nocardioides sp. JS614 RepID=A1SFQ9_NOCSJ Length = 563 Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats. Identities = 49/246 (19%), Positives = 89/246 (36%), Gaps = 18/246 (7%) Query: 494 LLENQVITEEQLDTA--LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAW 551 L+E IT +QL A L + L +L G + + + A G+ + + + Sbjct: 15 LVEGGWITRDQLSEAGRLADERSQTVL-EVLLESGWVDRTTVVRTAAASAGLEYVELTDF 73 Query: 552 QIPSSLIAEMPASVALHYAVLPLRLENDELIVGSE--DGIDPVSLAALTRKVGRKVRYVI 609 + + ++ +PA A VLPL E+ EL+V D L+R +VR+ I Sbjct: 74 IVDMAAVSLLPAEFARRTGVLPLVHEDGELLVAVSVRQAGDIELKDDLSRLTRSRVRFAI 133 Query: 610 VLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTT 669 R I + Y R G A + + E+ + P ++ Sbjct: 134 AGRSDIDARINQVY-RAEGELTDITSDLAPEDEVDDLSTLTEVSDE-APVVRFVNLLINQ 191 Query: 670 LGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS-LLLKA 728 + S I++ E + + +GV L + +Q + S L + A Sbjct: 192 AINDRASDIHI-----EPTERDMRVRYRIDGV-----LHDAHRSPKSIQNGVISRLKIMA 241 Query: 729 GLNTEQ 734 +N + Sbjct: 242 EMNIAE 247 >UniRef50_B8JGM0 General secretory system II protein E domain protein n=2 Tax=Anaeromyxobacter RepID=B8JGM0_ANAD2 Length = 426 Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats. Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 2/156 (1%) Query: 487 LRPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 LG++LL T + AL N+ + G RLG ++L G + LA+AL +Q+GV Sbjct: 2 RTRLGELLLRAGACTPAAIRDALENQVIFGGRLGTNLLELGAVDEGALARALGQQHGVPA 61 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 S + ++ +A + A VA V+P L L V + D D L + GR+V Sbjct: 62 LSGE-VRLEPEAVAVLRAEVADRCDVVPFLLAGRRLAVLAVDPSDLRVLDEVAFAAGREV 120 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQH 641 ++ ++ LR Y R + +++V+ Sbjct: 121 HALVAPEARVWALLRRAYGIERQLRGIEVDFDSVRR 156 >UniRef50_A1S0Z5 Glycosyl transferase, family 2 n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S0Z5_THEPD Length = 420 Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 122/411 (29%), Gaps = 65/411 (15%) Query: 70 PLAIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 + ++VP+ +E + + E + V +D + V R+P Sbjct: 68 RVTVIVPSKDEGRRVERCLNAILSSDYPLEKLEVIVVDASSDGYVEEIVRRAGERYPGAV 127 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE- 187 +++ P K LN L T + DA+ V +R +E Sbjct: 128 RLIREEE-PRGKPAALNRALREATG---------EVVAVFDADSVPERDAIRRAVKHLEE 177 Query: 188 -RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 +Q + + +E + H + RE L VP G R A Sbjct: 178 PGVAAVQGKTLVLNERESVLARVASKEEKAWFHA-LIRGRERLGLFVPLTGSCQFVKRSA 236 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHART 306 + + + +L ED ++ L +G + Sbjct: 237 LEEV------GGWREDALAEDLELSMDLLARGYRVKYA---------------------- 268 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV 366 N + + P + + Q++RW G + + R+G + + Sbjct: 269 -NDVVSWQEAPTSLRSLAVQRNRWYRGYMEAFARHL---------RLALAGRRGLDAAIL 318 Query: 367 SFLAMLVMIQLLLL--LAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIF 424 S L+ + LL + + P HF + + A L + + + + + Sbjct: 319 SAGPYLMALSLLAVAAWLASTALPHVNHFSTPAALVAALNAVSLFSVSVAL------ALS 372 Query: 425 VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + V+ +W L + AL + + R W +T Sbjct: 373 ERPVSAKNLAWVPVIYAYWFTL-----SAVALHALAEIILRRPRVWRRTPK 418 >UniRef50_Q1D133 General secretion pathway protein E, N-terminal domain protein n=2 Tax=Cystobacterineae RepID=Q1D133_MYXXD Length = 293 Score = 113 bits (283), Expect = 2e-23, Method: Composition-based stats. Identities = 46/173 (26%), Positives = 73/173 (42%), Gaps = 10/173 (5%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQN 541 +T S RPLG+ILLE V+ QL L + E + LG +++ +GL S + + LAEQ Sbjct: 9 ETGSRRPLGEILLEQGVLNRAQLRVGLVHHHEVHVPLGRALVREGLCSGADVLRGLAEQF 68 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEN--------DELIVGSEDGIDPVS 593 GV ++ S + +PA VA Y V+PLR++ + L + + + Sbjct: 69 GVDAVDLERTPPDSRRLNHIPARVARQYRVVPLRIDKVLLDQGEREVLHIALPAPVSLDA 128 Query: 594 LAALTRKVGR-KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLT 645 + A+ G +V I + L Y +P A L Sbjct: 129 VDAVRAVSGMPRVEAHIASDAALARALADLYGIETPAEPPMPPSLAPGGPLLL 181 >UniRef50_A3WSK4 Glycosyl transferase, family 2 n=1 Tax=Nitrobacter sp. Nb-311A RepID=A3WSK4_9BRAD Length = 627 Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 67/214 (31%), Gaps = 24/214 (11%) Query: 37 DVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL--------AIMVPAWNETGVIGNMA 88 D+ V + + R P+ ++ + E + ++ Sbjct: 239 DIWSSVLALWFLAFIGLRLAASFLPRRSARCAPPIPDSCLPVYTVIAALYREASSVASLL 298 Query: 89 E-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNV 147 + A E + + +D +T+ + + P+V ++ GP +K LN Sbjct: 299 RAIEALDYPREKLDVIIVIELDDLETRAALARLGP-MPHVQVLLAPTEGPRTKPKALNCA 357 Query: 148 LDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLIQIPVYPFEREW 203 L + + DAED P +LR F +Q + R Sbjct: 358 LPFA---------RGSFTAVFDAEDRPDPGQLRAALDAFRTQGVDVACVQASLCIENRSD 408 Query: 204 THFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 + + M + E++ +P + +P G Sbjct: 409 SWLSRM-FAAEYAGQFDVFLPGLASFGVPLPLGG 441 >UniRef50_Q1D3E1 General secretion pathway protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D3E1_MYXXD Length = 338 Score = 112 bits (280), Expect = 4e-23, Method: Composition-based stats. Identities = 34/141 (24%), Positives = 72/141 (51%), Gaps = 7/141 (4%) Query: 487 LRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + +G++L+E V+TEEQ+ AL R RLG ++ QGL + +AQAL+ Q+ + Sbjct: 2 RKKIGELLVEAGVVTEEQVRVALGRRGAFGSHRLGEVLVAQGLCTPTHIAQALSAQHALP 61 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLE----NDELIVGSEDGIDPVSLAALTRK 600 + ++ +IP+++ + + +LP R+E ++ ++V +D D + L + Sbjct: 62 FVAL-PEEIPANVAGLVSVDFQSEHRILPFRMEVEGRSERILVAVDDPADVTLVDELRFQ 120 Query: 601 VGRKVRYVIVLRGQIVTGLRH 621 + +++R + + L Sbjct: 121 LRKQMRVFVAASDDLDAALAR 141 >UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase n=76 Tax=Firmicutes RepID=B7HFD6_BACC4 Length = 433 Score = 112 bits (280), Expect = 5e-23, Method: Composition-based stats. Identities = 75/471 (15%), Positives = 148/471 (31%), Gaps = 74/471 (15%) Query: 13 YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLA 72 + ++ + I A + + F+ + + R ++ ++ + Sbjct: 24 FSIEYVLIFTAFLFSGLLVYYSFLTIAGLIHRNSKR------------KDRTLEHYPSVD 71 Query: 73 IMVPAWNETGVIGNMAELAATTLDYE-NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVV 131 I +PA NE VI + E A ++Y I++ + +T D+ + ++ + Sbjct: 72 IFIPAHNEGIVIKDTLE-AMAKIEYPGKLTIYLLNDNSQDETPEIGDDFDKAYAHICHIR 130 Query: 132 CARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL---FNYLVER 188 P K+ LN L ++ + F ++DA++ P LR+ E Sbjct: 131 VPPGEPKGKSRVLNYGLS-------ISDGEY--FCVYDADNQPEPHALRMLVEHAETTED 181 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 V T M EF R L G R A+ Sbjct: 182 AVGAVGHVRTVNENRNWLTRMI-SLEFQIFQLLMQSGRWLLFQTGSLTGTNMLLRRSALE 240 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 L +D ++ ED ++ R+ +KG V V Sbjct: 241 EL------GGYDPYAIAEDAELTLRITQKGYLLPIVPESV-------------------- 274 Query: 309 MICVREYFPDTFSTAVRQKSRWIIG---IVFQGFKTHKWTSSLTLNYFLWRDRKGAISNF 365 E P+ ++Q++RW+ G I+ + F + + K + + Sbjct: 275 ---TWEQEPEHLKILIKQRTRWLQGNLYILEKMFSSLSFFK-----------GKLLVHSL 320 Query: 366 VSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV 425 L +V L L ++W + + +W + + V Sbjct: 321 QQVLVYVVF---WLFLIISNVWFVIGLLGIFQIQYSIPLLFMWYVAYITYVSQLFSAQSV 377 Query: 426 TGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR-VAWDKTTH 475 + T +SV+ F + R+L L+ ++ + WDKT Sbjct: 378 ERTFTPTNIFISVIMYFTYAQLFTYLFIRSLILYLRAKSKKQVIGWDKTVR 428 >UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bacteria RepID=C4I9P5_BURPS Length = 520 Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats. Identities = 89/477 (18%), Positives = 151/477 (31%), Gaps = 68/477 (14%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 ++ + A +L+ ++ + A ++ GLD F + R YR + Sbjct: 98 VISLCAAYLWVFALLTLVYASRHYVFGLDRLF------------------KPQRAPYRAI 139 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 D + + V A NE V+ + L ATT E I + +T+ +DEV A Sbjct: 140 THADWPEITVFVAAHNEEAVVADCLTALLATTYPRERLTIVPVNDRSTDNTRALIDEVQA 199 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR-- 180 R P + K G KA L + L I ++ DA+ + P L+ Sbjct: 200 RAPELIKPFHRESGKPGKAAALKDALREI---------RGDIMVVFDADYLPRPGLLKEL 250 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 + + + V P + + E + + + R L G Sbjct: 251 VAPFFDPEVGAVMGRVVPQNADRNLLARLL-DLERAGGYQVNQQARNNLGLVPQYGGTVG 309 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 + A+ A+ + +L ED D+ +RL +++ Sbjct: 310 GVRKSALDAV------GGWRDDTLAEDTDMTYRLLLSNWRTVYL---------------- 347 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHK-WTSSLTLNYFLWRDRK 359 N E P+ + RQ +RW G F+ S D Sbjct: 348 -------NHAECYEEVPERWPVRARQLTRWAKGHNQTLFRYLIPLLRSPVTPRRCRLDGA 400 Query: 360 GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIV 419 + FV + + + L L + D+ + S A + NFG+ +V Sbjct: 401 LLLGVFVMPALLALAWGIALALYLTNGI-DSLVLGLLVSVFALFAFSTFGNFGVFFEIVV 459 Query: 420 QRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR-VAWDKTTH 475 G L V G + A AL + RR + WDKT Sbjct: 460 -----AARLDGRATRLRLVPVNVVGFCVTIAAVVAALWGLALDALLRRELRWDKTER 511 >UniRef50_Q09DM8 Serine/threonine-protein kinase Pkn6 n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09DM8_STIAU Length = 918 Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats. Identities = 42/168 (25%), Positives = 75/168 (44%), Gaps = 7/168 (4%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLA 534 R LG +L+ ++E QL L R +G +LG ++ +GL++ E + Sbjct: 408 QLAPPRARPTVRRQLGDLLIAAGKLSEAQLHAMLERQRRDGGKLGEWLVAEGLVTDEDVV 467 Query: 535 QALAEQNGVAWES---IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDP 591 A++EQ G+ + + + +P+ L++ +P A + +PL L EL+ + + Sbjct: 468 AAISEQLGIPFIAEHQLRHLPVPTPLLSLLPLEHAARFEAVPLTLHGKELVCAMREPENL 527 Query: 592 VSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAV 639 L+ L G VR V+ G I + +Y G DP A + A Sbjct: 528 DRLSELQFLTGYAVRGVLASDGAIRRAINRFYL---GEDPPAGMDWAS 572 Score = 44.8 bits (104), Expect = 0.014, Method: Composition-based stats. Identities = 13/54 (24%), Positives = 29/54 (53%) Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 ++L G ++ + ++ +L R R LG++LV EG+++ E + ++ Q Sbjct: 420 RQLGDLLIAAGKLSEAQLHAMLERQRRDGGKLGEWLVAEGLVTDEDVVAAISEQ 473 >UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L983_9DELT Length = 1140 Score = 111 bits (278), Expect = 9e-23, Method: Composition-based stats. Identities = 84/478 (17%), Positives = 143/478 (29%), Gaps = 90/478 (18%) Query: 29 SGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP---LAIMVPAWNETGVIG 85 +GL F+ L+V + ++ R P +A++VPA+NE V+ Sbjct: 723 AGLQYLFVTGTILGLGRLLILAVLAVFEKVRGRRRPVSGPAPDLSVAVVVPAYNEEKVVL 782 Query: 86 NMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLN 145 + + I V + T R + E A V V G K LN Sbjct: 783 QTVQSLLACQHPATFEIIVVDDGSTDATYRVLCEALAGEKLVTIVTKPNGG---KPAALN 839 Query: 146 NVLDAITQFERSANFAFAGFILHDAEDVISPME-LRLFNY-LVERKDLIQIPVYPFEREW 203 + + + DA+ V + LRL ++ + + R Sbjct: 840 HGIALTRA---------DIVVTLDADTVFARDTILRLADWFRDPKVGAVAGNAKVGNRI- 889 Query: 204 THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQS 263 +F + E+ D L G + R V A F ++ Sbjct: 890 -NFLTRCQALEYVTSQNLDRRALTVLDSVTVVPGAVGAWRREVVEA------AGGFSGET 942 Query: 264 LTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTA 323 L ED D+ R++ G T + V PDT Sbjct: 943 LAEDADLTIRIQRMGHTVAYEDRAVALT-----------------------EAPDTMRGF 979 Query: 324 VRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSF--------------- 368 +RQ+ RW+ G + +K L+R R G + F Sbjct: 980 LRQRFRWMFGTLQVAWKHKD---------ALFRPRYGLLGFFGLPNIWLYQIFFQIISPV 1030 Query: 369 ----LAMLVMIQLLLLLAYESLW-PDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 LA + +L L + + W PDA + + +L ++ R + + Sbjct: 1031 MDLWLAYTCLKSWVLWLWHPATWDPDALFRVLFYYALFMAADILAGLVAFLLERGEDKRL 1090 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 L R F+ L+ +A L+ +L R + W K + Sbjct: 1091 L---------AWLVPQRFFYRQLMYVVA----LRTLLASLRGREMGWSKLERKATVDS 1135 >UniRef50_C8WGD9 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WGD9_EGGLE Length = 444 Score = 111 bits (276), Expect = 1e-22, Method: Composition-based stats. Identities = 70/435 (16%), Positives = 124/435 (28%), Gaps = 74/435 (17%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 +LD F + + + + + + I + V R+ +EL Sbjct: 1 MLDQFFSQISFVDIFNFCVFLTFTICYTYQLYYVFVVLTRK---------------PKEL 45 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 A ++ A NE+ VIG++ + E +FV DT R E A Sbjct: 46 TAKKNHKFAAVISARNESAVIGDLIHSIKVQNYPSELIDVFVIADNCTDDTARVAREAGA 105 Query: 123 RFPNVHKVVCARPGPT--SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 +V R K L+ I ER A+ + + + DA++V+ R Sbjct: 106 -------IVFPRSNDKEVGKGYALDYGFQCIR--ERYADKGYEAYFVFDADNVLDVNYFR 156 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 N + + +++ S Y F R L +G G Sbjct: 157 EMNKTFDNGAKASTSYRNSKNYDSNWISAGYAVWFLREAKFLNQARLTLNTSCAVSGTGF 216 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 + + + LTED + +G + ++ + + Sbjct: 217 FIAADIIEK------NGGWKWHLLTEDIEFSANSILEGTRISYTPTAILYDEQ------- 263 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSS----------LTL 350 P TF + Q+ RW G + + Sbjct: 264 ----------------PITFRDSWNQRFRWAKGFYQVFWHYGARLAKGIAVNPKGARFAC 307 Query: 351 NYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN 410 L G + VS L +++ L L A + A SI LN Sbjct: 308 YDMLMTIAPGMLLTIVSVLFNAIIVFLSLTGAMSTGIMVASSLSSIL--------FCLLN 359 Query: 411 FGLMVNRIVQRVIFV 425 + + + FV Sbjct: 360 YFIFMFMFGVLTTFV 374 >UniRef50_UPI00016B268C type II secretion system protein E n=1 Tax=candidate division TM7 single-cell isolate TM7c RepID=UPI00016B268C Length = 586 Score = 111 bits (276), Expect = 2e-22, Method: Composition-based stats. Identities = 41/248 (16%), Positives = 100/248 (40%), Gaps = 15/248 (6%) Query: 489 PLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 L ++L+ +I + +D AL+ L + +GL+ E L +A+ +GV + + Sbjct: 11 KLIELLINEGLIEKSVIDDALKRASDNNKPLFSLLSEEGLLDNELLVHGVAQVSGVPYVN 70 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + I +++ + + VA + +PL + L V D + ++ L+ ++ R ++ Sbjct: 71 LSNSVISQDILSLLSSDVAERFMAVPLAEVQNRLAVAMIDANNVQAVDYLSNRIQRPIKV 130 Query: 608 VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEIL 667 + + L + + + A Q + L+E + Q P + IL Sbjct: 131 FMASEESVRHVLDQY---KTDLSSVNVAAEASQEESLSEAGNIKTIVQDSPISRALSTIL 187 Query: 668 TTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS-LLL 726 + S +++ E L +GV L ++ + + ++ ++ S + + Sbjct: 188 EYAVKSHASDVHI-----EPLEKALKIRCRVDGV-----LREIMQLPKSIEPALVSRIKI 237 Query: 727 KAGLNTEQ 734 + L ++ Sbjct: 238 LSNLKIDE 245 >UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A3S6_THESM Length = 388 Score = 111 bits (276), Expect = 2e-22, Method: Composition-based stats. Identities = 72/448 (16%), Positives = 136/448 (30%), Gaps = 69/448 (15%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP-LAIMVPAWNETGVIGNMAELAA-TTL 95 ++Y + L++ ++ P+E P + I++PA NE VI + A Sbjct: 1 MIYPIFVYYIVLTIAGLRYNSRFKRPEIPEELPSVTILIPARNEGLVIRDTLRAMANLDY 60 Query: 96 DYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFE 155 + + + + DT + +EV +P KV+ G + K+ LN L Sbjct: 61 PKDKLEVLLLDDGSTDDTAKIAEEVSKDYP-FIKVIRVEGGGSGKSYVLNYGLKLAKG-- 117 Query: 156 RSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDE 214 ++DA++ P L+ L L + + V T E Sbjct: 118 -------EVIAVYDADNRPEPGALKDLVAMLSDETPAVTGKVKTMNWNRNILTR-FICME 169 Query: 215 FSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRL 274 + + G + + L +D ++L ED ++ FR+ Sbjct: 170 YLYFQLAGQAGKSKFYKTAILPGTNFVIRKELLEEL------GGWDEEALAEDLELSFRI 223 Query: 275 KEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGI 334 G + V E P++ RQ++RW G Sbjct: 224 ILTGKKIAYTPLAV-----------------------TWEQEPESLRVWFRQRTRWAAGN 260 Query: 335 VFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFL 394 V + K + + R F FL ++V L + + + F+ Sbjct: 261 VHTVKEYVKRFKE--IPSWGLR--------FDLFLTLMVYYLLAMAVIVADV-----AFV 305 Query: 395 SIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWR 454 ++ S + W G + + IF Y G + W + + Sbjct: 306 ALLLTSGSVTWFTWSVLGFVYLAFLLE-IFAGLYDGKIKSPGC-----WLLALLMYHTYS 359 Query: 455 ALKQV-----LQHGDPRRVAWDKTTHDF 477 + + L + W KT Sbjct: 360 QIWILISLAGLWEARRAKKVWYKTPRTA 387 >UniRef50_A9GDR9 General secretion pathway protein E n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GDR9_SORC5 Length = 592 Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 90/256 (35%), Gaps = 18/256 (7%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQLAQALAEQN 541 + + LG+IL+ V+ E+L R G L ++ + +A+ALAE+ Sbjct: 2 SLQQQQYLGEILIRRGVVPAERLAPLYDTVRERGQPLADLIVSSNIADEASIAKALAEEC 61 Query: 542 GV---AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 V IDA +P L + +P + A + +LPL + + D +D ++ + Sbjct: 62 EVGLLPRIDIDA--VPLELASRVPITYAKQHKILPLAEHDGVVYCAVADPLDTTAIDDVR 119 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVP 658 G+ V + ++ + + R+ + A++ L + + P Sbjct: 120 ALFGKPVEIAVATSDAVLNAINRIWERKEDGGAKLEGDTALEEDNLVDIIDSDDD---AP 176 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ 718 + S I++ E + +G E ++ + Sbjct: 177 IIAWVNGLFAQAVRERASDIHI-----EPEERDVVVRYRIDG----ELYVAKRASKQFMA 227 Query: 719 VSMQSLLLKAGLNTEQ 734 + + + A LN + Sbjct: 228 PIIARVKIMAALNIAE 243 >UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NHH7_GLOVI Length = 426 Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats. Identities = 78/454 (17%), Positives = 142/454 (31%), Gaps = 64/454 (14%) Query: 37 DVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP-LAIMVPAWNETGVIGNMAELAATTL 95 V W+ L R +S R P +P ++++V A NE V + L Sbjct: 33 ASVVWLAAGLTGLYAVRVLFALSPRPKSDPAYRPRVSVLVAAKNEQAVAAQLVA-MLRRL 91 Query: 96 DYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPT-SKADCLNNVLDAITQF 154 DY ++ +++ + T + + E + +H V K+ LN Sbjct: 92 DYPDFEVWIADDGSTDRTYQRLLEAGRGWQALHLVRRIPERSRPGKSAVLN--------- 142 Query: 155 ERSANFAFAGFILHDAEDVISPMELRLF--NYLVERKDLIQIPVYPFEREWTHFTSMTYI 212 E ++ DA+ + P L + V +Q+ ++ +T Sbjct: 143 ELRERATGDILVVFDADARVEPDFLSRTVPLFAVSSVGALQVRKRVHNADFNFWTRGQSA 202 Query: 213 DEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGF 272 + + + R A+ G G G A+ A+ ++ ++T+D D+ Sbjct: 203 EMLLDAFYQQ--QRAAIGGTAELRGNGQLVRAAALEAV------GGWNEATVTDDLDLTL 254 Query: 273 RLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWII 332 RL G F P CV E T+S RQ+SRW Sbjct: 255 RLHLGGWQIAFASDP-----------------------CVDEEGVTTWSALWRQRSRWAE 291 Query: 333 GIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWH 392 G + + D+ + IQ L+ +A A+ Sbjct: 292 GGFQRYLDYAPRLFGGAMGTTKTVDQ-----------LIFCTIQYLMPVAAVLDLLFAFQ 340 Query: 393 FLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMAN 452 + + ++ +R + V L ++ L W +I Sbjct: 341 RGAAPLLTPLVLVATVFTVCGFYFGQRERGVQVGRAM-LETLAGTIYFLHWFPVILVKLA 399 Query: 453 WRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRS 486 AL +P+++ W KT H T D + Sbjct: 400 RTAL-------EPKKLVWVKTAHQGEYGTADFKP 426 >UniRef50_Q749Y3 Type IV pilus assembly protein, putative n=6 Tax=cellular organisms RepID=Q749Y3_GEOSL Length = 645 Score = 110 bits (274), Expect = 2e-22, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 87/239 (36%), Gaps = 21/239 (8%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALA 538 + T + LG IL+ +++ EE+L+ A + + +G L ++ L+ E LA+ +A Sbjct: 54 ILEQTGKRQKLGDILIRERLVDEERLNQARVAAKRDGSTLERALRKLRLVEEEPLAKTIA 113 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 Q +++ I+ +I L + + A ++P+ + + + I L L Sbjct: 114 TQYDLSFVHINTLEIEPDLARCINPNYAQRQRIVPISRIGNTITLAMAYPIKLHELKELE 173 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYA-----------------RRRGHDPRAMLYNAVQH 641 + + ++ VI + +I+ + Y G + A + Sbjct: 174 QSIKSRIIPVIAMESEIIQAQQRLYKTAASAAHALTLDEADLEIAPGSIVDILSSGAGED 233 Query: 642 QWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEG 700 + + + I + L +I+ S I++ + + +G Sbjct: 234 EPDIDDEVRTITERDSVIVKLVNKIIFDAHQNRASDIHI---EPYPGKNDVIVRMRVDG 289 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AFA6 Bacteriophage N4 adsorption protein B n=53 Tax=P... 629 e-178 UniRef50_B7LP59 Bacteriophage N4 receptor, inner membrane subuni... 611 e-173 UniRef50_B3GN83 Bacteriophage N4 adsorption NfrB-like protein n=... 473 e-131 UniRef50_B2UJM9 General secretory system II protein E domain pro... 430 e-118 UniRef50_B4SJA4 General secretory system II protein E domain pro... 410 e-112 UniRef50_A5P922 Bacteriophage N4 receptor, inner membrane subuni... 399 e-109 UniRef50_C6N7W0 Putative uncharacterized protein n=1 Tax=Legione... 333 2e-89 UniRef50_Q01UW7 Bacteriophage N4 receptor, outer membrane protei... 317 8e-85 UniRef50_C8WHH7 General secretory system II protein E domain pro... 313 1e-83 UniRef50_Q136N6 Bacteriophage N4 adsorption protein B n=1 Tax=Rh... 309 3e-82 UniRef50_Q1N778 Bacteriophage N4 adsorption protein B n=1 Tax=Sp... 297 8e-79 UniRef50_B8JA61 Bacteriophage N4 adsorption protein B n=2 Tax=An... 295 7e-78 UniRef50_Q1IXI4 Glycosyltransferase, NfrB-like protein n=2 Tax=D... 281 7e-74 UniRef50_Q2G4Z3 Bacteriophage N4 adsorption protein B n=2 Tax=Sp... 281 8e-74 UniRef50_A5FY21 Glycosyl transferase, family 2 n=2 Tax=Alphaprot... 265 5e-69 UniRef50_Q1IRV6 Type II secretion system protein E n=24 Tax=Bact... 264 7e-69 UniRef50_Q0G6A6 Glycosyl transferase, family 2 n=1 Tax=Fulvimari... 263 2e-68 UniRef50_B8ICV2 General secretion pathway protein E n=2 Tax=Meth... 261 6e-68 UniRef50_D2LI28 General secretion pathway protein E n=1 Tax=Rhod... 260 1e-67 UniRef50_Q2JNJ5 Glycosyl transferase, group 2 family protein n=6... 260 2e-67 UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 259 2e-67 UniRef50_A0LUI4 Type II secretion system protein E n=4 Tax=Bacte... 258 8e-67 UniRef50_B8D2C7 Tfp pilus assembly protein PilB n=5 Tax=Firmicut... 257 9e-67 UniRef50_Q0EZ04 Putative uncharacterized protein n=1 Tax=Maripro... 256 2e-66 UniRef50_A8IJY6 Putative glycosyltransferase n=1 Tax=Azorhizobiu... 256 3e-66 UniRef50_C6CY56 Type II secretion system protein E n=3 Tax=Bacil... 255 4e-66 UniRef50_B6JD78 Glycosyl transferase, family 2 n=1 Tax=Oligotrop... 255 4e-66 UniRef50_B0RVF2 Glycosyltransferase n=8 Tax=Proteobacteria RepID... 255 5e-66 UniRef50_C6PR94 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 254 7e-66 UniRef50_Q1YKP1 Putative glycosyl transferase n=1 Tax=Aurantimon... 254 9e-66 UniRef50_B5JBJ3 GSPII_E N-terminal domain family n=2 Tax=Octadec... 254 1e-65 UniRef50_Q1GVK0 Bacteriophage N4 adsorption protein B n=1 Tax=Sp... 253 1e-65 UniRef50_D0B518 Glycosyl transferase, family 2 n=36 Tax=Brucella... 253 2e-65 UniRef50_C6XK76 Glycosyl transferase family 2 n=1 Tax=Hirschia b... 253 2e-65 UniRef50_D2LBM7 Glycosyl transferase family 2 n=1 Tax=Rhodomicro... 253 2e-65 UniRef50_A7GCY8 Glycosyl transferase, group 2 family n=66 Tax=Fi... 252 3e-65 UniRef50_C6PVI5 Glycosyl transferase family 2 n=2 Tax=Clostridiu... 251 7e-65 UniRef50_Q13CF3 Glycosyl transferase, family 2 n=10 Tax=Bradyrhi... 251 7e-65 UniRef50_C4DGM8 Glycosyl transferase n=1 Tax=Stackebrandtia nass... 250 1e-64 UniRef50_Q1IUP8 Polysaccharide deacetylase n=1 Tax=Candidatus Ko... 250 1e-64 UniRef50_A9DFB5 Putative uncharacterized protein n=1 Tax=Hoeflea... 249 2e-64 UniRef50_B1IIP9 Glycosyl transferase, group 2 family protein n=2... 249 2e-64 UniRef50_D1B5J6 Type II secretion system protein E n=3 Tax=Syner... 249 3e-64 UniRef50_A1SEL1 General secretory system II, protein E domain pr... 249 4e-64 UniRef50_B7APV7 Putative uncharacterized protein n=2 Tax=Bacteri... 248 4e-64 UniRef50_A6DXN0 Glycosyl transferase, group 2 family protein n=2... 248 6e-64 UniRef50_C0YGU0 Polysaccharide deacetylase/glycosyl transferase,... 248 6e-64 UniRef50_Q168M3 Glycosyl transferase, putative n=12 Tax=Rhodobac... 248 6e-64 UniRef50_B5YDI2 Type IV pilus assembly protein PilB n=20 Tax=Bac... 248 7e-64 UniRef50_A3VPZ4 Putative uncharacterized protein n=2 Tax=Bacteri... 248 8e-64 UniRef50_A4WQA1 General secretory system II, protein E domain pr... 247 1e-63 UniRef50_Q0ASE8 Glycosyl transferase, family 2 n=1 Tax=Maricauli... 247 1e-63 UniRef50_A3TTM2 Glycosyl transferase, family 2 n=1 Tax=Oceanicol... 247 2e-63 UniRef50_A3VHJ3 Glycosyl transferase, family 2 n=2 Tax=Rhodobact... 245 4e-63 UniRef50_Q28T00 Glycosyl transferase family 2 n=1 Tax=Jannaschia... 245 4e-63 UniRef50_B9L0Q0 Glycosyl transferase, group 2 family protein n=2... 245 4e-63 UniRef50_C8WBS4 Polysaccharide deacetylase n=4 Tax=Sphingomonada... 245 4e-63 UniRef50_Q7D1E2 Glycosyltransferase n=1 Tax=Agrobacterium tumefa... 245 4e-63 UniRef50_C6J4S3 Glycosyl transferase n=1 Tax=Paenibacillus sp. o... 244 6e-63 UniRef50_A7HQ64 Glycosyl transferase family 2 n=1 Tax=Parvibacul... 244 7e-63 UniRef50_Q0FJ05 Glycosyl transferase, group 2 family protein n=2... 244 1e-62 UniRef50_C6QAR9 Glycosyl transferase family 2 n=1 Tax=Hyphomicro... 243 1e-62 UniRef50_B6B2W2 Glycosyl transferase, group 2 family protein n=1... 243 2e-62 UniRef50_A0RGJ4 Glycosyl transferase and polysaccharide deacetyl... 243 3e-62 UniRef50_A1B414 General secretory system II, protein E domain pr... 243 3e-62 UniRef50_C4L4L3 Type II secretion system protein E n=1 Tax=Exigu... 242 4e-62 UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio... 242 5e-62 UniRef50_B9LWG0 Glycosyl transferase family 2 n=1 Tax=Halorubrum... 241 6e-62 UniRef50_A3DEG0 Type II secretion system protein E n=5 Tax=Clost... 241 6e-62 UniRef50_B5ZHV8 Glycosyl transferase family 2 n=2 Tax=Gluconacet... 241 7e-62 UniRef50_B8FQB2 Type II secretion system protein E n=4 Tax=Clost... 241 1e-61 UniRef50_Q1IMJ5 Glycosyl transferase, family 2 n=10 Tax=cellular... 240 1e-61 UniRef50_B0TEE9 Type ii secretion system protein e, putative n=1... 240 1e-61 UniRef50_C0R5T9 Glycosyl transferase, group 2 family protein n=8... 240 2e-61 UniRef50_B8HEE8 Glycosyl transferase family 2 n=1 Tax=Arthrobact... 240 2e-61 UniRef50_A8MGF8 Type II secretion system protein E n=1 Tax=Alkal... 240 2e-61 UniRef50_B0MQV8 Putative uncharacterized protein n=2 Tax=Clostri... 240 2e-61 UniRef50_C1F7J6 Glycosyl transferase, group 2 family n=1 Tax=Aci... 240 2e-61 UniRef50_A7GMR7 Glycosyl transferase family 2 n=3 Tax=Bacillales... 240 2e-61 UniRef50_C8WIR5 Type II secretion system protein E n=2 Tax=Bacte... 239 2e-61 UniRef50_B7QPG7 Glycosyl transferase, group 2 family n=4 Tax=Rho... 239 2e-61 UniRef50_D2RJA7 Glycosyl transferase family 2 n=10 Tax=Veillonel... 239 2e-61 UniRef50_B9Y801 Putative uncharacterized protein n=1 Tax=Holdema... 239 3e-61 UniRef50_A8LS31 Glycosyl transferase n=1 Tax=Dinoroseobacter shi... 239 4e-61 UniRef50_B2V1W7 Glycosyl transferase, group 2 family protein n=2... 239 4e-61 UniRef50_Q82DY8 Putative polysaccharide deacetylase/glycosyltran... 238 5e-61 UniRef50_D0L467 Glycosyl transferase family 2 n=1 Tax=Gordonia b... 238 5e-61 UniRef50_C4Z588 Type IV pilus assembly protein PilB n=9 Tax=Clos... 238 5e-61 UniRef50_C9R8Y9 Glycosyl transferase family 2 n=1 Tax=Ammonifex ... 238 5e-61 UniRef50_B5HQE3 Bifunctional transferase/deacetylase n=3 Tax=Str... 238 8e-61 UniRef50_D1BGY1 Type II secretion system protein E (GspE) n=3 Ta... 237 1e-60 UniRef50_Q181B0 Type IV pilus assembly protein n=5 Tax=Clostridi... 237 1e-60 UniRef50_B5HP10 Bifunctional transferase/deacetylase n=9 Tax=Str... 237 1e-60 UniRef50_Q2G7Y8 Polysaccharide deacetylase n=1 Tax=Novosphingobi... 237 1e-60 UniRef50_A3I0Z1 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1... 237 1e-60 UniRef50_A4T169 Putative uncharacterized protein n=4 Tax=Mycobac... 236 2e-60 UniRef50_A5D548 Glycosyltransferases, probably involved in cell ... 236 3e-60 UniRef50_B9R454 Glycosyl transferase, group 2 family protein n=1... 236 3e-60 UniRef50_D1NA10 General secretion pathway protein E n=1 Tax=Vict... 236 3e-60 UniRef50_C6CWA2 Glycosyl transferase family 2 n=4 Tax=Bacillales... 236 4e-60 UniRef50_A3V835 Glycosyltransferase, family 2 n=2 Tax=Rhodobacte... 235 4e-60 UniRef50_B7QVL5 Glycosyl transferase, family 2 n=1 Tax=Ruegeria ... 235 5e-60 UniRef50_C1F4G9 Polysaccharide deacetylase domain protein/glycos... 234 9e-60 UniRef50_A4SIQ6 Glycosyl transferase, family 2 n=6 Tax=Gammaprot... 234 9e-60 UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococc... 234 9e-60 UniRef50_B0MCL3 Putative uncharacterized protein n=1 Tax=Anaeros... 234 1e-59 UniRef50_B6R4T3 Glycosyl transferase, family 2 n=1 Tax=Pseudovib... 234 1e-59 UniRef50_Q2FNF4 Glycosyl transferase, family 2 n=1 Tax=Methanosp... 234 1e-59 UniRef50_B2A7G1 Type II secretion system protein E n=5 Tax=Firmi... 234 1e-59 UniRef50_A9EEN3 Glycosyl transferase, family 2 n=1 Tax=Oceanibul... 234 1e-59 UniRef50_B1YJT8 Type II secretion system protein E n=5 Tax=Bacil... 234 1e-59 UniRef50_B9AE19 Putative uncharacterized protein n=1 Tax=Methano... 233 2e-59 UniRef50_D2R471 Type II secretion system protein E n=6 Tax=Planc... 233 2e-59 UniRef50_A9EU26 Group-specific protein n=3 Tax=Phaeobacter galla... 233 2e-59 UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=F... 233 2e-59 UniRef50_UPI0001B4D705 bi-functional transferase/deacetylase n=3... 233 2e-59 UniRef50_Q1GE89 Glycosyl transferase family 2 n=3 Tax=Rhodobacte... 233 2e-59 UniRef50_A3JQ28 Glycosyl transferase, family 2 n=1 Tax=Rhodobact... 233 2e-59 UniRef50_C8W5J2 Type II secretion system protein E n=2 Tax=Clost... 233 3e-59 UniRef50_C1ACV5 Putative glycosyl transferase/polysaccharide dea... 233 3e-59 UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosami... 233 3e-59 UniRef50_A5CDZ1 Putative glycosyl transferase, group 2 n=2 Tax=O... 232 4e-59 UniRef50_P96587 Uncharacterized glycosyltransferase ydaM n=6 Tax... 231 6e-59 UniRef50_Q1RIH7 Glycosyltransferase n=11 Tax=Rickettsia RepID=Q1... 231 8e-59 UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bact... 231 8e-59 UniRef50_C1AA14 Type IV pilus assembly protein PilB n=1 Tax=Gemm... 231 9e-59 UniRef50_A7IHY2 Glycosyl transferase family 2 n=1 Tax=Xanthobact... 231 9e-59 UniRef50_A1VIY0 Glycosyl transferase, family 2 n=1 Tax=Polaromon... 230 1e-58 UniRef50_A3UGE7 Putative uncharacterized protein n=1 Tax=Oceanic... 230 1e-58 UniRef50_B1I3E7 Type II secretion system protein E n=4 Tax=Clost... 230 2e-58 UniRef50_B8FZG2 Glycosyl transferase family 2 n=4 Tax=Clostridia... 230 2e-58 UniRef50_B8E0Z1 Glycosyl transferase family 2 n=1 Tax=Dictyoglom... 229 2e-58 UniRef50_B8E2U7 Type II secretion system protein E n=2 Tax=Dicty... 229 2e-58 UniRef50_B1HSU1 Biofilm PIA synthesis N-glycosyltransferase icaA... 229 3e-58 UniRef50_Q01NU4 Type II secretion system protein E (GspE) n=1 Ta... 229 3e-58 UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium... 229 3e-58 UniRef50_D2LDP7 Polysaccharide deacetylase n=1 Tax=Rhodomicrobiu... 229 4e-58 UniRef50_B0T3D0 Polysaccharide deacetylase n=2 Tax=Caulobacter R... 229 4e-58 UniRef50_B5GIB2 Bi-functional transferase/deacetylase n=7 Tax=St... 229 4e-58 UniRef50_C3RLB0 Type II secretion system protein E n=2 Tax=Bacte... 228 5e-58 UniRef50_C5S5F1 Glycosyl transferase family 2 n=1 Tax=Allochroma... 228 7e-58 UniRef50_Q8NU22 Glycosyltransferases, probably involved in cell ... 228 8e-58 UniRef50_B8IGT7 Glycosyl transferase family protein n=9 Tax=Alph... 227 1e-57 UniRef50_B5YIG2 Type IV-A pilus assembly ATPase PilB n=4 Tax=Bac... 227 1e-57 UniRef50_D2M877 Glycosyl transferase family 2 n=1 Tax=Rhodopseud... 227 1e-57 UniRef50_Q03HA4 Glycosyltransferase n=2 Tax=Bacilli RepID=Q03HA4... 227 2e-57 UniRef50_Q30SK0 Glycosyl transferase, family 2 n=6 Tax=Proteobac... 226 2e-57 UniRef50_B4W7G8 Putative uncharacterized protein n=1 Tax=Brevund... 226 2e-57 UniRef50_UPI0001B55850 glycosyl transferase family 2 n=1 Tax=Str... 226 2e-57 UniRef50_P75905 Biofilm PGA synthesis N-glycosyltransferase pgaC... 226 2e-57 UniRef50_Q11MF6 Glycosyl transferase, group 2 family protein n=1... 226 3e-57 UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophi... 226 3e-57 UniRef50_C8WT25 Glycosyl transferase family 2 n=3 Tax=Bacteria R... 226 3e-57 UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococ... 226 4e-57 UniRef50_A7HQK0 Glycosyl transferase family 2 n=1 Tax=Parvibacul... 225 5e-57 UniRef50_B3DW74 Glycosyltransferase n=1 Tax=Methylacidiphilum in... 225 6e-57 UniRef50_B5W069 Glycosyl transferase family 2 n=3 Tax=Arthrospir... 224 7e-57 UniRef50_D2AU81 Polysaccharide deacetylase n=1 Tax=Streptosporan... 224 7e-57 UniRef50_Q110Z2 Glycosyl transferase, family 2 n=15 Tax=Cyanobac... 224 8e-57 UniRef50_B9JQZ7 Glycosyltransferase n=1 Tax=Agrobacterium vitis ... 224 8e-57 UniRef50_B8H475 N-acetylglucosaminyltransferase n=3 Tax=Caulobac... 224 1e-56 UniRef50_A6M148 Type II secretion system protein E n=18 Tax=Clos... 224 1e-56 UniRef50_A6C2Q8 Type II secretion system protein E n=3 Tax=Planc... 223 2e-56 UniRef50_A3CR73 PilB-like pili biogenesis ATPase, putative n=2 T... 223 2e-56 UniRef50_A1TUR3 General secretory pathway protein E n=5 Tax=Prot... 222 3e-56 UniRef50_C8WRT0 Glycosyl transferase family 2 n=2 Tax=Alicycloba... 222 3e-56 UniRef50_B7I546 Glycosyltransferase n=12 Tax=Acinetobacter RepID... 222 3e-56 UniRef50_Q1D9E1 General secretory pathway protein E n=17 Tax=Pro... 222 4e-56 UniRef50_A6Q1D6 Glucosaminyltransferase n=2 Tax=Epsilonproteobac... 222 4e-56 UniRef50_B0T1N0 Putative uncharacterized protein n=1 Tax=Cauloba... 222 4e-56 UniRef50_UPI0001C1680B Glycosyl transferase, family 2 n=2 Tax=No... 222 5e-56 UniRef50_A4CJ64 Glycosyltransferase n=15 Tax=Bacteroidetes RepID... 222 5e-56 UniRef50_B0VJ41 Type IV pilus biogenesis protein PilB n=1 Tax=Ca... 221 8e-56 UniRef50_Q1IL87 Glycosyl transferase, family 2 n=1 Tax=Candidatu... 221 8e-56 UniRef50_A1TEN6 Glycosyl transferase, family 2 n=7 Tax=Actinomyc... 221 9e-56 UniRef50_D2R473 Type II secretion system protein E n=6 Tax=Planc... 221 9e-56 UniRef50_D1SD50 Polysaccharide deacetylase n=1 Tax=Micromonospor... 221 9e-56 UniRef50_C8WGD9 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 221 1e-55 UniRef50_C6XAP3 Type II secretion system protein E n=1 Tax=Methy... 221 1e-55 UniRef50_Q02IY3 Probable glucosyl transferase n=6 Tax=Proteobact... 220 1e-55 UniRef50_B8II97 Polysaccharide deacetylase n=1 Tax=Methylobacter... 220 1e-55 UniRef50_A3XKW0 Glycosyltransferase related protein n=1 Tax=Leeu... 220 1e-55 UniRef50_C1XWL4 Type II secretory pathway, ATPase PulE/Tfp pilus... 220 2e-55 UniRef50_Q4KHI6 Glycosyl transferase, group 2 family protein n=1... 220 2e-55 UniRef50_C9RLY4 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 220 2e-55 UniRef50_A5GEA8 Glycosyl transferase, family 2 n=3 Tax=Deltaprot... 219 2e-55 UniRef50_Q1J1R8 Tfp pilus assembly pathway, ATPase PilB n=7 Tax=... 219 2e-55 UniRef50_D1B8K4 Glycosyl transferase family 2 n=1 Tax=Thermanaer... 219 3e-55 UniRef50_D0LLW4 General secretory pathway protein E n=2 Tax=Nann... 219 3e-55 UniRef50_A1R2V1 Glycosyl transferase, group 2 family domain prot... 219 3e-55 UniRef50_C1V8R5 Glycosyl transferase n=1 Tax=Halogeometricum bor... 219 4e-55 UniRef50_C7MUK1 Glycosyl transferase n=11 Tax=Actinomycetales Re... 218 5e-55 UniRef50_C7TL81 Glycosyl transferase, group 2 n=3 Tax=Bacilli Re... 218 5e-55 UniRef50_Q1GJ85 Glycosyl transferase family 2 n=4 Tax=Rhodobacte... 218 5e-55 UniRef50_D2MLP0 Glycosyltransferase, group 2 family protein n=1 ... 218 6e-55 UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus Re... 217 9e-55 UniRef50_A3DDQ8 Type II secretion system protein E n=3 Tax=Clost... 217 1e-54 UniRef50_Q15ZI3 Type II secretion system protein E n=119 Tax=Pro... 217 1e-54 UniRef50_B2IIL4 Glycosyl transferase family 2 n=2 Tax=Beijerinck... 217 1e-54 UniRef50_A4J4W8 Glycosyl transferase, family 2 n=3 Tax=Clostridi... 217 1e-54 UniRef50_D2QUU5 Glycosyl transferase family 2 n=1 Tax=Spirosoma ... 217 1e-54 UniRef50_B9ZC52 Glycosyl transferase family 2 n=1 Tax=Natrialba ... 217 2e-54 UniRef50_B2UMM8 Glycosyl transferase family 2 n=3 Tax=Verrucomic... 216 3e-54 UniRef50_D2QZA7 Type II secretion system protein E n=1 Tax=Pirel... 216 3e-54 UniRef50_Q39PZ3 Response regulator receiver domain protein (CheY... 216 3e-54 UniRef50_Q1Q644 Strongly similar to general secretion pathway pr... 215 6e-54 UniRef50_Q886Q3 Glycosyl transferase, group 2 family protein n=2... 214 8e-54 UniRef50_Q7UE44 General secretion pathway protein E n=1 Tax=Rhod... 214 9e-54 UniRef50_D2QZD9 Type II secretion system protein E n=4 Tax=Planc... 214 1e-53 UniRef50_C4KBM8 Glycosyl transferase family 2 n=6 Tax=Betaproteo... 214 1e-53 UniRef50_B1L4E1 Glycosyl transferase family 2 n=1 Tax=Candidatus... 214 1e-53 UniRef50_B9XRT3 Type II secretion system protein E n=2 Tax=Verru... 214 1e-53 UniRef50_C1ZJA9 Glycosyl transferase n=1 Tax=Planctomyces limnop... 214 1e-53 UniRef50_B2UK24 Glycosyl transferase family 2 n=10 Tax=Burkholde... 213 3e-53 UniRef50_C0QER1 PilB n=10 Tax=Proteobacteria RepID=C0QER1_DESAH 212 3e-53 UniRef50_B6BG61 Glycosyl transferase, group 2 family protein n=2... 212 3e-53 UniRef50_D1RA94 Putative uncharacterized protein n=1 Tax=Parachl... 212 4e-53 UniRef50_A3N3L7 Biofilm PGA synthesis N-glycosyltransferase PgaC... 212 5e-53 UniRef50_Q1AWU6 Type II secretion system protein E n=1 Tax=Rubro... 212 5e-53 UniRef50_C8NRH6 Group 2 glycosyl transferase n=5 Tax=Corynebacte... 212 6e-53 UniRef50_C8W724 Glycosyl transferase family 2 n=4 Tax=Coriobacte... 211 6e-53 UniRef50_A4J3A4 Type II secretion system protein E n=1 Tax=Desul... 211 6e-53 UniRef50_Q47AJ5 Type II secretion system protein E:General secre... 211 7e-53 UniRef50_B0C9M4 Glycosyl transferase, family 2 n=1 Tax=Acaryochl... 211 7e-53 UniRef50_A6DGB3 Glycosyl transferase, family 2 n=3 Tax=Lentispha... 211 1e-52 UniRef50_Q2RZV9 Putative glucosyltransferase n=1 Tax=Salinibacte... 210 1e-52 UniRef50_Q1Q109 Strongly similar to general secretory system typ... 210 1e-52 UniRef50_C9RLV9 Type II secretion system protein E n=1 Tax=Fibro... 210 2e-52 UniRef50_A1WFR3 Type II secretion system protein E (GspE) n=8 Ta... 210 2e-52 UniRef50_Q0A8B9 Type II secretion system protein E n=1 Tax=Alkal... 209 2e-52 UniRef50_B0S8Q3 Type II secretory pathway ATPase, protein E n=7 ... 209 3e-52 UniRef50_A3ZX01 General secretion pathway protein E n=3 Tax=Plan... 209 3e-52 UniRef50_A6NRQ7 Putative uncharacterized protein n=4 Tax=Bacteri... 209 3e-52 UniRef50_A7INQ0 Glycosyl transferase family 2 n=11 Tax=Rhizobial... 209 4e-52 UniRef50_Q2JKN6 Glycosyl transferase, group 2 family protein n=2... 209 4e-52 UniRef50_A5G0G3 Glycosyl transferase, family 2 n=1 Tax=Acidiphil... 209 4e-52 UniRef50_A8ZYV7 Type II secretion system protein E n=11 Tax=Delt... 209 4e-52 UniRef50_B4WM17 Glycosyl transferase, group 2 family protein n=1... 209 4e-52 UniRef50_A6W755 Type II secretion system protein E n=3 Tax=Actin... 209 4e-52 UniRef50_UPI0001B50A66 glycosyl transferase family protein n=1 T... 208 6e-52 UniRef50_A9BY08 General secretory pathway protein E n=2 Tax=Prot... 208 8e-52 UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli s... 207 8e-52 UniRef50_B8F8X7 Response regulator receiver protein n=3 Tax=Prot... 207 9e-52 UniRef50_D0KDT2 Glycosyl transferase family 2 n=3 Tax=cellular o... 207 1e-51 UniRef50_Q3SKS0 Pilus assembly pathway ATPase PilB n=3 Tax=Prote... 207 1e-51 UniRef50_A9FZQ2 Glycosyltransferase n=1 Tax=Sorangium cellulosum... 207 1e-51 UniRef50_B8DWB6 Predicted glycosyltransferase n=10 Tax=Bifidobac... 207 1e-51 UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolob... 207 2e-51 UniRef50_C7RLS9 Type II secretion system protein E n=3 Tax=Betap... 207 2e-51 Sequences not found previously or not previously below threshold: UniRef50_Q9RQP9 Biofilm PIA synthesis N-glycosyltransferase icaA... 219 4e-55 >UniRef50_P0AFA6 Bacteriophage N4 adsorption protein B n=53 Tax=Proteobacteria RepID=NFRB_ECO57 Length = 745 Score = 629 bits (1622), Expect = e-178, Method: Composition-based stats. Identities = 745/745 (100%), Positives = 745/745 (100%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY Sbjct: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV Sbjct: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR Sbjct: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT Sbjct: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF Sbjct: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG Sbjct: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ Sbjct: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV Sbjct: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ Sbjct: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK Sbjct: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ Sbjct: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS Sbjct: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 Query: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 MQSLLLKAGLNTEQVAQLESENEGE Sbjct: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 >UniRef50_B7LP59 Bacteriophage N4 receptor, inner membrane subunit n=7 Tax=Bacteria RepID=B7LP59_ESCF3 Length = 750 Score = 611 bits (1576), Expect = e-173, Method: Composition-based stats. Identities = 631/744 (84%), Positives = 687/744 (92%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 ++WLLD+F+TWLYGLK IAI LA++M ISGLDD FIDVVYW+RR+KR LSVYRRYPRM+Y Sbjct: 7 VEWLLDLFSTWLYGLKFIAIALAIMMLISGLDDLFIDVVYWLRRVKRSLSVYRRYPRMNY 66 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV Sbjct: 67 RELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 126 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSA FAFAGFILHDAEDVISPMELR Sbjct: 127 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSAKFAFAGFILHDAEDVISPMELR 186 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 LFNYLV+RKDLIQIPVYPFER+WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT Sbjct: 187 LFNYLVDRKDLIQIPVYPFERKWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 246 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 CFSRRA++ALLADGDGIAFDVQSLTEDYDIGFRLKEKGM+EIFVRFPVVD+ K E RK Sbjct: 247 CFSRRAISALLADGDGIAFDVQSLTEDYDIGFRLKEKGMSEIFVRFPVVDDGKTGEPRKL 306 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 Q RT NMICVREYFPDTF+TAVRQKSRWIIGIVFQGFKTHKWTS+L LNYFLWRDRKG Sbjct: 307 FQSKRTHNMICVREYFPDTFTTAVRQKSRWIIGIVFQGFKTHKWTSNLILNYFLWRDRKG 366 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 AISNF+SF+AMLV IQL+LL+ Y++ WP+AWHFLSIF+ SA TLLW+NF LMVNRIVQ Sbjct: 367 AISNFISFIAMLVFIQLMLLMLYQTFWPNAWHFLSIFTDSAAFTTLLWMNFALMVNRIVQ 426 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 RVIFVTGYYGLTQG+LSVLRL WGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV Sbjct: 427 RVIFVTGYYGLTQGILSVLRLCWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 486 Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 +G+ R+LRPLGQILLEN VITE QL+ AL NR++GLRLGGSMLMQGLI+A+QLAQALAEQ Sbjct: 487 SGENRALRPLGQILLENHVITETQLEQALTNRIQGLRLGGSMLMQGLITAQQLAQALAEQ 546 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 NGV WES+DAWQIP LI ++PASVALHYAVLPLR+E+D L+VGSEDGIDPVSLAAL+RK Sbjct: 547 NGVGWESVDAWQIPRYLIEQIPASVALHYAVLPLRIEDDVLVVGSEDGIDPVSLAALSRK 606 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 GR+VRYVIVLRGQ+VTGLRHWYARRRG D R +L AV +WLT QQ EIW+Q+V HQ Sbjct: 607 TGRQVRYVIVLRGQVVTGLRHWYARRRGRDARELLEQAVLRRWLTPQQQTEIWQQFVQHQ 666 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 FLFAE+LTTLGHINRSAIN LLLRHERS PLG FLV EGVISQETLDRVL+IQ+ LQVS Sbjct: 667 FLFAEVLTTLGHINRSAINALLLRHERSDRPLGAFLVAEGVISQETLDRVLSIQQNLQVS 726 Query: 721 MQSLLLKAGLNTEQVAQLESENEG 744 MQSLL AGL T Q+A+LE+++EG Sbjct: 727 MQSLLQAAGLTTMQIAELETDHEG 750 >UniRef50_B3GN83 Bacteriophage N4 adsorption NfrB-like protein n=1 Tax=Zymomonas mobilis subsp. mobilis RepID=B3GN83_ZYMMO Length = 729 Score = 473 bits (1217), Expect = e-131, Method: Composition-based stats. Identities = 344/714 (48%), Positives = 455/714 (63%), Gaps = 9/714 (1%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 + IAI +A+++ + G+DD FID +W+R I R+ +Y YP ++L+ +EKPLAIM Sbjct: 20 FRYIAIFVAILVTLFGIDDIFIDSCFWIRSIYRRFFIYSHYPHADEKQLFSKNEKPLAIM 79 Query: 75 VPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 VPAW E GV+ NMA LAA TLDYENYHIFVGTYPNDP+TQ DVD V +++PNVHK+VCAR Sbjct: 80 VPAWREVGVVANMARLAAETLDYENYHIFVGTYPNDPETQNDVDAVVSQYPNVHKIVCAR 139 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQI 194 PGPTSKADCLNNV+DAI FE +A FAGFILHDAEDVISP+ELRLFNYLV RKD+IQI Sbjct: 140 PGPTSKADCLNNVIDAIFHFEEAAAIEFAGFILHDAEDVISPLELRLFNYLVARKDMIQI 199 Query: 195 PVYPFEREW-THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 PVYPF + FT Y+DEFSE HGKDV VREAL GQVPSAGVGTCFSRRA+T LL + Sbjct: 200 PVYPFISDRFGDFTRNHYVDEFSEHHGKDVVVREALTGQVPSAGVGTCFSRRAITLLLKE 259 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE-QRKFLQHARTSNMICV 312 DG FD SLTEDYDI FRL +GM+ IF R+PV D ++K R + +ICV Sbjct: 260 SDGFPFDTTSLTEDYDISFRLYREGMSCIFARYPVTDPQYAFPIKQKIGMDRRYTQVICV 319 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 RE+FPD F AVRQKSRWI GIVFQG + W +NYFLWRDR+G I+N V FLA + Sbjct: 320 REHFPDHFKYAVRQKSRWITGIVFQGTRNLGWEHRAIMNYFLWRDRRGIITNIVGFLANI 379 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 ++ + LL +L W F+S+ S +A L LLW+N +++NR QR FVT YYG+ Sbjct: 380 LLFFVALLWIISALNLKGWSFMSVLSDNALLSVLLWVNGFILLNRAAQRCFFVTKYYGIK 439 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQ 492 QGL S R+ WGN++N A RA QV+ G+ +R+AWDKTTHDF + P+G Sbjct: 440 QGLTSPFRMVWGNIVNSFACIRAFWQVITIGNIKRMAWDKTTHDF--PSIPVSRREPIGL 497 Query: 493 ILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQ 552 ++ + L+ L+ + RLG +L++GLI++EQLA+ALA Q + S + + Sbjct: 498 WMVAQNFLKNSDLEQVLQAPRQH-RLGQELLLRGLINSEQLAKALAHQASLKAVSFNIFY 556 Query: 553 IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLR 612 + S LIA P +A YAVLP + L + E + P+SL ++R +G V +I + Sbjct: 557 LDSKLIAAFPRYLACRYAVLPFSQKGKALQLICEHALSPISLGVISRHIGLNVECLIAPQ 616 Query: 613 GQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGH 672 G++ GLR+WY + + + + L + E + F +IL T GH Sbjct: 617 GRVTLGLRYWYPGQGNQPST----DRIIKELLKDPNNIEKQDTVCIYLAQFGDILQTTGH 672 Query: 673 INRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLL 726 I +L+ + + LG++LV +ISQE L+ L Q + + + +LL Sbjct: 673 IPEPIFAQVLIDFDPDKMKLGEYLVKRKLISQEILEECLKEQNKQEEMAEKVLL 726 >UniRef50_B2UJM9 General secretory system II protein E domain protein n=12 Tax=cellular organisms RepID=B2UJM9_RALPJ Length = 703 Score = 430 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 229/624 (36%), Positives = 335/624 (53%), Gaps = 14/624 (2%) Query: 11 WLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP 70 ++ L + + +++ IS DDFF+D YWVR + R +S + L +E+ Sbjct: 9 YVALLNTLTVATTLVILISTADDFFLDAFYWVRELWLWPQRGRTPVTISAQALRDREEQW 68 Query: 71 LAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP-NVHK 129 LAIMVPAW E VI M E T++Y + IF G Y ND +T +V+ + R+P V + Sbjct: 69 LAIMVPAWKEYDVIAKMVENTLATMEYTRFIIFAGAYRNDAETTTEVERMVRRYPGRVVR 128 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERK 189 GPT KADCLN ++ I ++E FAG I+HD EDVI P+EL+ FNY + + Sbjct: 129 AAVTHDGPTCKADCLNTIIQTIIRYEAGHGIRFAGVIMHDCEDVIHPLELKYFNYFISDQ 188 Query: 190 DLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTA 249 DL+Q+PV ER+W + + TY+D+FSE H KD+ R+AL G VP AGV C+SRRA+ A Sbjct: 189 DLVQLPVLSLERKWYEWVAGTYMDDFSETHQKDLVARQALTGTVPGAGVALCYSRRAIEA 248 Query: 250 LLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVD---------EAKEREQRKF 300 ++ F+ +LTEDYD FRL+E GM E FV FPV + + Sbjct: 249 VMKVRGDAPFNTSTLTEDYDFSFRLRELGMREAFVHFPVCENTAPVADGTGRQPTHWWTN 308 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 + ++ REYFP TF TA RQ++RW++GI FQG+ W +L Y +RDRKG Sbjct: 309 RRREARPQLLATREYFPSTFRTAYRQRARWVLGIAFQGWLQMGWKGNLITKYMFFRDRKG 368 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 ++ S LA + + LL+ + G+ W+ LL +N L++NR+ Q Sbjct: 369 VLTALFSILAYALSLNYLLVAVLLDKGWVTASEGAFVVGTIWMQDLLAINATLLINRLAQ 428 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHG-DPRRVAWDKTTHDFPS 479 RV FV G Q +L + RL N INF + RA K L + + +AWDKT H + S Sbjct: 429 RVYFVGRLNGPLQAVLCLPRLVVNNFINFFSVCRAWKIFLIYCFTGKPIAWDKTQHTYLS 488 Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALA 538 R+ LG+ LL+ +VIT+EQLD AL + G RLG ++ QGL++ + LA ALA Sbjct: 489 NDALGRTRCKLGETLLKWEVITQEQLDAALAIQQQTGRRLGQVLVQQGLVTPDTLADALA 548 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRL-ENDELIVGSEDGIDPVSLAAL 597 EQ + S+ + +L +P +A+ + V+P + E+ L + + D +L L Sbjct: 549 EQADLPRVSLTNV-VLGALADCLPRDLAVRHHVVPFSIGEDGSLNIAVSELPDGEALQEL 607 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRH 621 R GRKV + ++ L Sbjct: 608 ARAAGRKVACFMACDHEMSAELAQ 631 >UniRef50_B4SJA4 General secretory system II protein E domain protein n=15 Tax=Proteobacteria RepID=B4SJA4_STRM5 Length = 715 Score = 410 bits (1054), Expect = e-112, Method: Composition-based stats. Identities = 263/708 (37%), Positives = 377/708 (53%), Gaps = 38/708 (5%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRR--YPRMSYRELYKPDEKPLAIMVPAWNETGV 83 + IS LDD FIDV YWVR R L++ RR Y ++ +L + E+PLAIMVPAW E V Sbjct: 28 ILISSLDDLFIDVWYWVRESWRALTIKRRDAYKPLTQEDLLQRPEQPLAIMVPAWMEYDV 87 Query: 84 IGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADC 143 I M E LDY Y +FVGTYPND T +V+ + R+ + +V GPTSKADC Sbjct: 88 IAQMVENMINVLDYREYVVFVGTYPNDQQTIDEVERMRRRYKRLRRVEVPHDGPTSKADC 147 Query: 144 LNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREW 203 LN ++ AI ++E+ + FAG ILHD+EDV+ PMELR +NYL+ RKD+IQ+PV +REW Sbjct: 148 LNWLILAIFEYEKRHDIEFAGVILHDSEDVLHPMELRFYNYLLPRKDMIQLPVTSLDREW 207 Query: 204 THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQS 263 + Y+DEF+E H KD+ VRE+++G VPSAGVGTCFSRRA+ AL A D F+ S Sbjct: 208 YELVAGVYMDEFAEWHAKDLVVRESVSGMVPSAGVGTCFSRRALLALSAQTDNQPFNTDS 267 Query: 264 LTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNM--ICVREYFPDTFS 321 LTEDYD+G RL GM IF RFPV + + + +CVREYFPD F Sbjct: 268 LTEDYDVGARLAAMGMQSIFARFPVQFRVRRPSWFGWGPVRERTQQMALCVREYFPDNFR 327 Query: 322 TAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLL 381 + RQK+RW++GI Q +++ W SL Y L RDRKG I++FVS +A ++ +QLLL Sbjct: 328 ASYRQKARWVLGIGLQSWESLGWRGSLATKYLLARDRKGIITSFVSIIAYVIFLQLLLFW 387 Query: 382 AYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRL 441 + F S+F W M + L + R+VQR FV YG L+S+ R+ Sbjct: 388 LLKMTGVWTMQFPSVFQPGTWQMNVALLTTAALATRVVQRFYFVNRLYGWEHALMSIPRM 447 Query: 442 FWGNLINFMANWRALKQVLQHGD-PRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVI 500 GN+INFMA RA K L + +R+ WDKT HDFP ++ + LG++L Q + Sbjct: 448 VVGNMINFMATARAWKVFLAYLLFGKRMVWDKTMHDFPDAAQLVQTRKQLGELLGTWQAV 507 Query: 501 TEEQLDTALRNRVEG--LRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLI 558 E+L AL + G LG +L QG + E LA+A+A Q + ID + Sbjct: 508 EPERLQQALDQQHAGRQQPLGRILLTQGWLDDETLAEAIAFQGDLPRAVID-VDYLRACQ 566 Query: 559 AEMPASVALHYAVLPL-RLENDELIVGSEDGIDPVSLAALTRKVGRK-VRYVIVLRGQIV 616 + A + + +LPL + L + + +LA L ++ + + I +I Sbjct: 567 FPVSADACVQWRMLPLPPRQEGTLRLAVASPLPEEALALLKQETRSEHIEQSIARESEIN 626 Query: 617 TGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRS 676 GLR R+ H L ++L + I+ + Sbjct: 627 AGLRLIGGDRQWHLDN---------------------------VPLLGDLLVEMRLIDHA 659 Query: 677 AINVLLLRHERSSL-PLGKFLVTEGVISQETLDRVLTIQRELQVSMQS 723 + L ++ +G +LV +G+ ++E + + + QR ++QS Sbjct: 660 RFEIALDDYKPQRDGRIGDYLVKQGITTEEAVAQAMQEQRRRAATLQS 707 >UniRef50_A5P922 Bacteriophage N4 receptor, inner membrane subunit n=1 Tax=Erythrobacter sp. SD-21 RepID=A5P922_9SPHN Length = 698 Score = 399 bits (1024), Expect = e-109, Method: Composition-based stats. Identities = 239/718 (33%), Positives = 368/718 (51%), Gaps = 38/718 (5%) Query: 7 VFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 + +++ + A+ +A ++ IS LDD F+D V+W+ KR+ +S L + Sbjct: 1 MVESYIVAFECAALVVATLIAISSLDDLFVDSVFWIAMAKRRFLGKGEPRTVSPETLIER 60 Query: 67 DEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN 126 E P+AIM+PAW E VI +M E A TL Y NY IF+G+Y NDP+T +V+++ AR+ Sbjct: 61 PEAPIAIMLPAWQEADVIASMVENAIHTLVYRNYFIFIGSYANDPETILEVEKLAARYGR 120 Query: 127 VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV 186 V V GPT KADCLN+++ I + E+ + FAG +LHD+EDV+ P+EL LFNYL+ Sbjct: 121 VRHVRVPHYGPTCKADCLNHIVADILRLEKEVDIEFAGLVLHDSEDVLHPLELHLFNYLL 180 Query: 187 ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 +D+IQ+PV E+ +T F + TY+D+F+E H KD+ VR+ LA VPSAGVGTCFSRRA Sbjct: 181 PSRDMIQLPVVSLEQRFTDFVAGTYMDDFAESHAKDLVVRQMLAKSVPSAGVGTCFSRRA 240 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVD--EAKEREQRKFLQHA 304 + + G F+ Q+LTEDYD+G RL ++G+ +PV R + Sbjct: 241 IE--VMLEAGEPFNTQTLTEDYDVGSRLAKRGLNASIELYPVEFRSRQFGHFGRGPERVG 298 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISN 364 TS +CVRE+FP+TF + RQK+RWI+GI QG+ W S+ NYFL RDRK I+ Sbjct: 299 TTSKPLCVREHFPNTFRASYRQKARWILGIALQGWAQLGWDRSIVSNYFLCRDRKALITP 358 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIF 424 ++ LA ++ L + A + IFS L N + R+VQRV F Sbjct: 359 TLAVLAYVLTAMYLGATIWSFASGGA--AIPIFSNHPIASYLFSFNLFALAARVVQRVYF 416 Query: 425 VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGD-PRRVAWDKTTHDFPSVTGD 483 V Y T L+V R+ + INF A+ RA++ + +AWDKT H FPS Sbjct: 417 VAKIYCWTHAFLAVPRMVVLSFINFAASVRAIRIFVGSKFSGNPIAWDKTNHRFPSDEAL 476 Query: 484 TRSLRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 + R LG+IL V++ L+ A L + EG LG ++ G+I + L +A++ QN Sbjct: 477 GKEKRRLGEILRGWDVVSSPMLEKALLYQKREGGMLGDLLVRDGVIDEDVLTEAISTQNQ 536 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEN-DELIVGSEDGIDPVSLAALTRKV 601 + ++ + L + + +LP + + E ++ + + Sbjct: 537 LPRAELNLDMVCEHL-DLLDRATMTQLQILPFGISSKGEALLAVAKPL---ACEQTRLI- 591 Query: 602 GRKVRYVIVLRGQIVTGLRHW-YARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 ++ G R + R + A L N + + VP Sbjct: 592 ----------WSRMSKGYREHIVPQSRIKEILAALTNIPERNF------------PVPSV 629 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSL-PLGKFLVTEGVISQETLDRVLTIQREL 717 E+L + + + + LL + + +G+FLV +G I+Q TLD+ L ++ L Sbjct: 630 PRVHELLLSQKQLKKKELQNLLKDYNVARHGTIGQFLVAKGTITQATLDKTLELRTSL 687 >UniRef50_C6N7W0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N7W0_9GAMM Length = 501 Score = 333 bits (853), Expect = 2e-89, Method: Composition-based stats. Identities = 184/490 (37%), Positives = 271/490 (55%), Gaps = 7/490 (1%) Query: 13 YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLA 72 + + + L+ + ISG+DD F D YW+R + R L R Y ++Y +L + +E+ +A Sbjct: 11 FIMWYFLVALSCLFIISGIDDLFFDGYYWIRYVFR-LWKTRGYKPLTYEQLAEKEEQMIA 69 Query: 73 IMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVC 132 +++P W+E GVIG M + ++DY NY++FVG YPNDP+T +V EV NV V+ Sbjct: 70 VLIPCWHEAGVIGTMLKHNCYSIDYSNYYLFVGVYPNDPETVNEVQEVANLIKNVRCVIG 129 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLI 192 PGPT+KA LN + + + FE+ N +F+ F+ HD+ED+I PM +L+NYL+ RK++I Sbjct: 130 TTPGPTNKAANLNGIYNYVKAFEKELNRSFSIFVFHDSEDIIHPMSFKLYNYLMPRKEMI 189 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 QIPV+P E + +FT Y DEFSE H KD+ VRE++ G VPSAGVGT FSR A+ L Sbjct: 190 QIPVFPLEINYWNFTHWLYADEFSENHTKDIIVRESIHGHVPSAGVGTAFSRHALKLLED 249 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ--RKFLQHARTSNMI 310 F SLTEDY ++ KG+ +IFV +V RK T I Sbjct: 250 PTTRTPFSTDSLTEDYRTSLAIRIKGLKQIFVTETIVRMKWRPRGFFRKGYVQKPTREYI 309 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 R FP ++ AVRQK+RWIIGIVFQ ++ +W + + L DRK I++F++ Sbjct: 310 ATRALFPLEYTKAVRQKARWIIGIVFQEWQHTQWPKEWIIRFTLAHDRKSFITHFINGFG 369 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 V + L+ P+ F+ W+ L+ +M+ R++QR+I + YG Sbjct: 370 YFVFLFWLVYSLCTYTNPEYPSLQEQFNLHPWVWWLIVTVTLMMIERMIQRMIAIRRVYG 429 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQ----HGDPRRVAWDKTTHDFPSVTGDTRS 486 LS+ R F+GNL+N A RA ++ +WDKT H FP T Sbjct: 430 WIPSFLSIPRTFYGNLLNLHALIRAYHVYYTTPKSQATSKQPSWDKTDHHFPGSHILTPY 489 Query: 487 LRPLGQILLE 496 + +G +LLE Sbjct: 490 RKKIGDLLLE 499 >UniRef50_Q01UW7 Bacteriophage N4 receptor, outer membrane protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01UW7_SOLUE Length = 466 Score = 317 bits (813), Expect = 8e-85, Method: Composition-based stats. Identities = 165/461 (35%), Positives = 225/461 (48%), Gaps = 26/461 (5%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWN 79 + +AV + ISGLDD FI +V + + R+P S +L E+P+AI VP W+ Sbjct: 1 MPVAVWILISGLDDLFITMVGFA-------TSRVRFPWPSSGDLKSAAEQPIAIFVPLWH 53 Query: 80 ETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 E VIG M E + Y NYH+F G YPND T R V+ A P +H +C GPTS Sbjct: 54 EHRVIGRMLEHNLAAVRYGNYHVFAGVYPNDTPTLRAVELQAAVHPKIHTAICPHDGPTS 113 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPF 199 K DCLN + + +E F +LHDAED+I P LRL N+ +++Q+PV P Sbjct: 114 KGDCLNWIYQHMRAWEARHGTRFRVVVLHDAEDLIDPESLRLINWFSRDYEMVQVPVLPL 173 Query: 200 EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAF 259 +T Y DEF+E KD+PVR+ L G +PS GVGT F R A+ L +G F Sbjct: 174 ATAVKEWTHGLYCDEFAEYQRKDIPVRQQLGGFLPSNGVGTGFGRDALERLADGRNGRPF 233 Query: 260 DVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDT 319 D LTEDY+ G+ L E G +IF+ R + RE+FP Sbjct: 234 DPACLTEDYETGYLLHELGCRQIFLP----------------VRFRENGPTATREFFPRG 277 Query: 320 FSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLL 379 A+ Q++RW+ GI Q ++ H W L+ Y+ WRDRKG I N +S A L+ + Sbjct: 278 ARAAISQRTRWVTGIALQSWERHGWRVPLSQLYWFWRDRKGLIGNLLSPAANLLFLYGAG 337 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL 439 A+ + P AWH S WL L + + R YG L Sbjct: 338 SYAFSTGHPSAWHLGSHIP--PWLAGSCRLTLAIAALQTGVRARSAALIYGWKFAAGVPL 395 Query: 440 RLFWGNLINFMANWRALKQV-LQHGDPRRVAWDKTTHDFPS 479 R+ WGNL+NF A AL + +AW KT H +P+ Sbjct: 396 RMVWGNLVNFAATAMALWEFGNSRRRGGGLAWRKTDHMYPT 436 >UniRef50_C8WHH7 General secretory system II protein E domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WHH7_EGGLE Length = 711 Score = 313 bits (803), Expect = 1e-83, Method: Composition-based stats. Identities = 182/706 (25%), Positives = 271/706 (38%), Gaps = 90/706 (12%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAI 73 + I +A+ + G DD DV R R+ + + K LA+ Sbjct: 5 VVYWIGFFVALAFIVFGADDVLWDVFALFRGT--------GKKRVKLSLINEKPPKMLAV 56 Query: 74 MVPAWNETGVIGNMAELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPN-VHKV 130 ++ AW+E V+G + + + Y Y +F+G YPND T + R V V Sbjct: 57 VIAAWHEDAVLGEVVDNLVASAQYPRSLYRVFLGVYPNDAATVAVARALEVRHGGTVVCV 116 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKD 190 V PGPTSKA +N+ + AI ++E + FA +HDAEDV+ P E ++ NYL++ D Sbjct: 117 VGDDPGPTSKAANINHTVRAIREYEAERDVRFASVTIHDAEDVVHPNEFKMTNYLIDDYD 176 Query: 191 LIQIPVYPFER------EWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSR 244 +Q PV+P +R + TS TY DEF+E H + + +R+ L VPSAG G R Sbjct: 177 ALQFPVFPLQRMPRLRLFFKTLTSSTYADEFAEHHFRTMVMRDELG-FVPSAGTGFAIGR 235 Query: 245 RAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHA 304 R + A + SLTEDY + L+ +G +V V R + Sbjct: 236 RVLDAFRDEDLL---PRNSLTEDYKLSLTLRMRGFRVHYVLEKV--------PRVDARGR 284 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSS----LTLNYFLWRDRKG 360 + I R FP TF AVRQK+RW+ GI Q FL++ K Sbjct: 285 TVWDYIATRSLFPSTFKAAVRQKARWVYGITMQSASMADVFGKSELTFAERTFLYKGLKA 344 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 +NFV V+ L+ L + + + +MV R V Sbjct: 345 KFANFVLLPGYAVLAYFLVQTFAPQLELPVM-----YPLHSPSWWMCVFLLFMMVERQVL 399 Query: 421 RVIFVTGYYGLTQGLLSV-------LRLFWGNLINFMANWRALKQVLQHGDPR------- 466 R + YG S+ LRL WGNLIN A +RA +Q + + R Sbjct: 400 RGRALANVYGWKTMAFSILLPPLFPLRLLWGNLINMCATFRAWRQKIAYVLLRGREAKAA 459 Query: 467 -------------------------------------RVAWDKTTHDFPSVTGDTRSLRP 489 AW+KT H+F + R R Sbjct: 460 AAPVVEHRGNAAEEEGERKPATDGDEAQTSNATSAQEGPAWNKTDHEFLPASVLERYRRL 519 Query: 490 LGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI 548 LG LLE + L+ A+ R G+RLG +L QGL+ L QA A Q + Sbjct: 520 LGDALLERGFVEPGHLEDAVGSARARGVRLGQELLRQGLVEERHLTQAYALQQQSMYVRA 579 Query: 549 DAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYV 608 + L+ MP + A +A LPL IV +D + L +G ++ Sbjct: 580 QPDLVLLELMDRMPFAAADRFAALPLVESEKGWIVAVDDDLSCAERDELAFLLGEPTFFL 639 Query: 609 IVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR 654 ++ A + A + + + Sbjct: 640 FSSTADLLEAFEGALAFDNAAEAPQPAGAATLLEETSVELPQAGMA 685 >UniRef50_Q136N6 Bacteriophage N4 adsorption protein B n=1 Tax=Rhodopseudomonas palustris BisB5 RepID=Q136N6_RHOPS Length = 497 Score = 309 bits (791), Expect = 3e-82, Method: Composition-based stats. Identities = 177/468 (37%), Positives = 251/468 (53%), Gaps = 6/468 (1%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAI 73 ++++ + AV++ +S +DD +D++YW RR+ R + + + P+A+ Sbjct: 14 VVEIMLVVTAVLVALSSIDDLVVDLLYWGRRLTRPNAF---DATADLATMEAIPQAPIAV 70 Query: 74 MVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 ++PAW E VI +M T YENYH+FVG Y ND T +V A+ VH VV Sbjct: 71 IIPAWQEHEVIFSMLAANQATTKYENYHLFVGAYQNDAATLTEVRRAEAQSNRVHLVVVP 130 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQ 193 R GPTSKADCLN V + + FE++ FAG +LHDAED+I P EL LFN++ D IQ Sbjct: 131 RDGPTSKADCLNVVANGVFAFEQAKGIQFAGLVLHDAEDLIHPYELVLFNFMAHDNDFIQ 190 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 +PV+ F+R Y+DEF+E H KD+PVR ++G VP AGV F R +AD Sbjct: 191 LPVFSFKRPLRELVGGVYMDEFAESHLKDIPVRRMISGLVPCAGVAAFFGRDIALRTMAD 250 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 G F SLTEDYD RL G FV P + I R Sbjct: 251 NAGSLFRSDSLTEDYDFALRLGLLGARVNFVIAPASYTIDISSSTDLPEIVGRKLPIATR 310 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 E+FP++F A RQ++RW++GIVFQG ++ W + + Y L RDRK ++ + LA LV Sbjct: 311 EFFPNSFVAAQRQRARWLMGIVFQGTRSFGWRGTTGIKYALLRDRKSILTAPLIMLAYLV 370 Query: 374 MIQLL-LLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 + L+ + L + PD + + + LLWLNF ++ R++ R F YGL Sbjct: 371 LFGLVSVNLYFRWYLPDEVNQFPLLQE-PLVQQLLWLNFAFLIWRLLHRFYFTNRIYGLR 429 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHG-DPRRVAWDKTTHDFPS 479 GL+S+ RL GN +NF A RA + L H R+ WDKT H +P+ Sbjct: 430 HGLMSIPRLPLGNFLNFFAVARACRLYLSHSLLGTRLVWDKTEHQYPT 477 >UniRef50_Q1N778 Bacteriophage N4 adsorption protein B n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1N778_9SPHN Length = 470 Score = 297 bits (761), Expect = 8e-79, Method: Composition-based stats. Identities = 175/484 (36%), Positives = 236/484 (48%), Gaps = 25/484 (5%) Query: 5 LDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELY 64 A + I + AV + I GLDD ID+ Y+ R+ R + +Y R+ RM+ EL Sbjct: 8 AGTAALLVAVHHEILLFAAVGLAIGGLDDLLIDIFYFGRKAWRDIVIYARHQRMTGPELP 67 Query: 65 KPDEK-PLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCAR 123 +A+ VPAW E+ VI M A + Y IFVG YPND T V V Sbjct: 68 HSRRPGKIAVFVPAWQESNVIAAMLNHARDSWGEARYRIFVGVYPNDDATIDAVANVACD 127 Query: 124 FPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN 183 + + R GPT+KADCLN + A+ E +F + ILHDAEDV+ E+RLF+ Sbjct: 128 ATWLTLCINDRAGPTTKADCLNLLWRAMRAEEEQGDFRYKAIILHDAEDVVHADEIRLFD 187 Query: 184 YLVERKDLIQIPVYPFEREWTHF---TSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 ++V+R DL+Q+PV P + + Y DEF+E HGK + VREAL VPSAGV Sbjct: 188 FMVDRFDLVQLPVLPLRGRGGWWRRAIADHYGDEFAESHGKLLSVREALGASVPSAGVAC 247 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F R + AL +D FD SLTEDY+ G R+++ G FVR Sbjct: 248 AFERDMLAALASDEATGPFDPGSLTEDYEAGLRIRDMGGRSAFVR--------------- 292 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 N+I RE+FPD+ AVRQK+RW IGI G+ W + RDR+ Sbjct: 293 -MRDAYGNIIATREFFPDSIDAAVRQKARWTIGIALAGWDRLGWRGGPAEFWMRLRDRRA 351 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 ++ V F A L ++ L + P LS L LLW N LM+ R+ Sbjct: 352 VLAALVLFAAYLTLVLWATLALLALVIPFPARPLS-----PALTGLLWFNLFLMLWRMAM 406 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 R +FV YGL GL +V R N I +A RA+ L+ + + WDKT H FP + Sbjct: 407 RFLFVARAYGLRAGLGAVPRTLIANYIGILAARRAIFLYLRSLAGQPLRWDKTQHRFPDL 466 Query: 481 TGDT 484 D Sbjct: 467 KTDP 470 >UniRef50_B8JA61 Bacteriophage N4 adsorption protein B n=2 Tax=Anaeromyxobacter RepID=B8JA61_ANAD2 Length = 502 Score = 295 bits (754), Expect = 7e-78, Method: Composition-based stats. Identities = 165/468 (35%), Positives = 243/468 (51%), Gaps = 14/468 (2%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVP 76 V+A L + ++ LD+ FIDV Y RR+ R+ + +S L + + K AI++P Sbjct: 10 VMAGPLGGAILLNQLDELFIDVNYLARRLHRRSATA-----VSAALLRRVEPKRTAILLP 64 Query: 77 AWNETGVIGNMAELAATTLDYENYH--IFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 AW E VI M EL + +D+ F GTY NDP TQ VD AR V KVV Sbjct: 65 AWREEDVIERMLELNVSRIDFPRDRYVFFCGTYQNDPATQARVDRAAARGWPVRKVVVPH 124 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQI 194 GPTSKADCLN + + ER F ++HDAEDVI P+ LRL++ LV + + +Q Sbjct: 125 AGPTSKADCLNWIYQGVVLHERERGTRFDILLMHDAEDVIHPLALRLYSLLVPKHEFVQT 184 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 PV+ + + + TYIDEF+E H K++PVR+A+ G +PSAGVG+ F RRA + Sbjct: 185 PVFSLPLDASQVVAGTYIDEFAEHHLKELPVRQAIGGLIPSAGVGSAFERRAFEQIALAH 244 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 FD SLTEDY+IG R + F + + + + E + I RE Sbjct: 245 AQQPFDPASLTEDYEIGLRFRLARRRTHFACYRIAADPDDPEAPA------HDDPIATRE 298 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVM 374 YFPD F +VRQ+SRWI+GI Q +++ W + Y LWRDRK ++N + L+ ++ Sbjct: 299 YFPDRFQASVRQRSRWILGISLQTWESAGWQGPAAVRYCLWRDRKAVLTNALLALSYALL 358 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 +++ + + +W I + LL +N + R+ ++ FV YG Sbjct: 359 AYVVVRVWTAGMTGASWSPARIVPAGGLIQALLLVNLAGFLLRVGVKMGFVGRLYGARLA 418 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQH-GDPRRVAWDKTTHDFPSVT 481 L + RL N+I+ A RA+ ++H + W KT+H FPS Sbjct: 419 TLCLPRLLVANVISLAATARAVVTYVRHLVTGEPLRWVKTSHAFPSAE 466 >UniRef50_Q1IXI4 Glycosyltransferase, NfrB-like protein n=2 Tax=Deinococci RepID=Q1IXI4_DEIGD Length = 670 Score = 281 bits (719), Expect = 7e-74, Method: Composition-based stats. Identities = 169/602 (28%), Positives = 242/602 (40%), Gaps = 60/602 (9%) Query: 57 RMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHI--FVGTYPNDPDTQ 114 R+ +L + LA+M+ AW E GV+ M E + Y + FVG YPND T Sbjct: 46 RLRPADLQQDHPSHLAVMIGAWQEAGVVTPMIESTLRLMHYPASRVEFFVGVYPNDLATL 105 Query: 115 RDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVI 174 +V + RFPNVH VV RPGPTSK+ LN V AI E F +HDAEDVI Sbjct: 106 PEVQALAERFPNVHCVVNERPGPTSKSQNLNGVYAAIKAHEARTGKPFDVIAVHDAEDVI 165 Query: 175 SPMELRLFNYLVERKDLIQIPV---YPFEREWTHFTSMT------------YIDEFSELH 219 P +L++ L++R ++Q+PV +P R W Y DEF+E H Sbjct: 166 HPYTFQLYSTLLKRWKMVQLPVFALFPRGRAWGAGLRGLLRHLTGQIVTGSYADEFAEHH 225 Query: 220 GKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGM 279 + +P REAL +PSAG G R + L + DG +L EDY++ RL +G+ Sbjct: 226 LRHLPAREALGLFLPSAGTGFAMRREVMALL--EEDGQVLTEGALAEDYELALRLWRRGV 283 Query: 280 TEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGI-VFQG 338 F P+ R Q + + VREYFP A+RQK RW GI + Sbjct: 284 RVHFHVQPL--------PRLDTQGKLGRDYVAVREYFPTEVQAAIRQKGRWTYGITLQTP 335 Query: 339 FKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFS 398 + +L LW D+KG +N + L A F Sbjct: 336 HRLRGLRLNLRDRLTLWHDQKGKYTNLIHLLGYP----------LSLTLLLAPLFGFHLQ 385 Query: 399 GSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLS-------VLRLFWGNLINFMA 451 ++ LL G+ R++ R V YGL Q L++ LR GN+IN +A Sbjct: 386 SNSLTRDLLLGVLGVTGWRMLMRAGAVGRIYGLRQALIATLCLPGLPLRWLAGNVINTLA 445 Query: 452 NWRALKQVL----QHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDT 507 RA + L + R LG L + E +L Sbjct: 446 TLRAWRLFLFPERGQKRGTARWDKTERKAYVPDEVLQAVRRRLGDQWLFTGALRERELAR 505 Query: 508 ALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVA 566 LR R RLG + Q L+ Q+ ++LA+ G+ + ++ + + A A Sbjct: 506 LLRVQRRAAARLGQLAVQQALVDEAQVRRSLAQTQGLMYLNLTPEMLDH---RFLSAEQA 562 Query: 567 LHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG-------RKVRYVIVLRGQIVTGL 619 V L D L+V S + P AL R++ + R + Sbjct: 563 QRLDVAILGKRGDRLLVASPHAVSPERCEALLRELRFCLRAPELPITVYATSRQSLRAAY 622 Query: 620 RH 621 R Sbjct: 623 RR 624 >UniRef50_Q2G4Z3 Bacteriophage N4 adsorption protein B n=2 Tax=Sphingomonadales RepID=Q2G4Z3_NOVAD Length = 488 Score = 281 bits (718), Expect = 8e-74, Method: Composition-based stats. Identities = 147/481 (30%), Positives = 222/481 (46%), Gaps = 29/481 (6%) Query: 3 WLLDVFATWLYGLKV-IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 WL D WL L+ + + AV I D+ +D ++ + ++L+ +++ Sbjct: 4 WLADSAYQWLAVLEHELLLFAAVWFAIGAADELVMDGIW----LWQRLTGAGPTGQLAGN 59 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 A+ VPAW E+ VIG M E+ I+VG Y ND +T + + Sbjct: 60 G-RDKLSSMAAVFVPAWRESAVIGPMVAHCLAVWPQEDLRIYVGCYRNDQETLNALT-IV 117 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL 181 + P V VV R GPT+KADCLN + A+ Q ER + +LHDAED++ P L L Sbjct: 118 SEDPRVRVVVHDRDGPTTKADCLNRLYLAMRQDERRSGQRIGFIVLHDAEDMVHPAALAL 177 Query: 182 FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 + ++ D +Q+PV P + + + + Y DEF+E H +D+ VR+ + +PSAGVG Sbjct: 178 MDRALDTVDFVQLPVRPEPQASSPWVAGHYCDEFAEAHARDMVVRDHIGAGLPSAGVGCA 237 Query: 242 FSRRAVTALLADGDGI-AFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 FSR A+ ++A G F LTEDY+ G + E G F+R Sbjct: 238 FSRAAIERIVAVRGGALPFAADCLTEDYEAGMLVAETGGRSRFIR--------------- 282 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 ++ RE+FPD + +VRQK+RW+ GI FQG+ W S + RDR+G Sbjct: 283 -VRDARGELVATREFFPDGLAASVRQKTRWVHGIAFQGWDRLGWNRSAGDLWMRLRDRRG 341 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 + V A L + ++ E F+ L LL N ++ R+V Sbjct: 342 PLVALVLLAAYLALPLWPIVRFGEMAG-----FVVPVPPGPVLKGLLAFNLCSLIWRLVV 396 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 R +F YG +G+ SV R GN+I MA RA ++ + WD T H V Sbjct: 397 RALFTGSEYGWIEGVRSVFRFPVGNIIAIMAARRAAVAYVRVLFGGALTWDHTLHCAHPV 456 Query: 481 T 481 Sbjct: 457 Q 457 >UniRef50_A5FY21 Glycosyl transferase, family 2 n=2 Tax=Alphaproteobacteria RepID=A5FY21_ACICJ Length = 642 Score = 265 bits (677), Expect = 5e-69, Method: Composition-based stats. Identities = 86/475 (18%), Positives = 147/475 (30%), Gaps = 69/475 (14%) Query: 24 VIMFISGLDDFFIDVVYWVRRIKRKLSVYRRY----PRMSYRELYKPDEKPLAIMVPAWN 79 ++ I ++ R + + RR L D ++VP + Sbjct: 206 TLLGIMATVSALYASLFAFRGVLTIVGSGRRTDISVNASELAALKDGDLPVFTVLVPMYR 265 Query: 80 ETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPT 138 E V+ + + + + D +T + A + P Sbjct: 266 EAEVLPILVDSIRRLDYPRAKLDVKLVLEAGDTETIEAAKALGAED-LFEIIRVPDSQPK 324 Query: 139 SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV----ERKDLIQI 194 +K N L ++ DAED P +LR L +Q Sbjct: 325 TKPKACNYALRFA---------RGEYTVIFDAEDSPEPDQLRKVVALFNASGPEVACVQA 375 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 + F R+ T M + E+S+ +P L +P G F + L Sbjct: 376 RLNYFNRDDNFLTRM-FTLEYSQWFDYLLPGLYRLNIPIPLGGTSNHFRTEVLHEL---- 430 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 A+D ++TED D+G RL + G V +EA Sbjct: 431 --GAWDPYNVTEDADLGIRLTQAGYRVAVVNSTTFEEANGV------------------- 469 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR----KGAISNFVSFLA 370 + + Q+SRWI G + W + L+R F+ F Sbjct: 470 -----LHSWINQRSRWIKGYMQ------TWLVHMRRPVELYRRLGPVGFLGFHMFIGFPP 518 Query: 371 MLVMI-QLLLLLAYESLWPDAWHFLSIFSGSAWLMTLL-WLNFGLMVNRIVQRVIFVTGY 428 M +I LL ++ S+ F G ++ L + M + + Sbjct: 519 MTALINPLLWIMFLVSVIVGRSAVAGFFPGPVLVLALFDLMVGNAMYVYFNIVAVAKRRW 578 Query: 429 YGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 YGL + +W +A ++AL Q++ W+KTTH S T + Sbjct: 579 YGLVP-WGLLAPAYWVLH--SVAAYKALLQLI----TNPHYWEKTTHGTSSRTQE 626 Score = 84.7 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 63/153 (41%), Gaps = 3/153 (1%) Query: 492 QILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDA 550 L++ +T Q D A+ + L + +++ + A+ L+ +G+ + A Sbjct: 21 DRLVQAGSVTPLQRDEAIHTAYAWRVSLPDVLSAMYRVTSLRWARELSAASGLPLVDLRA 80 Query: 551 WQIPSSLIAEMPASVALHYAVLPLRLE-NDELIVGSEDGIDPVSLAALT-RKVGRKVRYV 608 +L+AE + L + +P R + + +++ + + DP A + R G V +V Sbjct: 81 SPCDHTLLAEAEHDLYLRHLFIPWRRQPDGVVVIATLNPEDPAIRALMRERMPGCHVEFV 140 Query: 609 IVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQH 641 I + ++ ++ + R L+N Sbjct: 141 ITSKFDLIWAVQRIFDPELSEAAREALFNRNPE 173 >UniRef50_Q1IRV6 Type II secretion system protein E n=24 Tax=Bacteria RepID=Q1IRV6_ACIBL Length = 571 Score = 264 bits (676), Expect = 7e-69, Method: Composition-based stats. Identities = 53/270 (19%), Positives = 105/270 (38%), Gaps = 25/270 (9%) Query: 488 RPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + LG +L+ +VIT EQL+ ALR + G RLG +++ G +S + + L+ Q GV Sbjct: 3 QRLGDLLVREKVITAEQLEQALREQGSSGTRLGAALVKLGFLSDDDVTNFLSRQYGVPAI 62 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 +++ ++I S++ +P A Y +LPL L + D + ++ + G + Sbjct: 63 NLNYFEIDPSVVKLIPYDTAKRYQILPLSRVGASLTIAMVDPTNVFAMDDIKFMTGFNIE 122 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ----------WLTEQQAGEIWRQY 656 V+ I+ G+ Y D +++ + + + + E + Sbjct: 123 PVVASESAILEGIEKAYNTAPEEDLESVMASMGEGEASDIEVQADMEEADSADLERAAEE 182 Query: 657 VPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRE 716 P L ILT S I++ E +G+ L ++ + Sbjct: 183 APIVKLVNMILTEAVKKGASDIHM-----EPYEKEYRVRFRIDGI-----LQTMMNPPMK 232 Query: 717 L-QVSMQSLLLKAGLNTEQVAQLESENEGE 745 L + + + A L+ + +G Sbjct: 233 LRDAIISRVKIMAKLDISE---KRLPQDGR 259 >UniRef50_Q0G6A6 Glycosyl transferase, family 2 n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0G6A6_9RHIZ Length = 692 Score = 263 bits (672), Expect = 2e-68, Method: Composition-based stats. Identities = 83/501 (16%), Positives = 154/501 (30%), Gaps = 68/501 (13%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 W++ F +++ ++ F ++ ++ + Sbjct: 238 WVMSSFWQVAMTFHLLSAFCFILWICLRSVAAF---------GRQDGTIASELKSGPNKG 288 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 +++V ET V+ + +F+ +D T + Sbjct: 289 GGTRPAPIYSVVVALHKETEVVARLVSALDNLKWPKSCLEVFLVCEADDHATVALCRKHT 348 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR- 180 ++ P +K LN+VL + +L+DAED P +L Sbjct: 349 EGKLQYRVILVPPGNPRTKPKALNHVLPIVA---------GDFLVLYDAEDEPHPGQLEE 399 Query: 181 ---LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 + R +Q P+ + TS + E++ L +P +P G Sbjct: 400 AYDRYRASDARLACLQAPLVIRNGDRNWLTS-IFAMEYAGLFRAFLPWLARHRLPIPLGG 458 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 F + + +D ++TED D+G RLK G + P +++ Sbjct: 459 TSNHFKVAVLREV------GGWDSHNVTEDADLGMRLKRAGYDIETISSPTLED------ 506 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD 357 P+T S V Q++RW+ G V + + L R+ Sbjct: 507 ------------------APETVSVWVPQRTRWLKGWVQ------TYAVHMRHPMLLMRE 542 Query: 358 RKGA-ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 F ++ LLL LA+ + W S LL L+ + V Sbjct: 543 LGVKRFVVFQLLFHGMITAALLLPLAFGLIGFTIWLQWSTGWERTSATALLVLDLAIFVG 602 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFW---GNLINFMANWRALKQVLQHGDPRRVAWDKT 473 + + G ++ V L + L +A +R L Q+ AW+KT Sbjct: 603 GYLSFLALTLRGMGTSELRPCVKWLPFVPIYWLCVSVAAYRGLFQLF----KNPHAWEKT 658 Query: 474 THDFPSVTGDTRSLRPLGQIL 494 H S + I Sbjct: 659 AHGLASRADKLDPRHRMEPIF 679 >UniRef50_B8ICV2 General secretion pathway protein E n=2 Tax=Methylobacterium RepID=B8ICV2_METNO Length = 442 Score = 261 bits (668), Expect = 6e-68, Method: Composition-based stats. Identities = 132/464 (28%), Positives = 206/464 (44%), Gaps = 40/464 (8%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIG 85 + +S LDD FID++ + K + R D +A+ V W+E V+G Sbjct: 16 INVSSLDDAFIDIIAFGILRKGLPGLAERT-----------DIPRIAVFVANWHEEEVLG 64 Query: 86 NMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP-NVHKVVCARPGPTSKADCL 144 M E + Y + +F+G YPND T R E+ A++P V ++ GPTSK L Sbjct: 65 KMVEGNLARIPYPSVSLFLGVYPNDTGTLRVAKELEAKYPDRVTVIINTLNGPTSKGQML 124 Query: 145 NNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREWT 204 N + + + E +LHD+EDVI P ++ + D IQ+PV+ R Sbjct: 125 NEMFQQVFEREDCP----DIAVLHDSEDVIDPRTFPIYAQYSQDHDFIQVPVFSLSRGKG 180 Query: 205 HFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSL 264 + TY+DEF+E H +++ VR A+ +PSAGVGT +++ + LA G ++ Sbjct: 181 LPVASTYMDEFAERHTREMIVRNAVGAAIPSAGVGTAMTKKLLKYFLATR-GQVLMSGTV 239 Query: 265 TEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAV 324 TEDY +G K G A N + RE+FP T + ++ Sbjct: 240 TEDYILGVEAKRAG-------------FSAAFAAVSADDASGLNYVATREFFPKTLAASI 286 Query: 325 RQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYE 384 +QK+RW+ GI F+ W + YF RDRKG I+NF+ ++ + ++ ++L L Sbjct: 287 KQKTRWVYGINFEATHKLGWEGNAWDKYFFVRDRKGIITNFLPPVSFVFLVLIVLGLIDP 346 Query: 385 SLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWG 444 S PD + ++LN ++ R RV+ YG + R G Sbjct: 347 SEMPDPIE--------PVFVASIYLNLAALIVRYTIRVVASHEVYGTYDLIGIAYRWPIG 398 Query: 445 NLINFMANWRALKQVLQHGD--PRRVAWDKTTHDFPSVTGDTRS 486 IN A +RA K + + + W KTTHD P Sbjct: 399 LYINAAAVFRAWKTYIGESQFATKPIVWSKTTHDLPENFMTATR 442 >UniRef50_D2LI28 General secretion pathway protein E n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LI28_RHOVA Length = 450 Score = 260 bits (665), Expect = 1e-67, Method: Composition-based stats. Identities = 125/474 (26%), Positives = 209/474 (44%), Gaps = 34/474 (7%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVP 76 + + +++++ +S DD F+D++ + + S + + + V Sbjct: 7 TLMLFISILINVSSFDDAFVDLL----SVGIIRGNFGPPEDPSPEKPTSSAIPDIGVFVA 62 Query: 77 AWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP-NVHKVVCARP 135 W E V+G M E + + +++G YPND T + + A++P V +V + Sbjct: 63 NWQEEDVLGRMVEGNLARIPISSVKLYLGVYPNDTGTLAVAEAMAAQYPDRVRVIVNSME 122 Query: 136 GPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF-NYLVERKDLIQI 194 GPTSK LN + + + +LHD+ED+I P ++ Y E D IQ+ Sbjct: 123 GPTSKGQMLNEMFRQVYARPGAP----EMAVLHDSEDIIDPRTFGVYTAYAREGYDFIQV 178 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 PV+ + TY+DEF+E H +++ VR A+ +PSAGVGTC +RR + + + Sbjct: 179 PVFSLNSIKRSKVAATYMDEFAERHTREMVVRHAVGAMIPSAGVGTCMTRRLLEHFVRER 238 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 +TEDY +G K G F R + + RE Sbjct: 239 GF-VLANGCVTEDYILGVEAKRAGFRSAFA-------------AVSADELRGLDFVATRE 284 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVM 374 YFP +FS +V+QK+RW+ GI F+ W YF RDRKGAI+NF+ ++++ Sbjct: 285 YFPKSFSASVKQKTRWVYGINFEATHKLGWGGDFWDKYFFMRDRKGAITNFLPPISLVFW 344 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 + L + + + IF S ++LN + R + RVI YG+ Sbjct: 345 VLLAFEVV--DIEQMPLDLVPIFQVS------IFLNMLALGLRYLMRVICCRDVYGINDF 396 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHGD--PRRVAWDKTTHDFPSVTGDTRS 486 + +R +N +A WRA K + + + + W KT H+ P R Sbjct: 397 IGVAVRWPVSLTVNMLAVWRAWKTYVGESEYATKPIVWSKTEHELPDDLMSARR 450 >UniRef50_Q2JNJ5 Glycosyl transferase, group 2 family protein n=6 Tax=Bacteria RepID=Q2JNJ5_SYNJB Length = 764 Score = 260 bits (664), Expect = 2e-67, Method: Composition-based stats. Identities = 81/467 (17%), Positives = 141/467 (30%), Gaps = 64/467 (13%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE---LYKPDEKPLAIMVPA 77 + + L + F + + + R+ +++ E L D I+VP Sbjct: 332 PWTTLTILVLLINLFYVASILFKLLLSLVGSADRFHQITDEEVAALDDRDLPIYTILVPV 391 Query: 78 WNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN-VHKVVCARP 135 + E V+ + + + +E + + ND DT A+ P V ++ Sbjct: 392 YKEPEVMPILIKSLSKLDYPHERLDVLILLEENDRDTIEAARA--AKPPRYVRLLLVPDS 449 Query: 136 GPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDL 191 P +K N L ++DAED+ P +L+ F Sbjct: 450 KPKTKPKACNYGLAFA---------RGEYLTIYDAEDIPDPDQLKKAVIAFRKGDPSLVC 500 Query: 192 IQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALL 251 +Q + F R T M + E+S +P E L +P G F + L Sbjct: 501 VQAALNYFNRSENFLTRM-FTLEYSYWFDYLLPGLETLRMPIPLGGTSNHFRTDRLRELQ 559 Query: 252 ADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMIC 311 +D ++TED D+G R + G T + +EA Sbjct: 560 ------GWDPFNVTEDADLGIRASQHGYTVGVINSTTYEEANCA---------------- 597 Query: 312 VREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG---AISNFVSF 368 +RQ+SRWI G + W R F Sbjct: 598 --------VKNWIRQRSRWIKGYMQ------TWLVHNRNPLRSLRKLGLKNWLSYQFFIG 643 Query: 369 LAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGY 428 + + ++ W +WL+ L N + + + Sbjct: 644 GSFFTFLTSPIMWLLFIYWLLTRAHWLQNLFPSWLVYLGLFNLLVGNAIGIYLNLVAVFR 703 Query: 429 YGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 G L ++ MA + AL Q+ + W+KT H Sbjct: 704 RGYYDLAFYALLNPIYWQLHSMAAYMALWQLF----TKPFYWEKTIH 746 Score = 86.6 bits (213), Expect = 4e-15, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 62/206 (30%), Gaps = 15/206 (7%) Query: 449 FMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTA 508 F A W+++K + + + + LG L T EQ A Sbjct: 109 FPAIWQSVKHLCPEAEKLWEIPATEKQILTVILERKLAAELLGSRL------TLEQWQQA 162 Query: 509 LR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESI----DAWQIPSSLIAEMPA 563 L R G LG + +S + LA G S SL + Sbjct: 163 LEIRRRTGSSLGQVLTRLSYLSTPEYLSILAHLLGYPAVSELMGTGLLHRDESLSRQFDP 222 Query: 564 SVALHYAVLPLRLEN-DELIVGSEDGIDPVSLAAL--TRKVGRKVRYVIVLRGQIVTGLR 620 V + + PL + L V D +D + L + + G ++ V+ I L Sbjct: 223 EVMMRHLFYPLSWTDEHTLTVMVNDPLDW-VVDELLYSWRPGLRIEKVLGTEQDITQLLS 281 Query: 621 HWYARRRGHDPRAMLYNAVQHQWLTE 646 R + L + + + Sbjct: 282 QDQGSRFSQEAVYKLMARLPEESASR 307 >UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVJ2_9BACT Length = 1181 Score = 259 bits (663), Expect = 2e-67, Method: Composition-based stats. Identities = 78/480 (16%), Positives = 141/480 (29%), Gaps = 70/480 (14%) Query: 7 VFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 +FA L+ ++I V + + F+ + Y R R Sbjct: 747 MFALIFGILRFLSIAFVVAIALGIARVAFVTSLAI-----------WVYFRSKPRGRPIE 795 Query: 67 DEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN 126 + ++I++PA+NE V+G DY + I + T V++ A Sbjct: 796 NPPLVSIIIPAYNEQSVVGRTIRSVLAN-DYPHMEIIFVDDGSTDGTADAVEQEFAGHEK 854 Query: 127 VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NY 184 V V KA LN+ + + DA+ + ++ Sbjct: 855 VRVVRQVNG---GKASALNHGI---------LVSKGEIIVGLDADTQFRKETITRLIRHF 902 Query: 185 LVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSR 244 R + V R + + E+ D L G + R Sbjct: 903 RDPRVGAVAGNVKVGNRI--NLITRWQALEYITSQNVDRLAYAQLNAVTVVPGAIGAWRR 960 Query: 245 RAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHA 304 A+ + + +L ED D+ +R++ KG V Sbjct: 961 TALDEV------GGYLTDTLAEDMDLTWRIRRKGWKIETEAGAVALT------------- 1001 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISN 364 P T +Q+ RW G + +K + +F W Sbjct: 1002 ----------EAPATTQAFFKQRFRWSFGTLQCLWKHR--RALFRYGWFGWVGLPTLWLF 1049 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAW--HFLSIFSGSA------WLMTLLWLNFGLMVN 416 + F + ++ L +L + S + H+L I + +A TLL+ V Sbjct: 1050 QILFQVIAPLVDLQVLYSLWSFGSSWFSEHYLGIVNQAATPPGALLQQTLLFYALFYAVE 1109 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 V G L+ F+ + + W+A+ L+ R W K Sbjct: 1110 LAGAAVAVAIDRDGWRLLPWLFLQRFFYRQLMYAVLWKAV---LRAFLGERTGWGKLERR 1166 >UniRef50_A0LUI4 Type II secretion system protein E n=4 Tax=Bacteria RepID=A0LUI4_ACIC1 Length = 553 Score = 258 bits (658), Expect = 8e-67, Method: Composition-based stats. Identities = 52/260 (20%), Positives = 106/260 (40%), Gaps = 16/260 (6%) Query: 487 LRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 ++ LG ILLE ++T EQL A ++ G LG ++ QG+++ QL ALA Q G+ + Sbjct: 1 MKQLGDILLEGGLVTPEQLAAAYAEHQRNGRSLGRVLVDQGILTEAQLVAALATQIGLRF 60 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 + I S ++ +P +V Y LP+ E+ +L+V D + ++ + G V Sbjct: 61 VDLTDVAIDGSAVSRVPEAVCRRYTALPIGYEDGKLVVAMADPANVFAIDDIRSITGLDV 120 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAE 665 + V+ R ++ + ++ D A + + + + + + P Sbjct: 121 KPVVATRADVLAAINRYHRADEELDDLTSTLAAEETE---DLASLDEVVEDAPIVKFVNL 177 Query: 666 ILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLL 725 ++T S I++ E + L +GV+ + + + + L Sbjct: 178 LITQAVQDRASDIHI-----EPTERDLRVRFRIDGVLHEVM----RSPRNIQSGVISRLK 228 Query: 726 LKAGLNTEQVAQLESENEGE 745 + A +N + +G Sbjct: 229 IMADMNIAE---RRVPQDGR 245 >UniRef50_B8D2C7 Tfp pilus assembly protein PilB n=5 Tax=Firmicutes RepID=B8D2C7_HALOH Length = 558 Score = 257 bits (657), Expect = 9e-67, Method: Composition-based stats. Identities = 56/269 (20%), Positives = 107/269 (39%), Gaps = 18/269 (6%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQ 540 ++ LG++LL+ ITE+QL+ AL+ + G +LG ++ G ++ L Q L Q Sbjct: 1 MTRTHIKKLGELLLDFNFITEKQLNEALKKQNKSGKKLGEILVESGYLNENDLIQVLEFQ 60 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 G+ ++ + I L +P ++A + V+PL +N +L V D + V++ + Sbjct: 61 LGIPHADLNKYVINPHLAQYIPENIARRHNVVPLEKKNGKLKVAMVDPTNLVAIEDIEMT 120 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPR---AMLYNAVQHQWLTEQQAGEIWRQYV 657 G KV +I R I L Y+ ++ + E + + Sbjct: 121 SGLKVEPLIASRKNIKMALNQIYSVNDSDAAEVFASLNEVTTKTNEEPELNELKEMIEDA 180 Query: 658 PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 P L I+ + S I++ E + +GV L +T+ + Sbjct: 181 PIVRLANLIINQAIQMKASDIHI-----EPQEDQVRVRYRVDGV-----LRENMTVPKHS 230 Query: 718 QVS-MQSLLLKAGLNTEQVAQLESENEGE 745 Q + + L + A L+ + +G Sbjct: 231 QAALISRLKIIADLDITE---RRVPQDGR 256 >UniRef50_Q0EZ04 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EZ04_9PROT Length = 684 Score = 256 bits (654), Expect = 2e-66, Method: Composition-based stats. Identities = 83/499 (16%), Positives = 158/499 (31%), Gaps = 71/499 (14%) Query: 10 TWLYGLKVIAITLAVIMFISGLDDF------FIDVVYWVRRIKRKLSVYRRYPRMSYRE- 62 T+L+ ++I + I I L F+ + + +R + + + + + E Sbjct: 233 TFLFVWMTLSIIILAIWPIQSLVVLNLSISAFLMLNFGLRMLLGWVGGEKHFDQYVTDEE 292 Query: 63 ---LYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDV 117 L D I++P ++E + N A TLDY I + D +T Sbjct: 293 VLALDDRDLPVYTILLPMFHEAATLPN-IAQALRTLDYPLSKLDIKLILEQEDDETIDAA 351 Query: 118 DEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM 177 E+ + P +K N L ++D ED P Sbjct: 352 KELG-LEGIFEIIRVPESLPQTKPKACNYALHF---------SRGEMATIYDGEDAPEPD 401 Query: 178 ELR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 +L+ F E +IQ + F T M + E+S +P + L + Sbjct: 402 QLKKAVIAFRKSPENTAVIQGRLNYFNVAENWLTRM-FTMEYSLWFDFYLPALDYLRIPI 460 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 P G F + + +D ++TED D G RL + G + Sbjct: 461 PLGGTSNHFKMSVLREM------GGWDPYNVTEDCDFGVRLTQAGYRVGVMNST------ 508 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF 353 E ++ +RQ+SRW+ G + + + + Sbjct: 509 ------------------TFEEANNSIPNWIRQRSRWLKGYMQ------SYLVHMRSPFK 544 Query: 354 LWRDRKGA-ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFG 412 L+ + F F+ ++ +L + + F L F Sbjct: 545 LYGELGHVGFWGFQFFIGGTIVSAMLTPVLFLMYIIWLLTSTFAFDPYFPSFVLYITLFN 604 Query: 413 LMVNRIVQRVIFVTGYYGLTQGLLSVLRL--FWGNLINFMANWRALKQVLQHGDPRRVAW 470 L++ + +F+ + L L + L+ A ++ Q++ + W Sbjct: 605 LLIANGMLIYLFMLSGFKRRYFGLIPWALTVPFYWLLQSWAGYKGFWQLIHN----PFYW 660 Query: 471 DKTTHDFPSVTGDTRSLRP 489 +KT H S + +P Sbjct: 661 EKTHHGLTSFEVTHSATQP 679 Score = 94.0 bits (232), Expect = 2e-17, Method: Composition-based stats. Identities = 39/179 (21%), Positives = 72/179 (40%), Gaps = 6/179 (3%) Query: 477 FPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQ 535 F D +G +L+ V++ EQ++ A++ + E G RLG +L +G +S + Q Sbjct: 56 FYDALTDHFRRGRIGDLLVSKGVLSNEQMEEAVQIQSEWGTRLGDIILAKGWVSPYVMGQ 115 Query: 536 ALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLA 595 LAE + + SL+ SV +P + E+ L V D P L Sbjct: 116 VLAEHFDKPHVDLMDRRPDISLLDRSKLSVYSENLFMPWQREDGLLKVAVVDVT-PELLK 174 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRH----WYARRRGHDPRAMLYNAVQHQWLTEQQAG 650 + V +++ + I+ L+ +Y+ + H+ L + T +Q Sbjct: 175 IVNETVDEPFDFIVTSKFDIIWLLQEIGGTYYSGKAVHELANTLPQYSASEVFTVKQLT 233 >UniRef50_A8IJY6 Putative glycosyltransferase n=1 Tax=Azorhizobium caulinodans ORS 571 RepID=A8IJY6_AZOC5 Length = 645 Score = 256 bits (653), Expect = 3e-66, Method: Composition-based stats. Identities = 78/469 (16%), Positives = 141/469 (30%), Gaps = 51/469 (10%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL 71 + G V L+ + + + ++ R + R L Sbjct: 195 MAGAAVATAGLSAPLETLLIVQSLLSSIFLASATLRIATCLARPEEPPPLGLPDAALPLY 254 Query: 72 AIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 +I+VP + E V+ + E I + P+D + + A P + Sbjct: 255 SIIVPLYREERVLPRLVRALQAIDYPPEKLDIKIVVEPDDAPVHAALARM-ALPPWFEII 313 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE--- 187 V GP +K LN L + ++ DAEDV P +L+ Sbjct: 314 VAPDVGPRTKPKALNCALPF---------TRGSFVVVFDAEDVPDPDQLKRALAAFRQGG 364 Query: 188 -RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 +Q + + T + S + E++ +P L + G F R Sbjct: 365 RNLACVQARLSVENADET-WISRLFAAEYAGQFDVLLPGLAQLRMPILLGGTSNHFRRSM 423 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHART 306 + + A+D ++TED D+G RL G T + Sbjct: 424 LELI------GAWDPYNVTEDADLGVRLARAGWTTAVIGSSTA----------------- 460 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV 366 E P T + +RQ++RW+ G + W + + Sbjct: 461 -------EEAPITRAAWMRQRTRWLKGWAQTLLVH-GRQPLRLVRELGWGNLVPLLLLTA 512 Query: 367 SFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVT 426 A ++ L + + + + L N + Sbjct: 513 GPFASALLHPLCVAWLIADVVRGVFLTTPGTTLGVVATALSLTNLAIGYGAAAWSCGLGL 572 Query: 427 GYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 G ++ L + L+ +A WRA+ Q++ R WDKT H Sbjct: 573 KRRGQFALAPILILLPFYWLLLSVAAWRAVVQLIV----RPYWWDKTEH 617 >UniRef50_C6CY56 Type II secretion system protein E n=3 Tax=Bacillales RepID=C6CY56_PAESJ Length = 554 Score = 255 bits (652), Expect = 4e-66, Method: Composition-based stats. Identities = 55/261 (21%), Positives = 110/261 (42%), Gaps = 20/261 (7%) Query: 487 LRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 + LG +L+E+ +I+EEQL AL +LG ++ QG I+ +QL + L Q G+ Sbjct: 5 KKRLGDLLVESAIISEEQLQKALLEQSKSKQKLGDLLIAQGYITEQQLIEVLEFQLGIPH 64 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 S+ +QI + +P S+A Y +PL+ + +L+V D +D ++ L G ++ Sbjct: 65 VSLYKYQIDPEITQIIPESMAKRYQAIPLQKDGGKLMVAMADPLDYFAIEELRMSTGFRI 124 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAE 665 I + ++ + Y + +M + E + EI + P L + Sbjct: 125 EPAISSKDELQRAIARHYGLQ-----DSMSQMMIDLPTQEEIRETEITDEDSPVVRLVNQ 179 Query: 666 ILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQ-SL 724 ++ + S I+V + + +GV+ E + +++Q + L Sbjct: 180 MIQQAVQLRASDIHV-----DPGETSVTIRYRIDGVLRTER-----ALPKQMQGFITARL 229 Query: 725 LLKAGLNTEQVAQLESENEGE 745 + + LN + +G Sbjct: 230 KIMSKLNIAE---RRLPQDGR 247 >UniRef50_B6JD78 Glycosyl transferase, family 2 n=1 Tax=Oligotropha carboxidovorans OM5 RepID=B6JD78_OLICO Length = 669 Score = 255 bits (652), Expect = 4e-66, Method: Composition-based stats. Identities = 83/499 (16%), Positives = 156/499 (31%), Gaps = 65/499 (13%) Query: 10 TWLYGLKVIAITLAVIM---FISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY-RELYK 65 W YGLK++A T A ++ ++ L + + + + R + R + Sbjct: 217 AWRYGLKLLAFTSAFMLAPTLLTQLSGAVLAIWFLLFNSLRLAGAFAGGERTPRSPRIPD 276 Query: 66 PDEKPLAIMVPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARF 124 +MV ++E + ++ + AA E I + +D +TQ + + Sbjct: 277 AQLPLYTVMVALYHEGPSVAHLVQSLAALDYPREKLDILLLLEADDIETQAALSRL-HLP 335 Query: 125 PNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR---- 180 NV ++ GP +K LN L + + DAED P +LR Sbjct: 336 GNVQTLIVPPFGPRTKPKALNAGLMSA---------RGEFTAVFDAEDRPDPSQLRDAID 386 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 F + +Q + + + + E++ + +P G Sbjct: 387 AFRHHHTDVACVQASLCIDNSADSWL-ACMFTAEYAGQFDVFLRGFSQFGLPLPLGGSSN 445 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F + + +D ++TED D+GFRL +G + Sbjct: 446 HFRTDVLREV------GGWDAYNVTEDADLGFRLARRGYRAVMFDST------------- 486 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD--R 358 E P +RQ+SRW+ G + W + L + Sbjct: 487 -----------TYEEAPAHTGAWLRQRSRWMKGWMQ------TWIVHMRSPRRLIKQSGL 529 Query: 359 KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHF---LSIFSGSAWLMTLLWLNFGLMV 415 G + + ++ +L L ++ S + L + Sbjct: 530 AGFFTLNLLVGGNVLTALAYPILIAACLLEAGLAATGSTAVAMFSGPFIELHFTTIAAGY 589 Query: 416 NRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 V + GL + ++ +A WRAL+Q+L W+KT H Sbjct: 590 LSTVVVSLMGLARRGLLRHAWVLVLTPLYWAWLSIAAWRALRQLLHD----PYHWEKTEH 645 Query: 476 DFPSVTGDTRSLRPLGQIL 494 + R R + Sbjct: 646 GLARHSRLARHQRKEAARM 664 Score = 42.0 bits (97), Expect = 0.085, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 28/73 (38%) Query: 507 TALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVA 566 A R + G+ ++ G+IS LA G+ + + +++ A Sbjct: 72 AARRAAILGIGAEQVLIRNGVISEADYIARLAHHCGLPLADFNRLRRSDCALSDAQLRYA 131 Query: 567 LHYAVLPLRLEND 579 + +LP++ + Sbjct: 132 AWHRLLPIKGRDG 144 >UniRef50_B0RVF2 Glycosyltransferase n=8 Tax=Proteobacteria RepID=B0RVF2_XANCB Length = 635 Score = 255 bits (651), Expect = 5e-66, Method: Composition-based stats. Identities = 81/480 (16%), Positives = 152/480 (31%), Gaps = 65/480 (13%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE----LYKPDEKPLAIMV 75 + ++ I+ L + ++ + R + L D ++V Sbjct: 203 FPIITLITINVLVALGFLATFGLKLLLVWFGSRHRIDIKVTEDEVAALRDDDLPVYTVLV 262 Query: 76 PAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 P + E V+ + A LDY + + +D +T ++ + Sbjct: 263 PMYKEPEVLP-ILANALRKLDYPISKLDVKLVLEADDFETIEAAKKLG-LEAFFEIIRVP 320 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV----ERK 189 P +K N L ++DAED P +L+ + Sbjct: 321 PSQPKTKPKACNYALHFA---------RGELLTIYDAEDKPEPDQLKRVVAAFRKAEKDV 371 Query: 190 DLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTA 249 IQ + + + T M + E++ +P E L +P G F + Sbjct: 372 VCIQARLNYYNADENWLTRM-FTLEYTLWFDFYLPALEYLRIPIPLGGTSNHFRLDVLRQ 430 Query: 250 LLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNM 309 + A +D ++TED D+G RL + G V +EA Sbjct: 431 VRA------WDPYNVTEDADLGVRLIQNGYRVNVVNSTTFEEANV--------------- 469 Query: 310 ICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-RKGAISNFVSF 368 + +RQ+SRW+ G + W + L+R F F Sbjct: 470 ---------SIPNWIRQRSRWLKGYMQ------TWLVHMRDPVHLYRSTGFKGFWGFQFF 514 Query: 369 LAMLVMIQLLLLLAYESLWPDAWHFLSIF--SGSAWLMTLLWLNFGLMVNRIVQRVIFVT 426 + I L + + + IF + WL T+ +N L V + Sbjct: 515 IGGNFFIALGVPVMWTLCLISMLSGARIFDATFPPWLATISLVNLLLANAFFVYVTLVAA 574 Query: 427 GYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRS 486 + L + + L+ +A ++ L Q++++ W+KTTH + R Sbjct: 575 FKRDYFKLAPYALTVPFYWLLQSIAAYKGLWQLIRN----PFYWEKTTHGISKHSEQERR 630 Score = 99.7 bits (247), Expect = 3e-19, Method: Composition-based stats. Identities = 38/158 (24%), Positives = 64/158 (40%), Gaps = 2/158 (1%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQAL 537 R LG+ L+ VIT+EQL AL + RLG +L Q + A++ + Sbjct: 12 RAPDIGRERGLLGRSLVSAGVITDEQLRAALALQQRWNSRLGDVILAQRGVPAQRFYAIV 71 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 A G+ + + L+ V +LP R E+ L++ D DP A Sbjct: 72 AAHFGLQFVDLVQQPPDPELLTATDLDVYAQRLILPWRREDGVLVLAVADP-DPALFAWA 130 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAML 635 G +VR+V + I+ L+ + + + +L Sbjct: 131 REHYGAQVRFVGTAKFDIIWSLQRYADEQLTDNALNLL 168 >UniRef50_C6PR94 Glycosyl transferase family 2 n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PR94_9CLOT Length = 743 Score = 254 bits (650), Expect = 7e-66, Method: Composition-based stats. Identities = 76/492 (15%), Positives = 157/492 (31%), Gaps = 81/492 (16%) Query: 27 FISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY-------RELYKPDEKPLAIMVPAWN 79 + ++ FF + ++ +K + + Y + + + I++P + Sbjct: 305 TLLSINLFFQSIYAFMTTLKLYIVIKGSYKDKQLHFTTEEIEAIDEKELPTYTILIPVYK 364 Query: 80 ETGVIGNMAELAATTLDYENY--HIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGP 137 E VI + + +DY Y + + +D +T V + ++ + P Sbjct: 365 EKEVIKTLIKN-IENIDYPKYKLDVCILLEEDDDETISTVKAM-KLPEYYSMIIVPKNTP 422 Query: 138 TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL----FNYLVERKDLIQ 193 +K N L +++DAED +L+ F L + IQ Sbjct: 423 KTKPKACNYGL---------IRARGKYVVIYDAEDRPESDQLKKVYLSFKKLPKNYVCIQ 473 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 + F + T + + E+S + + +P G F + + Sbjct: 474 SKLNYFNSDQNFLTRL-FTQEYSMWFELLLVGIMQIKTPIPLGGTSNHFKIEFLKEV--- 529 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 A+D ++TED D+G RL +KG V Sbjct: 530 ---GAWDPFNVTEDADLGVRLFKKGYNTAVVDSR------------------------TW 562 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR---------KGAISN 364 E S +RQ+SRWI G + W + L++ + Sbjct: 563 EEANSDLSNWIRQRSRWIKGYMQ------TWFVHMRHPVQLYKSLGLKGFVGYQAMILGT 616 Query: 365 FVSFLAMLVMIQLLLLL------AYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI 418 + L + +L+L S++P +++++ F + N M I Sbjct: 617 PLLPLINPIFWLMLILWYTTKASWIRSMFPGVFYYIAAFQLFFGNFMFTYTNAVGMYWVI 676 Query: 419 VQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 + + + L ++ +A ++AL Q++ + W+KT H Sbjct: 677 RDCSLKKEQPFSYR-LVKYALLSPIYWILMSVAAYKALIQLII----KPFYWEKTNHGLT 731 Query: 479 SVTGDTRSLRPL 490 + + Sbjct: 732 EIRERNFGSLDV 743 Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats. Identities = 41/165 (24%), Positives = 74/165 (44%), Gaps = 4/165 (2%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQAL 537 + + +S P+G++L+EN IT+EQL AL R G RLG +L G I E+L + L Sbjct: 112 YLNSNYKSKLPIGKMLVENNEITKEQLIKALDLQRKSGGRLGDILLFLGFIKPERLCRYL 171 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 A QN V ++ ++P +AL Y + L N+ ++ ++ + L + Sbjct: 172 ATQNNVGRI---GKNFDINVSKKLPYKLALKYNAIILNSRNNCYVIAVKELLSWKQLKEI 228 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ 642 + + V V+ +I Y +++ + LY+ Sbjct: 229 EGYLHKPVEQVLATMLEIDNFWNIVYRKKQSEESVFKLYDDQPEN 273 Score = 50.0 bits (118), Expect = 3e-04, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 1/57 (1%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQAL 537 +G+ LL+ I+EEQL+ AL+ +G ++ G IS +QLA+ + Sbjct: 1 MIQERTNRIGESLLKQGYISEEQLEIALKIQEKTNKLIGNILVESGFISQQQLAEYI 57 >UniRef50_Q1YKP1 Putative glycosyl transferase n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YKP1_MOBAS Length = 681 Score = 254 bits (649), Expect = 9e-66, Method: Composition-based stats. Identities = 86/462 (18%), Positives = 145/462 (31%), Gaps = 61/462 (13%) Query: 41 WVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAEL-AATTLDYEN 99 RI L ++ +++V + ETGVI + + Sbjct: 266 LAIRIVAALPAPPEPEPEQGADVEAGPLPVYSVLVALYQETGVIERLVASLSRLDWPTSR 325 Query: 100 YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSAN 159 I + +DP T + A P V P +K LN VL Sbjct: 326 IEIKLVCEADDPATIGEARRATAGLPQFEIVAVPPGEPRTKPKALNFVLPLC-------- 377 Query: 160 FAFAGFILHDAEDVISPMELRL----FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEF 215 L+DAED P +LR F +Q P+ + T + + E+ Sbjct: 378 -RGEFVALYDAEDEPDPGQLREAFHGFRNGPGDLACLQAPLVVRNGDQNWLTGL-FALEY 435 Query: 216 SELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLK 275 + L + +P +P G F R +TA+ A+D ++TED D+G RL Sbjct: 436 AALFRRLLPWLARRRLPLPLGGTSNHFRRHCLTAV------GAWDSHNVTEDADLGMRLY 489 Query: 276 EKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIV 335 +G + P +++ P+ + RQ++RW G + Sbjct: 490 REGWKIGTLTRPTLED------------------------APERWPVWYRQRTRWTKGWL 525 Query: 336 FQGFKTHKWTSSLTLNYFLWRDRKGA-----ISNFVSFLAMLVMIQLLLLLAYESLWPDA 390 W + LWR+ FV LA ++ + L+L +L Sbjct: 526 Q------TWLVHMRQPRRLWRELGPISFAVFQMLFVGMLASALIQPVFLVLVLSTLVSAL 579 Query: 391 WHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFM 450 H L + + L N V + + L+ + Sbjct: 580 NHGLP-GGLAGMIFALDLFNATGGFFAFVALSLPALRPEERATLPKYYALVHLYWLLIAL 638 Query: 451 ANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQ 492 A+ RA+ Q+ + W+KT HD + Sbjct: 639 ASLRAVCQLARD----PHRWEKTHHDLRARANLHDQRYRTEP 676 >UniRef50_B5JBJ3 GSPII_E N-terminal domain family n=2 Tax=Octadecabacter antarcticus RepID=B5JBJ3_9RHOB Length = 631 Score = 254 bits (648), Expect = 1e-65, Method: Composition-based stats. Identities = 77/503 (15%), Positives = 153/503 (30%), Gaps = 70/503 (13%) Query: 2 DWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLS--------VYR 53 DW W GL V+ ++ ++ + F + I + + ++ Sbjct: 164 DWNTGKAFRW--GLGVVLAIMSCLIVWPQISFFVLCGWAVFTLILKTILKVAAAVVHLFP 221 Query: 54 RYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPD 112 + + I+VP + E + G + E + + + + D Sbjct: 222 KPTSPMPANPQLAHLPIVTILVPLFRERDIAGTLIERLSRLDYPTDRLDVCLVLEA-DDG 280 Query: 113 TQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAED 172 T ++ + + +K LN L + ++DAED Sbjct: 281 TTQNALAATQLPFWMRAIKVPLGTLQTKPRALNYALCFAK---------GSIIGVYDAED 331 Query: 173 VISPMELRLFNYLVE----RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREA 228 +P ++ + +Q + + + + E++ +P E Sbjct: 332 APAPDQIHIVVNRFAQRGQDVACLQGQLDFYNSHSNWL-ARCFTVEYATWFRIMLPGLER 390 Query: 229 LAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPV 288 L +P G F R + L +D ++TED D+G RL G + Sbjct: 391 LGLAIPLGGTTLFFRREILEEL------GGWDAHNVTEDADLGIRLARHGYRTEIIDTVT 444 Query: 289 VDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSL 348 +EA R ++Q+SRW+ G + + Sbjct: 445 QEEANARAWP------------------------WIKQRSRWLKGYAI------TYGVHM 474 Query: 349 TLNYFLWRDRKGA-ISNFVSFLAMLVMIQLL--LLLAYESLWPDAWHFLSIFSGSAWLMT 405 LWRD A + LL LL ++ + H L + +T Sbjct: 475 RSPLKLWRDLGAWRFFGLQLLFAGTISQFLLAPLLWSFWLMLLGLPHPLDNVLSTNVTLT 534 Query: 406 LLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDP 465 + + I I VT G + L + + +A ++ + ++ Sbjct: 535 FAAIFLLSEIINITVSAIAVTSP-GKYDLIKWTPTLHFYFPLAALAAYKGVIEL----AT 589 Query: 466 RRVAWDKTTHDFPSVTGDTRSLR 488 + WDKT+H + T + Sbjct: 590 KPFYWDKTSHGIFAPTQPAPTSP 612 Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 17/105 (16%), Positives = 29/105 (27%), Gaps = 15/105 (14%) Query: 532 QLAQALAEQNGVA---------------WESIDAWQIPSSLIAEMPASVALHYAVLPLRL 576 +A AL + G+ LI A + L + VLP R Sbjct: 38 AIAVALGHRLGLPSTNPEQIEAGLAETCVVDPVITPADPRLIERFGAELCLKHRVLPWRS 97 Query: 577 ENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRH 621 + + V + + + V + IV + L Sbjct: 98 VSGRVTVLATSPDHFLRIRDALVVVFGPIHLAIVTTNNLDAALSR 142 >UniRef50_Q1GVK0 Bacteriophage N4 adsorption protein B n=1 Tax=Sphingopyxis alaskensis RepID=Q1GVK0_SPHAL Length = 483 Score = 253 bits (647), Expect = 1e-65, Method: Composition-based stats. Identities = 142/487 (29%), Positives = 204/487 (41%), Gaps = 38/487 (7%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 W +++ WL L L + I ++ + L + R R Sbjct: 8 WQVELSTGWLEWL-----VLGAGRELMLFASVGILLIGLDDLLLDALWLATRGQRRGETA 62 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 P E +AI VPAW+E + M D E++ ++VG YPND T V ++ A Sbjct: 63 RAPPIEGRIAIFVPAWDEAAALPAMLCRTLAAWDGEDFRLYVGCYPNDTATIYAVSQLVA 122 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 R + V+ GPT+K D LN + A+ ER FA +LHDAED + EL L+ Sbjct: 123 RDARLRLVIGESEGPTTKGDNLNRLWAALCADERVEARRFAAIVLHDAEDHVHRHELALY 182 Query: 183 NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 + ++QIPV P + Y DEF+E HGKD+PVR L +PSAGVG Sbjct: 183 RQHLAHNAMVQIPVVPIIDRRARWIGGHYADEFAEAHGKDMPVRSRLGLPLPSAGVGCAL 242 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 +R A+ L + G F SLTEDY+IG + G+ FV Sbjct: 243 TRSALALLAMERGGCPFSSDSLTEDYEIGMVIGAYGLGARFVDAA--------------- 287 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTS------------SLTL 350 + I R FP AVRQKSRWI GI G+ W L Sbjct: 288 -DPAGDRIVSRGAFPGRIDAAVRQKSRWIAGIAMAGWDHLGWPGCRLGHKQRSTGRDLLA 346 Query: 351 NYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN 410 + LWRDR+ ++ + A +I + +A + L + L LL +N Sbjct: 347 RWMLWRDRRAPLAALILLAAYAGLILVAAGVAGQLLLGW-----NAIEPGPTLQWLLVVN 401 Query: 411 FGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAW 470 L+ R+ R+ F +G + +V R F N+I +A R++ + V W Sbjct: 402 ALLLGWRMALRIHFTARLHGWREASFAVPRAFVANIIAMLAARRSVLLYWRILRSGEVVW 461 Query: 471 DKTTHDF 477 DKT H Sbjct: 462 DKTDHSE 468 >UniRef50_D0B518 Glycosyl transferase, family 2 n=36 Tax=Brucellaceae RepID=D0B518_BRUME Length = 630 Score = 253 bits (646), Expect = 2e-65, Method: Composition-based stats. Identities = 77/481 (16%), Positives = 145/481 (30%), Gaps = 62/481 (12%) Query: 7 VFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVY--RRYPRMSYRELY 64 F + + + M + + ++ + R + +R Sbjct: 176 AFVIAMAVYAFLGCIVNWPMKTMLALHVAMSLFFFGCVLIRLFAAASGKRLQFTEIAPFK 235 Query: 65 KPDEKPLAIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCAR 123 D +I+VP + E V+ + L I + +D +T + Sbjct: 236 PRDLPVYSILVPLYREKDVVAQLIAALNRLNWPRSKLDIKLVCEKDDYETIAAIR-CNTM 294 Query: 124 FPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL---- 179 N V+ GP +K LN L + DAED P +L Sbjct: 295 PSNFELVLVPPGGPRTKPKALNYALQFA---------RGEIVAVFDAEDRPHPDQLLEAW 345 Query: 180 RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVG 239 + F + +Q P+ T M + E++ L +P +P G Sbjct: 346 QAFRRGGSKLACVQAPLIIGNFRRNLLTRM-FAFEYAVLFRGLLPWLARRGLVIPLGGTS 404 Query: 240 TCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRK 299 F R + + +D ++TED D+G RL G + +++ Sbjct: 405 NHFRRSCLEQV------GGWDAYNVTEDADLGMRLARFGYRIDVISRGTIED-------- 450 Query: 300 FLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR- 358 P+ RQ++RWI G + W WR+ Sbjct: 451 ----------------APEEHGVWHRQRTRWIKGWMQ------TWLVHGRQPMNTWRELG 488 Query: 359 --KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 + +S + + + L+L + F + WL+ L +N + Sbjct: 489 WWRFVVSQIYTLGIIGSALLHPLMLLMLAGLCLRMAFGPLTPQGLWLLALDVINILMAYM 548 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 L L + ++ ++ WRA+ Q+++ W+KT H Sbjct: 549 SFHMLGAKTMEPTELGGYA-YFLAIPIYWVLISLSAWRAVWQLVRQ----PHLWEKTPHQ 603 Query: 477 F 477 Sbjct: 604 P 604 >UniRef50_C6XK76 Glycosyl transferase family 2 n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XK76_HIRBI Length = 488 Score = 253 bits (646), Expect = 2e-65, Method: Composition-based stats. Identities = 77/472 (16%), Positives = 153/472 (32%), Gaps = 64/472 (13%) Query: 25 IMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVI 84 IM + + + + I + + ++ +++P ++E V Sbjct: 76 IMVVFSIFLLLQVIFRFYAAIISIFNSPKTKSTSPSKKSVSFALPKFTLLIPLYHEQAVA 135 Query: 85 GNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADC 143 A + +F T +D T+ + + + N + + P +K Sbjct: 136 SRSVSAMEALNYPADKLEVFYLTEEDDKATESALKKAI-KHQNFKIISVPKHAPRTKPKA 194 Query: 144 LNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKD----LIQIPVYPF 199 LN L T ++DAED+ P +L + +IQ P++ + Sbjct: 195 LNYGLQFST---------GDIVTVYDAEDIPHPQQLLAAAQAFQNGGTNLAVIQAPLHAY 245 Query: 200 EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAF 259 E + + + + E++ +P + +P G F + + + A+ Sbjct: 246 NGEES-WIASQFDLEYAIHFDVWLPAMTKMGWPIPLGGTSNHFKKNVLEKV------GAW 298 Query: 260 DVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDT 319 D ++TED D+G+RL G + + P RE P Sbjct: 299 DPFNVTEDADLGYRLALNGYSAGMIELP------------------------TREEAPIN 334 Query: 320 FSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLL 379 + + Q++RWI G + + + LWR I+ + L + LLL Sbjct: 335 LAQWLPQRTRWIKGHIQSLAVLSRKPFETIKSLGLWRSLGCLITFVSAILTAGLHGPLLL 394 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL 439 LAY + A + L+ ++ + + + V T +L Sbjct: 395 YLAYSII--TAPNTLNPLHLIPIILAFSSVILAALASSAV------------THCFKPLL 440 Query: 440 RLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLG 491 + + A RA+ ++ R W KT H P+ Sbjct: 441 TAPFYWPLMSFAFIRAIWEL----HTRPYIWSKTQHGISKSKIPLLHKEPVT 488 >UniRef50_D2LBM7 Glycosyl transferase family 2 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LBM7_RHOVA Length = 643 Score = 253 bits (646), Expect = 2e-65, Method: Composition-based stats. Identities = 86/489 (17%), Positives = 153/489 (31%), Gaps = 60/489 (12%) Query: 6 DVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYK 65 A L V + S + F + +R R R L Sbjct: 205 ATVAVGLVLGAVAFAPAETLTLASAMLSIFFLLTIALRAAAAVNIALPRPKAKEARLLGD 264 Query: 66 PDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCAR 123 + ++VP + ET ++ ++ A ++DY I + +D DT ++ A Sbjct: 265 AELPRYTVLVPLYRETAILPHL-AHALASIDYPAAKLDIKIVLEASDRDTIEAAQKL-AF 322 Query: 124 FPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--- 180 NV VV P +K LN L + +++DAED P +LR Sbjct: 323 PGNVDLVVVPDREPRTKPKALNYALHFAS---------GEFVVIYDAEDRPEPDQLRKAA 373 Query: 181 -LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVG 239 +F +Q + + S + E++ L +P+ +P G Sbjct: 374 TVFAQAPADLVCLQARLDYYNARENWL-SRQFTIEYATLFRGLLPLLARFRLPLPLGGTS 432 Query: 240 TCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRK 299 F A+ + A+D ++TED D+G RL G + +EA R Sbjct: 433 NHFRAAALREI------GAWDPYNVTEDADLGMRLARAGYRTGTLESTTWEEACCRPMP- 485 Query: 300 FLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRK 359 ++Q++RW+ G + + Sbjct: 486 -----------------------WLKQRTRWLKGWMQTFGVHMRRPREAMSEL-----GV 517 Query: 360 GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIV 419 +F ++ A +++ L + Y + + A LLW+ V Sbjct: 518 AGFLSFHAYFAGIIVSALAHPVFYILMLYEVLQGRLFAGEGAMENLLLWIAAVNFVGGYA 577 Query: 420 QRV---IFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 + F +L V+ + L A +RAL Q+ W+KT H Sbjct: 578 ANIALSAFAVAGTRHRHLMLHVMFIPVYWLFVSAAAYRALWQLFH----APFHWEKTEHG 633 Query: 477 FPSVTGDTR 485 + Sbjct: 634 VSRLARPFP 642 >UniRef50_A7GCY8 Glycosyl transferase, group 2 family n=66 Tax=Firmicutes RepID=A7GCY8_CLOBL Length = 420 Score = 252 bits (644), Expect = 3e-65, Method: Composition-based stats. Identities = 68/458 (14%), Positives = 136/458 (29%), Gaps = 56/458 (12%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIG 85 +FI L ++ + + Y + + L ++I+VPA NE VIG Sbjct: 10 LFIFSLVSIWMLLFVNIILSLAGYRYYLKTLNSELKGLKNEKYPKVSILVPAHNEEKVIG 69 Query: 86 NMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARP---GPTSKA 141 + + + V + +T++ + ++ + + + + G K+ Sbjct: 70 RTVKSILLLNYPKDKMELIVINDNSSDNTKKILKQIQKEYRSYNFKIINTDNITGGRGKS 129 Query: 142 DCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYL---VERKDLIQIPVYP 198 + LN + ++DA++ L+ E + Sbjct: 130 NALNIGYKH---------SSGDFIAVYDADNTPDKNALKYLMETIIEDEHLGAVIGKFRT 180 Query: 199 FEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIA 258 ++ T I E R L G + + L Sbjct: 181 RNKDRNMLTRFINI-ETLSFQWMCQAGRWNLLNLCTIPGTNFVVRKNIIQKLN------G 233 Query: 259 FDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPD 318 +D +++ ED +I FR+ E G FV + V E P+ Sbjct: 234 WDPKAIAEDTEISFRIYELGYKIKFVPYSV-----------------------TWEQEPE 270 Query: 319 TFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLL 378 +Q++RW G ++ K K N F + F F + L Sbjct: 271 NLKVWFKQRTRWAKGNIYVLLKYFK-------NMFKGTSKDIIFDIFYFFSVYFLF--LS 321 Query: 379 LLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSV 438 ++ + L+ L L + L V + + G +L Sbjct: 322 SVIISDILFIVGIFLNINLHVIGNFNVLWILAYVLFVLEVSLTLTLEKGESNKENLILVP 381 Query: 439 LRLFWGNLINFMANWRALKQVLQ-HGDPRRVAWDKTTH 475 + F + + R + Q ++ + + W KT Sbjct: 382 IMYFTYCQMWMIVALRGIIQYIRDKLFKKEIKWYKTER 419 >UniRef50_C6PVI5 Glycosyl transferase family 2 n=2 Tax=Clostridium RepID=C6PVI5_9CLOT Length = 512 Score = 251 bits (641), Expect = 7e-65, Method: Composition-based stats. Identities = 67/447 (14%), Positives = 138/447 (30%), Gaps = 51/447 (11%) Query: 16 KVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMV 75 + + + + + + + + Y + +S++ Y + EK A++V Sbjct: 10 EFVYNFVICGINVFQVSVIILTMYYLI------ISLFGFYKKEDKEAENCKPEKKFALLV 63 Query: 76 PAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 A NE VIG + + + Y IFV T + + Sbjct: 64 AAHNEEMVIGKIVDSLKELDYPKDLYDIFVIADNCTDKTAEIARK-----HGGNVYERNV 118 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY-LVERKDLIQ 193 K L + + + + + + DA++++S L NY L++ ++Q Sbjct: 119 SDKRGKGYALEWMFARVFKM----DTKYDAIAIFDADNLVSKNFLNEMNYKLLKGYKVVQ 174 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 + + + T +Y F + R L G G C + L Sbjct: 175 GYIDSKNPDDSWIT-QSYSISFWTANRLFQLGRSNLGLSSQIGGTGFCMDTETLKKL--- 230 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 + LTED + +L G + +V + K Sbjct: 231 ----GWGSTCLTEDLEFTCKLVLNGHKVGWAHNAIVYDEK-------------------- 266 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 P T + Q+ RW+ G + + + A+ + + + Sbjct: 267 ---PLTLKQSWNQRKRWMQGFADVFSRFFVRLMKRAVKERSFITLDCALYTMQPYFTLFM 323 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRV---IFVTGYYG 430 ++ L D IF + + + M+ I Q + I + Sbjct: 324 AASAVISLIKSFSGIDVITLDGIFRDIFYSSQSSYSQYAWMIFSIGQFLFTPIVMLLETK 383 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALK 457 L++ + V L+ N + F +R L+ Sbjct: 384 LSKKMFGVFALYSLNAVVFSEIFRYLR 410 >UniRef50_Q13CF3 Glycosyl transferase, family 2 n=10 Tax=Bradyrhizobiaceae RepID=Q13CF3_RHOPS Length = 686 Score = 251 bits (641), Expect = 7e-65, Method: Composition-based stats. Identities = 85/490 (17%), Positives = 148/490 (30%), Gaps = 61/490 (12%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYR-RYPRMSYRELYKPDEKPLAI 73 + M GL + V + R + + R + R + Sbjct: 228 FAGLLGLAMPAMIAPGLVANLLAVWFMGFATLRLAACFWPRAAQRPLRRRPDATLPIYTV 287 Query: 74 MVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVC 132 + E + + A E + + PND T+ + + R P++ ++ Sbjct: 288 VAALHREERSVAGLVAAIEALDYPREKLDVILVIEPNDLATRAAIARLGPR-PHLRVLIA 346 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKD-- 190 P +K LN L + ++DAED P +LR +R Sbjct: 347 PPVAPQTKPKALNCALAFA---------RGSFIAVYDAEDQPEPGQLRAALDAFDRHGAT 397 Query: 191 --LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 Q + + S T+ E++ + +P + +P G F + Sbjct: 398 TACAQASLCIDNITHSWL-SRTFAAEYAGQFDRLLPGLSEMNLPLPLGGTSNHFRTDVLR 456 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 A+ +D ++TED D+GFRL G + S Sbjct: 457 AI------GGWDPYNVTEDADLGFRLARFGYRSV------------------------SF 486 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR--KGAISNFV 366 E P TF RQ++RW+ G + W + LWRD +G ++ + Sbjct: 487 ASTTYEEAPITFDNWRRQRARWMKGFIQ------TWLVHMRHPLRLWRDIGPRGVLALNL 540 Query: 367 SFLAMLVMIQLLLLLAYESL--WPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIF 424 L+ + L +L AW L + L WL V + Sbjct: 541 IVGGNLLTALVHPLFLGIALASLAGAWLELPAVLQPSPPSPLHWLAIAAGYASTVVVGLR 600 Query: 425 VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDT 484 + +L + +A W A+ Q + R W+KT H Sbjct: 601 GLAGRRQLRLGFVLLLTPAYWICLSIAAWCAVAQFVW----RPYYWEKTVHGVAKRAKAP 656 Query: 485 RSLRPLGQIL 494 G + Sbjct: 657 LPGVAAGPAI 666 >UniRef50_C4DGM8 Glycosyl transferase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DGM8_9ACTO Length = 476 Score = 250 bits (639), Expect = 1e-64, Method: Composition-based stats. Identities = 78/468 (16%), Positives = 143/468 (30%), Gaps = 59/468 (12%) Query: 25 IMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVI 84 + + + + S + S + P ++VP + E V+ Sbjct: 53 AIAATVVLYLAVIGFKLAMIAGSGRSSALHFDPGSLTVIADAALPPYTVLVPLYREATVL 112 Query: 85 GNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADC 143 + L+A + I + +D +T CA P VV P +K Sbjct: 113 PTLVSRLSALDYPRDRLQILLLIEADDAETLDAAV-TCATDPRFEIVVIPDSVPKTKPKA 171 Query: 144 LNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV----ERKDLIQIPVYPF 199 N L +++DAED P +LR +R +Q + + Sbjct: 172 CNIGLARA---------VGEFCVIYDAEDRPDPDQLRKAALAFRLSPQRVVCVQAELQYW 222 Query: 200 EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAF 259 T + E++ + + +P G F A+ AL + Sbjct: 223 NPWTNWLTR-CFAAEYATNFSMTLHGMDRYRLAIPLGGTSNHFRTDALRAL------GGW 275 Query: 260 DVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDT 319 D ++TED D+G R+ +G + +EA R Sbjct: 276 DPYNVTEDADLGIRIARRGWDVRMMVSVTEEEANAR------------------------ 311 Query: 320 FSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD----RKGAISNFVSFLAMLVMI 375 +RQ+SRWI G + W Y LWR+ R A+ + F + ++ Sbjct: 312 LGNWLRQRSRWIKGYLQ------TWLVHSRRPYRLWREVGTRRSLAVHLTLGFATVTTLV 365 Query: 376 QLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGL 435 ++ L L + ++ GL + Sbjct: 366 NPVMWAMTILYLIVGPQPLEPLFPKYNLYGGVIAMLLGNALMCYTLMLGCVRR-GLFAAV 424 Query: 436 LSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 +L + + +A ++AL Q+L+ +R W+ T H Sbjct: 425 RVMLTIPLYWGLMSLAAYKALIQLLRPS--KRHYWELTEHGLVRSEDT 470 >UniRef50_Q1IUP8 Polysaccharide deacetylase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IUP8_ACIBL Length = 1154 Score = 250 bits (639), Expect = 1e-64, Method: Composition-based stats. Identities = 76/477 (15%), Positives = 142/477 (29%), Gaps = 55/477 (11%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRK-----LSVYRRYPRMSYRELYKPDE--K 69 A+ ++ + G FI V++V + + Y R+ L + Sbjct: 715 WAAMVASLSFILFGAVSQFIIAVFFVGDVLMTGRLVFIGTLAIYDRIRGPRLTADPDYRP 774 Query: 70 PLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK 129 +A+++PA+NE VI + DY V + T V+++ A K Sbjct: 775 AVAVLIPAYNEEKVIERTVR-SVLDSDYPKLRAIVIDDGSKDATVEVVEQLFAAEIASGK 833 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV--E 187 V + KA LN L+ +T+ F+ DA+ +I+P + L Sbjct: 834 VTLLTKPNSGKAAALNYGLEFVTE---------EIFVGIDADTIIAPDAIGLLVPHFQNP 884 Query: 188 RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 + I R +T E+ + + G + AV Sbjct: 885 KIAAIAGNAKVGNR-VNWWTR-WQALEYITSQNFERRALDVFGAVSVVPGAIGAWRTEAV 942 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 A + ++ ED D+ L + G + + Sbjct: 943 LA------AGKYHHDTVAEDADLTMALLQDGYRVEYEDLALAYT---------------- 980 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVS 367 P T + +RQ+ RW GI+ +K + I + Sbjct: 981 -------EAPSTANGLMRQRFRWSFGIMQSVYKHRSAFKQGGALGWFALP-NVVIFQILL 1032 Query: 368 FLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTG 427 L + + L A W H S S++ +L+ L+++ + + F Sbjct: 1033 PLVSPFIDLMFLFGAGSYAWNRYMHPEST-DPSSFHKLVLYFALFLVIDFVASTIAFTLE 1091 Query: 428 YY---GLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 G L + + +K + + + AWDK Sbjct: 1092 RRQPGGQKDFWLLAHVWLQRFAYRQLFSIVLIKTLKRAIEGGEFAWDKLERMASVKP 1148 >UniRef50_A9DFB5 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DFB5_9RHIZ Length = 682 Score = 249 bits (637), Expect = 2e-64, Method: Composition-based stats. Identities = 90/505 (17%), Positives = 164/505 (32%), Gaps = 68/505 (13%) Query: 12 LYGLKVIAITLAVIMFISGLDDF--FIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDE- 68 L + + I +I +S L + +++ + R ++ R E Sbjct: 212 LALMLCLCIAAFLIWPLSALTVLHVVLTMLFSAGILLRLSALAMALRRPDTVRGGSDIET 271 Query: 69 --KPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP 125 +++P ++E + + A + D T ++ P Sbjct: 272 RMPIYTLLIPLYDEAAMAPALVARIDALRWPKSLLDVKYICEAGDEATIEALEA-QDLGP 330 Query: 126 NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RL 181 V GP +K L L + ++DAED +P +L Sbjct: 331 ECEIVRVPAFGPRTKPKALQYALR---------GARGSLIAVYDAEDKPAPGQLLEAWAT 381 Query: 182 FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 F ++ +Q P+ +++ S + E+S L +P +P G Sbjct: 382 FRAGDDQLGCLQAPLAVANL-RSNWISGLFALEYSGLFRVLIPFLARTGMPIPLGGTSNH 440 Query: 242 FSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFL 301 F R A+ + +D ++TED D+G RL G ++ V Sbjct: 441 FKRAALE------NTGGWDPHNVTEDADLGLRLHAYGYRTGILKCATV------------ 482 Query: 302 QHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR-KG 360 E P RQ++RW+ G W ++ W Sbjct: 483 ------------ESCPVQLDVWKRQRTRWLKGWAQ------TWLVAMRNPVATWSSLGPA 524 Query: 361 AISNFVSFLAMLVMIQLL-----LLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV 415 A F +A +++ L+ L +A+ W + H + A L+ + +N Sbjct: 525 AFVVFQLLIAGMLISALVHPLMYLFIAFSFAWIASGHASVVSDMHAVLLWMDAVNIFGNY 584 Query: 416 NRIVQRVIFVTGYYGLTQGLL-SVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 F Y + +L + L+ +A WRAL Q+L + W+KT Sbjct: 585 LLFPATGWFAFTAYERSHLKRHWLLMIPAYWLLISLAGWRALTQLLANAH----LWEKTP 640 Query: 475 HDFPSVTGDTRSLRPLGQILLENQV 499 HD S + P + + ENQ Sbjct: 641 HDAESSAKTQQDPLPTCEAVPENQR 665 >UniRef50_B1IIP9 Glycosyl transferase, group 2 family protein n=20 Tax=Clostridium RepID=B1IIP9_CLOBK Length = 424 Score = 249 bits (637), Expect = 2e-64, Method: Composition-based stats. Identities = 70/453 (15%), Positives = 138/453 (30%), Gaps = 48/453 (10%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLD 96 +VY++ +S++ Y + + + + A++V A NE VIGN+ E D Sbjct: 18 IVYFISMYYLIISLFGIYRKKNNKNIGDKT--KFALIVAAHNEELVIGNIIESLKMMDYD 75 Query: 97 YENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFER 156 + Y IFV T E A K L + + I + E+ Sbjct: 76 KKLYDIFVIADNCTDKTAEIAREKGA-----IVRERFDKKRRGKGYALEWMFNIIFKMEK 130 Query: 157 SANFAFAGFILHDAEDVISPMELRLFNY-LVERKDLIQIPVYPFEREWTHFTSMTYIDEF 215 + + DA++++ L+ N + + ++Q + E T T Y F Sbjct: 131 K----YDAIAVFDADNLVHKNFLKEMNKKMCKGYKVVQGYLDSKNPEDTWITGS-YSIAF 185 Query: 216 SELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLK 275 + R L G G C + L + LTED + ++ Sbjct: 186 WSCNRMFQLARYNLGLSSQLGGTGFCIDTDILKEL-------GWGATCLTEDLEFSCKII 238 Query: 276 EKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIV 335 G + ++ + K P T + RQ+ RW+ G Sbjct: 239 LNGYKVGWAHDAIIYDEK-----------------------PLTLGQSWRQRKRWMQGFA 275 Query: 336 FQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLS 395 + + F + A+ + F+A+L+ + ++ L + Sbjct: 276 DVSSRYFFKLMKKAIKNFNFTAFDCALYSIQPFVAILLGLSAIIGLFQYVIKATNIVNNF 335 Query: 396 IFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRA 455 + L+ +++ I T + + + L + + A Sbjct: 336 NNIVYSIDFNLI----TILIILFSLFQILYTPLILILEKKFTFKVLLYYIVYPIYAITWF 391 Query: 456 LKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLR 488 + D W T H + + Sbjct: 392 PISIQGIMDKNNKEWSHTIHTRSMNIDELEKVN 424 >UniRef50_D1B5J6 Type II secretion system protein E n=3 Tax=Synergistaceae RepID=D1B5J6_THEAS Length = 559 Score = 249 bits (636), Expect = 3e-64, Method: Composition-based stats. Identities = 54/260 (20%), Positives = 107/260 (41%), Gaps = 16/260 (6%) Query: 489 PLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 LG IL++ V+TE L+ AL ++ +RLG ++ G +S + LA+AL+ Q V S Sbjct: 9 RLGDILIQAGVLTESTLEAALAEQKMSSMRLGEILVKNGWVSEKHLAEALSRQLKVPLVS 68 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLR-LENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + ++ ++ +P ++A V+PL LEND+L+V + D ++ ++L L GR++ Sbjct: 69 LSRYRPTPEVLKIVPENLARRLDVVPLSILENDKLLVATADPLNVMALDELKMATGREID 128 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEI 666 I +I +Y + + + + + ++ P L I Sbjct: 129 ISIATASEIRRAFDQFYRVQATLEEAMVEVMDEKRGAESSLNLVDVSADDAPVVKLVNSI 188 Query: 667 LTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM-QSLL 725 + S I++ E +G L L R L ++ + Sbjct: 189 MEQAVKEGTSDIHI-----EVFERSARVRYRIDG-----ALFDSLEYPRNLHPAVCSRIK 238 Query: 726 LKAGLNTEQVAQLESENEGE 745 + +G++ + +G Sbjct: 239 IMSGMDISE---RRKPQDGR 255 >UniRef50_A1SEL1 General secretory system II, protein E domain protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SEL1_NOCSJ Length = 606 Score = 249 bits (635), Expect = 4e-64, Method: Composition-based stats. Identities = 76/486 (15%), Positives = 154/486 (31%), Gaps = 62/486 (12%) Query: 7 VFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE---- 62 V + + + + + + VV + + ++ E Sbjct: 171 VTTAIVVVMCAVIWPMETAIAVVAACSLLYLVVSFYKFRLTLRALGTHLETDVTDEEIAA 230 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 + + I+VP + E G++ + + A + + +D +T + + ++ Sbjct: 231 IDERHLPTYTILVPLYKEAGIVPRLVRDINALDYPRTRLDVKLLCEEDDEETVQRIRDLQ 290 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR- 180 P+ H VV P +K N L T ++ DAED P +L+ Sbjct: 291 -LPPHFHLVVVPDSQPKTKPKACNYGLQLAT---------GDYCVIFDAEDRPDPDQLKK 340 Query: 181 ---LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 F+ + E +Q + F ++ T+ + +E+S +P A +P G Sbjct: 341 AIIAFSRVPENVVCVQAKLNHFNQDQNMLTA-WFANEYSMHFELVLPAMGAAESPIPLGG 399 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 F + L A+D ++TED D+G RL +G + ++EA + Sbjct: 400 TSNHFVTAKLREL------GAWDPFNVTEDADLGIRLHREGYRTAMIDSTTLEEANSQ-- 451 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW-- 355 +RQ+SRW G + W + + L Sbjct: 452 ----------------------VPNWIRQRSRWNKGYIQ------TWLVHMRAPFALLSQ 483 Query: 356 RDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV 415 KG +S ++ + V++ + A +L+ + + Sbjct: 484 TGLKGFLSFNLTMGSAFVLLLNPIFWALTTLYVFTQAGFIEQLFPGIIFYAASALLFVGN 543 Query: 416 NRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 V + + + G + L + A W+ Q+ W+KT H Sbjct: 544 FVFVYLNVAGSLHRGEFGLTRTALLSPLYWGLMSWAAWKGFIQLF----TNPFYWEKTVH 599 Query: 476 DFPSVT 481 Sbjct: 600 GLDEGH 605 Score = 113 bits (284), Expect = 2e-23, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 64/156 (41%), Gaps = 1/156 (0%) Query: 488 RPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + Q+L + +IT++QL A L G LG ++ I+ + L AL+E + Sbjct: 1 MQMAQMLTRSGLITDDQLQRAMLEYSRTGDPLGDILVSHEAITEDVLVAALSEMYQMQRV 60 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + + + +P +A + +P+ +D +++ +D A + +G R Sbjct: 61 GLAGFTPDFEVARRLPERLAHTFQAVPVAATDDLVLLAVARPLDTEQAAEVEEALGSPFR 120 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ 642 ++ R ++ ++ +AR +L + Sbjct: 121 QLLANRTELDQLVQRVHARHYAEVSTRLLMETRPEE 156 >UniRef50_B7APV7 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B7APV7_9BACE Length = 566 Score = 248 bits (634), Expect = 4e-64, Method: Composition-based stats. Identities = 42/271 (15%), Positives = 103/271 (38%), Gaps = 20/271 (7%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 + R LG +L+ +I + QL+ AL + +G LG ++ G ++ E++ L E Sbjct: 1 MNYRKKIRLGDVLMSRGLINQNQLNMALKEQKEKGRMLGEMLVELGYVTQEKINDILCEM 60 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEN---DELIVGSEDGIDPVSLAAL 597 + + + + ++ +P V Y ++P+R + + V D ++ +++ + Sbjct: 61 LNIEFIDLQVEEPEENVRDLIPEEVMRKYTLVPMRYDKNNAGVIQVAMADPMNILAMDDI 120 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAM--LYNAVQHQWLTEQQAGEIWRQ 655 G++V + I + +++ + M + + E++ + + Sbjct: 121 NIITGKQVAPYLANASDIRAYFDRVFGKKQAQNIAEMYKKEQGLVQEESEEEKLRKEDVE 180 Query: 656 YVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQR 715 P L I+ S I++ E + +GV L V+ + Sbjct: 181 NAPIVQLVNSIIEQAARQRASDIHI-----EPFEESIRVRYRVDGV-----LREVIEYDK 230 Query: 716 ELQVSMQ-SLLLKAGLNTEQVAQLESENEGE 745 L ++ L + +G++ + +G Sbjct: 231 SLLGAITARLKIMSGMDISE---KRKPQDGR 258 >UniRef50_A6DXN0 Glycosyl transferase, group 2 family protein n=2 Tax=Rhodobacteraceae RepID=A6DXN0_9RHOB Length = 643 Score = 248 bits (633), Expect = 6e-64, Method: Composition-based stats. Identities = 78/465 (16%), Positives = 152/465 (32%), Gaps = 64/465 (13%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 + I + F +V +I ++ ++I+VP + E Sbjct: 203 FPITVFSIFAVWACFTLIVSAGLKIAAFVAQTSGRSDAPPAPSSPQPLPKVSILVPLFRE 262 Query: 81 TGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 T + + L+ T + + D T++ + P V V+ P + Sbjct: 263 TEIAHALIARLSRLTYPKCLLDVILVLEEEDALTRQTL-AGIDLPPWVRPVIVPDGKPRT 321 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLFNYLVERKDLIQIP 195 K +N LD + DAED ++ R F + + +Q Sbjct: 322 KPRAMNYALDFC---------QGDIIGIFDAEDAPEADQITIIARRFQQVPQEVACLQGI 372 Query: 196 VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGD 255 + + + + E++ +P L +P G F R + L Sbjct: 373 LDYYNPGQNWL-ARCFTIEYAAWFRTLMPGMARLGLAIPLGGTTLYFRRDVLEEL----- 426 Query: 256 GIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREY 315 +D ++TED D+GFRL G + +EA R Sbjct: 427 -GGWDAHNVTEDADLGFRLARHGYRTEMIHTVTEEEANCRAWP----------------- 468 Query: 316 FPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW-----RDRKGAISNFVSFLA 370 ++Q+SRW+ G + + + L+ R G ++FVS L+ Sbjct: 469 -------WIKQRSRWLKGYM------TTYLVHMRQPRLLYAQLGPRKFWGFQAHFVSALS 515 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 ++ +L ++ + H L A L+ L L + V + + Sbjct: 516 QFLLAP--VLWSFWLVLFGLPHPLDTVVPHALLVALGSLFLLVEVLNVSIHM-ASVSGPR 572 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + + + + +A ++AL +++ + WDKTTH Sbjct: 573 HRHLMAWAPTMHFYTPLGTIAAYKALYELIL----KPFFWDKTTH 613 Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 60/172 (34%), Gaps = 7/172 (4%) Query: 472 KTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTA-LRNRVEGLRLGGSMLMQGLISA 530 + H +G + PLG+ L+ I+ A L R L + +GL + Sbjct: 5 ELRHMRAQGSGARVPINPLGRELVRAGKISRSDATLAELVRRHCDTSLDRILQAEGLATE 64 Query: 531 EQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGID 590 + L A A + S + + + + L + VLP++ ++ + Sbjct: 65 DDLLAAHARRLSSTRLSAEDVMRAELVESGLDPRFLLKHGVLPIQSAAGVPVLATGGPDS 124 Query: 591 PVSLAALTRKVGRKV---RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAV 639 LA+L + RK+ R V+ R I + H + A + Sbjct: 125 ---LASLRPALPRKLDRARVVMAPRHAIQDRIAHEHRDILRSMAEARVPEIE 173 >UniRef50_C0YGU0 Polysaccharide deacetylase/glycosyl transferase, group 2 family protein n=2 Tax=Bacteroidetes RepID=C0YGU0_9FLAO Length = 1132 Score = 248 bits (633), Expect = 6e-64, Method: Composition-based stats. Identities = 61/442 (13%), Positives = 121/442 (27%), Gaps = 59/442 (13%) Query: 7 VFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 V AT +YG+ + L I + GL + + + R+ + Sbjct: 717 VLATIIYGVSHFLVALFTIFIVLGLIRLLLMAYWAFK--------ERKKEKKLGEFPVLE 768 Query: 67 DEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN 126 ++I+VPA+NE I + + Y N+ I + + T P Sbjct: 769 SYPKVSIIVPAYNEEVNIVSSLQN-LLKQTYPNFDIIMVDDGSKDSTYDKAKAAFPDHPK 827 Query: 127 VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV 186 + KA LN + + DA+ + ++ Sbjct: 828 LKIFTKRNG---GKATALNFGISQ---------TDAEYVVCIDADTKLQQDAVKYLIARF 875 Query: 187 ------ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 E+ + V R ++ + E++ D + G Sbjct: 876 LNSDPEEKIAAVAGNVKVGNRV--NWLTKWQAIEYTTSQNFDRLAYANINAITVIPGAIG 933 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F R V + +L ED DI ++ + G T V Sbjct: 934 AFKRSVVIE------TGGYSSDTLAEDCDITVKILKAGYTVANENRAVAVT--------- 978 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF-LWRDRK 359 P+T ++Q+ RW GI+ +K + + LW Sbjct: 979 --------------EAPETVKQFLKQRFRWTYGIMQMFWKQRQTFLNPRYKGLGLWAMPN 1024 Query: 360 GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIV 419 + ++ + ++ + + IF + L+ + Sbjct: 1025 ILLFQYIIPFFSPLADVIMFFGILSGNGDKIFTYYLIFLLVDASLALIAFIMQREKLINL 1084 Query: 420 QRVIFVTGYYGLTQGLLSVLRL 441 +I Y ++ L Sbjct: 1085 LYIIPQRFGYRWLMYIVLFKSL 1106 >UniRef50_Q168M3 Glycosyl transferase, putative n=12 Tax=Rhodobacterales RepID=Q168M3_ROSDO Length = 635 Score = 248 bits (633), Expect = 6e-64, Method: Composition-based stats. Identities = 80/491 (16%), Positives = 160/491 (32%), Gaps = 56/491 (11%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 W F + + L + + + + + LS R Sbjct: 188 WAAMAFCSLVSALVFAPAWSVTALALWAVITLLMTSTLKAAALFIHLSGARAVQGARASG 247 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 ++++VP E + G + L+ T ++ + D T++ + Sbjct: 248 KPFRM-PRVSVLVPLLKEKEIAGQLIARLSQLTYPKSLLNVVLVLEEGDTLTRQTIART- 305 Query: 122 ARFPNVHKVVCARPGP-TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 + + G T+K LN LD + + DAED ++ Sbjct: 306 TLPDWMSVIEVPEAGGLTTKPRALNYALDFCK---------GSIIGVWDAEDWPEADQIE 356 Query: 181 ----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 FN + +Q + + + + + E++ +P + VP Sbjct: 357 KVVTRFNTAPDNVVCLQGVLDYYNSRSSWL-ARCFTIEYAIWWRIVMPGIARMGLVVPLG 415 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G F R A+ L +D ++TED D+G RL G + +EA R Sbjct: 416 GTTLFFKRTALEEL------GGWDAHNVTEDADLGVRLARHGYKTELLPTVTREEATSRP 469 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 VRQ+SRW+ G + F + +L + WR Sbjct: 470 W------------------------AWVRQRSRWLKGFMITYFVHMRRPGALLRDLGFWR 505 Query: 357 DRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 G + F++ ++ +L ++ + H +++ G+ + L Sbjct: 506 -FMGVQTIFLAAVSQFAAAP--VLWSFWLTFFGVAHPVAMTLGAPVMWGLAGFFIATEAL 562 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 ++ V+ V+G G + V + + + +A ++AL ++++ WDKT H Sbjct: 563 SLLLGVVAVSGK-GHRHLIPFVPSMMFYFTLGTVAAYKALWELVR----APFFWDKTQHG 617 Query: 477 FPSVTGDTRSL 487 T Sbjct: 618 VSVQTDPAPRK 628 Score = 95.9 bits (237), Expect = 5e-18, Method: Composition-based stats. Identities = 36/146 (24%), Positives = 56/146 (38%), Gaps = 1/146 (0%) Query: 488 RPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 P+G++LL+ I L AL R LG M+ +GLI+ + + ALA Q Sbjct: 21 PPIGRVLLDQGKIASNDLTHALNLQRRIDAPLGDIMISEGLINKKDVLSALAAQARAEGA 80 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ++ + +PAS+ L + V+P R + L V + L R++ Sbjct: 81 DLELDPPEMEMANRLPASLCLRFGVVPWREDARALYVATSSPAGFTQLLDACGPQSRQLF 140 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPR 632 VIV QI Y Sbjct: 141 PVIVDDAQIQAHQSRLYGNELAQGAA 166 >UniRef50_B5YDI2 Type IV pilus assembly protein PilB n=20 Tax=Bacteria RepID=B5YDI2_DICT6 Length = 873 Score = 248 bits (632), Expect = 7e-64, Method: Composition-based stats. Identities = 59/275 (21%), Positives = 111/275 (40%), Gaps = 15/275 (5%) Query: 474 THDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQ 532 T T + LG++LLE +IT+EQLD AL + +G+RLG ++L L+ Sbjct: 300 KKVAKPSTRRTGRRKLLGEVLLEKNLITKEQLDEALALSSKKGIRLGEALLELKLLDDVA 359 Query: 533 LAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPV 592 LA+ L+EQ + ++S+ +I L + A +LPL +N ++VG D + + Sbjct: 360 LAKLLSEQFDIPFKSLKEVKIDHDLAKLISPQKARENLILPLYRDNGRIVVGIVDPSNIL 419 Query: 593 SLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWL--TEQQAG 650 +L L +V VIV R +++ + + + + + E Sbjct: 420 ALDDLRMVTRSEVFPVIVPRNELIDAINQIWGSEEVEKVLEEIIVQKEEEETQYQEVSLE 479 Query: 651 EIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRV 710 EI Q P L IL S I++ E + + +GV+ + + Sbjct: 480 EISSQEGPIAKLVNSILVDAVKRGASDIHI-----EPTEKNVRVRFRIDGVLHEIMFIQ- 533 Query: 711 LTIQRELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 +R + + + + ++ + +G Sbjct: 534 ---KRFQAAIVSRIKIMSDMDISE---RRIPQDGR 562 Score = 80.9 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 61/189 (32%), Gaps = 15/189 (7%) Query: 485 RSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 L IL ++ LD L N + G + +L +GLIS E+L L+E G Sbjct: 5 NINERLIGILKSRNIVPAAILDNILSN-LRGKDIQEILLEEGLISKEKLVDLLSEILGWK 63 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 ++ +P + + +PL +E + VG + P ++ + G Sbjct: 64 VLVGKEFKPNEEAAKSIPPFLTKFHNFIPLGIEEKTIKVGFFPPVKPTAIEDIRLLTGYD 123 Query: 605 VRYVI----VLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 V + + + + + + L + L + G Sbjct: 124 VEPYLLKISSGEEDLSSLIA----APKTTNVDIDLEPEKVEEPLKGIEIGTWDESE---- 175 Query: 661 FLFAEILTT 669 +I+ Sbjct: 176 --LNKIIEE 182 >UniRef50_A3VPZ4 Putative uncharacterized protein n=2 Tax=Bacteria RepID=A3VPZ4_9PROT Length = 512 Score = 248 bits (632), Expect = 8e-64, Method: Composition-based stats. Identities = 86/484 (17%), Positives = 159/484 (32%), Gaps = 61/484 (12%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDE--KPLAIMV 75 I L ++ ++ F +R LS+ + P+E P I+ Sbjct: 82 IIGPLQKLIALNLGVTLFYLFQAGLRSAVLSLSLAQPRRLTLPPPPIAPEERLPPFTILC 141 Query: 76 PAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 P ++E + ++ L E I + +D T C R P V+ Sbjct: 142 PVYDEAESLPHLVGSLLLLDYPRERLDIKIILEADDRATIAAARTHC-RAPMFDLVLVPP 200 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLFNYLVERKD 190 P +K LN+ L +++DAED +P +L R F L + Sbjct: 201 SAPRTKPKALNHAL---------WTAKGDYIVIYDAEDRPAPDQLTLAARTFAALPDHIA 251 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 +Q + + R+ T T + + E++ L +P AL+ VP G + A+ Sbjct: 252 CLQCRLNYYNRDTTILTRL-FALEYALLFDMTLPGLAALSAPVPLGGTSNILRTDILMAV 310 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 +D ++TED D+G RL G + ++EA + Sbjct: 311 ------GGWDPFNVTEDADLGLRLHRAGYETRLLNSTTLEEATDET-------------- 350 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 +RQ++RW+ G + W R G + Sbjct: 351 ----------GAWLRQRTRWMKGFMQ------TWLVHSRRAPRT--GRFGHFLTVHGVVG 392 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTG-YY 429 V+ L+ +A+ I + +L L L N + ++ + Sbjct: 393 GTVLAALINPVAWAIYGAWILGVDGIARLFPTPLNVLALTAFLGGNLLHLYMMMIAPLRR 452 Query: 430 GLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRP 489 + + ++ +A +RAL Q+++ R W+KT H +T Sbjct: 453 RWHDLVPYAVLSPLYWILQSVAAYRALWQLIR----RPSYWEKTKHGRGLSPEETLRWHS 508 Query: 490 LGQI 493 + Sbjct: 509 TLRQ 512 >UniRef50_A4WQA1 General secretory system II, protein E domain protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WQA1_RHOS5 Length = 628 Score = 247 bits (630), Expect = 1e-63, Method: Composition-based stats. Identities = 85/476 (17%), Positives = 153/476 (32%), Gaps = 61/476 (12%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPA 77 + + +++ L F RI ++ RR P ++++V Sbjct: 193 LMLAPGLVVLALSLWALFAMTCGTALRIATAIATLRRRP-ADPPCPPLLRLPIVSVIVAL 251 Query: 78 WNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG 136 + E + G + ++ I + D T++ + E P + V+ Sbjct: 252 YQEEDIAGRLVARLGRIDYPHDRLEILLVVEEADLRTRKALVE-ARLPPWMRIVISPAGA 310 Query: 137 PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER----KDLI 192 +K LN LD + ++DAED P ++R R + Sbjct: 311 IRTKPRALNVALDHC---------RGSIVGVYDAEDAPDPDQIRRVVEGFSRRGSQVACL 361 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 Q + + S + E++ +P + L VP G F R A+ Sbjct: 362 QGQLDYYNPRTNWL-SRCFTIEYASWFRLMLPGLDRLGLAVPLGGTTLFFRREALE---- 416 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 D A+D ++TED D+G RL G + +EA R Sbjct: 417 --DLGAWDAHNVTEDADLGIRLARHGYRTDLIDTVTGEEANCRALP-------------- 460 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGA-ISNFVSFLAM 371 ++Q+SRWI G + W + LWR + F Sbjct: 461 ----------WIKQRSRWIKGFMM------TWAVHMRDPVLLWRQLGPWRFAGFQVMFLG 504 Query: 372 LVMIQLL--LLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYY 429 + LL +L ++ L H ++ + L ++ L G I ++ Sbjct: 505 SLSQTLLAPVLWSFWLLALGLPHPVTPLLSTPALWAIVGLLLGAEGTSIALGILA-LRLT 563 Query: 430 GLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTR 485 L V + N + A ++AL ++L+ WDKT H + Sbjct: 564 RHKLNPLWVPTMHLYNPLATFAAYKALWELLR----APFYWDKTRHGLFDGSSRGP 615 Score = 76.2 bits (186), Expect = 4e-12, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 48/141 (34%), Gaps = 1/141 (0%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLR-LGGSMLMQGLISAEQLAQALA 538 + LG +LL + ++ ALR L +L +G + E++ A Sbjct: 8 PLPEAPVPDALGVMLLRQGHLAPHRIMGALRRSSGHAAGLADVLLAEGAMDEEEILALTA 67 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 Q+G+ LI + L +LP+ +++ + L Sbjct: 68 RQSGLPLLDPATGAADPRLIDRLGVRTCLRETLLPVHDVGGAVLIAAPSPESFRRHGPLL 127 Query: 599 RKVGRKVRYVIVLRGQIVTGL 619 ++ +V V+ R I L Sbjct: 128 GQLFGRVIPVLATRTAIEGAL 148 >UniRef50_Q0ASE8 Glycosyl transferase, family 2 n=1 Tax=Maricaulis maris MCS10 RepID=Q0ASE8_MARMM Length = 537 Score = 247 bits (630), Expect = 1e-63, Method: Composition-based stats. Identities = 87/474 (18%), Positives = 152/474 (32%), Gaps = 76/474 (16%) Query: 22 LAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNET 81 + V + ++ + R +V + + D IMV + E Sbjct: 87 ITVPTLTLTIASLVSCALFTGLIVLRLAAVLAKPAWLDAPGCADGDLPTATIMVALYREA 146 Query: 82 GVIGNMAELAATTLDYENYHI--FVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 V+ ++A ++DY + + +D +T + A P ++ P + Sbjct: 147 AVLPDLARG-LASIDYPTDRVAFKLVLEADDTETIHVARRM-ALDPRFEIIIVPPGNPRT 204 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL----FNYLVERKDLIQIP 195 K LN L +HDAED P +LR F +R +Q P Sbjct: 205 KPRALNYALRLC---------RSELVTIHDAEDRPDPYQLRRAAEAFRVADQRLACVQAP 255 Query: 196 VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGD 255 + + RE T T + E++ +P+ + L +P G F R A+ + Sbjct: 256 LNWYNREETWLTR-QFALEYAAHFHALLPLYQRLGWPLPLGGTSNHFRRDALVRV----- 309 Query: 256 GIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREY 315 +D ++TED D+G RL G + + E Sbjct: 310 -GGWDAWNVTEDADLGLRLHAAGYRCGLIEPKTL------------------------EE 344 Query: 316 FPDTFSTAVRQKSRWIIGIVFQG-----FKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 P V+Q++RWI G + W L A + + L+ Sbjct: 345 APLRLVPWVKQRTRWIKGYAQTIGVLAVRRDTPWRRVW--PGMLVLGGAVASALLHAPLS 402 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 + +I L E+ A F+ SA + + + RI Sbjct: 403 LACLIALATRAGPEAASLPALAFMLAGYASAITCAAVAMRRAGLPVRIRDLAG------- 455 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDT 484 + L+ +A RAL+Q+L W+KT H ++T Sbjct: 456 ----------MPAYWLLQTLAAARALRQLL----TDPHRWEKTEHGVSAMTRSP 495 >UniRef50_A3TTM2 Glycosyl transferase, family 2 n=1 Tax=Oceanicola batsensis HTCC2597 RepID=A3TTM2_9RHOB Length = 650 Score = 247 bits (630), Expect = 2e-63, Method: Composition-based stats. Identities = 88/479 (18%), Positives = 144/479 (30%), Gaps = 62/479 (12%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 LA + + + V + L R P L ++I+VP + E Sbjct: 215 ALARFLVAGAILALVVASVTKLSAAITHLLRPERPPPPCPEHL----LPVMSILVPLYRE 270 Query: 81 ---TGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGP 137 V+ E + +D T+ A P + V P Sbjct: 271 DRVASVLPRRLER--LDYPRARLDVIFVLEESDDVTRAA-LAAAALPPWIRIVTVPDGQP 327 Query: 138 TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE----RKDLIQ 193 +K +N LD ++DAED P +LR +Q Sbjct: 328 RTKPRAMNYALDFCI---------GDIIGIYDAEDAPEPDQLRKVAAGFAAASGETACLQ 378 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 + + T + E++ +P L VP G F R A+ A+ Sbjct: 379 GALDYYNAAENWITR-CFTIEYNTWFRLVMPGMAKLGFAVPLGGTTAFFRRDALEAV--- 434 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 A+D ++TED D+G RL G + +EA R Sbjct: 435 ---GAWDAHNVTEDADLGMRLARAGYRTRVIDTATGEEASARPV---------------- 475 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 + VRQ+SRW+ G + + +L + W+ + FL ++ Sbjct: 476 --------SWVRQRSRWLKGYLMTYAVHMRRPRALLRDLGPWQ----FLGFQAHFLTAIL 523 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 L LL L I + M LL F ++ T Sbjct: 524 HFALAPLLWLFWLVIFGVDLPLIAIDTGPAMRLLATAFLGFELLVMTLGAIATRGPRHRF 583 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQ 492 + L + +A W+AL ++ R WDKT H D ++ + Sbjct: 584 LWPWIPSLHLYWPMGTLAMWKALVEL----ACRPFYWDKTEHGHTLTEEDQPTVPIAPE 638 Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 35/116 (30%), Gaps = 4/116 (3%) Query: 524 MQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIV 583 +G ++ +L LA G + L L + LP R D + + Sbjct: 65 AEGRLTGPRLLTELAALTGAQPVDLGLTPADPGLARRFDGPTCLRHECLPWRQTADTIWI 124 Query: 584 GSEDGIDP-VSLAALTRKVG---RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAML 635 + +LA L G + R V+ R I +GL + Sbjct: 125 ATARPERFDRALADLMPGTGPHPPETRMVLADRSAIQSGLAKVHGPELQRRMVTRT 180 >UniRef50_A3VHJ3 Glycosyl transferase, family 2 n=2 Tax=Rhodobacterales RepID=A3VHJ3_9RHOB Length = 684 Score = 245 bits (626), Expect = 4e-63, Method: Composition-based stats. Identities = 86/477 (18%), Positives = 153/477 (32%), Gaps = 68/477 (14%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP----LA 72 V+ + +V ++ + RR+ + + P + ++ Sbjct: 236 VLLTAPTAAWAMLTGWAILTLIVNTGVKLAAAVIHTRRHRHAAPIRVVGPHDPRRRPVVS 295 Query: 73 IMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVV 131 I+VP + E + G + + T E I + +D TQ + + ++ Sbjct: 296 ILVPLYREREIAGRLVKRLERLTYPRELLDICLIVEEDDTLTQETLSN-ARLPAWMRQIT 354 Query: 132 CARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLFNYLVE 187 R G +K LN LD ++DAED ++ F Sbjct: 355 VPRGGVRTKPRALNFALDFA---------RGTIIGVYDAEDAPDADQIDRIVARFAEAPP 405 Query: 188 RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 R +Q + + S + E++ +P E + VP G F R + Sbjct: 406 RVACLQGMLDYYNARTNWL-SRCFTIEYATWFRIVLPGMEKMGFAVPLGGTTLFFRRGVL 464 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 L +D ++TED D+G RL G T V +EA R Sbjct: 465 EQL------GGWDAHNVTEDADLGIRLARLGYTTELVETVTKEEANCRVWP--------- 509 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-----RKGAI 362 ++Q+SRWI G + + LWR+ G Sbjct: 510 ---------------WIKQRSRWIKGYAM------TYGVHMRDPRRLWRELGARRFWGVQ 548 Query: 363 SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRV 422 F+ L+ L++ +L +Y L H L+ + + I + Sbjct: 549 IVFLGTLSHLILAP--VLWSYWLLAFGLPHPLAEVMPGWVVWAMFATFLTAEAINIAVGM 606 Query: 423 IFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS 479 I V+ G L V L + +A + L ++++ + WDKT H Sbjct: 607 IAVSTP-GRRFLKLWVPTLHVYFPLASLAALKGLGEIVR----KPFFWDKTQHGHDD 658 Score = 90.1 bits (222), Expect = 3e-16, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 59/179 (32%), Gaps = 1/179 (0%) Query: 486 SLRPLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 LG+ L++ ++ E L A + R LRLG +L +G ++ LA+ALA+ Sbjct: 58 RKSGLGEKLVDMGILRPEDLLAARKARAGTALRLGDVLLARGFVTPYTLARALAKVYDTT 117 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + LI + L +P + D I+ + + + Sbjct: 118 LVDLRRDPPDVRLIDAVGLDACLRLGFVPWKRVGDTTIIACACPDRFARIRSGLPESFGA 177 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 R V+ +I L RR + + + A +W L Sbjct: 178 CRMVVAQGNEIDAALVSLRHRRMVTQAETRVAEDMSCRTWDTVGAARLWLTLAGAVALV 236 >UniRef50_Q28T00 Glycosyl transferase family 2 n=1 Tax=Jannaschia sp. CCS1 RepID=Q28T00_JANSC Length = 654 Score = 245 bits (626), Expect = 4e-63, Method: Composition-based stats. Identities = 83/484 (17%), Positives = 148/484 (30%), Gaps = 54/484 (11%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 +L A + GL ++ I LA ++F + F + V R L Sbjct: 209 ILAGVAVAVAGLLLVPILLAQVVFGVAVVVFIANCVLKFAAFGRTLRGAASPSATPETSA 268 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 ++I+VP + E V G + E E I + +DP T + Sbjct: 269 QFLHNPTVSILVPLFREPEVAGALVERLRRLDYPRERLDIILAVEEDDPLTLSALQ-TGT 327 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 + +V R P +K LN VL+ ++DAED P +++ Sbjct: 328 LPAWMRAIVVPRGSPQTKPRALNYVLNYA---------RGDIVGIYDAEDRPEPDQIQRV 378 Query: 183 NYLV----ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGV 238 +Q + + + + + E++ +P + L VP G Sbjct: 379 VQRFAEVPADVACLQGRLDYYNARHNWLSRL-FTVEYAAWFRVLLPGVQRLGLVVPLGGT 437 Query: 239 GTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQR 298 R + + A+D ++TED ++G RL G V +EA Sbjct: 438 TVFLRRNVLEGV------GAWDAHNVTEDAELGLRLARAGYQTEIVETTTFEEANAATLP 491 Query: 299 KFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR 358 ++Q+SRW+ G + + +L WR Sbjct: 492 ------------------------WIKQRSRWLKGYLMTWGAAMRRPRALLNELGPWRFA 527 Query: 359 KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI 418 I + L L L ++ WH + L + ++ + Sbjct: 528 WLQIQFAGAVLGFLTAPLL---WSFMLKPFGVWHPMDGVMSPFAYGVLGVVMVSGLIGSV 584 Query: 419 VQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 + L + L +A W L +++ + W KTTH Sbjct: 585 AISFYACRAKH-LRHLRPIAPLVEPYYLFGTIAAWIGLFELIA----KPFFWAKTTHGKF 639 Query: 479 SVTG 482 T Sbjct: 640 GATQ 643 Score = 91.3 bits (225), Expect = 1e-16, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 61/161 (37%), Gaps = 6/161 (3%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQLAQALA 538 + + + ++LL ++T +Q+ A R L LG ++ QG IS +L LA Sbjct: 36 PSPERSEPAGVRELLLARGLVTPDQMRAAQDASRGTSLSLGEILIAQGAISEPELLSTLA 95 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 + GV + +SL +PA+ ++ + + L++ + + L Sbjct: 96 QTYGVGIADLSGDLPDTSLAPLLPAAASITAEAVIWKRAGSALVIATSRP---DRIQDLR 152 Query: 599 RKV--GRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYN 637 + G+++ V+ R QI R Y Sbjct: 153 ALLPGGQRIMTVLASRNQITEAQRKLYGPHLARKAEGRAPE 193 >UniRef50_B9L0Q0 Glycosyl transferase, group 2 family protein n=2 Tax=Bacteria RepID=B9L0Q0_THERP Length = 635 Score = 245 bits (626), Expect = 4e-63, Method: Composition-based stats. Identities = 67/421 (15%), Positives = 128/421 (30%), Gaps = 55/421 (13%) Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDE 119 + L + ++VP + E V+ ++ E I + +D +T Sbjct: 247 QALRDDELPMYTVLVPVYREANVVPHLIENLRNLDYPASKLEILLLIEEDDEETLAAAKA 306 Query: 120 VCARFPN-VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME 178 AR P V +V P +K N L ++DAED P + Sbjct: 307 --ARPPETVTFIVVPNGLPKTKPKACNVGLLFA---------RGEFLTIYDAEDRPEPDQ 355 Query: 179 LR----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVP 234 L+ F +Q + + T M + E+S + + L +P Sbjct: 356 LKKAILAFRKGSPDLVCVQAALNYYNATENLLTRM-FTLEYSYWFDYVLTGLDRLRLPIP 414 Query: 235 SAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKE 294 G F + L +D ++TED D+G R +G + Sbjct: 415 LGGTSNHFRVGRLREL------GGWDPFNVTEDADLGIRAAARGYRVGVINST------- 461 Query: 295 REQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFL 354 E + +RQ+SRWI G + T Sbjct: 462 -----------------TWEEANNHVGNWIRQRSRWIKGYLQTVLVH---TRHPLRLVRT 501 Query: 355 WRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLM 414 R + A + L+ L + W ++ + N + Sbjct: 502 AGIRNTFGFVLLIGGAPFAFLSLIPLWSLTLTWIVTRTHAFDILFPPVVLYISLFNLLIG 561 Query: 415 VNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 ++ + Q + L + +++ +A+++A+ Q++ + W+KT Sbjct: 562 NGVMIYLGMLAGFKRRRYQLIPFALLNPFYWILHSIASYKAVWQLI----TKPFYWEKTR 617 Query: 475 H 475 H Sbjct: 618 H 618 Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats. Identities = 32/148 (21%), Positives = 59/148 (39%), Gaps = 2/148 (1%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALA 538 TRS +G+ L+ ++ E L+ AL R G R+G ++ GL+ +QL Q LA Sbjct: 10 QPQVTRSRERIGEALVSRGLLRPEDLERALEYQRRTGDRIGRILIALGLVKRQQLYQVLA 69 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 E G + + + + L + P+R +E+ V + + + Sbjct: 70 ELWGHPYVDLLREPLDARLARLFDPEALVQRRCFPVRRVGNEVFVATAEPPGAELEEYIR 129 Query: 599 RKVGR-KVRYVIVLRGQIVTGLRHWYAR 625 +G VR ++ I +R + Sbjct: 130 SVLGSVTVRPLVTSEWDIDYAIRTIFRD 157 >UniRef50_C8WBS4 Polysaccharide deacetylase n=4 Tax=Sphingomonadaceae RepID=C8WBS4_ZYMMN Length = 1126 Score = 245 bits (626), Expect = 4e-63, Method: Composition-based stats. Identities = 63/483 (13%), Positives = 143/483 (29%), Gaps = 56/483 (11%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 W +F+ + + A+ + + L F+ ++ R+ R+ + Sbjct: 697 WNFALFSCLGASVIALRWIFAIAITLGILRALFLSAFSIIQ--------ARKENRLIFPP 748 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 + E+ +++++PA+NE VI + + N + V + ++ + V+ A Sbjct: 749 I--DPERTISVLIPAFNEEAVIEASIRRVLASAEVNNIEVIVIDDGSTDNSSQIVESQFA 806 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 P V + + KA LN+ + I DA+ P + Sbjct: 807 DDPRVQLIRLSNG---GKARALNHGVQKAK---------GEIIIALDADTHFEPRTIARL 854 Query: 183 N--YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 + + + R + + E+ + L G Sbjct: 855 TRWFSDPKLGAVAGNAKVGNRI--NLITRWQALEYITAQNLERRATVLLNAMTVVPGAVG 912 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 + + + F Q+L ED D+ ++E + + V Sbjct: 913 AWRAETLRQV------GGFPDQTLAEDQDLTIIIQEHDWAVRYDPYAVAWT--------- 957 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 P+T RQ+ RW G + +K + + Sbjct: 958 --------------EAPETIRALARQRFRWAFGTLQCLWKHWSIIKNRRPKGLAYIGLPQ 1003 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWL-MTLLWLNFGLMVNRIV 419 ++ + F + +I L L+++ + + + G M W F L+ Sbjct: 1004 SLIFQIGFATISPIIDLALVISIMATVFAVYQHGWVQQGDDLQKMAAYWSVFTLIDLMSG 1063 Query: 420 QRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS 479 + + L + + I + +AL Q ++ +++ H Sbjct: 1064 VVAFALERKEKWSLLWLLIPQRIGYRQIMYYVVIKALTQAIRGPKVGWDKLERSGHVQTE 1123 Query: 480 VTG 482 Sbjct: 1124 SNK 1126 >UniRef50_Q7D1E2 Glycosyltransferase n=1 Tax=Agrobacterium tumefaciens str. C58 RepID=Q7D1E2_AGRT5 Length = 515 Score = 245 bits (626), Expect = 4e-63, Method: Composition-based stats. Identities = 76/473 (16%), Positives = 144/473 (30%), Gaps = 63/473 (13%) Query: 22 LAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNET 81 +A+ + L +++ + + + + ++ + I+V + E Sbjct: 95 IALAWLHATLSMLYLNTLLFRLFALIHIPRE-TDVETALSLRHENELPVYTILVALYREE 153 Query: 82 GVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSK 140 VI + I + +D T + + P++ V P +K Sbjct: 154 AVIEQLVSALERLDWPRSRLDIKLVCEADDGATIEAIRRINPG-PHMEIVQVPPSEPRTK 212 Query: 141 ADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV----ERKDLIQIPV 196 L L + +++DAED P +LR E +Q P+ Sbjct: 213 PKALTYAL---------SGARGTFVVVYDAEDRPHPQQLREAYAAFRNQPEDMACVQAPL 263 Query: 197 YPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDG 256 + ++ + E++ L +P +P G F A+ Sbjct: 264 IISNASSSWLSA-CFALEYAGLFRCMLPALATHGLPLPLGGTSNHFRTAALRR------A 316 Query: 257 IAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYF 316 A+D ++TED D+G RL G +R +++ Sbjct: 317 GAWDPYNVTEDADLGLRLHRLGYRCGVIRRQTLED------------------------A 352 Query: 317 PDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-RKGAISNFVSFLAMLVMI 375 P + + Q++RW G + W + R A F + +++ Sbjct: 353 PTSLPVWLNQRTRWFKGWLQ------SWLVMTRTPFATARTMGWFAYMTFQLLIGGMLLS 406 Query: 376 QLLLLLAYESLWPDAWHFL-----SIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 L L + SL A +F L + LN V Y Sbjct: 407 SLTHPLLFVSLVFMAIAIRENGVDLLFRWQGALFFIDALNIVGSYTIFVLMGRSRMIAYE 466 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 Q L + L+ +A WRA+ ++ R W+KT H + Sbjct: 467 RRQVGRRWLAMPLYWLMLSVAAWRAVVEL----KTRPFVWNKTPHVPVAKDKT 515 >UniRef50_C6J4S3 Glycosyl transferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J4S3_9BACL Length = 413 Score = 244 bits (624), Expect = 6e-63, Method: Composition-based stats. Identities = 81/466 (17%), Positives = 155/466 (33%), Gaps = 58/466 (12%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 L VI I+L VI+ + G+ F + + +YR+ ++ Y K A++ Sbjct: 2 LDVILISLQVILAVVGVYQFGLALF----------GMYRKKNKVQYEP-----SKSFAVL 46 Query: 75 VPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 V A NE V+G + E E Y +FV +T V ++ V Sbjct: 47 VAAHNEEKVVGALMENLKQMNYPKELYDVFVICDNCSDNTANIVRS-----HGMNACVRT 101 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN-YLVERKDLI 192 P K + +L + + R + ++ DA++++ P LR N L +I Sbjct: 102 NPNLRGKGYAIEWMLKQLWKMPR----QYDAVVMFDADNLVHPDFLREMNNDLCAGARVI 157 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 Q + E + T+ Y + + R L G G CF + + Sbjct: 158 QGYIDTKNPEDSWITAS-YGISYWYCNRLWQLSRTNLKMANFLGGTGMCFETELLKEI-- 214 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 + SL ED + R E+G+ +F + + K Sbjct: 215 -----GWGATSLVEDLEFTMRCVERGVYPVFNYDAKLFDEK------------------- 250 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 P TF + RQ+ RW+ G + ++ A+ ++ +L Sbjct: 251 ----PLTFKASARQRLRWMQGHFTVARRYFFPLLWKSIKERNMVKLDMALYGANVYIVLL 306 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 + + +SL + H +++ ++ + + + + + + V Sbjct: 307 TFLLTAFIWVDQSLMQEP-HVKTLYGYLPMWVSYVAIVANVFIFLMAMFLEKVKSKKVYA 365 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 +L + L I F A + + H RV + Sbjct: 366 YLVLFPIYLISWWPITFYAFFTQNNKQWSHTQHTRVVRLEEVQSNK 411 >UniRef50_A7HQ64 Glycosyl transferase family 2 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQ64_PARL1 Length = 652 Score = 244 bits (624), Expect = 7e-63, Method: Composition-based stats. Identities = 85/483 (17%), Positives = 145/483 (30%), Gaps = 62/483 (12%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRR--YPRMSYRELYKPDEKPLAIMVPAW 78 L+ G+ I + ++ L+ R +Y D +MVP + Sbjct: 220 LLSGATATIGITFLLIATLRYMSIFIGLLAEPTREELAFSNYGVPLDKDLPVYTVMVPLF 279 Query: 79 NETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG 136 E V+ + A LDY I +D +T + + +V Sbjct: 280 REASVLP-ILATALRELDYPASKLDIKFIFEESDVETYEAAKAL-RLPDHFEFIVVPTSF 337 Query: 137 PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLI 192 P +K N L +++DAED P +L+ F E+ + Sbjct: 338 PQTKPKACNFALPFA---------RGEFLVIYDAEDAPEPQQLKKAVSAFRLGDEKLACV 388 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 Q + + T + E++ +P L +P G T F + Sbjct: 389 QAQLNYYNWRENWLTR-QFALEYAAFFDLMLPTMARLRLPIPLGGTSTHFRTELLR---- 443 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 + A+D ++TED D+G R G +R +EA + Sbjct: 444 --NAGAWDPNNVTEDADLGLRFALHGYRCSIIRSTTEEEANCK----------------- 484 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR-KGAISNFVSFLAM 371 VRQ+SRWI G + + + L+R F + Sbjct: 485 -------LPNWVRQRSRWIKGWMQ------TYLVRMRHPVRLYRALGLRGFIGFQVLIGG 531 Query: 372 LVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGL 431 + LL Y L + L L V + GL Sbjct: 532 STLSSLLHPFLYLGLIVPLIESGLAGDLTG-LTVFHLLVLVSGYALAVSAGLAAASARGL 590 Query: 432 TQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLG 491 Q + L + L+ A ++AL Q++ + W+KT H + Sbjct: 591 PQLFIHTLTMPAYWLLLSFAAYKALWQLVV----KPFHWEKTDHGISRMLPARLKGPVSS 646 Query: 492 QIL 494 Q + Sbjct: 647 QFV 649 Score = 94.3 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 61/176 (34%), Gaps = 3/176 (1%) Query: 461 QHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRV-EGLRLG 519 + R++ K P S LG++ +E ++ E L AL + G LG Sbjct: 8 ARVEQARLSERKPGRRAPRGRPQPGSRPLLGEMAVEAGLVQPEALAPALEKQAHWGGPLG 67 Query: 520 GSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEND 579 ++ G + +A+ Q G+ + + +SL + L LP R Sbjct: 68 RILVSIGAMRVADVARLYGRQRGLPFVDLQEEPHETSLASSERLDFYLREMCLPWRRRAG 127 Query: 580 ELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAML 635 E I + D + A+T + GR + + I + + + L Sbjct: 128 ETIYVAADPDRSRA--AITGQEGRPLPVFVTSPRDISRTVTRAFGQALTDRAIFHL 181 >UniRef50_Q0FJ05 Glycosyl transferase, group 2 family protein n=2 Tax=Rhodobacteraceae RepID=Q0FJ05_9RHOB Length = 616 Score = 244 bits (622), Expect = 1e-62, Method: Composition-based stats. Identities = 76/451 (16%), Positives = 139/451 (30%), Gaps = 58/451 (12%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLD 96 V + +I R ++ + + ++++VP + E + + L+ Sbjct: 198 AVLILAQITRLAALLASRRSVPDAPVGPVRLPRISLLVPLFREERIAAALLDRLSRLDYP 257 Query: 97 YENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFER 156 + + +D T+ + P + + +K LN L Sbjct: 258 RNRLEVLLLLEASDDTTRAALAAT-RLPPWLRVIEVPGGPIATKPRALNYGLTFA----- 311 Query: 157 SANFAFAGFILHDAEDVISPMELRLFNYLV----ERKDLIQIPVYPFEREWTHFTSMTYI 212 ++DAED +P +L +Q + + S + Sbjct: 312 ----QGDIVGIYDAEDSPAPDQLLKVAGHFCRAPPETACLQGILDFYNPHANWL-SRCFT 366 Query: 213 DEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGF 272 E++ +P L +P G F R A+ + +D ++TED D+G Sbjct: 367 IEYATWFRLVLPGLARLGFPIPLGGTTVFFRREALDRV------GGWDAHNVTEDADLGI 420 Query: 273 RLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWII 332 RL G V +EA R +RQ+SRW+ Sbjct: 421 RLARFGYVTELVPLVTREEANNRTWP------------------------WIRQRSRWLK 456 Query: 333 GIVFQGFKTHKWTSSLTLNYFLWRDRKG--AISNFVSFLAMLVMIQLLLLLAYESLWPDA 390 G + W L RD + V FL ++ L L L Sbjct: 457 GYMV------TWLVHSRRPLTLLRDLGAWRFVGMQVLFLTTILQFLLAPALWSFWLLLLG 510 Query: 391 WHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFM 450 W ++ + LL+ F + + + L V LF + + Sbjct: 511 WQPAALAILTEAQKQLLFGGFLVAEAISLMVSVAAVARSPHQGLLPWVPTLFLYFPLATV 570 Query: 451 ANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 A ++AL +++ R WDKT H + Sbjct: 571 AIYKALLELV----TRPFFWDKTMHGHSAPD 597 Score = 55.4 bits (132), Expect = 9e-06, Method: Composition-based stats. Identities = 18/106 (16%), Positives = 30/106 (28%), Gaps = 2/106 (1%) Query: 532 QLAQALAEQNGVAWESID--AWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGI 589 Q A+A A ++ +L +P L YA LP R + L++ + Sbjct: 45 QFAEAPAPLWRGPVLHLERNQRPPDPALAPLLPPETCLRYAALPWRQVGNTLLLATARPE 104 Query: 590 DPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAML 635 + A V I + + D L Sbjct: 105 RFDAARAALPPGCDDVVMATASMADIHAEIAARHGPALARDAETRL 150 >UniRef50_C6QAR9 Glycosyl transferase family 2 n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QAR9_9RHIZ Length = 506 Score = 243 bits (621), Expect = 1e-62, Method: Composition-based stats. Identities = 78/468 (16%), Positives = 143/468 (30%), Gaps = 59/468 (12%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 L + + L I V V + + + +++VP + E Sbjct: 86 ALILWWVVLALPFLMIATVRLVAVWYVVRRQPKHWRGPLDDRRFDARLPTFSVLVPVYKE 145 Query: 81 TGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 V+ + + I T +D T++ + + N+ + P + Sbjct: 146 EAVVPGLVAAMRRIDYPPDRVEILFITEEHDQPTRQALLQ-SNLAQNMRVLTVPAGHPQT 204 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV----ERKDLIQIP 195 K LN L ++DAED+ +LR R +Q Sbjct: 205 KPRALNFALQEAGGI---------LVAVYDAEDIPDRDQLRRAAAAFVAGGPRLACVQAQ 255 Query: 196 VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGD 255 + + + + F S + E+ L +P L +P G F R + Sbjct: 256 LTIYNAKQSFF-SRQFALEYKALFSGLLPALAFLKLPIPLGGTSNHFRRDLLRK------ 308 Query: 256 GIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREY 315 +D ++TED D+G R+ G +R E Sbjct: 309 CGGWDPFNVTEDADLGIRIARLGYDVAVIRS------------------------ETSEE 344 Query: 316 FPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGA-ISNFVSFLAMLVM 374 P + T Q++RWI G + + + LWRD F + +++ Sbjct: 345 APTEWRTWCGQRTRWIKGWIQ------TYLVHMRHPLRLWRDLGTWQFIGFQIMIGGMIL 398 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 L+ Y + A ++ A L + + + I G G Sbjct: 399 SILVHPWFYVLIVNKAMSGAALMPAGAALQWIFSAHLMIGYGAAWLLTIVTAR--GSISG 456 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 L + + L L A +RA+ ++ R W+KT H + Sbjct: 457 LWAAIWLPIYWLAISWAAYRAVIDLI----FRPFHWEKTAHGAGASHR 500 >UniRef50_B6B2W2 Glycosyl transferase, group 2 family protein n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B2W2_9RHOB Length = 588 Score = 243 bits (621), Expect = 2e-62, Method: Composition-based stats. Identities = 77/506 (15%), Positives = 164/506 (32%), Gaps = 65/506 (12%) Query: 4 LLDVFATWL--YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 +L + WL GL AV+M +G+ ++ + ++ + P R Sbjct: 140 VLALALQWLTFAGLYRAIFGFAVVMVFTGI---------VIKTAAAFIQLFVKEPEKQIR 190 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 +++++P +ET ++ + T + + +D T+ + E Sbjct: 191 AAPTTKLPRVSLLIPLHDETEILEALLRHIGMLTYPETLLDVILIVEASDTLTRSHL-ET 249 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 + ++ T+K +N L ++DAED +P +L Sbjct: 250 TDLPNWMRILIVPAGKITTKPRAMNYALPFC---------RGDIIGIYDAEDAPNPDQLS 300 Query: 181 LFNYLVERK----DLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 L R IQ + + S + E++ +P + +P Sbjct: 301 EVVDLFARTSEKTACIQAVLDFYNARSNAL-SRFFAIEYATWFRLILPGIARMGFAIPLG 359 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G F R + L +D ++TED D+G RL G + ++EA + Sbjct: 360 GTSVFFRRNVLEKLD------GWDAHNVTEDADLGIRLARAGYQTTLISSVTLEEANNKA 413 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 ++Q+SRW+ G + F + L W+ Sbjct: 414 WP------------------------WIKQRSRWLKGYLITYFTHMRRPFRLLFELGAWK 449 Query: 357 DRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 G + F+S ++ ++ L+ W + + +L T++ + Sbjct: 450 -FLGFQAFFLSSISQYMLAPLIWSCMALYFGAPQW--MDATVPAEFLNTIMVAFIVMWFT 506 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 ++ V+G + + + + +A ++ +++ + W+KT H Sbjct: 507 STTVYIVAVSGEL-HRHLIPWIPFMSAYMALGTLAIYKGAWELI----TKPFYWEKTAHG 561 Query: 477 FPSVTGDTRSLRPLGQILLENQVITE 502 V + L TE Sbjct: 562 HSDVEIVHSNSEETESSLSLVTKATE 587 Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 38/99 (38%) Query: 522 MLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDEL 581 ++ QG IS +L + LA QN + + + + +L + L +P R E + Sbjct: 2 LVSQGQISETELYETLAIQNSLPFIDMQSDIPDLTLQTGLDPHQCLRLQCVPWRREGETT 61 Query: 582 IVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLR 620 I+ + + A V+ I R I +R Sbjct: 62 ILTTSSPEQFENAKATLPAHLHPVKMAIATRSHITEAIR 100 >UniRef50_A0RGJ4 Glycosyl transferase and polysaccharide deacetylase fusion protein n=66 Tax=Bacillus RepID=A0RGJ4_BACAH Length = 1119 Score = 243 bits (619), Expect = 3e-62, Method: Composition-based stats. Identities = 59/484 (12%), Positives = 133/484 (27%), Gaps = 74/484 (15%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 + VF+ Y ++ V + + F+ W ++ + R Sbjct: 703 YNKAVFSGAGYFKHILTTIFYVAIGLGIFRFVFLIYFAW-----------KQKRKTLSRY 751 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 ++ + +++++ A+NE VI + Y + + V + T + + E Sbjct: 752 IHSSYQPFVSVVIAAYNEEKVIAKTIR-SILDSKYGEFEVIVVDDGSTDGTSKVMQETFY 810 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 + P V + K+ +N + DA+ +I+ + L Sbjct: 811 KHPKVRFIQKENG---GKSSAMNLGFQQ---------SRGEIIVTLDADTIIAQDAISLM 858 Query: 183 NYLVE--RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 E + V R + + E+ + + L G Sbjct: 859 VRHFEDQNVAAVSGNVKVGNRR--NLLTTWQHVEYITGFNLERRAFDELNCITVVPGAIG 916 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 + ++ V +L ED D+ +G ++ Sbjct: 917 AWRKKNVVE------SGYLSEDTLAEDTDLTITFLRQGHRIVYEEKAYAFT--------- 961 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 P+ + ++Q+ RW G + +K K + Sbjct: 962 --------------ESPEDVKSLIKQRYRWSYGTLQCLWKHRKALCNSKHK--------- 998 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 + F+A+ M +L + + + D + +FS + + F LM Sbjct: 999 ----TLGFIALPNMWLFQYVLQFIAPFADILMIIGLFSSDPLKVLGFYFVFFLMDLLASL 1054 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 + L + R + + +K + V W+K Sbjct: 1055 FAFKLEEENPKPLVWLILQRFIYRQ----FMTYVVVKSIFSSIRGIAVGWNKLKRVGSVE 1110 Query: 481 TGDT 484 Sbjct: 1111 HSTE 1114 >UniRef50_A1B414 General secretory system II, protein E domain protein n=7 Tax=Rhodobacteraceae RepID=A1B414_PARDP Length = 698 Score = 243 bits (619), Expect = 3e-62, Method: Composition-based stats. Identities = 79/496 (15%), Positives = 146/496 (29%), Gaps = 62/496 (12%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 +L + + + ++ + + S R + + Sbjct: 207 VLGLLLAPIAVIALLTGWTVLTLIASAALKLLSFAAILRRHRRDRTKAEAMARDAIPPPE 266 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 +++MVP + E + + L+ E I + D T +++ Sbjct: 267 MTAPLPVISVMVPLFAEADIAEKLIGRLSRLDYPRELMDILIVVEETDSVTCAALED-AR 325 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL--- 179 + V +K LN L+ + + DAED P +L Sbjct: 326 LPRWLRVVKVPDGPVRTKPRALNYALNFC---------RGSIIGVWDAEDRPEPGQLLKV 376 Query: 180 -RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGV 238 R F++ +Q + + + + E++ + AL VP G Sbjct: 377 ARGFHFAPPEVVCLQGVLDYYNPRTNWL-ARAFTIEYASWFRGTLAGAAALDLVVPLGGT 435 Query: 239 GTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQR 298 F R A+ + A+D ++TED D+G RL +G + +EA R Sbjct: 436 TLFFRREALEEV------GAWDAWNVTEDADLGVRLTRRGYRTRMLDTVTHEEANCRLIP 489 Query: 299 KFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR 358 V+Q+SRW+ G W + LWRD Sbjct: 490 ------------------------WVKQRSRWLKGFAM------TWGVHMRDPVALWRDL 519 Query: 359 KGA------ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFG 412 + F S L+ L P + + +L+ F Sbjct: 520 GARRFIGLQVQLFASVSQYLLAPVLWSFWLLSLGLPHPMRGMLSGMLGGNAIAILFTLFV 579 Query: 413 LMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDK 472 + ++ G L V L + +A W+A+ +V+ + WDK Sbjct: 580 ASELLNIAIGLWAVRGRGHRHLLPWVPTLHLYFPLGCLAAWKAIYEVVA----KPFYWDK 635 Query: 473 TTHDFPSVTGDTRSLR 488 T H + Sbjct: 636 TQHGIFEAGQEEAPEP 651 Score = 86.6 bits (213), Expect = 3e-15, Method: Composition-based stats. Identities = 34/145 (23%), Positives = 55/145 (37%), Gaps = 1/145 (0%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLA 534 D R LRPLGQIL+E+ + L AL + + RLG +L G + E L Sbjct: 21 DPAETVVSARDLRPLGQILIEDGAVDPRNLFKALVMRQRQSARLGEILLANGWVREEALI 80 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSL 594 +AL+ Q + + A L+ M A + L +P R + + +L Sbjct: 81 RALSRQWRASVLDLKALPPDPRLVDAMGAQLCLAEGAVPWRRVGGVTFIATARPEGFQAL 140 Query: 595 AALTRKVGRKVRYVIVLRGQIVTGL 619 + VR ++ + Sbjct: 141 QDRLPQDFGAVRMLLCSENAAREAI 165 >UniRef50_C4L4L3 Type II secretion system protein E n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L4L3_EXISA Length = 556 Score = 242 bits (617), Expect = 4e-62, Method: Composition-based stats. Identities = 53/262 (20%), Positives = 108/262 (41%), Gaps = 18/262 (6%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 R+ R LG++L+E +IT QLD AL + G +LG +++ I+ QL Q + EQ V Sbjct: 2 RRTKRRLGEMLIEAALITTNQLDEALEQKRPGEKLGDALIRLNHITETQLIQMIHEQLHV 61 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + ++ I ++ +P ++A + ++P L + L V + D +D +++ + + G Sbjct: 62 PIIELYSYDINVTVTKLVPKALAQKHDIMPFELNGNTLHVATADPLDLIAIDDVRLQTGM 121 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 + I R QI + +Y + L ++ ++ I R P L Sbjct: 122 NIEIGIATREQIRKTISRYY------EMDHSLVEILKEDAPEIERQETISRDDAPIIRLV 175 Query: 664 AEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS 723 ++ + S I+ + L +G + ET+ ++ V + Sbjct: 176 NQLFLSAIDQRASDIH-----FDPHEKQLHVRFRIDGDLRTETVY----PKKIQSVMLTR 226 Query: 724 LLLKAGLNTEQVAQLESENEGE 745 L + + L+ + +G Sbjct: 227 LKVMSNLDITE---SRLPQDGR 245 >UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L983_9DELT Length = 1140 Score = 242 bits (617), Expect = 5e-62, Method: Composition-based stats. Identities = 79/480 (16%), Positives = 137/480 (28%), Gaps = 56/480 (11%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELY---KPDE 68 L L + + +GL F+ L+V + ++ R + Sbjct: 706 LVALGNRVGFFLLFAWSAGLQYLFVTGTILGLGRLLILAVLAVFEKVRGRRRPVSGPAPD 765 Query: 69 KPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 +A++VPA+NE V+ + + I V + T R + E A V Sbjct: 766 LSVAVVVPAYNEEKVVLQTVQSLLACQHPATFEIIVVDDGSTDATYRVLCEALAGEKLVT 825 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN--YLV 186 V K LN+ + A + DA+ V + + + Sbjct: 826 IVTKPNG---GKPAALNHGI---------ALTRADIVVTLDADTVFARDTILRLADWFRD 873 Query: 187 ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 + + R +F + E+ D L G + R Sbjct: 874 PKVGAVAGNAKVGNRI--NFLTRCQALEYVTSQNLDRRALTVLDSVTVVPGAVGAWRREV 931 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHART 306 V A F ++L ED D+ R++ G T + V Sbjct: 932 VEA------AGGFSGETLAEDADLTIRIQRMGHTVAYEDRAVALT--------------- 970 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV 366 PDT +RQ+ RW+ G + +K + + Sbjct: 971 --------EAPDTMRGFLRQRFRWMFGTLQVAWKHKDALFRPRYGLLGFFGLPNIWLYQI 1022 Query: 367 SF-LAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIF- 424 F + VM L +S WH + A L + + + + V F Sbjct: 1023 FFQIISPVMDLWLAYTCLKSWVLWLWHPA-TWDPDALFRVLFYYALFMAADILAGLVAFL 1081 Query: 425 -VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 G L R F+ L+ + AL+ +L R + W K + Sbjct: 1082 LERGEDKRLLAWLVPQRFFYRQLMYVV----ALRTLLASLRGREMGWSKLERKATVDSRR 1137 >UniRef50_B9LWG0 Glycosyl transferase family 2 n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LWG0_HALLT Length = 509 Score = 241 bits (616), Expect = 6e-62, Method: Composition-based stats. Identities = 76/485 (15%), Positives = 139/485 (28%), Gaps = 53/485 (10%) Query: 2 DWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 W A YG V+ + SG+ F I +++ +VY R + Sbjct: 73 SWKGLCVAFVCYGFGVVTLGWVQPSLTSGVYLFAIALIFLYYWFIALAAVY-HNQRYHSQ 131 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDE 119 + ++I+VPA+NE G I A DY I V + +T + Sbjct: 132 DAPPEPSASISIIVPAYNEEGYIQRTI-TALLDADYPDGKREIIVIDDGSTDNTCAEARA 190 Query: 120 VCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL 179 + V K LN L + + DA+ V L Sbjct: 191 FESE-----TVSVVTKDNGGKYSALNYGLLFASN---------EIILTVDADSVPEKDAL 236 Query: 180 RLFNY--LVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 + + + V + R + E++ + + + G Sbjct: 237 KQMVAPLSDQSVGAVASTVTIWNRGS--LLTGCQQLEYTIGVNVYRRMLDLFGIVMVVPG 294 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 + R + + FD Q+LTED+DI ++ G Sbjct: 295 CLGAYRRDVLDEIQ------GFDPQTLTEDFDITVKVLRAGYEVR--------------- 333 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF-LWR 356 S+ V PD+ Q+ RW G FK S T + + Sbjct: 334 ---------SSEARVYTEAPDSLRDLYNQRLRWYRGNYMTIFKHRGVLSEPTTGFLYRFA 384 Query: 357 DRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 + A V++ L++ + F +L+ L + Sbjct: 385 FPLRLVELLFLPFASWVILGLIVKILLSGFVIQVVSLFIFFLSIIFLIAALGVYIEGEDW 444 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 R++ Y L + L L ++ RA + + P + ++ + Sbjct: 445 RLLWYTPLFLVGYKHFHDALLLKSLADVLLGRNLSWTRATRIEQRSNQPTQSNQVESDAE 504 Query: 477 FPSVT 481 + Sbjct: 505 VETAD 509 >UniRef50_A3DEG0 Type II secretion system protein E n=5 Tax=Clostridia RepID=A3DEG0_CLOTH Length = 787 Score = 241 bits (616), Expect = 6e-62, Method: Composition-based stats. Identities = 49/284 (17%), Positives = 98/284 (34%), Gaps = 31/284 (10%) Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAE 539 +G IL+ VIT++QL+ AL + G +G ++ QG I L + L + Sbjct: 208 NESGIFKDKIGNILVRAGVITQDQLENALSIQKKSGGLIGQILVKQGYIDRRSLYEFLQK 267 Query: 540 QNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTR 599 Q GV + I+ +I +I + ++A + V+P+ + L V D ++ S+ L Sbjct: 268 QMGVEYVDIEGIEIDEDIIGLVSPNLAKTHKVIPIEKVDGNLKVAMSDPMNIFSIDDLRL 327 Query: 600 KVGRKVRYVIVLRGQIVTGLRHWYAR-------RRGHDPRAMLYN-----------AVQH 641 G ++ + QI L +Y + + A L + Sbjct: 328 TTGLEIIPCLADEEQISAQLEKYYGKASRKTSAKEIEQKVADLDEEIKKVNEKIAVEITQ 387 Query: 642 QWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGV 701 + + P + I S I++ E + +G Sbjct: 388 TEDEDTTIDISDLENAPIVKMVNIIFQKAVATRASDIHI-----EPQEDCVLIRFRIDGQ 442 Query: 702 ISQETLDRVLTIQRELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 + + ++ L + + + +GLN + +G Sbjct: 443 LVEIMRYD----RKILSSIVARIKIISGLNIAE---KRIPQDGR 479 Score = 125 bits (313), Expect = 7e-27, Method: Composition-based stats. Identities = 49/254 (19%), Positives = 95/254 (37%), Gaps = 14/254 (5%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 R + +ILLE V+ L A R + +L GL+S + + A A + G+ Sbjct: 5 DTRGIDEILLEMGVLKIVDLKKAWDIQRESNKNIEDVLLELGLVSQKDIMHANAVKMGIP 64 Query: 545 WESIDAWQI-PSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + + +QI SS+ + ++A Y V+P+ EN L V D D + + Sbjct: 65 FVDLSTYQISDSSVPLLITRNIANRYKVIPIEKENGVLTVAMSDPTDIFCIDDIRLATAL 124 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 +++ V+ +I + ++ + + + E E + Sbjct: 125 EIKPVLADVKEIERLIVEYFGEEKKPQESKLKAENEEQNKKEELLKMEEELLGREI---Y 181 Query: 664 AEILTTLGHINRSAINVLLLRHE--------RSSLPLGKFLVTEGVISQETLDRVLTIQR 715 I ++ + +G LV GVI+Q+ L+ L+IQ+ Sbjct: 182 NNIKAD-VETREPEFDLASKGFDQNTYNESGIFKDKIGNILVRAGVITQDQLENALSIQK 240 Query: 716 ELQVSMQSLLLKAG 729 + + +L+K G Sbjct: 241 KSGGLIGQILVKQG 254 >UniRef50_B5ZHV8 Glycosyl transferase family 2 n=2 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=B5ZHV8_GLUDA Length = 624 Score = 241 bits (615), Expect = 7e-62, Method: Composition-based stats. Identities = 87/487 (17%), Positives = 158/487 (32%), Gaps = 64/487 (13%) Query: 5 LDVFATWLYGL-KVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 VF +LY + + + FF+ + +++++ V R + R L Sbjct: 183 AMVFLLFLYLAPERTLFAANLAAGLVFFASFFLKFMLSCAAVRQEVDVKVRDSEI--RSL 240 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTYPNDPDTQRDVDEVC 121 D I+VP + E V+ + A L+Y + + +D +T ++ Sbjct: 241 EDRDFPVYTILVPMYKEPDVLPILV-NAIRNLEYPQSKLDVKLVLEEDDIETIAAARKL- 298 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR- 180 A + P +K N L ++DAED +L Sbjct: 299 ALEATFEIICVPPSEPRTKPKACNYALRFA---------RGEYLTIYDAEDKPEATQLEK 349 Query: 181 ---LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 F L + IQ + + T M + E++ +P E + +P G Sbjct: 350 VLVAFRKLPDNVVCIQARLNYYNATENWLTRM-FTLEYTAWFDFYLPALEYMRIPIPLGG 408 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 F A+ A+ A +D ++TED D+G RL ++G V +EA Sbjct: 409 TSNHFKISALRAVHA------WDPYNVTEDADLGVRLTQRGWKVAVVDSTTFEEANV--- 459 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD 357 + +RQ+SRW+ G + + + +R Sbjct: 460 ---------------------SIPNWIRQRSRWLKGYMQ------TYLVHMRSPLAFYRK 492 Query: 358 RKG-AISNFVSFLA--MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLM 414 G F F+ + + + + L+ S +M L LN L Sbjct: 493 TGGTGFWGFQFFIGGTFMTALLAPIFWVFFILFTLFGLKAGSGVFSGRIMALNALNLLLG 552 Query: 415 VNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 +V + + L L + +A ++ L Q+L + W+KT Sbjct: 553 NGFLVYTYVLCSFKRNYRHLALYALTTPVYWALQSIAAYKGLFQLLY----KPFYWEKTQ 608 Query: 475 HDFPSVT 481 H T Sbjct: 609 HGLSKHT 615 Score = 78.2 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 54/155 (34%), Gaps = 3/155 (1%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQN 541 R GQ L+ ++TE QLD A++ ++ RLG +L +G + + LA Sbjct: 5 TQPDKRLFGQFLVSRNILTEAQLDEAIQTQKLWKSRLGDIILAKGWLKPRRFYHLLATFF 64 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLE-NDELIVGSEDGIDPVSLAALTRK 600 + + + L + LP R + +I+ D + L + Sbjct: 65 DLEFVDLMGHPPDPDLFDRAMIDEYARRSFLPWRRSADGAIILALADPS-EDTFNWLRAR 123 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAML 635 G+ +V R V L+ D L Sbjct: 124 YGQNACFVGTGRFDTVWLLQKMGNAALSDDALNAL 158 >UniRef50_B8FQB2 Type II secretion system protein E n=4 Tax=Clostridiales RepID=B8FQB2_DESHD Length = 573 Score = 241 bits (614), Expect = 1e-61, Method: Composition-based stats. Identities = 56/275 (20%), Positives = 106/275 (38%), Gaps = 26/275 (9%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 + + LG+IL+ + EEQL+ AL+ + GLRLG ++ Q +S E++ + + Q G Sbjct: 4 RQERKRLGEILIAGGALMEEQLNEALKLQKSLGLRLGEVLIRQNFVSEEEILRTIQRQLG 63 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + ++ + ++ +P SVA Y VLP+ N +L+V + D D ++ L G Sbjct: 64 LPAVDLNRIFVTEKILKMIPESVARKYTVLPVDFTNGQLLVATSDPTDYYAIDDLRLASG 123 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRA----------MLYNAVQHQWLTEQQAGEI 652 V+ + + I+ + +Y R + + A Q GE Sbjct: 124 MMVKPCVARKADILRAIDRFYGRSEAEKAVSDFVRQKGHDQVAAAAQTPVLTVVQAGGET 183 Query: 653 WRQYV-PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVL 711 + P I+ + S I++ E L +GV L ++ Sbjct: 184 ADEEATPIIKFLNTIIENAVNNFASDIHI-----EPVDDELRVRFRIDGV-----LREIM 233 Query: 712 TIQREL-QVSMQSLLLKAGLNTEQVAQLESENEGE 745 + + + + A LN + +G Sbjct: 234 RTPVGMTGPVVSRVKIMADLNIAE---RRLPQDGR 265 >UniRef50_Q1IMJ5 Glycosyl transferase, family 2 n=10 Tax=cellular organisms RepID=Q1IMJ5_ACIBL Length = 546 Score = 240 bits (613), Expect = 1e-61, Method: Composition-based stats. Identities = 66/501 (13%), Positives = 143/501 (28%), Gaps = 60/501 (11%) Query: 4 LLDVFATWLY---GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 +LD LY + + I+ I + R + R + + Sbjct: 28 VLDTTFKGLYQANAFDLCLLIPYFIVLI------ILAAYGVHRYQLVWMYYRNRKNKTTD 81 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDE 119 + + + + +P +NE VI + E + I V + +T E Sbjct: 82 PPQHFAELPRVTVQLPIFNEQYVIDRLVEAVCKLDYPKDKLDIQVLDD-STDETVEVARE 140 Query: 120 VCARFPNV---HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISP 176 V R+ + + KA L + + DA+ V Sbjct: 141 VVERYAALGNPISYIHRTNRHGFKAGALQEGMAVCK---------GEFIAIFDADFVPPA 191 Query: 177 MELRLF--NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVP 234 L+ ++ ++Q R ++ T + I + R Sbjct: 192 DFLQKCIHHFAEPEIGMVQTRWTHLNRNYSFLTEVEAIL-LDGHFVLEHGGRSRKGVFFN 250 Query: 235 SAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKE 294 G + ++A+ + +LTED D+ +R + KG ++ Sbjct: 251 FNGTAGMWRKQAIEE------AGGWQHDTLTEDTDLSYRAQVKGWRFKYL---------- 294 Query: 295 REQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFL 354 + P + Q++RW G++ K + + + Sbjct: 295 -------------QDVECPAELPIEMTAFKTQQARWAKGLIQCSKKVLPFLYRSDVPRRV 341 Query: 355 WRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLM 414 + ++ +S+ M+V+ L+L + + L I T +F L Sbjct: 342 KVEAWYHLTANISYPLMIVLSALMLPAMVLRFYQGWFQMLYIDMPLFLASTFSISSFYL- 400 Query: 415 VNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLI-NFMANWRALKQVLQHGDPRRVAWDKT 473 + Q+ ++ + L +++ L G + N A A+ Sbjct: 401 ---VSQKELYPKTWLRTFMYLPALMALGIGLTVTNTKAVLEAIVGKQSAFARTPKYRVTN 457 Query: 474 THDFPSVTGDTRSLRPLGQIL 494 + R + + Sbjct: 458 KGEKSIAAKKYRKRLGIIPWI 478 >UniRef50_B0TEE9 Type ii secretion system protein e, putative n=12 Tax=Firmicutes RepID=B0TEE9_HELMI Length = 570 Score = 240 bits (613), Expect = 1e-61, Method: Composition-based stats. Identities = 49/273 (17%), Positives = 101/273 (36%), Gaps = 22/273 (8%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 + + R LG +LLE +IT+EQL AL + G RLG +++ G ++ + + + L Q Sbjct: 1 MASTNRRKLGDLLLEYNLITDEQLQQALAEQKKRGERLGQTLVRLGFVTRQMINEVLEFQ 60 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 G+ S+ + + + +P S+ + LP++ + L V D ++ +L + Sbjct: 61 LGIPTISLLQYPLHPEVFKLLPESLCRRHKCLPVKRSGNRLTVAMVDPLNLPALDDIKMT 120 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR------ 654 ++ IV ++ Y + ++ E+ ++ Sbjct: 121 TNLEIDAAIVAEDELEQVFEKIYGLNEEQEADIKRLEVEANRAEEERSIIDLGELERMTA 180 Query: 655 -QYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTI 713 P + IL S I++ E + +G+ L VLT+ Sbjct: 181 VGDAPIIRVVNTILQQAAKEGASDIHL-----EPQEGGVRVRYRADGI-----LRHVLTL 230 Query: 714 QRELQ-VSMQSLLLKAGLNTEQVAQLESENEGE 745 + + + L A +N + +G Sbjct: 231 PKAAHPALLSRIKLLAKMNIAE---KRLPQDGR 260 >UniRef50_C0R5T9 Glycosyl transferase, group 2 family protein n=8 Tax=Wolbachia RepID=C0R5T9_WOLWR Length = 532 Score = 240 bits (612), Expect = 2e-61, Method: Composition-based stats. Identities = 69/482 (14%), Positives = 157/482 (32%), Gaps = 74/482 (15%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRY---------PRMSYRELY 64 + I+ ++ +F ++ ++ L + ++ Y +L Sbjct: 110 IFFASFFIVTSILTLTEKYTYFALMLIFIIGCSSYLFKFIATIFNLCETSNQKVDYSKLN 169 Query: 65 KPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCAR 123 + D I++PA+ E+ VI + E + + +D +T +++ Sbjct: 170 EEDFPIYTILLPAFKESAVIEQLIESIESLDYPKSKLDVKLQVESDDQETLAAIEKY-TL 228 Query: 124 FPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--- 180 + P +KA N + +++DA+D P++L+ Sbjct: 229 PQYFEVIKVPHSLPRTKAKSCNYAMSFA---------RGKYAVIYDADDKPDPLQLKKAL 279 Query: 181 -LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVG 239 FN ++ +Q + + + T ++ E+ +P + + +P G Sbjct: 280 IEFNKGDDKLACVQAKLNYYNCDCNFLT-KSFSLEYMNWFQYLLPGFQKMNMPMPLGGSS 338 Query: 240 TCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRK 299 FS + + + +D S+TED D+G RL + G + Sbjct: 339 NHFSVKILRKMFF------WDAYSVTEDADLGLRLAQMGYKTRMIDS------------- 379 Query: 300 FLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRK 359 E P ++Q++RWI G + K + ++ Sbjct: 380 -----------ETLEESPIAVFAWIKQRARWIKGYMQTYIVHLK-NIKSLYKHTGFKG-- 425 Query: 360 GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIV 419 ++++ L + A + + LS+ L+ + V ++ Sbjct: 426 ------------ILLLNLFVGSAAFLFFTTPFLLLSLILTKVLNELFLYYFVVVYVTNLI 473 Query: 420 QRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPS 479 VI V + + +L++ +A + AL + + + W+KT H Sbjct: 474 LLVIAVKQQKMPFYFYIVSIFFPVYSLLHSVAAFLALWEFILY----PERWNKTQHGLWK 529 Query: 480 VT 481 Sbjct: 530 QN 531 >UniRef50_B8HEE8 Glycosyl transferase family 2 n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HEE8_ARTCA Length = 664 Score = 240 bits (612), Expect = 2e-61, Method: Composition-based stats. Identities = 75/490 (15%), Positives = 148/490 (30%), Gaps = 65/490 (13%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVV-YWVRRIKRKLSVYRRYPRMSYRELYKPDEKP 70 L + V+ + ++ L F + + + + R P + D Sbjct: 220 LTAVNVVFLVSIGFKTVASLRQPFDALHDRSAAKARAREFRRRGLPVEEVARIPDADLPV 279 Query: 71 LAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN-VH 128 I++P + E +I + + V +D +T +R P V Sbjct: 280 YTILIPVFREANIIDKLLSNLGQLDYPRSKLDVLVLLEEDDTETIEAAKR--SRPPEYVR 337 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER 188 +V R P +K N L +++DAED P +LR + + Sbjct: 338 ILVVPRGEPQTKPRACNYGLTFA---------RGEYVVIYDAEDRPDPGQLRAAIHAFRK 388 Query: 189 KD--------------LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVP 234 +Q + F + T + + E++ +P + +P Sbjct: 389 DAFERQYLDPDRRPLICVQAALNYFNADQNVLTRL-FTIEYTHWFDSMLPGLDRSGIPLP 447 Query: 235 SAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKE 294 G F R + + A+D ++TED D+G R +G + Sbjct: 448 LGGTSNHFDTRLLRLV------GAWDPWNVTEDADLGLRAAVEGYRVGVINST------- 494 Query: 295 REQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFL 354 E ++Q++RWI G + TL Y Sbjct: 495 -----------------TWEEACSQVPAWIKQRTRWIKGYMVTAAVNT----RNTLRYIQ 533 Query: 355 WRDRKGAISNFVSFLAM-LVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGL 413 GA+ L L + L+L + + ++F+ + L+ + Sbjct: 534 RTGVAGAVGFLGLILGTPLAFLAYPLVLGFTIVTYVGYNFVGLVLPEWLLVGGVVSMLFG 593 Query: 414 MVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 IV + +G + ++L +++ +A WRA Q+L Sbjct: 594 NAMMIVVSGVATWRRHGWRIAIFALLN-PAYWVLHSVAAWRAAWQMLTSPHKWEKTPHGL 652 Query: 474 THDFPSVTGD 483 ++ Sbjct: 653 DEEYHDDGRW 662 Score = 88.9 bits (219), Expect = 7e-16, Method: Composition-based stats. Identities = 36/161 (22%), Positives = 62/161 (38%), Gaps = 2/161 (1%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQN 541 R LGQ LL+ +IT +QLD AL+ EG LG ++++ ++ + + LAEQ Sbjct: 22 PGRQSLALGQTLLQAGLITTDQLDRALQRAATEGGLLGRHIILETGLNRRHVYEVLAEQW 81 Query: 542 GVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKV 601 + + +L+ + S LP LE+ L V + AA R Sbjct: 82 DAPLVDLVSHPSDDALLERLQFSEVSEPGWLPWHLEDGVLTVATAVKPSEEIRAAAMRAT 141 Query: 602 GR-KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQH 641 G V + I ++ + ++ L + Sbjct: 142 GATDVVFRTTTDWDINHSIQRAFRNHLLYESAERLAEELPD 182 >UniRef50_A8MGF8 Type II secretion system protein E n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MGF8_ALKOO Length = 560 Score = 240 bits (612), Expect = 2e-61, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 95/251 (37%), Gaps = 16/251 (6%) Query: 498 QVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSS 556 +T+ QLD AL + G +LG ++ + + + + + L Q G+ +D +++ + Sbjct: 16 GKLTQSQLDNALDIQKKTGKKLGEIVVSEKYTTEDDIIEVLEFQLGIPHVDLDKYEVNPT 75 Query: 557 LIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIV 616 + +P ++ Y ++ + ++ LIV D ++ +L + V ++ VI + +++ Sbjct: 76 VATLIPENIVRRYELIAIDKKDTILIVAMTDPLNIFALDDVKLFVKSDIQPVISTKEKLI 135 Query: 617 TGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR-QYVPHQFLFAEILTTLGHINR 675 + +Y+ + + E+ P L I+ Sbjct: 136 KAIDKFYSSETTKKALEEFEENFLPINTDDIEESELLEVTTAPIVKLLNSIIEQAVKERA 195 Query: 676 SAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTI-QRELQVSMQSLLLKAGLNTEQ 734 S I++ E + + +G L ++T+ + + + + + +N + Sbjct: 196 SDIHI-----EPYAEDIRVRFRIDG-----DLREIMTLAKNSMSGIVTRIKIIGKMNIAE 245 Query: 735 VAQLESENEGE 745 +G Sbjct: 246 ---KRIPQDGR 253 >UniRef50_B0MQV8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0MQV8_9FIRM Length = 563 Score = 240 bits (612), Expect = 2e-61, Method: Composition-based stats. Identities = 54/266 (20%), Positives = 105/266 (39%), Gaps = 17/266 (6%) Query: 485 RSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNG 542 P+GQIL+EN + E+QL+ AL + G +LG +L G +S QLAQAL+ + Sbjct: 1 MKNIPIGQILVENGFLKEDQLEEALEKQRSEPGKKLGDVLLELGYVSETQLAQALSIRLK 60 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 V + + +I + ++P ++A + ++ + L V ++D I+ L G Sbjct: 61 VPFIDLTTTKIDIEAVKKIPEAIAKKNCCVAFQMTDSRLTVATDDPINFYIFEELKVISG 120 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAML---YNAVQHQWLTEQQAGEIWRQYVPH 659 ++ +I R I + Y+++ + L Y + + P Sbjct: 121 MEIHAMIATRTAINETISKAYSQQTVSNVMDNLNKEYTGNTDSVIQDDPESGERVDNAPI 180 Query: 660 QFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQV 719 L I+ T +N S I++ E L +G E ++++ Sbjct: 181 VKLVNTIVETSFRMNASDIHI-----EPFKDRTRIRLRVDG----ELIEQMKVKPAAHNS 231 Query: 720 SMQSLLLKAGLNTEQVAQLESENEGE 745 + + + G+N + +G Sbjct: 232 LITRIKILGGMNIAE---KRIPLDGR 254 >UniRef50_C1F7J6 Glycosyl transferase, group 2 family n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7J6_ACIC5 Length = 627 Score = 240 bits (612), Expect = 2e-61, Method: Composition-based stats. Identities = 73/486 (15%), Positives = 144/486 (29%), Gaps = 55/486 (11%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 L + A ++ L + R I R P+ + + Sbjct: 117 LYHWNLFDAAMLTPYFLVMIILSFYGVHRYIMVWEYYRFRKRATKEPPKEFPELPRVTVQ 176 Query: 75 VPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCARF-----PNVH 128 +P +NE VI + E + I V + +TQ + ++ P V+ Sbjct: 177 LPIFNEQFVIDRLIEAICAMDYPRDRLEIQVLDD-STDETQAVAAALVKKYQEQGQPIVY 235 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYLV 186 R G KA L+ L + DA+ V SP L ++ Sbjct: 236 LHRTNRQG--YKAGALDEGLKVAK---------GEFVAIFDADFVPSPDWLMKVIHHFSD 284 Query: 187 ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 ++Q R+++ T + I + R G + R A Sbjct: 285 PAIGMVQTRWTHLNRDYSFLTQVEAIL-LDGHFVLEHGARSRAGVFFNFNGTAGMWRRTA 343 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHART 306 + D + +LTED D+ +R + G ++ Sbjct: 344 I------TDAGGWQHDTLTEDTDLSYRAQLVGWKFKYL---------------------- 375 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV 366 + P + Q++RW G++ K L+ W ++ A + Sbjct: 376 -QDVECPAELPIEMTAFKTQQARWAKGLIQTSKKIMPQVLRADLS---WHEKLEAWYHLT 431 Query: 367 SFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVT 426 + ++ +MI L +LL + + + L + + Q++++ Sbjct: 432 ANISYPLMIVLSILLLPTEIIQFHQGWFQMLFIDFPLFAASTFSIASFYM-VSQQILYPH 490 Query: 427 GYYGLTQGLLSVLRLFWGNLI-NFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTR 485 ++ L ++ L G + N A AL + + + T + Sbjct: 491 RWFRTLCYLPFLMALGIGLTLTNSKAVIEALLGIKSSFKRTPKYRVQAKGERSKATKYRK 550 Query: 486 SLRPLG 491 L L Sbjct: 551 RLGILP 556 >UniRef50_A7GMR7 Glycosyl transferase family 2 n=3 Tax=Bacillales RepID=A7GMR7_BACCN Length = 437 Score = 240 bits (612), Expect = 2e-61, Method: Composition-based stats. Identities = 71/492 (14%), Positives = 152/492 (30%), Gaps = 67/492 (13%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 + ++ + LY + I L + L + ++ +V + SY Sbjct: 6 ISFVSYIGRVLLYVMIAFLIFLTQFQGVLSLYQVVVSLLGFV------------KKKNSY 53 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDE 119 + AI+V A NE VI + + E Y I V +T + V E Sbjct: 54 VLDHDIAHTRFAILVCAHNEEKVIEQIVKNLKKIDYPKEKYDIHVICDNCTDNTAQIVRE 113 Query: 120 VCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL 179 + K L + + + E+ + ++ DA++V+S L Sbjct: 114 NQVKAW-----ERHDNQKRGKGYGLEWMFQNLFRLEKEQQEVYDAVVILDADNVVSRNFL 168 Query: 180 RLFNYLV--ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 ++ N + E+ +++Q + + + S +Y + + R L G Sbjct: 169 QVLNAKLVKEKYEVVQAYLDSKNPKDN-WISKSYAIAYWSTNRLYQLSRGKLGLSAQLGG 227 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFR-LKEKGMTEIFVRFPVVDEAKERE 296 G CF+ + + + +SLTED + + + KG + + + K Sbjct: 228 TGMCFTMNILKEI-------GWGTESLTEDLEFTAKYILAKGRAVGWAHDAKIYDEK--- 277 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 P F + RQ+ RW+ G + + F Sbjct: 278 --------------------PTDFKVSFRQRIRWMQGHMDCMVRYSGPLLKNFTQTFNMN 317 Query: 357 DRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 I ML + ++L + + + ++ ++ W+ L+ ++F ++ Sbjct: 318 AIDMFIYLVQPTRTMLSVNSIILF--FVTYYDLLPSYIMLYVLHPWIWLLIAVSFYILPI 375 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 + V + W ++ L + W T H+ Sbjct: 376 IALL-------------QEKKVTNIIWTPIVYIFGFSWVPIIFLGFIKRKEKVWVHTPHN 422 Query: 477 FPSVTGDTRSLR 488 + + Sbjct: 423 RVMDRENMIKME 434 >UniRef50_C8WIR5 Type II secretion system protein E n=2 Tax=Bacteria RepID=C8WIR5_EGGLE Length = 568 Score = 239 bits (611), Expect = 2e-61, Method: Composition-based stats. Identities = 60/269 (22%), Positives = 114/269 (42%), Gaps = 24/269 (8%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + LG +L++ +ITE+QL AL+ + E RLG ++ +G+I+ L +AL Q GV + Sbjct: 4 KRLGDVLIDAGLITEDQLGHALKQQKETKRRLGDELIAEGVITEAGLIEALQMQLGVEFV 63 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + A + L + +VA Y V+P+R DE+ + D ++ +++ A+ ++V Sbjct: 64 DLSAIDLDPELSRVISKNVARQYNVVPVRTSPDEVCLAMSDPLNFMAIEAVKNATRKRVI 123 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAML---------YNAVQHQWLTEQQAGEIWRQYV 657 ++ ++ + Y + +A + T + Q Sbjct: 124 PMVTTHDSLMRAIMTLYGNEGAARAIEEMKRDARTTGADDASTGSFQTSTLGDDADAQSA 183 Query: 658 PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 P L I+ S I++ E + L + +GV L +LT+ +EL Sbjct: 184 PTVRLVNSIIERAATERASDIHL-----EPREIDLHVRMRIDGV-----LRTILTVPKEL 233 Query: 718 QVSMQ-SLLLKAGLNTEQVAQLESENEGE 745 Q S+ L + G+NT + +G Sbjct: 234 QASVISRLKIMGGMNTSE---RRVPQDGR 259 >UniRef50_B7QPG7 Glycosyl transferase, group 2 family n=4 Tax=Rhodobacteraceae RepID=B7QPG7_9RHOB Length = 661 Score = 239 bits (611), Expect = 2e-61, Method: Composition-based stats. Identities = 74/457 (16%), Positives = 145/457 (31%), Gaps = 58/457 (12%) Query: 35 FIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAA-T 93 FI + +R ++ + + + ++++VP + E + + Sbjct: 255 FISHIQTIRSRRQIELPIASRSKFTSPRHRR---PMISVLVPLYKEAEIGRALLRRLCKL 311 Query: 94 TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA-RPGPTSKADCLNNVLDAIT 152 T + + D T+ + + G T+K +N L+ Sbjct: 312 TYPRSLLEVLLVLEEEDDITRDAIR-CADLPDWFRVIEVPAHGGLTTKPRAMNYALNFC- 369 Query: 153 QFERSANFAFAGFILHDAEDVISPMELRLFNYLVER----KDLIQIPVYPFEREWTHFTS 208 + DAED P +L + +Q + + + S Sbjct: 370 --------RGEIIGIWDAEDAPEPDQLDHVAAAFAKGDGALACLQGALDYYNPTQN-WIS 420 Query: 209 MTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDY 268 + E++ +P L VP G R + + +D ++TED Sbjct: 421 RCFTLEYASWFRIVLPGIARLGLVVPLGGTTLFIRRDVLEQV------GGWDAHNVTEDA 474 Query: 269 DIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKS 328 D+G RL G + +EA R V+Q+S Sbjct: 475 DLGVRLSRFGYRTDMLPTSTYEEANCRPW------------------------AWVKQRS 510 Query: 329 RWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWP 388 RW+ G + + H + L L WR G + F+ + ++ L +L Sbjct: 511 RWLKGFM-VTYLVHMRSPRLLLKQLGWRQFLGLQAFFLGTVGQFLLAPCLWSFWLITLGL 569 Query: 389 DAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLIN 448 + +G+ +L + + F L+ I T G L L + + Sbjct: 570 PHPTAPLLPTGATYLAGVSLVFFELLGMVIAITAACAT---GRRSLALWAPSLIFYFPMG 626 Query: 449 FMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTR 485 +A ++AL +++ + WDKTTH + + Sbjct: 627 VIAVYKALYELIL----KPFYWDKTTHGHSPKSRTQK 659 Score = 109 bits (273), Expect = 4e-22, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 63/164 (38%), Gaps = 1/164 (0%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLA 534 R L Q L+++ ++++ + AL + + + ++ +G S +Q+ Sbjct: 36 RATRPLTKQVGRRTLEQRLIQDHAVSKDHVIRALTLQQHQRAPIDRILVSEGWASQDQVL 95 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSL 594 AL+ Q+ + + + L+A PA L ++VLP ++V D ++ Sbjct: 96 DALSSQHKIPKVDLSGHNPQARLLARKPAGFWLRHSVLPWMQLGQTVVVAVSDPNKLNTI 155 Query: 595 AALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNA 638 +V V+ QI L + + + + L A Sbjct: 156 RTDLAASFGEVMPVLASESQIQQLLVSHFRKDLAVEASSRLPLA 199 >UniRef50_D2RJA7 Glycosyl transferase family 2 n=10 Tax=Veillonellaceae RepID=D2RJA7_ACIFE Length = 428 Score = 239 bits (611), Expect = 2e-61, Method: Composition-based stats. Identities = 61/446 (13%), Positives = 131/446 (29%), Gaps = 59/446 (13%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 +I + L V++ + F I + + PR +++ P A++ Sbjct: 5 FDIIMVPLQVLIVFFTIYYFVISLFGIL-------------PRKKEKKILTPKTT-FAVI 50 Query: 75 VPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 V A NE VIG + E E Y IFV T + A Sbjct: 51 VAAHNEEKVIGELVENLHMLRYPDELYDIFVIADNCKDHTAEVARKAGAL-----VYERF 105 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN-YLVERKDLI 192 K L + + + + + DA++++ P L+ N + + LI Sbjct: 106 NQEEVGKGFALEWMFRQLFAL----DRQYDAVAIFDADNLVHPDFLKEMNNRFCKGERLI 161 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 Q + + + S T+ F ++ + + G G + + Sbjct: 162 QGYLDVKNPNDS-WVSGTFAINFWIVNHVWHLAKYTIGLSSVFGGTGMVIATEVLKKY-- 218 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 + LTED + + +G+ + + ++ + K Sbjct: 219 -----GWKATCLTEDMEFTMKCLLEGIPTTWCQDAIIYDEK------------------- 254 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 P TF + Q+ RW G + + G I F + ++ Sbjct: 255 ----PQTFKASWNQRKRWAQGQFDVAGRYMWKLLKEGIRKRDIVILDGVIDVFQPYFMLI 310 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRV-IFVTGYYGL 431 +L Y + ++ W ++ + + I+ ++ ++ Sbjct: 311 STFFVLCSTIYNFVPFYTNVLYALLPYHVW--QVIGVAQYAIPAIILFKINAAPKSWFYT 368 Query: 432 TQGLLSVLRLFWGNLINFMANWRALK 457 L + ++ F + Sbjct: 369 LFYPLLLYSWVPITILGFFHRHEHVW 394 >UniRef50_B9Y801 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y801_9FIRM Length = 560 Score = 239 bits (610), Expect = 3e-61, Method: Composition-based stats. Identities = 55/267 (20%), Positives = 102/267 (38%), Gaps = 18/267 (6%) Query: 485 RSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNG 542 P+GQ+LLE ITEEQL++AL ++ G RLG ++ G I+ E+ +AL+ + Sbjct: 1 MKNLPIGQLLLEQGYITEEQLNSALAHQKAHPGNRLGDVLIELGYITEEKKLKALSVRLN 60 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 V + S ++ + VA Y V+PL ++N+ L + + D +D +L + G Sbjct: 61 VPVYEGFQINVNSDIVRLISEDVAKKYQVMPLEIKNNALQLATSDPLDFYALEDIKASCG 120 Query: 603 RKVRYVIVLRGQIVTGLRHWYAR----RRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVP 658 V V+ + I +R YA+ + + L E P Sbjct: 121 IPVSPVLAPKEMIENAIRRNYAQANVSSAIDEIQKDLTEDQDLALNDELSELTQRVDNAP 180 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ 718 ++ S I++ E + T+GV+ + T + Sbjct: 181 VVKFVNNMIRQAYETGASDIHI-----EPFEMTTVIRFRTDGVLHEFT----RIAKSVHD 231 Query: 719 VSMQSLLLKAGLNTEQVAQLESENEGE 745 + + + +N + +G Sbjct: 232 ALITRIKIMGNMNIAE---KRIPQDGH 255 >UniRef50_A8LS31 Glycosyl transferase n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LS31_DINSH Length = 635 Score = 239 bits (609), Expect = 4e-61, Method: Composition-based stats. Identities = 75/492 (15%), Positives = 147/492 (29%), Gaps = 61/492 (12%) Query: 4 LLDVFATWLYGLKVIAIT--LAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 ++ + +L ++ I + + L I + + + LS R Sbjct: 194 MMGLVCLYLATAALVLIPGLVTSALMWITLAVLAITMGFKLMLAVACLSARPPPQPPPER 253 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 +P ++I+VP E+ V + + D T+ + ++ Sbjct: 254 NKTRPPLPAMSILVPLLRESEVAEKLLNNLDRLRYPRALLDVLFVVEAEDDVTKNALSQL 313 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL- 179 + R +K LN L ++DAED P +L Sbjct: 314 -RLPQGFRMLEVPRGTVQTKPRALNFALPFC---------RGEIVGIYDAEDRPDPDQLL 363 Query: 180 ---RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 F + +Q + F + + + E++ G + + L +P Sbjct: 364 KVAEGFRHAAPEVACLQGRLDFFNTRFN-LIARCFTAEYAGWFGLFLQGLDRLGLPIPLG 422 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G R+ + + +D ++TED D+G RL G + +EA R Sbjct: 423 GTTLFLRRKVLEEV------GPWDAHNVTEDADLGMRLYRHGYRVSLIDTVTQEEANVRI 476 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 ++Q+SRWI G + + + LWR Sbjct: 477 WP------------------------WIKQRSRWIKGYM------ATYAVHMRSPRALWR 506 Query: 357 DR-KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV 415 G + ++ L + L + + A L L ++ Sbjct: 507 ALGPGGFAALQCLFLGSILSALTMPLLLWLWLGHLGAPIVPNAVLATLPPAHVLGPVMLG 566 Query: 416 NRIVQRVIFVTGYYG--LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 V ++ G + L L++ +A +AL ++L + WDKT Sbjct: 567 IEAVNLALWAAGVRAARHRHLWPMLPLLHVYFLMSSIAALKALVELLY----KPFYWDKT 622 Query: 474 THDFPSVTGDTR 485 H + Sbjct: 623 DHGIAMDVPEPP 634 Score = 75.1 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 53/179 (29%), Gaps = 1/179 (0%) Query: 460 LQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITE-EQLDTALRNRVEGLRL 518 + P R D P L +L+ I+ E+L R++G R Sbjct: 3 VISLQPTRDTGDGLLAPVPRKPAPRPERELLADLLVRRGDISPAERLQAVHMARLQGRRP 62 Query: 519 GGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEN 578 + +G ++ E L +A E + LI L + +LP R + Sbjct: 63 LEVLEAEGWLAPELLLEARCEMYRAGRIDPEQAPADPDLIGAYGIDWCLTHGILPWRRLS 122 Query: 579 DELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYN 637 + + + +AA + + I +R +A L Sbjct: 123 GATVYLVAEPEEFADIAATLPPEAGQPVLAVAGADAIAEAIRAAHAPHLIARAETALPG 181 >UniRef50_B2V1W7 Glycosyl transferase, group 2 family protein n=29 Tax=Clostridium RepID=B2V1W7_CLOBA Length = 476 Score = 239 bits (609), Expect = 4e-61, Method: Composition-based stats. Identities = 61/438 (13%), Positives = 140/438 (31%), Gaps = 47/438 (10%) Query: 24 VIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGV 83 + + + F+ ++ + + R+ + +Y K A+++ A NE V Sbjct: 5 IFTITTTIFQIFVFILTLYYMVLGFFGLIRKKEKKNYTP-----NKKFALLIAAHNEEVV 59 Query: 84 IGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKAD 142 IG + E + Y +FV +T + E V+ K Sbjct: 60 IGKLIESMLNLNYPKDMYDVFVIADNCTDNTAKISKEY-----GVNVCERFNKDKRGKGY 114 Query: 143 CLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN-YLVERKDLIQIPVYPFER 201 L + D +++ ++ + + DA++++ L+ N + + ++Q + Sbjct: 115 ALEWMFDKLSKMKK----QYDAVAIFDADNLVHKDFLQEINSKMNDGYKVVQGYIDSKNP 170 Query: 202 EWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDV 261 E + + + +Y F + RE + G G + L + Sbjct: 171 EDS-WIAASYSIAFWTQNRMFQLARENVGFSNQIGGTGFAIETETLKEL-------GWGA 222 Query: 262 QSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFS 321 LTED + +L G + ++ + K P S Sbjct: 223 TCLTEDLEFTCKLVLNGEKVGWAHDAIIYDEK-----------------------PLKLS 259 Query: 322 TAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLL 381 + Q+ RW+ G + + ++ W A+ F+ +++ +L + Sbjct: 260 QSWTQRKRWMQGFTDVASRYFWRLTKKSIKERKWYIFDCALYVLQPFITLMLAASAVLTI 319 Query: 382 AYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRL 441 LS + ++ + F L+ I ++ + ++ L Sbjct: 320 IQVDTSGHNIFVLSQAAEGHVVLGVGIKVFALVQFIITPLILAIENKVSKGFFAMTALYS 379 Query: 442 FWGNLINFMANWRALKQV 459 LI ++ A Q+ Sbjct: 380 TNLFLIPYILRIMAEYQL 397 >UniRef50_Q82DY8 Putative polysaccharide deacetylase/glycosyltransferase n=1 Tax=Streptomyces avermitilis RepID=Q82DY8_STRAW Length = 790 Score = 238 bits (608), Expect = 5e-61, Method: Composition-based stats. Identities = 63/454 (13%), Positives = 129/454 (28%), Gaps = 72/454 (15%) Query: 27 FISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGN 86 SGL + V+ + + R +P++++VPA+NE I N Sbjct: 392 LASGLVVVGVAVMGRFAMMLVLARTHYRQRNKRRFSWGPEITRPVSVIVPAYNEKECIEN 451 Query: 87 MAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNN 146 + + I V + T V+ + R PNV + K LNN Sbjct: 452 TI-NSLAQSTHP-IEIIVVDDGSTDGTADIVEAM--RIPNVRVLRQENA---GKPAALNN 504 Query: 147 VLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYLVERKDLIQIPVYPFEREWT 204 + + + ++ D + V +R + + + R+ Sbjct: 505 GVRNAS---------YDIVVMMDGDTVFEADTVRRLVQPFADDEVGAVAGNAKVGNRD-- 553 Query: 205 HFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSL 264 E+ D + + L G F R AV + +L Sbjct: 554 TVIGAWQHIEYVMGFNLDRRMYDLLRCMPTIPGAIGAFRREAVLEV------GGMSEDTL 607 Query: 265 TEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAV 324 ED DI + G ++ P + Sbjct: 608 AEDTDITIAMHRGGWRVVYEEHARAWT-----------------------EAPGSLKQLW 644 Query: 325 RQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYE 384 Q+ RW G + +K K + + R + + L++ ++ Sbjct: 645 SQRYRWSYGTMQALWKHRKSLTDRGPSGRFGR------------------VGMPLVVIFQ 686 Query: 385 SLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI-----FVTGYYGLTQGLLSVL 439 + P + +F+ + + W + + +V +++ F L+ L Sbjct: 687 IVTPVFAPLIDVFTVYSMIFVDFWASLLAWLAVLVVQLVCAAYAFRLDREKYRYLLMMPL 746 Query: 440 RLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 + + ++ + L G R +T Sbjct: 747 QQLAYRQMMYLVLIHSCITALTGGRLRWQKLKRT 780 >UniRef50_D0L467 Glycosyl transferase family 2 n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0L467_GORB4 Length = 512 Score = 238 bits (608), Expect = 5e-61, Method: Composition-based stats. Identities = 82/483 (16%), Positives = 156/483 (32%), Gaps = 64/483 (13%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 LL + + + +A + + + V V + ++ R R + Sbjct: 62 LLGAVVSVVALVLFPVGAVATFVSVVTVAYVITLVDRLVIFRRGLVNGAIRVTDEQARSI 121 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 D P ++VPA+ E V+G++ + + + + +D T V A Sbjct: 122 PDDDLPPYTVLVPAYGEPEVVGDLIAAVESIEYPRDKLQVLLLLEEDDEPTIVAARAVEA 181 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 V V+ P +K N L T + DAED P++LR Sbjct: 182 -SGIVTVVLTPPADPRTKPKACNYGLHFAT---------GDIVTIFDAEDQPDPLQLRRA 231 Query: 183 NYLV-----ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 ++ + +Q + T + ++ G +P +P G Sbjct: 232 VHVFTHIDDDSVVCVQGKLSFHNSRDNILTE-WFTADYGIWFGFLLPGMMVSRAPIPLGG 290 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 F R + + A+D ++TED D+G R+ + G + ++EA Sbjct: 291 TSNHFRRDVLDRI------GAWDPFNVTEDADLGVRIADSGYRTAVLDSVTLEEANVDAI 344 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD 357 +RQ+SRW G + W + LWR Sbjct: 345 ------------------------NWIRQRSRWYKGYLQ------TWLVHMRHPVRLWRI 374 Query: 358 RKGAISNFVSFL-----AMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFG 412 + L + + L L+ + +F G + + L+ L FG Sbjct: 375 LGTVAWLRFTLLIAGTPLIACVNMLFWLILVLWVAGQPPVVADLFPGPIYYLALISLIFG 434 Query: 413 LMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDK 472 + + G + ++S L + L+ +A + + Q+L + W+K Sbjct: 435 NGAAIYMNLIA--IRENGRSDLVVSALLVPAYWLLMSVAAIKGVWQILVN----PSYWEK 488 Query: 473 TTH 475 T H Sbjct: 489 TFH 491 >UniRef50_C4Z588 Type IV pilus assembly protein PilB n=9 Tax=Clostridia RepID=C4Z588_EUBE2 Length = 615 Score = 238 bits (608), Expect = 5e-61, Method: Composition-based stats. Identities = 42/272 (15%), Positives = 97/272 (35%), Gaps = 21/272 (7%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 + R LG +L++ +I E QL TAL R R +G LG ++ G + + +AL + Sbjct: 1 MNYRKKIRLGDVLVKKGIIDENQLQTALSRQREQGKMLGEMVIALGYATQRDINEALCDS 60 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRL---ENDELIVGSEDGIDPVSLAAL 597 G+ + + + +++ + ++ Y ++PL + V D + +++ + Sbjct: 61 LGIDFVDMRETDVSEDVLSMLDENIMRKYTLVPLGDAPDNPGAIRVAMADPTNILAMDDI 120 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRA---MLYNAVQHQWLTEQQAGEIWR 654 G++V V+ I + +++ V + E +A Sbjct: 121 NIVTGKQVVPVLANASDINAFFDKAFGQKQAQSIVDLYKKEQGDVFKEETKEDKARREEI 180 Query: 655 QYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 + P L ++ S I++ E + +G L ++ Sbjct: 181 ENAPIVQLINSVIEQAVRQRASDIHI-----EPMEKSIRVRYRIDG-----NLREIIDYD 230 Query: 715 RE-LQVSMQSLLLKAGLNTEQVAQLESENEGE 745 L + + +G++ + +G Sbjct: 231 NTLLGAITTRIKIMSGMDISE---KRKPQDGR 259 >UniRef50_C9R8Y9 Glycosyl transferase family 2 n=1 Tax=Ammonifex degensii KC4 RepID=C9R8Y9_AMMDK Length = 415 Score = 238 bits (608), Expect = 5e-61, Method: Composition-based stats. Identities = 78/478 (16%), Positives = 138/478 (28%), Gaps = 68/478 (14%) Query: 28 ISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNM 87 I + + + S+YRR S P A+++ A NE VIG + Sbjct: 4 ILFILQLALASYGLYHILLSLFSLYRRVEDYS----ATPPRHSFAVVIAAHNEEKVIGEL 59 Query: 88 AELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNN 146 + + E Y +FV T + A + P K L Sbjct: 60 IKSIFRSDYPRELYEVFVIADNCTDRTAEIARSLGA-----TVIERYNPHERGKGYALEY 114 Query: 147 VLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER-KDLIQIPVYPFEREWTH 205 I R F F++ DA++++SP L++ N+ + R + +IQ + + T Sbjct: 115 GFQRIFALPRK----FDAFVILDADNLVSPHFLQVMNHRLARGEKIIQGYLDTKNPDDTW 170 Query: 206 FTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLT 265 + Y+ + + R L G G C + + + + +LT Sbjct: 171 ISRSIYVG-YLISNRFCQLARHNLGLSCALGGTGMCIATEVLKRF-------GWGMTTLT 222 Query: 266 EDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVR 325 ED + + G+ + V + K P T + R Sbjct: 223 EDLEFQTKALLCGLRVTWAHDAAVYDEK-----------------------PLTLKQSWR 259 Query: 326 QKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYES 385 Q+ RW+ G + + ++ GA+ + M+V + LL Sbjct: 260 QRQRWMQGHCQVAGRYFFRLMWEGIRTRDFKKIDGALYLLRPYFTMMVGLAALL------ 313 Query: 386 LWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGN 445 SIF + L W G + + + L L W Sbjct: 314 ---------SIFEFDWSRIDLWWFVKGFSGQYLYMALALLLERAPLRAYL-------WLL 357 Query: 446 LINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEE 503 A R AW T H ++ + + Sbjct: 358 YYPIFALTWIPITYAGFIYRHRRAWCHTQHVRNISWEQLKAWHRYRMQRKWHGAENPD 415 >UniRef50_B5HQE3 Bifunctional transferase/deacetylase n=3 Tax=Streptomyces RepID=B5HQE3_9ACTO Length = 900 Score = 238 bits (606), Expect = 8e-61, Method: Composition-based stats. Identities = 70/445 (15%), Positives = 127/445 (28%), Gaps = 61/445 (13%) Query: 34 FFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDE---KPLAIMVPAWNETGVIGNMAEL 90 I V + R L + RR+ R+ + + +P+ ++VPA+NE I N E Sbjct: 502 LVIVGVAVMGRFGMMLILARRHYRLRNKRRFSWGPTVTRPVTVIVPAYNEKECIANTLE- 560 Query: 91 AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDA 150 + + + I V + T E AR + V R K LNN + Sbjct: 561 SLSKSTHP-IEIIVVDDGSSDGTSEISRE-AARALGMTNVRVIRQDNAGKPAALNNGVRN 618 Query: 151 ITQFERSANFAFAGFILHDAEDVISPMELR--LFNYLVERKDLIQIPVYPFEREWTHFTS 208 + + I+ D + V P + + + + R+ Sbjct: 619 AS---------YDIVIMMDGDTVFEPDAVHQLVQPFADPEVGAVAGNAKVGNRD--TVIG 667 Query: 209 MTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDY 268 E+ D + + L G F R AV + +L ED Sbjct: 668 AWQHIEYVMGFNLDRRMYDLLRCMPTIPGAIGAFRREAVLQV------GGMSEDTLAEDT 721 Query: 269 DIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKS 328 DI + G ++ P + Q+ Sbjct: 722 DITIAIHRAGRRVVYQEHARAWT-----------------------EAPGSLKQLWSQRY 758 Query: 329 RWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWP 388 RW G + +K K + + R V +++ + + A Sbjct: 759 RWSYGTMQALWKHRKSLTDKGPSGRFGR---------VGMPLVVIFQIVTPVFAPLIDVF 809 Query: 389 DAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLIN 448 A+ + I +A L L + R+ Y L+ L+ + Sbjct: 810 TAYSMIFIDFRAALYAWLAVLGIQFVCAAYAFRLDKEKYRY----LLMMPLQQLAYRQMM 865 Query: 449 FMANWRALKQVLQHGDPRRVAWDKT 473 ++ + L G R +T Sbjct: 866 YLVLIHSCITALTGGRLRWQKLKRT 890 >UniRef50_D1BGY1 Type II secretion system protein E (GspE) n=3 Tax=Actinobacteridae RepID=D1BGY1_SANKS Length = 557 Score = 237 bits (605), Expect = 1e-60, Method: Composition-based stats. Identities = 51/261 (19%), Positives = 104/261 (39%), Gaps = 14/261 (5%) Query: 487 LRPLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 ++ LG+ILLE ++ E QL AL +V G LG ++ G++S QL ALA Q G+ + Sbjct: 1 MKQLGEILLEEGLVNEAQLMAALDEQVVRGTSLGRVLVELGVLSEGQLVSALAAQVGMQF 60 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKV 605 +D + + + ++ + +V Y VLP+ E D L++ D + +++ + G +V Sbjct: 61 VDLDTFPVDRAAVSRLTGAVCRRYTVLPIAFEGDALVLAMADPGNVLAVDDVRSSTGMQV 120 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQH-QWLTEQQAGEIWRQYVPHQFLFA 664 V+ + + + D + Q + + G+ P Sbjct: 121 LPVVATHEDLSRAIDRFVRADDEMDNLTNAFTEEQRVDDVDLSKIGDSVDDDAPIVRYVN 180 Query: 665 EILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSL 724 I+T S I++ E S L +GV+ + + + + + Sbjct: 181 LIVTQAITDRASDIHI-----EPSEHDLRVRYRIDGVLHEM----QRSPKNITGGVISRV 231 Query: 725 LLKAGLNTEQVAQLESENEGE 745 + + ++ + +G Sbjct: 232 KILSDIDIAE---RRKPQDGR 249 >UniRef50_Q181B0 Type IV pilus assembly protein n=5 Tax=Clostridium difficile RepID=Q181B0_CLOD6 Length = 561 Score = 237 bits (605), Expect = 1e-60, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 101/265 (38%), Gaps = 14/265 (5%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 +G L+E ITEEQL AL + G RLG ++ +GLI + L L E Sbjct: 2 KPVAKKVRIGDKLVEKGYITEEQLKWALSEQKNSGKRLGEFLVQEGLIDSNLLISVLKEL 61 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 + ++ +I + +P ++ Y V P +++ +++ + D D ++ + R Sbjct: 62 LDIESIFLEGTEIDTLATKMVPENICKRYTVFPFKIDGNKICLAMSDPQDREAVQDVRRM 121 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 G+ V I I + H YA + YN + + E E P Sbjct: 122 SGKDVEIFISSTEDINKAIGHAYAHSEINKAMTE-YNKNRTGGVRETVILEEDVNAAPIV 180 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L IL + S I++ E+S + +G++ + R+ + + Sbjct: 181 RLVNNILENAVRMEASDIHI-----EQSENYMRVRFRIDGMLREYM--RMNSAPYK--AV 231 Query: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 + + + + +N + +G Sbjct: 232 ISRIKIMSDINISE---KRIPQDGR 253 >UniRef50_B5HP10 Bifunctional transferase/deacetylase n=9 Tax=Streptomyces RepID=B5HP10_9ACTO Length = 741 Score = 237 bits (605), Expect = 1e-60, Method: Composition-based stats. Identities = 72/465 (15%), Positives = 129/465 (27%), Gaps = 57/465 (12%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIG 85 + + GL V+ + ++ R R P +P++++VPA+NE I Sbjct: 326 VLVVGLSIIGSLVIGRFALMLLLSGIHARRVRRKGFRWGAPVTQPVSVLVPAYNEAKCIE 385 Query: 86 NMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLN 145 N + ++ + V + T R V+ + PNV V K LN Sbjct: 386 NTVR-SLMASEHP-IEVLVIDDGSSDGTARIVEAMG--LPNVRVVRQLNA---GKPAALN 438 Query: 146 NVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYLVERKDLIQIPVYPFEREW 203 L AN ++ D + V P +R + R + R+ Sbjct: 439 RGL---------ANARHDIIVMMDGDTVFEPSTVRELVQPFGDPRVGAVAGNAKVGNRDS 489 Query: 204 THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQS 263 E+ D + + L G F R A+ + + Sbjct: 490 --LIGAWQHIEYVMGFNLDRRMYDVLRCMPTIPGAVGAFRRSALERV------GGMSDDT 541 Query: 264 LTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTA 323 L ED DI + G ++ P++ Sbjct: 542 LAEDTDITMAMHRDGWRVVYAEKARAWT-----------------------EAPESVQQL 578 Query: 324 VRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAY 383 Q+ RW G + +K + + R +S F +V+ LL L Sbjct: 579 WSQRYRWSYGTMQAIWKHRRALVERGPSGRFGRVGLPLVSLF------MVVAPLLAPLID 632 Query: 384 ESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFW 443 L + + AWL L + R Q +L ++ Sbjct: 633 VFLLYGVVFGPTQKTIVAWLGVLAIQVVCAAYAFRLDRERMTHLISLPLQQILYRQLMYV 692 Query: 444 GNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLR 488 L +++ + L+ RR + Sbjct: 693 VLLQSWITALTGGR--LRWQKLRRTGVVEAPGGPVPRQRARSESD 735 >UniRef50_Q2G7Y8 Polysaccharide deacetylase n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2G7Y8_NOVAD Length = 1101 Score = 237 bits (604), Expect = 1e-60, Method: Composition-based stats. Identities = 72/449 (16%), Positives = 125/449 (27%), Gaps = 47/449 (10%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPA 77 +A LA+ ++ F + ++ + S R E +++++PA Sbjct: 677 VAAFLALDGLVTLFSWLFFVAIALGIARAVIMAGLAWWQSRSPRAEPPAFEPTVSVIIPA 736 Query: 78 WNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGP 137 WNE VI E + DY + V + T V P V + Sbjct: 737 WNEERVIAASVERVLAS-DYPALQVIVADDGSKDATSAVVARHFGHDPRVTLLTL---AN 792 Query: 138 TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN--YLVERKDLIQIP 195 KA LN L T I DA+ P+ +R + R + Sbjct: 793 GGKAAALNRALRHAT---------GEVVIALDADTQFEPLTIRRLARWFADPRIGAVAGD 843 Query: 196 VYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGD 255 R + + E+ + G + R A+ ++ Sbjct: 844 ARVGNRV--NLVTRWQAVEYITAQNLERRALAGFDAMTVVPGAVGAWRRAALDSV----- 896 Query: 256 GIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREY 315 + +L ED D+ ++ KG + V Sbjct: 897 -GGYPENTLAEDQDLTIAIQRKGWRVTYDPRAVAWT-----------------------E 932 Query: 316 FPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMI 375 P TF RQ+ RW G + +K K +S A ++F A+ +I Sbjct: 933 APQTFRALARQRYRWAFGTLQCLWKHRKVITSRKPAGLGLVGLPQAWLFQIAFAAISPLI 992 Query: 376 QLLLLLAYESLWPDAWHFLSIFSGSAWL-MTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 L+ + S + M + W F + + Sbjct: 993 DGALIASIISTVVRVVQHGWAQTQGDLGRMAIYWSLFTAIDVICGWIAYRLDDKRPPYPA 1052 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHG 463 L V + I + RAL + Sbjct: 1053 HLLVAQRIVYRQIMYWVVLRALASAIGGW 1081 >UniRef50_A3I0Z1 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1 RepID=A3I0Z1_9SPHI Length = 489 Score = 237 bits (604), Expect = 1e-60, Method: Composition-based stats. Identities = 74/480 (15%), Positives = 137/480 (28%), Gaps = 44/480 (9%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAW 78 I + +++ I L FI + + R S + + +P + Sbjct: 1 MIFIYLLIGIYTLGLLFIFIYSLAQGNLLWNFWKARKWLASTPMKEMDTWPKVTVQLPIF 60 Query: 79 NETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGP 137 NE V+ + E AA E I + + +T + E +P V+ R Sbjct: 61 NELYVVDRLIEAAANLNYPKELLEIQLLDD-STDETVDLIQEKIKNYPEVNFQYIHRQDR 119 Query: 138 TS-KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL--RLFNYLVERKDLIQI 194 KA L L N + DA+ V P L L + E+ ++Q Sbjct: 120 VGFKAGALKEGL---------VNAEGEFIAIFDADFVPDPDFLLKTLPYFSSEKVGMVQS 170 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 R ++ T + + R + G G + + + Sbjct: 171 RWTHLNRSYSLLTRL-QAFALDAHFLIEQMGRNYQHAFINFNGTGGVWRKSCIL------ 223 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 D + +LTED D+ +R + KG I+ I Sbjct: 224 DSGNWHDDTLTEDLDLSYRAQRKGWEFIYRPE-----------------------IESPA 260 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVM 374 P S Q+ RW G K S L + + + N F+A+L++ Sbjct: 261 ELPPIMSAVKSQQFRWTKGGAECARKHISGVMSQKLPFRVKFHAFAHLFNSSIFIAILLV 320 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 + + + + L +G + ++ L+ +R + Q Sbjct: 321 SLSSIGVWWAGIKGMIPERLFQLAGIFMIGFVIIAGVYLVSYFYARRSFLKSLGQVFWQL 380 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQIL 494 + + +L N A W L ++ + P Sbjct: 381 PIFLSVSMALSLHNSQAVWEGLTGKKSPFIRTPKFNLESGKQGLRNNLYIKFKIPATTYF 440 >UniRef50_A4T169 Putative uncharacterized protein n=4 Tax=Mycobacterium RepID=A4T169_MYCGI Length = 475 Score = 236 bits (603), Expect = 2e-60, Method: Composition-based stats. Identities = 82/489 (16%), Positives = 151/489 (30%), Gaps = 72/489 (14%) Query: 13 YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLA 72 L + LA + S +D +W+ + R + Sbjct: 45 LVLPAVTTVLAALYLASTIDR------HWLLVQGLRSPSLLTISDEEARAVPDNQLPVYT 98 Query: 73 IMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVV 131 +++P +NE ++ N+ + I + +D T+R + V ++ Sbjct: 99 VLLPVYNEPSIVHNLIAGVGRLEYPKDKLEILLLVEEDDIATRRAM--ATTELEAVRLIL 156 Query: 132 CARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR----LFNYLVE 187 P +K N + + ++DAED+ P++LR F L + Sbjct: 157 VPNSQPKTKPKACNYGM-------ATPGLKGEMVTIYDAEDIPDPLQLRKTVVAFQQLPD 209 Query: 188 RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 IQ + F E T + E+ + G +P EA VP G Sbjct: 210 NVGCIQARLGYFNEEQNLLTR-WFSMEYDQWFGMTLPAVEAAGCVVPLGGTSNHMRTSVW 268 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 A+ +D ++TED D+G RL G + ++EA Sbjct: 269 RAI------GGWDEFNVTEDADLGVRLARAGYRTRILDSVTLEEANSDVL---------- 312 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD-------RKG 360 +RQ+SRW G + L LW R Sbjct: 313 --------------NWIRQRSRWYKGYLQTM------LVHLRHPAALWSQVGGKGILRLL 352 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 ++ V +A++ ++ + A+ PD + +T+ + L V + Sbjct: 353 NMTGAVPIVAVINLVFWATMAAWVLGRPDVVELAFPGATYYVYLTMYVVGAPLSVFMGLI 412 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 + Y L ++ +A +A+ Q++ R W+KT H Sbjct: 413 VTQRLGKPYMWWAAALVP----LYWMLQSIAALKAVFQLV----TRPQFWEKTVHGLSDT 464 Query: 481 TGDTRSLRP 489 S Sbjct: 465 VDVPNSTGR 473 >UniRef50_A5D548 Glycosyltransferases, probably involved in cell wall biogenesis n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D548_PELTS Length = 411 Score = 236 bits (602), Expect = 3e-60, Method: Composition-based stats. Identities = 68/458 (14%), Positives = 126/458 (27%), Gaps = 51/458 (11%) Query: 27 FISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGN 86 I F+ + + I YRR + E P AI+V A NE VIG Sbjct: 3 AIFYATQVFLTLFTFYHFIISLYGFYRR-----HEECLLPPSSRFAIVVAAHNEEKVIGE 57 Query: 87 MAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLN 145 + E Y ++V T + E A K L Sbjct: 58 LIRNLNELDYPKELYDVYVVADNCTDSTAKIAREKGA-----VVFERFNKAERGKPYALE 112 Query: 146 NVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV-ERKDLIQIPVYPFEREWT 204 I + + + DA++++ L + N + + + +IQ + T Sbjct: 113 FAFSKIF----ESGIPYDAVCVFDADNLVDTNFLTVMNAHLLKGEKIIQGYLDTKNAGDT 168 Query: 205 HFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSL 264 T Y+ + + + L G G C S + + + SL Sbjct: 169 WITKSIYVS-YILTNRFLQLSKYNLGLTCALGGTGMCLSVDVLKRY-------GWGMTSL 220 Query: 265 TEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAV 324 TED + + G+ + V + K P T + Sbjct: 221 TEDLEFQTKALLNGIKVTWAHDARVYDEK-----------------------PLTLMQSW 257 Query: 325 RQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYE 384 RQ+ RW+ G + + + GA+ +M + L+ Sbjct: 258 RQRKRWMQGHTNVAGRYVARLVREGIRTRNFAMIDGAVYLIQP---YFLMFTGIGLITNI 314 Query: 385 SLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWG 444 + PD ++ + + GL + R+ V + +Y + + + G Sbjct: 315 FMGPDQILDRPVWLVVGFFAQFFYFGLGLALERVKPVVYWWLIFYPIFALT-WIPVAYVG 373 Query: 445 NLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 + W + Sbjct: 374 FAMRKNKEWTHTLHFRNIKHENLPNLYLPARANGRRSS 411 >UniRef50_B9R454 Glycosyl transferase, group 2 family protein n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9R454_9RHOB Length = 617 Score = 236 bits (601), Expect = 3e-60, Method: Composition-based stats. Identities = 85/506 (16%), Positives = 157/506 (31%), Gaps = 79/506 (15%) Query: 12 LYGLKVIAITLAVIMF-----ISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL--Y 64 L + +I LAV + ISG F + V + + + +L + Sbjct: 162 LPAVLLILCFLAVFVLGVSQMISGALLFVVTGVVSLACFGAGIIRFVCAQSSQEEDLVYH 221 Query: 65 KPDE--------KPLAIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQR 115 P+ ++VP + E GV+ ++ L A + I + +D +T Sbjct: 222 LPEPLSSGLIIWPRYTVLVPLYREAGVVPDLLRALNALNYPRDRLQILLLMETDDLETAA 281 Query: 116 DVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVIS 175 + E ++ +V P +K L+ L A T + DAED Sbjct: 282 ALPE--DLPSHIEALVVPDGTPRTKPRALDYGLAAAT---------GTYVTVFDAEDRPD 330 Query: 176 PMELRLFNYLVER----KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAG 231 P +L+ +L + +Q + T F S + E++ L + +P Sbjct: 331 PDQLKKAAFLFAKGPAELACLQARLVVDNANET-FISRQFALEYACLFDQLLPWLFRHRW 389 Query: 232 QVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDE 291 P G F A+ A+ +D ++TED D+G RL+ G + Sbjct: 390 PFPLGGTSNHFRISALHAV------GGWDRYNVTEDADLGVRLERLGFRLGVLP------ 437 Query: 292 AKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLN 351 E P T + Q++RW G + F Sbjct: 438 ------------------CQTLEEAPVTLKAWLAQRARWHKGWLQTIF------VHARSP 473 Query: 352 YFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNF 411 L D + + A+ + LL+ L L + + + ++ Sbjct: 474 RRLLSDLGAVRTAVL--AALFLGTFLLIALHPVFFVLLTGSLLGYYDHTYFFGNIVLTVM 531 Query: 412 GLMVNRIVQRVIFVTGYYGLTQ-----GLLSVLRLFWGNLINFMANWRALKQVLQHGDPR 466 + + G + LL + + A +RA+ ++ Sbjct: 532 FVSGAAAGYAGALFALWTGARRRGHGIRLLDAPGVLIYWMFAGFAFYRAVWELAS----A 587 Query: 467 RVAWDKTTHDFPSVTGDTRSLRPLGQ 492 W+KT H +PL + Sbjct: 588 PYRWNKTEHGVSRQRTSLTDWKPLPE 613 >UniRef50_D1NA10 General secretion pathway protein E n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1NA10_9BACT Length = 527 Score = 236 bits (601), Expect = 3e-60, Method: Composition-based stats. Identities = 70/526 (13%), Positives = 144/526 (27%), Gaps = 104/526 (19%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE---LYKPDEKPL 71 + + I V++ + +S E L + + Sbjct: 51 WDYFVFFVTGYLMIWYA-----AAVFFRGGAAVLSWFGQGEEIVSDAEVAALDEKELPVY 105 Query: 72 AIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 I++P ++E + + E + + +D +T+ ++ P V Sbjct: 106 TILLPLYHEANIAEKIVRNMGRLDYPKEKLDVKLLLEADDDETRLALERTG-LPPYCEVV 164 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF-----NYL 185 P +K N L +++DAED P +L+ Sbjct: 165 TVPDAPPRTKPRACNFGLRRA---------RGEFSVIYDAEDAPEPDQLKKAYIVFRRDQ 215 Query: 186 VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 ++ +Q + F T + + E+S + + +P G F Sbjct: 216 EKKVLCVQGKLNYFNARHNLLTRL-FTVEYSTYFDLTLSGYQLFNLPLPLGGTSNHFRTA 274 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 + + +D ++TED D+G R+ E+G V Sbjct: 275 ELREV------GGWDPFNVTEDCDLGIRIYERGYKTRLVNST------------------ 310 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNF 365 E +RQ+SRW+ G + ++ L+ G ++ Sbjct: 311 ------TYEEANAHVWNWIRQRSRWVKGFIQTHLVHYRNPFLTVKRLGLYGAFGGFLAVG 364 Query: 366 VSFLAMLV-MIQLLLLLAYESLWPDAWH-------------------------------- 392 S + ML +I +LL Y L + Sbjct: 365 GSAMMMLTNLIFWTVLLIYAGLLIHGFSHGLGLYDQIVGPHLPGGAYEGIRLGGMSFRAW 424 Query: 393 ----------FLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLF 442 + L + F + + V + + ++ + Sbjct: 425 PLVYYGQGEDPFWAVFSQIFFAGSLIMLFANFIFIGIGVAACVKRKFYYLIPVSLLMPFY 484 Query: 443 WGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLR 488 W + +A W+ Q+ + W+KT H + L Sbjct: 485 WLLI--SIAAWKGFIQIF----TKPFYWEKTIHGLTTDPITEEELH 524 >UniRef50_C6CWA2 Glycosyl transferase family 2 n=4 Tax=Bacillales RepID=C6CWA2_PAESJ Length = 412 Score = 236 bits (601), Expect = 4e-60, Method: Composition-based stats. Identities = 73/466 (15%), Positives = 148/466 (31%), Gaps = 58/466 (12%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 I + ++ + GL F+ W R +L P +K A++ Sbjct: 2 FNTIMLVFQIVFALLGLYQLFLTCFGWHR---------------KKEDLSHPPQKTFALL 46 Query: 75 VPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 V A NE V+G + E E + IFV T V+ + V+ V Sbjct: 47 VAAHNEEQVVGALIENLLKLKYPRELFDIFVICDNCTDGTVDIVNS----YDGVYACVRN 102 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN-YLVERKDLI 192 K + +L + + +S + G + DA+++++ L+ N L+ +I Sbjct: 103 NKNQRGKGYAVEWMLKELWKMPKS----YDGVAIFDADNLVATDFLQYMNNDLINGHRVI 158 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 Q + + + S + + R L G G CF + + + Sbjct: 159 QGYLDTKNPNDS-WISSANAINYWFCNRLWQLPRTNLGLANFLGGTGMCFDAKLLQEM-- 215 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 + SL ED + R ++G+ F V + K Sbjct: 216 -----GWGATSLVEDLEFTVRCIQRGIYPKFNFEAKVFDEK------------------- 251 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 P TF + RQ+ RW+ G K + ++ F ++ L Sbjct: 252 ----PITFQASARQRLRWMQGHFDVTRKYMLPLLWQGIKERSMTKIDASLYVFNAY-NYL 306 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 + ++ + L + S+++ + +++ ++ + + + + V L Sbjct: 307 AGFFIAAIIWGDMLLFGGNNVESVYNLLPFWLSIPYMAYVFIQIPLSMYMAKVPWKLYLR 366 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 + + W I A + + H RV + Sbjct: 367 IPTFLLFTVSW-WPITVHAFFTQNNKKWSHTQHTRVIRLEDVQSKQ 411 >UniRef50_A3V835 Glycosyltransferase, family 2 n=2 Tax=Rhodobacteraceae RepID=A3V835_9RHOB Length = 633 Score = 235 bits (600), Expect = 4e-60, Method: Composition-based stats. Identities = 78/481 (16%), Positives = 153/481 (31%), Gaps = 55/481 (11%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAI 73 + + L ++ L + + + + + R R + + D + + Sbjct: 192 IMACAVLDLGALIIAVTLWAMVVLIATTLLKAAAVIIGMRARAH-DPRPVAEADLPVITM 250 Query: 74 MVPAWNETGVIGNM-AELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVC 132 ++P + ET + ++ LAA + + +D T+ + + ++ Sbjct: 251 LIPLYRETAIASHLLVRLAALRYPRALLDVCLVLEQDDATTRATLARTQ-LPGWIRAIIV 309 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE----R 188 +K LN L + + DAED +P +L + Sbjct: 310 PPGQVKTKPRALNYALPFA---------RGSIIGVWDAEDAPAPDQLHVVARHFAAAGAH 360 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 +Q + + T + E++ +P L VP G F R + Sbjct: 361 VACLQGVLDYYNAGTNWLTR-CFTIEYAAWFRVVLPGLARLGLVVPLGGTTLFFRRSVLE 419 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 L A+D ++TED D+G RL +G + ++EA R Sbjct: 420 TL------GAWDAHNVTEDADLGLRLARRGYVTALIPTLTMEEANGRAWP---------- 463 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSF 368 V+Q++RW+ G + L + LW G F+ Sbjct: 464 --------------WVKQRARWLKGYAITYGVHMRSPVRLLRDLGLW-QFIGVQVLFLGT 508 Query: 369 LAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGY 428 L++ ++ L L L + +G+ W MT ++ + + + Sbjct: 509 LSLFALMPLFWSLWLIPLGVAHPLHGWLSAGAFWAMTYAFIAAEALALLVNLVGL---HK 565 Query: 429 YGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLR 488 G + L L + +A +R L ++ R WDKT H + Sbjct: 566 AGRLRLWGWALTLPAYFPLGTLAAYRGLAEL----ATRPFYWDKTAHGVVLTGLKPPANL 621 Query: 489 P 489 P Sbjct: 622 P 622 Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 40/118 (33%), Gaps = 2/118 (1%) Query: 517 RLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRL 576 L +L + + +A+ LA+ +G ++LIA A L VLP R Sbjct: 49 PLSDILLRDYNLPSRTIARYLAKAHGAQVVDPTRRPADATLIARWGARDCLRRGVLPWRA 108 Query: 577 ENDELIVGSEDGIDPVSLA-ALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRA 633 + V + + L + G ++R I ++ + + D Sbjct: 109 AQGTVTVLTTCPTRFAAARPDLEQVFG-RIRMAITTEAKLTDAIATLHGPIFAADAET 165 >UniRef50_B7QVL5 Glycosyl transferase, family 2 n=1 Tax=Ruegeria sp. R11 RepID=B7QVL5_9RHOB Length = 1140 Score = 235 bits (599), Expect = 5e-60, Method: Composition-based stats. Identities = 65/456 (14%), Positives = 132/456 (28%), Gaps = 58/456 (12%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAW 78 + LAV++ + F+ + +R R R P+ +++PA+ Sbjct: 722 IVPLAVLLGVLRALTLFVLAI-----------RSKRLSRSDLRHRTGDFTAPVTVVIPAY 770 Query: 79 NETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPT 138 NE I + DY N + V + T V + P V + Sbjct: 771 NEEKSILKTI-YSVLDSDYPNLSVLVVDDGSTDATYDLVSKTYLNNPKVQILRQPNG--- 826 Query: 139 SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY--LVERKDLIQIPV 196 K N +T + DA+ +++P +R + + + Sbjct: 827 GKWKAANLAFAHVTT---------DYVVAIDADTIVAPDAIRRLMQPLRNPKVGAVAGKI 877 Query: 197 YPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDG 256 + + E++ D E + + G + AV Sbjct: 878 MVGN--SNNLLTKLEKLEYTVAQNIDRRAYETINAIMVVPGAFGAWRTAAVRE------C 929 Query: 257 IAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYF 316 + Q+L ED D+ L E G Sbjct: 930 GYYSSQTLAEDTDLTISLLEAGYVVRAAEKAYAYT-----------------------EA 966 Query: 317 PDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQ 376 P + T ++Q+ RW IGI+ +K T L + + L ++ Sbjct: 967 PASVGTLMKQRMRWSIGILQSAWKHRA-TIRKGHAIGLVGLTDLVLFGVIMPLLGPIIDL 1025 Query: 377 LLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLL 436 LL+L+ + F+ +++ ++L L+ + + L Sbjct: 1026 LLVLMLARFITSFDGTSFDAFTARDYIVLSIFLALPLLEMIMADYAVRSEPSTPRRMVFL 1085 Query: 437 SVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDK 472 +L + + +R+L ++L + Sbjct: 1086 LLLNRLIYRQLLIINVYRSLWRILTGRLTGWHKLRR 1121 >UniRef50_C1F4G9 Polysaccharide deacetylase domain protein/glycosyl transferase, group 2 family protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F4G9_ACIC5 Length = 1170 Score = 234 bits (597), Expect = 9e-60, Method: Composition-based stats. Identities = 67/464 (14%), Positives = 131/464 (28%), Gaps = 49/464 (10%) Query: 25 IMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVI 84 ++F+ + D + + + + +R R E +AI++PA+NE VI Sbjct: 748 LIFVFFVGDVLMSGRLLIIGLFALIERFRTR-----RIPPGVYEPAVAILIPAYNEEKVI 802 Query: 85 GNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCL 144 A DY + H+ V + T E A+ K+ KA+ L Sbjct: 803 VRTIRSAL-NSDYPHLHVVVIDDGSTDRTLEVAREAYAQEIADGKLTVLTKPNAGKAEAL 861 Query: 145 NNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYLVERKDLIQIPVYPFERE 202 N L I ++ DA+ VI+ + ++ + + + Sbjct: 862 NFGLRQI---------REEVYVGIDADTVIAVDAVSKLVRHFADPKVGAVAGNAKVGNK- 911 Query: 203 WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQ 262 +T E+ + + G + AV A G + + Sbjct: 912 VNLWTR-WQALEYITGQNFERRALDLFNVVTVVPGAIGAWRTAAVLA------GGCYPLN 964 Query: 263 SLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFST 322 ++ ED D+ L E+G I+ + P + Sbjct: 965 TVAEDADLTMNLVEQGYKVIYEDHALAFT-----------------------EAPINANG 1001 Query: 323 AVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLA 382 +RQ+ RW G + FK + + F + + + + + L Sbjct: 1002 LMRQRFRWSFGTLQAVFKHRQAFRTNRAMGFFALPNIVVFQILLPLASPFIDLLFAVSLI 1061 Query: 383 YESLW-PDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRL 441 + S S L+ L ++ + G L Sbjct: 1062 QFLINKHYHPETASAASFDKLLIYFLAFIVIDFFTSLLAFSLEPRHPANKGDGWLLFHIW 1121 Query: 442 FWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTR 485 + + + + + D R WDK + Sbjct: 1122 LQRFSYRQIFSIVLFRTLKRAIDGRPFNWDKLERTAKMSRQTEK 1165 >UniRef50_A4SIQ6 Glycosyl transferase, family 2 n=6 Tax=Gammaproteobacteria RepID=A4SIQ6_AERS4 Length = 416 Score = 234 bits (597), Expect = 9e-60, Method: Composition-based stats. Identities = 67/463 (14%), Positives = 137/463 (29%), Gaps = 56/463 (12%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPA 77 ++ L+ I + + +V+ + +R ++ L P + +MVP Sbjct: 1 MSGFLSGIAIFTLAYPSMMAMVWICGGLYYYFQWEQRDIALARTGLVLPRYPKVTLMVPC 60 Query: 78 WNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGP 137 +NE + Y + + + DT + +D + A+ + + Sbjct: 61 YNEGANVEETISH-LLRQRYPDLEVLAINDGSKDDTGQRLDRLAAQDARLTVL---HQHN 116 Query: 138 TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV---ERKDLIQI 194 KA LNN L+ + D + V+ +R + + Sbjct: 117 QGKAMALNNGLERAK---------GEILVGIDGDAVLDHDAVRWMVKHFIESPKVGAVTG 167 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 P R + EFS + G + +GV F + AV A+ Sbjct: 168 --NPRVRTRSTIIGKIQTGEFSSIIGLIKRAQRIYGTVFTVSGVVVAFRKSAVEAV---- 221 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 + +TED DI ++L+ G + + Sbjct: 222 --GGWSTDMVTEDIDISWKLQLAGWLIHYQPQAL-----------------------CWV 256 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVM 374 P+T +Q+ RW G + L WR+R + +L+ ++ Sbjct: 257 LMPETVRGLYKQRLRWAQGGAEVILRY-------GLQAMRWRNR-HFWLLLLEYLSSVLW 308 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 +LLLA L L+ W ++ L+ + + Sbjct: 309 CYSMLLLALIWLIRPDLGELAEGQLFQWT-GVIMTAICLLQFGVSIFIDNHYDKTMARSF 367 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDF 477 + + L+N + A+ + + + W+ Sbjct: 368 VWCIWYPLIYWLLNLVTVIVAVPKAILRRRGQLAVWESPDRGE 410 >UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A3S6_THESM Length = 388 Score = 234 bits (597), Expect = 9e-60, Method: Composition-based stats. Identities = 63/446 (14%), Positives = 133/446 (29%), Gaps = 64/446 (14%) Query: 31 LDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAEL 90 + F+ + R S ++R + + I++PA NE VI + Sbjct: 2 IYPIFVYYIVLTIAGLRYNSRFKR-------PEIPEELPSVTILIPARNEGLVIRDTLRA 54 Query: 91 AA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLD 149 A + + + + DT + +EV +P + + G + K+ LN L Sbjct: 55 MANLDYPKDKLEVLLLDDGSTDDTAKIAEEVSKDYPFIKVIRVE-GGGSGKSYVLNYGLK 113 Query: 150 AITQFERSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLIQIPVYPFEREWTHFTS 208 ++DA++ P L+ L L + + V T Sbjct: 114 LAK---------GEVIAVYDADNRPEPGALKDLVAMLSDETPAVTGKVKTMNWNRNILTR 164 Query: 209 MTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDY 268 E+ + G + + L +D ++L ED Sbjct: 165 FI-CMEYLYFQLAGQAGKSKFYKTAILPGTNFVIRKELLEEL------GGWDEEALAEDL 217 Query: 269 DIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKS 328 ++ FR+ G + V E P++ RQ++ Sbjct: 218 ELSFRIILTGKKIAYTPLAV-----------------------TWEQEPESLRVWFRQRT 254 Query: 329 RWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWP 388 RW G V + K +++ F FL ++V L + + + Sbjct: 255 RWAAGNVHTVKEYVKR----------FKEIPSWGLRFDLFLTLMVYYLLAMAVIVADV-- 302 Query: 389 DAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLIN 448 F+++ S + W G + + + + +L L + + Sbjct: 303 ---AFVALLLTSGSVTWFTWSVLGFVYLAFLLEIFAGLYDGKIKSPGCWLLALLMYHTYS 359 Query: 449 FMANWRALKQVLQHGDPRRVAWDKTT 474 + +L + + ++V + Sbjct: 360 QIWILISLAGLWEARRAKKVWYKTPR 385 >UniRef50_B0MCL3 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MCL3_9FIRM Length = 558 Score = 234 bits (597), Expect = 1e-59, Method: Composition-based stats. Identities = 47/266 (17%), Positives = 109/266 (40%), Gaps = 18/266 (6%) Query: 485 RSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALAEQNG 542 P+G++LL+ IT+EQ+D AL + E G RLG ++ I+ +Q+ +AL ++ Sbjct: 1 MKNIPIGEVLLQYGYITKEQIDQALDYQKEHPGKRLGTILMELQFITEQQMLEALGQRLS 60 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 ++ S+ ++ + S + ++P +A Y +L + +++ +L + D ++ ++ + + G Sbjct: 61 LSHISLGSYPVNSEAVEKIPRQLAFKYNILAVDMKDHQLYIAVNDPLNFYAMEDIRQLTG 120 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNA--VQHQWLTEQQAGEIWRQYVPHQ 660 +++ + + L ++YA E P Sbjct: 121 MQLKVFLAELSPLKKALEYFYAEVSARQAARQANETTQEAEDISFLDDMDEEADSDAPII 180 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L ++ + N S I++ E + +G I +T++R L S Sbjct: 181 KLLNTLVLRAYNTNASDIHI-----EPFEKETVVRMRIDGTIV-----DYVTLKRSLHAS 230 Query: 721 MQ-SLLLKAGLNTEQVAQLESENEGE 745 + + + G++ + +G Sbjct: 231 LTARIKIMGGMDIAE---KRIPQDGH 253 >UniRef50_B6R4T3 Glycosyl transferase, family 2 n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R4T3_9RHOB Length = 672 Score = 234 bits (597), Expect = 1e-59, Method: Composition-based stats. Identities = 72/488 (14%), Positives = 149/488 (30%), Gaps = 67/488 (13%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVR-----------RIKRKLSVYRRYPRMSYRELYKPD 67 +T++ I F+ + F DV YW + R S + + + + Sbjct: 236 LLTVSPIFFLGTIGFAF-DVFYWALFILLTLSLGAIGLLRIASFFTYSDKQDFAVPELEN 294 Query: 68 EKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN 126 I+VP + E+ + + + A E + +D TQ+++ ++ + Sbjct: 295 WPHYTILVPLYKESAICRQLVDALDALDYPKEALDVIFLVEQDDELTQKNLRKLLRKS-- 352 Query: 127 VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV 186 + ++ P +K L+ L A + DAED P +L+ Sbjct: 353 MRMIILPPGKPQTKPRALSVGLAA---------TKGEFVTVFDAEDRPEPQQLKKAICQF 403 Query: 187 E----RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 +Q + + + E++ L +P +P G F Sbjct: 404 ALEGHDVACLQAALSIDHAKDGWLVR-QFAFEYAALFDVFLPFLSRKNLLLPLGGTSNHF 462 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 A+ + +D ++TED D+ R +G + Sbjct: 463 RVSALRKV------GGWDPFNVTEDADLAVRFARQGFRTRTLNSS--------------- 501 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAI 362 E P T + Q++RW G + + L Y A+ Sbjct: 502 ---------TYEEAPLTLKAWLHQRTRWHKGWIQTLAVH---LRNPRLTYKQLGATNFAL 549 Query: 363 SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRV 422 F ML + L + H + + A+ + L+ + +V + Sbjct: 550 LLLTFFGGMLCLWAAPLTALMFAEVLWGVHQTGLQAFDAFSIYALFCFLFGLGGTVVTVL 609 Query: 423 IFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 G + + ++ +A+++AL + + + W KT H G Sbjct: 610 QGSAKR-GFRPRGWEIASIPLYWVLGCIASYKALFEFIV----KPHYWRKTEHGIVRHRG 664 Query: 483 DTRSLRPL 490 + + Sbjct: 665 KIKDAEGV 672 >UniRef50_Q2FNF4 Glycosyl transferase, family 2 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNF4_METHJ Length = 848 Score = 234 bits (596), Expect = 1e-59, Method: Composition-based stats. Identities = 72/501 (14%), Positives = 148/501 (29%), Gaps = 82/501 (16%) Query: 19 AITLAVIMFISGLDDFFIDV--VYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVP 76 L V+ I + F ++ Y + + L D ++VP Sbjct: 188 YFWLFVLFTIVNITYFVMNPVKFYVSMQGMMGEKNVIHISDEDIQNLKDEDLPIYTVLVP 247 Query: 77 AWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVC-------------- 121 ++E ++ ++ + A + + + D +T ++ Sbjct: 248 LFHEQEMLPHILQNIANINYPRDKLDVKILMEEEDTETIEKARKLGLFGNVEEIISPMSE 307 Query: 122 ----ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM 177 A H VV + T+K N L +++DAED+ Sbjct: 308 PEYHAFLSIFHPVVIPKADITTKPRACNYGLK---------RSRGEFVVIYDAEDLPDRD 358 Query: 178 ELRLFNYLV----ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 +L+ + +Q + + T + E+S + + + + + Sbjct: 359 QLKKVVIAFQRLGPKYACVQCLLNFYNPRKNMLTR-WFSIEYSYYYDFYIQGLDKIDAPI 417 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 P G F + + L A+D ++TED D+G R+ K + + Sbjct: 418 PLGGTSNHFRMKTLREL------GAWDPYNVTEDADLGMRIARKKLHTAVLNS------- 464 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF 353 E + +RQ+SRW+ G V W ++ Sbjct: 465 -----------------HTYEEAVTRVPSWIRQRSRWVKGFVI------TWFVTMRHPIK 501 Query: 354 LWRDRKGAISNFVSFLA-----MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLW 408 + +D I NF F + + L L + + S + + Sbjct: 502 VLKDI--GIKNFFIFQTGFGGNFYLPLMNLFLWLVFAAGFIIPEYFSSWFDFWPFAAIAV 559 Query: 409 LNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRV 468 N + + ++ T LL ++ + W+ Q++ + Sbjct: 560 FNLLIGNLFFLTMMVVATWKEKQRDLLLYAFFSPIYWILMSIGAWKGTLQLI----FKPY 615 Query: 469 AWDKTTHDFPSVTGDTRSLRP 489 W+KT+H V P Sbjct: 616 KWEKTSHGTEIVHEQLLIEHP 636 Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 41/113 (36%) Query: 530 AEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGI 589 + LA++ G+A+ D + + +P +V + L + L V + + + Sbjct: 46 PDDFYAYLADRLGLAFMERDTLFANPRIGSVLPYAVGEETLIALLESKPTYLKVATANPL 105 Query: 590 DPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQ 642 D L G+K+ V+ I+T Y + + L + + Sbjct: 106 DTPLFTRLEEIFGKKIEKVVTPLDAILTITDTSYKGPHAYSALSELVDRQPDE 158 >UniRef50_B2A7G1 Type II secretion system protein E n=5 Tax=Firmicutes RepID=B2A7G1_NATTJ Length = 571 Score = 234 bits (596), Expect = 1e-59, Method: Composition-based stats. Identities = 48/271 (17%), Positives = 104/271 (38%), Gaps = 19/271 (7%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQ 540 + + LG +LLE+ ITEE L AL ++ G +LG S++ G+I+ E++ + L Q Sbjct: 1 MFRQKKQRLGDLLLESGAITEEDLKQALDHQNKSGQKLGASLVDLGIITEEEIIEVLEFQ 60 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 G+ S+ + +PA +A Y VLP+ + +L++ D ++ V++ + Sbjct: 61 LGIPHVSLSQYDTNRETATLIPAYLAERYQVLPIDNRSGKLVLAMGDPLNVVAIDDVKMA 120 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEI------WR 654 G +V VI +I + + + + + + A Sbjct: 121 TGMEVEPVIASPREIEGEINRHFGIQDSVEKAIEEIEGSAEEEAESEIAATEEEELSNLE 180 Query: 655 QYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 P + +++ S I++ E + + +GV+ + Sbjct: 181 TNAPVVKVVNSLVSQAYEQGASDIHI-----EPTKQGMQIRYRIDGVLHNVA----TPPR 231 Query: 715 RELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 + + + + AG++ + +G Sbjct: 232 YAKDLLISRVKIMAGMDIT---KKRIPQDGR 259 >UniRef50_A9EEN3 Glycosyl transferase, family 2 n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EEN3_9RHOB Length = 1088 Score = 234 bits (596), Expect = 1e-59, Method: Composition-based stats. Identities = 74/449 (16%), Positives = 138/449 (30%), Gaps = 49/449 (10%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 LAV S L F V + + + ++ R + E + I++PA+NE Sbjct: 671 FLAVGNSWSLLQIAFWTV-FAIGICRSISLLFWAARRRRHAPPLSKHEPSVTIVIPAYNE 729 Query: 81 TGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSK 140 VI A + +Y ++ I V + DT A P V + K Sbjct: 730 ARVIEKCIRKALYS-EYGDFDIIVVDDGSTDDTYEKAISF-AYHPLVTVLRQP---NRGK 784 Query: 141 ADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFE 200 A LN LD I DA+ I+P + L + + + Sbjct: 785 AAALNAALDEA---------QSEILICIDADSQIAPDAVSLLAAHFKDPKVGAVAGRVVV 835 Query: 201 REWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFD 260 + + E+ + +E L G + A+ F Sbjct: 836 GNRDNLLTRLQALEYITAQAVERRAKEYLNAITVVPGAIGAWRTTALME------AGIFS 889 Query: 261 VQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTF 320 ++LTED D+ + I+ V P + Sbjct: 890 TETLTEDADMTMAMIRSDYQVIYEDRAVATT-----------------------ETPRSL 926 Query: 321 STAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLL 380 S + Q+ RW +G++ G+K T + + +A L + L+LL Sbjct: 927 SALMTQRLRWSLGMMQAGWKHLGATVERRNLGLVALPDLVVFGYLMPLIAPLADLFLVLL 986 Query: 381 LA--YESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSV 438 + + ++ F S+ + + L + I R+ LL Sbjct: 987 VIEFFTNIGATDQDFASVITNPLIIAYLALPALEIASAVIAFRL---DPTEDRRLLLLLP 1043 Query: 439 LRLFWGNLINFMANWRALKQVLQHGDPRR 467 ++ + + +++ RAL + + Sbjct: 1044 VQRIFYRQVLYVSVIRALWRAVTGSLTNW 1072 >UniRef50_B1YJT8 Type II secretion system protein E n=5 Tax=Bacillales RepID=B1YJT8_EXIS2 Length = 554 Score = 234 bits (596), Expect = 1e-59, Method: Composition-based stats. Identities = 44/262 (16%), Positives = 103/262 (39%), Gaps = 18/262 (6%) Query: 485 RSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 + LG++LLE V+TE Q++ AL + +LG ++L G ++ +QL +AL Q + Sbjct: 3 MKRKRLGEMLLEESVVTEAQIEEALSVKRTSEKLGDTLLRLGHLTEQQLIEALHHQLKIP 62 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 + + + ++ + +A + ++P+ E + L + D +D +++ L + G Sbjct: 63 VIQLYNYPVDVAVTKLISKELAQRHTLVPVYREGNRLFIAMADPMDLIAIDDLRLQTGLM 122 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFA 664 + + R +I + +Y ++ + R+ P L Sbjct: 123 IEVGLATRDEIRRTILKYYDIDSSLRELLESDEMTI----SDTSRDTVTREDAPIIRLVN 178 Query: 665 EILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM-QS 723 +IL S I++ + L + +G + E +++Q + Sbjct: 179 QILENGISQRASDIHM-----DPQETSLSIRIRIDGELRTEN-----NYPKQIQNILTTR 228 Query: 724 LLLKAGLNTEQVAQLESENEGE 745 + + + L+ + +G Sbjct: 229 IKVMSELDITE---SRLPQDGR 247 >UniRef50_B9AE19 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9AE19_METSM Length = 461 Score = 233 bits (595), Expect = 2e-59, Method: Composition-based stats. Identities = 67/471 (14%), Positives = 150/471 (31%), Gaps = 60/471 (12%) Query: 16 KVIAITLAVIMFISGLDDFFIDVVYWVRRI--KRKLSVYRRYPRMSYRELYKPDEKPLAI 73 + + + ISG+ + + W+ I L + + + + +++ Sbjct: 37 SIFLLVTCSALIISGILARNVTWLQWLLLIPTLTMLFLAIISTKKQEKPIPYEKPPFVSL 96 Query: 74 MVPAWNETGVIGNMAELAATTLDY-----ENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 ++PA NE I + + +DY N+ + V + T + E+ P + Sbjct: 97 IIPAHNEEYTIAQTV-TSISKIDYTLNGKPNFELIVVNDGSTDSTGEKLSELKKDIPILR 155 Query: 129 KVVC-ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV- 186 V K LN+ L + DA+ + L + + Sbjct: 156 IVTRKPPKSGKGKGFVLNDALSL---------SKGEIIGVFDADTQVEKDFLNIVMPYLN 206 Query: 187 -ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 + +Q V + ++ +M ++ EF E G + ++ L G G ++ Sbjct: 207 NPKVQGVQTRVKMYNKDENFLANMQHV-EF-ESFGNTLIAKDNLGKSGFLGGNGQFVKKQ 264 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 A+ DG +D ++TED ++ ++ KG + Sbjct: 265 AIL------DGEKWDGFAVTEDLNLSVKILLKGGQIRYCGET------------------ 300 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNF 365 V + + + +Q++RW IG F L + G I + Sbjct: 301 -----AVYQEAVTDWKSFFKQRTRWAIGNFETIFIYLPKILKSPLP---LIKKYGIIEHI 352 Query: 366 VSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV 425 + L + ++ ++ +H ++I A L+ + + + + Sbjct: 353 SFYAFNLFIFFGFIVSMLNAISWFIFHGVTIIRMDAPLIVGIISTIAFIPGISI--ALLR 410 Query: 426 TGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 L + + + + K +++ + WDKT H Sbjct: 411 EKAGPLKFIKDIIGYWIYCFHLIPLFFETMFKMIIR----KERKWDKTKHK 457 >UniRef50_D2R471 Type II secretion system protein E n=6 Tax=Planctomycetaceae RepID=D2R471_9PLAN Length = 573 Score = 233 bits (595), Expect = 2e-59, Method: Composition-based stats. Identities = 50/270 (18%), Positives = 105/270 (38%), Gaps = 20/270 (7%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 +ILL +V++++QL+ + + L ++ G S E + +A+A+++G Sbjct: 1 MARKGDFTEILLRRRVVSQDQLNEGRQVAKDTNANLSDVLIRLGYASGEDVMRAVAQEHG 60 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + + IP +I +P SVA A+LPL + D L V D D ++ L + Sbjct: 61 REYVDLSEVTIPEDVIELVPESVARENAILPLSEDEDSLKVIVSDPYDIDTIEKLRFILN 120 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYN-------AVQHQWLTEQQAGEIWRQ 655 RKV + R +I+ + +Y++ G ++L + + + Sbjct: 121 RKVDIALAPREKILEAINKYYSQIEGESADSVLQEFTDTAIDFTETEATKVTSNEAVDEN 180 Query: 656 YVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQR 715 P L ++ + S I+V E + +G++ + R +R Sbjct: 181 SAPIVRLVQLMIGEAVQLRASDIHV-----EPFEEIVRIRYRIDGILHK----RDSPPRR 231 Query: 716 ELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 L + + + A ++ + +G Sbjct: 232 LLAAIVSRIKILAKMDIAE---RRRPQDGR 258 >UniRef50_A9EU26 Group-specific protein n=3 Tax=Phaeobacter gallaeciensis RepID=A9EU26_9RHOB Length = 1136 Score = 233 bits (594), Expect = 2e-59, Method: Composition-based stats. Identities = 72/466 (15%), Positives = 139/466 (29%), Gaps = 62/466 (13%) Query: 9 ATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDE 68 ATW+ + AV++ + F+ + RR R R Sbjct: 712 ATWVA----FLVPFAVLLGVIRALVLFVMAI-----------RSRRADRADLRHRTGDFS 756 Query: 69 KPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH 128 P+ +++PA+NE I + DY N + V + DT V + PNV Sbjct: 757 APVTVVIPAYNEEKSILKTI-YSVLESDYPNLSVLVVDDGSTDDTHGLVTKTYKDNPNVQ 815 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY--LV 186 + K N +T + DA+ +++P +R Sbjct: 816 ILRQPNG---GKWKAANLAFSHVTT---------DYVVAIDADTIVAPDAIRRLMQPLRN 863 Query: 187 ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 + + + + + E++ D E + + G + A Sbjct: 864 PQVGAVAGKIMVGN--SNNLLTKLEKLEYTVAQNIDRRAYETINAIMVVPGAFGAWRTEA 921 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHART 306 V + Q+L ED D+ L E+G Sbjct: 922 VRK------CGYYSSQTLAEDTDLTISLLEQGYEVRAAERAYAYT--------------- 960 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV 366 P + T ++Q+ RW IGI+ +K T L I + Sbjct: 961 --------EAPASVGTLMKQRMRWSIGILQSAWKHRS-TIRKGHAVGLVGLTDLVIFGVI 1011 Query: 367 SFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVT 426 L ++ LL+L+ + F+ + + ++L L+ + + Sbjct: 1012 MPLLGPIIDLLLVLMLVRFVSGFDGTTFDAFTVRDYAVLSIFLALPLLEMIMADYAVRSE 1071 Query: 427 GYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDK 472 + LL +L + + +R+L ++L + Sbjct: 1072 PTMPRSMVLLLLLNRLIYRQLLIINVYRSLWRILTGRLTGWHKLRR 1117 >UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI000038DF5C Length = 405 Score = 233 bits (594), Expect = 2e-59, Method: Composition-based stats. Identities = 77/462 (16%), Positives = 145/462 (31%), Gaps = 66/462 (14%) Query: 14 GLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAI 73 L+ I + L I + + F I +Y+ + K + + K + ++I Sbjct: 3 ILQGIGLALIAIGLVYVIYQFPI--IYFGYKDFTKYDIDFSKLNEAQFSGLKMYKPMVSI 60 Query: 74 MVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 +VPA NE VI E + Y N+ +FV + +T + E +R V+ Sbjct: 61 IVPAKNEETVIKRTIE-SILNQTYTNFELFVVVDNSSDNTYKIAKEYESRDKRVNVFNRP 119 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--LFNYLVERKDL 191 SKA LN + +DA+ ++ P L ++ D+ Sbjct: 120 DG--KSKASALNFCFE---------KTKGEVIATYDADTMLLPNTLENAVYGMNYFNVDV 168 Query: 192 IQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALL 251 +Q RE FT + IDE + R VP AG F R+ + ++ Sbjct: 169 LQGYNSYINREENIFTRLAVIDEILV--KATLIGRTHFNLFVPVAGSNQYFKRKVIESI- 225 Query: 252 ADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMIC 311 +D LTED + R+ ++ Sbjct: 226 -----GGWDDNFLTEDLESSIRISNARYKSAYLGSAKAL--------------------- 259 Query: 312 VREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAM 371 + P ++S RQ++RW+ G F + K S D + + Sbjct: 260 --QETPASYSEYFRQRTRWLRGYHQVFFHSKKRFSKF-------TDFDALMIVLAPTFSG 310 Query: 372 LVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGL 431 ++ L + P + F + ++++ ++V Sbjct: 311 ILFFGWLYISLLNFYNPFVHSMRTYFISLILISLIIYVVALVLVLI------------KK 358 Query: 432 TQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 Q + + ++ +N + L + KT Sbjct: 359 RQNFIYIPLIYIYLTLNSLIAIYTLFLEITGAKRVWHKVKKT 400 >UniRef50_UPI0001B4D705 bi-functional transferase/deacetylase n=3 Tax=Streptomyces RepID=UPI0001B4D705 Length = 700 Score = 233 bits (594), Expect = 2e-59, Method: Composition-based stats. Identities = 63/466 (13%), Positives = 120/466 (25%), Gaps = 62/466 (13%) Query: 17 VIAITLAVIMFISGLDDFF--IDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 +A ++ I+G + +V++ R R+L+ +R + P+ ++ Sbjct: 286 TFTNAMAWVLAIAGALGLLRLVTLVFFARAHVRRLTRFR-----PGSPWLREVNDPVTVI 340 Query: 75 VPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 VPA+NE I + + + I V + T P + + Sbjct: 341 VPAYNEEAGIEATVRSLLAS-THPHLQIIVVDDGSTDRTADLAT--WIDDPRISVIRQIN 397 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYLVERKDLI 192 KA LN L ++ DA+ V + + Sbjct: 398 S---GKATALNTGLAHAAH---------DIVVMVDADTVFEADAVHQLIQPLAHPAIGAV 445 Query: 193 QIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLA 252 R E+ D + E L G F R A+ + Sbjct: 446 SGNTKVGNRRS--LLGRWQHLEYVFGFNLDRRMFEVLECMPTVPGAIGAFRRDALMGV-- 501 Query: 253 DGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICV 312 +L ED D+ L G ++ V Sbjct: 502 ----GGVSEDTLAEDTDLTMSLWRAGWRVVYEETAVAWT--------------------- 536 Query: 313 REYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 P + RQ+ RW G + +K + L R + F L ++ Sbjct: 537 --EVPTSLRQLWRQRYRWCFGTLQSMWKHRRAAVELGPAGRFGRRGLSYLVLFQVLLPLI 594 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 + FLS + L + + + + + Sbjct: 595 AP-------IVDLFALYGALFLSPVQSAGIWCAFLAVQLICAGYALRLDGERMRSLWAMP 647 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 L +L + +I + ++ H R + Sbjct: 648 FQLFVYRQLMYLVVIQSVVAALLGARLTWHRMHRSGTAAQQLRSHD 693 >UniRef50_Q1GE89 Glycosyl transferase family 2 n=3 Tax=Rhodobacteraceae RepID=Q1GE89_SILST Length = 509 Score = 233 bits (594), Expect = 2e-59, Method: Composition-based stats. Identities = 72/489 (14%), Positives = 144/489 (29%), Gaps = 55/489 (11%) Query: 2 DWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 D+ + G + ++ ++ + V + R + + Sbjct: 70 DYRAARAPAFATGTLLCLFSILAPHLVTAVLAVASLVTLLMFTALRISGLLAAARPDQPK 129 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEV 120 D ++++VP + E + ++ T + + + ND T+ V + Sbjct: 130 SETPKDLPQMSMLVPLYREAEIGKHLLRRLCRLTYPRDRLEVLLVLEENDDVTRNAV-KC 188 Query: 121 CARFPNVHKVVCA-RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL 179 V T+K +N L+ + DAED +P +L Sbjct: 189 ADLPDWFRVVEVPGDGTLTTKPRAMNYALNFC---------RGEIIGIWDAEDAPAPDQL 239 Query: 180 R----LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPS 235 F + Q + + S + E++ + L +P Sbjct: 240 ESAASAFAHAPPDVVCFQGILDFYNPSRN-LISRCFTLEYAGWFRVLLQGIARLGLVIPL 298 Query: 236 AGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKER 295 G R A+ L A+D ++TED D+G R+ + Sbjct: 299 GGTTLFIRRDALEQL------GAWDAHNVTEDADLGVRIARACYRTEMLPTT-------- 344 Query: 296 EQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW 355 E + ++Q+SRW+ G + + +L L W Sbjct: 345 ----------------TYEEANSRITPWIKQRSRWLKGFMMTYLVHMRAPKAL-LRDVGW 387 Query: 356 RDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV 415 R G + F+ L ++ +L +L S+ + + L F ++ Sbjct: 388 RRFWGLQAFFLGTLGQFLLAPVLWSFWLVALGVSHPLEASL-PRDMLSVAVGALVFFEVL 446 Query: 416 NRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 N + G + ++ A ++AL +V WDKT H Sbjct: 447 NLCIWYCGA--RASGRPVLAFCAPLMPLYFILGCFAAYKALWEVFA----APFFWDKTAH 500 Query: 476 DFPSVTGDT 484 T + Sbjct: 501 GDHGGTTEH 509 >UniRef50_A3JQ28 Glycosyl transferase, family 2 n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JQ28_9RHOB Length = 637 Score = 233 bits (594), Expect = 2e-59, Method: Composition-based stats. Identities = 72/485 (14%), Positives = 154/485 (31%), Gaps = 71/485 (14%) Query: 11 WLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIK--RKLSVYRRYPRMSYRELYKPDE 68 + IA+ ++ + F+ ++ ++ R + S + P E Sbjct: 200 IIAVFTAIALAPTLLFTVLFCLASFVLLMNTGFKLWVTAAFMRGRELAKTSTSAIISPPE 259 Query: 69 ----KPLAIMVPAWNETGVIGNM-AELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCAR 123 ++I++P ++ET + + +A I D T+ + A+ Sbjct: 260 NMRLPTVSILIPLFHETDIAERLVIRMAKIRYPPALLDIMFLVEEADHATKLAL--CQAK 317 Query: 124 FP-NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME-LRL 181 P N+ + +K +N L + ++DAED + L++ Sbjct: 318 VPQNMRIITVPDGPIRTKPRAMNYALPLC---------RGSIIGIYDAEDAPESDQILKI 368 Query: 182 FNYL---VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGV 238 + +Q + + + + E++ +P + L +P G Sbjct: 369 VAKFQTSSPKVACLQARLDFYNTSRNWL-ARCFTVEYATWFCVILPGLQCLKMPIPLGGT 427 Query: 239 GTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQR 298 F R + + A+D ++TED D+G RL G V +EA R Sbjct: 428 SVFFRRNVLEKV------GAWDAHNVTEDADLGMRLARNGFKTELVNSTTYEEANCRPWP 481 Query: 299 KFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD- 357 ++Q+SRW+ G + + L RD Sbjct: 482 ------------------------WIKQRSRWLKGYG------LTYFVMMRKPLQLIRDV 511 Query: 358 ----RKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGL 413 G F+ L ++ +LL + ++ + W + L++ + Sbjct: 512 GFINFCGVQILFLGTLTGFILAPVLLSFWFLTMGLPNPSAGMLPQWLLWALLFLFIMAEI 571 Query: 414 MVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 + + I + G V + + + + ++A+ ++ WDKT Sbjct: 572 VTISVGLLSISIRG--NAKGLGKWVPTMHFYFPMASIGAFKAIYEI----ATAPFYWDKT 625 Query: 474 THDFP 478 H Sbjct: 626 QHGAF 630 Score = 57.7 bits (138), Expect = 1e-06, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 42/127 (33%), Gaps = 1/127 (0%) Query: 494 LLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQ 552 L++ ++ AL + + L ++ G + + + + LA+ + ++ Sbjct: 36 LVKANLLCAADAKRALAMSAIHDASLPDILIFNGFCAGKDVYKNLAKIWNAPFLEKGQFE 95 Query: 553 IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLR 612 + ++ L LP+ ++ L T + + +V Sbjct: 96 SDVETLRKIGVEFCLRNKCLPIIDKDGNQFFALSQPDKFDDLVQFTPFKQKTKKMAVVEE 155 Query: 613 GQIVTGL 619 +I+ + Sbjct: 156 NEIIDTV 162 >UniRef50_C8W5J2 Type II secretion system protein E n=2 Tax=Clostridiales RepID=C8W5J2_DESAS Length = 561 Score = 233 bits (593), Expect = 3e-59, Method: Composition-based stats. Identities = 45/263 (17%), Positives = 105/263 (39%), Gaps = 23/263 (8%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE----GLRLGGSMLMQGLISAEQLAQALAEQNGV 543 LG IL++ +IT+EQL+ AL+N+ E +G +++ G + + +A+ +AE++G+ Sbjct: 8 NFLGTILVQKGIITQEQLEDALKNQSEMKGKKGLIGKTLVRLGYCTEDDIARVIAERSGI 67 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + S++ +QI + + + Y LP+ +D+L+V D +S+ L G Sbjct: 68 PYISLETYQIDPAAVTVLSIDNINRYKALPVSFADDKLVVAMNHPNDIMSIDDLRMLTGY 127 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 ++ V+ ++ + + + + + + P L Sbjct: 128 DIKPVMTSDTELEATIEKY-----SRESLDVEQEDDDVDAYNDLANESVDDADRPAIQLA 182 Query: 664 AEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQ- 722 IL+ S I++ E +GV+ ++ + R++ ++ Sbjct: 183 NMILSQALSARASDIHI-----EPYEKNSRVRFRIDGVLH-----DIMQVPRKMHATLTS 232 Query: 723 SLLLKAGLNTEQVAQLESENEGE 745 + + A ++ +G Sbjct: 233 RIKVMANMDIA---DRRVPQDGR 252 >UniRef50_C1ACV5 Putative glycosyl transferase/polysaccharide deacetylase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ACV5_GEMAT Length = 1123 Score = 233 bits (593), Expect = 3e-59, Method: Composition-based stats. Identities = 70/444 (15%), Positives = 128/444 (28%), Gaps = 48/444 (10%) Query: 42 VRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYH 101 + I +V R R + R ++++VPA+NE VIG + + + Y + Sbjct: 724 LLLIGTLATVQRFSKRYARRAADAEWLPRVSVLVPAYNEGRVIGRTVQ-SVLSQAYPDLE 782 Query: 102 IFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFA 161 + V + DT +H V KA LN + T Sbjct: 783 VVVVDDGSSDDTHDAA-SHATDDARLHVVRQTNA---GKAAALNTGIAMAT--------- 829 Query: 162 FAGFILHDAEDVISPMELRLFNY--LVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELH 219 ++ DA+ +++P +R R + R + + E+ Sbjct: 830 GEVIVVIDADTILAPDAIRHLVRPLADARVGAVAGNAKVGNRI--NLLTRWQAVEYVTSQ 887 Query: 220 GKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGM 279 D L G + R AV D F +L ED D+ L G Sbjct: 888 NLDRRAFVMLNCITVVPGAIGAWRRSAVL------DAGGFRTDTLAEDQDLTLTLLRGGH 941 Query: 280 TEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGF 339 V P+TF ++Q+ RW G + + Sbjct: 942 KVALAEQAVALT-----------------------EAPETFGALLKQRFRWSFGTLQCAW 978 Query: 340 KTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSG 399 K + F + + LL + L +A + Sbjct: 979 KHRGALMRRDAGALGMVGLPNIWLFQLLFPLLAPAADVALLASLARLLLEAPALGVHAAW 1038 Query: 400 SAWLMTLLWLNFGLMV-NRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQ 458 + L L++ + L+Q LL L+ + ++A +A++ Sbjct: 1039 AHAEPVFLLYALFLLIDTFTAVLGVAFEKGEPLSQALLVPLQRVAYRQVLYVALLKAMRA 1098 Query: 459 VLQHGDPRRVAWDKTTHDFPSVTG 482 ++ P ++T Sbjct: 1099 AVKGWAPGWGKLERTGRVQALPQK 1122 >UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase n=76 Tax=Firmicutes RepID=B7HFD6_BACC4 Length = 433 Score = 233 bits (593), Expect = 3e-59, Method: Composition-based stats. Identities = 73/471 (15%), Positives = 144/471 (30%), Gaps = 74/471 (15%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 W+ F+ ++ + I A + + F+ + + R ++ ++ Sbjct: 19 WISITFS-----IEYVLIFTAFLFSGLLVYYSFLTIAGLIHRNSKR------------KD 61 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE-NYHIFVGTYPNDPDTQRDVDEVC 121 + I +PA NE VI + E A ++Y I++ + +T D+ Sbjct: 62 RTLEHYPSVDIFIPAHNEGIVIKDTLE-AMAKIEYPGKLTIYLLNDNSQDETPEIGDDFD 120 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL 181 + ++ + P K+ LN L F ++DA++ P LR+ Sbjct: 121 KAYAHICHIRVPPGEPKGKSRVLNYGLSI---------SDGEYFCVYDADNQPEPHALRM 171 Query: 182 FNYL----VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAG 237 + + V T M EF R L G Sbjct: 172 LVEHAETTEDAVGAV-GHVRTVNENRNWLTRMI-SLEFQIFQLLMQSGRWLLFQTGSLTG 229 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 R A+ L +D ++ ED ++ R+ +KG V V Sbjct: 230 TNMLLRRSALEEL------GGYDPYAIAEDAELTLRITQKGYLLPIVPESV--------- 274 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD 357 E P+ ++Q++RW+ G ++ K S + Sbjct: 275 --------------TWEQEPEHLKILIKQRTRWLQGNLYILEKMFSSLS--------FFK 312 Query: 358 RKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNR 417 K + + L +V L L ++W + + +W + Sbjct: 313 GKLLVHSLQQVLVYVVFW---LFLIISNVWFVIGLLGIFQIQYSIPLLFMWYVAYITYVS 369 Query: 418 IVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRV 468 + V + T +SV+ F + R+L L+ ++V Sbjct: 370 QLFSAQSVERTFTPTNIFISVIMYFTYAQLFTYLFIRSLILYLRAKSKKQV 420 >UniRef50_A5CDZ1 Putative glycosyl transferase, group 2 n=2 Tax=Orientia tsutsugamushi RepID=A5CDZ1_ORITB Length = 583 Score = 232 bits (592), Expect = 4e-59, Method: Composition-based stats. Identities = 75/490 (15%), Positives = 151/490 (30%), Gaps = 75/490 (15%) Query: 10 TWLYGLKVIAITLAVIMFISGLDDFFID-VVYWVRRIKRKL----SVYRRYPRMSYRELY 64 ++ I +TL+ S FI ++Y+ + + L ++ R +S Sbjct: 153 NYVLLTGKIILTLSFFHQFSREGFLFIMHLLYFAHGVLKLLLFKVAMLRYQLHLSSLYYS 212 Query: 65 KPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCAR 123 I+VP ++E + ++ + + + +D T ++ Sbjct: 213 VELFPFYTILVPLYHEVEKLRDIVKAIELLNYPKNRIEVKIIIEEDDVYTMLELT-TMHL 271 Query: 124 FPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN 183 H + P +K LN ++ I ++DAED P +L Sbjct: 272 AHYFHVIKVPFSFPQTKPKALNYAMNYI---------VGEYITVYDAEDSPEPDQLLKVI 322 Query: 184 YLV----ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVG 239 Y IQ + + + T + I E+ + L V G Sbjct: 323 YHFQNLQPDYQCIQARINFYNKNENVLTKLMSI-EYCLWFDFFLYGLTCLGLPVTLGGTS 381 Query: 240 TCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRK 299 + + +L +D ++TED D+G R+ G + Sbjct: 382 NHCRAKMLKSL------GYWDAYNVTEDADLGLRIYIAGFKTAVIDS------------- 422 Query: 300 FLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRK 359 + Q+SRWI G + F + ++ Sbjct: 423 -----------YTYGEAVIDCKGWLHQRSRWIKGFIQTSFVFMSYNKNIRNR-------- 463 Query: 360 GAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN-FGLMVNRI 418 + A + + +L L+ W I L T+LW N + Sbjct: 464 ------LGLCANICICLFILFSPLMFLFIPLWLISGIIDNDCTLGTILWYNMLFALAYMH 517 Query: 419 VQRVIFVTGYYGLT-----QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 V I + G Q LL + ++++ +A+++A+ ++ + W+KT Sbjct: 518 VMSWIALCRIKGHWSNLTLQDLLCFIIWPLYSMLHVIASYKAIFELCV----KPFKWNKT 573 Query: 474 THDFPSVTGD 483 H + + Sbjct: 574 KHGVSRININ 583 >UniRef50_P96587 Uncharacterized glycosyltransferase ydaM n=6 Tax=Bacillus RepID=YDAM_BACSU Length = 420 Score = 231 bits (590), Expect = 6e-59, Method: Composition-based stats. Identities = 62/458 (13%), Positives = 133/458 (29%), Gaps = 57/458 (12%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR-ELYKPDEKPLAIMVPAWNETGVI 84 +F L ++ ++Y + ++ Y + R + + +++++PA NE VI Sbjct: 5 LFFISLSLIWVMLLYHMFLMQGGFRHYMTFERNIPKWRENMKELPKVSVLIPAHNEEVVI 64 Query: 85 GNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVH-KVVCARPGPTSKAD 142 + + I V + T V+E ++ + + K+ Sbjct: 65 RQTLKAMVNLYYPKDRLEIIVVNDNSSDRTGDIVNEFSEKYDFIKMVITKPPNAGKGKSS 124 Query: 143 CLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN---YLVERKDLIQIPVYPF 199 LN+ A ++DA++ M + E+ + Sbjct: 125 ALNSGF---------AESNGDVICVYDADNTPEKMAVYYLVLGLMNDEKAGAVVGKFRVI 175 Query: 200 EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAF 259 T T I E R G R + L + Sbjct: 176 NAAKTLLTRFINI-ETICFQWMAQGGRWKWFKIATIPGTNFAIRRSIIEKL------GGW 228 Query: 260 DVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDT 319 D ++L ED ++ R+ G F + E P+T Sbjct: 229 DDKALAEDTELTIRVYNLGYHIRFFPAAI-----------------------TWEQEPET 265 Query: 320 FSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLL 379 + RQ++RW G + K L ++ + F+ F +++ + + Sbjct: 266 WKVWWRQRTRWARGNQYVVLKFLAQFFKLKRKRIIFDLFYFFFTYFLFFFGVIMSNAIFV 325 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL 439 + + L +L +LW+ + V + + Q V Sbjct: 326 VNLFYDLHLSV----------GFLAMILWILAFFLFMTEVMITLSIEKTEMNKQNFFIVF 375 Query: 440 RLF--WGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 ++ + + + ++ + V W KT Sbjct: 376 LMYFTYSQAWIVLVIYSLFVEIKHRLFKQEVKWYKTER 413 >UniRef50_Q1RIH7 Glycosyltransferase n=11 Tax=Rickettsia RepID=Q1RIH7_RICBR Length = 573 Score = 231 bits (589), Expect = 8e-59, Method: Composition-based stats. Identities = 72/488 (14%), Positives = 165/488 (33%), Gaps = 77/488 (15%) Query: 8 FATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPD 67 F +L L + + + IS + + +V+ I+ + + R++ EL Sbjct: 152 FVVFLVILTYVPVLFHIANNISYFVQNVLKSLLFVKAIRDYKPLEVKQARINVEEL---- 207 Query: 68 EKPLAIMVPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPN 126 I+VP + E + ++ + + + + +D +++ + Sbjct: 208 -PIYTILVPLYKELSKLRSIIKNISLINYPDSKLDVKIIIEDDDYLMIKEI-ALYNLPAY 265 Query: 127 VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL----RLF 182 H ++ + P +K LN L+ +++DAED P +L +F Sbjct: 266 FHVILVPQSSPRTKPKALNYALEY---------SRGEYVVVYDAEDKPEPDQLLKALAMF 316 Query: 183 NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 L +Q + + + T M + E+S + L P G F Sbjct: 317 KSLAPDFICLQAKLNFYNKNENVLTKM-FNLEYSLWFEYILKGLSLLKLPTPLGGTSNHF 375 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 + L +D ++TED +IG R+ + + Sbjct: 376 KADILRKL------GGWDAHNVTEDAEIGLRIYSQNYKVTILDS---------------- 413 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAI 362 E P++ + Q+SRWI G + F + +D+ + Sbjct: 414 --------YTLEEAPNSLGNWLNQRSRWIKGFLQTFFV-----------FIAQKDKYKKL 454 Query: 363 SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI---- 418 + ++ I + + L+ + W + SI ++ +WL + Sbjct: 455 TLLQ-----IITIYIFIGLSTYNFWCLPFIIFSIIINKNPIIDYVWLVNSIFSLLYLYGT 509 Query: 419 VQRVIFVTGYYGLTQGLLSVLRLFW--GNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 V ++ + +G + + + W +++ +A+++A+ +++ W+KT H Sbjct: 510 VIYILKNSLKFGKIKFQDLIALVLWAGYFILHTIASYKAVFEII----FCPFKWNKTKHG 565 Query: 477 FPSVTGDT 484 + Sbjct: 566 VSLEDFEE 573 >UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bacteria RepID=C4I9P5_BURPS Length = 520 Score = 231 bits (589), Expect = 8e-59, Method: Composition-based stats. Identities = 80/467 (17%), Positives = 142/467 (30%), Gaps = 65/467 (13%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 ++ + A +L+ ++ + A ++ GLD F + R YR + Sbjct: 98 VISLCAAYLWVFALLTLVYASRHYVFGLDRLF------------------KPQRAPYRAI 139 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELAATT-LDYENYHIFVGTYPNDPDTQRDVDEVCA 122 D + + V A NE V+ + T E I + +T+ +DEV A Sbjct: 140 THADWPEITVFVAAHNEEAVVADCLTALLATTYPRERLTIVPVNDRSTDNTRALIDEVQA 199 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR-- 180 R P + K G KA L + L I ++ DA+ + P L+ Sbjct: 200 RAPELIKPFHRESGKPGKAAALKDALREI---------RGDIMVVFDADYLPRPGLLKEL 250 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 + + + V P + + E + + + R L G Sbjct: 251 VAPFFDPEVGAVMGRVVPQNADRNLLARLLD-LERAGGYQVNQQARNNLGLVPQYGGTVG 309 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 + A+ A+ + +L ED D+ +RL +++ Sbjct: 310 GVRKSALDAV------GGWRDDTLAEDTDMTYRLLLSNWRTVYLNHA------------- 350 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 E P+ + RQ +RW G F+ + R Sbjct: 351 ----------ECYEEVPERWPVRARQLTRWAKGHNQTLFRYLIPLLRSPVTPRRCRLDGA 400 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 + A+L + + L Y + D+ + S A + NFG+ +V Sbjct: 401 LLLGVFVMPALLALAWGIALALYLTNGIDSLVLGLLVSVFALFAFSTFGNFGVFFEIVVA 460 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR 467 G L V G + A AL + RR Sbjct: 461 -----ARLDGRATRLRLVPVNVVGFCVTIAAVVAALWGLALDALLRR 502 >UniRef50_C1AA14 Type IV pilus assembly protein PilB n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AA14_GEMAT Length = 634 Score = 231 bits (588), Expect = 9e-59, Method: Composition-based stats. Identities = 48/280 (17%), Positives = 96/280 (34%), Gaps = 25/280 (8%) Query: 478 PSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQ 535 P+ +RS LG +L+ +++ E L AL+ + G RLG +++ G++ ++ + Sbjct: 25 PAAPLASRSTDRLGDLLVREGLLSRENLTKALQEQSAYPGQRLGLTVVRLGMVPETEVVR 84 Query: 536 ALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLA 595 LA Q + + +++ + L+ +PA +A + VLPL+ + +L V D + Sbjct: 85 MLARQYRMPAVDLARFEVDTRLLKLIPAELASKHTVLPLKRDGRQLTVAIADPTAMAVVD 144 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDP---------RAMLYNAVQHQWLTE 646 L + V+ + + Y H + + + Sbjct: 145 DLKFITRYDIVPVLAGEYSMRAAIEKHYEANEIHMQSLLQDIAADDDDIEVLDNQDDMVD 204 Query: 647 QQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQET 706 P L IL H S I+ E L +G Sbjct: 205 ASVLAAQVDEAPVVKLINAILGDAVHKGASDIH-----FECFEHELRVRYRIDG-----A 254 Query: 707 LDRVLTIQREL-QVSMQSLLLKAGLNTEQVAQLESENEGE 745 L V+ ++ + + + LN + +G Sbjct: 255 LQEVMKPPMKMRAALISRFKIMSSLNIAE---RRVPQDGR 291 >UniRef50_A7IHY2 Glycosyl transferase family 2 n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7IHY2_XANP2 Length = 678 Score = 231 bits (588), Expect = 9e-59, Method: Composition-based stats. Identities = 81/486 (16%), Positives = 148/486 (30%), Gaps = 56/486 (11%) Query: 17 VIAITLAVIMFI-----SGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL 71 V+ + LA ++ + + +V+ R + P L Sbjct: 219 VVGLPLAGLVALAPEQGVLAVQALLSLVFLGWVSLRLAACAYDAPPDPPPTLDDRQLPVY 278 Query: 72 AIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 +++VP + E + ++ A E I + +D T+ + + P + +V Sbjct: 279 SLLVPLYREAASVPHLVAALGALDYPPEKLDIKLVVEADDAGTRAAIAAL-TLPPQMEEV 337 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV---- 186 GP +K L L A + ++DAED+ P +LR Sbjct: 338 PVPAVGPRTKPKALEVALAAA---------RGSFVAIYDAEDLPEPDQLRRALEAFRTGG 388 Query: 187 ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 + +Q + + + + + ++ E++ +P+ AL + G F RR Sbjct: 389 PKIACVQARLAIDNGDDS-WIAASFAAEYAAQFDVLLPMLSALGLPILLGGTSNHFRRRV 447 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHART 306 + + +D ++TED D+G RL G + Sbjct: 448 LDEV------GGWDPFNVTEDADLGIRLARAGWQTRVISST------------------- 482 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV 366 E P T V Q++RW+ G + +L + + + Sbjct: 483 -----TYEEAPVTARAWVGQRTRWLKGWAQTLLVHLRQPGALMADLGVGPALALLLLAAG 537 Query: 367 SFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVT 426 F A LV + L L A L + Sbjct: 538 PFAAALV-HPFCVALLLADLLRGVIGLPRGSMAEALTSALTFTTLFAGYAGTAAITYVGL 596 Query: 427 GYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRS 486 GL VL + + L+ A WRAL ++L+ R W KT H + Sbjct: 597 RRRARVPGLKVVLGIPFYWLLLSAAAWRALIELLR----RPHHWQKTEHGVARHRVAQAA 652 Query: 487 LRPLGQ 492 Sbjct: 653 ALRRSP 658 >UniRef50_A1VIY0 Glycosyl transferase, family 2 n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VIY0_POLNA Length = 476 Score = 230 bits (587), Expect = 1e-58, Method: Composition-based stats. Identities = 78/451 (17%), Positives = 130/451 (28%), Gaps = 49/451 (10%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWN 79 LA + I ++Y VR ++ R Y ++ + + + A N Sbjct: 56 FPLAATLASVLFLIVVIMMLYAVRHFIFTINRLLGEQRHPYLDIAIARWPMITVFIAAHN 115 Query: 80 ETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPT 138 E VI E T + I + T +D ARFP+ G Sbjct: 116 EEKVIAGCIEALLNTDYPADQLKIIPVNDRSTDRTGAIIDRYVARFPSRISPFHRTLGKA 175 Query: 139 SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--LFNYLVERKDLIQIPV 196 K+ L + L I+ DA+ V L+ + + V Sbjct: 176 GKSAALKDALAFA---------EGDIAIIFDADYVPGRGLLKQLAAPFFDPEVGAVMGRV 226 Query: 197 YPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDG 256 P T M E S + D R + G AV A+ Sbjct: 227 VPVNSGANLLTRMLD-LERSGGYQVDQQARMNMNLLPQYGGTVGGVRLSAVEAV------ 279 Query: 257 IAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYF 316 + +L ED DI +RL G ++ SN E Sbjct: 280 GGWHDDTLAEDTDITYRLMFNGWKTVY-----------------------SNRSECYEEV 316 Query: 317 PDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQ 376 P+ + ++Q RW G + W Y R + FL L+M+ Sbjct: 317 PEEWRVRIKQVKRWAKGHNQVMARY--WWQFACSPYLTLAQRIDGLLLLFVFLIPLLMLI 374 Query: 377 LLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLL 436 L+ + A ++ NF ++ ++ G + L Sbjct: 375 GWGLVLGLYFLNAGSMLSQLIPIFALMVYGTLGNFAAFFEIVIAVLL-----DGHRKRLR 429 Query: 437 SVLRLFWGNLINFMANWRALKQVLQHGDPRR 467 + G L++ A A+ + G +R Sbjct: 430 LLPLNILGFLVSLFAISGAVVSLALDGLFKR 460 >UniRef50_A3UGE7 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UGE7_9RHOB Length = 523 Score = 230 bits (587), Expect = 1e-58, Method: Composition-based stats. Identities = 77/503 (15%), Positives = 144/503 (28%), Gaps = 59/503 (11%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAW 78 + + + L + R + + D ++++V Sbjct: 60 ILFTLLGVTALTLVYLIAALSVAGACALRIAAAIMPPRYADRTTVSDQDLPVISVIVALH 119 Query: 79 NETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGP 137 +E V+ + L+ I + +D T+ A + VV GP Sbjct: 120 DEARVLPGLIAALSRLNYPRSKLDIILALEAHDQPTRAAAR-ALAGRKALRVVVLPPLGP 178 Query: 138 TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN---YLVERKDLIQI 194 +K LN L ++DAED P +LR +R +IQ Sbjct: 179 MTKPRALNVALQTA---------RGELVAVYDAEDAPHPDQLRQAAECFAADDRLGIIQA 229 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 P+ + R T+ + E++ +P+ L +P G F + A+ A Sbjct: 230 PLGWYNRTENWLTA-QFALEYATQFNALLPLLARLGWPLPLGGTSNIFRQSALVA----- 283 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 +D ++TED D+GFR+ G V E Sbjct: 284 -CGGWDPFNVTEDADLGFRMARSGWRAGLV------------------------APGTLE 318 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR--KGAISNFVSFLAML 372 P T Q+SRW+ G W + L G S + LA + Sbjct: 319 EAPITLRAWTHQRSRWLKGHFI------TWLVHMRDPRGLVDALGWGGVTSLTFTVLANM 372 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 + + L+ W + + I V + Sbjct: 373 MSALIHAPSLLMMGAGALLLGLAPGWSVLWTVGAALMTCAYASAMICAGVAARRAGFSPR 432 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQH---GDPRRVAWDKTTHDFPSVTGDTRSLRP 489 + + +W A +AL+++ + D + + + P Sbjct: 433 LSHMLSMPAYWLLQAP--AALKALRELPRQPYLWDKTQHGVSRARRETPDDASPYAPAHG 490 Query: 490 LGQILLENQVITEEQ-LDTALRN 511 + + + + Q L +A Sbjct: 491 RRSRPVRARRLAQRQALKSAQAA 513 >UniRef50_B1I3E7 Type II secretion system protein E n=4 Tax=Clostridia RepID=B1I3E7_DESAP Length = 561 Score = 230 bits (586), Expect = 2e-58, Method: Composition-based stats. Identities = 57/269 (21%), Positives = 109/269 (40%), Gaps = 23/269 (8%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALRNR--VEGLR--LGGSMLMQGLISAEQLAQAL 537 LG L++ VIT+EQL+ AL+ + +G + LG +++ G + E +AQ + Sbjct: 1 MLKPGANLLGMNLVKAGVITQEQLEEALKRQDPKKGGKGFLGATLVELGYCTEEDIAQVI 60 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 A QNGV + S++ + + + VA Y LP+ +N +L+V + D ++L L Sbjct: 61 ARQNGVPYVSLETFAADPQAVGLIAPEVARRYRALPIGFQNGKLVVAMKQPRDVIALDDL 120 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYV 657 GR+++ V++ Q ++ + + A + + E AG Sbjct: 121 RIITGREIQPVVIPDSQFDAAMQRY-----SQSGLEVELAAAEEEVAEEVVAGLDEAAQR 175 Query: 658 PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 P L I + S +++ E L +GV+ +L R L Sbjct: 176 PAVQLANAIFNQAVWASASDVHI-----EPLEKSLRVRFRIDGVLHN-----ILQPPRHL 225 Query: 718 QVS-MQSLLLKAGLNTEQVAQLESENEGE 745 S + + + A ++ + +G Sbjct: 226 HASLVSRIKVMANMDIAE---RRVPQDGR 251 >UniRef50_B8FZG2 Glycosyl transferase family 2 n=4 Tax=Clostridiales RepID=B8FZG2_DESHD Length = 425 Score = 230 bits (586), Expect = 2e-58, Method: Composition-based stats. Identities = 78/495 (15%), Positives = 140/495 (28%), Gaps = 74/495 (14%) Query: 1 MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY 60 MD++ V + L +I I + +I+ F + + RR Sbjct: 1 MDFMSGVTGSHL--FNMIMIPVQLIIIFMTFYYFVLSMFGLFRR---------------P 43 Query: 61 RELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDE 119 + EK A++V A NE VIG + + E Y +FV T Sbjct: 44 DKKVLEPEKSFALVVAAHNEEAVIGPLVDNLLNLDYPKELYDVFVVADNCTDKTALIAKN 103 Query: 120 VCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL 179 A K L + + + ER + I+ DA+++++ L Sbjct: 104 AGAL-----VHQRFNNEKRGKGYALEWMFHRLFKLER----HYDAVIIFDADNLVNETFL 154 Query: 180 RLFN-YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALA-GQVPSAG 237 N L + ++Q + + + + T+ F + R G Sbjct: 155 VEMNSKLCQGHQIVQCYLDSKNP-YDTWVTNTFSITFWLSNRLLQLARYNTGFLNNVLGG 213 Query: 238 VGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQ 297 G C S + + L + SLTED + + G+ + +V + K Sbjct: 214 TGMCISTKVLKDL-------GWGATSLTEDLEFTMKALISGIKTTWAHDAIVYDEK---- 262 Query: 298 RKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD 357 P TF A Q+ RW G V + Sbjct: 263 -------------------PLTFIQAWNQRKRWAQGQVDVAGRYFFPLIYKAFKERKLMY 303 Query: 358 RKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNR 417 A+ F L M+ + + L F + S W + Sbjct: 304 FDAAVHLFQPALVMIATFFMFVNLISGLQSSYTQVFNVVMPWSGWQI------LSAFSLV 357 Query: 418 IVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDF 477 + + L + +F + I + L + + +W T H Sbjct: 358 FPVAALALERLPWRAYAGLILYPVFIYSWIPIV--------FLGFVNRKDKSWSHTKHTR 409 Query: 478 PSVTGDTRSLRPLGQ 492 D + + Sbjct: 410 SIKYDDVVKEKKVSS 424 >UniRef50_B8E0Z1 Glycosyl transferase family 2 n=1 Tax=Dictyoglomus turgidum DSM 6724 RepID=B8E0Z1_DICTD Length = 399 Score = 229 bits (585), Expect = 2e-58, Method: Composition-based stats. Identities = 67/430 (15%), Positives = 135/430 (31%), Gaps = 51/430 (11%) Query: 49 LSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVG--T 106 L+ R+ Y+++ D ++++VP NE V N A DY I + Sbjct: 4 LNRAFGEQRLGYQDIIDSDLPYVSVLVPMHNEEKVAEN-VLNALLNTDYPKDRIEIIPID 62 Query: 107 YPNDPDTQRDVDEVCARFPNVHK-VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGF 165 + T+ +++ +++P++ K + P K LN+ L Sbjct: 63 DNSTDRTREILEDYSSKYPHLIKPLYRGSYLPRGKPSALNDALKVA---------EGEII 113 Query: 166 ILHDAEDVISPMELR--LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDV 223 I+ DA+ + +R ++L ++ V P T + + E + D Sbjct: 114 IVFDADYIPPKGIIRDLAVSFLDPEVGVVMGRVVPLNISKNLLTRL-FDLERIGGYQVDQ 172 Query: 224 PVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIF 283 R L G F + + L F+ + L ED ++ + G+ + Sbjct: 173 QARYNLKLIPQFGGTVGGFRKELILKL------GGFNPKILAEDTELTIKAYINGVKVCY 226 Query: 284 VRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHK 343 E P+T+ +Q RW G F+ Sbjct: 227 TNRA-----------------------ECYEEAPETWEVRAKQIRRWSRGHNQVMFRYL- 262 Query: 344 WTSSLTLNYFLWRDR-KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAW 402 + Y R++ G V ++ L +I L+ + L S F + Sbjct: 263 -LPLIKSPYLSLREKVDGVFLLCVYLISPLFLIGLVDSIVLFFLGEMQILGSSFFI---F 318 Query: 403 LMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQH 462 L I + Y + L + F+ + + + ++ ++ Sbjct: 319 LSAYNTFGNFAPFYEIGLGALLDGATYRIFLLPLPLFNFFFNMWYSSLGFFDSVLDLITR 378 Query: 463 GDPRRVAWDK 472 DP ++ Sbjct: 379 RDPVWHKTER 388 >UniRef50_B8E2U7 Type II secretion system protein E n=2 Tax=Dictyoglomus RepID=B8E2U7_DICTD Length = 561 Score = 229 bits (585), Expect = 2e-58, Method: Composition-based stats. Identities = 51/264 (19%), Positives = 105/264 (39%), Gaps = 16/264 (6%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 + +PLG+ LLE +IT+EQL+ AL + G +LG ++ +G + E + + L Q+ Sbjct: 1 MKEKKPLGEYLLEQGLITKEQLEKALEEQKKTGAKLGQILIERGYVKPEDIGKVLERQSE 60 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + + S+ QI L ++ Y +P++ E L V IDP + + R V Sbjct: 61 IPYISLTEVQIDEKLAGSFSENLLRRYKFIPIKREAGVLHVAVVPPIDPAIINEIRRIVK 120 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFL 662 +R I + + + + + + + E I + P L Sbjct: 121 SPIRIFITTDKEFNQIISRLFPLEKTTLSVVQDFQRTAPEPMVETLPA-IGVEEAPIVRL 179 Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ-VSM 721 + I+ + N S I++ + + G+ L ++ I +E+Q + Sbjct: 180 VSSIINEAINRNASDIHL-----DPQEKEMKVRYRIHGI-----LYDIMAIPKEIQDAVV 229 Query: 722 QSLLLKAGLNTEQVAQLESENEGE 745 + + +G++ + +G Sbjct: 230 TRIKVISGMDIAE---KRRPQDGR 250 >UniRef50_B1HSU1 Biofilm PIA synthesis N-glycosyltransferase icaA n=1 Tax=Lysinibacillus sphaericus C3-41 RepID=B1HSU1_LYSSC Length = 403 Score = 229 bits (584), Expect = 3e-58, Method: Composition-based stats. Identities = 59/416 (14%), Positives = 124/416 (29%), Gaps = 56/416 (13%) Query: 67 DEKPLAIMVPAWNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP 125 D +++ +PA NE VI + + + V + +T + +FP Sbjct: 27 DVPSVSVFIPAHNEALVIEQTLRAMSRLYYPKDKLEVIVINDNSSDETGNIALQYAEKFP 86 Query: 126 NVHKVV-CARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME---LRL 181 + + K+ LN+ L A+ +++DA++ M L + Sbjct: 87 FMRVIETVEPNKGKGKSSALNSAL---------ADSTGDIVVVYDADNTPERMAVWYLVM 137 Query: 182 FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 + T T I E R G Sbjct: 138 GLMNDPKAAATVGKFRVINAAETWLTRFINI-ETICFQWMAQGGRWKWFKVATIPGTNFA 196 Query: 242 FSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFL 301 R + L +DV++L ED ++ R+ G F + Sbjct: 197 IRRTVLEQL------GGWDVKALAEDTELTIRVYNLGYHIRFFPKAI------------- 237 Query: 302 QHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGA 361 E P+T +Q++RW G + K +L ++ Sbjct: 238 ----------TWEQEPETLKVWWKQRTRWARGNQYVVLKFLSQFFTLKRKSIIFDLFYFF 287 Query: 362 ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 + F+ F +++ L ++ + + + +LW+ L+ V Sbjct: 288 FTYFLFFFGVILSNALFVINLFYDIGLTVGD----------IALVLWVLAFLLFLGEVMI 337 Query: 422 VIFVTGYYGLTQGLLSVLRLF--WGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + + + L V+ ++ + + + W ++ + + V W KT Sbjct: 338 TLSIEKTEMNKKNFLYVILMYFTYSQMWIVLVVWSLFLEIKRMFTGQEVQWYKTER 393 >UniRef50_Q01NU4 Type II secretion system protein E (GspE) n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01NU4_SOLUE Length = 566 Score = 229 bits (584), Expect = 3e-58, Method: Composition-based stats. Identities = 54/260 (20%), Positives = 96/260 (36%), Gaps = 18/260 (6%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 LG+IL+E I E L+ AL ++E G +LG ++ GLI+ L AL++Q GV Sbjct: 22 MRLGEILIERGKIDAEDLERALELQLERGDKLGKIVVDMGLIAQRDLLSALSDQMGVPLI 81 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ++D + I + P+ L + L + D +D ++AA+ G +V+ Sbjct: 82 AVDGTPPNAPEIEGLSQRFLRQCRAFPVALNDSVLTIAMADPMDFETIAAVRAFSGLQVQ 141 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEI 666 + +I+ + Y D + + Q + + P L + Sbjct: 142 TALASEQEILDAIDRNYGES---DQKTFIGEGDDEQANADLEHLRDMASEAPVIRLVNAM 198 Query: 667 LTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL-QVSMQSLL 725 + S I++ E +GV+ + REL + L Sbjct: 199 IADAIEKRASDIHI-----EPFEKEFRIRFRVDGVLFAQE-----NPPRELKAAIISRLK 248 Query: 726 LKAGLNTEQVAQLESENEGE 745 L A LN + +G Sbjct: 249 LMAKLNIAE---RRLPQDGR 265 >UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium RepID=A1UIF2_MYCSK Length = 789 Score = 229 bits (584), Expect = 3e-58, Method: Composition-based stats. Identities = 63/455 (13%), Positives = 141/455 (30%), Gaps = 68/455 (14%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 L + ++ + ++Y + + R R+ + ++ +++++ A+NE Sbjct: 368 VLGFLFWLGMGSLTVMSLLYLILALV----CQYRQNRLRWNDIGDDQLPMVSVVLAAFNE 423 Query: 81 TGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 VI + + + T R + E+ +P + V + Sbjct: 424 EKVIARTIAELRRSDYPRSRFEVVAVNDGSTDGTLRILTELARDWPKLRVV---DQANSG 480 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER------KDLIQ 193 K+ +NN ++ + + DA+ + P +R R + Sbjct: 481 KSSAINNGINHASAVST-------VMVTMDADTLFRPDTIRNLARHFARHTHGRQVGAVA 533 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 + R + + E+ L G + +SR A+ + Sbjct: 534 GHIKVGNRR--NLLTAWQSLEYISGICVTRMAERLLNAISIVPGACSAWSRTALEEI--- 588 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 F ++ ED D L+ +G + + D Sbjct: 589 ---GGFCDDTMAEDCDATLALQRRGYRILQENNAIADT---------------------- 623 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 P+T +Q+ RW G + +K L+R R GA+ A L Sbjct: 624 -EAPETIRALAKQRKRWTYGNIQALWKHRA---------MLFRPRYGALGLVALPYAALS 673 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 +I LL + + +S+ +G+ + L + I + + Sbjct: 674 LIVPLLFMPLTIVAAG----MSLAAGNWQSIALFAGFVAALHMIISITAVAMARERAWHL 729 Query: 434 GLLSVLRLFW---GNLINFMANWRALKQVLQHGDP 465 ++ V R+ + + + + +RA+K + D Sbjct: 730 LVVPVYRIIYEPLRAYLLYASAYRAIKGTIVAWDK 764 >UniRef50_D2LDP7 Polysaccharide deacetylase n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LDP7_RHOVA Length = 1170 Score = 229 bits (583), Expect = 4e-58, Method: Composition-based stats. Identities = 67/481 (13%), Positives = 135/481 (28%), Gaps = 53/481 (11%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVRRIKRKL---SVYRRYPRMSYRELYKPDEKPLAIMV 75 + + +F + L FI + L + Y R + D+ +++++ Sbjct: 711 YMFYTLNLFQNTLTTLFIAAIALGLGRLVVLCGLAAYGNRRRKRRADPPYADDLSVSVLI 770 Query: 76 PAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARP 135 PA+NE VI + ++ + V + T V A P V + Sbjct: 771 PAFNEAKVITASIRQILASS-HQKLEVIVIDDGSTDGTADVVRGEFADDPRVSLMHTPNG 829 Query: 136 GPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN--YLVERKDLIQ 193 KA +N L T ++ DA+ P+ + + + + Sbjct: 830 ---GKARAINLALAQAT---------GDIVVVLDADTQFEPLTISRLVRWFADPKVGAVA 877 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 R + + E+ + L G + R A+ Sbjct: 878 GNAKVGNRI--NVLTRWQALEYITAQNLERRALATLDCITVVPGAVGAWRREAIM----- 930 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 F +L ED D+ ++ G +F + Sbjct: 931 -GLGGFPSNTLAEDQDLTISVQRAGYKVLFDADALAWT---------------------- 967 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 PDT +Q+ RW G + +K + ++ + Sbjct: 968 -EAPDTLGGLAKQRFRWAFGTLQCLWKHRSANLNPRYGALGMIALPQVWLFQIALALISP 1026 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV-NRIVQRVIFVTGYYGLT 432 ++ LLLL+ D S F+ + +TL + + V + + Sbjct: 1027 LVDLLLLVQVVRTGIDYLQHGSQFNSENFTITLTYYAVFMTVDLSAALIAFLLEKREDRS 1086 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQ 492 VL+ F + + +++ ++ R V W K + R Sbjct: 1087 LLWWLVLQRFGYRQVMYYVVAKSV---VKALQGRVVGWGKLERKATVRAMEPREADVRPA 1143 Query: 493 I 493 Sbjct: 1144 P 1144 >UniRef50_B0T3D0 Polysaccharide deacetylase n=2 Tax=Caulobacter RepID=B0T3D0_CAUSK Length = 1124 Score = 229 bits (583), Expect = 4e-58, Method: Composition-based stats. Identities = 62/460 (13%), Positives = 130/460 (28%), Gaps = 50/460 (10%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWN 79 + V + ++ L I + L++ R+ S L +++++P +N Sbjct: 703 LFRGVKVALTALFLTAIALGLARLVFLACLALVHRWTHQSPENLDPETGPLVSVLIPCFN 762 Query: 80 ETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 E VI + +++N + V + +T ++V P V + Sbjct: 763 EEKVIAASVARILES-EWKNLEVLVLDDGSKDNTAQEVRRAHGDDPRVTLLSFENG---G 818 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL-RLFNYL-VERKDLIQIPVY 197 KA +N L + DA+ + P + RL + + Sbjct: 819 KARAVNRGLAIAK---------GDYVVALDADTLFPPKTIGRLIRWFQDPTIGAVAGNAI 869 Query: 198 PFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGI 257 R + + E+ + AL G + + + AL Sbjct: 870 VGNRV--NMVTRWQALEYVTAQNLERRALAALGAVTVVPGAVGAWRKSVLDAL------G 921 Query: 258 AFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFP 317 + +L ED D+ + G F P Sbjct: 922 GYPSDTLAEDQDLTIACQRAGWKVAFDPAAQAFT-----------------------EAP 958 Query: 318 DTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQL 377 DT ++Q+ RW G + +K S + + ++ L Sbjct: 959 DTVGGLLKQRFRWSFGTLQCVWKHRAALFSPKTPALGFVALPQIWLFQILLAVAAPLVDL 1018 Query: 378 LLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV-NRIVQRVIFVTGYYGLTQGLL 436 ++ + S A +S + LL+ ++V + + Sbjct: 1019 AVVWSLISGVYGAIAHPVEWSPDDTIQGLLYWAVFILVDLSAGALGMALEKRAPWADLPY 1078 Query: 437 SVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHD 476 ++ F + + +++ L RV W K Sbjct: 1079 LPVQRFGYRQLMYYVVVKSV---LTAARGGRVGWGKLERR 1115 >UniRef50_B5GIB2 Bi-functional transferase/deacetylase n=7 Tax=Streptomyces RepID=B5GIB2_9ACTO Length = 767 Score = 229 bits (583), Expect = 4e-58, Method: Composition-based stats. Identities = 69/463 (14%), Positives = 128/463 (27%), Gaps = 62/463 (13%) Query: 13 YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLA 72 + V A + ++GL V+ + + R R P +P+ Sbjct: 326 WVWLVAASDSVTGVLVTGLAVTGSLVLARFGLMLLLSFAHARRTRRRGFAWGVPVTEPVT 385 Query: 73 IMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVC 132 ++VPA+NE + + + ++ I V + DT V+ + P V V Sbjct: 386 VLVPAYNEAKCVTATV-TSLSRSEHP-VEIIVIDDGSTDDTAGIVERLG--LPGVRVVRQ 441 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYLVERKD 190 K LN + + ++ D + V P +R + R Sbjct: 442 ENA---GKPAALNRGIAHASH---------DIIVMMDGDTVFEPATVRELVQPFGDPRVG 489 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 + R+ E+ D + + L G F R A+T + Sbjct: 490 AVAGNAKVGNRDS--LIGAWQHIEYVMGFNLDRRMYDVLRCMPTIPGAVGAFRRDALTRV 547 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 +L ED DI + G ++ Sbjct: 548 ------GGMSEDTLAEDTDITMAIHRDGWRVVYAEKARAWT------------------- 582 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 P++ + Q+ RW G + +K + R +S F Sbjct: 583 ----EAPESVAQLWSQRYRWSYGTMQAIWKHRHAVLERGASGRFGRVGLPLVSLF----- 633 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 +V+ LL L L + + AWL L F Sbjct: 634 -MVLAPLLAPLIDVFLLYGIVFGPTGRTLLAWLGVLAVQAVCAAYA-------FRLDKEP 685 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 + + L+ + ++ ++ L G R +T Sbjct: 686 MRHLVSLPLQQLLYRQLMYVVLLQSWITALTGGRLRWQKLRRT 728 >UniRef50_C3RLB0 Type II secretion system protein E n=2 Tax=Bacteria RepID=C3RLB0_9MOLU Length = 563 Score = 228 bits (582), Expect = 5e-58, Method: Composition-based stats. Identities = 44/268 (16%), Positives = 102/268 (38%), Gaps = 17/268 (6%) Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVEG--LRLGGSMLMQGLISAEQLAQALA 538 P+G++L E I +EQL+ AL + RLG ++ G +S Q+ +AL+ Sbjct: 4 RTKYMRNIPIGEVLKEYGYINDEQLNVALEAQKSNRSKRLGQHLIDLGFVSEYQMLEALS 63 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 ++ + ++ + ++P ++A Y ++ + L + +L + + D ++ + + Sbjct: 64 DKLAEPLIELSEIKVDIDAVQKIPRAMADKYNIIAIDLTDQQLTIVTSDPLNFYGIEDVR 123 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVP 658 G + + + ++ + +Y D + T E P Sbjct: 124 LVTGMHLNVCLATKAEVSKAIDRYYNDVAALDIADDIKLNTIVVEDTLDLFNE-SEDDTP 182 Query: 659 HQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ 718 L +L+ N S I++ E + + +G L LT+Q+ +Q Sbjct: 183 VVKLVNTLLSRGYVNNASDIHI-----EPFEDKVIIRMRVDG-----MLVDYLTLQKNIQ 232 Query: 719 -VSMQSLLLKAGLNTEQVAQLESENEGE 745 + + + + L+ + +G Sbjct: 233 NSLIVRIKILSNLDIAE---KRLPQDGH 257 >UniRef50_C5S5F1 Glycosyl transferase family 2 n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S5F1_CHRVI Length = 879 Score = 228 bits (581), Expect = 7e-58, Method: Composition-based stats. Identities = 67/438 (15%), Positives = 133/438 (30%), Gaps = 49/438 (11%) Query: 1 MDWLLDVFATWLYG-----LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRY 55 + + + A W+ A + ++ + G + ++ + + R Sbjct: 346 ITYAISTAAVWVVYDYTRQYMTPATAIVGVLLLIGGVGVIVLLMAEAHEWAESVWLRRWR 405 Query: 56 PRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPN-DPDTQ 114 R + +++ VPA+NE + LDY ++ + V DP Sbjct: 406 RPFPLRSVPDDQLPFVSVHVPAYNEPPELLKETLDGLAALDYPHFEVLVIDNNTKDPAVW 465 Query: 115 RDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVI 174 V + CAR + P KA LN L + + DA+ ++ Sbjct: 466 EPVRDYCARLGERFRFFHVDPLAGYKAGALNFALRH-------TDPRADVVAVIDADYIV 518 Query: 175 SPMELRLF--NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQ 232 P LR + ++Q P + F +M + E+ + R Sbjct: 519 RPPWLRHLVPAFGDPEVAIVQAPQDYRDAHQNAFKAMC-MAEYRGFFHLGMVTRNERNAI 577 Query: 233 VPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEA 292 + G T R+ + A+ + +TED ++G RL E G +++ Sbjct: 578 IQ-HGTMTMIRRQTLDAVD------GWAEWCITEDAELGLRLFEGGHKALYIPCTYG--- 627 Query: 293 KEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNY 352 + PDTF+ +Q+ RW G V + L Sbjct: 628 --------------------QGLMPDTFADFRKQRYRWAYGAVRILLHHRRELLGLRGTS 667 Query: 353 FLWRDRKGAISNFVSFLA---MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWL 409 R ++ ++ + A L+ L + + + + + L Sbjct: 668 LSLGQRYHFVAGWLPWFADGFNLLFNFAALAWSVAMVMAPDTITPPYMTIALVPLVLFLF 727 Query: 410 NFGLMVNRIVQRVIFVTG 427 + +RV Sbjct: 728 KMSKSLFLYRRRVTATLR 745 >UniRef50_Q8NU22 Glycosyltransferases, probably involved in cell wall biogenesis n=3 Tax=Corynebacterium glutamicum RepID=Q8NU22_CORGL Length = 487 Score = 228 bits (581), Expect = 8e-58, Method: Composition-based stats. Identities = 79/485 (16%), Positives = 146/485 (30%), Gaps = 69/485 (14%) Query: 7 VFATWLYGLKVIAITLAVIMFIS-GLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE--- 62 F + G ++ I I F+ ++ + R R + + Sbjct: 52 AFIAVVVGFILMLIFARQAALIGLSATCTFMYLITLLDRFIMFSRGIRAESIIQVSDEDA 111 Query: 63 --LYKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDE 119 + K ++VPA+ E VI + A + + +D T + Sbjct: 112 LAFPEDKLKTYTVLVPAYGEPEVIAQLLASMHAFDYPKHLLQVLLMLEEDDLPTIAAAEA 171 Query: 120 VCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL 179 + P +K N L T + DAED+ P++L Sbjct: 172 AGV-DQVATIIKVPPAQPRTKPKACNYGLHFAT---------GEIVTIFDAEDMPDPLQL 221 Query: 180 RLFNYLVERKD----LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPS 235 R ER +Q + T+ + E+ +P + VP Sbjct: 222 RRVVVAFERSASNTVCVQSRLSYRNARQNLLTA-WFTIEYDVWFNFLLPGVMRMNAPVPL 280 Query: 236 AGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKER 295 G + D A+D ++TED D+G R+ KG + + +EA Sbjct: 281 GGTSNHLLTGVLK------DLGAWDPFNVTEDADLGVRIAAKGYSTAVLDSVTWEEANSD 334 Query: 296 EQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW 355 +RQ+SRW G + W + +L Sbjct: 335 TI------------------------NWLRQRSRWYKGYLQ------TWLVYMRRPKWLV 364 Query: 356 RDRKGAISNFVSFLA-----MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN 410 ++ + +FL + V+ L L+ + +F + + L+ L Sbjct: 365 QELGIIPAVRFTFLMAGTPIIAVLNLLFWYLSLTWILGQPGTIEQMFPPAVYYPALVCLV 424 Query: 411 FGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAW 470 G + + G L + L+W + +A + Q++ R W Sbjct: 425 VGNAATIFMNLIGCREGRDPLLLIAVLTFPLYWLLM--SIAALKGTWQLI----TRPSYW 478 Query: 471 DKTTH 475 +KT H Sbjct: 479 EKTAH 483 >UniRef50_B8IGT7 Glycosyl transferase family protein n=9 Tax=Alphaproteobacteria RepID=B8IGT7_METNO Length = 678 Score = 227 bits (579), Expect = 1e-57, Method: Composition-based stats. Identities = 75/453 (16%), Positives = 134/453 (29%), Gaps = 58/453 (12%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDY 97 + + + + P Y D ++V E V+ ++ A LDY Sbjct: 202 LFLGMMLFRLAAVIEPPLPVPDYPRAADADLPVYTVLVALHREAAVVPHLI-GALERLDY 260 Query: 98 E--NYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFE 155 + + +D +T + A P + VV P +K LN L Sbjct: 261 PAAKLDVKLVLEADDAETAGAL-AARALPPWIEIVVAPPGLPRTKPRALNVALALA---- 315 Query: 156 RSANFAFAGFILHDAEDVISPMELRLFNY----LVERKDLIQIPVYPFEREWTHFTSMTY 211 +++DAEDV P +LR+ R +Q + + T + Sbjct: 316 -----RGEYLVVYDAEDVPDPGQLRMAAAIFAGANPRTACLQGRLVIDNTGDSWLTR-CF 369 Query: 212 IDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIG 271 E++ L +P A VP G T F + L +D ++TED D+G Sbjct: 370 TLEYTALFDVLIPALAAWRLPVPLGGTTTHFRTATLRTL------HGWDAWNVTEDADLG 423 Query: 272 FRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWI 331 RL G + E P +RQ+ RW+ Sbjct: 424 LRLALAGYHVG------------------------DLPLSTEEEAPAEIRPWLRQRVRWM 459 Query: 332 IGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAW 391 G V + ++ G++ +V + L + W Sbjct: 460 KGFVQTTITHSRRPAAAA----TALGPLGSLCALALIPGTVVSALVYPALLAVAGWRIVL 515 Query: 392 HFLSIFSGSA--WLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINF 449 + L + FG + ++ L + L + Sbjct: 516 SPAEPDPSFWVNMVTALGLVLFGAGLAALLLPAARGCLQRRWWGLLPWICVLPVYYGLMS 575 Query: 450 MANWRALKQVLQHGDPRRVAWDKTTHDFPSVTG 482 +A W +L +++ W+KT H + Sbjct: 576 VAAWLSLAELVL----APSRWNKTEHGRARTSR 604 Score = 42.3 bits (98), Expect = 0.077, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 46/157 (29%), Gaps = 13/157 (8%) Query: 487 LRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 PL L Q + + L A+ G ++L GL+S + +ALA G + Sbjct: 14 RFPLELAFLLWQGVRPDILIHAMGEAHRAGTDGATALLRAGLMSEDAYYRALARALGSPY 73 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLEN-DELIVGSEDGIDPVSLAALTRKVGRK 604 ++ + P SV PLR L++ L GR Sbjct: 74 --LEDLALAPGA--RYPDSVLA--GAAPLRPGPWGTLVLA---PQGAAIAELLRHGGGRP 124 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQH 641 I ++ + H + L Sbjct: 125 --PAITAPARLRAAVLHLHGAEAARRAAETLEVRAPD 159 >UniRef50_B5YIG2 Type IV-A pilus assembly ATPase PilB n=4 Tax=Bacteria RepID=B5YIG2_THEYD Length = 572 Score = 227 bits (579), Expect = 1e-57, Method: Composition-based stats. Identities = 52/276 (18%), Positives = 107/276 (38%), Gaps = 22/276 (7%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQAL 537 T T +G LL+ I+E+QL A ++EG+++G +++ G I+ ++L +++ Sbjct: 1 MSTKLTSERLTIGVFLLKKGKISEKQLIDAQAVQKIEGIKIGAALIKLGYITEDELVESM 60 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 +E G ID+++I ++ +P V Y VLP E + + V D + ++L L Sbjct: 61 SELYGYPVFKIDSYKIDPLVVKLLPEDVIRKYKVLPFLREGNIIRVLITDPANEIALEQL 120 Query: 598 T-RKVGRKVRYVIVLRGQIVTGLRHWYARRRGH----DPRAMLYNAVQHQWLTEQQAGE- 651 G K+ + I + ++ + L + + Q E Sbjct: 121 KFFLSGFKILFYIGKDSDFKNLINKFFGEEGAEIYSKETVHELVESAVQEPSITQPEEEQ 180 Query: 652 -IWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRV 710 I P L +I+ S I++ E + +GV+ + Sbjct: 181 AILEVDAPLIRLVNQIIVNAISKRASDIHI-----EPFEDNIYIRYRIDGVLH-----DI 230 Query: 711 LTIQREL-QVSMQSLLLKAGLNTEQVAQLESENEGE 745 LT+ +L + + + A ++ + +G Sbjct: 231 LTLPPKLKSALITRIKIMANMDISE---RRLPQDGR 263 >UniRef50_D2M877 Glycosyl transferase family 2 n=1 Tax=Rhodopseudomonas palustris DX-1 RepID=D2M877_RHOPA Length = 706 Score = 227 bits (578), Expect = 1e-57, Method: Composition-based stats. Identities = 89/477 (18%), Positives = 158/477 (33%), Gaps = 66/477 (13%) Query: 16 KVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMV 75 V+ + L + ++ + + R+ L R P R D ++ Sbjct: 250 AVMVLPLCLDPEVTTAFLAIWFIGFAGLRLLASLWPRRAPPISVRRP--DADLPVYTVVA 307 Query: 76 PAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 + E +G + E A E + + P+D T+ + + R P++ ++ Sbjct: 308 ALYREADSVGPLVEALEALDYPPEKLDLILVIEPDDLFTRAALARLKPR-PHLRVLIAPA 366 Query: 135 PGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKD---- 190 P +K LN L + + DAED P +LR + Sbjct: 367 VAPKTKPKALNYALAFA---------RGSFIAVFDAEDRPDPGQLRAALAAFDGAGRETA 417 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 +Q + + S T++ E++ +P AL +P G F + A+ Sbjct: 418 CVQASLCIDNLTHSWL-SRTFLAEYAGQFDLFLPGLAALGLPLPLGGSSNHFRTDVLRAI 476 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 A+D ++TED D+GFRL G Sbjct: 477 ------GAWDPHNVTEDADLGFRLARLGYRCGTFAST----------------------- 507 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD------RKGAISN 364 E P TF +RQ+SRW+ G + W + LWR+ + Sbjct: 508 -TYEEAPLTFGNWLRQRSRWMKGWIQ------TWEVHMRHPLRLWRETGIGGVLALNLLL 560 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIF 424 + L+ L LL++ + + + + + L ++ G+ +V + Sbjct: 561 GGNVLSALAYPLLLMIALMSAADWADSSPNWLAADTPTALHWLAISSGVASTIVVGLLGL 620 Query: 425 VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 V G+L + L+W +A WRAL + WDKT H S Sbjct: 621 VRRRQWRHAGVLMLTPLYWL--CLSIAAWRALAHYVW----CPYRWDKTQHGVASRP 671 >UniRef50_Q03HA4 Glycosyltransferase n=2 Tax=Bacilli RepID=Q03HA4_PEDPA Length = 414 Score = 227 bits (578), Expect = 2e-57, Method: Composition-based stats. Identities = 71/461 (15%), Positives = 135/461 (29%), Gaps = 58/461 (12%) Query: 22 LAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNET 81 +++ I + I + P E + +MVPA NE Sbjct: 3 FFLMVAIIAIWGTLIINLILTVAGYTWYLQEAPKPDKQLPE----KIPFVTVMVPAHNEG 58 Query: 82 GVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARP---GP 137 VI + +++Y I V + ++ + EV F V G Sbjct: 59 IVIVKTVLSLLSFDYPHDHYEIIVINDNSSDNSAELLKEVQNDFGEERLKVINTDNVTGG 118 Query: 138 TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY---LVERKDLIQI 194 K++ LN L + ++DA++ LRL +R + Sbjct: 119 KGKSNALNIGLKQAK---------GSVIAIYDADNTPERGALRLLVAELLGNDRLAAVIG 169 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 +E T T I E R+ L G R + + Sbjct: 170 KFRTRNKEATILTRFINI-ETLSFQWMAQAGRQRLFKLCTIPGTNYVIRREILEKI---- 224 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 +DV++L ED +I FR+ + G F V E Sbjct: 225 --GGWDVKALAEDTEISFRVYQMGYQIKFQPRAV-----------------------TWE 259 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVM 374 P T Q++RW+ G ++ K K + + F+ A+++ Sbjct: 260 QEPQTLDVWFHQRTRWVKGNIYVVVKNAKLLFKKAGRPIRFDLLYFLSTYFLLMTALILS 319 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 + +L D F + +M ++ + + R Sbjct: 320 DVVFVLSVAGLAHSDLQGFSNALWLFGIIMFIISTFVTVTTEKGEMRFS--------NIL 371 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + + L + + +A + + + + ++ W KT Sbjct: 372 YIIFMYLVYSQMWLAVAAYGMVGYIREQVFHKQAKWYKTKR 412 >UniRef50_Q30SK0 Glycosyl transferase, family 2 n=6 Tax=Proteobacteria RepID=Q30SK0_SULDN Length = 433 Score = 226 bits (577), Expect = 2e-57, Method: Composition-based stats. Identities = 76/485 (15%), Positives = 142/485 (29%), Gaps = 57/485 (11%) Query: 5 LDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELY 64 ++ F W + + + F+ ++ + I +RY S EL Sbjct: 1 METFFIWWHYIWHAMLGYVF------YYPLFMSTLWMIGAIFFYYKSEKRYVEQSIPELR 54 Query: 65 KPDEKP-LAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCAR 123 + + ++I++P +NE A + Y + + + DT + + Sbjct: 55 ENESWAGVSILIPCYNEGENAIETITYAL-DVIYPEFEVIAINDGSKDDTLDILLSLAKD 113 Query: 124 FPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN 183 P + V KA L S + D + +I P L Sbjct: 114 NPRLKVV--NLAQNQGKALALQAG---------SLVAKHEFLVCIDGDALIDPYCLYWMA 162 Query: 184 YLV---ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 + R T + EFS + G + + +GV T Sbjct: 163 KHFIRYPEVAAVTGNPRIRNR--TSLLGKIQVGEFSSIVGMIKRAQRSFGRLFTVSGVIT 220 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F + AV + + LTED DI ++L+ G F +V Sbjct: 221 GFRKSAVHEV------GYWSPDMLTEDIDITWKLQRNGWDVRFEPKSLVWI--------- 265 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 P+T +Q+ RW +G FK K + LW Sbjct: 266 --------------LMPETIKGLWKQRLRWAMGGAQVMFKNFKVLFLYKQTH-LW---GL 307 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 I +S + M+ ++L+ + L+P + S +++ + L+ + + Sbjct: 308 MIELLLSMVWAYTMVLVILVWIFGLLFPIGYLAPSESPILPDQGSVILIGACLVQFGVSK 367 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 + + F +N AL +VL R Sbjct: 368 WLDGHYDEGLGKNYFWMIWYPFAFWFLNLFTAVAALPKVLFGKKGRARWVSPDRGVHQQS 427 Query: 481 TGDTR 485 T + Sbjct: 428 TKSKK 432 >UniRef50_B4W7G8 Putative uncharacterized protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4W7G8_9CAUL Length = 461 Score = 226 bits (577), Expect = 2e-57, Method: Composition-based stats. Identities = 78/485 (16%), Positives = 148/485 (30%), Gaps = 63/485 (12%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDV-VYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP 70 + L + ++ S L FI + + +V + ++ R + Sbjct: 24 VASLATALVAAGIVWPRSTLSASFIGIQMGFVASALWRAALVIACLRPTPSSPKPSRWPR 83 Query: 71 LAIMVPAWNETGVIGNMAELAATTLDYENYHI--FVGTYPNDPDTQRDVDEVCARFPNVH 128 I+ +E V+ + + + +DY + F+ +D T R + Sbjct: 84 YTILAALHDEAAVVPQLIQR-LSKIDYPRRQLEGFLVLEAHDQATIDAAKA-ARRPGWLR 141 Query: 129 KVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV-- 186 +V P +K LN+ L T ++DAED P +LR Sbjct: 142 ILVVPPGAPKTKPRALNHALAFAT---------GELLTIYDAEDEPDPGQLREAASRFAG 192 Query: 187 -ERKDLIQIPVYPF--EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFS 243 + +Q P+ E + F + E++ L ++P L P G Sbjct: 193 QPQLGCLQAPLRIRRRNAELSTFLDRQFAFEYAALFEVNLPGMAKLNLPFPLGGTSNHLR 252 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQH 303 A+ + +D ++TED D+GF+L G + P Sbjct: 253 TAALRRV------GGWDAYNVTEDADLGFKLWSAGWRLGVLESP---------------- 290 Query: 304 ARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAIS 363 E P + Q++RW+ G + W L R + +++ Sbjct: 291 --------TWEAPPGALERWLPQRTRWLKGYMQ------TWGVHTRAPRALGRRGQLSLA 336 Query: 364 NFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 + + L + A + L + L G++ + V Sbjct: 337 MTLGAAIVSAASHAPTLAWLIAALMVALNIGMA--PVVPLASFAVLAVGVIAAWMGCAVG 394 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 + +W + +A AL +++ AWDKT HD Sbjct: 395 ARRAGQDYRLTDMLAAPAYWS--LLSLAFIHALWRLIV----APYAWDKTAHDAEIDAEC 448 Query: 484 TRSLR 488 T + Sbjct: 449 TSVMT 453 >UniRef50_UPI0001B55850 glycosyl transferase family 2 n=1 Tax=Streptomyces sp. C RepID=UPI0001B55850 Length = 486 Score = 226 bits (576), Expect = 2e-57, Method: Composition-based stats. Identities = 84/463 (18%), Positives = 133/463 (28%), Gaps = 64/463 (13%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAW 78 LA++ I V+ L L + ++VPA+ Sbjct: 77 MAVLALLTAIVTAYSLLHVVLML-----TGLGSGGPLAGADDVPLPDAELPFYTVLVPAY 131 Query: 79 NETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGP 137 E GVIG + A + + V +DP T R V R P V V P Sbjct: 132 REAGVIGGLVRHLAELDYPPDRLEVLVLVERHDPGTARAVPAAG-RPPFVRLVRLPPGPP 190 Query: 138 TSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE----RKDLIQ 193 +K +N L ++ DAED P +LR +Q Sbjct: 191 QTKPRSVNLGLLLA---------RGELLVVFDAEDRPDPGQLRRVAARFAARGADLACVQ 241 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 + T + E++ +P L VP G F + A+ Sbjct: 242 AQLLFHNAAGNWLTR-QFAMEYALRFTLALPGLVRLGMPVPLGGTSNHFRTATLRAV--- 297 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 +D ++TED D+G R G + Sbjct: 298 ---GGWDAWNVTEDADLGMRCAAMGHRTETIGS------------------------VTW 330 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 E VRQ++RW G + + + + + V Sbjct: 331 EEALGAVRPYVRQRTRWFKGFLLTTVVHTRRPRRTVSRFGGRGLLTLLGIVAGAPVTSFV 390 Query: 374 MIQL-LLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 L L L + + S + L + + +R ++ L Sbjct: 391 QPLLAALTLIGLCGLSWSPAGAGLLLPSVAAQAVAALAWTAITFTAARRAGLGAPWHALL 450 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 L SVL F A WRA+ Q++ R +W+KT H Sbjct: 451 TPLCSVLWWF--------AAWRAVHQLV----FSRFSWEKTPH 481 >UniRef50_P75905 Biofilm PGA synthesis N-glycosyltransferase pgaC n=146 Tax=Bacteria RepID=PGAC_ECOLI Length = 441 Score = 226 bits (576), Expect = 2e-57, Method: Composition-based stats. Identities = 70/450 (15%), Positives = 136/450 (30%), Gaps = 54/450 (12%) Query: 31 LDDFFIDVVYWVRRIKRKLSVYRRYPR-MSYRELYKPDEKPLAIMVPAWNETGVIGNMAE 89 FF+ +++ V + + R +P + D ++I++P +NE + Sbjct: 36 FWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETI- 94 Query: 90 LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLD 149 AA YEN + + T+ +D + A+ P++ + KA L Sbjct: 95 HAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI--HLAQNQGKAIALKTGAA 152 Query: 150 AITQFERSANFAFAGFILHDAEDVISPMELRLFNY---LVERKDLIQIPVYPFEREWTHF 206 A + D + ++ R + P R + Sbjct: 153 AAKS---------EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG--NPRIRTRSTL 201 Query: 207 TSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTE 266 + E+S + G + +GV F R A+ + + +TE Sbjct: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV------GYWSDDMITE 255 Query: 267 DYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQ 326 D DI ++L+ T + + P+T +Q Sbjct: 256 DIDISWKLQLNQWTIFYEPRALCWI-----------------------LMPETLKGLWKQ 292 Query: 327 KSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESL 386 + RW G K + ++ +F ++ I + LA L Sbjct: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 Query: 387 WPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNL 446 + H + + L TL L F IV +I + LT L ++ Sbjct: 353 NIELTHIAATHTAGILLCTLCLLQF------IVSLMIENRYEHNLTSSLFWIIWFPVIFW 406 Query: 447 INFMA-NWRALKQVLQHGDPRRVAWDKTTH 475 + +A + +V+ +R W Sbjct: 407 MLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 >UniRef50_Q11MF6 Glycosyl transferase, group 2 family protein n=1 Tax=Chelativorans sp. BNC1 RepID=Q11MF6_MESSB Length = 656 Score = 226 bits (576), Expect = 3e-57, Method: Composition-based stats. Identities = 72/476 (15%), Positives = 140/476 (29%), Gaps = 62/476 (13%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL 71 L + + ++ + L +R + + P + + Sbjct: 200 LIIVAFALAPFSALVILHLLLSVLFLACVLLRVAVSQEARE-PSPPAVLTTMRPAEMPVY 258 Query: 72 AIMVPAWNETGVIGNM-AELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 ++V + E V+ + L I + +D +T + + + Sbjct: 259 TVLVALYREADVVPELLVSLGRIVWPRSKLEIKLVCESDDTETLAAIRA-QELHSYIEVI 317 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE--- 187 GP +K L+ L +T +L DAED PM+L Sbjct: 318 EVPPHGPRTKPKALSYALPLVT---------GEFVVLFDAEDRPHPMQLVEAWERFRSNE 368 Query: 188 --RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 +Q P+ R + + + + E++ L +P +P G F R Sbjct: 369 SGDLACLQAPLMITNRGES-WIASMFAFEYAALFRGILPWLAKRDLVLPLGGTSNHFRRA 427 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 + + +D ++TED D+G RL G + P Sbjct: 428 LLERV------GGWDPCNVTEDADLGLRLARMGYKTGTISSP------------------ 463 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRD--RKGAIS 363 E P T + Q++RW G + W + L+R+ R + Sbjct: 464 ------TYEDAPTTACIWLPQRTRWFKGWMQ------TWLVHMRDVPRLYRELGRTSFLV 511 Query: 364 NFVSFLAMLV--MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 + + M V + ++ L + + L+ L +N + Sbjct: 512 TQILTMGMWVSALAYAAFPISAAVLLVIMLAQDNPVNHYTALLALDAVNVIFGHGAFLAL 571 Query: 422 VIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDF 477 ++ + + A WRAL Q+ + W+KT H Sbjct: 572 GWRTLPKTEQHGLWRHMMWIPVYWALLSAAAWRALWQLYR----CPHVWEKTPHRP 623 >UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophilus torridus RepID=Q6KZU9_PICTO Length = 395 Score = 226 bits (575), Expect = 3e-57, Method: Composition-based stats. Identities = 85/446 (19%), Positives = 152/446 (34%), Gaps = 65/446 (14%) Query: 30 GLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAE 89 G+ + +VY + + L Y + + + ++I+VPA NE VIG E Sbjct: 8 GILLIVVGIVYSIYQFPILLIGYLHFHDYDIDFDWDNYKPLVSIIVPAKNEETVIGRCIE 67 Query: 90 LAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLD 149 + Y+N+ +FV +D DT R R VH G +KA LN Sbjct: 68 -SILGQAYDNFELFVVVDNSDDDTYRIAKSY-ERDGRVHVFERH--GNLTKASALNYAY- 122 Query: 150 AITQFERSANFAFAGFILHDAEDVISPMELR--LFNYLVERKDLIQIPVYPFEREWTHFT 207 + +DA+ V+ L+ ++ D++Q RE FT Sbjct: 123 --------SMSHGEIIATYDADTVLEKNTLKNAVYGMRYMDADVLQGYNTYINREENIFT 174 Query: 208 SMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTED 267 + IDE + + R L VP AG F R + + ++ LTED Sbjct: 175 RLAAIDE--IIVKVSMIGRMYLHLFVPVAGSNQYFKRETIRII------GGWNGNFLTED 226 Query: 268 YDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQK 327 + G R+ K M ++ V + P T+S ++Q+ Sbjct: 227 LESGVRMAAKRMRSAYLPSAKVY-----------------------QETPATYSEYIKQR 263 Query: 328 SRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLW 387 RW+ G + K S L+ + L +++ +L + S++ Sbjct: 264 IRWLRGYHQVLLHSKKELSGLSG---------------LDILMIVLAPTFSGILLFSSIY 308 Query: 388 PDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLI 447 +F + + S + L M+ I FV + L + ++ ++ Sbjct: 309 ISILNFYNPYVHSMRTYFISLLFIFFMIYIIA----FVLALIKKKENALYIPLVYIYVVL 364 Query: 448 NFMANWRALKQVLQHGDPRRVAWDKT 473 N + + + KT Sbjct: 365 NAIISIYTFFLEILGVKRVWYKVKKT 390 >UniRef50_C8WT25 Glycosyl transferase family 2 n=3 Tax=Bacteria RepID=C8WT25_ALIAD Length = 410 Score = 226 bits (575), Expect = 3e-57, Method: Composition-based stats. Identities = 69/450 (15%), Positives = 134/450 (29%), Gaps = 56/450 (12%) Query: 31 LDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAEL 90 L F + +V+ V RR P+ ++I++P NE V+ + Sbjct: 8 LYPFVMSIVWMVGGCVYAW---RRERHPFAESPDLPETPFVSILIPCHNEGDVLEDTIGR 64 Query: 91 AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDA 150 LDY Y I + DT+ ++ + A V V KA LN L Sbjct: 65 MLQ-LDYPAYEIVALNDGSTDDTRAVLERMAACDARVRVVNLPVQ--RGKARALNAGL-- 119 Query: 151 ITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV-----ERKDLIQIPVYPFEREWTH 205 + DA+ V++ LR + ER + R Sbjct: 120 -------VASRGEILVTVDADAVLAKDALRFLVWHFVAPGSERVGAVTGNPRIRNR--GT 170 Query: 206 FTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLT 265 + E++ + G + L + +GV F +RA+ D +D +T Sbjct: 171 LLGKIQVLEYASIIGLIKRAQRVLGKIMTVSGVIAAFRKRAL------VDCGMWDEDMVT 224 Query: 266 EDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVR 325 +D + ++L+ + + + P+ + +R Sbjct: 225 DDIAVSWKLERRAWDIRYEPRALCFMW-----------------------APERLRSLIR 261 Query: 326 QKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYES 385 Q++RW G V + + + + + A L ++ L+ LA++ Sbjct: 262 QRARWAQGGVEVLIRNASVLWTWKNRRMIPLYVEELLG---IAWAYLWVVSLVWTLAFDV 318 Query: 386 LWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGN 445 W +++ WL + + + + Y G Sbjct: 319 AHGIPWTYVA--ETGTWLGLTSLVQTSVALWIEQRYERDSLWRYYFYAIWYPAAYWMIGA 376 Query: 446 LINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + A +A + R W Sbjct: 377 FVVVWAVPKACWAMWAARRGRYATWKSPDR 406 >UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococcaceae RepID=Q9UY40_PYRAB Length = 447 Score = 226 bits (575), Expect = 4e-57, Method: Composition-based stats. Identities = 78/482 (16%), Positives = 147/482 (30%), Gaps = 58/482 (12%) Query: 8 FATWLYGLKVIAITLAVIMFISGLDDFFIDVVYW------VRRIKRKLSVYRRYPRMSYR 61 F + LY +I I LA+++ + + +++ + S+ +RYP Sbjct: 8 FQSALYLYILIIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTLLMASLGKRYPYDETG 67 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 + E + +++PA NE VI + DY N + + + T+ ++E+ Sbjct: 68 FNLEFLEPLVYVLIPAHNEERVIYKTVR-SVLGQDYRNMKVILINDNSTDRTRDIMEEIN 126 Query: 122 ARFPN-VHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 ++P V + SK LN L+ I ++ N + DA+ +I P L+ Sbjct: 127 RKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPN----YVFILDADYLIPPNALK 182 Query: 181 L----FNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 + IQ V P F + E + L Sbjct: 183 TLVSIMESAPQYVIGIQGNVRPRNFRKN-FVTKFITLERLVGFNVAIEGDMKLNENGKYG 241 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G A+ F S+TED D+ R G + + Sbjct: 242 GTV------ALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGW------ 289 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 E +T ++Q+SRW G + + + + R Sbjct: 290 -----------------EEAVETLRDYIKQRSRWAQGHLQVMIDHY---------WPVMR 323 Query: 357 DRKGAISNF---VSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGL 413 I +F ++ LV + L + S F+ +++ F L Sbjct: 324 SCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIITGAPPLSFARPKLFLSVSIFTFLL 383 Query: 414 MVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 + + V ++ +A + L R+ W+KT Sbjct: 384 FWFSVAYSNWVEKKRHNYYVPWSFVALYPLYFMVFVIAGVIYTMRGLIRLLVGRLHWEKT 443 Query: 474 TH 475 Sbjct: 444 KR 445 >UniRef50_A7HQK0 Glycosyl transferase family 2 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQK0_PARL1 Length = 917 Score = 225 bits (573), Expect = 5e-57, Method: Composition-based stats. Identities = 78/495 (15%), Positives = 145/495 (29%), Gaps = 49/495 (9%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 + ++ TW+ + A + L + + +V I + P + E Sbjct: 405 VTGLYFTWVGITVWSFLFFAQGLL---LIVLLAEAIEFVEVIWTRHGSRHFKP---FDEN 458 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPN-DPDTQRDVDEVCA 122 ++I VP NE + A LDY+NY + V DP+ + V + CA Sbjct: 459 AVSPTAMVSIHVPIHNEPPEMVRETLQALANLDYDNYEVLVLDNNTVDPEVWQPVRDYCA 518 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 + + P KA LN L+ + D++ + P L++ Sbjct: 519 QLGPRFRFFHLENWPGFKAGALNFGLE-------KTAEEAEIIAVIDSDYQVEPSWLKVL 571 Query: 183 NYLVE--RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 + +Q P +R + F +M Y E++ + R + G T Sbjct: 572 VPYFDKQDVGFVQGPQDYRDRHESAFKNMAYW-EYAGFFHIGMVQRNNFNAIIQ-HGTMT 629 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 + A+ + + + ED ++G +L G ++V Sbjct: 630 QVRKSALKRV------GGWAEWCICEDAELGIKLYRAGYDSVYVNHSFG----------- 672 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 R PDT S + Q+ RW G V + + + Sbjct: 673 ------------RGLTPDTLSGYITQRFRWAYGAVQIVKHHWDALAPWASSKKGLTGAQR 720 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 F++ L LL S+ AW S + L G V + V+ Sbjct: 721 --YYFLAGWLPWFADGLALLFTTASIVLSAWALYQPHMVSLPVAAFLIPTIGSFVFKFVR 778 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 + + S+ + L ++ P + Sbjct: 779 SLWLYAVRVRDCSFIESLGAGVAALGLTHTVAKAMLNGMITTSKPFIRTPKCEDKPPLAA 838 Query: 481 TGDTRSLRPLGQILL 495 + +LL Sbjct: 839 AFIQVREETVMLVLL 853 >UniRef50_B3DW74 Glycosyltransferase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DW74_METI4 Length = 480 Score = 225 bits (573), Expect = 6e-57, Method: Composition-based stats. Identities = 71/497 (14%), Positives = 142/497 (28%), Gaps = 61/497 (12%) Query: 25 IMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVI 84 F+ FI +Y + I R L + + + + + I +P +NE V+ Sbjct: 4 FWFLILALVLFIYGIYRMSLILR-LWMGSHRDKKAPTDALFYTYPEVTIQLPIYNEKSVV 62 Query: 85 GNMAELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARF--PNVHKVVCARPGPTS- 139 + A +DY I + + +T + + + R Sbjct: 63 ERLL-HAVCKIDYPKNKMEIQIIDD-STDETTAIISKWVCEYQKKGFDIYQLRRGTREGF 120 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN--YLVERKDLIQIPVY 197 KA L L+ + DA+ + P L+ + ++Q Sbjct: 121 KAGGLQYGLE---------RSKGEFIAIFDADFLPPPSFLKETLPYFRSRDVGMVQARWG 171 Query: 198 PFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGI 257 R+ + T + PVR G + ++ + D Sbjct: 172 YLNRQASLLTR-CQALFLDGHFLLEQPVRYKYNLFFNFNGTAGIWRKKCI------IDAG 224 Query: 258 AFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFP 317 ++ +LTED D+ +R + KG ++ + V P Sbjct: 225 GWEGDTLTEDLDLSYRAQFKGWKFVYTPQ-----------------------MVVPSELP 261 Query: 318 DTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQL 377 Q+ RW G + K S +Y L +G + +V + + Sbjct: 262 SPIVAFRTQQHRWAKGAIQTAKKHL--FSLFKGSYSLGSKIEGLFHLLAHSIHPIVAVLV 319 Query: 378 LLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLS 437 +L P F S +L+ L VIF++ + Sbjct: 320 ILNAISFFCSPLPQSFTLEVS------GMLFSVISLFYLSYFAVVIFLSK----NLEAGA 369 Query: 438 VLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLEN 497 +L L + + + K V+ + + +T + + L Sbjct: 370 LLILPFSMAVALGMTFANTKSVIDGLFGKNNVFVRTPKNGFFNSDKPIYKVEHEITLPLL 429 Query: 498 QVITEEQLDTALRNRVE 514 + + AL ++ Sbjct: 430 ETLFAAVFGIALYQAIQ 446 >UniRef50_B5W069 Glycosyl transferase family 2 n=3 Tax=Arthrospira RepID=B5W069_SPIMA Length = 561 Score = 224 bits (572), Expect = 7e-57, Method: Composition-based stats. Identities = 63/410 (15%), Positives = 118/410 (28%), Gaps = 59/410 (14%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSV-------YRRYPRMSYRELYKPDEKP 70 AI L++I + + + + + L + E + Sbjct: 111 AAIALSLIWSLMIIGHLISWGYWLILGLTGLLVIQAIRIVWAGHRGLDINPEHRPEELPF 170 Query: 71 LAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK 129 ++++V A NE VIGN+ + Y ++V + T + E+ + +H Sbjct: 171 VSLLVAAKNEEAVIGNLVKNLCALNYPSHCYELWVIDDNSSDRTPIVLQELAKEYQQLHI 230 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYLVE 187 + K+ LN L + DA+ + L+ + E Sbjct: 231 LHRDENATGGKSGALNQALPL---------TRGKILGVFDADATVDSDLLQQVIPKFQAE 281 Query: 188 RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAV 247 + +Q+ + T + + R A+ G G G R A+ Sbjct: 282 QVGAVQLQKAIANSNFNFLTRCQASEMALDAF--FQKQRVAVGGIGELRGNGEFIRRAAL 339 Query: 248 TALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTS 307 + + Q++T+D D+ RL F+ VV Sbjct: 340 ES------CGGWCEQTITDDLDLTIRLHLDHWDIEFLDTSVVF----------------- 376 Query: 308 NMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVS 367 E + Q++RW G + K F+ +R G F Sbjct: 377 ------EEGVTNWVALWHQRNRWAEGGYQRYLDYGK---------FIIANRMGVGKTFDL 421 Query: 368 FLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNR 417 F ++ L + + I S L L L + R Sbjct: 422 FGFLITQYILPMAAIPDLFMSLILRRPPITSPLTLLAVSLSLIGMFIGLR 471 >UniRef50_D2AU81 Polysaccharide deacetylase n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AU81_STRRD Length = 752 Score = 224 bits (572), Expect = 7e-57, Method: Composition-based stats. Identities = 60/451 (13%), Positives = 123/451 (27%), Gaps = 59/451 (13%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPA 77 ++ I+ ++G+ + +++++ R R + R P+ + ++VPA Sbjct: 352 FVTAMSWILVVAGVITL-LRLLFFLVLAWVHARRVRGGKRRAGRAPAWPEPPAVTVIVPA 410 Query: 78 WNETGVIGNMAELAATTLDYEN-YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG 136 +NE I + DY + V + DT + P V + Sbjct: 411 YNEAAGIEATVR-SLVNTDYPGVLEVVVVDDGSSDDTAAIAASLG--LPGVRVIRQENG- 466 Query: 137 PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY--LVERKDLIQI 194 K LN + + ++ D + V P + + Sbjct: 467 --GKPSALNTGIAHASH---------DILVMVDGDTVFEPATIGHLVRPLSDPAVGAVSG 515 Query: 195 PVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADG 254 R E+ D + L G F R A+ + Sbjct: 516 NTKVGNRR--GMIGRWQHIEYVIGFNLDRRAFDLLGCMPTVPGAIGAFRRSALQEI---- 569 Query: 255 DGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVRE 314 V +L ED D+ + G ++ + Sbjct: 570 --GGVSVDTLAEDTDLTMAMCRGGWRVVYEENALAWT----------------------- 604 Query: 315 YFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVM 374 P + S RQ+ RW G + +K ++T R G ++ F V+ Sbjct: 605 EAPTSLSQLWRQRYRWCYGTLQAMWKHR---RAITEPSPFGRRCLGYLTLFQ------VV 655 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 + LL + + + W +L F + R + Q Sbjct: 656 LPLLAPVVDVMAVYSVVMGDPLPVVAVWAGFVLVQAFSGWYALRLDRERASVLWVLPLQQ 715 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHGDP 465 + ++ + + ++ Q Sbjct: 716 FVYRQLMYLVVIQSVATAVLGVRLRWQTIRR 746 >UniRef50_Q110Z2 Glycosyl transferase, family 2 n=15 Tax=Cyanobacteria RepID=Q110Z2_TRIEI Length = 502 Score = 224 bits (572), Expect = 8e-57, Method: Composition-based stats. Identities = 73/488 (14%), Positives = 136/488 (27%), Gaps = 72/488 (14%) Query: 6 DVFATWLYGLKVIAITLAVI--MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 WL I + L I GL R L + + E Sbjct: 67 AALMLWLIWTTTIILHLLSWGYWIILGLTGLLSVQF------LRILFAKPKLAPKTLSEE 120 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 + ++++V A NE VI + + +Y ++V + T ++++ Sbjct: 121 NFTEWPYISLLVAAKNEEAVIRKLVKNMLALDYPTNSYELWVIDDNSTDKTPLLLEQLAR 180 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 + + + + K+ LN + + + DA+ ++P L+ Sbjct: 181 EYEQLKVIRRSPDAGGGKSGALNAAIPFVK---------GKILGVFDADAQVTPDLLQKV 231 Query: 183 NYLVER--KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 L R +QI +T + + R A+ G G G Sbjct: 232 VPLFAREEVGAVQIRKAIANAGINFWTKGQSAEMVVDGF--FQEQRIAIGGIGELRGNGQ 289 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 A+ ++ Q++T+D D+ RL ++ FP V E Sbjct: 290 FVRMNALEE------CGGWNEQTITDDLDLTIRLHLNQWDIDYLAFPAVTEEGVTSPI-- 341 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 Q+SRW G + W L + Sbjct: 342 ---------------------ALWHQRSRWAEGGYQRYLDY--WKLILRNRMRFSK---- 374 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIV- 419 + + ++ L + + L L I S ++ L + R Sbjct: 375 ---TWDLWQFLVTQYLLSVAAVPDFLMSIILRRLPITSPLTVFTVMVSLLGMFIGLRRTR 431 Query: 420 ---QRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHG--------DPRRV 468 + + L LF L + + L + G P+R+ Sbjct: 432 KQQMNLAKEEKVMEFNSSKDNPLSLFLTLLESVRGTFYMLHWFVVMGVTIARMSILPKRL 491 Query: 469 AWDKTTHD 476 W KT H Sbjct: 492 KWVKTVHR 499 >UniRef50_B9JQZ7 Glycosyltransferase n=1 Tax=Agrobacterium vitis S4 RepID=B9JQZ7_AGRVS Length = 664 Score = 224 bits (572), Expect = 8e-57, Method: Composition-based stats. Identities = 71/496 (14%), Positives = 147/496 (29%), Gaps = 64/496 (12%) Query: 8 FATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRR--YPRMSYRELYK 65 F +A + ++Y R ++ R Sbjct: 217 FWLGSLLTATLAACSTFGYDALAVMHILTSLLYLCMLAFRAATLAYRIGAAAPPPALPAS 276 Query: 66 PDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARF 124 + ++V + E+ +I + + I + +D DT + E Sbjct: 277 VELPVYTVLVALYRESSMIPQLIDGLRRLDWPVSRLDIKLVCEADDLDTLGALAE-ADIP 335 Query: 125 PNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY 184 ++ V GP +K L+ L + +L+DAED P +L+ Sbjct: 336 AHIEIVPTPPIGPRTKPKALSYAL---------SGARGDFLVLYDAEDRPHPAQLKEAYA 386 Query: 185 LV----ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 +Q P+ + + + S + E++ L +P+ +P G Sbjct: 387 HFLSRPPEVACLQAPLIIANGDES-WISALFALEYAALFRGTLPMLAYHGMPLPLGGTSN 445 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F A+ + A+D ++TED D+G RL G + +++ Sbjct: 446 HFRIEALKDV------GAWDPYNVTEDADLGLRLFRAGYRCETITRQTLED--------- 490 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 P + + Q+SRW G + W + ++ G Sbjct: 491 ---------------APVSSRIWMGQRSRWFKGWLQ------TWLIVMREPRVACKEMGG 529 Query: 361 -AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWL-----NFGLM 414 A + F + +++ L L + + + L L N Sbjct: 530 SAFAVFHLMIGGMLLSSLSHPALLLFLTMTVYSMANPPADGIPLRDLTVFWIDLVNILGS 589 Query: 415 VNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 + + + L + L+ +A WRA+ ++ + W+KT Sbjct: 590 YLIFLALGRAAMTEFERRRIGRRYLFIPLYWLMTSIAAWRAMIEL----KTKPFFWNKTP 645 Query: 475 HDFPSVTGDTRSLRPL 490 H + ++ Sbjct: 646 HAPRGNERNRKAQPNK 661 >UniRef50_B8H475 N-acetylglucosaminyltransferase n=3 Tax=Caulobacter RepID=B8H475_CAUCN Length = 497 Score = 224 bits (570), Expect = 1e-56, Method: Composition-based stats. Identities = 73/454 (16%), Positives = 136/454 (29%), Gaps = 59/454 (12%) Query: 35 FIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAAT- 93 ++ + + R + PR L + D ++ P + E V+ + A Sbjct: 97 LFFFIFLLGGLTRLAAAMTPLPRHHSPALAEADLPSYTLITPLYREAEVLPELVASLAAI 156 Query: 94 TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQ 153 + + +D T+ + + +V P +K N L+ Sbjct: 157 DYPRDRLQALIVLEADDEVTRAAARAL-DLPSFIQVLVVPPGTPRTKPRACNYALERA-- 213 Query: 154 FERSANFAFAGFILHDAEDVISPMELR----LFNYLVERKDLIQIPVYPFEREWTHFTSM 209 +++DAED+ P +LR F R +Q P+ + ++ F Sbjct: 214 -------RGDLVVIYDAEDMPDPGQLREAAARFAASDARLACLQAPLRIEDPGFSLFLPS 266 Query: 210 TYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYD 269 + E++ +P P G F + + A+D ++TED D Sbjct: 267 QFRLEYAAHFEVLLPALARWGLPFPLGGTSNHFKIAPLREI------GAWDPYNVTEDAD 320 Query: 270 IGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSR 329 +GFRL G + P E P T + Q++R Sbjct: 321 VGFRLAAAGYRLDVIHRP------------------------TWETAPTTRAQWFPQRAR 356 Query: 330 WIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPD 389 WI G + + R + AI+ ++ + L + ++ Sbjct: 357 WIKGHMQTLAVHARGPVP--------RQPRNAIALILTLAQSVASSHLHGPVMGVAIALA 408 Query: 390 AWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINF 449 FL + L+ G + + L + +W Sbjct: 409 LVDFLPDAAFQIPPHDLVLYFAGWGAAALAGARGVMRAGGRPKALHLLGMPAYWL--CQS 466 Query: 450 MANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 +A +AL Q + WDKT H S Sbjct: 467 VAAVKALHQFV----TAPHHWDKTLHTPRSGRPR 496 >UniRef50_A6M148 Type II secretion system protein E n=18 Tax=Clostridiaceae RepID=A6M148_CLOB8 Length = 564 Score = 224 bits (570), Expect = 1e-56, Method: Composition-based stats. Identities = 45/264 (17%), Positives = 101/264 (38%), Gaps = 15/264 (5%) Query: 485 RSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGV 543 R LG IL+ IT QL AL+++ G +LG +L +I+ E + +A+ +Q G+ Sbjct: 3 TEKRRLGNILVNAGKITGYQLQEALKSQRTLGKKLGEILLDSKIITEEDIIEAIEQQTGI 62 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 ++ I +P ++ Y ++P +N+++ V D ++ ++ + G Sbjct: 63 KKVDLNTINFDRKAITLIPQNLCDKYLLIPFGFDNNKIKVALADPLNIFAIDDVAISTGF 122 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQ--YVPHQF 661 ++ I + I + +Y+ ++ ++ L +QA + + P Sbjct: 123 EIESFISRKADIKKFIGIYYSSQQVNNAAIQLAKESTKAVKNGKQAIDEMSEVNSAPVVK 182 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM 721 + + + S I++ E + +G + + L + Sbjct: 183 MVDYLFRNSVEMKTSDIHI-----EPFENEIRIRYRIDGKLQTVNTLGI----ESLGPLV 233 Query: 722 QSLLLKAGLNTEQVAQLESENEGE 745 + + AGLN + +G Sbjct: 234 TRIKILAGLNIAE---KRIPQDGR 254 >UniRef50_A6C2Q8 Type II secretion system protein E n=3 Tax=Planctomycetaceae RepID=A6C2Q8_9PLAN Length = 582 Score = 223 bits (568), Expect = 2e-56, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 98/277 (35%), Gaps = 23/277 (8%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVEG---LRLGGSMLMQGLISAEQLAQ 535 + + LG +L+ + IT EQL++AL + +G LG ++ + +Q+ + Sbjct: 2 NTNSLLQPKMRLGDLLVYKEYITLEQLESALEAQSQGDGSQLLGELLVNNEYCTEDQVLE 61 Query: 536 ALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLA 595 LA + + + +D+ S + +P + VLPL + L V + + + Sbjct: 62 CLALEYRIPYVQLDSRMFDSKIFDILPRDFVEKHTVLPLFKVRNVLTVAVAEPTNVFLVD 121 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQ 655 L +++ V +I ++ + ++ +A +++ + Sbjct: 122 QLRDLTKTEIQIVAASAREIRRMVQTYMPNTNVFVIDDIIDDANGTNVELIEESIDDIGF 181 Query: 656 YV------PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDR 709 P L I+ S I++ E + L +GV L + Sbjct: 182 DAEFAGQSPIIKLVNYIIYNAVREGASDIHI-----EPTEQQLRVRYRVDGV-----LQQ 231 Query: 710 VLTIQRELQVSM-QSLLLKAGLNTEQVAQLESENEGE 745 L L ++ + + A L+ + +G Sbjct: 232 ALEPPVHLAPAVSSRIKIMASLDISE---RRLPQDGR 265 >UniRef50_A3CR73 PilB-like pili biogenesis ATPase, putative n=2 Tax=Firmicutes RepID=A3CR73_STRSV Length = 560 Score = 223 bits (568), Expect = 2e-56, Method: Composition-based stats. Identities = 35/265 (13%), Positives = 93/265 (35%), Gaps = 21/265 (7%) Query: 488 RPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 L IL++ +IT Q + L++ ++L ++ +G ++ E + + ++ V Sbjct: 1 MALIAILVQFNLITAAQKEEILQDMPQSNMQLERYLISKGYVTEEDMLKVMSYYYRVPHV 60 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLEND------ELIVGSEDGIDPVSLAALTRK 600 ++ + I + ++ VA + ++P+ + +L+V D + ++L + Sbjct: 61 NLSQFVIEKEAVEKVSEKVAKRHGLIPISFTDGEEGEEPKLVVAMADPSNYIALDDVKIV 120 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 V + R I + +Y +G + + E ++ + P Sbjct: 121 SKMAVEPYVTFRDDIEKYIDQYY--SKGEEAQQAATEIEGFNVDEEIVEEDLEIKNAPVV 178 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L I++ S I++ E + +G + + + Sbjct: 179 RLIDSIISQAIKTRTSDIHI-----EPFEKVVRVRFRVDGTLVENMQLKA----NAHSAI 229 Query: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 + + +GL+ + +G Sbjct: 230 ATRIKIMSGLDIAE---RRIPQDGR 251 >UniRef50_A1TUR3 General secretory pathway protein E n=5 Tax=Proteobacteria RepID=A1TUR3_ACIAC Length = 578 Score = 222 bits (567), Expect = 3e-56, Method: Composition-based stats. Identities = 50/280 (17%), Positives = 90/280 (32%), Gaps = 17/280 (6%) Query: 468 VAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQG 526 P+ LG++L+++ ++ L+ AL + G LG ++ G Sbjct: 2 TTTILPDRTEPTADAVAVQQPLLGELLVQSGKLSARDLERALSAQQEMGGLLGRVLVRLG 61 Query: 527 LISAEQLAQALAEQNGVAWESIDAWQ-IPSSLIAEMPASVALHYAVLPLRLENDELIVGS 585 L+S + QAL+ Q G+ S + + + + +P +V PL +E+ L V Sbjct: 62 LVSETDVIQALSRQLGIPLISANDFPDLMPEVEGLLP-EFLQANSVYPLSVEDGRLHVAM 120 Query: 586 EDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLT 645 D + AL G V + L I L + + Sbjct: 121 AVPQDAFVVKALHLATGLSVVPRLALESDIEKALAE--PVEQAGEEEGDDGFGDGADGGD 178 Query: 646 EQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQE 705 + + P L I+ + + S I++ E L +GVI Sbjct: 179 FVEHLKDLASEAPVIRLVNAIIGRVIDLRASDIHL-----EPFDDGLHVRYRVDGVIQLG 233 Query: 706 TLDRVLTIQRELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 L R + L A L+ + +G Sbjct: 234 ELV----PPRLSAAVSSRVKLLAHLDIAE---RRLPQDGR 266 >UniRef50_C8WRT0 Glycosyl transferase family 2 n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WRT0_ALIAD Length = 417 Score = 222 bits (567), Expect = 3e-56, Method: Composition-based stats. Identities = 76/452 (16%), Positives = 138/452 (30%), Gaps = 45/452 (9%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIG 85 + D V V + LSVY + R R + +K AI++PA NE VIG Sbjct: 6 LLWDTFYDALKLVTGLVALYQIVLSVYGIWHR--RRPITHAPQKRFAIIIPAHNEECVIG 63 Query: 86 NMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCL 144 + + T Y + V T A V K + Sbjct: 64 PLLDSLKRQTYPAHLYDVHVIADNCTDGTAERARAHGA-----IVHVRENRAEQGKGYAI 118 Query: 145 NNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYL-VERKDLIQIPVYPFEREW 203 +L + + + ++ DA++++ P L + N +IQ + Sbjct: 119 EWMLARLKEM----GARYDAIVMFDADNLVHPDFLAIMNDHLCSGDRVIQGYLDTKNPFD 174 Query: 204 THFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQS 263 + + S++ + + R L G G C + + + Sbjct: 175 S-WISVSLAISYWFDNRLWQYARARLHLPCTLGGTGLCIDYPLLQEM-------GWKATG 226 Query: 264 LTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTA 323 LTED + G R +G+ ++ V + K P +F+ + Sbjct: 227 LTEDLEFGIRCVRRGIIPVWAHDARVYDEK-----------------------PTSFAAS 263 Query: 324 VRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAY 383 RQ+ RW G + L AI F +ML+ +++L Sbjct: 264 FRQRLRWQQGHFQCAREHLVPMFLEGLRERNLAKIDMAIYLFQPMRSMLLFAGAMIVLGL 323 Query: 384 ESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG-LLSVLRLF 442 L PD S + + +N L + + ++ + LL Sbjct: 324 HYLSPDPTDAASNPAALMVTNLWVAVNVILFLEVPLALLLERVNWRAYFALPLLPFFLWT 383 Query: 443 WGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 WG + R+ + R + D+ Sbjct: 384 WGPVTLQAYFTRSNRTWYHTVHKRAIRLDELR 415 >UniRef50_B7I546 Glycosyltransferase n=12 Tax=Acinetobacter RepID=B7I546_ACIB5 Length = 416 Score = 222 bits (567), Expect = 3e-56, Method: Composition-based stats. Identities = 71/461 (15%), Positives = 139/461 (30%), Gaps = 55/461 (11%) Query: 25 IMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVI 84 I+F+ G +I +W + + + L++++PA+NE VI Sbjct: 6 ILFLFGFLGIWIPQAFWAWLSYQAWKYSKTAEKELQNLPIPERWPVLSVLIPAYNEGVVI 65 Query: 85 GNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARP-GPTSKAD 142 + +A E+Y + + + +T + + +P + V + G K+ Sbjct: 66 EDTLHAIAQQDYPAESYEVLLINDGSKDNTLEIAENLAKIYPCIKIVNVPKGMGGKGKSR 125 Query: 143 CLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY---LVERKDLIQIPVYPF 199 LNN L +++DA+ P +RL ++ + V Sbjct: 126 TLNNGLPHAK---------GELIVVYDADSTPEPDCVRLLAQTLLADKKLVAVNGKVRTR 176 Query: 200 EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAF 259 + + T EF R G R A+ L F Sbjct: 177 NWQDSILTRFI-AIEFIFFQWIFQGGRWQRFELSTLMGTNYVIWRDALETL------GGF 229 Query: 260 DVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDT 319 D +SL +D ++ FR+ +V + + + P + Sbjct: 230 DEKSLVDDTEMSFRIFIGQKRIKWVPYAIGWQQD-----------------------PPS 266 Query: 320 FSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLL 379 S V+Q+SRW G + K +L + + + I ++ F+ L + L Sbjct: 267 LSVFVKQRSRWTQGNFYVTRKYL--PVALRTPFPIGIEILNNIMCYILFVPALFWSHITL 324 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL 439 L L L F+ L L++ + + Sbjct: 325 TL--GLLDIAGISVLGPFTLLWGLSFCLYVAQMWFTLSL-------EKVKPELYFYSVLS 375 Query: 440 RLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 + + + F+ A + + W KT Sbjct: 376 YVSYSQIFLFIVFKAAFDMLKNKIQGNSLQWYKTERSKEKK 416 >UniRef50_Q1D9E1 General secretory pathway protein E n=17 Tax=Proteobacteria RepID=Q1D9E1_MYXXD Length = 605 Score = 222 bits (566), Expect = 4e-56, Method: Composition-based stats. Identities = 55/255 (21%), Positives = 99/255 (38%), Gaps = 23/255 (9%) Query: 500 ITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAW-ESIDAWQIPSSL 557 +TEE+L AL + E G R+G +++ +S E +A+AL Q + + I A ++ + L Sbjct: 51 LTEEKLQEALAIQDEKGQRIGEALVGMKAVSEEDVAKALGHQLDLPYLARIFAEEVDAEL 110 Query: 558 IAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVT 617 + +P + A +LPL LE D + V D +D +L + +G+ V I L I Sbjct: 111 VKRIPINFAKQSRILPLSLEGDTVAVAVADPLDTAALDHVRVLLGQSVSQRIALGSTITD 170 Query: 618 GLRHWYAR------RRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLG 671 + Y R + + +A+ H+ + + P L +L Sbjct: 171 AINSVYDRSVNETEQLVDEMETQDLDAIAHELDEPKDLLDEDD-EAPVIRLVNSVLFRAA 229 Query: 672 HINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQ-VSMQSLLLKAGL 730 S I++ E L +GV L V+ + Q + + + L Sbjct: 230 KERASDIHI-----EPMERELLVRFRVDGV-----LQEVIKPPKRYQNAIVSRVKVMGQL 279 Query: 731 NTEQVAQLESENEGE 745 N + +G Sbjct: 280 NIAE---KRLPQDGR 291 >UniRef50_A6Q1D6 Glucosaminyltransferase n=2 Tax=Epsilonproteobacteria RepID=A6Q1D6_NITSB Length = 438 Score = 222 bits (566), Expect = 4e-56, Method: Composition-based stats. Identities = 61/431 (14%), Positives = 124/431 (28%), Gaps = 55/431 (12%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 +L F + + I + + + F +V + + S++R R + Sbjct: 21 ILYSFYQYYILFNSMFIKIGYGIILG-----FTSLVIFRYMLLLFFSIFRTIQRSAEETY 75 Query: 64 YKPD---EKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 ++++VPA+NE I + T +Y N I V + +T + Sbjct: 76 QIDPKQRYPKVSVIVPAYNEAKTIATSI-SSLLTQNYPNLEIIVVDDGSSDETYFKAKQF 134 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 ++ R K+ +N ++ + DA+ IS + Sbjct: 135 -EHNEFCKEIRVFRKKNEGKSKAINYGIE---------RSTGELIFVMDADSKISQNAIL 184 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 L E D+ + + + + E+ E + L G Sbjct: 185 LLARHFEDPDIAAVAGSVYVSNQVNLITKLQALEYIEGLNMVRNGQAFLKAVNIIPGPVG 244 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F + A+ + +D + ED D+ +L KG F V Sbjct: 245 MFRKNAL------YNVGLYDHDTFAEDCDVTLKLIAKGYKIDFEPEAVAYT--------- 289 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSL-----TLNYFLW 355 P+ ++Q+ RW GI+ K + + Sbjct: 290 --------------EAPENLLDLIKQRYRWTRGILQAIRKHRNLLWNFKENTTASMVMWY 335 Query: 356 RDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN--FGL 413 + F + ++ + W +I + L +L GL Sbjct: 336 MLFESLFWPFADIWVNIFVLYWAVTSGMSIFIFYWWSIFTILDVAGALYCILLTGEKLGL 395 Query: 414 MVNRIVQRVIF 424 + + R+ F Sbjct: 396 VFYAVYYRIFF 406 >UniRef50_B0T1N0 Putative uncharacterized protein n=1 Tax=Caulobacter sp. K31 RepID=B0T1N0_CAUSK Length = 492 Score = 222 bits (566), Expect = 4e-56, Method: Composition-based stats. Identities = 94/481 (19%), Positives = 147/481 (30%), Gaps = 69/481 (14%) Query: 13 YGLKVIAITLAVIMFIS------GLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKP 66 GL V A+T+ V + I F IK + R P ++ L Sbjct: 56 VGLAVFALTVIVAVIIEPRTTMEAFHLLFFVGFMANSMIKLAAACTPRRPGVAPS-LPDE 114 Query: 67 DEKPLAIMVPAWNETGVIGNMA-ELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFP 125 D ++VP + E V + LA + + + ND +TQ + Sbjct: 115 DLPGYTLIVPLYREASVAAELVLNLARLDYPRDRLQVLIVLEANDHETQAAFAAL-DLPV 173 Query: 126 NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYL 185 ++ P +K N L+ +++DAED P +LR Sbjct: 174 GFQVLIAPPGTPQTKPRACNIALERA---------HGEMVVIYDAEDAPHPAQLREAAAG 224 Query: 186 VE----RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 R +Q P+ F + E++ L +P P G Sbjct: 225 FAAGDRRLACLQAPLRIEPDPR--FLPDQFALEYAVLFEVFLPALARWRLPFPLGGTSNH 282 Query: 242 FSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFL 301 F AV A+ +D ++TED DIGFRL +G + P Sbjct: 283 FRTEAVRAV------GGWDSYNVTEDADIGFRLAARGYQLDVITCP-------------- 322 Query: 302 QHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGA 361 E P T T + Q++RWI G V + A Sbjct: 323 ----------TFETAPTTMKTWIPQRARWIKGHVQTLAVLARGPIVRDPPGLAALVLTLA 372 Query: 362 ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 +S S L ++ L+L L L+ W + + +R Sbjct: 373 LSVASSHLHGPLLAWLVLSWLGSMLDLCPPVPA----MDWMLVYFGWTCAAIAGAQAQRR 428 Query: 422 VIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 G Q L +L + + A +AL Q + WDKT H +V Sbjct: 429 A-------GHRQRPLPLLGAVFYWPLQSFAATKALWQFVV----APFHWDKTPHTPRTVN 477 Query: 482 G 482 Sbjct: 478 P 478 >UniRef50_UPI0001C1680B Glycosyl transferase, family 2 n=2 Tax=Nostocaceae RepID=UPI0001C1680B Length = 467 Score = 222 bits (565), Expect = 5e-56, Method: Composition-based stats. Identities = 70/475 (14%), Positives = 138/475 (29%), Gaps = 71/475 (14%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRY----PRMSYRELYKPDEKPLAI 73 + L ++ + ++ + L V+ + + + D +++ Sbjct: 52 AILVLTMVWGGTIALHLVSWGFAFILGLTTILGVHALRIILVRPRHHHKQIQGDLPSVSV 111 Query: 74 MVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVC 132 +V A NE VI + + Y +++ + T + ++ + ++ Sbjct: 112 LVSAKNEQAVIDRLVHNLCSLEYPHGEYEVWLIDDHSTDKTPEILAQLQQDYKQLNVFRR 171 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVER--KD 190 K+ LN VL + DA+ +SP L +R Sbjct: 172 DANATGGKSGALNQVLP---------MTKGEIIAVFDADAQVSPDLLLQVIPTFQREKVG 222 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 +Q+ + +T + L R A+ G G G R A+ Sbjct: 223 AVQVRKAIANAKENFWTRGQMAE--MALDTWFQQQRTAIGGLGELRGNGQFVRREAL--- 277 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 D ++ +++T+D D+ RL G + +P Sbjct: 278 ---NDCGGWNEETITDDLDLTIRLNLTGWDIECMFYP----------------------- 311 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 V E Q++RW G + + D + Sbjct: 312 PVLEEGVTNVVALWHQRNRWAEGGYQRYLDYWDLILKGRMRAGKTVD---------LLIF 362 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 ML+M + + L H I A + L + + ++R Sbjct: 363 MLIMYIIPTAAVPDLLMSLIRHRPPIL---APITGLSVTMSFIGMFSGLRRTRQDQKNSN 419 Query: 431 LTQGLL-----SVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 LL SV L W +++ +L+ P+R+ W KT H Sbjct: 420 YLMLLLQTIRGSVYMLHWLVVMSSTTARVSLR-------PKRLKWVKTVHTGSQH 467 >UniRef50_A4CJ64 Glycosyltransferase n=15 Tax=Bacteroidetes RepID=A4CJ64_9FLAO Length = 494 Score = 222 bits (565), Expect = 5e-56, Method: Composition-based stats. Identities = 65/445 (14%), Positives = 128/445 (28%), Gaps = 49/445 (11%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVP 76 IA + I ++ L FF + + R + L + + I +P Sbjct: 4 TIAYFIIAIYSLALLLIFFYSLSQLNLLLNYLGFKRRNKEAPKFNLLDPKEIPYVTIQLP 63 Query: 77 AWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPD---TQRDVDEVCARFPNVHKVVC 132 +NE V+ + E A I V D T ++E+ + ++ + Sbjct: 64 IYNEEYVVERLLENIARIEYPKSKLEIQVLDDSTDDSVEQTAAMIEELQKQGLDIQHIRR 123 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR--LFNYLVERKD 190 KA L L + DA+ + L+ + + E Sbjct: 124 ENREG-FKAGALKEGLKIAK---------GDFIAIFDADFLPDADWLKKTVIYFKDEEIG 173 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 ++Q R+++ T + R + + G + + + Sbjct: 174 VVQTRWGHINRDYSTLT-KIQAFALDAHFTLEQVGRNSKGHFINFNGTAGIWRKECIL-- 230 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 D ++ +LTED D+ +R + K ++ + Sbjct: 231 ----DAGNWEGDTLTEDLDLSYRAQLKNWKFKYLED-----------------------V 263 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 P S A Q+ RW G F+ W N G + S + Sbjct: 264 ETPAELPVVISAARSQQFRWNKGGAE-NFRKTVWNVVKAKNIPFKTKFHGVMHLLNSSMF 322 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMT--LLWLNFGLMVNRIVQRVIFVTGY 428 + V I LL + + H IF +++ + ++ + +Q F Sbjct: 323 LCVFIVALLSIPMLYIKNTFGHLDWIFEVTSFFIVSTIILFVCYWFTYKSIQGSSFDNFV 382 Query: 429 YGLTQGLLSVLRLFWGNLINFMANW 453 + +L N +A Sbjct: 383 DYIKLFFTFFSVALGFSLHNTVAVL 407 >UniRef50_B0VJ41 Type IV pilus biogenesis protein PilB n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJ41_9BACT Length = 569 Score = 221 bits (563), Expect = 8e-56, Method: Composition-based stats. Identities = 54/271 (19%), Positives = 101/271 (37%), Gaps = 21/271 (7%) Query: 483 DTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQN 541 LG IL+ ITEEQL AL + GL+LG +++ G ++ +L +AL +Q Sbjct: 3 YNPQFARLGDILVHEGYITEEQLKDALLKQGNFGLKLGETLIKLGYLTENELLEALHKQL 62 Query: 542 GVAWE-SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 G + + ++++ +P A VL LR E D ++V D + + +L + Sbjct: 63 GYDVVQDKELMDLDINIVSSIPEPYAKENKVLALREEGDGVVVAMTDPENLIVSDSLEKI 122 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYN-----AVQHQWLTEQQAGEIWRQ 655 +G+ ++ V++ + + +Y R AV A Sbjct: 123 LGKNIKPVLIGNSSLQDAIEKYYKSIRTTTEVEDAVGGFEFVAVDEDENEITIAAATEDV 182 Query: 656 YVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQR 715 P + I+ + I++ E + +G L V+T Sbjct: 183 DAPVVKMINLIINEAIKAGATDIHI-----EPLTKISRIRYRVDG-----ALREVMTPPI 232 Query: 716 ELQVSMQSL-LLKAGLNTEQVAQLESENEGE 745 + S+ SL + + LN + +G Sbjct: 233 GMHPSLISLVKVMSKLNIAE---RRLPQDGH 260 >UniRef50_Q1IL87 Glycosyl transferase, family 2 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IL87_ACIBL Length = 422 Score = 221 bits (563), Expect = 8e-56, Method: Composition-based stats. Identities = 66/448 (14%), Positives = 131/448 (29%), Gaps = 50/448 (11%) Query: 27 FISGLDDFFIDVVY--WVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVI 84 + + +Y + + LS++ R + ++I+VPA+ E I Sbjct: 19 TVVYVYALRFYGLYPILMSWVWISLSLFFRRRQEDTEMEMSGPAPMVSILVPAFAEAETI 78 Query: 85 GNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCL 144 + E A LDY NY + + + +T V + P + + + KA L Sbjct: 79 DDTIE-ALLKLDYPNYEVILVNDCSPDNTAEVVRQYLD-DPRIRLL--NKQVNEGKAMAL 134 Query: 145 NNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREWT 204 N+ L ++ DA+ ++S L + + P R Sbjct: 135 NDALPMC---------RGEILVVIDADIIVSRDLLNYMVPHFAGTRVAAVTGNPRVRNRV 185 Query: 205 HFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSL 264 EFS + + L + +G R A+ L F Sbjct: 186 SILQHLQAVEFSSIVSMQRRAQRVLGRVLTVSGAVFAVRRSALLEL------GGFTPHMA 239 Query: 265 TEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAV 324 TED D+ +RL+ K + VV P + Sbjct: 240 TEDIDLTWRLQMKFWDVRYEPRAVVWMQ-----------------------VPLSLRELW 276 Query: 325 RQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYE 384 +Q+ RW G+V + + ++ + + S L V + + Sbjct: 277 KQRKRWARGLVQVLKRHREVPTNWKMRRMW----PIFYESIFSILWSYVFVLMTSYWLIS 332 Query: 385 SLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWG 444 A +S F +M L + V R + + + Sbjct: 333 LAVGYAPRGVSPFPNFWGMMIATTCLLQLFIGAWVDRQY--DPGIMWSFPEAVFYPVIYW 390 Query: 445 NLINFMANWRALKQVLQHGDPRRVAWDK 472 L+ + ++ + + + + + Sbjct: 391 MLMALITSFYTIPALFKKPPRVQTWRIR 418 >UniRef50_A1TEN6 Glycosyl transferase, family 2 n=7 Tax=Actinomycetales RepID=A1TEN6_MYCVP Length = 461 Score = 221 bits (563), Expect = 9e-56, Method: Composition-based stats. Identities = 82/471 (17%), Positives = 150/471 (31%), Gaps = 56/471 (11%) Query: 13 YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLA 72 + L ++ +++++FI + + W + +RR K + Sbjct: 41 WLLYIVMTVISLLLFIVAATTLWWMLHAWRSPESLHSTGFRRR--------SAGRPKGFS 92 Query: 73 IMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVC 132 +++PA +E V+G+ + A LD+ Y + V +DP+T+ AR P + +VV Sbjct: 93 LLLPARHEQDVLGDTID-ALARLDHPLYEVIVIIGHDDPETEHVARAAAARHPRIVRVVI 151 Query: 133 ARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE--RKD 190 P +K LN L + DAED + P LRL E R D Sbjct: 152 DTNIPKNKPKALNTALPTC---------RGEIVGVFDAEDEVHPRLLRLVEARFEEARAD 202 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 ++Q V + + + S+ E+ + A +P G + ++ Sbjct: 203 VVQSGVQLMNIQTSWW-SLRNCLEYYFWFRSRLHF-HADQRFIPLGGNTVFARTALLRSV 260 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 +D L ED +IG RL +G P + Sbjct: 261 ------GGWDRDCLAEDCEIGVRLSTRGARVAVAYDP---------------------KV 293 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 RE P + V+Q++RW G + K L R ++ A Sbjct: 294 VTREETPGSLRALVKQRTRWDQGFMQVYRKGEWRKLPSRRQRMLARYT---LAMPFLQAA 350 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG 430 ++ + + + P LS + L+T+ L F Sbjct: 351 TGALVPIAIACMFVLKVPVPLTLLSFLPLAPTLVTVAVEAAALGEFG----KEFGIRIRL 406 Query: 431 LTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVT 481 Q L + + L+ A +++ G + Sbjct: 407 WDQVRLVLGAFPYQLLLAAAAVRSVWRELRGQGGWEKTEHVNAHRAGGREE 457 >UniRef50_D2R473 Type II secretion system protein E n=6 Tax=Planctomycetaceae RepID=D2R473_9PLAN Length = 572 Score = 221 bits (563), Expect = 9e-56, Method: Composition-based stats. Identities = 56/275 (20%), Positives = 103/275 (37%), Gaps = 27/275 (9%) Query: 485 RSLRPLGQILLENQVITEEQLDTALR--NRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 ++R +GQI ++ I+++QL+ L + G LG L++ EQL QALAEQ G Sbjct: 1 MAIRRIGQIFVDMGFISDDQLEMLLEEQQQRPGTLLGKLAQEMSLVNEEQLVQALAEQMG 60 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + + IP ++ ++ S+A Y V+P++ ++EL V + D + L +G Sbjct: 61 MQVVELGDITIPGDVLHKVTESMAQLYRVIPIKFSSNELTVATCDPQNITIQDELRSMLG 120 Query: 603 RKVRYVIVLRGQIVTGLRHWY-----------ARRRGHDPRAMLYNAVQHQWLTEQQAGE 651 +R VI I L ++ +AV + + E Sbjct: 121 YDIRVVIASETDIKKTLDRYFSSDKDTVDSIVGELEADSELKKAMDAVAKNGAVDLTSVE 180 Query: 652 IWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVL 711 P + L +L + S I+ E + +GV+ ++ Sbjct: 181 ALADSAPVRKLLNMVLLLAIKDHASDIH-----FEPFEDEFRIRIKADGVLF-----EMV 230 Query: 712 TIQRELQ-VSMQSLLLKAGLNTEQVAQLESENEGE 745 R L + + A L+ + +G Sbjct: 231 PPPRHLAFAITTRIKVMANLDIAE---RRMPQDGR 262 >UniRef50_D1SD50 Polysaccharide deacetylase n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SD50_9ACTO Length = 776 Score = 221 bits (563), Expect = 9e-56, Method: Composition-based stats. Identities = 69/464 (14%), Positives = 126/464 (27%), Gaps = 65/464 (14%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 W V + + L V + G+ ++ +V + RR Sbjct: 355 WRGHVLVGAVRVADGMFGMLGVFFVLVGVLTVGRTLLLFVMAPRHAARRRRRDWSWGP-- 412 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 P +P++++VPA+NE I A + V + T V + Sbjct: 413 ---PVTEPVSVIVPAYNEREGIAAAVRSLALGDHPGGIEVVVVDDGSTDGTADIVAAL-- 467 Query: 123 RFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF 182 R PNV V K LN + ++ D + + P +R Sbjct: 468 RLPNVRVVRKPNG---GKPSALNTGVALA---------RHDLIVMVDGDTIFEPDSVRRL 515 Query: 183 --NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 + ++ V R + E+ D + E L G Sbjct: 516 VQPFADPGVGVVAGNVKVGNRR--GLIAKWQHIEYVIGFNLDRRLYETLRCMPTVPGAIG 573 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 F R+A+ + +L ED D+ L G ++ Sbjct: 574 AFRRQALEQV------GGMTDDTLAEDTDVTIALGRAGWHIVYEESARAWT--------- 618 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 P T +Q+ RW G + +K + + R Sbjct: 619 --------------EAPTTVGQLWKQRYRWSYGTLQAMWKHRRSVIDSGRSGRFGRRCLS 664 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 ++ F V++ L + + AWL L ++ Sbjct: 665 FLTLF------GVLLPLAAPVIDLLAIYGLIFLDRSDTVVAWLAMLALQFLTAVLA---- 714 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGD 464 F L + L+ F + ++ +A+ L G Sbjct: 715 ---FRLDREKLGVLWVLPLQQFVYRQVMYLVLLQAVGTALTGGR 755 >UniRef50_C8WGD9 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WGD9_EGGLE Length = 444 Score = 221 bits (562), Expect = 1e-55, Method: Composition-based stats. Identities = 75/479 (15%), Positives = 136/479 (28%), Gaps = 74/479 (15%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 +LD F + + + + + + I + V R +EL Sbjct: 1 MLDQFFSQISFVDIFNFCVFLTFTICYTYQLYYVFVVLTR---------------KPKEL 45 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDPDTQRDVDEVCA 122 A ++ A NE+ VIG++ E +FV DT R E A Sbjct: 46 TAKKNHKFAAVISARNESAVIGDLIHSIKVQNYPSELIDVFVIADNCTDDTARVAREAGA 105 Query: 123 RFPNVHKVVCARPGPT--SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 +V R K L+ I ER A+ + + + DA++V+ R Sbjct: 106 -------IVFPRSNDKEVGKGYALDYGFQCIR--ERYADKGYEAYFVFDADNVLDVNYFR 156 Query: 181 LFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 N + + +++ S Y F R L +G G Sbjct: 157 EMNKTFDNGAKASTSYRNSKNYDSNWISAGYAVWFLREAKFLNQARLTLNTSCAVSGTGF 216 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 + + + LTED + +G + ++ + + Sbjct: 217 FIAADIIEK------NGGWKWHLLTEDIEFSANSILEGTRISYTPTAILYDEQ------- 263 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSS----------LTL 350 P TF + Q+ RW G + + Sbjct: 264 ----------------PITFRDSWNQRFRWAKGFYQVFWHYGARLAKGIAVNPKGARFAC 307 Query: 351 NYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN 410 L G + VS L +++ L L A + A SI LN Sbjct: 308 YDMLMTIAPGMLLTIVSVLFNAIIVFLSLTGAMSTGIMVASSLSSIL--------FCLLN 359 Query: 411 FGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVA 469 + + + FV + VL +F + AL +++ + + + Sbjct: 360 YFIFMFMFGVLTTFVEWDSIRSTTGKKVLYMFTFPVFMMTYIPIALVALVKKCNWKPIK 418 >UniRef50_C6XAP3 Type II secretion system protein E n=1 Tax=Methylovorus sp. SIP3-4 RepID=C6XAP3_METSD Length = 816 Score = 221 bits (562), Expect = 1e-55, Method: Composition-based stats. Identities = 50/297 (16%), Positives = 110/297 (37%), Gaps = 38/297 (12%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQL 533 ++R + LG+ L + ++I+E+QL AL + E G+ LG ++ G++ + L Sbjct: 204 QKAIRHQESRPILKLGEALRQLELISEDQLQHALNKQKENRGIPLGRILVDMGIVDEQTL 263 Query: 534 AQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVS 593 LA++ G+ + S+ + + I + A VA + ++PL + L+V ED ++ + Sbjct: 264 KGTLAKKLGIPYVSLSKFNFDPNAIRLIGAPVARKHLLIPLCMYEGALVVAFEDPMNVKA 323 Query: 594 LAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHD-----------------PRAMLY 636 + + K + R IVT + +Y R + M Sbjct: 324 IDEVRFLTQMKTLPAMASREDIVTAIDSFYGRSGAFEFSKSKDDMLDFDLKSSNVAGMQI 383 Query: 637 NAVQHQWLTEQQAGEIWRQYVPH-------QFLFAEILTTLGHINRSAINVLLLRHERSS 689 + + + +E+ + + P L +++ S I++ Sbjct: 384 DDLATKLFSEENSMQFESAEEPVAESDNTLVQLVNKMILDAYQDGVSDIHIETY---PDR 440 Query: 690 LPLGKFLVTEGVISQETLDRVLTIQ-RELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 +G TL + L I + + + + L+ + +G+ Sbjct: 441 RNTQVRFRKDG-----TLVQYLEIPSNFRNALISRIKIMSQLDISE---RRKPQDGK 489 >UniRef50_Q02IY3 Probable glucosyl transferase n=6 Tax=Proteobacteria RepID=Q02IY3_PSEAB Length = 869 Score = 220 bits (561), Expect = 1e-55, Method: Composition-based stats. Identities = 80/496 (16%), Positives = 149/496 (30%), Gaps = 48/496 (9%) Query: 3 WLL-DVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR 61 W+ D + + L + G I + + + +R Sbjct: 358 WIAYDYSQQYSTWFSLTVGALLGV----GALGVVIVLFTEAHELAEAVWTRKRRRPFLPI 413 Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPN-DPDTQRDVDEV 120 + ++I VP +NE + A LDY +Y + V DP + V+ Sbjct: 414 TAAQAYRPKVSIHVPCYNEPPELLKQTLDALARLDYPDYEVLVIDNNTRDPAVWQPVEAH 473 Query: 121 CARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR 180 CAR + P KA LN L + DA+ + P LR Sbjct: 474 CARLGERFRFFHVAPLEGFKAGALNFALGHVAADVEVVAV-------IDADYCVDPDWLR 526 Query: 181 LF--NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGV 238 ++ R ++Q P ++ + F + Y E+ + R + G Sbjct: 527 HMVPHFGDPRIAVVQSPQDYRDQHESAFKRLCYA-EYKGFFHIGMVTRNDRDAIIE-HGT 584 Query: 239 GTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQR 298 T R + L + +TED ++G R+ EKG++ + Sbjct: 585 MTMIRRSVLDELR-------WPEWCITEDAELGLRVFEKGLSAAYFERSYG--------- 628 Query: 299 KFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR 358 + PDTF +Q+ RW G + + R Sbjct: 629 --------------KGVMPDTFIDFKKQRFRWAYGAIQIMKRHTDALLRGRGPDG-SRLT 673 Query: 359 KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI 418 +G +FV+ + L + +L A + L+ L L ++ Sbjct: 674 RGQRYHFVAGWLPWIADGLNIFFTLGALLWSAAMIIVPKRVDPPLLIFAILPLALFAFKV 733 Query: 419 VQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFP 478 + + G+ L +L + +A V + R +++H Sbjct: 734 GKILFLYRRTVGVDLRDSFFAALAGLSLSHTIAKAVLYGFVTRGIPFFRTPKMRSSHGLL 793 Query: 479 SVTGDTRSLRPLGQIL 494 + R + +L Sbjct: 794 VALAEAREEVFVMLLL 809 >UniRef50_B8II97 Polysaccharide deacetylase n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8II97_METNO Length = 1120 Score = 220 bits (561), Expect = 1e-55, Method: Composition-based stats. Identities = 68/430 (15%), Positives = 126/430 (29%), Gaps = 48/430 (11%) Query: 37 DVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP-LAIMVPAWNETGVIGNMAELAATTL 95 V+ R + R S R+ + +A++VPA+NE VI + + Sbjct: 712 TVLAIFRLTLIIIGATAHGLRGSRRDPPEGWRPRGIAVLVPAYNEEIVILKTIQTLLAST 771 Query: 96 DYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFE 155 E I V + +T V FPN V KA LN L Sbjct: 772 IAEQIEIIVIDDGSTDNTAAVVRTA---FPNTAAVQIYTKANGGKAAALNYGLQ------ 822 Query: 156 RSANFAFAGFILHDAEDVISPMELRLFN--YLVERKDLIQIPVYPFEREWTHFTSMTYID 213 + + D + V+ P + + + + V R+ + Sbjct: 823 ---KTSTEIIVAIDGDTVLLPDAIEHLARHFADPKIGAVAGTVSVGNRK--TLIARFQAL 877 Query: 214 EFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFR 273 E++ D + + G + R A+ A+ + +L ED D+ Sbjct: 878 EYTMSQNLDRRAFQLINAIGVVPGAIGAWRREALMAV------GGYSSDTLAEDADLTIS 931 Query: 274 LKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIG 333 L+ G + P+ ++Q+ RW+ G Sbjct: 932 LELAGWKVVCEPRARALT-----------------------EAPERLRAFLKQRFRWMFG 968 Query: 334 IVFQGFKTHKW-TSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWH 392 + +K + + LA L+ + L+ + + Sbjct: 969 TLQVAYKHAPASLRRPRGISLILIPNVLLFQFLFTLLAPLMDLILIFSVVSSVVDITLIG 1028 Query: 393 FLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYG-LTQGLLSVLRLFWGNLINFMA 451 S G+ L+ WL F + + + G L VL+ F + ++ Sbjct: 1029 SRSEGYGTLELLMAYWLVFQVFDFLAGCAALLLHGKSPEWRLLPLLVLQRFCYRQLLYIT 1088 Query: 452 NWRALKQVLQ 461 R L L+ Sbjct: 1089 AIRTLLTALR 1098 >UniRef50_A3XKW0 Glycosyltransferase related protein n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XKW0_9FLAO Length = 631 Score = 220 bits (561), Expect = 1e-55, Method: Composition-based stats. Identities = 73/471 (15%), Positives = 145/471 (30%), Gaps = 76/471 (16%) Query: 40 YWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYE 98 + + + K ++ + ++ + I +P + E+ VI +A E Sbjct: 213 FLLTVVGAKSETIQKISQTELSKINADELPYYTIQLPVFKESEVIYKLASNLQNLDYPKE 272 Query: 99 NYHIFVGTYPNDPDTQRDVDEVCARFPN-VHKVVCARPGPTSKADCLNNVLDAITQFERS 157 + + +D T V + +FP V+ P +K N L Sbjct: 273 KLDVKLLIESDDEVTFNAVKNL--KFPCIFDPVIIPYAQPKTKPKACNYGLHF------- 323 Query: 158 ANFAFAGFILHDAEDVISPMELRLFNYLV----ERKDLIQIPVYPFEREWTHFTSMTYID 213 ++DAED+ +L++ + L E +IQ + F + + T M + Sbjct: 324 --SKGKYLTIYDAEDIPDSDQLKMVHALFSKLPEEYIVIQCALNYFNKTENYLTRM-FTL 380 Query: 214 EFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFR 273 E+S +P + L +P G F + L +D ++TED D+G R Sbjct: 381 EYSYWFDYMLPGLDGLKVPIPLGGTSNHFKFDRLIEL------GGWDGFNVTEDADLGIR 434 Query: 274 LKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIG 333 KG + E + F +RQ+SRWI G Sbjct: 435 AYAKGYKVTVLNST------------------------TYEEANNAFYNWIRQRSRWIKG 470 Query: 334 IVFQGFKTHKWTSSLTLNYFLWRD-RKGAISNFVSFLAMLVMIQLLLLLAYE-------- 384 + + + L+R+ F F+ L + Sbjct: 471 YMQ------TYLVHMRNPSKLYREVGLNGFLGFQFFIGGTFFTFLAYPILLLLFLFYIFL 524 Query: 385 -------SLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLS 437 + + F ++ F M ++ I + + L Sbjct: 525 TLDISSYVVGSLNTEIIDFFKLIFPEWVIIISVFNFMAGNLLMIYINMIAVFRRKSYSLI 584 Query: 438 VLRL--FWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRS 486 + + L++ ++ ++ L Q++ + W+KT H Sbjct: 585 LYAITNPIYWLMHSISAYKGLFQLIS----KPFYWEKTNHGLTKDHKPEIE 631 >UniRef50_C1XWL4 Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB n=4 Tax=Bacteria RepID=C1XWL4_9DEIN Length = 888 Score = 220 bits (560), Expect = 2e-55, Method: Composition-based stats. Identities = 49/277 (17%), Positives = 99/277 (35%), Gaps = 18/277 (6%) Query: 471 DKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGL-RLGGSMLMQGLIS 529 + + SV RPLG++L+E + E ++ +L+ + +G RL +++ G I Sbjct: 320 REALQEALSVQRRLGKTRPLGEVLVELGYVKPEDIEESLQKQRQGGGRLEDTLIQSGKIK 379 Query: 530 AEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGI 589 E LA++LA Q G + S++ +P + Y V P +EN L+V +D Sbjct: 380 PEMLARSLAAQLGYPYIDPLEQPPDPSVMMMVPEATVRRYHVFPHHMENGTLVVLMKDPR 439 Query: 590 DPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQA 649 + ++ L R++ + I + Y D L + + E++ Sbjct: 440 NIFAIDDLKMITKREILPAVSTETAINKLIERSYGGGGDLD---ELTKEFEKKKKQEEEV 496 Query: 650 GEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDR 709 L I+ S I++ E + + +G L Sbjct: 497 STSALDDNAVVRLVNNIIREAYLQEASDIHI-----EPRQQEILVRIRVDG-----NLRE 546 Query: 710 VLTIQREL-QVSMQSLLLKAGLNTEQVAQLESENEGE 745 + + + + + A L+ + +G Sbjct: 547 YMKLPKGAGPAIASRVKIMANLDIAE---RRLPQDGR 580 Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 34/138 (24%), Positives = 58/138 (42%), Gaps = 1/138 (0%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 + LG LL+ ++ +E+L A+ E G L + GL+S ++AQA+ E G+ Sbjct: 9 KRLGAALLDMGLLEDEELQKAIERHREIGGSLAEIVAEMGLLSERRVAQAIEEIFGIPLV 68 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + +IPS + +PA A +P + L V + +D + L L G+ + Sbjct: 69 ELSEVEIPSEAKSLIPAEKARDLEAIPFAFDGRLLRVALLNPLDNLVLEELEDLTGQIIE 128 Query: 607 YVIVLRGQIVTGLRHWYA 624 R L Y Sbjct: 129 PYQTTRASFRYALAKHYP 146 Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats. Identities = 50/233 (21%), Positives = 88/233 (37%), Gaps = 14/233 (6%) Query: 489 PLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE- 546 LG +L++ ++ + L AL + G LG ++ +GL+S QL QALAEQ G+ + Sbjct: 165 KLGDLLVKKGWLSPQALQAALAEQEKSGELLGRVLMQKGLVSELQLYQALAEQAGIEFLN 224 Query: 547 ----SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 D + + A + AL Y +P+ +E + V D + R + Sbjct: 225 ELLKEGDLPEPQPEVTALFLRTDALRYQAVPVDMEGKTVRVILADP---RHREEVARLLD 281 Query: 603 RKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQ-QAGEIWRQYVPHQF 661 + ++++ Y + + + + L E + P Sbjct: 282 KPAKFMLATPKVWEGIFNKAYPEKARLGETLVQKGKIDREALQEALSVQRRLGKTRP--- 338 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 E+L LG++ I L + + L L+ G I E L R L Q Sbjct: 339 -LGEVLVELGYVKPEDIEESLQKQRQGGGRLEDTLIQSGKIKPEMLARSLAAQ 390 >UniRef50_Q4KHI6 Glycosyl transferase, group 2 family protein n=11 Tax=Pseudomonadaceae RepID=Q4KHI6_PSEF5 Length = 863 Score = 220 bits (560), Expect = 2e-55, Method: Composition-based stats. Identities = 70/495 (14%), Positives = 143/495 (28%), Gaps = 56/495 (11%) Query: 6 DVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYK 65 D + + L + G FI ++ + + +++R E Sbjct: 362 DYSQQYSTWFSLTVGFLLAL----GALGVFIVLLTEAHELAEAVWIHKRRREFLPVEGDS 417 Query: 66 PDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPN-DPDTQRDVDEVCARF 124 ++I VP +NE + A LDY ++ + + DP V C Sbjct: 418 SYRPKVSIHVPCYNEPPEMVKQTLNALANLDYPDFEVLIIDNNTKDPAVWEPVQAYCETL 477 Query: 125 PNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF-- 182 K P K LN +L + + D++ + L+ Sbjct: 478 GPRFKFFHVAPLAGFKGGALNYLLPHTAKD-------AEVIAVIDSDYCVDRNWLKHMVP 530 Query: 183 NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 ++ + ++Q P ++ + F + Y E+ + R + G T Sbjct: 531 HFADPKIAIVQSPQDYRDQNESTFKKLCYA-EYKGFFHIGMVTRNDRDAIIQ-HGTMTMT 588 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 R + L + + ED ++G R+ EKG++ + Sbjct: 589 RRTVLEEL-------GWADWCICEDAELGLRVFEKGLSAAYHHESYG------------- 628 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAI 362 + PDTF +Q+ RW G + + R + Sbjct: 629 ----------KGLMPDTFIDFKKQRFRWAYGAIQIIKRHTASLLRGKNTELTRGQRYHFL 678 Query: 363 SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI--VQ 420 + ++ ++A + + ++ W I LL + + V Sbjct: 679 AGWLPWVAD-------GMNIFFTVGALLWSAAMIIVPQRVDPPLLIFAIPPLALFVFKVG 731 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDP-RRVAWDKTTHDFPS 479 ++IF+ + G ++ L P R + H F Sbjct: 732 KIIFLYRRAVGVNLKDAFCAALAGLALSHTIAKAVLYGFFTSSIPFFRTPKNADNHGFWV 791 Query: 480 VTGDTRSLRPLGQIL 494 + R + +L Sbjct: 792 AISEAREELFIMLLL 806 >UniRef50_C9RLY4 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLY4_FIBSS Length = 517 Score = 220 bits (560), Expect = 2e-55, Method: Composition-based stats. Identities = 74/464 (15%), Positives = 144/464 (31%), Gaps = 59/464 (12%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSY-RELYKPDEKP 70 LY + V+ + V + I G ++ +Y + RK + R + Y RE D Sbjct: 6 LYAMFVVYVIAGVGLVIYGFSCYY--SIYLFLKNSRKTRLSDRKAILKYYREHSLADLPQ 63 Query: 71 LAIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPD----TQRDVDEVCARFP 125 + +P +NE + + E + + I V + + T++ V E+ AR Sbjct: 64 VTTQLPVFNEANCVERLLEAVCAIDYPKDKHEIQVLDD-STDECYEVTKKKVAELAARGY 122 Query: 126 NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMEL-RLFNY 184 ++ + KA L + + DA+ V L + Y Sbjct: 123 DIKLIHRTN-RKDFKAGALKEGMAVAK---------GEFLAIFDADFVPEKDFLLKTVPY 172 Query: 185 --LVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCF 242 + + L+Q R + T + + R + G + Sbjct: 173 LVMDPQVGLVQGRWGHLNRTESGLT-LAQSIGIDGHFVIEQSARSWGKLFMNFNGTAGVW 231 Query: 243 SRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQ 302 + A+ G ++ +LTED D+ +R + G FV Sbjct: 232 RKDAI------YGGGGWEGDTLTEDMDLSYRSQLAGWKMKFV------------------ 267 Query: 303 HARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAI 362 + V P+ + Q+ RW G + K + R + GAI Sbjct: 268 -----FDVIVPAELPNDINAFKAQQFRWAKGSIQTAIKILPKVLRSKVP---LRVKIGAI 319 Query: 363 SNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRV 422 + + M+ L + + A+ + ++ + ++ V Sbjct: 320 LHTTHYSIHPCMLFTALCAWP---LLAFFEPVGHLPTWAYTVGFAFIFLAAIAPSVLYFV 376 Query: 423 IFVTGYY-GLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDP 465 Y G LLS+ L + ++N RA+ + Sbjct: 377 AQRCSGYTGWKIRLLSLPILMALGVGIAVSNSRAVFAAVLGTKG 420 >UniRef50_A5GEA8 Glycosyl transferase, family 2 n=3 Tax=Deltaproteobacteria RepID=A5GEA8_GEOUR Length = 492 Score = 219 bits (559), Expect = 2e-55, Method: Composition-based stats. Identities = 71/459 (15%), Positives = 132/459 (28%), Gaps = 63/459 (13%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVY--RRYPRMSYRELYKPDE-KPLAIM 74 + + ++ +Y V R+ +Y + R + P+E + + Sbjct: 1 MLSAIIPVLTAIHFAALLGLCLYGVHRLWLIYCLYMPKGSERSTPAPFAAPEEFPSVTVQ 60 Query: 75 VPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCA---RFPNVHKV 130 +P +NE V + + AA E I V +D DT R VD+ A + V Sbjct: 61 LPLYNERFVAERLLDAAAGLDWPRERLEIQVLDD-SDDDTCRLVDQRAAWWRKQGVAITV 119 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN--YLVER 188 V KA L N L + DA+ + P L + + Sbjct: 120 VRRTSRDGYKAGALANGLATA---------HGEYIAVFDADFIPPPDFLHATMPWFRNQD 170 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 ++Q + + FT + + + VR G + R A+ Sbjct: 171 VGMVQTRWSFCNADHSWFTGIQSLL-LGPHFSIEHRVRYRQGLFFNFNGTAGVWRRSAIE 229 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 + + ++TED D+ +R + G ++ Sbjct: 230 S------AGGWQSDTVTEDLDLSYRAQLAGWRFVY-----------------------RE 260 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSF 368 V P T + Q+ RW G + K L + + + Sbjct: 261 ECQVPSELPVTMAALRSQQQRWAKGSIQTARKILPRLLQERLPPAVKIE---------AM 311 Query: 369 LAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGY 428 ++ I LL + A + L L L + ++ Sbjct: 312 AHLMANIYWLLGMIVMLTLYPAVTWRVGIGLHQVLRIDLPLFLATSGAIMSYFLL----- 366 Query: 429 YGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRR 467 Y + G S+LR + ++ + G R Sbjct: 367 YSIRSGSKSLLRHVVLLPALTIGLAPSISLSVLKGLFRP 405 >UniRef50_Q1J1R8 Tfp pilus assembly pathway, ATPase PilB n=7 Tax=Bacteria RepID=Q1J1R8_DEIGD Length = 891 Score = 219 bits (559), Expect = 2e-55, Method: Composition-based stats. Identities = 47/279 (16%), Positives = 106/279 (37%), Gaps = 17/279 (6%) Query: 471 DKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLIS 529 + V ++PLG++++E E++D AL+ + G RL +++ G +S Sbjct: 317 RAQLREALQVQARGGKVKPLGEVIVELGFARAEEIDAALQKQNAGGGRLEDTLVQSGKLS 376 Query: 530 AEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGI 589 E LA++LA Q G + + +P + A Y V+P+RL+ + L+V +D Sbjct: 377 PEMLARSLAAQLGYEYLDPVQNPPDPQVALMIPEATARRYTVVPVRLQGEALVVAMKDPR 436 Query: 590 DPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQA 649 + +L L GR++ ++ IV + ++ + + L + + ++ Sbjct: 437 NVFALDDLKLITGREIVPAVMSEKDIVRLIERYFGNQDMANLNQRLAAESKTREARKEAD 496 Query: 650 GEIWR--QYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETL 707 + + ++ S I++ E + + +G L Sbjct: 497 LDFSAGLDDNAVVRVVDNLIREAALQEASDIHI-----EPTESAVRVRYRVDG-----AL 546 Query: 708 DRVLTIQR-ELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 + + Q + + + GL+ + +G Sbjct: 547 REQPELPKGSAQSILARIKIMGGLDIAE---RRVPQDGR 582 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 55/228 (24%), Positives = 91/228 (39%), Gaps = 8/228 (3%) Query: 489 PLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE- 546 LGQ L+ +I E QL AL + G LG ++ QGL+S +QL + LAEQ G + Sbjct: 166 KLGQRLISRGLINEAQLQVALDVQQQTGEALGHILVTQGLLSEDQLYEVLAEQAGAVYLR 225 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 + +Q ++ + + AL + +P+ + V D L +GR V+ Sbjct: 226 NPRDFQPGEEVLGSLLRADALRLSAVPVDETAQGVTVVVSDP---RRRDELEALIGRPVQ 282 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEI 666 V+ G + + +Y +R + + ++ L E + E+ Sbjct: 283 LVLARPGDVEALIERYYPQRGRLGEQMVQQGSLSRAQLREALQVQARGGK---VKPLGEV 339 Query: 667 LTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 + LG I+ L + L LV G +S E L R L Q Sbjct: 340 IVELGFARAEEIDAALQKQNAGGGRLEDTLVQSGKLSPEMLARSLAAQ 387 Score = 109 bits (273), Expect = 3e-22, Method: Composition-based stats. Identities = 49/229 (21%), Positives = 91/229 (39%), Gaps = 20/229 (8%) Query: 488 RPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 R LG ILLE +T+ L AL E G RL ++ G + +++A+A+ E G+ Sbjct: 8 RRLGAILLEQGYVTDTDLQKALVRHAEVGGRLADILIESGQVGEKRIARAIEEALGIPLV 67 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ++ ++ +A + A A P LE L V D + V++ AL G + Sbjct: 68 NLLVVTPDAAALAAIRAETAKQMQAFPFALEGQTLRVALVDPLSSVAIEALEDDSGLNIE 127 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAE- 665 LR Q++ + +Y ++ + + + G++ + L + Sbjct: 128 PYQALRDQVLWSIATYYP------ELGLMPVLPEGAAGSSESGGKLGQ------RLISRG 175 Query: 666 ILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 ++ + E L VT+G++S++ L VL Q Sbjct: 176 LINEAQLQVALDVQ--QQTGEALGHIL----VTQGLLSEDQLYEVLAEQ 218 >UniRef50_D1B8K4 Glycosyl transferase family 2 n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B8K4_THEAS Length = 438 Score = 219 bits (558), Expect = 3e-55, Method: Composition-based stats. Identities = 65/492 (13%), Positives = 139/492 (28%), Gaps = 68/492 (13%) Query: 5 LDVFATWL--YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE 62 + F+ W + ++ L ++ G+ F + W Sbjct: 6 MFYFSVWWGWWLVQCCLYLLFGLLIADGIYQFVVSFRGW---------------WTPKAP 50 Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 + A+++PA NE VIG + E + Y +FV T + Sbjct: 51 PKASRYRRFAVLIPAHNEARVIGPLLESLKEQDYPKDCYRVFVSCDNCTDHTAQVAA--- 107 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL 181 ++ + K + L I ++ DA+++ S L Sbjct: 108 --LHGAVPLIRTDTTKSGKTWNVRWALTQI------PMDEVDALVMFDADNLASRDFLSR 159 Query: 182 FNYLV---ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGV 238 N + + +Q + + T Y + + R G Sbjct: 160 MNDYMEAHPEAEAVQGVLDVKNPDDNWLT-KAYALAYWYTNRFWQLARSNWGLSCTLGGT 218 Query: 239 GTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQR 298 G + + ++++SLTED ++ RL G + VV + K Sbjct: 219 GLVIRSSTLRRI-------GWNLESLTEDLEMSTRLILSGSRVHWNEHAVVYDEK----- 266 Query: 299 KFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDR 358 P + +VRQ++RW+ G + ++ + + Sbjct: 267 ------------------PLDYRISVRQRTRWMQGHYWVCWRYGMEALKMFFRTRRLQYL 308 Query: 359 KGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI 418 + A + ++ + +AY + A L W+ F + + Sbjct: 309 DLFLYLLAPAKACISLLAMFAGMAYTVINNAIL--FPTLESKAPTTPLEWMAFVGLPVAM 366 Query: 419 VQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVL---QHGDPRRVAWDKTTH 475 + G + + + ++ + + +L + W KT H Sbjct: 367 ILAHCLFVALVGPSMHRRRLCLGYVKDVFGYFLFGLSWIPILFKAAFLAKDQGVWVKTEH 426 Query: 476 DFPSVTGDTRSL 487 G + Sbjct: 427 TRSISIGQVTNR 438 >UniRef50_D0LLW4 General secretory pathway protein E n=2 Tax=Nannocystineae RepID=D0LLW4_HALO1 Length = 610 Score = 219 bits (558), Expect = 3e-55, Method: Composition-based stats. Identities = 54/261 (20%), Positives = 105/261 (40%), Gaps = 17/261 (6%) Query: 490 LGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES- 547 LG+IL+ + + E L+ AL R + EG LG ++ + + +AL Q+ + + Sbjct: 35 LGEILMRDAGLRPEHLERALARQQDEGGLLGEILVRLQAVEEAAVMRALGVQHDMPVATE 94 Query: 548 -IDAWQIPSSLIAEMPASVALHYAVLPLRLE-NDELIVGSEDGIDPVSLAALTRKVGRKV 605 DA + + LI ++P + A + VLP+R++ ++ + V D ++ L ++ +GR V Sbjct: 95 LPDAESVDAELIDKIPINFAKTHRVLPIRIDADENVEVLVSDPLEVEVLDDISVLLGRAV 154 Query: 606 RYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIW-RQYVPHQFLFA 664 V+ +IV + Y R RG A + E+ + P Sbjct: 155 EGVLCPPSRIVDLINKVYGRLRGGAELAEKQDVEDEYGDDEELVDILDLTDEAPIIRWVN 214 Query: 665 EILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSL 724 +L S I++ E + +GV+ + ++ L + + Sbjct: 215 SLLFHAIKERASDIHI-----EPGEKEVMVRYRVDGVLREHK----RAHRQYLPSIIARV 265 Query: 725 LLKAGLNTEQVAQLESENEGE 745 + AGLN + +G Sbjct: 266 KIMAGLNIAE---KRLPQDGR 283 >UniRef50_A1R2V1 Glycosyl transferase, group 2 family domain protein n=4 Tax=Actinomycetales RepID=A1R2V1_ARTAT Length = 431 Score = 219 bits (558), Expect = 3e-55, Method: Composition-based stats. Identities = 70/451 (15%), Positives = 136/451 (30%), Gaps = 53/451 (11%) Query: 25 IMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVI 84 I+ ++ + + + R + RY +S R L ++++VP +NE VI Sbjct: 18 ILGFGMAKIVYVPMALYFDWLYRSICARHRYSVLSDRPL-------VSVIVPGYNEAVVI 70 Query: 85 GNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCL 144 E + Y + + + +T ++ + ++ V + K L Sbjct: 71 TGCVESILAS-RYLRLEVILVDDGSTDETASIMEGLAQQYDRVRFL---SQANAGKGAAL 126 Query: 145 NNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYLVERKDLIQIPVYPFERE 202 N + A + DA+ V +P L + + + P + Sbjct: 127 NCGIAAAL---------GDILMFVDADGVFAPDTLIHMLEGFDDPKVGAVCGDDRPVNLD 177 Query: 203 WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQ 262 +M G + +G F V L F Sbjct: 178 R--LQTMMLAILSHVGTGLVRRALSLMNCLPIVSGNIGAFRSDLVREL------GGFHED 229 Query: 263 SLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFST 322 +L ED ++ +R+ + G F +V P T Sbjct: 230 TLGEDLELTWRVYKAGYRVRFQPKALVY-----------------------AESPSTMGG 266 Query: 323 AVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLA 382 RQ+ RW G++ S F AI+ + + LV++ LL L Sbjct: 267 LWRQRVRWSRGLLQTLRLHSGMLGSRRYGMFGAFLVFNAITMVLIPILQLVVLALLPFLY 326 Query: 383 YESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLF 442 + P L+I ++L + F + +NR + + F+ + V Sbjct: 327 VAGMGPVPAEVLAILGWLGVFVSLALIVFSVGLNRSWRDLRFLWTLPLWPFYSVFVGLAL 386 Query: 443 WGNLINFMANWRALKQVLQHGDPRRVAWDKT 473 ++ + A LQ + V + Sbjct: 387 ASAIVKEIRGSPARWNKLQRTGIKSVTATHS 417 >UniRef50_Q9RQP9 Biofilm PIA synthesis N-glycosyltransferase icaA n=63 Tax=Staphylococcaceae RepID=ICAA_STAA8 Length = 412 Score = 219 bits (557), Expect = 4e-55, Method: Composition-based stats. Identities = 60/461 (13%), Positives = 134/461 (29%), Gaps = 56/461 (12%) Query: 25 IMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVI 84 F+ + + V I + RY ++ + + + ++ +NE+ I Sbjct: 3 FFNFLLFYPVFMSIYWIVGSIYFYFTREIRYSLNKKPDINVDELEGITFLLACYNESETI 62 Query: 85 GNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCL 144 + L YE I + + +T + ++ N + KA+ L Sbjct: 63 EDTL-SNVLALKYEKKEIIIINDGSSDNTAELIYKIKE---NNDFIFVDLQENRGKANAL 118 Query: 145 NNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN---YLVERKDLIQIPVYPFER 201 N + + + + DA+ ++ + + P R Sbjct: 119 NQGIKQAS---------YDYVMCLDADTIVDQDAPYYMIENFKHDPKLGAVTG--NPRIR 167 Query: 202 EWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDV 261 + E++ L G + +GV T F + AV + +D Sbjct: 168 NKSSILGKIQTIEYASLIGCIKRSQTLAGAVNTISGVFTLFKKSAVVDV------GYWDT 221 Query: 262 QSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFS 321 +TED + ++L +G + + P+T Sbjct: 222 DMITEDIAVSWKLHLRGYRIKYEPLAM-----------------------CWMLVPETLG 258 Query: 322 TAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLL 381 +Q+ RW G + + S++ F I F +++L + +LL L Sbjct: 259 GLWKQRVRWAQGGHEVLLR--DFFSTMKTKRFPL-----YILMFEQIISILWVYIVLLYL 311 Query: 382 AYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRL 441 Y + + + + + + + + V + + + Y L + Sbjct: 312 GYLFITANFLDYTFMTYSFSIFLLSSFTMTFINVIQFTVALFIDSRYEKKNMAGLIFVSW 371 Query: 442 F--WGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 + +IN A + L+ W Sbjct: 372 YPTVYWIINAAVVLVAFPKALKRKKGGYATWSSPDRGNTQR 412 >UniRef50_C1V8R5 Glycosyl transferase n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1V8R5_9EURY Length = 567 Score = 219 bits (557), Expect = 4e-55, Method: Composition-based stats. Identities = 69/481 (14%), Positives = 136/481 (28%), Gaps = 72/481 (14%) Query: 8 FATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYR--------RYPRMS 59 + WL + + +F + + V+ + + V R S Sbjct: 133 LSIWLAIVLSALVLTGGFVFGWSVPTPYEMAVFAGFVVIAFIVVVVFPLTIVQMRGREES 192 Query: 60 YRELYKPDEKPLAIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVD 118 L D ++++VPA+NE+ +G+ + + + V + T + Sbjct: 193 DHTLDDDDAPLVSVLVPAYNESNYVGDCLDSILASDYPTDRLEVIVIDDGSTDGTYAEAS 252 Query: 119 EVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME 178 +V K LN L + + DA+ +++P Sbjct: 253 AY-----RNDRVSVFHRSNGGKHAALNLGL---------SCSRGDVVVAVDADSILAPSA 298 Query: 179 LRLFN---YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPS 235 LR R + V + E+ L + Sbjct: 299 LRTAVEQLQSDPRLGAVAGTVVVNNAD--GIVGSVQALEYVLGINTLRRAFSYLGTVMVI 356 Query: 236 AGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKER 295 G F R A++ + +D ++TED+D+ RL + G Sbjct: 357 PGCLGVFRREALSEV------GGYDPDTVTEDFDLTVRLLKAGWRVE------------- 397 Query: 296 EQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW 355 + V P + + + Q+ RW G + K S Sbjct: 398 -----------LSEALVYTEAPFSLTDLLNQRLRWTRGNIQTLLKHRDVFSEPA------ 440 Query: 356 RDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV 415 F+ A + +L + + S+ F++I +G A+L L + L V Sbjct: 441 -------YGFLHRFAFPLSALSILFVPFASIVVTTMIFVAILNG-AFLGVALVAAYFLFV 492 Query: 416 NRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 V + + L L + R +L+ + R + + Sbjct: 493 LLFVAAMALDLSDGDWRLLAYAPLHLVGYRQFLDVIVIRTALMLLRGTNKRWESVTRERQ 552 Query: 476 D 476 Sbjct: 553 Q 553 >UniRef50_C7MUK1 Glycosyl transferase n=11 Tax=Actinomycetales RepID=C7MUK1_SACVD Length = 1099 Score = 218 bits (556), Expect = 5e-55, Method: Composition-based stats. Identities = 77/488 (15%), Positives = 134/488 (27%), Gaps = 69/488 (14%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELY----KPD 67 + G +A+ ++ +S L + V L V R P Sbjct: 280 VSGSAFVAVVNGALVVVSVLKWLLVAVGVLTVLRLLLLVVVAGRHAARRRGARSCWGPPV 339 Query: 68 EKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNV 127 +P++++VPA+NE I A + I V + T V+ + V Sbjct: 340 TEPVSVIVPAYNEAANIEATVRSAVASTH--PVEIIVVDDGSTDGTADLVEGLG--LSGV 395 Query: 128 HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--NYL 185 + KA LN + A + + ++ D + V P + + Sbjct: 396 RVLRRP---NRGKAAALNTGIAAAS---------YDLIVMVDGDTVFEPNTVHELVQPFA 443 Query: 186 VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 + V RE + E+ D V E + G G F R Sbjct: 444 DPEVGAVSGNVKIANRE--TLLARLQHIEYVVGFNVDRRVHEVMRSMPTVPGAGGAFRRS 501 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 A+ + Q+L ED D+ + G +F V Sbjct: 502 ALLQV------GGLSAQTLAEDTDLTISIGRAGWRTVFQEKAV----------------- 538 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNF 365 P T RQ+ RW G + +K K + R + F Sbjct: 539 ------TWTEAPTTVRQLWRQRFRWTFGTLQALWKHRKAIVQRGAAGRVGRFGMLHVVCF 592 Query: 366 VSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV 425 L M+ + D + + W LLWL L + F Sbjct: 593 QVLLPMIAPVI------------DVFLVYGVLFLDPWTTVLLWLTM-LGIQAAAAAYAFH 639 Query: 426 TGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTR 485 T L + + ++ ++L RV W + + Sbjct: 640 LDGERKTVLWLLPAQQLIYRQLMYVVLMQSL---AAAASGVRVRWQHMRRSGLTRFPAVQ 696 Query: 486 SLRPLGQI 493 + + + Sbjct: 697 AAQSVPAP 704 >UniRef50_C7TL81 Glycosyl transferase, group 2 n=3 Tax=Bacilli RepID=C7TL81_LACRL Length = 417 Score = 218 bits (556), Expect = 5e-55, Method: Composition-based stats. Identities = 70/467 (14%), Positives = 137/467 (29%), Gaps = 64/467 (13%) Query: 16 KVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMV 75 ++ + I I ++ + Y+ + P +++MV Sbjct: 6 WIMLFAIGAIWLILMVNVILVVAGYFEYMKMTQQPEPSLPPTPP----------MVSVMV 55 Query: 76 PAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCAR 134 PA NE VI E + Y I V + ++ + ++ R+P+ V Sbjct: 56 PAHNEGVVIVKTVESLLRFDYPQDRYEIIVINDNSSDNSAALLRDLQHRYPDRQLHVINT 115 Query: 135 P---GPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN---YLVER 188 G K++ LN L + ++DA++ LR+ + Sbjct: 116 DAVTGGKGKSNALNIGLTKA---------QGSVLAIYDADNTPEFGALRILVSELMADDG 166 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 + ++ T T I E R+ L G G R + Sbjct: 167 LGAVIGKFRTRNKQATWLTRFINI-ETLSFQWMAQAGRQHLFGLCTIPGTNYVIRRSLID 225 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 + +DV++L ED +I F + G F V Sbjct: 226 KI------GGWDVKALAEDTEISFHVYMNGARIKFQPKAV-------------------- 259 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSF 368 E P T Q++RW+ G ++ K + F+ Sbjct: 260 ---TWEQEPQTLDVWFHQRTRWVKGNIYVILKNSALLFQKRGRPIRFDLIYFLSIYFLLM 316 Query: 369 LAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGY 428 ++++ + +L L FS WL+ +L + ++ G Sbjct: 317 TSLVLSDAVFILSTAGWAHVG----LKGFSTGLWLLAILLFIVSTFITISTEKGEMTVGN 372 Query: 429 YGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 G+ + W +A + + + + ++ W KT Sbjct: 373 VGIIGLMYITYSQLWL----AVALYGMVAYIREQLFHQQAHWYKTKR 415 >UniRef50_Q1GJ85 Glycosyl transferase family 2 n=4 Tax=Rhodobacteraceae RepID=Q1GJ85_SILST Length = 1002 Score = 218 bits (556), Expect = 5e-55, Method: Composition-based stats. Identities = 58/457 (12%), Positives = 123/457 (26%), Gaps = 54/457 (11%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 V + F + + + +R + +A+++PA NE Sbjct: 586 FSLVAWGQDAIVILFWLALGIGVVRSVAILLLAVLNWRGHRTISL-TTPKVAVIIPAHNE 644 Query: 81 TGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSK 140 VI + + + DY+N I V + +T ++ ++ +V K Sbjct: 645 EKVIRSCIQ-SVRASDYKNLEIIVVDDGSSDNTLNEIFAF----SHMREVRLISQPNQGK 699 Query: 141 ADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV--ERKDLIQIPVYP 198 LN L N + + DA+ I + R + + Sbjct: 700 WSALNRAL---------MNTSAEIVVCIDADTQIEKSAIGHMVRHFDNPRIGAVAGKIIA 750 Query: 199 FEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIA 258 + + E++ + + + G + G + A+ Sbjct: 751 GN--KVNLLTRLQALEYTTAQNVERKAFDLINGMLVVPGALGAWRVAALRK------AGH 802 Query: 259 FDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPD 318 F +++TED D+ + G + P+ Sbjct: 803 FSDETMTEDTDLTIEVNRAGYRIAYEPLARGYT-----------------------EVPE 839 Query: 319 TFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLL 378 ++Q+ RW G+ +K K + LA + L Sbjct: 840 RIGQLLKQRLRWSFGMFQSAWKHKKAMFEGRSVGLISIPDMFIFGYLFPLLAPI--ADLF 897 Query: 379 LLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSV 438 + + + W L +L + I + + LL Sbjct: 898 VAILLYQMVSGGWD-SGAVGAQNMQYLLAYLTLPALEFVIAAFALARDKDESMWSLLLFP 956 Query: 439 LRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 ++ I + + RA+ L+ R +W Sbjct: 957 VQRVLYRPILYYSVIRAI---LRAITGRLFSWGAQKR 990 >UniRef50_D2MLP0 Glycosyltransferase, group 2 family protein n=1 Tax=Bulleidia extructa W1219 RepID=D2MLP0_9FIRM Length = 428 Score = 218 bits (556), Expect = 6e-55, Method: Composition-based stats. Identities = 68/456 (14%), Positives = 137/456 (30%), Gaps = 62/456 (13%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 ++ + V+M ++ L FF + + + +R R A+ Sbjct: 7 FELFTDVIFVVMTVAYLYQFFYIIYSIFKYKVPVMPEAKRLHR-------------YAVF 53 Query: 75 VPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 + A NE VIG + + + Y I+V DT AR + Sbjct: 54 ISARNERNVIGELLDSLTNQDYPRDKYDIYVTADNCTDDT-----AQVARDHGAYAFERF 108 Query: 134 RPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE--RKDL 191 K LN + + + + ++ DA++++ L+ N + + D Sbjct: 109 NDEKKGKGYALNEMYHQVIALKGQGY--YEAVVVFDADNIVDAQFLKEMNKTFDTGKYDA 166 Query: 192 IQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALL 251 + T+ Y F R L Q +G G S + + Sbjct: 167 LTTYRNSKNFGQNWLTA-AYSLWFMHEARHLNYARMMLGAQCMISGTGFVVSTKLMD--- 222 Query: 252 ADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMIC 311 + LTED +G + ++ + + Sbjct: 223 ---INEGWPYYLLTEDIQFSVASTLQGFHIGYCDTAILYDEQ------------------ 261 Query: 312 VREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAM 371 P T+ A RQ+ RW G + +S + R I V ++ Sbjct: 262 -----PATWKQAWRQRLRWAKGFYQIDGRYLGPLASGVVKGKNRRLAFYDILMTVLPSSL 316 Query: 372 LVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGL 431 L + ++L L ++++I + L L+ L G + + + + Sbjct: 317 LTVALIILALWVLVSSSVMPYYVAIVFQNEMLWYLIKLIGGSWIGLTLMAFVTTVQEWKR 376 Query: 432 TQG---------LLSVLRLFWGNLINFMANWRALKQ 458 LL + L I+ +A ++ ++ Sbjct: 377 IPATKVEKLGACLLFPVYLLSYIPISIVAIFKKVEW 412 >UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NHH7_GLOVI Length = 426 Score = 217 bits (554), Expect = 9e-55, Method: Composition-based stats. Identities = 78/482 (16%), Positives = 145/482 (30%), Gaps = 68/482 (14%) Query: 8 FATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPD 67 T L ++ LA+ + +G ++ R L P+ Sbjct: 10 VQTALVWSALVLFVLALHLIPTGASVVWLAAGLTGLYAVRVLFALSPRPKSDPAYR---- 65 Query: 68 EKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNV 127 ++++V A NE V + LDY ++ +++ + T + + E + + Sbjct: 66 -PRVSVLVAAKNEQAVAAQLVA-MLRRLDYPDFEVWIADDGSTDRTYQRLLEAGRGWQAL 123 Query: 128 HKVVCARPGPT-SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV 186 H V K+ LN E ++ DA+ + P L L Sbjct: 124 HLVRRIPERSRPGKSAVLN---------ELRERATGDILVVFDADARVEPDFLSRTVPLF 174 Query: 187 E--RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSR 244 +Q+ ++ +T + + R A+ G G G Sbjct: 175 AVSSVGALQVRKRVHNADFNFWTRGQSAEMLLDAF--YQQQRAAIGGTAELRGNGQLVRA 232 Query: 245 RAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHA 304 A+ A+ ++ ++T+D D+ RL G F P Sbjct: 233 AALEAV------GGWNEATVTDDLDLTLRLHLGGWQIAFASDP----------------- 269 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISN 364 CV E T+S RQ+SRW G + + D+ Sbjct: 270 ------CVDEEGVTTWSALWRQRSRWAEGGFQRYLDYAPRLFGGAMGTTKTVDQL----- 318 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIF 424 + IQ L+ +A A+ + + ++ +R + Sbjct: 319 ------IFCTIQYLMPVAAVLDLLFAFQRGAAPLLTPLVLVATVFTVCGFYFGQRERGV- 371 Query: 425 VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDT 484 G L ++ L W +I AL +P+++ W KT H T D Sbjct: 372 QVGRAMLETLAGTIYFLHWFPVILVKLARTAL-------EPKKLVWVKTAHQGEYGTADF 424 Query: 485 RS 486 + Sbjct: 425 KP 426 >UniRef50_A3DDQ8 Type II secretion system protein E n=3 Tax=Clostridium thermocellum RepID=A3DDQ8_CLOTH Length = 561 Score = 217 bits (554), Expect = 1e-54, Method: Composition-based stats. Identities = 45/265 (16%), Positives = 103/265 (38%), Gaps = 13/265 (4%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 + + LG IL+E +I++EQLD AL+ + G +LG ++ +G+++ E + + L E+ Sbjct: 1 MQKQKRKGLGDILVEAGLISKEQLDKALKLQKKTGQKLGVLLVSEGIVTQEDIMRVLEEK 60 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 GV +++ I ++ + +P +A Y ++P+ ++ L V D ++ ++ + Sbjct: 61 IGVLRVALEECNIDPAVCSLIPEKLARRYELIPIAQKDGVLRVAMSDPLNVFAIDDIEDY 120 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 G +V V+ I + +Y + + + + + Sbjct: 121 TGMRVEPVVDFASSIKNAIDKYYRTQHVLVEPVKEKGILFKIDEETIELESVEAENESAS 180 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L I+ I++ E L T+G + + + L Sbjct: 181 MLLNSIIEQAIRNGSGDIHI-----EPLQNALKIRFRTDGQMHEVMRTEI----GMLNGV 231 Query: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 + + G+N + + +G Sbjct: 232 LAKIKAICGMN---MNEKAVPQDGR 253 >UniRef50_Q15ZI3 Type II secretion system protein E n=119 Tax=Proteobacteria RepID=Q15ZI3_PSEA6 Length = 637 Score = 217 bits (553), Expect = 1e-54, Method: Composition-based stats. Identities = 56/266 (21%), Positives = 101/266 (37%), Gaps = 14/266 (5%) Query: 482 GDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQ 540 ++ LG +L++ +I+EEQL L + G +LG ++ G ++ QL L++ Sbjct: 1 MKPKAKIRLGDLLVQEGIISEEQLMQTLSAQKQSGRKLGYMLIELGFMTENQLLTFLSQH 60 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 GV + +++ + +P A Y L L + D L+VG D D +L L+ Sbjct: 61 LGVPLIDVTQYRVSVEAVLLLPEVQARRYRALVLDDKGDHLLVGMSDPADLAALDILSGV 120 Query: 601 VGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRA-MLYNAVQHQWLTEQQAGEIWRQYVPH 659 + + V+ +V Q+ +Y R A L Q + G Q Sbjct: 121 LPKPVKVAVVSDAQLFQAYDRFYRRTEDIASFAQELAEEYQDDEEFDFDTGVDNEQDTAV 180 Query: 660 QFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQV 719 L I S I++ E S L L +GV+ + ++ + Sbjct: 181 ARLLQSIFEDALQTKASDIHI-----EPDSEMLRIRLRVDGVLQEN----IIKEKNIASA 231 Query: 720 SMQSLLLKAGLNTEQVAQLESENEGE 745 + L L +GL+ + +G Sbjct: 232 LVLRLKLMSGLDISE---KRLPQDGR 254 >UniRef50_B2IIL4 Glycosyl transferase family 2 n=2 Tax=Beijerinckiaceae RepID=B2IIL4_BEII9 Length = 650 Score = 217 bits (553), Expect = 1e-54, Method: Composition-based stats. Identities = 71/465 (15%), Positives = 138/465 (29%), Gaps = 56/465 (12%) Query: 31 LDDFFIDVVYWVRRIKRKLSVYRRYPRMS--YRELYKPDEKPLAIMVPAWNETGVIGNMA 88 + ++++ + R + S + +I+V E ++ + Sbjct: 197 FGALVLAILFFGMLVLRLAAGAASLGPRSLWRPHVRDATLPLYSIVVALHREARIVPQLV 256 Query: 89 ELAATTLDYEN--YHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNN 146 + A +DY + + +D +T + + P +V P +K LN Sbjct: 257 D-ALERIDYPRAKLEVKLVIEADDRETLEALRK-ARLSPLYEIIVAPSGWPRTKPRALNV 314 Query: 147 VLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV----ERKDLIQIPVYPFERE 202 L + + DAED P++LR + +Q + E Sbjct: 315 ALPLL---------RGTFVTVFDAEDEPDPLQLRHAAEYFLASPKTLACLQARLVIDNVE 365 Query: 203 WTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQ 262 + T + + E++ L + L +P G F + A+ +D Sbjct: 366 DSWLTRL-FSIEYAVLFDVLLEGMSELRLPLPLGGSSNHFRADVLRAV------HGWDAW 418 Query: 263 SLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFST 322 ++TED D+G RL G + E P Sbjct: 419 NVTEDADLGMRLARNGYRTATLAS------------------------QTLEEAPARLDA 454 Query: 323 AVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLA 382 Q+ RW+ G + G L L + + L + +++ LL Sbjct: 455 WFSQRRRWLKGWMQTG------GVLLRDPRRLLAETGMKQGGALLLLLVGLVLAPLLWPI 508 Query: 383 YESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLF 442 + W + + WL F + V ++++ + LL++ Sbjct: 509 LTGVTLYQWMSGGLPEPTNWLGIFAATLFLAVSLLGVGSTLWLSWLGMRRRNLLNLWLFL 568 Query: 443 WGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSL 487 L ++ A L R W KT H R+ Sbjct: 569 PLILPYYLLISCAAWAALYDLLVRPFHWRKTEHGLARTRASRRAP 613 >UniRef50_A4J4W8 Glycosyl transferase, family 2 n=3 Tax=Clostridia RepID=A4J4W8_DESRM Length = 425 Score = 217 bits (553), Expect = 1e-54, Method: Composition-based stats. Identities = 65/459 (14%), Positives = 134/459 (29%), Gaps = 57/459 (12%) Query: 21 TLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNE 80 L I + + L + +++ + + R+ ++ E+ P+ I++P NE Sbjct: 10 LLHGIGYFTFLYPVTMSIIWVIGGLYFWWYRERKARTNNWPEV----WPPVTILIPCHNE 65 Query: 81 TGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARF-PNVHKVVCARPGPTS 139 I A + ++Y + + + +T + + E + P+ H + Sbjct: 66 EISIATT-CHALSKVNYPDLRVVFIDDASTDNTAQIIREWLRQEVPSFHLLRLTT--NQG 122 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV---ERKDLIQIPV 196 KA LN L ++ DA+ +I+P L+ R + Sbjct: 123 KAKALNCGLQVAVHTP--------ITVVIDADTLITPDTLKWLIAPFIRQPRLGAVSGNP 174 Query: 197 YPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDG 256 RE + EF+ + G + +L + +G T F + L Sbjct: 175 LVGNRE--NLLENLQTAEFASILGLIKRSQRSLGRMLTVSGCITAFCTDTLRQL------ 226 Query: 257 IAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYF 316 F +S TED DI + ++ F + Sbjct: 227 GGFSSRSATEDIDITWAIQRNFYEVWFEPRAIAYIQ-----------------------V 263 Query: 317 PDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFV-SFLAMLVMI 375 P T Q+ RW +G + WR +FV S+L ++ Sbjct: 264 PKTIKEFWHQRCRWALGGWHLLRSHWDIFTHWR-----WRRLWPVYLDFVISYLWSFCLV 318 Query: 376 QLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGL 435 LL L P + + + ++ ++ Y + Sbjct: 319 IGTLLWLVTYLIPSKPAIGLTPIPAWYGSVISFVCLVQFGAALLANHRHDHKMYQ-SFFW 377 Query: 436 LSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 + +F+ + W + + + Sbjct: 378 IPWYPIFFFCIGALTVVWTSCRGLFGDLVTVGKWKSPAR 416 >UniRef50_D2QUU5 Glycosyl transferase family 2 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QUU5_9SPHI Length = 508 Score = 217 bits (553), Expect = 1e-54, Method: Composition-based stats. Identities = 54/456 (11%), Positives = 115/456 (25%), Gaps = 51/456 (11%) Query: 46 KRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFV 104 + RR S + L + +P +NE V+ + + + I V Sbjct: 30 YLRSEKKRRALAQSAADYSPEALPRLTVQLPVYNELYVVERLIDAVVLLKYPKDKLDIQV 89 Query: 105 GTYPNDPDTQRDVDEVCARFP--NVHKVVCARPGPTS-KADCLNNVLDAITQFERSANFA 161 + +T + A + RP KA L L Sbjct: 90 LDD-STDETVSIIARKVAEYKKQGFDIEHIRRPERKGFKAGALAYGLTLAK--------- 139 Query: 162 FAGFILHDAEDVISPMEL--RLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELH 219 + DA+ V P L + ++ + ++Q +++ + + Sbjct: 140 GEFVAIFDADFVPDPEFLLKTVPHFADPKVAIVQTRWEHLNEDFS-LITQLQAFGLNAHF 198 Query: 220 GKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGM 279 + R A G G + + A+ D + +LTED D+ +R + +G Sbjct: 199 TVEQSGRYAAGLLANFNGTGGVWRKVAI------ADAGGWQSDTLTEDLDLSYRAQLRGW 252 Query: 280 TEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGF 339 ++ + P + Q+ RW+ G Sbjct: 253 KFVY-----------------------REDVGSPAELPVAMNALKSQQYRWMKGAAECAR 289 Query: 340 KTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSG 399 K + F + I +L+L + Sbjct: 290 KLFVNVLKTPG-----VSLSMKLHAFFHLFSSATFILVLILGVMSVPLIYIRSQHPEWEW 344 Query: 400 SAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQV 459 ++ L N +++ V F+ G + ++ ++ Sbjct: 345 VFVVINLFQFNLLILITFYGIPVWFLKGANKARLAWYFPMYSSLMMGLSLHNTIAVIEGY 404 Query: 460 LQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILL 495 + P + + L Sbjct: 405 MGRKTPFVRTPKFNVKTAADSWAANKYISRRINWLT 440 >UniRef50_B9ZC52 Glycosyl transferase family 2 n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZC52_NATMA Length = 549 Score = 217 bits (552), Expect = 2e-54, Method: Composition-based stats. Identities = 67/452 (14%), Positives = 127/452 (28%), Gaps = 61/452 (13%) Query: 11 WLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP 70 WL + + L ++ I F + + +I R V P Sbjct: 105 WLTPSPLALVHLGAVVLIFVYYWFIAFIALFHDQIGRSKYV------------PNPPYPQ 152 Query: 71 LAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK 129 + +++PA+NE G +G + + I + DT + A V Sbjct: 153 ITVLIPAYNEEGYVGRTIQSLLDANYPADALEIIAVDDGSTDDTLAEASAFAAASEQVSV 212 Query: 130 VVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNY---LV 186 V K LN L + DA+ ++ L+ Sbjct: 213 V---SKANGGKYSALNYGLLFAA---------GDIIVTVDADSIVDRDALKHIVAPFAAD 260 Query: 187 ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRA 246 + + V + R+ + E++ + + G + R Sbjct: 261 DDIGAVASNVTIWNRDS--LITRCQQLEYTIGVNIYRRALDYFGIVMVVPGCLGAYRREV 318 Query: 247 VTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHART 306 ++ + A +D +LTED+D+ ++ G V Sbjct: 319 LSEVFA------YDPDTLTEDFDVTMKVLRAGYRVSVSDARV------------------ 354 Query: 307 SNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF-LWRDRKGAISNF 365 P T+ RQ+ RW G K + + Y + F Sbjct: 355 ------YTEAPATWGDLYRQRLRWYRGNYMTIIKHWSVVTDSSYGYLNRIALPFRLVEMF 408 Query: 366 VSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFV 425 A V++ +L L + F+ +L+ L + R++ + Sbjct: 409 FLPFASFVVLAYILWLIAAGHVLTVFAVFVFFTSIVFLIAALGIQIEGEDWRLLVYAPLL 468 Query: 426 TGYYGLTQGLLSVLRLFWGNLINFMANWRALK 457 Y L+V LF + RA + Sbjct: 469 VVGYKQFHDALNVKCLFDVLTSPELGWTRAAR 500 >UniRef50_B2UMM8 Glycosyl transferase family 2 n=3 Tax=Verrucomicrobia RepID=B2UMM8_AKKM8 Length = 505 Score = 216 bits (550), Expect = 3e-54, Method: Composition-based stats. Identities = 64/411 (15%), Positives = 125/411 (30%), Gaps = 48/411 (11%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPA 77 +++ I + L Y R+ ++ + + + + +P Sbjct: 1 MSLNYNFIWLLCYLLVLVGLAGYGFHRLSIVYLYWKNRNNKPQPKARFQELPVVTVQLPM 60 Query: 78 WNETGVIGNMAEL-AATTLDYENYHIFVGTYPNDPDT---QRDVDEVCARFPNVHKVVCA 133 +NE V+ + E AA + I + D T R V+E+ +R V Sbjct: 61 FNEKFVVDRLLESVAALDYPQDKLEIQILDDSTDDTTEQCYRKVEELKSR--GFDAVCIH 118 Query: 134 RPGPTS-KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN--YLVERKD 190 R T KA L ++ DA+ V P L+ + E Sbjct: 119 RTDRTGFKAGALEAATKVAK---------GEFLLILDADFVPEPDLLQKTIHFFTDENVG 169 Query: 191 LIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTAL 250 L+Q RE+ T + R G + + + Sbjct: 170 LVQTRWGHINREYNLLTR-IQGMYLDGHFAMEQTARNRSGRFFTFNGTAGIWRKCVI--- 225 Query: 251 LADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMI 310 GD + +LTED D+ +R++ +G I++ V Sbjct: 226 ---GDAGGWSHDTLTEDMDLSYRVQLRGWRFIYLNDVV---------------------- 260 Query: 311 CVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLA 370 P Q+ RW G + K + ++ S+L Sbjct: 261 -TPAELPVDMDGFKSQQHRWTKGSIQVCQKILLDIWRSNAPLKAKVEATTHLTCNYSYLL 319 Query: 371 MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 + ++ L+ + + + + F+ + + + MT + + M +IV R Sbjct: 320 LALLCFLVYPICTQRIPENETVFMWFVNVALFFMTSVAVCIFYMSAQIVVR 370 >UniRef50_D2QZA7 Type II secretion system protein E n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QZA7_9PLAN Length = 596 Score = 216 bits (550), Expect = 3e-54, Method: Composition-based stats. Identities = 40/271 (14%), Positives = 83/271 (30%), Gaps = 22/271 (8%) Query: 484 TRSLRPLGQILLENQVITEEQLDTALRNRVEGLR---LGGSMLMQGLISAEQLAQALAEQ 540 + LG IL+E +T L AL ++ R LG ++ L + +Q+ + LA+ Sbjct: 5 PAPPQRLGNILIERGYLTVAHLQQALDHQQRAGRGKLLGEILVELSLCTEDQVMECLAQV 64 Query: 541 NGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRK 600 V + ++ ++ +P V PL L V + + L + Sbjct: 65 YCVPYAKLEQRLSDPRIVELLPREYIEKNLVFPLFRIQQTLTVAVTEPSNLFLLEEIRGL 124 Query: 601 VGRKVRYVIVLRGQIVTGL-----RHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQ 655 G V+ V I + + + + + + + Sbjct: 125 TGLTVQIVASSAKDIRRMITTLPDSKTFVIEDIIEDNSQTEVTLIESAVEDISDSTECAG 184 Query: 656 YVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ- 714 P L ++ S I++ E + + +G L + L + Sbjct: 185 QSPVIRLVNYVIYHAVKEGASDIHI-----EPAERCVRVRYRIDGK-----LYKSLEVPL 234 Query: 715 RELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 L + + A L+ + +G Sbjct: 235 NLLGAVTSRIKIMASLDISE---RRLPQDGR 262 >UniRef50_Q39PZ3 Response regulator receiver domain protein (CheY-like) n=6 Tax=Deltaproteobacteria RepID=Q39PZ3_GEOMG Length = 823 Score = 216 bits (549), Expect = 3e-54, Method: Composition-based stats. Identities = 41/318 (12%), Positives = 108/318 (33%), Gaps = 30/318 (9%) Query: 445 NLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQ 504 + + + ++ + +A + T LG IL E VI ++ Sbjct: 147 LRLTVSLALQQYALIQENKKLKEIAKAQQTKIRNFAGLFDEDKSMLGSILTEAGVIRKDD 206 Query: 505 LDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPAS 564 L+ + G L +++ + + ++ + L GV + + I ++ +P Sbjct: 207 FAAVLQGKKPGELLVDALVRTSVSTEAKILKTLQNHLGVEFIDLREANITPGVVRCLPRD 266 Query: 565 VALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWY- 623 + + ++P+RL+ + L + D D + ++R +G KV ++ +I L Y Sbjct: 267 MCDRHRLIPVRLDGNRLTIAMADPSDIFKIDNISRVLGLKVMPLLSTSSEIQAQLARIYG 326 Query: 624 --------ARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQY-------VPHQFLFAEILT 668 D L + + E + ++ P + +++ Sbjct: 327 EGAAAIVSGGGEELDEFGELEPLDEIDIVIEDEEADVSVDELIGSSKVPPIIRVVNAVIS 386 Query: 669 TLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQ-SLLLK 727 S I++ E + +G++ + + I ++ ++ + + Sbjct: 387 EAVRYRASDIHI-----EPKTKCTVIRYRIDGLLHGK-----IRIPSDIHAAVVSRVKIL 436 Query: 728 AGLNTEQVAQLESENEGE 745 A ++ + +G Sbjct: 437 AKMDISE---RRKPQDGR 451 >UniRef50_Q1Q644 Strongly similar to general secretion pathway protein E n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q644_9BACT Length = 557 Score = 215 bits (547), Expect = 6e-54, Method: Composition-based stats. Identities = 53/260 (20%), Positives = 91/260 (35%), Gaps = 19/260 (7%) Query: 488 RPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 +G++LLE I + L+ A + G +LG ++ G +S E L A + + Sbjct: 7 NNVGEVLLEIGKINRQDLERAFEAQKQTGQKLGRILIDLGTVSEEDLRLAYSRWLEIPVW 66 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ++ +P VLPL L+ + L + D D + + A+ GR+VR Sbjct: 67 EKKKTDTYP-MLENVPKVFLTTNRVLPLSLDENVLDIALADPQDTLLIEAIALSTGREVR 125 Query: 607 YVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEI 666 I + L Y + AM A + + + + P L I Sbjct: 126 VFAGTERDISSSLEKLYETGVSEEEDAM---ASSVEMMEDIEQLRDMASEAPVIRLVNSI 182 Query: 667 LTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL-QVSMQSLL 725 LT + S I++ E +GV L + REL + + Sbjct: 183 LTKAIEVGASDIHI-----EVFERNTRLRYRVDGV-----LGELAPPPRELYNSIVSRIK 232 Query: 726 LKAGLNTEQVAQLESENEGE 745 + A LN + +G Sbjct: 233 IMAKLNIAE---KRLPQDGR 249 >UniRef50_Q886Q3 Glycosyl transferase, group 2 family protein n=2 Tax=Pseudomonas syringae pv. tomato RepID=Q886Q3_PSESM Length = 842 Score = 214 bits (546), Expect = 8e-54, Method: Composition-based stats. Identities = 65/448 (14%), Positives = 141/448 (31%), Gaps = 48/448 (10%) Query: 4 LLDVFATWLYGLKV-----IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRM 58 + T L G+ + +++ +A+ + + + I + Sbjct: 334 IFAALYTLLVGVGISYAQPLSMWVALPIALVWVTSLLIGTGIQGYEFLESCWGPEKPRSF 393 Query: 59 SYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPN-DPDTQRDV 117 Y ++I VP +NE + + A LDY N+ + + DP+ + Sbjct: 394 PPLRAYPGPLPKVSIHVPCYNEPPDMVKLTLDALQRLDYPNFEVLIIDNNTQDPEVWEPI 453 Query: 118 DEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM 177 ++ C + ++ P K+ LN +LD + DA+ + Sbjct: 454 EQYCRQLGPRFRLFHVNPLSGFKSGALNYLLDYTAKD-------AEIVAAIDADYCVHRH 506 Query: 178 ELRLFN--YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPS 235 L+ + +IQ+P + + + F E+ + +R + Sbjct: 507 WLKHMAPYFACPDIAVIQVPQDYRDGDDSLFKRCCQA-EYRVFFNIGMVIRNDHDAIIQ- 564 Query: 236 AGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKER 295 G T + L + S+ ED ++G R+ E G + +V Sbjct: 565 HGTMTLIRNSVLQRLR-------WAEWSICEDAELGLRILENGFSTGYVAISYG------ 611 Query: 296 EQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW 355 + PDTF +Q+ RW G++ + + T Sbjct: 612 -----------------KGLIPDTFMDFKKQRYRWAYGVIQILKRHTGSLIAGTCEALTP 654 Query: 356 RDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDA-WHFLSIFSGSAWLMTLLWLNFGLM 414 R I+ ++ ++A + L + + S+ L LL G+ Sbjct: 655 IQRYHFIAGWMPWIAGGINYFLAIAVLLWSMAMIIQPDTLEPVPWIFSSSLLLMFVLGVC 714 Query: 415 VNRIVQRVIFVTGYYGLTQGLLSVLRLF 442 + + + T +++ + L+ Sbjct: 715 KAISLYQRLASTDIKDAFAAIIASMALY 742 >UniRef50_Q7UE44 General secretion pathway protein E n=1 Tax=Rhodopirellula baltica RepID=Q7UE44_RHOBA Length = 587 Score = 214 bits (546), Expect = 9e-54, Method: Composition-based stats. Identities = 55/266 (20%), Positives = 103/266 (38%), Gaps = 20/266 (7%) Query: 487 LRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWE 546 ++ +G IL+E ++T EQ+D A + G+ +G ++ Q LIS QL AL+EQ V + Sbjct: 1 MKRIGDILVELNILTNEQMDAAFAGKPRGVMIGDWLVRQSLISNAQLGAALSEQFSVPFV 60 Query: 547 SIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVR 606 ID + + +P A A + + + + L + D ++A G K+R Sbjct: 61 DIDFSSVNPQVARLLPEDFARSQASVAIDVSDRMLTLAMVAPDDIETIAEAELMTGYKIR 120 Query: 607 YVIVLRGQIVTGLRHWYARRR------GHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQ 660 V+ L + L Y R A + + + E + ++ P Sbjct: 121 PVVALEDDVRDLLNRIYDDRAFARQTIVDMKFAEMAESGEVTEEDELAMSAVSQEDAPVV 180 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L IL+ S I++ E + +G L V+TI ++ S Sbjct: 181 KLVQAILSGAVSAGASDIHL-----EPHKPEMRVRYRVDG-----ELQVVMTIPNHIEDS 230 Query: 721 MQ-SLLLKAGLNTEQVAQLESENEGE 745 + + + ++T + +G Sbjct: 231 VISRIKVMGDMDTTE---NRRPQDGH 253 >UniRef50_D2QZD9 Type II secretion system protein E n=4 Tax=Planctomycetaceae RepID=D2QZD9_9PLAN Length = 573 Score = 214 bits (545), Expect = 1e-53, Method: Composition-based stats. Identities = 46/264 (17%), Positives = 93/264 (35%), Gaps = 19/264 (7%) Query: 491 GQILLENQVITEEQLDTALRN---RVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWES 547 +ILL +I + QLD A +G R + + G +S E +AL ++ G+ + Sbjct: 4 CEILLRRGLIDKRQLDQARGQANGHGDGARQIEAAIQLGFVSEEAALRALGDEVGIEYVD 63 Query: 548 IDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRY 607 + +I SL+ P + ++ P+ + +L+V + + D L ++ G V Sbjct: 64 LTEAEIDLSLLKIFPHRLIHRQSLFPISKTDGQLVVATSNPFDLYPLDEVSAATGLAVMP 123 Query: 608 VIVLRGQIVTGLRHWYA------RRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQF 661 V+ R +I ++ + + T+ Q Sbjct: 124 VLAARAEIAKLIKRHLGVGSETVEGLVAQAQEEAALELVGDIETDGSELSEMAQEASVVR 183 Query: 662 LFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM 721 L EIL + S +++ E L +G++ + + I R + Sbjct: 184 LVNEILLEAIELRASDVHI-----ESQPSGLAIRYRVDGMLQSQPIPP--EIHRFEAAIV 236 Query: 722 QSLLLKAGLNTEQVAQLESENEGE 745 L + + LN + +G Sbjct: 237 SRLKIMSRLNIAE---KRLPQDGR 257 >UniRef50_C4KBM8 Glycosyl transferase family 2 n=6 Tax=Betaproteobacteria RepID=C4KBM8_THASP Length = 868 Score = 214 bits (545), Expect = 1e-53, Method: Composition-based stats. Identities = 65/425 (15%), Positives = 124/425 (29%), Gaps = 53/425 (12%) Query: 11 WLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP 70 +L +I + L + + + +K R P + + Sbjct: 371 YLTQRDLIGLVLLIGATCMTAAVLLSHGFEFGEVLFKKKWARRFTPLPPHPP---EQQPF 427 Query: 71 LAIMVPAWNE--TGVIGNMAELAATTLDYENYHIFVGTYPNDPDT-QRDVDEVCARFPNV 127 ++I + +NE VI + + ++Y+N+ + + + + ++ CA Sbjct: 428 VSIHLACYNEPPEMVIA-TID-SLAQMNYQNFEVLILDNNTRDEALWKPLERRCAELGPR 485 Query: 128 HKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV- 186 + P KA LN L + + DA+ V+ P L Sbjct: 486 FRFFHLANWPGFKAGALNYGLK-------VTDPRAEVVGVVDADYVVDPDWLACLVPHFD 538 Query: 187 -ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRR 245 ++Q P + E F M EF + R + G T RR Sbjct: 539 QPEVAVVQAPQAHRDWEGQPFKRMCNW-EFDGFFRIGMHHRNERNALIQ-HGTMTMVRRR 596 Query: 246 AVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHAR 305 A+ + + + ED ++G RL EKG ++ Sbjct: 597 ALEEV------GGWSEWCICEDTELGLRLIEKGYDTRYIDH------------------- 631 Query: 306 TSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNF 365 I R P F+ Q+ RW G + + + R ++ + Sbjct: 632 ----ILGRGLTPSGFAAIKSQRFRWAFGAMQILKAHL--PHMIGRSTLNLAQRYHFLTGW 685 Query: 366 VSFLA---MLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRV 422 ++L LV LL L L + S + + + L + + Sbjct: 686 FAWLGDALQLVFAFGSLLWTLGILLFPKAFGLPVVSLALPIFGFMIFKAALGPILYRRTM 745 Query: 423 IFVTG 427 Sbjct: 746 DCPWK 750 >UniRef50_B1L4E1 Glycosyl transferase family 2 n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L4E1_KORCO Length = 489 Score = 214 bits (544), Expect = 1e-53, Method: Composition-based stats. Identities = 69/473 (14%), Positives = 134/473 (28%), Gaps = 48/473 (10%) Query: 4 LLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYREL 63 LL + L + + + G+ F ++ W + L+ Y + Sbjct: 62 LLIIVPISLALVASLESLSTFETILYGIISFGFTLISWHYLLLVPLAAYYKRMEEIEARK 121 Query: 64 YKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCAR 123 +++++PA NE VIG+ + DYE + V + T + Sbjct: 122 PLLYRPLVSVIIPARNEEKVIGSTIR-SVLESDYEPKEVVVVDDGSTDRTFEIAS--IYQ 178 Query: 124 FPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN 183 P V + G KA LN L +L DA+ +IS ++ Sbjct: 179 GPKVKVLRRELGG-RGKARALNFGLRFA---------RGEVIVLMDADTIISRDAIKELV 228 Query: 184 Y--LVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTC 241 R + V + + E+ L +G Sbjct: 229 RKLQDPRVSAVAGNVMVRN--KVNLLTKLQAIEYIATFHLFRKGLSVLGAVPIISGALGA 286 Query: 242 FSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFL 301 F R + + +D +LTED+D+ + + G Sbjct: 287 FRRNVLES------SGLYDADTLTEDFDVTLKALKSG----------------------- 317 Query: 302 QHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGA 361 + + S+ P+ + RQ+ RW G K S + Sbjct: 318 KIVQASSYALAFTEAPEKLKSLYRQRLRWYRGAYEVLIKHRDAFS--LSGLTMLDFSLIL 375 Query: 362 ISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQR 421 ++N + L + I +++ L + +FS L+ L L + + Sbjct: 376 MNNLIIPLIDFLSIISIIIAILRGLIWPLIIQIILFSTLQILINLFILQLAEERDISLVF 435 Query: 422 VIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTT 474 + Y +L + F + + + + G Sbjct: 436 LPLFAIGYKQFHEILMLKCFFDVIIARMRGKSFSWTFIERRGLEEPKLRAGPQ 488 >UniRef50_B9XRT3 Type II secretion system protein E n=2 Tax=Verrucomicrobia RepID=B9XRT3_9BACT Length = 580 Score = 214 bits (544), Expect = 1e-53, Method: Composition-based stats. Identities = 41/284 (14%), Positives = 95/284 (33%), Gaps = 30/284 (10%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTALRNRV-EGLRLGGSMLMQGLISAEQLAQAL 537 + L+E+ ++T +Q++ L + EG RL ++ + ++S + + ++ Sbjct: 1 MPPVIKSFGERIADALVEDGLLTSKQVEELLEQQKKEGTRLLKLVVEKAIVSEQDMTVSM 60 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 ++ I + +P VA +Y V+P+ ++L + D ++ +++ + Sbjct: 61 GRVLNTPPINLSRISIIPEVADLLPREVAHNYKVIPVSRLENKLFLAMADPLNVLAIDDV 120 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGE------ 651 R V +I I+ L + A + G + + + E Sbjct: 121 KRLTKLDVVPMIASEKSIIDKLSNIDASKSGSMQDIIDDAKKAAEEEKDPDNIEVSGIAV 180 Query: 652 ---------IWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVI 702 + P L IL S I++ E + +G Sbjct: 181 EDVNLDQLAASSEEAPVIKLANLILVQAIKDRASDIHI-----EPFEKVIRLRYRVDG-- 233 Query: 703 SQETLDRVLTIQRELQ-VSMQSLLLKAGLNTEQVAQLESENEGE 745 L V +++Q L + + L+ + +G Sbjct: 234 ---ALLDVTPPPKQMQLALASRLKIMSSLDIAE---RRLPQDGR 271 >UniRef50_C1ZJA9 Glycosyl transferase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJA9_PLALI Length = 533 Score = 214 bits (544), Expect = 1e-53, Method: Composition-based stats. Identities = 66/493 (13%), Positives = 139/493 (28%), Gaps = 61/493 (12%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL 71 L L V + + ++ + L F + YW R + + + + R + + + Sbjct: 2 LLPLIVTLLFVTLVNTVFQLTQFDLAYRYW-RSVWK-------KQKPTSRPIDREHLPAV 53 Query: 72 AIMVPAWNETGVIGNMAELAATTLDYENYHIFV-GTYPNDPDTQRDV----DEVCARFPN 126 I +P +NE+ + + E A + +DY + V + + + +E+ P Sbjct: 54 TIQLPMFNESIIAPRILE-AVSRIDYPRDRLQVQILDDSTDHSPEIIAGILEELRQSQPE 112 Query: 127 VHKVVCARPGPTS-KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYL 185 ++ R KA L + +T + DA+ + P L Sbjct: 113 LNIEYLHRTDRQGFKAGALQAAMPLVT---------GEFIAIFDADFIPQPDFLTHLLPY 163 Query: 186 V--ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFS 243 ++Q + T H + R + G + Sbjct: 164 FDSPEVAVVQSRWGHLNAHDSVLTQAQQ-FFLDGHHSVEQNGRNRAGYFITFNGTAGIWQ 222 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQH 303 R A+ A + +L ED D+ +R + G ++V Sbjct: 223 RSAMEA------AGGWSADTLVEDLDLSYRTQSLGYRIVYVED----------------- 259 Query: 304 ARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAIS 363 P++ S Q RW G G K L Sbjct: 260 ------YVTPGELPNSVSGLRVQLFRWFKGNAQVGLKILG--KVWKQPLPLSVKIHATAQ 311 Query: 364 NFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 F F + ++ LL+ A + A + +W+ L+V R+ Sbjct: 312 LFAPFTMLSSLVMLLITGALPLILHAAPEHAGLVKLCYM--GFVWVPAVLLVYG-TPRIR 368 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 F G + + L + ++ ++ ++ + + Sbjct: 369 FDEGPWYIRLAKLVPRTFVFMAMMTGLSCQSSIAVLEAVFKRANQWVVTPKGFSQQSSKK 428 Query: 484 TRSLRPLGQILLE 496 + + + Sbjct: 429 KVRRKLAWYVWPD 441 >UniRef50_B2UK24 Glycosyl transferase family 2 n=10 Tax=Burkholderiales RepID=B2UK24_RALPJ Length = 425 Score = 213 bits (542), Expect = 3e-53, Method: Composition-based stats. Identities = 70/467 (14%), Positives = 137/467 (29%), Gaps = 54/467 (11%) Query: 19 AITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRE--LYKPDEKPLAIMVP 76 + I FF+ + + + + R R+ + L ++I+VP Sbjct: 4 LLAKRWISGFVFYYPFFMSYFWMIGGLLHYFLLERGTRRIQHPLALLGVKTYPKVSIIVP 63 Query: 77 AWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPG 136 +NE + + + Y NY I + T ++E+ A++P + VV + Sbjct: 64 CYNEEANVREVISH-LARMRYPNYDIIAVNDGSSDRTGERLNELAAQYPQL--VVIHQSS 120 Query: 137 PTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV---ERKDLIQ 193 KA I + + D + ++ + + + Sbjct: 121 NQGKA---------IGLTTAAQVTDAEYLMCIDGDSILDVDAIAWMIRHLMENPAVGAVT 171 Query: 194 IPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLAD 253 P R + + EFS + G ++ + +GV F +RA+ + Sbjct: 172 G--NPRIRTRSTLLGRMQVGEFSSIVGLIKRTQQVYGRLLTVSGVVVMFRKRAIEEV--- 226 Query: 254 GDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVR 313 + LTED DI ++L+ G T + + Sbjct: 227 ---GYWSNDMLTEDIDISWKLQVGGWTIRYEPRAL-----------------------SW 260 Query: 314 EYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLV 373 P+TF +Q+ RW G + K SL +W + V ML+ Sbjct: 261 ILMPETFRGLYKQRLRWAKGGIQALIKYAPAMLSLRQ-SMMWPIFFEYALSVVWAYNMLL 319 Query: 374 MIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQ 433 +I +L + + P+ L L + + + L Sbjct: 320 VITWSVLGLFVDMPPEWRMEAFPRWHGTLLFITCVLQLLIGCFIDRRYDDGI-----LRY 374 Query: 434 GLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 + +V ++N + V R W Sbjct: 375 FIDTVWYPVAFWILNLVTTVIGFPAVAFQRQRARARWTSPDRGIQQQ 421 >UniRef50_C0QER1 PilB n=10 Tax=Proteobacteria RepID=C0QER1_DESAH Length = 739 Score = 212 bits (541), Expect = 3e-53, Method: Composition-based stats. Identities = 45/297 (15%), Positives = 94/297 (31%), Gaps = 40/297 (13%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRN-RVEGLRLGGSMLMQGLISAEQLA 534 +G++L + IT Q +AL + G RL +L G I E + Sbjct: 14 TKTIKDQSGAGKVRIGELLSKEGQITSNQFQSALSQHKKTGTRLSSVLLTMGFIDPETII 73 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSL 594 L + + ++ +P +A Y V PL ++ +EL+V + D ++ Sbjct: 74 NVLGRIYNYPVVRLADIKPDPKILKLLPFDIAKRYMVFPLGMKGEELVVTMTEPTDTTAV 133 Query: 595 AALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQA----- 649 L ++VG+ ++ + ++ R +Y + ++ + Sbjct: 134 EELQQEVGKTLKISVSTENDVIQAYRDFYKISEEQYREFIHFDDEKEDDEPVTSVEDFGS 193 Query: 650 --------------------GEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSS 689 E P L IL + S I++ E Sbjct: 194 LVSEAAGELEIEPDDNVSDYDEFRASDAPIIKLVNGILIKAINDGVSDIHI-----EPFE 248 Query: 690 LPLGKFLVTEGVISQETLDRVLTIQRELQ-VSMQSLLLKAGLNTEQVAQLESENEGE 745 L +G +L + + + ++ + L + A L+ + +G Sbjct: 249 RSLQVRYRLDG-----SLYKAMNLPLTIKNAVISRLKILAELDIAE---RRVPQDGR 297 >UniRef50_B6BG61 Glycosyl transferase, group 2 family protein n=2 Tax=Rhodobacterales RepID=B6BG61_9RHOB Length = 1140 Score = 212 bits (541), Expect = 3e-53, Method: Composition-based stats. Identities = 55/424 (12%), Positives = 117/424 (27%), Gaps = 50/424 (11%) Query: 58 MSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDV 117 R + +A+++PA NE VI E + Y+N I V + DT +V Sbjct: 757 RRERFTPLQRQPKVAVVIPAHNEAKVIAQSIESVRASG-YKNLEIIVVDDGSTDDTLLEV 815 Query: 118 DEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPM 177 + + V + K LN + + + DA+ + Sbjct: 816 LKFGHK-SEVRLISQP---NQGKWSALNRAIQ---------STDAEFAVCIDADTQVCKD 862 Query: 178 ELRLF--NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPS 235 + ++ + + + R + + E++ + + + G + Sbjct: 863 AITHLVRHFADPKTGAVAGKIIAGNRV--NLLTRLQAFEYATSQNIERKAFDLINGILVV 920 Query: 236 AGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKER 295 G + A+ F ++LTED D+ ++ G +F Sbjct: 921 PGAIGAWRVEALRK------AGFFSEETLTEDTDLTIQVNRAGYNVVFEPKAKAYT---- 970 Query: 296 EQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLW 355 P+ ++Q+ RW +G+ +K + + Sbjct: 971 -------------------EVPENVGQLLKQRLRWSLGMFQSAWKHKRAIIEGRSIGLVS 1011 Query: 356 RDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMV 415 LA + + ++ +L S +L + Sbjct: 1012 ISDMFVFGYVFPLLAPIADLFVIFMLYNLMAGGWTGDVGSTQQVQTTQYLWAFLALPALE 1071 Query: 416 NRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 I I LL L+ + + + R++ L+ R W Sbjct: 1072 LLIAAIAITTDKDESNWSLLLFPLQRLAYRPLLYFSVIRSI---LRAVTGRLANWGSVKR 1128 Query: 476 DFPS 479 Sbjct: 1129 HGRD 1132 >UniRef50_D1RA94 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA94_9CHLA Length = 620 Score = 212 bits (540), Expect = 4e-53, Method: Composition-based stats. Identities = 72/467 (15%), Positives = 142/467 (30%), Gaps = 57/467 (12%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFI------DVVYWVRRIKRKLSVYRRYP 56 WL ++ + + I + L I + VR R Sbjct: 109 WLSELVMASFHYFEHWLSAAIFITLMGSLGCLMIFNTVMLSWITLVRYSDLYFRFPRLKR 168 Query: 57 RMSYRELYKPDEKPLAIMVPAWNE--TGVIGNMAELAATTLDYENYHIFVGTYPNDPDT- 113 ++ + K + ++I +P +NE VI + A + +Y ++ + V T Sbjct: 169 GLTIADQSKKNSPFVSIHIPCFNEPPELVIETL--NAISRFNYPHFEVIVLDNNTKDPTV 226 Query: 114 QRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDV 173 V+ C + + +KA LN L + DA+ V Sbjct: 227 WAPVEAHCLQLGERFRFYHIDKLAGAKAGALNACLK-------CTASQAELIAVFDADYV 279 Query: 174 ISPMEL-RLFNYL-VERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAG 231 L RL + + +Q + + +H+ + Y E+ ++P + Sbjct: 280 AKEDFLSRLVGFFDDPKIGFVQSCQDYRDWDHSHYQAACYY-EYETHFKLELPGQNEWDV 338 Query: 232 QVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDE 291 G R A+ + + LTED ++ R+ G +++ Sbjct: 339 TYTI-GTMCLIRRTALDEV------GGWAEWCLTEDSEVAVRIHALGFAGYYLKETFGY- 390 Query: 292 AKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLN 351 P+TF + Q+ RW G V Q K + N Sbjct: 391 ----------------------GLIPETFESYKLQRFRWSAGPVQQIQKHWRLYLPWAKN 428 Query: 352 YFLWRDRKG-AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTL-LWL 409 + G + F + + + + + LW S A L + + Sbjct: 429 GLSLAQKFGEIFHSLSIFFSESLSFLINIPILCICLWFAIVKQQSFILPKAVLWAIPIVF 488 Query: 410 NFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRAL 456 ++ N + R++ + + LLS L +A ++A Sbjct: 489 IKNILCNWLSIRLLGGS----WKEYLLSALAARSLIFTRNVAFYKAW 531 >UniRef50_A3N3L7 Biofilm PGA synthesis N-glycosyltransferase PgaC n=10 Tax=Pasteurellaceae RepID=A3N3L7_ACTP2 Length = 411 Score = 212 bits (539), Expect = 5e-53, Method: Composition-based stats. Identities = 59/461 (12%), Positives = 132/461 (28%), Gaps = 57/461 (12%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWN 79 + L + + + + L + ++ ++ + +++MVP +N Sbjct: 1 MILEIFSLFVFAYPAVMAFYWAFAGLTYFLFKEKLKVPPNFDQMKHEEVPLVSLMVPCYN 60 Query: 80 ETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTS 139 E+ + L Y NY + + T +D+ R +V + Sbjct: 61 ESDNLDEAIPH-LLNLKYPNYELIFINDGSKDHTGEIIDKWAKRDKR---IVALHQANSG 116 Query: 140 KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV---ERKDLIQIPV 196 KA LNN L D + V+ L + R + Sbjct: 117 KASALNNGLRIA---------RGKYVGCIDGDAVLDYKALDYMVQALESNPRYGAVTGNP 167 Query: 197 YPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDG 256 R + + EFS + G + + +GV F + + + Sbjct: 168 RVRNR--STILGRLQVSEFSSIIGLIKRAQCLMGTIFTVSGVCCLFRKDIMFEI------ 219 Query: 257 IAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYF 316 + +TED D+ ++++ G + + Sbjct: 220 GGWSTNMITEDIDVSWKIQTSGYDIFYEPRAL-----------------------CWVLM 256 Query: 317 PDTFSTAVRQKSRWIIGIVFQGFKT--HKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVM 374 P+T + +Q+ RW G K W + ++ I V+ + ++ Sbjct: 257 PETINGLFKQRLRWAQGGAETMMKYFPQIWRLKNRRLWPMF------IEYIVTAIWASLL 310 Query: 375 IQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQG 434 + +LL Y ++ + L + L F + + + G Sbjct: 311 LVSILLSIYNLIFDNQIGLLDWAELKPSIAILFIAFFTQLSISLYIDNRYEKGVVKYVFS 370 Query: 435 LLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 + L+W +N + + + + + W Sbjct: 371 CIWYPWLYWS--LNTITLLCGIPKAIFRNKTKLAVWTSPDR 409 >UniRef50_Q1AWU6 Type II secretion system protein E n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AWU6_RUBXD Length = 572 Score = 212 bits (539), Expect = 5e-53, Method: Composition-based stats. Identities = 57/256 (22%), Positives = 94/256 (36%), Gaps = 17/256 (6%) Query: 494 LLENQVITEEQLDTALRNRVEGLR-LGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQ 552 LL +TEEQL A+ + R LG ++ G +SAE+LA+A A + G+ + Sbjct: 21 LLSEGSLTEEQLHRAVEAQKHDPRDLGQILVSLGYVSAEELARARARRLGLGYLEPSERD 80 Query: 553 IPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLR 612 + + + +P V + LPLRLE L+ D D +L L G V V+ Sbjct: 81 VDPAALGLVPERVLRRHRALPLRLEEGRLVAALADPTDLQALDDLRMLSGYPVTPVVATE 140 Query: 613 GQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTE--QQAGEIWRQYVPHQFLFAEILTTL 670 I +A + + + + P L + IL Sbjct: 141 EAIRRLQIKLFAVDERVTGILREAELREAREEDDDLDLGAGAGAEERPVIRLVSSILQQA 200 Query: 671 GHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQV-SMQSLLLKAG 729 S I++ E L + +G+ L V++I LQ + L L +G Sbjct: 201 ISDGASDIHL-----EPRPGRLAVRVRVDGL-----LREVMSIPHRLQSGVISRLKLVSG 250 Query: 730 LNTEQVAQLESENEGE 745 L+ + +G Sbjct: 251 LDIAE---RRLPQDGR 263 >UniRef50_C8NRH6 Group 2 glycosyl transferase n=5 Tax=Corynebacterineae RepID=C8NRH6_COREF Length = 478 Score = 212 bits (539), Expect = 6e-53, Method: Composition-based stats. Identities = 84/468 (17%), Positives = 147/468 (31%), Gaps = 66/468 (14%) Query: 17 VIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVP 76 ++ L +M++ L D FI R+ R ++ R + + K I+VP Sbjct: 64 IVIAGLCTLMYVVTLTDRFI----MFRKGLRADAIMRVTDEEALA-VPVERLKAYTILVP 118 Query: 77 AWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNV-HKVVCARP 135 A+ E VI + A + DY + + V + D A + + Sbjct: 119 AYGEPEVITQLV-TAMNSFDYPPHLLQVLLLLEEDDLPTIEAAERANLGEISTIIKVPPA 177 Query: 136 GPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE----RKDL 191 P +K N L T + DAED+ P++LR E Sbjct: 178 QPRTKPKACNYGLHFAT---------GEIVTIFDAEDIPDPLQLRRVVVAFENSPANTVC 228 Query: 192 IQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALL 251 +Q + T + E+ +P + VP G + L Sbjct: 229 VQSRLSYRNARQNLLT-GWFTIEYDVWFNFLLPGIMRMQAPVPLGGTSNHLVTEVLREL- 286 Query: 252 ADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMIC 311 A+D ++TED D+G R+ +G + +EA Sbjct: 287 -----GAWDPYNVTEDADLGVRIAARGYRTAVLDSVTWEEANSDTI-------------- 327 Query: 312 VREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAM 371 +RQ+SRW G + W + +L R+ G + M Sbjct: 328 ----------NWLRQRSRWYKGYLQ------TWLVYMRRPRWLVREL-GVLPALRFTFLM 370 Query: 372 LVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLW---LNFGLMVNRIVQRVIFVTGY 428 + +L W + +A L++ L ++ N + + Sbjct: 371 AGTPIVAVLNLLFWYLSLTWILGQPATIAAMFPPLVYYPALICLILGNAATMYMNLIGCR 430 Query: 429 YGLTQGLLS-VLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTH 475 G L+ VL L+ +A + Q++ R W+KT H Sbjct: 431 EGRDPLLVVAVLTFPVYWLLMSIAALKGTWQLI----TRPSYWEKTAH 474 >UniRef50_C8W724 Glycosyl transferase family 2 n=4 Tax=Coriobacteriaceae RepID=C8W724_ATOPD Length = 435 Score = 211 bits (538), Expect = 6e-53, Method: Composition-based stats. Identities = 61/454 (13%), Positives = 125/454 (27%), Gaps = 51/454 (11%) Query: 20 ITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWN 79 + + I+ + + F + Y+ + + +++ ++ + A + A N Sbjct: 8 LGITPIVVFNFIIWLFFTLAYFYQIVYILRVMFKGEVKLPEA----KKQHRYAFFIAAHN 63 Query: 80 ETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPT 138 E VIGN+ E +FV T + + A Sbjct: 64 EEPVIGNLVRSILSQDYPRELMDVFVVADACTDKTAEEARKAGA-----ITWERNDLARK 118 Query: 139 SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYP 198 K+ ++ D I + FI+ DA++++SP L++ N + L+ Sbjct: 119 GKSWVMDYGFDRI---LNEYGDKYEAFIVMDADNLVSPSYLKIMNQAFDAGYLVCTSYRN 175 Query: 199 FEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIA 258 + + + S Y F R + +G G S R + Sbjct: 176 SKNFDSSWVSSAYATWFMREAKFLNNARMMMGTSCAVSGSGWMVSSRIIK------GMHG 229 Query: 259 FDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPD 318 +D +LTED + + DE P Sbjct: 230 WDFHTLTEDIQFSTFCCAHNIQIGYAPAEFFDEQ------------------------PL 265 Query: 319 TFSTAVRQKSRWIIGIVFQGFKT------HKWTSSLTLNYFLWRDRKGAISNFVSFLAML 372 TF + Q+ RW G F + L G I + +S A + Sbjct: 266 TFKASWTQRMRWTKGFYQVFFSYGFDLLKGIFKGQFASYDMLMTIAPGMILSLLS--AFI 323 Query: 373 VMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLT 432 LL+ + +++ F ++ Sbjct: 324 NGTYLLVGYLSHGFVATDAEIAMSVGSLVMTVFSMYVVFFILALITTISEYKHFHVKKKW 383 Query: 433 QGLLSVLRLFWGNLINFMANWRALKQVLQHGDPR 466 + ++ + AL + ++ + Sbjct: 384 RIFTNLFTFPIFMMTYIPITVAALFKKVEWVPTK 417 >UniRef50_A4J3A4 Type II secretion system protein E n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J3A4_DESRM Length = 554 Score = 211 bits (538), Expect = 6e-53, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 91/261 (34%), Gaps = 25/261 (9%) Query: 486 SLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVA 544 +G ILLE I+++QL AL +R LG ++ G ++ +Q++Q L Q + Sbjct: 6 PKSLIGNILLEKGAISQQQLREALNNHRQTDQPLGQVLVDLGYVTKKQVSQYLDYQEQME 65 Query: 545 WESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRK 604 E I +I +L+ + Y + PL + ++L V + + +++ L Sbjct: 66 EEVIHIQEIDKALLKLFSEQILRRYKIFPLFKKGNKLTVAMAEPANVIAIDDLKVISNLD 125 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFA 664 + V V I + +Y + + + + VP L Sbjct: 126 IVPVEVQEQIIELAIDLYYDITKREVNDKEQRLVITDE------------EEVPIIQLVY 173 Query: 665 EILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSL 724 +I+ S I++ E + +G++ + + + Sbjct: 174 QIIDRAIDQGASDIHI-----EPQEKRVRIRYRIDGMLILGM----ELAPTLDKAIISRI 224 Query: 725 LLKAGLNTEQVAQLESENEGE 745 + + LN + +G Sbjct: 225 KIMSQLNIAE---KRVPQDGR 242 >UniRef50_Q47AJ5 Type II secretion system protein E:General secretory system II, protein E, N-terminal n=4 Tax=cellular organisms RepID=Q47AJ5_DECAR Length = 568 Score = 211 bits (538), Expect = 7e-53, Method: Composition-based stats. Identities = 51/272 (18%), Positives = 101/272 (37%), Gaps = 21/272 (7%) Query: 480 VTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALA 538 ++ RPLGQIL+ +++E+QL AL + +G ++ G ++ L QAL+ Sbjct: 1 MSTTALQRRPLGQILISEGILSEDQLRIALLEQMKQNQPIGKLLVSLGFVTEATLRQALS 60 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPL--RLENDELIVGSEDGIDPVSLAA 596 E G + I + +P +A + +LPL N L + D D V L Sbjct: 61 ENLGKQSIDLSHAVIDPQALKLVPRDLAKRHHLLPLDYDRTNRRLALAISDINDIVGLDR 120 Query: 597 LTRKV--GRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR 654 + ++ G ++ ++ +I + +Y D +L+ + + Sbjct: 121 VRSQLEEGTEIETLLAGESEIDHAIDQYYGHELSID--GILHEIETGEIDWHSLSATDNE 178 Query: 655 QYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQ 714 P L ILT S I+ E + L +G L ++ + Sbjct: 179 YSQPVVRLIDSILTDAVKREASDIH-----FEPEANFLRIRYRIDG-----MLRQIRALH 228 Query: 715 REL-QVSMQSLLLKAGLNTEQVAQLESENEGE 745 + + + +G+N ++ + +G Sbjct: 229 KSYWPAMTVRIKVLSGMNIAEM---RAPQDGR 257 >UniRef50_B0C9M4 Glycosyl transferase, family 2 n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C9M4_ACAM1 Length = 492 Score = 211 bits (538), Expect = 7e-53, Method: Composition-based stats. Identities = 76/473 (16%), Positives = 145/473 (30%), Gaps = 51/473 (10%) Query: 26 MFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIG 85 +++ L I + + I R ++RR + D + I +P +NE V+ Sbjct: 16 LYLGILTLIAIYSFHKISIIWRYY-LHRRREISPLHKFSDADLPQVTIQLPLFNEMYVVD 74 Query: 86 NMAEL-AATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHK---VVCARPGPTSKA 141 + E AA + I V + +T+ H + KA Sbjct: 75 RLLEAVAALEYPVDKLQIQVLDD-STDETREICRAKVRELKQRHLNIDYIHRCDRKGYKA 133 Query: 142 DCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFN--YLVERKDLIQIPVYPF 199 L L + T ++ DA+ V SP L + + ++Q Sbjct: 134 GALAYGLQSAT---------GDLVMIFDADFVPSPDTLINMVHYFANPKVGMVQARWGHI 184 Query: 200 EREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAF 259 R ++ T + R G + + + D + Sbjct: 185 NRHYSILTE-IQALMLDGHFVTEQTSRNRSGCFFNFNGTAGIWRIQTIE------DAGGW 237 Query: 260 DVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDT 319 ++TED D+ +R + KG I++ I V P Sbjct: 238 QHTTVTEDLDLSYRAQLKGWECIYLPN-----------------------IVVPAELPME 274 Query: 320 FSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLL 379 ++ Q+ RW G K + + + ++N ++L +LV++ L L Sbjct: 275 MNSFKSQQFRWAKGASQVAKKLLLPILTSNAPGHVKLEAFFHLTNNFNYLLLLVLLLLSL 334 Query: 380 LLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVL 439 L W + +L+T L L V + QR + LT L ++ Sbjct: 335 PYQL-FLAETGWRYGLAIHLPLFLITTLSLLAFYSVAQEEQR--GQNSPWKLTSNLFLLM 391 Query: 440 RLFWGNLIN-FMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLG 491 + G IN +A + L +V + + + + + Sbjct: 392 SVGIGLSINQSLAVYDGLFRVGRDFVRTPKHGVTSNEEDWKTRKYRAARNLVP 444 >UniRef50_A6DGB3 Glycosyl transferase, family 2 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGB3_9BACT Length = 396 Score = 211 bits (536), Expect = 1e-52, Method: Composition-based stats. Identities = 63/421 (14%), Positives = 127/421 (30%), Gaps = 56/421 (13%) Query: 62 ELYKPDEKPLAIMVPAWNETGVIGNMAE-LAATTLDYENYHIFVGTYPNDPDTQRDVDEV 120 E+ ++++VPA NE VI + + + + + T+ +D Sbjct: 18 EIDLEYNPKVSVLVPAHNEEAVIEGCLDCMNKLEYKTGQLEVIILNDRSSDGTKELIDNF 77 Query: 121 CARFP--NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME 178 + P ++ KA + E A ++ DA+ + Sbjct: 78 LCKNPQSHIRAHHRPMSSEPGKAAAM---------KEIIATLKSEIIVIFDADYLPQADL 128 Query: 179 LRLF--NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 ++ + + V + T + E + D VR Sbjct: 129 IKRLISPFKDPQVGATMGRVVTYNANANIMTKLID-LERRSGYAIDQNVRNYFDLLPQFG 187 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G A+ + +D ++LTED D+ ++L G ++ Sbjct: 188 GTTGGIRLSALEDV------GGWDTRTLTEDTDLTYKLYLNGYKIKYLNAA--------- 232 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWR 356 E P+T+ +Q RW G K T L + Sbjct: 233 --------------ACYEETPETWQARYKQVRRWAYGHNDCMIKHLIPTLMHKDKNLLRK 278 Query: 357 DRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVN 416 + + A L+++ ++ L + SA L+TL L G Sbjct: 279 LDALLLLTIYAAPAALLVLSIVAFLFGNI----------SVNMSASLITLFLLFCGFGNF 328 Query: 417 RIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVL--QHGDPRRVAWDKTT 474 ++ + + +F + I+ +A+ AL + + G + ++WDKT Sbjct: 329 SPFFQMFAACIKDRQPHCIRYIPYIFVSSTISMLASTHALLLLPIEKLGLKKSLSWDKTL 388 Query: 475 H 475 Sbjct: 389 R 389 >UniRef50_Q2RZV9 Putative glucosyltransferase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2RZV9_SALRD Length = 510 Score = 210 bits (535), Expect = 1e-52, Method: Composition-based stats. Identities = 69/461 (14%), Positives = 117/461 (25%), Gaps = 59/461 (12%) Query: 18 IAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKP------- 70 + L V + + Y + L D Sbjct: 1 MLAALEVAVPALYAVAIVVLTAYGGNLLWLALVHAASERLRDGPVPDPDDLPVPDDDWPV 60 Query: 71 LAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCA-RFPNVH 128 + + +P +NE V + + I V D T+R V + V+ Sbjct: 61 VTVQLPLYNEAEVAHRLIDACVQLDYPRSRLDIQVLDDSTDATTERVARRVAHWQAEGVN 120 Query: 129 KVVCARPGPTS-KADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV- 186 R T KA L N L + DA+ V P LR Sbjct: 121 ITHVRRDDRTGYKAGALANGLQRA---------RGDLIAIFDADFVPRPSFLRRLVPRFF 171 Query: 187 --ERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSR 244 ++Q R+ + T + VRE + G + R Sbjct: 172 DAPDLGMVQARWGHLNRDDSLLT-KVQAFGLDAHFAIEQRVRELAGCFLNFNGTAGVWRR 230 Query: 245 RAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHA 304 + D + +LTED D+ +R + +G +V Sbjct: 231 ACIE------DAGGWAHDTLTEDLDLSYRAQLQGWRLTYVPAA----------------- 267 Query: 305 RTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISN 364 P + Q+ RW G K WR + + Sbjct: 268 ------EAPAELPPDMNALRAQQFRWAKGGAETALKLTGRLWRSAQP---WRVKLEGTFH 318 Query: 365 FVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIF 424 + A ++ L A L H + L + QR + Sbjct: 319 LTAHFAFPFILLAALTHAPLLLLKGIGHGPGEVYFAVMGFGLFGFAGFFLAQLFAQRAL- 377 Query: 425 VTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDP 465 + + L + G + ++N A+ Q L+ D Sbjct: 378 ---HPDWRRRLRLFVPFMAGTMGLSLSNTSAVWQALRGTDT 415 >UniRef50_Q1Q109 Strongly similar to general secretory system type II protein, ATPase component n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q109_9BACT Length = 582 Score = 210 bits (535), Expect = 1e-52, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 107/281 (38%), Gaps = 24/281 (8%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTALR-NRVEGLRLGGSMLMQGLISAEQLA 534 R GQ+L EN TE+Q+ AL + G LG ++ ++ Q+ Sbjct: 6 KSKPDAKSDAQRRLFGQLLKENGFATEDQIQEALAVQKQNGGLLGDILISMNYVTDPQIM 65 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDE--LIVGSEDGIDPV 592 Q L+E GV +I+ ++P +I +PA++A Y ++P+ E ++ + + + + Sbjct: 66 QVLSEYLGVEIVNIEDREVPGDVINLVPAAIAQLYRIIPISYEQEKQVITIAQANALAIE 125 Query: 593 SLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWL-------T 645 +L L + V+ V+ + + L +Y ++ + +L + Sbjct: 126 TLDDLRLVLKLNVKPVLCHKDSVARALEKYYPKKHESVEQLLLEFKEDKSYAQSVSGNYI 185 Query: 646 EQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQE 705 + + + P + + S I+ +E +GV+ + Sbjct: 186 DIEELKKMASTAPVKKWVGLMFLYAVLDKASDIH-----YEAFEDSFRVRYRIDGVLYER 240 Query: 706 TLDRVLTIQRELQVSM-QSLLLKAGLNTEQVAQLESENEGE 745 ++ REL + + + + AG++ + +G Sbjct: 241 -----VSPPRELGIPINSRIKVMAGMDISE---RRLPQDGR 273 >UniRef50_C9RLV9 Type II secretion system protein E n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLV9_FIBSS Length = 641 Score = 210 bits (534), Expect = 2e-52, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 94/280 (33%), Gaps = 28/280 (10%) Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALA 538 + + + +G++LL+ IT++QL+ AL + G RLG ++ I ++L L Sbjct: 63 QMQSVTKKRIGEMLLDQGFITQDQLNEALEKQKTSGGKRLGRVLVDLKFIDEKKLTDILC 122 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLEND---ELIVGSEDGIDPVSLA 595 Q V + +D ++ + +P ++PL + D L+V D + Sbjct: 123 CQFEVPYVKLDTIKLDEKVYEFIPEDQCKANKIVPLYVTKDARQALVVAMADPTNVRLRD 182 Query: 596 ALTRKVGRKVRYVIVLRGQIVTGLRHWY-ARRRGHDPRAMLYNAVQHQWLTE-------- 646 ++ KV R V V+ I + + + A L L Sbjct: 183 SIKFKVKRNVDVVMASEQDIKKTIDTLFAGHGPAEESLAELIGGSGEDELETVERGNGNS 242 Query: 647 QQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQET 706 + + + ++ + S I++ E L +G Sbjct: 243 DEPELTDEEGRQVVKIVTTLIHEAIARHASDIHL-----EPQETFLKLRYRIDG-----D 292 Query: 707 LDRVLTIQRELQV-SMQSLLLKAGLNTEQVAQLESENEGE 745 L + I L + + L + ++ + +G Sbjct: 293 LQVMSPIPARLMPQILSRIKLLSKMDIAE---KRKPLDGR 329 >UniRef50_A1WFR3 Type II secretion system protein E (GspE) n=8 Tax=cellular organisms RepID=A1WFR3_VEREI Length = 578 Score = 210 bits (534), Expect = 2e-52, Method: Composition-based stats. Identities = 50/268 (18%), Positives = 84/268 (31%), Gaps = 16/268 (5%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQAL 537 +G++L+++ ++ L+ AL + G LG + GL+S +AQAL Sbjct: 13 QPQDAPLPRPRIGELLVQSGKLSARDLERALSAQQEMGGLLGRVFVRLGLVSDADVAQAL 72 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAAL 597 + Q G+ S + + + V PLRLE D+L V D + AL Sbjct: 73 SAQLGIPLVSEHDFPDLLPEVEGLRPEFLAANNVCPLRLEGDQLHVAMAVPQDAFVVKAL 132 Query: 598 TRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYV 657 G + + L I L + + Sbjct: 133 HLATGHAIVPYLALESAIDKALAE---PANAVPQPQDDGFGDGLDGSDFVEHLKDLASEA 189 Query: 658 PHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQREL 717 P L I+ + + S I++ E L +GVI L R Sbjct: 190 PVIRLVNTIIGRVIDLRASDIHL-----EPFDDGLHVRYRIDGVIHPGELV----PPRLS 240 Query: 718 QVSMQSLLLKAGLNTEQVAQLESENEGE 745 + L A L+ + +G Sbjct: 241 AAVNSRVKLLAHLDIAE---RRLPQDGR 265 >UniRef50_Q0A8B9 Type II secretion system protein E n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0A8B9_ALHEH Length = 587 Score = 209 bits (533), Expect = 2e-52, Method: Composition-based stats. Identities = 57/269 (21%), Positives = 91/269 (33%), Gaps = 16/269 (5%) Query: 481 TGDTRSLRPLGQILLENQVITEEQLDTALRNRVE--GLRLGGSMLMQGLISAEQLAQALA 538 + +R LG ILLE +I E L AL L+LG ++ I+ EQL AL Sbjct: 15 KARGKQVRRLGGILLERGLIDETTLRAALDTHRAQPHLQLGRWLVEHRHITREQLEDALC 74 Query: 539 EQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT 598 EQ G+ + + + +P + L VLPL L+ + D LA L Sbjct: 75 EQLGIPRVDLAGFVAKPEVAGLIPYEMCLRLNVLPLARHRSVLMAATATPTDEELLANLR 134 Query: 599 RKVGRKVRYVIVLRGQIVTGLRHWYAR--RRGHDPRAMLYNAVQHQWLTEQQAGEIWRQY 656 G V V+ QI + + Y G + L + + L Q E Sbjct: 135 FHTGLNVEPVLAPPHQISSAINRSYKSLAIGGEEGMDTLLTTDEDRDLRRDQEIESQASS 194 Query: 657 VPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRE 716 P L ++ S I+ L +G I + L + + Sbjct: 195 RPVVRLVNTVILQAISRGASDIH-----FMPRENDLAVMFRIDGAIQRVRLVD----KAQ 245 Query: 717 LQVSMQSLLLKAGLNTEQVAQLESENEGE 745 L + + + +N + +G Sbjct: 246 LAAVVARIKILGRMNIAE---KRLPQDGH 271 >UniRef50_B0S8Q3 Type II secretory pathway ATPase, protein E n=7 Tax=Leptospira RepID=B0S8Q3_LEPBA Length = 558 Score = 209 bits (533), Expect = 3e-52, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 99/264 (37%), Gaps = 18/264 (6%) Query: 487 LRPLGQILLENQVITEEQLDTALRNR-VEGLRLGGSMLMQGLISAEQLAQALAEQNGVAW 545 + LGQILLE+ ++T + L+ + + L L + +GL S + +ALA+ + + + Sbjct: 2 RKSLGQILLEDGILTIKDLEDISKQQEKTNLPLTHIIQKKGLASETDILKALAKLHRMEF 61 Query: 546 ESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALT-RKVGRK 604 + + +++P + ++P ++ ++ V + D D + + G + Sbjct: 62 YDKLEFVASDEIFSKIPLKLVQRSKIVPFLVKGKKVFVATSDPTDLHPMDDMRSFLKGYE 121 Query: 605 VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWR--QYVPHQFL 662 +++V+ +I+ + + + M L++ + P + Sbjct: 122 IQFVLATENEIMRIVHSQFDKTTAEAKEMMDEMDGSFGDLSDAFESDALDLSNEAPIIKM 181 Query: 663 FAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSM- 721 IL+ S I++ E + +GV L +VL + + Sbjct: 182 VNVILSQAVSERASDIHI-----EPFEKSVIVRYRVDGV-----LQKVLNPPKSYLAGIS 231 Query: 722 QSLLLKAGLNTEQVAQLESENEGE 745 + + + LN + +G Sbjct: 232 TRIKIMSNLNIAE---NRLPQDGR 252 >UniRef50_A3ZX01 General secretion pathway protein E n=3 Tax=Planctomycetaceae RepID=A3ZX01_9PLAN Length = 575 Score = 209 bits (532), Expect = 3e-52, Method: Composition-based stats. Identities = 43/262 (16%), Positives = 88/262 (33%), Gaps = 18/262 (6%) Query: 492 QILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAW 551 +IL+ + +I+ EQL+ R + + G + E +AL + G+ + + Sbjct: 6 EILVRHGLISAEQLEIVRREQKLPGDAIERAVELGFVDEEDALKALGVEVGLDFVDLTTA 65 Query: 552 QIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVL 611 I SL+ +P + ++ PL+ N +IV + D D L + G V V+ Sbjct: 66 DIDLSLLKTLPQRLIYRQSLFPLQRRNGSVIVATSDPFDLYPLDEVAAVTGLSVVPVLAS 125 Query: 612 RGQIVTGLRHWYARRRGH--------DPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLF 663 R +I ++ + A + ++ Q L Sbjct: 126 RVEIAKLIKANLGVGGETVEGLLALKEEDAGSDIELLDDIESDGSELSEMAQEASVVRLV 185 Query: 664 AEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQS 723 EI+ S +++ E L +G++ + + I + Sbjct: 186 NEIMFEAIESRASDVHI-----ESQGSGLVVRYRIDGMLHSQPVPP--EINYFQAAIISR 238 Query: 724 LLLKAGLNTEQVAQLESENEGE 745 L + + LN + +G Sbjct: 239 LKIMSRLNIAE---KRLPQDGR 257 >UniRef50_A6NRQ7 Putative uncharacterized protein n=4 Tax=Bacteria RepID=A6NRQ7_9BACE Length = 427 Score = 209 bits (532), Expect = 3e-52, Method: Composition-based stats. Identities = 65/449 (14%), Positives = 128/449 (28%), Gaps = 45/449 (10%) Query: 34 FFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEK--PLAIMVPAWNETGVIGNMAELA 91 FFI V++ V + + + R + + K A ++ A NE GVIG + + Sbjct: 12 FFIAVLFTVLYFYQLVYLGVGLVRRKHPPRLPENCKFHRYAAVISARNEEGVIGELIQCL 71 Query: 92 -ATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDA 150 + ++V DT AR K L+ + Sbjct: 72 KQQNYPSDLLDVYVIADNCTDDTA-----GAARAAGAIVYEREDQRLKGKGYALDWLFHH 126 Query: 151 ITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVE--RKDLIQIPVYPFEREWTHFTS 208 + ER + +++ DA++++ +R N + + + D + ++ Sbjct: 127 LAAEER---DVYDAYLIFDADNLVDKNFVREMNRVFDTGKYDALTSYRNSKNFGDNWISA 183 Query: 209 MTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDY 268 Y F R L +G G S + + + LTED Sbjct: 184 G-YSIWFLREARFLSYPRMLLGSNCHVSGTGFLVSAKVIRE------NGGWPYHLLTEDI 236 Query: 269 DIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKS 328 + KG + VV + + P TF + Q+ Sbjct: 237 EFSVSSAVKGFRIGYCDAAVVYDEQ-----------------------PTTFRQSWDQRL 273 Query: 329 RWIIGIVFQGFKTH--KWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESL 386 RW G K L+ W ++ L L + ++ L Sbjct: 274 RWSKGFYQVDMKYTLPLLKGCTRLDRTSWSCYDMLMTVAPGMLLTLAVFLFNGIICAACL 333 Query: 387 WPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNL 446 + + S M F + + + + VL +F + Sbjct: 334 MEPPYIARYVISMCLEFMGSSLFTFYMGLLAYGIITVLSEWKNISAPAVKKVLYVFLFPV 393 Query: 447 INFMANWRALKQVLQHGDPRRVAWDKTTH 475 F +L +++ + + + Sbjct: 394 FMFTYIPISLAALVRRVEWKPIYHSSAKQ 422 >UniRef50_A7INQ0 Glycosyl transferase family 2 n=11 Tax=Rhizobiales RepID=A7INQ0_XANP2 Length = 905 Score = 209 bits (532), Expect = 4e-52, Method: Composition-based stats. Identities = 71/452 (15%), Positives = 126/452 (27%), Gaps = 50/452 (11%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYR- 61 WL V WL + + + + L ++Y + + + R Sbjct: 363 WLAMVVDYWLNHYVTGGDYVTLALSVVMLVPLVFVLLYRIEEMAAIAFGSGPRRLIDARK 422 Query: 62 ----ELYKPDEKPLAIMVPAWNET-GVIGNMAELAATTLDYENYH-IFVGTYPNDPDTQR 115 ++I VPA+ E ++ + A L+Y N+ I + DP Sbjct: 423 AAVVPTVPSRFPKVSIHVPAYREPPEMLKQTID-ALAALEYPNFEAIIIINNTPDPAMVE 481 Query: 116 DVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVIS 175 V E CA K + A KA L LDA + DA+ V++ Sbjct: 482 PVREYCAALGERFKFINAEKVAGFKAGALRIALDATA-------PDAEIIGVIDADYVVT 534 Query: 176 PMELRLFN--YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQV 233 P L+ + L+Q P + + + E++ + R V Sbjct: 535 PDWLKELVPVFDDPTVGLVQAPQDHRDADRSLLHEAMNA-EYAGFFDIGMVQRNEDDAIV 593 Query: 234 PSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAK 293 G R A+ + ++ ED D+G + E G + R Sbjct: 594 -VHGTMCLIRRAAMLE------AGNWSSDTICEDTDLGLTIAENGWKTHYTRKRYGY--- 643 Query: 294 EREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYF 353 PD+F +Q+ RW G K W L Sbjct: 644 --------------------GLLPDSFEAFKKQRHRWAYGGFQIIKKH--WRKFLPNRSR 681 Query: 354 LWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGL 413 L +K + ++ +A + P F + +L Sbjct: 682 LTTAQKRHFVLGWISWLGSESVGAVMAIASLAFVPFVLLFGVSVPAHVLTLPILITFLVY 741 Query: 414 MVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGN 445 +++ + + V G Sbjct: 742 LMHFVSLYRLRVETTPMRMLGAAVAASAVQYT 773 >UniRef50_Q2JKN6 Glycosyl transferase, group 2 family protein n=2 Tax=Synechococcus RepID=Q2JKN6_SYNJB Length = 493 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 65/437 (14%), Positives = 126/437 (28%), Gaps = 71/437 (16%) Query: 53 RRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELA-ATTLDYENYHIFVGTYPNDP 111 +R + EL +A+++PA NE+ V+ + + ++ + Sbjct: 113 QRDGQSGLSELGGLALPRVAVLIPAKNESAVLPRLLHSLTQLRYPTSHLELWAIDDNSSD 172 Query: 112 DTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAE 171 T + E P++ K+ LN VL ++ DA+ Sbjct: 173 ATPEVLREAQKWIPHLRVYRRQPGRGGGKSGALNEVLPL---------TQGEIILVCDAD 223 Query: 172 DVISPMELRLF------------NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELH 219 V+ L + +Q+ +T + S+ + Sbjct: 224 AVVPSDFLARTLPLFVQVGSLRSRFSRRTVGAVQVRKALSNPSVNFWTLGQVAEMASDAY 283 Query: 220 GKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGM 279 R A+ G G G R + ++ +LT+D D+ F+L G+ Sbjct: 284 --FQQQRVAVRGIGELRGNGQLVRRDVLEK------CGGWNEATLTDDLDLTFKLHLAGV 335 Query: 280 TEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGF 339 F+ P + E ++ + Q+ RW G + Sbjct: 336 DIAFLPEP-----------------------AIVEEGVTSWKSLWHQRCRWAEGGYQRYL 372 Query: 340 KTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSG 399 W L R+ ++ V A + LL + S Sbjct: 373 DY--WPGIL--------GRRMGMAKTVDLWAFFISQYLLPMALVPDTLWVLLTGHSSVLL 422 Query: 400 SAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQV 459 L L+ + R +++V + G L ++ + + R Sbjct: 423 P--LNALVMGCLTVAFYRGLRQVEGLRGRRLWWHTALGLVYMLHWLPVMIATTARMCV-- 478 Query: 460 LQHGDPRRVAWDKTTHD 476 P+R+ W KT H Sbjct: 479 ----QPKRLRWVKTVHH 491 >UniRef50_A5G0G3 Glycosyl transferase, family 2 n=1 Tax=Acidiphilium cryptum JF-5 RepID=A5G0G3_ACICJ Length = 903 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 68/419 (16%), Positives = 123/419 (29%), Gaps = 43/419 (10%) Query: 42 VRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYH 101 ++ R +++ + NE + A LDYEN+ Sbjct: 415 FDLVETLFGRVRMRHFEPVPAAPGTKLPKVSLHLAICNEPPEMVKQTLNALAALDYENFE 474 Query: 102 IFVGTYPN-DPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANF 160 + V DP V CAR + P KA LN L Sbjct: 475 VLVIDNNTKDPAVWEPVAAHCARLGKQFRFFTLGKHPGYKAGALNFALR-------ETAP 527 Query: 161 AFAGFILHDAEDVISPMELRLF--NYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSEL 218 + D++ ++ P LR + Q P + + + F M + E++ Sbjct: 528 DAEIVGVLDSDYIVDPDWLRCMVPAFADPNVGFTQSPQDYRDNDGSLFKRMMFW-EYAGF 586 Query: 219 HGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKG 278 + R + G T + A+ A + +TED ++G RL +G Sbjct: 587 FHIGMVNRNERNAVIQ-HGTMTLIRKAALDA------EGGWAEWCITEDSELGLRLFREG 639 Query: 279 MTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQG 338 ++ S R PD F+ +Q+ RW G + Sbjct: 640 YEAVY-----------------------SKRSFGRGVMPDDFNAFRKQRYRWAYGAMRIS 676 Query: 339 FKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFS 398 + K S R ++ ++ ++ + + LLL S F Sbjct: 677 RRHWKAFLSPFDRTLTIGQRWHFVTGWLPWIGDALGLAFLLLGLAWSAGLILDPVRFEFP 736 Query: 399 GSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALK 457 +++ + L +V V G L + + + A W+ L Sbjct: 737 ILLFMLPSIGLFAFKIVQIFALYAARVPCGVG--DRLGAAVAGLALSHTIGKAVWKGLF 793 >UniRef50_A8ZYV7 Type II secretion system protein E n=11 Tax=Deltaproteobacteria RepID=A8ZYV7_DESOH Length = 577 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 51/274 (18%), Positives = 96/274 (35%), Gaps = 25/274 (9%) Query: 484 TRSLRPLGQILLENQVITEEQLD-TALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNG 542 R+ + LG++L++ +TEE+L + GL+LG ++ +G++S + ++ Q G Sbjct: 10 KRTRKKLGEMLVDAGYLTEERLTGYVAAQKRSGLKLGQFLIREGVVSESMIVDLVSRQAG 69 Query: 543 VAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVG 602 + + + L + +V+ Y +PLR N L+V D +D SL A+ + Sbjct: 70 IQRFDPAEFPVTMELAKSLAETVSRKYGAVPLRRGNHLLLVAMTDPLDIRSLDAIEDECD 129 Query: 603 RKVRYVIVLRGQIVTGLRHWYARR----------RGHDPRAMLYNAVQHQWLTEQQAGEI 652 +V VI + Y R TE + + Sbjct: 130 LEVEPVICTEQEFSHLFTQVYGTRIDGFAGEGYDLTETMDYGEDEEPADAGATEISSLQH 189 Query: 653 WRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLT 712 + P L +L S I++ + L +GV+ V Sbjct: 190 MAEEAPVVRLVNALLAQAVRQGASDIHIS-----PEKRYVQVRLRVDGVLH-----EVPA 239 Query: 713 IQREL-QVSMQSLLLKAGLNTEQVAQLESENEGE 745 + L + L + A L+ +G Sbjct: 240 PPKTLFLSIVSRLKILANLDIS---VSRIPQDGR 270 >UniRef50_B4WM17 Glycosyl transferase, group 2 family protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WM17_9SYNE Length = 475 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 62/489 (12%), Positives = 139/489 (28%), Gaps = 74/489 (15%) Query: 3 WLLDVFATWLYGLKVIAITLAVIMFISGLDDFF---------IDVVYWVRRIKRKLSVYR 53 WLL F + + V++ + L F + V + + + Sbjct: 43 WLLTAFLHKVSWGHEFVLIATVVLAVYALRVIFARPQSPPIPLPSVESLTQTSIDRASLE 102 Query: 54 RYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPD 112 L ++++V A NE VIG++ E + Y +++ + Sbjct: 103 AVDEAGEIPLDSKSWPYVSLLVAAKNEEQVIGSLVESLLHIDYPTDRYDLWIIDDYSTDA 162 Query: 113 TQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAED 172 T +D + R ++ + K+ LN V + DA+ Sbjct: 163 TPEILDNLVKRHRQLNVIHRGPGAIGGKSGALNLVWPQ---------TKGDLLAVFDADA 213 Query: 173 VISPMELRLFNYLVE------RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVR 226 +S LR + + + +Q+ +T + + + R Sbjct: 214 QVSSDLLRYVVPMFDPEKGGKKTGAVQVRKAIANATKNFWTRGQKAEMALDCY--MQQRR 271 Query: 227 EALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRF 286 A+ G G G R A+ ++ +++T+D D+ +L + + Sbjct: 272 IAVGGIGELRGNGQFVRREAI------AQCGGWNEETITDDLDLTIQLHLQQWDIGLLFA 325 Query: 287 PVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTS 346 P V E Q++RW G + Sbjct: 326 PAVGEEGVTNPL-----------------------ALWHQRNRWAEGGFQRYLDY----- 357 Query: 347 SLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTL 406 + + + L ++ +L L+A L + + Sbjct: 358 -----WRQLGKNQLGWGKSLDMLGFWIIQYMLPLVALPDLVIALIR-----RQTPVYAPI 407 Query: 407 LWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPR 466 L + + + + + ++ + + A++ ++ + Sbjct: 408 TILAVCMSGIGMFTTLRQSERTSVWSAIVQTIRGSIYMLHWLVVIGSMAIRISVRQ---K 464 Query: 467 RVAWDKTTH 475 R+ W KT H Sbjct: 465 RLKWVKTAH 473 >UniRef50_A6W755 Type II secretion system protein E n=3 Tax=Actinomycetales RepID=A6W755_KINRD Length = 591 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 52/277 (18%), Positives = 101/277 (36%), Gaps = 21/277 (7%) Query: 477 FPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE----GLRLGGSMLMQGLISAEQ 532 T R LG +L+E ++ E LD AL + RLG ++ G++S + Sbjct: 20 PAQGQPATPVRRRLGDVLVEKGLLVPEDLDVALAEQRNVEGPRRRLGQILVELGMVSEAE 79 Query: 533 LAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPV 592 LAQ LAE + + + ++ +P +VA VL L + L+V + D + + Sbjct: 80 LAQCLAELLQLEHVDLSRLTLAPDVVRLLPRAVAERCRVLVLDKTPEYLLVAAADPTNVL 139 Query: 593 SLAALTRKVGRK-VRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAG- 650 +L + + V+ + QI L ++ + + + A Sbjct: 140 ALDDVKLYTRTPELHVVVAMDSQIRDQLARAWSLTEDTSQVSRMVQDATEDDDEDPLAAL 199 Query: 651 -EIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDR 709 P L IL+ + S I++ E L +G+ L Sbjct: 200 NGSVDDDAPIVKLVNRILSDAVRLRCSDIHL-----ESQRDQLRVRFRVDGL-----LRD 249 Query: 710 VLTIQRELQVSMQ-SLLLKAGLNTEQVAQLESENEGE 745 V++ + + S+ + + +GL+ + +G Sbjct: 250 VMSAPKRVAPSVISRIKIISGLDISE---RRIPQDGR 283 >UniRef50_UPI0001B50A66 glycosyl transferase family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B50A66 Length = 439 Score = 208 bits (530), Expect = 6e-52, Method: Composition-based stats. Identities = 86/445 (19%), Positives = 152/445 (34%), Gaps = 60/445 (13%) Query: 38 VVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLD 96 V+Y + + R + + + P ++++PA +E VI + E Sbjct: 32 VLYLMLYTWDRRDAER---KARAPDTFIPPRLSFSVLLPARHEEDVIQSTIERVVRADYP 88 Query: 97 YENYHIFVGTYPNDPDTQRDVDEVCARFPN----VHKVVCARPGPTSKADCLNNVLDAIT 152 E +FV +D T + +E + +VV GP +K LN L Sbjct: 89 AELLEVFVICSQDDDGTVKKAEEKIDQLAREGLHNVRVVVFDDGPINKPHGLNTALPQTA 148 Query: 153 QFERSANFAFAGFILHDAEDVISPMELRLFNYLV--ERKDLIQIPVYPFEREWTHFTSMT 210 + DAED I P RL N ++ ER ++Q V + ++++ Sbjct: 149 NK---------VVTIFDAEDDIHPKIFRLVNTVMVKERVRVVQAGVQLMNYQSNWYSTLN 199 Query: 211 YIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDI 270 + E+ + A G +P G F+R + L +D ++LTED D+ Sbjct: 200 -VLEYFFWFKSRLH-YHAHHGSIPLGGNTVFFARELLLRL------GGWDDRNLTEDADM 251 Query: 271 GFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRW 330 G R+ G V + +E P T +RQ++RW Sbjct: 252 GLRISAMGERVRVV---------------------YDDRYVTKEETPPTLGHFIRQRTRW 290 Query: 331 IIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDA 390 G + K W T + L + + I L ++L + Sbjct: 291 SQGFMQT-LKKGTWKKMPTRKQRWLAFYVLVFPRGQALLGLYLPISLGMILILKV----- 344 Query: 391 WHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFM 450 + I S + LL +F + + + + L + F ++ Sbjct: 345 --PVLIALCSYLPVLLLVAHFLVQMVGLYEFTDAHGLEASPKAVLRMAIAWFPFQMVLAY 402 Query: 451 ANWRALKQVLQHGDPRRVAWDKTTH 475 A RA+++ L R W+KT H Sbjct: 403 AALRAMRRQLA----GRHDWEKTQH 423 >UniRef50_A9BY08 General secretory pathway protein E n=2 Tax=Proteobacteria RepID=A9BY08_DELAS Length = 575 Score = 208 bits (529), Expect = 8e-52, Method: Composition-based stats. Identities = 57/274 (20%), Positives = 99/274 (36%), Gaps = 22/274 (8%) Query: 476 DFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVE-GLRLGGSMLMQGLISAEQLA 534 P + LG+ LL + E L AL+ + E G LG ++ G+++ +A Sbjct: 8 MEPIAGVEAPIKGRLGERLLSAGKLNERDLQNALQAQQELGGYLGQVLVQLGVVAETDVA 67 Query: 535 QALAEQNGVAWESIDAWQIPSSLIAEMP---ASVALHYAVLPLRLENDELIVGSEDGIDP 591 QAL+EQ + W + + L+ ++P AS V P+ L++ L V DP Sbjct: 68 QALSEQLHMRWLRAEEF---PDLLPDVPGLLASFLDAQCVCPISLQDGVLEVAMSVPQDP 124 Query: 592 VSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGE 651 AL G +++ V+ L I L G + + +A ++ + Sbjct: 125 FITKALRLATGLQIKPVLALEADIRKALSEAGQEPEGEEGQDWESDATGGDFVE---HLK 181 Query: 652 IWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVL 711 P L I++ ++ S I++ E L +GVI L Sbjct: 182 DLASEAPVIRLVTGIISRAIELHASDIHL-----EPFEAGLQVRYRMDGVIHAAELV--- 233 Query: 712 TIQRELQVSMQSLLLKAGLNTEQVAQLESENEGE 745 R + L A L+ + +G Sbjct: 234 -PPRLSAAVGSRVKLLAHLDIAE---RRLPQDGR 263 >UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli subsp. xyli RepID=Q6ACB6_LEIXX Length = 421 Score = 207 bits (528), Expect = 8e-52, Method: Composition-based stats. Identities = 74/442 (16%), Positives = 136/442 (30%), Gaps = 51/442 (11%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL 71 L L+ I + + +I+ ++GL + + P + Sbjct: 4 LGALQWIGLVVCLILALTGLIPVVAAAATFFVIPLHAWINHYHK--------AAPYLPRV 55 Query: 72 AIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKV 130 AI+VPAWNE VIG + E I+V + DT V E R+P + Sbjct: 56 AIVVPAWNEGAVIGASIDRLVTLDYPKEALRIYVVDDASTDDTSVVVRERAMRYPGNVFL 115 Query: 131 VCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYL--VER 188 G KA LN+ ++ I A+ ++ DA+ + P LR + Sbjct: 116 FRREKGGQGKAHTLNHGIERIF-----ADDWMEALLIMDADVIYQPDSLRKMTRHLADPK 170 Query: 189 KDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVT 248 + ++ + + T E+ + L Q AG SR + Sbjct: 171 VGAVSAYIHEGSADRNYLT-KFVSTEYVLSQPTARRAQNVLGAQACLAGGAQLHSRENLI 229 Query: 249 ALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSN 308 A+ G D +L ED F + +G +F Sbjct: 230 AI-----GGQVDTSTLAEDTITTFETQLRGKRVVFEPHA--------------------- 263 Query: 309 MICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSF 368 V P T +Q+ RW G V + K + + L GA + + + Sbjct: 264 --HVLAEEPRTIDALWKQRLRWARGNVSVTARYSKVWFRPSRKHHL-----GAWTFGIVW 316 Query: 369 LAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLN-FGLMVNRIVQRVIFVTG 427 ++ ++ ++++ + H + L L +++ V Sbjct: 317 FSLWLLPLIMVISSIGLAGLLFLHNGFATTVFRILWISAALAYLFVLILGSQTDVGTTRK 376 Query: 428 YYGLTQGLLSVLRLFWGNLINF 449 G ++ + F Sbjct: 377 EVGHVMLFPGIVNMLVMLTALF 398 >UniRef50_B8F8X7 Response regulator receiver protein n=3 Tax=Proteobacteria RepID=B8F8X7_DESAA Length = 804 Score = 207 bits (528), Expect = 9e-52, Method: Composition-based stats. Identities = 32/292 (10%), Positives = 97/292 (33%), Gaps = 25/292 (8%) Query: 466 RRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLR-LGGSMLM 524 + ++T + + +G++L++ ++T+E L+ A + + + L ++ Sbjct: 158 KNRQKEQTKKIKTLTKCFSANRSQIGRMLVKRNLLTKEDLEKAQQVQARSDKILPAILME 217 Query: 525 QGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVG 584 L + + + E+ + + + + L + +P+ + + ++PL+ + +L+ Sbjct: 218 MELADEKTIMDVMEEELKINRVNPAEFTASAPLASLIPSEICEKHLLVPLKRMDGQLVTA 277 Query: 585 SEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWL 644 D D V + L G ++ + I ++ Y + + Sbjct: 278 MADPTDLVKIDELRFLTGMPIKPALATHEDIRKKVQELYGGESALNSVISEIELMDPTET 337 Query: 645 TE----------QQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGK 694 E + P + I++ S +++ E + L Sbjct: 338 IEIILEEDDVVDVDELLKSKDQPPAIRIVNSIISDALRHGASDVHI-----EPKTKYLMV 392 Query: 695 FLVTEGVISQETLDRVLTIQRELQ-VSMQSLLLKAGLNTEQVAQLESENEGE 745 + + L + I + + + + + L+ + +G Sbjct: 393 RYRID-----DLLQEKIRIPMAMHPPIVSRIKVMSELDITE---RRKPQDGR 436 >UniRef50_D0KDT2 Glycosyl transferase family 2 n=3 Tax=cellular organisms RepID=D0KDT2_PECWW Length = 610 Score = 207 bits (527), Expect = 1e-51, Method: Composition-based stats. Identities = 73/461 (15%), Positives = 144/461 (31%), Gaps = 61/461 (13%) Query: 1 MDWLLDVFATWL-YGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMS 59 + + L + ++ +G + I I L + ++ + F+ + ++ + Sbjct: 104 LSYALSFYEIFVGHGFQNIFIALVLFSTVTSVYSFYYSPFSQNALFYLDFKINKKLTHVQ 163 Query: 60 YRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDT-QRDVD 118 + + + I +P ++E I + LDY+NY + V + R V+ Sbjct: 164 --TVSRVCSPKVTIHLPCYSEPPEIVITTLNSILELDYDNYEVIVIDNNTQDEALWRPVE 221 Query: 119 EVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPME 178 + C K +KA LN L+ + + DA+ +I P Sbjct: 222 KHCDMLGGKFKFHHVPVLSGAKAGALNYALNITS-------SDTELIAVIDADYIIEPDF 274 Query: 179 LRLFN--YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSA 236 +R + + E +Q + + +H Y H ++P + Sbjct: 275 IRRYVEIFKDENVGFVQTSHDYYNYQSSHVMEGAYYFW-VLFHKIELPSYTEINSAFTV- 332 Query: 237 GVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKERE 296 G + + + +D +LTED ++ R+ +G +V Sbjct: 333 GTMCILRKNILEKV------GGWDETALTEDSELAVRMHAQGH-VGYV------------ 373 Query: 297 QRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLT------- 349 R P TFS +Q+ RW G V Q K W L Sbjct: 374 ----------FADTVGRGLIPTTFSDMKKQQMRWTAGPVQQLLKH--WRLYLGFSSENKM 421 Query: 350 ---LNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTL 406 R + S +++ VM+ + + L + + + IFS + + Sbjct: 422 TLNQKIMEIRHSTSRMKPVFSLISLHVMVMVSIFLLSGDVNLSIPYEMVIFSLCMLMGSF 481 Query: 407 L---WLNFGLMVNRI--VQRVIFVTGYYGLTQGLLSVLRLF 442 + RI V + T + LF Sbjct: 482 VEKMAYFRLFNCTRILPVLYHGIMARALEWTFICGVISPLF 522 >UniRef50_Q3SKS0 Pilus assembly pathway ATPase PilB n=3 Tax=Proteobacteria RepID=Q3SKS0_THIDA Length = 577 Score = 207 bits (527), Expect = 1e-51, Method: Composition-based stats. Identities = 49/265 (18%), Positives = 97/265 (36%), Gaps = 16/265 (6%) Query: 485 RSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGV 543 R LG +L+E +VI+ LD AL + G RLG ++ GL +AQALA Q + Sbjct: 15 RQKIRLGDLLVEQKVISAADLDIALTAQKKSGRRLGRIIVESGLAGENDIAQALARQLAI 74 Query: 544 AWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGR 603 + + + S++ + + A + +PL ++ VG D D + + R + Sbjct: 75 PFVDLRKFNPDPSILQLLGETQARRFRAIPLGRREGDIFVGMADPTDLFAYDEVARLIEG 134 Query: 604 KVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQ---YVPHQ 660 ++ +V G +++ + Y R + + + P Sbjct: 135 GIQLAVVAEGDLLSAIDRLYRRTDDIHGLTEELARDMGESEASIIGLDALGEGQADAPVV 194 Query: 661 FLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVS 720 L + +N S +++ E L +GV+ ++T + Sbjct: 195 RLLQTLFEDALQVNASDVHI-----EPQEKQLMIRFRIDGVLHRQTEADLRIAP----AL 245 Query: 721 MQSLLLKAGLNTEQVAQLESENEGE 745 L + +GL+ + +G Sbjct: 246 ALRLKIVSGLDISE---KRLPQDGR 267 >UniRef50_A9FZQ2 Glycosyltransferase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZQ2_SORC5 Length = 521 Score = 207 bits (527), Expect = 1e-51, Method: Composition-based stats. Identities = 75/491 (15%), Positives = 150/491 (30%), Gaps = 61/491 (12%) Query: 12 LYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPL 71 + L + + V++ +SG + +V R + K++ + ++ R+L P+ Sbjct: 1 MLSLLLCVLYFGVLIGLSGYGLHRLHLVVLCRLNRAKITRAQEVAALTDRDL-----PPV 55 Query: 72 AIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARF-----P 125 I +P +NE+ V + + A + I V + +TQ V R Sbjct: 56 TIQLPLFNESTVAARLLDAVAKMDYPRDKLEIQVLDD-STDETQGLVRAHVERLRALGLD 114 Query: 126 NVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLF--N 183 V+ R G KA L+ L + DA+ + P +R + Sbjct: 115 AVYLHRVDRVG--YKAGALDAGLKIAK---------GELVAIFDADFIPQPDFVRSIVGH 163 Query: 184 YLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFS 243 + ++Q R+ + T H + R +G G + Sbjct: 164 FEDPTVGMVQTRWGHLNRDVSILT-QVQALMLDGHHLVENRARFGAGLLFNFSGTGGMWR 222 Query: 244 RRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQH 303 + A+ + +LTED D+ +R + G ++ Sbjct: 223 KDAIRE------AGGWQHDTLTEDLDLSYRAQLAGYRFVY-------------------- 256 Query: 304 ARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAIS 363 + P+ S Q+ RW G V K S L+ R A Sbjct: 257 ---REDVVSPAELPEDISALRAQQYRWAKGTVQTARKLMATVLSAKLS---LGQRIEAFF 310 Query: 364 NFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVI 423 + A +++ L +LL + A L++ + L T + + + Sbjct: 311 HLTPHFAYPLLVLLSVLLLPALVLFPAADTLTMIAIDLPLCTATTGSLAAFYM-LAETAQ 369 Query: 424 FVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGD 483 + + + + + + A L+ + G+ R V D Sbjct: 370 GRSRWGAVRRLPMLIALGTGLAPYLSKAVIEGLRSM--SGEFVRTPKQGDNKGRYKVRTD 427 Query: 484 TRSLRPLGQIL 494 + +L Sbjct: 428 LPITEAMLAVL 438 >UniRef50_B8DWB6 Predicted glycosyltransferase n=10 Tax=Bifidobacterium RepID=B8DWB6_BIFA0 Length = 428 Score = 207 bits (526), Expect = 1e-51, Method: Composition-based stats. Identities = 57/431 (13%), Positives = 126/431 (29%), Gaps = 44/431 (10%) Query: 63 LYKPDEKPLAIMVPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVC 121 P +K A+++ A NE V+GN+ + + I++ D T + ++ Sbjct: 38 PAAPMDKRYAVLISARNEEQVVGNLIRDIQSQSYPSKLIDIWLVADNCDDGTAQLARDL- 96 Query: 122 ARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRL 181 H V K L +L+A+ + A+ + F + DA++ + Sbjct: 97 ----GCHVVERFNQQQVGKGYALTYLLNAM--IDSKASDQYDAFFVFDADNRLDKHYFEE 150 Query: 182 FNYLVE-RKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGT 240 N + ++ +S F R L G G Sbjct: 151 MNKAYQSGFRILTSYRNSVNLSENWVSSG-SALWFIRESRFVSASRMWLGNSCHVGGTGF 209 Query: 241 CFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKF 300 FS+ + + LTED + G +V ++ + + Sbjct: 210 MFSQEVMRR------NQGWKFHLLTEDLEFTMDSVLHGDRIGYVGSAILYDEQ------- 256 Query: 301 LQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKG 360 P TF+ + RQ+ RW G + + + Sbjct: 257 ----------------PVTFAQSWRQRLRWSKGFLQVFRYYGPALVRRAIQERDFSSID- 299 Query: 361 AISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQ 420 ++ F+ +L +I++LL + + +W + L + + F +++ + Sbjct: 300 -LTLFICPFTVLAIIRVLLGTIFAACGFISWSSQGAALFNWMLGVVSSMVFMMVLAGLTM 358 Query: 421 RVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSV 480 V + L L +++A+ + + ++ Sbjct: 359 VVERKQIGASNRELFAYALSFPIYILSYVPISFQAVF---AKAQWKPIEHQGSSGAEDPR 415 Query: 481 TGDTRSLRPLG 491 + Sbjct: 416 IREQDREERAM 426 >UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolobaceae RepID=A4YD58_METS5 Length = 395 Score = 207 bits (526), Expect = 2e-51, Method: Composition-based stats. Identities = 64/434 (14%), Positives = 129/434 (29%), Gaps = 71/434 (16%) Query: 15 LKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIM 74 L I I L++I+ I + + + E +++ Sbjct: 2 LDDIVIGLSIIVSIWSVYN---SAFAIYGLSWKS------------DEPKTSSGPSFSLL 46 Query: 75 VPAWNETGVIGNMAELAA-TTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCA 133 VP NE V+G + E D Y I V + +T ++ + + V Sbjct: 47 VPVRNEEKVLGRLLERLVNQEYDRSKYEIIVLEDGSTDNTLGVCNKFSEMYSIIKCVHLE 106 Query: 134 RPGPT-SKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLV---ERK 189 + K+ LN L + DA+ V L R Sbjct: 107 KSNVVNGKSRALNYGLKI---------SRGDIIGVFDADTVPRLDVLGYVAQKFISNSRV 157 Query: 190 DLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTA 249 +Q + P + + ++E + R VP G + R A+ Sbjct: 158 GGVQGRLVPINVRESIVARLASLEEL--FSEYSISGRARAGLFVPLEGTCSFVRRDALEK 215 Query: 250 LLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNM 309 + ++ LTED D+ +L ++ Sbjct: 216 V------GGWNENVLTEDLDLSLKLTSLNYLIVYSPS----------------------- 246 Query: 310 ICVREYFPDTFSTAVRQKSRWIIGI----VFQGFKTHKWTS---SLTLNYFLWRDRKGAI 362 + P TFS+ VRQ+ RW G + W ++ + ++ A Sbjct: 247 VQSWREVPVTFSSLVRQRLRWYRGNFELTMRISRFKFTWRLVDAAMLVGTPVFMVLSLAN 306 Query: 363 SNFVSFLAMLVMIQLLLLLAYE----SLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRI 418 + V + + + + ++++ L + +++ L+LNF + ++ I Sbjct: 307 YSLVFIYSYQLHVLIAAIISFSSMMTLLLIIMISRRHMIETIYIILSALYLNFTISLHLI 366 Query: 419 VQRVIFVTGYYGLT 432 + G + Sbjct: 367 SIVLELAGAPKGWS 380 >UniRef50_C7RLS9 Type II secretion system protein E n=3 Tax=Betaproteobacteria RepID=C7RLS9_9PROT Length = 571 Score = 207 bits (526), Expect = 2e-51, Method: Composition-based stats. Identities = 51/273 (18%), Positives = 96/273 (35%), Gaps = 21/273 (7%) Query: 479 SVTGDTRSLRPLGQILLENQVITEEQLDTAL-RNRVEGLRLGGSMLMQGLISAEQLAQAL 537 + RPLGQIL+ +++E+QL AL +G ++ G +S L AL Sbjct: 3 NAPDAHPHSRPLGQILISKGILSEDQLRIALLEQMKSNRPIGKLLVTLGFVSEATLRDAL 62 Query: 538 AEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRL--ENDELIVGSEDGIDPVSLA 595 +E G + I + +P +A + +L L EN L + D D V+L Sbjct: 63 SESLGKQSVDLSNAIIDPLALKLVPRDLAKRHHLLALDYDAENQRLTLAIADINDIVALD 122 Query: 596 ALTRKVGRKVRY--VIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIW 653 + G ++ ++ +I + +Y D +L + Sbjct: 123 KIRSLAGDEIEIDTLLAGETEIDRAIDQYYGHELSID--GILNEIETGEIDFRGLQSSAD 180 Query: 654 RQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTI 713 P L ILT + S I+ E + L +G L ++ ++ Sbjct: 181 EYSQPVVRLIDSILTDAVKHDASDIH-----FEPEASFLRIRYRIDG-----MLRQIRSL 230 Query: 714 QREL-QVSMQSLLLKAGLNTEQVAQLESENEGE 745 + + + +G+N + + +G Sbjct: 231 HKTYWPAMAVRIKVLSGMNIAE---TRAPQDGR 260 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.136 0.373 Lambda K H 0.267 0.0415 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,849,837,847 Number of Sequences: 3077464 Number of extensions: 155704295 Number of successful extensions: 754755 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 2094 Number of HSP's successfully gapped in prelim test: 7522 Number of HSP's that attempted gapping in prelim test: 729597 Number of HSP's gapped (non-prelim): 13410 length of query: 745 length of database: 1,040,396,356 effective HSP length: 136 effective length of query: 609 effective length of database: 621,861,252 effective search space: 378713502468 effective search space used: 378713502468 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.0 bits) S2: 97 (42.0 bits)