BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (254 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobact... 527 e-148 UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebalde... 191 1e-47 UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bactero... 155 2e-36 UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobac... 147 4e-34 UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylo... 146 5e-34 UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteri... 140 5e-32 UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobi... 138 1e-31 UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepI... 134 4e-30 UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea... 128 2e-28 UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteob... 127 3e-28 UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcu... 124 2e-27 UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitino... 124 3e-27 UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelo... 123 7e-27 UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acineto... 121 2e-26 UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomi... 121 2e-26 UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Auranti... 120 6e-26 UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC... 118 2e-25 UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptuniu... 118 2e-25 UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmat... 117 3e-25 UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythro... 114 3e-24 UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodob... 112 1e-23 UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacter... 111 2e-23 UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter c... 111 3e-23 UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychro... 110 4e-23 UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax... 108 2e-22 UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydro... 106 6e-22 UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoc... 106 7e-22 UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetoba... 105 1e-21 UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=... 100 5e-20 UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucell... 82 2e-14 UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legione... 65 2e-09 UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legione... 62 1e-08 UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella b... 49 2e-04 UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseifl... 47 4e-04 >UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobacteriaceae RepID=YIGE_ECOLI Length = 254 Score = 527 bits (1357), Expect = e-148, Method: Compositional matrix adjust. Identities = 254/254 (100%), Positives = 254/254 (100%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY Sbjct: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 Query: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE Sbjct: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK Sbjct: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ Sbjct: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 Query: 241 RYPFVTMISVERKG 254 RYPFVTMISVERKG Sbjct: 241 RYPFVTMISVERKG 254 >UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1AL67_SEBTE Length = 266 Score = 191 bits (486), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 96/217 (44%), Positives = 134/217 (61%), Gaps = 4/217 (1%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + D TV Y + E +KMYW+ N +A+ L + + N+ ++ A NGGIY E Sbjct: 52 IEDRGFTV--YKPDLNKEIIKMYWKDENNKAYSELSKFIQE-NTGNKINFATNGGIYSEE 108 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 Y P GLYIEN + +NLA GEGNF+++P GVFY+ ++ I AF+ ++ I +A Q Sbjct: 109 YEPNGLYIENHKIISKINLADGEGNFYMQPNGVFYIQNNQPKISESKAFEYNENISYATQ 168 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 SGP+L+ENGVIN +I N S KIR+ VGI++ FL+S + NFYDF+ YA KLN Sbjct: 169 SGPLLIENGVINKKIGKNSESFKIRSAVGIDRENKVFFLMSSEKINFYDFSKYALDKLNC 228 Query: 218 EQLLYLDGTISHMYM-KGGAIPWQRYPFVTMISVERK 253 + LL+LDG IS MY IP Q YPF +I+ E++ Sbjct: 229 KDLLFLDGAISKMYFADEKKIPEQDYPFAVIITSEKR 265 >UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=Q11X50_CYTH3 Length = 244 Score = 155 bits (391), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 78/193 (40%), Positives = 119/193 (61%), Gaps = 3/193 (1%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM-AMNGGIYDESYAP 100 T+ V +YTV+PQ + ++ YW+ NGE ++ L A + S+G + A NGG+Y E +P Sbjct: 28 TIDVISYTVDPQKDNLQFYWKNDNGEILKSIKKLKAYVESKGSTLLFATNGGMYKEDRSP 87 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RLDAFKTSKEIQFAVQSG 159 LGL+I+NG+ LN A G+GNF+++P GVFY+ D ++ + + F + I+FA QSG Sbjct: 88 LGLFIQNGKTVTPLNKAKGQGNFYMQPNGVFYITNDNEAVICKTEDFINNGNIKFATQSG 147 Query: 160 PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQ 219 PM++ N I+P + IRNGVGI + +F +S++ NF+DFA Y + L E Sbjct: 148 PMIIVNNQIHPSFIKGSKNLNIRNGVGILPNKKIIFAMSEKEVNFFDFALYFQ-NLGCEN 206 Query: 220 LLYLDGTISHMYM 232 LYLDG +S Y+ Sbjct: 207 ALYLDGFVSRSYL 219 >UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9JX75_AGRVS Length = 274 Score = 147 bits (370), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 80/223 (35%), Positives = 126/223 (56%), Gaps = 5/223 (2%) Query: 33 ADDCALSDPTLTVQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQGQV-QMA 88 A++ + D T AY V +P T ++++ + A+G+ +G AL + + Q + A Sbjct: 46 AEEQSCRDQTENGFAYRVCRFDPATRTIRIFNRNADGDVYGGFEALRSQLWQQRLILTFA 105 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 +NGG+Y +P+GL+++ G + A G GNF+++P GVF++ G++ F+T Sbjct: 106 VNGGMYHSDLSPVGLFVDYGMTRKTAETADGWGNFYLKPNGVFFLKDGHAGVLETGQFET 165 Query: 149 SK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDF 207 K E FA QSGPML+ +GV++P+ P S KIRNGVGI+ G VF+LS+ FYD Sbjct: 166 QKIEADFATQSGPMLVIDGVLHPKFLPTSDSLKIRNGVGIDASGQVVFVLSKDPVRFYDM 225 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 A + + +L LYLDGTIS + + YP +I+V Sbjct: 226 AAFFRDRLGAANALYLDGTISSLAEPMAGRIDRAYPLGPIIAV 268 >UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylobacterium extorquens group RepID=A9W4Y6_METEP Length = 258 Score = 146 bits (369), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 76/189 (40%), Positives = 118/189 (62%), Gaps = 6/189 (3%) Query: 49 TVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIEN 107 TV+ + ERV+++W +G +G+L +L + QG ++ AMN G+YD+ AP+GLY+E+ Sbjct: 54 TVDLRRERVRLFWLGTDGLPYGSLSSL---ADRQGPRLSFAMNAGMYDKGQAPVGLYVED 110 Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI-QFAVQSGPMLMENG 166 G++ + A+G GNF ++P GVFYV GD+ G++ + +K FA QSGPML+ +G Sbjct: 111 GRELKGASTANGPGNFHLKPNGVFYVKGDRAGVLDTGRYLRAKPAPDFATQSGPMLVIDG 170 Query: 167 VINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQATNFYDFACYAKAKLNVEQLLYLDG 225 I+P+I + S KIRNGVG+ G+ AVF +S++ F FA K L+LDG Sbjct: 171 KIHPKISADGPSQKIRNGVGVRDGGHVAVFAISERPVTFGAFARLFKDSFGCRNALFLDG 230 Query: 226 TISHMYMKG 234 ++S +Y G Sbjct: 231 SVSSLYAPG 239 >UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteria RepID=C5CWT4_VARPS Length = 238 Score = 140 bits (352), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 75/197 (38%), Positives = 121/197 (61%), Gaps = 10/197 (5%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANG---EAWGTLHALLADINSQGQVQMAMNGGIYDE 96 +P TV ++ + ER++++ +G + + L A LA N Q + AMN G+Y Sbjct: 25 EPRYTV--VKIDVRRERLELFLHDDSGAPFKRFDRLEAWLAARNRQ--LVFAMNAGMYHA 80 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKE-IQF 154 ++P+GL ++ G+++ LNLA+G GNFF++P GVF V+ +V + KE ++ Sbjct: 81 DFSPVGLLVQEGREEAPLNLAAGAGNFFLKPNGVFLVSDAGPRVVESSEYAALPKEGVRL 140 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK 214 A QSGP+L+ GV++P P+ S KIRNGVG++ H A+F++S+Q NFY+FA Y + Sbjct: 141 ATQSGPLLLRRGVVHPAFIPDSDSRKIRNGVGVSGH-TAIFVISEQPVNFYEFALYFRDV 199 Query: 215 LNVEQLLYLDGTISHMY 231 L+ LYLDGT+S ++ Sbjct: 200 LHCRDALYLDGTVSALH 216 >UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobiales RepID=B2II06_BEII9 Length = 269 Score = 138 bits (348), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 70/223 (31%), Positives = 124/223 (55%), Gaps = 2/223 (0%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 + T L R+FL L L A A L++ + + + ++++WQ+ G+ +G Sbjct: 17 IFTKLLMRVFLPLFLSAGTAWAEPCLPLTEEGINYVVCRFDTKRSDLRLFWQQPGGQPYG 76 Query: 71 TLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 L A + +G+ ++ AMN G++ E +P+GLYI+ G+ N+ +G GNF ++P G Sbjct: 77 GFAPLRAQLQPKGETLEFAMNAGMFQEDLSPVGLYIQEGRLLHPANMRNGPGNFHMKPNG 136 Query: 130 VFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGIN 188 +FY + G++ F ++ + +A QSGP+L+ N ++P+I P S KIRNGVG+ Sbjct: 137 IFYFSQTSAGVMETGRFLQSGLKPDYATQSGPLLVANNQLHPKIEPTGTSEKIRNGVGVR 196 Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMY 231 + +F +S+ F+ FA + +L+ L+LDG+IS +Y Sbjct: 197 DNHEVIFAISEAPVTFFRFARLFRDRLHCPDALFLDGSISSLY 239 >UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepID=Q98NI9_RHILO Length = 263 Score = 134 bits (336), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 64/184 (34%), Positives = 107/184 (58%), Gaps = 2/184 (1%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM-AMNGGIYDESYAPLGLYIENG 108 V+P+ ++++W+ G+ + +LH L A + G+ + A+N G+Y P+GLY+E G Sbjct: 56 VDPKLYSIELFWKDPVGKPFQSLHNLDAAQRAAGRTMLFAINAGMYHPDLRPVGLYVERG 115 Query: 109 QQKVALNLASGEGNFFIRPGGVFYVAGDKVGI-VRLDAFKTSKEIQFAVQSGPMLMENGV 167 ++ + SG GNF ++P G+FY++G K + D + +A QSGPML+ +G Sbjct: 116 REMAGVRTGSGSGNFSLQPNGIFYISGGKAAVRATRDFVRKRPSTDYATQSGPMLVIDGQ 175 Query: 168 INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTI 227 ++P+ + S K R+GVG+ K G AVF +S NF+ FA + L + L+LDGTI Sbjct: 176 LHPKFQSDGTSRKTRDGVGVRKDGVAVFAISNGTVNFHTFARLFRDALGCDNALFLDGTI 235 Query: 228 SHMY 231 S ++ Sbjct: 236 SSLF 239 >UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9D6B9_9RHIZ Length = 286 Score = 128 bits (321), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 67/190 (35%), Positives = 109/190 (57%), Gaps = 6/190 (3%) Query: 49 TVNPQTERVKMYWQKANGEAWGTLHA----LLADINSQGQVQMAMNGGIYDESYAPLGLY 104 T++PQT +++ ++ G+ G++ A L A + ++ +AMN G+Y +P+GLY Sbjct: 70 TIDPQTHDMRLVYRDRMGDVLGSVSAVVDQLAAGAGTDHKLVLAMNAGMYHADMSPVGLY 129 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGD-KVGIVRLDAFKTSK-EIQFAVQSGPML 162 +EN + ALN G GNFF++P GVF+V D G++ DA+ + ++A QSGPML Sbjct: 130 VENSVEIAALNRDDGFGNFFLKPNGVFFVLKDGNAGVLETDAYAEADLSPEYATQSGPML 189 Query: 163 MENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLY 222 + +GVI+PR P+ S IRNGVG+ G VF +++ + FA + E L+ Sbjct: 190 VIDGVIHPRFLPDGTSKFIRNGVGVRPDGKVVFAITRDRVSLGSFARLFRDVAGCENALF 249 Query: 223 LDGTISHMYM 232 DG +S + + Sbjct: 250 FDGAVSSLAL 259 >UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteobacteria RepID=A9CIN9_AGRT5 Length = 254 Score = 127 bits (320), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 69/187 (36%), Positives = 108/187 (57%), Gaps = 6/187 (3%) Query: 48 YTV---NPQTERVKMYWQK-ANGEAWGTLHALLADINSQGQVQM-AMNGGIYDESYAPLG 102 YTV +P +++Y Q +G+ + L + + Q + AMNGG+Y Y+P+G Sbjct: 40 YTVCSFDPAKNTIRIYDQDHVSGQGYRNFADLSSALWRQHMFSVFAMNGGMYHSDYSPVG 99 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPM 161 L++ENG ++ ++ G GNF + P GVFY+ G+ G++ +A+ + FA QSGPM Sbjct: 100 LFVENGVERSPVSTRGGWGNFHLLPNGVFYLDGNTAGVLETEAYLAADPKPDFATQSGPM 159 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLL 221 L+ +G ++PR P+ S K RNGVG+++ G F +S+ FYDF + L+ L Sbjct: 160 LVIDGKLHPRFLPDSDSLKRRNGVGVSRDGMVHFAISETTVRFYDFGTLFRDVLDAPNAL 219 Query: 222 YLDGTIS 228 YLDGTIS Sbjct: 220 YLDGTIS 226 >UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcus RepID=Q1IX28_DEIGD Length = 317 Score = 124 bits (312), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 77/221 (34%), Positives = 122/221 (55%), Gaps = 9/221 (4%) Query: 15 NLKRIFLALTLLPLFAVA-ADDCALSDPTLTVQAYTV---NPQTERVKMYWQK-ANGEAW 69 N+ RIF+ LLPL A + A + T YTV + + + ++++W+ A G+ + Sbjct: 77 NVLRIFV--LLLPLTACSQAGGLDVRRVTAEGMLYTVAAVDLKRDHLRLHWKNPATGQPY 134 Query: 70 GTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 T + A + G QV A N GIY PLGL++E G+ + LN A GNF + P Sbjct: 135 RTFAEVSARLRKDGEQVLFATNSGIYGPGLEPLGLHVEEGRTLIGLNNARSGGNFALLPN 194 Query: 129 GVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GVF+V G++ G+ A++ + + FA QSGP+L++ G ++P + +S K+R+GVG+ Sbjct: 195 GVFWVKGNQAGVTETQAYRRLNIQPTFATQSGPLLVQGGRLHPAFNKGSSSFKVRSGVGV 254 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS 228 + G F +S NF+ FA + + L LYLDG+IS Sbjct: 255 CRDGRVRFAVSAGPVNFHSFAVFFRDVLGCPDALYLDGSIS 295 >UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PF78_CHIPD Length = 273 Score = 124 bits (311), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 73/198 (36%), Positives = 117/198 (59%), Gaps = 12/198 (6%) Query: 46 QAYTVNPQTERVKMYWQKANGEA-WGTLHALLADI--NSQGQVQMAMNGGIYDESYAPLG 102 A VNP + ++W A+ + + ++ AL D+ + + M NGG++ ++ P+G Sbjct: 58 DAIVVNPAVSDISLHWLSADQQTPYKSIQAL-QDVLLEKKKDILMITNGGMFMKNNIPVG 116 Query: 103 LYIENGQQKVALNLASGE-GNFFIRPGGVFYV--AGDKVGIVRLDAFKTSK---EIQFAV 156 L+I G++ ++ A+ + GNF+++P GVFY+ G V D K S+ +I A Sbjct: 117 LFISQGRELRPIDAATDQPGNFYMQPNGVFYLDHTGPHVSTTT-DYLKRSRAHSKIVAAT 175 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFYDFACYAKAKL 215 QSGPML+ G+IN + +P + +R+GVGI +GN VF++S++A T FYDFA KA+ Sbjct: 176 QSGPMLVSKGIINAKFNPGSVNRNLRSGVGILSNGNVVFIISKEAQTTFYDFASIFKARF 235 Query: 216 NVEQLLYLDGTISHMYMK 233 + LYLDG IS MY+K Sbjct: 236 GCKDALYLDGAISKMYLK 253 >UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EW16_DICNV Length = 263 Score = 123 bits (308), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 73/212 (34%), Positives = 116/212 (54%), Gaps = 13/212 (6%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQ 110 P+ ++++ WQ GE + T+H L + ++G QV MN GI++++ P GL+IE Sbjct: 51 PEHDKIRFLWQNDRGENYQTMHHALRALTNEGYQVHFLMNAGIFNQNAQPAGLWIEKKAL 110 Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RLDAFKTSKEIQFAVQSGPMLMENGVIN 169 LN SG+GNF I+P GVFY+ +K I+ + + +AVQSGP+L+ +G IN Sbjct: 111 LRPLNRRSGKGNFHIQPNGVFYLTQEKAHIITTVQWHNNPPKADYAVQSGPLLIIDGAIN 170 Query: 170 PRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYL 223 R+ N ++ RN V ++K F+++ + N Y FA +A + +Q LYL Sbjct: 171 SRLPKNHKAAYKRNAVCVDKARRVYFVITTRYDDGAHFPNLYRFA-HALQTIGCQQALYL 229 Query: 224 DGTISHMY--MKGGAIPWQRYPFVTMISVERK 253 DG++S Y M+ WQ+ F MI+V K Sbjct: 230 DGSLSDFYLPMESSRFHWQK--FAGMIAVVSK 259 >UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJ31_ACIJU Length = 252 Score = 121 bits (304), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 65/170 (38%), Positives = 105/170 (61%), Gaps = 4/170 (2%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI 125 G+ + + +D+ + +++ AMN G+Y ++ P+GLYIE ++ LN ++G GNFF+ Sbjct: 60 GDFYQKFSNIQSDLAACKELRFAMNAGMYHPNFEPVGLYIEKKKKLSELNESTGFGNFFM 119 Query: 126 RPGGVFYVAGDKVGIVR--LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRN 183 +P GV V D ++ D + + FA QSGPML+ G+IN + + S KIRN Sbjct: 120 QPNGV-VVWNDHGAVIHSTADYKRANFTANFATQSGPMLVHKGLINSQFIKDSNSLKIRN 178 Query: 184 GVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 GVG+ + + F++S+Q NFY FA + K +L V++ LYLDG+IS +Y+K Sbjct: 179 GVGV-RDDHLYFVISEQRINFYQFAKFFKHQLRVDEALYLDGSISSLYLK 227 >UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LB58_RHOVA Length = 247 Score = 121 bits (304), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 64/180 (35%), Positives = 99/180 (55%), Gaps = 2/180 (1%) Query: 57 VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNL 116 V+++WQK +G + L AL G++ A+NGG++ Y P+GL++ENG++ V N Sbjct: 46 VRLFWQKPDGGPYTYLSALPKTDERGGRLAFALNGGMFHPDYKPVGLHVENGRELVRANT 105 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPN 175 G GNF +RP G+FY + G++ AF K + FA QSGPML+ +G ++PRI Sbjct: 106 RPGPGNFHLRPNGIFYFGEAEAGVMETGAFLKKKPKANFATQSGPMLVIDGKLHPRIAKA 165 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD-GTISHMYMKG 234 S+K R+GV + + VF +S F F + L L+LD GT +++ G Sbjct: 166 NVSAKPRDGVCVRGDKSVVFAISDGGVPFDTFMRLFRDGLKCRNALFLDGGTAPALFVPG 225 >UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Aurantimonadaceae RepID=Q0G184_9RHIZ Length = 268 Score = 120 bits (300), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 64/190 (33%), Positives = 106/190 (55%), Gaps = 4/190 (2%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI 125 G + T A ++ G+V +AMN G+Y E P+GL +++G+ L +G GNF + Sbjct: 80 GRPYETFEKAAASLS--GEVVLAMNAGMYHEDRRPVGLTVQDGRIVKKAVLGTGSGNFSL 137 Query: 126 RPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNG 184 RP G+FY+ + + + + S + A QSGPML+ G ++PR P S +RNG Sbjct: 138 RPNGIFYLEDGRAFVRETERYLGESHDPVLATQSGPMLLIGGKVHPRFIPTSDSLYVRNG 197 Query: 185 VGINKHGNAVFL-LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 VG+++ G VFL L+++ NFYDFA + + + V+ L+ DG +S + + I ++R Sbjct: 198 VGVSEDGRTVFLALTRKPINFYDFALFFRDTVGVKDALFFDGQVSSLSYRAANIAYRRDR 257 Query: 244 FVTMISVERK 253 M+ V +K Sbjct: 258 LGPMLLVTKK 267 >UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N6Z2_9GAMM Length = 304 Score = 118 bits (295), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 77/212 (36%), Positives = 111/212 (52%), Gaps = 13/212 (6%) Query: 48 YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIE 106 Y +P +V ++W+ A+G A+ L L + G +V MN GIY E+ P GL+IE Sbjct: 91 YQADP--AQVSLHWKTADGSAYANLATLKRSLEQSGARVAFLMNAGIYSENDTPAGLWIE 148 Query: 107 NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMEN 165 GQ V LN +G+GNF I+P GVFY+ K I A+ + +AVQSGP+L+ + Sbjct: 149 RGQTLVPLNRKNGKGNFHIQPNGVFYIERGKARIQTSAAYHIGNHHPDWAVQSGPLLLLD 208 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ---ATNFYDFACYAKA--KLNVEQL 220 G NPR N++S RN V F+L++ + + F +A+A L Sbjct: 209 GKPNPRFVKNLSSPHKRNAVCTTADNRLYFILTEDYDLGSEWPSFHRFAEALQHLGCHDA 268 Query: 221 LYLDGTISHMYMKG--GAIPWQRYPFVTMISV 250 LYLDGT+S Y+ G G W Y V +I+V Sbjct: 269 LYLDGTLSGWYIPGIAGTFHWTHY--VGIIAV 298 >UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BWY7_HYPNA Length = 249 Score = 118 bits (295), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 65/183 (35%), Positives = 105/183 (57%), Gaps = 5/183 (2%) Query: 55 ERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVA 113 + ++++ + G +G L + S+G + AMN G+Y + P+GLYIE G+ ++ Sbjct: 46 DTIRLFLRDETGVPFGQFDRLANHVASKGGNLVFAMNAGMYHDDRRPVGLYIEEGEAEMN 105 Query: 114 LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS-KEI--QFAVQSGPMLMENGVINP 170 L + G GNF + P GVF++ K G+ AF KE +FA QSGPML+ +G ++P Sbjct: 106 LVRSPGPGNFGMLPNGVFWIDAGKAGVSETLAFDERFKETPPRFATQSGPMLVIDGALHP 165 Query: 171 RIHPNVASSKIRNGVGINKHGNAV-FLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH 229 ++P+ S + RNGVG+++ G V F++S NF+ FA + +L LYLDG +S Sbjct: 166 ALNPDGTSLRRRNGVGVSEDGRQVYFVISDVPVNFHSFARLFRDELGTPNALYLDGAVSK 225 Query: 230 MYM 232 Y+ Sbjct: 226 AYV 228 >UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093S1_STIAU Length = 278 Score = 117 bits (294), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 62/193 (32%), Positives = 107/193 (55%), Gaps = 4/193 (2%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLG 102 T Y V+ +++ Y+Q+ +G + +L L + +G ++ A N G++ + P+G Sbjct: 63 TYDTYEVDLTQSKLRFYFQQPDGTPFSSLGNLRGWLQGRGKRLVFATNAGMFTPARRPVG 122 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT--SKEIQFAVQSGP 160 LY+E+G++ V LN GNFF++P VF+V GI+ A+ ++ +A QSGP Sbjct: 123 LYVEDGREFVGLNTQEEAGNFFLKPNAVFFVTETGAGILESSAYAAHPPAKVLYATQSGP 182 Query: 161 MLMENGVINPRIHPNVAS-SKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQ 219 L+ +G ++P + S R+GVGI VF ++QQA N ++FA + + + + Sbjct: 183 ALLLHGQMHPAFREGSRNLSPRRSGVGIVTPTRVVFAMTQQAVNLHEFASFFRDQFGCQD 242 Query: 220 LLYLDGTISHMYM 232 LYLDG +S MY+ Sbjct: 243 ALYLDGVVSRMYL 255 >UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythrobacter RepID=Q2NAA1_ERYLH Length = 277 Score = 114 bits (285), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 65/182 (35%), Positives = 100/182 (54%), Gaps = 3/182 (1%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 A LA+ S V A+N G++D P+G Y+E+ ++ ALN G GNF ++P GVFY Sbjct: 97 AKLAEGRSSAPV-FAVNAGMFDGDGKPIGYYVEDSERLQALNTNDGAGNFHLKPNGVFYG 155 Query: 134 AGDKVGIVRLDAF--KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 + + + ++F S QF QSGPML+ +G ++P I + S +IRNGVG+++ G Sbjct: 156 SNGEWRVRTTESFLANVSDRPQFGTQSGPMLLIDGKLHPEISEDGPSRQIRNGVGVDRQG 215 Query: 192 NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVE 251 A F++S+ +F FA + + N LYLDG +S ++ R P MI VE Sbjct: 216 RAHFVISEGPISFGKFARFFRDVANTPNALYLDGNVSGLWDPANDRMDARAPIGPMIVVE 275 Query: 252 RK 253 + Sbjct: 276 TR 277 >UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodobacterales RepID=B9KP42_RHOSK Length = 245 Score = 112 bits (280), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 68/217 (31%), Positives = 116/217 (53%), Gaps = 11/217 (5%) Query: 28 LFAVAADDCALSDP-----TLTVQAYTVNPQT--ERVKMYWQKANGEAWGTLHALLADIN 80 LFA+ CA ++P T Y++ + ++++ +G +G+ + + ++ Sbjct: 9 LFALWPAACATAEPACRDLTFEGTRYSLCEAQAGDDIRIFQTAPDGRPYGSFERINSALD 68 Query: 81 SQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 +G Q+ AMN G+Y P+GL IE ++ L ++G GNF + P GVF V GD Sbjct: 69 GEGRQLAFAMNAGMYHADRRPVGLLIEEEVERAPLVTSAGPGNFGLLPNGVFCV-GDGFR 127 Query: 140 IVRLDAFKTSKE-IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLL 197 ++ +F + + A QSGPML+ G ++PR + S IRNGVG++ G AVF + Sbjct: 128 VIESRSFAAERPACRHASQSGPMLVIGGELHPRFLVHSDSRYIRNGVGVSADGRRAVFAI 187 Query: 198 SQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 S + F++F + +L + + LY DG+IS +Y +G Sbjct: 188 SNRPVTFHEFGRLFRDELGLPEALYFDGSISRLYDRG 224 >UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26CZ6_9BACT Length = 241 Score = 111 bits (278), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 66/200 (33%), Positives = 104/200 (52%), Gaps = 6/200 (3%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGI 93 D + D ++ + Q ++++YW + + T L + Q + + AMN G+ Sbjct: 24 DLIIKDDRFHIKVIDLTKQ--KLQLYWLDQDNKPIETFEQLNMHVKQQDKRLVYAMNAGM 81 Query: 94 YDESYAPLGLYIENGQQKVALNLAS-GEGNFFIRPGGVFYVAGD-KVGIVRLDAFKTSKE 151 Y + ++P GLYIENG L+ + G GNF+++P GVFY+ D K + Sbjct: 82 YLKDHSPQGLYIENGTIHKQLDTVTVGYGNFYLQPNGVFYLTQDGKAQVTATPQLSNFSN 141 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYA 211 I +A QSGPML+ N I+P + + IRN VGI G + +S++ NFYDFA + Sbjct: 142 ITYATQSGPMLVINDTIHPAFNKGSKNVHIRNAVGILPDGRILLAISKEKINFYDFATFF 201 Query: 212 KAKLNVEQLLYLDGTISHMY 231 K + + LYLDG +S +Y Sbjct: 202 KNQ-GCKNALYLDGFVSRIY 220 >UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter cryohalolentis K5 RepID=Q1QCK8_PSYCK Length = 276 Score = 111 bits (277), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 64/196 (32%), Positives = 111/196 (56%), Gaps = 11/196 (5%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANG-EAWGTLHALLADINSQGQVQMAMNGGIYDESYAP 100 T +Q+ + + + ++WQ+++ + T LL+ + ++ AMN G+Y+E+YAP Sbjct: 56 TCHIQSDLLTNKRYSLALFWQQSDSRQPLLTFDNLLSTLPPSQSLKFAMNAGMYNENYAP 115 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL-DAFKTSKEIQ------ 153 +G + ++ ALNL G GNF + P GV + DK G V++ ++ +++++ Sbjct: 116 IGYTVIKSEEIRALNLKEGGGNFHLLPNGVLW--WDKSGKVQITESNALAEQLKNGIAQP 173 Query: 154 -FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 +A QSGPML+ N I+P+ P+ S+KIRNG+G+ G+ F+ S+ FY FA K Sbjct: 174 LYATQSGPMLVINDAIHPQFDPDGTSAKIRNGIGVCSDGSLQFVNSEAPVAFYQFASLFK 233 Query: 213 AKLNVEQLLYLDGTIS 228 +L L+LDG I+ Sbjct: 234 NELKCPNALFLDGGIA 249 >UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WGQ7_PSYWF Length = 309 Score = 110 bits (276), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 68/191 (35%), Positives = 103/191 (53%), Gaps = 13/191 (6%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYI 105 QA PQ V Q + E L+ D+ +++ A N G+YD ++AP+G + Sbjct: 91 QAGENQPQAAIVD---QDKSHEPLYKFDTLIKDLPKDSELKFAANAGMYDGNFAPIGYTV 147 Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV------RLDAFKTSKEIQ--FAVQ 157 G+Q ++LNL G GNF + P GV + DK V +LDA S E + +A Q Sbjct: 148 IQGRQILSLNLKQGGGNFHLLPNGVLWW--DKANHVHITESTQLDAMLKSGEAKPWYATQ 205 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 SGPML+ +G I+P+ + + S KIRNGVG+ F+ S++ NFY FA + K L+ Sbjct: 206 SGPMLVIDGHIHPKFNSDSTSKKIRNGVGVCDGSQIHFVTSREPVNFYQFARFFKEDLHC 265 Query: 218 EQLLYLDGTIS 228 + L+LDG ++ Sbjct: 266 DNALFLDGGVA 276 >UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax=Rhizobiales RepID=Q1MEZ5_RHIL3 Length = 258 Score = 108 bits (270), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 58/185 (31%), Positives = 96/185 (51%), Gaps = 8/185 (4%) Query: 49 TVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIEN 107 T+ P ++++W+ A+G + +L + ++G+ + A+N G+Y ++P+GLY+EN Sbjct: 44 TLEPGKADLRLFWKNADGAPYRAFSSLAEAVRAEGRTLAFAVNAGMYRADFSPMGLYVEN 103 Query: 108 GQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGP 160 G++ N E NF+ +P GVF++ GI+ D F K + +FA QSGP Sbjct: 104 GRELNPANTTEAESSSGQVPNFYKKPNGVFFLGETGAGILPTDEFLKRRPKARFATQSGP 163 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQL 220 ML+ +NP R+GVG + G F +S+ NF+DFA + L Sbjct: 164 MLVIANKLNPIFIVGSTDRTRRSGVGTCERGAVRFAISEDRVNFHDFARLFRDHLKCPDA 223 Query: 221 LYLDG 225 L+LDG Sbjct: 224 LFLDG 228 >UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PYM1_9GAMM Length = 271 Score = 106 bits (265), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 63/181 (34%), Positives = 96/181 (53%), Gaps = 16/181 (8%) Query: 59 MYWQKANGEA------WGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 ++WQ + + + TL L + AMN G+YD ++AP+G + NG+Q Sbjct: 64 LHWQNPSSASHPLLLTFTTLRDYLVSEQPAKTLLFAMNAGMYDSNFAPIGYTVINGKQIR 123 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI------QFAVQSGPMLMENG 166 ALNL G GNF + P GVF+ D+ G ++ +K++ FA QSGPML+ +G Sbjct: 124 ALNLKQGGGNFHLMPNGVFW--QDRQGFYITESQSMAKKLASGAKPTFATQSGPMLVIDG 181 Query: 167 VINPRIHPNVASSKIRNGVGINKH--GNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 I+P N S K RNG+G+ H F++S +FY+FA K++L + L+LD Sbjct: 182 NIHPAFDANSTSRKYRNGIGVCGHNPSRVKFVISDTPVSFYEFADLFKSQLGCDNALFLD 241 Query: 225 G 225 G Sbjct: 242 G 242 >UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B5U0_PARDP Length = 251 Score = 106 bits (264), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 74/234 (31%), Positives = 114/234 (48%), Gaps = 14/234 (5%) Query: 12 ITLNLKR----IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNP----QTERVKMYWQK 63 + ++LKR F AL + L A+A C D Q Y + Q ++++ Sbjct: 1 MKIDLKRRLGLAFGALIAMTLPALAGI-CEKRD--FDGQGYVICTLTAGQEPGLRLWLNG 57 Query: 64 ANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNF 123 +G G A+ + + AMN G+Y + P+GLY+ +G + L A G GNF Sbjct: 58 PDGRTLGDFTAVRRTLAQGESLGFAMNAGMYHPDFTPVGLYVSDGVSQHDLVTAGGGGNF 117 Query: 124 FIRPGGVFYVAGDK-VGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 + P GVF G + ++ AF K + + + A QSGPML+ +G ++PR + S I Sbjct: 118 GMLPNGVFCAGGARPYQVIESRAFAKAAPDCRLATQSGPMLVIDGALHPRFLVDSDSRYI 177 Query: 182 RNGVGINKHG-NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 RNGVG++ G A F +S +A F+ F + L LY DG+IS +Y G Sbjct: 178 RNGVGVSPDGQTAWFAISDRAVTFHQFGRLFRDGLGARDALYFDGSISRLYAPG 231 >UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetobacter RepID=B2HYZ5_ACIBC Length = 204 Score = 105 bits (262), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 60/171 (35%), Positives = 94/171 (54%), Gaps = 13/171 (7%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY 104 ++ + NPQT+ + + + + + + Q+ AMNGG++ ++P+GLY Sbjct: 45 LRLFLKNPQTD-----------QYYKSFDNIQYQLKACEQLTFAMNGGMFHSGFSPVGLY 93 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLM 163 IENG++ LN G GNFF++P GV + I+ + +K + +A QSGPML+ Sbjct: 94 IENGRESQPLNEDKGWGNFFLQPNGVLAWNDKQAVILTTEQYKAKVFQPDYATQSGPMLV 153 Query: 164 ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK 214 NG INP N S KIRNGVG+ K+ F++S+ NFY FA + + K Sbjct: 154 INGKINPLFLANSDSKKIRNGVGV-KNNKLYFVISKNRVNFYSFAQFFQKK 203 >UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744B4D Length = 235 Score = 100 bits (249), Expect = 5e-20, Method: Compositional matrix adjust. Identities = 60/183 (32%), Positives = 97/183 (53%), Gaps = 11/183 (6%) Query: 56 RVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVAL 114 R+ + W +G+ G+ LL + QG+ ++ A N GIY+ P GL I G++ V L Sbjct: 29 RLDLRWLGQDGKPLGSFGPLLQEAARQGRRIEFATNAGIYERGPKPCGLTIAGGKELVPL 88 Query: 115 NLASGEGNFFIRPGGVFYVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLMENGVINPR 171 NLA GEGNF++ P GVFY+ D+ G + + ++ + + A QSGP+L+ G I+P Sbjct: 89 NLAKGEGNFYLHPNGVFYLD-DQTGAGVMTGAEYGQSGLQPRLATQSGPILLRQGKIHPA 147 Query: 172 IHPNVASSKIRNGVGIN-KHGNAVFLLSQQATNFYDFACYAKAK-----LNVEQLLYLDG 225 + N + ++RN VG+ G VF++S + + + L + L+LDG Sbjct: 148 FNFNSPNRRLRNAVGVRASDGQVVFVMSDREDRVKGRVTFHQLSRFFLHLGCQDALFLDG 207 Query: 226 TIS 228 IS Sbjct: 208 DIS 210 >UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucella sp. 83/13 RepID=D1CZ42_9RHIZ Length = 248 Score = 81.6 bits (200), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 44/135 (32%), Positives = 74/135 (54%), Gaps = 7/135 (5%) Query: 98 YAPLGLYIENGQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVRLDAF-KTSK 150 ++PLGL+I +G+++ + A + NF+ +P G+F++ G++ + F K Sbjct: 84 FSPLGLFIADGKEQSPIQPAGAKTSDKPVPNFYKKPNGIFFLDESGAGLLPTEQFVKRRP 143 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 ++ A QSGPML+ +NP A R+GVG+ K G F++S A NF+DFA + Sbjct: 144 KVWLATQSGPMLVIENRLNPIFIIGSADKSRRSGVGVCKDGVIHFVVSDDAVNFHDFARF 203 Query: 211 AKAKLNVEQLLYLDG 225 + +L L+LDG Sbjct: 204 FRDRLECPNALFLDG 218 >UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REU9_LEGLO Length = 260 Score = 64.7 bits (156), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 12/115 (10%) Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +++NGG +D + PLGL I NG+ + L S GVF++ +K I L F Sbjct: 98 LSINGGFFDHKFNPLGLRITNGKLENPLKRISW--------WGVFFIKNNKAYISSLRQF 149 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 + +I FA+QSGP L+ N I P + P +A R+ +GI G + L++ A Sbjct: 150 QYDNDIDFAIQSGPRLLVNRKI-PSLKPGIAE---RSALGITADGKIILLVTTNA 200 >UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legionella RepID=Q5WVS5_LEGPL Length = 258 Score = 62.4 bits (150), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 39/115 (33%), Positives = 56/115 (48%), Gaps = 12/115 (10%) Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +++NGG +D + PLGL I N +Q+ L S G+FYV +K I + F Sbjct: 98 LSINGGFFDHEFNPLGLRINNKKQENPLKRISW--------WGIFYVKDNKPRITNIRNF 149 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 I FA+QSGP L+ G I P + VA R +GI G + L++ A Sbjct: 150 HYDSNIDFAIQSGPRLLIRGNI-PSLKAGVAD---RTALGITDDGKVIILVTTNA 200 >UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella burnetii RepID=A9KDD2_COXBN Length = 255 Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 56/112 (50%), Gaps = 12/112 (10%) Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +A+NGG + + PLGL I + + +L S G+F + ++ I + Sbjct: 95 LAINGGFFTPNLEPLGLRISDNKVLSSLKRISW--------WGIFMIKNNRAAITSPQNY 146 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + S EI FA+Q+GP L+ +G I P++ S R+ +G+ G+ + ++ Sbjct: 147 RYSPEINFAIQAGPRLIIDGRI-PQLR---GGSAQRSALGVTPTGDIIIAIT 194 >UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseiflexus RepID=A5USB9_ROSS1 Length = 282 Score = 47.4 bits (111), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 52/221 (23%), Positives = 99/221 (44%), Gaps = 29/221 (13%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGE---AWGTLHALLADINSQGQVQMAMNGGIYD 95 SDP + + A ++P T R+++ + + W H L +A+NGG + Sbjct: 72 SDPPVPIYAVRLDPATIRLRIRYAPDAPQPLRTWFVAHRPL----------VAVNGGFFT 121 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD-KVGI--VRLDAFKTSKEI 152 L + +G G + GG+ A D +V I +R + + + + Sbjct: 122 AENRATALIVSDGTVY---------GTSYAGFGGMLAAAPDGRVWIQALRDEPYDPNIPL 172 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-QQATNFYDFACY- 210 A+QS PML+ G + I+ N ++ R V I++ G + ++ A + + A + Sbjct: 173 DQAIQSFPMLIYPGGVVASINDNGQRAR-RTVVAIDRAGRVLLIVCPTSAFSLQELATWL 231 Query: 211 AKAKLNVEQLLYLDG-TISHMYMKGGAIPWQRYPFVTMISV 250 A + + +++ L LDG + S +++ GA+ WQ F + SV Sbjct: 232 ASSDMEIDRALNLDGGSSSGIFVNAGAVRWQIDSFAALPSV 272 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobact... 372 e-102 UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobi... 319 6e-86 UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobac... 289 5e-77 UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylo... 281 2e-74 UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteob... 279 7e-74 UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcu... 279 8e-74 UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepI... 277 3e-73 UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bactero... 274 2e-72 UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptuniu... 268 1e-70 UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebalde... 266 7e-70 UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacter... 264 3e-69 UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoc... 264 3e-69 UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodob... 264 3e-69 UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea... 262 6e-69 UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmat... 256 5e-67 UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomi... 256 5e-67 UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Auranti... 253 3e-66 UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter c... 252 7e-66 UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteri... 252 1e-65 UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax... 251 2e-65 UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythro... 248 1e-64 UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acineto... 246 4e-64 UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydro... 246 6e-64 UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychro... 241 2e-62 UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC... 240 3e-62 UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelo... 239 6e-62 UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitino... 232 1e-59 UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetoba... 220 3e-56 UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=... 212 9e-54 UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucell... 190 3e-47 UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseifl... 173 5e-42 UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella b... 148 2e-34 UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legione... 143 4e-33 UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legione... 142 8e-33 Sequences not found previously or not previously below threshold: UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpeto... 94 4e-18 UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chlorof... 81 4e-14 UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochr... 76 1e-12 UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoni... 74 4e-12 UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-la... 69 2e-10 UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victiva... 68 4e-10 UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lacto... 66 1e-09 UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkerma... 65 3e-09 UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=... 63 6e-09 UniRef50_C6J074 Copper amine oxidase domain-containing protein n... 63 1e-08 UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-la... 63 1e-08 UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-la... 63 1e-08 UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=B... 63 1e-08 UniRef50_UPI00019038D8 hypothetical protein Retl8_15906 n=1 Tax=... 61 4e-08 UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 T... 61 5e-08 UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmati... 59 2e-07 UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY... 59 2e-07 UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alic... 58 2e-07 UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-la... 58 4e-07 UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtili... 57 6e-07 UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=P... 57 6e-07 UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bactero... 55 2e-06 UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=B... 55 2e-06 UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 55 3e-06 UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Trepone... 55 3e-06 UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bactero... 54 5e-06 UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc ... 53 7e-06 UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=... 53 9e-06 UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Trepone... 53 9e-06 UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellacea... 53 1e-05 UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 53 1e-05 UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 68... 53 1e-05 UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=... 52 1e-05 UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related t... 52 2e-05 UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=... 52 2e-05 UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57... 52 2e-05 UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacter... 52 2e-05 UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfo... 51 3e-05 UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=T... 51 3e-05 UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=A... 51 3e-05 UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Dein... 51 4e-05 UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=S... 51 4e-05 UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactoba... 51 5e-05 UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 51 5e-05 UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillu... 50 6e-05 UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related t... 50 6e-05 UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=F... 50 7e-05 UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinom... 50 7e-05 UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C... 50 9e-05 UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=... 50 1e-04 UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiob... 50 1e-04 UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bactero... 49 2e-04 UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillon... 49 2e-04 UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 ... 49 2e-04 UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chrooco... 48 2e-04 UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natr... 48 2e-04 UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bactero... 48 4e-04 UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanoth... 47 5e-04 UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostr... 47 5e-04 UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonel... 47 6e-04 UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=A... 47 6e-04 UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiob... 47 6e-04 UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryoc... 47 7e-04 UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteri... 47 7e-04 UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bactero... 47 7e-04 UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomacu... 46 9e-04 UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN... 46 0.001 UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synecho... 46 0.001 UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bactero... 46 0.001 UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostri... 46 0.001 UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax... 46 0.001 UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing pro... 46 0.001 UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya... 46 0.002 UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 ... 46 0.002 UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobi... 45 0.002 UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobaci... 45 0.002 UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=c... 45 0.002 UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Cop... 45 0.002 UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria Rep... 45 0.002 UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=No... 45 0.002 UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacilla... 45 0.002 UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya... 45 0.002 UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paeniba... 45 0.003 UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevote... 45 0.003 UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microco... 45 0.003 UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostri... 45 0.003 UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q... 45 0.003 UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sp... 45 0.003 UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Breviba... 44 0.004 UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=... 44 0.004 UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Breviba... 44 0.004 UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryoc... 44 0.004 UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya... 44 0.004 UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 44 0.005 UniRef50_C7LY43 Putative uncharacterized protein n=1 Tax=Acidimi... 44 0.006 UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=No... 43 0.008 UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfo... 43 0.008 UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bactero... 43 0.008 UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cave... 43 0.010 UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elon... 43 0.011 UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus seleniti... 43 0.011 UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtili... 43 0.012 UniRef50_B7IEY1 Putative uncharacterized protein n=1 Tax=Thermos... 43 0.012 UniRef50_UPI000190570B hypothetical protein RetlG_24562 n=1 Tax=... 43 0.012 UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=... 43 0.013 UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcy... 43 0.013 UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mir... 42 0.018 UniRef50_Q7NGC8 Glr3243 protein n=1 Tax=Gloeobacter violaceus Re... 42 0.020 UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synecho... 42 0.022 UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoa... 41 0.026 UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=P... 41 0.028 UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=C... 41 0.042 UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bactero... 41 0.051 UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora eryth... 40 0.056 UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synecho... 40 0.070 >UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobacteriaceae RepID=YIGE_ECOLI Length = 254 Score = 372 bits (955), Expect = e-102, Method: Composition-based stats. Identities = 254/254 (100%), Positives = 254/254 (100%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY Sbjct: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 Query: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE Sbjct: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK Sbjct: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ Sbjct: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 Query: 241 RYPFVTMISVERKG 254 RYPFVTMISVERKG Sbjct: 241 RYPFVTMISVERKG 254 >UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobiales RepID=B2II06_BEII9 Length = 269 Score = 319 bits (818), Expect = 6e-86, Method: Composition-based stats. Identities = 73/245 (29%), Positives = 129/245 (52%), Gaps = 2/245 (0%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 + T L R+FL L L A A L++ + + + ++++WQ+ G+ +G Sbjct: 17 IFTKLLMRVFLPLFLSAGTAWAEPCLPLTEEGINYVVCRFDTKRSDLRLFWQQPGGQPYG 76 Query: 71 TLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 L A + +G+ ++ AMN G++ E +P+GLYI+ G+ N+ +G GNF ++P G Sbjct: 77 GFAPLRAQLQPKGETLEFAMNAGMFQEDLSPVGLYIQEGRLLHPANMRNGPGNFHMKPNG 136 Query: 130 VFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGIN 188 +FY + G++ F ++ + +A QSGP+L+ N ++P+I P S KIRNGVG+ Sbjct: 137 IFYFSQTSAGVMETGRFLQSGLKPDYATQSGPLLVANNQLHPKIEPTGTSEKIRNGVGVR 196 Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + +F +S+ F+ FA + +L+ L+LDG+IS +Y Q P ++ Sbjct: 197 DNHEVIFAISEAPVTFFRFARLFRDRLHCPDALFLDGSISSLYAPSLNRDDQWRPIGPIV 256 Query: 249 SVERK 253 K Sbjct: 257 GAVSK 261 >UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9JX75_AGRVS Length = 274 Score = 289 bits (740), Expect = 5e-77, Method: Composition-based stats. Identities = 85/243 (34%), Positives = 133/243 (54%), Gaps = 6/243 (2%) Query: 16 LKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV---NPQTERVKMYWQKANGEAWGTL 72 L I L + P A A++ + D T AY V +P T ++++ + A+G+ +G Sbjct: 30 LFAILSPLVISPERA-EAEEQSCRDQTENGFAYRVCRFDPATRTIRIFNRNADGDVYGGF 88 Query: 73 HALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 AL + + Q + A+NGG+Y +P+GL+++ G + A G GNF+++P GVF Sbjct: 89 EALRSQLWQQRLILTFAVNGGMYHSDLSPVGLFVDYGMTRKTAETADGWGNFYLKPNGVF 148 Query: 132 YVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 ++ G++ F+T K E FA QSGPML+ +GV++P+ P S KIRNGVGI+ Sbjct: 149 FLKDGHAGVLETGQFETQKIEADFATQSGPMLVIDGVLHPKFLPTSDSLKIRNGVGIDAS 208 Query: 191 GNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 G VF+LS+ FYD A + + +L LYLDGTIS + + YP +I+V Sbjct: 209 GQVVFVLSKDPVRFYDMAAFFRDRLGAANALYLDGTISSLAEPMAGRIDRAYPLGPIIAV 268 Query: 251 ERK 253 + Sbjct: 269 VDQ 271 >UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylobacterium extorquens group RepID=A9W4Y6_METEP Length = 258 Score = 281 bits (719), Expect = 2e-74, Method: Composition-based stats. Identities = 76/235 (32%), Positives = 123/235 (52%), Gaps = 4/235 (1%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN 80 + + P A A+ TV+ + ERV+++W +G +G+L +L Sbjct: 26 VPVQAQPAPAAKGPCQAVEFEGQPYTVCTVDLRRERVRLFWLGTDGLPYGSLSSL--ADR 83 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 ++ AMN G+YD+ AP+GLY+E+G++ + A+G GNF ++P GVFYV GD+ G+ Sbjct: 84 QGPRLSFAMNAGMYDKGQAPVGLYVEDGRELKGASTANGPGNFHLKPNGVFYVKGDRAGV 143 Query: 141 VRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA-VFLLS 198 + + + FA QSGPML+ +G I+P+I + S KIRNGVG+ G+ VF +S Sbjct: 144 LDTGRYLRAKPAPDFATQSGPMLVIDGKIHPKISADGPSQKIRNGVGVRDGGHVAVFAIS 203 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 ++ F FA K L+LDG++S +Y G P ++ + Sbjct: 204 ERPVTFGAFARLFKDSFGCRNALFLDGSVSSLYAPGLGRSDLSRPLGPLVGAVGR 258 >UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteobacteria RepID=A9CIN9_AGRT5 Length = 254 Score = 279 bits (714), Expect = 7e-74, Method: Composition-based stats. Identities = 69/212 (32%), Positives = 114/212 (53%), Gaps = 3/212 (1%) Query: 45 VQAYTVNPQTERVKMYWQK-ANGEAWGTLHALLADINSQGQV-QMAMNGGIYDESYAPLG 102 + +P +++Y Q +G+ + L + + Q AMNGG+Y Y+P+G Sbjct: 40 YTVCSFDPAKNTIRIYDQDHVSGQGYRNFADLSSALWRQHMFSVFAMNGGMYHSDYSPVG 99 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPM 161 L++ENG ++ ++ G GNF + P GVFY+ G+ G++ +A+ + FA QSGPM Sbjct: 100 LFVENGVERSPVSTRGGWGNFHLLPNGVFYLDGNTAGVLETEAYLAADPKPDFATQSGPM 159 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLL 221 L+ +G ++PR P+ S K RNGVG+++ G F +S+ FYDF + L+ L Sbjct: 160 LVIDGKLHPRFLPDSDSLKRRNGVGVSRDGMVHFAISETTVRFYDFGTLFRDVLDAPNAL 219 Query: 222 YLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 YLDGTIS + + Q + +I+V + Sbjct: 220 YLDGTISSVDIPAMNRRDQLFSMGPIIAVVDR 251 >UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcus RepID=Q1IX28_DEIGD Length = 317 Score = 279 bits (713), Expect = 8e-74, Method: Composition-based stats. Identities = 79/245 (32%), Positives = 127/245 (51%), Gaps = 11/245 (4%) Query: 15 NLKRIFLALTLLPLFAVA-ADDCALSDPTLTVQAYTV---NPQTERVKMYWQKAN-GEAW 69 N+ RIF+ LLPL A + A + T YTV + + + ++++W+ G+ + Sbjct: 77 NVLRIFV--LLLPLTACSQAGGLDVRRVTAEGMLYTVAAVDLKRDHLRLHWKNPATGQPY 134 Query: 70 GTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 T + A + G QV A N GIY PLGL++E G+ + LN A GNF + P Sbjct: 135 RTFAEVSARLRKDGEQVLFATNSGIYGPGLEPLGLHVEEGRTLIGLNNARSGGNFALLPN 194 Query: 129 GVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GVF+V G++ G+ A++ + + FA QSGP+L++ G ++P + +S K+R+GVG+ Sbjct: 195 GVFWVKGNQAGVTETQAYRRLNIQPTFATQSGPLLVQGGRLHPAFNKGSSSFKVRSGVGV 254 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + G F +S NF+ FA + + L LYLDG+IS Q F + Sbjct: 255 CRDGRVRFAVSAGPVNFHSFAVFFRDVLGCPDALYLDGSISAYATPDADT--QVADFAGI 312 Query: 248 ISVER 252 ++ R Sbjct: 313 WTISR 317 >UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepID=Q98NI9_RHILO Length = 263 Score = 277 bits (709), Expect = 3e-73, Method: Composition-based stats. Identities = 71/233 (30%), Positives = 117/233 (50%), Gaps = 5/233 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAY---TVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + D +Y V+P+ ++++W+ G+ + +LH L A + Sbjct: 29 MAFSQWFVSLPPCRDFAFEATSYLICEVDPKLYSIELFWKDPVGKPFQSLHNLDAAQRAA 88 Query: 83 GQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 G+ + A+N G+Y P+GLY+E G++ + SG GNF ++P G+FY++G K + Sbjct: 89 GRTMLFAINAGMYHPDLRPVGLYVERGREMAGVRTGSGSGNFSLQPNGIFYISGGKAAVR 148 Query: 142 RLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 F + +A QSGPML+ +G ++P+ + S K R+GVG+ K G AVF +S Sbjct: 149 ATRDFVRKRPSTDYATQSGPMLVIDGQLHPKFQSDGTSRKTRDGVGVRKDGVAVFAISNG 208 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 NF+ FA + L + L+LDGTIS ++ + MI V RK Sbjct: 209 TVNFHTFARLFRDALGCDNALFLDGTISSLFAPAIGRNDDYWNLGPMIGVFRK 261 >UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=Q11X50_CYTH3 Length = 244 Score = 274 bits (701), Expect = 2e-72, Method: Composition-based stats. Identities = 83/217 (38%), Positives = 124/217 (57%), Gaps = 3/217 (1%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDES 97 T+ V +YTV+PQ + ++ YW+ NGE ++ L A + S+G + A NGG+Y E Sbjct: 25 QQDTIDVISYTVDPQKDNLQFYWKNDNGEILKSIKKLKAYVESKGSTLLFATNGGMYKED 84 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RLDAFKTSKEIQFAV 156 +PLGL+I+NG+ LN A G+GNF+++P GVFY+ D ++ + + F + I+FA Sbjct: 85 RSPLGLFIQNGKTVTPLNKAKGQGNFYMQPNGVFYITNDNEAVICKTEDFINNGNIKFAT 144 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLN 216 QSGPM++ N I+P + IRNGVGI + +F +S++ NF+DFA Y + L Sbjct: 145 QSGPMIIVNNQIHPSFIKGSKNLNIRNGVGILPNKKIIFAMSEKEVNFFDFALYFQN-LG 203 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 E LYLDG +S Y+ F MI V K Sbjct: 204 CENALYLDGFVSRSYLLEKKWLQTDGEFGVMIGVTEK 240 >UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BWY7_HYPNA Length = 249 Score = 268 bits (686), Expect = 1e-70, Method: Composition-based stats. Identities = 66/226 (29%), Positives = 112/226 (49%), Gaps = 5/226 (2%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ-GQVQMAMNG 91 S L + + + ++++ + G +G L + S+ G + AMN Sbjct: 24 GPCQTRSFENLPYLVCSFDASQDTIRLFLRDETGVPFGQFDRLANHVASKGGNLVFAMNA 83 Query: 92 GIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKE 151 G+Y + P+GLYIE G+ ++ L + G GNF + P GVF++ K G+ AF + Sbjct: 84 GMYHDDRRPVGLYIEEGEAEMNLVRSPGPGNFGMLPNGVFWIDAGKAGVSETLAFDERFK 143 Query: 152 ---IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQATNFYDF 207 +FA QSGPML+ +G ++P ++P+ S + RNGVG+++ G F++S NF+ F Sbjct: 144 ETPPRFATQSGPMLVIDGALHPALNPDGTSLRRRNGVGVSEDGRQVYFVISDVPVNFHSF 203 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 A + +L LYLDG +S Y+ ++ V R+ Sbjct: 204 ARLFRDELGTPNALYLDGAVSKAYVPALERSETGLDMGPIVGVIRE 249 >UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1AL67_SEBTE Length = 266 Score = 266 bits (679), Expect = 7e-70, Method: Composition-based stats. Identities = 96/217 (44%), Positives = 134/217 (61%), Gaps = 4/217 (1%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + D TV Y + E +KMYW+ N +A+ L + + N+ ++ A NGGIY E Sbjct: 52 IEDRGFTV--YKPDLNKEIIKMYWKDENNKAYSELSKFIQE-NTGNKINFATNGGIYSEE 108 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 Y P GLYIEN + +NLA GEGNF+++P GVFY+ ++ I AF+ ++ I +A Q Sbjct: 109 YEPNGLYIENHKIISKINLADGEGNFYMQPNGVFYIQNNQPKISESKAFEYNENISYATQ 168 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 SGP+L+ENGVIN +I N S KIR+ VGI++ FL+S + NFYDF+ YA KLN Sbjct: 169 SGPLLIENGVINKKIGKNSESFKIRSAVGIDRENKVFFLMSSEKINFYDFSKYALDKLNC 228 Query: 218 EQLLYLDGTISHMYM-KGGAIPWQRYPFVTMISVERK 253 + LL+LDG IS MY IP Q YPF +I+ E++ Sbjct: 229 KDLLFLDGAISKMYFADEKKIPEQDYPFAVIITSEKR 265 >UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26CZ6_9BACT Length = 241 Score = 264 bits (674), Expect = 3e-69, Method: Composition-based stats. Identities = 69/219 (31%), Positives = 110/219 (50%), Gaps = 6/219 (2%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ-GQVQMAMNGGI 93 D + D ++ ++ +++++YW + + T L + Q ++ AMN G+ Sbjct: 24 DLIIKDDRFHIKV--IDLTKQKLQLYWLDQDNKPIETFEQLNMHVKQQDKRLVYAMNAGM 81 Query: 94 YDESYAPLGLYIENGQQKVALNLAS-GEGNFFIRPGGVFYVAGD-KVGIVRLDAFKTSKE 151 Y + ++P GLYIENG L+ + G GNF+++P GVFY+ D K + Sbjct: 82 YLKDHSPQGLYIENGTIHKQLDTVTVGYGNFYLQPNGVFYLTQDGKAQVTATPQLSNFSN 141 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYA 211 I +A QSGPML+ N I+P + + IRN VGI G + +S++ NFYDFA + Sbjct: 142 ITYATQSGPMLVINDTIHPAFNKGSKNVHIRNAVGILPDGRILLAISKEKINFYDFATFF 201 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 K + + LYLDG +S +Y + F MI V Sbjct: 202 KNQ-GCKNALYLDGFVSRIYDPTINVEQMDGHFGVMIGV 239 >UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B5U0_PARDP Length = 251 Score = 264 bits (674), Expect = 3e-69, Method: Composition-based stats. Identities = 75/248 (30%), Positives = 116/248 (46%), Gaps = 14/248 (5%) Query: 12 ITLNLKR----IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNP----QTERVKMYWQK 63 + ++LKR F AL + L A+A C D Q Y + Q ++++ Sbjct: 1 MKIDLKRRLGLAFGALIAMTLPALAGI-CEKRD--FDGQGYVICTLTAGQEPGLRLWLNG 57 Query: 64 ANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNF 123 +G G A+ + + AMN G+Y + P+GLY+ +G + L A G GNF Sbjct: 58 PDGRTLGDFTAVRRTLAQGESLGFAMNAGMYHPDFTPVGLYVSDGVSQHDLVTAGGGGNF 117 Query: 124 FIRPGGVFYVAGDKVG-IVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 + P GVF G + ++ AF K + + + A QSGPML+ +G ++PR + S I Sbjct: 118 GMLPNGVFCAGGARPYQVIESRAFAKAAPDCRLATQSGPMLVIDGALHPRFLVDSDSRYI 177 Query: 182 RNGVGINKHGN-AVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 RNGVG++ G A F +S +A F+ F + L LY DG+IS +Y G Sbjct: 178 RNGVGVSPDGQTAWFAISDRAVTFHQFGRLFRDGLGARDALYFDGSISRLYAPGLGRADF 237 Query: 241 RYPFVTMI 248 +I Sbjct: 238 GRRLGPII 245 >UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodobacterales RepID=B9KP42_RHOSK Length = 245 Score = 264 bits (674), Expect = 3e-69, Method: Composition-based stats. Identities = 71/242 (29%), Positives = 121/242 (50%), Gaps = 8/242 (3%) Query: 17 KRIFLALTLLPLF--AVAADDCALSDPTLTVQAYTVNPQT--ERVKMYWQKANGEAWGTL 72 R LA L L+ A A + A D T Y++ + ++++ +G +G+ Sbjct: 1 MRTRLAAILFALWPAACATAEPACRDLTFEGTRYSLCEAQAGDDIRIFQTAPDGRPYGSF 60 Query: 73 HALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 + + ++ +G+ + AMN G+Y P+GL IE ++ L ++G GNF + P GVF Sbjct: 61 ERINSALDGEGRQLAFAMNAGMYHADRRPVGLLIEEEVERAPLVTSAGPGNFGLLPNGVF 120 Query: 132 YVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 V GD ++ +F + A QSGPML+ G ++PR + S IRNGVG++ Sbjct: 121 CV-GDGFRVIESRSFAAERPACRHASQSGPMLVIGGELHPRFLVHSDSRYIRNGVGVSAD 179 Query: 191 G-NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 G AVF +S + F++F + +L + + LY DG+IS +Y +G P ++ Sbjct: 180 GRRAVFAISNRPVTFHEFGRLFRDELGLPEALYFDGSISRLYDRGARRSDWGTPMGPIVG 239 Query: 250 VE 251 + Sbjct: 240 LV 241 >UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9D6B9_9RHIZ Length = 286 Score = 262 bits (671), Expect = 6e-69, Method: Composition-based stats. Identities = 68/234 (29%), Positives = 112/234 (47%), Gaps = 6/234 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHA----LLADINS 81 + T++PQT +++ ++ G+ G++ A L A + Sbjct: 47 MTKPDWPEGCVEQVFEGARAILCTIDPQTHDMRLVYRDRMGDVLGSVSAVVDQLAAGAGT 106 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV-AGDKVGI 140 ++ +AMN G+Y +P+GLY+EN + ALN G GNFF++P GVF+V G+ Sbjct: 107 DHKLVLAMNAGMYHADMSPVGLYVENSVEIAALNRDDGFGNFFLKPNGVFFVLKDGNAGV 166 Query: 141 VRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + DA+ ++A QSGPML+ +GVI+PR P+ S IRNGVG+ G VF +++ Sbjct: 167 LETDAYAEADLSPEYATQSGPMLVIDGVIHPRFLPDGTSKFIRNGVGVRPDGKVVFAITR 226 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + FA + E L+ DG +S + + P + V + Sbjct: 227 DRVSLGSFARLFRDVAGCENALFFDGAVSSLALGSKMEIDSEEPAGPVAVVVAR 280 >UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093S1_STIAU Length = 278 Score = 256 bits (654), Expect = 5e-67, Method: Composition-based stats. Identities = 78/262 (29%), Positives = 126/262 (48%), Gaps = 23/262 (8%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPT------------LTVQAYTVNP 52 LLIG G+ T A LL A +L PT T Y V+ Sbjct: 19 LLIGSGLGT-------GATHLLAAPHTPAATRSLQTPTGRVAARRIAYRGNTYDTYEVDL 71 Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQK 111 +++ Y+Q+ +G + +L L + +G ++ A N G++ + P+GLY+E+G++ Sbjct: 72 TQSKLRFYFQQPDGTPFSSLGNLRGWLQGRGKRLVFATNAGMFTPARRPVGLYVEDGREF 131 Query: 112 VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ--FAVQSGPMLMENGVIN 169 V LN GNFF++P VF+V GI+ A+ + +A QSGP L+ +G ++ Sbjct: 132 VGLNTQEEAGNFFLKPNAVFFVTETGAGILESSAYAAHPPAKVLYATQSGPALLLHGQMH 191 Query: 170 PRIHPNVAS-SKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS 228 P + S R+GVGI VF ++QQA N ++FA + + + + LYLDG +S Sbjct: 192 PAFREGSRNLSPRRSGVGIVTPTRVVFAMTQQAVNLHEFASFFRDQFGCQDALYLDGVVS 251 Query: 229 HMYMKGGAIPWQRYPFVTMISV 250 MY+ F MI++ Sbjct: 252 RMYLPALGRDELDGDFGAMIAI 273 >UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LB58_RHOVA Length = 247 Score = 256 bits (654), Expect = 5e-67, Method: Composition-based stats. Identities = 74/240 (30%), Positives = 116/240 (48%), Gaps = 6/240 (2%) Query: 19 IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTER---VKMYWQKANGEAWGTLHAL 75 F+A+ + AA YT+ + V+++WQK +G + L AL Sbjct: 6 AFIAMAAFCGSSEAAA-QTCKPYAFEGNGYTLCEASLDRFAVRLFWQKPDGGPYTYLSAL 64 Query: 76 LADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 G++ A+NGG++ Y P+GL++ENG++ V N G GNF +RP G+FY Sbjct: 65 PKTDERGGRLAFALNGGMFHPDYKPVGLHVENGRELVRANTRPGPGNFHLRPNGIFYFGE 124 Query: 136 DKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAV 194 + G++ AF K + FA QSGPML+ +G ++PRI S+K R+GV + + V Sbjct: 125 AEAGVMETGAFLKKKPKANFATQSGPMLVIDGKLHPRIAKANVSAKPRDGVCVRGDKSVV 184 Query: 195 FLLSQQATNFYDFACYAKAKLNVEQLLYLD-GTISHMYMKGGAIPWQRYPFVTMISVERK 253 F +S F F + L L+LD GT +++ G + MI+V K Sbjct: 185 FAISDGGVPFDTFMRLFRDGLKCRNALFLDGGTAPALFVPGTRSGNVLFGLGPMIAVYEK 244 >UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Aurantimonadaceae RepID=Q0G184_9RHIZ Length = 268 Score = 253 bits (647), Expect = 3e-66, Method: Composition-based stats. Identities = 67/229 (29%), Positives = 114/229 (49%), Gaps = 5/229 (2%) Query: 28 LFAVAADDCALSDPT-LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 L A C ++ + V + + + G + T A + G+V Sbjct: 41 LPAGHEGICRIAMAGSVETILCEVPLSSFDLHLRALDDAGRPYETFEKAAASL--SGEVV 98 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +AMN G+Y E P+GL +++G+ L +G GNF +RP G+FY+ + + + + Sbjct: 99 LAMNAGMYHEDRRPVGLTVQDGRIVKKAVLGTGSGNFSLRPNGIFYLEDGRAFVRETERY 158 Query: 147 -KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF-LLSQQATNF 204 S + A QSGPML+ G ++PR P S +RNGVG+++ G VF L+++ NF Sbjct: 159 LGESHDPVLATQSGPMLLIGGKVHPRFIPTSDSLYVRNGVGVSEDGRTVFLALTRKPINF 218 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 YDFA + + + V+ L+ DG +S + + I ++R M+ V +K Sbjct: 219 YDFALFFRDTVGVKDALFFDGQVSSLSYRAANIAYRRDRLGPMLLVTKK 267 >UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter cryohalolentis K5 RepID=Q1QCK8_PSYCK Length = 276 Score = 252 bits (644), Expect = 7e-66, Method: Composition-based stats. Identities = 70/237 (29%), Positives = 118/237 (49%), Gaps = 8/237 (3%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANG-EAWGTLHALLADINSQ 82 T ++ + + + T +Q+ + + + ++WQ+++ + T LL+ + Sbjct: 38 TASTDWSCQSHNTPFAYSTCHIQSDLLTNKRYSLALFWQQSDSRQPLLTFDNLLSTLPPS 97 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV-AGDKVGIV 141 ++ AMN G+Y+E+YAP+G + ++ ALNL G GNF + P GV + KV I Sbjct: 98 QSLKFAMNAGMYNENYAPIGYTVIKSEEIRALNLKEGGGNFHLLPNGVLWWDKSGKVQIT 157 Query: 142 RLDAFKTSKE-----IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 +A + +A QSGPML+ N I+P+ P+ S+KIRNG+G+ G+ F+ Sbjct: 158 ESNALAEQLKNGIAQPLYATQSGPMLVINDAIHPQFDPDGTSAKIRNGIGVCSDGSLQFV 217 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRYPFVTMISVER 252 S+ FY FA K +L L+LDG I S +Y ++ V + VE Sbjct: 218 NSEAPVAFYQFASLFKNELKCPNALFLDGGIASALYAPTIDKHDKKEMGVMIGLVES 274 >UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteria RepID=C5CWT4_VARPS Length = 238 Score = 252 bits (643), Expect = 1e-65, Method: Composition-based stats. Identities = 70/214 (32%), Positives = 122/214 (57%), Gaps = 6/214 (2%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDESY 98 +P TV ++ + ER++++ +G + L A + ++ + + AMN G+Y + Sbjct: 25 EPRYTV--VKIDVRRERLELFLHDDSGAPFKRFDRLEAWLAARNRQLVFAMNAGMYHADF 82 Query: 99 APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKE--IQFAV 156 +P+GL ++ G+++ LNLA+G GNFF++P GVF V+ +V + + ++ A Sbjct: 83 SPVGLLVQEGREEAPLNLAAGAGNFFLKPNGVFLVSDAGPRVVESSEYAALPKEGVRLAT 142 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLN 216 QSGP+L+ GV++P P+ S KIRNGVG++ H A+F++S+Q NFY+FA Y + L+ Sbjct: 143 QSGPLLLRRGVVHPAFIPDSDSRKIRNGVGVSGH-TAIFVISEQPVNFYEFALYFRDVLH 201 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 LYLDGT+S ++ ++ V Sbjct: 202 CRDALYLDGTVSALHSLALRRSDFTRELGPILGV 235 >UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax=Rhizobiales RepID=Q1MEZ5_RHIL3 Length = 258 Score = 251 bits (641), Expect = 2e-65, Method: Composition-based stats. Identities = 60/205 (29%), Positives = 99/205 (48%), Gaps = 11/205 (5%) Query: 33 ADDCALSDPTLT---VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMA 88 A A + T+ P ++++W+ A+G + +L + ++G+ + A Sbjct: 25 AHAQACEQESFEEAKYVVCTLEPGKADLRLFWKNADGAPYRAFSSLAEAVRAEGRTLAFA 84 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVR 142 +N G+Y ++P+GLY+ENG++ N E NF+ +P GVF++ GI+ Sbjct: 85 VNAGMYRADFSPMGLYVENGRELNPANTTEAESSSGQVPNFYKKPNGVFFLGETGAGILP 144 Query: 143 LDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 D F K + +FA QSGPML+ +NP R+GVG + G F +S+ Sbjct: 145 TDEFLKRRPKARFATQSGPMLVIANKLNPIFIVGSTDRTRRSGVGTCERGAVRFAISEDR 204 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGT 226 NF+DFA + L L+LDG Sbjct: 205 VNFHDFARLFRDHLKCPDALFLDGG 229 >UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythrobacter RepID=Q2NAA1_ERYLH Length = 277 Score = 248 bits (634), Expect = 1e-64, Method: Composition-based stats. Identities = 61/190 (32%), Positives = 97/190 (51%), Gaps = 4/190 (2%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI 125 G + L A+N G++D P+G Y+E+ ++ ALN G GNF + Sbjct: 90 GPPHRSFAKLAEG--RSSAPVFAVNAGMFDGDGKPIGYYVEDSERLQALNTNDGAGNFHL 147 Query: 126 RPGGVFYVAGDKVGIVRLDAFKTS--KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRN 183 +P GVFY + + + ++F + QF QSGPML+ +G ++P I + S +IRN Sbjct: 148 KPNGVFYGSNGEWRVRTTESFLANVSDRPQFGTQSGPMLLIDGKLHPEISEDGPSRQIRN 207 Query: 184 GVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 GVG+++ G A F++S+ +F FA + + N LYLDG +S ++ R P Sbjct: 208 GVGVDRQGRAHFVISEGPISFGKFARFFRDVANTPNALYLDGNVSGLWDPANDRMDARAP 267 Query: 244 FVTMISVERK 253 MI VE + Sbjct: 268 IGPMIVVETR 277 >UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJ31_ACIJU Length = 252 Score = 246 bits (629), Expect = 4e-64, Method: Composition-based stats. Identities = 70/231 (30%), Positives = 121/231 (52%), Gaps = 6/231 (2%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN-GEAWGTLHALLADINSQG 83 +FA D V V+ + ++++ + G+ + + +D+ + Sbjct: 21 ATTVFAFEYQSIKFEDVQFEV--IKVDDLKD-LQLFLKNPRIGDFYQKFSNIQSDLAACK 77 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +++ AMN G+Y ++ P+GLYIE ++ LN ++G GNFF++P GV I Sbjct: 78 ELRFAMNAGMYHPNFEPVGLYIEKKKKLSELNESTGFGNFFMQPNGVVVWNDHGAVIHST 137 Query: 144 DAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT 202 +K + FA QSGPML+ G+IN + + S KIRNGVG+ + F++S+Q Sbjct: 138 ADYKRANFTANFATQSGPMLVHKGLINSQFIKDSNSLKIRNGVGVRDD-HLYFVISEQRI 196 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 NFY FA + K +L V++ LYLDG+IS +Y+K ++Y ++ + + Sbjct: 197 NFYQFAKFFKHQLRVDEALYLDGSISSLYLKDIQRNDRKYNLGPIVGLTHQ 247 >UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PYM1_9GAMM Length = 271 Score = 246 bits (628), Expect = 6e-64, Method: Composition-based stats. Identities = 69/233 (29%), Positives = 107/233 (45%), Gaps = 15/233 (6%) Query: 34 DDCALSDPTLTVQAYTVNPQ-TERVKMYWQKANGE------AWGTLHALLADINSQGQVQ 86 DC ++ + ++WQ + + TL L + Sbjct: 38 PDCQRKSQPFDYSICELDAKNAANFSLHWQNPSSASHPLLLTFTTLRDYLVSEQPAKTLL 97 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 AMN G+YD ++AP+G + NG+Q ALNL G GNF + P GVF+ I + Sbjct: 98 FAMNAGMYDSNFAPIGYTVINGKQIRALNLKQGGGNFHLMPNGVFWQDRQGFYITESQSM 157 Query: 147 KTS----KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG--NAVFLLSQQ 200 + FA QSGPML+ +G I+P N S K RNG+G+ H F++S Sbjct: 158 AKKLASGAKPTFATQSGPMLVIDGNIHPAFDANSTSRKYRNGIGVCGHNPSRVKFVISDT 217 Query: 201 ATNFYDFACYAKAKLNVEQLLYLD-GTISHMYMKGGAIPWQRYPFVTMISVER 252 +FY+FA K++L + L+LD G+ S +Y + + +Y MI+V + Sbjct: 218 PVSFYEFADLFKSQLGCDNALFLDGGSASALYSQTLSRNDNKY-MGVMIAVTQ 269 >UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WGQ7_PSYWF Length = 309 Score = 241 bits (615), Expect = 2e-62, Method: Composition-based stats. Identities = 64/195 (32%), Positives = 99/195 (50%), Gaps = 10/195 (5%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYI 105 QA PQ V Q + E L+ D+ +++ A N G+YD ++AP+G + Sbjct: 91 QAGENQPQAAIVD---QDKSHEPLYKFDTLIKDLPKDSELKFAANAGMYDGNFAPIGYTV 147 Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKT-----SKEIQFAVQSG 159 G+Q ++LNL G GNF + P GV + + V I + +A QSG Sbjct: 148 IQGRQILSLNLKQGGGNFHLLPNGVLWWDKANHVHITESTQLDAMLKSGEAKPWYATQSG 207 Query: 160 PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQ 219 PML+ +G I+P+ + + S KIRNGVG+ F+ S++ NFY FA + K L+ + Sbjct: 208 PMLVIDGHIHPKFNSDSTSKKIRNGVGVCDGSQIHFVTSREPVNFYQFARFFKEDLHCDN 267 Query: 220 LLYLDGTI-SHMYMK 233 L+LDG + S +Y Sbjct: 268 ALFLDGGVASALYAP 282 >UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N6Z2_9GAMM Length = 304 Score = 240 bits (613), Expect = 3e-62, Method: Composition-based stats. Identities = 78/215 (36%), Positives = 109/215 (50%), Gaps = 15/215 (6%) Query: 48 YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIE 106 Y +P V ++W+ A+G A+ L L + G +V MN GIY E+ P GL+IE Sbjct: 91 YQADPAQ--VSLHWKTADGSAYANLATLKRSLEQSGARVAFLMNAGIYSENDTPAGLWIE 148 Query: 107 NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMEN 165 GQ V LN +G+GNF I+P GVFY+ K I A+ + +AVQSGP+L+ + Sbjct: 149 RGQTLVPLNRKNGKGNFHIQPNGVFYIERGKARIQTSAAYHIGNHHPDWAVQSGPLLLLD 208 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQ 219 G NPR N++S RN V F+L++ +F+ FA A L Sbjct: 209 GKPNPRFVKNLSSPHKRNAVCTTADNRLYFILTEDYDLGSEWPSFHRFAE-ALQHLGCHD 267 Query: 220 LLYLDGTISHMYMKG--GAIPWQRYPFVTMISVER 252 LYLDGT+S Y+ G G W Y V +I+V Sbjct: 268 ALYLDGTLSGWYIPGIAGTFHWTHY--VGIIAVTT 300 >UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EW16_DICNV Length = 263 Score = 239 bits (610), Expect = 6e-62, Method: Composition-based stats. Identities = 79/262 (30%), Positives = 128/262 (48%), Gaps = 23/262 (8%) Query: 12 ITLNLKRIFLALTLLP-LFAVAADDCALSDP------TLTVQAY---TVNPQTERVKMYW 61 + + L++I + + L L AA Q+ P+ ++++ W Sbjct: 1 MLVALRKIIVPVILSSFLLETAAAHLDFKKVAGGNFARFHHQSVDYAVFMPEHDKIRFLW 60 Query: 62 QKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Q GE + T+H L + ++G QV MN GI++++ P GL+IE LN SG+ Sbjct: 61 QNDRGENYQTMHHALRALTNEGYQVHFLMNAGIFNQNAQPAGLWIEKKALLRPLNRRSGK 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRL-DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 GNF I+P GVFY+ +K I+ + +AVQSGP+L+ +G IN R+ N ++ Sbjct: 121 GNFHIQPNGVFYLTQEKAHIITTVQWHNNPPKADYAVQSGPLLIIDGAINSRLPKNHKAA 180 Query: 180 KIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 RN V ++K F+++ + N Y FA +A + +Q LYLDG++S Y+ Sbjct: 181 YKRNAVCVDKARRVYFVITTRYDDGAHFPNLYRFA-HALQTIGCQQALYLDGSLSDFYLP 239 Query: 234 --GGAIPWQRYPFVTMISVERK 253 WQ+ F MI+V K Sbjct: 240 MESSRFHWQK--FAGMIAVVSK 259 >UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PF78_CHIPD Length = 273 Score = 232 bits (591), Expect = 1e-59, Method: Composition-based stats. Identities = 71/217 (32%), Positives = 118/217 (54%), Gaps = 8/217 (3%) Query: 45 VQAYTVNPQTERVKMYWQKANGE-AWGTLHALLADI-NSQGQVQMAMNGGIYDESYAPLG 102 A VNP + ++W A+ + + ++ AL + + + M NGG++ ++ P+G Sbjct: 57 YDAIVVNPAVSDISLHWLSADQQTPYKSIQALQDVLLEKKKDILMITNGGMFMKNNIPVG 116 Query: 103 LYIENGQQKVALNLASG-EGNFFIRPGGVFYVAGDKVGIVRLDAF----KTSKEIQFAVQ 157 L+I G++ ++ A+ GNF+++P GVFY+ + + + +I A Q Sbjct: 117 LFISQGRELRPIDAATDQPGNFYMQPNGVFYLDHTGPHVSTTTDYLKRSRAHSKIVAATQ 176 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFYDFACYAKAKLN 216 SGPML+ G+IN + +P + +R+GVGI +GN VF++S++A T FYDFA KA+ Sbjct: 177 SGPMLVSKGIINAKFNPGSVNRNLRSGVGILSNGNVVFIISKEAQTTFYDFASIFKARFG 236 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + LYLDG IS MY+K F MI+V + Sbjct: 237 CKDALYLDGAISKMYLKNSRPGDLNGDFGAMIAVTAR 273 >UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetobacter RepID=B2HYZ5_ACIBC Length = 204 Score = 220 bits (561), Expect = 3e-56, Method: Composition-based stats. Identities = 61/205 (29%), Positives = 106/205 (51%), Gaps = 4/205 (1%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA-NGEAWG 70 + + + I + + A+A + + + T E+++++ + + + Sbjct: 1 MKILVLCI-VNFIIFTQSALALEYRQIRNTTDDQFEVIEISNLEQLRLFLKNPQTDQYYK 59 Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + + + + Q+ AMNGG++ ++P+GLYIENG++ LN G GNFF++P GV Sbjct: 60 SFDNIQYQLKACEQLTFAMNGGMFHSGFSPVGLYIENGRESQPLNEDKGWGNFFLQPNGV 119 Query: 131 FYVAGDKVGIVRLDAFKTS-KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK 189 + I+ + +K + +A QSGPML+ NG INP N S KIRNGVG+ K Sbjct: 120 LAWNDKQAVILTTEQYKAKVFQPDYATQSGPMLVINGKINPLFLANSDSKKIRNGVGV-K 178 Query: 190 HGNAVFLLSQQATNFYDFACYAKAK 214 + F++S+ NFY FA + + K Sbjct: 179 NNKLYFVISKNRVNFYSFAQFFQKK 203 >UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744B4D Length = 235 Score = 212 bits (540), Expect = 9e-54, Method: Composition-based stats. Identities = 62/209 (29%), Positives = 104/209 (49%), Gaps = 12/209 (5%) Query: 55 ERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVA 113 R+ + W +G+ G+ LL + QG+ ++ A N GIY+ P GL I G++ V Sbjct: 28 SRLDLRWLGQDGKPLGSFGPLLQEAARQGRRIEFATNAGIYERGPKPCGLTIAGGKELVP 87 Query: 114 LNLASGEGNFFIRPGGVFYVAG-DKVGIVR-LDAFKTSKEIQFAVQSGPMLMENGVINPR 171 LNLA GEGNF++ P GVFY+ G++ + ++ + + A QSGP+L+ G I+P Sbjct: 88 LNLAKGEGNFYLHPNGVFYLDDQTGAGVMTGAEYGQSGLQPRLATQSGPILLRQGKIHPA 147 Query: 172 IHPNVASSKIRNGVGINK-HGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLD 224 + N + ++RN VG+ G VF++S + F+ + + L + L+LD Sbjct: 148 FNFNSPNRRLRNAVGVRASDGQVVFVMSDREDRVKGRVTFHQLSRFFL-HLGCQDALFLD 206 Query: 225 GTISH-MYMKGGAIPWQRYPFVTMISVER 252 G IS ++ F M + + Sbjct: 207 GDISDFLFHPPAGAAVTPNTFAGMFVLWK 235 >UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucella sp. 83/13 RepID=D1CZ42_9RHIZ Length = 248 Score = 190 bits (484), Expect = 3e-47, Method: Composition-based stats. Identities = 46/165 (27%), Positives = 81/165 (49%), Gaps = 10/165 (6%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVRLDAF-KT 148 ++PLGL+I +G+++ + A + NF+ +P G+F++ G++ + F K Sbjct: 82 AGFSPLGLFIADGKEQSPIQPAGAKTSDKPVPNFYKKPNGIFFLDESGAGLLPTEQFVKR 141 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA 208 ++ A QSGPML+ +NP A R+GVG+ K G F++S A NF+DFA Sbjct: 142 RPKVWLATQSGPMLVIENRLNPIFIIGSADKSRRSGVGVCKDGVIHFVVSDDAVNFHDFA 201 Query: 209 CYAKAKLNVEQLLYLD-GTISHMYMKGGAIPWQ--RYPFVTMISV 250 + + +L L+LD G + +Y + M ++ Sbjct: 202 RFFRDRLECPNALFLDGGGGAGLYDPALGRNDMSWHGGYGPMFAL 246 >UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseiflexus RepID=A5USB9_ROSS1 Length = 282 Score = 173 bits (438), Expect = 5e-42, Method: Composition-based stats. Identities = 50/220 (22%), Positives = 98/220 (44%), Gaps = 23/220 (10%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 SDP + + A ++P T R+++ + + T + +A+NGG + Sbjct: 70 DSSDPPVPIYAVRLDPATIRLRIRYAPDAPQPLRTW-------FVAHRPLVAVNGGFFTA 122 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD-KVGI--VRLDAFKTSKEIQ 153 L + +G G + GG+ A D +V I +R + + + + Sbjct: 123 ENRATALIVSDGTVY---------GTSYAGFGGMLAAAPDGRVWIQALRDEPYDPNIPLD 173 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-QQATNFYDFACY-A 211 A+QS PML+ G + I+ N ++ R V I++ G + ++ A + + A + A Sbjct: 174 QAIQSFPMLIYPGGVVASINDNGQRAR-RTVVAIDRAGRVLLIVCPTSAFSLQELATWLA 232 Query: 212 KAKLNVEQLLYLD-GTISHMYMKGGAIPWQRYPFVTMISV 250 + + +++ L LD G+ S +++ GA+ WQ F + SV Sbjct: 233 SSDMEIDRALNLDGGSSSGIFVNAGAVRWQIDSFAALPSV 272 >UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella burnetii RepID=A9KDD2_COXBN Length = 255 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 48/227 (21%), Positives = 88/227 (38%), Gaps = 22/227 (9%) Query: 26 LPLFAVAADDCALSDPTL--TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + V + S P L + A+ +NP+ + ++ A Sbjct: 36 MAYTVVTPAFSSESRPGLFTHLYAWKINPRQYHFNIV----TAKSLQQTALYAAQAAKIK 91 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +A+NGG + + PLGL I + + +L S G+F + ++ I Sbjct: 92 DTVLAINGGFFTPNLEPLGLRISDNKVLSSLKRISW--------WGIFMIKNNRAAITSP 143 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-- 201 ++ S EI FA+Q+GP L+ +G I P++ A R+ +G+ G+ + ++ Sbjct: 144 QNYRYSPEINFAIQAGPRLIIDGRI-PQLRGGSA---QRSALGVTPTGDIIIAITDNNLL 199 Query: 202 TNFYDFACYAKAKLNVEQLLYLD-GTISHMYMKGGAIPWQRYPFVTM 247 A + KL L LD GT S +++ Q + Sbjct: 200 LTATQLAILLQ-KLGCSNALNLDGGTSSQLFVHTNNFSLQIPSLRPV 245 >UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REU9_LEGLO Length = 260 Score = 143 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 51/200 (25%), Positives = 89/200 (44%), Gaps = 19/200 (9%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP 100 P V + V+ + ++ + K + +++ + +++NGG +D + P Sbjct: 56 PWSHVYVFRVDLKKNKLGLVNAKNLSLKYASVNQFAEHSKA----LLSINGGFFDHKFNP 111 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 LGL I NG+ + L S GVF++ +K I L F+ +I FA+QSGP Sbjct: 112 LGLRITNGKLENPLKRISW--------WGVFFIKNNKAYISSLRQFQYDNDIDFAIQSGP 163 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-ATNFYDFACYAKA-KLNVE 218 L+ N I P + P +A R+ +GI G + L++ A A ++ L+ Sbjct: 164 RLLVNRKI-PSLKPGIAE---RSALGITADGKIILLVTTNAAMTTNKLAHLLRSPPLSCM 219 Query: 219 QLLYLDGTISH-MYMKGGAI 237 + LDG S +Y G+ Sbjct: 220 DAINLDGGSSSQLYAHIGSF 239 >UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legionella RepID=Q5WVS5_LEGPL Length = 258 Score = 142 bits (359), Expect = 8e-33, Method: Composition-based stats. Identities = 52/212 (24%), Positives = 88/212 (41%), Gaps = 19/212 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L P + L +P + + ++ ++ + K + ++ + Sbjct: 40 LTPGIEYQDLEGGLLNPWSHIHVFRIDLNKNQMALVTAKNLAQKNASVDQFAEHSKA--- 96 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 +++NGG +D + PLGL I N +Q+ L S G+FYV +K I + Sbjct: 97 -LLSINGGFFDHEFNPLGLRINNKKQENPLKRISW--------WGIFYVKDNKPRITNIR 147 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-ATN 203 F I FA+QSGP L+ G I P + VA R +GI G + L++ A + Sbjct: 148 NFHYDSNIDFAIQSGPRLLIRGNI-PSLKAGVAD---RTALGITDDGKVIILVTTNAAMS 203 Query: 204 FYDFACYAKA-KLNVEQLLYLDGTISH-MYMK 233 A ++ L+ + LDG S +Y K Sbjct: 204 TRQLAQIMRSPPLSCSDAINLDGGSSSQLYAK 235 >UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B1E5_HERA2 Length = 272 Score = 94.2 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 83/227 (36%), Gaps = 26/227 (11%) Query: 33 ADDCALSDPTLTV---QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAM 89 + + Q V+P R+++ + A+ G + A + + Sbjct: 57 EPGLEFREIGYDITNVQILRVDPAYFRLRVGYDVASP---GRVSEWAAALKP----VAVI 109 Query: 90 NGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL---DAF 146 NGG +D L I +G G + GG+ V +R + Sbjct: 110 NGGYFDAQGRATALTIFDGVI---------NGTSYDGFGGMLAVDSADGWSLRSLREQPY 160 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFY 205 +++ + A+QS PML+ +G + + + ++ R+ V I++ G + ++ Sbjct: 161 DSTEVLNQALQSAPMLVVHGAAIEQPNDDGDRAR-RSVVAIDQTGRLLLMVCSWPSFTLT 219 Query: 206 DFACYA-KAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISV 250 D + + K L ++ L LDG S + + + V + V Sbjct: 220 DLSQWLVKQDLAIDAALNLDGGSSTGLVVASENRSFNLDSLVRVPQV 266 >UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chloroflexus RepID=A9WEC1_CHLAA Length = 265 Score = 81.1 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 85/217 (39%), Gaps = 24/217 (11%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 L P L VQ ++P R + + + L A + G V A+NGG +D+ Sbjct: 59 LEAPGLPVQVVRIDPAHVRFVVGYDPTSPLT------LSAWVARYGAVA-AINGGFFDQQ 111 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG---DKVGIVRLDAFKTSKEIQF 154 P+ L I N Q G ++ GG+F + + + + Sbjct: 112 GEPVALLISNQQVF---------GYSYVDQGGMFAIDEQGKPHLWSLADQPYD-GTPFVQ 161 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFYDFACY-AK 212 A+Q P+L+ + R+ + ++++G + +++ A + +++ + A Sbjct: 162 AIQGWPLLVRTNG-EAAYTDDDGQRARRSAIALDRNGYVLLIVAPGATFSLAEWSQFLAS 220 Query: 213 AKLNVEQLLYLD-GTISHMYMKGGAIPWQRYPFVTMI 248 A L++E + LD G+ S + + + F + Sbjct: 221 ADLDIEIAVNLDGGSSSGLIAQSDQGGVRVDSFTPLP 257 >UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S6T1_CHRVI Length = 272 Score = 76.1 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 71/203 (34%), Gaps = 20/203 (9%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP 100 T+ + + R+ + + + + G + A+NGG + P Sbjct: 62 RTVRAHLALFDSRRYRLAVLDLGPD---LASASDWPEHTRAAGLLA-AVNGGFFHADGQP 117 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 LGL I G++ GV Y + + R F++S I VQSGP Sbjct: 118 LGLVIAGGERLNRFETVK-------LLSGVLYGDARGIHLERRARFQSSPGIDALVQSGP 170 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY-----AKAKL 215 L+E G + + S R + + + V ++ + A A A Sbjct: 171 YLVEQGRAVRGLSTHDVSR--RTFIATDWRRHWVLGATRDGLTLAELAEALATPGALAPW 228 Query: 216 NVEQLLYLDGTISH--MYMKGGA 236 VE+ L LDG S ++ G Sbjct: 229 PVERALNLDGGTSTGFLFDPGAG 251 >UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZJ8_9BACT Length = 251 Score = 74.2 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 50/245 (20%), Positives = 85/245 (34%), Gaps = 41/245 (16%) Query: 16 LKRIFLALTLLPLF------AV---------AADDCALSDPTLTVQAYTVNPQTERVKMY 60 + R F+ L +L L A A + ++ + A V Sbjct: 1 MYRFFVCLLVLALTTQLASAAWVLKESADRPAPTELEFTERHVQGDAGDVTLWVVTF--- 57 Query: 61 WQKANGEAWGTLHALLADI----NSQGQVQMA-MNGGIYDESYAPLGLYIENGQQKVALN 115 A+ + S+ + +A +NGG + PLGL + G + L Sbjct: 58 --NPKACAFAVMDNPTGAFDLGTASEKRGALAGVNGGYFHPDRTPLGLVVRQGVEIHPLE 115 Query: 116 LASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN 175 A GV V + + R AFK S ++ A+Q+GP L+E P + Sbjct: 116 RAK-------LLSGVLSVMPTTITLQRTGAFKGSSAVREALQAGPFLIEKEKPIPGLEAT 168 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL-----NVEQLLYLDGTIS-H 229 ++ R V N G FL+ + T A + + + LDG S Sbjct: 169 KEAA--RTVVFQNAKGRCGFLICKS-TTLAGMADLLATSSIFPEGKIIRAMNLDGGTSTA 225 Query: 230 MYMKG 234 ++++G Sbjct: 226 LWVRG 230 >UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KSV8_9BACE Length = 390 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 38/181 (20%), Positives = 68/181 (37%), Gaps = 19/181 (10%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASG 119 + ++ + + A LA S +V A+NG + + P G+Y NG + Sbjct: 183 FLGTSSSYYYVSRDAALAYDKSGSRVLAAVNGDFFAKDGTPQGIYYRNGTCLKGTMTDN- 241 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRL--DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA 177 F F + +K I+ + + IQ AV LM NG + P+ V Sbjct: 242 VCTF-------FAITKNKRAIIGSYDEYDSYKENIQEAVGGRVRLMTNGNVLPQ---TVT 291 Query: 178 SSKIRNGVGINKHGNAVFLLSQQATNFYDFA-CYAK-----AKLNVEQLLYLDGTISHMY 231 + + R +G+ L++ +Y YA+ L + + LDG S + Sbjct: 292 ALEPRTAIGVTDDNVVYILVADGRNFWYSNGMRYAEMGAVMKALGAKNAINLDGGGSSTF 351 Query: 232 M 232 + Sbjct: 352 I 352 >UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N9W8_9BACT Length = 275 Score = 67.6 bits (164), Expect = 4e-10, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 70/193 (36%), Gaps = 23/193 (11%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG-- 102 + + T +++ + +G + T+ +++ A+N G + P+G Sbjct: 61 ISVLRADLSTPGLRLGLAECDGGNYETVSHFGRRLDA----LAAVNAGFFAMKGNPMGVR 116 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKTSKEIQFAVQSGPM 161 +G+ A +L + F + + IV F + + AV + Sbjct: 117 YLKIDGKVLNA-DLGGDPERAY------FVLDQTGRPAIVGPADF-APERCRSAVYGNRL 168 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-----TNFYDFACYAKAKLN 216 L+++G + P + + R G++ + + ++ +A F + A K L Sbjct: 169 LLKDGKVPP--LGDDKARHPRTAAGLSGNTLLLVVIDGRARESAGVTFAELATLLKD-LG 225 Query: 217 VEQLLYLDGTISH 229 + LDG S Sbjct: 226 CTDAVNLDGGGSS 238 >UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lactobacillus rhamnosus RepID=C7TED9_LACRG Length = 1561 Score = 66.1 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 73/201 (36%), Gaps = 22/201 (10%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS----QGQVQMAMNGGIYD-ESYA 99 + ++P+ + N + + N+ QV A+N Y+ + A Sbjct: 140 YYSVALDPKNPNTTLLAGMPNDGTKPGMQTVRNQANAAISHGQQVVAAVNADYYNMATGA 199 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 PLG ++NG + + + E F I+ G + + ++Q AV G Sbjct: 200 PLGNVVKNGTEIYSA-PDTNEAFFGIKKDGTPMIG------TAATYQQRKGDLQQAV-GG 251 Query: 160 P-MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYA 211 P + +++G +N ++ VGI G F++ + DFA Sbjct: 252 PSIFVKDGKVNATQVAGSEGNEPCTAVGIKADGTVFFVVIDGRQAPLSTGISVGDFAKLM 311 Query: 212 KAKLNVEQLLYLDGTISHMYM 232 + L+LDG S ++ Sbjct: 312 IER-GAVNALFLDGGGSATFV 331 >UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNL7_AKKM8 Length = 249 Score = 64.6 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 45/205 (21%), Positives = 77/205 (37%), Gaps = 29/205 (14%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEA-WGTLHALLADINSQGQVQMAMNGGIYDESY 98 L V + + T R+ + + + +G+L + + +NGG + Sbjct: 34 RDKLNVYFFRSD--THRLLVRDEGSVKTPRYGSLDKAM----RKSPCVAGVNGGFFSADA 87 Query: 99 --APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-----RLDAFKTSKE 151 PLGL +++G++ L G+F + GV Y G + G+ L + Sbjct: 88 GGTPLGLVVQDGKRLSPLAT----GSFAVS--GVVY-EGGRDGLTLVRSSVLRRMRRLPA 140 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY- 210 +Q A+Q GP L+ENG + N S R + + +S + A + Sbjct: 141 MQAAIQGGPFLVENGSAVKGL--NAQKSTYRTFIATDGGRRWCIGVSSS-LTLKELAAWL 197 Query: 211 ----AKAKLNVEQLLYLDGTISHMY 231 A VE L LDG S + Sbjct: 198 AAPGALGNFRVETALNLDGGSSSAF 222 >UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744905 Length = 251 Score = 63.4 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 59/153 (38%), Gaps = 16/153 (10%) Query: 90 NGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 NGG + + PLGL + +G + +S G GVF V + +V D + Sbjct: 82 NGGYFTPDFLPLGLEVSDGVRSGTFQRSSLLG-------GVFLVRHGRPAMVWKDEYIEQ 134 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFAC 209 K + +Q+GP L+ G+ + + R + ++ GN + + + Sbjct: 135 KGVTQLLQAGPRLVHAGLPVAGLEA--TKRRARTFILTDQAGNWALGTCKS-VTLRELSD 191 Query: 210 YAKAK-----LNVEQLLYLDGTIS-HMYMKGGA 236 + + V++ L DG S ++ + Sbjct: 192 LLSTRALLPEVTVKRALNFDGGNSTGLWWRAEG 224 >UniRef50_C6J074 Copper amine oxidase domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J074_9BACL Length = 406 Score = 63.0 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 84/215 (39%), Gaps = 36/215 (16%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE--SY 98 + + Q T++ +V++ A G+A G + L + + + + +A+NG ++ Sbjct: 190 RSFSTQMVTISLMDPKVRLKVALA-GDAVGKVEEL-SSLAKRHKAVVAINGTFFNAYTDN 247 Query: 99 A---PLGLYIENGQQKVALNLASGEGNFFIRPGGVF-YVAGDKVGIVRLDAFKTSKEI-- 152 A P G + G+ K+ + +F Y + ++ D F + Sbjct: 248 AYKAPYGYIVSGGELKMKASGDKRT---------IFTYDSNLLARLIPGDDFNDAFNAGT 298 Query: 153 -QFAVQSGPMLMENGVI----------NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 + A+Q+GP L+ NG + +P+I + R+ +G+ + + L+ Sbjct: 299 MEGALQAGPRLVVNGKVAVDVKAEGFKDPKILTGGGA---RSALGLTRDHKLIL-LTTGG 354 Query: 202 TNFYDFACYAKAKLNVEQLLYLD-GTISHMYMKGG 235 A K + Q + LD G S +Y G Sbjct: 355 ATIPQLAEIMK-QAGAYQAMNLDGGASSGLYYNGK 388 >UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-layer protein n=2 Tax=Lactobacillus rhamnosus RepID=C2JZN3_LACRH Length = 559 Score = 62.6 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 72/202 (35%), Gaps = 25/202 (12%) Query: 45 VQAYTVNPQTERVKMYWQKA-NGEAWGT---LHALLADINSQGQVQMAMNGGIYD-ESYA 99 + +NP+ ++ +G G A I + QV A+NG ++ S Sbjct: 147 YYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASAAIKNGHQVVAAVNGDLFKIASGV 206 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA--FKTSKEIQFAVQ 157 P G I++G + A A F + D I+ + K ++Q A+ Sbjct: 207 PTGNVIKDGVELHAATSARES---------FFGIKKDGTPIIGDEQTYQKVKGDLQQALG 257 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL-------SQQATNFYDFACY 210 +L+ +G +N S+ R VGI G F++ + + D A Sbjct: 258 GRNILVADGKVN-ETKAIGTDSEPRTAVGIKADGTVFFVVVDGRQAPTSNGLSMVDLANL 316 Query: 211 AKAKLNVEQLLYLDGTISHMYM 232 + L LDG S Y+ Sbjct: 317 MIQR-GAVTALNLDGGGSSTYV 337 >UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein (Fragment) n=1 Tax=Lactobacillus rhamnosus HN001 RepID=B2KU41_LACRH Length = 470 Score = 62.6 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 72/202 (35%), Gaps = 25/202 (12%) Query: 45 VQAYTVNPQTERVKMYWQKA-NGEAWGT---LHALLADINSQGQVQMAMNGGIYD-ESYA 99 + +NP+ ++ +G G A I + QV A+NG ++ S Sbjct: 147 YYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASAAIKNGHQVVAAVNGDLFKIASGV 206 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA--FKTSKEIQFAVQ 157 P G I++G + A A F + D I+ + K ++Q A+ Sbjct: 207 PTGNVIKDGVELHAATSARES---------FFGIKKDGTPIIGDEQTYQKVKGDLQQALG 257 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL-------SQQATNFYDFACY 210 +L+ +G +N S+ R VGI G F++ + + D A Sbjct: 258 GRNILVADGKVN-ETKAIGTDSEPRTAVGIKADGTVFFVVVDGRQAPTSNGLSMVDLANL 316 Query: 211 AKAKLNVEQLLYLDGTISHMYM 232 + L LDG S Y+ Sbjct: 317 MIQR-GAVTALNLDGGGSSTYV 337 >UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=Bacteroides RepID=C3QHD0_9BACE Length = 311 Score = 62.6 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 37/178 (20%), Positives = 63/178 (35%), Gaps = 19/178 (10%) Query: 76 LADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 LA +V A+NG + P G+Y NG + F F V Sbjct: 120 LAHDKQGSRVLAAVNGDFFATDGTPQGIYYRNGVCLKNTMTDN-VCTF-------FAVTK 171 Query: 136 DKVGIVRL--DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA 193 K ++ + EIQ AV LM NG + P+ + + + R +G+ + Sbjct: 172 GKKAVIGSYDEYDTYKDEIQEAVGGRVRLMTNGNVLPQ---TLTALEPRTAIGVTDNNVV 228 Query: 194 VFLLSQQATNFYDFA-CYAK-----AKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 L++ +Y YA+ L + + LDG S ++ ++ F Sbjct: 229 YILVADGRNFWYSNGMRYAEMGAVMKALGAKDAINLDGGGSSTFIIRSKAGFEENRFA 286 >UniRef50_UPI00019038D8 hypothetical protein Retl8_15906 n=1 Tax=Rhizobium etli 8C-3 RepID=UPI00019038D8 Length = 91 Score = 60.7 bits (146), Expect = 4e-08, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 4/68 (5%) Query: 33 ADDCALSDPTLT---VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMA 88 A T T+ ++++W+ A+GE + +L + ++G+ + A Sbjct: 24 ARAQQCGQETFDEAKYVVCTLEVGKVDLRLFWKGADGEPYRAFSSLADAVRAEGRKLIFA 83 Query: 89 MNGGIYDE 96 +N G+Y Sbjct: 84 VNAGMYRA 91 >UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A82C Length = 377 Score = 60.7 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 66/180 (36%), Gaps = 34/180 (18%) Query: 76 LADINSQGQVQMAMNGGIYDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 L I + +A+NG +D + P G + G + + + Sbjct: 194 LRSIAKRSNAVVAINGTFFDAYTSGAYKAPYGYLVSKGNIFHKASGDNRT---------I 244 Query: 131 F-YVAGDKVGIVRLDAFK---TSKEIQFAVQSGPMLMENGVI----------NPRIHPNV 176 F Y + + ++ FK + ++ A+Q+GP L+ NG + +P+I Sbjct: 245 FTYDSNNLATMMPGLDFKSVYETGRMEGALQAGPRLLTNGKVTLDVKKEGFKDPKILTGG 304 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD-GTISHMYMKGG 235 + R+ +GI K + L+ A K + Q + LD G S +Y G Sbjct: 305 GA---RSALGITKDHKLIL-LTTGGATIPQLAEIMK-QAGAYQAMNLDGGASSGLYYNGS 359 >UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ABL2_GEMAT Length = 311 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 81/220 (36%), Gaps = 35/220 (15%) Query: 30 AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAM 89 A + TV ++P + + +G+A A + N+ +A+ Sbjct: 69 AEWPVQLGARGISTTVIVVDIDPARIALTLEIA-RDGDAL----APWSLDNAPKDAVIAL 123 Query: 90 NGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 N G + + P G + ++ A + F I + I+R D + Sbjct: 124 NAGQFTDDG-PWGWVVHRQREWQAPGVGPLSAAFVID-------TAGRAAILRADEIAEA 175 Query: 150 KEI---QFAVQSGPMLMENGVINPRI----HPNVASSKIRNGVGINKHGNAVFLLSQ--- 199 + + A+QS P+++ +G + P + ++ IR +G+ G+ + L++ Sbjct: 176 RRRGGWEEALQSFPLILNDGALPPGLCAPGAVDLEHRDIRLTLGVLPDGHVLLALTRYAG 235 Query: 200 ----------QATNFYDFACYAKAKLNVEQLLYLDGTISH 229 T + A + +L V + + LDG +S Sbjct: 236 VGSAGNRLPIGPTT-GEMATIMR-ELGVARAVMLDGGLSA 273 >UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY15_CLONN Length = 436 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 45/228 (19%), Positives = 77/228 (33%), Gaps = 37/228 (16%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + V P ++++ + + + + + ++I + A+N G + + Sbjct: 200 DIKTNRFNGKMLIV-PNSKKIVIGFNEESP---SKVGKTTSEIAKENNAICAINAGGFTD 255 Query: 97 S------------------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK- 137 P G+ I NG+ N G N I G + K Sbjct: 256 DVSGKSAEVVLNPDSGYETRKPCGILIHNGEFVY--NDDKGRKNEKIDIVG--FSKRGKL 311 Query: 138 -VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 VG L+ K + I+ AV GP L+ +G + R +G + G+ +FL Sbjct: 312 IVGKYTLEELK-NINIKEAVSFGPALIVDGNPVNILGDGGWGVAPRTAIGQRRDGSVLFL 370 Query: 197 LSQQATNFYDFACYAKA------KLNVEQLLYLD-GTISHMYMKGGAI 237 + F K K LD GT+S MY K I Sbjct: 371 VIDGR-GFKSMGATIKDVQDIMKKYGAVNASNLDGGTVSTMYYKDKVI 417 >UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alicyclobacillus acidocaldarius RepID=B7DMS1_9BACL Length = 354 Score = 58.4 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 43/172 (25%), Positives = 77/172 (44%), Gaps = 23/172 (13%) Query: 95 DESYAPL---GLYIE--NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 P+ G IE G+ K + G+ I V + +K F Sbjct: 193 HPGITPIPPDGYDIEIGAGEAKTPIVTRVHVGDPAILTDTVLALPSEKPV-----PFAAY 247 Query: 150 KEIQFAVQSGPMLMENGVIN--PRIH----PNVASSK-IRNGVGINKHGNAVFLLSQQAT 202 A+ +GPML++NG I+ P + P++ +++ +R+ VGI++ G+ +FL +A Sbjct: 248 PN---AIGAGPMLVQNGRIDVEPSLEGLDEPDILNAETLRSVVGIDRAGHLIFLTIHEA- 303 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 N + A AKA L + + LDG S ++ +G + + T I V ++ Sbjct: 304 NVWQEASIAKA-LGLWDAMNLDGGSSVGLWYEGRYLTPPKRALATAIVVVQR 354 >UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Clostridium tetani RepID=Q892K3_CLOTE Length = 708 Score = 57.6 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 42/169 (24%), Positives = 61/169 (36%), Gaps = 25/169 (14%) Query: 76 LADINSQGQVQMAMNGGIY-DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 A I S V +NG Y + P+G+ +NG+ A + A NFF GV Sbjct: 119 KASIASGDNVVGGVNGDFYYTVTGEPIGIVYKNGKAVKANHAAEW--NFF----GVL--- 169 Query: 135 GDKVGIVRLD--AFKTSKEIQFAVQSGPMLMENGVIN--PRIHPNVASSKIRNGVGINKH 190 D I+ + +Q A+ +L+ G I P I + R VGI K Sbjct: 170 EDGTPIIGDGKKYNEVKDSLQEALGGNAILVREGRIYQTPSI---GGYREPRTAVGIKKD 226 Query: 191 GNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 G F+ + D A L + L LDG S ++ Sbjct: 227 GTIFFVTVDGRQEGHSAGISMPDLAQLMID-LGAVEALNLDGGGSSTFV 274 >UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtilis ortholog n=6 Tax=Clostridium RepID=Q97FU3_CLOAB Length = 354 Score = 56.8 bits (136), Expect = 6e-07, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 62/194 (31%), Gaps = 31/194 (15%) Query: 77 ADINSQGQVQMAMNGGIYDESYA------------PLGLYIENGQQKVALNLAS---GEG 121 ++I A+NGG + E+ + P G+ I +G+ N +G Sbjct: 155 SEIAKHNNALAAVNGGGFQENSSGSKVVWTGTGALPTGIIISDGKVVYPKNPDQLSIQKG 214 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG--VINPRI--HPNVA 177 I GV V + + ++ + A+ GP L+ NG + Sbjct: 215 TAAITKSGVLVVGDHSIREL------LNENVVEAINFGPTLIVNGVDQTRDSFGNSIDSQ 268 Query: 178 SSKIRNGVGINKHGNAVFLLSQQATNFYDFACY-----AKAKLNVEQLLYLDGTIS-HMY 231 ++ R +G K G + L A + N + LDG S MY Sbjct: 269 GAQPRTAIGQRKDGAILLLTVDGRQGLQMGATIKDIQKIMEQENAYNAVNLDGGASTTMY 328 Query: 232 MKGGAIPWQRYPFV 245 G I F Sbjct: 329 YNGHVINNPCDKFG 342 >UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT14_PEDHD Length = 303 Score = 56.8 bits (136), Expect = 6e-07, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 68/206 (33%), Gaps = 27/206 (13%) Query: 45 VQAYTVNPQTERVKMYWQKANGEA-WGTLHALLAD---INSQGQVQMAMNGGIY-DESYA 99 + ++ + VK+ +G+ + +V +NG + SY Sbjct: 72 IFILKIDLKNPDVKLQAATPYDAPGYGSQTVPEMAKYVDAANNRVIAGINGDFFNTSSYV 131 Query: 100 PLGLYIENGQQKVAL------NLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ 153 PLG+ + G G I G Y+ D +++ Sbjct: 132 PLGIIYKKGVAIKPAFTDNTDKPQQGLSFLGILANGKPYIGDK-----ETDYPTIKSQLK 186 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 A+ +G L+++ +I ++ + R GVGI F++ N+ + Sbjct: 187 EALGAGVFLVKD---YKKITQSIPTVDPRTGVGITDDDLVYFIVVDGRNFYNSNGINYQE 243 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYM 232 A V+ + LDG S +M Sbjct: 244 MGKIMYA-FGVKNAVNLDGGGSSTFM 268 >UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C3R3L4_9BACE Length = 431 Score = 54.9 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 63/195 (32%), Gaps = 55/195 (28%) Query: 89 MNGGIYDESYAPLGLYIENGQQKVA----LNLASGEGNFFIRP---------GGVFYV-- 133 MNGG + + A + L N ++ + G N P G F V Sbjct: 211 MNGGYFASNGATVSLLYRNNVMLAPNLQSMSRSDGTSNVAFYPTRSAFGEIENGKFEVNW 270 Query: 134 ---------------AGDKVGI--VRLDAFKTSK-----EIQFAVQSGPMLMENGV---- 167 + +K G+ +++ + + + + A+ GP+L++NG+ Sbjct: 271 VYTVSSGQTYAYPAPSPNKSGVSPMQIPSVNYPEGASIWKAKNAIGGGPVLLKNGLYKNT 330 Query: 168 -----INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA--------TNFYDFACYAKAK 214 + S+ R+ +GI +F + + + A + Sbjct: 331 WEAELFDTASGIGPTSNNPRSAIGITGDNRLIFFVCEGRNKTPNVPGFTLEEVAYILRD- 389 Query: 215 LNVEQLLYLDGTISH 229 L + LDG S Sbjct: 390 LGCLDAMNLDGGGSS 404 >UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=Bacteria RepID=C6D6X3_PAESJ Length = 344 Score = 54.9 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 40/176 (22%), Positives = 60/176 (34%), Gaps = 27/176 (15%) Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 + I S A+NG Y + G+ I NG G F + D Sbjct: 154 STIASNNNAVFAINGDYY--GFRSDGVVIRNGTVYRDEPARIGLAMF----NDGTMKSYD 207 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI-----NPRIHPNVASSKI-----RNGVG 186 + D F+ GP L+ +G I + I N + I R G+G Sbjct: 208 EEETSTDDLLAQGVTNAFSF--GPALVTDGEIAGDFSHVEIDKNFGNRSIQNSNPRTGIG 265 Query: 187 INKHGNAVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLD-GTISHMYMKG 234 + + VF++ + Y +FA K +L + LD G S MY G Sbjct: 266 MISANHYVFVVVDGRSTGYSRGMTLTEFADLFK-ELGATEAYNLDGGGSSTMYFMG 320 >UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=4 Tax=Alicyclobacillus acidocaldarius RepID=C8WTH1_ALIAD Length = 352 Score = 54.5 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 77/222 (34%), Gaps = 35/222 (15%) Query: 38 LSDPTLTVQAYTV-NPQTERV-KMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 L +PT V +P+ RV + GE ++ + G + GG D Sbjct: 123 LHEPTFNAFILLVKDPKRIRVVATKYLHVRGET------VMQMVQDSGAIAGINAGGFVD 176 Query: 96 ESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK--T 148 ++ P G+ I +G+ S +P V + I + Sbjct: 177 TNWQGTGAYPQGITITDGKLVSMTGSPS-------QPQPVIAFTKEGQMIAGTYSLNQLR 229 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 S ++ V GP+L+ENG P + + R +G K G + L++ Sbjct: 230 SLDVWQCVGFGPVLVENGK--PTVSAENYAVNPRTAIGQTKDGTVILLVTDGRYATGPND 287 Query: 203 ---NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 +F D A + + + LDG S ++ G W R Sbjct: 288 VGASFADVARIML-QFHADIAANLDGGSSATFVYKG-RMWNR 327 >UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Treponema denticola RepID=Q73Q09_TREDE Length = 293 Score = 54.5 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 75/208 (36%), Gaps = 28/208 (13%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI--NSQGQVQMAMNGGIYDE 96 D L + A ++ ++K+ + + + + +A+N ++ Sbjct: 62 EDYPLIIHAVKIDLTNPKLKIVVTEPALFNSKGMVKRETTLSFARRHNTVIALNAAFFNV 121 Query: 97 -------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-KT 148 PLG++I+ +NL+ F + G + ++ + I+ Sbjct: 122 ISFSFSLRGEPLGIHIDKK-----INLSKP----FPKYGALCFLDDNSAFIIESQNTEDI 172 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN----- 203 +I++AV SG ++ P I R VG+ G ++L + N Sbjct: 173 KADIEYAV-SGNRIILKNG-KPIITNISKKENSRTCVGLADGGKTLYLFFAEGENKKKSR 230 Query: 204 --FYDFACYAKAKLNVEQLLYLDGTISH 229 YD A + KL + ++LDG S Sbjct: 231 GITYDQAHFFMKKLGAQDAIHLDGGGSS 258 >UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0G9_BACOV Length = 621 Score = 53.8 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 60/172 (34%), Gaps = 20/172 (11%) Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 I + A+N G Y S P + + KVA + S + GV + + Sbjct: 108 IAKDKKALFAIN-GSYSISGNPSTFTMVDKVVKVASTIESAS-----KVNGVIAIDAEGS 161 Query: 139 ----GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV--ASSKIRNGVGINKHGN 192 D E + A+ SGPML+ G + + R+ +GI G Sbjct: 162 VDVKSCTFSDYTDVEDEYESALASGPMLLMEGKVC-SFPQDAIYTQRMARSVIGITAQGK 220 Query: 193 AVFLLSQQATN------FYDFACYAKAKLNVEQLLYL-DGTISHMYMKGGAI 237 + L A + A + L ++ + L DG+ S ++ G + Sbjct: 221 MMLLTIDGAITGNADGATLEEAAFIAKTLGMKNAVCLADGSSSTLWTSGKGV 272 >UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J8B3_NOSP7 Length = 276 Score = 53.4 bits (127), Expect = 7e-06, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 63/193 (32%), Gaps = 34/193 (17%) Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYI-----------ENGQQKVALNLASGEGNFFI 125 + + + +N G +D + Y+ EN + NL S F Sbjct: 62 EEFAQKHRAVAILNAGFFDPANQKTTSYVILQRKLVADPKENERLVNNPNLKSYLSQIFN 121 Query: 126 RPGGVFYVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLM----------ENGVINPRI 172 R Y G V ++ + ++ A+ +GP L+ + N R Sbjct: 122 RTEFRRYSCGQTVRYDIVLHSASQPAGCQLVDAIGAGPSLLPELTLEKEGFVDNA-NKRD 180 Query: 173 HPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLD- 224 R VGI G+ V ++ Q + A + K L ++ + LD Sbjct: 181 ALGSNQPNARTAVGITHDGSVVLVMVAQKPSAPANGISLPALANFMK-TLGADKAMNLDG 239 Query: 225 GTISHMYMKGGAI 237 G+ S +Y G Sbjct: 240 GSSSSLYYNGKTF 252 >UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XS52_9DEIN Length = 294 Score = 53.0 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 70/194 (36%), Gaps = 23/194 (11%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPL 101 + V A VN V + G +L + + ++ A+NGG + ++ P Sbjct: 80 VPVLAVHVNLAHPEVSIRSLLPPPGV-GRGGEVLQRLAWRTRLVAAINGGYFHPRTFWPA 138 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK-VGIVRLDAFKTSKEIQFAVQSGP 160 G + G Q V ++ + + DK ++ E A +GP Sbjct: 139 GDLVVGGHQLVKGSIQTA-----------LAITPDKRARVMVGPQTWRGYETVIA--NGP 185 Query: 161 MLMENGVIN--PRI----HPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK 214 ++ G + PR P + R+ VG+ +F+ ++ + AK Sbjct: 186 YILRRGRLVVTPRAEGYNDPAIWGRARRSAVGVVNERYLIFVSTKMELTLSELGKVM-AK 244 Query: 215 LNVEQLLYLDGTIS 228 L ++ + LDG S Sbjct: 245 LGAKEAIVLDGGSS 258 >UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNM8_9SPIO Length = 306 Score = 53.0 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 58/173 (33%), Gaps = 22/173 (12%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI 125 GE I + N ++ +G++I + ++ N G F+ Sbjct: 113 GETTRDFALRHNTIAAFNAAPFKTNSLLFSIYRTIVGIHITDFRRMSMPNERYGALLFY- 171 Query: 126 RPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNG 184 I+ S ++++AV ++ NG I P+ + R Sbjct: 172 --------KDKTARIIGSQTEDALSADVRYAVGGFWTILRNGTIVPQ---KLHRRDSRTA 220 Query: 185 VGINKHGNAVFLLS--------QQATNFYDFACYAKAKLNVEQLLYLDGTISH 229 VG+ G +F+ + + +F + A + L + L LDG S Sbjct: 221 VGLADSGKTLFVAAVEGENKRKSRGLSFEETAMLMQ-TLGADDALQLDGGSSS 272 >UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellaceae RepID=C9KQW2_9FIRM Length = 503 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 33/91 (36%), Gaps = 13/91 (14%) Query: 151 EIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVFLLSQQA--- 201 + F + GP L+ENG ++ ++ R+ VGI K G + + Sbjct: 385 NMDFIIGCGPRLVENGRVHVTVDEEDFPADIRIGRAPRSAVGITKDGRYLLAVVDGRQSH 444 Query: 202 ---TNFYDFACYAKAKLNVEQLLYLDGTISH 229 D+A K + L LDG S Sbjct: 445 SVGLTLTDWAKL-LVKFGAQDALNLDGGGSS 474 >UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YKN4_ANASP Length = 245 Score = 52.6 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 64 ANGEAWGTLHALLADIN-----SQGQVQMAM-NGGIYDE-SYAPLGLYIENGQQKV-ALN 115 + AL A ++ +Q A+ N G +D + + GQ + Sbjct: 8 PANSPFVVTGALSAKVSTVEEFAQKHRAFAIFNAGFFDPANQKSTSYVVVTGQMVADPKD 67 Query: 116 LASGEGNFFIRP--GGVF-------YVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLM 163 N ++P +F Y+ G + ++ + + A+ +GP L+ Sbjct: 68 NERLVNNPQLKPYLNLIFNRSEFRRYLCGQTTRYDITLHNESPPANCRLVDAIGAGPRLL 127 Query: 164 ENGVINPR-IHPN--------VASSKIRNGVGINKHGNAVFLL--------SQQATNFYD 206 P N R VGI G+ + ++ + Sbjct: 128 PKLTSVPEGFVDNAKGRDALLSKQLNARTAVGITSEGSIILVMVAQKPSKPKNSGISLVQ 187 Query: 207 FACYAKAKLNVEQLLYLD-GTISHMYMKGGAI 237 A K KL + LD G+ S +Y G A Sbjct: 188 LADLMK-KLGASAAMNLDGGSSSSLYYNGKAF 218 >UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74396_SYNY3 Length = 610 Score = 52.6 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 48/129 (37%), Gaps = 21/129 (16%) Query: 127 PGGVFYVAG--DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV-------- 176 P GV V + G +AF A GP+L++ G + ++ Sbjct: 471 PAGVLAVGTTLNVNGRSTPEAFNAFPNGMGA---GPLLIDQGRMV--LNATGEGFSSAFQ 525 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMY 231 R+ + ++++GN + + S + +FA + +L L LDG S Sbjct: 526 QQRASRSAIAVDRNGNIILVASHNRVGGAGASLGEFAQILQ-QLGAVNALNLDGGSSTSL 584 Query: 232 MKGGAIPWQ 240 GG + + Sbjct: 585 ALGGQLLDR 593 >UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B2F Length = 325 Score = 52.2 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 77/216 (35%), Gaps = 34/216 (15%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 TV +NP + + ++ +G A T LA + A+ D + PL Sbjct: 91 RETVNVIEINPANYQFQTSFK--DGFALTTAKERLATERA----AFAITANFRDPAGKPL 144 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF-AVQSGP 160 GL + G Q+ G F+V K F+ + + A Q P Sbjct: 145 GLVVHEGTQRNP--TFPAWT-------GYFFVKAGKPWFGPKSLFEETPGVLQEASQGYP 195 Query: 161 MLMENGVINPRIH---------PNVASSKIRNGVGINKHGNAVFLLSQQA--TNFYDFAC 209 LM+N + + R G+ ++GN VF+LS N + Sbjct: 196 SLMKN---HTVFSYVDLPSTRYFDGNRVTYRALAGMKQNGNIVFILSGTGGVMNVSEVTA 252 Query: 210 YAKAKLNVEQLLYLDGTIS---HMYMKGGAIPWQRY 242 A+ +LNV+ LDG + + + G A + + Sbjct: 253 LAQ-RLNVQHATLLDGGRALQYSLKLHGAARHFTAF 287 >UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=3 Tax=Micrococcineae RepID=D2NR45_9MICC Length = 356 Score = 52.2 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 63/184 (34%), Gaps = 28/184 (15%) Query: 64 ANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNF 123 AN + + +++ S+ A+NG Y + G+ I NG G F Sbjct: 151 ANNKFGQNIIDTPSNMASEHNGIWAINGDYY--GFRTTGIVIRNGVVYRDSGAREGLA-F 207 Query: 124 FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS-GPMLMENGVINPRI---------- 172 + R G V +A E + S GP L+++ I I Sbjct: 208 Y-RDGSVKLYDE-----TATNAQTLVSEGVWNTLSFGPALVKDSAIVDGIDSVEVDTNFG 261 Query: 173 HPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDG 225 + ++ ++ R GVG+ + VF++ +FA K L LDG Sbjct: 262 NHSIQGNQPRTGVGVLGTNHLVFIVVDGRSTNYSRGVTMPEFAQMFKD-LGCVSAYNLDG 320 Query: 226 TISH 229 S Sbjct: 321 GGSS 324 >UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VRM0_9FIRM Length = 361 Score = 52.2 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 32/134 (23%), Positives = 48/134 (35%), Gaps = 26/134 (19%) Query: 112 VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV-INP 170 V L L +GN + G + ++ E+ A GPML++NG + Sbjct: 232 VELVLVDSKGNETFKYNG------------QDISYSKVTELVAA---GPMLLQNGKNVVA 276 Query: 171 RIHPNVASSKI------RNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 N KI R+ +GI K+G + L N A L + LD Sbjct: 277 ESKNNYKEGKINSATGQRSAIGITKNGKVILL--TAVANVDKLA-LIMNDLGCIDAMNLD 333 Query: 225 GTISH-MYMKGGAI 237 G S ++ G I Sbjct: 334 GGASSALFANGKVI 347 >UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57_ANASP Length = 660 Score = 51.8 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 71/229 (31%), Gaps = 49/229 (21%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQV---QMAMNGG---------------I 93 P R + W G+ + +L + + + +A+N G + Sbjct: 411 PILNRGAIAW-NDAGQFYFGRLSLQETLATSSNLRVPILALNSGYVQNGIARYTPAWGKM 469 Query: 94 YDE--SYAPLGLYIENGQQKVAL-NLASGEGNFFIRPGGVFYVAGDKVGIVRLD-AFKTS 149 Y + + ++N + +G+ NF I G V T Sbjct: 470 YTPLTDNERI-VIVQNNKITNQFPGNKAGQTNFPIPNNGYLLTLRGNATTVASQLPVGTD 528 Query: 150 KEIQFAVQ------------SGPMLMENGVIN-----PRI-HPNVASSKIRNGVGINKHG 191 +I A +GP+L++N I + + +A +R+G+ + Sbjct: 529 VQITSATTPGEFNRYPHIIGAGPLLLQNSQIVLDAKSEQFSNAFIAERAVRSGICTTANN 588 Query: 192 NAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKG 234 + N + A K L L LDG S +Y+ G Sbjct: 589 TLLIAAVHNRAGGPGPNLAEHAQLMKL-LGCVNALNLDGGSSTSLYLSG 636 >UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacteroides RepID=Q8A0T0_BACTN Length = 308 Score = 51.8 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 65/180 (36%), Gaps = 30/180 (16%) Query: 76 LADINSQGQVQMAMNGGIYDES------YAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 ++ I + Q A+NG +D + + +G + + L L G + + G Sbjct: 109 ISRIARKHQAIGAINGSYFDMTKGNSVCFLKVGSQVVDTTSLDELKLRV-TGAVYEKKGK 167 Query: 130 VFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG------VINPRIHPNVASSKIRN 183 V + D+ +K +K A SGP+++++G N R+ Sbjct: 168 VKLIPWDR---QIEKNYKKNKGSVLA--SGPLMLKDGEYYDWSQCNANFIET---KHPRS 219 Query: 184 GVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGG 235 + + + G +F+ N + A L + L LDG S +++ G Sbjct: 220 AICLTEEGKILFVTVDGRSPENAVGINIPELAHL-LHVLGGKDALNLDGGGSTALWLSGA 278 >UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=B8J2Y6_DESDA Length = 429 Score = 51.5 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 69/207 (33%), Gaps = 23/207 (11%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + + L+D + A ++P + + +G G L L G + A Sbjct: 131 PGLDFGEFQLTDSEALLTALRIDPAHFDFILCARSQDG---GNLRPLNQWAEQYG-LTAA 186 Query: 89 MNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFI-------RPGGVFYVAGDKVGI 140 +N +Y G +NG + G FF+ PG D Sbjct: 187 INASMYLPDGITSTGYMRQNGH-HNNKRVVQRFGAFFVAGPDSPDLPGAAIVDRDDPQWE 245 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 R+ + + +Q+ M + I P I + V + G +FL +Q Sbjct: 246 QRIGQY------RLVIQNYRMTSADRRI--LWSPGGPHYSI-SAVAQDGDGRILFLHCRQ 296 Query: 201 ATNFYDFA-CYAKAKLNVEQLLYLDGT 226 Y FA LNV ++Y++G Sbjct: 297 PVEAYAFAQQLLHLPLNVRTVMYVEGG 323 >UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HRE9_9FIRM Length = 487 Score = 51.5 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 35/92 (38%), Gaps = 13/92 (14%) Query: 150 KEIQFAVQSGPMLMENGVIN-----PRIHPNVAS-SKIRNGVGINKHGNAVFLL------ 197 + A+ +GPML++NG I +VA R +G+ K G + ++ Sbjct: 364 DKTVHALGAGPMLLKNGSIYLTTKIEEFGSDVAGGRAPRTALGLTKDGRVLLVVVDGRQP 423 Query: 198 SQQATNFYDFACYAKAKLNVEQLLYLDGTISH 229 + + A +L + LDG S Sbjct: 424 TSAGMTLLELA-LFLQELGAVDAMNLDGGGSS 454 >UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus sp. D21 RepID=C0WEQ2_9FIRM Length = 470 Score = 51.1 bits (121), Expect = 3e-05, Method: Composition-based stats. Identities = 31/132 (23%), Positives = 52/132 (39%), Gaps = 23/132 (17%) Query: 124 FIRPGGVFYVAGDKVGI-VRLDAFKTSKEIQFAVQSGPMLMENGVIN--------PRIHP 174 F+RP V GDKV I L + K +GP+L+ +G++N P Sbjct: 329 FLRPLAV----GDKVKIKTSLGSPLADKAPSVGT-AGPLLVYDGLVNVTASLEEIPSDIA 383 Query: 175 NVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTIS 228 + R VGI K G + +++ + A Y +L ++ + DG S Sbjct: 384 DG--RAPRTAVGIKKDGTILVVVADGRSSRSAGMTLPELARY-LIQLGADRAMNFDGGGS 440 Query: 229 HMYMKGGAIPWQ 240 + GA+ + Sbjct: 441 SEMVVNGAVKNR 452 >UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Deinococcus deserti VCD115 RepID=C1CWE2_DEIDV Length = 442 Score = 51.1 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 76/216 (35%), Gaps = 25/216 (11%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE- 96 L + VQ V+ + V + + L A + + + Q +NG + Sbjct: 216 LRPLNIPVQLVRVDLRHRDVLVAPVLPHAGLVFGLGARVGQLAQRSGAQALINGSYFHPR 275 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI--VRLDAFKTSKEIQF 154 +YAP G + G+ + P + ++ I + + + Sbjct: 276 TYAPAGDIVMQGRML----------TWGRIPMALAITPDNRATIRATTTPLLRRPLDTTW 325 Query: 155 -----AVQSGPMLMENGVINPRIH-----PNVASSKIRNGVGINKHGNAVFLLSQQATNF 204 + +GP ++ G ++ + P + R+ VG++ + + V + ++ Sbjct: 326 RGMETVIATGPRIVTGGAVHTNYNQVFRDPALFGRAARSAVGLSSNRDLVMVSTRVRLTT 385 Query: 205 YDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPW 239 + +L V++ L LDG S + G A+ Sbjct: 386 TEMGKVM-TRLGVKEALLLDGGSSAGLAWNGRAVLD 420 >UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V4S8_9FIRM Length = 491 Score = 51.1 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 55/158 (34%), Gaps = 32/158 (20%) Query: 99 APLGL-YIENGQQKVALNLASGEGNFFIRPGG-----------VFYVAGDKVGIVRLDAF 146 P GL Y+ + +N I P G F AG +VG + Sbjct: 310 NPYGLEYVIRNGRVAEINTND----SLIPPDGYVVSVHGTLMDAFAAAGVRVGDPAVLTE 365 Query: 147 KTSKEIQFAVQ---SGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVFLL 197 + AVQ +GP L+ENG ++ + ++ R VG+ + GN +F + Sbjct: 366 DLGEPWNRAVQVLGAGPRLVENGSVHVTAGEEQFPGDIRYGRAPRTAVGVTQKGNILFAV 425 Query: 198 SQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH 229 +FA + V + LDG S Sbjct: 426 VDGRQSHSHGLTLTEFADL-LVQFGVRDAINLDGGGSS 462 >UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactobacillales RepID=C4G6X0_ABIDE Length = 345 Score = 50.7 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 58/176 (32%), Gaps = 29/176 (16%) Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 + + + +A+NG Y + G I+NG + E P Y G Sbjct: 157 SSMATNNNAILAVNGDYYGADRS--GYVIKNGVIYRNTVRSDSE-----YPDLAVYKDGS 209 Query: 137 KVGIVRLDAFKTSKEIQ-----FAVQSGPMLMENGVINPRIHPNVA-----SSKIRNGVG 186 I + FA GP L+ENG I + N + R +G Sbjct: 210 FKIIYETEVTAEELLADGVVNLFAF--GPSLVENGEI--SVDQNTEVRQAMTKNPRTAIG 265 Query: 187 INKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLD-GTISHMYMKGG 235 I + + ++S T + Y+ A K + LD G S MY G Sbjct: 266 IVDKNHYILVVSDGRTSESEGLSLYELAEVLK-EYGATTAYNLDGGGSSTMYFNGN 320 >UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RLV8_ACIFE Length = 477 Score = 50.7 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 24/117 (20%), Positives = 43/117 (36%), Gaps = 13/117 (11%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGI 187 GD V + + + +GP L+ +G + I ++A R GVGI Sbjct: 342 TGDPVKVTQTLGNAAADSAPSVGSAGPQLVRDGRVQVTSEEEEIADDIALGRAPRTGVGI 401 Query: 188 NKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 K G + +++ +F Y +L ++ + DG S + G I Sbjct: 402 KKDGTVLVVVADGRSDDSVGMTLTEFGRYF-VQLGADRAMNFDGGGSSEMVVNGKIM 457 >UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BF40_9BACI Length = 657 Score = 50.3 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 11/81 (13%) Query: 158 SGPMLMENGVINPRIHPNVASSK---IRNGVGINKHGNAVFLLS-------QQATNFYDF 207 SGP+L+ NG ++ + PN ++ R V I+K + VFL++ + N +F Sbjct: 278 SGPLLVNNGKVDLGMDPNSTRARERAPRTAVAIDKTMSKVFLVTVDGRLAESKGMNLTEF 337 Query: 208 ACYAKAKLNVEQLLYLDGTIS 228 A Y KL + L LDG S Sbjct: 338 AQY-LVKLGAYKALNLDGGGS 357 >UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=12 Tax=Firmicutes RepID=A4VXL8_STRSY Length = 312 Score = 50.3 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 44/211 (20%), Positives = 71/211 (33%), Gaps = 24/211 (11%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGT-LHALLADINSQGQVQMAMNGGIYDESYAP 100 T Y + Q + +GT + A ++ + +A+NG Y + Sbjct: 88 TNNTTVYVADIQVSSPEYLKTALAQNTYGTNVTAKTSETAAANNAILAVNGDYYGANS-- 145 Query: 101 LGLYIENGQQKVALNLASGE-GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G I+NG + G+ I G F V + I + A G Sbjct: 146 TGYVIKNGVLYRDTVRDNAAYGDLAIYADGSFEVIYENE-ITAQELIDKGVVNLLAF--G 202 Query: 160 PMLMENGVINPRIHPNVA------SSKIRNGVGINKHGNAVFLLSQQAT------NFYDF 207 P L+ENG I SS R+ +GI + + +++ T + Y Sbjct: 203 PSLVENGEIV---VDTSTEVGRAMSSNPRSAIGIIDENHYIIVVADGRTSESQGLSLYQL 259 Query: 208 ACYAKAKLNVEQLLYLD-GTISHMYMKGGAI 237 A K + + LD G S +Y G I Sbjct: 260 AEVMK-QYGAQTAYNLDGGGSSTLYFNGQVI 289 >UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=Firmicutes RepID=C2KZT9_9FIRM Length = 438 Score = 49.9 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 76/199 (38%), Gaps = 20/199 (10%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 D + V V T + + G + +D+ + +A+NG Y + Sbjct: 217 DSNIYVADVEVTDGTSILSAFANNTYGR---NITDTTSDMAEENNAVLAINGDYYGARQS 273 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G I NG + ++GE + G + +++ D+ K+ + G Sbjct: 274 --GYVIRNGVVYRS-QGSNGEDMVISKDGSLSFISESD---TTTDSL-IQKQTWQVLSFG 326 Query: 160 PMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACY 210 P+L+ENG + + V +S R +G + +F++S T + Y+ A + Sbjct: 327 PVLVENGQVAVSENDEVGMAMASNPRTAIGTVAKNHYLFVVSDGRTSESAGLSLYELANF 386 Query: 211 AKAKLNVEQLLYLDGTISH 229 K+ L + LDG S Sbjct: 387 MKS-LGATNVYNLDGGGSS 404 >UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WLU9_9ACTO Length = 447 Score = 49.9 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 49/218 (22%), Positives = 71/218 (32%), Gaps = 36/218 (16%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 T+T TV T+ + AN + + + I S A+NG Y + Sbjct: 223 TVTYYVATVKL-TDATALKSAFANNQFGRNITQKTSTIASNNNAIFAINGDYY--GFRSS 279 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI---VRLDAFKTSKEIQFAVQS 158 G+ I NG +G FY V I + K KE + S Sbjct: 280 GIVIRNGVVYRDDGARAGLA---------FY-RDGSVKIYDETSTNGQKLVKEGVWNTLS 329 Query: 159 -GPMLMENGVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQA------ 201 GP L++NG I I ++ ++ R VG K G VF++ Sbjct: 330 FGPSLVKNGKIVEGIDDVEIDTNFGNHSIQGNQPRTLVGAKKDGTLVFVVVDGRDAGYSR 389 Query: 202 -TNFYDFACYAKAKLNVEQLLYLD-GTISHMYMKGGAI 237 + A + LD G S MY G I Sbjct: 390 GVTMTEAAKIMLEQ-GCVTAYNLDGGGSSTMYFNGEVI 426 >UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C5_CLONN Length = 335 Score = 49.5 bits (117), Expect = 9e-05, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 58/180 (32%), Gaps = 42/180 (23%) Query: 88 AMNGGIYDESY-------------APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 A+N G + + P G+ I NG+ +R +A Sbjct: 152 AINAGGFVANNASSKDANPSETNGNPGGILISNGEIVYN----------NLRNNEKICIA 201 Query: 135 GDKV-GIVRLDAFKTSK----EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK 189 G GI+ + + + ++ AV GP L+ NG + R +G K Sbjct: 202 GITADGILLVGNYNLDEMMKLNVKDAVSFGPALIVNGQKTITSGDGGWGTAPRTAIGQRK 261 Query: 190 HGNAVFLLSQQ------ATNFYDFACYAKAKL---NVEQLLYLDGTISH-MYMKGGAIPW 239 G+ +FL+ A + + L + LDG S MY G I Sbjct: 262 DGSILFLVIDGKYIGRLAVTLREL----QDILYEYGAYNAVNLDGGSSSTMYYNGKVISE 317 >UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C43112 Length = 762 Score = 49.5 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 44/115 (38%), Gaps = 19/115 (16%) Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLS-------- 198 ++ +F + +GP L+ NG + + + R VG + G +FL++ Sbjct: 292 RDAEFILATGPTLVRNGQTSISMSTSSPFARERAPRTAVGASSDGTKLFLVTIDGRQSGY 351 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + A Y ++ + + LDG G RYP+ +SV + Sbjct: 352 SNGVTIPELAAYMRS-IGAHNAINLDGG-------GSTTMVARYPWADHVSVVNR 398 >UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GEE0_9FIRM Length = 379 Score = 49.5 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 70/198 (35%), Gaps = 32/198 (16%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES--YAPLGLYIEN 107 V+P RV + +G ++ I + +A+N + + ++P+ + Sbjct: 181 VDPTKLRVAF-----AHDEYGAPRKPVSKIANSNNAILAINASGFSGNVPFSPV---VRE 232 Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKV--GIVRLDAFKTSKEIQFAVQSGPMLMEN 165 G+ + G I G+ +G + ++ A + I F P+L+ N Sbjct: 233 GEVYSMDINHTPMG---ITACGMLMDSGKRGVEQMIEDGAHQV---ITFR----PVLVRN 282 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVE 218 G + N + R +G ++G+ +F++ N D A + Sbjct: 283 GQMTST-AQNNNTIHPRTAIGQKENGDLIFIVVDGRRNNWSTGINLGDLAQIFIDE-GAA 340 Query: 219 QLLYLDGTIS-HMYMKGG 235 LDG S +Y G Sbjct: 341 WAYNLDGGGSTTLYFNGK 358 >UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_2_4 RepID=C3R3M8_9BACE Length = 329 Score = 49.1 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 59/203 (29%), Gaps = 54/203 (26%) Query: 78 DINSQGQVQMAMNGGIY-DESYAPLGLYIENGQQKVALNLASG----------EGNFFIR 126 Q + + MNGG + + + L G+ A+N G F + Sbjct: 101 WQAEQQKYPIIMNGGYFVMGAGKSVSLLCREGEVL-AVNSQEEIRSQKSYYPTRGIFQLS 159 Query: 127 PGGVF-----YVAGDKVGIVRLDA------FKTSK-------------EIQFAVQSGPML 162 G F Y D V ++ + A+ GP+L Sbjct: 160 KNGYFSTDWAYTTTDGVTYTYEQPSPNKSGYEPQPAPSAYFPTRGVKLNAETAIGGGPIL 219 Query: 163 MENGVINPRIHPNV---------ASSKIRNGVGINKHGNAVFLLSQQA--------TNFY 205 +++G + + S R +G+ + +F + + N Sbjct: 220 LKDGSVRNTFIEELFDEESGVAPESYHPRTAIGVTANNKVIFFVCEGRSVTEGVKGMNMA 279 Query: 206 DFACYAKAKLNVEQLLYLDGTIS 228 A K+ L + LDG S Sbjct: 280 MMANILKS-LGCVDAMNLDGGGS 301 >UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillonellaceae RepID=D1BL19_VEIPT Length = 312 Score = 48.8 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 53/175 (30%), Gaps = 25/175 (14%) Query: 88 AMNGGIYDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+NGG + + P G + +G+ + ++ E V +V K G + Sbjct: 135 AINGGGFHDPNGTGTGRLPYGFILHDGEYVIGKDVGPDED--------VDFVGFSKAGNL 186 Query: 142 RLDAFKT----SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 + + + GP L+ +G R +G K G +FL+ Sbjct: 187 IAGNYNKTQLGDMKAMEGITFGPPLIVDGKKMITEGDGGWGVGPRTAIGQKKDGTVLFLV 246 Query: 198 SQQATNFYDFACYAKA------KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 Y + + LDG S +Y+ G + Sbjct: 247 IDGRQPGYSIGATLRDVQDILFEKGCYIAANLDGGSSSTLYLNGKVVNKPADLLG 301 >UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4R7_9CLOT Length = 894 Score = 48.8 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 70/206 (33%), Gaps = 28/206 (13%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD----INSQGQVQMAMNGGIYD-E 96 + ++ + + V + N E+ L + + V +N Y+ Sbjct: 64 RIESFVIEIDTKNKNVSIEASTPNDESAYGLQPVRKQAEALLAKGENVVAGVNADFYNMA 123 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFA 155 + P G+ +++G N F I + I + F ++ A Sbjct: 124 TGEPNGVLLKDGVIIK--NHPESRKFFGI-------LKDGSAVIGDYNKFNEVKDNVEEA 174 Query: 156 VQSGPMLMENGVI--NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 + +L+++G + P+ A + R VGI +GN F+ + D Sbjct: 175 LGGNAILVKDGQVFETPQ---TGADKEPRTAVGIKSNGNVFFITVDGRQEPYSAGLSMDD 231 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYM 232 A + + Q L LDG S ++ Sbjct: 232 LAQLMIS-MGAIQALNLDGGGSTTHL 256 >UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chroococcales RepID=B7KAU9_CYAP7 Length = 644 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 34/94 (36%), Gaps = 12/94 (12%) Query: 158 SGPMLMENGVIN-----PRIHPNVASSKI-RNGVGINKHGNAVFLLSQQAT-----NFYD 206 +GP+L+ NG I + + K R+ + + G + + + + Sbjct: 532 AGPLLLLNGQIVLDVASEQFSKGFQNQKASRSAIATTRDGKLMVVAVHNRVGGSGASLPE 591 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 A ++ L L LDG S GG + + Sbjct: 592 LAQILQS-LGAVDALNLDGGSSTSLALGGQLIDR 624 >UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A8G9_NATTJ Length = 718 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 36/97 (37%), Gaps = 14/97 (14%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPR-----IHPN-VASSKIRNGVGINKHGNAVFLLSQQ 200 K +++ FA+ GP ++E G ++ R I N R VG+ + G + Sbjct: 596 KNVEDVVFALGGGPRILEKGEVDIRSMEEVISDNVSQGRSPRTAVGVTRDGQLLLTAVDG 655 Query: 201 A-------TNFYDFACYAKAKLNVEQLLYLDGTISHM 230 + + K + + L LDG S M Sbjct: 656 RQSGLSIGMTLEELGNFMKDR-GAQDALNLDGGGSTM 691 Score = 42.2 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 33/104 (31%), Gaps = 16/104 (15%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 + + ++P + +G + L + + A+NGG Y + P+ Sbjct: 396 PIKIHELRLDPHGDVKPELIMAQDG--FSGFERLDSMAKRNNAIA-AINGGFYWRAGHPI 452 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPG--GVFYVAGDKVGIVRL 143 GLYI + + P FY + I R Sbjct: 453 GLYISDQRLIRE-----------PMPNRSAFFYSKDGEATIERT 485 >UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AQ96_9BACE Length = 305 Score = 47.6 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 67/208 (32%), Gaps = 22/208 (10%) Query: 43 LTVQAYTVNPQTERVKMYWQK-ANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 Y + Q A+G + + + I +A+NG Y + Sbjct: 78 YDTSIYVADIQLADASYLRAGLADGTFGRNVTEVTSQIAQDSNAILAINGDFY--GFRNK 135 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ--S- 158 G + NG +GN + V Y G I + + + A Q S Sbjct: 136 GYVMRNGYLYRETAQQGRQGNS-RQEDLVIYEDGHMDVIEENEVAAQTLKDSGASQIFSF 194 Query: 159 GPMLMENGVINPRIHPNVA-----SSKIRNGVGINKHGNAVFLLSQQAT------NFYDF 207 GP L++NG I + N S R +G+ + + +S T Y Sbjct: 195 GPGLIKNGNI--TVDENSEVEQSMQSNPRTAIGMITPLHYIMAVSDGRTEASEGLTLYQL 252 Query: 208 ACYAKAKLNVEQLLYLD-GTISHMYMKG 234 A K + + LD G S M+ G Sbjct: 253 AQIMKGQ-DCVTAYNLDGGGSSTMWFNG 279 >UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HTR4_CYAP4 Length = 603 Score = 47.2 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 25/107 (23%), Positives = 41/107 (38%), Gaps = 15/107 (14%) Query: 146 FKTSKEIQFAVQSGPMLMENGVI--NP---RIHPNVASSK-IRNGVGINKHGNAVFLLSQ 199 F +I A GP+L+E G I NP + + + + R+G+G G + + + Sbjct: 477 FNRFPQILGA---GPLLLERGQIVLNPDLEQFGNGLDAQQAPRSGIGRTSTGQILLVTTH 533 Query: 200 QAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 ++A K L L LDG S GG + + Sbjct: 534 NRIGGAGPTLAEWAAILK-TLGAVDALNLDGGSSTALYLGGQLLDRH 579 >UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostridium RepID=A6LS70_CLOB8 Length = 356 Score = 47.2 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 34/229 (14%), Positives = 66/229 (28%), Gaps = 39/229 (17%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY 104 +P ++ + G + I A+NGG + + + Sbjct: 141 YYLVVKDPTRVKIGV------SSKLGVEGETTSTIAENNDAIAAINGGAFT-DQSSAAQW 193 Query: 105 IENGQQKVALNLASGE-----------GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ 153 NG + + GE F I GV V + +L + + Sbjct: 194 TGNGGLASGIVMTGGEVKVNDVGDNPTTTFGIDKNGVMVVGD--YTVEKLKELGIQEALS 251 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 F GP L+ NG + + + +G K G+ + L+ + Sbjct: 252 F----GPALIINGNMVKINGDGGFGTAPKTAIGQMKDGSIILLVIDGREIGSIGATLKEL 307 Query: 208 ACYAKAKLNVEQLLYLDGTIS-HMYM-------KGGAIPWQRYPFVTMI 248 +L + LDG S +Y ++ + P ++ Sbjct: 308 QEIM-HQLGAWNAMNLDGGKSTTLYYYGEVRNKPSNSMGERTIPTAVIV 355 >UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FXK4_9FIRM Length = 305 Score = 47.2 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 41/186 (22%), Positives = 65/186 (34%), Gaps = 31/186 (16%) Query: 67 EAWG-TLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI 125 E +G + A + +A+NG Y G I NGQ + + + + I Sbjct: 109 ETYGRNVKAKTSTTAQSVNAVLAVNGDYYGA--RDAGYVIRNGQLLRSDSQDPNQEDLVI 166 Query: 126 RPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS---GPMLMENGVINPRIHPNVAS---- 178 G F + + G + AVQ GP L+E+ + + Sbjct: 167 YQDGSFEII--REGDITAQELLNKG----AVQVLSFGPALIEDSQV----AVDSTDEVGK 216 Query: 179 ---SKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLD-GTIS 228 S R +GI + V ++S T + + A + K +L V LD G S Sbjct: 217 AMASNPRTAIGIIDDKHYVLVVSDGRTDESKGLSLKELADFMK-ELKVTTAYNLDGGGSS 275 Query: 229 HMYMKG 234 MY G Sbjct: 276 TMYFNG 281 >UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TKB7_ALKMQ Length = 236 Score = 47.2 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 67/191 (35%), Gaps = 27/191 (14%) Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PLGL-YIENGQQKVALNLASGEGNFF 124 + + + N ++ A+NGG +D + P G+ Y+++G S G+ F Sbjct: 35 QPVQQIRHSYFEANGYKRIG-AVNGGFFDGNRTLPYGMFYVDSGFLLSE----SWAGDAF 89 Query: 125 I---RPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN----PRIHPNVA 177 + G ++ ++ K+ +A+ L+ G +N + Sbjct: 90 LELVHENGKLHIDDITANQLKTKY----KKANWAISLSYSLVVGGKMNIMKGDKFPFTNQ 145 Query: 178 SSKIRNGVGINKHGNAVFLLSQQATNFY------DFACYAKAKLNVEQLLYLDGTISHMY 231 S R +G + N +F++++ + A +L + DG S Sbjct: 146 S-HPRTLIG-DNQENYIFVVTEGRMTKEKGLTAVESARVML-ELGCNTAINADGGGSSAM 202 Query: 232 MKGGAIPWQRY 242 G I + Y Sbjct: 203 DVEGKIQNKYY 213 >UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67T45_SYMTH Length = 921 Score = 46.8 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 10/105 (9%) Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 ++ K G +++ S + +A+ L+ +G + + + AS + R+ VG + G Sbjct: 240 FLDPLKPGDPVTVSYRPSPAVAWAIGGQNYLVRDGAVVSGL--DNASRRPRSAVGFSADG 297 Query: 192 NAVFLL-----SQQAT--NFYDFACYAKAKLNVEQLLYLDGTISH 229 ++LL S ++ + A + K+ L LDG S Sbjct: 298 RRMYLLVIEGDSSRSVGATLAEMAAFMKS-FGAANALELDGGGSS 341 >UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0BZE5_ACAM1 Length = 584 Score = 46.8 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 41/113 (36%), Gaps = 17/113 (15%) Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHP--NVASSKIRNGVGINKHGNA 193 +F I A GP+L+ NG + + P + S+ R+G+G G Sbjct: 457 TTPASFNGFPNIVGA---GPLLVSNGQVVLNAKAEKFRPPFDTQSAP-RSGIGQTADGTI 512 Query: 194 VFL-----LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + +S ++A + +L L LDG S GG + + Sbjct: 513 LLAAVHNQVSGPGPTLKEWALIMQ-RLGSVNALNLDGGSSTSLYLGGQLLDRH 564 >UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteria RepID=Q4UP44_XANC8 Length = 439 Score = 46.8 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 74/237 (31%), Gaps = 39/237 (16%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHA-LLADIN 80 LTL P + P + + ++ T +++ + G A Sbjct: 173 PLTLAPGVRYWRQAIGGAQP-VMLHIAQIDLTTPGLQLVGTPGDRSDGGEFRATPTTAFV 231 Query: 81 SQGQVQMAMNGGIY---------DESYAPL--------GLYIENGQQKVALNLASGEGNF 123 G + +A+N + D+ + P GL IE G+ A + Sbjct: 232 RDGALTLAINADYFLPFDGGHLLDKPFVPAAGQGVTAEGLAIEAGRTDSAAATSD----- 286 Query: 124 FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV---ASSK 180 R V+ VR+ + V +GP+L+ +G PR + Sbjct: 287 -PRVNAALCVSQRDA--VRIVRGSCPAGSRLGVGAGPLLLLDGKRQPREASRAAYYDGPE 343 Query: 181 IRNGVGINKHGN-AVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISH 229 R+ VG+++ G+ +++ +L + LDG S Sbjct: 344 PRSAVGLDRSGHTLWMVVADGRQPGYSAGMTLDALTAVF-EQLGAHAAINLDGGGSS 399 >UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IP98_9BACE Length = 536 Score = 46.8 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 65/213 (30%), Gaps = 70/213 (32%) Query: 83 GQVQMAMNGG-IYDESYAPLGLYIENGQQKVALNLASGE----------------GNFFI 125 G + +NGG Y+ ++ G+ + N + +++ G G F + Sbjct: 297 GDCYLLVNGGYFYNGNH--TGIAVINSIKSGSVSAVRGSLKTGDTEYNSMYNVTRGTFGV 354 Query: 126 ----RPG----------GVFYVA--------GDKVGIVRLDAFKTSKE--IQFAVQSGPM 161 +P VFY +K GIV + T+ ++A+ +GP+ Sbjct: 355 DASGKPNVVWTGTDASSNVFYFDRPLPSVKGENKYGIVTNENPTTAISWSPKYALSAGPV 414 Query: 162 LMENGVINPRIHPNVASSKI--------------------RNGVGINKHGNAVFLLSQQA 201 L+++ I + R +G + G V + Sbjct: 415 LLKDKKIPFDFTETSKGTDYYLSNYEIIPYDIFGANVTPDRTAIGYREDGKVVIFICDGR 474 Query: 202 T------NFYDFACYAKAKLNVEQLLYLDGTIS 228 + A K L + LDG S Sbjct: 475 ITASGGATLTELAQIMK-GLGCVGAINLDGGGS 506 >UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3R0_PELTS Length = 485 Score = 46.4 bits (109), Expect = 9e-04, Method: Composition-based stats. Identities = 28/102 (27%), Positives = 43/102 (42%), Gaps = 15/102 (14%) Query: 146 FKTSKEIQF-AVQSG-PMLMENGVI------NPRIHPNVASSKIRNGVGINKHGNAVFLL 197 FK +++ +F A S P+L+ NG I P++ R+ VG+ V Sbjct: 376 FKENQQARFKATISNYPLLLSNGAIALGDITEPKLTIGAP----RSFVGVTWDNILVMGT 431 Query: 198 SQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIP 238 A N ++ A K L ++ L LDG S +Y G I Sbjct: 432 VDSA-NVWELAEVTKN-LGLKDALNLDGGASCGLYYDGAYIR 471 >UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN59_9BACE Length = 315 Score = 46.4 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 39/102 (38%), Gaps = 10/102 (9%) Query: 134 AGDKVGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN 192 +K+ I ++ ++ SG +++ +G I+ +P + R +G + G+ Sbjct: 179 KDNKMVIADSVEYRGMQYNLKEVTGSGVIVLRDGEISGATYPGID---PRTCLGYSDDGH 235 Query: 193 AVFLLSQQATNFYDFACYAKA------KLNVEQLLYLDGTIS 228 F+++ FY + L + LDG S Sbjct: 236 VYFMVADGRVEFYSYGLTYPEMGSIMKALGCSWAVNLDGGGS 277 >UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WS35_9SYNE Length = 687 Score = 46.4 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 50/168 (29%), Gaps = 26/168 (15%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS----- 158 ++N + ++ + P Y+ + +F+ + + QS Sbjct: 507 TVQNHEVIAQKSMGKAGSSSVPIPRDGGYLLALRSYRSAGQSFQPGTPVLLSSQSQPAVF 566 Query: 159 ---------GPMLMENGVI--NPRIHPNVAS----SKIRNGVGINKHGNAVFLLSQQAT- 202 GP+L+ + I NP++ + + R VG G + Sbjct: 567 EQYPNMIGGGPLLVRDRNIVLNPQLEGFSTNFIQGAAPRTAVGKTSDGTWIIATMHDRVG 626 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A K +L L LDG S GG + + Sbjct: 627 GRGPTLTETAYIMK-QLGAVDALNLDGGSSSSLYLGGQLLNRHPRTAA 673 >UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V127_BACUN Length = 277 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 24/109 (22%), Positives = 42/109 (38%), Gaps = 18/109 (16%) Query: 158 SGPMLMENGVI------NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNF 204 SGP+++++G + N + R+ V + + G + ++ N Sbjct: 165 SGPLMLKDGQVCDLSGTNRNFV---DTKHPRSAVALTREGKILLIVVDGRRKGKAEGINI 221 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + A + L E L LDG S + GA+P + S ERK Sbjct: 222 PELAHMIR-ILGGEDALNLDGGGSST-LWSGALPDKGIANTPSGSAERK 268 >UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostridium RepID=B8I4Q1_CLOCE Length = 346 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 67/227 (29%), Gaps = 33/227 (14%) Query: 37 ALSDPTLTVQAYTVN-PQTERVKMYWQKA-NGEAWGTLHALLADINSQGQVQMAMNGGIY 94 + + V+ P +V + +GE ++ + A+NGG + Sbjct: 123 DVESRNFKGKMIIVDDPTRIKVGYSSKMPRSGETTSSIARRNGAVA-------AINGGGF 175 Query: 95 ------DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI-VRLDAFK 147 +G I NG+ N + + + + + + A Sbjct: 176 IDKGWAGTGGVAIGFVISNGKYISGKLT-----NNYTKRDTIAFTKDGMLIVGKHSQAEL 230 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHP--NVASSKIRNGVGINKHGNAVFLLSQQATNFY 205 I+ + GP L+ NG P I+ R +G + G+ + L+ + Sbjct: 231 AKYNIKEGISFGPPLIVNGK--PTINKGDGGWGISPRTAIGQKEDGSVMLLVIDGR-SLK 287 Query: 206 DFACYAKA------KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 F K + LDG S MY G + Sbjct: 288 SFGATLKEVQDIMLEHGAVNAANLDGGSSATMYYDGKVVNTPSDALG 334 >UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3362 Length = 356 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 49/163 (30%), Gaps = 24/163 (14%) Query: 86 QMAMNGGIYDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 +A+N G +++ PLG+ + GQ K N+ S VF + Sbjct: 182 TLAINAGGFEDIGGVGNGGTPLGIVMSEGQLKYG-NVNSSYDLIGFDNNNVFVIGQ---- 236 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN-VASSKIRNGVGINKHGNAVFLLS 198 + I+ AV GP L+ NG P R +G G + L+ Sbjct: 237 --MTGQQAIDRGIRDAVSFGPFLILNGT--PLEVSGMGGGLNPRTAIGQRADGAVLLLII 292 Query: 199 QQATNFYDFACYAKA------KLNVEQLLYLDGTISH-MYMKG 234 + LDG S +Y G Sbjct: 293 DGRQT-HSLGASMNDLINVMLDFGAVNAANLDGGGSTVLYYDG 334 >UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing protein n=2 Tax=Deinococcus RepID=Q1IXP5_DEIGD Length = 444 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 60/176 (34%), Gaps = 25/176 (14%) Query: 89 MNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR--LDA 145 +NG + SYAP G + G+ G P + ++ I+ Sbjct: 269 VNGSYFHPRSYAPAGDLVVQGRLLA-------WGRI---PVALAITPDNRAAIMTSTTPL 318 Query: 146 FKTSKEIQF-----AVQSGPMLMENGVINPRI-----HPNVASSKIRNGVGINKHGNAVF 195 E+ + + +GP ++ G + + P + R+ VG+ + + VF Sbjct: 319 LGRPLEVSWHGMETVIATGPRILNGGTVVRQYASAFRDPALFGRAARSAVGLKSNRDLVF 378 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISV 250 + + + A+L V L LDG S + G A+ I V Sbjct: 379 VTTHAKLTTTEMGKVM-ARLGVRDALLLDGGSSAGLAWNGQAVLDSVRKVAYGIGV 433 >UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YND3_9CYAN Length = 304 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 71/207 (34%), Gaps = 24/207 (11%) Query: 41 PTLTVQAYTVNPQTERVKMYW----QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 L + ++ T +++ Q + + ++ + +Q+A+NG + Sbjct: 60 DPLMIHIVKIDLTTPGIELLVTPGEQGEDDQDIS--AQTTSEFLQKHYLQLAINGSFFHP 117 Query: 97 SY--APLGLYIENGQQKVALNLASGEGNFFI---RPGGVFYVAGDKVGIVRLDAFKTSKE 151 Y P+ Y +G++ A +G + + V ++ K + F T + Sbjct: 118 FYVHNPIDYYPNSGERVNIFGQAISQGKIYSIVNKGWSVLCISPKKKAEI---YFDTCPK 174 Query: 152 IQFAVQSGPMLMENGV--INPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQATNFYD-- 206 +G +++ + I + + R V I+K G +L ++Y Sbjct: 175 NTLQGIAGNLILIDQGQPIKVKKFSDANQKFPRTAVAIDKTGETLWLILIDGRQSWYSKG 234 Query: 207 --FACY---AKAKLNVEQLLYLDGTIS 228 A + VE L DG S Sbjct: 235 VTLATLTNIIQELDGVETALNFDGGGS 261 >UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 Tax=Nocardioides sp. JS614 RepID=A1SN25_NOCSJ Length = 420 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 56/161 (34%), Gaps = 24/161 (14%) Query: 92 GIYDESY-APLGLYIENGQQKV--------ALNLASGEGNFFIRP-GGVFYVA-GDKVGI 140 GIY + G + GQ + + +P G+ ++ G+ + Sbjct: 224 GIYTPRWGRTAGYGVTQGQTERVRAVTVVNGRVRTNRAKLSHDQPIKGLLFIGRGEGAKV 283 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGV---INPRIHPNVASS--KIRNGVGINKH-GNAV 194 +R T ++++++Q P + +G ++ I + R VG++ G + Sbjct: 284 LRKLPKHTRIKVRWSLQGRPQMAISGNNFLVHDGIIRAIDDREMHPRTAVGVDSDTGEVL 343 Query: 195 FLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH 229 L+ + A L ++ + LDG S Sbjct: 344 LLVVDGRQADSRGYTMVELANLMVD-LGADEAVNLDGGGSS 383 >UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobium RepID=B3PTF7_RHIE6 Length = 325 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 68/207 (32%), Gaps = 23/207 (11%) Query: 27 PLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P F VA A + V+P R + + + + + Sbjct: 83 PGFEVAELPVLADGREVDRIFLSRVDPARFRFVTHNAAPGDK---GIDEWEKTLPNA--- 136 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL-- 143 + +NG +D+ P +I G + G F D I L Sbjct: 137 VLIVNGSYFDKHGRPDTPFISEGIAMGPRQYDARA--------GAFTADKDTAEIRDLSH 188 Query: 144 -DAFKTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 D A+ S P+L+ ++G + ++ R V + G V +++A Sbjct: 189 QDWQTAFVGASNAMVSYPLLIGDDGQTH--VNVKSRWLANRTFVAKDDLGRVVIGTTKEA 246 Query: 202 -TNFYDFACYAK-AKLNVEQLLYLDGT 226 + A + K + LN++ L LDG Sbjct: 247 FFSLDRLAQFLKTSPLNLKVALNLDGG 273 >UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobacillus RepID=C9RVV6_GEOSY Length = 652 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 30/79 (37%), Gaps = 11/79 (13%) Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKA 213 L+ +G + P V R VGI+K+GN + ++ + A Sbjct: 384 RLVADGKVQPFSIEGV---HPRTAVGIDKNGNVMLIVVDGRQPAYSQGMTLNELAKLM-H 439 Query: 214 KLNVEQLLYLDGTISHMYM 232 +L + LDG S ++ Sbjct: 440 ELGAVDAMTLDGGGSSTFV 458 >UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=cyanobacterium UCYN-A RepID=UPI0001C3370C Length = 438 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 37/93 (39%), Gaps = 12/93 (12%) Query: 159 GPMLMENGVI-----NPRIHPNVASSKI-RNGVGINKHGNAVFL-----LSQQATNFYDF 207 GP+L+ +G I + + + K R+ +GI + + ++ N + Sbjct: 329 GPLLINDGSISLNVKDEKFTKSFQKQKASRSAIGITNKDKTILVTVHNSINSNGVNLNEM 388 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 A + KL L LDG S + GG + + Sbjct: 389 AQIMQ-KLGSINALNLDGGGSTSLVLGGRLIDR 420 >UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y710_COPPD Length = 485 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 21/86 (24%), Positives = 33/86 (38%), Gaps = 10/86 (11%) Query: 158 SGPMLMENGVI--NPR-----IHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 +GP L+ NG I +P I R+ +GI ++ +++ D A Sbjct: 384 AGPRLITNGEITLDPASELLDIPKITGQPLTRSALGITQNNEL-LMVTVSKCTIQDLATI 442 Query: 211 AKAKLNVEQLLYLDGTIS-HMYMKGG 235 K L + LDG S +Y G Sbjct: 443 MKD-LGAYNAMNLDGGASTSLYANGK 467 >UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria RepID=B5VVA8_SPIMA Length = 789 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 31/88 (35%), Gaps = 12/88 (13%) Query: 159 GPMLMENGVINPRIHPNV------ASSKIRNGVGINKHGNAVFLLSQQATN-----FYDF 207 GP+L++N I IR+ VG+ G + + N + Sbjct: 682 GPLLVQNRNIVVNAEAEGFNYWFGQQLAIRSAVGVTATGEVLMVTVHNRVNGAGPSLTEM 741 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGG 235 A + +L + LDG S + GG Sbjct: 742 AKLMQ-QLGAIDAINLDGGSSTSLVLGG 768 >UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=Nostocaceae RepID=UPI0001C164F4 Length = 300 Score = 44.9 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 6/83 (7%) Query: 153 QFAVQSGPMLMENGV--INPRI----HPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD 206 F++ SGP L+ NG +NPR+ P V + +R +G ++ G +FL + + + Sbjct: 170 WFSITSGPRLLRNGEVSVNPRLEGFKDPAVLGTSLRTAIGFSEDGKRLFLANFDEKLYLE 229 Query: 207 FACYAKAKLNVEQLLYLDGTISH 229 A + + + LDG S Sbjct: 230 EEAEAMKAIGCYEAMNLDGGPSR 252 >UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HN11_LYSSC Length = 815 Score = 44.9 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 38/89 (42%), Gaps = 12/89 (13%) Query: 151 EIQFAVQSGPMLMENGVINPRIHPNV---ASSKIRNGVGINKHGNAVFLLS--------Q 199 + QF + +GPML+ NG ++ + N ++ R V ++ G V L++ Sbjct: 266 DAQFILAAGPMLVRNGQVDISMPTNSGFASTRSPRTAVAVDATGTKVSLITIDGRLSGHS 325 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTIS 228 N D A + + + + LDG S Sbjct: 326 NGVNLSDLASHLIS-IGATSAINLDGGGS 353 >UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YL57_9CYAN Length = 620 Score = 44.9 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 22/102 (21%), Positives = 37/102 (36%), Gaps = 12/102 (11%) Query: 158 SGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVFL-LSQQAT----NFYD 206 +GP+L+++G I R IR+ VG + + + + N + Sbjct: 508 AGPLLLQSGEIVLDAPSERFSEAFSNQQAIRSAVGRTPDNKLLLVAVHNRPLGSGPNLTE 567 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 A + KL + L LDG S GG + + I Sbjct: 568 LAQILQ-KLGAVEALNLDGGSSTSLYLGGELIDRPAQTAAPI 608 >UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IYX5_9BACL Length = 347 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 69/217 (31%), Gaps = 33/217 (15%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 +S + TV +P R+ + ++ GE + + + G + GG D Sbjct: 100 EISGKSYHGYVLTVNDPTKIRLGVPAKRGKGE------KVSSMVARTGALAGVNGGGFAD 153 Query: 96 ES-----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 + + P+G+ I G+ + V DK G + + + Sbjct: 154 PNWKGNGFKPIGVVISRGKLYYNGISSGAATQI---------VGLDKQGKMIAGKYTLEE 204 Query: 151 EIQFAVQSG----PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA----- 201 + +Q P ++ NG R R +G + G +F++ Sbjct: 205 LDKLGIQEAVTFQPRIIVNGKGQIRSQKEGWGIAPRTAMGQREDGAILFVVIDGRQPGYS 264 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + YD + LDG S + +K G Sbjct: 265 IGASLYDVQQIMLER-GAVIAANLDGGSSTVLVKEGG 300 >UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PX63_9BACT Length = 294 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 69/210 (32%), Gaps = 35/210 (16%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 TV + P+ ++ A+G A + ++ + + + +NG + + Sbjct: 66 TVTVAEITPKRS-LEFDIAIADG------GATVGEMAQRTKALVGINGSYFGMNKRSAIT 118 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR-----LDAFKTSKEIQFAVQS 158 Y+ G+ + + +R G G K+ I+ + A S Sbjct: 119 YLRQGRTVLDTTTTAE---LALRVTGAIRTHGRKLRIMPWNKEIERRYHCRHGSTLA--S 173 Query: 159 GPMLMENGVINPRIHPNVAS------SKIRNGVGINKHGNAVFLLSQQA-------TNFY 205 G +L+ G I +S R+ + + G +F+ N Sbjct: 174 GHLLLYRGQ---SILLRSSSMGFVVKKHPRSAIALTSRGTVLFVTVDGRHPGYAGGMNLI 230 Query: 206 DFACYAKAKLNVEQLLYLDGTIS-HMYMKG 234 + + +L + LDG S ++ KG Sbjct: 231 EL-RHFLQQLGCTDAINLDGGGSTTLWAKG 259 >UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VX04_9CYAN Length = 681 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 35/89 (39%), Gaps = 13/89 (14%) Query: 158 SGPMLMENGVIN-----PRI-HPNVASSKIRNGVGINKHGNAVFLLSQQAT-----NFYD 206 +GP+L++N I + S IR+ +GI +G + N + Sbjct: 570 AGPLLLQNRQIVLDAKAENFSNAFAQQSAIRSAIGITANGTLIIAAMHNRVGGRGPNLTE 629 Query: 207 FACYAKAKLNVEQLLYLDGTIS-HMYMKG 234 A + +L L LDG S +Y+ G Sbjct: 630 TAQLMQ-QLGAVDALNLDGGSSTGLYLGG 657 >UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostridium RepID=B2V2N5_CLOBA Length = 348 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 64/194 (32%), Gaps = 31/194 (15%) Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIY-----------DESYAPLGLYIENGQQKVALN 115 + G L +++ + A+NGG + P G I +G+ + Sbjct: 143 KYLGKLGQKTSEMAEEHNAIAAINGGSFVDKSSDGITYAGTGGQPGGFVISSGKVVYPI- 201 Query: 116 LASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG-----PMLMENGVINP 170 G+ N V IV K++ VQ P ++ NG+ Sbjct: 202 ---GKCNEHSVEN-VIAFTKKGQLIVGNHTLAELKKLD--VQEAMCFREPNVIINGIRQH 255 Query: 171 RIHPNVASSKIRNGVGINKHGNAVFL------LSQQATNFYDFACYAKAKLNVEQLLYLD 224 + + R VG + G +FL LS+ Y+ +++ LD Sbjct: 256 KKEDYIDGINPRTAVGQKEDGTVLFLALDGRKLSKPGATIYEVQEIMRSR-GAINAGMLD 314 Query: 225 GTIS-HMYMKGGAI 237 G S MY KG I Sbjct: 315 GGYSTTMYYKGDVI 328 >UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q9_CLOCE Length = 952 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 33/88 (37%), Gaps = 11/88 (12%) Query: 151 EIQFAVQSGPMLMENGVINPRI--HPNVAS-SKIRNGVGINKHGNAVFLLSQQA------ 201 ++ A+ G ML+++ + +P S R +G +K G + + + Sbjct: 268 NMKMALTGGAMLVKDDKVLTSFSHNPVSPSTRASRTAIGTSKDGKTLIVAAVDGRSSASI 327 Query: 202 -TNFYDFACYAKAKLNVEQLLYLDGTIS 228 + A Y +L L LDG S Sbjct: 328 GMTQSELASYM-HELGCANALNLDGGGS 354 >UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LSB3_9FIRM Length = 475 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 34/92 (36%), Gaps = 13/92 (14%) Query: 158 SGPMLMENGVIN-----PRIHPNVAS-SKIRNGVGINKHGNAVFLLSQQA------TNFY 205 +GPML+++G+ + P++A R G+ G+ + + Sbjct: 364 AGPMLVKDGIAHVTATEEEFPPDIARGRAPRTAFGVTAEGHYLLAVVDGRQPHSIGCTLQ 423 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 + A + Q + DG S + GG + Sbjct: 424 EMAE-FMLQFGAVQAINFDGGGSSALVVGGEL 454 >UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZFU4_BREBN Length = 359 Score = 44.1 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 59/183 (32%), Gaps = 36/183 (19%) Query: 73 HALLADINSQGQVQMAMNGGIYDES----YAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 G + +A+N G + PLG+ + +G+ + N A Sbjct: 172 ETTSQAGKRTGSI-LAINAGGFMSDKQGNLTPLGITVVDGKIRTFSNNAKLS-------- 222 Query: 129 GVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG--------PMLMENGVINPRIHPNVASSK 180 F +K +V + KT +I Q G P L++ G P + + Sbjct: 223 --FVGFNNKGHLVGT-SIKTQAQI---TQQGILQGASFLPRLLQGGKRLPIPREWANARQ 276 Query: 181 IRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLD-GTISHMYM 232 R +G +G+ + ++ + A + +V LD G S Y Sbjct: 277 PRTLIGHFDNGDLLLIVIDGRRDGWSNGVTLEE-AQRKLQEWHVVDAYNLDGGGSSAFYY 335 Query: 233 KGG 235 G Sbjct: 336 NGK 338 >UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001694670 Length = 363 Score = 44.1 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 67/217 (30%), Gaps = 21/217 (9%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYI 105 Y +P++ RV + +K GE ++ + G+ +AP+G + Sbjct: 124 MMYVFDPRSIRVVVPGKKGEGERITSMVERTGAVAGVNGGGFIDPDGL-GNGFAPIGAIL 182 Query: 106 ENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 G+ + G + K I +L A K S+ + F P ++ Sbjct: 183 SGGKVLYNDQKEDIPQHIVGFTDKGTLVI--GKYSIDQLRAMKVSEAVSF----YPRVIA 236 Query: 165 NGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ--QATNFYDFACYAKAKL---NVEQ 219 NG R +G G +F++ QA + + L Sbjct: 237 NGKPLITKGDGGWGRAPRTALGQRADGTVIFVVIDGRQAHSVGATLREVQDLLLEQGCIN 296 Query: 220 LLYLDGTISH--------MYMKGGAIPWQRYPFVTMI 248 +LDG S + +R P +I Sbjct: 297 AGFLDGGASSEMVKDRKLLTQPSSRYGERRLPSGFLI 333 >UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZEQ6_BREBN Length = 356 Score = 44.1 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 63/200 (31%), Gaps = 24/200 (12%) Query: 48 YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIEN 107 Y +P R+ + +K G+ I A G Y + G+ I Sbjct: 146 YISDPSRVRLVVTNRKDRGDLLDEFVNKTGAIGIVNASGFADPDG-YGKGARAYGVVIHE 204 Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ---SG-PMLM 163 G+ N SGE + G + ++ + ++ V+ S P L+ Sbjct: 205 GKILQGYNPRSGETALGLTYDG----------KLITGSYSAEQLVKMGVRDAVSFRPQLI 254 Query: 164 ENGV-INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKL 215 NG + + R +G + G VF + + D A + Sbjct: 255 VNGKNMFEGKPAKSWGIQPRTAIGQKEDGTIVFAVIDGRQPGHSIGASMNDMAELLAER- 313 Query: 216 NVEQLLYLDGTISHMYMKGG 235 V + +DG S M + G Sbjct: 314 GVVTAMAMDGGSSSMMLHNG 333 >UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CAS6_ACAM1 Length = 279 Score = 44.1 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 70/231 (30%), Gaps = 51/231 (22%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 LS T+ V +P V++ + L + + +NGG +D Sbjct: 28 QELSQATVHVLRIPNHP-RYTVRL-------DVVDGLQTVADFAQGTPKPVAVINGGYFD 79 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV-GIVRLDAFKTSKEIQF 154 + YI G Q LA N + V K+ ++ + + Sbjct: 80 PANQLTTSYIRRGGQI----LADPTQNSRLVDNPDLKVYLPKILNRSEFRQYQCGAKTTY 135 Query: 155 AVQS-----------------GPMLM-------------ENGVINPRIHPNVASSKIRNG 184 A+ S GP L+ +G + R R+ Sbjct: 136 AITSYNQPIPPDCTLNYALGAGPQLLPQLTSQAEGFTDSVDGQVI-RDAIGSRQPNARSA 194 Query: 185 VGINKHGNAVFLL------SQQATNFYDFACYAKAKLNVEQLLYLDGTISH 229 VGI G +++L ++ + + A + + + L LDG S Sbjct: 195 VGITDKGEVIWVLVEQQSATKPGLSLPELADFMEQQ-GAASALNLDGGSSS 244 >UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YXN3_9CYAN Length = 775 Score = 44.1 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 59/184 (32%), Gaps = 41/184 (22%) Query: 89 MNGGI--YDES----YAPLGL-----YIENGQQKVALNLASGEGNFFIRPGGVF------ 131 + GI Y Y PL L +EN Q + + I G Sbjct: 571 VKAGIARYTPDWGKSYTPLTLNEVIITVENNQLSRQIESNDDQTPIEIPQNGYLLTFRSF 630 Query: 132 ------YVAGDKVGIV---RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------ 176 + G K+ I F I A GP+L++ G I Sbjct: 631 RSALSAFPLGGKIAITAKTTPSEFNQYPHILGA---GPLLLQQGQIVVDAEAEGFNIWFA 687 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMY 231 IR+G+G+ +G+ + + + + A + +L + L LDG S Sbjct: 688 KQRAIRSGIGVTANGDLLIVTVHNRVGGPGPDLTELAQLIQ-QLGAVEGLNLDGGSSTSL 746 Query: 232 MKGG 235 + GG Sbjct: 747 ILGG 750 >UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6I7_THEAS Length = 486 Score = 44.1 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 36/155 (23%), Positives = 55/155 (35%), Gaps = 21/155 (13%) Query: 91 GGIYDESYAPL-GLYIENGQQKVALNLASGEGNFFIRPGGVFYVA------GDKVGIVRL 143 GG Y L L +++G + A F + G A GD + +VR Sbjct: 308 GGAYRPGNQALLSLSVKDGIVQDEPQGAD----FTLLANGRAAEALGSLNIGDTLQLVRR 363 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS----SKIRNGVGINKHGNAVFLLS- 198 AF + + +Q GPM++EN R S R VGI++ G ++ Sbjct: 364 FAFPAFEACRLVIQGGPMIVENRRYVNRSEGLSRSIRERRHPRTLVGIDEQGLVFMVIDG 423 Query: 199 ----QQATNFYDFACYAKAKLNVEQLLYLDGTISH 229 + A A + + L LDG S Sbjct: 424 RNGHSSGVTLEEAANLALEE-GLVAALNLDGGGSS 457 >UniRef50_C7LY43 Putative uncharacterized protein n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LY43_ACIFD Length = 397 Score = 43.7 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 60/174 (34%), Gaps = 28/174 (16%) Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 G W + + + + + A N G + G Y G+ V L +G + Sbjct: 182 PPGGGPWPYMAPITNPVAAD--LVAAFNSGFRMQDAN--GGYYAYGRTAVPL--RNGAAS 235 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI- 181 F I GV + K I Q+ L+ NG INP N + I Sbjct: 236 FVISTSGVPTIE------TWTHGNHVPKGIAVVRQNLIPLISNGRINP--LVNSTNFAIW 287 Query: 182 -----------RNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+GVGI ++G V++ + + A A++ + LD Sbjct: 288 GATVGNQLLVWRSGVGITRNGALVYV-TGPGLSVASLARL-LARVGAVNAMELD 339 >UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=Nostocaceae RepID=UPI0001C16068 Length = 613 Score = 43.4 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 35/111 (31%), Gaps = 15/111 (13%) Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAV 194 F T I A GP+L++N I + + +R+ + + N + Sbjct: 490 TTPGEFNTYPHIIGA---GPLLIQNQRIVVDAKAEKFSQAFIKERAVRSAICTTNNDNLI 546 Query: 195 FLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A + K+ L LDG S GG + + Sbjct: 547 LAAVNNRVGGWGPTLEEHAQLMQ-KIGCTNALNLDGGSSTSLYLGGQLLDR 596 >UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LNU2_DESBD Length = 276 Score = 43.4 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 84/248 (33%), Gaps = 41/248 (16%) Query: 9 KGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN--- 65 + + T + +A + + A L + + Q + + + + ++ Sbjct: 7 RAVFTCLVLCAPVASLHAEEWRLLAPGLELREFLIPDQVGDLEGRQSGMAVLRIDSDRFD 66 Query: 66 ---GEAWGT--LHALLADINSQGQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLAS 118 G A GT + ++ G V + +N G++ G Y+ + + Sbjct: 67 VALGSALGTGRMRSMQEWARHSGFVAV-INAGMFRADDRMRSTG-YMRDAAVMI------ 118 Query: 119 GEGNFFIRPG-GVFYVAGDK-VGIVRLDAFKTSKEIQF----AVQSGPMLMENGVINPRI 172 N FI P G F + + L + + A G +++N + R Sbjct: 119 ---NSFIHPNYGAFLAFQPRDPSLPALRWVDRKSDPDWQAVLADYDG--IIQNYRLISRE 173 Query: 173 HPN----VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS 228 N + +++ G +F+ + + ++FA L LD I Sbjct: 174 RENLWEPSDRRHSGAAIAMDREGRLLFIHCRARLSLHEFAQALID-------LPLD-LIG 225 Query: 229 HMYMKGGA 236 MY++GGA Sbjct: 226 AMYVEGGA 233 >UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L610_BACV8 Length = 287 Score = 43.4 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 64/179 (35%), Gaps = 32/179 (17%) Query: 77 ADINSQGQVQMAMNGGIYD--ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 + + Q + A+NG + E ++ Y+ + + +R G ++ Sbjct: 88 SQLAEQSRSSAAINGSYFSIKEGFSTC--YLRKNEAVIDTTTTEER---HLRVNGAVHMV 142 Query: 135 GDKVGIVR--LDAFKTSKEIQ---FAVQSGPMLMENGV------INPRIHPNVASSKIRN 183 + + I+ + K + A SGP+LM++G I+ + R+ Sbjct: 143 DNNIRIIPWNDENEKKGFPLDGDILA--SGPLLMQDGKTCDFTTIDREF---SETRHPRS 197 Query: 184 GVGINKHGNAVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTIS-HMYMKG 234 + + K G+ + + + + A + L L LDG S +++ G Sbjct: 198 AIALTKEGDIMLVAVDGRAEGHADGMSIAELAYLLR-ILKAHCALNLDGGGSTTLWVNG 255 >UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C0E0_BEUC1 Length = 1327 Score = 43.0 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 17/75 (22%), Positives = 27/75 (36%), Gaps = 8/75 (10%) Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------TNFYDFACYAKAK 214 L+E+G I V R VG ++ G A F++ + A+ Sbjct: 304 LLEDGEITSATGGYVDVRHPRTAVGFDETGTTAYFVVVDGRQSHSIGMTLPELGR-FLAQ 362 Query: 215 LNVEQLLYLDGTISH 229 L + + LDG S Sbjct: 363 LGADDAINLDGGGSS 377 >UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DHU5_THEEB Length = 575 Score = 43.0 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 42/107 (39%), Gaps = 13/107 (12%) Query: 145 AFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVAS-SKIRNGVGINKHGNAVFLLS 198 AF I A GP+L+E G + + + + + R+ +G G+ V++ + Sbjct: 453 AFNRIPNIVGA---GPLLVEQGRVVLNAALEQFGAGLDAQAAPRSAMGNRSDGSIVWVTT 509 Query: 199 QQATNF--YDFACYAK--AKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 A +A+ +L + + LDG S GG + + Sbjct: 510 HNRIGGMGPTLAEWAQIVHRLGLINAVNLDGGSSTALYLGGVLVDRH 556 Score = 39.9 bits (92), Expect = 0.084, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 32/95 (33%), Gaps = 2/95 (2%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 P L V +NPQ +++ + + L + ++ + Q Sbjct: 242 PGLRWQQQTVILGTRQFPVDLLIINPQQPGLRLRPLEISPTTLVGLATVP-ELAQRWQAA 300 Query: 87 MAMNGGIYDESY-APLGLYIENGQQKVALNLASGE 120 A+NGG ++ APLG G L G Sbjct: 301 AAINGGFFNRDRQAPLGAIRREGNWLSGPILNRGA 335 >UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8W171_9BACI Length = 750 Score = 43.0 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 38/80 (47%), Gaps = 11/80 (13%) Query: 159 GPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLLSQQATNFY-------DFA 208 GP+L++NG ++ + + ++ R+G+GI+ GN +F+ + Y FA Sbjct: 290 GPLLVQNGRVDITMSSSASTYSVPNPRSGIGIDAQGNTMFVTVDGRQSGYSQGMTIPQFA 349 Query: 209 CYAKAKLNVEQLLYLDGTIS 228 Y + + + LDG S Sbjct: 350 NYMRDQ-GAVMAINLDGGGS 368 >UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtilis ortholog n=2 Tax=Clostridium RepID=Q97FU6_CLOAB Length = 347 Score = 42.6 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 58/179 (32%), Gaps = 24/179 (13%) Query: 77 ADINSQGQVQMAMNGGIYDE----------SYAPLGLYIENGQQKVALNLASGEGNFFIR 126 ++ + + A+NGG + + P G + NGQ + ++ Sbjct: 148 REMAKRYKAVAAINGGYFKDTSPNKQSGGVGAIPTGFIMSNGQIVYPQDNSNWSEITSEE 207 Query: 127 PGGVFYVAGDK----VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS-SKI 181 + D G D S I+ AV + P L++NG I N S + Sbjct: 208 ENRALTIDKDGNLQVGGTYSPDQLIKSG-IREAVITEPYLIKNGK--NTIQANSVSGTNP 264 Query: 182 RNGVGINKHGNAVFLLSQQATNFYDFAC-----YAKAKLNVEQLLYLDGTIS-HMYMKG 234 R +G + +F++ A KL LDG S MY G Sbjct: 265 RTAIGQRADKSIIFMVIDGRQGVKLGATVGDVQVLMHKLGAVNAACLDGGGSTAMYYNG 323 >UniRef50_B7IEY1 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IEY1_THEAB Length = 535 Score = 42.6 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 20/99 (20%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIR--NGVGINKHGNAVFLLS-QQATN 203 I+ A+ +GP+L+ENG + K+R NG+ ++K + ++ + + Sbjct: 415 DFPFPIKHAIGAGPLLIENGKK----LIDSDEEKLRYGNGLALSKTSRTIIAITKEGKVD 470 Query: 204 F-------------YDFACYAKAKLNVEQLLYLDGTISH 229 F YD A + + LDG S Sbjct: 471 FIVIEGYNDSPGMNYDIATEFLLEKGYFYAMMLDGGGSS 509 >UniRef50_UPI000190570B hypothetical protein RetlG_24562 n=1 Tax=Rhizobium etli GR56 RepID=UPI000190570B Length = 56 Score = 42.6 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 12/25 (48%), Positives = 17/25 (68%) Query: 146 FKTSKEIQFAVQSGPMLMENGVINP 170 K + + +FA QSGPML+ G +NP Sbjct: 1 MKLATKSRFATQSGPMLVIAGNLNP 25 >UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4EC3 Length = 279 Score = 42.6 bits (99), Expect = 0.013, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 66/199 (33%), Gaps = 34/199 (17%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALL-ADINSQGQVQMAMNGGIYDE------- 96 A ++ + + NG+ G L + + ++Q+A+N + Sbjct: 52 GHAVRIDLKAAGIGFLATPGNGDRPGETDGLKTSTFLKRHKLQLAINAAPFGPIHKDEEK 111 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPG--GVFYVAGDKVGIVRLDAFKTSKEIQF 154 +G+ + G+ +PG + ++ I F I+ Sbjct: 112 EQDVVGVQVSGGKLVSPA-----------QPGYPALLLAKDNRARI-AAPPFDLEG-IEN 158 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN--AVFLLSQQATNFYDFACYAK 212 AV ++++ G + S R G++ G + ++ + +F D A A+ Sbjct: 159 AVGGFHIVLKGGEV----LTGDKSIHPRTAAGVSADGKTLVLLVIDGRQKDFSDGATTAE 214 Query: 213 -----AKLNVEQLLYLDGT 226 L + + LDG Sbjct: 215 VGEWLKALGCAEGINLDGG 233 >UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcystis aeruginosa RepID=B0JGJ2_MICAN Length = 607 Score = 42.6 bits (99), Expect = 0.013, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 35/103 (33%), Gaps = 21/103 (20%) Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVIN---------PRIHPNVASSKIRNGVGINKH 190 + F +I A GP+L++NG I P AS R+ + +++ Sbjct: 480 VTLPGDFANYPQILGA---GPLLLQNGRIVLDGNAEKFSPAFQNQQAS---RSAIAVSRE 533 Query: 191 GNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTIS 228 G + + + A + + L LDG S Sbjct: 534 GKILLVAIHNRVGGRGATLGELARILLL-MAAKDGLNLDGGSS 575 Score = 41.8 bits (97), Expect = 0.021, Method: Composition-based stats. Identities = 30/159 (18%), Positives = 52/159 (32%), Gaps = 19/159 (11%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 P L V ++P+ ++ + AN + L INS+ Sbjct: 275 PGLIWNQKYIQLDQDWFPVTWLEIDPRNPQITIKPITANSTSMRG-TNPLITINSESNAV 333 Query: 87 MAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 +NGG ++ + PLG +G+ L G +A D G +R+D Sbjct: 334 AMINGGFFNRNNQLPLGAIRVDGKWLSGPILNRGA------------IAWDNRGKIRIDR 381 Query: 146 FKTSKEIQFAV-QSGPMLMENGVINPRIHPNVASSKIRN 183 + + A Q P+ +N S R+ Sbjct: 382 LSLEETLITATGQRFPLT----QLNSAFLTAGCSRYTRD 416 >UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WLB3_ACTMD Length = 1118 Score = 42.2 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 21/96 (21%), Positives = 32/96 (33%), Gaps = 13/96 (13%) Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 A + A+ +L+ + + P+ S R VG + G +FLL+ Sbjct: 283 STRAGDGGSAPRAAIGGNQVLLRDSEVVA---PDDPS-HPRTAVGFSADGRRMFLLTVDG 338 Query: 202 --------TNFYDFACYAKAKLNVEQLLYLDGTISH 229 N D A + L L LDG S Sbjct: 339 RQSAHLLGLNLKDVAEALRD-LGAHNALNLDGGGSS 373 >UniRef50_Q7NGC8 Glr3243 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NGC8_GLOVI Length = 540 Score = 41.8 bits (97), Expect = 0.020, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 49/136 (36%), Gaps = 21/136 (15%) Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ---------SGPMLMENGVIN-----P 170 + P G+ VA + L A + VQ +GP+L++N + Sbjct: 388 LPPDGLLLVARSEPLRTALRAVAAGTPVVLDVQPSGSGSLLGAGPLLVQNDKLVLDAQGE 447 Query: 171 RIHPNVASSKI-RNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLD 224 R P+V + + R + + + ++ + + +A + + L LD Sbjct: 448 RFRPDVRAPGVARTAIA-RRGSLGILAVAARNGWAAGLSLESWANLLLQQFQADDALNLD 506 Query: 225 GTISHMYMKGGAIPWQ 240 G S + GG + + Sbjct: 507 GGGSSGFYLGGRLRDR 522 >UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XK15_SYNP2 Length = 595 Score = 41.8 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 32/103 (31%), Gaps = 19/103 (18%) Query: 159 GPMLMENGVIN---------PRIHPNVASSKIRNGVGINKHGNAVFLL------SQQATN 203 GP+L++NG + + AS R+ + + + + Sbjct: 486 GPLLLKNGQVVLNGQAEQFSTAFNIQSAS---RSAIARTRDNKILLVTLHGAAEETAGAT 542 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 ++A + +L L LDG S G + + Sbjct: 543 LNEWANILR-RLGATDALNLDGGGSSALALGANLSDRHPTTAG 584 Score = 41.4 bits (96), Expect = 0.027, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 36/89 (40%), Gaps = 3/89 (3%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 LP ++ A S + V ++P ++++ + L LL ++ + Sbjct: 256 LPGVQWRQENFAASSGPVRVTWLEIDPTQRQLQLKPITPDNNTIVGLAPLLIQADTNQAI 315 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVA 113 A+N G ++ + PLG ++ + + Sbjct: 316 A-AINAGFFNRNNQYPLG-IVQGNRALRS 342 >UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoanaerobacterales RepID=Q8RCE6_THETN Length = 815 Score = 41.4 bits (96), Expect = 0.026, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 35/94 (37%), Gaps = 10/94 (10%) Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 I F+ +I+ AV G +L++ G I P + R +G K V +++ Sbjct: 280 ITTNPPFE---DIKMAVSGGTILVKGGKIYP-FTHEIKGYAARTAIGYTKDKRYVLMVTV 335 Query: 200 QATNF-----YDFACYAKAKLNVEQLLYLDGTIS 228 + + A + L L LDG S Sbjct: 336 DGPPYRGMTQEELASLMLS-LGAYDALNLDGGGS 368 >UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D0A3_PAESJ Length = 349 Score = 41.4 bits (96), Expect = 0.028, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 62/203 (30%), Gaps = 35/203 (17%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG-GIYDESY-----APLGLY 104 NP ++ +GE + + A+N G D A G+ Sbjct: 122 NPNRVKLVSSKLSDHGEQIFVIAKRAKALA-------AINASGFVDLDGHGNGGASTGVV 174 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ--SG--P 160 IE+G K F K G++ + ++ + VQ +G P Sbjct: 175 IEDGVIKSQNKNTKE-----------FVAGITKDGVMITGKYSANELVNLGVQYAAGFKP 223 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY--AKAKL--- 215 L+ NG R +G G+ +F++ A + L Sbjct: 224 QLIVNGQKMVE-GDGGWGWGPRTAIGQKADGSIIFVVIDGRQTRSVGASIKEVQDLLYER 282 Query: 216 NVEQLLYLD-GTISHMYMKGGAI 237 + +D G+ S MY G I Sbjct: 283 GAVNAMCMDGGSSSSMYFNGDNI 305 >UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I064_CLOCE Length = 383 Score = 41.1 bits (95), Expect = 0.042, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 54/155 (34%), Gaps = 26/155 (16%) Query: 103 LYIENGQQKV------ALNLASG------EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 L ++N + +N+ G G+ P + GDKV I + Sbjct: 208 LIVDNNKIISIIESTKEVNIKKGMYVISFYGDKSSLPDKIGLKTGDKVNIRIEPYLGYNY 267 Query: 151 EIQFAVQSGPMLMENGV-INPRIHP---NVASSKIRNGVGINKHGNAVFLLSQQATNFYD 206 + A + G ML++NG + P + + R +GI +G V +++ Y Sbjct: 268 Q---AYECGSMLVKNGKSVVPERDKWAGTLGNRDPRTVIGIKTNGKIVLVVADGRQPGYS 324 Query: 207 FACY------AKAKLNVEQLLYLD-GTISHMYMKG 234 K+ V LD G S M + G Sbjct: 325 EGMTGKEMGEFLVKIGVRDAAMLDGGATSQMIING 359 >UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LVE9_BACOV Length = 332 Score = 40.7 bits (94), Expect = 0.051, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 27/85 (31%), Gaps = 15/85 (17%) Query: 159 GPMLMENGVINPRIHP------NVASSKIRNGVGINKHGNAVFLLSQQ--------ATNF 204 GP+L+ +G I ++ R+ +GI + + + Sbjct: 220 GPVLLLDGNIKNTYEEEILSDIGATVNRPRSAIGITNDKKMILFVCEGDGMTTGVAGMTT 279 Query: 205 YDFACYAKAKLNVEQLLYLDGTISH 229 + A K L + LDG S Sbjct: 280 ENVANIMK-TLGCTDAINLDGGGSS 303 >UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FD37_SACEN Length = 519 Score = 40.3 bits (93), Expect = 0.056, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 30/91 (32%), Gaps = 21/91 (23%) Query: 159 GPMLMENGV----------IN--PRIHPN-VASSKIRNGVGINKHGNAVFLLSQQATN-- 203 GP L+ +G I+ P R+ +G++ G + ++ Sbjct: 399 GPELVRDGQVRINLQEDGIIHDAPSFAYTWGLKRNPRSVIGVDAQGRVILATTEGRMPGF 458 Query: 204 -----FYDFACYAKAKLNVEQLLYLDGTISH 229 + A + +A L + LDG S Sbjct: 459 SDGWGLPEAAEFVRA-LGAVDAMALDGGGSA 488 >UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N4C8_SYNP6 Length = 605 Score = 40.3 bits (93), Expect = 0.070, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 38/98 (38%), Gaps = 15/98 (15%) Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVI--NPRIHPNVASSKI----RNGVGINKHGNAVF 195 + F A+ GP+L+++G + NP+ + +I R+ +G+ G V Sbjct: 475 QPSQFDRFPH---ALGGGPLLVKSGRVVVNPQAEGFSRAFEIEAAPRSAIGLMPDGRLVL 531 Query: 196 LLSQ-----QATNFYDFACYAKAKLNVEQLLYLDGTIS 228 + + Q A + +L V L DG S Sbjct: 532 VAAHEQNQGQGPTLPQMAAIMQ-QLGVVDALNFDGGSS 568 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobact... 282 7e-75 UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobi... 268 1e-70 UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylo... 241 1e-62 UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobac... 239 9e-62 UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepI... 233 4e-60 UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteob... 224 3e-57 UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptuniu... 223 5e-57 UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bactero... 222 1e-56 UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomi... 220 4e-56 UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acineto... 217 3e-55 UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacter... 217 3e-55 UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea... 217 3e-55 UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydro... 214 2e-54 UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcu... 214 2e-54 UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmat... 214 2e-54 UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteri... 211 2e-53 UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodob... 211 3e-53 UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter c... 209 5e-53 UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoc... 209 6e-53 UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebalde... 209 6e-53 UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax... 209 7e-53 UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Auranti... 202 7e-51 UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelo... 201 3e-50 UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitino... 197 2e-49 UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythro... 197 3e-49 UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychro... 193 5e-48 UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetoba... 191 2e-47 UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC... 189 6e-47 UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=... 180 5e-44 UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legione... 165 1e-39 UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legione... 164 3e-39 UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucell... 160 3e-38 UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella b... 153 5e-36 UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseifl... 142 2e-32 UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostr... 134 2e-30 UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=B... 133 7e-30 UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinom... 133 7e-30 UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-la... 132 1e-29 UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-la... 132 2e-29 UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C... 130 4e-29 UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bactero... 130 5e-29 UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=P... 130 6e-29 UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY... 130 6e-29 UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-la... 129 7e-29 UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lacto... 129 9e-29 UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtili... 127 3e-28 UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostri... 127 3e-28 UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpeto... 126 6e-28 UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactoba... 123 5e-27 UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-la... 123 6e-27 UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiob... 122 9e-27 UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chlorof... 122 9e-27 UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 ... 122 1e-26 UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related t... 122 1e-26 UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonel... 120 3e-26 UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochr... 119 7e-26 UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=B... 119 1e-25 UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victiva... 118 1e-25 UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Trepone... 118 1e-25 UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=F... 117 2e-25 UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillon... 117 4e-25 UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 117 4e-25 UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoni... 116 7e-25 UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Dein... 113 4e-24 UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related t... 112 9e-24 UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Trepone... 112 9e-24 UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 T... 111 3e-23 UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing pro... 109 8e-23 UniRef50_C6J074 Copper amine oxidase domain-containing protein n... 109 9e-23 UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya... 109 1e-22 UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN... 108 2e-22 UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bactero... 106 7e-22 UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc ... 106 7e-22 UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bactero... 105 1e-21 UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteri... 105 2e-21 UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacter... 104 3e-21 UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax... 103 4e-21 UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkerma... 102 1e-20 UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bactero... 102 1e-20 UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=... 102 1e-20 UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bactero... 101 3e-20 UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 100 5e-20 UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmati... 100 9e-20 UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=... 99 2e-19 UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=... 96 9e-19 UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfo... 96 1e-18 UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57... 95 2e-18 UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natr... 95 2e-18 UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobi... 95 2e-18 UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=A... 95 2e-18 UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bactero... 93 8e-18 UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synecho... 93 9e-18 UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alic... 92 2e-17 UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobaci... 91 4e-17 UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=T... 91 4e-17 UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=S... 90 4e-17 UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryoc... 90 6e-17 UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=A... 90 7e-17 UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanoth... 87 4e-16 UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 ... 87 5e-16 UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellacea... 87 7e-16 UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=... 87 8e-16 UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 86 1e-15 UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chrooco... 85 2e-15 UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=... 85 3e-15 UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=c... 84 4e-15 UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 68... 83 9e-15 UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiob... 82 1e-14 UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillu... 80 4e-14 UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Cop... 77 6e-13 UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomacu... 74 5e-12 Sequences not found previously or not previously below threshold: UniRef50_C2HB28 Exopolysaccharide biosynthesis protein n=4 Tax=E... 120 5e-26 UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtili... 114 2e-24 UniRef50_B8FUP3 Putative uncharacterized protein n=2 Tax=Desulfi... 111 2e-23 UniRef50_B1BC21 Putative uncharacterized protein n=2 Tax=Clostri... 111 2e-23 UniRef50_B0TEY5 Putative uncharacterized protein n=1 Tax=Helioba... 108 2e-22 UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Breviba... 106 8e-22 UniRef50_A7GCS1 Putative uncharacterized protein n=12 Tax=Clostr... 106 1e-21 UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paeniba... 104 2e-21 UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostri... 102 2e-20 UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=... 100 5e-20 UniRef50_B3CE38 Putative uncharacterized protein n=3 Tax=Bactero... 98 2e-19 UniRef50_C4ICA6 Peptidase, M56 family n=1 Tax=Clostridium butyri... 97 4e-19 UniRef50_C6PYU6 Putative uncharacterized protein n=1 Tax=Clostri... 97 7e-19 UniRef50_C6XXH4 Putative uncharacterized protein n=1 Tax=Pedobac... 96 8e-19 UniRef50_A7LRK4 Putative uncharacterized protein n=1 Tax=Bactero... 95 2e-18 UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bactero... 95 3e-18 UniRef50_C5RID5 Putative uncharacterized protein n=1 Tax=Clostri... 94 3e-18 UniRef50_B7ASL4 Putative uncharacterized protein n=1 Tax=Bactero... 94 4e-18 UniRef50_C8WU56 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 94 4e-18 UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=P... 94 5e-18 UniRef50_A7LRK2 Putative uncharacterized protein n=1 Tax=Bactero... 94 5e-18 UniRef50_C6XT12 NHL repeat containing protein n=2 Tax=Pedobacter... 92 1e-17 UniRef50_B8CYN3 SpoIID/LytB domain protein n=1 Tax=Halothermothr... 92 2e-17 UniRef50_C4Z4Z5 Putative uncharacterized protein n=1 Tax=Eubacte... 92 2e-17 UniRef50_B4AZH7 Putative uncharacterized protein n=1 Tax=Cyanoth... 91 3e-17 UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Breviba... 91 3e-17 UniRef50_B7H7U4 Putative uncharacterized protein n=27 Tax=Bacill... 90 4e-17 UniRef50_C5PL46 Exopolysaccharide biosynthesis protein n=2 Tax=S... 90 5e-17 UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria Rep... 90 7e-17 UniRef50_C6XWN0 Putative uncharacterized protein n=1 Tax=Pedobac... 89 1e-16 UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microco... 89 2e-16 UniRef50_O31980 SPBc2 prophage-derived uncharacterized protein y... 87 4e-16 UniRef50_C4Z6E6 Putative uncharacterized protein n=1 Tax=Eubacte... 87 4e-16 UniRef50_C6JBU1 Putative uncharacterized protein n=1 Tax=Ruminoc... 87 4e-16 UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bactero... 87 5e-16 UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevote... 87 5e-16 UniRef50_C6J7B9 Exopolysaccharide biosynthesis protein n=2 Tax=B... 87 5e-16 UniRef50_C6LDL7 Putative uncharacterized protein n=1 Tax=Bryante... 87 7e-16 UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya... 86 1e-15 UniRef50_UPI0001BC335A hypothetical protein BcroD2_01203 n=1 Tax... 86 1e-15 UniRef50_B8HPJ4 Putative uncharacterized protein n=2 Tax=Cyanoth... 85 1e-15 UniRef50_UPI0000E45D54 PREDICTED: similar to N-acetylglucosamine... 85 2e-15 UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryoc... 84 3e-15 UniRef50_Q9UK23 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 84 4e-15 UniRef50_C0CND1 Putative uncharacterized protein n=1 Tax=Blautia... 84 5e-15 UniRef50_B1I1S0 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 84 5e-15 UniRef50_B3RIP6 Putative uncharacterized protein (Fragment) n=2 ... 84 6e-15 UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sp... 83 9e-15 UniRef50_B9YC35 Putative uncharacterized protein n=2 Tax=Holdema... 83 1e-14 UniRef50_C6CV17 Exopolysaccharide biosynthesis protein n=1 Tax=P... 82 1e-14 UniRef50_B8HPB3 Putative uncharacterized protein n=1 Tax=Cyanoth... 82 1e-14 UniRef50_A3DHF5 Ig-like, group 2 n=3 Tax=Clostridium thermocellu... 82 1e-14 UniRef50_B8G1I8 Peptidase M56 BlaR1 n=4 Tax=Desulfitobacterium h... 82 2e-14 UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=No... 82 2e-14 UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya... 82 3e-14 UniRef50_UPI00019088BB hypothetical protein RetlC8_25680 n=2 Tax... 81 3e-14 UniRef50_A4J956 Copper amine oxidase domain protein n=1 Tax=Desu... 81 4e-14 UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus seleniti... 80 5e-14 UniRef50_D2V2G1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 80 5e-14 UniRef50_A6L611 Putative uncharacterized protein n=1 Tax=Bactero... 80 6e-14 UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoa... 80 9e-14 UniRef50_UPI000180BA0C PREDICTED: similar to predicted protein n... 80 9e-14 UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synecho... 80 9e-14 UniRef50_A9NEV6 Hypothetical surface-anchored protein n=1 Tax=Ac... 79 1e-13 UniRef50_B0C332 Putative uncharacterized protein n=2 Tax=Bacteri... 79 1e-13 UniRef50_A6TVJ8 Exopolysaccharide biosynthesis protein n=2 Tax=A... 79 2e-13 UniRef50_Q3AA51 Conserved domain protein n=1 Tax=Carboxydothermu... 78 2e-13 UniRef50_Q7X4R9 XcbC n=1 Tax=Neisseria meningitidis RepID=Q7X4R9... 78 2e-13 UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elon... 78 3e-13 UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synecho... 78 3e-13 UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=No... 78 4e-13 UniRef50_UPI00017896CA metallophosphoesterase n=1 Tax=Geobacillu... 77 5e-13 UniRef50_C6IEV9 Putative uncharacterized protein n=2 Tax=Bactero... 77 8e-13 UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q... 76 8e-13 UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=... 76 9e-13 UniRef50_A7C442 Putative uncharacterized protein n=1 Tax=Beggiat... 76 1e-12 UniRef50_Q8YKH7 All7320 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 75 2e-12 UniRef50_D1R528 Putative uncharacterized protein n=1 Tax=Parachl... 75 3e-12 UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcy... 75 3e-12 UniRef50_B1X2V5 Putative uncharacterized protein n=2 Tax=Cyanoth... 74 4e-12 UniRef50_B9XE16 Putative uncharacterized protein n=1 Tax=bacteri... 74 4e-12 UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 74 5e-12 UniRef50_UPI0001923977 PREDICTED: similar to predicted protein, ... 73 8e-12 UniRef50_A4XD34 Putative uncharacterized protein n=1 Tax=Salinis... 73 8e-12 UniRef50_UPI0001BC7E39 hypothetical protein BacD2_08600 n=1 Tax=... 73 9e-12 UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacilla... 73 1e-11 UniRef50_B5W3X9 Putative uncharacterized protein n=3 Tax=Arthros... 72 1e-11 UniRef50_UPI0001C30FBA N-acetylglucosamine-1-phosphodiester alph... 72 1e-11 UniRef50_D2ASL7 Exopolysaccharide biosynthesis protein related t... 72 2e-11 UniRef50_A9V9Y5 Predicted protein n=1 Tax=Monosiga brevicollis R... 72 2e-11 UniRef50_D2AUR4 Exopolysaccharide biosynthesis protein related t... 72 2e-11 UniRef50_C0Z816 Putative uncharacterized protein n=1 Tax=Breviba... 72 2e-11 UniRef50_B3QZA6 Putative uncharacterized protein n=1 Tax=Chloroh... 72 2e-11 UniRef50_B5YE82 Putative uncharacterized protein n=2 Tax=Dictyog... 72 2e-11 UniRef50_A6TUG6 Copper amine oxidase domain protein n=1 Tax=Alka... 72 2e-11 UniRef50_A0LEU6 Putative uncharacterized protein n=1 Tax=Syntrop... 71 3e-11 UniRef50_A4XGY7 Putative uncharacterized protein n=2 Tax=Clostri... 71 3e-11 UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cave... 71 4e-11 UniRef50_C1A670 Putative uncharacterized protein n=1 Tax=Gemmati... 71 4e-11 UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=C... 71 4e-11 UniRef50_C7IFA0 Exopolysaccharide biosynthesis protein n=1 Tax=C... 70 7e-11 UniRef50_A5D3T7 Hypothetical membrane protein n=1 Tax=Pelotomacu... 70 8e-11 UniRef50_C9N2Q2 Metallophosphoesterase n=2 Tax=Actinomycetales R... 70 9e-11 UniRef50_UPI0001C31921 Collagen triple helix repeat protein n=2 ... 69 1e-10 UniRef50_A7M0H0 Putative uncharacterized protein n=2 Tax=Bactero... 69 2e-10 UniRef50_Q01TI8 Putative uncharacterized protein n=1 Tax=Candida... 68 2e-10 UniRef50_A3DIP4 Exopolysaccharide biosynthesis protein n=3 Tax=C... 68 3e-10 UniRef50_A7HB86 Putative uncharacterized protein n=4 Tax=Anaerom... 68 3e-10 UniRef50_C2FS46 Putative uncharacterized protein n=2 Tax=Sphingo... 68 3e-10 UniRef50_Q2JUI0 Conserved domain protein n=2 Tax=Synechococcus R... 68 4e-10 UniRef50_Q5ULM2 Orf92 n=1 Tax=Lactobacillus phage LP65 RepID=Q5U... 67 4e-10 UniRef50_A4FAG7 Secreted protein n=5 Tax=Actinomycetales RepID=A... 67 4e-10 UniRef50_A1VEZ3 Putative uncharacterized protein n=4 Tax=Desulfo... 67 4e-10 UniRef50_B4VYL6 Tat pathway signal sequence domain protein n=1 T... 67 5e-10 UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfo... 67 6e-10 UniRef50_A9QSN5 Exopolysaccharide biosynthesis protein n=4 Tax=L... 67 6e-10 UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora eryth... 67 7e-10 UniRef50_Q9L2D5 Putative secreted protein n=2 Tax=Streptomyces R... 67 8e-10 UniRef50_UPI0001744904 hypothetical protein VspiD_09360 n=1 Tax=... 67 8e-10 UniRef50_B5RQG1 Uncharacterized conserved protein n=20 Tax=Borre... 67 8e-10 UniRef50_A5ILT0 Putative uncharacterized protein n=6 Tax=Thermot... 66 9e-10 UniRef50_A4CSS0 Putative uncharacterized protein n=1 Tax=Synecho... 66 9e-10 UniRef50_B6V2M3 Gp2.43 n=1 Tax=Bacillus phage SPO1 RepID=B6V2M3_... 66 9e-10 UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mir... 66 1e-09 UniRef50_B4WHW3 Putative uncharacterized protein n=1 Tax=Synecho... 66 1e-09 UniRef50_D1VTW3 Copper amine oxidase N-domain superfamily n=1 Ta... 66 1e-09 UniRef50_A4FAL4 Putative uncharacterized protein n=2 Tax=Actinom... 65 2e-09 UniRef50_C0AEZ6 Putative uncharacterized protein n=1 Tax=Opituta... 65 3e-09 UniRef50_A5GW09 Putative uncharacterized protein SynRCC307_2165 ... 64 4e-09 UniRef50_Q7U4D6 Putative uncharacterized protein n=11 Tax=Cyanob... 64 4e-09 UniRef50_C8VW07 S-layer domain protein n=1 Tax=Desulfotomaculum ... 64 4e-09 UniRef50_C5CET4 Putative uncharacterized protein n=1 Tax=Kosmoto... 64 5e-09 UniRef50_C9M6C8 Putative uncharacterized protein n=1 Tax=Jonquet... 64 5e-09 UniRef50_Q30YC1 Putative uncharacterized protein n=1 Tax=Desulfo... 63 8e-09 UniRef50_B4WFN8 Putative uncharacterized protein n=1 Tax=Synecho... 63 8e-09 UniRef50_C8X0Z8 Putative uncharacterized protein n=1 Tax=Desulfo... 63 9e-09 UniRef50_Q0AWB0 Putative uncharacterized protein n=1 Tax=Syntrop... 63 1e-08 UniRef50_A3TM75 Putative uncharacterized protein n=1 Tax=Janibac... 63 1e-08 UniRef50_A7HN47 Putative uncharacterized protein n=1 Tax=Fervido... 63 1e-08 >UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobacteriaceae RepID=YIGE_ECOLI Length = 254 Score = 282 bits (722), Expect = 7e-75, Method: Composition-based stats. Identities = 254/254 (100%), Positives = 254/254 (100%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY Sbjct: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 Query: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE Sbjct: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK Sbjct: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ Sbjct: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 Query: 241 RYPFVTMISVERKG 254 RYPFVTMISVERKG Sbjct: 241 RYPFVTMISVERKG 254 >UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobiales RepID=B2II06_BEII9 Length = 269 Score = 268 bits (685), Expect = 1e-70, Method: Composition-based stats. Identities = 73/245 (29%), Positives = 127/245 (51%), Gaps = 2/245 (0%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 + T L R+FL L L A A L++ + + + ++++WQ+ G+ +G Sbjct: 17 IFTKLLMRVFLPLFLSAGTAWAEPCLPLTEEGINYVVCRFDTKRSDLRLFWQQPGGQPYG 76 Query: 71 TLHALLADINSQGQVQ-MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 L A + +G+ AMN G++ E +P+GLYI+ G+ N+ +G GNF ++P G Sbjct: 77 GFAPLRAQLQPKGETLEFAMNAGMFQEDLSPVGLYIQEGRLLHPANMRNGPGNFHMKPNG 136 Query: 130 VFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGIN 188 +FY + G++ F ++ + +A QSGP+L+ N ++P+I P S KIRNGVG+ Sbjct: 137 IFYFSQTSAGVMETGRFLQSGLKPDYATQSGPLLVANNQLHPKIEPTGTSEKIRNGVGVR 196 Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + +F +S+ F+ FA + +L+ L+LDG+IS +Y Q P ++ Sbjct: 197 DNHEVIFAISEAPVTFFRFARLFRDRLHCPDALFLDGSISSLYAPSLNRDDQWRPIGPIV 256 Query: 249 SVERK 253 K Sbjct: 257 GAVSK 261 >UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylobacterium extorquens group RepID=A9W4Y6_METEP Length = 258 Score = 241 bits (616), Expect = 1e-62, Method: Composition-based stats. Identities = 76/235 (32%), Positives = 123/235 (52%), Gaps = 4/235 (1%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN 80 + + P A A+ TV+ + ERV+++W +G +G+L +L Sbjct: 26 VPVQAQPAPAAKGPCQAVEFEGQPYTVCTVDLRRERVRLFWLGTDGLPYGSLSSL--ADR 83 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 ++ AMN G+YD+ AP+GLY+E+G++ + A+G GNF ++P GVFYV GD+ G+ Sbjct: 84 QGPRLSFAMNAGMYDKGQAPVGLYVEDGRELKGASTANGPGNFHLKPNGVFYVKGDRAGV 143 Query: 141 VRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA-VFLLS 198 + + + FA QSGPML+ +G I+P+I + S KIRNGVG+ G+ VF +S Sbjct: 144 LDTGRYLRAKPAPDFATQSGPMLVIDGKIHPKISADGPSQKIRNGVGVRDGGHVAVFAIS 203 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 ++ F FA K L+LDG++S +Y G P ++ + Sbjct: 204 ERPVTFGAFARLFKDSFGCRNALFLDGSVSSLYAPGLGRSDLSRPLGPLVGAVGR 258 >UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9JX75_AGRVS Length = 274 Score = 239 bits (609), Expect = 9e-62, Method: Composition-based stats. Identities = 81/246 (32%), Positives = 131/246 (53%), Gaps = 4/246 (1%) Query: 12 ITLNLKRIFLALTLLPLFAVAAD--DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW 69 I + L I L + P A A + ++ + +P T ++++ + A+G+ + Sbjct: 26 IVVWLFAILSPLVISPERAEAEEQSCRDQTENGFAYRVCRFDPATRTIRIFNRNADGDVY 85 Query: 70 GTLHALLADINSQGQVQ-MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 G AL + + Q + A+NGG+Y +P+GL+++ G + A G GNF+++P Sbjct: 86 GGFEALRSQLWQQRLILTFAVNGGMYHSDLSPVGLFVDYGMTRKTAETADGWGNFYLKPN 145 Query: 129 GVFYVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GVF++ G++ F+T K E FA QSGPML+ +GV++P+ P S KIRNGVGI Sbjct: 146 GVFFLKDGHAGVLETGQFETQKIEADFATQSGPMLVIDGVLHPKFLPTSDSLKIRNGVGI 205 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + G VF+LS+ FYD A + + +L LYLDGTIS + + YP + Sbjct: 206 DASGQVVFVLSKDPVRFYDMAAFFRDRLGAANALYLDGTISSLAEPMAGRIDRAYPLGPI 265 Query: 248 ISVERK 253 I+V + Sbjct: 266 IAVVDQ 271 >UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepID=Q98NI9_RHILO Length = 263 Score = 233 bits (595), Expect = 4e-60, Method: Composition-based stats. Identities = 72/247 (29%), Positives = 123/247 (49%), Gaps = 3/247 (1%) Query: 10 GMITLNLKRIFLALTLLPLFAVA-ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEA 68 G + L + + + V+ + + V+P+ ++++W+ G+ Sbjct: 15 GAVKAALPQAVASTMAFSQWFVSLPPCRDFAFEATSYLICEVDPKLYSIELFWKDPVGKP 74 Query: 69 WGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRP 127 + +LH L A + G+ + A+N G+Y P+GLY+E G++ + SG GNF ++P Sbjct: 75 FQSLHNLDAAQRAAGRTMLFAINAGMYHPDLRPVGLYVERGREMAGVRTGSGSGNFSLQP 134 Query: 128 GGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVG 186 G+FY++G K + F + +A QSGPML+ +G ++P+ + S K R+GVG Sbjct: 135 NGIFYISGGKAAVRATRDFVRKRPSTDYATQSGPMLVIDGQLHPKFQSDGTSRKTRDGVG 194 Query: 187 INKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + K G AVF +S NF+ FA + L + L+LDGTIS ++ + Sbjct: 195 VRKDGVAVFAISNGTVNFHTFARLFRDALGCDNALFLDGTISSLFAPAIGRNDDYWNLGP 254 Query: 247 MISVERK 253 MI V RK Sbjct: 255 MIGVFRK 261 >UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteobacteria RepID=A9CIN9_AGRT5 Length = 254 Score = 224 bits (570), Expect = 3e-57, Method: Composition-based stats. Identities = 69/213 (32%), Positives = 114/213 (53%), Gaps = 3/213 (1%) Query: 44 TVQAYTVNPQTERVKMYWQK-ANGEAWGTLHALLADINSQGQV-QMAMNGGIYDESYAPL 101 + +P +++Y Q +G+ + L + + Q AMNGG+Y Y+P+ Sbjct: 39 RYTVCSFDPAKNTIRIYDQDHVSGQGYRNFADLSSALWRQHMFSVFAMNGGMYHSDYSPV 98 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGP 160 GL++ENG ++ ++ G GNF + P GVFY+ G+ G++ +A+ + FA QSGP Sbjct: 99 GLFVENGVERSPVSTRGGWGNFHLLPNGVFYLDGNTAGVLETEAYLAADPKPDFATQSGP 158 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQL 220 ML+ +G ++PR P+ S K RNGVG+++ G F +S+ FYDF + L+ Sbjct: 159 MLVIDGKLHPRFLPDSDSLKRRNGVGVSRDGMVHFAISETTVRFYDFGTLFRDVLDAPNA 218 Query: 221 LYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 LYLDGTIS + + Q + +I+V + Sbjct: 219 LYLDGTISSVDIPAMNRRDQLFSMGPIIAVVDR 251 >UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BWY7_HYPNA Length = 249 Score = 223 bits (567), Expect = 5e-57, Method: Composition-based stats. Identities = 66/238 (27%), Positives = 112/238 (47%), Gaps = 5/238 (2%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN 80 A+ S L + + + ++++ + G +G L + Sbjct: 12 GAILSACNEVEEGPCQTRSFENLPYLVCSFDASQDTIRLFLRDETGVPFGQFDRLANHVA 71 Query: 81 -SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 G + AMN G+Y + P+GLYIE G+ ++ L + G GNF + P GVF++ K G Sbjct: 72 SKGGNLVFAMNAGMYHDDRRPVGLYIEEGEAEMNLVRSPGPGNFGMLPNGVFWIDAGKAG 131 Query: 140 IVRLDAFKTSKE---IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVF 195 + AF + +FA QSGPML+ +G ++P ++P+ S + RNGVG+++ G F Sbjct: 132 VSETLAFDERFKETPPRFATQSGPMLVIDGALHPALNPDGTSLRRRNGVGVSEDGRQVYF 191 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 ++S NF+ FA + +L LYLDG +S Y+ ++ V R+ Sbjct: 192 VISDVPVNFHSFARLFRDELGTPNALYLDGAVSKAYVPALERSETGLDMGPIVGVIRE 249 >UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=Q11X50_CYTH3 Length = 244 Score = 222 bits (565), Expect = 1e-56, Method: Composition-based stats. Identities = 83/217 (38%), Positives = 124/217 (57%), Gaps = 3/217 (1%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDES 97 T+ V +YTV+PQ + ++ YW+ NGE ++ L A + S+G + A NGG+Y E Sbjct: 25 QQDTIDVISYTVDPQKDNLQFYWKNDNGEILKSIKKLKAYVESKGSTLLFATNGGMYKED 84 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA-GDKVGIVRLDAFKTSKEIQFAV 156 +PLGL+I+NG+ LN A G+GNF+++P GVFY+ ++ I + + F + I+FA Sbjct: 85 RSPLGLFIQNGKTVTPLNKAKGQGNFYMQPNGVFYITNDNEAVICKTEDFINNGNIKFAT 144 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLN 216 QSGPM++ N I+P + IRNGVGI + +F +S++ NF+DFA Y + L Sbjct: 145 QSGPMIIVNNQIHPSFIKGSKNLNIRNGVGILPNKKIIFAMSEKEVNFFDFALYFQN-LG 203 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 E LYLDG +S Y+ F MI V K Sbjct: 204 CENALYLDGFVSRSYLLEKKWLQTDGEFGVMIGVTEK 240 >UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LB58_RHOVA Length = 247 Score = 220 bits (560), Expect = 4e-56, Method: Composition-based stats. Identities = 71/239 (29%), Positives = 114/239 (47%), Gaps = 4/239 (1%) Query: 19 IFLALTLLPLFAVAAD--DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL 76 F+A+ + AA + + V+++WQK +G + L AL Sbjct: 6 AFIAMAAFCGSSEAAAQTCKPYAFEGNGYTLCEASLDRFAVRLFWQKPDGGPYTYLSALP 65 Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 G++ A+NGG++ Y P+GL++ENG++ V N G GNF +RP G+FY Sbjct: 66 KTDERGGRLAFALNGGMFHPDYKPVGLHVENGRELVRANTRPGPGNFHLRPNGIFYFGEA 125 Query: 137 KVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 + G++ AF K + FA QSGPML+ +G ++PRI S+K R+GV + + VF Sbjct: 126 EAGVMETGAFLKKKPKANFATQSGPMLVIDGKLHPRIAKANVSAKPRDGVCVRGDKSVVF 185 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 +S F F + L L+LDG + +++ G + MI+V K Sbjct: 186 AISDGGVPFDTFMRLFRDGLKCRNALFLDGGTAPALFVPGTRSGNVLFGLGPMIAVYEK 244 >UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJ31_ACIJU Length = 252 Score = 217 bits (553), Expect = 3e-55, Method: Composition-based stats. Identities = 67/234 (28%), Positives = 122/234 (52%), Gaps = 4/234 (1%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN-GEAWGTLHALLADIN 80 + A + ++ + + V+ ++++ + G+ + + +D+ Sbjct: 16 CMVFQATTVFAFEYQSIKFEDVQFEVIKVDDLK-DLQLFLKNPRIGDFYQKFSNIQSDLA 74 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + +++ AMN G+Y ++ P+GLYIE ++ LN ++G GNFF++P GV I Sbjct: 75 ACKELRFAMNAGMYHPNFEPVGLYIEKKKKLSELNESTGFGNFFMQPNGVVVWNDHGAVI 134 Query: 141 VRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 +K + FA QSGPML+ G+IN + + S KIRNGVG+ + F++S+ Sbjct: 135 HSTADYKRANFTANFATQSGPMLVHKGLINSQFIKDSNSLKIRNGVGVRDD-HLYFVISE 193 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 Q NFY FA + K +L V++ LYLDG+IS +Y+K ++Y ++ + + Sbjct: 194 QRINFYQFAKFFKHQLRVDEALYLDGSISSLYLKDIQRNDRKYNLGPIVGLTHQ 247 >UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26CZ6_9BACT Length = 241 Score = 217 bits (552), Expect = 3e-55, Method: Composition-based stats. Identities = 68/222 (30%), Positives = 109/222 (49%), Gaps = 6/222 (2%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ-GQVQMAMNGG 92 D + D ++ ++ +++++YW + + T L + Q ++ AMN G Sbjct: 23 QDLIIKDDRFHIKV--IDLTKQKLQLYWLDQDNKPIETFEQLNMHVKQQDKRLVYAMNAG 80 Query: 93 IYDESYAPLGLYIENGQQKVALNLAS-GEGNFFIRPGGVFYVA-GDKVGIVRLDAFKTSK 150 +Y + ++P GLYIENG L+ + G GNF+++P GVFY+ K + Sbjct: 81 MYLKDHSPQGLYIENGTIHKQLDTVTVGYGNFYLQPNGVFYLTQDGKAQVTATPQLSNFS 140 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 I +A QSGPML+ N I+P + + IRN VGI G + +S++ NFYDFA + Sbjct: 141 NITYATQSGPMLVINDTIHPAFNKGSKNVHIRNAVGILPDGRILLAISKEKINFYDFATF 200 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 K + + LYLDG +S +Y + F MI V Sbjct: 201 FKNQ-GCKNALYLDGFVSRIYDPTINVEQMDGHFGVMIGVSD 241 >UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9D6B9_9RHIZ Length = 286 Score = 217 bits (552), Expect = 3e-55, Method: Composition-based stats. Identities = 66/234 (28%), Positives = 113/234 (48%), Gaps = 6/234 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS---- 81 + T++PQT +++ ++ G+ G++ A++ + + Sbjct: 47 MTKPDWPEGCVEQVFEGARAILCTIDPQTHDMRLVYRDRMGDVLGSVSAVVDQLAAGAGT 106 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV-AGDKVGI 140 ++ +AMN G+Y +P+GLY+EN + ALN G GNFF++P GVF+V G+ Sbjct: 107 DHKLVLAMNAGMYHADMSPVGLYVENSVEIAALNRDDGFGNFFLKPNGVFFVLKDGNAGV 166 Query: 141 VRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + DA+ ++A QSGPML+ +GVI+PR P+ S IRNGVG+ G VF +++ Sbjct: 167 LETDAYAEADLSPEYATQSGPMLVIDGVIHPRFLPDGTSKFIRNGVGVRPDGKVVFAITR 226 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + FA + E L+ DG +S + + P + V + Sbjct: 227 DRVSLGSFARLFRDVAGCENALFFDGAVSSLALGSKMEIDSEEPAGPVAVVVAR 280 >UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PYM1_9GAMM Length = 271 Score = 214 bits (546), Expect = 2e-54, Method: Composition-based stats. Identities = 69/233 (29%), Positives = 106/233 (45%), Gaps = 15/233 (6%) Query: 34 DDCALSDPTLTVQAYTVNPQTE-RVKMYWQKANGE------AWGTLHALLADINSQGQVQ 86 DC ++ + ++WQ + + TL L + Sbjct: 38 PDCQRKSQPFDYSICELDAKNAANFSLHWQNPSSASHPLLLTFTTLRDYLVSEQPAKTLL 97 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 AMN G+YD ++AP+G + NG+Q ALNL G GNF + P GVF+ I + Sbjct: 98 FAMNAGMYDSNFAPIGYTVINGKQIRALNLKQGGGNFHLMPNGVFWQDRQGFYITESQSM 157 Query: 147 KTS----KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG--NAVFLLSQQ 200 + FA QSGPML+ +G I+P N S K RNG+G+ H F++S Sbjct: 158 AKKLASGAKPTFATQSGPMLVIDGNIHPAFDANSTSRKYRNGIGVCGHNPSRVKFVISDT 217 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRYPFVTMISVER 252 +FY+FA K++L + L+LDG S +Y + + +Y MI+V + Sbjct: 218 PVSFYEFADLFKSQLGCDNALFLDGGSASALYSQTLSRNDNKY-MGVMIAVTQ 269 >UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcus RepID=Q1IX28_DEIGD Length = 317 Score = 214 bits (546), Expect = 2e-54, Method: Composition-based stats. Identities = 79/245 (32%), Positives = 126/245 (51%), Gaps = 11/245 (4%) Query: 15 NLKRIFLALTLLPLFAV-AADDCALSDPTLTVQAYTV---NPQTERVKMYWQKAN-GEAW 69 N+ RIF+ LLPL A A + T YTV + + + ++++W+ G+ + Sbjct: 77 NVLRIFV--LLLPLTACSQAGGLDVRRVTAEGMLYTVAAVDLKRDHLRLHWKNPATGQPY 134 Query: 70 GTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 T + A + G QV A N GIY PLGL++E G+ + LN A GNF + P Sbjct: 135 RTFAEVSARLRKDGEQVLFATNSGIYGPGLEPLGLHVEEGRTLIGLNNARSGGNFALLPN 194 Query: 129 GVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GVF+V G++ G+ A++ + + FA QSGP+L++ G ++P + +S K+R+GVG+ Sbjct: 195 GVFWVKGNQAGVTETQAYRRLNIQPTFATQSGPLLVQGGRLHPAFNKGSSSFKVRSGVGV 254 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + G F +S NF+ FA + + L LYLDG+IS Q F + Sbjct: 255 CRDGRVRFAVSAGPVNFHSFAVFFRDVLGCPDALYLDGSISAYATPDADT--QVADFAGI 312 Query: 248 ISVER 252 ++ R Sbjct: 313 WTISR 317 >UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093S1_STIAU Length = 278 Score = 214 bits (545), Expect = 2e-54, Method: Composition-based stats. Identities = 76/264 (28%), Positives = 124/264 (46%), Gaps = 23/264 (8%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPT------------LTVQAYTVNP 52 LLIG G+ T A LL A +L PT T Y V+ Sbjct: 19 LLIGSGLGT-------GATHLLAAPHTPAATRSLQTPTGRVAARRIAYRGNTYDTYEVDL 71 Query: 53 QTERVKMYWQKANGEAWGTLHALLADIN-SQGQVQMAMNGGIYDESYAPLGLYIENGQQK 111 +++ Y+Q+ +G + +L L + ++ A N G++ + P+GLY+E+G++ Sbjct: 72 TQSKLRFYFQQPDGTPFSSLGNLRGWLQGRGKRLVFATNAGMFTPARRPVGLYVEDGREF 131 Query: 112 VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK--EIQFAVQSGPMLMENGVIN 169 V LN GNFF++P VF+V GI+ A+ ++ +A QSGP L+ +G ++ Sbjct: 132 VGLNTQEEAGNFFLKPNAVFFVTETGAGILESSAYAAHPPAKVLYATQSGPALLLHGQMH 191 Query: 170 PRIHPNVASSKIR-NGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS 228 P + R +GVGI VF ++QQA N ++FA + + + + LYLDG +S Sbjct: 192 PAFREGSRNLSPRRSGVGIVTPTRVVFAMTQQAVNLHEFASFFRDQFGCQDALYLDGVVS 251 Query: 229 HMYMKGGAIPWQRYPFVTMISVER 252 MY+ F MI++ Sbjct: 252 RMYLPALGRDELDGDFGAMIAISE 275 >UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteria RepID=C5CWT4_VARPS Length = 238 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 67/210 (31%), Positives = 118/210 (56%), Gaps = 4/210 (1%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLG 102 ++ + ER++++ +G + L A + ++ + + AMN G+Y ++P+G Sbjct: 27 RYTVVKIDVRRERLELFLHDDSGAPFKRFDRLEAWLAARNRQLVFAMNAGMYHADFSPVG 86 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKE--IQFAVQSGP 160 L ++ G+++ LNLA+G GNFF++P GVF V+ +V + + ++ A QSGP Sbjct: 87 LLVQEGREEAPLNLAAGAGNFFLKPNGVFLVSDAGPRVVESSEYAALPKEGVRLATQSGP 146 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQL 220 +L+ GV++P P+ S KIRNGVG++ H A+F++S+Q NFY+FA Y + L+ Sbjct: 147 LLLRRGVVHPAFIPDSDSRKIRNGVGVSGH-TAIFVISEQPVNFYEFALYFRDVLHCRDA 205 Query: 221 LYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 LYLDGT+S ++ ++ V Sbjct: 206 LYLDGTVSALHSLALRRSDFTRELGPILGV 235 >UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodobacterales RepID=B9KP42_RHOSK Length = 245 Score = 211 bits (536), Expect = 3e-53, Method: Composition-based stats. Identities = 69/241 (28%), Positives = 116/241 (48%), Gaps = 6/241 (2%) Query: 17 KRIFLALTLLPLF--AVAADDCALSDPTLTVQAYTVNPQT--ERVKMYWQKANGEAWGTL 72 R LA L L+ A A + A D T Y++ + ++++ +G +G+ Sbjct: 1 MRTRLAAILFALWPAACATAEPACRDLTFEGTRYSLCEAQAGDDIRIFQTAPDGRPYGSF 60 Query: 73 HALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 + + ++ +G+ + AMN G+Y P+GL IE ++ L ++G GNF + P GVF Sbjct: 61 ERINSALDGEGRQLAFAMNAGMYHADRRPVGLLIEEEVERAPLVTSAGPGNFGLLPNGVF 120 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 V I + A QSGPML+ G ++PR + S IRNGVG++ G Sbjct: 121 CVGDGFRVIESRSFAAERPACRHASQSGPMLVIGGELHPRFLVHSDSRYIRNGVGVSADG 180 Query: 192 -NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 AVF +S + F++F + +L + + LY DG+IS +Y +G P ++ + Sbjct: 181 RRAVFAISNRPVTFHEFGRLFRDELGLPEALYFDGSISRLYDRGARRSDWGTPMGPIVGL 240 Query: 251 E 251 Sbjct: 241 V 241 >UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter cryohalolentis K5 RepID=Q1QCK8_PSYCK Length = 276 Score = 209 bits (533), Expect = 5e-53, Method: Composition-based stats. Identities = 69/237 (29%), Positives = 117/237 (49%), Gaps = 9/237 (3%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANG-EAWGTLHALLADINSQ 82 T ++ + + + T +Q+ + + + ++WQ+++ + T LL+ + Sbjct: 38 TASTDWSCQSHNTPFAYSTCHIQSDLLTNKRYSLALFWQQSDSRQPLLTFDNLLSTLPPS 97 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIV 141 ++ AMN G+Y+E+YAP+G + ++ ALNL G GNF + P GV + KV I Sbjct: 98 QSLKFAMNAGMYNENYAPIGYTVIKSEEIRALNLKEGGGNFHLLPNGVLWWDKSGKVQIT 157 Query: 142 RLDAFKTSKE-----IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 +A + +A QSGPML+ N I+P+ P+ S+KIRNG+G+ G+ F+ Sbjct: 158 ESNALAEQLKNGIAQPLYATQSGPMLVINDAIHPQFDPDGTSAKIRNGIGVCSDGSLQFV 217 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRYPFVTMISVER 252 S+ FY FA K +L L+LDG I S +Y ++ MI + Sbjct: 218 NSEAPVAFYQFASLFKNELKCPNALFLDGGIASALYAPTIDKHDKK-EMGVMIGLVE 273 >UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B5U0_PARDP Length = 251 Score = 209 bits (533), Expect = 6e-53, Method: Composition-based stats. Identities = 71/250 (28%), Positives = 113/250 (45%), Gaps = 8/250 (3%) Query: 12 ITLNLKR----IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNP-QTERVKMYWQKANG 66 + ++LKR F AL + L A+A T+ Q ++++ +G Sbjct: 1 MKIDLKRRLGLAFGALIAMTLPALAGICEKRDFDGQGYVICTLTAGQEPGLRLWLNGPDG 60 Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIR 126 G A+ + + AMN G+Y + P+GLY+ +G + L A G GNF + Sbjct: 61 RTLGDFTAVRRTLAQGESLGFAMNAGMYHPDFTPVGLYVSDGVSQHDLVTAGGGGNFGML 120 Query: 127 PGGVFYVAGDKVG-IVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNG 184 P GVF G + ++ AF K + + + A QSGPML+ +G ++PR + S IRNG Sbjct: 121 PNGVFCAGGARPYQVIESRAFAKAAPDCRLATQSGPMLVIDGALHPRFLVDSDSRYIRNG 180 Query: 185 VGINKHGNA-VFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 VG++ G F +S +A F+ F + L LY DG+IS +Y G Sbjct: 181 VGVSPDGQTAWFAISDRAVTFHQFGRLFRDGLGARDALYFDGSISRLYAPGLGRADFGRR 240 Query: 244 FVTMISVERK 253 +I + Sbjct: 241 LGPIIGYVGQ 250 >UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1AL67_SEBTE Length = 266 Score = 209 bits (532), Expect = 6e-53, Method: Composition-based stats. Identities = 95/217 (43%), Positives = 133/217 (61%), Gaps = 4/217 (1%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + D T Y + E +KMYW+ N +A+ L + + N+ ++ A NGGIY E Sbjct: 52 IEDRGFT--VYKPDLNKEIIKMYWKDENNKAYSELSKFIQE-NTGNKINFATNGGIYSEE 108 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 Y P GLYIEN + +NLA GEGNF+++P GVFY+ ++ I AF+ ++ I +A Q Sbjct: 109 YEPNGLYIENHKIISKINLADGEGNFYMQPNGVFYIQNNQPKISESKAFEYNENISYATQ 168 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 SGP+L+ENGVIN +I N S KIR+ VGI++ FL+S + NFYDF+ YA KLN Sbjct: 169 SGPLLIENGVINKKIGKNSESFKIRSAVGIDRENKVFFLMSSEKINFYDFSKYALDKLNC 228 Query: 218 EQLLYLDGTISHMYM-KGGAIPWQRYPFVTMISVERK 253 + LL+LDG IS MY IP Q YPF +I+ E++ Sbjct: 229 KDLLFLDGAISKMYFADEKKIPEQDYPFAVIITSEKR 265 >UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax=Rhizobiales RepID=Q1MEZ5_RHIL3 Length = 258 Score = 209 bits (532), Expect = 7e-53, Method: Composition-based stats. Identities = 63/255 (24%), Positives = 110/255 (43%), Gaps = 14/255 (5%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLT---VQAYTVNPQTERVKMYWQKANGEA 68 + ++ + LT A A + T+ P ++++W+ A+G Sbjct: 4 LKHSVLAAAIMLTATMTSLDQAHAQACEQESFEEAKYVVCTLEPGKADLRLFWKNADGAP 63 Query: 69 WGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG------ 121 + +L + ++G+ + A+N G+Y ++P+GLY+ENG++ N E Sbjct: 64 YRAFSSLAEAVRAEGRTLAFAVNAGMYRADFSPMGLYVENGRELNPANTTEAESSSGQVP 123 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 NF+ +P GVF++ GI+ D F K + +FA QSGPML+ +NP Sbjct: 124 NFYKKPNGVFFLGETGAGILPTDEFLKRRPKARFATQSGPMLVIANKLNPIFIVGSTDRT 183 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPW 239 R+GVG + G F +S+ NF+DFA + L L+LDG +Y Sbjct: 184 RRSGVGTCERGAVRFAISEDRVNFHDFARLFRDHLKCPDALFLDGGRGVGLYNPDMGHND 243 Query: 240 --QRYPFVTMISVER 252 + + + Sbjct: 244 WSWHGGYGPIFGLVE 258 >UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Aurantimonadaceae RepID=Q0G184_9RHIZ Length = 268 Score = 202 bits (515), Expect = 7e-51, Method: Composition-based stats. Identities = 68/229 (29%), Positives = 115/229 (50%), Gaps = 5/229 (2%) Query: 28 LFAVAADDCALSDPT-LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 L A C ++ + V + + + G + T A + G+V Sbjct: 41 LPAGHEGICRIAMAGSVETILCEVPLSSFDLHLRALDDAGRPYETFEKAAASL--SGEVV 98 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +AMN G+Y E P+GL +++G+ L +G GNF +RP G+FY+ + + + + Sbjct: 99 LAMNAGMYHEDRRPVGLTVQDGRIVKKAVLGTGSGNFSLRPNGIFYLEDGRAFVRETERY 158 Query: 147 -KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL-LSQQATNF 204 S + A QSGPML+ G ++PR P S +RNGVG+++ G VFL L+++ NF Sbjct: 159 LGESHDPVLATQSGPMLLIGGKVHPRFIPTSDSLYVRNGVGVSEDGRTVFLALTRKPINF 218 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 YDFA + + + V+ L+ DG +S + + I ++R M+ V +K Sbjct: 219 YDFALFFRDTVGVKDALFFDGQVSSLSYRAANIAYRRDRLGPMLLVTKK 267 >UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EW16_DICNV Length = 263 Score = 201 bits (510), Expect = 3e-50, Method: Composition-based stats. Identities = 78/260 (30%), Positives = 126/260 (48%), Gaps = 19/260 (7%) Query: 12 ITLNLKRIFLALTLLP-LFAVAADDCALSDP------TLTVQAY---TVNPQTERVKMYW 61 + + L++I + + L L AA Q+ P+ ++++ W Sbjct: 1 MLVALRKIIVPVILSSFLLETAAAHLDFKKVAGGNFARFHHQSVDYAVFMPEHDKIRFLW 60 Query: 62 QKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Q GE + T+H L + ++G QV MN GI++++ P GL+IE LN SG+ Sbjct: 61 QNDRGENYQTMHHALRALTNEGYQVHFLMNAGIFNQNAQPAGLWIEKKALLRPLNRRSGK 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRL-DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 GNF I+P GVFY+ +K I+ + +AVQSGP+L+ +G IN R+ N ++ Sbjct: 121 GNFHIQPNGVFYLTQEKAHIITTVQWHNNPPKADYAVQSGPLLIIDGAINSRLPKNHKAA 180 Query: 180 KIRNGVGINKHGNAVFLLS----QQA--TNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 RN V ++K F+++ A N Y FA A + +Q LYLDG++S Y+ Sbjct: 181 YKRNAVCVDKARRVYFVITTRYDDGAHFPNLYRFAH-ALQTIGCQQALYLDGSLSDFYLP 239 Query: 234 GGAIPWQRYPFVTMISVERK 253 + + F MI+V K Sbjct: 240 MESSRFHWQKFAGMIAVVSK 259 >UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PF78_CHIPD Length = 273 Score = 197 bits (502), Expect = 2e-49, Method: Composition-based stats. Identities = 71/228 (31%), Positives = 119/228 (52%), Gaps = 8/228 (3%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE-AWGTLHALLADI-NSQGQVQMAMNG 91 + + A VNP + ++W A+ + + ++ AL + + + M NG Sbjct: 46 GEITFTHNGQQYDAIVVNPAVSDISLHWLSADQQTPYKSIQALQDVLLEKKKDILMITNG 105 Query: 92 GIYDESYAPLGLYIENGQQKVALNLA-SGEGNFFIRPGGVFYVAGDKVGIVRLDAF---- 146 G++ ++ P+GL+I G++ ++ A GNF+++P GVFY+ + + Sbjct: 106 GMFMKNNIPVGLFISQGRELRPIDAATDQPGNFYMQPNGVFYLDHTGPHVSTTTDYLKRS 165 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFY 205 + +I A QSGPML+ G+IN + +P + +R+GVGI +GN VF++S++A T FY Sbjct: 166 RAHSKIVAATQSGPMLVSKGIINAKFNPGSVNRNLRSGVGILSNGNVVFIISKEAQTTFY 225 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 DFA KA+ + LYLDG IS MY+K F MI+V + Sbjct: 226 DFASIFKARFGCKDALYLDGAISKMYLKNSRPGDLNGDFGAMIAVTAR 273 >UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythrobacter RepID=Q2NAA1_ERYLH Length = 277 Score = 197 bits (500), Expect = 3e-49, Method: Composition-based stats. Identities = 67/226 (29%), Positives = 104/226 (46%), Gaps = 11/226 (4%) Query: 33 ADDCALSDPTLTVQA---YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAM 89 A + A T P R+ G + L A+ Sbjct: 58 AAESACERLTFQEVVLTHCVAVPAKHRITTVL----GPPHRSFAKLAEG--RSSAPVFAV 111 Query: 90 NGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF--K 147 N G++D P+G Y+E+ ++ ALN G GNF ++P GVFY + + + ++F Sbjct: 112 NAGMFDGDGKPIGYYVEDSERLQALNTNDGAGNFHLKPNGVFYGSNGEWRVRTTESFLAN 171 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDF 207 S QF QSGPML+ +G ++P I + S +IRNGVG+++ G A F++S+ +F F Sbjct: 172 VSDRPQFGTQSGPMLLIDGKLHPEISEDGPSRQIRNGVGVDRQGRAHFVISEGPISFGKF 231 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 A + + N LYLDG +S ++ R P MI VE + Sbjct: 232 ARFFRDVANTPNALYLDGNVSGLWDPANDRMDARAPIGPMIVVETR 277 >UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WGQ7_PSYWF Length = 309 Score = 193 bits (491), Expect = 5e-48, Method: Composition-based stats. Identities = 61/207 (29%), Positives = 101/207 (48%), Gaps = 8/207 (3%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 + + Q + E L+ D+ +++ A N G+YD ++AP+G + G+Q + Sbjct: 95 NQPQAAIVDQDKSHEPLYKFDTLIKDLPKDSELKFAANAGMYDGNFAPIGYTVIQGRQIL 154 Query: 113 ALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKT-----SKEIQFAVQSGPMLMENG 166 +LNL G GNF + P GV + + V I + +A QSGPML+ +G Sbjct: 155 SLNLKQGGGNFHLLPNGVLWWDKANHVHITESTQLDAMLKSGEAKPWYATQSGPMLVIDG 214 Query: 167 VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT 226 I+P+ + + S KIRNGVG+ F+ S++ NFY FA + K L+ + L+LDG Sbjct: 215 HIHPKFNSDSTSKKIRNGVGVCDGSQIHFVTSREPVNFYQFARFFKEDLHCDNALFLDGG 274 Query: 227 -ISHMYMKGGAIPWQRYPFVTMISVER 252 S +Y A ++ M+ + Sbjct: 275 VASALYAPDVAAQEEKN-MGVMVGLIE 300 >UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetobacter RepID=B2HYZ5_ACIBC Length = 204 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 61/205 (29%), Positives = 106/205 (51%), Gaps = 4/205 (1%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA-NGEAWG 70 + + + I + + A+A + + + T E+++++ + + + Sbjct: 1 MKILVLCI-VNFIIFTQSALALEYRQIRNTTDDQFEVIEISNLEQLRLFLKNPQTDQYYK 59 Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + + + + Q+ AMNGG++ ++P+GLYIENG++ LN G GNFF++P GV Sbjct: 60 SFDNIQYQLKACEQLTFAMNGGMFHSGFSPVGLYIENGRESQPLNEDKGWGNFFLQPNGV 119 Query: 131 FYVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK 189 + I+ + +K + +A QSGPML+ NG INP N S KIRNGVG+ K Sbjct: 120 LAWNDKQAVILTTEQYKAKVFQPDYATQSGPMLVINGKINPLFLANSDSKKIRNGVGV-K 178 Query: 190 HGNAVFLLSQQATNFYDFACYAKAK 214 + F++S+ NFY FA + + K Sbjct: 179 NNKLYFVISKNRVNFYSFAQFFQKK 203 >UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N6Z2_9GAMM Length = 304 Score = 189 bits (481), Expect = 6e-47, Method: Composition-based stats. Identities = 75/219 (34%), Positives = 110/219 (50%), Gaps = 11/219 (5%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAP 100 + Y +P V ++W+ A+G A+ L L + G +V MN GIY E+ P Sbjct: 85 NVRYGIYQADPAQ--VSLHWKTADGSAYANLATLKRSLEQSGARVAFLMNAGIYSENDTP 142 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSG 159 GL+IE GQ V LN +G+GNF I+P GVFY+ K I A+ + +AVQSG Sbjct: 143 AGLWIERGQTLVPLNRKNGKGNFHIQPNGVFYIERGKARIQTSAAYHIGNHHPDWAVQSG 202 Query: 160 PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKA 213 P+L+ +G NPR N++S RN V F+L++ +F+ FA + Sbjct: 203 PLLLLDGKPNPRFVKNLSSPHKRNAVCTTADNRLYFILTEDYDLGSEWPSFHRFAEALQH 262 Query: 214 KLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 L LYLDGT+S Y+ G A + +V +I+V Sbjct: 263 -LGCHDALYLDGTLSGWYIPGIAGTFHWTHYVGIIAVTT 300 >UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744B4D Length = 235 Score = 180 bits (456), Expect = 5e-44, Method: Composition-based stats. Identities = 63/227 (27%), Positives = 106/227 (46%), Gaps = 13/227 (5%) Query: 38 LSDPTLTVQAYTVNPQTE-RVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYD 95 + V+ R+ + W +G+ G+ LL + QG+ ++ A N GIY+ Sbjct: 10 IEFEGAIYHVLRVDRADFSRLDLRWLGQDGKPLGSFGPLLQEAARQGRRIEFATNAGIYE 69 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAF-KTSKEIQ 153 P GL I G++ V LNLA GEGNF++ P GVFY+ G++ + ++ + + Sbjct: 70 RGPKPCGLTIAGGKELVPLNLAKGEGNFYLHPNGVFYLDDQTGAGVMTGAEYGQSGLQPR 129 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK-HGNAVFLLSQ------QATNFYD 206 A QSGP+L+ G I+P + N + ++RN VG+ G VF++S F+ Sbjct: 130 LATQSGPILLRQGKIHPAFNFNSPNRRLRNAVGVRASDGQVVFVMSDREDRVKGRVTFHQ 189 Query: 207 FACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVER 252 + + L + L+LDG IS ++ F M + + Sbjct: 190 LSRFFLH-LGCQDALFLDGDISDFLFHPPAGAAVTPNTFAGMFVLWK 235 >UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legionella RepID=Q5WVS5_LEGPL Length = 258 Score = 165 bits (418), Expect = 1e-39, Method: Composition-based stats. Identities = 53/233 (22%), Positives = 91/233 (39%), Gaps = 27/233 (11%) Query: 11 MITLNLKRIFLALT---------LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 + + IF +T L P + L +P + + ++ ++ + Sbjct: 17 FLLILALAIFTPMTSYSASDWQELTPGIEYQDLEGGLLNPWSHIHVFRIDLNKNQMALVT 76 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG 121 K + ++ + +++NGG +D + PLGL I N +Q+ L S Sbjct: 77 AKNLAQKNASVDQF----AEHSKALLSINGGFFDHEFNPLGLRINNKKQENPLKRISW-- 130 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 G+FYV +K I + F I FA+QSGP L+ G I P + VA Sbjct: 131 ------WGIFYVKDNKPRITNIRNFHYDSNIDFAIQSGPRLLIRGNI-PSLKAGVAD--- 180 Query: 182 RNGVGINKHGNAVFLLSQQ-ATNFYDFACYAKA-KLNVEQLLYLDGTISHMYM 232 R +GI G + L++ A + A ++ L+ + LDG S Sbjct: 181 RTALGITDDGKVIILVTTNAAMSTRQLAQIMRSPPLSCSDAINLDGGSSSQLY 233 >UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REU9_LEGLO Length = 260 Score = 164 bits (414), Expect = 3e-39, Method: Composition-based stats. Identities = 54/223 (24%), Positives = 93/223 (41%), Gaps = 19/223 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L P + P V + V+ + ++ + K + ++ + Sbjct: 40 LSPGIEYQDLAGGILAPWSHVYVFRVDLKKNKLGLVNAKNLSLKYASV----NQFAEHSK 95 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 +++NGG +D + PLGL I NG+ + L S GVF++ +K I L Sbjct: 96 ALLSINGGFFDHKFNPLGLRITNGKLENPLKRISW--------WGVFFIKNNKAYISSLR 147 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-ATN 203 F+ +I FA+QSGP L+ N I P + P +A R+ +GI G + L++ A Sbjct: 148 QFQYDNDIDFAIQSGPRLLVNRKI-PSLKPGIAE---RSALGITADGKIILLVTTNAAMT 203 Query: 204 FYDFACYAKA-KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPF 244 A ++ L+ + LDG S +Y G+ + F Sbjct: 204 TNKLAHLLRSPPLSCMDAINLDGGSSSQLYAHIGSFLLNVHGF 246 >UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucella sp. 83/13 RepID=D1CZ42_9RHIZ Length = 248 Score = 160 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 46/167 (27%), Positives = 81/167 (48%), Gaps = 10/167 (5%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVRLDAF-KT 148 ++PLGL+I +G+++ + A + NF+ +P G+F++ G++ + F K Sbjct: 82 AGFSPLGLFIADGKEQSPIQPAGAKTSDKPVPNFYKKPNGIFFLDESGAGLLPTEQFVKR 141 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA 208 ++ A QSGPML+ +NP A R+GVG+ K G F++S A NF+DFA Sbjct: 142 RPKVWLATQSGPMLVIENRLNPIFIIGSADKSRRSGVGVCKDGVIHFVVSDDAVNFHDFA 201 Query: 209 CYAKAKLNVEQLLYLD-GTISHMYMKGGAIPWQ--RYPFVTMISVER 252 + + +L L+LD G + +Y + M ++ Sbjct: 202 RFFRDRLECPNALFLDGGGGAGLYDPALGRNDMSWHGGYGPMFALIE 248 >UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella burnetii RepID=A9KDD2_COXBN Length = 255 Score = 153 bits (387), Expect = 5e-36, Method: Composition-based stats. Identities = 47/227 (20%), Positives = 86/227 (37%), Gaps = 22/227 (9%) Query: 26 LPLFAVAADDCALSDPTL--TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + V + S P L + A+ +NP+ + ++ A Sbjct: 36 MAYTVVTPAFSSESRPGLFTHLYAWKINPRQYHFNIVTA----KSLQQTALYAAQAAKIK 91 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +A+NGG + + PLGL I + + +L S G+F + ++ I Sbjct: 92 DTVLAINGGFFTPNLEPLGLRISDNKVLSSLKRISW--------WGIFMIKNNRAAITSP 143 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-- 201 ++ S EI FA+Q+GP L+ +G I P++ A R+ +G+ G+ + ++ Sbjct: 144 QNYRYSPEINFAIQAGPRLIIDGRI-PQLRGGSAQ---RSALGVTPTGDIIIAITDNNLL 199 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTM 247 A KL L LDG S +++ Q + Sbjct: 200 LTATQLA-ILLQKLGCSNALNLDGGTSSQLFVHTNNFSLQIPSLRPV 245 >UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseiflexus RepID=A5USB9_ROSS1 Length = 282 Score = 142 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 47/221 (21%), Positives = 92/221 (41%), Gaps = 23/221 (10%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 SDP + + A ++P T R+++ + + T + +A+NGG + Sbjct: 70 DSSDPPVPIYAVRLDPATIRLRIRYAPDAPQPLRTW-------FVAHRPLVAVNGGFFTA 122 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV-AGDKVGI--VRLDAFKTSKEIQ 153 L + +G G + GG+ +V I +R + + + + Sbjct: 123 ENRATALIVSDGTVY---------GTSYAGFGGMLAAAPDGRVWIQALRDEPYDPNIPLD 173 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-QQATNFYDFACYA- 211 A+QS PML+ G + I+ N R V I++ G + ++ A + + A + Sbjct: 174 QAIQSFPMLIYPGGVVASINDNGQ-RARRTVVAIDRAGRVLLIVCPTSAFSLQELATWLA 232 Query: 212 KAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVE 251 + + +++ L LDG S +++ GA+ WQ F + SV Sbjct: 233 SSDMEIDRALNLDGGSSSGIFVNAGAVRWQIDSFAALPSVI 273 >UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostridium RepID=A6LS70_CLOB8 Length = 356 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 61/218 (27%), Gaps = 30/218 (13%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-------- 96 +P ++ + + G + I A+NGG + + Sbjct: 141 YYLVVKDPTRVKIGVSSK------LGVEGETTSTIAENNDAIAAINGGAFTDQSSAAQWT 194 Query: 97 --SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 G+ + G+ KV + F I GV V IQ Sbjct: 195 GNGGLASGIVMTGGEVKVNDVGDNPTTTFGIDKNGVMVVGD------YTVEKLKELGIQE 248 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFA 208 A+ GP L+ NG + + + +G K G+ + L+ + Sbjct: 249 ALSFGPALIINGNMVKINGDGGFGTAPKTAIGQMKDGSIILLVIDGREIGSIGATLKELQ 308 Query: 209 CYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 +L + LDG S +Y G Sbjct: 309 EIM-HQLGAWNAMNLDGGKSTTLYYYGEVRNKPSNSMG 345 >UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=Bacteroides RepID=C3QHD0_9BACE Length = 311 Score = 133 bits (334), Expect = 7e-30, Method: Composition-based stats. Identities = 45/236 (19%), Positives = 80/236 (33%), Gaps = 25/236 (10%) Query: 24 TLLPLF-AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE----AWGTLH-ALLA 77 TL P A+ + + + + + V+ + + M E + LA Sbjct: 62 TLAPGVKALEMEILSATGMAVKMFVLEVDLKDTHLTMKASSPKDEGKLKTKQQMTLQALA 121 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 +V A+NG + P G+Y NG + F + G K Sbjct: 122 HDKQGSRVLAAVNGDFFATDGTPQGIYYRNGVCLKNTMTDNVCTFFAV-------TKGKK 174 Query: 138 VGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 I D + EIQ AV LM NG + P+ + + + R +G+ + L Sbjct: 175 AVIGSYDEYDTYKDEIQEAVGGRVRLMTNGNVLPQ---TLTALEPRTAIGVTDNNVVYIL 231 Query: 197 LSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 ++ +Y + KA L + + LDG S ++ ++ F Sbjct: 232 VADGRNFWYSNGMRYAEMGAVMKA-LGAKDAINLDGGGSSTFIIRSKAGFEENRFA 286 >UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WLU9_9ACTO Length = 447 Score = 133 bits (334), Expect = 7e-30, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 68/221 (30%), Gaps = 28/221 (12%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 + T+T TV T+ + AN + + + I S A+NG Y + Sbjct: 221 NNTVTYYVATVKL-TDATALKSAFANNQFGRNITQKTSTIASNNNAIFAINGDYY--GFR 277 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G+ I NG +G F+ Y + + + G Sbjct: 278 SSGIVIRNGVVYRDDGARAGLA-FYRDGSVKIYDE-----TSTNGQKLVKEGVWNTLSFG 331 Query: 160 PMLMENGVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 P L++NG I I ++ ++ R VG K G VF++ Sbjct: 332 PSLVKNGKIVEGIDDVEIDTNFGNHSIQGNQPRTLVGAKKDGTLVFVVVDGRDAGYSRGV 391 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 + A + LDG S MY G I Sbjct: 392 TMTEAAKIMLEQ-GCVTAYNLDGGGSSTMYFNGEVINEPSN 431 >UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KSV8_9BACE Length = 390 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 69/194 (35%), Gaps = 19/194 (9%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASG 119 + ++ + + A LA S +V A+NG + + P G+Y NG + Sbjct: 183 FLGTSSSYYYVSRDAALAYDKSGSRVLAAVNGDFFAKDGTPQGIYYRNGTCLKGTMTDNV 242 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVAS 178 F I + I D + + IQ AV LM NG + P+ V + Sbjct: 243 CTFFAITKN-------KRAIIGSYDEYDSYKENIQEAVGGRVRLMTNGNVLPQ---TVTA 292 Query: 179 SKIRNGVGINKHGNAVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMY 231 + R +G+ L++ +Y + KA L + + LDG S + Sbjct: 293 LEPRTAIGVTDDNVVYILVADGRNFWYSNGMRYAEMGAVMKA-LGAKNAINLDGGGSSTF 351 Query: 232 MKGGAIPWQRYPFV 245 + ++ F Sbjct: 352 IIRKIAGFEDGRFA 365 >UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-layer protein n=2 Tax=Lactobacillus rhamnosus RepID=C2JZN3_LACRH Length = 559 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 78/222 (35%), Gaps = 22/222 (9%) Query: 24 TLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL----HALLAD 78 TL P + S + +NP+ ++ A + A Sbjct: 125 TLTPGVTEQRLTYISQSGTQNKYYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASAA 184 Query: 79 INSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 I + QV A+NG ++ S P G I++G + A ++ E F I+ G + ++ Sbjct: 185 IKNGHQVVAAVNGDLFKIASGVPTGNVIKDGVELHAA-TSARESFFGIKKDGTPIIGDEQ 243 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 K ++Q A+ +L+ +G +N S+ R VGI G F++ Sbjct: 244 T------YQKVKGDLQQALGGRNILVADGKVNET-KAIGTDSEPRTAVGIKADGTVFFVV 296 Query: 198 SQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + D A + L LDG S Y+ Sbjct: 297 VDGRQAPTSNGLSMVDLANLMIQR-GAVTALNLDGGGSSTYV 337 >UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C5_CLONN Length = 335 Score = 130 bits (327), Expect = 4e-29, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 67/232 (28%), Gaps = 35/232 (15%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 + + NP+ +V E G+ + + + A+N G + Sbjct: 105 REIHGDKFKGHLLVIKNPKKIKVGY------NEHLGSKGETTSAMAKRYNSIAAINAGGF 158 Query: 95 -------------DESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + + P G+ I NG+ L I G+ V + Sbjct: 159 VANNASSKDANPSETNGNPGGILISNGEIVYNNLRNNEKICIAGITADGILLVGNYNLDE 218 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + ++ AV GP L+ NG + R +G K G+ +FL+ Sbjct: 219 M------MKLNVKDAVSFGPALIVNGQKTITSGDGGWGTAPRTAIGQRKDGSILFLVIDG 272 Query: 201 ------ATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 A + + + LDG S MY G I Sbjct: 273 KYIGRLAVTLRELQDILY-EYGAYNAVNLDGGSSSTMYYNGKVISEPYKSTG 323 >UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AQ96_9BACE Length = 305 Score = 130 bits (326), Expect = 5e-29, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 70/222 (31%), Gaps = 17/222 (7%) Query: 41 PTLTVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 Y + Q + A+G + + + I +A+NG Y + Sbjct: 76 REYDTSIYVADIQLADASYLRAGLADGTFGRNVTEVTSQIAQDSNAILAINGDFY--GFR 133 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ-- 157 G + NG +GN + V Y G I + + + A Q Sbjct: 134 NKGYVMRNGYLYRETAQQGRQGNS-RQEDLVIYEDGHMDVIEENEVAAQTLKDSGASQIF 192 Query: 158 -SGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 GP L++NG I + V S R +G+ + + +S Y Sbjct: 193 SFGPGLIKNGNITVDENSEVEQSMQSNPRTAIGMITPLHYIMAVSDGRTEASEGLTLYQL 252 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A K + + LDG S G + + + + IS Sbjct: 253 AQIMKGQ-DCVTAYNLDGGGSSTMWFNGEVVNKPTSYGSKIS 293 >UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT14_PEDHD Length = 303 Score = 130 bits (326), Expect = 6e-29, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 68/213 (31%), Gaps = 27/213 (12%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD----INSQGQVQMAMNGGIYDES-YA 99 + ++ + VK+ + + +V +NG ++ S Y Sbjct: 72 IFILKIDLKNPDVKLQAATPYDAPGYGSQTVPEMAKYVDAANNRVIAGINGDFFNTSSYV 131 Query: 100 PLGLYIENGQQKVALNLAS------GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ 153 PLG+ + G + G I G Y+ D +++ Sbjct: 132 PLGIIYKKGVAIKPAFTDNTDKPQQGLSFLGILANGKPYIGDK-----ETDYPTIKSQLK 186 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 A+ +G L+++ +I ++ + R GVGI F++ N+ + Sbjct: 187 EALGAGVFLVKD---YKKITQSIPTVDPRTGVGITDDDLVYFIVVDGRNFYNSNGINYQE 243 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 A V+ + LDG S +M Sbjct: 244 MGKIMYA-FGVKNAVNLDGGGSSTFMIKHPRVD 275 >UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY15_CLONN Length = 436 Score = 130 bits (326), Expect = 6e-29, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 72/230 (31%), Gaps = 33/230 (14%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + V P ++++ + + + + + ++I + A+N G + + Sbjct: 200 DIKTNRFNGKMLIV-PNSKKIVIGFNEESP---SKVGKTTSEIAKENNAICAINAGGFTD 255 Query: 97 S------------------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 P G+ I NG+ N G N I G F G + Sbjct: 256 DVSGKSAEVVLNPDSGYETRKPCGILIHNGEFVY--NDDKGRKNEKIDIVG-FSKRGKLI 312 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + I+ AV GP L+ +G + R +G + G+ +FL+ Sbjct: 313 VGKYTLEELKNINIKEAVSFGPALIVDGNPVNILGDGGWGVAPRTAIGQRRDGSVLFLVI 372 Query: 199 QQA------TNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQR 241 D K K LDG +S MY K I Sbjct: 373 DGRGFKSMGATIKDVQDIMK-KYGAVNASNLDGGTVSTMYYKDKVINKPC 421 >UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein (Fragment) n=1 Tax=Lactobacillus rhamnosus HN001 RepID=B2KU41_LACRH Length = 470 Score = 129 bits (325), Expect = 7e-29, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 78/222 (35%), Gaps = 22/222 (9%) Query: 24 TLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL----HALLAD 78 TL P + S + +NP+ ++ A + A Sbjct: 125 TLTPGVTEQRLTYISQSGTQNKYYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASAA 184 Query: 79 INSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 I + QV A+NG ++ S P G I++G + A ++ E F I+ G + ++ Sbjct: 185 IKNGHQVVAAVNGDLFKIASGVPTGNVIKDGVELHAA-TSARESFFGIKKDGTPIIGDEQ 243 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 K ++Q A+ +L+ +G +N S+ R VGI G F++ Sbjct: 244 T------YQKVKGDLQQALGGRNILVADGKVNET-KAIGTDSEPRTAVGIKADGTVFFVV 296 Query: 198 SQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + D A + L LDG S Y+ Sbjct: 297 VDGRQAPTSNGLSMVDLANLMIQR-GAVTALNLDGGGSSTYV 337 >UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lactobacillus rhamnosus RepID=C7TED9_LACRG Length = 1561 Score = 129 bits (324), Expect = 9e-29, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 71/200 (35%), Gaps = 20/200 (10%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS----QGQVQMAMNGGIYDE-SYA 99 + ++P+ + N + + N+ QV A+N Y+ + A Sbjct: 140 YYSVALDPKNPNTTLLAGMPNDGTKPGMQTVRNQANAAISHGQQVVAAVNADYYNMATGA 199 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 PLG ++NG + + + E F I+ G + + ++Q AV Sbjct: 200 PLGNVVKNGTEIYSA-PDTNEAFFGIKKDGTPMIG------TAATYQQRKGDLQQAVGGP 252 Query: 160 PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAK 212 + +++G +N ++ VGI G F++ + DFA Sbjct: 253 SIFVKDGKVNATQVAGSEGNEPCTAVGIKADGTVFFVVIDGRQAPLSTGISVGDFAKLMI 312 Query: 213 AKLNVEQLLYLDGTISHMYM 232 + L+LDG S ++ Sbjct: 313 ER-GAVNALFLDGGGSATFV 331 >UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtilis ortholog n=6 Tax=Clostridium RepID=Q97FU3_CLOAB Length = 354 Score = 127 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 72/238 (30%), Gaps = 38/238 (15%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + + + +P ++ + G ++I A+NGG Sbjct: 117 ECKKIQGNKFSGLMLVIHDPTKVKIGYTSK------LGVEGETTSEIAKHNNALAAVNGG 170 Query: 93 IYDESYA------------PLGLYIENGQQKVALNLAS---GEGNFFIRPGGVFYVAGDK 137 + E+ + P G+ I +G+ N +G I GV V Sbjct: 171 GFQENSSGSKVVWTGTGALPTGIIISDGKVVYPKNPDQLSIQKGTAAITKSGVLVVGDHS 230 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENG--VINPRIHP--NVASSKIRNGVGINKHGNA 193 + + ++ + A+ GP L+ NG + ++ R +G K G Sbjct: 231 IREL------LNENVVEAINFGPTLIVNGVDQTRDSFGNSIDSQGAQPRTAIGQRKDGAI 284 Query: 194 VFLLSQQATNFYDFACY-----AKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + L A + N + LDG S MY G I F Sbjct: 285 LLLTVDGRQGLQMGATIKDIQKIMEQENAYNAVNLDGGASTTMYYNGHVINNPCDKFG 342 >UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostridium RepID=B8I4Q1_CLOCE Length = 346 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 59/227 (25%), Gaps = 27/227 (11%) Query: 34 DDCALSDPTLTVQAYTVN-PQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + + V+ P +V + + I + A+NGG Sbjct: 120 EYFDVESRNFKGKMIIVDDPTRIKVGYSSKMPRS------GETTSSIARRNGAVAAINGG 173 Query: 93 IY------DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI-VRLDA 145 + +G I NG+ N + + + + + + A Sbjct: 174 GFIDKGWAGTGGVAIGFVISNGKYISGKLT-----NNYTKRDTIAFTKDGMLIVGKHSQA 228 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA---- 201 I+ + GP L+ NG R +G + G+ + L+ Sbjct: 229 ELAKYNIKEGISFGPPLIVNGKPTINKGDGGWGISPRTAIGQKEDGSVMLLVIDGRSLKS 288 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + LDG S MY G + Sbjct: 289 FGATLKEVQDIMLEH-GAVNAANLDGGSSATMYYDGKVVNTPSDALG 334 >UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B1E5_HERA2 Length = 272 Score = 126 bits (317), Expect = 6e-28, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 84/238 (35%), Gaps = 26/238 (10%) Query: 22 ALTLLPLFAVAADDCALSDPTL---TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD 78 T+ + + VQ V+P R+++ + A+ G + A Sbjct: 46 PTTIDNQWQTLEPGLEFREIGYDITNVQILRVDPAYFRLRVGYDVASP---GRVSEWAAA 102 Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 + +NGG +D L I +G G + GG+ V Sbjct: 103 LKP----VAVINGGYFDAQGRATALTIFDGVI---------NGTSYDGFGGMLAVDSADG 149 Query: 139 GIVRL---DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 +R + +++ + A+QS PML+ +G + + + R+ V I++ G + Sbjct: 150 WSLRSLREQPYDSTEVLNQALQSAPMLVVHGAAIEQPNDDGD-RARRSVVAIDQTGRLLL 208 Query: 196 LLSQQA-TNFYDFACYA-KAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISV 250 ++ D + + K L ++ L LDG S + + + V + V Sbjct: 209 MVCSWPSFTLTDLSQWLVKQDLAIDAALNLDGGSSTGLVVASENRSFNLDSLVRVPQV 266 >UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactobacillales RepID=C4G6X0_ABIDE Length = 345 Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 63/214 (29%), Gaps = 22/214 (10%) Query: 41 PTLTVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 Y + + + A + + + + +A+NG Y + Sbjct: 120 RKNNTTVYVADIKLSDSSYLKTALAYDSFGTNVTETTSSMATNNNAILAVNGDYYGADRS 179 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV--- 156 G I+NG + E P Y G I + V Sbjct: 180 --GYVIKNGVIYRNTVRSDSE-----YPDLAVYKDGSFKIIYETEVTAEELLADGVVNLF 232 Query: 157 QSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 GP L+ENG I+ + V R +GI + + ++S + Y+ Sbjct: 233 AFGPSLVENGEISVDQNTEVRQAMTKNPRTAIGIVDKNHYILVVSDGRTSESEGLSLYEL 292 Query: 208 ACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQ 240 A K + LDG S MY G + Sbjct: 293 AEVLK-EYGATTAYNLDGGGSSTMYFNGNIVNNP 325 >UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Clostridium tetani RepID=Q892K3_CLOTE Length = 708 Score = 123 bits (309), Expect = 6e-27, Method: Composition-based stats. Identities = 40/195 (20%), Positives = 68/195 (34%), Gaps = 21/195 (10%) Query: 53 QTERVKMYWQKANGEAWGTLHALL----ADINSQGQVQMAMNGGIY-DESYAPLGLYIEN 107 + RV + N + + +++ A I S V +NG Y + P+G+ +N Sbjct: 92 TSSRVGVKAGTPNNKDSYGMQSVIMQAKASIASGDNVVGGVNGDFYYTVTGEPIGIVYKN 151 Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV 167 G+ A N A+ F + G + K + +Q A+ +L+ G Sbjct: 152 GKAVKA-NHAAEWNFFGVLEDGTPIIGDGK------KYNEVKDSLQEALGGNAILVREGR 204 Query: 168 INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQL 220 I + + R VGI K G F+ + D A L + Sbjct: 205 IY-QTPSIGGYREPRTAVGIKKDGTIFFVTVDGRQEGHSAGISMPDLAQLMID-LGAVEA 262 Query: 221 LYLDGTISHMYMKGG 235 L LDG S ++ Sbjct: 263 LNLDGGGSSTFVSRK 277 >UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GEE0_9FIRM Length = 379 Score = 122 bits (307), Expect = 9e-27, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 65/201 (32%), Gaps = 24/201 (11%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQ 109 V+P RV + +G ++ I + +A+N + + P + G+ Sbjct: 181 VDPTKLRVAF-----AHDEYGAPRKPVSKIANSNNAILAINASGFSGN-VPFSPVVREGE 234 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN 169 + G I G+ +G + ++ + P+L+ NG + Sbjct: 235 VYSMDINHTPMG---ITACGMLMDSGKRGVEQMIED-----GAHQVITFRPVLVRNGQM- 285 Query: 170 PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLY 222 N + R +G ++G+ +F++ N D A + Sbjct: 286 TSTAQNNNTIHPRTAIGQKENGDLIFIVVDGRRNNWSTGINLGDLAQIFIDE-GAAWAYN 344 Query: 223 LDGTIS-HMYMKGGAIPWQRY 242 LDG S +Y G + Sbjct: 345 LDGGGSTTLYFNGKVLNKPSD 365 >UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chloroflexus RepID=A9WEC1_CHLAA Length = 265 Score = 122 bits (307), Expect = 9e-27, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 84/222 (37%), Gaps = 26/222 (11%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 L P L VQ ++P R + + + L A + G A+NGG Sbjct: 55 AFRQLEAPGLPVQVVRIDPAHVRFVVGYDPTSPLT------LSAWVARYG-AVAAINGGF 107 Query: 94 YDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV---GIVRLDAFKTSK 150 +D+ P+ L I N Q G ++ GG+F + + + + Sbjct: 108 FDQQGEPVALLISNQQVF---------GYSYVDQGGMFAIDEQGKPHLWSLADQPYDGTP 158 Query: 151 EIQFAVQSGPMLME-NGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFYDFA 208 A+Q P+L+ NG + R+ + ++++G + +++ A + +++ Sbjct: 159 -FVQAIQGWPLLVRTNGE--AAYTDDDGQRARRSAIALDRNGYVLLIVAPGATFSLAEWS 215 Query: 209 CYAKA-KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMI 248 + + L++E + LDG S + + + F + Sbjct: 216 QFLASADLDIEIAVNLDGGSSSGLIAQSDQGGVRVDSFTPLP 257 >UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4R7_9CLOT Length = 894 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 67/203 (33%), Gaps = 22/203 (10%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD----INSQGQVQMAMNGGIYDE- 96 + ++ + + V + N E+ L + + V +N Y+ Sbjct: 64 RIESFVIEIDTKNKNVSIEASTPNDESAYGLQPVRKQAEALLAKGENVVAGVNADFYNMA 123 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 + P G+ +++G N F I G + + ++ A+ Sbjct: 124 TGEPNGVLLKDGVIIK--NHPESRKFFGILKDGSAVIGD------YNKFNEVKDNVEEAL 175 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFAC 209 +L+++G + A + R VGI +GN F+ + D A Sbjct: 176 GGNAILVKDGQVFET-PQTGADKEPRTAVGIKSNGNVFFITVDGRQEPYSAGLSMDDLAQ 234 Query: 210 YAKAKLNVEQLLYLDGTISHMYM 232 + + Q L LDG S ++ Sbjct: 235 LMIS-MGAIQALNLDGGGSTTHL 256 >UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=12 Tax=Firmicutes RepID=A4VXL8_STRSY Length = 312 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 71/211 (33%), Gaps = 18/211 (8%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGT-LHALLADINSQGQVQMAMNGGIYDESYAP 100 T Y + Q + +GT + A ++ + +A+NG Y + Sbjct: 88 TNNTTVYVADIQVSSPEYLKTALAQNTYGTNVTAKTSETAAANNAILAVNGDYYGAN--S 145 Query: 101 LGLYIENGQQKVALNLASGE-GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G I+NG + G+ I G F V + K + + G Sbjct: 146 TGYVIKNGVLYRDTVRDNAAYGDLAIYADGSFEVIYENEI---TAQELIDKGVVNLLAFG 202 Query: 160 PMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACY 210 P L+ENG I V SS R+ +GI + + +++ + Y A Sbjct: 203 PSLVENGEIVVDTSTEVGRAMSSNPRSAIGIIDENHYIIVVADGRTSESQGLSLYQLAEV 262 Query: 211 AKAKLNVEQLLYLDGTISH-MYMKGGAIPWQ 240 K + + LDG S +Y G I Sbjct: 263 MK-QYGAQTAYNLDGGGSSTLYFNGQVINNP 292 >UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FXK4_9FIRM Length = 305 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 76/235 (32%), Gaps = 19/235 (8%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVK----MYWQKANGEAWGTLHALLADI 79 T+ A D A++ T + ++ T +K + A+ + A + Sbjct: 63 TVNTATAYEDDTKAIAIDTYERNSTQIHVATVTIKGDASIKTALADETYGRNVKAKTSTT 122 Query: 80 NSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 +A+NG Y G I NGQ + + + + I G F + + Sbjct: 123 AQSVNAVLAVNGDYY--GARDAGYVIRNGQLLRSDSQDPNQEDLVIYQDGSFEIIREGDI 180 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNAVFL 196 +K + GP L+E+ + V + R +GI + V + Sbjct: 181 ---TAQELLNKGAVQVLSFGPALIEDSQVAVDSTDEVGKAMASNPRTAIGIIDDKHYVLV 237 Query: 197 LSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 +S + + A + K +L V LDG S G I + Sbjct: 238 VSDGRTDESKGLSLKELADFMK-ELKVTTAYNLDGGGSSTMYFNGQIINKPTTNG 291 >UniRef50_C2HB28 Exopolysaccharide biosynthesis protein n=4 Tax=Enterococcus faecium RepID=C2HB28_ENTFC Length = 308 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 63/219 (28%), Gaps = 18/219 (8%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWG-TLHALLADINSQGQVQMAMNGGIYDES 97 S+ Y + +G + + I + Q +A+NG Y Sbjct: 83 SERVDETTVYVADITVSDSSYLKTALANNTYGRNIKETTSAIAQEQQAILAINGDYY--G 140 Query: 98 YAPLGLYIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 + G + NG + I G F + + +++Q + Sbjct: 141 FRDKGYVLRNGTLYRDTPSDDETKEDLVIDKNGDFSIIKEAE---TSAEKLVEEDVQQVL 197 Query: 157 QSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 GP L+ENG + S R + + + ++S+ + + Sbjct: 198 SFGPALVENGEVTVSEDEEVSQSMKSNPRTAIAQVGTNHYLVVVSEGRTDDSQGLSLSEL 257 Query: 208 ACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 A K + LDG S +Y G I Sbjct: 258 ATVLKNH-GAKTAYNLDGGGSTTLYFNGKVINQTVGGSG 295 >UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S6T1_CHRVI Length = 272 Score = 119 bits (299), Expect = 7e-26, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 74/222 (33%), Gaps = 20/222 (9%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 P+ + + T+ + + R+ + + + + Sbjct: 43 PALEAPISHSERTLESSTGRTVRAHLALFDSRRYRLAVLDLGPD---LASASDWPEHTRA 99 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 G + A+NGG + PLGL I G++ GV Y + + Sbjct: 100 AG-LLAAVNGGFFHADGQPLGLVIAGGERLNRFETVK-------LLSGVLYGDARGIHLE 151 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 R F++S I VQSGP L+E G + + S R + + + V ++ Sbjct: 152 RRARFQSSPGIDALVQSGPYLVEQGRAVRGLSTHDVSR--RTFIATDWRRHWVLGATRDG 209 Query: 202 TNFYDFACYA-----KAKLNVEQLLYLDGTISH--MYMKGGA 236 + A A VE+ L LDG S ++ G Sbjct: 210 LTLAELAEALATPGALAPWPVERALNLDGGTSTGFLFDPGAG 251 >UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=Bacteria RepID=C6D6X3_PAESJ Length = 344 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 67/219 (30%), Gaps = 28/219 (12%) Query: 44 TVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG 102 + Y + ++ + A + + I S A+NG Y + G Sbjct: 120 MITYYVADVAFNSKMNLLTAFAKDSFGTNITQNTSTIASNNNAVFAINGDYY--GFRSDG 177 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPML 162 + I NG G F + D+ D + A GP L Sbjct: 178 VVIRNGTVYRDEPARIGLAMF----NDGTMKSYDEEETSTDDLLAQ--GVTNAFSFGPAL 231 Query: 163 MENGVINPRI----------HPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFY 205 + +G I + ++ +S R G+G+ + VF++ Sbjct: 232 VTDGEIAGDFSHVEIDKNFGNRSIQNSNPRTGIGMISANHYVFVVVDGRSTGYSRGMTLT 291 Query: 206 DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYP 243 +FA K +L + LDG S MY G + Sbjct: 292 EFADLFK-ELGATEAYNLDGGGSSTMYFMGRVVNNPLGK 329 >UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N9W8_9BACT Length = 275 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 65/197 (32%), Gaps = 25/197 (12%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG-- 102 + + T +++ + +G + T+ + A+N G + P+G Sbjct: 61 ISVLRADLSTPGLRLGLAECDGGNYETVSHFGRRL----DALAAVNAGFFAMKGNPMGVR 116 Query: 103 LYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 +G+ A L F + G + IV F + + AV + Sbjct: 117 YLKIDGKVLNADLGGDPERAYFVLDQTG-------RPAIVGPADF-APERCRSAVYGNRL 168 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKL 215 L+++G + P + + R G++ + + ++ F + A K L Sbjct: 169 LLKDGKVPP--LGDDKARHPRTAAGLSGN-TLLLVVIDGRARESAGVTFAELATLLKD-L 224 Query: 216 NVEQLLYLDGTISHMYM 232 + LDG S Sbjct: 225 GCTDAVNLDGGGSSTMW 241 >UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Treponema denticola RepID=Q73Q09_TREDE Length = 293 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 73/212 (34%), Gaps = 28/212 (13%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI--NSQGQVQMAMNGG 92 D L + A ++ ++K+ + + + + +A+N Sbjct: 58 HIKYEDYPLIIHAVKIDLTNPKLKIVVTEPALFNSKGMVKRETTLSFARRHNTVIALNAA 117 Query: 93 IYDE-------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 ++ PLG++I+ F + G + ++ + I+ Sbjct: 118 FFNVISFSFSLRGEPLGIHIDKKINLSKP---------FPKYGALCFLDDNSAFIIESQN 168 Query: 146 F-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN- 203 +I++AV ++++NG P I R VG+ G ++L + N Sbjct: 169 TEDIKADIEYAVSGNRIILKNGK--PIITNISKKENSRTCVGLADGGKTLYLFFAEGENK 226 Query: 204 ------FYDFACYAKAKLNVEQLLYLDGTISH 229 YD A + KL + ++LDG S Sbjct: 227 KKSRGITYDQAHFFMKKLGAQDAIHLDGGGSS 258 >UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=Firmicutes RepID=C2KZT9_9FIRM Length = 438 Score = 117 bits (294), Expect = 2e-25, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 78/221 (35%), Gaps = 18/221 (8%) Query: 39 SDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 Y + + T+ + AN + +D+ + +A+NG Y Sbjct: 212 RYRAYDSNIYVADVEVTDGTSILSAFANNTYGRNITDTTSDMAEENNAVLAINGDYYGAR 271 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 + G I NG + ++GE + G + +++ K+ + Sbjct: 272 QS--GYVIRNGVVYRS-QGSNGEDMVISKDGSLSFISESD----TTTDSLIQKQTWQVLS 324 Query: 158 SGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQQA------TNFYDFA 208 GP+L+ENG + + V +S R +G + +F++S + Y+ A Sbjct: 325 FGPVLVENGQVAVSENDEVGMAMASNPRTAIGTVAKNHYLFVVSDGRTSESAGLSLYELA 384 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 + K+ L + LDG S + G + IS Sbjct: 385 NFMKS-LGATNVYNLDGGGSSTMVFQGEVVNNPTTNGNKIS 424 >UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillonellaceae RepID=D1BL19_VEIPT Length = 312 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 63/226 (27%), Gaps = 32/226 (14%) Query: 38 LSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + +P+ +V ++I A+NGG + + Sbjct: 90 IQSARYVGYILEIPDPRRIQVGT------AANIQEKGDTTSNIAKMNNAVAAINGGGFHD 143 Query: 97 ------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-- 148 P G + +G+ + ++ E V +V K G + + Sbjct: 144 PNGTGTGRLPYGFILHDGEYVIGKDVGPDED--------VDFVGFSKAGNLIAGNYNKTQ 195 Query: 149 --SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD 206 + + GP L+ +G R +G K G +FL+ Y Sbjct: 196 LGDMKAMEGITFGPPLIVDGKKMITEGDGGWGVGPRTAIGQKKDGTVLFLVIDGRQPGYS 255 Query: 207 FACYAKA------KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 + + LDG S +Y+ G + Sbjct: 256 IGATLRDVQDILFEKGCYIAANLDGGSSSTLYLNGKVVNKPADLLG 301 >UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=4 Tax=Alicyclobacillus acidocaldarius RepID=C8WTH1_ALIAD Length = 352 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 47/243 (19%), Positives = 82/243 (33%), Gaps = 40/243 (16%) Query: 36 CALSDPTLTVQAYTV-NPQTERV-KMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 L +PT V +P+ RV + GE ++ + G + GG Sbjct: 121 ITLHEPTFNAFILLVKDPKRIRVVATKYLHVRGET------VMQMVQDSGAIAGINAGGF 174 Query: 94 YDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK- 147 D ++ P G+ I +G+ S +P V + I + Sbjct: 175 VDTNWQGTGAYPQGITITDGKLVSMTGSPS-------QPQPVIAFTKEGQMIAGTYSLNQ 227 Query: 148 -TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA----- 201 S ++ V GP+L+ENG P + + R +G K G + L++ Sbjct: 228 LRSLDVWQCVGFGPVLVENGK--PTVSAENYAVNPRTAIGQTKDGTVILLVTDGRYATGP 285 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ------RYPFVTMISVE 251 +F D A + + + LDG S ++ G + + T I V Sbjct: 286 NDVGASFADVARIML-QFHADIAANLDGGSSATFVYKGRMWNRPVDILGARAVATSIVVM 344 Query: 252 RKG 254 +G Sbjct: 345 PEG 347 >UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZJ8_9BACT Length = 251 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 50/245 (20%), Positives = 83/245 (33%), Gaps = 41/245 (16%) Query: 16 LKRIFLALTLLPLF------AV---------AADDCALSDPTLTVQAYTVNPQTERVKMY 60 + R F+ L +L L A A + ++ + A V Sbjct: 1 MYRFFVCLLVLALTTQLASAAWVLKESADRPAPTELEFTERHVQGDAGDVTLWVVTF--- 57 Query: 61 WQKANGEAWGTLHALLADI----NSQGQVQMA-MNGGIYDESYAPLGLYIENGQQKVALN 115 A+ + S+ + +A +NGG + PLGL + G + L Sbjct: 58 --NPKACAFAVMDNPTGAFDLGTASEKRGALAGVNGGYFHPDRTPLGLVVRQGVEIHPLE 115 Query: 116 LASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN 175 A GV V + + R AFK S ++ A+Q+GP L+E P + Sbjct: 116 RAK-------LLSGVLSVMPTTITLQRTGAFKGSSAVREALQAGPFLIEKEKPIPGLEA- 167 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL-----NVEQLLYLDGTISH- 229 R V N G FL+ + T A + + + LDG S Sbjct: 168 -TKEAARTVVFQNAKGRCGFLICKS-TTLAGMADLLATSSIFPEGKIIRAMNLDGGTSTA 225 Query: 230 MYMKG 234 ++++G Sbjct: 226 LWVRG 230 >UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtilis ortholog n=2 Tax=Clostridium RepID=Q97FU6_CLOAB Length = 347 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 69/226 (30%), Gaps = 27/226 (11%) Query: 40 DPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-- 96 D T + +P ++ Q G + ++ + + A+NGG + + Sbjct: 116 DGKFTANVLIIKDPNRVKIGYAAQ------IGYVGETTREMAKRYKAVAAINGGYFKDTS 169 Query: 97 --------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK---VGIVRLDA 145 P G + NGQ + ++ + D VG Sbjct: 170 PNKQSGGVGAIPTGFIMSNGQIVYPQDNSNWSEITSEEENRALTIDKDGNLQVGGTYSPD 229 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFY 205 I+ AV + P L++NG N +V+ + R +G + +F++ Sbjct: 230 QLIKSGIREAVITEPYLIKNGK-NTIQANSVSGTNPRTAIGQRADKSIIFMVIDGRQGVK 288 Query: 206 DFA-----CYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 A KL LDG S MY G I Sbjct: 289 LGATVGDVQVLMHKLGAVNAACLDGGGSTAMYYNGEIINNPSNATG 334 >UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Deinococcus deserti VCD115 RepID=C1CWE2_DEIDV Length = 442 Score = 113 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 41/257 (15%), Positives = 87/257 (33%), Gaps = 26/257 (10%) Query: 10 GMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW 69 GM L +R+ + + A L + VQ V+ + V + + Sbjct: 189 GMKILVAQRVPVPIPPRA-TGKAVTFKQLRPLNIPVQLVRVDLRHRDVLVAPVLPHAGLV 247 Query: 70 GTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 L A + + + Q +NG + +YAP G + G+ + P Sbjct: 248 FGLGARVGQLAQRSGAQALINGSYFHPRTYAPAGDIVMQGRML----------TWGRIPM 297 Query: 129 GVFYVAGDKVGI--VRLDAFKTSKEIQF-----AVQSGPMLMENGVINPRIH-----PNV 176 + ++ I + + + + +GP ++ G ++ + P + Sbjct: 298 ALAITPDNRATIRATTTPLLRRPLDTTWRGMETVIATGPRIVTGGAVHTNYNQVFRDPAL 357 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGG 235 R+ VG++ + + V + ++ + +L V++ L LDG S + G Sbjct: 358 FGRAARSAVGLSSNRDLVMVSTRVRLTTTEMGKVM-TRLGVKEALLLDGGSSAGLAWNGR 416 Query: 236 AIPWQRYPFVTMISVER 252 A+ I V Sbjct: 417 AVLDSMRKVSYGIGVFT 433 >UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=3 Tax=Micrococcineae RepID=D2NR45_9MICC Length = 356 Score = 112 bits (281), Expect = 9e-24, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 69/214 (32%), Gaps = 27/214 (12%) Query: 45 VQAYTVNPQTER-VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 V ++ + + + + AN + + +++ S+ A+NG Y + G+ Sbjct: 131 VVSFVADIKLDNATLLRSAFANNKFGQNIIDTPSNMASEHNGIWAINGDYY--GFRTTGI 188 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 I NG G F+ Y S+ + + GP L+ Sbjct: 189 VIRNGVVYRDSGAREGLA-FYRDGSVKLYDE-----TATNAQTLVSEGVWNTLSFGPALV 242 Query: 164 ENGVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 ++ I I ++ ++ R GVG+ + VF++ + Sbjct: 243 KDSAIVDGIDSVEVDTNFGNHSIQGNQPRTGVGVLGTNHLVFIVVDGRSTNYSRGVTMPE 302 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 FA K L LDG S + + + Sbjct: 303 FAQMFKD-LGCVSAYNLDGGGSSAMVFNNKLVNR 335 >UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNM8_9SPIO Length = 306 Score = 112 bits (281), Expect = 9e-24, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 71/223 (31%), Gaps = 32/223 (14%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN----------GEAWGTLHALLAD 78 + A D A L V ++ V + +A GE Sbjct: 66 PGIEAADIADPQLPLIVHIVKIDLLNPSVSVITSEAALFKNTRGRIRGETTRDFALRHNT 125 Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 I + N ++ +G++I + ++ N G F+ Sbjct: 126 IAAFNAAPFKTNSLLFSIYRTIVGIHITDFRRMSMPNERYGALLFY---------KDKTA 176 Query: 139 GIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 I+ S ++++AV ++ NG I P+ + R VG+ G +F+ Sbjct: 177 RIIGSQTEDALSADVRYAVGGFWTILRNGTIVPQ---KLHRRDSRTAVGLADSGKTLFVA 233 Query: 198 S--------QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + +F + A + L + L LDG S + Sbjct: 234 AVEGENKRKSRGLSFEETAMLMQ-TLGADDALQLDGGSSSTLV 275 >UniRef50_B8FUP3 Putative uncharacterized protein n=2 Tax=Desulfitobacterium hafniense RepID=B8FUP3_DESHD Length = 350 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 57/224 (25%), Gaps = 28/224 (12%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 +S V NPQ R+ Q G +++ +N G + Sbjct: 124 EVSGKGFQGYLLKVGNPQRVRLAATDQ------LGDRGLKVSEFVENNHAVAGINAGGFA 177 Query: 96 E------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + P G+ I G+ N + I V V Sbjct: 178 DPGGVSFGGTPTGILITEGKIIHKDNWET-YSLIGITKHDVLVVGR------YTLEQIEE 230 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFAC 209 I+ AV GP L+ NG R +G G + L+ Sbjct: 231 LGIRDAVSFGPALIVNGEPMITYGDGGWGIAPRTAIGQTHDGTILLLVIDGRQ-LGSLGA 289 Query: 210 YAKA------KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVT 246 K + LDG S + +G P+ Sbjct: 290 TLKDVQDILIEHGAVNGANLDGGSSSTLVYEGEVKNKPSSPYGP 333 >UniRef50_B1BC21 Putative uncharacterized protein n=2 Tax=Clostridium botulinum RepID=B1BC21_CLOBO Length = 326 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 68/235 (28%), Gaps = 35/235 (14%) Query: 38 LSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 L + + NP+ RV + G + ++I A+NGG + + Sbjct: 100 LENSRFKAYLMEISNPKKVRVGY------AKKLGKVGEPTSEIAKDFNAIAAINGGSFTD 153 Query: 97 SYA-----------PLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 + P G+ + +G+ ++ + G + + +R Sbjct: 154 ETSNGTKYSGTGAFPEGVIMSHGKVIWKTVSTNTKIDIIAFNNEGKLILGKYTINELR-- 211 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ---- 200 A+ P L+ +G A R +G K G +FL++ Sbjct: 212 ----KLNCIEALCYKPSLIVDGKKAKIKGDGGAGMAPRTAIGQKKDGTILFLVADGTMFK 267 Query: 201 --ATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV--TMISV 250 + K LDG S MY G I + S+ Sbjct: 268 RDGLRMDELQDILYEK-GAYNATNLDGGSSATMYYDGEVINNPCDSVGERPIPSI 321 >UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A82C Length = 377 Score = 111 bits (277), Expect = 3e-23, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 80/229 (34%), Gaps = 28/229 (12%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA- 99 + + Q TV+ +V++ A +A L I + +A+NG +D + Sbjct: 161 RSFSAQVVTVSLLHPKVELDVVLAGNKAGKVED--LRSIAKRSNAVVAINGTFFDAYTSG 218 Query: 100 ----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 P G + G + + + + G + + + ++ A Sbjct: 219 AYKAPYGYLVSKGNIFHKASGDNRTIFTYDSNNLATMMPG-----LDFKSVYETGRMEGA 273 Query: 156 VQSGPMLMENGVI----------NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFY 205 +Q+GP L+ NG + +P+I + R+ +GI K + L + Sbjct: 274 LQAGPRLLTNGKVTLDVKKEGFKDPKILTGGGA---RSALGITKDHKLILLTT-GGATIP 329 Query: 206 DFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMISVERK 253 A K + Q + LDG S +Y G + I V+ K Sbjct: 330 QLAEIMK-QAGAYQAMNLDGGASSGLYYNGSYLTTPGRQISNAIVVKYK 377 >UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing protein n=2 Tax=Deinococcus RepID=Q1IXP5_DEIGD Length = 444 Score = 109 bits (273), Expect = 8e-23, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 74/230 (32%), Gaps = 25/230 (10%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 L + VQ V+ + V + A ++ + Q +NG + Sbjct: 217 QLKALNIPVQVLRVDLRHRNVLVAPVLPRTGLGTAGGARVSTLARTSGAQAVVNGSYFHP 276 Query: 97 -SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR--LDAFKTSKEIQ 153 SYAP G + G+ + P + ++ I+ E+ Sbjct: 277 RSYAPAGDLVVQGRLLA----------WGRIPVALAITPDNRAAIMTSTTPLLGRPLEVS 326 Query: 154 F-----AVQSGPMLMENGVINPRI-----HPNVASSKIRNGVGINKHGNAVFLLSQQATN 203 + + +GP ++ G + + P + R+ VG+ + + VF+ + Sbjct: 327 WHGMETVIATGPRILNGGTVVRQYASAFRDPALFGRAARSAVGLKSNRDLVFVTTHAKLT 386 Query: 204 FYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVER 252 + A+L V L LDG S + G A+ I V Sbjct: 387 TTEMGKVM-ARLGVRDALLLDGGSSAGLAWNGQAVLDSVRKVAYGIGVFT 435 >UniRef50_C6J074 Copper amine oxidase domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J074_9BACL Length = 406 Score = 109 bits (272), Expect = 9e-23, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 82/224 (36%), Gaps = 22/224 (9%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY-- 98 + + Q T++ +V++ A G+A G + L + + + + +A+NG ++ Sbjct: 190 RSFSTQMVTISLMDPKVRLKVALA-GDAVGKVEEL-SSLAKRHKAVVAINGTFFNAYTDN 247 Query: 99 ---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 AP G + G+ K+ + + + GD DAF ++ A Sbjct: 248 AYKAPYGYIVSGGELKMKASGDKRTIFTYDSNLLARLIPGDDF----NDAFNAGT-MEGA 302 Query: 156 VQSGPMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQATNFYDFA 208 +Q+GP L+ NG + + R+ +G+ + + L + A Sbjct: 303 LQAGPRLVVNGKVAVDVKAEGFKDPKILTGGGARSALGLTRDHKLILLTT-GGATIPQLA 361 Query: 209 CYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMISVE 251 K + Q + LDG S +Y G + + V Sbjct: 362 EIMK-QAGAYQAMNLDGGASSGLYYNGKYLTQPGRKISNALIVT 404 >UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YND3_9CYAN Length = 304 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 69/217 (31%), Gaps = 24/217 (11%) Query: 39 SDPTLTVQAYTVNPQTERVKMYW----QKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 L + ++ T +++ Q + + ++ + +Q+A+NG + Sbjct: 58 KPDPLMIHIVKIDLTTPGIELLVTPGEQGEDDQDIS--AQTTSEFLQKHYLQLAINGSFF 115 Query: 95 DESY--APLGLYIENGQQKVALNLASGEGNFFI---RPGGVFYVAGDKVGIVRLDAFKTS 149 Y P+ Y +G++ A +G + + V ++ K + F T Sbjct: 116 HPFYVHNPIDYYPNSGERVNIFGQAISQGKIYSIVNKGWSVLCISPKKKAEI---YFDTC 172 Query: 150 KEIQFAVQSGPMLMEN-GV-INPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA----- 201 + +G +++ + G I + + R V I+K G +L Sbjct: 173 PKNTLQGIAGNLILIDQGQPIKVKKFSDANQKFPRTAVAIDKTGETLWLILIDGRQSWYS 232 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + VE L DG S + Sbjct: 233 KGVTLATLTNIIQELDGVETALNFDGGGSTTLVISEG 269 >UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN59_9BACE Length = 315 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 66/233 (28%), Gaps = 21/233 (9%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA----- 77 + L + + + ++ V + Sbjct: 61 VVALGVTETDVHFQKADSRSTHIFIIDIDLNEPGVSLEVGMPYDADVRNNFQRQTLTEMA 120 Query: 78 --DINSQGQVQMAMNGGIYDESYAPL-GLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 +V +N +D S + G NG + E + Sbjct: 121 DYADRPWHRVAAMINADFWDVSTMDIRGPIHRNGVILKNSFIFK-ETLPQQALSFIALTK 179 Query: 135 GDKVGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA 193 +K+ I ++ ++ SG +++ +G I+ +P + R +G + G+ Sbjct: 180 DNKMVIADSVEYRGMQYNLKEVTGSGVIVLRDGEISGATYPGID---PRTCLGYSDDGHV 236 Query: 194 VFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 F+++ FY + KA L + LDG S + I Sbjct: 237 YFMVADGRVEFYSYGLTYPEMGSIMKA-LGCSWAVNLDGGGSTQMLIRHPIAD 288 >UniRef50_B0TEY5 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TEY5_HELMI Length = 327 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 65/223 (29%), Gaps = 27/223 (12%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY- 94 + T + V +P +V + + G + ++ + A+NGG + Sbjct: 98 DIQGYRFTGKVMIVHDPLRIKVAVSSK------LGEAGETVPEMARREGAVAAINGGGFI 151 Query: 95 DESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 D + P G+ + GQ ++ E I G V +R Sbjct: 152 DPNGQGNGAYPDGITVSRGQFISVIDEDQKENIIGITKKGQMIVGRYSARELRS------ 205 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------N 203 +I V GP L+ NG R G+G G+ + ++ Sbjct: 206 MDISEVVTFGPPLVVNGRPTITSGDGGWGVAPRTGIGQRSDGSIIMVVIDGRQIGSIGAT 265 Query: 204 FYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + K LDG S M G I F Sbjct: 266 LRELQDLLL-KYGAVTAGNLDGGASTTMVYNGKVINQPSSVFG 307 >UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V127_BACUN Length = 277 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 74/224 (33%), Gaps = 28/224 (12%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLG 102 V + ++P+ R + E + A+NG +D + + Sbjct: 59 EVSIFEISPKRYRFDVLVHNPKEET--------SIAARHAGAVAAINGSYFDMKAGNSVC 110 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR---LDAFKTSKEIQFAVQSG 159 ++G + G G + ++ ++ + + + + SG Sbjct: 111 YLRKDGVVIDTTST----GVLATVSNGAVLIKKGRLELIPWSKQEEKACTLKKGTVLASG 166 Query: 160 PMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFAC 209 P+++++G + N V + R+ V + + G + ++ N + A Sbjct: 167 PLMLKDGQVCDLSGTNRNFVDTKHPRSAVALTREGKILLIVVDGRRKGKAEGINIPELAH 226 Query: 210 YAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 L E L LDG S G A+P + S ERK Sbjct: 227 -MIRILGGEDALNLDGGGSSTLWSG-ALPDKGIANTPSGSAERK 268 >UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J8B3_NOSP7 Length = 276 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 64/195 (32%), Gaps = 32/195 (16%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYAPLGLYI-----------ENGQQKVALNLASGEGN 122 A + + + + +N G +D + Y+ EN + NL S Sbjct: 59 ATVEEFAQKHRAVAILNAGFFDPANQKTTSYVILQRKLVADPKENERLVNNPNLKSYLSQ 118 Query: 123 FFIRPGGVFYVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLM------ENG---VINP 170 F R Y G V ++ + ++ A+ +GP L+ + G N Sbjct: 119 IFNRTEFRRYSCGQTVRYDIVLHSASQPAGCQLVDAIGAGPSLLPELTLEKEGFVDNANK 178 Query: 171 RIHPNVASSKIRNGVGINKHGNAVFLLSQ-------QATNFYDFACYAKAKLNVEQLLYL 223 R R VGI G+ V ++ + A + K L ++ + L Sbjct: 179 RDALGSNQPNARTAVGITHDGSVVLVMVAQKPSAPANGISLPALANFMK-TLGADKAMNL 237 Query: 224 DGT-ISHMYMKGGAI 237 DG S +Y G Sbjct: 238 DGGSSSSLYYNGKTF 252 >UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZEQ6_BREBN Length = 356 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 58/198 (29%), Gaps = 16/198 (8%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYI 105 Y +P R+ + +K G+ I A + Y + G+ I Sbjct: 144 IMYISDPSRVRLVVTNRKDRGDLLDEFVNKTGAIGIVNASGFA-DPDGYGKGARAYGVVI 202 Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 G+ N SGE + G ++ AV P L+ N Sbjct: 203 HEGKILQGYNPRSGETALGLTYDGKLITG------SYSAEQLVKMGVRDAVSFRPQLIVN 256 Query: 166 GV-INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNV 217 G + + R +G + G VF + + D A + V Sbjct: 257 GKNMFEGKPAKSWGIQPRTAIGQKEDGTIVFAVIDGRQPGHSIGASMNDMAELLAER-GV 315 Query: 218 EQLLYLDGTISHMYMKGG 235 + +DG S M + G Sbjct: 316 VTAMAMDGGSSSMMLHNG 333 >UniRef50_A7GCS1 Putative uncharacterized protein n=12 Tax=Clostridium RepID=A7GCS1_CLOBL Length = 339 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 64/222 (28%), Gaps = 37/222 (16%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 ++ + NPQ ++ + G + + + + A+NGG + Sbjct: 116 DINTAKFDGYILEIKNPQKVKIGYT------KYMGKMGERTSKMAERHGAVAAVNGGGFR 169 Query: 96 E-----------SYAPLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRL 143 + P GL I NG+ + + N G+ V V + Sbjct: 170 DVSSTGKLWTGTGAYPEGLVISNGKVIYNDFKSGQKVNVTAFTKEGLLVVGDHTVDEL-- 227 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-- 201 + A+ L+ NG P R +G + G V L+ Sbjct: 228 ----LKMGVVEALSFRNTLIINGKPIPY----NEGINPRTAIGQKQDGTIVLLVIDGRRG 279 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIP 238 + + V LDG S MY KG I Sbjct: 280 IKQGATLEEVENILLQR-GVVNASNLDGGSSSTMYYKGKVIN 320 >UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_2_4 RepID=C3R3M8_9BACE Length = 329 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 59/209 (28%), Gaps = 53/209 (25%) Query: 78 DINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQK---------VALNLASGEGNFFIRP 127 Q + + MNGG + + L G+ + G F + Sbjct: 101 WQAEQQKYPIIMNGGYFVMGAGKSVSLLCREGEVLAVNSQEEIRSQKSYYPTRGIFQLSK 160 Query: 128 GGVF-----YVAGDKVGIVRLDA------FKTSK-------------EIQFAVQSGPMLM 163 G F Y D V ++ + A+ GP+L+ Sbjct: 161 NGYFSTDWAYTTTDGVTYTYEQPSPNKSGYEPQPAPSAYFPTRGVKLNAETAIGGGPILL 220 Query: 164 ENGVINPRIHPNV---------ASSKIRNGVGINKHGNAVFLLSQQA--------TNFYD 206 ++G + + S R +G+ + +F + + N Sbjct: 221 KDGSVRNTFIEELFDEESGVAPESYHPRTAIGVTANNKVIFFVCEGRSVTEGVKGMNMAM 280 Query: 207 FACYAKAKLNVEQLLYLDGTISH-MYMKG 234 A K+ L + LDG S M + G Sbjct: 281 MANILKS-LGCVDAMNLDGGGSTCMLVNG 308 >UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteria RepID=Q4UP44_XANC8 Length = 439 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 39/257 (15%), Positives = 76/257 (29%), Gaps = 30/257 (11%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHA-LLADIN 80 LTL P + P + + ++ T +++ + G A Sbjct: 173 PLTLAPGVRYWRQAIGGAQP-VMLHIAQIDLTTPGLQLVGTPGDRSDGGEFRATPTTAFV 231 Query: 81 SQGQVQMAMNGGIY---------DESYAPL-GLYIE-NGQQKVALNLASGEGNFFIRPGG 129 G + +A+N + D+ + P G + G A S R Sbjct: 232 RDGALTLAINADYFLPFDGGHLLDKPFVPAAGQGVTAEGLAIEAGRTDSAAATSDPRVNA 291 Query: 130 VFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV---ASSKIRNGVG 186 V+ + + + V +GP+L+ +G PR + R+ VG Sbjct: 292 ALCVSQRDAVRIVRGS--CPAGSRLGVGAGPLLLLDGKRQPREASRAAYYDGPEPRSAVG 349 Query: 187 INKHGN-AVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMY---MKGG 235 +++ G+ +++ +L + LDG S + G Sbjct: 350 LDRSGHTLWMVVADGRQPGYSAGMTLDALTAVF-EQLGAHAAINLDGGGSSTLAARVDGD 408 Query: 236 AIPWQRYPFVTMISVER 252 R + ER Sbjct: 409 VRALNRPIHTGIPGRER 425 >UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IYX5_9BACL Length = 347 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 71/222 (31%), Gaps = 29/222 (13%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 +S + TV +P R+ + ++ GE ++ A + +NGG + Sbjct: 100 EISGKSYHGYVLTVNDPTKIRLGVPAKRGKGEKVSSMVARTGALA-------GVNGGGFA 152 Query: 96 E------SYAPLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 + + P+G+ I G+ ++ + + G + + Sbjct: 153 DPNWKGNGFKPIGVVISRGKLYYNGISSGAATQIVGLDKQGKMIAGKYTLEELD------ 206 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 IQ AV P ++ NG R R +G + G +F++ Sbjct: 207 KLGIQEAVTFQPRIIVNGKGQIRSQKEGWGIAPRTAMGQREDGAILFVVIDGRQPGYSIG 266 Query: 203 -NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + YD + LDG S + +K G + Sbjct: 267 ASLYDVQQIMLER-GAVIAANLDGGSSTVLVKEGGEIVNKPS 307 >UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacteroides RepID=Q8A0T0_BACTN Length = 308 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 75/214 (35%), Gaps = 36/214 (16%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES---- 97 T ++ +NP+T K G A+ ++ I + Q A+NG +D + Sbjct: 82 TQSINILEINPKT-------GKKIGIAFTGQLEKISRIARKHQAIGAINGSYFDMTKGNS 134 Query: 98 --YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + +G + + L L G + + G V + D+ +K +K A Sbjct: 135 VCFLKVGSQVVDTTSLDELKLRV-TGAVYEKKGKVKLIPWDR---QIEKNYKKNKGSVLA 190 Query: 156 VQSGPMLMENG------VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 SGP+++++G N R+ + + + G +F+ Sbjct: 191 --SGPLMLKDGEYYDWSQCNANFIET---KHPRSAICLTEEGKILFVTVDGRSPENAVGI 245 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 N + A L + L LDG S GA Sbjct: 246 NIPELAHLL-HVLGGKDALNLDGGGSTALWLSGA 278 >UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3362 Length = 356 Score = 103 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 64/227 (28%), Gaps = 27/227 (11%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 +S T V +P + +G+ G + I + +A+N G ++ Sbjct: 137 EVSGSTFAGTMVVVTDPSR-----VFVGTSGDYKGEAGINVPAICDKYGATLAINAGGFE 191 Query: 96 E------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + PLG+ + GQ K N+ S VF + Sbjct: 192 DIGGVGNGGTPLGIVMSEGQLKYG-NVNSSYDLIGFDNNNVFVIGQ------MTGQQAID 244 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFAC 209 + I+ AV GP L+ NG R +G G + L+ + Sbjct: 245 RGIRDAVSFGPFLILNGTPLEVSGMGG-GLNPRTAIGQRADGAVLLLIIDGRQT-HSLGA 302 Query: 210 YAKA------KLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 LDG S + G I + V Sbjct: 303 SMNDLINVMLDFGAVNAANLDGGGSTVLYYDGEIKNKISSIYGARGV 349 >UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNL7_AKKM8 Length = 249 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 42/201 (20%), Positives = 76/201 (37%), Gaps = 19/201 (9%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEA-WGTLHALLADINSQGQVQMAMNGGIYDES- 97 L V + + T R+ + + + +G+L + + +NGG + Sbjct: 34 RDKLNVYFFRSD--THRLLVRDEGSVKTPRYGSLDKAM----RKSPCVAGVNGGFFSADA 87 Query: 98 -YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 PLGL +++G++ L S + + GG + + ++R + +Q A+ Sbjct: 88 GGTPLGLVVQDGKRLSPLATGSFAVSGVVYEGGRDGLTLVRSSVLR--RMRRLPAMQAAI 145 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK-- 214 Q GP L+ENG + N S R + + +S + A + A Sbjct: 146 QGGPFLVENGSAVKGL--NAQKSTYRTFIATDGGRRWCIGVSSS-LTLKELAAWLAAPGA 202 Query: 215 LN---VEQLLYLDGTISHMYM 232 L VE L LDG S + Sbjct: 203 LGNFRVETALNLDGGSSSAFW 223 >UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C3R3L4_9BACE Length = 431 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 64/218 (29%), Gaps = 56/218 (25%) Query: 80 NSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVA----LNLASGEGNFFIRP-------- 127 + + MNGG + + A + L N ++ + G N P Sbjct: 202 AESVKPAIVMNGGYFASNGATVSLLYRNNVMLAPNLQSMSRSDGTSNVAFYPTRSAFGEI 261 Query: 128 -GGVFYV-----------------AGDKVGI----VRLDAFKTSK---EIQFAVQSGPML 162 G F V + +K G+ + + + + A+ GP+L Sbjct: 262 ENGKFEVNWVYTVSSGQTYAYPAPSPNKSGVSPMQIPSVNYPEGASIWKAKNAIGGGPVL 321 Query: 163 MENGVINPRIHP---------NVASSKIRNGVGINKHGNAVFLLSQQA--------TNFY 205 ++NG+ S+ R+ +GI +F + + Sbjct: 322 LKNGLYKNTWEAELFDTASGIGPTSNNPRSAIGITGDNRLIFFVCEGRNKTPNVPGFTLE 381 Query: 206 DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 + A + L + LDG S M + G Sbjct: 382 EVAYILRD-LGCLDAMNLDGGGSSCMLVNGQETIKPSD 418 >UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XS52_9DEIN Length = 294 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 41/253 (16%), Positives = 79/253 (31%), Gaps = 27/253 (10%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 LI + RI T + V + V A VN V + Sbjct: 48 LISNTIHPGQRLRIRPPATSFSVKLVTRPVLK-----VPVLAVHVNLAHPEVSIRSLLPP 102 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFF 124 +L + + ++ A+NGG + ++ P G + G Q V ++ + Sbjct: 103 PGVGRG-GEVLQRLAWRTRLVAAINGGYFHPRTFWPAGDLVVGGHQLVKGSIQTALA--- 158 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVAS 178 + ++ E A +GP ++ G + P + Sbjct: 159 -------ITPDKRARVMVGPQTWRGYETVIA--NGPYILRRGRLVVTPRAEGYNDPAIWG 209 Query: 179 SKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAI 237 R+ VG+ +F+ ++ + AKL ++ + LDG S + KG + Sbjct: 210 RARRSAVGVVNERYLIFVSTKMELTLSELGKVM-AKLGAKEAIVLDGGSSTGLVWKGETL 268 Query: 238 PWQRYPFVTMISV 250 I + Sbjct: 269 IRPGRALSYGIGI 281 >UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostridium RepID=B2V2N5_CLOBA Length = 348 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 71/223 (31%), Gaps = 32/223 (14%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY- 94 + + NP +V M + G L +++ + A+NGG + Sbjct: 118 DIHTDRYDGYMLEIENPHKVKVAMT------KYLGKLGQKTSEMAEEHNAIAAINGGSFV 171 Query: 95 ----------DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RL 143 P G I +G+ + G+ N + + ++ + Sbjct: 172 DKSSDGITYAGTGGQPGGFVISSGKVVYPI----GKCNEHSVENVIAFTKKGQLIVGNHT 227 Query: 144 DAFKTSKEIQFAVQSG-PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA- 201 A ++Q A+ P ++ NG+ + + R VG + G +FL Sbjct: 228 LAELKKLDVQEAMCFREPNVIINGIRQHKKEDYIDGINPRTAVGQKEDGTVLFLALDGRK 287 Query: 202 -----TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIP 238 Y+ +++ LDG S MY KG I Sbjct: 288 LSKPGATIYEVQEIMRSR-GAINAGMLDGGYSTTMYYKGDVIN 329 >UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0G9_BACOV Length = 621 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 37/179 (20%), Positives = 64/179 (35%), Gaps = 20/179 (11%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 A + I + A+NG Y S P + + KVA + S + GV + Sbjct: 103 AKTSMIAKDKKALFAINGS-YSISGNPSTFTMVDKVVKVASTIESAS-----KVNGVIAI 156 Query: 134 AGDKVGIVRL----DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV--ASSKIRNGVGI 187 + V+ D E + A+ SGPML+ G + + R+ +GI Sbjct: 157 DAEGSVDVKSCTFSDYTDVEDEYESALASGPMLLMEGKVC-SFPQDAIYTQRMARSVIGI 215 Query: 188 NKHGNAVFLLSQQATN------FYDFACYAKAKLNVEQLLYL-DGTISHMYMKGGAIPW 239 G + L A + A + L ++ + L DG+ S ++ G + Sbjct: 216 TAQGKMMLLTIDGAITGNADGATLEEAAFIAKTLGMKNAVCLADGSSSTLWTSGKGVVN 274 >UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001694670 Length = 363 Score = 100 bits (249), Expect = 5e-20, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 62/208 (29%), Gaps = 15/208 (7%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 D + Y +P++ RV + +K GE ++ + ++ +A Sbjct: 118 DYWVGKMMYVFDPRSIRVVVPGKKGEGERITSMVERTGAVAGVNGGGF-IDPDGLGNGFA 176 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR-LDAFKTSKEIQFAVQS 158 P+G + G+ I V + + I + + ++ AV Sbjct: 177 PIGAILSGGKVLYNDQKED------IPQHIVGFTDKGTLVIGKYSIDQLRAMKVSEAVSF 230 Query: 159 GPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAK 212 P ++ NG R +G G +F++ + Sbjct: 231 YPRVIANGKPLITKGDGGWGRAPRTALGQRADGTVIFVVIDGRQAHSVGATLREVQDLLL 290 Query: 213 AKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + +LDG S +K + Q Sbjct: 291 EQ-GCINAGFLDGGASSEMVKDRKLLTQ 317 >UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YKN4_ANASP Length = 245 Score = 100 bits (249), Expect = 5e-20, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 62/212 (29%), Gaps = 39/212 (18%) Query: 64 ANGEAWGTLHALLA------DINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKV-ALN 115 + AL A + + + N G +D + + GQ + Sbjct: 8 PANSPFVVTGALSAKVSTVEEFAQKHRAFAIFNAGFFDPANQKSTSYVVVTGQMVADPKD 67 Query: 116 LASGEGNFFIRP--GGVF-------YVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLM 163 N ++P +F Y+ G + ++ + + A+ +GP L+ Sbjct: 68 NERLVNNPQLKPYLNLIFNRSEFRRYLCGQTTRYDITLHNESPPANCRLVDAIGAGPRLL 127 Query: 164 ENGVINP-RIHPNVASS--------KIRNGVGINKHGNAVFLLS--------QQATNFYD 206 P N R VGI G+ + ++ + Sbjct: 128 PKLTSVPEGFVDNAKGRDALLSKQLNARTAVGITSEGSIILVMVAQKPSKPKNSGISLVQ 187 Query: 207 FACYAKAKLNVEQLLYLDGT-ISHMYMKGGAI 237 A K KL + LDG S +Y G A Sbjct: 188 LADLMK-KLGASAAMNLDGGSSSSLYYNGKAF 218 >UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ABL2_GEMAT Length = 311 Score = 99.7 bits (247), Expect = 9e-20, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 80/220 (36%), Gaps = 33/220 (15%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 +A + TV ++P + + +G+A A + N+ +A Sbjct: 68 WAEWPVQLGARGISTTVIVVDIDPARIALTLEIA-RDGDAL----APWSLDNAPKDAVIA 122 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-- 146 +N G + + P G + ++ A + F I + I+R D Sbjct: 123 LNAGQFTDDG-PWGWVVHRQREWQAPGVGPLSAAFVID-------TAGRAAILRADEIAE 174 Query: 147 -KTSKEIQFAVQSGPMLMENGVINPRI----HPNVASSKIRNGVGINKHGNAVFLLSQ-- 199 + + A+QS P+++ +G + P + ++ IR +G+ G+ + L++ Sbjct: 175 ARRRGGWEEALQSFPLILNDGALPPGLCAPGAVDLEHRDIRLTLGVLPDGHVLLALTRYA 234 Query: 200 ------QAT----NFYDFACYAKAKLNVEQLLYLDGTISH 229 + A + +L V + + LDG +S Sbjct: 235 GVGSAGNRLPIGPTTGEMATIMR-ELGVARAVMLDGGLSA 273 >UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744905 Length = 251 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 78/232 (33%), Gaps = 28/232 (12%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVN-----PQTERVKMYWQKANGEAWGTLHALLA 77 + + L A L A V P + + A +H Sbjct: 3 VIVETLPAQWTVRSQAGPVKLPGGAIQVKKQLAGPTEAELNLILFTAGKYEMRVVHQPER 62 Query: 78 ----DINSQGQVQMAM---NGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + ++ + A+ NGG + + PLGL + +G + +S G GV Sbjct: 63 DKGVSLATKMRELGAIAGCNGGYFTPDFLPLGLEVSDGVRSGTFQRSSLLG-------GV 115 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 F V + +V D + K + +Q+GP L+ G+ + + R + ++ Sbjct: 116 FLVRHGRPAMVWKDEYIEQKGVTQLLQAGPRLVHAGLPVAGLEA--TKRRARTFILTDQA 173 Query: 191 GNAVFLLSQQATNFYDFACYAKAK-----LNVEQLLYLDGTISH-MYMKGGA 236 GN + + + + + V++ L DG S ++ + Sbjct: 174 GNWALGTCKS-VTLRELSDLLSTRALLPEVTVKRALNFDGGNSTGLWWRAEG 224 >UniRef50_B3CE38 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B3CE38_9BACE Length = 285 Score = 98.2 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 64/219 (29%), Gaps = 27/219 (12%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 A+ +L V + P+ R + E ++ + +A+NG Sbjct: 49 EAEFVSLYGVPQHVTILEIKPERHRFDILIHSPKEET--------SNAARRSGAVVAING 100 Query: 92 GIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 ++ + + ++G G G + K+ I+ Sbjct: 101 SYFNIKQGTSICYLRKDGVVVDTTAT----GVLSTVSNGAVKIDKGKLDIIAWKKQDEKT 156 Query: 151 ---EIQFAVQSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQA--- 201 + + SGP+++ +G N V + R+ V + K G + Sbjct: 157 CEQKEGSILVSGPLMLLDGKTCDLSACNRSFVQTKHPRSAVALMKDGTVFLIAVAGRFEG 216 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 N + + L + L LDG S A Sbjct: 217 KAEGINIPELTHLLR-VLGARKALNLDGGGSTTLWSASA 254 >UniRef50_C4ICA6 Peptidase, M56 family n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ICA6_CLOBU Length = 568 Score = 97.4 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 54/182 (29%), Gaps = 26/182 (14%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY---------- 94 +P+ +V + + + I A+NGG + Sbjct: 401 YYMEIKDPKRIKVGVAVK------LNEEGQTASKIAQNYNAVAAINGGGFLDQSSTGYWN 454 Query: 95 DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK--TSKEI 152 P+G+ + G+ + + +F + + IV + + K + Sbjct: 455 GTGGIPVGIIMSKGEVIYNDVEETEKTE-------LFAIDKQRQMIVGTYSVEDLKEKGV 507 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 Q AV GP L+ +G ++ R +G + G + L+ K Sbjct: 508 QEAVSFGPSLIIDGKMSEMTGDGGWGIAPRTAIGQKEDGTIILLVIDGR-GIGSLGATLK 566 Query: 213 AK 214 Sbjct: 567 ET 568 >UniRef50_C6PYU6 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYU6_9CLOT Length = 369 Score = 96.7 bits (239), Expect = 7e-19, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 59/223 (26%), Gaps = 24/223 (10%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQ-KANGEAWGTLHALLADINSQGQVQM---AMNG 91 + D V NP ++ + K G+ + + + NG Sbjct: 146 EIDDTKFHACILEVKNPTRMKIGYTNKLKEVGQKTSEIAEENGAAAAINGGGFTDKSSNG 205 Query: 92 GIYDESYA-PLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRLDAFKTS 149 ++ + A P G+ I NG+ + + N G V V + Sbjct: 206 KLWTGTGAYPQGIVISNGKVVYSDVKNNEAVNVTAFTKDGKLIVGDHTVSEL------LR 259 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TN 203 + A+ L+ NG R +G G + L+ + Sbjct: 260 DNVTEAISFRNSLIINGKPVALAEEG---LNPRTAIGQKADGTIIMLVIDGRKGLKAGAS 316 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 + + LDG S MY G I Sbjct: 317 LKEVQNILLQR-GALNASSLDGGSSSTMYFNGEVINDPCDWNG 358 >UniRef50_C6XXH4 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XXH4_PEDHD Length = 289 Score = 96.3 bits (238), Expect = 8e-19, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 58/174 (33%), Gaps = 15/174 (8%) Query: 74 ALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 + ++ A+NG +D ++ + G+ L + + V Sbjct: 85 KTTSTFGTENNALAAVNGSFFDVKNGGSVDFIKVGGKVLAENRLEKNDSRARHQQAAV-V 143 Query: 133 VAGDKVGIVR---LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV--ASSKIRNGVGI 187 ++ K+ + + ++ + + SGP+LM NG + + S R +GI Sbjct: 144 ISNGKLALKKWDGTADWEQRLTEENVLLSGPLLMLNGT-DEALDSTSFSRSRHPRTAIGI 202 Query: 188 NKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 +G + L + + A K L + LDG S G Sbjct: 203 KPNGRILLLTVDGRNSNSAGMSLTELAKTMKW-LGCTSSINLDGGGSTTLWVSG 255 >UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B2F Length = 325 Score = 96.3 bits (238), Expect = 9e-19, Method: Composition-based stats. Identities = 47/214 (21%), Positives = 80/214 (37%), Gaps = 28/214 (13%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 TV +NP + + ++ +G A T LA + A+ D + PL Sbjct: 91 RETVNVIEINPANYQFQTSFK--DGFALTTAKERLATE----RAAFAITANFRDPAGKPL 144 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI-QFAVQSGP 160 GL + G Q+ F G F+V K F+ + + Q A Q P Sbjct: 145 GLVVHEGTQRNPT---------FPAWTGYFFVKAGKPWFGPKSLFEETPGVLQEASQGYP 195 Query: 161 MLMENGVI------NPRIHPNVASSKIRNGVGINKHGNAVFLL--SQQATNFYDFACYAK 212 LM+N + + + R G+ ++GN VF+L + N + A+ Sbjct: 196 SLMKNHTVFSYVDLPSTRYFDGNRVTYRALAGMKQNGNIVFILSGTGGVMNVSEVTALAQ 255 Query: 213 AKLNVEQLLYLDGTIS---HMYMKGGAIPWQRYP 243 +LNV+ LDG + + + G A + + Sbjct: 256 -RLNVQHATLLDGGRALQYSLKLHGAARHFTAFN 288 >UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=B8J2Y6_DESDA Length = 429 Score = 96.3 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 68/208 (32%), Gaps = 23/208 (11%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + + L+D + A ++P + + +G L+ Q + A Sbjct: 131 PGLDFGEFQLTDSEALLTALRIDPAHFDFILCARSQDGGNLRPLNQW----AEQYGLTAA 186 Query: 89 MNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFI-------RPGGVFYVAGDKVGI 140 +N +Y G +NG + G FF+ PG D Sbjct: 187 INASMYLPDGITSTGYMRQNGH-HNNKRVVQRFGAFFVAGPDSPDLPGAAIVDRDDPQWE 245 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 R+ + + +Q+ M + I P I + V + G +FL +Q Sbjct: 246 QRIGQY------RLVIQNYRMTSADRRI--LWSPGGPHYSI-SAVAQDGDGRILFLHCRQ 296 Query: 201 ATNFYDFA-CYAKAKLNVEQLLYLDGTI 227 Y FA LNV ++Y++G Sbjct: 297 PVEAYAFAQQLLHLPLNVRTVMYVEGGG 324 >UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57_ANASP Length = 660 Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 70/239 (29%), Gaps = 46/239 (19%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQV---QMAMNGGIYDES----------- 97 P R + W G+ + +L + + + +A+N G Sbjct: 411 PILNRGAIAW-NDAGQFYFGRLSLQETLATSSNLRVPILALNSGYVQNGIARYTPAWGKM 469 Query: 98 YAPLG-----LYIENGQQKVAL-NLASGEGNFFIRPGGVFYVAGDKVGIVRLD-AFKTSK 150 Y PL + ++N + +G+ NF I G V T Sbjct: 470 YTPLTDNERIVIVQNNKITNQFPGNKAGQTNFPIPNNGYLLTLRGNATTVASQLPVGTDV 529 Query: 151 EIQFA------------VQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGN 192 +I A + +GP+L++N I + +A +R+G+ + Sbjct: 530 QITSATTPGEFNRYPHIIGAGPLLLQNSQIVLDAKSEQFSNAFIAERAVRSGICTTANNT 589 Query: 193 AVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + N + A K L L LDG S G + + Sbjct: 590 LLIAAVHNRAGGPGPNLAEHAQLMKL-LGCVNALNLDGGSSTSLYLSGQLLDRYPNTAA 647 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 50/146 (34%), Gaps = 9/146 (6%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T L V VNP+T + + N + +L + Sbjct: 320 ITWATGLRWRQQFVNLGTNRFPVVLLEVNPRTIGLTLKPIVTNPDTLVGTAPILQT-AQR 378 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+NGG ++ + PLG +N Q + L G G FY ++ + Sbjct: 379 YFAVGAINGGYFNRNNRYPLGAIRQNNQWLSSPILNR--GAIAWNDAGQFYF--GRLSLQ 434 Query: 142 RLDAFKTSKE-IQFAVQSGPMLMENG 166 A ++ A+ SG ++NG Sbjct: 435 ETLATSSNLRVPILALNSGY--VQNG 458 >UniRef50_A7LRK4 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LRK4_BACOV Length = 326 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 60/207 (28%), Gaps = 27/207 (13%) Query: 46 QAYTVNPQTERVKMYWQKANGEA------WGTLHALLADINSQGQVQMAMNGGIYDES-- 97 V+ V + + + + +V + NG Y + Sbjct: 93 IIAEVDLNK-NVTIVTSTPDNKPEVGKILQQVTVQAEKAEAAGRKVILGTNGDFYSKKND 151 Query: 98 -YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK-TSKEIQFA 155 + P GL+ ++G + F++ + I + FK +E+ A Sbjct: 152 LWIPGGLFYKDGVAIKTEIGWEADHVFYM-------LKDGTAHITSVPEFKLVEREVVHA 204 Query: 156 VQSGPMLMENGVINPRI--HPNVASSKIRNGVGINKHGN-AVFLLSQQATNFYDFACYAK 212 + ++++G + + N R VG++ + Y + Sbjct: 205 IGGWQRMVQDGEVVKNFTVNDNAMQFHPRTFVGVSADNRKVYLFVVDGRQPEYSNGMRLE 264 Query: 213 AKL------NVEQLLYLDGTISHMYMK 233 + Q +DG S ++ Sbjct: 265 DMMLLCQGAGCYQAFNMDGGGSTTMVR 291 >UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A8G9_NATTJ Length = 718 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 34/107 (31%), Gaps = 14/107 (13%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPN------VASSKIRNGVGINKHGNAVFLLSQQ 200 K +++ FA+ GP ++E G ++ R R VG+ + G + Sbjct: 596 KNVEDVVFALGGGPRILEKGEVDIRSMEEVISDNVSQGRSPRTAVGVTRDGQLLLTAVDG 655 Query: 201 A-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + + K + + L LDG S M Sbjct: 656 RQSGLSIGMTLEELGNFMKDR-GAQDALNLDGGGSTMMWFDNEFQNN 701 Score = 50.0 bits (118), Expect = 8e-05, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 44/145 (30%), Gaps = 25/145 (17%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + + ++P + + + + L + + A+NGG Y Sbjct: 391 GQENGPIKIHELRLDP---HGDVKPELIMAQDGFSGFERLDSMAKRNNAIAAINGGFYWR 447 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 + P+GLYI + + FY + I R A Sbjct: 448 AGHPIGLYISDQRLIREPMPNRS---------AFFYSKDGEATIERT-----------AF 487 Query: 157 QSGPMLMENGVINPRIHPNVASSKI 181 G M +++ IN + + + Sbjct: 488 NGGLMYIDD--INTNLSIDGVNRSR 510 >UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobium RepID=B3PTF7_RHIE6 Length = 325 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 67/207 (32%), Gaps = 23/207 (11%) Query: 27 PLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P F VA A + V+P R + + + + Sbjct: 83 PGFEVAELPVLADGREVDRIFLSRVDPARFRFVTHNAAPGDK---GIDEWEKTLP---NA 136 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL-- 143 + +NG +D+ P +I G + G F D I L Sbjct: 137 VLIVNGSYFDKHGRPDTPFISEGIAMGPRQYDARA--------GAFTADKDTAEIRDLSH 188 Query: 144 -DAFKTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 D A+ S P+L+ ++G + ++ R V + G V +++A Sbjct: 189 QDWQTAFVGASNAMVSYPLLIGDDGQTH--VNVKSRWLANRTFVAKDDLGRVVIGTTKEA 246 Query: 202 -TNFYDFACYAK-AKLNVEQLLYLDGT 226 + A + K + LN++ L LDG Sbjct: 247 FFSLDRLAQFLKTSPLNLKVALNLDGG 273 >UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TKB7_ALKMQ Length = 236 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 61/179 (34%), Gaps = 18/179 (10%) Query: 80 NSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 + + A+NGG +D + P G++ + ++ + A + G ++ Sbjct: 47 ANGYKRIGAVNGGFFDGNRTLPYGMFYVDSGFLLSESWAGDAFLELVHENGKLHIDDITA 106 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVIN----PRIHPNVASSKIRNGVGINKHGNAV 194 ++ K+ +A+ L+ G +N + S R +G + N + Sbjct: 107 NQLKTKY----KKANWAISLSYSLVVGGKMNIMKGDKFPFTNQS-HPRTLIG-DNQENYI 160 Query: 195 FLLSQQATN------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 F++++ + A +L + DG S G I + Y + Sbjct: 161 FVVTEGRMTKEKGLTAVESARVML-ELGCNTAINADGGGSSAMDVEGKIQNKYYDNRAV 218 >UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L610_BACV8 Length = 287 Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 20/184 (10%) Query: 75 LLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 + + Q + A+NG + + +N +R G ++ Sbjct: 86 TTSQLAEQSRSSAAINGSYFSIKEGFSTCYLRKNEAVIDTTTTEER----HLRVNGAVHM 141 Query: 134 AGDKVGIVRLDAFKTSKEIQF---AVQSGPMLMENGVINPRIHPN---VASSKIRNGVGI 187 + + I+ + K + SGP+LM++G + + R+ + + Sbjct: 142 VDNNIRIIPWNDENEKKGFPLDGDILASGPLLMQDGKTCDFTTIDREFSETRHPRSAIAL 201 Query: 188 NKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPW 239 K G+ + + + + A + L L LDG S +++ G + Sbjct: 202 TKEGDIMLVAVDGRAEGHADGMSIAELAYLLR-ILKAHCALNLDGGGSTTLWVNGQVVNH 260 Query: 240 QRYP 243 Sbjct: 261 PSDN 264 >UniRef50_C5RID5 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RID5_CLOCL Length = 347 Score = 94.3 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 67/225 (29%), Gaps = 39/225 (17%) Query: 38 LSDPTLTVQAYTV-NPQTER-VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + + +P + V + NG+ +++ A+NGG + Sbjct: 119 IEHDRYIAHILEIKDPTKIKAVMTKYVGKNGQK-------TSEMALDYDAIAAINGGAFA 171 Query: 96 E-----------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL- 143 + P G I NG N + V + K+ + Sbjct: 172 DVSASGQKWAGNGAIPGGFVITNGAIVYPKENV----NKYDVQNVVAFTKEGKLVVGDYC 227 Query: 144 DAFKTSKEIQFAVQSGP-MLMENG--VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + + A+ P ++ +G I ++ + R +G G V L+ Sbjct: 228 INDLMAMGVTEAMCFRPPSIIIDGVAQITDKLQDG---TNPRTAIGQKADGTVVLLVIDG 284 Query: 201 AT------NFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIP 238 T YD K LNV LDG S MY G I Sbjct: 285 RTLSMPGATLYDVQQIFKD-LNVVNAGNLDGGYSSTMYFNGEIIN 328 >UniRef50_B7ASL4 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7ASL4_9BACE Length = 367 Score = 94.3 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 67/230 (29%), Gaps = 18/230 (7%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + D + D T + + +++ + + ++A I ++ Sbjct: 134 SEEQSQDIEIVDIKGTTYRGKLMIIKDPSRVFVGTV-PQFFEGDGKVVAKIAARYNAVGG 192 Query: 89 MNGGIYDES-----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +NGG + + P+GL + +G+ F + + V Sbjct: 193 VNGGEFVDGELTYTAMPVGLVMTDGRIVNGDTATRCHVTGFTKDN-ILVVGNMTGQQALD 251 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT- 202 + I ++ GP L+ NG R VG G + L Sbjct: 252 MGMRDCVSISSSI--GPFLIINGEAQDVSGVGG-GLNPRTAVGQRADGAVLLLAIDGRQA 308 Query: 203 -----NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVT 246 +F D + + +DG S MY +G I P Sbjct: 309 NSLGASFADLLYIMQ-QYGAVNASTMDGGTSTQMYYEGSVINTPYSPTGP 357 >UniRef50_C8WU56 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WU56_ALIAD Length = 296 Score = 94.0 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 62/226 (27%), Gaps = 35/226 (15%) Query: 38 LSDPTLTVQAYTV-NPQTER-VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + +P T V +P+ V+ + GE +NGG + Sbjct: 77 IHEPNFTAYVLWVRDPRRVEIVETRYAGDVGETVEQFVN-------DWHAVAGVNGGSFT 129 Query: 96 E------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + G+ I NG+ + F G + A + Sbjct: 130 DTNWQGTGGLVQGIVISNGRILKRASGPESIVGFT--------ADGRLISGTYTLAELQA 181 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------- 201 + A+ GP L++ G ++ R +G G + +++ Sbjct: 182 MGVTQALMFGPTLVDRG-VDQIQGAGDWGYAPRTAIGQTADGTVILMVTDGRELHGPADI 240 Query: 202 -TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + D A + L LDG S + G I Sbjct: 241 GASLGDIARLMIS-LGAVTAANLDGGSSATLVYDGCLINQPTDILG 285 >UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D0A3_PAESJ Length = 349 Score = 94.0 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 53/172 (30%), Gaps = 22/172 (12%) Query: 79 INSQGQVQMAMNGGIYDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 I + + A+N + + A G+ IE+G K N + E I GV Sbjct: 143 IAKRAKALAAINASGFVDLDGHGNGGASTGVVIEDGVIKSQ-NKNTKEFVAGITKDGVMI 201 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN 192 + + +Q+A P L+ NG R +G G+ Sbjct: 202 TGKYSANEL------VNLGVQYAAGFKPQLIVNGQKMVE-GDGGWGWGPRTAIGQKADGS 254 Query: 193 AVFLLSQQATN------FYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAI 237 +F++ + + + +DG S MY G I Sbjct: 255 IIFVVIDGRQTRSVGASIKEVQDLLYER-GAVNAMCMDGGSSSSMYFNGDNI 305 >UniRef50_A7LRK2 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LRK2_BACOV Length = 315 Score = 93.6 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 28/225 (12%), Positives = 69/225 (30%), Gaps = 28/225 (12%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLH-------ALLADINSQGQVQMAMNGGIYDE 96 + T++ + A + V + +NG Y + Sbjct: 77 HIFVATIDLNELTFTPATKDDKNVPATGPESSAPLPIHAFAAEANGKTVWLGVNGDYYAD 136 Query: 97 S-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + +GL+ ++G + + + G YV +A + A Sbjct: 137 NPRRVMGLFYKDGVCINSQYFEGHDEVLYQLKNGETYVGQ------ADEALAHEANLLHA 190 Query: 156 VQSGPMLMENGVINPRIHP--NVASSKIRNGVGINKHGNAVFL-LSQQA---------TN 203 + +L+++GV+ ++ ++ R VG+++ +++ + Sbjct: 191 LGGYGLLVKDGVVQNFYEEMGDLQNTHPRTSVGLSQDRKTMYVFVVDGRRKDSFFALGLT 250 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 A KA + + LDG S + + P ++ Sbjct: 251 LPHLATMMKA-VGCYNAINLDGGGSTTLII-RKVNDGGKPTFPIL 293 >UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IP98_9BACE Length = 536 Score = 93.2 bits (230), Expect = 8e-18, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 78/277 (28%), Gaps = 72/277 (25%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 TL + L+ + T +V++ + A T+ A N G Sbjct: 242 TLPAEIELYETTSNLNGSNFHAWYAIGDLSTGKVEVRVHIPSSPA--TIDTQSASFN--G 297 Query: 84 QVQMAMNGGIYDESYAPLGLYIEN----------------GQQKVALNLASGEGNFFIRP 127 + +NGG + + G+ + N G + G F + Sbjct: 298 DCYLLVNGGYF-YNGNHTGIAVINSIKSGSVSAVRGSLKTGDTEYNSMYNVTRGTFGVDA 356 Query: 128 GG--------------VFYVA--------GDKVGIVRLDAFKTSKE--IQFAVQSGPMLM 163 G VFY +K GIV + T+ ++A+ +GP+L+ Sbjct: 357 SGKPNVVWTGTDASSNVFYFDRPLPSVKGENKYGIVTNENPTTAISWSPKYALSAGPVLL 416 Query: 164 ENGVINPRIHPNVASSKI--------------------RNGVGINKHGNAVFLLSQQAT- 202 ++ I + R +G + G V + Sbjct: 417 KDKKIPFDFTETSKGTDYYLSNYEIIPYDIFGANVTPDRTAIGYREDGKVVIFICDGRIT 476 Query: 203 -----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 + A K L + LDG S + G Sbjct: 477 ASGGATLTELAQIMKG-LGCVGAINLDGGGSTGMVVG 512 >UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WS35_9SYNE Length = 687 Score = 92.8 bits (229), Expect = 9e-18, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 47/168 (27%), Gaps = 26/168 (15%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS----- 158 ++N + ++ + P Y+ + +F+ + + QS Sbjct: 507 TVQNHEVIAQKSMGKAGSSSVPIPRDGGYLLALRSYRSAGQSFQPGTPVLLSSQSQPAVF 566 Query: 159 ---------GPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVFLLSQQAT- 202 GP+L+ + I N + + R VG G + Sbjct: 567 EQYPNMIGGGPLLVRDRNIVLNPQLEGFSTNFIQGAAPRTAVGKTSDGTWIIATMHDRVG 626 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A K +L L LDG S GG + + Sbjct: 627 GRGPTLTETAYIMK-QLGAVDALNLDGGSSSSLYLGGQLLNRHPRTAA 673 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 47/142 (33%), Gaps = 15/142 (10%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P ++ V + V P + + + A + ++ + Q Sbjct: 349 APGLRWRQQYINVNQHRFPVYMFIVRPNPDALTLRPIHAASNTAIGIEPIVTT-AKRAQA 407 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 A+N G ++ + PLG GQ L G +A + G + +D Sbjct: 408 IGAVNAGFFNRNNQLPLGAVRSAGQWISGPILGRG------------AMAWNDSGELVID 455 Query: 145 AFKTSKEIQFAVQ-SGPMLMEN 165 F S+ + V + P+L N Sbjct: 456 RFALSESVTTGVGEAFPILAVN 477 >UniRef50_C6XT12 NHL repeat containing protein n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT12_PEDHD Length = 646 Score = 92.4 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 70/229 (30%), Gaps = 29/229 (12%) Query: 23 LTLLPL-FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN- 80 +T+ P + + + VN +V M + A Sbjct: 74 ITVAPGVTETDIHYTDTAGKAMHLFILKVNLNEPQVFMEVATPFNLPAYARQTVPAQAAE 133 Query: 81 ---SQGQVQMAMNGGIYDE-SYAPLGLYIENGQQK------VALNLASGEGNFFIRPGGV 130 + V +NG +D + P+G+ +NG L F + V Sbjct: 134 IDTATHMVIAGINGDFFDTSTGIPMGIVHKNGSIVKSTFNDNTLKPQQAVSFFGVTENNV 193 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 + + S ++ + SG ML+ N + + + R VG + + Sbjct: 194 PIID------FKSGYAALSSQLYNSTGSGVMLVNN---HLPVSQPYTAIDPRTSVGYDDN 244 Query: 191 GNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 G F++ N+ A NV+ + LDG S +M Sbjct: 245 GIVYFVVIDGRDAPYSNGMNYAQLTSAFMA-FNVKNAVNLDGGGSSTFM 292 >UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alicyclobacillus acidocaldarius RepID=B7DMS1_9BACL Length = 354 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 41/164 (25%), Positives = 70/164 (42%), Gaps = 20/164 (12%) Query: 100 PLGLYIE--NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 P G IE G+ K + G+ I V + +K F A+ Sbjct: 201 PDGYDIEIGAGEAKTPIVTRVHVGDPAILTDTVLALPSEKPV-----PFAAYPN---AIG 252 Query: 158 SGPMLMENGVINPR-------IHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 +GPML++NG I+ + + +R+ VGI++ G+ +FL +A N + A Sbjct: 253 AGPMLVQNGRIDVEPSLEGLDEPDILNAETLRSVVGIDRAGHLIFLTIHEA-NVWQEASI 311 Query: 211 AKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 AKA L + + LDG S ++ +G + + T I V ++ Sbjct: 312 AKA-LGLWDAMNLDGGSSVGLWYEGRYLTPPKRALATAIVVVQR 354 >UniRef50_B8CYN3 SpoIID/LytB domain protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8CYN3_HALOH Length = 833 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 56/162 (34%), Gaps = 22/162 (13%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + +P + +NG A + F V G+ +K+I+ A Sbjct: 667 KDGSPGTVIPDNGFIIQAHGQSRQFLKLFKEGDKVVLQNNFGPGLT-------NKDIKMA 719 Query: 156 VQSGPMLMENGVIN-----PRIHPNV-ASSKIRNGVGINKHGNAVFLLSQQA-------T 202 + +GP L++NG I P++ R +GI + + + Sbjct: 720 LGAGPTLIKNGKIYITGKAEGFQPDILRGRAPRTALGITSGNHLIMVTVDGRQPGFSIGM 779 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYP 243 + A + K NV Q + LDG S M ++G + Sbjct: 780 TLEELAQFML-KYNVVQAMNLDGGASSRMVVRGYTMNNPSDK 820 Score = 43.1 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 15/69 (21%), Positives = 31/69 (44%), Gaps = 2/69 (2%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY 104 + ++ + + + A+G+ L L + + S + +NGG Y + PLGL+ Sbjct: 521 ITMLDLDLNNDFLYVEPFLASGK-LSGLSDL-SQVVSGKKALAGINGGFYSYTGRPLGLF 578 Query: 105 IENGQQKVA 113 + NG+ Sbjct: 579 MINGEIVSE 587 >UniRef50_C4Z4Z5 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z4Z5_EUBE2 Length = 388 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 65/225 (28%), Gaps = 19/225 (8%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 D + D + + + +++ G ++ADI + +NGG Sbjct: 161 PDIEIVDVKGATYSGKLMIVKDPSRLFVGTVPEFTNGN-GMVVADIAKRYDAIGGVNGGE 219 Query: 94 YDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 + + P+GL +++G+ S I + + + Sbjct: 220 FVDGETTYTAMPIGLVMKDGEILNDNGGTSHVT--GITFDNKLVLGNMNAAKAKELNIRD 277 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 I + GP L+ NG + + R +G G + L Sbjct: 278 CVSISNHI--GPFLIVNGEAQDIVGIAG-GTNPRTAIGQTADGKILLLAVDGRQPNSIGA 334 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVT 246 F D A+ +DG S MY G I P Sbjct: 335 TFSDLQDIM-AQYGAVNASTMDGGTSTQMYYDGEVINVPYSPTGP 378 >UniRef50_B4AZH7 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AZH7_9CHRO Length = 298 Score = 91.3 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 37/273 (13%), Positives = 79/273 (28%), Gaps = 58/273 (21%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYT--------VNPQTERVKMYWQK 63 + +K I ++L + + A + P + A + V Sbjct: 1 MNRLVKLIIISLIMGVVSACTPTQSSSEKPQRSESAVVQPEPLYKVYDLPQSTVHTL-TI 59 Query: 64 ANGEAWGTLHALLADI------NSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNL 116 + L + + A+NGG +D + I G+ Sbjct: 60 PVDSPYQVTVTLARSLETVENLAKKQGAMAAINGGFFDPNNGKTTSYIIHQGKIIADPKN 119 Query: 117 ASGEGNFFIRPGGVFYVAGD-----------KVGIVRLDAFKTSK-----EIQFAVQSGP 160 P Y+ + +F ++ ++ +GP Sbjct: 120 NERL---MKNPDLTRYLDKILNRSEWRRYQCGATVRYSISFHNQPTLTGCQLLDSLGAGP 176 Query: 161 MLM-------------ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ------- 200 L+ NG + + + R +GI +G+ +++++ Q Sbjct: 177 RLLPEMTAQTEGFIDLVNGTMI-KDALGLKEPNARTAIGITANGDLIWIMAAQKAHSSRA 235 Query: 201 -ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A + K L V++ L LDG S + Sbjct: 236 TGLSLLELAEFLK-TLGVQEALNLDGGSSSTFY 267 >UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZFU4_BREBN Length = 359 Score = 90.9 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 28/218 (12%), Positives = 66/218 (30%), Gaps = 26/218 (11%) Query: 37 ALSDPTLTVQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 +L + V +P ++ + + + + + +A+N G Sbjct: 137 SLQEGGYRGYMAKVRLNDPNALKMVL-----ANNSVKSKGETTSQAGKRTGSILAINAGG 191 Query: 94 YDES----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + PLG+ + +G+ + + G ++ A T Sbjct: 192 FMSDKQGNLTPLGITVVDGK-IRTFSNNAKLSFVGFNNKGHLVGTS-----IKTQAQITQ 245 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 + I P L++ G P + + R +G +G+ + ++ Sbjct: 246 QGILQGASFLPRLLQGGKRLPIPREWANARQPRTLIGHFDNGDLLLIVIDGRRDGWSNGV 305 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A + +V LDG S + G + + Sbjct: 306 TLEE-AQRKLQEWHVVDAYNLDGGGSSAFYYNGKLLNK 342 >UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobacillus RepID=C9RVV6_GEOSY Length = 652 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 39/107 (36%), Gaps = 13/107 (12%) Query: 135 GDKVGIVRLDAFKTSK--EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN 192 + + + + ++ A+ L+ +G + P ++ R VGI+K+GN Sbjct: 356 KEGDAVEISLQYDQPEWSGVKEALGGRYRLVADGKVQP---FSIEGVHPRTAVGIDKNGN 412 Query: 193 AVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + ++ + A +L + LDG S ++ Sbjct: 413 VMLIVVDGRQPAYSQGMTLNELAKLM-HELGAVDAMTLDGGGSSTFV 458 Score = 50.4 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 39/124 (31%), Gaps = 4/124 (3%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG---TLHALLA 77 ++ + + + V ++ ER+ + +N + G L Sbjct: 137 VSTRIASGVEKEEMEIVGARGKQHVYKLDIDTSNERMAIETALSNDQVLGIEPVLEQAKR 196 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGD 136 G V A+NG + + +P L + G+ A+ F I G + Sbjct: 197 YDGRDGIVLAAVNGDYFKQDGSPTDLMVHRGEIVITNTTPAAERTIFGISADGKPMIGNP 256 Query: 137 KVGI 140 V I Sbjct: 257 DVQI 260 >UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HRE9_9FIRM Length = 487 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 35/103 (33%), Gaps = 13/103 (12%) Query: 150 KEIQFAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVFLLSQQA-- 201 + A+ +GPML++NG I + R +G+ K G + ++ Sbjct: 364 DKTVHALGAGPMLLKNGSIYLTTKIEEFGSDVAGGRAPRTALGLTKDGRVLLVVVDGRQP 423 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A +L + LDG S + + + Sbjct: 424 TSAGMTLLELA-LFLQELGAVDAMNLDGGGSSEMVINDKVVNK 465 >UniRef50_B7H7U4 Putative uncharacterized protein n=27 Tax=Bacillus cereus group RepID=B7H7U4_BACC4 Length = 365 Score = 90.5 bits (223), Expect = 4e-17, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 52/186 (27%), Gaps = 18/186 (9%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDE------SYAPLGLYIENGQQKVALNLASG 119 G ++ + + +A+N + + G+ IENG+ + Sbjct: 135 GTQGANRGEKISVMAKRNHALVAVNASGFADETGRGGGNVATGIVIENGKAIDTNMDRNA 194 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 + G+ K++ A P L+ NG S Sbjct: 195 PTIITGLTKFGQMITGN-----YSTQQLLDKQVVSAAGFMPQLIVNGEKMITEGDGGWGS 249 Query: 180 KIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA------KLNVEQLLYLDGTISHMYMK 233 R+ + + G +FL+ + K + + +DG S Sbjct: 250 APRSIMAQKEDGTIMFLVIDGRQT-HSIGATLKECQDILYEKGAINAMAMDGGSSATLYL 308 Query: 234 GGAIPW 239 GG + Sbjct: 309 GGKVIN 314 >UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V4S8_9FIRM Length = 491 Score = 90.5 bits (223), Expect = 4e-17, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 51/176 (28%), Gaps = 24/176 (13%) Query: 77 ADINSQGQVQMAMNGGIYDESYAPLG--LYIENGQQ--KVALNLASGEGNFFIRPGGVFY 132 + + P G I NG+ + + + G Sbjct: 288 NAERGADNLIIYNRAYGRSTGTNPYGLEYVIRNGRVAEINTNDSLIPPDGYVVSVHGTLM 347 Query: 133 -------VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SS 179 V ++ D + + +GP L+ENG ++ + ++ Sbjct: 348 DAFAAAGVRVGDPAVLTEDLGEPWNRAVQVLGAGPRLVENGSVHVTAGEEQFPGDIRYGR 407 Query: 180 KIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH 229 R VG+ + GN +F + +FA + V + LDG S Sbjct: 408 APRTAVGVTQKGNILFAVVDGRQSHSHGLTLTEFADLL-VQFGVRDAINLDGGGSS 462 Score = 41.2 bits (95), Expect = 0.037, Method: Composition-based stats. Identities = 17/110 (15%), Positives = 29/110 (26%), Gaps = 6/110 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L A D +T +P RV+ A ++ I Sbjct: 164 LAAGLTQREYVYADEDGPVTAYFIEADPARYRVR----PALARGIIPGRQTVSGIAQDTN 219 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 A+N + S +G+ +G + F + P F Sbjct: 220 AAAAINASYFALSGELIGITKIDGTVVSSTYFDRSA--FGVMPDNSFVFG 267 >UniRef50_C5PL46 Exopolysaccharide biosynthesis protein n=2 Tax=Sphingobacterium spiritivorum RepID=C5PL46_9SPHI Length = 288 Score = 90.5 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 58/202 (28%), Gaps = 19/202 (9%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPLG 102 + ++ Q + + T S+ A+NG ++ ++ Sbjct: 59 EINFIEIDLQKIKQPIRLAG-----LQTGFKNTTTFASEANALAAINGAFFNTKTGGGTT 113 Query: 103 LYIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVGIVRLDA----FKTSKEIQFAVQ 157 L N Q L G+ R K+ I++ D + ++ + Sbjct: 114 LVRINKQLINETVLKEGKSPKRSFRSNAALAFDTKKIVIIKGDDRDSTWDKKIKMPNVMT 173 Query: 158 SGPMLMENG-VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACY 210 GP+L+ + + R+ + + + + + + + Sbjct: 174 CGPLLLHKSHRAYLDSNAFNNNRHPRSAIALTTEHKLILITVDGRNAQAYGMSLIELSNV 233 Query: 211 AKAKLNVEQLLYLDGTISHMYM 232 K L + L LDG S Sbjct: 234 MKW-LKGKDALNLDGGGSTTLY 254 >UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0BZE5_ACAM1 Length = 584 Score = 90.1 bits (222), Expect = 6e-17, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 52/166 (31%), Gaps = 24/166 (14%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYV-----------AGDKVGIVRLDAFKTSKE 151 + + N Q ++ +F I G V +G ++ I + Sbjct: 406 ITVINNQVVSEKV-STSTKSFAIPKNGYLLVLRSFDVGGALASGTQLQIQTATTPASFNG 464 Query: 152 IQFAVQSGPMLMENGVIN-----PRI-HPNVASSKIRNGVGINKHGNAVFLLSQQAT--- 202 V +GP+L+ NG + + P S R+G+G G + Sbjct: 465 FPNIVGAGPLLVSNGQVVLNAKAEKFRPPFDTQSAPRSGIGQTADGTILLAAVHNQVSGP 524 Query: 203 --NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 ++A + +L L LDG S GG + + Sbjct: 525 GPTLKEWALIMQ-RLGSVNALNLDGGSSTSLYLGGQLLDRHPVTAA 569 Score = 44.7 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 2/95 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P +L + V +NPQT +K+ N A +H LL+ + QV Sbjct: 249 APGILRQERVISLGNKQYPVTWLALNPQTPGLKLQPIWGNRNALLGIHPLLS-MAQGNQV 307 Query: 86 QMAMNGGIYDESY-APLGLYIENGQQKVALNLASG 119 A+N G ++ + PLG +NGQ + L G Sbjct: 308 AAAINAGYFNRNNKTPLGAIRQNGQWISSPILNRG 342 >UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria RepID=B5VVA8_SPIMA Length = 789 Score = 90.1 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 49/162 (30%), Gaps = 22/162 (13%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + P + NG V + S F + G ++ I + Sbjct: 629 DDQTPTAI-PTNGYLLVFRSFRSAVSAFGV---------GSRLTITATTTPSEFIDFPHI 678 Query: 156 VQSGPMLMENGVINPRIHPNV------ASSKIRNGVGINKHGNAVFLLSQQATN-----F 204 + GP+L++N I IR+ VG+ G + + N Sbjct: 679 MGGGPLLVQNRNIVVNAEAEGFNYWFGQQLAIRSAVGVTATGEVLMVTVHNRVNGAGPSL 738 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A + +L + LDG S + GG + + Sbjct: 739 TEMAKLMQ-QLGAIDAINLDGGSSTSLVLGGHLLNRTPDTAA 779 >UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus sp. D21 RepID=C0WEQ2_9FIRM Length = 470 Score = 90.1 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 63/192 (32%), Gaps = 32/192 (16%) Query: 76 LADINSQGQVQMAMNGGIYDESYAPLGL--YIENGQQKVALNLAS-------------GE 120 L + + + NG G+ I NG+ S G Sbjct: 266 LNRMRLENDLIFYNNGYDDTTDTNAAGVEVAIRNGRVIKTGTTGSMPMSWNMTVLSGHGT 325 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN------PRIHP 174 F+RP V GDKV I + + +GP+L+ +G++N Sbjct: 326 AADFLRPLAV----GDKVKIKTSLGSPLADKAPSVGTAGPLLVYDGLVNVTASLEEIPSD 381 Query: 175 NVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTIS 228 R VGI K G + +++ + A Y +L ++ + DG S Sbjct: 382 IADGRAPRTAVGIKKDGTILVVVADGRSSRSAGMTLPELARYLI-QLGADRAMNFDGGGS 440 Query: 229 HMYMKGGAIPWQ 240 + GA+ + Sbjct: 441 SEMVVNGAVKNR 452 >UniRef50_C6XWN0 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWN0_PEDHD Length = 328 Score = 89.0 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 64/202 (31%), Gaps = 18/202 (8%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG----QVQMAMNGGIYDES 97 + + V+ +T ++++ + L L + A+NG + + Sbjct: 99 PVRIFIMEVDMKTPKLEIQAMAPYNDYINGLQRLSEMCRDNELPGTNIVAAVNGDTFSTT 158 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 AP L+ N + A+G F G + G ++ +I+ AV Sbjct: 159 GAPTSLFYINNRVYYGTV-ATGRTFFAAMKDGTIVIGGKDTK--GVERPVDKAQIKNAVG 215 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-------ATNFYDFACY 210 G + + I + S+ R +G N + ++ D Sbjct: 216 -GNQWLVDNNIKATLTDATISA--RTAIGYNANKVIYAIVVDGSQATYSNGLTLVDLRDI 272 Query: 211 AKAKLNVEQLLYLDGTISHMYM 232 A L + + LDG S + Sbjct: 273 M-AALGTKDAVNLDGASSSTLV 293 >UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VX04_9CYAN Length = 681 Score = 88.6 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 74/239 (30%), Gaps = 45/239 (18%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQ--VQMAMNGGIYDES-----------Y 98 P R + W + +G I + G+ + +N G Y Sbjct: 431 PILNRGAIAWTDQHQFKFGRFSLQETLITANGERFPSLFLNSGYVQAGISRYTPAWGVTY 490 Query: 99 APLG-----LYIENGQQKVAL-NLASGEGNFFIRPGGVFYV--AGDKVGIVRLDAFKTSK 150 PL ++N Q L +GE +F I G D I + T+ Sbjct: 491 TPLTDNEVIWVVQNNQITAQLPGGVAGEESFVIPVNGYLLTHRGHDPNAIAKSLTLGTTV 550 Query: 151 EIQF------------AVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGN 192 +I+ + +GP+L++N I + S IR+ +GI +G Sbjct: 551 QIEQKTLPVEFNDYPHILGAGPLLLQNRQIVLDAKAENFSNAFAQQSAIRSAIGITANGT 610 Query: 193 AVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + N + A + +L L LDG S GG + + Sbjct: 611 LIIAAMHNRVGGRGPNLTETAQLMQ-QLGAVDALNLDGGSSTGLYLGGHLLDRSPHTAA 668 Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 32/96 (33%), Gaps = 2/96 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P L D + V+P+ +VK+ ++ L+ + Sbjct: 343 APGIWWRQRTVTLGDHQFPLVWLEVDPKNPQVKLSPMWSHPTTQVGTAPLIKT-AQLWKA 401 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 A+NGG ++ + PLG +G L G Sbjct: 402 AAAINGGFFNRNNQLPLGAIRRDGYWYSGPILNRGA 437 >UniRef50_O31980 SPBc2 prophage-derived uncharacterized protein yomE n=2 Tax=root RepID=YOME_BACSU Length = 644 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 31/239 (12%), Positives = 65/239 (27%), Gaps = 33/239 (13%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQT-------------ERVKMYWQKANGEAWGTLHA 74 F V + + + V P+T + + T Sbjct: 58 YFTVTSSFKQDATLGIEYYVTKVTPKTTEAKKSMVQKTFAYDFEKSIDPTSSYFGTTNRE 117 Query: 75 LLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 + + + + +A+N + + +GL I++G + A G VF+ Sbjct: 118 TVLSMAKRKRSVVAINASGWRSNGEVMGLQIKDGVLYKDYDAAGYTGAEAC----VFF-D 172 Query: 135 GDKVGIVRLDAFKTS----KEIQFAVQSGPMLMENGVINPRIH---PNVASSKIRNGVGI 187 + + K + + G L+++ ++ R +G Sbjct: 173 DGTMKVYGNREVDADILISKGARNSFAFGIWLVKDSKPRTAQMTTWADLNVKHPRQAIGQ 232 Query: 188 NKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPW 239 G V + YD ++ LDG S ++G I Sbjct: 233 RSDGTLVIITVDGRSLRSSGITAYDMPSLFLSE-GCINAFLLDGGGSSQTAVEGKYINN 290 >UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HTR4_CYAP4 Length = 603 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 45/159 (28%), Gaps = 21/159 (13%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYV---------AGDKVGIVRLDAFKTSKEIQF 154 + + A+ I P G V + I Sbjct: 423 VVNDRVAGQQTASANTPTPILIPPNGYLLVLRDVPLPVFGEGSLQIQMNALPADFNRFPQ 482 Query: 155 AVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVFLLSQQAT-----N 203 + +GP+L+E G I + + R+G+G G + + + Sbjct: 483 ILGAGPLLLERGQIVLNPDLEQFGNGLDAQQAPRSGIGRTSTGQILLVTTHNRIGGAGPT 542 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 ++A K L L LDG S GG + + Sbjct: 543 LAEWAAILK-TLGAVDALNLDGGSSTALYLGGQLLDRHP 580 Score = 43.5 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 34/100 (34%), Gaps = 2/100 (2%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 + A+ + +NP +++ + + L + Sbjct: 260 NILWSSGVRWREQTLAVGGDRYPLTWLEINPHQAGLQLRPIWNQPDTLVGIQPLPR-LAQ 318 Query: 82 QGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 + QV A+NGG ++ + PLG ++G + L G Sbjct: 319 RWQVAAAINGGFFNRNQQVPLGAIRQSGSWISSPILNRGA 358 >UniRef50_C4Z6E6 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z6E6_EUBE2 Length = 360 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 74/225 (32%), Gaps = 34/225 (15%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY- 94 +S + + + +P +V + WG L +I S +NGG+Y Sbjct: 119 EISGRSFFGKMLIIKDPSQVKVGTTY------PWGDYGKELHEIVSGAGAIAGVNGGLYV 172 Query: 95 ---DESYAPLGLYIENGQQ-KVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKTS 149 + +PLG+ +++G+ + + SG + + V D + +++ Sbjct: 173 SSGNRGGSPLGIVVQDGKITYNSPSALSGLYLIGLNKDNLLVVKDIDGMSAADFESYVNE 232 Query: 150 KEIQFAVQSG-----------PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 I+ AV P+++ N + + + R +G G + L++ Sbjct: 233 AGIRDAVAFQEESSDSNNHFVPLIINNEARV--LKGQGSGANPRTAIGQRVDGAILLLVT 290 Query: 199 QQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 D + + LDG S + G Sbjct: 291 DGRGASGHLGATASDLISVMQ-EYGAVNAANLDGGSSSTMVYNGG 334 >UniRef50_C6JBU1 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JBU1_9FIRM Length = 291 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 63/182 (34%), Gaps = 16/182 (8%) Query: 65 NGEAWGTLHALLADINSQGQVQMAMNGGIYDESY---APLGLYIENGQQKVALNLASGEG 121 + +G +D S + +NG +D +PLG+ I+NG + Sbjct: 100 SNGTYGGSRQTTSDAVSSNGGIIGVNGSAFDYGTGKPSPLGMCIKNGIIYGDYMTSYSV- 158 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 ++ G Y G++ + + + GP+L+++G Sbjct: 159 -MAVKKDGTIYTPAQ--GLMGKNLLAAGVKDTY--NFGPVLIKDGEAQLPWTET-EKYYP 212 Query: 182 RNGVGINKHGNAVFLLSQ----QATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGA 236 R VG+ K + V L++ N +D K+ LDG S +Y G Sbjct: 213 RTAVGMVKPNDYVLLVTDTGSYNGLNHWDMVNIFKS-YGCTYAYNLDGGGSATLYFNGKV 271 Query: 237 IP 238 + Sbjct: 272 MN 273 >UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LVE9_BACOV Length = 332 Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 58/212 (27%), Gaps = 50/212 (23%) Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQK--VALNLASGEGN----------FFIRPG 128 + + +NGG + E L L NG+ A N F Sbjct: 112 EENNSTIVINGGFFYEG--SLSLIWRNGEMVCKNNDVTAEDWTNGPFWYPVLAAFCEMND 169 Query: 129 GVF-------------YV----AGDKVGIVRLDAFKTSKEIQFA---VQSGPMLMENGVI 168 G F Y + K + F ++ + A + GP+L+ +G I Sbjct: 170 GSFKSMWTYTTLSNVTYWYSEPSPVKSETTPDENFPSTGTVLNAKTGIGGGPVLLLDGNI 229 Query: 169 NPRIHP------NVASSKIRNGVGINKHGNAVFLLSQQ--------ATNFYDFACYAKAK 214 ++ R+ +GI + + + + A K Sbjct: 230 KNTYEEEILSDIGATVNRPRSAIGITNDKKMILFVCEGDGMTTGVAGMTTENVANIMK-T 288 Query: 215 LNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 L + LDG S M + G Sbjct: 289 LGCTDAINLDGGGSSCMLVNGQETIKTSDSSG 320 >UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 Tax=Nocardioides sp. JS614 RepID=A1SN25_NOCSJ Length = 420 Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 55/170 (32%), Gaps = 24/170 (14%) Query: 92 GIYDES-YAPLGLYIENGQ--------QKVALNLASGEGNFFIRP-GGVFYVA-GDKVGI 140 GIY G + GQ + +P G+ ++ G+ + Sbjct: 224 GIYTPRWGRTAGYGVTQGQTERVRAVTVVNGRVRTNRAKLSHDQPIKGLLFIGRGEGAKV 283 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGV---INPRIHPNVASS--KIRNGVGINKH-GNAV 194 +R T ++++++Q P + +G ++ I + R VG++ G + Sbjct: 284 LRKLPKHTRIKVRWSLQGRPQMAISGNNFLVHDGIIRAIDDREMHPRTAVGVDSDTGEVL 343 Query: 195 FLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 L+ + A L ++ + LDG S + Sbjct: 344 LLVVDGRQADSRGYTMVELANLMVD-LGADEAVNLDGGGSSTMVGKNRRG 392 >UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PX63_9BACT Length = 294 Score = 87.0 bits (214), Expect = 5e-16, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 68/207 (32%), Gaps = 29/207 (14%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 TV + P+ ++ A+G A + ++ + + + +NG + + Sbjct: 66 TVTVAEITPKRS-LEFDIAIADG------GATVGEMAQRTKALVGINGSYFGMNKRSAIT 118 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR-----LDAFKTSKEIQFAVQS 158 Y+ G+ + + +R G G K+ I+ + A S Sbjct: 119 YLRQGRTVLDTTTTAELA---LRVTGAIRTHGRKLRIMPWNKEIERRYHCRHGSTLA--S 173 Query: 159 GPMLMENGV---INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFA 208 G +L+ G + V R+ + + G +F+ N + Sbjct: 174 GHLLLYRGQSILLRSSSMGFVVKKHPRSAIALTSRGTVLFVTVDGRHPGYAGGMNLIEL- 232 Query: 209 CYAKAKLNVEQLLYLDGTIS-HMYMKG 234 + +L + LDG S ++ KG Sbjct: 233 RHFLQQLGCTDAINLDGGGSTTLWAKG 259 >UniRef50_C6J7B9 Exopolysaccharide biosynthesis protein n=2 Tax=Bacillales RepID=C6J7B9_9BACL Length = 355 Score = 87.0 bits (214), Expect = 5e-16, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 73/236 (30%), Gaps = 37/236 (15%) Query: 34 DDCALSDPTLTVQAYTVNPQTER---VKMYWQKANG-------EAWGTLHALLADINSQG 83 + + ++ Y VNP T R +K+ + + + G + + G Sbjct: 116 PFETIQSDRIRIELYKVNPGTYRGYAMKIRLKSPDAMKMTLGKDRLGGAETTMQAVQRYG 175 Query: 84 QVQMAMNGGIYDESYA--PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 V GG D PL I NGQ F +F+V ++ G + Sbjct: 176 AVAGINAGGFADSRGQRYPLSTTILNGQYVNG---------FEPSYKDLFFVGLNQSGQL 226 Query: 142 RLDAFKTSK-----EIQFAVQSGPMLMENGVINPRIHP--NVASSKIRNGVGINKHGNAV 194 F+ + + +F P+L++NGV P R +G K + Sbjct: 227 IGGKFQNKESLDKLKPKFGASFVPILLQNGVKLPIPDKWKTSPLRAPRTVIGNYKDDQLL 286 Query: 195 FLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRY 242 L+ + L V+ LDG S + + G I Sbjct: 287 VLVVDGDNEKGRSGATLEELQNKLAN-LGVQDAYNLDGGGSSSLVVNGRVINHPSD 341 >UniRef50_C6LDL7 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LDL7_9FIRM Length = 400 Score = 86.6 bits (213), Expect = 7e-16, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 56/202 (27%), Gaps = 23/202 (11%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWG-TLHALLADINSQGQVQMAMNGGIYDES 97 + + + + +G L+ ++ + + +A+NG Y + Sbjct: 170 EKYGTQISYVLADIYVGDITCLRTAFAQDTYGVGYSEKLSGMSDRMKAVLAVNGDSYSNN 229 Query: 98 Y-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 G I NG + + V G + A Sbjct: 230 RHRNNGTIIRNGVIYRSQATDAETC--------VLNWDGTMDIYTPDQMDIQKLIERGAY 281 Query: 157 QS---GPMLM-ENGVINPRIH--PNVASSKIRNGVGINKHGNAVFLLSQQA------TNF 204 QS GP L+ ENG + S R +G + G+ LL Sbjct: 282 QSWVFGPSLLDENGKAKDSFLTWDYIRQSHPRTAIGYYEPGHYCLLLVDGRQKASRGMFL 341 Query: 205 YDFACYAKAKLNVEQLLYLDGT 226 + A +L + LDG Sbjct: 342 DEMAQLF-EELGCKAAYNLDGG 362 >UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellaceae RepID=C9KQW2_9FIRM Length = 503 Score = 86.6 bits (213), Expect = 7e-16, Method: Composition-based stats. Identities = 22/116 (18%), Positives = 40/116 (34%), Gaps = 13/116 (11%) Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINK 189 ++ + + + F + GP L+ENG ++ ++ R+ VGI K Sbjct: 370 GDPVMIEENLGDGWQNMDFIIGCGPRLVENGRVHVTVDEEDFPADIRIGRAPRSAVGITK 429 Query: 190 HGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 G + + D+A K + L LDG S + G + Sbjct: 430 DGRYLLAVVDGRQSHSVGLTLTDWAKLL-VKFGAQDALNLDGGGSSDLVVNGDVQN 484 >UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VRM0_9FIRM Length = 361 Score = 86.6 bits (213), Expect = 8e-16, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 49/151 (32%), Gaps = 11/151 (7%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G I G+ V + + V + + ++ V +GPM Sbjct: 207 GYIIYFGKDSVDKSYIDQRFKLGRKVELVLVDSKGNETFKYNGQDISYSKVTELVAAGPM 266 Query: 162 LMENGV-INPRIHPNVASSKI------RNGVGINKHGNAVFLLSQQATNFYDFACYAKAK 214 L++NG + N KI R+ +GI K+G + L + N A Sbjct: 267 LLQNGKNVVAESKNNYKEGKINSATGQRSAIGITKNGKVILLTA--VANVDKLALIMND- 323 Query: 215 LNVEQLLYLDGT-ISHMYMKGGAIPWQRYPF 244 L + LDG S ++ G I Sbjct: 324 LGCIDAMNLDGGASSALFANGKVIKNAGRNL 354 >UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RLV8_ACIFE Length = 477 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 28/137 (20%), Positives = 49/137 (35%), Gaps = 18/137 (13%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGI 187 GD V + + + +GP L+ +G + I ++A R GVGI Sbjct: 342 TGDPVKVTQTLGNAAADSAPSVGSAGPQLVRDGRVQVTSEEEEIADDIALGRAPRTGVGI 401 Query: 188 NKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQ 240 K G + +++ +F Y +L ++ + DG S M + G + Sbjct: 402 KKDGTVLVVVADGRSDDSVGMTLTEFGRYF-VQLGADRAMNFDGGGSSEMVVNGKIMNDP 460 Query: 241 RY----PFVTMISVERK 253 P + V RK Sbjct: 461 SDGTERPVRVALGVFRK 477 >UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YXN3_9CYAN Length = 775 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 53/167 (31%), Gaps = 24/167 (14%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVF------------YVAGDKVGIVRLDAFKTSK 150 + +EN Q + + I G + G K+ I Sbjct: 596 ITVENNQLSRQIESNDDQTPIEIPQNGYLLTFRSFRSALSAFPLGGKIAITAKTTPSEFN 655 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNV------ASSKIRNGVGINKHGNAVFLLSQQATN- 203 + + +GP+L++ G I IR+G+G+ +G+ + + Sbjct: 656 QYPHILGAGPLLLQQGQIVVDAEAEGFNIWFAKQRAIRSGIGVTANGDLLIVTVHNRVGG 715 Query: 204 ----FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A + +L + L LDG S + GG + + Sbjct: 716 PGPDLTELAQLIQ-QLGAVEGLNLDGGSSTSLILGGHLLNRTADTAA 761 >UniRef50_UPI0001BC335A hypothetical protein BcroD2_01203 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC335A Length = 366 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 68/209 (32%), Gaps = 31/209 (14%) Query: 51 NPQTERVKMY--WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY----APLGLY 104 +P V W N +G L ++ + +NGG Y P GL Sbjct: 139 DPSKVSVATIYPWSDENKSKYGV---TLGELVTNAGAIAGINGGEYCSDGNWGGRPKGLV 195 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKTSKEIQFAVQS----- 158 + NG+ + + G+ + + + + + +++ ++ I+ V Sbjct: 196 VSNGELQYN-SPQWGDVMVGFNEDNILVIKDLNGMSVGQIEEMVKTERIRDCVSFKDIDD 254 Query: 159 -----GPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 L+ NG + I+ + + + R +G G + ++ D Sbjct: 255 GDSNHFTKLIING-VATEINGSGSGANPRTCIGQRADGTVLMFVTDGRGASGHIGATAAD 313 Query: 207 FACYAKAKLNVEQLLYLDGT-ISHMYMKG 234 K + +DG S MY KG Sbjct: 314 LISVMK-EYGAVNAANIDGGSSSSMYYKG 341 >UniRef50_B8HPJ4 Putative uncharacterized protein n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HPJ4_CYAP4 Length = 338 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 68/219 (31%), Gaps = 18/219 (8%) Query: 27 PLFAVAADDCALSDPTL-TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P V + L+ P L +N T +K + L ++ + + Sbjct: 53 PFVGVTYINRRLTSPRLLNQHIVLINLATTGLKFRVTSPAADGSTALEKTIS-FTRRSKA 111 Query: 86 QMAMNGGIY----DESYAPLGLYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGDKVGI 140 Q+ +NG + LGL +G+ + N G NF F +G Sbjct: 112 QIGINGNFFQALSSTRAKVLGLAASSGRVYSSWSNGYQGAINFSSNRTATFVTPPSGLG- 170 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 T + + P+L++NG N R+ +G+ ++ + Sbjct: 171 TTTVPLLTPYNLVSGL---PVLVKNGQNVTVGVANPNEYAARSVIGLTQNQQLLLFAVDG 227 Query: 201 A-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 N + A + V + LDG S + Sbjct: 228 PRSNVSTGMNQIELADLLISDFKVVHAVNLDGGGSSTLV 266 >UniRef50_UPI0000E45D54 PREDICTED: similar to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E45D54 Length = 447 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 59/220 (26%), Gaps = 27/220 (12%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYA 99 VN V + +G A + Q +A+N G ++ S A Sbjct: 102 ERAPGHIVRVNSPARTVSVLEPFDSGGCTNHHRATVDSTAKQDNCLVAVNAGFFNPRSGA 161 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G + NG+ N + IR G + IQ G Sbjct: 162 CYGNVVSNGRLV-QTNGGLQNAHLGIRADGTLVFGYLSE---ENVLQTENPFIQLVGGVG 217 Query: 160 PMLMENGVINPRIHPNVA---------------SSKIRNGVGINKHGNAVFLLSQQ---- 200 L+ +G I R VG ++ G V + Sbjct: 218 W-LLRDGEIYVEESKKAECGDTEEASSVDLFFNMLSARVAVGSDEKGRLVIAVIDGQTLK 276 Query: 201 -ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + FA + + V + DG S ++ G I Sbjct: 277 RGLSLLSFAKWLLSH-GVTNAINFDGGGSATFVVNGTIVN 315 >UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chroococcales RepID=B7KAU9_CYAP7 Length = 644 Score = 85.1 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 65/237 (27%), Gaps = 43/237 (18%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQ--VQMAMNGGIYDES-----------Y 98 P R + W G L I + G + +N G Y Sbjct: 395 PILNRGAIAWNDRGQVKMGRLRLQETVITNGGNRLPVLYLNSGYVQSGMARYTRDWGATY 454 Query: 99 APLG-----LYIENGQQKVALNLASGEGNFFIRPGG--VFYVAGDKV-----GIVRLDAF 146 PL + ++N Q N P + + + V I Sbjct: 455 TPLSDDELIITVQNNQVISQRQGGKAGQNVIPIPNDGYLLAIRKNSVPASALTIGTSLNL 514 Query: 147 KTSK------EIQFAVQSGPMLMENGVIN-----PRIHPNVASSKI-RNGVGINKHGNAV 194 ++ + +GP+L+ NG I + + K R+ + + G + Sbjct: 515 ESGTIPADFNNYPHILGAGPLLLLNGQIVLDVASEQFSKGFQNQKASRSAIATTRDGKLM 574 Query: 195 FLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + + + A ++ L L LDG S GG + + Sbjct: 575 VVAVHNRVGGSGASLPELAQILQS-LGAVDALNLDGGSSTSLALGGQLIDRSPVTAA 630 Score = 46.6 bits (109), Expect = 9e-04, Method: Composition-based stats. Identities = 16/99 (16%), Positives = 32/99 (32%), Gaps = 2/99 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T P L + V ++ ++ + + +N + + I Sbjct: 304 ITWTPGLIWRQKIIPLKGDSFPVTWLDIDLKSPNIFLKPVTSNPDTLEG-TEPIVTIGRN 362 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 A+NGG ++ + PLG N + L G Sbjct: 363 TTASAAINGGFFNRNNRLPLGAIRTNNRWVSGPILNRGA 401 >UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C43112 Length = 762 Score = 84.7 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 53/158 (33%), Gaps = 25/158 (15%) Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF------KTSKEIQFAVQSGPMLMENG 166 N + F I G + V + ++ +F + +GP L+ NG Sbjct: 249 GANATIPKDGFVISANGGPFRDALTGVSVGDELTVEASINDAWRDAEFILATGPTLVRNG 308 Query: 167 VINPRIHPNVA---SSKIRNGVGINKHGNAVFLLS--------QQATNFYDFACYAKAKL 215 + + + R VG + G +FL++ + A Y ++ + Sbjct: 309 QTSISMSTSSPFARERAPRTAVGASSDGTKLFLVTIDGRQSGYSNGVTIPELAAYMRS-I 367 Query: 216 NVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + LDG S + RYP+ +SV + Sbjct: 368 GAHNAINLDGGGSTTMV-------ARYPWADHVSVVNR 398 Score = 42.0 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 43/108 (39%), Gaps = 6/108 (5%) Query: 45 VQAYTVNPQTER--VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPL 101 VQ V + +++Y+ G T +A+ +V A+N Y+ + P+ Sbjct: 69 VQVLDVQYRNPNVGLELYYPTPIGRVQTTSQQAMANTYENHRVVGAVNASFYNMSNGMPV 128 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 L +EN + L++ +G F ++ G + L F+T Sbjct: 129 NLLVENNKILNYGVLSNDQGGPV---NAPFAFGVNRNGALTLTDFQTK 173 >UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CAS6_ACAM1 Length = 279 Score = 84.3 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 66/224 (29%), Gaps = 45/224 (20%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 TV + P R + +G T+ +NGG +D + Sbjct: 34 TVHVLRI-PNHPRYTVRLDVVDG--LQTVADFAQGTPKP---VAVINGGYFDPANQLTTS 87 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYV--------------AGDKVGIVRLDAFKTS 149 YI G Q +A + P Y+ + Sbjct: 88 YIRRGGQILADPTQNSR--LVDNPDLKVYLPKILNRSEFRQYQCGAKTTYAITSYNQPIP 145 Query: 150 KE--IQFAVQSGPMLM-------------ENGVINPRIHPNVASSKIRNGVGINKHGNAV 194 + + +A+ +GP L+ +G + R R+ VGI G + Sbjct: 146 PDCTLNYALGAGPQLLPQLTSQAEGFTDSVDGQVI-RDAIGSRQPNARSAVGITDKGEVI 204 Query: 195 FLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 ++L +Q + + A + + + L LDG S + Sbjct: 205 WVLVEQQSATKPGLSLPELADFMEQQ-GAASALNLDGGSSSSLV 247 >UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=cyanobacterium UCYN-A RepID=UPI0001C3370C Length = 438 Score = 83.9 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 43/129 (33%), Gaps = 12/129 (9%) Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI-----NPRIHPNVA-SSKIRNG 184 + G + I K ++ + GP+L+ +G I + + + R+ Sbjct: 301 LFFIGSTLKIESKTVPKKFNQLSHILGGGPLLINDGSISLNVKDEKFTKSFQKQKASRSA 360 Query: 185 VGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 +GI + + N + A + KL L LDG S + GG + Sbjct: 361 IGITNKDKTILVTVHNSINSNGVNLNEMAQIMQ-KLGSINALNLDGGGSTSLVLGGRLID 419 Query: 240 QRYPFVTMI 248 + I Sbjct: 420 RFPVTAAKI 428 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 36/94 (38%), Gaps = 4/94 (4%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PL 101 V ++ ++ +V + + + L +I + +V A+NGG ++ + PL Sbjct: 121 FPVNLLEIDNKSSKVILRPIT-SNLNGQIGTSSLEEIAKKWRVVAAINGGFFNRNNRLPL 179 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 G N + L G G G F++ Sbjct: 180 GAIRHNNDWLSSPIL--GRGAVGWNENGKFFIDH 211 >UniRef50_Q9UK23 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=29 Tax=Chordata RepID=NAGPA_HUMAN Length = 515 Score = 83.9 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 64/224 (28%), Gaps = 33/224 (14%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI---NSQGQVQMAMNGGIY 94 D + E ++ + G G A + ++A NGG + Sbjct: 85 FRDRAVAGHLTR---AVEPLRTFSVLEPGGPGGCAARRRATVEETARAADCRVAQNGGFF 141 Query: 95 DES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ 153 + LG + + ++ + F IR G + + T Sbjct: 142 RMNSGECLGNVVSDERRVSSSGGLQNA-QFGIRRDGTLVTG----YLSEEEVLDTENPFV 196 Query: 154 FAVQSGPMLMENGVINP---------------RIHPNVASSKIRNGVGINKHGNAVFLLS 198 + L+ NG I V R +G ++ G V + Sbjct: 197 QLLSGVVWLIRNGSIYINESQATECDETQETGSFSKFVNVISARTAIGHDRKGQLVLFHA 256 Query: 199 QQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 N ++ A + + +V + LDG S ++ G + Sbjct: 257 DGHTEQRGINLWEMAEFLLKQ-DVVNAINLDGGGSATFVLNGTL 299 >UniRef50_C0CND1 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CND1_9FIRM Length = 454 Score = 83.9 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 59/186 (31%), Gaps = 20/186 (10%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPL-GLYIENGQQKVALNLASGEGNF 123 G +G ++ + + +NG + S P G + + ++G Sbjct: 233 GGTYGNPRRTVSQELADHNGVLGINGSGFSYSSGIPAPGKSMIKDRTVYEDVYSNGNIMC 292 Query: 124 FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV---INPRIHPNVASSK 180 GG+F G+ + + + + GP L+ENG I+ + Sbjct: 293 VTGEGGMFTAP---AGMTVQEMLQRDVKDTYC--FGPTLVENGEAFEISEQFQQTY--RY 345 Query: 181 IRNGVGINKHGNAVFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 R VG+ G+ ++ + + L+ E LDG S + Sbjct: 346 QRTAVGMISPGDYYLVIVDGKGVGGSQGMTYEELQQVFLD-LDCEYAYNLDGGGSTTLVF 404 Query: 234 GGAIPW 239 G + Sbjct: 405 KGRVIN 410 >UniRef50_B1I1S0 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I1S0_DESAP Length = 345 Score = 83.9 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 65/223 (29%), Gaps = 35/223 (15%) Query: 37 ALSDPTLTVQAYTVNPQTER-VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 L V P +++ +++ GE + G V GG Y Sbjct: 122 ELKGIGYRGYIAKVKPFDPGVLRVTYREGPGET------TSEAVRRTGAVLGVNGGGFYR 175 Query: 96 ------ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK-- 147 P+G + +G+ G F +F+ D G + F Sbjct: 176 APVDGLMHTLPIGNTMVDGKLV---------GGFQPPREDLFFAGFDGRGRLVGGIFNDR 226 Query: 148 ---TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA--- 201 + V P+L+++ P + R +G +G+ + ++ Sbjct: 227 TALLGTGARQGVSFVPILIKDRQPVPIPEKWRNQRQPRTILGEYANGDLIMIVVDGRQAD 286 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 D K V LDG S +++ G I + Sbjct: 287 WSSGVTLEDL-QVTLIKFGVIDAYNLDGGGSSVFVFGNQILNR 328 >UniRef50_B3RIP6 Putative uncharacterized protein (Fragment) n=2 Tax=Trichoplax adhaerens RepID=B3RIP6_TRIAD Length = 344 Score = 83.6 bits (205), Expect = 6e-15, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 65/205 (31%), Gaps = 31/205 (15%) Query: 48 YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIE 106 +P + Q G L + +A N G ++ E+ G I Sbjct: 12 VVEDPLRTISVLEPQNTGGCNMSKLSTVADTARKAH-CYVAENAGFFNTETGGCYGNIIS 70 Query: 107 NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM-LMEN 165 NG+ N+ + NF IR G V G + + + + SG + L+ N Sbjct: 71 NGRLVRLTNVQNV--NFGIRKNGSIIV-----GYLTEEEILDKENPFVQLVSGVIWLVRN 123 Query: 166 GVINPR---------------IHPNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFY 205 G + + + R +G +++GN + + + N Y Sbjct: 124 GKSYVKESMKMESNKHEETGTLKQFIEVKSARTAIGHDRNGNVMLMQIEGQTNARGLNLY 183 Query: 206 DFACYAKAKLNVEQLLYLDGTISHM 230 DFA + LDG S Sbjct: 184 DFAKKLIKS-GFVNAINLDGGGSST 207 >UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LSB3_9FIRM Length = 475 Score = 83.2 bits (204), Expect = 9e-15, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 36/97 (37%), Gaps = 13/97 (13%) Query: 155 AVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVFLLSQQAT------ 202 + +GPML+++G+ + P++A R G+ G+ + + Sbjct: 361 VIGAGPMLVKDGIAHVTATEEEFPPDIARGRAPRTAFGVTAEGHYLLAVVDGRQPHSIGC 420 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + A + + Q + DG S + GG + Sbjct: 421 TLQEMAEFML-QFGAVQAINFDGGGSSALVVGGELEN 456 >UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74396_SYNY3 Length = 610 Score = 82.8 bits (203), Expect = 9e-15, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 49/136 (36%), Gaps = 17/136 (12%) Query: 127 PGGVFYVAG--DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------AS 178 P GV V + G +AF + +GP+L++ G + Sbjct: 471 PAGVLAVGTTLNVNGRSTPEAFNAFPN---GMGAGPLLIDQGRMVLNATGEGFSSAFQQQ 527 Query: 179 SKIRNGVGINKHGNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 R+ + ++++GN + + S + +FA + +L L LDG S Sbjct: 528 RASRSAIAVDRNGNIILVASHNRVGGAGASLGEFAQILQ-QLGAVNALNLDGGSSTSLAL 586 Query: 234 GGAIPWQRYPFVTMIS 249 GG + + +S Sbjct: 587 GGQLLDRSPVTAARVS 602 Score = 42.3 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 28/76 (36%), Gaps = 2/76 (2%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 +S V T+NP++ + + AN A L I + + Sbjct: 278 GITWQQRFVNISGGQFPVTTVTINPRSPGISLRPLMANP-TMAQGTAPLVTIARDQRAAV 336 Query: 88 AMNGGIYDESYA-PLG 102 A+N G ++ + PLG Sbjct: 337 AINAGFFNRNNQLPLG 352 >UniRef50_B9YC35 Putative uncharacterized protein n=2 Tax=Holdemania filiformis DSM 12042 RepID=B9YC35_9FIRM Length = 368 Score = 82.8 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 61/218 (27%), Gaps = 26/218 (11%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 L T + V +P V +G ++ + MN G + Sbjct: 147 IDLKGTTFEGKLMIVHDPSRVFVACNPNMDSGAPGYSVEKYIEL----NDAIAGMNAGGF 202 Query: 95 DESY------APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 +++ G+ I +G+ + + I V Sbjct: 203 EDAGGNGNGGTAYGIVIHDGKLISG-SPSEFTPVIGINNANQLVVGD------MTAQQAL 255 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 +I+ AV GP+ ++N + + R +G G + ++ Sbjct: 256 DYDIRDAVTFGPVFIKNWEVVFESGRH-PGLNPRTVIGQRYDGAFLLMVLDGRQPSSFGS 314 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + D + + + LDG S + + G Sbjct: 315 TYQDIIDIMQ-QYDAVNAANLDGGNSTVMVYDGETLNT 351 >UniRef50_C6CV17 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CV17_PAESJ Length = 355 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 59/217 (27%), Gaps = 19/217 (8%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + A + ++ + G LA + G V GG D Sbjct: 132 IKADNFQSYAMKIKLKSGDAMKMVLGND--KVGGAETTLAAVQRYGAVAGVNAGGFADGG 189 Query: 98 YA--PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 PL I NG + FF+ + G K + ++F Sbjct: 190 GKRYPLSTTILNGDYVEGFEP-TRADLFFVGLNASNKLVGGKF---TSKQQLDNLNVKFG 245 Query: 156 VQSGPMLMENGVI--NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 P+L++NG P + + R + K G + +++ + Sbjct: 246 ASFVPVLLKNGSPTTIPSKWQSSPTRAPRTVIANYKDGQLLIIVADGRNEGGSSGATLAE 305 Query: 207 FACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRY 242 +L LDG S M G I Sbjct: 306 M-QILLQRLGAVDGYNLDGGGSSSMIWNGRVINKPSD 341 >UniRef50_B8HPB3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB3_CYAP4 Length = 304 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 43/275 (15%), Positives = 78/275 (28%), Gaps = 49/275 (17%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA 64 L+IG G+I + T L + Y++ + Sbjct: 11 LVIGLGLIGSTTACTQTSTTASSAPVAPTPPQPLQ-----YKVYSLPHSKIHTLVI---P 62 Query: 65 NGEAWGTLH------ALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQK-VALNL 116 G + LA Q Q +NGG +D + + G+Q + Sbjct: 63 AGSTYEVTAAIAPDVQPLATFAQQHQAIAVLNGGFFDPVNGKSTSHVVLAGKQVANPQDN 122 Query: 117 ASGEGNFFIRP-----------GGVFYVAGDKVGIVR-LDAFKTSKEIQFAVQSGPMLME 164 N + P + I R E+ A+ +GP L+ Sbjct: 123 ERLIQNPDLIPYLPLILNRSELRNYRCAGQIRYEISRHDKPIPPGCELLMALGAGPQLLP 182 Query: 165 N------------GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ--------QATNF 204 G R R+ +G+ G+ V+L+ + Sbjct: 183 QNTSVQEGFMAYSGETITRDSLGSLYPNARSAIGLKADGSLVWLMVAERSDANQPGGLSL 242 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + A + ++ L V + + LDG S + G + Sbjct: 243 PELAQFMQS-LGVVKGMNLDGGSSASFYYQGQTHF 276 >UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67T45_SYMTH Length = 921 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 42/108 (38%), Gaps = 10/108 (9%) Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 ++ K G +++ S + +A+ L+ +G + + + AS + R+ VG + G Sbjct: 240 FLDPLKPGDPVTVSYRPSPAVAWAIGGQNYLVRDGAVVSGL--DNASRRPRSAVGFSADG 297 Query: 192 -NAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 L+ + + A + K+ L LDG S + Sbjct: 298 RRMYLLVIEGDSSRSVGATLAEMAAFMKS-FGAANALELDGGGSSTIV 344 >UniRef50_A3DHF5 Ig-like, group 2 n=3 Tax=Clostridium thermocellum RepID=A3DHF5_CLOTH Length = 929 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 45/149 (30%), Gaps = 24/149 (16%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK------------- 150 ++NG + G I G ++ L FK Sbjct: 213 VVDNGTVV---EIRQGLPAVEIPQNGYVIISRGANAQFLLQHFKVGDPVEISFSTVLDWQ 269 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------TN 203 +I+ AV +L+++G I + ++ R G +K G + + Sbjct: 270 KIEMAVTGSAILVKDGQIPEKFSYEISGVHPRTAAGTSKSGKELILVTVDGRQAASKGMT 329 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + A + L + LDG S + Sbjct: 330 QRELANLMLS-LGAYNAINLDGGGSTSMV 357 >UniRef50_B8G1I8 Peptidase M56 BlaR1 n=4 Tax=Desulfitobacterium hafniense RepID=B8G1I8_DESHD Length = 747 Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 58/218 (26%), Gaps = 31/218 (14%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY---DESYA--PL 101 +P+ + + E L ++ G + GGIY +E P Sbjct: 530 MLISDPKRVTLAV-----TEEIGTVEEKLTDMVSRSGAIAGINAGGIYLSLEEGNEVFPD 584 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G+ ++NG+ + G V + K IQ V P Sbjct: 585 GITVQNGEVVYNNAGDQAVEFIGLDAEGKLITGPMNVQEI------KEKNIQEGVGFSPP 638 Query: 162 LMENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFA 208 L +NG R R G+G G +F++ D Sbjct: 639 LADNGTTLVREGKPAVPGDGGWGIAPRAGIGQRADGTLIFMVIDGRDPDWSIGATLKDME 698 Query: 209 CYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + + + L G M G + F Sbjct: 699 NLFL-EYGAVEAVNLSGGSMVEMVYDGKVLNKVSNIFG 735 >UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=Nostocaceae RepID=UPI0001C16068 Length = 613 Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 65/244 (26%), Gaps = 57/244 (23%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADI------NSQGQVQMAMNGGIYD---------- 95 P R + W GE + +L + + +N G Sbjct: 367 PILNRGAIAWNYQ-GEFYFGRLSLNETLIVDQDNKQTSLPVLFLNSGYVQNGIARYTFAW 425 Query: 96 -ESYAPLG-----LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI--------- 140 +Y PL + ++NG+ + G I G + Sbjct: 426 GPNYVPLTNNETIITVQNGKI---TKQSPPGGAISIPGDGYLLILRGTAVSKTSLLSVGT 482 Query: 141 -------VRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGI 187 F T I + +GP+L++N I + + +R+ + Sbjct: 483 KVNLESSTTPGEFNTYPHI---IGAGPLLIQNQRIVVDAKAEKFSQAFIKERAVRSAICT 539 Query: 188 NKHGNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 + N + + A + K+ L LDG S GG + + Sbjct: 540 TNNDNLILAAVNNRVGGWGPTLEEHAQLMQ-KIGCTNALNLDGGSSTSLYLGGQLLDRFP 598 Query: 243 PFVT 246 Sbjct: 599 NTAA 602 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 58/200 (29%), Gaps = 34/200 (17%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T L + V +N +T + + N + L + Sbjct: 276 ITWSKGLRWQQKFINLDKDSFPVVWLEINRKTSGLNLQPILPNPQTQTGTAPLTLT-AQR 334 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASG------EGNFF---IRPGGVFY 132 A+NGG ++ + PLG +N Q L G +G F+ + Sbjct: 335 YSAMAAINGGYFNRNNQLPLGAVRQNDQWISGPILNRGAIAWNYQGEFYFGRLSLNETLI 394 Query: 133 VAGDK-----VGIVRLDAFKTSKEIQFAVQSGP-----------MLMENGVINPRIHPNV 176 V D + + + ++ GP + ++NG I + P Sbjct: 395 VDQDNKQTSLPVLFLNSGYVQNGIARYTFAWGPNYVPLTNNETIITVQNGKITKQSPPGG 454 Query: 177 ASSKIRNGVGINKHGNAVFL 196 + I G + L Sbjct: 455 -------AISIPGDGYLLIL 467 >UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YL57_9CYAN Length = 620 Score = 81.6 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 36/106 (33%), Gaps = 12/106 (11%) Query: 154 FAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVFLLSQQAT----- 202 + +GP+L+++G I R IR+ VG + + Sbjct: 504 QILAAGPLLLQSGEIVLDAPSERFSEAFSNQQAIRSAVGRTPDNKLLLVAVHNRPLGSGP 563 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 N + A + KL + L LDG S GG + + I Sbjct: 564 NLTELAQILQ-KLGAVEALNLDGGSSTSLYLGGELIDRPAQTAAPI 608 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 35/109 (32%), Gaps = 11/109 (10%) Query: 22 ALTLLPLFAVAADDCALSDP---------TLTVQAYTVNPQTERVKMYWQKANGEAWGTL 72 + +P + + V ++ ++ + + + L Sbjct: 270 NILWMPGLRWRQQYIEIPNSQPTASSLPNRFPVFWLEIDLTAPQLSLKPILSRNTSRVGL 329 Query: 73 HALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 LL S+ Q A+NGG ++ + PLG G+ + L G Sbjct: 330 APLLKT-ASRSQALAAINGGFFNRNTLFPLGAIRRQGRWLSSPILNRGA 377 >UniRef50_UPI00019088BB hypothetical protein RetlC8_25680 n=2 Tax=Rhizobium etli RepID=UPI00019088BB Length = 332 Score = 81.2 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 65/203 (32%), Gaps = 17/203 (8%) Query: 28 LFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 F VA A + ++P R ++ + + + Sbjct: 91 GFEVAELPVLADGREVDRIFLSRIDPMRFRFVVHNASQGDK---GIDEWEHALPK---AV 144 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 + +NG YD P +I G + G FF + D Sbjct: 145 LIVNGSYYDMHGRPDTPFISEGVAMGPRQYDAKAGAFFADAASADIRD-----LTHQDWG 199 Query: 147 KTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNF 204 A+ S P+L+ ++G + ++ R V + G + +++A + Sbjct: 200 SALAGATNAMVSYPLLIGDDGQTH--VNVKSRWLANRTFVAKDGSGRILIGTTKEAFFSL 257 Query: 205 YDFACYAK-AKLNVEQLLYLDGT 226 A + K + L+++ L LDG Sbjct: 258 DRLAEFLKASPLDLKVALNLDGG 280 >UniRef50_A4J956 Copper amine oxidase domain protein n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J956_DESRM Length = 480 Score = 80.9 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 47/137 (34%), Gaps = 16/137 (11%) Query: 119 GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG-----VINPRIH 173 G G+ + GV G K ++ I+ + PML+E G +N + Sbjct: 211 GWGSSAGQLVGV--AEGTKARVITEMPEDWQ-NIRHVLTGSPMLVEGGLPVDQAVNEGLW 267 Query: 174 PNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTI 227 +V R +G+ G + ++ + A L Q + LDG Sbjct: 268 GSVLKYSPRTALGVTAQGKVLLVVVDGRQESSAGLTLEEMAYLMID-LGAVQAVGLDGGG 326 Query: 228 SH-MYMKGGAIPWQRYP 243 S M++KG + Sbjct: 327 SSEMWVKGKIVNNPSDK 343 Score = 40.4 bits (93), Expect = 0.052, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 41/131 (31%), Gaps = 17/131 (12%) Query: 24 TLLPLFAVAADDCAL------------SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT 71 L + A AAD A + V+P + ++ G Sbjct: 16 MLWAVPAWAADQLAKGVQYRSFERNNWEGKPIKGHILEVDPGVKYTEIR--PVMGNEVFG 73 Query: 72 LHALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 L+ + + A+NGG +D PLG I +G+ + + +F + G Sbjct: 74 QRENLSKMAQRTGAIAAVNGGFFDMGSGVPLGNLIIDGKPEY--ISDILKTSFGFKTSGG 131 Query: 131 FYVAGDKVGIV 141 + I Sbjct: 132 LKLGYLAPKIT 142 >UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BF40_9BACI Length = 657 Score = 80.5 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 45/116 (38%), Gaps = 11/116 (9%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKH 190 + ++ K ++ + SGP+L+ NG ++ + PN R V I+K Sbjct: 254 KPGDTVEIAINIDDKWKNSEYMLASGPLLVNNGKVDLGMDPNSTRARERAPRTAVAIDKT 313 Query: 191 GNAVFLLS-QQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + VFL++ N +FA Y KL + L LDG S + Sbjct: 314 MSKVFLVTVDGRLAESKGMNLTEFAQYL-VKLGAYKALNLDGGGSTAIIARKNGND 368 >UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8W171_9BACI Length = 750 Score = 80.5 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 59/145 (40%), Gaps = 18/145 (12%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF-------AVQ 157 I N ++ N + G+ FI G + G GI D + +I+ + Sbjct: 229 ITNMKEYGRRNASPIPGDGFIISGHGNRLDGLLDGIRAGDDIEVKVDIEDRWKDAEMIMA 288 Query: 158 SGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLLSQQA-------TNFYDF 207 +GP+L++NG ++ + + ++ R+G+GI+ GN +F+ F Sbjct: 289 TGPLLVQNGRVDITMSSSASTYSVPNPRSGIGIDAQGNTMFVTVDGRQSGYSQGMTIPQF 348 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYM 232 A Y + + + LDG S + Sbjct: 349 ANYMRDQ-GAVMAINLDGGGSTTMV 372 >UniRef50_D2V2G1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=1 Tax=Naegleria gruberi RepID=D2V2G1_NAEGR Length = 558 Score = 80.5 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 56/193 (29%), Gaps = 36/193 (18%) Query: 62 QKANGEAWGTLHALLADINSQGQ----VQMAMNGGIYD-ESYAPLGLYIENGQQKVALNL 116 + G + + A + DI N G ++ + LG + +G+ Sbjct: 189 RDLRGGCYYNVTAPVRDIAKYHANGYFCHYTTNAGFFNTHKHTCLGNVVSDGRISH--VS 246 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV 176 + NF I G +++ D ++ + L+ G + Sbjct: 247 TNHNVNFGITKDGKYFIGY-------TDENTKLEDFDQMISGVIWLVRKGESYVDESSKI 299 Query: 177 AS---------------SKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKL 215 R+ +G +K G V + Y+ A +L Sbjct: 300 EDMSIQETGNAKRFITVRASRSALGHDKEGRLVLVSIDGDGNHNKGPTLYELATLMI-EL 358 Query: 216 NVEQLLYLDGTIS 228 VE + LDG S Sbjct: 359 GVENAINLDGGGS 371 >UniRef50_A6L611 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L611_BACV8 Length = 308 Score = 80.1 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 63/231 (27%), Gaps = 38/231 (16%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 A ++S V V+ + + + L+ + + A+NG Sbjct: 51 AGYDSISQAHQNVDVLEVDLTSPSYDIQL------VYEEHGDSLSSVAERNNAAAAING- 103 Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI---VRLDAFKTS 149 +Y +I+ G + +A N + + G F D I D+ S Sbjct: 104 ----TYEAEASFIKIGGRLLAQNRLDSTHIRYWKHEGAFLFDDDNKNIDIRFASDSTFLS 159 Query: 150 KEIQFAVQSGPMLMENGVINP-RIHPNVAS-----------------SKIRNGVGINKHG 191 + PML++N NV R V + +H Sbjct: 160 HPAANILSGAPMLIDNNDPVGLNFTGNVEGMDLNKLDYEDFRRHQGVRHPRTAVALTEHK 219 Query: 192 NAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + + + + + + L LDG S + Sbjct: 220 KLLLITVDGRSTQAAGMSANELTRFLLTYFCPQSALNLDGGGSTTMWIASS 270 >UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoanaerobacterales RepID=Q8RCE6_THETN Length = 815 Score = 79.7 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 34/95 (35%), Gaps = 10/95 (10%) Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS- 198 I F ++I+ AV G +L++ G I P + R +G K V +++ Sbjct: 280 ITTNPPF---EDIKMAVSGGTILVKGGKIYP-FTHEIKGYAARTAIGYTKDKRYVLMVTV 335 Query: 199 QQA----TNFYDFACYAKAKLNVEQLLYLDGTISH 229 + A + L L LDG S Sbjct: 336 DGPPYRGMTQEELASLMLS-LGAYDALNLDGGGST 369 Score = 42.3 bits (98), Expect = 0.015, Method: Composition-based stats. Identities = 10/93 (10%), Positives = 33/93 (35%), Gaps = 3/93 (3%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPL 101 + + ++ + + + + + + ++ + A+NG +D ++ + Sbjct: 85 ININILKIDLKDPYLDLSVIFSPSGIKERM--PIREMANSYGAVAAINGDFFDTKTGFVI 142 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 G +++G F+I G Y+ Sbjct: 143 GATVKDGNLITDPASNGKMATFYIDKTGTPYID 175 >UniRef50_UPI000180BA0C PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BA0C Length = 621 Score = 79.7 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 60/186 (32%), Gaps = 17/186 (9%) Query: 70 GTLHALLADINSQGQVQMAMNGGIYDESYAP-LGLYIENGQQKVALNLASGEGNFFIRPG 128 G + +GQ +A NGG ++ LG I G+ + + +F I Sbjct: 123 GNQLDTSVNAARRGQCFVAQNGGYFNTKTQSCLGNVISRGRTLHTSDATNA--HFGILSN 180 Query: 129 GVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVAS--------S 179 G V +R F + + V++G +E V Sbjct: 181 GSIVVGYISDADLRRLNFTNLVGGVIWLVRNGTSFVEESVSMESSDTEETGTLRYFSDVQ 240 Query: 180 KIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 R +G +KHG V + N F+ + L++ + LDG S + Sbjct: 241 SARTAIGHDKHGWVVLVQVDGQTGARGVNLNSFSKFLIEDLHLVNAINLDGGGSATLVIN 300 Query: 235 GAIPWQ 240 G + Sbjct: 301 GTLANT 306 >UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N4C8_SYNP6 Length = 605 Score = 79.7 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 48/167 (28%), Gaps = 25/167 (14%) Query: 104 YIENGQQKVALNLASGEGN-FFIRPGGVF------------YVAGDKVGIVRLDAFKTSK 150 ++ + N F I G V G + +++ Sbjct: 421 TVQGDRVVSQSQADKAGSNRFTIPRNGYLIVLRSANSLRTSLVNGTTIQVLQQAQPSQFD 480 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSK------IRNGVGINKHGNAVFLLSQ----- 199 A+ GP+L+++G + S R+ +G+ G V + + Sbjct: 481 RFPHALGGGPLLVKSGRVVVNPQAEGFSRAFEIEAAPRSAIGLMPDGRLVLVAAHEQNQG 540 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 Q A + +L V L DG S + G + + Sbjct: 541 QGPTLPQMAAIMQ-QLGVVDALNFDGGSSTSLIVNGQLVNRARGSAA 586 Score = 47.0 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 14/96 (14%), Positives = 30/96 (31%), Gaps = 2/96 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 L + + T V ++ + V++ A + +L + Sbjct: 263 LAGTQLQQRQVTVDGATFPVFVIQLDLRQPNVRLAPIWAGNGSLEG-TQVLQAVARDRGA 321 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 +A+N G ++ + PLG + L G Sbjct: 322 AIAINAGFFNRNNRLPLGAIRRDNIWYSGPILNRGA 357 >UniRef50_A9NEV6 Hypothetical surface-anchored protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEV6_ACHLI Length = 520 Score = 79.3 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 14/100 (14%), Positives = 32/100 (32%), Gaps = 15/100 (15%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVAS------SKIRNGVGINKHGNAVFLLSQQ 200 + ++ A+ +G +L+++G + ++ S R +G G F++ Sbjct: 278 NGFENVRNAIGTGQLLVKDGAVQHAAFKSLPSNNMAHFRHPRTAIGQKADGTVFFIVVDG 337 Query: 201 A--------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + K LDG S + Sbjct: 338 RDALSGKYGVKYSELGELMKMH-GAVTAFNLDGGGSSTML 376 >UniRef50_B0C332 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0C332_ACAM1 Length = 306 Score = 79.3 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 62/226 (27%), Gaps = 28/226 (12%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK--ANGEAWGTLHALLADINSQ 82 L S V ++ +++ + + +Q Sbjct: 40 LFAGITYQRQVYT-SPRPYIVHIAKIDLTHPGIRVIATPGQPADDDNEFRAQPTSAFLTQ 98 Query: 83 GQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRP---GGVFYVAG-D 136 ++Q+AMN G + P G + L + G + P V Sbjct: 99 FRLQLAMNAGYFYHFNEKTPWDYAPHTGGRVNVLGQSISMGQPYSPPQKQWPVLCFDQSQ 158 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR--IHPNVASSKIRNGVGINKHGN-A 193 + IV + AV +L +P + + R+ +++ G Sbjct: 159 RGRIVATG--HCPSDTLHAVAGNYIL------HPDQPLQLDSDKPYARSIAALDQTGTTL 210 Query: 194 VFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 ++ F D K ++ + L LDG S + Sbjct: 211 WLIVVDGKQPDYSEGATFADIEQLIK-QIGADIALNLDGGGSTTLV 255 >UniRef50_A6TVJ8 Exopolysaccharide biosynthesis protein n=2 Tax=Alkaliphilus RepID=A6TVJ8_ALKMQ Length = 942 Score = 78.5 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 48/144 (33%), Gaps = 12/144 (8%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 P NG VA +G G + + +I+ A Sbjct: 219 RRRQPATGIPSNGYVLVASQTETGWGRAGHLFDNLKVGDRLTLHQEIQPNLN---QIELA 275 Query: 156 VQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQATNFY------DFA 208 + G +L+++G + +VA + R+ +GI++ + + +NFY + Sbjct: 276 LGGGTLLVKDGQA-AHLTQSVAGAHPRSAIGISRDRKQVILVTIDGRSNFYHGVDGRELG 334 Query: 209 CYAKAKLNVEQLLYLDGTISHMYM 232 L + +DG S + Sbjct: 335 NILLG-LGAHDAIIMDGGGSTTMI 357 >UniRef50_Q3AA51 Conserved domain protein n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AA51_CARHZ Length = 356 Score = 78.2 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 52/153 (33%), Gaps = 24/153 (15%) Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 P G + G NL + I P ++ ++ A+ +++G Sbjct: 211 PQGFVLNTGSLCPPDNLLNSNVTLKIEP-------ENQENVLWSKAYA-------VLEAG 256 Query: 160 PMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 P L++ G I S R+G+G+ K+ + + ++A + Sbjct: 257 PYLVKEGKIIADPLKENFTHYKIKDGSFARSGIGVTKNKKLLLVTV-NRATIKEWAIIMQ 315 Query: 213 AKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPF 244 KL + LDG S +Y+ G + Sbjct: 316 -KLGAYYAMNLDGGASSGLYVNGKYLTKPGRLL 347 Score = 40.4 bits (93), Expect = 0.053, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 45/113 (39%), Gaps = 6/113 (5%) Query: 7 IGKGMITLNLKRIFLALTLLPLF-AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 I G I L +F L L V +++ TV+ V+ T+++KM A Sbjct: 6 IALGAILLVYIILFSQLALGANSYQVFEKKLKINNKNFTVKGVIVDLNTKKLKMQTVLAK 65 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIY---DESYAPLGLYIENGQQKVALN 115 + G + +L + + + + +NG + D P G + +G+ N Sbjct: 66 NQ-IGQVESLESMVKRKKGLIG-INGAFFSAYDAYKEPYGNLMIDGRLIRKGN 116 >UniRef50_Q7X4R9 XcbC n=1 Tax=Neisseria meningitidis RepID=Q7X4R9_NEIME Length = 256 Score = 78.2 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 30/239 (12%), Positives = 69/239 (28%), Gaps = 31/239 (12%) Query: 16 LKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHAL 75 + IF+ + A C + +N + + + + + Sbjct: 7 ILSIFILSFFNSEYTYAQSLCIQQSSQNHIHIAKINLNCKGINLIATQEADK-----GMT 61 Query: 76 LADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 ++ + + +A+NG + Y P GL I + + + Sbjct: 62 VSQFARKYRTDIAINGSFFRTGYFPFGLAITDHKTWDKTRDVQKRVFLACNRQNRCMIED 121 Query: 136 D----------KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGV 185 K+ + +F + + P+ G + + + + R V Sbjct: 122 KNMVSKVDDSWKLAVSGWQSFNPATKKFECSDDDPV----GCTHIKFI----TKQPRTMV 173 Query: 186 GIN-KHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 G++ K ++ + A A L + + + LDG S +KG Sbjct: 174 GLDEKRNYLYLVVIDGRLPKFKGATLNELGQLA-ASLKLTKAINLDGGGSSTMVKGYNR 231 >UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DHU5_THEEB Length = 575 Score = 78.2 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 49/148 (33%), Gaps = 21/148 (14%) Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 G VA N S NF G V + I V +GP+L+E Sbjct: 420 SEGFLLVARNFNSALANFPP---------GAAVQLETTAVPAAFNRIPNIVGAGPLLVEQ 470 Query: 166 GVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVFLLSQQAT-----NFYDFACYAKAK 214 G + + + + R+ +G G+ V++ + ++A + Sbjct: 471 GRVVLNAALEQFGAGLDAQAAPRSAMGNRSDGSIVWVTTHNRIGGMGPTLAEWAQIV-HR 529 Query: 215 LNVEQLLYLDGTISHMYMKGGAIPWQRY 242 L + + LDG S GG + + Sbjct: 530 LGLINAVNLDGGSSTALYLGGVLVDRHG 557 Score = 52.0 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 31/96 (32%), Gaps = 2/96 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P L V +NPQ +++ + + L + ++ + Q Sbjct: 241 APGLRWQQQTVILGTRQFPVDLLIINPQQPGLRLRPLEISPTTLVGLATVP-ELAQRWQA 299 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 A+NGG ++ PLG G L G Sbjct: 300 AAAINGGFFNRDRQAPLGAIRREGNWLSGPILNRGA 335 >UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XK15_SYNP2 Length = 595 Score = 78.2 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 37/117 (31%), Gaps = 16/117 (13%) Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVF 195 ++F T I V GP+L++NG + + S R+ + + + Sbjct: 472 TPNSFATLPNI---VGGGPLLLKNGQVVLNGQAEQFSTAFNIQSASRSAIARTRDNKILL 528 Query: 196 LLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + ++A + +L L LDG S G + + Sbjct: 529 VTLHGAAEETAGATLNEWANILR-RLGATDALNLDGGGSSALALGANLSDRHPTTAG 584 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 53/179 (29%), Gaps = 18/179 (10%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 LP ++ A S + V ++P ++++ + L LL Q Sbjct: 256 LPGVQWRQENFAASSGPVRVTWLEIDPTQRQLQLKPITPDNNTIVGLAPLLIQ-ADTNQA 314 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 A+N G ++ + PLG+ ++ + + I G Sbjct: 315 IAAINAGFFNRNNQYPLGI-VQGNRALRSG---------PILNRGAVAWDNAGRWEFDRL 364 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPR--IHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 +T + G L+ +G + ++ S+ V V + Sbjct: 365 KVETDIVAGNGERVGVELINSGYVKAGAALYDRAWGSRYTTAV----DHEIVLTVMTSG 419 >UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=Nostocaceae RepID=UPI0001C164F4 Length = 300 Score = 77.8 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 78/230 (33%), Gaps = 28/230 (12%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA------DINSQGQVQMAMNGGIYD 95 + ++ + + + N + + + ++ + +NG Sbjct: 56 GIPFYQTIIDLEDPNILLTIGLPNSANFANTISRTNGDENFDQLVARSGAAVVVNGTFAY 115 Query: 96 ESYAPL--GLYIENGQQKVALNLASGEGNFFIRPG-GVFYVAGDKVGIVRLDAFKTSK-- 150 + G + G+ S NF G GV G+K ++ + Sbjct: 116 TNPQKTVMGNLVAGGRSLK----YSPWENFGTTLGLGV----GNKPEMITARVEGRPEWN 167 Query: 151 EIQFAVQSGPMLMENGVI--NPRI----HPNVASSKIRNGVGINKHGNAVFLL-SQQATN 203 + F++ SGP L+ NG + NPR+ P V + +R +G ++ G +FL + Sbjct: 168 KHWFSITSGPRLLRNGEVSVNPRLEGFKDPAVLGTSLRTAIGFSEDGKRLFLANFDEKLY 227 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-QRYPFVTMISVER 252 + A KA + + + LDG S I +I V Sbjct: 228 LEEEAEAMKA-IGCYEAMNLDGGPSRALASDNVILVPPARKLTNVILVYD 276 >UniRef50_UPI00017896CA metallophosphoesterase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017896CA Length = 2050 Score = 77.0 bits (188), Expect = 5e-13, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 59/168 (35%), Gaps = 23/168 (13%) Query: 72 LHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 L + + S + M + + D+ +P+G GQ ++ + + ++ G Sbjct: 230 LDIVSGRVASGETLTMKVVSVLKDQGNSPIG----QGQVVLSASGSQRSKLAGLKAG--- 282 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 G + ++ ++ A+ ML+++GV+ + R VG G Sbjct: 283 --DEVTAGFQLDNEWQ---DVTMAIGGTVMLVKDGVVQ---QHTDPAVHPRTVVGTKADG 334 Query: 192 NAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + V N+ + +L V L LDG S ++ Sbjct: 335 SVVLFEVDGRQPGFSEGLNYIELGE-MLQELGVVNALNLDGGGSATFV 381 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 42/107 (39%), Gaps = 6/107 (5%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG--TLHALLA-DINSQG 83 P + V +P +++ +G+ +G + + + + Sbjct: 70 PGATYTWANMQKGSGEQKVHMVEFDPSQGNLELQPGLTDGKVYGMQGVSKMASDADKAGN 129 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 +V A+NG YD + PLGL++ +G+ + SG F I+ G Sbjct: 130 RVIAAVNGDFYDMSTGIPLGLFMGDGELL--TDPPSGRNAFGIKQDG 174 >UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y710_COPPD Length = 485 Score = 77.0 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 47/162 (29%), Gaps = 16/162 (9%) Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 P G I G +V G Y V I + +G Sbjct: 332 PDGYVIHLGGTEVRFKDRFEVGTRLS------YRDIYDVRNSSNPEMWQEGVIWGTLSAG 385 Query: 160 PMLMENGVI--NPR-----IHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 P L+ NG I +P I R+ +GI ++ + + D A K Sbjct: 386 PRLITNGEITLDPASELLDIPKITGQPLTRSALGITQNNELLMVTV-SKCTIQDLATIMK 444 Query: 213 AKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 L + LDG S +Y G + + V + Sbjct: 445 D-LGAYNAMNLDGGASTSLYANGKFLATPTRKISNALMVLPR 485 >UniRef50_C6IEV9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IEV9_9BACE Length = 343 Score = 76.6 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 31/246 (12%), Positives = 62/246 (25%), Gaps = 56/246 (22%) Query: 45 VQAYTVNPQTERVK----MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY---DES 97 + + + + + + G ++ + + +NGG++ Sbjct: 79 AYIAVADMSKAKFEVLGDIAFSQEANGYGGKSIHTPSEFYESSKAPVVINGGLFFYSAGF 138 Query: 98 YAPLGLYIENGQQKVALNL---ASGEGNFFIRPGGVFYVAGDKVGIVRLDA--------- 145 Y L I GQ ++ G + Sbjct: 139 YYSQNLVIREGQLLAPNQNYYSKDWVTMWYPTLGAFCQMKDGTFQTTWTYQASDGINYCY 198 Query: 146 -------------------FKTSKEIQFAVQS--GP-MLMENGVINPR-----IHPNVAS 178 F + A G +L+ G I + + AS Sbjct: 199 PAPADNDINKDPLQAPSSTFPNGAKALEATTGIGGVTVLLRAGEIKNTYVEEMLDISAAS 258 Query: 179 SKIRNGVGINKHGNAVFLLSQQA--------TNFYDFACYAKAKLNVEQLLYLDGTISH- 229 ++ R +GI + + + + + A K L + L LDG S Sbjct: 259 NQPRTAIGITTNKKMIIFVCEGRNMTEGVAGLTTANVAKVMKD-LGCTEALNLDGGGSSC 317 Query: 230 MYMKGG 235 M + G Sbjct: 318 MLVNGK 323 >UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q9_CLOCE Length = 952 Score = 76.2 bits (186), Expect = 8e-13, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 45/157 (28%), Gaps = 27/157 (17%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF-------- 154 + +E+G N + + G + D F +++ Sbjct: 210 MVVEDG-IVKEFNENKPSMD--MPKNGFVVLGAGSHIQYLKDNFNVGDPVEYNITMNVDT 266 Query: 155 -----AVQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLL-SQQA---- 201 A+ G ML+++ + N S R +G +K G + + Sbjct: 267 NNMKMALTGGAMLVKDDKVLTSFSHNPVSPSTRASRTAIGTSKDGKTLIVAAVDGRSSAS 326 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + A Y +L L LDG S + Sbjct: 327 IGMTQSELASYM-HELGCANALNLDGGGSTTLVARKQ 362 >UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4EC3 Length = 279 Score = 76.2 bits (186), Expect = 9e-13, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 65/208 (31%), Gaps = 32/208 (15%) Query: 45 VQAYTVNPQTERVKMYWQKANGE-AWGTLHALLADINSQGQVQMAMNGGIYDESYAP--- 100 A ++ + + NG+ T + + ++Q+A+N + + Sbjct: 52 GHAVRIDLKAAGIGFLATPGNGDRPGETDGLKTSTFLKRHKLQLAINAAPFGPIHKDEEK 111 Query: 101 ----LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 +G+ + G+ + ++ I F + I+ AV Sbjct: 112 EQDVVGVQVSGGKLVSPAQPGYP---------ALLLAKDNRARI-AAPPFDL-EGIENAV 160 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQAT-------NFYDFA 208 ++++ G + S R G++ G V L+ + Sbjct: 161 GGFHIVLKGGEV----LTGDKSIHPRTAAGVSADGKTLVLLVIDGRQKDFSDGATTAEVG 216 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + KA L + + LDG + + GA Sbjct: 217 EWLKA-LGCAEGINLDGGGTTTLVVAGA 243 >UniRef50_A7C442 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C442_9GAMM Length = 299 Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 73/223 (32%), Gaps = 29/223 (13%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL-ADINSQGQVQMAMNGGI 93 + + + + +V+ ++ G + A + ++ ++Q+A+NG Sbjct: 44 EVRQTPRPIIIHFISVDLTKPNIRFLVTPGEVRDDGEIGARTTSQFLTEFKLQLAINGNF 103 Query: 94 YDESYAPL-------GLYIENGQQKVALNLASGEGNFFIRPGGVF---YVAGDKVGIVRL 143 + + PL + G + LAS G + + F Y++ D Sbjct: 104 FYP-FHPLFSVDFWNAYPKKRGDPVYVVGLASSHGQVYSQTKKSFETLYISADNQA---- 158 Query: 144 DAFKTSKEIQF-AVQSGPMLMENGVINPRIHPN--VASSKIRNGVGINKHGNAVFL-LSQ 199 F+TS + A+ + ++ G I R + ++K + + + Sbjct: 159 -RFQTSIGPLYHAISGRELFIKQGKIQGPFPKGAFNEKPYPRTALALDKTAKTLMIFVVD 217 Query: 200 Q-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + A ++ + L LDG S + G Sbjct: 218 GKLKNYSEGVTLMELADIVQS-YGADMALNLDGGGSSTLVMEG 259 >UniRef50_Q8YKH7 All7320 protein n=2 Tax=Cyanobacteria RepID=Q8YKH7_ANASP Length = 314 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 66/228 (28%), Gaps = 30/228 (13%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL-------------HALLADINSQ 82 L + T++ T +K + + ++ ++ Sbjct: 55 IESKPRPLIIHIVTIDLNTPGIKPFITPDIENLSKNVGVGKQAIIDNETKARTTSEFVAE 114 Query: 83 GQVQMAMNGGIYDE--SYAPLGLYIENGQQKVALNLASGEGNFFIRP---GGVFYVAGDK 137 QV++A+NG + P Y +G L G + V + Sbjct: 115 FQVKLAINGSYFYPFKEVTPWHYYPHSGDTTKVLGQTISNGKIYANKKSSWYVLCFDNNN 174 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--RNGVGINKHGN-AV 194 + + K + +L+ G I+ N ++ K R I+K G Sbjct: 175 QAQIPGGE-ECPKNTIQGLAGDDVLVFQGKPKINIYANSSADKPYSRVVAAIDKTGKKLW 233 Query: 195 FLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 +L Y + + AKL V + LDG S + Sbjct: 234 LVLVDGKQPLYSEGFTKRELTQFI-AKLGVYNAINLDGGGSTTLVVAN 280 >UniRef50_D1R528 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R528_9CHLA Length = 380 Score = 74.7 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 36/108 (33%), Gaps = 17/108 (15%) Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSK------IRNGVGINKHGNAVFLLSQQ---- 200 +I V P+L+ G + S R VGI ++GN +F++ Sbjct: 255 DIVHIVGGTPILVRGGRLVTDFSAEQTGSHFLNVRLARTAVGILENGNWLFVVVDGFYKN 314 Query: 201 -----ATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRY 242 D A + KL + L L G S M +K + Sbjct: 315 IWNTKGITIPDLAELMQ-KLGCVEALNLCGGKCSTMVLKNVVVNDPPD 361 >UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcystis aeruginosa RepID=B0JGJ2_MICAN Length = 607 Score = 74.7 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 52/163 (31%), Gaps = 27/163 (16%) Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV------------GIVRLDAFKT 148 GL ++ + LN + + I G + + F Sbjct: 429 TGLVVQGDRVTEKLNNLFPQDSIKIPENGYLVICRKTDISLNIGERVNLDSVTLPGDFAN 488 Query: 149 SKEIQFAVQSGPMLMENGVIN-----PRIHPNVASSKI-RNGVGINKHGNAVFLLSQQAT 202 +I + +GP+L++NG I + P + + R+ + +++ G + + Sbjct: 489 YPQI---LGAGPLLLQNGRIVLDGNAEKFSPAFQNQQASRSAIAVSREGKILLVAIHNRV 545 Query: 203 -----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A + + L LDG S G + + Sbjct: 546 GGRGATLGELARILLL-MAAKDGLNLDGGSSTGIALAGYLLDR 587 Score = 51.2 bits (121), Expect = 3e-05, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 35/95 (36%), Gaps = 2/95 (2%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 P L V ++P+ ++ + AN + + L+ INS+ Sbjct: 275 PGLIWNQKYIQLDQDWFPVTWLEIDPRNPQITIKPITANSTSMRGTNPLI-TINSESNAV 333 Query: 87 MAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 +NGG ++ + PLG +G+ L G Sbjct: 334 AMINGGFFNRNNQLPLGAIRVDGKWLSGPILNRGA 368 >UniRef50_B1X2V5 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B1X2V5_CYAA5 Length = 309 Score = 74.3 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 59/190 (31%), Gaps = 32/190 (16%) Query: 74 ALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKV-ALNLASGEGNFFIRPGGVF 131 + D + + +NGG +D + I+ G+ N N + P Sbjct: 89 KTVEDFAQETEAIAVLNGGFFDPVNSQTTSYVIKEGEAIADPSNNPRLMDNPQLEPYLKQ 148 Query: 132 YVAGDK------------VGIVRLDAFKTSKEIQFAVQSGPMLM-------------ENG 166 + + + + ++ ++ GP L+ NG Sbjct: 149 ILNRSEFRRYQCNELTRYAITYHQEPVPENCQLTESIGGGPQLLPNLSAEEEAFFESVNG 208 Query: 167 VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY----AKAKLNVEQLLY 222 + R + + R + I G+ ++++ +Q + + L+V + Sbjct: 209 QV-TRDPLGLERANARTAIAITSSGDVLWIMVEQTSPSTGLSLLKLREFLESLDVTSAMN 267 Query: 223 LDGTISHMYM 232 LDG S + Sbjct: 268 LDGGSSSSFF 277 >UniRef50_B9XE16 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XE16_9BACT Length = 398 Score = 74.3 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 45/144 (31%), Gaps = 24/144 (16%) Query: 109 QQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI 168 + ++++ I+PG + + + + + AV P+L+ +G Sbjct: 232 KMIISIDPKLASRFAGIQPGTILHFSTGTSRDIA--------KADTAVGGRPLLLVHGKE 283 Query: 169 NPRIHPNVAS-----SKIRNGVGINKHGNAVF-LLSQQA-------TNFYDFACYAKAKL 215 + R +G + F ++ + + A + + L Sbjct: 284 LETSKQKGNNAATIVRHPRTALG--WNARYFFLVVVDGRQKELSMGMSSQELAHFM-STL 340 Query: 216 NVEQLLYLDGTISHMYMKGGAIPW 239 + + LDG S + G + Sbjct: 341 GCTEAMNLDGGGSTTFWLDGKVVN 364 Score = 40.4 bits (93), Expect = 0.051, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 62/166 (37%), Gaps = 16/166 (9%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 +++++ + + + + ++ ++ + + + + A G+ G Sbjct: 19 LVSIHARAELTPIFSSLVPGLDYAHITETNHPWSIHVARLERSHKELDLVSTLAQGKIVG 78 Query: 71 ---TLHALLADINSQGQVQMAMNGGIYDE-----SYAPLGLYIENGQQKVALNLAS---- 118 + + G+ +A+NG + PLGL I NG+ A N AS Sbjct: 79 LSSVANQVKTFPAGSGKPLVAVNGDFFVIAKGPYQGDPLGLQILNGELVSAPNGASFWKD 138 Query: 119 GEGNFFIRPG----GVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 EGN F+ + G+K+ + +TSK + F GP Sbjct: 139 AEGNLFLDNVQSKFSILLPKGEKIPFGLNEQRQTSKAVLFTPAFGP 184 >UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6I7_THEAS Length = 486 Score = 73.9 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 61/172 (35%), Gaps = 28/172 (16%) Query: 93 IYDESYAP-----LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA------GDKVGIV 141 Y +Y P L L +++G +F + G A GD + +V Sbjct: 306 FYGGAYRPGNQALLSLSVKDGIV----QDEPQGADFTLLANGRAAEALGSLNIGDTLQLV 361 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS----SKIRNGVGINKHGNAVFLL 197 R AF + + +Q GPM++EN R S R VGI++ G VF++ Sbjct: 362 RRFAFPAFEACRLVIQGGPMIVENRRYVNRSEGLSRSIRERRHPRTLVGIDEQG-LVFMV 420 Query: 198 SQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 + A A + + L LDG S M +G + Sbjct: 421 IDGRNGHSSGVTLEEAANLALEE-GLVAALNLDGGGSSQMIWRGVTVNIPSD 471 >UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3R0_PELTS Length = 485 Score = 73.9 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 58/178 (32%), Gaps = 26/178 (14%) Query: 94 YDESYAPLG---LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD------ 144 Y P G + + NG + G I G G+ Sbjct: 315 YKYDTTPPGRTAVVVRNG-----IVTGIRSGQVEIPEDGYVIWYGENNYERDDQFSAGRQ 369 Query: 145 -----AFKTSKEIQF--AVQSGPMLMENGVIN--PRIHPNVASSKIRNGVGINKHGNAVF 195 FK +++ +F + + P+L+ NG I P + R+ VG+ V Sbjct: 370 VDYRVTFKENQQARFKATISNYPLLLSNGAIALGDITEPKLTIGAPRSFVGVTWDNILVM 429 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVER 252 A N ++ A K L ++ L LDG S +Y G I + V + Sbjct: 430 GTVDSA-NVWELAEVTKN-LGLKDALNLDGGASCGLYYDGAYIRQPGRLLSNCLVVIQ 485 >UniRef50_UPI0001923977 PREDICTED: similar to predicted protein, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001923977 Length = 290 Score = 73.2 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 54/186 (29%), Gaps = 32/186 (17%) Query: 80 NSQGQVQMAMNGGIYDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 Q ++A+N G ++ G I NG N NF IR G + Sbjct: 106 AKQQNCRIAVNAGFFNPFETDKDYGKCYGNIISNGNLVQD-NGGIQNANFGIRSDGTLVI 164 Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG---------------VINPRIHPNVAS 178 V K + +Q G +L NG + Sbjct: 165 GYLPEKEVID---KKNPFLQLLSGVGWIL-RNGSSYLKESEKAECKESETTGTLDKFFNV 220 Query: 179 SKIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 R +G + G+ + N Y+ Y K K+ + + DG S Y++ Sbjct: 221 KSARTMIGYDAKGHVHIVQFDGKTGKSGINLYEAVEYLK-KIGLINAINFDGGGSATYVQ 279 Query: 234 GGAIPW 239 I Sbjct: 280 DSIILN 285 >UniRef50_A4XD34 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XD34_SALTO Length = 430 Score = 73.2 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 32/95 (33%), Gaps = 8/95 (8%) Query: 153 QFAVQSGPMLMENGVIN-PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFY 205 FAV L+++G I P + R G G V + T Sbjct: 314 TFAVNGRYRLVKDGQIVAPSGSDSFFDRHPRTIAGTTLDGKIVLVTIDGRQTTSVGTTMT 373 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A A A L + + LDG S G++ Q Sbjct: 374 ETASVA-AALGMHDAVNLDGGGSTTMSVEGSLVNQ 407 >UniRef50_UPI0001BC7E39 hypothetical protein BacD2_08600 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7E39 Length = 660 Score = 73.2 bits (178), Expect = 9e-12, Method: Composition-based stats. Identities = 22/218 (10%), Positives = 53/218 (24%), Gaps = 40/218 (18%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 V ++ + ++ + A L+ + + +NG E Sbjct: 75 RQQVNVLEIDLSSPDYELEFVSAPQL------DSLSSVALKHDAVAGINGTYELE----A 124 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVA---GDKVGIVRLDAFKTSKEIQFAVQS 158 NG + L G ++ G + Y G ++ + I Sbjct: 125 SFVKVNGSIISPITLPEGHLRYWKHEGAIAYDGYKVEIGYGTKESYSYNSMPNI---FSG 181 Query: 159 GPMLMENGVIN-PRIHPNVAS-----------------SKIRNGVGINKHGNAVFLLSQQ 200 P+L+++ ++ R V + + + + Sbjct: 182 APVLIDDYQPVGKTFIGDITGINLNSLDGEDYRRHQGVRHPRTAVALTEQNKLLLVTVDG 241 Query: 201 A------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + + L +DG S Sbjct: 242 RADLAAGMTAKELTSFINQYFKPQHALNVDGGGSTTMY 279 >UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HN11_LYSSC Length = 815 Score = 72.8 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 21/103 (20%), Positives = 37/103 (35%), Gaps = 15/103 (14%) Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHG-NAVFLLSQQA----- 201 + QF + +GPML+ NG ++ + N + R V ++ G + Sbjct: 266 DAQFILAAGPMLVRNGQVDISMPTNSGFASTRSPRTAVAVDATGTKVSLITIDGRLSGHS 325 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYM---KGGAIPW 239 N D A + + + + LDG S + GG Sbjct: 326 NGVNLSDLASHLIS-IGATSAINLDGGGSTAMVARNPGGYFAN 367 >UniRef50_B5W3X9 Putative uncharacterized protein n=3 Tax=Arthrospira RepID=B5W3X9_SPIMA Length = 812 Score = 72.4 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 18/115 (15%), Positives = 35/115 (30%), Gaps = 23/115 (20%) Query: 156 VQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHG-----------NAVFLLS 198 + +GP+L+ + IR+ VG+ + + + ++ Sbjct: 689 LGAGPLLLRGNQVVLDARAENFSDAFNTQRAIRSAVGLKTNTPGRSGSDSPAVSLLLVVV 748 Query: 199 QQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + + A K +L L LDG S GG + + I Sbjct: 749 HPRLGGPGPSLAELAELMK-QLGATDALNLDGGSSTGLYLGGYLLDRPPQTAAPI 802 Score = 42.7 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 13/99 (13%), Positives = 33/99 (33%), Gaps = 4/99 (4%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 V ++ + + + + + + L+ + S + A+NGG ++ Sbjct: 458 GRESARFPVVWLEIDLNNQGISLQPILSRPGSRSGVSPLV-HVASSTRAAAAINGGFFNR 516 Query: 97 SYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 + PLG + L G + + F++ Sbjct: 517 NNQYPLGAIRHQNRWLSGPILNRGAIAWTDQNQ--FFID 553 >UniRef50_UPI0001C30FBA N-acetylglucosamine-1-phosphodiester alpha-N- acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=2 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30FBA Length = 249 Score = 72.4 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 55/196 (28%), Gaps = 33/196 (16%) Query: 82 QGQVQMAMNGGIYDES-YAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 + A+ G + + PLG G A G V D Sbjct: 51 ENDRPEAIVAGFFVRDPHLPLGEVRVGGVPVVHEPVAAPWAGRRA-------CVHVDGEI 103 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVIN--------------PRIHPNVAS-SKIRNG 184 + VQ+GP+L+ +G + ++ + R Sbjct: 104 RIAPREELADVGGGDLVQAGPLLVRDGTAAIVDGEDREGFSAGASQFDSDITAERHPRCA 163 Query: 185 VGINKHGNAVFLLSQQATN-------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 +G+++ + + + + A + + + LDG S + G + Sbjct: 164 LGVSED-ELLAVCCDGRRSGVDAGLDLAELARLMVS-FGAREAINLDGGGSATLVHRGHL 221 Query: 238 PWQRYPFVTMISVERK 253 + Y + E + Sbjct: 222 LNRPYADRDQPAPESR 237 >UniRef50_D2ASL7 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein n=2 Tax=Actinomycetales RepID=D2ASL7_STRRD Length = 1138 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 54/194 (27%), Gaps = 27/194 (13%) Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-----PLGLYIENGQQKVALNL 116 G + TL I G G Y A + + G + Sbjct: 183 ATPAGGSPITLTQFNQLIQGNGVGLFTPLWGSYGRGRAVEGAAAVTEVVLEGGVVTEVRT 242 Query: 117 ASGEGNFFI-------RPGGVFYVAGDKVGIVRLDAFKTSK----EIQFAVQSGPMLMEN 165 ++G G R G +A K G ++ ++ AV +L+++ Sbjct: 243 SAGSGPIPAGTAILLGRDAGASALAALKPGDRVEVRYQPKPSEGGAVKAAVGGSQILVKD 302 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------TNFYDFACYAKAKLNVE 218 G + + + R VG + G L + A+L Sbjct: 303 G--VAQTSADNT-AHPRTAVGFSADGRKMYLLTVDGRQTDSRGVTLTELGA-MMAELGAH 358 Query: 219 QLLYLDGTISHMYM 232 L LDG S + Sbjct: 359 DALNLDGGGSSTML 372 >UniRef50_A9V9Y5 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V9Y5_MONBE Length = 298 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 63/189 (33%), Gaps = 27/189 (14%) Query: 73 HALLADINSQGQVQMAMNGGIYDESYAP---LGLYIENGQQKVALNLASGEGNFFIRPGG 129 H +++ + A N G + + P G I +G + + NF + G Sbjct: 95 HRTVSEQAKLLTCEYATNAGFF--DFTPPACEGNLITDGVSIQ--HPCPNQVNFGRKL-G 149 Query: 130 VFYVA---GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP----RIHPN---VASS 179 + GD++ I + + + G L+ +G P V+ Sbjct: 150 MTCPDSTQGDRIVIGYMQEADIADLTELITGRGW-LIRHGQAYTNQSREFTPTDSFVSEK 208 Query: 180 KIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMY-M 232 R +G+ K G + L+ + ++ A +L+V Q + LDG S Sbjct: 209 APRTALGLTKDGAILSLVVDGIEEELVGPDLHEMASLLL-ELDVVQAINLDGGGSSTAVY 267 Query: 233 KGGAIPWQR 241 +G Sbjct: 268 QGHVFNMPH 276 >UniRef50_D2AUR4 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AUR4_STRRD Length = 487 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 50/183 (27%), Gaps = 17/183 (9%) Query: 73 HALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVF 131 + G ++ ++ G + G + + + V Sbjct: 290 EEFGTKTAADGGAEIVVDAQGRIVKARAAGGVVPRGTYVLHGTGIMATWLLEHAQETSVM 349 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS-------SKIRNG 184 + KV +R + + G L+ NG + + + R Sbjct: 350 KL-DTKVIDLRTERAVPLTPETHIMGGGVGLLRNGRVRISAKADGHASVVMMLRRHPRTM 408 Query: 185 VGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 VG+ K G + + + A + L +Q + DG S + G + Sbjct: 409 VGVTKSGGLILATVDGRNPGVTVGASMVEAAQLMRW-LGAKQAINFDGGGSTAMVVGHKV 467 Query: 238 PWQ 240 + Sbjct: 468 INR 470 >UniRef50_C0Z816 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z816_BREBN Length = 1054 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 47/135 (34%), Gaps = 12/135 (8%) Query: 129 GVFYVAGDKVGIVRLDAFKTSK---EIQFAVQSGPMLMENGVINPRIHPN--VASSKIRN 183 G F VG ++T+ ++ AV +L++ G + + S R Sbjct: 254 GAFLKQNFPVGATAAVEYQTTPQTLNLKQAVGGNVILVDQGKALTSFQADKSITSKTART 313 Query: 184 GVGINKHGNAVFLLS---QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 VG+++ G +++++ Q + A A+L + + DG S + Sbjct: 314 SVGVSQDGKTLYMVTIDASQGVYLDELAKIM-AELGSYRAVNFDGGGSTTMA-TRMLGET 371 Query: 241 RYPFV--TMISVERK 253 ER+ Sbjct: 372 HANLANKPSGGAERR 386 Score = 40.4 bits (93), Expect = 0.061, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 40/133 (30%), Gaps = 7/133 (5%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 + + ++ +T+ V+ V++ + + + Sbjct: 51 GTTLQKYTKSFANQVVTIMVTKVDLNNPYVEVKPVYGTKGKLTD-KQTVTQMARETGAIA 109 Query: 88 AMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 A+N + + AP G+ +++ + ++ L S + + V G Sbjct: 110 AINADFFHMTKRGAPFGIVMKDDELISSMGLVSYWYALGLTGDKMAIVDKFGFG----GK 165 Query: 146 FKTSKEIQFAVQS 158 +++Q Sbjct: 166 VTAPNGATYSIQG 178 >UniRef50_B3QZA6 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QZA6_CHLT3 Length = 280 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 71/199 (35%), Gaps = 15/199 (7%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + +NP+ K+ + + ++ A Q + A+N G++ Sbjct: 59 QIYVIRINPEHYAFKLMCASEHAKTPLSVKAWC----KQHGLISAINAGMFQADMLSAVS 114 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV--RLDAFKTSKEIQFAVQSGPM 161 ++N L+ F P K I+ + + K + + G Sbjct: 115 LMKNFAHINNPRLSKDNTIFAFNPTKK---DLPKAQIIDRTVQNYDALKSVYQSQFQGIR 171 Query: 162 LMENGVINP-RIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQ 219 ++ G N + P+ S +G + GN +F+ S+ +DF +++++ Sbjct: 172 MIAPGRKNVWQEQPDEWSIA---ALGSDGDGNILFIFSRSPYTVHDFINILLELPIDIQR 228 Query: 220 LLYLDGTI-SHMYMKGGAI 237 +YLDG + +Y I Sbjct: 229 AMYLDGGAVAQLYFSNKHI 247 >UniRef50_B5YE82 Putative uncharacterized protein n=2 Tax=Dictyoglomus RepID=B5YE82_DICT6 Length = 691 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 54/150 (36%), Gaps = 23/150 (15%) Query: 98 YAPLGLY--IENGQQKVALNLAS---GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 G+ I G +K + + G ++ +F + + I + Sbjct: 188 GKTSGIVSNIYYGVKKTPIKENTCIISLGGTALKYLPLFSIGKEIEIITEC---NPPIPL 244 Query: 153 QFAVQSGPMLMENGVINPR------IHPNV-ASSKIRNGVGINKHGNAVFLLSQQA---- 201 + A+ GP+L++NG I N+ S R +GI K+ + F++ + Sbjct: 245 KEAIGGGPILLKNGDIVLGNTDELAFDNNIVNSRHPRTIIGI-KNNSIYFIVIEGRKENS 303 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISH 229 + + K ++ + + +DG S Sbjct: 304 AGVSLKEACEILK-EMGINDAINMDGGGSS 332 >UniRef50_A6TUG6 Copper amine oxidase domain protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TUG6_ALKMQ Length = 491 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 26/112 (23%), Positives = 45/112 (40%), Gaps = 10/112 (8%) Query: 150 KEIQFAVQSGPMLMENGVINPR-------IHPNVASSKIRNGVGINKHGNAVFLLSQQAT 202 KE+ A+ +GP L++NGVI + + R+ +G+ K V Sbjct: 258 KEVTSAIGAGPTLIKNGVITANGLSEGFFEDEILTNRGQRSFIGVTKENKLVMGTVPS-V 316 Query: 203 NFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMISVERK 253 + + A AK +L + Q + LDG S + K + I + +K Sbjct: 317 SVKELAEIAK-ELGLYQAINLDGGASSGLIYKDRMVHAPGRLLSNAIVITKK 367 >UniRef50_A0LEU6 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LEU6_SYNFM Length = 300 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 72/209 (34%), Gaps = 14/209 (6%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY-APL 101 + ++P+ K+ N T + A+N G+Y E A + Sbjct: 76 YRITVVRIDPRYYAFKLINASENTREKMTAREWSRQF----NLIAAVNAGMYQEDGLASV 131 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGP 160 G ++N L + P G V ++ F + ++ + VQS Sbjct: 132 GY-MKNFDHVNNPRLGRDKTVLAFNPSG-PDVPEVQIIDRECQDFNSLRQKYRTFVQSIR 189 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQ 219 M+ + R S+ +G ++ G + L + +DF L++++ Sbjct: 190 MISCDRKNVWRQQAGRWSTV---AIGTDETGKVLLLFCRSPITVHDFIEVLLTLPLSLQR 246 Query: 220 LLYLDGT-ISHMYMK-GGAIPWQRYPFVT 246 +YL+G + +Y+ G + + Sbjct: 247 AMYLEGGPQASLYLSTGKTTLERYGSWEP 275 >UniRef50_A4XGY7 Putative uncharacterized protein n=2 Tax=Clostridia RepID=A4XGY7_CALS8 Length = 877 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 9/87 (10%) Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL-SQQA------T 202 ++I+ A L+++G I P +A R+ +GI+K G ++L+ Sbjct: 266 EKIKAAASGNTFLLKDGKI-PSFTHEIAGRHPRSAIGIDKTGRYLYLVAVDGRNGKSIGL 324 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISH 229 + + A + ++ ++V + LDG S Sbjct: 325 SQGELASFLQS-IDVWTAINLDGGYST 350 >UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C0E0_BEUC1 Length = 1327 Score = 70.8 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 10/91 (10%) Query: 151 EIQFAVQSGPM--LMENGVINPRIHPNVASSKIRNGVGINKHGNA-VFLLSQQA------ 201 ++ A+ P L+E+G I V R VG ++ G F++ Sbjct: 291 DVAVALGGAPEDWLLEDGEITSATGGYVDVRHPRTAVGFDETGTTAYFVVVDGRQSHSIG 350 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A+L + + LDG S + Sbjct: 351 MTLPELGRFL-AQLGADDAINLDGGGSSEMV 380 >UniRef50_C1A670 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A670_GEMAT Length = 426 Score = 70.8 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 43/156 (27%), Gaps = 22/156 (14%) Query: 99 APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS 158 G +G V + R V +V + + + + + Sbjct: 255 RSGGAIPRDGALLVGTGDRAAGVAAMSRFDTV------RVHLNTWPRLTSQRAPKAVIGG 308 Query: 159 GPMLMENGVINPR--------IHPNVASSKIRNGVGINKHGNA-VFLLSQQA------TN 203 P+++++G I N + R + +++ G + Sbjct: 309 WPLVLQDGENVAARAATLEGTISRNAEARHPRTAIAVSRSGQTAWLVTVDGRATNSVGMT 368 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + A + + L L DG S + G + Sbjct: 369 LVELAEFLR-TLGAWHALNFDGGGSTTMVIDGRVVN 403 >UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I064_CLOCE Length = 383 Score = 70.8 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 31/221 (14%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA----PLG 102 ++ R + ++ E+ G + + + + + Sbjct: 148 VLILDKMGARFETFYSNIFLESKGNRVKINEMNRVGKNDDIILYIDKFGNTNRAEVKSTS 207 Query: 103 LYIENGQQKVALNLASG------------EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 L ++N + + G+ P + GDKV I + Sbjct: 208 LIVDNNKIISIIESTKEVNIKKGMYVISFYGDKSSLPDKIGLKTGDKVNIRIEPYLGYNY 267 Query: 151 EIQFAVQSGPMLMENGV-INP---RIHPNVASSKIRNGVGINKHGNAVFLLSQQA----- 201 A + G ML++NG + P + + + R +GI +G V +++ Sbjct: 268 ---QAYECGSMLVKNGKSVVPERDKWAGTLGNRDPRTVIGIKTNGKIVLVVADGRQPGYS 324 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + + K+ V LDG + + G I + Sbjct: 325 EGMTGKEMGEFL-VKIGVRDAAMLDGGATSQMIINGRIQNR 364 Score = 41.2 bits (95), Expect = 0.031, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 29/68 (42%), Gaps = 2/68 (2%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + +P+ ERV+ + +G L+DI + A+NGG + + P G+ Sbjct: 74 EIYMLEFDPRDERVEFKPALSFDNIFG--FEKLSDICKRNGAYAAVNGGFFYQFGDPAGM 131 Query: 104 YIENGQQK 111 +GQ Sbjct: 132 VAIDGQML 139 >UniRef50_C7IFA0 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IFA0_9CLOT Length = 385 Score = 70.1 bits (170), Expect = 7e-11, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 68/221 (30%), Gaps = 31/221 (14%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP----LG 102 V+ + R + ++ E G + + + + + Sbjct: 150 VLIVDKKGARFETFYSNITLEHKGNKIKINDMNRIGKNNDIVLYNDKFGSTNRAEIKNTT 209 Query: 103 LYIENGQQ------------KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 + ++N + +N+ S G P + AGDKV I Sbjct: 210 IIVDNNVITTLVESTKEVNIRKGMNVISFYGGKESIPEKMGLKAGDKVNIRMEPYLGYRY 269 Query: 151 EIQFAVQSGPMLMENGVIN----PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA----- 201 A + G ML+++G + + + R +GI G + L++ Sbjct: 270 ---QAYECGSMLVKDGKTVVPERDKWAGTLGNRDPRTVIGIKTDGKIIMLVADGRQPGYS 326 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + Y KL V + LDG S + G++ + Sbjct: 327 EGMTGKEMGEYL-VKLGVRDVAMLDGGASSQMIINGSLRNR 366 Score = 44.7 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 52/157 (33%), Gaps = 20/157 (12%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + +P+ ERV+ + +G L+DI + + A+NGG + + P G+ Sbjct: 76 EIYMLEFDPRDERVEFKPALSFDNIFG--FEKLSDICKRNEAYAAINGGFFYQFGEPTGM 133 Query: 104 YIENGQQKVA--------LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF- 154 +GQ A + G G+K+ I ++ + +I Sbjct: 134 VAIDGQMLTASTGLSPVLIVDKKGARFETFYSNITLEHKGNKIKINDMNRIGKNNDIVLY 193 Query: 155 ---------AVQSGPMLMENGVINPRIHPNVASSKIR 182 A ++ + + + + IR Sbjct: 194 NDKFGSTNRAEIKNTTIIVDNNVITTLVESTKEVNIR 230 >UniRef50_A5D3T7 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3T7_PELTS Length = 887 Score = 70.1 bits (170), Expect = 8e-11, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 50/154 (32%), Gaps = 29/154 (18%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF--------- 154 ++NG + L G I P G + L+ ++ + Sbjct: 219 VVKNGVVQQVLTDQPG---VPIPPDGYVLRGHGQAARFILENLPAGSKVSYTYSVMPQGD 275 Query: 155 ----AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL------------LS 198 AV +L+E G + N+A R G++K G ++L + Sbjct: 276 KLFAAVGGQALLVEEGRLPAYFTQNIAGKHARTAAGVSKDGKTLYLVAVEKQSASDGTVV 335 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A + + + V + + LDG S Sbjct: 336 SRGMTQEELAEFLIS-IGVWRAVNLDGGGSTTLA 368 >UniRef50_C9N2Q2 Metallophosphoesterase n=2 Tax=Actinomycetales RepID=C9N2Q2_9ACTO Length = 1163 Score = 69.7 bits (169), Expect = 9e-11, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 48/158 (30%), Gaps = 25/158 (15%) Query: 96 ESYAPLGLY-IENGQQKVALNLASGEGNFFIRPGGVFYVA-------------GDKVGIV 141 + P+ + +G+ ++ G G+ + V V GD V I Sbjct: 240 DDARPVAEVAVRDGEVVS---VSDGPGSGPVPEDTVVLVGREAGAGLLAALEPGDPVKIA 296 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS--- 198 + AV +L+ +G ++ R VG ++ G + +++ Sbjct: 297 YRARTDGGAVPRTAVGGRELLVVDGAAQNHDGEGNNTAAPRTAVGFSEDGRTMQVVTVDG 356 Query: 199 ----QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + + L LDG S + Sbjct: 357 RQTDSGGVTLTELGEMMR-RAGSYSALNLDGGGSSTLV 393 >UniRef50_UPI0001C31921 Collagen triple helix repeat protein n=2 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31921 Length = 1426 Score = 69.3 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 49/146 (33%), Gaps = 22/146 (15%) Query: 104 YIENGQQKVALNLASG----EGNFFI--RPGGVFYVAGDKVGIVRLDAF----KTSKEIQ 153 + +G+ + G+F++ R + + G A+ ++++Q Sbjct: 211 LVTDGRVVAVSDGVGAGEIPAGSFYLVGRESAADAIRALRAGDEVRLAYGLSGDVAQQLQ 270 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL-LSQQA------TNFYD 206 FA+ +L+ +G + + S R +G G + L ++ Sbjct: 271 FAIGGNEVLVRDGQVV----GSDQSVHPRTAIGFKDGGRTLLLFVADGRQTQVLGMTTQK 326 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYM 232 A + E + LDG S + Sbjct: 327 VAQLLRDA-GAETAMNLDGGGSTTLV 351 >UniRef50_A7M0H0 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0H0_BACOV Length = 354 Score = 68.5 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 70/228 (30%), Gaps = 50/228 (21%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY 104 V ++ + + K+ + NG++ + +NGG E +Y Sbjct: 108 VNVLEIDLLSNKYKVEFTYNNGDSL-------STTAQVRGAIGGINGGYEQE-----AIY 155 Query: 105 IE-NGQQKVALNLASGEGNFFIRPGGVFYVAG---------DKVGIVRLDAFKTSKEIQF 154 I NG + L G + + G Y G + G +D +K ++ Sbjct: 156 IRINGTNISEVTLPEGHLR-YWKHDGALYSDGKSDIGIIYGGRNGKAAIDTYK-QHSAKY 213 Query: 155 AVQSGPMLMENGVINPRIHPNVAS------------------SKIRNGVGINKHGNAVFL 196 + S P L+++ + R V + + + + + Sbjct: 214 LLASAPTLIDDYNPLGETFVGNYTMEQLESFDYEDYRRHQGVRHPRTVVAVTEDKDLLLV 273 Query: 197 LSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGA 236 + + + K N + L +DG S MY+KG Sbjct: 274 TIDGRWAGKAEGMSAKEVTLFLKKHFNPQYALNMDGGGSTTMYVKGKG 321 >UniRef50_Q01TI8 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01TI8_SOLUE Length = 340 Score = 68.1 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 37/274 (13%), Positives = 76/274 (27%), Gaps = 53/274 (19%) Query: 20 FLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI 79 F+ +TL+ + T+ +N + + G T+ D Sbjct: 34 FVGVTLITRTETSP-------RAETMHIAEINLNAPGIGVKLTSP-GGTLETVRQTTLDY 85 Query: 80 NSQGQVQMAMNGGIY----DESYAP--LGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 +Q Q+A+NG + + +GL NG + + Sbjct: 86 LNQEHAQLAINGEFFLPFPSSDFNSMLIGLAASNGNVYSSFEAPVQSYAIVTDAPALNID 145 Query: 134 AGDKVGIVRLDA-------FKTSKEIQFAVQSGPMLMENG---------VINPR--IHPN 175 + IV + + + + ++ NG +P + P Sbjct: 146 QSNHASIVHDNTSFVDGKHVLENVTLWNTIAGSAQIITNGVASIPTYLDATHPNGLLTPG 205 Query: 176 VASS-----------KIRNGVGINKHGNAVFLLS------QQATNFYDFACYAKAKLNVE 218 +S R +G+++ +FL + + + A ++ Sbjct: 206 GPASYSNSNSWYNLINARTVIGLSQDNQTLFLFTVDNAGGSRGMTLPEVANLLIGDYSIY 265 Query: 219 QLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 L LDG S A+ I+V Sbjct: 266 NALNLDGGGSTSM----AMQDPVTGMGRFINVSS 295 >UniRef50_A3DIP4 Exopolysaccharide biosynthesis protein n=3 Tax=Clostridium thermocellum RepID=A3DIP4_CLOTH Length = 382 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 33/113 (29%), Gaps = 16/113 (14%) Query: 154 FAVQSGPMLMENGVINPRIHPN----VASSKIRNGVGINKHGNAVFLLSQQATNFY---- 205 A + G L+ +G + + + + R +G+ G V + Y Sbjct: 266 QAYECGSWLVRDGQVVAVDRDDWVGLLTNRDPRTAIGVKHDGKVVLVTVDGRQPGYSVGL 325 Query: 206 ---DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW----QRYPFVTMISVE 251 + A Y L ++ LDG S + + I V Sbjct: 326 SSRELAGYLL-TLGIKDAAMLDGGASTQMIVQNKTVNRLPARERMLGGGIVVV 377 >UniRef50_A7HB86 Putative uncharacterized protein n=4 Tax=Anaeromyxobacter RepID=A7HB86_ANADF Length = 287 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 80/243 (32%), Gaps = 21/243 (8%) Query: 18 RIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA 77 R + LF + ++P +K+ A GE GTL A Sbjct: 44 RTLEPGLEMGLFDGPPAGEEAR----PIAVVRIDPARFELKLLNASAPGE--GTLRTARA 97 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 G A+N +Y E Y + ++ P Sbjct: 98 WAERAG-ASAAINASMYQEDYRTSVSLMRTRHHVNQRRVSKDRSVLAFDP---LARGASP 153 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIR----NGVGINKHGNA 193 V I+ D +++ A Q+ L+++ + NV + R +G++ G Sbjct: 154 VRIIDRD----CDDLERAAQTYGTLVQSIRLVSCDRKNVWAPSARRFSAAAIGVDAKGRV 209 Query: 194 VFLLSQQATNFYDFA-CYAKAKLNVEQLLYLDGT-ISHMYMKGGAI-PWQRYPFVTMISV 250 +F+ ++ ++ + + Q +Y++G + ++++GG F + Sbjct: 210 LFIHARTPWPVHELVNALLALPIELRQAMYVEGGPEAQLFVRGGGRQHEWVGGFEHVPQA 269 Query: 251 ERK 253 E + Sbjct: 270 ENR 272 >UniRef50_C2FS46 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FS46_9SPHI Length = 341 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 61/213 (28%), Gaps = 24/213 (11%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN-----SQGQVQMAMNGGIYDE 96 +++ ++ ++ E L S G++ +A+NG Sbjct: 102 PVSMHVLEIDLSKPKLAAQALGPFNEVLYATQILPEMAKYNESGSGGKMMVAINGDAVLT 161 Query: 97 SYA----PLGLYIENGQQKVALNLASGEGN---FFIRPGGVFYVAGDKVGIVRLDAFKTS 149 S P G YI G+Q + F + GV ++ +A + Sbjct: 162 SGTTVNAPSGSYIRYGRQIKTNTTTATAFTIPYFAVTKAGVPFIGNRPSATYPAEAVDLN 221 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-------AT 202 + L+ N + I A+ R +GIN + ++ Sbjct: 222 TIYHLVSGTNW-LVFNNNL---ITSTTATVSARTAIGINADKKVICVVVDGGDDAFSTGI 277 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 D K L + + +G +K Sbjct: 278 TLNDLGIVMK-TLGSSRAFFTNGGNFSAMVKRK 309 >UniRef50_Q2JUI0 Conserved domain protein n=2 Tax=Synechococcus RepID=Q2JUI0_SYNJA Length = 411 Score = 67.8 bits (164), Expect = 4e-10, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 38/128 (29%), Gaps = 16/128 (12%) Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------ASSKIRNGVG 186 + GD++ + + + +GP+L+ +G + R+ + Sbjct: 258 IPGDRLRLDWTVDPLELEAYPHILGAGPLLLLDGQVVLDAELEGFQPLFRRQQAARSAIC 317 Query: 187 INK---HGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 + + + L++ + A + +L L LDG S + G Sbjct: 318 LRQGQPDNRDLLLVAAGNAQENQGLTLLEMAQLLR-QLGCRHALNLDGGRSSTLVLGEEA 376 Query: 238 PWQRYPFV 245 Sbjct: 377 VNLEPEIG 384 Score = 41.2 bits (95), Expect = 0.031, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 33/92 (35%), Gaps = 4/92 (4%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 L D + V V+ +++ W L L A +G A+NGG ++ + Sbjct: 63 LEDRRILVSVVAVSLAAGQLRPIWADPAS--LVGLGELPAFSRERG-AVAAINGGFFNRN 119 Query: 98 -YAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 PLG G+ + L G +F Sbjct: 120 TRQPLGAIRLEGRWISSPILGRGAIAWFDAAN 151 >UniRef50_Q5ULM2 Orf92 n=1 Tax=Lactobacillus phage LP65 RepID=Q5ULM2_9CAUD Length = 556 Score = 67.4 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 60/202 (29%), Gaps = 21/202 (10%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY-APLGLYIENGQQKVALNLAS 118 ++ + A+N G+++ S P+G I NG + + S Sbjct: 339 LALTSSDGSLSGTKRPTLRYAKDNDTIFAVNAGLFNVSTVEPVGQLIINGISLINTPMTS 398 Query: 119 GEGNFFIRPGGV--FYVAGDKVGIVRLDAFKTSK----EIQFAVQSGPMLMENGVINPRI 172 G I P + + T+ +++AV + L++N I Sbjct: 399 DNG-VTINPNECYPLAIDANGDLTTYPRNADTADMIAAGVKYAVTAWGKLVDNFEIATTD 457 Query: 173 HPN---VASSKIRNGVGINKHGNAVFLLSQ---QATN------FYDFACYAKAKLNVEQL 220 N IR +G ++G + + + A K V+ Sbjct: 458 IENEIVHNGRYIRQSIGQYQNGYYCVCTVDMTRGSVTNEAGLYYKELAQIFVDK-GVKFA 516 Query: 221 LYLDGTISHMYMKGGAIPWQRY 242 LDG S + G Y Sbjct: 517 FSLDGGGSAETVLGKRQLNPIY 538 >UniRef50_A4FAG7 Secreted protein n=5 Tax=Actinomycetales RepID=A4FAG7_SACEN Length = 434 Score = 67.4 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 48/147 (32%), Gaps = 21/147 (14%) Query: 103 LYIENGQQKV----ALNLASGEGNFFI--RPGGVFYVAGDKVGIVRLDAFKTSK----EI 152 + + +G+ A G+F + R GV + K G ++ + E Sbjct: 257 IVVRDGKVAEVRPEPGAGAIAAGDFVLVGREDGVGELDDLKPGDPVSVDYQLAPVGVPEF 316 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA------TNFY 205 +F V P+L +G P + + + R G + G + Sbjct: 317 RFVVGGFPIL-RDGTALPGL--DDQALAPRTSAGASADGKRVYLVAMDGRSQVSAGLTVS 373 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYM 232 + A K + + + LDG S + Sbjct: 374 ELADLLK-RSGADDAVNLDGGGSTTLV 399 >UniRef50_A1VEZ3 Putative uncharacterized protein n=4 Tax=Desulfovibrio vulgaris RepID=A1VEZ3_DESVV Length = 311 Score = 67.4 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 53/182 (29%), Gaps = 9/182 (4%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES-YAPLGLYI 105 A ++P ++ G +L A + + A+N +Y G Sbjct: 74 ALRIDPNLWDFSLHTATGEGGYPLSLGAWAEKL----NLGAAINSSMYLPDVRTSTGFLK 129 Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 F P D + + VQ+ ++ N Sbjct: 130 AGEHVNNPRVTTKFGSFFVAAPDDPTLPQADLLDRAIDPWAERLPHYNMVVQNYRLISTN 189 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLD 224 I I VG + G +FL ++ + FA A L++ ++Y++ Sbjct: 190 RRI--LWPQGGPEYSI-AAVGQDGSGAILFLHCREPMTAHAFASMLLALPLDIHDVMYVE 246 Query: 225 GT 226 G Sbjct: 247 GG 248 >UniRef50_B4VYL6 Tat pathway signal sequence domain protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VYL6_9CYAN Length = 299 Score = 67.0 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 30/248 (12%), Positives = 68/248 (27%), Gaps = 29/248 (11%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT------LHALLA 77 + P+ A + + + + ++ + AN G + Sbjct: 32 LVSPVAAESVSFRRSTILGVPLYQTHIDLTNPDTFIAIGLANNSTLGNHQGAIGEESFGN 91 Query: 78 DINSQGQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 + +A +G + + +G + G + +R Sbjct: 92 MVRRYHAAVVA-SGTFFSKKDPKRLMGNMVSAGTFLKYSPWENYGTTLGLRV-------- 142 Query: 136 DKVGIVRLDAFKTSKEI---QFAVQSGPMLMENGVI-----NPRI-HPNVASSKIRNGVG 186 + + F++ GP L+ G + + P V R +G Sbjct: 143 GNQPELVTARVDGKPDWGQHWFSLTGGPRLLRKGKVWLAPRSEGFTDPRVMGVAHRCAIG 202 Query: 187 INKHGN-AVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPF 244 G V + + A +A + + + +DG S +Y +G + + Sbjct: 203 FPASGKKLVLVTFLAPLPLWREAKVMRA-IGCSEAMNIDGGSSSALYHRGRILVNPKRML 261 Query: 245 VTMISVER 252 I V Sbjct: 262 TNAIVVYD 269 >UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LNU2_DESBD Length = 276 Score = 67.0 bits (162), Expect = 6e-10, Method: Composition-based stats. Identities = 25/240 (10%), Positives = 70/240 (29%), Gaps = 24/240 (10%) Query: 9 KGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN--- 65 + + T + +A + + A L + + Q + + + + ++ Sbjct: 7 RAVFTCLVLCAPVASLHAEEWRLLAPGLELREFLIPDQVGDLEGRQSGMAVLRIDSDRFD 66 Query: 66 ---GEAWGTLHALLADINSQGQ-VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASG 119 G A GT ++ +N G++ G + + Sbjct: 67 VALGSALGTGRMRSMQEWARHSGFVAVINAGMFRADDRMRSTGYMRDAAVMINSF----- 121 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF--AVQSGPMLMENGVINPRIHPNVA 177 G + L + + + +++N + R N+ Sbjct: 122 ---IHPNYGAFLAFQPRDPSLPALRWVDRKSDPDWQAVLADYDGIIQNYRLISRERENLW 178 Query: 178 SSKIR----NGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISHMYM 232 R + +++ G +F+ + + ++FA L++ +Y++G Sbjct: 179 EPSDRRHSGAAIAMDREGRLLFIHCRARLSLHEFAQALIDLPLDLIGAMYVEGGADAAMY 238 >UniRef50_A9QSN5 Exopolysaccharide biosynthesis protein n=4 Tax=Lactococcus lactis RepID=A9QSN5_LACLK Length = 303 Score = 67.0 bits (162), Expect = 6e-10, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 66/184 (35%), Gaps = 13/184 (7%) Query: 59 MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLA 117 + + ++ +++ + + MN ++ + G I NG+ N Sbjct: 117 LKTATSADSPVVSMSEVISKYPNS----LIMNASGFNMTTGKITGFQINNGKLFKDWNSD 172 Query: 118 SGEGN-FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV 176 N F G + D + K + + G +L+++G P Sbjct: 173 KRATNAFVFNKNG----SSDIYNSTTPASEILKKGAEMSFSFGSILIKDGKSLPS--DGT 226 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + +I + +G +K N ++S +T + + KL++E + +DG S G Sbjct: 227 VNWEIHSFIGNDKDNNIYLIISDTSTGYQSIMEKFQ-KLHLENVQVMDGGGSSQMSLNGQ 285 Query: 237 IPWQ 240 I + Sbjct: 286 IIYP 289 >UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FD37_SACEN Length = 519 Score = 66.6 bits (161), Expect = 7e-10, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 44/165 (26%), Gaps = 24/165 (14%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G G L A+ R G +V A V GP Sbjct: 344 GPVPAGGTVVQGLGQAAEWLVAHARAGEPLWVDQQ--IREESGAPLRLGPSDDIVNGGPE 401 Query: 162 LMENGVINPRIHPNV-------------ASSKIRNGVGINKHGNAVFLLSQQATN----- 203 L+ +G + + + R+ +G++ G + ++ Sbjct: 402 LVRDGQVRINLQEDGIIHDAPSFAYTWGLKRNPRSVIGVDAQGRVILATTEGRMPGFSDG 461 Query: 204 --FYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + A + +A L + LDG S M + + Sbjct: 462 WGLPEAAEFVRA-LGAVDAMALDGGGSAGMVVDDRVVTTPSDATG 505 >UniRef50_Q9L2D5 Putative secreted protein n=2 Tax=Streptomyces RepID=Q9L2D5_STRCO Length = 428 Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 45/183 (24%), Gaps = 14/183 (7%) Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 A+ + + +++N P G G+ + Sbjct: 229 DADARFTEDDDPGAEAVVAADGTVLSLNPNGRGGVTVPTG-----GRVLQGTGTGADWLR 283 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIR 182 PG D + P L+ N + R Sbjct: 284 AHATPGTDLAFEERLHDERFGDDIPLDSSVDVVNGHYP-LVHNAQY--AYTGQNTAVDPR 340 Query: 183 NGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 + + ++ G +F+ + +FA L L +DG S + A+ Sbjct: 341 SAIAVDGPGRTLFVTATGKSGRNGVTLDEFARILLD-LGAVDGLNMDGGGSTTLVVEQAV 399 Query: 238 PWQ 240 + Sbjct: 400 VNR 402 >UniRef50_UPI0001744904 hypothetical protein VspiD_09360 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744904 Length = 258 Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 61/193 (31%), Gaps = 24/193 (12%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLY 104 Q T + +V++ ++ + E +H + + + NGG +D ++AP GL Sbjct: 50 QVVTFDASKVKVEVLARQ-DRETALPMHRWMTEA----RAIAGCNGGYFDPATFAPSGLQ 104 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK--EIQFAVQSGPML 162 + G G G F V K I E + VQ P+L Sbjct: 105 VVEGLATGKYQQFGEWG-------GGFGVRSGKAQIWTEQEILAMPTFEAESFVQCSPVL 157 Query: 163 MENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK------LN 216 + +G + R + + ++ + A + Sbjct: 158 V-DG-VRRFTGAGEDVRARRTFIAHDGGARWALGVTSG-IGLRELAELLVNQGAGLLGFK 214 Query: 217 VEQLLYLDGTISH 229 V + L LDG S Sbjct: 215 VSRALNLDGGPST 227 >UniRef50_B5RQG1 Uncharacterized conserved protein n=20 Tax=Borrelia RepID=B5RQG1_BORRA Length = 269 Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 51/200 (25%), Gaps = 31/200 (15%) Query: 48 YTVNPQTERVKMYWQKANGEA----WGTLHALLADINSQGQVQMAMNGGIYDESYA---P 100 V + + +K K + + + +V +A+N Y P Sbjct: 46 VIVKIKNKDLKFIISKPIYDTKMNNYYFKGQTTSQFLISNKVDIAINTSPYTIKGTMFYP 105 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 G+YI N + G + K + Sbjct: 106 NGIYIYNKKLISHAKKDQGIIIIKNNQI------------ILNPKHNEIKNSDYGFGGFF 153 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA-------TNFYDFACYAK 212 L++NG N R +G +K + + + + + A Sbjct: 154 SLIKNGKYTKNFKEN---KHPRTIIGTDKENKHLYLITVEGRGTNNSKGISLNE-AIDLS 209 Query: 213 AKLNVEQLLYLDGTISHMYM 232 V + LDG S + Sbjct: 210 LSYGVTNSINLDGGGSSTLV 229 >UniRef50_A5ILT0 Putative uncharacterized protein n=6 Tax=Thermotogaceae RepID=A5ILT0_THEP1 Length = 553 Score = 66.2 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 25/129 (19%), Positives = 44/129 (34%), Gaps = 20/129 (15%) Query: 143 LDAFKTSKEIQFAVQSGPMLMENGVINP-------RIHPNVA-SSKIRNGVGINKHGNAV 194 I+ AV+ GP+L++NG P R +A + R + K G Sbjct: 420 SLQPNIPLRIKQAVEGGPLLIQNGAPIPDAWEEKARYGGGIAYAKAPRTVIA-TKDGKLW 478 Query: 195 FLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTM 247 FL+ + + + + ++ E + +DG S M + G + Sbjct: 479 FLVFEGYNHITRGLTYDELVDFLISR-GFEDAMCVDGGSSSVMAVAGSLFGRTENSTAAI 537 Query: 248 ---ISVERK 253 I V K Sbjct: 538 PVGIVVWEK 546 >UniRef50_A4CSS0 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 7805 RepID=A4CSS0_SYNPV Length = 549 Score = 66.2 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 48/175 (27%), Gaps = 19/175 (10%) Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 S L + I +G+ + + G VA + + + + + Sbjct: 368 YRSLSGEELAILIRDGRVTDQFSKTELARGVPLPEGASLVVARARAPLPAKPGDEVAIRL 427 Query: 153 ---------QFAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVF-- 195 + + GP+L++ G + + + R VG + + Sbjct: 428 KVSSPVGERRQVMAGGPLLLKEGQVVLRGRQEGFSSGFLGQAAPRTVVGQDPKHRWMLTL 487 Query: 196 -LLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 LS + A +L + L LDG S + I Sbjct: 488 EGLSGSDPTLLE-TTLALQQLGLSDALNLDGGSSTTMLIANRTVMTGRGVPPRIQ 541 >UniRef50_B6V2M3 Gp2.43 n=1 Tax=Bacillus phage SPO1 RepID=B6V2M3_BPSP1 Length = 437 Score = 66.2 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 57/219 (26%), Gaps = 18/219 (8%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES- 97 ++ +K+ N + ++ + N I++ + Sbjct: 30 KTDYFITHVPNLDKNGNLIKLRHGFQNDLINSGVGETARSFCNRHSASLVANASIWNTNN 89 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 G+ I++G+ + + V + I Sbjct: 90 GLIRGVQIQDGKVIQDAKDTNSYTLGIKSDNTLVMYPPS----VTAEQVLADGCIDAITA 145 Query: 158 SGPMLMENGVINP----RIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 PM +++G NV RN + + + +FL + + D Sbjct: 146 FYPM-IQDGAAFDLSGVTTVSNVTEHHPRNVIAQLPNKDLLFLTCEGRTKANQGMTYDDM 204 Query: 208 ACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 A+ V LDG S ++G + Sbjct: 205 IRILLAR-GVTTAYCLDGGGSSQTVVRGHLVNNPLDNNG 242 >UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WLB3_ACTMD Length = 1118 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 32/98 (32%), Gaps = 13/98 (13%) Query: 143 LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-QQA 201 A + A+ +L+ + + P S R VG + G +FLL+ Sbjct: 284 TRAGDGGSAPRAAIGGNQVLLRDSEVVAPDDP----SHPRTAVGFSADGRRMFLLTVDGR 339 Query: 202 -------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 N D A + L L LDG S + Sbjct: 340 QSAHLLGLNLKDVAEALRD-LGAHNALNLDGGGSSTLV 376 >UniRef50_B4WHW3 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WHW3_9SYNE Length = 335 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 56/203 (27%), Gaps = 48/203 (23%) Query: 74 ALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIE----------NGQQKVALNLASGEGN 122 A + + +NGG +D + I N + NL Sbjct: 98 ATIEAFAERTNADYIINGGFFDPHNGKTTSHLISQEQTVSDPADNERLINNSNLGQYMAQ 157 Query: 123 FFIRPGGVFYVAGDKVGIVRLD-------------------AFKTSKEIQFAVQSGPMLM 163 R Y + R EI A+ +GP L+ Sbjct: 158 ILNRSEFRVYRCRQASVVERGGLEGSLTEEAVVYDITFHNAPPPDGCEIDTAIGAGPQLL 217 Query: 164 ------------ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYD 206 + I R R+ +G+ G ++ ++ + Sbjct: 218 PADTSWVEGFIDYDDGILFRDAIGSRQPNARSAIGLYPDGAIALIMVEKSASSIGMTLLE 277 Query: 207 FACYAKAKLNVEQLLYLDGTISH 229 A +AK+ L + +LL LDG S Sbjct: 278 LADFAKS-LGITKLLNLDGGSSS 299 >UniRef50_D1VTW3 Copper amine oxidase N-domain superfamily n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VTW3_9FIRM Length = 765 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 42/107 (39%), Gaps = 12/107 (11%) Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP--RIHPNVASSKIRNGVGINKHGN 192 GDK+ I + S ++ + +L++N I P + ++ ++ R +GI G Sbjct: 246 GDKLKITYDIYPQKSWKML--IGGHSLLVDNSKIRPYKKDINSIGGTRARTCIGIADGGK 303 Query: 193 -AVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + + + + + L ++ L LDG S + Sbjct: 304 SVYIVSCEGRTKRSSGMSLNELSNFMVN-LGCQRALNLDGGGSTAMV 349 >UniRef50_A4FAL4 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=A4FAL4_SACEN Length = 1118 Score = 65.1 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 30/88 (34%), Gaps = 11/88 (12%) Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA------TNF 204 + AV +L+ +GV+ + + R G + G + Sbjct: 301 PKAAVGGNKVLLRDGVVQ---QVDDTALHPRTAAGFSADGTRMWLVTIDGRQADSRGMTE 357 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYM 232 + A + ++ L + L LDG S + Sbjct: 358 RELAEHLRS-LGADDALNLDGGGSSTLL 384 >UniRef50_C0AEZ6 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AEZ6_9BACT Length = 421 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 38/104 (36%), Gaps = 6/104 (5%) Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 + + L A +++++ AV +L+ G + + R+ VG+ G Sbjct: 284 AILDINWRLTDLPAGVHTRDVRDAVSGNVILIAAGRLQEGGGAFWTTRHPRSAVGVAADG 343 Query: 192 -NAVFLLSQQATNFY---DFACY--AKAKLNVEQLLYLDGTISH 229 A+ +L + F D + A L + LDG S Sbjct: 344 RRALLVLVDGRSLFSAGMDLSALRDYLAHLGAHDAVNLDGGGSS 387 >UniRef50_A5GW09 Putative uncharacterized protein SynRCC307_2165 n=1 Tax=Synechococcus sp. RCC307 RepID=A5GW09_SYNR3 Length = 563 Score = 64.3 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 42/123 (34%), Gaps = 10/123 (8%) Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNV-ASSKIRNGVGIN 188 GD V + R K E+ +Q GP+L+ G + R R+ VG + Sbjct: 430 GDGVSLERSMVPKAFAELPNLIQGGPLLLNQGKVVLNGKAERFSSAFMRQKAPRSVVGSD 489 Query: 189 KHGNAVFLLSQQAT---NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 + + Q + A + KL ++Q L LDG S M F Sbjct: 490 DELIWLLAVEGQGNAGPTLRETAELMQ-KLGLKQALNLDGGSSTRLMVRNRGQSSGRGFG 548 Query: 246 TMI 248 I Sbjct: 549 AAI 551 >UniRef50_Q7U4D6 Putative uncharacterized protein n=11 Tax=Cyanobacteria RepID=Q7U4D6_SYNPX Length = 589 Score = 64.3 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 48/162 (29%), Gaps = 19/162 (11%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYV---------AGDKVGIVRLDAFKTSKEIQ 153 L ++ G+ + AS I G V ++ + Sbjct: 418 LLVQGGRVTQRFDRASIRRGVLIPADGDLVVARGGTPLPAKPGDAVMLSQRTTSGLGDQA 477 Query: 154 FAVQSGPMLMENGVIN-----PRIHPNVAS-SKIRNGVGINKHGNAVFLL---SQQATNF 204 + GP+LM+ G I P+ + + R VG G + L + Sbjct: 478 NVLGGGPLLMQGGQIVLNGRAEGFSPDFLALAAPRTVVGQGTGGTWLLALRGAAGSDPTL 537 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A A +L ++ L LDG S + G Sbjct: 538 LETA-LAAQQLGLKDALNLDGGSSTTVVVAGRTVMNGRGSAP 578 >UniRef50_C8VW07 S-layer domain protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VW07_DESAS Length = 921 Score = 64.3 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 35/86 (40%), Gaps = 8/86 (9%) Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ------ATNF 204 ++ A+ +L+++G + P + + R VGI ++L + + T Sbjct: 269 NLRAALGGNTLLVQDGQLAP-FTQEITGNYARTAVGIMPDNKTLYLAAAENGNGSVGTTQ 327 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHM 230 A + A L V + + LDG S Sbjct: 328 TGMAEFLLA-LGVNRAVNLDGGGSTT 352 Score = 43.1 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 28/86 (32%), Gaps = 2/86 (2%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLG 102 + A V+ VK+ +L + G +NGG + ++ P+G Sbjct: 58 RIYAIKVDLSNPYVKIDTMIGADGTLNKAQSLTGMTSRTG-AVAGINGGFFQMKNHRPIG 116 Query: 103 LYIENGQQKVALNLASGEGNFFIRPG 128 L NG + + F + Sbjct: 117 LEFSNGNLVSSPAMREDMPGFAVTNN 142 >UniRef50_C5CET4 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CET4_KOSOT Length = 558 Score = 63.9 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 42/103 (40%), Gaps = 14/103 (13%) Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVAS------SKIRNGVGINKHGNAVFLLSQQ-- 200 +++++FA++ GP+++ G + S R +GI K G +F++ Sbjct: 439 NEKLKFAIEGGPLIISRGKPVTEYEKSFYSSSLLDIRAPRTLIGITKSGTLMFMIIDGYQ 498 Query: 201 ----ATNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIP 238 F + + K N E L+ +DG S + KG Sbjct: 499 MKSYGLTFKEMVEFFTDK-NFEYLMCVDGGKSSALVFKGEVFS 540 Score = 43.1 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 39/101 (38%), Gaps = 13/101 (12%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQK---ANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 +S + + A ++P+ + +GE+ ++ +NGG + Sbjct: 249 VSGRRIILTALELDPERFDIHPVLANGRIPSGESLLSMAKRYDAFA-------VINGGYF 301 Query: 95 DES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 D S + P+GL IE+G+ +L FF G + Sbjct: 302 DPSSFYPIGLLIEDGKLISLPSLERPL--FFQTEDGKMGIG 340 >UniRef50_C9M6C8 Putative uncharacterized protein n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6C8_9BACT Length = 603 Score = 63.9 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 33/103 (32%), Gaps = 12/103 (11%) Query: 151 EIQFAVQSGPMLMENGVI---NPRIHPN-VASSKIRNGVGINKHGNAVFLLSQQATNFY- 205 A+Q GP+L+++G I N I + R VG + ++ Sbjct: 459 GTVGALQGGPLLLKDGKIQRMNEGIAVGVINRRHPRTLVGRIGKTVWWLAV-DGRAPWHS 517 Query: 206 -----DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 D A L LL LDG S + G + Sbjct: 518 SGLTLDEATTLGQYLGFTDLLNLDGGGSTELLYHGYPVNKPSD 560 >UniRef50_Q30YC1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30YC1_DESDG Length = 383 Score = 63.1 bits (152), Expect = 8e-09, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 67/245 (27%), Gaps = 54/245 (22%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 +A D + TV ++P+ +++Y G T + + Sbjct: 93 GLELAESSAVFRDTSGTVALLRIDPRHYSLQLYTISEQGGPPQTPSEW----AALYNLDA 148 Query: 88 AMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNF----FIRPGGVFYVAGDKVGIVRL 143 +N ++ + Y+ NG + G+F + P G + Sbjct: 149 VINASMFLPDGSTSTGYMRNGTAANNSRINQRFGSFLVFSPLPPHAAASDGQPPAGGTQP 208 Query: 144 DAFKTS--------------------------------------KEIQFAVQSGPMLMEN 165 D + + + VQ+ M+ + Sbjct: 209 DPYAPAAAHTTAARNTPNDAGSDNQQLPAADVLDRYADDWQTLLPRYRGVVQNFRMISAD 268 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA---KLNVEQLLY 222 P S I +G + G +F+ S+ T + + Y L +Y Sbjct: 269 RK--PLWPEEGDSFSI-AAIGKDTQGRILFIHSRAQTTVRELSEYLLDICPSLGAT--MY 323 Query: 223 LDGTI 227 ++G Sbjct: 324 VEGGA 328 >UniRef50_B4WFN8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WFN8_9SYNE Length = 309 Score = 63.1 bits (152), Expect = 8e-09, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 51/184 (27%), Gaps = 18/184 (9%) Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNF- 123 + TL + ++Q+A+N ++ P G+ + LA +G Sbjct: 97 QPHETLAQKTSSFLKTHRLQLAVNANFFNPFNETTPWQYSPREGELTNLVGLAISDGQIV 156 Query: 124 --FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 + + I + + + AV +G L P + + Sbjct: 157 SPGDKNYPALCFLEGRAEIRDEGV--CAPDTKQAV-AGLRLNLENRPPPDV-ETIYKFYP 212 Query: 182 RNGVGINKHGN-AVFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 ++ G LL + A + +A L + LDG S Sbjct: 213 VCVAALDAEGTTLWLLLVDGKQPLYSEGMTRPEVADFLQA-LGATTAVQLDGGGSTTLAI 271 Query: 234 GGAI 237 Sbjct: 272 ASER 275 >UniRef50_C8X0Z8 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X0Z8_DESRD Length = 302 Score = 63.1 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 62/187 (33%), Gaps = 10/187 (5%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + ++P+ R +Y A A T+ + + A+N +Y E Sbjct: 63 ELTVLRIDPEFFRFVLYSASAERGADRTVRQWVE----DKNLVAAINASMYWEDRETSTG 118 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA--VQSGPM 161 + N + G FF+ + + L+ + Q+A +Q+ + Sbjct: 119 LMTNFGHVNNGRVHPEFGAFFVANPRRAQLPPVDILDRSLEQQWRKRVAQYATIIQNYRL 178 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA-CYAKAKLNVEQL 220 L G + V + G+ +F+L + + + L++ Sbjct: 179 LDAKGE---NVWQASRQEHSSAAVAEDSQGHILFILQHEPVSVHALGSRLENLSLDLSTA 235 Query: 221 LYLDGTI 227 ++++G + Sbjct: 236 MFVEGGV 242 >UniRef50_Q0AWB0 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AWB0_SYNWW Length = 497 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 53/174 (30%), Gaps = 24/174 (13%) Query: 102 GLYIENGQQKVALNLA---SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF---- 154 G+ + NG + G + G Y+ ++ + ++ + F Sbjct: 168 GVIVSNGHVSSITTSSFNIPENGFAIVYNGASSYLVDERYKVGDEVYYEVIIKPTFTNPS 227 Query: 155 -------AVQSGPMLMENGVINPR-------IHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 A+ +GP L+ NG + SS R+ +G G + Sbjct: 228 DWEEVQCAIGAGPSLIINGNVTASGEEEGFFEAKINTSSSPRSFIGATADGRIIMGNMDA 287 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 AT A ++ + + LDG S +Y + ++ + Sbjct: 288 ATLKKAAAAC--QRMGLVNAMCLDGGYSIALYYASAGVSLAGRDINNGLAFVGR 339 >UniRef50_A3TM75 Putative uncharacterized protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TM75_9MICO Length = 1151 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 37/93 (39%), Gaps = 11/93 (11%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-QQA---- 201 ++ A+ ML+++ V+ P+ + R +G + G+ +F+L+ Sbjct: 308 NEGANLKMAISGNTMLLKDNVVLPQTDKAI---HPRTAIGFDADGSTMFVLTVDGRMAAS 364 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + + K ++ + LDG S + Sbjct: 365 RGMTYAETGAFLK-EVGATSGINLDGGGSSTML 396 >UniRef50_A7HN47 Putative uncharacterized protein n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HN47_FERNB Length = 528 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 41/113 (36%), Gaps = 14/113 (12%) Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNV---ASSKIRNGVGIN 188 + +++ AV +GP+L+++ I ++ + R + I Sbjct: 402 GADVSVELYTDNGYKVKNAVGAGPLLIQDKKIIQDAAEEKLRYGGGIPTTRASRTIIAI- 460 Query: 189 KHGNAVFLLSQQ----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 K G + + NF + A + +K E + LDG S + G + Sbjct: 461 KDGKVHLITIEGTNGTGMNFDEAAQFLLSK-GYESAMMLDGGGSTGMVYAGKL 512 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.136 0.343 Lambda K H 0.267 0.0416 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,392,485,134 Number of Sequences: 3077464 Number of extensions: 57340453 Number of successful extensions: 142531 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 193 Number of HSP's successfully gapped in prelim test: 309 Number of HSP's that attempted gapping in prelim test: 141244 Number of HSP's gapped (non-prelim): 678 length of query: 254 length of database: 1,040,396,356 effective HSP length: 126 effective length of query: 128 effective length of database: 652,635,892 effective search space: 83537394176 effective search space used: 83537394176 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 91 (39.6 bits)