BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (326 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P39370 Uncharacterized protein yjhS n=156 Tax=root RepI... 688 0.0 UniRef50_Q9FCW8 ORF616 n=118 Tax=root RepID=Q9FCW8_ECOLX 380 e-104 UniRef50_D2TQ80 Hypothetical prophage protein n=1 Tax=Citrobacte... 372 e-102 UniRef50_B2U4Z6 YjhS n=22 Tax=root RepID=B2U4Z6_SHIB3 365 2e-99 UniRef50_Q6KCY2 Hypothetical adenine-specific methylase YfcB n=1... 81 4e-14 UniRef50_D2QHB3 Putative uncharacterized protein n=1 Tax=Spiroso... 50 1e-04 UniRef50_Q7UGU5 Probable acetyl xylan esterase AxeA n=2 Tax=Plan... 49 3e-04 UniRef50_Q07GI7 Conserved domain protein n=1 Tax=Roseobacter den... 43 0.017 UniRef50_C5WRX2 Putative uncharacterized protein Sb01g000530 n=2... 41 0.049 >UniRef50_P39370 Uncharacterized protein yjhS n=156 Tax=root RepID=YJHS_ECOLI Length = 326 Score = 688 bits (1775), Expect = 0.0, Method: Compositional matrix adjust. Identities = 326/326 (100%), Positives = 326/326 (100%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN Sbjct: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR Sbjct: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG Sbjct: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE Sbjct: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 Query: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS 300 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS Sbjct: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS 300 Query: 301 HFSTAARRGIISDRFVEAILQFWRER 326 HFSTAARRGIISDRFVEAILQFWRER Sbjct: 301 HFSTAARRGIISDRFVEAILQFWRER 326 >UniRef50_Q9FCW8 ORF616 n=118 Tax=root RepID=Q9FCW8_ECOLX Length = 616 Score = 380 bits (975), Expect = e-104, Method: Compositional matrix adjust. Identities = 179/324 (55%), Positives = 221/324 (68%), Gaps = 4/324 (1%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 ++ PD+YYV+ +AGQSNAMAYGEGLPLPD DAP PRIKQLAR + PGG C +N Sbjct: 55 VSGATEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN 114 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIP HC HDVQDM +HP A + QYG VGQ LHIA+KLLP+IP+NAGIL+VPCCR Sbjct: 115 DIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCR 174 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFT G+EGT+S GAS D+ RWG PLYQDL++RT+AAL KNP+N L CWMQG Sbjct: 175 GGSAFTQGAEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQG 234 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHS 238 EFD+ + +A P F M+ FR DL +++Q + D PW CGDTT+YWK + Sbjct: 235 EFDMSAATHAQQPALFTAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQ 294 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGE--RGLTNAPDEDPDDLSTGYYGSAYRSPENWTTA 296 Y+ +YG Y+N + FV F G TNAP EDPD ++GYYG+A R+ N ++ Sbjct: 295 YDTVYGGYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGNQVSS 354 Query: 297 LRSSHFSTAARRGIISDRFVEAIL 320 R +HFS+ ARR II DR AIL Sbjct: 355 NRPTHFSSWARRSIIPDRLATAIL 378 >UniRef50_D2TQ80 Hypothetical prophage protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TQ80_CITRO Length = 683 Score = 372 bits (955), Expect = e-102, Method: Compositional matrix adjust. Identities = 183/318 (57%), Positives = 215/318 (67%), Gaps = 5/318 (1%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 P+YYYVL +AGQSN MAYGEGLPLPD D P PRIKQLAR + P G C +NDIIP Sbjct: 112 PEYYYVLPLAGQSNGMAYGEGLPLPDSFDRPEPRIKQLARRSTVTPDGTSCTYNDIIPAD 171 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 HC HDVQDM G +HP A + QYG VGQ LHIA+KLLP+IP NAGIL+VPCCRG SAFT Sbjct: 172 HCLHDVQDMSGINHPKADLSKGQYGCVGQGLHIAKKLLPYIPQNAGILLVPCCRGASAFT 231 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMT 186 G +G++SE GAS D+ RWG PLYQDL+SRTRAAL KNP+N+ L WMQGE DL Sbjct: 232 TGDDGSFSEVSGASADSSRWGAGKPLYQDLLSRTRAALEKNPKNRLLAVVWMQGEADL-A 290 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 S H F MV+ FR DL +Q N PW CGDTT+YWK + YE +YG Sbjct: 291 SGSQQHNGLFTAMVQQFRTDLSPLAAQCVSGNAGTVPWICGDTTYYWKNTYATQYETVYG 350 Query: 245 NYQNNVLANIIFVDF--QQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHF 302 Y+N NI FV F + G+ TNAP EDPD ++ GYYG+A R+ ++ + R SHF Sbjct: 351 AYKNLTAQNIFFVPFLTDENGQNTPTNAPAEDPDIVAVGYYGAASRTQGSFVSTQRDSHF 410 Query: 303 STAARRGIISDRFVEAIL 320 S+ ARRGIISDR AIL Sbjct: 411 SSWARRGIISDRLSSAIL 428 >UniRef50_B2U4Z6 YjhS n=22 Tax=root RepID=B2U4Z6_SHIB3 Length = 462 Score = 365 bits (936), Expect = 2e-99, Method: Compositional matrix adjust. Identities = 178/325 (54%), Positives = 219/325 (67%), Gaps = 6/325 (1%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 ++A P+YY+V+ +AGQSN M+YGEGLPLP D P PRIKQLAR + PGG C +N Sbjct: 54 ISATSDPEYYFVVVLAGQSNGMSYGEGLPLPGTYDRPDPRIKQLARRSTVTPGGAACKYN 113 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIP HC HDVQDM +HP A + QYGTVGQ LHIA+KLLPFIP NAGIL+VPCCR Sbjct: 114 DIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCR 173 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFT G++GTYS+ GAS ++ RWG D PLY+DL+ RT+AAL KNP+N WMQG Sbjct: 174 GGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALEKNPKNVLFAVVWMQG 233 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHS 238 EFD + A+H F +V+ FR DL Q + PW CGDTT++WK+ + Sbjct: 234 EFDFGGTP-ANHAAQFGALVDKFRADLADMAGQCVGGSAGGVPWICGDTTYFWKQKNEST 292 Query: 239 YEAIYGNYQNNVLANIIFVDFQ--QQGERGLTNAPDEDPDDLSTGYYGSAYR-SPENWTT 295 Y+ +YG+Y+N NI FV F + G TN P+EDPD GYYGS +R S WT+ Sbjct: 293 YQTVYGSYKNKTEKNIHFVPFMTDENGVNVPTNKPEEDPDIPGIGYYGSKWRDSSATWTS 352 Query: 296 ALRSSHFSTAARRGIISDRFVEAIL 320 R+SHFS+ ARRGIISDR AIL Sbjct: 353 QDRASHFSSWARRGIISDRLATAIL 377 >UniRef50_Q6KCY2 Hypothetical adenine-specific methylase YfcB n=1 Tax=Escherichia coli RepID=Q6KCY2_ECOLX Length = 166 Score = 81.3 bits (199), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 50/121 (41%), Positives = 62/121 (51%), Gaps = 13/121 (10%) Query: 9 YYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 ++YV+ +AGQSN MAYGEG+PLPD D P R+KQLAR PGG C FN+IIP H Sbjct: 13 FWYVIALAGQSNGMAYGEGIPLPDTLDKPESRVKQLARRKTITPGGKECKFNEIIPADHA 72 Query: 69 PHDVQ-----DMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 ++ +G H + N Q Q LH FIP G + VP R Sbjct: 73 LNNTVFFAGGQTRGAH--VFRNIQRQVERRQHQLH------GFIPRVIGAVAVPDIRRAE 124 Query: 124 A 124 A Sbjct: 125 A 125 >UniRef50_D2QHB3 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QHB3_9SPHI Length = 264 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 57/231 (24%), Positives = 80/231 (34%), Gaps = 54/231 (23%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDA-PHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 P + + GQSN G +P+ ED PH RI L + P P HF+ Sbjct: 27 PPRLKLFLLIGQSNMAGRG----IPEAEDKQPHQRIWMLTKEQTWVPARDPLHFD----- 77 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 P VG L A+KL+ I ++PC +GGS Sbjct: 78 --------------KPAVIG-------VGPGLAFAQKLVN-ADKKVNIGLIPCAQGGSGI 115 Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 G Y T + Y D + R + AL + G W QGE D Sbjct: 116 DVWVPGAYYA-----------ATKSYPYDDAIKRAKKALE---TGELAGILWHQGESDSQ 161 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 T A + + +V R DL Q N+ P+F G ++ + P Sbjct: 162 TEKAAVYGEKLTALVSRIRTDL-----QAENV---PFFVGTLGDFYVQKHP 204 >UniRef50_Q7UGU5 Probable acetyl xylan esterase AxeA n=2 Tax=Planctomycetaceae RepID=Q7UGU5_RHOBA Length = 298 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 50/226 (22%), Positives = 77/226 (34%), Gaps = 49/226 (21%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 A + P ++ +AGQSN G+ + D + PHPR+ + P P HF+ Sbjct: 54 TAQLPPTGLHLFLLAGQSNMAGRGK---IADEDLQPHPRVLVFNKAGEWAPAIAPLHFDK 110 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 P G + TVG ++PC G Sbjct: 111 -------PRIAGVGLGRTFAIEYAENNPQATVG--------------------LIPCAVG 143 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 GS+ G + E T+T Y D + R + A+ + G W QGE Sbjct: 144 GSSLDVWQPGGFHE-----------STNTHPYDDCMKRMQQAIVA---GELKGILWHQGE 189 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYH-----SQLNNITDAPW 222 D + ++ N + E FR + + QL T+ PW Sbjct: 190 SDSNPALSKTYQSKLNELFERFRTEFGSPNVPIVIGQLGQFTEKPW 235 >UniRef50_Q07GI7 Conserved domain protein n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q07GI7_ROSDO Length = 617 Score = 42.7 bits (99), Expect = 0.017, Method: Compositional matrix adjust. Identities = 35/135 (25%), Positives = 55/135 (40%), Gaps = 20/135 (14%) Query: 74 DMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTY 133 D PLA + + G +G + A L PD +L +PC +G + F+ G+ Sbjct: 113 DGPATSRPLA-HTGARLGNMGLDIQFAIDYLSDKPD-VTLLFIPCAQGATGFSNGA---- 166 Query: 134 SERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHP 193 W LY +R AA+ NP+ F G W QGE D T ++ Sbjct: 167 ------------WNPGDWLYNRETARINAAMNANPEFLFQGFLWHQGETD--TGIPGTYG 212 Query: 194 QHFNHMVEAFRRDLK 208 ++++ RRD+ Sbjct: 213 GLLDNLIAGLRRDVT 227 >UniRef50_C5WRX2 Putative uncharacterized protein Sb01g000530 n=2 Tax=Sorghum bicolor RepID=C5WRX2_SORBI Length = 278 Score = 41.2 bits (95), Expect = 0.049, Method: Compositional matrix adjust. Identities = 54/220 (24%), Positives = 82/220 (37%), Gaps = 68/220 (30%) Query: 11 YVLTVAGQSNAMAYG-------EGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDII 63 V +AGQSN G +G+ PD AP PRI +L+ P + + Sbjct: 31 LVFLLAGQSNMGGRGGATNGTWDGVVPPD--CAPSPRILRLS---------PSLRWEEAR 79 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLL---PFIPDNAGILIVPCCR 120 H D+ ++ G VG + A LL +P +A + +VPC + Sbjct: 80 EPLHAGIDLHNVLG---------------VGPGMPFAHALLRRHGRVPPHAVVGLVPCAQ 124 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNP-----------Q 169 G + + W TPLY ++ R RAALA N Sbjct: 125 GATPIAS------------------WSRGTPLYDRMLKRARAALANNNNNNNNNNNNAGS 166 Query: 170 NKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQ 209 ++ W QGE D + A + +EAF RD+++ Sbjct: 167 SRLAALLWYQGEADTIRRQDAD---VYTSRMEAFVRDVRR 203 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39370 Uncharacterized protein yjhS n=156 Tax=root RepI... 531 e-149 UniRef50_Q9FCW8 ORF616 n=118 Tax=root RepID=Q9FCW8_ECOLX 509 e-143 UniRef50_B2U4Z6 YjhS n=22 Tax=root RepID=B2U4Z6_SHIB3 499 e-140 UniRef50_D2TQ80 Hypothetical prophage protein n=1 Tax=Citrobacte... 494 e-138 UniRef50_Q7UGU5 Probable acetyl xylan esterase AxeA n=2 Tax=Plan... 218 2e-55 UniRef50_D2QHB3 Putative uncharacterized protein n=1 Tax=Spiroso... 212 1e-53 UniRef50_Q6KCY2 Hypothetical adenine-specific methylase YfcB n=1... 139 1e-31 Sequences not found previously or not previously below threshold: UniRef50_D1N2H8 Putative uncharacterized protein n=2 Tax=Victiva... 128 3e-28 UniRef50_A3I3A6 Probable acetyl xylan esterase AxeA n=1 Tax=Algo... 121 3e-26 UniRef50_B5JF90 Conserved domain protein n=1 Tax=Verrucomicrobia... 119 9e-26 UniRef50_Q01TY8 Putative uncharacterized protein n=1 Tax=Candida... 119 1e-25 UniRef50_A6CAJ7 Probable acetyl xylan esterase AxeA n=1 Tax=Plan... 115 2e-24 UniRef50_C6XV32 Putative uncharacterized protein n=1 Tax=Pedobac... 109 2e-22 UniRef50_UPI00017448C4 hypothetical protein VspiD_04945 n=1 Tax=... 105 2e-21 UniRef50_C0ABN2 Putative uncharacterized protein n=2 Tax=Opituta... 104 5e-21 UniRef50_B9MVW2 Predicted protein n=11 Tax=Magnoliophyta RepID=B... 101 4e-20 UniRef50_B9XLT7 Putative uncharacterized protein n=1 Tax=bacteri... 100 1e-19 UniRef50_A8FC47 Possible acetylxylan esterase n=19 Tax=Bacteria ... 91 7e-17 UniRef50_C5WRX2 Putative uncharacterized protein Sb01g000530 n=2... 91 7e-17 UniRef50_A9RQK4 Predicted protein n=1 Tax=Physcomitrella patens ... 86 2e-15 UniRef50_B0BZZ0 Putative uncharacterized protein n=1 Tax=Acaryoc... 85 3e-15 UniRef50_Q8L9J9 Probable carbohydrate esterase At4g34215 n=10 Ta... 81 4e-14 UniRef50_C0ACS7 Putative uncharacterized protein n=1 Tax=Opituta... 81 5e-14 UniRef50_Q84M79 Os03g0857600 protein n=4 Tax=Poaceae RepID=Q84M7... 79 2e-13 UniRef50_A6CD12 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A6C... 79 3e-13 UniRef50_Q7XSV9 OSJNBa0027H06.16 protein n=5 Tax=Poaceae RepID=Q... 78 3e-13 UniRef50_B8I0M1 Carbohydrate binding family 6 n=6 Tax=Clostridiu... 78 4e-13 UniRef50_C7J1I1 Os04g0110400 protein n=2 Tax=Poaceae RepID=C7J1I... 78 5e-13 UniRef50_Q9LF91 Putative uncharacterized protein F8J2_180 n=1 Ta... 75 2e-12 UniRef50_C0A5B6 Putative uncharacterized protein n=1 Tax=Opituta... 73 2e-11 UniRef50_A9GML8 Iduronate-2-sulfatase n=1 Tax=Sorangium cellulos... 71 6e-11 UniRef50_Q9F106 Carbohydrate binding family 6 n=1 Tax=Fibrobacte... 71 7e-11 UniRef50_Q8A041 Acetyl xylan esterase A n=8 Tax=Bacteroides RepI... 70 8e-11 UniRef50_Q07GI7 Conserved domain protein n=1 Tax=Roseobacter den... 70 9e-11 UniRef50_C7NVN3 Carbohydrate-binding family V/XII n=1 Tax=Halorh... 70 2e-10 UniRef50_A6C656 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris... 66 2e-09 UniRef50_B5JIZ7 Conserved domain protein n=1 Tax=Verrucomicrobia... 65 3e-09 UniRef50_B0RC94 Putative surface-anchored protein n=1 Tax=Clavib... 65 3e-09 UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 65 4e-09 UniRef50_D2R5Y4 Putative uncharacterized protein n=1 Tax=Pirellu... 59 3e-07 UniRef50_A3K171 Probable acetyl xylan esterase AxeA n=2 Tax=Bact... 58 4e-07 UniRef50_C5BRL9 Acetylxylan esterase / xylanase n=1 Tax=Teredini... 55 3e-06 UniRef50_C6VVT7 Putative uncharacterized protein n=2 Tax=Dyadoba... 54 6e-06 UniRef50_A6DRT7 Putative uncharacterized protein n=1 Tax=Lentisp... 54 6e-06 UniRef50_C1ZKK3 Putative uncharacterized protein n=1 Tax=Plancto... 54 8e-06 UniRef50_C9RS24 Carbohydrate binding family 6 n=1 Tax=Fibrobacte... 53 1e-05 UniRef50_B4DB21 Putative uncharacterized protein n=1 Tax=Chthoni... 53 1e-05 UniRef50_UPI00016C0614 hypothetical protein Epulo_09645 n=1 Tax=... 52 2e-05 UniRef50_C9RKV3 Putative uncharacterized protein n=1 Tax=Fibroba... 52 3e-05 UniRef50_B2UM46 Putative uncharacterized protein n=1 Tax=Akkerma... 52 3e-05 UniRef50_C3QEP9 Glycoside hydrolase family 43 n=6 Tax=root RepID... 51 4e-05 UniRef50_A6DJ18 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 51 4e-05 UniRef50_Q11TG0 CHU large protein; candidate polyfunctional acet... 51 4e-05 UniRef50_C9RND1 Putative uncharacterized protein n=1 Tax=Fibroba... 51 6e-05 UniRef50_C9RLV5 Carbohydrate binding family 6 n=1 Tax=Fibrobacte... 51 8e-05 UniRef50_B7AIJ5 Putative uncharacterized protein n=1 Tax=Bactero... 51 8e-05 UniRef50_A6DIF1 Acetyl xylan esterase A n=1 Tax=Lentisphaera ara... 50 1e-04 UniRef50_A9GHE3 Putative uncharacterized protein n=1 Tax=Sorangi... 50 1e-04 UniRef50_UPI0001C367C9 hypothetical protein ChatD1_04961 n=1 Tax... 50 1e-04 UniRef50_D2R922 Putative uncharacterized protein n=1 Tax=Pirellu... 49 2e-04 UniRef50_A0LNW1 Putative uncharacterized protein n=1 Tax=Syntrop... 48 5e-04 UniRef50_UPI0001BC8395 sialate O-acetylesterase n=1 Tax=Bacteroi... 48 5e-04 UniRef50_B5JPM4 Conserved domain protein n=1 Tax=Verrucomicrobia... 44 0.007 UniRef50_A8ITT3 Predicted protein n=1 Tax=Chlamydomonas reinhard... 43 0.016 >UniRef50_P39370 Uncharacterized protein yjhS n=156 Tax=root RepID=YJHS_ECOLI Length = 326 Score = 531 bits (1367), Expect = e-149, Method: Composition-based stats. Identities = 326/326 (100%), Positives = 326/326 (100%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN Sbjct: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR Sbjct: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG Sbjct: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE Sbjct: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 Query: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS 300 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS Sbjct: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS 300 Query: 301 HFSTAARRGIISDRFVEAILQFWRER 326 HFSTAARRGIISDRFVEAILQFWRER Sbjct: 301 HFSTAARRGIISDRFVEAILQFWRER 326 >UniRef50_Q9FCW8 ORF616 n=118 Tax=root RepID=Q9FCW8_ECOLX Length = 616 Score = 509 bits (1309), Expect = e-143, Method: Composition-based stats. Identities = 179/324 (55%), Positives = 221/324 (68%), Gaps = 4/324 (1%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 ++ PD+YYV+ +AGQSNAMAYGEGLPLPD DAP PRIKQLAR + PGG C +N Sbjct: 55 VSGATEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN 114 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIP HC HDVQDM +HP A + QYG VGQ LHIA+KLLP+IP+NAGIL+VPCCR Sbjct: 115 DIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCR 174 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFT G+EGT+S GAS D+ RWG PLYQDL++RT+AAL KNP+N L CWMQG Sbjct: 175 GGSAFTQGAEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQG 234 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHS 238 EFD+ + +A P F M+ FR DL +++Q + D PW CGDTT+YWK + Sbjct: 235 EFDMSAATHAQQPALFTAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQ 294 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGE--RGLTNAPDEDPDDLSTGYYGSAYRSPENWTTA 296 Y+ +YG Y+N + FV F G TNAP EDPD ++GYYG+A R+ N ++ Sbjct: 295 YDTVYGGYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGNQVSS 354 Query: 297 LRSSHFSTAARRGIISDRFVEAIL 320 R +HFS+ ARR II DR AIL Sbjct: 355 NRPTHFSSWARRSIIPDRLATAIL 378 >UniRef50_B2U4Z6 YjhS n=22 Tax=root RepID=B2U4Z6_SHIB3 Length = 462 Score = 499 bits (1285), Expect = e-140, Method: Composition-based stats. Identities = 178/325 (54%), Positives = 219/325 (67%), Gaps = 6/325 (1%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 ++A P+YY+V+ +AGQSN M+YGEGLPLP D P PRIKQLAR + PGG C +N Sbjct: 54 ISATSDPEYYFVVVLAGQSNGMSYGEGLPLPGTYDRPDPRIKQLARRSTVTPGGAACKYN 113 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIP HC HDVQDM +HP A + QYGTVGQ LHIA+KLLPFIP NAGIL+VPCCR Sbjct: 114 DIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCR 173 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFT G++GTYS+ GAS ++ RWG D PLY+DL+ RT+AAL KNP+N WMQG Sbjct: 174 GGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALEKNPKNVLFAVVWMQG 233 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHS 238 EFD + A+H F +V+ FR DL Q + PW CGDTT++WK+ + Sbjct: 234 EFDFGGTP-ANHAAQFGALVDKFRADLADMAGQCVGGSAGGVPWICGDTTYFWKQKNEST 292 Query: 239 YEAIYGNYQNNVLANIIFVDFQ--QQGERGLTNAPDEDPDDLSTGYYGSAYR-SPENWTT 295 Y+ +YG+Y+N NI FV F + G TN P+EDPD GYYGS +R S WT+ Sbjct: 293 YQTVYGSYKNKTEKNIHFVPFMTDENGVNVPTNKPEEDPDIPGIGYYGSKWRDSSATWTS 352 Query: 296 ALRSSHFSTAARRGIISDRFVEAIL 320 R+SHFS+ ARRGIISDR AIL Sbjct: 353 QDRASHFSSWARRGIISDRLATAIL 377 >UniRef50_D2TQ80 Hypothetical prophage protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TQ80_CITRO Length = 683 Score = 494 bits (1272), Expect = e-138, Method: Composition-based stats. Identities = 183/322 (56%), Positives = 216/322 (67%), Gaps = 5/322 (1%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 + P+YYYVL +AGQSN MAYGEGLPLPD D P PRIKQLAR + P G C +NDI Sbjct: 108 SSTEPEYYYVLPLAGQSNGMAYGEGLPLPDSFDRPEPRIKQLARRSTVTPDGTSCTYNDI 167 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 IP HC HDVQDM G +HP A + QYG VGQ LHIA+KLLP+IP NAGIL+VPCCRG Sbjct: 168 IPADHCLHDVQDMSGINHPKADLSKGQYGCVGQGLHIAKKLLPYIPQNAGILLVPCCRGA 227 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 SAFT G +G++SE GAS D+ RWG PLYQDL+SRTRAAL KNP+N+ L WMQGE Sbjct: 228 SAFTTGDDGSFSEVSGASADSSRWGAGKPLYQDLLSRTRAALEKNPKNRLLAVVWMQGEA 287 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHSYE 240 DL S H F MV+ FR DL +Q N PW CGDTT+YWK + YE Sbjct: 288 DL-ASGSQQHNGLFTAMVQQFRTDLSPLAAQCVSGNAGTVPWICGDTTYYWKNTYATQYE 346 Query: 241 AIYGNYQNNVLANIIFVDF--QQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALR 298 +YG Y+N NI FV F + G+ TNAP EDPD ++ GYYG+A R+ ++ + R Sbjct: 347 TVYGAYKNLTAQNIFFVPFLTDENGQNTPTNAPAEDPDIVAVGYYGAASRTQGSFVSTQR 406 Query: 299 SSHFSTAARRGIISDRFVEAIL 320 SHFS+ ARRGIISDR AIL Sbjct: 407 DSHFSSWARRGIISDRLSSAIL 428 >UniRef50_Q7UGU5 Probable acetyl xylan esterase AxeA n=2 Tax=Planctomycetaceae RepID=Q7UGU5_RHOBA Length = 298 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 50/226 (22%), Positives = 77/226 (34%), Gaps = 49/226 (21%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 A + P ++ +AGQSN G+ + D + PHPR+ + P P HF+ Sbjct: 54 TAQLPPTGLHLFLLAGQSNMAGRGK---IADEDLQPHPRVLVFNKAGEWAPAIAPLHFDK 110 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 P G + TVG ++PC G Sbjct: 111 -------PRIAGVGLGRTFAIEYAENNPQATVG--------------------LIPCAVG 143 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 GS+ G + E T+T Y D + R + A+ + G W QGE Sbjct: 144 GSSLDVWQPGGFHE-----------STNTHPYDDCMKRMQQAIVA---GELKGILWHQGE 189 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYH-----SQLNNITDAPW 222 D + ++ N + E FR + + QL T+ PW Sbjct: 190 SDSNPALSKTYQSKLNELFERFRTEFGSPNVPIVIGQLGQFTEKPW 235 >UniRef50_D2QHB3 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QHB3_9SPHI Length = 264 Score = 212 bits (540), Expect = 1e-53, Method: Composition-based stats. Identities = 55/231 (23%), Positives = 78/231 (33%), Gaps = 54/231 (23%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDA-PHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 P + + GQSN G +P+ ED PH RI L + P P HF+ Sbjct: 27 PPRLKLFLLIGQSNMAGRG----IPEAEDKQPHQRIWMLTKEQTWVPARDPLHFDK---- 78 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 P VG L A+KL+ I ++PC +GGS Sbjct: 79 ---------------PAVIG-------VGPGLAFAQKLVN-ADKKVNIGLIPCAQGGSGI 115 Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 G Y T + Y D + R + AL + G W QGE D Sbjct: 116 DVWVPGAYYA-----------ATKSYPYDDAIKRAKKALE---TGELAGILWHQGESDSQ 161 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 T A + + +V R DL + P+F G ++ + P Sbjct: 162 TEKAAVYGEKLTALVSRIRTDL--------QAENVPFFVGTLGDFYVQKHP 204 >UniRef50_Q6KCY2 Hypothetical adenine-specific methylase YfcB n=1 Tax=Escherichia coli RepID=Q6KCY2_ECOLX Length = 166 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 50/121 (41%), Positives = 62/121 (51%), Gaps = 13/121 (10%) Query: 9 YYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 ++YV+ +AGQSN MAYGEG+PLPD D P R+KQLAR PGG C FN+IIP H Sbjct: 13 FWYVIALAGQSNGMAYGEGIPLPDTLDKPESRVKQLARRKTITPGGKECKFNEIIPADHA 72 Query: 69 PHDV-----QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 ++ +G H + N Q Q LH FIP G + VP R Sbjct: 73 LNNTVFFAGGQTRGAH--VFRNIQRQVERRQHQLH------GFIPRVIGAVAVPDIRRAE 124 Query: 124 A 124 A Sbjct: 125 A 125 >UniRef50_D1N2H8 Putative uncharacterized protein n=2 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N2H8_9BACT Length = 461 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 56/286 (19%), Positives = 98/286 (34%), Gaps = 60/286 (20%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 + ++++ +AGQSN G P + PHPR+ R P P H++ Sbjct: 29 ENFHLILLAGQSNMAGRGVISP---SDRIPHPRVLMQNRQGEWVPAVEPVHYDK------ 79 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 + VG A +L P + ++P GGS + Sbjct: 80 ---------------------DFAGVGPGRSFAIRLAASDPA-ITVGLIPAACGGSPIAS 117 Query: 128 GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTS 187 G Y E+ T + Y D V RTR A+ + QGE D S Sbjct: 118 WQPGAYHEQ-----------TQSHPYDDAVRRTRRAMK---DGTLKAILFHQGEADCYGS 163 Query: 188 DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAI--YGN 245 + + ++ + R++L P+ G + + +E + +++ Sbjct: 164 APNQYRERLFTLIRSLRQELGAPA--------CPFIIGQLSRFPQETWSEGKKSVDAAHR 215 Query: 246 YQNNVLANIIFVDFQQ---QGERGLTNAPDEDPDDLSTGYYGSAYR 288 L + FV +Q +R +AP + + YYG+ R Sbjct: 216 AAAAELPEVGFVSSEQLTSNPDRIHFDAPSQ--REFGRRYYGTYRR 259 >UniRef50_A3I3A6 Probable acetyl xylan esterase AxeA n=1 Tax=Algoriphagus sp. PR1 RepID=A3I3A6_9SPHI Length = 274 Score = 121 bits (304), Expect = 3e-26, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 79/237 (33%), Gaps = 56/237 (23%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDA-PHPRIKQLARFAHTHPGGPPCHFN 60 + + +++ + GQSN G L + D HPR+ L + P HF+ Sbjct: 28 SQKSEKENFHLYLLMGQSNMAGRG----LVEAIDTLSHPRVWMLDSTMNWVLARDPMHFD 83 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 VG L + + P + I ++P Sbjct: 84 K---------------------------PVAGVGLGLTFGKIMANENP-SVKIGLIPTAV 115 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGS+ A D+ T T Y D++ R + AL G W QG Sbjct: 116 GGSSINAW-----------FKDSIHNQTKTFPYNDMIDRAKKAL---GDGTLKGILWHQG 161 Query: 181 EFDLMTSDY-ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 E D + A++P F M+++ ++DL I P G+ ++ P Sbjct: 162 ESDTRNEESIANYPAKFYAMIDSLQKDLG--------IEPVPIVMGEIGHFFYGRAP 210 >UniRef50_B5JF90 Conserved domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JF90_9BACT Length = 265 Score = 119 bits (299), Expect = 9e-26, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 72/220 (32%), Gaps = 51/220 (23%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 D ++++ +AGQSN G+ + +P++ L + P H++ + Sbjct: 33 DSFHLILLAGQSNMAGRGD---MEGPRVESNPQVLALDKEGRWVVAKDPLHWDKSV---- 85 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 VG L AR+ L P I ++P GGS ++ Sbjct: 86 -----------------------AGVGLGLSFAREYLKDHP-GVTIGLIPAACGGSPISS 121 Query: 128 GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTS 187 G Y ++ TD+ Y D + R A G W QGE D Sbjct: 122 WEAGAYFDQ-----------TDSHPYDDALKRVSRATQ---DGTLKGVLWHQGESDSHEG 167 Query: 188 DYASHPQHFNHMVEAFRR-----DLKQYHSQLNNITDAPW 222 + +++ FR DL QL + W Sbjct: 168 LSDLYEAKLEGLIKRFRVEWDREDLPVILGQLGQF-EVKW 206 >UniRef50_Q01TY8 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01TY8_SOLUE Length = 252 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 47/224 (20%), Positives = 72/224 (32%), Gaps = 60/224 (26%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 + + GQSN G + +++ P PR+ L + P P HF+ P Sbjct: 21 IFLLIGQSNMAGRG---VVEEQDRQPIPRVFMLNKAMEWVPAIDPVHFDK-------PDI 70 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 K+L NA I +VP GG++ G Sbjct: 71 AGVGLARTF--------------------GKVLAAADPNASIGLVPAAFGGTSLEEWKVG 110 Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM-TSDYA 190 LY++ V R + A++ K G W QGE D + Sbjct: 111 G------------------KLYEEAVRRAKFAMS---SGKLRGILWHQGEADAGKKELAS 149 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKEN 234 S+ Q F+ M+ R DL + D P G + E+ Sbjct: 150 SYRQRFSAMITQLRADLGEP--------DVPVVVGQLGEFLSES 185 >UniRef50_A6CAJ7 Probable acetyl xylan esterase AxeA n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAJ7_9PLAN Length = 278 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 79/246 (32%), Gaps = 62/246 (25%) Query: 2 NAIIS-PDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 A + + +++ + GQSN G+ P + HPR+ +L + + P P HF+ Sbjct: 41 TAELPEKEKFHIYLLIGQSNMAGRGKVDP---ASNKAHPRVLKLDKAGNWVPATDPLHFD 97 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 P G T+G ++P Sbjct: 98 K-------PKIAGVGPGSGFGPVIADAYPEVTIG--------------------LIPAAV 130 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GG+ + +G LY+ V + + GA W QG Sbjct: 131 GGTPLSRWVKGG------------------DLYERAVKLAK---ENQKKGVIKGAIWHQG 169 Query: 181 EFD-LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKEN-FPHS 238 E D Y S+ + + M+ R DL + D P+ G+ ++ P Sbjct: 170 EGDSSNPKLYNSYQKRLSGMIADLRTDLGEP--------DMPFVMGELGEFFTRPGAPTV 221 Query: 239 YEAIYG 244 +A++G Sbjct: 222 NQALHG 227 >UniRef50_C6XV32 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV32_PEDHD Length = 276 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 48/298 (16%), Positives = 77/298 (25%), Gaps = 70/298 (23%) Query: 6 SPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 + + GQSN G L + P+ + P H++ Sbjct: 41 PGPELEIYLLLGQSNMAGRGPLLAEYTAMEQPNVLVW--DSEGKWIIARHPLHYD----- 93 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 + + VG L + P N I +VPC GG+ Sbjct: 94 ---------------------KPKVAGVGPGLSFGFAMARSKP-NVRIGLVPCAVGGTNI 131 Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 G + T+T + D R R A+ G W QGE + Sbjct: 132 DVWKPGAMDK-----------ATNTHPFDDAEMRIREAMK---YGVVKGMIWHQGEANSG 177 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGN 245 + + N ++ R+ + P G+ + +Y+ Sbjct: 178 AQNMIGYLDKLNELITRIRKMVGN--------EKLPVVVGELG-----RYKTNYQQF--- 221 Query: 246 YQNNVLANIIFVDFQQQGERGLTNAPDEDP------DDLSTGYYGSAYRSPENWTTAL 297 N +LA T+ D D S YG Y W Sbjct: 222 --NKMLAG---APQMIPNLALATSESLVDKGDLTHFDSPSATAYGKRYAEKMLWLQQN 274 >UniRef50_UPI00017448C4 hypothetical protein VspiD_04945 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448C4 Length = 650 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 45/231 (19%), Positives = 67/231 (29%), Gaps = 60/231 (25%) Query: 6 SPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 + + + + GQSN G LPL DR R+ + + PG P H + Sbjct: 413 EKETFDLYLLIGQSNMAGRG-LLPLEDRLSR--ERVLKFSARNAWAPGVEPLHTDK---- 465 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 P G T+G ++PC GG+ Sbjct: 466 ---PAVAGAGLGMSFARQMAEAKPKVTIG--------------------LIPCAVGGTPL 502 Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 +G LY + R R A+ G W QGE D Sbjct: 503 DRWVKGG------------------DLYAAALVRAREAMK---SGNLKGILWHQGEADSG 541 Query: 186 T-SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENF 235 + S+ Q MV+ R DL D P+ G+ + + Sbjct: 542 SEEKAGSYAQRLAGMVKDLRADLG--------AGDVPFVAGELGEFLERTN 584 >UniRef50_C0ABN2 Putative uncharacterized protein n=2 Tax=Opitutaceae bacterium TAV2 RepID=C0ABN2_9BACT Length = 301 Score = 104 bits (259), Expect = 5e-21, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 72/230 (31%), Gaps = 63/230 (27%) Query: 4 IISP--DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 P + + + + GQSN G P + P R+ L + G P HF+ Sbjct: 43 ATPPKAENFDLYLLVGQSNMSGRGRVTP---ADSQPDTRVLVLGKDGEWLLQGEPVHFD- 98 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 T+ VG A+++ P I ++PC G Sbjct: 99 --------------------------TRNAAVGLGFAFAKRMADHSP-GVTIGLIPCAVG 131 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 + G LY++ V R A + G W QGE Sbjct: 132 ATPQKRWMPGG------------------DLYEEAVRRAGIAQQ---SGRLRGILWHQGE 170 Query: 182 FDLMT-SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY 230 + + ++ ++ +VE FRRDL N P+ G+ + Sbjct: 171 SETGSLVRSKAYGENLAKIVEGFRRDL--------NAPGVPFVAGELGEF 212 >UniRef50_B9MVW2 Predicted protein n=11 Tax=Magnoliophyta RepID=B9MVW2_POPTR Length = 297 Score = 101 bits (250), Expect = 4e-20, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 73/222 (32%), Gaps = 59/222 (26%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYG-------EGLP----LPDREDAPHPRIKQLARFAH 49 ++ + + + +AGQSN G G+P + + P+P I +L+ Sbjct: 17 ISEQLPQN---IFILAGQSNMAGRGGVVNNTKNGIPSWDGIVPVQCQPNPSILRLSASLT 73 Query: 50 THPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPD 109 P H A + VG + A +L +P+ Sbjct: 74 WVQAHEPLH------------------------ADIDYNKTNGVGPGMSFANAILTKVPN 109 Query: 110 NAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQ 169 I +VPC GG++ + ++G + LY LV RT+ AL + Sbjct: 110 FGSIGLVPCAIGGTSISEWAKGGF------------------LYDQLVRRTQFALQRG-- 149 Query: 170 NKFLGACWMQGEFDLM-TSDYASHPQHFNHMVEAFRRDLKQY 210 W QGE D D ++ + R DL Sbjct: 150 GVIGAMLWYQGESDTQIREDADAYKGRLDRFFIDLRADLGYP 191 >UniRef50_B9XLT7 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XLT7_9BACT Length = 266 Score = 99.6 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 45/228 (19%), Positives = 70/228 (30%), Gaps = 60/228 (26%) Query: 10 YYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 + + + GQSN G+ + + HPR+ L P Sbjct: 34 FQIYLLMGQSNMAGRGK---VGLEDTTTHPRVLLLNTNNTWELAMEPVT----------- 79 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 D + +G VG L + + N I +VPC GG+ + Sbjct: 80 KDRKAGRG---------------VGPGLAFGKSMAEK-NSNVTIGLVPCAVGGTPLSRWQ 123 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD- 188 G LY + V+R + A+ G W QGE D Sbjct: 124 RGG------------------DLYSNAVARAKVAVK---DGALAGVLWHQGENDSSDKGL 162 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 S+ + + M+ FR D+ Q T+ P G + E P Sbjct: 163 AESYGKRLSEMIHDFRTDVGQ--------TNLPVVVGQIGEFLYERGP 202 >UniRef50_A8FC47 Possible acetylxylan esterase n=19 Tax=Bacteria RepID=A8FC47_BACP2 Length = 276 Score = 90.7 bits (223), Expect = 7e-17, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 64/249 (25%), Gaps = 79/249 (31%) Query: 13 LTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDV 72 + GQSN G +P + RI L R P HF+ Sbjct: 4 FLLIGQSNMAGRGFKHEVPPIYNE---RIMML-RNGRWQMMTEPIHFD------------ 47 Query: 73 QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGT 132 VG A A + I ++PC GGS+ Sbjct: 48 ---------------RPVAGVGLAASFAETWCKD-HEGEKIGLIPCAEGGSSID------ 85 Query: 133 YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASH 192 W D L++ +S A ++ G W QGE D Y + Sbjct: 86 ------------EWSRDGALFRHAISEATFAKE---NSELAGILWHQGESDSQDGKYKEY 130 Query: 193 PQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW------------------KEN 234 + + R +L + P G + + Sbjct: 131 DEKIRRLFHEIRTELSVPN--------IPLVIGGLGDFLGKVAFGAGCVEYQLINEELQK 182 Query: 235 FPHSYEAIY 243 + H +E Y Sbjct: 183 YAHRHENCY 191 >UniRef50_C5WRX2 Putative uncharacterized protein Sb01g000530 n=2 Tax=Sorghum bicolor RepID=C5WRX2_SORBI Length = 278 Score = 90.7 bits (223), Expect = 7e-17, Method: Composition-based stats. Identities = 55/222 (24%), Positives = 74/222 (33%), Gaps = 66/222 (29%) Query: 12 VLTVAGQSNAMAYG-------EGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIP 64 V +AGQSN G +G+ PD AP PRI +L+ P H Sbjct: 32 VFLLAGQSNMGGRGGATNGTWDGVVPPDC--APSPRILRLSPSLRWEEAREPLH------ 83 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLL---PFIPDNAGILIVPCCRG 121 H+V VG + A LL +P +A + +VPC +G Sbjct: 84 AGIDLHNVLG------------------VGPGMPFAHALLRRHGRVPPHAVVGLVPCAQG 125 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNP-----------QN 170 + + S G TPLY ++ R RAALA N + Sbjct: 126 ATPIASWSRG------------------TPLYDRMLKRARAALANNNNNNNNNNNNAGSS 167 Query: 171 KFLGACWMQGEFDL-MTSDYASHPQHFNHMVEAFRRDLKQYH 211 + W QGE D D + V RRDL Sbjct: 168 RLAALLWYQGEADTIRRQDADVYTSRMEAFVRDVRRDLGMPD 209 >UniRef50_A9RQK4 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RQK4_PHYPA Length = 263 Score = 85.7 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 53/272 (19%), Positives = 82/272 (30%), Gaps = 39/272 (14%) Query: 9 YYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 + + ++GQSN G G+ +D R P I+ L Sbjct: 7 GFEIFILSGQSNMSGRG-GMQTIVAKDGSTSRKW-----DGIVPAECAAEPGSILRLNKN 60 Query: 69 PHDVQDMQGYHHPLATNHQTQYGT-VGQALHIARKLL-----PFIPDNAGILIVPCCRGG 122 + + H P + T VG L A LL P I +VPC GG Sbjct: 61 L----EWEEAHEPTHIDIDTSKACGVGPGLVFAASLLRARKYKVKPTGPQIGLVPCAIGG 116 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 ++ +G LY ++ RT+AAL K W QGE Sbjct: 117 TSIVQWEKG------------------RVLYNHMIQRTKAALEKG--GTLKALLWYQGES 156 Query: 183 DL-MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 D S + Q R DL ++ + + W Y + A Sbjct: 157 DAVEKSLADHYEQRLVTFFNHVRTDLNNHNLPIIQVA-INWPAAPHPEY-VNKVRSAQRA 214 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDE 273 + ++ L + + + T A E Sbjct: 215 ALDHVKHLHLVDALGLPLLSDHIHLTTEAQTE 246 >UniRef50_B0BZZ0 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0BZZ0_ACAM1 Length = 302 Score = 85.3 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 74/218 (33%), Gaps = 52/218 (23%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 + +AGQSN G PL HP++ H P Sbjct: 59 LYVLAGQSNMTGRG---PLDAESSKTHPQVFVFGNDYRWHLAKDPL-------------- 101 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 + G P++ + + VG + A LL +A I ++PC RGGS Sbjct: 102 -DSIDGQVDPVS--QEGKAPGVGPGMTFASALLKH-DKDAVIGLIPCARGGSTIQEWQ-- 155 Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY-- 189 R ++ LY + R RAA + + G + QGE D + Sbjct: 156 -------------RNLSENSLYGSCLKRLRAA---SLMGQLEGMLFFQGEADALDQKQFS 199 Query: 190 ------ASHPQHFNHMVEAFRRDLKQYH-----SQLNN 216 + F +E+FR D KQ + +Q+ + Sbjct: 200 HLSLSPQQWSKKFEKFIESFRLDTKQENLPIVFAQIGS 237 >UniRef50_Q8L9J9 Probable carbohydrate esterase At4g34215 n=10 Tax=Magnoliophyta RepID=CAES_ARATH Length = 260 Score = 81.5 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 72/224 (32%), Gaps = 59/224 (26%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLP-----------LPDREDAPHPRIKQLARFAH 49 + + I P+ + ++GQSN G + E AP+ I +L+ Sbjct: 15 IQSPIPPN--QIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLR 72 Query: 50 THPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIP- 108 P H + VG + A + + Sbjct: 73 WEEAHEPLH------------------------VDIDTGKVCGVGPGMAFANAVKNRLET 108 Query: 109 DNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNP 168 D+A I +VPC GG+A G + LY+ +V RT + + Sbjct: 109 DSAVIGLVPCASGGTAIKEWERG------------------SHLYERMVKRTEES--RKC 148 Query: 169 QNKFLGACWMQGEFD-LMTSDYASHPQHFNHMVEAFRRDLKQYH 211 + W QGE D L D S+ + + +++ R DL Sbjct: 149 GGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPS 192 >UniRef50_C0ACS7 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0ACS7_9BACT Length = 520 Score = 81.1 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 46/232 (19%), Positives = 70/232 (30%), Gaps = 35/232 (15%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 V +AGQSN G L PHP I+ + P H +P Sbjct: 139 VWLLAGQSNMEGGG---LLAASVARPHPFIRAFSLARVWRQAADPLH----VPWESQEAA 191 Query: 72 VQDMQGYHHP-LATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + D + + +T G LH R++L + ++ RG + Sbjct: 192 LNDGKPFTREQAEDYRRTSRVGAGVGLHFGREML--LRSGVPQGLICAARGATRMEQWLP 249 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 R G LY ++ RA G W QGE D A Sbjct: 250 A-----------RGRDG-GAGLYGAMLRSVRATGQP-----VAGVLWHQGEGDSPRERAA 292 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAI 242 + Q ++ A RRDL PW + E ++ ++ Sbjct: 293 LYSQRMRKLIAAVRRDLGLPR--------LPWIFAQLARVYGERPDCAWNSV 336 >UniRef50_Q84M79 Os03g0857600 protein n=4 Tax=Poaceae RepID=Q84M79_ORYSJ Length = 266 Score = 78.8 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 49/208 (23%), Positives = 66/208 (31%), Gaps = 51/208 (24%) Query: 12 VLTVAGQSNAMAYGEGLPLP-----DREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 + + GQSN G P E AP PRI +L+ P Sbjct: 32 IFLLGGQSNMGGRGGATNGPWDGVVPPECAPSPRILRLSPELRWEEAREPL--------- 82 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 H DV ++ G VG + A L IP + I +VPC +GG+ Sbjct: 83 HAGIDVHNVLG---------------VGPGMSFAHALFRAIPPSTVIGLVPCAQGGTPIA 127 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSR---TRAALAKNPQNKFLGACWMQGEFD 183 + G T LY+ +V R A + W QGE D Sbjct: 128 NWTRG------------------TELYERMVGRGRAAMATAGAGAGARMGALLWYQGEAD 169 Query: 184 L-MTSDYASHPQHFNHMVEAFRRDLKQY 210 D + + MV RRDL Sbjct: 170 TIRREDAEVYARKMEGMVRDVRRDLALP 197 >UniRef50_A6CD12 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A6CD12_9PLAN Length = 331 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 44/227 (19%), Positives = 74/227 (32%), Gaps = 24/227 (10%) Query: 10 YYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 Y V + GQSN YG LPD P + A++ P P + Sbjct: 50 YQVYFLGGQSNMDGYGYAKDLPDDLKQSVPGVMIF--HANSAPDAVP------VDGRGLW 101 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 +++ G T G L A+ L P+ A I ++ RGG++ + Sbjct: 102 SELKPGHGVGFKSDGKENTYSNRFGVELSFAKTLQQLAPE-ANIALIKISRGGTSIAVEA 160 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK--------FLGACWMQGE 181 G + G Y ++ + AL ++ G WMQGE Sbjct: 161 AGNFGCWDPDFEKGTGKGQGINQYDHFLAGMKRALQTTDIDQDGEADTLIPAGIVWMQGE 220 Query: 182 FDL--MTSDYASHPQHFNHMVEAFRR-----DLKQYHSQLNNITDAP 221 D + + +++ R DL ++++ D P Sbjct: 221 SDAAYTEEIAKDYEANLKRLMDLIRATLYADDLPVVIGRISDSGDNP 267 >UniRef50_Q7XSV9 OSJNBa0027H06.16 protein n=5 Tax=Poaceae RepID=Q7XSV9_ORYSJ Length = 282 Score = 78.4 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 67/218 (30%), Gaps = 50/218 (22%) Query: 1 MNAIISPDYYY------VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGG 54 ++A +SP Y + +AGQSN G PHPR+ +LA P Sbjct: 33 LSAFLSPSSPYAHRPKLLFLLAGQSNMAGRGALARPLPPPYLPHPRLLRLAASRRWVPAA 92 Query: 55 PPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGIL 114 PP H A + +G A+ A +LL + Sbjct: 93 PPLH------------------------ADIDTHKTCGLGPAMPFAHRLLLQTDSEEVLG 128 Query: 115 IVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLG 174 +VPC GG+ + G PLY+ A + Sbjct: 129 LVPCAVGGTRIWMWARG------------------QPLYE-AAVARARAAVADGGGAIGA 169 Query: 175 ACWMQGEFDL-MTSDYASHPQHFNHMVEAFRRDLKQYH 211 W QGE D D S+ +V R DL + Sbjct: 170 VLWFQGESDTIELDDARSYGGKMERLVADLRADLHLPN 207 >UniRef50_B8I0M1 Carbohydrate binding family 6 n=6 Tax=Clostridium RepID=B8I0M1_CLOCE Length = 780 Score = 78.0 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 68/229 (29%), Gaps = 54/229 (23%) Query: 3 AIISP--DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 P ++ + GQSN Y + PR+ L + G ++ Sbjct: 540 GTTEPTTPKFHCFLLLGQSNMAGYAAAQA---SDKVEDPRVLVLGYDNNAALGRVTDKWD 596 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 P H + VG + ++ +P I ++PC Sbjct: 597 VACPPLHA-------------------SWLDAVGPGDWFGKTMIQKVPSGDTIGLIPCAI 637 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 G + GT Y +++R + A K G + QG Sbjct: 638 SGEKIETFMKSG--------------GTK---YNWIINRAKLAQEKG--GVIDGIIFHQG 678 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 E + S P +VE R+DL N+ + P+ G+ + Sbjct: 679 ESNSG---DPSWPGKVKTLVEDLRKDL--------NLGNVPFIAGELLY 716 >UniRef50_C7J1I1 Os04g0110400 protein n=2 Tax=Poaceae RepID=C7J1I1_ORYSJ Length = 252 Score = 78.0 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 67/218 (30%), Gaps = 50/218 (22%) Query: 1 MNAIISPDYYY------VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGG 54 ++A +SP Y + +AGQSN G PHPR+ +LA P Sbjct: 36 LSAFLSPSSPYAHRPKLLFLLAGQSNMAGRGALARPLPPPYLPHPRLLRLAASRRWVPAA 95 Query: 55 PPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGIL 114 PP H A + +G A+ A +LL + Sbjct: 96 PPLH------------------------ADIDTHKTCGLGPAMPFAHRLLLQTDSEEVLG 131 Query: 115 IVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLG 174 +VPC GG+ + G PLY+ A + Sbjct: 132 LVPCAVGGTRIWMWARG------------------QPLYE-AAVARARAAVADGGGAIGA 172 Query: 175 ACWMQGEFDL-MTSDYASHPQHFNHMVEAFRRDLKQYH 211 W QGE D D S+ +V R DL + Sbjct: 173 VLWFQGESDTIELDDARSYGGKMERLVADLRADLHLPN 210 >UniRef50_Q9LF91 Putative uncharacterized protein F8J2_180 n=1 Tax=Arabidopsis thaliana RepID=Q9LF91_ARATH Length = 169 Score = 75.3 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 55/177 (31%), Gaps = 46/177 (25%) Query: 36 APHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQ 95 +P I +L P H + I + VG Sbjct: 27 RSNPSILRLTSKLEWKEAKEPLHVDIDI------------------------NKTNGVGP 62 Query: 96 ALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQD 155 + A +++ + +VPC GG+ + +G + LY++ Sbjct: 63 GMPFANRVVN---RFGQVGLVPCSIGGTKLSQWQKGEF------------------LYEE 101 Query: 156 LVSRTRAALAKNPQNKFLGACWMQGEFDL-MTSDYASHPQHFNHMVEAFRRDLKQYH 211 V R +AA+A + W QGE D D + + + R DL+ + Sbjct: 102 TVKRAKAAMASGGGGSYRAVLWYQGESDTVDMVDASVYKKRLVKFFSDLRNDLQHPN 158 >UniRef50_C0A5B6 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A5B6_9BACT Length = 646 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 46/214 (21%), Positives = 63/214 (29%), Gaps = 34/214 (15%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 V +AGQSN G G PHP I+ P H P C +D Sbjct: 111 VWLLAGQSNM--EGCGFMDSPHCARPHPLIRAFTMAREWRQAADPLHIRWESP-DSCHND 167 Query: 72 VQDMQGYHHPLATNHQ-TQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 G Q + G + + +V GG++ + Sbjct: 168 -----GATWDRTRAEQHRRTALRGAGVGLPFAHEMLARSGVPQALVCTAHGGTSMEQWNP 222 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 ++ G D LY ++ RA G W QGE D A Sbjct: 223 --LHKKLG----------DGSLYGSMLLSMRATGQPC-----AGVLWYQGESDTAAPLAA 265 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFC 224 + +V A RRDL+Q D PW Sbjct: 266 IYTDRMKKLVAATRRDLRQP--------DLPWII 291 >UniRef50_A9GML8 Iduronate-2-sulfatase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GML8_SORC5 Length = 453 Score = 71.1 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 72/334 (21%), Positives = 107/334 (32%), Gaps = 68/334 (20%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +AGQSN + YG G LP E P + GGP + P Sbjct: 25 KVFVLAGQSNMVGYGVGRQLP-VELQSQPDVWYDHYNPDAREGGP---YAAATSADWGPL 80 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF-TAGS 129 + + + G + R + P++ I IV +GG+ Sbjct: 81 E--------------PKGEARRYGPEITFGRAIAAAYPEH-RIAIVKMAQGGTNLVDHWG 125 Query: 130 EGT------------YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQN----KFL 173 G Y G A G Y + V+R ALA+ + Sbjct: 126 RGLAPDPEVLYKSQLYHALLGKLDSATYEGDRALRYPEEVTRLDGALARLESEGHPYEIA 185 Query: 174 GACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKE 233 WMQGE + S S+ + A R DL P G + Sbjct: 186 ALVWMQGENEAGWSAAFSYGNTLRGFIAAIRADLGVP--------GLPVVLGRVSD---N 234 Query: 234 NFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENW 293 +P + I ANI V Q +EDP A+ +++ Sbjct: 235 LYPANGGPIAAG----KEANIDAVRAAQ------VTVAEEDPRV--------AWVDTDDF 276 Query: 294 T--TALRSSHFSTAARRGIISDRFVEAILQFWRE 325 T + + HF +AA + ++ +RF EA L RE Sbjct: 277 TVRSPDDAYHFDSAAYQ-LLGERFAEAYLALVRE 309 >UniRef50_Q9F106 Carbohydrate binding family 6 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=Q9F106_FIBSS Length = 539 Score = 70.7 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 74/234 (31%), Gaps = 42/234 (17%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 NA P++ ++ GQSN G D + HPR+K A Sbjct: 29 NAAPDPNF-HIYIAYGQSNM--EGNARNFTDVDKKEHPRVKMFA---------------- 69 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPD---NAGILIVPC 118 T CP + G +P G+ L +A + D N I I+P Sbjct: 70 ---TTSCPSLGRPTVGEMYPAV----PPMFKCGEGLSVADWFGRHMADSLPNVTIGIIPV 122 Query: 119 CRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQ-DLVSR-TRAALAKNPQNKFLGAC 176 +GG++ Y ++ + G + + R A + G Sbjct: 123 AQGGTSIRLFDPDDYKNYLNSAESWLKNGAKAYGDDGNAMGRIIEVAKKAQEKGVIKGII 182 Query: 177 WMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLN-NITDAPWFCGDTTW 229 + QGE D S++ ++ + QL N + P+ G+ Sbjct: 183 FHQGETDGGMSNWEQI----------VKKTYEYMLKQLGLNAEETPFVAGEMVD 226 >UniRef50_Q8A041 Acetyl xylan esterase A n=8 Tax=Bacteroides RepID=Q8A041_BACTN Length = 267 Score = 70.3 bits (170), Expect = 8e-11, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 62/234 (26%), Gaps = 57/234 (24%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLP-LPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 A + GQSN G+ P + D + L P P Sbjct: 24 AEKPLKTLDLYLCIGQSNMAGRGKLSPEVMDTL----QNVYLLNADDQFEPAVNPL---- 75 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 + + VG A A+ + + ++ RG Sbjct: 76 -----------------NRYSTIGKGLSWQQVGPAYGFAKTM---ATKKHPVGLIVNARG 115 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 GS+ + + ++ G Y + + R + A+ W QGE Sbjct: 116 GSSIRSWVKNA--KQSGGY------------YDEAIRRAKEAMK---YGTLKAIIWHQGE 158 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT-WYWKEN 234 D + + + ++ R DL D P G W W + Sbjct: 159 ADCHHPEA--YKEKIIQLMTDLRNDLGMP--------DLPVVVGQIAQWNWTKK 202 >UniRef50_Q07GI7 Conserved domain protein n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q07GI7_ROSDO Length = 617 Score = 70.3 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 50/228 (21%), Positives = 73/228 (32%), Gaps = 48/228 (21%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGE---GLPLPDREDAPHPRIKQLARFAHTHPGGPPCHF 59 A P +V + GQSN + G PD G Sbjct: 58 AAAQPRETHVFALMGQSNMIGRAAFDGGAKWPD---------------GTLQIGRGGDED 102 Query: 60 NDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCC 119 IIP + D PLA G +G + A L PD +L +PC Sbjct: 103 GAIIPA----RNPADGPATSRPLAHTGAR-LGNMGLDIQFAIDYLSDKPD-VTLLFIPCA 156 Query: 120 RGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQ 179 +G + F+ G+ W LY +R AA+ NP+ F G W Q Sbjct: 157 QGATGFSNGA----------------WNPGDWLYNRETARINAAMNANPEFLFQGFLWHQ 200 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 GE D ++ ++++ RRD+ P+ G Sbjct: 201 GETDTGIPG--TYGGLLDNLIAGLRRDVTA------ATPTTPFILGGL 240 >UniRef50_C7NVN3 Carbohydrate-binding family V/XII n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NVN3_HALUD Length = 523 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 45/228 (19%), Positives = 75/228 (32%), Gaps = 41/228 (17%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 P + + GQSN G P+ ++ HPRI LA Sbjct: 61 TTATDPSNLDLYLLFGQSNMEGQG---PIEAQDRETHPRIHVLADKT------------- 104 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 CP+ ++ G + YG +G + A+ ++ +PD+ I +VP Sbjct: 105 ------CPNLDRE-YGEWYLAEPPLNRCYGKLGPGDYFAKSMIEEMPDDRSIGLVPAAVS 157 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 G+ +G R+ + G Y+ +V A F G + QGE Sbjct: 158 GADIALFEKGAPIGRNDRDIPSQFDGG----YEWMVDLAETAQQ---VGTFRGILFHQGE 210 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 + +VE R DL I + P+ G+ + Sbjct: 211 TNTN---DQQWTDQVQGIVEDLRADLG--------IGNVPFLAGEMLY 247 >UniRef50_A6C656 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C656_9PLAN Length = 667 Score = 66.1 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 68/208 (32%), Gaps = 49/208 (23%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 + P + +AGQSN ++ G LP++ P + P Sbjct: 16 SAKEPAIDKLFLLAGQSNMVSQGTLAELPEQLQQPPTNVY-FWSNGTWIP---------- 64 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 YH+ +A + G L IA +L PD I ++ +GG Sbjct: 65 ---------------YHNKVAYVKPGKE--FGPELAIAHELSRAFPDE-KIGLIKHAKGG 106 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 +A PL + L + A K + WMQGE Sbjct: 107 TAIRLWQP------------------RMPLVRGLFQKLDDA-QKAGGGEVAALFWMQGER 147 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQY 210 D + A + + F ++++A R+ Q Sbjct: 148 DARFHEPA-YAKKFQNLIQAVRQKSDQP 174 >UniRef50_B5JIZ7 Conserved domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JIZ7_9BACT Length = 296 Score = 64.9 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 46/237 (19%), Positives = 73/237 (30%), Gaps = 35/237 (14%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 A + Y V + GQSN + +G LP D RI ++ F+ G ++ Sbjct: 10 ATANAKTYKVYFLGGQSNMVGFGHENELPGDLDR---RIYEVPIFS-----GSSKMDENL 61 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 + G G + A +LL P+ I I+ GG Sbjct: 62 AGGDGKWTTLGIGFGLGSDFVDGQYVLSDRFGPEITFADELLKIAPEE-NIAIIKYAWGG 120 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQN--------KFLG 174 +A G G +G+ R Y + R ALA + G Sbjct: 121 TALLDGVSG-----YGSWDPKVR---KLNQYDYFLKTVRKALAARDIDNDGEHDLLVPAG 172 Query: 175 ACWMQGEFDL--MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 WMQGE D + ++ ++ +++ R P G T Sbjct: 173 IIWMQGEADAFESQAASQAYQENLANLMSLMRAAFHD--------NSLPIVIGRITD 221 >UniRef50_B0RC94 Putative surface-anchored protein n=1 Tax=Clavibacter michiganensis subsp. sepedonicus RepID=B0RC94_CLAMS Length = 654 Score = 64.9 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 52/301 (17%), Positives = 90/301 (29%), Gaps = 32/301 (10%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 + + V+ + GQSNA G G D + QL + Sbjct: 375 VAGAQDGVGHDVVAILGQSNAQGGGFGYD--PAIDVAQDGLDQLV--GDWQDK----DWG 426 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 ++P V + P VG + R LL +L+VP + Sbjct: 427 RVVPAEDSLKHVTTWRMTDRPKL---------VGPGMTFGRALLADSAPGRRVLLVPAAQ 477 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 G ++ T + T LY + ++ ALA +P N+ + W QG Sbjct: 478 GSTSLTRVDAVQKFTWDPSPEQGSVEAGLTNLYANATTQIDNALALDPDNRLVAIIWAQG 537 Query: 181 EFDLMTSDYA-SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS- 238 E D A + V+ R L+ P+ G W + Sbjct: 538 ESDANAISSAPTAAGRVAAKVKYADRLLELESGLAVRYGPVPFLVGGMVPEWIGSDAARQ 597 Query: 239 -YEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDE-------DPDDLSTGYYGSAYRSP 290 +A++ ++ + +V G G N + + G+Y + R Sbjct: 598 DIDAVHQGLRSLRKE-VAYVP----GVSGHANEGEAFIHYDAVGARMMGAGFYAAYLRQT 652 Query: 291 E 291 Sbjct: 653 G 653 >UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA8_RHOBA Length = 745 Score = 64.9 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 71/227 (31%), Gaps = 41/227 (18%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 D++ V +AGQSN G+ L + + + + G F +P Sbjct: 42 DHHDVYLLAGQSNMDGRGQVSDLSEEQ-----------KQST----GDAIIFYRSVPRES 86 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 G+ P T G + AR + P N + ++ +GG++ A Sbjct: 87 DGWQTL-APGFSVPPKYKGDLPSPTFGPEIGFARSMSNANP-NQKLALIKGSKGGTSLRA 144 Query: 128 -GSEGTYSERHGASHDACRWGTDTPLYQDLVSR----TRAALAKNPQNKFLGACWMQGEF 182 G + P Y+D + T+ + Q G W QGE Sbjct: 145 DWKPGVQGDPKSQG----------PRYRDFIETIRMATKQLSDRGDQFTIRGLLWHQGES 194 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 D +S + + ++ R D+ D P G+ Sbjct: 195 DSKSSTER-YRRRLEELIVRIREDVGVP--------DLPVVVGEVFD 232 >UniRef50_D2R5Y4 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5Y4_9PLAN Length = 319 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 72/221 (32%), Gaps = 29/221 (13%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 ++ + + V +AGQSN +G+ P + P Sbjct: 26 SLTAAETLKVFVLAGQSNMQGHGKVKADPKANGGQGSLEWLVKE----SPKKADFKHLVT 81 Query: 63 IPLTHCPHDVQDM--QGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 D + G L + +G L + I + +L+V Sbjct: 82 DSGDWVSRDDVQIWYLGRQGKLTAGYGASEEMMGPELGFGHVVGNAIDEP--VLLVKLAW 139 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQ---------NK 171 GG + G + GT P YQ+++++T+ L P+ + Sbjct: 140 GGKSL--GQD---------FRPPSSGGTVGPYYQEIITQTKTVLKDLPKLFPEYASHQAE 188 Query: 172 FLGACWMQGEFD-LMTSDYASHPQHFNHMVEAFRRDLKQYH 211 +G W QG D + + + ++ ++V R+DL + Sbjct: 189 LVGFGWHQGWNDRINQAFNDEYEKNLANLVRDLRKDLSAPN 229 >UniRef50_A3K171 Probable acetyl xylan esterase AxeA n=2 Tax=Bacteria RepID=A3K171_9RHOB Length = 920 Score = 58.4 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 67/218 (30%), Gaps = 49/218 (22%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGG--PPCHFNDIIPLTHCP 69 VL + GQSN + G HP A + G C L H Sbjct: 681 VLPLIGQSNMVGQGVFDGGAG-----HP-----ATYKQWLQAGSLAACT----AHLDHLS 726 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 D +M G ++ A P NA ++ VPC G+ + Sbjct: 727 SDAGEM------------------GLSVQFAIDFAAEFP-NAQLIFVPCAVSGTGY---- 763 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY 189 + G RWG LY V RT + + PQ G GE D S Sbjct: 764 --GFGNLQGGVE-LGRWGIGDDLYLAAVRRTDEVMRRYPQCVLGGILHHDGEDDAENST- 819 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 A+ + + +R + +Q P+ G+ Sbjct: 820 ANFADLLDEAIGGYRTSIVGASAQ------TPFVVGEI 851 >UniRef50_C5BRL9 Acetylxylan esterase / xylanase n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BRL9_TERTT Length = 952 Score = 55.3 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 70/229 (30%), Gaps = 39/229 (17%) Query: 6 SPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 +++ + GQSN G+ + ++ + + + GG Sbjct: 38 PDPNFHIYLMFGQSNMEGQGQ---ISSQDQQVPTGLLAMQADNNCTVGGAS--------- 85 Query: 66 THCPHDVQDMQGYHHPLATNHQTQY----GTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 + + PL + T + G +G + R +L + +V Sbjct: 86 ------YGEWRTATPPLIRCYNTAHAWNNGGLGPGDYFGRTMLENSGAGVRVGLVGAAYQ 139 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGT---DTPLYQDLVSRTRAALAKNPQNKFLGACWM 178 G + + G+ + G+ Y ++ R A G + Sbjct: 140 GQSINFFRKNC--AALGSCQPSGANGSVPGGAGGYAWMLDLARKAQE---DGVIKGIIFH 194 Query: 179 QGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 QGE D + ++ N +V R DL + ++ P+ G+ Sbjct: 195 QGESD---TGSSTWSSRVNEVVTDLRTDLGL------SASEVPFIAGEM 234 >UniRef50_C6VVT7 Putative uncharacterized protein n=2 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VVT7_DYAFD Length = 618 Score = 54.1 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 47/222 (21%), Positives = 65/222 (29%), Gaps = 38/222 (17%) Query: 12 VLTVAGQSNAMA-YGEGLPLPDREDAPHPRIKQLARFAHTHPGGP-PCHFNDIIPLTHCP 69 V +AGQSNA + LP P + P F ++ PL Sbjct: 133 VFIIAGQSNAQGIKDQSYKLPSGAGIPE---WVVGASEDKTCTRKLPESFTNLFPL---- 185 Query: 70 HDVQDMQGYHHPL--ATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 + D H PL N YG +G+ + A +P NA GS+ T Sbjct: 186 -NTADDMKKHGPLGPTGNSVWAYGVLGKLISDANGGMPVAFFNA-------ATAGSSVTE 237 Query: 128 GSEGT-----YSERHGASHDACRWGTDTPLYQDLVSRTRAALAK--NPQNKFLGA---CW 177 +G GA G +D + AL N G W Sbjct: 238 WKQGADGVEAKHPYTGAQVCLGYMGGSVIP-KDYYGQPYTALKTALNYYGSLYGVRAVLW 296 Query: 178 MQGEFDLMT--------SDYASHPQHFNHMVEAFRRDLKQYH 211 QGE D S A + ++ R D + Sbjct: 297 HQGEADADPNVNAIYKASSAADYQSKLQAVIAKSRSDFAAPN 338 >UniRef50_A6DRT7 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DRT7_9BACT Length = 252 Score = 54.1 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 110 NAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTP-------LYQDLVSRTRA 162 +LIV GG + D T P +Y+ ++ + Sbjct: 52 KENVLIVKEAIGGRPIRMWVH-DWKAAPYWKIDPNIPNTKNPQPKENGVMYKSMMKKITK 110 Query: 163 ALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW 222 A + K + CWMQGE D A + + + + D + T + Sbjct: 111 ATQ-GKKPKAIAFCWMQGERDSRERHSAVYERSLKALFSQIKADFPE--------TPIVF 161 Query: 223 FCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGY 282 G + + K+N +A+Y ++ + A ++ + E D L+TG Sbjct: 162 VIGKLSDFGKDNK----QALYPEWEEIIAAQ------KKVAKDTPNCKIIETHD-LNTGD 210 Query: 283 YGSAYRSPENWTTALRSSHFSTAARRGIISDRFVEAILQFWRER 326 +++ E H + + I+ RF EA ++ +++ Sbjct: 211 SPPHWKTKEIRKYVD-DLHMTNEGYK-ILGTRFAEAAIELLKKQ 252 >UniRef50_C1ZKK3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZKK3_PLALI Length = 1077 Score = 53.7 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 43/255 (16%), Positives = 78/255 (30%), Gaps = 47/255 (18%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYG--------------EGLPLPDREDAPHPRIKQL-A 45 + A V +AGQSN +G L + ++K+L Sbjct: 782 IAANSEAKPLKVFILAGQSNMEGHGVVSMDGKRDYNGGKGNLVWSMKHSQSAEKLKRLKN 841 Query: 46 RFAHTHPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLP 105 D + ++ D G + +G L + Sbjct: 842 EKGEWV-------IRDDVQISFKVDDKVRKGGLT--IGYTGYGGSSHIGPELGFGFVMGD 892 Query: 106 FIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALA 165 ++ + L++ GG + G P Y +V RAALA Sbjct: 893 YLDEPV--LLIKTAWGGKSLFV-----------DFRPPSSGGQVGPYYTKMVEEVRAALA 939 Query: 166 K--NPQNKFLGACWMQGEFDLMTSDY-ASHPQHFNHMVEAFRRDLKQYH-----SQLNNI 217 + + + + G W QG D+ A + Q+ ++V+ R++ + +L N Sbjct: 940 ELGDQKYEIAGFVWQQGWNDMCEKPAIAEYAQNLVNLVKDLRKEFDSPNLPVVVGELGNG 999 Query: 218 TDAPWFCGDTTWYWK 232 P GD + K Sbjct: 1000 G--PVTSGDMFEFRK 1012 >UniRef50_C9RS24 Carbohydrate binding family 6 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RS24_FIBSS Length = 552 Score = 53.3 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 61/203 (30%), Gaps = 21/203 (10%) Query: 4 IISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDII 63 +++ GQSN G+ +P D+ +AP I LA N Sbjct: 29 AAPNPNFHIYIAYGQSNMAGNGDIVPSEDQAEAPKNFI-MLA------------SHNANA 75 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 + G +P + + A + R + +P + I+P G Sbjct: 76 SQRSGKTNQSIKTGEWYPAIPPMFHPFENLSPADYFGRAMADSLP-GVTVGIIPVAIGAV 134 Query: 124 AFTAGSEGTYSER-HGASHDACRWGTDTPLYQDLVSR-TRAALAKNPQNKFLGACWMQGE 181 + A + Y G D WG + R A G + QGE Sbjct: 135 SIRAFDKDQYEAYFRGDGKDIMNWGWPKDYDNNPPGRILELAKKAKEVGVIKGFIFHQGE 194 Query: 182 FDLMTSD-----YASHPQHFNHM 199 D ++ Y ++ + + Sbjct: 195 SDGTDANWRKTVYKTYKDVIDAL 217 >UniRef50_B4DB21 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DB21_9BACT Length = 384 Score = 53.3 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 69/232 (29%), Gaps = 62/232 (26%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 V VAGQSN+ YGE +++ R+ L P Sbjct: 126 VFVVAGQSNSANYGE-----EKQTTQTGRVTALDGRG-WQLANDP--------------- 164 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 P A + + L A + +P I V C GG++ Sbjct: 165 --------QPGAAGSRGSFM---PPLGDALEERFHVP----IGFVACGVGGTSVREWLPQ 209 Query: 132 TYS-------ERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL 184 E W + LY L++ +A P F W QGE D Sbjct: 210 GVVFPNPPTVESRVVRLAGGTWESKGQLYAKLLASMKAV---GPHG-FRAVLWHQGESDA 265 Query: 185 M------TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY 230 T + ++ ++ RR++ DAPWF +++ Sbjct: 266 NQQDTSRTLPGKLYREYLEKIIRESRREVG---------WDAPWFVAQASYH 308 >UniRef50_UPI00016C0614 hypothetical protein Epulo_09645 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0614 Length = 246 Score = 52.2 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 65/223 (29%), Gaps = 58/223 (26%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 ++ AGQSN G G + D AP + + ND + Sbjct: 12 IIIAAGQSNXEGAGIGX-VEDPY-APKNNVWSM---------------NDFVIAKATERI 54 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLP--FIPDNAGILIVPCCRGGSAFTAGS 129 + LH A + L + + ILI+ +GG+ F Sbjct: 55 KGNTLRGRFV---------------LHFAXEYLNAGLLDADREILIISAAQGGTGFAT-- 97 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY 189 W L ++ T+ AL N +NK + W QGE ++ Sbjct: 98 --------------HEWNRGDALAVRMLEMTKTALELNTENKIVAXLWHQGEREV----- 138 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWK 232 SH + R L + + P D WK Sbjct: 139 -SHNMTAEAHLNNVRILLGDLQAAFGK--NFPMITADLVPIWK 178 >UniRef50_C9RKV3 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RKV3_FIBSS Length = 409 Score = 52.2 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 32/237 (13%), Positives = 58/237 (24%), Gaps = 51/237 (21%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLAR-------FAHTHPG 53 M +++ GQSN G+ D + Sbjct: 18 MANAAPDPNFHIYLAFGQSNMEGQGDVGSQDKTVDERFQVLWAANNGFCSGKTKGKWATA 77 Query: 54 GPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGI 113 PP HC Q +G + R ++ + Sbjct: 78 VPPL--------AHC--------------------QGAKLGPTDYFGRTMVEKTDSKIKV 109 Query: 114 LIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTP---LYQDLVSRTRAALAKNPQN 170 ++ G + + Y+ + + Y L+ + A Sbjct: 110 GVIVVAVAGCSIQLFDKDGYANYARSQQSWMTQRINEYGGNPYGRLIEMAKKAQE---DG 166 Query: 171 KFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 G + QGE D + P + + +DL L N D P+ G+ Sbjct: 167 VIKGIIFHQGETD---AGDGQWPSKVKKVYDNIIKDLG-----LGN--DVPFLAGEV 213 >UniRef50_B2UM46 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UM46_AKKM8 Length = 303 Score = 51.8 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 50/325 (15%), Positives = 95/325 (29%), Gaps = 69/325 (21%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 V+ + GQSNA G +P RI +++ + T Sbjct: 37 VILIGGQSNATGQGYVNNIPPCF-KTDKRIL--------------LYYSGSLKGTEPAEQ 81 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 + PL+ ++ G L + L P +I G + F + G Sbjct: 82 LV-------PLSPASESP-DRFGVELSLGTALQKKFPQKKWAIIKHARSGSNLFRQWNPG 133 Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQN----KFLGACWMQGEFDL--- 184 S+ Y L+ R + + W QGE D Sbjct: 134 KTSQDKQGEE-----------YVKLLRTVRNGMEALKKQGHAPVLKAMVWQQGEGDARDI 182 Query: 185 -MTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHSYEA 241 + S+ + N++++ R DL+ ++ P FP E Sbjct: 183 AGIKNALSYGANLNNLIKRIRADLEAPGLAFIYGSVLPVPALA---------RFPGR-EK 232 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSH 301 + ++ + + P +D S + R+P T +H Sbjct: 233 VRQGQKDVAEESRTSL-----SVNNAVYVPADDLQLRSMDF-----RTPYPTDTVHLGTH 282 Query: 302 FSTAARRGIISDRFVEAILQFWRER 326 ++ +RF A+ + W ++ Sbjct: 283 GVL-----VLGERFASALEKLWGQK 302 >UniRef50_C3QEP9 Glycoside hydrolase family 43 n=6 Tax=root RepID=C3QEP9_9BACE Length = 638 Score = 51.4 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 65/224 (29%), Gaps = 39/224 (17%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND---IIP 64 +++ GQSN + C+ N+ ++ Sbjct: 22 PNFHIYLCLGQSNMEGNAK------------------------IEAQDTCNVNERFLMMA 57 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 CP + ++ + + G + A + R L+ +PDN + ++ GG Sbjct: 58 AVDCPSLGRVKGQWYKAVPPLVRCHTG-LTPADYFGRTLVERLPDNIKVGVINVAVGGCR 116 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRA-ALAKNPQNKFLGACWMQGEFD 183 E E H AS T + R + A+ G QGE + Sbjct: 117 IELFDEEN-CEEHIASQPEWLKNTAKAYGNNPYRRLKELAVEAQKAGVIKGILLHQGESN 175 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 + PQ + E RDL D P G+ Sbjct: 176 ---TGDKEWPQKVKRVYENLLRDLNL------QAKDVPLLAGEV 210 >UniRef50_A6DJ18 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ18_9BACT Length = 229 Score = 51.4 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 69/210 (32%), Gaps = 36/210 (17%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 + ++GQSNA G G L + P + FN +T P Sbjct: 28 LFILSGQSNAGGNGNGDELKQSQKELDPEVL--------------LAFNSGQFMTMAPIK 73 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 + ++ + Q G L ++KL P++ + RGG++ A G Sbjct: 74 KKKVK---------FKIQNSIFGTELSFSKKLKQAYPNDIIAICKVGIRGGTSIVAW--G 122 Query: 132 TYSERHGASHDACRWGTDTP----LYQDLVSRTRAALAKNPQNK------FLGACWMQGE 181 R G + G + LYQ+++ + + K G W Q E Sbjct: 123 KDRTRPGWKEELKALGIEEASQRMLYQEIIDGVNKGIENLKKRKDVKEVIISGMWWCQTE 182 Query: 182 FDLM-TSDYASHPQHFNHMVEAFRRDLKQY 210 D ++ ++ + + R+D Sbjct: 183 RDSSFVEFSKAYEKNLTNFINNLRQDFNTP 212 >UniRef50_Q11TG0 CHU large protein; candidate polyfunctional acetylxylan esterase/b-xylosidase/a-L-arabinofuranosidase, CBM9 module, Glycoside Hydrolase Family 43 protein and Carbohydrate Esterase Family 6 protein n=11 Tax=Bacteroidetes RepID=Q11TG0_CYTH3 Length = 1585 Score = 51.4 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 68/223 (30%), Gaps = 36/223 (16%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 +++ GQSN G G+ + A + R + + G C Sbjct: 27 PNFHIYLTFGQSNM--EGNGVIEAQDQTAVNSRFQVM--------GAVNCTGTKS----- 71 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG---SA 124 + P+ + +G + R ++ +P N + +VP GG + Sbjct: 72 --YTTGKWTTATAPIVRCNTG----LGPLDYFGRTMVSNLPANIKVGVVPVAIGGCDIAL 125 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL 184 F + G+Y + Y LV + A G + QGE + Sbjct: 126 FDKVNYGSYVATAPSWMIGTINQYGGNPYARLVEVAKLA---QKDGVIKGILFHQGETN- 181 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 + P + + +DL ++ P+ G+ Sbjct: 182 --NGQQDWPAKVKAIYDNLIKDLGLDPAK------TPFLAGEL 216 >UniRef50_C9RND1 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RND1_FIBSS Length = 341 Score = 50.7 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 63/226 (27%), Gaps = 32/226 (14%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 ++ GQSN + D + +PR L Sbjct: 21 PNLHIYLAYGQSNMSGQA---TITDTDRQTNPRFLVL----------------------R 55 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 + G +P A VG RK++ +PD+ + + GG + Sbjct: 56 AGNHSNQKVGEFYPAAPPMGHSGSKVGIVDFFGRKMIKELPDSITVAVANVAIGGQSIDL 115 Query: 128 GSE--GTYSERHGASHDACRWGTDTPLYQ-DLVSR-TRAALAKNPQNKFLGACWMQGEFD 183 + ++ + + W Y D+ R + G + QGE D Sbjct: 116 FDKDRNAAYVQNAKNKNDTWWIQYLNEYGGDVHKRIVEMGKIAKQKGVIKGFLFHQGEAD 175 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 P+ + + F +L+ + + GD W Sbjct: 176 YQ---MKDWPERVKKVYDQFIEELELDPEKTPILLGELAPTGDLGW 218 >UniRef50_C9RLV5 Carbohydrate binding family 6 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLV5_FIBSS Length = 524 Score = 50.7 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 68/228 (29%), Gaps = 34/228 (14%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 + +++ GQSN + + + R K A Sbjct: 28 SEAAPDPNFHIYIAYGQSNMGGTADAQ---SADKVENSRFKIFATQK------------- 71 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 C ++ G +P + T+ A R + +P N I I+P G Sbjct: 72 ------CSGKGRNTLGDVYPAVPSLFNCGNTISVADWFGRTMADSMP-NVTIGIIPVAVG 124 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRT--RAALAKNPQNKFLGACWMQ 179 G++ + Y + + V++T A + G + Q Sbjct: 125 GASIKLFDQDQYKTYLSTAETWLQNYAKEYASDGNVTKTIIDIAKKAQEKGVIKGFIFHQ 184 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 GE D SD+ +V+ R D+ + + P+ G+ Sbjct: 185 GETDGGYSDWPK-------IVKKTRDDI--LKALDMSSDTVPFVAGEL 223 >UniRef50_B7AIJ5 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIJ5_9BACE Length = 1019 Score = 50.7 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 65/229 (28%), Gaps = 41/229 (17%) Query: 6 SPDYYYVLTVAGQSNAMAYGEGLPLPDREDAP--HPRIKQLARFAHTHPGGPPCHFNDII 63 +++ GQSN E +P +D +PR + +A G + I Sbjct: 29 PDPNFFIYLCIGQSNM----EAGAVPAEQDKDFNNPRFQFMAAVDMPKLGREMGKWYTAI 84 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 P P+ +G RK++ +P + ++ G+ Sbjct: 85 P----------------PICREGNN----LGPVDFFGRKMIDILPSEYHVGVINVSVAGA 124 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTP---LYQDLVSRTRAALAKNPQNKFLGACWMQG 180 Y + D + Y+ LV+ R A G QG Sbjct: 125 KIQLWDREDYKDYIDNERDWMKNIVSQYGGNPYERLVNMARLA---QKDGVIKGILMHQG 181 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 E + P+ + + +DL Q P G+ + Sbjct: 182 ESNSEDPL---WPERVKKIYDNLCKDLNLNPKQ------TPLLAGELKY 221 >UniRef50_A6DIF1 Acetyl xylan esterase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DIF1_9BACT Length = 240 Score = 50.3 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 18/59 (30%), Positives = 29/59 (49%) Query: 153 YQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYH 211 YQ ++ + + L K P+ + CWMQGE D ++ ++ FRRDLK+ Sbjct: 93 YQPILDQYKELLKKYPKPASVTFCWMQGESDAQGRVSVAYKDSLKLLISNFRRDLKRPD 151 >UniRef50_A9GHE3 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GHE3_SORC5 Length = 346 Score = 49.9 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 29/224 (12%), Positives = 62/224 (27%), Gaps = 57/224 (25%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLA----RFAHTHPGGPPCHFNDII 63 +++ + GQSN + R+K L + PP Sbjct: 115 PTFHIFMLMGQSNMAGVAAKQA---SDQNSDQRLKVLGGCNQPAGQWNLANPPL------ 165 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 + CP + + +V + + LL + + I ++ G Sbjct: 166 --SDCPGESRINLST-------------SVDPGIWFGKTLLGKLREGDTIGLIGTAESGE 210 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 + G + + +++ + A +F G + QGE D Sbjct: 211 SINTFISGGSHHQTILNK---------------IAKAKTA----ENARFAGIIFHQGETD 251 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 + +S P + + + D P+ G+ Sbjct: 252 ---TGQSSWPGKVVQLYNEMKAAWGVDY-------DVPFILGEL 285 >UniRef50_UPI0001C367C9 hypothetical protein ChatD1_04961 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C367C9 Length = 254 Score = 49.9 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 62/224 (27%), Gaps = 59/224 (26%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 +L GQSN G+ P+ + + P Sbjct: 5 ILLFMGQSNMAGRGDYRLAPEVLPGAAYEYRAVTEPDTLVP------------------- 45 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQAL---HIARKLLPFIPDNAG--ILIVPCCRGGSAFT 126 P N + G + +A + G I+ V C +GGS Sbjct: 46 ------LTEPFGVNENREGGVFEPGMKTGSMAAAFVNACYRKTGRPIIAVSCSKGGSRIQ 99 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFL----GACWMQG-- 180 +TP ++D +R +A L+ + G W QG Sbjct: 100 EWQP------------------ETPYFKDAAARYQACLSFVQSRQIAVHSTGMVWCQGCT 141 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQ---YHSQLNNITDAP 221 D + A + + +A + L + Q+ N + P Sbjct: 142 NADDGMAK-AEYKEKTKAFFQAVKS-LGVDKIFLIQIGNHREFP 183 >UniRef50_D2R922 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R922_9PLAN Length = 240 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 49/142 (34%), Gaps = 15/142 (10%) Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAG---ILIVPCCRGGSAFTAG 128 D +G H L + L A+ +P + + G ++IV R G Sbjct: 24 ASDTRGVHLILLSGQSNM-----ANLDPAQVFIPEVERHFGAENVVIVKVARSGQPIRRW 78 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 Y + + D LY+ L++ AL P WMQGE D+ Sbjct: 79 ----YKKWTVTGKQNPKEIGD--LYEQLMATAEKALNGRPIQS-ATLIWMQGERDVKERL 131 Query: 189 YASHPQHFNHMVEAFRRDLKQY 210 A + F MVE + DL Sbjct: 132 SAHYKVAFLGMVEQLKTDLDVP 153 >UniRef50_A0LNW1 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LNW1_SYNFM Length = 261 Score = 48.0 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 62/185 (33%), Gaps = 27/185 (14%) Query: 60 NDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCC 119 + P + V DM P Q G + + + P ++++ Sbjct: 47 DLRTPHQLKEYHVLDMDYSEKPRIVQTFDQRSHFGPEVRFVQLYVKANPSR-EVILLKMV 105 Query: 120 RGGSAFTAGSE---GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGAC 176 + GS T S G Y + G LY+ LV A+ ++ G Sbjct: 106 KNGSGMTRWSPKWPGEYDQWTGD------------LYRILVDFVIEAVDGRDV-EWGGFL 152 Query: 177 WMQGEFDLM-TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENF 235 ++QGE D + ++ Q+ ++V R DL + +P E + Sbjct: 153 FVQGENDSVYPERARAYVQNLRNLVNRLREDLGAPKMPVMTSEVSPVL---------EKY 203 Query: 236 PHSYE 240 PH Y+ Sbjct: 204 PHQYQ 208 >UniRef50_UPI0001BC8395 sialate O-acetylesterase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC8395 Length = 478 Score = 47.6 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 32/150 (21%), Positives = 50/150 (33%), Gaps = 26/150 (17%) Query: 96 ALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGT------YSERHGASHDACRWGTD 149 + ARKLL + + +V GGS + Y H +A G + Sbjct: 185 GYYFARKLLSSLD--VPVGLVVDAYGGSPIQSWIPYAETLKPLYKAEHETLQEAVEKGKE 242 Query: 150 TPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQ 209 P Y L S A + K G W QGE ++ D + +V ++R+ K Sbjct: 243 KPEYNMLSSLYNAMVHPLIDYKIRGWLWYQGEANVG--DAGRYIAMMKDLVSSWRKKWK- 299 Query: 210 YHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 P+ Y+ + P Y Sbjct: 300 --------AKLPF-------YYVQIAPFQY 314 >UniRef50_B5JPM4 Conserved domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPM4_9BACT Length = 468 Score = 44.1 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 56/318 (17%), Positives = 98/318 (30%), Gaps = 28/318 (8%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 V ++GQSN LPL DA P + A P F H Sbjct: 102 VWLLSGQSNM-----ELPLAGWPDAEPPCPIEGGPEAIAAADHPQIRFIIAGQKPAASHQ 156 Query: 72 VQDMQGYHHPLATNHQT----QYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 + P T ++ VG A K +P I ++ GGSA A Sbjct: 157 AEISSNSPLPAWTVCHPDTVPEFSAVGYFFARALKEKVRVP----IGLIQSTWGGSACEA 212 Query: 128 GSEGTYSERHGASHDACRWGTDTPLYQDLVSRT---RAALAKNPQNKFLGACWMQGEFDL 184 + + + + +P D + + +A G W QGE ++ Sbjct: 213 WTPSHALKTLEDFRNLAPFAPQSP--DDNYTPSVLFNGMIAPLAPFTLAGILWYQGESNV 270 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 + F M++++R Q APW Y ++ P ++A Sbjct: 271 G--RHQQLETLFPAMIKSWRATFDQPELPFYFAHIAPW-----AGYERDTLPKFWQAQAS 323 Query: 245 NYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFST 304 L N G+ + P + P A R+ + + ++ S Sbjct: 324 A---LDLPNTALAVTIDCGDSANIHPPHKKPIGERFAQLALANRTDGYSSQSTGPNYQSR 380 Query: 305 AARRGIISDRFVEAILQF 322 + + ++ F + +Q Sbjct: 381 SIQDSQLTLHFSTSDIQL 398 >UniRef50_A8ITT3 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8ITT3_CHLRE Length = 304 Score = 42.9 bits (99), Expect = 0.016, Method: Composition-based stats. Identities = 24/98 (24%), Positives = 34/98 (34%), Gaps = 21/98 (21%) Query: 90 YGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF-TAGSEGTYSERHGASHDACRWGT 148 Y + G L R LL + + + VP GG+ G Sbjct: 168 YDSCGPDLGFGRVLLQ-LGVSGRVGFVPTAAGGTNLADMWCPGC---------------- 210 Query: 149 DTPLYQDLVSRTRAAL-AKNPQNKFLGACWMQGEFDLM 185 PLY+D+ A+ A P + G W+QGE D Sbjct: 211 --PLYKDMAQTVVRAMRAAGPNARLRGMLWVQGESDAN 246 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39370 Uncharacterized protein yjhS n=156 Tax=root RepI... 378 e-103 UniRef50_Q9FCW8 ORF616 n=118 Tax=root RepID=Q9FCW8_ECOLX 364 3e-99 UniRef50_B2U4Z6 YjhS n=22 Tax=root RepID=B2U4Z6_SHIB3 353 4e-96 UniRef50_D2TQ80 Hypothetical prophage protein n=1 Tax=Citrobacte... 345 1e-93 UniRef50_D1N2H8 Putative uncharacterized protein n=2 Tax=Victiva... 204 4e-51 UniRef50_D2QHB3 Putative uncharacterized protein n=1 Tax=Spiroso... 202 2e-50 UniRef50_Q7UGU5 Probable acetyl xylan esterase AxeA n=2 Tax=Plan... 195 2e-48 UniRef50_A3I3A6 Probable acetyl xylan esterase AxeA n=1 Tax=Algo... 190 7e-47 UniRef50_C6XV32 Putative uncharacterized protein n=1 Tax=Pedobac... 187 5e-46 UniRef50_A6CAJ7 Probable acetyl xylan esterase AxeA n=1 Tax=Plan... 182 1e-44 UniRef50_B5JF90 Conserved domain protein n=1 Tax=Verrucomicrobia... 178 3e-43 UniRef50_A9GML8 Iduronate-2-sulfatase n=1 Tax=Sorangium cellulos... 174 3e-42 UniRef50_B2UM46 Putative uncharacterized protein n=1 Tax=Akkerma... 174 4e-42 UniRef50_B9MVW2 Predicted protein n=11 Tax=Magnoliophyta RepID=B... 171 4e-41 UniRef50_Q9F106 Carbohydrate binding family 6 n=1 Tax=Fibrobacte... 169 1e-40 UniRef50_B9XLT7 Putative uncharacterized protein n=1 Tax=bacteri... 169 2e-40 UniRef50_A9RQK4 Predicted protein n=1 Tax=Physcomitrella patens ... 169 2e-40 UniRef50_Q01TY8 Putative uncharacterized protein n=1 Tax=Candida... 168 2e-40 UniRef50_UPI00017448C4 hypothetical protein VspiD_04945 n=1 Tax=... 166 1e-39 UniRef50_C9RS24 Carbohydrate binding family 6 n=1 Tax=Fibrobacte... 165 1e-39 UniRef50_C0ABN2 Putative uncharacterized protein n=2 Tax=Opituta... 165 2e-39 UniRef50_B0RC94 Putative surface-anchored protein n=1 Tax=Clavib... 164 4e-39 UniRef50_Q11TG0 CHU large protein; candidate polyfunctional acet... 161 4e-38 UniRef50_C0ACS7 Putative uncharacterized protein n=1 Tax=Opituta... 160 6e-38 UniRef50_C9RND1 Putative uncharacterized protein n=1 Tax=Fibroba... 159 1e-37 UniRef50_B8I0M1 Carbohydrate binding family 6 n=6 Tax=Clostridiu... 158 3e-37 UniRef50_C7NVN3 Carbohydrate-binding family V/XII n=1 Tax=Halorh... 157 4e-37 UniRef50_B7AIJ5 Putative uncharacterized protein n=1 Tax=Bactero... 157 4e-37 UniRef50_A6CD12 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A6C... 157 6e-37 UniRef50_C9RLV5 Carbohydrate binding family 6 n=1 Tax=Fibrobacte... 157 7e-37 UniRef50_Q8L9J9 Probable carbohydrate esterase At4g34215 n=10 Ta... 156 8e-37 UniRef50_C5BRL9 Acetylxylan esterase / xylanase n=1 Tax=Teredini... 156 1e-36 UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 153 6e-36 UniRef50_C3QEP9 Glycoside hydrolase family 43 n=6 Tax=root RepID... 153 8e-36 UniRef50_C9RKV3 Putative uncharacterized protein n=1 Tax=Fibroba... 153 8e-36 UniRef50_C0A5B6 Putative uncharacterized protein n=1 Tax=Opituta... 152 1e-35 UniRef50_C5WRX2 Putative uncharacterized protein Sb01g000530 n=2... 152 2e-35 UniRef50_Q8A041 Acetyl xylan esterase A n=8 Tax=Bacteroides RepI... 150 5e-35 UniRef50_A6DRT7 Putative uncharacterized protein n=1 Tax=Lentisp... 150 8e-35 UniRef50_B0BZZ0 Putative uncharacterized protein n=1 Tax=Acaryoc... 149 1e-34 UniRef50_A8FC47 Possible acetylxylan esterase n=19 Tax=Bacteria ... 148 2e-34 UniRef50_Q84M79 Os03g0857600 protein n=4 Tax=Poaceae RepID=Q84M7... 140 7e-32 UniRef50_D2R5Y4 Putative uncharacterized protein n=1 Tax=Pirellu... 139 1e-31 UniRef50_Q7XSV9 OSJNBa0027H06.16 protein n=5 Tax=Poaceae RepID=Q... 138 2e-31 UniRef50_C1ZKK3 Putative uncharacterized protein n=1 Tax=Plancto... 135 3e-30 UniRef50_C7J1I1 Os04g0110400 protein n=2 Tax=Poaceae RepID=C7J1I... 134 5e-30 UniRef50_B5JIZ7 Conserved domain protein n=1 Tax=Verrucomicrobia... 133 8e-30 UniRef50_A6C656 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris... 130 7e-29 UniRef50_Q9LF91 Putative uncharacterized protein F8J2_180 n=1 Ta... 127 7e-28 UniRef50_Q07GI7 Conserved domain protein n=1 Tax=Roseobacter den... 123 8e-27 UniRef50_A6DJ18 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 123 1e-26 UniRef50_A0LNW1 Putative uncharacterized protein n=1 Tax=Syntrop... 118 2e-25 UniRef50_A9GHE3 Putative uncharacterized protein n=1 Tax=Sorangi... 117 5e-25 UniRef50_UPI0001BC8395 sialate O-acetylesterase n=1 Tax=Bacteroi... 114 6e-24 UniRef50_C6VVT7 Putative uncharacterized protein n=2 Tax=Dyadoba... 108 2e-22 UniRef50_A3K171 Probable acetyl xylan esterase AxeA n=2 Tax=Bact... 100 9e-20 UniRef50_UPI0001C367C9 hypothetical protein ChatD1_04961 n=1 Tax... 99 3e-19 UniRef50_UPI00016C0614 hypothetical protein Epulo_09645 n=1 Tax=... 97 8e-19 UniRef50_B4DB21 Putative uncharacterized protein n=1 Tax=Chthoni... 93 1e-17 UniRef50_D2R922 Putative uncharacterized protein n=1 Tax=Pirellu... 88 6e-16 UniRef50_A6DIF1 Acetyl xylan esterase A n=1 Tax=Lentisphaera ara... 83 1e-14 UniRef50_Q6KCY2 Hypothetical adenine-specific methylase YfcB n=1... 83 1e-14 Sequences not found previously or not previously below threshold: UniRef50_Q11VQ5 CHU large protein; candidate bifunctional acetyl... 157 4e-37 UniRef50_C3QXI5 Glycoside hydrolase family 43 protein n=5 Tax=Ba... 148 2e-34 UniRef50_O13495 Acetylxylan esterase n=2 Tax=Neocallimastigaceae... 143 8e-33 UniRef50_A5FD34 Candidate bifunctional acetylxylan esterase/feru... 125 2e-27 UniRef50_Q2YI73 Acetlyxylan esterase (Fragment) n=1 Tax=unidenti... 122 2e-26 UniRef50_UPI00016C5503 hypothetical protein GobsU_26641 n=1 Tax=... 112 2e-23 UniRef50_C1ZJF8 Putative uncharacterized protein n=5 Tax=Bacteri... 108 2e-22 UniRef50_C3PX29 Predicted protein n=4 Tax=Bacteroides RepID=C3PX... 103 8e-21 UniRef50_A6DFN7 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 102 2e-20 UniRef50_UPI00016C3AF5 hypothetical protein GobsU_22277 n=1 Tax=... 100 1e-19 UniRef50_A6DFJ1 Sialate O-acetylesterase n=1 Tax=Lentisphaera ar... 97 9e-19 UniRef50_C6XSW9 Sialate O-acetylesterase n=1 Tax=Pedobacter hepa... 94 9e-18 UniRef50_D2R930 Putative uncharacterized protein n=1 Tax=Pirellu... 93 1e-17 UniRef50_A5FCG2 Sialate O-acetylesterase n=1 Tax=Flavobacterium ... 93 1e-17 UniRef50_A7V9B4 Putative uncharacterized protein n=2 Tax=Bactero... 92 2e-17 UniRef50_A3ZRY8 Iduronate-2-sulfatase n=1 Tax=Blastopirellula ma... 92 3e-17 UniRef50_C7PT40 Sialate O-acetylesterase n=3 Tax=Bacteria RepID=... 90 1e-16 UniRef50_B5CXD4 Putative uncharacterized protein n=1 Tax=Bactero... 89 2e-16 UniRef50_A5FA09 Sialate O-acetylesterase n=1 Tax=Flavobacterium ... 89 3e-16 UniRef50_UPI0001BC840D sialic acid-specific 9-O-acetylesterase n... 89 3e-16 UniRef50_C6Y155 Sialate O-acetylesterase n=1 Tax=Pedobacter hepa... 87 1e-15 UniRef50_B3C638 Putative uncharacterized protein n=1 Tax=Bactero... 86 1e-15 UniRef50_B5Y539 Predicted protein n=1 Tax=Phaeodactylum tricornu... 86 2e-15 UniRef50_B7ALN1 Putative uncharacterized protein n=1 Tax=Bactero... 85 2e-15 UniRef50_A6DRW1 Acetyl xylan esterase A n=1 Tax=Lentisphaera ara... 85 3e-15 UniRef50_D2R5K6 Putative uncharacterized protein n=1 Tax=Pirellu... 85 4e-15 UniRef50_C9KWT4 Sialic acid-specific 9-O-acetylesterase n=16 Tax... 85 4e-15 UniRef50_C0D050 Putative uncharacterized protein n=2 Tax=Clostri... 85 5e-15 UniRef50_A6L837 Sialic acid-specific 9-O-acetylesterase n=8 Tax=... 84 7e-15 UniRef50_B7AIP4 Putative uncharacterized protein n=3 Tax=Bactero... 84 8e-15 UniRef50_A6DFR1 Acetylxylan esterase related enzyme n=1 Tax=Lent... 83 1e-14 UniRef50_C6Y117 Sialate O-acetylesterase n=2 Tax=Pedobacter hepa... 83 1e-14 UniRef50_A6DGA5 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 83 1e-14 UniRef50_B5JPM4 Conserved domain protein n=1 Tax=Verrucomicrobia... 83 1e-14 UniRef50_B3CHU5 Putative uncharacterized protein n=1 Tax=Bactero... 82 2e-14 UniRef50_C7PN81 Sialate O-acetylesterase n=1 Tax=Chitinophaga pi... 82 3e-14 UniRef50_Q8A331 Putative sialic acid-specific acetylesterase n=4... 82 4e-14 UniRef50_Q8A2Y4 Sialic acid-specific 9-O-acetylesterase n=10 Tax... 81 5e-14 UniRef50_B9XHK5 Autotransporter-associated beta strand repeat pr... 81 5e-14 UniRef50_UPI000180C4CE PREDICTED: similar to LOC495015 protein n... 81 5e-14 UniRef50_B7ALS5 Putative uncharacterized protein n=1 Tax=Bactero... 81 5e-14 UniRef50_A6DQW4 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 81 6e-14 UniRef50_C5SLN9 Sialate O-acetylesterase n=1 Tax=Asticcacaulis e... 81 7e-14 UniRef50_A6DFP9 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 81 7e-14 UniRef50_B7AHV8 Putative uncharacterized protein n=1 Tax=Bactero... 80 7e-14 UniRef50_A9UYV9 Predicted protein n=2 Tax=Monosiga brevicollis R... 80 9e-14 UniRef50_C6XUV3 Sialate O-acetylesterase n=1 Tax=Pedobacter hepa... 80 1e-13 UniRef50_C0A824 Sialate O-acetylesterase n=1 Tax=Opitutaceae bac... 80 1e-13 UniRef50_B4REE2 Putative uncharacterized protein n=1 Tax=Phenylo... 80 1e-13 UniRef50_C3RDR8 Putative uncharacterized protein n=2 Tax=Bactero... 80 1e-13 UniRef50_B3CHU8 Putative uncharacterized protein n=2 Tax=Bactero... 80 1e-13 UniRef50_A6DN35 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 80 1e-13 UniRef50_A5ZHN2 Putative uncharacterized protein n=1 Tax=Bactero... 79 2e-13 UniRef50_C6XXZ6 Sialate O-acetylesterase n=1 Tax=Pedobacter hepa... 79 2e-13 UniRef50_UPI000196870D hypothetical protein BACCELL_00054 n=1 Ta... 79 2e-13 UniRef50_B0NPW7 Putative uncharacterized protein n=1 Tax=Bactero... 79 2e-13 UniRef50_UPI0001BC816D sialic acid-specific 9-O-acetylesterase n... 79 2e-13 UniRef50_B5JQ65 Conserved domain protein n=1 Tax=Verrucomicrobia... 79 2e-13 UniRef50_A6L7S8 Sialate O-acetylesterase n=10 Tax=Bacteroidales ... 78 3e-13 UniRef50_A7V893 Putative uncharacterized protein n=4 Tax=Bactero... 78 3e-13 UniRef50_A7LY32 Putative uncharacterized protein n=1 Tax=Bactero... 78 3e-13 UniRef50_C1F9V4 Sialate O-acetylesterase homolog n=1 Tax=Acidoba... 78 3e-13 UniRef50_C6Y404 Sialate O-acetylesterase n=1 Tax=Pedobacter hepa... 78 3e-13 UniRef50_C1ZI44 Putative uncharacterized protein n=1 Tax=Plancto... 78 4e-13 UniRef50_C5RDR3 LPXTG-motif cell wall anchor domain protein n=2 ... 78 4e-13 UniRef50_D2QHH9 Putative uncharacterized protein n=1 Tax=Spiroso... 78 4e-13 UniRef50_A6DKT6 Putative uncharacterized protein n=1 Tax=Lentisp... 78 4e-13 UniRef50_D2QYI9 Sialate O-acetylesterase n=1 Tax=Pirellula stale... 78 4e-13 UniRef50_D2QX68 Sialate O-acetylesterase n=1 Tax=Pirellula stale... 77 8e-13 UniRef50_C6VZW5 Putative uncharacterized protein n=1 Tax=Dyadoba... 76 2e-12 UniRef50_A4AN28 Sialate O-acetylesterase n=1 Tax=Flavobacteriale... 76 2e-12 UniRef50_C3QSB9 Sialate O-acetylesterase n=8 Tax=Bacteroides Rep... 76 2e-12 UniRef50_UPI0001BC8367 sialic acid-specific 9-O-acetylesterase n... 75 2e-12 UniRef50_B3CGC9 Putative uncharacterized protein n=2 Tax=Bactero... 75 3e-12 UniRef50_D1PH08 Putative sialate O-acetylesterase n=1 Tax=Prevot... 75 3e-12 UniRef50_Q1LUX8 Novel protein (Zgc:56454) n=3 Tax=Danio rerio Re... 75 3e-12 UniRef50_A6DH60 Sialic acid-specific 9-O-acetylesterase n=2 Tax=... 75 3e-12 UniRef50_A6LG42 Sialate O-acetylesterase n=21 Tax=Bacteroidetes ... 75 5e-12 UniRef50_C2FWW0 Putative uncharacterized protein n=2 Tax=Sphingo... 74 5e-12 UniRef50_B7FW97 Predicted protein n=1 Tax=Phaeodactylum tricornu... 74 5e-12 UniRef50_A6DF57 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 74 6e-12 UniRef50_A6DS94 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 74 7e-12 UniRef50_B4CVC6 Sialate O-acetylesterase n=1 Tax=Chthoniobacter ... 73 1e-11 UniRef50_C6W5L8 Putative uncharacterized protein n=1 Tax=Dyadoba... 73 1e-11 UniRef50_A4AM20 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 73 1e-11 UniRef50_C6Z4B3 Polysaccharide deacetylase n=10 Tax=Bacteroides ... 73 1e-11 UniRef50_B1ZNK9 Sialate O-acetylesterase n=1 Tax=Opitutus terrae... 72 2e-11 UniRef50_Q8AAL7 S-layer related protein, sialic acid-specific 9-... 72 2e-11 UniRef50_D1NBH8 Putative uncharacterized protein n=1 Tax=Victiva... 72 2e-11 UniRef50_B7G104 Predicted protein n=1 Tax=Phaeodactylum tricornu... 72 2e-11 UniRef50_D2QKK0 Conserved repeat domain protein n=1 Tax=Spirosom... 72 3e-11 UniRef50_UPI00016E1BD2 UPI00016E1BD2 related cluster n=2 Tax=Tak... 72 4e-11 UniRef50_Q9HAT2 Sialate O-acetylesterase n=16 Tax=Tetrapoda RepI... 72 4e-11 UniRef50_C3Z127 Putative uncharacterized protein n=3 Tax=Branchi... 72 4e-11 UniRef50_A7S3W8 Predicted protein (Fragment) n=1 Tax=Nematostell... 71 4e-11 UniRef50_UPI00016C4ED6 sialic acid-specific 9-O-acetylesterase n... 71 4e-11 UniRef50_C6VZW6 Putative uncharacterized protein n=1 Tax=Dyadoba... 71 4e-11 UniRef50_A5FC32 Sialate O-acetylesterase n=2 Tax=Flavobacteriace... 71 5e-11 UniRef50_A0ZZI1 Sialic acid-specific 9-O-acetylesterase n=6 Tax=... 71 5e-11 UniRef50_D1N4T8 Sialate O-acetylesterase n=1 Tax=Victivallis vad... 71 6e-11 UniRef50_Q7UL92 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 70 8e-11 UniRef50_UPI00019689D5 hypothetical protein BACCELL_02528 n=1 Ta... 70 1e-10 UniRef50_Q1IR02 Sialate O-acetylesterase n=1 Tax=Candidatus Kori... 70 1e-10 UniRef50_Q022C5 Sialate O-acetylesterase n=1 Tax=Candidatus Soli... 70 1e-10 UniRef50_D2QFV7 Conserved repeat domain protein n=1 Tax=Spirosom... 70 1e-10 UniRef50_D2EUP1 Sialate O-acetylesterase n=1 Tax=Bacteroides sp.... 70 1e-10 UniRef50_B2UQD5 Putative uncharacterized protein n=1 Tax=Akkerma... 70 2e-10 UniRef50_A8ITT3 Predicted protein n=1 Tax=Chlamydomonas reinhard... 70 2e-10 UniRef50_UPI0001BC7C56 hypothetical protein BacD2_14780 n=1 Tax=... 69 3e-10 UniRef50_C6VYE2 Putative uncharacterized protein n=1 Tax=Dyadoba... 68 3e-10 UniRef50_D2QD65 Putative uncharacterized protein n=1 Tax=Spiroso... 68 5e-10 UniRef50_Q7UPP2 Sialic acid-specific 9-O-acetylesterase n=4 Tax=... 67 8e-10 UniRef50_D1N7W8 Putative uncharacterized protein n=1 Tax=Victiva... 67 8e-10 UniRef50_UPI000196858D hypothetical protein BACCELL_00130 n=1 Ta... 67 1e-09 UniRef50_A7M023 Putative uncharacterized protein n=1 Tax=Bactero... 66 1e-09 UniRef50_C0A737 Sialate O-acetylesterase n=1 Tax=Opitutaceae bac... 66 2e-09 UniRef50_D2QD66 Putative uncharacterized protein n=1 Tax=Spiroso... 66 2e-09 UniRef50_A0YRB5 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 65 3e-09 UniRef50_D2QJG4 Putative uncharacterized protein n=1 Tax=Spiroso... 65 3e-09 UniRef50_C3Y8H9 Putative uncharacterized protein n=1 Tax=Branchi... 65 3e-09 UniRef50_C0ADE1 Sialate O-acetylesterase n=1 Tax=Opitutaceae bac... 65 3e-09 UniRef50_C1E516 Sialic-acid o-acetylesterase-like protein n=1 Ta... 64 7e-09 UniRef50_A6ECR5 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 64 9e-09 UniRef50_C0ACS1 Sialate O-acetylesterase n=1 Tax=Opitutaceae bac... 63 9e-09 UniRef50_A6DF88 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 63 1e-08 UniRef50_D2R456 Putative uncharacterized protein n=1 Tax=Pirellu... 63 1e-08 UniRef50_D2QHE7 Putative uncharacterized protein n=1 Tax=Spiroso... 63 2e-08 UniRef50_A6DLA5 Sialic-acid O-acetylesterase n=1 Tax=Lentisphaer... 62 2e-08 UniRef50_D2BR70 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 62 4e-08 UniRef50_C0A8R8 Sialate O-acetylesterase n=1 Tax=Opitutaceae bac... 62 4e-08 UniRef50_B1GS50 Maturation/adhesion protein n=1 Tax=Salmonella p... 62 4e-08 UniRef50_UPI00019275B4 PREDICTED: similar to cytosolic sialic ac... 61 5e-08 UniRef50_Q7UHJ8 Sialic-acid O-acetylesterase n=1 Tax=Rhodopirell... 61 5e-08 UniRef50_Q8A042 Polysaccharide deacetylase n=7 Tax=Bacteroides R... 61 5e-08 UniRef50_C9PT41 Sialic acid-specific 9-O-acetylesterase n=3 Tax=... 61 5e-08 UniRef50_D2Q6U1 Sialic acid-specific 9-O-acetylesterase n=6 Tax=... 61 6e-08 UniRef50_UPI0001AEC524 hypothetical protein AmacA2_21053 n=1 Tax... 61 7e-08 UniRef50_UPI0001924939 PREDICTED: similar to predicted protein, ... 60 7e-08 UniRef50_A3HTJ4 Putative sialate O-acetylesterase n=1 Tax=Algori... 60 8e-08 UniRef50_UPI0001924174 PREDICTED: similar to predicted protein n... 59 2e-07 UniRef50_Q21HS7 Glycoside hydrolase family 2, sugar binding n=1 ... 59 2e-07 UniRef50_C3QEP5 Sialic acid-specific 9-O-acetylesterase n=2 Tax=... 59 2e-07 UniRef50_C6LL91 Putative ExsB n=1 Tax=Bryantella formatexigens D... 59 2e-07 UniRef50_C6Y1K6 Putative uncharacterized protein n=1 Tax=Pedobac... 59 3e-07 UniRef50_B7AFZ4 Putative uncharacterized protein n=1 Tax=Bactero... 58 4e-07 UniRef50_UPI0001924DD2 PREDICTED: similar to predicted protein, ... 58 5e-07 UniRef50_A2RKE5 Sialic acid-specific 9-O-acetylesterase n=2 Tax=... 58 6e-07 UniRef50_C4XH28 Putative uncharacterized protein n=2 Tax=Desulfo... 58 6e-07 UniRef50_UPI0001923B19 PREDICTED: similar to predicted protein n... 58 6e-07 UniRef50_C3QLH3 Sialic acid-specific 9-O-acetylesterase n=11 Tax... 57 7e-07 UniRef50_C6VWZ0 Putative uncharacterized protein n=1 Tax=Dyadoba... 57 8e-07 UniRef50_B8I0P3 Putative uncharacterized protein n=5 Tax=Bacteri... 57 9e-07 UniRef50_UPI0001744488 sialic acid-specific 9-O-acetylesterase n... 57 9e-07 UniRef50_B2IIR7 Putative uncharacterized protein n=1 Tax=Beijeri... 57 1e-06 UniRef50_UPI0001968E9D hypothetical protein BACCELL_02606 n=1 Ta... 57 1e-06 UniRef50_B8DVE1 Sialic acid-specific 9-O-acetylesterase n=4 Tax=... 57 1e-06 UniRef50_B9XAZ4 Putative uncharacterized protein n=1 Tax=bacteri... 56 2e-06 UniRef50_A6KWF2 Sialic acid-specific 9-O-acetylesterase n=8 Tax=... 56 2e-06 UniRef50_D2U9U1 Putative hydrolase protein n=1 Tax=Xanthomonas a... 55 3e-06 UniRef50_C3QG04 Sialic acid-specific 9-O-acetylesterase n=4 Tax=... 55 3e-06 UniRef50_A8A1H8 Putative uncharacterized protein n=1 Tax=Escheri... 55 5e-06 UniRef50_Q0FSF8 Putative hemagglutinin-related protein n=3 Tax=R... 54 7e-06 UniRef50_C0AED8 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 54 8e-06 UniRef50_UPI0001925DCB PREDICTED: similar to predicted protein n... 54 8e-06 UniRef50_C5BNV0 ExsB n=1 Tax=Teredinibacter turnerae T7901 RepID... 54 9e-06 UniRef50_C6IKL7 Sialic acid-specific 9-O-acetylesterase n=9 Tax=... 53 9e-06 UniRef50_UPI0001BC87BE hypothetical protein BacD2_14083 n=1 Tax=... 53 1e-05 UniRef50_Q15XN0 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 53 1e-05 UniRef50_D1NUY7 Sugar binding domain protein, glycosyl hydrolase... 53 1e-05 UniRef50_UPI0001C35E1E hypothetical protein ChatD1_22041 n=1 Tax... 53 2e-05 UniRef50_A1A0H7 Putative secreted protein n=5 Tax=Bifidobacteriu... 53 2e-05 UniRef50_A6AXJ1 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 53 2e-05 UniRef50_C3PXV2 Sialic acid-specific 9-O-acetylesterase n=6 Tax=... 53 2e-05 UniRef50_B0MQZ6 Putative uncharacterized protein n=2 Tax=Clostri... 52 2e-05 UniRef50_C0BIJ4 Putative uncharacterized protein n=1 Tax=Flavoba... 52 3e-05 UniRef50_D2KFQ9 Beta-galactosidase/beta-glucuronidase (Fragment)... 52 4e-05 UniRef50_A6KYD0 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 52 4e-05 UniRef50_C5SG53 Putative uncharacterized protein n=1 Tax=Asticca... 52 4e-05 UniRef50_A9KN08 Putative uncharacterized protein n=1 Tax=Clostri... 51 5e-05 UniRef50_C3QAB6 Sialic acid-specific 9-O-acetylesterase n=3 Tax=... 51 5e-05 UniRef50_Q07P27 Putative uncharacterized protein n=1 Tax=Rhodops... 51 6e-05 UniRef50_D2QK09 Putative uncharacterized protein n=1 Tax=Spiroso... 51 6e-05 UniRef50_C6X165 Putative uncharacterized protein n=1 Tax=Flavoba... 51 7e-05 UniRef50_B4D814 Sialate O-acetylesterase n=2 Tax=Chthoniobacter ... 51 7e-05 UniRef50_B3PDW1 Sialic acid-specific 9-O-acetylesterase n=1 Tax=... 50 1e-04 UniRef50_C6XQG7 Putative uncharacterized protein n=1 Tax=Hirschi... 50 1e-04 UniRef50_C7PN71 Putative uncharacterized protein n=1 Tax=Chitino... 50 2e-04 UniRef50_A6E6P0 Sialic acid-specific 9-O-acetylesterase n=3 Tax=... 50 2e-04 >UniRef50_P39370 Uncharacterized protein yjhS n=156 Tax=root RepID=YJHS_ECOLI Length = 326 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 326/326 (100%), Positives = 326/326 (100%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN Sbjct: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR Sbjct: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG Sbjct: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE Sbjct: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 Query: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS 300 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS Sbjct: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS 300 Query: 301 HFSTAARRGIISDRFVEAILQFWRER 326 HFSTAARRGIISDRFVEAILQFWRER Sbjct: 301 HFSTAARRGIISDRFVEAILQFWRER 326 >UniRef50_Q9FCW8 ORF616 n=118 Tax=root RepID=Q9FCW8_ECOLX Length = 616 Score = 364 bits (933), Expect = 3e-99, Method: Composition-based stats. Identities = 179/324 (55%), Positives = 221/324 (68%), Gaps = 4/324 (1%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 ++ PD+YYV+ +AGQSNAMAYGEGLPLPD DAP PRIKQLAR + PGG C +N Sbjct: 55 VSGATEPDWYYVIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN 114 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIP HC HDVQDM +HP A + QYG VGQ LHIA+KLLP+IP+NAGIL+VPCCR Sbjct: 115 DIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCR 174 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFT G+EGT+S GAS D+ RWG PLYQDL++RT+AAL KNP+N L CWMQG Sbjct: 175 GGSAFTQGAEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQG 234 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHS 238 EFD+ + +A P F M+ FR DL +++Q + D PW CGDTT+YWK + Sbjct: 235 EFDMSAATHAQQPALFTAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQ 294 Query: 239 YEAIYGNYQNNVLANIIFVDFQQ--QGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTA 296 Y+ +YG Y+N + FV F G TNAP EDPD ++GYYG+A R+ N ++ Sbjct: 295 YDTVYGGYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGNQVSS 354 Query: 297 LRSSHFSTAARRGIISDRFVEAIL 320 R +HFS+ ARR II DR AIL Sbjct: 355 NRPTHFSSWARRSIIPDRLATAIL 378 >UniRef50_B2U4Z6 YjhS n=22 Tax=root RepID=B2U4Z6_SHIB3 Length = 462 Score = 353 bits (906), Expect = 4e-96, Method: Composition-based stats. Identities = 176/330 (53%), Positives = 218/330 (66%), Gaps = 6/330 (1%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 ++A P+YY+V+ +AGQSN M+YGEGLPLP D P PRIKQLAR + PGG C +N Sbjct: 54 ISATSDPEYYFVVVLAGQSNGMSYGEGLPLPGTYDRPDPRIKQLARRSTVTPGGAACKYN 113 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 DIIP HC HDVQDM +HP A + QYGTVGQ LHIA+KLLPFIP NAGIL+VPCCR Sbjct: 114 DIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCR 173 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 GGSAFT G++GTYS+ GAS ++ RWG D PLY+DL+ RT+AAL KNP+N WMQG Sbjct: 174 GGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALEKNPKNVLFAVVWMQG 233 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHS 238 EFD + +H F +V+ FR DL Q + PW CGDTT++WK+ + Sbjct: 234 EFDFGGTPA-NHAAQFGALVDKFRADLADMAGQCVGGSAGGVPWICGDTTYFWKQKNEST 292 Query: 239 YEAIYGNYQNNVLANIIFVDFQ--QQGERGLTNAPDEDPDDLSTGYYGSAYRS-PENWTT 295 Y+ +YG+Y+N NI FV F + G TN P+EDPD GYYGS +R WT+ Sbjct: 293 YQTVYGSYKNKTEKNIHFVPFMTDENGVNVPTNKPEEDPDIPGIGYYGSKWRDSSATWTS 352 Query: 296 ALRSSHFSTAARRGIISDRFVEAILQFWRE 325 R+SHFS+ ARRGIISDR AIL + Sbjct: 353 QDRASHFSSWARRGIISDRLATAILSCAGK 382 >UniRef50_D2TQ80 Hypothetical prophage protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TQ80_CITRO Length = 683 Score = 345 bits (885), Expect = 1e-93, Method: Composition-based stats. Identities = 183/322 (56%), Positives = 216/322 (67%), Gaps = 5/322 (1%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 + P+YYYVL +AGQSN MAYGEGLPLPD D P PRIKQLAR + P G C +NDI Sbjct: 108 SSTEPEYYYVLPLAGQSNGMAYGEGLPLPDSFDRPEPRIKQLARRSTVTPDGTSCTYNDI 167 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 IP HC HDVQDM G +HP A + QYG VGQ LHIA+KLLP+IP NAGIL+VPCCRG Sbjct: 168 IPADHCLHDVQDMSGINHPKADLSKGQYGCVGQGLHIAKKLLPYIPQNAGILLVPCCRGA 227 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 SAFT G +G++SE GAS D+ RWG PLYQDL+SRTRAAL KNP+N+ L WMQGE Sbjct: 228 SAFTTGDDGSFSEVSGASADSSRWGAGKPLYQDLLSRTRAALEKNPKNRLLAVVWMQGEA 287 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL--NNITDAPWFCGDTTWYWKENFPHSYE 240 DL S H F MV+ FR DL +Q N PW CGDTT+YWK + YE Sbjct: 288 DL-ASGSQQHNGLFTAMVQQFRTDLSPLAAQCVSGNAGTVPWICGDTTYYWKNTYATQYE 346 Query: 241 AIYGNYQNNVLANIIFVDFQ--QQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALR 298 +YG Y+N NI FV F + G+ TNAP EDPD ++ GYYG+A R+ ++ + R Sbjct: 347 TVYGAYKNLTAQNIFFVPFLTDENGQNTPTNAPAEDPDIVAVGYYGAASRTQGSFVSTQR 406 Query: 299 SSHFSTAARRGIISDRFVEAIL 320 SHFS+ ARRGIISDR AIL Sbjct: 407 DSHFSSWARRGIISDRLSSAIL 428 >UniRef50_D1N2H8 Putative uncharacterized protein n=2 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N2H8_9BACT Length = 461 Score = 204 bits (518), Expect = 4e-51, Method: Composition-based stats. Identities = 54/290 (18%), Positives = 97/290 (33%), Gaps = 60/290 (20%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 + ++++ +AGQSN G + + PHPR+ R P P H++ Sbjct: 29 ENFHLILLAGQSNMAGRG---VISPSDRIPHPRVLMQNRQGEWVPAVEPVHYDKD----- 80 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 + VG A +L P + ++P GGS + Sbjct: 81 ----------------------FAGVGPGRSFAIRLAASDPA-ITVGLIPAACGGSPIAS 117 Query: 128 GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTS 187 G Y E+ + Y D V RTR A+ + QGE D S Sbjct: 118 WQPGAYHEQTQS-----------HPYDDAVRRTRRAMK---DGTLKAILFHQGEADCYGS 163 Query: 188 DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAI--YGN 245 + + ++ + R++L P+ G + + +E + +++ Sbjct: 164 APNQYRERLFTLIRSLRQELGAPA--------CPFIIGQLSRFPQETWSEGKKSVDAAHR 215 Query: 246 YQNNVLANIIFVDFQQ---QGERGLTNAPDEDPDDLSTGYYGSAYRSPEN 292 L + FV +Q +R +AP + + YYG+ R Sbjct: 216 AAAAELPEVGFVSSEQLTSNPDRIHFDAPSQ--REFGRRYYGTYRRLTAP 263 >UniRef50_D2QHB3 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QHB3_9SPHI Length = 264 Score = 202 bits (512), Expect = 2e-50, Method: Composition-based stats. Identities = 62/319 (19%), Positives = 95/319 (29%), Gaps = 83/319 (26%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 P + + GQSN G + PH RI L + P P HF+ Sbjct: 27 PPRLKLFLLIGQSNMAGRG---IPEAEDKQPHQRIWMLTKEQTWVPARDPLHFDK----- 78 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 VG L A+KL+ I ++PC +GGS Sbjct: 79 ---------------------PAVIGVGPGLAFAQKLVNA-DKKVNIGLIPCAQGGSGID 116 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMT 186 G Y T + Y D + R + AL + G W QGE D T Sbjct: 117 VWVPGAYYA-----------ATKSYPYDDAIKRAKKALET---GELAGILWHQGESDSQT 162 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNY 246 A + + +V R DL + P+F G ++ + P + Sbjct: 163 EKAAVYGEKLTALVSRIRTDL--------QAENVPFFVGTLGDFYVQKHP-----VAAQI 209 Query: 247 QNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAA 306 + A + S T ++HF T++ Sbjct: 210 NTILEALPKTIPNM-------------------------YAVSASGLTDKGDTTHFDTSS 244 Query: 307 RRGIISDRFVEAILQFWRE 325 R + RF +A L ++ Sbjct: 245 ART-LGRRFADAYLAQSKK 262 >UniRef50_Q7UGU5 Probable acetyl xylan esterase AxeA n=2 Tax=Planctomycetaceae RepID=Q7UGU5_RHOBA Length = 298 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 56/294 (19%), Positives = 101/294 (34%), Gaps = 55/294 (18%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 A + P ++ +AGQSN G+ + D + PHPR+ + P P HF+ Sbjct: 54 TAQLPPTGLHLFLLAGQSNMAGRGK---IADEDLQPHPRVLVFNKAGEWAPAIAPLHFDK 110 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 + VG A + P A + ++PC G Sbjct: 111 --------------------------PRIAGVGLGRTFAIEYAENNPQ-ATVGLIPCAVG 143 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 GS+ G + E T+T Y D + R + A+ + G W QGE Sbjct: 144 GSSLDVWQPGGFHE-----------STNTHPYDDCMKRMQQAIVA---GELKGILWHQGE 189 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 D + ++ N + E FR + + P G + ++ + S + Sbjct: 190 SDSNPALSKTYQSKLNELFERFRTEFGSPN--------VPIVIGQLGQFTEKPWDESRKL 241 Query: 242 IYGNYQNN--VLANIIFVDFQQQGERG-LTNAPDEDPDDLSTGYYGSAYRSPEN 292 + ++ + N +FV G +G T+ E + Y+ + + + Sbjct: 242 VDQAHRTLPDRMTNTVFVHSDGLGHKGDQTHFSAEAYREFGHRYFLAYQQLTGS 295 >UniRef50_A3I3A6 Probable acetyl xylan esterase AxeA n=1 Tax=Algoriphagus sp. PR1 RepID=A3I3A6_9SPHI Length = 274 Score = 190 bits (481), Expect = 7e-47, Method: Composition-based stats. Identities = 53/325 (16%), Positives = 99/325 (30%), Gaps = 85/325 (26%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 + + +++ + GQSN G + + HPR+ L + P HF+ Sbjct: 28 SQKSEKENFHLYLLMGQSNMAGRGLVEAI---DTLSHPRVWMLDSTMNWVLARDPMHFDK 84 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 VG L + + P I ++P G Sbjct: 85 ---------------------------PVAGVGLGLTFGKIMANENPS-VKIGLIPTAVG 116 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 GS+ A D+ T T Y D++ R + AL G W QGE Sbjct: 117 GSSINAW-----------FKDSIHNQTKTFPYNDMIDRAKKAL---GDGTLKGILWHQGE 162 Query: 182 FDLMTSDY-ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 D + A++P F M+++ ++DL I P G+ ++ Sbjct: 163 SDTRNEESIANYPAKFYAMIDSLQKDLG--------IEPVPIVMGEIGHFFYGRA----- 209 Query: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSS 300 + + E+P G ++ S+ Sbjct: 210 -----------------PLAKNMNDTFSQIASENPCIDLVRSDGLNHK--------GDST 244 Query: 301 HFSTAARRGIISDRFVEAILQFWRE 325 HF + + ++ R+ E ++ ++ Sbjct: 245 HFDSNSYH-VLGMRYAEKMIDLQKD 268 >UniRef50_C6XV32 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV32_PEDHD Length = 276 Score = 187 bits (474), Expect = 5e-46, Method: Composition-based stats. Identities = 48/299 (16%), Positives = 75/299 (25%), Gaps = 70/299 (23%) Query: 5 ISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIP 64 + + GQSN G L + P+ + P H++ Sbjct: 40 KPGPELEIYLLLGQSNMAGRGPLLAEYTAMEQPNVLVW--DSEGKWIIARHPLHYDK--- 94 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 + VG L + P N I +VPC GG+ Sbjct: 95 -----------------------PKVAGVGPGLSFGFAMARSKP-NVRIGLVPCAVGGTN 130 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL 184 G T+T + D R R A+ G W QGE + Sbjct: 131 IDVWKPGA-----------MDKATNTHPFDDAEMRIREAMK---YGVVKGMIWHQGEANS 176 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 + + N ++ R+ + P G+ + +Y+ Sbjct: 177 GAQNMIGYLDKLNELITRIRKMVGN--------EKLPVVVGELG-----RYKTNYQQF-- 221 Query: 245 NYQNNVLANIIFVDFQQQGERGLTNAPDEDP------DDLSTGYYGSAYRSPENWTTAL 297 N +LA T+ D D S YG Y W Sbjct: 222 ---NKMLAG---APQMIPNLALATSESLVDKGDLTHFDSPSATAYGKRYAEKMLWLQQN 274 >UniRef50_A6CAJ7 Probable acetyl xylan esterase AxeA n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAJ7_9PLAN Length = 278 Score = 182 bits (462), Expect = 1e-44, Method: Composition-based stats. Identities = 50/292 (17%), Positives = 93/292 (31%), Gaps = 68/292 (23%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 + + +++ + GQSN G+ P + HPR+ +L + + P P HF+ Sbjct: 42 AELPEKEKFHIYLLIGQSNMAGRGKVDP---ASNKAHPRVLKLDKAGNWVPATDPLHFDK 98 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 + VG + P+ I ++P G Sbjct: 99 --------------------------PKIAGVGPGSGFGPVIADAYPE-VTIGLIPAAVG 131 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 G+ + +G LY+ V + A + GA W QGE Sbjct: 132 GTPLSRWVKGG------------------DLYERAV---KLAKENQKKGVIKGAIWHQGE 170 Query: 182 FD-LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKEN-FPHSY 239 D Y S+ + + M+ R DL + D P+ G+ ++ P Sbjct: 171 GDSSNPKLYNSYQKRLSGMIADLRTDLGEP--------DMPFVMGELGEFFTRPGAPTVN 222 Query: 240 EAIYGNYQNN---VLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYR 288 +A++G + +A+ + + NA E + Y + Sbjct: 223 QALHGIAKEVPATAVASSKGLPAKSDQV--HFNAESE--REFGKRYAAQMLK 270 >UniRef50_B5JF90 Conserved domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JF90_9BACT Length = 265 Score = 178 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 53/290 (18%), Positives = 94/290 (32%), Gaps = 59/290 (20%) Query: 5 ISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIP 64 S D ++++ +AGQSN G+ + +P++ L + P H++ + Sbjct: 30 PSQDSFHLILLAGQSNMAGRGD---MEGPRVESNPQVLALDKEGRWVVAKDPLHWDKSV- 85 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 VG L AR+ L P I ++P GGS Sbjct: 86 --------------------------AGVGLGLSFAREYLKDHP-GVTIGLIPAACGGSP 118 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL 184 ++ G Y ++ TD+ Y D + R A G W QGE D Sbjct: 119 ISSWEAGAYFDQ-----------TDSHPYDDALKRVSRATQ---DGTLKGVLWHQGESDS 164 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 + +++ FR + + D P G + + H E Sbjct: 165 HEGLSDLYEAKLEGLIKRFRVEWDR--------EDLPVILGQLGQFEVKWGKHIEEVNRA 216 Query: 245 NYQ-NNVLANIIFVD---FQQQGERGLTNAPDEDPDDLSTGYYGSAYRSP 290 + L ++ FV + +G+ ++ + YYG R+ Sbjct: 217 TKRVAKRLEHVGFVSSKNLESKGDALHFSSAA--LQEFGKRYYGRFKRAS 264 >UniRef50_A9GML8 Iduronate-2-sulfatase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GML8_SORC5 Length = 453 Score = 174 bits (441), Expect = 3e-42, Method: Composition-based stats. Identities = 72/345 (20%), Positives = 109/345 (31%), Gaps = 68/345 (19%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 ++A V +AGQSN + YG G LP P + GGP + Sbjct: 15 LSAPAIAQPVKVFVLAGQSNMVGYGVGRQLPVEL-QSQPDVWYDHYNPDAREGGP---YA 70 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 P + + + G + R + P++ I IV + Sbjct: 71 AATSADWGPLEP--------------KGEARRYGPEITFGRAIAAAYPEH-RIAIVKMAQ 115 Query: 121 GGSAF-TAGSEGT------------YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKN 167 GG+ G Y G A G Y + V+R ALA+ Sbjct: 116 GGTNLVDHWGRGLAPDPEVLYKSQLYHALLGKLDSATYEGDRALRYPEEVTRLDGALARL 175 Query: 168 PQ----NKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWF 223 + WMQGE + S S+ + A R DL P Sbjct: 176 ESEGHPYEIAALVWMQGENEAGWSAAFSYGNTLRGFIAAIRADLGVP--------GLPVV 227 Query: 224 CGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYY 283 G + +P + I ANI V Q +EDP Sbjct: 228 LGRVSD---NLYPANGGPIAAG----KEANIDAVRAAQ------VTVAEEDPRV------ 268 Query: 284 GSAYRSPENWT--TALRSSHFSTAARRGIISDRFVEAILQFWRER 326 A+ +++T + + HF +AA + ++ +RF EA L RE Sbjct: 269 --AWVDTDDFTVRSPDDAYHFDSAAYQ-LLGERFAEAYLALVREE 310 >UniRef50_B2UM46 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UM46_AKKM8 Length = 303 Score = 174 bits (440), Expect = 4e-42, Method: Composition-based stats. Identities = 50/334 (14%), Positives = 95/334 (28%), Gaps = 70/334 (20%) Query: 6 SPDYYY-----VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 + V+ + GQSNA G +P RI +++ Sbjct: 26 PAPPLHADEVNVILIGGQSNATGQGYVNNIPPCF-KTDKRIL--------------LYYS 70 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 + T + PL+ ++ G L + L P +I Sbjct: 71 GSLKGTEPAEQLV-------PLSPASESP-DRFGVELSLGTALQKKFPQKKWAIIKHARS 122 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQ----NKFLGAC 176 G + F + G S+ Y L+ R + + Sbjct: 123 GSNLFRQWNPGKTSQDKQGEE-----------YVKLLRTVRNGMEALKKQGHAPVLKAMV 171 Query: 177 WMQGEFD----LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWK 232 W QGE D + S+ + N++++ R DL+ + P Sbjct: 172 WQQGEGDARDIAGIKNALSYGANLNNLIKRIRADLEAPGLAFIYGSVLPVPA-------L 224 Query: 233 ENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPEN 292 FP E + ++ + + P +D S + R+P Sbjct: 225 ARFPGR-EKVRQGQKDVAEESRTSL-----SVNNAVYVPADDLQLRSMDF-----RTPYP 273 Query: 293 WTTALRSSHFSTAARRGIISDRFVEAILQFWRER 326 T +H ++ +RF A+ + W ++ Sbjct: 274 TDTVHLGTHGVL-----VLGERFASALEKLWGQK 302 >UniRef50_B9MVW2 Predicted protein n=11 Tax=Magnoliophyta RepID=B9MVW2_POPTR Length = 297 Score = 171 bits (432), Expect = 4e-41, Method: Composition-based stats. Identities = 50/319 (15%), Positives = 92/319 (28%), Gaps = 71/319 (22%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLP-----------LPDREDAPHPRIKQLARFAH 49 ++ + + + +AGQSN G + + + P+P I +L+ Sbjct: 17 ISEQLPQN---IFILAGQSNMAGRGGVVNNTKNGIPSWDGIVPVQCQPNPSILRLSASLT 73 Query: 50 THPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPD 109 P H A + VG + A +L +P+ Sbjct: 74 WVQAHEPLH------------------------ADIDYNKTNGVGPGMSFANAILTKVPN 109 Query: 110 NAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQ 169 I +VPC GG++ + ++G + LY LV RT+ AL + Sbjct: 110 FGSIGLVPCAIGGTSISEWAKGGF------------------LYDQLVRRTQFALQRG-- 149 Query: 170 NKFLGACWMQGEFDLM-TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT 228 W QGE D D ++ + R DL + + G+ Sbjct: 150 GVIGAMLWYQGESDTQIREDADAYKGRLDRFFIDLRADLGYPTLPIIQVA---LASGEGP 206 Query: 229 WYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGY-----Y 283 + + N N + + + T A + L+ + Sbjct: 207 YVEIVRNA----QLGINLPNVQCVDAKGLPLEPDRVHLTTPAQVQLGQTLTDAFLQSLSS 262 Query: 284 GSAYRSPENWTTALRSSHF 302 + + HF Sbjct: 263 PIHIANNSCRRFSNLMFHF 281 >UniRef50_Q9F106 Carbohydrate binding family 6 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=Q9F106_FIBSS Length = 539 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 50/324 (15%), Positives = 88/324 (27%), Gaps = 69/324 (21%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 +++ GQSN G D + HPR+K A Sbjct: 28 ANAAPDPNFHIYIAYGQSNME--GNARNFTDVDKKEHPRVKMFAT--------------- 70 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 T CP + G +P + A R + +P N I I+P +G Sbjct: 71 ----TSCPSLGRPTVGEMYPAVPPMFKCGEGLSVADWFGRHMADSLP-NVTIGIIPVAQG 125 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQ-DLVSRT-RAALAKNPQNKFLGACWMQ 179 G++ Y ++ + G + + R A + G + Q Sbjct: 126 GTSIRLFDPDDYKNYLNSAESWLKNGAKAYGDDGNAMGRIIEVAKKAQEKGVIKGIIFHQ 185 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 GE D S+ Q E + L N + P+ G+ Sbjct: 186 GETDGGMSN---WEQIVKKTYEYMLKQLGL------NAEETPFVAGEMVD---------- 226 Query: 240 EAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRS 299 G + + + ++ YG + Sbjct: 227 ----GGSCAGFSSRVRGLSKYIANFGVASSKG-----------YG----------SKGDG 261 Query: 300 SHFSTAARRGIISDRFVEAILQFW 323 HF+ RG + R+ + +L+ Sbjct: 262 LHFTVEGYRG-MGLRYAQQMLKLI 284 >UniRef50_B9XLT7 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XLT7_9BACT Length = 266 Score = 169 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 43/228 (18%), Positives = 67/228 (29%), Gaps = 60/228 (26%) Query: 10 YYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 + + + GQSN G+ + HPR+ L P + Sbjct: 34 FQIYLLMGQSNMAGRGKVGL---EDTTTHPRVLLLNTNNTWELAMEPVTKD--------- 81 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 + VG L + + N I +VPC GG+ + Sbjct: 82 -----------------RKAGRGVGPGLAFGKSMAEKN-SNVTIGLVPCAVGGTPLSRWQ 123 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD- 188 G LY + V+R + A+ G W QGE D Sbjct: 124 RGG------------------DLYSNAVARAKVAVK---DGALAGVLWHQGENDSSDKGL 162 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 S+ + + M+ FR D+ Q T+ P G + E P Sbjct: 163 AESYGKRLSEMIHDFRTDVGQ--------TNLPVVVGQIGEFLYERGP 202 >UniRef50_A9RQK4 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RQK4_PHYPA Length = 263 Score = 169 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 53/273 (19%), Positives = 82/273 (30%), Gaps = 39/273 (14%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 + + ++GQSN G G+ +D R P I+ L Sbjct: 6 KGFEIFILSGQSNMSGRG-GMQTIVAKDGSTSRKW-----DGIVPAECAAEPGSILRLNK 59 Query: 68 CPHDVQDMQGYHHPLATNHQTQYG-TVGQALHIARKLLPF-----IPDNAGILIVPCCRG 121 + + H P + T VG L A LL P I +VPC G Sbjct: 60 NL----EWEEAHEPTHIDIDTSKACGVGPGLVFAASLLRARKYKVKPTGPQIGLVPCAIG 115 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 G++ +G LY ++ RT+AAL K W QGE Sbjct: 116 GTSIVQWEKGRV------------------LYNHMIQRTKAALEKG--GTLKALLWYQGE 155 Query: 182 FDL-MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 D S + Q R DL ++ + + W Y + Sbjct: 156 SDAVEKSLADHYEQRLVTFFNHVRTDLNNHNLPIIQVA-INWPAAPHPEY-VNKVRSAQR 213 Query: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDE 273 A + ++ L + + + T A E Sbjct: 214 AALDHVKHLHLVDALGLPLLSDHIHLTTEAQTE 246 >UniRef50_Q01TY8 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01TY8_SOLUE Length = 252 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 61/314 (19%), Positives = 95/314 (30%), Gaps = 87/314 (27%) Query: 10 YYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 + + + GQSN G + +++ P PR+ L + P P HF+ Sbjct: 19 HEIFLLIGQSNMAGRG---VVEEQDRQPIPRVFMLNKAMEWVPAIDPVHFDKPDIA---- 71 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 VG A + L P NA I +VP GG++ Sbjct: 72 ----------------------GVGLARTFGKVLAAADP-NASIGLVPAAFGGTSLEEWK 108 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM-TSD 188 G LY++ V R + A++ K G W QGE D Sbjct: 109 VGGK------------------LYEEAVRRAKFAMS---SGKLRGILWHQGEADAGKKEL 147 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQN 248 +S+ Q F+ M+ R DL + D P G + E+ + + Sbjct: 148 ASSYRQRFSAMITQLRADLGEP--------DVPVVVGQLGEFLSESATPR-----SPFAS 194 Query: 249 NVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARR 308 V + V SA+ S T+ HF ++R Sbjct: 195 VVDEQLATVPLTVPH---------------------SAFVSSNGLTSNADHLHFDARSQR 233 Query: 309 GIISDRFVEAILQF 322 R+ A L Sbjct: 234 E-FGRRYALAFLSI 246 >UniRef50_UPI00017448C4 hypothetical protein VspiD_04945 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448C4 Length = 650 Score = 166 bits (419), Expect = 1e-39, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 84/296 (28%), Gaps = 70/296 (23%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 ++ + + + + GQSN G LP + R+ + + PG P H + Sbjct: 410 SMPEKETFDLYLLIGQSNMAGRG---LLPLEDRLSRERVLKFSARNAWAPGVEPLHTDK- 465 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 G + AR++ P I ++PC GG Sbjct: 466 -------------------------PAVAGAGLGMSFARQMAEAKP-KVTIGLIPCAVGG 499 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 + +G LY + R R A+ G W QGE Sbjct: 500 TPLDRWVKGG------------------DLYAAALVRAREAMK---SGNLKGILWHQGEA 538 Query: 183 DLM-TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 D S+ Q MV+ R DL D P+ G+ + + + Sbjct: 539 DSGSEEKAGSYAQRLAGMVKDLRADLG--------AGDVPFVAGELGEFLERTNKEGRPS 590 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDP------DDLSTGYYGSAYRSPE 291 + V + + + +A + D S +G Y + Sbjct: 591 FWP----VVNEQLATLPGLVPNADVVDSAGLKHKGDGVHFDTPSLREFGVRYATAM 642 >UniRef50_C9RS24 Carbohydrate binding family 6 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RS24_FIBSS Length = 552 Score = 165 bits (418), Expect = 1e-39, Method: Composition-based stats. Identities = 42/300 (14%), Positives = 76/300 (25%), Gaps = 25/300 (8%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 +++ GQSN G+ +P D+ +AP I N Sbjct: 27 ANAAPNPNFHIYIAYGQSNMAGNGDIVPSEDQAEAPKNFIML-------------ASHNA 73 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 + G +P + + A + R + +P + I+P G Sbjct: 74 NASQRSGKTNQSIKTGEWYPAIPPMFHPFENLSPADYFGRAMADSLP-GVTVGIIPVAIG 132 Query: 122 GSAFTAGSEGTYSER-HGASHDACRWGTDTPLYQDLVSRT-RAALAKNPQNKFLGACWMQ 179 + A + Y G D WG + R A G + Q Sbjct: 133 AVSIRAFDKDQYEAYFRGDGKDIMNWGWPKDYDNNPPGRILELAKKAKEVGVIKGFIFHQ 192 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 GE D A+ + + L + + P+ G+ + Sbjct: 193 GESDG---TDANWRKTVYKTYKDVIDALGL------DENEVPFVAGELLQEGQNCCSSKN 243 Query: 240 EAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRS 299 I QN + Q + + +L Y + + Sbjct: 244 GGIAQLKQNFKKFGLASSKGLQGNGKDPYHFGRAGVIELGKRYCSEMLKLIDKTIDPDAP 303 >UniRef50_C0ABN2 Putative uncharacterized protein n=2 Tax=Opitutaceae bacterium TAV2 RepID=C0ABN2_9BACT Length = 301 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 51/313 (16%), Positives = 88/313 (28%), Gaps = 77/313 (24%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 + + + + + GQSN G P + P R+ L + G P HF+ Sbjct: 42 VATPPKAENFDLYLLVGQSNMSGRGRVTP---ADSQPDTRVLVLGKDGEWLLQGEPVHFD 98 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 T+ VG A+++ P I ++PC Sbjct: 99 ---------------------------TRNAAVGLGFAFAKRMADHSP-GVTIGLIPCAV 130 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 G + G LY++ V R A + G W QG Sbjct: 131 GATPQKRWMPGG------------------DLYEEAVRRAGIAQQ---SGRLRGILWHQG 169 Query: 181 EFDLMT-SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 E + + ++ ++ +VE FRRDL P+ G+ + Sbjct: 170 ESETGSLVRSKAYGENLAKIVEGFRRDLNAP--------GVPFVAGELGEFLYMKSEER- 220 Query: 240 EAIYGNYQNNVLANIIFVDFQQQGERGLTNAP-----------DEDPDDLSTGYYGSAYR 288 V I + + +A E + YY + Sbjct: 221 ----AANAKIVNEQINRLPALVPNTAVIPSAGLGHRGDGTHFNAEAQREFGRRYYEAMVA 276 Query: 289 SPENWTTALRSSH 301 + TT + +H Sbjct: 277 LRKKTTTTQKVAH 289 >UniRef50_B0RC94 Putative surface-anchored protein n=1 Tax=Clavibacter michiganensis subsp. sepedonicus RepID=B0RC94_CLAMS Length = 654 Score = 164 bits (414), Expect = 4e-39, Method: Composition-based stats. Identities = 52/301 (17%), Positives = 90/301 (29%), Gaps = 32/301 (10%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 + + V+ + GQSNA G G D + QL + Sbjct: 375 VAGAQDGVGHDVVAILGQSNAQGGGFGYD--PAIDVAQDGLDQLV--GDWQDK----DWG 426 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 ++P V + P VG + R LL +L+VP + Sbjct: 427 RVVPAEDSLKHVTTWRMTDRPKL---------VGPGMTFGRALLADSAPGRRVLLVPAAQ 477 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 G ++ T + T LY + ++ ALA +P N+ + W QG Sbjct: 478 GSTSLTRVDAVQKFTWDPSPEQGSVEAGLTNLYANATTQIDNALALDPDNRLVAIIWAQG 537 Query: 181 EFDLMT-SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 E D S + V+ R L+ P+ G W + Sbjct: 538 ESDANAISSAPTAAGRVAAKVKYADRLLELESGLAVRYGPVPFLVGGMVPEWIGSDAARQ 597 Query: 240 --EAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDE-------DPDDLSTGYYGSAYRSP 290 +A++ ++ + +V G G N + + G+Y + R Sbjct: 598 DIDAVHQGLRSLRKE-VAYVP----GVSGHANEGEAFIHYDAVGARMMGAGFYAAYLRQT 652 Query: 291 E 291 Sbjct: 653 G 653 >UniRef50_Q11TG0 CHU large protein; candidate polyfunctional acetylxylan esterase/b-xylosidase/a-L-arabinofuranosidase, CBM9 module, Glycoside Hydrolase Family 43 protein and Carbohydrate Esterase Family 6 protein n=11 Tax=Bacteroidetes RepID=Q11TG0_CYTH3 Length = 1585 Score = 161 bits (406), Expect = 4e-38, Method: Composition-based stats. Identities = 44/325 (13%), Positives = 85/325 (26%), Gaps = 69/325 (21%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 + + +++ GQSN G + A + R + + Sbjct: 20 IQSYAQDPNFHIYLTFGQSNMEGNGVIEA--QDQTAVNSRFQVMGAVN------------ 65 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 C G +G + R ++ +P N + +VP Sbjct: 66 -------CTGTKSYTTGKWTTATAPIVRCNTGLGPLDYFGRTMVSNLPANIKVGVVPVAI 118 Query: 121 GGSAFTAGSE---GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACW 177 GG + G+Y + Y LV + A G + Sbjct: 119 GGCDIALFDKVNYGSYVATAPSWMIGTINQYGGNPYARLVEVAKLA---QKDGVIKGILF 175 Query: 178 MQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPH 237 QGE + D P + + +DL ++ P+ G+ + Sbjct: 176 HQGETNNGQQD---WPAKVKAIYDNLIKDLGLDPAK------TPFLAGELVTTAQGGACG 226 Query: 238 SYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTAL 297 + +I N + N V +G Sbjct: 227 GHNSIIAKLPNVI-PNAHVVSAAGLPHKG------------------------------- 254 Query: 298 RSSHFSTAARRGIISDRFVEAILQF 322 + HF+ A+ R +R+ + +L Sbjct: 255 DNLHFTPASYRT-FGERYAQLMLTL 278 >UniRef50_C0ACS7 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0ACS7_9BACT Length = 520 Score = 160 bits (404), Expect = 6e-38, Method: Composition-based stats. Identities = 47/279 (16%), Positives = 78/279 (27%), Gaps = 37/279 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +AGQSN G L PHP I+ + P H +P Sbjct: 138 DVWLLAGQSNMEGGG---LLAASVARPHPFIRAFSLARVWRQAADPLH----VPWESQEA 190 Query: 71 DVQDMQGYHHP-LATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 + D + + +T G LH R++L + ++ RG + Sbjct: 191 ALNDGKPFTREQAEDYRRTSRVGAGVGLHFGREML--LRSGVPQGLICAARGATRMEQWL 248 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY 189 + LY ++ RA G W QGE D Sbjct: 249 PARGRD------------GGAGLYGAMLRSVRATGQP-----VAGVLWHQGEGDSPRERA 291 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNN 249 A + Q ++ A RRDL PW + E ++ ++ + Sbjct: 292 ALYSQRMRKLIAAVRRDLGLPR--------LPWIFAQLARVYGERPDCAWNSVQEQQRAL 343 Query: 250 --VLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSA 286 + ++ V + E +A Sbjct: 344 ADRIHDVALVATVDLSLDDFIHLSAEAHPRFGARLARAA 382 >UniRef50_C9RND1 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RND1_FIBSS Length = 341 Score = 159 bits (401), Expect = 1e-37, Method: Composition-based stats. Identities = 47/330 (14%), Positives = 90/330 (27%), Gaps = 71/330 (21%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 + + ++ GQSN + D + +PR L Sbjct: 14 VASFAQDPNLHIYLAYGQSNMSGQA---TITDTDRQTNPRFLVL---------------- 54 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 + G +P A VG RK++ +PD+ + + Sbjct: 55 ------RAGNHSNQKVGEFYPAAPPMGHSGSKVGIVDFFGRKMIKELPDSITVAVANVAI 108 Query: 121 GGSAFTAGSEG--TYSERHGASHDACRWGTDTPLY-QDLVSR-TRAALAKNPQNKFLGAC 176 GG + + ++ + + W Y D+ R + G Sbjct: 109 GGQSIDLFDKDRNAAYVQNAKNKNDTWWIQYLNEYGGDVHKRIVEMGKIAKQKGVIKGFL 168 Query: 177 WMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 + QGE D P+ + + F +L+ + + GD W Sbjct: 169 FHQGEADYQM---KDWPERVKKVYDQFIEELELDPEKTPILLGELAPTGDLGW------- 218 Query: 237 HSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTA 296 +A E D + GY SA P Sbjct: 219 ------------------------------RNDAVKEAADLIPNGYVISAQGCPAI-KEP 247 Query: 297 LRSSHFSTAARRGIISDRFVEAILQFWRER 326 + HF+ + + +R+ E +L+ + + Sbjct: 248 NYTLHFTRDGYQT-LGERYAEKMLELLKAQ 276 >UniRef50_B8I0M1 Carbohydrate binding family 6 n=6 Tax=Clostridium RepID=B8I0M1_CLOCE Length = 780 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 66/229 (28%), Gaps = 54/229 (23%) Query: 3 AIISP--DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 P ++ + GQSN Y + PR+ L + G ++ Sbjct: 540 GTTEPTTPKFHCFLLLGQSNMAGYAAAQA---SDKVEDPRVLVLGYDNNAALGRVTDKWD 596 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 P H VG + ++ +P I ++PC Sbjct: 597 VACPPLHASWLDA-------------------VGPGDWFGKTMIQKVPSGDTIGLIPCAI 637 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 G + + Y +++R + A K G + QG Sbjct: 638 SGEKIETFMK-----------------SGGTKYNWIINRAKLAQEKG--GVIDGIIFHQG 678 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 E + S P +VE R+DL N+ + P+ G+ + Sbjct: 679 ESNSGDP---SWPGKVKTLVEDLRKDL--------NLGNVPFIAGELLY 716 >UniRef50_C7NVN3 Carbohydrate-binding family V/XII n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NVN3_HALUD Length = 523 Score = 157 bits (397), Expect = 4e-37, Method: Composition-based stats. Identities = 53/320 (16%), Positives = 96/320 (30%), Gaps = 75/320 (23%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 P + + GQSN G P+ ++ HPRI LA Sbjct: 61 TTATDPSNLDLYLLFGQSNMEGQG---PIEAQDRETHPRIHVLADKT------------- 104 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 CP+ ++ G + YG +G + A+ ++ +PD+ I +VP Sbjct: 105 ------CPNLDRE-YGEWYLAEPPLNRCYGKLGPGDYFAKSMIEEMPDDRSIGLVPAAVS 157 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 G+ +G R+ + G Y+ +V A F G + QGE Sbjct: 158 GADIALFEKGAPIGRNDRDIPSQFDGG----YEWMVDLAETAQQ---VGTFRGILFHQGE 210 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 + +VE R DL I + P+ G+ + Sbjct: 211 TNTND---QQWTDQVQGIVEDLRADLG--------IGNVPFLAGEMLY-----------D 248 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSH 301 G + + + + ++ D +H Sbjct: 249 SAGGCCGSHNTEVNELPDVIENAHVVSAEGLAGQDY----------------------AH 286 Query: 302 FSTAARRGIISDRFVEAILQ 321 F++ A R + R+ +L+ Sbjct: 287 FTSEAYRE-LGRRYAAEMLE 305 >UniRef50_Q11VQ5 CHU large protein; candidate bifunctional acetylxylan esterase/xylanase, CBM4 module, Glycoside Hydrolase Family 10 protein and Carbohydrate Esterase Family 6 protein n=3 Tax=Bacteria RepID=Q11VQ5_CYTH3 Length = 1414 Score = 157 bits (397), Expect = 4e-37, Method: Composition-based stats. Identities = 42/328 (12%), Positives = 82/328 (25%), Gaps = 76/328 (23%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 A +++ GQSN G P+ ++ R + + Sbjct: 21 QAFSQDPNFHIYLCFGQSNMEGQG---PIEAQDQTVDSRFRVMQA--------------- 62 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 Q +G + + R+++ +P N + IV Sbjct: 63 -------VSCTGQPQNAWRTATPPIARCNTKIGPSDYFGREMVKNLPANIKVGIVHVSVA 115 Query: 122 GSAFTAGSEGTYSERHGASHDA------CRWGTDTPLYQDLVSRTRAALAKNPQNKFLGA 175 G + Y+ + A Y LV + A G Sbjct: 116 GCKIELFDKTNYNTYLNSLSSADQYIKTTAGQYGGNPYGRLVELAKLA---QKDGVIKGI 172 Query: 176 CWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENF 235 QGE + + N++ + DL + P G Sbjct: 173 LLHQGESNTGD---QAWATKVNNVYKNLLADLGL------TAANVPLLAGQVVD------ 217 Query: 236 PHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTT 295 A G ++ I + ++++ D Sbjct: 218 -----AAQGGLCASMNTTINALPNTIPTAHVISSSGCTD--------------------- 251 Query: 296 ALRSSHFSTAARRGIISDRFVEAILQFW 323 + HF+TA R ++ R+ + +L Sbjct: 252 QSDNLHFTTAGYR-LLGTRYAQTMLSLL 278 >UniRef50_B7AIJ5 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIJ5_9BACE Length = 1019 Score = 157 bits (397), Expect = 4e-37, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 82/325 (25%), Gaps = 69/325 (21%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 +++ GQSN +P +D +PR + +A G Sbjct: 26 ESKPDPNFFIYLCIGQSNME--AGAVPAEQDKDFNNPRFQFMAAVDMPKLGRE------- 76 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 G + + +G RK++ +P + ++ G Sbjct: 77 -------------MGKWYTAIPPICREGNNLGPVDFFGRKMIDILPSEYHVGVINVSVAG 123 Query: 123 SAFTAGSEGTYSERHGASHDACRWGT---DTPLYQDLVSRTRAALAKNPQNKFLGACWMQ 179 + Y + D + Y+ LV+ R A G Q Sbjct: 124 AKIQLWDREDYKDYIDNERDWMKNIVSQYGGNPYERLVNMARLA---QKDGVIKGILMHQ 180 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 GE + P+ + + +DL Q P G+ + Sbjct: 181 GESNSEDPL---WPERVKKIYDNLCKDLNLNPKQ------TPLLAGEL------KYAEQG 225 Query: 240 EAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRS 299 + + + ++ E + Sbjct: 226 GVCAAFNSSIMPK----LPKVLPNAHIISALGCE---------------------STGDQ 260 Query: 300 SHFSTAARRGIISDRFVEAILQFWR 324 HFST R ++ RF + +LQ Sbjct: 261 FHFSTEGMR-LLGYRFADKMLQLQG 284 >UniRef50_A6CD12 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A6CD12_9PLAN Length = 331 Score = 157 bits (395), Expect = 6e-37, Method: Composition-based stats. Identities = 53/328 (16%), Positives = 96/328 (29%), Gaps = 54/328 (16%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 Y V + GQSN YG LPD P + + P P + Sbjct: 47 AKTYQVYFLGGQSNMDGYGYAKDLPDDLKQSVPGVMIFHANS--APDAVP------VDGR 98 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 +++ G T G L A+ L P+ A I ++ RGG++ Sbjct: 99 GLWSELKPGHGVGFKSDGKENTYSNRFGVELSFAKTLQQLAPE-ANIALIKISRGGTSIA 157 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK--------FLGACWM 178 + G + G Y ++ + AL ++ G WM Sbjct: 158 VEAAGNFGCWDPDFEKGTGKGQGINQYDHFLAGMKRALQTTDIDQDGEADTLIPAGIVWM 217 Query: 179 QGEFDL--MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 QGE D + + +++ R L D P G + Sbjct: 218 QGESDAAYTEEIAKDYEANLKRLMDLIRATL--------YADDLPVVIGRISDSGDNP-- 267 Query: 237 HSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTA 296 ++ + + A FV ++D + S + + + Sbjct: 268 --EGKVWKHGEIVRAAQAAFV--------------EKDKR-------AALVTSTDEYGYS 304 Query: 297 LRSSHFSTAARRGIISDRFVEAILQFWR 324 R H+++ + F +A+ + R Sbjct: 305 DR-WHYNSEGYLD-LGRNFAQALWKVPR 330 >UniRef50_C9RLV5 Carbohydrate binding family 6 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLV5_FIBSS Length = 524 Score = 157 bits (395), Expect = 7e-37, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 68/228 (29%), Gaps = 34/228 (14%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 + +++ GQSN + + + R K A Sbjct: 28 SEAAPDPNFHIYIAYGQSNMGGTADAQ---SADKVENSRFKIFATQK------------- 71 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 C ++ G +P + T+ A R + +P N I I+P G Sbjct: 72 ------CSGKGRNTLGDVYPAVPSLFNCGNTISVADWFGRTMADSMP-NVTIGIIPVAVG 124 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRT--RAALAKNPQNKFLGACWMQ 179 G++ + Y + + V++T A + G + Q Sbjct: 125 GASIKLFDQDQYKTYLSTAETWLQNYAKEYASDGNVTKTIIDIAKKAQEKGVIKGFIFHQ 184 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 GE D SD+ +V+ R D+ + + P+ G+ Sbjct: 185 GETDGGYSDWP-------KIVKKTRDDI--LKALDMSSDTVPFVAGEL 223 >UniRef50_Q8L9J9 Probable carbohydrate esterase At4g34215 n=10 Tax=Magnoliophyta RepID=CAES_ARATH Length = 260 Score = 156 bits (394), Expect = 8e-37, Method: Composition-based stats. Identities = 47/286 (16%), Positives = 86/286 (30%), Gaps = 67/286 (23%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEG-----------LPLPDREDAPHPRIKQLARFAH 49 + + I P+ + ++GQSN G + E AP+ I +L+ Sbjct: 15 IQSPIPPN--QIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLR 72 Query: 50 THPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIP- 108 P H + + VG + A + + Sbjct: 73 WEEAHEPLHVD------------------------IDTGKVCGVGPGMAFANAVKNRLET 108 Query: 109 DNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNP 168 D+A I +VPC GG+A G + LY+ +V RT ++ Sbjct: 109 DSAVIGLVPCASGGTAIKEWERG------------------SHLYERMVKRTEE--SRKC 148 Query: 169 QNKFLGACWMQGEFDL-MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 + W QGE D+ D S+ + + +++ R DL + + Sbjct: 149 GGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQVA----IASGG 204 Query: 228 TWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDE 273 + K + N V + + + T A + Sbjct: 205 GYIDKVREA----QLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQVQ 246 >UniRef50_C5BRL9 Acetylxylan esterase / xylanase n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BRL9_TERTT Length = 952 Score = 156 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 46/325 (14%), Positives = 92/325 (28%), Gaps = 76/325 (23%) Query: 6 SPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 +++ + GQSN G+ + ++ + + + GG Sbjct: 38 PDPNFHIYLMFGQSNMEGQGQ---ISSQDQQVPTGLLAMQADNNCTVGGAS--------- 85 Query: 66 THCPHDVQDMQGYHHPLATNHQTQY----GTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 + + PL + T + G +G + R +L + +V Sbjct: 86 ------YGEWRTATPPLIRCYNTAHAWNNGGLGPGDYFGRTMLENSGAGVRVGLVGAAYQ 139 Query: 122 GSAFTAGSEGTYSERHGASHDACRWG---TDTPLYQDLVSRTRAALAKNPQNKFLGACWM 178 G + + G+ + G Y ++ R A G + Sbjct: 140 GQSINFFRKN--CAALGSCQPSGANGSVPGGAGGYAWMLDLARKAQE---DGVIKGIIFH 194 Query: 179 QGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS 238 QGE D ++ N +V R DL + ++ P+ G+ Sbjct: 195 QGESDTG---SSTWSSRVNEVVTDLRTDLGL------SASEVPFIAGEMVP--------- 236 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALR 298 G + A + + ++ G+Y SA + Sbjct: 237 -----GACCTSHDARVHEIPS-----------------VVANGHYVSAAG-----LGSRD 269 Query: 299 SSHFSTAARRGIISDRFVEAILQFW 323 HF+ A R I R+ +L+ Sbjct: 270 QYHFNAAGYREI-GRRYANKMLELI 293 >UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA8_RHOBA Length = 745 Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats. Identities = 53/321 (16%), Positives = 91/321 (28%), Gaps = 77/321 (23%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 D++ V +AGQSN G+ L + + G F +P Sbjct: 41 ADHHDVYLLAGQSNMDGRGQVSDLSEEQKQS---------------TGDAIIFYRSVPRE 85 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 G+ P T G + AR + P N + ++ +GG++ Sbjct: 86 SDGWQTLA-PGFSVPPKYKGDLPSPTFGPEIGFARSMSNANP-NQKLALIKGSKGGTSLR 143 Query: 127 A-GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALA----KNPQNKFLGACWMQGE 181 A G + + P Y+D + R A + Q G W QGE Sbjct: 144 ADWKPGVQGDPK----------SQGPRYRDFIETIRMATKQLSDRGDQFTIRGLLWHQGE 193 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 D +S + + ++ R D+ D P G+ K Sbjct: 194 SDSKSST-ERYRRRLEELIVRIREDVGVP--------DLPVVVGEVFDNGKR-------- 236 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSH 301 R A + S E TT +H Sbjct: 237 --------------------DNVRTAIQAVAAASSTVG-------LVSSEGTTTWDPGTH 269 Query: 302 FSTAARRGIISDRFVEAILQF 322 F + + ++ +R+ A+ + Sbjct: 270 FDARS-QLLLGERYAVAMSEL 289 >UniRef50_C3QEP9 Glycoside hydrolase family 43 n=6 Tax=root RepID=C3QEP9_9BACE Length = 638 Score = 153 bits (386), Expect = 8e-36, Method: Composition-based stats. Identities = 53/328 (16%), Positives = 84/328 (25%), Gaps = 68/328 (20%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 + A +++ GQSN + + R +A G Sbjct: 15 IFAFAQDPNFHIYLCLGQSNMEGNAKIEA--QDTCNVNERFLMMAAVDCPSLGR------ 66 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 ++G + + + A + R L+ +PDN + ++ Sbjct: 67 --------------VKGQWYKAVPPLVRCHTGLTPADYFGRTLVERLPDNIKVGVINVAV 112 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRA-ALAKNPQNKFLGACWMQ 179 GG E E H AS T + R + A+ G Q Sbjct: 113 GGCRIELFDEEN-CEEHIASQPEWLKNTAKAYGNNPYRRLKELAVEAQKAGVIKGILLHQ 171 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 GE + PQ + E RDL D P G+ Sbjct: 172 GESNTGD---KEWPQKVKRVYENLLRDLNL------QAKDVPLLAGEV------------ 210 Query: 240 EAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDED-PDDLSTGYYGSAYRSPENWTTALR 298 V Q G N P + T Y + P A Sbjct: 211 -----------------VHADQNGRCASMNEIINTLPQVILTAYVIPSSGCPA----AED 249 Query: 299 SSHFSTAARRGIISDRFVEAILQFWRER 326 + HF+ R + R+ E L + Sbjct: 250 NLHFTAEGYRK-LGVRYAEKRLLLLEKE 276 >UniRef50_C9RKV3 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RKV3_FIBSS Length = 409 Score = 153 bits (386), Expect = 8e-36, Method: Composition-based stats. Identities = 42/329 (12%), Positives = 86/329 (26%), Gaps = 70/329 (21%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 M +++ GQSN G+ ++ R + L + G Sbjct: 18 MANAAPDPNFHIYLAFGQSNMEGQGDVG---SQDKTVDERFQVLWAANNGFCSGKT---- 70 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 PLA Q +G + R ++ + ++ Sbjct: 71 -----------KGKWATAVPPLAHC---QGAKLGPTDYFGRTMVEKTDSKIKVGVIVVAV 116 Query: 121 GGSAFTAGSEGTYSERHGASHDACR---WGTDTPLYQDLVSRTRAALAKNPQNKFLGACW 177 G + + Y+ + Y L+ + A G + Sbjct: 117 AGCSIQLFDKDGYANYARSQQSWMTQRINEYGGNPYGRLIEMAKKAQE---DGVIKGIIF 173 Query: 178 MQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPH 237 QGE D P + + +DL + D P+ G+ + Sbjct: 174 HQGETDAGD---GQWPSKVKKVYDNIIKDLGLGN-------DVPFLAGEVLRSGSSKGAN 223 Query: 238 SYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTAL 297 + NI + QQ + + + L G Sbjct: 224 N--------------NIAKLP--QQSKNFYVVSSEGFNQALGDG---------------- 251 Query: 298 RSSHFSTAARRGIISDRFVEAILQFWRER 326 ++ HF++ R R+ E +++ ++ Sbjct: 252 QNVHFTSQEYRD-FGKRYAEKMIEVLGDK 279 >UniRef50_C0A5B6 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A5B6_9BACT Length = 646 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 47/237 (19%), Positives = 67/237 (28%), Gaps = 32/237 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +AGQSN G G PHP I+ P H P C + Sbjct: 110 DVWLLAGQSNME--GCGFMDSPHCARPHPLIRAFTMAREWRQAADPLHIRWESP-DSCHN 166 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D +T G L A ++L +V GG++ + Sbjct: 167 DGATWDRTRAEQHR--RTALRGAGVGLPFAHEML--ARSGVPQALVCTAHGGTSMEQWNP 222 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 + D LY ++ RA G W QGE D A Sbjct: 223 ------------LHKKLGDGSLYGSMLLSMRATGQPC-----AGVLWYQGESDTAAPLAA 265 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQ 247 + +V A RRDL+Q D PW + ++ + Sbjct: 266 IYTDRMKKLVAATRRDLRQP--------DLPWIIVQLARVLGIRPETGWNSVQEQQR 314 >UniRef50_C5WRX2 Putative uncharacterized protein Sb01g000530 n=2 Tax=Sorghum bicolor RepID=C5WRX2_SORBI Length = 278 Score = 152 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 57/294 (19%), Positives = 85/294 (28%), Gaps = 69/294 (23%) Query: 12 VLTVAGQSNAMAYGEGLP-----LPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 V +AGQSN G + + AP PRI +L+ P H Sbjct: 32 VFLLAGQSNMGGRGGATNGTWDGVVPPDCAPSPRILRLSPSLRWEEAREPLH-------- 83 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLP---FIPDNAGILIVPCCRGGS 123 A VG + A LL +P +A + +VPC +G + Sbjct: 84 ----------------AGIDLHNVLGVGPGMPFAHALLRRHGRVPPHAVVGLVPCAQGAT 127 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKN-----------PQNKF 172 + S G TPLY ++ R RAALA N ++ Sbjct: 128 PIASWSRG------------------TPLYDRMLKRARAALANNNNNNNNNNNNAGSSRL 169 Query: 173 LGACWMQGEFDL-MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW 231 W QGE D D + V RRDL + + G + Sbjct: 170 AALLWYQGEADTIRRQDADVYTSRMEAFVRDVRRDLGMPDLLVIQVG---LATGQGKFVD 226 Query: 232 KENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGS 285 +++ N + + T A + L+ Y Sbjct: 227 IVREAQRRVSLH----NVKYVDAKGLPVASDYTHLTTPAQVQLGKMLAASYLAP 276 >UniRef50_Q8A041 Acetyl xylan esterase A n=8 Tax=Bacteroides RepID=Q8A041_BACTN Length = 267 Score = 150 bits (379), Expect = 5e-35, Method: Composition-based stats. Identities = 42/323 (13%), Positives = 78/323 (24%), Gaps = 79/323 (24%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 A + GQSN G+ L + L P P Sbjct: 24 AEKPLKTLDLYLCIGQSNMAGRGK---LSPEVMDTLQNVYLLNADDQFEPAVNPL----- 75 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 + + VG A A+ + + ++ RGG Sbjct: 76 ----------------NRYSTIGKGLSWQQVGPAYGFAKTMATK---KHPVGLIVNARGG 116 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 S+ + + Y + + R + A+ W QGE Sbjct: 117 SSIRSWVKNAK--------------QSGGYYDEAIRRAKEAMK---YGTLKAIIWHQGEA 159 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAI 242 D + + + ++ R DL D P G + P+ E Sbjct: 160 DCHHPEA--YKEKIIQLMTDLRNDLGMP--------DLPVVVGQIAQWNWTKKPYIPEGT 209 Query: 243 YGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHF 302 + + + F+ E L HF Sbjct: 210 KP-FNDMIKEISTFLPH-------SACVSSEGLTPL----------------KDETDPHF 245 Query: 303 STAARRGIISDRFVEAILQFWRE 325 AA + + R+ + + + ++ Sbjct: 246 D-AASQITLGKRYAKEVKKLIKK 267 >UniRef50_A6DRT7 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DRT7_9BACT Length = 252 Score = 150 bits (377), Expect = 8e-35, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 110 NAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTP-------LYQDLVSRTRA 162 +LIV GG + D T P +Y+ ++ + Sbjct: 52 KENVLIVKEAIGGRPIRMWVHD-WKAAPYWKIDPNIPNTKNPQPKENGVMYKSMMKKITK 110 Query: 163 ALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW 222 A + K + CWMQGE D A + + + + D + T + Sbjct: 111 ATQ-GKKPKAIAFCWMQGERDSRERHSAVYERSLKALFSQIKADFPE--------TPIVF 161 Query: 223 FCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGY 282 G + + K+ + +A+Y ++ + A ++ + E D L+TG Sbjct: 162 VIGKLSDFGKD----NKQALYPEWEEIIAAQ------KKVAKDTPNCKIIETHD-LNTGD 210 Query: 283 YGSAYRSPENWTTALRSSHFSTAARRGIISDRFVEAILQFWRER 326 +++ E H + + I+ RF EA ++ +++ Sbjct: 211 SPPHWKTKEIRKYVD-DLHMTNEGYK-ILGTRFAEAAIELLKKQ 252 >UniRef50_B0BZZ0 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0BZZ0_ACAM1 Length = 302 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 61/322 (18%), Positives = 99/322 (30%), Gaps = 88/322 (27%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 + +AGQSN G PL HP++ H P Sbjct: 59 LYVLAGQSNMTGRG---PLDAESSKTHPQVFVFGNDYRWHLAKDPLD------------- 102 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 + G P++ + + VG + A LL +A I ++PC RGGS Sbjct: 103 --SIDGQVDPVS--QEGKAPGVGPGMTFASALLKH-DKDAVIGLIPCARGGSTIQEWQ-- 155 Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY-- 189 R ++ LY + R RAA + + G + QGE D + Sbjct: 156 -------------RNLSENSLYGSCLKRLRAA---SLMGQLEGMLFFQGEADALDQKQFS 199 Query: 190 ------ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIY 243 + F +E+FR D KQ + P + N + + Sbjct: 200 HLSLSPQQWSKKFEKFIESFRLDTKQEN--------LPIVFAQIGSHDAPNLLTQWNVVK 251 Query: 244 GNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFS 303 +N L ++ + DDL+ Y H++ Sbjct: 252 KQQENIQLPHVAMI----------------TTDDLALEDY----------------VHYT 279 Query: 304 TAARRGIISDRFVEAILQFWRE 325 T + R I RF A ++ + Sbjct: 280 TKSYRTI-GQRFANAYIKLTEK 300 >UniRef50_A8FC47 Possible acetylxylan esterase n=19 Tax=Bacteria RepID=A8FC47_BACP2 Length = 276 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 72/296 (24%), Gaps = 90/296 (30%) Query: 13 LTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDV 72 + GQSN G +P + RI L R P HF+ Sbjct: 4 FLLIGQSNMAGRGFKHEVPPIY---NERIMML-RNGRWQMMTEPIHFD------------ 47 Query: 73 QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGT 132 VG A A + I ++PC GGS+ S Sbjct: 48 ---------------RPVAGVGLAASFAETWCKD-HEGEKIGLIPCAEGGSSIDEWSRDG 91 Query: 133 YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASH 192 L++ +S A + + G W QGE D Y + Sbjct: 92 ------------------ALFRHAISEATFAKENS---ELAGILWHQGESDSQDGKYKEY 130 Query: 193 PQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWK------------------EN 234 + + R +L + P G + + Sbjct: 131 DEKIRRLFHEIRTELSVPN--------IPLVIGGLGDFLGKVAFGAGCVEYQLINEELQK 182 Query: 235 FPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSP 290 + H +E Y + + NA + YY + R Sbjct: 183 YAHRHENCYY---------VTAKGLIPNPDGIHINAMSQ--RIFGLRYYEAFRRKQ 227 >UniRef50_C3QXI5 Glycoside hydrolase family 43 protein n=5 Tax=Bacteroidales RepID=C3QXI5_9BACE Length = 641 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 47/323 (14%), Positives = 80/323 (24%), Gaps = 71/323 (21%) Query: 5 ISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIP 64 +Y+ GQSN ++ R + LA + G Sbjct: 19 AQDANFYIYLCLGQSNMEGNARY---EAQDTLVDARFQVLAAVDNKELGR---------- 65 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 ++G +P + A + R L+ +P + I +V GG Sbjct: 66 ----------VKGEWYPARAPLCRPNTGLTPADYFGRTLVENLPPHVRIGVVHVAIGGCR 115 Query: 125 FTAGSEGTYSERHGASHDACRW---GTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 + E + D D Y LV R A G QGE Sbjct: 116 IELFQKDKCEEYIKTAPDWMVNTLKEYDNDPYTRLVKMARIA---QKSGVIKGILLHQGE 172 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 + PQ + + DL + P G+ A Sbjct: 173 SNTGD---KEWPQKVKSVYDNLLADLHL------QADEVPLIAGEVV-----------NA 212 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSH 301 +G + I + + +++ + A H Sbjct: 213 DHGGVCAGMNEVIAMLPQVIKNCAIVSSKGL---------------------SCAPDHLH 251 Query: 302 FSTAARRGIISDRFVEAILQFWR 324 F A R ++ R+ L Sbjct: 252 FDAAGYR-VLGRRYAAQALHLMG 273 >UniRef50_O13495 Acetylxylan esterase n=2 Tax=Neocallimastigaceae RepID=O13495_NEOPA Length = 393 Score = 143 bits (360), Expect = 8e-33, Method: Composition-based stats. Identities = 39/323 (12%), Positives = 82/323 (25%), Gaps = 70/323 (21%) Query: 4 IISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDII 63 +++ GQSN G P+ ++ R + ++ + + Sbjct: 22 AAPDPNFHIYLAFGQSNMEGQG---PIGSQDRTVDKRFQMISTVSGCN------------ 66 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 G + G +G + R L+ +P + + G Sbjct: 67 ---------GRQMGNWYDAVPPLANCDGKLGPVDYFGRTLVKKLPQEIKVGVAVVAVAGC 117 Query: 124 AFTAGSEGTYSERH-GASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 + Y + Y L+ + A G QGE Sbjct: 118 DIQLFEKNNYRNYRLESYMQGRVNAYGGNPYGRLIEVAKKAQQ---VGVIKGILLHQGET 174 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAI 242 + + P + E +DL N D P G+ ++ Sbjct: 175 NTG---QQNWPNRVKAVYEDMLKDLGL------NAKDVPLLAGEVV-----------QSN 214 Query: 243 YGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHF 302 G ++ + I + +++ HF Sbjct: 215 QGGQCGSMNSIIQKLPSVIPTAHVISSQGL---------------------GQQGDGLHF 253 Query: 303 STAARRGIISDRFVEAILQFWRE 325 S+ A R +R+ + +L+ + Sbjct: 254 SSQAYRT-FGERYADEMLKILGD 275 >UniRef50_Q84M79 Os03g0857600 protein n=4 Tax=Poaceae RepID=Q84M79_ORYSJ Length = 266 Score = 140 bits (352), Expect = 7e-32, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 78/280 (27%), Gaps = 58/280 (20%) Query: 12 VLTVAGQSNAMAYGEGLPLP-----DREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 + + GQSN G P E AP PRI +L+ P H Sbjct: 32 IFLLGGQSNMGGRGGATNGPWDGVVPPECAPSPRILRLSPELRWEEAREPLH-------- 83 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 A VG + A L IP + I +VPC +GG+ Sbjct: 84 ----------------AGIDVHNVLGVGPGMSFAHALFRAIPPSTVIGLVPCAQGGTPIA 127 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSR---TRAALAKNPQNKFLGACWMQGEFD 183 + G T LY+ +V R A + W QGE D Sbjct: 128 NWTRG------------------TELYERMVGRGRAAMATAGAGAGARMGALLWYQGEAD 169 Query: 184 L-MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAI 242 D + + MV RRDL + + + E + +A+ Sbjct: 170 TIRREDAEVYARKMEGMVRDVRRDLALPELLVIQVG----IATGQGKF-VEPVREAQKAV 224 Query: 243 YGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGY 282 + + T A + L+ Y Sbjct: 225 R--LPFLKYVDAKGLPIANDYTHLTTPAQVKLGKLLAKAY 262 >UniRef50_D2R5Y4 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5Y4_9PLAN Length = 319 Score = 139 bits (349), Expect = 1e-31, Method: Composition-based stats. Identities = 49/335 (14%), Positives = 102/335 (30%), Gaps = 58/335 (17%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 ++ + + V +AGQSN +G+ P + P Sbjct: 26 SLTAAETLKVFVLAGQSNMQGHGKVKADPKANGGQGSLEWLVKES----PKKADFKHLVT 81 Query: 63 IPLTHCPHDVQDM--QGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 D + G L + +G L + I + +L+V Sbjct: 82 DSGDWVSRDDVQIWYLGRQGKLTAGYGASEEMMGPELGFGHVVGNAIDE--PVLLVKLAW 139 Query: 121 GGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQ---------NK 171 GG + GT P YQ+++++T+ L P+ + Sbjct: 140 GGKSLGQ-----------DFRPPSSGGTVGPYYQEIITQTKTVLKDLPKLFPEYASHQAE 188 Query: 172 FLGACWMQGEFD-LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY 230 +G W QG D + + + ++ ++V R+DL + + + +T Sbjct: 189 LVGFGWHQGWNDRINQAFNDEYEKNLANLVRDLRKDLSAPNMK--------FVVAETG-- 238 Query: 231 WKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSP 290 P + + + A ++++ + +Y SP Sbjct: 239 --MTGPTE---THPRALSLMKAQAAVAEYEEFRGNVAF--------VGTRDFYRPKEESP 285 Query: 291 ENWTTALRSSHF-STAARRGIISDRFVEAILQFWR 324 + + H+ S A +I D A+L+ + Sbjct: 286 -----SGQGYHWNSNAETYYLIGDGMGHAMLKLLK 315 >UniRef50_Q7XSV9 OSJNBa0027H06.16 protein n=5 Tax=Poaceae RepID=Q7XSV9_ORYSJ Length = 282 Score = 138 bits (348), Expect = 2e-31, Method: Composition-based stats. Identities = 54/272 (19%), Positives = 87/272 (31%), Gaps = 51/272 (18%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 + +AGQSN G PHPR+ +LA P PP H Sbjct: 50 LFLLAGQSNMAGRGALARPLPPPYLPHPRLLRLAASRRWVPAAPPLH------------- 96 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 A + +G A+ A +LL + +VPC GG+ + G Sbjct: 97 -----------ADIDTHKTCGLGPAMPFAHRLLLQTDSEEVLGLVPCAVGGTRIWMWARG 145 Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL-MTSDYA 190 PLY+ V+R A + W QGE D D Sbjct: 146 Q------------------PLYEAAVARA-RAAVADGGGAIGAVLWFQGESDTIELDDAR 186 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNV 250 S+ +V R DL + + + G+ + + + + I N N + Sbjct: 187 SYGGKMERLVADLRADLHLPNLLVIQVG---LASGE--GNYTDIVREAQKNI--NIPNVL 239 Query: 251 LANIIFVDFQQQGERGLTNAPDEDPDDLSTGY 282 L + + + + T A + + L+ Y Sbjct: 240 LVDAMGLPLRDDQLHLSTEAQLQLGNMLAEAY 271 >UniRef50_C1ZKK3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZKK3_PLALI Length = 1077 Score = 135 bits (338), Expect = 3e-30, Method: Composition-based stats. Identities = 43/335 (12%), Positives = 87/335 (25%), Gaps = 54/335 (16%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDRE--DAPHPRIKQL----ARFAHTHPGG 54 + A V +AGQSN +G R+ + + + Sbjct: 782 IAANSEAKPLKVFILAGQSNMEGHGVVSMDGKRDYNGGKGNLVWSMKHSQSAEKLKRLKN 841 Query: 55 PPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGIL 114 + + ++ + +G L + ++ + +L Sbjct: 842 EKGEWVIRDDVQISFKVDDKVRKGGLTIGYTGYGGSSHIGPELGFGFVMGDYLDE--PVL 899 Query: 115 IVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALA--KNPQNKF 172 ++ GG + G P Y +V RAALA + + + Sbjct: 900 LIKTAWGGKSL-----------FVDFRPPSSGGQVGPYYTKMVEEVRAALAELGDQKYEI 948 Query: 173 LGACWMQGEFDLMTSDY-ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW 231 G W QG D+ A + Q+ ++V+ R++ + P G+ Sbjct: 949 AGFVWQQGWNDMCEKPAIAEYAQNLVNLVKDLRKEFDSPN--------LPVVVGELG--- 997 Query: 232 KENFPHSYEAIYGN-YQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSP 290 + + A T + +G + Sbjct: 998 ------NGGPVTSGDMFEFRKAQEQGTGQINNALFIKTTDFARPAELSPNTTHGHHW--- 1048 Query: 291 ENWTTALRSSHFSTAARRGIISDRFVEAILQFWRE 325 F A +I + E + Q +E Sbjct: 1049 -----------FGNAESYFLIGEALGEGMKQLLKE 1072 >UniRef50_C7J1I1 Os04g0110400 protein n=2 Tax=Poaceae RepID=C7J1I1_ORYSJ Length = 252 Score = 134 bits (336), Expect = 5e-30, Method: Composition-based stats. Identities = 45/207 (21%), Positives = 65/207 (31%), Gaps = 44/207 (21%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 + +AGQSN G PHPR+ +LA P PP H Sbjct: 53 LFLLAGQSNMAGRGALARPLPPPYLPHPRLLRLAASRRWVPAAPPLH------------- 99 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 A + +G A+ A +LL + +VPC GG+ + G Sbjct: 100 -----------ADIDTHKTCGLGPAMPFAHRLLLQTDSEEVLGLVPCAVGGTRIWMWARG 148 Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL-MTSDYA 190 PLY+ V+R A + W QGE D D Sbjct: 149 Q------------------PLYEAAVARA-RAAVADGGGAIGAVLWFQGESDTIELDDAR 189 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNI 217 S+ +V R DL + + + Sbjct: 190 SYGGKMERLVADLRADLHLPNLLVIQV 216 >UniRef50_B5JIZ7 Conserved domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JIZ7_9BACT Length = 296 Score = 133 bits (334), Expect = 8e-30, Method: Composition-based stats. Identities = 53/330 (16%), Positives = 84/330 (25%), Gaps = 60/330 (18%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 A + Y V + GQSN + +G LP D + + G ++ Sbjct: 10 ATANAKTYKVYFLGGQSNMVGFGHENELPGDLDRRIYEVPIFS--------GSSKMDENL 61 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 + G G + A +LL P+ I I+ GG Sbjct: 62 AGGDGKWTTLGIGFGLGSDFVDGQYVLSDRFGPEITFADELLKIAPEE-NIAIIKYAWGG 120 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQN--------KFLG 174 +A G G Y Y + R ALA + G Sbjct: 121 TALLDGVSG-YGSWDPKVR-------KLNQYDYFLKTVRKALAARDIDNDGEHDLLVPAG 172 Query: 175 ACWMQGEFDLMTSDYAS--HPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWK 232 WMQGE D S AS + ++ +++ R P G T Sbjct: 173 IIWMQGEADAFESQAASQAYQENLANLMSLMRAAFHDNS--------LPIVIGRITD--- 221 Query: 233 ENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPEN 292 + + + A F E YG Sbjct: 222 -SRSGQSKPVMKYSHTVRQAQKAFAAQDPFASLSTVTEQLE---------YG-------- 263 Query: 293 WTTALRSSHFSTAARRGIISDRFVEAILQF 322 + H+ + ++ F + I Sbjct: 264 ---EHDAWHYLSEGY-LLMGRDFAQEIHSL 289 >UniRef50_A6C656 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C656_9PLAN Length = 667 Score = 130 bits (326), Expect = 7e-29, Method: Composition-based stats. Identities = 52/321 (16%), Positives = 100/321 (31%), Gaps = 80/321 (24%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 + P + +AGQSN ++ G LP++ P + P + Sbjct: 16 SAKEPAIDKLFLLAGQSNMVSQGTLAELPEQLQQPPTNVY-FWSNGTWIPYHNKVAYVK- 73 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 G L IA +L PD I ++ +GG Sbjct: 74 --------------------------PGKEFGPELAIAHELSRAFPDE-KIGLIKHAKGG 106 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 +A PL + L + A + WMQGE Sbjct: 107 TAIRLWQP------------------RMPLVRGLFQKLDDAQKAG-GGEVAALFWMQGER 147 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKEN-FPHSYEA 241 D + A + + F ++++A R+ Q + P G + E + Sbjct: 148 DARFHEPA-YAKKFQNLIQAVRQKSDQP--------ELPVVFGRISRIIPEREYTDQIRQ 198 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSH 301 I + LAN++ +D T+A + P++++ + + +H Sbjct: 199 IQQQVAD-ELANVVMID---------TDALERKPEEITVNGKPTKFL-----------AH 237 Query: 302 FSTAARRGIISDRFVEAILQF 322 +S+ + + + +A L+ Sbjct: 238 YSSRG-QIDLGMQLAQAYLKL 257 >UniRef50_Q9LF91 Putative uncharacterized protein F8J2_180 n=1 Tax=Arabidopsis thaliana RepID=Q9LF91_ARATH Length = 169 Score = 127 bits (317), Expect = 7e-28, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 60/209 (28%), Gaps = 57/209 (27%) Query: 21 AMAYGEGLP-----------LPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 G + E +P I +L P H + I Sbjct: 1 MAGRGGVYNDTATNTTVWDGVIPPECRSNPSILRLTSKLEWKEAKEPLHVDIDI------ 54 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 + VG + A +++ + +VPC GG+ + Sbjct: 55 ------------------NKTNGVGPGMPFANRVVNRF---GQVGLVPCSIGGTKLSQWQ 93 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL-MTSD 188 +G + LY++ V R +AA+A + W QGE D D Sbjct: 94 KGEF------------------LYEETVKRAKAAMASGGGGSYRAVLWYQGESDTVDMVD 135 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNI 217 + + + R DL+ + + + Sbjct: 136 ASVYKKRLVKFFSDLRNDLQHPNLPIIQV 164 >UniRef50_A5FD34 Candidate bifunctional acetylxylan esterase/feruloyl esterase; Carbohydrate esterase family 1/carbohydrate esterase family 6 n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FD34_FLAJ1 Length = 560 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 48/322 (14%), Positives = 85/322 (26%), Gaps = 72/322 (22%) Query: 5 ISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIP 64 ++V GQSN + + P + + R + L+ G Sbjct: 22 SQDPNFHVYLSFGQSNMEGFAKIE--PQDKTGVNERFQVLSAVDCPEMGRE--------- 70 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 +G + + + R ++ +P N + +V GG Sbjct: 71 -----------KGKWYTAVPPLCRCTTGLTPMDYFGRTMISNLPQNIKVGVVNVAVGGCK 119 Query: 125 FTAGSEGTYSERHGASHDACRWGTD---TPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 + + S D + Y LV + A + G QGE Sbjct: 120 IELFDKNNFESYVANSPDWLKNIVKQYDGNPYGRLVEMAKIA---QKKGVIKGILLHQGE 176 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 + + PQ + + +DL + P G+T Sbjct: 177 SNTGDTL---WPQKVKIVYDNLIKDLNLDPKK------VPLLSGET-------------- 213 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDED-PDDLSTGYYGSAYRSPENWTTALRSS 300 V+ Q G+ G N P L Y S+ + Sbjct: 214 ---------------VNEDQNGKCGSMNKIIAKLPQVLPNSYIISSKGCKADADF----L 254 Query: 301 HFSTAARRGIISDRFVEAILQF 322 HFS R + R+ + +L Sbjct: 255 HFSPEGYRE-LGKRYADKMLSL 275 >UniRef50_Q07GI7 Conserved domain protein n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q07GI7_ROSDO Length = 617 Score = 123 bits (308), Expect = 8e-27, Method: Composition-based stats. Identities = 46/227 (20%), Positives = 67/227 (29%), Gaps = 42/227 (18%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 A P +V + GQSN + G Sbjct: 57 QAAAQPRETHVFALMGQSNMIGRAAFDGGAK------------WPDGTLQIGRGGDEDGA 104 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 IIP + D PLA G +G + A L PD +L +PC +G Sbjct: 105 IIPA----RNPADGPATSRPLAHTGAR-LGNMGLDIQFAIDYLSDKPD-VTLLFIPCAQG 158 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 + F W LY +R AA+ NP+ F G W QGE Sbjct: 159 ATGF----------------SNGAWNPGDWLYNRETARINAAMNANPEFLFQGFLWHQGE 202 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT 228 D ++ ++++ RRD+ P+ G Sbjct: 203 TDTGIPG--TYGGLLDNLIAGLRRDV------TAATPTTPFILGGLA 241 >UniRef50_A6DJ18 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ18_9BACT Length = 229 Score = 123 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 69/214 (32%), Gaps = 36/214 (16%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 + ++GQSNA G G L + P + FN +T P Sbjct: 28 LFILSGQSNAGGNGNGDELKQSQKELDPEVL--------------LAFNSGQFMTMAPIK 73 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 + ++ + Q G L ++KL P++ + RGG++ A G Sbjct: 74 KKKVK---------FKIQNSIFGTELSFSKKLKQAYPNDIIAICKVGIRGGTSIVAW--G 122 Query: 132 TYSERHGASHDACRWGTDTP----LYQDLVSRTRAALAKNPQNK------FLGACWMQGE 181 R G + G + LYQ+++ + + K G W Q E Sbjct: 123 KDRTRPGWKEELKALGIEEASQRMLYQEIIDGVNKGIENLKKRKDVKEVIISGMWWCQTE 182 Query: 182 FDLM-TSDYASHPQHFNHMVEAFRRDLKQYHSQL 214 D ++ ++ + + R+D Sbjct: 183 RDSSFVEFSKAYEKNLTNFINNLRQDFNTPRLPF 216 >UniRef50_Q2YI73 Acetlyxylan esterase (Fragment) n=1 Tax=unidentified microorganism RepID=Q2YI73_9ZZZZ Length = 297 Score = 122 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 40/319 (12%), Positives = 79/319 (24%), Gaps = 68/319 (21%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 +Y+ GQSN G + D + +P R ++ Sbjct: 2 DPKFYIYLCIGQSNMEGQG---VIEDCDLSPDERFLMMST-------------------- 38 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 G + + A + R ++ + + + +V GG Sbjct: 39 --LDCGTRKLGQWYRAIPPLARCDTHLCPADYFGRTMVANLDEGKRVGVVVVAIGGINID 96 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSR-TRAALAKNPQNKFLGACWMQGEFDLM 185 + G +++ + + + R A G QGE D Sbjct: 97 LYDPDGWQSYVGTMNESWQINAVNAYGGNPLGRLLECAREAQKSGVIKGILLHQGENDAY 156 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGN 245 + Q + E +L N D P G+ Sbjct: 157 ---SSVWLQKVKKVYENLLAELNL------NAEDVPLIAGEVG----------------- 190 Query: 246 YQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTA 305 + + G+ A + + L + S T + HF + Sbjct: 191 ---------------NEDQNGICCAANNTINRLPQTIPTAHVVSSVGCTLQSDNLHFDSK 235 Query: 306 ARRGIISDRFVEAILQFWR 324 R + R+ + +L Sbjct: 236 GYRK-LGRRYAKTMLATMG 253 >UniRef50_A0LNW1 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LNW1_SYNFM Length = 261 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 41/271 (15%), Positives = 83/271 (30%), Gaps = 35/271 (12%) Query: 27 GLPLPDREDAPHPRIKQLARFAHTHPG--GPPCHFNDIIPLTHCPHDVQDMQGYHHPLAT 84 + + P + +A + G + P + V DM P Sbjct: 12 VMLVYPSFALSEPIVFIVAGQSTAKCGLVEEVATGDLRTPHQLKEYHVLDMDYSEKPRIV 71 Query: 85 NHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDAC 144 Q G + + + P ++++ + GS T S + G Sbjct: 72 QTFDQRSHFGPEVRFVQLYVKANPSR-EVILLKMVKNGSGMTRWSP----KWPGEYDQWT 126 Query: 145 RWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL-MTSDYASHPQHFNHMVEAF 203 LY+ LV A+ ++ G ++QGE D ++ Q+ ++V Sbjct: 127 -----GDLYRILVDFVIEAVDGRDV-EWGGFLFVQGENDSVYPERARAYVQNLRNLVNRL 180 Query: 204 RRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANII-------F 256 R DL + +P E +PH Y+ + N L Sbjct: 181 REDLGAPKMPVMTSEVSPVL---------EKYPHQYQ-VNQAKINAALTGRDMFVVSNSA 230 Query: 257 VDFQQQGERGLTNAPDEDPDDLSTGYYGSAY 287 + +++ G ++A + L ++ + Sbjct: 231 LGYREDGIHFTSDAVLQ----LGMRFFWTYR 257 >UniRef50_A9GHE3 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GHE3_SORC5 Length = 346 Score = 117 bits (292), Expect = 5e-25, Method: Composition-based stats. Identities = 44/326 (13%), Positives = 85/326 (26%), Gaps = 94/326 (28%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLA----RFAHTHPGGPPCH 58 A S +++ + GQSN + R+K L + PP Sbjct: 110 APNSRPTFHIFMLMGQSNMAGVAAKQA---SDQNSDQRLKVLGGCNQPAGQWNLANPPL- 165 Query: 59 FNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPC 118 + CP + + V + + LL + + I ++ Sbjct: 166 -------SDCPGESRINLSTS-------------VDPGIWFGKTLLGKLREGDTIGLIGT 205 Query: 119 CRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWM 178 G + G + + +++ + A +F G + Sbjct: 206 AESGESINTFISGGSHHQTILNK---------------IAKAKTA----ENARFAGIIFH 246 Query: 179 QGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS 238 QGE D S S P + + + D P+ G+ Sbjct: 247 QGETDTGQS---SWPGKVVQLYNEMKAAWGVDY-------DVPFILGELP---------- 286 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALR 298 N + D L GY + S E + Sbjct: 287 ---------------------AGGCCSVHNNLVHQAADMLPDGY----WISQEGTKVMDQ 321 Query: 299 SSHFSTAARRGIISDRFVEAILQFWR 324 HF A+ ++ R+ E +++ + Sbjct: 322 -YHFDHASV-VLMGTRYGEKMIEALK 345 >UniRef50_UPI0001BC8395 sialate O-acetylesterase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC8395 Length = 478 Score = 114 bits (284), Expect = 6e-24, Method: Composition-based stats. Identities = 48/264 (18%), Positives = 79/264 (29%), Gaps = 36/264 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + P R+ P + Sbjct: 109 EVWFCSGQSNME--------MIMRNDPQWRLYVDNANEEIAVADYPGIRFMTVQRNESFT 160 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + ++ + + QT G + ARKLL + + +V GGS + Sbjct: 161 ALDEVLTEGWQVCSP-QTVGGLSAVGYYFARKLLSSLD--VPVGLVVDAYGGSPIQSWIP 217 Query: 131 GT------YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL 184 Y H +A G + P Y L S A + K G W QGE ++ Sbjct: 218 YAETLKPLYKAEHETLQEAVEKGKEKPEYNMLSSLYNAMVHPLIDYKIRGWLWYQGEANV 277 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 D + +V ++R+ K P+ Y+ + P Y Y Sbjct: 278 G--DAGRYIAMMKDLVSSWRKKWKA---------KLPF-------YYVQIAPFQY-PGYQ 318 Query: 245 NYQNNVLANIIFVDFQQQGERGLT 268 + LA + + Q G+ Sbjct: 319 KEKWAELAEVQSMALQTISSSGMV 342 >UniRef50_UPI00016C5503 hypothetical protein GobsU_26641 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5503 Length = 546 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 40/258 (15%), Positives = 72/258 (27%), Gaps = 39/258 (15%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT---- 66 V +AGQSN + + +K L + T Sbjct: 29 KVFVLAGQSNMEGHSVTDLSGKDYNDGKGTLKTLLDDPDRAKLVRHLRNDRGDWGTRDDV 88 Query: 67 HCPHDVQDMQGYHHPLATNH--QTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 + + PL G L L + +L++ GG + Sbjct: 89 WVRYQRERGPLLAGPLGMGFSVYGDKHHFGPELQFGHVLGDHFEN--QVLLIKTAWGGKS 146 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALA---------KNPQNKFLGA 175 + G P Y +++ RAALA + + G Sbjct: 147 L-----------YKDFRPPSSGGEVGPYYTKMIADVRAALANLKAEFPKYADQGYELAGF 195 Query: 176 CWMQGEFDLMTSDYA--SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKE 233 W QG D + + A + Q+ ++++ R +LK P G+ T W + Sbjct: 196 VWYQGWNDGVDAKKAVPEYEQNLVNLIKDVRTELKAPK--------LPVVIGELTGPWVD 247 Query: 234 NFPHSYEAIYGNYQNNVL 251 P ++ + Sbjct: 248 -APGAWSDLRKAQAAAAE 264 >UniRef50_C6VVT7 Putative uncharacterized protein n=2 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VVT7_DYAFD Length = 618 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 61/222 (27%), Gaps = 36/222 (16%) Query: 11 YVLTVAGQSNAMA-YGEGLPLPDREDAPHPRIKQLARFAHTHPGGP-PCHFNDIIPLTHC 68 V +AGQSNA + LP P + P F ++ PL Sbjct: 132 DVFIIAGQSNAQGIKDQSYKLPSGAGIPE---WVVGASEDKTCTRKLPESFTNLFPL--- 185 Query: 69 PHDVQDMQGYHHPLATNHQ--TQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 + D H PL YG +G+ + A +P + GS+ T Sbjct: 186 --NTADDMKKHGPLGPTGNSVWAYGVLGKLISDANGGMP-------VAFFNAATAGSSVT 236 Query: 127 AGSEGT-----YSERHGASHDACRWGTD---TPLYQDLVSRTRAALAKNPQ-NKFLGACW 177 +G GA G Y + + AL W Sbjct: 237 EWKQGADGVEAKHPYTGAQVCLGYMGGSVIPKDYYGQPYTALKTALNYYGSLYGVRAVLW 296 Query: 178 MQGEFDLMT--------SDYASHPQHFNHMVEAFRRDLKQYH 211 QGE D S A + ++ R D + Sbjct: 297 HQGEADADPNVNAIYKASSAADYQSKLQAVIAKSRSDFAAPN 338 >UniRef50_C1ZJF8 Putative uncharacterized protein n=5 Tax=Bacteria RepID=C1ZJF8_PLALI Length = 427 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 49/399 (12%), Positives = 90/399 (22%), Gaps = 85/399 (21%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEG--LPLPDREDAPHPRIKQLARFAHTHPGGPPCHFN 60 A V +AGQSN + + P +K++ + Sbjct: 33 ACAEGKPLKVFILAGQSNMEGHARVETFDYIGDDPVTLPLLKKMRGADGKPVVCEGVWIS 92 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 + + N Q G +G + + +L++ Sbjct: 93 YFTGSGDKNGEGFGPLTAGYGSRRNPQEDGGKIGPEFTFGIAMDAAFEE--PVLLIKTAW 150 Query: 121 GGSAFTA-GSEGTYSERHGASHDAC---------------RWGTDTPLYQDLVSRTRAAL 164 GG + + + Y+ +V + L Sbjct: 151 GGKSLNTDFRPPSAGPYVFNEKQLSDFRKQGKDIESIQKAKAEETGHYYRLMVDHVKHVL 210 Query: 165 AKNP----------QNKFLGACWMQGEFD----------LMTSDYASHPQHFNHMVEAFR 204 + P + G WMQG D + YA++ + H + R Sbjct: 211 SDIPRVCPQYDEKQGYELSGFVWMQGWNDLVDTGTYPNRSEPNGYAAYSEVMAHFIRDVR 270 Query: 205 RDLKQYHSQLNNITDAPWFCGDTTWYWKEN------------------------------ 234 +DL P+ G + Sbjct: 271 KDLNAPQM--------PFVIGVLGVDGGKRNLQTANFRAAMAAPAMLPEFRGNVAAVETA 322 Query: 235 --FPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYY----GSAYR 288 + AI Y N + + +E Y A Sbjct: 323 PYWAEELGAIAQKYDQVRQMNYFLNSKHKDHANADGSMTEEQKRAYLKQYEEKLISPAEV 382 Query: 289 SPENWTTALRSSHFSTAARRGI-ISDRFVEAILQFWRER 326 + + H+ A+ I + F EA L+ E Sbjct: 383 TLWKRGASNAGYHYLGCAKTFAQIGNAFAEANLKLLNEE 421 >UniRef50_C3PX29 Predicted protein n=4 Tax=Bacteroides RepID=C3PX29_9BACE Length = 267 Score = 103 bits (256), Expect = 8e-21, Method: Composition-based stats. Identities = 42/295 (14%), Positives = 71/295 (24%), Gaps = 62/295 (21%) Query: 5 ISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIP 64 V V GQSNA +P R++ Sbjct: 20 AQQKEIDVYLVGGQSNATGQAYVKNIPASFKI-DTRVRIYYSR----------------- 61 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 + + +PL +T+ G L + KL P LI G + Sbjct: 62 ----FLNKGEGSEQWNPLCQASETK-NKFGIELSLGTKLQSLYPKPQIALIKHALSGSNL 116 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAA----LAKNPQNKFLGACWMQG 180 + + G + Y + + + A + + W QG Sbjct: 117 YQQWNPGN-----------RQKNIRGEEYINFIKTVKDAIISLKQQGYRPIIKAMVWQQG 165 Query: 181 EFD----LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 E D + + + +E R++ + T P T Sbjct: 166 EADARDIAGMEQSRQYSSNLKNFIEQIRKEFNSENMLFVYGTVIPIAASRFT-------- 217 Query: 237 HSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNA---PDEDPDDLSTGYYGSAYR 288 + V V E + NA P +D L Y + Sbjct: 218 ---------GRELVRKAQFAVSNNSNSEFSVNNALLIPADDLQMLYNDYQIQHLK 263 >UniRef50_A6DFN7 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFN7_9BACT Length = 471 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 62/236 (26%), Gaps = 24/236 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + R+ + A + + Sbjct: 101 DVWLCSGQSNMAWRAGAYAKNVQGMENKIRLFMVKESAEL-----------MSVSKADLN 149 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + P + + VG L+ A +L I I ++ C GGS Sbjct: 150 RDVKVGMSWSPCTPKNSVAFSGVG--LNFALRLQQDID--VPIGLIHCAVGGSRIECWMS 205 Query: 131 GTYSERHGASHDACRW-------GTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 + + R G + + + L K G W QGE + Sbjct: 206 ENELLKEAQTEKLMRNFLNNIKAGERFTANRSIAALYNTMLKPLIGFKIKGVLWYQGEAN 265 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 + + ++ +R Q + AP+ G + + Y Sbjct: 266 --RRQGSQYKDLLKRLISEWRDQWGQGDFPFYYVQIAPFDYGKMLFTSNQLRESQY 319 >UniRef50_A3K171 Probable acetyl xylan esterase AxeA n=2 Tax=Bacteria RepID=A3K171_9RHOB Length = 920 Score = 100 bits (247), Expect = 9e-20, Method: Composition-based stats. Identities = 52/282 (18%), Positives = 83/282 (29%), Gaps = 52/282 (18%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 VL + GQSN + G HP A + G L H D Sbjct: 681 VLPLIGQSNMVGQGVFDGGAG-----HP-----ATYKQWLQAGSLAACT--AHLDHLSSD 728 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 +M G ++ A P NA ++ VPC G+ + Sbjct: 729 AGEM------------------GLSVQFAIDFAAEFP-NAQLIFVPCAVSGTGY------ 763 Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYAS 191 + G RWG LY V RT + + PQ G GE D S + Sbjct: 764 GFGNLQGGVE-LGRWGIGDDLYLAAVRRTDEVMRRYPQCVLGGILHHDGEDDAENSTA-N 821 Query: 192 HPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT-WYWKENFPHS---YEAIYGNYQ 247 + + +R + + P+ G+ FP A+ G Sbjct: 822 FADLLDEAIGGYRTSI------VGASAQTPFVVGEIAQDLDTGTFPLRDTINAALAGLPG 875 Query: 248 NNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRS 289 + + + NA + + Y+ +A++S Sbjct: 876 RLIHTACASSSGLVRQDFAHFNAASQ--RTFGSRYF-TAWQS 914 >UniRef50_UPI00016C3AF5 hypothetical protein GobsU_22277 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AF5 Length = 353 Score = 99.7 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 47/364 (12%), Positives = 85/364 (23%), Gaps = 70/364 (19%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 +P V +AGQSN L + + G D Sbjct: 20 ARADAPKPVKVFLLAGQSNMEGKAPNALLDSQAADAKTKDLFAHLR-----AGDKWAVRD 74 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 + + G+ P G G L L + +L+V G Sbjct: 75 DVFVKFLDRKGPLTVGFGSP---------GRTGAELEFGTALGNHFDE--PVLLVKAAWG 123 Query: 122 GSAF-------------TAGSEGTYSERHGASHDACRWGTDTPL--------------YQ 154 G + A E + T Y+ Sbjct: 124 GHSLYKQFRPPAAGLPADAVLEKELKQARDRVRQNNEKNKKTDPLPTLDEIKKGYGESYR 183 Query: 155 DLVSRTRAALA---------KNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRR 205 +++ + A K + G W QG D + + H++ R+ Sbjct: 184 AMLAEVKGATENMGTLFPALKGRPFELAGFVWFQGWNDQYNGAETEYEANLKHLINDVRK 243 Query: 206 DLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGER 265 DLK P + I + + Sbjct: 244 DLKAPK--------LPVVIAAMGQNGSKPATGPMLVIQKAQLAMNEVPEYRGNVKAVRTD 295 Query: 266 GLTNAPDEDPDDLSTGYYGSAYRSPENWTTA--LRSSHF-STAARRGIISDRFVEAILQF 322 L + E+ + + + E W H+ +A I +A+L+ Sbjct: 296 VLVDTAAEEL-------FPTWRENKEKWDKIGGDFPYHYLGSAIWFNRIGKSLADAMLEL 348 Query: 323 WRER 326 + + Sbjct: 349 HKNK 352 >UniRef50_UPI0001C367C9 hypothetical protein ChatD1_04961 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C367C9 Length = 254 Score = 98.5 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 59/224 (26%), Gaps = 57/224 (25%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 +L GQSN G+ P+ + + P Sbjct: 4 DILLFMGQSNMAGRGDYRLAPEVLPGAAYEYRAVTEPDTLVP------------------ 45 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQAL---HIARKLLPFIPD--NAGILIVPCCRGGSAF 125 P N + G + +A + I+ V C +GGS Sbjct: 46 -------LTEPFGVNENREGGVFEPGMKTGSMAAAFVNACYRKTGRPIIAVSCSKGGSRI 98 Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFL----GACWMQGE 181 +TP ++D +R +A L+ + G W QG Sbjct: 99 QEWQP------------------ETPYFKDAAARYQACLSFVQSRQIAVHSTGMVWCQGC 140 Query: 182 FDLMTSDY-ASHPQHFNHMVEAFRRDLKQYH---SQLNNITDAP 221 + A + + +A + L Q+ N + P Sbjct: 141 TNADDGMAKAEYKEKTKAFFQAVKS-LGVDKIFLIQIGNHREFP 183 >UniRef50_UPI00016C0614 hypothetical protein Epulo_09645 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0614 Length = 246 Score = 97.0 bits (239), Expect = 8e-19, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 65/229 (28%), Gaps = 58/229 (25%) Query: 6 SPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 S + ++ AGQSN G G AP + + ND + Sbjct: 6 SKKKFDIIIAAGQSNXEGAGIGXVEDP--YAPKNNVWSM---------------NDFVIA 48 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPF--IPDNAGILIVPCCRGGS 123 + LH A + L + + ILI+ +GG+ Sbjct: 49 KATERIKGNTLRGRF---------------VLHFAXEYLNAGLLDADREILIISAAQGGT 93 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 F W L ++ T+ AL N +NK + W QGE + Sbjct: 94 GFAT----------------HEWNRGDALAVRMLEMTKTALELNTENKIVAXLWHQGERE 137 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWK 232 + SH + R L + + P D WK Sbjct: 138 V------SHNMTAEAHLNNVRILLGDLQAAFGK--NFPMITADLVPIWK 178 >UniRef50_A6DFJ1 Sialate O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFJ1_9BACT Length = 583 Score = 97.0 bits (239), Expect = 9e-19, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 65/245 (26%), Gaps = 27/245 (11%) Query: 11 YVLTVAGQSNAMA--YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 + GQSN + A + Sbjct: 103 DIWLCGGQSNMDYDIRSYVNWGRKPHKEYTSIVNNFASYDQIRLAKMNKISTLPNQYDVP 162 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + + +G + N + G+ A+KL + I ++ +GGS Sbjct: 163 FVEDKTFKGKWMKASDNKELILGSSAVGFVFAKKLQADL--KIPIGLIDANKGGSFIKFW 220 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL-MTS 187 + G S A ++ ++ G W QGE D Sbjct: 221 EPPHALKARGESRPA------RNMFNSMLGSY------AHGFPIKGFIWYQGESDAINLQ 268 Query: 188 DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQ 247 + + F M+E +R + K + P+ + E P+ + Y + Sbjct: 269 KAQEYEKTFKTMIEGWRHEFKDP--------EMPFLFVQLASF--ERNPYMHGITYPVLR 318 Query: 248 NNVLA 252 + A Sbjct: 319 DAQTA 323 >UniRef50_C6XSW9 Sialate O-acetylesterase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XSW9_PEDHD Length = 469 Score = 93.5 bits (230), Expect = 9e-18, Method: Composition-based stats. Identities = 40/258 (15%), Positives = 72/258 (27%), Gaps = 31/258 (12%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN G P L G + Sbjct: 105 EVWICSGQSNMSMTMRGFPKQP----------ILNADGIIAAAGNAQLRLFKLQRAASLE 154 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + D++G + + + + A L + + ++ GG+ A Sbjct: 155 PLNDVKGNWNTASPESAAAFSAM--AYQFGEMLQRRL--KVPVGLILTAVGGTQIQAWMS 210 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 ++ + T ++ + +A + K GA W QGE + D A Sbjct: 211 SAQLQQFKEVNIPLSLDTAAAPHKMPTALYNGMVAPLLKFKIKGAIWCQGEAN--RDDPA 268 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNV 250 + Q F MV +R+D D P+ Y+ + P + Sbjct: 269 LYGQLFPAMVAGWRKDWNIP--------DFPF-------YYVQIAPLNSRDKRPTIVLVR 313 Query: 251 LANIIFVDFQQQGERGLT 268 A +D + T Sbjct: 314 EAQQKALDKIPNSDMVCT 331 >UniRef50_D2R930 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R930_9PLAN Length = 901 Score = 93.1 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 39/273 (14%), Positives = 64/273 (23%), Gaps = 60/273 (21%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPP--CHFN 60 A S V +AGQSN + D P P G P C Sbjct: 511 AGNSAKPLKVFILAGQSNMQGHAAISTFDSLADDPQTAPLLKTMRG---PDGKPRICEKV 567 Query: 61 DIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR 120 I + D+ L +G + + + +LI+ Sbjct: 568 WISSVGCLGDAYSDLTEAKGKLTAGFGAPEHKIGPEFTFGLTMEQRL--SEPVLIIKTSW 625 Query: 121 GGSAF-------------------TAGSE------GTYSERHGASHDACRWGTDTPLYQD 155 GG + + G A + Y++ Sbjct: 626 GGRSLHTDFRPPSAGPFVLAKETQELWDKHPEGAHGIPKAEDRPKFYAEKTAATGVFYRE 685 Query: 156 LVSRTRAALA----------KNPQNKFLGACWMQGEFDL----------MTSDYASHPQH 195 +++ + L +N + G W QG D Y + Sbjct: 686 MIAHVKMVLKDIKRVVPDYDENQGYELAGFVWFQGFNDYVDGGVYPKQNTAGGYELYADL 745 Query: 196 FNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT 228 H + R+DL P+ G Sbjct: 746 LGHFIRDVRKDLSAPK--------LPFVIGVMG 770 >UniRef50_A5FCG2 Sialate O-acetylesterase n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FCG2_FLAJ1 Length = 474 Score = 93.1 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 48/337 (14%), Positives = 94/337 (27%), Gaps = 53/337 (15%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN +PL + P IP Sbjct: 105 EVWLCSGQSNME-----MPLKGFQGQP-----VKNGNEIIVRSANKNIRLITIPRATVLE 154 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG-S 129 +QD +G + + + A + L + N + ++ GGS+ A + Sbjct: 155 PLQDFEGKWEEASPKSTSNFSA--TAWYFGSLLQEVL--NVPVGLIHVSYGGSSMEAWMN 210 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY 189 + + A + + + L+ G W QGE + Sbjct: 211 QEMLKDFASAKIPTTKEELAKDPNRVPTTLFNGMLSPVIGYGIKGCIWYQGESN--YERA 268 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY---------WKENFPHSY- 239 + + MV ++R Q D P++ + + E + +Y Sbjct: 269 SEYTALMKKMVSSWRALWNQ--------GDFPFYFAQIAPFNYASFHPKDYLEKYNSAYL 320 Query: 240 -EAIYGNYQNN------------VLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSA 286 EA + NI +D ++ G+R A + Sbjct: 321 REAQLKASKEIPNSAMAVLMDVGEENNIHPMDKEKGGDRLAFQALARTYGIEGFEFESPK 380 Query: 287 YRSPEN-----WTTALRSSHFSTAARRGIISDRFVEA 318 Y+S E + ++ T+ + ++ A Sbjct: 381 YKSMEIKDGAVTVSFDDVNNGITSYDKEVLGFEIAGA 417 >UniRef50_B4DB21 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DB21_9BACT Length = 384 Score = 92.7 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 43/233 (18%), Positives = 67/233 (28%), Gaps = 62/233 (26%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V VAGQSN+ YGE +++ R+ L P Sbjct: 125 EVFVVAGQSNSANYGE-----EKQTTQTGRVTALDGRG-WQLANDP-------------- 164 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 G+ L A + +P I V C GG++ Sbjct: 165 ------------QPGAAGSRGSFMPPLGDALEERFHVP----IGFVACGVGGTSVREWLP 208 Query: 131 GT-------YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 E W + LY L++ +A P F W QGE D Sbjct: 209 QGVVFPNPPTVESRVVRLAGGTWESKGQLYAKLLASMKAV---GPHG-FRAVLWHQGESD 264 Query: 184 LMTSD------YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY 230 D + ++ ++ RR++ DAPWF +++ Sbjct: 265 ANQQDTSRTLPGKLYREYLEKIIRESRREVG---------WDAPWFVAQASYH 308 >UniRef50_A7V9B4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7V9B4_BACUN Length = 473 Score = 92.0 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 39/275 (14%), Positives = 67/275 (24%), Gaps = 39/275 (14%) Query: 11 YVLTVAGQSNAM----AYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 V +GQSN + + +D I A P H + Sbjct: 105 EVWLASGQSNMEMPLKGFAGCCIMNGADD-----IIVSAENKGVRMFTVPKHQTYELQTD 159 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 C P + A + A L + + I+ GGS Sbjct: 160 -CKGSWNISSIETAPDFSAT---------AYYFATSLSRAL--RIPVGIICSAYGGSTVE 207 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMT 186 E + + A L G W QGE ++ Sbjct: 208 GWISRELLENYPDISLNPDSIERLHPMLRPMLMYNAMLKPLQNYTIKGFIWYQGESNVGR 267 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNY 246 ++ + MV+ +R++ G+ +Y+ E P++Y Sbjct: 268 H--ETYAERLADMVQLWRKEWGL---------------GELPFYFAEIAPYAYGGTQQEK 310 Query: 247 QN-NVLANIIFVDFQQQGERGLTNAPDEDPDDLST 280 A TN E + + Sbjct: 311 AAYLREAQFRAQSLIPNSGMISTNDLVEPYEMYNI 345 >UniRef50_A3ZRY8 Iduronate-2-sulfatase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZRY8_9PLAN Length = 293 Score = 91.6 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 46/341 (13%), Positives = 90/341 (26%), Gaps = 80/341 (23%) Query: 1 MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPD--REDAPHPRIKQLARFAHTHPGGPPCH 58 + P +V ++GQSN G+ LP+ + PH F G Sbjct: 18 IALAAQP---HVYLLSGQSNMQGIGKLADLPESVPHEMPHAFFWNGKTFEPLVLGKT--- 71 Query: 59 FNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPC 118 T+ G G + A ++ + ++ Sbjct: 72 --------------------------KISTRAGEFGPEVGFALQMASA---EHPVYLIKY 102 Query: 119 CRGGSAFTA-GSEGTYSERHGASHDACRWGTDTP-------LYQDLVSRTRAAL----AK 166 G + GT++ + LYQ ++ R L + Sbjct: 103 HASGMPLYHGWNGGTWAGTEPGPKRRNFYPGAAAADPNVGTLYQQMLKMYRTGLAQLAKQ 162 Query: 167 NPQNKFLGACWMQGEFDLMTSDYA-SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCG 225 + G WMQGE D + A ++ + + + DLK + P G Sbjct: 163 GEAPQVKGFLWMQGEMDAKGKESAITYAANLKRLRDRLAADLKLD-------ANLPMVYG 215 Query: 226 DTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGS 285 +E + + + D P+ + Sbjct: 216 QVLP---------HEPAHARFTHRTETRQQMADADADS-----GKPEA--------ITAA 253 Query: 286 AYRSPENWTTALRSSHFSTAARRGIISDRFVEAILQFWRER 326 S + + + H+S A + + + A+ + Sbjct: 254 KMVSTDGFEVLPDTVHYSAAG-QLRLGEAMAAAMKPLVEAK 293 >UniRef50_C7PT40 Sialate O-acetylesterase n=3 Tax=Bacteria RepID=C7PT40_CHIPD Length = 486 Score = 90.0 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 37/275 (13%), Positives = 79/275 (28%), Gaps = 45/275 (16%) Query: 11 YVLTVAGQSNAM----AYGEGLPLPDREDAPH-PRIKQLARFAHTHPGGPPCHFNDIIPL 65 V +GQSN +G+ L + P I+ T Sbjct: 108 EVWIASGQSNMEMPLKGWGKILNHDQEIAVANYPEIRLFQLKHTTSTA------------ 155 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 DV G ++ +VG AR++ + ++ + ++ GG+ Sbjct: 156 --PQEDVNPWDGGWQACTPQSIPEFSSVG--YFFAREI--YTHEHIPVGVIHTSWGGTVA 209 Query: 126 TAGSEG-------TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWM 178 A + G +++ A + + A + F G W Sbjct: 210 EAWTSGESIKKIPAFADTVKAFESLPTPLSPSNP-NKATLLYNAMIHPLLPYAFRGVIWY 268 Query: 179 QGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS 238 QGE + + + F M++ +R+ D P++ + Sbjct: 269 QGEGNA--ERAYQYREVFPTMIKDWRKQWHN--------GDFPFYFVQLANF----KDKQ 314 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDE 273 + + ++ A ++ + G + D Sbjct: 315 TQPVESDWAELREAQLMTLSLPNTGMATAVDIGDA 349 >UniRef50_B5CXD4 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CXD4_9BACE Length = 468 Score = 89.3 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 64/258 (24%), Gaps = 49/258 (18%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + P P Sbjct: 109 EVWLCSGQSNMEWSANMG--------------IMNGEQEVKQAACPSVRIFHNPKQGADT 154 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D + + A AR L + + I+ GG+ + Sbjct: 155 PQTDCRTKWEVATPESMRKTSA--TAYFFARYLTEHL--KVPVGILVSAWGGTPAEVWTP 210 Query: 131 GTYSERHG------ASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL 184 E+ LY ++ + G W QGE + Sbjct: 211 AEIVEKDEILSKNKLKDYPWWPIKPGVLYNQMI-------HPLVPYQIAGCIWYQGESN- 262 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 + S+ + + MVEA+R+D D P+ Y+ + PH+Y A Sbjct: 263 -HENAPSYARLVSKMVEAWRKDFG---------WDFPF-------YYVQIAPHTYGAKNN 305 Query: 245 NYQNNVLANIIFVDFQQQ 262 + + + Sbjct: 306 TPALLREQQELMLGQVKN 323 >UniRef50_A5FA09 Sialate O-acetylesterase n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FA09_FLAJ1 Length = 459 Score = 88.5 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 70/246 (28%), Gaps = 36/246 (14%) Query: 11 YVLTVAGQSNA---MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 V +GQSN ++G + ++A + I+ + + Sbjct: 104 EVWVCSGQSNMEMSASWGIENGDEEVKNAANSNIRFFT-----------------VSKST 146 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 ++ G + VG A++L + N I ++ GG+ Sbjct: 147 AVTPQNNVLGNWVESTPETMKYFSAVG--YFFAKRLREDL-KNVPIGLISSNWGGTPAEI 203 Query: 128 GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQN-KFLGACWMQGEFDLMT 186 + + + + R A+ K G W QGE ++ Sbjct: 204 WMPEEVVQNDPVLLENAKKLNEQEYGPHQPGRAYNAMIAPITGFKIAGTIWYQGESNVG- 262 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNY 246 + + + ++ ++R+ + P++ Y S + + Sbjct: 263 --SLVYDKTLSALIASWRKAWND---------EFPFYFVQIAPYKNGTNNFSNVTVRNSQ 311 Query: 247 QNNVLA 252 + + Sbjct: 312 RKILKE 317 >UniRef50_UPI0001BC840D sialic acid-specific 9-O-acetylesterase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC840D Length = 615 Score = 88.5 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 62/221 (28%), Gaps = 45/221 (20%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN +PL E P +++ A H Sbjct: 125 EVWICSGQSNME-----MPLGGWEGCPVDNMEEAVTNADKHSEIRMLTVESNSATVSQNE 179 Query: 71 DVQDMQGYHHPLATNHQTQYGTVG-QALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 D Q A + A +L + N + +V GGS + Sbjct: 180 LKGDWL-------VASSGQVKRFSAVAYYFALELQEKL--NVPVGLVVAAWGGSDIESWL 230 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY 189 G Y + L + G W QGE ++ + Sbjct: 231 PGGDK------------------YNGM-------LYPCHKYAAKGFLWYQGESNVW--KW 263 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY 230 + ++ +V+++R + L + P++ + Y Sbjct: 264 YEYQKNMKELVKSWRSLWE---GSLGTGENMPFYYAEIAPY 301 >UniRef50_D2R922 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R922_9PLAN Length = 240 Score = 87.7 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 59/197 (29%), Gaps = 25/197 (12%) Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAG---ILIVPCCRGGSAFTAG 128 D +G H L + L A+ +P + + G ++IV R G Sbjct: 24 ASDTRGVHLILLSGQSNMAN-----LDPAQVFIPEVERHFGAENVVIVKVARSGQPIRRW 78 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 Y + + D LY+ L++ AL P WMQGE D+ Sbjct: 79 ----YKKWTVTGKQNPKEIGD--LYEQLMATAEKALNGRPIQS-ATLIWMQGERDVKERL 131 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQN 248 A + F MVE + DL + G + N +++I Q Sbjct: 132 SAHYKVAFLGMVEQLKTDLDVPQ--------INYIIGRISDSGPGNKD--WDSIRDVQQK 181 Query: 249 NVLANIIFVDFQQQGER 265 + Sbjct: 182 LGESGPHSAWINTDDLN 198 >UniRef50_C6Y155 Sialate O-acetylesterase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y155_PEDHD Length = 477 Score = 86.6 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 27/237 (11%), Positives = 58/237 (24%), Gaps = 32/237 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + P ++ Sbjct: 106 EVWLCSGQSNME---YAMHKLKTMKKPLNEKLGF-PANEVANAHNNLIRIFVVNRKDLAK 161 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + ++ + A++L + + + ++ GSA Sbjct: 162 PDTNPKNWNVAADPALKNSSA---AGYFFAKELQKQL--HVPVGMITSAVSGSAIEPWIP 216 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 A+ A L D + G W QGE + + Sbjct: 217 --------AAILAGPDFQGQYLGSDPGKFYPTMIEPLMPFAIKGVIWYQGETNCFRRENI 268 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQ 247 ++ ++ ++RR K ++Y+ + P Y N + Sbjct: 269 TYSYKMKALINSWRRGWKNP---------------GMSFYYVQIAPFQYSKTMQNVE 310 >UniRef50_B3C638 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3C638_9BACE Length = 465 Score = 86.2 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 58/213 (27%), Gaps = 21/213 (9%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + + LP P LA P ++ P Sbjct: 90 EVWLCSGQSNMV---YKMKLPGNYALPAKGE-NLAALELRKPANEMIRV-FVVRRDDKPV 144 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + G + + L + + I+ GS + Sbjct: 145 SWKVADGESLAEVSA---------VGYFFGKALQEQLD--VPVGIITAAVNGSRIETWTS 193 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLV--SRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 E + G ++ R + G W QGE + M D Sbjct: 194 KEAYEHSPVFGPQLKEGEGKM--DGMIPGGRYETMVLPLIPFAIKGCLWYQGESNCMIRD 251 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAP 221 + + + +V+++R + ++ AP Sbjct: 252 -RQYAEKYQVLVDSWREAFNVPGAPFYSVLLAP 283 >UniRef50_B5Y539 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B5Y539_PHATR Length = 333 Score = 85.8 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 40/312 (12%), Positives = 75/312 (24%), Gaps = 45/312 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +AGQSN + H I + Sbjct: 36 KVFLLAGQSNMVGMASV---------QHLEILINDHNITHNDFREDLWNGTGFRSRDDVF 86 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + L + G L + DN P G + Sbjct: 87 VKYNDR--VGKLEPGYGASVSKFGPELGFGWTVGDAFTDN------PLGGGRNLAVDFRP 138 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAK----------NPQNKFLGACWMQG 180 E ++G + Y+ ++ L + + G W QG Sbjct: 139 PLSGEGQFPDVKPSKYGWE---YRQMIHAILDGLEAIQEIYPDYCEDQGYQLCGFVWFQG 195 Query: 181 EFDL-MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 D+ + + +++ RR+ + P+ G+ + HS Sbjct: 196 WNDMLSWPFVREYGFNLANLIRDIRRETDEPS--------LPFVVGELGMHGNLTGDHST 247 Query: 240 EAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRS 299 A + + + Q + +P + Y Y A Sbjct: 248 AATRVKTIRAMEQGVTLLSEFQNNTIFVKTSPYV---INNGTKYNKIYHYNG---RADTY 301 Query: 300 SHFSTAARRGII 311 H A RG++ Sbjct: 302 YHMGKAFGRGLL 313 >UniRef50_B7ALN1 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7ALN1_9BACE Length = 442 Score = 85.4 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 74/253 (29%), Gaps = 37/253 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPH---PRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 V +GQSN +++ PH I+ + P Sbjct: 76 EVWLCSGQSNMEWNSAKGIQDVKDELPHANNSEIRFFTVEKQ----------TSLFPQDD 125 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 C +G + + VG +KL + I ++ GG+ Sbjct: 126 C-------RGRWMVCSRETLKLFSAVG--YFFGKKLNQELDS--PIGLINSSWGGTNIET 174 Query: 128 GSEGTYSERHGAS-HDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMT 186 E H T T + A + + G W QGE +L Sbjct: 175 WMPHETMEEHPEFAKALQTLSTSTGWDISPSTTYNAMIHPLMPIRLAGIIWYQGEANLAN 234 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS--YEAIYG 244 +Y + F M++A+R Q + + P++ Y + F + Sbjct: 235 GNY--YASMFTSMIKAWR--------QGFSDDELPFYYVQIAPYARYKFASRAAAQVREQ 284 Query: 245 NYQNNVLANIIFV 257 Y+ + L N+ V Sbjct: 285 QYEVSKLPNVGMV 297 >UniRef50_A6DRW1 Acetyl xylan esterase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DRW1_9BACT Length = 236 Score = 85.4 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 30/235 (12%), Positives = 68/235 (28%), Gaps = 41/235 (17%) Query: 93 VGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPL 152 + + + ++++ +GG Y + A + + + L Sbjct: 39 MDPNIAFIPAVEKAF-GKENVVVIHDAQGGQPIRRW----YKDWEPA--NGEKPTSTGML 91 Query: 153 YQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHS 212 Y+ ++ + + K + + WMQGE D + + ++E DL + Sbjct: 92 YKRMMRKI-LPIVKKHKFSSVTFVWMQGEADAQAKHGDVYKKSLIGLIEQLSNDLGR--- 147 Query: 213 QLNNITDAPWFCGDTT--WYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNA 270 D G + + FPH + I +N Sbjct: 148 -----KDLNVVIGRLSDCDLTNKRFPH-WTMIRDIQMEVAESNPRSAWVNTDDLND---- 197 Query: 271 PDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRGIISDRFVEAILQFWRE 325 G G ++ H+S + + +RF E ++ ++ Sbjct: 198 --------GKGENGRKLKNN---------LHYSIEGYKQ-LGERFAEKSIELIKK 234 >UniRef50_D2R5K6 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5K6_9PLAN Length = 373 Score = 84.7 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 64/240 (26%), Gaps = 35/240 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V + GQSN + +G P + +K+ ++ + + H Sbjct: 57 KVYIMLGQSNMLGFGRVGP-KELNGTLEYMVKEKGKYPYLIDDAGAWTTRQDVRYVHVMD 115 Query: 71 DVQDMQGYHHPLATNHQ---TQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF-- 125 T + G L L F + +L++ C G + Sbjct: 116 QRGVDYKDMEKFGDVRNEWLTPNKSFGPELGFGHVLGTFHEE--PVLLLKACIGNRSLGW 173 Query: 126 -------TAGSEGTYSERHGASHDACRWGTDTPL----------YQDLVSRTRAALAKNP 168 G + W T Y + +A L Sbjct: 174 DLLPPGSERFEHEG-KIYAGYKDVSNFWEKGTEPKPVSWYAGRQYDADTAHAKAVLKNLE 232 Query: 169 QN---------KFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITD 219 + + G W QG D + + Q+ H+++ R+D +++ T Sbjct: 233 KYYPGYKGQGYEVAGFVWWQGHKDQNAAHAGRYEQNLVHLIKTLRKDFDAPNAKFVVATG 292 >UniRef50_C9KWT4 Sialic acid-specific 9-O-acetylesterase n=16 Tax=Bacteroides RepID=C9KWT4_9BACE Length = 478 Score = 84.7 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 40/306 (13%), Positives = 82/306 (26%), Gaps = 51/306 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 + +GQSN +P+ ++ P P + T Sbjct: 111 ELWLCSGQSNME-----MPMKGFKNQP-----VENANMDILRSKNPNIRLFTVKRTSTFT 160 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D+ G + + A + R + + + +V GGSA A Sbjct: 161 PQTDVAGTWKEATPANVRDFSA--TAYYFGRLINEILD--VPVGLVVAAWGGSACEAWMT 216 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 + + + + L G W QGE + + Sbjct: 217 ADWLKAFPDAKIPQSEADIKSKNRTPTVLYNGMLHPLIGMTMKGVIWYQGEDN--WNRAH 274 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNV 250 ++ F ++ +R + KQ D P+ Y+ + P+ Y I + + Sbjct: 275 TYADMFTTLINGWRAEWKQ--------GDFPF-------YYCQIAPYDYGIITEKGKEVI 319 Query: 251 LANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRGI 310 N ++ Q + ++G + A++ I Sbjct: 320 --NSAYLREAQAKV---------EHRVPNSGM--AVLLDAGMEKGIH-------PAKKQI 359 Query: 311 ISDRFV 316 +R Sbjct: 360 AGERLA 365 >UniRef50_C0D050 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C0D050_9CLOT Length = 260 Score = 84.7 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 42/208 (20%), Gaps = 48/208 (23%) Query: 8 DYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 + + GQSN G P P Sbjct: 7 KEHDLFLFLGQSNMAGRGVTSP-------------------QWPESAPALTPGAGYEYR- 46 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQAL---HIARKLLPF--IPDNAGILIVPCCRGG 122 D + P N G + + + ++ V +GG Sbjct: 47 AISDPGRLHPASEPFGVNENNPDGICEPGMKTGSMVTAFINAYYARTKIPVIGVSASKGG 106 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKF----LGACWM 178 SA D D + R + + + W Sbjct: 107 SAIGQWQ------------------GDGDYLSDALMRLKRTGKFLKEQEITVRHRYMLWC 148 Query: 179 QGEFDLMTSDY-ASHPQHFNHMVEAFRR 205 QGE D + F +M R Sbjct: 149 QGETDGDLGTSPEDYKARFTNMFSQLRE 176 >UniRef50_A6L837 Sialic acid-specific 9-O-acetylesterase n=8 Tax=Bacteroidales RepID=A6L837_PARD8 Length = 473 Score = 83.9 bits (205), Expect = 7e-15, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 51/229 (22%), Gaps = 39/229 (17%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + H + + Sbjct: 107 EVWFGSGQSNME---------------------MPLRGFWHCPIEGGNHAIATSGKYKNS 145 Query: 71 D---VQDMQGYHHPLATNHQTQYGTVGQ---------ALHIARKLLPFIPDNAGILIVPC 118 G P ++ A A L + N + I+ C Sbjct: 146 IRYATIQRVGALEPKDYPVGGEWKVCDPRNAPEFGATAYFFATLLTEVL--NVPVGIINC 203 Query: 119 CRGGSAFTAGSE-GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACW 177 GGS + L + G W Sbjct: 204 SWGGSTVEGWLPKDILRNYSDIDLSLAGNDEKIHPMLQPMIMYNGMLKPASKYTVRGFLW 263 Query: 178 MQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGD 226 QGE ++ D + + MVE +R Q + AP+ G+ Sbjct: 264 YQGESNVGHPD---YAKRLATMVEHWRGLWGQEELPFYLVEIAPYEYGE 309 >UniRef50_B7AIP4 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B7AIP4_9BACE Length = 466 Score = 83.9 bits (205), Expect = 8e-15, Method: Composition-based stats. Identities = 50/293 (17%), Positives = 80/293 (27%), Gaps = 54/293 (18%) Query: 11 YVLTVAGQSNA---MAYGEGLPLPDREDA----PHPRIKQLARFAHTHPGGPPCHFNDII 63 V +GQSN M P+ DA + +I+ D I Sbjct: 110 EVWLCSGQSNMDMRMGGRYDEPVIGSLDAIVTSRNSQIRMFTVDGKM----------DSI 159 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 PL C Q+ P T A KL + + I+ GGS Sbjct: 160 PLADCEGRWQEASSETVPDFTA---------VGYFFACKLNSVL--GVPVGIIHASYGGS 208 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 A E D + LY + L+ G W QGE + Sbjct: 209 RVEAWMS---KESIEPYKDLQEVRNGSILYNGM-------LSPVIGYGIRGCLWYQGEAN 258 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIY 243 + D + Q F +V +RR K I + P++ + + Sbjct: 259 IDAPD--LYTQLFPALVNDWRRQWK--------IGEFPFYYAQIAPFNYNKGEEKGKN-S 307 Query: 244 GNYQNNVLANIIFVDF-----QQQGERGLTNAPDEDPDDLSTGYYGSAYRSPE 291 + + + ++ T P E + Y + R+ Sbjct: 308 AFLREAQMKCLRYIPSSGMIVLTDVGDAHTIHPMEKETVGNRFAYLALGRTYG 360 >UniRef50_A6DFR1 Acetylxylan esterase related enzyme n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFR1_9BACT Length = 240 Score = 83.5 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 67/217 (30%), Gaps = 44/217 (20%) Query: 112 GILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK 171 +++V +GG + Y E + +Y L++ T+ A++ + Sbjct: 58 KVIVVRVAQGGQPISKW----YKEWKSSK--GEVDPKKGAIYDKLMTETKKAISGQKIST 111 Query: 172 FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW-- 229 WMQGE D + + + + + DL + TD + G + Sbjct: 112 V-TFIWMQGEADSKAGNGDVYLKSLKGLQKQLEDDLGR--------TDINFVIGRLSDSG 162 Query: 230 -YWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYR 288 + K P + A + F +++P + + Sbjct: 163 FFKKGTIPRENSK----WAEVQKAQMDF--------------AEQNPR--------AYWI 196 Query: 289 SPENWTTALRSSHFSTAARRGIISDRFVEAILQFWRE 325 +++ H+ + +VE L+ +E Sbjct: 197 DTDDFNGEKNELHYIKPEGYEQLGKAYVETALKALKE 233 >UniRef50_A6DIF1 Acetyl xylan esterase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DIF1_9BACT Length = 240 Score = 83.1 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 39/139 (28%), Gaps = 13/139 (9%) Query: 104 LPFIPDNAGILIVPCCRGGSAFTAGSEG-----TYSERHGASHDACRWGTDTPLYQDLVS 158 + + I+ + G YQ ++ Sbjct: 39 AENLFKDEKIVYIKKSMGSQYIHRWLPEWDEIAKSKGLEENHRKKILRDGKVLYYQPILD 98 Query: 159 RTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNIT 218 + + L K P+ + CWMQGE D ++ ++ FRRDLK+ Sbjct: 99 QYKELLKKYPKPASVTFCWMQGESDAQGRVSVAYKDSLKLLISNFRRDLKRP-------- 150 Query: 219 DAPWFCGDTTWYWKENFPH 237 D G Y + Sbjct: 151 DMNVVIGRIADYALDKKEC 169 >UniRef50_C6Y117 Sialate O-acetylesterase n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y117_PEDHD Length = 482 Score = 83.1 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 60/226 (26%), Gaps = 23/226 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V GQSN +P+ + P + P +P + Sbjct: 111 EVWFCGGQSNME-----MPMKGFKGQP-----VIGSNEAILKAANPQIRLYTVPRSSVTE 160 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + A + A + R L + + ++ GGS+ A Sbjct: 161 RQDNSKASEWKAAAPETVANFSA-TAYYFGRLLSEML--QVPVGLINDSYGGSSIEAWMS 217 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 + + T + + + L G W QGE + Sbjct: 218 PAQLKAFPEVKIPAKTDTIKEVSRTPTTLYNGMLYPVIGYGIRGVIWYQGESNYDR--AD 275 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 + + F MV+A+R + P++ Y P Sbjct: 276 RYEELFPAMVKAWREAWGM--------GEFPFYYAQIAPYNYAQLP 313 >UniRef50_A6DGA5 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGA5_9BACT Length = 1889 Score = 83.1 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 43/295 (14%), Positives = 71/295 (24%), Gaps = 33/295 (11%) Query: 6 SPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 V +AGQSN G L + + H I P Sbjct: 20 PQAEVDVFIIAGQSNVNGRGLVSNLSNDQKTQEAMFY-----GSWHKFTNNASVASINPQ 74 Query: 66 THCPHDVQDMQGYHH--PLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 Q + G + + G + A + I I+ GS Sbjct: 75 LFSGWRSQTIAGETRNDGNISENFGSSAWFGPEVGFAARANEINLSPNPIAIIKYAVDGS 134 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK---FLGACWMQG 180 A + ++ A+ P F G W QG Sbjct: 135 AINNSVNPQLAGTTSDWDTVATGEYAGDCWRGFQIALDKAIGDLPVGTTPNFKGMVWWQG 194 Query: 181 EFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 E + N + R L H + N +D P + + + + Sbjct: 195 ESGTNAAA-------LNTFIGEVRSHLATNHG-VQNASDFPVVITGSFAWGADLKSLVSD 246 Query: 241 A------------IYGNYQNNVL---ANIIFVDFQQQGERGLTNAPDEDPDDLST 280 G + N L N +D G + + D+++T Sbjct: 247 PDDDIGYIDPNDFGQGGWANVHLGSGENGQSLDVDGNGVNDMFDIGQAYADEMAT 301 Score = 78.9 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 67/236 (28%), Gaps = 30/236 (12%) Query: 2 NAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 N V +AGQSNA G L + + + +F++ Sbjct: 1200 NTPTDSTEVDVYIIAGQSNAYGQGLISDLNIDQQIQNALFFTSWHEIDGN-AESQQYFSN 1258 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 ++P T + + G L ++ + I I+ G Sbjct: 1259 VLPFTEAGFSKGNPGQSNL-------GGSDYFGPELGFVNRINELGINANPIAIIKYAVG 1311 Query: 122 GSAFT------AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGA 175 G++ T E+ G C G + L + S AKN G Sbjct: 1312 GTSLTYNPAVSDWDLNETGEQDGD----CWRGFQSALSNGITSL----EAKNYVPNIKGV 1363 Query: 176 CWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW 231 W QGE +D N + R L + + N P+ T W Sbjct: 1364 IWWQGESGTSATD-------LNSFIAEVRNHLNV-NYGVQNSAQLPFVITGTDNAW 1411 Score = 51.5 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 61/250 (24%), Gaps = 36/250 (14%) Query: 2 NAIISPDYYYVLTVAGQSNAMA--------YGEGLPLPDREDAPHPRIKQLARFAHTHPG 53 + +S Y + + N G P K + P Sbjct: 571 STPVSDPTYDITFI----NVDGSQTVQNVEEGTVANPPAGASTVD---KTFTAWPTIDPA 623 Query: 54 GPPCHFNDIIPLTHCPHDVQDM---QGYHHPLATNHQTQYGTVGQALHIARKLLPF-IPD 109 + + ++ D+ G + + + G I KL + Sbjct: 624 YENAVYEALYEPVSSGGNLVDVFIATGQSNAAYPLYDGEENKFGFGRGIQTKLTESGLFS 683 Query: 110 NAGILIVPCCRGGSAFTAGS----EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALA 165 N +++ G YS D G L ++ +A Sbjct: 684 NPTVVL--AGEPSRPIAYWWAEFWPGAYSGYDSNFFDIDNSGEGQ-----LEAKINEIIA 736 Query: 166 KNPQNKFLGACWMQGEFD------LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITD 219 +F G W QGE D + A + +N ++ DL+ Sbjct: 737 NGDTPRFRGFFWFQGESDGLGPYATADTSQADYETRWNGLLAQLDSDLQAAGVSNTEYKF 796 Query: 220 APWFCGDTTW 229 ++ Sbjct: 797 VVNTVAESGD 806 >UniRef50_B5JPM4 Conserved domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPM4_9BACT Length = 468 Score = 83.1 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 53/303 (17%), Positives = 81/303 (26%), Gaps = 19/303 (6%) Query: 11 YVLTVAGQSNAMA--YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V ++GQSN G P P A Sbjct: 101 EVWLLSGQSNMELPLAGWPDAEPPCPIEGGPEAIAAADHPQIRFIIAGQKPAASHQAEIS 160 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + HP + AR L + I ++ GGSA A Sbjct: 161 SNSPLPAWTVCHPDTVPEFSA-----VGYFFARALKEKV--RVPIGLIQSTWGGSACEAW 213 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRT-RAALAKNPQNKFLGACWMQGEFDLMTS 187 + + + + +P S +A G W QGE ++ Sbjct: 214 TPSHALKTLEDFRNLAPFAPQSPDDNYTPSVLFNGMIAPLAPFTLAGILWYQGESNVGRH 273 Query: 188 DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGD---TTWYWKENFPH---SYEA 241 F M++++R Q APW + +W+ A Sbjct: 274 --QQLETLFPAMIKSWRATFDQPELPFYFAHIAPWAGYERDTLPKFWQAQASALDLPNTA 331 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSH 301 + ANI + GER A D S+ G Y+S + + H Sbjct: 332 LAVTIDCGDSANIHPPHKKPIGERFAQLALANRTDGYSSQSTGPNYQSRSIQDSQ-LTLH 390 Query: 302 FST 304 FST Sbjct: 391 FST 393 >UniRef50_Q6KCY2 Hypothetical adenine-specific methylase YfcB n=1 Tax=Escherichia coli RepID=Q6KCY2_ECOLX Length = 166 Score = 82.7 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 50/121 (41%), Positives = 62/121 (51%), Gaps = 13/121 (10%) Query: 9 YYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 ++YV+ +AGQSN MAYGEG+PLPD D P R+KQLAR PGG C FN+IIP H Sbjct: 13 FWYVIALAGQSNGMAYGEGIPLPDTLDKPESRVKQLARRKTITPGGKECKFNEIIPADHA 72 Query: 69 PHDV-----QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 ++ +G H + N Q Q LH FIP G + VP R Sbjct: 73 LNNTVFFAGGQTRGAH--VFRNIQRQVERRQHQLH------GFIPRVIGAVAVPDIRRAE 124 Query: 124 A 124 A Sbjct: 125 A 125 >UniRef50_B3CHU5 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CHU5_9BACE Length = 465 Score = 82.3 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 44/286 (15%), Positives = 75/286 (26%), Gaps = 40/286 (13%) Query: 11 YVLTVAGQSNAMAY-GEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 V +GQSN G P + G P + Sbjct: 109 EVWLCSGQSNMDMRVGGRYSDP-----------VIGSLDVIVTSGNPGIRMFTVGSKMTS 157 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 + D +G ++ ++ G ARKL + + I+ GGS A Sbjct: 158 EPLTDCKGGWQEASSETVPEFSAAG--YFFARKLNQVL--GIPVGIIHASYGGSRVEAWM 213 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY 189 E D + LY + L+ G W QGE ++ D Sbjct: 214 S---KEGVAPYKDLPDVHNASILYNGM-------LSPVVGYGIRGCLWYQGEANVDAPD- 262 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIY----GN 245 + Q F +V +R+ I + P++ + + Sbjct: 263 -LYTQLFPSLVSDWRKQWG--------IGEFPFYYAQIAPFNYNKGEGKGKNSAYLREAQ 313 Query: 246 YQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPE 291 + L + T P E + Y + R+ Sbjct: 314 VKCLHLIPSSGMVVLTDVGDDRTIHPMEKETVGNRFAYLALGRTYG 359 >UniRef50_C7PN81 Sialate O-acetylesterase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PN81_CHIPD Length = 469 Score = 82.0 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 78/291 (26%), Gaps = 29/291 (9%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P P + + + R P +PL + Sbjct: 106 EVWVCSGQSNMERQ--LGPRPPQLPITN---WEQERDK----ANYPLIREYYVPLKYSAE 156 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + D+ + + VG +L + + + GG+ + Sbjct: 157 KIPDVHQQWTVCSPQTAADFSAVG--YFFTSELYKKL--KIPVGFIFTAYGGTPAEDWTS 212 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKF-----LGACWMQGEFDLM 185 A D + + + ++ L G W QGE + Sbjct: 213 EEALNSDPALADFSKNYDNIKYGYMPQGKKKSGLYNGMIYPILPYAVKGVAWYQGEAN-- 270 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGN 245 +P ++M+ +R D Q D P+ Y + Sbjct: 271 NELPGIYPAILSNMIRNWRTDFGQ--------GDIPFLIVQIAPYKDMTPEIREAQLLVT 322 Query: 246 YQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTA 296 Q A I+ D + +N L+T G Y+ P + Sbjct: 323 KQVKNTALIVTTDCGDPKDIHPSNKRPVGER-LATAAIGMVYQQPGEYAGP 372 >UniRef50_Q8A331 Putative sialic acid-specific acetylesterase n=4 Tax=Bacteroides RepID=Q8A331_BACTN Length = 479 Score = 81.6 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 48/312 (15%), Positives = 79/312 (25%), Gaps = 55/312 (17%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAP---HPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 V +GQSN +P+ + P I A+ + + Sbjct: 105 EVWFCSGQSNME-----MPMGGFDRQPVRGTNDIIAKAKPSTPIRMYTTDSKDGRWVRQF 159 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 V+D QG + + V + + AR + + + IV GGS A Sbjct: 160 SKTPVEDCQGEWLENTPVNVSHISAV--SYYFARYIQEVL--EVPVGIVVSTWGGSKIEA 215 Query: 128 GSEGTYSERHGASHDACRWGTD--TPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 + + + +A G W QGE + Sbjct: 216 WMSRESIKPFSSIDLSILDNDAEVKNPTATPCVLYNGKIAPLTNFAVRGFLWYQGESN-- 273 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE----A 241 + + V R + G+ +Y+ + P YE Sbjct: 274 RDNADLYQSLMPAFVADLRAKWGR---------------GELPFYFVQIAPFDYEGADGT 318 Query: 242 IYGNYQNNVLAN--------------------IIFVDFQQQGERGLTNAPDEDPDDLSTG 281 + L N I VD + G R A + G Sbjct: 319 SAARLREVQLQNMKDIPNSGMVTTMDVGHPVFIHPVDKETVGNRLAYWALAQTYGMKGFG 378 Query: 282 YYGSAYRSPENW 293 Y Y+S E Sbjct: 379 YAPPVYKSMEIQ 390 >UniRef50_Q8A2Y4 Sialic acid-specific 9-O-acetylesterase n=10 Tax=Bacteroides RepID=Q8A2Y4_BACTN Length = 477 Score = 81.2 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 68/244 (27%), Gaps = 21/244 (8%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN +G + + + +P T Sbjct: 105 EVWICSGQSNMEMTMKGNMGQPIDHSLETLLNAGNYRDRIR--------FITVPRTKGVK 156 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + D +G +++ T + AR+L + + + +V GGS A Sbjct: 157 ERTDFEGAKWEVSSPETTMDCSA-AGYFFARQLTETL--HLPVGLVINSWGGSRIEAWMT 213 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 ++ L L G W QGE +L +Y Sbjct: 214 EETLASVKGANIEAAKNPKLDTNSRLQCLYNIMLLPVKNYTARGFLWYQGESNLF--NYQ 271 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNV 250 + MV+ +R + D P++ + N + A+ Q Sbjct: 272 IYAPMMTAMVQLWRNVWETP--------DMPFYYVQIAPHKYGNSRNINSALLQEAQMKA 323 Query: 251 LANI 254 L I Sbjct: 324 LQTI 327 >UniRef50_B9XHK5 Autotransporter-associated beta strand repeat protein n=1 Tax=bacterium Ellin514 RepID=B9XHK5_9BACT Length = 1318 Score = 81.2 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 36/250 (14%), Positives = 68/250 (27%), Gaps = 33/250 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V AGQSN + LA P N +P T Sbjct: 163 DVWLCAGQSNMQ-----------FSLNEIGVTNLAAE-IADSANYPAIRNFSVPFTSSLT 210 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHI-ARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 ++ A T G A + AR++ + I I+ GS + Sbjct: 211 AETNLPSGSWQGAGPSTT--GGFTAAGYFTAREIYKQ--QHIPIGIIRSAWAGSEIKSWL 266 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAAL-AKNPQNKFLGACWMQGEFDLMTSD 188 + + +D +S A+ + W QGE+++ Sbjct: 267 DPLFVSEICDFTQPIFDQAGQTPGRDTISGPYNAMIRPLSPFRIKAVEWYQGEYNVGWP- 325 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAI-YGNYQ 247 + + ++ +R Q + P+ + +A+ G++ Sbjct: 326 -EQYSRLLPGLMSNWRSLFGQPN--------LPFVIIQLPNFGN----TQSQAVETGSWA 372 Query: 248 NNVLANIIFV 257 A + V Sbjct: 373 ELREAQLKTV 382 >UniRef50_UPI000180C4CE PREDICTED: similar to LOC495015 protein n=3 Tax=Ciona intestinalis RepID=UPI000180C4CE Length = 538 Score = 81.2 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 49/316 (15%), Positives = 88/316 (27%), Gaps = 32/316 (10%) Query: 11 YVLTVAGQSNA-----MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 V GQSN MA+ L P+ R+ + P H + Sbjct: 124 DVWVCGGQSNMKFTVAMAFNASYELSLAPKYPNVRVMTVGDDRS----NEPLHDFKNLEQ 179 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 + G + + R L F+ I +V GG+ Sbjct: 180 PWSVPSKHSLTGDNVEWTYFSSLCWL-------YGRYLYDFL--KVPIGLVSSNWGGTPI 230 Query: 126 TAGSE-------GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWM 178 S G + R T T + + + A + GA W Sbjct: 231 ETWSSTDALKKCGLHLCDEPDFKLEMRNRTITKVPRSNSALWNAMIHPMLNMTIKGAIWY 290 Query: 179 QGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS 238 QGE + + + F M+ +R ++ P+ + + + P+ Sbjct: 291 QGETNYDYHNA-EYACSFPAMINDWRDKW-LEGTRGQTDKKFPFGFVQISTTTESDRPND 348 Query: 239 YEAIYGNYQNNVLANIIFVDFQQ-QGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTAL 297 + Y + AN +V ++ + D D + G + ++ L Sbjct: 349 GQ--YPVIRWRQTANYGYVPNEKMRNTFMAVAMDLADNDSPTGGIHPRDKQTVAARL--L 404 Query: 298 RSSHFSTAARRGIISD 313 S R+ I D Sbjct: 405 LGSMRVAYGRKDINPD 420 >UniRef50_B7ALS5 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7ALS5_9BACE Length = 526 Score = 81.2 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 66/274 (24%), Gaps = 81/274 (29%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPP---CHFNDIIPLTH 67 V GQSN R ++ HF + + + Sbjct: 113 EVWLATGQSNME-----------------RKLFMSENGEAVVRSANNQNIHFLIVPQVYY 155 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 H+V + A + A KL + N I I+ C RGG+ A Sbjct: 156 KGHNVNKEMKWQTATAPQVANMSAI---GYYFACKLQKEL--NVPIGIICCYRGGTPAEA 210 Query: 128 GSEGTYSERHGASHDACRWGTDTP-----LYQDLVSRTR--------------------- 161 + ER LY+ + + Sbjct: 211 WVKEETLERDDKLRPILDNYRKVAIMDDTLYEKKMEEYKVLYRVYNDSVTMGYKNVVRPF 270 Query: 162 ----------------AALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRR 205 L + G W QGE + + F ++ +R Sbjct: 271 EPVGPKHHKRPCGLYHTMLQRIIPYTAKGVIWYQGEGNA--ERAEQYRVLFPALIRQWRT 328 Query: 206 DLKQYHSQLNNITDAPWFCGDTT----WYWKENF 235 D Q D P++ + +W++ Sbjct: 329 DFHQP--------DLPFYFVQLSNFDHPFWQDRP 354 >UniRef50_A6DQW4 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQW4_9BACT Length = 527 Score = 80.8 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 41/278 (14%), Positives = 75/278 (26%), Gaps = 51/278 (18%) Query: 11 YVLTVAGQSNAMAY-GEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 V AGQSN G+ P P + D +I ++ + G L+ Sbjct: 102 EVWLCAGQSNMAGRFGDKSPFPAKYDK--KKISRMRYNGKGNKGWDSIDEKSAKSLSR-- 157 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 A + L + N I ++ GG+ A Sbjct: 158 -------------------------VAFYFGVNLYEEL--NIPIGLITRHNGGTPMQAWM 190 Query: 130 EGTYSE--------RHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 +E G +A + Y D + GA W QGE Sbjct: 191 NADDAEVARKALNIPEGWREEAKKQRKPGYQYNDKLKEV-------IPYAIRGAIWYQGE 243 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT-TWYWKENFPHSYE 240 + ++ + + H ++++R+D + P++ T N+ + Sbjct: 244 RNAKSNTALEYDKLTVHFLDSWRKDWGERSGL--ETRKFPFYYIQIPTDIHLRNYEFPWV 301 Query: 241 AIYGNYQNNVLANIIFVDFQQQGERGL-TNAPDEDPDD 277 + N F +QG + Sbjct: 302 RDRQRRALEITENTGMAIFWEQGPGLHPADKSLAGKRL 339 >UniRef50_C5SLN9 Sialate O-acetylesterase n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SLN9_9CAUL Length = 488 Score = 80.8 bits (197), Expect = 7e-14, Method: Composition-based stats. Identities = 36/277 (12%), Positives = 69/277 (24%), Gaps = 44/277 (15%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + + A P + Sbjct: 111 EVWLASGQSNME---------KPFRNQPGQKDVINADAELAGANMPQIRLFKVKKARSTT 161 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG-S 129 D++G A ++L N + ++ GG+ + Sbjct: 162 PAADVEGQWVITTPETLDTSRFSAVAYVFGKRL--HTTLNTPVGLIDSTWGGTRIEPWTA 219 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTS-- 187 + A D+ + ++ G W QGE +LMT+ Sbjct: 220 PEGMAAVPTLKRYADVKPGQKVDGSDISGLYNSMISPLAPFGLKGVIWYQGESNLMTAGE 279 Query: 188 ---DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 A + ++ +R+ Q D +Y+ + PH Y + Sbjct: 280 FPGGAADYTDKMEALIGGWRKVFGQ----------------DLAFYYVQLAPHLYHVVRP 323 Query: 245 NYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTG 281 +A+ + A +TG Sbjct: 324 AQ----VASAEGLPQMW-------EAQTAALRIPNTG 349 >UniRef50_A6DFP9 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFP9_9BACT Length = 516 Score = 80.8 bits (197), Expect = 7e-14, Method: Composition-based stats. Identities = 40/343 (11%), Positives = 75/343 (21%), Gaps = 88/343 (25%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQL-ARFAHTHPGGPPCHFNDIIPLTHCP 69 V +GQSN L + P + P Sbjct: 107 DVWLCSGQSNME-------------------WALRGTNGSSEKANYPQIRLFRVKTNPSP 147 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 + + G A + L + N I ++ GG+ A + Sbjct: 148 EVQTETDASWTACTQKSARDFS--GTAYFFGKTLHEEL--NVPIGLIQSAVGGTCIEAWT 203 Query: 130 -------------------------------------EGTYSERHGASHDACRWGTD--- 149 + Y E+ + + G Sbjct: 204 EKEQQLSDPMSVHRIKLGDKAIEGFDSAVAKKENQKLQSDYQEKLASWEKGGKNGKRPRP 263 Query: 150 -----TPLYQDLV--SRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEA 202 PLY + + GA W QGE + + + ++ Sbjct: 264 PRLKTNPLYSTNYPGNLYNGMIKPLLPFSIKGAIWYQGEANSRPGQASDYHNDLKRLIST 323 Query: 203 FRRDLKQYHSQLNNITDAPWFCGDTTWYWKEN--------FPHSYEAIYGNYQNNVLANI 254 +R+D Q D P++ + + + E Q+ + Sbjct: 324 WRKDWGQ--------GDFPFYFVQLPNFGERTTEPIQEHPWVTIREQFLKTAQSLENTGM 375 Query: 255 IFVDFQQQGERGL-TNAPDEDPDDLSTGYYGSAYRSPENWTTA 296 + + N D S + + WT Sbjct: 376 AITIDIGEAKNLHPRNKRDVGKRLAMVALGKSYNKKDQVWTGP 418 >UniRef50_B7AHV8 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AHV8_9BACE Length = 478 Score = 80.4 bits (196), Expect = 7e-14, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 72/234 (30%), Gaps = 31/234 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V V GQSN G P +DA +I + F Sbjct: 110 DVWFVGGQSNMQMNFHGNPDQPVQDA--QKILLRCKHKGIRL------FRVNNGYAISAS 161 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D + G + ++ ++ VG KL I +V GGS+ A + Sbjct: 162 DTLTIDGKWTSASPDNVKEFSVVG--YIFGEKLHELTD--IPIGLVQSAHGGSSAEAWLD 217 Query: 131 GTYSERHGASH-DACRWGTDTPLYQ-DLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 ER G + R D Y + LA G W QGE ++ Sbjct: 218 YETLERFGGFDLNLERKKIDPIWYAFEPTVLYNKMLAPMLPLTVKGVIWYQGESNVERP- 276 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAI 242 + F +V +RR + P+ Y+ + P++++ + Sbjct: 277 -EQYRMLFPELVRTWRRYFADDN--------LPF-------YYVQIAPYNHQKV 314 >UniRef50_A9UYV9 Predicted protein n=2 Tax=Monosiga brevicollis RepID=A9UYV9_MONBE Length = 970 Score = 80.0 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 39/276 (14%), Positives = 74/276 (26%), Gaps = 33/276 (11%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + ++A + P + Sbjct: 184 EVYLCSGQSNME---------FPLNIATNGTYEMADAVNW-----PQFRLFRVSHNTAET 229 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 +V+++ G A + + + AR + + + +V GG+ A Sbjct: 230 EVRNVTGTWALNAPDVAGAFSAL--CYLTARDVQRLRNSSMPVGLVQVTWGGTRVEAWMS 287 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 E A A WGT + + A + + A W QGE + ++ Sbjct: 288 ---KEALAACPAAPAWGTGSSPQNQPTALWNGMAAPVIEMTYRMALWYQGETNAQGANST 344 Query: 191 SHPQ-HFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNN 249 + F M+ +R D P+ P + Y Sbjct: 345 DYYACMFQSMIADWR--------GRAGYGDLPFLYMQLPPSLLPTDPANITT---GYSEI 393 Query: 250 VLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGS 285 A ++ + + T D +G Sbjct: 394 RQAQLLTLPHTRGPSD--TTGMVVGLDLGGVSMWGP 427 >UniRef50_C6XUV3 Sialate O-acetylesterase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XUV3_PEDHD Length = 468 Score = 80.0 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 33/260 (12%), Positives = 69/260 (26%), Gaps = 44/260 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 + +GQSN ++ P R + ++ + + C Sbjct: 101 ELWLCSGQSNMQWNALNDLKEMKDVLPGIRNSNIRFLNVSNIASAYPQDDLVNSWQLCDS 160 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 A ++ + N + I+ GG+ + Sbjct: 161 TSASTFSA----------------IGYFFAEEISKRL--NVPVGIINASWGGTCAEVWTP 202 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRT-RAALAKNPQNKFLGACWMQGEFDLMTSDY 189 G A + P +L + + G W QGE ++ + Y Sbjct: 203 GELVMNDELLLKASQLKKVAPRKPNLPGYAWNSMVHPLVGYTIAGVLWYQGEDNV--ASY 260 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNN 249 S+ + F M++++R+ + P++ Y Y+N Sbjct: 261 DSYERLFAMMIKSWRKSWND---------EFPFYFAQIAPY--------------TYKNK 297 Query: 250 VLANIIFVDFQQQGERGLTN 269 L ++ QQ Sbjct: 298 ELPKAAYLREQQTFTALHNE 317 >UniRef50_C0A824 Sialate O-acetylesterase n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A824_9BACT Length = 485 Score = 80.0 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 39/320 (12%), Positives = 72/320 (22%), Gaps = 61/320 (19%) Query: 11 YVLTVAGQSNAMA--YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN + + A +P I+Q L Sbjct: 124 DVWLCSGQSNMEWQVRKVTNAIDEIAAANYPGIRQFKVPERFA-----------STLLSP 172 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + P T A L N ++ GG+ A Sbjct: 173 YELEGEWIVCSPPTVGKDFTA-----VGYFFALDLHRRY--NIPQGLIKSAWGGTPVEAW 225 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 + D + A + G W QGE + D Sbjct: 226 MSIGALALADTAPAVAPLVPDNR--RAPAGAYNAMIHPLIPVALRGVLWYQGEANARNPD 283 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQN 248 + F ++ +R+ + + P+ + + ++ I Sbjct: 284 G--YGALFRTLITDWRQRWQSP--------ELPFLFVQLPNHGS-SEKTNWAKIREGQ-- 330 Query: 249 NVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARR 308 A+++ + P + + + ARR Sbjct: 331 ---ASVLALPATSMAVTIDLGDPKDIH----------------PRNKQDVAHRLALLARR 371 Query: 309 GII-------SDRFVEAILQ 321 I RF A + Sbjct: 372 MIFDEDVMAEGPRFASAKFE 391 >UniRef50_B4REE2 Putative uncharacterized protein n=1 Tax=Phenylobacterium zucineum HLK1 RepID=B4REE2_PHEZH Length = 267 Score = 80.0 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 53/271 (19%), Positives = 84/271 (30%), Gaps = 60/271 (22%) Query: 12 VLTVAGQSNAMAYG-EGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 ++ VAGQSNA+ YG LP +P P ++ Sbjct: 32 IVVVAGQSNALGYGLTAADLPPSLASPDPDVRI--------------------------W 65 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF----- 125 D Q T Q G G AR PD A + +V RG + Sbjct: 66 DGARFQPMAAGRNTGFGPQPGAWGPEAGFARAWRAAHPD-APLHVVKFARGSTPLAASPG 124 Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQN-KFLGACWMQGEFDL 184 S GT A+ + + +AALA N + + W+QGE D Sbjct: 125 RDWSPGTQELFAAATTE--------------IEEAKAALAVNGGPARVVAILWVQGEADA 170 Query: 185 -MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW--YWKENFPHSYEA 241 + A++ + +++A RRD ++AP G T + + A Sbjct: 171 VDPAKAAAYGPNLAGLIQAIRRDW---------SSEAPIVVGQTGPGLPYAKAVRAGQAA 221 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPD 272 + + + + Q G Sbjct: 222 VASPEGRVAVVDTGPLPRQADGLHIAAEGQA 252 >UniRef50_C3RDR8 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C3RDR8_9BACE Length = 485 Score = 79.6 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 83/310 (26%), Gaps = 64/310 (20%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V AGQSN +P+ P + + L Sbjct: 117 EVWVCAGQSNVQ-----MPVKGFIGQP-----VIGSCDAIAMASSKEKIRLLTVLRRENS 166 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG-S 129 ++Q+ L T A A+ + + + I+ GG+ + Sbjct: 167 ELQEECQSTSWLETTPSNVKEFSAVAYFFAKYIQNIL--QVPVGIICSAAGGTNIERWMN 224 Query: 130 EGTYSE-RHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 + Y S D + LY ++ G W QGE +++ Sbjct: 225 KNDYLAIYPDKSMDEVKNVRAGELYNTMI-------YPLHLFNIKGIIWYQGESNVLNPH 277 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY---EAIYGN 245 + Q F MV ++R W G+ +Y+ + P+ Y +A Sbjct: 278 --EYKQLFIRMVASWREK---------------WALGEFPFYYVQIAPYQYSDKQASPIG 320 Query: 246 YQNNVLAN-----------------------IIFVDFQQQGERGLTNAPDEDPDDLSTGY 282 A I D G+R A + + Y Sbjct: 321 AAELRQAQLEAMTIIPNSGMVVTSDVGDAKHIHPADKPNVGKRLALWALAKTYNIPGIPY 380 Query: 283 YGSAYRSPEN 292 G Y+S E Sbjct: 381 CGPIYKSAEI 390 >UniRef50_B3CHU8 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B3CHU8_9BACE Length = 484 Score = 79.6 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 67/253 (26%), Gaps = 27/253 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPR---IKQLARFAHTHPGGPPCHFNDIIPLTH 67 V +GQSN +P+ + P I A+ + + Sbjct: 110 EVWFCSGQSNME-----MPMGGFDRQPAQGTNDIIAKAKASTPIRIYNTDSKDGRWIRQS 164 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 +D G + + A + AR + + + I+ GGS A Sbjct: 165 SKTPQEDCLGEWLENTPENVSHTSA--AAYYFARYIQEVLD--VPVGIIISTLGGSKVEA 220 Query: 128 GSEGTYSERHGASHDACRWGTDT--PLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 + + + L A +A G W QGE + Sbjct: 221 WMSREAISPFKSIDLSILDNDEKIKNLTNTPCVLYNAKIAPFLNFAIKGFLWYQGESN-- 278 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGN 245 + + V+ R N + P++ + + N+ + Sbjct: 279 RDNADLYKDLMPAFVKDLRSKW--------NRGEFPFYFVEIAPF---NYEGADGTSAAR 327 Query: 246 YQNNVLANIIFVD 258 + L N+ + Sbjct: 328 MREVQLQNMKDIP 340 >UniRef50_A6DN35 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DN35_9BACT Length = 494 Score = 79.6 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 37/314 (11%), Positives = 82/314 (26%), Gaps = 57/314 (18%) Query: 11 YVLTVAGQSN--AMAYG--EGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 V AGQSN + P+ D + +L IP + Sbjct: 112 DVWLCAGQSNMRMSLRACLKSKPVADAHKKAGNNLLRL----------------CTIPKS 155 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 D+ + + +G + + + I ++ GG+ Sbjct: 156 VSDELQTDVDCNWAVATRDTSLPFSAIG--YLFGQTIQEELA--VPIGVINGSWGGTFIE 211 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK------------FLG 174 ++ + + R + + P G Sbjct: 212 QWMPSDVVKKRADCKAFNKKVDE--------RRKKDPNSSGPGGHFNGMIGPIMPYGLKG 263 Query: 175 ACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKEN 234 W QGE ++ ++++ + M+ +RR K + AP+ G ++ Sbjct: 264 VLWYQGEGNVW--GFSTYKHKISTMIADWRRLFKSPKLPFIMTSLAPF--GVRKDKPIDS 319 Query: 235 FPHSY-EAIYGNYQNNVLANIIFVD--FQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPE 291 + E + Q+ A +I + Q+ + + + A Sbjct: 320 HSSRFAEGLAEVEQSIENAWLITIPDGGMQKDIHPPF------KEIPAQRFSAMALAKVY 373 Query: 292 NWTTALRSSHFSTA 305 + F + Sbjct: 374 KKAGVYKGPVFKSW 387 >UniRef50_A5ZHN2 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZHN2_9BACE Length = 474 Score = 79.3 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 45/319 (14%), Positives = 80/319 (25%), Gaps = 35/319 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN +P+ P + P +I L Sbjct: 104 EVWLCSGQSNME-----MPVKGFRGQP------VEGSCDAIATALPSDNIRMITLKINSS 152 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 P + + L + N + ++ GGS + Sbjct: 153 QTVLDDCVATPWVESTPANVADFSATAYFFASYLRKV-LNVPVGVICSSWGGSKIESWIN 211 Query: 131 GTYSERHGASHDACRWGTD----TPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMT 186 D + A + Q G W QGE +L Sbjct: 212 KEVYTEKFPEISLSVLTKDPKDIARPKDEPTLLYNAMIHPIKQFTIKGTIWYQGESNLNN 271 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY-WKENFPHSYEAIYGN 245 + + F MV ++R++ Q + P++ Y + EA Sbjct: 272 P--QVYKRLFPAMVRSWRKEWNQ--------GEFPFYYVQIAPYDYGRKNADKTEAAEIR 321 Query: 246 YQNN----VLANIIFVDFQQQGERG---LTNAPDEDPDDLSTGYYGSAYRSPENWTTALR 298 + N V G R ++ + RS ++ L Sbjct: 322 QVQLECLKEIPNAGMVVTADIGNRTCVHPSDKESVGKRLALWALAKTYQRSGTPYSGPLY 381 Query: 299 SSHFSTAARRGIISDRFVE 317 S F+ + I+ + E Sbjct: 382 KS-FTIDKEKIIVEFDYAE 399 >UniRef50_C6XXZ6 Sialate O-acetylesterase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XXZ6_PEDHD Length = 466 Score = 79.3 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 75/274 (27%), Gaps = 52/274 (18%) Query: 11 YVLTVAGQSNAM----AYGE-----GLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND 61 V +GQSN +G L D P R+ ++ + P Sbjct: 107 EVWLCSGQSNMEMPVKGFGNQPITNSNELLSDADEPGVRLFRIEKNMSRTP--------- 157 Query: 62 IIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRG 121 + ++ + Q+ VG AR L + + I+ G Sbjct: 158 ----------LTELNAKWEHSNSETTGQFSAVG--YQFARMLQQKL--KVPVGIIQSAYG 203 Query: 122 GSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 G+ A G + T + + A + GA W QGE Sbjct: 204 GTIIEAWM--DKKSFAGFTDVKIPADTVKMIKNEPFVLFNAMINPIVGFNIKGALWYQGE 261 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 + ++ + MV+ +R CGD ++Y+ + P++Y Sbjct: 262 NN--WFTPDTYDKKMEAMVKEWRSIWG---------------CGDFSFYYVQLAPNAYPN 304 Query: 242 IYGNYQNNVLANIIFVDFQQQ-GERGLTNAPDED 274 + G +A + Sbjct: 305 GKDKLPVIYEKQAKAMQLIPNSGMAVSVDAGSQT 338 >UniRef50_UPI000196870D hypothetical protein BACCELL_00054 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196870D Length = 388 Score = 78.9 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 67/259 (25%), Gaps = 42/259 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN +PL P L IP Sbjct: 107 EVWLASGQSNMS-----MPLKGYYCQP-----VLGSTEAILSSAKKQIHFINIPTLAAYK 156 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 ++ + + + + VG A L + + I+ GGS A Sbjct: 157 PLEHFEAEWVKASPENVAECTAVG--WFFADFLQRNMD--VPVGIINASYGGSNIEAWMN 212 Query: 131 GTYSER------HGASHDACRWGTDTP--LYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 ++ S + W ++ P LY ++ G W QGE Sbjct: 213 AEACKQFDDIPVPPLSDETSPWISNVPTVLYNGMI-------HPLVGYGIKGIIWYQGES 265 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPH---SY 239 ++ + MV +R + P++ Y + + + Sbjct: 266 NIFNVP--RYAPSVAAMVSKWREAWGL--------GEVPFYYAQIAPYDYKEWNFFTPQW 315 Query: 240 EAIYGNYQNNVLANIIFVD 258 + + + + Sbjct: 316 PEVSAYQREAQRQCLTLIP 334 >UniRef50_B0NPW7 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NPW7_BACSE Length = 903 Score = 78.9 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 64/228 (28%), Gaps = 34/228 (14%) Query: 11 YVLTVAGQSNA--MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN M D +A +++ A + + + L H Sbjct: 162 EVWLCSGQSNMEFMLKQAATAQTDIPEANDDKLRLFDMKARWRTNAVAWNASVLDSLNHL 221 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + + + A + + L + N + ++ GGS A Sbjct: 222 HYYKSTVWKSCTSATAASFSA-----VAYYFGKMLRDSL--NVPVGLICNAVGGSPTEAW 274 Query: 129 S----------------------EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAK 166 + +R + Y+ + + Sbjct: 275 IDRHTLDCEFPDILYDWTKNDFIQDWVRKRAALNIKHSTNKQQRHPYEPCF-LFESGILP 333 Query: 167 NPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL 214 + G W QGE + + H + F +++++R + +Q + Sbjct: 334 LSKYPLKGVIWYQGESNTHNKEA--HEKLFKLLIDSWRSNWEQPNLPF 379 >UniRef50_UPI0001BC816D sialic acid-specific 9-O-acetylesterase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC816D Length = 480 Score = 78.9 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 61/230 (26%), Gaps = 28/230 (12%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN G + + + + T Sbjct: 108 EVWICSGQSNMDMRMMGNTGQPIDRSLETILHAGNYRNRIR--------FIAVSRTKDVQ 159 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D +G ++ + A A+++ + + +V GGS + Sbjct: 160 QRTDFEGRKWEVSAPEAVMTCSA-VAYFFAKQVTEVLD--IPVGLVISSWGGSRIESWMN 216 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 + ++ L L G W QGE + +Y Sbjct: 217 EKTLASIDGVDIEAVRSSKLKMHHRLECMYDTMLWPVRNFTARGFLWYQGESN--IFNYY 274 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE 240 + MV+ +R + D P+ Y+ + PH Y+ Sbjct: 275 CYAPMMTAMVQLWREVWEAP--------DMPF-------YYVQIAPHKYK 309 >UniRef50_B5JQ65 Conserved domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JQ65_9BACT Length = 902 Score = 78.9 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 60/319 (18%), Positives = 90/319 (28%), Gaps = 55/319 (17%) Query: 11 YVLTVAGQSNAM----AYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 V + GQSN + G P D A + I+ Sbjct: 125 EVWLLGGQSNMALPLSGWSFGDPPAPVSDG-----------AAAIAAANYPNIRLIVVGE 173 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVG----QALHIARKLLPFIPDNAGILIVPCCRGG 122 + + ++ H LA+ + VG +LL + + I ++ G Sbjct: 174 NSASEPEEDITPHWALASWTKCSPSNVGNFSAIGYWFGERLLQDL--SVPIGLIQAPWSG 231 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVS-RTRAALAKNPQNKFLGACWMQGE 181 S+ A ER + + S +A GA W QGE Sbjct: 232 SSCEAWLPAADLERVANYRGQGPFISTGNSDNQTPSVNYNGMIAPIVPFTIAGALWYQGE 291 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT---WYWKENFPHS 238 ++ Q F M+ A+R KQ D P++ YW+ P Sbjct: 292 TNMGR--AEELSQLFPQMITAWRNQWKQ--------GDFPFYFAQLAPYDDYWQLQLPEF 341 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDP------DDLSTGYYG-------- 284 +EA L N V G+ + D+ P YG Sbjct: 342 WEA---QASALHLPNTGLVTTIDVGDAENIHPGDKAPIAHRFAQLALARAYGQTGFIASS 398 Query: 285 SAYRSPENWTTALRSSHFS 303 YRS T S H S Sbjct: 399 PLYRST---TVVDNSLHLS 414 >UniRef50_A6L7S8 Sialate O-acetylesterase n=10 Tax=Bacteroidales RepID=A6L7S8_BACV8 Length = 687 Score = 78.5 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 30/228 (13%), Positives = 60/228 (26%), Gaps = 34/228 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDRE--DAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN Y ++ A + I+ A + + L H Sbjct: 302 EVWLCSGQSNMEFYLNWSATAAQDVPQAANSNIRFYDMKARWRTDAVEWDASVLDSLNHL 361 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA---- 124 + P + + + +KL + + ++ GGS Sbjct: 362 QYYKDTRWTVCSPETAGNFSAVA-----YYFGKKLQDSL--QVPVGLICNAIGGSPTEAW 414 Query: 125 -------------FTAGSEGTYSERHGASHDA-----CRWGTDTPLYQDLVSRTRAALAK 166 + + + A Y+ + + Sbjct: 415 IDRSTLEYRFPAILRNWMQNDFIQDWVRGRAALNVKQSSNKLQRHPYEPCY-LYESGIRP 473 Query: 167 NPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL 214 Q G W QGE + + H + F +VE++R++ Sbjct: 474 LEQFPVKGFIWYQGESNAHNREA--HEKLFGLLVESWRKNWGDAELPF 519 >UniRef50_A7V893 Putative uncharacterized protein n=4 Tax=Bacteroides RepID=A7V893_BACUN Length = 952 Score = 78.5 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 32/273 (11%), Positives = 67/273 (24%), Gaps = 56/273 (20%) Query: 11 YVLTVAGQSNA---MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 V +GQSN +G + +A P ++ P P + Sbjct: 116 EVWLCSGQSNMEWSANHGIKNGDEETANAHCPNLRIFHVARIAAP----------FPQEN 165 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 C + A R + + + I+ GG+ Sbjct: 166 CFSQWTQCTPETMRSTSA---------LAYFFGRNIQEELD--VPVGIIVAAWGGTTAET 214 Query: 128 GSE------GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 + + + LY ++ P G W QGE Sbjct: 215 WTPRECVMSDPVLCDYPYESNPWFPAETGTLYNSMIYPVM------PYG-IAGCIWYQGE 267 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 + +S+ + ++ ++R + P+ Y + P Y + Sbjct: 268 ANQGR--ASSYARVMQRLIGSWRT---------GFNKEFPF-------YLVQIAPFQYHS 309 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDED 274 + + + +T + D Sbjct: 310 KDNGPALLREQQAM-LPEMLDKVKMITVSDLVD 341 >UniRef50_A7LY32 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LY32_BACOV Length = 465 Score = 78.5 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 56/214 (26%), Gaps = 24/214 (11%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN G+P+ P + + P + + Sbjct: 104 EVWLCSGQSNM-----GMPMKGFPGQP---VAYANDYITRAKKKVPLRIYTVSNHSSAVP 155 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + T A + L + + IV GS Sbjct: 156 LEHSGGQWKCHDSGAVANCSAT---AYFFGKYLQEVL--GIPVGIVISAWNGSNIETWMS 210 Query: 131 GTYSERHGASHDACRWGTDTP--LYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 A R TP LY ++ R G W QGE + + Sbjct: 211 RESFRALDILSVAKRPVHQTPSLLYNAMIYPIRNL-------AVKGMIWYQGEAN--RNK 261 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW 222 + + F V+ R ++ + AP+ Sbjct: 262 PEEYARLFPAFVQDIRNTFQKPELPFYYVQIAPF 295 >UniRef50_C1F9V4 Sialate O-acetylesterase homolog n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F9V4_ACIC5 Length = 509 Score = 78.5 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 48/335 (14%), Positives = 82/335 (24%), Gaps = 81/335 (24%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P + + P +PL P+ Sbjct: 113 DVWFASGQSNMQ--------IPLIGFPGSAVIR-NAKEEIAQANHPEIRLLFVPLKSSPY 163 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG-- 128 D G + V A AR L + + I IV GG+ + Sbjct: 164 PRDDQGGTWTHCTPETARTFSAV--AYFFARDLEQHL--HVPIGIVDATWGGTPIESWMS 219 Query: 129 ------------------------------------------SEGTYSERHGASHDACRW 146 + G H D W Sbjct: 220 LRSLASDAAFMPVFMKRAEFAAQQTNLKTILAQEKAEDAKAVAAGKPKPEHPWHPDQQSW 279 Query: 147 GTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTS-DYASHPQHFNHMVEAFRR 205 + LY +++ G W QGE + + + + + F M+E +R Sbjct: 280 NP-SYLYNAMIA-------PETPYTIRGFLWYQGETNSDDAWVPSLYRRLFPAMIEDWRT 331 Query: 206 DLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQ-NNVLANIIFVDFQQQGE 264 + + P+ + ++ + + I + L N +G Sbjct: 332 RWHE--------GELPFLYVQISSFY--SPQEHWGEIRNAQRLTLALRNTGMAVSLDKGL 381 Query: 265 RG---LTNAPDEDPDDLSTGYYGSAYRSPENWTTA 296 R + L+ Y AY W Sbjct: 382 RNNIHPPDKQTVSHR-LALAAYHLAYGEDGTWEGP 415 >UniRef50_C6Y404 Sialate O-acetylesterase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y404_PEDHD Length = 689 Score = 78.1 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 41/293 (13%), Positives = 78/293 (26%), Gaps = 42/293 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAP-HPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 V +GQSN E A P +L +F N + Sbjct: 304 DVWLCSGQSNMAFTLSASETGKSELADIKPNNMRLFKFTQYAETNNAPWDNKTL---EQV 360 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG- 128 + ++ G P ++ VG + +KL + ++ GGS + Sbjct: 361 NRLKYFSGNWTPTNAGTAAEFSAVG--YYFGKKLTAE--SGVPVGLIQLAVGGSTIESWI 416 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK----------------- 171 T W + + R L + K Sbjct: 417 DRYTMEHDEQLVDVLADWRKSDFIQEWARGRADVNLKQATNPKQRHPYEPVYNYEAGISK 476 Query: 172 -----FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGD 226 G W QGE ++ D + F +V+++R+ D P++ Sbjct: 477 LIEFPISGVIWYQGESNVQNVD--LYKHTFPVLVQSWRQKWGY---------DFPFYFVQ 525 Query: 227 TTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLS 279 + + ++P+ +A + + E D L+ Sbjct: 526 LSGLNRPSWPYFRDAQRELQKAIKNIWMAVSSDLGDSLNVHPKRKKEVGDRLA 578 >UniRef50_C1ZI44 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZI44_PLALI Length = 393 Score = 78.1 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 45/348 (12%), Positives = 99/348 (28%), Gaps = 67/348 (19%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V + GQSN + G+ D + +KQ + +N + + Sbjct: 80 KVFILMGQSNMLEMGKVAG--DTDGTLEHAVKQEGLYPFLID--DAGKWNTHADVRNVAV 135 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCR---------- 120 G T +G I +L + + +L++ Sbjct: 136 MGSGGPGKTQFRINAWLTVGNKIGVEQGIGHQLGNALDE--PVLLLKSAIGNRSLGWDLL 193 Query: 121 --GGSAFTAGSE--GTYSERHGASHDACRWGTDTPL----------YQDLVSRTRAALAK 166 G S++ G G RW T ++R + L Sbjct: 194 PPGSSSYEFVDPKDGKTYIYAGYGQSPDRWEKGTEPKAINWKAGLQLDGDIARAKEVLND 253 Query: 167 NPQ-------NKFLGACWMQGEFD-LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNIT 218 + + G W QG+ D + + Q+ +++ R++ ++ Sbjct: 254 LGKYYPGANEYEIAGFFWWQGDKDRYNAGHASRYEQNLVNLIATLRKEFNAPQAK----- 308 Query: 219 DAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDL 278 + C ++N P E + + P + P+ Sbjct: 309 ---FVCATLGQTDRDN-PTGNEKYLLEAK------------------LAISDPQKHPELQ 346 Query: 279 STGYYGSAYRSPENWTTALRSSHFSTAARRGIISDRFVEAILQFWRER 326 T + Y P + ++ + + + A + EA+++ +E+ Sbjct: 347 GT--VATVYTHPLSMGSSSNAHYGNNAKTYMNVGLAMGEAMVKLLKEQ 392 >UniRef50_C5RDR3 LPXTG-motif cell wall anchor domain protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RDR3_CLOCL Length = 1235 Score = 78.1 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 71/233 (30%), Gaps = 50/233 (21%) Query: 11 YVLTVAGQSNAMAY--GEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN G + +++ +P I+ ++ + ++ Sbjct: 542 DVWLCSGQSNMAFQLTGTLDSENEIKNSDYPNIR---------------YYTVPVKTSYV 586 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 P D + + V A ARKL + N I +V GG+ Sbjct: 587 PLSNIDNAQGWKVCSPSTSGNLSAV--AYFFARKLTTDL--NVPIGVVFAAEGGTRAEQW 642 Query: 129 SEGT-------YSERHGASHDACRWGTDTP----LYQDLVSRTRAALAKNPQNKFLGACW 177 + Y A T LY +++ P G W Sbjct: 643 TSYESLQNIPEYVAASNAIKSCSTEIDATSSPNVLYNGMIAPV------APYG-LKGVLW 695 Query: 178 MQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY 230 QGE + AS+ + +++ +R++ N+ D P+ + Sbjct: 696 YQGESNWGD---ASYERLLPNLIADWRKNF--------NVKDLPFILVQLPGF 737 >UniRef50_D2QHH9 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QHH9_9SPHI Length = 521 Score = 78.1 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 45/345 (13%), Positives = 79/345 (22%), Gaps = 57/345 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDII----PLT 66 V AGQSN P+ + A + P ++ L+ Sbjct: 114 DVWICAGQSNMA-----FPVASDQFAAQTLRQSGNGSLRLFNKLPALSTYNVAYKPNELS 168 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 H D + + VG + + + + N I ++ GGS Sbjct: 169 HLHPDAFYKPASWQEADSLAIRLFSAVG--YYFGQAVQQSL--NIPIGLINVAVGGSPTE 224 Query: 127 AGSEGTYSERHGASHD--ACRWGTDTPLYQDLVSR------------------------- 159 A W + L + R Sbjct: 225 AWMRPGSGLADPTIQPVFKGDWWQNPVLEPWCIQRGHENLDNLIQAGYKPPHDSLGYHHP 284 Query: 160 ------TRAALAKNPQNKFLGACWMQGEFDL-MTSDYASHPQHFNHMVEAFRRDLKQYHS 212 +AA+ + G W QGE + + H F +V +R Sbjct: 285 FKPGFLYQAAIVPLLRLPIRGVLWYQGESNALSLARAQQHGHLFARLVGDWRDQW----- 339 Query: 213 QLNNITDAPWFCGDTTWYWKEN--FPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNA 270 D P++ + E ++ + + T+ Sbjct: 340 ---QQGDFPFYVCQLSSIGTEKGYKSENWPWFRDSQRRLAQRLPNVGMAVTSDVGNPTDV 396 Query: 271 PDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRGIISDRF 315 + + A L A R I+ RF Sbjct: 397 HPTNKRVVGQRLAREALVKTYGQQAVLTPEIVEVARVRNGITLRF 441 >UniRef50_A6DKT6 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKT6_9BACT Length = 352 Score = 78.1 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 39/296 (13%), Positives = 74/296 (25%), Gaps = 62/296 (20%) Query: 1 MNAIISPDYY----YVLTVAGQSNAMAYGEGLPLPD---REDAPHPRIKQL--ARFAHTH 51 + A P V + GQSN + +G+ D + + Sbjct: 25 VGANTKPANMSKPVKVFILMGQSNMLGFGKISGGKDGCLDYAVKNEGLYPFLQDAKGRWI 84 Query: 52 PGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNA 111 ++ + + + + N G +G + I + Sbjct: 85 TRQD---VRNVFTMGSGGPQSRGGVKKNEFMTINK----GKIGPEIGIGHYMGNLY--GE 135 Query: 112 GILIVPCCRG-----------GSAFTAGSEGTYSE-------RHGASHDACRWGTDTP-- 151 +LI+ C G GS + G RW T Sbjct: 136 PVLILKSCIGNRSLGWDLLPPGSPSYVFEDKDKKSKQMKTYVYAGYGQSPDRWEKGTQAK 195 Query: 152 --------LYQDLVSRTRAALAKNPQ-------NKFLGACWMQGEFD-LMTSDYASHPQH 195 Y ++R + L + + G W QG+ D T + Q+ Sbjct: 196 AINWKAGVQYDGDIARAKNVLNNLDKYYPGAKSYEVAGFFWWQGDKDRYNTGHAIKYEQN 255 Query: 196 FNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVL 251 ++ +A R+D ++ KE + + I Sbjct: 256 LVNLFKALRKDFNSPKAK--------MVVATLGQTKKETAKGNEKLILDAMFALEK 303 >UniRef50_D2QYI9 Sialate O-acetylesterase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QYI9_9PLAN Length = 539 Score = 78.1 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 55/220 (25%), Gaps = 30/220 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + DR I+ F I Sbjct: 143 EVWLASGQSNMAM--TLKSVADRLTQAQDDIRAAD--------HSSLRFRRIDEPASRET 192 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + H + A + A KL + + I+ RGG+ Sbjct: 193 VADIPPKSWTVCSPTHAGSFSA--AAFYFASKLQRELD--VPVGIIDSSRGGTPIEPFIP 248 Query: 131 GTYSERHGASHDACRWGTDTPLYQ--DLVS--RTRAA-----------LAKNPQNKFLGA 175 + H G L L R R A LA Q GA Sbjct: 249 REAFQGHPTLEQELALGDREDLLGIWKLAGGVRARDANWLPGRLFHSRLAPMKQFAVRGA 308 Query: 176 CWMQGEFDLM-TSDYASHPQHFNHMVEAFRRDLKQYHSQL 214 W QGE + D + +++ +R + Sbjct: 309 IWYQGESNCGIEEDPRDYQHKMRGLIQGWRDAFGNRSMPV 348 >UniRef50_D2QX68 Sialate O-acetylesterase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX68_9PLAN Length = 477 Score = 77.0 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 58/230 (25%), Gaps = 30/230 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P +E G P Sbjct: 108 EVWICSGQSNMEWSVAASDNPQKEAEAANFPLIRMIKVEKAVAGEP-------------- 153 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D++G + + + VG RKL + + ++ GG+ A + Sbjct: 154 -QTDIKGAWQVCSPSTVPGFSAVG--YFFGRKLHQDLD--VPVGMINTSWGGTICEAWTS 208 Query: 131 GTYSERHGASHDACRWGTDTPLY----QDLVSRTRAALAKNPQNKFLGACWMQGEFDLMT 186 + A LA GA W QGE + Sbjct: 209 KEALAASEPLKFMTERQLNIDPAKMNPNQPTVLYNAMLAPLVPYGIRGAIWYQGESNKGR 268 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCG-----DTTWYW 231 + F M+ +R+ Q + AP+ G + W Sbjct: 269 --AEQYRTLFPVMISDWRKQFGQGDFPFGFVQLAPFDYGGSDPRELAEQW 316 >UniRef50_C6VZW5 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VZW5_DYAFD Length = 720 Score = 76.2 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 62/197 (31%), Gaps = 20/197 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +AGQSNA +G P P + R+ + + + G D +P T Sbjct: 127 EVFLIAGQSNAQGL-KGKPTPPGAN--DDRVLYIDNYENDPDGRYNDLLTDPVPPTFSKI 183 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 +G +G L+ + N +L + G++ T +E Sbjct: 184 TSDIKTMSPR---GQTAWCWGALG------DLLVSKL--NVPVLFINAAWEGTSVTNWAE 232 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL-MTSDY 189 +R + Y +L R A N WMQGE D + Sbjct: 233 SASGQRTVSY--YGYKYNTGMPYANL--RISARNYGNQYG-LRAVLWMQGETDGFFGTPS 287 Query: 190 ASHPQHFNHMVEAFRRD 206 A + +++ D Sbjct: 288 ALYRTSLQKIIDQLSID 304 >UniRef50_A4AN28 Sialate O-acetylesterase n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AN28_9FLAO Length = 515 Score = 75.8 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 62/235 (26%), Gaps = 37/235 (15%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRI-KQLARFAHTHPGGPPCHFNDIIPLTHCP 69 V AGQSN + D + RF +T G + Sbjct: 118 DVWLCAGQSNME---WSMEREMHFDKEKKNVNLPSLRFYNTTYAGKNIYNKSFNDSLLSL 174 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 + D ++ + + + + +++L + I ++ GG+ A Sbjct: 175 LNENDFYDGRWAVSDSISIKRMSA-VGYYFGKEILEQVD--VPIGLINMAIGGAPIEAFI 231 Query: 130 EGTY--SERHGASHDACRWGTDTPLYQDLVSR---------------------------T 160 +S W + L + + R Sbjct: 232 GRNVMEGNSLFSSKVKGNWLDNNVLPEWIRERGHQNVGDIQVLDGDELGPNHAFKPGFAY 291 Query: 161 RAALAKNPQNKFLGACWMQGEFDLMT-SDYASHPQHFNHMVEAFRRDLKQYHSQL 214 A + + G W QGE + + + ++E +R+ KQ Sbjct: 292 SAGIKPLFKLPIKGIIWYQGESNAQEIERVNEYGELQKLLIEDYRQKWKQPEMPF 346 >UniRef50_C3QSB9 Sialate O-acetylesterase n=8 Tax=Bacteroides RepID=C3QSB9_9BACE Length = 809 Score = 75.8 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 46/322 (14%), Positives = 93/322 (28%), Gaps = 36/322 (11%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIK----QLARFAHTHPGGPPCHFNDIIPLT 66 V GQSN P+ A +P++K L P + Sbjct: 74 EVWLCTGQSNME-----FPV-----ARNPQVKWKTGMLNEAEEMKDADFPEIRLFHVEHQ 123 Query: 67 HCPHDVQDM-QGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 P D ++ G + + VG RKL + + + ++ GG+ Sbjct: 124 LAPDDEKEDCVGKWVVCNPENLKDFSAVG--FVFGRKLYKEL--STPVGLIQSTWGGTHA 179 Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQD------LVSRTRAALAKNPQNKFLGACWMQ 179 + + E + D + + + ++ + +A G W Q Sbjct: 180 ESWTSMKVMENNPLYADVLKQYSKEKVSREKDKCKVPATLWNGMIAPMVGYTVKGNIWYQ 239 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSY 239 GE + Y + + F +++ ++R++ Q D P++ ++K+ Sbjct: 240 GESNS--VRYEKYQEVFTNLINSWRKEWNQP--------DMPFYFVQIAPHYKQPAGIRE 289 Query: 240 EAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRS 299 + + V + L+ Y ++ L Sbjct: 290 AQLKTWLSGLENIGMAVVTDAADSTDIHPRNKVAPGERLAAWALAKQYGKKIVYSGPLYK 349 Query: 300 SHFSTAARRGIISDRFVEAILQ 321 S R + F E LQ Sbjct: 350 S-MKVNGREITLDFAFAEGGLQ 370 >UniRef50_UPI0001BC8367 sialic acid-specific 9-O-acetylesterase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC8367 Length = 481 Score = 75.4 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 60/222 (27%), Gaps = 32/222 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGP-PCHFNDIIPLTHCP 69 V +GQSN +P+ P + P +P T Sbjct: 104 EVWICSGQSNME-----MPVHGFYGQP-----VVGSLEEIVEASQYPDIRMFTLPPTPAA 153 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 D +G + VG + L + N I ++ GG A Sbjct: 154 EPQDDCRGSWLKSTPESVRDFSAVG--YFFGKNLNKVL--NIPIGLITPNCGGIAIEPWM 209 Query: 130 --------EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 G + + L+ +++ R G W QGE Sbjct: 210 TAEAIRETAGINQKLAFTPQVQTEAANASYLFNGMIAPIR-------NFTGRGFIWYQGE 262 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWF 223 + +Y + + MV +R++ K + P+ Sbjct: 263 SN--QHNYFDYDKLQVSMVNLWRKEWKNEDMPFYYVQLVPFP 302 >UniRef50_B3CGC9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B3CGC9_9BACE Length = 472 Score = 75.4 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 39/322 (12%), Positives = 85/322 (26%), Gaps = 68/322 (21%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHP----GGPPCHFNDIIPLT 66 V +GQSN + + P IP Sbjct: 101 DVWLCSGQSNMQ-------------------WVVNNVTNAEVEKKNANYPQIRTLNIPRR 141 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 + + + ++ G A A+K+ N I I+ GG+ Sbjct: 142 MELSPKDTISATWLVCSPENVGRFS--GVAYFFAKKVYEET--NIPIGIINSSWGGTIVE 197 Query: 127 AGSEGTYSERHGASHDACRWGTDT--PLYQDLVSRTRAALAKNP-------------QNK 171 + + + P + L + + A + Sbjct: 198 TWTSLEAANTLPQKRLDRYNKNEKLFPPTEYLTRKNKEAKRNDYPSLVYNAMIHPLLSFS 257 Query: 172 FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW 231 G W QGE ++ + + M+ +R ++ P++ + Sbjct: 258 IKGVLWYQGENNVG--NAEPYTDWLTCMIGDWRNRWN---------SELPFYIIQLPNF- 305 Query: 232 KENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPE 291 + + ++ ++ ++ V G + + DP DL + Sbjct: 306 ---DSINKKPLWAEMRDAQSK-VLAVP----GTHLIVTSDLGDPYDLH-----PRNKQEV 352 Query: 292 NWTTALRSSHFSTAARRGIISD 313 AL++ H+ I+S+ Sbjct: 353 GMRAALQALHYE-YGYSDIVSE 373 >UniRef50_D1PH08 Putative sialate O-acetylesterase n=1 Tax=Prevotella copri DSM 18205 RepID=D1PH08_9BACT Length = 484 Score = 75.4 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 75/249 (30%), Gaps = 37/249 (14%) Query: 11 YVLTVAGQSNAMA--YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V AGQSN G G + + + + ++ H P + + Sbjct: 106 EVWVCAGQSNMEMPVKGFGNCPVEGYNKA---VLEANQYKGVHYVKIP-----SVMSSKP 157 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 D +P + G A+ + + + ++ +GGS + Sbjct: 158 LDDANCEWKEVNPETVGDASATGYF-----FAQVINKTLD--IPVGLIMANKGGSRVESW 210 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQD----LVSRTRAALAKNPQNKFLGACWMQGEFDL 184 + Y +++ T P ++ + G + QG ++ Sbjct: 211 LDRDYLKKNTKEDLDSVKMTKNPKFKWDFLYPLLWGNGTFHPILNYSVRGILFYQGCSNV 270 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 D + + +V +RRD KQ G+ +Y+ + P+ + G Sbjct: 271 GDPD-GQYTKRLADLVAQWRRDFKQ---------------GELPFYFVQIAPYHNGDVNG 314 Query: 245 NYQNNVLAN 253 ++ + Sbjct: 315 DWGPKLREQ 323 >UniRef50_Q1LUX8 Novel protein (Zgc:56454) n=3 Tax=Danio rerio RepID=Q1LUX8_DANRE Length = 509 Score = 75.0 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 40/310 (12%), Positives = 80/310 (25%), Gaps = 39/310 (12%) Query: 11 YVLTVAGQSNAM---AYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 + +GQSN + P ++ +P + Sbjct: 113 DIWLCSGQSNMAFTVGQVINATEELTMASKFPDVRIFQAALEQSKKELTDLAGVEVPWSK 172 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 + + + H A R L I +V GG+ A Sbjct: 173 PTPGLLGGKDFSHFSAVCWL-----------FGRYLYEK--RKYPIGLVHSSWGGTPVEA 219 Query: 128 GSEG---TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL 184 S + + + L+ ++ GA W QGE + Sbjct: 220 WSSPRALQKCGLKSSVISEQNTWSSSVLWNAMI-------HPLLNMTITGAIWYQGEANA 272 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 + + F M++ +R S+ D P+ Y ++ + I Sbjct: 273 NYNRDK-YNCTFPGMIDDWRMAFH-EGSEGQTALDFPFGFVQLCTYKTKDPTDGFREIRW 330 Query: 245 NYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFST 304 + A+ FV ++ + + PD+ S + S + Sbjct: 331 HQT----ADYGFVPNKRM-KNTFMAVAVDVPDEKSP------WGSIHPEDKQDVAFRLVL 379 Query: 305 AARRGIISDR 314 AR ++ Sbjct: 380 GARAVAYGEK 389 >UniRef50_A6DH60 Sialic acid-specific 9-O-acetylesterase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DH60_9BACT Length = 541 Score = 75.0 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 34/252 (13%), Positives = 66/252 (26%), Gaps = 59/252 (23%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN L P P + P Sbjct: 104 EVWICSGQSNMDWR--LTQLTKPARDPFYNPISEYVKKEIETANDPLLRQIEVKKEVSPD 161 Query: 71 -DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 +++++ G P+A + + G AR+L + N + ++ C GG+ Sbjct: 162 AELKNINGNWVPVAPANSINFTATG--YFFARELRKTL--NVPVGLIKCAWGGTLVEPWV 217 Query: 130 E---------------GTYSERHGASHDACRWGTDTPLYQDLVSRTRA----ALAKNPQ- 169 + + ++Q+ ++ A K + Sbjct: 218 PMPKYQTNDELKAFYAEEKVAKLEKASKWWTPEKAKAMHQEKLAEWETQKAQAKEKGKKF 277 Query: 170 -------------------------------NKFLGACWMQGEFDLMTSDYASHPQHFNH 198 GA W QGE + + + +HF+ Sbjct: 278 NKRKPRMIQDPAKSNRIPSTLYNGMIAPLVPYAVKGAIWYQGESNAGYQNDK-YQKHFSA 336 Query: 199 MVEAFRRDLKQY 210 ++E +R Q Sbjct: 337 LIEGWRTAWDQD 348 >UniRef50_A6LG42 Sialate O-acetylesterase n=21 Tax=Bacteroidetes RepID=A6LG42_PARD8 Length = 691 Score = 74.6 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 29/239 (12%), Positives = 60/239 (25%), Gaps = 46/239 (19%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDA-----PHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 V +GQSN +++ H +I+ + Sbjct: 303 EVWLCSGQSNMAFRVNESVKEEQQQQLDYAKQHSQIRLFDLKPRW----ETYAVEWDASV 358 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA- 124 + +Q + T + R L + + ++ GGS Sbjct: 359 LDSLNRLQYYHDAQWEVCDTRNTARFSA-IGFAFGRMLADSL--QVPVGLILNAVGGSPT 415 Query: 125 ----------------FTAGSEGTY-----SERHGASHDACRWGTDTPLYQDLVSRTRAA 163 ++ + ER + Y+ A Sbjct: 416 EAWIDRKTLEFEFPDILQDWTKNDFIQDWVRERAALNIKQASNPLQRHPYEPCY-LFEAG 474 Query: 164 LAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW 222 + + G W QGE + + H + F +V ++R++ D P+ Sbjct: 475 IQPLHRYPIKGIIWYQGESNA--HNMEVHERLFPLLVNSWRQNWNA---------DLPF 522 >UniRef50_C2FWW0 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FWW0_9SPHI Length = 507 Score = 74.3 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 46/333 (13%), Positives = 84/333 (25%), Gaps = 71/333 (21%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + R L H P + T P Sbjct: 112 EVWLCSGQSNMD--------FPVAKSTGWRTGILDEEQHMKEADFPEIRLFHVRQTLSPK 163 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + + + A RK+ + ++ GG+ + + Sbjct: 164 VPLEDCEGEWMICNPDNLKEFSA-VAFFFGRKIYRQT--KLPVGLIQTTWGGTHAESWTP 220 Query: 131 GTYSERHGASHDACRWGTDTPL--------YQDLVSRTRAALAKNPQ------------- 169 + + YQ + A +K Q Sbjct: 221 MPVMQHNPLYTTLIEEQNKAEESYPQDSLIYQKALKDYEEARSKGQQLPKKPKEPLGIYH 280 Query: 170 -----------------NKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHS 212 G W QGE + Y + Q F++M++++R KQ Sbjct: 281 NKALATLWNGMVNPLVPYTIKGVIWYQGESNS--VRYQDYQQVFSNMIQSWRTAWKQ--- 335 Query: 213 QLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGL-TNAP 271 D P++ ++K+ P EA +Q+ ++ + N Sbjct: 336 -----KDMPFYFVQIAPHYKQ-PPEIREAQLKTWQSVQHTGMVVITDVGDSTDIHPRNKQ 389 Query: 272 DEDPDD----------LSTGYYGSAYRSPENWT 294 +S Y G YR + Sbjct: 390 VPGERLANWALAKEYKISVAYSGPLYRKKKIQK 422 >UniRef50_B7FW97 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FW97_PHATR Length = 396 Score = 74.3 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 42/295 (14%), Positives = 80/295 (27%), Gaps = 29/295 (9%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPD------REDAPHPRIKQLARFAHTHPGGPP 56 A V +AG++N Y L D + R+ R+ H G Sbjct: 18 AKKRGKPVKVFILAGEANVEGYASLSHLHDLVTGQHTLNVTETRLDGPGRYQHLRDGYGQ 77 Query: 57 CHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIV 116 D + +T+ + + G + + +++V Sbjct: 78 WSTRDDVFVTYEHERHSGWKYGPLDVTHWGAAP-NVFGPEVEFGHVMGNAY--VEPVILV 134 Query: 117 PCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK---FL 173 G + + +W + ++ L + ++ Sbjct: 135 KAAWGKRSLAK----DFRPPSATGETGFQWYRMQTGIANTFAQIANILGEEYKHADIDIG 190 Query: 174 GACWMQGEFDL-MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWK 232 G W G DL ++ A + + H V R L + P + Sbjct: 191 GIVWWHGYTDLWNQANAAEYESNLEHFVRDLRSTLHRPL--------LPIVIAELGGSGA 242 Query: 233 ENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGL-TNAPDEDPDDLSTGYYGSA 286 S I +AN+ ++ R P + D++T YYG A Sbjct: 243 N---ASRREIRMRDAQQRVANLAEWNYTTSYVRTASFAVPSKPFLDINTHYYGRA 294 >UniRef50_A6DF57 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF57_9BACT Length = 522 Score = 74.3 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 35/308 (11%), Positives = 74/308 (24%), Gaps = 76/308 (24%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + + P + Sbjct: 104 DVWLCSGQSNMEWTVNNS---------------MNKDQEIASANNPMIRHFKANHAMNER 148 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS- 129 D + + VG + AR++ + + I+ GG+ Sbjct: 149 PADDNGAQWRICTPQNAAHFTAVG--YYFAREIYKN--EKVPMGILSVNWGGTRVEPWVT 204 Query: 130 EGTYSERHGASHDACR-------WGTDTPLYQDLVSRTRAALAKNPQNKFL--------- 173 + + A + + Y+ + +A +AK+ Sbjct: 205 PEGFHMVPELKNLAMKIDAVNPKKPSGNAAYKKYIQEMKAWIAKSESALIKMEGLTAAPK 264 Query: 174 ----------------------------GACWMQGEFDLMTSDYASHPQHFNHMVEAFRR 205 G W QGE + D + +++ +R+ Sbjct: 265 RPSLGTSHQSPTYLYNAMINPIAPAALKGILWYQGESNGNEGDS--YYHKKQALIKGWRK 322 Query: 206 DLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGER 265 Q D P++ + K N H+ + A + + + G Sbjct: 323 LFNQP--------DLPFYYVQLANFQKSNPNHALGG--DGWARLRQAQLDTLQVENTGMA 372 Query: 266 GLTNAPDE 273 T+ + Sbjct: 373 LATDIGEA 380 >UniRef50_A6DS94 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DS94_9BACT Length = 556 Score = 73.9 bits (179), Expect = 7e-12, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 66/248 (26%), Gaps = 50/248 (20%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN M Y + +D + + + + + + Sbjct: 121 EVWFCSGQSNMM-YTLEMLSLKTKDVGYESVLKFMKDEKEQAKDEFLRQIKVPNVASALE 179 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 +D +G + + G A A++L + + IV GG + Sbjct: 180 TKKDFEGQWLASTPQNNGSFS--GTAYFFAKQLRQHLDR--PVGIVNVTWGGKRIESFIP 235 Query: 131 -----GTYSERHGASHDACRWGTDTPL----YQDLVSRTRAALAKNPQ------------ 169 ++ D L YQ + + R A+ N + Sbjct: 236 PSEFNQAIHKQLLTKIQKQVKSYDAQLASKKYQQAMVKYREAIKLNRKKGLPRPKRPIMS 295 Query: 170 -----------------------NKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRD 206 GA W QGE + +H ++ +R Sbjct: 296 VVPSANSATPASIFNGMVNPVIPYAIRGAIWYQGES-HNPNRVEAHRGLLKSLIRGWRAK 354 Query: 207 LKQYHSQL 214 +Q + + Sbjct: 355 WQQGNFPV 362 >UniRef50_B4CVC6 Sialate O-acetylesterase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVC6_9BACT Length = 544 Score = 73.5 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 45/344 (13%), Positives = 81/344 (23%), Gaps = 93/344 (27%) Query: 11 YVLTVAGQSNAMAY--GEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN G + + A P+I+ P C Sbjct: 104 EVWVASGQSNMEFTLSGAKDHDAEIKAADFPQIRMFTVQKSA----------KTEPADDC 153 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 G + VG AR+L + I I+ GG+ Sbjct: 154 -------VGKWEICTPQTSPHFSAVG--YFFARRLYQTL--KEPIGIIHTSWGGTPAEFW 202 Query: 129 SEGTYSERHGASHD---ACRWGTDTPL-----YQDLVSRTRAALAKNP------------ 168 + R A + Y+ ++ + A P Sbjct: 203 TPKPILARDPAFKPAFDSWEKAVANYPQAKEKYEKDLAEWKEATKTPPADGKPAPKAPRP 262 Query: 169 ----------------------QNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRD 206 G W QGE + D + + F M+ ++R++ Sbjct: 263 PRGGDAFGSPGCLYNGMVAPLVHYTIRGTIWYQGEANAGAPD--LYKKLFPTMIRSWRKE 320 Query: 207 LKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERG 266 I D P+ + + + E N+ A + +D G Sbjct: 321 W--------QIEDFPFLYVQLANFMQRHA----EPTDTNWARLREAQLETLDVPHTGMAV 368 Query: 267 LTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRGI 310 + D ++ + A + Sbjct: 369 TIDIGD--------------SKNIHPTDKQDVGLRLALWAEATV 398 >UniRef50_C6W5L8 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W5L8_DYAFD Length = 501 Score = 73.5 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 41/284 (14%), Positives = 73/284 (25%), Gaps = 47/284 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHP-RIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 V +GQSN PL + P R GP + D+ + Sbjct: 110 DVWLCSGQSNMD-----FPLKAAQTGPEELRKGSFNANIRLLKYGPAVPWGDVAWDSTAL 164 Query: 70 HDV---QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 V QG + VG + +K+ N + ++ GGS Sbjct: 165 ATVNRFGFFQGEWKTADAASVGAFSAVG--YYFGQKI--AAETNVPVGLIQVAVGGSPTE 220 Query: 127 AGSEGTYSERHGASHDA-CRWGTDTPLYQDLVSRTR----------------------AA 163 + + + G W + R A Sbjct: 221 SWIDRQLLAQDGRFTGLLSNWPHSQQVMPWPRERAERNTGSKHFEAQQHPFKPGYSFAAG 280 Query: 164 LAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWF 223 +A G W QGE ++ D H F +V+++R+ P++ Sbjct: 281 IAPLTSFPIAGVIWYQGESNV--HDIPLHEALFETLVKSWRQHWGHV---------LPFY 329 Query: 224 CGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGL 267 + + N+P ++ A + Sbjct: 330 YAQLSGIERPNWPEFRDSQRQMLARIPHAGMAVSYDVGDSLDVH 373 >UniRef50_A4AM20 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AM20_9FLAO Length = 468 Score = 73.1 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 59/218 (27%), Gaps = 22/218 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN L P + Sbjct: 107 EVWIASGQSNMQ-------WTPTNG-------LLNAEEEIKNANFPNIRFFQVDQHTAKF 152 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 ++ G + +V A AR + + + + ++ GG+ Sbjct: 153 PQENTPGKWMECTPETMKDFSSV--AYFFARNIQDRL--SFPMGMISSNWGGTPIETWIP 208 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAAL-AKNPQNKFLGACWMQGEFDLMTSDY 189 A + P + + A+ + G W QGE + + Sbjct: 209 SELINGDMELKKAATKVEEKPWWPNDAGLAYNAMIHPLSKFNIAGCIWYQGESN--RQNP 266 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT 227 S+ + F +++++R DL + AP+ G Sbjct: 267 NSYYKSFPLLIKSWR-DLWRKDFSFYFAQIAPFKYGKM 303 >UniRef50_C6Z4B3 Polysaccharide deacetylase n=10 Tax=Bacteroides RepID=C6Z4B3_9BACE Length = 503 Score = 72.7 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 35/280 (12%), Positives = 72/280 (25%), Gaps = 43/280 (15%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 +V+ AGQSN LP A + + +C Sbjct: 27 HVIITAGQSNTDGRTPNEDLPAYIKA---------------LATDTLTYAEGA-YRYCQI 70 Query: 71 DVQDMQGYHHPLAT-----NHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 D +G P + + +LL +V GG++ Sbjct: 71 AQNDGKGEFIPFWPRAKRSGKNNMWAFDAVTYYWLEQLLQEKFY-----VVKWAVGGTSI 125 Query: 126 ---------TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQN-KFLGA 175 S Q++ L++ + Sbjct: 126 APDYNASKGRFWSAAPEWLAQAKPTSDGGNSLLLSFIQEIDMCIDKTLSRLKDGYQIDAF 185 Query: 176 CWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW---YWK 232 W QGE D + + ++ MV R L + + + + P+ G Y+ Sbjct: 186 LWHQGESD--YAKSKDYYRNLKTMVAYVRMHLTEKTGK--DYSRLPFIFGTVARSNKYFS 241 Query: 233 ENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPD 272 ++ + + N L ++ + ++ + Sbjct: 242 REVENAMKQLAAEDPNMHLIDMSGAELLNDRLHFTAHSAE 281 >UniRef50_B1ZNK9 Sialate O-acetylesterase n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZNK9_OPITP Length = 492 Score = 72.3 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 66/308 (21%), Gaps = 78/308 (25%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V V+GQSN P R P + Sbjct: 92 EVWLVSGQSNME-------WPVALLREDERQLAAVD----LPLVRQLKIERAVASQPAET 140 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS- 129 P + + +G AR+L + + I+ GG+ A Sbjct: 141 AKTSG---WQPALRDKVGDFSAIG--YFFARELHRKL--GVPVGIINSSWGGTEIEAWMS 193 Query: 130 --------------------------------EGTYSERHGASHDACRWGTDTPL----- 152 A A T PL Sbjct: 194 DLARQSTSVGAAMEARWQQAKSEWPPERVARYPAEMEAWQKAEEQARATKTKNPLPWPQP 253 Query: 153 --YQDLVSR----TRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRD 206 D R A +A G W QGE ++ A + + F M+ +R + Sbjct: 254 PASDDSPRRPGGLYNAMIAPLRPCALRGFVWYQGESNVGR--AAEYAELFPAMIRTWRAN 311 Query: 207 LKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERG 266 P+ Y +N + A ++ Sbjct: 312 WGDEA--------LPFLFVQIPDYADDNPGGRQ------WARLREAQTHALELPNTAMAV 357 Query: 267 LTNAPDED 274 + D D Sbjct: 358 AIDVGDPD 365 >UniRef50_Q8AAL7 S-layer related protein, sialic acid-specific 9-O-acetylesterase n=9 Tax=Bacteroidales RepID=Q8AAL7_BACTN Length = 884 Score = 72.3 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 57/218 (26%), Gaps = 25/218 (11%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAP--HPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN +P+ P + ++ N L Sbjct: 106 EVWFCSGQSNME-----MPVKGFRGQPVYGSQPYIVSANPKRPLRLYTVKNNWSTTLKEE 160 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 D Q + +A T Y L + + ++ C S A Sbjct: 161 GIDGQWSEASSEEVADFSATAY-------FFGNLLQQSLD--VPVGLIHCSWSMSKIEAW 211 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRT----RAALAKNPQNKFLGACWMQGEFDL 184 + E + T+ ++ A + G W QGE + Sbjct: 212 MD---KETLSHFSEVTLPDTNQDKFEWAAGTPTLLWNAMVNPWEGFPVKGVIWYQGEANT 268 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW 222 D + + F MV +R + APW Sbjct: 269 --PDPTLYKKLFPAMVSQWRNFFHNAEMPFYYVQIAPW 304 >UniRef50_D1NBH8 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1NBH8_9BACT Length = 723 Score = 72.3 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 39/349 (11%), Positives = 76/349 (21%), Gaps = 81/349 (23%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDRE--DAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN ++ D+ +PR++ H P Sbjct: 103 EVWLCSGQSNMAWRLNQSEGAEQAIRDSANPRLRLFQVERHWGQVAPE------------ 150 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 G + + VG R+L + + ++ GG+ Sbjct: 151 -----QGTGRWRVSSPESSGTFSGVG--YFFGRRLAAEL--EVTVGLIDVSWGGTRIEPW 201 Query: 129 SE----GTYSERHGASHDAC--------------RWGTDTPLYQDLVSRT---------- 160 G Y + + A +W Y V Sbjct: 202 ISPAELGNYPQLAELNRQAQLFDPASAAHRERLEQWLAACETYNAAVKEALARKSPPPLP 261 Query: 161 -------------------RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVE 201 R + G W QGE + D + + + Sbjct: 262 PKFPAELRCASRDDVGMLFRWMIRPLTPLAVGGTIWYQGEAN--RLDGLVYAEKLKALAT 319 Query: 202 AFRRDLKQYHSQLNNITDAPWFCGDTTWY-WKENFPHSYEAIYGNYQNNVLANIIFVDFQ 260 +RR+ + P++ + + P + +++ Sbjct: 320 GWRREFASP--------EMPFYLVQLAPFAYARQNPVALASVWAAQSRAAREIPNAGMAV 371 Query: 261 QQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRG 309 L + + G A R + G Sbjct: 372 INDVGNLRDIHPVKKRPVGERLAGLALRRTFGREVPAEFPEPESWKVEG 420 >UniRef50_B7G104 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G104_PHATR Length = 403 Score = 71.9 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 38/267 (14%), Positives = 68/267 (25%), Gaps = 54/267 (20%) Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLL-----PFIPDNAGILIVPCCR 120 P PL+ N G L + L + ++ R Sbjct: 144 DVTPPIAVQEFMSAVPLSPNS-GCGNPYGPELVLGHTLGFLPDGKESGSDLSFIMPKVSR 202 Query: 121 GGSAFT-AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQ 179 GG+ S+ + + W Q Sbjct: 203 GGTQIRGNWSKAEGDLWSTLQSRIAH-----------IDSVSTQCQTGSGCSWDAFVWFQ 251 Query: 180 GEFDLMTS-DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS 238 GE D M + ++ + R +L ++ + P +++ Sbjct: 252 GENDSMDQLNAENYEGDLITFLADVRAELFAAGTRYAAPEEIPVVIVQIGSFFRAR---E 308 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALR 298 + + Q +V A+ F A DDL T Y Sbjct: 309 FGTVVARAQASVAASDAF-------------ASIVWTDDLGTFY---------------- 339 Query: 299 SSHFSTAARRGIISDRFVEAILQFWRE 325 H+ A+ + II DR A+ W++ Sbjct: 340 --HYD-ASSQLIIGDRVARALEGLWKD 363 >UniRef50_D2QKK0 Conserved repeat domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QKK0_9SPHI Length = 990 Score = 71.9 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 61/295 (20%), Positives = 92/295 (31%), Gaps = 53/295 (17%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +AGQSNA G P +D R+ + + +IPL Sbjct: 123 EVFIIAGQSNAEG-GFQRPPSSVDD----RVMCVDFRQDSL-------SEQLIPLQFSHI 170 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 G P +A++L N IL + GG++ + Sbjct: 171 SYGTSIGPSQPPHIYSI-------LGDKLAQRL------NVPILFLGAALGGTSSADWQQ 217 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 + A L V+RT A W QGE DL +S Sbjct: 218 SAAGNMGTGRNSAVYRRMGAVL-LHYVTRTGA----------RAVLWHQGESDLHSST-Q 265 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNV 250 ++ + +++E R+ L W ++ + + A N Sbjct: 266 TYFDNIKYVIEKSRQQLG--------GKPLAWAVSRASYIFGQTSSSVIAA------QNQ 311 Query: 251 LANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTA 305 L N +F F G+T PD DDL G G R W +L +S F A Sbjct: 312 LINSVFNVFAGPATDGIT-GPDNRFDDLHFGGNGLY-RFASAWDESLTASFFQNA 364 >UniRef50_UPI00016E1BD2 UPI00016E1BD2 related cluster n=2 Tax=Takifugu rubripes RepID=UPI00016E1BD2 Length = 511 Score = 71.6 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 50/301 (16%), Positives = 85/301 (28%), Gaps = 44/301 (14%) Query: 11 YVLTVAGQSNAMAY-----GEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 + GQSN L P R +AR + +P Sbjct: 87 DIWLCGGQSNMAFQTSRIFNSLEELNLVAKYPDVRPFMVARDWSGTELTDIHYIRVKLPW 146 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 + + + H + G+ LH K I ++ C GG+ Sbjct: 147 SVPSSGTSFIHSFVHSVVAKFSAVCWLFGRYLHDTLKY--------PIGLIDSCWGGTPV 198 Query: 126 TAGSEG--------TYSERHGASHDACRWGTDTPLYQDL---------VSRT-------R 161 A S Y++ A Y+ L +SR Sbjct: 199 EAWSSSRALQQCGLDYNDEFRAFLKKTIPRQTVMNYKILHFQPFKSIVLSRYYKNSVLWN 258 Query: 162 AALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAP 221 A + + GA W QGE + + F M++ +R + N D P Sbjct: 259 AMIHPLVKMTIKGAIWYQGESNANYHQDK-YNCSFPAMIDDWRMAFHRGSG-GNTAADFP 316 Query: 222 WFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTG 281 + + Y K + S+ I + A+ F + ++ + PD+ S G Sbjct: 317 FGFVQLSTYIKYSTDDSFPNIRWHQT----ADFGFAPNLRM-QKTFMAVAMDLPDETSKG 371 Query: 282 Y 282 Sbjct: 372 D 372 >UniRef50_Q9HAT2 Sialate O-acetylesterase n=16 Tax=Tetrapoda RepID=SIAE_HUMAN Length = 523 Score = 71.6 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 57/218 (26%), Gaps = 24/218 (11%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + + Q R P D++ + Sbjct: 119 DVWLCSGQSNMQMT-VLQIFNATRELSNTAAYQSVRILSVSPIQAEQELEDLVAVDLQWS 177 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 H + R L + I ++ GG+ A S Sbjct: 178 KPTSENLGHGYFKYMSAVCWL-------FGRHLYDTL--QYPIGLIASSWGGTPIEAWSS 228 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTR------AALAKNPQNKFLGACWMQGEFDL 184 G + G Y + ++ A + G W QGE ++ Sbjct: 229 GRSLKACGVPKQGSI------PYDSVTGPSKHSVLWNAMIHPLCNMTLKGVVWYQGESNI 282 Query: 185 MTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW 222 + + F ++E +R + SQ P+ Sbjct: 283 NY-NTDLYNCTFPALIEDWRETFHR-GSQGQTERFFPF 318 >UniRef50_C3Z127 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Z127_BRAFL Length = 522 Score = 71.6 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 49/312 (15%), Positives = 82/312 (26%), Gaps = 35/312 (11%) Query: 11 YVLTVAGQSNAM---AYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 V +GQSN E A P I+ P + D I Sbjct: 110 DVWVCSGQSNMEFTVRQAFNASYAIAEAADFPDIRLFTA--DLVQSDKPLYDLDKILQPW 167 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 + G + R L + I +V GG+ A Sbjct: 168 SVASSDSVGGADWKYFSAVCW---------FYGRDLYNHL--GYPIGLVATSWGGTPVEA 216 Query: 128 G------SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGE 181 + + H S D G + A + GA W QGE Sbjct: 217 WSSPTVLEKCNITNSHEESEDLQYAGPEVGGPSGHSVLWNAMVHPLLNMTITGAIWYQGE 276 Query: 182 FDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 + D ++ F M++++R++ S P+ + K P Sbjct: 277 ANTGHPD--TYSCSFPGMIDSWRKEW-YMGSGGQTDPYFPFGFVQLSTTGK---PSDTGM 330 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSH 301 + + + A+ +V + A D D + + R +S Sbjct: 331 GFPAIRWHQTADYGYVP--NPAMPNVFMAVAVDLPDAKSPFGSIHPRDK-----QDVASR 383 Query: 302 FSTAARRGIISD 313 AAR + Sbjct: 384 LVLAARAVAYGE 395 >UniRef50_A7S3W8 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7S3W8_NEMVE Length = 500 Score = 71.2 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 42/282 (14%), Positives = 73/282 (25%), Gaps = 24/282 (8%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P A + F FN PL Sbjct: 117 DVWVCSGQSNMDFSITQTNNPKEAAAEANHYLHIRLFT-------AERFNSTSPLYELKA 169 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHI-ARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 Q +Y + L + I ++ GG+A S Sbjct: 170 IRQLWSVASSASINGGAWKY--FSAVCWFYGKNLFDRL--QYPIGLISTTWGGTAIEEWS 225 Query: 130 -EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 + ++ S D+ + T P + G W QGE + Sbjct: 226 SPDSLAKCGIQSFDSSKEKTLGPFPNGGSGLYNGMVHPFLNISIYGVIWYQGESNSGAP- 284 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKEN------FPHSY-EA 241 ++ F M+E +RR + + + P+ + Y + + + Sbjct: 285 -QTYNCTFPAMIEDWRRKW-FSGTSHDTDPEFPFGFVQLSSYTSDPTLIDGFPAIRWAQT 342 Query: 242 IYGNY-QNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGY 282 Y N+ + N+ G D G+ Sbjct: 343 ADQGYVPNSSMRNVFMAVAMDLGNATSPYGSIHPTDKRDVGF 384 >UniRef50_UPI00016C4ED6 sialic acid-specific 9-O-acetylesterase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4ED6 Length = 535 Score = 71.2 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 43/316 (13%), Positives = 79/316 (25%), Gaps = 85/316 (26%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDA--PHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN P+ +P ++ T P H Sbjct: 114 EVWVCSGQSNMEWSVNAGESPEDVKKGAENPNLRLFTVQKRTAP--------------HP 159 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 D D++ + + T G A H +KL + + ++ GG+ A Sbjct: 160 IQDQNDLKHFTKWSVSGPDTVGGFSAVAYHFGQKLQKEL--GVPVGLIHTSWGGTPAQAW 217 Query: 129 ------SEGTYSERHGASHDACRWGTDTPLYQDL--------------VSRTRAALAKNP 168 + + S A G + + + +AA P Sbjct: 218 ASLEALDADPSLKYYADSARAAVKGYENYDAKKAQADYDTSLAKWKENADKLKAAGKPVP 277 Query: 169 Q------------------------------NKFLGACWMQGEFDLMTSDYASHPQHFNH 198 + K GA W QGE + + F Sbjct: 278 KEPTKPGATPPALGPGTPGVLYNAMIYPLLNFKVKGAIWYQGESNAG--KAFEYRTLFTA 335 Query: 199 MVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVD 258 M++ +R+ + P+ + Y + A Sbjct: 336 MIKDWRKQFN---------CELPFMLVQLAPFRGGASGVDYAELRDAQLYATKA------ 380 Query: 259 FQQQGERGLTNAPDED 274 + G +T+ +E Sbjct: 381 LPKTGIAVITDVGNET 396 >UniRef50_C6VZW6 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VZW6_DYAFD Length = 1068 Score = 71.2 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 46/262 (17%), Positives = 80/262 (30%), Gaps = 32/262 (12%) Query: 11 YVLTVAGQSNAMAYGEGLPLP----DREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 V +AGQSN Y +G P P D + A R+ +A P + P Sbjct: 122 EVFMIAGQSNGEGYRDGQPNPGDIWDAQGAGDDRVSVVAHSTVPDQANLP-SGDSNFPYP 180 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 + H +D + +G +G +L+ + +L GS Sbjct: 181 NFGHLDKD---SNISPRGKTAWCWGRLG------DRLVSKL--GVPVLFFNVAWYGSHVG 229 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL-M 185 A E R + + + + ++ R + G W+QGE D Sbjct: 230 AWRESINGGRPKSVYADAYFDPAGMPFGNMRDVIRRYTSLTG---MRGVLWIQGEADTDN 286 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGN 245 + S+ ++EA R + Q D W T++ S + I G Sbjct: 287 RTGTDSYFNDLKAVIEASRNESGQ---------DISWMVSQTSYIRGN---TSNQVIAGQ 334 Query: 246 YQNNVLANIIFVDFQQQGERGL 267 + +F + Sbjct: 335 GRVISEVPNVFQGPLTDLIQTP 356 >UniRef50_A5FC32 Sialate O-acetylesterase n=2 Tax=Flavobacteriaceae RepID=A5FC32_FLAJ1 Length = 511 Score = 71.2 bits (172), Expect = 5e-11, Method: Composition-based stats. Identities = 46/284 (16%), Positives = 80/284 (28%), Gaps = 66/284 (23%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P+ E +K P + L Sbjct: 110 EVWLCSGQSNM-----FFPVGREEGTWKTGVK--NYEEEVKNASFPNIRLFTVALNASQK 162 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 ++D+ G + + + V A R L + N I ++ GG+ A + Sbjct: 163 PLEDVTGNWKICSPENIKTFSAV--AYFFGRDLYQKL--NVPIGLISTSWGGTKAEAWTA 218 Query: 131 GTYSERHGASHDACRWGTD---------TPLYQDLVSRTRAALAKNPQNK---------- 171 E A + Y L + A+ A P+ + Sbjct: 219 QNILEDETAFLPILQEDAKNEKIHQEKLEAYYLALTNERIASAANAPKGQLKKPKKEPNK 278 Query: 172 -----------------FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL 214 GA W QGE + + F MV+++R + KQ Sbjct: 279 TSYVLYNAMLHPIVNYTIKGAIWYQGESNSG--KAYLYRSLFPAMVKSWRDEWKQ----- 331 Query: 215 NNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVD 258 D P+ Y+ + PH + + L ++ + Sbjct: 332 ---GDFPF-------YYVQITPHKGQN--AEIREAQLMSLKTIP 363 >UniRef50_A0ZZI1 Sialic acid-specific 9-O-acetylesterase n=6 Tax=Bifidobacterium RepID=A0ZZI1_BIFAA Length = 624 Score = 71.2 bits (172), Expect = 5e-11, Method: Composition-based stats. Identities = 46/343 (13%), Positives = 70/343 (20%), Gaps = 93/343 (27%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V GQSN L L + E P +P T + Sbjct: 133 EVWLAGGQSNME-----LELRNSE----------HADEALEDCADPLLRFYNVPKTGVIN 177 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + + V A + ARKL + + + IV C GG++ + Sbjct: 178 RNAENTSSWQESSPENSGVMSAV--AYYFARKLRSELDPDLPVGIVDCYIGGTSISCWMS 235 Query: 131 GTYSERHGASHD--------------------ACRWGTDTPLYQDLVSRTR--------- 161 A D W + + V R Sbjct: 236 EDALNSSDAGRDYLTRYQRAIAGKTQQQFELETSEWQSQMDAWNAAVETVRQTNPNATSS 295 Query: 162 ----------------------------AALAKNPQNKFLGACWMQGEFDLMTSDYASHP 193 A L + G W QGE D S + Sbjct: 296 ELSEQCGTCPWPPPLTPTSQWRPCGPFHAMLERIMPYSLAGFLWYQGEEDEQYSGS--YR 353 Query: 194 QHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLAN 253 + M+ +R + P+ W + + A Sbjct: 354 ELLGMMIGEWRALW---------SENLPFLIVQL-PQWINGKTAADGNDPMRWPVLREAQ 403 Query: 254 IIFVDFQQQGERGLT-------NAPDEDPDDLSTGYYGSAYRS 289 T N D A R Sbjct: 404 WDAAQSIDNVYAICTIDCGEYDNIHPLDKRTPGERLANCALRQ 446 >UniRef50_D1N4T8 Sialate O-acetylesterase n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N4T8_9BACT Length = 973 Score = 70.8 bits (171), Expect = 6e-11, Method: Composition-based stats. Identities = 43/301 (14%), Positives = 81/301 (26%), Gaps = 65/301 (21%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPC-HFNDIIPLTHCP 69 V +GQSN + R + P + P Sbjct: 110 EVWLCSGQSNME---------MPLWGGNARFRHYNGDKVAAESNYPLIRIAQMRPYGWSQ 160 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 D + P+ ++ + G R+L + I ++ GG+ + Sbjct: 161 FPRDDFKMSWQPVRPDNIAPFSAAG--FFFGRELFKALD--IPIGLISSHWGGTRIEPWT 216 Query: 130 EGTYSERHGASHDACR------WGTDTPL---------YQDLVSRTRAALAKNPQ----- 169 E + R GT YQ+ +++ + A AKN Sbjct: 217 PPAGFEAVPELANIARSVNAKLPGTKDYQEINAKVVRDYQEWLAKYQDAAAKNQPVPQPP 276 Query: 170 ----------------------------NKFLGACWMQGEFDLMTSDYASHPQHFNHMVE 201 GA W QGE +L D A + + +++ Sbjct: 277 AFPPELKPYENHQQPTVLYNRMLYPFVPFAMRGAIWYQGEANLG--DGALYEKKMEALLK 334 Query: 202 AFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQ 261 +++ + +L AP+ G + +A ++ +A I V Sbjct: 335 GWKQIFRNPDFKLYFAQLAPFNYGGDATRLPRVWEAQ-QAFADKTKDAGMAVINDVGNIS 393 Query: 262 Q 262 Sbjct: 394 D 394 >UniRef50_Q7UL92 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Rhodopirellula baltica RepID=Q7UL92_RHOBA Length = 544 Score = 70.4 bits (170), Expect = 8e-11, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 73/291 (25%), Gaps = 74/291 (25%) Query: 11 YVLTVAGQSNAM----AYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 V +GQSN +G D A +P+I+ +P Sbjct: 161 EVWICSGQSNMQWKMRGFGVDHFKEDVVRAKYPQIRFCD-----------------VPQM 203 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 D+Q + + VG +L + I ++ GGS+ Sbjct: 204 LALEGQDDVQAKWTTCSPQTVLNFSAVG--YFFGSRLHQELD--VPIGLISTNWGGSSAE 259 Query: 127 AGS-EGTYSERHGASHDACRWGTDTP------------------------LYQDLVSRTR 161 A E + LY ++ Sbjct: 260 AWVSPEVLKEHFPEFDELFATNAKLADEVGITFGRGQKTPRGLNQRNPSVLYNSMIR--- 316 Query: 162 AALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAP 221 F G W QGE ++ + F ++ +R D P Sbjct: 317 ----PLIPFSFRGVIWYQGESNVKQP--EQYRTLFPALIRDWRSRWG--------AGDFP 362 Query: 222 WFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPD 272 + Y+ + P +Y+ + A ++ + G + D Sbjct: 363 F-------YFVQIAPFAYKQEPISAAYLREAQLMSLSEPNTGMVVTMDIGD 406 >UniRef50_UPI00019689D5 hypothetical protein BACCELL_02528 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI00019689D5 Length = 274 Score = 70.0 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 47/319 (14%), Positives = 80/319 (25%), Gaps = 85/319 (26%) Query: 13 LTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDV 72 +AGQSNA G D G + + Sbjct: 24 WLIAGQSNASGMG---------DRRTSMKYYSKECFDYVQSGDSLKILQDPVGENGKYFG 74 Query: 73 QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGT 132 + G P A L +++V RGGSA + E Sbjct: 75 KANSGSISP----------------SFAWNLNKMT--GDSVVVVSAARGGSACSTTGETI 116 Query: 133 YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDL-----MTS 187 Y PL+ +++ +A+ KF G W+QGE D Sbjct: 117 YGTWAEK--------GVLPLFDAAMAKCNSAI-AVTGLKFSGVIWLQGERDANAINDGKM 167 Query: 188 DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQ 247 D + +++ FR+ L D P++ T + + Y + + Sbjct: 168 DGEDYEAALRNLILRFRKHLHD--------ADLPFYI-VLTGQYVDRPQEGYHQVRAAQR 218 Query: 248 NNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTAL-----RSSHF 302 +G + + W H+ Sbjct: 219 RLSEK-----------------------------MHGVHLVAADPWLFPQMNMMTDDIHY 249 Query: 303 STAARRGIISDRFVEAILQ 321 S A +I + I + Sbjct: 250 SQYAY-NLIGETIARQIYE 267 >UniRef50_Q1IR02 Sialate O-acetylesterase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IR02_ACIBL Length = 503 Score = 69.6 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 38/309 (12%), Positives = 70/309 (22%), Gaps = 82/309 (26%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P+ P + + + F + P Sbjct: 103 DVWVASGQSNME-----YPMEGWGGTPKQNLDEFPKANFPTLR----FFQTQHAYSDHPL 153 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG-- 128 ++ V A + A+ L+ + + I+ GGS A Sbjct: 154 MDIPKPAKWVACTPETAKKFSAV--AYYFAKNLIEK--EKVPVGIMEADWGGSVAEAWTS 209 Query: 129 -------------------------------------------SEGTYSERHGASHDACR 145 ++G D Sbjct: 210 LDGLSSKAGLMPIFANRATMMDKYVDEAEIIGPQEQRLKDEAKAKGQPEPSFPWHPDPHS 269 Query: 146 WGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRR 205 W LY ++S G W QGE + + + F M+ +R Sbjct: 270 WAPSE-LYNAMIS-------PLTPYPIRGVIWYQGESNSAYDRAPHYAELFQTMIRDWRN 321 Query: 206 DLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGER 265 D P+ + Y H + + + + + G Sbjct: 322 HWGV--------GDFPFLFVQISAYKSSEAEH--------WGSLRQTQLESLALRNTGMA 365 Query: 266 GLTNAPDED 274 + + D Sbjct: 366 VTIDVGNPD 374 >UniRef50_Q022C5 Sialate O-acetylesterase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q022C5_SOLUE Length = 493 Score = 69.6 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 34/267 (12%), Positives = 61/267 (22%), Gaps = 69/267 (25%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + P ++ Sbjct: 95 EVWVGSGQSNMEFKLQNAN---------------NHDEEIANANYPMIHLFLVKRAVADQ 139 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D+ G + + V R L + + + ++ GG+ + Sbjct: 140 PADDVVGTWQVCSPASAKAFSAV--EYFFGRHLQQNL--HVPMGLIESDWGGTPAESWIS 195 Query: 131 GTYSERHGASHDACR-WGTDTPLYQDLVSRTRAALAKNPQNK------------------ 171 E + W Y +R ALA P+ Sbjct: 196 RQAVESDASLKFVTENWDKVLANYPAAKTRYETALAAWPKAVEEAKAAGKTPPNKPALPQ 255 Query: 172 -----------------------FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLK 208 GA W QGE + ++ + + F M++ +R Sbjct: 256 GPGHQNTPAGLYNAMIAPLVPYGIRGAIWYQGESNANEANAWRYRRLFGAMIQDWRNRWG 315 Query: 209 QYHSQLNNITDAPWFCGDTTWYWKENF 235 Q D P+ Y + Sbjct: 316 Q--------GDFPFLFVQLANYKSNPY 334 >UniRef50_D2QFV7 Conserved repeat domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QFV7_9SPHI Length = 792 Score = 69.6 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 59/223 (26%), Gaps = 28/223 (12%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V AGQSN+ G G R+ + H +P PP + P Sbjct: 121 EVFITAGQSNSRGLGIGDN---DLGTNTDRVNAIDSINHYYP-QPPSLPALVSSGDPMPV 176 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + ++ +G +I + N + S + Sbjct: 177 PRYKALTAARRIFPMAESSWGWGELGDYIVNRF------NVPVAFYVAGWDASTIDNWYK 230 Query: 131 GTYSERH-GASHDACRWGTDTPLYQDL--VSRTRAALAKNPQNKFLGACWMQGEF--DLM 185 A + + Y +L V R A+A W QGE D+ Sbjct: 231 TANGIATCNAYYCVGGDWPNLQPYTNLKNVLRYYGAVAG-----VRAVLWHQGEAEGDIA 285 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT 228 S ++ ++ R D N PW + Sbjct: 286 ASSIPNYANLLKAVIAKSRADF--------NGWSLPWMVARAS 320 >UniRef50_D2EUP1 Sialate O-acetylesterase n=1 Tax=Bacteroides sp. D20 RepID=D2EUP1_9BACE Length = 474 Score = 69.6 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 33/271 (12%), Positives = 63/271 (23%), Gaps = 49/271 (18%) Query: 13 LTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDV 72 +GQSN + ++ Sbjct: 119 WLCSGQSNME---------------YCFKWRVDDITDRSTLFDNKKIRFFKVAKSSSAYP 163 Query: 73 QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGT 132 + + + + +V A ++L + N I ++ GG+A Sbjct: 164 VERIQGKWEICSPETAEDFSV-VAFCFGKRLNEEL-GNLPIGLIGSYWGGTAIEPWM-DE 220 Query: 133 YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASH 192 ++ RH + + T S A + G W QGE + + + Sbjct: 221 FTLRHEKLEEKTKALTAGWAPTANSSLYNAMIHPIINYTIAGVVWYQGEAN--NERHQDY 278 Query: 193 PQHFNHMVEAFRRDLKQYHSQLNNITDAPW----------------------------FC 224 F+ M+ +R + + PW Sbjct: 279 GVMFDAMIRGWRNAFHH-YLPFYFVQITPWSGYADKNAAYLREQQADVAATLRNTGMVVA 337 Query: 225 GDTTWYWKENFPHSYEAIYGNYQNNVLANII 255 GD + P + N L N Sbjct: 338 GDLVNDLTDIHPSLKRQVGERLANMALKNSY 368 >UniRef50_B2UQD5 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQD5_AKKM8 Length = 518 Score = 69.6 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 46/327 (14%), Positives = 89/327 (27%), Gaps = 50/327 (15%) Query: 6 SPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 S + + GQSN++ +G P + ++++ Sbjct: 18 SAKELKIFLLTGQSNSLGAVKGSPASPELLK-------------KYEPKETLYWHENFGQ 64 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPF-IPDNAGILIVPCCRGGSA 124 A +G A L +A + +V R G Sbjct: 65 REGVFPGASTSWEQVRPAMPRYNGNLCMGPEYGFAFTLEKNGWFKDADVAVVKASRDGGD 124 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALA-----KNPQNKFLGACWMQ 179 + + + Y+ LV + A A K + +F G ++Q Sbjct: 125 NSHWQKNGQA------------------YRTLVQAVKNACAGVDRSKYSKVEFAGLLYLQ 166 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGD-TTWYWKENFPHS 238 GE + TS S F ++ DLK + + + G+ W K Sbjct: 167 GESNAGTSVPES-ASRFLELLGNLAADLK-PYGDTSALAAQKAVLGENANWAGKNESDPE 224 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALR 298 + G+ + + +Q + + P Y P+ Sbjct: 225 TGNLTGSLEGRDTE-VQGKTTRQVMKDLAESRPSLG--------YAPTRDLPKLTAGDQM 275 Query: 299 SSHFSTAARRGIISDRFVEAILQFWRE 325 H+S ++ I RF + + Sbjct: 276 GVHYSGQSQISI-GARFAYEAARMAGK 301 >UniRef50_A8ITT3 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8ITT3_CHLRE Length = 304 Score = 69.6 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 49/171 (28%), Gaps = 38/171 (22%) Query: 18 QSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDVQDMQG 77 QSNA+ P ++ + P + D P C +G Sbjct: 113 QSNAVGENMQGSRPACCSPVPGKLLTFNLGNN-----PTNQWRDATPCVGCIS-----RG 162 Query: 78 YHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF-TAGSEGTYSER 136 + Y + G L R LL + VP GG+ G Sbjct: 163 ANPAF-------YDSCGPDLGFGRVLLQLGVSG-RVGFVPTAAGGTNLADMWCPGC---- 210 Query: 137 HGASHDACRWGTDTPLYQDLVSRTRAAL-AKNPQNKFLGACWMQGEFDLMT 186 PLY+D+ A+ A P + G W+QGE D Sbjct: 211 --------------PLYKDMAQTVVRAMRAAGPNARLRGMLWVQGESDANN 247 >UniRef50_UPI0001BC7C56 hypothetical protein BacD2_14780 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7C56 Length = 568 Score = 68.9 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 44/315 (13%), Positives = 88/315 (27%), Gaps = 48/315 (15%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 + + + +V+ VAGQSN LP+ IK + + G Sbjct: 29 STRAQEPAHVIIVAGQSNTDGRVPVADLPEY-------IKSMGIDSTGFAKGA------- 74 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 +C + G P L ++ GG Sbjct: 75 --YKYCKISQNRVDGKFVPFWPRRNRW------GYDAVTYYLLEQLYQKEFYVIKWAVGG 126 Query: 123 SAFTAGS---EGTY----SERHGASHDACRWGTDTPLY--QDLVSRTRAALAKNPQN-KF 172 ++ T + G Y E + + G L Q + + L+ P+ Sbjct: 127 TSITPENTDSRGGYWSATPEWLAQNTPTAKKGKSLLLSFTQQISNSISKTLSHLPEGYHI 186 Query: 173 LGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWK 232 W QGE D + ++ ++V R L + + + ++ P+ G K Sbjct: 187 DAFLWHQGESDSAYGP--DYYENLKNVVSYVRDHLTRKTGE--DYSELPFIFGSVAKSNK 242 Query: 233 ENFP---HSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRS 289 + + + +N L ++ + D S Y G Sbjct: 243 RYNAEVEAAMKRLASEDKNAYLIDMSKATLLKDRLHF---------DKTSAEYLGKQMYD 293 Query: 290 PENWTTALRSSHFST 304 +++ + + Sbjct: 294 TMIQASSVNVTSMQS 308 >UniRef50_C6VYE2 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VYE2_DYAFD Length = 665 Score = 68.5 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 44/293 (15%), Positives = 80/293 (27%), Gaps = 39/293 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V VAGQSNA G P+ A H ++ + F + +P + L CP Sbjct: 127 EVFVVAGQSNATG---GDSNPNGPGAAHDQVNSVD-FQNVNPANSTITPYPDVQLP-CPA 181 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 V N+ +G K+ ++I + + Sbjct: 182 FVHLDAQTKMAPFGNYAWCWG------SFGDKIYEKF--RVPVMIFNAGWSSTGINNWKQ 233 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK-FLGACWMQGEFD--LMTS 187 T S + T P R AL W QGE D L Sbjct: 234 TTDPNGITTSAFGYTFPTGLP-----FGHLRLALNNYIAQLGVRAVLWHQGETDNLLEQP 288 Query: 188 DYASHPQHFNHM---VEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYG 244 ++ ++ + + V A R + + W + + N + Sbjct: 289 GDDTYSRYLSGLWDVVNASRNLSGKPNLA--------WVVARASRFTVNNISRVSTNVVN 340 Query: 245 NYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTAL 297 + + ++ Q E + + D++ +R + Sbjct: 341 AQNELINNDGLYPHVFQGPETDPYYSIEYRHDEV-------HFRGDGVTQSPD 386 >UniRef50_D2QD65 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QD65_9SPHI Length = 989 Score = 68.1 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 46/278 (16%), Positives = 81/278 (29%), Gaps = 41/278 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V VAGQSNA +DAP+P L + P P T Sbjct: 123 EVFVVAGQSNAQG--------IHQDAPNP----LNDLVNCVNYRYPDQGFPNEPPTPVFT 170 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + G+ +G +G +A++L IL G+ + Sbjct: 171 QLDNSSGFTIAPRGMGSWAWGQLG--DILAKRL------RVPILFFNAAFTGTFVRNWRD 222 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNK-FLGACWMQGEFDLMTSDY 189 G ++ Y +L + AL + W QGE D + + Sbjct: 223 SA--PEGGVAYGPGGAYPARQPYINL----KLALQFYANSLGVRAVLWQQGESDNLYNTS 276 Query: 190 A-SHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT--TWYWKENFPHSYEAIYGNY 246 + +++ R++ ++ W + P +A N Sbjct: 277 KDQYVNDLQYVINQSRQEYN---------SNTSWVVARVSYGDFTGGVDPAIIDA--QNQ 325 Query: 247 QNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYG 284 + AN+ + P DP+ + Y G Sbjct: 326 VISTTANVFAGPNTDVIQIPRQRPPRNDPEGVHFDYNG 363 >UniRef50_Q7UPP2 Sialic acid-specific 9-O-acetylesterase n=4 Tax=Bacteria RepID=Q7UPP2_RHOBA Length = 732 Score = 66.9 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 46/341 (13%), Positives = 83/341 (24%), Gaps = 90/341 (26%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + + P + +P P Sbjct: 116 EVWLCSGQSNMEWRVQSS---------------VNAAEEIASANFPQIRHMKVPRVPSPV 160 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + D+ + Q+ G +AR+L + N I +V GG+ + Sbjct: 161 AMDDVPAPWQVCSPETVGQFTAAG--YFMARRLHQEL--NVPIGLVNSSWGGTRIEPWTP 216 Query: 131 ----GTYSERHGASHDACRWGTDTPLYQDL-----------VSRTRAALAKNPQNK---- 171 E S + + ++ V++ AALA N + Sbjct: 217 PVGFEGVEELKDISESVTQRTPGSEPFKSALRGHLQSTKAWVAKAEAALANNTFIEPAPA 276 Query: 172 -----------------------------FLGACWMQGEFDLMTSDYASHPQHFNHMVEA 202 GA W QGE + + + ++E Sbjct: 277 YPSSLTPFTSHGQPTTLYNGMIHPLIHMPIRGAIWYQGEAN--HREGMLYTAKMRALIEG 334 Query: 203 FRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQ 262 +R Q P+ Y+ + P+ Y G+ F + Q Sbjct: 335 WRAKWNQ--------GPFPF-------YFVQIAPYHYGDESGSVLAK------FWEAQTA 373 Query: 263 GERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFS 303 D + + A + H Sbjct: 374 ALAIPNTGMVVTNDIATVNDIHPPNKQDVGKRLADLALHHD 414 >UniRef50_D1N7W8 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N7W8_9BACT Length = 476 Score = 66.9 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 37/256 (14%), Positives = 60/256 (23%), Gaps = 41/256 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDR---EDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 V AGQSN YG R E P+P I+ P Sbjct: 102 DVWLCAGQSNMQ-YGLQSITGSRKLIEAFPNPDIRLFQVPNVW----------SRTPQAD 150 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF-- 125 P + + R L + N I ++ GG Sbjct: 151 VKAQWNLCSPQTLPRFSA---------VGYLVGRDLQSKL--NVPIGLINISWGGCRIES 199 Query: 126 ------------TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFL 173 T G +++ + A + Sbjct: 200 MTAPESFAAVSVTQGVADEVAKQIADLKSKKDADLRKDKQRLPNVLFNAMVHPLTPFAVR 259 Query: 174 GACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKE 233 G W QGE + S+ + + + +R + + PW G Sbjct: 260 GMLWYQGEDN--HSEGMRYAEKLRALAHTWRTYFANPDMPIFIVQLPPWQYGREKSTLIP 317 Query: 234 NFPHSYEAIYGNYQNN 249 NF + + + N Sbjct: 318 NFWAAQQHFAEHDSNA 333 >UniRef50_UPI000196858D hypothetical protein BACCELL_00130 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196858D Length = 465 Score = 66.6 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 43/292 (14%), Positives = 76/292 (26%), Gaps = 26/292 (8%) Query: 11 YVLTVAGQSNAMAYGEGLPLPD-REDAPHPRIKQLARFAHTHPGGPPCH-FNDIIPLTHC 68 V +GQSN +P+ + +K + A T P C Sbjct: 106 EVWLCSGQSNME-----MPIKGFKNQRVDGSLKAIVTSADTQIRLCQIKRIASDAPQESC 160 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + + + K+L + I I+ GGSA A Sbjct: 161 ESEWKT----------CGPESVANFSATGYFYAKMLRQV-LGVPIGIIEADWGGSAIEAW 209 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRT-RAALAKNPQNKFLGACWMQGEFDLMTS 187 + + P Q L ++ + G W G + Sbjct: 210 MSNESLQSIPQQLKTSKNIRKKPQIQHLPNKLFNGMIHPIIGYGIKGVIWWHGANNSRQ- 268 Query: 188 DYASHPQHFNHMVEAFRRDL--KQYHSQLNNITDAPW--FCGDTTWYWKENFPHSYEAIY 243 Y ++ F MV +R + L + P+ G ++ Sbjct: 269 -YYNYELLFRTMVNDWRNRWGIGDFSFNLAQLAPYPFNNVMGYMREAQVNCARNTPNCDI 327 Query: 244 GNYQNNVLANIIFVDFQQ-QGERGLTNAPDEDPDDLSTGYYGSAYRSPENWT 294 + + + ++ GER A + Y G Y+S E Sbjct: 328 AILLDISDSTFMHGPIKEVVGERFAYIALAQTYGMKGFEYTGPIYKSMEIKK 379 >UniRef50_A7M023 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7M023_BACOV Length = 643 Score = 66.2 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 36/156 (23%), Gaps = 21/156 (13%) Query: 120 RGGSAF---TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGAC 176 G T LY ++ G Sbjct: 374 IAGQTIPLTAMWQHKIGCTMKRIPSTIGFQNEPTGLYNSMI-------HPLRNYGIRGII 426 Query: 177 WMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 W QGE D + +H +V +R N + P+ Y ++ Sbjct: 427 WYQGESDTGPEGSKHYERHLIDLVNDWRTQW--------NNKNLPFVIVQLANY-QQRSK 477 Query: 237 HSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPD 272 E+ GN Q + + G + + Sbjct: 478 VPVES--GNAQVREAQRKASLQLKNVGLATAIDLGE 511 >UniRef50_C0A737 Sialate O-acetylesterase n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A737_9BACT Length = 521 Score = 65.8 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 35/271 (12%), Positives = 61/271 (22%), Gaps = 78/271 (28%) Query: 11 YVLTVAGQSNAMA--YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V GQSN +A P+I+Q + P Sbjct: 110 DVWLCGGQSNMEWTVRKSAGAAEAIAEADLPQIRQF----------------RVRPAISD 153 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + G + ++ +G +AR L ++ C GG+A Sbjct: 154 EPLSEPKDGKWVVCSPKTVGEFTALG--FFVARDLFRG--SGVPQGLINCSWGGTAIETW 209 Query: 129 SEGTYSERHGASH---------------------------------------------DA 143 G Sbjct: 210 LSGNAYASGARPELSVVKARWEKRRANYPAAKLAYDQAKAAFEEEKRAAEAAGASFTKRP 269 Query: 144 CRWGTDTPLYQDLVSRT-RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEA 202 R + P + + SR A L A W QGE + + F ++E Sbjct: 270 PRAPSGGPQDRAVPSRAFNAMLNPLAGYGVRAALWYQGEAN--WQFPREYAGLFAALIED 327 Query: 203 FRRDLKQYHSQLNNITDAPWFCGDTTWYWKE 233 +R+ + P+ + + Sbjct: 328 WRQAWGSP--------ELPFVFMQLPGFGGD 350 >UniRef50_D2QD66 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QD66_9SPHI Length = 727 Score = 65.8 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 61/207 (29%), Gaps = 29/207 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V + GQSNA + + +D R+ +A L+ P Sbjct: 130 EVFIITGQSNAQGFQNYGAVGAVDD----RVNCVAYDNT-----------KANSLSDPPA 174 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + Q+ + G ++ L+ N IL + G+ +E Sbjct: 175 PTFQQLTATSLIGPRGQSAW-CWG---YLGDLLVKQY--NVPILFINTAWVGTTIQNWTE 228 Query: 131 ---GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTS 187 G ++ A Y +L++ R + W QGEFD Sbjct: 229 SSLGKVTKNLFALGTPDENFPAGMPYGNLITALRYYCSLQG---LRAVLWQQGEFDNFPL 285 Query: 188 DY--ASHPQHFNHMVEAFRRDLKQYHS 212 + + +V R D +Y + Sbjct: 286 RSTRQDYANNMQFLVNKTRNDTDRYPA 312 >UniRef50_A0YRB5 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YRB5_9CYAN Length = 567 Score = 65.4 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 30/271 (11%), Positives = 59/271 (21%), Gaps = 37/271 (13%) Query: 11 YVLTVAGQSNA----MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLT 66 V GQSN + + P R+ + P Sbjct: 114 EVWVAGGQSNMWWFVSRSDHSQQEISQANFPKIRVWDANTSPQ----------ENGWPAN 163 Query: 67 HCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFT 126 + P A++L + N I IV Sbjct: 164 TPQKTIPAKWELTTPKTVKDFPATAYF-----FAKELHQNL--NVPIGIVHLAVPNREIE 216 Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMT 186 + W + + G W QGE + Sbjct: 217 TFLSQPLLAAN-FPETIAFWQLEKDAKTRPAQLFNGMVYPAIPYTPRGFIWWQGESNAKE 275 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW-KENFPHSYEAIYGN 245 + F +++ +R ++ P+ + + K+ +P + Sbjct: 276 --AMQYRTLFPSLIQEWRSLWGNDNA--------PFLFVELANFLEKQTYPVEDDP---- 321 Query: 246 YQNNVLANIIFVDFQQQGERGLTNAPDEDPD 276 + A + + DE+ D Sbjct: 322 WPRLRSAQKEALKLPNTAMISTIDILDENND 352 >UniRef50_D2QJG4 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QJG4_9SPHI Length = 672 Score = 65.4 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 84/278 (30%), Gaps = 53/278 (19%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V + GQSNA+ +P C Sbjct: 113 DVFLIMGQSNAVGQYGDYTFRSTFCRT---FGIHNFNNTYNPA----------DTAWCLT 159 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + ++ G L ++++ ++ GG+ T+ Sbjct: 160 NTKEGLTCLW-------------GMEL---QRMIKEN-QGIPTAVINGAVGGTTITS--- 199 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 A+ DA + + + LY L+ R A W QGE + ++ Sbjct: 200 -------HANRDASQPTSLSTLYGRLLYRASKAGVARQA---KAMIWRQGEAEA-ANNPD 248 Query: 191 SHPQHFNHMVEAFRRDL----KQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNY 246 + + F + +++D K YHSQ+N +T+ G Y + + + I+ + Sbjct: 249 VYVRTFPQLYSYWKQDYPGLKKIYHSQINILTNNVVNAGVLRDYQRRS-----KYIFSDN 303 Query: 247 QNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYG 284 + + + E G E ++ +YG Sbjct: 304 EPIATVGLSGYEGIHYNEAGHYQFGLELYRLIARDFYG 341 >UniRef50_C3Y8H9 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y8H9_BRAFL Length = 537 Score = 65.4 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 43/322 (13%), Positives = 83/322 (25%), Gaps = 50/322 (15%) Query: 11 YVLTVAGQSNA-----MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 V +GQSN M + + + PH R+ P D + Sbjct: 121 DVWVCSGQSNMQFTVMMGFNSTEEILAANNYPHIRLFTAGLVGSDT----PLSNLDRVEQ 176 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 + G + R L + I +V G+ Sbjct: 177 PWSVASAASVGGGMWNYFSALCW---------FYGRDLYDTL--QYPIGLVASSFEGTPI 225 Query: 126 TAG------------SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFL 173 + + +++ +H G ++ A + Sbjct: 226 ESWSSPEVLQKCKVNDTLSKTDKLSVNHPNNIPGVSGDHQDSVL--WNAMIHPILNMTIK 283 Query: 174 GACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKE 233 GA W QGE + D + F M++++R + P+ + Sbjct: 284 GAIWYQGESNTDHPDS--YLCLFPAMIDSWRSSW-YLGTGGQTDPTFPFGFMQIS----G 336 Query: 234 NFPHSYEAIYGNYQNNVLANIIFVDF-QQQGERGLTNAPDEDPDDLSTGYYGSAYRSPEN 292 N S + Y + + A+ +V + + PD + + Sbjct: 337 NVLSSTDLGYPEIRWHQTADYGYVPNPKMEKVFMAVAMDLWRPDSP--------WLTVHP 388 Query: 293 WTTALRSSHFSTAARRGIISDR 314 + S AAR D+ Sbjct: 389 QDKQDMGTRLSLAARAVAYGDK 410 >UniRef50_C0ADE1 Sialate O-acetylesterase n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0ADE1_9BACT Length = 520 Score = 65.0 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 32/309 (10%), Positives = 67/309 (21%), Gaps = 78/309 (25%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + + P + + P Sbjct: 105 DVWLCSGQSNMVWHVNQAE---------------GGRDDAKTANFPALRHFKVTRAKSPT 149 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG-S 129 + +++G + + V A + R L + ++ G+ A S Sbjct: 150 PLTELEGEWQVASPKTVRGWSAV--AFYFGRALHQQ-RGGTPVGLINTAWSGTDIEAWIS 206 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRA----------------ALAKNPQNK-- 171 + RW Y + + LA Q K Sbjct: 207 SESIDTSPVGPAVRARWRELATDYPARAQKHKTDFAAWEKARAEAEAAGTLANFKQRKPR 266 Query: 172 --------------------------FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRR 205 G W QGE + + + ++ +R Sbjct: 267 APVGPGSSSEPSVIFNGEIHPLIPYALAGMIWYQGENNT--PRSSEYAALQRSLIRDWRA 324 Query: 206 DLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGER 265 + P+ N+ S++ + + + G Sbjct: 325 RWESPS--------LPFLYVQLP-----NYAASHKPLGTGWAAFRDEQARVLSEPATGMA 371 Query: 266 GLTNAPDED 274 + + Sbjct: 372 ITIDVGESG 380 >UniRef50_C1E516 Sialic-acid o-acetylesterase-like protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E516_9CHLO Length = 772 Score = 64.2 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 36/282 (12%), Positives = 61/282 (21%), Gaps = 65/282 (23%) Query: 11 YVLTVAGQSNA-----MAYGEGLPLPDREDAPHP-RIKQLA-------RFAHTHPGGPPC 57 V AGQSN ++ + + RI +A R Sbjct: 150 DVWLCAGQSNMQFTVAASFDAAAHIAESATVGDGIRIATVAMTAADVERHDVAAATAGDA 209 Query: 58 HFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHI-ARKLLPFIPDNAGILIV 116 + P+ + G A +L + ++ Sbjct: 210 AYEKSAWAKAKPNAFNPPATPDPTAPYGGWSDMGWFSAACWFHGLELYERSFREVPVGLI 269 Query: 117 PCCRGGSAFTAGS-----------------------EGTYSERHGASHDACRWGTDTPLY 153 GG A S G + R S Y Sbjct: 270 VAAWGGQAIEPFSSKEALADETCGGTRGPSGTPGFGPGVNARRRETSSIVRPEERPGEPY 329 Query: 154 QDLVSRT------------------RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQH 195 + + +A + G W QGE + + S+ Sbjct: 330 RRIADTHPDEESPPVFPVFGPSALWNGMIAPLLNTRLKGVAWYQGEAN--WATPESYACR 387 Query: 196 FNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPH 237 F M+ +R P+ Y K ++ H Sbjct: 388 FPAMIADWRARFASPRM--------PFVFVQLAAYPKRDYSH 421 >UniRef50_A6ECR5 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Pedobacter sp. BAL39 RepID=A6ECR5_9SPHI Length = 492 Score = 63.9 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 80/275 (29%), Gaps = 53/275 (19%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDR--EDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 + +GQSN + + + +DA P I+ L+ + Sbjct: 112 DLWLCSGQSNMQFAVKEMTGAEAVIQDADQPNIRLLSVGLNFS----------------- 154 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVG-QALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 + G+ + RKL + + I ++ G S+ A Sbjct: 155 ---ADPIDGFSGKWQQCSSSSVAGFSAVGYCFGRKLYEEL--HIPIGLIFSGIGASSVQA 209 Query: 128 GSE-----------GTYSERHGASHDACRWGTDTPLYQDLVS---RTRAALAKNPQNKFL 173 TY + + + + ++ +V A + Sbjct: 210 YLPREVLSGDALLDQTYLQPYLSDPKSKEKVDGGFSFEKVVRPFLLYNAMINPLTNLSIK 269 Query: 174 GACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKE 233 G CW QGE + + S+ +V+ +R KQ + P++ Y+ E Sbjct: 270 GFCWYQGEANHLER--ESYTLATQTLVKTWRERFKQ--------GELPFYYVQIAPYFHE 319 Query: 234 N--FPHSYEAIYGNYQNNV--LANIIFVDFQQQGE 264 +A + Q V L N V G+ Sbjct: 320 KEDPKFGTDAFFREAQERVGQLNNTFMVSTMDVGD 354 >UniRef50_C0ACS1 Sialate O-acetylesterase n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0ACS1_9BACT Length = 538 Score = 63.5 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 43/326 (13%), Positives = 74/326 (22%), Gaps = 80/326 (24%) Query: 2 NAIISPDYYY------VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGP 55 +A P V +GQSN + D D A Sbjct: 109 SATPEPVRLKNILVGEVWLASGQSNMA-----WLVKDTTD----------ARAIIDASAN 153 Query: 56 PCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILI 115 P +P + D G + + V A H ARKL + + Sbjct: 154 PAVRTYNLPRVIADDPLADAPGKWRVASPSTTGWMSAV--AYHFARKL--NADLGVPVAV 209 Query: 116 VPCCRGGSAFTAGSEGTYSERHG--------ASHDACRWGTDTP---------------- 151 + RG + + + +++ D W Sbjct: 210 ICSSRGSTRIQSWIPLSVFDQNPAFAADRDQWQQDLTAWPQKKASWEKKLDDWKSRAAAA 269 Query: 152 ------------------LYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHP 193 Y A +A + G W QGE A + Sbjct: 270 RSAGKPEPKRPEQPPGPGHYNQPAGYYNAMIAPVTKVPIRGFLWYQGE--ANARRAARYR 327 Query: 194 QHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLAN 253 + ++ ++R + D P+ Y F + Sbjct: 328 ETLPALITSWRETWGR--------GDLPFLIVQLAAYRAPGFAPDGLE-RAGLREAQALA 378 Query: 254 IIFVDFQQQGERGLTNAPD--EDPDD 277 + + + P+ E P D Sbjct: 379 VERLPATGLVVTLDVSGPESKEHPRD 404 >UniRef50_A6DF88 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF88_9BACT Length = 524 Score = 63.5 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 39/308 (12%), Positives = 65/308 (21%), Gaps = 79/308 (25%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN G P + T Sbjct: 104 EVWICSGQSNMQWRLSG---------------AFNSKEEIAAAKFPAIRQLNLTRTAANL 148 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D++G + Y VG AR L + I I+ GG+ Sbjct: 149 PRTDVKGEWTVCSPETAKDYTAVG--YFFARSLYKSL--KIPIGIIDASWGGTGIEPWIP 204 Query: 131 G-------------------------TYSERHGASHDACRWGTD-------TPLYQDLVS 158 + + + W L ++ Sbjct: 205 ELGFNMVAELEKDKIELHKILPVNDKNKQKWNDYLDELESWLPGAEKRVSQGSLPLNMPD 264 Query: 159 RTRAALAKNPQNK---------------FLGACWMQGEFDLMTSDYASHPQHFNHMVEAF 203 R + + G W QGE DY + ++E++ Sbjct: 265 RPDSGMPVTHHGMTKIFNSMIHPIIPYGVRGVVWYQGESSWGQGDY--YFFKKKALIESW 322 Query: 204 RRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQG 263 R Q + P++ K N Y A ++ G Sbjct: 323 RELWGQ--------GEFPFYTVQLANLHKFNEKAEGG---DGYAKIREAQKRSLELPNTG 371 Query: 264 ERGLTNAP 271 + Sbjct: 372 LAVTYDIG 379 >UniRef50_D2R456 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R456_9PLAN Length = 1282 Score = 63.1 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 55/202 (27%), Gaps = 44/202 (21%) Query: 13 LTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDV 72 + GQSNA+A G P + Sbjct: 933 YLIIGQSNAVATDFGKENP-------------------LVASDWVRTFGATSGDPQGARL 973 Query: 73 QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGT 132 + G + + Q G G L R+L+ + I ++ GG+ Sbjct: 974 KQW-GNAEARSPEGKLQIGYWGMEL--GRRLVES--EKLPICLINGAVGGTRIDQHQRNA 1028 Query: 133 YSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD------LMT 186 ++ Y L+ R + A + W QGE D Sbjct: 1029 DDPTDVSTI-----------YGRLLWRVQQAKLTHG---IRAILWHQGENDQGADGPTGG 1074 Query: 187 SDYASHPQHFNHMVEAFRRDLK 208 Y ++ Q F + ++++D Sbjct: 1075 YGYETYRQFFVDLAGSWKQDYP 1096 >UniRef50_D2QHE7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QHE7_9SPHI Length = 661 Score = 62.7 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 58/207 (28%), Gaps = 42/207 (20%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V + GQSNA L L D + + + + P+ Sbjct: 122 DVYIIHGQSNA------LALSDFDG-----LYSFNFN------------DRYMRNVAYPY 158 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 Q +P + G L + R +L ++ GG+ +A Sbjct: 159 LGLPSQMSWYPAKQPFASVG---GLGLTLQRLILENY--GIPTCVINGAMGGTPISA--- 210 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYA 190 + D Y DL++R + A W QGE D S Sbjct: 211 -------LSVRDPLNHANPITFYGDLLNRAQWAGVAKQT---KAIIWKQGEEDAG-SGLP 259 Query: 191 SHPQHFNHMVEAFRRDLKQYHSQLNNI 217 +P F + FR D + I Sbjct: 260 GYPAKFATLYNQFREDYGNARIYVGQI 286 >UniRef50_A6DLA5 Sialic-acid O-acetylesterase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLA5_9BACT Length = 482 Score = 62.3 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 59/229 (25%), Gaps = 49/229 (21%) Query: 11 YVLTVAGQSNA-MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 V +GQSN M +G P IK++A A N+ + Sbjct: 101 EVWICSGQSNMQMGWGGI-----------PEIKKMAVNAKNI---RTFKVNNTVSYQEED 146 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 + A A L I I I+ G S+ Sbjct: 147 YCQGSWVEAAPGSAVAAA-----------FAVNLQKSI--ICPIGIIQASWGSSSVEGWM 193 Query: 130 EGTYSERHGASHDACRWGTDTPLY--------------QDLVSRTR------AALAKNPQ 169 +E+ ++ RTR A + Sbjct: 194 PMDMAEKLPHFKKELEDCRANDKDKVAEILAKKKMSGKDNIFLRTRPNLLYNAMMHPLIP 253 Query: 170 NKFLGACWMQGEFDLMTSDYA-SHPQHFNHMVEAFRRDLKQYHSQLNNI 217 G W QGE + + + + Q + +R++ + QL + Sbjct: 254 YSLRGVVWYQGEANTKSVEAMLQYGQTLPMWFQRYRQEWDNENLQLMAV 302 >UniRef50_D2BR70 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Lactococcus lactis subsp. lactis KF147 RepID=D2BR70_LACLK Length = 615 Score = 61.5 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 54/179 (30%), Gaps = 27/179 (15%) Query: 108 PDNAGILIVPCCRGGSAFTA---GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAAL 164 P IL+V G + S + T LY +++ + Sbjct: 346 PSKPHILLV----GENRLDLNHGWKIRRSSTLPERYKEYFINYEPTGLYNGMIATLQKL- 400 Query: 165 AKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFC 224 KF W QGE D + ++ F ++E++R+ KQ + P+ Sbjct: 401 ------KFAAILWYQGESDAGSP--QNYGPRFRELIESWRKLFKQPN--------LPFLY 444 Query: 225 GDTTWYWKEN---FPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLST 280 E + E + + A ++ + + + N D L+ Sbjct: 445 VQLPNCDTEKEADWARLREEQKEGLKISRTAMVVTIGDGEDDDLHPLNKKDVAHKLLNA 503 Score = 58.1 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 59/232 (25%), Gaps = 34/232 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V + GQSN +R +P P +P + Sbjct: 87 DVFLLGGQSNMQ------LWMERLKTRYP--------DEIEQAKNPWIRYFEVPQEPSFN 132 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 +++ + G A++ F D I ++ GG+ A Sbjct: 133 NIKTELILGQWKRAIGEDLKNLSGIGYFFAKE--KFSEDGIPIGLITTAVGGTPLNAWLS 190 Query: 131 GTYSERHGASHDACRWGTDTPLY----QDLVSRTRAALAKNPQNKFLGACWMQGEFD--- 183 + + Y Q+L + + K + G Q D Sbjct: 191 EESLTKFNSL-PPYYNALKNKEYLKEIQNLDKLYQDSYQKLCEETDEGL--HQSWQDPNF 247 Query: 184 -----LMTSDYASHPQHFN-HMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 S + + + + R+ L+ + + + + G T Sbjct: 248 DDRDWAEISLNETWIEKYTFPGILWLRKKLQISDEFIGKMGELRF--GTMTD 297 >UniRef50_C0A8R8 Sialate O-acetylesterase n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A8R8_9BACT Length = 554 Score = 61.5 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 43/346 (12%), Positives = 76/346 (21%), Gaps = 76/346 (21%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN A P + Sbjct: 149 EVWLASGQSNMEWRVSNTD---------------AGKFEMAFANNPLIRHYGTKKKVSST 193 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + +G + VG + A L + + I+ GG+ A + Sbjct: 194 PLATAEGKWERTTPETVGSFTAVG--YYFAADLYRAL--GVPVGIINSSWGGTRIEAWMD 249 Query: 131 GTYSERHGAS---HDACRWGTDTPLYQDLVSRTRAALAKNPQ------------------ 169 +++ RW Y +R ++ K Sbjct: 250 AASADKTKGPAFAEIHTRWDKTLADYPAAKTRYEQSVKKWEAARDAAKAAGKPFADRRPS 309 Query: 170 -------------------------NKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFR 204 G W QG + + M+ +R Sbjct: 310 APAGSSGHPAEPSGLYNGMIAPHVPYALRGFIWYQGCSNTGRHY--EYRDFQTAMITGWR 367 Query: 205 RDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGE 264 + Q D P++ Y + A Q LA Sbjct: 368 QQFAQ--------GDVPFYWAQLANYKGQGPDKLEYAFLRGAQTECLALPKTGQAVIIDI 419 Query: 265 RGLTNAPDEDPDDLSTGYYGSAYRSPENWTT-ALRSSHFSTAARRG 309 +T+ + D+ A ++ T+ F+ A R G Sbjct: 420 GNVTDIHPRNKSDVGRRLALVALKNDYGKTSLVDSGPIFAAAVREG 465 >UniRef50_B1GS50 Maturation/adhesion protein n=1 Tax=Salmonella phage Vi II-E1 RepID=B1GS50_9CAUD Length = 796 Score = 61.5 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 38/282 (13%), Positives = 71/282 (25%), Gaps = 57/282 (20%) Query: 11 YVLTVA--GQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +A GQSNA+ +G P P + ++ + Sbjct: 170 KVYIIAITGQSNAVGANKGGPNPANDKI--------------------VIWDGVTGGWGS 209 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + P N L A +L+ + I+ GG Sbjct: 210 SDYTKPPLSRSTPNGNNGNNNIA-----LAFAHRLVDE-HKAEKVFIIYDAVGGRPIEDW 263 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNP-----QNKFLGACWMQGEFD 183 G ++ Y + S+ AAL + + + QGE + Sbjct: 264 ---------------MANGVNSERYAAIKSKIEAALVSQEIVATGKTEIDFLVFAQGEEN 308 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT--WYWKENFPHSYEA 241 +T + + + FR + P F + + + + Sbjct: 309 ALTDTVTDYQAKLTTLDKQFRAE-------SWMSDTTPMFIMGMSGLHMRYQVWQAQVDY 361 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYY 283 +N + N + Q + GYY Sbjct: 362 CENYNRNCIYVNSAGLKTQYDVDNTGDYTHWLGESLWEHGYY 403 >UniRef50_UPI00019275B4 PREDICTED: similar to cytosolic sialic acid 9-O-acetylesterase homolog, partial n=1 Tax=Hydra magnipapillata RepID=UPI00019275B4 Length = 351 Score = 61.2 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 58/184 (31%), Gaps = 18/184 (9%) Query: 110 NAGILIVPCCRGGSAFTAGS-EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNP 168 I ++ +GGS + S T + S + D L+ ++S Sbjct: 21 KVPIGLISSNQGGSCIESWSSPQTLKVCNATSKYPVTFNNDNVLWNAMIS-------PFL 73 Query: 169 QNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTT 228 + GA W QGE + + + F M+ +R++ NI P+ Sbjct: 74 KTTIYGAIWYQGEQNA--INPEGYNCTFPAMINGWRKEWSDGTGGETNI-KFPFGFVQLA 130 Query: 229 WYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYR 288 + P + + A +V +QQ A D D ++ Y R Sbjct: 131 SFNDGTTPG-----FPTLRWLQTAGYGYVPNKQQ--ENTFMAVAMDLADNNSPYGSIHPR 183 Query: 289 SPEN 292 + Sbjct: 184 DKAD 187 >UniRef50_Q7UHJ8 Sialic-acid O-acetylesterase n=1 Tax=Rhodopirellula baltica RepID=Q7UHJ8_RHOBA Length = 527 Score = 61.2 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 35/325 (10%), Positives = 64/325 (19%), Gaps = 83/325 (25%) Query: 1 MNAIISPDYYYV-----------------LTVAGQSNAMAYGEGLPLPDREDAPHPRIKQ 43 M+A P V +GQSN +P A I+ Sbjct: 102 MSASADPKTLQVSSPDSTVSFANVVVGEVWICSGQSNMQFSTAAVPEIQSLTATSENIRC 161 Query: 44 LARFAHTHPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKL 103 D + + P + A A L Sbjct: 162 FEVKRTVAMTQ---------------QDRLEGKWTEQPPNSA---------VAFSFAHFL 197 Query: 104 LPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSR---- 159 + I+ C G S+ A +E QD ++ Sbjct: 198 EQAGD--VPVGIILTCWGSSSIEAWMPREMTETVPHFQTMMEEFDADTATQDRIASILNG 255 Query: 160 -------------------TRAALAKNPQNKFLGACWMQGEFD-------------LMTS 187 A + G W QGE + S Sbjct: 256 KKPWSRTDDIFLRRQSNILYNAMIHPLVPYACRGLVWYQGERNTQSMFGMLKDPWFSRNS 315 Query: 188 DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQ 247 + ++ +R++ + G + ++ Sbjct: 316 GMLKYGDTLVEWIKRYRKEWGNEDMHFLIV----MLPGYFKPLPTGPQKGAEHPSTHSWA 371 Query: 248 NNVLANIIFVDFQQQGERGLTNAPD 272 + + +D + D Sbjct: 372 WMRESQLQSLDLPHTSVVNTIDLGD 396 >UniRef50_Q8A042 Polysaccharide deacetylase n=7 Tax=Bacteroides RepID=Q8A042_BACTN Length = 541 Score = 61.2 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 62/207 (29%), Gaps = 24/207 (11%) Query: 115 IVPCCRGGSAFTA----------GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAAL 164 ++ GG+A A S + L ++ + L Sbjct: 62 VIKWAIGGTAIAAPVTTPFRGTYWSADPKWLAENTATSEKGKSLLLSLIANIDASIDQTL 121 Query: 165 AKNPQN-KFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWF 223 +K Q + W QGE D + Q+ +V R L + + + ++ P+ Sbjct: 122 SKLKQGYQIDAFVWHQGESD--YEHGKEYYQNLKGVVSYVRNHLTEKTGK--DYSELPFI 177 Query: 224 CGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQ-QGERGLTNAPDEDPDDLSTGY 282 G + K E + + + A +I + + G++ N + + Sbjct: 178 FGTVSRKNKRYNSDVEEGMRRYAKEDKNAYLIDMSEAELLGDKLHFNQVS--AESMGKQV 235 Query: 283 YGSAYRSPENWTTALRSSHFSTAARRG 309 Y + T H A +G Sbjct: 236 Y------EQIKKTLSDDPHVYVAKYKG 256 >UniRef50_C9PT41 Sialic acid-specific 9-O-acetylesterase n=3 Tax=Prevotella RepID=C9PT41_9BACT Length = 481 Score = 61.2 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 25/227 (11%), Positives = 53/227 (23%), Gaps = 29/227 (12%) Query: 11 YVLTVAGQSNAMA--YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V AGQSN G + IP Sbjct: 113 EVWVCAGQSNMEMPVRGFNECPVEGYQEA-----------VWASANMRGVRYVKIPARMS 161 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + D + + VG A++L + + +V +GG+ + Sbjct: 162 SVPLDDTPCQWVDVTPKTVSDCSAVG--FFFAQRLAQVL--QMPVGLVLANKGGTMVESW 217 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRT--RAALAKNPQNKFLGACWMQGEFDLMT 186 +H + L G + QG ++ Sbjct: 218 LNVDNLRKHTDEPTDSTLIARKYPTEWLRPLLWGNGTFHPIVNYGVRGVLYYQGCANV-D 276 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKE 233 + ++ + + + +R+ + P + + E Sbjct: 277 HNPTTYGERLKLLAKQWRKHF---------SDNMPMLLVEIAPHCYE 314 >UniRef50_D2Q6U1 Sialic acid-specific 9-O-acetylesterase n=6 Tax=Bifidobacterium RepID=D2Q6U1_9BIFI Length = 538 Score = 60.8 bits (145), Expect = 6e-08, Method: Composition-based stats. Identities = 41/286 (14%), Positives = 71/286 (24%), Gaps = 47/286 (16%) Query: 11 YVLTVAGQSNAM-AYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHF----NDIIPL 65 V AGQSN Y + D + HF +++ Sbjct: 140 DVFVAAGQSNMELNYTQYYSSGDDSNWNWGGGLISRGDLPKLLTDANVHFVVADHNVDNT 199 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 D G+ +TN Q A A +L N I I+ GG+A Sbjct: 200 DFPLIDANKTSGWLTADSTNSQHLSY---LAQQFAMQL-RAKRTNIPIGIIQTSWGGTAI 255 Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 + +G +Y + ++ + G W QG D Sbjct: 256 SRHVQGG------------------DIYANHIA-------PLTGFRVAGVLWYQGCNDAS 290 Query: 186 T-SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT----TWYWKENFPHSYE 240 T S + ++ +R+ + + P+ + + +N Sbjct: 291 TLSTSLDYESQMTALINQYRKVFDE--------STLPFLYVQLARWSGYQYTQNVRQGQL 342 Query: 241 AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSA 286 N AN+ + D L Sbjct: 343 RTLDNANLRNSANVAMTVSIDTDKGTSKVIHPLGKDILGARMAAQY 388 >UniRef50_UPI0001AEC524 hypothetical protein AmacA2_21053 n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC524 Length = 644 Score = 60.8 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 26/248 (10%), Positives = 52/248 (20%), Gaps = 20/248 (8%) Query: 30 LPDREDAPHPRIKQLARFAHT----HPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLATN 85 L D E P + + ++ + + + Sbjct: 283 LTDAEQKPINGVHWFRKRFQWPHHLVGEQATLVLGALVDADTTYVNGTKVGQTTYKYPPR 342 Query: 86 HQT-QYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDAC 144 T + + L ++G + G+ T Sbjct: 343 RYTVPADVLKPGSNTITIRLESHRGHSGFVSDKEYWIGAGETRVDLNGEWHYKLGVVHEH 402 Query: 145 RWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFR 204 T Y+ + A L G W QGE + D ++ F M+E +R Sbjct: 403 LQQTTFIQYKPM-GLFNAMLNPLTSLPIKGVIWYQGESNTANPD--NYGDMFKTMIEDWR 459 Query: 205 RDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGE 264 P+ + P + + G Sbjct: 460 NKWNNP--------TLPFLFVQLANF----NPRLSTPAQSGWAELREQQASALSLPNTGM 507 Query: 265 RGLTNAPD 272 + + Sbjct: 508 AVAIDVGE 515 Score = 47.3 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 40/149 (26%), Gaps = 15/149 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 + +GQSN LPL R P I P +N P + Sbjct: 107 DLWLASGQSNM-----VLPLA-RIKEKFPDIVANESRPLIREFSVPMQYNFKQP--NIEV 158 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + + AR + + N + IV GG+ + Sbjct: 159 KDAQWKSADKLIERESFSAVSFF-----FARAIQDDV--NVPVGIVLSAVGGTPIQSWMS 211 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSR 159 ++ + + ++S+ Sbjct: 212 EDALASFPEDLKEAKYFQNDNVINSIISQ 240 >UniRef50_UPI0001924939 PREDICTED: similar to predicted protein, partial n=7 Tax=Hydra magnipapillata RepID=UPI0001924939 Length = 1059 Score = 60.4 bits (144), Expect = 7e-08, Method: Composition-based stats. Identities = 49/292 (16%), Positives = 81/292 (27%), Gaps = 40/292 (13%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDAP---HPRIKQLARFAHTHPGGPPCHFNDII 63 P Y VL VAGQSN YG H RI Q A + PC + Sbjct: 707 PRKYKVLIVAGQSN-TYYGREWGHSAAITTVNYMHRRIVQFAMDNGQNDVLLPCSLRLEV 765 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 + + + T L I + ++I+PC G Sbjct: 766 NESDNMCYAGYGSILAGLIMQDADTSNT------------LNIIHSDESLMILPCMLGAR 813 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 F+ Y +G +++L R R L+ +F G W QGE D Sbjct: 814 GFS----DDYFMPYGTG------------FRNLTRRIRYILSHY-NAEFCGMTWAQGEND 856 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITD-------APWFCGDTTWYWKENFP 236 + ++ + ++ R + +Q +T+ P+ W Sbjct: 857 SIDGWNNNYRHILVNFIQTIRDYIGYSSNQFYTLTNSQAKSNTIPFLTFQMRPGWVSANS 916 Query: 237 HSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYR 288 + ++D + Y GS + Sbjct: 917 ARATPVQNALSTIGEYIPYAASISITNLPPSMRITNDDTVHYDSRYKGSEWA 968 >UniRef50_A3HTJ4 Putative sialate O-acetylesterase n=1 Tax=Algoriphagus sp. PR1 RepID=A3HTJ4_9SPHI Length = 635 Score = 60.4 bits (144), Expect = 8e-08, Method: Composition-based stats. Identities = 16/158 (10%), Positives = 38/158 (24%), Gaps = 24/158 (15%) Query: 120 RGGSAFTA---GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGAC 176 +G + + + T +Y ++ G Sbjct: 368 QGSNKISLEGNWAYSTNIVEPQLPKVEKVNWKPGMMYNSMI-------NPLVSYPIKGVI 420 Query: 177 WMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 W QGE + + + F+ M+ +R D P+ + + P Sbjct: 421 WYQGESNAGR--ADEYEELFSAMITNWRSKWGL--------GDIPFLFVQLANFMERKSP 470 Query: 237 HSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDED 274 ++ A + G + + + Sbjct: 471 ----QPESDWAYLREAQSKTLSLPNTGMAVIIDIGEAG 504 >UniRef50_UPI0001924174 PREDICTED: similar to predicted protein n=3 Tax=Hydra magnipapillata RepID=UPI0001924174 Length = 1210 Score = 59.2 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 56/265 (21%), Positives = 84/265 (31%), Gaps = 44/265 (16%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDAP---HPRIKQLARFAHTHPGGPPCHFNDII 63 P Y VL VAGQSN YG D A H RI Q PC + Sbjct: 748 PHKYKVLIVAGQSN-TYYGREWGNSDAITAVNYMHRRIVQFGMDNGQTDSLLPC----SL 802 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 L D GY LA I L I + ++I+PC GG Sbjct: 803 RLDVNESDSMCYAGYGSILAGLIM--------QDAIGSNTLNMINSDECLMILPCMLGGM 854 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 F+ Y +G +++L R R L + +F G W QGE D Sbjct: 855 GFS----NDYFMPYGTG------------FKNLTRRIRYIL-THYNAEFCGVTWSQGETD 897 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITD-------APWFCGDTTWYWKENFP 236 + ++ + ++ R + +Q + + P+ W Sbjct: 898 SVPGWCDNYRYILMNFIQTIRDYVGYSSNQFATLANSQVKSNTIPFITFQMNPTWV---- 953 Query: 237 HSYEAIYGNYQNNVLANIIFVDFQQ 261 + A QN + + ++ F Sbjct: 954 AANSAFASPVQNALTSIGDYIPFTS 978 >UniRef50_Q21HS7 Glycoside hydrolase family 2, sugar binding n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21HS7_SACD2 Length = 661 Score = 58.8 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 30/110 (27%), Gaps = 14/110 (12%) Query: 153 YQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHS 212 Y + A LA + G W QGE ++ + +++++R Q Sbjct: 418 YTAPLGYYNAMLAPLLTTQIKGVVWYQGESNVGR--AEEYKSLMATLIKSWRAGFNQP-- 473 Query: 213 QLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQ 262 + P+ + ++ A V+ Sbjct: 474 ------ELPFIVVQLANFL----EAQAAPSESHWAELREAQRQIVNSTNN 513 Score = 44.6 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 42/155 (27%), Gaps = 16/155 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN D P R+K P +P + + Sbjct: 110 DVWLTSGQSNM-------------DLPMRRVKH-TYPLDVAEAHLPQVRVFTVPSQYDFN 155 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 V + LA N + A+ L + I I+ GG+ Sbjct: 156 QVHNDVSGGQWLAVNPNNIEQFSAVSYFFAKALHQA--KHIPIGIINSSMGGTTAEGWLS 213 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALA 165 + + + D L++ + A A Sbjct: 214 EESLQAFPTHYQKAKQYQDKHYLAQLINSDKKAQA 248 >UniRef50_C3QEP5 Sialic acid-specific 9-O-acetylesterase n=2 Tax=Bacteroides RepID=C3QEP5_9BACE Length = 633 Score = 58.8 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 43/173 (24%), Gaps = 34/173 (19%) Query: 146 WGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRR 205 + LY ++ K G W QGE + + ++ +R Sbjct: 397 KSAGSGLYNGMI-------YPIKDYKIKGTIWYQGETNAGHP--QGYATLLESLITNWRE 447 Query: 206 DLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGER 265 + + P+ + K+ + G + A + Sbjct: 448 LWEMP--------EMPFLLVQLPNFMKK----QMQPSDGGWARLREAQ----------LQ 485 Query: 266 GLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRGIISDRFVEA 318 N P L+ Y + + AR+ ++ V + Sbjct: 486 IAMNVP---HTTLAVTYDVGEWNDIHPLNKKAVAHRLFLGARKVAYGEKLVSS 535 >UniRef50_C6LL91 Putative ExsB n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LL91_9FIRM Length = 872 Score = 58.8 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 30/251 (11%), Positives = 54/251 (21%), Gaps = 38/251 (15%) Query: 24 YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLA 83 YG G P I + PG +P Sbjct: 494 YGNGRVTP---GKALDIIWEDVCSDDWMPGQKEA---GAMPGRTDAGTASQADASGRKAQ 547 Query: 84 TNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDA 143 + + D+A P GG+ + A Sbjct: 548 PEDGRKLRQTTSEAEPGAAEGADLADSA-----PVALGGT----WEYRILAACGPAPEQV 598 Query: 144 CRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAF 203 T T L+ +V+ W QGE + S + Q +++ + Sbjct: 599 FLNRTPTGLFNGMVA-------PCQPYTVSAVVWYQGESN--DSHPGDYAQLLAGLIQDW 649 Query: 204 RRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQG 263 RR D P+ + P + A + Sbjct: 650 RRGF--------RREDLPFVVVQLPNCGVDIAPG------DAWPLIREAQLSAERLPYVA 695 Query: 264 ERGLTNAPDED 274 + +++ Sbjct: 696 VTVNLDIGEDN 706 >UniRef50_C6Y1K6 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y1K6_PEDHD Length = 657 Score = 58.8 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 45/174 (25%), Gaps = 17/174 (9%) Query: 103 LLPFIPDNAGILIVPCCRGGSAFTA--GSEGTYSERHGASHDACRWGTDTPLYQDLVSRT 160 ++ + G+ S + A+ +P V Sbjct: 355 VIRVFDIGGKGGMYSGAIWGNPILLGKWSYQQDLKIDAATFPKPHVVNVSPFSTPAV-LY 413 Query: 161 RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDA 220 +A Q G W QGE + + Q F M+ +R+ KQ D Sbjct: 414 NGNIAPVTQMAIKGFIWYQGESNAGR--AVEYRQLFPAMIRDWRKQFKQ--------GDV 463 Query: 221 PWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDED 274 P+ Y E E ++ A + G L + + Sbjct: 464 PFIFAQLANYMAEKP----EPGESDWAELREAQAKALALPNTGMAVLIDIGEAG 513 >UniRef50_B7AFZ4 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AFZ4_9BACE Length = 623 Score = 58.1 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 57/197 (28%), Gaps = 37/197 (18%) Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 ++ + G + + G+ LY ++ + GA W QGE + Sbjct: 372 NQNEVMKYAGKLKNLKKAGSG--LYNGMI-------HPISNYQVKGAIWYQGESNSGR-- 420 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQN 248 ++ +++ +R K D P+ Y +++ + + Sbjct: 421 SQTYASLLEALIQNWRELWKMP--------DMPFLLVQLPNYMEKSD----KPSDSGWAR 468 Query: 249 NVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARR 308 A + N P L+ Y + + ARR Sbjct: 469 IREAQF----------KTALNVP---HTALAVTYDVGEWNDIHPLNKKAVAQRLFLGARR 515 Query: 309 GIISDRFVEAILQFWRE 325 + ++ V A ++E Sbjct: 516 LVYGEK-VTASGPLYKE 531 >UniRef50_UPI0001924DD2 PREDICTED: similar to predicted protein, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001924DD2 Length = 1230 Score = 57.7 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 73/246 (29%), Gaps = 40/246 (16%) Query: 7 PDYYYVLTVAGQSNAMAYGEGLPLPDREDAP---HPRIKQLARFAHTHPGGPPCHFNDII 63 P Y VL VAGQSN YG H RI Q A + PC + Sbjct: 979 PRKYKVLIVAGQSN-TYYGREWGHSAAITTVNYMHRRIVQFAMDNGQNDVFLPCSLRLEV 1037 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 + + + T L I + ++I+PC G Sbjct: 1038 NESDNMCYAGYGSILAGLIMQDADTSNT------------LNIIHSDESLMILPCMLGAR 1085 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 F+ Y +G +++L R R L+ +F G W QGE D Sbjct: 1086 GFS----DDYFMPYGTG------------FRNLTRRIRYILSHY-NAEFCGMTWAQGEND 1128 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITD-------APWFCGDTTWYWKENFP 236 + ++ + ++ R + +Q +T+ P+ W Sbjct: 1129 SIDGWNNNYRHILVNFIQTIRDYIGYSSNQFYTLTNSQAKSNTIPFLTFQMRPGWVSANS 1188 Query: 237 HSYEAI 242 + Sbjct: 1189 ARATPV 1194 >UniRef50_A2RKE5 Sialic acid-specific 9-O-acetylesterase n=2 Tax=Lactococcus lactis subsp. cremoris RepID=A2RKE5_LACLM Length = 452 Score = 57.7 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 50/207 (24%), Gaps = 30/207 (14%) Query: 73 QDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCC--RGGSAFTA--- 127 G + + + L IA +L + P G Sbjct: 135 GKKVGSTDYKYPPRNYKISKLTKNLTIAIRLKVYNAPGGITSSKPHILLVGEKYLDLNHG 194 Query: 128 GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTS 187 S T LY +++ + KF+ W QGE D Sbjct: 195 WKIRRSSTLPERHKAYFINYEPTGLYNGMIA-------PLQKLKFVAILWYQGESDAGQP 247 Query: 188 DYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQ 247 ++ F ++E++R KQ + P+ E ++ Sbjct: 248 --KTYGTRFRELIESWRILFKQPN--------LPFLYVQLPNCETEKEA--------DWA 289 Query: 248 NNVLANIIFVDFQQQGERGLTNAPDED 274 + + ++D Sbjct: 290 GLREEQKEALKISRTAMVVTIGDGEDD 316 >UniRef50_C4XH28 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=C4XH28_DESMR Length = 337 Score = 57.7 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 46/322 (14%), Positives = 75/322 (23%), Gaps = 88/322 (27%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 A ++ D V V GQSNA + ++ P + A Sbjct: 92 AALAKDRTMVALVFGQSNAS-----NTVDPGYESVQP-VYAFA-DGVCTKARDALPGATG 144 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 + P + A ++ RGG Sbjct: 145 TKGSSWPRLGDKLIAGGFYDA-----------------------------VIFANIARGG 175 Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 S+ G L L S +A LA + QGE Sbjct: 176 SSILEWGPGGRHNAV--------------LLASLDSLAKAGLAPTH------VLFHQGEA 215 Query: 183 DLMTSDY-ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 D + +++ R + + ++ + A Sbjct: 216 DCALGVAGPDYADMLAAVIDQIRSKVGPAPDV--------VVARTSQYFDLVCGDAANPA 267 Query: 242 IYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSH 301 + + A D Q+ G D D L ++ H Sbjct: 268 CFKTCPALIQAQTAAADPQRHVLSGP------DTDRL------------VPYSDRNDGYH 309 Query: 302 FSTAARRGIISDRFVEAILQFW 323 F+ A +DRF EA L Sbjct: 310 FTAQA-----ADRFAEAWLPLL 326 >UniRef50_UPI0001923B19 PREDICTED: similar to predicted protein n=3 Tax=Hydra magnipapillata RepID=UPI0001923B19 Length = 605 Score = 57.7 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 73/246 (29%), Gaps = 40/246 (16%) Query: 7 PDYYYVLTVAGQSNAM-AY--GEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDII 63 P Y VL VAGQSN G + + H RI Q A + PC + Sbjct: 354 PHKYKVLIVAGQSNTYYGREWGNSNAIT-TVNYMHRRIVQFAMDNTQNDMLLPCSLRLEV 412 Query: 64 PLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGS 123 + + + T L I + ++I+PC G Sbjct: 413 NESDNMCYAGYGSILAGLIMQDADTSNT------------LNIINSDESLMILPCMLGAR 460 Query: 124 AFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD 183 F+ Y +G +++L R R L+ +F G W QGE D Sbjct: 461 GFS----DDYFMPYGTG------------FRNLTRRIRYILSHY-NAEFCGMTWAQGEND 503 Query: 184 LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITD-------APWFCGDTTWYWKENFP 236 + S+ + ++ R + +Q T+ P+ W Sbjct: 504 SIDGWNNSYRYILLNFIQTIRDYIGYSSNQFYTFTNSQAKSNTIPFLTFQMRPGWVSANS 563 Query: 237 HSYEAI 242 + Sbjct: 564 ARATPV 569 >UniRef50_C3QLH3 Sialic acid-specific 9-O-acetylesterase n=11 Tax=Bacteria RepID=C3QLH3_9BACE Length = 549 Score = 57.3 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 33/102 (32%), Gaps = 12/102 (11%) Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY 189 EG + R GA A T V A +A G W QGE ++ + Sbjct: 396 EGNWKYRLGAPMPAAPGQTAFHY--KPVGLYNAMIAPLLNYTVSGIIWYQGESNVSRRN- 452 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW 231 + M+ +R+ + D P++ + + Sbjct: 453 -EYKDLLTAMIADWRQHWNRP--------DMPFYVIELADFL 485 Score = 52.7 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 55/225 (24%), Gaps = 30/225 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN LP+ D I + + P +N P T P Sbjct: 105 DVWVCSGQSNME-----LPVSRVTDRFRDEISADSDYPMVRYIKTPLLYNFHAPQTDIP- 158 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 P + + + + I+ GGS A Sbjct: 159 --GIFWKAMTPENVMSFSALVYFFTKDYFQKT-------KVPVGIINSSVGGSPVEAWIS 209 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVS------RT-RAALAKNPQNKFLGACWM-QGEF 182 + + R L + + R AL + + W G Sbjct: 210 EEGLKPFPYYLNEKRIYESDDLVESMKKEESKKSRAWNVALYQGDKGMHETIPWYAAGYD 269 Query: 183 DLMTSD----YASHPQHFNHMVEA---FRRDLKQYHSQLNNITDA 220 D + + + + + FR+D + Q Sbjct: 270 DSDWTPTDLFASGWATNGLNTINGSHWFRKDFQVSGQQAGEKATL 314 >UniRef50_C6VWZ0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VWZ0_DYAFD Length = 680 Score = 57.3 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 44/312 (14%), Positives = 86/312 (27%), Gaps = 53/312 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V ++GQSN+ + RI L P+ Sbjct: 143 DVYILSGQSNSTGF-FAERDTSMFCRTFGRI--------------------TDNLNTTPY 181 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGI--LIVPCCRGGSAFTAG 128 + D +N VG I ++ + + +G+ ++ Sbjct: 182 NAADTLW----ALSNMHPYNNGVG---SIGFEIQKQLMEKSGVPNCLINAGY------HW 228 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 S YS ++ + T Y ++ R + A + + QGE + Sbjct: 229 SS-AYSHSLRTENNPTDFTTG---YGRMLYRAQKAGVAH---AVKAYIFRQGESESYGEG 281 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW---FCGDTTWYWKENFPHSYEAIYGN 245 + HF+ + + + D D + G ++ P IY + Sbjct: 282 -GNWEGHFDVLYKNLKTDFAGLEKLYVYQIDIIYHSSLIGVLIRDYQRRLP----DIYPD 336 Query: 246 YQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTA 305 N D G + E + +YGS + + + F Sbjct: 337 IDNLATVGTTGFDGLHYNRDGNKQSAFELSRLMLRDFYGS--KDTTDIASPNLRKAFYNN 394 Query: 306 ARRGIISDRFVE 317 A + ++ F E Sbjct: 395 AEKNSLTLVFDE 406 >UniRef50_B8I0P3 Putative uncharacterized protein n=5 Tax=Bacteria RepID=B8I0P3_CLOCE Length = 635 Score = 56.9 bits (135), Expect = 9e-07, Method: Composition-based stats. Identities = 14/146 (9%), Positives = 35/146 (23%), Gaps = 21/146 (14%) Query: 127 AGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMT 186 ++ A LY +++ G W QGE + Sbjct: 381 EWQYMIGAKSSPMPAPAFVQWRPLGLYNGMIA-------PVTSYSIKGFIWYQGESNTKN 433 Query: 187 SDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNY 246 + ++ +R+ N+ + P+ + S + ++ Sbjct: 434 PG--EYENLLKALIADWRQKW--------NMGNLPFLYVQLPNFM----EASEIPMESSW 479 Query: 247 QNNVLANIIFVDFQQQGERGLTNAPD 272 A + G + + Sbjct: 480 AELREAQRRTLSVPCTGMAVAIDLGE 505 Score = 48.4 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 67/235 (28%), Gaps = 35/235 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN D +P + + P ++ P T Sbjct: 96 DVWLCSGQSNME------MKMDSVKDTYPDEIVHSCNDYIRHFLVPVKYDFEKPQTDLE- 148 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + + P + T G A KL N I ++ GGS A Sbjct: 149 -AGIWEAAN-PESILDFTATGYF-----FALKLFEKY--NIPIGLINASLGGSPAEAWLS 199 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRA-------ALAKNPQN-KFLGACWMQGEF 182 +++ + ++ ++ + AL +N + K + E+ Sbjct: 200 ENALREFPEHYESAKQLSNRDYLDKVLREDQESAEAWYTALNQNDEGLKSNDIPFHDTEY 259 Query: 183 DL----MTSDYASHPQ----HFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 D + +FN +V FR+++ + + G+ Sbjct: 260 DAPFWQNIKVPSYWEDEGVGNFNGVV-WFRKEIDIPSTLADKPARL--VLGNIVD 311 >UniRef50_UPI0001744488 sialic acid-specific 9-O-acetylesterase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744488 Length = 566 Score = 56.9 bits (135), Expect = 9e-07, Method: Composition-based stats. Identities = 36/318 (11%), Positives = 76/318 (23%), Gaps = 85/318 (26%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDRE--DAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN + P++E A HP+I+ + ++ Sbjct: 102 EVWVCSGQSNMEWTVDRAVNPEQEIPAANHPKIRLFSVPK---------------AVSDV 146 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA---- 124 P + + + + VG R+L + + ++ GG+ Sbjct: 147 PLKDIEKRPSWQVCSPETVKSFSAVG--YFFGRELNKALD--VPVGLINTSWGGTRAEAW 202 Query: 125 ---------------FTAGS---------------------------EGTYSERHGASHD 142 TA + + Sbjct: 203 TSKPALEAVPTCAAIITAWDDFLKTYNAQEARAKHEAAAAATKEKIEKIKAENAKPGATQ 262 Query: 143 ACRWGTDTPLYQDLVSRTRAALAKNPQ------NKFLGACWMQGEFDLMTSDYASHPQHF 196 G P ++ R A+ N GA W QGE + + Sbjct: 263 QPVPGAPRPFENQQTTQHRPAVLFNGMVAPIVPYTVKGAIWYQGESN--QPRAVQYQTLM 320 Query: 197 NHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANI-I 255 ++ +R+ S ++ + + A + Sbjct: 321 PTLIADWRKQWGDELS---------FYIVQLAGFGNGRTWPQPVGAADTWAELQWAQLQT 371 Query: 256 FVDFQQQGERGLTNAPDE 273 + ++ G + +E Sbjct: 372 ALRVKKSGLAVANDIGEE 389 >UniRef50_B2IIR7 Putative uncharacterized protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IIR7_BEII9 Length = 289 Score = 56.9 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 46/192 (23%), Gaps = 58/192 (30%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHD 71 V+ V GQSN + P + L+ Sbjct: 72 VILVIGQSNGANFAYSYSKAQD------------------PDAAAFYNGRCYALSDPLPG 113 Query: 72 VQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEG 131 +G P A + ++ GG++ SE Sbjct: 114 GDGGRGSQWPA----------------FADLFRAKF--GRSVTLISAGWGGTSVRDWSEN 155 Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYAS 191 + +S+ + L + K W QGE D +D + Sbjct: 156 GFDAY-------------------ALSQAKLVLDA--KGKIDAIFWQQGESDPN-TDAET 193 Query: 192 HPQHFNHMVEAF 203 + +++ F Sbjct: 194 YAARLQVVLDRF 205 >UniRef50_UPI0001968E9D hypothetical protein BACCELL_02606 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968E9D Length = 640 Score = 56.5 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 50/183 (27%), Gaps = 23/183 (12%) Query: 132 TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYAS 191 + + Y L+ A LA + G W QGE + D Sbjct: 378 GDWKYRVSQIIETTETLGPNSYPSLL--YNAMLAPLTKFPIKGTIWYQGESNSG--DAYR 433 Query: 192 HPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGN---YQN 248 + F +M+ +R+ K + +YW + A + Sbjct: 434 YRSLFPNMIRDWRKQWKY---------------AEMPFYWVQLANFMKPAEEPGQSDWAE 478 Query: 249 NVLANIIFVDFQQQGERGLTNAPD-EDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAAR 307 + + ++ GE + + ED + G T + FS Sbjct: 479 LRESQHLTLELPYTGEALAIDIGETEDIHPRNKQDVGHRLALNALAKTYGKEVEFSGPEY 538 Query: 308 RGI 310 + + Sbjct: 539 QSM 541 >UniRef50_B8DVE1 Sialic acid-specific 9-O-acetylesterase n=4 Tax=Bifidobacterium animalis subsp. lactis RepID=B8DVE1_BIFA0 Length = 565 Score = 56.5 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 28/283 (9%), Positives = 56/283 (19%), Gaps = 76/283 (26%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V GQSN P P + + Sbjct: 121 EVWLAGGQSNIEFELHNSEF---------------GKEAVADAHDPLLRFYNTPKSARVN 165 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + +G + ++L + + + +V C GG++ +A Sbjct: 166 LATESASGWQTAEAPQVAHMSAIG--YYFGKQLRDALANGIAVGVVDCYIGGTSISAWMS 223 Query: 131 GTYSERHGASHD--------------------ACRWGTDTPLYQDLVSRTRAALAKNPQN 170 E+ G W + V+ + Q Sbjct: 224 EHLLEQTGLGRKYLQAYRDAIAGKTDEEMLAAQTAWQQVFDKWNADVAAVKEEHPDYSQP 283 Query: 171 KFLGA-------------------------------------CWMQGEFDLMTSDYASHP 193 + W QGE D + A + Sbjct: 284 QIDAMLGPCPWPPPVTPFAERRPYALYEAMIRRVAPYTLAGILWYQGEED--ELNAAGYG 341 Query: 194 QHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFP 236 + ++ +R + W + +P Sbjct: 342 ELLRGLIAEWRTTWHDDALPFLVVQLPQWIAAANAEHDPLRWP 384 >UniRef50_B9XAZ4 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XAZ4_9BACT Length = 663 Score = 55.8 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 48/216 (22%), Gaps = 37/216 (17%) Query: 94 GQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTP-- 151 G + + P G + W + P Sbjct: 367 GPGGLTGKPAQMHVAPRKDAGATPISLAG----QWQMHDSAPLAKLPAVPQIWNRNNPNV 422 Query: 152 ---LYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLK 208 LY ++S GA W QGE + + + M++ +R + Sbjct: 423 STVLYNGMIS-------PLLPFGIKGAIWYQGESNAGR--AEQYRKLLPAMIQDWRNRFE 473 Query: 209 QYHSQLNNITDAPWFCGDTTWY-------WKENFPHSYEAIYGNYQNNVLANIIFVDFQQ 261 + P++ + + EA +N A + Sbjct: 474 V--------GEFPFYIVQLAAFTRTAAEPRNNEWAELREAQALTAKNVPHAGLAVAIDIG 525 Query: 262 QGERGLTNAPDEDPDDLS----TGYYGSAYRSPENW 293 + L+ YG S W Sbjct: 526 DAADIHPKDKLDVGRRLALCALADTYGKKIESSGPW 561 Score = 49.6 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 50/219 (22%), Gaps = 25/219 (11%) Query: 11 YVLTVAGQSNA-MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 V +GQSN M G D A P+I+ L +P Sbjct: 112 DVWICSGQSNMEMGIGACNVTNDIAQANFPQIRLLTVPRLITTKPAQTLECQWLPCNPAN 171 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 G+ R+L + I ++ GG+ A + Sbjct: 172 VMKGQWAGFSAA--------------GFFFGRELQQEL--KIPIGLIHTSWGGTVAEAWT 215 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFL-GACWMQGEFDLMTSD 188 + G + ++ N W +G+ D Sbjct: 216 SQEGLKPLGDFNAQLEQVNKVNTLNKDEDPSKELEKWYAHNDPGTAQHWEKGDADYSNWK 275 Query: 189 YASHPQHFNHM-------VEAFRRDLKQYHSQLNNITDA 220 AS PQ + + FR Sbjct: 276 SASMPQAWEQAGLPNYDGIVWFRHTFNLPDDWTGKDLTL 314 >UniRef50_A6KWF2 Sialic acid-specific 9-O-acetylesterase n=8 Tax=Bacteroides RepID=A6KWF2_BACV8 Length = 645 Score = 55.8 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 56/201 (27%), Gaps = 36/201 (17%) Query: 11 YVLTVAGQSNA---MAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTH 67 V +GQSN + +P I+ + P + P Sbjct: 101 DVWLCSGQSNMELTAGRVTDKFAEEIARDENPMIRYV---------KIPLGNDLHGPKDD 151 Query: 68 CPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 P D A + A A+++ + IV GGS+ A Sbjct: 152 LP--GADWMSLTKETAPSFSA------LAYFFAKEMYRET--QVPVGIVNSSWGGSSVEA 201 Query: 128 GSEGTYSERHGASHDACRWGTDTPLYQDLVSRT--------RAALAKNPQNKFLGACWMQ 179 ++ R ++ Y++L +R+ AL K + G CW + Sbjct: 202 WMSEEALQKFP-RQLHERDLFNSDEYRELCNRSGQMMNRFWDTALYKGDRGLHDGICWNR 260 Query: 180 GEFD-----LMTSDYASHPQH 195 E D + + Sbjct: 261 PELDDTDWQTVDMFSKEWGRK 281 Score = 55.4 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 50/189 (26%), Gaps = 34/189 (17%) Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 + E ++ T +Y ++S R F GA W QGE + Sbjct: 385 SRWKYQLGCEMPARTNSVSFQNVPTGMYNSMISPLRNL-------AFTGALWYQGETNTG 437 Query: 186 TSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGN 245 + + + M+ +R L + P+F + K E + N Sbjct: 438 RPN--EYEELLAAMIIDWREKLAD--------KELPFFIVQLANFMK----THSEPVESN 483 Query: 246 YQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTA 305 + A Q + A D + + S Sbjct: 484 WAALREAQR------QVALKVPNAALAVAIDL-------GEWNDIHPLNKKELARRISLL 530 Query: 306 ARRGIISDR 314 +R + D+ Sbjct: 531 VKRRVYGDK 539 >UniRef50_D2U9U1 Putative hydrolase protein n=1 Tax=Xanthomonas albilineans RepID=D2U9U1_XANAL Length = 641 Score = 55.4 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 44/160 (27%), Gaps = 4/160 (2%) Query: 160 TRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITD 219 A + G W QGE + + + F M+ +R + Q + Sbjct: 410 YNAMIHPLQPFPVRGVIWYQGESNATALGALRYREQFAAMIRHWREERGQPQLPFLWVQL 469 Query: 220 APWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLS 279 A + G T + E+ A ++ +D G+ + D Sbjct: 470 ANFRAGADTDDLS-PWALLRESQSKALALPATAQVVTIDIGTPGDIHPPDKQDVGHRLAL 528 Query: 280 TGYYGSAYRSPENWTTALRSSHFSTAARR---GIISDRFV 316 + + + L +HF R ++ Sbjct: 529 AARHVAYGETLIYTAPVLADAHFDHGQARLRFDLLGSALA 568 Score = 45.7 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 32/121 (26%), Gaps = 20/121 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P + + +P T P Sbjct: 110 DVWLASGQSNME---------------MPLVMARDGAREVAAATDSQLRDFKVPHTWSPQ 154 Query: 71 -DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGS 129 + G H + V A AR+L I I I+ GSA A Sbjct: 155 PQPRLTGGTWQATTPEHAHAFSAV--AYFFARELREQI--GVPIGIIDSTWSGSAIEAWM 210 Query: 130 E 130 + Sbjct: 211 D 211 >UniRef50_C3QG04 Sialic acid-specific 9-O-acetylesterase n=4 Tax=Bacteroides RepID=C3QG04_9BACE Length = 318 Score = 55.4 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 19/128 (14%), Positives = 36/128 (28%), Gaps = 18/128 (14%) Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 S+ ++ ++ G + + + L G + QGE Sbjct: 36 SSLAEYNQLNAEQKKYIDKPVEPMGRKN--FHRPIGLSETMLNTVIPYTLKGFLFYQGES 93 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDT------TWYWKENFP 236 + + A + + F M+ +R Q D P+ T YW E Sbjct: 94 NT--ARGAQYRKLFPAMINEWRTAWGQ--------GDIPFLFIQLPRFETKTRYWYELRE 143 Query: 237 HSYEAIYG 244 Y + Sbjct: 144 AQYLTSHH 151 >UniRef50_A8A1H8 Putative uncharacterized protein n=1 Tax=Escherichia coli HS RepID=A8A1H8_ECOHS Length = 664 Score = 54.6 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 70/270 (25%), Gaps = 50/270 (18%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 ++ AGQSNA Y PD + P + ++ L+ Sbjct: 270 FIFVTAGQSNARGY-----CPDADQT--------------IVAATPIYPDNAFMLSGGVR 310 Query: 71 DVQDMQGYHHPL---ATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTA 127 PL + + G A R + L + C + G A+ Sbjct: 311 RTGTRSTTLVPLVEAVSGTDKETAASGLANTFIRDMAAATGIMPRTLSIVCAQSGQAYEY 370 Query: 128 GSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGAC----WMQGEFD 183 G YQ L+ + +L WMQGE D Sbjct: 371 QKRGNQV------------------YQYLLDSIEDCVTACKARGWLPIVLCVDWMQGESD 412 Query: 184 LMTSDYAS--HPQHFNHMVEAFRRDL----KQYHSQLNNITDAPWFCGDTTWYWKENFPH 237 S + D+ Q + IT + + + Sbjct: 413 EDWSGLREGMYESRMRQYQRQITSDIIARTGQNEPPIIAITQLGYVNDGHGAFTGQYARL 472 Query: 238 SYEAIYGNYQNNVLANIIFVDFQQQGERGL 267 + ++G Q ++ ++ DF G Sbjct: 473 ASTRLHGKEQFRLVNSLYQYDFISDGLHLT 502 >UniRef50_Q0FSF8 Putative hemagglutinin-related protein n=3 Tax=Roseovarius sp. HTCC2601 RepID=Q0FSF8_9RHOB Length = 624 Score = 54.2 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 42/252 (16%), Positives = 70/252 (27%), Gaps = 43/252 (17%) Query: 11 YVLTVAGQSNAMA-----YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPL 65 V+ V G SN+ +G+ +P RE A PRI + P P Sbjct: 356 DVIIVGGDSNSANATSERFGDEIPTSARETAFDPRIWYM----------PCLRATGNYPT 405 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 T V + + + +A L+ + L+V Sbjct: 406 TDSVRHVPQ-----PCIEPVAAVEARRMSPVHAVAGALVGWSAARGRPLLV--------M 452 Query: 126 TAGSEG-TYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFD- 183 G G + + T + ++ +L++ A A P ++ +G W QG D Sbjct: 453 ALGDPGSGFMNTEDWRKSSAVATTGSRMWSELLAMKAALDALGPAHEIVGMVWSQGANDL 512 Query: 184 --LMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEA 241 S V R D+ D P ++ E P+ Sbjct: 513 FGGDYSVSGQWMDQMRQFVSDLRSDI----------ADVPMVMWSVGQHY-EPAPYDGRG 561 Query: 242 IYGNYQNNVLAN 253 L Sbjct: 562 AAMRAAQLRLDQ 573 >UniRef50_C0AED8 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AED8_9BACT Length = 703 Score = 53.8 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 38/148 (25%), Gaps = 20/148 (13%) Query: 141 HDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMV 200 R T LY +++ G W QGE D S + ++ Sbjct: 420 QPPERRKTPAFLYNGMIA-------PLVPYALRGVIWYQGESDT--SRAWQYRTALPLLI 470 Query: 201 EAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQ 260 + +RR D P+ Y A+ Y A + Sbjct: 471 DDWRRQWD--------ANDFPFLICQLANYG---AKTQNPAVPAPYAELREAQTLATRTL 519 Query: 261 QQGERGLTNAPDEDPDDLSTGYYGSAYR 288 + + E+ D +A R Sbjct: 520 PHTSQAVLIDLGEELDIHYRDKRPAAER 547 Score = 45.4 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 31/118 (26%), Gaps = 17/118 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN L DA HF P Sbjct: 112 EVWLASGQSNME-----FALKRTTDATQTIATSANARLR--------HFTVPKKTAAEPA 158 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 ++G + + +G H +R+L + + ++ GG+ Sbjct: 159 PSDTLEGRWQIASPDTIADLTAIG--YHFSRELQQQLD--VPVGLIHASWGGTPIAPW 212 >UniRef50_UPI0001925DCB PREDICTED: similar to predicted protein n=2 Tax=Hydra magnipapillata RepID=UPI0001925DCB Length = 760 Score = 53.8 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 44/241 (18%), Positives = 72/241 (29%), Gaps = 40/241 (16%) Query: 12 VLTVAGQSNAMAYGEGLPLPDREDAP---HPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 VL VAGQSN YG H RI Q A + PC + + Sbjct: 425 VLIVAGQSN-TYYGREWGHSAAITNVNYMHRRIVQFAMNNGQNDVLLPCSLRLEVNESDN 483 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + + T L I + ++I+PC GG F+ Sbjct: 484 MCYAGYGSILAGLIMQDATTSNT------------LNIINSDESLMILPCMLGGKGFS-- 529 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 Y +G +++L R R L+ +F G W QGE D + Sbjct: 530 --DDYFMPYGTG------------FKNLTRRIRYILSHY-NAEFCGITWSQGETDSIAGW 574 Query: 189 YASHPQHFNHMVEAFRRDLKQYHSQLNNITD-------APWFCGDTTWYWKENFPHSYEA 241 ++ + ++ R + +Q +T+ P+ W Sbjct: 575 NDNYSHILVNFIQTIRDYIGYSSNQFYTLTNSQAKSNTIPFVTFQMQPEWVAANSTMATP 634 Query: 242 I 242 + Sbjct: 635 V 635 >UniRef50_C5BNV0 ExsB n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BNV0_TERTT Length = 654 Score = 53.8 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 36/114 (31%), Gaps = 13/114 (11%) Query: 120 RGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQ 179 G + +G + A+ + + + A LA G W Q Sbjct: 387 VGDTKIDL--KGEWK-YKVANVIEPPKPKRFIPWNEPLGCYNAMLAPLFNMNIKGVIWYQ 443 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKE 233 GE + + F H+++++RRD Q D P+ Y + Sbjct: 444 GESNTGNP--QEYATLFPHLIKSWRRDWNQ--------GDFPFLFVQLANYMSD 487 >UniRef50_C6IKL7 Sialic acid-specific 9-O-acetylesterase n=9 Tax=Bacteroides RepID=C6IKL7_9BACE Length = 662 Score = 53.5 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 28/113 (24%), Gaps = 15/113 (13%) Query: 161 RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDA 220 A L GA W QGE + + M+ +R D Sbjct: 428 NAMLNPLIPYAIKGAIWYQGESNAGE--AFQYRDLMPLMITDWRNRWGY---------DF 476 Query: 221 PWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDE 273 P++ + + + A + Q G + +E Sbjct: 477 PFYMVQLASF----TAKQTAPVESTWAELREAQTRTLHLQNTGMAVAIDIGEE 525 Score = 50.0 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 27/231 (11%), Positives = 55/231 (23%), Gaps = 41/231 (17%) Query: 11 YVLTVAGQSNAM----AYGEGLPLPDREDAPH--PRIKQLARFAHTHPGGPPCHFNDIIP 64 V +GQSN +G+ ++ + P I+ L P Sbjct: 105 EVWICSGQSNMEMQVEGWGKVKNYEQEKEEANNYPNIRFLLVENAMSPTPVENITVKENG 164 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 C R L + N I ++ GG+ Sbjct: 165 WQVCTSKSVADFSA----------------AGYFFGRDLNKY--RNVPIGLIDTSWGGTI 206 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPL--------YQDLVSRTRAALAKNPQNKFLG-A 175 + + P +++ V +A + + + G A Sbjct: 207 IETWTSNEALSTIPSMKKRLEALVGLPASQEGRKKKFEEDVETWKAEVERIDKGCVNGEA 266 Query: 176 CWMQGEFDLMTSDYASHPQ-HFNHMVEAFRRDLKQYHSQLNNITDAPWFCG 225 W + A+ +++ +DL + + G Sbjct: 267 IWA-----APDFNDAAWKSMKVPGLMQE--QDLPGFSGLVWFRKTIDIPAG 310 >UniRef50_UPI0001BC87BE hypothetical protein BacD2_14083 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC87BE Length = 640 Score = 53.5 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 60/189 (31%), Gaps = 30/189 (15%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN + + ++++A + FN I + H P Sbjct: 105 EVWICSGQSNMEFR---------LRSANHAVEEIAAANYLQIR----SFNVIQEMRHTPK 151 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 + +++G + + + VG AR+L + N I + GG+ Sbjct: 152 N--NLKGKWEVCSPTSASDFSAVG--YFFARELYQKL--NIPIGFINSSWGGTDIETWIS 205 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF------DL 184 + + +P +++ + R+ + Q GE D Sbjct: 206 MEVMDHFPKYEKSLS-RMRSPEFEEYIKRSDKVKTEFEQ----AILNEPGETEKWYSEDT 260 Query: 185 MTSDYASHP 193 T ++ H Sbjct: 261 STENWKKHA 269 Score = 46.9 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 13/102 (12%), Positives = 31/102 (30%), Gaps = 15/102 (14%) Query: 172 FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW 231 G W QGE + + F +++ +R + P++ + Sbjct: 421 MKGVIWYQGENNAGR--ANEYIDLFPALIKDWRNRWD---------CEFPFYWVQLANFM 469 Query: 232 KENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDE 273 P + ++ N A + G+ + + +E Sbjct: 470 ---APAR-QPSESHWANLRDAQSKTLALPYTGQAVIIDIGEE 507 >UniRef50_Q15XN0 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XN0_PSEA6 Length = 635 Score = 53.5 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 34/139 (24%), Gaps = 21/139 (15%) Query: 134 SERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHP 193 W LY ++ G W QGE ++ A + Sbjct: 390 KAEDPLPMPPMLWQKPGVLYNAMI-------HPLIGFPLKGVIWYQGESNVGQ--AAQYA 440 Query: 194 QHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLAN 253 F +++++R Q + P+ + ++ A Sbjct: 441 SLFTSLMKSWRERWGQ--------KELPFVYAQLA----NIHGIKEQPAPSDWALLREAQ 488 Query: 254 IIFVDFQQQGERGLTNAPD 272 + L +A D Sbjct: 489 TQALALPHTAMAVLMDAGD 507 >UniRef50_D1NUY7 Sugar binding domain protein, glycosyl hydrolase family 2 n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NUY7_9BIFI Length = 835 Score = 53.5 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 29/123 (23%), Gaps = 10/123 (8%) Query: 157 VSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNN 216 ++ A LA W QGE + D + M++ +R + Sbjct: 550 MALFNAMLAPCFDYAVRAVLWYQGESNTGP-DSKWYQAMLEAMIKLWRANWNVDR----- 603 Query: 217 ITDAPWFCGDTTWYWKE-NFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDP 275 P+F E + + + A + N Sbjct: 604 ---LPFFIVQLPELLTECADDGGWPTVREAQWRIMDAYTGQPLYSLDACDEYGNEGGAGD 660 Query: 276 DDL 278 D Sbjct: 661 DVP 663 >UniRef50_UPI0001C35E1E hypothetical protein ChatD1_22041 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35E1E Length = 642 Score = 52.7 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 43/170 (25%), Gaps = 28/170 (16%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIP--LTHC 68 V +AGQSN P R+K G P +P Sbjct: 94 DVWLLAGQSNMQ-------------LPMERVKY-RYPEEYREGASPLIRQFAVPICWNFD 139 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + G A + + VG A+KL N + +V GG+ A Sbjct: 140 SAQAELSGGEWKTAAAEYTPVFSAVG--FFFAKKLYERY--NVPVGLVLTAVGGTPVQAW 195 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWM 178 S + LV R + A + + Sbjct: 196 M----SREALREFPDELEKAEALRAPGLVKRIQRA----DEERIAAWWRH 237 Score = 47.7 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 29/92 (31%), Gaps = 7/92 (7%) Query: 128 GSEGTYSERHGASH----DACRWGTDTPLYQDL-VSRTRAALAKNPQNKFLGACWMQGEF 182 +G S+ G + ++ + + +A A G CW QGE Sbjct: 367 WEDGMESDISGGWEYRRGAVMEPLEEQTFFERMPLGMYQAMTAPLHDFPVRGICWYQGEM 426 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQL 214 + +P +F+ + +R + Sbjct: 427 NADEPGA--YPGYFSRLTADWREKWNNPRLPV 456 >UniRef50_A1A0H7 Putative secreted protein n=5 Tax=Bifidobacterium RepID=A1A0H7_BIFAA Length = 659 Score = 52.7 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 30/108 (27%), Gaps = 17/108 (15%) Query: 123 SAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEF 182 S + ++R D RW T LY + LA W QGE Sbjct: 389 SGIWQYAVTAEADRDCPFEDFVRW-KPTGLYNAM-------LAPCFPYAVRAVLWYQGES 440 Query: 183 DLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY 230 + + M++ +R Q D P+ + Sbjct: 441 NTGDR-AMQYGDELKAMIQLWRVKWHQP--------DMPFLIVQLPKF 479 >UniRef50_A6AXJ1 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Vibrio parahaemolyticus AQ3810 RepID=A6AXJ1_VIBPA Length = 516 Score = 52.7 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 45/206 (21%), Gaps = 38/206 (18%) Query: 25 GEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDVQDMQGYHHPLAT 84 G R + + + PL Sbjct: 184 GCNWGGTPACAWVEERYLRNTPAQVWLDEYDAKLKGIDLEEDKQLYLNHPNSDTSQPLKV 243 Query: 85 NHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDAC 144 + L A + + + + G H Sbjct: 244 VEGLAGKIMYPGLSEAEQAEMVADAAS-----------------DQNQKAPLSGGPHHQY 286 Query: 145 RWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFR 204 R G LY+++VSR A K G W QGE D + F+ +++ +R Sbjct: 287 RPGG---LYKNMVSRITA-------FKVRGVIWYQGESDS--PHSDVYFSTFSQLIQCWR 334 Query: 205 RDLKQYHSQLNNITDAPWFCGDTTWY 230 P+ + Sbjct: 335 EAWG---------ECLPFLFVQLAPF 351 >UniRef50_C3PXV2 Sialic acid-specific 9-O-acetylesterase n=6 Tax=Bacteroides RepID=C3PXV2_9BACE Length = 651 Score = 52.7 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 33/114 (28%), Gaps = 15/114 (13%) Query: 161 RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDA 220 A + G W QGE ++ + F ++ +RR +D Sbjct: 426 NAMVKPWTAFPIKGVIWYQGEANVGR--SEQYGDLFPALITDWRRQW---------RSDF 474 Query: 221 PWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDED 274 P++ + E+ ++ + A + Q G + D Sbjct: 475 PFYFVQLANF-MESKKIQPDS---EWAALREAQTKALKLDQVGMAVTIDIGLAD 524 Score = 45.7 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 65/238 (27%), Gaps = 53/238 (22%) Query: 11 YVLTVAGQSNA----MAYGEGLPLPDREDAPH-PRIKQLARFAHTHPGGPPCHFNDIIPL 65 V +GQSN +G+ + + P I+ + +IPL Sbjct: 105 EVWFCSGQSNMEMPVAGWGKVMNYEQEITEANYPSIRLFQVKKN----------TSVIPL 154 Query: 66 THCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAF 125 + + Q + A AR L + N + ++ C GG+ Sbjct: 155 SDMESTMGGWQECSSATIPEFSS------LAYFYARSLWKEL--NVPVGVIDCTWGGTPA 206 Query: 126 TAGSEGTYSERHGASHDACRW---------------GTDTPLYQDLVSRTRAALAKNPQN 170 A + ++ + + +Q L S+ + ++ Sbjct: 207 EAWTSYETLKQVLGFREELAKMEQLDFDPIRMEKAYNQERSEWQSLFSKEDKGMEEDKPC 266 Query: 171 KFL-----GACWMQGEFDLMTSDYASHPQH-FNHM--VEAFRRDLKQYHSQLNNITDA 220 G W D+ DY ++ + V FRR + + Sbjct: 267 WIAPDLSEGQWW-----DMCLPDY--WEKNGLKNFDGVVWFRRSFEIPAEWIGKPLKL 317 >UniRef50_B0MQZ6 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0MQZ6_9FIRM Length = 508 Score = 52.3 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 28/87 (32%), Gaps = 17/87 (19%) Query: 150 TPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQ 209 T LY +V R G + QGE D ++ + F +++ +R D Sbjct: 272 TALYDSMVKRV-------CPYTIKGVIYYQGETDDNRP--KTYFKLFKALIQLWRDDWGD 322 Query: 210 YHSQLNNITDAPWFCGDTTWYWKENFP 236 + P+ + + P Sbjct: 323 D--------ELPFMFVQLPMHRYKADP 341 >UniRef50_C0BIJ4 Putative uncharacterized protein n=1 Tax=Flavobacteria bacterium MS024-2A RepID=C0BIJ4_9BACT Length = 691 Score = 51.9 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 33/108 (30%), Gaps = 14/108 (12%) Query: 172 FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW 231 GA W QGE ++ ++ + + F M+ +R + P++ Y Sbjct: 483 IKGALWYQGESNVG--NFQEYQELFTGMIGDWRERWGY---------EFPFYFAQIAPYT 531 Query: 232 KENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERG---LTNAPDEDPD 276 S+E ++ N V GE N D Sbjct: 532 YTEADRSHELREAQRKSLTTPNTGMVITLDIGEEKDIHPANKQDVGLR 579 >UniRef50_D2KFQ9 Beta-galactosidase/beta-glucuronidase (Fragment) n=1 Tax=Cellulosilyticum ruminicola RepID=D2KFQ9_9FIRM Length = 645 Score = 51.5 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 59/235 (25%), Gaps = 34/235 (14%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 + GQSN LP+ + ++ P ++ PL Sbjct: 90 DLWLCGGQSNME-----LPISRVMERYKEEVETY-ENNKIRMFQVPMTYDFNGPLE---- 139 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 G L ++ + VG A+ L I ++ GG+ A Sbjct: 140 --NIRAGIWQRLNQSNAKNFTAVG--YFFAKALYEKY--QVPIGLIHTAIGGTPIEAWMS 193 Query: 131 GTYSERHGASHDACRWGTDTP-----LYQDLVS------RTRAA-LAKNPQNKFLGACW- 177 ++ A + D L D+ R L+ + G + Sbjct: 194 EEALKQFPAYIEQSAKYKDKAFIQKTLVDDITRINAWYNRLDEVDLSLKERELIKGRWYD 253 Query: 178 ---MQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTW 229 M+G D + +R+ ++ + G Sbjct: 254 AREMEGWKDCELPSFLVDENLVEAGSVWYRKVVQVPENMAGKAGRI--LLGTIVD 306 Score = 50.0 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 30/96 (31%), Gaps = 15/96 (15%) Query: 157 VSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNN 216 + +A KF G W QGE + D + F +++ +R+ L Sbjct: 399 MGVYNGMIAPLKNIKFKGVIWYQGESNGYYPD--DYKTLFEGLIKDWRKTL--------R 448 Query: 217 ITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLA 252 +D P+ N+ E Y N Sbjct: 449 SSDLPFLYVQLP-----NYATLSEDYYLQQANLTDE 479 >UniRef50_A6KYD0 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6KYD0_BACV8 Length = 651 Score = 51.5 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 33/114 (28%), Gaps = 15/114 (13%) Query: 161 RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDA 220 A + G W QGE ++ + F ++ +RR +D Sbjct: 426 NAMVKPWTAFPIKGVIWYQGEANVGRP--EQYGDLFPALITDWRRQW---------RSDF 474 Query: 221 PWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDED 274 P++ + E+ ++ + A + Q G + D Sbjct: 475 PFYFVQLANF-MESKEIQPDS---EWAALREAQTKALKLDQVGMAVTIDIGLAD 524 >UniRef50_C5SG53 Putative uncharacterized protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SG53_9CAUL Length = 649 Score = 51.5 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 43/171 (25%), Gaps = 29/171 (16%) Query: 142 DACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVE 201 LY +++ G W QGE + S+ + + + Sbjct: 414 PWLPTSGLATLYNGMIA-------PIAPYTLKGVAWYQGEANA--SNAREYSRLLPALFA 464 Query: 202 AFRRDLKQYHSQLNNITDAPWFCGDTTWYW---KENFPHSYEAIYGNYQ--NNVLANIIF 256 +R +Q + P + P + + + + AN Sbjct: 465 DWRTSFRQPN--------LPIVVVQLANFGPVVNGPGPSQWAELRESQRLSVKADANTAL 516 Query: 257 VDFQQQGERG---LTNAPDEDPD----DLSTGYYGSAYRSPENWTTALRSS 300 V G+R T Y + SPE + R + Sbjct: 517 VVSLDVGDRTDIHPTQKKVVGERVALGMRKAAYGEAVSLSPEPVSATRRGT 567 Score = 43.4 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 57/236 (24%), Gaps = 38/236 (16%) Query: 11 YVLTVAGQSNAM--AYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN + ++ + RI+ + P G Sbjct: 108 DVFLCSGQSNMEFTTKYATNAYNEILNSANDRIRFVTVEKDAQPAG-------------- 153 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 P + V ++++ L I ++ GG+ + Sbjct: 154 PLSDLPKSPAWRAVGPETTGDSSAV--CYYMSKTLAAKT--GVPIGMIHSSWGGTMIQSW 209 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSD 188 + + + LY + K Q +D Sbjct: 210 LSQSTLRQLKSYDTGLETLK---LYARDPQAAQKGWQKVT----------QAWWDTHEPQ 256 Query: 189 YASHPQHFNHMVEAFRRDLKQYH-SQLNNITDAPWFCG--DTTWYWKENFPHSYEA 241 A Q + D K ++ + P G WY E + +A Sbjct: 257 AAEKRQWATTAYDD--GDWKTMDTTRFWEESGDPALAGFDGVVWYRTEVTLTAAQA 310 >UniRef50_A9KN08 Putative uncharacterized protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KN08_CLOPH Length = 639 Score = 51.1 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 47/211 (22%), Gaps = 39/211 (18%) Query: 103 LLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRA 162 +P P G+ + S T LYQ ++ Sbjct: 359 FVPDKPYGIRYGSKFIDLSGT----WQFRIGATCETLSPQTFFNYMPTSLYQGMI----- 409 Query: 163 ALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW 222 G + QGE + S + + F M++ +R N + P+ Sbjct: 410 --YPLRNYAIKGILFYQGESN--DSHPEKYEELFRAMIKDWRTLF--------NNQNLPF 457 Query: 223 FCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGY 282 + H N+ A + A D Sbjct: 458 LYVQLANFG----DHKRFNTGTNWAYLREAQRHIL-------NIPNTAMAVTLDI----- 501 Query: 283 YGSAYRSPENWTTALRSSHFSTAARRGIISD 313 Y + AAR + + Sbjct: 502 --GEYNDLHPQNKQAVGKRLALAARAILYGE 530 Score = 51.1 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 43/153 (28%), Gaps = 14/153 (9%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFND------IIP 64 V ++GQSN LP+ + + Q + CH N+ +P Sbjct: 82 DVYLLSGQSNM-----VLPISRTINK-NKLNWQADSLFINNNADEICHANEKHIRFFTVP 135 Query: 65 LTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA 124 + D L+ +T + A+++ + ++ GG+ Sbjct: 136 KQYQFDAPADDLEEGIWLSVTPETILPMSAVGYYFAKEIHDRYD--VPVGLIEAAVGGAP 193 Query: 125 FTAGSEGTYSERHGASHDACRWGTDTPLYQDLV 157 A + G + ++ Sbjct: 194 IEAFISEETLHQFGRYDSNIKQNKQKEYVDSVI 226 >UniRef50_C3QAB6 Sialic acid-specific 9-O-acetylesterase n=3 Tax=Bacteroides RepID=C3QAB6_9BACE Length = 639 Score = 51.1 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 44/164 (26%), Gaps = 24/164 (14%) Query: 11 YVLTVAGQSNAMA--YGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHC 68 V +GQSN + A +P+I+ + Sbjct: 104 EVWICSGQSNMEFRLRSANHATEEVAAANYPQIRSFN-----------------VIQEMG 146 Query: 69 PHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 D++G + + + VG AR+L + N I + GG+ Sbjct: 147 HTPKTDLKGKWEVCSPASASDFSAVG--YFFARELYQKL--NIPIGFINSSWGGTDIETW 202 Query: 129 SEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKF 172 E + + +++ + + + Q Sbjct: 203 MSMEVIEHFPKYEKSLA-RMRSSEFEEYIKHSDKVKKEFEQAII 245 Score = 45.7 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 11/103 (10%), Positives = 32/103 (31%), Gaps = 15/103 (14%) Query: 172 FLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYW 231 G W QGE + + + F +++ +R + P++ + Sbjct: 420 MKGVIWYQGENNA--ARANEYIDLFPALIKDWRSRWN---------NEFPFYWVQLANFM 468 Query: 232 KENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDED 274 + + ++ N + G+ + + +E+ Sbjct: 469 ----SPAKQPSESHWANLRDTQSKTLALPHTGQAVIIDIGEEN 507 >UniRef50_Q07P27 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisA53 RepID=Q07P27_RHOP5 Length = 399 Score = 50.8 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 47/327 (14%), Positives = 71/327 (21%), Gaps = 88/327 (26%) Query: 3 AIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDI 62 I VL AGQSN G PH F+ Sbjct: 139 EQIPRQNLAVLLTAGQSNISNTGAPDGGAKTLYQPHNVFYNFDP------------FDGK 186 Query: 63 IPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGG 122 + P G + + +G L + +LI P G Sbjct: 187 CYIAQNPALGTGGDGENVAV---------RLGDELIDRKIYQN-------VLIAPIAISG 230 Query: 123 SAFTAGSE--GTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQG 180 + G Y E ++ R P G W QG Sbjct: 231 TYLEEWRARGGKYFEVVLSALAGLREHGLEP---------------------TGILWHQG 269 Query: 181 EFDLMTSDYASHPQ----------HFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWY 230 EF+ + + + R L+ +AP F T Sbjct: 270 EFNALAFTANTAEDATQLTVTTPMREAARLSYIRNYLEIIAGLRAADANAPIFVATATR- 328 Query: 231 WKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSP 290 G + + + + G + P S G + + + Sbjct: 329 -----------CGGAQDEIIRSAQMSIPNPTLGIYAGPDTDLIGPSMRSDGCHMTHAGT- 376 Query: 291 ENWTTALRSSHFSTAARRGIISDRFVE 317 H A DR E Sbjct: 377 --------DQHARMWA------DRLSE 389 >UniRef50_D2QK09 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QK09_9SPHI Length = 652 Score = 50.8 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 15/112 (13%), Positives = 31/112 (27%), Gaps = 14/112 (12%) Query: 161 RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDA 220 A L G W QGE + + + F M++ +R+ D Sbjct: 410 NAMLNPLIPYAIQGTIWYQGESNAGR--AYQYRKTFPLMIQDWRQHWGY---------DF 458 Query: 221 PWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPD 272 P+ + N + + A + + G ++ + Sbjct: 459 PFLFVQLASFNAANGDSRRGS---GWAELREAQTMTLQLPNTGMAVTSDIGE 507 Score = 49.6 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 44/162 (27%), Gaps = 21/162 (12%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P P ++ Sbjct: 105 EVWICSGQSNME---------------WPLAAAANAKTEIPLANYPNIRQLLVKKDLSLT 149 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 ++++G Q+ VG A++L + N I ++ GG+ + Sbjct: 150 PKENIEGSWSVCTPATAPQFTAVG--YFFAKQLQKEL--NVPIGLINTSWGGTHSETWTS 205 Query: 131 GTYSERHGASH--DACRWGTDTPLYQDLVSRTRAALAKNPQN 170 +H TD + + +RT+A L Sbjct: 206 REAMNQHDELKLVAGKLPATDEEVIKSGAARTQALLKTQQGG 247 >UniRef50_C6X165 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X165_FLAB3 Length = 637 Score = 50.8 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 16/119 (13%), Positives = 32/119 (26%), Gaps = 13/119 (10%) Query: 120 RGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQ 179 GG +G + + G + G ++ A + GA W Q Sbjct: 374 IGGQKTDL--KGEWKYKVGTKMNRSAPGQTFIRWK-PTGLYNAMIHPLINYNIKGALWYQ 430 Query: 180 GEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS 238 GE ++ + M+ +R +Q + P + + Sbjct: 431 GESNIGKPM--EYGDLLTTMITDWRLKFRQP--------ELPVVVVQLANFMEPKAQPQ 479 Score = 50.0 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 45/160 (28%), Gaps = 22/160 (13%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V V+GQSN P R+K L P +P + Sbjct: 104 DVWLVSGQSNME-------------LPMYRVKPL-YEDELSSANNPNIRFFAVPQKYNFK 149 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG-S 129 + Q A N +T A A+ + + IV GGS A Sbjct: 150 NAQQNLDGGKWEAMNAKTISNFSAVAYFFAKNIHEKY--KIPVGIVNASLGGSPIQAWMD 207 Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQ 169 T + +A RW DL+ +T + Sbjct: 208 PVTLKKYPEYLAEAERWKN-----DDLIKQTETSERALSD 242 >UniRef50_B4D814 Sialate O-acetylesterase n=2 Tax=Chthoniobacter flavus Ellin428 RepID=B4D814_9BACT Length = 586 Score = 50.8 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 16/119 (13%), Positives = 35/119 (29%), Gaps = 7/119 (5%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAP-HPRIKQLARFAHTHPGGPPCHFNDIIPLTHCP 69 V +GQSN + + D + +P+I+ + + P DI Sbjct: 118 DVWICSGQSNMELGSKAVMSTDEFNKAGNPQIRLFSVPKYIAPAPA----RDITTAPQGF 173 Query: 70 HDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAG 128 + Q + G + ++ + + ++ C GG+ Sbjct: 174 PLLGTWQVCTPDTLSKTGEWSGFPAVGYYFGSEIQKYT--QQPVGLIGTCWGGTRINCW 230 Score = 45.7 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 14/122 (11%), Positives = 29/122 (23%), Gaps = 18/122 (14%) Query: 130 EGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLMTSDY 189 T L+ +++ GA W QGE + Sbjct: 314 PRAPIGPKEPRDPVHNNQTSAALFNGMIA-------PLIPFGIKGAVWYQGESNADNP-- 364 Query: 190 ASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNN 249 A + ++ +R Q + P+ + P E+ + + Sbjct: 365 AFYKIALPALINDWRTQWGQ--------GNFPFMIVQLPNFG-NPKPEPSESYWAGTREA 415 Query: 250 VL 251 Sbjct: 416 QA 417 >UniRef50_B3PDW1 Sialic acid-specific 9-O-acetylesterase n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PDW1_CELJU Length = 672 Score = 50.0 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 49/185 (26%), Gaps = 36/185 (19%) Query: 108 PDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKN 167 +L+ G + +G + R A + + T YQ S A LA Sbjct: 369 DKRYALLLSDAATGEETVSL--QGQWHYRIAARMEPMKPSTTLH-YQPA-SLFNAKLAPA 424 Query: 168 PQNKFLGACWMQGEFDLMTSDYA--------------------SHPQHFNHMVEAFRRDL 207 G W QGE ++ +D + F ++ +RR Sbjct: 425 LPMAIKGVIWYQGESNVDRADAQGVHRQPGGQCAEPSCAVSTSEYRYLFADLIRDWRRQF 484 Query: 208 KQYHSQLNNITDAPWFCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGL 267 Q D P+ + P E + A ++ Sbjct: 485 NQ--------GDFPFLFVQLASFL----PARDEPTESKWAQLREAQRHTLELPNTAMAVA 532 Query: 268 TNAPD 272 +A + Sbjct: 533 IDAGE 537 Score = 46.1 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 40/152 (26%), Gaps = 16/152 (10%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V AGQSN +R +P + P + PL Sbjct: 104 DVWIAAGQSNME------QPLNRVRYRYPEVLANTEQPRIREFNVPVAYAFKGPLEDY-- 155 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 QG + VG A+ LL I I+ GGS A Sbjct: 156 ----TQGQWKSATPEQIAGFSAVG--FFFAQTLLEQT--RVPIGIISIPVGGSPAEAWMS 207 Query: 131 GTYSERHGASHDACRWGTDTPLYQDLVSRTRA 162 + + D Q +++ +A Sbjct: 208 EQALAAYPHYLKQLQPFKDDAHVQATIAQDKA 239 >UniRef50_C6XQG7 Putative uncharacterized protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XQG7_HIRBI Length = 671 Score = 49.6 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 31/101 (30%), Gaps = 13/101 (12%) Query: 126 TAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWMQGEFDLM 185 + ++ +VS KF A W QGE + Sbjct: 412 WQWQLIENGPSSMDRAPWETMNGLSGIHNGMVS-------PLDGLKFKAAFWYQGESNA- 463 Query: 186 TSDYASHPQHFNHMVEAFR----RDLKQYHSQLNNITDAPW 222 S+ + Q +++ +R DL QL N + P+ Sbjct: 464 -SNPEDYEQMLAGLIKGWRLMFTSDLPVVVIQLANYGNLPF 503 Score = 48.4 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 26/211 (12%), Positives = 41/211 (19%), Gaps = 57/211 (27%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V +GQSN P + L P + + Sbjct: 128 DVYLCSGQSNME---------------FPVSRALNPNRE-IPAADGSNIRLFQMPMRSFN 171 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 Q + K L + I +V GG+ A Sbjct: 172 APQKSVTGDVSWEMASAKTVENFSAVCWFSAKELEKK-SDTPIGLVQAAWGGTRIEAWMN 230 Query: 131 GTYSERHG-----------------------ASHDACRWGTDTP----LYQDLVSRTRAA 163 G W + P +++ A Sbjct: 231 EASLAETGLVSDELEMLQNYRETPKAAISKFGERLNTWWDSTFPSASKPWENASQSIDQA 290 Query: 164 LAKNPQ-------------NKFLGACWMQGE 181 K P+ ++ G W Q E Sbjct: 291 WKKAPETMGNYQAWGIEEIEEYKGLIWFQTE 321 >UniRef50_C7PN71 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PN71_CHIPD Length = 637 Score = 49.6 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 20/137 (14%), Positives = 33/137 (24%), Gaps = 16/137 (11%) Query: 11 YVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPH 70 V AGQSN + + H P + IP T Sbjct: 106 DVWLCAGQSNMV----------HQMTLHSERYAF----DITNANFPEIRHFKIPQTTNLK 151 Query: 71 DVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSE 130 D + + A AR L + + ++ GG+ A + Sbjct: 152 APSDDLSAAYWKSATPTDVADFSAVAYFFARNLYQRY--HVPVGLINASVGGTPIEAWTS 209 Query: 131 GTYSERHGASHDACRWG 147 + + R Sbjct: 210 KEGLKDFPDIQTSIRNN 226 Score = 47.3 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 11/110 (10%), Positives = 24/110 (21%), Gaps = 14/110 (12%) Query: 163 ALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPW 222 LA GA W QGE ++ + + ++ +R P Sbjct: 409 MLAPIIPYTIKGALWYQGESNMGNP--QEYSRLLPALINDWRNKWHNPA--------LPV 458 Query: 223 FCGDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPD 272 + Y ++ + + + Sbjct: 459 LFVQLPGFL----EVQYLPSESSWATLRESQRKSGSLPHTAMAIAIDLGE 504 >UniRef50_A6E6P0 Sialic acid-specific 9-O-acetylesterase n=3 Tax=Bacteroidetes RepID=A6E6P0_9SPHI Length = 652 Score = 49.6 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 42/155 (27%), Gaps = 15/155 (9%) Query: 119 CRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLGACWM 178 GG T E Y A R + A + Q GA W Sbjct: 376 RSGGEQMTLAGEWNYRVGLDLKTIASRPLAEDGP-NRPTVLFNAMINPFVQFNIRGAIWY 434 Query: 179 QGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHS 238 QGE + + F M++ +R+ NI D P++ + K Sbjct: 435 QGESNADR--AYQYRDLFPTMIKDWRKQW--------NIGDFPFYFVQLANFMK----AD 480 Query: 239 YEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDE 273 + + A + G + + D Sbjct: 481 EQPQESAWAELREAQSKTLTLPNTGMAVIIDIGDA 515 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.301 0.108 0.280 Lambda K H 0.267 0.0335 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,518,647,940 Number of Sequences: 3077464 Number of extensions: 53152686 Number of successful extensions: 139072 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 96 Number of HSP's successfully gapped in prelim test: 281 Number of HSP's that attempted gapping in prelim test: 138133 Number of HSP's gapped (non-prelim): 558 length of query: 326 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 197 effective length of database: 643,403,500 effective search space: 126750489500 effective search space used: 126750489500 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 92 (40.3 bits)