BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (111 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P63747 Ethanolamine utilization protein eutS n=127 Tax=... 223 1e-57 UniRef50_A1TZZ8 Microcompartments protein n=15 Tax=Bacteria RepI... 145 3e-34 UniRef50_P0A1D2 Propanediol utilization protein pduU n=93 Tax=Ba... 131 7e-30 UniRef50_A8MJE6 Microcompartments protein n=24 Tax=Bacteria RepI... 128 6e-29 UniRef50_B7LTX3 Putative propanediol utilization protein: polyhe... 128 7e-29 UniRef50_C4FNS4 Putative uncharacterized protein n=1 Tax=Veillon... 127 7e-29 UniRef50_A5I3N7 Ethanolamine utilization protein EutS homolog n=... 119 3e-26 UniRef50_C0F0N5 Putative uncharacterized protein n=1 Tax=Eubacte... 116 2e-25 UniRef50_C4G9K1 Putative uncharacterized protein n=1 Tax=Shuttle... 100 2e-20 UniRef50_A9KN55 Microcompartments protein n=1 Tax=Clostridium ph... 98 6e-20 UniRef50_A9KPD2 Microcompartments protein n=3 Tax=Firmicutes Rep... 93 3e-18 UniRef50_A6M0P4 Microcompartments protein n=6 Tax=Bacteria RepID... 91 1e-17 UniRef50_A5ZM05 Putative uncharacterized protein n=4 Tax=Bacteri... 84 1e-15 UniRef50_UPI0001693372 ethanolamine utilization protein EutS n=1... 71 1e-11 >UniRef50_P63747 Ethanolamine utilization protein eutS n=127 Tax=Bacteria RepID=EUTS_ECOL6 Length = 111 Score = 223 bits (569), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 111/111 (100%), Positives = 111/111 (100%) Query: 1 MDKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLAL 60 MDKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLAL Sbjct: 1 MDKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLAL 60 Query: 61 KAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 KAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS Sbjct: 61 KAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 >UniRef50_A1TZZ8 Microcompartments protein n=15 Tax=Bacteria RepID=A1TZZ8_MARAV Length = 120 Score = 145 bits (366), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 72/108 (66%), Positives = 88/108 (81%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALKAA 63 ERIIQE+VPGKQVTLAHL+A+P E+L K+GV A AIGI+TLTPGETA+IAGD+A K A Sbjct: 13 ERIIQEYVPGKQVTLAHLLANPTEDLCHKVGVEHAEAIGILTLTPGETAIIAGDVATKFA 72 Query: 64 DVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 V IGFLDRFSGALV+ GS+G+VEEALS +S L L Y +C+ T++ Sbjct: 73 SVKIGFLDRFSGALVLTGSIGSVEEALSAILSTLESSLGYRVCQATRT 120 >UniRef50_P0A1D2 Propanediol utilization protein pduU n=93 Tax=Bacteria RepID=PDUU_SALTI Length = 116 Score = 131 bits (329), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 62/109 (56%), Positives = 85/109 (77%), Gaps = 1/109 (0%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDA-GAIGIMTLTPGETAMIAGDLALKA 62 +R+IQE+VPGKQVTLAHLIA+PG++L KK+G+ DA AIGI+T+TP E ++IA D+A K+ Sbjct: 8 DRMIQEYVPGKQVTLAHLIANPGKDLFKKLGLQDAVSAIGILTITPSEASIIACDIATKS 67 Query: 63 ADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 V IGFLDRF+GA+V+ G V AVE AL Q LG ++ +T C +T++ Sbjct: 68 GAVEIGFLDRFTGAVVLTGDVSAVEYALKQVTRTLGEMMQFTTCSITRT 116 >UniRef50_A8MJE6 Microcompartments protein n=24 Tax=Bacteria RepID=A8MJE6_ALKOO Length = 117 Score = 128 bits (321), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 61/108 (56%), Positives = 84/108 (77%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALKAA 63 ER+IQE VPGKQVT+AH+IA P E++ +++G+ + GAIGI+TL+P ETA+IA D+ KAA Sbjct: 10 ERVIQESVPGKQVTIAHVIASPMEDIYERLGIDNQGAIGILTLSPYETAIIAADIVTKAA 69 Query: 64 DVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 DV IGFLDRF+G+L+I G V +VE ALS + L LLN+T +T++ Sbjct: 70 DVEIGFLDRFTGSLIITGDVQSVETALSAANNTLRDLLNFTTAPITRT 117 >UniRef50_B7LTX3 Putative propanediol utilization protein: polyhedral bodies pduU n=2 Tax=Bacteria RepID=B7LTX3_ESCF3 Length = 124 Score = 128 bits (321), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 60/109 (55%), Positives = 84/109 (77%), Gaps = 1/109 (0%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDA-GAIGIMTLTPGETAMIAGDLALKA 62 ER+IQE+VPGKQVTLAHLIA+PG++L K+G+P+ A+GI+T+TP E ++IA D+A K+ Sbjct: 16 ERMIQEYVPGKQVTLAHLIANPGKDLFSKLGLPNTVSALGILTITPSEASIIACDIATKS 75 Query: 63 ADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 V IGFLDRF+G++V+ G V AVE AL Q LG +L +T C +T++ Sbjct: 76 GAVEIGFLDRFTGSVVLTGDVSAVEYALKQVTRTLGEMLRFTACPITRT 124 >UniRef50_C4FNS4 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FNS4_9FIRM Length = 133 Score = 127 bits (320), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 62/111 (55%), Positives = 85/111 (76%), Gaps = 2/111 (1%) Query: 3 KERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAG--AIGIMTLTPGETAMIAGDLAL 60 K+RIIQEFVPGKQVT+AH+IA+P +L +K+G+ + G AIGI+T+TPGE +IA D+A Sbjct: 23 KQRIIQEFVPGKQVTIAHVIANPKPDLFRKMGLEEKGRNAIGILTITPGEGTIIAADIAS 82 Query: 61 KAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 K+ D+ IGF+DRFSGAL+I G V AVE AL + GL R+L + ++T+S Sbjct: 83 KSGDIEIGFIDRFSGALLITGDVSAVESALRSVIEGLQRILGFFPTDLTRS 133 >UniRef50_A5I3N7 Ethanolamine utilization protein EutS homolog n=24 Tax=Clostridiales RepID=A5I3N7_CLOBH Length = 117 Score = 119 bits (298), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 56/108 (51%), Positives = 81/108 (75%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALKAA 63 +R+IQE VPGKQ+T+AH+IA P ++ ++G+ + GAIGI+TL+P ETA+IA D+A KA+ Sbjct: 10 QRVIQESVPGKQITIAHVIASPMPDIYDRLGIDEKGAIGILTLSPFETAIIAADIATKAS 69 Query: 64 DVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 DV IGFLDRF+G++VI G V +VE ALS L +L +T +T++ Sbjct: 70 DVEIGFLDRFTGSVVISGDVQSVETALSAVNDTLKDMLGFTTAPITRT 117 >UniRef50_C0F0N5 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F0N5_9FIRM Length = 125 Score = 116 bits (291), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 53/112 (47%), Positives = 76/112 (67%), Gaps = 2/112 (1%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDA--GAIGIMTLTPGETAMIAGDLA 59 DK RIIQE+VPGKQ+TLAH++ P + KK+G+PD AIGI+T+TP E +IA D+ Sbjct: 14 DKSRIIQEYVPGKQITLAHMVTRPKASVYKKLGLPDDYHDAIGILTITPSEATIIAADIC 73 Query: 60 LKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 KAA + +GF+DRFSG+++I GS VE AL+ ++ L + +TK+ Sbjct: 74 TKAASIKLGFVDRFSGSVIIVGSTSNVESALNSVITTFYNKLGFDTVSITKT 125 >UniRef50_C4G9K1 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9K1_9FIRM Length = 120 Score = 100 bits (249), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 48/115 (41%), Positives = 81/115 (70%), Gaps = 5/115 (4%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGV--PD---AGAIGIMTLTPGETAMIAG 56 +K+RIIQE VPGKQ++LAH++A+P L +K+G+ PD AIG+MT++P ETA+I Sbjct: 6 NKQRIIQELVPGKQISLAHIMANPDPILFEKLGISAPDEEGGSAIGVMTISPAETAVIIA 65 Query: 57 DLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 DLA+K++ V + +D G+L++ G+V VE +++ + R+L +++C +TK+ Sbjct: 66 DLAVKSSGVILELIDLEQGSLIVTGTVSQVESSIAAVMEYCSRMLGFSVCPVTKT 120 >UniRef50_A9KN55 Microcompartments protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KN55_CLOPH Length = 119 Score = 98.2 bits (243), Expect = 6e-20, Method: Compositional matrix adjust. Identities = 51/115 (44%), Positives = 74/115 (64%), Gaps = 5/115 (4%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAG-----AIGIMTLTPGETAMIAG 56 DK RI+QE VPGKQ++LAH+IA P L +K+ AIGI+T++P ET +I Sbjct: 5 DKLRIVQELVPGKQISLAHIIAAPDPLLMEKLCFQKENERMKAAIGILTMSPAETVIIGA 64 Query: 57 DLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 DLALKA+ V + +D SG LV G+V VE A++ V R L++T+C++T++ Sbjct: 65 DLALKASGVTLQNVDYASGTLVFTGTVSEVEAAMNAVVEYSNRTLSFTVCDITRT 119 >UniRef50_A9KPD2 Microcompartments protein n=3 Tax=Firmicutes RepID=A9KPD2_CLOPH Length = 113 Score = 92.8 bits (229), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 58/113 (51%), Positives = 83/113 (73%), Gaps = 2/113 (1%) Query: 1 MDKE--RIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDL 58 M++E R+I+E VPGKQVT+AH+IA P E+ + +G+ GAIGI+TL+P ET++IA D+ Sbjct: 1 MEEEMRRVIEESVPGKQVTIAHVIASPIREVYECLGIDQLGAIGILTLSPFETSIIAADI 60 Query: 59 ALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 A KAADV +GFLDRF+G+++I G V +VE AL+ L R L Y + +TK+ Sbjct: 61 AAKAADVEVGFLDRFTGSVIIAGDVDSVETALNAVCDILKRGLGYMVPVVTKT 113 >UniRef50_A6M0P4 Microcompartments protein n=6 Tax=Bacteria RepID=A6M0P4_CLOB8 Length = 126 Score = 90.9 bits (224), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 53/115 (46%), Positives = 78/115 (67%), Gaps = 5/115 (4%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVP-----DAGAIGIMTLTPGETAMIAG 56 D RIIQE VPGKQ+T+AH+IA P + KK+G+ + AIGI+T+TP ET++IA Sbjct: 12 DLMRIIQETVPGKQITMAHVIASPDSIIYKKLGLDPKIDYNRAAIGILTVTPSETSIIAA 71 Query: 57 DLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 D+A+KA+ + IGF+DRFSG L+I G++ V AL ++ L + +C +TK+ Sbjct: 72 DIAIKASAIEIGFIDRFSGTLIITGTISDVAIALESILNYAKNTLGFNVCNLTKA 126 >UniRef50_A5ZM05 Putative uncharacterized protein n=4 Tax=Bacteria RepID=A5ZM05_9FIRM Length = 143 Score = 84.0 bits (206), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 49/118 (41%), Positives = 72/118 (61%), Gaps = 9/118 (7%) Query: 3 KERIIQEFVPGKQVTLAHLI--AHPG--EELAKKIGVPDAG-----AIGIMTLTPGETAM 53 K R+ + VPGK + AH+I + P + LA IGV + +IGIM +TP E + Sbjct: 26 KLRLTRVRVPGKSLDFAHVITPSDPAVYQNLALNIGVHEGEDHTGESIGIMRVTPWEAIV 85 Query: 54 IAGDLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 +A D+A+KAADV +GF+DRFSGAL+I G + V A+ + V L++ +CE+ KS Sbjct: 86 VATDVAVKAADVQVGFMDRFSGALIILGGLSQVMTAVEEVVKFFRDELHFDVCEIHKS 143 >UniRef50_UPI0001693372 ethanolamine utilization protein EutS n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693372 Length = 68 Score = 70.9 bits (172), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 31/68 (45%), Positives = 50/68 (73%) Query: 44 MTLTPGETAMIAGDLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNY 103 MT+TP E ++I D+A KAA V IGF+DRFSG+LVI G V +V+ A+++ ++GL +L + Sbjct: 1 MTITPSEASIIGADIATKAAGVQIGFVDRFSGSLVITGDVSSVDSAVNEVLNGLQNILGF 60 Query: 104 TLCEMTKS 111 + +T++ Sbjct: 61 SAAAITRT 68 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P63747 Ethanolamine utilization protein eutS n=127 Tax=... 162 3e-39 UniRef50_B7LTX3 Putative propanediol utilization protein: polyhe... 158 6e-38 UniRef50_C4FNS4 Putative uncharacterized protein n=1 Tax=Veillon... 156 1e-37 UniRef50_A5I3N7 Ethanolamine utilization protein EutS homolog n=... 156 2e-37 UniRef50_A8MJE6 Microcompartments protein n=24 Tax=Bacteria RepI... 155 4e-37 UniRef50_P0A1D2 Propanediol utilization protein pduU n=93 Tax=Ba... 155 5e-37 UniRef50_A1TZZ8 Microcompartments protein n=15 Tax=Bacteria RepI... 152 3e-36 UniRef50_C0F0N5 Putative uncharacterized protein n=1 Tax=Eubacte... 148 4e-35 UniRef50_A9KN55 Microcompartments protein n=1 Tax=Clostridium ph... 141 9e-33 UniRef50_C4G9K1 Putative uncharacterized protein n=1 Tax=Shuttle... 140 9e-33 UniRef50_A5ZM05 Putative uncharacterized protein n=4 Tax=Bacteri... 134 8e-31 UniRef50_A6M0P4 Microcompartments protein n=6 Tax=Bacteria RepID... 128 8e-29 UniRef50_A9KPD2 Microcompartments protein n=3 Tax=Firmicutes Rep... 127 1e-28 UniRef50_UPI0001693372 ethanolamine utilization protein EutS n=1... 90 2e-17 Sequences not found previously or not previously below threshold: UniRef50_C0EV75 Putative uncharacterized protein n=1 Tax=Eubacte... 41 0.008 UniRef50_Q0TSP7 Ethanolamine utilization protein n=8 Tax=Clostri... 41 0.012 UniRef50_B8DC54 Microcompartment protein family n=20 Tax=Listeri... 40 0.027 UniRef50_B8FDM5 Microcompartments protein n=3 Tax=Deltaproteobac... 39 0.057 UniRef50_Q892D8 Putative ethanolamine utilization protein n=1 Ta... 39 0.059 UniRef50_C0ESY9 Putative uncharacterized protein n=1 Tax=Eubacte... 39 0.063 UniRef50_Q845B4 PduK protein n=1 Tax=Lactobacillus collinoides R... 38 0.079 CONVERGED! >UniRef50_P63747 Ethanolamine utilization protein eutS n=127 Tax=Bacteria RepID=EUTS_ECOL6 Length = 111 Score = 162 bits (411), Expect = 3e-39, Method: Composition-based stats. Identities = 111/111 (100%), Positives = 111/111 (100%) Query: 1 MDKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLAL 60 MDKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLAL Sbjct: 1 MDKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLAL 60 Query: 61 KAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 KAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS Sbjct: 61 KAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 >UniRef50_B7LTX3 Putative propanediol utilization protein: polyhedral bodies pduU n=2 Tax=Bacteria RepID=B7LTX3_ESCF3 Length = 124 Score = 158 bits (399), Expect = 6e-38, Method: Composition-based stats. Identities = 60/109 (55%), Positives = 84/109 (77%), Gaps = 1/109 (0%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDA-GAIGIMTLTPGETAMIAGDLALKA 62 ER+IQE+VPGKQVTLAHLIA+PG++L K+G+P+ A+GI+T+TP E ++IA D+A K+ Sbjct: 16 ERMIQEYVPGKQVTLAHLIANPGKDLFSKLGLPNTVSALGILTITPSEASIIACDIATKS 75 Query: 63 ADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 V IGFLDRF+G++V+ G V AVE AL Q LG +L +T C +T++ Sbjct: 76 GAVEIGFLDRFTGSVVLTGDVSAVEYALKQVTRTLGEMLRFTACPITRT 124 >UniRef50_C4FNS4 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FNS4_9FIRM Length = 133 Score = 156 bits (396), Expect = 1e-37, Method: Composition-based stats. Identities = 62/111 (55%), Positives = 85/111 (76%), Gaps = 2/111 (1%) Query: 3 KERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAG--AIGIMTLTPGETAMIAGDLAL 60 K+RIIQEFVPGKQVT+AH+IA+P +L +K+G+ + G AIGI+T+TPGE +IA D+A Sbjct: 23 KQRIIQEFVPGKQVTIAHVIANPKPDLFRKMGLEEKGRNAIGILTITPGEGTIIAADIAS 82 Query: 61 KAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 K+ D+ IGF+DRFSGAL+I G V AVE AL + GL R+L + ++T+S Sbjct: 83 KSGDIEIGFIDRFSGALLITGDVSAVESALRSVIEGLQRILGFFPTDLTRS 133 >UniRef50_A5I3N7 Ethanolamine utilization protein EutS homolog n=24 Tax=Clostridiales RepID=A5I3N7_CLOBH Length = 117 Score = 156 bits (396), Expect = 2e-37, Method: Composition-based stats. Identities = 56/110 (50%), Positives = 82/110 (74%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALK 61 + +R+IQE VPGKQ+T+AH+IA P ++ ++G+ + GAIGI+TL+P ETA+IA D+A K Sbjct: 8 NIQRVIQESVPGKQITIAHVIASPMPDIYDRLGIDEKGAIGILTLSPFETAIIAADIATK 67 Query: 62 AADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 A+DV IGFLDRF+G++VI G V +VE ALS L +L +T +T++ Sbjct: 68 ASDVEIGFLDRFTGSVVISGDVQSVETALSAVNDTLKDMLGFTTAPITRT 117 >UniRef50_A8MJE6 Microcompartments protein n=24 Tax=Bacteria RepID=A8MJE6_ALKOO Length = 117 Score = 155 bits (392), Expect = 4e-37, Method: Composition-based stats. Identities = 61/108 (56%), Positives = 84/108 (77%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALKAA 63 ER+IQE VPGKQVT+AH+IA P E++ +++G+ + GAIGI+TL+P ETA+IA D+ KAA Sbjct: 10 ERVIQESVPGKQVTIAHVIASPMEDIYERLGIDNQGAIGILTLSPYETAIIAADIVTKAA 69 Query: 64 DVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 DV IGFLDRF+G+L+I G V +VE ALS + L LLN+T +T++ Sbjct: 70 DVEIGFLDRFTGSLIITGDVQSVETALSAANNTLRDLLNFTTAPITRT 117 >UniRef50_P0A1D2 Propanediol utilization protein pduU n=93 Tax=Bacteria RepID=PDUU_SALTI Length = 116 Score = 155 bits (391), Expect = 5e-37, Method: Composition-based stats. Identities = 62/109 (56%), Positives = 85/109 (77%), Gaps = 1/109 (0%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDA-GAIGIMTLTPGETAMIAGDLALKA 62 +R+IQE+VPGKQVTLAHLIA+PG++L KK+G+ DA AIGI+T+TP E ++IA D+A K+ Sbjct: 8 DRMIQEYVPGKQVTLAHLIANPGKDLFKKLGLQDAVSAIGILTITPSEASIIACDIATKS 67 Query: 63 ADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 V IGFLDRF+GA+V+ G V AVE AL Q LG ++ +T C +T++ Sbjct: 68 GAVEIGFLDRFTGAVVLTGDVSAVEYALKQVTRTLGEMMQFTTCSITRT 116 >UniRef50_A1TZZ8 Microcompartments protein n=15 Tax=Bacteria RepID=A1TZZ8_MARAV Length = 120 Score = 152 bits (384), Expect = 3e-36, Method: Composition-based stats. Identities = 72/110 (65%), Positives = 89/110 (80%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALK 61 + ERIIQE+VPGKQVTLAHL+A+P E+L K+GV A AIGI+TLTPGETA+IAGD+A K Sbjct: 11 NIERIIQEYVPGKQVTLAHLLANPTEDLCHKVGVEHAEAIGILTLTPGETAIIAGDVATK 70 Query: 62 AADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 A V IGFLDRFSGALV+ GS+G+VEEALS +S L L Y +C+ T++ Sbjct: 71 FASVKIGFLDRFSGALVLTGSIGSVEEALSAILSTLESSLGYRVCQATRT 120 >UniRef50_C0F0N5 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F0N5_9FIRM Length = 125 Score = 148 bits (375), Expect = 4e-35, Method: Composition-based stats. Identities = 53/112 (47%), Positives = 76/112 (67%), Gaps = 2/112 (1%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDA--GAIGIMTLTPGETAMIAGDLA 59 DK RIIQE+VPGKQ+TLAH++ P + KK+G+PD AIGI+T+TP E +IA D+ Sbjct: 14 DKSRIIQEYVPGKQITLAHMVTRPKASVYKKLGLPDDYHDAIGILTITPSEATIIAADIC 73 Query: 60 LKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 KAA + +GF+DRFSG+++I GS VE AL+ ++ L + +TK+ Sbjct: 74 TKAASIKLGFVDRFSGSVIIVGSTSNVESALNSVITTFYNKLGFDTVSITKT 125 >UniRef50_A9KN55 Microcompartments protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KN55_CLOPH Length = 119 Score = 141 bits (355), Expect = 9e-33, Method: Composition-based stats. Identities = 51/115 (44%), Positives = 74/115 (64%), Gaps = 5/115 (4%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAG-----AIGIMTLTPGETAMIAG 56 DK RI+QE VPGKQ++LAH+IA P L +K+ AIGI+T++P ET +I Sbjct: 5 DKLRIVQELVPGKQISLAHIIAAPDPLLMEKLCFQKENERMKAAIGILTMSPAETVIIGA 64 Query: 57 DLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 DLALKA+ V + +D SG LV G+V VE A++ V R L++T+C++T++ Sbjct: 65 DLALKASGVTLQNVDYASGTLVFTGTVSEVEAAMNAVVEYSNRTLSFTVCDITRT 119 >UniRef50_C4G9K1 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9K1_9FIRM Length = 120 Score = 140 bits (354), Expect = 9e-33, Method: Composition-based stats. Identities = 46/115 (40%), Positives = 79/115 (68%), Gaps = 5/115 (4%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGV-----PDAGAIGIMTLTPGETAMIAG 56 +K+RIIQE VPGKQ++LAH++A+P L +K+G+ AIG+MT++P ETA+I Sbjct: 6 NKQRIIQELVPGKQISLAHIMANPDPILFEKLGISAPDEEGGSAIGVMTISPAETAVIIA 65 Query: 57 DLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 DLA+K++ V + +D G+L++ G+V VE +++ + R+L +++C +TK+ Sbjct: 66 DLAVKSSGVILELIDLEQGSLIVTGTVSQVESSIAAVMEYCSRMLGFSVCPVTKT 120 >UniRef50_A5ZM05 Putative uncharacterized protein n=4 Tax=Bacteria RepID=A5ZM05_9FIRM Length = 143 Score = 134 bits (338), Expect = 8e-31, Method: Composition-based stats. Identities = 43/118 (36%), Positives = 67/118 (56%), Gaps = 9/118 (7%) Query: 3 KERIIQEFVPGKQVTLAHLIAHPGEELAKKIGV---------PDAGAIGIMTLTPGETAM 53 K R+ + VPGK + AH+I + + + + +IGIM +TP E + Sbjct: 26 KLRLTRVRVPGKSLDFAHVITPSDPAVYQNLALNIGVHEGEDHTGESIGIMRVTPWEAIV 85 Query: 54 IAGDLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 +A D+A+KAADV +GF+DRFSGAL+I G + V A+ + V L++ +CE+ KS Sbjct: 86 VATDVAVKAADVQVGFMDRFSGALIILGGLSQVMTAVEEVVKFFRDELHFDVCEIHKS 143 >UniRef50_A6M0P4 Microcompartments protein n=6 Tax=Bacteria RepID=A6M0P4_CLOB8 Length = 126 Score = 128 bits (321), Expect = 8e-29, Method: Composition-based stats. Identities = 53/115 (46%), Positives = 78/115 (67%), Gaps = 5/115 (4%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVP-----DAGAIGIMTLTPGETAMIAG 56 D RIIQE VPGKQ+T+AH+IA P + KK+G+ + AIGI+T+TP ET++IA Sbjct: 12 DLMRIIQETVPGKQITMAHVIASPDSIIYKKLGLDPKIDYNRAAIGILTVTPSETSIIAA 71 Query: 57 DLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 D+A+KA+ + IGF+DRFSG L+I G++ V AL ++ L + +C +TK+ Sbjct: 72 DIAIKASAIEIGFIDRFSGTLIITGTISDVAIALESILNYAKNTLGFNVCNLTKA 126 >UniRef50_A9KPD2 Microcompartments protein n=3 Tax=Firmicutes RepID=A9KPD2_CLOPH Length = 113 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 56/110 (50%), Positives = 80/110 (72%) Query: 2 DKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALK 61 + R+I+E VPGKQVT+AH+IA P E+ + +G+ GAIGI+TL+P ET++IA D+A K Sbjct: 4 EMRRVIEESVPGKQVTIAHVIASPIREVYECLGIDQLGAIGILTLSPFETSIIAADIAAK 63 Query: 62 AADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNYTLCEMTKS 111 AADV +GFLDRF+G+++I G V +VE AL+ L R L Y + +TK+ Sbjct: 64 AADVEVGFLDRFTGSVIIAGDVDSVETALNAVCDILKRGLGYMVPVVTKT 113 >UniRef50_UPI0001693372 ethanolamine utilization protein EutS n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693372 Length = 68 Score = 90.4 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 31/68 (45%), Positives = 50/68 (73%) Query: 44 MTLTPGETAMIAGDLALKAADVHIGFLDRFSGALVIYGSVGAVEEALSQTVSGLGRLLNY 103 MT+TP E ++I D+A KAA V IGF+DRFSG+LVI G V +V+ A+++ ++GL +L + Sbjct: 1 MTITPSEASIIGADIATKAAGVQIGFVDRFSGSLVITGDVSSVDSAVNEVLNGLQNILGF 60 Query: 104 TLCEMTKS 111 + +T++ Sbjct: 61 SAAAITRT 68 >UniRef50_C0EV75 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EV75_9FIRM Length = 258 Score = 41.4 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 30/105 (28%), Positives = 44/105 (41%), Gaps = 15/105 (14%) Query: 19 AHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALKAADVHI----------G 68 +H+ GE L + G P A G++ P ++A D +KAA V I Sbjct: 131 SHVTPRAGEILHQIFGTPLGKAFGVIGAAPAGIGIVAVDQCMKAAPVDIVWYGSPSHNLT 190 Query: 69 FLDRFSGALVIYGSVGAVEEALSQTVSG---LGRLLNYTLCEMTK 110 ++ FS I G V AV++AL L R+ T +TK Sbjct: 191 MMNEFSAG--ISGDVSAVQKALEAGKEVGCELLRVCGITPISITK 233 >UniRef50_Q0TSP7 Ethanolamine utilization protein n=8 Tax=Clostridium perfringens RepID=Q0TSP7_CLOP1 Length = 254 Score = 41.1 bits (95), Expect = 0.012, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 34/66 (51%), Gaps = 3/66 (4%) Query: 38 AGAIGIMTLTPGETAMIAGDLALKAADVHIGFLDRFSGALV---IYGSVGAVEEALSQTV 94 A+G++ A+ A D ALKAA V + +++ +G +V I G V AV+ A+ Sbjct: 2 GKALGLIEFIGYVPAICAADTALKAASVDLIGIEKVTGGIVTVKITGDVDAVQAAVESAE 61 Query: 95 SGLGRL 100 + R+ Sbjct: 62 NYASRI 67 >UniRef50_B8DC54 Microcompartment protein family n=20 Tax=Listeria RepID=B8DC54_LISMH Length = 173 Score = 39.9 bits (92), Expect = 0.027, Method: Composition-based stats. Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 3/57 (5%) Query: 39 GAIGIMTLTPGETAMIAGDLALKAADVHIGFLDRFSGALV---IYGSVGAVEEALSQ 92 A+G++ +T A++A D LKAA+V + + SG L + G VGAV A+ Sbjct: 4 EALGLIEVTGFLGAVVAADTCLKAANVELIQCEVISGGLTTVELTGDVGAVNAAIEA 60 >UniRef50_B8FDM5 Microcompartments protein n=3 Tax=Deltaproteobacteria RepID=B8FDM5_DESAA Length = 183 Score = 38.7 bits (89), Expect = 0.057, Method: Composition-based stats. Identities = 24/94 (25%), Positives = 47/94 (50%), Gaps = 6/94 (6%) Query: 4 ERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALKAA 63 RI EF+ L H+ ++ + VP+ AIG++ +A++A D A+KAA Sbjct: 61 LRIADEFI-ADHFVLTHIHPDVFPAISSFVIVPEIRAIGVIETYSSSSAVMAADKAVKAA 119 Query: 64 DVHIGFLDRFSG-----ALVIYGSVGAVEEALSQ 92 +V++ + +G + + G +G+V+ A+ Sbjct: 120 NVNLIDIRLATGLAGKAVVTLTGRIGSVQAAVEA 153 >UniRef50_Q892D8 Putative ethanolamine utilization protein n=1 Tax=Clostridium tetani RepID=Q892D8_CLOTE Length = 185 Score = 38.7 bits (89), Expect = 0.059, Method: Composition-based stats. Identities = 25/105 (23%), Positives = 54/105 (51%), Gaps = 11/105 (10%) Query: 14 KQVTLAHLIAHPGEELAKKIG----VPDAGAIGIMTLTPGETAMIAGDLALKAADVHIGF 69 K++ +++I E++ K + + + AIG+M + ++++A D+ALK +D+ + Sbjct: 71 KKLVESNIITSVHEDILKALKKRPIIKEFNAIGVMEFSTVTSSLVALDIALKTSDIQLVK 130 Query: 70 LDRFSGA-----LVIYGSVGAVEEALSQTVS--GLGRLLNYTLCE 107 L +G +I GSV +VEE + + R+++ T+ Sbjct: 131 LVLGNGIAGKAYFIINGSVSSVEEGVKSAQNCVNSKRIVHSTVIP 175 >UniRef50_C0ESY9 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0ESY9_9FIRM Length = 196 Score = 38.7 bits (89), Expect = 0.063, Method: Composition-based stats. Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 9/99 (9%) Query: 9 EFVPGKQVTLAHLIAHPGEELAKKIG----VPDAGAIGIMTLTPGETAMIAGDLALKAAD 64 E V G + H++ + EE+ I A+G + TA+IA D A+KA+ Sbjct: 74 ELVSGIYLIDTHILDNIREEVLPAIAGVTPTQQIQAVGAIETISALTAIIAADTAVKASK 133 Query: 65 VHIGFLDRF-----SGALVIYGSVGAVEEALSQTVSGLG 98 V I L G LVI G V +V A++ ++ LG Sbjct: 134 VSIVDLRIARGLGGKGYLVITGEVSSVRSAVNACLAKLG 172 >UniRef50_Q845B4 PduK protein n=1 Tax=Lactobacillus collinoides RepID=Q845B4_LACCL Length = 231 Score = 38.4 bits (88), Expect = 0.079, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 4/64 (6%) Query: 40 AIGIMTLTPGETAMIAGDLALKAADVHIGFLDRFSG----ALVIYGSVGAVEEALSQTVS 95 +IG + + A++A D LK ADV + +++ G + + G V AV+ A+ + Sbjct: 3 SIGYLEVEGLPGAIVAADRMLKTADVDLKYVENTKGGGWITVSVTGDVAAVQAAIDAGIG 62 Query: 96 GLGR 99 LG Sbjct: 63 ELGD 66 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.316 0.149 0.402 Lambda K H 0.267 0.0451 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 426,902,586 Number of Sequences: 3077464 Number of extensions: 15993899 Number of successful extensions: 44923 Number of sequences better than 1.0e-01: 47 Number of HSP's better than 0.1 without gapping: 29 Number of HSP's successfully gapped in prelim test: 34 Number of HSP's that attempted gapping in prelim test: 44850 Number of HSP's gapped (non-prelim): 77 length of query: 111 length of database: 1,040,396,356 effective HSP length: 79 effective length of query: 32 effective length of database: 797,276,700 effective search space: 25512854400 effective search space used: 25512854400 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 87 (38.0 bits)