BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (346 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76228 Inner membrane protein ynjI n=42 Tax=Bacteria Re... 724 0.0 UniRef50_C7BNF7 Inner membrane protein ynji n=1 Tax=Photorhabdus... 118 3e-25 UniRef50_Q7MZC0 Similar to unknown protein YnjI of Escherichia c... 114 5e-24 UniRef50_D0LG06 Putative uncharacterized protein n=1 Tax=Haliang... 103 9e-21 >UniRef50_P76228 Inner membrane protein ynjI n=42 Tax=Bacteria RepID=YNJI_ECOLI Length = 346 Score = 724 bits (1868), Expect = 0.0, Method: Compositional matrix adjust. Identities = 346/346 (100%), Positives = 346/346 (100%) Query: 1 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT 60 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT Sbjct: 1 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT 60 Query: 61 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI 120 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI Sbjct: 61 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI 120 Query: 121 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY 180 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY Sbjct: 121 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY 180 Query: 181 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW 240 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW Sbjct: 181 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW 240 Query: 241 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC 300 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC Sbjct: 241 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC 300 Query: 301 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS 346 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS Sbjct: 301 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS 346 >UniRef50_C7BNF7 Inner membrane protein ynji n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BNF7_PHOAA Length = 381 Score = 118 bits (296), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 94/349 (26%), Positives = 154/349 (44%), Gaps = 27/349 (7%) Query: 1 MKKVLLQNHPGSEKYSFNGWEIFN--------SNFERMIKE--NKAMLLCKWGFYLTCVV 50 M K LLQN P ++ G++I S F R++ N ++ + V Sbjct: 1 MAKELLQNQPHYRQFPVKGYQIIKNILCEKRKSPFLRLLYTVINILFVIAVGMGIILSVA 60 Query: 51 AVMFVFAAITSNGLNERGLITAGCSFLYLLIMMGLIV-RAGFKAKKEQLHYYQAKGIEPL 109 V+ + + + L G L ++I++ +I+ R + + EQ YYQ G+ L Sbjct: 61 IVLSIMGDVYVPDKDYLLLTKVGSVALVVVILLFVIIYRIVKRPEWEQRRYYQQAGLSLL 120 Query: 110 SIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLE 169 EK Q L+L ++ WSETLE +P + D + Y++LP + + + L Sbjct: 121 PEEKRQVLRLNIVSDYWLGFWSETLEHYPLQSRVAHDDYCYYLLPLSA---AQEHQSQLY 177 Query: 170 DQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANH 229 WGI D E Y ++ G H + F ++ + ++ +L K DYI C+ Sbjct: 178 SDWGILDEEGYMKMLTGLWHGVHSKH-FAVDVALSDGKMFEVLAKLVEVTPDYIRKCSRS 236 Query: 230 SSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 +G A L+W +L I +S F G I EE+AW ++ + +E+F S +D+ N Sbjct: 237 VNGHPPA-LVWGFDLWLAIVLSRNCFCAGYISEEMAWENMLKTADYIYEIFGSFDDFYTN 295 Query: 290 SQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 ++G YW + T LE QF +Y C WPI +PW Sbjct: 296 FRLGNTYWSNDFDK---TKGRLE-------QF-NYYKLHCDWPIAKLPW 333 >UniRef50_Q7MZC0 Similar to unknown protein YnjI of Escherichia coli n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZC0_PHOLL Length = 380 Score = 114 bits (285), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 93/349 (26%), Positives = 156/349 (44%), Gaps = 31/349 (8%) Query: 3 KVLLQNHPGSEKYSFNGWEI------------FNSNFERMIKENKAMLLCKWGFYLTCVV 50 K LLQN P ++ G+++ F S +I + +C G L+ V Sbjct: 2 KELLQNQPHYRQFPVKGYQVMKHIISEKRKLPFLSALYALINILFVIAVCM-GIVLS-VA 59 Query: 51 AVMFVFAAITSNGLNERGLITAGCSFLY-LLIMMGLIVRAGFKAKKEQLHYYQAKGIEPL 109 V+F+ ++ + L G L+ +++++ +I R + + EQ YYQ G+ L Sbjct: 60 IVLFIIGNVSVPDTDYLLLAQVGGVALFGVMLLLVIIYRIVKRPEWEQRRYYQQAGLSLL 119 Query: 110 SIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLE 169 +K Q L+L ++ WSETLE +P + D+++Y +LP + + L Sbjct: 120 PEDKRQVLRLNIVGDYWLGFWSETLEHYPLQSRVAHDSYRYCLLPLSP---TQEHQSQLY 176 Query: 170 DQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANH 229 WGI D E Y ++ G H + F + + ++ +L K DYI CA Sbjct: 177 SDWGIIDEEGYMKMLTGLWEGVHSKH-FAIDAALSDGKMFKVLAKLVEVTPDYIHKCAKP 235 Query: 230 SSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 + + A L+W +L I +S F G I EE+AW ++ + +E+F S +++ N Sbjct: 236 INKRPPA-LVWGFDLWLAIVLSRNCFCAGYISEEMAWKNMLKTADYIYEIFGSFDEFYTN 294 Query: 290 SQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 ++G YW R K LE QF +Y C WPI ++PW Sbjct: 295 FRLGNAYWSNDFDRSK---ERLE-------QF-NYYKSHCDWPIASLPW 332 >UniRef50_D0LG06 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LG06_HALO1 Length = 357 Score = 103 bits (257), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 86/337 (25%), Positives = 145/337 (43%), Gaps = 29/337 (8%) Query: 5 LLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAITSNGL 64 +LQNHPG +Y + + FE + K + F ++ + A+ NGL Sbjct: 1 MLQNHPGDSRYPVT--SNWLTPFEHLSKRRASAARAALTF------GIITIGVAVLGNGL 52 Query: 65 NERG---LITAGCSFLYLLIMMGLIVRAGFK---------AKKEQLHYYQAKGIEPLSIE 112 L + + +Y + +GL++ A F ++EQ YY+ + + E Sbjct: 53 LSEAVEPLPASAFAVVYAIGALGLLLFAVFAWLVSQGATARRREQQRYYELGRVPEFTEE 112 Query: 113 KLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRR--ESLED 170 + A QL A WSETLE WP GK F ++ I+ K ESL+ Sbjct: 113 QRSAFQLDAVNAV--GLWSETLETWPCAARLGKVASGSAAASFVTLPILPKEEALESLDG 170 Query: 171 QWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHS 230 WG+ +E + L+G H A + + ++ L + P + + Sbjct: 171 DWGVLSAEGCRRTIADLLAGMHSAGFAEVARGPDGDAMLTRLAELTGLPLERVR-ATLQP 229 Query: 231 SGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNS 290 + + A+LIWA +L+ ++ ++ AF G +E AW I+ ASR AH LF S ED+ +N Sbjct: 230 ANRRPARLIWAWDLARVVPLARKAFMAGLFDEAQAWDAILSASRPAHALFASVEDFYENY 289 Query: 291 QMGFLYW----HICCYRRKLTDAELEACYRYDKQFWE 323 ++G +W R + DA L++ + W+ Sbjct: 290 RIGHAFWSNDYQGARTRAERIDAFLQSQLPVRRAVWQ 326 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76228 Inner membrane protein ynjI n=42 Tax=Bacteria Re... 515 e-145 UniRef50_C7BNF7 Inner membrane protein ynji n=1 Tax=Photorhabdus... 465 e-129 UniRef50_Q7MZC0 Similar to unknown protein YnjI of Escherichia c... 453 e-126 UniRef50_D0LG06 Putative uncharacterized protein n=1 Tax=Haliang... 375 e-102 Sequences not found previously or not previously below threshold: UniRef50_A5CUH0 Putative uncharacterized protein n=1 Tax=Claviba... 116 2e-24 UniRef50_C9LQ50 Putative uncharacterized protein n=1 Tax=Dialist... 67 1e-09 UniRef50_Q6MI61 Putative uncharacterized protein n=1 Tax=Bdellov... 61 5e-08 UniRef50_B0MZC9 Putative uncharacterized protein n=1 Tax=Alistip... 60 2e-07 UniRef50_Q2S9A8 Putative uncharacterized protein n=1 Tax=Hahella... 59 2e-07 UniRef50_Q5Z2V3 Putative uncharacterized protein n=1 Tax=Nocardi... 59 2e-07 UniRef50_B0NPN4 Putative uncharacterized protein n=1 Tax=Bactero... 56 2e-06 UniRef50_B9Y3D4 Putative uncharacterized protein n=1 Tax=Holdema... 56 2e-06 UniRef50_A5FIQ4 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 55 3e-06 UniRef50_C5EQ99 Putative uncharacterized protein n=1 Tax=Clostri... 55 4e-06 UniRef50_D1QRT4 Putative uncharacterized protein n=1 Tax=Prevote... 55 5e-06 UniRef50_C7M7R2 Putative uncharacterized protein n=1 Tax=Capnocy... 54 6e-06 UniRef50_C6DJA3 Putative uncharacterized protein n=4 Tax=Pectoba... 54 8e-06 UniRef50_D1PWQ7 Putative uncharacterized protein n=1 Tax=Prevote... 53 1e-05 UniRef50_C7PJR4 Putative uncharacterized protein n=1 Tax=Chitino... 53 1e-05 UniRef50_A8RHY8 Putative uncharacterized protein n=1 Tax=Clostri... 53 2e-05 UniRef50_D1QRT3 Putative uncharacterized protein n=1 Tax=Prevote... 50 1e-04 UniRef50_D0GJQ3 Putative liporotein n=1 Tax=Leptotrichia goodfel... 50 2e-04 UniRef50_Q5LDR8 Putative uncharacterized protein n=6 Tax=Bactero... 50 2e-04 UniRef50_C9Q0E0 Putative uncharacterized protein n=1 Tax=Prevote... 49 2e-04 UniRef50_B1EFD5 Putative uncharacterized protein n=1 Tax=Escheri... 49 2e-04 UniRef50_A7BBF7 Putative uncharacterized protein n=1 Tax=Actinom... 49 3e-04 UniRef50_B1KGR1 Putative uncharacterized protein n=1 Tax=Shewane... 48 4e-04 UniRef50_C6PMJ3 Putative uncharacterized protein n=1 Tax=Clostri... 48 6e-04 UniRef50_C8UAU2 Putative uncharacterized protein n=3 Tax=Escheri... 47 7e-04 UniRef50_Q5LIZ2 Putative uncharacterized protein n=5 Tax=Bactero... 47 7e-04 UniRef50_D1AQB9 Putative uncharacterized protein n=1 Tax=Sebalde... 47 8e-04 UniRef50_C4DP30 Putative uncharacterized protein n=1 Tax=Stackeb... 47 0.001 UniRef50_B8F949 Putative uncharacterized protein n=1 Tax=Desulfa... 47 0.001 UniRef50_A5FIQ3 Putative uncharacterized protein n=1 Tax=Flavoba... 47 0.001 UniRef50_Q8A2P3 Putative uncharacterized protein n=10 Tax=Bacter... 46 0.001 UniRef50_D0L2R2 Serine/threonine protein kinase-related protein ... 46 0.002 UniRef50_UPI00019694B1 hypothetical protein BACCELL_01818 n=1 Ta... 46 0.003 UniRef50_C7PJ30 Putative uncharacterized protein n=1 Tax=Chitino... 45 0.004 UniRef50_UPI0001B4FA8B hypothetical protein ShygA5_09679 n=1 Tax... 43 0.015 UniRef50_C0Z9V0 Putative uncharacterized protein n=1 Tax=Breviba... 41 0.048 UniRef50_A6VV45 Putative uncharacterized protein n=1 Tax=Marinom... 41 0.065 >UniRef50_P76228 Inner membrane protein ynjI n=42 Tax=Bacteria RepID=YNJI_ECOLI Length = 346 Score = 515 bits (1327), Expect = e-145, Method: Composition-based stats. Identities = 346/346 (100%), Positives = 346/346 (100%) Query: 1 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT 60 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT Sbjct: 1 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT 60 Query: 61 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI 120 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI Sbjct: 61 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI 120 Query: 121 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY 180 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY Sbjct: 121 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY 180 Query: 181 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW 240 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW Sbjct: 181 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW 240 Query: 241 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC 300 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC Sbjct: 241 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC 300 Query: 301 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS 346 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS Sbjct: 301 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS 346 >UniRef50_C7BNF7 Inner membrane protein ynji n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BNF7_PHOAA Length = 381 Score = 465 bits (1196), Expect = e-129, Method: Composition-based stats. Identities = 92/349 (26%), Positives = 152/349 (43%), Gaps = 27/349 (7%) Query: 1 MKKVLLQNHPGSEKYSFNGWEIFN--------SNFERMIKE--NKAMLLCKWGFYLTCVV 50 M K LLQN P ++ G++I S F R++ N ++ + V Sbjct: 1 MAKELLQNQPHYRQFPVKGYQIIKNILCEKRKSPFLRLLYTVINILFVIAVGMGIILSVA 60 Query: 51 AVMFVFAAITSNGLNERGLITAGCSFLYLLIMMG-LIVRAGFKAKKEQLHYYQAKGIEPL 109 V+ + + + L G L ++I++ +I R + + EQ YYQ G+ L Sbjct: 61 IVLSIMGDVYVPDKDYLLLTKVGSVALVVVILLFVIIYRIVKRPEWEQRRYYQQAGLSLL 120 Query: 110 SIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLE 169 EK Q L+L ++ WSETLE +P + D + Y++LP + + + L Sbjct: 121 PEEKRQVLRLNIVSDYWLGFWSETLEHYPLQSRVAHDDYCYYLLPLSA---AQEHQSQLY 177 Query: 170 DQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANH 229 WGI D E Y ++ G H + F ++ + ++ +L K DYI C+ Sbjct: 178 SDWGILDEEGYMKMLTGLWHGVHSKH-FAVDVALSDGKMFEVLAKLVEVTPDYIRKCSRS 236 Query: 230 SSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 +G A L+W +L I +S F G I EE+AW ++ + +E+F S +D+ N Sbjct: 237 VNGHPPA-LVWGFDLWLAIVLSRNCFCAGYISEEMAWENMLKTADYIYEIFGSFDDFYTN 295 Query: 290 SQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 ++G YW + T LE + +Y C WPI +PW Sbjct: 296 FRLGNTYWSNDFDK---TKGRLEQ--------FNYYKLHCDWPIAKLPW 333 >UniRef50_Q7MZC0 Similar to unknown protein YnjI of Escherichia coli n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZC0_PHOLL Length = 380 Score = 453 bits (1164), Expect = e-126, Method: Composition-based stats. Identities = 89/349 (25%), Positives = 153/349 (43%), Gaps = 31/349 (8%) Query: 3 KVLLQNHPGSEKYSFNGWEI------------FNSNFERMIKENKAMLLCKWGFYLTCVV 50 K LLQN P ++ G+++ F S +I + +C + V Sbjct: 2 KELLQNQPHYRQFPVKGYQVMKHIISEKRKLPFLSALYALINILFVIAVCMG--IVLSVA 59 Query: 51 AVMFVFAAITSNGLNERGLITAGCSFLY-LLIMMGLIVRAGFKAKKEQLHYYQAKGIEPL 109 V+F+ ++ + L G L+ +++++ +I R + + EQ YYQ G+ L Sbjct: 60 IVLFIIGNVSVPDTDYLLLAQVGGVALFGVMLLLVIIYRIVKRPEWEQRRYYQQAGLSLL 119 Query: 110 SIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLE 169 +K Q L+L ++ WSETLE +P + D+++Y +LP + + L Sbjct: 120 PEDKRQVLRLNIVGDYWLGFWSETLEHYPLQSRVAHDSYRYCLLPLSP---TQEHQSQLY 176 Query: 170 DQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANH 229 WGI D E Y ++ G H + F + + ++ +L K DYI CA Sbjct: 177 SDWGIIDEEGYMKMLTGLWEGVHSKH-FAIDAALSDGKMFKVLAKLVEVTPDYIHKCAKP 235 Query: 230 SSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 + + A L+W +L I +S F G I EE+AW ++ + +E+F S +++ N Sbjct: 236 INKRPPA-LVWGFDLWLAIVLSRNCFCAGYISEEMAWKNMLKTADYIYEIFGSFDEFYTN 294 Query: 290 SQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 ++G YW R K LE + +Y C WPI ++PW Sbjct: 295 FRLGNAYWSNDFDRSK---ERLEQ--------FNYYKSHCDWPIASLPW 332 >UniRef50_D0LG06 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LG06_HALO1 Length = 357 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 86/337 (25%), Positives = 145/337 (43%), Gaps = 29/337 (8%) Query: 5 LLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAITSNGL 64 +LQNHPG +Y + + FE + K + F ++ + A+ NGL Sbjct: 1 MLQNHPGDSRYPVT--SNWLTPFEHLSKRRASAARAALTF------GIITIGVAVLGNGL 52 Query: 65 NERG---LITAGCSFLYLLIMMGLIVRAGFK---------AKKEQLHYYQAKGIEPLSIE 112 L + + +Y + +GL++ A F ++EQ YY+ + + E Sbjct: 53 LSEAVEPLPASAFAVVYAIGALGLLLFAVFAWLVSQGATARRREQQRYYELGRVPEFTEE 112 Query: 113 KLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRR--ESLED 170 + A QL A WSETLE WP GK F ++ I+ K ESL+ Sbjct: 113 QRSAFQLDAVNAV--GLWSETLETWPCAARLGKVASGSAAASFVTLPILPKEEALESLDG 170 Query: 171 QWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHS 230 WG+ +E + L+G H A + + ++ L + P + + Sbjct: 171 DWGVLSAEGCRRTIADLLAGMHSAGFAEVARGPDGDAMLTRLAELTGLPLERVR-ATLQP 229 Query: 231 SGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNS 290 + + A+LIWA +L+ ++ ++ AF G +E AW I+ ASR AH LF S ED+ +N Sbjct: 230 ANRRPARLIWAWDLARVVPLARKAFMAGLFDEAQAWDAILSASRPAHALFASVEDFYENY 289 Query: 291 QMGFLYWHIC----CYRRKLTDAELEACYRYDKQFWE 323 ++G +W R + DA L++ + W+ Sbjct: 290 RIGHAFWSNDYQGARTRAERIDAFLQSQLPVRRAVWQ 326 >UniRef50_A5CUH0 Putative uncharacterized protein n=1 Tax=Clavibacter michiganensis subsp. michiganensis NCPPB 382 RepID=A5CUH0_CLAM3 Length = 348 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 61/362 (16%), Positives = 122/362 (33%), Gaps = 37/362 (10%) Query: 6 LQNHPGSEKYSFNGWEIFNSNFERMI------KENKAMLLCKWGFYLTCVVAVMFVFAAI 59 LQNHPG ++ + + K +A+L+ + + A Sbjct: 3 LQNHPGEVEFPVRARVRYARMLDAQRLRPGGGKGTRALLVTALALPFGAAGVALGILVAT 62 Query: 60 TSNGLNERGLITAGCSFLYLLIMMGL-IVRA---GFKAKKEQLHYYQAKGIEPLSIEKLQ 115 + + + + M+ IV +++QL Y I P+++E+ Q Sbjct: 63 SGEPEGGPAMPIVLFALGMGIGMLVASIVFQQIDARAPRRDQLDYVAQARIRPVTLEEQQ 122 Query: 116 ALQLIAPYRFYHKQWSETLEFWPRKPEP----------GKDTFQYHVLPFDSIDIISKRR 165 L L A + W+ +L F P E G + ++ LP ++ ++ R Sbjct: 123 LLALDAVSDYSFGGWNSSLAFQPTWAEMPAELRTTHADGANGHEWVGLPMTTL---AQHR 179 Query: 166 ESLEDQWGIEDSESYCALMEH-FLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYIS 224 +L+ Q+ I + + G A + + E E++++ + I Sbjct: 180 AALDTQFRIASRDDIELFVADALTQGPQSARFAELAVSEEAERMVSRMAALTGRSEFEII 239 Query: 225 DCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEE 284 D G+ L+ A + I A+ G + + AW I + ++ + Sbjct: 240 DLTRPHDGRPPVLLL-AGDSERTIGAIRYAYMAGYLPADDAWALIRQIGARVFATYDGWD 298 Query: 285 DYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVK 344 Y + + + R DA +++ R WP VPW ++ Sbjct: 299 AYWADVSLALAF------RTDSLDA-VQSQRRVRDAL-----VASAWPAATVPWPGAATP 346 Query: 345 YS 346 S Sbjct: 347 RS 348 >UniRef50_C9LQ50 Putative uncharacterized protein n=1 Tax=Dialister invisus DSM 15470 RepID=C9LQ50_9FIRM Length = 259 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 74/240 (30%), Gaps = 39/240 (16%) Query: 104 KGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISK 163 K + PL + L + A + ++LE K E Sbjct: 40 KKLSPL---QQSVLNIGAVNAEQTMFYCDSLETGSEKEEI-------------------- 76 Query: 164 RRESLEDQWGIEDSESYCALMEHFLSGDHGANT-----FKANMEEA-------PEQVIAL 211 R SL + I D ES +E L H F A + + P++ + Sbjct: 77 -RNSLAAYYDIIDEESALHTLEWLLERGHRVYFDAIKLFSAGISPSITDEILTPDERLDT 135 Query: 212 ---LNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHY 268 + I S + + A ++ ++ I+ F+ G I EE AW+Y Sbjct: 136 PRYMKNMKEMIESLIEKGYISSQADLQNQSVLAWDMGRLVLIARCCFECGYITEEKAWYY 195 Query: 269 IMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKK 328 + A +K ++ +++ +G W E R + W+ Sbjct: 196 MEEAHKKCCAVYGDWKEFAAGYVIGRCMWGGMKQMPGGIMGIAEGLLRDPESPWQKARLH 255 >UniRef50_Q6MI61 Putative uncharacterized protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MI61_BDEBA Length = 256 Score = 61.4 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 61/177 (34%), Gaps = 20/177 (11%) Query: 168 LEDQWGIEDSESYCALMEHFLSGDHGANTFKANME-EAPEQVIAL-LNKFAV-FPSDYIS 224 LE+ WG+ D ES +E+ + H + + A+ + KF F D Sbjct: 80 LEEFWGVSDRESCQKTLENIRTQGHRTKFNVLRSALPSDGSIDAVSMEKFRQIFRFDLEE 139 Query: 225 DCANHSSGKSSAKL---------------IWAAELSWMISISSTAFQNGTIEEELAWHYI 269 D S +KL I A + S I + +F G + + AW I Sbjct: 140 DQELQMSDADYSKLALWVQRTNKYLKEPGILAWDASRYIHLVRLSFVAGHLSDIQAWSEI 199 Query: 270 MLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYS 326 + + F++ D+ ++ +G +W A E + W+ S Sbjct: 200 LKLAPIVEGRFDNWMDFSQSFLIGRTFWSGADD--PRVKAICEKLLGHPASPWQFIS 254 >UniRef50_B0MZC9 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MZC9_9BACT Length = 342 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 66/181 (36%), Gaps = 13/181 (7%) Query: 154 PFDSIDIISKRRESLEDQWGIEDSESYCALMEHFL-SGDHGANTFKANMEEAPEQVIALL 212 +S +E LE+ W + D +S + L G H + E ++ I + Sbjct: 160 SLESTAGTDTLKEMLEEWWEVTDRKSALETISWLLNEGQHAG--ADPALAEIRQRGIEAI 217 Query: 213 NKFAVFPSDY-------ISDCANHSSGKSSAKL---IWAAELSWMISISSTAFQNGTIEE 262 + D I++ + + A L + A +L ++++ AF G I E Sbjct: 218 TEEEKADEDSKIGDAFTIAEFVMGVNETTEADLPETVLAWDLVRAVNMARWAFICGYINE 277 Query: 263 ELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFW 322 + W I + A E F S E+Y + +G W + D + A + W Sbjct: 278 DEMWEAIRTTAGIAKESFSSWEEYGNSFAVGRGIWRGETDDYETADEVVGALLNKEDSPW 337 Query: 323 E 323 + Sbjct: 338 K 338 >UniRef50_Q2S9A8 Putative uncharacterized protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S9A8_HAHCH Length = 394 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 60/186 (32%), Gaps = 8/186 (4%) Query: 142 EPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKAN- 200 D V + + I +E L+D W + D ES ++ L G H A K Sbjct: 191 RARHDLLGSEV---TTPESIQSWKEGLKDWWSVTDRESLLETLDWLLEGGHRAGFNKLRE 247 Query: 201 --MEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNG 258 M+ EQ A S I + ++ IS+ + G Sbjct: 248 QVMQLDSEQYQAAWEANEDEELRANMKIIRRYSNALGEPGIASWDIGRYISLCRWGYLVG 307 Query: 259 TIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAE--LEACYR 316 + EE AW I+ A+R ++ S +G +YW +T L Sbjct: 308 MLTEEEAWERILHAARAVQGMYHSWRQMGLGYVVGRMYWQSDASDEHMTKFFNLLRRQTV 367 Query: 317 YDKQFW 322 +W Sbjct: 368 SPDSYW 373 >UniRef50_Q5Z2V3 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5Z2V3_NOCFA Length = 278 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 63/184 (34%), Gaps = 31/184 (16%) Query: 166 ESLEDQWGIEDSESYCALMEHFLSGDHGA--------NTFKANMEEAPE---------QV 208 ++L WGI D A ME L G H T N E Sbjct: 74 DTLTGAWGITDGTEAQASMEQLLDGMHAPLYALVHPLVTASINASERDRFGERADRHRAF 133 Query: 209 IALLNKFAVF--PSDYISDC-----------ANHSSGKSSAKLIWAAELSWMISISSTAF 255 + + F P + D H + + I A +L+ +++++ +F Sbjct: 134 LRQVASFRGMDNPESLVRDYDIWSQAIKIGFTEHLARPLPSD-IHAWDLARVVAVARMSF 192 Query: 256 QNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACY 315 G IE ++AW Y+M A A + + + G+ YW C +L + ++ Sbjct: 193 TAGYIEADVAWGYLMRALPLAQRKYRNWRQFGDAYLTGWTYWQACEDLAELKNGGVDRRM 252 Query: 316 RYDK 319 + Sbjct: 253 ELLR 256 >UniRef50_B0NPN4 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NPN4_BACSE Length = 116 Score = 56.0 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 22/94 (23%), Positives = 43/94 (45%), Gaps = 1/94 (1%) Query: 230 SSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 +G+ +I A +L ++++ A+ I E+ WH + +A+ A E F S E+Y ++ Sbjct: 18 PAGQMPQSVI-AWDLVRLVNLGRWAYLCDYIREDEMWHIMQVAADTALEHFSSWEEYGRS 76 Query: 290 SQMGFLYWHICCYRRKLTDAELEACYRYDKQFWE 323 MG WH + +E + + W+ Sbjct: 77 FIMGRGVWHGDPTDSETAYEIVELLLKNGESPWK 110 >UniRef50_B9Y3D4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y3D4_9FIRM Length = 367 Score = 55.6 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 52/152 (34%), Gaps = 7/152 (4%) Query: 165 RESLEDQWGIED----SESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPS 220 ++ LE WGI + +E L + AN +E + +V P Sbjct: 187 KDMLESSWGITNPAELTEKLRELTTAGHQAKYSRYQAAANPQELMDDPEDEEELESVLP- 245 Query: 221 DYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELF 280 A + K I + ++ + G I EE +W ++ + K ++F Sbjct: 246 --CWKLAQYFKDKLPENYILGWDYGRAATVVRWGYTVGYINEEDSWAWLDQIAEKMIDVF 303 Query: 281 ESEEDYQKNSQMGFLYWHICCYRRKLTDAELE 312 +S ++ + G L+W + E Sbjct: 304 DSWTEFGLSYVFGSLFWIAAFDGEEGISERFE 335 >UniRef50_A5FIQ4 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FIQ4_FLAJ1 Length = 256 Score = 55.2 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 61/182 (33%), Gaps = 16/182 (8%) Query: 165 RESLEDQWGIEDSESYCALMEHFLS--GDHGANTFKANMEEAPEQVI-----ALLNKFAV 217 ++ L+ W I D S ++ S G H E +++ L Sbjct: 74 KQMLQQYWSISDLNSGMKQVQELTSKNGMHSKEFVDQVKELGIDKMSKQEFETKLAAITD 133 Query: 218 FPSDYISDCANHSSGKSSA-KLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKA 276 P I + + I +L + ++ + G +E A + S++ Sbjct: 134 -PEQKIHLQLLYDAYTDLGYNAILGWDLGRANFLLTSFYVAGFNDENTALDKALEVSKRI 192 Query: 277 HELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNV 336 + F+S ++Y ++ G+LYW + + Y + F K + P V Sbjct: 193 QKTFKSWDEYNRSYMYGYLYWSNEDPKDS------SSKYAERQGFISELKKDTKSPF-QV 245 Query: 337 PW 338 W Sbjct: 246 KW 247 >UniRef50_C5EQ99 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQ99_9FIRM Length = 329 Score = 54.8 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 60/162 (37%), Gaps = 13/162 (8%) Query: 165 RESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYIS 224 + +E WGI D +S +++ + H K E Q ++ A + S Sbjct: 146 KSVMERDWGIRDRQSAQSMISWLENEGHNQALLKYYEEHDLGQYETDIDLNASWDSGQGE 205 Query: 225 DCANHSSGKSSAKLIW---------AAELSWMISISSTAFQNGTIEEELAWHYIMLASRK 275 ++ + +A + + + S + + + G E A + +K Sbjct: 206 ISDGEAARQMAAYMGYRTYGAYAASGWDYSRALMLLGQCYVAGYYTYEEAMDKSLELGKK 265 Query: 276 AHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRY 317 +F S ED+ ++ GF+YW R T+ + E YR Sbjct: 266 LQSMFPSWEDFMQSYMYGFVYWS----RSDPTEPQSEFQYRV 303 >UniRef50_D1QRT4 Putative uncharacterized protein n=1 Tax=Prevotella oris F0302 RepID=D1QRT4_9BACT Length = 302 Score = 54.8 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 55/320 (17%), Positives = 107/320 (33%), Gaps = 46/320 (14%) Query: 30 MIKENKAMLLCKWGFYLTCVVAVMFVFAAITSNGLNERGLITAGCSFLYLLIMMGLIVRA 89 M K+ L+ GF A+ F + + + G++T + +++ + R Sbjct: 1 MKKKIIFCLIAMLGFLSQQAFALRFRVRVPSGSSESSNGVVTWIVYAVIAVVVAVTLYRY 60 Query: 90 GFKAKKEQLHY---YQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKD 146 FK + + + LS E+ + + L A Y K + T++ K Sbjct: 61 FFKIRGFLRQFKGDFLMDESSSLSKEQQRKMLLGAVYAVIDKGYLNTIKTGLEK------ 114 Query: 147 TFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHF---LSGDHGANTFKANMEE 203 +R + LE W I +S + + D+ N +A + Sbjct: 115 ---------------EEREDRLEKDWNICTHDSAVDALNGLKIACTKDYSPNIGEA-FKL 158 Query: 204 APEQVIALLNKFAVFPSDYISDCANHSSG--KSSAKLI----------------WAAELS 245 ++ I + + CA K L+ A E + Sbjct: 159 KEQKAIEKYLRETFVNPNDAKACAKQIERAFKHIGNLVKEGIVRDEAEFSRIGGVAFEAT 218 Query: 246 WMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRK 305 +++I+ ++ I E+ W Y+ A +AH+ S EDY K+ +G W Y Sbjct: 219 RLVAIARMCAESKYISEQEMWEYVDFADEQAHKSLTSWEDYGKSYVIGDCLWGADSYDLG 278 Query: 306 LTDAELEACYRYDKQFWEHY 325 + + K W+ + Sbjct: 279 QSSKIIRKLINDPKSPWKLF 298 >UniRef50_C7M7R2 Putative uncharacterized protein n=1 Tax=Capnocytophaga ochracea DSM 7271 RepID=C7M7R2_CAPOD Length = 240 Score = 54.4 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 76/226 (33%), Gaps = 37/226 (16%) Query: 72 AGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWS 131 G FL+ L + V +K+ H Q PL+ E+++ L A +Y + Sbjct: 9 IGAVFLFALKIYVNKVYTKKHLQKQIGHINQ----NPLTEEQIRLLTFGAILTYYRGE-- 62 Query: 132 ETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGD 191 ++L +I+ ++ L QW I D+ S + L+ Sbjct: 63 -------------------NLLNLIPTEILETYQKGLRRQWEITDTASAKETISDLLAQK 103 Query: 192 HGANTFKANMEEAPE--QVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMIS 249 + +PE ++ + K + + K +A +L S Sbjct: 104 RSLQFRHLLTQTSPELSKIQKQIAKGLGIELAQVE----------AVKSAYAWDLCRAAS 153 Query: 250 ISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFL 295 ++ + I E W + S A E ++ ++Y + +G Sbjct: 154 LAKWCYWCQYITETEMWDILQKVSEIAKEQGKNWQEYTISFLLGRT 199 >UniRef50_C6DJA3 Putative uncharacterized protein n=4 Tax=Pectobacterium RepID=C6DJA3_PECCP Length = 234 Score = 54.0 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 65/213 (30%), Gaps = 29/213 (13%) Query: 149 QYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQV 208 +Y FD SL + WGI E +++ G H Sbjct: 22 KYGARFFDPTFYEPGETISLSNSWGITSREGLISMINDMTDGGHAERLA---------YY 72 Query: 209 IALLNKFAVFPSDYISDCANHSSGKSSAKL-------------IWAAELSWMISISSTAF 255 L + S++ CAN S A + I A +L M +S Sbjct: 73 YHLWHHLTA--SEWQQHCANQSEEAQGALMLVTETAALCGEGGIRAWDLGRMSFLSRVGL 130 Query: 256 QNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYW----HICCYRRKLTDAEL 311 NG I E+ + +A + S E+Y +G YW ++ + Sbjct: 131 LNGWISEKENLWIHTRLADRARYYYRSWENYYAAFLIGRTYWLSSDEEDPECQRYIFSNA 190 Query: 312 EACYRYDKQFWEHYSKKCRWPIRNVPWGASSVK 344 Y Q Y+ PI ++ W ++ Sbjct: 191 SQNPDYIDQIGTLYT-HPDCPIHDLDWDVDPIE 222 >UniRef50_D1PWQ7 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PWQ7_9BACT Length = 235 Score = 53.3 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 77/223 (34%), Gaps = 29/223 (13%) Query: 75 SFLYLLIMMGLIVRAGFK--AKKEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSE 132 + + +LI+ ++ + AKK + + PL+ EK +L+ SE Sbjct: 8 AIVAVLIIAFIVFEIVSRLIAKKNLKAFLASHPQTPLTEEKK---RLLVFGAILSCYRSE 64 Query: 133 TLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDH 192 + L + D +++ + L++QW I E + L+ + Sbjct: 65 DI------------------LSIITDDNMNEYKTGLQEQWSINGREDALETLNALLNLEQ 106 Query: 193 GANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISS 252 + + A L + +D + + +A ++ ++S++ Sbjct: 107 ST---EVDEVLAQRGSSEELIELQTLIADGLKTDLAQV---RTTTSTYAWDVCRLVSLAK 160 Query: 253 TAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFL 295 + I E W Y+ + KA L + DY + MG Sbjct: 161 WCYWLQYISEAEMWKYLNEGAVKASSLGKDWNDYTVSFLMGRA 203 >UniRef50_C7PJR4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJR4_CHIPD Length = 286 Score = 53.3 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 63/189 (33%), Gaps = 15/189 (7%) Query: 148 FQYHVLPFDSIDIISKR--RESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEA- 204 +Y + PF D ++ + +L + W I + + L H A + Sbjct: 97 SRYFIFPFKPGDAAGEKDAKATLAEYWDIHNVAGLEKSLTWLLDEGHQAQYAQYRKVLDE 156 Query: 205 PEQVIALLN-------KFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQN 257 A LN K I +H + SSA I A +L+ I+ A+Q Sbjct: 157 NGGASADLNTLDLNKYKLTKEDLAGIQFIKDHYTSFSSAG-IKAWDLARYINNICVAYQA 215 Query: 258 GTIE--EELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACY 315 G E +AW +M A + A + + Y + +G +W E+ Sbjct: 216 GYFNRGEAMAW--LMKAPQVAQARYSDWKAYFNDFLLGREFWGGGEADNARFKEEVTGML 273 Query: 316 RYDKQFWEH 324 + + Sbjct: 274 EGKYSIYSY 282 >UniRef50_A8RHY8 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RHY8_9CLOT Length = 409 Score = 52.9 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 66/177 (37%), Gaps = 15/177 (8%) Query: 159 DIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLN--KFA 216 D+++ ++SLE+ W + D + + ++ L+ H +TF +M E I + + + Sbjct: 226 DVMAMEQKSLEEWWSVTDRATADSTLDWILTEGH-RDTFAEDMAYLEEAGIRDIAPNERS 284 Query: 217 VFPSDYISDCANHSSGKSS---------AKLIWAAELSWMISISSTAFQNGTIEEELAWH 267 F D A+ + + I + +++ S + G E+ A Sbjct: 285 AFLLDQFQMTADEAQNYADMFGFYEQYGPDAIAGWDYCRAMNLMSFYYLAGYYTEQEALD 344 Query: 268 YIMLASRKAHELFESEEDYQKNSQMGFLYW---HICCYRRKLTDAELEACYRYDKQF 321 + +R LFES +D + G+ YW R D + Y F Sbjct: 345 KSLEIARTMQPLFESWDDLMSSYMRGYEYWAEESADERRALYEDLKTREDNPYSVDF 401 >UniRef50_D1QRT3 Putative uncharacterized protein n=1 Tax=Prevotella oris F0302 RepID=D1QRT3_9BACT Length = 301 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 35/85 (41%) Query: 241 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC 300 A + ++ ++ + G I EE W+Y+ A AH S ED+ K+ +G W Sbjct: 213 AWDAGRLVFVARMCAEEGWITEEELWNYVDAADEIAHRTLTSWEDFGKSYIIGRCLWCGT 272 Query: 301 CYRRKLTDAELEACYRYDKQFWEHY 325 ++ + Y K W+ + Sbjct: 273 ANYFEVMAGYAKKMYTNPKSPWKTF 297 >UniRef50_D0GJQ3 Putative liporotein n=1 Tax=Leptotrichia goodfellowii F0264 RepID=D0GJQ3_9FUSO Length = 225 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 39/176 (22%), Positives = 64/176 (36%), Gaps = 9/176 (5%) Query: 143 PGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFL-SGDHGANTFKANM 201 ++ F Y LP D D E L W I D +S ++ L G G Sbjct: 38 LTRNDFPYDRLP-DKKDCDKCSVEVLSRDWDITDKKSATETLDILLDEGTRGEVDLILPE 96 Query: 202 EEAPEQVIALLNKFAVFPSDYISDCANHSSGKSS-----AKLIWAAELSWMISISSTAFQ 256 ++P + + A + I D + G + K I A + +I+++ A+ Sbjct: 97 LKSPNALSGEYAEVAA-TYNNIRDALVNDYGYTKEEVDNVKTISAWDYDRLINVARFAYD 155 Query: 257 NGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLY-WHICCYRRKLTDAEL 311 G I EE W YI KA ++S + Y +G + K+ +L Sbjct: 156 AGYITEEEMWSYIDKTVVKARNDYDSWKSYFAGVMLGRGITFSNNFSENKIVADKL 211 >UniRef50_Q5LDR8 Putative uncharacterized protein n=6 Tax=Bacteroides RepID=Q5LDR8_BACFN Length = 262 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 84/266 (31%), Gaps = 45/266 (16%) Query: 80 LIMMGLIVRAGFKAKKEQLHYYQAKGIEP---LSIEKLQALQLIAPYRFYHKQWSETLEF 136 +I++ + R + A ++ + +Q I P L+ + + L + + Y + +L Sbjct: 14 IIIVYYLYRHVWPAVRKFIRLFQGIRINPRSHLTEAEYKKLSVGSLYALQQGAYLNSL-- 71 Query: 137 WPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHF------LSG 190 ++DI K L D WGI +++ +E+ Sbjct: 72 --------------------TLDIKDKLPTILADWWGICNAQDAKQTLEYLGKKGFAYYF 111 Query: 191 DHGANTFKANMEEAPEQVI--------------ALLNKFAVFPSDYISDCANHSSGKSSA 236 H F + EEA +++ L+ + + Sbjct: 112 PHVYQAFLLDDEEAKDRIFQQHMDSQEDYDKAVEQLHNLEDCYDELLECGTITCREDLLR 171 Query: 237 KLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLY 296 + + + ++ + I E+ AWHYI A H F S D+ + +G Sbjct: 172 YGVTGWDAGRLNFMARACYDMKYISEDEAWHYINHAYEMVHSHFSSWHDFAMSYVIGRAL 231 Query: 297 WHICCYRRKLTDAELEACYRYDKQFW 322 W E + +K W Sbjct: 232 WGGKSASNSGMMYMAEDLLKSEKSPW 257 >UniRef50_C9Q0E0 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9Q0E0_9BACT Length = 298 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 74/249 (29%), Gaps = 47/249 (18%) Query: 76 FLYLLIMMGLIVRAGFKAKK------EQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQ 129 +++L + + ++ + K H Y+ PL+ ++ + + L A Y Sbjct: 39 YIWLGLTLFILYKIIRHRDKIVGLYKLMFHVYRLTPNNPLTTDQQRKILLSAIYS----- 93 Query: 130 WSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLS 189 + ++ + R LE WGI D S + Sbjct: 94 ----------------QQQCSKLDSLNTKLSTAYRTGMLEGGWGITDRASAIETLNFLKD 137 Query: 190 GDHGANT---FKANMEEAPEQVIALLNKF------AVFPSDYISDCANHSSGKSSAKLIW 240 H KA + E + + A ++I + K+I Sbjct: 138 EGHRYYFPYVVKALQQPTQEALNNYIKDIIQNEDRAERAVEFIQNVFYSLDELKKWKVIG 197 Query: 241 -----------AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 A + + I ++ I EE W Y+ A AH S DY + Sbjct: 198 SVDEFINIGVDAWDSGRLSFIVRLCYEAKLISEEETWQYLDAADDIAHNTLSSWNDYSTS 257 Query: 290 SQMGFLYWH 298 +G W+ Sbjct: 258 YILGRAMWN 266 >UniRef50_B1EFD5 Putative uncharacterized protein n=1 Tax=Escherichia albertii TW07627 RepID=B1EFD5_9ESCH Length = 273 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 53/140 (37%), Gaps = 16/140 (11%) Query: 172 WGIEDSESYCALMEHFLSGDHGANTFK-----------ANMEEAPEQVIALLN-KFAVFP 219 WGI+D ++ + G H + + E+ + + A ++ K + Sbjct: 91 WGIKDISMGMEMIRSLVDGRHNEQFLQEFYNITENVINLDNEQNWQTLFANISDKKLLIK 150 Query: 220 SDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHEL 279 + D S I A +LS + + + G I+E+ + + K + Sbjct: 151 MRVMHDAFLDFGNNS----ILAWDLSRANHLLADYYLAGWIDEQRYMKEVFDVTLKIQKS 206 Query: 280 FESEEDYQKNSQMGFLYWHI 299 F S +++ K+ G+L+W Sbjct: 207 FSSWDEFNKSYLYGYLWWSG 226 >UniRef50_A7BBF7 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BBF7_9ACTO Length = 353 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 49/140 (35%), Gaps = 11/140 (7%) Query: 168 LEDQWGIEDSESYCALMEHFLSGDHGANTFKA----NMEEAPEQVIALLNKFAVFPSD-- 221 L+ WGI + ES + L H + + IA L+K A + Sbjct: 123 LDRDWGITNRESLIRQIYSLLRAGHREDFAALRERCARPSWADTEIARLSKTADSSMEDW 182 Query: 222 ----YISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAH 277 I ++ G S A +L +++ G + E+ AW + + +R Sbjct: 183 ERRWRIRRFLDNDRGIQSLDFA-AWDLIRAANLTRAGAGLGWLSEDEAWDTLAIINRALQ 241 Query: 278 ELFESEEDYQKNSQMGFLYW 297 + S E+ + ++ W Sbjct: 242 FSYSSWEETWEAFRITRWLW 261 >UniRef50_B1KGR1 Putative uncharacterized protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KGR1_SHEWM Length = 233 Score = 48.3 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 49/152 (32%), Gaps = 22/152 (14%) Query: 171 QWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSD-YISDCANH 229 WGIE E Y +++ H + Q+ LN + D YI +++ Sbjct: 59 DWGIETREEYLNMLKWLREEGHNRSYM---------QMQDHLNTLSESAIDAYIDAHSHN 109 Query: 230 SSGKSSAKL------------IWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAH 277 +S +L I A + +SI G E+ W I S + Sbjct: 110 VDRQSCLQLVRNYRHTLNIGGIGAWDDGRYVSICRWGASLGLFSEDECWEKIKQISLRVQ 169 Query: 278 ELFESEEDYQKNSQMGFLYWHICCYRRKLTDA 309 + ++S + + G +W D Sbjct: 170 QSYDSWHSFALSYIAGRQFWRNDATESFAKDE 201 >UniRef50_C6PMJ3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PMJ3_9CLOT Length = 222 Score = 47.9 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 60/185 (32%), Gaps = 9/185 (4%) Query: 145 KDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKAN---M 201 DT + ++I R ++ WGI + + L HG + + Sbjct: 32 HDTLCGCE---KTPELIEDNRHMMKRDWGISNKADLLGSLNWLLKDGHGIDFLQERYFFS 88 Query: 202 EEAPEQVIALLNKFAVFPSDYIS-DCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTI 260 + + LN+ S YI + + + A + I + I Sbjct: 89 TLSETEQNTYLNRLDKNSSKYIQYSLIKNYDKITPNAGVIAWDYGRYIFLCRDGVFLNYI 148 Query: 261 EEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYW--HICCYRRKLTDAELEACYRYD 318 E AW+ ++ ++ A + + S +Y G W ++ + ++++ Sbjct: 149 SSEEAWNLMLKVAKLAQKAYSSWREYGLAYIAGRQTWLKNMSADSAEEQVSKIKNLIIDK 208 Query: 319 KQFWE 323 + W Sbjct: 209 ESPWN 213 >UniRef50_C8UAU2 Putative uncharacterized protein n=3 Tax=Escherichia RepID=C8UAU2_ECO10 Length = 520 Score = 47.5 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 94/279 (33%), Gaps = 36/279 (12%) Query: 75 SFLYLLIMMGLIVRAGFKAKKEQLHYYQAK----GIEPLSIEKLQALQLIAPYRFYHKQW 130 + L++ + ++ G+ ++A I+ + + A+ + APY Sbjct: 224 VAIILILALVAVLWIGYGLVFAGHRLFKASCKDPSIQKIPAAERWAMAVGAPYAIAGN-- 281 Query: 131 SETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSG 190 W R D ++ + ESL D WG+ D +S + Sbjct: 282 ----NHWAR--RVVNDD------SVEAENDCKSEVESLADMWGVFDRDSLLEQLLALFVA 329 Query: 191 DHGANTFKA--NMEEAPE-------QVIALLNKFAVFPSDYISDCANHSSGKSSAKLI-- 239 H + + N E P+ Q +A K + + + + I Sbjct: 330 GHRSVYAEQIKNDSEMPDAEYRAFAQQLAQNAKSSSEAKERLWQLQMVRKNRRKICDIDF 389 Query: 240 WAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHI 299 A ++ + + ++ Q G I + + MLA+R+ + + S + + + YW Sbjct: 390 CAWDMVRFVMLCNSGAQVGYITQREMVDFSMLAARRVQQHYRSWRELAGHFLLARWYWK- 448 Query: 300 CCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 +R L L + K ++ P R +PW Sbjct: 449 ATDKRHLITHIL-----FKKAITSLLKEQGS-PWRTLPW 481 >UniRef50_Q5LIZ2 Putative uncharacterized protein n=5 Tax=Bacteroides RepID=Q5LIZ2_BACFN Length = 235 Score = 47.5 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 34/68 (50%) Query: 239 IWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWH 298 + A ++ M+ ++ A+ G I+E LAW+YI A ++ + F + K+ +G Sbjct: 146 VLAWDMGRMVCLTRIAYDAGFIDESLAWNYICSAGQQCIQAFNDWTEVGKSFLLGQAMEA 205 Query: 299 ICCYRRKL 306 +++L Sbjct: 206 TEKRKQEL 213 >UniRef50_D1AQB9 Putative uncharacterized protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1AQB9_SEBTE Length = 251 Score = 47.5 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 49/132 (37%), Gaps = 4/132 (3%) Query: 168 LEDQWGIEDSESYCALMEHFLSGDHGAN----TFKANMEEAPEQVIALLNKFAVFPSDYI 223 L W + D+ + +E L+ H A + EA E+ + + + Sbjct: 89 LRSAWKVTDTATAKETLESLLAEGHRAEGDPMLTELRTPEAAEKNTEEFQAYEDVKKNLV 148 Query: 224 SDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESE 283 + + K I A + +++++ + G I E+ W YI A +A + S Sbjct: 149 DNYGYTAEQVDGIKTISAWDYDRLVNVARFSHSAGYITEQEMWDYINKAVTQAKNDYNSW 208 Query: 284 EDYQKNSQMGFL 295 E+Y +G Sbjct: 209 EEYFAGVMLGRT 220 >UniRef50_C4DP30 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DP30_9ACTO Length = 359 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 53/185 (28%), Gaps = 40/185 (21%) Query: 154 PFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLN 213 PF S+ R +LE WGI D E A +E H + Sbjct: 207 PFKSLGRTGISR-ALERDWGIRDREGMVAQIESLARDGHREQFAQ--------------- 250 Query: 214 KFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLAS 273 A P Y+ A + + + + F G +EE W ++ + Sbjct: 251 --AGIPGKYL-----------------AWDYARALWMQRMGFILGWFDEEYCWDTMLPLA 291 Query: 274 RKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPI 333 R + + G W K + + ++ + KC W Sbjct: 292 RDVQRHYSGWAEMNHWYLEGRRLWSAAVSDGKPDPVQAQRERTAERLAAD---PKCPWNF 348 Query: 334 RNVPW 338 +PW Sbjct: 349 --LPW 351 >UniRef50_B8F949 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F949_DESAA Length = 218 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 23/98 (23%), Positives = 44/98 (44%), Gaps = 11/98 (11%) Query: 241 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC 300 A + +I + ++ G I + AW +IM +++ + + S E Y + ++GF YW+ Sbjct: 123 AWDHGNLIQAARWSYSAGYISSDDAWDWIMSSAKTIQDNYSSWEHYGFHWRLGFEYWN-- 180 Query: 301 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 + LT + EA W + W + +PW Sbjct: 181 -DGQPLTSSFREAGA------WLLMNSASPW--KKLPW 209 >UniRef50_A5FIQ3 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FIQ3_FLAJ1 Length = 306 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 62/185 (33%), Gaps = 18/185 (9%) Query: 148 FQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPE- 206 Q+ F+ S +E LE+ W I D E ++E H + + + + Sbjct: 136 HQHPTATFEINGYKSDLKEMLENAWNITDHEDAVEILEWLKDEGHRGEDAEIDEDTEDDI 195 Query: 207 -QVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELA 265 + L D G + I + ++ + S++ + G I+++ Sbjct: 196 KEYEQALAPALKLNID--------EDGNPTD--IDSWDIERIGSVARYCYAAGYIDQQTC 245 Query: 266 WHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC--CYRRKLTDAELEACYRYDKQFWE 323 Y+ A + A E + + +Y + G + + + L + K W Sbjct: 246 LQYLETARKMAKERYNNWSEYAASFMTGRAFMYGGSPIDFATVILEMLSS----KKSIWN 301 Query: 324 HYSKK 328 Y K Sbjct: 302 TYPLK 306 >UniRef50_Q8A2P3 Putative uncharacterized protein n=10 Tax=Bacteroides RepID=Q8A2P3_BACTN Length = 226 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 28/61 (45%) Query: 239 IWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWH 298 + +++ ++ ++ A G I +E AW YI AS E+ + E+ K+ +G Sbjct: 146 VIGWDMAQVVGLARAAKDCGYITKEQAWEYIEQASTLCSEILRTPEEIDKSFLIGGAMKS 205 Query: 299 I 299 Sbjct: 206 N 206 >UniRef50_D0L2R2 Serine/threonine protein kinase-related protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0L2R2_GORB4 Length = 584 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 65/189 (34%), Gaps = 19/189 (10%) Query: 165 RESLEDQWGIEDSESYCALMEHFLSGDHG--------------ANTFKANMEEAPEQVIA 210 R+ L D WGI D+ES ++ G T + + ++++ Sbjct: 396 RKKLRDTWGIVDAESADETVDLLQLGMDAPDYDPTLRTIRNVAGGTPRGALVHERDRILR 455 Query: 211 LLNKFAVFPSDYISDCANHSSGKSS--AKLIWAAELSWMISISSTAFQNGTIEEELAWHY 268 D + A+ + + A +LS ++ I G ++ ++AW Sbjct: 456 AAPGLIPSILDTVLTVASSTRDFPEEIPGSVAAWDLSRLVIIVRYCVFLGYLDPDVAWSI 515 Query: 269 IMLASRKAHELFESEEDYQKNSQMGFLY--WHICCYRRKLTDA-ELEACYRYDKQFWEHY 325 ++ A R+A ++ Y ++G + + D E+ + + Sbjct: 516 VVDAGRRAAGVYPHWGAYAAGFEVGRALSRAEGDRHPARAADGVFAESRPIILRLLSDPT 575 Query: 326 SKKCRWPIR 334 S R P+R Sbjct: 576 SPWIRLPLR 584 >UniRef50_UPI00019694B1 hypothetical protein BACCELL_01818 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI00019694B1 Length = 291 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 33/67 (49%) Query: 227 ANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDY 286 + + + I A +L ++S++ ++ G + EE AW YI A +K E F ++ Sbjct: 190 TIEITNEDLQRGILAWDLGHLVSLARVSYDYGLLAEEEAWKYIEFAGKKCRETFACWKEI 249 Query: 287 QKNSQMG 293 K+ +G Sbjct: 250 GKSFLLG 256 >UniRef50_C7PJ30 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJ30_CHIPD Length = 280 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 39/263 (14%), Positives = 80/263 (30%), Gaps = 58/263 (22%) Query: 75 SFLYLLIMMGLIVRAGFKA---------KKEQLHYYQAKGIEPLSIE------KLQALQL 119 + +Y++ ++ +++ +A ++EQ + +S + Q + Sbjct: 11 AAVYIVFVIIKLMKLNSRAKEIAAKAMKEREQQRDKAVEDEPLISEDGVLSLHDRQYIAC 70 Query: 120 IAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSES 179 A + + +TLE + E L W I + Sbjct: 71 GANLIYLRGERLDTLETDTEQDEIRH---------------------MLRRDWHINTRDK 109 Query: 180 YCALMEHFLSGDHGANTFKA-------NMEEAPEQVIALLNKFA----VFPSDY----IS 224 + ++ + H + E PE + L FA P + IS Sbjct: 110 LLSTIDGLATRGHRVYFKPIWQILTTLPVRERPEALDKLQQDFAAKGDDVPIEQYAANIS 169 Query: 225 DCANHS-------SGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAH 277 +C H GK +L I++ + G + E + YI ++ Sbjct: 170 ECYKHLREISDCFEGKKCKLDALTWDLGRAINLCRWGYDAGFLSREESMRYIRKFGKELL 229 Query: 278 ELFESEEDYQKNSQMGFLYWHIC 300 + S + +N +GF W Sbjct: 230 HNYTSWANLGENYLIGFAMWTGD 252 >UniRef50_UPI0001B4FA8B hypothetical protein ShygA5_09679 n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4FA8B Length = 212 Score = 43.2 bits (100), Expect = 0.015, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 32/107 (29%), Gaps = 4/107 (3%) Query: 232 GKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQ 291 G A +++ I+ +G I+E AW + + S ++Y + Sbjct: 102 GHPPL----AWDIARYADITRYGLASGYIDEPTAWRLLREVVAPVARTYGSWKEYADDFM 157 Query: 292 MGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 G L W + + D + P + PW Sbjct: 158 TGRLAWMRALHGTENEDWPVSQEDTARAVQRLVDPMNQDSPWQRTPW 204 >UniRef50_C0Z9V0 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z9V0_BREBN Length = 209 Score = 41.3 bits (95), Expect = 0.048, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 60/172 (34%), Gaps = 16/172 (9%) Query: 169 EDQWGIEDSESYCALMEHFLSGDHGA-----NTFKANMEEAPEQVIALLNKFAVFPSDYI 223 +WG++D+ S + + L + F + E+ + + I Sbjct: 38 YTKWGMKDATSQRSRLTWMLQEGERKEFARLHHFMTALSESGRKEYI---DSLESDQERI 94 Query: 224 SDC--ANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFE 281 + + A+ I A + +W +S G I +E A + + A+R+ + + Sbjct: 95 AKAKVVQFYMRRLPAEGIAAYDYTWASFLSRRKGDYGYISKEEARQFKLQATRQTQQAYN 154 Query: 282 SEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSK--KCRW 331 + ++ G+ + + D E + + + F +S K W Sbjct: 155 NWGEFFTGYIAGYQF----MTAQTSLDYLRENEWEFTRYFVSKHSSIVKTDW 202 >UniRef50_A6VV45 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MWYL1 RepID=A6VV45_MARMS Length = 242 Score = 40.9 bits (94), Expect = 0.065, Method: Composition-based stats. Identities = 13/59 (22%), Positives = 26/59 (44%) Query: 239 IWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYW 297 I A ++ + A+ I E+ AW +++ + A E F S ++ + +G W Sbjct: 136 IQAFDIGRYAFLCRCAYTVSLITEDEAWAFLLRIGKIAQERFTSWYEFATSYTVGRCIW 194 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76228 Inner membrane protein ynjI n=42 Tax=Bacteria Re... 383 e-105 UniRef50_C7BNF7 Inner membrane protein ynji n=1 Tax=Photorhabdus... 329 9e-89 UniRef50_Q7MZC0 Similar to unknown protein YnjI of Escherichia c... 323 4e-87 UniRef50_D0LG06 Putative uncharacterized protein n=1 Tax=Haliang... 264 3e-69 UniRef50_A5CUH0 Putative uncharacterized protein n=1 Tax=Claviba... 248 2e-64 UniRef50_D1QRT4 Putative uncharacterized protein n=1 Tax=Prevote... 201 3e-50 UniRef50_Q5LDR8 Putative uncharacterized protein n=6 Tax=Bactero... 177 5e-43 UniRef50_C8UAU2 Putative uncharacterized protein n=3 Tax=Escheri... 175 2e-42 UniRef50_C9LQ50 Putative uncharacterized protein n=1 Tax=Dialist... 175 2e-42 UniRef50_C9Q0E0 Putative uncharacterized protein n=1 Tax=Prevote... 171 3e-41 UniRef50_Q2S9A8 Putative uncharacterized protein n=1 Tax=Hahella... 162 2e-38 UniRef50_C7M7R2 Putative uncharacterized protein n=1 Tax=Capnocy... 156 1e-36 UniRef50_B0MZC9 Putative uncharacterized protein n=1 Tax=Alistip... 153 9e-36 UniRef50_C6PMJ3 Putative uncharacterized protein n=1 Tax=Clostri... 150 1e-34 UniRef50_D1QRT3 Putative uncharacterized protein n=1 Tax=Prevote... 148 2e-34 UniRef50_D1PWQ7 Putative uncharacterized protein n=1 Tax=Prevote... 148 3e-34 UniRef50_A5FIQ3 Putative uncharacterized protein n=1 Tax=Flavoba... 145 3e-33 UniRef50_D0GJQ3 Putative liporotein n=1 Tax=Leptotrichia goodfel... 143 9e-33 UniRef50_C7PJR4 Putative uncharacterized protein n=1 Tax=Chitino... 141 3e-32 UniRef50_C6DJA3 Putative uncharacterized protein n=4 Tax=Pectoba... 141 3e-32 UniRef50_D1AQB9 Putative uncharacterized protein n=1 Tax=Sebalde... 141 5e-32 UniRef50_B1KGR1 Putative uncharacterized protein n=1 Tax=Shewane... 139 1e-31 UniRef50_Q6MI61 Putative uncharacterized protein n=1 Tax=Bdellov... 130 6e-29 UniRef50_A8RHY8 Putative uncharacterized protein n=1 Tax=Clostri... 130 7e-29 UniRef50_B9Y3D4 Putative uncharacterized protein n=1 Tax=Holdema... 130 8e-29 UniRef50_C5EQ99 Putative uncharacterized protein n=1 Tax=Clostri... 130 1e-28 UniRef50_A5FIQ4 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 129 1e-28 UniRef50_C4DP30 Putative uncharacterized protein n=1 Tax=Stackeb... 124 3e-27 UniRef50_A7BBF7 Putative uncharacterized protein n=1 Tax=Actinom... 123 8e-27 UniRef50_D0L2R2 Serine/threonine protein kinase-related protein ... 122 2e-26 UniRef50_Q5Z2V3 Putative uncharacterized protein n=1 Tax=Nocardi... 118 3e-25 UniRef50_B1EFD5 Putative uncharacterized protein n=1 Tax=Escheri... 110 1e-22 UniRef50_Q5LIZ2 Putative uncharacterized protein n=5 Tax=Bactero... 109 1e-22 UniRef50_B0NPN4 Putative uncharacterized protein n=1 Tax=Bactero... 104 4e-21 UniRef50_B8F949 Putative uncharacterized protein n=1 Tax=Desulfa... 101 5e-20 UniRef50_Q8A2P3 Putative uncharacterized protein n=10 Tax=Bacter... 82 3e-14 Sequences not found previously or not previously below threshold: UniRef50_C7PJ30 Putative uncharacterized protein n=1 Tax=Chitino... 116 1e-24 UniRef50_B7B9Y1 Putative uncharacterized protein n=2 Tax=Bactero... 106 2e-21 UniRef50_A7B4Z3 Putative uncharacterized protein n=1 Tax=Ruminoc... 103 8e-21 UniRef50_C9MPB3 Putative uncharacterized protein n=1 Tax=Prevote... 98 3e-19 UniRef50_A6VV45 Putative uncharacterized protein n=1 Tax=Marinom... 98 4e-19 UniRef50_P77427 Uncharacterized protein ybeU n=128 Tax=Enterobac... 96 1e-18 UniRef50_UPI00019694B1 hypothetical protein BACCELL_01818 n=1 Ta... 96 2e-18 UniRef50_Q3KF29 Putative uncharacterized protein n=1 Tax=Pseudom... 94 8e-18 UniRef50_C0FWW7 Putative uncharacterized protein n=1 Tax=Rosebur... 92 3e-17 UniRef50_UPI0001B4FA8B hypothetical protein ShygA5_09679 n=1 Tax... 88 4e-16 UniRef50_C7PJC7 Putative uncharacterized protein n=1 Tax=Chitino... 86 1e-15 UniRef50_C7PXG6 Putative uncharacterized protein n=1 Tax=Catenul... 86 2e-15 UniRef50_A8IK38 WosA n=4 Tax=Proteus RepID=A8IK38_PROMI 86 2e-15 UniRef50_Q8RI41 Putative uncharacterized protein FN1795 n=1 Tax=... 85 4e-15 UniRef50_C3BW33 Putative uncharacterized protein n=3 Tax=Bacillu... 82 3e-14 UniRef50_C5EQL7 Predicted protein n=1 Tax=Clostridiales bacteriu... 80 1e-13 UniRef50_Q1IDH3 Putative uncharacterized protein n=1 Tax=Pseudom... 80 1e-13 UniRef50_C0ZAT1 Putative uncharacterized protein n=1 Tax=Breviba... 79 2e-13 UniRef50_Q639C8 Group-specific protein n=16 Tax=Bacillus cereus ... 78 5e-13 UniRef50_A0AF53 Complete genome n=6 Tax=Listeria RepID=A0AF53_LISW6 77 9e-13 UniRef50_C3PJD0 Putative uncharacterized protein n=4 Tax=Coryneb... 75 3e-12 UniRef50_C5ETY0 Predicted protein n=1 Tax=Clostridiales bacteriu... 75 4e-12 UniRef50_C1YTB3 Putative uncharacterized protein n=1 Tax=Nocardi... 75 4e-12 UniRef50_C0Z9V0 Putative uncharacterized protein n=1 Tax=Breviba... 75 5e-12 UniRef50_UPI0001B4F4A1 hypothetical protein ShygA5_42835 n=1 Tax... 75 6e-12 UniRef50_UPI0001B58620 hypothetical protein StAA4_22524 n=1 Tax=... 74 8e-12 UniRef50_Q5WK87 Putative uncharacterized protein n=1 Tax=Bacillu... 74 8e-12 UniRef50_B1W156 Putative uncharacterized protein n=3 Tax=Strepto... 73 2e-11 UniRef50_C8NTP8 Putative uncharacterized protein n=1 Tax=Coryneb... 71 5e-11 UniRef50_C8NLE2 Putative uncharacterized protein n=2 Tax=Coryneb... 71 6e-11 UniRef50_C1YUR3 Putative uncharacterized protein n=1 Tax=Nocardi... 71 6e-11 UniRef50_C0XUK5 Putative uncharacterized protein n=1 Tax=Coryneb... 70 1e-10 UniRef50_Q1QY12 Putative uncharacterized protein n=1 Tax=Chromoh... 65 3e-09 UniRef50_UPI00003826C9 hypothetical protein Magn03000930 n=1 Tax... 65 6e-09 UniRef50_B2UWN6 Putative uncharacterized protein n=2 Tax=Clostri... 64 9e-09 UniRef50_B1W155 Putative uncharacterized protein n=3 Tax=Strepto... 64 9e-09 UniRef50_Q4K851 Putative uncharacterized protein n=1 Tax=Pseudom... 63 2e-08 UniRef50_Q39LT8 Putative uncharacterized protein n=9 Tax=Proteob... 63 2e-08 UniRef50_B5HCC5 Predicted protein n=4 Tax=Streptomyces RepID=B5H... 62 3e-08 UniRef50_A1ACT9 Putative uncharacterized protein n=36 Tax=Entero... 62 3e-08 UniRef50_A4QDU6 Putative uncharacterized protein n=2 Tax=Coryneb... 62 4e-08 UniRef50_C7Q7G6 Putative uncharacterized protein n=1 Tax=Catenul... 61 5e-08 UniRef50_Q1QY11 Putative uncharacterized protein n=1 Tax=Chromoh... 56 2e-06 UniRef50_C5EGQ2 Putative uncharacterized protein n=1 Tax=Clostri... 55 3e-06 UniRef50_D0KK37 Putative uncharacterized protein n=1 Tax=Pectoba... 53 1e-05 UniRef50_D0J788 Putative uncharacterized protein n=3 Tax=Comamon... 51 5e-05 UniRef50_B9CYP9 Putative uncharacterized protein n=1 Tax=Campylo... 49 2e-04 UniRef50_Q47QQ1 Putative uncharacterized protein n=1 Tax=Thermob... 48 5e-04 UniRef50_Q894P0 Putative uncharacterized protein n=1 Tax=Clostri... 48 6e-04 UniRef50_B5ZB98 Putative uncharacterized protein n=15 Tax=Ureapl... 46 0.002 UniRef50_A8RZQ8 Putative uncharacterized protein n=1 Tax=Clostri... 46 0.002 UniRef50_D0WQ83 Putative uncharacterized protein n=1 Tax=Actinom... 43 0.023 >UniRef50_P76228 Inner membrane protein ynjI n=42 Tax=Bacteria RepID=YNJI_ECOLI Length = 346 Score = 383 bits (983), Expect = e-105, Method: Composition-based stats. Identities = 346/346 (100%), Positives = 346/346 (100%) Query: 1 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT 60 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT Sbjct: 1 MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAIT 60 Query: 61 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI 120 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI Sbjct: 61 SNGLNERGLITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLI 120 Query: 121 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY 180 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY Sbjct: 121 APYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESY 180 Query: 181 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW 240 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW Sbjct: 181 CALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW 240 Query: 241 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC 300 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC Sbjct: 241 AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHIC 300 Query: 301 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS 346 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS Sbjct: 301 CYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVKYS 346 >UniRef50_C7BNF7 Inner membrane protein ynji n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BNF7_PHOAA Length = 381 Score = 329 bits (843), Expect = 9e-89, Method: Composition-based stats. Identities = 89/350 (25%), Positives = 150/350 (42%), Gaps = 27/350 (7%) Query: 1 MKKVLLQNHPGSEKYSFNGWEIFN--------SNFERMIKE--NKAMLLCKWGFYLTCVV 50 M K LLQN P ++ G++I S F R++ N ++ + V Sbjct: 1 MAKELLQNQPHYRQFPVKGYQIIKNILCEKRKSPFLRLLYTVINILFVIAVGMGIILSVA 60 Query: 51 AVMFVFAAITSNGLNERGLITAGCSFLYLLIMMG-LIVRAGFKAKKEQLHYYQAKGIEPL 109 V+ + + + L G L ++I++ +I R + + EQ YYQ G+ L Sbjct: 61 IVLSIMGDVYVPDKDYLLLTKVGSVALVVVILLFVIIYRIVKRPEWEQRRYYQQAGLSLL 120 Query: 110 SIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLE 169 EK Q L+L ++ WSETLE +P + D + Y++LP + + + L Sbjct: 121 PEEKRQVLRLNIVSDYWLGFWSETLEHYPLQSRVAHDDYCYYLLPLSA---AQEHQSQLY 177 Query: 170 DQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANH 229 WGI D E Y ++ G H + ++ + ++ +L K DYI C+ Sbjct: 178 SDWGILDEEGYMKMLTGLWHGVHSKHFA-VDVALSDGKMFEVLAKLVEVTPDYIRKCSRS 236 Query: 230 SSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 +G A L+W +L I +S F G I EE+AW ++ + +E+F S +D+ N Sbjct: 237 VNGHPPA-LVWGFDLWLAIVLSRNCFCAGYISEEMAWENMLKTADYIYEIFGSFDDFYTN 295 Query: 290 SQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWG 339 ++G YW + K + + +Y C WPI +PW Sbjct: 296 FRLGNTYWSNDFDKTKGRLEQ-----------FNYYKLHCDWPIAKLPWP 334 >UniRef50_Q7MZC0 Similar to unknown protein YnjI of Escherichia coli n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZC0_PHOLL Length = 380 Score = 323 bits (828), Expect = 4e-87, Method: Composition-based stats. Identities = 88/350 (25%), Positives = 152/350 (43%), Gaps = 31/350 (8%) Query: 3 KVLLQNHPGSEKYSFNGWEI------------FNSNFERMIKENKAMLLCKWGFYLTCVV 50 K LLQN P ++ G+++ F S +I + +C + V Sbjct: 2 KELLQNQPHYRQFPVKGYQVMKHIISEKRKLPFLSALYALINILFVIAVCMG--IVLSVA 59 Query: 51 AVMFVFAAITSNGLNERGLITAGCSFLY-LLIMMGLIVRAGFKAKKEQLHYYQAKGIEPL 109 V+F+ ++ + L G L+ +++++ +I R + + EQ YYQ G+ L Sbjct: 60 IVLFIIGNVSVPDTDYLLLAQVGGVALFGVMLLLVIIYRIVKRPEWEQRRYYQQAGLSLL 119 Query: 110 SIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLE 169 +K Q L+L ++ WSETLE +P + D+++Y +LP + + L Sbjct: 120 PEDKRQVLRLNIVGDYWLGFWSETLEHYPLQSRVAHDSYRYCLLPLSP---TQEHQSQLY 176 Query: 170 DQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANH 229 WGI D E Y ++ G H + + + ++ +L K DYI CA Sbjct: 177 SDWGIIDEEGYMKMLTGLWEGVHSKHFA-IDAALSDGKMFKVLAKLVEVTPDYIHKCAKP 235 Query: 230 SSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 + + A L+W +L I +S F G I EE+AW ++ + +E+F S +++ N Sbjct: 236 INKRPPA-LVWGFDLWLAIVLSRNCFCAGYISEEMAWKNMLKTADYIYEIFGSFDEFYTN 294 Query: 290 SQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWG 339 ++G YW R K LE + +Y C WPI ++PW Sbjct: 295 FRLGNAYWSNDFDRSK---ERLEQ--------FNYYKSHCDWPIASLPWP 333 >UniRef50_D0LG06 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LG06_HALO1 Length = 357 Score = 264 bits (674), Expect = 3e-69, Method: Composition-based stats. Identities = 86/339 (25%), Positives = 145/339 (42%), Gaps = 29/339 (8%) Query: 5 LLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAITSNGL 64 +LQNHPG +Y + + FE + K + F ++ + A+ NGL Sbjct: 1 MLQNHPGDSRYPVT--SNWLTPFEHLSKRRASAARAALTF------GIITIGVAVLGNGL 52 Query: 65 NERG---LITAGCSFLYLLIMMGLIVRAGFK---------AKKEQLHYYQAKGIEPLSIE 112 L + + +Y + +GL++ A F ++EQ YY+ + + E Sbjct: 53 LSEAVEPLPASAFAVVYAIGALGLLLFAVFAWLVSQGATARRREQQRYYELGRVPEFTEE 112 Query: 113 KLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRR--ESLED 170 + A QL A WSETLE WP GK F ++ I+ K ESL+ Sbjct: 113 QRSAFQLDAVNAV--GLWSETLETWPCAARLGKVASGSAAASFVTLPILPKEEALESLDG 170 Query: 171 QWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHS 230 WG+ +E + L+G H A + + ++ L + P + + Sbjct: 171 DWGVLSAEGCRRTIADLLAGMHSAGFAEVARGPDGDAMLTRLAELTGLPLERVR-ATLQP 229 Query: 231 SGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNS 290 + + A+LIWA +L+ ++ ++ AF G +E AW I+ ASR AH LF S ED+ +N Sbjct: 230 ANRRPARLIWAWDLARVVPLARKAFMAGLFDEAQAWDAILSASRPAHALFASVEDFYENY 289 Query: 291 QMGFLYWHIC----CYRRKLTDAELEACYRYDKQFWEHY 325 ++G +W R + DA L++ + W+ Sbjct: 290 RIGHAFWSNDYQGARTRAERIDAFLQSQLPVRRAVWQPL 328 >UniRef50_A5CUH0 Putative uncharacterized protein n=1 Tax=Clavibacter michiganensis subsp. michiganensis NCPPB 382 RepID=A5CUH0_CLAM3 Length = 348 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 61/362 (16%), Positives = 123/362 (33%), Gaps = 37/362 (10%) Query: 6 LQNHPGSEKYSFNGWEIFNSNFERMI------KENKAMLLCKWGFYLTCVVAVMFVFAAI 59 LQNHPG ++ + + K +A+L+ + + A Sbjct: 3 LQNHPGEVEFPVRARVRYARMLDAQRLRPGGGKGTRALLVTALALPFGAAGVALGILVAT 62 Query: 60 TSNGLNERGLITAGCSFLYLLIMMGL-IVRA---GFKAKKEQLHYYQAKGIEPLSIEKLQ 115 + + + + M+ IV +++QL Y I P+++E+ Q Sbjct: 63 SGEPEGGPAMPIVLFALGMGIGMLVASIVFQQIDARAPRRDQLDYVAQARIRPVTLEEQQ 122 Query: 116 ALQLIAPYRFYHKQWSETLEFWPRKPEP----------GKDTFQYHVLPFDSIDIISKRR 165 L L A + W+ +L F P E G + ++ LP ++ ++ R Sbjct: 123 LLALDAVSDYSFGGWNSSLAFQPTWAEMPAELRTTHADGANGHEWVGLPMTTL---AQHR 179 Query: 166 ESLEDQWGIEDSESYCALM-EHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYIS 224 +L+ Q+ I + + + G A + + E E++++ + I Sbjct: 180 AALDTQFRIASRDDIELFVADALTQGPQSARFAELAVSEEAERMVSRMAALTGRSEFEII 239 Query: 225 DCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEE 284 D G+ L+ A + I A+ G + + AW I + ++ + Sbjct: 240 DLTRPHDGRPPV-LLLAGDSERTIGAIRYAYMAGYLPADDAWALIRQIGARVFATYDGWD 298 Query: 285 DYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSVK 344 Y + + + R DA +++ R WP VPW ++ Sbjct: 299 AYWADVSLALAF------RTDSLDA-VQSQRRVRDAL-----VASAWPAATVPWPGAATP 346 Query: 345 YS 346 S Sbjct: 347 RS 348 >UniRef50_D1QRT4 Putative uncharacterized protein n=1 Tax=Prevotella oris F0302 RepID=D1QRT4_9BACT Length = 302 Score = 201 bits (511), Expect = 3e-50, Method: Composition-based stats. Identities = 55/321 (17%), Positives = 107/321 (33%), Gaps = 46/321 (14%) Query: 30 MIKENKAMLLCKWGFYLTCVVAVMFVFAAITSNGLNERGLITAGCSFLYLLIMMGLIVRA 89 M K+ L+ GF A+ F + + + G++T + +++ + R Sbjct: 1 MKKKIIFCLIAMLGFLSQQAFALRFRVRVPSGSSESSNGVVTWIVYAVIAVVVAVTLYRY 60 Query: 90 GFKAKKEQLHY---YQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKD 146 FK + + + LS E+ + + L A Y K + T++ K Sbjct: 61 FFKIRGFLRQFKGDFLMDESSSLSKEQQRKMLLGAVYAVIDKGYLNTIKTGLEK------ 114 Query: 147 TFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHF---LSGDHGANTFKANMEE 203 +R + LE W I +S + + D+ N +A + Sbjct: 115 ---------------EEREDRLEKDWNICTHDSAVDALNGLKIACTKDYSPNIGEA-FKL 158 Query: 204 APEQVIALLNKFAVFPSDYISDCANHSSG--KSSAKLI----------------WAAELS 245 ++ I + + CA K L+ A E + Sbjct: 159 KEQKAIEKYLRETFVNPNDAKACAKQIERAFKHIGNLVKEGIVRDEAEFSRIGGVAFEAT 218 Query: 246 WMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRK 305 +++I+ ++ I E+ W Y+ A +AH+ S EDY K+ +G W Y Sbjct: 219 RLVAIARMCAESKYISEQEMWEYVDFADEQAHKSLTSWEDYGKSYVIGDCLWGADSYDLG 278 Query: 306 LTDAELEACYRYDKQFWEHYS 326 + + K W+ + Sbjct: 279 QSSKIIRKLINDPKSPWKLFP 299 >UniRef50_Q5LDR8 Putative uncharacterized protein n=6 Tax=Bacteroides RepID=Q5LDR8_BACFN Length = 262 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 42/277 (15%), Positives = 85/277 (30%), Gaps = 45/277 (16%) Query: 72 AGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEP---LSIEKLQALQLIAPYRFYHK 128 + +I++ + R + A ++ + +Q I P L+ + + L + + Y Sbjct: 6 WMLWIVTPIIIVYYLYRHVWPAVRKFIRLFQGIRINPRSHLTEAEYKKLSVGSLYALQQG 65 Query: 129 QWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFL 188 + +L ++DI K L D WGI +++ +E+ Sbjct: 66 AYLNSL----------------------TLDIKDKLPTILADWWGICNAQDAKQTLEYLG 103 Query: 189 SGD------HGANTFKANMEEAPEQVI--------------ALLNKFAVFPSDYISDCAN 228 H F + EEA +++ L+ + + Sbjct: 104 KKGFAYYFPHVYQAFLLDDEEAKDRIFQQHMDSQEDYDKAVEQLHNLEDCYDELLECGTI 163 Query: 229 HSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQK 288 + + + ++ + I E+ AWHYI A H F S D+ Sbjct: 164 TCREDLLRYGVTGWDAGRLNFMARACYDMKYISEDEAWHYINHAYEMVHSHFSSWHDFAM 223 Query: 289 NSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHY 325 + +G W E + +K W Sbjct: 224 SYVIGRALWGGKSASNSGMMYMAEDLLKSEKSPWTKI 260 >UniRef50_C8UAU2 Putative uncharacterized protein n=3 Tax=Escherichia RepID=C8UAU2_ECO10 Length = 520 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 46/280 (16%), Positives = 94/280 (33%), Gaps = 36/280 (12%) Query: 75 SFLYLLIMMGLIVRAGFKAKKEQLHYYQAK----GIEPLSIEKLQALQLIAPYRFYHKQW 130 + L++ + ++ G+ ++A I+ + + A+ + APY Sbjct: 224 VAIILILALVAVLWIGYGLVFAGHRLFKASCKDPSIQKIPAAERWAMAVGAPYAIAGN-- 281 Query: 131 SETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSG 190 W R D ++ + ESL D WG+ D +S + Sbjct: 282 ----NHWAR--RVVNDD------SVEAENDCKSEVESLADMWGVFDRDSLLEQLLALFVA 329 Query: 191 DHGANTFKA--NMEEAPE-------QVIALLNKFAVFPSDYISDCANHSSGKSSAKLI-- 239 H + + N E P+ Q +A K + + + + I Sbjct: 330 GHRSVYAEQIKNDSEMPDAEYRAFAQQLAQNAKSSSEAKERLWQLQMVRKNRRKICDIDF 389 Query: 240 WAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHI 299 A ++ + + ++ Q G I + + MLA+R+ + + S + + + YW Sbjct: 390 CAWDMVRFVMLCNSGAQVGYITQREMVDFSMLAARRVQQHYRSWRELAGHFLLARWYWKA 449 Query: 300 CCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWG 339 +R L L + K ++ P R +PW Sbjct: 450 -TDKRHLITHIL-----FKKAITSLLKEQGS-PWRTLPWD 482 >UniRef50_C9LQ50 Putative uncharacterized protein n=1 Tax=Dialister invisus DSM 15470 RepID=C9LQ50_9FIRM Length = 259 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 40/258 (15%), Positives = 77/258 (29%), Gaps = 41/258 (15%) Query: 86 IVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGK 145 +++ K ++ K + PL + L + A + ++LE K E Sbjct: 24 LLKIISHKKYQKNPL--GKKLSPL---QQSVLNIGAVNAEQTMFYCDSLETGSEKEEI-- 76 Query: 146 DTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKA------ 199 R SL + I D ES +E L H Sbjct: 77 -------------------RNSLAAYYDIIDEESALHTLEWLLERGHRVYFDAIKLFSAG 117 Query: 200 ------NMEEAPEQVIA---LLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISI 250 + P++ + + I S + + A ++ ++ I Sbjct: 118 ISPSITDEILTPDERLDTPRYMKNMKEMIESLIEKGYISSQADLQNQSVLAWDMGRLVLI 177 Query: 251 SSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAE 310 + F+ G I EE AW+Y+ A +K ++ +++ +G W Sbjct: 178 ARCCFECGYITEEKAWYYMEEAHKKCCAVYGDWKEFAAGYVIGRCMWGGMKQMPGGIMGI 237 Query: 311 LEACYRYDKQFWEHYSKK 328 E R + W+ Sbjct: 238 AEGLLRDPESPWQKARLH 255 >UniRef50_C9Q0E0 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9Q0E0_9BACT Length = 298 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 41/278 (14%), Positives = 77/278 (27%), Gaps = 48/278 (17%) Query: 76 FLYLLIMMGLIVRAGFKAKK------EQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQ 129 +++L + + ++ + K H Y+ PL+ ++ + + L A Y Sbjct: 39 YIWLGLTLFILYKIIRHRDKIVGLYKLMFHVYRLTPNNPLTTDQQRKILLSAIYS----- 93 Query: 130 WSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLS 189 + ++ + R LE WGI D S + Sbjct: 94 ----------------QQQCSKLDSLNTKLSTAYRTGMLEGGWGITDRASAIETLNFLKD 137 Query: 190 GDHGANT------FKANMEEAPEQVIALLNKFAVFPS---DYISDCANHSSGKSSAKLIW 240 H + +EA I + + ++I + K+I Sbjct: 138 EGHRYYFPYVVKALQQPTQEALNNYIKDIIQNEDRAERAVEFIQNVFYSLDELKKWKVIG 197 Query: 241 -----------AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKN 289 A + + I ++ I EE W Y+ A AH S DY + Sbjct: 198 SVDEFINIGVDAWDSGRLSFIVRLCYEAKLISEEETWQYLDAADDIAHNTLSSWNDYSTS 257 Query: 290 SQMGFLYWH-ICCYRRKLTDAELEACYRYDKQFWEHYS 326 +G W+ K + W Sbjct: 258 YILGRAMWNKTSADGSKFMFEVGQHLLTKKTSPWLKIP 295 >UniRef50_Q2S9A8 Putative uncharacterized protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S9A8_HAHCH Length = 394 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 47/239 (19%), Positives = 72/239 (30%), Gaps = 33/239 (13%) Query: 108 PLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRES 167 PLS +L AL + A D V + + I +E Sbjct: 171 PLSDSQLWALAVPAILT--------------EFNRARHDLLGSEV---TTPESIQSWKEG 213 Query: 168 LEDQWGIEDSESYCALMEHFLSGDHGANTFKAN---MEEAPEQVIALLNKFAVFPSDYIS 224 L+D W + D ES ++ L G H A K M+ EQ A Sbjct: 214 LKDWWSVTDRESLLETLDWLLEGGHRAGFNKLREQVMQLDSEQYQAAWEANEDEELRANM 273 Query: 225 DCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEE 284 S I + ++ IS+ + G + EE AW I+ A+R ++ S Sbjct: 274 KIIRRYSNALGEPGIASWDIGRYISLCRWGYLVGMLTEEEAWERILHAARAVQGMYHSWR 333 Query: 285 DYQKNSQMGFLYWHICCYRRKLTDAE--LEACYRYDKQFWEHYSKKCRWPIRNVPWGAS 341 +G +YW +T L +W +PW Sbjct: 334 QMGLGYVVGRMYWQSDASDEHMTKFFNLLRRQTVSPDSYW-----------VRLPWDLD 381 >UniRef50_C7M7R2 Putative uncharacterized protein n=1 Tax=Capnocytophaga ochracea DSM 7271 RepID=C7M7R2_CAPOD Length = 240 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 77/229 (33%), Gaps = 37/229 (16%) Query: 70 ITAGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQ 129 + G FL+ L + V +K+ H Q PL+ E+++ L A +Y + Sbjct: 7 VIIGAVFLFALKIYVNKVYTKKHLQKQIGHINQ----NPLTEEQIRLLTFGAILTYYRGE 62 Query: 130 WSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLS 189 ++L +I+ ++ L QW I D+ S + L+ Sbjct: 63 ---------------------NLLNLIPTEILETYQKGLRRQWEITDTASAKETISDLLA 101 Query: 190 GDHGANTFKANMEEAPE--QVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWM 247 + +PE ++ + K + + K +A +L Sbjct: 102 QKRSLQFRHLLTQTSPELSKIQKQIAKGLGIELAQVE----------AVKSAYAWDLCRA 151 Query: 248 ISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLY 296 S++ + I E W + S A E ++ ++Y + +G Sbjct: 152 ASLAKWCYWCQYITETEMWDILQKVSEIAKEQGKNWQEYTISFLLGRTI 200 >UniRef50_B0MZC9 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MZC9_9BACT Length = 342 Score = 153 bits (386), Expect = 9e-36, Method: Composition-based stats. Identities = 45/230 (19%), Positives = 81/230 (35%), Gaps = 32/230 (13%) Query: 104 KGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISK 163 KG L+ E+ + L AP Y+ ++LE S Sbjct: 131 KGDNSLTPEQNKLLAYGAPLFLYNDDNVDSLE---------------------STAGTDT 169 Query: 164 RRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSD-- 221 +E LE+ W + D +S + L+ A + E ++ I + + D Sbjct: 170 LKEMLEEWWEVTDRKSALETISWLLNEGQHAG-ADPALAEIRQRGIEAITEEEKADEDSK 228 Query: 222 -----YISDCANHSSGKSSAKL---IWAAELSWMISISSTAFQNGTIEEELAWHYIMLAS 273 I++ + + A L + A +L ++++ AF G I E+ W I + Sbjct: 229 IGDAFTIAEFVMGVNETTEADLPETVLAWDLVRAVNMARWAFICGYINEDEMWEAIRTTA 288 Query: 274 RKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWE 323 A E F S E+Y + +G W + D + A + W+ Sbjct: 289 GIAKESFSSWEEYGNSFAVGRGIWRGETDDYETADEVVGALLNKEDSPWK 338 >UniRef50_C6PMJ3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PMJ3_9CLOT Length = 222 Score = 150 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 68/223 (30%), Gaps = 23/223 (10%) Query: 109 LSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESL 168 LS +L + L + + DT + ++I R + Sbjct: 10 LSEAQLWMIALSSVLGEQNDYR--------------HDTLCGCE---KTPELIEDNRHMM 52 Query: 169 EDQWGIEDSESYCALMEHFLSGDHGANTFKAN---MEEAPEQVIALLNKFAVFPSDYIS- 224 + WGI + + L HG + + + + LN+ S YI Sbjct: 53 KRDWGISNKADLLGSLNWLLKDGHGIDFLQERYFFSTLSETEQNTYLNRLDKNSSKYIQY 112 Query: 225 DCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEE 284 + + + A + I + I E AW+ ++ ++ A + + S Sbjct: 113 SLIKNYDKITPNAGVIAWDYGRYIFLCRDGVFLNYISSEEAWNLMLKVAKLAQKAYSSWR 172 Query: 285 DYQKNSQMGFLYW--HICCYRRKLTDAELEACYRYDKQFWEHY 325 +Y G W ++ + ++++ + W Sbjct: 173 EYGLAYIAGRQTWLKNMSADSAEEQVSKIKNLIIDKESPWNTL 215 >UniRef50_D1QRT3 Putative uncharacterized protein n=1 Tax=Prevotella oris F0302 RepID=D1QRT3_9BACT Length = 301 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 46/321 (14%), Positives = 99/321 (30%), Gaps = 47/321 (14%) Query: 30 MIKENKAMLLCKWGFYLTCVVAVMFVFAAITSNGLNERGLITAGCSFLYLLIMMGLIVRA 89 MIK+ +LL F A F A + T + + + + + Sbjct: 1 MIKKTTTILLPLLCFTAQYAEA--FKITAKRAEAPVSSSYTTLWFILAFGGVFLLITLYN 58 Query: 90 GFKAKKEQLHYY----QAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGK 145 + K + + + L+ E+ + + L Y + ++ + Sbjct: 59 HWSKVKMLFRMFGGSFRMEKDTSLTEEQQRKMLLSGIYSVQKSSFMNVIKTGMGR----- 113 Query: 146 DTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANME--- 202 +RRE L +GI SE+ ++++F + + Sbjct: 114 ----------------EERRELLSRAYGITGSETAKDMLDYFKNTGSRRFFPQVAEALKL 157 Query: 203 ---EAPEQVIALLNKFAVFPS---DYISDCANHSSGKSSAKLIW-----------AAELS 245 A +Q + + + + + ++I A + Sbjct: 158 KNKPAIQQYLNDTFEDSEEARNCWEQVQFAFESVEPLMKEQIIRDENDFIRIGPDAWDAG 217 Query: 246 WMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRK 305 ++ ++ + G I EE W+Y+ A AH S ED+ K+ +G W + Sbjct: 218 RLVFVARMCAEEGWITEEELWNYVDAADEIAHRTLTSWEDFGKSYIIGRCLWCGTANYFE 277 Query: 306 LTDAELEACYRYDKQFWEHYS 326 + + Y K W+ + Sbjct: 278 VMAGYAKKMYTNPKSPWKTFP 298 >UniRef50_D1PWQ7 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PWQ7_9BACT Length = 235 Score = 148 bits (372), Expect = 3e-34, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 77/228 (33%), Gaps = 29/228 (12%) Query: 71 TAGCSFLYLLIMMGLIVRAGFK--AKKEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHK 128 + + +LI+ ++ + AKK + + PL+ EK + L A Sbjct: 4 KIIVAIVAVLIIAFIVFEIVSRLIAKKNLKAFLASHPQTPLTEEKKRLLVFGAILS---C 60 Query: 129 QWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFL 188 SE + L + D +++ + L++QW I E + L Sbjct: 61 YRSEDI------------------LSIITDDNMNEYKTGLQEQWSINGREDALETLNALL 102 Query: 189 SGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMI 248 + + + + A L + +D + + +A ++ ++ Sbjct: 103 NLEQST---EVDEVLAQRGSSEELIELQTLIADGLKTDLAQVR---TTTSTYAWDVCRLV 156 Query: 249 SISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLY 296 S++ + I E W Y+ + KA L + DY + MG Sbjct: 157 SLAKWCYWLQYISEAEMWKYLNEGAVKASSLGKDWNDYTVSFLMGRAI 204 >UniRef50_A5FIQ3 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FIQ3_FLAJ1 Length = 306 Score = 145 bits (364), Expect = 3e-33, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 70/225 (31%), Gaps = 37/225 (16%) Query: 106 IEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRR 165 IEP+ +K + L + + Q+ F+ S + Sbjct: 117 IEPVPDDKKELLSIGSI-----------------------ILHQHPTATFEINGYKSDLK 153 Query: 166 ESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISD 225 E LE+ W I D E ++E H + + + + + Sbjct: 154 EMLENAWNITDHEDAVEILEWLKDEGHRGEDAEIDEDTEDD--------IKEYEQALAPA 205 Query: 226 CANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEED 285 + + I + ++ + S++ + G I+++ Y+ A + A E + + + Sbjct: 206 LKLNIDEDGNPTDIDSWDIERIGSVARYCYAAGYIDQQTCLQYLETARKMAKERYNNWSE 265 Query: 286 YQKNSQMGFLYWHICC--YRRKLTDAELEACYRYDKQFWEHYSKK 328 Y + G + + + L + K W Y K Sbjct: 266 YAASFMTGRAFMYGGSPIDFATVILEMLSS----KKSIWNTYPLK 306 >UniRef50_D0GJQ3 Putative liporotein n=1 Tax=Leptotrichia goodfellowii F0264 RepID=D0GJQ3_9FUSO Length = 225 Score = 143 bits (360), Expect = 9e-33, Method: Composition-based stats. Identities = 43/231 (18%), Positives = 75/231 (32%), Gaps = 31/231 (13%) Query: 105 GIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKR 164 G + L+ E+ + A ++ F Y LP D D Sbjct: 19 GKKKLTKEQQHKMVYGAVL-------------------LTRNDFPYDRLP-DKKDCDKCS 58 Query: 165 RESLEDQWGIEDSESYCALMEHFLSGD-HGANTFKANMEEAPEQVIALLNKFAVFPSDYI 223 E L W I D +S ++ L G ++P + + A + I Sbjct: 59 VEVLSRDWDITDKKSATETLDILLDEGTRGEVDLILPELKSPNALSGEYAEVAA-TYNNI 117 Query: 224 SDCANHSSGKSS-----AKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHE 278 D + G + K I A + +I+++ A+ G I EE W YI KA Sbjct: 118 RDALVNDYGYTKEEVDNVKTISAWDYDRLINVARFAYDAGYITEEEMWSYIDKTVVKARN 177 Query: 279 LFESEEDYQKNSQMGFLY-WHICCYRRKLTDAELEACYRYDKQFWEHYSKK 328 ++S + Y +G + K+ + + + +S K Sbjct: 178 DYDSWKSYFAGVMLGRGITFSNNFSENKIV---ADKLLKDKNSPYNKFSFK 225 >UniRef50_C7PJR4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJR4_CHIPD Length = 286 Score = 141 bits (356), Expect = 3e-32, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 63/197 (31%), Gaps = 11/197 (5%) Query: 142 EPGKDTFQYHVLPFDSIDIISKR--RESLEDQWGIEDSESYCALMEHFLSGDHGANTFKA 199 E +Y + PF D ++ + +L + W I + + L H A + Sbjct: 91 EMQAAYSRYFIFPFKPGDAAGEKDAKATLAEYWDIHNVAGLEKSLTWLLDEGHQAQYAQY 150 Query: 200 NMEEAP-EQVIALLN-------KFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISIS 251 A LN K I +H + SSA I A +L+ I+ Sbjct: 151 RKVLDENGGASADLNTLDLNKYKLTKEDLAGIQFIKDHYTSFSSA-GIKAWDLARYINNI 209 Query: 252 STAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAEL 311 A+Q G A ++M A + A + + Y + +G +W E+ Sbjct: 210 CVAYQAGYFNRGEAMAWLMKAPQVAQARYSDWKAYFNDFLLGREFWGGGEADNARFKEEV 269 Query: 312 EACYRYDKQFWEHYSKK 328 + + K Sbjct: 270 TGMLEGKYSIYSYMPVK 286 >UniRef50_C6DJA3 Putative uncharacterized protein n=4 Tax=Pectobacterium RepID=C6DJA3_PECCP Length = 234 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 70/252 (27%), Gaps = 49/252 (19%) Query: 110 SIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLE 169 + + L AP +Y FD SL Sbjct: 3 PEYQRWLMALSAP--------------------MVALNIKYGARFFDPTFYEPGETISLS 42 Query: 170 DQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANH 229 + WGI E +++ G H L + S++ CAN Sbjct: 43 NSWGITSREGLISMINDMTDGGHAERLA---------YYYHLWHHLTA--SEWQQHCANQ 91 Query: 230 SSGKSSAKL-------------IWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKA 276 S A + I A +L M +S NG I E+ + +A Sbjct: 92 SEEAQGALMLVTETAALCGEGGIRAWDLGRMSFLSRVGLLNGWISEKENLWIHTRLADRA 151 Query: 277 HELFESEEDYQKNSQMGFLYW----HICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWP 332 + S E+Y +G YW ++ + Y Q Y+ P Sbjct: 152 RYYYRSWENYYAAFLIGRTYWLSSDEEDPECQRYIFSNASQNPDYIDQIGTLYT-HPDCP 210 Query: 333 IRNVPWGASSVK 344 I ++ W ++ Sbjct: 211 IHDLDWDVDPIE 222 >UniRef50_D1AQB9 Putative uncharacterized protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1AQB9_SEBTE Length = 251 Score = 141 bits (354), Expect = 5e-32, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 73/225 (32%), Gaps = 27/225 (12%) Query: 108 PLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRES 167 L+ E+ AL A + + LE + + + Sbjct: 50 KLTKEQENALVFGAVLTTRNDMSFDDLE---------------------AEEYKDASIQV 88 Query: 168 LEDQWGIEDSESYCALMEHFLSGDHGAN----TFKANMEEAPEQVIALLNKFAVFPSDYI 223 L W + D+ + +E L+ H A + EA E+ + + + Sbjct: 89 LRSAWKVTDTATAKETLESLLAEGHRAEGDPMLTELRTPEAAEKNTEEFQAYEDVKKNLV 148 Query: 224 SDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESE 283 + + K I A + +++++ + G I E+ W YI A +A + S Sbjct: 149 DNYGYTAEQVDGIKTISAWDYDRLVNVARFSHSAGYITEQEMWDYINKAVTQAKNDYNSW 208 Query: 284 EDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKK 328 E+Y +G + + A+ + + ++ + K Sbjct: 209 EEYFAGVMLGRTLVYGQPFADS--KAQADKLLKDADSVYKTHPFK 251 >UniRef50_B1KGR1 Putative uncharacterized protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KGR1_SHEWM Length = 233 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 53/193 (27%), Gaps = 9/193 (4%) Query: 139 RKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFK 198 E D +H S + I + + WGIE E Y +++ H + + Sbjct: 30 EMNELRHDVLHHHG---TSDEDIKDLKHMMMRDWGIETREEYLNMLKWLREEGHNRSYMQ 86 Query: 199 ANM---EEAPEQVIALL-NKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTA 254 + + A + + + I A + +SI Sbjct: 87 MQDHLNTLSESAIDAYIDAHSHNVDRQSCLQLVRNYRHTLNIGGIGAWDDGRYVSICRWG 146 Query: 255 FQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAE--LE 312 G E+ W I S + + ++S + + G +W D + Sbjct: 147 ASLGLFSEDECWEKIKQISLRVQQSYDSWHSFALSYIAGRQFWRNDATESFAKDEMDVVR 206 Query: 313 ACYRYDKQFWEHY 325 K W Sbjct: 207 RLTGDPKSPWNTL 219 >UniRef50_Q6MI61 Putative uncharacterized protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MI61_BDEBA Length = 256 Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats. Identities = 46/256 (17%), Positives = 80/256 (31%), Gaps = 34/256 (13%) Query: 103 AKGIEPLSIEKLQALQLIAPY-----------RFYHKQWSETLEFWPRKPEPGKDTFQYH 151 + I P + + + + L AP+ + + E P E D + Sbjct: 1 MQRILPETDLEKKLMSLGAPFIEENQVLDELFQVVGSDLVDGAELAPETREQILDEIGEY 60 Query: 152 VLPFDS---IDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQV 208 D +I LE+ WG+ D ES +E+ + H + Sbjct: 61 FFRLDMNYGPEIKLDCLGILEEFWGVSDRESCQKTLENIRTQGHRTKFNVLRSALPSDGS 120 Query: 209 IAL--LNKFAV-FPSDYISDCANHSSGKSSAKL---------------IWAAELSWMISI 250 I + KF F D D S +KL I A + S I + Sbjct: 121 IDAVSMEKFRQIFRFDLEEDQELQMSDADYSKLALWVQRTNKYLKEPGILAWDASRYIHL 180 Query: 251 SSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAE 310 +F G + + AW I+ + F++ D+ ++ +G +W A Sbjct: 181 VRLSFVAGHLSDIQAWSEILKLAPIVEGRFDNWMDFSQSFLIGRTFWSGADD--PRVKAI 238 Query: 311 LEACYRYDKQFWEHYS 326 E + W+ S Sbjct: 239 CEKLLGHPASPWQFIS 254 >UniRef50_A8RHY8 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RHY8_9CLOT Length = 409 Score = 130 bits (327), Expect = 7e-29, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 64/178 (35%), Gaps = 15/178 (8%) Query: 158 IDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLN--KF 215 D+++ ++SLE+ W + D + + ++ L+ H + +M E I + + Sbjct: 225 DDVMAMEQKSLEEWWSVTDRATADSTLDWILTEGHRDTFAE-DMAYLEEAGIRDIAPNER 283 Query: 216 AVFPSDYISDCANHSSGKS---------SAKLIWAAELSWMISISSTAFQNGTIEEELAW 266 + F D A+ + + I + +++ S + G E+ A Sbjct: 284 SAFLLDQFQMTADEAQNYADMFGFYEQYGPDAIAGWDYCRAMNLMSFYYLAGYYTEQEAL 343 Query: 267 HYIMLASRKAHELFESEEDYQKNSQMGFLYW---HICCYRRKLTDAELEACYRYDKQF 321 + +R LFES +D + G+ YW R D + Y F Sbjct: 344 DKSLEIARTMQPLFESWDDLMSSYMRGYEYWAEESADERRALYEDLKTREDNPYSVDF 401 >UniRef50_B9Y3D4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y3D4_9FIRM Length = 367 Score = 130 bits (326), Expect = 8e-29, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 64/208 (30%), Gaps = 2/208 (0%) Query: 133 TLEFWPRKPEPGKDTFQYHVLPFDS-IDIISKRRESLEDQWGIEDSESYCALMEHFLSGD 191 T E K+ + L + + I + ++ LE WGI + + + Sbjct: 154 TFEILLTAYMSLKNAHEMDGLAMEEDPEFIDQIKDMLESSWGITNPAELTEKLRELTTAG 213 Query: 192 HGANTFKANMEEAPEQVIALLNKFAVFPSDY-ISDCANHSSGKSSAKLIWAAELSWMISI 250 H A + P++++ S A + K I + ++ Sbjct: 214 HQAKYSRYQAAANPQELMDDPEDEEELESVLPCWKLAQYFKDKLPENYILGWDYGRAATV 273 Query: 251 SSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAE 310 + G I EE +W ++ + K ++F+S ++ + G L+W + Sbjct: 274 VRWGYTVGYINEEDSWAWLDQIAEKMIDVFDSWTEFGLSYVFGSLFWIAAFDGEEGISER 333 Query: 311 LEACYRYDKQFWEHYSKKCRWPIRNVPW 338 E + + W Sbjct: 334 FEEGIELLTELLDEDEDGQPGVWAQCAW 361 >UniRef50_C5EQ99 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQ99_9FIRM Length = 329 Score = 130 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 69/206 (33%), Gaps = 16/206 (7%) Query: 142 EPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANM 201 G D + D + +E WGI D +S +++ + H K Sbjct: 123 RNGADLYTPGGGTPDEAFYALVIKSVMERDWGIRDRQSAQSMISWLENEGHNQALLKYYE 182 Query: 202 EEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW---------AAELSWMISISS 252 E Q ++ A + S ++ + +A + + + S + + Sbjct: 183 EHDLGQYETDIDLNASWDSGQGEISDGEAARQMAAYMGYRTYGAYAASGWDYSRALMLLG 242 Query: 253 TAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELE 312 + G E A + +K +F S ED+ ++ GF+YW R T+ + E Sbjct: 243 QCYVAGYYTYEEAMDKSLELGKKLQSMFPSWEDFMQSYMYGFVYWS----RSDPTEPQSE 298 Query: 313 ACYRYDKQFWEHYSKKCRWPIRNVPW 338 YR + + P + W Sbjct: 299 FQYRV--SIYHYLDSLEDGPF-KMDW 321 >UniRef50_A5FIQ4 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FIQ4_FLAJ1 Length = 256 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 61/183 (33%), Gaps = 16/183 (8%) Query: 165 RESLEDQWGIEDSESYCALMEHFL--SGDHGANTFKANMEEAPEQVIAL-----LNKFAV 217 ++ L+ W I D S ++ +G H E +++ L Sbjct: 74 KQMLQQYWSISDLNSGMKQVQELTSKNGMHSKEFVDQVKELGIDKMSKQEFETKLAAITD 133 Query: 218 FPSDYISDCANHSSGKSSA-KLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKA 276 P I + + I +L + ++ + G +E A + S++ Sbjct: 134 -PEQKIHLQLLYDAYTDLGYNAILGWDLGRANFLLTSFYVAGFNDENTALDKALEVSKRI 192 Query: 277 HELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNV 336 + F+S ++Y ++ G+LYW + + Y + F K + P V Sbjct: 193 QKTFKSWDEYNRSYMYGYLYWSNEDPKDS------SSKYAERQGFISELKKDTKSPF-QV 245 Query: 337 PWG 339 W Sbjct: 246 KWD 248 >UniRef50_C4DP30 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DP30_9ACTO Length = 359 Score = 124 bits (312), Expect = 3e-27, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 53/186 (28%), Gaps = 40/186 (21%) Query: 154 PFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLN 213 PF S+ R +LE WGI D E A +E H + Sbjct: 207 PFKSLGRTGISR-ALERDWGIRDREGMVAQIESLARDGHREQFAQ--------------- 250 Query: 214 KFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLAS 273 A P Y+ A + + + + F G +EE W ++ + Sbjct: 251 --AGIPGKYL-----------------AWDYARALWMQRMGFILGWFDEEYCWDTMLPLA 291 Query: 274 RKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPI 333 R + + G W K + + ++ + KC W Sbjct: 292 RDVQRHYSGWAEMNHWYLEGRRLWSAAVSDGKPDPVQAQRERTAERLAAD---PKCPWNF 348 Query: 334 RNVPWG 339 +PW Sbjct: 349 --LPWD 352 >UniRef50_A7BBF7 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BBF7_9ACTO Length = 353 Score = 123 bits (309), Expect = 8e-27, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 55/183 (30%), Gaps = 15/183 (8%) Query: 165 RESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANM----EEAPEQVIALLNKFAVFPS 220 + L+ WGI + ES + L H + + IA L+K A Sbjct: 120 KGMLDRDWGITNRESLIRQIYSLLRAGHREDFAALRERCARPSWADTEIARLSKTADSSM 179 Query: 221 D------YISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASR 274 + I ++ G S A +L +++ G + E+ AW + + +R Sbjct: 180 EDWERRWRIRRFLDNDRGIQSLDF-AAWDLIRAANLTRAGAGLGWLSEDEAWDTLAIINR 238 Query: 275 KAHELFESEEDYQKNSQMGFLYWHICCY----RRKLTDAELEACYRYDKQFWEHYSKKCR 330 + S E+ + ++ W L D W Sbjct: 239 ALQFSYSSWEETWEAFRITRWLWAAEGDAQTANNDLHDRNRGEFLVGKNGLWTAIPWNAP 298 Query: 331 WPI 333 +P Sbjct: 299 YPA 301 >UniRef50_D0L2R2 Serine/threonine protein kinase-related protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0L2R2_GORB4 Length = 584 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 69/222 (31%), Gaps = 19/222 (8%) Query: 132 ETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGD 191 E L + V R+ L D WGI D+ES ++ G Sbjct: 363 EQLRGLSCGAYFALEATSDPVDDLFVTGSKRHLRKKLRDTWGIVDAESADETVDLLQLGM 422 Query: 192 HGANT--------------FKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSS-- 235 + + + ++++ D + A+ + Sbjct: 423 DAPDYDPTLRTIRNVAGGTPRGALVHERDRILRAAPGLIPSILDTVLTVASSTRDFPEEI 482 Query: 236 AKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFL 295 + A +LS ++ I G ++ ++AW ++ A R+A ++ Y ++G Sbjct: 483 PGSVAAWDLSRLVIIVRYCVFLGYLDPDVAWSIVVDAGRRAAGVYPHWGAYAAGFEVGRA 542 Query: 296 Y--WHICCYRRKLTDA-ELEACYRYDKQFWEHYSKKCRWPIR 334 + + D E+ + + S R P+R Sbjct: 543 LSRAEGDRHPARAADGVFAESRPIILRLLSDPTSPWIRLPLR 584 >UniRef50_Q5Z2V3 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5Z2V3_NOCFA Length = 278 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 46/249 (18%), Positives = 81/249 (32%), Gaps = 42/249 (16%) Query: 109 LSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESL 168 +S ++L+AL L Y + + L P + P D + + ++L Sbjct: 30 ISDDELRALALGGFYSTRWDAFHDALLLGPEREHPLGDRRELAI-------------DTL 76 Query: 169 EDQWGIEDSESYCALMEHFLSGDHGA--------NTFKANMEEAPE---------QVIAL 211 WGI D A ME L G H T N E + Sbjct: 77 TGAWGITDGTEAQASMEQLLDGMHAPLYALVHPLVTASINASERDRFGERADRHRAFLRQ 136 Query: 212 LNKFAVFP------------SDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGT 259 + F S I + I A +L+ +++++ +F G Sbjct: 137 VASFRGMDNPESLVRDYDIWSQAIKIGFTEHLARPLPSDIHAWDLARVVAVARMSFTAGY 196 Query: 260 IEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDK 319 IE ++AW Y+M A A + + + G+ YW C +L + ++ + Sbjct: 197 IEADVAWGYLMRALPLAQRKYRNWRQFGDAYLTGWTYWQACEDLAELKNGGVDRRMELLR 256 Query: 320 QFWEHYSKK 328 + S Sbjct: 257 LWTRPTSPW 265 >UniRef50_C7PJ30 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJ30_CHIPD Length = 280 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 29/276 (10%), Positives = 68/276 (24%), Gaps = 23/276 (8%) Query: 76 FLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQL-IAPYRFYHKQWSETL 134 + ++ IV K K + E+ + + P S Sbjct: 5 IIIGVVAAVYIVFVIIKLMKLNSRAKEIAAKAMKEREQQRDKAVEDEPLISEDGVLSLHD 64 Query: 135 EFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGA 194 + + ++ + R L W I + + ++ + H Sbjct: 65 RQYIACGANLIYLRGERLDTLETDTEQDEIRHMLRRDWHINTRDKLLSTIDGLATRGHRV 124 Query: 195 NT---------------------FKAN-MEEAPEQVIALLNKFAVFPSDYISDCANHSSG 232 + + + + I ++ + ++ G Sbjct: 125 YFKPIWQILTTLPVRERPEALDKLQQDFAAKGDDVPIEQYAANISECYKHLREISDCFEG 184 Query: 233 KSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQM 292 K +L I++ + G + E + YI ++ + S + +N + Sbjct: 185 KKCKLDALTWDLGRAINLCRWGYDAGFLSREESMRYIRKFGKELLHNYTSWANLGENYLI 244 Query: 293 GFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKK 328 GF W + D W K Sbjct: 245 GFAMWTGDIEQLDELHGGHCDLLSEDSSPWVLLESK 280 >UniRef50_B1EFD5 Putative uncharacterized protein n=1 Tax=Escherichia albertii TW07627 RepID=B1EFD5_9ESCH Length = 273 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 60/182 (32%), Gaps = 20/182 (10%) Query: 169 EDQWGIEDSESYCALMEHFLSGDHGANTFK-----------ANMEEAPEQVIALLN-KFA 216 + WGI+D ++ + G H + + E+ + + A ++ K Sbjct: 88 KRFWGIKDISMGMEMIRSLVDGRHNEQFLQEFYNITENVINLDNEQNWQTLFANISDKKL 147 Query: 217 VFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKA 276 + + D I A +LS + + + G I+E+ + + K Sbjct: 148 LIKMRVMHDAFLDFGN----NSILAWDLSRANHLLADYYLAGWIDEQRYMKEVFDVTLKI 203 Query: 277 HELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNV 336 + F S +++ K+ G+L+W + L CY P + Sbjct: 204 QKSFSSWDEFNKSYLYGYLWWSGEDQQHDLKKL---GCYHNRIMIINKQQYVANSPF-KL 259 Query: 337 PW 338 W Sbjct: 260 DW 261 >UniRef50_Q5LIZ2 Putative uncharacterized protein n=5 Tax=Bacteroides RepID=Q5LIZ2_BACFN Length = 235 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 60/215 (27%), Gaps = 18/215 (8%) Query: 132 ETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGD 191 T E ++V + + L GI++ + ++ Sbjct: 21 STKETDLLISLIPSLQEDFYVDSLTTGASKETLSKLLLRNPGIKNETAVIEMIHFLHDEG 80 Query: 192 HGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKS----------------- 234 + E L + I + Sbjct: 81 DRISFSILLPFLVAEYDPKELEEKIRERFFGIELFIRKCNNLHHFITCIKADQTFKIGEE 140 Query: 235 -SAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMG 293 + + A ++ M+ ++ A+ G I+E LAW+YI A ++ + F + K+ +G Sbjct: 141 ELKRGVLAWDMGRMVCLTRIAYDAGFIDESLAWNYICSAGQQCIQAFNDWTEVGKSFLLG 200 Query: 294 FLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKK 328 +++L W+ + K Sbjct: 201 QAMEATEKRKQELYIRLYRQATENPNSPWKKRTLK 235 >UniRef50_B7B9Y1 Putative uncharacterized protein n=2 Tax=Bacteroidales RepID=B7B9Y1_9PORP Length = 245 Score = 106 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 34/263 (12%), Positives = 62/263 (23%), Gaps = 52/263 (19%) Query: 91 FKAKKEQLHYYQ----AKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKD 146 + + ++Q L L+ + L Y + + T E Sbjct: 7 WGRLAKLQSFFQDGLNVDENSHLPEADLRKISLGNLYVYQQQGVLNTFETGVTPSV---- 62 Query: 147 TFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGD-----HGANTF---- 197 R+ L + +GI D +S + H A T Sbjct: 63 -----------------RKVILGEYFGITDRDSAIETLNWLSQAPSQTMFHYAYTAFLQG 105 Query: 198 ------------KANMEEAP--EQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAE 243 + E + L D S + + A + Sbjct: 106 GGNISRKWLNENEELKEHTDFRNDCLEKLETMEEKYPDIEQAGIVVSKEEMGKLGVLAWD 165 Query: 244 LSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYR 303 + IS + I +E I A E++ + +DY + +G Sbjct: 166 AGRLNFISRLCLEQEYIVKEECMQCINAAYEMTKEVYTNWKDYAYSYVLGRTLSMGTTN- 224 Query: 304 RKLTDAELEACYRYDKQFWEHYS 326 E K W + Sbjct: 225 ---MIGLAEDLLTDTKSPWTYIK 244 >UniRef50_B0NPN4 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NPN4_BACSE Length = 116 Score = 104 bits (259), Expect = 4e-21, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 40/88 (45%) Query: 236 AKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFL 295 + + A +L ++++ A+ I E+ WH + +A+ A E F S E+Y ++ MG Sbjct: 23 PQSVIAWDLVRLVNLGRWAYLCDYIREDEMWHIMQVAADTALEHFSSWEEYGRSFIMGRG 82 Query: 296 YWHICCYRRKLTDAELEACYRYDKQFWE 323 WH + +E + + W+ Sbjct: 83 VWHGDPTDSETAYEIVELLLKNGESPWK 110 >UniRef50_A7B4Z3 Putative uncharacterized protein n=1 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B4Z3_RUMGN Length = 465 Score = 103 bits (257), Expect = 8e-21, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 58/219 (26%), Gaps = 13/219 (5%) Query: 125 FYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRES-------LEDQWGIEDS 177 F + + W + + L ++ S L + WGI Sbjct: 58 FVQAEINSDTVEWICAAYAVYTQYNHKTLGVIGGLSDQEKESSQDRIKLTLSEGWGINGR 117 Query: 178 ESYCALMEHFLSGDHGANTFKANMEEAPEQVIA--LLNKFAVFPSDYISDCANHSSGKSS 235 + ++ L H K + + ++ F D + + Sbjct: 118 DDVTEVINKLLIKGHRETYLKTVKKLEKKGLLELSTEEAMTNFSEDDEEFARYQDAHEMY 177 Query: 236 AK----LIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQ 291 + + + S + + + I E + ++K F S E+ + Sbjct: 178 TQYGEHGMDGWDYSRALQVLGDCYLADYINLEECLDLSLPIAKKLQSAFTSWEELADSYI 237 Query: 292 MGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCR 330 G+ +W T ++A + YS Sbjct: 238 YGYAFWQNETADDVETKFRIQAYAELVEMENSPYSVAYD 276 >UniRef50_B8F949 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F949_DESAA Length = 218 Score = 101 bits (250), Expect = 5e-20, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 73/225 (32%), Gaps = 20/225 (8%) Query: 122 PYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDI---ISKRRESLEDQWGIEDSE 178 R +W E G + + F ++ + E L + WG+ + Sbjct: 7 ISRMLGVEWDSLCEAQLWGLAAGGVLSRVNQESFSRLESRRPKEECIEILSEAWGVYEPH 66 Query: 179 SYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKL 238 L + H + + + A F A + Sbjct: 67 HAWGLKYWLENEGHSKECLDVLSGKELGEQPNCGDPEARF------GFAEANREILEKYG 120 Query: 239 IWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWH 298 + A + +I + ++ G I + AW +IM +++ + + S E Y + ++GF YW+ Sbjct: 121 LMAWDHGNLIQAARWSYSAGYISSDDAWDWIMSSAKTIQDNYSSWEHYGFHWRLGFEYWN 180 Query: 299 ICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPWGASSV 343 + LT + EA W + W + +PW Sbjct: 181 ---DGQPLTSSFREA------GAWLLMNSASPW--KKLPWDEDLT 214 >UniRef50_C9MPB3 Putative uncharacterized protein n=1 Tax=Prevotella veroralis F0319 RepID=C9MPB3_9BACT Length = 242 Score = 98.4 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 40/257 (15%), Positives = 76/257 (29%), Gaps = 57/257 (22%) Query: 107 EPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRE 166 LS E+L+ + L A Y+ TL S I S+ + Sbjct: 8 SKLSDEQLRRISLSAQYQGQQGGDHFTL----------------------SSKIGSRAKV 45 Query: 167 SLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIA---------------- 210 LE WGI + + C +E L + E E Sbjct: 46 LLEQGWGITNRQELCETIEELLGRCRSLDIAVIKEEMMAEVQEDSGINTEVRRIWSMASI 105 Query: 211 ---LLNKFAVFPSDYISDCANHSSGKSS--------------AKLIWAAELSWMISISST 253 A SD ++ N+ + + S K + ++ + Sbjct: 106 VDKHYITRAGDLSDLLNMLTNYIAAQDSLLANELITSWDAITGKDVIGWDIGRAAYLVRV 165 Query: 254 AFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDA--EL 311 + + + AW Y+ A ++A F++ E+ + +G W R + + Sbjct: 166 GVEMKYLNADQAWDYLERAYQRAISTFDTWEELGHSYIIGRCCWTSHPEERDVLGFCNVV 225 Query: 312 EACYRYDKQFWEHYSKK 328 + ++ + W K Sbjct: 226 KWLLKHPESPWVKVKLK 242 >UniRef50_A6VV45 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MWYL1 RepID=A6VV45_MARMS Length = 242 Score = 98.0 bits (242), Expect = 4e-19, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 25/247 (10%) Query: 105 GIEPLSIEKLQALQLIAPYRFYHKQWS-ETLEFWPRKPEPGKDTFQYHVLPFDSIDIISK 163 EPL+ E+ L ++ + +S ++LE K + II+ Sbjct: 10 PSEPLTAEQKLWLISLSSFLSLPNSYSLDSLENKSEK--------------YTPEAIIAG 55 Query: 164 RRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANME-----EAPEQVIALLNKFAVF 218 + L++ +GI D + ++ F E +++ +K+ Sbjct: 56 NSKVLKNSYGINDKNEFIDTLDIFCGTARSQEFMYLMKEWAKPDLYAKRLFGNRHKYKGT 115 Query: 219 PSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHE 278 + S A S I A ++ + A+ I E+ AW +++ + A E Sbjct: 116 EVEIASVWAEKYRTPLSHCGIQAFDIGRYAFLCRCAYTVSLITEDEAWAFLLRIGKIAQE 175 Query: 279 LFESEEDYQKNSQMGFLYW----HICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIR 334 F S ++ + +G W + T LE + + + P Sbjct: 176 RFTSWYEFATSYTVGRCIWLDINIDGIESTENTFDVLEKIQTESVELEKILADH-EHPWS 234 Query: 335 NVPWGAS 341 + W S Sbjct: 235 ILGWEIS 241 >UniRef50_P77427 Uncharacterized protein ybeU n=128 Tax=Enterobacteriaceae RepID=YBEU_ECOLI Length = 235 Score = 96.1 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 57/189 (30%), Gaps = 7/189 (3%) Query: 156 DSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANME---EAPEQVIALL 212 + L WGI + + + H + + +PE+ AL+ Sbjct: 29 SPKMYTGIKEFELSSSWGINNRDDLIQTIYQMTDDGHANDLAGLYLTWHRSSPEEWKALI 88 Query: 213 NKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLA 272 + Y A ++ I A + M +S N + EE + Sbjct: 89 AGGSERGLIYTQFVA-QTAMCCGEGGIKAWDYVRMGFLSRVGVLNKWLTEEESLWLQSRV 147 Query: 273 SRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQ---FWEHYSKKC 329 +AH + S Y +G LYW + E Y+YD +E + Sbjct: 148 YVRAHHYYHSWMHYFSAYSLGRLYWQSSQCEDNTSLREALTLYKYDSAGSRMFEELAAGS 207 Query: 330 RWPIRNVPW 338 +PW Sbjct: 208 DRFYATLPW 216 >UniRef50_UPI00019694B1 hypothetical protein BACCELL_01818 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI00019694B1 Length = 291 Score = 95.7 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 81/227 (35%), Gaps = 11/227 (4%) Query: 100 YYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSID 159 +++ P++ + QAL + ++ +L + E + + Sbjct: 66 FHRNPQAPPVNRNREQALLIGLMSGEQEFFYTNSLTTGRSREELSR--HLSLAVRLKDKP 123 Query: 160 IISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEA---PEQVIALLNKFA 216 I L++ E S +M +H +A ++E ++ + + Sbjct: 124 NIQLVFNFLKE----EGERSAYNIMISLFLSEHNEKKREALIKERFLGIDRFVQYCRNLS 179 Query: 217 VFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKA 276 F + + + + I A +L ++S++ ++ G + EE AW YI A +K Sbjct: 180 DFLVYIKENNTIEITNEDLQRGILAWDLGHLVSLARVSYDYGLLAEEEAWKYIEFAGKKC 239 Query: 277 HELFESEEDYQKNSQMGFLY-WHICCYRRKLTDAELEACYRYDKQFW 322 E F ++ K+ +G + + ++ L + + W Sbjct: 240 RETFACWKEIGKSFLLGQIMSYPGEENFQEAIRYFLLS-TESLESPW 285 >UniRef50_Q3KF29 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KF29_PSEPF Length = 231 Score = 93.8 bits (231), Expect = 8e-18, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 64/198 (32%), Gaps = 8/198 (4%) Query: 155 FDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEE---APEQVIAL 211 +D R L+ WGI+D +++ H + A P + L Sbjct: 25 YDDPAFCDDRYIDLKGSWGIDDRRQLFDMLQWMTDDGHAKHLSGAYSAWQRCLPNEWQRL 84 Query: 212 LNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIML 271 L + + + + A+ + G I + + M + A +N I+ + Sbjct: 85 LEELSPR-ERVLHEFASRTFGGCGPGGILSWDYGRMGFLLRCAVRNQWIDLAESNWLHSR 143 Query: 272 ASRKAHELFESEEDYQKNSQMGFLYWH--ICCYRRKLTDAELEACYRYDKQFWEHYSKKC 329 + +A + S Y +G +W + E + ++ + ++ Sbjct: 144 LAVRAQFHYGSWMSYFNGFVVGRTFWCCLNTSDDELACELERQGDSVHNLRITRGLAQNI 203 Query: 330 RWPIRNVPW--GASSVKY 345 + ++PW S+ Sbjct: 204 PHFLADLPWHMEIDSLPR 221 >UniRef50_C0FWW7 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FWW7_9FIRM Length = 628 Score = 91.9 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 60/184 (32%), Gaps = 12/184 (6%) Query: 158 IDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVI----ALLN 213 D + RE++ D WGI D +S + L + + E + ++ Sbjct: 448 EDKVEAIRENISDYWGIHDRKSLMKTTDSLLQKGDKYTYAQTLEKLGEEALTLPEDSIYY 507 Query: 214 KFAVFPSDYISDC--ANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIML 271 + ++ D + ++ + A + I + + + G I + + Sbjct: 508 TYKLYAPDEMCKYLGTYYAYNNIGDAGVDAWDYCRCIRLFAFGYICGYISYDEYLIHAAP 567 Query: 272 ASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRW 331 + ++S E ++ G+L + + D +++E + K Sbjct: 568 LAVYLQNEYDSWETMYESYYYGYLIFAG------RNKNSSSSVIYSDYRYYEIMADKTEI 621 Query: 332 PIRN 335 P R Sbjct: 622 PFRT 625 >UniRef50_UPI0001B4FA8B hypothetical protein ShygA5_09679 n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4FA8B Length = 212 Score = 88.0 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 35/232 (15%), Positives = 61/232 (26%), Gaps = 53/232 (22%) Query: 107 EPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRE 166 PL+ + + L A S TL + P +D+ S R Sbjct: 26 TPLTSHQRWMVSLAAILAERTPGHSHTL-----------------LYPLKRVDV-STSRG 67 Query: 167 SLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDC 226 L + WGI +E ++ + H A Sbjct: 68 RLSESWGITTTEDLHGVLHRLATTGHRTRMAAAI-------------------------- 101 Query: 227 ANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDY 286 G A +++ I+ +G I+E AW + + S ++Y Sbjct: 102 -----GHPP----LAWDIARYADITRYGLASGYIDEPTAWRLLREVVAPVARTYGSWKEY 152 Query: 287 QKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 + G L W + + D + P + PW Sbjct: 153 ADDFMTGRLAWMRALHGTENEDWPVSQEDTARAVQRLVDPMNQDSPWQRTPW 204 >UniRef50_C7PJC7 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJC7_CHIPD Length = 218 Score = 86.5 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 50/169 (29%), Gaps = 5/169 (2%) Query: 129 QWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFL 188 W L + + L S + L+ WGI +S AL+ L Sbjct: 23 GWEPYLNIGAL---LTEGNLRRDSLTLQSRLASKQLTPLLQGAWGIHNSADTKALISDLL 79 Query: 189 SGD-HGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWM 247 + F + + + + + + + + + A ++ Sbjct: 80 TLPVTQKQAFISAEQLTDDGLYQRIKDNCEKAFAQ-HNLYFSKAYFDGVQDLAAWDIERA 138 Query: 248 ISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLY 296 I+ AF G + +E A + + A + + + DY G Sbjct: 139 GLITRYAFNTGWLTQEEALDALKALHKLAKQHYTNWLDYYLGYLKGRTI 187 >UniRef50_C7PXG6 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7PXG6_CATAD Length = 247 Score = 85.7 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 70/259 (27%), Gaps = 51/259 (19%) Query: 107 EPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRE 166 L+ + + L L A Y L K Sbjct: 13 PALTARQERGLALGAVYAVGDDLPINALTAGSDPQSAAK--------------------- 51 Query: 167 SLEDQWGIEDSESYCALMEHFLS-GDHGANTF----------------KANMEEAPEQVI 209 LE W + D++S A L G H + E + I Sbjct: 52 VLEQAWDVYDAQSARATYRFLLEEGGHRDVYACVRGYLNAGWDLSRADERARVEQATREI 111 Query: 210 ALLNKFAVFPSDYISDCANH----------SSGKSSAKLIWAAELSWMISISSTAFQNGT 259 + D + + + I A + + ++ +S G Sbjct: 112 PAIAMQRGERPDVALNYFQSAWPSRAMMQGHYPRRIIESIAAWDAARVVHVSRFIVDAGY 171 Query: 260 IEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDK 319 + + AW I A+R + S E++Q G ++W E+ A YR Sbjct: 172 LPADEAWAAIDAAARMVLPEYPSWEEFQLGFLAGRVFWQCNGTFD---AQEVSADYRRYT 228 Query: 320 QFWEHYSKKCRWPIRNVPW 338 + K P + +PW Sbjct: 229 SSGKSLLSKADSPWQRLPW 247 >UniRef50_A8IK38 WosA n=4 Tax=Proteus RepID=A8IK38_PROMI Length = 321 Score = 85.7 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 27/244 (11%), Positives = 72/244 (29%), Gaps = 25/244 (10%) Query: 74 CSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQL----IAPYRFYHKQ 129 S ++ + L+V + + P + + AL + + + Sbjct: 11 SSVGVIIGVFFLVVLFKWLRNFAVQDNKLSTSAMPATTDPESALSTPTTEGSVNQLLPNE 70 Query: 130 WSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLS 189 W + P + + Y+ D L WG+ D + L+ Sbjct: 71 WG----LYVAAPYAVMNEWAYNEYNQGKDDG------GLSAAWGVNDRWDLIYQLFWLLT 120 Query: 190 GDHGANTFKANMEEAPEQVIA--------LLNKFAVFPS-DYISDCANHSSGKSSAKLI- 239 H + ++ + + LL+ + + ++ + + + + Sbjct: 121 QGHTNDFYQLRDQILNGKEEDIQSLKNDILLSDLTENDKNERLWQIDMMNTNRMNIQNVK 180 Query: 240 -WAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWH 298 +L + Q G I ++ A + ++++ +++ ED +N W Sbjct: 181 YLIWDLCRFNKLCLEGCQQGYITQQEAQTWSLMSASMLRRIYDGWEDMWQNFIATRWLWA 240 Query: 299 ICCY 302 Sbjct: 241 SGDQ 244 >UniRef50_Q8RI41 Putative uncharacterized protein FN1795 n=1 Tax=Fusobacterium nucleatum subsp. nucleatum RepID=Q8RI41_FUSNN Length = 279 Score = 84.6 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 75/264 (28%), Gaps = 42/264 (15%) Query: 114 LQALQLIAPYRFYHK----QWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLE 169 Q ++ Y+ Y + F +P ++Y DII K + SL Sbjct: 14 QQISRIDEGYKQYFGLLLSGVLSVINFGKLEPLQSSSDYEYD------EDIIKKMKHSLY 67 Query: 170 DQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPS--------- 220 + W I D +S + L+ H E + K Sbjct: 68 ESWEIYDPKSVFETISWLLNEGHSKKYESLKYTTLDEAIQEKYKKIKKDIETENYSDDVY 127 Query: 221 -----------------DYISDCANHSS-----GKSSAKLIWAAELSWMISISSTAFQNG 258 D I + ++ + K I A ++ + + Sbjct: 128 TTHGFRDKEHYIETLLKDEIKELQHNWDFILAFRGLNVKNIRAWDIGRAAYLVWECYFFD 187 Query: 259 TIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLT-DAELEACYRY 317 +++ A I + A + F + ++ + +G ++W+ ++ E Y Sbjct: 188 YLKKFEAEQLIDTLAEVAAKEFSNFTEFACSYALGRIFWYFSISKKNNINKEMTEIVYEL 247 Query: 318 DKQFWEHYSKKCRWPIRNVPWGAS 341 + F +S N W Sbjct: 248 LEAFEILFSSNDGLWAVNQWWNID 271 >UniRef50_C3BW33 Putative uncharacterized protein n=3 Tax=Bacillus RepID=C3BW33_9BACI Length = 226 Score = 81.9 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 45/131 (34%) Query: 172 WGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSS 231 W IE+S + FL + + P A P AN+ Sbjct: 59 WQIENSTELKEKIIWFLEEGTRQEFNRIRHQLTPLSEAARKQLSKDHPDHEKLYIANYGL 118 Query: 232 GKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQ 291 + I A + +W I + + G + ++ A ++++ A+R + + +Y + Sbjct: 119 HILTDSGIAAFDYAWCICLCRVGRRLGYLSKQEAKNFMIQAARLSQHSYSDWHEYFNAFR 178 Query: 292 MGFLYWHICCY 302 +G + Sbjct: 179 IGSHFNANDTE 189 >UniRef50_Q8A2P3 Putative uncharacterized protein n=10 Tax=Bacteroides RepID=Q8A2P3_BACTN Length = 226 Score = 81.9 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 57/215 (26%), Gaps = 39/215 (18%) Query: 109 LSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESL 168 L K Q L++ + +E + K L Sbjct: 19 LGTRKKQGLRIGYMEAALDGFYLNCMETGVHPEKLSK---------------------LL 57 Query: 169 EDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCAN 228 D++ D+ S C L ++ A+ + + + Sbjct: 58 SDKFHCTDAISSCQLFLFLINEGDRASYSIMVPYLLSTENLNQFENTIRERFYGVDRFIQ 117 Query: 229 HSSG------------------KSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIM 270 + + +++ ++ ++ A G I +E AW YI Sbjct: 118 QGRNLYKFKEYIEERGEPIVWINDLERGVIGWDMAQVVGLARAAKDCGYITKEQAWEYIE 177 Query: 271 LASRKAHELFESEEDYQKNSQMGFLYWHICCYRRK 305 AS E+ + E+ K+ +G + Sbjct: 178 QASTLCSEILRTPEEIDKSFLIGGAMKSNKIEDWE 212 >UniRef50_C5EQL7 Predicted protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQL7_9FIRM Length = 316 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 55/234 (23%), Gaps = 49/234 (20%) Query: 98 LHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDS 157 Y K P + + A Y K S + + + + +S Sbjct: 129 RQQYLNKKANPYN-DNQVVQWFNATYAILTKHNSCNIRAYGGELLLAGVEGEDDGSSDNS 187 Query: 158 IDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAV 217 I + R+ L WG+ D S A + L Sbjct: 188 --IKERNRKMLSKSWGVTDRASADAALLRLLESGRATG---------------------- 223 Query: 218 FPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAH 277 A + S +S + G + + +R Sbjct: 224 ----------------------SAWDYSRAMSNLGFYYLAGYYPITESLDRSLEVARTIQ 261 Query: 278 ELFESEEDYQKNSQMGFLYWHICCYRR-KLTDAELEACYRYDKQF-WEHYSKKC 329 + + S +++ + G+ W L L+ W KK Sbjct: 262 QTYGSWDEFIASYLAGYHAWAGDEAENRDLIYEGLKGSAFNPYAVDWNLELKKT 315 >UniRef50_Q1IDH3 Putative uncharacterized protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1IDH3_PSEE4 Length = 230 Score = 79.9 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 46/187 (24%), Gaps = 5/187 (2%) Query: 157 SIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFA 216 + + LE WGI ++ H +A + Sbjct: 27 PDYFEGEDQADLERWWGISTRAQLLDML-SMADNGHATELSEAYWQYQRCLPSQWQALLE 85 Query: 217 VFPSDYISDCANHSSGKS--SAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASR 274 P + A +L M + + G + E + + + Sbjct: 86 TLPPRERIRHQYAARTFPDCGPGGTRAWDLGRMSYLLRAGVKKGLVSREESLYLHYRLAL 145 Query: 275 KAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEAC--YRYDKQFWEHYSKKCRWP 332 +A + + Y G W+ + A LE +++ + Sbjct: 146 RARHYYNRWDSYLAGYLFGKALWNASGSSDEALAANLERQGYEHWNRCILLNLRHGAHAL 205 Query: 333 IRNVPWG 339 +PW Sbjct: 206 FAELPWD 212 >UniRef50_C0ZAT1 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZAT1_BREBN Length = 239 Score = 78.8 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 52/188 (27%), Gaps = 17/188 (9%) Query: 164 RRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKAN---MEEAPEQVIALLNKFAVFPS 220 R ++++W D+ + FL + + + Sbjct: 49 LRLRIDNRWDTGDATKTKERLTWFLEHGRRTEFNQHRHVLSTLSDANRSNYIASLKKGDL 108 Query: 221 DYISDCANH-SSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHEL 279 + + + I A + +W + I Q + + A ++ +++ Sbjct: 109 ESARQLVVYAYMNRLPHAGIAAFDYAWYLIICKAGAQQSYLPKAEAVEGMLDVAKRIQLA 168 Query: 280 FESEEDYQKNSQMGFLYWHICCYRR--KLTDAELEACYRYDKQFWEHYSKKCRWPIRNVP 337 + S E+Y G LY + K T+A + PIR Sbjct: 169 YSSWEEYLFAYACGNLYDEAAASKNTRKATEAHILKLLTGKYS-----------PIREFD 217 Query: 338 WGASSVKY 345 W Y Sbjct: 218 WKFDLTPY 225 >UniRef50_Q639C8 Group-specific protein n=16 Tax=Bacillus cereus group RepID=Q639C8_BACCZ Length = 225 Score = 78.0 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 40/117 (34%) Query: 180 YCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLI 239 ++E L+ + P +D N+S I Sbjct: 64 LKEMIEWLLTEGSRQEFQTMYNQLTPLSEAQRKLLLPRQSNDEKMYVVNYSLHMLPDAGI 123 Query: 240 WAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLY 296 A + +W I +S + G + ++ A +Y++ A++ A + +Y +G + Sbjct: 124 AAFDYAWCICLSRIGKRLGYLSKKEAEYYMIQAAKLAQNSYSDWHEYFLAFHIGSHF 180 >UniRef50_A0AF53 Complete genome n=6 Tax=Listeria RepID=A0AF53_LISW6 Length = 220 Score = 76.8 bits (187), Expect = 9e-13, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 65/217 (29%), Gaps = 23/217 (10%) Query: 107 EPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRE 166 S +K + L + AP + L D + F K + Sbjct: 17 NTFSPDKEKLLCIGAPSTECKHGITTKL-----------DGSHKMLKLFYPKKSEGKTMK 65 Query: 167 SLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDC 226 +GI DS+S ++ ++ E P+ ++ + Y D Sbjct: 66 YWLPMFGITDSQSAVEVISSWIKA------NDFYEEVTPDTTREVIKELTKAAKKYDFDG 119 Query: 227 ANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDY 286 ++ S K A ++ + I F + EE AW ++ F++ +DY Sbjct: 120 SDLIDAASKVKTYGAFDIDRLGYIVRVCFSLNLLTEEQAWSFLKQLQEDTEAHFDNWDDY 179 Query: 287 QKNSQMGF------LYWHICCYRRKLTDAELEACYRY 317 + G Y C K+ +Y Sbjct: 180 MVSYLNGQEGLDTSWYSSACESYIKMKKDSTSLMNKY 216 >UniRef50_C3PJD0 Putative uncharacterized protein n=4 Tax=Corynebacterium RepID=C3PJD0_CORA7 Length = 293 Score = 75.3 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 48/201 (23%), Gaps = 33/201 (16%) Query: 161 ISKRRESLEDQWGIEDSESYCALMEHFLSGD-HGANTFKANMEEA-PEQVIALLNKFAVF 218 + SLE WG+ ++ +++ L G H + + A + L+ Sbjct: 78 AKYYKTSLEQNWGVTGAQEAYQVIDALLEGGQHVEDDLVLPLAYAVKDVPERELDAEVEE 137 Query: 219 PSDYISDCANHSSGKS--------------------------SAKLIWAAELSWMISISS 252 ++I D A ++ + ++ Sbjct: 138 KVEFIKDFFVAQGADPRRGEHKFRYLVRMLRSEGFAKAAAPALPTTTRAWDIIRIHNVGG 197 Query: 253 TAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRR-----KLT 307 A + G I E A F S D + G + W K Sbjct: 198 PATELGWISPEEFLQISDKAVAALQRYFVSWADVAASFWWGRMIWASDGEPDVAAAMKDQ 257 Query: 308 DAELEACYRYDKQFWEHYSKK 328 L + W Sbjct: 258 SQRLTELLAHSDSPWVRVPLH 278 >UniRef50_C5ETY0 Predicted protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5ETY0_9FIRM Length = 464 Score = 74.9 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 54/214 (25%), Gaps = 48/214 (22%) Query: 119 LIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSE 178 A Y K S T D F Y+ K R+ + WG+ D Sbjct: 298 FNATYAILTK--SNTGNIRAVGGATKVDGF-YNDGSSTDQWYSDKIRQGQAESWGVTDRS 354 Query: 179 SYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKL 238 S ++E ++ + Sbjct: 355 SADQVLERLIASGNATG------------------------------------------- 371 Query: 239 IWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWH 298 A + S +S + G E + ++ F S +D+ ++ G+ W Sbjct: 372 -SAWDYSRAMSNLGFYYIAGYYTIEETLDKSLETAKIIQTKFTSWDDFVESYLAGYASWS 430 Query: 299 I-CCYRRKLTDAELEACYRYDKQFWEHYSKKCRW 331 R+ +L+ + + + W Sbjct: 431 GTDASERRNIYEQLKTSAFNPFSLDWNMTLEKNW 464 >UniRef50_C1YTB3 Putative uncharacterized protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YTB3_NOCDA Length = 255 Score = 74.9 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 25/213 (11%), Positives = 56/213 (26%), Gaps = 44/213 (20%) Query: 160 IISKRRESLEDQWGIEDSESYCALMEHFLSGDHGA-------NTFKANMEEAP------- 205 R+ L D WGI D + + + M +AP Sbjct: 51 DAQDERDKLRDSWGITDHAGWDRALRRLTDDARSPTALSIVLDLRSLAMAQAPGWPFDAG 110 Query: 206 --------------------EQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELS 245 E+++ + + + D + + + A + Sbjct: 111 VWPDLIVRWCQERNAAPGLYEELVTAAAQVWEYEERMVRDGVLPPG--TPVRTVRAYDFG 168 Query: 246 WMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRK 305 ++++ G +E A +I+ A ++ S ++ +G Sbjct: 169 RAVNLARWGVNAGYADERAAHAHILRAGAQSMRYHGSWQEMSAGFVLGRAMAFDEGAFGP 228 Query: 306 LTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 + A K + P ++PW Sbjct: 229 YYTDSVRAHAVLAKDP--------QSPWLHLPW 253 >UniRef50_C0Z9V0 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z9V0_BREBN Length = 209 Score = 74.5 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 17/135 (12%), Positives = 44/135 (32%), Gaps = 4/135 (2%) Query: 172 WGIEDSESYCALMEHFLSGDHGANTFKAN---MEEAPEQVIALLNKFAVFPSDYISDCAN 228 WG++D+ S + + L + + + ++ Sbjct: 41 WGMKDATSQRSRLTWMLQEGERKEFARLHHFMTALSESGRKEYIDSLESDQERIAKAKVV 100 Query: 229 HSS-GKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQ 287 + A+ I A + +W +S G I +E A + + A+R+ + + + ++ Sbjct: 101 QFYMRRLPAEGIAAYDYTWASFLSRRKGDYGYISKEEARQFKLQATRQTQQAYNNWGEFF 160 Query: 288 KNSQMGFLYWHICCY 302 G+ + Sbjct: 161 TGYIAGYQFMTAQTS 175 >UniRef50_UPI0001B4F4A1 hypothetical protein ShygA5_42835 n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4F4A1 Length = 426 Score = 74.5 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 54/227 (23%), Gaps = 60/227 (26%) Query: 164 RRESLEDQWGIEDSESYCALMEHFLSGDH---------------GANTFKANMEEAPEQV 208 + L++ WGI E + E L D + + Sbjct: 206 EKGRLKEWWGITSREEWRHYQEQLLEADQISGAWEFVLGVRRALSREFGGHVEVDQWRKA 265 Query: 209 IALLNKFAVFPSDY----------------------------ISDCANHSSGKSSAKL-- 238 + + ++Y I A + + L Sbjct: 266 AEKVLRRGAEGTEYRLSADGVTKVGPRDSADVTAQIAGVQRLIGRIARYEKRFRADGLLA 325 Query: 239 -------IWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQ 291 + A + + ++ ++ A I+ ASR + S E++ + Sbjct: 326 EGKFIPTVEAWDYGRAVGMARWGLGARYCDQREAEDAILRASRLGRANYRSWEEFSASYI 385 Query: 292 MGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 +G LEA + P +PW Sbjct: 386 LGRCLHFDEEEFGSWYQDMLEAHRI--------LTTDPTSPWLTIPW 424 >UniRef50_UPI0001B58620 hypothetical protein StAA4_22524 n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B58620 Length = 382 Score = 74.1 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 66/254 (25%), Gaps = 33/254 (12%) Query: 109 LSIEKLQALQL-IAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRES 167 ++ + Q +L A Y + L + + ++VL D +++ R Sbjct: 130 VTAVEQQVERLYTARYGILDGPVAHALACGAHQAVLSAE--PWNVLDARYHDYVAEVR-G 186 Query: 168 LEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIA----------------- 210 L D WG+ D+E + A M+ L + T + +A Sbjct: 187 LRDWWGVADAEGWRAAMDRLLGDSYQLTTGNLVLVLRARSGLALDVYGWVELVRQWCADN 246 Query: 211 LLNKFAVFPSDYISDCANHSSGKSS---------AKLIWAAELSWMISISSTAFQNGTIE 261 A D + + + I ++ + ++ G + Sbjct: 247 DAEDQAGPLVDAVRRIVRYEQRFRADGLLAPDGVVDSIIGWDVGRAVELARWGLAVGYCD 306 Query: 262 EELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELE---ACYRYD 318 A ++ A A S + +G L + Sbjct: 307 ALTAELMVLEAGAIARRYHGSWAELSTGYVLGRLLALDGEEFGPEYVSAARVRHRLLNDP 366 Query: 319 KQFWEHYSKKCRWP 332 W + P Sbjct: 367 ASPWSTLDFEAGGP 380 >UniRef50_Q5WK87 Putative uncharacterized protein n=1 Tax=Bacillus clausii KSM-K16 RepID=Q5WK87_BACSK Length = 648 Score = 73.8 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 27/279 (9%), Positives = 73/279 (26%), Gaps = 28/279 (10%) Query: 71 TAGCSFLYLLIMMGLIVRAGFKA-----KKEQLH--YYQAKGIEPLSIEKLQALQLIAPY 123 F++L++++ I E K + K + L Sbjct: 379 AWIYVFVFLIMLIPPIPFHYTALFLIPILWEFGRVALVFLKPAGMFTKIKKRRQAL---- 434 Query: 124 RFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCAL 183 ++ + +E F Y ++ LE W + + Sbjct: 435 -YFRCLSAACIESDFLNHYFTVYDFTYVRDRLI---GKKFLQKVLEK-WEVSSAAELKQR 489 Query: 184 MEHFLSGDHGANT---FKANMEEAPEQVIALLNKFAVFPSDYISDCANHSS-GKSSAKLI 239 ++ + + E + DY + + + Sbjct: 490 IQWLMDVGTRREFDYYLDQLTPLSEEARTRFVQSLTKDDPDYPKFFIANRGIHTLTEAGV 549 Query: 240 WAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHI 299 A + +W I + + G + + A ++ A++++ + + + +Y +G + Sbjct: 550 AAVDWAWSIYLCRVGRRLGWLSKTEANEIMLKAAQQSQQAYLNWNEYFTAFHLGSYFNAD 609 Query: 300 CCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 + L + P+ + W Sbjct: 610 DSEHDQYAKHGL--------SIIFTLLIQGNSPLLKLDW 640 >UniRef50_B1W156 Putative uncharacterized protein n=3 Tax=Streptomyces RepID=B1W156_STRGG Length = 392 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 53/206 (25%), Gaps = 39/206 (18%) Query: 164 RRESLEDQWGIEDSESYCALMEHFLSGDHG--ANTFKANMEEAPEQVIALL--------- 212 R+ L D WGI D + ++ L + F E + L Sbjct: 194 ERDQLRDSWGITDHAKWRRQLDVLLEARNSPPEPDFVLRARERLAAGLGELPSADLWRET 253 Query: 213 -----------NKFAVFPSDYISDCANHSSGKSSAKLI---------WAAELSWMISISS 252 D + + S + L+ A + ++++ Sbjct: 254 AAGHAQDLGADADTVKVIEDLVRRITRYESRFRADGLLPPDGRVHTTVAYDYGRAVNLAR 313 Query: 253 TAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELE 312 A I+ A A S ED+ +G + + + Sbjct: 314 WGLSARYCAPADAEAAIVYAGALAKSAHRSWEDFSAGYALGRVL---RFDEEEYGRFYEQ 370 Query: 313 ACYRYDKQFWEHYSKKCRWPIRNVPW 338 + ++ P R++PW Sbjct: 371 NVLAHR-----LLAESEGSPWRHIPW 391 >UniRef50_C8NTP8 Putative uncharacterized protein n=1 Tax=Corynebacterium genitalium ATCC 33030 RepID=C8NTP8_9CORY Length = 276 Score = 71.5 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 54/212 (25%), Gaps = 48/212 (22%) Query: 163 KRRESLEDQWGIEDSESYCALMEHFLSGD-------------------------HGANTF 197 + + LE WGI +E + E ++GD + Sbjct: 66 QYKRMLEQWWGITSAEEARQMTERLIAGDVHTASSDAVLHTAEMLTGEFAPQEWRSQDFA 125 Query: 198 KANMEEAPEQVIALLNKFAVFPSDYISDCANH------------SSGKSSAKLIWAAELS 245 + E + L + +H + S+ A ++ Sbjct: 126 ERRR--TMEIFLEHLAISNWMDPKQLWTDFDHWLAVRTHPSFANVNAPSAPATTRAWDIM 183 Query: 246 WMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRR- 304 + ++TA G I E Y A + + F S D + G W R Sbjct: 184 RVEMTAATAALVGIITPEEYSSYAARAVAELQKHFMSWADAAASFWWGRAIWSADTIRDN 243 Query: 305 --------KLTDAELEACYRYDKQFWEHYSKK 328 ++ D L W + Sbjct: 244 ADDMQGELQVFDHILTEALTDPNSPWRRFPLH 275 >UniRef50_C8NLE2 Putative uncharacterized protein n=2 Tax=Corynebacterium efficiens RepID=C8NLE2_COREF Length = 274 Score = 71.1 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 50/166 (30%), Gaps = 13/166 (7%) Query: 166 ESLEDQWGIEDSESYCALMEHFLS--GDHGANTFKANMEEAPEQVIALLNKFAVFPSDYI 223 +L WG+ +++ + +H A A P + + Sbjct: 100 NALRADWGVRNADQARTRLSQAQQVITEHAAAVLAQRGLPEDAGEFRAKLVRAGAPGELV 159 Query: 224 SDCANHSSGKSSAKLI-----WAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHE 278 D H+ ++I A +++ + +++ + ++ + ++ A Sbjct: 160 DDFLAHTDAAPDPQVIIDADGLAFDIARVANLARWSGYVRYVDPDQCTEHLDALGIAAVA 219 Query: 279 LFESEEDYQKNSQMGFL--YWHICCYRRKLTDAELEACYRYDKQFW 322 +F S +D+ G + + + +E W Sbjct: 220 VFRSWDDFADAFLAGQATRFKGGAKHYTQA----VEWLRTDTDSPW 261 >UniRef50_C1YUR3 Putative uncharacterized protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YUR3_NOCDA Length = 329 Score = 71.1 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 21/198 (10%), Positives = 49/198 (24%), Gaps = 20/198 (10%) Query: 164 RRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAV--FPSD 221 + LE QWG+ D ES ++ L ++ + + L + Sbjct: 118 YQRMLERQWGVTDRESLLEAVDALLEELRTGPKLDLVVDLSAGVARSRLPRERGGQVTGA 177 Query: 222 YISDCANHSSGKSSAKLIWA------------AELSWMISISSTAFQNGTIEEELAWHYI 269 ++ + + + + +I + + + Sbjct: 178 HVRLSGEQVARLRAVTGVAEADETVLIGAYQWWKSVHVIRLVCGGASLDWLSPVETQTLL 237 Query: 270 MLASRKAHELFESEEDYQKNSQMGFLYW-----HICCYRRKLTDAELEACYRYDKQFWEH 324 + + S + G+L W A L ++ W Sbjct: 238 RRVASDLQRRYASWQQLSMAFHAGYLLWPEKGVEGDHGGTDRVWAALGLLTEDERSPWNL 297 Query: 325 YSKKCRWPIRNVPWGASS 342 R +P G ++ Sbjct: 298 LPWDMPLE-RVLPEGGAA 314 >UniRef50_C0XUK5 Putative uncharacterized protein n=1 Tax=Corynebacterium lipophiloflavum DSM 44291 RepID=C0XUK5_9CORY Length = 102 Score = 70.3 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 13/95 (13%), Positives = 24/95 (25%), Gaps = 1/95 (1%) Query: 236 AKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFL 295 + + + IS A G EE + A +A ++ + Y +G Sbjct: 8 PSTLTSWDYCRAAWISRMAHALGWFNEEECAQHHAAALERAQAMYPDWKSYASGWLLGRA 67 Query: 296 YWHICC-YRRKLTDAELEACYRYDKQFWEHYSKKC 329 W + A + W Sbjct: 68 AWSGMVGEDGEGLAALSATLLSHPTSPWLRMPLNP 102 >UniRef50_Q1QY12 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QY12_CHRSD Length = 573 Score = 65.3 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 39/158 (24%), Gaps = 4/158 (2%) Query: 168 LEDQWGIEDSESYCALMEHFLSGDHGA--NTFKANMEEAPEQVIALLNKFAVFPSDYISD 225 L + W I+D + L+ H + + E DY Sbjct: 200 LSEVWSIDDRDELIRLLLWLGGQGHRYTWDLDAQRLTVQGETARRRWQASLGEARDYGHV 259 Query: 226 CANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEED 285 S + A + + ++ + G +E A + A + Sbjct: 260 MLTFLSSGEPLEW-AAWDWLRLADLAYAGWNAGWLERHEAETFAAHAGDLLMRRYRDWTT 318 Query: 286 YQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWE 323 K Q G + R + A W+ Sbjct: 319 VAKAYQRGRSLFEGVDRRAEFAADW-SALLNAASSPWQ 355 >UniRef50_UPI00003826C9 hypothetical protein Magn03000930 n=1 Tax=Magnetospirillum magnetotacticum MS-1 RepID=UPI00003826C9 Length = 204 Score = 64.5 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 54/210 (25%), Gaps = 56/210 (26%) Query: 99 HYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSI 158 Y + L+ + + L L A Y VLP +++ Sbjct: 16 DLYAKEDAPALTPAQQRGLALGAVYAVEG------------------------VLPINAL 51 Query: 159 DIISKRRES---LEDQWGIEDSESYCALMEHFLSGDHGANTF------------------ 197 + + R + L W + ++ E L+ H Sbjct: 52 TVEADARTAAKPLAAGWDVRGADDVERTYEFLLTQGHRGYYAIAMPKVEELYSGRLSRRD 111 Query: 198 ------KANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGK-----SSAKLIWAAELSW 246 + + E L+ + G I A + + Sbjct: 112 AKGAADQHVAQARQEATARGLDPERAVAFYQGWSASAQMGGHGELADPLPPSIAAWDAAR 171 Query: 247 MISISSTAFQNGTIEEELAWHYIMLASRKA 276 ++ +S A G + E AW I SR A Sbjct: 172 VVHVSRLAVDAGFVTPERAWAAIEEPSRFA 201 >UniRef50_B2UWN6 Putative uncharacterized protein n=2 Tax=Clostridium botulinum E RepID=B2UWN6_CLOBA Length = 297 Score = 63.7 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 33/267 (12%), Positives = 75/267 (28%), Gaps = 18/267 (6%) Query: 40 CKWGFYLTCVVAVMFVFAAITSNGLNERGLITAGCSFLYLLIM---MGLIVRAGFKAK-- 94 + ++ + +I + LI + + I+ +VR +K Sbjct: 13 TLLSIIIVVILFYLNEVISIYIGDKIGKLLIGVFLVAVIVNIIKWLAVFVVRFIWKTSTI 72 Query: 95 KEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLP 154 + + + +++ A + + + V Sbjct: 73 SFEKAINKTIHFKCTDEASQFLVKIGALFSITEATLD---KHNKGEKNKKGFINCLEVRD 129 Query: 155 FDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNK 214 + S D SL W I + A +E + N + + + + K Sbjct: 130 YKSKDEKEMILASLGASWNIANKNQLYATLEQLMDK---NNFLDIDFDINKNKWFEKILK 186 Query: 215 FAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASR 274 + + K S+K I A + + + + I E + Sbjct: 187 KHNLQFE-------NLKNKKSSKHIAAFNIQRAVLLLRESLTCDLINLEEFEEFKPKVHN 239 Query: 275 KAHELFESEEDYQKNSQMGFLYWHICC 301 +E F S ED+ + +G Y++ Sbjct: 240 LINEEFASMEDFIIDYLIGVCYFYEEK 266 >UniRef50_B1W155 Putative uncharacterized protein n=3 Tax=Streptomyces RepID=B1W155_STRGG Length = 392 Score = 63.7 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 33/272 (12%), Positives = 65/272 (23%), Gaps = 40/272 (14%) Query: 98 LHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDS 157 + LS + + L H + L + ++ L Sbjct: 129 KRLKAWARADELSGGARRGVLLTDVGGPLHGPLAHGLALGAHLA--VTNGLIWNRLGAAY 186 Query: 158 IDIISKRRESLEDQWGIEDSESYCALMEHF-----------------LSGDHG------- 193 D + R L WGI Y + + H Sbjct: 187 EDYATD-RARLRSPWGIPHRAEYRDRLASLMKNQLVGRVQEAVLRTRHTLAHRLGRTPTH 245 Query: 194 -------ANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSW 246 A F A E L + + +D G+ + A +L Sbjct: 246 EEWSDAVARAFAARDAEDRAAADRALRHVTRYEERFQADGVLAPEGR--VDTLAAFDLGR 303 Query: 247 MISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKL 306 +++ A ++ + A ++ A + S D+ M + + Sbjct: 304 AVNVVRLALGARYVDPQEAEQDVLRLGGLARAAYSSWADFSLGYLMARVVHRAEDDGPEA 363 Query: 307 TDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 + + + P RN+ W Sbjct: 364 AEPTYRQSLDEHRVLVQ----DPTSPYRNIAW 391 >UniRef50_Q4K851 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4K851_PSEF5 Length = 219 Score = 63.0 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 40/138 (28%), Gaps = 9/138 (6%) Query: 205 PEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEEL 264 ++V LN I + S A + +A A +E E Sbjct: 84 DDEVYVHLNGTLERQWFRIDLHGLNPSDDPRAAMAFA--CVRSAFFVRCAMLMSWLEPET 141 Query: 265 AWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEH 324 W ++L +++A + F ED+ K +G W L + EA Sbjct: 142 GWRIMLLNAQRAQDCFNDWEDFGKAFMLGRQQWIAAFRADSLGTSFNEAKLSQLLAPGSG 201 Query: 325 YSKKCRWPIRNVPWGASS 342 +V W Sbjct: 202 V-------WASVDWKGLP 212 >UniRef50_Q39LT8 Putative uncharacterized protein n=9 Tax=Proteobacteria RepID=Q39LT8_BURS3 Length = 260 Score = 62.6 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 31/274 (11%), Positives = 78/274 (28%), Gaps = 39/274 (14%) Query: 72 AGCSFLYLLIMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWS 131 G + L+L+ + R + +E A +++ + A++L Sbjct: 6 IGLAVLWLVRYLWRSFRIARRMVREVDADQAAGADAAVTVRQTAAVKLA----------- 54 Query: 132 ETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGD 191 + + ++ + D + I + R D + + Sbjct: 55 -----HTLADQAPPNRRRWSLALADILLIRNGLR---------CDCDDLVYTLPDAQRDK 100 Query: 192 HGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSS-------AKLIWAAEL 244 A + A ++ + + +I G + A + Sbjct: 101 LAAQLRRELDLPADLPEWQIVQRVPAILAGWIRGVGRSHEGFYEQLAAEGRVRDALAFDC 160 Query: 245 SWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRR 304 + + G E+ AW ++L +++A + F+S ED+ +W + Sbjct: 161 ARTAFLVRCIALLGWASEQHAWVVLLLNAQRAQDSFDSWEDFGLAYARARQHWLRGSGQD 220 Query: 305 KLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 + R + + ++PW Sbjct: 221 GPASSRATQEVR-------EHLRDPSGNWLSLPW 247 >UniRef50_B5HCC5 Predicted protein n=4 Tax=Streptomyces RepID=B5HCC5_STRPR Length = 104 Score = 62.2 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 12/103 (11%), Positives = 29/103 (28%), Gaps = 8/103 (7%) Query: 236 AKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFL 295 + + + + ++ + A ++ A + ++ S ED+ +G Sbjct: 9 GRSVLSWDHGRAADMARWGLAARYCDPAKAERAVVRAGEVSARVYRSWEDFGAGYAIGRC 68 Query: 296 YWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 + E Y + + P VPW Sbjct: 69 L--------HFDEEEFGPWYTEVLDIHKTLTTDPESPWLTVPW 103 >UniRef50_A1ACT9 Putative uncharacterized protein n=36 Tax=Enterobacteriaceae RepID=A1ACT9_ECOK1 Length = 238 Score = 61.8 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 35/115 (30%), Gaps = 7/115 (6%) Query: 224 SDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESE 283 D + + A + ++ G +E AW ++L +++A + F S Sbjct: 122 HDFYEQLAAQGQVLDGLAFDCMRTAFLTRCIAGLGWCDENQAWIVLLLNAQRAQDCFASW 181 Query: 284 EDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 EDY W + T + W K + +PW Sbjct: 182 EDYASAYVRARQKWLMIYD----TPVATANRDLKEVTAWL---KDPSSNWKKLPW 229 >UniRef50_A4QDU6 Putative uncharacterized protein n=2 Tax=Corynebacterium glutamicum RepID=A4QDU6_CORGB Length = 269 Score = 61.8 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 37/267 (13%), Positives = 69/267 (25%), Gaps = 20/267 (7%) Query: 70 ITAGCSFLYLL-IMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIE--KLQALQLIAPYRFY 126 I + + L+ I+M K + A P + A L A Y Sbjct: 9 IILLFAAVILISIVMITAAFKTRKKRFAARAEGMANPTIPAPTVPWQRFAGALAALYA-- 66 Query: 127 HKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALME- 185 +W +T Y F S + + L+ WG++ SE + Sbjct: 67 RPEWHKTRG----AKRVYSAEQTYFG--FVSAMPLGMVQNMLQTDWGVKKSEHAVDQLSK 120 Query: 186 ------HFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLI 239 +G+ N E Q +A + + + Sbjct: 121 GVEVIVGVAAGNWRKNGVSPAQVEEAGQRLAAEGLAHPHFVVFQKQLQHADPNAEYDLDV 180 Query: 240 WAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHI 299 A +++ + ++ A + A + A F S E+Y + G Sbjct: 181 LAFDIARVANLLRWAAYTDLLLPAEARWFQDQLGIAAAVSFGSWEEYGERYVRGLQ--KN 238 Query: 300 CCYRRKLTDAELEACYRYDKQFWEHYS 326 K + W+ Sbjct: 239 FKGGNKPYIEGERWLNTEAESPWKTQK 265 >UniRef50_C7Q7G6 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q7G6_CATAD Length = 419 Score = 61.4 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 47/198 (23%), Gaps = 38/198 (19%) Query: 166 ESLEDQWGIEDSESYCALMEHFLSGD--------------------------HGANTFKA 199 ESL D WGIE + + ++ L + H T Sbjct: 219 ESLRDWWGIEGAVEWQNQVDALLDSENPQPVDLVLGIRTERGAGVQPSGDPLHDTRTLTE 278 Query: 200 NMEEA--PEQVIALLNKFAVFPSDYISDCA--NHSSGKSSAKLIWA----AELSWMISIS 251 +E L + + + ++ C G I A + ++++ Sbjct: 279 AVEAWCRDRGAPDRLRQEMLDIAQWVVRCETWMRRDGVLPPGAIVATQDSWDWGRCVNMA 338 Query: 252 STAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYW----HICCYRRKLT 307 G + A I A + +G + + T Sbjct: 339 RWGLTCGFCDRTTAEQIIRHAGSLCARAYADWNQLSAAYILGRVVKMGRQGNPEESYRDT 398 Query: 308 DAELEACYRYDKQFWEHY 325 A + + Sbjct: 399 LQIHRALMQDPASPFLTL 416 >UniRef50_Q1QY11 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QY11_CHRSD Length = 237 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 18/193 (9%), Positives = 55/193 (28%), Gaps = 13/193 (6%) Query: 162 SKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFK--ANMEEAPEQVIALLNKF-AVF 218 + RE L+ +G+ +E+ ++ + A ++ + + Sbjct: 32 DELREWLDGHYGLLSAEALKTFLDFLIDAGDRQEYLINYAPYTLNAARLREEIAIIESDE 91 Query: 219 PSDYISDCANHSSGKSSAKLIW------AAELSWMISISSTAFQNGTIEEELAWHYIMLA 272 S+ + A +++ + ++ Q + ++ A Sbjct: 92 CSEDERNHLLRLRRVQENAAGCNDIDMTAWDVAQTVDLAIAGRQMEWLSAAEFDAFLERA 151 Query: 273 SRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYSKKCRWP 332 ++ A + S DY + GF ++ R W + Sbjct: 152 TQLARAHYASWRDYARGLYAGFSFFMGETEERDALLKSFGEAL----AAWLSGAPPLAGA 207 Query: 333 IRNVPWGASSVKY 345 ++ + + ++ Sbjct: 208 WASLDFPGAPARH 220 >UniRef50_C5EGQ2 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EGQ2_9FIRM Length = 342 Score = 55.3 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 48/149 (32%) Query: 160 IISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFP 219 + + L +GI D ES + F AP + L + Sbjct: 65 DAEELKAHLFRLYGIHDRESLEEACMKQYTSGKEYEQFMTFWCGAPLFDLKELGEEGRMA 124 Query: 220 SDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHEL 279 + D A+ + +A +++ I + AF G I EE + KA Sbjct: 125 FEERIDRASMFRPYLEERGFYAWDINERIGLGRKAFACGMITEEEFFGIFDYQIAKAQVF 184 Query: 280 FESEEDYQKNSQMGFLYWHICCYRRKLTD 308 + S ++Y + G +Y+ + Sbjct: 185 YHSFKEYAISCICGAVYFVPENNEEDMLS 213 >UniRef50_D0KK37 Putative uncharacterized protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KK37_PECWW Length = 303 Score = 53.0 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 28/296 (9%), Positives = 78/296 (26%), Gaps = 23/296 (7%) Query: 23 FNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAITS--NGLNERGLITAGCSFLYLL 80 F + ++++ + +G +++++ + N L L L Sbjct: 5 FYKDINQLLRGGLVVAAILFGGEWLKNASLIYLNVDFSLLYNVLFYSFLAIWVFVLLRSC 64 Query: 81 IMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRK 140 + ++ + + + + +++ + Y E+ ++ Sbjct: 65 AKVFFLLSTKKENNVLSVGINRNAKYPCRDEKDALLVRISSLYSITSVGNGESKKY---- 120 Query: 141 PEPGKDTFQYHVLPFDSIDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKAN 200 D + + +L + I S + L + Sbjct: 121 ---PNFINCIEECNLSEPDAVEGIKLALSASYDINTSGGLTRFIADLLDE-------ENY 170 Query: 201 MEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTI 260 ++ + L+ ++ S L A L + + G + Sbjct: 171 SKQRNDDYKKPLSHLYQLAEKTGANLKPVSE---INNLTAAFNLQRAALLMRSGVTCGFL 227 Query: 261 EEELAWHYI-MLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACY 315 E W + R + F S + + + + +H+ LE Y Sbjct: 228 SIEE-WDSLKDDVVRLMEQEFPSIDQFIHDYMLAVYLFHMDGSFSAFM--ILERLY 280 >UniRef50_D0J788 Putative uncharacterized protein n=3 Tax=Comamonadaceae RepID=D0J788_COMTE Length = 235 Score = 51.4 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 42/140 (30%), Gaps = 9/140 (6%) Query: 199 ANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISSTAFQNG 258 + + EQV L + + A + + +A Sbjct: 88 LRTDLSDEQVRQQLPD--ALRQRWFMLDLQRLQRSDDVRAAMAFACARVTFFVRSARLLE 145 Query: 259 TIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYD 318 +E L W + L +++A + F+S + + G W + +D +A + Sbjct: 146 WTDEALHWDILQLNAQRAQQCFDSWLAFGQAYAQGRAQWLA----QGRSDVLGKAFTTEE 201 Query: 319 KQFWEHYSKKCRWPIRNVPW 338 W + P + W Sbjct: 202 VAQWVTQEQH---PWHAMSW 218 >UniRef50_B9CYP9 Putative uncharacterized protein n=1 Tax=Campylobacter rectus RM3267 RepID=B9CYP9_WOLRE Length = 516 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 47/143 (32%), Gaps = 13/143 (9%) Query: 194 ANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIWAAELSWMISISST 253 A + + + + +F SD ++ C + I A + + +I+++ Sbjct: 134 AEFMQIAEDFYKNENLRKFIEFCADTSDILAVCEKQN--------IRAYDYASIIALAII 185 Query: 254 AFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHI-----CCYRRKLTD 308 F++ + ++ + K F +++ + +G LY + Sbjct: 186 GFEDLYLRQKKYEKIVNFYGEKILSEFNGWDEFIASFMLGELYKNSFEKKEDDEDDYEII 245 Query: 309 AELEACYRYDKQFWEHYSKKCRW 331 + L Y ++ + W Sbjct: 246 SNLRLTYNALTLPYDIFQMSGIW 268 >UniRef50_Q47QQ1 Putative uncharacterized protein n=1 Tax=Thermobifida fusca YX RepID=Q47QQ1_THEFY Length = 311 Score = 48.0 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 19/193 (9%), Positives = 46/193 (23%), Gaps = 20/193 (10%) Query: 158 IDIISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAV 217 + + R + + GI D A +E H A ++ + + Sbjct: 112 PQLRGRYRRTFARERGITDRAGLLAEIERLWRELHTAPDTDLLVDLRSGIARSRCADRSA 171 Query: 218 ------FPSDYISDCANHSSGKSSAKLIWA-----AELSWMISISSTAFQNGTIEEELAW 266 + ++ + + SA + +++ + + Sbjct: 172 PDRTVVLTPEQVARLRTVTGAQESADTVVVGAYQWWRAVYLVPLICGGATLNWLSPVETQ 231 Query: 267 HYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFWEHYS 326 + + F G+L + A E + + Sbjct: 232 GLLRRVAADLQPRFGGWSQLSAAFHAGWLL-----DQDGAAAAGAERMWDAL----GLLT 282 Query: 327 KKCRWPIRNVPWG 339 P R +PW Sbjct: 283 TDPASPWRLLPWD 295 >UniRef50_Q894P0 Putative uncharacterized protein n=1 Tax=Clostridium tetani RepID=Q894P0_CLOTE Length = 497 Score = 47.6 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 39/142 (27%), Gaps = 7/142 (4%) Query: 202 EEAPEQVIALLNKFAVFPSDYISDCANHSSGKSS---AKLIWAAELSWMISISSTAFQNG 258 E ++ K ++ A + + M+ I + G Sbjct: 146 VEDVRNHVSRFFKKRKKQLRQLNKFFQQFESLLPFAKNVGFLAFDSARMVDIIGKSVNVG 205 Query: 259 TIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYW----HICCYRRKLTDAELEAC 314 +E E A + +++ E + ++ +G + + K +D + Sbjct: 206 YLEVEDAVPLLDEIGSYIIHTYKNWEIFLASAILGKQFMLFDSGVKSPFIKGSDEYISDI 265 Query: 315 YRYDKQFWEHYSKKCRWPIRNV 336 Y + W N+ Sbjct: 266 YGLVTSPNKPLLISGIWENSNL 287 >UniRef50_B5ZB98 Putative uncharacterized protein n=15 Tax=Ureaplasma RepID=B5ZB98_UREU1 Length = 278 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 74/198 (37%), Gaps = 7/198 (3%) Query: 148 FQYHVLPFDSIDIISKR--RESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAP 205 + L S D ++ R + L + + + + E + A M++ + ++ N + ++ Sbjct: 79 HKSSSLYLTSDDELAHRFKIKVLHNYFNLYNIEEFYAFMDNQIRLENAKNIARLSLIYDL 138 Query: 206 EQVIALLNKFAVFPSDYIS---DCANHSSGKSSAKLIWAAELSWMISISSTAFQNGTIEE 262 ++ +L + I N K S +A ++ I + T +E Sbjct: 139 KKHENILKNTTDTDLENIDLEFAFINFLRAKISINFAYAYDICETILLLRTGHDLRFLEN 198 Query: 263 ELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQFW 322 E + I + + + F + +++ N M Y ++ + + + + Sbjct: 199 EHFSNLIPVFGLQVIKYFSNWKEFLINYVMAVAYHNLDINNPQNVFLATQKFIDHLDELI 258 Query: 323 EHY--SKKCRWPIRNVPW 338 Y +K+ +W + +P+ Sbjct: 259 VDYGLNKRDQWIKKIIPY 276 >UniRef50_A8RZQ8 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RZQ8_9CLOT Length = 342 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 47/143 (32%), Gaps = 8/143 (5%) Query: 158 IDIISKRRESLEDQWGIEDSESYCAL-MEHFLSGDHGANTFKANMEEAPEQVIALLNKFA 216 + + L +GI D ES + ME F SG M + + L + Sbjct: 63 EPASEELKAHLFRLYGIYDRESLEKVCMEQFTSGREYEQF----MTFWCDAPLFDLEELE 118 Query: 217 VFPSDYISDCANHSSGKSS---AKLIWAAELSWMISISSTAFQNGTIEEELAWHYIMLAS 273 +S + +A +++ I + A G I+ E Sbjct: 119 EKGRRAFETRFKRASLFRPYVGERGFYAWDINERIGLGRLACACGIIDRETFDELTDYQV 178 Query: 274 RKAHELFESEEDYQKNSQMGFLY 296 RKA + + +DY + G +Y Sbjct: 179 RKAQVFYHTFKDYAVSCICGAVY 201 >UniRef50_D0WQ83 Putative uncharacterized protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WQ83_9ACTO Length = 547 Score = 42.6 bits (98), Expect = 0.023, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 33/92 (35%) Query: 247 MISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKL 306 ++ + S A+ + I++E A + K L+ E + + +G + Sbjct: 210 VVFLLSAAYSSRLIDDEAAAEALEHYGGKIAALYTGWEQFLASCALGAILAGEPSNGDSN 269 Query: 307 TDAELEACYRYDKQFWEHYSKKCRWPIRNVPW 338 + L A Y + + WP N+ W Sbjct: 270 DGSLLSAVYGFAVSPAPVFEAAEFWPRPNLSW 301 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.116 0.301 Lambda K H 0.267 0.0348 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,558,154,571 Number of Sequences: 3077464 Number of extensions: 50345971 Number of successful extensions: 148071 Number of sequences better than 1.0e-01: 88 Number of HSP's better than 0.1 without gapping: 120 Number of HSP's successfully gapped in prelim test: 14 Number of HSP's that attempted gapping in prelim test: 147805 Number of HSP's gapped (non-prelim): 145 length of query: 346 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 217 effective length of database: 643,403,500 effective search space: 139618559500 effective search space used: 139618559500 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 93 (40.7 bits)