BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (164 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76510 Uncharacterized protein yfdN n=31 Tax=root RepID... 342 4e-93 UniRef50_UPI0001BCF2D2 hypothetical protein EscherichiacoliO157E... 156 2e-37 UniRef50_B5FL70 Sb43 n=2 Tax=Salmonella enterica subsp. enterica... 130 2e-29 UniRef50_C8TZ57 Predicted transcriptional regulator Pch-homolog ... 51 2e-05 UniRef50_C4X6V8 Putative uncharacterized protein n=1 Tax=Klebsie... 49 6e-05 UniRef50_C9Y1N4 Putative uncharacterized protein n=2 Tax=Cronoba... 48 1e-04 UniRef50_D0FT77 Phage-related protein n=2 Tax=Erwinia pyrifoliae... 46 4e-04 UniRef50_A4WB51 Putative uncharacterized protein n=1 Tax=Enterob... 42 0.006 UniRef50_C8T721 Transcriptional activator PerC family protein n=... 42 0.007 UniRef50_A1Z2R4 PerC protein n=11 Tax=Escherichia RepID=A1Z2R4_E... 42 0.008 UniRef50_A7FET3 Transcriptional regulator, PerC family n=2 Tax=Y... 40 0.020 UniRef50_B7MV60 Putative transcriptional activator (PerC family)... 39 0.046 >UniRef50_P76510 Uncharacterized protein yfdN n=31 Tax=root RepID=YFDN_ECOLI Length = 164 Score = 342 bits (876), Expect = 4e-93, Method: Compositional matrix adjust. Identities = 164/164 (100%), Positives = 164/164 (100%) Query: 1 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH 60 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH Sbjct: 1 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH 60 Query: 61 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA 120 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA Sbjct: 61 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA 120 Query: 121 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA 164 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA Sbjct: 121 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA 164 >UniRef50_UPI0001BCF2D2 hypothetical protein EscherichiacoliO157EcO_17952 n=1 Tax=Escherichia coli O157:H7 str. FRIK966 RepID=UPI0001BCF2D2 Length = 111 Score = 156 bits (395), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 74/79 (93%), Positives = 76/79 (96%) Query: 56 DTHRHFPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA 115 DT RHFPRLTERAQ+PEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA Sbjct: 1 DTRRHFPRLTERAQEPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA 60 Query: 116 TVWMAAFRESHSQPERNNF 134 TVWMAAFRESHSQPE+ F Sbjct: 61 TVWMAAFRESHSQPEQXIF 79 >UniRef50_B5FL70 Sb43 n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FL70_SALDC Length = 157 Score = 130 bits (326), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 73/161 (45%), Positives = 96/161 (59%), Gaps = 7/161 (4%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 MSLL VQ FIE +PG TS +IADAF Y+R V +SASKL + RV R +GD R++ Sbjct: 1 MSLLAKVQAFIELNPGLTSNEIADAFPEYARFDVQRSASKLYRCKRVNRRMDGDVFRYYA 60 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 + + R R+ + G+ DP VI L +AEELESRGL+ RA+ VW+ AF Sbjct: 61 -------GKDEAVILTLRQKRSGHTGSGDPMVIAKLVSRAEELESRGLFNRASIVWLEAF 113 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVG 163 ES ER FL RR++CL + KR E+ YL+G +VG Sbjct: 114 SESQFIYEREEFLRRRQKCLNRIKKRIRPVEQVYLAGRFVG 154 >UniRef50_C8TZ57 Predicted transcriptional regulator Pch-homolog n=2 Tax=Escherichia coli RepID=C8TZ57_ECO10 Length = 124 Score = 50.8 bits (120), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 21/43 (48%), Positives = 32/43 (74%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCL 142 ++AE+LES+GL+RRAA W+ +E+H+ P+R + RRE CL Sbjct: 76 KKAEQLESQGLWRRAAARWLDVMKEAHTDPQREHIARRREICL 118 >UniRef50_C4X6V8 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae NTUH-K2044 RepID=C4X6V8_KLEPN Length = 152 Score = 48.9 bits (115), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSG 159 + A++LE R LYRRAATVW S SQ ++ ++ CLRK ++ S E L+G Sbjct: 84 KAAKKLEERSLYRRAATVWHQLSTSSCSQKTLEYYIRQKNACLRK-ARMGKSHTECLLAG 142 Query: 160 NYVG 163 NY G Sbjct: 143 NYCG 146 >UniRef50_C9Y1N4 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=C9Y1N4_CROTZ Length = 78 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 25/60 (41%), Positives = 37/60 (61%) Query: 104 ELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVG 163 +LE + L+RRAA V++AAF S S +R +R +CL+ S++ YL+GNYVG Sbjct: 17 QLERQNLWRRAAHVYLAAFDASKSNRDRERLAKKRTQCLKMSNRVGYVEGRCYLAGNYVG 76 >UniRef50_D0FT77 Phage-related protein n=2 Tax=Erwinia pyrifoliae RepID=D0FT77_ERWPY Length = 150 Score = 46.2 bits (108), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 39/145 (26%), Positives = 66/145 (45%), Gaps = 20/145 (13%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 M++ + ++I +PGCTS IA+ AG ++ V ++L + V R +H F Sbjct: 1 MNIRENAVEYIRLNPGCTSTQIANG-AGIPKRMVQPLMTEL-YTQEVVLRYALKSHPFFY 58 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 RL + + + R + + Y + +AEELE + L+ RAA W+ A Sbjct: 59 RLPDES---------DKRRLNDAYEAHRE---------KAEELEKKHLWNRAAREWLLAM 100 Query: 123 RESHSQPERNNFLARRERCLRKSSK 147 E+ + R + RRE C+ K Sbjct: 101 DETRDEEAREKAIRRREYCISHGKK 125 >UniRef50_A4WB51 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WB51_ENT38 Length = 144 Score = 42.4 bits (98), Expect = 0.006, Method: Compositional matrix adjust. Identities = 40/143 (27%), Positives = 58/143 (40%), Gaps = 13/143 (9%) Query: 7 NDVQKFIEAHPGCTSGDIADAF-AGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFPRLT 65 + V F+ P C GD++DA G S L A L G + +R T + + Sbjct: 9 DQVAIFVRYQPHCAVGDVSDALDLGSSTAGKLLRA--LTDEG-IVNRSHNGTQFTYEAIE 65 Query: 66 ERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRES 125 A E P + DP L A+ELE++GL+RRAA V+ + Sbjct: 66 GAAIPDEFLPCMLHKA---------DPAKTLAAETLAKELEAKGLWRRAAAVYSGMLEIA 116 Query: 126 HSQPERNNFLARRERCLRKSSKR 148 + E RR+ CL+ S R Sbjct: 117 VNAVEVGRIAKRRDACLQMSRGR 139 >UniRef50_C8T721 Transcriptional activator PerC family protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T721_KLEPR Length = 89 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 20/47 (42%), Positives = 30/47 (63%) Query: 101 QAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSK 147 +AEELE++GLYRRAA W+ S+ + E++ R C+RKS + Sbjct: 5 KAEELEAKGLYRRAAQRWLEVMLMSNDRAEQDKARQRHNECVRKSKR 51 >UniRef50_A1Z2R4 PerC protein n=11 Tax=Escherichia RepID=A1Z2R4_ECOLX Length = 104 Score = 42.0 bits (97), Expect = 0.008, Method: Compositional matrix adjust. Identities = 21/54 (38%), Positives = 30/54 (55%) Query: 94 VILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSK 147 V + R AEELE++G YRRAA W + + ER++ RR C RK+ + Sbjct: 13 VTMVHDRIAEELEAKGFYRRAAARWGEVMQLVETDKERHHITMRRLECSRKAQR 66 >UniRef50_A7FET3 Transcriptional regulator, PerC family n=2 Tax=Yersinia RepID=A7FET3_YERP3 Length = 85 Score = 40.4 bits (93), Expect = 0.020, Method: Compositional matrix adjust. Identities = 21/51 (41%), Positives = 28/51 (54%) Query: 95 ILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKS 145 I L ++A +LE G +RRAAT W+A E + RRE+CL KS Sbjct: 10 IETLIQKATDLELAGFWRRAATQWLAVMDHCPDDTEWEQIVRRREQCLLKS 60 >UniRef50_B7MV60 Putative transcriptional activator (PerC family) from phage origin n=4 Tax=Enterobacteriaceae RepID=B7MV60_ECO81 Length = 122 Score = 39.3 bits (90), Expect = 0.046, Method: Compositional matrix adjust. Identities = 20/48 (41%), Positives = 27/48 (56%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSK 147 R AEELE++G YRRAA W + + ER+ RR C RK+ + Sbjct: 45 RIAEELEAKGFYRRAAARWGEVMQLVETDKERHQVTMRRLECSRKAQR 92 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76510 Uncharacterized protein yfdN n=31 Tax=root RepID... 230 2e-59 UniRef50_B5FL70 Sb43 n=2 Tax=Salmonella enterica subsp. enterica... 195 5e-49 UniRef50_D0FT77 Phage-related protein n=2 Tax=Erwinia pyrifoliae... 146 2e-34 UniRef50_UPI0001BCF2D2 hypothetical protein EscherichiacoliO157E... 115 7e-25 UniRef50_C4X6V8 Putative uncharacterized protein n=1 Tax=Klebsie... 98 7e-20 UniRef50_C9Y1N4 Putative uncharacterized protein n=2 Tax=Cronoba... 79 4e-14 UniRef50_C8TZ57 Predicted transcriptional regulator Pch-homolog ... 64 2e-09 Sequences not found previously or not previously below threshold: UniRef50_A4WB51 Putative uncharacterized protein n=1 Tax=Enterob... 53 3e-06 UniRef50_A7FET3 Transcriptional regulator, PerC family n=2 Tax=Y... 49 4e-05 UniRef50_Q49JF3 LdaB n=22 Tax=Escherichia RepID=Q49JF3_ECOLX 49 6e-05 UniRef50_B5XSL7 Putative uncharacterized protein n=1 Tax=Klebsie... 46 3e-04 UniRef50_A1Z2R4 PerC protein n=11 Tax=Escherichia RepID=A1Z2R4_E... 46 5e-04 UniRef50_C8T721 Transcriptional activator PerC family protein n=... 45 8e-04 UniRef50_B5Y379 Putative uncharacterized protein n=6 Tax=Enterob... 45 8e-04 UniRef50_B7L407 Protein perC (Protein bfpW) n=3 Tax=Escherichia ... 45 9e-04 UniRef50_A9MY25 Putative uncharacterized protein n=6 Tax=Salmone... 44 0.002 UniRef50_B7MV60 Putative transcriptional activator (PerC family)... 44 0.002 UniRef50_C8TZ59 Predicted transcriptional regulator Pch-homolog ... 43 0.004 UniRef50_UPI00019F1C15 hypothetical protein CATC2_07755 n=1 Tax=... 42 0.005 UniRef50_UPI0001B532C3 hypothetical protein ShiD9_09914 n=1 Tax=... 42 0.009 UniRef50_B7UTE9 Protein perC n=8 Tax=Escherichia coli RepID=PERC... 42 0.009 UniRef50_C6UPG6 Transcriptional regulator n=13 Tax=Escherichia c... 41 0.016 UniRef50_B7UGU1 Predicted protein n=6 Tax=root RepID=B7UGU1_ECO27 40 0.023 UniRef50_C1MF26 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 39 0.062 >UniRef50_P76510 Uncharacterized protein yfdN n=31 Tax=root RepID=YFDN_ECOLI Length = 164 Score = 230 bits (585), Expect = 2e-59, Method: Composition-based stats. Identities = 164/164 (100%), Positives = 164/164 (100%) Query: 1 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH 60 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH Sbjct: 1 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH 60 Query: 61 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA 120 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA Sbjct: 61 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA 120 Query: 121 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA 164 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA Sbjct: 121 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA 164 >UniRef50_B5FL70 Sb43 n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FL70_SALDC Length = 157 Score = 195 bits (495), Expect = 5e-49, Method: Composition-based stats. Identities = 73/161 (45%), Positives = 96/161 (59%), Gaps = 7/161 (4%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 MSLL VQ FIE +PG TS +IADAF Y+R V +SASKL + RV R +GD R++ Sbjct: 1 MSLLAKVQAFIELNPGLTSNEIADAFPEYARFDVQRSASKLYRCKRVNRRMDGDVFRYYA 60 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 + + R R+ + G+ DP VI L +AEELESRGL+ RA+ VW+ AF Sbjct: 61 -------GKDEAVILTLRQKRSGHTGSGDPMVIAKLVSRAEELESRGLFNRASIVWLEAF 113 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVG 163 ES ER FL RR++CL + KR E+ YL+G +VG Sbjct: 114 SESQFIYEREEFLRRRQKCLNRIKKRIRPVEQVYLAGRFVG 154 >UniRef50_D0FT77 Phage-related protein n=2 Tax=Erwinia pyrifoliae RepID=D0FT77_ERWPY Length = 150 Score = 146 bits (369), Expect = 2e-34, Method: Composition-based stats. Identities = 39/151 (25%), Positives = 67/151 (44%), Gaps = 20/151 (13%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 M++ + ++I +PGCTS IA+ AG ++ V ++L + V R +H F Sbjct: 1 MNIRENAVEYIRLNPGCTSTQIANG-AGIPKRMVQPLMTEL-YTQEVVLRYALKSHPFFY 58 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 RL + + + R + + Y + +AEELE + L+ RAA W+ A Sbjct: 59 RLPDES---------DKRRLNDAYEAHRE---------KAEELEKKHLWNRAAREWLLAM 100 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGE 153 E+ + R + RRE C+ K + Sbjct: 101 DETRDEEAREKAIRRREYCISHGKKGGVAET 131 >UniRef50_UPI0001BCF2D2 hypothetical protein EscherichiacoliO157EcO_17952 n=1 Tax=Escherichia coli O157:H7 str. FRIK966 RepID=UPI0001BCF2D2 Length = 111 Score = 115 bits (287), Expect = 7e-25, Method: Composition-based stats. Identities = 74/81 (91%), Positives = 76/81 (93%) Query: 56 DTHRHFPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA 115 DT RHFPRLTERAQ+PEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA Sbjct: 1 DTRRHFPRLTERAQEPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA 60 Query: 116 TVWMAAFRESHSQPERNNFLA 136 TVWMAAFRESHSQPE+ F Sbjct: 61 TVWMAAFRESHSQPEQXIFWR 81 >UniRef50_C4X6V8 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae NTUH-K2044 RepID=C4X6V8_KLEPN Length = 152 Score = 98.4 bits (243), Expect = 7e-20, Method: Composition-based stats. Identities = 42/162 (25%), Positives = 68/162 (41%), Gaps = 15/162 (9%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 M++ +FI +P DI AF V + +L GR+ + + Sbjct: 1 MNITQMAFEFIAKNPDQKMRDIISAFPECKPVSVKSAVYRLYTEGRLETK--ATSCGFIY 58 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 R+ A + +++ + + L + A++LE R LYRRAATVW Sbjct: 59 RVINDASCCDD--------LQDDFKSRGN----LEQEKAAKKLEERSLYRRAATVWHQLS 106 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA 164 S SQ ++ ++ CLRK ++ S E L+GNY G Sbjct: 107 TSSCSQKTLEYYIRQKNACLRK-ARMGKSHTECLLAGNYCGG 147 >UniRef50_C9Y1N4 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=C9Y1N4_CROTZ Length = 78 Score = 79.5 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 26/62 (41%), Positives = 38/62 (61%) Query: 102 AEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNY 161 A +LE + L+RRAA V++AAF S S +R +R +CL+ S++ YL+GNY Sbjct: 15 ACQLERQNLWRRAAHVYLAAFDASKSNRDRERLAKKRTQCLKMSNRVGYVEGRCYLAGNY 74 Query: 162 VG 163 VG Sbjct: 75 VG 76 >UniRef50_C8TZ57 Predicted transcriptional regulator Pch-homolog n=2 Tax=Escherichia coli RepID=C8TZ57_ECO10 Length = 124 Score = 64.1 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 21/43 (48%), Positives = 32/43 (74%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCL 142 ++AE+LES+GL+RRAA W+ +E+H+ P+R + RRE CL Sbjct: 76 KKAEQLESQGLWRRAAARWLDVMKEAHTDPQREHIARRREICL 118 >UniRef50_A4WB51 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WB51_ENT38 Length = 144 Score = 53.3 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 56/152 (36%), Gaps = 13/152 (8%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 ++ + V F+ P C GD++DA + L G V G + Sbjct: 5 LTQKDQVAIFVRYQPHCAVGDVSDAL-DLGSSTAGKLLRALTDEGIVNRSHNGT--QFTY 61 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 E A P+ DP L A+ELE++GL+RRAA V+ Sbjct: 62 EAIEGAAIPDEFLPCMLHKA--------DPAKTLAAETLAKELEAKGLWRRAAAVYSGML 113 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGEE 154 + + E RR+ CL+ S R + + Sbjct: 114 EIAVNAVEVGRIAKRRDACLQMS--RGRNHAQ 143 >UniRef50_A7FET3 Transcriptional regulator, PerC family n=2 Tax=Yersinia RepID=A7FET3_YERP3 Length = 85 Score = 49.4 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 21/52 (40%), Positives = 28/52 (53%) Query: 95 ILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSS 146 I L ++A +LE G +RRAAT W+A E + RRE+CL KS Sbjct: 10 IETLIQKATDLELAGFWRRAATQWLAVMDHCPDDTEWEQIVRRREQCLLKSQ 61 >UniRef50_Q49JF3 LdaB n=22 Tax=Escherichia RepID=Q49JF3_ECOLX Length = 108 Score = 49.0 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 1/52 (1%) Query: 102 AEELESRGLYRRAATVWMAAF-RESHSQPERNNFLARRERCLRKSSKRAASG 152 A++LE G +RRAAT W+ +++ +R RRE CL + + Sbjct: 35 AQKLEEAGFWRRAATRWLTVMGDVEYTEAQREWLRQRREYCLMQIPQLVLPE 86 >UniRef50_B5XSL7 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae 342 RepID=B5XSL7_KLEP3 Length = 148 Score = 46.4 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 53/139 (38%), Gaps = 8/139 (5%) Query: 9 VQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFPRLTERA 68 V + P C + DA +L +G++ G ++ R+ Sbjct: 17 VLAIVSRTPECVLQHVCDAL-DLQASTAGNLLRQLHAAGKLHRTYNG--CQYVYRVVAGV 73 Query: 69 QDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQ 128 + P+ + P+ D + + A+ LE + L+RRAATV+ + + + Sbjct: 74 EVPDVALPQTATPLSE-----EDVKKVQNALSLAKTLEDKKLWRRAATVYTSMLGMTTTA 128 Query: 129 PERNNFLARRERCLRKSSK 147 E R RCLR +++ Sbjct: 129 NELWLLAKMRNRCLRNATR 147 >UniRef50_A1Z2R4 PerC protein n=11 Tax=Escherichia RepID=A1Z2R4_ECOLX Length = 104 Score = 46.0 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 30/62 (48%) Query: 90 NDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRA 149 V + R AEELE++G YRRAA W + + ER++ RR C RK+ + Sbjct: 9 RSEGVTMVHDRIAEELEAKGFYRRAAARWGEVMQLVETDKERHHITMRRLECSRKAQRAP 68 Query: 150 AS 151 Sbjct: 69 EP 70 >UniRef50_C8T721 Transcriptional activator PerC family protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T721_KLEPR Length = 89 Score = 45.2 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 20/48 (41%), Positives = 30/48 (62%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSK 147 +AEELE++GLYRRAA W+ S+ + E++ R C+RKS + Sbjct: 4 SKAEELEAKGLYRRAAQRWLEVMLMSNDRAEQDKARQRHNECVRKSKR 51 >UniRef50_B5Y379 Putative uncharacterized protein n=6 Tax=Enterobacteriaceae RepID=B5Y379_KLEP3 Length = 96 Score = 45.2 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 20/53 (37%), Positives = 30/53 (56%) Query: 102 AEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEE 154 AE LE+ GLYRRAA+ W+ + + +R R +CL+K+ + A EE Sbjct: 8 AERLEASGLYRRAASRWIEVMQRCLNDDDREWIRHHRNQCLKKAQRPPAPKEE 60 >UniRef50_B7L407 Protein perC (Protein bfpW) n=3 Tax=Escherichia RepID=B7L407_ECO55 Length = 95 Score = 45.2 bits (105), Expect = 9e-04, Method: Composition-based stats. Identities = 18/48 (37%), Positives = 26/48 (54%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSK 147 +AE LE+RGLYRRAA W ++ R RR C+ K+++ Sbjct: 5 SKAEALEARGLYRRAAARWAEVIMLANDDKAREQAAKRRAECIHKAAR 52 >UniRef50_A9MY25 Putative uncharacterized protein n=6 Tax=Salmonella enterica subsp. enterica RepID=A9MY25_SALPB Length = 71 Score = 43.7 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%) Query: 102 AEELESRGLYRRAATVWMAAF-RESHSQPERNNFLARRERCLRKSSKRAAS 151 A+ LE+ G +RRA+T W+ ++ +R L RR CL + S AS Sbjct: 6 AQNLEAAGHWRRASTRWLLVMGDFECTEAQREWLLLRRNYCLAQISSPGAS 56 >UniRef50_B7MV60 Putative transcriptional activator (PerC family) from phage origin n=4 Tax=Enterobacteriaceae RepID=B7MV60_ECO81 Length = 122 Score = 43.7 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 22/57 (38%), Positives = 29/57 (50%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWY 156 R AEELE++G YRRAA W + + ER+ RR C RK+ + E Y Sbjct: 45 RIAEELEAKGFYRRAAARWGEVMQLVETDKERHQVTMRRLECSRKAQRPPEPPTENY 101 >UniRef50_C8TZ59 Predicted transcriptional regulator Pch-homolog n=4 Tax=Escherichia coli RepID=C8TZ59_ECO10 Length = 96 Score = 42.9 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 18/47 (38%), Positives = 26/47 (55%) Query: 102 AEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKR 148 A++LE RGL+RRAAT W + + ER RR C+ K+ + Sbjct: 6 AQKLEERGLWRRAATRWAEVLLHAETDSEREEAARRRAICISKTRRM 52 >UniRef50_UPI00019F1C15 hypothetical protein CATC2_07755 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F1C15 Length = 96 Score = 42.5 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 1/52 (1%) Query: 102 AEELESRGLYRRAATVWMAAF-RESHSQPERNNFLARRERCLRKSSKRAASG 152 AE+LE+ GL+RRAAT W+ R ++ ER RR C K + A Sbjct: 7 AEKLEAAGLWRRAATRWLNVMLRIEYTVAEREWIRQRRIYCQSKITPVAVPE 58 >UniRef50_UPI0001B532C3 hypothetical protein ShiD9_09914 n=1 Tax=Shigella sp. D9 RepID=UPI0001B532C3 Length = 94 Score = 41.7 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 1/52 (1%) Query: 102 AEELESRGLYRRAATVWMAAFRESH-SQPERNNFLARRERCLRKSSKRAASG 152 A +LE+ G +RRA+ W+ + + +R L RR+ CL + Sbjct: 14 ALKLEAAGCWRRASARWLVVMGAADITDAQREWLLLRRKYCLAQIKSLLGPQ 65 >UniRef50_B7UTE9 Protein perC n=8 Tax=Escherichia coli RepID=PERC_ECO27 Length = 89 Score = 41.7 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 17/48 (35%), Positives = 24/48 (50%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSK 147 ++A+ LE +G YRRAA W S ER +R C+ KS + Sbjct: 6 KKAKYLEEKGFYRRAADRWAEIMVLLSSDAERKLAAQKRAFCINKSLR 53 >UniRef50_C6UPG6 Transcriptional regulator n=13 Tax=Escherichia coli RepID=C6UPG6_ECO5T Length = 104 Score = 41.0 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 18/46 (39%), Positives = 24/46 (52%) Query: 102 AEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSK 147 AE LE +GLYRRAA W + +R +R CLRK+ + Sbjct: 7 AECLEKKGLYRRAAERWAKVMVQLSDDQKRKVAAQKRAECLRKARR 52 >UniRef50_B7UGU1 Predicted protein n=6 Tax=root RepID=B7UGU1_ECO27 Length = 99 Score = 40.2 bits (92), Expect = 0.023, Method: Composition-based stats. Identities = 16/47 (34%), Positives = 27/47 (57%) Query: 100 RQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSS 146 ++A E ES+G YR AA +W+ + + ER RR++C+ K + Sbjct: 35 KRAVERESKGQYRIAARLWLLCMDVAVGEVERARIAIRRDQCISKGN 81 >UniRef50_C1MF26 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1MF26_9ENTR Length = 111 Score = 39.0 bits (89), Expect = 0.062, Method: Composition-based stats. Identities = 21/47 (44%), Positives = 27/47 (57%), Gaps = 2/47 (4%) Query: 102 AEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKR 148 A +LE RGLY RAA W E H+Q + +RERC+R S+ R Sbjct: 28 ARDLEERGLYLRAARQWGEVMFE-HTQCT-EYIVEQRERCIRLSNSR 72 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76510 Uncharacterized protein yfdN n=31 Tax=root RepID... 200 2e-50 UniRef50_B5FL70 Sb43 n=2 Tax=Salmonella enterica subsp. enterica... 173 2e-42 UniRef50_C4X6V8 Putative uncharacterized protein n=1 Tax=Klebsie... 157 1e-37 UniRef50_A4WB51 Putative uncharacterized protein n=1 Tax=Enterob... 137 2e-31 UniRef50_B5XSL7 Putative uncharacterized protein n=1 Tax=Klebsie... 133 2e-30 UniRef50_D0FT77 Phage-related protein n=2 Tax=Erwinia pyrifoliae... 126 3e-28 UniRef50_UPI0001BCF2D2 hypothetical protein EscherichiacoliO157E... 100 1e-20 UniRef50_C9Y1N4 Putative uncharacterized protein n=2 Tax=Cronoba... 81 1e-14 UniRef50_A1Z2R4 PerC protein n=11 Tax=Escherichia RepID=A1Z2R4_E... 78 1e-13 UniRef50_B5Y379 Putative uncharacterized protein n=6 Tax=Enterob... 73 3e-12 UniRef50_Q49JF3 LdaB n=22 Tax=Escherichia RepID=Q49JF3_ECOLX 72 7e-12 UniRef50_A7FET3 Transcriptional regulator, PerC family n=2 Tax=Y... 72 8e-12 UniRef50_C8TZ57 Predicted transcriptional regulator Pch-homolog ... 68 1e-10 UniRef50_C8T721 Transcriptional activator PerC family protein n=... 64 1e-09 UniRef50_B7L407 Protein perC (Protein bfpW) n=3 Tax=Escherichia ... 64 2e-09 Sequences not found previously or not previously below threshold: UniRef50_B7MV60 Putative transcriptional activator (PerC family)... 74 1e-12 UniRef50_UPI00019F1C15 hypothetical protein CATC2_07755 n=1 Tax=... 60 2e-08 UniRef50_A9MY25 Putative uncharacterized protein n=6 Tax=Salmone... 55 9e-07 UniRef50_B7UTE9 Protein perC n=8 Tax=Escherichia coli RepID=PERC... 55 1e-06 UniRef50_C8TZ59 Predicted transcriptional regulator Pch-homolog ... 54 2e-06 UniRef50_C6UPG6 Transcriptional regulator n=13 Tax=Escherichia c... 53 4e-06 UniRef50_UPI0001B532C3 hypothetical protein ShiD9_09914 n=1 Tax=... 52 7e-06 UniRef50_B7UGU1 Predicted protein n=6 Tax=root RepID=B7UGU1_ECO27 45 0.001 UniRef50_C1MF26 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 43 0.005 UniRef50_Q12Z82 Transcriptional regulator, ArsR family protein n... 43 0.005 UniRef50_B7US58 Predicted protein n=17 Tax=Enterobacteriaceae Re... 41 0.017 UniRef50_C8Q8I9 Putative uncharacterized protein n=1 Tax=Pantoea... 40 0.027 >UniRef50_P76510 Uncharacterized protein yfdN n=31 Tax=root RepID=YFDN_ECOLI Length = 164 Score = 200 bits (507), Expect = 2e-50, Method: Composition-based stats. Identities = 164/164 (100%), Positives = 164/164 (100%) Query: 1 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH 60 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH Sbjct: 1 MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRH 60 Query: 61 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA 120 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA Sbjct: 61 FPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMA 120 Query: 121 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA 164 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA Sbjct: 121 AFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA 164 >UniRef50_B5FL70 Sb43 n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FL70_SALDC Length = 157 Score = 173 bits (438), Expect = 2e-42, Method: Composition-based stats. Identities = 73/161 (45%), Positives = 96/161 (59%), Gaps = 7/161 (4%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 MSLL VQ FIE +PG TS +IADAF Y+R V +SASKL + RV R +GD R++ Sbjct: 1 MSLLAKVQAFIELNPGLTSNEIADAFPEYARFDVQRSASKLYRCKRVNRRMDGDVFRYY- 59 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 + + R R+ + G+ DP VI L +AEELESRGL+ RA+ VW+ AF Sbjct: 60 ------AGKDEAVILTLRQKRSGHTGSGDPMVIAKLVSRAEELESRGLFNRASIVWLEAF 113 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVG 163 ES ER FL RR++CL + KR E+ YL+G +VG Sbjct: 114 SESQFIYEREEFLRRRQKCLNRIKKRIRPVEQVYLAGRFVG 154 >UniRef50_C4X6V8 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae NTUH-K2044 RepID=C4X6V8_KLEPN Length = 152 Score = 157 bits (397), Expect = 1e-37, Method: Composition-based stats. Identities = 42/162 (25%), Positives = 68/162 (41%), Gaps = 15/162 (9%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 M++ +FI +P DI AF V + +L GR+ + + Sbjct: 1 MNITQMAFEFIAKNPDQKMRDIISAFPECKPVSVKSAVYRLYTEGRLETK--ATSCGFIY 58 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 R+ A + +++ + + L + A++LE R LYRRAATVW Sbjct: 59 RVINDASCCDD--------LQDDFKSRGN----LEQEKAAKKLEERSLYRRAATVWHQLS 106 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA 164 S SQ ++ ++ CLRK+ + S E L+GNY G Sbjct: 107 TSSCSQKTLEYYIRQKNACLRKA-RMGKSHTECLLAGNYCGG 147 >UniRef50_A4WB51 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WB51_ENT38 Length = 144 Score = 137 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 56/152 (36%), Gaps = 13/152 (8%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 ++ + V F+ P C GD++DA + L G V G + Sbjct: 5 LTQKDQVAIFVRYQPHCAVGDVSDAL-DLGSSTAGKLLRALTDEGIVNRSHNGT--QFTY 61 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 E A P+ DP L A+ELE++GL+RRAA V+ Sbjct: 62 EAIEGAAIPDEFLPCMLHKA--------DPAKTLAAETLAKELEAKGLWRRAAAVYSGML 113 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGEE 154 + + E RR+ CL+ S R + + Sbjct: 114 EIAVNAVEVGRIAKRRDACLQMS--RGRNHAQ 143 >UniRef50_B5XSL7 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae 342 RepID=B5XSL7_KLEP3 Length = 148 Score = 133 bits (333), Expect = 2e-30, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 55/145 (37%), Gaps = 8/145 (5%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 ++ V + P C + DA +L +G++ G ++ Sbjct: 11 VTKAQMVLAIVSRTPECVLQHVCDAL-DLQASTAGNLLRQLHAAGKLHRTYNG--CQYVY 67 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 R+ + P+ + P+ D + + A+ LE + L+RRAATV+ + Sbjct: 68 RVVAGVEVPDVALPQTATPLSE-----EDVKKVQNALSLAKTLEDKKLWRRAATVYTSML 122 Query: 123 RESHSQPERNNFLARRERCLRKSSK 147 + + E R RCLR +++ Sbjct: 123 GMTTTANELWLLAKMRNRCLRNATR 147 >UniRef50_D0FT77 Phage-related protein n=2 Tax=Erwinia pyrifoliae RepID=D0FT77_ERWPY Length = 150 Score = 126 bits (316), Expect = 3e-28, Method: Composition-based stats. Identities = 39/151 (25%), Positives = 67/151 (44%), Gaps = 20/151 (13%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 M++ + ++I +PGCTS IA+ AG ++ V ++L + V R +H F Sbjct: 1 MNIRENAVEYIRLNPGCTSTQIANG-AGIPKRMVQPLMTELY-TQEVVLRYALKSHPFFY 58 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 RL + + + R + + Y + +AEELE + L+ RAA W+ A Sbjct: 59 RLPDES---------DKRRLNDAYEAHRE---------KAEELEKKHLWNRAAREWLLAM 100 Query: 123 RESHSQPERNNFLARRERCLRKSSKRAASGE 153 E+ + R + RRE C+ K + Sbjct: 101 DETRDEEAREKAIRRREYCISHGKKGGVAET 131 >UniRef50_UPI0001BCF2D2 hypothetical protein EscherichiacoliO157EcO_17952 n=1 Tax=Escherichia coli O157:H7 str. FRIK966 RepID=UPI0001BCF2D2 Length = 111 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 74/81 (91%), Positives = 76/81 (93%) Query: 56 DTHRHFPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA 115 DT RHFPRLTERAQ+PEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA Sbjct: 1 DTRRHFPRLTERAQEPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAA 60 Query: 116 TVWMAAFRESHSQPERNNFLA 136 TVWMAAFRESHSQPE+ F Sbjct: 61 TVWMAAFRESHSQPEQXIFWR 81 >UniRef50_C9Y1N4 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=C9Y1N4_CROTZ Length = 78 Score = 81.0 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 39/75 (52%) Query: 89 TNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKR 148 + A +LE + L+RRAA V++AAF S S +R +R +CL+ S++ Sbjct: 2 KKEKLKHEVFEEMACQLERQNLWRRAAHVYLAAFDASKSNRDRERLAKKRTQCLKMSNRV 61 Query: 149 AASGEEWYLSGNYVG 163 YL+GNYVG Sbjct: 62 GYVEGRCYLAGNYVG 76 >UniRef50_A1Z2R4 PerC protein n=11 Tax=Escherichia RepID=A1Z2R4_ECOLX Length = 104 Score = 78.0 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 30/62 (48%) Query: 90 NDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRA 149 V + R AEELE++G YRRAA W + + ER++ RR C RK+ + Sbjct: 9 RSEGVTMVHDRIAEELEAKGFYRRAAARWGEVMQLVETDKERHHITMRRLECSRKAQRAP 68 Query: 150 AS 151 Sbjct: 69 EP 70 >UniRef50_B7MV60 Putative transcriptional activator (PerC family) from phage origin n=4 Tax=Enterobacteriaceae RepID=B7MV60_ECO81 Length = 122 Score = 74.1 bits (180), Expect = 1e-12, Method: Composition-based stats. Identities = 23/67 (34%), Positives = 31/67 (46%) Query: 90 NDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRA 149 V + R AEELE++G YRRAA W + + ER+ RR C RK+ + Sbjct: 35 RSEGVRMVHDRIAEELEAKGFYRRAAARWGEVMQLVETDKERHQVTMRRLECSRKAQRPP 94 Query: 150 ASGEEWY 156 E Y Sbjct: 95 EPPTENY 101 >UniRef50_B5Y379 Putative uncharacterized protein n=6 Tax=Enterobacteriaceae RepID=B5Y379_KLEP3 Length = 96 Score = 73.0 bits (177), Expect = 3e-12, Method: Composition-based stats. Identities = 20/56 (35%), Positives = 30/56 (53%) Query: 99 TRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEE 154 AE LE+ GLYRRAA+ W+ + + +R R +CL+K+ + A EE Sbjct: 5 DSIAERLEASGLYRRAASRWIEVMQRCLNDDDREWIRHHRNQCLKKAQRPPAPKEE 60 >UniRef50_Q49JF3 LdaB n=22 Tax=Escherichia RepID=Q49JF3_ECOLX Length = 108 Score = 71.8 bits (174), Expect = 7e-12, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 1/59 (1%) Query: 95 ILCLTRQAEELESRGLYRRAATVWMAAF-RESHSQPERNNFLARRERCLRKSSKRAASG 152 + A++LE G +RRAAT W+ +++ +R RRE CL + + Sbjct: 28 VQVQDPVAQKLEEAGFWRRAATRWLTVMGDVEYTEAQREWLRQRREYCLMQIPQLVLPE 86 >UniRef50_A7FET3 Transcriptional regulator, PerC family n=2 Tax=Yersinia RepID=A7FET3_YERP3 Length = 85 Score = 71.8 bits (174), Expect = 8e-12, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 28/56 (50%) Query: 95 ILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAA 150 I L ++A +LE G +RRAAT W+A E + RRE+CL KS Sbjct: 10 IETLIQKATDLELAGFWRRAATQWLAVMDHCPDDTEWEQIVRRREQCLLKSQGTPK 65 >UniRef50_C8TZ57 Predicted transcriptional regulator Pch-homolog n=2 Tax=Escherichia coli RepID=C8TZ57_ECO10 Length = 124 Score = 68.0 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 10/89 (11%) Query: 64 LTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLT----------RQAEELESRGLYRR 113 Q PE + + G V ++AE+LES+GL+RR Sbjct: 30 APSGQQRPERLKIWGLTRKKKILYGARSTGVKQGEETEVNMDTITDKKAEQLESQGLWRR 89 Query: 114 AATVWMAAFRESHSQPERNNFLARRERCL 142 AA W+ +E+H+ P+R + RRE CL Sbjct: 90 AAARWLDVMKEAHTDPQREHIARRREICL 118 >UniRef50_C8T721 Transcriptional activator PerC family protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T721_KLEPR Length = 89 Score = 64.5 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 20/51 (39%), Positives = 30/51 (58%) Query: 99 TRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRA 149 +AEELE++GLYRRAA W+ S+ + E++ R C+RKS + Sbjct: 3 DSKAEELEAKGLYRRAAQRWLEVMLMSNDRAEQDKARQRHNECVRKSKRPP 53 >UniRef50_B7L407 Protein perC (Protein bfpW) n=3 Tax=Escherichia RepID=B7L407_ECO55 Length = 95 Score = 63.7 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 28/55 (50%) Query: 96 LCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAA 150 + +AE LE+RGLYRRAA W ++ R RR C+ K+++ A Sbjct: 1 MIHDSKAEALEARGLYRRAAARWAEVIMLANDDKAREQAAKRRAECIHKAARPPA 55 >UniRef50_UPI00019F1C15 hypothetical protein CATC2_07755 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F1C15 Length = 96 Score = 60.2 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 20/58 (34%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 96 LCLTRQAEELESRGLYRRAATVWMAAF-RESHSQPERNNFLARRERCLRKSSKRAASG 152 + AE+LE+ GL+RRAAT W+ R ++ ER RR C K + A Sbjct: 1 MIQDHIAEKLEAAGLWRRAATRWLNVMLRIEYTVAEREWIRQRRIYCQSKITPVAVPE 58 >UniRef50_A9MY25 Putative uncharacterized protein n=6 Tax=Salmonella enterica subsp. enterica RepID=A9MY25_SALPB Length = 71 Score = 54.9 bits (130), Expect = 9e-07, Method: Composition-based stats. Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 1/53 (1%) Query: 102 AEELESRGLYRRAATVWMAAF-RESHSQPERNNFLARRERCLRKSSKRAASGE 153 A+ LE+ G +RRA+T W+ ++ +R L RR CL + S AS Sbjct: 6 AQNLEAAGHWRRASTRWLLVMGDFECTEAQREWLLLRRNYCLAQISSPGASKA 58 >UniRef50_B7UTE9 Protein perC n=8 Tax=Escherichia coli RepID=PERC_ECO27 Length = 89 Score = 54.9 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 17/49 (34%), Positives = 24/49 (48%) Query: 99 TRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSK 147 ++A+ LE +G YRRAA W S ER +R C+ KS + Sbjct: 5 DKKAKYLEEKGFYRRAADRWAEIMVLLSSDAERKLAAQKRAFCINKSLR 53 >UniRef50_C8TZ59 Predicted transcriptional regulator Pch-homolog n=4 Tax=Escherichia coli RepID=C8TZ59_ECO10 Length = 96 Score = 53.7 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 18/52 (34%), Positives = 26/52 (50%) Query: 99 TRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAA 150 A++LE RGL+RRAAT W + + ER RR C+ K+ + Sbjct: 3 DSVAQKLEERGLWRRAATRWAEVLLHAETDSEREEAARRRAICISKTRRMPE 54 >UniRef50_C6UPG6 Transcriptional regulator n=13 Tax=Escherichia coli RepID=C6UPG6_ECO5T Length = 104 Score = 52.9 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 19/56 (33%), Positives = 26/56 (46%) Query: 96 LCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAAS 151 + AE LE +GLYRRAA W + +R +R CLRK+ + S Sbjct: 1 MLHDHVAECLEKKGLYRRAAERWAKVMVQLSDDQKRKVAAQKRAECLRKARRTPVS 56 >UniRef50_UPI0001B532C3 hypothetical protein ShiD9_09914 n=1 Tax=Shigella sp. D9 RepID=UPI0001B532C3 Length = 94 Score = 51.8 bits (122), Expect = 7e-06, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 1/53 (1%) Query: 101 QAEELESRGLYRRAATVWMAAFRESH-SQPERNNFLARRERCLRKSSKRAASG 152 A +LE+ G +RRA+ W+ + + +R L RR+ CL + Sbjct: 13 IALKLEAAGCWRRASARWLVVMGAADITDAQREWLLLRRKYCLAQIKSLLGPQ 65 >UniRef50_B7UGU1 Predicted protein n=6 Tax=root RepID=B7UGU1_ECO27 Length = 99 Score = 44.8 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 16/51 (31%), Positives = 29/51 (56%) Query: 96 LCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSS 146 + + ++A E ES+G YR AA +W+ + + ER RR++C+ K + Sbjct: 31 IEVEKRAVERESKGQYRIAARLWLLCMDVAVGEVERARIAIRRDQCISKGN 81 >UniRef50_C1MF26 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1MF26_9ENTR Length = 111 Score = 42.5 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 2/48 (4%) Query: 101 QAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKR 148 A +LE RGLY RAA W E H+Q + +RERC+R S+ R Sbjct: 27 LARDLEERGLYLRAARQWGEVMFE-HTQCT-EYIVEQRERCIRLSNSR 72 >UniRef50_Q12Z82 Transcriptional regulator, ArsR family protein n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12Z82_METBU Length = 248 Score = 42.5 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 5/99 (5%) Query: 6 LNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFPR-- 63 N+V ++I ++PGC+ DI+ +R KL ++ G + R F Sbjct: 101 RNNVFEYIVSNPGCSVADISHGL-ELNRGTAKYHLKKLHNEHKITVYDHGSSPRFFQNNS 159 Query: 64 -LTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQ 101 +E+AQ P ++ + + + P + Q Sbjct: 160 VCSEQAQILAPFL-QDANQKQILLMIRDKPGMTNNEISQ 197 >UniRef50_B7US58 Predicted protein n=17 Tax=Enterobacteriaceae RepID=B7US58_ECO27 Length = 81 Score = 40.6 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 27/74 (36%), Gaps = 1/74 (1%) Query: 9 VQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFPRLTERA 68 + +I A+PGC+ G+IA A + +L +SG V + R ++ Sbjct: 3 ILDYIAANPGCSGGEIAAAL-NTPTTTINAELRRLWRSGSVIRKERKTGGRFSYQVNPMP 61 Query: 69 QDPEPQPVRETRPV 82 + + Sbjct: 62 FGCSNPLTQMFNQL 75 >UniRef50_C8Q8I9 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q8I9_9ENTR Length = 137 Score = 40.2 bits (92), Expect = 0.027, Method: Composition-based stats. Identities = 34/151 (22%), Positives = 48/151 (31%), Gaps = 27/151 (17%) Query: 3 MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFP 62 M+ + + A PG T IA+A G +R V S + G Sbjct: 2 MNTETLILTHLMAFPGQTPAQIANAI-GRTRSTVGASLPVMVAVG--------------- 45 Query: 63 RLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAF 122 + E R G D + I L +A L+ R L+ AA VW A Sbjct: 46 ---------DIWSDAEARYYTAEPAGEGDEKYI-ALCDEAYRLQERNLWNPAAHVWHQAQ 95 Query: 123 RESHSQPERNNFLARRERCLRKSS-KRAASG 152 + R R C+ K+ K G Sbjct: 96 EATLKPGLREKARIRAIMCVEKAREKDPRPG 126 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.119 0.298 Lambda K H 0.267 0.0363 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 732,138,366 Number of Sequences: 3077464 Number of extensions: 22436050 Number of successful extensions: 85932 Number of sequences better than 1.0e-01: 30 Number of HSP's better than 0.1 without gapping: 57 Number of HSP's successfully gapped in prelim test: 16 Number of HSP's that attempted gapping in prelim test: 85825 Number of HSP's gapped (non-prelim): 77 length of query: 164 length of database: 1,040,396,356 effective HSP length: 119 effective length of query: 45 effective length of database: 674,178,140 effective search space: 30338016300 effective search space used: 30338016300 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 87 (38.3 bits)