BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (120 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76515 Uncharacterized protein yfdS n=23 Tax=root RepID... 248 5e-65 UniRef50_D2TCG7 Uncharacterized protein yfdS n=2 Tax=Erwinia pyr... 100 3e-20 UniRef50_A4W7L9 Putative uncharacterized protein n=1 Tax=Enterob... 73 2e-12 UniRef50_Q76H44 ORF9 n=1 Tax=Enterobacteria phage ST104 RepID=Q7... 73 3e-12 UniRef50_Q9MCR3 Gp37 n=10 Tax=root RepID=Q9MCR3_BPHK7 72 7e-12 UniRef50_B3WIS8 Gp43 n=4 Tax=root RepID=B3WIS8_ECOLX 72 7e-12 UniRef50_B4T7R2 Gp37 n=3 Tax=Enterobacteriaceae RepID=B4T7R2_SALHS 70 2e-11 UniRef50_B5Z195 Eae protein n=14 Tax=root RepID=B5Z195_ECO5E 68 9e-11 UniRef50_B7UN77 Predicted protein n=2 Tax=root RepID=B7UN77_ECO27 60 2e-08 >UniRef50_P76515 Uncharacterized protein yfdS n=23 Tax=root RepID=YFDS_ECOLI Length = 120 Score = 248 bits (632), Expect = 5e-65, Method: Compositional matrix adjust. Identities = 120/120 (100%), Positives = 120/120 (100%) Query: 1 MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPAN 60 MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPAN Sbjct: 1 MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPAN 60 Query: 61 ESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA 120 ESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA Sbjct: 61 ESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA 120 >UniRef50_D2TCG7 Uncharacterized protein yfdS n=2 Tax=Erwinia pyrifoliae RepID=D2TCG7_ERWPY Length = 129 Score = 99.8 bits (247), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 48/103 (46%), Positives = 68/103 (66%), Gaps = 4/103 (3%) Query: 17 VPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFDANGEGWLIG 76 + +D V ++ + + L + AA IPV+ +LP N +VLL+DA+GEGW+IG Sbjct: 25 IAKDFLVLSSHCHDMSLRVKELAQRDAA--TPTIPVTQQLPEPNNTVLLYDASGEGWVIG 82 Query: 77 WRSLWYTWGQKETGEWQWTFQVGDL--ENVNITHWAVMPKAPE 117 WRS+W T GQKETG W+W+FQ+ +L EN +ITHWA +P PE Sbjct: 83 WRSVWRTSGQKETGNWEWSFQITELNSENTHITHWAPIPDKPE 125 >UniRef50_A4W7L9 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4W7L9_ENT38 Length = 148 Score = 73.2 bits (178), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 33/55 (60%), Positives = 42/55 (76%) Query: 1 MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAE 55 M M + EM+GFLRGKC+P D+KVNETNAEYLVRKF EA+ +L +++ V AE Sbjct: 1 MTMTLIEMDGFLRGKCLPGDMKVNETNAEYLVRKFAEAEAQLTSLTSQLESVVAE 55 >UniRef50_Q76H44 ORF9 n=1 Tax=Enterobacteria phage ST104 RepID=Q76H44_9CAUD Length = 169 Score = 72.8 bits (177), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 32/44 (72%), Positives = 38/44 (86%) Query: 2 RMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAAL 45 +M + EM+GFL+GKC+PRDLKVNETNAEYLVRKF EAK +AL Sbjct: 3 QMTLIEMDGFLKGKCIPRDLKVNETNAEYLVRKFAEAEAKISAL 46 >UniRef50_Q9MCR3 Gp37 n=10 Tax=root RepID=Q9MCR3_BPHK7 Length = 176 Score = 71.6 bits (174), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 44/98 (44%), Positives = 54/98 (55%), Gaps = 26/98 (26%) Query: 2 RMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKF---------------------DALEA 40 +M++ EM+GFL+GKC+PRDLKVNETNAEYLVRKF D LEA Sbjct: 3 QMSLIEMDGFLKGKCIPRDLKVNETNAEYLVRKFGELESKLETALRECRSAGITIDNLEA 62 Query: 41 KCAALENKIIPVSAELPPANESVLLFDANGEGWLIGWR 78 KCAAL + SA + E+V N G + GW Sbjct: 63 KCAAL----VAESAVMLETIEAVRSVADNSSG-IAGWH 95 >UniRef50_B3WIS8 Gp43 n=4 Tax=root RepID=B3WIS8_ECOLX Length = 196 Score = 71.6 bits (174), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 38/61 (62%), Positives = 42/61 (68%), Gaps = 7/61 (11%) Query: 8 MEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFD 67 MEGFLRGKCVPRDLKVNETNAEYLVRKF A E KI+ +S + A ES+ D Sbjct: 1 MEGFLRGKCVPRDLKVNETNAEYLVRKF-------AEAEAKILALSEDQQKAIESIKQAD 53 Query: 68 A 68 A Sbjct: 54 A 54 >UniRef50_B4T7R2 Gp37 n=3 Tax=Enterobacteriaceae RepID=B4T7R2_SALHS Length = 205 Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 30/39 (76%), Positives = 36/39 (92%) Query: 3 MNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAK 41 M++ EM+GFL+GKC+PRDLKVNETNAEYLVRKF LE+K Sbjct: 1 MSLIEMDGFLKGKCIPRDLKVNETNAEYLVRKFGELESK 39 >UniRef50_B5Z195 Eae protein n=14 Tax=root RepID=B5Z195_ECO5E Length = 109 Score = 67.8 bits (164), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 30/35 (85%), Positives = 32/35 (91%) Query: 8 MEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKC 42 MEGFLRGKCVPRDLKVNETNAEYLVRKFD + A+ Sbjct: 1 MEGFLRGKCVPRDLKVNETNAEYLVRKFDEVRAEA 35 >UniRef50_B7UN77 Predicted protein n=2 Tax=root RepID=B7UN77_ECO27 Length = 187 Score = 60.1 bits (144), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 25/34 (73%), Positives = 30/34 (88%) Query: 2 RMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKF 35 RM + EM+GFL+GKC+ DLKVNETNAEY+VRKF Sbjct: 3 RMTLIEMDGFLKGKCISCDLKVNETNAEYMVRKF 36 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76515 Uncharacterized protein yfdS n=23 Tax=root RepID... 203 1e-51 UniRef50_D2TCG7 Uncharacterized protein yfdS n=2 Tax=Erwinia pyr... 158 5e-38 UniRef50_Q9MCR3 Gp37 n=10 Tax=root RepID=Q9MCR3_BPHK7 93 2e-18 UniRef50_Q76H44 ORF9 n=1 Tax=Enterobacteria phage ST104 RepID=Q7... 86 5e-16 UniRef50_A4W7L9 Putative uncharacterized protein n=1 Tax=Enterob... 81 1e-14 UniRef50_B3WIS8 Gp43 n=4 Tax=root RepID=B3WIS8_ECOLX 77 1e-13 UniRef50_B4T7R2 Gp37 n=3 Tax=Enterobacteriaceae RepID=B4T7R2_SALHS 76 3e-13 UniRef50_B7UN77 Predicted protein n=2 Tax=root RepID=B7UN77_ECO27 68 1e-10 UniRef50_B5Z195 Eae protein n=14 Tax=root RepID=B5Z195_ECO5E 63 3e-09 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P76515 Uncharacterized protein yfdS n=23 Tax=root RepID=YFDS_ECOLI Length = 120 Score = 203 bits (517), Expect = 1e-51, Method: Composition-based stats. Identities = 120/120 (100%), Positives = 120/120 (100%) Query: 1 MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPAN 60 MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPAN Sbjct: 1 MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPAN 60 Query: 61 ESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA 120 ESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA Sbjct: 61 ESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA 120 >UniRef50_D2TCG7 Uncharacterized protein yfdS n=2 Tax=Erwinia pyrifoliae RepID=D2TCG7_ERWPY Length = 129 Score = 158 bits (399), Expect = 5e-38, Method: Composition-based stats. Identities = 48/103 (46%), Positives = 68/103 (66%), Gaps = 4/103 (3%) Query: 17 VPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFDANGEGWLIG 76 + +D V ++ + + L + AA IPV+ +LP N +VLL+DA+GEGW+IG Sbjct: 25 IAKDFLVLSSHCHDMSLRVKELAQRDAA--TPTIPVTQQLPEPNNTVLLYDASGEGWVIG 82 Query: 77 WRSLWYTWGQKETGEWQWTFQVGDL--ENVNITHWAVMPKAPE 117 WRS+W T GQKETG W+W+FQ+ +L EN +ITHWA +P PE Sbjct: 83 WRSVWRTSGQKETGNWEWSFQITELNSENTHITHWAPIPDKPE 125 >UniRef50_Q9MCR3 Gp37 n=10 Tax=root RepID=Q9MCR3_BPHK7 Length = 176 Score = 93.3 bits (230), Expect = 2e-18, Method: Composition-based stats. Identities = 39/94 (41%), Positives = 51/94 (54%), Gaps = 18/94 (19%) Query: 2 RMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAK-----------------CAA 44 +M++ EM+GFL+GKC+PRDLKVNETNAEYLVRKF LE+K A Sbjct: 3 QMSLIEMDGFLKGKCIPRDLKVNETNAEYLVRKFGELESKLETALRECRSAGITIDNLEA 62 Query: 45 LENKIIPVSAELPPANESVLLFDANGEGWLIGWR 78 ++ SA + E+V N G + GW Sbjct: 63 KCAALVAESAVMLETIEAVRSVADNSSG-IAGWH 95 >UniRef50_Q76H44 ORF9 n=1 Tax=Enterobacteria phage ST104 RepID=Q76H44_9CAUD Length = 169 Score = 85.6 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 36/67 (53%), Positives = 46/67 (68%), Gaps = 7/67 (10%) Query: 2 RMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANE 61 +M + EM+GFL+GKC+PRDLKVNETNAEYLVRKF EAK +AL + + A E Sbjct: 3 QMTLIEMDGFLKGKCIPRDLKVNETNAEYLVRKFAEAEAKISAL-------AEDHQRAIE 55 Query: 62 SVLLFDA 68 S+ D+ Sbjct: 56 SIKQADS 62 >UniRef50_A4W7L9 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4W7L9_ENT38 Length = 148 Score = 80.6 bits (197), Expect = 1e-14, Method: Composition-based stats. Identities = 33/55 (60%), Positives = 42/55 (76%) Query: 1 MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAE 55 M M + EM+GFLRGKC+P D+KVNETNAEYLVRKF EA+ +L +++ V AE Sbjct: 1 MTMTLIEMDGFLRGKCLPGDMKVNETNAEYLVRKFAEAEAQLTSLTSQLESVVAE 55 >UniRef50_B3WIS8 Gp43 n=4 Tax=root RepID=B3WIS8_ECOLX Length = 196 Score = 77.2 bits (188), Expect = 1e-13, Method: Composition-based stats. Identities = 38/61 (62%), Positives = 42/61 (68%), Gaps = 7/61 (11%) Query: 8 MEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFD 67 MEGFLRGKCVPRDLKVNETNAEYLVRKF EAK I+ +S + A ES+ D Sbjct: 1 MEGFLRGKCVPRDLKVNETNAEYLVRKFAEAEAK-------ILALSEDQQKAIESIKQAD 53 Query: 68 A 68 A Sbjct: 54 A 54 >UniRef50_B4T7R2 Gp37 n=3 Tax=Enterobacteriaceae RepID=B4T7R2_SALHS Length = 205 Score = 76.4 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 36/75 (48%), Positives = 49/75 (65%), Gaps = 2/75 (2%) Query: 3 MNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKC--AALENKIIPVSAELPPAN 60 M++ EM+GFL+GKC+PRDLKVNETNAEYLVRKF LE+K A E + ++ + A Sbjct: 1 MSLIEMDGFLKGKCIPRDLKVNETNAEYLVRKFGELESKLETALRECRSAGITIDNLEAK 60 Query: 61 ESVLLFDANGEGWLI 75 + L + G +I Sbjct: 61 CTALAAENAGMKSVI 75 >UniRef50_B7UN77 Predicted protein n=2 Tax=root RepID=B7UN77_ECO27 Length = 187 Score = 67.5 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 25/34 (73%), Positives = 30/34 (88%) Query: 2 RMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKF 35 RM + EM+GFL+GKC+ DLKVNETNAEY+VRKF Sbjct: 3 RMTLIEMDGFLKGKCISCDLKVNETNAEYMVRKF 36 >UniRef50_B5Z195 Eae protein n=14 Tax=root RepID=B5Z195_ECO5E Length = 109 Score = 62.9 bits (151), Expect = 3e-09, Method: Composition-based stats. Identities = 30/35 (85%), Positives = 32/35 (91%) Query: 8 MEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKC 42 MEGFLRGKCVPRDLKVNETNAEYLVRKFD + A+ Sbjct: 1 MEGFLRGKCVPRDLKVNETNAEYLVRKFDEVRAEA 35 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.306 0.122 0.373 Lambda K H 0.267 0.0371 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 451,032,591 Number of Sequences: 3077464 Number of extensions: 15169816 Number of successful extensions: 38518 Number of sequences better than 1.0e-01: 9 Number of HSP's better than 0.1 without gapping: 18 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 38496 Number of HSP's gapped (non-prelim): 19 length of query: 120 length of database: 1,040,396,356 effective HSP length: 87 effective length of query: 33 effective length of database: 772,656,988 effective search space: 25497680604 effective search space used: 25497680604 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.3 bits)