BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (81 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76544 Uncharacterized protein yffM n=3 Tax=Escherichia... 164 1e-39 UniRef50_C4TIH6 Predicted protein n=1 Tax=Escherichia coli O157:... 51 1e-05 UniRef50_B1IWL5 Putative uncharacterized protein n=3 Tax=Enterob... 47 2e-04 >UniRef50_P76544 Uncharacterized protein yffM n=3 Tax=Escherichia coli K-12 RepID=YFFM_ECOLI Length = 81 Score = 164 bits (414), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 81/81 (100%), Positives = 81/81 (100%) Query: 1 MMVIRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKR 60 MMVIRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKR Sbjct: 1 MMVIRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKR 60 Query: 61 LKLIKADIHAQLKACDCDISE 81 LKLIKADIHAQLKACDCDISE Sbjct: 61 LKLIKADIHAQLKACDCDISE 81 >UniRef50_C4TIH6 Predicted protein n=1 Tax=Escherichia coli O157:H7 RepID=C4TIH6_ECO57 Length = 93 Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 4 IRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKRLKL 63 IRH + KR LR FSELV GT LL+ A + EE+ F+KRL++ Sbjct: 21 IRHSNGD--LPMKRQQLRLFSELVFGTFHDLLKH-----IDAKDAPRNAEEREFIKRLRM 73 Query: 64 IKADIHAQLKACDCDISE 81 I+ D+H QL + CD+ + Sbjct: 74 IERDLHTQLSSVGCDVGD 91 >UniRef50_B1IWL5 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B1IWL5_ECOLC Length = 109 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 6/68 (8%) Query: 14 AQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKRLKLIKADIHAQLK 73 A KR LR ++E LGT++ +++ K+ + S EE F++RL LI+ DIH+QL Sbjct: 44 AMKRQTLRLWAEYSLGTINKIIDMKSGPS------NQSAEEMEFIRRLILIRRDIHSQLH 97 Query: 74 ACDCDISE 81 + DI++ Sbjct: 98 SVGIDIND 105 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76544 Uncharacterized protein yffM n=3 Tax=Escherichia... 115 4e-25 UniRef50_C4TIH6 Predicted protein n=1 Tax=Escherichia coli O157:... 99 5e-20 UniRef50_B1IWL5 Putative uncharacterized protein n=3 Tax=Enterob... 90 2e-17 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P76544 Uncharacterized protein yffM n=3 Tax=Escherichia coli K-12 RepID=YFFM_ECOLI Length = 81 Score = 115 bits (288), Expect = 4e-25, Method: Composition-based stats. Identities = 81/81 (100%), Positives = 81/81 (100%) Query: 1 MMVIRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKR 60 MMVIRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKR Sbjct: 1 MMVIRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKR 60 Query: 61 LKLIKADIHAQLKACDCDISE 81 LKLIKADIHAQLKACDCDISE Sbjct: 61 LKLIKADIHAQLKACDCDISE 81 >UniRef50_C4TIH6 Predicted protein n=1 Tax=Escherichia coli O157:H7 RepID=C4TIH6_ECO57 Length = 93 Score = 98.6 bits (244), Expect = 5e-20, Method: Composition-based stats. Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 4 IRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKRLKL 63 IRH + KR LR FSELV GT LL+ A + EE+ F+KRL++ Sbjct: 21 IRHSNGD--LPMKRQQLRLFSELVFGTFHDLLKH-----IDAKDAPRNAEEREFIKRLRM 73 Query: 64 IKADIHAQLKACDCDISE 81 I+ D+H QL + CD+ + Sbjct: 74 IERDLHTQLSSVGCDVGD 91 >UniRef50_B1IWL5 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B1IWL5_ECOLC Length = 109 Score = 90.2 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 7/78 (8%) Query: 4 IRHECPSYCIAQKRVALREFSELVLGTLSLLLEQKTNGKCSASLYDCSEEEKLFVKRLKL 63 ++H P A KR LR ++E LGT++ +++ K+ + S EE F++RL L Sbjct: 35 LKHS-PLSKQAMKRQTLRLWAEYSLGTINKIIDMKSGPS------NQSAEEMEFIRRLIL 87 Query: 64 IKADIHAQLKACDCDISE 81 I+ DIH+QL + DI++ Sbjct: 88 IRRDIHSQLHSVGIDIND 105 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.318 0.131 0.330 Lambda K H 0.267 0.0403 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 249,248,519 Number of Sequences: 3077464 Number of extensions: 6642433 Number of successful extensions: 24639 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 24632 Number of HSP's gapped (non-prelim): 6 length of query: 81 length of database: 1,040,396,356 effective HSP length: 52 effective length of query: 29 effective length of database: 880,368,228 effective search space: 25530678612 effective search space used: 25530678612 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.3 bits) S2: 87 (38.1 bits)