BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (56 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P38394 Uncharacterized protein ydaE n=16 Tax=Enterobact... 114 8e-25 UniRef50_A9N3H4 Putative uncharacterized protein n=5 Tax=Salmone... 51 1e-05 UniRef50_A8AHS8 Putative uncharacterized protein n=2 Tax=Enterob... 45 5e-04 UniRef50_A9N4J4 Putative uncharacterized protein n=5 Tax=Salmone... 45 0.001 UniRef50_O30352 Gifsy-1 prophage protein n=18 Tax=Enterobacteria... 44 0.002 >UniRef50_P38394 Uncharacterized protein ydaE n=16 Tax=Enterobacteriaceae RepID=YDAE_ECOLI Length = 56 Score = 114 bits (286), Expect = 8e-25, Method: Compositional matrix adjust. Identities = 56/56 (100%), Positives = 56/56 (100%) Query: 1 MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL 56 MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL Sbjct: 1 MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL 56 >UniRef50_A9N3H4 Putative uncharacterized protein n=5 Tax=Salmonella enterica subsp. enterica RepID=A9N3H4_SALPB Length = 56 Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 25/55 (45%), Positives = 31/55 (56%) Query: 1 MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHE 55 M + CAYHLC K +E+ K ++ L + G E R YCS CA DQMAHE Sbjct: 1 MHNQKTCAYHLCGKTIEQGKEVKNELTLIRGAQLTHEERDYCSVRCASYDQMAHE 55 >UniRef50_A8AHS8 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8AHS8_CITK8 Length = 52 Score = 45.4 bits (106), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 22/49 (44%), Positives = 33/49 (67%), Gaps = 2/49 (4%) Query: 7 CAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHE 55 C Y C+K VEE K ++ +L + +G ++ ++YCS+ CAE DQMAHE Sbjct: 5 CGY--CRKPVEEGKEVKSILFYRNGNRLANKEKEYCSKQCAEYDQMAHE 51 >UniRef50_A9N4J4 Putative uncharacterized protein n=5 Tax=Salmonella enterica subsp. enterica RepID=A9N4J4_SALPB Length = 93 Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust. Identities = 26/52 (50%), Positives = 32/52 (61%), Gaps = 3/52 (5%) Query: 5 IKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL 56 I CAY C+K + E A E L +MH L + +KYCS+ CA DQMAHEL Sbjct: 45 INCAY--CQKAIPEETAYEYELIYMHETLISRK-KKYCSKRCASHDQMAHEL 93 >UniRef50_O30352 Gifsy-1 prophage protein n=18 Tax=Enterobacteriaceae RepID=O30352_SALTY Length = 52 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 20/50 (40%), Positives = 32/50 (64%), Gaps = 2/50 (4%) Query: 6 KCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHE 55 +C Y C+K ++E K ++ L +++G + ++YCS CAE DQMAHE Sbjct: 4 QCGY--CRKSIDEGKEVKNTLLYLNGSQLARKEKEYCSRQCAEYDQMAHE 51 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A9N3H4 Putative uncharacterized protein n=5 Tax=Salmone... 72 7e-12 UniRef50_P38394 Uncharacterized protein ydaE n=16 Tax=Enterobact... 68 1e-10 UniRef50_A9N4J4 Putative uncharacterized protein n=5 Tax=Salmone... 67 2e-10 UniRef50_O30352 Gifsy-1 prophage protein n=18 Tax=Enterobacteria... 60 2e-08 UniRef50_A8AHS8 Putative uncharacterized protein n=2 Tax=Enterob... 59 5e-08 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_A9N3H4 Putative uncharacterized protein n=5 Tax=Salmonella enterica subsp. enterica RepID=A9N3H4_SALPB Length = 56 Score = 71.7 bits (174), Expect = 7e-12, Method: Composition-based stats. Identities = 25/55 (45%), Positives = 31/55 (56%) Query: 1 MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHE 55 M + CAYHLC K +E+ K ++ L + G E R YCS CA DQMAHE Sbjct: 1 MHNQKTCAYHLCGKTIEQGKEVKNELTLIRGAQLTHEERDYCSVRCASYDQMAHE 55 >UniRef50_P38394 Uncharacterized protein ydaE n=16 Tax=Enterobacteriaceae RepID=YDAE_ECOLI Length = 56 Score = 67.9 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 56/56 (100%), Positives = 56/56 (100%) Query: 1 MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL 56 MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL Sbjct: 1 MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL 56 >UniRef50_A9N4J4 Putative uncharacterized protein n=5 Tax=Salmonella enterica subsp. enterica RepID=A9N4J4_SALPB Length = 93 Score = 67.1 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 26/52 (50%), Positives = 32/52 (61%), Gaps = 3/52 (5%) Query: 5 IKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL 56 I CAY C+K + E A E L +MH L + +KYCS+ CA DQMAHEL Sbjct: 45 INCAY--CQKAIPEETAYEYELIYMHETLISRK-KKYCSKRCASHDQMAHEL 93 >UniRef50_O30352 Gifsy-1 prophage protein n=18 Tax=Enterobacteriaceae RepID=O30352_SALTY Length = 52 Score = 59.8 bits (143), Expect = 2e-08, Method: Composition-based stats. Identities = 20/51 (39%), Positives = 32/51 (62%), Gaps = 2/51 (3%) Query: 5 IKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHE 55 +C Y C+K ++E K ++ L +++G + ++YCS CAE DQMAHE Sbjct: 3 KQCGY--CRKSIDEGKEVKNTLLYLNGSQLARKEKEYCSRQCAEYDQMAHE 51 >UniRef50_A8AHS8 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8AHS8_CITK8 Length = 52 Score = 58.6 bits (140), Expect = 5e-08, Method: Composition-based stats. Identities = 22/52 (42%), Positives = 35/52 (67%), Gaps = 2/52 (3%) Query: 4 KIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHE 55 + +C Y C+K VEE K ++ +L + +G ++ ++YCS+ CAE DQMAHE Sbjct: 2 QKECGY--CRKPVEEGKEVKSILFYRNGNRLANKEKEYCSKQCAEYDQMAHE 51 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.124 0.368 Lambda K H 0.267 0.0382 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 199,814,945 Number of Sequences: 3077464 Number of extensions: 4678356 Number of successful extensions: 12057 Number of sequences better than 1.0e-01: 5 Number of HSP's better than 0.1 without gapping: 9 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 12046 Number of HSP's gapped (non-prelim): 10 length of query: 56 length of database: 1,040,396,356 effective HSP length: 29 effective length of query: 27 effective length of database: 951,149,900 effective search space: 25681047300 effective search space used: 25681047300 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 87 (38.2 bits)