BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (46 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q47272 Uncharacterized protein ylcG in lambdoid DLP12 p... 97 2e-19 UniRef50_B5Y0G4 Putative uncharacterized protein n=2 Tax=Klebsie... 42 0.008 UniRef50_C9XDJ8 Predicted prophage protein n=11 Tax=Enterobacter... 40 0.027 >UniRef50_Q47272 Uncharacterized protein ylcG in lambdoid DLP12 prophage region n=36 Tax=root RepID=YLCG_ECOLI Length = 46 Score = 96.7 bits (239), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 46/46 (100%), Positives = 46/46 (100%) Query: 1 MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV 46 MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV Sbjct: 1 MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV 46 >UniRef50_B5Y0G4 Putative uncharacterized protein n=2 Tax=Klebsiella pneumoniae RepID=B5Y0G4_KLEP3 Length = 46 Score = 41.6 bits (96), Expect = 0.008, Method: Compositional matrix adjust. Identities = 19/40 (47%), Positives = 25/40 (62%) Query: 6 NMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAG 45 + E +R RW +LRL R G+VL DYRILKN+ + G Sbjct: 4 DAIERIRDRWQKLRLCRHRGTVLVDYRILKNFVRIYQTLG 43 >UniRef50_C9XDJ8 Predicted prophage protein n=11 Tax=Enterobacteriaceae RepID=C9XDJ8_SALTD Length = 48 Score = 39.7 bits (91), Expect = 0.027, Method: Compositional matrix adjust. Identities = 18/34 (52%), Positives = 24/34 (70%) Query: 6 NMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAK 39 + E+LR RW RLR+YR+ GS YRIL+NY + Sbjct: 4 ELIEILRMRWLRLRIYRYRGSFPVAYRILRNYVR 37 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q47272 Uncharacterized protein ylcG in lambdoid DLP12 p... 84 1e-15 Sequences not found previously or not previously below threshold: UniRef50_B5Y0G4 Putative uncharacterized protein n=2 Tax=Klebsie... 39 0.056 CONVERGED! >UniRef50_Q47272 Uncharacterized protein ylcG in lambdoid DLP12 prophage region n=36 Tax=root RepID=YLCG_ECOLI Length = 46 Score = 83.9 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 46/46 (100%), Positives = 46/46 (100%) Query: 1 MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV 46 MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV Sbjct: 1 MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV 46 >UniRef50_B5Y0G4 Putative uncharacterized protein n=2 Tax=Klebsiella pneumoniae RepID=B5Y0G4_KLEP3 Length = 46 Score = 38.8 bits (89), Expect = 0.056, Method: Composition-based stats. Identities = 19/37 (51%), Positives = 24/37 (64%) Query: 9 ELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAG 45 E +R RW +LRL R G+VL DYRILKN+ + G Sbjct: 7 ERIRDRWQKLRLCRHRGTVLVDYRILKNFVRIYQTLG 43 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.331 0.144 0.453 Lambda K H 0.267 0.0467 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 230,481,436 Number of Sequences: 3077464 Number of extensions: 6771665 Number of successful extensions: 18984 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 18978 Number of HSP's gapped (non-prelim): 6 length of query: 46 length of database: 1,040,396,356 effective HSP length: 20 effective length of query: 26 effective length of database: 978,847,076 effective search space: 25450023976 effective search space used: 25450023976 T: 11 A: 40 X1: 16 ( 7.6 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.9 bits) S2: 87 (37.9 bits)