BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (103 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P75971 Putative uncharacterized protein b1142 n=1 Tax=E... 209 2e-53 UniRef50_B7LNL5 Putative uncharacterized protein n=1 Tax=Escheri... 71 1e-11 UniRef50_B7MW52 Putative uncharacterized protein n=2 Tax=Escheri... 64 2e-09 UniRef50_D2TUV0 Putative prophage membrane protein n=1 Tax=Citro... 51 1e-05 UniRef50_C9X8C8 Predicted phage protein n=10 Tax=root RepID=C9X8... 50 2e-05 >UniRef50_P75971 Putative uncharacterized protein b1142 n=1 Tax=Escherichia coli K-12 RepID=Y1142_ECOLI Length = 103 Score = 209 bits (532), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 103/103 (100%), Positives = 103/103 (100%) Query: 1 MVNAAQRTRVKVEADNRPSVDTHPPGVQPSPGTGGTRHHNFMLCVVLAVPVFSLVLSGTA 60 MVNAAQRTRVKVEADNRPSVDTHPPGVQPSPGTGGTRHHNFMLCVVLAVPVFSLVLSGTA Sbjct: 1 MVNAAQRTRVKVEADNRPSVDTHPPGVQPSPGTGGTRHHNFMLCVVLAVPVFSLVLSGTA 60 Query: 61 LFTKQRRVSPDDGLITRPILIAVATGALLCFVEKLTDRAGSIC 103 LFTKQRRVSPDDGLITRPILIAVATGALLCFVEKLTDRAGSIC Sbjct: 61 LFTKQRRVSPDDGLITRPILIAVATGALLCFVEKLTDRAGSIC 103 >UniRef50_B7LNL5 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LNL5_ESCF3 Length = 95 Score = 70.9 bits (172), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 40/62 (64%), Positives = 45/62 (72%), Gaps = 9/62 (14%) Query: 1 MVNAAQRTRVKVEADNRPSVDTHPPGVQPSPGTGGTRHHNFMLCVVLAVPVFSLVLSGTA 60 +VNAAQRTRVKVEAD+RPSVDTHPPGVQPSPGTG RH L V + L GT+ Sbjct: 21 VVNAAQRTRVKVEADSRPSVDTHPPGVQPSPGTG--RHPAPQLYAVCS-------LGGTS 71 Query: 61 LF 62 L+ Sbjct: 72 LY 73 >UniRef50_B7MW52 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7MW52_ECO81 Length = 355 Score = 63.5 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 29/39 (74%), Positives = 31/39 (79%) Query: 57 SGTALFTKQRRVSPDDGLITRPILIAVATGALLCFVEKL 95 +GTALFTKQRR SPDDGLIT+ I A G LLCFVEKL Sbjct: 4 AGTALFTKQRRASPDDGLITQSIRAAATAGVLLCFVEKL 42 >UniRef50_D2TUV0 Putative prophage membrane protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TUV0_CITRO Length = 55 Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 25/37 (67%), Positives = 30/37 (81%) Query: 41 FMLCVVLAVPVFSLVLSGTALFTKQRRVSPDDGLITR 77 +LCVVLAVPV SL + GTALFT++ R SP DGLIT+ Sbjct: 7 ILLCVVLAVPVPSLEVPGTALFTQETRASPGDGLITQ 43 >UniRef50_C9X8C8 Predicted phage protein n=10 Tax=root RepID=C9X8C8_SALTD Length = 177 Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 23/34 (67%), Positives = 26/34 (76%) Query: 61 LFTKQRRVSPDDGLITRPILIAVATGALLCFVEK 94 LFT+Q+RVSP DGLIT+ AV GAL CFVEK Sbjct: 8 LFTQQKRVSPRDGLITQSTRKAVTAGALFCFVEK 41 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75971 Putative uncharacterized protein b1142 n=1 Tax=E... 180 9e-45 UniRef50_B7LNL5 Putative uncharacterized protein n=1 Tax=Escheri... 88 1e-16 UniRef50_B7MW52 Putative uncharacterized protein n=2 Tax=Escheri... 76 4e-13 UniRef50_C9X8C8 Predicted phage protein n=10 Tax=root RepID=C9X8... 68 8e-11 UniRef50_D2TUV0 Putative prophage membrane protein n=1 Tax=Citro... 61 1e-08 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P75971 Putative uncharacterized protein b1142 n=1 Tax=Escherichia coli K-12 RepID=Y1142_ECOLI Length = 103 Score = 180 bits (457), Expect = 9e-45, Method: Composition-based stats. Identities = 103/103 (100%), Positives = 103/103 (100%) Query: 1 MVNAAQRTRVKVEADNRPSVDTHPPGVQPSPGTGGTRHHNFMLCVVLAVPVFSLVLSGTA 60 MVNAAQRTRVKVEADNRPSVDTHPPGVQPSPGTGGTRHHNFMLCVVLAVPVFSLVLSGTA Sbjct: 1 MVNAAQRTRVKVEADNRPSVDTHPPGVQPSPGTGGTRHHNFMLCVVLAVPVFSLVLSGTA 60 Query: 61 LFTKQRRVSPDDGLITRPILIAVATGALLCFVEKLTDRAGSIC 103 LFTKQRRVSPDDGLITRPILIAVATGALLCFVEKLTDRAGSIC Sbjct: 61 LFTKQRRVSPDDGLITRPILIAVATGALLCFVEKLTDRAGSIC 103 >UniRef50_B7LNL5 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LNL5_ESCF3 Length = 95 Score = 87.9 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 40/62 (64%), Positives = 45/62 (72%), Gaps = 9/62 (14%) Query: 1 MVNAAQRTRVKVEADNRPSVDTHPPGVQPSPGTGGTRHHNFMLCVVLAVPVFSLVLSGTA 60 +VNAAQRTRVKVEAD+RPSVDTHPPGVQPSPGTG RH L V + L GT+ Sbjct: 21 VVNAAQRTRVKVEADSRPSVDTHPPGVQPSPGTG--RHPAPQLYAVCS-------LGGTS 71 Query: 61 LF 62 L+ Sbjct: 72 LY 73 >UniRef50_B7MW52 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7MW52_ECO81 Length = 355 Score = 75.6 bits (184), Expect = 4e-13, Method: Composition-based stats. Identities = 29/39 (74%), Positives = 31/39 (79%) Query: 57 SGTALFTKQRRVSPDDGLITRPILIAVATGALLCFVEKL 95 +GTALFTKQRR SPDDGLIT+ I A G LLCFVEKL Sbjct: 4 AGTALFTKQRRASPDDGLITQSIRAAATAGVLLCFVEKL 42 >UniRef50_C9X8C8 Predicted phage protein n=10 Tax=root RepID=C9X8C8_SALTD Length = 177 Score = 68.2 bits (165), Expect = 8e-11, Method: Composition-based stats. Identities = 24/36 (66%), Positives = 27/36 (75%) Query: 61 LFTKQRRVSPDDGLITRPILIAVATGALLCFVEKLT 96 LFT+Q+RVSP DGLIT+ AV GAL CFVEK T Sbjct: 8 LFTQQKRVSPRDGLITQSTRKAVTAGALFCFVEKPT 43 >UniRef50_D2TUV0 Putative prophage membrane protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TUV0_CITRO Length = 55 Score = 60.9 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 25/41 (60%), Positives = 30/41 (73%) Query: 41 FMLCVVLAVPVFSLVLSGTALFTKQRRVSPDDGLITRPILI 81 +LCVVLAVPV SL + GTALFT++ R SP DGLIT+ Sbjct: 7 ILLCVVLAVPVPSLEVPGTALFTQETRASPGDGLITQSTRA 47 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.127 0.369 Lambda K H 0.267 0.0373 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 422,700,732 Number of Sequences: 3077464 Number of extensions: 15708489 Number of successful extensions: 65021 Number of sequences better than 1.0e-01: 6 Number of HSP's better than 0.1 without gapping: 10 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 65009 Number of HSP's gapped (non-prelim): 11 length of query: 103 length of database: 1,040,396,356 effective HSP length: 72 effective length of query: 31 effective length of database: 818,818,948 effective search space: 25383387388 effective search space used: 25383387388 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.3 bits)