BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (83 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P07010 Uncharacterized protein rem n=50 Tax=Bacteria Re... 168 4e-41 UniRef50_B5MUF2 Putative uncharacterized protein n=1 Tax=Salmone... 44 0.002 UniRef50_C2DWS1 Putative uncharacterized protein n=2 Tax=Escheri... 42 0.007 >UniRef50_P07010 Uncharacterized protein rem n=50 Tax=Bacteria RepID=REM_ECOLI Length = 83 Score = 168 bits (426), Expect = 4e-41, Method: Compositional matrix adjust. Identities = 83/83 (100%), Positives = 83/83 (100%) Query: 1 MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGE 60 MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGE Sbjct: 1 MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGE 60 Query: 61 FVTSLQALKEIGIKAGFSAFSGE 83 FVTSLQALKEIGIKAGFSAFSGE Sbjct: 61 FVTSLQALKEIGIKAGFSAFSGE 83 >UniRef50_B5MUF2 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29 RepID=B5MUF2_SALET Length = 64 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 21/45 (46%), Positives = 25/45 (55%) Query: 34 CLRKNGAALINFVDARVTDGFILREGEFVTSLQALKEIGIKAGFS 78 L G LI ++ GF+LRE EFVTS Q L+E G AGF Sbjct: 11 LLTHTGPVLIYLINGEAQRGFLLRENEFVTSWQELQEAGKLAGFP 55 >UniRef50_C2DWS1 Putative uncharacterized protein n=2 Tax=Escherichia RepID=C2DWS1_ECOLX Length = 73 Score = 41.6 bits (96), Expect = 0.007, Method: Compositional matrix adjust. Identities = 19/26 (73%), Positives = 22/26 (84%) Query: 1 MMNIEELRKIFCEDGLYAVCVENGNL 26 +M IEELR+IF EDGLYAV VENG + Sbjct: 16 LMKIEELREIFSEDGLYAVRVENGGI 41 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P07010 Uncharacterized protein rem n=50 Tax=Bacteria Re... 165 4e-40 Sequences not found previously or not previously below threshold: UniRef50_C2DWS1 Putative uncharacterized protein n=2 Tax=Escheri... 42 0.005 UniRef50_B5MUF2 Putative uncharacterized protein n=1 Tax=Salmone... 42 0.006 CONVERGED! >UniRef50_P07010 Uncharacterized protein rem n=50 Tax=Bacteria RepID=REM_ECOLI Length = 83 Score = 165 bits (418), Expect = 4e-40, Method: Composition-based stats. Identities = 83/83 (100%), Positives = 83/83 (100%) Query: 1 MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGE 60 MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGE Sbjct: 1 MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGE 60 Query: 61 FVTSLQALKEIGIKAGFSAFSGE 83 FVTSLQALKEIGIKAGFSAFSGE Sbjct: 61 FVTSLQALKEIGIKAGFSAFSGE 83 >UniRef50_C2DWS1 Putative uncharacterized protein n=2 Tax=Escherichia RepID=C2DWS1_ECOLX Length = 73 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 19/26 (73%), Positives = 22/26 (84%) Query: 1 MMNIEELRKIFCEDGLYAVCVENGNL 26 +M IEELR+IF EDGLYAV VENG + Sbjct: 16 LMKIEELREIFSEDGLYAVRVENGGI 41 >UniRef50_B5MUF2 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29 RepID=B5MUF2_SALET Length = 64 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 21/43 (48%), Positives = 25/43 (58%) Query: 35 LRKNGAALINFVDARVTDGFILREGEFVTSLQALKEIGIKAGF 77 L G LI ++ GF+LRE EFVTS Q L+E G AGF Sbjct: 12 LTHTGPVLIYLINGEAQRGFLLRENEFVTSWQELQEAGKLAGF 54 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.326 0.142 0.412 Lambda K H 0.267 0.0463 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 371,280,639 Number of Sequences: 3077464 Number of extensions: 14448588 Number of successful extensions: 33430 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 33424 Number of HSP's gapped (non-prelim): 6 length of query: 83 length of database: 1,040,396,356 effective HSP length: 53 effective length of query: 30 effective length of database: 877,290,764 effective search space: 26318722920 effective search space used: 26318722920 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 87 (37.9 bits)