BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (159 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A5A605 Uncharacterized protein ykfM n=9 Tax=Enterobacte... 320 1e-86 UniRef50_B5F214 Putative uncharacterized protein n=1 Tax=Salmone... 59 6e-08 >UniRef50_A5A605 Uncharacterized protein ykfM n=9 Tax=Enterobacteriaceae RepID=YKFM_ECOLI Length = 159 Score = 320 bits (819), Expect = 1e-86, Method: Compositional matrix adjust. Identities = 159/159 (100%), Positives = 159/159 (100%) Query: 1 MRLHVKLKEFLSMFFMAILFFPAFNASLFFTGVKPLYSIIKCSTEIFYDWRMLILCFGFM 60 MRLHVKLKEFLSMFFMAILFFPAFNASLFFTGVKPLYSIIKCSTEIFYDWRMLILCFGFM Sbjct: 1 MRLHVKLKEFLSMFFMAILFFPAFNASLFFTGVKPLYSIIKCSTEIFYDWRMLILCFGFM 60 Query: 61 SFSFLNIHVILLTIIKSFLIKKTKVVNFATDITIQLTLIFLLIAIVIAPLIAPFVTGYVN 120 SFSFLNIHVILLTIIKSFLIKKTKVVNFATDITIQLTLIFLLIAIVIAPLIAPFVTGYVN Sbjct: 61 SFSFLNIHVILLTIIKSFLIKKTKVVNFATDITIQLTLIFLLIAIVIAPLIAPFVTGYVN 120 Query: 121 TNYHPCGNNTGIFPGAIYIKNGMKCNNGYISRKEDSAVK 159 TNYHPCGNNTGIFPGAIYIKNGMKCNNGYISRKEDSAVK Sbjct: 121 TNYHPCGNNTGIFPGAIYIKNGMKCNNGYISRKEDSAVK 159 >UniRef50_B5F214 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. enterica serovar Agona str. SL483 RepID=B5F214_SALA4 Length = 148 Score = 58.5 bits (140), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 40/118 (33%), Positives = 63/118 (53%), Gaps = 13/118 (11%) Query: 37 YSIIKCSTEIFYDWRMLILCFGFMSFSFLNIHVILLTIIKSFLIKKTKV-VNFATDITIQ 95 YS+IK S I+YDW+ ++ G S FL I L I SF+I +T V V + Sbjct: 36 YSLIKGSDRIYYDWKGIV---GLYSIPFLFFTYIYLVI--SFIISETNVLVKKNKKKSHN 90 Query: 96 LTLIFL-------LIAIVIAPLIAPFVTGYVNTNYHPCGNNTGIFPGAIYIKNGMKCN 146 + + +L L+ I+I + F+T YV+++Y PC ++GI+ G Y+K G C+ Sbjct: 91 IIMKYLGPIIALSLLCIIIGFISTFFITSYVHSHYTPCDGSSGIYSGKNYVKGGATCH 148 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A5A605 Uncharacterized protein ykfM n=9 Tax=Enterobacte... 224 8e-58 UniRef50_B5F214 Putative uncharacterized protein n=1 Tax=Salmone... 120 2e-26 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_A5A605 Uncharacterized protein ykfM n=9 Tax=Enterobacteriaceae RepID=YKFM_ECOLI Length = 159 Score = 224 bits (571), Expect = 8e-58, Method: Composition-based stats. Identities = 159/159 (100%), Positives = 159/159 (100%) Query: 1 MRLHVKLKEFLSMFFMAILFFPAFNASLFFTGVKPLYSIIKCSTEIFYDWRMLILCFGFM 60 MRLHVKLKEFLSMFFMAILFFPAFNASLFFTGVKPLYSIIKCSTEIFYDWRMLILCFGFM Sbjct: 1 MRLHVKLKEFLSMFFMAILFFPAFNASLFFTGVKPLYSIIKCSTEIFYDWRMLILCFGFM 60 Query: 61 SFSFLNIHVILLTIIKSFLIKKTKVVNFATDITIQLTLIFLLIAIVIAPLIAPFVTGYVN 120 SFSFLNIHVILLTIIKSFLIKKTKVVNFATDITIQLTLIFLLIAIVIAPLIAPFVTGYVN Sbjct: 61 SFSFLNIHVILLTIIKSFLIKKTKVVNFATDITIQLTLIFLLIAIVIAPLIAPFVTGYVN 120 Query: 121 TNYHPCGNNTGIFPGAIYIKNGMKCNNGYISRKEDSAVK 159 TNYHPCGNNTGIFPGAIYIKNGMKCNNGYISRKEDSAVK Sbjct: 121 TNYHPCGNNTGIFPGAIYIKNGMKCNNGYISRKEDSAVK 159 >UniRef50_B5F214 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. enterica serovar Agona str. SL483 RepID=B5F214_SALA4 Length = 148 Score = 120 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 40/118 (33%), Positives = 63/118 (53%), Gaps = 13/118 (11%) Query: 37 YSIIKCSTEIFYDWRMLILCFGFMSFSFLNIHVILLTIIKSFLIKKTKV-VNFATDITIQ 95 YS+IK S I+YDW+ ++ G S FL I L I SF+I +T V V + Sbjct: 36 YSLIKGSDRIYYDWKGIV---GLYSIPFLFFTYIYLVI--SFIISETNVLVKKNKKKSHN 90 Query: 96 LTLIFL-------LIAIVIAPLIAPFVTGYVNTNYHPCGNNTGIFPGAIYIKNGMKCN 146 + + +L L+ I+I + F+T YV+++Y PC ++GI+ G Y+K G C+ Sbjct: 91 IIMKYLGPIIALSLLCIIIGFISTFFITSYVHSHYTPCDGSSGIYSGKNYVKGGATCH 148 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.333 0.146 0.431 Lambda K H 0.267 0.0449 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 674,008,184 Number of Sequences: 3077464 Number of extensions: 29411344 Number of successful extensions: 161855 Number of sequences better than 1.0e-01: 26 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 26 Number of HSP's that attempted gapping in prelim test: 161827 Number of HSP's gapped (non-prelim): 36 length of query: 159 length of database: 1,040,396,356 effective HSP length: 118 effective length of query: 41 effective length of database: 677,255,604 effective search space: 27767479764 effective search space used: 27767479764 T: 11 A: 40 X1: 16 ( 7.7 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.5 bits) S2: 88 (38.4 bits)