BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (78 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P75687 Uncharacterized protein ykgI n=76 Tax=Enterobact... 154 8e-37 UniRef50_B5PFA9 Putative uncharacterized protein n=1 Tax=Salmone... 42 0.007 UniRef50_B7NH61 Putative uncharacterized protein n=2 Tax=Enterob... 39 0.058 >UniRef50_P75687 Uncharacterized protein ykgI n=76 Tax=Enterobacteriaceae RepID=YKGI_ECOLI Length = 78 Score = 154 bits (389), Expect = 8e-37, Method: Compositional matrix adjust. Identities = 78/78 (100%), Positives = 78/78 (100%) Query: 1 MFKKSVLFATLLSGVMAFSTNADDKIILKHISVSSVSASPTVLEDTIADIARKYNASSWK 60 MFKKSVLFATLLSGVMAFSTNADDKIILKHISVSSVSASPTVLEDTIADIARKYNASSWK Sbjct: 1 MFKKSVLFATLLSGVMAFSTNADDKIILKHISVSSVSASPTVLEDTIADIARKYNASSWK 60 Query: 61 VTSMRIDNNSTATAVLYK 78 VTSMRIDNNSTATAVLYK Sbjct: 61 VTSMRIDNNSTATAVLYK 78 >UniRef50_B5PFA9 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537 RepID=B5PFA9_SALET Length = 87 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 19/49 (38%), Positives = 30/49 (61%) Query: 30 HISVSSVSASPTVLEDTIADIARKYNASSWKVTSMRIDNNSTATAVLYK 78 H+SVSS++ +P + + + A +++TS IDN S ATA+LYK Sbjct: 39 HVSVSSINTNPESIIQLLQNKTEASGAHYYRITSFHIDNQSHATAILYK 87 >UniRef50_B7NH61 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B7NH61_ECOLU Length = 70 Score = 38.5 bits (88), Expect = 0.058, Method: Compositional matrix adjust. Identities = 23/50 (46%), Positives = 31/50 (62%), Gaps = 3/50 (6%) Query: 32 SVSSVSASPTVL---EDTIADIARKYNASSWKVTSMRIDNNSTATAVLYK 78 S SV+A+ T L E IA A++ NAS +K+TS R+ N T +A LYK Sbjct: 21 SAQSVTATGTTLDSAEAVIAQKAKEINASEYKITSARMGNTVTMSAELYK 70 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75687 Uncharacterized protein ykgI n=76 Tax=Enterobact... 122 2e-27 Sequences not found previously or not previously below threshold: UniRef50_B5PFA9 Putative uncharacterized protein n=1 Tax=Salmone... 39 0.039 CONVERGED! >UniRef50_P75687 Uncharacterized protein ykgI n=76 Tax=Enterobacteriaceae RepID=YKGI_ECOLI Length = 78 Score = 122 bits (307), Expect = 2e-27, Method: Composition-based stats. Identities = 78/78 (100%), Positives = 78/78 (100%) Query: 1 MFKKSVLFATLLSGVMAFSTNADDKIILKHISVSSVSASPTVLEDTIADIARKYNASSWK 60 MFKKSVLFATLLSGVMAFSTNADDKIILKHISVSSVSASPTVLEDTIADIARKYNASSWK Sbjct: 1 MFKKSVLFATLLSGVMAFSTNADDKIILKHISVSSVSASPTVLEDTIADIARKYNASSWK 60 Query: 61 VTSMRIDNNSTATAVLYK 78 VTSMRIDNNSTATAVLYK Sbjct: 61 VTSMRIDNNSTATAVLYK 78 >UniRef50_B5PFA9 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537 RepID=B5PFA9_SALET Length = 87 Score = 39.4 bits (90), Expect = 0.039, Method: Composition-based stats. Identities = 19/51 (37%), Positives = 31/51 (60%) Query: 28 LKHISVSSVSASPTVLEDTIADIARKYNASSWKVTSMRIDNNSTATAVLYK 78 + H+SVSS++ +P + + + A +++TS IDN S ATA+LYK Sbjct: 37 VAHVSVSSINTNPESIIQLLQNKTEASGAHYYRITSFHIDNQSHATAILYK 87 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.315 0.123 0.327 Lambda K H 0.267 0.0388 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 241,883,571 Number of Sequences: 3077464 Number of extensions: 6404930 Number of successful extensions: 21825 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 21821 Number of HSP's gapped (non-prelim): 6 length of query: 78 length of database: 1,040,396,356 effective HSP length: 49 effective length of query: 29 effective length of database: 889,600,620 effective search space: 25798417980 effective search space used: 25798417980 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 87 (38.2 bits)