BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (77 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9JMS6 Uncharacterized protein yuaN n=1 Tax=Escherichia... 160 1e-38 UniRef50_UPI0001BCF39F hypothetical protein EscherichiacoliO157_... 45 7e-04 UniRef50_Q9F571 YaiB protein n=5 Tax=Escherichia coli RepID=Q9F5... 40 0.031 >UniRef50_Q9JMS6 Uncharacterized protein yuaN n=1 Tax=Escherichia coli K-12 RepID=YUAN_ECOLI Length = 77 Score = 160 bits (404), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 77/77 (100%), Positives = 77/77 (100%) Query: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL 60 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL Sbjct: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL 60 Query: 61 DANDKEIIRGMIDELKK 77 DANDKEIIRGMIDELKK Sbjct: 61 DANDKEIIRGMIDELKK 77 >UniRef50_UPI0001BCF39F hypothetical protein EscherichiacoliO157_17308 n=1 Tax=Escherichia coli O157:H7 str. FRIK2000 RepID=UPI0001BCF39F Length = 84 Score = 45.1 bits (105), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 4/80 (5%) Query: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKR----DTSNCNHLHIIRELSFNEDELELIDFS 56 M + G TYY+ +Y F KK++ D + EL FNE E+E IDF+ Sbjct: 1 MRKNKGRLTYYLEVIDKKYHFVKKISSYSKEFTDGKTKRTKRTLSELVFNESEVEAIDFT 60 Query: 57 TDGLDANDKEIIRGMIDELK 76 +GL DK I+ M+ E K Sbjct: 61 KNGLRPVDKNILLTMVKEYK 80 >UniRef50_Q9F571 YaiB protein n=5 Tax=Escherichia coli RepID=Q9F571_ECOLX Length = 86 Score = 39.7 bits (91), Expect = 0.031, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 4/74 (5%) Query: 6 GEFTYYVYQQSGRYFFCKKLTKKRDTSNCNH----LHIIRELSFNEDELELIDFSTDGLD 61 G+ YY+ ++ + K++ ++ + + +L F+E + IDF++DGL Sbjct: 6 GDRQYYLNKEGDTFHLVKRVKTFSKSATLGKTKATVKTVADLVFHEKAFDTIDFASDGLR 65 Query: 62 ANDKEIIRGMIDEL 75 NDKEII MI E+ Sbjct: 66 ENDKEIIFMMIQEM 79 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9JMS6 Uncharacterized protein yuaN n=1 Tax=Escherichia... 111 1e-23 UniRef50_UPI0001BCF39F hypothetical protein EscherichiacoliO157_... 89 3e-17 Sequences not found previously or not previously below threshold: UniRef50_Q9F571 YaiB protein n=5 Tax=Escherichia coli RepID=Q9F5... 49 4e-05 >UniRef50_Q9JMS6 Uncharacterized protein yuaN n=1 Tax=Escherichia coli K-12 RepID=YUAN_ECOLI Length = 77 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 77/77 (100%), Positives = 77/77 (100%) Query: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL 60 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL Sbjct: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL 60 Query: 61 DANDKEIIRGMIDELKK 77 DANDKEIIRGMIDELKK Sbjct: 61 DANDKEIIRGMIDELKK 77 >UniRef50_UPI0001BCF39F hypothetical protein EscherichiacoliO157_17308 n=1 Tax=Escherichia coli O157:H7 str. FRIK2000 RepID=UPI0001BCF39F Length = 84 Score = 89.3 bits (220), Expect = 3e-17, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 4/81 (4%) Query: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKR----DTSNCNHLHIIRELSFNEDELELIDFS 56 M + G TYY+ +Y F KK++ D + EL FNE E+E IDF+ Sbjct: 1 MRKNKGRLTYYLEVIDKKYHFVKKISSYSKEFTDGKTKRTKRTLSELVFNESEVEAIDFT 60 Query: 57 TDGLDANDKEIIRGMIDELKK 77 +GL DK I+ M+ E K+ Sbjct: 61 KNGLRPVDKNILLTMVKEYKE 81 >UniRef50_Q9F571 YaiB protein n=5 Tax=Escherichia coli RepID=Q9F571_ECOLX Length = 86 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 4/81 (4%) Query: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRD----TSNCNHLHIIRELSFNEDELELIDFS 56 M G+ YY+ ++ + K++ + + +L F+E + IDF+ Sbjct: 1 MKIRKGDRQYYLNKEGDTFHLVKRVKTFSKSATLGKTKATVKTVADLVFHEKAFDTIDFA 60 Query: 57 TDGLDANDKEIIRGMIDELKK 77 +DGL NDKEII MI E+ + Sbjct: 61 SDGLRENDKEIIFMMIQEMSE 81 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9JMS6 Uncharacterized protein yuaN n=1 Tax=Escherichia... 98 9e-20 UniRef50_Q9F571 YaiB protein n=5 Tax=Escherichia coli RepID=Q9F5... 85 4e-16 UniRef50_UPI0001BCF39F hypothetical protein EscherichiacoliO157_... 80 2e-14 Sequences not found previously or not previously below threshold: UniRef50_B1VJ90 Putative uncharacterized protein n=6 Tax=Enterob... 40 0.017 CONVERGED! >UniRef50_Q9JMS6 Uncharacterized protein yuaN n=1 Tax=Escherichia coli K-12 RepID=YUAN_ECOLI Length = 77 Score = 97.8 bits (242), Expect = 9e-20, Method: Composition-based stats. Identities = 77/77 (100%), Positives = 77/77 (100%) Query: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL 60 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL Sbjct: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRDTSNCNHLHIIRELSFNEDELELIDFSTDGL 60 Query: 61 DANDKEIIRGMIDELKK 77 DANDKEIIRGMIDELKK Sbjct: 61 DANDKEIIRGMIDELKK 77 >UniRef50_Q9F571 YaiB protein n=5 Tax=Escherichia coli RepID=Q9F571_ECOLX Length = 86 Score = 85.4 bits (210), Expect = 4e-16, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 4/81 (4%) Query: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKRD----TSNCNHLHIIRELSFNEDELELIDFS 56 M G+ YY+ ++ + K++ + + +L F+E + IDF+ Sbjct: 1 MKIRKGDRQYYLNKEGDTFHLVKRVKTFSKSATLGKTKATVKTVADLVFHEKAFDTIDFA 60 Query: 57 TDGLDANDKEIIRGMIDELKK 77 +DGL NDKEII MI E+ + Sbjct: 61 SDGLRENDKEIIFMMIQEMSE 81 >UniRef50_UPI0001BCF39F hypothetical protein EscherichiacoliO157_17308 n=1 Tax=Escherichia coli O157:H7 str. FRIK2000 RepID=UPI0001BCF39F Length = 84 Score = 80.4 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 4/81 (4%) Query: 1 MVRYHGEFTYYVYQQSGRYFFCKKLTKKR----DTSNCNHLHIIRELSFNEDELELIDFS 56 M + G TYY+ +Y F KK++ D + EL FNE E+E IDF+ Sbjct: 1 MRKNKGRLTYYLEVIDKKYHFVKKISSYSKEFTDGKTKRTKRTLSELVFNESEVEAIDFT 60 Query: 57 TDGLDANDKEIIRGMIDELKK 77 +GL DK I+ M+ E K+ Sbjct: 61 KNGLRPVDKNILLTMVKEYKE 81 >UniRef50_B1VJ90 Putative uncharacterized protein n=6 Tax=Enterobacteriaceae RepID=B1VJ90_PROMH Length = 88 Score = 40.4 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 38/80 (47%), Gaps = 5/80 (6%) Query: 1 MVRYHGEFTYYVYQQS-GRYFFCKKLTKKR----DTSNCNHLHIIRELSFNEDELELIDF 55 + + G+ TYY+ +++ Y KK+ + + + +L D+L +D+ Sbjct: 4 IRKNKGDVTYYLSRENNDSYRLIKKIKARATHLVKDGHKTTKVTLSDLLLTHDQLYNLDY 63 Query: 56 STDGLDANDKEIIRGMIDEL 75 S +GL A+DK I +I E Sbjct: 64 SLNGLRADDKATIELLIGEF 83 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.324 0.141 0.352 Lambda K H 0.267 0.0420 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 423,708,164 Number of Sequences: 3077464 Number of extensions: 13511039 Number of successful extensions: 45412 Number of sequences better than 1.0e-01: 4 Number of HSP's better than 0.1 without gapping: 8 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 45398 Number of HSP's gapped (non-prelim): 11 length of query: 77 length of database: 1,040,396,356 effective HSP length: 48 effective length of query: 29 effective length of database: 892,678,084 effective search space: 25887664436 effective search space used: 25887664436 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 87 (38.1 bits)