BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (57 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P64454 Uncharacterized protein ydcX n=130 Tax=Enterobac... 78 9e-14 UniRef50_C9Y560 Uncharacterized protein ydcX n=6 Tax=Enterobacte... 62 4e-09 UniRef50_C4X918 Putative uncharacterized protein n=5 Tax=Klebsie... 62 8e-09 UniRef50_A8GFE6 Putative uncharacterized protein n=2 Tax=Serrati... 54 2e-06 UniRef50_B4EZ06 Putative membrane protein n=3 Tax=Enterobacteria... 53 2e-06 UniRef50_A1JLX1 Putative membrane protein n=1 Tax=Yersinia enter... 52 6e-06 UniRef50_C5BE08 Putative uncharacterized protein n=3 Tax=Enterob... 43 0.003 UniRef50_Q2TJ82 Z4199 n=13 Tax=Escherichia coli RepID=Q2TJ82_ECOLX 41 0.009 UniRef50_C6C3M0 Putative uncharacterized protein n=8 Tax=Enterob... 40 0.031 >UniRef50_P64454 Uncharacterized protein ydcX n=130 Tax=Enterobacteriaceae RepID=YDCX_ECOL6 Length = 57 Score = 77.8 bits (190), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 36/57 (63%), Positives = 45/57 (78%) Query: 1 MALFSKILIFYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF 57 M+L+ +L+FY + I+F+I WF+SH+K IR LSAFLVG TWPMS PVALLFSLF Sbjct: 1 MSLYQHMLVFYAVMAAIAFLITWFLSHDKKRIRFLSAFLVGATWPMSFPVALLFSLF 57 >UniRef50_C9Y560 Uncharacterized protein ydcX n=6 Tax=Enterobacteriaceae RepID=C9Y560_CROTZ Length = 57 Score = 62.4 bits (150), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 29/57 (50%), Positives = 42/57 (73%) Query: 1 MALFSKILIFYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF 57 M L+ +L+FY + +I+ +I +FI+ +KT I+LL A L+G TWPMS PVALL +LF Sbjct: 1 MTLYQHMLVFYAVMASIAALITFFIAKDKTSIKLLCATLIGATWPMSFPVALLVALF 57 >UniRef50_C4X918 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4X918_KLEPN Length = 75 Score = 61.6 bits (148), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 31/57 (54%), Positives = 39/57 (68%) Query: 1 MALFSKILIFYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF 57 M L+ K+L+FY I I VI WF++ E IRLL+A L+G TWP S P+ALL SLF Sbjct: 19 MTLYQKMLVFYAIMGVICAVISWFLTKESRMIRLLAALLIGATWPFSFPMALLVSLF 75 >UniRef50_A8GFE6 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GFE6_SERP5 Length = 58 Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 23/48 (47%), Positives = 35/48 (72%) Query: 10 FYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF 57 FY+IG+ +S + + +S + IR +SA +G+TWP+SLPV L+FSLF Sbjct: 11 FYLIGLLVSMIFTYLVSRDTIKIRFISALTIGLTWPLSLPVVLMFSLF 58 >UniRef50_B4EZ06 Putative membrane protein n=3 Tax=Enterobacteriaceae RepID=B4EZ06_PROMH Length = 59 Score = 53.1 bits (126), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 27/58 (46%), Positives = 45/58 (77%), Gaps = 1/58 (1%) Query: 1 MALFSKILIFYVIGVNISFVIIWFISHEKT-HIRLLSAFLVGITWPMSLPVALLFSLF 57 M++ + I I Y+IG +SF+I +F++ + + +RLLSAFL+ ITWP+S P+A++FS+F Sbjct: 1 MSVIAIIGIGYLIGAVLSFIITFFLTKDPSLAMRLLSAFLIAITWPLSFPMAIIFSIF 58 >UniRef50_A1JLX1 Putative membrane protein n=1 Tax=Yersinia enterocolitica subsp. enterocolitica 8081 RepID=A1JLX1_YERE8 Length = 58 Score = 52.0 bits (123), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 27/52 (51%), Positives = 40/52 (76%), Gaps = 1/52 (1%) Query: 7 ILIFYVIGVNISFVIIWFISHEKTHI-RLLSAFLVGITWPMSLPVALLFSLF 57 +L YV+GV IS II+ ++ +++ + R LS+ L+GITWP+SLPVALL S+F Sbjct: 7 LLCIYVVGVIISTFIIFPLTRDQSLVVRCLSSLLIGITWPLSLPVALLLSMF 58 >UniRef50_C5BE08 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=C5BE08_EDWI9 Length = 61 Score = 42.7 bits (99), Expect = 0.003, Method: Compositional matrix adjust. Identities = 24/48 (50%), Positives = 32/48 (66%), Gaps = 1/48 (2%) Query: 11 YVIGVNISFVIIWFISHEKT-HIRLLSAFLVGITWPMSLPVALLFSLF 57 Y IGV + +F + + + IRLLSA +VG+TWPMS P+ LLF LF Sbjct: 13 YGIGVLFCAFVTFFATKDPSMKIRLLSAIMVGLTWPMSFPLVLLFWLF 60 >UniRef50_Q2TJ82 Z4199 n=13 Tax=Escherichia coli RepID=Q2TJ82_ECOLX Length = 59 Score = 41.2 bits (95), Expect = 0.009, Method: Compositional matrix adjust. Identities = 18/46 (39%), Positives = 32/46 (69%) Query: 3 LFSKILIFYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSL 48 + +K+L+ Y+IG+ ++ +I+ S EK RL +A + G+TWP+SL Sbjct: 1 MINKLLLAYLIGLVVTSTLIFIFSEEKVTYRLFAAVITGLTWPLSL 46 >UniRef50_C6C3M0 Putative uncharacterized protein n=8 Tax=Enterobacteriaceae RepID=C6C3M0_DICDC Length = 61 Score = 39.7 bits (91), Expect = 0.031, Method: Compositional matrix adjust. Identities = 20/56 (35%), Positives = 38/56 (67%), Gaps = 1/56 (1%) Query: 3 LFSKILIFYVIGVNISFVIIWFISHEKTH-IRLLSAFLVGITWPMSLPVALLFSLF 57 ++ I + Y+IG I+ + +F+S +++ IR L++ L+ +TWP+S PV ++FS F Sbjct: 6 MWETIKLIYLIGFCIAILFTFFMSKDRSLLIRFLASALIALTWPLSFPVVIVFSFF 61 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C4X918 Putative uncharacterized protein n=5 Tax=Klebsie... 63 2e-09 UniRef50_P64454 Uncharacterized protein ydcX n=130 Tax=Enterobac... 52 7e-06 UniRef50_C9Y560 Uncharacterized protein ydcX n=6 Tax=Enterobacte... 48 1e-04 Sequences not found previously or not previously below threshold: UniRef50_B4EZ06 Putative membrane protein n=3 Tax=Enterobacteria... 41 0.014 UniRef50_A8GFE6 Putative uncharacterized protein n=2 Tax=Serrati... 40 0.037 CONVERGED! >UniRef50_C4X918 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4X918_KLEPN Length = 75 Score = 63.4 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 31/57 (54%), Positives = 39/57 (68%) Query: 1 MALFSKILIFYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF 57 M L+ K+L+FY I I VI WF++ E IRLL+A L+G TWP S P+ALL SLF Sbjct: 19 MTLYQKMLVFYAIMGVICAVISWFLTKESRMIRLLAALLIGATWPFSFPMALLVSLF 75 >UniRef50_P64454 Uncharacterized protein ydcX n=130 Tax=Enterobacteriaceae RepID=YDCX_ECOL6 Length = 57 Score = 51.8 bits (123), Expect = 7e-06, Method: Composition-based stats. Identities = 36/57 (63%), Positives = 45/57 (78%) Query: 1 MALFSKILIFYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF 57 M+L+ +L+FY + I+F+I WF+SH+K IR LSAFLVG TWPMS PVALLFSLF Sbjct: 1 MSLYQHMLVFYAVMAAIAFLITWFLSHDKKRIRFLSAFLVGATWPMSFPVALLFSLF 57 >UniRef50_C9Y560 Uncharacterized protein ydcX n=6 Tax=Enterobacteriaceae RepID=C9Y560_CROTZ Length = 57 Score = 48.0 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 29/57 (50%), Positives = 42/57 (73%) Query: 1 MALFSKILIFYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF 57 M L+ +L+FY + +I+ +I +FI+ +KT I+LL A L+G TWPMS PVALL +LF Sbjct: 1 MTLYQHMLVFYAVMASIAALITFFIAKDKTSIKLLCATLIGATWPMSFPVALLVALF 57 >UniRef50_B4EZ06 Putative membrane protein n=3 Tax=Enterobacteriaceae RepID=B4EZ06_PROMH Length = 59 Score = 40.7 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 27/58 (46%), Positives = 45/58 (77%), Gaps = 1/58 (1%) Query: 1 MALFSKILIFYVIGVNISFVIIWFISHEKT-HIRLLSAFLVGITWPMSLPVALLFSLF 57 M++ + I I Y+IG +SF+I +F++ + + +RLLSAFL+ ITWP+S P+A++FS+F Sbjct: 1 MSVIAIIGIGYLIGAVLSFIITFFLTKDPSLAMRLLSAFLIAITWPLSFPMAIIFSIF 58 >UniRef50_A8GFE6 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GFE6_SERP5 Length = 58 Score = 39.5 bits (91), Expect = 0.037, Method: Composition-based stats. Identities = 23/48 (47%), Positives = 35/48 (72%) Query: 10 FYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF 57 FY+IG+ +S + + +S + IR +SA +G+TWP+SLPV L+FSLF Sbjct: 11 FYLIGLLVSMIFTYLVSRDTIKIRFISALTIGLTWPLSLPVVLMFSLF 58 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.328 0.148 0.438 Lambda K H 0.267 0.0455 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 227,714,115 Number of Sequences: 3077464 Number of extensions: 6872110 Number of successful extensions: 45578 Number of sequences better than 1.0e-01: 9 Number of HSP's better than 0.1 without gapping: 17 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 45554 Number of HSP's gapped (non-prelim): 18 length of query: 57 length of database: 1,040,396,356 effective HSP length: 30 effective length of query: 27 effective length of database: 948,072,436 effective search space: 25597955772 effective search space used: 25597955772 T: 11 A: 40 X1: 16 ( 7.6 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.2 bits) S2: 87 (38.0 bits)