BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (72 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P75620 Uncharacterized protein yaaY n=34 Tax=Enterobact... 146 2e-34 UniRef50_C8UFZ6 Conserved predicted protein n=22 Tax=Enterobacte... 126 2e-28 UniRef50_D2TGJ8 Putative membrane protein n=1 Tax=Citrobacter ro... 89 4e-17 >UniRef50_P75620 Uncharacterized protein yaaY n=34 Tax=Enterobacteriaceae RepID=YAAY_ECOLI Length = 72 Score = 146 bits (369), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 72/72 (100%), Positives = 72/72 (100%) Query: 1 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLT 60 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLT Sbjct: 1 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLT 60 Query: 61 AVQGILGRFSLF 72 AVQGILGRFSLF Sbjct: 61 AVQGILGRFSLF 72 >UniRef50_C8UFZ6 Conserved predicted protein n=22 Tax=Enterobacteriaceae RepID=C8UFZ6_ECO1A Length = 76 Score = 126 bits (317), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 61/62 (98%), Positives = 61/62 (98%) Query: 1 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLT 60 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGG FLPVICAMAMKSHFFLISVLNRRLTLT Sbjct: 1 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGHFLPVICAMAMKSHFFLISVLNRRLTLT 60 Query: 61 AV 62 AV Sbjct: 61 AV 62 >UniRef50_D2TGJ8 Putative membrane protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TGJ8_CITRO Length = 71 Score = 89.0 bits (219), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 49/70 (70%), Positives = 51/70 (72%) Query: 3 RHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLTAV 62 R SLRSDGAGFYQLAG EYSFSA KI IC +AMKS FL +NRRLTLTAV Sbjct: 2 RRSLRSDGAGFYQLAGREYSFSARKIELRCLLTACICLIAMKSILFLFRGINRRLTLTAV 61 Query: 63 QGILGRFSLF 72 QGIL RFSLF Sbjct: 62 QGILWRFSLF 71 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75620 Uncharacterized protein yaaY n=34 Tax=Enterobact... 109 3e-23 UniRef50_C8UFZ6 Conserved predicted protein n=22 Tax=Enterobacte... 97 2e-19 UniRef50_D2TGJ8 Putative membrane protein n=1 Tax=Citrobacter ro... 97 2e-19 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P75620 Uncharacterized protein yaaY n=34 Tax=Enterobacteriaceae RepID=YAAY_ECOLI Length = 72 Score = 109 bits (272), Expect = 3e-23, Method: Composition-based stats. Identities = 72/72 (100%), Positives = 72/72 (100%) Query: 1 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLT 60 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLT Sbjct: 1 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLT 60 Query: 61 AVQGILGRFSLF 72 AVQGILGRFSLF Sbjct: 61 AVQGILGRFSLF 72 >UniRef50_C8UFZ6 Conserved predicted protein n=22 Tax=Enterobacteriaceae RepID=C8UFZ6_ECO1A Length = 76 Score = 96.9 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 61/62 (98%), Positives = 61/62 (98%) Query: 1 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLT 60 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGG FLPVICAMAMKSHFFLISVLNRRLTLT Sbjct: 1 MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGHFLPVICAMAMKSHFFLISVLNRRLTLT 60 Query: 61 AV 62 AV Sbjct: 61 AV 62 >UniRef50_D2TGJ8 Putative membrane protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TGJ8_CITRO Length = 71 Score = 96.5 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 49/70 (70%), Positives = 51/70 (72%) Query: 3 RHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICAMAMKSHFFLISVLNRRLTLTAV 62 R SLRSDGAGFYQLAG EYSFSA KI IC +AMKS FL +NRRLTLTAV Sbjct: 2 RRSLRSDGAGFYQLAGREYSFSARKIELRCLLTACICLIAMKSILFLFRGINRRLTLTAV 61 Query: 63 QGILGRFSLF 72 QGIL RFSLF Sbjct: 62 QGILWRFSLF 71 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.330 0.148 0.450 Lambda K H 0.267 0.0455 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 318,726,874 Number of Sequences: 3077464 Number of extensions: 10680025 Number of successful extensions: 36753 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 36747 Number of HSP's gapped (non-prelim): 6 length of query: 72 length of database: 1,040,396,356 effective HSP length: 43 effective length of query: 29 effective length of database: 908,065,404 effective search space: 26333896716 effective search space used: 26333896716 T: 11 A: 40 X1: 16 ( 7.6 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.3 bits) S2: 87 (38.0 bits)