BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (155 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A8A6H8 Inner membrane protein cbrB n=40 Tax=Enterobacte... 290 1e-77 UniRef50_A4W5K2 Putative uncharacterized protein n=2 Tax=Enterob... 72 7e-12 UniRef50_B4SU82 Putative uncharacterized protein n=16 Tax=Salmon... 52 4e-06 >UniRef50_A8A6H8 Inner membrane protein cbrB n=40 Tax=Enterobacteriaceae RepID=CBRB_ECOHS Length = 157 Score = 290 bits (741), Expect = 1e-77, Method: Compositional matrix adjust. Identities = 151/157 (96%), Positives = 152/157 (96%), Gaps = 2/157 (1%) Query: 1 MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKEPLVLWVIIH--PIFLLLSITTGAIP 58 MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKEPLVL VII P+FLLLSITTGAIP Sbjct: 1 MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKEPLVLLVIIQVLPLFLLLSITTGAIP 60 Query: 59 ALLTGVMVACLPEKIGSQKRYRCLAGGIGGVVITEIYCAVIVHIKGMASSELFENILSGD 118 ALLTGVMVACLPEKIGSQK YRCLAGGIGGVVITEIYCAVIVHIKGMASSELFENILSGD Sbjct: 61 ALLTGVMVACLPEKIGSQKNYRCLAGGIGGVVITEIYCAVIVHIKGMASSELFENILSGD 120 Query: 119 SLVVRIIPALLAGVVMSRIITRLPGLDISCPETDSLS 155 SLVVRIIPALLAGVVMSRIITRLPGLDISCPETDSLS Sbjct: 121 SLVVRIIPALLAGVVMSRIITRLPGLDISCPETDSLS 157 >UniRef50_A4W5K2 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A4W5K2_ENT38 Length = 198 Score = 71.6 bits (174), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 49/148 (33%), Positives = 78/148 (52%), Gaps = 17/148 (11%) Query: 1 MSVSRRVIHHGLYFAVLGPLIGVLFL----VLYIFFAKEPLVLWVIIH-PIFLLLSITTG 55 M+ +RR+ HH +FA+LGP +GV +L E L+L ++++ P F+ ++ G Sbjct: 38 MTKTRRIFHHACWFALLGPHLGVPVTFAGELLTNHRTAENLLLQIVLNFPFFMAVTWFLG 97 Query: 56 AIPALLTGVMVACLPEKIGSQKRYRCLAGGIGGVVITEIYCAVIVHIKGMASSELFENIL 115 + ALLTG+ ACLP +I +R LA G G VI +Y G+ S ++++ Sbjct: 98 GVSALLTGIAAACLPPQIYQHAWHRSLACGAMGSVIATLY--------GLIYSGSVDSVI 149 Query: 116 SGDSLVVRIIPALLAGVVMSRIITRLPG 143 + P LLAG++M R + LPG Sbjct: 150 ----FWMATGPGLLAGLIMGRFVPYLPG 173 >UniRef50_B4SU82 Putative uncharacterized protein n=16 Tax=Salmonella enterica RepID=B4SU82_SALNS Length = 154 Score = 52.4 bits (124), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 14/152 (9%) Query: 1 MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKE----PLVLWVIIHPIFLLLSITTGA 56 MSV RR+I+H +F +LGPLIG + I ++ P + L I TG Sbjct: 1 MSVFRRIIYHSGWFILLGPLIGAATAIGVINYSPAIIGGPDSFLFCFYRTIPALVIITGW 60 Query: 57 I----PALLTGVMVACLPEKIGSQKRYRCLAGGIGGVVITEIYCAVIVHIKGMASSELFE 112 I PA LTGV A +P K+ + R + I G +IT ++ V G S L Sbjct: 61 IYSLLPAWLTGVACALIPLKLYQKIINRMILCAIAGGLITTLFNLV---RYGFDISIL-- 115 Query: 113 NILSGDSLVVRIIPALLAGVVMSRIITRLPGL 144 ++ G+ L++ +IPA++AG +M +IT LPGL Sbjct: 116 PLIYGEVLIM-LIPAIVAGAIMGGVITYLPGL 146 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A8A6H8 Inner membrane protein cbrB n=40 Tax=Enterobacte... 184 7e-46 UniRef50_A4W5K2 Putative uncharacterized protein n=2 Tax=Enterob... 160 9e-39 UniRef50_B4SU82 Putative uncharacterized protein n=16 Tax=Salmon... 133 2e-30 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_A8A6H8 Inner membrane protein cbrB n=40 Tax=Enterobacteriaceae RepID=CBRB_ECOHS Length = 157 Score = 184 bits (467), Expect = 7e-46, Method: Composition-based stats. Identities = 151/157 (96%), Positives = 152/157 (96%), Gaps = 2/157 (1%) Query: 1 MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKEPLVLWVIIH--PIFLLLSITTGAIP 58 MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKEPLVL VII P+FLLLSITTGAIP Sbjct: 1 MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKEPLVLLVIIQVLPLFLLLSITTGAIP 60 Query: 59 ALLTGVMVACLPEKIGSQKRYRCLAGGIGGVVITEIYCAVIVHIKGMASSELFENILSGD 118 ALLTGVMVACLPEKIGSQK YRCLAGGIGGVVITEIYCAVIVHIKGMASSELFENILSGD Sbjct: 61 ALLTGVMVACLPEKIGSQKNYRCLAGGIGGVVITEIYCAVIVHIKGMASSELFENILSGD 120 Query: 119 SLVVRIIPALLAGVVMSRIITRLPGLDISCPETDSLS 155 SLVVRIIPALLAGVVMSRIITRLPGLDISCPETDSLS Sbjct: 121 SLVVRIIPALLAGVVMSRIITRLPGLDISCPETDSLS 157 >UniRef50_A4W5K2 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A4W5K2_ENT38 Length = 198 Score = 160 bits (406), Expect = 9e-39, Method: Composition-based stats. Identities = 49/148 (33%), Positives = 78/148 (52%), Gaps = 17/148 (11%) Query: 1 MSVSRRVIHHGLYFAVLGPLIGVLFL----VLYIFFAKEPLVLWVIIH-PIFLLLSITTG 55 M+ +RR+ HH +FA+LGP +GV +L E L+L ++++ P F+ ++ G Sbjct: 38 MTKTRRIFHHACWFALLGPHLGVPVTFAGELLTNHRTAENLLLQIVLNFPFFMAVTWFLG 97 Query: 56 AIPALLTGVMVACLPEKIGSQKRYRCLAGGIGGVVITEIYCAVIVHIKGMASSELFENIL 115 + ALLTG+ ACLP +I +R LA G G VI +Y G+ S ++++ Sbjct: 98 GVSALLTGIAAACLPPQIYQHAWHRSLACGAMGSVIATLY--------GLIYSGSVDSVI 149 Query: 116 SGDSLVVRIIPALLAGVVMSRIITRLPG 143 + P LLAG++M R + LPG Sbjct: 150 ----FWMATGPGLLAGLIMGRFVPYLPG 173 >UniRef50_B4SU82 Putative uncharacterized protein n=16 Tax=Salmonella enterica RepID=B4SU82_SALNS Length = 154 Score = 133 bits (334), Expect = 2e-30, Method: Composition-based stats. Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 14/152 (9%) Query: 1 MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKE----PLVLWVIIHPIFLLLSITTGA 56 MSV RR+I+H +F +LGPLIG + I ++ P + L I TG Sbjct: 1 MSVFRRIIYHSGWFILLGPLIGAATAIGVINYSPAIIGGPDSFLFCFYRTIPALVIITGW 60 Query: 57 I----PALLTGVMVACLPEKIGSQKRYRCLAGGIGGVVITEIYCAVIVHIKGMASSELFE 112 I PA LTGV A +P K+ + R + I G +IT ++ V G S L Sbjct: 61 IYSLLPAWLTGVACALIPLKLYQKIINRMILCAIAGGLITTLFNLV---RYGFDISIL-- 115 Query: 113 NILSGDSLVVRIIPALLAGVVMSRIITRLPGL 144 ++ G+ L++ +IPA++AG +M +IT LPGL Sbjct: 116 PLIYGEVLIM-LIPAIVAGAIMGGVITYLPGL 146 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.323 0.138 0.388 Lambda K H 0.267 0.0422 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 592,945,031 Number of Sequences: 3077464 Number of extensions: 21408430 Number of successful extensions: 100088 Number of sequences better than 1.0e-01: 14 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 12 Number of HSP's that attempted gapping in prelim test: 100063 Number of HSP's gapped (non-prelim): 19 length of query: 155 length of database: 1,040,396,356 effective HSP length: 117 effective length of query: 38 effective length of database: 680,333,068 effective search space: 25852656584 effective search space used: 25852656584 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.5 bits) S2: 87 (38.1 bits)