BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (167 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P28247 Putative uncharacterized protein bicB n=1 Tax=Es... 350 1e-95 UniRef50_A7MLN2 Putative uncharacterized protein n=1 Tax=Cronoba... 145 6e-34 UniRef50_C4XBA3 Putative uncharacterized protein n=1 Tax=Klebsie... 138 8e-32 UniRef50_B1EJW4 Putative uncharacterized protein n=1 Tax=Escheri... 75 1e-12 >UniRef50_P28247 Putative uncharacterized protein bicB n=1 Tax=Escherichia coli K-12 RepID=BICB_ECOLI Length = 167 Score = 350 bits (897), Expect = 1e-95, Method: Compositional matrix adjust. Identities = 167/167 (100%), Positives = 167/167 (100%) Query: 1 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR 60 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR Sbjct: 1 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR 60 Query: 61 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC 120 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC Sbjct: 61 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC 120 Query: 121 DKFIKSHVCDFACKNGGRALTRKSELLSAQYSGLITSLKGKRSMAYY 167 DKFIKSHVCDFACKNGGRALTRKSELLSAQYSGLITSLKGKRSMAYY Sbjct: 121 DKFIKSHVCDFACKNGGRALTRKSELLSAQYSGLITSLKGKRSMAYY 167 >UniRef50_A7MLN2 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MLN2_ENTS8 Length = 250 Score = 145 bits (365), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 79/162 (48%), Positives = 109/162 (67%), Gaps = 11/162 (6%) Query: 1 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR 60 M+ARR +AVRPL ++SHQHQ GG++IQPP +Q R R ++KIEHR VIR+V GA++ALR Sbjct: 75 MVARRGDAVRPLAVVSHQHQPGGVDIQPPCGVQLMRDRLVQKIEHRRVIRIVGGADVALR 134 Query: 61 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC 120 ++H++ VLL +R+A++ D++ R+QF+ + + AI D A+FT NS A+A+LL Sbjct: 135 FVQHKIARAVLLDERIAVILDVVIRQQFKSAVFYNLAIHRDPAGANFTASNSAADAKLLG 194 Query: 121 DKFIKSH-----VCDFACKNGGRALTRKSELL----SAQYSG 153 DKFIKSH V D A GRA R S L S QY+G Sbjct: 195 DKFIKSHYVYKPVVD-AVPTRGRA-KRPSICLKRNQSRQYNG 234 >UniRef50_C4XBA3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae NTUH-K2044 RepID=C4XBA3_KLEPN Length = 271 Score = 138 bits (347), Expect = 8e-32, Method: Compositional matrix adjust. Identities = 63/145 (43%), Positives = 99/145 (68%) Query: 1 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR 60 ++ARR NA+RPL +I HQHQ GG++IQP +Q R+R +++I+ +I +V G ++ LR Sbjct: 121 VVARRGNAMRPLAVIGHQHQPGGVDIQPSGGVQLMRHRLVEEIKDSRMIGIVGGTDVPLR 180 Query: 61 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC 120 L+EH+VT +LL QR++++ ++ R + +R I + A+ G+ AA+FTPGNS A+A+LL Sbjct: 181 LVEHKVTRAILLGQRISVILHLVLRLELKRGIFHNVAVHGNAAAANFTPGNSPADAQLLS 240 Query: 121 DKFIKSHVCDFACKNGGRALTRKSE 145 DK IKSH AC + G + + E Sbjct: 241 DKLIKSHEIFLACHSAGAGKSPEKE 265 >UniRef50_B1EJW4 Putative uncharacterized protein n=1 Tax=Escherichia albertii TW07627 RepID=B1EJW4_9ESCH Length = 40 Score = 74.7 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 35/40 (87%), Positives = 38/40 (95%) Query: 128 VCDFACKNGGRALTRKSELLSAQYSGLITSLKGKRSMAYY 167 +CDFACKNGGRALTRKSELL AQYSGLITSLK K+S+AYY Sbjct: 1 MCDFACKNGGRALTRKSELLFAQYSGLITSLKRKKSVAYY 40 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28247 Putative uncharacterized protein bicB n=1 Tax=Es... 272 2e-72 UniRef50_C4XBA3 Putative uncharacterized protein n=1 Tax=Klebsie... 223 2e-57 UniRef50_A7MLN2 Putative uncharacterized protein n=1 Tax=Cronoba... 211 5e-54 UniRef50_B1EJW4 Putative uncharacterized protein n=1 Tax=Escheri... 70 3e-11 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P28247 Putative uncharacterized protein bicB n=1 Tax=Escherichia coli K-12 RepID=BICB_ECOLI Length = 167 Score = 272 bits (697), Expect = 2e-72, Method: Composition-based stats. Identities = 167/167 (100%), Positives = 167/167 (100%) Query: 1 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR 60 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR Sbjct: 1 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR 60 Query: 61 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC 120 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC Sbjct: 61 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC 120 Query: 121 DKFIKSHVCDFACKNGGRALTRKSELLSAQYSGLITSLKGKRSMAYY 167 DKFIKSHVCDFACKNGGRALTRKSELLSAQYSGLITSLKGKRSMAYY Sbjct: 121 DKFIKSHVCDFACKNGGRALTRKSELLSAQYSGLITSLKGKRSMAYY 167 >UniRef50_C4XBA3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae NTUH-K2044 RepID=C4XBA3_KLEPN Length = 271 Score = 223 bits (568), Expect = 2e-57, Method: Composition-based stats. Identities = 63/145 (43%), Positives = 99/145 (68%) Query: 1 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR 60 ++ARR NA+RPL +I HQHQ GG++IQP +Q R+R +++I+ +I +V G ++ LR Sbjct: 121 VVARRGNAMRPLAVIGHQHQPGGVDIQPSGGVQLMRHRLVEEIKDSRMIGIVGGTDVPLR 180 Query: 61 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC 120 L+EH+VT +LL QR++++ ++ R + +R I + A+ G+ AA+FTPGNS A+A+LL Sbjct: 181 LVEHKVTRAILLGQRISVILHLVLRLELKRGIFHNVAVHGNAAAANFTPGNSPADAQLLS 240 Query: 121 DKFIKSHVCDFACKNGGRALTRKSE 145 DK IKSH AC + G + + E Sbjct: 241 DKLIKSHEIFLACHSAGAGKSPEKE 265 >UniRef50_A7MLN2 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MLN2_ENTS8 Length = 250 Score = 211 bits (538), Expect = 5e-54, Method: Composition-based stats. Identities = 77/161 (47%), Positives = 107/161 (66%), Gaps = 9/161 (5%) Query: 1 MMARRSNAVRPLTIISHQHQTGGINIQPPRRMQFPRYRFIKKIEHRWVIRVVRGANIALR 60 M+ARR +AVRPL ++SHQHQ GG++IQPP +Q R R ++KIEHR VIR+V GA++ALR Sbjct: 75 MVARRGDAVRPLAVVSHQHQPGGVDIQPPCGVQLMRDRLVQKIEHRRVIRIVGGADVALR 134 Query: 61 LIEHEVTWTVLLRQRVAIVSDIMFRKQFERCITDDFAIDGDTIAADFTPGNSTANAELLC 120 ++H++ VLL +R+A++ D++ R+QF+ + + AI D A+FT NS A+A+LL Sbjct: 135 FVQHKIARAVLLDERIAVILDVVIRQQFKSAVFYNLAIHRDPAGANFTASNSAADAKLLG 194 Query: 121 DKFIKSHVCDF----ACKNGGRALTRKSELL----SAQYSG 153 DKFIKSH A GRA R S L S QY+G Sbjct: 195 DKFIKSHYVYKPVVDAVPTRGRAK-RPSICLKRNQSRQYNG 234 >UniRef50_B1EJW4 Putative uncharacterized protein n=1 Tax=Escherichia albertii TW07627 RepID=B1EJW4_9ESCH Length = 40 Score = 69.6 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 35/40 (87%), Positives = 38/40 (95%) Query: 128 VCDFACKNGGRALTRKSELLSAQYSGLITSLKGKRSMAYY 167 +CDFACKNGGRALTRKSELL AQYSGLITSLK K+S+AYY Sbjct: 1 MCDFACKNGGRALTRKSELLFAQYSGLITSLKRKKSVAYY 40 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.323 0.145 0.392 Lambda K H 0.267 0.0447 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 672,688,898 Number of Sequences: 3077464 Number of extensions: 26496025 Number of successful extensions: 70558 Number of sequences better than 1.0e-01: 4 Number of HSP's better than 0.1 without gapping: 8 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 70549 Number of HSP's gapped (non-prelim): 8 length of query: 167 length of database: 1,040,396,356 effective HSP length: 119 effective length of query: 48 effective length of database: 674,178,140 effective search space: 32360550720 effective search space used: 32360550720 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.4 bits) S2: 88 (38.4 bits)