BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (115 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AA96 Uncharacterized protein yacC n=152 Tax=Enterobac... 238 6e-62 UniRef50_D1P4K5 YacC protein n=7 Tax=Enterobacteriaceae RepID=D1... 124 9e-28 UniRef50_B4EU96 Putative exported protein n=2 Tax=Proteus mirabi... 113 1e-24 >UniRef50_P0AA96 Uncharacterized protein yacC n=152 Tax=Enterobacteriaceae RepID=YACC_ECOL6 Length = 115 Score = 238 bits (606), Expect = 6e-62, Method: Compositional matrix adjust. Identities = 115/115 (100%), Positives = 115/115 (100%) Query: 1 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV 60 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV Sbjct: 1 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV 60 Query: 61 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAYVK 115 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAYVK Sbjct: 61 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAYVK 115 >UniRef50_D1P4K5 YacC protein n=7 Tax=Enterobacteriaceae RepID=D1P4K5_9ENTR Length = 115 Score = 124 bits (311), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 59/115 (51%), Positives = 82/115 (71%) Query: 1 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV 60 M+ F F L+ V ++ AL+ +EAED+ADLTAVF++LK DCGY +P+ +I RA+V Sbjct: 1 MQKFILISYFMMLLGVTHSAKALAPNEAEDLADLTAVFIYLKYDCGYTQIPDREIERAIV 60 Query: 61 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAYVK 115 +FA+ N+WDLSNY++ M L ++SY DL GI + KC++LARDSL L AYVK Sbjct: 61 YFAKSNKWDLSNYNSNKMSKLNKESYDDLKGIPLAQDFKCQSLARDSLGLFAYVK 115 >UniRef50_B4EU96 Putative exported protein n=2 Tax=Proteus mirabilis RepID=B4EU96_PROMH Length = 120 Score = 113 bits (283), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 54/113 (47%), Positives = 74/113 (65%) Query: 1 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV 60 M F T++ +++ AL+ ++AED+ADLTAV++FL NDCGY+ + +I RAL+ Sbjct: 8 MFALFCTIILSFATLFSSSAKALTYTQAEDLADLTAVYLFLNNDCGYEQISKQKIERALI 67 Query: 61 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAY 113 F+Q QWD+SNY M L EDSY+DL I I KKC+ LA+ SLSLL Y Sbjct: 68 IFSQSQQWDVSNYSMLPMSKLNEDSYQDLKNIQIAHNKKCQLLAQKSLSLLNY 120 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AA96 Uncharacterized protein yacC n=152 Tax=Enterobac... 177 1e-43 UniRef50_D1P4K5 YacC protein n=7 Tax=Enterobacteriaceae RepID=D1... 174 8e-43 UniRef50_B4EU96 Putative exported protein n=2 Tax=Proteus mirabi... 157 7e-38 Sequences not found previously or not previously below threshold: UniRef50_Q01567 Lipoprotein outS n=2 Tax=Dickeya RepID=OUTS_DICD3 39 0.042 CONVERGED! >UniRef50_P0AA96 Uncharacterized protein yacC n=152 Tax=Enterobacteriaceae RepID=YACC_ECOL6 Length = 115 Score = 177 bits (448), Expect = 1e-43, Method: Composition-based stats. Identities = 115/115 (100%), Positives = 115/115 (100%) Query: 1 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV 60 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV Sbjct: 1 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV 60 Query: 61 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAYVK 115 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAYVK Sbjct: 61 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAYVK 115 >UniRef50_D1P4K5 YacC protein n=7 Tax=Enterobacteriaceae RepID=D1P4K5_9ENTR Length = 115 Score = 174 bits (441), Expect = 8e-43, Method: Composition-based stats. Identities = 59/115 (51%), Positives = 82/115 (71%) Query: 1 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV 60 M+ F F L+ V ++ AL+ +EAED+ADLTAVF++LK DCGY +P+ +I RA+V Sbjct: 1 MQKFILISYFMMLLGVTHSAKALAPNEAEDLADLTAVFIYLKYDCGYTQIPDREIERAIV 60 Query: 61 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAYVK 115 +FA+ N+WDLSNY++ M L ++SY DL GI + KC++LARDSL L AYVK Sbjct: 61 YFAKSNKWDLSNYNSNKMSKLNKESYDDLKGIPLAQDFKCQSLARDSLGLFAYVK 115 >UniRef50_B4EU96 Putative exported protein n=2 Tax=Proteus mirabilis RepID=B4EU96_PROMH Length = 120 Score = 157 bits (398), Expect = 7e-38, Method: Composition-based stats. Identities = 54/113 (47%), Positives = 74/113 (65%) Query: 1 MKTFFRTVLFGSLMAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLPNGQIRRALV 60 M F T++ +++ AL+ ++AED+ADLTAV++FL NDCGY+ + +I RAL+ Sbjct: 8 MFALFCTIILSFATLFSSSAKALTYTQAEDLADLTAVYLFLNNDCGYEQISKQKIERALI 67 Query: 61 FFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLSLLAY 113 F+Q QWD+SNY M L EDSY+DL I I KKC+ LA+ SLSLL Y Sbjct: 68 IFSQSQQWDVSNYSMLPMSKLNEDSYQDLKNIQIAHNKKCQLLAQKSLSLLNY 120 >UniRef50_Q01567 Lipoprotein outS n=2 Tax=Dickeya RepID=OUTS_DICD3 Length = 133 Score = 39.2 bits (90), Expect = 0.042, Method: Composition-based stats. Identities = 25/93 (26%), Positives = 40/93 (43%), Gaps = 3/93 (3%) Query: 14 MAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLP-NGQIRRALVFFAQQNQWDLSN 72 A + + + + +A L A +L+ C +LP +G I + V A Q WD Sbjct: 30 TASRSAASVPANEQISQLASLVAASKYLRVQCERSDLPDDGTILKTAVNVAVQKGWDTGR 89 Query: 73 YDTFDMKALGEDSYRDLSGIGIPVAKKCKALAR 105 Y + L E+ Y+ L G P A +C + R Sbjct: 90 YQSLP--QLSENLYQGLLKDGTPKATQCSSFNR 120 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.319 0.137 0.367 Lambda K H 0.267 0.0421 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 425,785,063 Number of Sequences: 3077464 Number of extensions: 15746015 Number of successful extensions: 36921 Number of sequences better than 1.0e-01: 6 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 36914 Number of HSP's gapped (non-prelim): 9 length of query: 115 length of database: 1,040,396,356 effective HSP length: 82 effective length of query: 33 effective length of database: 788,044,308 effective search space: 26005462164 effective search space used: 26005462164 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 87 (38.1 bits)