BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (93 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P33344 Uncharacterized protein yehE n=52 Tax=Enterobact... 196 1e-49 UniRef50_A9MKV2 Putative uncharacterized protein n=32 Tax=Entero... 91 1e-17 UniRef50_B7LV73 Putative uncharacterized protein n=1 Tax=Escheri... 86 3e-16 UniRef50_B5Y2J0 Putative uncharacterized protein n=1 Tax=Klebsie... 78 9e-14 >UniRef50_P33344 Uncharacterized protein yehE n=52 Tax=Enterobacteriaceae RepID=YEHE_ECOLI Length = 93 Score = 196 bits (499), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 93/93 (100%), Positives = 93/93 (100%) Query: 1 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD 60 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD Sbjct: 1 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD 60 Query: 61 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD Sbjct: 61 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 >UniRef50_A9MKV2 Putative uncharacterized protein n=32 Tax=Enterobacteriaceae RepID=A9MKV2_SALAR Length = 122 Score = 90.5 bits (223), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 43/93 (46%), Positives = 62/93 (66%) Query: 1 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD 60 M KY L GII AYG++ F+S+TATL I+G+++ PTCS +VN Q QQ CG + Sbjct: 30 MKKYLLMGIIVSAYGISVLVFASDTATLTISGKVTAPTCSTEVVNAQLQQRCGNTIHVST 89 Query: 61 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 + ++P++GVTT++ DS R+IV+NRYD Sbjct: 90 LQTPAATPMRGVTTQLYTVPGDSTRQIVVNRYD 122 >UniRef50_B7LV73 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LV73_ESCF3 Length = 93 Score = 86.3 bits (212), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 40/86 (46%), Positives = 58/86 (67%) Query: 8 GIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVDTRHLFSS 67 GII L YG+A+P +S+ ATL+I G+ISPPTCS+ +V+ Q CG++T + + + Sbjct: 8 GIIVLVYGIAAPVSASQEATLSIQGKISPPTCSVDVVSSHFTQECGKMTRHFTLQKSVIT 67 Query: 68 PVKGVTTEVVVAGSDSKRRIVLNRYD 93 V+GV TEVV D KR+I+L+ YD Sbjct: 68 AVRGVVTEVVAVPEDGKRKIILSSYD 93 >UniRef50_B5Y2J0 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae 342 RepID=B5Y2J0_KLEP3 Length = 91 Score = 77.8 bits (190), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 38/91 (41%), Positives = 60/91 (65%), Gaps = 1/91 (1%) Query: 3 KYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVDTR 62 K L GI+ AYGL+ P F+++TATL ++GR+ TCS +VN QPQQ CG+ TY + ++ Sbjct: 2 KKPLIGILAFAYGLSLPLFAADTATLTLSGRVVSETCSTDIVNKQPQQRCGKNTYLIASQ 61 Query: 63 HLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 + ++ +GV T V D+ R+I+++ YD Sbjct: 62 NSVTN-ARGVITRTVNLPDDASRKIIISSYD 91 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P33344 Uncharacterized protein yehE n=52 Tax=Enterobact... 150 1e-35 UniRef50_A9MKV2 Putative uncharacterized protein n=32 Tax=Entero... 136 2e-31 UniRef50_B5Y2J0 Putative uncharacterized protein n=1 Tax=Klebsie... 129 3e-29 UniRef50_B7LV73 Putative uncharacterized protein n=1 Tax=Escheri... 126 2e-28 Sequences not found previously or not previously below threshold: UniRef50_C4X2P8 Putative uncharacterized protein n=1 Tax=Klebsie... 42 0.008 CONVERGED! >UniRef50_P33344 Uncharacterized protein yehE n=52 Tax=Enterobacteriaceae RepID=YEHE_ECOLI Length = 93 Score = 150 bits (379), Expect = 1e-35, Method: Composition-based stats. Identities = 93/93 (100%), Positives = 93/93 (100%) Query: 1 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD 60 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD Sbjct: 1 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD 60 Query: 61 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD Sbjct: 61 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 >UniRef50_A9MKV2 Putative uncharacterized protein n=32 Tax=Enterobacteriaceae RepID=A9MKV2_SALAR Length = 122 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 43/93 (46%), Positives = 62/93 (66%) Query: 1 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD 60 M KY L GII AYG++ F+S+TATL I+G+++ PTCS +VN Q QQ CG + Sbjct: 30 MKKYLLMGIIVSAYGISVLVFASDTATLTISGKVTAPTCSTEVVNAQLQQRCGNTIHVST 89 Query: 61 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 + ++P++GVTT++ DS R+IV+NRYD Sbjct: 90 LQTPAATPMRGVTTQLYTVPGDSTRQIVVNRYD 122 >UniRef50_B5Y2J0 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae 342 RepID=B5Y2J0_KLEP3 Length = 91 Score = 129 bits (323), Expect = 3e-29, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 60/91 (65%), Gaps = 1/91 (1%) Query: 3 KYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVDTR 62 K L GI+ AYGL+ P F+++TATL ++GR+ TCS +VN QPQQ CG+ TY + ++ Sbjct: 2 KKPLIGILAFAYGLSLPLFAADTATLTLSGRVVSETCSTDIVNKQPQQRCGKNTYLIASQ 61 Query: 63 HLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 + ++ +GV T V D+ R+I+++ YD Sbjct: 62 NSVTN-ARGVITRTVNLPDDASRKIIISSYD 91 >UniRef50_B7LV73 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LV73_ESCF3 Length = 93 Score = 126 bits (316), Expect = 2e-28, Method: Composition-based stats. Identities = 41/93 (44%), Positives = 59/93 (63%) Query: 1 MNKYWLSGIIFLAYGLASPAFSSETATLAINGRISPPTCSMAMVNGQPQQHCGQLTYNVD 60 M GII L YG+A+P +S+ ATL+I G+ISPPTCS+ +V+ Q CG++T + Sbjct: 1 MKTPLPFGIIVLVYGIAAPVSASQEATLSIQGKISPPTCSVDVVSSHFTQECGKMTRHFT 60 Query: 61 TRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 + + V+GV TEVV D KR+I+L+ YD Sbjct: 61 LQKSVITAVRGVVTEVVAVPEDGKRKIILSSYD 93 >UniRef50_C4X2P8 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae NTUH-K2044 RepID=C4X2P8_KLEPN Length = 77 Score = 41.6 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 8/40 (20%), Positives = 22/40 (55%), Gaps = 1/40 (2%) Query: 54 QLTYNVDTRHLFSSPVKGVTTEVVVAGSDSKRRIVLNRYD 93 + T + +++ ++ G+ +V D+ R+I+++ YD Sbjct: 39 RSTAFIASQNSVTNAC-GMVNRIVNLPDDTSRKIIISSYD 77 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.129 0.338 Lambda K H 0.267 0.0398 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 330,386,503 Number of Sequences: 3077464 Number of extensions: 10046157 Number of successful extensions: 27151 Number of sequences better than 1.0e-01: 5 Number of HSP's better than 0.1 without gapping: 8 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 27141 Number of HSP's gapped (non-prelim): 9 length of query: 93 length of database: 1,040,396,356 effective HSP length: 63 effective length of query: 30 effective length of database: 846,516,124 effective search space: 25395483720 effective search space used: 25395483720 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.2 bits)