BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (79 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_E... 159 3e-38 UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae... 99 4e-20 UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsie... 89 5e-17 UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteria... 80 2e-14 UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. ... 71 1e-11 UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli R... 52 4e-06 UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae ... 49 4e-05 UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia... 49 6e-05 UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichi... 48 8e-05 UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 39 0.038 >UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_ECOLX Length = 79 Score = 159 bits (401), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 75/79 (94%), Positives = 76/79 (96%) Query: 1 MRMSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF 60 MRMSEYFRILQGLPDG FTR+ AEAVA QYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF Sbjct: 1 MRMSEYFRILQGLPDGSFTREQAEAVAVQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF 60 Query: 61 EDGAGYWMNHVIRDFGILK 79 EDGAGYWMNHVIRDFGILK Sbjct: 61 EDGAGYWMNHVIRDFGILK 79 >UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae RepID=YUBL_YERPA Length = 76 Score = 99.4 bits (246), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 49/77 (63%), Positives = 53/77 (68%), Gaps = 3/77 (3%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNFED 62 MSE +L PD FTR+ AE VAAQY NV IEDD G FRLVVR NG MVWRTWNFE Sbjct: 1 MSEALAVL---PDDTFTREQAEVVAAQYTNVAIEDDQGAHFRLVVRQNGEMVWRTWNFEP 57 Query: 63 GAGYWMNHVIRDFGILK 79 G YW+N I D+GI K Sbjct: 58 GGTYWLNRYIADYGIRK 74 >UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsiella pneumoniae RepID=A6TIE0_KLEP7 Length = 82 Score = 88.6 bits (218), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 43/72 (59%), Positives = 52/72 (72%), Gaps = 2/72 (2%) Query: 10 LQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVR--NNGAMVWRTWNFEDGAGYW 67 L LPDG FTR+ A+ VAAQY+NV IEDD G FRLVVR ++G+M+WR WNFE G Sbjct: 9 LVSLPDGTFTREQAQVVAAQYQNVAIEDDQGTHFRLVVRHKDDGSMIWRVWNFEPGGEDI 68 Query: 68 MNHVIRDFGILK 79 MN IRD+G+ K Sbjct: 69 MNRYIRDYGVRK 80 >UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteriaceae RepID=Y3853_ECOL5 Length = 77 Score = 80.1 bits (196), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 40/78 (51%), Positives = 50/78 (64%), Gaps = 3/78 (3%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFE 61 MS+ +L LP+GPF+R+ A AV YRNV IEDD G FRLV+RN G + WR WNFE Sbjct: 1 MSDCHPVL--LPEGPFSREQAVAVTTAYRNVLIEDDQGTHFRLVIRNAEGQLRWRCWNFE 58 Query: 62 DGAGYWMNHVIRDFGILK 79 AG +N + GIL+ Sbjct: 59 PDAGKQLNSYLASEGILR 76 >UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. enterica RepID=B5QEK8_SALVI Length = 91 Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 35/76 (46%), Positives = 46/76 (60%), Gaps = 1/76 (1%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNN-GAMVWRTWNFE 61 M + +L +GPF+R A AV YRNVFIEDD G FRLV+R++ ++W WNFE Sbjct: 13 MPDCTSLLLPEGEGPFSRTQAIAVTTAYRNVFIEDDQGTHFRLVIRDSYNQLLWWAWNFE 72 Query: 62 DGAGYWMNHVIRDFGI 77 A YW+N + GI Sbjct: 73 ARAWYWLNRYLLSHGI 88 >UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli RepID=Y3666_ECOL6 Length = 77 Score = 52.4 bits (124), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 30/68 (44%), Positives = 38/68 (55%), Gaps = 1/68 (1%) Query: 13 LPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFEDGAGYWMNHV 71 L +G F+ A AV YRNVFI+DD G FR V+RN G WR N E AG +N Sbjct: 9 LAEGSFSYGQAVAVITAYRNVFIQDDPGMHFRRVIRNAEGQRRWRCRNSEADAGKQLNAW 68 Query: 72 IRDFGILK 79 + G+L+ Sbjct: 69 LASGGLLR 76 >UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae RepID=YUBL_SALTI Length = 78 Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 10/80 (12%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIED-DHGEQFRLVVRN----NGAMVWRT 57 MS+ F + P T A VAA Y+NVFIE+ F++V+R+ + +VWR Sbjct: 1 MSDLFS-----SESPVTLAQARTVAAGYQNVFIENLQPAGHFQIVIRDHRDHDSQLVWRN 55 Query: 58 WNFEDGAGYWMNHVIRDFGI 77 WN+E GA +N ++ G+ Sbjct: 56 WNYESGANDALNSYLQSHGL 75 >UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia tasmaniensis RepID=B2VB73_ERWT9 Length = 63 Score = 48.5 bits (114), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 22/41 (53%), Positives = 27/41 (65%) Query: 8 RILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVR 48 R + L +G FTR+ A V A + NVFI+DD G FRLVVR Sbjct: 16 RCVAALGNGTFTREKALNVTAAFSNVFIDDDQGSHFRLVVR 56 >UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichia coli K-12 RepID=YPJI_ECOLI Length = 90 Score = 48.1 bits (113), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 31/75 (41%), Positives = 38/75 (50%), Gaps = 2/75 (2%) Query: 5 EYFRILQGLPDGPFTRKHA-EAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFED 62 + + L GLP A V+A ++ IEDD G RL+VR+ G MVWR WNFE Sbjct: 9 SFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEP 68 Query: 63 GAGYWMNHVIRDFGI 77 AG N I GI Sbjct: 69 DAGEGFNRYIHRSGI 83 >UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M6X3_9ENTR Length = 90 Score = 39.3 bits (90), Expect = 0.038, Method: Compositional matrix adjust. Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 1/58 (1%) Query: 23 AEAVAAQYRNVFIEDDHGEQFRLVVRNNGA-MVWRTWNFEDGAGYWMNHVIRDFGILK 79 AE V+ N+ +ED F +R + + MVWR W FE AG +N I +GIL+ Sbjct: 29 AEVVSYLNNNMPLEDCKHIHFSPAIRCSESWMVWRAWCFEPDAGEGLNRYILQYGILR 86 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_E... 127 1e-28 UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteria... 120 2e-26 UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. ... 114 9e-25 UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae... 113 2e-24 UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsie... 110 1e-23 UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichi... 98 7e-20 UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli R... 92 4e-18 UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae ... 89 5e-17 UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia... 66 4e-10 Sequences not found previously or not previously below threshold: UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 62 8e-09 >UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_ECOLX Length = 79 Score = 127 bits (318), Expect = 1e-28, Method: Composition-based stats. Identities = 75/79 (94%), Positives = 76/79 (96%) Query: 1 MRMSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF 60 MRMSEYFRILQGLPDG FTR+ AEAVA QYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF Sbjct: 1 MRMSEYFRILQGLPDGSFTREQAEAVAVQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF 60 Query: 61 EDGAGYWMNHVIRDFGILK 79 EDGAGYWMNHVIRDFGILK Sbjct: 61 EDGAGYWMNHVIRDFGILK 79 >UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteriaceae RepID=Y3853_ECOL5 Length = 77 Score = 120 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 40/78 (51%), Positives = 50/78 (64%), Gaps = 3/78 (3%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFE 61 MS+ +L LP+GPF+R+ A AV YRNV IEDD G FRLV+RN G + WR WNFE Sbjct: 1 MSDCHPVL--LPEGPFSREQAVAVTTAYRNVLIEDDQGTHFRLVIRNAEGQLRWRCWNFE 58 Query: 62 DGAGYWMNHVIRDFGILK 79 AG +N + GIL+ Sbjct: 59 PDAGKQLNSYLASEGILR 76 >UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. enterica RepID=B5QEK8_SALVI Length = 91 Score = 114 bits (285), Expect = 9e-25, Method: Composition-based stats. Identities = 35/78 (44%), Positives = 47/78 (60%), Gaps = 1/78 (1%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNN-GAMVWRTWNFE 61 M + +L +GPF+R A AV YRNVFIEDD G FRLV+R++ ++W WNFE Sbjct: 13 MPDCTSLLLPEGEGPFSRTQAIAVTTAYRNVFIEDDQGTHFRLVIRDSYNQLLWWAWNFE 72 Query: 62 DGAGYWMNHVIRDFGILK 79 A YW+N + GI + Sbjct: 73 ARAWYWLNRYLLSHGIPR 90 >UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae RepID=YUBL_YERPA Length = 76 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 47/70 (67%), Positives = 50/70 (71%) Query: 10 LQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNFEDGAGYWMN 69 L LPD FTR+ AE VAAQY NV IEDD G FRLVVR NG MVWRTWNFE G YW+N Sbjct: 5 LAVLPDDTFTREQAEVVAAQYTNVAIEDDQGAHFRLVVRQNGEMVWRTWNFEPGGTYWLN 64 Query: 70 HVIRDFGILK 79 I D+GI K Sbjct: 65 RYIADYGIRK 74 >UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsiella pneumoniae RepID=A6TIE0_KLEP7 Length = 82 Score = 110 bits (275), Expect = 1e-23, Method: Composition-based stats. Identities = 43/77 (55%), Positives = 52/77 (67%), Gaps = 2/77 (2%) Query: 5 EYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVR--NNGAMVWRTWNFED 62 L LPDG FTR+ A+ VAAQY+NV IEDD G FRLVVR ++G+M+WR WNFE Sbjct: 4 NVLSPLVSLPDGTFTREQAQVVAAQYQNVAIEDDQGTHFRLVVRHKDDGSMIWRVWNFEP 63 Query: 63 GAGYWMNHVIRDFGILK 79 G MN IRD+G+ K Sbjct: 64 GGEDIMNRYIRDYGVRK 80 >UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichia coli K-12 RepID=YPJI_ECOLI Length = 90 Score = 98.2 bits (243), Expect = 7e-20, Method: Composition-based stats. Identities = 31/76 (40%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Query: 5 EYFRILQGLPDGPFTRKHA-EAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFED 62 + + L GLP A V+A ++ IEDD G RL+VR+ G MVWR WNFE Sbjct: 9 SFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEP 68 Query: 63 GAGYWMNHVIRDFGIL 78 AG N I GI Sbjct: 69 DAGEGFNRYIHRSGIR 84 >UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli RepID=Y3666_ECOL6 Length = 77 Score = 92.4 bits (228), Expect = 4e-18, Method: Composition-based stats. Identities = 30/68 (44%), Positives = 38/68 (55%), Gaps = 1/68 (1%) Query: 13 LPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFEDGAGYWMNHV 71 L +G F+ A AV YRNVFI+DD G FR V+RN G WR N E AG +N Sbjct: 9 LAEGSFSYGQAVAVITAYRNVFIQDDPGMHFRRVIRNAEGQRRWRCRNSEADAGKQLNAW 68 Query: 72 IRDFGILK 79 + G+L+ Sbjct: 69 LASGGLLR 76 >UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae RepID=YUBL_SALTI Length = 78 Score = 88.6 bits (218), Expect = 5e-17, Method: Composition-based stats. Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 10/80 (12%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIED-DHGEQFRLVVRN----NGAMVWRT 57 MS+ F + P T A VAA Y+NVFIE+ F++V+R+ + +VWR Sbjct: 1 MSDLFS-----SESPVTLAQARTVAAGYQNVFIENLQPAGHFQIVIRDHRDHDSQLVWRN 55 Query: 58 WNFEDGAGYWMNHVIRDFGI 77 WN+E GA +N ++ G+ Sbjct: 56 WNYESGANDALNSYLQSHGL 75 >UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia tasmaniensis RepID=B2VB73_ERWT9 Length = 63 Score = 65.9 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 22/42 (52%), Positives = 27/42 (64%) Query: 8 RILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRN 49 R + L +G FTR+ A V A + NVFI+DD G FRLVVR Sbjct: 16 RCVAALGNGTFTREKALNVTAAFSNVFIDDDQGSHFRLVVRE 57 >UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M6X3_9ENTR Length = 90 Score = 61.6 bits (148), Expect = 8e-09, Method: Composition-based stats. Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 1/58 (1%) Query: 23 AEAVAAQYRNVFIEDDHGEQFRLVVR-NNGAMVWRTWNFEDGAGYWMNHVIRDFGILK 79 AE V+ N+ +ED F +R + MVWR W FE AG +N I +GIL+ Sbjct: 29 AEVVSYLNNNMPLEDCKHIHFSPAIRCSESWMVWRAWCFEPDAGEGLNRYILQYGILR 86 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_E... 124 1e-27 UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteria... 116 2e-25 UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae... 112 3e-24 UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. ... 112 4e-24 UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsie... 111 6e-24 UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichi... 96 3e-19 UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 94 2e-18 UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli R... 90 3e-17 UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae ... 85 8e-16 UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia... 64 1e-09 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_ECOLX Length = 79 Score = 124 bits (310), Expect = 1e-27, Method: Composition-based stats. Identities = 75/79 (94%), Positives = 76/79 (96%) Query: 1 MRMSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF 60 MRMSEYFRILQGLPDG FTR+ AEAVA QYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF Sbjct: 1 MRMSEYFRILQGLPDGSFTREQAEAVAVQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNF 60 Query: 61 EDGAGYWMNHVIRDFGILK 79 EDGAGYWMNHVIRDFGILK Sbjct: 61 EDGAGYWMNHVIRDFGILK 79 >UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteriaceae RepID=Y3853_ECOL5 Length = 77 Score = 116 bits (290), Expect = 2e-25, Method: Composition-based stats. Identities = 40/78 (51%), Positives = 50/78 (64%), Gaps = 3/78 (3%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFE 61 MS+ +L LP+GPF+R+ A AV YRNV IEDD G FRLV+RN G + WR WNFE Sbjct: 1 MSDCHPVL--LPEGPFSREQAVAVTTAYRNVLIEDDQGTHFRLVIRNAEGQLRWRCWNFE 58 Query: 62 DGAGYWMNHVIRDFGILK 79 AG +N + GIL+ Sbjct: 59 PDAGKQLNSYLASEGILR 76 >UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae RepID=YUBL_YERPA Length = 76 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 47/70 (67%), Positives = 50/70 (71%) Query: 10 LQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNNGAMVWRTWNFEDGAGYWMN 69 L LPD FTR+ AE VAAQY NV IEDD G FRLVVR NG MVWRTWNFE G YW+N Sbjct: 5 LAVLPDDTFTREQAEVVAAQYTNVAIEDDQGAHFRLVVRQNGEMVWRTWNFEPGGTYWLN 64 Query: 70 HVIRDFGILK 79 I D+GI K Sbjct: 65 RYIADYGIRK 74 >UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. enterica RepID=B5QEK8_SALVI Length = 91 Score = 112 bits (280), Expect = 4e-24, Method: Composition-based stats. Identities = 35/78 (44%), Positives = 47/78 (60%), Gaps = 1/78 (1%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNN-GAMVWRTWNFE 61 M + +L +GPF+R A AV YRNVFIEDD G FRLV+R++ ++W WNFE Sbjct: 13 MPDCTSLLLPEGEGPFSRTQAIAVTTAYRNVFIEDDQGTHFRLVIRDSYNQLLWWAWNFE 72 Query: 62 DGAGYWMNHVIRDFGILK 79 A YW+N + GI + Sbjct: 73 ARAWYWLNRYLLSHGIPR 90 >UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsiella pneumoniae RepID=A6TIE0_KLEP7 Length = 82 Score = 111 bits (278), Expect = 6e-24, Method: Composition-based stats. Identities = 43/77 (55%), Positives = 52/77 (67%), Gaps = 2/77 (2%) Query: 5 EYFRILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVR--NNGAMVWRTWNFED 62 L LPDG FTR+ A+ VAAQY+NV IEDD G FRLVVR ++G+M+WR WNFE Sbjct: 4 NVLSPLVSLPDGTFTREQAQVVAAQYQNVAIEDDQGTHFRLVVRHKDDGSMIWRVWNFEP 63 Query: 63 GAGYWMNHVIRDFGILK 79 G MN IRD+G+ K Sbjct: 64 GGEDIMNRYIRDYGVRK 80 >UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichia coli K-12 RepID=YPJI_ECOLI Length = 90 Score = 96.3 bits (238), Expect = 3e-19, Method: Composition-based stats. Identities = 31/76 (40%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Query: 5 EYFRILQGLPDGPFTRKHA-EAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFED 62 + + L GLP A V+A ++ IEDD G RL+VR+ G MVWR WNFE Sbjct: 9 SFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEP 68 Query: 63 GAGYWMNHVIRDFGIL 78 AG N I GI Sbjct: 69 DAGEGFNRYIHRSGIR 84 >UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M6X3_9ENTR Length = 90 Score = 93.6 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 1/58 (1%) Query: 23 AEAVAAQYRNVFIEDDHGEQFRLVVR-NNGAMVWRTWNFEDGAGYWMNHVIRDFGILK 79 AE V+ N+ +ED F +R + MVWR W FE AG +N I +GIL+ Sbjct: 29 AEVVSYLNNNMPLEDCKHIHFSPAIRCSESWMVWRAWCFEPDAGEGLNRYILQYGILR 86 >UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli RepID=Y3666_ECOL6 Length = 77 Score = 89.7 bits (221), Expect = 3e-17, Method: Composition-based stats. Identities = 30/68 (44%), Positives = 38/68 (55%), Gaps = 1/68 (1%) Query: 13 LPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFEDGAGYWMNHV 71 L +G F+ A AV YRNVFI+DD G FR V+RN G WR N E AG +N Sbjct: 9 LAEGSFSYGQAVAVITAYRNVFIQDDPGMHFRRVIRNAEGQRRWRCRNSEADAGKQLNAW 68 Query: 72 IRDFGILK 79 + G+L+ Sbjct: 69 LASGGLLR 76 >UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae RepID=YUBL_SALTI Length = 78 Score = 84.7 bits (208), Expect = 8e-16, Method: Composition-based stats. Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 10/80 (12%) Query: 3 MSEYFRILQGLPDGPFTRKHAEAVAAQYRNVFIED-DHGEQFRLVVRN----NGAMVWRT 57 MS+ F + P T A VAA Y+NVFIE+ F++V+R+ + +VWR Sbjct: 1 MSDLFS-----SESPVTLAQARTVAAGYQNVFIENLQPAGHFQIVIRDHRDHDSQLVWRN 55 Query: 58 WNFEDGAGYWMNHVIRDFGI 77 WN+E GA +N ++ G+ Sbjct: 56 WNYESGANDALNSYLQSHGL 75 >UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia tasmaniensis RepID=B2VB73_ERWT9 Length = 63 Score = 64.3 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 22/44 (50%), Positives = 27/44 (61%) Query: 8 RILQGLPDGPFTRKHAEAVAAQYRNVFIEDDHGEQFRLVVRNNG 51 R + L +G FTR+ A V A + NVFI+DD G FRLVVR Sbjct: 16 RCVAALGNGTFTREKALNVTAAFSNVFIDDDQGSHFRLVVREPS 59 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.134 0.373 Lambda K H 0.267 0.0407 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 457,337,294 Number of Sequences: 3077464 Number of extensions: 14659743 Number of successful extensions: 46469 Number of sequences better than 1.0e-01: 10 Number of HSP's better than 0.1 without gapping: 27 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 46421 Number of HSP's gapped (non-prelim): 30 length of query: 79 length of database: 1,040,396,356 effective HSP length: 50 effective length of query: 29 effective length of database: 886,523,156 effective search space: 25709171524 effective search space used: 25709171524 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 87 (38.1 bits)