BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (90 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichi... 182 2e-45 UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteria... 69 4e-11 UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae... 63 3e-09 UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 62 8e-09 UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsie... 60 2e-08 UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. ... 57 2e-07 UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_E... 49 7e-05 UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli R... 45 7e-04 UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae ... 42 0.005 >UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichia coli K-12 RepID=YPJI_ECOLI Length = 90 Score = 182 bits (463), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 90/90 (100%), Positives = 90/90 (100%) Query: 1 MSNSEGWXSFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV 60 MSNSEGW SF QTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV Sbjct: 1 MSNSEGWXSFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV 60 Query: 61 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR 90 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR Sbjct: 61 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR 90 >UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteriaceae RepID=Y3853_ECOL5 Length = 77 Score = 68.9 bits (167), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 28/53 (52%), Positives = 38/53 (71%) Query: 31 VSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHRSGI 83 V+ ++ IEDDQG H RL++R+AEG++ WR WNFEPDAG+ N Y+ GI Sbjct: 22 VTTAYRNVLIEDDQGTHFRLVIRNAEGQLRWRCWNFEPDAGKQLNSYLASEGI 74 >UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae RepID=YUBL_YERPA Length = 76 Score = 62.8 bits (151), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 31/55 (56%), Positives = 36/55 (65%), Gaps = 1/55 (1%) Query: 30 LVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHRSGIR 84 +V+A T++ IEDDQG H RL+VR G MVWR WNFEP NRYI GIR Sbjct: 20 VVAAQYTNVAIEDDQGAHFRLVVRQ-NGEMVWRTWNFEPGGTYWLNRYIADYGIR 73 >UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M6X3_9ENTR Length = 90 Score = 61.6 bits (148), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 30/58 (51%), Positives = 38/58 (65%) Query: 26 VAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHRSGI 83 VA +VS ++ +ED + IH +R +E MVWRAW FEPDAGEG NRYI + GI Sbjct: 27 VAAEVVSYLNNNMPLEDCKHIHFSPAIRCSESWMVWRAWCFEPDAGEGLNRYILQYGI 84 >UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsiella pneumoniae RepID=A6TIE0_KLEP7 Length = 82 Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 28/56 (50%), Positives = 37/56 (66%), Gaps = 1/56 (1%) Query: 30 LVSAGITDINIEDDQGIHVRLIVRDAE-GRMVWRAWNFEPDAGEGFNRYIHRSGIR 84 +V+A ++ IEDDQG H RL+VR + G M+WR WNFEP + NRYI G+R Sbjct: 24 VVAAQYQNVAIEDDQGTHFRLVVRHKDDGSMIWRVWNFEPGGEDIMNRYIRDYGVR 79 >UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. enterica RepID=B5QEK8_SALVI Length = 91 Score = 56.6 bits (135), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 24/53 (45%), Positives = 34/53 (64%) Query: 31 VSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHRSGI 83 V+ ++ IEDDQG H RL++RD+ +++W AWNFE A NRY+ GI Sbjct: 36 VTTAYRNVFIEDDQGTHFRLVIRDSYNQLLWWAWNFEARAWYWLNRYLLSHGI 88 >UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_ECOLX Length = 79 Score = 48.5 bits (114), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 30/75 (40%), Positives = 37/75 (49%), Gaps = 2/75 (2%) Query: 9 SFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEP 68 + + L GLP S V+ ++ IEDD G RL+VR+ G MVWR WNFE Sbjct: 5 EYFRILQGLPD-GSFTREQAEAVAVQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFED 62 Query: 69 DAGEGFNRYIHRSGI 83 AG N I GI Sbjct: 63 GAGYWMNHVIRDFGI 77 >UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli RepID=Y3666_ECOL6 Length = 77 Score = 45.1 bits (105), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 32/53 (60%) Query: 31 VSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHRSGI 83 V ++ I+DD G+H R ++R+AEG+ WR N E DAG+ N ++ G+ Sbjct: 22 VITAYRNVFIQDDPGMHFRRVIRNAEGQRRWRCRNSEADAGKQLNAWLASGGL 74 >UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae RepID=YUBL_SALTI Length = 78 Score = 42.4 bits (98), Expect = 0.005, Method: Compositional matrix adjust. Identities = 19/59 (32%), Positives = 35/59 (59%), Gaps = 4/59 (6%) Query: 31 VSAGITDINIEDDQGI-HVRLIVRDA---EGRMVWRAWNFEPDAGEGFNRYIHRSGIRT 85 V+AG ++ IE+ Q H ++++RD + ++VWR WN+E A + N Y+ G++ Sbjct: 19 VAAGYQNVFIENLQPAGHFQIVIRDHRDHDSQLVWRNWNYESGANDALNSYLQSHGLKA 77 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichi... 147 1e-34 UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_E... 107 2e-22 UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteria... 104 8e-22 UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsie... 103 1e-21 UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 101 6e-21 UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae... 101 9e-21 UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. ... 97 2e-19 UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli R... 87 1e-16 Sequences not found previously or not previously below threshold: UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae ... 62 7e-09 UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia... 52 8e-06 >UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichia coli K-12 RepID=YPJI_ECOLI Length = 90 Score = 147 bits (370), Expect = 1e-34, Method: Composition-based stats. Identities = 90/90 (100%), Positives = 90/90 (100%) Query: 1 MSNSEGWXSFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV 60 MSNSEGW SF QTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV Sbjct: 1 MSNSEGWXSFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV 60 Query: 61 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR 90 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR Sbjct: 61 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR 90 >UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_ECOLX Length = 79 Score = 107 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 30/75 (40%), Positives = 37/75 (49%), Gaps = 2/75 (2%) Query: 9 SFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEP 68 + + L GLP S V+ ++ IEDD G RL+VR+ G MVWR WNFE Sbjct: 5 EYFRILQGLPD-GSFTREQAEAVAVQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFED 62 Query: 69 DAGEGFNRYIHRSGI 83 AG N I GI Sbjct: 63 GAGYWMNHVIRDFGI 77 >UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteriaceae RepID=Y3853_ECOL5 Length = 77 Score = 104 bits (260), Expect = 8e-22, Method: Composition-based stats. Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 1/67 (1%) Query: 17 LPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNR 76 LP+ V+ ++ IEDDQG H RL++R+AEG++ WR WNFEPDAG+ N Sbjct: 9 LPE-GPFSREQAVAVTTAYRNVLIEDDQGTHFRLVIRNAEGQLRWRCWNFEPDAGKQLNS 67 Query: 77 YIHRSGI 83 Y+ GI Sbjct: 68 YLASEGI 74 >UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsiella pneumoniae RepID=A6TIE0_KLEP7 Length = 82 Score = 103 bits (258), Expect = 1e-21, Method: Composition-based stats. Identities = 31/72 (43%), Positives = 41/72 (56%), Gaps = 2/72 (2%) Query: 14 LSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDA-EGRMVWRAWNFEPDAGE 72 L LP + +V+A ++ IEDDQG H RL+VR +G M+WR WNFEP + Sbjct: 9 LVSLPD-GTFTREQAQVVAAQYQNVAIEDDQGTHFRLVVRHKDDGSMIWRVWNFEPGGED 67 Query: 73 GFNRYIHRSGIR 84 NRYI G+R Sbjct: 68 IMNRYIRDYGVR 79 >UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M6X3_9ENTR Length = 90 Score = 101 bits (252), Expect = 6e-21, Method: Composition-based stats. Identities = 30/58 (51%), Positives = 38/58 (65%) Query: 26 VAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHRSGI 83 VA +VS ++ +ED + IH +R +E MVWRAW FEPDAGEG NRYI + GI Sbjct: 27 VAAEVVSYLNNNMPLEDCKHIHFSPAIRCSESWMVWRAWCFEPDAGEGLNRYILQYGI 84 >UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae RepID=YUBL_YERPA Length = 76 Score = 101 bits (251), Expect = 9e-21, Method: Composition-based stats. Identities = 34/73 (46%), Positives = 42/73 (57%), Gaps = 2/73 (2%) Query: 12 QTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAG 71 + L+ LP + +V+A T++ IEDDQG H RL+VR G MVWR WNFEP Sbjct: 3 EALAVLPDD-TFTREQAEVVAAQYTNVAIEDDQGAHFRLVVRQ-NGEMVWRTWNFEPGGT 60 Query: 72 EGFNRYIHRSGIR 84 NRYI GIR Sbjct: 61 YWLNRYIADYGIR 73 >UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. enterica RepID=B5QEK8_SALVI Length = 91 Score = 96.7 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 24/63 (38%), Positives = 34/63 (53%) Query: 21 ASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHR 80 V+ ++ IEDDQG H RL++RD+ +++W AWNFE A NRY+ Sbjct: 26 GPFSRTQAIAVTTAYRNVFIEDDQGTHFRLVIRDSYNQLLWWAWNFEARAWYWLNRYLLS 85 Query: 81 SGI 83 GI Sbjct: 86 HGI 88 >UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli RepID=Y3666_ECOL6 Length = 77 Score = 87.4 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 21/63 (33%), Positives = 33/63 (52%) Query: 21 ASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHR 80 S V ++ I+DD G+H R ++R+AEG+ WR N E DAG+ N ++ Sbjct: 12 GSFSYGQAVAVITAYRNVFIQDDPGMHFRRVIRNAEGQRRWRCRNSEADAGKQLNAWLAS 71 Query: 81 SGI 83 G+ Sbjct: 72 GGL 74 >UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae RepID=YUBL_SALTI Length = 78 Score = 61.6 bits (148), Expect = 7e-09, Method: Composition-based stats. Identities = 18/65 (27%), Positives = 34/65 (52%), Gaps = 4/65 (6%) Query: 24 DCVAGPLVSAGITDINIED-DQGIHVRLIVRDA---EGRMVWRAWNFEPDAGEGFNRYIH 79 V+AG ++ IE+ H ++++RD + ++VWR WN+E A + N Y+ Sbjct: 12 TLAQARTVAAGYQNVFIENLQPAGHFQIVIRDHRDHDSQLVWRNWNYESGANDALNSYLQ 71 Query: 80 RSGIR 84 G++ Sbjct: 72 SHGLK 76 >UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia tasmaniensis RepID=B2VB73_ERWT9 Length = 63 Score = 51.6 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 13/40 (32%), Positives = 22/40 (55%) Query: 21 ASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV 60 + V+A +++ I+DDQG H RL+VR+ G + Sbjct: 24 GTFTREKALNVTAAFSNVFIDDDQGSHFRLVVREPSGSFI 63 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichi... 141 6e-33 UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteria... 111 6e-24 UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsie... 109 3e-23 UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae... 108 7e-23 UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_E... 106 2e-22 UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. ... 105 6e-22 UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 100 2e-20 UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli R... 93 3e-18 UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae ... 83 3e-15 UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia... 64 1e-09 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P58095 Putative UPF0401 protein ypjI n=1 Tax=Escherichia coli K-12 RepID=YPJI_ECOLI Length = 90 Score = 141 bits (356), Expect = 6e-33, Method: Composition-based stats. Identities = 90/90 (100%), Positives = 90/90 (100%) Query: 1 MSNSEGWXSFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV 60 MSNSEGW SF QTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV Sbjct: 1 MSNSEGWXSFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV 60 Query: 61 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR 90 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR Sbjct: 61 WRAWNFEPDAGEGFNRYIHRSGIRTDTFPR 90 >UniRef50_Q0TB55 UPF0401 protein ECP_3853 n=28 Tax=Enterobacteriaceae RepID=Y3853_ECOL5 Length = 77 Score = 111 bits (278), Expect = 6e-24, Method: Composition-based stats. Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 1/67 (1%) Query: 17 LPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNR 76 LP+ V+ ++ IEDDQG H RL++R+AEG++ WR WNFEPDAG+ N Sbjct: 9 LPE-GPFSREQAVAVTTAYRNVLIEDDQGTHFRLVIRNAEGQLRWRCWNFEPDAGKQLNS 67 Query: 77 YIHRSGI 83 Y+ GI Sbjct: 68 YLASEGI 74 >UniRef50_A6TIE0 Putative uncharacterized protein n=2 Tax=Klebsiella pneumoniae RepID=A6TIE0_KLEP7 Length = 82 Score = 109 bits (272), Expect = 3e-23, Method: Composition-based stats. Identities = 31/72 (43%), Positives = 41/72 (56%), Gaps = 2/72 (2%) Query: 14 LSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDA-EGRMVWRAWNFEPDAGE 72 L LP + +V+A ++ IEDDQG H RL+VR +G M+WR WNFEP + Sbjct: 9 LVSLPD-GTFTREQAQVVAAQYQNVAIEDDQGTHFRLVVRHKDDGSMIWRVWNFEPGGED 67 Query: 73 GFNRYIHRSGIR 84 NRYI G+R Sbjct: 68 IMNRYIRDYGVR 79 >UniRef50_Q1BZY8 UPF0401 protein yubL n=29 Tax=Enterobacteriaceae RepID=YUBL_YERPA Length = 76 Score = 108 bits (269), Expect = 7e-23, Method: Composition-based stats. Identities = 34/73 (46%), Positives = 42/73 (57%), Gaps = 2/73 (2%) Query: 12 QTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAG 71 + L+ LP + +V+A T++ IEDDQG H RL+VR G MVWR WNFEP Sbjct: 3 EALAVLPDD-TFTREQAEVVAAQYTNVAIEDDQGAHFRLVVRQ-NGEMVWRTWNFEPGGT 60 Query: 72 EGFNRYIHRSGIR 84 NRYI GIR Sbjct: 61 YWLNRYIADYGIR 73 >UniRef50_Q5JBK6 UPF0401 protein yubL n=62 Tax=root RepID=YUBL2_ECOLX Length = 79 Score = 106 bits (265), Expect = 2e-22, Method: Composition-based stats. Identities = 30/75 (40%), Positives = 37/75 (49%), Gaps = 2/75 (2%) Query: 9 SFXQTLSGLPQWASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEP 68 + + L GLP S V+ ++ IEDD G RL+VR+ G MVWR WNFE Sbjct: 5 EYFRILQGLPD-GSFTREQAEAVAVQYRNVFIEDDHGEQFRLVVRN-NGAMVWRTWNFED 62 Query: 69 DAGEGFNRYIHRSGI 83 AG N I GI Sbjct: 63 GAGYWMNHVIRDFGI 77 >UniRef50_B5QEK8 YkfF protein n=3 Tax=Salmonella enterica subsp. enterica RepID=B5QEK8_SALVI Length = 91 Score = 105 bits (261), Expect = 6e-22, Method: Composition-based stats. Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 1/68 (1%) Query: 17 LPQ-WASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFN 75 LP+ V+ ++ IEDDQG H RL++RD+ +++W AWNFE A N Sbjct: 21 LPEGEGPFSRTQAIAVTTAYRNVFIEDDQGTHFRLVIRDSYNQLLWWAWNFEARAWYWLN 80 Query: 76 RYIHRSGI 83 RY+ GI Sbjct: 81 RYLLSHGI 88 >UniRef50_C1M6X3 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M6X3_9ENTR Length = 90 Score = 100 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 30/58 (51%), Positives = 38/58 (65%) Query: 26 VAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHRSGI 83 VA +VS ++ +ED + IH +R +E MVWRAW FEPDAGEG NRYI + GI Sbjct: 27 VAAEVVSYLNNNMPLEDCKHIHFSPAIRCSESWMVWRAWCFEPDAGEGLNRYILQYGI 84 >UniRef50_Q8FDS2 UPF0401 protein c3666 n=6 Tax=Escherichia coli RepID=Y3666_ECOL6 Length = 77 Score = 92.8 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 21/63 (33%), Positives = 33/63 (52%) Query: 21 ASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMVWRAWNFEPDAGEGFNRYIHR 80 S V ++ I+DD G+H R ++R+AEG+ WR N E DAG+ N ++ Sbjct: 12 GSFSYGQAVAVITAYRNVFIQDDPGMHFRRVIRNAEGQRRWRCRNSEADAGKQLNAWLAS 71 Query: 81 SGI 83 G+ Sbjct: 72 GGL 74 >UniRef50_Q9L5N1 UPF0401 protein yubL n=4 Tax=Enterobacteriaceae RepID=YUBL_SALTI Length = 78 Score = 82.8 bits (203), Expect = 3e-15, Method: Composition-based stats. Identities = 18/67 (26%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 22 SADCVAGPLVSAGITDINIED-DQGIHVRLIVRDA---EGRMVWRAWNFEPDAGEGFNRY 77 V+AG ++ IE+ H ++++RD + ++VWR WN+E A + N Y Sbjct: 10 PVTLAQARTVAAGYQNVFIENLQPAGHFQIVIRDHRDHDSQLVWRNWNYESGANDALNSY 69 Query: 78 IHRSGIR 84 + G++ Sbjct: 70 LQSHGLK 76 >UniRef50_B2VB73 Putative uncharacterized protein n=1 Tax=Erwinia tasmaniensis RepID=B2VB73_ERWT9 Length = 63 Score = 64.3 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 13/40 (32%), Positives = 22/40 (55%) Query: 21 ASADCVAGPLVSAGITDINIEDDQGIHVRLIVRDAEGRMV 60 + V+A +++ I+DDQG H RL+VR+ G + Sbjct: 24 GTFTREKALNVTAAFSNVFIDDDQGSHFRLVVREPSGSFI 63 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.135 0.401 Lambda K H 0.267 0.0415 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 536,296,418 Number of Sequences: 3077464 Number of extensions: 17937944 Number of successful extensions: 41119 Number of sequences better than 1.0e-01: 10 Number of HSP's better than 0.1 without gapping: 27 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 41079 Number of HSP's gapped (non-prelim): 29 length of query: 90 length of database: 1,040,396,356 effective HSP length: 60 effective length of query: 30 effective length of database: 855,748,516 effective search space: 25672455480 effective search space used: 25672455480 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 87 (38.1 bits)