BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (96 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76165 Uncharacterized protein ydfX n=44 Tax=Escherichi... 196 2e-49 UniRef50_B5R8N9 Predicted phage protein n=15 Tax=Salmonella ente... 83 2e-15 UniRef50_C8U4T3 Probable phage regulatory protein CII n=2 Tax=Es... 42 0.005 >UniRef50_P76165 Uncharacterized protein ydfX n=44 Tax=Escherichia RepID=YDFX_ECOLI Length = 96 Score = 196 bits (497), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 96/96 (100%), Positives = 96/96 (100%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL Sbjct: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 Query: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG 96 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG Sbjct: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG 96 >UniRef50_B5R8N9 Predicted phage protein n=15 Tax=Salmonella enterica subsp. enterica RepID=B5R8N9_SALG2 Length = 164 Score = 83.2 bits (204), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 43/91 (47%), Positives = 57/91 (62%) Query: 3 ITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRALGV 62 ITPE A +AL +W+ +TQE AT LIT AF RP I V R+ + G +D A Sbjct: 2 ITPETASQALSSWLAYLQITQETATQLITRAFLEQPARPEIAVHRIERDDGTVDYDAWRR 61 Query: 63 NRVKIFERWKAIDTRDKREKFTALVPAIMEA 93 NR+ IF+RW+ +T + EKF+AL PAI+EA Sbjct: 62 NRINIFQRWRKRETAEHCEKFSALTPAILEA 92 >UniRef50_C8U4T3 Probable phage regulatory protein CII n=2 Tax=Escherichia coli RepID=C8U4T3_ECO10 Length = 145 Score = 42.4 bits (98), Expect = 0.005, Method: Compositional matrix adjust. Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 12/93 (12%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKER--PNIDVQRVTYEGGAIDQR 58 MKI E R A++AW+ P + +++ I A++ L+ P D T EG Sbjct: 1 MKIKHEHIRMAMNAWLLYPRVGRKKIADDIATAYFELEMTYPPMHDTS--TTEG------ 52 Query: 59 ALGVNRVKIFERWKAIDTRDKREKFTALVPAIM 91 +G+N IF RW DT D EK AL+PAI+ Sbjct: 53 -IGLNIQNIF-RWLEKDTPDAVEKIQALIPAIL 83 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76165 Uncharacterized protein ydfX n=44 Tax=Escherichi... 154 1e-36 UniRef50_B5R8N9 Predicted phage protein n=15 Tax=Salmonella ente... 145 4e-34 Sequences not found previously or not previously below threshold: UniRef50_C8U4T3 Probable phage regulatory protein CII n=2 Tax=Es... 47 2e-04 UniRef50_A7ZND8 Putative uncharacterized protein n=2 Tax=Escheri... 46 5e-04 UniRef50_B7UQF8 Probable phage regulatory protein n=95 Tax=Enter... 41 0.012 UniRef50_Q7N1J8 Similarities with unknown protein and with YdaT ... 38 0.099 >UniRef50_P76165 Uncharacterized protein ydfX n=44 Tax=Escherichia RepID=YDFX_ECOLI Length = 96 Score = 154 bits (388), Expect = 1e-36, Method: Composition-based stats. Identities = 96/96 (100%), Positives = 96/96 (100%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL Sbjct: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 Query: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG 96 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG Sbjct: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG 96 >UniRef50_B5R8N9 Predicted phage protein n=15 Tax=Salmonella enterica subsp. enterica RepID=B5R8N9_SALG2 Length = 164 Score = 145 bits (366), Expect = 4e-34, Method: Composition-based stats. Identities = 43/91 (47%), Positives = 57/91 (62%) Query: 3 ITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRALGV 62 ITPE A +AL +W+ +TQE AT LIT AF RP I V R+ + G +D A Sbjct: 2 ITPETASQALSSWLAYLQITQETATQLITRAFLEQPARPEIAVHRIERDDGTVDYDAWRR 61 Query: 63 NRVKIFERWKAIDTRDKREKFTALVPAIMEA 93 NR+ IF+RW+ +T + EKF+AL PAI+EA Sbjct: 62 NRINIFQRWRKRETAEHCEKFSALTPAILEA 92 >UniRef50_C8U4T3 Probable phage regulatory protein CII n=2 Tax=Escherichia coli RepID=C8U4T3_ECO10 Length = 145 Score = 47.0 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 8/91 (8%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 MKI E R A++AW+ P + +++ I A++ L E + + G + Sbjct: 1 MKIKHEHIRMAMNAWLLYPRVGRKKIADDIATAYFEL-EMTYPPMHDTSTTEG------I 53 Query: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIM 91 G+N IF RW DT D EK AL+PAI+ Sbjct: 54 GLNIQNIF-RWLEKDTPDAVEKIQALIPAIL 83 >UniRef50_A7ZND8 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=A7ZND8_ECO24 Length = 158 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 28/89 (31%), Positives = 45/89 (50%), Gaps = 4/89 (4%) Query: 5 PEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRALGVNR 64 PE+ ++ + W R G QE TI I A+++ + + G +D RA+ NR Sbjct: 4 PEELQKEILTWAARAG--QELVTIEICRAWFSQGRNDELRLHEFEDADGNVDWRAINNNR 61 Query: 65 VKIFERWKAIDTRDKREKFTALVPAIMEA 93 KIF RW +T R K T ++ ++M+A Sbjct: 62 QKIF-RWLRGETTAARRK-TQVLASVMKA 88 >UniRef50_B7UQF8 Probable phage regulatory protein n=95 Tax=Enterobacteriaceae RepID=B7UQF8_ECO27 Length = 141 Score = 40.9 bits (94), Expect = 0.012, Method: Composition-based stats. Identities = 34/94 (36%), Positives = 44/94 (46%), Gaps = 12/94 (12%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKER-PNIDVQRVTYEGGAIDQRA 59 MKI E R A++ W P + A IT+A++ L P + Y+ A Sbjct: 1 MKIKHEHIRMAMNVW-AHPDGEKVPAAK-ITKAYFELGMTFPEL------YDDS--HPEA 50 Query: 60 LGVNRVKIFERWKAIDTRDKREKFTALVPAIMEA 93 L N KIF RW DT D EK AL+PAI +A Sbjct: 51 LARNTQKIF-RWLDKDTPDAVEKMQALLPAIEKA 83 >UniRef50_Q7N1J8 Similarities with unknown protein and with YdaT protein of Escherichia coli n=5 Tax=Enterobacteriaceae RepID=Q7N1J8_PHOLL Length = 149 Score = 37.8 bits (86), Expect = 0.099, Method: Composition-based stats. Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 5/83 (6%) Query: 12 LDAWICRPGMTQEQATILITEAFWALKERP-NIDVQRVTYEGGAIDQRALGVNRVKIFER 70 ++AW G QE I I+ ++ L ER + + + G A + +A+ NR +IF R Sbjct: 12 VEAWAAEKG--QEYVAIEISRMYFLLCERTVSAKLHPIEINGNA-NWKAINNNRQQIF-R 67 Query: 71 WKAIDTRDKREKFTALVPAIMEA 93 W D+R R K + L+PAI A Sbjct: 68 WLRSDSRAARRKVSELLPAIQSA 90 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76165 Uncharacterized protein ydfX n=44 Tax=Escherichi... 135 4e-31 UniRef50_B5R8N9 Predicted phage protein n=15 Tax=Salmonella ente... 126 2e-28 UniRef50_C8U4T3 Probable phage regulatory protein CII n=2 Tax=Es... 115 4e-25 UniRef50_A7ZND8 Putative uncharacterized protein n=2 Tax=Escheri... 104 6e-22 Sequences not found previously or not previously below threshold: UniRef50_B7UQF8 Probable phage regulatory protein n=95 Tax=Enter... 77 2e-13 UniRef50_Q7N1J8 Similarities with unknown protein and with YdaT ... 68 9e-11 UniRef50_C0AW01 Putative uncharacterized protein n=2 Tax=Enterob... 66 4e-10 UniRef50_P76064 Uncharacterized protein ydaT n=20 Tax=Enterobact... 57 2e-07 UniRef50_C4SVT0 tRNA-(Guanine-N1)-methyltransferase n=1 Tax=Yers... 53 3e-06 UniRef50_B4T224 Putative uncharacterized protein n=2 Tax=Salmone... 51 2e-05 >UniRef50_P76165 Uncharacterized protein ydfX n=44 Tax=Escherichia RepID=YDFX_ECOLI Length = 96 Score = 135 bits (339), Expect = 4e-31, Method: Composition-based stats. Identities = 96/96 (100%), Positives = 96/96 (100%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL Sbjct: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 Query: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG 96 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG Sbjct: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG 96 >UniRef50_B5R8N9 Predicted phage protein n=15 Tax=Salmonella enterica subsp. enterica RepID=B5R8N9_SALG2 Length = 164 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 43/91 (47%), Positives = 57/91 (62%) Query: 3 ITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRALGV 62 ITPE A +AL +W+ +TQE AT LIT AF RP I V R+ + G +D A Sbjct: 2 ITPETASQALSSWLAYLQITQETATQLITRAFLEQPARPEIAVHRIERDDGTVDYDAWRR 61 Query: 63 NRVKIFERWKAIDTRDKREKFTALVPAIMEA 93 NR+ IF+RW+ +T + EKF+AL PAI+EA Sbjct: 62 NRINIFQRWRKRETAEHCEKFSALTPAILEA 92 >UniRef50_C8U4T3 Probable phage regulatory protein CII n=2 Tax=Escherichia coli RepID=C8U4T3_ECO10 Length = 145 Score = 115 bits (287), Expect = 4e-25, Method: Composition-based stats. Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 8/91 (8%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 MKI E R A++AW+ P + +++ I A++ L E + + G + Sbjct: 1 MKIKHEHIRMAMNAWLLYPRVGRKKIADDIATAYFEL-EMTYPPMHDTSTTEG------I 53 Query: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIM 91 G+N IF RW DT D EK AL+PAI+ Sbjct: 54 GLNIQNIF-RWLEKDTPDAVEKIQALIPAIL 83 >UniRef50_A7ZND8 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=A7ZND8_ECO24 Length = 158 Score = 104 bits (260), Expect = 6e-22, Method: Composition-based stats. Identities = 28/89 (31%), Positives = 45/89 (50%), Gaps = 4/89 (4%) Query: 5 PEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRALGVNR 64 PE+ ++ + W R G QE TI I A+++ + + G +D RA+ NR Sbjct: 4 PEELQKEILTWAARAG--QELVTIEICRAWFSQGRNDELRLHEFEDADGNVDWRAINNNR 61 Query: 65 VKIFERWKAIDTRDKREKFTALVPAIMEA 93 KIF RW +T R K T ++ ++M+A Sbjct: 62 QKIF-RWLRGETTAARRK-TQVLASVMKA 88 >UniRef50_B7UQF8 Probable phage regulatory protein n=95 Tax=Enterobacteriaceae RepID=B7UQF8_ECO27 Length = 141 Score = 76.8 bits (187), Expect = 2e-13, Method: Composition-based stats. Identities = 31/93 (33%), Positives = 44/93 (47%), Gaps = 10/93 (10%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 MKI E R A++ W P ++ IT+A++ L ++ ++ AL Sbjct: 1 MKIKHEHIRMAMNVW-AHPD-GEKVPAAKITKAYFELG-MTFPELYDDSHPE------AL 51 Query: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEA 93 N KIF RW DT D EK AL+PAI +A Sbjct: 52 ARNTQKIF-RWLDKDTPDAVEKMQALLPAIEKA 83 >UniRef50_Q7N1J8 Similarities with unknown protein and with YdaT protein of Escherichia coli n=5 Tax=Enterobacteriaceae RepID=Q7N1J8_PHOLL Length = 149 Score = 68.0 bits (164), Expect = 9e-11, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 43/87 (49%), Gaps = 5/87 (5%) Query: 8 AREALDAWICRPGMTQEQATILITEAFWALKERP-NIDVQRVTYEGGAIDQRALGVNRVK 66 + ++AW G QE I I+ ++ L ER + + + G + +A+ NR + Sbjct: 8 LKAEVEAWAAEKG--QEYVAIEISRMYFLLCERTVSAKLHPIE-INGNANWKAINNNRQQ 64 Query: 67 IFERWKAIDTRDKREKFTALVPAIMEA 93 IF RW D+R R K + L+PAI A Sbjct: 65 IF-RWLRSDSRAARRKVSELLPAIQSA 90 >UniRef50_C0AW01 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=C0AW01_9ENTR Length = 151 Score = 65.7 bits (158), Expect = 4e-10, Method: Composition-based stats. Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 4/84 (4%) Query: 8 AREALDAWICRPGMTQEQATILITEAFWALKERPNI-DVQRVTYEGGAIDQRALGVNRVK 66 R ++ W G QE I I+ A+ L + + + G D +A+ NR + Sbjct: 8 IRAEIEDWAVEQG--QEHVAIEISRAYLRLVINQEHGRLHVIEDQTGRADWKAINNNRQQ 65 Query: 67 IFERWKAIDTRDKREKFTALVPAI 90 IF RW D+R + K L+PAI Sbjct: 66 IF-RWLRGDSRASQRKIAELMPAI 88 >UniRef50_P76064 Uncharacterized protein ydaT n=20 Tax=Enterobacteriaceae RepID=YDAT_ECOLI Length = 140 Score = 56.8 bits (135), Expect = 2e-07, Method: Composition-based stats. Identities = 26/93 (27%), Positives = 35/93 (37%), Gaps = 17/93 (18%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALK--ERPNIDVQRVTYEGGAIDQR 58 MKI E L A G Q ITE + E P + D Sbjct: 1 MKIKHEHIESVLFALAAEKG--QAWVANAITEEYLRQGGGELPLVP---------GKDW- 48 Query: 59 ALGVNRVKIFERWKAIDTRDKREKFTALVPAIM 91 N+ I+ RW +T+ +REK L+PAI+ Sbjct: 49 ---NNQQNIYHRWLKGETKTQREKIQKLIPAIL 78 >UniRef50_C4SVT0 tRNA-(Guanine-N1)-methyltransferase n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SVT0_YERFR Length = 146 Score = 53.0 bits (125), Expect = 3e-06, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 38/96 (39%), Gaps = 10/96 (10%) Query: 1 MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAIDQRAL 60 MK+ + L W QE I +A++ L + + V D+ A Sbjct: 1 MKLKHDAICAELRGWAA--ETKQEIVAAEIAQAYFVLGGGD-LPLTPVN------DEHAT 51 Query: 61 GVNRVKIFERWKAIDTRDKREKFTALVPAIMEATTG 96 N+ ++F RW DT + K L PAI+ A G Sbjct: 52 HNNKQRLF-RWIDSDTDRSKTKIAELTPAILRALPG 86 >UniRef50_B4T224 Putative uncharacterized protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B4T224_SALNS Length = 115 Score = 50.6 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 1/51 (1%) Query: 43 IDVQRVTYEGGAIDQRALGVNRVKIFERWKAIDTRDKREKFTALVPAIMEA 93 + + ++ G D RA+ NR +IF RW +T+ R K AL A+ A Sbjct: 8 VKLHQMEDSKGNADWRAINNNRQQIF-RWLRGETKAARTKTKALAKAMEAA 57 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.301 0.117 0.299 Lambda K H 0.267 0.0359 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 464,119,236 Number of Sequences: 3077464 Number of extensions: 12681404 Number of successful extensions: 33574 Number of sequences better than 1.0e-01: 10 Number of HSP's better than 0.1 without gapping: 11 Number of HSP's successfully gapped in prelim test: 10 Number of HSP's that attempted gapping in prelim test: 33551 Number of HSP's gapped (non-prelim): 21 length of query: 96 length of database: 1,040,396,356 effective HSP length: 65 effective length of query: 31 effective length of database: 840,361,196 effective search space: 26051197076 effective search space used: 26051197076 T: 11 A: 40 X1: 16 ( 6.9 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.9 bits) S2: 86 (37.9 bits)