BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (258 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_O82899 Uncharacterized protein yubG n=80 Tax=root RepID... 514 e-145 UniRef50_B3X7A3 DNA methyltransferase n=2 Tax=Enterobacteriaceae... 184 2e-45 UniRef50_D2WFG1 Plasmid SOS inhibition protein B n=3 Tax=Escheri... 153 7e-36 UniRef50_A6TIQ6 Putative uncharacterized protein n=8 Tax=Enterob... 137 2e-31 UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escheri... 62 2e-08 UniRef50_B1LRQ2 Putative uncharacterized protein n=1 Tax=Escheri... 60 6e-08 UniRef50_Q27TL7 Putative uncharacterized protein n=1 Tax=Escheri... 60 9e-08 >UniRef50_O82899 Uncharacterized protein yubG n=80 Tax=root RepID=YUBG_ECO57 Length = 258 Score = 514 bits (1325), Expect = e-145, Method: Compositional matrix adjust. Identities = 254/257 (98%), Positives = 255/257 (99%) Query: 1 MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAPRLATLLVNRH 60 MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAPRLATLLVNRH Sbjct: 1 MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAPRLATLLVNRH 60 Query: 61 GGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFPYAGEWLTEDEIRAVL 120 GGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFPYAGEWLTEDEIRAVL Sbjct: 61 GGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFPYAGEWLTEDEIRAVL 120 Query: 121 AAVRDAVRSVSCRVAEDMRRIRAALTTTGQTLLTRQTRRFRLVVKESDHPCWLDEDDENL 180 AAVRDAVRSVSCRVAED RRIRAALTTTGQTLLTRQTRRFRLVVKESDHPCWLDEDDENL Sbjct: 121 AAVRDAVRSVSCRVAEDTRRIRAALTTTGQTLLTRQTRRFRLVVKESDHPCWLDEDDENL 180 Query: 181 PVVLDAILNRGARFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREVVR 240 PVVLDAILNRGARFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPR+WFDRGVLREVVR Sbjct: 181 PVVLDAILNRGARFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVR 240 Query: 241 EARAEIRSMADALAKIR 257 EAR EIRSMADALAKIR Sbjct: 241 EARNEIRSMADALAKIR 257 >UniRef50_B3X7A3 DNA methyltransferase n=2 Tax=Enterobacteriaceae RepID=B3X7A3_SHIDY Length = 287 Score = 184 bits (468), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 93/115 (80%), Positives = 97/115 (84%) Query: 143 AALTTTGQTLLTRQTRRFRLVVKESDHPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV 202 A +T L R + + E HPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV Sbjct: 172 AGSGSTCVAALQSGRRYIGIELLEQYHPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV 231 Query: 203 SDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREVVREARAEIRSMADALAKIR 257 SDCIEHILSSGLACDVLRIPDEPPR+WFDRGVLREVVREARAEIRSMADALAKIR Sbjct: 232 SDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVREARAEIRSMADALAKIR 286 >UniRef50_D2WFG1 Plasmid SOS inhibition protein B n=3 Tax=Escherichia coli RepID=D2WFG1_ECOLX Length = 160 Score = 153 bits (386), Expect = 7e-36, Method: Compositional matrix adjust. Identities = 74/79 (93%), Positives = 75/79 (94%) Query: 4 NTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAPRLATLLVNRHGGV 63 N TLNALICRHAR+LLLAQGWPEETDVDQRN KYPGWISIYV LDAPRLATLLVNRH GV Sbjct: 2 NKTLNALICRHARDLLLAQGWPEETDVDQRNLKYPGWISIYVRLDAPRLATLLVNRHDGV 61 Query: 64 LPPHLASAIQKLTGTGAEL 82 LPPHLASAIQKLTGTGAEL Sbjct: 62 LPPHLASAIQKLTGTGAEL 80 >UniRef50_A6TIQ6 Putative uncharacterized protein n=8 Tax=Enterobacteriaceae RepID=A6TIQ6_KLEP7 Length = 274 Score = 137 bits (346), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 103/254 (40%), Positives = 138/254 (54%), Gaps = 14/254 (5%) Query: 1 MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPK-YPGWIS-------IYVLLDAPRL 52 M + TLN LI RH +L GWP+ D+ P+ PGW++ + +L +L Sbjct: 20 MMKKETLNTLILRHGDRMLRQAGWPDTVDMAPVAPETVPGWLAACGSLSGVEILTLTEQL 79 Query: 53 ATLLVNRHGGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFPYAGEWLT 112 L +L LASA ++L GT A L L Q P AD + PYA EWLT Sbjct: 80 CQPLTYGRAALL---LASA-RRLAGTPARLHLFPVQAYPHPERLADCQVIRLPYAQEWLT 135 Query: 113 EDEIRAVLAAVRDAVRSVSCRVAEDMRRIRAALT-TTGQTLLTRQTRRFRLVVKESDHPC 171 E +LA ++ ++ +S V D +RI AALT + L+ R+ +RL+ E DH Sbjct: 136 AAECDDLLAFLKASLTQISEIVHRDTKRIAAALTPSVTPRLMDRRIGDWRLLAVEYDHDN 195 Query: 172 WLDEDD-ENLPVVLDAILNRGARFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPRQWF 230 LDED+ + L VLDAIL R ARF V + LV++ E I S+G+ D LR D P R+WF Sbjct: 196 CLDEDETDRLDQVLDAILIRDARFCPVLLTLVNEREETIRSAGVIADQLRFTDTPVRRWF 255 Query: 231 DRGVLREVVREARA 244 DR VLRE+VREARA Sbjct: 256 DRRVLREMVREARA 269 >UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=B9K6E8_ECOLX Length = 184 Score = 62.0 bits (149), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 27/46 (58%), Positives = 35/46 (76%) Query: 193 RFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREV 238 R + + + S+C+EHIL+SGL CDVLRIP EPPR+ FDR +LRE Sbjct: 9 RETTIPLMPASECVEHILASGLVCDVLRIPGEPPRRLFDRYILREF 54 >UniRef50_B1LRQ2 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRQ2_ECOSM Length = 137 Score = 60.5 bits (145), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 26/37 (70%), Positives = 31/37 (83%) Query: 202 VSDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREV 238 S+C+EHIL+SGL CDVLRIP EPPR+ FDR +LRE Sbjct: 3 ASECVEHILASGLVCDVLRIPGEPPRRLFDRYILREF 39 >UniRef50_Q27TL7 Putative uncharacterized protein n=1 Tax=Escherichia coli APEC O1 RepID=Q27TL7_ECOK1 Length = 86 Score = 59.7 bits (143), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 26/37 (70%), Positives = 31/37 (83%) Query: 202 VSDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREV 238 S+C+EHIL+SGL CDVLRIP EPPR+ FDR +LRE Sbjct: 3 ASECVEHILASGLVCDVLRIPGEPPRRLFDRYILREF 39 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_O82899 Uncharacterized protein yubG n=80 Tax=root RepID... 432 e-120 UniRef50_A6TIQ6 Putative uncharacterized protein n=8 Tax=Enterob... 317 2e-85 UniRef50_B3X7A3 DNA methyltransferase n=2 Tax=Enterobacteriaceae... 201 3e-50 UniRef50_D2WFG1 Plasmid SOS inhibition protein B n=3 Tax=Escheri... 146 8e-34 UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escheri... 80 7e-14 UniRef50_B1LRQ2 Putative uncharacterized protein n=1 Tax=Escheri... 72 2e-11 UniRef50_Q27TL7 Putative uncharacterized protein n=1 Tax=Escheri... 71 3e-11 Sequences not found previously or not previously below threshold: UniRef50_C7Q7M8 Putative sensor with HAMP domain n=1 Tax=Catenul... 44 0.005 UniRef50_C7Q6K7 Histidine kinase n=1 Tax=Catenulispora acidiphil... 40 0.087 CONVERGED! >UniRef50_O82899 Uncharacterized protein yubG n=80 Tax=root RepID=YUBG_ECO57 Length = 258 Score = 432 bits (1112), Expect = e-120, Method: Composition-based stats. Identities = 254/257 (98%), Positives = 255/257 (99%) Query: 1 MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAPRLATLLVNRH 60 MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAPRLATLLVNRH Sbjct: 1 MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAPRLATLLVNRH 60 Query: 61 GGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFPYAGEWLTEDEIRAVL 120 GGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFPYAGEWLTEDEIRAVL Sbjct: 61 GGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFPYAGEWLTEDEIRAVL 120 Query: 121 AAVRDAVRSVSCRVAEDMRRIRAALTTTGQTLLTRQTRRFRLVVKESDHPCWLDEDDENL 180 AAVRDAVRSVSCRVAED RRIRAALTTTGQTLLTRQTRRFRLVVKESDHPCWLDEDDENL Sbjct: 121 AAVRDAVRSVSCRVAEDTRRIRAALTTTGQTLLTRQTRRFRLVVKESDHPCWLDEDDENL 180 Query: 181 PVVLDAILNRGARFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREVVR 240 PVVLDAILNRGARFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPR+WFDRGVLREVVR Sbjct: 181 PVVLDAILNRGARFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVR 240 Query: 241 EARAEIRSMADALAKIR 257 EAR EIRSMADALAKIR Sbjct: 241 EARNEIRSMADALAKIR 257 >UniRef50_A6TIQ6 Putative uncharacterized protein n=8 Tax=Enterobacteriaceae RepID=A6TIQ6_KLEP7 Length = 274 Score = 317 bits (813), Expect = 2e-85, Method: Composition-based stats. Identities = 103/254 (40%), Positives = 138/254 (54%), Gaps = 14/254 (5%) Query: 1 MNENTTLNALICRHARNLLLAQGWPEETDVDQRNPK-YPGWIS-------IYVLLDAPRL 52 M + TLN LI RH +L GWP+ D+ P+ PGW++ + +L +L Sbjct: 20 MMKKETLNTLILRHGDRMLRQAGWPDTVDMAPVAPETVPGWLAACGSLSGVEILTLTEQL 79 Query: 53 ATLLVNRHGGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFPYAGEWLT 112 L +L LASA ++L GT A L L Q P AD + PYA EWLT Sbjct: 80 CQPLTYGRAALL---LASA-RRLAGTPARLHLFPVQAYPHPERLADCQVIRLPYAQEWLT 135 Query: 113 EDEIRAVLAAVRDAVRSVSCRVAEDMRRIRAALT-TTGQTLLTRQTRRFRLVVKESDHPC 171 E +LA ++ ++ +S V D +RI AALT + L+ R+ +RL+ E DH Sbjct: 136 AAECDDLLAFLKASLTQISEIVHRDTKRIAAALTPSVTPRLMDRRIGDWRLLAVEYDHDN 195 Query: 172 WLDEDD-ENLPVVLDAILNRGARFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPRQWF 230 LDED+ + L VLDAIL R ARF V + LV++ E I S+G+ D LR D P R+WF Sbjct: 196 CLDEDETDRLDQVLDAILIRDARFCPVLLTLVNEREETIRSAGVIADQLRFTDTPVRRWF 255 Query: 231 DRGVLREVVREARA 244 DR VLRE+VREARA Sbjct: 256 DRRVLREMVREARA 269 >UniRef50_B3X7A3 DNA methyltransferase n=2 Tax=Enterobacteriaceae RepID=B3X7A3_SHIDY Length = 287 Score = 201 bits (510), Expect = 3e-50, Method: Composition-based stats. Identities = 93/115 (80%), Positives = 97/115 (84%) Query: 143 AALTTTGQTLLTRQTRRFRLVVKESDHPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV 202 A +T L R + + E HPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV Sbjct: 172 AGSGSTCVAALQSGRRYIGIELLEQYHPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV 231 Query: 203 SDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREVVREARAEIRSMADALAKIR 257 SDCIEHILSSGLACDVLRIPDEPPR+WFDRGVLREVVREARAEIRSMADALAKIR Sbjct: 232 SDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVREARAEIRSMADALAKIR 286 >UniRef50_D2WFG1 Plasmid SOS inhibition protein B n=3 Tax=Escherichia coli RepID=D2WFG1_ECOLX Length = 160 Score = 146 bits (368), Expect = 8e-34, Method: Composition-based stats. Identities = 76/101 (75%), Positives = 81/101 (80%) Query: 4 NTTLNALICRHARNLLLAQGWPEETDVDQRNPKYPGWISIYVLLDAPRLATLLVNRHGGV 63 N TLNALICRHAR+LLLAQGWPEETDVDQRN KYPGWISIYV LDAPRLATLLVNRH GV Sbjct: 2 NKTLNALICRHARDLLLAQGWPEETDVDQRNLKYPGWISIYVRLDAPRLATLLVNRHDGV 61 Query: 64 LPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSF 104 LPPHLASAIQKLTGTGAEL + + + L + G Sbjct: 62 LPPHLASAIQKLTGTGAELGFTPAHERFHLALCSPGDVSQL 102 >UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=B9K6E8_ECOLX Length = 184 Score = 80.1 bits (196), Expect = 7e-14, Method: Composition-based stats. Identities = 27/46 (58%), Positives = 35/46 (76%) Query: 193 RFSAVEMYLVSDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREV 238 R + + + S+C+EHIL+SGL CDVLRIP EPPR+ FDR +LRE Sbjct: 9 RETTIPLMPASECVEHILASGLVCDVLRIPGEPPRRLFDRYILREF 54 >UniRef50_B1LRQ2 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRQ2_ECOSM Length = 137 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 26/39 (66%), Positives = 31/39 (79%) Query: 200 YLVSDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREV 238 S+C+EHIL+SGL CDVLRIP EPPR+ FDR +LRE Sbjct: 1 MPASECVEHILASGLVCDVLRIPGEPPRRLFDRYILREF 39 >UniRef50_Q27TL7 Putative uncharacterized protein n=1 Tax=Escherichia coli APEC O1 RepID=Q27TL7_ECOK1 Length = 86 Score = 71.3 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 26/39 (66%), Positives = 31/39 (79%) Query: 200 YLVSDCIEHILSSGLACDVLRIPDEPPRQWFDRGVLREV 238 S+C+EHIL+SGL CDVLRIP EPPR+ FDR +LRE Sbjct: 1 MPASECVEHILASGLVCDVLRIPGEPPRRLFDRYILREF 39 >UniRef50_C7Q7M8 Putative sensor with HAMP domain n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q7M8_CATAD Length = 1000 Score = 43.9 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 41/178 (23%), Positives = 67/178 (37%), Gaps = 8/178 (4%) Query: 46 LLDAPRLATLLVNRHGGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQVSFP 105 L+ + RL ++ G+ A A + L L +GS P AD P Sbjct: 318 LMVSLRLGRPIIKGLRGLRDGAHAVAFEDLPAAVQALRSTGSLTGQTPEEFADEAAGGLP 377 Query: 106 YAGEWLTEDEIRAVLAAVRDAVRSVSCRVAEDMRRIRAALTTTGQTLLTRQTRRFRLVVK 165 G+ RA A R+A+R+ + +V +RA + L R R + Sbjct: 378 VKGDDEIAAVARAFNAVRREAIRTAAEQVL-----LRAGVGAAFVALARRGERLTGALTL 432 Query: 166 ESDHPCWLDEDDENLPV--VLDAILNRGARFSAVEMYLVSDCIEHILSSGL-ACDVLR 220 E D ++D + L VLD + R R + + L + + G+ DV+R Sbjct: 433 ELDKAERDEQDPDRLARLFVLDHLAARMTRNNESLLVLGGEGTVRVRDEGVPLIDVIR 490 >UniRef50_C7Q6K7 Histidine kinase n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q6K7_CATAD Length = 863 Score = 39.7 bits (91), Expect = 0.087, Method: Composition-based stats. Identities = 44/188 (23%), Positives = 74/188 (39%), Gaps = 9/188 (4%) Query: 42 SIYVLLDAPRLATLLVNRHGGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQ 101 ++ V++ + RL ++ + A A + L EL + S P A+ Sbjct: 356 ALLVIMISLRLGRPMIRGLRQLRDAAHAVATEGLPAAVEELRTTDSLTGRTPEEFAEQAG 415 Query: 102 VSFPYA---GEWLTEDEIRAVLAAVRDAVRSV---SCRVAEDMRRIRAALTTTGQTLLTR 155 + P G+ L D+ + LAAV A SV + R A + +RA + T L R Sbjct: 416 DALPLGAGRGDALNADDPKDELAAVARAFNSVHREALRTAAEQVLLRAGVGATFVALARR 475 Query: 156 QTRRFRLVVKESDHPCWLDEDDENLPV--VLDAILNRGARFSAVEMYLVSDCIEHILSSG 213 R + E D ++D + L VLD + R AR + + L + + Sbjct: 476 GERLTGALTSELDRAERSEQDPDRLARLFVLDHLAARMARNNESLLVLGGEGSARVRERA 535 Query: 214 L-ACDVLR 220 + DV+R Sbjct: 536 VTLLDVVR 543 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.134 0.381 Lambda K H 0.267 0.0412 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 988,913,634 Number of Sequences: 3077464 Number of extensions: 38336467 Number of successful extensions: 116218 Number of sequences better than 1.0e-01: 12 Number of HSP's better than 0.1 without gapping: 14 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 116195 Number of HSP's gapped (non-prelim): 20 length of query: 258 length of database: 1,040,396,356 effective HSP length: 126 effective length of query: 132 effective length of database: 652,635,892 effective search space: 86147937744 effective search space used: 86147937744 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 91 (39.7 bits)