BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (297 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P11072 Bacteriophage T4 late gene expression-blocking p... 615 e-175 UniRef50_A8H3S3 Cell death peptidase, inhibitor of T4 late gene ... 145 2e-33 UniRef50_Q5HXY2 Putative uncharacterized protein n=1 Tax=Glucono... 60 9e-08 UniRef50_D0L0N2 Peptidase U49, Lit peptidase n=1 Tax=Halothiobac... 52 3e-05 UniRef50_Q31IR0 Putative uncharacterized protein n=1 Tax=Thiomic... 49 2e-04 UniRef50_A6GL96 Putative uncharacterized protein n=1 Tax=Limnoba... 48 4e-04 UniRef50_A5FFH2 Putative uncharacterized protein n=1 Tax=Flavoba... 45 0.003 UniRef50_Q858S4 Putative lysogenic conversion protein n=1 Tax=En... 42 0.019 >UniRef50_P11072 Bacteriophage T4 late gene expression-blocking protein n=2 Tax=Escherichia coli RepID=LIT_ECOLI Length = 297 Score = 615 bits (1586), Expect = e-175, Method: Compositional matrix adjust. Identities = 297/297 (100%), Positives = 297/297 (100%) Query: 1 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE 60 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE Sbjct: 1 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE 60 Query: 61 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT 120 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT Sbjct: 61 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT 120 Query: 121 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE Sbjct: 121 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 Query: 181 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY 240 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY Sbjct: 181 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY 240 Query: 241 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN 297 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN Sbjct: 241 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN 297 >UniRef50_A8H3S3 Cell death peptidase, inhibitor of T4 late gene expression n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H3S3_SHEPA Length = 276 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 96/278 (34%), Positives = 153/278 (55%), Gaps = 17/278 (6%) Query: 18 KIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNF 77 ++APE E L + + + + E GF RV + + I + LEY+W FS Sbjct: 7 RVAPENESKLVELSNKHDFSLLFLKESGFTFRV----NTKTKVIKIPYGGLEYLWTFSLK 62 Query: 78 FWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTGCESW-PKKCPKPEAY 136 W+ Q Y+++Q N +++ DL N + +L ++ +++L ES+ P K A Sbjct: 63 AWLLYQAYAQAQLNGEQNLDLETINGYAGTSQLTEYLKESLY---AESYDPNK--NFSAL 117 Query: 137 LQGSE-DSQVASEIFLCAIAWILHHEISHVVLQHP-LVTTAFSTQEEREADSHATKWILG 194 + E D +VA+EIFLCA+ WI+ HEI+H+ L H L S QEE++AD ++T WIL Sbjct: 118 ISVEEIDQEVATEIFLCALGWIIWHEIAHIELGHSSLEINTLSIQEEKDADLYSTNWILS 177 Query: 195 NLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPVGNEELI 254 + A E KKR +GI A+L IQSLE+++ C + THP A RI+ N+S + +E+ Sbjct: 178 S-STIAAESKKRIVGITIALLAIQSLELDSKSCFKGTHPDASNRIFDNLSQHSEIGDEIS 236 Query: 255 EALCTVMLQYLFHGKNINVN-LDGESFSSILGDLLCDI 291 +A+ V LQ + I++ + +FS +LG+ L I Sbjct: 237 QAMSIVTLQSM---TKIDIAPMSDLNFSDMLGEALYQI 271 >UniRef50_Q5HXY2 Putative uncharacterized protein n=1 Tax=Gluconobacter oxydans RepID=Q5HXY2_GLUOX Length = 297 Score = 60.1 bits (144), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 52/210 (24%), Positives = 92/210 (43%), Gaps = 13/210 (6%) Query: 61 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT 120 I + +LE +W + F + ++ + L + L+K + L A ++ Sbjct: 68 ITMKWWALEVLWLTAFVFQNLGARLWRHAEDGNTSSTLEDEVHLRKGRDALAHAANLIEN 127 Query: 121 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAF--ST 178 +WP+ P P + +DS+ +++FL A+ WI HE++H+ + + + A S Sbjct: 128 RESRNWPENIPHP---AENKKDSERTNKVFLKALGWIELHEVAHITVDESIFSGAVNPSI 184 Query: 179 QEEREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYER 238 EE DS+AT W+L + P ++ G+A A + EV THP + R Sbjct: 185 AEEEYCDSYATNWVLAGNGKKLPSQRE---GVAVATFFLVLREVLRGRT-GPTHPNSQSR 240 Query: 239 I-YSNISCYPVGNEELIEALCTVMLQYLFH 267 + S IS G + + +MLQ H Sbjct: 241 VAASQISDRHGG---WLAIIIDIMLQSGGH 267 >UniRef50_D0L0N2 Peptidase U49, Lit peptidase n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0N2_HALNC Length = 309 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 62/299 (20%), Positives = 117/299 (39%), Gaps = 38/299 (12%) Query: 20 APEKEQDLKTIVDDKKIIISVVSEP-GFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFF 78 APE++Q++K + + + VV + G NI K H+ +L+ IW Sbjct: 20 APERKQEIKDLWNHYAPKVCVVEDARGVNISAGKGRIQFDHK------TLKAIWLLGFNG 73 Query: 79 WVFTQEYSKS----------------QKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTG 122 W + YS + + F++ ++R + +++ + ++ TT Sbjct: 74 WRSIETYSPAIILAGITDGAIEDILCADDELAAFEMDYRSRANSARSIIE-EKSSVHTT- 131 Query: 123 CESWPKKCPKPEAYLQ--GSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 WP+ P+PEA + GS VA ++ A A++ HE+ HV A +E Sbjct: 132 ---WPQDVPRPEADRETLGSHQEMVAFDLVCLATAYVFLHELRHVKFLCDGDCPADRREE 188 Query: 181 EREADSHATKWILGNLYESAPE--------LKKRALGIATAVLCIQSLEVENYFCLQNTH 232 E D A ++ + A L KR++GIA + + + E+ + Sbjct: 189 EIACDVWARSFLTDKVESYAQSVGQAFSRVLDKRSMGIALGAMILHEITPESARWGTAEY 248 Query: 233 PAAYERIYSNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDI 291 P RI + IS + E +L +F + + + S ++ L+ D+ Sbjct: 249 PPITTRIQAMISGSTLAKESHFWLFTACLLVGIFRQAHRQLPMYAPSTHELVEQLITDL 307 >UniRef50_Q31IR0 Putative uncharacterized protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31IR0_THICR Length = 307 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 31/122 (25%), Positives = 52/122 (42%), Gaps = 8/122 (6%) Query: 125 SWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQEEREA 184 +WP PKPE + ++ L A A++ HE+ HV+ + + +EE E Sbjct: 128 NWPAGLPKPEDGKPKDTEQAAVFDLVLMATAYVFLHELKHVIFESEGNSPEDRLREELEC 187 Query: 185 DSHATKWILGNL--------YESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAY 236 DS A + +L + Y L KR++ IA + + + THPA + Sbjct: 188 DSFALEMMLSKIQDYSAKSGYPEGQVLMKRSISIALGSVFLAVATPRHNLGGTTTHPAVH 247 Query: 237 ER 238 +R Sbjct: 248 DR 249 >UniRef50_A6GL96 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GL96_9BURK Length = 309 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 64/272 (23%), Positives = 119/272 (43%), Gaps = 43/272 (15%) Query: 44 PGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFFWVFTQE----------YSKSQKNND 93 P N++ R EI + + L Y+WAF +V +E +S S + N+ Sbjct: 55 PHANVKTR--------EITIYESHLAYLWAFVYSSFVLYEEGIQKPMLAGTFSGSLEFNN 106 Query: 94 EHFDLTGKNRLKKSDELLKWARKNLQTTGCES-WP-KKCPKPEAYLQGSEDSQV--ASEI 149 + L+++ L WA K ++ C S W + P P +E V + + Sbjct: 107 --------SLLQRAAALQSWAIKFVR---CYSDWSIDELPNPAKVESEAEQFYVPKVNSL 155 Query: 150 FLCAIAWILHHEISHVVLQHPL-VTTAFSTQEEREADSHATKWILGNLYESAPELKKRAL 208 FL A+ ++L HE H+VL H + ++ +E++AD++AT + + E ++R + Sbjct: 156 FLQAVNFLLFHEYGHLVLGHVVNEDKDWTLDQEKDADNYATTFFIE---AGTNESERRFV 212 Query: 209 GIATAVLCIQSLEVENYF--CLQNTHPAAYERIYSNISCYPVGNEE---LIEALCTVMLQ 263 G++ +L + + + Q HP ++RI + IS + EE I L ++ LQ Sbjct: 213 GVSIVLLLVSCVFIPEKISGLWQVKHPHLHDRIRNGISSLNLEEEESKFYIYYLASIALQ 272 Query: 264 YLFHGKNIN-VNLDGESFSSILGDLLCDISRL 294 K ++ L+ E+ + + L I Sbjct: 273 KYLLEKGVDCAQLEIETAEELFFEYLARIDEF 304 >UniRef50_A5FFH2 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FFH2_FLAJ1 Length = 312 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 57/211 (27%), Positives = 90/211 (42%), Gaps = 34/211 (16%) Query: 68 LEYIWAFSNFFWVFTQE-YSKSQKNNDEHFDLTGKNRLKK---SDELLKWARKNLQ---- 119 L YIW +F V +E Y+ +L K ++ SDELL A Sbjct: 78 LSYIWINCYYFVVLHEEKYALP--------NLIDKGEMESRPYSDELLSEAEDLFSYALT 129 Query: 120 -TTGCESWPKKC-PKPEAYLQGSEDSQV---ASEIFLCAIAWILHHEISHVVLQH--PLV 172 TG W K+ P PE + + S +++F+ + +IL+HE +H QH + Sbjct: 130 LVTGFTDWDKETLPNPEYFDEESPQGNYILHVNDLFVEVLNFILYHETAHAEFQHIKKIK 189 Query: 173 TTAFSTQE----EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCL 228 S +E E EADS A + ++ N + +K + IA + I L ++N Sbjct: 190 EGKLSNEEIKDLEIEADSRAIELLIQN----SKNRQKAEIAIAMGLASI--LFIKNSLKG 243 Query: 229 QNTHPAAYERIYSNISCY-PVGNEELIEALC 258 +THP +RI + I P + E+ LC Sbjct: 244 GSTHPDVDQRIENAIEILQPSEDSEIWTTLC 274 >UniRef50_Q858S4 Putative lysogenic conversion protein n=1 Tax=Enterobacteria phage P2-EC46 RepID=Q858S4_BPP2 Length = 333 Score = 42.4 bits (98), Expect = 0.019, Method: Compositional matrix adjust. Identities = 32/135 (23%), Positives = 58/135 (42%), Gaps = 16/135 (11%) Query: 144 QVASEIFLCAIAWILHHEISHVVLQHPLV-------TTAFSTQEEREADSHATKWILGN- 195 + +E+ + A AW HE+ H++ Q T + EE D ATK+IL + Sbjct: 186 RATAELAIIAAAWAFLHEVRHIIHQQEGTSYDMNEFTQEQAHNEEFSCDEFATKFILDHI 245 Query: 196 -------LYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPV 248 +Y+ +KR L I A+ + L N+ +HP+ +RI + Sbjct: 246 DNYCEESMYDRVLVSRKRRLSIYCALFSVTMLGKNNW-GFSKSHPSLQDRINKVKALMKE 304 Query: 249 GNEELIEALCTVMLQ 263 ++E++E + M + Sbjct: 305 PDDEVLEYIVETMFK 319 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P11072 Bacteriophage T4 late gene expression-blocking p... 406 e-112 UniRef50_D0L0N2 Peptidase U49, Lit peptidase n=1 Tax=Halothiobac... 301 2e-80 UniRef50_A8H3S3 Cell death peptidase, inhibitor of T4 late gene ... 265 2e-69 UniRef50_Q5HXY2 Putative uncharacterized protein n=1 Tax=Glucono... 215 1e-54 UniRef50_A6GL96 Putative uncharacterized protein n=1 Tax=Limnoba... 213 4e-54 UniRef50_Q31IR0 Putative uncharacterized protein n=1 Tax=Thiomic... 170 7e-41 Sequences not found previously or not previously below threshold: UniRef50_A5FFH2 Putative uncharacterized protein n=1 Tax=Flavoba... 81 3e-14 UniRef50_Q858S4 Putative lysogenic conversion protein n=1 Tax=En... 67 8e-10 UniRef50_C3R9B2 Predicted protein n=2 Tax=Bacteroides RepID=C3R9... 66 1e-09 UniRef50_C5VJH7 Putative uncharacterized protein n=2 Tax=Prevote... 47 8e-04 UniRef50_A9A7F0 Putative uncharacterized protein n=1 Tax=Methano... 44 0.008 UniRef50_C4Z256 Putative uncharacterized protein n=3 Tax=Bacteri... 42 0.031 >UniRef50_P11072 Bacteriophage T4 late gene expression-blocking protein n=2 Tax=Escherichia coli RepID=LIT_ECOLI Length = 297 Score = 406 bits (1042), Expect = e-112, Method: Composition-based stats. Identities = 297/297 (100%), Positives = 297/297 (100%) Query: 1 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE 60 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE Sbjct: 1 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE 60 Query: 61 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT 120 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT Sbjct: 61 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT 120 Query: 121 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE Sbjct: 121 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 Query: 181 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY 240 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY Sbjct: 181 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY 240 Query: 241 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN 297 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN Sbjct: 241 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN 297 >UniRef50_D0L0N2 Peptidase U49, Lit peptidase n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0N2_HALNC Length = 309 Score = 301 bits (770), Expect = 2e-80, Method: Composition-based stats. Identities = 62/299 (20%), Positives = 117/299 (39%), Gaps = 38/299 (12%) Query: 20 APEKEQDLKTIVDDKKIIISVVSEP-GFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFF 78 APE++Q++K + + + VV + G NI K H+ +L+ IW Sbjct: 20 APERKQEIKDLWNHYAPKVCVVEDARGVNISAGKGRIQFDHK------TLKAIWLLGFNG 73 Query: 79 WVFTQEYSKS----------------QKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTG 122 W + YS + + F++ ++R + +++ + ++ TT Sbjct: 74 WRSIETYSPAIILAGITDGAIEDILCADDELAAFEMDYRSRANSARSIIE-EKSSVHTT- 131 Query: 123 CESWPKKCPKPEAYLQ--GSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 WP+ P+PEA + GS VA ++ A A++ HE+ HV A +E Sbjct: 132 ---WPQDVPRPEADRETLGSHQEMVAFDLVCLATAYVFLHELRHVKFLCDGDCPADRREE 188 Query: 181 EREADSHATKWILGNLYESAPE--------LKKRALGIATAVLCIQSLEVENYFCLQNTH 232 E D A ++ + A L KR++GIA + + + E+ + Sbjct: 189 EIACDVWARSFLTDKVESYAQSVGQAFSRVLDKRSMGIALGAMILHEITPESARWGTAEY 248 Query: 233 PAAYERIYSNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDI 291 P RI + IS + E +L +F + + + S ++ L+ D+ Sbjct: 249 PPITTRIQAMISGSTLAKESHFWLFTACLLVGIFRQAHRQLPMYAPSTHELVEQLITDL 307 >UniRef50_A8H3S3 Cell death peptidase, inhibitor of T4 late gene expression n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H3S3_SHEPA Length = 276 Score = 265 bits (676), Expect = 2e-69, Method: Composition-based stats. Identities = 96/279 (34%), Positives = 154/279 (55%), Gaps = 17/279 (6%) Query: 18 KIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNF 77 ++APE E L + + + + E GF RV + + I + LEY+W FS Sbjct: 7 RVAPENESKLVELSNKHDFSLLFLKESGFTFRV----NTKTKVIKIPYGGLEYLWTFSLK 62 Query: 78 FWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTGCESW-PKKCPKPEAY 136 W+ Q Y+++Q N +++ DL N + +L ++ +++L ES+ P K A Sbjct: 63 AWLLYQAYAQAQLNGEQNLDLETINGYAGTSQLTEYLKESLY---AESYDPNK--NFSAL 117 Query: 137 LQGSE-DSQVASEIFLCAIAWILHHEISHVVLQHP-LVTTAFSTQEEREADSHATKWILG 194 + E D +VA+EIFLCA+ WI+ HEI+H+ L H L S QEE++AD ++T WIL Sbjct: 118 ISVEEIDQEVATEIFLCALGWIIWHEIAHIELGHSSLEINTLSIQEEKDADLYSTNWILS 177 Query: 195 NLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPVGNEELI 254 + A E KKR +GI A+L IQSLE+++ C + THP A RI+ N+S + +E+ Sbjct: 178 S-STIAAESKKRIVGITIALLAIQSLELDSKSCFKGTHPDASNRIFDNLSQHSEIGDEIS 236 Query: 255 EALCTVMLQYLFHGKNINV-NLDGESFSSILGDLLCDIS 292 +A+ V LQ + I++ + +FS +LG+ L I+ Sbjct: 237 QAMSIVTLQSM---TKIDIAPMSDLNFSDMLGEALYQIN 272 >UniRef50_Q5HXY2 Putative uncharacterized protein n=1 Tax=Gluconobacter oxydans RepID=Q5HXY2_GLUOX Length = 297 Score = 215 bits (548), Expect = 1e-54, Method: Composition-based stats. Identities = 55/250 (22%), Positives = 105/250 (42%), Gaps = 16/250 (6%) Query: 23 KEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFFWVFT 82 + L++ ++I + + + G + + I + +LE +W + F Sbjct: 33 NARALRSFHGRQEIKVGISEDSGGDAFACQPSLAT---ITMKWWALEVLWLTAFVFQNLG 89 Query: 83 QEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTGCESWPKKCPKPEAYLQGSED 142 + ++ + L + L+K + L A ++ +WP+ P P + +D Sbjct: 90 ARLWRHAEDGNTSSTLEDEVHLRKGRDALAHAANLIENRESRNWPENIPHPA---ENKKD 146 Query: 143 SQVASEIFLCAIAWILHHEISHVVLQHPLVTTAF--STQEEREADSHATKWILGNLYESA 200 S+ +++FL A+ WI HE++H+ + + + A S EE DS+AT W+L + Sbjct: 147 SERTNKVFLKALGWIELHEVAHITVDESIFSGAVNPSIAEEEYCDSYATNWVLAGNGKKL 206 Query: 201 PELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERI-YSNISCYPVGNEELIEALCT 259 P ++ G+A A + EV THP + R+ S IS G + + Sbjct: 207 PSQRE---GVAVATFFLVLREVLRGRTG-PTHPNSQSRVAASQISDRHGG---WLAIIID 259 Query: 260 VMLQYLFHGK 269 +MLQ H Sbjct: 260 IMLQSGGHST 269 >UniRef50_A6GL96 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GL96_9BURK Length = 309 Score = 213 bits (543), Expect = 4e-54, Method: Composition-based stats. Identities = 64/272 (23%), Positives = 119/272 (43%), Gaps = 43/272 (15%) Query: 44 PGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFFWVFTQE----------YSKSQKNND 93 P N++ R EI + + L Y+WAF +V +E +S S + N+ Sbjct: 55 PHANVKTR--------EITIYESHLAYLWAFVYSSFVLYEEGIQKPMLAGTFSGSLEFNN 106 Query: 94 EHFDLTGKNRLKKSDELLKWARKNLQTTGCES-WP-KKCPKPEAYLQGSEDSQV--ASEI 149 + L+++ L WA K ++ C S W + P P +E V + + Sbjct: 107 --------SLLQRAAALQSWAIKFVR---CYSDWSIDELPNPAKVESEAEQFYVPKVNSL 155 Query: 150 FLCAIAWILHHEISHVVLQHPL-VTTAFSTQEEREADSHATKWILGNLYESAPELKKRAL 208 FL A+ ++L HE H+VL H + ++ +E++AD++AT + + E ++R + Sbjct: 156 FLQAVNFLLFHEYGHLVLGHVVNEDKDWTLDQEKDADNYATTFFIE---AGTNESERRFV 212 Query: 209 GIATAVLCIQSLEVENYF--CLQNTHPAAYERIYSNISCYPVGNEE---LIEALCTVMLQ 263 G++ +L + + + Q HP ++RI + IS + EE I L ++ LQ Sbjct: 213 GVSIVLLLVSCVFIPEKISGLWQVKHPHLHDRIRNGISSLNLEEEESKFYIYYLASIALQ 272 Query: 264 YLFHGKNIN-VNLDGESFSSILGDLLCDISRL 294 K ++ L+ E+ + + L I Sbjct: 273 KYLLEKGVDCAQLEIETAEELFFEYLARIDEF 304 >UniRef50_Q31IR0 Putative uncharacterized protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31IR0_THICR Length = 307 Score = 170 bits (429), Expect = 7e-41, Method: Composition-based stats. Identities = 56/297 (18%), Positives = 104/297 (35%), Gaps = 27/297 (9%) Query: 19 IAPEKEQDLKTIVDDKKIIISVVSE-PGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNF 77 + PE+ ++ ++D V + GFN+ + I T SLE +W F Sbjct: 14 VIPERLDEVLDLIDIHSAQFRRVGDKSGFNLNAGAYGA-----IQFTQRSLEQLWLFGFA 68 Query: 78 FWVFTQEYSKSQKN---NDEHFDLTGKNRLKKSDELLKWARKNLQTTGC---------ES 125 S DL + + K E + K ++T + Sbjct: 69 GLYALHSLSGIIFFVKCKGLKLDLEEIDGVPKQKEENERFSKIIETIKGLNSAESEYDFN 128 Query: 126 WPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQEEREAD 185 WP PKPE + ++ L A A++ HE+ HV+ + + +EE E D Sbjct: 129 WPAGLPKPEDGKPKDTEQAAVFDLVLMATAYVFLHELKHVIFESEGNSPEDRLREELECD 188 Query: 186 SHATKWILGNL--------YESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYE 237 S A + +L + Y L KR++ IA + + + THPA ++ Sbjct: 189 SFALEMMLSKIQDYSAKSGYPEGQVLMKRSISIALGSVFLAVATPRHNLGGTTTHPAVHD 248 Query: 238 RIYSNISCYPVGNEELIE-ALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISR 293 R + + + + ++ + L H K + S+ + + + Sbjct: 249 RWSATLGKIELEENDFYWLYFASLAIALLKHMKIVFPAQTVVSYKQVAMAAIKALEE 305 >UniRef50_A5FFH2 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FFH2_FLAJ1 Length = 312 Score = 81.4 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 49/249 (19%), Positives = 91/249 (36%), Gaps = 23/249 (9%) Query: 61 IVLTVASLEYIWAFSNFFWVFTQE---YSKSQKNNDEHFDLTGKNRLKKSDELLKWARKN 117 I + L YIW +F V +E + L ++++L +A Sbjct: 71 INIHETFLSYIWINCYYFVVLHEEKYALPNLIDKGEMESRPYSDELLSEAEDLFSYALTL 130 Query: 118 LQTTGCESWPKK-CPKPEAYLQGSEDSQV---ASEIFLCAIAWILHHEISHVVLQHPLVT 173 + TG W K+ P PE + + S +++F+ + +IL+HE +H QH Sbjct: 131 V--TGFTDWDKETLPNPEYFDEESPQGNYILHVNDLFVEVLNFILYHETAHAEFQHIKKI 188 Query: 174 TAFSTQEER------EADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFC 227 E EADS A + ++ ++ IA A+ L ++N Sbjct: 189 KEGKLSNEEIKDLEIEADSRAIELLIQ------NSKNRQKAEIAIAMGLASILFIKNSLK 242 Query: 228 LQNTHPAAYERIYSNISCYPVGNEELIEALCTVMLQYLFHGKNINV--NLDGESFSSILG 285 +THP +RI + I + I + ++ + + N + + Sbjct: 243 GGSTHPDVDQRIENAIEILQPSEDSEIWTTLCLFIKTWDRQYGLGLIENRICTTIKDVFY 302 Query: 286 DLLCDISRL 294 DLL ++ Sbjct: 303 DLLEQAKKI 311 >UniRef50_Q858S4 Putative lysogenic conversion protein n=1 Tax=Enterobacteria phage P2-EC46 RepID=Q858S4_BPP2 Length = 333 Score = 66.8 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 64/158 (40%), Gaps = 18/158 (11%) Query: 127 PKKCPKPEAYLQGSED--SQVASEIFLCAIAWILHHEISHVVLQHPL-------VTTAFS 177 P P+P + + +E+ + A AW HE+ H++ Q T + Sbjct: 167 PIGIPEPGTMPDSKIEPVKRATAELAIIAAAWAFLHEVRHIIHQQEGTSYDMNEFTQEQA 226 Query: 178 TQEEREADSHATKWILGNLYESAPE--------LKKRALGIATAVLCIQSLEVENYFCLQ 229 EE D ATK+IL ++ E +KR L I A+ + L +N + Sbjct: 227 HNEEFSCDEFATKFILDHIDNYCEESMYDRVLVSRKRRLSIYCALFSVTMLG-KNNWGFS 285 Query: 230 NTHPAAYERIYSNISCYPVGNEELIEALCTVMLQYLFH 267 +HP+ +RI + ++E++E + M + + Sbjct: 286 KSHPSLQDRINKVKALMKEPDDEVLEYIVETMFKSIAR 323 >UniRef50_C3R9B2 Predicted protein n=2 Tax=Bacteroides RepID=C3R9B2_9BACE Length = 327 Score = 66.4 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 48/285 (16%), Positives = 96/285 (33%), Gaps = 35/285 (12%) Query: 31 VDDKKIIISVVSEPGF--NIRVRKNESNNSHEIVLTVASLEYIWAFSNFFWVFTQEYSKS 88 ++ + + + ++ K ++N + I + +Y+W+ + + + Sbjct: 39 SNNFSRRFTFIRDEKAISDVAEIKKDNNKLNHIYINENFCQYLWSVCVYLIAYFE----- 93 Query: 89 QKNNDEHFDLTGKNRLKK------------SDELLKWARKNLQT--TGCESWPKKCPKPE 134 N H + + K ++ R+ L P+ Sbjct: 94 ---NVIHIPMMDAVGINKNGYKPNMIDVEYGNDCFFRGRQLLHNFNRDAYWVTPNICNPQ 150 Query: 135 AYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAF--STQEEREADSHATKWI 192 + A++++ AIA+I HE SH L H + S +E AD A +I Sbjct: 151 QFENIISH---ANDVYCAAIAFIYAHEFSHNYLGHTQIQQTLSRSINDEIAADDMAISFI 207 Query: 193 LGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPVGNEE 252 + K + A L + E+ THP RI + ++ + + Sbjct: 208 QTEYNSAWGRTYKAGIATTLAALLLMG---EDSISGGGTHPDMDVRIENLVTKLELHEMD 264 Query: 253 LIEALCTVMLQ---YLFHGKNINVNLDGESFSSILGDLLCDISRL 294 L+ V L+ +F G +I ++ F S L I +L Sbjct: 265 LLWGYLGVALRLWLLVFDGLSIKEDMQQPGFGSYKEIYLYYIEKL 309 >UniRef50_C5VJH7 Putative uncharacterized protein n=2 Tax=Prevotella RepID=C5VJH7_9BACT Length = 337 Score = 46.8 bits (109), Expect = 8e-04, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 55/154 (35%), Gaps = 5/154 (3%) Query: 129 KCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQEEREADSHA 188 A + + + ++ I +IL HE SH L H L + + Q+E EAD + Sbjct: 172 NLGDFSAIDVNTPYGEKINSVYCFGICFILLHEASHFALGH-LDKVSPAIQDEVEADFSS 230 Query: 189 TKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPV 248 G++Y E +K + L + N HP ERI+ C Sbjct: 231 ----FGSIYSDISENEKFSANCGVLCALFSLLYLNPTIEPDNIHPTEDERIFKVYECIKN 286 Query: 249 GNEELIEALCTVMLQYLFHGKNINVNLDGESFSS 282 N + I L + + I+ + ++ Sbjct: 287 DNPKYIVILVQFFKYWAEIYQIIDFPPNLQNTED 320 >UniRef50_A9A7F0 Putative uncharacterized protein n=1 Tax=Methanococcus maripaludis C6 RepID=A9A7F0_METM6 Length = 353 Score = 43.7 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 45/239 (18%), Positives = 71/239 (29%), Gaps = 45/239 (18%) Query: 22 EKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFFWVF 81 + ++ L+ I + P N R R + + I F + Sbjct: 86 DTKKLLENIWVGHLPV------PQLNARCRYVPNTKTPVIAFYDL------LFGVLSFHA 133 Query: 82 TQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTGCESWPKKCPKPEAYLQGSE 141 Y Q N + ++ +T C + P+ + L Sbjct: 134 ESHYIAHQLNEIDPKLCDFFVDYHY-KVIIDIFNGKRRTIPCYNLPRHIKTSSSLL---- 188 Query: 142 DSQVASEIFLCAIAWILHHEISHVVLQH------------PLVTTA--FSTQEEREADSH 187 A EIFL A HE +H+ L H + FS Q E EAD Sbjct: 189 --ACAQEIFLLA------HEYAHIYLNHLDTVSSFNPVDGSINIKEYCFSKQREFEADLQ 240 Query: 188 ATKWILGNLYESAPELKKRALGIA------TAVLCIQSLEVENYFCLQNTHPAAYERIY 240 A +WI+ + K+ L I + + N THP + ER+ Sbjct: 241 AIRWIINFRNRISNTDKENILMITKNISLVVELFFLFYAIELNCNITSETHPKSKERLQ 299 >UniRef50_C4Z256 Putative uncharacterized protein n=3 Tax=Bacteria RepID=C4Z256_EUBE2 Length = 350 Score = 41.8 bits (96), Expect = 0.031, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 21/38 (55%) Query: 156 WILHHEISHVVLQHPLVTTAFSTQEEREADSHATKWIL 193 + L HE+ H++L H + Q+E++AD + ++ Sbjct: 254 FSLFHELGHIILGHVGKNGGTTEQDEQDADVWSRDELI 291 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P11072 Bacteriophage T4 late gene expression-blocking p... 359 6e-98 UniRef50_D0L0N2 Peptidase U49, Lit peptidase n=1 Tax=Halothiobac... 255 1e-66 UniRef50_Q31IR0 Putative uncharacterized protein n=1 Tax=Thiomic... 248 2e-64 UniRef50_A8H3S3 Cell death peptidase, inhibitor of T4 late gene ... 235 2e-60 UniRef50_C3R9B2 Predicted protein n=2 Tax=Bacteroides RepID=C3R9... 227 3e-58 UniRef50_Q5HXY2 Putative uncharacterized protein n=1 Tax=Glucono... 218 1e-55 UniRef50_A5FFH2 Putative uncharacterized protein n=1 Tax=Flavoba... 197 5e-49 UniRef50_A6GL96 Putative uncharacterized protein n=1 Tax=Limnoba... 181 2e-44 UniRef50_C5VJH7 Putative uncharacterized protein n=2 Tax=Prevote... 149 1e-34 UniRef50_Q858S4 Putative lysogenic conversion protein n=1 Tax=En... 135 2e-30 Sequences not found previously or not previously below threshold: UniRef50_A9A7F0 Putative uncharacterized protein n=1 Tax=Methano... 46 0.001 UniRef50_UPI0001C41EDC peptidase M48 family n=1 Tax=Methanobrevi... 45 0.003 UniRef50_B2IFU6 Putative uncharacterized protein n=1 Tax=Beijeri... 45 0.004 UniRef50_C7R141 Putative uncharacterized protein n=1 Tax=Jonesia... 43 0.015 UniRef50_A1ZNT8 Peptidase, M48 family n=1 Tax=Microscilla marina... 43 0.016 UniRef50_C4Z256 Putative uncharacterized protein n=3 Tax=Bacteri... 43 0.018 >UniRef50_P11072 Bacteriophage T4 late gene expression-blocking protein n=2 Tax=Escherichia coli RepID=LIT_ECOLI Length = 297 Score = 359 bits (921), Expect = 6e-98, Method: Composition-based stats. Identities = 297/297 (100%), Positives = 297/297 (100%) Query: 1 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE 60 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE Sbjct: 1 MRSPICHLFSAINSSPFKIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHE 60 Query: 61 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT 120 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT Sbjct: 61 IVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQT 120 Query: 121 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE Sbjct: 121 TGCESWPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 Query: 181 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY 240 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY Sbjct: 181 EREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIY 240 Query: 241 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN 297 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN Sbjct: 241 SNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISRLTSN 297 >UniRef50_D0L0N2 Peptidase U49, Lit peptidase n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0N2_HALNC Length = 309 Score = 255 bits (651), Expect = 1e-66, Method: Composition-based stats. Identities = 62/300 (20%), Positives = 117/300 (39%), Gaps = 38/300 (12%) Query: 20 APEKEQDLKTIVDDKKIIISVVSEP-GFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFF 78 APE++Q++K + + + VV + G NI K H+ +L+ IW Sbjct: 20 APERKQEIKDLWNHYAPKVCVVEDARGVNISAGKGRIQFDHK------TLKAIWLLGFNG 73 Query: 79 WVFTQEYSKSQ----------------KNNDEHFDLTGKNRLKKSDELLKWARKNLQTTG 122 W + YS + + F++ ++R + +++ + ++ TT Sbjct: 74 WRSIETYSPAIILAGITDGAIEDILCADDELAAFEMDYRSRANSARSIIE-EKSSVHTT- 131 Query: 123 CESWPKKCPKPEAYLQ--GSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQE 180 WP+ P+PEA + GS VA ++ A A++ HE+ HV A +E Sbjct: 132 ---WPQDVPRPEADRETLGSHQEMVAFDLVCLATAYVFLHELRHVKFLCDGDCPADRREE 188 Query: 181 EREADSHATKWILGNLYESAPE--------LKKRALGIATAVLCIQSLEVENYFCLQNTH 232 E D A ++ + A L KR++GIA + + + E+ + Sbjct: 189 EIACDVWARSFLTDKVESYAQSVGQAFSRVLDKRSMGIALGAMILHEITPESARWGTAEY 248 Query: 233 PAAYERIYSNISCYPVGNEELIEALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDIS 292 P RI + IS + E +L +F + + + S ++ L+ D+ Sbjct: 249 PPITTRIQAMISGSTLAKESHFWLFTACLLVGIFRQAHRQLPMYAPSTHELVEQLITDLQ 308 >UniRef50_Q31IR0 Putative uncharacterized protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31IR0_THICR Length = 307 Score = 248 bits (632), Expect = 2e-64, Method: Composition-based stats. Identities = 56/297 (18%), Positives = 104/297 (35%), Gaps = 27/297 (9%) Query: 19 IAPEKEQDLKTIVDDKKIIISVVSE-PGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNF 77 + PE+ ++ ++D V + GFN+ + I T SLE +W F Sbjct: 14 VIPERLDEVLDLIDIHSAQFRRVGDKSGFNLNAGAYGA-----IQFTQRSLEQLWLFGFA 68 Query: 78 FWVFTQEYSKSQKN---NDEHFDLTGKNRLKKSDELLKWARKNLQTTGC---------ES 125 S DL + + K E + K ++T + Sbjct: 69 GLYALHSLSGIIFFVKCKGLKLDLEEIDGVPKQKEENERFSKIIETIKGLNSAESEYDFN 128 Query: 126 WPKKCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQEEREAD 185 WP PKPE + ++ L A A++ HE+ HV+ + + +EE E D Sbjct: 129 WPAGLPKPEDGKPKDTEQAAVFDLVLMATAYVFLHELKHVIFESEGNSPEDRLREELECD 188 Query: 186 SHATKWILGNL--------YESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYE 237 S A + +L + Y L KR++ IA + + + THPA ++ Sbjct: 189 SFALEMMLSKIQDYSAKSGYPEGQVLMKRSISIALGSVFLAVATPRHNLGGTTTHPAVHD 248 Query: 238 RIYSNISCYPVGNEELIE-ALCTVMLQYLFHGKNINVNLDGESFSSILGDLLCDISR 293 R + + + + ++ + L H K + S+ + + + Sbjct: 249 RWSATLGKIELEENDFYWLYFASLAIALLKHMKIVFPAQTVVSYKQVAMAAIKALEE 305 >UniRef50_A8H3S3 Cell death peptidase, inhibitor of T4 late gene expression n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H3S3_SHEPA Length = 276 Score = 235 bits (598), Expect = 2e-60, Method: Composition-based stats. Identities = 92/277 (33%), Positives = 149/277 (53%), Gaps = 13/277 (4%) Query: 18 KIAPEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNF 77 ++APE E L + + + + E GF RV + + I + LEY+W FS Sbjct: 7 RVAPENESKLVELSNKHDFSLLFLKESGFTFRV----NTKTKVIKIPYGGLEYLWTFSLK 62 Query: 78 FWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTGCESWPKKCPKPEAYL 137 W+ Q Y+++Q N +++ DL N + +L ++ +++L ES+ Sbjct: 63 AWLLYQAYAQAQLNGEQNLDLETINGYAGTSQLTEYLKESLY---AESYDPNKNFSALIS 119 Query: 138 QGSEDSQVASEIFLCAIAWILHHEISHVVLQHP-LVTTAFSTQEEREADSHATKWILGNL 196 D +VA+EIFLCA+ WI+ HEI+H+ L H L S QEE++AD ++T WIL + Sbjct: 120 VEEIDQEVATEIFLCALGWIIWHEIAHIELGHSSLEINTLSIQEEKDADLYSTNWILSS- 178 Query: 197 YESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPVGNEELIEA 256 A E KKR +GI A+L IQSLE+++ C + THP A RI+ N+S + +E+ +A Sbjct: 179 STIAAESKKRIVGITIALLAIQSLELDSKSCFKGTHPDASNRIFDNLSQHSEIGDEISQA 238 Query: 257 LCTVMLQYLFHGKNINV-NLDGESFSSILGDLLCDIS 292 + V LQ + I++ + +FS +LG+ L I+ Sbjct: 239 MSIVTLQSM---TKIDIAPMSDLNFSDMLGEALYQIN 272 >UniRef50_C3R9B2 Predicted protein n=2 Tax=Bacteroides RepID=C3R9B2_9BACE Length = 327 Score = 227 bits (578), Expect = 3e-58, Method: Composition-based stats. Identities = 48/285 (16%), Positives = 95/285 (33%), Gaps = 35/285 (12%) Query: 31 VDDKKIIISVVSEPGF--NIRVRKNESNNSHEIVLTVASLEYIWAFSNFFWVFTQEYSKS 88 ++ + + + ++ K ++N + I + +Y+W+ + + + Sbjct: 39 SNNFSRRFTFIRDEKAISDVAEIKKDNNKLNHIYINENFCQYLWSVCVYLIAYFE----- 93 Query: 89 QKNNDEHFDLTGKNRLKK------------SDELLKWARKNLQT--TGCESWPKKCPKPE 134 N H + + K ++ R+ L P+ Sbjct: 94 ---NVIHIPMMDAVGINKNGYKPNMIDVEYGNDCFFRGRQLLHNFNRDAYWVTPNICNPQ 150 Query: 135 AYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFS--TQEEREADSHATKWI 192 + A++++ AIA+I HE SH L H + S +E AD A +I Sbjct: 151 QFENIISH---ANDVYCAAIAFIYAHEFSHNYLGHTQIQQTLSRSINDEIAADDMAISFI 207 Query: 193 LGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPVGNEE 252 + K + A L + E+ THP RI + ++ + + Sbjct: 208 QTEYNSAWGRTYKAGIATTLAALLLMG---EDSISGGGTHPDMDVRIENLVTKLELHEMD 264 Query: 253 LIEALCTVMLQYL---FHGKNINVNLDGESFSSILGDLLCDISRL 294 L+ V L+ F G +I ++ F S L I +L Sbjct: 265 LLWGYLGVALRLWLLVFDGLSIKEDMQQPGFGSYKEIYLYYIEKL 309 >UniRef50_Q5HXY2 Putative uncharacterized protein n=1 Tax=Gluconobacter oxydans RepID=Q5HXY2_GLUOX Length = 297 Score = 218 bits (556), Expect = 1e-55, Method: Composition-based stats. Identities = 55/250 (22%), Positives = 105/250 (42%), Gaps = 16/250 (6%) Query: 23 KEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFFWVFT 82 + L++ ++I + + + G + + I + +LE +W + F Sbjct: 33 NARALRSFHGRQEIKVGISEDSGGDAFACQPSLAT---ITMKWWALEVLWLTAFVFQNLG 89 Query: 83 QEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTGCESWPKKCPKPEAYLQGSED 142 + ++ + L + L+K + L A ++ +WP+ P P + +D Sbjct: 90 ARLWRHAEDGNTSSTLEDEVHLRKGRDALAHAANLIENRESRNWPENIPHPA---ENKKD 146 Query: 143 SQVASEIFLCAIAWILHHEISHVVLQHPLVTTA--FSTQEEREADSHATKWILGNLYESA 200 S+ +++FL A+ WI HE++H+ + + + A S EE DS+AT W+L + Sbjct: 147 SERTNKVFLKALGWIELHEVAHITVDESIFSGAVNPSIAEEEYCDSYATNWVLAGNGKKL 206 Query: 201 PELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERI-YSNISCYPVGNEELIEALCT 259 P ++ G+A A + EV THP + R+ S IS G + + Sbjct: 207 PSQRE---GVAVATFFLVLREVLRGRTG-PTHPNSQSRVAASQISDRHGG---WLAIIID 259 Query: 260 VMLQYLFHGK 269 +MLQ H Sbjct: 260 IMLQSGGHST 269 >UniRef50_A5FFH2 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FFH2_FLAJ1 Length = 312 Score = 197 bits (499), Expect = 5e-49, Method: Composition-based stats. Identities = 49/249 (19%), Positives = 91/249 (36%), Gaps = 23/249 (9%) Query: 61 IVLTVASLEYIWAFSNFFWVFTQE---YSKSQKNNDEHFDLTGKNRLKKSDELLKWARKN 117 I + L YIW +F V +E + L ++++L +A Sbjct: 71 INIHETFLSYIWINCYYFVVLHEEKYALPNLIDKGEMESRPYSDELLSEAEDLFSYALTL 130 Query: 118 LQTTGCESWPKK-CPKPEAYLQGSEDSQV---ASEIFLCAIAWILHHEISHVVLQHPLVT 173 + TG W K+ P PE + + S +++F+ + +IL+HE +H QH Sbjct: 131 V--TGFTDWDKETLPNPEYFDEESPQGNYILHVNDLFVEVLNFILYHETAHAEFQHIKKI 188 Query: 174 TAFSTQEER------EADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFC 227 E EADS A + ++ ++ IA A+ L ++N Sbjct: 189 KEGKLSNEEIKDLEIEADSRAIELLIQ------NSKNRQKAEIAIAMGLASILFIKNSLK 242 Query: 228 LQNTHPAAYERIYSNISCYPVGNEELIEALCTVMLQYLFHGKNINV--NLDGESFSSILG 285 +THP +RI + I + I + ++ + + N + + Sbjct: 243 GGSTHPDVDQRIENAIEILQPSEDSEIWTTLCLFIKTWDRQYGLGLIENRICTTIKDVFY 302 Query: 286 DLLCDISRL 294 DLL ++ Sbjct: 303 DLLEQAKKI 311 >UniRef50_A6GL96 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GL96_9BURK Length = 309 Score = 181 bits (460), Expect = 2e-44, Method: Composition-based stats. Identities = 59/260 (22%), Positives = 113/260 (43%), Gaps = 33/260 (12%) Query: 55 SNNSHEIVLTVASLEYIWAFSNFFWVFTQE----------YSKSQKNNDEHFDLTGKNRL 104 + + EI + + L Y+WAF +V +E +S S + N+ + L Sbjct: 58 NVKTREITIYESHLAYLWAFVYSSFVLYEEGIQKPMLAGTFSGSLEFNN--------SLL 109 Query: 105 KKSDELLKWARKNLQTTGCESWP-KKCPKPEAYLQGSEDSQV--ASEIFLCAIAWILHHE 161 +++ L WA K + W + P P +E V + +FL A+ ++L HE Sbjct: 110 QRAAALQSWAIKFV--RCYSDWSIDELPNPAKVESEAEQFYVPKVNSLFLQAVNFLLFHE 167 Query: 162 ISHVVLQHPL-VTTAFSTQEEREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSL 220 H+VL H + ++ +E++AD++AT + + E ++R +G++ +L + + Sbjct: 168 YGHLVLGHVVNEDKDWTLDQEKDADNYATTFF---IEAGTNESERRFVGVSIVLLLVSCV 224 Query: 221 EVENYFC--LQNTHPAAYERIYSNISCYPVGNEE---LIEALCTVMLQYLFHGKNIN-VN 274 + Q HP ++RI + IS + EE I L ++ LQ K ++ Sbjct: 225 FIPEKISGLWQVKHPHLHDRIRNGISSLNLEEEESKFYIYYLASIALQKYLLEKGVDCAQ 284 Query: 275 LDGESFSSILGDLLCDISRL 294 L+ E+ + + L I Sbjct: 285 LEIETAEELFFEYLARIDEF 304 >UniRef50_C5VJH7 Putative uncharacterized protein n=2 Tax=Prevotella RepID=C5VJH7_9BACT Length = 337 Score = 149 bits (375), Expect = 1e-34, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 55/154 (35%), Gaps = 5/154 (3%) Query: 129 KCPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQHPLVTTAFSTQEEREADSHA 188 A + + + ++ I +IL HE SH L H L + + Q+E EAD + Sbjct: 172 NLGDFSAIDVNTPYGEKINSVYCFGICFILLHEASHFALGH-LDKVSPAIQDEVEADFSS 230 Query: 189 TKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPV 248 G++Y E +K + L + N HP ERI+ C Sbjct: 231 ----FGSIYSDISENEKFSANCGVLCALFSLLYLNPTIEPDNIHPTEDERIFKVYECIKN 286 Query: 249 GNEELIEALCTVMLQYLFHGKNINVNLDGESFSS 282 N + I L + + I+ + ++ Sbjct: 287 DNPKYIVILVQFFKYWAEIYQIIDFPPNLQNTED 320 >UniRef50_Q858S4 Putative lysogenic conversion protein n=1 Tax=Enterobacteria phage P2-EC46 RepID=Q858S4_BPP2 Length = 333 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 49/303 (16%), Positives = 97/303 (32%), Gaps = 57/303 (18%) Query: 20 APEKEQDLKTIVDDKKIIISVVSEPGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFFW 79 PE++ +L + S+ + + ++ I L W + Sbjct: 23 IPERQNELDEFWSKFNMTFQNHSDNHTDGKFI-FDAGMYRFIRFNHRVLRTFWIGTFAAM 81 Query: 80 VFTQEYSKSQKNNDEHF--------DLTGKNRLK----------------------KSDE 109 + + Q E F DL ++ S + Sbjct: 82 AGYEAINTEQNKKIEDFIDKRDSILDLFDSKKISLVSTESELHVLLEELQEELADFSSAD 141 Query: 110 L--LKWARKNLQTTGCESWPKK------CPKPEAYLQGSED--SQVASEIFLCAIAWILH 159 + + T + P + P+P + + +E+ + A AW Sbjct: 142 FVNFEQLICAFENTAKDEVPDEKPLPIGIPEPGTMPDSKIEPVKRATAELAIIAAAWAFL 201 Query: 160 HEISHVVLQHPL-------VTTAFSTQEEREADSHATKWILGNLYESAPEL--------K 204 HE+ H++ Q T + EE D ATK+IL ++ E + Sbjct: 202 HEVRHIIHQQEGTSYDMNEFTQEQAHNEEFSCDEFATKFILDHIDNYCEESMYDRVLVSR 261 Query: 205 KRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPVGNEELIEALCTVMLQY 264 KR L I A+ + L +N + +HP+ +RI + ++E++E + M + Sbjct: 262 KRRLSIYCALFSVTMLG-KNNWGFSKSHPSLQDRINKVKALMKEPDDEVLEYIVETMFKS 320 Query: 265 LFH 267 + Sbjct: 321 IAR 323 >UniRef50_A9A7F0 Putative uncharacterized protein n=1 Tax=Methanococcus maripaludis C6 RepID=A9A7F0_METM6 Length = 353 Score = 46.4 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 43/126 (34%), Gaps = 22/126 (17%) Query: 149 IFLCAIAWILHHEISHVVLQH------------PLVTTA--FSTQEEREADSHATKWILG 194 + ++L HE +H+ L H + FS Q E EAD A +WI+ Sbjct: 188 LACAQEIFLLAHEYAHIYLNHLDTVSSFNPVDGSINIKEYCFSKQREFEADLQAIRWIIN 247 Query: 195 NLYESAPELKKRAL----GIATAV---LCIQSLEVENYFCLQNTHPAAYERIYSNISCYP 247 + K+ L I+ V ++E+ THP + ER+ Sbjct: 248 FRNRISNTDKENILMITKNISLVVELFFLFYAIELNCNIT-SETHPKSKERLQYIYENIK 306 Query: 248 VGNEEL 253 E Sbjct: 307 NELSEY 312 >UniRef50_UPI0001C41EDC peptidase M48 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C41EDC Length = 407 Score = 45.2 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 67/234 (28%), Gaps = 35/234 (14%) Query: 43 EPGFNIRVRKNESNNSHEIVLTVASLEYIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKN 102 + ++ KN + + +L+ + + + +N L Sbjct: 63 DSKNDLDSNKNINPFKQKDDDFDKALDKNYLEEYNTQYYYSISQCTSLDNSSQGRLVSNI 122 Query: 103 RLKKSDELLKWARKNL---QTTGCESWPKKCPKPE---AYLQGSEDSQVASEIFLCA--- 153 L + L + K W A+ V S I A Sbjct: 123 LLNLIESLENYLLKIARIDYINSYFDWDFHLLNSSFENAFCMPGGKVLVYSGILSIADNE 182 Query: 154 --IAWILHHEISHVVLQHPLVT--------------TAFSTQEEREADSHATKWILGNLY 197 IA+IL HE++H +L HP FS +E EAD A + Y Sbjct: 183 ERIAYILAHEMAHALLNHPRDYIIVKDKEYNNVLLFKPFSLNQEFEADRLAMMILKWADY 242 Query: 198 ESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYERIYSNISCYPVGNE 251 + C N F +THP + +R+ + S N Sbjct: 243 DIGN----------IPSFCQFLRGNYNKFNYFSTHPLSDDRLMNMESLIAEINN 286 >UniRef50_B2IFU6 Putative uncharacterized protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IFU6_BEII9 Length = 378 Score = 44.9 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 68/212 (32%), Gaps = 47/212 (22%) Query: 15 SPFKIAPEK-EQDLKTIVDDK----KIIISVVSEPGFNIRVRKNESNNSHEIVLTVASLE 69 S F++ ++ L ++ D+ + FN ++ + I + Sbjct: 34 SVFQVVSDRCYARLASMADEYEAPGSVAFGFAEHREFNAFAQRGRMD---VIGFYTTVVR 90 Query: 70 YIWAFSNFFWVFTQEYSKSQKNNDEHFDLTGKNRLKKSDELLKWARKNLQTTGCESWPKK 129 +W+ +N + + D G+ + ++ L + R P Sbjct: 91 VMWSVTNAMMGIREMFPWIDD-----VDQLGEEQAPHANGELFFVR-----------PPD 134 Query: 130 CPKPEAYLQGSEDSQVASEIFLCAIAWILHHEISHVVLQH--------PLVTTAFSTQE- 180 P P ++A+ +F A+ + L HE++H+ H P E Sbjct: 135 QPAPNFEPV---RGRLATALFDVAMDFTLMHELAHLWNGHVELLHRMSPKPIQEMHLNEG 191 Query: 181 -----------EREADSHATKWILGNLYESAP 201 E +ADS A + + ++ P Sbjct: 192 ECLDLPLMQALEFDADSFAIQKVFARVHRENP 223 >UniRef50_C7R141 Putative uncharacterized protein n=1 Tax=Jonesia denitrificans DSM 20603 RepID=C7R141_JONDD Length = 126 Score = 42.9 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 14/47 (29%), Positives = 24/47 (51%) Query: 157 ILHHEISHVVLQHPLVTTAFSTQEEREADSHATKWILGNLYESAPEL 203 +L HE+ HV H L T S ++E AD +A + ++ + + E Sbjct: 48 VLAHELGHVHYGHDLRTRHDSPRDETRADMYAARLLIEPIEYAIAES 94 >UniRef50_A1ZNT8 Peptidase, M48 family n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZNT8_9SPHI Length = 285 Score = 42.6 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 27/138 (19%) Query: 128 KKCPKPEAYLQGSEDSQVASEIFLCAIAW----------ILHHEISHVVLQHPLVTTAFS 177 A++Q + +F+ A+ + +L HE+ H H L + Sbjct: 65 NGINNAYAHIQNGKRFITYDNLFVEALDYQTGTKWASVSVLAHEVGHHYFDHVLDREGST 124 Query: 178 TQEEREADSHATKWILGNLYESAPELKKRALGIATAVLCIQSLEVENYFCLQNTHPAAYE 237 +E EAD + ++L + S + K +A ++HP + Sbjct: 125 HSKELEADYFS-GYVLAKMGASIAQAKAAMAKLA-------------NPYGSHSHPPRNQ 170 Query: 238 R---IYSNISCYPVGNEE 252 R I + + Sbjct: 171 RLTAIEKGYNTVKPRKKS 188 >UniRef50_C4Z256 Putative uncharacterized protein n=3 Tax=Bacteria RepID=C4Z256_EUBE2 Length = 350 Score = 42.6 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 21/38 (55%) Query: 156 WILHHEISHVVLQHPLVTTAFSTQEEREADSHATKWIL 193 + L HE+ H++L H + Q+E++AD + ++ Sbjct: 254 FSLFHELGHIILGHVGKNGGTTEQDEQDADVWSRDELI 291 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.117 0.274 Lambda K H 0.267 0.0362 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,205,060,966 Number of Sequences: 3077464 Number of extensions: 37227861 Number of successful extensions: 117228 Number of sequences better than 1.0e-01: 17 Number of HSP's better than 0.1 without gapping: 22 Number of HSP's successfully gapped in prelim test: 15 Number of HSP's that attempted gapping in prelim test: 117136 Number of HSP's gapped (non-prelim): 39 length of query: 297 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 169 effective length of database: 646,480,964 effective search space: 109255282916 effective search space used: 109255282916 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 92 (40.2 bits)