BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (357 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P52129 Uncharacterized protein yfjN n=5 Tax=Gammaproteo... 744 0.0 UniRef50_A3JE92 CP4-57 prophage; RNase LS n=1 Tax=Marinobacter s... 264 5e-69 UniRef50_Q6LR54 Hypothetical cell division protein n=1 Tax=Photo... 167 4e-40 UniRef50_O82881 Plasmid pOSAK1 DNA, complete sequence n=1 Tax=Es... 77 1e-12 UniRef50_O69417 Putative uncharacterized protein n=1 Tax=Escheri... 61 6e-08 UniRef50_C5VRV1 Ribonuclease HI n=1 Tax=Clostridium botulinum D ... 61 6e-08 UniRef50_B9K8R5 Ribonuclease H n=5 Tax=Thermotogaceae RepID=B9K8... 53 1e-05 UniRef50_B3PN28 Ribonuclease HI n=1 Tax=Mycoplasma arthritidis 1... 52 3e-05 UniRef50_Q6AIF4 Putative uncharacterized protein n=1 Tax=Desulfo... 49 3e-04 >UniRef50_P52129 Uncharacterized protein yfjN n=5 Tax=Gammaproteobacteria RepID=YFJN_ECOLI Length = 357 Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust. Identities = 357/357 (100%), Positives = 357/357 (100%) Query: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI Sbjct: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 Query: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA Sbjct: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 Query: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA Sbjct: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 Query: 181 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA 240 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA Sbjct: 181 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA 240 Query: 241 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR 300 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR Sbjct: 241 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR 300 Query: 301 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV Sbjct: 301 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 >UniRef50_A3JE92 CP4-57 prophage; RNase LS n=1 Tax=Marinobacter sp. ELB17 RepID=A3JE92_9ALTE Length = 358 Score = 264 bits (674), Expect = 5e-69, Method: Compositional matrix adjust. Identities = 146/362 (40%), Positives = 211/362 (58%), Gaps = 17/362 (4%) Query: 3 IRSYKNLNLVRANIETESRQFIENK----NYSIQSIGPMPGSRAGLRVVFTRPGVNLATV 58 + Y++LNL R ++ F+ + + IQ++ R RV F +PG A V Sbjct: 1 MSDYRDLNLNREVLDENIGSFLGSYGCVLDDKIQTL-----DRGKRRVAFGKPGAEFAMV 55 Query: 59 DIFYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLEL 118 D+ N G+TTIQ+ G N+ LG++LA +L TI+PAEFE VN L+G S P+L Sbjct: 56 DLHLNNTGTTTIQWKLGKNQPLGEKLAAYLKSTIDPAEFESVNYSLKGISTGSFDPILGC 115 Query: 119 SADESHIE---FREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFT 175 A+ IE R+ + + I+ +QD+LT++ H +T LQIQG+PLSCYR Sbjct: 116 IAELDDIEVVVLRDEVKCKQVTLKSIV---HQDKLTLTHHRSTRVLQIQGKPLSCYRRVV 172 Query: 176 FNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLC 235 F L LLDL+GL +VL R++D A IV++E+A YL+TV +Y HL + +KLL S C Sbjct: 173 FMLIDLLDLKGLTQVLYRKDDNSAEIVRKEMAEDYLKTVFTRSYDHLPDSVKKLLTSSCC 232 Query: 236 VKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-GFGTYFDKPAAHYILKPQ 294 +KLA+P LPDYC+LLYP+LR +EGVLK MSG M V+ GFG +FD LK + Sbjct: 233 IKLASPQLPDYCLLLYPDLRALEGVLKELMSGYNMSVEDAEYGFGNFFDNRGGVCSLKAE 292 Query: 295 FAATLRPEQI-NIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKD 353 ++A + + + + Y+F+ RH+LFHME D SRMI + + + + A+ I Sbjct: 293 YSAQVAHTHMEDAFNRGYSFYKKHRHTLFHMEEFADGSRMIDTLDKAISLSKDAYEAIDS 352 Query: 354 LY 355 LY Sbjct: 353 LY 354 >UniRef50_Q6LR54 Hypothetical cell division protein n=1 Tax=Photobacterium profundum RepID=Q6LR54_PHOPR Length = 362 Score = 167 bits (424), Expect = 4e-40, Method: Compositional matrix adjust. Identities = 103/315 (32%), Positives = 158/315 (50%), Gaps = 8/315 (2%) Query: 48 FTRPGVNLATVDIFYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGF 107 F + G ++ATV ++ G TI Y TG N +LG+ D L + A+ + N+VL+G Sbjct: 43 FIKTGQDVATVIVYIKQGGLVTITYKTGKNHALGKVFHDFLEQKCESADANKANLVLKGM 102 Query: 108 VETSVLPVLELSADE---SHIEFREHSRNAHTVVWK--IISTSYQDELTVSLHITTGKLQ 162 + VL L DE F + N + K I ++D + + H TT KLQ Sbjct: 103 SSDEIEFVLALMGDELVEGEKAFSITTANPTPICQKYTITCDKFKDNIVLLYHTTTYKLQ 162 Query: 163 IQGRPLSCYRVFTFNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHL 222 IQGR L CYR ++L+ALLD Q L V+ ++ ++ +EVA Y++ + +A+ L Sbjct: 163 IQGRALFCYRTLCYHLSALLDQQSLLAVVEKKSAEDKVVLHEEVASIYIKKALPNAFERL 222 Query: 223 HVTAEKLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-GFGTY 281 T LL S CVKLA+P L +Y ML+YP+LR +EGV+K M+ + G Y Sbjct: 223 DDTYRSLLSSSYCVKLASPSLSEYSMLIYPDLRVLEGVIKEAMAKNDLYTSSEGIDIGEY 282 Query: 282 FDKPAAHYILKPQFAATLRPE-QINIISTAYTFFNVERHSLFHMETVVDASRMISDMARL 340 F LK ++ + + E +I + Y +F RHSLFHM+ SR + + Sbjct: 283 F-THGRQTELKTEYNSNFQSESEIRCLEQCYAYFKAHRHSLFHMDESGYESRTTDTIGEV 341 Query: 341 MGKATRAWGIIKDLY 355 M + + +I +Y Sbjct: 342 MQMSEKIAELIDAMY 356 >UniRef50_O82881 Plasmid pOSAK1 DNA, complete sequence n=1 Tax=Escherichia coli O157:H7 RepID=O82881_ECO57 Length = 346 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 56/276 (20%), Positives = 126/276 (45%), Gaps = 7/276 (2%) Query: 66 GSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSADESHI 125 G+TTI +G N + E+A + E ++ + + F + + E ++E Sbjct: 58 GNTTIGRASGQNNTYFDEIALIIKENCLYSDTKNFEYTIPKFSDDDRANLFEFLSEEGIT 117 Query: 126 EFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDLQ 185 +++ + + I++TS D + ++ G +Q QG+ L + + ++L++ Sbjct: 118 ITEDNNNDPNCKHQYIMTTSNGDRVRAKIY-KRGSIQFQGKYLQIASLINDFMCSILNM- 175 Query: 186 GLEKVLIRQEDGKANI-VQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAAPDLP 244 K ++ Q++ + N+ +++E + L + + + +H +K L L +K ++ Sbjct: 176 ---KEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIKKQLSCSLIMKKIDVEME 232 Query: 245 DYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLRPEQI 304 DY + LR IEG + ++ + P G YF + YI++ T+ E Sbjct: 233 DYSTYCFSALRAIEGFIYQILNDVCNP-SSSKNLGEYFTENKPKYIIREIHQETINGEIA 291 Query: 305 NIISTAYTFFNVERHSLFHMETVVDASRMISDMARL 340 ++ YT+++ RH LFHM+ + ++ I+ + + Sbjct: 292 EVLCECYTYWHENRHGLFHMKPGIADTKTINKLESI 327 >UniRef50_O69417 Putative uncharacterized protein n=1 Tax=Escherichia coli RepID=O69417_ECOLX Length = 209 Score = 61.2 bits (147), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 38/171 (22%), Positives = 81/171 (47%), Gaps = 6/171 (3%) Query: 175 TFNLAALLDLQGLE----KVLIRQEDGKANI-VQQEVARTYLQTVMADAYPHLHVTAEKL 229 TF +A+L++ K ++ Q++ + N+ +++E + L + + + +H +K Sbjct: 21 TFQIASLINDFMCSILNMKEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIKKQ 80 Query: 230 LVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHY 289 L L +K ++ DY + LR IEG + ++ + P G YF + Y Sbjct: 81 LSCSLIMKKIDVEMEDYSTYCFSALRAIEGFIYQILNDVCNP-SSSKNLGEYFTENKPKY 139 Query: 290 ILKPQFAATLRPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARL 340 I++ T+ E ++ YT+++ RH LFHM+ + ++ I+ + + Sbjct: 140 IIREIHQETINGEIAEVLCECYTYWHENRHGLFHMKPGIADTKTINKLESI 190 >UniRef50_C5VRV1 Ribonuclease HI n=1 Tax=Clostridium botulinum D str. 1873 RepID=C5VRV1_CLOBO Length = 480 Score = 61.2 bits (147), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 53/225 (23%), Positives = 96/225 (42%), Gaps = 27/225 (12%) Query: 141 IISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDLQGLEKVLIRQEDGKAN 200 I + + L VS +I K+ I G + T + LL+++ + L D + Sbjct: 259 IFNNEKKQRLIVSSYIDKNKVYINGEKEELFNRLTSYIVELLEIEDIPNFLNTVHDLQ-- 316 Query: 201 IVQQEVARTYLQTVMADAYP--------HLHVTAEKLLVSGLCVKLAAPDLPDYCMLLYP 252 + ++V + + ++Y +LH L ++G + D L+ P Sbjct: 317 -IDKDVVESEFNSYFPNSYNLIPDELNNYLHQAVYNLHITG---NIYVADF-----LVEP 367 Query: 253 ELRTIEGVLKSKMSGLGMPVQQPA-GFGTYF--DKPAAHYILKPQFAATLRPEQI-NIIS 308 +R +EG+LK + +P+++ + ++F K YIL+ ++ E I N +S Sbjct: 368 AIRPLEGILKIALQENNIPIRKKQDNYDSFFVFKKNKDRYILRDKYVREDHSENILNYLS 427 Query: 309 TAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKD 353 YT+FN RH+L H D + D R++ A IIKD Sbjct: 428 ECYTYFNKNRHTLLHW----DNPKNELDTTRILTTVQEAHTIIKD 468 >UniRef50_B9K8R5 Ribonuclease H n=5 Tax=Thermotogaceae RepID=B9K8R5_THENN Length = 530 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 47/227 (20%), Positives = 102/227 (44%), Gaps = 13/227 (5%) Query: 137 VVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDLQGLEKVLIRQED 196 V+ K+ + +++ELT++ ++ TG LQIQG+ ++ L+ + + + R Sbjct: 298 VMVKVKNKYFKEELTLTYYMNTGNLQIQGKAHDVFKNVQLFLSEFESAERYKNFIKRI-- 355 Query: 197 GKANIVQQEVARTYLQTVMADAYPHLHVTAEKL---LVSGLCVKLAAPDLP--DYCMLLY 251 I + L V AY + +E L L++ L P +P D+ + L Sbjct: 356 --YGIEDERRLEETLNRVTKGAYGS-ELISEALKNELLTAYLAYLEEPTMPFRDFSLYLV 412 Query: 252 PELRTIEGVLKSKMSGLGMPVQQPAGFGTYF--DKPAAHYILKPQFAATLR-PEQINIIS 308 P +R +E ++ + + +++ G +F K Y ++P A + ++++ Sbjct: 413 PSVRVLEAFIEMGLQMITGKIEKINRIGDFFKWSKKDNSYKIRPDCIANYNLGDLLSVLE 472 Query: 309 TAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLY 355 Y F++ RH+ H ++ + +I + +++ RA +I DL+ Sbjct: 473 KCYNFYHDYRHAYVHASSLEGHTSIIPEKSQVDDLIKRALELIGDLF 519 >UniRef50_B3PN28 Ribonuclease HI n=1 Tax=Mycoplasma arthritidis 158L3-1 RepID=B3PN28_MYCA5 Length = 487 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 39/158 (24%), Positives = 68/158 (43%), Gaps = 12/158 (7%) Query: 211 LQTVMADAYPHLHVTA-----EKLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKM 265 LQT ++ P+ + + + +L+S +C L LPDY L+ P R++E L + Sbjct: 330 LQTKFSNFLPNFSINSNDFKIKNILLSAVCGTLMKGYLPDYTYLVMPLFRSMEYYLHIIL 389 Query: 266 -SGLGMPVQQPAGFGTY----FDKPAAHYILKPQFAATLRPEQINIISTAYTFFNVERHS 320 LG + G + F+K Y L EQ+ ++ Y +N RH Sbjct: 390 GDKLGRKTTRKNGANDFCHFSFNKKTNEYEYNHSTKERLNNEQLQYLNKLYNMYNKLRHP 449 Query: 321 LFHM-ETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 FH+ + ++DAS +IS + +I Y++ Sbjct: 450 YFHLPQNLIDAS-VISKLEEAQNILVEGLKLINKFYLI 486 >UniRef50_Q6AIF4 Putative uncharacterized protein n=1 Tax=Desulfotalea psychrophila RepID=Q6AIF4_DESPS Length = 231 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 38/129 (29%), Positives = 58/129 (44%), Gaps = 15/129 (11%) Query: 60 IFYNGD-GSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSV-----L 113 IFY G TTIQ+ G N+ L E A + + + + VN +G +E Sbjct: 52 IFYPKKFGITTIQFSCGKNKELSCEKAKQIINNFDVSSAKSVNCTFKGLLEEEFGIFEEY 111 Query: 114 PVLELSADESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRV 173 + EL S I+ + + T+ + S Y D +TV+ + TTG +QGRPL + Sbjct: 112 VIEELPDISSKIQ--KDDKTKKTISY---SGKYSDTVTVTFYKTTGTTLLQGRPLPAF-- 164 Query: 174 FTFNLAALL 182 F + AL Sbjct: 165 --FEIKALF 171 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P52129 Uncharacterized protein yfjN n=5 Tax=Gammaproteo... 473 e-132 UniRef50_A3JE92 CP4-57 prophage; RNase LS n=1 Tax=Marinobacter s... 404 e-111 UniRef50_Q6LR54 Hypothetical cell division protein n=1 Tax=Photo... 320 3e-86 UniRef50_O82881 Plasmid pOSAK1 DNA, complete sequence n=1 Tax=Es... 293 8e-78 UniRef50_B9K8R5 Ribonuclease H n=5 Tax=Thermotogaceae RepID=B9K8... 209 2e-52 UniRef50_O69417 Putative uncharacterized protein n=1 Tax=Escheri... 181 3e-44 UniRef50_C5VRV1 Ribonuclease HI n=1 Tax=Clostridium botulinum D ... 174 4e-42 UniRef50_B3PN28 Ribonuclease HI n=1 Tax=Mycoplasma arthritidis 1... 145 3e-33 UniRef50_Q6AIF4 Putative uncharacterized protein n=1 Tax=Desulfo... 114 5e-24 Sequences not found previously or not previously below threshold: UniRef50_Q03453 Complete nucleotide sequence n=2 Tax=root RepID=... 158 3e-37 UniRef50_UPI000196AB20 hypothetical protein CATMIT_00854 n=1 Tax... 106 1e-21 UniRef50_Q5LEL8 Putative uncharacterized protein n=2 Tax=Bactero... 94 6e-18 UniRef50_C7IHA2 Putative uncharacterized protein n=1 Tax=Clostri... 74 7e-12 UniRef50_Q6AIF3 Putative uncharacterized protein n=1 Tax=Desulfo... 54 1e-05 UniRef50_A6TLN0 Putative uncharacterized protein n=1 Tax=Alkalip... 47 0.001 >UniRef50_P52129 Uncharacterized protein yfjN n=5 Tax=Gammaproteobacteria RepID=YFJN_ECOLI Length = 357 Score = 473 bits (1218), Expect = e-132, Method: Composition-based stats. Identities = 357/357 (100%), Positives = 357/357 (100%) Query: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI Sbjct: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 Query: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA Sbjct: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 Query: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA Sbjct: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 Query: 181 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA 240 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA Sbjct: 181 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA 240 Query: 241 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR 300 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR Sbjct: 241 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR 300 Query: 301 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV Sbjct: 301 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 >UniRef50_A3JE92 CP4-57 prophage; RNase LS n=1 Tax=Marinobacter sp. ELB17 RepID=A3JE92_9ALTE Length = 358 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 146/362 (40%), Positives = 210/362 (58%), Gaps = 17/362 (4%) Query: 3 IRSYKNLNLVRANIETESRQFIENK----NYSIQSIGPMPGSRAGLRVVFTRPGVNLATV 58 + Y++LNL R ++ F+ + + IQ++ R RV F +PG A V Sbjct: 1 MSDYRDLNLNREVLDENIGSFLGSYGCVLDDKIQTL-----DRGKRRVAFGKPGAEFAMV 55 Query: 59 DIFYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLEL 118 D+ N G+TTIQ+ G N+ LG++LA +L TI+PAEFE VN L+G S P+L Sbjct: 56 DLHLNNTGTTTIQWKLGKNQPLGEKLAAYLKSTIDPAEFESVNYSLKGISTGSFDPILGC 115 Query: 119 SADESHIE---FREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFT 175 A+ IE R+ + + I+ +QD+LT++ H +T LQIQG+PLSCYR Sbjct: 116 IAELDDIEVVVLRDEVKCKQVTLKSIV---HQDKLTLTHHRSTRVLQIQGKPLSCYRRVV 172 Query: 176 FNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLC 235 F L LLDL+GL +VL R++D A IV++E+A YL+TV +Y HL + +KLL S C Sbjct: 173 FMLIDLLDLKGLTQVLYRKDDNSAEIVRKEMAEDYLKTVFTRSYDHLPDSVKKLLTSSCC 232 Query: 236 VKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-GFGTYFDKPAAHYILKPQ 294 +KLA+P LPDYC+LLYP+LR +EGVLK MSG M V+ GFG +FD LK + Sbjct: 233 IKLASPQLPDYCLLLYPDLRALEGVLKELMSGYNMSVEDAEYGFGNFFDNRGGVCSLKAE 292 Query: 295 FAATLRPEQIN-IISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKD 353 ++A + + + Y+F+ RH+LFHME D SRMI + + + + A+ I Sbjct: 293 YSAQVAHTHMEDAFNRGYSFYKKHRHTLFHMEEFADGSRMIDTLDKAISLSKDAYEAIDS 352 Query: 354 LY 355 LY Sbjct: 353 LY 354 >UniRef50_Q6LR54 Hypothetical cell division protein n=1 Tax=Photobacterium profundum RepID=Q6LR54_PHOPR Length = 362 Score = 320 bits (821), Expect = 3e-86, Method: Composition-based stats. Identities = 108/354 (30%), Positives = 171/354 (48%), Gaps = 10/354 (2%) Query: 9 LNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDIFYNGDGST 68 L + + T +F+ N S+ +++ F + G ++ATV ++ G Sbjct: 6 LAMQTEKLSTYVDEFLTVHN--AISMEARDITQSKQTFKFIKTGQDVATVIVYIKQGGLV 63 Query: 69 TIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSADE---SHI 125 TI Y TG N +LG+ D L + A+ + N+VL+G + VL L DE Sbjct: 64 TITYKTGKNHALGKVFHDFLEQKCESADANKANLVLKGMSSDEIEFVLALMGDELVEGEK 123 Query: 126 EFREHSRNAHTVVWK--IISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLD 183 F + N + K I ++D + + H TT KLQIQGR L CYR ++L+ALLD Sbjct: 124 AFSITTANPTPICQKYTITCDKFKDNIVLLYHTTTYKLQIQGRALFCYRTLCYHLSALLD 183 Query: 184 LQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAAPDL 243 Q L V+ ++ ++ +EVA Y++ + +A+ L T LL S CVKLA+P L Sbjct: 184 QQSLLAVVEKKSAEDKVVLHEEVASIYIKKALPNAFERLDDTYRSLLSSSYCVKLASPSL 243 Query: 244 PDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-GFGTYFDKPAAHYILKPQFAATLRPE 302 +Y ML+YP+LR +EGV+K M+ + G YF LK ++ + + E Sbjct: 244 SEYSMLIYPDLRVLEGVIKEAMAKNDLYTSSEGIDIGEYFT-HGRQTELKTEYNSNFQSE 302 Query: 303 -QINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLY 355 +I + Y +F RHSLFHM+ SR + +M + + +I +Y Sbjct: 303 SEIRCLEQCYAYFKAHRHSLFHMDESGYESRTTDTIGEVMQMSEKIAELIDAMY 356 >UniRef50_O82881 Plasmid pOSAK1 DNA, complete sequence n=1 Tax=Escherichia coli O157:H7 RepID=O82881_ECO57 Length = 346 Score = 293 bits (749), Expect = 8e-78, Method: Composition-based stats. Identities = 65/341 (19%), Positives = 145/341 (42%), Gaps = 15/341 (4%) Query: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 M +K LN+ IE+ Q + + + + T G++ + Sbjct: 1 MAQNPFKALNINIDKIESALTQ------NGVTNYSSNVKNERETHISGTYKGIDF--LIK 52 Query: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 G+TTI +G N + E+A + E ++ + + F + + E + Sbjct: 53 LMPSGGNTTIGRASGQNNTYFDEIALIIKENCLYSDTKNFEYTIPKFSDDDRANLFEFLS 112 Query: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 +E +++ + + I++TS D + ++ G +Q QG+ L + + + Sbjct: 113 EEGITITEDNNNDPNCKHQYIMTTSNGDRVRAKIY-KRGSIQFQGKYLQIASLINDFMCS 171 Query: 181 LLDLQGLEKVLIRQEDGKANI-VQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLA 239 +L++ K ++ Q++ + N+ +++E + L + + + +H +K L L +K Sbjct: 172 ILNM----KEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIKKQLSCSLIMKKI 227 Query: 240 APDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATL 299 ++ DY + LR IEG + ++ + P G YF + YI++ T+ Sbjct: 228 DVEMEDYSTYCFSALRAIEGFIYQILNDVCNP-SSSKNLGEYFTENKPKYIIREIHQETI 286 Query: 300 RPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARL 340 E ++ YT+++ RH LFHM+ + ++ I+ + + Sbjct: 287 NGEIAEVLCECYTYWHENRHGLFHMKPGIADTKTINKLESI 327 >UniRef50_B9K8R5 Ribonuclease H n=5 Tax=Thermotogaceae RepID=B9K8R5_THENN Length = 530 Score = 209 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 45/230 (19%), Positives = 101/230 (43%), Gaps = 11/230 (4%) Query: 133 NAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDLQGLEKVLI 192 + V+ K+ + +++ELT++ ++ TG LQIQG+ ++ L+ + + + Sbjct: 294 SQRIVMVKVKNKYFKEELTLTYYMNTGNLQIQGKAHDVFKNVQLFLSEFESAERYKNFIK 353 Query: 193 RQEDGKANIVQQEVARTYLQTVMADAYPH--LHVTAEKLLVSGLCVKLAAPDLP--DYCM 248 R I + L V AY + + L++ L P +P D+ + Sbjct: 354 RI----YGIEDERRLEETLNRVTKGAYGSELISEALKNELLTAYLAYLEEPTMPFRDFSL 409 Query: 249 LLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYF--DKPAAHYILKPQFAATLR-PEQIN 305 L P +R +E ++ + + +++ G +F K Y ++P A + ++ Sbjct: 410 YLVPSVRVLEAFIEMGLQMITGKIEKINRIGDFFKWSKKDNSYKIRPDCIANYNLGDLLS 469 Query: 306 IISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLY 355 ++ Y F++ RH+ H ++ + +I + +++ RA +I DL+ Sbjct: 470 VLEKCYNFYHDYRHAYVHASSLEGHTSIIPEKSQVDDLIKRALELIGDLF 519 >UniRef50_O69417 Putative uncharacterized protein n=1 Tax=Escherichia coli RepID=O69417_ECOLX Length = 209 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 80/173 (46%), Gaps = 6/173 (3%) Query: 169 SCYRVFTFNLAALLDLQGLEKVLIRQEDGKANI-VQQEVARTYLQTVMADAYPHLHVTAE 227 + + ++L++ K ++ Q++ + N+ +++E + L + + + +H + Sbjct: 23 QIASLINDFMCSILNM----KEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIK 78 Query: 228 KLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAA 287 K L L +K ++ DY + LR IEG + ++ + P G YF + Sbjct: 79 KQLSCSLIMKKIDVEMEDYSTYCFSALRAIEGFIYQILNDVCNP-SSSKNLGEYFTENKP 137 Query: 288 HYILKPQFAATLRPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARL 340 YI++ T+ E ++ YT+++ RH LFHM+ + ++ I+ + + Sbjct: 138 KYIIREIHQETINGEIAEVLCECYTYWHENRHGLFHMKPGIADTKTINKLESI 190 >UniRef50_C5VRV1 Ribonuclease HI n=1 Tax=Clostridium botulinum D str. 1873 RepID=C5VRV1_CLOBO Length = 480 Score = 174 bits (442), Expect = 4e-42, Method: Composition-based stats. Identities = 55/257 (21%), Positives = 105/257 (40%), Gaps = 29/257 (11%) Query: 111 SVLPVLELSADE--SHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPL 168 + +++L ++ F + I + + L VS +I K+ I G Sbjct: 227 DFVGIIDLLKEDFGDLKCFEKDIPYGKEYTLTIFNNEKKQRLIVSSYIDKNKVYINGEKE 286 Query: 169 SCYRVFTFNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYP-------- 220 + T + LL+++ + L D + + ++V + + ++Y Sbjct: 287 ELFNRLTSYIVELLEIEDIPNFLNTVHDLQ---IDKDVVESEFNSYFPNSYNLIPDELNN 343 Query: 221 HLHVTAEKLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-GFG 279 +LH L ++G + D L+ P +R +EG+LK + +P+++ + Sbjct: 344 YLHQAVYNLHITG---NIYVADF-----LVEPAIRPLEGILKIALQENNIPIRKKQDNYD 395 Query: 280 TYF--DKPAAHYILKPQFAATLRPEQI-NIISTAYTFFNVERHSLFHMETVVDASRMISD 336 ++F K YIL+ ++ E I N +S YT+FN RH+L H D + D Sbjct: 396 SFFVFKKNKDRYILRDKYVREDHSENILNYLSECYTYFNKNRHTLLHW----DNPKNELD 451 Query: 337 MARLMGKATRAWGIIKD 353 R++ A IIKD Sbjct: 452 TTRILTTVQEAHTIIKD 468 >UniRef50_Q03453 Complete nucleotide sequence n=2 Tax=root RepID=Q03453_9ZZZZ Length = 216 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 94/229 (41%), Gaps = 14/229 (6%) Query: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 M +K LN+ IE+ Q + + + + T G++ + Sbjct: 1 MAQNPFKALNINIDKIESALTQ------NGVTNYSSNVKNERETHISGTYKGIDF--LIK 52 Query: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 G+TTI +G N + E+A + E ++ + + F + + E + Sbjct: 53 LMPSGGNTTIGRASGQNNTYFDEIALIIKENCLYSDTKNFEYTIPKFSDDDRANLFEFLS 112 Query: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 +E +++ + + I++TS D + ++ G +Q QG+ L + + + Sbjct: 113 EEGITITEDNNNDPNCKHQYIMTTSNGDRVRAKIY-KRGSIQFQGKYLQIASLINDFMCS 171 Query: 181 LLDLQGLEKVLIRQEDGKANI-VQQEVARTYLQTVMADAYPHLHVTAEK 228 +L++ K ++ Q++ + N+ +++E + L + + + +H +K Sbjct: 172 ILNM----KEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIKK 216 >UniRef50_B3PN28 Ribonuclease HI n=1 Tax=Mycoplasma arthritidis 158L3-1 RepID=B3PN28_MYCA5 Length = 487 Score = 145 bits (365), Expect = 3e-33, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 95/239 (39%), Gaps = 9/239 (3%) Query: 125 IEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDL 184 IE N + + ++I +D++T++ + K +QG+ +++ +L Sbjct: 251 IEEGIVDENKKSYLNRLIFRLDKDKVTINCYSN-NKSYVQGKQSMLFQIIITAAIEMLPS 309 Query: 185 QGLEKVLIRQEDGKANIVQQEVARTYLQTVMAD-AYPHLHVTAEKLLVSGLCVKLAAPDL 243 + ++ + + +++E +T + + + + +L+S +C L L Sbjct: 310 E--KEAIGVLQSYYMLPIKEENLQTKFSNFLPNFSINSNDFKIKNILLSAVCGTLMKGYL 367 Query: 244 PDYCMLLYPELRTIEGVLKSKM-SGLGMPVQQPAGFGTY----FDKPAAHYILKPQFAAT 298 PDY L+ P R++E L + LG + G + F+K Y Sbjct: 368 PDYTYLVMPLFRSMEYYLHIILGDKLGRKTTRKNGANDFCHFSFNKKTNEYEYNHSTKER 427 Query: 299 LRPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 L EQ+ ++ Y +N RH FH+ + + +IS + +I Y++ Sbjct: 428 LNNEQLQYLNKLYNMYNKLRHPYFHLPQNLIDASVISKLEEAQNILVEGLKLINKFYLI 486 >UniRef50_Q6AIF4 Putative uncharacterized protein n=1 Tax=Desulfotalea psychrophila RepID=Q6AIF4_DESPS Length = 231 Score = 114 bits (285), Expect = 5e-24, Method: Composition-based stats. Identities = 48/182 (26%), Positives = 77/182 (42%), Gaps = 16/182 (8%) Query: 60 IFYNGD-GSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSV-----L 113 IFY G TTIQ+ G N+ L E A + + + + VN +G +E Sbjct: 52 IFYPKKFGITTIQFSCGKNKELSCEKAKQIINNFDVSSAKSVNCTFKGLLEEEFGIFEEY 111 Query: 114 PVLELSADESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRV 173 + EL S I+ + + T+ + S Y D +TV+ + TTG +QGRPL + Sbjct: 112 VIEELPDISSKIQ--KDDKTKKTISY---SGKYSDTVTVTFYKTTGTTLLQGRPLPAF-- 164 Query: 174 FTFNLAALLDLQGLEKVLIRQEDGKANIVQQEV-ARTYLQTVMADAYPHLHVTAEKLLVS 232 F + AL + LI + +I E L+ M +A+ L + ++V Sbjct: 165 --FEIKALFAGIVESEQLISSDKENFSIKVPETGFLPKLEGYMPNAFSFLDAKIKDIIVP 222 Query: 233 GL 234 L Sbjct: 223 SL 224 >UniRef50_UPI000196AB20 hypothetical protein CATMIT_00854 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AB20 Length = 496 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 37/263 (14%), Positives = 86/263 (32%), Gaps = 14/263 (5%) Query: 106 GFVETSVLPVLELSADESHI-------EFREHSRNAHTVVWKIISTSYQDELTVSLHITT 158 GF +++ +E+ ++ + ++I + ++ ++ ++L+ Sbjct: 236 GFSSDDWKAIVDYIDEENRKSLVNLNAIITIQTKEINETRKQLIISDFKSQVIINLYGNR 295 Query: 159 GKLQIQGRPLSCYRVFTFNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADA 218 +QG+ ++ L + V+ A + +E + ++ + Sbjct: 296 RS-YVQGKQSVLFQKIIATAIEFLSSDQI--VVETLNSYHALTISKEEVEEQFKMLLPNY 352 Query: 219 YPHLHVTAEKLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSG----LGMPVQQ 274 L+S + +PDY L+ P R E L ++ Sbjct: 353 RGSYDDKNFCNLLSATYNTMLTGYMPDYTCLVTPIFRAYEFYLHRILNEKMELDTARDNG 412 Query: 275 PAGFGTYFDKPAAHYILKPQFAATLRPEQINIISTAYTFFNVERHSLFHMETVVDASRMI 334 FG + Y L +Q+ ++ YT +N RH H + + MI Sbjct: 413 TNNFGYFIKTENGSYECSSSSKNKLSNKQLTFLNDFYTNYNEVRHPYSHWSSEDYDTAMI 472 Query: 335 SDMARLMGKATRAWGIIKDLYIV 357 + + + +I YI+ Sbjct: 473 DSIEKARALLEKGLNLIDKYYIL 495 >UniRef50_Q5LEL8 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=Q5LEL8_BACFN Length = 349 Score = 94.4 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 51/352 (14%), Positives = 116/352 (32%), Gaps = 13/352 (3%) Query: 7 KNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDIFYNGDG 66 K +L NI+ + FI K SI L+ + ++ F NG Sbjct: 3 KKHSLSVENIDQVAIDFIATKPS--YSIKITDCQEGKLKKIAITHNKETGILNCFINGGQ 60 Query: 67 STTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSADESHIE 126 + + + +E + + + + ++ + +G E +++ ++ IE Sbjct: 61 VSYSTQGKAHLKGICEECWNVILQNTSIPCPDKKSFTAKGISEEDFDAFIDVLSESDEIE 120 Query: 127 --FREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDL 184 N + Y ++++ + G L +QG + Y + + Sbjct: 121 ITTVNTDNNPAIRNQYHLKGKYDAKVSIIFY-NNGTLFLQGAVTAFYIELITEIMETIS- 178 Query: 185 QGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAAPDLP 244 + ++ + + V L E L + + + + + Sbjct: 179 -SVPTEVME-DFLAIQPLVGCVIEKDLNKHFTKTENIEGSILEDFLKTSIALANSGVVVD 236 Query: 245 DYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFD--KPAAHYILKPQFAATLRPE 302 DY + ++ ++G++ ++ +GTYF+ K ++ L+ P Sbjct: 237 DYGCYTFGIMKALDGLISKRLLED---APDFKDYGTYFERGKDGNYHFLENVGTYNGNPS 293 Query: 303 QINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDL 354 + AY F+N RH+ FH++ + + II DL Sbjct: 294 LKRALEKAYDFYNKNRHTTFHIDRRNLETSRTLYYDEAVNIIKDGLVIINDL 345 >UniRef50_C7IHA2 Putative uncharacterized protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IHA2_9CLOT Length = 369 Score = 74.0 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 53/326 (16%), Positives = 106/326 (32%), Gaps = 27/326 (8%) Query: 54 NLATVDIFYNGDGSTTIQYLTGANRSLGQELADHL---FETINPAEFEQVNM--VLQGFV 108 ++ + I+ DG +Y G G+EL + I + +L F Sbjct: 46 DVGIIAIYITKDGFMNPKYKFGKMHEQGRELLEIAVGDKSNIERVKTTYAKYENLLTKFS 105 Query: 109 ETSVLPVLELSADESHIEFREHSRNAHTVVWKIIS-TSYQDELTVSLHITTGKLQIQGRP 167 E E + H ++++ +D V++H L +QG+ Sbjct: 106 SNE-QFTEEF-KQYIENELGGTIQLKHATLYEVFYWDIIKDNEKVTVHCYRTNLLLQGKN 163 Query: 168 LSCYRVFTFNLAALLDLQGLEKVLIRQE-----DGKANIVQQEVARTYLQTVMADAYPHL 222 + + LD + ++L+R +GK A L+ + AY L Sbjct: 164 NYLWDDICIWIEQKLD-SPVSEMLVRITGDENINGKIATTAINEAENILKDRLKTAYDIL 222 Query: 223 HVTAEKLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMS--------GLGMPVQQ 274 K L S +C+ L L +Y ++ P + +EG L+ ++ + V Sbjct: 223 FPHDRKFLNSAICLILYNNPLQEYSAIINPAFKGLEGYLRKMIAEKIGSRIPEVMKKVYD 282 Query: 275 PAGFGTYFDKPA---AHYILKPQFAATLRPEQINIISTAYTFFNVERHSLFHMETVVDAS 331 P ++ + Y + + + P Y + +R+ H + A+ Sbjct: 283 PKLSLSWLVEKDKYMDTYFINRNYGDSRNPSNDRAFEGLYKIYKEDRNPYSH--STGLAT 340 Query: 332 RMISDMARLMGKATRAWGIIKDLYIV 357 R + + I Y + Sbjct: 341 RTCDSIEDAKDIVDQILSAISSTYSI 366 >UniRef50_Q6AIF3 Putative uncharacterized protein n=1 Tax=Desulfotalea psychrophila RepID=Q6AIF3_DESPS Length = 104 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 22/98 (22%), Positives = 38/98 (38%), Gaps = 3/98 (3%) Query: 261 LKSKMSGLGMPVQQPAGFGTYFDKPAA--HYILKPQFAATL-RPEQINIISTAYTFFNVE 317 +K + G F+ Y L +A + R E I + +Y F+NV Sbjct: 1 MKQLLFKYYQDDFCAKRIGKIFETKDGGSTYSLNYGISAGIGRKEVIVALEESYKFWNVY 60 Query: 318 RHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLY 355 RH FH++ V+ S +I + + +I+ Y Sbjct: 61 RHPYFHVDDVIRTSTIIPTKEEAISLNVEIFALIERTY 98 >UniRef50_A6TLN0 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TLN0_ALKMQ Length = 130 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 25/123 (20%), Positives = 47/123 (38%), Gaps = 10/123 (8%) Query: 243 LPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-----GFGTYFDKPAA---HYILKPQ 294 +PDY +L+ R EG K+ + + + + G FD Y K Sbjct: 7 IPDYGILIDSTCRAFEGYFKTLLKTIDISKNREMKKSDWNSGNIFDGNRNLLSQYHHKLS 66 Query: 295 FAATLRPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDL 354 F ++ EQ+NI+S Y+ R+ + H + + I + + + +I Sbjct: 67 FDDKIKNEQLNILSEMYSLMKDLRNPISH--SGPRPTIKIPNYNDGLDQYNEIIDLINRS 124 Query: 355 YIV 357 Y + Sbjct: 125 YTL 127 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P52129 Uncharacterized protein yfjN n=5 Tax=Gammaproteo... 409 e-113 UniRef50_A3JE92 CP4-57 prophage; RNase LS n=1 Tax=Marinobacter s... 359 7e-98 UniRef50_Q6LR54 Hypothetical cell division protein n=1 Tax=Photo... 322 1e-86 UniRef50_O82881 Plasmid pOSAK1 DNA, complete sequence n=1 Tax=Es... 305 2e-81 UniRef50_Q5LEL8 Putative uncharacterized protein n=2 Tax=Bactero... 285 2e-75 UniRef50_C7IHA2 Putative uncharacterized protein n=1 Tax=Clostri... 228 3e-58 UniRef50_UPI000196AB20 hypothetical protein CATMIT_00854 n=1 Tax... 203 6e-51 UniRef50_B3PN28 Ribonuclease HI n=1 Tax=Mycoplasma arthritidis 1... 201 4e-50 UniRef50_B9K8R5 Ribonuclease H n=5 Tax=Thermotogaceae RepID=B9K8... 193 1e-47 UniRef50_Q03453 Complete nucleotide sequence n=2 Tax=root RepID=... 180 7e-44 UniRef50_C5VRV1 Ribonuclease HI n=1 Tax=Clostridium botulinum D ... 173 9e-42 UniRef50_O69417 Putative uncharacterized protein n=1 Tax=Escheri... 163 9e-39 UniRef50_Q6AIF4 Putative uncharacterized protein n=1 Tax=Desulfo... 131 3e-29 UniRef50_A6TLN0 Putative uncharacterized protein n=1 Tax=Alkalip... 99 2e-19 UniRef50_Q6AIF3 Putative uncharacterized protein n=1 Tax=Desulfo... 90 9e-17 Sequences not found previously or not previously below threshold: UniRef50_B0KVC8 Putative uncharacterized protein n=1 Tax=uncultu... 46 0.002 CONVERGED! >UniRef50_P52129 Uncharacterized protein yfjN n=5 Tax=Gammaproteobacteria RepID=YFJN_ECOLI Length = 357 Score = 409 bits (1051), Expect = e-113, Method: Composition-based stats. Identities = 357/357 (100%), Positives = 357/357 (100%) Query: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI Sbjct: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 Query: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA Sbjct: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 Query: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA Sbjct: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 Query: 181 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA 240 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA Sbjct: 181 LLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA 240 Query: 241 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR 300 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR Sbjct: 241 PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLR 300 Query: 301 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV Sbjct: 301 PEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 >UniRef50_A3JE92 CP4-57 prophage; RNase LS n=1 Tax=Marinobacter sp. ELB17 RepID=A3JE92_9ALTE Length = 358 Score = 359 bits (922), Expect = 7e-98, Method: Composition-based stats. Identities = 146/363 (40%), Positives = 210/363 (57%), Gaps = 17/363 (4%) Query: 3 IRSYKNLNLVRANIETESRQFIENK----NYSIQSIGPMPGSRAGLRVVFTRPGVNLATV 58 + Y++LNL R ++ F+ + + IQ++ R RV F +PG A V Sbjct: 1 MSDYRDLNLNREVLDENIGSFLGSYGCVLDDKIQTL-----DRGKRRVAFGKPGAEFAMV 55 Query: 59 DIFYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLEL 118 D+ N G+TTIQ+ G N+ LG++LA +L TI+PAEFE VN L+G S P+L Sbjct: 56 DLHLNNTGTTTIQWKLGKNQPLGEKLAAYLKSTIDPAEFESVNYSLKGISTGSFDPILGC 115 Query: 119 SADESHIE---FREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFT 175 A+ IE R+ + + I+ +QD+LT++ H +T LQIQG+PLSCYR Sbjct: 116 IAELDDIEVVVLRDEVKCKQVTLKSIV---HQDKLTLTHHRSTRVLQIQGKPLSCYRRVV 172 Query: 176 FNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLC 235 F L LLDL+GL +VL R++D A IV++E+A YL+TV +Y HL + +KLL S C Sbjct: 173 FMLIDLLDLKGLTQVLYRKDDNSAEIVRKEMAEDYLKTVFTRSYDHLPDSVKKLLTSSCC 232 Query: 236 VKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-GFGTYFDKPAAHYILKPQ 294 +KLA+P LPDYC+LLYP+LR +EGVLK MSG M V+ GFG +FD LK + Sbjct: 233 IKLASPQLPDYCLLLYPDLRALEGVLKELMSGYNMSVEDAEYGFGNFFDNRGGVCSLKAE 292 Query: 295 FAATLRPEQIN-IISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKD 353 ++A + + + Y+F+ RH+LFHME D SRMI + + + + A+ I Sbjct: 293 YSAQVAHTHMEDAFNRGYSFYKKHRHTLFHMEEFADGSRMIDTLDKAISLSKDAYEAIDS 352 Query: 354 LYI 356 LY Sbjct: 353 LYT 355 >UniRef50_Q6LR54 Hypothetical cell division protein n=1 Tax=Photobacterium profundum RepID=Q6LR54_PHOPR Length = 362 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 108/355 (30%), Positives = 171/355 (48%), Gaps = 10/355 (2%) Query: 9 LNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDIFYNGDGST 68 L + + T +F+ N S+ +++ F + G ++ATV ++ G Sbjct: 6 LAMQTEKLSTYVDEFLTVHN--AISMEARDITQSKQTFKFIKTGQDVATVIVYIKQGGLV 63 Query: 69 TIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSADE---SHI 125 TI Y TG N +LG+ D L + A+ + N+VL+G + VL L DE Sbjct: 64 TITYKTGKNHALGKVFHDFLEQKCESADANKANLVLKGMSSDEIEFVLALMGDELVEGEK 123 Query: 126 EFREHSRNAHTVVWK--IISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLD 183 F + N + K I ++D + + H TT KLQIQGR L CYR ++L+ALLD Sbjct: 124 AFSITTANPTPICQKYTITCDKFKDNIVLLYHTTTYKLQIQGRALFCYRTLCYHLSALLD 183 Query: 184 LQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAAPDL 243 Q L V+ ++ ++ +EVA Y++ + +A+ L T LL S CVKLA+P L Sbjct: 184 QQSLLAVVEKKSAEDKVVLHEEVASIYIKKALPNAFERLDDTYRSLLSSSYCVKLASPSL 243 Query: 244 PDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-GFGTYFDKPAAHYILKPQFAATLRPE 302 +Y ML+YP+LR +EGV+K M+ + G YF LK ++ + + E Sbjct: 244 SEYSMLIYPDLRVLEGVIKEAMAKNDLYTSSEGIDIGEYFT-HGRQTELKTEYNSNFQSE 302 Query: 303 -QINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYI 356 +I + Y +F RHSLFHM+ SR + +M + + +I +Y Sbjct: 303 SEIRCLEQCYAYFKAHRHSLFHMDESGYESRTTDTIGEVMQMSEKIAELIDAMYS 357 >UniRef50_O82881 Plasmid pOSAK1 DNA, complete sequence n=1 Tax=Escherichia coli O157:H7 RepID=O82881_ECO57 Length = 346 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 66/353 (18%), Positives = 147/353 (41%), Gaps = 16/353 (4%) Query: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 M +K LN+ IE+ Q + + + + T G++ + Sbjct: 1 MAQNPFKALNINIDKIESALTQ------NGVTNYSSNVKNERETHISGTYKGIDF--LIK 52 Query: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 G+TTI +G N + E+A + E ++ + + F + + E + Sbjct: 53 LMPSGGNTTIGRASGQNNTYFDEIALIIKENCLYSDTKNFEYTIPKFSDDDRANLFEFLS 112 Query: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 +E +++ + + I++TS D + ++ G +Q QG+ L + + + Sbjct: 113 EEGITITEDNNNDPNCKHQYIMTTSNGDRVRAKIY-KRGSIQFQGKYLQIASLINDFMCS 171 Query: 181 LLDLQGLEKVLIRQEDGKANI-VQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLA 239 +L++ K ++ Q++ + N+ +++E + L + + + +H +K L L +K Sbjct: 172 ILNM----KEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIKKQLSCSLIMKKI 227 Query: 240 APDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATL 299 ++ DY + LR IEG + ++ + P G YF + YI++ T+ Sbjct: 228 DVEMEDYSTYCFSALRAIEGFIYQILNDVCNP-SSSKNLGEYFTENKPKYIIREIHQETI 286 Query: 300 RPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIK 352 E ++ YT+++ RH LFHM+ + ++ I+ + + +I Sbjct: 287 NGEIAEVLCECYTYWHENRHGLFHMKPGIADTKTINKLESI-AIIDTVCQLID 338 >UniRef50_Q5LEL8 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=Q5LEL8_BACFN Length = 349 Score = 285 bits (728), Expect = 2e-75, Method: Composition-based stats. Identities = 51/352 (14%), Positives = 116/352 (32%), Gaps = 13/352 (3%) Query: 7 KNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDIFYNGDG 66 K +L NI+ + FI K SI L+ + ++ F NG Sbjct: 3 KKHSLSVENIDQVAIDFIATKPS--YSIKITDCQEGKLKKIAITHNKETGILNCFINGGQ 60 Query: 67 STTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSADESHIE 126 + + + +E + + + + ++ + +G E +++ ++ IE Sbjct: 61 VSYSTQGKAHLKGICEECWNVILQNTSIPCPDKKSFTAKGISEEDFDAFIDVLSESDEIE 120 Query: 127 --FREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDL 184 N + Y ++++ + G L +QG + Y + + Sbjct: 121 ITTVNTDNNPAIRNQYHLKGKYDAKVSIIFY-NNGTLFLQGAVTAFYIELITEIMETIS- 178 Query: 185 QGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAAPDLP 244 + ++ + + V L E L + + + + + Sbjct: 179 -SVPTEVME-DFLAIQPLVGCVIEKDLNKHFTKTENIEGSILEDFLKTSIALANSGVVVD 236 Query: 245 DYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFD--KPAAHYILKPQFAATLRPE 302 DY + ++ ++G++ ++ +GTYF+ K ++ L+ P Sbjct: 237 DYGCYTFGIMKALDGLISKRLLED---APDFKDYGTYFERGKDGNYHFLENVGTYNGNPS 293 Query: 303 QINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDL 354 + AY F+N RH+ FH++ + + II DL Sbjct: 294 LKRALEKAYDFYNKNRHTTFHIDRRNLETSRTLYYDEAVNIIKDGLVIINDL 345 >UniRef50_C7IHA2 Putative uncharacterized protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IHA2_9CLOT Length = 369 Score = 228 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 61/377 (16%), Positives = 122/377 (32%), Gaps = 33/377 (8%) Query: 3 IRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDIFY 62 + ++N+ L + I T+ K I G + F+ ++ + I+ Sbjct: 1 MEVFENIKLTKEEIITKLTD--ACKGLGISVSGVEKKNEKLYHFKFS----DVGIIAIYI 54 Query: 63 NGDGSTTIQYLTGANRSLGQELADHL---FETINPAEFEQVNM--VLQGFVETSVLPVLE 117 DG +Y G G+EL + I + +L F E Sbjct: 55 TKDGFMNPKYKFGKMHEQGRELLEIAVGDKSNIERVKTTYAKYENLLTKFSSNE-QFTEE 113 Query: 118 LSADESHIEFREHSRNAHTVVWKIIS-TSYQDELTVSLHITTGKLQIQGRPLSCYRVFTF 176 E + H ++++ +D V++H L +QG+ + Sbjct: 114 F-KQYIENELGGTIQLKHATLYEVFYWDIIKDNEKVTVHCYRTNLLLQGKNNYLWDDICI 172 Query: 177 NLAALLDLQGLEKVLIRQE-----DGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLV 231 + LD + ++L+R +GK A L+ + AY L K L Sbjct: 173 WIEQKLDS-PVSEMLVRITGDENINGKIATTAINEAENILKDRLKTAYDILFPHDRKFLN 231 Query: 232 SGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMS--------GLGMPVQQPAGFGTYFD 283 S +C+ L L +Y ++ P + +EG L+ ++ + V P ++ Sbjct: 232 SAICLILYNNPLQEYSAIINPAFKGLEGYLRKMIAEKIGSRIPEVMKKVYDPKLSLSWLV 291 Query: 284 KPA---AHYILKPQFAATLRPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARL 340 + Y + + + P Y + +R+ H + A+R + Sbjct: 292 EKDKYMDTYFINRNYGDSRNPSNDRAFEGLYKIYKEDRNPYSH--STGLATRTCDSIEDA 349 Query: 341 MGKATRAWGIIKDLYIV 357 + I Y + Sbjct: 350 KDIVDQILSAISSTYSI 366 >UniRef50_UPI000196AB20 hypothetical protein CATMIT_00854 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AB20 Length = 496 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 37/263 (14%), Positives = 86/263 (32%), Gaps = 14/263 (5%) Query: 106 GFVETSVLPVLELSADESHI-------EFREHSRNAHTVVWKIISTSYQDELTVSLHITT 158 GF +++ +E+ ++ + ++I + ++ ++ ++L+ Sbjct: 236 GFSSDDWKAIVDYIDEENRKSLVNLNAIITIQTKEINETRKQLIISDFKSQVIINLYGNR 295 Query: 159 GKLQIQGRPLSCYRVFTFNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADA 218 +QG+ ++ L + V+ A + +E + ++ + Sbjct: 296 RS-YVQGKQSVLFQKIIATAIEFLSSDQI--VVETLNSYHALTISKEEVEEQFKMLLPNY 352 Query: 219 YPHLHVTAEKLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSG----LGMPVQQ 274 L+S + +PDY L+ P R E L ++ Sbjct: 353 RGSYDDKNFCNLLSATYNTMLTGYMPDYTCLVTPIFRAYEFYLHRILNEKMELDTARDNG 412 Query: 275 PAGFGTYFDKPAAHYILKPQFAATLRPEQINIISTAYTFFNVERHSLFHMETVVDASRMI 334 FG + Y L +Q+ ++ YT +N RH H + + MI Sbjct: 413 TNNFGYFIKTENGSYECSSSSKNKLSNKQLTFLNDFYTNYNEVRHPYSHWSSEDYDTAMI 472 Query: 335 SDMARLMGKATRAWGIIKDLYIV 357 + + + +I YI+ Sbjct: 473 DSIEKARALLEKGLNLIDKYYIL 495 >UniRef50_B3PN28 Ribonuclease HI n=1 Tax=Mycoplasma arthritidis 158L3-1 RepID=B3PN28_MYCA5 Length = 487 Score = 201 bits (510), Expect = 4e-50, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 112/290 (38%), Gaps = 13/290 (4%) Query: 78 RSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSADESH----IEFREHSRN 133 +LA ++ +++ F + L +++ +++ IE N Sbjct: 200 NEKADQLARRALLDQGYKTYDDGSILFTSFQKQDWLDIIQKLKEKNELNILIEEGIVDEN 259 Query: 134 AHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDLQGLEKVLIR 193 + + ++I +D++T++ + K +QG+ +++ +L + ++ + Sbjct: 260 KKSYLNRLIFRLDKDKVTINCYSN-NKSYVQGKQSMLFQIIITAAIEMLPSE--KEAIGV 316 Query: 194 QEDGKANIVQQEVARTYLQTVMAD-AYPHLHVTAEKLLVSGLCVKLAAPDLPDYCMLLYP 252 + +++E +T + + + + +L+S +C L LPDY L+ P Sbjct: 317 LQSYYMLPIKEENLQTKFSNFLPNFSINSNDFKIKNILLSAVCGTLMKGYLPDYTYLVMP 376 Query: 253 ELRTIEGVLKSKM-SGLGMPVQQPAGFGTY----FDKPAAHYILKPQFAATLRPEQINII 307 R++E L + LG + G + F+K Y L EQ+ + Sbjct: 377 LFRSMEYYLHIILGDKLGRKTTRKNGANDFCHFSFNKKTNEYEYNHSTKERLNNEQLQYL 436 Query: 308 STAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV 357 + Y +N RH FH+ + + +IS + +I Y++ Sbjct: 437 NKLYNMYNKLRHPYFHLPQNLIDASVISKLEEAQNILVEGLKLINKFYLI 486 >UniRef50_B9K8R5 Ribonuclease H n=5 Tax=Thermotogaceae RepID=B9K8R5_THENN Length = 530 Score = 193 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 45/231 (19%), Positives = 101/231 (43%), Gaps = 11/231 (4%) Query: 132 RNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLDLQGLEKVL 191 + V+ K+ + +++ELT++ ++ TG LQIQG+ ++ L+ + + + Sbjct: 293 PSQRIVMVKVKNKYFKEELTLTYYMNTGNLQIQGKAHDVFKNVQLFLSEFESAERYKNFI 352 Query: 192 IRQEDGKANIVQQEVARTYLQTVMADAYPH--LHVTAEKLLVSGLCVKLAAPDLP--DYC 247 R I + L V AY + + L++ L P +P D+ Sbjct: 353 KRI----YGIEDERRLEETLNRVTKGAYGSELISEALKNELLTAYLAYLEEPTMPFRDFS 408 Query: 248 MLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFD--KPAAHYILKPQFAATLR-PEQI 304 + L P +R +E ++ + + +++ G +F K Y ++P A + + Sbjct: 409 LYLVPSVRVLEAFIEMGLQMITGKIEKINRIGDFFKWSKKDNSYKIRPDCIANYNLGDLL 468 Query: 305 NIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLY 355 +++ Y F++ RH+ H ++ + +I + +++ RA +I DL+ Sbjct: 469 SVLEKCYNFYHDYRHAYVHASSLEGHTSIIPEKSQVDDLIKRALELIGDLF 519 >UniRef50_Q03453 Complete nucleotide sequence n=2 Tax=root RepID=Q03453_9ZZZZ Length = 216 Score = 180 bits (456), Expect = 7e-44, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 94/229 (41%), Gaps = 14/229 (6%) Query: 1 MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDI 60 M +K LN+ IE+ Q + + + + T G++ + Sbjct: 1 MAQNPFKALNINIDKIESALTQ------NGVTNYSSNVKNERETHISGTYKGIDF--LIK 52 Query: 61 FYNGDGSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSA 120 G+TTI +G N + E+A + E ++ + + F + + E + Sbjct: 53 LMPSGGNTTIGRASGQNNTYFDEIALIIKENCLYSDTKNFEYTIPKFSDDDRANLFEFLS 112 Query: 121 DESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAA 180 +E +++ + + I++TS D + ++ G +Q QG+ L + + + Sbjct: 113 EEGITITEDNNNDPNCKHQYIMTTSNGDRVRAKIY-KRGSIQFQGKYLQIASLINDFMCS 171 Query: 181 LLDLQGLEKVLIRQEDGKANI-VQQEVARTYLQTVMADAYPHLHVTAEK 228 +L++ K ++ Q++ + N+ +++E + L + + + +H +K Sbjct: 172 ILNM----KEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIKK 216 >UniRef50_C5VRV1 Ribonuclease HI n=1 Tax=Clostridium botulinum D str. 1873 RepID=C5VRV1_CLOBO Length = 480 Score = 173 bits (438), Expect = 9e-42, Method: Composition-based stats. Identities = 47/256 (18%), Positives = 103/256 (40%), Gaps = 12/256 (4%) Query: 111 SVLPVLELSADE--SHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPL 168 + +++L ++ F + I + + L VS +I K+ I G Sbjct: 227 DFVGIIDLLKEDFGDLKCFEKDIPYGKEYTLTIFNNEKKQRLIVSSYIDKNKVYINGEKE 286 Query: 169 SCYRVFTFNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEK 228 + T + LL+++ + L D + + ++V + + ++Y + Sbjct: 287 ELFNRLTSYIVELLEIEDIPNFLNTVHDLQ---IDKDVVESEFNSYFPNSYNLIPDELNN 343 Query: 229 LLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-GFGTY--FDKP 285 L + ++ L+ P +R +EG+LK + +P+++ + ++ F K Sbjct: 344 YLHQAVYNLHITGNIYVADFLVEPAIRPLEGILKIALQENNIPIRKKQDNYDSFFVFKKN 403 Query: 286 AAHYILKPQFAATLRPEQI-NIISTAYTFFNVERHSLFHME---TVVDASRMISDMARLM 341 YIL+ ++ E I N +S YT+FN RH+L H + +D +R+++ + Sbjct: 404 KDRYILRDKYVREDHSENILNYLSECYTYFNKNRHTLLHWDNPKNELDTTRILTTVQEAH 463 Query: 342 GKATRAWGIIKDLYIV 357 +I Y + Sbjct: 464 TIIKDTIKLIDKYYKL 479 >UniRef50_O69417 Putative uncharacterized protein n=1 Tax=Escherichia coli RepID=O69417_ECOLX Length = 209 Score = 163 bits (412), Expect = 9e-39, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 82/185 (44%), Gaps = 7/185 (3%) Query: 169 SCYRVFTFNLAALLDLQGLEKVLIRQEDGKANI-VQQEVARTYLQTVMADAYPHLHVTAE 227 + + ++L++ K ++ Q++ + N+ +++E + L + + + +H + Sbjct: 23 QIASLINDFMCSILNM----KEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIK 78 Query: 228 KLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAA 287 K L L +K ++ DY + LR IEG + ++ + P G YF + Sbjct: 79 KQLSCSLIMKKIDVEMEDYSTYCFSALRAIEGFIYQILNDVCNP-SSSKNLGEYFTENKP 137 Query: 288 HYILKPQFAATLRPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRA 347 YI++ T+ E ++ YT+++ RH LFHM+ + ++ I+ + + Sbjct: 138 KYIIREIHQETINGEIAEVLCECYTYWHENRHGLFHMKPGIADTKTINKLESI-AIIDTV 196 Query: 348 WGIIK 352 +I Sbjct: 197 CQLID 201 >UniRef50_Q6AIF4 Putative uncharacterized protein n=1 Tax=Desulfotalea psychrophila RepID=Q6AIF4_DESPS Length = 231 Score = 131 bits (330), Expect = 3e-29, Method: Composition-based stats. Identities = 53/232 (22%), Positives = 91/232 (39%), Gaps = 11/232 (4%) Query: 5 SYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDIFYNG 64 S+++ N+ IE ++ + Q+ + R + F + IFY Sbjct: 2 SFRDQNITIGQIE----DHLQLLSKKGQTNKCVNKGR-NVHCSFLDENGYEKCLLIFYPK 56 Query: 65 D-GSTTIQYLTGANRSLGQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSADES 123 G TTIQ+ G N+ L E A + + + + VN +G +E E +E Sbjct: 57 KFGITTIQFSCGKNKELSCEKAKQIINNFDVSSAKSVNCTFKGLLEEEFGIFEEYVIEEL 116 Query: 124 HIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGKLQIQGRPLSCYRVFTFNLAALLD 183 + ++ T S Y D +TV+ + TTG +QGRPL + F + AL Sbjct: 117 PDISSKIQKDDKTKKTISYSGKYSDTVTVTFYKTTGTTLLQGRPLPAF----FEIKALFA 172 Query: 184 LQGLEKVLIRQEDGKANIVQQEV-ARTYLQTVMADAYPHLHVTAEKLLVSGL 234 + LI + +I E L+ M +A+ L + ++V L Sbjct: 173 GIVESEQLISSDKENFSIKVPETGFLPKLEGYMPNAFSFLDAKIKDIIVPSL 224 >UniRef50_A6TLN0 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TLN0_ALKMQ Length = 130 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 25/123 (20%), Positives = 47/123 (38%), Gaps = 10/123 (8%) Query: 243 LPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPA-----GFGTYFDKPAA---HYILKPQ 294 +PDY +L+ R EG K+ + + + + G FD Y K Sbjct: 7 IPDYGILIDSTCRAFEGYFKTLLKTIDISKNREMKKSDWNSGNIFDGNRNLLSQYHHKLS 66 Query: 295 FAATLRPEQINIISTAYTFFNVERHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDL 354 F ++ EQ+NI+S Y+ R+ + H + + I + + + +I Sbjct: 67 FDDKIKNEQLNILSEMYSLMKDLRNPISH--SGPRPTIKIPNYNDGLDQYNEIIDLINRS 124 Query: 355 YIV 357 Y + Sbjct: 125 YTL 127 >UniRef50_Q6AIF3 Putative uncharacterized protein n=1 Tax=Desulfotalea psychrophila RepID=Q6AIF3_DESPS Length = 104 Score = 90.2 bits (222), Expect = 9e-17, Method: Composition-based stats. Identities = 22/99 (22%), Positives = 38/99 (38%), Gaps = 3/99 (3%) Query: 261 LKSKMSGLGMPVQQPAGFGTYFDKPAA--HYILKPQFAATL-RPEQINIISTAYTFFNVE 317 +K + G F+ Y L +A + R E I + +Y F+NV Sbjct: 1 MKQLLFKYYQDDFCAKRIGKIFETKDGGSTYSLNYGISAGIGRKEVIVALEESYKFWNVY 60 Query: 318 RHSLFHMETVVDASRMISDMARLMGKATRAWGIIKDLYI 356 RH FH++ V+ S +I + + +I+ Y Sbjct: 61 RHPYFHVDDVIRTSTIIPTKEEAISLNVEIFALIERTYS 99 >UniRef50_B0KVC8 Putative uncharacterized protein n=1 Tax=uncultured candidate division WWE3 bacterium EJ0ADIGA11YD11 RepID=B0KVC8_9BACT Length = 162 Score = 45.9 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 43/151 (28%), Gaps = 13/151 (8%) Query: 208 RTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAAPDLPDYCMLLYPELRTIEGVLKSKMSG 267 + L M+ L E +L + + D+ DY +++P + EG LK Sbjct: 7 DSDLWRYMSPEIKDLIEDGESIL-TFVYKNKDKADISDYSFIVFPFAKAYEGFLKKFFLD 65 Query: 268 LGMPVQQPA-----GFGTYFDKP--AAHYILKPQFAATLRPEQINIISTAYTFFNVERHS 320 + + G + + + + I + + R++ Sbjct: 66 TDLITEDEYYGDEIRIGRLLNPNYQDNTSVYNKVCNYSGGGKGIA--QRLWNTWKRGRNT 123 Query: 321 LFHMETVVDASRMISDMARLMGKATRAWGII 351 +FH + + ++ Sbjct: 124 VFHYFPHNF---RKLEYNEALDIINDIISVM 151 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.124 0.272 Lambda K H 0.267 0.0376 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,291,848,779 Number of Sequences: 3077464 Number of extensions: 40875224 Number of successful extensions: 129993 Number of sequences better than 1.0e-01: 26 Number of HSP's better than 0.1 without gapping: 32 Number of HSP's successfully gapped in prelim test: 22 Number of HSP's that attempted gapping in prelim test: 129889 Number of HSP's gapped (non-prelim): 62 length of query: 357 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 227 effective length of database: 640,326,036 effective search space: 145354010172 effective search space used: 145354010172 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 93 (40.6 bits)