BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (115 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0A1S9 Uncharacterized protein yajD n=151 Tax=Bacteria ... 240 1e-62 UniRef50_A9R2B8 HNH endonuclease domain protein n=50 Tax=Proteob... 209 2e-53 UniRef50_A4VJW5 Restriction endonuclease n=1 Tax=Pseudomonas stu... 130 1e-29 UniRef50_Q48LB8 HNH endonuclease n=15 Tax=Proteobacteria RepID=Q... 127 1e-28 UniRef50_A8ZVU0 HNH endonuclease n=8 Tax=Proteobacteria RepID=A8... 119 3e-26 UniRef50_A1WVI6 HNH endonuclease n=14 Tax=Proteobacteria RepID=A... 106 2e-22 UniRef50_Q0EYP0 Putative uncharacterized protein n=1 Tax=Maripro... 101 6e-21 UniRef50_Q39X67 HNH endonuclease n=7 Tax=Desulfuromonadales RepI... 92 7e-18 UniRef50_B9M834 HNH endonuclease n=4 Tax=Deltaproteobacteria Rep... 90 2e-17 UniRef50_Q3IQ78 Probable HNH-type endonuclease n=1 Tax=Natronomo... 42 0.009 UniRef50_B5EW68 Putative uncharacterized protein n=1 Tax=Vibrio ... 41 0.011 UniRef50_Q76Z37 Packaging and recombination endonuclease VII n=3... 40 0.031 UniRef50_C7QYC6 HNH endonuclease n=1 Tax=Cyanothece sp. PCC 8802... 38 0.077 >UniRef50_P0A1S9 Uncharacterized protein yajD n=151 Tax=Bacteria RepID=YAJD_SALTI Length = 115 Score = 240 bits (612), Expect = 1e-62, Method: Compositional matrix adjust. Identities = 111/115 (96%), Positives = 114/115 (99%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 MAIIPKNYARLESGYREKALK++PWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN Sbjct: 1 MAIIPKNYARLESGYREKALKLFPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 WELLCLYCHDHEHSKYTEADQYG+TVIAGEDAQKDVGEA YNPFADLKAMMNKKK Sbjct: 61 WELLCLYCHDHEHSKYTEADQYGSTVIAGEDAQKDVGEATYNPFADLKAMMNKKK 115 >UniRef50_A9R2B8 HNH endonuclease domain protein n=50 Tax=Proteobacteria RepID=A9R2B8_YERPG Length = 117 Score = 209 bits (532), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 93/113 (82%), Positives = 105/113 (92%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 MA IPKNY+RLESGYREKALKI+PWVCG+CSREFVYSNLRELTVHHIDHDH NNPEDGSN Sbjct: 5 MAYIPKNYSRLESGYREKALKIFPWVCGKCSREFVYSNLRELTVHHIDHDHGNNPEDGSN 64 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNK 113 WE+LCL+CHDHEHSKYTE D YG+TV+AG+DAQKD G A +NPFA+LK++M K Sbjct: 65 WEMLCLFCHDHEHSKYTEVDLYGSTVVAGDDAQKDQGVATHNPFANLKSLMKK 117 >UniRef50_A4VJW5 Restriction endonuclease n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VJW5_PSEU5 Length = 114 Score = 130 bits (327), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 62/108 (57%), Positives = 77/108 (71%), Gaps = 4/108 (3%) Query: 8 YARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLY 67 Y++ E GYREKALK+YPWVCGRC+REF L ELTVHH DH+H NNPEDGSNWELLCLY Sbjct: 10 YSQREQGYREKALKMYPWVCGRCAREFSGKRLSELTVHHKDHNHDNNPEDGSNWELLCLY 69 Query: 68 CHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 CHD+EHS+YT+ +QY G D + + FA+L ++ K+ Sbjct: 70 CHDNEHSRYTD-NQYQAEARPGSDLGP---KETFKAFANLADLLKGKQ 113 >UniRef50_Q48LB8 HNH endonuclease n=15 Tax=Proteobacteria RepID=Q48LB8_PSE14 Length = 124 Score = 127 bits (319), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 58/102 (56%), Positives = 72/102 (70%), Gaps = 4/102 (3%) Query: 12 ESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDH 71 E GYR+KAL++YP VCGRC+REF L ELTVHH DH+H NNP+DGSNWELLCLYCHD+ Sbjct: 25 EMGYRDKALRMYPHVCGRCAREFAGKRLSELTVHHRDHNHDNNPQDGSNWELLCLYCHDN 84 Query: 72 EHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNK 113 EHS+YT+ + G + +A +NPFA L +M K Sbjct: 85 EHSRYTDQQYFAD----GSLSTPKTAKATHNPFAALAGLMKK 122 >UniRef50_A8ZVU0 HNH endonuclease n=8 Tax=Proteobacteria RepID=A8ZVU0_DESOH Length = 128 Score = 119 bits (297), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 56/104 (53%), Positives = 70/104 (67%), Gaps = 3/104 (2%) Query: 12 ESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDH 71 E GYRE+ALK++P +CG C REF LRELTVHH DH+H NNP DGSNWELLCLYCH++ Sbjct: 28 EKGYRERALKLFPPICGHCGREFSGKRLRELTVHHKDHNHDNNPPDGSNWELLCLYCHEN 87 Query: 72 EHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 H + AD Y + G D + G + PFA L ++NKK+ Sbjct: 88 AHGRQAVADAYDPS--GGPDREPASGFG-HKPFAGLDTLLNKKE 128 >UniRef50_A1WVI6 HNH endonuclease n=14 Tax=Proteobacteria RepID=A1WVI6_HALHL Length = 130 Score = 106 bits (265), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 61/116 (52%), Positives = 77/116 (66%), Gaps = 8/116 (6%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 +A + A+ E GYREKALK +PWVCGRC REF N+RELTVHHIDH+H +NPEDGSN Sbjct: 13 VARAQQERAQREHGYREKALKRFPWVCGRCGREFDRRNVRELTVHHIDHNHDHNPEDGSN 72 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAK---YNPFADLKAMMNK 113 WELLCLYCHDHEH + + G + +K G A ++PFA L ++ + Sbjct: 73 WELLCLYCHDHEHQR-----EIGPATGPEQPRRKGRGAAPQSTHSPFAGLGDLLGR 123 >UniRef50_Q0EYP0 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EYP0_9PROT Length = 107 Score = 101 bits (252), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 59/106 (55%), Positives = 73/106 (68%), Gaps = 10/106 (9%) Query: 12 ESG--YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCH 69 ESG YRE+ALK+YPWVC RC+REF LR LTVHH DH+H +NP DGSNWELLC+YCH Sbjct: 8 ESGDNYREQALKLYPWVCARCAREFGGKQLRMLTVHHKDHNHDHNPSDGSNWELLCIYCH 67 Query: 70 DHEHSKYTEADQYGTTVIAGEDAQKDVGEA-KYNPFADLKAMMNKK 114 D+EH +Y EAD G D Q++ A +NPFA L ++ + Sbjct: 68 DNEHQRYMEADAQG-------DIQREEPVAGTHNPFAGLDLLLKGR 106 >UniRef50_Q39X67 HNH endonuclease n=7 Tax=Desulfuromonadales RepID=Q39X67_GEOMG Length = 113 Score = 91.7 bits (226), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 39/66 (59%), Positives = 49/66 (74%) Query: 15 YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 YRE++LKI+ W+C +C REF +NL LTVHH D +H NNP DGSNWE LC++CHD EHS Sbjct: 38 YREQSLKIHGWICAKCGREFDLANLHLLTVHHRDGNHLNNPPDGSNWENLCVWCHDDEHS 97 Query: 75 KYTEAD 80 + D Sbjct: 98 RGVLGD 103 >UniRef50_B9M834 HNH endonuclease n=4 Tax=Deltaproteobacteria RepID=B9M834_GEOSF Length = 111 Score = 89.7 bits (221), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 39/66 (59%), Positives = 48/66 (72%) Query: 15 YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 YRE++LK++ W+C +C REF NL LTVHH D +H NP DGSNWE LC+YCHD EHS Sbjct: 37 YRERSLKLHGWICAKCGREFDLDNLHLLTVHHKDGNHNYNPPDGSNWENLCVYCHDDEHS 96 Query: 75 KYTEAD 80 + AD Sbjct: 97 RSILAD 102 >UniRef50_Q3IQ78 Probable HNH-type endonuclease n=1 Tax=Natronomonas pharaonis DSM 2160 RepID=Q3IQ78_NATPD Length = 209 Score = 41.6 bits (96), Expect = 0.009, Method: Compositional matrix adjust. Identities = 28/72 (38%), Positives = 36/72 (50%), Gaps = 8/72 (11%) Query: 3 IIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWE 62 +IP A++ YR++ LK C C V N++ VHHID DH+NN D N Sbjct: 2 LIPNEKAKMCMEYRDRCLKTKGEYCHSCG---VRQNIQ---VHHIDGDHSNNGLD--NLV 53 Query: 63 LLCLYCHDHEHS 74 LC CH HS Sbjct: 54 PLCANCHSKVHS 65 >UniRef50_B5EW68 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW68_VIBFM Length = 253 Score = 41.2 bits (95), Expect = 0.011, Method: Compositional matrix adjust. Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 2/54 (3%) Query: 44 VHHIDHDHTNNPEDGSNWELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVG 97 VHHID DH NN E SN+ + C +CH EH + ++ G + A + +Q+++ Sbjct: 67 VHHIDGDHQNNKE--SNFSIRCPFCHLCEHIGWVGKNRKGVIIYAPDISQENLN 118 >UniRef50_Q76Z37 Packaging and recombination endonuclease VII n=3 Tax=T4-like viruses RepID=Q76Z37_9CAUD Length = 161 Score = 39.7 bits (91), Expect = 0.031, Method: Compositional matrix adjust. Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 9/73 (12%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDH-TNNPEDGS 59 M + PK Y +EK +C C RE +E+ +H+DHDH N P+ G Sbjct: 1 MILTPKTY----DAAKEKLYHAQNGICPLCKRELD----KEINKNHLDHDHELNGPQAGR 52 Query: 60 NWELLCLYCHDHE 72 LLC +C+ E Sbjct: 53 VRGLLCCFCNKFE 65 >UniRef50_C7QYC6 HNH endonuclease n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QYC6_CYAP0 Length = 101 Score = 38.1 bits (87), Expect = 0.077, Method: Compositional matrix adjust. Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 19/82 (23%) Query: 5 PKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRE-------------LTVHHIDHDH 51 PKN+ + +EKA W C +C E +N + LTVHH D+D Sbjct: 8 PKNWEEIALEVKEKA----EWTCAKCGLECFPTNYLKHTIKDKSEKAKLTLTVHHSDYDP 63 Query: 52 TNNPEDGSNWELLCLYCHDHEH 73 +NN E SN LC CH + H Sbjct: 64 SNNQE--SNLIPLCSACHLYAH 83 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0A1S9 Uncharacterized protein yajD n=151 Tax=Bacteria ... 209 3e-53 UniRef50_A9R2B8 HNH endonuclease domain protein n=50 Tax=Proteob... 202 3e-51 UniRef50_A4VJW5 Restriction endonuclease n=1 Tax=Pseudomonas stu... 168 8e-41 UniRef50_A8ZVU0 HNH endonuclease n=8 Tax=Proteobacteria RepID=A8... 166 3e-40 UniRef50_Q48LB8 HNH endonuclease n=15 Tax=Proteobacteria RepID=Q... 160 1e-38 UniRef50_A1WVI6 HNH endonuclease n=14 Tax=Proteobacteria RepID=A... 147 9e-35 UniRef50_Q0EYP0 Putative uncharacterized protein n=1 Tax=Maripro... 136 3e-31 UniRef50_B9M834 HNH endonuclease n=4 Tax=Deltaproteobacteria Rep... 124 1e-27 UniRef50_Q39X67 HNH endonuclease n=7 Tax=Desulfuromonadales RepI... 124 1e-27 Sequences not found previously or not previously below threshold: UniRef50_Q3IQ78 Probable HNH-type endonuclease n=1 Tax=Natronomo... 50 2e-05 UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 45 0.001 UniRef50_C6C5L2 HNH nuclease n=1 Tax=Dickeya dadantii Ech703 Rep... 43 0.002 UniRef50_C4I886 Putative uncharacterized protein n=1 Tax=Burkhol... 42 0.006 UniRef50_C1V6S3 HNH endonuclease n=1 Tax=Halogeometricum borinqu... 42 0.006 UniRef50_C7QYC6 HNH endonuclease n=1 Tax=Cyanothece sp. PCC 8802... 41 0.012 >UniRef50_P0A1S9 Uncharacterized protein yajD n=151 Tax=Bacteria RepID=YAJD_SALTI Length = 115 Score = 209 bits (531), Expect = 3e-53, Method: Composition-based stats. Identities = 111/115 (96%), Positives = 114/115 (99%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 MAIIPKNYARLESGYREKALK++PWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN Sbjct: 1 MAIIPKNYARLESGYREKALKLFPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 WELLCLYCHDHEHSKYTEADQYG+TVIAGEDAQKDVGEA YNPFADLKAMMNKKK Sbjct: 61 WELLCLYCHDHEHSKYTEADQYGSTVIAGEDAQKDVGEATYNPFADLKAMMNKKK 115 >UniRef50_A9R2B8 HNH endonuclease domain protein n=50 Tax=Proteobacteria RepID=A9R2B8_YERPG Length = 117 Score = 202 bits (513), Expect = 3e-51, Method: Composition-based stats. Identities = 93/113 (82%), Positives = 105/113 (92%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 MA IPKNY+RLESGYREKALKI+PWVCG+CSREFVYSNLRELTVHHIDHDH NNPEDGSN Sbjct: 5 MAYIPKNYSRLESGYREKALKIFPWVCGKCSREFVYSNLRELTVHHIDHDHGNNPEDGSN 64 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNK 113 WE+LCL+CHDHEHSKYTE D YG+TV+AG+DAQKD G A +NPFA+LK++M K Sbjct: 65 WEMLCLFCHDHEHSKYTEVDLYGSTVVAGDDAQKDQGVATHNPFANLKSLMKK 117 >UniRef50_A4VJW5 Restriction endonuclease n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VJW5_PSEU5 Length = 114 Score = 168 bits (424), Expect = 8e-41, Method: Composition-based stats. Identities = 62/108 (57%), Positives = 77/108 (71%), Gaps = 4/108 (3%) Query: 8 YARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLY 67 Y++ E GYREKALK+YPWVCGRC+REF L ELTVHH DH+H NNPEDGSNWELLCLY Sbjct: 10 YSQREQGYREKALKMYPWVCGRCAREFSGKRLSELTVHHKDHNHDNNPEDGSNWELLCLY 69 Query: 68 CHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 CHD+EHS+YT+ +QY G D + + FA+L ++ K+ Sbjct: 70 CHDNEHSRYTD-NQYQAEARPGSDLGP---KETFKAFANLADLLKGKQ 113 >UniRef50_A8ZVU0 HNH endonuclease n=8 Tax=Proteobacteria RepID=A8ZVU0_DESOH Length = 128 Score = 166 bits (419), Expect = 3e-40, Method: Composition-based stats. Identities = 57/115 (49%), Positives = 73/115 (63%), Gaps = 3/115 (2%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 +A + E GYRE+ALK++P +CG C REF LRELTVHH DH+H NNP DGSN Sbjct: 17 VADVLARRKEAEKGYRERALKLFPPICGHCGREFSGKRLRELTVHHKDHNHDNNPPDGSN 76 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 WELLCLYCH++ H + AD Y + G D + G + PFA L ++NKK+ Sbjct: 77 WELLCLYCHENAHGRQAVADAYDPS--GGPDREPASGFG-HKPFAGLDTLLNKKE 128 >UniRef50_Q48LB8 HNH endonuclease n=15 Tax=Proteobacteria RepID=Q48LB8_PSE14 Length = 124 Score = 160 bits (405), Expect = 1e-38, Method: Composition-based stats. Identities = 58/105 (55%), Positives = 73/105 (69%), Gaps = 4/105 (3%) Query: 11 LESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHD 70 E GYR+KAL++YP VCGRC+REF L ELTVHH DH+H NNP+DGSNWELLCLYCHD Sbjct: 24 REMGYRDKALRMYPHVCGRCAREFAGKRLSELTVHHRDHNHDNNPQDGSNWELLCLYCHD 83 Query: 71 HEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 +EHS+YT+ + G + +A +NPFA L +M K + Sbjct: 84 NEHSRYTDQQYFAD----GSLSTPKTAKATHNPFAALAGLMKKDE 124 >UniRef50_A1WVI6 HNH endonuclease n=14 Tax=Proteobacteria RepID=A1WVI6_HALHL Length = 130 Score = 147 bits (371), Expect = 9e-35, Method: Composition-based stats. Identities = 61/116 (52%), Positives = 77/116 (66%), Gaps = 8/116 (6%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 +A + A+ E GYREKALK +PWVCGRC REF N+RELTVHHIDH+H +NPEDGSN Sbjct: 13 VARAQQERAQREHGYREKALKRFPWVCGRCGREFDRRNVRELTVHHIDHNHDHNPEDGSN 72 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEA---KYNPFADLKAMMNK 113 WELLCLYCHDHEH + + G + +K G A ++PFA L ++ + Sbjct: 73 WELLCLYCHDHEHQR-----EIGPATGPEQPRRKGRGAAPQSTHSPFAGLGDLLGR 123 >UniRef50_Q0EYP0 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EYP0_9PROT Length = 107 Score = 136 bits (341), Expect = 3e-31, Method: Composition-based stats. Identities = 59/106 (55%), Positives = 73/106 (68%), Gaps = 10/106 (9%) Query: 12 ESG--YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCH 69 ESG YRE+ALK+YPWVC RC+REF LR LTVHH DH+H +NP DGSNWELLC+YCH Sbjct: 8 ESGDNYREQALKLYPWVCARCAREFGGKQLRMLTVHHKDHNHDHNPSDGSNWELLCIYCH 67 Query: 70 DHEHSKYTEADQYGTTVIAGEDAQKDVGEA-KYNPFADLKAMMNKK 114 D+EH +Y EAD G D Q++ A +NPFA L ++ + Sbjct: 68 DNEHQRYMEADAQG-------DIQREEPVAGTHNPFAGLDLLLKGR 106 >UniRef50_B9M834 HNH endonuclease n=4 Tax=Deltaproteobacteria RepID=B9M834_GEOSF Length = 111 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 39/69 (56%), Positives = 48/69 (69%) Query: 15 YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 YRE++LK++ W+C +C REF NL LTVHH D +H NP DGSNWE LC+YCHD EHS Sbjct: 37 YRERSLKLHGWICAKCGREFDLDNLHLLTVHHKDGNHNYNPPDGSNWENLCVYCHDDEHS 96 Query: 75 KYTEADQYG 83 + AD Sbjct: 97 RSILADYLQ 105 >UniRef50_Q39X67 HNH endonuclease n=7 Tax=Desulfuromonadales RepID=Q39X67_GEOMG Length = 113 Score = 124 bits (310), Expect = 1e-27, Method: Composition-based stats. Identities = 39/70 (55%), Positives = 49/70 (70%) Query: 15 YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 YRE++LKI+ W+C +C REF +NL LTVHH D +H NNP DGSNWE LC++CHD EHS Sbjct: 38 YREQSLKIHGWICAKCGREFDLANLHLLTVHHRDGNHLNNPPDGSNWENLCVWCHDDEHS 97 Query: 75 KYTEADQYGT 84 + D Sbjct: 98 RGVLGDYLND 107 >UniRef50_Q3IQ78 Probable HNH-type endonuclease n=1 Tax=Natronomonas pharaonis DSM 2160 RepID=Q3IQ78_NATPD Length = 209 Score = 50.2 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 28/76 (36%), Positives = 37/76 (48%), Gaps = 8/76 (10%) Query: 3 IIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWE 62 +IP A++ YR++ LK C C V N++ VHHID DH+NN D N Sbjct: 2 LIPNEKAKMCMEYRDRCLKTKGEYCHSCG---VRQNIQ---VHHIDGDHSNNGLD--NLV 53 Query: 63 LLCLYCHDHEHSKYTE 78 LC CH HS + Sbjct: 54 PLCANCHSKVHSGEID 69 >UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YLU0_ANASP Length = 539 Score = 44.8 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 21/69 (30%), Positives = 34/69 (49%), Gaps = 8/69 (11%) Query: 8 YARLES----GYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWEL 63 +++ E+ G + LK C C+ F+ ++ EL HHID +H N + N E+ Sbjct: 463 WSKRENVNYDGVTARLLKKQNHKCTECNLSFISGDIAEL--HHIDGNHDNWKPN--NLEV 518 Query: 64 LCLYCHDHE 72 L CH H+ Sbjct: 519 LHRECHQHQ 527 >UniRef50_C6C5L2 HNH nuclease n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5L2_DICDC Length = 287 Score = 43.2 bits (100), Expect = 0.002, Method: Composition-based stats. Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 7 NYARLESGYREKALKIYPWVCGRCSREFVYSNLRE-LTVHHIDHDHTNNPEDGSNWELLC 65 ++ + YRE W C +C NL + L VHH D NN SN + LC Sbjct: 205 DFNTISKQYRES----IHWHCEQCGINLKEKNLNKYLHVHHRDGQKNNNQR--SNLQALC 258 Query: 66 LYCHDHE 72 + CH + Sbjct: 259 IECHSKQ 265 >UniRef50_C4I886 Putative uncharacterized protein n=1 Tax=Burkholderia pseudomallei MSHR346 RepID=C4I886_BURPS Length = 282 Score = 42.1 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 18/69 (26%) Query: 7 NYARLESGYREKALKIYPWVCGRCS---REFVYSNLRELTVHHIDHDHTNNPEDGSNWEL 63 + + GYR C C+ R+F L VHH + +N + SN E+ Sbjct: 208 ERTKRQRGYR----------CATCNTVLRQFDSKFLH---VHHRNGQKYDNRD--SNLEV 252 Query: 64 LCLYCHDHE 72 LC+ CH E Sbjct: 253 LCIGCHAEE 261 >UniRef50_C1V6S3 HNH endonuclease n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1V6S3_9EURY Length = 357 Score = 42.1 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 24/67 (35%), Positives = 28/67 (41%), Gaps = 2/67 (2%) Query: 16 REKALKIYPWVCGRCSREFVYSN-LRELTVHHIDHDHTNNPE-DGSNWELLCLYCHDHEH 73 R+ L+ Y C C R L L VHHI+ D E D N LLC CH H Sbjct: 40 RDDVLEKYKHRCQACGRRGPGKGGLATLHVHHIERDPDGMGEHDLENLTLLCRSCHSWFH 99 Query: 74 SKYTEAD 80 + T D Sbjct: 100 QQSTPED 106 >UniRef50_C7QYC6 HNH endonuclease n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QYC6_CYAP0 Length = 101 Score = 40.9 bits (94), Expect = 0.012, Method: Composition-based stats. Identities = 29/84 (34%), Positives = 38/84 (45%), Gaps = 19/84 (22%) Query: 3 IIPKNYARLESGYREKALKIYPWVCGRCSRE-FVYSNLRE------------LTVHHIDH 49 PKN+ + +EKA W C +C E F + L+ LTVHH D+ Sbjct: 6 RYPKNWEEIALEVKEKA----EWTCAKCGLECFPTNYLKHTIKDKSEKAKLTLTVHHSDY 61 Query: 50 DHTNNPEDGSNWELLCLYCHDHEH 73 D +NN E SN LC CH + H Sbjct: 62 DPSNNQE--SNLIPLCSACHLYAH 83 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0A1S9 Uncharacterized protein yajD n=151 Tax=Bacteria ... 186 2e-46 UniRef50_A9R2B8 HNH endonuclease domain protein n=50 Tax=Proteob... 179 3e-44 UniRef50_A8ZVU0 HNH endonuclease n=8 Tax=Proteobacteria RepID=A8... 153 2e-36 UniRef50_A4VJW5 Restriction endonuclease n=1 Tax=Pseudomonas stu... 147 9e-35 UniRef50_Q48LB8 HNH endonuclease n=15 Tax=Proteobacteria RepID=Q... 146 2e-34 UniRef50_A1WVI6 HNH endonuclease n=14 Tax=Proteobacteria RepID=A... 132 4e-30 UniRef50_Q0EYP0 Putative uncharacterized protein n=1 Tax=Maripro... 118 5e-26 UniRef50_Q39X67 HNH endonuclease n=7 Tax=Desulfuromonadales RepI... 112 3e-24 UniRef50_B9M834 HNH endonuclease n=4 Tax=Deltaproteobacteria Rep... 112 4e-24 UniRef50_Q3IQ78 Probable HNH-type endonuclease n=1 Tax=Natronomo... 90 3e-17 UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 87 2e-16 Sequences not found previously or not previously below threshold: UniRef50_Q8YKQ2 Alr7241 protein n=14 Tax=Cyanobacteria RepID=Q8Y... 54 1e-06 UniRef50_C1V6S3 HNH endonuclease n=1 Tax=Halogeometricum borinqu... 49 4e-05 UniRef50_UPI00016600B2 HNH-type endonuclease n=1 Tax=Halobacteri... 48 1e-04 UniRef50_Q3AR05 HNH nuclease n=1 Tax=Chlorobium chlorochromatii ... 48 1e-04 UniRef50_A7I879 HNH endonuclease n=1 Tax=Candidatus Methanoregul... 48 1e-04 UniRef50_Q6QGL2 H-N-H endonuclease TflIV n=3 Tax=T5-like viruses... 47 2e-04 UniRef50_A7FCK8 Putative type IV secretion system protein IcmJ/D... 47 2e-04 UniRef50_C6C5L2 HNH nuclease n=1 Tax=Dickeya dadantii Ech703 Rep... 47 2e-04 UniRef50_Q8DK67 Reverse transcriptase n=1 Tax=Thermosynechococcu... 46 3e-04 UniRef50_Q3BZU0 Putative IcmJ-like type IV secretion system prot... 46 4e-04 UniRef50_Q8U2D9 Putative uncharacterized protein n=1 Tax=Pyrococ... 46 5e-04 UniRef50_C7QYC6 HNH endonuclease n=1 Tax=Cyanothece sp. PCC 8802... 45 6e-04 UniRef50_C0N456 Putative uncharacterized protein n=1 Tax=Methylo... 45 7e-04 UniRef50_Q0W144 Putative uncharacterized protein n=1 Tax=uncultu... 45 9e-04 UniRef50_C4I886 Putative uncharacterized protein n=1 Tax=Burkhol... 45 0.001 UniRef50_Q73IV1 Reverse transcriptase, interruption-C n=3 Tax=Wo... 44 0.001 UniRef50_UPI0001BC7B91 HNH endonuclease n=1 Tax=Bacteroides sp. ... 44 0.002 UniRef50_Q3Z844 HNH endonuclease family protein n=1 Tax=Dehaloco... 43 0.002 UniRef50_B9LVY8 HNH endonuclease n=1 Tax=Halorubrum lacusprofund... 43 0.002 UniRef50_B4WW73 Group II intron, maturase-specific domain family... 43 0.002 UniRef50_D2S2Y4 HNH endonuclease n=1 Tax=Haloterrigena turkmenic... 43 0.002 UniRef50_B5EW68 Putative uncharacterized protein n=1 Tax=Vibrio ... 43 0.002 UniRef50_B9LX84 HNH endonuclease n=1 Tax=Halorubrum lacusprofund... 43 0.003 UniRef50_B9ZDV6 HNH endonuclease n=1 Tax=Natrialba magadii ATCC ... 43 0.003 UniRef50_A3CSG6 HNH endonuclease n=1 Tax=Methanoculleus marisnig... 42 0.005 UniRef50_Q12UG1 RNA-directed DNA polymerase n=53 Tax=cellular or... 42 0.005 UniRef50_O99970 Orf546 n=2 Tax=Porphyra purpurea RepID=O99970_PORPU 42 0.006 UniRef50_C5B541 Putative uncharacterized protein n=1 Tax=Methylo... 42 0.007 UniRef50_C3K0B7 Putative 5-methylcytosine-specific restriction e... 42 0.008 UniRef50_D1RHL0 Putative uncharacterized protein n=1 Tax=Legione... 41 0.009 UniRef50_B0JX80 Reverse transcriptase n=82 Tax=Bacteria RepID=B0... 41 0.010 UniRef50_Q2FQI6 HNH endonuclease n=1 Tax=Methanospirillum hungat... 41 0.013 UniRef50_Q6YRR0 Slr6094 protein n=6 Tax=Synechocystis sp. PCC 68... 41 0.013 UniRef50_B7K703 RNA-directed DNA polymerase (Reverse transcripta... 41 0.014 UniRef50_B1X354 Putative uncharacterized protein n=1 Tax=Cyanoth... 41 0.014 UniRef50_C8VXB8 Putative uncharacterized protein n=1 Tax=Desulfo... 41 0.016 UniRef50_A0Y4N4 Putative uncharacterized protein n=1 Tax=Alterom... 41 0.016 UniRef50_UPI0001BC350E HNH endonuclease n=1 Tax=Butyrivibrio cro... 41 0.017 UniRef50_B1QVB2 HNH endonuclease domain protein n=2 Tax=Clostrid... 40 0.018 UniRef50_Q0I2A0 Putative uncharacterized protein n=1 Tax=Haemoph... 40 0.019 UniRef50_B4SDY4 HNH nuclease n=4 Tax=Bacteria RepID=B4SDY4_PELPB 40 0.025 UniRef50_A3XZ24 Putative uncharacterized protein n=1 Tax=Vibrio ... 40 0.025 UniRef50_Q44225 ORF439 n=2 Tax=Anabaena RepID=Q44225_9NOST 40 0.029 UniRef50_C4K5N9 Group II intron encoded reverse transcriptase n=... 39 0.033 UniRef50_B7VNF7 Putative uncharacterized protein n=6 Tax=Vibrion... 39 0.035 UniRef50_B4XH92 Putative uncharacterized protein n=1 Tax=Actinob... 39 0.035 UniRef50_C1SP87 Predicted restriction endonuclease n=1 Tax=Denit... 39 0.053 UniRef50_B8IXC1 HNH endonuclease n=4 Tax=Methylobacterium RepID=... 39 0.056 UniRef50_C7LPI3 Restriction endonuclease n=1 Tax=Desulfomicrobiu... 39 0.059 UniRef50_A0YXC5 Putative uncharacterized protein n=1 Tax=Lyngbya... 39 0.060 UniRef50_D2QV00 HNH endonuclease n=1 Tax=Spirosoma linguale DSM ... 39 0.066 UniRef50_D1P9G2 5-methylcytosine-specific restriction enzyme A n... 38 0.070 UniRef50_C6X560 HNH nuclease n=1 Tax=Flavobacteriaceae bacterium... 38 0.074 UniRef50_C7N4P2 HNH endonuclease n=1 Tax=Slackia heliotrinireduc... 38 0.078 UniRef50_C6UP82 Predicted protein n=14 Tax=Escherichia coli RepI... 38 0.083 UniRef50_C9KJE0 H-N-H endonuclease F-TflIV n=1 Tax=Mitsuokella m... 38 0.093 >UniRef50_P0A1S9 Uncharacterized protein yajD n=151 Tax=Bacteria RepID=YAJD_SALTI Length = 115 Score = 186 bits (472), Expect = 2e-46, Method: Composition-based stats. Identities = 111/115 (96%), Positives = 114/115 (99%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 MAIIPKNYARLESGYREKALK++PWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN Sbjct: 1 MAIIPKNYARLESGYREKALKLFPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 WELLCLYCHDHEHSKYTEADQYG+TVIAGEDAQKDVGEA YNPFADLKAMMNKKK Sbjct: 61 WELLCLYCHDHEHSKYTEADQYGSTVIAGEDAQKDVGEATYNPFADLKAMMNKKK 115 >UniRef50_A9R2B8 HNH endonuclease domain protein n=50 Tax=Proteobacteria RepID=A9R2B8_YERPG Length = 117 Score = 179 bits (453), Expect = 3e-44, Method: Composition-based stats. Identities = 93/113 (82%), Positives = 105/113 (92%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 MA IPKNY+RLESGYREKALKI+PWVCG+CSREFVYSNLRELTVHHIDHDH NNPEDGSN Sbjct: 5 MAYIPKNYSRLESGYREKALKIFPWVCGKCSREFVYSNLRELTVHHIDHDHGNNPEDGSN 64 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNK 113 WE+LCL+CHDHEHSKYTE D YG+TV+AG+DAQKD G A +NPFA+LK++M K Sbjct: 65 WEMLCLFCHDHEHSKYTEVDLYGSTVVAGDDAQKDQGVATHNPFANLKSLMKK 117 >UniRef50_A8ZVU0 HNH endonuclease n=8 Tax=Proteobacteria RepID=A8ZVU0_DESOH Length = 128 Score = 153 bits (386), Expect = 2e-36, Method: Composition-based stats. Identities = 57/115 (49%), Positives = 73/115 (63%), Gaps = 3/115 (2%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 +A + E GYRE+ALK++P +CG C REF LRELTVHH DH+H NNP DGSN Sbjct: 17 VADVLARRKEAEKGYRERALKLFPPICGHCGREFSGKRLRELTVHHKDHNHDNNPPDGSN 76 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 WELLCLYCH++ H + AD Y + G D + G + PFA L ++NKK+ Sbjct: 77 WELLCLYCHENAHGRQAVADAYDPS--GGPDREPASGFG-HKPFAGLDTLLNKKE 128 >UniRef50_A4VJW5 Restriction endonuclease n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VJW5_PSEU5 Length = 114 Score = 147 bits (371), Expect = 9e-35, Method: Composition-based stats. Identities = 62/108 (57%), Positives = 77/108 (71%), Gaps = 4/108 (3%) Query: 8 YARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLY 67 Y++ E GYREKALK+YPWVCGRC+REF L ELTVHH DH+H NNPEDGSNWELLCLY Sbjct: 10 YSQREQGYREKALKMYPWVCGRCAREFSGKRLSELTVHHKDHNHDNNPEDGSNWELLCLY 69 Query: 68 CHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 CHD+EHS+YT+ +QY G D + + FA+L ++ K+ Sbjct: 70 CHDNEHSRYTD-NQYQAEARPGSDLGP---KETFKAFANLADLLKGKQ 113 >UniRef50_Q48LB8 HNH endonuclease n=15 Tax=Proteobacteria RepID=Q48LB8_PSE14 Length = 124 Score = 146 bits (369), Expect = 2e-34, Method: Composition-based stats. Identities = 58/105 (55%), Positives = 73/105 (69%), Gaps = 4/105 (3%) Query: 11 LESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHD 70 E GYR+KAL++YP VCGRC+REF L ELTVHH DH+H NNP+DGSNWELLCLYCHD Sbjct: 24 REMGYRDKALRMYPHVCGRCAREFAGKRLSELTVHHRDHNHDNNPQDGSNWELLCLYCHD 83 Query: 71 HEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 +EHS+YT+ + G + +A +NPFA L +M K + Sbjct: 84 NEHSRYTDQQYFAD----GSLSTPKTAKATHNPFAALAGLMKKDE 124 >UniRef50_A1WVI6 HNH endonuclease n=14 Tax=Proteobacteria RepID=A1WVI6_HALHL Length = 130 Score = 132 bits (331), Expect = 4e-30, Method: Composition-based stats. Identities = 61/117 (52%), Positives = 76/117 (64%), Gaps = 8/117 (6%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSN 60 +A + A+ E GYREKALK +PWVCGRC REF N+RELTVHHIDH+H +NPEDGSN Sbjct: 13 VARAQQERAQREHGYREKALKRFPWVCGRCGREFDRRNVRELTVHHIDHNHDHNPEDGSN 72 Query: 61 WELLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEA---KYNPFADLKAMMNKK 114 WELLCLYCHDHEH + G + +K G A ++PFA L ++ + Sbjct: 73 WELLCLYCHDHEHQREI-----GPATGPEQPRRKGRGAAPQSTHSPFAGLGDLLGRD 124 >UniRef50_Q0EYP0 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EYP0_9PROT Length = 107 Score = 118 bits (295), Expect = 5e-26, Method: Composition-based stats. Identities = 56/101 (55%), Positives = 70/101 (69%), Gaps = 8/101 (7%) Query: 15 YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 YRE+ALK+YPWVC RC+REF LR LTVHH DH+H +NP DGSNWELLC+YCHD+EH Sbjct: 13 YREQALKLYPWVCARCAREFGGKQLRMLTVHHKDHNHDHNPSDGSNWELLCIYCHDNEHQ 72 Query: 75 KYTEADQYGTTVIAGEDAQKDVGEA-KYNPFADLKAMMNKK 114 +Y EAD G D Q++ A +NPFA L ++ + Sbjct: 73 RYMEADAQG-------DIQREEPVAGTHNPFAGLDLLLKGR 106 >UniRef50_Q39X67 HNH endonuclease n=7 Tax=Desulfuromonadales RepID=Q39X67_GEOMG Length = 113 Score = 112 bits (280), Expect = 3e-24, Method: Composition-based stats. Identities = 39/70 (55%), Positives = 49/70 (70%) Query: 15 YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 YRE++LKI+ W+C +C REF +NL LTVHH D +H NNP DGSNWE LC++CHD EHS Sbjct: 38 YREQSLKIHGWICAKCGREFDLANLHLLTVHHRDGNHLNNPPDGSNWENLCVWCHDDEHS 97 Query: 75 KYTEADQYGT 84 + D Sbjct: 98 RGVLGDYLND 107 >UniRef50_B9M834 HNH endonuclease n=4 Tax=Deltaproteobacteria RepID=B9M834_GEOSF Length = 111 Score = 112 bits (280), Expect = 4e-24, Method: Composition-based stats. Identities = 39/74 (52%), Positives = 49/74 (66%) Query: 10 RLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCH 69 + YRE++LK++ W+C +C REF NL LTVHH D +H NP DGSNWE LC+YCH Sbjct: 32 KAPDNYRERSLKLHGWICAKCGREFDLDNLHLLTVHHKDGNHNYNPPDGSNWENLCVYCH 91 Query: 70 DHEHSKYTEADQYG 83 D EHS+ AD Sbjct: 92 DDEHSRSILADYLQ 105 >UniRef50_Q3IQ78 Probable HNH-type endonuclease n=1 Tax=Natronomonas pharaonis DSM 2160 RepID=Q3IQ78_NATPD Length = 209 Score = 89.5 bits (220), Expect = 3e-17, Method: Composition-based stats. Identities = 28/76 (36%), Positives = 37/76 (48%), Gaps = 8/76 (10%) Query: 3 IIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWE 62 +IP A++ YR++ LK C C V N++ VHHID DH+NN D N Sbjct: 2 LIPNEKAKMCMEYRDRCLKTKGEYCHSCG---VRQNIQ---VHHIDGDHSNNGLD--NLV 53 Query: 63 LLCLYCHDHEHSKYTE 78 LC CH HS + Sbjct: 54 PLCANCHSKVHSGEID 69 >UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YLU0_ANASP Length = 539 Score = 87.2 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 21/69 (30%), Positives = 34/69 (49%), Gaps = 8/69 (11%) Query: 8 YARLES----GYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWEL 63 +++ E+ G + LK C C+ F+ ++ EL HHID +H N + N E+ Sbjct: 463 WSKRENVNYDGVTARLLKKQNHKCTECNLSFISGDIAEL--HHIDGNHDNWKPN--NLEV 518 Query: 64 LCLYCHDHE 72 L CH H+ Sbjct: 519 LHRECHQHQ 527 >UniRef50_Q8YKQ2 Alr7241 protein n=14 Tax=Cyanobacteria RepID=Q8YKQ2_ANASP Length = 562 Score = 54.4 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 18/70 (25%), Positives = 29/70 (41%), Gaps = 4/70 (5%) Query: 6 KNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLC 65 + ++L G K +K CG C + + +HHID +H N N ++ Sbjct: 492 ERNSKLYDGETSKTIKKQNHTCGYCGLKCTSEE--RVHLHHIDGNHKNRKP--KNLIVVH 547 Query: 66 LYCHDHEHSK 75 CHD+ H Sbjct: 548 ESCHDYIHMG 557 >UniRef50_C1V6S3 HNH endonuclease n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1V6S3_9EURY Length = 357 Score = 49.4 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 24/67 (35%), Positives = 28/67 (41%), Gaps = 2/67 (2%) Query: 16 REKALKIYPWVCGRCSREFVYSN-LRELTVHHIDHDHTNNPE-DGSNWELLCLYCHDHEH 73 R+ L+ Y C C R L L VHHI+ D E D N LLC CH H Sbjct: 40 RDDVLEKYKHRCQACGRRGPGKGGLATLHVHHIERDPDGMGEHDLENLTLLCRSCHSWFH 99 Query: 74 SKYTEAD 80 + T D Sbjct: 100 QQSTPED 106 >UniRef50_UPI00016600B2 HNH-type endonuclease n=1 Tax=Halobacterium salinarum RepID=UPI00016600B2 Length = 145 Score = 47.5 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 18/65 (27%), Positives = 25/65 (38%), Gaps = 8/65 (12%) Query: 11 LESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHD 70 + + YR L C C E+ VHH+D D NN + N +C CH Sbjct: 1 MSATYRRVCLDTKGEECEICGTT------EEIVVHHVDGDRENNAIE--NLVPVCKSCHG 52 Query: 71 HEHSK 75 H+ Sbjct: 53 KIHTG 57 >UniRef50_Q3AR05 HNH nuclease n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3AR05_CHLCH Length = 224 Score = 47.5 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 19/70 (27%), Positives = 29/70 (41%), Gaps = 6/70 (8%) Query: 11 LESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHD 70 +E+ R + K C C E ++ +HHID +H NN + N L+C CH Sbjct: 9 MENKVRAELQKQISSKCPFCENE----DVGHFQIHHIDENHDNN--EILNLILICPNCHS 62 Query: 71 HEHSKYTEAD 80 K + Sbjct: 63 KMTKKEIAQE 72 >UniRef50_A7I879 HNH endonuclease n=1 Tax=Candidatus Methanoregula boonei 6A8 RepID=A7I879_METB6 Length = 321 Score = 47.5 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 38/85 (44%), Gaps = 12/85 (14%) Query: 3 IIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWE 62 +I K + + K L C C ++ +++ +HH+D + +NN D N Sbjct: 1 MIKKKRTEIPADISAKVLFDSNRTCCVC-----RNDSKQVIIHHLDENPSNNKLD--NLA 53 Query: 63 LLCLYCHDHEHSKY-----TEADQY 82 +LCL CH H++ +ADQ Sbjct: 54 VLCLECHGKTHTRGGFDRKLDADQI 78 >UniRef50_Q6QGL2 H-N-H endonuclease TflIV n=3 Tax=T5-like viruses RepID=Q6QGL2_BPT5 Length = 227 Score = 47.1 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 17/48 (35%), Positives = 20/48 (41%), Gaps = 2/48 (4%) Query: 22 IYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCH 69 + P C C E L + H D +H NN D N LLC CH Sbjct: 85 LKPHKCESCGLESWLDKPIPLELEHKDGNHYNNEWD--NLALLCPNCH 130 >UniRef50_A7FCK8 Putative type IV secretion system protein IcmJ/DotN n=1 Tax=Yersinia pseudotuberculosis IP 31758 RepID=A7FCK8_YERP3 Length = 239 Score = 47.1 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 19/61 (31%), Positives = 25/61 (40%), Gaps = 6/61 (9%) Query: 13 SGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHE 72 R LK C C + +L VHHI+ DH+NN +N +C CH Sbjct: 39 QEVRTSVLKRDNHTCKFC----FFKSLHYQEVHHINDDHSNNSP--TNLVTVCPLCHQVH 92 Query: 73 H 73 H Sbjct: 93 H 93 >UniRef50_C6C5L2 HNH nuclease n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5L2_DICDC Length = 287 Score = 46.7 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 21/68 (30%), Positives = 29/68 (42%), Gaps = 7/68 (10%) Query: 6 KNYARLESGYREKALKIYPWVCGRCSREFVYSNLRE-LTVHHIDHDHTNNPEDGSNWELL 64 ++ + YRE W C +C NL + L VHH D NN SN + L Sbjct: 204 NDFNTISKQYRESI----HWHCEQCGINLKEKNLNKYLHVHHRDGQKNNN--QRSNLQAL 257 Query: 65 CLYCHDHE 72 C+ CH + Sbjct: 258 CIECHSKQ 265 >UniRef50_Q8DK67 Reverse transcriptase n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DK67_THEEB Length = 317 Score = 46.4 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 27/72 (37%), Gaps = 6/72 (8%) Query: 4 IPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWEL 63 P Y R R + K +C C E L E +HHI H +D N L Sbjct: 239 APAQYRRT----RRELWKKQGGICPVCGGEIEQDMLTE--IHHILPKHKGGTDDLDNLVL 292 Query: 64 LCLYCHDHEHSK 75 + CH HS+ Sbjct: 293 IHTNCHKQVHSR 304 >UniRef50_Q3BZU0 Putative IcmJ-like type IV secretion system protein n=1 Tax=Xanthomonas campestris pv. vesicatoria str. 85-10 RepID=Q3BZU0_XANC5 Length = 250 Score = 46.0 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 23/69 (33%), Positives = 29/69 (42%), Gaps = 5/69 (7%) Query: 7 NYARLESGYREKAL--KIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELL 64 + +G RE L + C C +F E VHH+D DH NN D N L Sbjct: 24 RKDKEHAGSREALLGQARFLRRCAFCDFKF-GGVADECEVHHLDGDHANNTAD--NLTLA 80 Query: 65 CLYCHDHEH 73 C+ CH H Sbjct: 81 CVLCHMPHH 89 >UniRef50_Q8U2D9 Putative uncharacterized protein n=1 Tax=Pyrococcus furiosus RepID=Q8U2D9_PYRFU Length = 326 Score = 45.6 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 24/100 (24%), Positives = 38/100 (38%), Gaps = 16/100 (16%) Query: 16 REKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHSK 75 R+ L+ + C C + L VHH+D + NN D N LC CH H Sbjct: 134 RKVVLERNNYRCSVCGYGY-------LEVHHVDGNILNNTLD--NLVTLCRRCHRKVH-- 182 Query: 76 YTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 + + ED K + + ++ + +M KK Sbjct: 183 -----YHPSFHTTPEDMDKCIRSFHHEFYSTIYEIMKNKK 217 >UniRef50_C7QYC6 HNH endonuclease n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QYC6_CYAP0 Length = 101 Score = 45.2 bits (105), Expect = 6e-04, Method: Composition-based stats. Identities = 28/84 (33%), Positives = 37/84 (44%), Gaps = 19/84 (22%) Query: 5 PKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRE-------------LTVHHIDHDH 51 PKN+ + +EKA W C +C E +N + LTVHH D+D Sbjct: 8 PKNWEEIALEVKEKA----EWTCAKCGLECFPTNYLKHTIKDKSEKAKLTLTVHHSDYDP 63 Query: 52 TNNPEDGSNWELLCLYCHDHEHSK 75 +NN E SN LC CH + H Sbjct: 64 SNNQE--SNLIPLCSACHLYAHRG 85 >UniRef50_C0N456 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N456_9GAMM Length = 230 Score = 45.2 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 5/51 (9%) Query: 23 YPWVCGRCSREFVYSNLRELTVHHIDH-DHTNNPEDGSNWELLCLYCHDHE 72 + ++C +C + + +N R L HHI+ H N E N + LC+ CH + Sbjct: 65 FNYICQQCGLDLI-NNKRLLHTHHINGVKHDNRKE---NLKPLCVDCHSKQ 111 >UniRef50_Q0W144 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W144_UNCMA Length = 360 Score = 44.8 bits (104), Expect = 9e-04, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 24/78 (30%), Gaps = 11/78 (14%) Query: 24 PWVCGRCSREFVYSNLRELTVHHI------DHDHTNNPEDGSNWELLCLYCHDHEHSKYT 77 C +C ++ L VHHI D + N SN +LC CH S Sbjct: 26 GHKCEKCG---SICDIEVLEVHHIEPVRDADGRYDYNSP--SNLIVLCANCHKLAGSNKI 80 Query: 78 EADQYGTTVIAGEDAQKD 95 Q + K Sbjct: 81 PKIQLSDITYKRPEYLKG 98 >UniRef50_C4I886 Putative uncharacterized protein n=1 Tax=Burkholderia pseudomallei MSHR346 RepID=C4I886_BURPS Length = 282 Score = 44.8 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 8/71 (11%) Query: 5 PKNYARLESGYREKALKIYPWVCGRCS---REFVYSNLRELTVHHIDHDHTNNPEDGSNW 61 Y++ + E+ + + C C+ R+F L VHH + +N + SN Sbjct: 196 LNVYSQDWNEISERTKRQRGYRCATCNTVLRQFDSKFLH---VHHRNGQKYDNRD--SNL 250 Query: 62 ELLCLYCHDHE 72 E+LC+ CH E Sbjct: 251 EVLCIGCHAEE 261 >UniRef50_Q73IV1 Reverse transcriptase, interruption-C n=3 Tax=Wolbachia RepID=Q73IV1_WOLPM Length = 228 Score = 44.4 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 19/58 (32%), Positives = 27/58 (46%), Gaps = 4/58 (6%) Query: 18 KALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHSK 75 K LK+ CG C F ++ E +HH D + +N N LL +CHD H + Sbjct: 173 KLLKLQQSKCGNCRLWFESDDIIE--IHHKDRNRRSNMI--KNLSLLHGHCHDELHRR 226 >UniRef50_UPI0001BC7B91 HNH endonuclease n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7B91 Length = 235 Score = 44.0 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 30/78 (38%), Gaps = 12/78 (15%) Query: 8 YARLESGYREKALKIYPWVCGRCSREF--VYSNLRE--LTVHHI------DHDHTNNPED 57 R+ LK Y + C C F VY L + + VHHI D +H +P+ Sbjct: 142 KHERNQALRQLCLKHYGYTCQVCGMNFEAVYGKLGKNYIEVHHINPIAETDGEHVLDPKT 201 Query: 58 GSNWELLCLYCHDHEHSK 75 G LC CH H Sbjct: 202 G--LIPLCSNCHSMIHRG 217 >UniRef50_Q3Z844 HNH endonuclease family protein n=1 Tax=Dehalococcoides ethenogenes 195 RepID=Q3Z844_DEHE1 Length = 182 Score = 43.3 bits (100), Expect = 0.002, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 30/77 (38%), Gaps = 10/77 (12%) Query: 7 NYARLESGYREKALKIYPWVC--GRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELL 64 + + R++ L+ C C+ + +HHID +++NN D N L Sbjct: 92 RREPVPNKMRKRVLERAENRCENHDCNYTGIPE------IHHIDDNNSNN--DPRNLIAL 143 Query: 65 CLYCHDHEHSKYTEADQ 81 C CH H A Q Sbjct: 144 CRNCHGDAHHGNIIASQ 160 >UniRef50_B9LVY8 HNH endonuclease n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LVY8_HALLT Length = 355 Score = 43.3 bits (100), Expect = 0.002, Method: Composition-based stats. Identities = 23/67 (34%), Positives = 28/67 (41%), Gaps = 2/67 (2%) Query: 16 REKALKIYPWVCGRCSREF-VYSNLRELTVHHIDHD-HTNNPEDGSNWELLCLYCHDHEH 73 R+ L Y C C R L L VHHI+ D + D +N LLC CH H Sbjct: 40 RDDVLTEYWHRCQVCGRRGPEKGGLATLHVHHIERDPEGMDEHDMANLTLLCRSCHSWFH 99 Query: 74 SKYTEAD 80 + T D Sbjct: 100 QQSTPDD 106 >UniRef50_B4WW73 Group II intron, maturase-specific domain family n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WW73_9SYNE Length = 479 Score = 43.3 bits (100), Expect = 0.002, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 23/59 (38%), Gaps = 2/59 (3%) Query: 16 REKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 R+K K + C C + + L VHH + SNW LL CH H+ Sbjct: 413 RQKLAKRQKFKCPNCGESLLNGD--SLHVHHKTPRSKGGKDCFSNWLLLHKVCHQQRHA 469 >UniRef50_D2S2Y4 HNH endonuclease n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2S2Y4_9EURY Length = 221 Score = 43.3 bits (100), Expect = 0.002, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 29/84 (34%), Gaps = 9/84 (10%) Query: 5 PKNYARLESGYREKALKIYPWVCGRCSRE---FVYSNLRELTVHHIDHDHTNNPEDGSNW 61 P AR ++ YR W C C R+ + L HHI SN Sbjct: 24 PDWEARRKTVYRHD-----NWTCQSCGRQSGPHAGNEGVRLHAHHIVPLSEGGSNRLSNL 78 Query: 62 ELLCLYCHDHEHSKYT-EADQYGT 84 E LC CH ++H D G Sbjct: 79 ETLCEPCHQNQHDHDIFTGDWVGD 102 >UniRef50_B5EW68 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW68_VIBFM Length = 253 Score = 43.3 bits (100), Expect = 0.002, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 9/90 (10%) Query: 9 ARLESGYREKALKIYPWV---CGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLC 65 A E + + +++ C +C+ VHHID DH NN E SN+ + C Sbjct: 33 AAREVFIQNRKARMHNDEELECEKCTIAMPKG----YHVHHIDGDHQNNKE--SNFSIRC 86 Query: 66 LYCHDHEHSKYTEADQYGTTVIAGEDAQKD 95 +CH EH + ++ G + A + +Q++ Sbjct: 87 PFCHLCEHIGWVGKNRKGVIIYAPDISQEN 116 >UniRef50_B9LX84 HNH endonuclease n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LX84_HALLT Length = 299 Score = 42.9 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 30/98 (30%), Gaps = 5/98 (5%) Query: 3 IIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWE 62 P ++ R+K K + C +C EL HH SN Sbjct: 4 DYPSDWNSR----RKKVYKRDNYRCQKCGSRGGSRGNTELHAHHKKPKSKGGSHRFSNLT 59 Query: 63 LLCLYCHDHEHSKYTEADQYGTTV-IAGEDAQKDVGEA 99 +C CH+ H ++ A E+ + A Sbjct: 60 TVCKSCHEDIHGHGVGGRSNSSSTNQATEELSPEELIA 97 >UniRef50_B9ZDV6 HNH endonuclease n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZDV6_NATMA Length = 282 Score = 42.9 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 29/86 (33%), Gaps = 5/86 (5%) Query: 14 GYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEH 73 R++ L+ + C RC ++ R L HHI P++ N LC CH H Sbjct: 61 ELRQQTLRRDNYACTRCG-----ADDRTLQAHHIVPRSAGGPDELENLLTLCRPCHGVIH 115 Query: 74 SKYTEADQYGTTVIAGEDAQKDVGEA 99 + D + A Sbjct: 116 QHNSAFDDVRDDAPLFPERTAPDPVA 141 >UniRef50_A3CSG6 HNH endonuclease n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CSG6_METMJ Length = 143 Score = 42.1 bits (97), Expect = 0.005, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 8/65 (12%) Query: 10 RLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCH 69 R R L+ C C E ++L +HHID D TN+ SN LC CH Sbjct: 64 RRWKETRSGVLERDGNRCTVCGGE------QDLHIHHIDRDPTNDVP--SNLVTLCDICH 115 Query: 70 DHEHS 74 H+ Sbjct: 116 ARVHT 120 >UniRef50_Q12UG1 RNA-directed DNA polymerase n=53 Tax=cellular organisms RepID=Q12UG1_METBU Length = 592 Score = 42.1 bits (97), Expect = 0.005, Method: Composition-based stats. Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 3/75 (4%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNL-RELTVHHIDHDHTNNPEDGS 59 + ++ G + K C C+ +N E +HH + +H +N + Sbjct: 505 VERKLNQGSKKLLGKFKTVWKNQKGKCPFCNLLIDINNGGEERPLHHKNGNHDDNGI--T 562 Query: 60 NWELLCLYCHDHEHS 74 N +YCH H+ Sbjct: 563 NLVYAHVYCHRQYHA 577 >UniRef50_O99970 Orf546 n=2 Tax=Porphyra purpurea RepID=O99970_PORPU Length = 546 Score = 42.1 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 19/61 (31%), Positives = 27/61 (44%), Gaps = 2/61 (3%) Query: 17 EKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHSKY 76 ++ LK C C+ F+ S+ E+ H I H D N L+ +CHD HSK Sbjct: 486 KRLLKTKGPQCDMCNLYFIDSDRIEID-HIIPRSHGGT-SDWKNLRLMHGHCHDIRHSKA 543 Query: 77 T 77 Sbjct: 544 V 544 >UniRef50_C5B541 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens AM1 RepID=C5B541_METEA Length = 253 Score = 41.7 bits (96), Expect = 0.007, Method: Composition-based stats. Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 6/54 (11%) Query: 16 REKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCH 69 R++ L + C C F +E +HH + DH +N + N + C+YCH Sbjct: 36 RQRVLDTQKYTCQYCG--FQSQKWQE--IHHRNDDHHDNRPE--NLAVACMYCH 83 >UniRef50_C3K0B7 Putative 5-methylcytosine-specific restriction enzyme n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3K0B7_PSEFS Length = 237 Score = 41.7 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 28/74 (37%), Gaps = 3/74 (4%) Query: 11 LESGYREKALKIYPWVCGRCSRE--FVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYC 68 + R LK +C C F L L VHH+ H + SN LC C Sbjct: 151 RDPKVRAWVLKEAKGICEGCGSNAPFEVDGLPFLEVHHVKHLAQKGSDRISNAVALCPNC 210 Query: 69 HDHEHSKYTEADQY 82 H H + ++ D + Sbjct: 211 HQRCH-RSSDRDAF 223 >UniRef50_D1RHL0 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RHL0_LEGLO Length = 270 Score = 41.4 bits (95), Expect = 0.009, Method: Composition-based stats. Identities = 17/70 (24%), Positives = 33/70 (47%), Gaps = 4/70 (5%) Query: 5 PKNYARLESGYREKALKIYPWVC--GRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWE 62 +Y+ + ++A + + C +C + + + L VHH+D NN + N + Sbjct: 177 LNDYSTNWTHISKEAKRKAGYKCQNSKCHIDLAGAYSQYLHVHHLDGQKNNNRK--HNLK 234 Query: 63 LLCLYCHDHE 72 +LC+ CH E Sbjct: 235 VLCVKCHADE 244 >UniRef50_B0JX80 Reverse transcriptase n=82 Tax=Bacteria RepID=B0JX80_MICAN Length = 613 Score = 41.4 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 19/56 (33%), Positives = 26/56 (46%), Gaps = 2/56 (3%) Query: 17 EKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHE 72 K LK C RC + F S+L E V HI ++ N +LL +CHD + Sbjct: 522 AKLLKKQQGKCSRCGQYFTPSDLIE--VDHILPLSLGGKDEYKNLQLLHRHCHDDK 575 >UniRef50_Q2FQI6 HNH endonuclease n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FQI6_METHJ Length = 163 Score = 41.0 bits (94), Expect = 0.013, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 6/79 (7%) Query: 2 AIIPKNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNW 61 A I + + R + L+ + C C + R+L+VHHI SN Sbjct: 49 ASIRGRRPQYWNVIRRQILERDGYRCQICGEQ------RDLSVHHIIPLSEGGDSTASNL 102 Query: 62 ELLCLYCHDHEHSKYTEAD 80 +LC CH H K D Sbjct: 103 RVLCHSCHQQAHGKRAVRD 121 >UniRef50_Q6YRR0 Slr6094 protein n=6 Tax=Synechocystis sp. PCC 6803 RepID=Q6YRR0_SYNY3 Length = 107 Score = 41.0 bits (94), Expect = 0.013, Method: Composition-based stats. Identities = 22/65 (33%), Positives = 27/65 (41%), Gaps = 15/65 (23%) Query: 24 PWVCGRCS------------REFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDH 71 WVC RC + LR + VHH D+D +NN SN LC CH H Sbjct: 22 GWVCERCGVQCLKPGEGKGLLKEERYRLR-MAVHHCDYDPSNNSP--SNLMALCSPCHLH 78 Query: 72 EHSKY 76 H + Sbjct: 79 YHQRQ 83 >UniRef50_B7K703 RNA-directed DNA polymerase (Reverse transcriptase) n=85 Tax=Bacteria RepID=B7K703_CYAP7 Length = 661 Score = 41.0 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 2/54 (3%) Query: 19 ALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHE 72 LK C C F + +L V HI ++ NW+LL +CHD + Sbjct: 526 LLKKQKGKCAHCGLFFKEGD--KLEVDHIIPKSLGGRDEYKNWQLLHRHCHDEK 577 >UniRef50_B1X354 Putative uncharacterized protein n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X354_CYAA5 Length = 100 Score = 41.0 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 19/64 (29%), Positives = 22/64 (34%), Gaps = 14/64 (21%) Query: 24 PWVCGRCSREFVY------------SNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDH 71 W C +C + + R L VHH D NN D SN LC CH Sbjct: 21 GWGCAKCGMQCIKPGEDVSKLTVKERKARTLQVHHSDFTPENN--DPSNLIPLCTACHLS 78 Query: 72 EHSK 75 H Sbjct: 79 YHQG 82 >UniRef50_C8VXB8 Putative uncharacterized protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VXB8_DESAS Length = 326 Score = 40.6 bits (93), Expect = 0.016, Method: Composition-based stats. Identities = 19/63 (30%), Positives = 26/63 (41%), Gaps = 5/63 (7%) Query: 27 CG-RCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHSKYTEADQYGTT 85 C RC +S + +HHID +H+NN D N + CH HSK+ Sbjct: 18 CHYRCCLCPEHSRIA--NIHHIDKNHSNNQYD--NLVAVREKCHSDLHSKFEMRRNITPE 73 Query: 86 VIA 88 I Sbjct: 74 QIG 76 >UniRef50_A0Y4N4 Putative uncharacterized protein n=1 Tax=Alteromonadales bacterium TW-7 RepID=A0Y4N4_9GAMM Length = 430 Score = 40.6 bits (93), Expect = 0.016, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 21/53 (39%), Gaps = 3/53 (5%) Query: 22 IYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 CG C +E E VHH+ + N LLC CHD+ HS Sbjct: 17 RQNGRCGSCGKEIYAK---EFAVHHVLNCKDGGKGHIDNGVLLCCECHDNVHS 66 >UniRef50_UPI0001BC350E HNH endonuclease n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC350E Length = 263 Score = 40.6 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 12/81 (14%) Query: 8 YARLESGYREKALKIYPWVCGRCSREFV--YSNLRE--LTVHHI------DHDHTNNPED 57 + YRE A+KI+ C C +F Y + E + VHH D + NPE Sbjct: 163 RYERVAKYREAAIKIHGTKCQICGFDFNKKYGYIGENYIEVHHKKPLFSLDEELIPNPE- 221 Query: 58 GSNWELLCLYCHDHEHSKYTE 78 ++ +C CH H K + Sbjct: 222 -TDMITICSNCHRMIHRKKND 241 >UniRef50_B1QVB2 HNH endonuclease domain protein n=2 Tax=Clostridium butyricum RepID=B1QVB2_CLOBU Length = 409 Score = 40.2 bits (92), Expect = 0.018, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 33/88 (37%), Gaps = 12/88 (13%) Query: 1 MAIIPKNYARLESGYREKALKIYPWVCGRCSREF------VYSNLRELTVHHIDH----D 50 M I N R+K L+ Y + C C +F + N+ E VHH Sbjct: 305 MTTIQVNKYERNPKARKKCLEYYGYKCCICGFDFEKFYGYIGKNIIE--VHHKKALNEIK 362 Query: 51 HTNNPEDGSNWELLCLYCHDHEHSKYTE 78 +T + + +C CH H++ + Sbjct: 363 NTYEVDPIKDLRPVCSNCHTIIHNRKPD 390 >UniRef50_Q0I2A0 Putative uncharacterized protein n=1 Tax=Haemophilus somnus 129PT RepID=Q0I2A0_HAES1 Length = 269 Score = 40.2 bits (92), Expect = 0.019, Method: Composition-based stats. Identities = 16/63 (25%), Positives = 23/63 (36%), Gaps = 7/63 (11%) Query: 7 NYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCL 66 ++ + YRE W C C L VHH + + +N N + LC Sbjct: 190 DWRLISQRYREG----QKWCCENCGLNMQSYP-HLLDVHHRNGNKRDN--SDQNLQALCR 242 Query: 67 YCH 69 CH Sbjct: 243 ECH 245 >UniRef50_B4SDY4 HNH nuclease n=4 Tax=Bacteria RepID=B4SDY4_PELPB Length = 309 Score = 39.8 bits (91), Expect = 0.025, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 26/63 (41%), Gaps = 3/63 (4%) Query: 16 REKALKIYPWVCGRCS---REFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHE 72 R + L+ + C +C + S+ R L HHI N LC CHD E Sbjct: 247 RREVLQRDDYRCQQCGWHQEMWNQSDPRHLEAHHIKQHVEGGENTKENLVTLCNICHDKE 306 Query: 73 HSK 75 H++ Sbjct: 307 HTR 309 >UniRef50_A3XZ24 Putative uncharacterized protein n=1 Tax=Vibrio sp. MED222 RepID=A3XZ24_9VIBR Length = 282 Score = 39.8 bits (91), Expect = 0.025, Method: Composition-based stats. Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 7/66 (10%) Query: 7 NYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCL 66 N+ + YR + C +C V L HHID +NN N LC Sbjct: 200 NWREMSISYRAS----QQYCCEQCRVSLVTRK-TLLHTHHIDGVKSNN--SVRNLMALCK 252 Query: 67 YCHDHE 72 CH + Sbjct: 253 ECHSKQ 258 >UniRef50_Q44225 ORF439 n=2 Tax=Anabaena RepID=Q44225_9NOST Length = 439 Score = 39.8 bits (91), Expect = 0.029, Method: Composition-based stats. Identities = 12/43 (27%), Positives = 19/43 (44%), Gaps = 2/43 (4%) Query: 6 KNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHID 48 K ++L G+ KAL C C +F+ + +HH D Sbjct: 339 KRNSKLYDGHTSKALTKQNHKCASCGLKFIGEE--RVHLHHRD 379 >UniRef50_C4K5N9 Group II intron encoded reverse transcriptase n=10 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5N9_HAMD5 Length = 570 Score = 39.4 bits (90), Expect = 0.033, Method: Composition-based stats. Identities = 18/62 (29%), Positives = 22/62 (35%), Gaps = 2/62 (3%) Query: 14 GYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEH 73 G + K C C +E +L VHHI +N LL CH H Sbjct: 495 GVKVNLYKRQKGYCPLCDQELDNGE--QLHVHHIQPKAEGGDNKLANLRLLHANCHRQLH 552 Query: 74 SK 75 SK Sbjct: 553 SK 554 >UniRef50_B7VNF7 Putative uncharacterized protein n=6 Tax=Vibrionaceae RepID=B7VNF7_VIBSL Length = 362 Score = 39.4 bits (90), Expect = 0.035, Method: Composition-based stats. Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 7/66 (10%) Query: 7 NYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCL 66 +++R+ S YR + C C +N L VHHI+ ++N E+ N LC+ Sbjct: 181 DWSRVSSRYRVD----KNFKCEDCKVNL-RTNRSLLHVHHINGVKSDNNEE--NLRSLCI 233 Query: 67 YCHDHE 72 CH + Sbjct: 234 DCHSKQ 239 >UniRef50_B4XH92 Putative uncharacterized protein n=1 Tax=Actinobacillus pleuropneumoniae RepID=B4XH92_ACTPL Length = 269 Score = 39.4 bits (90), Expect = 0.035, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 29/89 (32%), Gaps = 7/89 (7%) Query: 6 KNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLC 65 +++ + YRE + + C +C L VHH + +N +D N LC Sbjct: 187 QDWENISRQYRES----HQYCCEQCGVNLTSHK-HLLDVHHKNGVKQDNRKD--NLIALC 239 Query: 66 LYCHDHEHSKYTEADQYGTTVIAGEDAQK 94 CH + S I Sbjct: 240 KICHSEQDSHGHYYVSLMDRQIIESLRSP 268 >UniRef50_C1SP87 Predicted restriction endonuclease n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SP87_9BACT Length = 244 Score = 39.0 bits (89), Expect = 0.053, Method: Composition-based stats. Identities = 17/75 (22%), Positives = 25/75 (33%), Gaps = 8/75 (10%) Query: 10 RLESGYREKALKIYPWVCGRCSREFVYS----NLRELTVHHI----DHDHTNNPEDGSNW 61 + R K L+ C C F + + + HHI + D P+ + Sbjct: 153 ERDHDARTKCLESQGCTCSVCGFNFEQAYGLMGIDFIHTHHITPPSEIDKNYIPDPAKDL 212 Query: 62 ELLCLYCHDHEHSKY 76 LC CH H K Sbjct: 213 VPLCPNCHAMIHRKS 227 >UniRef50_B8IXC1 HNH endonuclease n=4 Tax=Methylobacterium RepID=B8IXC1_METNO Length = 276 Score = 38.7 bits (88), Expect = 0.056, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 30/83 (36%), Gaps = 4/83 (4%) Query: 15 YREKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 +R L+ W C C + +R H I+ D +N LC CH + Sbjct: 198 WRLAVLQRAGWCCEDCGAQGGRGGVRLFADHVIERQDGGALTDPNNGRCLCGSCH----T 253 Query: 75 KYTEADQYGTTVIAGEDAQKDVG 97 + T A++ + A+ G Sbjct: 254 RKTVAERARRMAVRSAAAEPGRG 276 >UniRef50_C7LPI3 Restriction endonuclease n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LPI3_DESBD Length = 313 Score = 38.7 bits (88), Expect = 0.059, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 39/113 (34%), Gaps = 7/113 (6%) Query: 7 NYARLESGYREKALKIYPWVCGRCSREFVYSNLREL--TVHHIDH--DHTNNPEDGSNWE 62 + +R +K+Y C C + + R + H D H ++P +G Sbjct: 185 EKPARDQAFRRVIVKVYDHRCALCGIRIISPDSRTVVDAAHIKDWAVSHDDSPTNGL--- 241 Query: 63 LLCLYCHDHEHSKYTEADQYGTTVIAGEDAQKDVGEAKYNPFADLKAMMNKKK 115 LC CH S D ++A + FA+ +M +++ Sbjct: 242 ALCKLCHWTFDSGLVGFDDDFKVIVARSITRDGNLPGHIQQFANRPMIMPERE 294 >UniRef50_A0YXC5 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YXC5_9CYAN Length = 299 Score = 38.7 bits (88), Expect = 0.060, Method: Composition-based stats. Identities = 15/59 (25%), Positives = 22/59 (37%), Gaps = 2/59 (3%) Query: 16 REKALKIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHS 74 + + LK C C + F + L HH+ N L+ L+CHD H Sbjct: 239 KSRLLKKQKGKCADCGQTFKPED--NLEKHHLKAKAKGGNNSDKNLILVHLHCHDQIHG 295 >UniRef50_D2QV00 HNH endonuclease n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QV00_9SPHI Length = 260 Score = 38.7 bits (88), Expect = 0.066, Method: Composition-based stats. Identities = 15/57 (26%), Positives = 25/57 (43%), Gaps = 3/57 (5%) Query: 23 YPWVCGRCSREFVY-SNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDHEHSKYTE 78 + C +CS + + L VHH + T+N N + LC+ CH ++ E Sbjct: 186 KAYKCEKCSIDLSAVMDRHFLHVHHRNGRKTDNRP--QNLQCLCIRCHASVDDRHKE 240 >UniRef50_D1P9G2 5-methylcytosine-specific restriction enzyme A n=1 Tax=Prevotella copri DSM 18205 RepID=D1P9G2_9BACT Length = 251 Score = 38.3 bits (87), Expect = 0.070, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 29/74 (39%), Gaps = 10/74 (13%) Query: 10 RLESGYREKALKIYPWVCGRCSREFVYSNLREL-----TVHHI----DHDHTNNPEDG-S 59 R R+ L Y + C C +F + +EL VHHI ++ PE+ Sbjct: 149 RRNPQLRQMCLDKYGYQCQCCGMDFEETYGKELGVNFMEVHHIRMISTYETDGVPENFLE 208 Query: 60 NWELLCLYCHDHEH 73 N LC CH H Sbjct: 209 NLVPLCSNCHSMIH 222 >UniRef50_C6X560 HNH nuclease n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X560_FLAB3 Length = 231 Score = 38.3 bits (87), Expect = 0.074, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 34/80 (42%), Gaps = 8/80 (10%) Query: 4 IPKNYARLESGYREKAL--KIYPWVCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNW 61 + ++ R Y+ +++ K VC C E ++ VHHID + NN D N Sbjct: 146 VSESKPRKTISYKIRSILQKEIKSVCPFCYNE----DVEHFHVHHIDENPANNKID--NL 199 Query: 62 ELLCLYCHDHEHSKYTEADQ 81 +LC CH + ++ Sbjct: 200 LMLCPNCHSKITKGDIKYEE 219 >UniRef50_C7N4P2 HNH endonuclease n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N4P2_SLAHD Length = 97 Score = 38.3 bits (87), Expect = 0.078, Method: Composition-based stats. Identities = 22/72 (30%), Positives = 30/72 (41%), Gaps = 13/72 (18%) Query: 8 YARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDH---DHTNNP---EDGSNW 61 +R R L+ +C RC R V VHH + D+ ++P D N Sbjct: 12 KSRAWKRTRAAYLETVNRICERCGRPAV-------IVHHKRYVTADNLHDPGVTLDFENL 64 Query: 62 ELLCLYCHDHEH 73 E LC CH+ EH Sbjct: 65 EALCRDCHNKEH 76 >UniRef50_C6UP82 Predicted protein n=14 Tax=Escherichia coli RepID=C6UP82_ECO5T Length = 367 Score = 38.3 bits (87), Expect = 0.083, Method: Composition-based stats. Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 9/68 (13%) Query: 6 KNYARLESGYREKALKIYPWVCGRCSREFVYSNLRELTVHHIDH-DHTNNPEDGSNWELL 64 +N+ + REKA +VC C + VHH + + N+ E N +L Sbjct: 191 ENWKEISKEIREKA----NYVCNDCGVNLSTAK-NLCHVHHKNGIKYDNHHE---NLLVL 242 Query: 65 CLYCHDHE 72 C CH + Sbjct: 243 CKDCHRKQ 250 >UniRef50_C9KJE0 H-N-H endonuclease F-TflIV n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KJE0_9FIRM Length = 203 Score = 37.9 bits (86), Expect = 0.093, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 20/46 (43%), Gaps = 2/46 (4%) Query: 26 VCGRCSREFVYSNLRELTVHHIDHDHTNNPEDGSNWELLCLYCHDH 71 VC C +HH D D+TNN D N ++LC CH Sbjct: 69 VCEICGISEWNGKPISCQLHHKDGDNTNNSLD--NLQMLCPNCHSQ 112 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.306 0.121 0.349 Lambda K H 0.267 0.0371 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 659,635,713 Number of Sequences: 3077464 Number of extensions: 23085346 Number of successful extensions: 77542 Number of sequences better than 1.0e-01: 87 Number of HSP's better than 0.1 without gapping: 30 Number of HSP's successfully gapped in prelim test: 89 Number of HSP's that attempted gapping in prelim test: 77426 Number of HSP's gapped (non-prelim): 124 length of query: 115 length of database: 1,040,396,356 effective HSP length: 82 effective length of query: 33 effective length of database: 788,044,308 effective search space: 26005462164 effective search space used: 26005462164 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 87 (38.3 bits)