BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (243 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P77169 Uncharacterized protein yagJ n=4 Tax=Enterobacte... 501 e-141 UniRef50_A3QCD3 Type III restriction enzyme, res subunit n=8 Tax... 337 2e-91 UniRef50_B8HHU8 Type III restriction protein res subunit n=1 Tax... 102 2e-20 UniRef50_A4G305 Putative DNA or RNA helicase of superfamily II n... 85 2e-15 UniRef50_D1PAZ7 Type III restriction enzyme, res subunit superfa... 76 1e-12 UniRef50_A4YMZ4 Putative uncharacterized protein n=1 Tax=Bradyrh... 75 2e-12 UniRef50_C5RJN1 Type III restriction protein res subunit n=1 Tax... 69 2e-10 UniRef50_C9KIN7 Type III restriction enzyme, res subunit superfa... 68 3e-10 UniRef50_B0TCY2 Type iii restriction enzyme, res subunit, putati... 66 9e-10 UniRef50_D1ZY02 Whole genome shotgun sequence assembly, contig_3... 40 0.056 >UniRef50_P77169 Uncharacterized protein yagJ n=4 Tax=Enterobacteriaceae RepID=YAGJ_ECOLI Length = 243 Score = 501 bits (1291), Expect = e-141, Method: Compositional matrix adjust. Identities = 243/243 (100%), Positives = 243/243 (100%) Query: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP Sbjct: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 Query: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE Sbjct: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 Query: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL Sbjct: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 Query: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD Sbjct: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 Query: 241 DYQ 243 DYQ Sbjct: 241 DYQ 243 >UniRef50_A3QCD3 Type III restriction enzyme, res subunit n=8 Tax=Bacteria RepID=A3QCD3_SHELP Length = 861 Score = 337 bits (864), Expect = 2e-91, Method: Compositional matrix adjust. Identities = 160/242 (66%), Positives = 190/242 (78%) Query: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 +EARVTVAG+GLV EVQ YFD EAD+LAK WL +Y QIK+L D+RKE+YRQIVEMSTEP Sbjct: 619 VEARVTVAGLGLVTEVQAYFDAEADKLAKEWLVKYASQIKALSDDRKESYRQIVEMSTEP 678 Query: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 Q DLV+P ++ E + RE +KE P W +HLL D+ G YP +N WE V E E+KR+ Sbjct: 679 QSFDLVKPESRCEAAKARESDKEIKFPTWNNHLLSDKDGKYPVEMNEWERTVVEAESKRD 738 Query: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 GF FWYRNPQ GQSSLGIAY+E EQ+KIVRPDF+FFAEQD K+VVDLVDPH +HLADAL Sbjct: 739 GFLFWYRNPQQPGQSSLGIAYLEDEQFKIVRPDFIFFAEQDDKIVVDLVDPHGVHLADAL 798 Query: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 PKL+GLA YA H++AYRRIE+VAE GKLRVLDL R DV+ AV A +A++LF AD Sbjct: 799 PKLQGLAAYATKHANAYRRIEAVAEAHGKLRVLDLTRTDVRQAVLDASSAKSLFEGLSAD 858 Query: 241 DY 242 DY Sbjct: 859 DY 860 >UniRef50_B8HHU8 Type III restriction protein res subunit n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HHU8_ARTCA Length = 867 Score = 102 bits (253), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 71/246 (28%), Positives = 117/246 (47%), Gaps = 17/246 (6%) Query: 2 EARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQI-----VEM 56 EA V ++ + + EV + A L +W + P + L + +E I V + Sbjct: 614 EAVVRLSALSVHEEVIRTLESSASALIDSWRQQLNPAVSRLDTKDREELDAIWHPHGVPI 673 Query: 57 STEPQDVDLVRPANKFEMTRVREGEKEA-----DLPVWKHHLLCDESGNYPALLNHWETK 111 + E + + VR + ++ V++G ++ + + HL D G++P WE + Sbjct: 674 AGEFRLPEKVRTRTQ-KIAAVKDGAGKSVETIEAIEAFNGHLFADGQGDFPMAATGWERE 732 Query: 112 VFEIETKREGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDP 171 V E E + WYRNP G L ++Y E K + PDFLFF DG +VVD+VDP Sbjct: 733 VIEKELAQSSLIGWYRNP--AGAGGLSVSYTEGGVDKSLYPDFLFFHRVDGDIVVDIVDP 790 Query: 172 HSLHLADALPKLEGLALYAEHHSDAYRRIESVAEVK-GKLRVLDLKRQD---VQDAVATA 227 H+ L D K L+ +A H A+RR+ +V K G L+ L+L + +++ +A A Sbjct: 791 HNHSLGDTPGKWAALSRFARQHPGAFRRVTAVIRNKAGALKSLELTGRANTVLENKIAAA 850 Query: 228 ENAETL 233 E + Sbjct: 851 SGGEGI 856 >UniRef50_A4G305 Putative DNA or RNA helicase of superfamily II n=1 Tax=Herminiimonas arsenicoxydans RepID=A4G305_HERAR Length = 840 Score = 84.7 bits (208), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 56/185 (30%), Positives = 93/185 (50%), Gaps = 7/185 (3%) Query: 32 LAEYTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVRPANKFEMTRVREGEKEADLPVWKH 91 L++ IK L +E Y ++ E++ EP+ + + P + K+++ + Sbjct: 635 LSQNKAAIKKLTTAEQEDYNKVQEVAKEPEALSFLPPVEMMLAVDI----KDSNFRNYDG 690 Query: 92 HLLCDESGNYPALLNHWETKVFEIETKREGFAFWYRN-PQYTGQSSLGIAYVEAEQYKIV 150 H+ D SG++ +LN+WE V E E R W RN P+ SL Y + + + Sbjct: 691 HMYVDASGHFVDVLNNWEHPVIEAEIVRADVVGWLRNVPRKPWAFSL--PYEFGGENRPM 748 Query: 151 RPDFLFFAEQDGKMVVDLVDPHSLHLADALPKLEGLALYAEHHSDAYRRIESVAEVKGKL 210 PDFL D +VD+++PHS LAD+ K +GLA +A H+ + RIE + V ++ Sbjct: 749 YPDFLVVRAVDDDHIVDILEPHSPALADSYAKAKGLAQFAAKHAMHFGRIELIRVVGKEI 808 Query: 211 RVLDL 215 + LDL Sbjct: 809 KRLDL 813 >UniRef50_D1PAZ7 Type III restriction enzyme, res subunit superfamily n=1 Tax=Prevotella copri DSM 18205 RepID=D1PAZ7_9BACT Length = 839 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 60/201 (29%), Positives = 94/201 (46%), Gaps = 22/201 (10%) Query: 34 EYTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVRPANKFEMTRVREGEKEADLPVWKHHL 93 +Y +++ +E K Y +IV+ DL P + ++ EG D HL Sbjct: 636 KYRQKLEDFGEEVKREYEKIVKAHVSTLPFDLTLP-DLMVTSKHPEGTAFTD------HL 688 Query: 94 LCDESGNYPALLNHWETKVFEIETKREGFAFWYRN-PQYTGQSSLGIAYVEAEQYKIVRP 152 D G LN WE V ++E K+EGF W RN P G L I Y+ ++ + P Sbjct: 689 YVDGEGKAVFKLNEWEQAVLDVEQKKEGFVCWVRNVPNKNG--FLCIQYLNGDELRPHFP 746 Query: 153 DFLFFAEQDGKMVVDLVDPHSLHLADALPKLEGLALYAEHHSDAYRRIESVAEVKGKLRV 212 DF+ D + L++PH AD++PKL+G+A Y+E S A +R E +R+ Sbjct: 747 DFIVVRRVDEQFEFVLLEPHYTGYADSVPKLKGMAAYSERCS-AIKRNEM-------MRI 798 Query: 213 LDL----KRQDVQDAVATAEN 229 +D+ K Q + A ++ N Sbjct: 799 VDIATGKKVQSLNAASSSVRN 819 >UniRef50_A4YMZ4 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMZ4_BRASO Length = 824 Score = 74.7 bits (182), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 57/209 (27%), Positives = 95/209 (45%), Gaps = 16/209 (7%) Query: 30 AWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVRPANKFEMTRVREGEKEADLPVW 89 +W + +I L K ++ +V+ S + +L P E R W Sbjct: 625 SWWLKNKSKIAVLPASEKARFQLLVQASGKAVRQELELPLTIVEKPGAR---------TW 675 Query: 90 KHHLLCDESGNYPALLNHWETKVFEIETKREGFAFWYRN-PQYTGQSSLGIAYVEAEQYK 148 K+HL D +GN+ A +N WE E + F W RN P+ + +L + Y E+ K Sbjct: 676 KNHLFVDLAGNFAANMNSWEEDCLEWAAQSPDFVCWLRNLPRR--EWALCVPY-ESGGEK 732 Query: 149 IVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPKLEGLALYAEHHSDAYRRIESVAEVKG 208 PDFL + VVD+++PH D K++GLA +A+ H A+ R+ + G Sbjct: 733 PFYPDFLIVRKSGTSFVVDVMEPHDDSRTDTWAKVKGLASFADEHHLAFGRLMIGRKKNG 792 Query: 209 KLRVLDL---KRQDVQDAVATAENAETLF 234 L+ +D+ K + +A + + E+LF Sbjct: 793 ALQFIDVSEAKTRAKARKLAASADLESLF 821 >UniRef50_C5RJN1 Type III restriction protein res subunit n=1 Tax=Clostridium cellulovorans 743B RepID=C5RJN1_CLOCL Length = 961 Score = 68.6 bits (166), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 46/143 (32%), Positives = 69/143 (48%), Gaps = 7/143 (4%) Query: 89 WKHHLLCDESGNYPAL-LNHWETKVFEIETKREGFAFWYRNPQYTGQSSLGIAYVEAEQY 147 + +HL + + L LN WE V E E +R+ F W RNP +L I Y + Sbjct: 799 YMNHLFVNNTTGMAKLKLNTWEAGVIEEEERRDDFVCWIRNPSRASW-ALCIPYEIDGET 857 Query: 148 KIVRPDFLFFAEQDG-KMVVDLVDPHSLHLADALPKLEGLALYAEHHSDAYR----RIES 202 K PDF+ + D V+DL++PH+ D L K +G A YA + R R+ Sbjct: 858 KPTFPDFIVVRKDDRLGYVIDLLEPHNPDFKDNLGKAKGFAEYARQNPGVGRIQLIRMSK 917 Query: 203 VAEVKGKLRVLDLKRQDVQDAVA 225 A K KL+ LD+ + ++D V+ Sbjct: 918 DAAGKNKLKRLDMSKSSIRDKVS 940 >UniRef50_C9KIN7 Type III restriction enzyme, res subunit superfamily n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KIN7_9FIRM Length = 962 Score = 67.8 bits (164), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 61/211 (28%), Positives = 92/211 (43%), Gaps = 15/211 (7%) Query: 30 AWLAEYTPQIKSLKDER-KEAYRQIVEMSTEPQDVDLVRPANKFEMTRVREGEKEADLPV 88 +W EY I + E+ ++ Y IV D D V N F + + + Sbjct: 748 SWNDEYRRYIARIDSEKIRKQYDSIV------SDGDKVSKHN-FRLPETIQVPHDIGGKP 800 Query: 89 WKHHLLCDESGNYPAL-LNHWETKVFEIETKREGFAFWYRNPQYTGQSSLGIAYVEAEQY 147 ++ HL + +L LN WE+ V E E KR F W RNP G +L I Y E Sbjct: 801 YRDHLFVNGDTGVASLKLNSWESGVIEEEEKRHDFVCWIRNPS-RGSWALCIPYDEDGDT 859 Query: 148 KIVRPDFLFFAEQD-GKMVVDLVDPHSLHLADALPKLEGLALYAEHHSDAYR----RIES 202 K PDF+ + + V+D+++PH+ D L K +G A YA + R R+ Sbjct: 860 KPTYPDFIIVRKDPISEYVIDILEPHNPDFKDNLGKAKGFAEYARLNPGLGRIQLIRMSK 919 Query: 203 VAEVKGKLRVLDLKRQDVQDAVATAENAETL 233 A K + LD+ + ++D V A + E L Sbjct: 920 DAAGHNKFKRLDMAKSAIRDKVLKAMSIEEL 950 >UniRef50_B0TCY2 Type iii restriction enzyme, res subunit, putative n=3 Tax=Firmicutes RepID=B0TCY2_HELMI Length = 835 Score = 66.2 bits (160), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 56/201 (27%), Positives = 85/201 (42%), Gaps = 16/201 (7%) Query: 39 IKSLKDERKEAYRQIVEMSTEPQDVDLVRPAN-KFEMTRVREGEKEADLPVWKHHLLCDE 97 I L + RK Y +++ S +P V V P + F ++ D + HL C E Sbjct: 633 IAKLNEARKIVYERLINASAQPIAVPWVLPDSIDFSVS--------DDSIKLEQHLFCSE 684 Query: 98 SGNYPALLNHWETKVFEIETKREGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFF 157 G + A LN WE+ V E G W RN + SL I Y + PD + Sbjct: 685 DGIFQATLNPWESGVVA-EELNNGAVCWLRNLDRK-KWSLEIPYEVGGITTSMFPDLVIV 742 Query: 158 AEQDGKMVVDLVDPHSLHLADALPKLEGLALYAEHHSDAYRRIESVAEVKG-----KLRV 212 + D+++PH D PK GLA +AE H D + RI+ + + +G Sbjct: 743 RADAQGYIFDILEPHDPSRKDNYPKAVGLAKFAEKHWDVFGRIQLIRQKRGVDGRDHFYR 802 Query: 213 LDLKRQDVQDAVATAENAETL 233 LD+ + V++ V + E L Sbjct: 803 LDMSKTPVRNRVRGITSNEEL 823 >UniRef50_D1ZY02 Whole genome shotgun sequence assembly, contig_3552 n=1 Tax=Sordaria macrospora RepID=D1ZY02_SORMA Length = 168 Score = 40.4 bits (93), Expect = 0.056, Method: Compositional matrix adjust. Identities = 18/30 (60%), Positives = 22/30 (73%) Query: 39 IKSLKDERKEAYRQIVEMSTEPQDVDLVRP 68 IK L D+R++AYR I EMS P DVDL +P Sbjct: 104 IKDLGDDRQDAYRLIREMSPTPMDVDLAKP 133 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77169 Uncharacterized protein yagJ n=4 Tax=Enterobacte... 325 8e-88 UniRef50_A3QCD3 Type III restriction enzyme, res subunit n=8 Tax... 324 1e-87 UniRef50_B8HHU8 Type III restriction protein res subunit n=1 Tax... 255 9e-67 UniRef50_C9KIN7 Type III restriction enzyme, res subunit superfa... 238 9e-62 UniRef50_A4YMZ4 Putative uncharacterized protein n=1 Tax=Bradyrh... 236 4e-61 UniRef50_A4G305 Putative DNA or RNA helicase of superfamily II n... 231 2e-59 UniRef50_B0TCY2 Type iii restriction enzyme, res subunit, putati... 226 6e-58 UniRef50_C5RJN1 Type III restriction protein res subunit n=1 Tax... 225 9e-58 UniRef50_D1PAZ7 Type III restriction enzyme, res subunit superfa... 204 2e-51 Sequences not found previously or not previously below threshold: UniRef50_A1U8N1 Type III restriction enzyme, res subunit n=2 Tax... 46 0.002 UniRef50_B1XQZ6 Type III restriction-modification enzyme, R/heli... 44 0.003 UniRef50_A7HKD2 Type III restriction protein res subunit n=3 Tax... 42 0.018 UniRef50_Q9ZJM1 Putative n=15 Tax=Helicobacter RepID=Q9ZJM1_HELPJ 41 0.032 UniRef50_A3YK84 Type III restriction enzyme R protein, putative ... 41 0.054 >UniRef50_P77169 Uncharacterized protein yagJ n=4 Tax=Enterobacteriaceae RepID=YAGJ_ECOLI Length = 243 Score = 325 bits (833), Expect = 8e-88, Method: Composition-based stats. Identities = 243/243 (100%), Positives = 243/243 (100%) Query: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP Sbjct: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 Query: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE Sbjct: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 Query: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL Sbjct: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 Query: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD Sbjct: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 Query: 241 DYQ 243 DYQ Sbjct: 241 DYQ 243 >UniRef50_A3QCD3 Type III restriction enzyme, res subunit n=8 Tax=Bacteria RepID=A3QCD3_SHELP Length = 861 Score = 324 bits (831), Expect = 1e-87, Method: Composition-based stats. Identities = 160/242 (66%), Positives = 190/242 (78%) Query: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 +EARVTVAG+GLV EVQ YFD EAD+LAK WL +Y QIK+L D+RKE+YRQIVEMSTEP Sbjct: 619 VEARVTVAGLGLVTEVQAYFDAEADKLAKEWLVKYASQIKALSDDRKESYRQIVEMSTEP 678 Query: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 Q DLV+P ++ E + RE +KE P W +HLL D+ G YP +N WE V E E+KR+ Sbjct: 679 QSFDLVKPESRCEAAKARESDKEIKFPTWNNHLLSDKDGKYPVEMNEWERTVVEAESKRD 738 Query: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 GF FWYRNPQ GQSSLGIAY+E EQ+KIVRPDF+FFAEQD K+VVDLVDPH +HLADAL Sbjct: 739 GFLFWYRNPQQPGQSSLGIAYLEDEQFKIVRPDFIFFAEQDDKIVVDLVDPHGVHLADAL 798 Query: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 PKL+GLA YA H++AYRRIE+VAE GKLRVLDL R DV+ AV A +A++LF AD Sbjct: 799 PKLQGLAAYATKHANAYRRIEAVAEAHGKLRVLDLTRTDVRQAVLDASSAKSLFEGLSAD 858 Query: 241 DY 242 DY Sbjct: 859 DY 860 >UniRef50_B8HHU8 Type III restriction protein res subunit n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HHU8_ARTCA Length = 867 Score = 255 bits (651), Expect = 9e-67, Method: Composition-based stats. Identities = 71/246 (28%), Positives = 117/246 (47%), Gaps = 17/246 (6%) Query: 2 EARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQI-----VEM 56 EA V ++ + + EV + A L +W + P + L + +E I V + Sbjct: 614 EAVVRLSALSVHEEVIRTLESSASALIDSWRQQLNPAVSRLDTKDREELDAIWHPHGVPI 673 Query: 57 STEPQDVDLVRPANKFEMTRVREGEKEA-----DLPVWKHHLLCDESGNYPALLNHWETK 111 + E + + VR + ++ V++G ++ + + HL D G++P WE + Sbjct: 674 AGEFRLPEKVRTRTQ-KIAAVKDGAGKSVETIEAIEAFNGHLFADGQGDFPMAATGWERE 732 Query: 112 VFEIETKREGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDP 171 V E E + WYRNP G L ++Y E K + PDFLFF DG +VVD+VDP Sbjct: 733 VIEKELAQSSLIGWYRNP--AGAGGLSVSYTEGGVDKSLYPDFLFFHRVDGDIVVDIVDP 790 Query: 172 HSLHLADALPKLEGLALYAEHHSDAYRRIESVAEVK-GKLRVLDLKRQD---VQDAVATA 227 H+ L D K L+ +A H A+RR+ +V K G L+ L+L + +++ +A A Sbjct: 791 HNHSLGDTPGKWAALSRFARQHPGAFRRVTAVIRNKAGALKSLELTGRANTVLENKIAAA 850 Query: 228 ENAETL 233 E + Sbjct: 851 SGGEGI 856 >UniRef50_C9KIN7 Type III restriction enzyme, res subunit superfamily n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KIN7_9FIRM Length = 962 Score = 238 bits (608), Expect = 9e-62, Method: Composition-based stats. Identities = 63/226 (27%), Positives = 95/226 (42%), Gaps = 15/226 (6%) Query: 15 EVQDYFDGEADRLAKAWLAEYTPQIKSLKDER-KEAYRQIVEMSTEPQDVDLVRPANKFE 73 E + A +W EY I + E+ ++ Y IV D D V N F Sbjct: 733 ECMNQLHNYAQDKFHSWNDEYRRYIARIDSEKIRKQYDSIV------SDGDKVSKHN-FR 785 Query: 74 MTRVREGEKEADLPVWKHHLLCDESGNYPAL-LNHWETKVFEIETKREGFAFWYRNPQYT 132 + + + ++ HL + +L LN WE+ V E E KR F W RNP Sbjct: 786 LPETIQVPHDIGGKPYRDHLFVNGDTGVASLKLNSWESGVIEEEEKRHDFVCWIRNPSR- 844 Query: 133 GQSSLGIAYVEAEQYKIVRPDFLFFAEQD-GKMVVDLVDPHSLHLADALPKLEGLALYAE 191 G +L I Y E K PDF+ + + V+D+++PH+ D L K +G A YA Sbjct: 845 GSWALCIPYDEDGDTKPTYPDFIIVRKDPISEYVIDILEPHNPDFKDNLGKAKGFAEYAR 904 Query: 192 HHSDAYR----RIESVAEVKGKLRVLDLKRQDVQDAVATAENAETL 233 + R R+ A K + LD+ + ++D V A + E L Sbjct: 905 LNPGLGRIQLIRMSKDAAGHNKFKRLDMAKSAIRDKVLKAMSIEEL 950 >UniRef50_A4YMZ4 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMZ4_BRASO Length = 824 Score = 236 bits (603), Expect = 4e-61, Method: Composition-based stats. Identities = 58/236 (24%), Positives = 99/236 (41%), Gaps = 14/236 (5%) Query: 3 ARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQD 62 A+V + + + A +W + +I L K ++ +V+ S + Sbjct: 598 AKVELFALIRRAGTLAQVEDVARSCFDSWWLKNKSKIAVLPASEKARFQLLVQASGKAVR 657 Query: 63 VDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREGF 122 +L P E R WK+HL D +GN+ A +N WE E + F Sbjct: 658 QELELPLTIVEKPGAR---------TWKNHLFVDLAGNFAANMNSWEEDCLEWAAQSPDF 708 Query: 123 AFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPK 182 W RN + +L + Y + K PDFL + VVD+++PH D K Sbjct: 709 VCWLRNLPRR-EWALCVPYESGGE-KPFYPDFLIVRKSGTSFVVDVMEPHDDSRTDTWAK 766 Query: 183 LEGLALYAEHHSDAYRRIESVAEVKGKLRVLDL---KRQDVQDAVATAENAETLFS 235 ++GLA +A+ H A+ R+ + G L+ +D+ K + +A + + E+LF Sbjct: 767 VKGLASFADEHHLAFGRLMIGRKKNGALQFIDVSEAKTRAKARKLAASADLESLFE 822 >UniRef50_A4G305 Putative DNA or RNA helicase of superfamily II n=1 Tax=Herminiimonas arsenicoxydans RepID=A4G305_HERAR Length = 840 Score = 231 bits (588), Expect = 2e-59, Method: Composition-based stats. Identities = 58/221 (26%), Positives = 101/221 (45%), Gaps = 7/221 (3%) Query: 14 MEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVRPANKFE 73 + + + L++ IK L +E Y ++ E++ EP+ + + P Sbjct: 617 EKTWKTLEAACGERCETLLSQNKAAIKKLTTAEQEDYNKVQEVAKEPEALSFLPPVEMML 676 Query: 74 MTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREGFAFWYRN-PQYT 132 + K+++ + H+ D SG++ +LN+WE V E E R W RN P+ Sbjct: 677 AVDI----KDSNFRNYDGHMYVDASGHFVDVLNNWEHPVIEAEIVRADVVGWLRNVPRKP 732 Query: 133 GQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPKLEGLALYAEH 192 SL Y + + + PDFL D +VD+++PHS LAD+ K +GLA +A Sbjct: 733 WAFSL--PYEFGGENRPMYPDFLVVRAVDDDHIVDILEPHSPALADSYAKAKGLAQFAAK 790 Query: 193 HSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETL 233 H+ + RIE + V +++ LDL + V ++ L Sbjct: 791 HAMHFGRIELIRVVGKEIKRLDLIDATNRKRVLAVDSNAGL 831 >UniRef50_B0TCY2 Type iii restriction enzyme, res subunit, putative n=3 Tax=Firmicutes RepID=B0TCY2_HELMI Length = 835 Score = 226 bits (575), Expect = 6e-58, Method: Composition-based stats. Identities = 57/225 (25%), Positives = 89/225 (39%), Gaps = 14/225 (6%) Query: 14 MEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVRPANKFE 73 + + D A++ I L + RK Y +++ S +P V V P + Sbjct: 608 TDAMERIDTYAEKEFINLYENNKRAIAKLNEARKIVYERLINASAQPIAVPWVLPDSI-- 665 Query: 74 MTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREGFAFWYRNPQYTG 133 + D + HL C E G + A LN WE+ V E G W RN Sbjct: 666 -----DFSVSDDSIKLEQHLFCSEDGIFQATLNPWESGVV-AEELNNGAVCWLRNLDRK- 718 Query: 134 QSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPKLEGLALYAEHH 193 + SL I Y + PD + + D+++PH D PK GLA +AE H Sbjct: 719 KWSLEIPYEVGGITTSMFPDLVIVRADAQGYIFDILEPHDPSRKDNYPKAVGLAKFAEKH 778 Query: 194 SDAYRRIESVAEVKG-----KLRVLDLKRQDVQDAVATAENAETL 233 D + RI+ + + +G LD+ + V++ V + E L Sbjct: 779 WDVFGRIQLIRQKRGVDGRDHFYRLDMSKTPVRNRVRGITSNEEL 823 >UniRef50_C5RJN1 Type III restriction protein res subunit n=1 Tax=Clostridium cellulovorans 743B RepID=C5RJN1_CLOCL Length = 961 Score = 225 bits (574), Expect = 9e-58, Method: Composition-based stats. Identities = 60/226 (26%), Positives = 95/226 (42%), Gaps = 15/226 (6%) Query: 15 EVQDYFDGEADRLAKAWLAEYTPQIKSLKDER-KEAYRQIVEMSTEPQDVDLVRPANKFE 73 E + A + +Y I ++ ++ + Y IV D D+V N F Sbjct: 731 ECMNRLHNYAQKRFHGLNDDYRRYIATVDSDKIRRQYDNIV------SDGDVVSKHN-FR 783 Query: 74 MTRVREGEKEADLPVWKHHLLCDESGNYPAL-LNHWETKVFEIETKREGFAFWYRNPQYT 132 + + E + +HL + + L LN WE V E E +R+ F W RNP Sbjct: 784 LPETIQVPHEDGGKKYMNHLFVNNTTGMAKLKLNTWEAGVIEEEERRDDFVCWIRNPSRA 843 Query: 133 GQSSLGIAYVEAEQYKIVRPDFLFFAEQDG-KMVVDLVDPHSLHLADALPKLEGLALYAE 191 +L I Y + K PDF+ + D V+DL++PH+ D L K +G A YA Sbjct: 844 -SWALCIPYEIDGETKPTFPDFIVVRKDDRLGYVIDLLEPHNPDFKDNLGKAKGFAEYAR 902 Query: 192 HHSDAYR----RIESVAEVKGKLRVLDLKRQDVQDAVATAENAETL 233 + R R+ A K KL+ LD+ + ++D V+ + L Sbjct: 903 QNPGVGRIQLIRMSKDAAGKNKLKRLDMSKSSIRDKVSHTMTNDEL 948 >UniRef50_D1PAZ7 Type III restriction enzyme, res subunit superfamily n=1 Tax=Prevotella copri DSM 18205 RepID=D1PAZ7_9BACT Length = 839 Score = 204 bits (519), Expect = 2e-51, Method: Composition-based stats. Identities = 58/209 (27%), Positives = 95/209 (45%), Gaps = 15/209 (7%) Query: 30 AWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVRPANKFEMTRVREGEKEADLPVW 89 + +Y +++ +E K Y +IV+ DL P + ++ EG D Sbjct: 632 EYYDKYRQKLEDFGEEVKREYEKIVKAHVSTLPFDLTLP-DLMVTSKHPEGTAFTD---- 686 Query: 90 KHHLLCDESGNYPALLNHWETKVFEIETKREGFAFWYRN-PQYTGQSSLGIAYVEAEQYK 148 HL D G LN WE V ++E K+EGF W RN P G L I Y+ ++ + Sbjct: 687 --HLYVDGEGKAVFKLNEWEQAVLDVEQKKEGFVCWVRNVPNKNG--FLCIQYLNGDELR 742 Query: 149 IVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPKLEGLALYAEHHSDAYRRIESVA---- 204 PDF+ D + L++PH AD++PKL+G+A Y+E S A +R E + Sbjct: 743 PHFPDFIVVRRVDEQFEFVLLEPHYTGYADSVPKLKGMAAYSERCS-AIKRNEMMRIVDI 801 Query: 205 EVKGKLRVLDLKRQDVQDAVATAENAETL 233 K++ L+ V++ + + + L Sbjct: 802 ATGKKVQSLNAASSSVRNDIKHLMSQDDL 830 >UniRef50_A1U8N1 Type III restriction enzyme, res subunit n=2 Tax=Marinobacter aquaeolei VT8 RepID=A1U8N1_MARAV Length = 913 Score = 45.6 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 52/156 (33%), Gaps = 12/156 (7%) Query: 60 PQDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKR 119 P D + + + RE ++ V + LN E + + Sbjct: 743 PPSTDDLATVEQTLVIEDRELLRDTQWQVGNER-FMTGRFDSTFSLNSDERAFADALDQA 801 Query: 120 EGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLF-FAEQDGKMVVDLVDPHSLHLAD 178 A+W+RNP S + V E PDF+ DG+ + + + D Sbjct: 802 SFVAWWFRNPDRK---SYSVQLVRGEHRNYFFPDFVVCVEHVDGQTPMARLIETKHDVKD 858 Query: 179 ALPKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLD 214 A K + H Y ++ + GKL V++ Sbjct: 859 ARRKAK-------HIPQHYGKVLFLTRDSGKLHVVN 887 >UniRef50_B1XQZ6 Type III restriction-modification enzyme, R/helicase subunit n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XQZ6_SYNP2 Length = 1039 Score = 44.4 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 69/210 (32%), Gaps = 31/210 (14%) Query: 15 EVQDYFDGEADRLAKAWLAE-------YTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVR 67 +VQ + + +++K WL E PQ+ L + +A I + + ++ Sbjct: 806 DVQSWLFPQVLQISKDWLGECLIQKSHTFPQMLLLTEFAYDASDCIYQAIAAGESEKFLK 865 Query: 68 PANK-FEMTRVREGEKEADLPVWKHHLLCDES-----GNYPALLNHWETKVFEIETKREG 121 P + +E +G + + + A + WE K+ ++ Sbjct: 866 PILRPYETIGSTDGVDFDTSRP----VYVSDPEKCHISHVVADTDSWEQKMAQVLESMAE 921 Query: 122 FAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVD-----LVDPHSLHL 176 + +N I Y A Q K PDF+ A D D +V+ Sbjct: 922 VVCYVKN----QGLGFFIPYTMAGQSKNYMPDFI--ARVDDGHGEDDLLNLIVEVSGEAR 975 Query: 177 ADALPKLEGLALY---AEHHSDAYRRIESV 203 D K++ + A + R E + Sbjct: 976 RDKAIKVQTARNFWLPAVNGHGGLGRWEFI 1005 >UniRef50_A7HKD2 Type III restriction protein res subunit n=3 Tax=Thermotogaceae RepID=A7HKD2_FERNB Length = 989 Score = 42.1 bits (97), Expect = 0.018, Method: Composition-based stats. Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 2/50 (4%) Query: 139 IAYVEA--EQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPKLEGL 186 I Y + PDF+F+ ++D K +V VDP S K++G Sbjct: 880 IPYTNDDNGALEKFYPDFVFWFKKDNKYLVAFVDPKSTEFTSGYRKIDGF 929 >UniRef50_Q9ZJM1 Putative n=15 Tax=Helicobacter RepID=Q9ZJM1_HELPJ Length = 972 Score = 41.3 bits (95), Expect = 0.032, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 69/178 (38%), Gaps = 23/178 (12%) Query: 29 KAWLAEYTPQIKSLKDERKEAYRQ-IVEMSTEPQDVDLVRPANKFEMTRVREGEKEADLP 87 K L + ++K KE R+ I + +P D + + F++ +A+L Sbjct: 754 KEKLIQTIQEVKEHAPLDKETLRKKIAQGEIDPYDTEKHKQDRTFKV-------GDAELL 806 Query: 88 VWKHHLL--------CDESGNYPALLNHWETKVFEI-----ETKREGFAFWYRNPQYTGQ 134 K H CD + + + E+ E ET +E + FW + Sbjct: 807 KLKEHYYTPLIKAKNCDWLKHVVKVKS--ESDFLEELLKITETLQENYDFWAFSKIDEHL 864 Query: 135 SSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPKLEGLALYAEH 192 +L I Y++ + PDF+F+ ++ G ++ +DP D K + L+ + Sbjct: 865 DNLFIPYIDNATERRFFPDFIFWLQKGGTQIICFIDPKGSKHTDYEHKADAYQLFEDK 922 >UniRef50_A3YK84 Type III restriction enzyme R protein, putative n=4 Tax=Campylobacter jejuni RepID=A3YK84_CAMJE Length = 947 Score = 40.5 bits (93), Expect = 0.054, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 77/241 (31%), Gaps = 30/241 (12%) Query: 13 VMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVRPANKF 72 +Q D E + + IK + + K Y + + ++D Sbjct: 717 HRNIQAKLDFET-------VQKINKTIKDVLNA-KSEYELKADFENKKINLD-ELMQGIK 767 Query: 73 EMTRVREGEKEADLPVWKHHLLC--------DESGNYPALL-NHWETKVFE--IETKREG 121 E + +E + H D+ + N E + E + Sbjct: 768 ESQKSKEVQNYIISAKLSKHYYSPLIIYNKNDKENKINFAISNKSEKEFLEDLESNLKSS 827 Query: 122 FAF---WYRNPQYTGQSSLGIAY--VEAEQYKIVRPDFLFF--AEQDGKMVVDLVDPHSL 174 F WY + Q + I Y E ++ + PDF+F+ +Q G + +DP L Sbjct: 828 FFEQYEWYFSKLVENQDEIYIPYFDEEQQKERKFYPDFIFWLKNKQSGDFSIYFIDPKGL 887 Query: 175 HLADALP-KLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETL 233 + D KL+G E+ + Y + +V ++ + N E + Sbjct: 888 KIEDNPRFKLKGFKAIFENKNLTYE--DKNIKVNLFFYNKNINNTSEEIKDFVRSNIEDI 945 Query: 234 F 234 F Sbjct: 946 F 946 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A3QCD3 Type III restriction enzyme, res subunit n=8 Tax... 303 3e-81 UniRef50_P77169 Uncharacterized protein yagJ n=4 Tax=Enterobacte... 298 1e-79 UniRef50_A4G305 Putative DNA or RNA helicase of superfamily II n... 255 9e-67 UniRef50_A4YMZ4 Putative uncharacterized protein n=1 Tax=Bradyrh... 253 3e-66 UniRef50_C5RJN1 Type III restriction protein res subunit n=1 Tax... 253 3e-66 UniRef50_C9KIN7 Type III restriction enzyme, res subunit superfa... 252 6e-66 UniRef50_B0TCY2 Type iii restriction enzyme, res subunit, putati... 245 1e-63 UniRef50_B8HHU8 Type III restriction protein res subunit n=1 Tax... 242 1e-62 UniRef50_D1PAZ7 Type III restriction enzyme, res subunit superfa... 226 4e-58 UniRef50_A1U8N1 Type III restriction enzyme, res subunit n=2 Tax... 149 1e-34 Sequences not found previously or not previously below threshold: UniRef50_C9Y7A9 Putative uncharacterized protein n=1 Tax=Curviba... 80 6e-14 UniRef50_B1G5L4 DEAD-like helicase n=3 Tax=Proteobacteria RepID=... 67 4e-10 UniRef50_B2SK93 Putative uncharacterized protein n=13 Tax=Xantho... 52 2e-05 UniRef50_B1XQZ6 Type III restriction-modification enzyme, R/heli... 46 9e-04 UniRef50_A7HKD2 Type III restriction protein res subunit n=3 Tax... 45 0.002 UniRef50_D1P9K2 Putative uncharacterized protein n=1 Tax=Prevote... 43 0.008 UniRef50_A8IHD0 Putative uncharacterized protein n=2 Tax=Alphapr... 42 0.021 UniRef50_B8J8N7 Putative uncharacterized protein n=1 Tax=Anaerom... 41 0.026 UniRef50_Q0HGG8 Type III restriction enzyme, res subunit n=2 Tax... 41 0.028 >UniRef50_A3QCD3 Type III restriction enzyme, res subunit n=8 Tax=Bacteria RepID=A3QCD3_SHELP Length = 861 Score = 303 bits (776), Expect = 3e-81, Method: Composition-based stats. Identities = 160/242 (66%), Positives = 190/242 (78%) Query: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 +EARVTVAG+GLV EVQ YFD EAD+LAK WL +Y QIK+L D+RKE+YRQIVEMSTEP Sbjct: 619 VEARVTVAGLGLVTEVQAYFDAEADKLAKEWLVKYASQIKALSDDRKESYRQIVEMSTEP 678 Query: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 Q DLV+P ++ E + RE +KE P W +HLL D+ G YP +N WE V E E+KR+ Sbjct: 679 QSFDLVKPESRCEAAKARESDKEIKFPTWNNHLLSDKDGKYPVEMNEWERTVVEAESKRD 738 Query: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 GF FWYRNPQ GQSSLGIAY+E EQ+KIVRPDF+FFAEQD K+VVDLVDPH +HLADAL Sbjct: 739 GFLFWYRNPQQPGQSSLGIAYLEDEQFKIVRPDFIFFAEQDDKIVVDLVDPHGVHLADAL 798 Query: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 PKL+GLA YA H++AYRRIE+VAE GKLRVLDL R DV+ AV A +A++LF AD Sbjct: 799 PKLQGLAAYATKHANAYRRIEAVAEAHGKLRVLDLTRTDVRQAVLDASSAKSLFEGLSAD 858 Query: 241 DY 242 DY Sbjct: 859 DY 860 >UniRef50_P77169 Uncharacterized protein yagJ n=4 Tax=Enterobacteriaceae RepID=YAGJ_ECOLI Length = 243 Score = 298 bits (762), Expect = 1e-79, Method: Composition-based stats. Identities = 243/243 (100%), Positives = 243/243 (100%) Query: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP Sbjct: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 Query: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE Sbjct: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 Query: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL Sbjct: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 Query: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD Sbjct: 181 PKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETLFSSGLAD 240 Query: 241 DYQ 243 DYQ Sbjct: 241 DYQ 243 >UniRef50_A4G305 Putative DNA or RNA helicase of superfamily II n=1 Tax=Herminiimonas arsenicoxydans RepID=A4G305_HERAR Length = 840 Score = 255 bits (651), Expect = 9e-67, Method: Composition-based stats. Identities = 57/231 (24%), Positives = 104/231 (45%), Gaps = 5/231 (2%) Query: 3 ARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQD 62 +R+ + M + + + L++ IK L +E Y ++ E++ EP+ Sbjct: 606 SRLELFLMLQDEKTWKTLEAACGERCETLLSQNKAAIKKLTTAEQEDYNKVQEVAKEPEA 665 Query: 63 VDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREGF 122 + + P + K+++ + H+ D SG++ +LN+WE V E E R Sbjct: 666 LSFLPPVEMMLAVDI----KDSNFRNYDGHMYVDASGHFVDVLNNWEHPVIEAEIVRADV 721 Query: 123 AFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPK 182 W RN + + Y + + + PDFL D +VD+++PHS LAD+ K Sbjct: 722 VGWLRNVPRK-PWAFSLPYEFGGENRPMYPDFLVVRAVDDDHIVDILEPHSPALADSYAK 780 Query: 183 LEGLALYAEHHSDAYRRIESVAEVKGKLRVLDLKRQDVQDAVATAENAETL 233 +GLA +A H+ + RIE + V +++ LDL + V ++ L Sbjct: 781 AKGLAQFAAKHAMHFGRIELIRVVGKEIKRLDLIDATNRKRVLAVDSNAGL 831 >UniRef50_A4YMZ4 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMZ4_BRASO Length = 824 Score = 253 bits (647), Expect = 3e-66, Method: Composition-based stats. Identities = 58/236 (24%), Positives = 99/236 (41%), Gaps = 14/236 (5%) Query: 3 ARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQD 62 A+V + + + A +W + +I L K ++ +V+ S + Sbjct: 598 AKVELFALIRRAGTLAQVEDVARSCFDSWWLKNKSKIAVLPASEKARFQLLVQASGKAVR 657 Query: 63 VDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREGF 122 +L P E R WK+HL D +GN+ A +N WE E + F Sbjct: 658 QELELPLTIVEKPGAR---------TWKNHLFVDLAGNFAANMNSWEEDCLEWAAQSPDF 708 Query: 123 AFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPK 182 W RN + +L + Y + K PDFL + VVD+++PH D K Sbjct: 709 VCWLRNLPRR-EWALCVPYESGGE-KPFYPDFLIVRKSGTSFVVDVMEPHDDSRTDTWAK 766 Query: 183 LEGLALYAEHHSDAYRRIESVAEVKGKLRVLDL---KRQDVQDAVATAENAETLFS 235 ++GLA +A+ H A+ R+ + G L+ +D+ K + +A + + E+LF Sbjct: 767 VKGLASFADEHHLAFGRLMIGRKKNGALQFIDVSEAKTRAKARKLAASADLESLFE 822 >UniRef50_C5RJN1 Type III restriction protein res subunit n=1 Tax=Clostridium cellulovorans 743B RepID=C5RJN1_CLOCL Length = 961 Score = 253 bits (647), Expect = 3e-66, Method: Composition-based stats. Identities = 62/237 (26%), Positives = 98/237 (41%), Gaps = 15/237 (6%) Query: 4 RVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDER-KEAYRQIVEMSTEPQD 62 +V V E + A + +Y I ++ ++ + Y IV D Sbjct: 720 KVDVILFVADDECMNRLHNYAQKRFHGLNDDYRRYIATVDSDKIRRQYDNIV------SD 773 Query: 63 VDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPAL-LNHWETKVFEIETKREG 121 D+V N F + + E + +HL + + L LN WE V E E +R+ Sbjct: 774 GDVVSKHN-FRLPETIQVPHEDGGKKYMNHLFVNNTTGMAKLKLNTWEAGVIEEEERRDD 832 Query: 122 FAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDG-KMVVDLVDPHSLHLADAL 180 F W RNP +L I Y + K PDF+ + D V+DL++PH+ D L Sbjct: 833 FVCWIRNPSRA-SWALCIPYEIDGETKPTFPDFIVVRKDDRLGYVIDLLEPHNPDFKDNL 891 Query: 181 PKLEGLALYAEHHSDAYR----RIESVAEVKGKLRVLDLKRQDVQDAVATAENAETL 233 K +G A YA + R R+ A K KL+ LD+ + ++D V+ + L Sbjct: 892 GKAKGFAEYARQNPGVGRIQLIRMSKDAAGKNKLKRLDMSKSSIRDKVSHTMTNDEL 948 >UniRef50_C9KIN7 Type III restriction enzyme, res subunit superfamily n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KIN7_9FIRM Length = 962 Score = 252 bits (644), Expect = 6e-66, Method: Composition-based stats. Identities = 64/236 (27%), Positives = 97/236 (41%), Gaps = 15/236 (6%) Query: 5 VTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDER-KEAYRQIVEMSTEPQDV 63 + V E + A +W EY I + E+ ++ Y IV D Sbjct: 723 IEVILFVADDECMNQLHNYAQDKFHSWNDEYRRYIARIDSEKIRKQYDSIV------SDG 776 Query: 64 DLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPAL-LNHWETKVFEIETKREGF 122 D V N F + + + ++ HL + +L LN WE+ V E E KR F Sbjct: 777 DKVSKHN-FRLPETIQVPHDIGGKPYRDHLFVNGDTGVASLKLNSWESGVIEEEEKRHDF 835 Query: 123 AFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQD-GKMVVDLVDPHSLHLADALP 181 W RNP G +L I Y E K PDF+ + + V+D+++PH+ D L Sbjct: 836 VCWIRNPSR-GSWALCIPYDEDGDTKPTYPDFIIVRKDPISEYVIDILEPHNPDFKDNLG 894 Query: 182 KLEGLALYAEHHSDAYR----RIESVAEVKGKLRVLDLKRQDVQDAVATAENAETL 233 K +G A YA + R R+ A K + LD+ + ++D V A + E L Sbjct: 895 KAKGFAEYARLNPGLGRIQLIRMSKDAAGHNKFKRLDMAKSAIRDKVLKAMSIEEL 950 >UniRef50_B0TCY2 Type iii restriction enzyme, res subunit, putative n=3 Tax=Firmicutes RepID=B0TCY2_HELMI Length = 835 Score = 245 bits (624), Expect = 1e-63, Method: Composition-based stats. Identities = 57/238 (23%), Positives = 94/238 (39%), Gaps = 14/238 (5%) Query: 1 MEARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEP 60 ++ + + + + + D A++ I L + RK Y +++ S +P Sbjct: 595 IDIKKEIIVLTSDTDAMERIDTYAEKEFINLYENNKRAIAKLNEARKIVYERLINASAQP 654 Query: 61 QDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKRE 120 V V P + + D + HL C E G + A LN WE+ V E Sbjct: 655 IAVPWVLPDSI-------DFSVSDDSIKLEQHLFCSEDGIFQATLNPWESGVV-AEELNN 706 Query: 121 GFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 G W RN + SL I Y + PD + + D+++PH D Sbjct: 707 GAVCWLRNLDRK-KWSLEIPYEVGGITTSMFPDLVIVRADAQGYIFDILEPHDPSRKDNY 765 Query: 181 PKLEGLALYAEHHSDAYRRIESVAEVKG-----KLRVLDLKRQDVQDAVATAENAETL 233 PK GLA +AE H D + RI+ + + +G LD+ + V++ V + E L Sbjct: 766 PKAVGLAKFAEKHWDVFGRIQLIRQKRGVDGRDHFYRLDMSKTPVRNRVRGITSNEEL 823 >UniRef50_B8HHU8 Type III restriction protein res subunit n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HHU8_ARTCA Length = 867 Score = 242 bits (616), Expect = 1e-62, Method: Composition-based stats. Identities = 71/246 (28%), Positives = 117/246 (47%), Gaps = 17/246 (6%) Query: 2 EARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQI-----VEM 56 EA V ++ + + EV + A L +W + P + L + +E I V + Sbjct: 614 EAVVRLSALSVHEEVIRTLESSASALIDSWRQQLNPAVSRLDTKDREELDAIWHPHGVPI 673 Query: 57 STEPQDVDLVRPANKFEMTRVREGEKEA-----DLPVWKHHLLCDESGNYPALLNHWETK 111 + E + + VR + ++ V++G ++ + + HL D G++P WE + Sbjct: 674 AGEFRLPEKVRTRTQ-KIAAVKDGAGKSVETIEAIEAFNGHLFADGQGDFPMAATGWERE 732 Query: 112 VFEIETKREGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDP 171 V E E + WYRNP G L ++Y E K + PDFLFF DG +VVD+VDP Sbjct: 733 VIEKELAQSSLIGWYRNP--AGAGGLSVSYTEGGVDKSLYPDFLFFHRVDGDIVVDIVDP 790 Query: 172 HSLHLADALPKLEGLALYAEHHSDAYRRIESVAEVK-GKLRVLDLKRQD---VQDAVATA 227 H+ L D K L+ +A H A+RR+ +V K G L+ L+L + +++ +A A Sbjct: 791 HNHSLGDTPGKWAALSRFARQHPGAFRRVTAVIRNKAGALKSLELTGRANTVLENKIAAA 850 Query: 228 ENAETL 233 E + Sbjct: 851 SGGEGI 856 >UniRef50_D1PAZ7 Type III restriction enzyme, res subunit superfamily n=1 Tax=Prevotella copri DSM 18205 RepID=D1PAZ7_9BACT Length = 839 Score = 226 bits (576), Expect = 4e-58, Method: Composition-based stats. Identities = 61/237 (25%), Positives = 100/237 (42%), Gaps = 15/237 (6%) Query: 2 EARVTVAGMGLVMEVQDYFDGEADRLAKAWLAEYTPQIKSLKDERKEAYRQIVEMSTEPQ 61 E R+ + D + +Y +++ +E K Y +IV+ Sbjct: 604 ELRLQFILYANNQSCMERLDKYCKEAFYEYYDKYRQKLEDFGEEVKREYEKIVKAHVSTL 663 Query: 62 DVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREG 121 DL P + ++ EG D HL D G LN WE V ++E K+EG Sbjct: 664 PFDLTLP-DLMVTSKHPEGTAFTD------HLYVDGEGKAVFKLNEWEQAVLDVEQKKEG 716 Query: 122 FAFWYRN-PQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADAL 180 F W RN P G L I Y+ ++ + PDF+ D + L++PH AD++ Sbjct: 717 FVCWVRNVPNKNG--FLCIQYLNGDELRPHFPDFIVVRRVDEQFEFVLLEPHYTGYADSV 774 Query: 181 PKLEGLALYAEHHSDAYRRIESVA----EVKGKLRVLDLKRQDVQDAVATAENAETL 233 PKL+G+A Y+E S A +R E + K++ L+ V++ + + + L Sbjct: 775 PKLKGMAAYSERCS-AIKRNEMMRIVDIATGKKVQSLNAASSSVRNDIKHLMSQDDL 830 >UniRef50_A1U8N1 Type III restriction enzyme, res subunit n=2 Tax=Marinobacter aquaeolei VT8 RepID=A1U8N1_MARAV Length = 913 Score = 149 bits (375), Expect = 1e-34, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 52/156 (33%), Gaps = 12/156 (7%) Query: 60 PQDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKR 119 P D + + + RE ++ V + LN E + + Sbjct: 743 PPSTDDLATVEQTLVIEDRELLRDTQWQVGNER-FMTGRFDSTFSLNSDERAFADALDQA 801 Query: 120 EGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLF-FAEQDGKMVVDLVDPHSLHLAD 178 A+W+RNP S + V E PDF+ DG+ + + + D Sbjct: 802 SFVAWWFRNPDRK---SYSVQLVRGEHRNYFFPDFVVCVEHVDGQTPMARLIETKHDVKD 858 Query: 179 ALPKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLD 214 A K + H Y ++ + GKL V++ Sbjct: 859 ARRKAK-------HIPQHYGKVLFLTRDSGKLHVVN 887 >UniRef50_C9Y7A9 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9Y7A9_9BURK Length = 520 Score = 80.2 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 46/156 (29%), Gaps = 11/156 (7%) Query: 60 PQDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKR 119 P + + R + + L + LN+ E Sbjct: 350 PPSKEDSQDVETVLFMDERHWWIDQVFSLEDGSQLSVGRYDGAVKLNNLERDFARALDSA 409 Query: 120 EGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLF-FAEQDGKMVVDLVDPHSLHLAD 178 + +W+RNP + V AE PDF+ G + + + D Sbjct: 410 DFVHWWHRNPDKKP---YAVRVVRAEHEHYFYPDFVVCVEHAPGDVPMQRLLETKESTKD 466 Query: 179 ALPKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLD 214 A K A H A+ ++ + +LR ++ Sbjct: 467 AARK-------ARHWPAAFGKVLFLTPDGNRLRWVN 495 >UniRef50_B1G5L4 DEAD-like helicase n=3 Tax=Proteobacteria RepID=B1G5L4_9BURK Length = 923 Score = 67.1 bits (162), Expect = 4e-10, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 45/156 (28%), Gaps = 12/156 (7%) Query: 60 PQDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKR 119 P + + + R+ + + + N E Sbjct: 746 PPTREAGERVPREVLMDDRQWLVDKTYALSDGE-FSQGHFDGTWFGNSLEDSFSRALDSA 804 Query: 120 EGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLF-FAEQDGKMVVDLVDPHSLHLAD 178 E +W+RNP+ + V AE PDF+ + + D Sbjct: 805 EYVVWWHRNPRNKR---FAVRVVRAEHDNYFYPDFVVCVKHNPADGPMPRLLETKDDTKD 861 Query: 179 ALPKLEGLALYAEHHSDAYRRIESVAEVKGKLRVLD 214 A K ++H D Y ++ + ++R ++ Sbjct: 862 AARK-------SQHSPDFYGKVLFLTPDGDRMRWVN 890 >UniRef50_B2SK93 Putative uncharacterized protein n=13 Tax=Xanthomonadaceae RepID=B2SK93_XANOP Length = 833 Score = 51.7 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 17 QDYFDGEADRLAKAWLAEYTPQI----KSLKDERKEAYRQI--VEMSTEPQDVDLVRPAN 70 + L + L + I + D+ + RQ+ + + D + Sbjct: 616 LRLVEANDRELYRRLLERFVRAIEGSGAEVPDDEELQMRQLDLLLVRRPGLLGDAFKSLR 675 Query: 71 KFEMTRV-----REGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREGFAFW 125 + ++ V E + L L G +P LN E + E +W Sbjct: 676 QCQVLDVDVLLPAELLSDQPLRSANRGLY----GVFPPGLNQDELAIAERLDASTQVLWW 731 Query: 126 YRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVDLVDPHSLHLADALPKLEG 185 +RN +G + ++ PDF+ + + L++ HL K Sbjct: 732 HRNQPKSG-----VGLYRWDEGDGFYPDFVVSIAERSAPGIALLELKGDHL---WGKPSE 783 Query: 186 LALYAEHHSDAYRRIESVAEVKGK 209 + A +H + Y + V +G+ Sbjct: 784 VDKSAANHRE-YGAVFMVGRKRGE 806 >UniRef50_B1XQZ6 Type III restriction-modification enzyme, R/helicase subunit n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XQZ6_SYNP2 Length = 1039 Score = 46.3 bits (108), Expect = 9e-04, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 52/162 (32%), Gaps = 16/162 (9%) Query: 50 YRQIVEMSTEPQDVDLVRPANKFEMTRVREGEKEADLPVWKHHLLCDESGNYPALLNHWE 109 Y+ I +E ++RP T + + PV+ + A + WE Sbjct: 852 YQAIAAGESEKFLKPILRPYETIGSTDGVDF--DTSRPVYVSDPEKCHISHVVADTDSWE 909 Query: 110 TKVFEIETKREGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLFFAEQDGKMVVD-- 167 K+ ++ + +N I Y A Q K PDF+ A D D Sbjct: 910 QKMAQVLESMAEVVCYVKN----QGLGFFIPYTMAGQSKNYMPDFI--ARVDDGHGEDDL 963 Query: 168 ---LVDPHSLHLADALPKLEGLALY---AEHHSDAYRRIESV 203 +V+ D K++ + A + R E + Sbjct: 964 LNLIVEVSGEARRDKAIKVQTARNFWLPAVNGHGGLGRWEFI 1005 >UniRef50_A7HKD2 Type III restriction protein res subunit n=3 Tax=Thermotogaceae RepID=A7HKD2_FERNB Length = 989 Score = 45.2 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 19/99 (19%), Positives = 35/99 (35%), Gaps = 8/99 (8%) Query: 109 ETKVFEIETKREG----FAFWYRNPQYTGQSSLGIAYVEA--EQYKIVRPDFLFFAEQDG 162 E + ++ F +W + + I Y + PDF+F+ ++D Sbjct: 846 EIDFIDELKRKSKHFERFDWWLFSKVIQKVDEIYIPYTNDDNGALEKFYPDFVFWFKKDN 905 Query: 163 KMVVDLVDPHSLHLADALPKLEGLAL--YAEHHSDAYRR 199 K +V VDP S K++G Y + + Sbjct: 906 KYLVAFVDPKSTEFTSGYRKIDGFIRLFYENDKPKIFEK 944 >UniRef50_D1P9K2 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1P9K2_9BACT Length = 962 Score = 43.2 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 37/98 (37%), Gaps = 4/98 (4%) Query: 109 ETKVFEIETKREGFAFWYRNPQYTGQSSLGIAY--VEAEQYKIVRPDFLFFAEQDGKMVV 166 E ++ ++G +W++N L I Y E + + PD++F + + Sbjct: 660 EESFYKFLEGQDGIEWWFKNGDSGKDW-LAIRYFSEERNEEALFYPDWIFKKKDGTIGIF 718 Query: 167 DLVDPHSLHLADALPKLEGL-ALYAEHHSDAYRRIESV 203 D + D K E L + + A RI+ V Sbjct: 719 DTKGGQTAASKDTKNKAEALQKRLSMLNRLAEGRIKYV 756 >UniRef50_A8IHD0 Putative uncharacterized protein n=2 Tax=Alphaproteobacteria RepID=A8IHD0_AZOC5 Length = 863 Score = 41.7 bits (96), Expect = 0.021, Method: Composition-based stats. Identities = 23/95 (24%), Positives = 35/95 (36%), Gaps = 9/95 (9%) Query: 101 YPALLNHWETKVFEIETKREGFAFWYRNPQYTGQSSLGIAYVEAEQYKIVRPDFLF-FAE 159 Y LN E V + A+W+RN T GI + + PDF+F Sbjct: 736 YENELNSDERDVAVYLDGEKTLAWWHRNVARTQ---YGIQ---GWKKAKIYPDFIFAVQR 789 Query: 160 QDGKMVVDLVDPHSLHLA--DALPKLEGLALYAEH 192 + +++ L D K E LA ++H Sbjct: 790 DGEAKRITVLETKGDQLDNLDTAYKREALAFLSDH 824 >UniRef50_B8J8N7 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter dehalogenans 2CP-1 RepID=B8J8N7_ANAD2 Length = 767 Score = 41.3 bits (95), Expect = 0.026, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 37/139 (26%), Gaps = 16/139 (11%) Query: 21 DGEADRLAKAW---LAEYTPQIKSLKDERKEAYRQIVEMSTEPQDVDLVRPANKFEMTRV 77 D A R + + +L AY Q VE + + Sbjct: 556 DSYAGRSFHQASCPRSAAQEALATLAGAVATAYEQSVEYQKNEIAGE----RSWHVAPHR 611 Query: 78 REGEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREGFAFWYRNPQYTGQ-SS 136 G H +S N E + + G W RNP G+ + Sbjct: 612 PSGADLLTFRHATHASYSRKSF------NKDEREFADALD-SFGKGTWCRNPSTPGEGWA 664 Query: 137 LGIAYVEAEQYKIVRPDFL 155 L + + PD+L Sbjct: 665 LPLPSKVGDSLN-FFPDYL 682 >UniRef50_Q0HGG8 Type III restriction enzyme, res subunit n=2 Tax=Proteobacteria RepID=Q0HGG8_SHESM Length = 885 Score = 41.3 bits (95), Expect = 0.028, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 72/218 (33%), Gaps = 21/218 (9%) Query: 26 RLAKAWLAEYTPQIKSLKDER-KEAYRQ----IVEMSTEPQDVDLVRPANK-FEMTRVRE 79 L K + Q+ L ++ + + + ++++ + ++ ++ + Sbjct: 676 ELLKEMKRDIKQQVAQLSEQIFRSKLDKGDISLRLLASDNEKLNWELAQTLEVNVSEHDQ 735 Query: 80 GEKEADLPVWKHHLLCDESGNYPALLNHWETKVFEIETKREGFAFWYRNPQYTGQSSLGI 139 + D + L Y LN+ E K+E +W+R + SL Sbjct: 736 VLRRKDSSELEKSLF---EKVYQNGLNNLERDTAWYLDKQESVYWWHRIAVNQREYSL-- 790 Query: 140 AYVEAEQYKIVRPDFLFFAEQD--GKMVVDLVDPHSLHLA---DALPKLEGLALYAEHHS 194 + Q + V PD L E+ G +++ HL D K L+ EH Sbjct: 791 ---QGWQKQKVYPDLLVCVEKPNSGSYRFSVLETKGEHLKGNDDTEYKRRLFELFTEHVK 847 Query: 195 DAY--RRIESVAEVKGKLRVLDLKRQDVQDAVATAENA 230 A ++ A G + ++ Q+ V Sbjct: 848 TAVDAGELKLEAASGGMSFRMLMEDSWSQEIVPELVTN 885 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.123 0.305 Lambda K H 0.267 0.0382 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,074,014,264 Number of Sequences: 3077464 Number of extensions: 36015342 Number of successful extensions: 111218 Number of sequences better than 1.0e-01: 20 Number of HSP's better than 0.1 without gapping: 28 Number of HSP's successfully gapped in prelim test: 16 Number of HSP's that attempted gapping in prelim test: 111123 Number of HSP's gapped (non-prelim): 45 length of query: 243 length of database: 1,040,396,356 effective HSP length: 125 effective length of query: 118 effective length of database: 655,713,356 effective search space: 77374176008 effective search space used: 77374176008 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 91 (39.8 bits)