BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (216 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P05050 Alpha-ketoglutarate-dependent dioxygenase alkB n... 451 e-126 UniRef50_Q8Z566 AlkB protein n=6 Tax=Salmonella enterica subsp. ... 363 1e-99 UniRef50_UPI000197C9F9 alpha-ketoglutarate-dependent dioxygenase... 228 1e-58 UniRef50_Q5QTX8 Alkylated DNA repair protein n=1 Tax=Idiomarina ... 224 2e-57 UniRef50_D0IWA8 2OG-Fe(II) oxygenase n=4 Tax=Proteobacteria RepI... 221 2e-56 UniRef50_C7JEB8 DNA repair protein for alkylated DNA n=8 Tax=Ace... 219 4e-56 UniRef50_A3WP94 Alkylated DNA repair protein n=1 Tax=Idiomarina ... 199 4e-50 UniRef50_Q1N5M7 2OG-Fe(II) oxygenase superfamily protein n=1 Tax... 176 6e-43 UniRef50_D0LXU5 2OG-Fe(II) oxygenase n=1 Tax=Haliangium ochraceu... 145 7e-34 UniRef50_B8GWW6 Alpha-ketoglutarate-dependent dioxygenase alkB h... 144 2e-33 UniRef50_B0T136 2OG-Fe(II) oxygenase n=1 Tax=Caulobacter sp. K31... 128 2e-28 UniRef50_Q28VY2 DNA-N1-methyladenine dioxygenase n=30 Tax=Bacter... 125 9e-28 UniRef50_C9CYS1 Putative uncharacterized protein n=2 Tax=Alphapr... 125 1e-27 UniRef50_A7HZ41 2OG-Fe(II) oxygenase n=1 Tax=Parvibaculum lavame... 120 3e-26 UniRef50_A3VR77 Alkylated DNA repair protein n=1 Tax=Parvularcul... 117 3e-25 UniRef50_D2B1L1 Alkylated DNA repair protein n=3 Tax=Actinomycet... 89 1e-16 UniRef50_C7QZR3 2OG-Fe(II) oxygenase n=29 Tax=Actinomycetales Re... 86 7e-16 UniRef50_Q7KUZ2 AlkB n=12 Tax=Drosophila RepID=Q7KUZ2_DROME 85 2e-15 UniRef50_C0Z2F3 AT5G01780 protein n=9 Tax=Magnoliophyta RepID=C0... 84 4e-15 UniRef50_Q9LZW8 Putative uncharacterized protein T20L15_50 n=3 T... 82 1e-14 UniRef50_Q9LJH2 Similarity to unknown protein n=7 Tax=Embryophyt... 82 1e-14 UniRef50_Q17GQ0 Putative uncharacterized protein (Fragment) n=2 ... 82 2e-14 UniRef50_Q7QEU7 AGAP000155-PA n=1 Tax=Anopheles gambiae RepID=Q7... 81 2e-14 UniRef50_C6TKW1 Putative uncharacterized protein n=1 Tax=Glycine... 81 3e-14 UniRef50_UPI0001983B96 PREDICTED: hypothetical protein n=1 Tax=V... 78 2e-13 UniRef50_UPI00005257D6 PREDICTED: similar to AlkB CG33250-PA n=1... 78 2e-13 UniRef50_D1HAA5 Whole genome shotgun sequence of line PN40024, s... 78 2e-13 UniRef50_B3RXF0 Putative uncharacterized protein (Fragment) n=1 ... 78 2e-13 UniRef50_UPI000051A07C PREDICTED: similar to AlkB CG33250-PA n=3... 77 5e-13 UniRef50_B9GZQ0 Predicted protein n=13 Tax=Magnoliophyta RepID=B... 76 6e-13 UniRef50_Q9SA98 Alkylated DNA repair protein alkB homolog n=4 Ta... 76 7e-13 UniRef50_C7NLG8 Alkylated DNA repair protein n=2 Tax=Actinomycet... 76 7e-13 UniRef50_A9TG90 Predicted protein n=1 Tax=Physcomitrella patens ... 75 1e-12 UniRef50_UPI0001BCD579 alkylated DNA repair protein AlkB n=1 Tax... 75 1e-12 UniRef50_C9YWY9 Putative DNA repair protein n=1 Tax=Streptomyces... 75 2e-12 UniRef50_D2A2Y9 Putative uncharacterized protein GLEAN_07602 n=1... 74 2e-12 UniRef50_B7PUG0 Putative uncharacterized protein n=1 Tax=Ixodes ... 74 3e-12 UniRef50_D2V2M0 Predicted protein n=1 Tax=Naegleria gruberi RepI... 74 3e-12 UniRef50_D1IRG0 Whole genome shotgun sequence of line PN40024, s... 73 8e-12 UniRef50_UPI0000E47318 PREDICTED: similar to LOC494680 protein n... 72 1e-11 UniRef50_C0WHI3 Alkylated DNA repair protein n=3 Tax=Corynebacte... 72 2e-11 UniRef50_C0PA85 Putative uncharacterized protein n=1 Tax=Zea may... 71 3e-11 UniRef50_Q13686 Alkylated DNA repair protein alkB homolog 1 n=27... 71 3e-11 UniRef50_C2BL13 Alkylated DNA repair protein n=2 Tax=Corynebacte... 70 3e-11 UniRef50_C8NRG2 DNA repair protein n=9 Tax=Actinomycetales RepID... 70 4e-11 UniRef50_B5Y3R7 Predicted protein n=1 Tax=Phaeodactylum tricornu... 70 4e-11 UniRef50_C3XQU3 Putative uncharacterized protein n=1 Tax=Branchi... 70 6e-11 UniRef50_B6JWW7 AlkB-like protein n=1 Tax=Schizosaccharomyces ja... 69 8e-11 UniRef50_A7RXQ8 Predicted protein (Fragment) n=1 Tax=Nematostell... 69 9e-11 UniRef50_A8J903 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 69 1e-10 UniRef50_Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB n... 69 2e-10 UniRef50_O60066 Alkylated DNA repair protein alkB homolog n=2 Ta... 69 2e-10 UniRef50_Q6C333 YALI0F03003p n=1 Tax=Yarrowia lipolytica RepID=Q... 67 4e-10 UniRef50_Q010W8 Oxidoreductase, 2OG-Fe (ISS) n=1 Tax=Ostreococcu... 67 4e-10 UniRef50_C1FFM9 Predicted protein (Fragment) n=2 Tax=Micromonas ... 65 2e-09 UniRef50_C1EB25 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 63 8e-09 UniRef50_A4H8G0 Putative uncharacterized protein n=3 Tax=Leishma... 63 8e-09 UniRef50_Q4D9X3 Alkylated DNA repair protein, putative n=3 Tax=T... 62 1e-08 UniRef50_D0NYX8 Alkylated DNA repair protein alkB-like protein n... 62 2e-08 UniRef50_UPI000186D7D6 conserved hypothetical protein n=1 Tax=Pe... 60 4e-08 UniRef50_D0N998 Putative uncharacterized protein n=1 Tax=Phytoph... 60 5e-08 UniRef50_Q5K7S3 Putative uncharacterized protein n=1 Tax=Filobas... 60 7e-08 UniRef50_B6KBE4 Putative uncharacterized protein n=3 Tax=Toxopla... 59 2e-07 UniRef50_A4S344 Predicted protein n=2 Tax=Mamiellales RepID=A4S3... 58 2e-07 UniRef50_A0C122 Chromosome undetermined scaffold_140, whole geno... 58 3e-07 UniRef50_UPI00019271E1 PREDICTED: similar to predicted protein n... 58 3e-07 UniRef50_A7AWB3 Putative uncharacterized protein n=1 Tax=Babesia... 57 6e-07 UniRef50_Q9LJH4 Emb|CAB82748.1 n=1 Tax=Arabidopsis thaliana RepI... 55 1e-06 UniRef50_Q22MH4 Putative uncharacterized protein n=1 Tax=Tetrahy... 55 2e-06 UniRef50_A9TLH2 Predicted protein n=1 Tax=Physcomitrella patens ... 55 2e-06 UniRef50_C4Q8H0 Expressed protein n=2 Tax=Schistosoma RepID=C4Q8... 52 1e-05 UniRef50_B1XR40 Oxidoreductase, 2OG-Fe(II) oxygenase family n=4 ... 52 1e-05 UniRef50_A8PV44 ALKBH protein, putative n=1 Tax=Brugia malayi Re... 52 1e-05 UniRef50_Q0U5B3 Putative uncharacterized protein n=1 Tax=Phaeosp... 51 3e-05 UniRef50_C5L3Y2 Putative uncharacterized protein n=1 Tax=Perkins... 50 4e-05 UniRef50_UPI000175883A PREDICTED: similar to alkB, alkylation re... 50 4e-05 UniRef50_B8ESE9 2OG-Fe(II) oxygenase n=1 Tax=Methylocella silves... 49 1e-04 UniRef50_B2SPH7 DNA repair system specific for alkylated DNA n=1... 49 1e-04 UniRef50_A1SVH4 DNA-N1-methyladenine dioxygenase n=12 Tax=Bacter... 49 1e-04 UniRef50_Q4UFZ4 Alkylated DNA repair protein, putative n=2 Tax=T... 49 1e-04 UniRef50_UPI000180CD20 PREDICTED: similar to alkB, alkylation re... 48 2e-04 UniRef50_UPI0000E484FA PREDICTED: hypothetical protein n=1 Tax=S... 48 2e-04 UniRef50_C5KBY7 Putative uncharacterized protein n=1 Tax=Perkins... 48 2e-04 UniRef50_Q09BP3 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 47 4e-04 UniRef50_D2UYR0 Predicted protein n=1 Tax=Naegleria gruberi RepI... 47 4e-04 UniRef50_B8J7A0 2OG-Fe(II) oxygenase n=3 Tax=Anaeromyxobacter Re... 46 0.001 UniRef50_Q8T9A3 SD10403p n=17 Tax=Coelomata RepID=Q8T9A3_DROME 46 0.001 UniRef50_A4SZF3 DNA-N1-methyladenine dioxygenase n=1 Tax=Polynuc... 46 0.001 UniRef50_D2A2C2 Putative uncharacterized protein GLEAN_07671 n=1... 45 0.001 UniRef50_A5WBM5 DNA-N1-methyladenine dioxygenase n=5 Tax=Moraxel... 45 0.001 UniRef50_C6XT27 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter heparinu... 45 0.002 UniRef50_Q26EI7 Alkylated DNA repair protein n=1 Tax=Flavobacter... 44 0.003 UniRef50_B0SGN3 Alkylated DNA repair protein n=2 Tax=Leptospira ... 44 0.003 UniRef50_A4CQ67 Alkylated DNA repair protein n=2 Tax=Flavobacter... 44 0.003 UniRef50_D2XAQ5 Alkylated DNA repair protein n=1 Tax=Marseillevi... 44 0.004 UniRef50_B6GZZ6 Pc12g09870 protein n=4 Tax=Eurotiomycetidae RepI... 44 0.005 UniRef50_UPI00006CC0FF hypothetical protein TTHERM_00219000 n=1 ... 44 0.005 UniRef50_Q6C9X6 YALI0D07546p n=1 Tax=Yarrowia lipolytica RepID=Q... 43 0.006 UniRef50_C7RA32 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis... 42 0.013 UniRef50_Q07GB6 Oxidoreductase, putative n=1 Tax=Roseobacter den... 42 0.014 UniRef50_Q2MF23 TobX protein n=2 Tax=Actinomycetales RepID=Q2MF2... 42 0.017 UniRef50_C5CMR5 2OG-Fe(II) oxygenase n=1 Tax=Variovorax paradoxu... 41 0.021 UniRef50_B2AYU6 Predicted CDS Pa_1_12280 (Fragment) n=5 Tax=Leot... 40 0.039 UniRef50_A4C6S7 Putative 2OG-Fe(II) oxygenase superfamily protei... 40 0.045 UniRef50_A5GWW3 Alkylated DNA repair protein n=2 Tax=Synechococc... 40 0.074 >UniRef50_P05050 Alpha-ketoglutarate-dependent dioxygenase alkB n=232 Tax=cellular organisms RepID=ALKB_ECOLI Length = 216 Score = 451 bits (1161), Expect = e-126, Method: Compositional matrix adjust. Identities = 216/216 (100%), Positives = 216/216 (100%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA Sbjct: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN Sbjct: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG Sbjct: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE Sbjct: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 >UniRef50_Q8Z566 AlkB protein n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=Q8Z566_SALTI Length = 216 Score = 363 bits (933), Expect = 1e-99, Method: Compositional matrix adjust. Identities = 172/216 (79%), Positives = 192/216 (88%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 MLDLFAD PWQEPLA GAV+LRRFAF AA+ L+ DI VASQSPFRQMVTPGGYTMSVA Sbjct: 1 MLDLFADEAPWQEPLAPGAVVLRRFAFRAAQSLLDDIGFVASQSPFRQMVTPGGYTMSVA 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 MTNCG LGWTT GY Y+ DP T+KPWPA+P SF ++C++AA AAGY FQPDACLIN Sbjct: 61 MTNCGALGWTTDGHGYCYAVRDPLTDKPWPALPLSFASVCRQAAIAAGYASFQPDACLIN 120 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RYAPGAKLSLHQDKDEPDLRAPIVSVSLG+PA+FQFGGL+R+DPL+R+LLEHGD+VVWGG Sbjct: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGVPAVFQFGGLRRSDPLQRILLEHGDIVVWGG 180 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 ESRLFYHGIQPLKAGFHP+T + RYNLTFRQA +KE Sbjct: 181 ESRLFYHGIQPLKAGFHPMTGEFRYNLTFRQAAEKE 216 >UniRef50_UPI000197C9F9 alpha-ketoglutarate-dependent dioxygenase alkB n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C9F9 Length = 214 Score = 228 bits (580), Expect = 1e-58, Method: Compositional matrix adjust. Identities = 109/198 (55%), Positives = 136/198 (68%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 +A A +L+ F ++ L++ +++V + +P R M TP GY MS AMTNCG GW T ++ Sbjct: 15 IAPEAFLLKGFLLGQSDALLQSLSNVITANPLRHMATPNGYQMSAAMTNCGDWGWVTDKK 74 Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 GY YS DP TN+PW MP SF L AA+ AG+ F PDACLINRYA GA +SLHQDK Sbjct: 75 GYRYSQRDPVTNQPWQPMPISFVQLATSAASTAGFEHFIPDACLINRYAVGAAMSLHQDK 134 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKA 194 DE D PIVS SLGLP IF FGG R+ P + LEHGDV+VWGG SRL YHG++ +K+ Sbjct: 135 DEADFTHPIVSFSLGLPTIFDFGGATRDAPKIAVYLEHGDVLVWGGRSRLNYHGVRRIKS 194 Query: 195 GFHPLTIDCRYNLTFRQA 212 G HPL RYNLTFR++ Sbjct: 195 GVHPLLGPYRYNLTFRRS 212 >UniRef50_Q5QTX8 Alkylated DNA repair protein n=1 Tax=Idiomarina loihiensis RepID=Q5QTX8_IDILO Length = 217 Score = 224 bits (570), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 105/214 (49%), Positives = 135/214 (63%), Gaps = 2/214 (0%) Query: 2 LDLFAD--AEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSV 59 LDLFA+ +EP ++ A + +F E L+ DI V QSP R + TP G+ MSV Sbjct: 4 LDLFANDGSEPLSTEISEQATLFHQFLLADDEALLNDIRGVLKQSPLRHLATPAGHKMSV 63 Query: 60 AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 ++CG GW + + GY Y IDP T +PWP +PQS + + AG+ +FQPD+CLI Sbjct: 64 KSSSCGSYGWLSDKHGYRYQNIDPVTGQPWPDIPQSILVKATQVSRLAGFQNFQPDSCLI 123 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y PGAK+ LHQDK+E D PIVS S GLP F +GG KR+D ++ L+H D +VWG Sbjct: 124 NVYTPGAKMGLHQDKNEADFSKPIVSFSFGLPITFMWGGFKRSDKYQKFSLQHADALVWG 183 Query: 180 GESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 G+ RL YHG+Q LK HPLT CR NLT RQAG Sbjct: 184 GKDRLRYHGVQQLKEAMHPLTGRCRVNLTIRQAG 217 >UniRef50_D0IWA8 2OG-Fe(II) oxygenase n=4 Tax=Proteobacteria RepID=D0IWA8_COMTE Length = 224 Score = 221 bits (562), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 116/215 (53%), Positives = 144/215 (66%), Gaps = 2/215 (0%) Query: 2 LDLF-ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 L LF AD+ P E + GAV+LR FA ++ + ++ + + + FR M PGG MSVA Sbjct: 3 LSLFPADSLP-AEIIDDGAVLLRGFAAAEEQRWVAEVTALQTGAAFRTMQVPGGKFMSVA 61 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 +TN G GW + QGY YS +DPQT KPWPA+P RAA AGYP F PDACLIN Sbjct: 62 ITNAGGWGWISDLQGYRYSAVDPQTGKPWPAIPAFLGEQAARAAALAGYPGFAPDACLIN 121 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RY PGA++ LH+D+DE D APIVSVSLGLP F +GGL R P +RL L HGDV+VWGG Sbjct: 122 RYQPGARMGLHRDQDEHDFAAPIVSVSLGLPCRFLWGGLTRQSPTRRLALTHGDVLVWGG 181 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 SRL +HG+ PL+ G HPL + R+NLTFR A + Sbjct: 182 PSRLVFHGVAPLREGQHPLLGNERWNLTFRMAKAR 216 >UniRef50_C7JEB8 DNA repair protein for alkylated DNA n=8 Tax=Acetobacter pasteurianus RepID=C7JEB8_ACEP3 Length = 222 Score = 219 bits (559), Expect = 4e-56, Method: Compositional matrix adjust. Identities = 106/207 (51%), Positives = 141/207 (68%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 L D P L AGAV+L FA + AE + I+ +A Q+PFR+M TPGG MSVAMT Sbjct: 12 LLPDTRPDYVQLDAGAVLLPGFALHDAEACMLAIHHIAQQAPFRKMHTPGGGQMSVAMTC 71 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA 123 CG GW + QGY Y+ ++P T +PWP MP F L +AA AG+ FQP+ACLIN Y+ Sbjct: 72 CGTFGWISTAQGYSYTKVNPFTGQPWPDMPAIFQALAHKAAQKAGFAQFQPNACLINSYS 131 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 PGA++ LHQD+DE P+VS+S GL A F +GGLKR+DP +++LL+ GDV+VWGG R Sbjct: 132 PGARMGLHQDRDEGCTDQPVVSLSFGLEATFLWGGLKRSDPTRQILLKDGDVLVWGGPDR 191 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 L +HG++P+ +G H T + R N+TFR Sbjct: 192 LRFHGVKPIHSGAHIRTGETRLNITFR 218 >UniRef50_A3WP94 Alkylated DNA repair protein n=1 Tax=Idiomarina baltica OS145 RepID=A3WP94_9GAMM Length = 209 Score = 199 bits (507), Expect = 4e-50, Method: Compositional matrix adjust. Identities = 98/198 (49%), Positives = 121/198 (61%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 A GA +L A A ++ I + Q+P R +TPG MSV +NCG GW + + Sbjct: 12 FAPGAWLLPNHASEQAADILAAIRECVRQAPLRHFMTPGNKPMSVLSSNCGDFGWVSDSK 71 Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 GY Y DP+++KPWP +P N + A AGYP+F P+ACLIN Y PGAK+ LHQD+ Sbjct: 72 GYRYQATDPKSDKPWPDIPSILLNDATQVAEQAGYPEFLPNACLINVYKPGAKMGLHQDR 131 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKA 194 DE DL P+VS S GLPA F + G R +RL L HGDV+VWGG SRL YHGI L Sbjct: 132 DESDLNEPVVSYSFGLPARFIWAGQTRTGTKQRLPLNHGDVLVWGGPSRLNYHGIDKLVE 191 Query: 195 GFHPLTIDCRYNLTFRQA 212 G HPLT R NLT R+A Sbjct: 192 GTHPLTQQTRVNLTLRKA 209 >UniRef50_Q1N5M7 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=Bermanella marisrubri RepID=Q1N5M7_9GAMM Length = 212 Score = 176 bits (445), Expect = 6e-43, Method: Compositional matrix adjust. Identities = 91/212 (42%), Positives = 119/212 (56%), Gaps = 2/212 (0%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 M DLFA E E L G + + E + I++VA Q+PFR M+TP G+ M VA Sbjct: 1 MSDLFASNE--VEILDHGQGLFQIRNLVNTEATMAAIHEVAKQAPFRHMMTPMGHHMKVA 58 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 TNCG GW GY YS DP++ + WPAMP + + P + PDACLIN Sbjct: 59 TTNCGEYGWIAQPSGYGYSRNDPESGQSWPAMPDTIRTISDDVIAHLNLPKYSPDACLIN 118 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RY G + HQDKDE + PI+SVSLGLPAIFQ G KR + GDV + G Sbjct: 119 RYDIGTSMGRHQDKDEANFDYPIISVSLGLPAIFQVVGPKRQGKATYYSVSDGDVFILSG 178 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 ++RL+YHG+ +KA + + RYNLT R++ Sbjct: 179 QARLYYHGVNTVKANPNQPELQQRYNLTLRRS 210 >UniRef50_D0LXU5 2OG-Fe(II) oxygenase n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LXU5_HALO1 Length = 219 Score = 145 bits (367), Expect = 7e-34, Method: Compositional matrix adjust. Identities = 85/212 (40%), Positives = 123/212 (58%), Gaps = 10/212 (4%) Query: 3 DLFADAEPWQEPLAAGAV-ILRRFAFNAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVA 60 +LF + P PL G + I+ +A L+ + V +++P +R + G +SV Sbjct: 7 ELFPEQAP---PLPEGFLHIVAALDLDAQGALLEQVRAVLAEAPAYRPSMPRTGAPLSVR 63 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 M+NCG LGW + R GY Y P+ P T + WPA+P R A +P +P+ACL+N Sbjct: 64 MSNCGTLGWISDRAGYRYEPLHPHTARRWPAIPPLAMAQWNRFAD---WP-VRPEACLVN 119 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y G++L +H D+DE AP+VS+SLG A+++ GG RN P +RLLL GDVVV GG Sbjct: 120 LYQTGSRLGMHVDQDERAADAPVVSISLGCDAVYRLGGHTRNLPSQRLLLRSGDVVVLGG 179 Query: 181 ESRLFYHGIQPLKAGFHPL-TIDCRYNLTFRQ 211 +R YHG+ + AG PL ++ R NLT R+ Sbjct: 180 AARRCYHGVDRIVAGTSPLPELEARINLTLRR 211 >UniRef50_B8GWW6 Alpha-ketoglutarate-dependent dioxygenase alkB homolog n=43 Tax=Alphaproteobacteria RepID=ALKB_CAUCN Length = 220 Score = 144 bits (364), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 79/187 (42%), Positives = 107/187 (57%), Gaps = 5/187 (2%) Query: 27 FNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTN 86 +A L+ + A Q+PF T G MSVAMT G LGWT+ +GY Y P+T Sbjct: 35 ISAQRALVEAVLAGAEQAPFSNYRTAYGKPMSVAMTALGSLGWTSDARGYRYVDRHPETG 94 Query: 87 KPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSV 146 +PWP MP + +L T G P+ PD+CL+N Y GA++ LHQD+DE D R P++S+ Sbjct: 95 RPWPDMPPALLDLW----TVLGDPETPPDSCLVNLYRDGARMGLHQDRDEADPRFPVLSI 150 Query: 147 SLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI-DCRY 205 SLG A+F+ GG+ R DP + L L GDV G +RL +HG+ + G L R Sbjct: 151 SLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPGSSSLVPGGGRI 210 Query: 206 NLTFRQA 212 NLT R+A Sbjct: 211 NLTLRRA 217 >UniRef50_B0T136 2OG-Fe(II) oxygenase n=1 Tax=Caulobacter sp. K31 RepID=B0T136_CAUSK Length = 212 Score = 128 bits (321), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 70/160 (43%), Positives = 92/160 (57%), Gaps = 5/160 (3%) Query: 54 GYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ 113 G MSVAM++ G LGWT+ + GY Y+ P T PWPAMPQ+ +L G P Sbjct: 47 GKAMSVAMSSFGPLGWTSDKTGYRYTGRHPGTGAPWPAMPQALLDLWADL----GDPQTP 102 Query: 114 PDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHG 173 PDA LIN Y A++ LHQD+DE D R P++S+SLG A+F+ GG R P + L L G Sbjct: 103 PDAALINLYRGEARMGLHQDRDEADPRFPVLSISLGDTAVFRIGGTSRKGPTRSLKLSSG 162 Query: 174 DVVVWGGESRLFYHGIQPLKAGFHPLTI-DCRYNLTFRQA 212 DV G +RL +HG+ + G L R N+T R+A Sbjct: 163 DVCRLSGPARLAFHGVDRILPGSSSLVAGGGRINITLRRA 202 >UniRef50_Q28VY2 DNA-N1-methyladenine dioxygenase n=30 Tax=Bacteria RepID=Q28VY2_JANSC Length = 216 Score = 125 bits (314), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 74/207 (35%), Positives = 110/207 (53%), Gaps = 8/207 (3%) Query: 7 DAEPWQEPLAAGAVILRRFAFNAAEQ--LIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 ++E + + G +++ + +Q L+ + V +P + T G MSV MT+ Sbjct: 12 ESEEFALSVDVGGIVVHPEHLDGPDQAELVEQVRRVVRSAPLYRPETRTGRKMSVRMTSA 71 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 G GW + R+GY Y P + WP +P + + + A P +CLIN Y Sbjct: 72 GTYGWISDRRGYRYDRCHPD-GQDWPPIPPMALEIWRAVSGVA----QDPQSCLINYYDA 126 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 GAK+ +HQD+DE D P+VSVSLG A+F+ GG KR + + L+ GDV V GGE+RL Sbjct: 127 GAKMGMHQDRDEGDFDMPVVSVSLGDEALFRVGGPKRGGKTQSVWLKSGDVAVMGGEARL 186 Query: 185 FYHGIQPLKAGFHPLTID-CRYNLTFR 210 +HGI ++AG L + R NLT R Sbjct: 187 NFHGIDRIRAGSSTLLPNGGRINLTMR 213 >UniRef50_C9CYS1 Putative uncharacterized protein n=2 Tax=Alphaproteobacteria RepID=C9CYS1_9RHOB Length = 200 Score = 125 bits (314), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 71/187 (37%), Positives = 97/187 (51%), Gaps = 6/187 (3%) Query: 25 FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQ 84 A A +LI+ + V +P PGG MSV MT+ G GW + + GY Y+ P Sbjct: 16 LAAEAQRELIQALRPVLRAAPLFSPEVPGGGQMSVRMTSAGAFGWFSDKSGYRYADRHP- 74 Query: 85 TNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIV 144 + + WP +P + TA D PD CL N Y GA++ LHQDKDE D P+V Sbjct: 75 SGQAWPEIPAEVLKIW----TALIDRDRMPDCCLFNYYGEGARMGLHQDKDEADFSYPVV 130 Query: 145 SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLK-AGFHPLTIDC 203 S+SLG + + GG R + + + L GDVVV GG++RL YHG+ ++ L Sbjct: 131 SISLGDDGLLRVGGTSRKEKTESIWLNSGDVVVMGGDARLAYHGVDRIRFRSSRLLPKGG 190 Query: 204 RYNLTFR 210 R NLT R Sbjct: 191 RVNLTLR 197 >UniRef50_A7HZ41 2OG-Fe(II) oxygenase n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HZ41_PARL1 Length = 221 Score = 120 bits (301), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 73/193 (37%), Positives = 102/193 (52%), Gaps = 7/193 (3%) Query: 25 FAFNAAEQLIRDIN-DVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDP 83 F A L+ + + P+R + G S+ TN G LGW + GY YSP++ Sbjct: 31 FGETAQRALVERLQAGFGAAPPYRPRMPRTGRPWSILQTNFGQLGWVSRPGGYAYSPVND 90 Query: 84 QTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY-APGAKLSLHQDKDEPDLRAP 142 + PWPA+P + L A YP P+ CL+N Y AP +++ LH+D+DE L AP Sbjct: 91 VSKAPWPAIPAALLALWD---DLAAYPA-PPECCLVNLYDAPKSRMGLHRDEDEEALDAP 146 Query: 143 IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI- 201 ++S+SLG IF+ GG R D K L GDV+V GG SRL YHG+ + +G L Sbjct: 147 VLSLSLGDTCIFRVGGFARGDKSKSFRLASGDVLVLGGASRLRYHGVDRVISGSSRLIPG 206 Query: 202 DCRYNLTFRQAGK 214 R NLT R+ + Sbjct: 207 GGRINLTLRRVTR 219 >UniRef50_A3VR77 Alkylated DNA repair protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VR77_9PROT Length = 222 Score = 117 bits (293), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 62/140 (44%), Positives = 76/140 (54%), Gaps = 4/140 (2%) Query: 50 VTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGY 109 VTPGG TMS N G LGW T R+GY Y P P WP MP + + A Sbjct: 49 VTPGGQTMSARQMNLGPLGWVTDRRGYRYEPRHPVDGAAWPEMPPALIRIWNDLLPEAP- 107 Query: 110 PDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLL 169 P+A L+N Y P AK+ LH+D DE PI+SVS G P F+ GG R + ++ Sbjct: 108 ---SPEAGLVNLYGPTAKMGLHRDADEAAKDVPILSVSFGAPGRFRLGGATRKGSTRSIV 164 Query: 170 LEHGDVVVWGGESRLFYHGI 189 L HGDV++ G SR FYHGI Sbjct: 165 LGHGDVLILAGPSRHFYHGI 184 >UniRef50_D2B1L1 Alkylated DNA repair protein n=3 Tax=Actinomycetales RepID=D2B1L1_STRRD Length = 213 Score = 88.6 bits (218), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 61/165 (36%), Positives = 86/165 (52%), Gaps = 14/165 (8%) Query: 52 PGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD 111 PGG MSV T C W +R Y P++P +PQ L + A Sbjct: 54 PGGGLMSV-RTVCLGRRWRPYR--YTDEPVEP--------LPQWLAELGRAAVAQTLGGP 102 Query: 112 FQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGL-KRNDPLKRLLL 170 ++PD L+N Y A + +HQD+DE AP+VS+SLG +F+FG R P + L Sbjct: 103 YEPDVALVNFYDDAATMGMHQDRDE-RAAAPVVSLSLGDACVFRFGNTATRARPWSDVRL 161 Query: 171 EHGDVVVWGGESRLFYHGIQPLKAGFHPLT-IDCRYNLTFRQAGK 214 E GD+ V+GG SRL +HG++ + G P I R N+T RQ+G+ Sbjct: 162 ESGDLFVFGGPSRLAFHGVRRILPGTGPHDLIQGRLNITLRQSGQ 206 >UniRef50_C7QZR3 2OG-Fe(II) oxygenase n=29 Tax=Actinomycetales RepID=C7QZR3_JONDD Length = 231 Score = 86.3 bits (212), Expect = 7e-16, Method: Compositional matrix adjust. Identities = 77/222 (34%), Positives = 108/222 (48%), Gaps = 17/222 (7%) Query: 4 LFADAEP--WQEPLAAGAVILRRF-AFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 LF+DA+ +E +A GAV L + L R A+ + T G+ MSV Sbjct: 2 LFSDADVPRVREEIAPGAVWLPGWLTIPQQAWLARQCAQWAAGPVPIRSATVRGHPMSVK 61 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQR-AATAAGYPD----FQPD 115 T C +GW Y +D + P+ L +R A A G D + PD Sbjct: 62 -TVC--VGWHWRPYAYSRDAVDVNGQRV-VEFPKWMVRLGRRIVADATGDEDRALAYTPD 117 Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKRLLLEHGD 174 LIN Y A++ +HQDKDE L AP+VS+S+G F+FG + RN P + + L GD Sbjct: 118 TALINFYDVQARMGMHQDKDEKSL-APVVSLSIGDTCTFRFGNTENRNRPYRDIALASGD 176 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTIDC---RYNLTFRQAG 213 V V+GG SRL +HG+Q + A P R+N+T R+ G Sbjct: 177 VFVFGGPSRLAFHGVQKIHAESAPDGCGVEHGRWNITMRETG 218 >UniRef50_Q7KUZ2 AlkB n=12 Tax=Drosophila RepID=Q7KUZ2_DROME Length = 332 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 46/125 (36%), Positives = 66/125 (52%), Gaps = 5/125 (4%) Query: 67 LGWTT--HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 + WTT + + D + P+P + +LC A A GY DF+P+A ++N Y Sbjct: 141 MRWTTFGYHHNWDTKIYDEEMQSPFP---EDLSSLCGLFAQALGYADFKPEAAIVNYYPV 197 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 G+ LS H D EP+ AP+ S S G AIF GG + + L+ GDV++ GESRL Sbjct: 198 GSTLSGHTDHSEPNKSAPLFSFSFGQTAIFLIGGRSLEEKPTAIYLQSGDVMIMSGESRL 257 Query: 185 FYHGI 189 YH + Sbjct: 258 CYHAV 262 >UniRef50_C0Z2F3 AT5G01780 protein n=9 Tax=Magnoliophyta RepID=C0Z2F3_ARATH Length = 217 Score = 83.6 bits (205), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 51/164 (31%), Positives = 81/164 (49%), Gaps = 34/164 (20%) Query: 82 DPQT--------NKPWPAMPQSFHNLCQRAATAAG---------------YPDFQPDACL 118 DPQT + P +P +F+ L ++A A P PD C+ Sbjct: 53 DPQTKYRKNTDIDSKAPEIPVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICI 112 Query: 119 INRYAPGAKLSLHQDKDEPDLRA----PIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 +N Y+ +L LHQD+DE + PIVS S+G A F +G + + + ++LE GD Sbjct: 113 VNFYSETGRLGLHQDRDESEESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGD 172 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211 V+++GGESR+ +HG++ + P+++ R NLTFR Sbjct: 173 VLIFGGESRMIFHGVKSIIPNSAPMSLLNESKLRTGRLNLTFRH 216 >UniRef50_Q9LZW8 Putative uncharacterized protein T20L15_50 n=3 Tax=Arabidopsis thaliana RepID=Q9LZW8_ARATH Length = 449 Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 51/164 (31%), Positives = 81/164 (49%), Gaps = 34/164 (20%) Query: 82 DPQT--------NKPWPAMPQSFHNLCQRAATAAG---------------YPDFQPDACL 118 DPQT + P +P +F+ L ++A A P PD C+ Sbjct: 285 DPQTKYRKNTDIDSKAPEIPVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICI 344 Query: 119 INRYAPGAKLSLHQDKDEPDLRA----PIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 +N Y+ +L LHQD+DE + PIVS S+G A F +G + + + ++LE GD Sbjct: 345 VNFYSETGRLGLHQDRDESEESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGD 404 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211 V+++GGESR+ +HG++ + P+++ R NLTFR Sbjct: 405 VLIFGGESRMIFHGVKSIIPNSAPMSLLNESKLRTGRLNLTFRH 448 >UniRef50_Q9LJH2 Similarity to unknown protein n=7 Tax=Embryophyta RepID=Q9LJH2_ARATH Length = 455 Score = 82.0 bits (201), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 52/160 (32%), Positives = 76/160 (47%), Gaps = 26/160 (16%) Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAA-------------TAAG--YPDFQPDACLINRY 122 Y P P +P F+ ++A T G P PD C++N Y Sbjct: 295 YGETRPFDGSTAPRIPAEFNQFVEKAVKESQSLAASNSKQTKGGDEIPFMLPDICIVNFY 354 Query: 123 APGAKLSLHQDKDEPD--LRA--PIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 + +L LHQDKDE + +R P+VS S+G A F +G + D + L LE GDV+++ Sbjct: 355 SSTGRLGLHQDKDESENSIRKGLPVVSFSIGDSAEFLYGDQRDEDKAETLTLESGDVLLF 414 Query: 179 GGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211 GG SR +HG++ ++ P + R NLTFRQ Sbjct: 415 GGRSRKVFHGVRSIRKDTAPKALLQETSLRPGRLNLTFRQ 454 >UniRef50_Q17GQ0 Putative uncharacterized protein (Fragment) n=2 Tax=Culicini RepID=Q17GQ0_AEDAE Length = 292 Score = 81.6 bits (200), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 49/128 (38%), Positives = 66/128 (51%), Gaps = 11/128 (8%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPA-----MPQSFHNLCQRAATAAGYPDFQPDACLINR 121 L WTT GY Y TNK + P LC+ A + G+ F+P+A ++N Sbjct: 90 LRWTT--LGYHYD----WTNKIYEEAARNEFPADLEELCRHFAESLGFRGFKPEAAIVNY 143 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y G+ L+ H D E +L AP+ S S G PA+F GG R++ LLL GDV+V Sbjct: 144 YPTGSTLAGHTDHSEKNLEAPLFSFSFGQPAVFLIGGPTRDEKPDALLLRSGDVIVMTRA 203 Query: 182 SRLFYHGI 189 SRL YH + Sbjct: 204 SRLCYHAV 211 >UniRef50_Q7QEU7 AGAP000155-PA n=1 Tax=Anopheles gambiae RepID=Q7QEU7_ANOGA Length = 293 Score = 81.3 bits (199), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 42/98 (42%), Positives = 52/98 (53%) Query: 92 MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLP 151 P L + AT GY F P+A ++N Y GA L+ H D E D AP+ S S G P Sbjct: 129 FPCELGALVRYVATTLGYDRFSPEAAIVNYYPAGATLAGHTDHSEDDQTAPLFSFSFGQP 188 Query: 152 AIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 A+F GG R + LLL GD+VV G SRL YH + Sbjct: 189 AVFLIGGTSREEHPDALLLRSGDIVVMTGASRLCYHAV 226 >UniRef50_C6TKW1 Putative uncharacterized protein n=1 Tax=Glycine max RepID=C6TKW1_SOYBN Length = 311 Score = 80.9 bits (198), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 53/139 (38%), Positives = 73/139 (52%), Gaps = 21/139 (15%) Query: 90 PAMPQSFHNLCQRAA--TAAGYPDFQPDACLINRYAPGAKLSLHQDKDE-PD---LRAPI 143 P +P FH+ A + A P PD C++N Y+ +L LHQDKDE PD L P+ Sbjct: 176 PQIPPEFHSHVHSALKDSNALLPSISPDICIVNFYSETGRLGLHQDKDESPDSLRLGLPV 235 Query: 144 VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT--- 200 +S S+G A F + + D K+LLL+ GDV+++GG SR +HG+ A HP T Sbjct: 236 ISFSIGDSADFLYADHRDLDQPKKLLLQSGDVLIFGGPSRNLFHGV----ASIHPNTAPN 291 Query: 201 --------IDCRYNLTFRQ 211 R NLTFR+ Sbjct: 292 LLLQHTNLCPGRLNLTFRR 310 >UniRef50_UPI0001983B96 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI0001983B96 Length = 456 Score = 78.2 bits (191), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 64/237 (27%), Positives = 104/237 (43%), Gaps = 49/237 (20%) Query: 12 QEPLAAGAVILRRF-AFNAAEQLIRDINDVASQSPFRQMVTPGGY---------TMSVAM 61 QE L G V+L+ + + ++++ D+ V PGG+ + + M Sbjct: 231 QEVLRPGMVLLKGYISLTEQIKMVKKCRDLG--------VGPGGFYRPGYQDGAKLRLQM 282 Query: 62 TNCGHLGWTTHRQGY-LYSPIDPQTNKPWPAMPQSFHNLCQRAATAA------------- 107 C + W + Y + P+D P +P F L +RA + Sbjct: 283 M-CLGMNWDPQTRKYEKWHPLD---GSETPDIPHEFSVLVERAIQDSQSLIKKNSGENNV 338 Query: 108 --GYPDFQPDACLINRYAPGAKLSLHQDKDEPD---LRA-PIVSVSLGLPAIFQFGGLKR 161 P P+ C++N Y +L LHQD+DE + L+ P+VS SLG A F +G + Sbjct: 339 EDTLPRMSPNICIVNFYTTSGRLGLHQDRDESEESLLKGLPVVSFSLGDSAEFLYGNQRN 398 Query: 162 NDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQ 211 D +++LE GDV+++GG SR +HG+ + P + + R NLT RQ Sbjct: 399 VDAAGKVVLESGDVLIFGGPSRHIFHGVSSIIPNSAPNSLLEETNLLPGRLNLTLRQ 455 >UniRef50_UPI00005257D6 PREDICTED: similar to AlkB CG33250-PA n=1 Tax=Ciona intestinalis RepID=UPI00005257D6 Length = 312 Score = 78.2 bits (191), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 46/139 (33%), Positives = 70/139 (50%), Gaps = 6/139 (4%) Query: 55 YTMSVAMTNCG---HLGWTTHRQGYLYSPIDPQ-TNKPWPAMPQSFHNLCQRAATAAGYP 110 Y S+++ C L W T GY ++ Q + +P +P + A+ G Sbjct: 137 YEDSLSLLKCCPIWKLRWAT--LGYHHNWNSKQYSEQPCSELPSELRKTSKLFASMIGTD 194 Query: 111 DFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLL 170 DF+ +A ++N Y G LS H D E L AP+VS+S GL A+F GG ++ + L + Sbjct: 195 DFKAEASIVNYYHVGNALSPHDDTSELYLEAPLVSLSFGLSAVFLIGGTSKDQKPEALFI 254 Query: 171 EHGDVVVWGGESRLFYHGI 189 GDV++ G SRL YH + Sbjct: 255 RSGDVIIMSGASRLAYHAV 273 >UniRef50_D1HAA5 Whole genome shotgun sequence of line PN40024, scaffold_58.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HAA5_VITVI Length = 554 Score = 78.2 bits (191), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 64/237 (27%), Positives = 104/237 (43%), Gaps = 49/237 (20%) Query: 12 QEPLAAGAVILRRF-AFNAAEQLIRDINDVASQSPFRQMVTPGGY---------TMSVAM 61 QE L G V+L+ + + ++++ D+ V PGG+ + + M Sbjct: 329 QEVLRPGMVLLKGYISLTEQIKMVKKCRDLG--------VGPGGFYRPGYQDGAKLRLQM 380 Query: 62 TNCGHLGWTTHRQGY-LYSPIDPQTNKPWPAMPQSFHNLCQRAATAA------------- 107 C + W + Y + P+D P +P F L +RA + Sbjct: 381 M-CLGMNWDPQTRKYEKWHPLD---GSETPDIPHEFSVLVERAIQDSQSLIKKNSGENNV 436 Query: 108 --GYPDFQPDACLINRYAPGAKLSLHQDKDEPD---LRA-PIVSVSLGLPAIFQFGGLKR 161 P P+ C++N Y +L LHQD+DE + L+ P+VS SLG A F +G + Sbjct: 437 EDTLPRMSPNICIVNFYTTSGRLGLHQDRDESEESLLKGLPVVSFSLGDSAEFLYGNQRN 496 Query: 162 NDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQ 211 D +++LE GDV+++GG SR +HG+ + P + + R NLT RQ Sbjct: 497 VDAAGKVVLESGDVLIFGGPSRHIFHGVSSIIPNSAPNSLLEETNLLPGRLNLTLRQ 553 >UniRef50_B3RXF0 Putative uncharacterized protein (Fragment) n=1 Tax=Trichoplax adhaerens RepID=B3RXF0_TRIAD Length = 271 Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 44/123 (35%), Positives = 60/123 (48%), Gaps = 1/123 (0%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA 126 L WTT Y +S + N+ P L + A G+P F P+A +IN Y + Sbjct: 77 LRWTTLGYHYDWSTKEYYHNRK-SEFPTDLAELTKLLAATVGFPLFSPEAAIINYYKLDS 135 Query: 127 KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY 186 LS H D E D AP+ S+S G AIF GG + + +E GD+ + GESRL Y Sbjct: 136 TLSGHTDHSEFDFTAPLFSISFGQKAIFLLGGRTTSVTPVAMYIESGDICIMSGESRLAY 195 Query: 187 HGI 189 H + Sbjct: 196 HAV 198 >UniRef50_UPI000051A07C PREDICTED: similar to AlkB CG33250-PA n=3 Tax=Neoptera RepID=UPI000051A07C Length = 310 Score = 76.6 bits (187), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 5/111 (4%) Query: 92 MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLP 151 +P L A G+ DF+ +A +IN Y + L+ H D E ++ AP+ S+S G Sbjct: 156 IPIELSLLTSFLAQTLGFKDFKAEAAIINYYRMNSTLAGHTDHSELNVEAPLFSISFGQT 215 Query: 152 AIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID 202 AIF GGL + D + L GD+++ G SRL YHGI + LTID Sbjct: 216 AIFLIGGLMQEDTTNAIFLRSGDIIIMSGMSRLRYHGIPKI-----LLTID 261 >UniRef50_B9GZQ0 Predicted protein n=13 Tax=Magnoliophyta RepID=B9GZQ0_POPTR Length = 353 Score = 76.3 bits (186), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 45/125 (36%), Positives = 64/125 (51%), Gaps = 2/125 (1%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--YPDFQPDACLINRYAP 124 L W+T + +S + + P +P L ++ A A +F P+A ++N +A Sbjct: 187 LRWSTLGLQFDWSKRNYNVSLPHNKIPDGLCQLAKKLAAPAMPVGEEFHPEAAIVNYFAS 246 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 G L H D E D PIVS+SLG AIF GG R DP + L GDVV+ GE+R Sbjct: 247 GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSREDPPLAMFLRSGDVVLMAGEARE 306 Query: 185 FYHGI 189 +HG+ Sbjct: 307 CFHGV 311 >UniRef50_Q9SA98 Alkylated DNA repair protein alkB homolog n=4 Tax=Magnoliophyta RepID=ALKBH_ARATH Length = 345 Score = 76.3 bits (186), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 49/134 (36%), Positives = 69/134 (51%), Gaps = 8/134 (5%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAAT--AAGYPD---FQPDACLINR 121 L W+T + +S + + P +P + LCQ A T A PD F+P+ ++N Sbjct: 177 LRWSTLGLQFDWSKRNYDVSLPHNNIPDA---LCQLAKTHAAIAMPDGEEFRPEGAIVNY 233 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 + G L H D E D PIVS+SLG AIF GG ++DP + L GDVV+ GE Sbjct: 234 FGIGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSKDDPPHAMYLRSGDVVLMAGE 293 Query: 182 SRLFYHGIQPLKAG 195 +R +HGI + G Sbjct: 294 ARECFHGIPRIFTG 307 >UniRef50_C7NLG8 Alkylated DNA repair protein n=2 Tax=Actinomycetales RepID=C7NLG8_KYTSD Length = 228 Score = 76.3 bits (186), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 60/173 (34%), Positives = 81/173 (46%), Gaps = 20/173 (11%) Query: 54 GYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ 113 G MSV M GW GY + Q P P +P L +RA A G+ + Sbjct: 60 GGRMSVTMVP---FGWVWTSAGYART--GEQDAAPLP-VPDWMVRLYRRAVVATGFDGWA 113 Query: 114 ---PDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKRLL 169 PD L+N Y P A + +H+D DE AP+VS+S+G F+FG + R P + Sbjct: 114 EAAPDVALVNHYRPDASMGMHRDADE-LTEAPVVSLSVGDACTFRFGSTETRTRPWTDIR 172 Query: 170 LEHGDVVVWGGESRLFYHG---IQPLKAG------FHPLTIDCRYNLTFRQAG 213 LE GD+VV+GG +R +HG I P AG + R N+T R G Sbjct: 173 LESGDLVVFGGPARRAFHGVPRIHPGTAGPQVAAAQAEAELPGRLNITLRVTG 225 >UniRef50_A9TG90 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TG90_PHYPA Length = 401 Score = 75.1 bits (183), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 1/103 (0%) Query: 88 PWPAMPQSFHNLCQRAAT-AAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSV 146 P+ +P +L +R A A DF+ +A ++N Y P L H D E D+ PIVS+ Sbjct: 253 PFQEIPPKLADLARRLAKPAMENEDFKAEAAIVNFYGPDDMLGGHVDDMEADMSKPIVSI 312 Query: 147 SLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 SLG AIF GG R++P + + GDVV+ G +R +HG+ Sbjct: 313 SLGCKAIFLLGGTTRDEPPAAMFVRSGDVVLMAGPARHCFHGV 355 >UniRef50_UPI0001BCD579 alkylated DNA repair protein AlkB n=1 Tax=Campylobacter fetus subsp. venerealis str. Azul-94 RepID=UPI0001BCD579 Length = 69 Score = 75.1 bits (183), Expect = 1e-12, Method: Composition-based stats. Identities = 34/55 (61%), Positives = 40/55 (72%) Query: 158 GLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 G+KR DP+ + +L HGDVVVW G SRLFYHGI PLK+G H R NLTFR+A Sbjct: 14 GMKRTDPITKYILHHGDVVVWVGPSRLFYHGILPLKSGEHERLGPIRLNLTFRKA 68 >UniRef50_C9YWY9 Putative DNA repair protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9YWY9_STRSW Length = 242 Score = 74.7 bits (182), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 63/195 (32%), Positives = 86/195 (44%), Gaps = 29/195 (14%) Query: 46 FRQMVTPGGYTMSV-------------AMTNCGHL--------GWTTHRQGYLYSPIDPQ 84 R + TPGG TM+ A CG G T+ Y + +D Sbjct: 46 LRTVRTPGGGTMTARQVCLGRHWGVVPACPACGRAVRDNPACPGRHTYPYAYSRTVVD-G 104 Query: 85 TNKPWPAMPQSFHNLCQRAATAAGYPDFQPD---ACLINRYAPGAKLSLHQDKDEPDLRA 141 P P L +RA P D LIN Y A++ +H+D DEP A Sbjct: 105 DGAPVKPFPAWLGELGRRAVADTLGPQRATDPYDIALINYYDADARMGMHRDSDEPS-DA 163 Query: 142 PIVSVSLGLPAIFQFGGLK-RNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP-- 198 P+VS+SLG +F+FG + R P + L GD+ V+GGE+R YHG+ + AG P Sbjct: 164 PVVSLSLGDTCLFRFGNPRTRTRPYTDVELRSGDLFVFGGEARRAYHGVPRVYAGTAPPG 223 Query: 199 LTIDCRYNLTFRQAG 213 L + R N+T R G Sbjct: 224 LGLTGRLNITLRAGG 238 >UniRef50_D2A2Y9 Putative uncharacterized protein GLEAN_07602 n=1 Tax=Tribolium castaneum RepID=D2A2Y9_TRICA Length = 297 Score = 74.3 bits (181), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 43/133 (32%), Positives = 67/133 (50%), Gaps = 8/133 (6%) Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPD 138 S + + NK P+ L + A + + F +A ++N Y + LS H D E + Sbjct: 118 SKVYAEENKG--EFPKDLAELSRFIAESLNFLHFNAEAAIVNYYHMDSTLSGHTDHSEHN 175 Query: 139 LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP 198 L+AP++S+S G AIF GG ++D + L GD+VV ESRL YHG+ + Sbjct: 176 LKAPLISLSFGQTAIFLLGGKTKDDEPSAMFLRSGDIVVMSEESRLCYHGVPKI------ 229 Query: 199 LTIDCRYNLTFRQ 211 L +D R+ F + Sbjct: 230 LQMDSRFWNCFEE 242 >UniRef50_B7PUG0 Putative uncharacterized protein n=1 Tax=Ixodes scapularis RepID=B7PUG0_IXOSC Length = 287 Score = 73.9 bits (180), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 40/98 (40%), Positives = 51/98 (52%) Query: 92 MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLP 151 P L A AG+ FQP+A ++N YA + L H D E L AP+VS S G Sbjct: 148 FPDCLRELATGLARLAGFSAFQPEAAIVNYYAMDSALGGHVDNSELALDAPVVSASFGQT 207 Query: 152 AIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 A+F GG R + LLL GDV+V G +RL YH + Sbjct: 208 AVFLVGGATRERRPRALLLRSGDVLVMSGPARLAYHAV 245 >UniRef50_D2V2M0 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V2M0_NAEGR Length = 314 Score = 73.9 bits (180), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 40/124 (32%), Positives = 62/124 (50%), Gaps = 3/124 (2%) Query: 67 LGWTTHRQGYLYSPIDPQTNK-PWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 L W T GY Y + +K + P N C A Y ++P+A ++N Y+ Sbjct: 131 LAWCT--LGYQYEWTTRKYHKDKFVQFPHDIGNFCDLIACQCNYGPYKPEAAIVNFYSKD 188 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLF 185 + H D E ++ PI+S+S+G +IF GG R+ K + LE GD ++ GG +R Sbjct: 189 RLMGGHVDDAEYEMTKPIISLSIGSKSIFLLGGETRDTEPKAIFLESGDCMIMGGRARYC 248 Query: 186 YHGI 189 +HGI Sbjct: 249 FHGI 252 >UniRef50_D1IRG0 Whole genome shotgun sequence of line PN40024, scaffold_2.assembly12x (Fragment) n=2 Tax=rosids RepID=D1IRG0_VITVI Length = 457 Score = 72.8 bits (177), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 42/109 (38%), Positives = 59/109 (54%), Gaps = 11/109 (10%) Query: 114 PDACLINRYAPGAKLSLHQDKDEPD--LRA--PIVSVSLGLPAIFQFGGLKRNDPLKRLL 169 PD C++N Y +L LHQD+DE + LR P+VS S+G A F + + +L Sbjct: 348 PDICIVNFYTTSGRLGLHQDRDETEETLRKGLPVVSFSIGDSAKFLYSNQRDVFNADEVL 407 Query: 170 LEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211 LE GDV+++GGESR +HG+ + P + R NLTFRQ Sbjct: 408 LESGDVLIFGGESRRIFHGVASILPNTSPQVLLKETNLRPGRLNLTFRQ 456 >UniRef50_UPI0000E47318 PREDICTED: similar to LOC494680 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47318 Length = 488 Score = 72.0 bits (175), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 46/124 (37%), Positives = 61/124 (49%), Gaps = 3/124 (2%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAM-PQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 L W T GY Y + N+ ++ P+ + A GYP FQ A ++N Y Sbjct: 151 LRWVT--LGYHYDWNNKVYNEDQHSLFPEDLGPMSALIAEVLGYPRFQSQAAIVNFYHMD 208 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLF 185 + L H D E DL AP++S SLG AI GG + L L GDV++ GGESRL Sbjct: 209 STLGGHTDHSEFDLTAPLISYSLGQSAILLVGGKTKATKPLALHLRSGDVIILGGESRLA 268 Query: 186 YHGI 189 YH + Sbjct: 269 YHAV 272 >UniRef50_C0WHI3 Alkylated DNA repair protein n=3 Tax=Corynebacterium RepID=C0WHI3_9CORY Length = 227 Score = 71.6 bits (174), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 61/197 (30%), Positives = 87/197 (44%), Gaps = 23/197 (11%) Query: 35 RDINDVASQSPFRQMVTP---GGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPA 91 R+I + +P MV P G MSV HLG H Y Y +D P Sbjct: 37 REIARAYAHTPM-AMVQPRLKSGGQMSVFQL---HLGRYWHYPSYRY--VDNMEGTRVPP 90 Query: 92 MPQSFHNLCQRAATAAG---------YPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAP 142 +P S L A A +F P+ L+N Y PG+ + +H D E AP Sbjct: 91 VPDSLRELAPVALRQAAQVAPELEPWVDNFVPEMALVNYYPPGSAMGMHVDDSEGS-PAP 149 Query: 143 IVSVSLGLPAIFQFGGLK-RNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI 201 ++S+S+G A+F+ G + R P + L GD+VV+GG R YHG+ + G P Sbjct: 150 VISLSIGDEALFRIGHTENRTKPWDDVTLCSGDLVVFGGPKRFAYHGVVRVNDGTLPEGC 209 Query: 202 ---DCRYNLTFRQAGKK 215 + R N+T RQ + Sbjct: 210 GLQEGRINITIRQVSAR 226 >UniRef50_C0PA85 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0PA85_MAIZE Length = 389 Score = 70.9 bits (172), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 37/104 (35%), Positives = 56/104 (53%), Gaps = 2/104 (1%) Query: 88 PWPAMPQSFHNLCQRAATAA--GYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVS 145 P +P + +L ++ A A +F+P+A ++N Y P L H D E D PIVS Sbjct: 242 PHNKIPGALASLAKKMAIPAMPSGEEFKPEAAIVNYYGPSDMLGGHVDDMEADWTKPIVS 301 Query: 146 VSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 +SLG IF GG R++ + L GD+V+ GE+R +HG+ Sbjct: 302 ISLGCKCIFLLGGKTRDEVPTAMFLRSGDIVLMAGEARERFHGV 345 >UniRef50_Q13686 Alkylated DNA repair protein alkB homolog 1 n=27 Tax=Euteleostomi RepID=ALKB1_HUMAN Length = 389 Score = 70.9 bits (172), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 33/98 (33%), Positives = 54/98 (55%) Query: 92 MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLP 151 P L ++ A A G+ DF+ +A ++N Y + L +H D+ E D P++S S G Sbjct: 192 FPSDLGFLSEQVAAACGFEDFRAEAGILNYYRLDSTLGIHVDRSELDHSKPLLSFSFGQS 251 Query: 152 AIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 AIF GGL+R++ + + GD+++ G SRL H + Sbjct: 252 AIFLLGGLQRDEAPTAMFMHSGDIMIMSGFSRLLNHAV 289 >UniRef50_C2BL13 Alkylated DNA repair protein n=2 Tax=Corynebacterium RepID=C2BL13_9CORY Length = 229 Score = 70.5 bits (171), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 50/163 (30%), Positives = 75/163 (46%), Gaps = 16/163 (9%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------YPDFQPDA 116 HLG H Y Y +D P +P+S + A AA F P+ Sbjct: 67 HLGRYWHYPSYRY--VDNMEGTRVPPVPESLRQIAPGALRAAAEVAPELEPWVDTFVPEM 124 Query: 117 CLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKRLLLEHGDV 175 L+N Y PG+ + + D E + AP++S+S+G A+F+ G + R P + L GD+ Sbjct: 125 ALVNYYPPGSAMGMRVDDSE-ESPAPVISLSIGDEALFRMGHTEARTRPWDDITLCSGDL 183 Query: 176 VVWGGESRLFYHGIQPLKAGFHPLTI---DCRYNLTFRQAGKK 215 VV+GG R YHG+ + G P + R N+T RQ + Sbjct: 184 VVFGGPKRFAYHGVVRVNDGTLPEGCGLREGRINITIRQVSAR 226 >UniRef50_C8NRG2 DNA repair protein n=9 Tax=Actinomycetales RepID=C8NRG2_COREF Length = 237 Score = 70.5 bits (171), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 38/104 (36%), Positives = 61/104 (58%), Gaps = 5/104 (4%) Query: 112 FQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGL-KRNDPLKRLLL 170 ++ +A L+N YAPG+ + +HQD +E AP++S+S+G IF+ G RN P + L Sbjct: 132 YRAEAALVNYYAPGSAMGMHQDANELS-EAPVISLSIGDTGIFRLGNTDNRNRPWVDVPL 190 Query: 171 EHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDC---RYNLTFRQ 211 GD++++GGE R +HG+ ++A P R N+T RQ Sbjct: 191 LSGDLIIFGGEHRRAFHGVPRIEADTAPEGCGLDRGRINITIRQ 234 >UniRef50_B5Y3R7 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B5Y3R7_PHATR Length = 352 Score = 70.5 bits (171), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 1/84 (1%) Query: 110 PDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRL 168 P F A ++N Y P + + H+D E L PIVS+SLG PA+F GG ++D P+ + Sbjct: 206 PCFTASASIVNFYTPKSMMGGHRDDLEHALDKPIVSISLGRPAVFLLGGNTKDDQPVVAI 265 Query: 169 LLEHGDVVVWGGESRLFYHGIQPL 192 L+ GDV++ GG SRL YHG+ L Sbjct: 266 LVRPGDVMMMGGASRLRYHGMARL 289 >UniRef50_C3XQU3 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQU3_BRAFL Length = 365 Score = 69.7 bits (169), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 60/123 (48%), Gaps = 1/123 (0%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA 126 L WTT Y + + Q + + P L A G+P ++ + ++N Y + Sbjct: 158 LRWTTLGYHYDWDKKEYQQER-YTEFPPDLSQLSTHVAQTLGFPRYRAQSAIVNYYGLDS 216 Query: 127 KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY 186 +L H D E D PI+S S G A+F GG ++ + L +GD++V G++RL Y Sbjct: 217 QLGGHVDHQELDYSKPIISFSFGQTAVFLLGGKTKSVKPMAMFLRNGDIMVMSGDTRLAY 276 Query: 187 HGI 189 HG+ Sbjct: 277 HGV 279 >UniRef50_B6JWW7 AlkB-like protein n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JWW7_SCHJY Length = 296 Score = 69.3 bits (168), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 46/145 (31%), Positives = 70/145 (48%), Gaps = 7/145 (4%) Query: 50 VTPGGYTMSVAMTNC--GHLGWTTHRQGYLYSPI---DPQTNKPWPAMPQSFHNLCQRAA 104 + P G + V + N L W T + Y ++ DP T P+P + H + Sbjct: 116 IPPTGSSKPVTVKNLMEKKLRWITFGEQYNWTTRVYPDPATAPPFPE--KLGHLTEELVH 173 Query: 105 TAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP 164 A + D++ +A ++N Y+P LS H D E DL P++S+S+GL I+ G R D Sbjct: 174 KATEFKDWKAEAAIVNFYSPRDTLSGHVDDAEDDLTLPLLSMSIGLDCIYLLGTETRKDV 233 Query: 165 LKRLLLEHGDVVVWGGESRLFYHGI 189 K + L GD V+ G SR YH + Sbjct: 234 PKAIRLHSGDAVIMTGLSRKAYHAV 258 >UniRef50_A7RXQ8 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RXQ8_NEMVE Length = 323 Score = 69.3 bits (168), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 41/136 (30%), Positives = 66/136 (48%), Gaps = 7/136 (5%) Query: 60 AMTNCGH------LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ 113 A++N H L W + Y+ +D + K + P+ L A A GY + Sbjct: 127 ALSNSEHVNLIDRLRWVHLGYQFDYNVVDYKPEKYY-GFPKDLGGLMHHLAEAIGYLGYT 185 Query: 114 PDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHG 173 P+A ++N Y A + H D E DL P++SVS G A+F GG ++ L + G Sbjct: 186 PEAGIVNYYPLSASMGGHTDHYELDLSWPLISVSFGQSAVFLIGGKTKDVKPTALYIRSG 245 Query: 174 DVVVWGGESRLFYHGI 189 D+++ GE+RL +H + Sbjct: 246 DILIMSGEARLAFHAV 261 >UniRef50_A8J903 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8J903_CHLRE Length = 398 Score = 69.3 bits (168), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 36/76 (47%), Positives = 45/76 (59%) Query: 112 FQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 171 ++PDA ++N Y G L H D E DL PIVSVSLG PA+F GG + LLL Sbjct: 323 YEPDAAIVNYYQIGDVLGGHVDDVESDLAQPIVSVSLGCPALFLMGGRTKATHPSALLLR 382 Query: 172 HGDVVVWGGESRLFYH 187 GDV+V G++R YH Sbjct: 383 GGDVLVLAGQARSCYH 398 >UniRef50_Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB n=1 Tax=Dictyostelium discoideum RepID=ALKB_DICDI Length = 393 Score = 68.6 bits (166), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 36/123 (29%), Positives = 60/123 (48%), Gaps = 1/123 (0%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA 126 L W+T Y ++P + + + P L Q+ A A + + +A +N Y+ + Sbjct: 215 LAWSTLGYQYQWTP-RLYSEEFYEEFPDDLQELVQKIAIATKFDPYVAEAATVNFYSEDS 273 Query: 127 KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY 186 + H D E ++ PI+S+S G A+F G R+ L + GD+V+ GG SR Y Sbjct: 274 IMGGHLDDAEQEMEKPIISISFGSTAVFLMGAETRDIAPVPLFIRSGDIVIMGGRSRYCY 333 Query: 187 HGI 189 HG+ Sbjct: 334 HGV 336 >UniRef50_O60066 Alkylated DNA repair protein alkB homolog n=2 Tax=Schizosaccharomyces pombe RepID=ALKBH_SCHPO Length = 297 Score = 68.6 bits (166), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 36/124 (29%), Positives = 63/124 (50%), Gaps = 1/124 (0%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAAT-AAGYPDFQPDACLINRYAPG 125 L W T + Y ++ + P P+ + ++ + + ++ +A ++N Y+PG Sbjct: 135 LRWVTLGEQYDWTTKEYPDPSKSPGFPKDLGDFVEKVVKESTDFLHWKAEAAIVNFYSPG 194 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLF 185 LS H D+ E DL P++S+S+GL I+ G R++ L L GDVV+ G SR Sbjct: 195 DTLSAHIDESEEDLTLPLISLSMGLDCIYLIGTESRSEKPSALRLHSGDVVIMTGTSRKA 254 Query: 186 YHGI 189 +H + Sbjct: 255 FHAV 258 >UniRef50_Q6C333 YALI0F03003p n=1 Tax=Yarrowia lipolytica RepID=Q6C333_YARLI Length = 330 Score = 67.0 bits (162), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 53/136 (38%), Positives = 69/136 (50%), Gaps = 19/136 (13%) Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 +T G WTT Y P T +P P++ + L R + P+A +IN Sbjct: 174 VTLGGQYNWTTKA----YPSFIPGTEG-FPYFPKNLYELLSRPLFS-----INPEAAIIN 223 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-------PLKRLLLEHG 173 Y+PG LS HQD E + +VSVS+GL AIF + GL R D P L+L G Sbjct: 224 FYSPGDILSPHQDVAELS-QDDLVSVSIGLDAIF-YVGLNRYDDSENSLAPPLCLMLRSG 281 Query: 174 DVVVWGGESRLFYHGI 189 DV+V GG+SR YHGI Sbjct: 282 DVIVMGGKSRHAYHGI 297 >UniRef50_Q010W8 Oxidoreductase, 2OG-Fe (ISS) n=1 Tax=Ostreococcus tauri RepID=Q010W8_OSTTA Length = 214 Score = 67.0 bits (162), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 45/136 (33%), Positives = 67/136 (49%), Gaps = 17/136 (12%) Query: 95 SFHNLCQRAATAA-----GYPDFQPDACLINRYAPGAKLSLHQDKDEPDL---RA--PIV 144 + +C A AA PD P CL+N Y GA+ H+D ++P L RA PIV Sbjct: 56 ALREMCAEAVRAAQKVDDAMPDMNPTTCLVNFYKDGAEFKWHKDSEDPKLVKSRAGPPIV 115 Query: 145 SVSLGLPAIFQFGGLKRNDPLKRLL-LEHGDVVVWGGESRLFYHGIQPLKAGFHP----- 198 S S+G+ A F + DP ++ L GDV+++GG SR+ H + + G P Sbjct: 116 SFSVGMSADFGY-KYSFEDPTHEVVRLNSGDVLLFGGPSRMIVHSVLNVHPGSMPGHLRG 174 Query: 199 LTIDCRYNLTFRQAGK 214 ++ R N+T R G+ Sbjct: 175 KMLNGRLNVTVRDIGE 190 >UniRef50_C1FFM9 Predicted protein (Fragment) n=2 Tax=Micromonas RepID=C1FFM9_9CHLO Length = 126 Score = 64.7 bits (156), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 33/74 (44%), Positives = 43/74 (58%) Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 A L+N Y G +L+ H D E D+ PIVSVSLG P +F GG R+ +L+ GD Sbjct: 1 AGLVNYYRSGDQLAGHVDDAEVDMSKPIVSVSLGCPCVFLLGGRSRDVAPTAVLMRSGDA 60 Query: 176 VVWGGESRLFYHGI 189 +V G SR YHG+ Sbjct: 61 IVLTGPSRRCYHGV 74 >UniRef50_C1EB25 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EB25_9CHLO Length = 334 Score = 62.8 bits (151), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 10/125 (8%) Query: 100 CQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEP-----DLRAPIVSVSLGLPAIF 154 Q A + PD P L+N Y GAK H+D ++P D PIVS ++GL A F Sbjct: 186 AQTADSCTNVPDMNPTTALVNFYKEGAKFKWHRDSEDPAHARHDTGPPIVSFTVGLSADF 245 Query: 155 QFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP-----LTIDCRYNLTF 209 + + + + L GDV+++GG SR+ H + + P + R N+T Sbjct: 246 SYKNRFEDATHRTVRLNSGDVLLFGGPSRMIVHSVTGVVPRTMPPMLRGRMLHGRLNVTV 305 Query: 210 RQAGK 214 R G+ Sbjct: 306 RDIGR 310 >UniRef50_A4H8G0 Putative uncharacterized protein n=3 Tax=Leishmania RepID=A4H8G0_LEIBR Length = 440 Score = 62.8 bits (151), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 31/83 (37%), Positives = 43/83 (51%), Gaps = 1/83 (1%) Query: 108 GYPD-FQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK 166 YPD ++P ++N Y G+ + HQD E L P++S+SLG A+F G R D Sbjct: 187 AYPDTYEPQTAIVNYYPVGSMMMCHQDVSEETLEQPLMSLSLGCSAVFLMGTQSREDAPH 246 Query: 167 RLLLEHGDVVVWGGESRLFYHGI 189 LL GDV + G SR +H Sbjct: 247 AFLLRSGDVAAFTGPSRAAFHST 269 >UniRef50_Q4D9X3 Alkylated DNA repair protein, putative n=3 Tax=Trypanosoma RepID=Q4D9X3_TRYCR Length = 323 Score = 62.0 bits (149), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 26/78 (33%), Positives = 44/78 (56%) Query: 112 FQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 171 ++P ++N ++ G+ + HQD E ++ P++S+SLG +F G R+D L Sbjct: 194 YEPQTAIVNYFSVGSMMMAHQDVSEESMQHPLISISLGCSCVFLMGTSSRDDAPYAFWLR 253 Query: 172 HGDVVVWGGESRLFYHGI 189 GDV V+ G SR+ +H I Sbjct: 254 SGDVAVFSGPSRVAFHSI 271 >UniRef50_D0NYX8 Alkylated DNA repair protein alkB-like protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NYX8_PHYIN Length = 309 Score = 61.6 bits (148), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 1/98 (1%) Query: 92 MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLP 151 +P+ L + A G + +A ++N Y + + H D E + P+VS+SLG Sbjct: 157 VPELLQQLGTKCAAVCGMT-LEAEAVIVNYYKTKSSMGGHLDDVEYTMDHPVVSLSLGSK 215 Query: 152 AIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 +F GG +++ +LL GD+ + GG SR YHG+ Sbjct: 216 CVFLMGGHTKDEAPLEILLRSGDIAIMGGASRTCYHGV 253 >UniRef50_UPI000186D7D6 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D7D6 Length = 278 Score = 60.5 bits (145), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 30/88 (34%), Positives = 45/88 (51%) Query: 90 PAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLG 149 + P+ + LC A + F +A ++N Y + LS H D E ++ AP+ S S G Sbjct: 154 SSFPEDLNLLCNHFANNFFFEGFNAEAAIVNMYHLNSTLSGHTDTSELNINAPLFSFSFG 213 Query: 150 LPAIFQFGGLKRNDPLKRLLLEHGDVVV 177 AIF GG ND +L+E GDV++ Sbjct: 214 QSAIFLIGGKFINDSALPILVESGDVLI 241 >UniRef50_D0N998 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N998_PHYIN Length = 292 Score = 60.1 bits (144), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 45/161 (27%), Positives = 71/161 (44%), Gaps = 26/161 (16%) Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD---------FQPDACLI 119 + R Y Y+PI +P S+ QR+ AA D PD C++ Sbjct: 132 YEDQRSNYDYAPIR--------TLPDSWKTYAQRSLDAAKKIDPLVMGSCKKMTPDICVV 183 Query: 120 NRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGL--KRNDPLKRLLLEHG 173 N Y + +H DKDE D + +P++S S+G A F + ++ + + LE G Sbjct: 184 NFYKKAGRNGMHIDKDESDEAMSMGSPVISFSVGCAAEFAYIDHYPDPHEAVPIVRLESG 243 Query: 174 DVVVWGGESRLFYHGIQPLKAGFHPLTI---DCRYNLTFRQ 211 D +V+GG +R H + + P + R NLTFR+ Sbjct: 244 DALVFGGPARTVVHALTRVYNNTQPSWLRMRSGRLNLTFRE 284 >UniRef50_Q5K7S3 Putative uncharacterized protein n=1 Tax=Filobasidiella neoformans RepID=Q5K7S3_CRYNE Length = 425 Score = 59.7 bits (143), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 43/155 (27%), Positives = 66/155 (42%), Gaps = 25/155 (16%) Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP-------------- 110 +LGW Y P+T P+PA +LC A + + Sbjct: 234 ANLGWVYQWSTKSYD-FAPETPIPFPA---PLADLCSEAVASVPWENVFSSVSDPDASTY 289 Query: 111 -------DFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND 163 D++PD ++N Y L H D+ E D P+VSVSLG AI G R++ Sbjct: 290 GWQSWPRDYKPDTGIVNFYQLNDTLMAHVDRAELDPARPLVSVSLGHAAILLLGSDSRDE 349 Query: 164 PLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP 198 + ++L GD+++ G+ R YHG+ + G P Sbjct: 350 VPRPIILRSGDMLIMSGKGRQSYHGVPRILEGSLP 384 >UniRef50_B6KBE4 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KBE4_TOXGO Length = 927 Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%) Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 +A ++N Y G +L H+D D AP++S+SLG PAIF GG R K L+L GD Sbjct: 679 NAAILNVYRKGDRLRGHRD-DAERAEAPLISISLGQPAIFLLGGDSRRVAPKALVLRSGD 737 Query: 175 VVVWGGESRLFYHGIQPL 192 V+V G +R HG+ L Sbjct: 738 VLVLSGAARWAVHGVPKL 755 >UniRef50_A4S344 Predicted protein n=2 Tax=Mamiellales RepID=A4S344_OSTLU Length = 348 Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 12/114 (10%) Query: 111 DFQPDACLINRYAPGAKLSLHQDKDEPDLRA-----PIVSVSLGLPAIFQFGGLKRNDPL 165 + P CL+N Y GA+ H+D ++P L PIVS S+GL F + +DP Sbjct: 211 NMNPTTCLVNFYKDGAEFKWHKDSEDPKLVKSRTGPPIVSFSVGLSGDFGY-KYSFDDPE 269 Query: 166 KRLL-LEHGDVVVWGGESRLFYHGIQPLKAGFHP-----LTIDCRYNLTFRQAG 213 +++ L GDV+++GG SR+ H + + G P ++ R N+T R G Sbjct: 270 HKVVRLNSGDVLLFGGPSRMIVHSVLNVYPGSMPGHLRGKMLNGRLNVTVRDIG 323 >UniRef50_A0C122 Chromosome undetermined scaffold_140, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C122_PARTE Length = 312 Score = 57.8 bits (138), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 37/126 (29%), Positives = 62/126 (49%), Gaps = 2/126 (1%) Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD-FQPDACLINRYAPGAKLSLHQD 133 GY Y + Q + +P + QRA + +Q ++ +IN Y ++ H D Sbjct: 136 GYQYDWNNRQYPQEKTQVPDPIQEISQRANNFLQLQNQYQSESVIINFYQSHDYMTGHLD 195 Query: 134 KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI-QPL 192 E D +PI S S GL ++F GG +++ + L+ GD++V G +R YHG+ + L Sbjct: 196 DAELDQDSPIYSFSFGLSSVFVIGGPTKDEKPIAIKLDSGDLLVMSGHARKCYHGVPRVL 255 Query: 193 KAGFHP 198 F+P Sbjct: 256 ADSFNP 261 >UniRef50_UPI00019271E1 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI00019271E1 Length = 350 Score = 57.8 bits (138), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 1/132 (0%) Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDAC 117 SV L WT + Y+ +D + K + P+ L A + ++ + Sbjct: 137 SVENNFINQLRWTHMGYHFDYNIVDYKA-KEYYGFPKDLAELTVTIADVFKFQNYIAETG 195 Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVV 177 +IN Y G+ + H D E +L P++S S G A+F GG R+ + + + GD+++ Sbjct: 196 IINYYPEGSSMGGHTDHYEEELSQPLISYSFGQAAVFLIGGPTRDIKPEGIWVRTGDIIL 255 Query: 178 WGGESRLFYHGI 189 G SR +H + Sbjct: 256 MTGPSRTAFHAV 267 >UniRef50_A7AWB3 Putative uncharacterized protein n=1 Tax=Babesia bovis RepID=A7AWB3_BABBO Length = 336 Score = 56.6 bits (135), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 34/94 (36%), Positives = 49/94 (52%), Gaps = 7/94 (7%) Query: 111 DFQPDAC--LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRL 168 D+ PD C +IN Y+ L LH+D D + ++++SLG PAIF GG + Sbjct: 183 DYIPDVCAAIINFYSKAYFLRLHKD-DAEETDDSVLNISLGAPAIFMLGGTDHSTIPVSF 241 Query: 169 LLEHGDVVVWGGESRLFYHGIQPL----KAGFHP 198 ++E G VV+ +SR HGI L K G+ P Sbjct: 242 VVESGSVVLMADKSRFCLHGIVKLLSYNKPGYQP 275 >UniRef50_Q9LJH4 Emb|CAB82748.1 n=1 Tax=Arabidopsis thaliana RepID=Q9LJH4_ARATH Length = 330 Score = 55.5 bits (132), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 52/184 (28%), Positives = 68/184 (36%), Gaps = 52/184 (28%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRA-------------ATAAG--YPD 111 LG Q Y I P P +P F L ++A T G P Sbjct: 134 LGKNWDCQTRRYGEIRPIDGSVPPRIPVEFSQLVEKAIKESKSLVATNSNETKGGDEIPL 193 Query: 112 FQPDACLINRYAPGAKLSLHQ---------------------------------DKDEP- 137 PD C++N Y KL LHQ DK E Sbjct: 194 LLPDICVVNFYTSTGKLGLHQVVTSIQYRKNCSSLYYCSFKFLHHQLIESFMAQDKGESK 253 Query: 138 -DLRA--PIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKA 194 LR PIVS S+G A F +G K D L+LE GDV+++G SR +HG++ ++ Sbjct: 254 KSLRKGLPIVSFSIGDSAEFLYGDQKDVDKADTLILESGDVLIFGERSRNVFHGVRSIRK 313 Query: 195 GFHP 198 P Sbjct: 314 ILPP 317 >UniRef50_Q22MH4 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22MH4_TETTH Length = 403 Score = 55.1 bits (131), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 9/122 (7%) Query: 86 NKPWPA----MPQSFHNLCQRAATAAGYP-----DFQPDACLINRYAPGAKLSLHQDKDE 136 N+ +P+ MP + L + A D++P+A ++N Y +S H D E Sbjct: 205 NRLYPSFTTPMPDIINELAEFAKNVVSDEITDVYDYEPEAVIVNYYDKKNYMSGHLDDGE 264 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGF 196 D ++PI S + G IF G ++ L L+ GD+++ G SR YHG+ + G Sbjct: 265 KDQKSPIFSFTFGCSCIFLMGDRTKDFTPLPLRLDAGDLMIMSGYSRNCYHGVPRIFPGS 324 Query: 197 HP 198 P Sbjct: 325 FP 326 >UniRef50_A9TLH2 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TLH2_PHYPA Length = 697 Score = 54.7 bits (130), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 43/137 (31%), Positives = 63/137 (45%), Gaps = 36/137 (26%) Query: 111 DFQPDACLINRY-APGAKLSL-----HQDKDEPDLRAPIVSVSLGLPAIFQF-------- 156 +F+PD L+N Y A +L + HQD D+ P+VSVS+G F + Sbjct: 557 NFEPDVALVNFYPAKDEELGVVGLGGHQDLDDY-CDMPVVSVSVGDSMTFFYRRFPPQSR 615 Query: 157 ------------------GGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP 198 G ++ K+++L GDV+V+GGESRL YHG + ++ G P Sbjct: 616 RKSGVQIIVDEYAAQCCKDGDTAHNSEKKIILASGDVLVFGGESRLVYHGTRCVQPGTRP 675 Query: 199 LTIDC---RYNLTFRQA 212 + R N TFRQ Sbjct: 676 PGLHMAPGRLNFTFRQC 692 >UniRef50_C4Q8H0 Expressed protein n=2 Tax=Schistosoma RepID=C4Q8H0_SCHMA Length = 334 Score = 52.4 bits (124), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 18/94 (19%) Query: 111 DFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR--- 167 ++ P+A ++N Y + H D E D AP+VS+S G A+F L+ ++ +K Sbjct: 180 NYTPEASIVNYYRTKTTMGFHSDDAEVDKEAPLVSISFGPTALFL---LETSEAIKHEFD 236 Query: 168 ------------LLLEHGDVVVWGGESRLFYHGI 189 + L HGDVV+ G+SRL H + Sbjct: 237 APLHGSFDHVLPIYLHHGDVVIMAGKSRLARHAV 270 >UniRef50_B1XR40 Oxidoreductase, 2OG-Fe(II) oxygenase family n=4 Tax=Bacteria RepID=B1XR40_SYNP2 Length = 204 Score = 52.4 bits (124), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 32/102 (31%), Positives = 50/102 (49%), Gaps = 4/102 (3%) Query: 114 PDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHG 173 PD ++N Y PG ++ H D +P I+S+SL P I F + N+ +LL Sbjct: 102 PDQAIVNEYLPGQGITSHVDC-KPCFTDTIISLSLNAPCIMNFDSIVNNERQSKLLKPRS 160 Query: 174 DVVVWGGESRLFYHGIQPLKA---GFHPLTIDCRYNLTFRQA 212 V++ G L+ HGI P K+ + D R ++TFR+ Sbjct: 161 LVILQGESRYLWKHGIPPRKSDQWNGQKIMRDRRISITFRKV 202 >UniRef50_A8PV44 ALKBH protein, putative n=1 Tax=Brugia malayi RepID=A8PV44_BRUMA Length = 339 Score = 52.0 bits (123), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 33/98 (33%), Positives = 52/98 (53%) Query: 92 MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLP 151 +P+ +L + A G DA +IN Y+ + L+ H D+ E L +P++S+S G Sbjct: 204 LPEELVSLSDVLSQALGIGPMYADAAIINFYSRKSTLAPHVDRSERSLSSPLISLSFGQT 263 Query: 152 AIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 AI+ GG +DP+ + GDV+V G RL YH + Sbjct: 264 AIYLAGGTDLDDPVDAFYIRSGDVLVIYGPQRLIYHAV 301 >UniRef50_Q0U5B3 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0U5B3_PHANO Length = 298 Score = 51.2 bits (121), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 8/109 (7%) Query: 90 PAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLG 149 P P N+ + T + A ++N Y+PG LS+H+D E ++S+SLG Sbjct: 175 PPFPADTKNMLESIFTTT-----RAQAAIVNLYSPGDTLSVHRDVAETSSHG-LISLSLG 228 Query: 150 LPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP 198 A+F G +D + L L G V G SR +HG+ + AG P Sbjct: 229 CDAVFVIG--TDDDKVLTLRLRSGSAVYMSGASRFAWHGVPQIVAGSCP 275 >UniRef50_C5L3Y2 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5L3Y2_9ALVE Length = 325 Score = 50.4 bits (119), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 44/128 (34%), Positives = 56/128 (43%), Gaps = 10/128 (7%) Query: 92 MPQSFHNLCQR--AATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLG 149 MP L +R A + A DF+PD IN Y PG +S H D PIV +S+G Sbjct: 166 MPAYTEELVRRIRAESVAEARDFRPDQLTINEYIPGVGISFHVDTHSA-FEGPIVILSIG 224 Query: 150 LPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDC----- 203 + +F + L L L + V GGESR + HGI K D Sbjct: 225 GGIVLEFRKSEEGRALP-LWLPRRSLAVMGGESRFGWVHGIAGRKTDRVGPDGDLVERQR 283 Query: 204 RYNLTFRQ 211 R +LTFRQ Sbjct: 284 RISLTFRQ 291 >UniRef50_UPI000175883A PREDICTED: similar to alkB, alkylation repair homolog 2 n=2 Tax=Coelomata RepID=UPI000175883A Length = 197 Score = 50.4 bits (119), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 43/136 (31%), Positives = 61/136 (44%), Gaps = 22/136 (16%) Query: 87 KPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK-LSLHQDKD-EPDLRAPIV 144 KPW NL +R F + LINRY G + H+D + E D PI Sbjct: 67 KPWTETLIQVRNLIKRVT------GFDYNFVLINRYRDGNDHIGEHKDNESELDKNTPIA 120 Query: 145 SVSLGLPAIFQF--------GGLKRNDPLKRLLLEHGDVVVWG-GESRLFYHGIQPLKAG 195 S+SLG +F F GG KR+ P ++ L+HG +++ + +YH + P K Sbjct: 121 SLSLGQQRLFVFKHQDCRKKGGAKRSVPPVKIQLQHGSLLLMNPPTNNYWYHALPPAKRA 180 Query: 196 FHPLTIDCRYNLTFRQ 211 R NLTFR+ Sbjct: 181 -----PGARINLTFRK 191 >UniRef50_B8ESE9 2OG-Fe(II) oxygenase n=1 Tax=Methylocella silvestris BL2 RepID=B8ESE9_METSB Length = 202 Score = 48.9 bits (115), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 51/193 (26%), Positives = 77/193 (39%), Gaps = 36/193 (18%) Query: 29 AAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKP 88 AAEQ + D A SPFR + + GW + + P +P Sbjct: 27 AAEQTLISAIDAARLSPFR-------FQGWLGKRVTASFGWRYDFETASFGPAEP----- 74 Query: 89 WPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSL 148 +P+ L + AA AG P L+ RY PGA + H+D+ L ++ +SL Sbjct: 75 ---IPEFLLPLRESAAGFAGLPTGALAQALLIRYDPGAGIGWHRDR---PLFEHVIGISL 128 Query: 149 GLPAIFQF-----GGLKR-NDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID 202 G PA+ +F G R N PL + H + G L+ H I + Sbjct: 129 GAPAVLRFRRRTAAGFDRANAPLAPRSIYH----LSGDARHLWEHSIAQVDV-------- 176 Query: 203 CRYNLTFRQAGKK 215 R+++TFR +K Sbjct: 177 ARWSITFRSLSEK 189 >UniRef50_B2SPH7 DNA repair system specific for alkylated DNA n=17 Tax=Xanthomonadaceae RepID=B2SPH7_XANOP Length = 202 Score = 48.9 bits (115), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 36/99 (36%), Positives = 48/99 (48%), Gaps = 6/99 (6%) Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAP--IVSVSLGLPAIFQFGGLKRNDPLKRLLLEH 172 ++ LINRY G+ DEP+L A I SVSLG F F + L L H Sbjct: 97 NSVLINRYRSGSDAMGWHSDDEPELGAQPLIASVSLGARRRFAFKHRDDASVKQALELGH 156 Query: 173 GDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 GD+++ GG+++ Y P A + R NLTFRQ Sbjct: 157 GDLLLMGGQTQRHYRHALPRTAK----PVGERINLTFRQ 191 >UniRef50_A1SVH4 DNA-N1-methyladenine dioxygenase n=12 Tax=Bacteria RepID=A1SVH4_PSYIN Length = 210 Score = 48.5 bits (114), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 42/116 (36%), Positives = 59/116 (50%), Gaps = 14/116 (12%) Query: 99 LCQRAATAAGYPDFQPDACLINRYAPGAK-LSLHQDKDEPDLR--APIVSVSLGLPAIFQ 155 L Q+ A G+ Q +ACL+N Y G + ++ H D E DL+ A I S+S G F Sbjct: 96 LKQKVEDATGH---QFNACLLNLYHSGQEGMAWHSDA-EKDLQKNAAIASLSFGAERKFS 151 Query: 156 FGGLKRNDPLKRLLLEHGDVVVWGGES-RLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 F K N + L+HG ++V GG++ R + H + P K P R NLTFR Sbjct: 152 FKH-KVNQKTISVSLQHGSLLVMGGDTQRHWLHRLPPTKKVTTP-----RINLTFR 201 >UniRef50_Q4UFZ4 Alkylated DNA repair protein, putative n=2 Tax=Theileria RepID=Q4UFZ4_THEAN Length = 350 Score = 48.5 bits (114), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 7/87 (8%) Query: 109 YPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK-- 166 Y F D+ +IN Y+ L LH+D D + P++++SLG PAIF + + DP + Sbjct: 194 YQPFTADSAIINFYSNSYFLRLHRD-DAEETNDPVINISLGAPAIF---CICKEDPSQFP 249 Query: 167 -RLLLEHGDVVVWGGESRLFYHGIQPL 192 +++ G +++ SR HGI L Sbjct: 250 LSCVVDSGSIIIMSKNSRRCLHGISKL 276 >UniRef50_UPI000180CD20 PREDICTED: similar to alkB, alkylation repair homolog 8 (E. coli) (alkbh8) n=1 Tax=Ciona intestinalis RepID=UPI000180CD20 Length = 593 Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 47/148 (31%), Positives = 63/148 (42%), Gaps = 15/148 (10%) Query: 76 YLYSPIDPQTNKPWP-AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 + Y D N P +P NL R A GY +PD IN Y PG + H D Sbjct: 178 FRYGTNDVDINNPISEGLPNYIENLLDRIM-ATGYLPSRPDQLTINMYEPGDGIPPHTD- 235 Query: 135 DEPDLRAPIVSVSLGLPAIFQFG--GLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQP 191 + + +VSLG + F G +R D + +E + ++ GESR + HGIQ Sbjct: 236 NTRSFDGVLSTVSLGSHTVMNFSKEGAERID----VCVEPRTLFLFTGESRYEWRHGIQQ 291 Query: 192 -----LKAGFHPLTIDCRYNLTFRQAGK 214 L G T RY+LTFR K Sbjct: 292 RKFDILDQGKKITTRTIRYSLTFRTVVK 319 >UniRef50_UPI0000E484FA PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E484FA Length = 424 Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 58/219 (26%), Positives = 90/219 (41%), Gaps = 28/219 (12%) Query: 5 FADAEPWQEPLAA----GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 F ++ P +P + G VI+ F EQ I D + AS S +A Sbjct: 126 FVESVPTDKPQSNVPPPGLVIIPDFIDECLEQKIIDSIEWASPS-------------EIA 172 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLI 119 + H H + YS + +KP P MP+ + + R G+ F+PD I Sbjct: 173 NQSLKHRKVKHHGYEFNYSSNNIDRDKPLPGGMPELYGQVINRIM-ETGHVQFKPDQLTI 231 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N+Y PG + H D I+S+SL + +F + ++L ++V Sbjct: 232 NQYQPGQGIPPHVDTHSA-FEDAIISLSLESQIVMEFTHPAGHQ--VPVVLPRRSLLVMT 288 Query: 180 GESRL-FYHGIQPLKAGFHPLTIDCRY--NLTFRQAGKK 215 GE+R + HGI P K P D + NLT Q G++ Sbjct: 289 GEARYKWTHGITPKKTDVIP---DPTFPDNLTLHQRGQR 324 >UniRef50_C5KBY7 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KBY7_9ALVE Length = 332 Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 35/110 (31%), Positives = 53/110 (48%), Gaps = 12/110 (10%) Query: 92 MPQ----SFHNLCQRAATAAGY----PDFQPDACLIN---RYAPGAKLSLHQDKDEPDLR 140 MPQ +HN + A G +Q +A L+N + +L H+D E Sbjct: 157 MPQWVCDIYHNALKAADDICGSRLAEDGYQAEAALVNFFHSHRSSDRLGGHKDDVEARDH 216 Query: 141 APIVSVSLGLPAIFQFGGLKRNDPLKR-LLLEHGDVVVWGGESRLFYHGI 189 +P+V ++LGLP F GG R D +L GDV+V E+R ++HG+ Sbjct: 217 SPLVILALGLPCTFLLGGDSRVDVTPAPILFSSGDVLVLSREARQWFHGV 266 >UniRef50_Q09BP3 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09BP3_STIAU Length = 185 Score = 47.0 bits (110), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 50/157 (31%), Positives = 62/157 (39%), Gaps = 26/157 (16%) Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 VA H GW Y Y + + P PAMP L R A G Q L Sbjct: 45 VAKRRTAHFGWL-----YGYESLKVE---PGPAMPDFLLPLRNRCAELMGELPEQLVEAL 96 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF----GGLKRNDPLKRLLLEHGD 174 +N Y PGA + H +D P +V VSLG +F G +R L+ L Sbjct: 97 LNEYPPGAAIGWH--RDAPMFGHQVVGVSLGGACRMRFQRDQGEARRTYALE---LAPRS 151 Query: 175 VVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 V GGESR + H I P RY++TFR Sbjct: 152 AYVLGGESRSTWQHSI--------PAVKQERYSITFR 180 >UniRef50_D2UYR0 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2UYR0_NAEGR Length = 294 Score = 47.0 bits (110), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 31/112 (27%), Positives = 54/112 (48%), Gaps = 21/112 (18%) Query: 122 YAPGAKLSLHQDK-------DEPDLRAPIVSVSLGLPAIFQF---------GGLKRNDPL 165 Y KL H+D+ ++ + +P+VS+SLG +IF + G L+ Sbjct: 168 YTNKGKLGWHRDRISGLTPEEQHLIVSPVVSMSLGNDSIFSYKLNTINETTGKLEYGTEA 227 Query: 166 KRLLLEHGDVVVWGGESRLFYHGIQPL--KAGFHPLTID---CRYNLTFRQA 212 L L+ GD++++G R+FYH ++ + H L +D R N+T R+ Sbjct: 228 IDLQLKSGDILIFGATQRMFYHCVKRIIPSTNHHKLDMDGLSGRINITLREG 279 >UniRef50_B8J7A0 2OG-Fe(II) oxygenase n=3 Tax=Anaeromyxobacter RepID=B8J7A0_ANAD2 Length = 204 Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust. Identities = 48/156 (30%), Positives = 69/156 (44%), Gaps = 26/156 (16%) Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 +A H G + Y Y D + +P P P + L +RAA AG L Sbjct: 62 IARRRVAHFG-----RAYAY---DARAVQPGPPFPAALEPLRRRAAALAGVAPAALAEAL 113 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR---LLLEHGDV 175 + RY PGA + H +D P +V VSLG PA F+ ++ P R +LLE G Sbjct: 114 VTRYPPGAGIGWH--RDAPAF-GQVVGVSLGAPARFR---MREGGPGGRALEVLLEPGSA 167 Query: 176 VVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFR 210 + G +R + H I P+ A R+++TFR Sbjct: 168 YLLAGAARWRWQHAIPPVPA--------ERWSVTFR 195 >UniRef50_Q8T9A3 SD10403p n=17 Tax=Coelomata RepID=Q8T9A3_DROME Length = 615 Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust. Identities = 60/222 (27%), Positives = 90/222 (40%), Gaps = 30/222 (13%) Query: 5 FADAEPWQEPLAAGAVILRRFAFNAAEQ-LIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 A W +PL G I+ F E L+R I + S T S+ N Sbjct: 123 LAGKSEWNKPLPRGLHIIADFVTEEEESTLLRAIGEDGRTSEG---------TGSLKHRN 173 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWP-AMPQSFHNLCQRAATAAGYPDFQ-PDACLINR 121 H G+ +LY + +KP ++P + L R + A D+ PD +N Sbjct: 174 VKHFGFE-----FLYGTNNVDPSKPLEQSIPSACDILWPRLNSFASTWDWSSPDQLTVNE 228 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PG + H D L PI+S+SL + F +R D ++ L +++ GE Sbjct: 229 YEPGHGIPPHVDTHSAFL-DPILSLSLQSDVVMDF---RRGDDQVQVRLPRRSLLIMSGE 284 Query: 182 SRL-FYHGIQPLKAGFHP-----LTIDC---RYNLTFRQAGK 214 +R + HGI+P P LT R +LTFR+ K Sbjct: 285 ARYDWTHGIRPKHIDVVPSASGGLTTQARGKRTSLTFRRLRK 326 >UniRef50_A4SZF3 DNA-N1-methyladenine dioxygenase n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SZF3_POLSQ Length = 209 Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust. Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 9/105 (8%) Query: 110 PDFQPDACLINRYAPGAK-LSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR 167 P + ++CL+N Y GA + H D + E D ++PI S+SLG F F K++ Sbjct: 105 PQAEFNSCLLNFYHDGADGMGWHSDDEKELDAQSPIASLSLGSARKFSFKH-KKDKSTTS 163 Query: 168 LLLEHGDVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQ 211 L LE+G ++ ++ F+ H + K P R NLTFR+ Sbjct: 164 LFLENGSALIMHAPTQQFWQHALLKTKTIHTP-----RINLTFRR 203 >UniRef50_D2A2C2 Putative uncharacterized protein GLEAN_07671 n=1 Tax=Tribolium castaneum RepID=D2A2C2_TRICA Length = 582 Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust. Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 13/112 (11%) Query: 111 DFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLL 170 +F+P+ INRY PG + H D PI+S+SL + +F + D +LL Sbjct: 199 EFRPNQLTINRYNPGQGIPSHVDTHSA-FGDPILSLSLSSDVVMEF----KKDETICVLL 253 Query: 171 EHGDVVVWGGESRL-FYHGIQPL-------KAGFHPLTIDCRYNLTFRQAGK 214 ++V GESR + HGI P + G H R + TFR+ K Sbjct: 254 PRRSLLVMAGESRYEWTHGIVPRTFDFYNDEGGCHCFKRGVRVSFTFRKIRK 305 >UniRef50_A5WBM5 DNA-N1-methyladenine dioxygenase n=5 Tax=Moraxellaceae RepID=A5WBM5_PSYWF Length = 212 Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust. Identities = 52/174 (29%), Positives = 78/174 (44%), Gaps = 34/174 (19%) Query: 47 RQMV----TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPW-PAMPQSFHNLCQ 101 RQ+V TP T +++ T GH PI+P W PA+ H + Q Sbjct: 59 RQIVWMGDTPSASTQALSYTYSGHT-----------RPIEP-----WHPAVFHVKHMIEQ 102 Query: 102 RAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDL--RAPIVSVSLGLPAIFQFGGL 159 + F ++CL+N Y G + + DEP+L + I S+SLG F F Sbjct: 103 QLQPLKICTQF--NSCLLNYYPSGEEGMGYHADDEPELGYQPIIASLSLGATRKFVFKHK 160 Query: 160 KRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQPLKAGFHPLTIDC-RYNLTFRQ 211 K D ++ L LE G +VV G+++ ++ H I K +D R +LTFR Sbjct: 161 KTQDKVE-LYLESGQLVVMRGDTQQYWKHSITKTKK------VDTGRISLTFRH 207 >UniRef50_C6XT27 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT27_PEDHD Length = 201 Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust. Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 8/103 (7%) Query: 111 DFQPDACLINRYAPGA-KLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRL 168 D + + CL+N Y GA + H+D ++ + P I SVS G P IFQF P+ + Sbjct: 100 DVEFNTCLLNHYRSGADSIGWHRDNEKNLGQYPFIASVSFGAPRIFQFRHYTDKIPIISV 159 Query: 169 LLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 L HG +++ +++ L+ H + + P R NLTFR Sbjct: 160 ELTHGSLLIMKADTQHLWEHRLPKILRPVGP-----RINLTFR 197 >UniRef50_Q26EI7 Alkylated DNA repair protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26EI7_9BACT Length = 201 Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust. Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 14/127 (11%) Query: 88 PWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLS-LHQDKDEPDLRAPIV-S 145 PW Q ++A A + CLINRY G + H D ++ PI+ S Sbjct: 82 PWTETLQKIKQDVEKATGATF------NICLINRYRNGQDSNGWHADNEKELGINPIIAS 135 Query: 146 VSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQPLKAGFHPLTIDCR 204 +SLG F D + L+HG ++V GE++ Y H I K I R Sbjct: 136 ISLGQERFFHLKHHHNKDWKFKFPLQHGSLLVMAGETQHTYKHQIAKTKR-----LIGER 190 Query: 205 YNLTFRQ 211 NLTFR+ Sbjct: 191 INLTFRK 197 >UniRef50_B0SGN3 Alkylated DNA repair protein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SGN3_LEPBA Length = 202 Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust. Identities = 37/102 (36%), Positives = 55/102 (53%), Gaps = 9/102 (8%) Query: 115 DACLINRYAPGAK-LSLHQDKDEPDLR--APIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 171 ++CL+N Y G++ ++ H D DE L+ + I SVSLG IF+F K+N ++ L LE Sbjct: 104 NSCLLNLYHDGSEGMAWHSD-DETSLQKHSTIASVSLGAERIFRFKHKKKNSVVE-LPLE 161 Query: 172 HGDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 G +++ GE + H + L R NLTFRQ G Sbjct: 162 PGSLLLMKGEIQ--EHWLHSLPKALK--VKRPRVNLTFRQFG 199 >UniRef50_A4CQ67 Alkylated DNA repair protein n=2 Tax=Flavobacteriaceae RepID=A4CQ67_9FLAO Length = 197 Score = 43.9 bits (102), Expect = 0.003, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 79/178 (44%), Gaps = 17/178 (9%) Query: 39 DVASQSPFRQ-MVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFH 97 ++ SQ+P+RQ + G T + + Q Y YS I +P P P Sbjct: 34 EIKSQTPWRQDTIRLFGKTFQQPRLTAL---YGKNGQAYTYSGI---LMEPLPFTPL-LE 86 Query: 98 NLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDL-RAPIV-SVSLGLPAIFQ 155 +L R + AAG + CL+N Y G+ + DEP+L P++ S+SLG F Sbjct: 87 DLLHRVSIAAGE---KFTTCLLNLYRDGSDSNGWHADDEPELGNNPVIASLSLGASRKFH 143 Query: 156 FGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 + R+ LE G +++ G ++ +H + + P + R NLTFR+ G Sbjct: 144 LKHRRIKSQRVRMNLESGSLLLMAGTTQ--HHWLHQVPKTKRP--VGPRINLTFRRLG 197 >UniRef50_D2XAQ5 Alkylated DNA repair protein n=1 Tax=Marseillevirus RepID=D2XAQ5_9VIRU Length = 198 Score = 43.9 bits (102), Expect = 0.004, Method: Compositional matrix adjust. Identities = 52/200 (26%), Positives = 82/200 (41%), Gaps = 20/200 (10%) Query: 24 RFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ-GYLYSPID 82 RF F + ++L R + D+ P + G + + G + H Y ++ + Sbjct: 10 RFLFGSRKKLQRQLADIEYLPPEDTAIKMHGKVIPIPRLQTG---FGKHESLSYSFTGVK 66 Query: 83 PQTNKPWPAMPQSFH-----NLCQRAATAAGYPDFQPDACLINRYAPGAK-LSLHQDKDE 136 K WP + +L ++ P P+ L+N+Y G + H DK E Sbjct: 67 IPA-KIWPPYIEKLSLKIHAHLVEQGVMGQDTPP--PNYVLVNKYLNGDHYIGWHSDK-E 122 Query: 137 PDLRA--PIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKA 194 DL PI+SVSLG F +K + K + L GDV+V + + P + Sbjct: 123 RDLMMGYPIISVSLGARRDFCLRLIKNHKHKKTISLGSGDVLVMLPGMQQVWQHCLPKRK 182 Query: 195 GFHPLTIDCRYNLTFRQAGK 214 G + RYNLTFR G+ Sbjct: 183 GLD----EPRYNLTFRWIGE 198 >UniRef50_B6GZZ6 Pc12g09870 protein n=4 Tax=Eurotiomycetidae RepID=B6GZZ6_PENCW Length = 360 Score = 43.5 bits (101), Expect = 0.005, Method: Compositional matrix adjust. Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 2/83 (2%) Query: 107 AGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK 166 A +P A ++N Y+ G LS+H+D E + ++SVS G +F + + Sbjct: 218 AAFPATSAQAAILNLYSAGDTLSVHRDVSE-ECDVGLISVSFGCDGLF-LASHDDGNGCE 275 Query: 167 RLLLEHGDVVVWGGESRLFYHGI 189 + L GD V G+SR +HG+ Sbjct: 276 IIRLRSGDTVYMSGKSRFAWHGV 298 >UniRef50_UPI00006CC0FF hypothetical protein TTHERM_00219000 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CC0FF Length = 254 Score = 43.5 bits (101), Expect = 0.005, Method: Compositional matrix adjust. Identities = 41/159 (25%), Positives = 70/159 (44%), Gaps = 14/159 (8%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQS-FHNLCQRAATAAGYP-DFQPDACLINRYA 123 H G + + S I P++++ A+ S F L QR + + P+ CL+N Y Sbjct: 71 HYGVVYYHTRHNLSEIQPESSESEKALDLSVFDWLIQRLINDEVFDVSYPPNQCLVNEYD 130 Query: 124 PGAKLSLHQDKDEPDLRAPIVS-VSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES 182 KL H + E PI++ +SL P+ ++ + +L LE + V +S Sbjct: 131 NKDKLGCHVENIEA--FGPIIAGLSLHNPSYLALREVENKENKVQLYLEPRSLYVLTSDS 188 Query: 183 RLFY-HGIQPLKAGFHPLTIDC--------RYNLTFRQA 212 R + HG+ +K ++P+T R +LTFR Sbjct: 189 RYKWEHGVTKMKEIYNPITQQTIIKNETYRRVSLTFRHV 227 >UniRef50_Q6C9X6 YALI0D07546p n=1 Tax=Yarrowia lipolytica RepID=Q6C9X6_YARLI Length = 372 Score = 43.1 bits (100), Expect = 0.006, Method: Compositional matrix adjust. Identities = 51/214 (23%), Positives = 94/214 (43%), Gaps = 27/214 (12%) Query: 20 VILRRFAFNAAEQLIRDINDVASQSPF----RQMVTP---GGYTMSVAMTNCGHLGWTTH 72 V+ + A N +QL+ D N ++ F + TP YT +++ ++ H Sbjct: 113 VLEKAEAENLLDQLMEDHNSWKEKTKFYLFDKLCETPHKSAFYTKDMSV-------YSQH 165 Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQ---RAATAAGYPDFQPDACLINRYAPGAK-L 128 Y P+D T++ + ++ + Q R+ G ++ DAC+ N YA ++ + Sbjct: 166 TYRYNGKPVD--TSRKYSKAMEACSDRIQELVRSTNKEGTAEWLSDACIANYYADESQSV 223 Query: 129 SLHQDK-DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK--RLLLEHGDVVVW-GGESRL 184 H D+ R I +++LG IF+ G + + +LL H +++ G Sbjct: 224 GFHSDQLTYIGPRPVIAALTLGSERIFRLKGACPDGDRRTYNILLPHNSLMIMHAGCQEA 283 Query: 185 FYHGIQPLKA---GFHPLTIDCRYNLTFRQAGKK 215 + H I P++A G H R++LTFR K+ Sbjct: 284 YKHSIIPVQAKQIGLHERAGKVRFSLTFRHYKKE 317 >UniRef50_C7RA32 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RA32_KANKD Length = 207 Score = 42.0 bits (97), Expect = 0.013, Method: Compositional matrix adjust. Identities = 39/142 (27%), Positives = 58/142 (40%), Gaps = 26/142 (18%) Query: 75 GYLYSPIDPQTNKPWP----AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSL 130 G ++ PI PW A+ + +CQ + +A L N Y G Sbjct: 79 GVIHHPI------PWSEQLLALKKRIEQVCQTSFNSA----------LFNLYRDGRDSVA 122 Query: 131 HQDKDEPDLRAP--IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHG 188 DEP+L A I S+SLG P Q K D +L L G ++V G+++ + Sbjct: 123 WHSDDEPELGAKPIIASLSLGAPRSLQLKHKKHKDLRHKLTLTSGSLLVMRGDTQRCWQH 182 Query: 189 IQPLKAGFHPLTIDCRYNLTFR 210 P + P + R N+TFR Sbjct: 183 QVPKE----PAITEPRINITFR 200 >UniRef50_Q07GB6 Oxidoreductase, putative n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q07GB6_ROSDO Length = 195 Score = 42.0 bits (97), Expect = 0.014, Method: Compositional matrix adjust. Identities = 52/190 (27%), Positives = 78/190 (41%), Gaps = 31/190 (16%) Query: 46 FRQMVTPGGYTM---SVAMTNCGHLG-------WTT------HRQGYLYSPIDPQTNKPW 89 FRQ V P G T ++ G L W T GY Y D + + W Sbjct: 8 FRQDVWPDGLTYLENYISEDEAGRLVQEIDAALWRTDLKRRVQHYGYRY---DYKARQAW 64 Query: 90 PA-----MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIV 144 +P+ F +L +R TA G+ PD ++N Y PG +S H D +P I Sbjct: 65 REDYLGPLPELFQSLAERL-TAEGHFQTVPDQVIVNEYQPGQGISAHIDC-QPCFGETIA 122 Query: 145 SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGF---HPLT 200 S+SL + +F + ++ L L+ ++V ++R L+ H I P K Sbjct: 123 SLSLLSACVMRFASRIYSQQME-LHLQPSSLLVLQSDARHLWTHAIPPRKTDVFEGQKYA 181 Query: 201 IDCRYNLTFR 210 R +LTFR Sbjct: 182 RARRISLTFR 191 >UniRef50_Q2MF23 TobX protein n=2 Tax=Actinomycetales RepID=Q2MF23_STRSD Length = 219 Score = 41.6 bits (96), Expect = 0.017, Method: Compositional matrix adjust. Identities = 43/156 (27%), Positives = 62/156 (39%), Gaps = 24/156 (15%) Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 VA H G+ + + +P DP +P+ F L R A AG L Sbjct: 77 VARRTVRHFGFDYGYESWRLTPTDP--------LPEEFWWLRDRCAHLAGLRPESLAQTL 128 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF----GGLKRNDPLKRLLLEHGD 174 I RY PGA + H +D P +V VSL + +F G +R L+ L Sbjct: 129 IARYPPGATIGWH--RDAPMFGPSVVGVSLLSSCLMRFQRRVGEERRVYELE--LAPRSA 184 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 V+ G + H I P+ + RY++TFR Sbjct: 185 YVLSGAARSAWQHSIPPVP--------ELRYSITFR 212 >UniRef50_C5CMR5 2OG-Fe(II) oxygenase n=1 Tax=Variovorax paradoxus S110 RepID=C5CMR5_VARPS Length = 202 Score = 41.2 bits (95), Expect = 0.021, Method: Compositional matrix adjust. Identities = 42/144 (29%), Positives = 60/144 (41%), Gaps = 11/144 (7%) Query: 69 WTTHRQGYLYS---PIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 +T R+G + D +P AMP + H L R A G LI+ Y PG Sbjct: 59 YTARRRGISFGGSYDFDKHRLRPGAAMPPALHPLRARVAAWMGMAPEDFAHMLISEYRPG 118 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF----GGLKRNDPLKRLLLEHGDVVVWGGE 181 L H +D PD IV VSL A+ Q +LL+E + + GE Sbjct: 119 TPLGWH--RDVPDF-EDIVGVSLQGDAVMQLRPYPPASASAPASLQLLIEPRSIYMLRGE 175 Query: 182 SRLFY-HGIQPLKAGFHPLTIDCR 204 +R + H I P +A + +T+ R Sbjct: 176 ARWAWQHSIAPTEALRYSITMRTR 199 >UniRef50_B2AYU6 Predicted CDS Pa_1_12280 (Fragment) n=5 Tax=Leotiomyceta RepID=B2AYU6_PODAN Length = 320 Score = 40.4 bits (93), Expect = 0.039, Method: Compositional matrix adjust. Identities = 35/111 (31%), Positives = 53/111 (47%), Gaps = 19/111 (17%) Query: 117 CLINRYAPGA-KLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLK-------- 166 CL+N YA G+ +S H D + R P I S SLG F L ++ P+ Sbjct: 155 CLVNYYATGSDSISFHSDDERFLGREPAIASFSLGAARDF----LMKHKPVPPPPDGQTT 210 Query: 167 -----RLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 +LLL GD+++ G+++ + P +AG D R N+TFR+A Sbjct: 211 VFKQLKLLLASGDMILMKGKTQANWLHSIPKRAGKSSQYGDGRINITFRRA 261 >UniRef50_A4C6S7 Putative 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C6S7_9GAMM Length = 208 Score = 40.4 bits (93), Expect = 0.045, Method: Compositional matrix adjust. Identities = 42/127 (33%), Positives = 61/127 (48%), Gaps = 15/127 (11%) Query: 87 KPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-KLSLHQDKDEPDL-RAP-I 143 +PW A+ + N R + G P +A L+N Y G + H D DEP+L R P I Sbjct: 86 EPWSAVLLAIKN---RLSHTFGVPF---NALLVNWYRDGQDSMGWHSD-DEPELGREPCI 138 Query: 144 VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDC 203 S+SLG +F+ K+ + L L+ GD ++ G S+L + P + P Sbjct: 139 ASLSLGASRLFKMRQ-KQTLQVYNLQLQSGDCLLMSGRSQLDFQHSLPKQ----PSVKQG 193 Query: 204 RYNLTFR 210 R NLTFR Sbjct: 194 RINLTFR 200 >UniRef50_A5GWW3 Alkylated DNA repair protein n=2 Tax=Synechococcus RepID=A5GWW3_SYNR3 Length = 204 Score = 39.7 bits (91), Expect = 0.074, Method: Compositional matrix adjust. Identities = 46/143 (32%), Positives = 71/143 (49%), Gaps = 22/143 (15%) Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK-LSLHQD 133 GY YS +D +PW Q ++ +G+ + ++ L+N Y G + H D Sbjct: 72 GYRYSGLD-NVVEPWSPTAQRIR---EQLNELSGW---RFNSLLLNLYRDGRDAMGFHAD 124 Query: 134 KDEPDL--RAPIVSVSLGLPAIFQF---GGLKRNDPLKRLLLEHGDVVVWGGESRLFY-H 187 DEP+L API S+SLG+ F+F G + ND L L HG +++ ++L + H Sbjct: 125 -DEPELDPTAPIASLSLGVSRTFRFKPKKGHQGND--FDLELGHGALLLMDPPTQLHWLH 181 Query: 188 GIQPLKAGFHPLTIDCRYNLTFR 210 G+ P + + CR NLTFR Sbjct: 182 GL-PKRLRVN----QCRLNLTFR 199 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P05050 Alpha-ketoglutarate-dependent dioxygenase alkB n... 336 3e-91 UniRef50_Q8Z566 AlkB protein n=6 Tax=Salmonella enterica subsp. ... 306 4e-82 UniRef50_C7JEB8 DNA repair protein for alkylated DNA n=8 Tax=Ace... 277 2e-73 UniRef50_UPI000197C9F9 alpha-ketoglutarate-dependent dioxygenase... 269 4e-71 UniRef50_Q5QTX8 Alkylated DNA repair protein n=1 Tax=Idiomarina ... 267 1e-70 UniRef50_D0IWA8 2OG-Fe(II) oxygenase n=4 Tax=Proteobacteria RepI... 260 3e-68 UniRef50_A3WP94 Alkylated DNA repair protein n=1 Tax=Idiomarina ... 256 3e-67 UniRef50_Q1N5M7 2OG-Fe(II) oxygenase superfamily protein n=1 Tax... 246 3e-64 UniRef50_D0LXU5 2OG-Fe(II) oxygenase n=1 Tax=Haliangium ochraceu... 226 4e-58 UniRef50_Q28VY2 DNA-N1-methyladenine dioxygenase n=30 Tax=Bacter... 220 2e-56 UniRef50_B8GWW6 Alpha-ketoglutarate-dependent dioxygenase alkB h... 218 1e-55 UniRef50_C9CYS1 Putative uncharacterized protein n=2 Tax=Alphapr... 211 2e-53 UniRef50_A7HZ41 2OG-Fe(II) oxygenase n=1 Tax=Parvibaculum lavame... 201 1e-50 UniRef50_B0T136 2OG-Fe(II) oxygenase n=1 Tax=Caulobacter sp. K31... 199 4e-50 UniRef50_C7QZR3 2OG-Fe(II) oxygenase n=29 Tax=Actinomycetales Re... 199 7e-50 UniRef50_D2B1L1 Alkylated DNA repair protein n=3 Tax=Actinomycet... 188 1e-46 UniRef50_A3VR77 Alkylated DNA repair protein n=1 Tax=Parvularcul... 182 5e-45 UniRef50_UPI0001983B96 PREDICTED: hypothetical protein n=1 Tax=V... 177 2e-43 UniRef50_D2V2M0 Predicted protein n=1 Tax=Naegleria gruberi RepI... 177 3e-43 UniRef50_D2A2Y9 Putative uncharacterized protein GLEAN_07602 n=1... 177 3e-43 UniRef50_Q17GQ0 Putative uncharacterized protein (Fragment) n=2 ... 177 3e-43 UniRef50_A9TG90 Predicted protein n=1 Tax=Physcomitrella patens ... 176 4e-43 UniRef50_B7PUG0 Putative uncharacterized protein n=1 Tax=Ixodes ... 175 6e-43 UniRef50_D1HAA5 Whole genome shotgun sequence of line PN40024, s... 175 9e-43 UniRef50_C7NLG8 Alkylated DNA repair protein n=2 Tax=Actinomycet... 173 3e-42 UniRef50_C0WHI3 Alkylated DNA repair protein n=3 Tax=Corynebacte... 173 4e-42 UniRef50_C9YWY9 Putative DNA repair protein n=1 Tax=Streptomyces... 173 5e-42 UniRef50_B9GZQ0 Predicted protein n=13 Tax=Magnoliophyta RepID=B... 173 5e-42 UniRef50_C3XQU3 Putative uncharacterized protein n=1 Tax=Branchi... 173 5e-42 UniRef50_Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB n... 171 2e-41 UniRef50_B6JWW7 AlkB-like protein n=1 Tax=Schizosaccharomyces ja... 171 2e-41 UniRef50_Q9SA98 Alkylated DNA repair protein alkB homolog n=4 Ta... 170 3e-41 UniRef50_A7RXQ8 Predicted protein (Fragment) n=1 Tax=Nematostell... 170 3e-41 UniRef50_B3RXF0 Putative uncharacterized protein (Fragment) n=1 ... 169 6e-41 UniRef50_O60066 Alkylated DNA repair protein alkB homolog n=2 Ta... 168 8e-41 UniRef50_Q7KUZ2 AlkB n=12 Tax=Drosophila RepID=Q7KUZ2_DROME 168 9e-41 UniRef50_Q7QEU7 AGAP000155-PA n=1 Tax=Anopheles gambiae RepID=Q7... 168 2e-40 UniRef50_C2BL13 Alkylated DNA repair protein n=2 Tax=Corynebacte... 167 3e-40 UniRef50_C6TKW1 Putative uncharacterized protein n=1 Tax=Glycine... 165 9e-40 UniRef50_Q8T9A3 SD10403p n=17 Tax=Coelomata RepID=Q8T9A3_DROME 165 1e-39 UniRef50_UPI00019271E1 PREDICTED: similar to predicted protein n... 164 2e-39 UniRef50_Q9LJH2 Similarity to unknown protein n=7 Tax=Embryophyt... 164 2e-39 UniRef50_UPI00005257D6 PREDICTED: similar to AlkB CG33250-PA n=1... 164 2e-39 UniRef50_Q9LZW8 Putative uncharacterized protein T20L15_50 n=3 T... 163 5e-39 UniRef50_D1IRG0 Whole genome shotgun sequence of line PN40024, s... 162 6e-39 UniRef50_C0PA85 Putative uncharacterized protein n=1 Tax=Zea may... 162 9e-39 UniRef50_UPI0000E47318 PREDICTED: similar to LOC494680 protein n... 160 3e-38 UniRef50_UPI0000E484FA PREDICTED: hypothetical protein n=1 Tax=S... 160 3e-38 UniRef50_Q13686 Alkylated DNA repair protein alkB homolog 1 n=27... 159 5e-38 UniRef50_C0Z2F3 AT5G01780 protein n=9 Tax=Magnoliophyta RepID=C0... 157 3e-37 UniRef50_D0NYX8 Alkylated DNA repair protein alkB-like protein n... 156 6e-37 UniRef50_Q4D9X3 Alkylated DNA repair protein, putative n=3 Tax=T... 155 1e-36 UniRef50_UPI000051A07C PREDICTED: similar to AlkB CG33250-PA n=3... 152 1e-35 UniRef50_Q22MH4 Putative uncharacterized protein n=1 Tax=Tetrahy... 148 1e-34 UniRef50_A0C122 Chromosome undetermined scaffold_140, whole geno... 148 1e-34 UniRef50_B5Y3R7 Predicted protein n=1 Tax=Phaeodactylum tricornu... 147 3e-34 UniRef50_C8NRG2 DNA repair protein n=9 Tax=Actinomycetales RepID... 145 8e-34 UniRef50_Q0U5B3 Putative uncharacterized protein n=1 Tax=Phaeosp... 139 7e-32 UniRef50_A4H8G0 Putative uncharacterized protein n=3 Tax=Leishma... 138 2e-31 UniRef50_A8PV44 ALKBH protein, putative n=1 Tax=Brugia malayi Re... 136 6e-31 UniRef50_D0N998 Putative uncharacterized protein n=1 Tax=Phytoph... 135 8e-31 UniRef50_D2A2C2 Putative uncharacterized protein GLEAN_07671 n=1... 135 8e-31 UniRef50_Q6C333 YALI0F03003p n=1 Tax=Yarrowia lipolytica RepID=Q... 135 1e-30 UniRef50_B1XR40 Oxidoreductase, 2OG-Fe(II) oxygenase family n=4 ... 133 5e-30 UniRef50_C1EB25 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 132 8e-30 UniRef50_Q5K7S3 Putative uncharacterized protein n=1 Tax=Filobas... 132 1e-29 UniRef50_A9TLH2 Predicted protein n=1 Tax=Physcomitrella patens ... 130 2e-29 UniRef50_Q09BP3 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 128 1e-28 UniRef50_B8ESE9 2OG-Fe(II) oxygenase n=1 Tax=Methylocella silves... 128 2e-28 UniRef50_C1FFM9 Predicted protein (Fragment) n=2 Tax=Micromonas ... 126 6e-28 UniRef50_UPI000186D7D6 conserved hypothetical protein n=1 Tax=Pe... 125 1e-27 UniRef50_C5KBY7 Putative uncharacterized protein n=1 Tax=Perkins... 124 2e-27 UniRef50_Q010W8 Oxidoreductase, 2OG-Fe (ISS) n=1 Tax=Ostreococcu... 124 2e-27 UniRef50_UPI000180CD20 PREDICTED: similar to alkB, alkylation re... 123 4e-27 UniRef50_A8J903 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 123 4e-27 UniRef50_Q4UFZ4 Alkylated DNA repair protein, putative n=2 Tax=T... 122 1e-26 UniRef50_Q9LJH4 Emb|CAB82748.1 n=1 Tax=Arabidopsis thaliana RepI... 121 1e-26 UniRef50_A7AWB3 Putative uncharacterized protein n=1 Tax=Babesia... 119 5e-26 UniRef50_C5L3Y2 Putative uncharacterized protein n=1 Tax=Perkins... 117 3e-25 UniRef50_A4SZF3 DNA-N1-methyladenine dioxygenase n=1 Tax=Polynuc... 115 1e-24 UniRef50_A4S344 Predicted protein n=2 Tax=Mamiellales RepID=A4S3... 114 2e-24 UniRef50_C4Q8H0 Expressed protein n=2 Tax=Schistosoma RepID=C4Q8... 113 3e-24 UniRef50_B6KBE4 Putative uncharacterized protein n=3 Tax=Toxopla... 113 4e-24 UniRef50_A1SVH4 DNA-N1-methyladenine dioxygenase n=12 Tax=Bacter... 111 2e-23 UniRef50_A5WBM5 DNA-N1-methyladenine dioxygenase n=5 Tax=Moraxel... 110 5e-23 UniRef50_UPI000175883A PREDICTED: similar to alkB, alkylation re... 108 2e-22 UniRef50_C6XT27 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter heparinu... 104 2e-21 UniRef50_B2SPH7 DNA repair system specific for alkylated DNA n=1... 104 3e-21 UniRef50_B8J7A0 2OG-Fe(II) oxygenase n=3 Tax=Anaeromyxobacter Re... 97 3e-19 UniRef50_D2UYR0 Predicted protein n=1 Tax=Naegleria gruberi RepI... 89 1e-16 UniRef50_UPI0001BCD579 alkylated DNA repair protein AlkB n=1 Tax... 77 3e-13 Sequences not found previously or not previously below threshold: UniRef50_C3YRT0 Putative uncharacterized protein n=1 Tax=Branchi... 130 4e-29 UniRef50_UPI000186CFBD conserved hypothetical protein n=1 Tax=Pe... 129 7e-29 UniRef50_UPI0000DB70B1 PREDICTED: similar to CG17807-PA n=2 Tax=... 127 2e-28 UniRef50_UPI00017B2DD1 UPI00017B2DD1 related cluster n=1 Tax=Tet... 120 4e-26 UniRef50_B3RYI8 Putative uncharacterized protein n=2 Tax=Trichop... 120 4e-26 UniRef50_B2APW5 Predicted CDS Pa_4_6060 n=6 Tax=Sordariomycetes ... 120 5e-26 UniRef50_Q96BT7 Alkylated DNA repair protein alkB homolog 8 n=32... 118 1e-25 UniRef50_D1Z416 Whole genome shotgun sequence assembly, scaffold... 117 2e-25 UniRef50_A7SSH3 Predicted protein n=1 Tax=Nematostella vectensis... 117 3e-25 UniRef50_A2QVX5 Contig An11c0110, complete genome n=18 Tax=Eurot... 114 2e-24 UniRef50_A8P5P3 Putative uncharacterized protein n=1 Tax=Brugia ... 114 2e-24 UniRef50_B8HYX6 2OG-Fe(II) oxygenase n=1 Tax=Cyanothece sp. PCC ... 112 8e-24 UniRef50_Q9U3P9 Protein C14B1.10, partially confirmed by transcr... 110 4e-23 UniRef50_B7QP17 Methyltransferase, putative n=1 Tax=Ixodes scapu... 107 3e-22 UniRef50_B6GZZ6 Pc12g09870 protein n=4 Tax=Eurotiomycetidae RepI... 106 5e-22 UniRef50_A7EDX1 Putative uncharacterized protein n=2 Tax=Sclerot... 106 6e-22 UniRef50_Q12QK9 DNA-N1-methyladenine dioxygenase n=5 Tax=Shewane... 105 1e-21 UniRef50_B8HQU9 2OG-Fe(II) oxygenase n=1 Tax=Cyanothece sp. PCC ... 105 1e-21 UniRef50_D0MUQ0 Alkylated DNA repair protein alkB 8 n=1 Tax=Phyt... 104 2e-21 UniRef50_C3NYZ8 Alkylated DNA repair protein n=28 Tax=Bacteria R... 103 5e-21 UniRef50_B8M368 Oxidoreductase, 2OG-Fe(II) oxygenase family, put... 101 1e-20 UniRef50_Q3AYK8 DNA-N1-methyladenine dioxygenase n=12 Tax=Cyanob... 101 2e-20 UniRef50_Q2BPN4 Putative uncharacterized protein n=1 Tax=Neptuni... 101 2e-20 UniRef50_D1I753 Whole genome shotgun sequence of line PN40024, s... 101 2e-20 UniRef50_Q07GB6 Oxidoreductase, putative n=1 Tax=Roseobacter den... 100 3e-20 UniRef50_Q5UR03 Uncharacterized protein L905 n=1 Tax=Acanthamoeb... 100 6e-20 UniRef50_B6HH87 Pc20g14010 protein n=10 Tax=Leotiomyceta RepID=B... 99 7e-20 UniRef50_A9T8I6 Predicted protein n=1 Tax=Physcomitrella patens ... 99 7e-20 UniRef50_B0SGN3 Alkylated DNA repair protein n=2 Tax=Leptospira ... 99 1e-19 UniRef50_C6W3C2 2OG-Fe(II) oxygenase n=5 Tax=Bacteroidetes RepID... 99 1e-19 UniRef50_A4CQ67 Alkylated DNA repair protein n=2 Tax=Flavobacter... 99 1e-19 UniRef50_C8VDQ1 DNA repair family protein (AFU_orthologue; AFUA_... 98 2e-19 UniRef50_C3ZI75 Putative uncharacterized protein n=1 Tax=Branchi... 98 2e-19 UniRef50_C6X2N0 2OG-Fe(II) oxygenase n=4 Tax=Bacteria RepID=C6X2... 98 2e-19 UniRef50_Q26EI7 Alkylated DNA repair protein n=1 Tax=Flavobacter... 98 2e-19 UniRef50_C5KK00 Putative uncharacterized protein n=6 Tax=Perkins... 97 3e-19 UniRef50_Q6ZEA1 Slr7097 protein n=5 Tax=Bacteria RepID=Q6ZEA1_SYNY3 97 5e-19 UniRef50_C5BKX4 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 97 6e-19 UniRef50_A4C6S7 Putative 2OG-Fe(II) oxygenase superfamily protei... 96 6e-19 UniRef50_Q3IHQ7 Putative 2OG-Fe(II) oxygenase superfamily protei... 96 9e-19 UniRef50_A4S4F4 Predicted protein n=1 Tax=Ostreococcus lucimarin... 96 9e-19 UniRef50_B2AYU6 Predicted CDS Pa_1_12280 (Fragment) n=5 Tax=Leot... 95 1e-18 UniRef50_A6EJU4 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter sp. BAL3... 95 1e-18 UniRef50_C4WWX2 ACYPI004109 protein n=2 Tax=Acyrthosiphon pisum ... 95 2e-18 UniRef50_A6EGN8 Alkylated DNA repair protein n=1 Tax=Pedobacter ... 95 2e-18 UniRef50_UPI00006CCD66 hypothetical protein TTHERM_00483520 n=1 ... 95 2e-18 UniRef50_B2W6U5 Oxidoreductase domain containing protein n=2 Tax... 95 2e-18 UniRef50_Q2MF23 TobX protein n=2 Tax=Actinomycetales RepID=Q2MF2... 94 3e-18 UniRef50_Q609W8 2OG-Fe(II) oxygenase family domain protein n=1 T... 94 4e-18 UniRef50_B5JS77 2OG-Fe(II) oxygenase n=1 Tax=gamma proteobacteri... 93 4e-18 UniRef50_Q7D1B7 Putative uncharacterized protein n=1 Tax=Agrobac... 93 6e-18 UniRef50_C7RA32 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis... 93 7e-18 UniRef50_A2R3V2 Similarity to human sequence 203 from patent WO0... 93 8e-18 UniRef50_C1FJB5 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 92 9e-18 UniRef50_UPI0001927839 PREDICTED: similar to predicted protein n... 92 1e-17 UniRef50_Q00Z84 2-Oxoglutarate-and iron-dependent dioxygenase-re... 92 1e-17 UniRef50_A0D9E2 Chromosome undetermined scaffold_42, whole genom... 92 1e-17 UniRef50_B8CDM4 Predicted protein (Fragment) n=1 Tax=Thalassiosi... 92 2e-17 UniRef50_Q7MF65 Alkylated DNA repair protein n=15 Tax=Vibrionace... 92 2e-17 UniRef50_B8HW11 2OG-Fe(II) oxygenase n=10 Tax=Bacteria RepID=B8H... 91 2e-17 UniRef50_B8KWS1 2OG-Fe(II) oxygenase superfamily protein n=1 Tax... 91 2e-17 UniRef50_A9EAY0 2OG-Fe(II) oxygenase n=2 Tax=Flavobacteriales Re... 91 3e-17 UniRef50_Q21J14 DNA-N1-methyladenine dioxygenase n=1 Tax=Sacchar... 90 3e-17 UniRef50_B0T8K7 2OG-Fe(II) oxygenase n=3 Tax=Alphaproteobacteria... 90 4e-17 UniRef50_D2XAQ5 Alkylated DNA repair protein n=1 Tax=Marseillevi... 90 5e-17 UniRef50_Q54BK8 2-oxoglutarate and Fe-dependent oxygenase family... 90 5e-17 UniRef50_B2AC29 Predicted CDS Pa_2_14240 n=1 Tax=Podospora anser... 90 7e-17 UniRef50_A4BAI0 Putative uncharacterized protein n=1 Tax=Reineke... 90 7e-17 UniRef50_A3M1I5 DNA repair system n=12 Tax=Acinetobacter RepID=A... 89 8e-17 UniRef50_C9SM10 DNA repair family protein n=1 Tax=Verticillium a... 89 9e-17 UniRef50_A4AA20 Putative alkylated DNA repair protein n=1 Tax=Co... 89 1e-16 UniRef50_B4RAN1 DNA alkylation damage repair protein AlkB n=1 Ta... 89 1e-16 UniRef50_C5BTC2 Putative alkylated DNA repair protein n=1 Tax=Te... 88 2e-16 UniRef50_Q2UNX0 Predicted protein n=6 Tax=Trichocomaceae RepID=Q... 88 2e-16 UniRef50_B7RVL5 Oxidoreductase, 2OG-Fe(II) oxygenase family n=2 ... 88 2e-16 UniRef50_A6SM11 Putative uncharacterized protein n=1 Tax=Botryot... 88 2e-16 UniRef50_A5GWW3 Alkylated DNA repair protein n=2 Tax=Synechococc... 88 2e-16 UniRef50_A1ZXT1 Alkylated DNA repair protein n=1 Tax=Microscilla... 88 2e-16 UniRef50_A3D131 DNA-N1-methyladenine dioxygenase n=15 Tax=Shewan... 88 2e-16 UniRef50_B4RZB3 Alkylated DNA repair protein n=4 Tax=Proteobacte... 88 2e-16 UniRef50_B4VHI5 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 88 3e-16 UniRef50_A4SB59 Predicted protein (Fragment) n=2 Tax=Ostreococcu... 88 3e-16 UniRef50_Q1YTT7 Oxidoreductase, 2OG-Fe(II) oxygenase family prot... 87 5e-16 UniRef50_Q6NS38 Alpha-ketoglutarate-dependent dioxygenase alkB h... 87 6e-16 UniRef50_A5FII3 DNA-N1-methyladenine dioxygenase n=2 Tax=Flavoba... 86 6e-16 UniRef50_A9BA22 Alkylated DNA repair protein n=2 Tax=Prochloroco... 86 7e-16 UniRef50_C8XTB3 Putative uncharacterized protein n=1 Tax=Dunalie... 86 8e-16 UniRef50_A9UX55 Predicted protein (Fragment) n=1 Tax=Monosiga br... 86 9e-16 UniRef50_D2V5F7 Predicted protein (Fragment) n=1 Tax=Naegleria g... 86 9e-16 UniRef50_UPI00017458F4 Alkylated DNA repair protein n=1 Tax=Verr... 86 1e-15 UniRef50_A9AQD8 2OG-Fe(II) oxygenase n=43 Tax=Burkholderia RepID... 86 1e-15 UniRef50_Q1ECQ5 At4g02485 n=2 Tax=Arabidopsis thaliana RepID=Q1E... 85 2e-15 UniRef50_B0C4L3 Alkylated DNA repair protein n=3 Tax=Bacteria Re... 84 3e-15 UniRef50_Q15YR0 DNA-N1-methyladenine dioxygenase n=1 Tax=Pseudoa... 84 3e-15 UniRef50_UPI00006CC0FF hypothetical protein TTHERM_00219000 n=1 ... 84 4e-15 UniRef50_D2V4D7 Predicted protein n=1 Tax=Naegleria gruberi RepI... 83 4e-15 UniRef50_C7PPT9 2OG-Fe(II) oxygenase n=2 Tax=Bacteroidetes RepID... 83 5e-15 UniRef50_Q80Y20-2 Isoform 2 of Alkylated DNA repair protein alkB... 83 6e-15 UniRef50_B4EMC2 2OG-Fe(II) oxygenase superfamily protein n=14 Ta... 83 6e-15 UniRef50_C6Y338 2OG-Fe(II) oxygenase n=10 Tax=Bacteria RepID=C6Y... 83 6e-15 UniRef50_B8NIA9 Putative uncharacterized protein n=1 Tax=Aspergi... 83 6e-15 UniRef50_A0YHA0 Putative uncharacterized protein n=1 Tax=marine ... 82 1e-14 UniRef50_D0MWE8 Putative uncharacterized protein n=1 Tax=Phytoph... 82 1e-14 UniRef50_Q01EG7 SelMay undefined product (IC) n=1 Tax=Ostreococc... 82 1e-14 UniRef50_A6VZM6 Putative alkylated DNA repair protein n=1 Tax=Ma... 82 2e-14 UniRef50_UPI000180B7B0 PREDICTED: similar to LOC496071 protein n... 81 2e-14 UniRef50_C1N0U9 Predicted protein n=1 Tax=Micromonas pusilla CCM... 81 2e-14 UniRef50_A6GHE3 2OG-Fe(II) oxygenase n=1 Tax=Plesiocystis pacifi... 81 3e-14 UniRef50_A2Q3T7 2OG-Fe(II) oxygenase n=2 Tax=Medicago truncatula... 80 4e-14 UniRef50_A1S8P3 DNA-N1-methyladenine dioxygenase n=1 Tax=Shewane... 80 4e-14 UniRef50_A1ZYS3 Alkylated DNA repair protein n=1 Tax=Microscilla... 80 4e-14 UniRef50_D1ITZ2 Whole genome shotgun sequence of line PN40024, s... 80 6e-14 UniRef50_D2V609 Predicted protein n=2 Tax=Naegleria gruberi RepI... 80 7e-14 UniRef50_Q6C9X6 YALI0D07546p n=1 Tax=Yarrowia lipolytica RepID=Q... 79 1e-13 UniRef50_A7EEP6 Putative uncharacterized protein n=1 Tax=Sclerot... 79 1e-13 UniRef50_Q2SBS6 Alkylated DNA repair protein n=1 Tax=Hahella che... 79 1e-13 UniRef50_D2VTV7 Predicted protein n=1 Tax=Naegleria gruberi RepI... 78 2e-13 UniRef50_A1K994 DNA repair system specific for alkylated DNA n=1... 78 2e-13 UniRef50_C7PS76 2OG-Fe(II) oxygenase n=1 Tax=Chitinophaga pinens... 77 4e-13 UniRef50_A9G7L2 High confidence in function and specificity n=1 ... 77 5e-13 UniRef50_Q9SIE0 Expressed protein n=10 Tax=Magnoliophyta RepID=Q... 77 6e-13 UniRef50_B9SA55 Oxidoreductase, putative n=1 Tax=Ricinus communi... 77 6e-13 UniRef50_C5CMR5 2OG-Fe(II) oxygenase n=1 Tax=Variovorax paradoxu... 76 7e-13 UniRef50_D2VF66 Predicted protein n=1 Tax=Naegleria gruberi RepI... 76 7e-13 UniRef50_A6DH75 Putative uncharacterized protein n=1 Tax=Lentisp... 76 7e-13 UniRef50_B8C4G3 Predicted protein n=1 Tax=Thalassiosira pseudona... 75 1e-12 UniRef50_B5IQB5 DNA repair system specific for alkylated DNA n=1... 75 1e-12 UniRef50_Q3M1V0 DNA-N1-methyladenine dioxygenase n=3 Tax=Nostoca... 75 1e-12 UniRef50_Q7S1J6 Predicted protein n=3 Tax=Sordariales RepID=Q7S1... 75 2e-12 UniRef50_Q8YKL5 All7279 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 75 2e-12 UniRef50_C1FG54 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 75 2e-12 UniRef50_B6AFB9 Oxidoreductase, 2og-Fe(II) oxygenase family prot... 75 2e-12 UniRef50_UPI000179247A PREDICTED: similar to alkB, alkylation re... 75 2e-12 UniRef50_Q4SEM2 Chromosome 10 SCAF14616, whole genome shotgun se... 74 4e-12 UniRef50_Q5CYU2 F27M3_19 plant like RRM plus AlkB domain contain... 74 4e-12 UniRef50_B6EQD5 Putative uncharacterized protein n=1 Tax=Aliivib... 74 4e-12 UniRef50_A4RVX0 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 73 5e-12 UniRef50_C1MXD4 Predicted protein n=1 Tax=Micromonas pusilla CCM... 73 5e-12 UniRef50_Q96Q83 Alpha-ketoglutarate-dependent dioxygenase alkB h... 73 6e-12 UniRef50_Q4A2D1 Putative uncharacterized protein n=2 Tax=dsDNA v... 73 7e-12 UniRef50_D2V317 Predicted protein n=1 Tax=Naegleria gruberi RepI... 72 1e-11 UniRef50_Q95XY7 Putative uncharacterized protein n=3 Tax=Caenorh... 72 2e-11 UniRef50_A6ESW5 2OG-Fe(II) oxygenase n=5 Tax=Bacteroidetes RepID... 72 2e-11 UniRef50_C5DVB8 ZYRO0D05434p n=1 Tax=Zygosaccharomyces rouxii Re... 71 2e-11 UniRef50_C9NTT0 Alkylated DNA repair protein n=5 Tax=cellular or... 71 2e-11 UniRef50_B0CZJ7 Predicted protein n=2 Tax=Agaricales RepID=B0CZJ... 71 2e-11 UniRef50_C1BKL6 Alkylated repair protein alkB homolog 3 n=5 Tax=... 71 3e-11 UniRef50_Q17527 Protein B0564.2, partially confirmed by transcri... 71 3e-11 UniRef50_D1HRN8 Whole genome shotgun sequence of line PN40024, s... 71 3e-11 UniRef50_D2VCK4 Predicted protein n=2 Tax=Naegleria gruberi RepI... 71 3e-11 UniRef50_B6JV21 2 OG-Fe(II) oxygenase n=1 Tax=Schizosaccharomyce... 70 4e-11 UniRef50_D2VNR1 Putative uncharacterized protein n=1 Tax=Naegler... 70 4e-11 UniRef50_Q987X8 Msr6861 protein n=10 Tax=Proteobacteria RepID=Q9... 70 5e-11 UniRef50_Q1G659 Polyprotein n=17 Tax=root RepID=Q1G659_9SECO 70 6e-11 UniRef50_C4Y8Y4 Putative uncharacterized protein n=1 Tax=Clavisp... 70 6e-11 UniRef50_B4JDW7 GH11262 n=4 Tax=Neoptera RepID=B4JDW7_DROGR 70 8e-11 UniRef50_A4I1X6 Putative uncharacterized protein n=3 Tax=Leishma... 70 8e-11 UniRef50_Q95XY6 Putative uncharacterized protein n=1 Tax=Caenorh... 69 8e-11 UniRef50_Q4D8T3 Putative uncharacterized protein n=2 Tax=Trypano... 69 8e-11 >UniRef50_P05050 Alpha-ketoglutarate-dependent dioxygenase alkB n=232 Tax=cellular organisms RepID=ALKB_ECOLI Length = 216 Score = 336 bits (862), Expect = 3e-91, Method: Composition-based stats. Identities = 216/216 (100%), Positives = 216/216 (100%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA Sbjct: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN Sbjct: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG Sbjct: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE Sbjct: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 >UniRef50_Q8Z566 AlkB protein n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=Q8Z566_SALTI Length = 216 Score = 306 bits (784), Expect = 4e-82, Method: Composition-based stats. Identities = 172/216 (79%), Positives = 192/216 (88%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 MLDLFAD PWQEPLA GAV+LRRFAF AA+ L+ DI VASQSPFRQMVTPGGYTMSVA Sbjct: 1 MLDLFADEAPWQEPLAPGAVVLRRFAFRAAQSLLDDIGFVASQSPFRQMVTPGGYTMSVA 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 MTNCG LGWTT GY Y+ DP T+KPWPA+P SF ++C++AA AAGY FQPDACLIN Sbjct: 61 MTNCGALGWTTDGHGYCYAVRDPLTDKPWPALPLSFASVCRQAAIAAGYASFQPDACLIN 120 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RYAPGAKLSLHQDKDEPDLRAPIVSVSLG+PA+FQFGGL+R+DPL+R+LLEHGD+VVWGG Sbjct: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGVPAVFQFGGLRRSDPLQRILLEHGDIVVWGG 180 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 ESRLFYHGIQPLKAGFHP+T + RYNLTFRQA +KE Sbjct: 181 ESRLFYHGIQPLKAGFHPMTGEFRYNLTFRQAAEKE 216 >UniRef50_C7JEB8 DNA repair protein for alkylated DNA n=8 Tax=Acetobacter pasteurianus RepID=C7JEB8_ACEP3 Length = 222 Score = 277 bits (709), Expect = 2e-73, Method: Composition-based stats. Identities = 106/207 (51%), Positives = 141/207 (68%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 L D P L AGAV+L FA + AE + I+ +A Q+PFR+M TPGG MSVAMT Sbjct: 12 LLPDTRPDYVQLDAGAVLLPGFALHDAEACMLAIHHIAQQAPFRKMHTPGGGQMSVAMTC 71 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA 123 CG GW + QGY Y+ ++P T +PWP MP F L +AA AG+ FQP+ACLIN Y+ Sbjct: 72 CGTFGWISTAQGYSYTKVNPFTGQPWPDMPAIFQALAHKAAQKAGFAQFQPNACLINSYS 131 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 PGA++ LHQD+DE P+VS+S GL A F +GGLKR+DP +++LL+ GDV+VWGG R Sbjct: 132 PGARMGLHQDRDEGCTDQPVVSLSFGLEATFLWGGLKRSDPTRQILLKDGDVLVWGGPDR 191 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 L +HG++P+ +G H T + R N+TFR Sbjct: 192 LRFHGVKPIHSGAHIRTGETRLNITFR 218 >UniRef50_UPI000197C9F9 alpha-ketoglutarate-dependent dioxygenase alkB n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C9F9 Length = 214 Score = 269 bits (689), Expect = 4e-71, Method: Composition-based stats. Identities = 112/209 (53%), Positives = 141/209 (67%), Gaps = 1/209 (0%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 LF D + + +A A +L+ F ++ L++ +++V + +P R M TP GY MS AMTN Sbjct: 5 LFPDEDNIVQ-IAPEAFLLKGFLLGQSDALLQSLSNVITANPLRHMATPNGYQMSAAMTN 63 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA 123 CG GW T ++GY YS DP TN+PW MP SF L AA+ AG+ F PDACLINRYA Sbjct: 64 CGDWGWVTDKKGYRYSQRDPVTNQPWQPMPISFVQLATSAASTAGFEHFIPDACLINRYA 123 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 GA +SLHQDKDE D PIVS SLGLP IF FGG R+ P + LEHGDV+VWGG SR Sbjct: 124 VGAAMSLHQDKDEADFTHPIVSFSLGLPTIFDFGGATRDAPKIAVYLEHGDVLVWGGRSR 183 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 L YHG++ +K+G HPL RYNLTFR++ Sbjct: 184 LNYHGVRRIKSGVHPLLGPYRYNLTFRRS 212 >UniRef50_Q5QTX8 Alkylated DNA repair protein n=1 Tax=Idiomarina loihiensis RepID=Q5QTX8_IDILO Length = 217 Score = 267 bits (684), Expect = 1e-70, Method: Composition-based stats. Identities = 105/214 (49%), Positives = 135/214 (63%), Gaps = 2/214 (0%) Query: 2 LDLFAD--AEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSV 59 LDLFA+ +EP ++ A + +F E L+ DI V QSP R + TP G+ MSV Sbjct: 4 LDLFANDGSEPLSTEISEQATLFHQFLLADDEALLNDIRGVLKQSPLRHLATPAGHKMSV 63 Query: 60 AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 ++CG GW + + GY Y IDP T +PWP +PQS + + AG+ +FQPD+CLI Sbjct: 64 KSSSCGSYGWLSDKHGYRYQNIDPVTGQPWPDIPQSILVKATQVSRLAGFQNFQPDSCLI 123 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y PGAK+ LHQDK+E D PIVS S GLP F +GG KR+D ++ L+H D +VWG Sbjct: 124 NVYTPGAKMGLHQDKNEADFSKPIVSFSFGLPITFMWGGFKRSDKYQKFSLQHADALVWG 183 Query: 180 GESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 G+ RL YHG+Q LK HPLT CR NLT RQAG Sbjct: 184 GKDRLRYHGVQQLKEAMHPLTGRCRVNLTIRQAG 217 >UniRef50_D0IWA8 2OG-Fe(II) oxygenase n=4 Tax=Proteobacteria RepID=D0IWA8_COMTE Length = 224 Score = 260 bits (664), Expect = 3e-68, Method: Composition-based stats. Identities = 113/214 (52%), Positives = 140/214 (65%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L LF E + GAV+LR FA ++ + ++ + + + FR M PGG MSVA+ Sbjct: 3 LSLFPADSLPAEIIDDGAVLLRGFAAAEEQRWVAEVTALQTGAAFRTMQVPGGKFMSVAI 62 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 TN G GW + QGY YS +DPQT KPWPA+P RAA AGYP F PDACLINR Sbjct: 63 TNAGGWGWISDLQGYRYSAVDPQTGKPWPAIPAFLGEQAARAAALAGYPGFAPDACLINR 122 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PGA++ LH+D+DE D APIVSVSLGLP F +GGL R P +RL L HGDV+VWGG Sbjct: 123 YQPGARMGLHRDQDEHDFAAPIVSVSLGLPCRFLWGGLTRQSPTRRLALTHGDVLVWGGP 182 Query: 182 SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 SRL +HG+ PL+ G HPL + R+NLTFR A + Sbjct: 183 SRLVFHGVAPLREGQHPLLGNERWNLTFRMAKAR 216 >UniRef50_A3WP94 Alkylated DNA repair protein n=1 Tax=Idiomarina baltica OS145 RepID=A3WP94_9GAMM Length = 209 Score = 256 bits (655), Expect = 3e-67, Method: Composition-based stats. Identities = 100/211 (47%), Positives = 125/211 (59%), Gaps = 2/211 (0%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 + L + P A GA +L A A ++ I + Q+P R +TPG MSV Sbjct: 1 MSLLDASGPI--EFAPGAWLLPNHASEQAADILAAIRECVRQAPLRHFMTPGNKPMSVLS 58 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 +NCG GW + +GY Y DP+++KPWP +P N + A AGYP+F P+ACLIN Sbjct: 59 SNCGDFGWVSDSKGYRYQATDPKSDKPWPDIPSILLNDATQVAEQAGYPEFLPNACLINV 118 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PGAK+ LHQD+DE DL P+VS S GLPA F + G R +RL L HGDV+VWGG Sbjct: 119 YKPGAKMGLHQDRDESDLNEPVVSYSFGLPARFIWAGQTRTGTKQRLPLNHGDVLVWGGP 178 Query: 182 SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 SRL YHGI L G HPLT R NLT R+A Sbjct: 179 SRLNYHGIDKLVEGTHPLTQQTRVNLTLRKA 209 >UniRef50_Q1N5M7 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=Bermanella marisrubri RepID=Q1N5M7_9GAMM Length = 212 Score = 246 bits (629), Expect = 3e-64, Method: Composition-based stats. Identities = 91/212 (42%), Positives = 119/212 (56%), Gaps = 2/212 (0%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 M DLFA E E L G + + E + I++VA Q+PFR M+TP G+ M VA Sbjct: 1 MSDLFASNEV--EILDHGQGLFQIRNLVNTEATMAAIHEVAKQAPFRHMMTPMGHHMKVA 58 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 TNCG GW GY YS DP++ + WPAMP + + P + PDACLIN Sbjct: 59 TTNCGEYGWIAQPSGYGYSRNDPESGQSWPAMPDTIRTISDDVIAHLNLPKYSPDACLIN 118 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RY G + HQDKDE + PI+SVSLGLPAIFQ G KR + GDV + G Sbjct: 119 RYDIGTSMGRHQDKDEANFDYPIISVSLGLPAIFQVVGPKRQGKATYYSVSDGDVFILSG 178 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 ++RL+YHG+ +KA + + RYNLT R++ Sbjct: 179 QARLYYHGVNTVKANPNQPELQQRYNLTLRRS 210 >UniRef50_D0LXU5 2OG-Fe(II) oxygenase n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LXU5_HALO1 Length = 219 Score = 226 bits (577), Expect = 4e-58, Method: Composition-based stats. Identities = 84/213 (39%), Positives = 120/213 (56%), Gaps = 10/213 (4%) Query: 3 DLFADAEPWQEPLAAG-AVILRRFAFNAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVA 60 +LF + P PL G I+ +A L+ + V +++P +R + G +SV Sbjct: 7 ELFPEQAP---PLPEGFLHIVAALDLDAQGALLEQVRAVLAEAPAYRPSMPRTGAPLSVR 63 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 M+NCG LGW + R GY Y P+ P T + WPA+P R A +P+ACL+N Sbjct: 64 MSNCGTLGWISDRAGYRYEPLHPHTARRWPAIPPLAMAQWNRFAD----WPVRPEACLVN 119 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y G++L +H D+DE AP+VS+SLG A+++ GG RN P +RLLL GDVVV GG Sbjct: 120 LYQTGSRLGMHVDQDERAADAPVVSISLGCDAVYRLGGHTRNLPSQRLLLRSGDVVVLGG 179 Query: 181 ESRLFYHGIQPLKAGFHPLT-IDCRYNLTFRQA 212 +R YHG+ + AG PL ++ R NLT R+ Sbjct: 180 AARRCYHGVDRIVAGTSPLPELEARINLTLRRV 212 >UniRef50_Q28VY2 DNA-N1-methyladenine dioxygenase n=30 Tax=Bacteria RepID=Q28VY2_JANSC Length = 216 Score = 220 bits (562), Expect = 2e-56, Method: Composition-based stats. Identities = 73/207 (35%), Positives = 108/207 (52%), Gaps = 8/207 (3%) Query: 7 DAEPWQEPLAAGAVILRRFAFN--AAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 ++E + + G +++ + +L+ + V +P + T G MSV MT+ Sbjct: 12 ESEEFALSVDVGGIVVHPEHLDGPDQAELVEQVRRVVRSAPLYRPETRTGRKMSVRMTSA 71 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 G GW + R+GY Y P + WP +P + + + A P +CLIN Y Sbjct: 72 GTYGWISDRRGYRYDRCHPD-GQDWPPIPPMALEIWRAVSGVAQD----PQSCLINYYDA 126 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 GAK+ +HQD+DE D P+VSVSLG A+F+ GG KR + + L+ GDV V GGE+RL Sbjct: 127 GAKMGMHQDRDEGDFDMPVVSVSLGDEALFRVGGPKRGGKTQSVWLKSGDVAVMGGEARL 186 Query: 185 FYHGIQPLKAGFHPLTI-DCRYNLTFR 210 +HGI ++AG L R NLT R Sbjct: 187 NFHGIDRIRAGSSTLLPNGGRINLTMR 213 >UniRef50_B8GWW6 Alpha-ketoglutarate-dependent dioxygenase alkB homolog n=43 Tax=Alphaproteobacteria RepID=ALKB_CAUCN Length = 220 Score = 218 bits (555), Expect = 1e-55, Method: Composition-based stats. Identities = 80/199 (40%), Positives = 109/199 (54%), Gaps = 6/199 (3%) Query: 17 AGAVILRRFA-FNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G + +A L+ + A Q+PF T G MSVAMT G LGWT+ +G Sbjct: 24 PGFDVWPGLLDISAQRALVEAVLAGAEQAPFSNYRTAYGKPMSVAMTALGSLGWTSDARG 83 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y Y P+T +PWP MP + +L T G P+ PD+CL+N Y GA++ LHQD+D Sbjct: 84 YRYVDRHPETGRPWPDMPPALLDLW----TVLGDPETPPDSCLVNLYRDGARMGLHQDRD 139 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAG 195 E D R P++S+SLG A+F+ GG+ R DP + L L GDV G +RL +HG+ + G Sbjct: 140 EADPRFPVLSISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPG 199 Query: 196 FHPL-TIDCRYNLTFRQAG 213 L R NLT R+A Sbjct: 200 SSSLVPGGGRINLTLRRAR 218 >UniRef50_C9CYS1 Putative uncharacterized protein n=2 Tax=Alphaproteobacteria RepID=C9CYS1_9RHOB Length = 200 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 74/196 (37%), Positives = 101/196 (51%), Gaps = 7/196 (3%) Query: 17 AGAVILRRF-AFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G I + F A A +LI+ + V +P PGG MSV MT+ G GW + + G Sbjct: 7 RGFEIHKGFLAAEAQRELIQALRPVLRAAPLFSPEVPGGGQMSVRMTSAGAFGWFSDKSG 66 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y Y+ P + + WP +P + TA D PD CL N Y GA++ LHQDKD Sbjct: 67 YRYADRHP-SGQAWPEIPAEVLKIW----TALIDRDRMPDCCLFNYYGEGARMGLHQDKD 121 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLK-A 194 E D P+VS+SLG + + GG R + + + L GDVVV GG++RL YHG+ ++ Sbjct: 122 EADFSYPVVSISLGDDGLLRVGGTSRKEKTESIWLNSGDVVVMGGDARLAYHGVDRIRFR 181 Query: 195 GFHPLTIDCRYNLTFR 210 L R NLT R Sbjct: 182 SSRLLPKGGRVNLTLR 197 >UniRef50_A7HZ41 2OG-Fe(II) oxygenase n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HZ41_PARL1 Length = 221 Score = 201 bits (512), Expect = 1e-50, Method: Composition-based stats. Identities = 71/202 (35%), Positives = 101/202 (50%), Gaps = 8/202 (3%) Query: 17 AGAVILRRF-AFNAAEQLIRDI-NDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 G + F A L+ + + P+R + G S+ TN G LGW + Sbjct: 22 DGIGLYPGFFGETAQRALVERLQAGFGAAPPYRPRMPRTGRPWSILQTNFGQLGWVSRPG 81 Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA-PGAKLSLHQD 133 GY YSP++ + PWPA+P + L A P+ CL+N Y P +++ LH+D Sbjct: 82 GYAYSPVNDVSKAPWPAIPAALLALWDDLAAY----PAPPECCLVNLYDAPKSRMGLHRD 137 Query: 134 KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLK 193 +DE L AP++S+SLG IF+ GG R D K L GDV+V GG SRL YHG+ + Sbjct: 138 EDEEALDAPVLSLSLGDTCIFRVGGFARGDKSKSFRLASGDVLVLGGASRLRYHGVDRVI 197 Query: 194 AGFHPL-TIDCRYNLTFRQAGK 214 +G L R NLT R+ + Sbjct: 198 SGSSRLIPGGGRINLTLRRVTR 219 >UniRef50_B0T136 2OG-Fe(II) oxygenase n=1 Tax=Caulobacter sp. K31 RepID=B0T136_CAUSK Length = 212 Score = 199 bits (507), Expect = 4e-50, Method: Composition-based stats. Identities = 78/212 (36%), Positives = 106/212 (50%), Gaps = 15/212 (7%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 +++ + W + L GA L R +P T G MSVAM Sbjct: 5 INVLPGFDLWPQLLDPGA----------QADLARLTLAALEVAPPAHYETAYGKAMSVAM 54 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 ++ G LGWT+ + GY Y+ P T PWPAMPQ+ +L G P PDA LIN Sbjct: 55 SSFGPLGWTSDKTGYRYTGRHPGTGAPWPAMPQALLDLW----ADLGDPQTPPDAALINL 110 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y A++ LHQD+DE D R P++S+SLG A+F+ GG R P + L L GDV G Sbjct: 111 YRGEARMGLHQDRDEADPRFPVLSISLGDTAVFRIGGTSRKGPTRSLKLSSGDVCRLSGP 170 Query: 182 SRLFYHGIQPLKAGFHPLT-IDCRYNLTFRQA 212 +RL +HG+ + G L R N+T R+A Sbjct: 171 ARLAFHGVDRILPGSSSLVAGGGRINITLRRA 202 >UniRef50_C7QZR3 2OG-Fe(II) oxygenase n=29 Tax=Actinomycetales RepID=C7QZR3_JONDD Length = 231 Score = 199 bits (505), Expect = 7e-50, Method: Composition-based stats. Identities = 77/222 (34%), Positives = 108/222 (48%), Gaps = 17/222 (7%) Query: 4 LFADAEP--WQEPLAAGAVILRRF-AFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 LF+DA+ +E +A GAV L + L R A+ + T G+ MSV Sbjct: 2 LFSDADVPRVREEIAPGAVWLPGWLTIPQQAWLARQCAQWAAGPVPIRSATVRGHPMSVK 61 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRA-ATAAGYPD----FQPD 115 T C +GW Y +D + P+ L +R A A G D + PD Sbjct: 62 -TVC--VGWHWRPYAYSRDAVDVN-GQRVVEFPKWMVRLGRRIVADATGDEDRALAYTPD 117 Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKRLLLEHGD 174 LIN Y A++ +HQDKDE L AP+VS+S+G F+FG + RN P + + L GD Sbjct: 118 TALINFYDVQARMGMHQDKDEKSL-APVVSLSIGDTCTFRFGNTENRNRPYRDIALASGD 176 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTID---CRYNLTFRQAG 213 V V+GG SRL +HG+Q + A P R+N+T R+ G Sbjct: 177 VFVFGGPSRLAFHGVQKIHAESAPDGCGVEHGRWNITMRETG 218 >UniRef50_D2B1L1 Alkylated DNA repair protein n=3 Tax=Actinomycetales RepID=D2B1L1_STRRD Length = 213 Score = 188 bits (477), Expect = 1e-46, Method: Composition-based stats. Identities = 69/207 (33%), Positives = 101/207 (48%), Gaps = 16/207 (7%) Query: 12 QEPLAAGAVILRRFAFNA-AEQLIRDINDVASQS-PFRQMVTPGGYTMSVAMTNCGHLGW 69 + +A GAV + + A QL+R A ++ PGG MSV T C W Sbjct: 12 RAEIAPGAVHVPDWLSPARQRQLVRACRAWARPPLGMERIRLPGGGLMSVR-TVCLGRRW 70 Query: 70 TTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLS 129 Y Y T++P +PQ L + A ++PD L+N Y A + Sbjct: 71 RP----YRY------TDEPVEPLPQWLAELGRAAVAQTLGGPYEPDVALVNFYDDAATMG 120 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGL-KRNDPLKRLLLEHGDVVVWGGESRLFYHG 188 +HQD+DE AP+VS+SLG +F+FG R P + LE GD+ V+GG SRL +HG Sbjct: 121 MHQDRDERA-AAPVVSLSLGDACVFRFGNTATRARPWSDVRLESGDLFVFGGPSRLAFHG 179 Query: 189 IQPLKAGFHPLTI-DCRYNLTFRQAGK 214 ++ + G P + R N+T RQ+G+ Sbjct: 180 VRRILPGTGPHDLIQGRLNITLRQSGQ 206 >UniRef50_A3VR77 Alkylated DNA repair protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VR77_9PROT Length = 222 Score = 182 bits (463), Expect = 5e-45, Method: Composition-based stats. Identities = 73/208 (35%), Positives = 97/208 (46%), Gaps = 7/208 (3%) Query: 9 EPWQEPLA--AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGH 66 +P +E + GA L + A ++ Q + VTPGG TMS N G Sbjct: 6 QPVREVVDLGEGACHLPGYLPAKAASDLQQHLIHLCQDRWIVPVTPGGQTMSARQMNLGP 65 Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA 126 LGW T R+GY Y P P WP MP + + A P+A L+N Y P A Sbjct: 66 LGWVTDRRGYRYEPRHPVDGAAWPEMPPALIRIWNDLLPEAP----SPEAGLVNLYGPTA 121 Query: 127 KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY 186 K+ LH+D DE PI+SVS G P F+ GG R + ++L HGDV++ G SR FY Sbjct: 122 KMGLHRDADEAAKDVPILSVSFGAPGRFRLGGATRKGSTRSIVLGHGDVLILAGPSRHFY 181 Query: 187 HGIQPL-KAGFHPLTIDCRYNLTFRQAG 213 HGI + R +LT R+ Sbjct: 182 HGIDRILLPSPLFAEDPHRLSLTLRRVT 209 >UniRef50_UPI0001983B96 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI0001983B96 Length = 456 Score = 177 bits (449), Expect = 2e-43, Method: Composition-based stats. Identities = 61/230 (26%), Positives = 96/230 (41%), Gaps = 31/230 (13%) Query: 11 WQEPLAAGAVILRRF-AFNAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVAMTNCGHLG 68 QE L G V+L+ + + ++++ D+ F + G + + M LG Sbjct: 230 TQEVLRPGMVLLKGYISLTEQIKMVKKCRDLGVGPGGFYRPGYQDGAKLRLQMMC---LG 286 Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------------YPDFQ 113 Q Y P P +P F L +RA + P Sbjct: 287 MNWDPQTRKYEKWHPLDGSETPDIPHEFSVLVERAIQDSQSLIKKNSGENNVEDTLPRMS 346 Query: 114 PDACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLL 169 P+ C++N Y +L LHQD+DE + P+VS SLG A F +G + D +++ Sbjct: 347 PNICIVNFYTTSGRLGLHQDRDESEESLLKGLPVVSFSLGDSAEFLYGNQRNVDAAGKVV 406 Query: 170 LEHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQA 212 LE GDV+++GG SR +HG+ + P + + R NLT RQ Sbjct: 407 LESGDVLIFGGPSRHIFHGVSSIIPNSAPNSLLEETNLLPGRLNLTLRQL 456 >UniRef50_D2V2M0 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V2M0_NAEGR Length = 314 Score = 177 bits (448), Expect = 3e-43, Method: Composition-based stats. Identities = 50/227 (22%), Positives = 82/227 (36%), Gaps = 32/227 (14%) Query: 17 AGAVILRRFAFNA-AEQLIRDINDVASQSPFRQMVTPGGYTM------SV-----AMTNC 64 G +++ I+ + P T + SV A Sbjct: 69 PGFYVIKDLMTPEKQIYWIKQALETYPNPPNITNHTMKNGEILDIFKRSVEGDESAKNYL 128 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 L W T Y ++ +K + P N C A Y ++P+A ++N Y+ Sbjct: 129 KKLAWCTLGYQYEWTTRKYHKDK-FVQFPHDIGNFCDLIACQCNYGPYKPEAAIVNFYSK 187 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 + H D E ++ PI+S+S+G +IF GG R+ K + LE GD ++ GG +R Sbjct: 188 DRLMGGHVDDAEYEMTKPIISLSIGSKSIFLLGGETRDTEPKAIFLESGDCMIMGGRARY 247 Query: 185 FYHGIQPLKAGFHPLTIDC-------------------RYNLTFRQA 212 +HGI + P + R N+ RQ Sbjct: 248 CFHGIARILKDTIPEYLQTKFVDPKYKIYAEYMERDMMRININARQV 294 >UniRef50_D2A2Y9 Putative uncharacterized protein GLEAN_07602 n=1 Tax=Tribolium castaneum RepID=D2A2Y9_TRICA Length = 297 Score = 177 bits (448), Expect = 3e-43, Method: Composition-based stats. Identities = 50/207 (24%), Positives = 81/207 (39%), Gaps = 19/207 (9%) Query: 17 AGAVILR-RFAFNAAEQLIRDINDVASQSPFRQMV-----TPGGYTMSVAMTN------C 64 G + ++ F + S+ P + + P G N Sbjct: 43 PGLIFIKNPFTSIGQRYWVVRCLQDYSKRPNKTNLDALNLVPEGKEWWEVCQNNNNKILM 102 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 L W T + + P+ L + A + + F +A ++N Y Sbjct: 103 NKLRWVTLGYHHDWESKVYAEENKG-EFPKDLAELSRFIAESLNFLHFNAEAAIVNYYHM 161 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 + LS H D E +L+AP++S+S G AIF GG ++D + L GD+VV ESRL Sbjct: 162 DSTLSGHTDHSEHNLKAPLISLSFGQTAIFLLGGKTKDDEPSAMFLRSGDIVVMSEESRL 221 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 YHG+ + L +D R+ F + Sbjct: 222 CYHGVPKI------LQMDSRFWNCFEE 242 >UniRef50_Q17GQ0 Putative uncharacterized protein (Fragment) n=2 Tax=Culicini RepID=Q17GQ0_AEDAE Length = 292 Score = 177 bits (448), Expect = 3e-43, Method: Composition-based stats. Identities = 54/200 (27%), Positives = 81/200 (40%), Gaps = 22/200 (11%) Query: 16 AAGAVILR-RFAFNAAEQLIRDINDVASQSPFRQ---------MVTPGGYT--MSVAMT- 62 G +++ F A + + P R T G+ S+A T Sbjct: 18 RPGLILIANPFTKPAQRYWMARCLQDYPKHPNRTNLPDTIMDKFGTYSGHFDWWSIAKTI 77 Query: 63 --------NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQP 114 L WTT Y ++ + P LC+ A + G+ F+P Sbjct: 78 EDPQERAKLWKALRWTTLGYHYDWTNKIYEEAAR-NEFPADLEELCRHFAESLGFRGFKP 136 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 +A ++N Y G+ L+ H D E +L AP+ S S G PA+F GG R++ LLL GD Sbjct: 137 EAAIVNYYPTGSTLAGHTDHSEKNLEAPLFSFSFGQPAVFLIGGPTRDEKPDALLLRSGD 196 Query: 175 VVVWGGESRLFYHGIQPLKA 194 V+V SRL YH + + Sbjct: 197 VIVMTRASRLCYHAVPKVFP 216 >UniRef50_A9TG90 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TG90_PHYPA Length = 401 Score = 176 bits (447), Expect = 4e-43, Method: Composition-based stats. Identities = 51/185 (27%), Positives = 77/185 (41%), Gaps = 24/185 (12%) Query: 51 TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAA-TAAGY 109 T T+S + T L W T + +S P+ +P +L +R A A Sbjct: 217 TSSAKTVS-SETLVRKLRWATVGIQFDWSKRAYNEALPFQEIPPKLADLARRLAKPAMEN 275 Query: 110 PDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLL 169 DF+ +A ++N Y P L H D E D+ PIVS+SLG AIF GG R++P + Sbjct: 276 EDFKAEAAIVNFYGPDDMLGGHVDDMEADMSKPIVSISLGCKAIFLLGGTTRDEPPAAMF 335 Query: 170 LEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID----------------------CRYNL 207 + GDVV+ G +R +HG+ + + + R N+ Sbjct: 336 VRSGDVVLMAGPARHCFHGVPRIFSEAKESELPDFTSMSDVDGIQPRSIVKYLESSRINV 395 Query: 208 TFRQA 212 RQ Sbjct: 396 NIRQV 400 >UniRef50_B7PUG0 Putative uncharacterized protein n=1 Tax=Ixodes scapularis RepID=B7PUG0_IXOSC Length = 287 Score = 175 bits (445), Expect = 6e-43, Method: Composition-based stats. Identities = 60/229 (26%), Positives = 86/229 (37%), Gaps = 19/229 (8%) Query: 2 LDLFADAEPWQEPL--AAGAVILR-RFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMS 58 L L A+ P L A G ++L F +R + ++ P V + Sbjct: 56 LGLCAERLPAAYELVDAPGLLLLPNPFTAEGQRLWVRRCLEEYTRPPHVTNVKAPATVLR 115 Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 G L W T + + + P P L A AG+ FQP+A + Sbjct: 116 AGDPFYGALRWATVGLHHDWDTKVYDKTRRSP-FPDCLRELATGLARLAGFSAFQPEAAI 174 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 +N YA + L H D E L AP+VS S G A+F GG R + LLL GDV+V Sbjct: 175 VNYYAMDSALGGHVDNSELALDAPVVSASFGQTAVFLVGGATRERRPRALLLRSGDVLVM 234 Query: 179 GGESRLFYHGIQPLKAGFHPLTIDC---------------RYNLTFRQA 212 G +RL YH + + R +++ RQ Sbjct: 235 SGPARLAYHAVPRVLPAGDDRPWAGGDAQWAPLEAYLDTHRISISVRQV 283 >UniRef50_D1HAA5 Whole genome shotgun sequence of line PN40024, scaffold_58.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HAA5_VITVI Length = 554 Score = 175 bits (444), Expect = 9e-43, Method: Composition-based stats. Identities = 61/230 (26%), Positives = 95/230 (41%), Gaps = 31/230 (13%) Query: 11 WQEPLAAGAVILRRFA-FNAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVAMTNCGHLG 68 QE L G V+L+ + ++++ D+ F + G + + M LG Sbjct: 328 TQEVLRPGMVLLKGYISLTEQIKMVKKCRDLGVGPGGFYRPGYQDGAKLRLQMMC---LG 384 Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------------YPDFQ 113 Q Y P P +P F L +RA + P Sbjct: 385 MNWDPQTRKYEKWHPLDGSETPDIPHEFSVLVERAIQDSQSLIKKNSGENNVEDTLPRMS 444 Query: 114 PDACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLL 169 P+ C++N Y +L LHQD+DE + P+VS SLG A F +G + D +++ Sbjct: 445 PNICIVNFYTTSGRLGLHQDRDESEESLLKGLPVVSFSLGDSAEFLYGNQRNVDAAGKVV 504 Query: 170 LEHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQA 212 LE GDV+++GG SR +HG+ + P + + R NLT RQ Sbjct: 505 LESGDVLIFGGPSRHIFHGVSSIIPNSAPNSLLEETNLLPGRLNLTLRQL 554 >UniRef50_C7NLG8 Alkylated DNA repair protein n=2 Tax=Actinomycetales RepID=C7NLG8_KYTSD Length = 228 Score = 173 bits (439), Expect = 3e-42, Method: Composition-based stats. Identities = 61/216 (28%), Positives = 90/216 (41%), Gaps = 21/216 (9%) Query: 12 QEPLAAGAVILRRFAFNAAEQLI-RDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 + +A G + + + + R A+ G MSV M GW Sbjct: 17 AQEVAPGCHWVPGWLDAGQQAWVVRQYRRWAAGPVPAHRPAVRGGRMSVTMV---PFGWV 73 Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ---PDACLINRYAPGAK 127 GY + Q P P +P L +RA A G+ + PD L+N Y P A Sbjct: 74 WTSAGYARTG--EQDAAPLP-VPDWMVRLYRRAVVATGFDGWAEAAPDVALVNHYRPDAS 130 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGL-KRNDPLKRLLLEHGDVVVWGGESRLFY 186 + +H+D DE AP+VS+S+G F+FG R P + LE GD+VV+GG +R + Sbjct: 131 MGMHRDADEL-TEAPVVSLSVGDACTFRFGSTETRTRPWTDIRLESGDLVVFGGPARRAF 189 Query: 187 HGIQPLKAGFHPL---------TIDCRYNLTFRQAG 213 HG+ + G + R N+T R G Sbjct: 190 HGVPRIHPGTAGPQVAAAQAEAELPGRLNITLRVTG 225 >UniRef50_C0WHI3 Alkylated DNA repair protein n=3 Tax=Corynebacterium RepID=C0WHI3_9CORY Length = 227 Score = 173 bits (439), Expect = 4e-42, Method: Composition-based stats. Identities = 62/231 (26%), Positives = 95/231 (41%), Gaps = 25/231 (10%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLI----RDINDVASQSPFRQMVTP--GGYTM 57 LF +A G + + ++++ R+I + +P + G M Sbjct: 2 LFDSLPRPNAYVAPGVGHVPGWVGIGKQKVLVEETREIARAYAHTPMAMVQPRLKSGGQM 61 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--------- 108 SV HLG H Y Y +D P +P S L A A Sbjct: 62 SVFQL---HLGRYWHYPSYRY--VDNMEGTRVPPVPDSLRELAPVALRQAAQVAPELEPW 116 Query: 109 YPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKR 167 +F P+ L+N Y PG+ + +H D E AP++S+S+G A+F+ G + R P Sbjct: 117 VDNFVPEMALVNYYPPGSAMGMHVDDSEGSP-APVISLSIGDEALFRIGHTENRTKPWDD 175 Query: 168 LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID---CRYNLTFRQAGKK 215 + L GD+VV+GG R YHG+ + G P R N+T RQ + Sbjct: 176 VTLCSGDLVVFGGPKRFAYHGVVRVNDGTLPEGCGLQEGRINITIRQVSAR 226 >UniRef50_C9YWY9 Putative DNA repair protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9YWY9_STRSW Length = 242 Score = 173 bits (438), Expect = 5e-42, Method: Composition-based stats. Identities = 68/240 (28%), Positives = 96/240 (40%), Gaps = 34/240 (14%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFN-AAEQLIRDINDVASQSP-FRQMVTPGGYTMSVA 60 +LF + + GAV + +L+ A R + TPGG TM+ Sbjct: 4 ELFPR---DRREIVPGAVHVPDRLDAGQQRRLLDACRAWARPPAGLRTVRTPGGGTMTAR 60 Query: 61 MTNCG-HLG--------------------WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNL 99 G H G T+ Y + +D P P L Sbjct: 61 QVCLGRHWGVVPACPACGRAVRDNPACPGRHTYPYAYSRTVVD-GDGAPVKPFPAWLGEL 119 Query: 100 CQRAATAAGYPDFQ---PDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF 156 +RA P D LIN Y A++ +H+D DEP AP+VS+SLG +F+F Sbjct: 120 GRRAVADTLGPQRATDPYDIALINYYDADARMGMHRDSDEPS-DAPVVSLSLGDTCLFRF 178 Query: 157 GGL-KRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID--CRYNLTFRQAG 213 G R P + L GD+ V+GGE+R YHG+ + AG P + R N+T R G Sbjct: 179 GNPRTRTRPYTDVELRSGDLFVFGGEARRAYHGVPRVYAGTAPPGLGLTGRLNITLRAGG 238 >UniRef50_B9GZQ0 Predicted protein n=13 Tax=Magnoliophyta RepID=B9GZQ0_POPTR Length = 353 Score = 173 bits (438), Expect = 5e-42, Method: Composition-based stats. Identities = 49/167 (29%), Positives = 70/167 (41%), Gaps = 20/167 (11%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--YPDFQPDACLINRYA 123 L W+T + +S + + P +P L ++ A A +F P+A ++N +A Sbjct: 186 KLRWSTLGLQFDWSKRNYNVSLPHNKIPDGLCQLAKKLAAPAMPVGEEFHPEAAIVNYFA 245 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 G L H D E D PIVS+SLG AIF GG R DP + L GDVV+ GE+R Sbjct: 246 SGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSREDPPLAMFLRSGDVVLMAGEAR 305 Query: 184 LFYHGIQPLKAGF------------------HPLTIDCRYNLTFRQA 212 +HG+ + R N+ RQ Sbjct: 306 ECFHGVPRIFTDKENAEITALELHFCDENDILEYIRTSRININIRQV 352 >UniRef50_C3XQU3 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQU3_BRAFL Length = 365 Score = 173 bits (438), Expect = 5e-42, Method: Composition-based stats. Identities = 44/189 (23%), Positives = 71/189 (37%), Gaps = 35/189 (18%) Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 T L WTT Y + + Q + + P L A G+P ++ + ++N Sbjct: 152 KTLLHKLRWTTLGYHYDWDKKEYQQER-YTEFPPDLSQLSTHVAQTLGFPRYRAQSAIVN 210 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y ++L H D E D PI+S S G A+F GG ++ + L +GD++V G Sbjct: 211 YYGLDSQLGGHVDHQELDYSKPIISFSFGQTAVFLLGGKTKSVKPMAMFLRNGDIMVMSG 270 Query: 181 ESRLFYHGIQPLKAG----------------------------------FHPLTIDCRYN 206 ++RL YHG+ + +CR N Sbjct: 271 DTRLAYHGVPKILKPPIAELLPEGLCEGDREGELHESMLPSTVEKSWEETALFMEECRIN 330 Query: 207 LTFRQAGKK 215 +T RQ + Sbjct: 331 ITVRQVVAE 339 >UniRef50_Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB n=1 Tax=Dictyostelium discoideum RepID=ALKB_DICDI Length = 393 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 20/184 (10%) Query: 48 QMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA 107 + + G + L W+T Y ++P + + + P L Q+ A A Sbjct: 196 RPLDKNGEPLPTYRQLLDKLAWSTLGYQYQWTPRLY-SEEFYEEFPDDLQELVQKIAIAT 254 Query: 108 GYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR 167 + + +A +N Y+ + + H D E ++ PI+S+S G A+F G R+ Sbjct: 255 KFDPYVAEAATVNFYSEDSIMGGHLDDAEQEMEKPIISISFGSTAVFLMGAETRDIAPVP 314 Query: 168 LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI-------------------DCRYNLT 208 L + GD+V+ GG SR YHG+ + L + + R N+ Sbjct: 315 LFIRSGDIVIMGGRSRYCYHGVAKIVENSFDLGLIDENDDQDLKYKIQWLKEKNRRVNIN 374 Query: 209 FRQA 212 RQ Sbjct: 375 TRQV 378 >UniRef50_B6JWW7 AlkB-like protein n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JWW7_SCHJY Length = 296 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 48/180 (26%), Positives = 74/180 (41%), Gaps = 16/180 (8%) Query: 50 VTPGGYTMSVAMTNC--GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAA-TA 106 + P G + V + N L W T + Y ++ P P+ +L + A Sbjct: 116 IPPTGSSKPVTVKNLMEKKLRWITFGEQYNWTTRVYPDPATAPPFPEKLGHLTEELVHKA 175 Query: 107 AGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK 166 + D++ +A ++N Y+P LS H D E DL P++S+S+GL I+ G R D K Sbjct: 176 TEFKDWKAEAAIVNFYSPRDTLSGHVDDAEDDLTLPLLSMSIGLDCIYLLGTETRKDVPK 235 Query: 167 RLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDC-------------RYNLTFRQAG 213 + L GD V+ G SR YH + + P + R N RQ Sbjct: 236 AIRLHSGDAVIMTGLSRKAYHAVPKIIPNTAPSYLQLKDEAVWNQWIQTKRVNFNIRQVR 295 >UniRef50_Q9SA98 Alkylated DNA repair protein alkB homolog n=4 Tax=Magnoliophyta RepID=ALKBH_ARATH Length = 345 Score = 170 bits (431), Expect = 3e-41, Method: Composition-based stats. Identities = 49/169 (28%), Positives = 72/169 (42%), Gaps = 22/169 (13%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA--GYPDFQPDACLINRYA 123 L W+T + +S + + P +P + L + A A +F+P+ ++N + Sbjct: 176 KLRWSTLGLQFDWSKRNYDVSLPHNNIPDALCQLAKTHAAIAMPDGEEFRPEGAIVNYFG 235 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 G L H D E D PIVS+SLG AIF GG ++DP + L GDVV+ GE+R Sbjct: 236 IGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSKDDPPHAMYLRSGDVVLMAGEAR 295 Query: 184 LFYHGIQPLKAGFHPLTIDC--------------------RYNLTFRQA 212 +HGI + G I R N+ RQ Sbjct: 296 ECFHGIPRIFTGEENADIGALESELSHESGHFFAEYIKTSRININIRQV 344 >UniRef50_A7RXQ8 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RXQ8_NEMVE Length = 323 Score = 170 bits (430), Expect = 3e-41, Method: Composition-based stats. Identities = 49/191 (25%), Positives = 82/191 (42%), Gaps = 18/191 (9%) Query: 16 AAGAV-ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSV-------AMTNCGH- 66 G + I+ F A R + P + ++ MS+ A++N H Sbjct: 78 RPGLIFIVNPFVSGAQHYWARRCLEDF---PRKPNISNLDAHMSIGDDDCIWALSNSEHV 134 Query: 67 -----LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 L W + Y+ +D + K + P+ L A A GY + P+A ++N Sbjct: 135 NLIDRLRWVHLGYQFDYNVVDYKPEK-YYGFPKDLGGLMHHLAEAIGYLGYTPEAGIVNY 193 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y A + H D E DL P++SVS G A+F GG ++ L + GD+++ GE Sbjct: 194 YPLSASMGGHTDHYELDLSWPLISVSFGQSAVFLIGGKTKDVKPTALYIRSGDILIMSGE 253 Query: 182 SRLFYHGIQPL 192 +RL +H + + Sbjct: 254 ARLAFHAVPRI 264 >UniRef50_B3RXF0 Putative uncharacterized protein (Fragment) n=1 Tax=Trichoplax adhaerens RepID=B3RXF0_TRIAD Length = 271 Score = 169 bits (428), Expect = 6e-41, Method: Composition-based stats. Identities = 51/184 (27%), Positives = 74/184 (40%), Gaps = 9/184 (4%) Query: 17 AGAVILR-RFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGH-------LG 68 G + +R F I+ P + + G + H L Sbjct: 19 PGFIYIRNPFLNCGQRYWIKRCLKNFHTYPSKTNLDAHGNSTKGKEVVTKHRNDLMDKLR 78 Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKL 128 WTT Y +S + N+ P L + A G+P F P+A +IN Y + L Sbjct: 79 WTTLGYHYDWSTKEYYHNRK-SEFPTDLAELTKLLAATVGFPLFSPEAAIINYYKLDSTL 137 Query: 129 SLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHG 188 S H D E D AP+ S+S G AIF GG + + +E GD+ + GESRL YH Sbjct: 138 SGHTDHSEFDFTAPLFSISFGQKAIFLLGGRTTSVTPVAMYIESGDICIMSGESRLAYHA 197 Query: 189 IQPL 192 + + Sbjct: 198 VPRI 201 >UniRef50_O60066 Alkylated DNA repair protein alkB homolog n=2 Tax=Schizosaccharomyces pombe RepID=ALKBH_SCHPO Length = 297 Score = 168 bits (427), Expect = 8e-41, Method: Composition-based stats. Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 13/172 (7%) Query: 55 YTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAA-TAAGYPDFQ 113 ++V L W T + Y ++ + P P+ + ++ + + ++ Sbjct: 123 KPLTVDRLVHKKLRWVTLGEQYDWTTKEYPDPSKSPGFPKDLGDFVEKVVKESTDFLHWK 182 Query: 114 PDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHG 173 +A ++N Y+PG LS H D+ E DL P++S+S+GL I+ G R++ L L G Sbjct: 183 AEAAIVNFYSPGDTLSAHIDESEEDLTLPLISLSMGLDCIYLIGTESRSEKPSALRLHSG 242 Query: 174 DVVVWGGESRLFYHGIQPLKAGFHPLTI------------DCRYNLTFRQAG 213 DVV+ G SR +H + + P + R N RQ Sbjct: 243 DVVIMTGTSRKAFHAVPKIIPNSTPNYLLTGNKAWDGWISRKRVNFNVRQVR 294 >UniRef50_Q7KUZ2 AlkB n=12 Tax=Drosophila RepID=Q7KUZ2_DROME Length = 332 Score = 168 bits (427), Expect = 9e-41, Method: Composition-based stats. Identities = 46/135 (34%), Positives = 66/135 (48%), Gaps = 1/135 (0%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA 126 + WTT + + P P+ +LC A A GY DF+P+A ++N Y G+ Sbjct: 141 MRWTTFGYHHNWDTKIYDEEMQSP-FPEDLSSLCGLFAQALGYADFKPEAAIVNYYPVGS 199 Query: 127 KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY 186 LS H D EP+ AP+ S S G AIF GG + + L+ GDV++ GESRL Y Sbjct: 200 TLSGHTDHSEPNKSAPLFSFSFGQTAIFLIGGRSLEEKPTAIYLQSGDVMIMSGESRLCY 259 Query: 187 HGIQPLKAGFHPLTI 201 H + + T+ Sbjct: 260 HAVPRIIKTQASATL 274 >UniRef50_Q7QEU7 AGAP000155-PA n=1 Tax=Anopheles gambiae RepID=Q7QEU7_ANOGA Length = 293 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 54/195 (27%), Positives = 73/195 (37%), Gaps = 19/195 (9%) Query: 16 AAGAVILR-RFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA-------------- 60 G +++ F A Q + P + G A Sbjct: 36 RPGLLVVANPFTAEAQRQWMTRSLADYPIPPNATNQSGVGQQARDAVGSWWEQLQTIPTP 95 Query: 61 ---MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDAC 117 L W T Y ++ + P P L + AT GY F P+A Sbjct: 96 AERRKFAKSLRWATLGYQYDWTNKLYDEARREP-FPCELGALVRYVATTLGYDRFSPEAA 154 Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVV 177 ++N Y GA L+ H D E D AP+ S S G PA+F GG R + LLL GD+VV Sbjct: 155 IVNYYPAGATLAGHTDHSEDDQTAPLFSFSFGQPAVFLIGGTSREEHPDALLLRSGDIVV 214 Query: 178 WGGESRLFYHGIQPL 192 G SRL YH + + Sbjct: 215 MTGASRLCYHAVPRV 229 >UniRef50_C2BL13 Alkylated DNA repair protein n=2 Tax=Corynebacterium RepID=C2BL13_9CORY Length = 229 Score = 167 bits (422), Expect = 3e-40, Method: Composition-based stats. Identities = 61/231 (26%), Positives = 96/231 (41%), Gaps = 25/231 (10%) Query: 4 LFADAEPWQEPLAAGAVILRRFA-FNAAEQLIRDINDVASQ---SPFRQMVTP--GGYTM 57 LF +A G + + + + L+ ++ +A + +P + G M Sbjct: 2 LFDSLPRPSVRVAPGVGHVPAWVGVDKQKALVEEMRGIAREYANTPMAMVRPRLKSGGQM 61 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--------- 108 SV HLG H Y Y +D P +P+S + A AA Sbjct: 62 SVFQL---HLGRYWHYPSYRY--VDNMEGTRVPPVPESLRQIAPGALRAAAEVAPELEPW 116 Query: 109 YPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKR 167 F P+ L+N Y PG+ + + D E AP++S+S+G A+F+ G + R P Sbjct: 117 VDTFVPEMALVNYYPPGSAMGMRVDDSEESP-APVISLSIGDEALFRMGHTEARTRPWDD 175 Query: 168 LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID---CRYNLTFRQAGKK 215 + L GD+VV+GG R YHG+ + G P R N+T RQ + Sbjct: 176 ITLCSGDLVVFGGPKRFAYHGVVRVNDGTLPEGCGLREGRINITIRQVSAR 226 >UniRef50_C6TKW1 Putative uncharacterized protein n=1 Tax=Glycine max RepID=C6TKW1_SOYBN Length = 311 Score = 165 bits (418), Expect = 9e-40, Method: Composition-based stats. Identities = 60/211 (28%), Positives = 90/211 (42%), Gaps = 16/211 (7%) Query: 15 LAAGAVILRRF-AFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHR 73 L G V L+ + + + E +++ ++ S G T C W Sbjct: 102 LRPGMVFLKGYLSLSDQEMIVKRCRELGVGSGGFYQHGYGEDTKMHLKMMCLEKNWDPQF 161 Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAA--TAAGYPDFQPDACLINRYAPGAKLSLH 131 Y P P +P FH+ A + A P PD C++N Y+ +L LH Sbjct: 162 GQ--YGDRRPFDGAKPPQIPPEFHSHVHSALKDSNALLPSISPDICIVNFYSETGRLGLH 219 Query: 132 QDKDE----PDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYH 187 QDKDE L P++S S+G A F + + D K+LLL+ GDV+++GG SR +H Sbjct: 220 QDKDESPDSLRLGLPVISFSIGDSADFLYADHRDLDQPKKLLLQSGDVLIFGGPSRNLFH 279 Query: 188 GIQPLKAGFHP-------LTIDCRYNLTFRQ 211 G+ + P R NLTFR+ Sbjct: 280 GVASIHPNTAPNLLLQHTNLCPGRLNLTFRR 310 >UniRef50_Q8T9A3 SD10403p n=17 Tax=Coelomata RepID=Q8T9A3_DROME Length = 615 Score = 165 bits (417), Expect = 1e-39, Method: Composition-based stats. Identities = 59/225 (26%), Positives = 88/225 (39%), Gaps = 30/225 (13%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQ-LIRDINDVASQSPFRQMVTPGGYTMSVA 60 L A W +PL G I+ F E L+R I + S T S+ Sbjct: 120 LPALAGKSEWNKPLPRGLHIIADFVTEEEESTLLRAIGEDGRTSE---------GTGSLK 170 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQ-PDACL 118 N H G+ +LY + +KP +P + L R + A D+ PD Sbjct: 171 HRNVKHFGFE-----FLYGTNNVDPSKPLEQSIPSACDILWPRLNSFASTWDWSSPDQLT 225 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 +N Y PG + H D L PI+S+SL + F +R D ++ L +++ Sbjct: 226 VNEYEPGHGIPPHVDTHSAFLD-PILSLSLQSDVVMDF---RRGDDQVQVRLPRRSLLIM 281 Query: 179 GGESRL-FYHGIQPLKAGFHPLTIDC--------RYNLTFRQAGK 214 GE+R + HGI+P P R +LTFR+ K Sbjct: 282 SGEARYDWTHGIRPKHIDVVPSASGGLTTQARGKRTSLTFRRLRK 326 >UniRef50_UPI00019271E1 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI00019271E1 Length = 350 Score = 164 bits (416), Expect = 2e-39, Method: Composition-based stats. Identities = 34/148 (22%), Positives = 61/148 (41%), Gaps = 1/148 (0%) Query: 55 YTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQP 114 SV L WT + Y+ +D + K + P+ L A + ++ Sbjct: 134 QEKSVENNFINQLRWTHMGYHFDYNIVDYK-AKEYYGFPKDLAELTVTIADVFKFQNYIA 192 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 + +IN Y G+ + H D E +L P++S S G A+F GG R+ + + + GD Sbjct: 193 ETGIINYYPEGSSMGGHTDHYEEELSQPLISYSFGQAAVFLIGGPTRDIKPEGIWVRTGD 252 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTID 202 +++ G SR +H + + Sbjct: 253 IILMTGPSRTAFHAVPCIITKNQKTIPH 280 >UniRef50_Q9LJH2 Similarity to unknown protein n=7 Tax=Embryophyta RepID=Q9LJH2_ARATH Length = 455 Score = 164 bits (416), Expect = 2e-39, Method: Composition-based stats. Identities = 58/227 (25%), Positives = 93/227 (40%), Gaps = 31/227 (13%) Query: 13 EPLAAGAVILRRF-AFNAAEQLIRDINDVA-SQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 + G V+L+ + + N ++ + + F Q + + M LG Sbjct: 231 TVIRPGMVLLKNYLSINDQVMIVNKCRRLGLGEGGFYQPGYRDEAKLHLKMMC---LGKN 287 Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA---------------GYPDFQPD 115 + Y P P +P F+ ++A + P PD Sbjct: 288 WDPETSRYGETRPFDGSTAPRIPAEFNQFVEKAVKESQSLAASNSKQTKGGDEIPFMLPD 347 Query: 116 ACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 171 C++N Y+ +L LHQDKDE + P+VS S+G A F +G + D + L LE Sbjct: 348 ICIVNFYSSTGRLGLHQDKDESENSIRKGLPVVSFSIGDSAEFLYGDQRDEDKAETLTLE 407 Query: 172 HGDVVVWGGESRLFYHGIQPLKAGFHPL-------TIDCRYNLTFRQ 211 GDV+++GG SR +HG++ ++ P R NLTFRQ Sbjct: 408 SGDVLLFGGRSRKVFHGVRSIRKDTAPKALLQETSLRPGRLNLTFRQ 454 >UniRef50_UPI00005257D6 PREDICTED: similar to AlkB CG33250-PA n=1 Tax=Ciona intestinalis RepID=UPI00005257D6 Length = 312 Score = 164 bits (415), Expect = 2e-39, Method: Composition-based stats. Identities = 47/175 (26%), Positives = 74/175 (42%), Gaps = 18/175 (10%) Query: 55 YTMSVAMTNCG---HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD 111 Y S+++ C L W T + ++ + +P +P + A+ G D Sbjct: 137 YEDSLSLLKCCPIWKLRWATLGYHHNWNSKQY-SEQPCSELPSELRKTSKLFASMIGTDD 195 Query: 112 FQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 171 F+ +A ++N Y G LS H D E L AP+VS+S GL A+F GG ++ + L + Sbjct: 196 FKAEASIVNYYHVGNALSPHDDTSELYLEAPLVSLSFGLSAVFLIGGTSKDQKPEALFIR 255 Query: 172 HGDVVVWGGESRLFYHGIQPLKAGFH--------------PLTIDCRYNLTFRQA 212 GDV++ G SRL YH + + R N+ RQ Sbjct: 256 SGDVIIMSGASRLAYHAVPRILKPTSDAQLKSSSVADNLDLFMSHSRLNINIRQV 310 >UniRef50_Q9LZW8 Putative uncharacterized protein T20L15_50 n=3 Tax=Arabidopsis thaliana RepID=Q9LZW8_ARATH Length = 449 Score = 163 bits (412), Expect = 5e-39, Method: Composition-based stats. Identities = 59/228 (25%), Positives = 102/228 (44%), Gaps = 35/228 (15%) Query: 13 EPLAAGAVILRRFAFNA-AEQLIRDINDVA-SQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 + + G V+L+ F +++ ++ + F Q G + + M LG Sbjct: 227 KVIRPGMVLLKDFLTPDIQVDIVKTCRELGVKPTGFYQPGYSVGSKLHLQMMC---LGRN 283 Query: 71 THRQG-YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA---------------GYPDFQP 114 Q Y + + P +P +F+ L ++A A P P Sbjct: 284 WDPQTKYR---KNTDIDSKAPEIPVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSP 340 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLR----APIVSVSLGLPAIFQFGGLKRNDPLKRLLL 170 D C++N Y+ +L LHQD+DE + PIVS S+G A F +G + + + ++L Sbjct: 341 DICIVNFYSETGRLGLHQDRDESEESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVIL 400 Query: 171 EHGDVVVWGGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211 E GDV+++GGESR+ +HG++ + P+++ R NLTFR Sbjct: 401 ESGDVLIFGGESRMIFHGVKSIIPNSAPMSLLNESKLRTGRLNLTFRH 448 >UniRef50_D1IRG0 Whole genome shotgun sequence of line PN40024, scaffold_2.assembly12x (Fragment) n=2 Tax=rosids RepID=D1IRG0_VITVI Length = 457 Score = 162 bits (411), Expect = 6e-39, Method: Composition-based stats. Identities = 60/235 (25%), Positives = 99/235 (42%), Gaps = 32/235 (13%) Query: 6 ADAEPWQEPLAAGAVILRRF-AFNAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVAMTN 63 A+ + + +G V+L+ + + + ++++ ++ S F Q G +++ M Sbjct: 225 AEEGLKGDVIRSGMVLLKGYISSSDQVKIVKKCQELGLGSGGFYQPGYRDGGKLNLQMMC 284 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAAT----------------AA 107 LG + Y P N P +P F +L + A Sbjct: 285 ---LGKNWDPETGKYEDERPVDNAKPPPIPDEFFHLVKEAIQDSQALLSKEKIEASKVEK 341 Query: 108 GYPDFQPDACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRND 163 P PD C++N Y +L LHQD+DE + P+VS S+G A F + + Sbjct: 342 ELPWMIPDICIVNFYTTSGRLGLHQDRDETEETLRKGLPVVSFSIGDSAKFLYSNQRDVF 401 Query: 164 PLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211 +LLE GDV+++GGESR +HG+ + P + R NLTFRQ Sbjct: 402 NADEVLLESGDVLIFGGESRRIFHGVASILPNTSPQVLLKETNLRPGRLNLTFRQ 456 >UniRef50_C0PA85 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0PA85_MAIZE Length = 389 Score = 162 bits (409), Expect = 9e-39, Method: Composition-based stats. Identities = 48/199 (24%), Positives = 74/199 (37%), Gaps = 45/199 (22%) Query: 59 VAMTNCGHLGWTTHRQGYLYSPI-----------------------DPQTNKPWPAMPQS 95 A T L W+T + +S + + P +P + Sbjct: 190 AATTLVRKLRWSTLGLQFDWSKRTIRSKSPRNSQEKPLSSSRLVERNYDVSLPHNKIPGA 249 Query: 96 FHNLCQRAATAA--GYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAI 153 +L ++ A A +F+P+A ++N Y P L H D E D PIVS+SLG I Sbjct: 250 LASLAKKMAIPAMPSGEEFKPEAAIVNYYGPSDMLGGHVDDMEADWTKPIVSISLGCKCI 309 Query: 154 FQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDC---------- 203 F GG R++ + L GD+V+ GE+R +HG+ + I Sbjct: 310 FLLGGKTRDEVPTAMFLRSGDIVLMAGEARERFHGVPRIFTESDQQEIPALISQLSSGDD 369 Query: 204 ----------RYNLTFRQA 212 R N+ RQ Sbjct: 370 VFILEYIKNSRININIRQV 388 >UniRef50_UPI0000E47318 PREDICTED: similar to LOC494680 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47318 Length = 488 Score = 160 bits (405), Expect = 3e-38, Method: Composition-based stats. Identities = 43/137 (31%), Positives = 60/137 (43%), Gaps = 1/137 (0%) Query: 56 TMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPD 115 + L W T Y ++ ++ P+ + A GYP FQ Sbjct: 140 QQTQEKRLMDQLRWVTLGYHYDWNNKVYNEDQ-HSLFPEDLGPMSALIAEVLGYPRFQSQ 198 Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 A ++N Y + L H D E DL AP++S SLG AI GG + L L GDV Sbjct: 199 AAIVNFYHMDSTLGGHTDHSEFDLTAPLISYSLGQSAILLVGGKTKATKPLALHLRSGDV 258 Query: 176 VVWGGESRLFYHGIQPL 192 ++ GGESRL YH + + Sbjct: 259 IILGGESRLAYHAVPKI 275 >UniRef50_UPI0000E484FA PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E484FA Length = 424 Score = 160 bits (405), Expect = 3e-38, Method: Composition-based stats. Identities = 57/217 (26%), Positives = 86/217 (39%), Gaps = 24/217 (11%) Query: 5 FADAEPWQEPL----AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 F ++ P +P G VI+ F EQ I D + AS S +A Sbjct: 126 FVESVPTDKPQSNVPPPGLVIIPDFIDECLEQKIIDSIEWASPSE-------------IA 172 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLI 119 + H H + YS + +KP P MP+ + + R G+ F+PD I Sbjct: 173 NQSLKHRKVKHHGYEFNYSSNNIDRDKPLPGGMPELYGQVINRIME-TGHVQFKPDQLTI 231 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N+Y PG + H D I+S+SL + +F ++L ++V Sbjct: 232 NQYQPGQGIPPHVDTHSA-FEDAIISLSLESQIVMEFTHP--AGHQVPVVLPRRSLLVMT 288 Query: 180 GESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 GE+R + HGI P K P NLT Q G++ Sbjct: 289 GEARYKWTHGITPKKTDVIPDPTFPD-NLTLHQRGQR 324 >UniRef50_Q13686 Alkylated DNA repair protein alkB homolog 1 n=27 Tax=Euteleostomi RepID=ALKB1_HUMAN Length = 389 Score = 159 bits (403), Expect = 5e-38, Method: Composition-based stats. Identities = 37/142 (26%), Positives = 64/142 (45%), Gaps = 1/142 (0%) Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 + L W T Y + + + P L ++ A A G+ DF+ +A ++N Sbjct: 162 RSLLEKLRWVTVGYHYNWDSKKYSADH-YTPFPSDLGFLSEQVAAACGFEDFRAEAGILN 220 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y + L +H D+ E D P++S S G AIF GGL+R++ + + GD+++ G Sbjct: 221 YYRLDSTLGIHVDRSELDHSKPLLSFSFGQSAIFLLGGLQRDEAPTAMFMHSGDIMIMSG 280 Query: 181 ESRLFYHGIQPLKAGFHPLTID 202 SRL H + + + Sbjct: 281 FSRLLNHAVPRVLPNPEGEGLP 302 >UniRef50_C0Z2F3 AT5G01780 protein n=9 Tax=Magnoliophyta RepID=C0Z2F3_ARATH Length = 217 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 58/222 (26%), Positives = 99/222 (44%), Gaps = 35/222 (15%) Query: 19 AVILRRFAFNA-AEQLIRDINDVA-SQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG- 75 V+L+ F +++ ++ + F Q G + + M LG Q Sbjct: 1 MVLLKDFLTPDIQVDIVKTCRELGVKPTGFYQPGYSVGSKLHLQMMC---LGRNWDPQTK 57 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA---------------GYPDFQPDACLIN 120 Y + + P +P +F+ L ++A A P PD C++N Sbjct: 58 YR---KNTDIDSKAPEIPVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICIVN 114 Query: 121 RYAPGAKLSLHQDKDEPDLR----APIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVV 176 Y+ +L LHQD+DE + PIVS S+G A F +G + + + ++LE GDV+ Sbjct: 115 FYSETGRLGLHQDRDESEESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGDVL 174 Query: 177 VWGGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211 ++GGESR+ +HG++ + P+++ R NLTFR Sbjct: 175 IFGGESRMIFHGVKSIIPNSAPMSLLNESKLRTGRLNLTFRH 216 >UniRef50_D0NYX8 Alkylated DNA repair protein alkB-like protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NYX8_PHYIN Length = 309 Score = 156 bits (394), Expect = 6e-37, Method: Composition-based stats. Identities = 40/181 (22%), Positives = 66/181 (36%), Gaps = 26/181 (14%) Query: 56 TMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPD 115 L W Y ++ + +P+ L + A G + + Sbjct: 122 QDPAESPLLAKLCWAASGYHYDWTARKYHKGS-FSPVPELLQQLGTKCAAVCGM-TLEAE 179 Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 A ++N Y + + H D E + P+VS+SLG +F GG +++ +LL GD+ Sbjct: 180 AVIVNYYKTKSSMGGHLDDVEYTMDHPVVSLSLGSKCVFLMGGHTKDEAPLEILLRSGDI 239 Query: 176 VVWGGESRLFYHGIQPLKA----------GFHPLTID--------------CRYNLTFRQ 211 + GG SR YHG+ + PL+ D R N+ RQ Sbjct: 240 AIMGGASRTCYHGVARVLPTPFSMKSKELDALPLSNDDREQYEAVRTYLGSQRININVRQ 299 Query: 212 A 212 Sbjct: 300 V 300 >UniRef50_Q4D9X3 Alkylated DNA repair protein, putative n=3 Tax=Trypanosoma RepID=Q4D9X3_TRYCR Length = 323 Score = 155 bits (392), Expect = 1e-36, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 70/225 (31%), Gaps = 30/225 (13%) Query: 17 AGAVILRRFAFNAAEQ--LIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 G + +Q I S ++ + W T Sbjct: 89 PGLLFFPDAISEEEQQAFCRDAILRYGDSSQHPNHLSTHASKPKCTKRYEAPMRWATLGF 148 Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD---------FQPDACLINRYAPG 125 Y ++ + A P + + ++P ++N ++ G Sbjct: 149 SYDWTSKTYTREN-YSAFPPALKRRIEEILHLCSSTPDLKDVNPSIYEPQTAIVNYFSVG 207 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLF 185 + + HQD E ++ P++S+SLG +F G R+D L GDV V+ G SR+ Sbjct: 208 SMMMAHQDVSEESMQHPLISISLGCSCVFLMGTSSRDDAPYAFWLRSGDVAVFSGPSRVA 267 Query: 186 YHGIQPLKAGFHP------------------LTIDCRYNLTFRQA 212 +H I + P R N+ RQ Sbjct: 268 FHSIPRIMDDCPPHLCTISGENNEDEVYWRTQMRHMRININVRQV 312 >UniRef50_UPI000051A07C PREDICTED: similar to AlkB CG33250-PA n=3 Tax=Neoptera RepID=UPI000051A07C Length = 310 Score = 152 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 41/151 (27%), Positives = 63/151 (41%), Gaps = 1/151 (0%) Query: 56 TMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPD 115 + L W T + + + +P L A G+ DF+ + Sbjct: 121 KGEINTKLISKLRWATFGYHHNWDTKLY-SETCKTKIPIELSLLTSFLAQTLGFKDFKAE 179 Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 A +IN Y + L+ H D E ++ AP+ S+S G AIF GGL + D + L GD+ Sbjct: 180 AAIINYYRMNSTLAGHTDHSELNVEAPLFSISFGQTAIFLIGGLMQEDTTNAIFLRSGDI 239 Query: 176 VVWGGESRLFYHGIQPLKAGFHPLTIDCRYN 206 ++ G SRL YHGI + + N Sbjct: 240 IIMSGMSRLRYHGIPKILLTIDKPWDNEELN 270 >UniRef50_Q22MH4 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22MH4_TETTH Length = 403 Score = 148 bits (374), Expect = 1e-34, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 67/195 (34%), Gaps = 46/195 (23%) Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP-----DFQPDACLI 119 + W+ Y + + MP + L + A D++P+A ++ Sbjct: 190 KKIRWSNVGAQYDWD--NRLYPSFTTPMPDIINELAEFAKNVVSDEITDVYDYEPEAVIV 247 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y +S H D E D ++PI S + G IF G ++ L L+ GD+++ Sbjct: 248 NYYDKKNYMSGHLDDGEKDQKSPIFSFTFGCSCIFLMGDRTKDFTPLPLRLDAGDLMIMS 307 Query: 180 GESRLFYHGIQPLKAGFHPLT--------------------------------------- 200 G SR YHG+ + G P Sbjct: 308 GYSRNCYHGVPRIFPGSFPKEEFEKYVRELYPHLYDNEEINKNKDLKFFENNYRHAINYL 367 Query: 201 IDCRYNLTFRQAGKK 215 D R NL FRQ KK Sbjct: 368 QDSRINLNFRQVEKK 382 >UniRef50_A0C122 Chromosome undetermined scaffold_140, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C122_PARTE Length = 312 Score = 148 bits (374), Expect = 1e-34, Method: Composition-based stats. Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 1/128 (0%) Query: 70 TTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD-FQPDACLINRYAPGAKL 128 GY Y + Q + +P + QRA + +Q ++ +IN Y + Sbjct: 131 RWANVGYQYDWNNRQYPQEKTQVPDPIQEISQRANNFLQLQNQYQSESVIINFYQSHDYM 190 Query: 129 SLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHG 188 + H D E D +PI S S GL ++F GG +++ + L+ GD++V G +R YHG Sbjct: 191 TGHLDDAELDQDSPIYSFSFGLSSVFVIGGPTKDEKPIAIKLDSGDLLVMSGHARKCYHG 250 Query: 189 IQPLKAGF 196 + + A Sbjct: 251 VPRVLADS 258 >UniRef50_B5Y3R7 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B5Y3R7_PHATR Length = 352 Score = 147 bits (371), Expect = 3e-34, Method: Composition-based stats. Identities = 45/149 (30%), Positives = 70/149 (46%), Gaps = 7/149 (4%) Query: 60 AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA-----GYPDFQP 114 + L W T Y ++ P MP+ + + A + P F Sbjct: 152 KYRSFRKLSWATMGYHYDWNTRSYNEKAKSP-MPKLLERIAEIFAATSLLVDGQDPCFTA 210 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHG 173 A ++N Y P + + H+D E L PIVS+SLG PA+F GG ++D P+ +L+ G Sbjct: 211 SASIVNFYTPKSMMGGHRDDLEHALDKPIVSISLGRPAVFLLGGNTKDDQPVVAILVRPG 270 Query: 174 DVVVWGGESRLFYHGIQPLKAGFHPLTID 202 DV++ GG SRL YHG+ L +++ Sbjct: 271 DVMMMGGASRLRYHGMARLLPTTGLPSVE 299 >UniRef50_C8NRG2 DNA repair protein n=9 Tax=Actinomycetales RepID=C8NRG2_COREF Length = 237 Score = 145 bits (367), Expect = 8e-34, Method: Composition-based stats. Identities = 60/220 (27%), Positives = 97/220 (44%), Gaps = 24/220 (10%) Query: 12 QEPLAAGAVILRRFA-FNAAEQLIRDINDVAS---QSPFRQMVTP-GGYTMSVAMTNCGH 66 +A+G V L + ++ + +A +P MSV + + G Sbjct: 23 SREVASGVVHLPDWLPLGEQAAVVEEARGIARSVAGTPLAMTRPQLRSGQMSVHILSLGQ 82 Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD---------FQPDAC 117 W T+ Y + P +P SFH+L RA A ++ +A Sbjct: 83 -HWATNPYRY----VTSVGGVAVPPIPASFHDLAARALADAAALSPPLAAWSGKYRAEAA 137 Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKRLLLEHGDVV 176 L+N YAPG+ + +HQD +E AP++S+S+G IF+ G RN P + L GD++ Sbjct: 138 LVNYYAPGSAMGMHQDANELS-EAPVISLSIGDTGIFRLGNTDNRNRPWVDVPLLSGDLI 196 Query: 177 VWGGESRLFYHGIQPLKAGFHPLTID---CRYNLTFRQAG 213 ++GGE R +HG+ ++A P R N+T RQ Sbjct: 197 IFGGEHRRAFHGVPRIEADTAPEGCGLDRGRINITIRQVA 236 >UniRef50_Q0U5B3 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0U5B3_PHANO Length = 298 Score = 139 bits (350), Expect = 7e-32, Method: Composition-based stats. Identities = 46/172 (26%), Positives = 71/172 (41%), Gaps = 18/172 (10%) Query: 45 PFRQMVTPGGY-TMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRA 103 P + P + +S+ L WTT Y ++ P P P N+ + Sbjct: 130 PIAHPLDPSIHKPLSITQLLNKKLRWTTLGGQYDWTAKKYPDATP-PPFPADTKNMLESI 188 Query: 104 ATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND 163 T + A ++N Y+PG LS+H+D E ++S+SLG A+F G +D Sbjct: 189 FTTT-----RAQAAIVNLYSPGDTLSVHRDVAETSSHG-LISLSLGCDAVFVIG--TDDD 240 Query: 164 PLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 + L L G V G SR +HG+ + AG P FR G++ Sbjct: 241 KVLTLRLRSGSAVYMSGASRFAWHGVPQIVAGSCPGV--------FRGVGQE 284 >UniRef50_A4H8G0 Putative uncharacterized protein n=3 Tax=Leishmania RepID=A4H8G0_LEIBR Length = 440 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 67/195 (34%), Gaps = 18/195 (9%) Query: 17 AGAVILRRFA--FNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGH--LGWTTH 72 G ++ + D + + T+C + W T Sbjct: 85 PGLLLFPGVLSEAEQQRWCREAVLDYGDSE--HHPNILSTHARAPQSTSCYQPPMRWATL 142 Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHN----LCQRAATA------AGYPD-FQPDACLINR 121 Y ++ ++ + P + L A YPD ++P ++N Sbjct: 143 GYSYEWTQKVYHRDR-YSTFPSALRQRMCDLVSLVAEVRQDGFCCAYPDTYEPQTAIVNY 201 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y G+ + HQD E L P++S+SLG A+F G R D LL GDV + G Sbjct: 202 YPVGSMMMCHQDVSEETLEQPLMSLSLGCSAVFLMGTQSREDAPHAFLLRSGDVAAFTGP 261 Query: 182 SRLFYHGIQPLKAGF 196 SR +H + Sbjct: 262 SRAAFHSTPRILDDC 276 >UniRef50_A8PV44 ALKBH protein, putative n=1 Tax=Brugia malayi RepID=A8PV44_BRUMA Length = 339 Score = 136 bits (342), Expect = 6e-31, Method: Composition-based stats. Identities = 54/216 (25%), Positives = 89/216 (41%), Gaps = 19/216 (8%) Query: 16 AAGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMV---TPGGYTMSVAMTNCGHLGWTT 71 G V+L F ++ Q I+ + ++SP V P V + L W+T Sbjct: 128 RPGMVMLNDIFKSSSHLQWIKRSLFIYAESPGFTNVGLQVPNVRN--VFKEHGRQLRWST 185 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLH 131 Y ++ +P+ +L + A G DA +IN Y+ + L+ H Sbjct: 186 LGLHYDWATKIYPFEGEL--LPEELVSLSDVLSQALGIGPMYADAAIINFYSRKSTLAPH 243 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQP 191 D+ E L +P++S+S G AI+ GG +DP+ + GDV+V G RL YH + Sbjct: 244 VDRSERSLSSPLISLSFGQTAIYLAGGTDLDDPVDAFYIRSGDVLVIYGPQRLIYHAVPR 303 Query: 192 LKAGFHPLTIDC-----------RYNLTFRQAGKKE 216 + + D R N+T RQ + + Sbjct: 304 ILQDTYFEDKDQPEEIVKYANTNRINITLRQVDEHK 339 >UniRef50_D0N998 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N998_PHYIN Length = 292 Score = 135 bits (341), Expect = 8e-31, Method: Composition-based stats. Identities = 53/222 (23%), Positives = 87/222 (39%), Gaps = 22/222 (9%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVASQSP--FRQMVTPGGYTMSVAMTNCGHLGWTTH 72 L G +I+++F +Q + D + F + G + C W Sbjct: 70 LLPGLLIIKQFLTPQEQQELVDDSRCMGLGEGGFYKPTYASGAKCRLHQM-CLGRHWNVK 128 Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD---------FQPDACLINRYA 123 + Y + P +P S+ QR+ AA D PD C++N Y Sbjct: 129 TEKYEDQRSNYDY-APIRTLPDSWKTYAQRSLDAAKKIDPLVMGSCKKMTPDICVVNFYK 187 Query: 124 PGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGG--LKRNDPLKRLLLEHGDVVV 177 + +H DKDE D + +P++S S+G A F + ++ + + LE GD +V Sbjct: 188 KAGRNGMHIDKDESDEAMSMGSPVISFSVGCAAEFAYIDHYPDPHEAVPIVRLESGDALV 247 Query: 178 WGGESRLFYHGIQPLKAGFHP---LTIDCRYNLTFRQAGKKE 216 +GG +R H + + P R NLTFR+ E Sbjct: 248 FGGPARTVVHALTRVYNNTQPSWLRMRSGRLNLTFREYKPSE 289 >UniRef50_D2A2C2 Putative uncharacterized protein GLEAN_07671 n=1 Tax=Tribolium castaneum RepID=D2A2C2_TRICA Length = 582 Score = 135 bits (341), Expect = 8e-31, Method: Composition-based stats. Identities = 50/209 (23%), Positives = 73/209 (34%), Gaps = 37/209 (17%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 L G I+ F E + + S+ H G Sbjct: 125 LPPGLRIITNFVSEEEEARLLALCQFEDG-------------GSMKHRLVKHYG-----Y 166 Query: 75 GYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQD 133 + Y + KP +PQ L +R +F+P+ INRY PG + H D Sbjct: 167 EFRYDINNVDKEKPLSEGIPQECDFLWRRLPF-----EFRPNQLTINRYNPGQGIPSHVD 221 Query: 134 KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPL 192 PI+S+SL + +F + D +LL ++V GESR + HGI P Sbjct: 222 THSA-FGDPILSLSLSSDVVMEF----KKDETICVLLPRRSLLVMAGESRYEWTHGIVPR 276 Query: 193 K-------AGFHPLTIDCRYNLTFRQAGK 214 G H R + TFR+ K Sbjct: 277 TFDFYNDEGGCHCFKRGVRVSFTFRKIRK 305 >UniRef50_Q6C333 YALI0F03003p n=1 Tax=Yarrowia lipolytica RepID=Q6C333_YARLI Length = 330 Score = 135 bits (340), Expect = 1e-30, Method: Composition-based stats. Identities = 56/177 (31%), Positives = 77/177 (43%), Gaps = 24/177 (13%) Query: 53 GGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKP----WPAMPQSFHNLCQRAATAAG 108 GG S L W T Y ++ + P +P P++ + L R + Sbjct: 157 GGKPTSRDKIKNKQLRWVTLGGQYNWTTKAYPSFIPGTEGFPYFPKNLYELLSRPLFSI- 215 Query: 109 YPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND----- 163 P+A +IN Y+PG LS HQD E +VSVS+GL AIF G + +D Sbjct: 216 ----NPEAAIINFYSPGDILSPHQDVAELSQDD-LVSVSIGLDAIFYVGLNRYDDSENSL 270 Query: 164 -PLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDC--------RYNLTFRQ 211 P L+L GDV+V GG+SR YHGI + + P R N+ RQ Sbjct: 271 APPLCLMLRSGDVIVMGGKSRHAYHGIGKVFSNTSPDLNSPYQDWLDTKRVNINVRQ 327 >UniRef50_B1XR40 Oxidoreductase, 2OG-Fe(II) oxygenase family n=4 Tax=Bacteria RepID=B1XR40_SYNP2 Length = 204 Score = 133 bits (334), Expect = 5e-30, Method: Composition-based stats. Identities = 46/214 (21%), Positives = 78/214 (36%), Gaps = 24/214 (11%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L+LFA + EP G + F EQ + ++ D + Sbjct: 10 LELFA-SVTINEPQIPGLQYIEEFIDKQTEQELLNLID---------------QQQWLMD 53 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 + Y Y + +P + ++ + PD ++N Sbjct: 54 L---KRRVQHYGYKYDYRTKKIDYSMYLGILPDWLFPIIEQMVS-LNLISELPDQAIVNE 109 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PG ++ H D +P I+S+SL P I F + N+ +LL V++ G Sbjct: 110 YLPGQGITSHVDC-KPCFTDTIISLSLNAPCIMNFDSIVNNERQSKLLKPRSLVILQGES 168 Query: 182 SRLFYHGIQPLKA---GFHPLTIDCRYNLTFRQA 212 L+ HGI P K+ + D R ++TFR+ Sbjct: 169 RYLWKHGIPPRKSDQWNGQKIMRDRRISITFRKV 202 >UniRef50_C1EB25 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EB25_9CHLO Length = 334 Score = 132 bits (332), Expect = 8e-30, Method: Composition-based stats. Identities = 58/231 (25%), Positives = 82/231 (35%), Gaps = 38/231 (16%) Query: 7 DAEPWQEPLAAGAVILRR-FAFNAAEQLIRDINDVASQSPFRQ---MVTPGGYTM--SVA 60 + E LA G V LRR L +V RQ PG V Sbjct: 95 EKGSVAEVLAPGLVCLRRAIDLETQAWLAERAFEVGEGKDGRQGFYNTVPGDAPGDAPVL 154 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNL-------CQRAATAAGYPDFQ 113 N G G P P+ + Q A + PD Sbjct: 155 RLNQGTRGRVIL---------------PVSDFPERLGRIVRGCVRCAQTADSCTNVPDMN 199 Query: 114 PDACLINRYAPGAKLSLHQDKDEP-----DLRAPIVSVSLGLPAIFQFGGLKRNDPLKRL 168 P L+N Y GAK H+D ++P D PIVS ++GL A F + + + + Sbjct: 200 PTTALVNFYKEGAKFKWHRDSEDPAHARHDTGPPIVSFTVGLSADFSYKNRFEDATHRTV 259 Query: 169 LLEHGDVVVWGGESRLFYHGIQPLKAGFHPL-----TIDCRYNLTFRQAGK 214 L GDV+++GG SR+ H + + P + R N+T R G+ Sbjct: 260 RLNSGDVLLFGGPSRMIVHSVTGVVPRTMPPMLRGRMLHGRLNVTVRDIGR 310 >UniRef50_Q5K7S3 Putative uncharacterized protein n=1 Tax=Filobasidiella neoformans RepID=Q5K7S3_CRYNE Length = 425 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 46/187 (24%), Positives = 69/187 (36%), Gaps = 43/187 (22%) Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP-------------- 110 +LGW Y P+T P P +LC A + + Sbjct: 234 ANLGWVYQWSTKSYD-FAPETPIP---FPAPLADLCSEAVASVPWENVFSSVSDPDASTY 289 Query: 111 -------DFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND 163 D++PD ++N Y L H D+ E D P+VSVSLG AI G R++ Sbjct: 290 GWQSWPRDYKPDTGIVNFYQLNDTLMAHVDRAELDPARPLVSVSLGHAAILLLGSDSRDE 349 Query: 164 PLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPL------------------TIDCRY 205 + ++L GD+++ G+ R YHG+ + G P R Sbjct: 350 VPRPIILRSGDMLIMSGKGRQSYHGVPRILEGSLPSHFLVQESDSEEMKAAKNWISTARI 409 Query: 206 NLTFRQA 212 N+ RQ Sbjct: 410 NINARQV 416 >UniRef50_A9TLH2 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TLH2_PHYPA Length = 697 Score = 130 bits (328), Expect = 2e-29, Method: Composition-based stats. Identities = 59/252 (23%), Positives = 97/252 (38%), Gaps = 55/252 (21%) Query: 7 DAEPWQEPLAAGAVILRRF-AFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 + + L G V+LR + + + ++L+ + A F++ T G + Sbjct: 454 ERKSTVTILQPGMVLLRSWLSLDIQQRLVNESQSAAHL--FKRPTTASGGKYHLWQMA-- 509 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--------YPDFQPDAC 117 G + + Y+ P+ P ++L + A A +F+PD Sbjct: 510 -FGCSWDSKTRRYAA--PERGLR---FPVWMYDLGRELAFDAQKHTPVYAQGSNFEPDVA 563 Query: 118 LINRYAPGA------KLSLHQDKDEPDLRAPIVSVSLGLPAIFQF--------------- 156 L+N Y L HQD D+ P+VSVS+G F + Sbjct: 564 LVNFYPAKDEELGVVGLGGHQDLDDYC-DMPVVSVSVGDSMTFFYRRFPPQSRRKSGVQI 622 Query: 157 -----------GGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI---D 202 G ++ K+++L GDV+V+GGESRL YHG + ++ G P + Sbjct: 623 IVDEYAAQCCKDGDTAHNSEKKIILASGDVLVFGGESRLVYHGTRCVQPGTRPPGLHMAP 682 Query: 203 CRYNLTFRQAGK 214 R N TFRQ Sbjct: 683 GRLNFTFRQCNA 694 >UniRef50_C3YRT0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YRT0_BRAFL Length = 641 Score = 130 bits (326), Expect = 4e-29, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 76/224 (33%), Gaps = 26/224 (11%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 +D L G ++ F A + + +++ ++ Sbjct: 124 IDEVPSQRATGLDLPPGLRLVEDFVSPACADRLLEGLGWSNE-----QQQHMDAEQALKH 178 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLIN 120 H G + Y + +KP P +P + R + G+ +PD +N Sbjct: 179 RRVKHFG-----YEFRYDNNNVDKDKPLPGGLPDWCSQVIDRMMS-GGHIKHRPDQITVN 232 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 +Y PG + H D I S+SLG + F ++L ++V G Sbjct: 233 QYQPGQGIPPHVDTHSA-FEDEISSLSLGGQTVMDFKHPS--GKRVAVVLPARSLLVMSG 289 Query: 181 ESR-LFYHGIQPLKAGFHPLTIDC----------RYNLTFRQAG 213 E+R L+ HGI P K P+ R + TFR+ Sbjct: 290 EARYLWTHGIIPRKMDPVPVKGQEDSITLARREVRTSFTFRKIR 333 >UniRef50_UPI000186CFBD conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186CFBD Length = 602 Score = 129 bits (324), Expect = 7e-29, Method: Composition-based stats. Identities = 53/226 (23%), Positives = 81/226 (35%), Gaps = 31/226 (13%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 + L A+ +Q+P G V+L F E I + G S Sbjct: 113 ISLLNQAKIFQKP--PGLVLLEDFISEEEETEILKLLKF----------NDSGEEYS--- 157 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRA----ATAAGYPDFQPDAC 117 + H + + Y + N+P +P + L R DF PD Sbjct: 158 SELKHRKVKHYGYEFKYGSNNVNLNEPIKKIPSKLNYLWDRLKKYSDNFESDFDFTPDQL 217 Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVV 177 +N Y PG + H D I+S+SL + +F K D +LL + + Sbjct: 218 TVNCYEPGQGIPPHVDTHSA-FEDGILSLSLESSVVMEFKNDK--DLTFSVLLPRRSLCL 274 Query: 178 WGGESRL-FYHGIQPLKAGFHPLT--------IDCRYNLTFRQAGK 214 GESR + HGI P K+ P + R +LTFR+ + Sbjct: 275 MLGESRYNWVHGITPRKSDLIPNKDGSLTVQNRERRTSLTFRKTRR 320 >UniRef50_Q09BP3 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09BP3_STIAU Length = 185 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 54/205 (26%), Positives = 73/205 (35%), Gaps = 27/205 (13%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 +E G + + F ++ E + + + S R VA H GW Sbjct: 5 EEERPEGLLYVPDFLTDSEEARLLEHLRGLTFSEIRM-------RGQVAKRRTAHFGWL- 56 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLH 131 Y Y + + P PAMP L R A G Q L+N Y PGA + H Sbjct: 57 ----YGYESLKVE---PGPAMPDFLLPLRNRCAELMGELPEQLVEALLNEYPPGAAIGWH 109 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQF-GGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGI 189 +D P +V VSLG +F L L V GGESR + H I Sbjct: 110 RD--APMFGHQVVGVSLGGACRMRFQRDQGEARRTYALELAPRSAYVLGGESRSTWQHSI 167 Query: 190 QPLKAGFHPLTIDCRYNLTFRQAGK 214 +K RY++TFR Sbjct: 168 PAVK--------QERYSITFRTLKA 184 >UniRef50_B8ESE9 2OG-Fe(II) oxygenase n=1 Tax=Methylocella silvestris BL2 RepID=B8ESE9_METSB Length = 202 Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats. Identities = 51/211 (24%), Positives = 82/211 (38%), Gaps = 36/211 (17%) Query: 11 WQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 ++ L G + AAEQ + D A SPFR + + GW Sbjct: 9 FERSLLPGLRLGENIISAAAEQTLISAIDAARLSPFR-------FQGWLGKRVTASFGWR 61 Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSL 130 + + P +P +P+ L + AA AG P L+ RY PGA + Sbjct: 62 YDFETASFGPAEP--------IPEFLLPLRESAAGFAGLPTGALAQALLIRYDPGAGIGW 113 Query: 131 HQDKDEPDLRAPIVSVSLGLPAIFQF-----GGLKRNDPLKRLLLEHGDVVVWGGESR-L 184 H+D+ L ++ +SLG PA+ +F G R + L + G++R L Sbjct: 114 HRDR---PLFEHVIGISLGAPAVLRFRRRTAAGFDRANAP----LAPRSIYHLSGDARHL 166 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 + H I + R+++TFR +K Sbjct: 167 WEHSIAQVDV--------ARWSITFRSLSEK 189 >UniRef50_UPI0000DB70B1 PREDICTED: similar to CG17807-PA n=2 Tax=Apocrita RepID=UPI0000DB70B1 Length = 558 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 44/221 (19%), Positives = 77/221 (34%), Gaps = 31/221 (14%) Query: 4 LFADAE--PWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 +F D W L +G ++ F E+++ ++ Sbjct: 85 IFPDLNYCEWSLNLPSGIKLIEDFITEEEEKMLLSTITWNNE----------------ES 128 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 ++ H + Y +KP +P+++ L Q ++ D IN Sbjct: 129 SDLKHRKVKHFGYEFQYDTNKVDLDKPIVPIPKNYQFL-QVLFKQYHNVSYEYDQLTINH 187 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PG + H D I+S+SLG I F K+ + L L +++ GE Sbjct: 188 YLPGQGIPPHIDTHSV-FEDSILSLSLGSACIMNF---KKENKKASLFLPPRSLLIMSGE 243 Query: 182 SRLFY-HGIQPLK-------AGFHPLTIDCRYNLTFRQAGK 214 +R + HGI P G + R + TFR+ + Sbjct: 244 ARYAWSHGICPRHNDIVQTSNGITTQSRGTRVSFTFRKVHR 284 >UniRef50_C1FFM9 Predicted protein (Fragment) n=2 Tax=Micromonas RepID=C1FFM9_9CHLO Length = 126 Score = 126 bits (316), Expect = 6e-28, Method: Composition-based stats. Identities = 33/77 (42%), Positives = 44/77 (57%) Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 A L+N Y G +L+ H D E D+ PIVSVSLG P +F GG R+ +L+ GD Sbjct: 1 AGLVNYYRSGDQLAGHVDDAEVDMSKPIVSVSLGCPCVFLLGGRSRDVAPTAVLMRSGDA 60 Query: 176 VVWGGESRLFYHGIQPL 192 +V G SR YHG+ + Sbjct: 61 IVLTGPSRRCYHGVPRI 77 >UniRef50_UPI000186D7D6 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D7D6 Length = 278 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 41/179 (22%), Positives = 66/179 (36%), Gaps = 17/179 (9%) Query: 17 AGAVILR-RFAFNAAEQLIRDINDVASQSP------FRQMVTPGGYTMSVAMTNCG---- 65 G ++ F F I + ++ P ++T S N Sbjct: 66 PGLFFIKNPFKFIGQRYWIIRCLEYYTRKPNKLNIDIHNILTEEDDWWSYCKKNFNTANG 125 Query: 66 -----HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 L W T + + + + P+ + LC A + F +A ++N Sbjct: 126 KLVLDKLRWVTFGYHHNWDTK-VYSESSKSSFPEDLNLLCNHFANNFFFEGFNAEAAIVN 184 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 Y + LS H D E ++ AP+ S S G AIF GG ND +L+E GDV++ Sbjct: 185 MYHLNSTLSGHTDTSELNINAPLFSFSFGQSAIFLIGGKFINDSALPILVESGDVLIMS 243 >UniRef50_C5KBY7 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KBY7_9ALVE Length = 332 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 42/174 (24%), Positives = 70/174 (40%), Gaps = 24/174 (13%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA--------GYPDFQPDAC 117 L W++ Y ++ + MPQ ++ A AA +Q +A Sbjct: 132 KLRWSSLGVHYDWTRRSYR-GTSSSDMPQWVCDIYHNALKAADDICGSRLAEDGYQAEAA 190 Query: 118 LINR---YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHG 173 L+N + +L H+D E +P+V ++LGLP F GG R D +L G Sbjct: 191 LVNFFHSHRSSDRLGGHKDDVEARDHSPLVILALGLPCTFLLGGDSRVDVTPAPILFSSG 250 Query: 174 DVVVWGGESRLFYHGIQPLKAGF--HPLTIDC---------RYNLTFRQAGKKE 216 DV+V E+R ++HG+ + P D R +++ R+ E Sbjct: 251 DVLVLSREARQWFHGVPTVLKNSVDRPHHADGSVEDFLKRTRLSVSIREVSGDE 304 >UniRef50_Q010W8 Oxidoreductase, 2OG-Fe (ISS) n=1 Tax=Ostreococcus tauri RepID=Q010W8_OSTTA Length = 214 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 43/141 (30%), Positives = 66/141 (46%), Gaps = 16/141 (11%) Query: 90 PAMP-QSFHNLCQRAATAA-----GYPDFQPDACLINRYAPGAKLSLHQDKDEPDL---- 139 A P + +C A AA PD P CL+N Y GA+ H+D ++P L Sbjct: 50 EAFPGDALREMCAEAVRAAQKVDDAMPDMNPTTCLVNFYKDGAEFKWHKDSEDPKLVKSR 109 Query: 140 -RAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP 198 PIVS S+G+ A F + + + + L GDV+++GG SR+ H + + G P Sbjct: 110 AGPPIVSFSVGMSADFGYKYSFEDPTHEVVRLNSGDVLLFGGPSRMIVHSVLNVHPGSMP 169 Query: 199 -----LTIDCRYNLTFRQAGK 214 ++ R N+T R G+ Sbjct: 170 GHLRGKMLNGRLNVTVRDIGE 190 >UniRef50_UPI000180CD20 PREDICTED: similar to alkB, alkylation repair homolog 8 (E. coli) (alkbh8) n=1 Tax=Ciona intestinalis RepID=UPI000180CD20 Length = 593 Score = 123 bits (309), Expect = 4e-27, Method: Composition-based stats. Identities = 56/220 (25%), Positives = 82/220 (37%), Gaps = 28/220 (12%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 +FA+ + L G + + F EQ + + + +S Sbjct: 119 IFAEEKSD---LPNGLIKIENFLNKEEEQALINCIQ-------HDISILSNDHVS---EK 165 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 H + + Y D N P +P NL R A GY +PD IN Y Sbjct: 166 LKHRTVLHYGYKFRYGTNDVDINNPISEGLPNYIENLLDRIM-ATGYLPSRPDQLTINMY 224 Query: 123 APGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFG--GLKRNDPLKRLLLEHGDVVVWGG 180 PG + H D + + +VSLG + F G +R D + +E + ++ G Sbjct: 225 EPGDGIPPHTD-NTRSFDGVLSTVSLGSHTVMNFSKEGAERID----VCVEPRTLFLFTG 279 Query: 181 ESRL-FYHGIQPLK-----AGFHPLTIDCRYNLTFRQAGK 214 ESR + HGIQ K G T RY+LTFR K Sbjct: 280 ESRYEWRHGIQQRKFDILDQGKKITTRTIRYSLTFRTVVK 319 >UniRef50_A8J903 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8J903_CHLRE Length = 398 Score = 123 bits (309), Expect = 4e-27, Method: Composition-based stats. Identities = 47/187 (25%), Positives = 64/187 (34%), Gaps = 54/187 (28%) Query: 55 YTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQR------------ 102 A L W T + ++ + +P S +L ++ Sbjct: 212 GPGPAAEQLLRKLRWATLGPQFDWTERQYDFTGAYRQLPPSLSDLARQMAAVVDALQAAG 271 Query: 103 -----AATAAGYPD-------------------------------------FQPDACLIN 120 AA A P ++PDA ++N Sbjct: 272 MQLMPAAAEAEVPQQPAAPRVQATEPSQAAAGAVSAGLAPPTGAAPAAPRGYEPDAAIVN 331 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y G L H D E DL PIVSVSLG PA+F GG + LLL GDV+V G Sbjct: 332 YYQIGDVLGGHVDDVESDLAQPIVSVSLGCPALFLMGGRTKATHPSALLLRGGDVLVLAG 391 Query: 181 ESRLFYH 187 ++R YH Sbjct: 392 QARSCYH 398 >UniRef50_Q4UFZ4 Alkylated DNA repair protein, putative n=2 Tax=Theileria RepID=Q4UFZ4_THEAN Length = 350 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 71/189 (37%), Gaps = 12/189 (6%) Query: 17 AGAVILRRFAFNAAE-----QLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 G +LR F + +R + + S M + + + +L W+T Sbjct: 99 PGVFVLRNFLTQDQSLLLACETLRSYINPPNNSNLLLMDPNISSPIWPSNS-FKNLRWST 157 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--YPDFQPDACLINRYAPGAKLS 129 Y + + P+ + + T Y F D+ +IN Y+ L Sbjct: 158 IGHLYDWGKRQYIG---FTQFPEIIAKIVNQINTLLSGFYQPFTADSAIINFYSNSYFLR 214 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 LH+D E + P++++SLG PAIF + +++ G +++ SR HGI Sbjct: 215 LHRDDAE-ETNDPVINISLGAPAIFCICKEDPSQFPLSCVVDSGSIIIMSKNSRRCLHGI 273 Query: 190 QPLKAGFHP 198 L P Sbjct: 274 SKLYHYVKP 282 >UniRef50_Q9LJH4 Emb|CAB82748.1 n=1 Tax=Arabidopsis thaliana RepID=Q9LJH4_ARATH Length = 330 Score = 121 bits (304), Expect = 1e-26, Method: Composition-based stats. Identities = 56/240 (23%), Positives = 84/240 (35%), Gaps = 57/240 (23%) Query: 14 PLAAGAVILRRF-AFNAAEQLIRDINDVA-SQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 + G V+L+ + + N ++ + + F Q G + + M LG Sbjct: 82 VIRPGMVLLKNYLSINNQVMIVNKCRQLGLGEGGFYQPGFQDGGLLHLKMMC---LGKNW 138 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------------YPDFQPDA 116 Q Y I P P +P F L ++A + P PD Sbjct: 139 DCQTRRYGEIRPIDGSVPPRIPVEFSQLVEKAIKESKSLVATNSNETKGGDEIPLLLPDI 198 Query: 117 CLINRYAPGAKLSLHQ---------------------------------DKDE----PDL 139 C++N Y KL LHQ DK E Sbjct: 199 CVVNFYTSTGKLGLHQVVTSIQYRKNCSSLYYCSFKFLHHQLIESFMAQDKGESKKSLRK 258 Query: 140 RAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPL 199 PIVS S+G A F +G K D L+LE GDV+++G SR +HG++ ++ P Sbjct: 259 GLPIVSFSIGDSAEFLYGDQKDVDKADTLILESGDVLIFGERSRNVFHGVRSIRKILPPR 318 >UniRef50_UPI00017B2DD1 UPI00017B2DD1 related cluster n=1 Tax=Tetraodon nigroviridis RepID=UPI00017B2DD1 Length = 643 Score = 120 bits (301), Expect = 4e-26, Method: Composition-based stats. Identities = 48/232 (20%), Positives = 71/232 (30%), Gaps = 39/232 (16%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 E G +L F E + D +S + A Sbjct: 122 PCEEDVSVSFPMGLALLDNFVSPEEEASLLSAVDWSSSNDGVT-----------AQKAMK 170 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 H + + Y + +KP PA +P +R T PD +N+Y Sbjct: 171 HRRVKHYGFEFRYDNNNVDKDKPLPAGIPAECLPFLERCLTNKIIDVM-PDQLTVNQYES 229 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR- 183 G + H D I+S+SL + F + L L+L ++V GESR Sbjct: 230 GQGIPPHVDTHSA-FEDAILSLSLRAQTVMDFRHP--DGSLVALVLPGRSLLVMKGESRY 286 Query: 184 LFYHGIQPLKAGFHP----------------------LTIDCRYNLTFRQAG 213 L+ HGI P K P R + TFR+ Sbjct: 287 LWTHGITPRKFDVVPSCDSQPSAPTSHDSQSQSNLTLSRRATRTSFTFRKIR 338 >UniRef50_B3RYI8 Putative uncharacterized protein n=2 Tax=Trichoplax adhaerens RepID=B3RYI8_TRIAD Length = 653 Score = 120 bits (300), Expect = 4e-26, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 70/216 (32%), Gaps = 22/216 (10%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDV----ASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 G ++++ + E + S H Sbjct: 150 PEGLLLIQNYVSEQEEDELLQSIGWYTNHGQASHDTHPCQTVQSDRESMQRRLKHRHVKH 209 Query: 72 HRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLS 129 + + Y +KP A +P +CQR GY QPD +N Y PG A + Sbjct: 210 YGYEFRYDTNTVDKDKPLHATIPSKCRYICQRMTDD-GYIQHQPDQLTVNEYMPGQAGIP 268 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHG 188 H D + IVS+SL + F + + L ++V GE R L+ HG Sbjct: 269 PHIDTHSA-FQDQIVSLSLLSQIVMDFRHP--DGTRISINLPRRSLLVMSGECRYLWSHG 325 Query: 189 IQPLK-----------AGFHPLTIDCRYNLTFRQAG 213 I P K + L R + TFR+ Sbjct: 326 ITPRKYDVVCDDNDNNSNITLLERSRRVSFTFRKIR 361 >UniRef50_B2APW5 Predicted CDS Pa_4_6060 n=6 Tax=Sordariomycetes RepID=B2APW5_PODAN Length = 356 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 66/206 (32%), Gaps = 41/206 (19%) Query: 42 SQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQ 101 S + F +++ L W T Y ++ P P P + Sbjct: 157 SPTTFTPKDPSVHKPLTIKQVLQRRLSWVTLGGQYDWTNRIYPGELP-PQFPPDIAGFLE 215 Query: 102 RAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF----- 156 +P+ A ++N Y PG + +H+D E + ++S+S G ++F Sbjct: 216 TL-----FPETLAQAAIVNFYTPGDTMMMHRDVSE-ETDKGLISLSFGCDSLFMIAPNDV 269 Query: 157 GGLKRNDPLKR-----------LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPL------ 199 G + + L L GD + +SR +HG+ + G P Sbjct: 270 GKMSDEEKKAAGFGDGQKEYLLLRLRSGDAIYMTKDSRFAWHGVPKVLKGTCPDYLEDWP 329 Query: 200 ------------TIDCRYNLTFRQAG 213 + R NL RQ Sbjct: 330 AEDGKYEEWRGWMKNKRINLNVRQMR 355 >UniRef50_A7AWB3 Putative uncharacterized protein n=1 Tax=Babesia bovis RepID=A7AWB3_BABBO Length = 336 Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats. Identities = 48/202 (23%), Positives = 74/202 (36%), Gaps = 18/202 (8%) Query: 17 AGAVILRRFAFNAA-EQLIRDINDVASQSP----FRQMVTPGGYTMSVAMTNCGHLGWTT 71 G ++R F + L+ + P Q L W T Sbjct: 89 PGLYLVRDFFTKEQCDALLLETLVDYINPPNNSNLYQNDPNVATPFW-PSPVFSKLRWAT 147 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPD--ACLINRYAPGAKLS 129 Y + K + P ++ + + D+ PD A +IN Y+ L Sbjct: 148 IGHMYDWGTRTY---KGYTKFPGLLVDVTRDLLSHFN-EDYIPDVCAAIINFYSKAYFLR 203 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 LH+D E + ++++SLG PAIF GG + ++E G VV+ +SR HGI Sbjct: 204 LHKDDAE-ETDDSVLNISLGAPAIFMLGGTDHSTIPVSFVVESGSVVLMADKSRFCLHGI 262 Query: 190 QPL----KAGFHPLTIDCRYNL 207 L K G+ P NL Sbjct: 263 VKLLSYNKPGYQPSGGLP-INL 283 >UniRef50_Q96BT7 Alkylated DNA repair protein alkB homolog 8 n=32 Tax=Euteleostomi RepID=ALKB8_HUMAN Length = 664 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 47/235 (20%), Positives = 82/235 (34%), Gaps = 40/235 (17%) Query: 5 FADAEPWQE----PLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 F + W+E L G +++ + E+++ + D + + S+ Sbjct: 119 FVEKVQWKELRPQALPPGLMVVEEIISSEEEKMLLESVDWTEDTDNQN------SQKSLK 172 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLI 119 H G + Y + +KP +P + ++ GY +PD I Sbjct: 173 HRRVKHFG-----YEFHYENNNVDKDKPLSGGLPDICESFLEKWLRK-GYIKHKPDQMTI 226 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N+Y PG + H D IVS+SLG + F + ++L ++V Sbjct: 227 NQYEPGQGIPAHIDTHSA-FEDEIVSLSLGSEIVMDFKHP--DGIAVPVMLPRRSLLVMT 283 Query: 180 GESR-LFYHGIQPLKAGFHPLT-------------------IDCRYNLTFRQAGK 214 GESR L+ HGI K + R + TFR+ + Sbjct: 284 GESRYLWTHGITCRKFDTVQASESLKSGIITSDVGDLTLSKRGLRTSFTFRKVRQ 338 >UniRef50_D1Z416 Whole genome shotgun sequence assembly, scaffold_2 n=1 Tax=Sordaria macrospora RepID=D1Z416_SORMA Length = 358 Score = 117 bits (294), Expect = 2e-25, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 68/206 (33%), Gaps = 45/206 (21%) Query: 46 FRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAAT 105 F ++ L W T Y + + P P+ + Sbjct: 159 FEPKDPSVHKPLTFQQVFNRKLHWVTLGGQYDW-TNRVYPGELPPEFPKDISGFLETL-- 215 Query: 106 AAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQ-----FGGLK 160 +P+ A ++N Y PG + +H+D E + ++S+S+G ++F +G + Sbjct: 216 ---FPETLAQAAIVNFYTPGDTMMMHRDVSE-ETDKGLISLSIGCDSLFMICPEDWGKVS 271 Query: 161 RNDPLKR---------------LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI---- 201 + K L L GDV+ ESR +HG+ + G P + Sbjct: 272 AEEKQKETKESETGKSEKKFLLLRLRSGDVIYMTKESRFAWHGVPKIFKGTCPEWLEDWP 331 Query: 202 --------------DCRYNLTFRQAG 213 + R N+ RQ Sbjct: 332 AEDGKYEAWRGWMKNKRININVRQMR 357 >UniRef50_A7SSH3 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7SSH3_NEMVE Length = 648 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 49/227 (21%), Positives = 71/227 (31%), Gaps = 51/227 (22%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G I F E+ + D + + H G Sbjct: 139 PPGLQIYEEFINEEEEKTLLDALGWDAP------------QKELRHRRVKHYG-----YE 181 Query: 76 YLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 +LY D KP P MP +++ R + + PD +N Y PG + H D Sbjct: 182 FLYGTNDIDRAKPLPGGMPAVCNDILTRMVSQGAVQN-TPDQLTVNEYLPGQGIPPHVDT 240 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLK 193 I S+SLG F + +LL ++V GESR L+ HGI P K Sbjct: 241 HSA-FEDGICSLSLGAKISMDFRHP--DSRHVSVLLPRRSLLVMSGESRYLWTHGITPRK 297 Query: 194 ----------------------------AGFHPLTIDCRYNLTFRQA 212 +G + R +LTFR+ Sbjct: 298 FDIIGSGLDTSIHEDQESIAADASNVSTSGVTQYERERRISLTFRKI 344 >UniRef50_C5L3Y2 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5L3Y2_9ALVE Length = 325 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 50/211 (23%), Positives = 73/211 (34%), Gaps = 30/211 (14%) Query: 11 WQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 + L G ++ F E+ + + D + S+ Sbjct: 101 TTDELPPGLTLIPDFITEEEEEKLLGLVDAGE------------WDHSIRRRV------Q 142 Query: 71 THRQGYLYSPIDPQTN--KPWPAMPQSFHNLCQR--AATAAGYPDFQPDACLINRYAPGA 126 + Y+ + + MP L +R A + A DF+PD IN Y PG Sbjct: 143 HFGHAFDYTSLRAKDAFLDGEARMPAYTEELVRRIRAESVAEARDFRPDQLTINEYIPGV 202 Query: 127 KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-F 185 +S H D PIV +S+G + +F L L + V GGESR + Sbjct: 203 GISFHVDTHSA-FEGPIVILSIGGGIVLEFR-KSEEGRALPLWLPRRSLAVMGGESRFGW 260 Query: 186 YHGIQPLKAGFHPLTID-----CRYNLTFRQ 211 HGI K D R +LTFRQ Sbjct: 261 VHGIAGRKTDRVGPDGDLVERQRRISLTFRQ 291 >UniRef50_A4SZF3 DNA-N1-methyladenine dioxygenase n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SZF3_POLSQ Length = 209 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 39/149 (26%), Positives = 62/149 (41%), Gaps = 17/149 (11%) Query: 69 WTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W + Y YS + Q P + L + P + ++CL+N Y G Sbjct: 70 WIGDPKCTYTYSGVKKQPQSWTPELLIIKRQLEE-------LPQAEFNSCLLNFYHDGAD 122 Query: 127 KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES-RL 184 + H D + E D ++PI S+SLG F F K++ L LE+G ++ + + Sbjct: 123 GMGWHSDDEKELDAQSPIASLSLGSARKFSFKH-KKDKSTTSLFLENGSALIMHAPTQQF 181 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 + H + K P R NLTFR+ Sbjct: 182 WQHALLKTKTIHTP-----RINLTFRRIS 205 >UniRef50_A2QVX5 Contig An11c0110, complete genome n=18 Tax=Eurotiomycetidae RepID=A2QVX5_ASPNC Length = 360 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 35/147 (23%), Positives = 58/147 (39%), Gaps = 8/147 (5%) Query: 55 YTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQP 114 ++V L W T Y ++ + +P P P+ L A +P + Sbjct: 181 KPLTVQSILNRKLRWVTLGGQYDWTAKVYPSERP-PEFPRDIAKLLH-----AMFPATEA 234 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 A ++N Y+ G LS H+D E D ++SVS G +F + + + L GD Sbjct: 235 QAAILNVYSAGDHLSPHRDVSE-DCDVGLISVSFGCDGLFLISH-DDGEHCEIIRLRSGD 292 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTI 201 V G SR +H + + P + Sbjct: 293 AVYMDGTSRFAWHAVPKIVPNTCPKWL 319 >UniRef50_A4S344 Predicted protein n=2 Tax=Mamiellales RepID=A4S344_OSTLU Length = 348 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 10/113 (8%) Query: 111 DFQPDACLINRYAPGAKLSLHQDKDEPDL-----RAPIVSVSLGLPAIFQFGGLKRNDPL 165 + P CL+N Y GA+ H+D ++P L PIVS S+GL F + + Sbjct: 211 NMNPTTCLVNFYKDGAEFKWHKDSEDPKLVKSRTGPPIVSFSVGLSGDFGYKYSFDDPEH 270 Query: 166 KRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP-----LTIDCRYNLTFRQAG 213 K + L GDV+++GG SR+ H + + G P ++ R N+T R G Sbjct: 271 KVVRLNSGDVLLFGGPSRMIVHSVLNVYPGSMPGHLRGKMLNGRLNVTVRDIG 323 >UniRef50_A8P5P3 Putative uncharacterized protein n=1 Tax=Brugia malayi RepID=A8P5P3_BRUMA Length = 576 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 47/215 (21%), Positives = 77/215 (35%), Gaps = 28/215 (13%) Query: 9 EPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLG 68 EP +P +L F E + + P G T + H G Sbjct: 120 EPLYKP--NNLWVLPDFINPDEEAALITVIQDY---------LPRGKT--LKNRKVIHFG 166 Query: 69 WTTHRQGYLYSPIDPQTNKPWP-AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK 127 + + Y + + +P P +P + R A + + +PD +N Y PG Sbjct: 167 FE-----FNYD-NNMASEQPSPDPIPSVCQPVIDRMLGAGIFKE-KPDQVTVNIYEPGNG 219 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FY 186 + H D I S+SL + +F + +LL + V GESR + Sbjct: 220 IPSHVDTHSA-FSDTIASLSLLSDLVMEFRDFANTSTIYDVLLPRFSLTVMRGESRYRWK 278 Query: 187 HGIQPLKAGFHPLT-----IDCRYNLTFRQAGKKE 216 HGI K +P+T R + TFR +++ Sbjct: 279 HGIAKRKYDINPVTNKLMARQLRVSFTFRNVIREK 313 >UniRef50_C4Q8H0 Expressed protein n=2 Tax=Schistosoma RepID=C4Q8H0_SCHMA Length = 334 Score = 113 bits (284), Expect = 3e-24, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 69/211 (32%), Gaps = 39/211 (18%) Query: 19 AVILRRFA--FNAAEQLIRDINDVASQSPFRQ----MVTPGGYTMSVAMTNCGHLGWTTH 72 IL+ F + + + + SP Q P ++ L W T Sbjct: 65 FFILKNFFSPIEIEDLWLAALTEWC-HSPTAQCNLGTNVPPNSRIACTDPWYSKLRWITL 123 Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGY-------------------PDFQ 113 Y +S +K P + + ++ Sbjct: 124 GYHYQWSERVYNESKVG-EFPSLLYGTTVNIINFLKHLIEGREISSSNSSRLLEQCQNYT 182 Query: 114 PDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRN----------- 162 P+A ++N Y + H D E D AP+VS+S G A+F + Sbjct: 183 PEASIVNYYRTKTTMGFHSDDAEVDKEAPLVSISFGPTALFLLETSEAIKHEFDAPLHGS 242 Query: 163 -DPLKRLLLEHGDVVVWGGESRLFYHGIQPL 192 D + + L HGDVV+ G+SRL H + + Sbjct: 243 FDHVLPIYLHHGDVVIMAGKSRLARHAVPVI 273 >UniRef50_B6KBE4 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KBE4_TOXGO Length = 927 Score = 113 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 37/110 (33%), Positives = 52/110 (47%), Gaps = 9/110 (8%) Query: 91 AMPQSFHNLCQRAATAAG------YPDFQP--DACLINRYAPGAKLSLHQDKDEPDLRAP 142 +P + LC +P +A ++N Y G +L H+D E AP Sbjct: 647 GLPLALEKLCDDILQFTEPFLQGPKEGRRPRMNAAILNVYRKGDRLRGHRDDAERA-EAP 705 Query: 143 IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPL 192 ++S+SLG PAIF GG R K L+L GDV+V G +R HG+ L Sbjct: 706 LISISLGQPAIFLLGGDSRRVAPKALVLRSGDVLVLSGAARWAVHGVPKL 755 >UniRef50_B8HYX6 2OG-Fe(II) oxygenase n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HYX6_CYAP4 Length = 204 Score = 112 bits (280), Expect = 8e-24, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 74/206 (35%), Gaps = 28/206 (13%) Query: 13 EPLA--AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 +PL+ G + F A E+ + + D Q P+ + Sbjct: 19 KPLSSVPGLRYIPNFINPAVEKTLLEEID---QQPWITDL---------------KRRVQ 60 Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSL 130 + Y Y +P+ L R GY PD ++N Y PG ++ Sbjct: 61 HYGYRYDYKARAISPEAYLGTLPEWLKPLTNRLWQE-GYIPDLPDQVIVNEYIPGQGITA 119 Query: 131 HQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGI 189 H D + I+S+SLG I +F + L+LE +VV G++R + H I Sbjct: 120 HIDCID-CFSDTILSLSLGSDCIMRFTAPSHT--TEDLVLERRSLVVLQGDARYQWQHSI 176 Query: 190 QPLKAG---FHPLTIDCRYNLTFRQA 212 K+ R +LTFR+ Sbjct: 177 PARKSDLIKGQKQARSRRISLTFRKV 202 >UniRef50_A1SVH4 DNA-N1-methyladenine dioxygenase n=12 Tax=Bacteria RepID=A1SVH4_PSYIN Length = 210 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 55/225 (24%), Positives = 83/225 (36%), Gaps = 35/225 (15%) Query: 2 LDLFA--DAEPWQEPLAAGAVILRRFAFNAAEQ-----LIRDINDVASQSPFRQMVTPGG 54 +DLFA + P G V + ++ + + Sbjct: 4 MDLFAALEDSPINIINCDGVVEYHGLLIPFDQANHYFGVLLETIQW------KHDQANIL 57 Query: 55 YTMSVAMTNCGHLGWTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ 113 + V W + Y YS + + PW L Q+ A G+ Q Sbjct: 58 GQIIVTQRKVA---WHADKPFHYTYSNMT-KVALPWT---LELLQLKQKVEDATGH---Q 107 Query: 114 PDACLINRYAPGA-KLSLHQDKDEPDLR--APIVSVSLGLPAIFQFGGLKRNDPLKRLLL 170 +ACL+N Y G ++ H D E DL+ A I S+S G F F N + L Sbjct: 108 FNACLLNLYHSGQEGMAWHSD-AEKDLQKNAAIASLSFGAERKFSFKHKV-NQKTISVSL 165 Query: 171 EHGDVVVWGGES-RLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 +HG ++V GG++ R + H + P K P R NLTFR + Sbjct: 166 QHGSLLVMGGDTQRHWLHRLPPTKKVTTP-----RINLTFRMINE 205 >UniRef50_Q9U3P9 Protein C14B1.10, partially confirmed by transcript evidence n=2 Tax=Caenorhabditis RepID=Q9U3P9_CAEEL Length = 591 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 39/162 (24%), Positives = 62/162 (38%), Gaps = 8/162 (4%) Query: 60 AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 ++ + H + YS K +P ++L R + Y +PD Sbjct: 165 SVQSLKHRAVVHFGHVFDYSTNSASEWKEADPIPPVINSLIDRLISD-KYITERPDQVTA 223 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y G + H D PIVS+SL + +F + + +LL+ + + Sbjct: 224 NVYESGHGIPSHYDTHSA-FDDPIVSISLLSDVVMEFKDGANSARIAPVLLKARSLCLIQ 282 Query: 180 GESRL-FYHGIQPLKAGFHPLT-----IDCRYNLTFRQAGKK 215 GESR + HGI K P T R +LT R+ +K Sbjct: 283 GESRYRWKHGIVNRKYDVDPRTNRVVPRQTRVSLTLRKIRRK 324 >UniRef50_A5WBM5 DNA-N1-methyladenine dioxygenase n=5 Tax=Moraxellaceae RepID=A5WBM5_PSYWF Length = 212 Score = 110 bits (274), Expect = 5e-23, Method: Composition-based stats. Identities = 48/164 (29%), Positives = 71/164 (43%), Gaps = 26/164 (15%) Query: 51 TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP 110 TP T +++ T GH PI+P PA+ H + Q+ Sbjct: 67 TPSASTQALSYTYSGHTR-----------PIEPW----HPAVFHVKHMIEQQLQPLKICT 111 Query: 111 DFQPDACLINRYAPGAKLSLHQDKDEPDLR-APIV-SVSLGLPAIFQFGGLKRNDPLKRL 168 F ++CL+N Y G + + DEP+L PI+ S+SLG F F K D ++ L Sbjct: 112 QF--NSCLLNYYPSGEEGMGYHADDEPELGYQPIIASLSLGATRKFVFKHKKTQDKVE-L 168 Query: 169 LLEHGDVVVWGGES-RLFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 LE G +VV G++ + + H I K R +LTFR Sbjct: 169 YLESGQLVVMRGDTQQYWKHSITKTKK-----VDTGRISLTFRH 207 >UniRef50_UPI000175883A PREDICTED: similar to alkB, alkylation repair homolog 2 n=2 Tax=Coelomata RepID=UPI000175883A Length = 197 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 43/137 (31%), Positives = 60/137 (43%), Gaps = 22/137 (16%) Query: 87 KPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK-DEPDLRAPIV 144 KPW NL +R F + LINRY G + H+D E D PI Sbjct: 67 KPWTETLIQVRNLIKRV------TGFDYNFVLINRYRDGNDHIGEHKDNESELDKNTPIA 120 Query: 145 SVSLGLPAIFQF--------GGLKRNDPLKRLLLEHGDVVVWGGES-RLFYHGIQPLKAG 195 S+SLG +F F GG KR+ P ++ L+HG +++ + +YH + P K Sbjct: 121 SLSLGQQRLFVFKHQDCRKKGGAKRSVPPVKIQLQHGSLLLMNPPTNNYWYHALPPAKRA 180 Query: 196 FHPLTIDCRYNLTFRQA 212 R NLTFR+ Sbjct: 181 -----PGARINLTFRKI 192 >UniRef50_B7QP17 Methyltransferase, putative n=1 Tax=Ixodes scapularis RepID=B7QP17_IXOSC Length = 602 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 64/212 (30%), Gaps = 34/212 (16%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G ++R A E L+ + Sbjct: 108 PPGLRLVREAVDEAEEALLWRLVSWDRD-----------------CRALKQREVRHFGYA 150 Query: 76 YLYSPIDPQTNKPW-PAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 + Y + + P +P+ R A+G+ PD + RY PG + H D Sbjct: 151 FDYELQGVRKDAPLAEPIPEECAPFLGRLV-ASGHLSGLPDQLTVTRYLPGQGIPPHVDS 209 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 IV +SLG P + F + +LL ++ G SR + HG K Sbjct: 210 H-GSFEDGIVCLSLGSPVVMDFRHPDGDRA--AVLLPPRSALLLHGPSRYIWTHGTASRK 266 Query: 194 AGFHPLT-----------IDCRYNLTFRQAGK 214 + P T R + TFR+ + Sbjct: 267 SDVVPRTEVPGQGLTLSPRGVRISFTFRRIRR 298 >UniRef50_B6GZZ6 Pc12g09870 protein n=4 Tax=Eurotiomycetidae RepID=B6GZZ6_PENCW Length = 360 Score = 106 bits (265), Expect = 5e-22, Method: Composition-based stats. Identities = 36/156 (23%), Positives = 58/156 (37%), Gaps = 8/156 (5%) Query: 46 FRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAAT 105 F+ ++V L W T Y ++ + P P L Sbjct: 163 FQPKEPDVHKPLTVQNFLEKKLRWVTLGGQYDWTAKVYPSGPPPEFPPDIAKVLR----- 217 Query: 106 AAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPL 165 A +P A ++N Y+ G LS+H+D E ++SVS G +F N Sbjct: 218 -AAFPATSAQAAILNLYSAGDTLSVHRDVSEECDVG-LISVSFGCDGLFLASHDDGNG-C 274 Query: 166 KRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI 201 + + L GD V G+SR +HG+ + P + Sbjct: 275 EIIRLRSGDTVYMSGKSRFAWHGVPKILPSTCPKWL 310 >UniRef50_A7EDX1 Putative uncharacterized protein n=2 Tax=Sclerotiniaceae RepID=A7EDX1_SCLS1 Length = 359 Score = 106 bits (264), Expect = 6e-22, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 59/163 (36%), Gaps = 17/163 (10%) Query: 52 PGGYTMSV--AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGY 109 P S+ A W T Y ++ ++ P P P L A Sbjct: 163 PNSKHKSLNTAQALQKKFRWLTLGAQYNWNTRAYPSSSPTP-FPADVSRLVTTLFQNA-- 219 Query: 110 PDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGL---------K 160 F P++ ++ Y+ + +H+D E R + S +LG +F Sbjct: 220 --FTPESGVVLMYSTKDFMPVHRDVSEECERG-LASFTLGCDGLFLISRDVKKGEEHVSD 276 Query: 161 RNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDC 203 R L + + GDV+ GGE+R +H + + AG P + Sbjct: 277 REQDLVCIRVRSGDVIQMGGETRWAWHAMPKIIAGTCPPCLSE 319 >UniRef50_Q12QK9 DNA-N1-methyladenine dioxygenase n=5 Tax=Shewanella RepID=Q12QK9_SHEDO Length = 255 Score = 105 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 53/203 (26%), Positives = 77/203 (37%), Gaps = 27/203 (13%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPF-RQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 + L F +Q + D A PF R + G + W Sbjct: 65 SPPVTWLTGFLSVQEQQ---ALLDDAKSYPFERPQIEVYGKLHPIPRQQV----WFADED 117 Query: 75 -GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 GY Y+ + + PWPA+ L QR G + L+N YA G + H Sbjct: 118 CGYRYASL-FISPTPWPAL---LMQLRQRLQAELGL---VFNGVLVNFYADGQDTVGWHS 170 Query: 133 DKDEPDLRAP--IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG-GESRLFYHGI 189 D DE ++R P I S+S+G FQ KR+ L L GD+++ G + + H + Sbjct: 171 D-DEAEIRKPSSIASISIGATRDFQIRH-KRSQETFTLPLVSGDLLIMQPGMQQTWQHAV 228 Query: 190 QPLKAGFHPLTIDCRYNLTFRQA 212 P R NLTFR+ Sbjct: 229 PRRAKVKAP-----RINLTFREL 246 >UniRef50_B8HQU9 2OG-Fe(II) oxygenase n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HQU9_CYAP4 Length = 207 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 36/149 (24%), Positives = 57/149 (38%), Gaps = 8/149 (5%) Query: 68 GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK 127 + Y Y + ++P L + Y PD ++N Y PG Sbjct: 59 RVQHYGYKYDYKSRGVDKSMYIASLPIWAKELAHKIRK--KYTTDLPDQVIVNEYMPGQG 116 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FY 186 ++ H D IVS+SL + F ++ K L+LE +VV G++R + Sbjct: 117 IANHIDCV-NCFTDTIVSLSLCSSCVMDFVHIETGAR-KSLMLEPRSLVVLSGDARYKWL 174 Query: 187 HGIQPLKAG---FHPLTIDCRYNLTFRQA 212 HGI K+ R +LTFR+ Sbjct: 175 HGIAKRKSDMYKGEKYIRKRRVSLTFRKV 203 >UniRef50_D0MUQ0 Alkylated DNA repair protein alkB 8 n=1 Tax=Phytophthora infestans T30-4 RepID=D0MUQ0_PHYIN Length = 640 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 61/201 (30%), Gaps = 21/201 (10%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G F A E F + + ++ H G + Sbjct: 143 PGLKFGAEFVTEAQEAACLA---------FFERENGAHWANTIRARQVQHFG-----YEF 188 Query: 77 LYSPIDPQTNKPW-PAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y ++P +P+ + + +PD +N Y PG ++ H D Sbjct: 189 NYDTRRCDPDQPMKEPIPEVLQPVIDKIVECGIMDGDRPDQITVNEYLPGQGIAFHLDTH 248 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK- 193 I S+S+ + F + +LL + V G SR + H I P Sbjct: 249 SA-FTTTIASLSICSEVVMDFRHPD-GVRNEGVLLPARSLAVMSGASRYKWEHAIVPRTF 306 Query: 194 --AGFHPLTIDCRYNLTFRQA 212 + R ++TFR+ Sbjct: 307 DVIDGKQIPRQRRVSITFRKI 327 >UniRef50_C6XT27 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT27_PEDHD Length = 201 Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats. Identities = 46/200 (23%), Positives = 70/200 (35%), Gaps = 19/200 (9%) Query: 14 PLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHR 73 P+ A F A + ++ Q ++Q + G Sbjct: 14 PIPGEAFFYPGFFTEAESD--QYFQELTHQVTWKQEPIKVFGKDILQPRFTAFYG--DEA 69 Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 Y YS AMP L + D + + CL+N Y G + H+ Sbjct: 70 TSYSYS------GITLNAMP-WIDTLTRIKENIETKFDVEFNTCLLNHYRSGADSIGWHR 122 Query: 133 DKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW-GGESRLFYHGIQ 190 D ++ + P I SVS G P IFQF P+ + L HG +++ L+ H + Sbjct: 123 DNEKNLGQYPFIASVSFGAPRIFQFRHYTDKIPIISVELTHGSLLIMKADTQHLWEHRLP 182 Query: 191 PLKAGFHPLTIDCRYNLTFR 210 + P R NLTFR Sbjct: 183 KILRPVGP-----RINLTFR 197 >UniRef50_B2SPH7 DNA repair system specific for alkylated DNA n=17 Tax=Xanthomonadaceae RepID=B2SPH7_XANOP Length = 202 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 50/198 (25%), Positives = 79/198 (39%), Gaps = 23/198 (11%) Query: 21 ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG-YLYS 79 R + +A + V + ++ G S +++ W + Y YS Sbjct: 14 WWRGWLPHAQADALMQALLVQAHWQLHRIRMFGRMVDSPRLSS-----WIGDPEASYRYS 68 Query: 80 PIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPD 138 + +PW + + R G+ + ++ LINRY G + H D DEP+ Sbjct: 69 GTR-FSPQPWLEV---LQPVRLRLEDETGH---RFNSVLINRYRSGSDAMGWHSD-DEPE 120 Query: 139 LRAP--IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQPLKAG 195 L A I SVSLG F F + L L HGD+++ GG+++ Y H + Sbjct: 121 LGAQPLIASVSLGARRRFAFKHRDDASVKQALELGHGDLLLMGGQTQRHYRHALPRTAK- 179 Query: 196 FHPLTIDCRYNLTFRQAG 213 + R NLTFRQ Sbjct: 180 ----PVGERINLTFRQVA 193 >UniRef50_C3NYZ8 Alkylated DNA repair protein n=28 Tax=Bacteria RepID=C3NYZ8_VIBCJ Length = 202 Score = 103 bits (256), Expect = 5e-21, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 59/143 (41%), Gaps = 17/143 (11%) Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 +GY YS + P L + AA P ++ L N Y G + HQ Sbjct: 72 KGYRYSGLSLSAQ----PFPPPLLTLKTQCEQAAQAP---FNSVLANLYRDGQDSMGWHQ 124 Query: 133 DKDEPDLRA-PIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES-RLFYHGI 189 D +EP+L + P++ S+SLG F K + L HGD+++ G + + H I Sbjct: 125 D-NEPELGSNPVIASLSLGESRRFLLRHHKDHALQVECELNHGDLLIMAGNTQHFWQHAI 183 Query: 190 QPLKAGFHPLTIDCRYNLTFRQA 212 + T R NLTFR Sbjct: 184 PKTRQ-----TKQTRINLTFRNI 201 >UniRef50_B8M368 Oxidoreductase, 2OG-Fe(II) oxygenase family, putative n=1 Tax=Talaromyces stipitatus ATCC 10500 RepID=B8M368_TALSN Length = 332 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 57/176 (32%), Gaps = 42/176 (23%) Query: 55 YTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQP 114 M++ L W T Y ++ +P P P+ +L + A +P+ + Sbjct: 181 KPMTIQNMLDKKLRWITFGGQYNWTTKVYPEGQP-PPFPEDIAHLLR-----ASFPETEA 234 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 A ++N Y+ +G A+F + + + L GD Sbjct: 235 QAAIVNFYSANDTF-------------------IGCDALFMISH-DDGEGCEVIRLRSGD 274 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTI----------------DCRYNLTFRQAGK 214 V G+SR +HG+ + P + R NL RQ + Sbjct: 275 AVYMSGQSRFAWHGVPKIIPDTCPKWLCDWPGPGYPYWQGWMGRKRVNLNVRQMME 330 >UniRef50_Q3AYK8 DNA-N1-methyladenine dioxygenase n=12 Tax=Cyanobacteria RepID=Q3AYK8_SYNS9 Length = 211 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 36/143 (25%), Positives = 50/143 (34%), Gaps = 15/143 (10%) Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-KLSL 130 Y YS + WP A A Q + CL+N Y G ++ Sbjct: 79 QGLQYRYSGA-IHVGEGWPEWFHPLVEQVNHIAQA------QFNGCLLNLYRDGDDRMGW 131 Query: 131 HQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG-GESRLFYHG 188 H D + E D PI S+SLG F F + L GD+++ G + H Sbjct: 132 HADDEPEIDQTQPIASLSLGSTRDFLFRHRGDQPKRAAIPLADGDLLIMHPGCQGHWMHS 191 Query: 189 IQPLKAGFHPLTIDCRYNLTFRQ 211 + + R NLTFR Sbjct: 192 VPQRRK-----VKTMRINLTFRH 209 >UniRef50_Q2BPN4 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BPN4_9GAMM Length = 195 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 40/142 (28%), Positives = 58/142 (40%), Gaps = 17/142 (11%) Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 Y YS + T W + + L + A+ + +A LIN Y G + H+D Sbjct: 68 YRYSGLT-LTASGWHPVVKKIKELAEAASNT------EFNAVLINLYRDGQDSMGWHKDD 120 Query: 135 D-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRLFYHGIQPL 192 + E IVSVSLG F + LLL G ++V G E + + H I Sbjct: 121 EPELGPEPTIVSVSLGATRRFLLRAADKT--QHELLLNSGSLLVMGPELQKHWQHSIPKT 178 Query: 193 KAGFHPLTIDCRYNLTFRQAGK 214 + P R NLTFR+ + Sbjct: 179 RKQIGP-----RINLTFRKIVQ 195 >UniRef50_D1I753 Whole genome shotgun sequence of line PN40024, scaffold_87.assembly12x (Fragment) n=18 Tax=Embryophyta RepID=D1I753_VITVI Length = 912 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 68/240 (28%), Gaps = 48/240 (20%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 D + E G +L F E+ + D S S++ Sbjct: 676 DSVPVSLVDSELNIPGIYLLHDFVSAKEEEELLAAVDKMSW-------------KSLSKR 722 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP---DFQPDACLI 119 H G + Y + T + +P + +R ++ D D + Sbjct: 723 RVQHYG-----YEFCYETRNVNTKQYLGKLPSFVSAIVERISSFPNLESAADIVLDQLTV 777 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPL-------------- 165 N Y PG LS H D I S+SL P I F Sbjct: 778 NEYPPGVGLSPHIDTHSA-FEGFIFSLSLAGPCIMDFRRYTEGVWPKSASSSDMSVEYPD 836 Query: 166 -------KRLLLEHGDVVVWGGESRLFYHGIQP-----LKAGFHPLTIDCRYNLTFRQAG 213 + + L +++ GE+R +H P + R + TFR+ Sbjct: 837 KSSSFLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTFRKVR 896 >UniRef50_Q07GB6 Oxidoreductase, putative n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q07GB6_ROSDO Length = 195 Score = 100 bits (249), Expect = 3e-20, Method: Composition-based stats. Identities = 36/148 (24%), Positives = 56/148 (37%), Gaps = 5/148 (3%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 + Y Y +P+ F +L +R TA G+ PD ++N Y PG Sbjct: 46 KRRVQHYGYRYDYKARQAWREDYLGPLPELFQSLAERL-TAEGHFQTVPDQVIVNEYQPG 104 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLF 185 +S H D +P I S+SL + +F + ++ L +V+ L+ Sbjct: 105 QGISAHIDC-QPCFGETIASLSLLSACVMRFASRIYSQQMELHLQPSSLLVLQSDARHLW 163 Query: 186 YHGIQPLKAGF---HPLTIDCRYNLTFR 210 H I P K R +LTFR Sbjct: 164 THAIPPRKTDVFEGQKYARARRISLTFR 191 >UniRef50_Q5UR03 Uncharacterized protein L905 n=1 Tax=Acanthamoeba polyphaga mimivirus RepID=YL905_MIMIV Length = 210 Score = 99.7 bits (247), Expect = 6e-20, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 75/206 (36%), Gaps = 29/206 (14%) Query: 17 AGAVILRRFAFNAAEQ-LIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G I+ + E+ L++ IN+ + V Sbjct: 14 NGFSIIHDYVTPDQEKKLLKKINE----------------SEWVV-----DYQRRLQYYN 52 Query: 76 YLYSPIDPQTNKPWP-AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 Y +P P P +P+ L + +PD ++N Y PG L H D+ Sbjct: 53 YRNELFEPYDLIPIPNKIPKYLDQLINQMILDKIIDQ-KPDQIIVNEYKPGEGLKPHFDR 111 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 + + I+ +SLG I +F K K++ + + + ++R + HGI P K Sbjct: 112 KDY-YQNVIIGLSLGSGTIMEFYKNKPIPEKKKIYIPPRSLYIIKDDARYIWKHGIPPRK 170 Query: 194 ---AGFHPLTIDCRYNLTFRQAGKKE 216 + + R ++TFR K++ Sbjct: 171 YDEINGKKIPRETRISITFRNVIKEK 196 >UniRef50_B6HH87 Pc20g14010 protein n=10 Tax=Leotiomyceta RepID=B6HH87_PENCW Length = 230 Score = 99.3 bits (246), Expect = 7e-20, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 66/206 (32%), Gaps = 33/206 (16%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G F EQ + + + P R + Sbjct: 14 PHGIFWQDNFIDAEHEQRLISVFTNELEWPDRPGRVS-----------------LHYGYS 56 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 + Y + P+ P L D PD + Y PGA + H D Sbjct: 57 FDYKTFGIDPDIPYKEFPGWLQPLIPT------TEDRPPDQVCLQYYPPGAGIPPHVDAH 110 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKA 194 +P + ++S+G PA F +R + + L ++ G+SRL + HGI+ K Sbjct: 111 KPY--DQLYALSIGAPATMIF---RRGEERIEVDLTPRSMMQMSGDSRLHWTHGIRKRKN 165 Query: 195 GFHP----LTIDCRYNLTFRQAGKKE 216 P R+++T+R E Sbjct: 166 DTLPDGTVRPRGERWSITYRWLRDGE 191 >UniRef50_A9T8I6 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T8I6_PHYPA Length = 320 Score = 99.3 bits (246), Expect = 7e-20, Method: Composition-based stats. Identities = 36/135 (26%), Positives = 55/135 (40%), Gaps = 9/135 (6%) Query: 82 DPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLR 140 + +KP +P+ L + A D + L+N YA G +S H D + Sbjct: 157 ENVYHKPPRPIPRCLQELKRCVEQAT---DEYYNFVLVNFYADGTHSISPHSDDESFLGT 213 Query: 141 AP-IVSVSLGLPAIFQFGGLKRNDP-LKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFH 197 P I S+SLG F R D ++ L GD+VV G ++ ++H I Sbjct: 214 NPCIASLSLGGTRDFVMKHKTRKDVNSEKFALRSGDMVVMRGTTQANWFHSIPKRTGKTQ 273 Query: 198 PLTIDCRYNLTFRQA 212 R N+TFR+ Sbjct: 274 ATAP--RINVTFRKC 286 >UniRef50_B0SGN3 Alkylated DNA repair protein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SGN3_LEPBA Length = 202 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 18/145 (12%) Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-KLSLH 131 Y YS +T PW +L + + + ++CL+N Y G+ ++ H Sbjct: 69 GYSYRYSGTT-KTAIPWT---NELLDLKKEVESET---NEIFNSCLLNLYHDGSEGMAWH 121 Query: 132 QDKDEPDLRAP--IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHG 188 D DE L+ I SVSLG IF+F K+N + L LE G +++ GE + + H Sbjct: 122 SD-DETSLQKHSTIASVSLGAERIFRFKHKKKN-SVVELPLEPGSLLLMKGEIQEHWLHS 179 Query: 189 IQPLKAGFHPLTIDCRYNLTFRQAG 213 + P R NLTFRQ G Sbjct: 180 LPKALKVKRP-----RVNLTFRQFG 199 >UniRef50_C6W3C2 2OG-Fe(II) oxygenase n=5 Tax=Bacteroidetes RepID=C6W3C2_DYAFD Length = 202 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 50/216 (23%), Positives = 77/216 (35%), Gaps = 20/216 (9%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L LF E P F + I ++ P+RQ V A Sbjct: 4 LSLFGSEETLLFP-ENLLEYYPGFVPPDESAAL--IGKWITEVPWRQQVMQMYGKQVTAP 60 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 G T + + +P W + L +R G F ++ L+N Sbjct: 61 RLMAWYGDTEKSYTFSGTRFEPYG---WT---KELAALKKRIEEKTG---FTFNSVLLNY 111 Query: 122 YAPG-AKLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 Y G ++ H D ++ R P++ SVSLG F+F + L LE+G +++ Sbjct: 112 YRDGNDSVAWHGDNEQELGRNPVIASVSLGQERRFEFRYRADHSRKYGLPLENGSLLIMK 171 Query: 180 GE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 G+ + H I K P R NLTFR + Sbjct: 172 GDLQHTWEHRIPKSKTQNAP-----RINLTFRTIQR 202 >UniRef50_A4CQ67 Alkylated DNA repair protein n=2 Tax=Flavobacteriaceae RepID=A4CQ67_9FLAO Length = 197 Score = 98.5 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 43/194 (22%), Positives = 68/194 (35%), Gaps = 19/194 (9%) Query: 23 RRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPID 82 F + + ++ SQ+P+RQ G Y ++ Sbjct: 20 PGFLLPKEAESL--FGEIKSQTPWRQDTIRLFGKTFQQPRLTALYGKNGQAYTYSGILME 77 Query: 83 PQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRA 141 P P +L R + AAG + CL+N Y G H D + Sbjct: 78 PLPFTPL------LEDLLHRVSIAAGE---KFTTCLLNLYRDGSDSNGWHADDEPELGNN 128 Query: 142 PIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES-RLFYHGIQPLKAGFHPL 199 P++ S+SLG F + R+ LE G +++ G + + H + K P Sbjct: 129 PVIASLSLGASRKFHLKHRRIKSQRVRMNLESGSLLLMAGTTQHHWLHQVPKTKRPVGP- 187 Query: 200 TIDCRYNLTFRQAG 213 R NLTFR+ G Sbjct: 188 ----RINLTFRRLG 197 >UniRef50_C8VDQ1 DNA repair family protein (AFU_orthologue; AFUA_5G14250) n=5 Tax=Leotiomyceta RepID=C8VDQ1_EMENI Length = 335 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 44/150 (29%), Positives = 61/150 (40%), Gaps = 21/150 (14%) Query: 80 PIDPQTNKPWP---------AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLS 129 P+D +T +P P +P+ L Q A G + CL+N YA G +S Sbjct: 148 PVDVKTRRPIPDNKYQYTPRPIPKCLDQLRQAVEAAVG-DGSSYNFCLVNYYATGDDSIS 206 Query: 130 LHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDP-----LKRLLLEHGDVVVWGGESR 183 H D + P I S+SLG F P + L GD+VV GE++ Sbjct: 207 YHSDDERFLGPNPSIASISLGAQRDFLMRHKPSQAPGVSNQPLKFSLASGDMVVMRGETQ 266 Query: 184 -LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 + H I K G R N+TFR+A Sbjct: 267 SNWLHSIPKRKGGESQK---GRINITFRKA 293 >UniRef50_C3ZI75 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZI75_BRAFL Length = 844 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 43/149 (28%), Positives = 65/149 (43%), Gaps = 24/149 (16%) Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLH 131 Y +S ++ +PW + + R A G+ + + L+NRY G + H Sbjct: 703 GLSYRFSGVEV-PARPWTPL---MEGIRDRVQEATGH---KFNFVLVNRYKDGNDHMGEH 755 Query: 132 QDKDEPDL--RAPIVSVSLGLPAIFQFGG-------LKRNDPLKRLLLEHGDVVVWGGES 182 +D DE DL API S+SLG F F KR +L LEHG +++ + Sbjct: 756 RD-DEKDLVREAPIASLSLGQKRDFIFKHCDARGKSAKRAMDPVKLELEHGSLLMMNYPT 814 Query: 183 -RLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 R +YH + K + R N+TFR Sbjct: 815 NRYWYHSLPVRKKA-----LGVRINMTFR 838 >UniRef50_C6X2N0 2OG-Fe(II) oxygenase n=4 Tax=Bacteria RepID=C6X2N0_FLAB3 Length = 204 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 46/165 (27%), Positives = 66/165 (40%), Gaps = 21/165 (12%) Query: 54 GYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ 113 G TM + G Y S +T PW A + L + A G Sbjct: 56 GKTM-ITKRKVAWYGDREFSYTYSKST---KTAIPWTA---TLLKLKKMVENATGE---A 105 Query: 114 PDACLINRYAPGA-KLSLHQDKDEPDL--RAPIVSVSLGLPAIFQFGGLKRNDPLKRLLL 170 ++CL+N Y G + H D E DL I S+SLG F F D ++ + L Sbjct: 106 FNSCLLNLYHSGEEGMGWHSD-AEKDLKKNGAIASLSLGAERRFLFKHKHTADKVETV-L 163 Query: 171 EHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 EHG ++V E++ + H + P + P R NLTFR + Sbjct: 164 EHGSLLVMKNETQSFWQHRLPPARKILTP-----RINLTFRSIDE 203 >UniRef50_Q26EI7 Alkylated DNA repair protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26EI7_9BACT Length = 201 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 56/217 (25%), Positives = 80/217 (36%), Gaps = 25/217 (11%) Query: 3 DLFADAEPWQEPLAAGAVILRR--FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 +LF P + V +AF A+QL+ + ++P+RQ Sbjct: 4 NLFPSDFPD---IPDAQVQYDGNFYAFAEAQQLLSKL---LKKTPWRQNKITVYGKEHDE 57 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 G Y YS I PW Q ++A A + CLIN Sbjct: 58 PRLTQLYG--DPGIKYGYSNISYD-ALPWTETLQKIKQDVEKATGAT------FNICLIN 108 Query: 121 RYAPG-AKLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 RY G H D ++ PI+ S+SLG F D + L+HG ++V Sbjct: 109 RYRNGQDSNGWHADNEKELGINPIIASISLGQERFFHLKHHHNKDWKFKFPLQHGSLLVM 168 Query: 179 GGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 GE++ Y H I K I R NLTFR+ + Sbjct: 169 AGETQHTYKHQIAKTKR-----LIGERINLTFRKIVQ 200 >UniRef50_C5KK00 Putative uncharacterized protein n=6 Tax=Perkinsus marinus ATCC 50983 RepID=C5KK00_9ALVE Length = 477 Score = 97.3 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 40/145 (27%), Positives = 58/145 (40%), Gaps = 12/145 (8%) Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AK 127 + Q Y YS + PW P L + A G + + C++N Y G Sbjct: 335 YADDGQQYRYSGGPLRVPSPWRRGPIVIDRLRKAVGEACGQ---EFNCCVLNYYRDGSDS 391 Query: 128 LSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LF 185 + LH D ++ P I VSLG F KR+ L G ++V GG ++ L+ Sbjct: 392 IGLHSDDEKVLGVNPSIACVSLGAERDFVL-DAKRDKKKVELTPRSGSLLVMGGSTQKLW 450 Query: 186 YHGIQPLKAGFHPLTIDCRYNLTFR 210 H + K P R +LTFR Sbjct: 451 KHSVPSRKREHRP-----RVSLTFR 470 >UniRef50_B8J7A0 2OG-Fe(II) oxygenase n=3 Tax=Anaeromyxobacter RepID=B8J7A0_ANAD2 Length = 204 Score = 97.3 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 20/156 (12%) Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 +A H G + Y Y +P P P + L +RAA AG L Sbjct: 62 IARRRVAHFG-----RAYAYDARAV---QPGPPFPAALEPLRRRAAALAGVAPAALAEAL 113 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 + RY PGA + H+D +V VSLG PA F+ +LLE G + Sbjct: 114 VTRYPPGAGIGWHRD---APAFGQVVGVSLGAPARFRMREGGPGGRALEVLLEPGSAYLL 170 Query: 179 GGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 G +R + H I P+ A R+++TFR Sbjct: 171 AGAARWRWQHAIPPVPA--------ERWSVTFRTLR 198 >UniRef50_Q6ZEA1 Slr7097 protein n=5 Tax=Bacteria RepID=Q6ZEA1_SYNY3 Length = 208 Score = 96.6 bits (239), Expect = 5e-19, Method: Composition-based stats. Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 17/147 (11%) Query: 69 WTTHR-QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W + Y YS I PW + Q+ + A A ++ L+N Y G Sbjct: 71 WYGDPERSYTYSGI-AMEPTPWIPLLQTIKTKAETLAKAT------FNSVLLNFYRTGTD 123 Query: 127 KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKR-NDPLKRLLLEHGDVVVWGGESR- 183 +S H D + E PI SVS G F L+L G +++ G ++ Sbjct: 124 GVSWHADDEPELKKNYPIASVSFGGTRRFLLKHKTDPTIEKVELILTSGSILLMLGTTQE 183 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 + H + K P R NLTFR Sbjct: 184 YWLHQVPKTKKFVEP-----RINLTFR 205 >UniRef50_C5BKX4 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BKX4_TERTT Length = 207 Score = 96.6 bits (239), Expect = 6e-19, Method: Composition-based stats. Identities = 48/224 (21%), Positives = 81/224 (36%), Gaps = 36/224 (16%) Query: 3 DLFADAEPWQEPLAAG----AVILRRFAFNAAEQ----LIRDINDVASQSPFRQMVTPGG 54 D+FAD E + G I ++ A + L+ + + + G Sbjct: 4 DIFADTS-RSEIVDLGDNAWLDIFPQWIATAQTRVLFNLLLQECEWEQPA-----IRIAG 57 Query: 55 YTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQP 114 + + C W + + + K +P P L + A + Sbjct: 58 RELPIPRLQC----WYGDK-----GAVLRYSGKSFPPHP-WLKALAELNLQLATVCKRRF 107 Query: 115 DACLINRYAPG-AKLSLHQDKDEPDLRA-PIV-SVSLGLPAIFQFGGLKRNDPLKR--LL 169 ++ L+N Y G + H D DEP+L A P++ S+SLG F + Sbjct: 108 NSVLVNCYRDGSDSVGWHAD-DEPELGAKPVIASISLGATRRFSLKHKFDQQQKSSRHIQ 166 Query: 170 LEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 L GD+++ G ++ + H IQ + P R NLTFR Sbjct: 167 LRDGDLLIMRGNTQANWVHAIQKTTSSVGP-----RINLTFRNI 205 >UniRef50_A4C6S7 Putative 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C6S7_9GAMM Length = 208 Score = 96.2 bits (238), Expect = 6e-19, Method: Composition-based stats. Identities = 40/141 (28%), Positives = 60/141 (42%), Gaps = 16/141 (11%) Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLH 131 Y YS + +PW A+ + R + G P +A L+N Y G + H Sbjct: 73 GLEYQYSGLT-MAPEPWSAV---LLAIKNRLSHTFGVP---FNALLVNWYRDGQDSMGWH 125 Query: 132 QDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGI 189 D + R P I S+SLG +F+ K+ + L L+ GD ++ G S+L + H + Sbjct: 126 SDDEPELGREPCIASLSLGASRLFKMR-QKQTLQVYNLQLQSGDCLLMSGRSQLDFQHSL 184 Query: 190 QPLKAGFHPLTIDCRYNLTFR 210 P R NLTFR Sbjct: 185 PK-----QPSVKQGRINLTFR 200 >UniRef50_Q3IHQ7 Putative 2OG-Fe(II) oxygenase superfamily protein n=2 Tax=Alteromonadales RepID=Q3IHQ7_PSEHT Length = 196 Score = 95.8 bits (237), Expect = 9e-19, Method: Composition-based stats. Identities = 35/144 (24%), Positives = 56/144 (38%), Gaps = 18/144 (12%) Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 GY +S + + PW P + +R P ++ L+N Y G + H Sbjct: 68 YGYSHSKLIVE---PW---PDVLLAMRKRLERHLNQP---LNSLLVNYYRDGNDTMGWHS 118 Query: 133 DKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQ 190 D + P IV +SLG + + ++ + L L G ++ G S+ Y H I Sbjct: 119 DDEAELGHQPTIVCISLGAERVLKLKHKA-SNKVTNLKLHSGSCLIMSGNSQRDYQHAIA 177 Query: 191 PLKAGFHPLTIDCRYNLTFRQAGK 214 HP R +LTFR + Sbjct: 178 KQTTLAHP-----RISLTFRLIKR 196 >UniRef50_A4S4F4 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4S4F4_OSTLU Length = 343 Score = 95.8 bits (237), Expect = 9e-19, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 67/206 (32%), Gaps = 30/206 (14%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G ++ F E+ + + G +A H G + Sbjct: 115 EGLTLIENFVTVDEERALATLAAT------------SGDETRLARRRVKHFG-----YAF 157 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAA-GYPD-FQPDACLINRYAPGAKLSLHQDK 134 Y D K +P+ + +R GY + D +N Y G L+ H D Sbjct: 158 DYGTRDANL-KVVDEIPELAMEVLRRLPRETPGYEGAMRCDQVTVNEYPRGVGLAPHVDT 216 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 I+S+SL + +F + + + L ++V GESR + H I K Sbjct: 217 HSA-FGDTILSLSLLGGTVMEF--RTSGEAHRAIYLPPRSLLVMHGESRYRWQHYIPHRK 273 Query: 194 AGF-----HPLTI-DCRYNLTFRQAG 213 P D R + TFR+ Sbjct: 274 FDTLEGEAAPTPRDDVRLSYTFRERR 299 >UniRef50_B2AYU6 Predicted CDS Pa_1_12280 (Fragment) n=5 Tax=Leotiomyceta RepID=B2AYU6_PODAN Length = 320 Score = 95.4 bits (236), Expect = 1e-18, Method: Composition-based stats. Identities = 39/134 (29%), Positives = 56/134 (41%), Gaps = 16/134 (11%) Query: 91 AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSL 148 +PQ L + A + + CL+N YA G +S H D + R P I S SL Sbjct: 132 PIPQCLDALRKSTEAATNC---KFNFCLVNYYATGSDSISFHSDDERFLGREPAIASFSL 188 Query: 149 GLPAIFQFG---------GLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHP 198 G F G +LLL GD+++ G+++ + H I AG Sbjct: 189 GAARDFLMKHKPVPPPPDGQTTVFKQLKLLLASGDMILMKGKTQANWLHSIPKR-AGKSS 247 Query: 199 LTIDCRYNLTFRQA 212 D R N+TFR+A Sbjct: 248 QYGDGRINITFRRA 261 >UniRef50_A6EJU4 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter sp. BAL39 RepID=A6EJU4_9SPHI Length = 202 Score = 95.4 bits (236), Expect = 1e-18, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 56/145 (38%), Gaps = 14/145 (9%) Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AK 127 W + Y Y+ + W + ++ AG + ++ L+N Y G Sbjct: 66 WYADEETYDYTSLRRAAPNIWTP---ELLMIREKVQAIAGL---RFNSVLLNYYRDGNDS 119 Query: 128 LSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRLF 185 ++ H D ++ P++ SVS G F + + LE G +++ G+ + + Sbjct: 120 VAWHSDNEKALGTHPLIASVSFGQVRCFDIRRKSDHSDKYSIRLESGALMIMKGDLQQHW 179 Query: 186 YHGIQPLKAGFHPLTIDCRYNLTFR 210 H + ++ R NLTFR Sbjct: 180 EHRVAK-----STKSMRARVNLTFR 199 >UniRef50_C4WWX2 ACYPI004109 protein n=2 Tax=Acyrthosiphon pisum RepID=C4WWX2_ACYPI Length = 220 Score = 95.0 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 38/138 (27%), Positives = 58/138 (42%), Gaps = 19/138 (13%) Query: 83 PQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD-EPDLR 140 +PW PQ ++L ++ T G + L+NRY G + H+D + E D Sbjct: 93 VVPAQPW---PQPLYDLKRKICTTRGVD---YNFVLVNRYKNGEDHMGEHRDDEVELDKT 146 Query: 141 APIVSVSLGLPAIFQFGGLK-----RNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKA 194 PI S+SLG F F R L +L L +G +++ + +YH I K Sbjct: 147 VPIASISLGQTRKFVFKHTDVRKKIRQVELVKLDLHNGSLLMMNQPTNEYWYHSIPKEK- 205 Query: 195 GFHPLTIDCRYNLTFRQA 212 + R N TFR+ Sbjct: 206 ----NAKNIRLNFTFRKI 219 >UniRef50_A6EGN8 Alkylated DNA repair protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EGN8_9SPHI Length = 197 Score = 94.6 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 39/141 (27%), Positives = 54/141 (38%), Gaps = 15/141 (10%) Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLH 131 Y YS I PW + + Q +ACL+N Y G + H Sbjct: 65 GVSYSYSGIT-MNALPWTP---ELAEIRSAIQQKTAH---QFNACLLNFYRDGSDSMGWH 117 Query: 132 QDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES-RLFYHGI 189 +D + P I SVS G FQF P+ L L G +++ GE+ L+ H + Sbjct: 118 RDNERNLGPYPTIASVSFGAHRTFQFRRYVEKLPVVSLDLTSGSLLLMKGETQHLWEHRL 177 Query: 190 QPLKAGFHPLTIDCRYNLTFR 210 + I R NLTFR Sbjct: 178 PKTT-----MPIGPRINLTFR 193 >UniRef50_UPI00006CCD66 hypothetical protein TTHERM_00483520 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CCD66 Length = 199 Score = 94.6 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 38/146 (26%), Positives = 58/146 (39%), Gaps = 9/146 (6%) Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSL 130 + Y YS N +P+ N CQR PD +IN Y PG ++ Sbjct: 55 HYGYKYDYSIKSIDKNMFLGVLPKYAINFCQRLIDDKVIKVM-PDQMIINEYLPGQGINP 113 Query: 131 HQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGI 189 H DK + I SVSLG I + + L L+ +++ ++R + H I Sbjct: 114 HIDKTDI-FGETIFSVSLGSGCIMKL---TYGETEIDLYLKRRSILILEDKARYLFKHSI 169 Query: 190 QPLKA---GFHPLTIDCRYNLTFRQA 212 K+ + R +LTFR+A Sbjct: 170 PSRKSDKIDGKTIQRSTRVSLTFRKA 195 >UniRef50_B2W6U5 Oxidoreductase domain containing protein n=2 Tax=Pleosporineae RepID=B2W6U5_PYRTR Length = 366 Score = 94.6 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 52/139 (37%), Gaps = 14/139 (10%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 L W T + Y + K P+ L +P +P++ ++ Y+ Sbjct: 194 KLRWLTLGEQYDWPTRSY--AKHATPFPEDLSTLVTGL-----FPHIRPESGVVLMYSAK 246 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRN----DPLK--RLLLEHGDVVVWG 179 + +H+D E RA + S S+G IF + + D + + + GDVV Sbjct: 247 DFMPVHRDVSEQCQRA-LASFSVGCDGIFLMARGEDDGEGEDAPRSVAIRVHSGDVVHLT 305 Query: 180 GESRLFYHGIQPLKAGFHP 198 G +R +H + P Sbjct: 306 GNARWAWHAMARSIPSTCP 324 >UniRef50_Q2MF23 TobX protein n=2 Tax=Actinomycetales RepID=Q2MF23_STRSD Length = 219 Score = 93.9 bits (232), Expect = 3e-18, Method: Composition-based stats. Identities = 48/202 (23%), Positives = 71/202 (35%), Gaps = 31/202 (15%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G V A E+ + + R VA H G+ + Sbjct: 41 PEGLVHQPDLLDEAEERSLLTAVEAMPLHEVRM-------HGQVARRTVRHFGFDYGYES 93 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 + +P DP +P+ F L R A AG LI RY PGA + H+D Sbjct: 94 WRLTPTDP--------LPEEFWWLRDRCAHLAGLRPESLAQTLIARYPPGATIGWHRD-- 143 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLL---LEHGDVVVWGGESRLFY-HGIQP 191 P +V VSL + +F +R +R+ L V G +R + H I P Sbjct: 144 APMFGPSVVGVSLLSSCLMRF--QRRVGEERRVYELELAPRSAYVLSGAARSAWQHSIPP 201 Query: 192 LKAGFHPLTIDCRYNLTFRQAG 213 + + RY++TFR Sbjct: 202 V--------PELRYSITFRTLR 215 >UniRef50_Q609W8 2OG-Fe(II) oxygenase family domain protein n=1 Tax=Methylococcus capsulatus RepID=Q609W8_METCA Length = 141 Score = 93.9 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 37/151 (24%), Positives = 58/151 (38%), Gaps = 17/151 (11%) Query: 69 WTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W Y YS + PW + +L R +G+ +A L NRY G Sbjct: 3 WYGDPGATYRYSGVS-HQPSPWHEV---LADLRTRIEAFSGH---VFNAVLCNRYRSGRD 55 Query: 127 KLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-L 184 + H D + P I S+SLG +F+ + + L GD+++ GGE + Sbjct: 56 SMGWHADDEPELGERPFIASLSLGAERLFRIRH-RGTGRTLDVPLRDGDLLLMGGELQSH 114 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 + H + R NLTFR+ + Sbjct: 115 WRHCVPRTAR-----PCGERINLTFRRVVPR 140 >UniRef50_B5JS77 2OG-Fe(II) oxygenase n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JS77_9GAMM Length = 199 Score = 93.5 bits (231), Expect = 4e-18, Method: Composition-based stats. Identities = 37/144 (25%), Positives = 54/144 (37%), Gaps = 15/144 (10%) Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-KLSL 130 Y YS P A P + L ++ P + L N Y G + Sbjct: 66 DGVNYTYS----GDTAPRQAWPIALLRLRRQLEVFCQVP---FNGVLANYYRDGDDSMGW 118 Query: 131 HQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHG 188 H D + P I S+SLG P F F L + L+HG +++ GE++ + H Sbjct: 119 HSDDERSLGPRPCIASISLGAPRDFAFRPLNGGKQRHNICLDHGSLLIMQGETQKHWQHA 178 Query: 189 IQPLKAGFHPLTIDCRYNLTFRQA 212 + + P R NLTFR Sbjct: 179 LPRRRRVNQP-----RLNLTFRHI 197 >UniRef50_Q7D1B7 Putative uncharacterized protein n=1 Tax=Agrobacterium tumefaciens str. C58 RepID=Q7D1B7_AGRT5 Length = 195 Score = 93.1 bits (230), Expect = 6e-18, Method: Composition-based stats. Identities = 45/202 (22%), Positives = 63/202 (31%), Gaps = 26/202 (12%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 L + F + E + D D S + H G+ Sbjct: 10 LPHDIMYFDGFLSSEDEAFVADRLDAGEWS-------------TELKRRVQHFGYR---- 52 Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 Y Y + +P +R GY PD + N Y G +S H D Sbjct: 53 -YDYKVRAVTPDAYLGPLPPWLGLFAERLVAD-GYCRTVPDQVIANEYLLGQGISAHVDC 110 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 P IVS+SL F L+ P R +L V+ G SR + H I K Sbjct: 111 V-PCFDDTIVSISLLSACEMVFRDLR--GPGIRSVLHPRSGVLLRGSSRYDWTHEIPARK 167 Query: 194 A---GFHPLTIDCRYNLTFRQA 212 + R +LTFR+ Sbjct: 168 SDIVNGVKTARSRRISLTFRKV 189 >UniRef50_C7RA32 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RA32_KANKD Length = 207 Score = 92.7 bits (229), Expect = 7e-18, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 60/149 (40%), Gaps = 18/149 (12%) Query: 69 WTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W Y YS + PW L +R ++ L N Y G Sbjct: 67 WYGDKGASYTYSGV-IHHPIPWSE---QLLALKKRIEQVC---QTSFNSALFNLYRDGRD 119 Query: 127 KLSLHQDKDEPDLRA-PIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 ++ H D DEP+L A PI+ S+SLG P Q K D +L L G ++V G+++ Sbjct: 120 SVAWHSD-DEPELGAKPIIASLSLGAPRSLQLKHKKHKDLRHKLTLTSGSLLVMRGDTQR 178 Query: 185 FY-HGIQPLKAGFHPLTIDCRYNLTFRQA 212 + H + P + R N+TFR Sbjct: 179 CWQHQVPK-----EPAITEPRINITFRNI 202 >UniRef50_A2R3V2 Similarity to human sequence 203 from patent WO0129221-A/203 n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2R3V2_ASPNC Length = 356 Score = 92.7 bits (229), Expect = 8e-18, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 55/125 (44%), Gaps = 5/125 (4%) Query: 91 AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-KLSLHQDKDEPDLRAP-IVSVSL 148 +P ++ ++A A + + L+N YA G +S H D + + P I S+SL Sbjct: 200 PIPPCL-DILRQAVEKATDDGTRYNFVLVNYYATGDDSISYHSDDERFLGQNPTIASLSL 258 Query: 149 GLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNL 207 G F K + L+ GD+++ GE++ + H + K R N+ Sbjct: 259 GAGRDFLLKH-KPAAKPLKFPLKSGDMLIMRGETQSNWLHSVPKRKGLQGSAGALGRINI 317 Query: 208 TFRQA 212 TFR+A Sbjct: 318 TFRRA 322 >UniRef50_C1FJB5 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FJB5_9CHLO Length = 418 Score = 92.3 bits (228), Expect = 9e-18, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 64/212 (30%), Gaps = 34/212 (16%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G ++ F E+ + D + + Sbjct: 196 PGVTLITDFVTEEEEREMLACVDSDE-----------------RWQGLAKRRVLHYGYAF 238 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---YPDFQPDACLINRYAPGAKLSLHQD 133 Y D + MP L RAA+ D +N Y G ++ H D Sbjct: 239 DYGTRDARDKT--SPMPAFVAGLLGRAASCGAPGACESVHCDQLTVNEYVAGVGIAPHVD 296 Query: 134 KDEPDLRAPIVSVSLGLPAIFQF----GGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHG 188 I+S+SL A+ +F GG K + + + ++V GE+R + H Sbjct: 297 THSA-FGPTILSLSLAGRAVMEFRLHEGGEKEPRERRAISMPPRSLLVLHGEARYRWLHY 355 Query: 189 IQPLKAGF------HPLTIDCRYNLTFRQAGK 214 I K + R + TFR+ + Sbjct: 356 IPHRKRDAIVGEDECEAREERRVSFTFRRRRE 387 >UniRef50_UPI0001927839 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI0001927839 Length = 235 Score = 92.3 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 35/147 (23%), Positives = 54/147 (36%), Gaps = 17/147 (11%) Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSL 130 Y +S + + W Q + ++ + L+NRY G + Sbjct: 94 QGLSYTFSGVTVF-AQSWLPFMQKLKEIAEQLTMT------SFNFVLVNRYDNGNDYMGF 146 Query: 131 HQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPL----KRLLLEHGDVVVWGGESR-L 184 HQD + + D API S S G F F K L HG +++ + L Sbjct: 147 HQDNEKDLDAHAPIASFSFGQDRDFIFKYKKNKSNKSYENVTFHLGHGSLLIMHPPTNDL 206 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 +YH + P + R NLTFR+ Sbjct: 207 WYHSLPKRSVKTCP---NPRINLTFRK 230 >UniRef50_Q00Z84 2-Oxoglutarate-and iron-dependent dioxygenase-related proteins (ISS) n=1 Tax=Ostreococcus tauri RepID=Q00Z84_OSTTA Length = 232 Score = 92.3 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 72/204 (35%), Gaps = 29/204 (14%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G ++ F E R + +A +S G +A H G + Sbjct: 12 PGLTLIENFVSVDEE---RALVTLARES---------GEETRLARRRVKHFG-----YAF 54 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRA-ATAAGYPD-FQPDACLINRYAPGAKLSLHQDK 134 Y D N+ A+P + +R + GY + D +N Y G L+ H D Sbjct: 55 DYGTRD--ANERCEAIPSLALEILKRLRSDMIGYQSAIRCDQVTVNEYPRGTGLAPHVDT 112 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 I+S++L A+ +F + + L L ++V ++R + H I K Sbjct: 113 HSA-FGETILSLTLEGCAVMEF--RTSAEENRALFLPRRSMLVLSADARYRWQHYIPHRK 169 Query: 194 ----AGFHPLTIDCRYNLTFRQAG 213 G D R + TFR+ Sbjct: 170 FDNVEGETIARDDVRLSYTFRERR 193 >UniRef50_A0D9E2 Chromosome undetermined scaffold_42, whole genome shotgun sequence n=3 Tax=Oligohymenophorea RepID=A0D9E2_PARTE Length = 636 Score = 92.0 bits (227), Expect = 1e-17, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 63/202 (31%), Gaps = 27/202 (13%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G ++ F E+ I D+ D S + + Sbjct: 155 PGLYLIHDFITPEYEKYIMDLIDKQEWS------------------KLKQRRVQHYGYEF 196 Query: 77 LYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 +Y ++P +P ++ + + P + + IN Y PG + H D Sbjct: 197 IYGDNTVNVDQPAEKKIPAFLEDVRAKVSDLVK-PQAEINQLTINEYLPGMGIPPHFDVH 255 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKA 194 P VS+SL + F K + + L L + GE R ++H I K Sbjct: 256 -PPFHEKFVSISLLSGLVMSFKSYKGEE--QHLYLPPRSCAFFTGEVRFAWFHSIASRKI 312 Query: 195 GFHPLT---IDCRYNLTFRQAG 213 R +LTFR Sbjct: 313 DKIEGETHFRSRRLSLTFRTIR 334 >UniRef50_B8CDM4 Predicted protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8CDM4_THAPS Length = 222 Score = 91.6 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 37/149 (24%), Positives = 54/149 (36%), Gaps = 16/149 (10%) Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDA--------CLINRYAP 124 GY Y +T +P L + + D +A CL+N Y Sbjct: 76 GTGYRYRDAPGETIIGFPPTVYKLKLLAEEWYNSKQTNDGTANAEAKVEFNVCLLNYYQD 135 Query: 125 G-AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR-LLLEHGDVVVWGGE- 181 G ++ H D++E PI S+SLG F + L LE+G +VV Sbjct: 136 GTQRIGWHSDREELGRTTPIASISLGATRSFLIRSQTDGVHDRASLDLENGSIVVMENVC 195 Query: 182 SRLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 R + H + + R NLTFR Sbjct: 196 QREYVHSVPK-----EGEVVGGRINLTFR 219 >UniRef50_Q7MF65 Alkylated DNA repair protein n=15 Tax=Vibrionaceae RepID=Q7MF65_VIBVY Length = 203 Score = 91.6 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 47/215 (21%), Positives = 76/215 (35%), Gaps = 23/215 (10%) Query: 2 LDLFA-DAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 LF D+ W + F + + + P+ Q + Sbjct: 3 SSLFLFDSPDWLTITDGQLLWWPTFLSQDQAETY--FTQLKHELPWEQKAIQMFGRQVLQ 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 W Y YS + T +P P P + +L R A+G+ ++ L N Sbjct: 61 PRLQA---WCGDA-AYTYSGL---TMQPLPWTP-TLLDLKTRCENASGH---IFNSVLAN 109 Query: 121 RYAPG-AKLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 Y G + HQD + R P++ SV+LG F L N+ ++ L G +++ Sbjct: 110 LYRDGQDSMGWHQDDEPELGRNPVIASVNLGESRRFVLQHLITNEKIE-FELTSGSLLIM 168 Query: 179 GGES-RLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 G + + H + T R NLTFRQ Sbjct: 169 AGSTQHYWRHCVPKTAK-----TKSERINLTFRQI 198 >UniRef50_B8HW11 2OG-Fe(II) oxygenase n=10 Tax=Bacteria RepID=B8HW11_CYAP4 Length = 217 Score = 91.2 bits (225), Expect = 2e-17, Method: Composition-based stats. Identities = 40/147 (27%), Positives = 56/147 (38%), Gaps = 17/147 (11%) Query: 69 WTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W + Y YS I+ +PW A + + Q T AG ++ L+ Y G Sbjct: 79 WYGDAGKSYTYSGIN-MQPQPWTA---ALLTIKQEIETIAGV---IFNSVLLTLYRDGQD 131 Query: 127 KLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGG-ESR 183 + H D + PI+ SVS G FQ R D + L HG ++ G Sbjct: 132 SMGWHSDDEPELGTNPIIASVSFGATRKFQLRHKSRKDLDKVVINLSHGSFLLMAGITQH 191 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 + H I +P R NLTFR Sbjct: 192 HWQHQIPKTTKVTNP-----RINLTFR 213 >UniRef50_B8KWS1 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KWS1_9GAMM Length = 209 Score = 91.2 bits (225), Expect = 2e-17, Method: Composition-based stats. Identities = 35/146 (23%), Positives = 57/146 (39%), Gaps = 17/146 (11%) Query: 69 WTTHR-QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W Y YS + +PW +P + AG + + L+N+Y G Sbjct: 73 WYGDGGASYTYSGLK-LRPRPWV-VP--LMEIKTACEAVAGA---RFNGVLLNQYRDGND 125 Query: 127 KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL- 184 + H D + E I SVS G F + + ++ L +G ++V G+++ Sbjct: 126 AMGWHSDNETELGTNPTIASVSFGASRRFDLRHKRTKETIRS-WLPNGSILVMSGQTQTD 184 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFR 210 + H + K D R NLTFR Sbjct: 185 WVHQVPRTKK-----VGDARINLTFR 205 >UniRef50_A9EAY0 2OG-Fe(II) oxygenase n=2 Tax=Flavobacteriales RepID=A9EAY0_9FLAO Length = 200 Score = 90.8 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 55/143 (38%), Gaps = 15/143 (10%) Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLS 129 T+++ Y YS I T P + +L + L+N Y G Sbjct: 67 TNQKSYSYSNIK-MTPLPLTE---TLKSLKNKVDIVC---QTDFTTLLLNYYRDGKDSNG 119 Query: 130 LHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYH 187 H D ++ + PI+ S+S G F ++ L+HG +++ GE++ + H Sbjct: 120 WHADNEKELGKNPIIASLSFGQERFFHLKHRTDKTLKHKIALQHGSLLLMKGETQHKWLH 179 Query: 188 GIQPLKAGFHPLTIDCRYNLTFR 210 I + R N+TFR Sbjct: 180 QIPKTAK-----QLHGRINITFR 197 >UniRef50_Q21J14 DNA-N1-methyladenine dioxygenase n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21J14_SACD2 Length = 204 Score = 90.4 bits (223), Expect = 3e-17, Method: Composition-based stats. Identities = 32/129 (24%), Positives = 52/129 (40%), Gaps = 15/129 (11%) Query: 85 TNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAPI 143 PW + +L + ++ L+N Y G + H D ++ P+ Sbjct: 84 YPTPWS---KELISLKDLIENKT---ESSYNSVLVNLYRNGADGVGWHADDEKELGGCPV 137 Query: 144 V-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES-RLFYHGIQPLKAGFHPLTI 201 + S+SLG F KR +L L +GD++V G++ R + H + P Sbjct: 138 IASLSLGASRSFSLK-PKRGGKSIKLELNNGDLIVMKGDTQRNWLHAVAKTSKKIGP--- 193 Query: 202 DCRYNLTFR 210 R NLTFR Sbjct: 194 --RINLTFR 200 >UniRef50_B0T8K7 2OG-Fe(II) oxygenase n=3 Tax=Alphaproteobacteria RepID=B0T8K7_CAUSK Length = 215 Score = 90.4 bits (223), Expect = 4e-17, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 66/215 (30%), Gaps = 28/215 (13%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 LF + PL G + A EQ + P+ + Sbjct: 14 SLFDLPIAPRTPLPEGFRHQTKLITPAEEQALVAQFTDLDFQPYEH-------KGYLGHR 66 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 GW G L +P +P L + A G+ L+ Y Sbjct: 67 RVAGFGWRRGPDGALVETGEP--------LPDFLAPLLDKVAAFTGFARNTFAHALVTEY 118 Query: 123 APGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGGE 181 APGA + H+D+ I VSL P F+ + E + G Sbjct: 119 APGAGIGWHRDR---PPAIAIAGVSLLSPCTFRLRRRSGQAWERASIEAEPRSAYLMSGP 175 Query: 182 SR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 SR + H I P+ A RY++TFR + Sbjct: 176 SRSQWQHSIPPVDA--------LRYSVTFRTVPTR 202 >UniRef50_D2XAQ5 Alkylated DNA repair protein n=1 Tax=Marseillevirus RepID=D2XAQ5_9VIRU Length = 198 Score = 90.0 bits (222), Expect = 5e-17, Method: Composition-based stats. Identities = 51/200 (25%), Positives = 80/200 (40%), Gaps = 20/200 (10%) Query: 24 RFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDP 83 RF F + ++L R + D+ P + G + + G G Y ++ + Sbjct: 10 RFLFGSRKKLQRQLADIEYLPPEDTAIKMHGKVIPIPRLQTG-FG-KHESLSYSFTGVKI 67 Query: 84 QTNKPWPAMPQSFHNLCQRAAT------AAGYPDFQPDACLINRYAPGA-KLSLHQDKD- 135 K W P L + G P+ L+N+Y G + H DK+ Sbjct: 68 -PAKIW---PPYIEKLSLKIHAHLVEQGVMGQDTPPPNYVLVNKYLNGDHYIGWHSDKER 123 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG-GESRLFYHGIQPLKA 194 + + PI+SVSLG F +K + K + L GDV+V G +++ H + K Sbjct: 124 DLMMGYPIISVSLGARRDFCLRLIKNHKHKKTISLGSGDVLVMLPGMQQVWQHCLPKRKG 183 Query: 195 GFHPLTIDCRYNLTFRQAGK 214 P RYNLTFR G+ Sbjct: 184 LDEP-----RYNLTFRWIGE 198 >UniRef50_Q54BK8 2-oxoglutarate and Fe-dependent oxygenase family protein n=1 Tax=Dictyostelium discoideum RepID=Q54BK8_DICDI Length = 247 Score = 90.0 bits (222), Expect = 5e-17, Method: Composition-based stats. Identities = 37/160 (23%), Positives = 53/160 (33%), Gaps = 25/160 (15%) Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSL 130 + Y Y ++ P PQ +LC DF P ++N Y G +S Sbjct: 50 HYGYKYNYKSRSLKSEDIAPPFPQWASDLCCHLMKEGLINDF-PQQLIVNEYKDGQGISA 108 Query: 131 HQDKDEPDLRAPIVSVSLGLPAIFQFGG-------------LKRNDPLKRL--LLEHGDV 175 H D I S+SLG F + ++ L Sbjct: 109 HID--SKIFDNIIFSISLGSTCKMIFKKSIQPTTTTKTTTTTSEKAEVLKVEKQLAPRAF 166 Query: 176 VVWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 ++ E+R + H I LK G H R +LTFR K Sbjct: 167 LLIKDEARFNWTHEIPKLKKGQH------RISLTFRFVSK 200 >UniRef50_B2AC29 Predicted CDS Pa_2_14240 n=1 Tax=Podospora anserina RepID=B2AC29_PODAN Length = 298 Score = 89.6 bits (221), Expect = 7e-17, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 73/242 (30%), Gaps = 60/242 (24%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G ++ F A EQ + S ++P ++ + Sbjct: 39 GLTLVHEFISPAEEQEMISAFHAISP------LSPADSKRRISQ---------HFGHHFD 83 Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQR--AATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y+ +K +P N R T +PD + Y PGA + H D Sbjct: 84 YTTFGIDESK-HSPVPAYITNFLDRLPVDTDGKEAGRKPDQFTVQYYPPGAGIPPHVDTH 142 Query: 136 EPDLRAPIVSVSLGL--PAIFQFGGLKRNDPLK--------------------------- 166 + S+S G P IF+ G L+ Sbjct: 143 S-MFGEALYSLSFGSGVPMIFRMSGENEARKLRLPKRSLQESSDGNVNGKVGGEILDKAE 201 Query: 167 --------RLLLEHGDVVVWGGESRLFY-HGIQPLKAGFH---PLTIDCRYNLTFRQAGK 214 L+L ++V G SR Y HGI+P K + + RY++T R + Sbjct: 202 GVVVHPAWELMLPARSLLVMRGASRYGYTHGIRPRKTDAVDGITVKREGRYSITMRSVRR 261 Query: 215 KE 216 E Sbjct: 262 GE 263 >UniRef50_A4BAI0 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BAI0_9GAMM Length = 194 Score = 89.6 bits (221), Expect = 7e-17, Method: Composition-based stats. Identities = 35/150 (23%), Positives = 54/150 (36%), Gaps = 17/150 (11%) Query: 69 WTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W Y YS + + P+ F +L R + DF P+ L+N Y G Sbjct: 58 WLGDPGLRYGYSGQEYVAS----GWPEGFKSLLDRFQSQ---HDFAPNGALMNYYRSGAD 110 Query: 127 KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRL 184 + H D + E L I +SLG F F K + +L L G +++ G Sbjct: 111 TMGWHADDEPELGLNPTIAILSLGGARDFHFRQHKDHSQKLKLRLPEGSLLLMSGAVQHH 170 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + H + R + TFR+ Sbjct: 171 WQHALPKRAQAR------PRISCTFRRIVA 194 >UniRef50_A3M1I5 DNA repair system n=12 Tax=Acinetobacter RepID=A3M1I5_ACIBT Length = 203 Score = 89.3 bits (220), Expect = 8e-17, Method: Composition-based stats. Identities = 48/214 (22%), Positives = 78/214 (36%), Gaps = 21/214 (9%) Query: 2 LDLFADAEPWQEPLA-AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 LDLF+ EP L G V A E + + + +R + Sbjct: 3 LDLFS-PEPCSNLLPYDGEVQDYGCILTAEEAE-QYFHYLYHHLAWRHDEAKLYGKHFIT 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 G +R Y D + PW ++ L Q+ + ++CL N Sbjct: 61 PRKVAWYGDEHYRYKYSGVFRD---SLPWD---KALAQLKQQVEQILSE---KFNSCLAN 111 Query: 121 RYAPG-AKLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 Y G ++ H D D I S+S G F F ++ + ++ + L+ G ++V Sbjct: 112 LYEDGTQGMAWHSDSDVSLARTTTIASLSFGATRKFSFRHIQTKEKVE-MWLQPGQLIVM 170 Query: 179 GGES-RLFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 GE+ + + H + P R NLTFRQ Sbjct: 171 RGETQQYWQHRLNRSTKILQP-----RINLTFRQ 199 >UniRef50_C9SM10 DNA repair family protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SM10_VERA1 Length = 287 Score = 89.3 bits (220), Expect = 9e-17, Method: Composition-based stats. Identities = 37/151 (24%), Positives = 53/151 (35%), Gaps = 19/151 (12%) Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEP 137 P P +P L + A A G + CL+N YA G ++ H D + Sbjct: 112 GPDRRYDRIPPRPLPACLDALRRSAEAATGC---AFNVCLVNYYATGADSIAFHSDDERF 168 Query: 138 DLRAP-IVSVSLGLPAIFQFGG---LKRNDPLKR--------LLLEHGDVVVWGGESR-L 184 AP I S SLG F R D L GD+++ G ++ Sbjct: 169 LGPAPAIASFSLGARRDFLLKHKPCPPRGDAPSPRPALGTLRFPLGSGDMLLMRGATQAN 228 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 + H + R N+TFR+A K Sbjct: 229 WLHSVPKRSGRHAED--GGRINITFRRAVVK 257 >UniRef50_A4AA20 Putative alkylated DNA repair protein n=1 Tax=Congregibacter litoralis KT71 RepID=A4AA20_9GAMM Length = 206 Score = 88.9 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 79/218 (36%), Gaps = 27/218 (12%) Query: 1 MLDLFADAEPWQEPLAAGAVIL---RRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTM 57 M +LF +++P + L G ++L + E ++A + Q+ Sbjct: 1 MTELFPNSDPEEIDLPGGELLLYRAADLGADPQELFENLERELAWREEPIQLFGKRYLQP 60 Query: 58 SVAMTNCGHLGWTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDA 116 + L W Y YS I PW L +R + D + ++ Sbjct: 61 RL-------LAWYADAGVSYKYSGIQ-HDPLPWTP---QLAVLRERVEALS---DARFNS 106 Query: 117 CLINRYAPG-AKLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRND-PLKRLLLEHG 173 L N Y + LH D + P++ S+SLG +F+ R D RL L G Sbjct: 107 VLANLYRHHRDSMGLHADDERELGAQPVIASLSLGEERMFRLKHRHRKDLKPIRLPLASG 166 Query: 174 DVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 +++ G ++ + H + R NLTFR Sbjct: 167 MLLIMRGATQENWRHEVPK-----QSRPCGPRINLTFR 199 >UniRef50_D2UYR0 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2UYR0_NAEGR Length = 294 Score = 88.9 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 21/114 (18%) Query: 121 RYAPGAKLSLHQDK-------DEPDLRAPIVSVSLGLPAIFQF---------GGLKRNDP 164 Y KL H+D+ ++ + +P+VS+SLG +IF + G L+ Sbjct: 167 YYTNKGKLGWHRDRISGLTPEEQHLIVSPVVSMSLGNDSIFSYKLNTINETTGKLEYGTE 226 Query: 165 LKRLLLEHGDVVVWGGESRLFYHGIQPLKA--GFHPLTID---CRYNLTFRQAG 213 L L+ GD++++G R+FYH ++ + H L +D R N+T R+ Sbjct: 227 AIDLQLKSGDILIFGATQRMFYHCVKRIIPSTNHHKLDMDGLSGRINITLREGT 280 >UniRef50_B4RAN1 DNA alkylation damage repair protein AlkB n=1 Tax=Phenylobacterium zucineum HLK1 RepID=B4RAN1_PHEZH Length = 238 Score = 88.9 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 47/201 (23%), Positives = 67/201 (33%), Gaps = 28/201 (13%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G F A E D+ D S+ PF G GW G Sbjct: 64 EGLAYRPDFLTAAEEA---DLLDRLSRLPFEPFQFRGYE----GRRRVVSFGWRYDFNGP 116 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 +P MP + RAA AG P LIN Y GA + H+D+ Sbjct: 117 GLVEAEP--------MPGWLRPVRDRAADFAGLPPEAFGHVLINEYREGAPIGWHKDR-- 166 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGGESRL-FYHGIQPLKA 194 P + +SLG P + +F + L + + G +R + H + KA Sbjct: 167 PVFEK-VAGISLGAPCVMRFRRRAGERFERLNVPLAPRSIYLLDGPARTEWEHSLPEAKA 225 Query: 195 GFHPLTIDCRYNLTFRQAGKK 215 RY++TFR + Sbjct: 226 --------LRYSITFRNLRAR 238 >UniRef50_C5BTC2 Putative alkylated DNA repair protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTC2_TERTT Length = 211 Score = 88.5 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 37/144 (25%), Positives = 61/144 (42%), Gaps = 16/144 (11%) Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSL 130 Y Y+ + +PW +P+ L Q A + +A L Y G ++ Sbjct: 75 DGLQYRYAD-NLMHTQPW--LPE-LLQLRQIINNATQC---EFNAVLATLYRHGNDHVTW 127 Query: 131 HQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRLFYHG 188 H D + AP++ S+SLG FQF + + + + L HGD++V + H Sbjct: 128 HSDDERELGYAPVIASLSLGATRCFQFRHKENDTKGE-ISLHHGDLIVMEPAFQHYWEHQ 186 Query: 189 IQPLKAGFHPLTIDCRYNLTFRQA 212 + P P ++ R NLTFR+ Sbjct: 187 VPP-----QPDVLEPRINLTFRRV 205 >UniRef50_Q2UNX0 Predicted protein n=6 Tax=Trichocomaceae RepID=Q2UNX0_ASPOR Length = 335 Score = 88.1 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 40/146 (27%), Positives = 54/146 (36%), Gaps = 20/146 (13%) Query: 82 DPQTNKPWPA----------MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSL 130 D +TN P P +P +L Q T P + LIN YA +S Sbjct: 159 DAKTNTPIPPRTKYKHTPRPIPPCLTHLLQTIQTTTNTPPDYYNFILINYYATNTDSISY 218 Query: 131 HQDKDEPDLRAP-IVSVSLGLPAIFQFGGL--KRNDPLKRLLLEHGDVVVWGGESR-LFY 186 H D + P I S+SLG F + L GD+VV GE++ + Sbjct: 219 HSDDERFLGPNPSIASLSLGAKRDFLLKHKPGVEAGKPLKFPLASGDMVVMRGETQGNWL 278 Query: 187 HGIQPLKAGFHPLTIDCRYNLTFRQA 212 H I R N+TFR+A Sbjct: 279 HSIPKRAG-----EGGGRINVTFRRA 299 >UniRef50_B7RVL5 Oxidoreductase, 2OG-Fe(II) oxygenase family n=2 Tax=unclassified Gammaproteobacteria (miscellaneous) RepID=B7RVL5_9GAMM Length = 218 Score = 88.1 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 36/153 (23%), Positives = 57/153 (37%), Gaps = 17/153 (11%) Query: 69 WTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W + Y YS Q PW + L T ++ L+N Y G Sbjct: 68 WYGDPEAQYAYSGKQYQ-PIPWTPL---LTTLKASVETLCAS---SFNSVLLNFYRDGAD 120 Query: 127 KLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGG-ESR 183 + LH D + P I S+SLG F +R + ++L + V+ G + Sbjct: 121 SMGLHADDEPELGTEPCIASLSLGEERTLYFKHKQRKELKPLNVVLPNASVLRMQGVTQQ 180 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + HGI+ + R NLTFR+ ++ Sbjct: 181 YWKHGIRKISR-----PCGPRVNLTFRRIYPRK 208 >UniRef50_A6SM11 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6SM11_BOTFB Length = 216 Score = 88.1 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 36/138 (26%), Positives = 53/138 (38%), Gaps = 18/138 (13%) Query: 88 PWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVS 145 P +P+ +L A G + + CL+N YA G +S H D + P I S Sbjct: 49 PPRPIPKCLDDLRLSTEMATGC---KFNFCLVNYYASGSDSISYHSDDERFLGPLPAIAS 105 Query: 146 VSLGLPAIFQFGG----LKRNDP------LKRLLLEHGDVVVWGGESR-LFYHGIQPLKA 194 SLG F N P +L L GD+++ G ++ + H I Sbjct: 106 YSLGARRDFLMKHKPIPPNDNAPLPPETKPIKLPLASGDMILMRGRTQANWLHSIPKRTG 165 Query: 195 GFHPLTIDCRYNLTFRQA 212 R N+TFR+A Sbjct: 166 KNA--DDGGRINITFRRA 181 >UniRef50_A5GWW3 Alkylated DNA repair protein n=2 Tax=Synechococcus RepID=A5GWW3_SYNR3 Length = 204 Score = 88.1 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 43/153 (28%), Positives = 64/153 (41%), Gaps = 21/153 (13%) Query: 69 WTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W GY YS +D PW Q + ++ ++ L+N Y G Sbjct: 65 WMADPGCGYRYSGLDNVVE-PWSPTAQRIREQLNELS------GWRFNSLLLNLYRDGRD 117 Query: 127 KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQF---GGLKRNDPLKRLLLEHGDVVVWGGES 182 + H D + E D API S+SLG+ F+F G + ND L L HG +++ + Sbjct: 118 AMGFHADDEPELDPTAPIASLSLGVSRTFRFKPKKGHQGND--FDLELGHGALLLMDPPT 175 Query: 183 RL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 +L + HG+ CR NLTFR + Sbjct: 176 QLHWLHGLPKRLR-----VNQCRLNLTFRVVQQ 203 >UniRef50_A1ZXT1 Alkylated DNA repair protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZXT1_9SPHI Length = 189 Score = 88.1 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 35/142 (24%), Positives = 57/142 (40%), Gaps = 15/142 (10%) Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 + Y YS + +PW + + A+ Q +A ++ Y G +++ H Sbjct: 58 KSYNYSGL-VFNPEPWTDFLLELKTVAENLASV------QFNALVLQYYRDGNDRVNWHS 110 Query: 133 DKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQ 190 D D P IVS+S G F +D + L GD+V+ G+ + Y H + Sbjct: 111 DDDSCVGTNPVIVSMSFGESRDFWVRHKTHHDDRHKFTLHSGDIVIMQGDMQHVYVHKVP 170 Query: 191 PLKAGFHPLTIDCRYNLTFRQA 212 K + R NLTFR+ Sbjct: 171 IEKDKT-----EARLNLTFRKV 187 >UniRef50_A3D131 DNA-N1-methyladenine dioxygenase n=15 Tax=Shewanella RepID=A3D131_SHEB5 Length = 246 Score = 87.7 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 74/205 (36%), Gaps = 23/205 (11%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 E L ++R + + + + + + R + G ++ W Sbjct: 55 AEQLTPPITLVRGYLNAEQQAAL--MKEAQTYPLSRPEIQVFGQFHAIPRQQV----WFG 108 Query: 72 H-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLS 129 YLYS + PW P+ H L ++ A G + L+NRYA G + Sbjct: 109 DSGCDYLYSGL-FIRALPW---PKYAHKLREKLARDYGLAS---NGVLVNRYADGKDCMG 161 Query: 130 LHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYH 187 H D + E + I S++LG F K + + L GD+++ + + H Sbjct: 162 AHSDDEPEIAHGSHIASITLGATRDFVLKH-KHSQTKYCISLHSGDLLIMHWPMQNDWLH 220 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQA 212 + P R+N TFRQ Sbjct: 221 SLPKRLKIKEP-----RWNYTFRQL 240 >UniRef50_B4RZB3 Alkylated DNA repair protein n=4 Tax=Proteobacteria RepID=B4RZB3_ALTMD Length = 213 Score = 87.7 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 40/153 (26%), Positives = 62/153 (40%), Gaps = 17/153 (11%) Query: 69 WTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDF--QPDACLINRYAPG 125 W + Y YS + PW + S + R P++ + ++ L N Y G Sbjct: 70 WHGDPECTYTYSNLT-MPPNPWTS---SLALIKARCEALCS-PNYGTKFNSVLANWYRDG 124 Query: 126 -AKLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 +S H D + E I SV+LG F + + ++ LEHG V++ G ++ Sbjct: 125 QDSMSFHSDNEPELGTNPVIASVTLGEARPFVLKHKETKEKYTQI-LEHGSVLIMAGATQ 183 Query: 184 LFY-HGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 Y HGI I R NLTFR ++ Sbjct: 184 SHYVHGIAKTAK-----PIGGRINLTFRHLIQR 211 >UniRef50_B4VHI5 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VHI5_9CYAN Length = 207 Score = 87.7 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 65/208 (31%), Gaps = 23/208 (11%) Query: 8 AEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHL 67 + P G ++ + + + +I D S + Sbjct: 15 KSDFIVPEIPGLNLIHDYINTQEQNQLLEIID--------------QQEWSTQLKR---- 56 Query: 68 GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK 127 + Y Y + +P + L QR P PD +IN Y PG Sbjct: 57 RVQHYGYRYEYQKRTLTSASYLGELPNWANQLGQRLVRDRVTPT-PPDQLIINEYLPGQG 115 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYH 187 ++ H D P I+S+SLG + L + LLL +++ + H Sbjct: 116 ITNHVDCV-PCFGNTIISLSLGSCCVMNLTHLPTQTQIPVLLLPGSLLILQRVARYQWQH 174 Query: 188 GIQPLKAG---FHPLTIDCRYNLTFRQA 212 GI K R +LTFR+ Sbjct: 175 GIPARKNDKYQGREFGRSRRVSLTFREV 202 >UniRef50_A4SB59 Predicted protein (Fragment) n=2 Tax=Ostreococcus RepID=A4SB59_OSTLU Length = 134 Score = 87.7 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 36/146 (24%), Positives = 51/146 (34%), Gaps = 16/146 (10%) Query: 69 WTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA- 126 W Y +DP +P+ L G + L+NRY G Sbjct: 1 WAGDLPYKYSGQTLDPV------PVPEVLRRLQTAVEAKCGA---TFNHILLNRYRDGDD 51 Query: 127 KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRL 184 ++ H D + E A I +VS+G F R + LEHG ++V G Sbjct: 52 SMAFHADDEPELGKNACIAAVSVGHTRKFDVQVKSRAKKKTSIFLEHGSLMVMDGSLQHT 111 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFR 210 YH + P R N+TFR Sbjct: 112 HYHAVPK---NRVPTNGKERINITFR 134 >UniRef50_Q1YTT7 Oxidoreductase, 2OG-Fe(II) oxygenase family protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YTT7_9GAMM Length = 202 Score = 86.6 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 38/139 (27%), Positives = 55/139 (39%), Gaps = 16/139 (11%) Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQD 133 Y YS I Q A P L ++ +G + + L+N Y G + H D Sbjct: 73 SYTYSNIRLQAV----AFPCWIDQLREQIEIQSGE---RFNRALVNYYRDGSDSVDWHAD 125 Query: 134 KDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG-GESRLFYHGIQP 191 + P+V S+SLG +FQ + L + L HG +++ G G + H I Sbjct: 126 DEAELGFEPLVASLSLGAERVFQLRHNLTKERL-DIALPHGSLLLMGAGIQTYWQHRIAK 184 Query: 192 LKAGFHPLTIDCRYNLTFR 210 K P R N TFR Sbjct: 185 TKKVDKP-----RVNFTFR 198 >UniRef50_Q6NS38 Alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 n=20 Tax=Eumetazoa RepID=ALKB2_HUMAN Length = 261 Score = 86.6 bits (213), Expect = 6e-16, Method: Composition-based stats. Identities = 37/136 (27%), Positives = 53/136 (38%), Gaps = 19/136 (13%) Query: 87 KPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD-EPDLRAPIV 144 P P +P + + G + LINRY G + H+D + E +PI Sbjct: 130 SPKPWIP-VLERIRDHVSGVTGQ---TFNFVLINRYKDGCDHIGEHRDDERELAPGSPIA 185 Query: 145 SVSLGLPAIFQFGG-------LKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGF 196 SVS G F F R + RL L HG +++ + +YH + K Sbjct: 186 SVSFGACRDFVFRHKDSRGKSPSRRVAVVRLPLAHGSLLMMNHPTNTHWYHSLPVRKKVL 245 Query: 197 HPLTIDCRYNLTFRQA 212 P R NLTFR+ Sbjct: 246 AP-----RVNLTFRKI 256 >UniRef50_A5FII3 DNA-N1-methyladenine dioxygenase n=2 Tax=Flavobacterium johnsoniae UW101 RepID=A5FII3_FLAJ1 Length = 208 Score = 86.2 bits (212), Expect = 6e-16, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 49/140 (35%), Gaps = 12/140 (8%) Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEP 137 +P + P + R + L+N Y G + H DK+ Sbjct: 75 DKDNPGADLKGPDWNYELLTIRGRVEKET---QQDFNTVLLNLYRDGNDGVGWHSDKEHN 131 Query: 138 DLRAPIV-SVSLGLPAIFQFGGL-KRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKA 194 PI+ SV+ G +F+ + P + L HG ++ G + + H + Sbjct: 132 TGPNPIIASVTFGETRMFRLRHKYSKEIPQIEIPLHHGSFLLMAGTTNSFWQHQVPKTAR 191 Query: 195 GFHPLTIDCRYNLTFRQAGK 214 P R NLTFRQ + Sbjct: 192 NVLP-----RINLTFRQTHR 206 >UniRef50_A9BA22 Alkylated DNA repair protein n=2 Tax=Prochlorococcus marinus RepID=A9BA22_PROM4 Length = 189 Score = 86.2 bits (212), Expect = 7e-16, Method: Composition-based stats. Identities = 37/143 (25%), Positives = 55/143 (38%), Gaps = 16/143 (11%) Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK-LS 129 + Y YS + W P+ F L + + CLIN Y G + Sbjct: 57 SKGISYKYSGA-IHYAEDW---PKWFFPLLDYIRD---FSRTNYNGCLINLYRDGNDCMG 109 Query: 130 LHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYH 187 H D + E D + I S+SLG F F L + + L GD+++ E + + H Sbjct: 110 WHSDNEKELDPKKSIASLSLGATRDFFFRSLI-DSSSNNIELRDGDLLLMHPECQFNWKH 168 Query: 188 GIQPLKAGFHPLTIDCRYNLTFR 210 + K + R NLTFR Sbjct: 169 CLPKRKKVS-----EVRINLTFR 186 >UniRef50_C8XTB3 Putative uncharacterized protein n=1 Tax=Dunaliella viridis RepID=C8XTB3_9CHLO Length = 2229 Score = 85.8 bits (211), Expect = 8e-16, Method: Composition-based stats. Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 7/122 (5%) Query: 92 MPQSFHNLCQRAATAA-GYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSL 148 +P F + ++ + D+ L+N Y G + H D ++ P I S+S Sbjct: 1984 IPSPFTPFLEHLKSSVQECVKEEFDSVLLNYYRDGSDTVGWHADNEKLYGDTPTIASLSF 2043 Query: 149 GLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES-RLFYHGIQPLKAGFHPLTIDCRYNL 207 G F ++ N + L GD++V G++ + + H + P I R NL Sbjct: 2044 GSARDFILRKIEDNSDKYKFTLGPGDLLVMKGKTQQQWQHTVPRRSP---PQAIGPRINL 2100 Query: 208 TF 209 TF Sbjct: 2101 TF 2102 >UniRef50_A9UX55 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UX55_MONBE Length = 180 Score = 85.8 bits (211), Expect = 9e-16, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 45/131 (34%), Gaps = 17/131 (12%) Query: 94 QSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK-DEPDLRAPIVSVSLGLP 151 L + G ++ + LINRYA G + HQD E D PI S++LG Sbjct: 36 PQLRQLKEYVEQTTG---YEYNFVLINRYADGRDTIGEHQDNESELDPDVPIASLTLGAT 92 Query: 152 AIFQFGGLKRNDP--------LKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTID 202 F L L G ++ + +YH + P Sbjct: 93 RDFVLRHRDVRRKCGAHSKLNPHTLPLPSGLLLTMEAPTNKCWYHSVPRRSLARCP---G 149 Query: 203 CRYNLTFRQAG 213 R NLTFR+ Sbjct: 150 PRINLTFRRIR 160 >UniRef50_D2V5F7 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2V5F7_NAEGR Length = 460 Score = 85.8 bits (211), Expect = 9e-16, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 56/176 (31%), Gaps = 27/176 (15%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQR--------------------AAT 105 + + Y N +P+ N+ R + Sbjct: 30 QRRVQHYGFKFDYDIRSIDFNTQVEPIPEYTTNIMNRMKEAMKKKKEENDSTIMSDEFIS 89 Query: 106 AAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPL 165 + + PD IN Y PG + H D P + VS+ A+ F + + Sbjct: 90 TFDFETYNPDQLTINEYQPGQGIRPHIDVHTP-FNDGLFIVSMLGSAVMYFSKCVGEEVV 148 Query: 166 KR--LLLEHGDVVVWGGESR-LFYHGIQPL---KAGFHPLTIDCRYNLTFRQAGKK 215 ++ + L +++ GE+R L+ H I + R +LT R K+ Sbjct: 149 EKKYVDLPRRSLLILVGEARYLWRHAIMCRELDRVNGKIRKRQRRVSLTIRSVRKE 204 >UniRef50_UPI00017458F4 Alkylated DNA repair protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017458F4 Length = 187 Score = 85.8 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 37/162 (22%), Positives = 52/162 (32%), Gaps = 23/162 (14%) Query: 56 TMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPD 115 M T C G T G Y + M LCQ+ G F P Sbjct: 34 RMKSRKTAC--FGQTYDDSGIAYEEV---------PMHALLAPLCQKLTATLG---FAPT 79 Query: 116 ACLINRYAPG-AKLSLHQD-KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHG 173 CLIN Y G + + H D + +SLG + F + L G Sbjct: 80 NCLINYYENGRSSMGFHSDATYNLADDTGVAIISLGAERVLTFRSKSTPNLEHAFALPSG 139 Query: 174 DVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 ++ ++ + H I+ D R +LTFR K Sbjct: 140 SLLYMTQATQAHWMHAIKKTDTD------DARISLTFRHILK 175 >UniRef50_A9AQD8 2OG-Fe(II) oxygenase n=43 Tax=Burkholderia RepID=A9AQD8_BURM1 Length = 226 Score = 85.8 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 44/204 (21%), Positives = 69/204 (33%), Gaps = 22/204 (10%) Query: 11 WQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 + E A + + + + TP G +T W Sbjct: 30 FDETPAPDVDWYPDWLVPSEADRLLAALIDEVAWRQDTIRTPRGRIPLPRLTA-----WQ 84 Query: 71 THRQG-YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKL 128 Y+YS I + PA+ L + A G + ++ L+NRY G + Sbjct: 85 GEPDAVYVYSGIRNVPAQWTPAV----LELKRAVEAACGA---RFNSVLLNRYRNGQDGM 137 Query: 129 SLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFY 186 H D + AP++ SVSLG +F L HG ++V G ++ + Sbjct: 138 GWHADNEPELGDAPVIASVSLGAMRVFDLRHRA-TGATHAYRLTHGSLLVMRGRTQAEWQ 196 Query: 187 HGIQPLKAGFHPLTIDCRYNLTFR 210 H + P R NLTFR Sbjct: 197 HRVPK-----APSVHGERVNLTFR 215 >UniRef50_Q1ECQ5 At4g02485 n=2 Tax=Arabidopsis thaliana RepID=Q1ECQ5_ARATH Length = 226 Score = 84.6 bits (208), Expect = 2e-15, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 9/107 (8%) Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHG 173 D ++N Y PG + H D I VSL P + +F ++N+ +LL G Sbjct: 121 DQLIVNLYQPGEGICAHVDL--LRFEDGIAIVSLESPCVMRFSPAEKNEYEAVDVLLNPG 178 Query: 174 DVVVWGGESRL-FYHGIQPLKAGFHPLTIDC-----RYNLTFRQAGK 214 +++ GE+R + H I + GF + R ++T R+ + Sbjct: 179 SLILMSGEARYRWKHEINRKQNGFQLWEGEEIDQKRRISITLRKLCQ 225 >UniRef50_B0C4L3 Alkylated DNA repair protein n=3 Tax=Bacteria RepID=B0C4L3_ACAM1 Length = 175 Score = 84.2 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 32/145 (22%), Positives = 53/145 (36%), Gaps = 16/145 (11%) Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLS 129 + Y YS I + P LC + G F P+ CL+N Y G + + Sbjct: 42 SFGVAYNYSQITYLKTEMHPE----LLPLCAAVLESLG---FTPNNCLLNFYTDGSSSMG 94 Query: 130 LHQDK-DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYH 187 H D +E + ++SL + + + LE GD++ E ++ + H Sbjct: 95 FHSDTAEELSPGTGVATLSLRATRTITYKHKQARETQYSYSLESGDLLYMSNEVQIDWLH 154 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQA 212 GI R ++TFR Sbjct: 155 GILK------EAQAGPRISVTFRSI 173 >UniRef50_Q15YR0 DNA-N1-methyladenine dioxygenase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15YR0_PSEA6 Length = 210 Score = 83.9 bits (206), Expect = 3e-15, Method: Composition-based stats. Identities = 35/138 (25%), Positives = 55/138 (39%), Gaps = 16/138 (11%) Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 Y YS ++ PW + C++A+ + ++ L N Y G ++ H D Sbjct: 78 YQYSGLN-LPPIPWTEELHALKVQCEKASESV------FNSVLANCYRDGQDSMAWHSDD 130 Query: 135 DEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPL 192 + P++ S+SLG F RL LEHG + + G S+ + H + Sbjct: 131 EPELGTRPVIASLSLGQVRNFDLKHRTSGQR-HRLPLEHGSLFIMAGNSQTHWLHSLAKT 189 Query: 193 KAGFHPLTIDCRYNLTFR 210 P R NLTFR Sbjct: 190 TKSLAP-----RINLTFR 202 >UniRef50_UPI00006CC0FF hypothetical protein TTHERM_00219000 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CC0FF Length = 254 Score = 83.9 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 62/160 (38%), Gaps = 16/160 (10%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPA----MPQSFHNLCQRAATAAGYPDFQPDACLINR 121 H G + + S I P++++ A + + + P+ CL+N Sbjct: 71 HYGVVYYHTRHNLSEIQPESSESEKALDLSVFDWLIQ--RLINDEVFDVSYPPNQCLVNE 128 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y KL H + E I +SL P+ ++ + +L LE + V + Sbjct: 129 YDNKDKLGCHVENIEA-FGPIIAGLSLHNPSYLALREVENKENKVQLYLEPRSLYVLTSD 187 Query: 182 SRL-FYHGIQPLKAGFHPLTIDC--------RYNLTFRQA 212 SR + HG+ +K ++P+T R +LTFR Sbjct: 188 SRYKWEHGVTKMKEIYNPITQQTIIKNETYRRVSLTFRHV 227 >UniRef50_D2V4D7 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V4D7_NAEGR Length = 292 Score = 83.5 bits (205), Expect = 4e-15, Method: Composition-based stats. Identities = 35/137 (25%), Positives = 52/137 (37%), Gaps = 27/137 (19%) Query: 93 PQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK-LSLHQDKDEPDLRAPIVSVSLGLP 151 P+ NL +R G + +A LINRY G + H D++ I SVSLG Sbjct: 160 PEPLLNLKKRLEELIGE---KYEAALINRYDDGDDLIGWHADREASGFS--IASVSLGAS 214 Query: 152 AIFQFG------GLKRNDPLKRL---------LLEHGDVVVWGGESRLFY-HGIQPLKAG 195 FQ N + LE+G +++ G ++ Y H + K Sbjct: 215 RDFQLRPMPKQNTNSSNTTQSPIQKKGEIITKSLENGCLLIMNGATQKHYQHCVPKRKG- 273 Query: 196 FHPLTIDCRYNLTFRQA 212 + R N+TFR Sbjct: 274 ----VLSARLNITFRDI 286 >UniRef50_C7PPT9 2OG-Fe(II) oxygenase n=2 Tax=Bacteroidetes RepID=C7PPT9_CHIPD Length = 170 Score = 83.5 bits (205), Expect = 5e-15, Method: Composition-based stats. Identities = 35/145 (24%), Positives = 51/145 (35%), Gaps = 16/145 (11%) Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLS 129 + Y YS I P+ A + G F P+ CLIN Y G +K+ Sbjct: 38 SFGVAYNYSQISY----PFQAFTPELQEIVTAITATLG---FTPNNCLINYYPDGKSKMG 90 Query: 130 LHQD-KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYH 187 H D D + I VS+G +F ++ L L G ++ + + H Sbjct: 91 YHADQTDILEAGTGIAIVSVGETRTLRFKNIQDPTELVDFPLNAGSLIYMTQAVQDEWLH 150 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQA 212 I G R +LTFR Sbjct: 151 AIPAADTG------QGRMSLTFRSI 169 >UniRef50_Q80Y20-2 Isoform 2 of Alkylated DNA repair protein alkB homolog 8 n=4 Tax=Euteleostomi RepID=Q80Y20-2 Length = 629 Score = 83.1 bits (204), Expect = 6e-15, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 67/234 (28%), Gaps = 73/234 (31%) Query: 5 FADAEPWQ----EPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 F + W+ E L G +++ + E+ + + + + + S+ Sbjct: 119 FVEKAQWKNMGLEALPPGLLVVEEIISSEEEKKLLESVNWTEDT------GNQNFQRSLK 172 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 H G + Y +KP P Sbjct: 173 HRRVKHFG-----YEFHYESNTVDKDKPLPG----------------------------- 198 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 + H D I+S+SLG + F + +++L ++V G Sbjct: 199 ------GIPAHIDTHSA-FEDEIISLSLGSAIVMDFKHPE--GVTVQVMLPRRSLLVMTG 249 Query: 181 ESR-LFYHGIQPLKAGFHPLT-------------------IDCRYNLTFRQAGK 214 ESR L+ HGI P K + R + TFR+ + Sbjct: 250 ESRYLWTHGITPRKFDTVQASEQFKGGIITSDIGDLTLSKRGMRTSFTFRKVRR 303 >UniRef50_B4EMC2 2OG-Fe(II) oxygenase superfamily protein n=14 Tax=Proteobacteria RepID=B4EMC2_BURCJ Length = 240 Score = 83.1 bits (204), Expect = 6e-15, Method: Composition-based stats. Identities = 53/215 (24%), Positives = 74/215 (34%), Gaps = 28/215 (13%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 DLFAD + A A LI ++ Q R TP G +T Sbjct: 46 DLFADTPAPDVDWCPD-WLAPPEADRALATLIDEV--AWRQDTIR---TPRGRIPLPRLT 99 Query: 63 NCGHLGWTTHRQG-YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 W Y+YS I +PW L G + ++ L+NR Sbjct: 100 A-----WQGEPDAVYVYSGIR-NVPQPWTP---GVLALKHAVEATCGV---RFNSVLLNR 147 Query: 122 YAPG-AKLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 Y G L H D + AP++ SVSLG +F L HG ++V Sbjct: 148 YRNGLDSLGWHADNEPELGDAPVIASVSLGAMRMFDLRHRT-TGATHTYRLVHGSLLVMR 206 Query: 180 GESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 G ++ + H + P R NLTFR+ Sbjct: 207 GRTQAEWQHRVPK-----APGVQGERINLTFRRVS 236 >UniRef50_C6Y338 2OG-Fe(II) oxygenase n=10 Tax=Bacteria RepID=C6Y338_PEDHD Length = 202 Score = 83.1 bits (204), Expect = 6e-15, Method: Composition-based stats. Identities = 42/150 (28%), Positives = 60/150 (40%), Gaps = 19/150 (12%) Query: 69 WTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W R+ Y YS + W A L A G ++CL+N Y G Sbjct: 66 WYGDREFEYTYS-NTTKKALAWTA---ELLELKAMAEQKTGE---TFNSCLLNLYHSGEE 118 Query: 127 KLSLHQDKDEPDL--RAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR- 183 ++ H D E DL I S+S G F F K++ L+LEHG ++V ++ Sbjct: 119 GMAWHSD-GEKDLKKNGAIGSMSFGAERKFSFKH-KQSKETVSLILEHGSLLVMKDTTQS 176 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 + H + P K + R NLTFR Sbjct: 177 NWLHRLPPTK-----MVHKARVNLTFRTIT 201 >UniRef50_B8NIA9 Putative uncharacterized protein n=1 Tax=Aspergillus flavus NRRL3357 RepID=B8NIA9_ASPFN Length = 316 Score = 83.1 bits (204), Expect = 6e-15, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 45/116 (38%), Gaps = 10/116 (8%) Query: 102 RAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGL 159 RA Y D + LIN YA +S H D + P I S+SLG F Sbjct: 170 RAEDVVQYKDNYYNFILINYYATNTDSISYHSDDERFLGPNPSIASLSLGAKRDFLLKHK 229 Query: 160 --KRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 + L GD+VV GE++ + H I R N+TFR+A Sbjct: 230 PGVEAGKPLKFPLASGDMVVMRGETQGNWLHSIPKRAG-----EGGGRINVTFRRA 280 >UniRef50_A0YHA0 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YHA0_9GAMM Length = 206 Score = 82.3 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 35/152 (23%), Positives = 57/152 (37%), Gaps = 19/152 (12%) Query: 69 WTTHR-QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP-GA 126 W Y YS + +PW L + + + ++ L N Y Sbjct: 63 WYGDANAHYGYSGLK-LAPQPWTP---GLLLLKTKIEK---FLQTEFNSVLANYYRDAND 115 Query: 127 KLSLHQDKDEPDLRAP--IVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGGESR 183 ++ H D DEP+L A I S+S G F N + L G ++V G+++ Sbjct: 116 SVAWHAD-DEPELGAQPVIASLSFGATRRFSLRRKSANGIAPFHIELASGSLLVMAGDTQ 174 Query: 184 LFYHG-IQPLKAGFHPLTIDCRYNLTFRQAGK 214 F+H + +K R NLT+R + Sbjct: 175 KFWHHQVAKIKQPVA-----GRINLTYRFITE 201 >UniRef50_D0MWE8 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MWE8_PHYIN Length = 261 Score = 81.9 bits (201), Expect = 1e-14, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 59/161 (36%), Gaps = 22/161 (13%) Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 CG +G Y YS + + +P + QR + L+N Sbjct: 109 QQLCGEMG------SYRYSGKTFEAQQKFPPGLRHAVQQMQRFVEDPSTQHTRLTGGLVN 162 Query: 121 RYAPGA-KLSLHQDKDEPDLR-APIVSVSLGLPAIFQFGGLKRNDPLK--------RLLL 170 Y G + H D ++ + +PI+++SLG F F + L + Sbjct: 163 WYENGDHYIGPHADDEKDMMACSPIIALSLGAARRFVFTKKTSKSAPQGDEAVARMELQM 222 Query: 171 EHGDVVVWGGES-RLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 E GD+++ GG + R H + + P R ++T R Sbjct: 223 EDGDLMIMGGTTQRTHKHAVPKMARCREP-----RISVTLR 258 >UniRef50_Q01EG7 SelMay undefined product (IC) n=1 Tax=Ostreococcus tauri RepID=Q01EG7_OSTTA Length = 494 Score = 81.9 bits (201), Expect = 1e-14, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 60/148 (40%), Gaps = 18/148 (12%) Query: 79 SPIDPQTNKPWPAMPQSFHNLCQR---AATAAGY------PDFQPDACLINRYAPGAKLS 129 + + + +P + QR A G+ P F C+IN+Y P L+ Sbjct: 335 GWLKKEQAMQFSPLPSWLVVVGQRLYQIAVEVGFVMDDERPLFNFSQCIINQYTPPGGLT 394 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR---LLLEHGDVVVWGGESRL-F 185 H D I S+SL F ++ N ++ L L+HGDV+++ G++R + Sbjct: 395 PHVDL--RAFGDLIASISLCSTVAMDFAPVEPNANMQSNLTLRLDHGDVLIFKGDARWRW 452 Query: 186 YHGIQPLKA---GFHPLTIDCRYNLTFR 210 H I + G + R ++T R Sbjct: 453 THAIPSRQVDIFGAERVERAHRISITLR 480 >UniRef50_A6VZM6 Putative alkylated DNA repair protein n=1 Tax=Marinomonas sp. MWYL1 RepID=A6VZM6_MARMS Length = 185 Score = 81.6 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 34/138 (24%), Positives = 55/138 (39%), Gaps = 15/138 (10%) Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK-LSLHQDK 134 Y Y+ D W P + + A A +A L+N Y G + + H D Sbjct: 57 YRYTGKD-HYGIGW---PDWLLAIKEEAEILAKQ---SFNAVLLNWYQDGEEYMGWHADD 109 Query: 135 DEPDLRAPIVS-VSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPL 192 ++ AP+V+ +SLG F F + + LE G +V ++ L+ H + Sbjct: 110 EKSLGPAPVVAMLSLGASRPFIFRLKGNHQIKHSVELEDGSWLVMSASTQVLWQHSLPVR 169 Query: 193 KAGFHPLTIDCRYNLTFR 210 K + R +LTFR Sbjct: 170 KR-----IKEERISLTFR 182 >UniRef50_UPI000180B7B0 PREDICTED: similar to LOC496071 protein n=1 Tax=Ciona intestinalis RepID=UPI000180B7B0 Length = 288 Score = 81.2 bits (199), Expect = 2e-14, Method: Composition-based stats. Identities = 42/201 (20%), Positives = 68/201 (33%), Gaps = 30/201 (14%) Query: 22 LRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPI 81 F + + + Q R+ + G +M +T W + Y YS + Sbjct: 100 FPNFLEKSDADWMLETLKNEVQWEHRRNLKYGPNSMEPRLTA-----WFSEFS-YSYSGV 153 Query: 82 DPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDL- 139 N W + L R G ++ ++ L N Y G + H D EP L Sbjct: 154 VQPPNPHWHPL---LAALRDRLNDLYG---YKFNSLLANLYRDGHDSVDWHTD-AEPALG 206 Query: 140 -RAPIVSVSLGLPAIFQFGGLKRND--------PLKRLLLEHGDVVVWGGESRL-FYHGI 189 PI S+S G F+ + R+ L HG +++ G ++ + H + Sbjct: 207 NSPPIASISFGDTRNFELREITDIKTDEDLTYCKRIRVPLTHGSLLLMTGATQHDWQHRV 266 Query: 190 QPLKAGFHPLTIDCRYNLTFR 210 R NLTFR Sbjct: 267 PKEY-----HDRSARVNLTFR 282 >UniRef50_C1N0U9 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1N0U9_9CHLO Length = 408 Score = 81.2 bits (199), Expect = 2e-14, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 68/229 (29%), Gaps = 52/229 (22%) Query: 17 AGAVILRRFAFNAAE-QLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 GA ++ F E +++ + +A H G Sbjct: 144 PGATLILDFVTEDEEVAMLKSAEEDPRW-------------QRLAKRRVLHYG-----YA 185 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD----FQPDACLINRYAPGAKLSLH 131 + Y D + AMP L RAA P + D +N Y PG L+ H Sbjct: 186 FDYGTRDAKAPA-GAAMPSYAAALLDRAAALTDVPGVERALRCDQLTVNEYEPGIGLAPH 244 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND------------PLKRLLLEHGDVVVWG 179 D I++ S G A+ +F +R+ + L ++V Sbjct: 245 VDTHSA-FGGTILAASCGGGAVIEFRLHERDGDGDDDASRRVPSRRAAIYLPPRSLLVMA 303 Query: 180 GESRLFY-HGIQPLKAGFHPLTI--------------DCRYNLTFRQAG 213 GE+R + H + K R + TFR+ Sbjct: 304 GEARYRWAHYVPHRKRDAVKRFGGDGGGAATEIARREGKRVSFTFRETR 352 >UniRef50_A6GHE3 2OG-Fe(II) oxygenase n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE3_9DELT Length = 228 Score = 80.8 bits (198), Expect = 3e-14, Method: Composition-based stats. Identities = 34/137 (24%), Positives = 50/137 (36%), Gaps = 19/137 (13%) Query: 88 PWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVS 145 P P P L +R A + ++ LIN Y G + H D + AP I S Sbjct: 91 PLPWTPP-LDALRRRVEAAT---RRRFNSALINYYRDGRDTVGWHADDEVELGPAPFIAS 146 Query: 146 VSLGLPAIFQF-----GGLKRNDPLK--RLLLEHGDVVVWGGESR-LFYHGIQPLKAGFH 197 VSLG F + + + L HG ++V S+ + H + Sbjct: 147 VSLGAERDFLLRRVANADTDTDTEPRHLSVALPHGSLLVMAEGSQARWQHTLPRRTRVT- 205 Query: 198 PLTIDCRYNLTFRQAGK 214 + R NLTFR + Sbjct: 206 ----EGRINLTFRHVSR 218 >UniRef50_A2Q3T7 2OG-Fe(II) oxygenase n=2 Tax=Medicago truncatula RepID=A2Q3T7_MEDTR Length = 497 Score = 80.4 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 68/215 (31%), Gaps = 43/215 (20%) Query: 25 FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA-MTNCGHLGW------TTHRQG-- 75 F ++++ I + G + T W T + G Sbjct: 217 FNATEQDEIVEYIYGLQR----------RGQQGRLRDRTYSKPRKWMRGKGRETLQFGCC 266 Query: 76 YLYSPIDPQTN------KPWPAMPQSFHNLCQRAATAAGYPDF-QPDACLINRYAPGAKL 128 Y Y+ + +P F + +R P PD+C++N Y G + Sbjct: 267 YNYAVDKYGNPPGICRTEEVDPLPDVFKQMIKRMVRWNIIPPTCVPDSCIVNIYDVGDCI 326 Query: 129 SLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP------LKRLLLEHGDVVVWGGE- 181 H D D P SVS A FG + + L G V V G Sbjct: 327 PPHIDHH--DFVRPFYSVSFLNEAKILFGSNLKEIQPGEFSGPASISLPLGSVFVLNGNG 384 Query: 182 SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + + H I + + R ++TFR+ +++ Sbjct: 385 ADIAKHCIPSVSSK--------RISITFRKMDERK 411 >UniRef50_A1S8P3 DNA-N1-methyladenine dioxygenase n=1 Tax=Shewanella amazonensis SB2B RepID=A1S8P3_SHEAM Length = 206 Score = 80.4 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 70/195 (35%), Gaps = 27/195 (13%) Query: 22 LRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTH-RQGYLYSP 80 + F + L+ + + A M+ G + W Y YS Sbjct: 24 VPAFLSPRQQALL--MTEAADYPFESPMIKVYGKWHPIPRQQV----WFADEGCSYRYSS 77 Query: 81 IDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDL 139 + + PW P L Q G + CL+N Y G + H D DEP+L Sbjct: 78 L-LISPTPW---PHYLLRLKQALEAHCGAG---FNGCLVNHYRGGEDTMGFHAD-DEPEL 129 Query: 140 --RAPIVSVSLGLPAIFQFGGLKRNDPLKR-LLLEHGDVVVWGGESR-LFYHGIQPLKAG 195 + I VSLG +R D L+ +LL+ GD+++ + + H I Sbjct: 130 VEESLIAIVSLGASRPLVM--RRREDGLRCRVLLQSGDLLLMHPPMQSTWEHAIPR---- 183 Query: 196 FHPLTIDCRYNLTFR 210 ++ R + TFR Sbjct: 184 -SQKSLPARISFTFR 197 >UniRef50_A1ZYS3 Alkylated DNA repair protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZYS3_9SPHI Length = 185 Score = 80.4 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 33/145 (22%), Positives = 49/145 (33%), Gaps = 14/145 (9%) Query: 72 HRQGYLYS--PIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLS 129 + GY Y +P L A Y + PD ++N Y G + Sbjct: 45 QQYGYRYHFLKRTMDHVSTHTPLPGWAAQL-THAFLIKQYLNTLPDLLIVNEYKVGEGIK 103 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGG-LKRNDPLKRLLLEHGDVVVWGGESR-LFYH 187 H D I+ VSLG I + + + L L ++V GE R + H Sbjct: 104 PHID-SPLLFGETILIVSLGADCIMELEPMPEAGQGKQTLSLAARSLLVMQGEVRHHWQH 162 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQA 212 I ++ R +LTFR Sbjct: 163 SIVNVQK--------RRVSLTFRTV 179 >UniRef50_D1ITZ2 Whole genome shotgun sequence of line PN40024, scaffold_5.assembly12x (Fragment) n=6 Tax=Embryophyta RepID=D1ITZ2_VITVI Length = 260 Score = 80.0 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 73/243 (30%), Gaps = 43/243 (17%) Query: 2 LDLFADAEPWQEPLA--AGAVILRRFAFNAAEQLIRDIND---VASQSPFRQMVTPGGYT 56 + ++ P EP++ G + R F + + + S++ Q + G Sbjct: 33 SSIHSEKNPSWEPISEINGLWLCRDFLSPQEQSSLLSAIEKEGWFSEASHNQAMRFGNLP 92 Query: 57 MSVAMTNCGHLGWTTHRQGY---LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ 113 A + Y + ++ +P L + Sbjct: 93 EW-ATELSHSIREVVLFSDYVSEHMDSVTCDGDEKGCLLPSEI--LWREPL--------- 140 Query: 114 PDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP--------- 164 D ++N Y PG + H D I +SL I F + + Sbjct: 141 FDQLILNVYQPGEGICPHVDLM--RFEDGIAIISLESSCIMHFTHVDDTEACDSGREGRN 198 Query: 165 -----LKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDC-----RYNLTFRQAG 213 + L G +V+ GE+R + H I K GF R ++T R+ Sbjct: 199 YSPMTKIPVYLTPGSLVLMSGEARYFWKHEINR-KPGFQIWEGQEIDQKSRTSITLRKLC 257 Query: 214 KKE 216 K E Sbjct: 258 KIE 260 >UniRef50_D2V609 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2V609_NAEGR Length = 279 Score = 79.6 bits (195), Expect = 7e-14, Method: Composition-based stats. Identities = 49/230 (21%), Positives = 80/230 (34%), Gaps = 46/230 (20%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 +LF+ + + GA+ ++ + E+ I + D + Sbjct: 53 ELFSTN---PKVVVPGAIYIKNYISEEEEERIMKLID--------------------SKA 89 Query: 63 NCGHLGWTTHRQGYLY----------SPIDPQTNKPWPAMPQSFHNLCQRAATAAGY--P 110 C + T GY Y P++ ++ + F L +R G Sbjct: 90 WCHEICRRTQMYGYTYYHTRHNLPTMQPVNESSSNYQHLDLKEFDWLIERLVERDGLYKT 149 Query: 111 DF-QPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLL 169 D+ P CL+N Y +S H D P I VSL P + ++L Sbjct: 150 DYGNPTQCLVNEYIGTQGISSHVDNPGP-FGDIITLVSLNKPIYMVLKLASNENIQTKIL 208 Query: 170 LEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDC--------RYNLTFR 210 LE + V +SR + HGI +K + P T + R +LTFR Sbjct: 209 LEPRSLFVMKDDSRFKWKHGITHMKQVYVPSTGETLIRDENYRRVSLTFR 258 >UniRef50_Q6C9X6 YALI0D07546p n=1 Tax=Yarrowia lipolytica RepID=Q6C9X6_YARLI Length = 372 Score = 78.9 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 65/157 (41%), Gaps = 11/157 (7%) Query: 70 TTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA---GYPDFQPDACLINRY-APG 125 + Y Y+ T++ + ++ + Q + G ++ DAC+ N Y Sbjct: 161 VYSQHTYRYNGKPVDTSRKYSKAMEACSDRIQELVRSTNKEGTAEWLSDACIANYYADES 220 Query: 126 AKLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGG--LKRNDPLKRLLLEHGDVVVWGGES 182 + H D+ P++ +++LG IF+ G + +LL H +++ Sbjct: 221 QSVGFHSDQLTYIGPRPVIAALTLGSERIFRLKGACPDGDRRTYNILLPHNSLMIMHAGC 280 Query: 183 RLFY-HGIQPLKA---GFHPLTIDCRYNLTFRQAGKK 215 + Y H I P++A G H R++LTFR K+ Sbjct: 281 QEAYKHSIIPVQAKQIGLHERAGKVRFSLTFRHYKKE 317 >UniRef50_A7EEP6 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EEP6_SCLS1 Length = 319 Score = 78.9 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 60/183 (32%), Gaps = 42/183 (22%) Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSL 130 + + Y+ ++ W +P+ +L R + +PD + Y PG + Sbjct: 59 HYGAHFDYTTF--GASEMWTPVPRYLEDLVDRLPWRKEGKEERPDQFTVQYYPPGTGIPP 116 Query: 131 HQDKDEPDLRAPIVSVSLGLPAIFQFG------------------GLKRNDPLKR----- 167 H D + S+S+G F G R++ + Sbjct: 117 HVDTHSV-FGEYLYSLSIGSSVPMVFKKCGENEARKMRKPKRSLLGDSRDEVNRTRVTIK 175 Query: 168 ----------LLLEHGDVVVWGGESRLFY-HGIQPLKAGFHPLTID-----CRYNLTFRQ 211 + L +++ GE+R + H I+ K F + R+++T R+ Sbjct: 176 AEDDGEEKWEVWLRERSLLLMRGEARFGFTHMIRGRKFDFDERKGERVRRVGRWSITMRR 235 Query: 212 AGK 214 + Sbjct: 236 VRR 238 >UniRef50_Q2SBS6 Alkylated DNA repair protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SBS6_HAHCH Length = 203 Score = 78.9 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 38/146 (26%), Positives = 56/146 (38%), Gaps = 17/146 (11%) Query: 69 WTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W Y YS + P+ + Q ++ A A + + L N Y G Sbjct: 66 WYGEPHCHYAYSGLRLN-PTPFSPLLQQLRHIASEHAAA------KFNCALCNLYRNGQD 118 Query: 127 KLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRL 184 +S H D + API+ S S G FQ KR + L H +++ G+ R Sbjct: 119 SVSWHADDEPELGPAPIIASFSFGATRTFQIK-PKRGGQTLAIELLHNSLLIMSGDMQRH 177 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFR 210 + H + KA P R NLT+R Sbjct: 178 WRHQLPKTKAPVGP-----RVNLTYR 198 >UniRef50_D2VTV7 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VTV7_NAEGR Length = 251 Score = 78.1 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 39/160 (24%), Positives = 63/160 (39%), Gaps = 28/160 (17%) Query: 70 TTHRQGYLYSPIDPQTNKP---------WPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 T + Y Y D + + W +P+ +L +R + C IN Sbjct: 98 TWKTKTYNYGGKDVISPREFCGISSQSEWSKIPE-LISLKERIEKFTNH---TFTYCFIN 153 Query: 121 RYAPG-AKLSLHQDKDE-PDLRAPIVSVSLGLPAIFQF-------GGLKRNDPLKRLLLE 171 +Y G + H DK++ PIVS+SLG FQF +++ + L Sbjct: 154 KYKDGNDSIYWHSDKEKGLKKGCPIVSISLGQERDFQFRPKISKNSKQQKDGNIIEKNLP 213 Query: 172 HGDVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFR 210 G +VV E++ +Y H + + + R NLTFR Sbjct: 214 DGSMVVMNYETQEYYEHSLPKRRNIH-----NIRLNLTFR 248 >UniRef50_A1K994 DNA repair system specific for alkylated DNA n=1 Tax=Azoarcus sp. BH72 RepID=A1K994_AZOSB Length = 194 Score = 77.7 bits (190), Expect = 2e-13, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 65/203 (32%), Gaps = 23/203 (11%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTH-RQG 75 + + AA+ L + A T G + C W + Sbjct: 8 PLLALTPLYDATAAQALFDTL--CAEIPWNDGDYTAAGRRFRLPRLQC----WFSDPGAT 61 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 Y Y+ + + PW + L R +G + +A L N Y G + H D Sbjct: 62 YRYAD-NLMNSHPWTP---TLAALRARVEAVSGV---RFNAVLANLYRDGEDAVGWHADD 114 Query: 135 DEPDLRAP-IVSVSLGLPAIFQFG-GLKRNDPLKRLLLEHGDVVVWGGE-SRLFYHGIQP 191 ++ AP I S+SLG F + L L G +++ + + H + Sbjct: 115 EDDLGPAPHIASLSLGATRRFHWRPKPGVVGEADALPLPAGTLLLMRAPFQQQWEHAVP- 173 Query: 192 LKAGFHPLTIDCRYNLTFRQAGK 214 P R NLTFR Sbjct: 174 ----AEPAVRGARLNLTFRNVVA 192 >UniRef50_UPI0001BCD579 alkylated DNA repair protein AlkB n=1 Tax=Campylobacter fetus subsp. venerealis str. Azul-94 RepID=UPI0001BCD579 Length = 69 Score = 77.3 bits (189), Expect = 3e-13, Method: Composition-based stats. Identities = 34/55 (61%), Positives = 40/55 (72%) Query: 158 GLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 G+KR DP+ + +L HGDVVVW G SRLFYHGI PLK+G H R NLTFR+A Sbjct: 14 GMKRTDPITKYILHHGDVVVWVGPSRLFYHGILPLKSGEHERLGPIRLNLTFRKA 68 >UniRef50_C7PS76 2OG-Fe(II) oxygenase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PS76_CHIPD Length = 211 Score = 77.3 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 30/121 (24%), Positives = 44/121 (36%), Gaps = 12/121 (9%) Query: 94 QSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK-DEPDLRAPIVSVSLGLP 151 ++ TA G + L+N Y G +S H D I SV+ G Sbjct: 83 PVLLDIKAAVETACGI---TFNRVLLNYYRDGQDSVSWHSDHPSSSGKHYAIASVTFGET 139 Query: 152 AIFQFGGLKRND-PLKRLLLEHGDVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTF 209 +F+ +R D + L HG ++ G + Y H + I R NLTF Sbjct: 140 RLFKVRHKERKDIAPLDIPLTHGSFLLMGPTMQEHYEHHVPKTSRN-----IGARINLTF 194 Query: 210 R 210 R Sbjct: 195 R 195 >UniRef50_A9G7L2 High confidence in function and specificity n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G7L2_SORC5 Length = 174 Score = 76.9 bits (188), Expect = 5e-13, Method: Composition-based stats. Identities = 38/145 (26%), Positives = 57/145 (39%), Gaps = 16/145 (11%) Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLH 131 Y YS I P MP +L R A G+P + CL N YA G AK+ H Sbjct: 43 GVAYNYSGISY----PDCEMPPFVQDLAARLAGVVGHPI---NNCLANFYADGTAKMGFH 95 Query: 132 QDKDEPDLRAPIVS-VSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGI 189 D + S +SLG P + F R + + L G +++ + + H + Sbjct: 96 SDSSAGVVAGTTTSILSLGAPRVLTFRRSLRRNETHDMALAPGSLLIMRPSVQEGWQHAV 155 Query: 190 QPLKAGFHPLTIDCRYNLTFRQAGK 214 ++A R +LTFR + Sbjct: 156 LAVEAA------GPRISLTFRFLSR 174 >UniRef50_Q9SIE0 Expressed protein n=10 Tax=Magnoliophyta RepID=Q9SIE0_ARATH Length = 314 Score = 76.5 bits (187), Expect = 6e-13, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 53/158 (33%), Gaps = 34/158 (21%) Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA-PGAKLSLHQDKDE 136 YS P T+ W P + P + ++ L+NRY ++ H D ++ Sbjct: 163 YSGYRP-TSYSWDDFPP-LKEILDAIYKVL--PGSRFNSLLLNRYKGASDYVAWHADDEK 218 Query: 137 PDLRAP-IVSVSLGLPAIFQFGGLKRNDP----------------------LKRLLLEHG 173 P I SVS G F K + + L L+HG Sbjct: 219 IYGPTPEIASVSFGCERDFVLKKKKDEESSQGKTGDSGPAKKRLKRSSREDQQSLTLKHG 278 Query: 174 DVVVWGGES-RLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 ++V G + R + H + R NLTFR Sbjct: 279 SLLVMRGYTQRDWIHSVPKRAKAE-----GTRINLTFR 311 >UniRef50_B9SA55 Oxidoreductase, putative n=1 Tax=Ricinus communis RepID=B9SA55_RICCO Length = 253 Score = 76.5 bits (187), Expect = 6e-13, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 53/160 (33%), Gaps = 34/160 (21%) Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDE 136 YS P W P ++ + A P + ++ L+NRY G + H D ++ Sbjct: 102 YSGYKPHVYS-WDDYPP-LKDILEAVHRAL--PGSRFNSLLLNRYKGGNDNVGWHADDEK 157 Query: 137 PDLRAP-IVSVSLGLPAIFQFGG----------LKRNDP------------LKRLLLEHG 173 P I SVS G F ++P L+HG Sbjct: 158 LYGPTPEIASVSFGCEREFLLKKRQSKSKAAERRCDDEPDRKRLKKSSHVDQHSFTLKHG 217 Query: 174 DVVVWGGES-RLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 ++V G + R + H + R NLTFR Sbjct: 218 SLLVMKGNTQRDWLHSLPKRAKAEA-----TRINLTFRHV 252 >UniRef50_C5CMR5 2OG-Fe(II) oxygenase n=1 Tax=Variovorax paradoxus S110 RepID=C5CMR5_VARPS Length = 202 Score = 76.2 bits (186), Expect = 7e-13, Method: Composition-based stats. Identities = 52/217 (23%), Positives = 72/217 (33%), Gaps = 34/217 (15%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 DLF +A G R F A E + + R Sbjct: 14 DLFGEAPAAAIE---GLRYEREFLTRAEEAELLRLVQGFELREMRY------------KE 58 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 + Y + D +P AMP + H L R A G LI+ Y Sbjct: 59 YTARRRGISFGGSYDF---DKHRLRPGAAMPPALHPLRARVAAWMGMAPEDFAHMLISEY 115 Query: 123 APGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFG----GLKRNDPLKRLLLEHGDVVVW 178 PG L H+D PD IV VSL A+ Q +LL+E + + Sbjct: 116 RPGTPLGWHRDV--PDFED-IVGVSLQGDAVMQLRPYPPASASAPASLQLLIEPRSIYML 172 Query: 179 GGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 GE+R + H I P +A RY++T R + Sbjct: 173 RGEARWAWQHSIAPTEA--------LRYSITMRTRRR 201 >UniRef50_D2VF66 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VF66_NAEGR Length = 271 Score = 76.2 bits (186), Expect = 7e-13, Method: Composition-based stats. Identities = 38/166 (22%), Positives = 58/166 (34%), Gaps = 36/166 (21%) Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 + Y P + P+ +L +R G LINRY G + + H Sbjct: 118 ENYSKHHRRPVHD----EWPKELTDLKERIEKYTGD---TFTFALINRYDTGESSIGWHS 170 Query: 133 DKDEPDL--RAPIVSVSLGLPAIFQF----------GGLKRNDP---------LKRLLLE 171 D E D+ + IVS+SLG F+F + D LE Sbjct: 171 D-MEQDIKKDSSIVSISLGAARDFKFRPTPKKENSKKSPTKKDEESEEEEKVQTITQKLE 229 Query: 172 HGDVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 +G +V+ ++ Y H I K R N+TFR + + Sbjct: 230 NGSMVIMNYATQRHYQHSIPKRK-----NLNSVRLNITFRHVVRNK 270 >UniRef50_A6DH75 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DH75_9BACT Length = 196 Score = 76.2 bits (186), Expect = 7e-13, Method: Composition-based stats. Identities = 32/146 (21%), Positives = 51/146 (34%), Gaps = 16/146 (10%) Query: 69 WTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W Y YS ID + N PW L + + ++ L N Y G Sbjct: 58 WLADPNIHYNYSGIDLKIN-PWTQQVLKLKTLAED------KSHWTFNSMLANYYRDGKD 110 Query: 127 KLSLHQDKDEPDLRAPIVS-VSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRL 184 H D ++ R P+++ S G F + + L +G +++ G Sbjct: 111 SNGWHADNEKELGRNPLIAMFSFGQIRRFSIRSNENHKNKLDFDLNNGSLIIMKGPLQHT 170 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFR 210 H ++ K D R +LTFR Sbjct: 171 SQHCLRKTKKK-----CDARISLTFR 191 >UniRef50_B8C4G3 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C4G3_THAPS Length = 318 Score = 75.4 bits (184), Expect = 1e-12, Method: Composition-based stats. Identities = 36/136 (26%), Positives = 53/136 (38%), Gaps = 17/136 (12%) Query: 92 MPQSFHNLCQRAATAAGYPDFQPDA---CLINRYAPGAK-LSLHQDK-DEPDLRAPIVSV 146 + F NL + + + CL+N Y G + +S H D D +PI SV Sbjct: 188 IAPEFVNLLKTINNEKKNGQRRTEIFNYCLLNHYRSGEEYMSYHTDDESSLDPHSPIASV 247 Query: 147 SLGLPAIFQF------GGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQPLKAGFHPL 199 SLG+ F G R L R+ L GD+++ + Y H + Sbjct: 248 SLGVARNFDIRQRKMKGSDGRRPRLARISLGDGDLLLMFPPMQDHYEHAVP-----IEKR 302 Query: 200 TIDCRYNLTFRQAGKK 215 + R NLTFR+ K Sbjct: 303 VVGDRINLTFRRIVTK 318 >UniRef50_B5IQB5 DNA repair system specific for alkylated DNA n=1 Tax=Cyanobium sp. PCC 7001 RepID=B5IQB5_9CHRO Length = 147 Score = 75.4 bits (184), Expect = 1e-12, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 99 LCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP--IVSVSLGLPAIFQ 155 + QR A+G P + L N Y G ++ H D DEP+L A I S+SLG F Sbjct: 1 MLQRLREASGVP---FNTALANLYRDGRDSVAWHSD-DEPELGAHPVIASLSLGATRRFL 56 Query: 156 FGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 + + L HG ++ G ++ + H + R NLTFR G Sbjct: 57 MRRKADHRHRRAFQLSHGSLLWMAGSTQEHWQHCLPKTARPVA-----ARINLTFRAIG 110 >UniRef50_Q3M1V0 DNA-N1-methyladenine dioxygenase n=3 Tax=Nostocaceae RepID=Q3M1V0_ANAVT Length = 199 Score = 75.4 bits (184), Expect = 1e-12, Method: Composition-based stats. Identities = 39/160 (24%), Positives = 52/160 (32%), Gaps = 19/160 (11%) Query: 54 GYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ 113 G TM V C + YLYS W + L G + Sbjct: 49 GKTMPVPRLECI---YGDEGCDYLYSNSVLLKPLAWT---DALSKLRDSITAFTG---YS 99 Query: 114 PDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLE 171 + N+Y G + H DK+ P I S+SLG FQ + LE Sbjct: 100 FRIVIGNQYRSGQDSIGWHADKESSMGVEPTITSISLGAVRKFQIKPI--GGKPTDFWLE 157 Query: 172 HGDVVVW-GGESRLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 HG ++V G H + + R NLTFR Sbjct: 158 HGSLLVMLPGCQTTHLHQVPKTNK-----FVTTRINLTFR 192 >UniRef50_Q7S1J6 Predicted protein n=3 Tax=Sordariales RepID=Q7S1J6_NEUCR Length = 290 Score = 75.0 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 35/267 (13%), Positives = 68/267 (25%), Gaps = 81/267 (30%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 + Q G +++ F E + F + ++ Sbjct: 11 PENPKIQRWDDIGLLLIHDFITEDEEAAMIAA--------FHAVDPRLDGKRRISQ---- 58 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 + Y+ + + +P + R D+ PD Y PG Sbjct: 59 -----HFGYHFDYTTFGA-SETSFTPVPSYITDFLPRL----PVQDYLPDQFTAQYYPPG 108 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGL--PAIFQFGG------------------------- 158 A + H D + S+S G P +F+ Sbjct: 109 AGIPPHVDTHS-MFGEALYSLSYGSAVPMVFRLSDANDARKMRLPRRSLQSSVSESKTEG 167 Query: 159 ---------------------LKRNDPLKR------LLLEHGDVVVWGGESRLFY-HGIQ 190 +++ L+L +++ G +R Y HGI+ Sbjct: 168 ASGTNPESAATAPDSQTTMTVQSKSEEPSPENPSWELVLPPRSLLLMTGPARYGYTHGIK 227 Query: 191 PLKA---GFHPLTIDCRYNLTFRQAGK 214 K + RY++T R + Sbjct: 228 SRKTDIINGEMVHRQGRYSITMRTIRR 254 >UniRef50_Q8YKL5 All7279 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YKL5_ANASP Length = 141 Score = 75.0 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 35/138 (25%), Positives = 47/138 (34%), Gaps = 16/138 (11%) Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 YLYS W L + A G + + N+Y G + H DK Sbjct: 10 YLYSNSVLLKPLAWTE---PLAKLRDKITAATG---YSFRIVIGNQYRSGQDSIGWHADK 63 Query: 135 DEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW-GGESRLFYHGIQPL 192 + P I S+SLG FQ + LEHG ++V G H + Sbjct: 64 ESSMGIEPAIASISLGSARKFQLKPI--GGKPTDFWLEHGSLLVMLPGCQTTHVHQVPKT 121 Query: 193 KAGFHPLTIDCRYNLTFR 210 + R NLTFR Sbjct: 122 TK-----FVTTRINLTFR 134 >UniRef50_C1FG54 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FG54_9CHLO Length = 684 Score = 75.0 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 40/151 (26%), Positives = 56/151 (37%), Gaps = 26/151 (17%) Query: 80 PIDPQTNKPWPAMPQSFHNLCQR-AATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPD 138 +T+ P +P + A A +PD+C IN Y PG + H D P Sbjct: 117 SNRVETHVPVAPLPPELDAVVDALIARGALTELQRPDSCTINLYGPGQWIPPHIDN--PA 174 Query: 139 LRAPIVSVSLGLPAIFQF--------------GGLKRNDPLKRLLLEHGDVVVWGGESRL 184 P V+VSL G +R + L L G VV GE+ Sbjct: 175 FDRPFVTVSLCSEQPMVLGRGMVWPEGGRGPCGDDERLNEEHALSLPVGSAVVVEGEAAD 234 Query: 185 FY-HGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 Y H + P+ A R +LTFR+ G+ Sbjct: 235 EYEHAVPPVTA--------ERISLTFRRRGR 257 >UniRef50_B6AFB9 Oxidoreductase, 2og-Fe(II) oxygenase family protein n=1 Tax=Cryptosporidium muris RN66 RepID=B6AFB9_9CRYT Length = 332 Score = 75.0 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 46/157 (29%), Gaps = 14/157 (8%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 + G+ Y + K +P + R +PD IN Y G Sbjct: 147 SRRVQHYGFGFDY-KNKIISPKWVRDIPIKIEMIINRLL-LHNIVTSRPDQITINEYIAG 204 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF-------GGLKRNDPLKRLLLEHGDVVVW 178 + H D + I VSLG F + + + V Sbjct: 205 QGIGPHIDSH-HTIGNYIAVVSLGSGVGMDFYELQLSDSKSFKKQKKHSIYIPKNSVYTM 263 Query: 179 GGESRLFY-HGIQPLKA---GFHPLTIDCRYNLTFRQ 211 R + HGI+ + + R +LTFR+ Sbjct: 264 SSNIRYCWQHGIKKRYTDNIDGNIIKRHRRVSLTFRR 300 >UniRef50_UPI000179247A PREDICTED: similar to alkB, alkylation repair homolog 8 (E. coli) (alkbh8) n=1 Tax=Acyrthosiphon pisum RepID=UPI000179247A Length = 382 Score = 74.6 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 65/213 (30%), Gaps = 50/213 (23%) Query: 10 PWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGW 69 +G I+ F E + ++ H Sbjct: 65 ETSTSSPSGLEIIDNFITEEEE----------------HFMLQYLKKHWSESSSMKHRQV 108 Query: 70 TTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLS 129 + + Y + + P +P+ F + Y ++ Sbjct: 109 KHYGYEFDYDNNGVRYDSCDP-IPKEFEFILNAI------------------YL---RIP 146 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHG 188 H D I+S+SL I +F ++ + +LL+ +++ GESR + HG Sbjct: 147 SHIDTH-GVFDEYILSLSLNSDIIMEF---RKGNYHNSILLKAKSLLIMSGESRFEWSHG 202 Query: 189 IQPLK-------AGFHPLTIDCRYNLTFRQAGK 214 I P K G + R ++TFR+ + Sbjct: 203 ITPRKFDMINTADGPDIICRGTRISVTFRRVVQ 235 >UniRef50_Q4SEM2 Chromosome 10 SCAF14616, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4SEM2_TETNG Length = 604 Score = 73.8 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 60/213 (28%), Gaps = 64/213 (30%) Query: 24 RFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDP 83 + ++++I ++ + P +T A H + + Y + Sbjct: 133 GISRAELSAVLKEIGEIEMLVMPPRNHMPLSHT---AQKAMKHRRVKHYGFEFRYDNNNV 189 Query: 84 QTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPI 143 +KP P A + H D I Sbjct: 190 DKDKPLP-----------------------------------AGIPPHVDTHSA-FEDAI 213 Query: 144 VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHP---- 198 +S+SL + F + L L+L ++V GESR L+ HGI P K P Sbjct: 214 LSLSLRAQTVMDFRHP--DGSLVALVLPGRSLLVMKGESRYLWTHGITPRKFDVVPSCDS 271 Query: 199 ------------------LTIDCRYNLTFRQAG 213 R + TFR+ Sbjct: 272 QPSAPTSHDSQSQSNLTLSRRATRTSFTFRKIR 304 >UniRef50_Q5CYU2 F27M3_19 plant like RRM plus AlkB domain containing protein n=2 Tax=Cryptosporidium RepID=Q5CYU2_CRYPV Length = 350 Score = 73.8 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 25/169 (14%) Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWP-AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 L GY + + + W +P + L +R + + PD IN Y Sbjct: 152 KLNRKVQHYGYSFDYNNKTISSVWERDIPPILNRLIERMLSLKIITE-VPDQITINEYEV 210 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPL------------------K 166 G + H D + I +SLG +F+F L + + Sbjct: 211 GKGIGPHIDSH-HTIGENISVISLGSGILFEFNELSKRKNPDCSSKEGSGSRKYDRISKR 269 Query: 167 RLLLEHGDVVVWGGESRL-FYHGIQPL---KAGFHPLTIDCRYNLTFRQ 211 + + + + E R + HGI+ K R ++T R+ Sbjct: 270 TVYIPENSLYIMKNEIRYAWEHGIKSRMYDKIQGKFQQRKRRVSITIRK 318 >UniRef50_B6EQD5 Putative uncharacterized protein n=1 Tax=Aliivibrio salmonicida LFI1238 RepID=B6EQD5_ALISL Length = 208 Score = 73.8 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 28/129 (21%), Positives = 45/129 (34%), Gaps = 11/129 (8%) Query: 91 AMPQSFHNLCQRAATAAGYPDFQPDACLINRYA-PGAKLSLHQDKDEPDLRAP-IVSVSL 148 L + G ++ + L N Y + H D + + P I S SL Sbjct: 86 PFTSQLMVLKNKIEKETG---YKFNCVLANLYRNENDGVGYHADDEAILGKNPAIASYSL 142 Query: 149 GLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNL 207 G F + + L++ +V+ G + + H I K + R NL Sbjct: 143 GETRRFLVKHNQHKYKNISIDLKNNSLVLMDGCLQDHWKHAIPKTKRA-----MSARINL 197 Query: 208 TFRQAGKKE 216 TFR GK + Sbjct: 198 TFRFLGKND 206 >UniRef50_A4RVX0 Predicted protein n=2 Tax=Ostreococcus RepID=A4RVX0_OSTLU Length = 347 Score = 73.5 bits (179), Expect = 5e-12, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 70/203 (34%), Gaps = 26/203 (12%) Query: 17 AGAVILRRFAFNAAEQLIRD-INDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G IL F E+ I D ++D + P+R G + G + Sbjct: 150 PGHYILENFITEDEERRIVDWLDDDIAAGPWRDSSFNGAHQG-------KKYGVEPNLLK 202 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNL--CQRAATAAGYPDFQPDAC-LINRYAP-GAKLSLH 131 P MP+ +L + AA F P+ C IN G+ L+ H Sbjct: 203 RCVEPARV-------PMPKILRDLVVAKFAAAHETLKHFTPNECNAINYRKDLGSVLTPH 255 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQ 190 D + +V++SL + K + L + + G +R Y H I Sbjct: 256 CDDRQLS-SDILVNLSLCSDCTMTYSHEKFASKRVDVRLPRRSLQIQSGSTRYDYMHSIA 314 Query: 191 PLKAGFHPLTIDCRYNLTFRQAG 213 L + R ++TFR++G Sbjct: 315 N-----ENLHGNRRVSVTFRESG 332 >UniRef50_C1MXD4 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MXD4_9CHLO Length = 377 Score = 73.1 bits (178), Expect = 5e-12, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 65/218 (29%), Gaps = 30/218 (13%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 A P G +L F E + D G + + N Sbjct: 165 AATSTSNTPSLPGHHLLLDFITEDEENALVAFLDDGE---------RGIHDWKPSTFNGA 215 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATA-AGYPDFQPDACLINRYAP 124 H G G + P MP + ++ A A F P+ Y Sbjct: 216 HRGKAW---GVRVDLKRRTVSPPTREMPPRLLAVAEKMRGAHALLARFSPNEANAISYDK 272 Query: 125 --GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF--------GGLKRNDPLKRLLLEHGD 174 G +L H D + +V++SL + + GG R + L Sbjct: 273 RLGDRLLSHVDDRQLS-SDVLVNLSLCGECVMTYERTTTRSSGGGTRGSDRVDVRLPRRS 331 Query: 175 VVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQ 211 + + G++R + H I L R ++TFR+ Sbjct: 332 LQIQSGDARYAFAHSIAN-----ENLLDPRRVSITFRE 364 >UniRef50_Q96Q83 Alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 n=17 Tax=Chordata RepID=ALKB3_HUMAN Length = 286 Score = 73.1 bits (178), Expect = 6e-12, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 65/208 (31%), Gaps = 38/208 (18%) Query: 21 ILRRFAF-NAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYS 79 + F A+ ++ + G + W Y YS Sbjct: 92 LYPGFVDVKEADWILEQLCQDV------PWKQRTGIREDITYQQPRLTAW-YGELPYTYS 144 Query: 80 PIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA-PGAKLSLHQDKDEPD 138 I + N W + L R G+ ++ L N Y + H D + Sbjct: 145 RITMEPNPHWHPV---LRTLKNRIEENTGH---TFNSLLCNLYRNEKDSVDWHSDDEPSL 198 Query: 139 LRAPIV-SVSLGLPAIFQFGGLKRNDPL------------KRLLLEHGDVVVWGGESRL- 184 R PI+ S+S G F+ R P ++ L+HG +++ G ++ Sbjct: 199 GRCPIIASLSFGATRTFEM----RKKPPPEENGDYTYVERVKIPLDHGTLLIMEGATQAD 254 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 + H + P R NLTFR Sbjct: 255 WQHRVPKEYHSREP-----RVNLTFRTV 277 >UniRef50_Q4A2D1 Putative uncharacterized protein n=2 Tax=dsDNA viruses, no RNA stage RepID=Q4A2D1_EHV86 Length = 210 Score = 72.7 bits (177), Expect = 7e-12, Method: Composition-based stats. Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 5/105 (4%) Query: 110 PDFQPDACLINRYAPG-AKLSLHQDKDEPDLRA-PIVSVSLGLPAIFQFGGLKRNDPLKR 167 + ++ L+N Y G +SLH D++ +++ PIVS+SLG FQ + + + Sbjct: 103 DKYNYNSVLVNWYMTGNDYVSLHGDRERGLVQSTPIVSISLGGSRTFQVKKNDTKELVYQ 162 Query: 168 LLLEHGDVVVWGGE--SRLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 +L GD ++ G + H I + A H + R NLT R Sbjct: 163 EMLNDGDCIIMKGTDFQHKYKHCIPKMIAKKHGTVLP-RINLTIR 206 >UniRef50_D2V317 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V317_NAEGR Length = 236 Score = 71.9 bits (175), Expect = 1e-11, Method: Composition-based stats. Identities = 36/145 (24%), Positives = 59/145 (40%), Gaps = 20/145 (13%) Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 Y YS + +++ PW P+ L G + L+N Y G + H DK Sbjct: 102 YSYSGMQ-RSSIPW--FPE-LLKLKTLIQEKTGE---VFNYALVNVYDNGNDYIGWHSDK 154 Query: 135 DEPDLR-APIVSVSLGLPAIFQF----GGLKRNDPLK-RLLLEHGDVVVWGGESRLFY-H 187 + + + IVS++LG FQF G K +D L +G +++ ++ +Y H Sbjct: 155 TKDLVENSSIVSLTLGETRPFQFKPSEGKQKNSDKSIITKYLPNGSMIIMNWNTQFYYKH 214 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQA 212 + K R N+TFR Sbjct: 215 CLPKRKNISK-----QRINITFRHV 234 >UniRef50_Q95XY7 Putative uncharacterized protein n=3 Tax=Caenorhabditis RepID=Q95XY7_CAEEL Length = 248 Score = 71.9 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 45/122 (36%), Gaps = 3/122 (2%) Query: 14 PLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYT-MSVAMTNCGHLGWTTH 72 P G +L + ++ P +T G ++ L WTT Sbjct: 129 PDRIGLYLLPSLLRKEKSTMWMKRAFKYAEPPNITNLTLHGKDVLTDPTLLTKGLRWTTL 188 Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQ 132 Y ++ + N +P+ + L + + D +PDA ++N Y P + LS H Sbjct: 189 GVEYDWNSKEYPPNGR--PVPEELYQLGNLISRSLKLGDMKPDATILNYYPPKSALSPHV 246 Query: 133 DK 134 DK Sbjct: 247 DK 248 >UniRef50_A6ESW5 2OG-Fe(II) oxygenase n=5 Tax=Bacteroidetes RepID=A6ESW5_9BACT Length = 203 Score = 71.5 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 34/140 (24%), Positives = 54/140 (38%), Gaps = 15/140 (10%) Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 Y YS I T + + + H++ G + L N Y G H D Sbjct: 75 YSYSGI-VMTPRKFSR---TLHHIKTAIENHTGA---TFNTVLCNLYRDGKDSNGWHSDN 127 Query: 135 DEPDLRAPI-VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG-GESRLFYHGIQPL 192 ++ PI VS+SLG +F + +L L +G ++ G G + + H + Sbjct: 128 EKELGPDPIIVSISLGETRMFHLKNKQAPTERIKLALTNGSLLYMGKGTQKNYKHQLAKT 187 Query: 193 KAGFHPLTIDCRYNLTFRQA 212 + P R NLTFR+ Sbjct: 188 QKQITP-----RINLTFRRL 202 >UniRef50_C5DVB8 ZYRO0D05434p n=1 Tax=Zygosaccharomyces rouxii RepID=C5DVB8_ZYGRO Length = 428 Score = 71.2 bits (173), Expect = 2e-11, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 48/111 (43%), Gaps = 13/111 (11%) Query: 110 PDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKR- 167 P+++ D L+NRY + L H DK PI+ S+SLG F+ ++N P Sbjct: 233 PNWKGDVVLVNRYYKESNLDWHSDKMTSIGPQPIIASLSLGCSREFRV---RKNYPSNSQ 289 Query: 168 ---LLLEHGDVVVWG----GESRLFYHGIQPLKA-GFHPLTIDCRYNLTFR 210 + H +V+ E R HG HP++ R NLT+R Sbjct: 290 IYIIRPPHNTLVIMHAGFQEEYRHCVHGHSKNSPLKPHPISGHVRINLTYR 340 >UniRef50_C9NTT0 Alkylated DNA repair protein n=5 Tax=cellular organisms RepID=C9NTT0_9VIBR Length = 82 Score = 71.2 bits (173), Expect = 2e-11, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 33/87 (37%), Gaps = 8/87 (9%) Query: 128 LSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRLF 185 + HQD + E I S+SLG F L + L HG +++ GE + Sbjct: 1 MGAHQDNEPELGQNPTIASLSLGATRRFTLKHL-HTGQNHDIELSHGSLLIMAGEMQHHW 59 Query: 186 YHGIQPLKAGFHPLTIDCRYNLTFRQA 212 H + I R NLTFR Sbjct: 60 KHSLPKTNQS-----IGERINLTFRTI 81 >UniRef50_B0CZJ7 Predicted protein n=2 Tax=Agaricales RepID=B0CZJ7_LACBS Length = 364 Score = 71.2 bits (173), Expect = 2e-11, Method: Composition-based stats. Identities = 30/126 (23%), Positives = 43/126 (34%), Gaps = 29/126 (23%) Query: 113 QPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF------------GGLK 160 Q ++N Y PG ++ H D IV VS G + +F G Sbjct: 225 QARQAILNLYQPGEGITPHVDLL-GRFGDGIVGVSFGSGCVMRFDKVPSETETRERGAEG 283 Query: 161 RNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHP---------------LTIDCR 204 +D L L V+V E+R + HGI K F + R Sbjct: 284 EDDSRWELYLPERSVIVLSEEARYEWTHGIDERKEDFVSCGNGEKDSALSQGRWIGRGVR 343 Query: 205 YNLTFR 210 ++TFR Sbjct: 344 LSVTFR 349 >UniRef50_C1BKL6 Alkylated repair protein alkB homolog 3 n=5 Tax=Euteleostomi RepID=C1BKL6_OSMMO Length = 304 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 35/152 (23%), Positives = 54/152 (35%), Gaps = 30/152 (19%) Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA-PGAKLSLHQDK 134 Y Y+ + N W + L Q +A+G ++ L N Y + H D Sbjct: 137 YTYAHSTMEANTQWHPL---LLTLRQAVDSASGS---SFNSLLCNLYRNESDSIGWHSDD 190 Query: 135 DEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLK------------RLLLEHGDVVVWGGE 181 + P I S+SLG +F R P R+ L HG +++ G Sbjct: 191 EASLGIKPTIASLSLGDTRVFSL----RKKPPPEENGDYTYMERLRVPLAHGTLLLMEGA 246 Query: 182 SR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 ++ + H + P R NLTFR Sbjct: 247 TQDDWQHQVAKEYHSRGP-----RINLTFRTI 273 >UniRef50_Q17527 Protein B0564.2, partially confirmed by transcript evidence n=4 Tax=Caenorhabditis RepID=Q17527_CAEEL Length = 231 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 63/226 (27%), Gaps = 47/226 (20%) Query: 13 EPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTH 72 + A + + + E L + + A Q +R +A + G Sbjct: 17 KSAPATMIYIPNWIDEEEENLYKSCIENAPQPKWRV----------LANRRLQNYG---- 62 Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQ 132 + P P L + + + + L+N Y G + H Sbjct: 63 ------GVVGKTALIPTDDFPVELKYLMTKINDLGIFKN-PVNHVLVNEYEAGQGIMPHT 115 Query: 133 DKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK--------RLLLEHGDVVVWGGESRL 184 D P + +V+LG + + K +LLE + + ++ Sbjct: 116 DG--PAFHRIVTTVTLGSHCLLDMYDPVDQEIAKSEEERYVGSMLLEPRSLFIMTDDAYT 173 Query: 185 F-YHGIQPLKAG---------------FHPLTIDCRYNLTFRQAGK 214 HGI + L D R ++T R K Sbjct: 174 RMLHGIAERETDLIEPGKVFNCTEELANKRLDRDTRISITVRNVEK 219 >UniRef50_D1HRN8 Whole genome shotgun sequence of line PN40024, scaffold_34.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1HRN8_VITVI Length = 439 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 27/132 (20%), Positives = 48/132 (36%), Gaps = 18/132 (13%) Query: 91 AMPQSFHNLCQRAATAAGYPDF-QPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLG 149 +P F + +R P P++C++N Y G + H D D P +VS Sbjct: 239 PLPPLFKQMIKRMVRWHILPPTCVPNSCIVNIYDEGDCIPPHIDHH--DFLRPFCTVSFL 296 Query: 150 LPAIFQFGGLKRN------DPLKRLLLEHGDVVVWGGE-SRLFYHGIQPLKAGFHPLTID 202 FG + + L G V++ G + + H + + A Sbjct: 297 TECNILFGSSLKILDAGEFSGPVSISLPKGSVLILNGNGADVAKHCVPAVPAK------- 349 Query: 203 CRYNLTFRQAGK 214 R ++TFR+ + Sbjct: 350 -RISITFRKMDE 360 >UniRef50_D2VCK4 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2VCK4_NAEGR Length = 226 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 53/154 (34%), Gaps = 28/154 (18%) Query: 78 YSPIDPQTNKPWP----AMPQSFHNLCQRAATAAG--------------YPDFQPDACLI 119 Y P + P +PQ +LC D + + Sbjct: 45 YGPKHDKKYNIIPDEITPLPQFLKDLCTTILERTATHVPKIDLSQYESYLGDDKFTEIFV 104 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y P L H D + + I +SL + F F + D ++ L + + Sbjct: 105 NEYKPDDSLDQHFDHRKT-YKEIIFGLSLECDSTFTF---TKKDIKHQVKLPARSLYLMT 160 Query: 180 GESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQA 212 G SR + HGI+P + L D R +LTFR Sbjct: 161 GSSRKSFKHGIEP-----NLLEGDRRISLTFRTV 189 >UniRef50_B6JV21 2 OG-Fe(II) oxygenase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JV21_SCHJY Length = 251 Score = 70.4 bits (171), Expect = 4e-11, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 58/170 (34%), Gaps = 14/170 (8%) Query: 48 QMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA 107 ++ P + G L +T RQ +LY P P N A Sbjct: 86 NVLPPSVQDELIQCLPDGILDRSTGRQAHLYRPFAPVLQN-------LMQNFIPSAFKKQ 138 Query: 108 GYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR 167 + +A ++ Y PG + H D P IV SL +F Sbjct: 139 VWEGKDAEAIIMQVYNPGDGIIPHVDL--PMFDDGIVIFSLLSDITMEFTQPSSKRKA-S 195 Query: 168 LLLEHGDVVVWGGESRL-FYHGIQPLK---AGFHPLTIDCRYNLTFRQAG 213 +LLE G + + GE+R + HGI + R ++T R+ G Sbjct: 196 VLLEKGSLTIMEGEARYQWLHGIPFRTGDWTNGTWIPRAQRCSITMRRIG 245 >UniRef50_D2VNR1 Putative uncharacterized protein n=1 Tax=Naegleria gruberi RepID=D2VNR1_NAEGR Length = 259 Score = 70.4 bits (171), Expect = 4e-11, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 55/176 (31%), Gaps = 34/176 (19%) Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ-------PDACLINRYA 123 + Y Y + P +P F +L + G + +IN Y Sbjct: 83 HYGVSYNYGARGVKEALKVPPVPSEFSDLLEEIKNKEGLDSIRNLMEGIDFKQVIINEYK 142 Query: 124 -PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF-------------GGLKRNDPLKR-- 167 +S H D + D I+ +SLG + +F +KR + Sbjct: 143 GAKQGISKHVDHCQ-DFGPLILILSLGDECVMKFHKLEQVKEEDLKKKKVKRTEVSPSEC 201 Query: 168 --LLLEHGDVVVWGGESRLFY-HGIQPL----KAGFHPLTIDC---RYNLTFRQAG 213 + +++ G++R Y H I G L R ++T+R Sbjct: 202 YDRRMPRRSLIILSGDARYQYQHEIPKTMVFKIDGKQFLKRSESYRRVSITYRSLT 257 >UniRef50_Q987X8 Msr6861 protein n=10 Tax=Proteobacteria RepID=Q987X8_RHILO Length = 92 Score = 70.0 bits (170), Expect = 5e-11, Method: Composition-based stats. Identities = 27/100 (27%), Positives = 43/100 (43%), Gaps = 13/100 (13%) Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVV 176 LIN Y PGA + H+DK P + +SL P F+ + + +++E Sbjct: 2 LINEYRPGAGIGWHRDK--PHFED-VAGISLLAPCSFRLRRKSGDRWERRTIVVEPRSAY 58 Query: 177 VWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 + G SR + H I PL + RY++T R + Sbjct: 59 LMTGPSRTEWEHSIPPL--------AEHRYSITLRTLRSQ 90 >UniRef50_Q1G659 Polyprotein n=17 Tax=root RepID=Q1G659_9SECO Length = 2163 Score = 69.6 bits (169), Expect = 6e-11, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 74/209 (35%), Gaps = 40/209 (19%) Query: 16 AAGAVILRRFAFNAAEQL-------IRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLG 68 + +F AE L + ++ + + PF + M++ ++ G Sbjct: 1978 RPSVSLSPKFTTPPAELLKTIVGVELYEVTEAVKKMPFGSVGPSLRGRMALFYSD-GAFD 2036 Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKL 128 + + Y T++ WP + A GY ++CL+ +Y GA + Sbjct: 2037 YAHDKYHY--------TSQGWP------REVDDLAKKLGGY-----NSCLVQKYDKGAYI 2077 Query: 129 SLHQDKDEPDLR--APIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LF 185 H D DEP +++V+L A F +R L HG ++ + L Sbjct: 2078 PFHAD-DEPCYDDNDSVITVNLNGRATFIVRNKTTGAETRR-ELHHGSILEMLPSCQKLC 2135 Query: 186 YHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 H + R +LTFR+ + Sbjct: 2136 KHSVNVRD--------QGRVSLTFRRQRR 2156 >UniRef50_C4Y8Y4 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y8Y4_CLAL4 Length = 416 Score = 69.6 bits (169), Expect = 6e-11, Method: Composition-based stats. Identities = 29/110 (26%), Positives = 45/110 (40%), Gaps = 10/110 (9%) Query: 110 PDFQPDACLINRYAP-GAKLSLHQDKDEPDLRAP---IVSVSLGLPAIFQF-GGLKRNDP 164 + + CL+N Y L H D+ P I SVSLG +F+ K+N P Sbjct: 221 EKWAAEYCLVNYYEKLSNNLDWHSDR--LSHIGPHNYIASVSLGCTRLFRLRSNHKKNAP 278 Query: 165 LKRLLLEHGDVVVWG-GESRLFYHGIQPLKAGF--HPLTIDCRYNLTFRQ 211 + + L H +++ G + H + + HP R+ LTFR Sbjct: 279 IFEIPLPHNSLLIMRPGCQEEYKHCVNSMSKSVALHPEVGSTRFGLTFRH 328 >UniRef50_B4JDW7 GH11262 n=4 Tax=Neoptera RepID=B4JDW7_DROGR Length = 221 Score = 69.6 bits (169), Expect = 8e-11, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 67/219 (30%), Gaps = 50/219 (22%) Query: 22 LRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPI 81 + F + EQ I + TP + + G H G Sbjct: 18 IPNFITSDQEQCILSQIE----------RTPKPRWTQLLNRRLINYGGVPHPNG------ 61 Query: 82 DPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRA 141 MP+ + + + + + L+N Y PG + H D L Sbjct: 62 -----MIAEEMPEWLQSYVDKVNNLGVFESQKANHVLVNEYLPGQGILPHTD---GPLFY 113 Query: 142 PIVS-VSLGLPAIFQF-----GGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQPL-- 192 PI+S +S G + +F G + L +LLLE +++ Y H I + Sbjct: 114 PIISTISCGAHTVLEFTKRETTGDAAGEVLFKLLLEPRSLLILKDSLYSDYMHAISEINE 173 Query: 193 -----------------KAGFHPLTIDCRYNLTFRQAGK 214 K G H + R +LT R K Sbjct: 174 DTLCDRICNYNLCENTYKIGDHLVRRAPRISLTIRNVPK 212 >UniRef50_A4I1X6 Putative uncharacterized protein n=3 Tax=Leishmania RepID=A4I1X6_LEIIN Length = 563 Score = 69.6 bits (169), Expect = 8e-11, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 66/225 (29%), Gaps = 34/225 (15%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 P G ++ F E+ I +++ P QM ++ H Sbjct: 332 AVPDVPGLFLVEDFVTADEEKTI--WHELHHGRPRLQMEY-------LSRRRVAHFNRRF 382 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP-----DFQP-----DACLINR 121 G + P+ Q A G F+P D +N Sbjct: 383 L-YGVNALTAEGDVTNARPSFYVWMRARLQNDMAAGGVRIDGDYPFRPGDHECDQLTVNY 441 Query: 122 YA---PGA-KLSLHQDKDEPDLRAPIVSVSLGLPAIF---QFGGLKRNDPLKRLLLEHGD 174 Y GA ++ H D + VSLG + ++ + L Sbjct: 442 YDYSEVGACGIAAHVDAHNA-FDDAVFIVSLGSYTVMEFSRWDAPAEVAAPVGVYLAPRS 500 Query: 175 VVVWGGESRL-FYHGIQPLKAGFHPLTIDC-----RYNLTFRQAG 213 +VV GE+R + H I + + R +LT+R+ Sbjct: 501 LVVIAGEARYGWTHCIAEKRTDTLSELLPTFSRGDRMSLTWRRGR 545 >UniRef50_Q95XY6 Putative uncharacterized protein n=1 Tax=Caenorhabditis elegans RepID=Q95XY6_CAEEL Length = 169 Score = 69.2 bits (168), Expect = 8e-11, Method: Composition-based stats. Identities = 22/65 (33%), Positives = 35/65 (53%), Gaps = 3/65 (4%) Query: 141 APIVSVSLGLPAIFQFGGLK-RNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPL 199 AP++S+SLG AI+ G + ++P + L +GD ++ G+ RL YH I + G P Sbjct: 64 APLISMSLGQTAIYLSGTIDLSSEPPIPIWLRNGDFLIMHGDQRLVYHAIPCI--GSIPK 121 Query: 200 TIDCR 204 R Sbjct: 122 RRGNR 126 >UniRef50_Q4D8T3 Putative uncharacterized protein n=2 Tax=Trypanosoma cruzi RepID=Q4D8T3_TRYCR Length = 426 Score = 69.2 bits (168), Expect = 8e-11, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 68/224 (30%), Gaps = 34/224 (15%) Query: 9 EPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLG 68 EP +E G I++ F I + A S + ++A N H Sbjct: 194 EPIEEV--PGLYIVKEF--------ITQMEHDAVWSELKGPKAAALELETLAHRNVAHFN 243 Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQP-------DACLINR 121 G ++ + PA + D+ D +N Sbjct: 244 RR-FYYGVNRIGVEGDSVNAKPAFYDWMQRRLKNEDPRRKLQDYPSVAQTFSCDQLTVNF 302 Query: 122 Y------APGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF---GGLKRNDPLKRLLLEH 172 Y + ++ H D I VSLG + ++ G + + Sbjct: 303 YNYKDGDTAASGIAHHVDSH-ASFMDCIYIVSLGSHTVLEYNRHGVPPDVAETFGVFVAP 361 Query: 173 GDVVVWGGESRL-FYHGIQPLKAG-----FHPLTIDCRYNLTFR 210 +++ GESR + HGI + P+ R +LT+R Sbjct: 362 RSLLLMTGESRYSWTHGIAGKRVDILSDRIPPVWRGDRVSLTWR 405 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P05050 Alpha-ketoglutarate-dependent dioxygenase alkB n... 250 2e-65 UniRef50_Q8Z566 AlkB protein n=6 Tax=Salmonella enterica subsp. ... 239 6e-62 UniRef50_C7JEB8 DNA repair protein for alkylated DNA n=8 Tax=Ace... 218 1e-55 UniRef50_UPI000197C9F9 alpha-ketoglutarate-dependent dioxygenase... 212 5e-54 UniRef50_D0IWA8 2OG-Fe(II) oxygenase n=4 Tax=Proteobacteria RepI... 211 1e-53 UniRef50_A3WP94 Alkylated DNA repair protein n=1 Tax=Idiomarina ... 207 2e-52 UniRef50_Q5QTX8 Alkylated DNA repair protein n=1 Tax=Idiomarina ... 204 1e-51 UniRef50_Q1N5M7 2OG-Fe(II) oxygenase superfamily protein n=1 Tax... 190 2e-47 UniRef50_B8GWW6 Alpha-ketoglutarate-dependent dioxygenase alkB h... 178 1e-43 UniRef50_Q8T9A3 SD10403p n=17 Tax=Coelomata RepID=Q8T9A3_DROME 175 8e-43 UniRef50_D0LXU5 2OG-Fe(II) oxygenase n=1 Tax=Haliangium ochraceu... 174 1e-42 UniRef50_C3YRT0 Putative uncharacterized protein n=1 Tax=Branchi... 174 2e-42 UniRef50_Q28VY2 DNA-N1-methyladenine dioxygenase n=30 Tax=Bacter... 171 1e-41 UniRef50_UPI0000E484FA PREDICTED: hypothetical protein n=1 Tax=S... 170 3e-41 UniRef50_C9CYS1 Putative uncharacterized protein n=2 Tax=Alphapr... 170 3e-41 UniRef50_B7PUG0 Putative uncharacterized protein n=1 Tax=Ixodes ... 170 4e-41 UniRef50_A7HZ41 2OG-Fe(II) oxygenase n=1 Tax=Parvibaculum lavame... 167 2e-40 UniRef50_B3RYI8 Putative uncharacterized protein n=2 Tax=Trichop... 167 2e-40 UniRef50_B0T136 2OG-Fe(II) oxygenase n=1 Tax=Caulobacter sp. K31... 167 2e-40 UniRef50_D2V2M0 Predicted protein n=1 Tax=Naegleria gruberi RepI... 167 3e-40 UniRef50_UPI0000DB70B1 PREDICTED: similar to CG17807-PA n=2 Tax=... 165 9e-40 UniRef50_D2A2C2 Putative uncharacterized protein GLEAN_07671 n=1... 164 2e-39 UniRef50_B1XR40 Oxidoreductase, 2OG-Fe(II) oxygenase family n=4 ... 164 2e-39 UniRef50_UPI000186CFBD conserved hypothetical protein n=1 Tax=Pe... 164 2e-39 UniRef50_C7QZR3 2OG-Fe(II) oxygenase n=29 Tax=Actinomycetales Re... 162 9e-39 UniRef50_D2A2Y9 Putative uncharacterized protein GLEAN_07602 n=1... 159 8e-38 UniRef50_A3VR77 Alkylated DNA repair protein n=1 Tax=Parvularcul... 158 1e-37 UniRef50_B3RXF0 Putative uncharacterized protein (Fragment) n=1 ... 158 1e-37 UniRef50_B9GZQ0 Predicted protein n=13 Tax=Magnoliophyta RepID=B... 158 1e-37 UniRef50_UPI00017B2DD1 UPI00017B2DD1 related cluster n=1 Tax=Tet... 157 2e-37 UniRef50_Q17GQ0 Putative uncharacterized protein (Fragment) n=2 ... 157 2e-37 UniRef50_D2B1L1 Alkylated DNA repair protein n=3 Tax=Actinomycet... 157 2e-37 UniRef50_Q7QEU7 AGAP000155-PA n=1 Tax=Anopheles gambiae RepID=Q7... 157 3e-37 UniRef50_B8HYX6 2OG-Fe(II) oxygenase n=1 Tax=Cyanothece sp. PCC ... 156 4e-37 UniRef50_A8P5P3 Putative uncharacterized protein n=1 Tax=Brugia ... 156 6e-37 UniRef50_Q9SA98 Alkylated DNA repair protein alkB homolog n=4 Ta... 156 8e-37 UniRef50_B6JWW7 AlkB-like protein n=1 Tax=Schizosaccharomyces ja... 154 2e-36 UniRef50_O60066 Alkylated DNA repair protein alkB homolog n=2 Ta... 154 2e-36 UniRef50_C3XQU3 Putative uncharacterized protein n=1 Tax=Branchi... 151 1e-35 UniRef50_C9YWY9 Putative DNA repair protein n=1 Tax=Streptomyces... 151 2e-35 UniRef50_C0WHI3 Alkylated DNA repair protein n=3 Tax=Corynebacte... 149 5e-35 UniRef50_UPI000180CD20 PREDICTED: similar to alkB, alkylation re... 149 6e-35 UniRef50_A9TG90 Predicted protein n=1 Tax=Physcomitrella patens ... 149 8e-35 UniRef50_D0MUQ0 Alkylated DNA repair protein alkB 8 n=1 Tax=Phyt... 149 8e-35 UniRef50_UPI0000E47318 PREDICTED: similar to LOC494680 protein n... 148 1e-34 UniRef50_Q4D9X3 Alkylated DNA repair protein, putative n=3 Tax=T... 148 1e-34 UniRef50_Q7KUZ2 AlkB n=12 Tax=Drosophila RepID=Q7KUZ2_DROME 147 2e-34 UniRef50_C7NLG8 Alkylated DNA repair protein n=2 Tax=Actinomycet... 147 3e-34 UniRef50_Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB n... 146 5e-34 UniRef50_C6W3C2 2OG-Fe(II) oxygenase n=5 Tax=Bacteroidetes RepID... 146 6e-34 UniRef50_UPI0001983B96 PREDICTED: hypothetical protein n=1 Tax=V... 145 7e-34 UniRef50_UPI000051A07C PREDICTED: similar to AlkB CG33250-PA n=3... 145 1e-33 UniRef50_Q96BT7 Alkylated DNA repair protein alkB homolog 8 n=32... 145 1e-33 UniRef50_Q9LJH2 Similarity to unknown protein n=7 Tax=Embryophyt... 144 1e-33 UniRef50_Q9LZW8 Putative uncharacterized protein T20L15_50 n=3 T... 144 1e-33 UniRef50_D1I753 Whole genome shotgun sequence of line PN40024, s... 144 2e-33 UniRef50_A7SSH3 Predicted protein n=1 Tax=Nematostella vectensis... 144 2e-33 UniRef50_C2BL13 Alkylated DNA repair protein n=2 Tax=Corynebacte... 144 3e-33 UniRef50_B7QP17 Methyltransferase, putative n=1 Tax=Ixodes scapu... 144 3e-33 UniRef50_A7RXQ8 Predicted protein (Fragment) n=1 Tax=Nematostell... 143 3e-33 UniRef50_B8HQU9 2OG-Fe(II) oxygenase n=1 Tax=Cyanothece sp. PCC ... 143 3e-33 UniRef50_D1HAA5 Whole genome shotgun sequence of line PN40024, s... 143 4e-33 UniRef50_Q09BP3 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 143 4e-33 UniRef50_UPI00005257D6 PREDICTED: similar to AlkB CG33250-PA n=1... 142 5e-33 UniRef50_D0NYX8 Alkylated DNA repair protein alkB-like protein n... 142 1e-32 UniRef50_C6TKW1 Putative uncharacterized protein n=1 Tax=Glycine... 142 1e-32 UniRef50_Q13686 Alkylated DNA repair protein alkB homolog 1 n=27... 141 1e-32 UniRef50_D1IRG0 Whole genome shotgun sequence of line PN40024, s... 141 1e-32 UniRef50_C3NYZ8 Alkylated DNA repair protein n=28 Tax=Bacteria R... 141 1e-32 UniRef50_A1SVH4 DNA-N1-methyladenine dioxygenase n=12 Tax=Bacter... 141 2e-32 UniRef50_C1FJB5 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 141 2e-32 UniRef50_Q07GB6 Oxidoreductase, putative n=1 Tax=Roseobacter den... 140 3e-32 UniRef50_C5L3Y2 Putative uncharacterized protein n=1 Tax=Perkins... 140 3e-32 UniRef50_C0PA85 Putative uncharacterized protein n=1 Tax=Zea may... 140 3e-32 UniRef50_A0D9E2 Chromosome undetermined scaffold_42, whole genom... 139 4e-32 UniRef50_Q9U3P9 Protein C14B1.10, partially confirmed by transcr... 139 5e-32 UniRef50_Q7MF65 Alkylated DNA repair protein n=15 Tax=Vibrionace... 137 2e-31 UniRef50_A4CQ67 Alkylated DNA repair protein n=2 Tax=Flavobacter... 137 3e-31 UniRef50_C0Z2F3 AT5G01780 protein n=9 Tax=Magnoliophyta RepID=C0... 136 5e-31 UniRef50_Q7D1B7 Putative uncharacterized protein n=1 Tax=Agrobac... 136 5e-31 UniRef50_Q26EI7 Alkylated DNA repair protein n=1 Tax=Flavobacter... 135 1e-30 UniRef50_C6XT27 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter heparinu... 135 1e-30 UniRef50_UPI00019271E1 PREDICTED: similar to predicted protein n... 135 1e-30 UniRef50_A4S4F4 Predicted protein n=1 Tax=Ostreococcus lucimarin... 135 1e-30 UniRef50_C6X2N0 2OG-Fe(II) oxygenase n=4 Tax=Bacteria RepID=C6X2... 134 3e-30 UniRef50_A4SZF3 DNA-N1-methyladenine dioxygenase n=1 Tax=Polynuc... 133 4e-30 UniRef50_B0SGN3 Alkylated DNA repair protein n=2 Tax=Leptospira ... 133 5e-30 UniRef50_Q12QK9 DNA-N1-methyladenine dioxygenase n=5 Tax=Shewane... 132 7e-30 UniRef50_A0C122 Chromosome undetermined scaffold_140, whole geno... 132 1e-29 UniRef50_Q2BPN4 Putative uncharacterized protein n=1 Tax=Neptuni... 131 1e-29 UniRef50_C5BKX4 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 130 2e-29 UniRef50_A8PV44 ALKBH protein, putative n=1 Tax=Brugia malayi Re... 130 3e-29 UniRef50_B6HH87 Pc20g14010 protein n=10 Tax=Leotiomyceta RepID=B... 129 5e-29 UniRef50_C5KK00 Putative uncharacterized protein n=6 Tax=Perkins... 129 5e-29 UniRef50_A6EJU4 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter sp. BAL3... 129 8e-29 UniRef50_Q5UR03 Uncharacterized protein L905 n=1 Tax=Acanthamoeb... 129 8e-29 UniRef50_UPI00006CCD66 hypothetical protein TTHERM_00483520 n=1 ... 129 9e-29 UniRef50_B4VHI5 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 129 9e-29 UniRef50_C1N0U9 Predicted protein n=1 Tax=Micromonas pusilla CCM... 128 1e-28 UniRef50_C7RA32 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis... 128 1e-28 UniRef50_D2V609 Predicted protein n=2 Tax=Naegleria gruberi RepI... 128 1e-28 UniRef50_B8HW11 2OG-Fe(II) oxygenase n=10 Tax=Bacteria RepID=B8H... 127 2e-28 UniRef50_A3M1I5 DNA repair system n=12 Tax=Acinetobacter RepID=A... 127 2e-28 UniRef50_Q22MH4 Putative uncharacterized protein n=1 Tax=Tetrahy... 127 2e-28 UniRef50_B5Y3R7 Predicted protein n=1 Tax=Phaeodactylum tricornu... 127 3e-28 UniRef50_D2V5F7 Predicted protein (Fragment) n=1 Tax=Naegleria g... 127 3e-28 UniRef50_A9EAY0 2OG-Fe(II) oxygenase n=2 Tax=Flavobacteriales Re... 126 5e-28 UniRef50_A1ZYS3 Alkylated DNA repair protein n=1 Tax=Microscilla... 125 7e-28 UniRef50_Q0U5B3 Putative uncharacterized protein n=1 Tax=Phaeosp... 125 8e-28 UniRef50_C8NRG2 DNA repair protein n=9 Tax=Actinomycetales RepID... 125 9e-28 UniRef50_B7RVL5 Oxidoreductase, 2OG-Fe(II) oxygenase family n=2 ... 125 9e-28 UniRef50_A9AQD8 2OG-Fe(II) oxygenase n=43 Tax=Burkholderia RepID... 125 9e-28 UniRef50_B4EMC2 2OG-Fe(II) oxygenase superfamily protein n=14 Ta... 125 1e-27 UniRef50_A5WBM5 DNA-N1-methyladenine dioxygenase n=5 Tax=Moraxel... 125 1e-27 UniRef50_Q3AYK8 DNA-N1-methyladenine dioxygenase n=12 Tax=Cyanob... 124 2e-27 UniRef50_A6EGN8 Alkylated DNA repair protein n=1 Tax=Pedobacter ... 124 2e-27 UniRef50_B8KWS1 2OG-Fe(II) oxygenase superfamily protein n=1 Tax... 124 2e-27 UniRef50_B8ESE9 2OG-Fe(II) oxygenase n=1 Tax=Methylocella silves... 124 2e-27 UniRef50_Q00Z84 2-Oxoglutarate-and iron-dependent dioxygenase-re... 124 2e-27 UniRef50_B0T8K7 2OG-Fe(II) oxygenase n=3 Tax=Alphaproteobacteria... 124 3e-27 UniRef50_B2SPH7 DNA repair system specific for alkylated DNA n=1... 123 4e-27 UniRef50_UPI000186D7D6 conserved hypothetical protein n=1 Tax=Pe... 123 5e-27 UniRef50_D0N998 Putative uncharacterized protein n=1 Tax=Phytoph... 122 6e-27 UniRef50_B5JS77 2OG-Fe(II) oxygenase n=1 Tax=gamma proteobacteri... 122 1e-26 UniRef50_B2AC29 Predicted CDS Pa_2_14240 n=1 Tax=Podospora anser... 121 2e-26 UniRef50_UPI00006CC0FF hypothetical protein TTHERM_00219000 n=1 ... 121 2e-26 UniRef50_A4H8G0 Putative uncharacterized protein n=3 Tax=Leishma... 121 2e-26 UniRef50_A4AA20 Putative alkylated DNA repair protein n=1 Tax=Co... 121 2e-26 UniRef50_Q15YR0 DNA-N1-methyladenine dioxygenase n=1 Tax=Pseudoa... 120 3e-26 UniRef50_A7AWB3 Putative uncharacterized protein n=1 Tax=Babesia... 120 3e-26 UniRef50_UPI000180B7B0 PREDICTED: similar to LOC496071 protein n... 120 3e-26 UniRef50_Q6C333 YALI0F03003p n=1 Tax=Yarrowia lipolytica RepID=Q... 120 3e-26 UniRef50_UPI000179247A PREDICTED: similar to alkB, alkylation re... 120 3e-26 UniRef50_B4RZB3 Alkylated DNA repair protein n=4 Tax=Proteobacte... 120 4e-26 UniRef50_Q54BK8 2-oxoglutarate and Fe-dependent oxygenase family... 120 5e-26 UniRef50_B4JDW7 GH11262 n=4 Tax=Neoptera RepID=B4JDW7_DROGR 119 5e-26 UniRef50_Q6ZEA1 Slr7097 protein n=5 Tax=Bacteria RepID=Q6ZEA1_SYNY3 119 5e-26 UniRef50_A3D131 DNA-N1-methyladenine dioxygenase n=15 Tax=Shewan... 119 5e-26 UniRef50_B4RAN1 DNA alkylation damage repair protein AlkB n=1 Ta... 119 6e-26 UniRef50_A0YHA0 Putative uncharacterized protein n=1 Tax=marine ... 119 6e-26 UniRef50_Q17527 Protein B0564.2, partially confirmed by transcri... 118 1e-25 UniRef50_A7SAR6 Predicted protein n=1 Tax=Nematostella vectensis... 118 1e-25 UniRef50_Q4TCN8 Chromosome undetermined SCAF6790, whole genome s... 118 1e-25 UniRef50_Q9VKU5 CG6144, isoform A n=11 Tax=Diptera RepID=Q9VKU5_... 118 2e-25 UniRef50_C4WWX2 ACYPI004109 protein n=2 Tax=Acyrthosiphon pisum ... 117 2e-25 UniRef50_Q609W8 2OG-Fe(II) oxygenase family domain protein n=1 T... 117 3e-25 UniRef50_C8XTB3 Putative uncharacterized protein n=1 Tax=Dunalie... 117 3e-25 UniRef50_Q2MF23 TobX protein n=2 Tax=Actinomycetales RepID=Q2MF2... 117 3e-25 UniRef50_A6ESW5 2OG-Fe(II) oxygenase n=5 Tax=Bacteroidetes RepID... 117 4e-25 UniRef50_Q4UFZ4 Alkylated DNA repair protein, putative n=2 Tax=T... 116 4e-25 UniRef50_C8VDQ1 DNA repair family protein (AFU_orthologue; AFUA_... 116 4e-25 UniRef50_Q5K7S3 Putative uncharacterized protein n=1 Tax=Filobas... 116 6e-25 UniRef50_A9BA22 Alkylated DNA repair protein n=2 Tax=Prochloroco... 115 9e-25 UniRef50_UPI000175883A PREDICTED: similar to alkB, alkylation re... 115 9e-25 UniRef50_Q2SBS6 Alkylated DNA repair protein n=1 Tax=Hahella che... 115 1e-24 UniRef50_A4C6S7 Putative 2OG-Fe(II) oxygenase superfamily protei... 115 1e-24 UniRef50_A6GHE3 2OG-Fe(II) oxygenase n=1 Tax=Plesiocystis pacifi... 115 1e-24 UniRef50_B6EQD5 Putative uncharacterized protein n=1 Tax=Aliivib... 115 1e-24 UniRef50_A1S8P3 DNA-N1-methyladenine dioxygenase n=1 Tax=Shewane... 115 1e-24 UniRef50_D2XAQ5 Alkylated DNA repair protein n=1 Tax=Marseillevi... 115 1e-24 UniRef50_A4BAI0 Putative uncharacterized protein n=1 Tax=Reineke... 115 1e-24 UniRef50_Q1YTT7 Oxidoreductase, 2OG-Fe(II) oxygenase family prot... 114 2e-24 UniRef50_Q21J14 DNA-N1-methyladenine dioxygenase n=1 Tax=Sacchar... 114 2e-24 UniRef50_A9T8I6 Predicted protein n=1 Tax=Physcomitrella patens ... 114 2e-24 UniRef50_Q3IHQ7 Putative 2OG-Fe(II) oxygenase superfamily protei... 114 2e-24 UniRef50_A2QVX5 Contig An11c0110, complete genome n=18 Tax=Eurot... 114 2e-24 UniRef50_C1EB25 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 113 3e-24 UniRef50_B2APW5 Predicted CDS Pa_4_6060 n=6 Tax=Sordariomycetes ... 113 4e-24 UniRef50_Q3KRA9 Alkylated DNA repair protein alkB homolog 6 n=23... 113 4e-24 UniRef50_C3ZI75 Putative uncharacterized protein n=1 Tax=Branchi... 113 5e-24 UniRef50_C4Q8H0 Expressed protein n=2 Tax=Schistosoma RepID=C4Q8... 113 5e-24 UniRef50_C6Y338 2OG-Fe(II) oxygenase n=10 Tax=Bacteria RepID=C6Y... 112 6e-24 UniRef50_D2VNR1 Putative uncharacterized protein n=1 Tax=Naegler... 112 8e-24 UniRef50_A2R3V2 Similarity to human sequence 203 from patent WO0... 112 8e-24 UniRef50_A4SB59 Predicted protein (Fragment) n=2 Tax=Ostreococcu... 112 8e-24 UniRef50_A7EEP6 Putative uncharacterized protein n=1 Tax=Sclerot... 112 9e-24 UniRef50_UPI0001927839 PREDICTED: similar to predicted protein n... 112 1e-23 UniRef50_B0C4L3 Alkylated DNA repair protein n=3 Tax=Bacteria Re... 112 1e-23 UniRef50_Q96Q83 Alpha-ketoglutarate-dependent dioxygenase alkB h... 112 1e-23 UniRef50_A6VZM6 Putative alkylated DNA repair protein n=1 Tax=Ma... 112 1e-23 UniRef50_Q3M1V0 DNA-N1-methyladenine dioxygenase n=3 Tax=Nostoca... 112 1e-23 UniRef50_C5KBY7 Putative uncharacterized protein n=1 Tax=Perkins... 112 1e-23 UniRef50_Q8K2U2 Alkylated DNA repair protein alkB homolog 6 n=5 ... 111 2e-23 UniRef50_Q80Y20-2 Isoform 2 of Alkylated DNA repair protein alkB... 111 2e-23 UniRef50_B6AFB9 Oxidoreductase, 2og-Fe(II) oxygenase family prot... 111 2e-23 UniRef50_A1ZXT1 Alkylated DNA repair protein n=1 Tax=Microscilla... 111 2e-23 UniRef50_Q7S1J6 Predicted protein n=3 Tax=Sordariales RepID=Q7S1... 110 2e-23 UniRef50_A5GWW3 Alkylated DNA repair protein n=2 Tax=Synechococc... 110 2e-23 UniRef50_D1ITZ2 Whole genome shotgun sequence of line PN40024, s... 110 3e-23 UniRef50_Q2UNX0 Predicted protein n=6 Tax=Trichocomaceae RepID=Q... 110 3e-23 UniRef50_A5FII3 DNA-N1-methyladenine dioxygenase n=2 Tax=Flavoba... 110 4e-23 UniRef50_D2VF66 Predicted protein n=1 Tax=Naegleria gruberi RepI... 110 4e-23 UniRef50_Q9SIE0 Expressed protein n=10 Tax=Magnoliophyta RepID=Q... 110 4e-23 UniRef50_A4I1X6 Putative uncharacterized protein n=3 Tax=Leishma... 110 4e-23 UniRef50_B8CDM4 Predicted protein (Fragment) n=1 Tax=Thalassiosi... 110 5e-23 UniRef50_A1K994 DNA repair system specific for alkylated DNA n=1... 109 5e-23 UniRef50_D1Z416 Whole genome shotgun sequence assembly, scaffold... 109 5e-23 UniRef50_Q6NS38 Alpha-ketoglutarate-dependent dioxygenase alkB h... 109 6e-23 UniRef50_D1HRN8 Whole genome shotgun sequence of line PN40024, s... 109 6e-23 UniRef50_A2Q3T7 2OG-Fe(II) oxygenase n=2 Tax=Medicago truncatula... 109 7e-23 UniRef50_A9G7L2 High confidence in function and specificity n=1 ... 109 8e-23 UniRef50_C5BTC2 Putative alkylated DNA repair protein n=1 Tax=Te... 109 9e-23 UniRef50_C1MXD4 Predicted protein n=1 Tax=Micromonas pusilla CCM... 109 1e-22 UniRef50_D2V4D7 Predicted protein n=1 Tax=Naegleria gruberi RepI... 108 2e-22 UniRef50_Q5CYU2 F27M3_19 plant like RRM plus AlkB domain contain... 108 2e-22 UniRef50_A9TLH2 Predicted protein n=1 Tax=Physcomitrella patens ... 107 2e-22 UniRef50_UPI00017458F4 Alkylated DNA repair protein n=1 Tax=Verr... 107 2e-22 UniRef50_C9SM10 DNA repair family protein n=1 Tax=Verticillium a... 107 3e-22 UniRef50_C7PPT9 2OG-Fe(II) oxygenase n=2 Tax=Bacteroidetes RepID... 107 3e-22 UniRef50_B8J7A0 2OG-Fe(II) oxygenase n=3 Tax=Anaeromyxobacter Re... 107 3e-22 UniRef50_Q5RJC7 At4g36090 n=6 Tax=Magnoliophyta RepID=Q5RJC7_ARATH 107 3e-22 UniRef50_C5CMR5 2OG-Fe(II) oxygenase n=1 Tax=Variovorax paradoxu... 107 3e-22 UniRef50_A8J903 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 107 4e-22 UniRef50_A6DH75 Putative uncharacterized protein n=1 Tax=Lentisp... 106 4e-22 UniRef50_C5KHM6 Putative uncharacterized protein n=1 Tax=Perkins... 106 5e-22 UniRef50_A4RVX0 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 106 5e-22 UniRef50_A6SM11 Putative uncharacterized protein n=1 Tax=Botryot... 106 6e-22 UniRef50_A8MRW3 Uncharacterized protein At2g17970.2 n=14 Tax=ros... 105 8e-22 UniRef50_C1BKL6 Alkylated repair protein alkB homolog 3 n=5 Tax=... 105 8e-22 UniRef50_Q57WP7 Putative uncharacterized protein n=2 Tax=Trypano... 105 8e-22 UniRef50_B8M368 Oxidoreductase, 2OG-Fe(II) oxygenase family, put... 105 8e-22 UniRef50_B8LLT0 Putative uncharacterized protein n=1 Tax=Picea s... 105 9e-22 UniRef50_B2AYU6 Predicted CDS Pa_1_12280 (Fragment) n=5 Tax=Leot... 105 9e-22 UniRef50_B6GZZ6 Pc12g09870 protein n=4 Tax=Eurotiomycetidae RepI... 105 1e-21 UniRef50_Q4D8T3 Putative uncharacterized protein n=2 Tax=Trypano... 105 1e-21 UniRef50_Q6C9X6 YALI0D07546p n=1 Tax=Yarrowia lipolytica RepID=Q... 104 2e-21 UniRef50_C9SQ13 Isochorismatase family protein family n=1 Tax=Ve... 104 2e-21 UniRef50_D2VV84 Predicted protein n=1 Tax=Naegleria gruberi RepI... 104 2e-21 UniRef50_A9TPJ1 Predicted protein (Fragment) n=1 Tax=Physcomitre... 104 2e-21 UniRef50_C7YT81 Putative uncharacterized protein n=1 Tax=Nectria... 104 3e-21 UniRef50_A4VNR2 DNA repair system protein n=11 Tax=Gammaproteoba... 104 3e-21 UniRef50_Q5VPG7 Os06g0138200 protein n=5 Tax=BEP clade RepID=Q5V... 104 3e-21 UniRef50_B8NIA9 Putative uncharacterized protein n=1 Tax=Aspergi... 103 4e-21 UniRef50_B2ADC0 Predicted CDS Pa_4_670 n=1 Tax=Podospora anserin... 103 4e-21 UniRef50_UPI00016C4217 Alkylated DNA repair protein n=1 Tax=Gemm... 103 5e-21 UniRef50_Q9LJH4 Emb|CAB82748.1 n=1 Tax=Arabidopsis thaliana RepI... 102 8e-21 UniRef50_A9UX55 Predicted protein (Fragment) n=1 Tax=Monosiga br... 102 1e-20 UniRef50_D0NI62 Alkylated DNA repair protein alkB n=2 Tax=Phytop... 102 1e-20 UniRef50_Q1ECQ5 At4g02485 n=2 Tax=Arabidopsis thaliana RepID=Q1E... 102 1e-20 UniRef50_A9UYL7 Predicted protein n=1 Tax=Monosiga brevicollis R... 102 1e-20 UniRef50_Q8YKL5 All7279 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 102 1e-20 UniRef50_D2VTV7 Predicted protein n=1 Tax=Naegleria gruberi RepI... 101 1e-20 UniRef50_D2VCK4 Predicted protein n=2 Tax=Naegleria gruberi RepI... 101 2e-20 UniRef50_Q4PIA7 Putative uncharacterized protein n=1 Tax=Ustilag... 101 2e-20 UniRef50_C1FG54 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 101 2e-20 UniRef50_C5FMW8 Isochorismatase family protein n=1 Tax=Microspor... 101 2e-20 UniRef50_B2AVQ7 Predicted CDS Pa_7_2120 n=8 Tax=Leotiomyceta Rep... 100 2e-20 UniRef50_B9SA55 Oxidoreductase, putative n=1 Tax=Ricinus communi... 100 2e-20 UniRef50_B6JV21 2 OG-Fe(II) oxygenase n=1 Tax=Schizosaccharomyce... 100 3e-20 UniRef50_UPI000023F682 hypothetical protein FG09872.1 n=1 Tax=Gi... 100 3e-20 Sequences not found previously or not previously below threshold: >UniRef50_P05050 Alpha-ketoglutarate-dependent dioxygenase alkB n=232 Tax=cellular organisms RepID=ALKB_ECOLI Length = 216 Score = 250 bits (639), Expect = 2e-65, Method: Composition-based stats. Identities = 216/216 (100%), Positives = 216/216 (100%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA Sbjct: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN Sbjct: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG Sbjct: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE Sbjct: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 >UniRef50_Q8Z566 AlkB protein n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=Q8Z566_SALTI Length = 216 Score = 239 bits (609), Expect = 6e-62, Method: Composition-based stats. Identities = 172/216 (79%), Positives = 192/216 (88%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 MLDLFAD PWQEPLA GAV+LRRFAF AA+ L+ DI VASQSPFRQMVTPGGYTMSVA Sbjct: 1 MLDLFADEAPWQEPLAPGAVVLRRFAFRAAQSLLDDIGFVASQSPFRQMVTPGGYTMSVA 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 MTNCG LGWTT GY Y+ DP T+KPWPA+P SF ++C++AA AAGY FQPDACLIN Sbjct: 61 MTNCGALGWTTDGHGYCYAVRDPLTDKPWPALPLSFASVCRQAAIAAGYASFQPDACLIN 120 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 RYAPGAKLSLHQDKDEPDLRAPIVSVSLG+PA+FQFGGL+R+DPL+R+LLEHGD+VVWGG Sbjct: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGVPAVFQFGGLRRSDPLQRILLEHGDIVVWGG 180 Query: 181 ESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 ESRLFYHGIQPLKAGFHP+T + RYNLTFRQA +KE Sbjct: 181 ESRLFYHGIQPLKAGFHPMTGEFRYNLTFRQAAEKE 216 >UniRef50_C7JEB8 DNA repair protein for alkylated DNA n=8 Tax=Acetobacter pasteurianus RepID=C7JEB8_ACEP3 Length = 222 Score = 218 bits (555), Expect = 1e-55, Method: Composition-based stats. Identities = 106/209 (50%), Positives = 141/209 (67%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 L D P L AGAV+L FA + AE + I+ +A Q+PFR+M TPGG MSVAMT Sbjct: 12 LLPDTRPDYVQLDAGAVLLPGFALHDAEACMLAIHHIAQQAPFRKMHTPGGGQMSVAMTC 71 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA 123 CG GW + QGY Y+ ++P T +PWP MP F L +AA AG+ FQP+ACLIN Y+ Sbjct: 72 CGTFGWISTAQGYSYTKVNPFTGQPWPDMPAIFQALAHKAAQKAGFAQFQPNACLINSYS 131 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 PGA++ LHQD+DE P+VS+S GL A F +GGLKR+DP +++LL+ GDV+VWGG R Sbjct: 132 PGARMGLHQDRDEGCTDQPVVSLSFGLEATFLWGGLKRSDPTRQILLKDGDVLVWGGPDR 191 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 L +HG++P+ +G H T + R N+TFR Sbjct: 192 LRFHGVKPIHSGAHIRTGETRLNITFRFV 220 >UniRef50_UPI000197C9F9 alpha-ketoglutarate-dependent dioxygenase alkB n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C9F9 Length = 214 Score = 212 bits (540), Expect = 5e-54, Method: Composition-based stats. Identities = 112/208 (53%), Positives = 140/208 (67%), Gaps = 1/208 (0%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 LF D + + +A A +L+ F ++ L++ +++V + +P R M TP GY MS AMTN Sbjct: 5 LFPDEDNIVQ-IAPEAFLLKGFLLGQSDALLQSLSNVITANPLRHMATPNGYQMSAAMTN 63 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA 123 CG GW T ++GY YS DP TN+PW MP SF L AA+ AG+ F PDACLINRYA Sbjct: 64 CGDWGWVTDKKGYRYSQRDPVTNQPWQPMPISFVQLATSAASTAGFEHFIPDACLINRYA 123 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 GA +SLHQDKDE D PIVS SLGLP IF FGG R+ P + LEHGDV+VWGG SR Sbjct: 124 VGAAMSLHQDKDEADFTHPIVSFSLGLPTIFDFGGATRDAPKIAVYLEHGDVLVWGGRSR 183 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 L YHG++ +K+G HPL RYNLTFR+ Sbjct: 184 LNYHGVRRIKSGVHPLLGPYRYNLTFRR 211 >UniRef50_D0IWA8 2OG-Fe(II) oxygenase n=4 Tax=Proteobacteria RepID=D0IWA8_COMTE Length = 224 Score = 211 bits (537), Expect = 1e-53, Method: Composition-based stats. Identities = 113/214 (52%), Positives = 140/214 (65%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L LF E + GAV+LR FA ++ + ++ + + + FR M PGG MSVA+ Sbjct: 3 LSLFPADSLPAEIIDDGAVLLRGFAAAEEQRWVAEVTALQTGAAFRTMQVPGGKFMSVAI 62 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 TN G GW + QGY YS +DPQT KPWPA+P RAA AGYP F PDACLINR Sbjct: 63 TNAGGWGWISDLQGYRYSAVDPQTGKPWPAIPAFLGEQAARAAALAGYPGFAPDACLINR 122 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PGA++ LH+D+DE D APIVSVSLGLP F +GGL R P +RL L HGDV+VWGG Sbjct: 123 YQPGARMGLHRDQDEHDFAAPIVSVSLGLPCRFLWGGLTRQSPTRRLALTHGDVLVWGGP 182 Query: 182 SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 SRL +HG+ PL+ G HPL + R+NLTFR A + Sbjct: 183 SRLVFHGVAPLREGQHPLLGNERWNLTFRMAKAR 216 >UniRef50_A3WP94 Alkylated DNA repair protein n=1 Tax=Idiomarina baltica OS145 RepID=A3WP94_9GAMM Length = 209 Score = 207 bits (526), Expect = 2e-52, Method: Composition-based stats. Identities = 99/210 (47%), Positives = 124/210 (59%), Gaps = 2/210 (0%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 + L + P A GA +L A A ++ I + Q+P R +TPG MSV Sbjct: 1 MSLLDASGPI--EFAPGAWLLPNHASEQAADILAAIRECVRQAPLRHFMTPGNKPMSVLS 58 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 +NCG GW + +GY Y DP+++KPWP +P N + A AGYP+F P+ACLIN Sbjct: 59 SNCGDFGWVSDSKGYRYQATDPKSDKPWPDIPSILLNDATQVAEQAGYPEFLPNACLINV 118 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PGAK+ LHQD+DE DL P+VS S GLPA F + G R +RL L HGDV+VWGG Sbjct: 119 YKPGAKMGLHQDRDESDLNEPVVSYSFGLPARFIWAGQTRTGTKQRLPLNHGDVLVWGGP 178 Query: 182 SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 SRL YHGI L G HPLT R NLT R+ Sbjct: 179 SRLNYHGIDKLVEGTHPLTQQTRVNLTLRK 208 >UniRef50_Q5QTX8 Alkylated DNA repair protein n=1 Tax=Idiomarina loihiensis RepID=Q5QTX8_IDILO Length = 217 Score = 204 bits (520), Expect = 1e-51, Method: Composition-based stats. Identities = 105/214 (49%), Positives = 134/214 (62%), Gaps = 2/214 (0%) Query: 2 LDLFA--DAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSV 59 LDLFA +EP ++ A + +F E L+ DI V QSP R + TP G+ MSV Sbjct: 4 LDLFANDGSEPLSTEISEQATLFHQFLLADDEALLNDIRGVLKQSPLRHLATPAGHKMSV 63 Query: 60 AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 ++CG GW + + GY Y IDP T +PWP +PQS + + AG+ +FQPD+CLI Sbjct: 64 KSSSCGSYGWLSDKHGYRYQNIDPVTGQPWPDIPQSILVKATQVSRLAGFQNFQPDSCLI 123 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y PGAK+ LHQDK+E D PIVS S GLP F +GG KR+D ++ L+H D +VWG Sbjct: 124 NVYTPGAKMGLHQDKNEADFSKPIVSFSFGLPITFMWGGFKRSDKYQKFSLQHADALVWG 183 Query: 180 GESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 G+ RL YHG+Q LK HPLT CR NLT RQAG Sbjct: 184 GKDRLRYHGVQQLKEAMHPLTGRCRVNLTIRQAG 217 >UniRef50_Q1N5M7 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=Bermanella marisrubri RepID=Q1N5M7_9GAMM Length = 212 Score = 190 bits (483), Expect = 2e-47, Method: Composition-based stats. Identities = 92/213 (43%), Positives = 118/213 (55%), Gaps = 6/213 (2%) Query: 1 MLDLFADAEPWQEPLAAG--AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMS 58 M DLFA E E L G +R E + I++VA Q+PFR M+TP G+ M Sbjct: 1 MSDLFASNE--VEILDHGQGLFQIRNLVNT--EATMAAIHEVAKQAPFRHMMTPMGHHMK 56 Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 VA TNCG GW GY YS DP++ + WPAMP + + P + PDACL Sbjct: 57 VATTNCGEYGWIAQPSGYGYSRNDPESGQSWPAMPDTIRTISDDVIAHLNLPKYSPDACL 116 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 INRY G + HQDKDE + PI+SVSLGLPAIFQ G KR + GDV + Sbjct: 117 INRYDIGTSMGRHQDKDEANFDYPIISVSLGLPAIFQVVGPKRQGKATYYSVSDGDVFIL 176 Query: 179 GGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 G++RL+YHG+ +KA + + RYNLT R+ Sbjct: 177 SGQARLYYHGVNTVKANPNQPELQQRYNLTLRR 209 >UniRef50_B8GWW6 Alpha-ketoglutarate-dependent dioxygenase alkB homolog n=43 Tax=Alphaproteobacteria RepID=ALKB_CAUCN Length = 220 Score = 178 bits (451), Expect = 1e-43, Method: Composition-based stats. Identities = 79/201 (39%), Positives = 109/201 (54%), Gaps = 6/201 (2%) Query: 15 LAAGAVILRRFAF-NAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHR 73 + G + +A L+ + A Q+PF T G MSVAMT G LGWT+ Sbjct: 22 VVPGFDVWPGLLDISAQRALVEAVLAGAEQAPFSNYRTAYGKPMSVAMTALGSLGWTSDA 81 Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQD 133 +GY Y P+T +PWP MP + +L G P+ PD+CL+N Y GA++ LHQD Sbjct: 82 RGYRYVDRHPETGRPWPDMPPALLDLWT----VLGDPETPPDSCLVNLYRDGARMGLHQD 137 Query: 134 KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLK 193 +DE D R P++S+SLG A+F+ GG+ R DP + L L GDV G +RL +HG+ + Sbjct: 138 RDEADPRFPVLSISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRIL 197 Query: 194 AGFHPL-TIDCRYNLTFRQAG 213 G L R NLT R+A Sbjct: 198 PGSSSLVPGGGRINLTLRRAR 218 >UniRef50_Q8T9A3 SD10403p n=17 Tax=Coelomata RepID=Q8T9A3_DROME Length = 615 Score = 175 bits (444), Expect = 8e-43, Method: Composition-based stats. Identities = 53/224 (23%), Positives = 79/224 (35%), Gaps = 28/224 (12%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L A W +PL G I+ F E + T S+ Sbjct: 120 LPALAGKSEWNKPLPRGLHIIADFVTEEEESTLLRAIGE--------DGRTSEGTGSLKH 171 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDF-QPDACLI 119 N H +LY + +KP +P + L R + A D+ PD + Sbjct: 172 RNVKHF-----GFEFLYGTNNVDPSKPLEQSIPSACDILWPRLNSFASTWDWSSPDQLTV 226 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y PG + H D PI+S+SL + F R D ++ L +++ Sbjct: 227 NEYEPGHGIPPHVDTHSA-FLDPILSLSLQSDVVMDFR---RGDDQVQVRLPRRSLLIMS 282 Query: 180 GESRL-FYHGIQPLKAGFHP--------LTIDCRYNLTFRQAGK 214 GE+R + HGI+P P R +LTFR+ K Sbjct: 283 GEARYDWTHGIRPKHIDVVPSASGGLTTQARGKRTSLTFRRLRK 326 >UniRef50_D0LXU5 2OG-Fe(II) oxygenase n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LXU5_HALO1 Length = 219 Score = 174 bits (442), Expect = 1e-42, Method: Composition-based stats. Identities = 83/213 (38%), Positives = 119/213 (55%), Gaps = 10/213 (4%) Query: 3 DLFADAEPWQEPLAAG-AVILRRFAFNAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVA 60 +LF + P PL G I+ +A L+ + V +++P +R + G +SV Sbjct: 7 ELFPEQAP---PLPEGFLHIVAALDLDAQGALLEQVRAVLAEAPAYRPSMPRTGAPLSVR 63 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 M+NCG LGW + R GY Y P+ P T + WPA+P R A +P+ACL+N Sbjct: 64 MSNCGTLGWISDRAGYRYEPLHPHTARRWPAIPPLAMAQWNRFADW----PVRPEACLVN 119 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y G++L +H D+DE AP+VS+SLG A+++ GG RN P +RLLL GDVVV GG Sbjct: 120 LYQTGSRLGMHVDQDERAADAPVVSISLGCDAVYRLGGHTRNLPSQRLLLRSGDVVVLGG 179 Query: 181 ESRLFYHGIQPLKAGFHP-LTIDCRYNLTFRQA 212 +R YHG+ + AG P ++ R NLT R+ Sbjct: 180 AARRCYHGVDRIVAGTSPLPELEARINLTLRRV 212 >UniRef50_C3YRT0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YRT0_BRAFL Length = 641 Score = 174 bits (441), Expect = 2e-42, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 74/224 (33%), Gaps = 26/224 (11%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 +D L G ++ F A + + +++ A Sbjct: 124 IDEVPSQRATGLDLPPGLRLVEDFVSPACADRLLEGLGWSNEQ----------QQHMDAE 173 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLIN 120 H + Y + +KP P +P + R + G+ +PD +N Sbjct: 174 QALKHRRVKHFGYEFRYDNNNVDKDKPLPGGLPDWCSQVIDRMMS-GGHIKHRPDQITVN 232 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 +Y PG + H D I S+SLG + F ++L ++V G Sbjct: 233 QYQPGQGIPPHVDTHSA-FEDEISSLSLGGQTVMDFKHPS--GKRVAVVLPARSLLVMSG 289 Query: 181 ESRL-FYHGIQPLKAGFHPLT----------IDCRYNLTFRQAG 213 E+R + HGI P K P+ + R + TFR+ Sbjct: 290 EARYLWTHGIIPRKMDPVPVKGQEDSITLARREVRTSFTFRKIR 333 >UniRef50_Q28VY2 DNA-N1-methyladenine dioxygenase n=30 Tax=Bacteria RepID=Q28VY2_JANSC Length = 216 Score = 171 bits (434), Expect = 1e-41, Method: Composition-based stats. Identities = 73/197 (37%), Positives = 101/197 (51%), Gaps = 7/197 (3%) Query: 18 GAVILRRFAF-NAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G V+ +L+ + V +P + T G MSV MT+ G GW + R+GY Sbjct: 24 GIVVHPEHLDGPDQAELVEQVRRVVRSAPLYRPETRTGRKMSVRMTSAGTYGWISDRRGY 83 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 Y P + WP +P + + + A P +CLIN Y GAK+ +HQD+DE Sbjct: 84 RYDRCHPD-GQDWPPIPPMALEIWRAVSGVAQD----PQSCLINYYDAGAKMGMHQDRDE 138 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGF 196 D P+VSVSLG A+F+ GG KR + + L+ GDV V GGE+RL +HGI ++AG Sbjct: 139 GDFDMPVVSVSLGDEALFRVGGPKRGGKTQSVWLKSGDVAVMGGEARLNFHGIDRIRAGS 198 Query: 197 HP-LTIDCRYNLTFRQA 212 L R NLT R Sbjct: 199 STLLPNGGRINLTMRVV 215 >UniRef50_UPI0000E484FA PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E484FA Length = 424 Score = 170 bits (431), Expect = 3e-41, Method: Composition-based stats. Identities = 56/225 (24%), Positives = 85/225 (37%), Gaps = 33/225 (14%) Query: 5 FADAEPWQEP----LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 F ++ P +P G VI+ F EQ I D + AS S +A Sbjct: 126 FVESVPTDKPQSNVPPPGLVIIPDFIDECLEQKIIDSIEWASPSE-------------IA 172 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLI 119 + H H + YS + +KP P MP+ + + R G+ F+PD I Sbjct: 173 NQSLKHRKVKHHGYEFNYSSNNIDRDKPLPGGMPELYGQVINRIME-TGHVQFKPDQLTI 231 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N+Y PG + H D I+S+SL + +F + ++L ++V Sbjct: 232 NQYQPGQGIPPHVDTHSA-FEDAIISLSLESQIVMEFTHPAGHQVP--VVLPRRSLLVMT 288 Query: 180 GESRL-FYHGIQPLKAGFHPLT----------IDCRYNLTFRQAG 213 GE+R + HGI P K P R + TFR Sbjct: 289 GEARYKWTHGITPKKTDVIPDPTFPDNLTLHQRGQRTSFTFRAVR 333 >UniRef50_C9CYS1 Putative uncharacterized protein n=2 Tax=Alphaproteobacteria RepID=C9CYS1_9RHOB Length = 200 Score = 170 bits (430), Expect = 3e-41, Method: Composition-based stats. Identities = 71/197 (36%), Positives = 98/197 (49%), Gaps = 7/197 (3%) Query: 18 GAVILRRFAF-NAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G I + F A +LI+ + V +P PGG MSV MT+ G GW + + GY Sbjct: 8 GFEIHKGFLAAEAQRELIQALRPVLRAAPLFSPEVPGGGQMSVRMTSAGAFGWFSDKSGY 67 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 Y+ P + + WP +P + D PD CL N Y GA++ LHQDKDE Sbjct: 68 RYADRHP-SGQAWPEIPAEVLKIWTALID----RDRMPDCCLFNYYGEGARMGLHQDKDE 122 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLK-AG 195 D P+VS+SLG + + GG R + + + L GDVVV GG++RL YHG+ ++ Sbjct: 123 ADFSYPVVSISLGDDGLLRVGGTSRKEKTESIWLNSGDVVVMGGDARLAYHGVDRIRFRS 182 Query: 196 FHPLTIDCRYNLTFRQA 212 L R NLT R Sbjct: 183 SRLLPKGGRVNLTLRVV 199 >UniRef50_B7PUG0 Putative uncharacterized protein n=1 Tax=Ixodes scapularis RepID=B7PUG0_IXOSC Length = 287 Score = 170 bits (430), Expect = 4e-41, Method: Composition-based stats. Identities = 60/229 (26%), Positives = 86/229 (37%), Gaps = 19/229 (8%) Query: 2 LDLFADAEPWQEPL--AAGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMS 58 L L A+ P L A G ++L F +R + ++ P V + Sbjct: 56 LGLCAERLPAAYELVDAPGLLLLPNPFTAEGQRLWVRRCLEEYTRPPHVTNVKAPATVLR 115 Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 G L W T + + + P L A AG+ FQP+A + Sbjct: 116 AGDPFYGALRWATVGLHHDWDTKVYDKTRR-SPFPDCLRELATGLARLAGFSAFQPEAAI 174 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 +N YA + L H D E L AP+VS S G A+F GG R + LLL GDV+V Sbjct: 175 VNYYAMDSALGGHVDNSELALDAPVVSASFGQTAVFLVGGATRERRPRALLLRSGDVLVM 234 Query: 179 GGESRLFYHGIQPLKA--GFHPLTID-------------CRYNLTFRQA 212 G +RL YH + + P R +++ RQ Sbjct: 235 SGPARLAYHAVPRVLPAGDDRPWAGGDAQWAPLEAYLDTHRISISVRQV 283 >UniRef50_A7HZ41 2OG-Fe(II) oxygenase n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HZ41_PARL1 Length = 221 Score = 167 bits (424), Expect = 2e-40, Method: Composition-based stats. Identities = 70/215 (32%), Positives = 107/215 (49%), Gaps = 11/215 (5%) Query: 7 DAEPWQEPLA---AGAVILRRFAFNAAEQLIRDINDVA--SQSPFRQMVTPGGYTMSVAM 61 ++ +P+ G + F A++ + + + P+R + G S+ Sbjct: 9 ESPLASKPVETGFDGIGLYPGFFGETAQRALVERLQAGFGAAPPYRPRMPRTGRPWSILQ 68 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 TN G LGW + GY YSP++ + PWPA+P + L A P+ CL+N Sbjct: 69 TNFGQLGWVSRPGGYAYSPVNDVSKAPWPAIPAALLALWDDLAA----YPAPPECCLVNL 124 Query: 122 YA-PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y P +++ LH+D+DE L AP++S+SLG IF+ GG R D K L GDV+V GG Sbjct: 125 YDAPKSRMGLHRDEDEEALDAPVLSLSLGDTCIFRVGGFARGDKSKSFRLASGDVLVLGG 184 Query: 181 ESRLFYHGIQPLKAG-FHPLTIDCRYNLTFRQAGK 214 SRL YHG+ + +G + R NLT R+ + Sbjct: 185 ASRLRYHGVDRVISGSSRLIPGGGRINLTLRRVTR 219 >UniRef50_B3RYI8 Putative uncharacterized protein n=2 Tax=Trichoplax adhaerens RepID=B3RYI8_TRIAD Length = 653 Score = 167 bits (424), Expect = 2e-40, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 67/216 (31%), Gaps = 22/216 (10%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVA----SQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 G ++++ + E + S H Sbjct: 150 PEGLLLIQNYVSEQEEDELLQSIGWYTNHGQASHDTHPCQTVQSDRESMQRRLKHRHVKH 209 Query: 72 HRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-KLS 129 + + Y +KP A +P +CQR GY QPD +N Y PG + Sbjct: 210 YGYEFRYDTNTVDKDKPLHATIPSKCRYICQRMTDD-GYIQHQPDQLTVNEYMPGQAGIP 268 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHG 188 H D + IVS+SL + F + + L ++V GE R + HG Sbjct: 269 PHIDTHSA-FQDQIVSLSLLSQIVMDFRHP--DGTRISINLPRRSLLVMSGECRYLWSHG 325 Query: 189 IQPLKAGFH-----------PLTIDCRYNLTFRQAG 213 I P K L R + TFR+ Sbjct: 326 ITPRKYDVVCDDNDNNSNITLLERSRRVSFTFRKIR 361 >UniRef50_B0T136 2OG-Fe(II) oxygenase n=1 Tax=Caulobacter sp. K31 RepID=B0T136_CAUSK Length = 212 Score = 167 bits (424), Expect = 2e-40, Method: Composition-based stats. Identities = 73/197 (37%), Positives = 101/197 (51%), Gaps = 6/197 (3%) Query: 17 AGAVILRRFAFNAAEQLIRDI-NDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G + + A+ + + +P T G MSVAM++ G LGWT+ + G Sbjct: 9 PGFDLWPQLLDPGAQADLARLTLAALEVAPPAHYETAYGKAMSVAMSSFGPLGWTSDKTG 68 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y Y+ P T PWPAMPQ+ +L G P PDA LIN Y A++ LHQD+D Sbjct: 69 YRYTGRHPGTGAPWPAMPQALLDLW----ADLGDPQTPPDAALINLYRGEARMGLHQDRD 124 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAG 195 E D R P++S+SLG A+F+ GG R P + L L GDV G +RL +HG+ + G Sbjct: 125 EADPRFPVLSISLGDTAVFRIGGTSRKGPTRSLKLSSGDVCRLSGPARLAFHGVDRILPG 184 Query: 196 FHPLT-IDCRYNLTFRQ 211 L R N+T R+ Sbjct: 185 SSSLVAGGGRINITLRR 201 >UniRef50_D2V2M0 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V2M0_NAEGR Length = 314 Score = 167 bits (422), Expect = 3e-40, Method: Composition-based stats. Identities = 48/227 (21%), Positives = 79/227 (34%), Gaps = 32/227 (14%) Query: 17 AGAVILRRFAFNA-AEQLIRDINDVASQSPFRQMVTPGGYT-----------MSVAMTNC 64 G +++ I+ + P T A Sbjct: 69 PGFYVIKDLMTPEKQIYWIKQALETYPNPPNITNHTMKNGEILDIFKRSVEGDESAKNYL 128 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 L W T Y ++ +K + P N C A Y ++P+A ++N Y+ Sbjct: 129 KKLAWCTLGYQYEWTTRKYHKDK-FVQFPHDIGNFCDLIACQCNYGPYKPEAAIVNFYSK 187 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 + H D E ++ PI+S+S+G +IF GG R+ K + LE GD ++ GG +R Sbjct: 188 DRLMGGHVDDAEYEMTKPIISLSIGSKSIFLLGGETRDTEPKAIFLESGDCMIMGGRARY 247 Query: 185 FYHGIQPLKAGFHPLTIDC-------------------RYNLTFRQA 212 +HGI + P + R N+ RQ Sbjct: 248 CFHGIARILKDTIPEYLQTKFVDPKYKIYAEYMERDMMRININARQV 294 >UniRef50_UPI0000DB70B1 PREDICTED: similar to CG17807-PA n=2 Tax=Apocrita RepID=UPI0000DB70B1 Length = 558 Score = 165 bits (418), Expect = 9e-40, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 75/223 (33%), Gaps = 31/223 (13%) Query: 4 LFADAE--PWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 +F D W L +G ++ F E+++ ++ Sbjct: 85 IFPDLNYCEWSLNLPSGIKLIEDFITEEEEKMLLSTITWNNE----------------ES 128 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 ++ H + Y +KP +P++ + Q ++ D IN Sbjct: 129 SDLKHRKVKHFGYEFQYDTNKVDLDKPIVPIPKN-YQFLQVLFKQYHNVSYEYDQLTINH 187 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PG + H D I+S+SLG I F + + L L +++ GE Sbjct: 188 YLPGQGIPPHIDTHS-VFEDSILSLSLGSACIMNFK---KENKKASLFLPPRSLLIMSGE 243 Query: 182 SRL-FYHGIQPLKAGFHP-------LTIDCRYNLTFRQAGKKE 216 +R + HGI P + R + TFR+ + + Sbjct: 244 ARYAWSHGICPRHNDIVQTSNGITTQSRGTRVSFTFRKVHRGD 286 >UniRef50_D2A2C2 Putative uncharacterized protein GLEAN_07671 n=1 Tax=Tribolium castaneum RepID=D2A2C2_TRICA Length = 582 Score = 164 bits (416), Expect = 2e-39, Method: Composition-based stats. Identities = 47/209 (22%), Positives = 70/209 (33%), Gaps = 37/209 (17%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 L G I+ F E + + + H + Sbjct: 125 LPPGLRIITNFVSEEEEARLLALCQFEDGG------------------SMKHRLVKHYGY 166 Query: 75 GYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQD 133 + Y + KP +PQ L +R +F+P+ INRY PG + H D Sbjct: 167 EFRYDINNVDKEKPLSEGIPQECDFLWRRLPF-----EFRPNQLTINRYNPGQGIPSHVD 221 Query: 134 KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPL 192 PI+S+SL + +F D +LL ++V GESR + HGI P Sbjct: 222 THSA-FGDPILSLSLSSDVVMEFK----KDETICVLLPRRSLLVMAGESRYEWTHGIVPR 276 Query: 193 KAG-------FHPLTIDCRYNLTFRQAGK 214 H R + TFR+ K Sbjct: 277 TFDFYNDEGGCHCFKRGVRVSFTFRKIRK 305 >UniRef50_B1XR40 Oxidoreductase, 2OG-Fe(II) oxygenase family n=4 Tax=Bacteria RepID=B1XR40_SYNP2 Length = 204 Score = 164 bits (416), Expect = 2e-39, Method: Composition-based stats. Identities = 48/215 (22%), Positives = 81/215 (37%), Gaps = 26/215 (12%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L+LFA + EP G + F EQ + ++ D + Sbjct: 10 LELFA-SVTINEPQIPGLQYIEEFIDKQTEQELLNLID---------------QQQWLMD 53 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 + Y Y + +P + ++ + PD ++N Sbjct: 54 L---KRRVQHYGYKYDYRTKKIDYSMYLGILPDWLFPIIEQMVS-LNLISELPDQAIVNE 109 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PG ++ H D +P I+S+SL P I F + N+ +L L+ +V+ GE Sbjct: 110 YLPGQGITSHVDC-KPCFTDTIISLSLNAPCIMNFDSIVNNERQSKL-LKPRSLVILQGE 167 Query: 182 SRL-FYHGIQPLKAG---FHPLTIDCRYNLTFRQA 212 SR + HGI P K+ + D R ++TFR+ Sbjct: 168 SRYLWKHGIPPRKSDQWNGQKIMRDRRISITFRKV 202 >UniRef50_UPI000186CFBD conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186CFBD Length = 602 Score = 164 bits (415), Expect = 2e-39, Method: Composition-based stats. Identities = 49/226 (21%), Positives = 76/226 (33%), Gaps = 31/226 (13%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 + L A+ +Q+ G V+L F E I + Sbjct: 113 ISLLNQAKIFQK--PPGLVLLEDFISEEEETEILKLLKFNDSGEEYSSE----------- 159 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAAT----AAGYPDFQPDAC 117 H + + Y + N+P +P + L R DF PD Sbjct: 160 --LKHRKVKHYGYEFKYGSNNVNLNEPIKKIPSKLNYLWDRLKKYSDNFESDFDFTPDQL 217 Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVV 177 +N Y PG + H D I+S+SL + +F D +LL + + Sbjct: 218 TVNCYEPGQGIPPHVDTHSA-FEDGILSLSLESSVVMEFKN--DKDLTFSVLLPRRSLCL 274 Query: 178 WGGESRL-FYHGIQPLKAGFHPLT--------IDCRYNLTFRQAGK 214 GESR + HGI P K+ P + R +LTFR+ + Sbjct: 275 MLGESRYNWVHGITPRKSDLIPNKDGSLTVQNRERRTSLTFRKTRR 320 >UniRef50_C7QZR3 2OG-Fe(II) oxygenase n=29 Tax=Actinomycetales RepID=C7QZR3_JONDD Length = 231 Score = 162 bits (409), Expect = 9e-39, Method: Composition-based stats. Identities = 74/222 (33%), Positives = 105/222 (47%), Gaps = 17/222 (7%) Query: 4 LFADA--EPWQEPLAAGAVILRRFAFNAAEQLI-RDINDVASQSPFRQMVTPGGYTMSVA 60 LF+DA +E +A GAV L + + + R A+ + T G+ MSV Sbjct: 2 LFSDADVPRVREEIAPGAVWLPGWLTIPQQAWLARQCAQWAAGPVPIRSATVRGHPMSVK 61 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRA-ATAAGYPD----FQPD 115 GW Y +D + P+ L +R A A G D + PD Sbjct: 62 TVCV---GWHWRPYAYSRDAVDVN-GQRVVEFPKWMVRLGRRIVADATGDEDRALAYTPD 117 Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKRLLLEHGD 174 LIN Y A++ +HQDKDE L AP+VS+S+G F+FG + RN P + + L GD Sbjct: 118 TALINFYDVQARMGMHQDKDEKSL-APVVSLSIGDTCTFRFGNTENRNRPYRDIALASGD 176 Query: 175 VVVWGGESRLFYHGIQPLKAGFHPLTID---CRYNLTFRQAG 213 V V+GG SRL +HG+Q + A P R+N+T R+ G Sbjct: 177 VFVFGGPSRLAFHGVQKIHAESAPDGCGVEHGRWNITMRETG 218 >UniRef50_D2A2Y9 Putative uncharacterized protein GLEAN_07602 n=1 Tax=Tribolium castaneum RepID=D2A2Y9_TRICA Length = 297 Score = 159 bits (401), Expect = 8e-38, Method: Composition-based stats. Identities = 51/243 (20%), Positives = 83/243 (34%), Gaps = 45/243 (18%) Query: 17 AGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMV-----TPGGYTMSVAMTN------C 64 G + ++ F + S+ P + + P G N Sbjct: 43 PGLIFIKNPFTSIGQRYWVVRCLQDYSKRPNKTNLDALNLVPEGKEWWEVCQNNNNKILM 102 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 L W T + + P+ L + A + + F +A ++N Y Sbjct: 103 NKLRWVTLGYHHDWESKVYAEENKG-EFPKDLAELSRFIAESLNFLHFNAEAAIVNYYHM 161 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 + LS H D E +L+AP++S+S G AIF GG ++D + L GD+VV ESRL Sbjct: 162 DSTLSGHTDHSEHNLKAPLISLSFGQTAIFLLGGKTKDDEPSAMFLRSGDIVVMSEESRL 221 Query: 185 FYHGIQPLK--------------------------------AGFHPLTIDCRYNLTFRQA 212 YHG+ + F + R N+ RQ Sbjct: 222 CYHGVPKILQMDSRFWNCFEENDFTNDCKNVVNICKEETLWKPFGDYLNNSRININVRQV 281 Query: 213 GKK 215 ++ Sbjct: 282 LQR 284 >UniRef50_A3VR77 Alkylated DNA repair protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VR77_9PROT Length = 222 Score = 158 bits (400), Expect = 1e-37, Method: Composition-based stats. Identities = 72/208 (34%), Positives = 93/208 (44%), Gaps = 5/208 (2%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 L GA L + A ++ Q + VTPGG TMS N G Sbjct: 5 PQPVREVVDLGEGACHLPGYLPAKAASDLQQHLIHLCQDRWIVPVTPGGQTMSARQMNLG 64 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 LGW T R+GY Y P P WP MP + + A P+A L+N Y P Sbjct: 65 PLGWVTDRRGYRYEPRHPVDGAAWPEMPPALIRIWNDLLPEAP----SPEAGLVNLYGPT 120 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLF 185 AK+ LH+D DE PI+SVS G P F+ GG R + ++L HGDV++ G SR F Sbjct: 121 AKMGLHRDADEAAKDVPILSVSFGAPGRFRLGGATRKGSTRSIVLGHGDVLILAGPSRHF 180 Query: 186 YHGIQPL-KAGFHPLTIDCRYNLTFRQA 212 YHGI + R +LT R+ Sbjct: 181 YHGIDRILLPSPLFAEDPHRLSLTLRRV 208 >UniRef50_B3RXF0 Putative uncharacterized protein (Fragment) n=1 Tax=Trichoplax adhaerens RepID=B3RXF0_TRIAD Length = 271 Score = 158 bits (400), Expect = 1e-37, Method: Composition-based stats. Identities = 51/184 (27%), Positives = 74/184 (40%), Gaps = 9/184 (4%) Query: 17 AGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGH-------LG 68 G + +R F I+ P + + G + H L Sbjct: 19 PGFIYIRNPFLNCGQRYWIKRCLKNFHTYPSKTNLDAHGNSTKGKEVVTKHRNDLMDKLR 78 Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKL 128 WTT Y +S + N+ P L + A G+P F P+A +IN Y + L Sbjct: 79 WTTLGYHYDWSTKEYYHNRK-SEFPTDLAELTKLLAATVGFPLFSPEAAIINYYKLDSTL 137 Query: 129 SLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHG 188 S H D E D AP+ S+S G AIF GG + + +E GD+ + GESRL YH Sbjct: 138 SGHTDHSEFDFTAPLFSISFGQKAIFLLGGRTTSVTPVAMYIESGDICIMSGESRLAYHA 197 Query: 189 IQPL 192 + + Sbjct: 198 VPRI 201 >UniRef50_B9GZQ0 Predicted protein n=13 Tax=Magnoliophyta RepID=B9GZQ0_POPTR Length = 353 Score = 158 bits (399), Expect = 1e-37, Method: Composition-based stats. Identities = 58/270 (21%), Positives = 85/270 (31%), Gaps = 64/270 (23%) Query: 7 DAEPWQEPLAAGAVILRRFAFNAAE-QLIRDINDVASQSPFRQMVTPGGYTMS------- 58 D + G + + + IR+ Q P R +S Sbjct: 83 DRPVFGLESRPGFYFIPGALSVDEQCRWIRESLMSFPQPPNRTNHNAIYGPISDLFIAAK 142 Query: 59 ------------------------------------VAMTNCGHLGWTTHRQGYLYSPID 82 A L W+T + +S + Sbjct: 143 ERKVLVEDENMPANTRRWSFCEEDSVLLRGKSCKPVSASVLLRKLRWSTLGLQFDWSKRN 202 Query: 83 PQTNKPWPAMPQSFHNLCQRAATAAG--YPDFQPDACLINRYAPGAKLSLHQDKDEPDLR 140 + P +P L ++ A A +F P+A ++N +A G L H D E D Sbjct: 203 YNVSLPHNKIPDGLCQLAKKLAAPAMPVGEEFHPEAAIVNYFASGDTLGGHLDDMEADWS 262 Query: 141 APIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFH--- 197 PIVS+SLG AIF GG R DP + L GDVV+ GE+R +HG+ + Sbjct: 263 KPIVSMSLGCKAIFLLGGKSREDPPLAMFLRSGDVVLMAGEARECFHGVPRIFTDKENAE 322 Query: 198 ---------------PLTIDCRYNLTFRQA 212 R N+ RQ Sbjct: 323 ITALELHFCDENDILEYIRTSRININIRQV 352 >UniRef50_UPI00017B2DD1 UPI00017B2DD1 related cluster n=1 Tax=Tetraodon nigroviridis RepID=UPI00017B2DD1 Length = 643 Score = 157 bits (397), Expect = 2e-37, Method: Composition-based stats. Identities = 48/232 (20%), Positives = 71/232 (30%), Gaps = 39/232 (16%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 E G +L F E + D +S + A Sbjct: 122 PCEEDVSVSFPMGLALLDNFVSPEEEASLLSAVDWSSSNDGVT-----------AQKAMK 170 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 H + + Y + +KP PA +P +R T D PD +N+Y Sbjct: 171 HRRVKHYGFEFRYDNNNVDKDKPLPAGIPAECLPFLERCLTNKII-DVMPDQLTVNQYES 229 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 G + H D I+S+SL + F + L L+L ++V GESR Sbjct: 230 GQGIPPHVDTHSA-FEDAILSLSLRAQTVMDFRHP--DGSLVALVLPGRSLLVMKGESRY 286 Query: 185 -FYHGIQPLKAGFHPLT----------------------IDCRYNLTFRQAG 213 + HGI P K P R + TFR+ Sbjct: 287 LWTHGITPRKFDVVPSCDSQPSAPTSHDSQSQSNLTLSRRATRTSFTFRKIR 338 >UniRef50_Q17GQ0 Putative uncharacterized protein (Fragment) n=2 Tax=Culicini RepID=Q17GQ0_AEDAE Length = 292 Score = 157 bits (397), Expect = 2e-37, Method: Composition-based stats. Identities = 52/200 (26%), Positives = 79/200 (39%), Gaps = 22/200 (11%) Query: 16 AAGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTP-----------GGYTMSVAMT- 62 G +++ F A + + P R + S+A T Sbjct: 18 RPGLILIANPFTKPAQRYWMARCLQDYPKHPNRTNLPDTIMDKFGTYSGHFDWWSIAKTI 77 Query: 63 --------NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQP 114 L WTT Y ++ + P LC+ A + G+ F+P Sbjct: 78 EDPQERAKLWKALRWTTLGYHYDWTNKIYEEAAR-NEFPADLEELCRHFAESLGFRGFKP 136 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 +A ++N Y G+ L+ H D E +L AP+ S S G PA+F GG R++ LLL GD Sbjct: 137 EAAIVNYYPTGSTLAGHTDHSEKNLEAPLFSFSFGQPAVFLIGGPTRDEKPDALLLRSGD 196 Query: 175 VVVWGGESRLFYHGIQPLKA 194 V+V SRL YH + + Sbjct: 197 VIVMTRASRLCYHAVPKVFP 216 >UniRef50_D2B1L1 Alkylated DNA repair protein n=3 Tax=Actinomycetales RepID=D2B1L1_STRRD Length = 213 Score = 157 bits (397), Expect = 2e-37, Method: Composition-based stats. Identities = 68/205 (33%), Positives = 100/205 (48%), Gaps = 16/205 (7%) Query: 14 PLAAGAVILRRFAFNA-AEQLIRDINDVASQS-PFRQMVTPGGYTMSVAMTNCGHLGWTT 71 +A GAV + + A QL+R A ++ PGG MSV T C W Sbjct: 14 EIAPGAVHVPDWLSPARQRQLVRACRAWARPPLGMERIRLPGGGLMSV-RTVCLGRRWRP 72 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLH 131 Y Y+ ++P +PQ L + A ++PD L+N Y A + +H Sbjct: 73 ----YRYT------DEPVEPLPQWLAELGRAAVAQTLGGPYEPDVALVNFYDDAATMGMH 122 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQFGGL-KRNDPLKRLLLEHGDVVVWGGESRLFYHGIQ 190 QD+DE AP+VS+SLG +F+FG R P + LE GD+ V+GG SRL +HG++ Sbjct: 123 QDRDERA-AAPVVSLSLGDACVFRFGNTATRARPWSDVRLESGDLFVFGGPSRLAFHGVR 181 Query: 191 PLKAGFHPLTI-DCRYNLTFRQAGK 214 + G P + R N+T RQ+G+ Sbjct: 182 RILPGTGPHDLIQGRLNITLRQSGQ 206 >UniRef50_Q7QEU7 AGAP000155-PA n=1 Tax=Anopheles gambiae RepID=Q7QEU7_ANOGA Length = 293 Score = 157 bits (396), Expect = 3e-37, Method: Composition-based stats. Identities = 56/248 (22%), Positives = 78/248 (31%), Gaps = 49/248 (19%) Query: 16 AAGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMS---------------- 58 G +++ F A Q + P + G Sbjct: 36 RPGLLVVANPFTAEAQRQWMTRSLADYPIPPNATNQSGVGQQARDAVGSWWEQLQTIPTP 95 Query: 59 -VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDAC 117 L W T Y ++ + P L + AT GY F P+A Sbjct: 96 AERRKFAKSLRWATLGYQYDWTNKLYDEARR-EPFPCELGALVRYVATTLGYDRFSPEAA 154 Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVV 177 ++N Y GA L+ H D E D AP+ S S G PA+F GG R + LLL GD+VV Sbjct: 155 IVNYYPAGATLAGHTDHSEDDQTAPLFSFSFGQPAVFLIGGTSREEHPDALLLRSGDIVV 214 Query: 178 WGGESRLFYHGIQPLKAGFHPLTI------------------------------DCRYNL 207 G SRL YH + + R N+ Sbjct: 215 MTGASRLCYHAVPRVCIDAELPEGLGCSAARWAVLDAERPGAVQWGAAVEEYMQHSRINI 274 Query: 208 TFRQAGKK 215 RQ ++ Sbjct: 275 NVRQVLRE 282 >UniRef50_B8HYX6 2OG-Fe(II) oxygenase n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HYX6_CYAP4 Length = 204 Score = 156 bits (395), Expect = 4e-37, Method: Composition-based stats. Identities = 44/200 (22%), Positives = 70/200 (35%), Gaps = 26/200 (13%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G + F A E+ + + D Q P+ + + Y Sbjct: 25 PGLRYIPNFINPAVEKTLLEEID---QQPWITDL---------------KRRVQHYGYRY 66 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 Y +P+ L R GY PD ++N Y PG ++ H D + Sbjct: 67 DYKARAISPEAYLGTLPEWLKPLTNRLWQE-GYIPDLPDQVIVNEYIPGQGITAHIDCID 125 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAG 195 I+S+SLG I +F + L+LE +VV G++R + H I K+ Sbjct: 126 -CFSDTILSLSLGSDCIMRFTAPSHT--TEDLVLERRSLVVLQGDARYQWQHSIPARKSD 182 Query: 196 ---FHPLTIDCRYNLTFRQA 212 R +LTFR+ Sbjct: 183 LIKGQKQARSRRISLTFRKV 202 >UniRef50_A8P5P3 Putative uncharacterized protein n=1 Tax=Brugia malayi RepID=A8P5P3_BRUMA Length = 576 Score = 156 bits (393), Expect = 6e-37, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 64/207 (30%), Gaps = 24/207 (11%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 +L F E + + P G T + Sbjct: 125 PNNLWVLPDFINPDEEAALITVIQDY---------LPRGKT-------LKNRKVIHFGFE 168 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 + Y +P + R AG +PD +N Y PG + H D Sbjct: 169 FNYDNNMASEQPSPDPIPSVCQPVIDRML-GAGIFKEKPDQVTVNIYEPGNGIPSHVDTH 227 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKA 194 I S+SL + +F + +LL + V GESR + HGI K Sbjct: 228 SA-FSDTIASLSLLSDLVMEFRDFANTSTIYDVLLPRFSLTVMRGESRYRWKHGIAKRKY 286 Query: 195 GFHPLT-----IDCRYNLTFRQAGKKE 216 +P+T R + TFR +++ Sbjct: 287 DINPVTNKLMARQLRVSFTFRNVIREK 313 >UniRef50_Q9SA98 Alkylated DNA repair protein alkB homolog n=4 Tax=Magnoliophyta RepID=ALKBH_ARATH Length = 345 Score = 156 bits (393), Expect = 8e-37, Method: Composition-based stats. Identities = 54/271 (19%), Positives = 85/271 (31%), Gaps = 64/271 (23%) Query: 6 ADAEPWQEPLAAGAVILRRFAF-NAAEQLIRDINDVASQSPFRQMVTPGGYT-------- 56 + + G + + I++ Q P R Sbjct: 74 DSSPVFCIDNRPGFYFIPDALSLKEQCKWIKESLTSFPQPPNRTNHNAIYGPIDDLFDSA 133 Query: 57 ------------------------MSVAMTNCG---------HLGWTTHRQGYLYSPIDP 83 ++C L W+T + +S + Sbjct: 134 KENKVLVQDDLTNNKWKFYEEVDIEKATRSSCKSVSASVLLRKLRWSTLGLQFDWSKRNY 193 Query: 84 QTNKPWPAMPQSFHNLCQ--RAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRA 141 + P +P + L + A +F+P+ ++N + G L H D E D Sbjct: 194 DVSLPHNNIPDALCQLAKTHAAIAMPDGEEFRPEGAIVNYFGIGDTLGGHLDDMEADWSK 253 Query: 142 PIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI 201 PIVS+SLG AIF GG ++DP + L GDVV+ GE+R +HGI + G I Sbjct: 254 PIVSMSLGCKAIFLLGGKSKDDPPHAMYLRSGDVVLMAGEARECFHGIPRIFTGEENADI 313 Query: 202 DC--------------------RYNLTFRQA 212 R N+ RQ Sbjct: 314 GALESELSHESGHFFAEYIKTSRININIRQV 344 >UniRef50_B6JWW7 AlkB-like protein n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JWW7_SCHJY Length = 296 Score = 154 bits (390), Expect = 2e-36, Method: Composition-based stats. Identities = 46/179 (25%), Positives = 71/179 (39%), Gaps = 14/179 (7%) Query: 49 MVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQR-AATAA 107 T ++V L W T + Y ++ P P+ +L + A Sbjct: 117 PPTGSSKPVTVKNLMEKKLRWITFGEQYNWTTRVYPDPATAPPFPEKLGHLTEELVHKAT 176 Query: 108 GYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR 167 + D++ +A ++N Y+P LS H D E DL P++S+S+GL I+ G R D K Sbjct: 177 EFKDWKAEAAIVNFYSPRDTLSGHVDDAEDDLTLPLLSMSIGLDCIYLLGTETRKDVPKA 236 Query: 168 LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID-------------CRYNLTFRQAG 213 + L GD V+ G SR YH + + P + R N RQ Sbjct: 237 IRLHSGDAVIMTGLSRKAYHAVPKIIPNTAPSYLQLKDEAVWNQWIQTKRVNFNIRQVR 295 >UniRef50_O60066 Alkylated DNA repair protein alkB homolog n=2 Tax=Schizosaccharomyces pombe RepID=ALKBH_SCHPO Length = 297 Score = 154 bits (389), Expect = 2e-36, Method: Composition-based stats. Identities = 50/242 (20%), Positives = 89/242 (36%), Gaps = 43/242 (17%) Query: 16 AAGAVILRRFAFNAAEQLIRDIN---------DVASQSPFRQMVTPGG------------ 54 A G +IL+ + + + + + + SPF Q+ Sbjct: 54 APGLLILKNYVSSELQMQLLKSIMFTQIQDPENKTNLSPFYQLPLGNDSIWRRYYNGDGE 113 Query: 55 ---------YTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAAT 105 ++V L W T + Y ++ + P P+ + ++ Sbjct: 114 SIIDGLGETKPLTVDRLVHKKLRWVTLGEQYDWTTKEYPDPSKSPGFPKDLGDFVEKVVK 173 Query: 106 A-AGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP 164 + ++ +A ++N Y+PG LS H D+ E DL P++S+S+GL I+ G R++ Sbjct: 174 ESTDFLHWKAEAAIVNFYSPGDTLSAHIDESEEDLTLPLISLSMGLDCIYLIGTESRSEK 233 Query: 165 LKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI------------DCRYNLTFRQA 212 L L GDVV+ G SR +H + + P + R N RQ Sbjct: 234 PSALRLHSGDVVIMTGTSRKAFHAVPKIIPNSTPNYLLTGNKAWDGWISRKRVNFNVRQV 293 Query: 213 GK 214 Sbjct: 294 RP 295 >UniRef50_C3XQU3 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQU3_BRAFL Length = 365 Score = 151 bits (382), Expect = 1e-35, Method: Composition-based stats. Identities = 47/252 (18%), Positives = 78/252 (30%), Gaps = 56/252 (22%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTP---------------------GGYTM 57 IL + + P + Sbjct: 89 LFILNPWRRGCQRYWVTRCLRDYPCKPNVTNLDKLQHGVDNVWKEGYSEYRKYRLHEGKK 148 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDAC 117 + T L WTT Y + + Q + + P L A G+P ++ + Sbjct: 149 TQPKTLLHKLRWTTLGYHYDWDKKEYQQER-YTEFPPDLSQLSTHVAQTLGFPRYRAQSA 207 Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVV 177 ++N Y ++L H D E D PI+S S G A+F GG ++ + L +GD++V Sbjct: 208 IVNYYGLDSQLGGHVDHQELDYSKPIISFSFGQTAVFLLGGKTKSVKPMAMFLRNGDIMV 267 Query: 178 WGGESRLFYHGIQPLKAG----------------------------------FHPLTIDC 203 G++RL YHG+ + +C Sbjct: 268 MSGDTRLAYHGVPKILKPPIAELLPEGLCEGDREGELHESMLPSTVEKSWEETALFMEEC 327 Query: 204 RYNLTFRQAGKK 215 R N+T RQ + Sbjct: 328 RINITVRQVVAE 339 >UniRef50_C9YWY9 Putative DNA repair protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9YWY9_STRSW Length = 242 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 66/237 (27%), Positives = 97/237 (40%), Gaps = 34/237 (14%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSP--FRQMVTPGGYTMSVA 60 +LF + + GAV + ++ + D ++ P R + TPGG TM+ Sbjct: 4 ELFP---RDRREIVPGAVHVPDRLDAGQQRRLLDACRAWARPPAGLRTVRTPGGGTMTAR 60 Query: 61 MTNCGHL---------------------GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNL 99 G G T+ Y + +D P P L Sbjct: 61 QVCLGRHWGVVPACPACGRAVRDNPACPGRHTYPYAYSRTVVDGD-GAPVKPFPAWLGEL 119 Query: 100 CQRAATAAGYPDF---QPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF 156 +RA P D LIN Y A++ +H+D DEP AP+VS+SLG +F+F Sbjct: 120 GRRAVADTLGPQRATDPYDIALINYYDADARMGMHRDSDEPS-DAPVVSLSLGDTCLFRF 178 Query: 157 GGL-KRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID--CRYNLTFR 210 G R P + L GD+ V+GGE+R YHG+ + AG P + R N+T R Sbjct: 179 GNPRTRTRPYTDVELRSGDLFVFGGEARRAYHGVPRVYAGTAPPGLGLTGRLNITLR 235 >UniRef50_C0WHI3 Alkylated DNA repair protein n=3 Tax=Corynebacterium RepID=C0WHI3_9CORY Length = 227 Score = 149 bits (377), Expect = 5e-35, Method: Composition-based stats. Identities = 63/231 (27%), Positives = 95/231 (41%), Gaps = 25/231 (10%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLI----RDINDVASQSPFR--QMVTPGGYTM 57 LF +A G + + ++++ R+I + +P Q G M Sbjct: 2 LFDSLPRPNAYVAPGVGHVPGWVGIGKQKVLVEETREIARAYAHTPMAMVQPRLKSGGQM 61 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--------- 108 SV HLG H Y Y +D P +P S L A A Sbjct: 62 SVFQL---HLGRYWHYPSYRY--VDNMEGTRVPPVPDSLRELAPVALRQAAQVAPELEPW 116 Query: 109 YPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKR 167 +F P+ L+N Y PG+ + +H D E AP++S+S+G A+F+ G + R P Sbjct: 117 VDNFVPEMALVNYYPPGSAMGMHVDDSEGS-PAPVISLSIGDEALFRIGHTENRTKPWDD 175 Query: 168 LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID---CRYNLTFRQAGKK 215 + L GD+VV+GG R YHG+ + G P R N+T RQ + Sbjct: 176 VTLCSGDLVVFGGPKRFAYHGVVRVNDGTLPEGCGLQEGRINITIRQVSAR 226 >UniRef50_UPI000180CD20 PREDICTED: similar to alkB, alkylation repair homolog 8 (E. coli) (alkbh8) n=1 Tax=Ciona intestinalis RepID=UPI000180CD20 Length = 593 Score = 149 bits (376), Expect = 6e-35, Method: Composition-based stats. Identities = 53/219 (24%), Positives = 77/219 (35%), Gaps = 24/219 (10%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 +FA+ + L G + + F EQ + + + +S Sbjct: 119 IFAEEKSD---LPNGLIKIENFLNKEEEQALINCIQ-------HDISILSNDHVSEK--- 165 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 H + + Y D N P +P NL R GY +PD IN Y Sbjct: 166 LKHRTVLHYGYKFRYGTNDVDINNPISEGLPNYIENLLDRIMA-TGYLPSRPDQLTINMY 224 Query: 123 APGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES 182 PG + H D + +VSLG + F K + +E + ++ GES Sbjct: 225 EPGDGIPPHTDNTR-SFDGVLSTVSLGSHTVMNFS--KEGAERIDVCVEPRTLFLFTGES 281 Query: 183 RL-FYHGIQPLK-----AGFHPLTIDCRYNLTFRQAGKK 215 R + HGIQ K G T RY+LTFR K Sbjct: 282 RYEWRHGIQQRKFDILDQGKKITTRTIRYSLTFRTVVKD 320 >UniRef50_A9TG90 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TG90_PHYPA Length = 401 Score = 149 bits (375), Expect = 8e-35, Method: Composition-based stats. Identities = 48/174 (27%), Positives = 72/174 (41%), Gaps = 23/174 (13%) Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGY-PDFQPDACLIN 120 T L W T + +S P+ +P +L +R A A DF+ +A ++N Sbjct: 227 TLVRKLRWATVGIQFDWSKRAYNEALPFQEIPPKLADLARRLAKPAMENEDFKAEAAIVN 286 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y P L H D E D+ PIVS+SLG AIF GG R++P + + GDVV+ G Sbjct: 287 FYGPDDMLGGHVDDMEADMSKPIVSISLGCKAIFLLGGTTRDEPPAAMFVRSGDVVLMAG 346 Query: 181 ESRLFYHGIQPLKAGFHPLTID----------------------CRYNLTFRQA 212 +R +HG+ + + + R N+ RQ Sbjct: 347 PARHCFHGVPRIFSEAKESELPDFTSMSDVDGIQPRSIVKYLESSRINVNIRQV 400 >UniRef50_D0MUQ0 Alkylated DNA repair protein alkB 8 n=1 Tax=Phytophthora infestans T30-4 RepID=D0MUQ0_PHYIN Length = 640 Score = 149 bits (375), Expect = 8e-35, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 59/201 (29%), Gaps = 21/201 (10%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G F A E + + + + + + Sbjct: 143 PGLKFGAEFVTEAQEAACLAFFERENGAHWANTI--------------RARQVQHFGYEF 188 Query: 77 LYSPIDPQTNKPW-PAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y ++P +P+ + + +PD +N Y PG ++ H D Sbjct: 189 NYDTRRCDPDQPMKEPIPEVLQPVIDKIVECGIMDGDRPDQITVNEYLPGQGIAFHLDTH 248 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK- 193 I S+S+ + F + +LL + V G SR + H I P Sbjct: 249 SA-FTTTIASLSICSEVVMDFRHP-DGVRNEGVLLPARSLAVMSGASRYKWEHAIVPRTF 306 Query: 194 --AGFHPLTIDCRYNLTFRQA 212 + R ++TFR+ Sbjct: 307 DVIDGKQIPRQRRVSITFRKI 327 >UniRef50_UPI0000E47318 PREDICTED: similar to LOC494680 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47318 Length = 488 Score = 148 bits (374), Expect = 1e-34, Method: Composition-based stats. Identities = 49/196 (25%), Positives = 69/196 (35%), Gaps = 21/196 (10%) Query: 17 AGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM-------------- 61 G +R F A +R P + + Sbjct: 81 PGFQFIRNPFLPGAQRYWVRRCLADYPCKPNVTNLDTQYQKDQLTNPWQASRKNFSSPNQ 140 Query: 62 -----TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDA 116 L W T Y ++ ++ P+ + A GYP FQ A Sbjct: 141 QTQEKRLMDQLRWVTLGYHYDWNNKVYNEDQ-HSLFPEDLGPMSALIAEVLGYPRFQSQA 199 Query: 117 CLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVV 176 ++N Y + L H D E DL AP++S SLG AI GG + L L GDV+ Sbjct: 200 AIVNFYHMDSTLGGHTDHSEFDLTAPLISYSLGQSAILLVGGKTKATKPLALHLRSGDVI 259 Query: 177 VWGGESRLFYHGIQPL 192 + GGESRL YH + + Sbjct: 260 ILGGESRLAYHAVPKI 275 >UniRef50_Q4D9X3 Alkylated DNA repair protein, putative n=3 Tax=Trypanosoma RepID=Q4D9X3_TRYCR Length = 323 Score = 148 bits (373), Expect = 1e-34, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 70/225 (31%), Gaps = 30/225 (13%) Query: 17 AGAVILRRFAFNAAEQ--LIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 G + +Q I S ++ + W T Sbjct: 89 PGLLFFPDAISEEEQQAFCRDAILRYGDSSQHPNHLSTHASKPKCTKRYEAPMRWATLGF 148 Query: 75 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD---------FQPDACLINRYAPG 125 Y ++ + A P + + ++P ++N ++ G Sbjct: 149 SYDWTSKTYTREN-YSAFPPALKRRIEEILHLCSSTPDLKDVNPSIYEPQTAIVNYFSVG 207 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLF 185 + + HQD E ++ P++S+SLG +F G R+D L GDV V+ G SR+ Sbjct: 208 SMMMAHQDVSEESMQHPLISISLGCSCVFLMGTSSRDDAPYAFWLRSGDVAVFSGPSRVA 267 Query: 186 YHGIQPLKAGFHP------------------LTIDCRYNLTFRQA 212 +H I + P R N+ RQ Sbjct: 268 FHSIPRIMDDCPPHLCTISGENNEDEVYWRTQMRHMRININVRQV 312 >UniRef50_Q7KUZ2 AlkB n=12 Tax=Drosophila RepID=Q7KUZ2_DROME Length = 332 Score = 147 bits (371), Expect = 2e-34, Method: Composition-based stats. Identities = 51/195 (26%), Positives = 77/195 (39%), Gaps = 18/195 (9%) Query: 17 AGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHL-------- 67 G +++R F+ ++P + + SV L Sbjct: 74 PGIIVIRNPFSERGRRYWSARCLRDFPRTPNIVNLNERLFDESVRSDWWKQLNLCSDGVE 133 Query: 68 --------GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 WTT + + P+ +LC A A GY DF+P+A ++ Sbjct: 134 FQRIKSAMRWTTFGYHHNWDTKIYDEEMQ-SPFPEDLSSLCGLFAQALGYADFKPEAAIV 192 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y G+ LS H D EP+ AP+ S S G AIF GG + + L+ GDV++ Sbjct: 193 NYYPVGSTLSGHTDHSEPNKSAPLFSFSFGQTAIFLIGGRSLEEKPTAIYLQSGDVMIMS 252 Query: 180 GESRLFYHGIQPLKA 194 GESRL YH + + Sbjct: 253 GESRLCYHAVPRIIK 267 >UniRef50_C7NLG8 Alkylated DNA repair protein n=2 Tax=Actinomycetales RepID=C7NLG8_KYTSD Length = 228 Score = 147 bits (371), Expect = 3e-34, Method: Composition-based stats. Identities = 60/223 (26%), Positives = 91/223 (40%), Gaps = 23/223 (10%) Query: 4 LFADA--EPWQEPLAAGAVILRRFAFNAAEQLI-RDINDVASQSPFRQMVTPGGYTMSVA 60 LF D + +A G + + + + R A+ G MSV Sbjct: 7 LFGDETLPRPAQEVAPGCHWVPGWLDAGQQAWVVRQYRRWAAGPVPAHRPAVRGGRMSVT 66 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ---PDAC 117 M GW GY + + + +P L +RA A G+ + PD Sbjct: 67 MVP---FGWVWTSAGYARTG---EQDAAPLPVPDWMVRLYRRAVVATGFDGWAEAAPDVA 120 Query: 118 LINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGG-LKRNDPLKRLLLEHGDVV 176 L+N Y P A + +H+D DE AP+VS+S+G F+FG R P + LE GD+V Sbjct: 121 LVNHYRPDASMGMHRDADELT-EAPVVSLSVGDACTFRFGSTETRTRPWTDIRLESGDLV 179 Query: 177 VWGGESRLFYHGIQPLKAGFHPL---------TIDCRYNLTFR 210 V+GG +R +HG+ + G + R N+T R Sbjct: 180 VFGGPARRAFHGVPRIHPGTAGPQVAAAQAEAELPGRLNITLR 222 >UniRef50_Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB n=1 Tax=Dictyostelium discoideum RepID=ALKB_DICDI Length = 393 Score = 146 bits (369), Expect = 5e-34, Method: Composition-based stats. Identities = 46/257 (17%), Positives = 83/257 (32%), Gaps = 62/257 (24%) Query: 17 AGAVILR-RFAFNAAEQLIRDINDVASQSPFRQMVT------------------------ 51 G ++ F + ++ I+ + + P +T Sbjct: 123 PGFYFIKSPFTASQQKKWIKHALEDYADPPNNNNITLFHGPIKNLWKNGEKELINEELKS 182 Query: 52 -----------------PGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQ 94 G + L W+T Y ++P + + + P Sbjct: 183 QGKHDDDEIEQPTRPLDKNGEPLPTYRQLLDKLAWSTLGYQYQWTPRLY-SEEFYEEFPD 241 Query: 95 SFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIF 154 L Q+ A A + + +A +N Y+ + + H D E ++ PI+S+S G A+F Sbjct: 242 DLQELVQKIAIATKFDPYVAEAATVNFYSEDSIMGGHLDDAEQEMEKPIISISFGSTAVF 301 Query: 155 QFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP---------------- 198 G R+ L + GD+V+ GG SR YHG+ + Sbjct: 302 LMGAETRDIAPVPLFIRSGDIVIMGGRSRYCYHGVAKIVENSFDLGLIDENDDQDLKYKI 361 Query: 199 ---LTIDCRYNLTFRQA 212 + R N+ RQ Sbjct: 362 QWLKEKNRRVNINTRQV 378 >UniRef50_C6W3C2 2OG-Fe(II) oxygenase n=5 Tax=Bacteroidetes RepID=C6W3C2_DYAFD Length = 202 Score = 146 bits (368), Expect = 6e-34, Method: Composition-based stats. Identities = 50/216 (23%), Positives = 75/216 (34%), Gaps = 20/216 (9%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L LF E P F + I ++ P+RQ V A Sbjct: 4 LSLFGSEETLLFP-ENLLEYYPGFVPPDESAAL--IGKWITEVPWRQQVMQMYGKQVTAP 60 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 G T + + +P + L +R G F ++ L+N Sbjct: 61 RLMAWYGDTEKSYTFSGTRFEPY------GWTKELAALKKRIEEKTG---FTFNSVLLNY 111 Query: 122 YAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 Y G ++ H D ++ R P I SVSLG F+F + L LE+G +++ Sbjct: 112 YRDGNDSVAWHGDNEQELGRNPVIASVSLGQERRFEFRYRADHSRKYGLPLENGSLLIMK 171 Query: 180 GE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 G+ + H I K P R NLTFR + Sbjct: 172 GDLQHTWEHRIPKSKTQNAP-----RINLTFRTIQR 202 >UniRef50_UPI0001983B96 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI0001983B96 Length = 456 Score = 145 bits (367), Expect = 7e-34, Method: Composition-based stats. Identities = 61/233 (26%), Positives = 94/233 (40%), Gaps = 31/233 (13%) Query: 8 AEPWQEPLAAGAVILRRFAF-NAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVAMTNCG 65 QE L G V+L+ + ++++ D+ F + G + + M Sbjct: 227 EGTTQEVLRPGMVLLKGYISLTEQIKMVKKCRDLGVGPGGFYRPGYQDGAKLRLQMMC-- 284 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------------YP 110 LG Q Y P P +P F L +RA + P Sbjct: 285 -LGMNWDPQTRKYEKWHPLDGSETPDIPHEFSVLVERAIQDSQSLIKKNSGENNVEDTLP 343 Query: 111 DFQPDACLINRYAPGAKLSLHQDKDE----PDLRAPIVSVSLGLPAIFQFGGLKRNDPLK 166 P+ C++N Y +L LHQD+DE P+VS SLG A F +G + D Sbjct: 344 RMSPNICIVNFYTTSGRLGLHQDRDESEESLLKGLPVVSFSLGDSAEFLYGNQRNVDAAG 403 Query: 167 RLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQA 212 +++LE GDV+++GG SR +HG+ + P + + R NLT RQ Sbjct: 404 KVVLESGDVLIFGGPSRHIFHGVSSIIPNSAPNSLLEETNLLPGRLNLTLRQL 456 >UniRef50_UPI000051A07C PREDICTED: similar to AlkB CG33250-PA n=3 Tax=Neoptera RepID=UPI000051A07C Length = 310 Score = 145 bits (365), Expect = 1e-33, Method: Composition-based stats. Identities = 51/240 (21%), Positives = 79/240 (32%), Gaps = 43/240 (17%) Query: 17 AGAVILRR-FAFNAAEQLIRDINDVASQSP----------------FRQMVTPGGYTMSV 59 G + ++ F I S+ P + + + Sbjct: 65 PGLIFIKNPFTTYGQRYWIIKCLKEYSKKPHKLNLHAHDILNDDENWWDICFKNFDKGEI 124 Query: 60 AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 L W T + + + +P L A G+ DF+ +A +I Sbjct: 125 NTKLISKLRWATFGYHHNWDTKLY-SETCKTKIPIELSLLTSFLAQTLGFKDFKAEAAII 183 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y + L+ H D E ++ AP+ S+S G AIF GGL + D + L GD+++ Sbjct: 184 NYYRMNSTLAGHTDHSELNVEAPLFSISFGQTAIFLIGGLMQEDTTNAIFLRSGDIIIMS 243 Query: 180 GESRLFYHGIQPLK-------------------------AGFHPLTIDCRYNLTFRQAGK 214 G SRL YHGI + D R N+ RQ K Sbjct: 244 GMSRLRYHGIPKILLTIDKPWDNEELNDDQCSRLNQNDWKKAKIYISDARINMNVRQVLK 303 >UniRef50_Q96BT7 Alkylated DNA repair protein alkB homolog 8 n=32 Tax=Euteleostomi RepID=ALKB8_HUMAN Length = 664 Score = 145 bits (365), Expect = 1e-33, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 80/235 (34%), Gaps = 40/235 (17%) Query: 5 FADAEPWQE----PLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 F + W+E L G +++ + E+++ + D + + + Sbjct: 119 FVEKVQWKELRPQALPPGLMVVEEIISSEEEKMLLESVDWTEDTDNQN-----------S 167 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLI 119 + H + Y + +KP +P + ++ GY +PD I Sbjct: 168 QKSLKHRRVKHFGYEFHYENNNVDKDKPLSGGLPDICESFLEKWLRK-GYIKHKPDQMTI 226 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N+Y PG + H D IVS+SLG + F + ++L ++V Sbjct: 227 NQYEPGQGIPAHIDTHSA-FEDEIVSLSLGSEIVMDFKHP--DGIAVPVMLPRRSLLVMT 283 Query: 180 GESRL-FYHGIQPLKAGFHPLT-------------------IDCRYNLTFRQAGK 214 GESR + HGI K + R + TFR+ + Sbjct: 284 GESRYLWTHGITCRKFDTVQASESLKSGIITSDVGDLTLSKRGLRTSFTFRKVRQ 338 >UniRef50_Q9LJH2 Similarity to unknown protein n=7 Tax=Embryophyta RepID=Q9LJH2_ARATH Length = 455 Score = 144 bits (364), Expect = 1e-33, Method: Composition-based stats. Identities = 58/227 (25%), Positives = 92/227 (40%), Gaps = 31/227 (13%) Query: 13 EPLAAGAVILRRFAF-NAAEQLIRDINDVA-SQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 + G V+L+ + N ++ + + F Q + + M LG Sbjct: 231 TVIRPGMVLLKNYLSINDQVMIVNKCRRLGLGEGGFYQPGYRDEAKLHLKMMC---LGKN 287 Query: 71 THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------------YPDFQPD 115 + Y P P +P F+ ++A + P PD Sbjct: 288 WDPETSRYGETRPFDGSTAPRIPAEFNQFVEKAVKESQSLAASNSKQTKGGDEIPFMLPD 347 Query: 116 ACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 171 C++N Y+ +L LHQDKDE + P+VS S+G A F +G + D + L LE Sbjct: 348 ICIVNFYSSTGRLGLHQDKDESENSIRKGLPVVSFSIGDSAEFLYGDQRDEDKAETLTLE 407 Query: 172 HGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQ 211 GDV+++GG SR +HG++ ++ P R NLTFRQ Sbjct: 408 SGDVLLFGGRSRKVFHGVRSIRKDTAPKALLQETSLRPGRLNLTFRQ 454 >UniRef50_Q9LZW8 Putative uncharacterized protein T20L15_50 n=3 Tax=Arabidopsis thaliana RepID=Q9LZW8_ARATH Length = 449 Score = 144 bits (364), Expect = 1e-33, Method: Composition-based stats. Identities = 59/228 (25%), Positives = 101/228 (44%), Gaps = 35/228 (15%) Query: 13 EPLAAGAVILRRFAFNA-AEQLIRDINDVA-SQSPFRQMVTPGGYTMSVAMTNCGHLGWT 70 + + G V+L+ F +++ ++ + F Q G + + M LG Sbjct: 227 KVIRPGMVLLKDFLTPDIQVDIVKTCRELGVKPTGFYQPGYSVGSKLHLQMMC---LGRN 283 Query: 71 THRQG-YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA---------------GYPDFQP 114 Q Y + + P +P +F+ L ++A A P P Sbjct: 284 WDPQTKYR---KNTDIDSKAPEIPVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSP 340 Query: 115 DACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLL 170 D C++N Y+ +L LHQD+DE + PIVS S+G A F +G + + + ++L Sbjct: 341 DICIVNFYSETGRLGLHQDRDESEESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVIL 400 Query: 171 EHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQ 211 E GDV+++GGESR+ +HG++ + P++ R NLTFR Sbjct: 401 ESGDVLIFGGESRMIFHGVKSIIPNSAPMSLLNESKLRTGRLNLTFRH 448 >UniRef50_D1I753 Whole genome shotgun sequence of line PN40024, scaffold_87.assembly12x (Fragment) n=18 Tax=Embryophyta RepID=D1I753_VITVI Length = 912 Score = 144 bits (363), Expect = 2e-33, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 66/240 (27%), Gaps = 48/240 (20%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 D + E G +L F E+ + D S Sbjct: 676 DSVPVSLVDSELNIPGIYLLHDFVSAKEEEELLAAVDKMSW------------------K 717 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP---DFQPDACLI 119 + + + Y + T + +P + +R ++ D D + Sbjct: 718 SLSKRRVQHYGYEFCYETRNVNTKQYLGKLPSFVSAIVERISSFPNLESAADIVLDQLTV 777 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR------------ 167 N Y PG LS H D I S+SL P I F K Sbjct: 778 NEYPPGVGLSPHIDTHSA-FEGFIFSLSLAGPCIMDFRRYTEGVWPKSASSSDMSVEYPD 836 Query: 168 ---------LLLEHGDVVVWGGESRL-FYHGIQPLK----AGFHPLTIDCRYNLTFRQAG 213 + L +++ GE+R ++H I K R + TFR+ Sbjct: 837 KSSSFLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTFRKVR 896 >UniRef50_A7SSH3 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7SSH3_NEMVE Length = 648 Score = 144 bits (363), Expect = 2e-33, Method: Composition-based stats. Identities = 47/227 (20%), Positives = 69/227 (30%), Gaps = 51/227 (22%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G I F E+ + D + H + Sbjct: 139 PPGLQIYEEFINEEEEKTLLDALGWDA-----------------PQKELRHRRVKHYGYE 181 Query: 76 YLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 +LY D KP P MP +++ R + + PD +N Y PG + H D Sbjct: 182 FLYGTNDIDRAKPLPGGMPAVCNDILTRMVSQGAVQN-TPDQLTVNEYLPGQGIPPHVDT 240 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 I S+SLG F + +LL ++V GESR + HGI P K Sbjct: 241 HSA-FEDGICSLSLGAKISMDFRHP--DSRHVSVLLPRRSLLVMSGESRYLWTHGITPRK 297 Query: 194 ----------------------------AGFHPLTIDCRYNLTFRQA 212 +G + R +LTFR+ Sbjct: 298 FDIIGSGLDTSIHEDQESIAADASNVSTSGVTQYERERRISLTFRKI 344 >UniRef50_C2BL13 Alkylated DNA repair protein n=2 Tax=Corynebacterium RepID=C2BL13_9CORY Length = 229 Score = 144 bits (362), Expect = 3e-33, Method: Composition-based stats. Identities = 60/231 (25%), Positives = 93/231 (40%), Gaps = 25/231 (10%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRD----INDVASQSPFRQMVTP--GGYTM 57 LF +A G + + ++ + + I + +P + G M Sbjct: 2 LFDSLPRPSVRVAPGVGHVPAWVGVDKQKALVEEMRGIAREYANTPMAMVRPRLKSGGQM 61 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--------- 108 SV HLG H Y Y +D P +P+S + A AA Sbjct: 62 SVFQL---HLGRYWHYPSYRY--VDNMEGTRVPPVPESLRQIAPGALRAAAEVAPELEPW 116 Query: 109 YPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKR 167 F P+ L+N Y PG+ + + D E AP++S+S+G A+F+ G + R P Sbjct: 117 VDTFVPEMALVNYYPPGSAMGMRVDDSEES-PAPVISLSIGDEALFRMGHTEARTRPWDD 175 Query: 168 LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID---CRYNLTFRQAGKK 215 + L GD+VV+GG R YHG+ + G P R N+T RQ + Sbjct: 176 ITLCSGDLVVFGGPKRFAYHGVVRVNDGTLPEGCGLREGRINITIRQVSAR 226 >UniRef50_B7QP17 Methyltransferase, putative n=1 Tax=Ixodes scapularis RepID=B7QP17_IXOSC Length = 602 Score = 144 bits (362), Expect = 3e-33, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 63/214 (29%), Gaps = 34/214 (15%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G ++R A E L+ + Sbjct: 108 PPGLRLVREAVDEAEEALLWRLVSWDRD-----------------CRALKQREVRHFGYA 150 Query: 76 YLYSPIDPQTNKPW-PAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 134 + Y + + P +P+ R + G+ PD + RY PG + H D Sbjct: 151 FDYELQGVRKDAPLAEPIPEECAPFLGRLVAS-GHLSGLPDQLTVTRYLPGQGIPPHVDS 209 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 IV +SLG P + F + +LL ++ G SR + HG K Sbjct: 210 H-GSFEDGIVCLSLGSPVVMDFRHPDGDRA--AVLLPPRSALLLHGPSRYIWTHGTASRK 266 Query: 194 AGFHPL-----------TIDCRYNLTFRQAGKKE 216 + P R + TFR+ + + Sbjct: 267 SDVVPRTEVPGQGLTLSPRGVRISFTFRRIRRGD 300 >UniRef50_A7RXQ8 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RXQ8_NEMVE Length = 323 Score = 143 bits (361), Expect = 3e-33, Method: Composition-based stats. Identities = 45/188 (23%), Positives = 75/188 (39%), Gaps = 12/188 (6%) Query: 16 AAGA-VILRRFAFNAAEQLIRDINDVASQSPFRQMVTPG----------GYTMSVAMTNC 64 G I+ F A R + + P + + S + Sbjct: 78 RPGLIFIVNPFVSGAQHYWARRCLEDFPRKPNISNLDAHMSIGDDDCIWALSNSEHVNLI 137 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 L W + Y+ +D + K + P+ L A A GY + P+A ++N Y Sbjct: 138 DRLRWVHLGYQFDYNVVDYKPEKYY-GFPKDLGGLMHHLAEAIGYLGYTPEAGIVNYYPL 196 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 A + H D E DL P++SVS G A+F GG ++ L + GD+++ GE+RL Sbjct: 197 SASMGGHTDHYELDLSWPLISVSFGQSAVFLIGGKTKDVKPTALYIRSGDILIMSGEARL 256 Query: 185 FYHGIQPL 192 +H + + Sbjct: 257 AFHAVPRI 264 >UniRef50_B8HQU9 2OG-Fe(II) oxygenase n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HQU9_CYAP4 Length = 207 Score = 143 bits (361), Expect = 3e-33, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 64/199 (32%), Gaps = 26/199 (13%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G V ++ + + D N + Y Sbjct: 27 GLVYIKDYIDQTTHNYLISQIDSFPW------------------LNDLARRVQHYGYKYD 68 Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEP 137 Y + ++P L + Y PD ++N Y PG ++ H D Sbjct: 69 YKSRGVDKSMYIASLPIWAKELAHKIRK--KYTTDLPDQVIVNEYMPGQGIANHIDCVN- 125 Query: 138 DLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAG- 195 IVS+SL + F ++ L+LE +VV G++R + HGI K+ Sbjct: 126 CFTDTIVSLSLCSSCVMDFVHIETGARK-SLMLEPRSLVVLSGDARYKWLHGIAKRKSDM 184 Query: 196 --FHPLTIDCRYNLTFRQA 212 R +LTFR+ Sbjct: 185 YKGEKYIRKRRVSLTFRKV 203 >UniRef50_D1HAA5 Whole genome shotgun sequence of line PN40024, scaffold_58.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HAA5_VITVI Length = 554 Score = 143 bits (361), Expect = 4e-33, Method: Composition-based stats. Identities = 61/233 (26%), Positives = 94/233 (40%), Gaps = 31/233 (13%) Query: 8 AEPWQEPLAAGAVILRRFAF-NAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVAMTNCG 65 QE L G V+L+ + ++++ D+ F + G + + M Sbjct: 325 EGTTQEVLRPGMVLLKGYISLTEQIKMVKKCRDLGVGPGGFYRPGYQDGAKLRLQMMC-- 382 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------------YP 110 LG Q Y P P +P F L +RA + P Sbjct: 383 -LGMNWDPQTRKYEKWHPLDGSETPDIPHEFSVLVERAIQDSQSLIKKNSGENNVEDTLP 441 Query: 111 DFQPDACLINRYAPGAKLSLHQDKDE----PDLRAPIVSVSLGLPAIFQFGGLKRNDPLK 166 P+ C++N Y +L LHQD+DE P+VS SLG A F +G + D Sbjct: 442 RMSPNICIVNFYTTSGRLGLHQDRDESEESLLKGLPVVSFSLGDSAEFLYGNQRNVDAAG 501 Query: 167 RLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQA 212 +++LE GDV+++GG SR +HG+ + P + + R NLT RQ Sbjct: 502 KVVLESGDVLIFGGPSRHIFHGVSSIIPNSAPNSLLEETNLLPGRLNLTLRQL 554 >UniRef50_Q09BP3 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09BP3_STIAU Length = 185 Score = 143 bits (361), Expect = 4e-33, Method: Composition-based stats. Identities = 54/205 (26%), Positives = 73/205 (35%), Gaps = 27/205 (13%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 +E G + + F ++ E + + + S R VA H GW Sbjct: 5 EEERPEGLLYVPDFLTDSEEARLLEHLRGLTFSEIRM-------RGQVAKRRTAHFGWL- 56 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLH 131 Y Y + +P PAMP L R A G Q L+N Y PGA + H Sbjct: 57 ----YGYESLKV---EPGPAMPDFLLPLRNRCAELMGELPEQLVEALLNEYPPGAAIGWH 109 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQF-GGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGI 189 +D P +V VSLG +F L L V GGESR + H I Sbjct: 110 RD--APMFGHQVVGVSLGGACRMRFQRDQGEARRTYALELAPRSAYVLGGESRSTWQHSI 167 Query: 190 QPLKAGFHPLTIDCRYNLTFRQAGK 214 +K RY++TFR Sbjct: 168 PAVK--------QERYSITFRTLKA 184 >UniRef50_UPI00005257D6 PREDICTED: similar to AlkB CG33250-PA n=1 Tax=Ciona intestinalis RepID=UPI00005257D6 Length = 312 Score = 142 bits (359), Expect = 5e-33, Method: Composition-based stats. Identities = 53/238 (22%), Positives = 83/238 (34%), Gaps = 43/238 (18%) Query: 17 AGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTPGG--------------------- 54 G I+ F + P V P Sbjct: 74 PGLFIIPNSFTDKGRRMWANKCVTEYFKKPHETNVDPFDDNPDKMWESSCEYLQSNAGKG 133 Query: 55 ---YTMSVAMTNCG---HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG 108 Y S+++ C L W T + ++ + +P +P + A+ G Sbjct: 134 SCTYEDSLSLLKCCPIWKLRWATLGYHHNWNSKQY-SEQPCSELPSELRKTSKLFASMIG 192 Query: 109 YPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRL 168 DF+ +A ++N Y G LS H D E L AP+VS+S GL A+F GG ++ + L Sbjct: 193 TDDFKAEASIVNYYHVGNALSPHDDTSELYLEAPLVSLSFGLSAVFLIGGTSKDQKPEAL 252 Query: 169 LLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI--------------DCRYNLTFRQA 212 + GDV++ G SRL YH + + + R N+ RQ Sbjct: 253 FIRSGDVIIMSGASRLAYHAVPRILKPTSDAQLKSSSVADNLDLFMSHSRLNINIRQV 310 >UniRef50_D0NYX8 Alkylated DNA repair protein alkB-like protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NYX8_PHYIN Length = 309 Score = 142 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 63/199 (31%), Gaps = 26/199 (13%) Query: 38 NDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFH 97 + L W Y ++ + +P+ Sbjct: 104 LQNQQVPNIWRKACDSHPQDPAESPLLAKLCWAASGYHYDWTARKYHKGS-FSPVPELLQ 162 Query: 98 NLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFG 157 L + A G + +A ++N Y + + H D E + P+VS+SLG +F G Sbjct: 163 QLGTKCAAVCGM-TLEAEAVIVNYYKTKSSMGGHLDDVEYTMDHPVVSLSLGSKCVFLMG 221 Query: 158 GLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPL------------------------K 193 G +++ +LL GD+ + GG SR YHG+ + Sbjct: 222 GHTKDEAPLEILLRSGDIAIMGGASRTCYHGVARVLPTPFSMKSKELDALPLSNDDREQY 281 Query: 194 AGFHPLTIDCRYNLTFRQA 212 R N+ RQ Sbjct: 282 EAVRTYLGSQRININVRQV 300 >UniRef50_C6TKW1 Putative uncharacterized protein n=1 Tax=Glycine max RepID=C6TKW1_SOYBN Length = 311 Score = 142 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 60/219 (27%), Positives = 88/219 (40%), Gaps = 16/219 (7%) Query: 7 DAEPWQEPLAAGAVILRRFAF-NAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 L G V L+ + + E +++ ++ S G T C Sbjct: 94 SRSNVVVSLRPGMVFLKGYLSLSDQEMIVKRCRELGVGSGGFYQHGYGEDTKMHLKMMCL 153 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATA--AGYPDFQPDACLINRYA 123 W Y P P +P FH+ A A P PD C++N Y+ Sbjct: 154 EKNWDPQFGQY--GDRRPFDGAKPPQIPPEFHSHVHSALKDSNALLPSISPDICIVNFYS 211 Query: 124 PGAKLSLHQDKDE----PDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 +L LHQDKDE L P++S S+G A F + + D K+LLL+ GDV+++G Sbjct: 212 ETGRLGLHQDKDESPDSLRLGLPVISFSIGDSADFLYADHRDLDQPKKLLLQSGDVLIFG 271 Query: 180 GESRLFYHGIQPLKAGFHP-------LTIDCRYNLTFRQ 211 G SR +HG+ + P R NLTFR+ Sbjct: 272 GPSRNLFHGVASIHPNTAPNLLLQHTNLCPGRLNLTFRR 310 >UniRef50_Q13686 Alkylated DNA repair protein alkB homolog 1 n=27 Tax=Euteleostomi RepID=ALKB1_HUMAN Length = 389 Score = 141 bits (356), Expect = 1e-32, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 78/207 (37%), Gaps = 22/207 (10%) Query: 17 AGAVILRR-FAFNAAEQLIRDINDVASQSPF--------------------RQMVTPGGY 55 G + + F ++ + SQ P ++ + Sbjct: 97 PGFIFIPNPFLPGYQWHWVKQCLKLYSQKPNVCNLDKHMSKEETQDLWEQSKEFLRYKEA 156 Query: 56 TMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPD 115 T + L W T Y + + + P L ++ A A G+ DF+ + Sbjct: 157 TKRRPRSLLEKLRWVTVGYHYNWDSKKYSADH-YTPFPSDLGFLSEQVAAACGFEDFRAE 215 Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 A ++N Y + L +H D+ E D P++S S G AIF GGL+R++ + + GD+ Sbjct: 216 AGILNYYRLDSTLGIHVDRSELDHSKPLLSFSFGQSAIFLLGGLQRDEAPTAMFMHSGDI 275 Query: 176 VVWGGESRLFYHGIQPLKAGFHPLTID 202 ++ G SRL H + + + Sbjct: 276 MIMSGFSRLLNHAVPRVLPNPEGEGLP 302 >UniRef50_D1IRG0 Whole genome shotgun sequence of line PN40024, scaffold_2.assembly12x (Fragment) n=2 Tax=rosids RepID=D1IRG0_VITVI Length = 457 Score = 141 bits (356), Expect = 1e-32, Method: Composition-based stats. Identities = 60/235 (25%), Positives = 97/235 (41%), Gaps = 32/235 (13%) Query: 6 ADAEPWQEPLAAGAVILRRFAFN-AAEQLIRDINDVASQSP-FRQMVTPGGYTMSVAMTN 63 A+ + + +G V+L+ + + ++++ ++ S F Q G +++ M Sbjct: 225 AEEGLKGDVIRSGMVLLKGYISSSDQVKIVKKCQELGLGSGGFYQPGYRDGGKLNLQMMC 284 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--------------- 108 LG + Y P N P +P F +L + A + Sbjct: 285 ---LGKNWDPETGKYEDERPVDNAKPPPIPDEFFHLVKEAIQDSQALLSKEKIEASKVEK 341 Query: 109 -YPDFQPDACLINRYAPGAKLSLHQDKDE----PDLRAPIVSVSLGLPAIFQFGGLKRND 163 P PD C++N Y +L LHQD+DE P+VS S+G A F + + Sbjct: 342 ELPWMIPDICIVNFYTTSGRLGLHQDRDETEETLRKGLPVVSFSIGDSAKFLYSNQRDVF 401 Query: 164 PLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQ 211 +LLE GDV+++GGESR +HG+ + P R NLTFRQ Sbjct: 402 NADEVLLESGDVLIFGGESRRIFHGVASILPNTSPQVLLKETNLRPGRLNLTFRQ 456 >UniRef50_C3NYZ8 Alkylated DNA repair protein n=28 Tax=Bacteria RepID=C3NYZ8_VIBCJ Length = 202 Score = 141 bits (356), Expect = 1e-32, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 68/218 (31%), Gaps = 24/218 (11%) Query: 1 MLDLFADAEPWQEPL---AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTM 57 M LF D + +F + + + ++Q Sbjct: 2 MKKLFLDTHSASGEIKLTDGLLYWFPQFLTPIQAD--QAFQQMLTHLDWQQKSIRLFGKS 59 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDAC 117 + G GY YS + P L + AA P ++ Sbjct: 60 VLQPRLIAWYGEK----GYRYSGLSLSA----QPFPPPLLTLKTQCEQAAQAP---FNSV 108 Query: 118 LINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 L N Y G + HQD + P I S+SLG F K + L HGD+ Sbjct: 109 LANLYRDGQDSMGWHQDNEPELGSNPVIASLSLGESRRFLLRHHKDHALQVECELNHGDL 168 Query: 176 VVWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 ++ G ++ + H I + T R NLTFR Sbjct: 169 LIMAGNTQHFWQHAIPKTR-----QTKQTRINLTFRNI 201 >UniRef50_A1SVH4 DNA-N1-methyladenine dioxygenase n=12 Tax=Bacteria RepID=A1SVH4_PSYIN Length = 210 Score = 141 bits (355), Expect = 2e-32, Method: Composition-based stats. Identities = 52/224 (23%), Positives = 79/224 (35%), Gaps = 33/224 (14%) Query: 2 LDLFA--DAEPWQEPLAAGAVILRRFAFNAAEQ-----LIRDINDVASQSPFRQMVTPGG 54 +DLFA + P G V + ++ + + Sbjct: 4 MDLFAALEDSPINIINCDGVVEYHGLLIPFDQANHYFGVLLETIQW------KHDQANIL 57 Query: 55 YTMSVAMTNCGHLGWTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ 113 + V W Y YS + + PW L Q+ A G+ Q Sbjct: 58 GQIIVTQRKVA---WHADKPFHYTYSNMT-KVALPWTL---ELLQLKQKVEDATGH---Q 107 Query: 114 PDACLINRYAPGA-KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 171 +ACL+N Y G ++ H D + + A I S+S G F F N + L+ Sbjct: 108 FNACLLNLYHSGQEGMAWHSDAEKDLQKNAAIASLSFGAERKFSFKHKV-NQKTISVSLQ 166 Query: 172 HGDVVVWGGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 HG ++V GG+ R + H + P K P R NLTFR + Sbjct: 167 HGSLLVMGGDTQRHWLHRLPPTKKVTTP-----RINLTFRMINE 205 >UniRef50_C1FJB5 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FJB5_9CHLO Length = 418 Score = 141 bits (355), Expect = 2e-32, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 65/224 (29%), Gaps = 34/224 (15%) Query: 5 FADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 A G ++ F E+ + D + Sbjct: 184 CAATRDSATLGVPGVTLITDFVTEEEEREMLACVDSDERWQG-----------------L 226 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---YPDFQPDACLINR 121 + + Y D + MP L RAA+ D +N Sbjct: 227 AKRRVLHYGYAFDYGTRDARDKT--SPMPAFVAGLLGRAASCGAPGACESVHCDQLTVNE 284 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP----LKRLLLEHGDVVV 177 Y G ++ H D I+S+SL A+ +F + + + + + ++V Sbjct: 285 YVAGVGIAPHVDTHSA-FGPTILSLSLAGRAVMEFRLHEGGEKEPRERRAISMPPRSLLV 343 Query: 178 WGGESRL-FYHGIQPLKAGF------HPLTIDCRYNLTFRQAGK 214 GE+R + H I K + R + TFR+ + Sbjct: 344 LHGEARYRWLHYIPHRKRDAIVGEDECEAREERRVSFTFRRRRE 387 >UniRef50_Q07GB6 Oxidoreductase, putative n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q07GB6_ROSDO Length = 195 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 62/205 (30%), Gaps = 23/205 (11%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 Q+ G L + + D A Sbjct: 10 QDVWPDGLTYLENYISEDEAGRLVQEIDAALW------------------RTDLKRRVQH 51 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLH 131 + Y Y +P+ F +L +R G+ PD ++N Y PG +S H Sbjct: 52 YGYRYDYKARQAWREDYLGPLPELFQSLAERLTAE-GHFQTVPDQVIVNEYQPGQGISAH 110 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQP 191 D +P I S+SL + +F + ++ L +V+ L+ H I P Sbjct: 111 IDC-QPCFGETIASLSLLSACVMRFASRIYSQQMELHLQPSSLLVLQSDARHLWTHAIPP 169 Query: 192 LKAGF---HPLTIDCRYNLTFRQAG 213 K R +LTFR Sbjct: 170 RKTDVFEGQKYARARRISLTFRTMK 194 >UniRef50_C5L3Y2 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5L3Y2_9ALVE Length = 325 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 47/215 (21%), Positives = 70/215 (32%), Gaps = 30/215 (13%) Query: 9 EPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLG 68 + L G ++ F E+ + + D + Sbjct: 99 PTTTDELPPGLTLIPDFITEEEEEKLLGLVDAGEWDHSIR------------------RR 140 Query: 69 WTTHRQGYLYSPIDPQTNKPWPA--MPQSFHNLCQRA--ATAAGYPDFQPDACLINRYAP 124 + Y+ + + MP L +R + A DF+PD IN Y P Sbjct: 141 VQHFGHAFDYTSLRAKDAFLDGEARMPAYTEELVRRIRAESVAEARDFRPDQLTINEYIP 200 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 G +S H D PIV +S+G + +F L L + V GGESR Sbjct: 201 GVGISFHVDTHSA-FEGPIVILSIGGGIVLEFR-KSEEGRALPLWLPRRSLAVMGGESRF 258 Query: 185 -FYHGIQPLK-----AGFHPLTIDCRYNLTFRQAG 213 + HGI K + R +LTFRQ Sbjct: 259 GWVHGIAGRKTDRVGPDGDLVERQRRISLTFRQMK 293 >UniRef50_C0PA85 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0PA85_MAIZE Length = 389 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 59/306 (19%), Positives = 90/306 (29%), Gaps = 100/306 (32%) Query: 7 DAEPWQEPLAAGAVILRRFAFNAAEQ-LIRDINDVASQSPFRQMVTP------------- 52 D + G + + IR+ Q P R +T Sbjct: 83 DRPVFCFLDRPGFYFIPGALSTEEQCCWIRESLKTFPQPPNRTNLTAIYGSISDLLIAAE 142 Query: 53 ----------------------GGYTMS-------------------VAMTNCGHLGWTT 71 GG T S A T L W+T Sbjct: 143 NQQILVEAENPDIQERNKQNNCGGKTESKYFKFVDSESQKGEEHRSNAATTLVRKLRWST 202 Query: 72 HRQGYLYSPI-----------------------DPQTNKPWPAMPQSFHNLCQRAA--TA 106 + +S + + P +P + +L ++ A Sbjct: 203 LGLQFDWSKRTIRSKSPRNSQEKPLSSSRLVERNYDVSLPHNKIPGALASLAKKMAIPAM 262 Query: 107 AGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK 166 +F+P+A ++N Y P L H D E D PIVS+SLG IF GG R++ Sbjct: 263 PSGEEFKPEAAIVNYYGPSDMLGGHVDDMEADWTKPIVSISLGCKCIFLLGGKTRDEVPT 322 Query: 167 RLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDC--------------------RYN 206 + L GD+V+ GE+R +HG+ + I R N Sbjct: 323 AMFLRSGDIVLMAGEARERFHGVPRIFTESDQQEIPALISQLSSGDDVFILEYIKNSRIN 382 Query: 207 LTFRQA 212 + RQ Sbjct: 383 INIRQV 388 >UniRef50_A0D9E2 Chromosome undetermined scaffold_42, whole genome shotgun sequence n=3 Tax=Oligohymenophorea RepID=A0D9E2_PARTE Length = 636 Score = 139 bits (351), Expect = 4e-32, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 64/207 (30%), Gaps = 27/207 (13%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 ++ G ++ F E+ I D+ D S Sbjct: 150 RQVNVPGLYLIHDFITPEYEKYIMDLIDKQEWS------------------KLKQRRVQH 191 Query: 72 HRQGYLYSPIDPQTNKPWPA-MPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSL 130 + ++Y ++P +P ++ + + P + + IN Y PG + Sbjct: 192 YGYEFIYGDNTVNVDQPAEKKIPAFLEDVRAKVSDLVK-PQAEINQLTINEYLPGMGIPP 250 Query: 131 HQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGI 189 H D P VS+SL + F K + L L + GE R ++H I Sbjct: 251 HFDVH-PPFHEKFVSISLLSGLVMSFKSYKGEEQ--HLYLPPRSCAFFTGEVRFAWFHSI 307 Query: 190 QPLKAGFHPLT---IDCRYNLTFRQAG 213 K R +LTFR Sbjct: 308 ASRKIDKIEGETHFRSRRLSLTFRTIR 334 >UniRef50_Q9U3P9 Protein C14B1.10, partially confirmed by transcript evidence n=2 Tax=Caenorhabditis RepID=Q9U3P9_CAEEL Length = 591 Score = 139 bits (351), Expect = 5e-32, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 77/217 (35%), Gaps = 25/217 (11%) Query: 5 FADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 +A ++ A I+ + + E+ + D+ T ++ + Sbjct: 127 LPEATKCEDFRPANLKIIEEYVSSDLEKELVDLV-----------------TNHPSVQSL 169 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 H + YS K +P ++L R + Y +PD N Y Sbjct: 170 KHRAVVHFGHVFDYSTNSASEWKEADPIPPVINSLIDRLISD-KYITERPDQVTANVYES 228 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 G + H D PIVS+SL + +F + + +LL+ + + GESR Sbjct: 229 GHGIPSHYDTHSA-FDDPIVSISLLSDVVMEFKDGANSARIAPVLLKARSLCLIQGESRY 287 Query: 185 -FYHGIQPLKAGFHPLT-----IDCRYNLTFRQAGKK 215 + HGI K P T R +LT R+ +K Sbjct: 288 RWKHGIVNRKYDVDPRTNRVVPRQTRVSLTLRKIRRK 324 >UniRef50_Q7MF65 Alkylated DNA repair protein n=15 Tax=Vibrionaceae RepID=Q7MF65_VIBVY Length = 203 Score = 137 bits (346), Expect = 2e-31, Method: Composition-based stats. Identities = 46/215 (21%), Positives = 73/215 (33%), Gaps = 23/215 (10%) Query: 2 LDLFADAEPWQEPLAAG-AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 LF P + G + F + + + P+ Q + Sbjct: 3 SSLFLFDSPDWLTITDGQLLWWPTFLSQDQAET--YFTQLKHELPWEQKAIQMFGRQVLQ 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 W Y YS + PW + +L R A+G+ ++ L N Sbjct: 61 PRLQA---WCGD-AAYTYSGLT-MQPLPWTP---TLLDLKTRCENASGH---IFNSVLAN 109 Query: 121 RYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 Y G + HQD + R P I SV+LG F L N+ ++ L G +++ Sbjct: 110 LYRDGQDSMGWHQDDEPELGRNPVIASVNLGESRRFVLQHLITNEKIE-FELTSGSLLIM 168 Query: 179 GGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 G ++ + H + T R NLTFRQ Sbjct: 169 AGSTQHYWRHCVPKTAK-----TKSERINLTFRQI 198 >UniRef50_A4CQ67 Alkylated DNA repair protein n=2 Tax=Flavobacteriaceae RepID=A4CQ67_9FLAO Length = 197 Score = 137 bits (344), Expect = 3e-31, Method: Composition-based stats. Identities = 49/211 (23%), Positives = 74/211 (35%), Gaps = 19/211 (9%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 E + A F + + ++ SQ+P+RQ Sbjct: 3 PSREDLENLPDATLRYQPGFLLPKEAESL--FGEIKSQTPWRQDTIRLFGKTFQQPRLTA 60 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 G Q Y YS I P+ + +L R + AAG + CL+N Y G Sbjct: 61 LYGKN--GQAYTYSGI-LMEPLPFTPL---LEDLLHRVSIAAGE---KFTTCLLNLYRDG 111 Query: 126 -AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 H D + P I S+SLG F + R+ LE G +++ G ++ Sbjct: 112 SDSNGWHADDEPELGNNPVIASLSLGASRKFHLKHRRIKSQRVRMNLESGSLLLMAGTTQ 171 Query: 184 -LFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 + H + K P R NLTFR+ G Sbjct: 172 HHWLHQVPKTKRPVGP-----RINLTFRRLG 197 >UniRef50_C0Z2F3 AT5G01780 protein n=9 Tax=Magnoliophyta RepID=C0Z2F3_ARATH Length = 217 Score = 136 bits (343), Expect = 5e-31, Method: Composition-based stats. Identities = 58/222 (26%), Positives = 98/222 (44%), Gaps = 35/222 (15%) Query: 19 AVILRRFAFNA-AEQLIRDINDVA-SQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG- 75 V+L+ F +++ ++ + F Q G + + M LG Q Sbjct: 1 MVLLKDFLTPDIQVDIVKTCRELGVKPTGFYQPGYSVGSKLHLQMMC---LGRNWDPQTK 57 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA---------------GYPDFQPDACLIN 120 Y + + P +P +F+ L ++A A P PD C++N Sbjct: 58 YR---KNTDIDSKAPEIPVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICIVN 114 Query: 121 RYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVV 176 Y+ +L LHQD+DE + PIVS S+G A F +G + + + ++LE GDV+ Sbjct: 115 FYSETGRLGLHQDRDESEESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGDVL 174 Query: 177 VWGGESRLFYHGIQPLKAGFHPLT-------IDCRYNLTFRQ 211 ++GGESR+ +HG++ + P++ R NLTFR Sbjct: 175 IFGGESRMIFHGVKSIIPNSAPMSLLNESKLRTGRLNLTFRH 216 >UniRef50_Q7D1B7 Putative uncharacterized protein n=1 Tax=Agrobacterium tumefaciens str. C58 RepID=Q7D1B7_AGRT5 Length = 195 Score = 136 bits (342), Expect = 5e-31, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 59/205 (28%), Gaps = 26/205 (12%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 L + F + E + D D S Sbjct: 7 PTLLPHDIMYFDGFLSSEDEAFVADRLDAGEWSTE------------------LKRRVQH 48 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLH 131 Y Y + +P +R GY PD + N Y G +S H Sbjct: 49 FGYRYDYKVRAVTPDAYLGPLPPWLGLFAERLVAD-GYCRTVPDQVIANEYLLGQGISAH 107 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQ 190 D P IVS+SL F L+ P R +L V+ G SR + H I Sbjct: 108 VDCV-PCFDDTIVSISLLSACEMVFRDLR--GPGIRSVLHPRSGVLLRGSSRYDWTHEIP 164 Query: 191 PLKAG---FHPLTIDCRYNLTFRQA 212 K+ R +LTFR+ Sbjct: 165 ARKSDIVNGVKTARSRRISLTFRKV 189 >UniRef50_Q26EI7 Alkylated DNA repair protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26EI7_9BACT Length = 201 Score = 135 bits (339), Expect = 1e-30, Method: Composition-based stats. Identities = 55/217 (25%), Positives = 79/217 (36%), Gaps = 25/217 (11%) Query: 3 DLFADAEPWQEPLAAGAVILRR--FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 +LF P + V +AF A+QL+ + ++P+RQ Sbjct: 4 NLFPSDFPD---IPDAQVQYDGNFYAFAEAQQLLSKLL---KKTPWRQNKITVYGKEHDE 57 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 G Y YS I PW + + Q A G + CLIN Sbjct: 58 PRLTQLYG--DPGIKYGYSNISYD-ALPWTE---TLQKIKQDVEKATG---ATFNICLIN 108 Query: 121 RYAPG-AKLSLHQDKDEPDLRAPI-VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 RY G H D ++ PI S+SLG F D + L+HG ++V Sbjct: 109 RYRNGQDSNGWHADNEKELGINPIIASISLGQERFFHLKHHHNKDWKFKFPLQHGSLLVM 168 Query: 179 GGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 GE++ + H I K I R NLTFR+ + Sbjct: 169 AGETQHTYKHQIAKTKR-----LIGERINLTFRKIVQ 200 >UniRef50_C6XT27 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT27_PEDHD Length = 201 Score = 135 bits (339), Expect = 1e-30, Method: Composition-based stats. Identities = 47/204 (23%), Positives = 74/204 (36%), Gaps = 19/204 (9%) Query: 14 PLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHR 73 P+ A F A + ++ Q ++Q + G Sbjct: 14 PIPGEAFFYPGFFTEAESD--QYFQELTHQVTWKQEPIKVFGKDILQPRFTAFYGDEA-- 69 Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 Y YS I AMP L + D + + CL+N Y G + H+ Sbjct: 70 TSYSYSGITLN------AMP-WIDTLTRIKENIETKFDVEFNTCLLNHYRSGADSIGWHR 122 Query: 133 DKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQ 190 D ++ + P I SVS G P IFQF P+ + L HG +++ +++ + H + Sbjct: 123 DNEKNLGQYPFIASVSFGAPRIFQFRHYTDKIPIISVELTHGSLLIMKADTQHLWEHRLP 182 Query: 191 PLKAGFHPLTIDCRYNLTFRQAGK 214 + P R NLTFR K Sbjct: 183 KILRPVGP-----RINLTFRLILK 201 >UniRef50_UPI00019271E1 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI00019271E1 Length = 350 Score = 135 bits (339), Expect = 1e-30, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 76/201 (37%), Gaps = 15/201 (7%) Query: 16 AAGAVILRR-FA-FNAAEQLIRDINDVASQSPFR------------QMVTPGGYTMSVAM 61 G ++++ F + A I+ ++ P+ + V SV Sbjct: 81 CPGFIVIQNPFICYGAQSHWIKQALTEYTKKPYPCNLDALMALDKDKTVWDISQEKSVEN 140 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 L WT + Y+ +D K + P+ L A + ++ + +IN Sbjct: 141 NFINQLRWTHMGYHFDYNIVDY-KAKEYYGFPKDLAELTVTIADVFKFQNYIAETGIINY 199 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y G+ + H D E +L P++S S G A+F GG R+ + + + GD+++ G Sbjct: 200 YPEGSSMGGHTDHYEEELSQPLISYSFGQAAVFLIGGPTRDIKPEGIWVRTGDIILMTGP 259 Query: 182 SRLFYHGIQPLKAGFHPLTID 202 SR +H + + Sbjct: 260 SRTAFHAVPCIITKNQKTIPH 280 >UniRef50_A4S4F4 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4S4F4_OSTLU Length = 343 Score = 135 bits (339), Expect = 1e-30, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 66/215 (30%), Gaps = 30/215 (13%) Query: 8 AEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHL 67 + + G ++ F E+ + + G +A H Sbjct: 106 TKASRRSSVEGLTLIENFVTVDEERALATL------------AATSGDETRLARRRVKHF 153 Query: 68 GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD--FQPDACLINRYAPG 125 + Y D K +P+ + +R + + D +N Y G Sbjct: 154 -----GYAFDYGTRDANL-KVVDEIPELAMEVLRRLPRETPGYEGAMRCDQVTVNEYPRG 207 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL- 184 L+ H D I+S+SL + +F + + + L ++V GESR Sbjct: 208 VGLAPHVDTHSA-FGDTILSLSLLGGTVMEFR--TSGEAHRAIYLPPRSLLVMHGESRYR 264 Query: 185 FYHGIQPLKAGFHP------LTIDCRYNLTFRQAG 213 + H I K D R + TFR+ Sbjct: 265 WQHYIPHRKFDTLEGEAAPTPRDDVRLSYTFRERR 299 >UniRef50_C6X2N0 2OG-Fe(II) oxygenase n=4 Tax=Bacteria RepID=C6X2N0_FLAB3 Length = 204 Score = 134 bits (336), Expect = 3e-30, Method: Composition-based stats. Identities = 46/220 (20%), Positives = 73/220 (33%), Gaps = 23/220 (10%) Query: 1 MLDLFADAEPWQEPLAA---GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTM 57 M+ LF + L + I D+ ++ Sbjct: 1 MMSLFDNISDPNLNLLPKDGTVNYYGKILSEDESSSIYQ--DLLDNIEWKNDEAVIFGKT 58 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDAC 117 + G Y S T PW A + L + A G ++C Sbjct: 59 MITKRKVAWYGDREFSYTYSKSTK---TAIPWTA---TLLKLKKMVENATGE---AFNSC 109 Query: 118 LINRYAPGA-KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 L+N Y G + H D + + I S+SLG F F D ++ + LEHG + Sbjct: 110 LLNLYHSGEEGMGWHSDAEKDLKKNGAIASLSLGAERRFLFKHKHTADKVETV-LEHGSL 168 Query: 176 VVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 +V E++ + H + P + P R NLTFR + Sbjct: 169 LVMKNETQSFWQHRLPPARKILTP-----RINLTFRSIDE 203 >UniRef50_A4SZF3 DNA-N1-methyladenine dioxygenase n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SZF3_POLSQ Length = 209 Score = 133 bits (334), Expect = 4e-30, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 77/220 (35%), Gaps = 26/220 (11%) Query: 2 LDLFA-DAEPWQEPLAA--GAV-ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTM 57 LF+ D++ L G V F + + +N + + + Sbjct: 4 SSLFSIDSDLCATNLLPKEGLVNYYPEFL--GEVESLNLLNQLQKSLQWEADQLIIFGRL 61 Query: 58 SVAMTNCGHLGWTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDA 116 W + Y YS + + + W + ++ P + ++ Sbjct: 62 ISTRRKVA---WIGDPKCTYTYSGVK-KQPQSWTP---ELLIIKRQLEE---LPQAEFNS 111 Query: 117 CLINRYAPG-AKLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 CL+N Y G + H D + E D ++PI S+SLG F F K L LE+G Sbjct: 112 CLLNFYHDGADGMGWHSDDEKELDAQSPIASLSLGSARKFSFKHKKDKS-TTSLFLENGS 170 Query: 175 VVVWGGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 ++ + + H + K P R NLTFR+ Sbjct: 171 ALIMHAPTQQFWQHALLKTKTIHTP-----RINLTFRRIS 205 >UniRef50_B0SGN3 Alkylated DNA repair protein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SGN3_LEPBA Length = 202 Score = 133 bits (334), Expect = 5e-30, Method: Composition-based stats. Identities = 47/216 (21%), Positives = 78/216 (36%), Gaps = 21/216 (9%) Query: 2 LDLFADAEPWQEPLAAG-AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 ++LF G V + F ++ + + ++Q Sbjct: 1 MNLFQRTPHSNLLPYDGTLVYIPEFLNG--KKSLEYFETFLTTILWKQDEAILYGKHITT 58 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 + Y YS +T PW +L + + + ++CL+N Sbjct: 59 KRSVAWY--AEKGYSYRYSGTT-KTAIPWT---NELLDLKKEVESET---NEIFNSCLLN 109 Query: 121 RYAPGA-KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 Y G+ ++ H D + + I SVSLG IF+F K+N + L LE G +++ Sbjct: 110 LYHDGSEGMAWHSDDETSLQKHSTIASVSLGAERIFRFKHKKKNS-VVELPLEPGSLLLM 168 Query: 179 GGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 GE + H + P R NLTFRQ G Sbjct: 169 KGEIQEHWLHSLPKALKVKRP-----RVNLTFRQFG 199 >UniRef50_Q12QK9 DNA-N1-methyladenine dioxygenase n=5 Tax=Shewanella RepID=Q12QK9_SHEDO Length = 255 Score = 132 bits (333), Expect = 7e-30, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 71/202 (35%), Gaps = 21/202 (10%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 L F +Q + D A PF + + GY Sbjct: 66 PPVTWLTGFLSVQEQQAL---LDDAKSYPFERPQIEVYGKLHPIPRQQVWFADED--CGY 120 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD 135 Y+ + + PWPA+ L QR G + L+N YA G + H D + Sbjct: 121 RYASL-FISPTPWPAL---LMQLRQRLQAELGL---VFNGVLVNFYADGQDTVGWHSDDE 173 Query: 136 -EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG-GESRLFYHGIQPLK 193 E + I S+S+G FQ KR+ L L GD+++ G + + H + Sbjct: 174 AEIRKPSSIASISIGATRDFQIRH-KRSQETFTLPLVSGDLLIMQPGMQQTWQHAVPRRA 232 Query: 194 AGFHPLTIDCRYNLTFRQAGKK 215 P R NLTFR+ + Sbjct: 233 KVKAP-----RINLTFRELVPQ 249 >UniRef50_A0C122 Chromosome undetermined scaffold_140, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C122_PARTE Length = 312 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 74/192 (38%), Gaps = 13/192 (6%) Query: 16 AAGAVILRRFAF-NAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG--------- 65 G + ++ F + ++ + + P+R + + Sbjct: 69 PQGVIQVKGFLNLDDQIRISKLCMNEYINQPYRTNLFIYKEDENFDKFIVHDDKRYHFNN 128 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD-FQPDACLINRYAP 124 + W Y ++ K +P + QRA + +Q ++ +IN Y Sbjct: 129 KIRWANVGYQYDWNNRQYPQEK--TQVPDPIQEISQRANNFLQLQNQYQSESVIINFYQS 186 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL 184 ++ H D E D +PI S S GL ++F GG +++ + L+ GD++V G +R Sbjct: 187 HDYMTGHLDDAELDQDSPIYSFSFGLSSVFVIGGPTKDEKPIAIKLDSGDLLVMSGHARK 246 Query: 185 FYHGIQPLKAGF 196 YHG+ + A Sbjct: 247 CYHGVPRVLADS 258 >UniRef50_Q2BPN4 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BPN4_9GAMM Length = 195 Score = 131 bits (330), Expect = 1e-29, Method: Composition-based stats. Identities = 47/212 (22%), Positives = 81/212 (38%), Gaps = 21/212 (9%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 ++ P ++ L ++ F ++ N ++++ +RQ + Sbjct: 2 SEKNPKRDDLTPDYTLITDFLSPDTAD--QNFNTLSNELEWRQDQIKMFGKLVAIPRLQN 59 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 +G Y YS + T W + + L + A+ + + +A LIN Y G Sbjct: 60 FMG--DPGIRYRYSGLTL-TASGWHPVVKKIKELAEAAS------NTEFNAVLINLYRDG 110 Query: 126 -AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-S 182 + H+D + P IVSVSLG F + LLL G ++V G E Sbjct: 111 QDSMGWHKDDEPELGPEPTIVSVSLGATRRFLLRAADKT--QHELLLNSGSLLVMGPELQ 168 Query: 183 RLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + + H I + P R NLTFR+ + Sbjct: 169 KHWQHSIPKTRKQIGP-----RINLTFRKIVQ 195 >UniRef50_C5BKX4 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BKX4_TERTT Length = 207 Score = 130 bits (328), Expect = 2e-29, Method: Composition-based stats. Identities = 45/221 (20%), Positives = 75/221 (33%), Gaps = 30/221 (13%) Query: 3 DLFADAEPWQEPLAAG----AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTP-GGYTM 57 D+FAD E + G I ++ A +++ ++ + + Q G + Sbjct: 4 DIFADTSR-SEIVDLGDNAWLDIFPQWIATAQTRVLFNLL--LQECEWEQPAIRIAGREL 60 Query: 58 SVAMTNCGHLGWTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDA 116 + C W YS +P P L + A + ++ Sbjct: 61 PIPRLQC----WYGDKGAVLRYSGKS------FPPHP-WLKALAELNLQLATVCKRRFNS 109 Query: 117 CLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKR--LLLEH 172 L+N Y G + H D + P I S+SLG F + L Sbjct: 110 VLVNCYRDGSDSVGWHADDEPELGAKPVIASISLGATRRFSLKHKFDQQQKSSRHIQLRD 169 Query: 173 GDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 GD+++ G ++ + H IQ + P R NLTFR Sbjct: 170 GDLLIMRGNTQANWVHAIQKTTSSVGP-----RINLTFRNI 205 >UniRef50_A8PV44 ALKBH protein, putative n=1 Tax=Brugia malayi RepID=A8PV44_BRUMA Length = 339 Score = 130 bits (327), Expect = 3e-29, Method: Composition-based stats. Identities = 53/214 (24%), Positives = 90/214 (42%), Gaps = 15/214 (7%) Query: 16 AAGAVILRR-FAFNAAEQLIRDINDVASQSP-FRQMVTPGGYTMSVAMTNCGHLGWTTHR 73 G V+L F ++ Q I+ + ++SP F + +V + L W+T Sbjct: 128 RPGMVMLNDIFKSSSHLQWIKRSLFIYAESPGFTNVGLQVPNVRNVFKEHGRQLRWSTLG 187 Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQD 133 Y ++ +P+ +L + A G DA +IN Y+ + L+ H D Sbjct: 188 LHYDWATKIYPFEGEL--LPEELVSLSDVLSQALGIGPMYADAAIINFYSRKSTLAPHVD 245 Query: 134 KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLK 193 + E L +P++S+S G AI+ GG +DP+ + GDV+V G RL YH + + Sbjct: 246 RSERSLSSPLISLSFGQTAIYLAGGTDLDDPVDAFYIRSGDVLVIYGPQRLIYHAVPRIL 305 Query: 194 AGFHPLTIDC-----------RYNLTFRQAGKKE 216 + D R N+T RQ + + Sbjct: 306 QDTYFEDKDQPEEIVKYANTNRINITLRQVDEHK 339 >UniRef50_B6HH87 Pc20g14010 protein n=10 Tax=Leotiomyceta RepID=B6HH87_PENCW Length = 230 Score = 129 bits (325), Expect = 5e-29, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 64/206 (31%), Gaps = 33/206 (16%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G F EQ + + + P R + Sbjct: 14 PHGIFWQDNFIDAEHEQRLISVFTNELEWPDRPGRVS-----------------LHYGYS 56 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 + Y + P+ P L D PD + Y PGA + H D Sbjct: 57 FDYKTFGIDPDIPYKEFPGWLQPLIPT------TEDRPPDQVCLQYYPPGAGIPPHVDAH 110 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKA 194 +P + ++S+G PA F R + + L ++ G+SR + HGI+ K Sbjct: 111 KP--YDQLYALSIGAPATMIFR---RGEERIEVDLTPRSMMQMSGDSRLHWTHGIRKRKN 165 Query: 195 GFHP----LTIDCRYNLTFRQAGKKE 216 P R+++T+R E Sbjct: 166 DTLPDGTVRPRGERWSITYRWLRDGE 191 >UniRef50_C5KK00 Putative uncharacterized protein n=6 Tax=Perkinsus marinus ATCC 50983 RepID=C5KK00_9ALVE Length = 477 Score = 129 bits (325), Expect = 5e-29, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 68/195 (34%), Gaps = 18/195 (9%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLY 78 L +F E + ++ ++ + Q + Q Y Y Sbjct: 291 LTYLPKFV----ENPADALKELINEVLWEQGKVKIFGKEHLERRLTAFY--ADDGQQYRY 344 Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEP 137 S + PW P L + A G + C++N Y G + LH D ++ Sbjct: 345 SGGPLRVPSPWRRGPIVIDRLRKAVGEACGQE---FNCCVLNYYRDGSDSIGLHSDDEKV 401 Query: 138 DLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAG 195 P I VSLG F KR+ L G ++V GG ++ L+ H + K Sbjct: 402 LGVNPSIACVSLGAERDFVL-DAKRDKKKVELTPRSGSLLVMGGSTQKLWKHSVPSRKRE 460 Query: 196 FHPLTIDCRYNLTFR 210 P R +LTFR Sbjct: 461 HRP-----RVSLTFR 470 >UniRef50_A6EJU4 2OG-Fe(II) oxygenase n=1 Tax=Pedobacter sp. BAL39 RepID=A6EJU4_9SPHI Length = 202 Score = 129 bits (323), Expect = 8e-29, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 70/215 (32%), Gaps = 21/215 (9%) Query: 2 LDLFADAEPWQEPLAAGAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 L FAD + F ++ L+R P++Q + V Sbjct: 4 LSFFADRGQSPGLPKSLLEYHPGVFDDKESDMLLRKFIADC---PWQQKIVKMYDKEVVT 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 W + Y Y+ + W + ++ AG + ++ L+N Sbjct: 61 PRLTS---WYADEETYDYTSLRRAAPNIWTP---ELLMIREKVQAIAGL---RFNSVLLN 111 Query: 121 RYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 Y G ++ H D ++ P I SVS G F + + LE G +++ Sbjct: 112 YYRDGNDSVAWHSDNEKALGTHPLIASVSFGQVRCFDIRRKSDHSDKYSIRLESGALMIM 171 Query: 179 GGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 G+ + + H + R NLTFR Sbjct: 172 KGDLQQHWEHRVAKSTKSMR-----ARVNLTFRVV 201 >UniRef50_Q5UR03 Uncharacterized protein L905 n=1 Tax=Acanthamoeba polyphaga mimivirus RepID=YL905_MIMIV Length = 210 Score = 129 bits (323), Expect = 8e-29, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 72/204 (35%), Gaps = 27/204 (13%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G I+ + E+ + + + Q Y Sbjct: 15 GFSIIHDYVTPDQEKKLLKKINESEWVVDYQ--------------------RRLQYYNYR 54 Query: 78 YSPIDPQTNKPWP-AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 +P P P +P+ L + D +PD ++N Y PG L H D+ + Sbjct: 55 NELFEPYDLIPIPNKIPKYLDQLINQMI-LDKIIDQKPDQIIVNEYKPGEGLKPHFDRKD 113 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAG 195 + I+ +SLG I +F K K++ + + + ++R + HGI P K Sbjct: 114 Y-YQNVIIGLSLGSGTIMEFYKNKPIPEKKKIYIPPRSLYIIKDDARYIWKHGIPPRKYD 172 Query: 196 ---FHPLTIDCRYNLTFRQAGKKE 216 + + R ++TFR K++ Sbjct: 173 EINGKKIPRETRISITFRNVIKEK 196 >UniRef50_UPI00006CCD66 hypothetical protein TTHERM_00483520 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CCD66 Length = 199 Score = 129 bits (323), Expect = 9e-29, Method: Composition-based stats. Identities = 45/212 (21%), Positives = 71/212 (33%), Gaps = 29/212 (13%) Query: 4 LFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 LF A+ + + G + E I ++ Q+ Sbjct: 8 LFDSAQTFDQVQ--GLRYIDSILTEEEEVFI--FKEIYQNEWNTQL-------------- 49 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA 123 + Y YS N +P+ N CQR PD +IN Y Sbjct: 50 --KRRTQHYGYKYDYSIKSIDKNMFLGVLPKYAINFCQRLIDDKVIKVM-PDQMIINEYL 106 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 PG ++ H DK + I SVSLG I + + L L+ +++ ++R Sbjct: 107 PGQGINPHIDKTDI-FGETIFSVSLGSGCIMKL---TYGETEIDLYLKRRSILILEDKAR 162 Query: 184 L-FYHGIQPLKA---GFHPLTIDCRYNLTFRQ 211 F H I K+ + R +LTFR+ Sbjct: 163 YLFKHSIPSRKSDKIDGKTIQRSTRVSLTFRK 194 >UniRef50_B4VHI5 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VHI5_9CYAN Length = 207 Score = 129 bits (323), Expect = 9e-29, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 64/208 (30%), Gaps = 23/208 (11%) Query: 8 AEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHL 67 + P G ++ + + + +I D S Sbjct: 15 KSDFIVPEIPGLNLIHDYINTQEQNQLLEIIDQQEWST------------------QLKR 56 Query: 68 GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK 127 + Y Y + +P + L QR P PD +IN Y PG Sbjct: 57 RVQHYGYRYEYQKRTLTSASYLGELPNWANQLGQRLVRDRVTPT-PPDQLIINEYLPGQG 115 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYH 187 ++ H D P I+S+SLG + L + LLL +++ + H Sbjct: 116 ITNHVDCV-PCFGNTIISLSLGSCCVMNLTHLPTQTQIPVLLLPGSLLILQRVARYQWQH 174 Query: 188 GIQPLKAG---FHPLTIDCRYNLTFRQA 212 GI K R +LTFR+ Sbjct: 175 GIPARKNDKYQGREFGRSRRVSLTFREV 202 >UniRef50_C1N0U9 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1N0U9_9CHLO Length = 408 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 64/228 (28%), Gaps = 50/228 (21%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 GA ++ F E + + + + + Sbjct: 144 PGATLILDFVTEDEEVAMLKSAEEDPRW-----------------QRLAKRRVLHYGYAF 186 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD----FQPDACLINRYAPGAKLSLHQ 132 Y D + AMP L RAA P + D +N Y PG L+ H Sbjct: 187 DYGTRDAKAPAGA-AMPSYAAALLDRAAALTDVPGVERALRCDQLTVNEYEPGIGLAPHV 245 Query: 133 DKDEPDLRAPIVSVSLGLPAIFQFGGLKRND------------PLKRLLLEHGDVVVWGG 180 D I++ S G A+ +F +R+ + L ++V G Sbjct: 246 DTHSA-FGGTILAASCGGGAVIEFRLHERDGDGDDDASRRVPSRRAAIYLPPRSLLVMAG 304 Query: 181 ESRL-FYHGIQPLKAGFHPLTI--------------DCRYNLTFRQAG 213 E+R + H + K R + TFR+ Sbjct: 305 EARYRWAHYVPHRKRDAVKRFGGDGGGAATEIARREGKRVSFTFRETR 352 >UniRef50_C7RA32 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RA32_KANKD Length = 207 Score = 128 bits (321), Expect = 1e-28, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 68/211 (32%), Gaps = 19/211 (9%) Query: 5 FADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 F + + A +L F + + ++ ++ + Sbjct: 8 FGQSSEIIQLKDAEIELLPHFLPAEEGGNLFE--NLLEAVDWQSETIRIAGVERLVPRLT 65 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 G Y YS + PW L +R ++ L N Y Sbjct: 66 AWYGDK--GASYTYSGV-IHHPIPWSE---QLLALKKRIEQVC---QTSFNSALFNLYRD 116 Query: 125 G-AKLSLHQDKDEPDLRAPI-VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES 182 G ++ H D + PI S+SLG P Q K D +L L G ++V G++ Sbjct: 117 GRDSVAWHSDDEPELGAKPIIASLSLGAPRSLQLKHKKHKDLRHKLTLTSGSLLVMRGDT 176 Query: 183 RL-FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 + + H + P + R N+TFR Sbjct: 177 QRCWQHQVPK-----EPAITEPRINITFRNI 202 >UniRef50_D2V609 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2V609_NAEGR Length = 279 Score = 128 bits (321), Expect = 1e-28, Method: Composition-based stats. Identities = 48/233 (20%), Positives = 77/233 (33%), Gaps = 46/233 (19%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 +LF+ + + GA+ ++ + E+ I + D + Sbjct: 53 ELFSTNP---KVVVPGAIYIKNYISEEEEERIMKLID--------------------SKA 89 Query: 63 NCGHLGWTTHRQGYLY--SPIDPQTNKPWPAM--------PQSFHNLCQRAATAAGYPDF 112 C + T GY Y + + T +P + F L +R G Sbjct: 90 WCHEICRRTQMYGYTYYHTRHNLPTMQPVNESSSNYQHLDLKEFDWLIERLVERDGLYKT 149 Query: 113 ---QPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLL 169 P CL+N Y +S H D I VSL P + ++L Sbjct: 150 DYGNPTQCLVNEYIGTQGISSHVDN-PGPFGDIITLVSLNKPIYMVLKLASNENIQTKIL 208 Query: 170 LEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTID--------CRYNLTFRQAG 213 LE + V +SR + HGI +K + P T + R +LTFR Sbjct: 209 LEPRSLFVMKDDSRFKWKHGITHMKQVYVPSTGETLIRDENYRRVSLTFRFIK 261 >UniRef50_B8HW11 2OG-Fe(II) oxygenase n=10 Tax=Bacteria RepID=B8HW11_CYAP4 Length = 217 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 65/198 (32%), Gaps = 20/198 (10%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLY 78 V F A + + S+ +R + G + Y Y Sbjct: 34 LVYYPHFFSLAESDRYLE--QLTSEIDWRHEPIKVYGREILQPRLTAWYGDA--GKSYTY 89 Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEP 137 S I+ +PW A + + Q T AG ++ L+ Y G + H D + Sbjct: 90 SGIN-MQPQPWTA---ALLTIKQEIETIAGV---IFNSVLLTLYRDGQDSMGWHSDDEPE 142 Query: 138 DLRAPI-VSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGG-ESRLFYHGIQPLKA 194 PI SVS G FQ R D + L HG ++ G + H I Sbjct: 143 LGTNPIIASVSFGATRKFQLRHKSRKDLDKVVINLSHGSFLLMAGITQHHWQHQIPKTTK 202 Query: 195 GFHPLTIDCRYNLTFRQA 212 +P R NLTFR Sbjct: 203 VTNP-----RINLTFRIV 215 >UniRef50_A3M1I5 DNA repair system n=12 Tax=Acinetobacter RepID=A3M1I5_ACIBT Length = 203 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 47/214 (21%), Positives = 76/214 (35%), Gaps = 21/214 (9%) Query: 2 LDLFADAEPWQEPLA-AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 LDLF EP L G V A E + + + +R + Sbjct: 3 LDLF-SPEPCSNLLPYDGEVQDYGCILTAEEAE-QYFHYLYHHLAWRHDEAKLYGKHFIT 60 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 G Y YS + + + PW ++ L Q+ + ++CL N Sbjct: 61 PRKVAWYGDEH--YRYKYSGV-FRDSLPWD---KALAQLKQQVEQIL---SEKFNSCLAN 111 Query: 121 RYAPG-AKLSLHQD-KDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 Y G ++ H D I S+S G F F ++ + ++ + L+ G ++V Sbjct: 112 LYEDGTQGMAWHSDSDVSLARTTTIASLSFGATRKFSFRHIQTKEKVE-MWLQPGQLIVM 170 Query: 179 GGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 GE + + H + P R NLTFRQ Sbjct: 171 RGETQQYWQHRLNRSTKILQP-----RINLTFRQ 199 >UniRef50_Q22MH4 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22MH4_TETTH Length = 403 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 66/195 (33%), Gaps = 46/195 (23%) Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAA-----TAAGYPDFQPDACLI 119 + W+ Y + MP + L + A D++P+A ++ Sbjct: 190 KKIRWSNVGAQYDWDNRLY--PSFTTPMPDIINELAEFAKNVVSDEITDVYDYEPEAVIV 247 Query: 120 NRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 N Y +S H D E D ++PI S + G IF G ++ L L+ GD+++ Sbjct: 248 NYYDKKNYMSGHLDDGEKDQKSPIFSFTFGCSCIFLMGDRTKDFTPLPLRLDAGDLMIMS 307 Query: 180 GESRLFYHGIQPLKAGFHPLT--------------------------------------- 200 G SR YHG+ + G P Sbjct: 308 GYSRNCYHGVPRIFPGSFPKEEFEKYVRELYPHLYDNEEINKNKDLKFFENNYRHAINYL 367 Query: 201 IDCRYNLTFRQAGKK 215 D R NL FRQ KK Sbjct: 368 QDSRINLNFRQVEKK 382 >UniRef50_B5Y3R7 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B5Y3R7_PHATR Length = 352 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 47/183 (25%), Positives = 70/183 (38%), Gaps = 13/183 (7%) Query: 30 AEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPW 89 E + + S + + L W T Y ++ Sbjct: 127 EEWKLEQMETYTEASQMTSKSSSRPK-----YRSFRKLSWATMGYHYDWNTRSYNEKAK- 180 Query: 90 PAMPQSFHNLCQRAATA-----AGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIV 144 MP+ + + A P F A ++N Y P + + H+D E L PIV Sbjct: 181 SPMPKLLERIAEIFAATSLLVDGQDPCFTASASIVNFYTPKSMMGGHRDDLEHALDKPIV 240 Query: 145 SVSLGLPAIFQFGGLKRNDPL-KRLLLEHGDVVVWGGESRLFYHGIQPLKAGFH-PLTID 202 S+SLG PA+F GG ++D +L+ GDV++ GG SRL YHG+ L P Sbjct: 241 SISLGRPAVFLLGGNTKDDQPVVAILVRPGDVMMMGGASRLRYHGMARLLPTTGLPSVEK 300 Query: 203 CRY 205 R Sbjct: 301 DRV 303 >UniRef50_D2V5F7 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2V5F7_NAEGR Length = 460 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 64/223 (28%), Gaps = 45/223 (20%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLY 78 ++ F E I + + + Q + + Y Sbjct: 1 LYVIENFITEEEEAKIMEQVEPKNWVIELQ------------------RRVQHYGFKFDY 42 Query: 79 SPIDPQTNKPWPAMPQSFHNLCQR--------------------AATAAGYPDFQPDACL 118 N +P+ N+ R + + + PD Sbjct: 43 DIRSIDFNTQVEPIPEYTTNIMNRMKEAMKKKKEENDSTIMSDEFISTFDFETYNPDQLT 102 Query: 119 INRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKR--LLLEHGDVV 176 IN Y PG + H D P + VS+ A+ F + +++ + L ++ Sbjct: 103 INEYQPGQGIRPHIDVHTP-FNDGLFIVSMLGSAVMYFSKCVGEEVVEKKYVDLPRRSLL 161 Query: 177 VWGGESRL-FYHGIQPL---KAGFHPLTIDCRYNLTFRQAGKK 215 + GE+R + H I + R +LT R K+ Sbjct: 162 ILVGEARYLWRHAIMCRELDRVNGKIRKRQRRVSLTIRSVRKE 204 >UniRef50_A9EAY0 2OG-Fe(II) oxygenase n=2 Tax=Flavobacteriales RepID=A9EAY0_9FLAO Length = 200 Score = 126 bits (316), Expect = 5e-28, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 68/195 (34%), Gaps = 19/195 (9%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLY 78 +F + I + +++P++Q + G T+++ Y Y Sbjct: 19 ITYFPKFIKASEATCIFETL--LNETPWQQDDIKVFGKVYAQPRLTALYG--TNQKSYSY 74 Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEP 137 S I + ++ +L + L+N Y G H D ++ Sbjct: 75 SNIKMTPL----PLTETLKSLKNKVDIVCQTD---FTTLLLNYYRDGKDSNGWHADNEKE 127 Query: 138 DLRAPI-VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAG 195 + PI S+S G F ++ L+HG +++ GE++ + H I Sbjct: 128 LGKNPIIASLSFGQERFFHLKHRTDKTLKHKIALQHGSLLLMKGETQHKWLHQIPKTAK- 186 Query: 196 FHPLTIDCRYNLTFR 210 + R N+TFR Sbjct: 187 ----QLHGRINITFR 197 >UniRef50_A1ZYS3 Alkylated DNA repair protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZYS3_9SPHI Length = 185 Score = 125 bits (315), Expect = 7e-28, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 57/208 (27%), Gaps = 32/208 (15%) Query: 12 QEPLAA--GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGW 69 L G + F E+ + A+ Sbjct: 3 NSVLPPIEGLEYVPDFVNKKEEKQLLKEIASATWEDLYV------------------RRV 44 Query: 70 TTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLS 129 + Y + +P L Y + PD ++N Y G + Sbjct: 45 QQYGYRYHFLKRTMDHVSTHTPLPGWAAQLTHAFL-IKQYLNTLPDLLIVNEYKVGEGIK 103 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP-LKRLLLEHGDVVVWGGESR-LFYH 187 H D I+ VSLG I + + + L L ++V GE R + H Sbjct: 104 PHID-SPLLFGETILIVSLGADCIMELEPMPEAGQGKQTLSLAARSLLVMQGEVRHHWQH 162 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 I ++ R +LTFR + Sbjct: 163 SIVNVQK--------RRVSLTFRTVKDE 182 >UniRef50_Q0U5B3 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0U5B3_PHANO Length = 298 Score = 125 bits (315), Expect = 8e-28, Method: Composition-based stats. Identities = 43/174 (24%), Positives = 68/174 (39%), Gaps = 17/174 (9%) Query: 42 SQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQ 101 + +S+ L WTT Y ++ P P P N+ + Sbjct: 128 RDPIAHPLDPSIHKPLSITQLLNKKLRWTTLGGQYDWTAKKYPDATP-PPFPADTKNMLE 186 Query: 102 RAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKR 161 + + A ++N Y+PG LS+H+D E ++S+SLG A+F G Sbjct: 187 SI-----FTTTRAQAAIVNLYSPGDTLSVHRDVAETS-SHGLISLSLGCDAVFVIG--TD 238 Query: 162 NDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 +D + L L G V G SR +HG+ + AG P FR G++ Sbjct: 239 DDKVLTLRLRSGSAVYMSGASRFAWHGVPQIVAGSCPGV--------FRGVGQE 284 >UniRef50_C8NRG2 DNA repair protein n=9 Tax=Actinomycetales RepID=C8NRG2_COREF Length = 237 Score = 125 bits (314), Expect = 9e-28, Method: Composition-based stats. Identities = 60/225 (26%), Positives = 97/225 (43%), Gaps = 24/225 (10%) Query: 6 ADAEPWQEPLAAGAVILRRFA-FNAAEQLIRDINDVASQSPFRQMVTPGGY----TMSVA 60 A +A+G V L + ++ + +A + MSV Sbjct: 17 ARFPRPSREVASGVVHLPDWLPLGEQAAVVEEARGIARSVAGTPLAMTRPQLRSGQMSVH 76 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP---------D 111 + + G W T+ Y + P +P SFH+L RA A Sbjct: 77 ILSLGQ-HWATNPYRY----VTSVGGVAVPPIPASFHDLAARALADAAALSPPLAAWSGK 131 Query: 112 FQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-RNDPLKRLLL 170 ++ +A L+N YAPG+ + +HQD +E AP++S+S+G IF+ G RN P + L Sbjct: 132 YRAEAALVNYYAPGSAMGMHQDANELS-EAPVISLSIGDTGIFRLGNTDNRNRPWVDVPL 190 Query: 171 EHGDVVVWGGESRLFYHGIQPLKAGFHPLTID---CRYNLTFRQA 212 GD++++GGE R +HG+ ++A P R N+T RQ Sbjct: 191 LSGDLIIFGGEHRRAFHGVPRIEADTAPEGCGLDRGRINITIRQV 235 >UniRef50_B7RVL5 Oxidoreductase, 2OG-Fe(II) oxygenase family n=2 Tax=unclassified Gammaproteobacteria (miscellaneous) RepID=B7RVL5_9GAMM Length = 218 Score = 125 bits (314), Expect = 9e-28, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 71/207 (34%), Gaps = 30/207 (14%) Query: 19 AVILR----RFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 V+ F L+ + D ++P+ Q + W + Sbjct: 23 LVLFPKVALGFDSAG---LLARLID---ETPWSQETIRLYGKTHLQPRLIA---WYGDPE 73 Query: 75 -GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 Y YS Q PW + + + + ++ L+N Y G + LH Sbjct: 74 AQYAYSGKQYQ-PIPWTPLLTTLKASVETLCAS------SFNSVLLNFYRDGADSMGLHA 126 Query: 133 DKDEPDLRAP-IVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGG-ESRLFYHGI 189 D + P I S+SLG F +R + ++L + V+ G + + HGI Sbjct: 127 DDEPELGTEPCIASLSLGEERTLYFKHKQRKELKPLNVVLPNASVLRMQGVTQQYWKHGI 186 Query: 190 QPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + + R NLTFR+ ++ Sbjct: 187 RKISR-----PCGPRVNLTFRRIYPRK 208 >UniRef50_A9AQD8 2OG-Fe(II) oxygenase n=43 Tax=Burkholderia RepID=A9AQD8_BURM1 Length = 226 Score = 125 bits (314), Expect = 9e-28, Method: Composition-based stats. Identities = 47/214 (21%), Positives = 69/214 (32%), Gaps = 28/214 (13%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 DLF + A + + + + TP G +T Sbjct: 28 DLFDETP------APDVDWYPDWLVPSEADRLLAALIDEVAWRQDTIRTPRGRIPLPRLT 81 Query: 63 NCGHLGWTTHRQG-YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 W Y+YS I + PA L + A G + ++ L+NR Sbjct: 82 A-----WQGEPDAVYVYSGIRNVPAQWTPA----VLELKRAVEAACG---ARFNSVLLNR 129 Query: 122 YAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 Y G + H D + AP I SVSLG +F L HG ++V Sbjct: 130 YRNGQDGMGWHADNEPELGDAPVIASVSLGAMRVFDLRHRA-TGATHAYRLTHGSLLVMR 188 Query: 180 GESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 G ++ + H + P R NLTFR Sbjct: 189 GRTQAEWQHRVPK-----APSVHGERVNLTFRYI 217 >UniRef50_B4EMC2 2OG-Fe(II) oxygenase superfamily protein n=14 Tax=Proteobacteria RepID=B4EMC2_BURCJ Length = 240 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 49/215 (22%), Positives = 67/215 (31%), Gaps = 28/215 (13%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 DLFAD A + + TP G +T Sbjct: 46 DLFADTP------APDVDWCPDWLAPPEADRALATLIDEVAWRQDTIRTPRGRIPLPRLT 99 Query: 63 NCGHLGWTTHRQG-YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 W Y+YS I +PW L G + ++ L+NR Sbjct: 100 A-----WQGEPDAVYVYSGIR-NVPQPWTP---GVLALKHAVEATCGV---RFNSVLLNR 147 Query: 122 YAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 Y G L H D + AP I SVSLG +F L HG ++V Sbjct: 148 YRNGLDSLGWHADNEPELGDAPVIASVSLGAMRMFDLRHRT-TGATHTYRLVHGSLLVMR 206 Query: 180 GESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 G ++ + H + P R NLTFR+ Sbjct: 207 GRTQAEWQHRVPK-----APGVQGERINLTFRRVS 236 >UniRef50_A5WBM5 DNA-N1-methyladenine dioxygenase n=5 Tax=Moraxellaceae RepID=A5WBM5_PSYWF Length = 212 Score = 125 bits (313), Expect = 1e-27, Method: Composition-based stats. Identities = 52/219 (23%), Positives = 77/219 (35%), Gaps = 18/219 (8%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 M LFA A G V + L + + ++ P++ + Sbjct: 1 MSTLFAPAPTDNLLPYDGIVNDLGRLITDDKALYQQLL---AELPWQSDKVTLFGKTHIT 57 Query: 61 MTNCGHLGWT----THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDA 116 +G T T Y YS PA+ H + Q+ Q ++ Sbjct: 58 TRQIVWMGDTPSASTQALSYTYSGHTRPIEPWHPAVFHVKHMIEQQLQPLKIC--TQFNS 115 Query: 117 CLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 CL+N Y G + H D + P I S+SLG F F K D ++ L LE G Sbjct: 116 CLLNYYPSGEEGMGYHADDEPELGYQPIIASLSLGATRKFVFKHKKTQDKVE-LYLESGQ 174 Query: 175 VVVWGGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 +VV G+ + + H I K R +LTFR Sbjct: 175 LVVMRGDTQQYWKHSITKTKK-----VDTGRISLTFRHM 208 >UniRef50_Q3AYK8 DNA-N1-methyladenine dioxygenase n=12 Tax=Cyanobacteria RepID=Q3AYK8_SYNS9 Length = 211 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 42/194 (21%), Positives = 66/194 (34%), Gaps = 19/194 (9%) Query: 21 ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSP 80 +L + Q + + + + Q + L Y YS Sbjct: 32 LLPGWLSTDDAQRWQLLLE--HNISWEQPLVQVFGKYHRVPRKTVFL--AEQGLQYRYSG 87 Query: 81 IDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-KLSLHQDKD-EPD 138 + W P+ FH L ++ A Q + CL+N Y G ++ H D + E D Sbjct: 88 A-IHVGEGW---PEWFHPLVEQVNHIA---QAQFNGCLLNLYRDGDDRMGWHADDEPEID 140 Query: 139 LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG-GESRLFYHGIQPLKAGFH 197 PI S+SLG F F + L GD+++ G + H + + Sbjct: 141 QTQPIASLSLGSTRDFLFRHRGDQPKRAAIPLADGDLLIMHPGCQGHWMHSVPQRRK--- 197 Query: 198 PLTIDCRYNLTFRQ 211 R NLTFR Sbjct: 198 --VKTMRINLTFRH 209 >UniRef50_A6EGN8 Alkylated DNA repair protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EGN8_9SPHI Length = 197 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 41/175 (23%), Positives = 62/175 (35%), Gaps = 17/175 (9%) Query: 39 DVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHN 98 +++ P++Q + G Y YS I PW Sbjct: 33 QLSANVPWKQEPIKIFGKTVLQPRFTAFYGEE--GVSYSYSGITMN-ALPWTP---ELAE 86 Query: 99 LCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQF 156 + + Q +ACL+N Y G + H+D + P I SVS G FQF Sbjct: 87 IRSAIQQKTAH---QFNACLLNFYRDGSDSMGWHRDNERNLGPYPTIASVSFGAHRTFQF 143 Query: 157 GGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFR 210 P+ L L G +++ GE++ + H + P R NLTFR Sbjct: 144 RRYVEKLPVVSLDLTSGSLLLMKGETQHLWEHRLPKTTMPIGP-----RINLTFR 193 >UniRef50_B8KWS1 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KWS1_9GAMM Length = 209 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 70/224 (31%), Gaps = 30/224 (13%) Query: 2 LDLF--ADAEPWQEPLAAGAVILRRFAFN--------AAEQLIRDINDVASQSPFRQMVT 51 +DLF E L FA + EQ N++A + Q Sbjct: 1 MDLFVKPSDGFSVEELIPDDK--SDFASSAKLYRSVFDDEQCRTLFNNLAHSIAWEQREI 58 Query: 52 PGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPD 111 + G Y YS + + L + Sbjct: 59 TLFGKRHLQPRLIAWYG--DGGASYTYSGLKLRPR-------PWVVPLMEIKTACEAVAG 109 Query: 112 FQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLL 169 + + L+N+Y G + H D + P I SVS G F + + ++ Sbjct: 110 ARFNGVLLNQYRDGNDAMGWHSDNETELGTNPTIASVSFGASRRFDLRHKRTKETIRS-W 168 Query: 170 LEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 L +G ++V G+++ + H + K D R NLTFR Sbjct: 169 LPNGSILVMSGQTQTDWVHQVPRTKK-----VGDARINLTFRWV 207 >UniRef50_B8ESE9 2OG-Fe(II) oxygenase n=1 Tax=Methylocella silvestris BL2 RepID=B8ESE9_METSB Length = 202 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 50/216 (23%), Positives = 79/216 (36%), Gaps = 34/216 (15%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 +DLF + L G + AAEQ + D A SPFR + + Sbjct: 6 MDLF------ERSLLPGLRLGENIISAAAEQTLISAIDAARLSPFR-------FQGWLGK 52 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 GW + + P +P+ L + AA AG P L+ R Sbjct: 53 RVTASFGWRYDFETASFG--------PAEPIPEFLLPLRESAAGFAGLPTGALAQALLIR 104 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGG 180 Y PGA + H+D+ L ++ +SLG PA+ +F L + G Sbjct: 105 YDPGAGIGWHRDR---PLFEHVIGISLGAPAVLRFRRRTAAGFDRANAPLAPRSIYHLSG 161 Query: 181 ESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 ++R + H I + R+++TFR +K Sbjct: 162 DARHLWEHSIAQVDV--------ARWSITFRSLSEK 189 >UniRef50_Q00Z84 2-Oxoglutarate-and iron-dependent dioxygenase-related proteins (ISS) n=1 Tax=Ostreococcus tauri RepID=Q00Z84_OSTTA Length = 232 Score = 124 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 64/204 (31%), Gaps = 29/204 (14%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G ++ F E+ + + G +A H + Sbjct: 12 PGLTLIENFVSVDEERALVTLARE------------SGEETRLARRRVKHF-----GYAF 54 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDF--QPDACLINRYAPGAKLSLHQDK 134 Y N+ A+P + +R + + D +N Y G L+ H D Sbjct: 55 DYGTR--DANERCEAIPSLALEILKRLRSDMIGYQSAIRCDQVTVNEYPRGTGLAPHVDT 112 Query: 135 DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 I+S++L A+ +F + L L ++V ++R + H I K Sbjct: 113 HSA-FGETILSLTLEGCAVMEFRTSAEENR--ALFLPRRSMLVLSADARYRWQHYIPHRK 169 Query: 194 ----AGFHPLTIDCRYNLTFRQAG 213 G D R + TFR+ Sbjct: 170 FDNVEGETIARDDVRLSYTFRERR 193 >UniRef50_B0T8K7 2OG-Fe(II) oxygenase n=3 Tax=Alphaproteobacteria RepID=B0T8K7_CAUSK Length = 215 Score = 124 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 44/215 (20%), Positives = 66/215 (30%), Gaps = 28/215 (13%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 LF + PL G + A EQ + P+ + Sbjct: 14 SLFDLPIAPRTPLPEGFRHQTKLITPAEEQALVAQFTDLDFQPYEH-------KGYLGHR 66 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 GW +T +P +P L + A G+ L+ Y Sbjct: 67 RVAGFGWRRGP-----DGALVETGEP---LPDFLAPLLDKVAAFTGFARNTFAHALVTEY 118 Query: 123 APGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGGE 181 APGA + H+D+ I VSL P F+ + E + G Sbjct: 119 APGAGIGWHRDR---PPAIAIAGVSLLSPCTFRLRRRSGQAWERASIEAEPRSAYLMSGP 175 Query: 182 SR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 SR + H I P+ A RY++TFR + Sbjct: 176 SRSQWQHSIPPVDA--------LRYSVTFRTVPTR 202 >UniRef50_B2SPH7 DNA repair system specific for alkylated DNA n=17 Tax=Xanthomonadaceae RepID=B2SPH7_XANOP Length = 202 Score = 123 bits (308), Expect = 4e-27, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 73/202 (36%), Gaps = 23/202 (11%) Query: 17 AGAV--ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 GA R + +A + Q+ ++ M + W + Sbjct: 8 PGAEIDWWRGWLPHAQADALMQAL--LVQAHWQLHRIRMFGRMVDSPRLSS---WIGDPE 62 Query: 75 -GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQ 132 Y YS + +PW + + R G+ + ++ LINRY G + H Sbjct: 63 ASYRYSGTRF-SPQPWLEV---LQPVRLRLEDETGH---RFNSVLINRYRSGSDAMGWHS 115 Query: 133 DKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQ 190 D + P I SVSLG F F + L L HGD+++ GG+++ Y H + Sbjct: 116 DDEPELGAQPLIASVSLGARRRFAFKHRDDASVKQALELGHGDLLLMGGQTQRHYRHALP 175 Query: 191 PLKAGFHPLTIDCRYNLTFRQA 212 R NLTFRQ Sbjct: 176 RTAKPV-----GERINLTFRQV 192 >UniRef50_UPI000186D7D6 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D7D6 Length = 278 Score = 123 bits (308), Expect = 5e-27, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 73/216 (33%), Gaps = 29/216 (13%) Query: 17 AGAVILRR-FAFNAAEQLIRDINDVASQSPF------RQMVTPGGYTMSVAMTN------ 63 G ++ F F I + ++ P ++T S N Sbjct: 66 PGLFFIKNPFKFIGQRYWIIRCLEYYTRKPNKLNIDIHNILTEEDDWWSYCKKNFNTANG 125 Query: 64 ---CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 L W T + + + + P+ + LC A + F +A ++N Sbjct: 126 KLVLDKLRWVTFGYHHNWDTKVYSESSK-SSFPEDLNLLCNHFANNFFFEGFNAEAAIVN 184 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 Y + LS H D E ++ AP+ S S G AIF GG ND +L+E GDV++ Sbjct: 185 MYHLNSTLSGHTDTSELNINAPLFSFSFGQSAIFLIGGKFINDSALPILVESGDVLIMSE 244 Query: 181 ESRLFYHGIQPLKAGFHPLT-----IDCRYNLTFRQ 211 + L R N+ RQ Sbjct: 245 I-------VDKLSDDVEWEPFNSYINRARINMNVRQ 273 >UniRef50_D0N998 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N998_PHYIN Length = 292 Score = 122 bits (307), Expect = 6e-27, Method: Composition-based stats. Identities = 52/222 (23%), Positives = 87/222 (39%), Gaps = 22/222 (9%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVAS--QSPFRQMVTPGGYTMSVAMTNCGHLGWTTH 72 L G +I+++F +Q + D + + F + G + G W Sbjct: 70 LLPGLLIIKQFLTPQEQQELVDDSRCMGLGEGGFYKPTYASGAKCRLHQMCLG-RHWNVK 128 Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------YPDFQPDACLINRYA 123 + Y + P +P S+ QR+ AA PD C++N Y Sbjct: 129 TEKYEDQRSNYDYA-PIRTLPDSWKTYAQRSLDAAKKIDPLVMGSCKKMTPDICVVNFYK 187 Query: 124 PGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQF--GGLKRNDPLKRLLLEHGDVVV 177 + +H DKDE D + +P++S S+G A F + ++ + + LE GD +V Sbjct: 188 KAGRNGMHIDKDESDEAMSMGSPVISFSVGCAAEFAYIDHYPDPHEAVPIVRLESGDALV 247 Query: 178 WGGESRLFYHGIQPLKAGFHP---LTIDCRYNLTFRQAGKKE 216 +GG +R H + + P R NLTFR+ E Sbjct: 248 FGGPARTVVHALTRVYNNTQPSWLRMRSGRLNLTFREYKPSE 289 >UniRef50_B5JS77 2OG-Fe(II) oxygenase n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JS77_9GAMM Length = 199 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 38/179 (21%), Positives = 60/179 (33%), Gaps = 17/179 (9%) Query: 37 INDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSF 96 ++ ++Q + Y YS P A P + Sbjct: 33 FCELVGAFNWQQRSLSIYGRTCLTPRLVAW--CADDGVNYTYSG----DTAPRQAWPIAL 86 Query: 97 HNLCQRAATAAGYPDFQPDACLINRYAPGA-KLSLHQDKDEPDLRAP-IVSVSLGLPAIF 154 L ++ P + L N Y G + H D + P I S+SLG P F Sbjct: 87 LRLRRQLEVFCQVP---FNGVLANYYRDGDDSMGWHSDDERSLGPRPCIASISLGAPRDF 143 Query: 155 QFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 F L + L+HG +++ GE++ + H + + P R NLTFR Sbjct: 144 AFRPLNGGKQRHNICLDHGSLLIMQGETQKHWQHALPRRRRVNQP-----RLNLTFRHI 197 >UniRef50_B2AC29 Predicted CDS Pa_2_14240 n=1 Tax=Podospora anserina RepID=B2AC29_PODAN Length = 298 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 70/242 (28%), Gaps = 60/242 (24%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G ++ F A EQ + S ++P ++ + Sbjct: 39 GLTLVHEFISPAEEQEMISAFHAIS------PLSPADSKRRISQ---------HFGHHFD 83 Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQR--AATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y+ +K +P N R T +PD + Y PGA + H D Sbjct: 84 YTTFGIDESK-HSPVPAYITNFLDRLPVDTDGKEAGRKPDQFTVQYYPPGAGIPPHVDTH 142 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRND-------------------------------- 163 + S+S G F N+ Sbjct: 143 S-MFGEALYSLSFGSGVPMIFRMSGENEARKLRLPKRSLQESSDGNVNGKVGGEILDKAE 201 Query: 164 -----PLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHP---LTIDCRYNLTFRQAGK 214 P L+L ++V G SR + HGI+P K + + RY++T R + Sbjct: 202 GVVVHPAWELMLPARSLLVMRGASRYGYTHGIRPRKTDAVDGITVKREGRYSITMRSVRR 261 Query: 215 KE 216 E Sbjct: 262 GE 263 >UniRef50_UPI00006CC0FF hypothetical protein TTHERM_00219000 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CC0FF Length = 254 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 75/206 (36%), Gaps = 25/206 (12%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G + + +++ + D + H G + + Sbjct: 36 GLMYFPNYVNEKEAEILSNTVD-------------SNQWLVNIRRRQQHYGVVYYHTRHN 82 Query: 78 YSPIDPQTNKPWPAM-PQSFHNLCQRAATAAGYP-DFQPDACLINRYAPGAKLSLHQDKD 135 S I P++++ A+ F L QR + + P+ CL+N Y KL H + Sbjct: 83 LSEIQPESSESEKALDLSVFDWLIQRLINDEVFDVSYPPNQCLVNEYDNKDKLGCHVENI 142 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKA 194 E I +SL P+ ++ + +L LE + V +SR + HG+ +K Sbjct: 143 EA-FGPIIAGLSLHNPSYLALREVENKENKVQLYLEPRSLYVLTSDSRYKWEHGVTKMKE 201 Query: 195 GFHPLTID--------CRYNLTFRQA 212 ++P+T R +LTFR Sbjct: 202 IYNPITQQTIIKNETYRRVSLTFRHV 227 >UniRef50_A4H8G0 Putative uncharacterized protein n=3 Tax=Leishmania RepID=A4H8G0_LEIBR Length = 440 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 65/193 (33%), Gaps = 14/193 (7%) Query: 17 AGAVILRRFAFN-AAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGH-LGWTTHRQ 74 G ++ ++ R+ S + + + + W T Sbjct: 85 PGLLLFPGVLSEAEQQRWCREAVLDYGDSEHHPNILSTHARAPQSTSCYQPPMRWATLGY 144 Query: 75 GYLYSPIDPQTNKPWPAMPQSFHN----LCQRAATAAGY-------PDFQPDACLINRYA 123 Y ++ ++ + P + L A ++P ++N Y Sbjct: 145 SYEWTQKVYHRDR-YSTFPSALRQRMCDLVSLVAEVRQDGFCCAYPDTYEPQTAIVNYYP 203 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 G+ + HQD E L P++S+SLG A+F G R D LL GDV + G SR Sbjct: 204 VGSMMMCHQDVSEETLEQPLMSLSLGCSAVFLMGTQSREDAPHAFLLRSGDVAAFTGPSR 263 Query: 184 LFYHGIQPLKAGF 196 +H + Sbjct: 264 AAFHSTPRILDDC 276 >UniRef50_A4AA20 Putative alkylated DNA repair protein n=1 Tax=Congregibacter litoralis KT71 RepID=A4AA20_9GAMM Length = 206 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 77/217 (35%), Gaps = 21/217 (9%) Query: 1 MLDLFADAEPWQEPLAAG-AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSV 59 M +LF +++P + L G ++ R A Q + + + + +R+ + Sbjct: 1 MTELFPNSDPEEIDLPGGELLLYRAADLGADPQELFENLE--RELAWREEPIQLFGKRYL 58 Query: 60 AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 Y YS I PW L +R + D + ++ L Sbjct: 59 QPRLLAWY--ADAGVSYKYSGIQ-HDPLPWTP---QLAVLRERVEALS---DARFNSVLA 109 Query: 120 NRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVV 176 N Y + LH D + P I S+SLG +F+ R D RL L G ++ Sbjct: 110 NLYRHHRDSMGLHADDERELGAQPVIASLSLGEERMFRLKHRHRKDLKPIRLPLASGMLL 169 Query: 177 VWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 + G ++ + H + R NLTFR Sbjct: 170 IMRGATQENWRHEVPK-----QSRPCGPRINLTFRYV 201 >UniRef50_Q15YR0 DNA-N1-methyladenine dioxygenase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15YR0_PSEA6 Length = 210 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 67/218 (30%), Gaps = 28/218 (12%) Query: 2 LDLFADAEPWQ---EPLAAG-AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTM 57 LF + PL G + F + + +RQ Sbjct: 4 SSLFPSGSSDEAEILPLPDGDFRYFQHFLSSQEAD--NYYKRLLESLAWRQDDIKMYGKQ 61 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMP--QSFHNLCQRAATAAGYPDFQPD 115 G Y + P +P + H L + A+ + + Sbjct: 62 VKIPRLQAWYGDEDALYQY--------SGLNLPPIPWTEELHALKVQCEKAS---ESVFN 110 Query: 116 ACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHG 173 + L N Y G ++ H D + P I S+SLG F + RL LEHG Sbjct: 111 SVLANCYRDGQDSMAWHSDDEPELGTRPVIASLSLGQVRNFDLKHRT-SGQRHRLPLEHG 169 Query: 174 DVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 + + G S+ + H + P R NLTFR Sbjct: 170 SLFIMAGNSQTHWLHSLAKTTKSLAP-----RINLTFR 202 >UniRef50_A7AWB3 Putative uncharacterized protein n=1 Tax=Babesia bovis RepID=A7AWB3_BABBO Length = 336 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 46/201 (22%), Positives = 73/201 (36%), Gaps = 16/201 (7%) Query: 17 AGAVILRRFAFNAA-EQLIRDINDVASQSP----FRQMVTPGGYTMSVAMTNCGHLGWTT 71 G ++R F + L+ + P Q + L W T Sbjct: 89 PGLYLVRDFFTKEQCDALLLETLVDYINPPNNSNLYQNDPNVATPFWPSP-VFSKLRWAT 147 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDAC--LINRYAPGAKLS 129 Y + K + P ++ + + D+ PD C +IN Y+ L Sbjct: 148 IGHMYDWGTRTY---KGYTKFPGLLVDVTRDLLSHFN-EDYIPDVCAAIINFYSKAYFLR 203 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 LH+D E ++++SLG PAIF GG + ++E G VV+ +SR HGI Sbjct: 204 LHKDDAEET-DDSVLNISLGAPAIFMLGGTDHSTIPVSFVVESGSVVLMADKSRFCLHGI 262 Query: 190 QPLKAGFHP---LTIDCRYNL 207 L + P + NL Sbjct: 263 VKLLSYNKPGYQPSGGLPINL 283 >UniRef50_UPI000180B7B0 PREDICTED: similar to LOC496071 protein n=1 Tax=Ciona intestinalis RepID=UPI000180B7B0 Length = 288 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 66/201 (32%), Gaps = 28/201 (13%) Query: 21 ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSP 80 F + + + Q R+ + G +M +T W + Y YS Sbjct: 99 FFPNFLEKSDADWMLETLKNEVQWEHRRNLKYGPNSMEPRLTA-----WFSE-FSYSYSG 152 Query: 81 IDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDL 139 + N W + L R G ++ ++ L N Y G + H D + Sbjct: 153 VVQPPNPHWHPL---LAALRDRLNDLYG---YKFNSLLANLYRDGHDSVDWHTDAEPALG 206 Query: 140 R-APIVSVSLGLPAIFQFGGLKRN--------DPLKRLLLEHGDVVVWGGESRL-FYHGI 189 PI S+S G F+ + R+ L HG +++ G ++ + H + Sbjct: 207 NSPPIASISFGDTRNFELREITDIKTDEDLTYCKRIRVPLTHGSLLLMTGATQHDWQHRV 266 Query: 190 QPLKAGFHPLTIDCRYNLTFR 210 R NLTFR Sbjct: 267 PKEYHD-----RSARVNLTFR 282 >UniRef50_Q6C333 YALI0F03003p n=1 Tax=Yarrowia lipolytica RepID=Q6C333_YARLI Length = 330 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 60/242 (24%), Positives = 85/242 (35%), Gaps = 50/242 (20%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQM--------------------------V 50 G +I A + + P + Sbjct: 95 PGLLIYPNLMPPAVQSRLVTETIEQYLPPKEHLNNLDLFYDIPRPFNLFNNENPHAKILH 154 Query: 51 TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKP----WPAMPQSFHNLCQRAATA 106 GG S L W T Y ++ + P +P P++ + L R Sbjct: 155 REGGKPTSRDKIKNKQLRWVTLGGQYNWTTKAYPSFIPGTEGFPYFPKNLYELLSR---- 210 Query: 107 AGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND--- 163 P+A +IN Y+PG LS HQD E +VSVS+GL AIF G + +D Sbjct: 211 -PLFSINPEAAIINFYSPGDILSPHQDVAELSQDD-LVSVSIGLDAIFYVGLNRYDDSEN 268 Query: 164 ---PLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDC--------RYNLTFRQA 212 P L+L GDV+V GG+SR YHGI + + P R N+ RQ Sbjct: 269 SLAPPLCLMLRSGDVIVMGGKSRHAYHGIGKVFSNTSPDLNSPYQDWLDTKRVNINVRQM 328 Query: 213 GK 214 + Sbjct: 329 LQ 330 >UniRef50_UPI000179247A PREDICTED: similar to alkB, alkylation repair homolog 8 (E. coli) (alkbh8) n=1 Tax=Acyrthosiphon pisum RepID=UPI000179247A Length = 382 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 63/218 (28%), Gaps = 50/218 (22%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 +G I+ F E + ++ Sbjct: 61 PKLIETSTSSPSGLEIIDNFITEEEEHFMLQYL----------------KKHWSESSSMK 104 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 H + + Y + + P +P+ F + Sbjct: 105 HRQVKHYGYEFDYDNNGVRYDSCDP-IPKEFEFILNAIYL-------------------- 143 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL- 184 ++ H D I+S+SL I +F ++ +LL+ +++ GESR Sbjct: 144 -RIPSHIDTH-GVFDEYILSLSLNSDIIMEFRKGNYHN---SILLKAKSLLIMSGESRFE 198 Query: 185 FYHGIQPLK-------AGFHPLTIDCRYNLTFRQAGKK 215 + HGI P K G + R ++TFR+ + Sbjct: 199 WSHGITPRKFDMINTADGPDIICRGTRISVTFRRVVQN 236 >UniRef50_B4RZB3 Alkylated DNA repair protein n=4 Tax=Proteobacteria RepID=B4RZB3_ALTMD Length = 213 Score = 120 bits (301), Expect = 4e-26, Method: Composition-based stats. Identities = 48/224 (21%), Positives = 74/224 (33%), Gaps = 27/224 (12%) Query: 3 DLF--ADAEPWQEPLA---AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPG-GYT 56 LF D E L A + + ++ P+RQ G Sbjct: 4 SLFETPDEESVPYQLPLTEADVRYFPNALSKNDADAFFE--RLKTELPWRQDTLRLFGKQ 61 Query: 57 MSVAMTNCGHLGWTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG-YPDFQP 114 + + W + Y YS + P S + R + Sbjct: 62 VKIPRLQS----WHGDPECTYTYSNLT----MPPNPWTSSLALIKARCEALCSPNYGTKF 113 Query: 115 DACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEH 172 ++ L N Y G +S H D + P I SV+LG F + + ++ LEH Sbjct: 114 NSVLANWYRDGQDSMSFHSDNEPELGTNPVIASVTLGEARPFVLKHKETKEKYTQI-LEH 172 Query: 173 GDVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 G V++ G ++ Y HGI I R NLTFR ++ Sbjct: 173 GSVLIMAGATQSHYVHGIAKTAK-----PIGGRINLTFRHLIQR 211 >UniRef50_Q54BK8 2-oxoglutarate and Fe-dependent oxygenase family protein n=1 Tax=Dictyostelium discoideum RepID=Q54BK8_DICDI Length = 247 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 66/230 (28%), Gaps = 46/230 (20%) Query: 1 MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 M D F EP + G I+ + + + Sbjct: 1 MEDFFKKKEP---IIIEGLTIIENAIDKEMHDKLWKEVN---------------KEEWLT 42 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 + + Y Y ++ P PQ +LC DF P ++N Sbjct: 43 DLS---RRTQHYGYKYNYKSRSLKSEDIAPPFPQWASDLCCHLMKEGLINDF-PQQLIVN 98 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGG-------------LKRNDPLKR 167 Y G +S H D I S+SLG F + + Sbjct: 99 EYKDGQGISAHID--SKIFDNIIFSISLGSTCKMIFKKSIQPTTTTKTTTTTSEKAEVLK 156 Query: 168 LL--LEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + L ++ E+R + H I LK G H R +LTFR K Sbjct: 157 VEKQLAPRAFLLIKDEARFNWTHEIPKLKKGQH------RISLTFRFVSK 200 >UniRef50_B4JDW7 GH11262 n=4 Tax=Neoptera RepID=B4JDW7_DROGR Length = 221 Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 62/224 (27%), Gaps = 48/224 (21%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 A + + F + EQ I + + + Q + Sbjct: 12 PATVMYIPNFITSDQEQCILSQIERTPKPRWTQ----------LLNRRLI---------- 51 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y + MP+ + + + + + L+N Y PG + H D Sbjct: 52 -NYGGVPHPNGMIAEEMPEWLQSYVDKVNNLGVFESQKANHVLVNEYLPGQGILPHTDG- 109 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPL-----KRLLLEHGDVVVWGGESRLFY-HGI 189 P I ++S G + +F + +LLLE +++ Y H I Sbjct: 110 -PLFYPIISTISCGAHTVLEFTKRETTGDAAGEVLFKLLLEPRSLLILKDSLYSDYMHAI 168 Query: 190 QPLKAGF-------------------HPLTIDCRYNLTFRQAGK 214 + H + R +LT R K Sbjct: 169 SEINEDTLCDRICNYNLCENTYKIGDHLVRRAPRISLTIRNVPK 212 >UniRef50_Q6ZEA1 Slr7097 protein n=5 Tax=Bacteria RepID=Q6ZEA1_SYNY3 Length = 208 Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 73/218 (33%), Gaps = 23/218 (10%) Query: 1 MLDLFADAEPWQEPLAAG-AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSV 59 + D++ + P + ++ G + + D + + ++Q Sbjct: 7 LFDIYPNGSPEEIIISDGHLQLYHSIFSDVEASRYYDRLE--KEICWQQDSIILFGKSQP 64 Query: 60 AMTNCGHLGWTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 W + Y YS I PW + Q+ + A A ++ L Sbjct: 65 LPRLTA---WYGDPERSYTYSGI-AMEPTPWIPLLQTIKTKAETLAKAT------FNSVL 114 Query: 119 INRYAPG-AKLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDV 175 +N Y G +S H D + E PI SVS G F L+L G + Sbjct: 115 LNFYRTGTDGVSWHADDEPELKKNYPIASVSFGGTRRFLLKHKTDPTIEKVELILTSGSI 174 Query: 176 VVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 ++ G ++ + H + K P R NLTFR Sbjct: 175 LLMLGTTQEYWLHQVPKTKKFVEP-----RINLTFRFI 207 >UniRef50_A3D131 DNA-N1-methyladenine dioxygenase n=15 Tax=Shewanella RepID=A3D131_SHEB5 Length = 246 Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats. Identities = 44/205 (21%), Positives = 72/205 (35%), Gaps = 23/205 (11%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 E L ++R + + + S R + G ++ W Sbjct: 55 AEQLTPPITLVRGYLNAEQQAALMKEAQTYPLS--RPEIQVFGQFHAIPRQQV----WFG 108 Query: 72 H-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLS 129 YLYS + PW P+ H L ++ A G + L+NRYA G + Sbjct: 109 DSGCDYLYSGL-FIRALPW---PKYAHKLREKLARDYGLAS---NGVLVNRYADGKDCMG 161 Query: 130 LHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYH 187 H D + E + I S++LG F K + + L GD+++ + + H Sbjct: 162 AHSDDEPEIAHGSHIASITLGATRDFVLKH-KHSQTKYCISLHSGDLLIMHWPMQNDWLH 220 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQA 212 + P R+N TFRQ Sbjct: 221 SLPKRLKIKEP-----RWNYTFRQL 240 >UniRef50_B4RAN1 DNA alkylation damage repair protein AlkB n=1 Tax=Phenylobacterium zucineum HLK1 RepID=B4RAN1_PHEZH Length = 238 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 46/201 (22%), Positives = 69/201 (34%), Gaps = 28/201 (13%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G F A E D+ D S+ PF G GW Y Sbjct: 64 EGLAYRPDFLTAAEEA---DLLDRLSRLPFEPFQFRGYE----GRRRVVSFGWR-----Y 111 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 ++ +P MP + RAA AG P LIN Y GA + H+D+ Sbjct: 112 DFNGPGLVEAEP---MPGWLRPVRDRAADFAGLPPEAFGHVLINEYREGAPIGWHKDRPV 168 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGGESRL-FYHGIQPLKA 194 + + +SLG P + +F + L + + G +R + H + KA Sbjct: 169 FEK---VAGISLGAPCVMRFRRRAGERFERLNVPLAPRSIYLLDGPARTEWEHSLPEAKA 225 Query: 195 GFHPLTIDCRYNLTFRQAGKK 215 RY++TFR + Sbjct: 226 --------LRYSITFRNLRAR 238 >UniRef50_A0YHA0 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YHA0_9GAMM Length = 206 Score = 119 bits (298), Expect = 6e-26, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 67/217 (30%), Gaps = 22/217 (10%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 +LF + + A F N+A + + ++Q + Sbjct: 4 NLFTEQPLILDLPDADIRYYPEFIDNSAAN----YRALVDEINWQQDTINMYGRPVLIPR 59 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 G Y YS + +PW L + ++ L N Y Sbjct: 60 MNAWYGDA--NAHYGYSGLKL-APQPWTP---GLLLLKTKIEKFLQTE---FNSVLANYY 110 Query: 123 AP-GAKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWG 179 ++ H D + P I S+S G F N + L G ++V Sbjct: 111 RDANDSVAWHADDEPELGAQPVIASLSFGATRRFSLRRKSANGIAPFHIELASGSLLVMA 170 Query: 180 GESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 G+++ ++H + +K R NLT+R + Sbjct: 171 GDTQKFWHHQVAKIKQPVA-----GRINLTYRFITEN 202 >UniRef50_Q17527 Protein B0564.2, partially confirmed by transcript evidence n=4 Tax=Caenorhabditis RepID=Q17527_CAEEL Length = 231 Score = 118 bits (296), Expect = 1e-25, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 64/236 (27%), Gaps = 49/236 (20%) Query: 5 FADAEP--WQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 F + + A + + + E L + + A Q +R +A Sbjct: 7 FPENIKKFIVKSAPATMIYIPNWIDEEEENLYKSCIENAPQPKWRV----------LANR 56 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 + G + P P L + + + + L+N Y Sbjct: 57 RLQNYG----------GVVGKTALIPTDDFPVELKYLMTKINDLGIFKNPV-NHVLVNEY 105 Query: 123 APGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK--------RLLLEHGD 174 G + H D P + +V+LG + + K +LLE Sbjct: 106 EAGQGIMPHTDG--PAFHRIVTTVTLGSHCLLDMYDPVDQEIAKSEEERYVGSMLLEPRS 163 Query: 175 VVVWGGESRLF-YHGIQPLKAGFHPL---------------TIDCRYNLTFRQAGK 214 + + ++ HGI + D R ++T R K Sbjct: 164 LFIMTDDAYTRMLHGIAERETDLIEPGKVFNCTEELANKRLDRDTRISITVRNVEK 219 >UniRef50_A7SAR6 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7SAR6_NEMVE Length = 269 Score = 118 bits (295), Expect = 1e-25, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 69/214 (32%), Gaps = 31/214 (14%) Query: 16 AAGAVIL---RRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTH 72 +G I+ F + + + + ++ P+ + G Sbjct: 74 PSGLAIIDLYPSFLDGDETEWMFE--QLQAEIPWEEKDIKIKGEFHKQPRLTAWFGEFP- 130 Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLH 131 Y YS + + + W + + L ++ A A G ++ L N Y + H Sbjct: 131 ---YTYSGLTLRPFQ-WSPI---LNILREKIAKATGE---TFNSMLANLYRHNKDSVDWH 180 Query: 132 QDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRND-------PLKRLLLEHGDVVVWGGESR 183 D + P I S+S G +F+ N+ ++ L G ++V G + Sbjct: 181 ADDEPSLGVNPTIASLSFGDSRVFELRKNPLNEGDDYSLMQHIKVPLNCGSLLVMRGSVQ 240 Query: 184 L-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + H + R NLTFR + Sbjct: 241 EDWQHRVPKEYHD-----RGPRINLTFRNISPVK 269 >UniRef50_Q4TCN8 Chromosome undetermined SCAF6790, whole genome shotgun sequence. (Fragment) n=9 Tax=Eumetazoa RepID=Q4TCN8_TETNG Length = 234 Score = 118 bits (295), Expect = 1e-25, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 71/227 (31%), Gaps = 51/227 (22%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 A + F E ++ + +P ++ Sbjct: 20 PPTAYYIPDFLSEQEESHLQQ----------QVYKSPKPKWTQLSGRRLQ---------- 59 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 + + +P+ CQR ++ + + L+N Y PG + H+D Sbjct: 60 -NWGGLPHPKGMLAETIPEWLQTYCQRISSLGAFGGKVANHVLVNEYKPGEGIMPHEDG- 117 Query: 136 EPDLRAPIVSVSLGLPAIFQF----GGLKRNDPLK-------RLLLEHGDVVVWGGES-R 183 P + ++SLG + F GG++ + P LL+E +++ E + Sbjct: 118 -PLYHPTVTTLSLGSHTLLDFYTPVGGVQGDAPQTEENRFLFSLLVEPRSLLILQDEMYQ 176 Query: 184 LFYHGIQPLKAGFHP----------------LTIDCRYNLTFRQAGK 214 HGI+P + LT R +LT R K Sbjct: 177 KLLHGIRPCEQDALSQKVLNLSAAGARAGDVLTRGTRVSLTVRHVPK 223 >UniRef50_Q9VKU5 CG6144, isoform A n=11 Tax=Diptera RepID=Q9VKU5_DROME Length = 228 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 61/231 (26%), Gaps = 55/231 (23%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 + + F + EQ I + + + Q+ + Sbjct: 12 PPTVMYIPNFITSEEEQRILSHIERTPKPRWTQL---------LNRRLV----------- 51 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y + +P+ + + + L+N Y PG + H D Sbjct: 52 -NYGGVPHPNGMIAEEIPEWLQTYVDKVNNLGVFESQNANHVLVNEYLPGQGILPHTDG- 109 Query: 136 EPDLRAPIVSVSLGLPAIFQF------------GGLKRNDPLKRLLLEHGDVVVWGGESR 183 P I ++S G + +F G + L +LLLE +++ Sbjct: 110 -PLFHPIISTISTGAHTVLEFVKREDTTTETEAGDQTTREVLFKLLLEPRSLLILKDTLY 168 Query: 184 LFY-HGIQPLKAGF-------------------HPLTIDCRYNLTFRQAGK 214 Y H I H + R +LT R K Sbjct: 169 TDYLHAISETSEDVLCDRISNYDLCENTYKIGDHLVRRSPRISLTIRNVPK 219 >UniRef50_C4WWX2 ACYPI004109 protein n=2 Tax=Acyrthosiphon pisum RepID=C4WWX2_ACYPI Length = 220 Score = 117 bits (294), Expect = 2e-25, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 74/219 (33%), Gaps = 25/219 (11%) Query: 6 ADAEPWQEPLAAGAV--ILRRFAFNAAEQLIRDINDV-ASQSPFRQMVTPGGYTMSVAMT 62 A W++ +A RF A + + + S R Sbjct: 16 AQPSQWRKIVAEDLDLDYCERFLTAAESATLLNYMENNVSYFDGRLSQVKVFGQYYPIPR 75 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 G + + + P PQ ++L ++ T G + L+NRY Sbjct: 76 QQVAFGDAGLLYKFSGTVV------PAQPWPQPLYDLKRKICTTRGVD---YNFVLVNRY 126 Query: 123 APG-AKLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLK-----RNDPLKRLLLEHGDV 175 G + H+D + E D PI S+SLG F F R L +L L +G + Sbjct: 127 KNGEDHMGEHRDDEVELDKTVPIASISLGQTRKFVFKHTDVRKKIRQVELVKLDLHNGSL 186 Query: 176 VVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 ++ + +YH I K + R N TFR+ Sbjct: 187 LMMNQPTNEYWYHSIPKEK-----NAKNIRLNFTFRKIK 220 >UniRef50_Q609W8 2OG-Fe(II) oxygenase family domain protein n=1 Tax=Methylococcus capsulatus RepID=Q609W8_METCA Length = 141 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 37/151 (24%), Positives = 58/151 (38%), Gaps = 17/151 (11%) Query: 69 WTTH-RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 W Y YS + PW + +L R +G+ +A L NRY G Sbjct: 3 WYGDPGATYRYSGVS-HQPSPWHEV---LADLRTRIEAFSGH---VFNAVLCNRYRSGRD 55 Query: 127 KLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-L 184 + H D + P I S+SLG +F+ + + L GD+++ GGE + Sbjct: 56 SMGWHADDEPELGERPFIASLSLGAERLFRIRH-RGTGRTLDVPLRDGDLLLMGGELQSH 114 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 + H + R NLTFR+ + Sbjct: 115 WRHCVPRTAR-----PCGERINLTFRRVVPR 140 >UniRef50_C8XTB3 Putative uncharacterized protein n=1 Tax=Dunaliella viridis RepID=C8XTB3_9CHLO Length = 2229 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 64/191 (33%), Gaps = 14/191 (7%) Query: 23 RRFAFNAAEQLIRDIN-DVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPI 81 +RF D+ + +S F Q + R + YS + Sbjct: 1922 KRFLPPQQSSETGDLFKRLMRESSFEQRDIFVMGKRHKQPRLTAYYATDLERGTFTYSGL 1981 Query: 82 DPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLR 140 P+ + + Q + D+ L+N Y G + H D ++ Sbjct: 1982 -LNIPSPFTPFLEHLKSSVQECVKE------EFDSVLLNYYRDGSDTVGWHADNEKLYGD 2034 Query: 141 AP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRLFYHGIQPLKAGFHP 198 P I S+S G F ++ N + L GD++V G+ + + H + P Sbjct: 2035 TPTIASLSFGSARDFILRKIEDNSDKYKFTLGPGDLLVMKGKTQQQWQHTVPRRSP---P 2091 Query: 199 LTIDCRYNLTF 209 I R NLTF Sbjct: 2092 QAIGPRINLTF 2102 >UniRef50_Q2MF23 TobX protein n=2 Tax=Actinomycetales RepID=Q2MF23_STRSD Length = 219 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 43/200 (21%), Positives = 61/200 (30%), Gaps = 27/200 (13%) Query: 16 AAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G V A E+ + + R Sbjct: 41 PEGLVHQPDLLDEAEERSLLTAVEAMPLHEVRMHG------------QVARRTVRHFGFD 88 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 Y Y P +P+ F L R A AG LI RY PGA + H+D Sbjct: 89 YGYESWRL---TPTDPLPEEFWWLRDRCAHLAGLRPESLAQTLIARYPPGATIGWHRD-- 143 Query: 136 EPDLRAPIVSVSLGLPAIFQF-GGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 P +V VSL + +F + + L L V G +R + H I P+ Sbjct: 144 APMFGPSVVGVSLLSSCLMRFQRRVGEERRVYELELAPRSAYVLSGAARSAWQHSIPPV- 202 Query: 194 AGFHPLTIDCRYNLTFRQAG 213 + RY++TFR Sbjct: 203 -------PELRYSITFRTLR 215 >UniRef50_A6ESW5 2OG-Fe(II) oxygenase n=5 Tax=Bacteroidetes RepID=A6ESW5_9BACT Length = 203 Score = 117 bits (292), Expect = 4e-25, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 68/218 (31%), Gaps = 23/218 (10%) Query: 2 LDLFADAEPWQEPLAAG----AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTM 57 + LF+ P + + F + ++ ++Q Sbjct: 1 MSLFSSEIPRESQIVPLKDAVVTYTPHFYSEQEAVQLYKTL--LTEINWQQDKITLFGKT 58 Query: 58 SVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDAC 117 + G Y + P+ ++ H++ G + Sbjct: 59 HLQPRLTALYGDEEIPYSYSGIVMTPRK------FSRTLHHIKTAIENHTG---ATFNTV 109 Query: 118 LINRYAPG-AKLSLHQDKDEPDLRAPI-VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDV 175 L N Y G H D ++ PI VS+SLG +F + +L L +G + Sbjct: 110 LCNLYRDGKDSNGWHSDNEKELGPDPIIVSISLGETRMFHLKNKQAPTERIKLALTNGSL 169 Query: 176 VVWG-GESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 + G G + + H + + P R NLTFR+ Sbjct: 170 LYMGKGTQKNYKHQLAKTQKQITP-----RINLTFRRL 202 >UniRef50_Q4UFZ4 Alkylated DNA repair protein, putative n=2 Tax=Theileria RepID=Q4UFZ4_THEAN Length = 350 Score = 116 bits (291), Expect = 4e-25, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 70/193 (36%), Gaps = 12/193 (6%) Query: 17 AGAVILRRFAFNAAEQLIR-DINDVASQSPFRQMV----TPGGYTMSVAMTNCGHLGWTT 71 G +LR F L+ + P + + + + +L W+T Sbjct: 99 PGVFVLRNFLTQDQSLLLACETLRSYINPPNNSNLLLMDPNISSPIWPSNS-FKNLRWST 157 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--YPDFQPDACLINRYAPGAKLS 129 Y + + P+ + + T Y F D+ +IN Y+ L Sbjct: 158 IGHLYDWGKRQY---IGFTQFPEIIAKIVNQINTLLSGFYQPFTADSAIINFYSNSYFLR 214 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGI 189 LH+D E P++++SLG PAIF + +++ G +++ SR HGI Sbjct: 215 LHRDDAEET-NDPVINISLGAPAIFCICKEDPSQFPLSCVVDSGSIIIMSKNSRRCLHGI 273 Query: 190 QPLKAGFHPLTID 202 L P + + Sbjct: 274 SKLYHYVKPDSSN 286 >UniRef50_C8VDQ1 DNA repair family protein (AFU_orthologue; AFUA_5G14250) n=5 Tax=Leotiomyceta RepID=C8VDQ1_EMENI Length = 335 Score = 116 bits (291), Expect = 4e-25, Method: Composition-based stats. Identities = 40/144 (27%), Positives = 57/144 (39%), Gaps = 21/144 (14%) Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 Y Y+P +P+ L Q A G + CL+N YA G +S H D Sbjct: 162 YQYTPR---------PIPKCLDQLRQAVEAAVG-DGSSYNFCLVNYYATGDDSISYHSDD 211 Query: 135 DEPDLRAP-IVSVSLGLPAIFQFGGLKR-----NDPLKRLLLEHGDVVVWGGESR-LFYH 187 + P I S+SLG F ++ + L GD+VV GE++ + H Sbjct: 212 ERFLGPNPSIASISLGAQRDFLMRHKPSQAPGVSNQPLKFSLASGDMVVMRGETQSNWLH 271 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQ 211 I K G R N+TFR+ Sbjct: 272 SIPKRKGGESQK---GRINITFRK 292 >UniRef50_Q5K7S3 Putative uncharacterized protein n=1 Tax=Filobasidiella neoformans RepID=Q5K7S3_CRYNE Length = 425 Score = 116 bits (290), Expect = 6e-25, Method: Composition-based stats. Identities = 43/192 (22%), Positives = 66/192 (34%), Gaps = 40/192 (20%) Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWP-AMPQSFHNLCQRAATAAGYP--------- 110 + W Y +S P P +LC A + + Sbjct: 225 RKLWKEIRWANLGWVYQWSTKSYDFAPETPIPFPAPLADLCSEAVASVPWENVFSSVSDP 284 Query: 111 ------------DFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGG 158 D++PD ++N Y L H D+ E D P+VSVSLG AI G Sbjct: 285 DASTYGWQSWPRDYKPDTGIVNFYQLNDTLMAHVDRAELDPARPLVSVSLGHAAILLLGS 344 Query: 159 LKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHP------------------LT 200 R++ + ++L GD+++ G+ R YHG+ + G P Sbjct: 345 DSRDEVPRPIILRSGDMLIMSGKGRQSYHGVPRILEGSLPSHFLVQESDSEEMKAAKNWI 404 Query: 201 IDCRYNLTFRQA 212 R N+ RQ Sbjct: 405 STARININARQV 416 >UniRef50_A9BA22 Alkylated DNA repair protein n=2 Tax=Prochlorococcus marinus RepID=A9BA22_PROM4 Length = 189 Score = 115 bits (288), Expect = 9e-25, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 64/193 (33%), Gaps = 20/193 (10%) Query: 21 ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSP 80 + ++I + Q V LG + Y YS Sbjct: 11 YFPALISHNQTGYWKNII--LENLEWTQPVVKVYSKRYSVPRLTAFLG--SKGISYKYSG 66 Query: 81 IDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD-EPD 138 + W P+ F L + + CLIN Y G + H D + E D Sbjct: 67 A-IHYAEDW---PKWFFPLLDYIRD---FSRTNYNGCLINLYRDGNDCMGWHSDNEKELD 119 Query: 139 LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFH 197 + I S+SLG F F L + + L GD+++ E + + H + K Sbjct: 120 PKKSIASLSLGATRDFFFRSLIDSS-SNNIELRDGDLLLMHPECQFNWKHCLPKRKKVS- 177 Query: 198 PLTIDCRYNLTFR 210 + R NLTFR Sbjct: 178 ----EVRINLTFR 186 >UniRef50_UPI000175883A PREDICTED: similar to alkB, alkylation repair homolog 2 n=2 Tax=Coelomata RepID=UPI000175883A Length = 197 Score = 115 bits (288), Expect = 9e-25, Method: Composition-based stats. Identities = 46/198 (23%), Positives = 70/198 (35%), Gaps = 25/198 (12%) Query: 26 AFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQT 85 + +L++ + D G Y +S Sbjct: 9 IDETSSELMQQLEDSVEYLDGDLSKVRVFGKWHQIPRQQAAYG--DQGTVYKFSGTSIPC 66 Query: 86 NKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD-EPDLRAPI 143 KPW NL +R F + LINRY G + H+D + E D PI Sbjct: 67 -KPWTETLIQVRNLIKRV------TGFDYNFVLINRYRDGNDHIGEHKDNESELDKNTPI 119 Query: 144 VSVSLGLPAIFQFGGL--------KRNDPLKRLLLEHGDVVVWGGES-RLFYHGIQPLKA 194 S+SLG +F F KR+ P ++ L+HG +++ + +YH + P K Sbjct: 120 ASLSLGQQRLFVFKHQDCRKKGGAKRSVPPVKIQLQHGSLLLMNPPTNNYWYHALPPAKR 179 Query: 195 GFHPLTIDCRYNLTFRQA 212 R NLTFR+ Sbjct: 180 -----APGARINLTFRKI 192 >UniRef50_Q2SBS6 Alkylated DNA repair protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SBS6_HAHCH Length = 203 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 67/214 (31%), Gaps = 21/214 (9%) Query: 5 FADAEPWQEPLAAG-AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 D + +A G ++ +A + + D+ +RQ Sbjct: 6 HPDLQIEPITIANGALTLIHPLLADADAAFVLE--DLTQHLDWRQDSLRIQGRTIPIPRL 63 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA 123 G Y YS + L A+ A + + L N Y Sbjct: 64 QAWYGEPH--CHYAYSGLRLNP----TPFSPLLQQLRHIASEHAA---AKFNCALCNLYR 114 Query: 124 PG-AKLSLHQDKDEPDLRAPI-VSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 G +S H D + API S S G FQ KR + L H +++ G+ Sbjct: 115 NGQDSVSWHADDEPELGPAPIIASFSFGATRTFQIK-PKRGGQTLAIELLHNSLLIMSGD 173 Query: 182 -SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 R + H + KA P R NLT+R Sbjct: 174 MQRHWRHQLPKTKAPVGP-----RVNLTYRYIPA 202 >UniRef50_A4C6S7 Putative 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C6S7_9GAMM Length = 208 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 52/215 (24%), Positives = 83/215 (38%), Gaps = 25/215 (11%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFN-AAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 + LF + L G + + + A+ L + D + + + G T+S+ Sbjct: 9 MPLFNTK---VKSLPNGFICHQNTISSVKADALYHYLLDECAWQ--QPKIVIYGKTVSIP 63 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 C Y YS + +PW A+ + R + G P +A L+N Sbjct: 64 RLQCYI---ADEGLEYQYSGLT-MAPEPWSAV---LLAIKNRLSHTFGVP---FNALLVN 113 Query: 121 RYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 178 Y G + H D + R P I S+SLG +F+ K+ + L L+ GD ++ Sbjct: 114 WYRDGQDSMGWHSDDEPELGREPCIASLSLGASRLFKMR-QKQTLQVYNLQLQSGDCLLM 172 Query: 179 GGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 G S+L F H + P R NLTFR Sbjct: 173 SGRSQLDFQHSLPK-----QPSVKQGRINLTFRYV 202 >UniRef50_A6GHE3 2OG-Fe(II) oxygenase n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE3_9DELT Length = 228 Score = 115 bits (287), Expect = 1e-24, Method: Composition-based stats. Identities = 47/229 (20%), Positives = 70/229 (30%), Gaps = 32/229 (13%) Query: 3 DLFADAEPWQ-EPLAAGA------VILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGY 55 +LF P + P+ A + F A + A P+RQ Sbjct: 5 ELFPAPSPERWRPIDHDAAAEARIFLREAFLDPEAATTLYAQLRDAV--PWRQDELRAYG 62 Query: 56 TMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPD 115 Y++S + PW L +R A + + Sbjct: 63 KTHPIPRLHQWYADDDSG-TYVWSGLTMH-PLPWTP---PLDALRRRVEAAT---RRRFN 114 Query: 116 ACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRND-------PLK 166 + LIN Y G + H D + AP I SVSLG F + D Sbjct: 115 SALINYYRDGRDTVGWHADDEVELGPAPFIASVSLGAERDFLLRRVANADTDTDTEPRHL 174 Query: 167 RLLLEHGDVVVWG-GESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + L HG ++V G + H + + R NLTFR + Sbjct: 175 SVALPHGSLLVMAEGSQARWQHTLPRRTRVT-----EGRINLTFRHVSR 218 >UniRef50_B6EQD5 Putative uncharacterized protein n=1 Tax=Aliivibrio salmonicida LFI1238 RepID=B6EQD5_ALISL Length = 208 Score = 115 bits (287), Expect = 1e-24, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 59/217 (27%), Gaps = 22/217 (10%) Query: 5 FADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 F + F + + + S +T G C Sbjct: 7 FENTPLTLLNNDGNITYWDNFLSEDEATNLFNELQINSDWKEET-ITLFGKEYKQPRLTC 65 Query: 65 --GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 G G + GY L + G ++ + L N Y Sbjct: 66 WYGEYGVVANG-GYQVLTKAV-------PFTSQLMVLKNKIEKETG---YKFNCVLANLY 114 Query: 123 A-PGAKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 + H D + + P I S SLG F + + L++ +V+ G Sbjct: 115 RNENDGVGYHADDEAILGKNPAIASYSLGETRRFLVKHNQHKYKNISIDLKNNSLVLMDG 174 Query: 181 E-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + H I K R NLTFR GK + Sbjct: 175 CLQDHWKHAIPKTKRAM-----SARINLTFRFLGKND 206 >UniRef50_A1S8P3 DNA-N1-methyladenine dioxygenase n=1 Tax=Shewanella amazonensis SB2B RepID=A1S8P3_SHEAM Length = 206 Score = 115 bits (287), Expect = 1e-24, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 61/202 (30%), Gaps = 23/202 (11%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTH-RQG 75 + F + L+ + A M+ G + W Sbjct: 19 PPVSHVPAFLSPRQQALLMT--EAADYPFESPMIKVYGKWHPIPRQQV----WFADEGCS 72 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 Y YS + P L Q G + CL+N Y G + H D Sbjct: 73 YRYSSLLISP----TPWPHYLLRLKQALEAHCGAG---FNGCLVNHYRGGEDTMGFHADD 125 Query: 135 D-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPL 192 + E + I VSLG + R+LL+ GD+++ + + H I Sbjct: 126 EPELVEESLIAIVSLGASRPLVMRRREDGLR-CRVLLQSGDLLLMHPPMQSTWEHAIPRS 184 Query: 193 KAGFHPLTIDCRYNLTFRQAGK 214 + ++ R + TFR Sbjct: 185 QK-----SLPARISFTFRNLKP 201 >UniRef50_D2XAQ5 Alkylated DNA repair protein n=1 Tax=Marseillevirus RepID=D2XAQ5_9VIRU Length = 198 Score = 115 bits (287), Expect = 1e-24, Method: Composition-based stats. Identities = 50/207 (24%), Positives = 79/207 (38%), Gaps = 20/207 (9%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 RF F + ++L R + D+ P + G + + G G Y Sbjct: 3 PPIFYSERFLFGSRKKLQRQLADIEYLPPEDTAIKMHGKVIPIPRLQTG-FGK-HESLSY 60 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAA------GYPDFQPDACLINRYAPGA-KLS 129 ++ + P P L + G P+ L+N+Y G + Sbjct: 61 SFTGVKI----PAKIWPPYIEKLSLKIHAHLVEQGVMGQDTPPPNYVLVNKYLNGDHYIG 116 Query: 130 LHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW-GGESRLFYH 187 H DK+ + + PI+SVSLG F +K + K + L GDV+V G +++ H Sbjct: 117 WHSDKERDLMMGYPIISVSLGARRDFCLRLIKNHKHKKTISLGSGDVLVMLPGMQQVWQH 176 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + K P RYNLTFR G+ Sbjct: 177 CLPKRKGLDEP-----RYNLTFRWIGE 198 >UniRef50_A4BAI0 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BAI0_9GAMM Length = 194 Score = 115 bits (287), Expect = 1e-24, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 66/211 (31%), Gaps = 23/211 (10%) Query: 8 AEPWQEPLAAGAV-ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGH 66 +G V + + +E + ++ V Sbjct: 3 TPVTLLDDDSGLVKLYPEWL-RNSEAFYQYCRQNLD---WQSRTIRLFGKAHVIPRLECW 58 Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG- 125 LG Y YS + + P+ F +L R + DF P+ L+N Y G Sbjct: 59 LG--DPGLRYGYSGQEYVAS----GWPEGFKSLLDRFQSQH---DFAPNGALMNYYRSGA 109 Query: 126 AKLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SR 183 + H D + E L I +SLG F F K + +L L G +++ G Sbjct: 110 DTMGWHADDEPELGLNPTIAILSLGGARDFHFRQHKDHSQKLKLRLPEGSLLLMSGAVQH 169 Query: 184 LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + H + R + TFR+ Sbjct: 170 HWQHALPKRAQAR------PRISCTFRRIVA 194 >UniRef50_Q1YTT7 Oxidoreductase, 2OG-Fe(II) oxygenase family protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YTT7_9GAMM Length = 202 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 75/217 (34%), Gaps = 27/217 (12%) Query: 2 LDLFAD---AEPWQEPLAAGAVILRRFAFNA-AEQLIRDINDVASQSPFRQMVTPGGYTM 57 +DLF E + + A++L+ + ++ G T+ Sbjct: 1 MDLFDHQLIPVRLTEERQSQITFWPNWLDGERADRLVSQSINDIDWRS--DVIRIVGKTI 58 Query: 58 SVAMTNCGHLGWTTHRQ-GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDA 116 + W + + Y YS I Q A P L ++ +G + + Sbjct: 59 PIPRLQQ----WFGNPETSYTYSNIRLQA----VAFPCWIDQLREQIEIQSGE---RFNR 107 Query: 117 CLINRYAPG-AKLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGD 174 L+N Y G + H D + E + S+SLG +FQ + + L HG Sbjct: 108 ALVNYYRDGSDSVDWHADDEAELGFEPLVASLSLGAERVFQLRHNLTKER-LDIALPHGS 166 Query: 175 VVVWG-GESRLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 +++ G G + H I K P R N TFR Sbjct: 167 LLLMGAGIQTYWQHRIAKTKKVDKP-----RVNFTFR 198 >UniRef50_Q21J14 DNA-N1-methyladenine dioxygenase n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21J14_SACD2 Length = 204 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 71/216 (32%), Gaps = 23/216 (10%) Query: 3 DLF--ADAEPWQEPLAAG-AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSV 59 +LF +P L G + R + + N +A++ ++Q Sbjct: 4 NLFEQDADKPQSIDLKGGELELHRGWLESDLAT--EYFNKLAAEVDWQQPEIWVAGQRHK 61 Query: 60 AMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 G Y PW + +L + ++ L+ Sbjct: 62 IPRLQAWYGDENSVMEYS---ATRFYPTPWS---KELISLKDLIENKT---ESSYNSVLV 112 Query: 120 NRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVV 177 N Y G + H D ++ P I S+SLG F KR +L L +GD++V Sbjct: 113 NLYRNGADGVGWHADDEKELGGCPVIASLSLGASRSFSLK-PKRGGKSIKLELNNGDLIV 171 Query: 178 WGGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 G+ R + H + P R NLTFR Sbjct: 172 MKGDTQRNWLHAVAKTSKKIGP-----RINLTFRYI 202 >UniRef50_A9T8I6 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T8I6_PHYPA Length = 320 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 36/129 (27%), Positives = 53/129 (41%), Gaps = 9/129 (6%) Query: 87 KPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IV 144 KP +P+ L + A D + L+N YA G +S H D + P I Sbjct: 162 KPPRPIPRCLQELKRCVEQAT---DEYYNFVLVNFYADGTHSISPHSDDESFLGTNPCIA 218 Query: 145 SVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTID 202 S+SLG F R D ++ L GD+VV G ++ ++H I Sbjct: 219 SLSLGGTRDFVMKHKTRKDVNSEKFALRSGDMVVMRGTTQANWFHSIPKRTGKTQ--ATA 276 Query: 203 CRYNLTFRQ 211 R N+TFR+ Sbjct: 277 PRINVTFRK 285 >UniRef50_Q3IHQ7 Putative 2OG-Fe(II) oxygenase superfamily protein n=2 Tax=Alteromonadales RepID=Q3IHQ7_PSEHT Length = 196 Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 66/206 (32%), Gaps = 20/206 (9%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 + L G R ++ + + ++Q + Sbjct: 8 PQLLPNGFSYQSRALSA--QKSLDLFYYLQQNLCWQQPNVTVYNKTGPIPRLQCFISENN 65 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSL 130 GY +S + + P + +R P ++ L+N Y G + Sbjct: 66 IEYGYSHSKLIVE------PWPDVLLAMRKRLERHLNQP---LNSLLVNYYRDGNDTMGW 116 Query: 131 HQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHG 188 H D + P IV +SLG + + ++ + L L G ++ G S+ + H Sbjct: 117 HSDDEAELGHQPTIVCISLGAERVLKLKHKA-SNKVTNLKLHSGSCLIMSGNSQRDYQHA 175 Query: 189 IQPLKAGFHPLTIDCRYNLTFRQAGK 214 I HP R +LTFR + Sbjct: 176 IAKQTTLAHP-----RISLTFRLIKR 196 >UniRef50_A2QVX5 Contig An11c0110, complete genome n=18 Tax=Eurotiomycetidae RepID=A2QVX5_ASPNC Length = 360 Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 62/200 (31%), Gaps = 31/200 (15%) Query: 36 DINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQS 95 ++V L W T Y ++ + +P P P+ Sbjct: 162 SFFGDDPARVIEPKDPNVHKPLTVQSILNRKLRWVTLGGQYDWTAKVYPSERP-PEFPRD 220 Query: 96 FHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQ 155 L +P + A ++N Y+ G LS H+D E D ++SVS G +F Sbjct: 221 IAKLLHAM-----FPATEAQAAILNVYSAGDHLSPHRDVSE-DCDVGLISVSFGCDGLFL 274 Query: 156 FGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI-------------- 201 + + + L GD V G SR +H + + P + Sbjct: 275 ISH-DDGEHCEIIRLRSGDAVYMDGTSRFAWHAVPKIVPNTCPKWLANWPSSPHDGAASQ 333 Query: 202 ---------DCRYNLTFRQA 212 R NL RQ Sbjct: 334 YDAWRGWMSGKRVNLNVRQM 353 >UniRef50_C1EB25 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EB25_9CHLO Length = 334 Score = 113 bits (283), Expect = 3e-24, Method: Composition-based stats. Identities = 53/231 (22%), Positives = 81/231 (35%), Gaps = 38/231 (16%) Query: 7 DAEPWQEPLAAGAVILRRFAFNAAEQLIRD------INDVASQSPFRQMVTPGGYTMSVA 60 + E LA G V LRR + + + Q + + V Sbjct: 95 EKGSVAEVLAPGLVCLRRAIDLETQAWLAERAFEVGEGKDGRQGFYNTVPGDAPGDAPVL 154 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLC-------QRAATAAGYPDFQ 113 N G G P P+ + Q A + PD Sbjct: 155 RLNQGTRGRVIL---------------PVSDFPERLGRIVRGCVRCAQTADSCTNVPDMN 199 Query: 114 PDACLINRYAPGAKLSLHQDKDEP-----DLRAPIVSVSLGLPAIFQFGGLKRNDPLKRL 168 P L+N Y GAK H+D ++P D PIVS ++GL A F + + + + Sbjct: 200 PTTALVNFYKEGAKFKWHRDSEDPAHARHDTGPPIVSFTVGLSADFSYKNRFEDATHRTV 259 Query: 169 LLEHGDVVVWGGESRLFYHGIQPLKAGFHPL-----TIDCRYNLTFRQAGK 214 L GDV+++GG SR+ H + + P + R N+T R G+ Sbjct: 260 RLNSGDVLLFGGPSRMIVHSVTGVVPRTMPPMLRGRMLHGRLNVTVRDIGR 310 >UniRef50_B2APW5 Predicted CDS Pa_4_6060 n=6 Tax=Sordariomycetes RepID=B2APW5_PODAN Length = 356 Score = 113 bits (283), Expect = 4e-24, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 67/228 (29%), Gaps = 43/228 (18%) Query: 22 LRRFAFNAAEQLIRDINDVASQSP--FRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYS 79 E + SP F +++ L W T Y ++ Sbjct: 135 YPDQGSEGPEHKHASFFTLPQDSPTTFTPKDPSVHKPLTIKQVLQRRLSWVTLGGQYDWT 194 Query: 80 PIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDL 139 P P P + +P+ A ++N Y PG + +H+D E Sbjct: 195 NRIYPGELP-PQFPPDIAGFLETL-----FPETLAQAAIVNFYTPGDTMMMHRDVSEET- 247 Query: 140 RAPIVSVSLGLPAIF-----QFGGLKRNDPLKR-----------LLLEHGDVVVWGGESR 183 ++S+S G ++F G + + L L GD + +SR Sbjct: 248 DKGLISLSFGCDSLFMIAPNDVGKMSDEEKKAAGFGDGQKEYLLLRLRSGDAIYMTKDSR 307 Query: 184 LFYHGIQPLKAGFHP------------------LTIDCRYNLTFRQAG 213 +HG+ + G P + R NL RQ Sbjct: 308 FAWHGVPKVLKGTCPDYLEDWPAEDGKYEEWRGWMKNKRINLNVRQMR 355 >UniRef50_Q3KRA9 Alkylated DNA repair protein alkB homolog 6 n=23 Tax=Metazoa RepID=ALKB6_HUMAN Length = 238 Score = 113 bits (283), Expect = 4e-24, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 70/243 (28%), Gaps = 57/243 (23%) Query: 6 ADAEPWQ-EPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 EP++ E + F E+ + A + + Q+ Sbjct: 9 PALEPFRVEQAPPVIYYVPDFISKEEEEYLLRQVFNAPKPKWTQLSGRKLQ--------- 59 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 + + +P + + + + + L+N+Y P Sbjct: 60 ------------NWGGLPHPRGMVPERLPPWLQRYVDKVSNLSLFGGLPANHVLVNQYLP 107 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF---GGLKRNDPLK----------RLLLE 171 G + H+D P + ++SLG + F + +DP + LLLE Sbjct: 108 GEGIMPHEDG--PLYYPTVSTISLGSHTVLDFYEPRRPEDDDPTEQPRPPPRPTTSLLLE 165 Query: 172 HGDVVVWGGESRLF-YHGIQ-------------------PLKAGFHPLTIDCRYNLTFRQ 211 ++V G + HGI P L R +LT R+ Sbjct: 166 PRSLLVLRGPAYTRLLHGIAAARVDALDAASSPPNAAACPSARPGACLVRGTRVSLTIRR 225 Query: 212 AGK 214 + Sbjct: 226 VPR 228 >UniRef50_C3ZI75 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZI75_BRAFL Length = 844 Score = 113 bits (282), Expect = 5e-24, Method: Composition-based stats. Identities = 37/154 (24%), Positives = 62/154 (40%), Gaps = 22/154 (14%) Query: 73 RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLH 131 Y +S ++ +PW + + R A G+ + + L+NRY G + H Sbjct: 703 GLSYRFSGVEV-PARPWTPL---MEGIRDRVQEATGH---KFNFVLVNRYKDGNDHMGEH 755 Query: 132 QDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDP-------LKRLLLEHGDVVVWGGES- 182 +D + + API S+SLG F F +L LEHG +++ + Sbjct: 756 RDDEKDLVREAPIASLSLGQKRDFIFKHCDARGKSAKRAMDPVKLELEHGSLLMMNYPTN 815 Query: 183 RLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 R +YH + K + R N+TFR + Sbjct: 816 RYWYHSLPVRKK-----ALGVRINMTFRSMVTSK 844 >UniRef50_C4Q8H0 Expressed protein n=2 Tax=Schistosoma RepID=C4Q8H0_SCHMA Length = 334 Score = 113 bits (282), Expect = 5e-24, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 69/227 (30%), Gaps = 40/227 (17%) Query: 19 AVILRRFAF--NAAEQLIRDINDVASQSPFRQMV---TPGGYTMSVAMTNCGHLGWTTHR 73 IL+ F + + + + + + P ++ L W T Sbjct: 65 FFILKNFFSPIEIEDLWLAALTEWCHSPTAQCNLGTNVPPNSRIACTDPWYSKLRWITLG 124 Query: 74 QGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGY-------------------PDFQP 114 Y +S +K P + + ++ P Sbjct: 125 YHYQWSERVYNESK-VGEFPSLLYGTTVNIINFLKHLIEGREISSSNSSRLLEQCQNYTP 183 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF------GGLKRN------ 162 +A ++N Y + H D E D AP+VS+S G A+F Sbjct: 184 EASIVNYYRTKTTMGFHSDDAEVDKEAPLVSISFGPTALFLLETSEAIKHEFDAPLHGSF 243 Query: 163 DPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT---IDCRYN 206 D + + L HGDVV+ G+SRL H + + R + Sbjct: 244 DHVLPIYLHHGDVVIMAGKSRLARHAVPVIFFDDDTEVVSKGALRVS 290 >UniRef50_C6Y338 2OG-Fe(II) oxygenase n=10 Tax=Bacteria RepID=C6Y338_PEDHD Length = 202 Score = 112 bits (281), Expect = 6e-24, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 66/217 (30%), Gaps = 23/217 (10%) Query: 2 LDLFA-DAEPWQEPLAAG--AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMS 58 +DLF L G + A + + ++ Sbjct: 1 MDLFNTGTGADLNLLPHGGIVNYYGKLMSPATANHYLQVL--LNTIEWKSDEAIILGKHI 58 Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 G Y + W A L A G ++CL Sbjct: 59 FTKRKVAWYGDREFEYTYS---NTTKKALAWTA---ELLELKAMAEQKTGE---TFNSCL 109 Query: 119 INRYAPGA-KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVV 176 +N Y G ++ H D + + I S+S G F F + + L+LEHG ++ Sbjct: 110 LNLYHSGEEGMAWHSDGEKDLKKNGAIGSMSFGAERKFSFKHKQSKE-TVSLILEHGSLL 168 Query: 177 VWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 V ++ + H + P K + R NLTFR Sbjct: 169 VMKDTTQSNWLHRLPPTK-----MVHKARVNLTFRTI 200 >UniRef50_D2VNR1 Putative uncharacterized protein n=1 Tax=Naegleria gruberi RepID=D2VNR1_NAEGR Length = 259 Score = 112 bits (280), Expect = 8e-24, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 59/228 (25%), Gaps = 52/228 (22%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G I+ A E+ + D + + Y Sbjct: 48 GLYIIENIIDVAEERKLVKFIDSQKWNDEI------------------SRRTQHYGVSYN 89 Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ-------PDACLINRYA-PGAKLS 129 Y + P +P F +L + G + +IN Y +S Sbjct: 90 YGARGVKEALKVPPVPSEFSDLLEEIKNKEGLDSIRNLMEGIDFKQVIINEYKGAKQGIS 149 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQF-----------------GGLKRNDPLKRLLLEH 172 H D + D I+ +SLG + +F + Sbjct: 150 KHVDHCQ-DFGPLILILSLGDECVMKFHKLEQVKEEDLKKKKVKRTEVSPSECYDRRMPR 208 Query: 173 GDVVVWGGESRL-FYHGIQPL-------KAGFHPLTIDCRYNLTFRQA 212 +++ G++R + H I K R ++T+R Sbjct: 209 RSLIILSGDARYQYQHEIPKTMVFKIDGKQFLKRSESYRRVSITYRSL 256 >UniRef50_A2R3V2 Similarity to human sequence 203 from patent WO0129221-A/203 n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2R3V2_ASPNC Length = 356 Score = 112 bits (280), Expect = 8e-24, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 51/124 (41%), Gaps = 5/124 (4%) Query: 91 AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-KLSLHQDKDEPDLRAP-IVSVSL 148 +P L Q A + + L+N YA G +S H D + + P I S+SL Sbjct: 200 PIPPCLDILRQAVEKAT-DDGTRYNFVLVNYYATGDDSISYHSDDERFLGQNPTIASLSL 258 Query: 149 GLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNL 207 G F K + L+ GD+++ GE++ + H + K R N+ Sbjct: 259 GAGRDFLLKH-KPAAKPLKFPLKSGDMLIMRGETQSNWLHSVPKRKGLQGSAGALGRINI 317 Query: 208 TFRQ 211 TFR+ Sbjct: 318 TFRR 321 >UniRef50_A4SB59 Predicted protein (Fragment) n=2 Tax=Ostreococcus RepID=A4SB59_OSTLU Length = 134 Score = 112 bits (280), Expect = 8e-24, Method: Composition-based stats. Identities = 36/145 (24%), Positives = 50/145 (34%), Gaps = 14/145 (9%) Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA-K 127 W Y YS +P+ L G + L+NRY G Sbjct: 1 WAGD-LPYKYSGQTLDP----VPVPEVLRRLQTAVEAKCG---ATFNHILLNRYRDGDDS 52 Query: 128 LSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRLF 185 ++ H D + E A I +VS+G F R + LEHG ++V G Sbjct: 53 MAFHADDEPELGKNACIAAVSVGHTRKFDVQVKSRAKKKTSIFLEHGSLMVMDGSLQHTH 112 Query: 186 YHGIQPLKAGFHPLTIDCRYNLTFR 210 YH + P R N+TFR Sbjct: 113 YHAVPK---NRVPTNGKERINITFR 134 >UniRef50_A7EEP6 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EEP6_SCLS1 Length = 319 Score = 112 bits (280), Expect = 9e-24, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 64/237 (27%), Gaps = 61/237 (25%) Query: 18 GAVILRRFAFNAAEQLIRD-INDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G + F E I I S G + + Sbjct: 23 GLALYSNFITPTEEAEIISSILSDDRWSGI------------------GKRQTLHYGAHF 64 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 Y+ ++ W +P+ +L R + +PD + Y PG + H D Sbjct: 65 DYTTFG--ASEMWTPVPRYLEDLVDRLPWRKEGKEERPDQFTVQYYPPGTGIPPHVDTHS 122 Query: 137 PDLRAPIVSVSLGLPAIFQFGGL---------------------------------KRND 163 + S+S+G F + Sbjct: 123 -VFGEYLYSLSIGSSVPMVFKKCGENEARKMRKPKRSLLGDSRDEVNRTRVTIKAEDDGE 181 Query: 164 PLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTID-----CRYNLTFRQAGK 214 + L +++ GE+R F H I+ K F + R+++T R+ + Sbjct: 182 EKWEVWLRERSLLLMRGEARFGFTHMIRGRKFDFDERKGERVRRVGRWSITMRRVRR 238 >UniRef50_UPI0001927839 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI0001927839 Length = 235 Score = 112 bits (279), Expect = 1e-23, Method: Composition-based stats. Identities = 34/148 (22%), Positives = 55/148 (37%), Gaps = 17/148 (11%) Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSL 130 Y +S + + W Q + ++ + L+NRY G + Sbjct: 94 QGLSYTFSGVTVF-AQSWLPFMQKLKEIAEQL------TMTSFNFVLVNRYDNGNDYMGF 146 Query: 131 HQDKD-EPDLRAPIVSVSLGLPAIFQF----GGLKRNDPLKRLLLEHGDVVVWGGESR-L 184 HQD + + D API S S G F F ++ L HG +++ + L Sbjct: 147 HQDNEKDLDAHAPIASFSFGQDRDFIFKYKKNKSNKSYENVTFHLGHGSLLIMHPPTNDL 206 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 +YH + P + R NLTFR+ Sbjct: 207 WYHSLPKRSVKTCP---NPRINLTFRKM 231 >UniRef50_B0C4L3 Alkylated DNA repair protein n=3 Tax=Bacteria RepID=B0C4L3_ACAM1 Length = 175 Score = 112 bits (279), Expect = 1e-23, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 56/172 (32%), Gaps = 16/172 (9%) Query: 44 SPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRA 103 P Q + + V + Y YS I + P + + + Sbjct: 15 WPNHQDLFFKLCSTVVWDVRMKARRTASFGVAYNYSQITYLKTEMHPELLPLCAAVLESL 74 Query: 104 ATAAGYPDFQPDACLINRYAPGAK-LSLHQDK-DEPDLRAPIVSVSLGLPAIFQFGGLKR 161 F P+ CL+N Y G+ + H D +E + ++SL + + Sbjct: 75 -------GFTPNNCLLNFYTDGSSSMGFHSDTAEELSPGTGVATLSLRATRTITYKHKQA 127 Query: 162 NDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 + LE GD++ E + + HGI R ++TFR Sbjct: 128 RETQYSYSLESGDLLYMSNEVQIDWLHGILK------EAQAGPRISVTFRSI 173 >UniRef50_Q96Q83 Alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 n=17 Tax=Chordata RepID=ALKB3_HUMAN Length = 286 Score = 112 bits (279), Expect = 1e-23, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 64/203 (31%), Gaps = 28/203 (13%) Query: 21 ILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSP 80 + F I + + P++Q G Y YS Sbjct: 92 LYPGFVDVKEADWILE--QLCQDVPWKQRTGIREDITYQQPRLTAWYGELP----YTYSR 145 Query: 81 IDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA-PGAKLSLHQDKDEPDL 139 I + N W + L R G+ ++ L N Y + H D + Sbjct: 146 ITMEPNPHWHPV---LRTLKNRIEENTGH---TFNSLLCNLYRNEKDSVDWHSDDEPSLG 199 Query: 140 RAP-IVSVSLGLPAIFQFGGLKRND--------PLKRLLLEHGDVVVWGGESR-LFYHGI 189 R P I S+S G F+ + ++ L+HG +++ G ++ + H + Sbjct: 200 RCPIIASLSFGATRTFEMRKKPPPEENGDYTYVERVKIPLDHGTLLIMEGATQADWQHRV 259 Query: 190 QPLKAGFHPLTIDCRYNLTFRQA 212 + + R NLTFR Sbjct: 260 PKEY-----HSREPRVNLTFRTV 277 >UniRef50_A6VZM6 Putative alkylated DNA repair protein n=1 Tax=Marinomonas sp. MWYL1 RepID=A6VZM6_MARMS Length = 185 Score = 112 bits (279), Expect = 1e-23, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 57/169 (33%), Gaps = 17/169 (10%) Query: 45 PFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAA 104 P+ + + Y Y+ D W P + + A Sbjct: 28 PWERESLTMFGRDVLVPRRVAF--VADTGICYRYTGKD-HYGIGW---PDWLLAIKEEAE 81 Query: 105 TAAGYPDFQPDACLINRYAPGA-KLSLHQDKDEPDLRAPIVS-VSLGLPAIFQFGGLKRN 162 +A L+N Y G + H D ++ AP+V+ +SLG F F + Sbjct: 82 ---ILAKQSFNAVLLNWYQDGEEYMGWHADDEKSLGPAPVVAMLSLGASRPFIFRLKGNH 138 Query: 163 DPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFR 210 + LE G +V ++ L+ H + K + R +LTFR Sbjct: 139 QIKHSVELEDGSWLVMSASTQVLWQHSLPVRKR-----IKEERISLTFR 182 >UniRef50_Q3M1V0 DNA-N1-methyladenine dioxygenase n=3 Tax=Nostocaceae RepID=Q3M1V0_ANAVT Length = 199 Score = 112 bits (279), Expect = 1e-23, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 60/212 (28%), Gaps = 26/212 (12%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L LF + L F + + + G TM V Sbjct: 4 LQLFDE---PTTVLP--VSYHPDFLSKQEADEL--YQHCQQLQWQQNQIRMLGKTMPVPR 56 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 C + YLYS W + L G + + N+ Sbjct: 57 LECI---YGDEGCDYLYSNSVLLKPLAWT---DALSKLRDSITAFTG---YSFRIVIGNQ 107 Query: 122 YAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW- 178 Y G + H DK+ P I S+SLG FQ + LEHG ++V Sbjct: 108 YRSGQDSIGWHADKESSMGVEPTITSISLGAVRKFQIKPI--GGKPTDFWLEHGSLLVML 165 Query: 179 GGESRLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 G H + R NLTFR Sbjct: 166 PGCQTTHLHQVPKTNKFVT-----TRINLTFR 192 >UniRef50_C5KBY7 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KBY7_9ALVE Length = 332 Score = 112 bits (279), Expect = 1e-23, Method: Composition-based stats. Identities = 50/245 (20%), Positives = 87/245 (35%), Gaps = 43/245 (17%) Query: 14 PLAAGAVILRRFAFNAA-EQLIRDINDVASQSPFRQMVTPGGYTMSVAMT---------- 62 P + ++LRR+ A E++ +I SP + P + Sbjct: 61 PSRSAVILLRRYLSEDAVERITSEILQHCISSPHSTSLDPITTPEERSDMFEEYQQMCGS 120 Query: 63 --------NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAA------- 107 L W++ Y ++ + MPQ ++ A AA Sbjct: 121 NRTELNTCLLRKLRWSSLGVHYDWTRRSYR-GTSSSDMPQWVCDIYHNALKAADDICGSR 179 Query: 108 -GYPDFQPDACLINR---YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND 163 +Q +A L+N + +L H+D E +P+V ++LGLP F GG R D Sbjct: 180 LAEDGYQAEAALVNFFHSHRSSDRLGGHKDDVEARDHSPLVILALGLPCTFLLGGDSRVD 239 Query: 164 -PLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID-----------CRYNLTFRQ 211 +L GDV+V E+R ++HG+ + R +++ R+ Sbjct: 240 VTPAPILFSSGDVLVLSREARQWFHGVPTVLKNSVDRPHHADGSVEDFLKRTRLSVSIRE 299 Query: 212 AGKKE 216 E Sbjct: 300 VSGDE 304 >UniRef50_Q8K2U2 Alkylated DNA repair protein alkB homolog 6 n=5 Tax=Eutheria RepID=ALKB6_MOUSE Length = 235 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 70/243 (28%), Gaps = 57/243 (23%) Query: 6 ADAEPWQ-EPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 EP++ E + F E+ + A + + Q+ Sbjct: 9 PALEPFRVEQAPPLIYYVPDFISKEEEEYLLRQVFNAPKPKWTQLSGRKLQ--------- 59 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 + + +P + + + + + L+N+Y P Sbjct: 60 ------------NWGGLPHPRGMVPERLPPWLQRYVDKVSDLSLFGGLPANHVLVNQYLP 107 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRND-------------PLKRLLLE 171 G + H+D P + ++SLG + F ++ D P+ LL+E Sbjct: 108 GEGIMPHEDG--PLYYPTVSTISLGSHTVLDFYEPRQPDDDVPMEQPRPPQRPITSLLVE 165 Query: 172 HGDVVVWGGESRLF-YHGIQPLKAGFHPLT-------------------IDCRYNLTFRQ 211 ++V G + HGI + T R +LT R+ Sbjct: 166 PRSLLVLRGTAYTRLLHGISATRVDELDATSLPPNATACKSALPGAHLVRGTRVSLTIRR 225 Query: 212 AGK 214 + Sbjct: 226 VPR 228 >UniRef50_Q80Y20-2 Isoform 2 of Alkylated DNA repair protein alkB homolog 8 n=4 Tax=Euteleostomi RepID=Q80Y20-2 Length = 629 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 34/234 (14%), Positives = 63/234 (26%), Gaps = 73/234 (31%) Query: 5 FADAEPWQ----EPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVA 60 F + W+ E L G +++ + E+ + + + + + Sbjct: 119 FVEKAQWKNMGLEALPPGLLVVEEIISSEEEKKLLESVNWTEDTGNQNF----------- 167 Query: 61 MTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 + H + Y +KP P Sbjct: 168 QRSLKHRRVKHFGYEFHYESNTVDKDKPLP------------------------------ 197 Query: 121 RYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 + H D I+S+SLG + F +++L ++V G Sbjct: 198 -----GGIPAHIDTHSA-FEDEIISLSLGSAIVMDFKHP--EGVTVQVMLPRRSLLVMTG 249 Query: 181 ESRL-FYHGIQPLKAGFHPLT-------------------IDCRYNLTFRQAGK 214 ESR + HGI P K + R + TFR+ + Sbjct: 250 ESRYLWTHGITPRKFDTVQASEQFKGGIITSDIGDLTLSKRGMRTSFTFRKVRR 303 >UniRef50_B6AFB9 Oxidoreductase, 2og-Fe(II) oxygenase family protein n=1 Tax=Cryptosporidium muris RN66 RepID=B6AFB9_9CRYT Length = 332 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 46/156 (29%), Gaps = 14/156 (8%) Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA 126 + G+ Y + K +P + R +PD IN Y G Sbjct: 148 RRVQHYGFGFDY-KNKIISPKWVRDIPIKIEMIINRLL-LHNIVTSRPDQITINEYIAGQ 205 Query: 127 KLSLHQDKDEPDLRAPIVSVSLGLPAIFQF-------GGLKRNDPLKRLLLEHGDVVVWG 179 + H D + I VSLG F + + + V Sbjct: 206 GIGPHIDSH-HTIGNYIAVVSLGSGVGMDFYELQLSDSKSFKKQKKHSIYIPKNSVYTMS 264 Query: 180 GESRL-FYHGIQPLKA---GFHPLTIDCRYNLTFRQ 211 R + HGI+ + + R +LTFR+ Sbjct: 265 SNIRYCWQHGIKKRYTDNIDGNIIKRHRRVSLTFRR 300 >UniRef50_A1ZXT1 Alkylated DNA repair protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZXT1_9SPHI Length = 189 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 70/197 (35%), Gaps = 21/197 (10%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLY 78 VI + + + + + + + + Q + + G Y Y Sbjct: 9 LVIYQNYF--DLKYCQQTFDKLLKEIKWLQKSHNNEGKIVDLPRLTANYGEK----SYNY 62 Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEP 137 S + +PW + + A+ Q +A ++ Y G +++ H D D Sbjct: 63 SGL-VFNPEPWTDFLLELKTVAENLASV------QFNALVLQYYRDGNDRVNWHSDDDSC 115 Query: 138 DLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQPLKAG 195 P IVS+S G F +D + L GD+V+ G+ + Y H + K Sbjct: 116 VGTNPVIVSMSFGESRDFWVRHKTHHDDRHKFTLHSGDIVIMQGDMQHVYVHKVPIEKDK 175 Query: 196 FHPLTIDCRYNLTFRQA 212 + R NLTFR+ Sbjct: 176 T-----EARLNLTFRKV 187 >UniRef50_Q7S1J6 Predicted protein n=3 Tax=Sordariales RepID=Q7S1J6_NEUCR Length = 290 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 35/269 (13%), Positives = 68/269 (25%), Gaps = 81/269 (30%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 + Q G +++ F E + F + ++ Sbjct: 11 PENPKIQRWDDIGLLLIHDFITEDEEAAMIAA--------FHAVDPRLDGKRRISQ---- 58 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG 125 + Y+ + + +P + R D+ PD Y PG Sbjct: 59 -----HFGYHFDYTTFGA-SETSFTPVPSYITDFLPRL----PVQDYLPDQFTAQYYPPG 108 Query: 126 AKLSLHQDKDEPDLRAPIVSVSLGL--PAIFQFGG------------------------- 158 A + H D + S+S G P +F+ Sbjct: 109 AGIPPHVDTHS-MFGEALYSLSYGSAVPMVFRLSDANDARKMRLPRRSLQSSVSESKTEG 167 Query: 159 ---------------------------LKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQ 190 +P L+L +++ G +R + HGI+ Sbjct: 168 ASGTNPESAATAPDSQTTMTVQSKSEEPSPENPSWELVLPPRSLLLMTGPARYGYTHGIK 227 Query: 191 PLKAG---FHPLTIDCRYNLTFRQAGKKE 216 K + RY++T R + + Sbjct: 228 SRKTDIINGEMVHRQGRYSITMRTIRRGD 256 >UniRef50_A5GWW3 Alkylated DNA repair protein n=2 Tax=Synechococcus RepID=A5GWW3_SYNR3 Length = 204 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 46/202 (22%), Positives = 77/202 (38%), Gaps = 24/202 (11%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTP-GGYTMSVAMTNCGHLGWTTH-RQGY 76 + A L + + P++Q G + C W GY Sbjct: 20 LRHAPAWVDPATATLWLE--QLQQDVPWKQESIQLYGKRHPLPRLTC----WMADPGCGY 73 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD 135 YS +D PW + + ++ +G ++ ++ L+N Y G + H D + Sbjct: 74 RYSGLDNVVE-PWSP---TAQRIREQLNELSG---WRFNSLLLNLYRDGRDAMGFHADDE 126 Query: 136 -EPDLRAPIVSVSLGLPAIFQFGGLKRNDPL-KRLLLEHGDVVVWGGESR-LFYHGIQPL 192 E D API S+SLG+ F+F K + L L HG +++ ++ + HG+ Sbjct: 127 PELDPTAPIASLSLGVSRTFRFKPKKGHQGNDFDLELGHGALLLMDPPTQLHWLHGLPKR 186 Query: 193 KAGFHPLTIDCRYNLTFRQAGK 214 CR NLTFR + Sbjct: 187 LR-----VNQCRLNLTFRVVQQ 203 >UniRef50_D1ITZ2 Whole genome shotgun sequence of line PN40024, scaffold_5.assembly12x (Fragment) n=6 Tax=Embryophyta RepID=D1ITZ2_VITVI Length = 260 Score = 110 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 72/242 (29%), Gaps = 41/242 (16%) Query: 2 LDLFADAEPWQEPLAA--GAVILRRFAFNAAEQLIRDIND---VASQSPFRQMVTPGGYT 56 + ++ P EP++ G + R F + + + S++ Q + G Sbjct: 33 SSIHSEKNPSWEPISEINGLWLCRDFLSPQEQSSLLSAIEKEGWFSEASHNQAMRFGNLP 92 Query: 57 MSVAMTNCGHLGWT--THRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQP 114 + + + ++ +P + + Sbjct: 93 EWATELSHSIREVVLFSDYVSEHMDSVTCDGDEKGCLLPSEIL-----------WREPLF 141 Query: 115 DACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP---------- 164 D ++N Y PG + H D I +SL I F + + Sbjct: 142 DQLILNVYQPGEGICPHVDLMR--FEDGIAIISLESSCIMHFTHVDDTEACDSGREGRNY 199 Query: 165 ----LKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDC-----RYNLTFRQAGK 214 + L G +V+ GE+R + H I K GF R ++T R+ K Sbjct: 200 SPMTKIPVYLTPGSLVLMSGEARYFWKHEI-NRKPGFQIWEGQEIDQKSRTSITLRKLCK 258 Query: 215 KE 216 E Sbjct: 259 IE 260 >UniRef50_Q2UNX0 Predicted protein n=6 Tax=Trichocomaceae RepID=Q2UNX0_ASPOR Length = 335 Score = 110 bits (275), Expect = 3e-23, Method: Composition-based stats. Identities = 34/126 (26%), Positives = 47/126 (37%), Gaps = 10/126 (7%) Query: 91 AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSL 148 +P +L Q T P + LIN YA +S H D + P I S+SL Sbjct: 178 PIPPCLTHLLQTIQTTTNTPPDYYNFILINYYATNTDSISYHSDDERFLGPNPSIASLSL 237 Query: 149 GLPAIFQFGGLK--RNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRY 205 G F + L GD+VV GE++ + H I R Sbjct: 238 GAKRDFLLKHKPGVEAGKPLKFPLASGDMVVMRGETQGNWLHSIPKRA-----GEGGGRI 292 Query: 206 NLTFRQ 211 N+TFR+ Sbjct: 293 NVTFRR 298 >UniRef50_A5FII3 DNA-N1-methyladenine dioxygenase n=2 Tax=Flavobacterium johnsoniae UW101 RepID=A5FII3_FLAJ1 Length = 208 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 67/216 (31%), Gaps = 27/216 (12%) Query: 5 FADAEPWQEPLAAGAVILRRFAFNAA-EQLIRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 A + + P + +++ F ++ + + + T +V Sbjct: 14 LAGKKVFDIPDSE-LILIDNFFTKEESDRFYERLLRKTKWREYEMEIY--DKTYTVPRM- 69 Query: 64 CGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA 123 + W +P + P + R + L+N Y Sbjct: 70 ---IAWYED-------KDNPGADLKGPDWNYELLTIRGRVEKETQQD---FNTVLLNLYR 116 Query: 124 PG-AKLSLHQDKDEPDLRAPI-VSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGG 180 G + H DK+ PI SV+ G +F+ + P + L HG ++ G Sbjct: 117 DGNDGVGWHSDKEHNTGPNPIIASVTFGETRMFRLRHKYSKEIPQIEIPLHHGSFLLMAG 176 Query: 181 ESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKK 215 + + H + P R NLTFRQ + Sbjct: 177 TTNSFWQHQVPKTARNVLP-----RINLTFRQTHRN 207 >UniRef50_D2VF66 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VF66_NAEGR Length = 271 Score = 110 bits (274), Expect = 4e-23, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 68/223 (30%), Gaps = 37/223 (16%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLG---WTTHRQG 75 + F + ++ F + ++ G + + Sbjct: 60 VRYIPNFLSRQESTKLFNVL--LQTCEFEKGKFKIFGKEIISNRQISAFGERDYEPLLKE 117 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 YS + P+ +L +R G LINRY G + + H D Sbjct: 118 ENYSKHHRRP--VHDEWPKELTDLKERIEKYTGD---TFTFALINRYDTGESSIGWHSDM 172 Query: 135 D-EPDLRAPIVSVSLGLPAIFQFGGLKRND-------------------PLKRLLLEHGD 174 + + + IVS+SLG F+F + + LE+G Sbjct: 173 EQDIKKDSSIVSISLGAARDFKFRPTPKKENSKKSPTKKDEESEEEEKVQTITQKLENGS 232 Query: 175 VVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 +V+ ++ Y H I K R N+TFR + + Sbjct: 233 MVIMNYATQRHYQHSIPKRK-----NLNSVRLNITFRHVVRNK 270 >UniRef50_Q9SIE0 Expressed protein n=10 Tax=Magnoliophyta RepID=Q9SIE0_ARATH Length = 314 Score = 110 bits (274), Expect = 4e-23, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 73/235 (31%), Gaps = 43/235 (18%) Query: 6 ADAEPWQEPLAAG----AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 + P ++ + G + ++RF D D P+ + + Sbjct: 90 SSNAPSRKTIDLGHGSDLIYIQRFLPFQQSWTFFDYLD--KHIPWTRPTIRVFGRSCLQP 147 Query: 62 TNCGHLGWTTHRQGYL-YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLIN 120 + + + L YS P T+ W P + P + ++ L+N Sbjct: 148 RDTCY--VASSGLTALVYSGYRP-TSYSWDDFPP-LKEILDAIYKV--LPGSRFNSLLLN 201 Query: 121 RYA-PGAKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRND--------------- 163 RY ++ H D ++ P I SVS G F K + Sbjct: 202 RYKGASDYVAWHADDEKIYGPTPEIASVSFGCERDFVLKKKKDEESSQGKTGDSGPAKKR 261 Query: 164 -------PLKRLLLEHGDVVVWGG-ESRLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 + L L+HG ++V G R + H + R NLTFR Sbjct: 262 LKRSSREDQQSLTLKHGSLLVMRGYTQRDWIHSVPKRAKAE-----GTRINLTFR 311 >UniRef50_A4I1X6 Putative uncharacterized protein n=3 Tax=Leishmania RepID=A4I1X6_LEIIN Length = 563 Score = 110 bits (274), Expect = 4e-23, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 63/225 (28%), Gaps = 34/225 (15%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 P G ++ F E+ I +++ P QM ++ H Sbjct: 332 AVPDVPGLFLVEDFVTADEEKTIW--HELHHGRPRLQMEY-------LSRRRVAHFNRRF 382 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP----------DFQPDACLINR 121 G + P+ Q A G D + D +N Sbjct: 383 L-YGVNALTAEGDVTNARPSFYVWMRARLQNDMAAGGVRIDGDYPFRPGDHECDQLTVNY 441 Query: 122 YAPGA----KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGG---LKRNDPLKRLLLEHGD 174 Y ++ H D + VSLG + +F + L Sbjct: 442 YDYSEVGACGIAAHVDAHNA-FDDAVFIVSLGSYTVMEFSRWDAPAEVAAPVGVYLAPRS 500 Query: 175 VVVWGGESRL-FYHGIQPLKAGFHPL-----TIDCRYNLTFRQAG 213 +VV GE+R + H I + + R +LT+R+ Sbjct: 501 LVVIAGEARYGWTHCIAEKRTDTLSELLPTFSRGDRMSLTWRRGR 545 >UniRef50_B8CDM4 Predicted protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8CDM4_THAPS Length = 222 Score = 110 bits (274), Expect = 5e-23, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 69/223 (30%), Gaps = 26/223 (11%) Query: 9 EPWQEPLA-AGAVILRRFAFNAAEQL----IRDINDVASQSPFRQMVTPGGYTMSVAMTN 63 + + PL F + + + + VT + + + Sbjct: 2 DEFLIPLRTPCVYHDPNFLPTDEATAAYQDLLENTPWEKTAKINRWVTLMELPKNASNVD 61 Query: 64 CGHL--GWTTH---RQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDA-- 116 G T GY Y +T +P L + + D +A Sbjct: 62 EKEEADGDDTDEKKGTGYRYRDAPGETIIGFPPTVYKLKLLAEEWYNSKQTNDGTANAEA 121 Query: 117 ------CLINRYAPG-AKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP-LKRL 168 CL+N Y G ++ H D++E PI S+SLG F L Sbjct: 122 KVEFNVCLLNYYQDGTQRIGWHSDREELGRTTPIASISLGATRSFLIRSQTDGVHDRASL 181 Query: 169 LLEHGDVVVWGG-ESRLFYHGIQPLKAGFHPLTIDCRYNLTFR 210 LE+G +VV R + H + + R NLTFR Sbjct: 182 DLENGSIVVMENVCQREYVHSVPK-----EGEVVGGRINLTFR 219 >UniRef50_A1K994 DNA repair system specific for alkylated DNA n=1 Tax=Azoarcus sp. BH72 RepID=A1K994_AZOSB Length = 194 Score = 109 bits (273), Expect = 5e-23, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 64/203 (31%), Gaps = 23/203 (11%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTH-RQG 75 + + AA+ L + + T G + C W + Sbjct: 8 PLLALTPLYDATAAQALFDTLCAEIPWNDGDY--TAAGRRFRLPRLQC----WFSDPGAT 61 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDK 134 Y Y+ + PW + L R +G + +A L N Y G + H D Sbjct: 62 YRYADN-LMNSHPWTP---TLAALRARVEAVSGV---RFNAVLANLYRDGEDAVGWHADD 114 Query: 135 DEPDLRAP-IVSVSLGLPAIFQFGGLKR-NDPLKRLLLEHGDVVVWGGE-SRLFYHGIQP 191 ++ AP I S+SLG F + L L G +++ + + H + Sbjct: 115 EDDLGPAPHIASLSLGATRRFHWRPKPGVVGEADALPLPAGTLLLMRAPFQQQWEHAVP- 173 Query: 192 LKAGFHPLTIDCRYNLTFRQAGK 214 P R NLTFR Sbjct: 174 ----AEPAVRGARLNLTFRNVVA 192 >UniRef50_D1Z416 Whole genome shotgun sequence assembly, scaffold_2 n=1 Tax=Sordaria macrospora RepID=D1Z416_SORMA Length = 358 Score = 109 bits (273), Expect = 5e-23, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 68/206 (33%), Gaps = 45/206 (21%) Query: 46 FRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAAT 105 F ++ L W T Y ++ P P P+ + Sbjct: 159 FEPKDPSVHKPLTFQQVFNRKLHWVTLGGQYDWTNRVYPGELP-PEFPKDISGFLETL-- 215 Query: 106 AAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIF-----QFGGLK 160 +P+ A ++N Y PG + +H+D E ++S+S+G ++F +G + Sbjct: 216 ---FPETLAQAAIVNFYTPGDTMMMHRDVSEET-DKGLISLSIGCDSLFMICPEDWGKVS 271 Query: 161 RNDPLKR---------------LLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTID--- 202 + K L L GDV+ ESR +HG+ + G P ++ Sbjct: 272 AEEKQKETKESETGKSEKKFLLLRLRSGDVIYMTKESRFAWHGVPKIFKGTCPEWLEDWP 331 Query: 203 ---------------CRYNLTFRQAG 213 R N+ RQ Sbjct: 332 AEDGKYEAWRGWMKNKRININVRQMR 357 >UniRef50_Q6NS38 Alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 n=20 Tax=Eumetazoa RepID=ALKB2_HUMAN Length = 261 Score = 109 bits (273), Expect = 6e-23, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 66/198 (33%), Gaps = 24/198 (12%) Query: 25 FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQ 84 F A+++ +++ G Y +S + Sbjct: 73 FGKAEADEIFQELEKEVEYFTGALARVQVFGKWHSVPRKQATYGDA--GLTYTFSGLTL- 129 Query: 85 TNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD-EPDLRAP 142 + KPW + + + G + LINRY G + H+D + E +P Sbjct: 130 SPKPWIPV---LERIRDHVS---GVTGQTFNFVLINRYKDGCDHIGEHRDDERELAPGSP 183 Query: 143 IVSVSLGLPAIFQFGGLKRNDP-------LKRLLLEHGDVVVWGGESR-LFYHGIQPLKA 194 I SVS G F F + RL L HG +++ + +YH + K Sbjct: 184 IASVSFGACRDFVFRHKDSRGKSPSRRVAVVRLPLAHGSLLMMNHPTNTHWYHSLPVRKK 243 Query: 195 GFHPLTIDCRYNLTFRQA 212 P R NLTFR+ Sbjct: 244 VLAP-----RVNLTFRKI 256 >UniRef50_D1HRN8 Whole genome shotgun sequence of line PN40024, scaffold_34.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1HRN8_VITVI Length = 439 Score = 109 bits (272), Expect = 6e-23, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 67/213 (31%), Gaps = 28/213 (13%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G + R + ++ I + + Q T S G T + G Sbjct: 160 GLELHTRVFNSEEQKKIVECV--YNLQRMGQKGMLRERTYSEPKKWMRGKGRVTIQFGCC 217 Query: 78 YSPIDPQTNKP--------WPAMPQSFHNLCQRAATAAGYPDF-QPDACLINRYAPGAKL 128 Y+ + P +P F + +R P P++C++N Y G + Sbjct: 218 YNYAVDKNGNPPGIIREEEVDPLPPLFKQMIKRMVRWHILPPTCVPNSCIVNIYDEGDCI 277 Query: 129 SLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRN------DPLKRLLLEHGDVVVWGGE- 181 H D D P +VS FG + + L G V++ G Sbjct: 278 PPHIDHH--DFLRPFCTVSFLTECNILFGSSLKILDAGEFSGPVSISLPKGSVLILNGNG 335 Query: 182 SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + + H + + R ++TFR+ + Sbjct: 336 ADVAKHCVPAV--------PAKRISITFRKMDE 360 >UniRef50_A2Q3T7 2OG-Fe(II) oxygenase n=2 Tax=Medicago truncatula RepID=A2Q3T7_MEDTR Length = 497 Score = 109 bits (272), Expect = 7e-23, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 71/216 (32%), Gaps = 30/216 (13%) Query: 18 GAVILRR-FAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG- 75 G + F ++++ I + + Q T S G T + G Sbjct: 209 GLELHTDVFNATEQDEIVEYIYGLQRRG---QQGRLRDRTYSKPRKWMRGKGRETLQFGC 265 Query: 76 -YLYSPIDPQTN------KPWPAMPQSFHNLCQRAATAAGYPDF-QPDACLINRYAPGAK 127 Y Y+ + +P F + +R P PD+C++N Y G Sbjct: 266 CYNYAVDKYGNPPGICRTEEVDPLPDVFKQMIKRMVRWNIIPPTCVPDSCIVNIYDVGDC 325 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP------LKRLLLEHGDVVVWGGE 181 + H D D P SVS A FG + + L G V V G Sbjct: 326 IPPHIDHH--DFVRPFYSVSFLNEAKILFGSNLKEIQPGEFSGPASISLPLGSVFVLNGN 383 Query: 182 -SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + + H I + + R ++TFR+ +++ Sbjct: 384 GADIAKHCIPSVSSK--------RISITFRKMDERK 411 >UniRef50_A9G7L2 High confidence in function and specificity n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G7L2_SORC5 Length = 174 Score = 109 bits (272), Expect = 8e-23, Method: Composition-based stats. Identities = 38/155 (24%), Positives = 58/155 (37%), Gaps = 16/155 (10%) Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 + Y YS I P MP +L R A G+P + CL N Y Sbjct: 33 RMHSRRTASCGVAYNYSGISY----PDCEMPPFVQDLAARLAGVVGHP---INNCLANFY 85 Query: 123 APG-AKLSLHQDKDEPDLRAPIVS-VSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGG 180 A G AK+ H D + S +SLG P + F R + + L G +++ Sbjct: 86 ADGTAKMGFHSDSSAGVVAGTTTSILSLGAPRVLTFRRSLRRNETHDMALAPGSLLIMRP 145 Query: 181 ESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + + H + ++A R +LTFR + Sbjct: 146 SVQEGWQHAVLAVEAA------GPRISLTFRFLSR 174 >UniRef50_C5BTC2 Putative alkylated DNA repair protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTC2_TERTT Length = 211 Score = 109 bits (271), Expect = 9e-23, Method: Composition-based stats. Identities = 41/179 (22%), Positives = 60/179 (33%), Gaps = 18/179 (10%) Query: 37 INDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSF 96 ++ + P++Q Y Y+ T Sbjct: 42 LSQLEQSIPWQQDSFVSFDRRFTIPRMQAWF--ADDGLQYRYADNLMHT----QPWLPEL 95 Query: 97 HNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIF 154 L Q A +A L Y G ++ H D + AP I S+SLG F Sbjct: 96 LQLRQIINNATQCE---FNAVLATLYRHGNDHVTWHSDDERELGYAPVIASLSLGATRCF 152 Query: 155 QFGGLKRNDPLKRLLLEHGDVVVWGGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 QF K ND + L HGD++V + H + P P ++ R NLTFR+ Sbjct: 153 QFRH-KENDTKGEISLHHGDLIVMEPAFQHYWEHQVPP-----QPDVLEPRINLTFRRV 205 >UniRef50_C1MXD4 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MXD4_9CHLO Length = 377 Score = 109 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 64/218 (29%), Gaps = 30/218 (13%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCG 65 A P G +L F E + D G + + N Sbjct: 165 AATSTSNTPSLPGHHLLLDFITEDEENALVAFLDDGE---------RGIHDWKPSTFNGA 215 Query: 66 HLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAA-TAAGYPDFQPDACLINRYAP 124 H G G + P MP + ++ A F P+ Y Sbjct: 216 HRGKAW---GVRVDLKRRTVSPPTREMPPRLLAVAEKMRGAHALLARFSPNEANAISYDK 272 Query: 125 --GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF--------GGLKRNDPLKRLLLEHGD 174 G +L H D + +V++SL + + GG R + L Sbjct: 273 RLGDRLLSHVDDRQLS-SDVLVNLSLCGECVMTYERTTTRSSGGGTRGSDRVDVRLPRRS 331 Query: 175 VVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQ 211 + + G++R + H I L R ++TFR+ Sbjct: 332 LQIQSGDARYAFAHSIA-----NENLLDPRRVSITFRE 364 >UniRef50_D2V4D7 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V4D7_NAEGR Length = 292 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 42/220 (19%), Positives = 71/220 (32%), Gaps = 38/220 (17%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL- 77 + RF + + D + + V+ G R Sbjct: 79 VRYMERFLSQRESKELFDCL--MEKCEWTSPKYNMYGKDVVSKRKVAFFGKPITRNETDS 136 Query: 78 -YSPIDPQTNKPWPAM-------PQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAK-L 128 D ++ + M P+ NL +R G + +A LINRY G + Sbjct: 137 VVDTNDGSVSRVYNDMVKLQRDWPEPLLNLKKRLEELIGE---KYEAALINRYDDGDDLI 193 Query: 129 SLHQDKDEPDLRAPIVSVSLGLPAIFQFGGL---------------KRNDPLKRLLLEHG 173 H D++ I SVSLG FQ + ++ + LE+G Sbjct: 194 GWHADREASGFS--IASVSLGASRDFQLRPMPKQNTNSSNTTQSPIQKKGEIITKSLENG 251 Query: 174 DVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQA 212 +++ G ++ Y H + K + R N+TFR Sbjct: 252 CLLIMNGATQKHYQHCVPKRK-----GVLSARLNITFRDI 286 >UniRef50_Q5CYU2 F27M3_19 plant like RRM plus AlkB domain containing protein n=2 Tax=Cryptosporidium RepID=Q5CYU2_CRYPV Length = 350 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 64/216 (29%), Gaps = 42/216 (19%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G V++ F + D D G + H + + Sbjct: 123 GLVLVEDFINKLEAIELLDWID------------NNGQWETKLNRKVQH-----YGYSFD 165 Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEP 137 Y+ ++ +P + L +R + + PD IN Y G + H D Sbjct: 166 YNNKTI-SSVWERDIPPILNRLIERMLSLKIITE-VPDQITINEYEVGKGIGPHIDSH-H 222 Query: 138 DLRAPIVSVSLGLPAIFQFGGLKRNDPL------------------KRLLLEHGDVVVWG 179 + I +SLG +F+F L + + + + + + Sbjct: 223 TIGENISVISLGSGILFEFNELSKRKNPDCSSKEGSGSRKYDRISKRTVYIPENSLYIMK 282 Query: 180 GESRL-FYHGIQPLKAGFHP---LTIDCRYNLTFRQ 211 E R + HGI+ R ++T R+ Sbjct: 283 NEIRYAWEHGIKSRMYDKIQGKFQQRKRRVSITIRK 318 >UniRef50_A9TLH2 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TLH2_PHYPA Length = 697 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 56/248 (22%), Positives = 91/248 (36%), Gaps = 53/248 (21%) Query: 7 DAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGH 66 + + L G V+LR + +Q + + + A+ F++ T G + Sbjct: 454 ERKSTVTILQPGMVLLRSWLSLDIQQRLVNESQSAAHL-FKRPTTASGGKYHLWQM---- 508 Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYP--------DFQPDACL 118 + + P ++L + A A +F+PD L Sbjct: 509 ----AFGCSWDSKTRRYAAPERGLRFPVWMYDLGRELAFDAQKHTPVYAQGSNFEPDVAL 564 Query: 119 INRYAPGA------KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFG--------------- 157 +N Y L HQD D+ P+VSVS+G F + Sbjct: 565 VNFYPAKDEELGVVGLGGHQDLDDYC-DMPVVSVSVGDSMTFFYRRFPPQSRRKSGVQII 623 Query: 158 -----------GLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI---DC 203 G ++ K+++L GDV+V+GGESRL YHG + ++ G P + Sbjct: 624 VDEYAAQCCKDGDTAHNSEKKIILASGDVLVFGGESRLVYHGTRCVQPGTRPPGLHMAPG 683 Query: 204 RYNLTFRQ 211 R N TFRQ Sbjct: 684 RLNFTFRQ 691 >UniRef50_UPI00017458F4 Alkylated DNA repair protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017458F4 Length = 187 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 34/159 (21%), Positives = 50/159 (31%), Gaps = 16/159 (10%) Query: 59 VAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACL 118 + Q Y S I + M LCQ+ G F P CL Sbjct: 30 IWDERMKSRKTACFGQTYDDSGIAYEE----VPMHALLAPLCQKLTATLG---FAPTNCL 82 Query: 119 INRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVV 176 IN Y G + + H D + +SLG + F + L G ++ Sbjct: 83 INYYENGRSSMGFHSDATYNLADDTGVAIISLGAERVLTFRSKSTPNLEHAFALPSGSLL 142 Query: 177 VWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 ++ + H I+ D R +LTFR K Sbjct: 143 YMTQATQAHWMHAIKKTDTD------DARISLTFRHILK 175 >UniRef50_C9SM10 DNA repair family protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SM10_VERA1 Length = 287 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 36/147 (24%), Positives = 52/147 (35%), Gaps = 19/147 (12%) Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEP 137 P P +P L + A A G + CL+N YA G ++ H D + Sbjct: 112 GPDRRYDRIPPRPLPACLDALRRSAEAATGC---AFNVCLVNYYATGADSIAFHSDDERF 168 Query: 138 DLRAP-IVSVSLGLPAIFQFGG---LKRNDPL--------KRLLLEHGDVVVWGGESR-L 184 AP I S SLG F R D R L GD+++ G ++ Sbjct: 169 LGPAPAIASFSLGARRDFLLKHKPCPPRGDAPSPRPALGTLRFPLGSGDMLLMRGATQAN 228 Query: 185 FYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 + H + R N+TFR+ Sbjct: 229 WLHSVPKRSGRHAED--GGRINITFRR 253 >UniRef50_C7PPT9 2OG-Fe(II) oxygenase n=2 Tax=Bacteroidetes RepID=C7PPT9_CHIPD Length = 170 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 57/199 (28%), Gaps = 34/199 (17%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G + F I + + Y Sbjct: 3 GITFISDFVAAPEALFI------------------SLKDNVLWDERMTARKTASFGVAYN 44 Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDE 136 YS I P+ A + G F P+ CLIN Y G +K+ H D+ + Sbjct: 45 YSQISY----PFQAFTPELQEIVTAITATLG---FTPNNCLINYYPDGKSKMGYHADQTD 97 Query: 137 -PDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKA 194 + I VS+G +F ++ L L G ++ + + H I Sbjct: 98 ILEAGTGIAIVSVGETRTLRFKNIQDPTELVDFPLNAGSLIYMTQAVQDEWLHAIP---- 153 Query: 195 GFHPLTIDCRYNLTFRQAG 213 T R +LTFR Sbjct: 154 --AADTGQGRMSLTFRSIK 170 >UniRef50_B8J7A0 2OG-Fe(II) oxygenase n=3 Tax=Anaeromyxobacter RepID=B8J7A0_ANAD2 Length = 204 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 49/213 (23%), Positives = 70/213 (32%), Gaps = 30/213 (14%) Query: 2 LDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAM 61 L+L P L G + R + A Q + Sbjct: 15 LELVPSPVPA---LPPGMRLWR------------ALLPAAEQQALLAALARLELGEVRMH 59 Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 + Y Y +P P P + L +RAA AG L+ R Sbjct: 60 GVIARRRVAHFGRAYAYDARAV---QPGPPFPAALEPLRRRAAALAGVAPAALAEALVTR 116 Query: 122 YAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE 181 Y PGA + H+D P +V VSLG PA F+ +LLE G + G Sbjct: 117 YPPGAGIGWHRD--APAFGQ-VVGVSLGAPARFRMREGGPGGRALEVLLEPGSAYLLAGA 173 Query: 182 SRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 +R + H I P+ R+++TFR Sbjct: 174 ARWRWQHAIPPV--------PAERWSVTFRTLR 198 >UniRef50_Q5RJC7 At4g36090 n=6 Tax=Magnoliophyta RepID=Q5RJC7_ARATH Length = 520 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 63/216 (29%), Gaps = 24/216 (11%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 + G + ++ I D + R + +T Sbjct: 216 ILEGLELHTGVFSAVEQKKIVDFVYELQEKGRRGELRERTFTAPHKWMRGKGRVTIQFGC 275 Query: 75 GYLYSPIDPQTNKP------WPAMPQSFHNLCQRAATAAGYPDF-QPDACLINRYAPGAK 127 Y Y+P MP F + +R P PD+C++N Y Sbjct: 276 CYNYAPDKAGNPPGILQRGDVDPMPSIFKVIIKRLVGWHVLPPTCVPDSCIVNIYEEDDC 335 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPL------KRLLLEHGDVVVWGGE 181 + H D D P +VS FG + + L G V+V G Sbjct: 336 IPPHIDNH--DFLRPFCTVSFLSECNILFGSNLKVLGPGEFSGSYSIPLPVGSVLVLKGN 393 Query: 182 -SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + + H + + R ++TFR+ + + Sbjct: 394 GADVAKHCVPAV--------PTKRISITFRKMDESK 421 >UniRef50_C5CMR5 2OG-Fe(II) oxygenase n=1 Tax=Variovorax paradoxus S110 RepID=C5CMR5_VARPS Length = 202 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 51/217 (23%), Positives = 71/217 (32%), Gaps = 34/217 (15%) Query: 3 DLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 DLF +A G R F A E + + R Sbjct: 14 DLFGEAPAAAIE---GLRYEREFLTRAEEAELLRLVQGFELREMRY------------KE 58 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 + Y + +P AMP + H L R A G LI+ Y Sbjct: 59 YTARRRGISFGGSYDFDK---HRLRPGAAMPPALHPLRARVAAWMGMAPEDFAHMLISEY 115 Query: 123 APGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGL----KRNDPLKRLLLEHGDVVVW 178 PG L H+D PD IV VSL A+ Q +LL+E + + Sbjct: 116 RPGTPLGWHRDV--PDFED-IVGVSLQGDAVMQLRPYPPASASAPASLQLLIEPRSIYML 172 Query: 179 GGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 GE+R + H I P +A RY++T R + Sbjct: 173 RGEARWAWQHSIAPTEA--------LRYSITMRTRRR 201 >UniRef50_A8J903 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8J903_CHLRE Length = 398 Score = 107 bits (266), Expect = 4e-22, Method: Composition-based stats. Identities = 48/197 (24%), Positives = 66/197 (33%), Gaps = 54/197 (27%) Query: 45 PFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQR-- 102 P + A L W T + ++ + +P S +L ++ Sbjct: 202 PSYEQCWSRDGPGPAAEQLLRKLRWATLGPQFDWTERQYDFTGAYRQLPPSLSDLARQMA 261 Query: 103 ---------------AATAAGYP------------------------------------- 110 AA A P Sbjct: 262 AVVDALQAAGMQLMPAAAEAEVPQQPAAPRVQATEPSQAAAGAVSAGLAPPTGAAPAAPR 321 Query: 111 DFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLL 170 ++PDA ++N Y G L H D E DL PIVSVSLG PA+F GG + LLL Sbjct: 322 GYEPDAAIVNYYQIGDVLGGHVDDVESDLAQPIVSVSLGCPALFLMGGRTKATHPSALLL 381 Query: 171 EHGDVVVWGGESRLFYH 187 GDV+V G++R YH Sbjct: 382 RGGDVLVLAGQARSCYH 398 >UniRef50_A6DH75 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DH75_9BACT Length = 196 Score = 106 bits (265), Expect = 4e-22, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 60/196 (30%), Gaps = 21/196 (10%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ-GYL 77 + F ++ + + P++ W Y Sbjct: 13 IIYHPHFFTDSEASQLFSELE--KDLPWQCDKIRIMGKEHFIPRLHA---WLADPNIHYN 67 Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDE 136 YS ID + N PW Q L A + + ++ L N Y G H D ++ Sbjct: 68 YSGIDLKIN-PWT---QQVLKLKTLAEDKS---HWTFNSMLANYYRDGKDSNGWHADNEK 120 Query: 137 PDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGE-SRLFYHGIQPLKA 194 R P I S G F + + L +G +++ G H ++ K Sbjct: 121 ELGRNPLIAMFSFGQIRRFSIRSNENHKNKLDFDLNNGSLIIMKGPLQHTSQHCLRKTKK 180 Query: 195 GFHPLTIDCRYNLTFR 210 D R +LTFR Sbjct: 181 -----KCDARISLTFR 191 >UniRef50_C5KHM6 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KHM6_9ALVE Length = 252 Score = 106 bits (265), Expect = 5e-22, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 60/203 (29%), Gaps = 26/203 (12%) Query: 12 QEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 + + G V+L A E + ++ + Sbjct: 49 SDIVPPGLVVLADAITEAEEATLLGDI--------------YARPWKLSQS---GRRKQD 91 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG-YPDFQPDACLINRY--APGAKL 128 + + + + +P S + R T G D ++ Y + G+ + Sbjct: 92 YGPQVNFKKRKLKCPDNFQGLPHSIDLVLPRIHTGLGLLLDHAWHEMVVQEYAVSRGSSI 151 Query: 129 SLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYH 187 LH D I+ +SL I F K + L + G S+ + H Sbjct: 152 DLHVD-HSWVWADGILDLSLAADCIMAFANPK-EGVYYDVGLPRRSACLIAGLSQTQWMH 209 Query: 188 GIQPLKAGFHPLTIDCRYNLTFR 210 GI K L D R ++T R Sbjct: 210 GI---KRDNACLGGDTRVSITLR 229 >UniRef50_A4RVX0 Predicted protein n=2 Tax=Ostreococcus RepID=A4RVX0_OSTLU Length = 347 Score = 106 bits (265), Expect = 5e-22, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 67/203 (33%), Gaps = 26/203 (12%) Query: 17 AGAVILRRFAFNAAEQLIRDINDV-ASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQG 75 G IL F E+ I D D + P+R G + G + Sbjct: 150 PGHYILENFITEDEERRIVDWLDDDIAAGPWRDSSFNGAHQG-------KKYGVEPNLLK 202 Query: 76 YLYSPIDPQTNKPWPAMPQSFHNLC--QRAATAAGYPDFQPDACLINRYAP--GAKLSLH 131 P MP+ +L + AA F P+ C Y G+ L+ H Sbjct: 203 RCVEPARV-------PMPKILRDLVVAKFAAAHETLKHFTPNECNAINYRKDLGSVLTPH 255 Query: 132 QDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY-HGIQ 190 D + +V++SL + K + L + + G +R Y H I Sbjct: 256 CDDRQLS-SDILVNLSLCSDCTMTYSHEKFASKRVDVRLPRRSLQIQSGSTRYDYMHSIA 314 Query: 191 PLKAGFHPLTIDCRYNLTFRQAG 213 L + R ++TFR++G Sbjct: 315 -----NENLHGNRRVSVTFRESG 332 >UniRef50_A6SM11 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6SM11_BOTFB Length = 216 Score = 106 bits (264), Expect = 6e-22, Method: Composition-based stats. Identities = 33/137 (24%), Positives = 50/137 (36%), Gaps = 18/137 (13%) Query: 88 PWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVS 145 P +P+ +L A G + + CL+N YA G +S H D + P I S Sbjct: 49 PPRPIPKCLDDLRLSTEMATGC---KFNFCLVNYYASGSDSISYHSDDERFLGPLPAIAS 105 Query: 146 VSLGLPAIFQFGGL----------KRNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKA 194 SLG F +L L GD+++ G ++ + H I Sbjct: 106 YSLGARRDFLMKHKPIPPNDNAPLPPETKPIKLPLASGDMILMRGRTQANWLHSIPKRT- 164 Query: 195 GFHPLTIDCRYNLTFRQ 211 R N+TFR+ Sbjct: 165 -GKNADDGGRINITFRR 180 >UniRef50_A8MRW3 Uncharacterized protein At2g17970.2 n=14 Tax=rosids RepID=A8MRW3_ARATH Length = 433 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 65/214 (30%), Gaps = 24/214 (11%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G + ++ I D + R + +T Y Sbjct: 140 DGLELHTGVFSAVEQKRIVDQVYQLQEKGRRGELKKRTFTAPHKWMRGKGRETIQFGCCY 199 Query: 77 LYSPIDPQTN------KPWPAMPQSFHNLCQRAATAAGYPDF-QPDACLINRYAPGAKLS 129 Y+P + +P F + ++ P PD+C++N Y G + Sbjct: 200 NYAPDRAGNPPGILQREEVDPLPHLFKVIIRKLIKWHVLPPTCVPDSCIVNIYDEGDCIP 259 Query: 130 LHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPL------KRLLLEHGDVVVWGGE-S 182 H D D P ++S FG + + + L G V+V G + Sbjct: 260 PHIDNH--DFLRPFCTISFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSVLVLNGNGA 317 Query: 183 RLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + H + + R ++TFR+ + + Sbjct: 318 DVAKHCVPAV--------PTKRISITFRKMDESK 343 >UniRef50_C1BKL6 Alkylated repair protein alkB homolog 3 n=5 Tax=Euteleostomi RepID=C1BKL6_OSMMO Length = 304 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 64/205 (31%), Gaps = 28/205 (13%) Query: 19 AVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLY 78 + F + + + ++ P+ Q G Y Y Sbjct: 86 LRLFTEFLPVEEADWM--FSKLLAELPWSQKTNYRQGESYGEPRLTCWYGELP----YTY 139 Query: 79 SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYA-PGAKLSLHQDKDEP 137 + + N W + L Q +A+G ++ L N Y + H D + Sbjct: 140 AHSTMEANTQWHPL---LLTLRQAVDSASG---SSFNSLLCNLYRNESDSIGWHSDDEAS 193 Query: 138 DLRAP-IVSVSLGLPAIFQFGGLKRNDP--------LKRLLLEHGDVVVWGGESRL-FYH 187 P I S+SLG +F + R+ L HG +++ G ++ + H Sbjct: 194 LGIKPTIASLSLGDTRVFSLRKKPPPEENGDYTYMERLRVPLAHGTLLLMEGATQDDWQH 253 Query: 188 GIQPLKAGFHPLTIDCRYNLTFRQA 212 + + R NLTFR Sbjct: 254 QVAKEY-----HSRGPRINLTFRTI 273 >UniRef50_Q57WP7 Putative uncharacterized protein n=2 Tax=Trypanosoma brucei RepID=Q57WP7_9TRYP Length = 454 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 64/217 (29%), Gaps = 33/217 (15%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G I+ F ++ I + + + S F +A + H + G Sbjct: 227 PGLYIVSDFITHSEHNAIWNELNGDAASHFEVEH--------LARRDVAHFNRRFY-YGI 277 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ-------PDACLINRYAPGA--- 126 + PA + ++ D +N Y Sbjct: 278 NRVGAEGVQVNSRPAFYDWMAQRLCNTDSEVKIHNYPMKQHPEYFDQLTVNYYNYDDPKS 337 Query: 127 ----KLSLHQDKDEPDLRAPIVSVSLGLPAIFQFG---GLKRNDPLKRLLLEHGDVVVWG 179 ++ H D + I VSLG + +F +L++ +++ Sbjct: 338 KLVPGIARHVDSHDA-FGDYIAIVSLGSHTVIEFSRYNRPPDVFAPLGVLVKPCSLLLLT 396 Query: 180 GESRL-FYHGIQPLKAGFH-----PLTIDCRYNLTFR 210 GE+R + H I + PL R +LT+R Sbjct: 397 GEARYCWTHCIVEKREDVLNDQLPPLQRGNRLSLTWR 433 >UniRef50_B8M368 Oxidoreductase, 2OG-Fe(II) oxygenase family, putative n=1 Tax=Talaromyces stipitatus ATCC 10500 RepID=B8M368_TALSN Length = 332 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 58/196 (29%), Gaps = 42/196 (21%) Query: 35 RDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQ 94 + F M++ L W T Y ++ +P P P+ Sbjct: 161 ISFFGADPTTTFAPKDPHVHKPMTIQNMLDKKLRWITFGGQYNWTTKVYPEGQP-PPFPE 219 Query: 95 SFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIF 154 +L + A +P+ + A ++N Y+ +G A+F Sbjct: 220 DIAHLLR-----ASFPETEAQAAIVNFYSANDTF-------------------IGCDALF 255 Query: 155 QFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT-------------- 200 + + + L GD V G+SR +HG+ + P Sbjct: 256 MISH-DDGEGCEVIRLRSGDAVYMSGQSRFAWHGVPKIIPDTCPKWLCDWPGPGYPYWQG 314 Query: 201 --IDCRYNLTFRQAGK 214 R NL RQ + Sbjct: 315 WMGRKRVNLNVRQMME 330 >UniRef50_B8LLT0 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=B8LLT0_PICSI Length = 496 Score = 105 bits (263), Expect = 9e-22, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 66/216 (30%), Gaps = 24/216 (11%) Query: 15 LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ 74 + G + A ++ + + ++ + Y+ Sbjct: 211 ILEGLELHTGVFNAAEQRRLVAFIYQLQEQGRKKQLRERTYSEPRKWMRGKGRITIQFGC 270 Query: 75 GYLY------SPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQ-PDACLINRYAPGAK 127 Y Y +P ++ +P F +R P PD+C++N Y G Sbjct: 271 CYNYAVDKNGNPPGIVRDEEVDPLPPLFKAAIRRMVRWHVLPPSCIPDSCIVNIYDEGDC 330 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPL------KRLLLEHGDVVVWGGE 181 + H D D P +VSL FG + + L G V++ G Sbjct: 331 IPPHIDHH--DFVRPFCTVSLLSECNIIFGSNLKILGPGEFAGSTAIPLPMGSVLILNGN 388 Query: 182 -SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + + H + + R ++TFR+ + Sbjct: 389 GADVAKHSVPAV--------PCKRISITFRKMDHSK 416 >UniRef50_B2AYU6 Predicted CDS Pa_1_12280 (Fragment) n=5 Tax=Leotiomyceta RepID=B2AYU6_PODAN Length = 320 Score = 105 bits (263), Expect = 9e-22, Method: Composition-based stats. Identities = 41/152 (26%), Positives = 59/152 (38%), Gaps = 26/152 (17%) Query: 73 RQGY-LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSL 130 + Y YSP +PQ L + A + + CL+N YA G +S Sbjct: 122 GEAYPRYSPR---------PIPQCLDALRKSTEAATNC---KFNFCLVNYYATGSDSISF 169 Query: 131 HQDKDEPDLRAP-IVSVSLGLPAIFQFGGL---------KRNDPLKRLLLEHGDVVVWGG 180 H D + R P I S SLG F +LLL GD+++ G Sbjct: 170 HSDDERFLGREPAIASFSLGAARDFLMKHKPVPPPPDGQTTVFKQLKLLLASGDMILMKG 229 Query: 181 ESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 +++ + H I AG D R N+TFR+ Sbjct: 230 KTQANWLHSIPKR-AGKSSQYGDGRINITFRR 260 >UniRef50_B6GZZ6 Pc12g09870 protein n=4 Tax=Eurotiomycetidae RepID=B6GZZ6_PENCW Length = 360 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 64/206 (31%), Gaps = 34/206 (16%) Query: 35 RDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQ 94 + F+ ++V L W T Y ++ + P P Sbjct: 152 KSFFADDPTRTFQPKEPDVHKPLTVQNFLEKKLRWVTLGGQYDWTAKVYPSGPPPEFPPD 211 Query: 95 SFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIF 154 L A +P A ++N Y+ G LS+H+D E ++SVS G +F Sbjct: 212 IAKVLR------AAFPATSAQAAILNLYSAGDTLSVHRDVSEEC-DVGLISVSFGCDGLF 264 Query: 155 QFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTI------------- 201 + + + L GD V G+SR +HG+ + P + Sbjct: 265 -LASHDDGNGCEIIRLRSGDTVYMSGKSRFAWHGVPKILPSTCPKWLANWPSFGEPAPGM 323 Query: 202 -------------DCRYNLTFRQAGK 214 R NL RQ Sbjct: 324 APAPYEMWKGWMSSKRINLNVRQMTA 349 >UniRef50_Q4D8T3 Putative uncharacterized protein n=2 Tax=Trypanosoma cruzi RepID=Q4D8T3_TRYCR Length = 426 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 67/224 (29%), Gaps = 34/224 (15%) Query: 9 EPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLG 68 EP +E G I++ F + A S + ++A N H Sbjct: 194 EPIEEV--PGLYIVKEFIT--------QMEHDAVWSELKGPKAAALELETLAHRNVAHFN 243 Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDF-------QPDACLINR 121 + G ++ + PA + D+ D +N Sbjct: 244 RRFY-YGVNRIGVEGDSVNAKPAFYDWMQRRLKNEDPRRKLQDYPSVAQTFSCDQLTVNF 302 Query: 122 YAPGAK------LSLHQDKDEPDLRAPIVSVSLGLPAIFQF---GGLKRNDPLKRLLLEH 172 Y ++ H D I VSLG + ++ G + + Sbjct: 303 YNYKDGDTAASGIAHHVDSH-ASFMDCIYIVSLGSHTVLEYNRHGVPPDVAETFGVFVAP 361 Query: 173 GDVVVWGGESRL-FYHGIQPLKAGFH-----PLTIDCRYNLTFR 210 +++ GESR + HGI + P+ R +LT+R Sbjct: 362 RSLLLMTGESRYSWTHGIAGKRVDILSDRIPPVWRGDRVSLTWR 405 >UniRef50_Q6C9X6 YALI0D07546p n=1 Tax=Yarrowia lipolytica RepID=Q6C9X6_YARLI Length = 372 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 76/202 (37%), Gaps = 15/202 (7%) Query: 28 NAAEQLIRDINDVASQSPFRQMVTPGGY---TMSVAMTNCGHLGWTTHRQGYLYSPIDPQ 84 AE L+ + + + + T + + + Y Y+ Sbjct: 117 AEAENLLDQLMEDHNSWKEKTKFYLFDKLCETPHKSAFYTKDMSVYSQ-HTYRYNGKPVD 175 Query: 85 TNKPWPAMPQSFHNLCQRAATAA---GYPDFQPDACLINRYAP-GAKLSLHQDKDEPDLR 140 T++ + ++ + Q + G ++ DAC+ N YA + H D+ Sbjct: 176 TSRKYSKAMEACSDRIQELVRSTNKEGTAEWLSDACIANYYADESQSVGFHSDQLTYIGP 235 Query: 141 AP-IVSVSLGLPAIFQFGG--LKRNDPLKRLLLEHGDVVVWG-GESRLFYHGIQPLKA-- 194 P I +++LG IF+ G + +LL H +++ G + H I P++A Sbjct: 236 RPVIAALTLGSERIFRLKGACPDGDRRTYNILLPHNSLMIMHAGCQEAYKHSIIPVQAKQ 295 Query: 195 -GFHPLTIDCRYNLTFRQAGKK 215 G H R++LTFR K+ Sbjct: 296 IGLHERAGKVRFSLTFRHYKKE 317 >UniRef50_C9SQ13 Isochorismatase family protein family n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SQ13_VERA1 Length = 979 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 72/230 (31%), Gaps = 34/230 (14%) Query: 9 EPWQEPLAAGA-VILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHL 67 EPL G VI+ +L+ + + Q ++ G + + G + Sbjct: 436 PETSEPLCEGDTVIIHNVLPP---KLLDGVFEKLKDEVGWQTMSHQGGEVPRKVAVQGLV 492 Query: 68 GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-A 126 Y + + P ++ + G+P + LI Y G Sbjct: 493 AEDGSMPVYRHPA---DESPPLFPFTKTVLEIKAVVEEKLGHP---LNHVLIQFYRDGND 546 Query: 127 KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKR---------------NDPLKRLLL 170 +S H DK + + IV+VSLG F + R L Sbjct: 547 YISEHSDKTLDIVKGSYIVNVSLGAERTMIFRTKRDAKDPSKKTDNIPEGAKRKTTRKQL 606 Query: 171 EHGDVVVWGGESRL-FYHGIQPLKAGFHPLTI------DCRYNLTFRQAG 213 H + G + + + H I+ K T R +LTFRQ G Sbjct: 607 PHNSLCRMGLVTNMRWLHAIRQDKRAERDKTAPELAFAGGRISLTFRQIG 656 >UniRef50_D2VV84 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VV84_NAEGR Length = 288 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 66/232 (28%), Gaps = 41/232 (17%) Query: 8 AEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSP---------------FRQMVTP 52 P G + + + + + D S F ++ P Sbjct: 54 KNPITGEEIEGLIYVPNYLSHDEGSKLLSTLDNHSWYDSLLSRRIQCYGIHYYFTKLFHP 113 Query: 53 GGYTMSVAMTNCGHLGWTTHRQGYLYSPI-----------DPQTNKPWPAMPQSFHNLCQ 101 G M + L + + + + N P+ + + Sbjct: 114 GLQPMRKSPLLLDELSFVSKPVERDFGINFMKDEEIKGLDEYYENSNDPSSFEWEAPSLE 173 Query: 102 RAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKR 161 + + + CL+ Y ++ H D + IV +SL F Sbjct: 174 QVIDD------RINQCLVQEYYE-QSIAKHVDNVK-VFGKEIVCLSLVNECQMDFESANN 225 Query: 162 NDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 D +L LE +++ E R + HGI+ K R +LTFR Sbjct: 226 PDMKFKLTLEPNSLLIMSKEGRYDWKHGIKRKKR------FPRRISLTFRHL 271 >UniRef50_A9TPJ1 Predicted protein (Fragment) n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TPJ1_PHYPA Length = 258 Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats. Identities = 35/268 (13%), Positives = 60/268 (22%), Gaps = 92/268 (34%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 + F A E + VA P ++ + Sbjct: 5 PTIYYVPDFVSAAEEASLVQEVQVA----------PVAKWKTLKNRRLQNWA-------- 46 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 + + +P ++ ++ A + L+N Y PG ++ HQD Sbjct: 47 --GGVVHEKGLISQPIPAWLSSITEKIAKETNLFPAPINHVLVNEYLPGQGITSHQDG-- 102 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKR---------------------------------ND 163 P + +SLG P + F R Sbjct: 103 PVYYPVVAILSLGAPTLMHFTPHVRLTEANNDVDDLFSENPDGLLEPSQKTQGGRDHPVQ 162 Query: 164 PLKRLLLEHGDVVVWGGESRLFY-HGIQPLKAG--------------------------- 195 L+L ++V+ + Y HGI Sbjct: 163 ATCSLVLMPRSLLVFKDSAYTEYLHGIDEAYEDLLNNKVVNLDAYLSHVSSRNDGIAEDS 222 Query: 196 ---------FHPLTIDCRYNLTFRQAGK 214 R +LT R K Sbjct: 223 RLGDVAEEKDMLRRTGTRVSLTCRHVPK 250 >UniRef50_C7YT81 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YT81_NECH7 Length = 905 Score = 104 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 69/206 (33%), Gaps = 29/206 (14%) Query: 31 EQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWP 90 + L+ I + Q ++ G + + G +G Y + + P Sbjct: 440 DTLVDGIFEKLRNEVQWQRMSHQGGEVPRLVAVQGEVGEDGTIPVYRHPS---DESPPLL 496 Query: 91 AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD-EPDLRAPIVSVSL 148 + L G+P + LI Y G +S H DK + + I +VSL Sbjct: 497 PFSPTVQALKAETEKHLGHP---LNHVLIQYYRDGTDYISEHSDKTLDIVQGSYIANVSL 553 Query: 149 GLPAIFQFGGLKRN--------------DPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK 193 G F +R+ ++R L H + G +S + + H I+ K Sbjct: 554 GAERTMVFRTKRRDKDPSRDEASPADLKRKIQRARLPHNSLCRLGLKSNMKWLHAIRQDK 613 Query: 194 AGFHPLTI------DCRYNLTFRQAG 213 T R +LTFRQ G Sbjct: 614 RPKSQKTPAELAYEGGRISLTFRQIG 639 >UniRef50_A4VNR2 DNA repair system protein n=11 Tax=Gammaproteobacteria RepID=A4VNR2_PSEU5 Length = 212 Score = 104 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 44/215 (20%), Positives = 73/215 (33%), Gaps = 21/215 (9%) Query: 6 ADAEPWQEPLAAGAVILRRFAFNA-AEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNC 64 P E A + + A+ ++++ +P+ Q Sbjct: 11 PRHVPSIELPDAALDYSADWLDSDTADAWLQELI---LATPWTQPEIRLYGRQVAVPRLV 67 Query: 65 GHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAP 124 G + + Y YS + W + + QR G+ + + L+N Y Sbjct: 68 AWYGDS--QAAYRYSGLQ-HEPLAWTPL---LQGVRQRLERETGH---RFNGVLLNLYRD 118 Query: 125 G-AKLSLHQDKDEPDLRAPIV-SVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGES 182 G + H D + R P+V S+SLG F LLL HG ++V G + Sbjct: 119 GGDAMGWHSDDEVELGRNPVVASLSLGAERRFDLRRKGSGRIQHSLLLGHGSLLVMSGAT 178 Query: 183 R-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + + H I P R NLTFR + + Sbjct: 179 QHHWQHQIARTSKVSQP-----RLNLTFRLIHRDQ 208 >UniRef50_Q5VPG7 Os06g0138200 protein n=5 Tax=BEP clade RepID=Q5VPG7_ORYSJ Length = 616 Score = 104 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 68/215 (31%), Gaps = 28/215 (13%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G + A ++ I D + + G T + G T + G Sbjct: 204 GLELHCGVFSAAEQKRIVDYVYDLQEMGKHGEL--GDRTYTEPQRWMRGKGRVTIQFGCC 261 Query: 78 YSPIDPQTNKPW--------PAMPQSFHNLCQRAATAAGYPDFQ-PDACLINRYAPGAKL 128 Y+ + P MP F + +R P PD+C++N Y PG + Sbjct: 262 YNYATDKNGNPPGIIRTIASDPMPSLFKIMIKRLVRWHVLPKTCIPDSCIVNIYDPGDCI 321 Query: 129 SLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPL------KRLLLEHGDVVVWGGE- 181 H D D P +VS FG + + L G V++ G Sbjct: 322 PPHIDSH--DFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSLPIPLPVGSVLILNGNG 379 Query: 182 SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 + + H + + R ++TFR+ + Sbjct: 380 ADVAKHCVPAV--------PTKRISITFRKMDPAK 406 >UniRef50_B8NIA9 Putative uncharacterized protein n=1 Tax=Aspergillus flavus NRRL3357 RepID=B8NIA9_ASPFN Length = 316 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 33/115 (28%), Positives = 44/115 (38%), Gaps = 10/115 (8%) Query: 102 RAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGL 159 RA Y D + LIN YA +S H D + P I S+SLG F Sbjct: 170 RAEDVVQYKDNYYNFILINYYATNTDSISYHSDDERFLGPNPSIASLSLGAKRDFLLKHK 229 Query: 160 K--RNDPLKRLLLEHGDVVVWGGESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQ 211 + L GD+VV GE++ + H I R N+TFR+ Sbjct: 230 PGVEAGKPLKFPLASGDMVVMRGETQGNWLHSIPKRA-----GEGGGRINVTFRR 279 >UniRef50_B2ADC0 Predicted CDS Pa_4_670 n=1 Tax=Podospora anserina RepID=B2ADC0_PODAN Length = 1033 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 71/231 (30%), Gaps = 34/231 (14%) Query: 8 AEPWQEPLAAGA-VILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGH 66 EP G ++ L D + + ++ G + + G Sbjct: 519 TSAVSEPSCEGDTHVITNVLSP---TLAADAFERLLEEVSWAGMSHMGGEVPRRIAVQGE 575 Query: 67 LGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA 126 + + Y + + P + + G+P + LI Y G Sbjct: 576 VDKDGNMPVYRHPA---DESPPLLPFSPTVLQIKTAIEKHLGHP---LNHVLIQHYRGGD 629 Query: 127 -KLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLK---------------RLL 169 +S H DK + + I ++SLG F +R R+ Sbjct: 630 DYISEHSDKTLDIVPNSFIANLSLGAERTMVFRTKRRPSKHHQEETSPAEKARRQIQRVP 689 Query: 170 LEHGDVVVWGGES-RLFYHGIQPLKAGFHPLTI------DCRYNLTFRQAG 213 L H ++ G + R + H I+P K + R +LTFRQ G Sbjct: 690 LPHNSLLRMGLSTNRHWLHAIRPDKRPPLSKSPSELSHSGHRISLTFRQIG 740 >UniRef50_UPI00016C4217 Alkylated DNA repair protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4217 Length = 179 Score = 103 bits (256), Expect = 5e-21, Method: Composition-based stats. Identities = 35/155 (22%), Positives = 55/155 (35%), Gaps = 15/155 (9%) Query: 62 TNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINR 121 T + + Y YS I+ P PQ+ L A G+ P+ CL N Sbjct: 36 TRIKARKSASFGRPYNYSGIE----WPVAPFPQAVAALLAPIAAVIGFE---PNNCLANF 88 Query: 122 YAPG-AKLSLHQDK-DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWG 179 Y G + + H D + + I +SLG + F + + L G ++ Sbjct: 89 YPNGNSSMGFHSDSIVDLEPNTGIAVLSLGTERVITFHRIDNKAVRESYPLSPGTLLWMC 148 Query: 180 GESR-LFYHGIQPLKAGFHPLTIDCRYNLTFRQAG 213 E + + H I D R +LTFR+ Sbjct: 149 PEMQADWRHAILADTGVT-----DGRISLTFRRVK 178 >UniRef50_Q9LJH4 Emb|CAB82748.1 n=1 Tax=Arabidopsis thaliana RepID=Q9LJH4_ARATH Length = 330 Score = 102 bits (254), Expect = 8e-21, Method: Composition-based stats. Identities = 59/252 (23%), Positives = 87/252 (34%), Gaps = 64/252 (25%) Query: 14 PLAAGAVILRRFAF-NAAEQLIRDINDVA-SQSPFRQMVTPGGYTMSVAMTNCGHLGWTT 71 + G V+L+ + N ++ + + F Q G + + M LG Sbjct: 82 VIRPGMVLLKNYLSINNQVMIVNKCRQLGLGEGGFYQPGFQDGGLLHLKMMC---LGKNW 138 Query: 72 HRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------------YPDFQPDA 116 Q Y I P P +P F L ++A + P PD Sbjct: 139 DCQTRRYGEIRPIDGSVPPRIPVEFSQLVEKAIKESKSLVATNSNETKGGDEIPLLLPDI 198 Query: 117 CLINRYAPGAKLSLH---------------------------------QDKDE----PDL 139 C++N Y KL LH QDK E Sbjct: 199 CVVNFYTSTGKLGLHQVVTSIQYRKNCSSLYYCSFKFLHHQLIESFMAQDKGESKKSLRK 258 Query: 140 RAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPL 199 PIVS S+G A F +G K D L+LE GDV+++G SR +HG++ ++ P Sbjct: 259 GLPIVSFSIGDSAEFLYGDQKDVDKADTLILESGDVLIFGERSRNVFHGVRSIRKILPPR 318 Query: 200 TIDCRYNLTFRQ 211 L FR+ Sbjct: 319 -------LFFRK 323 >UniRef50_A9UX55 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UX55_MONBE Length = 180 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 51/170 (30%), Gaps = 23/170 (13%) Query: 56 TMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPD 115 V G Y + W L + G ++ + Sbjct: 4 KEHVPPRRIAAHGDAGVNYTYSHKT---MLAAEWTP---QLRQLKEYVEQTTG---YEYN 54 Query: 116 ACLINRYAPG-AKLSLHQDKD-EPDLRAPIVSVSLGLPAIFQFGGLKRNDP--------L 165 LINRYA G + HQD + E D PI S++LG F Sbjct: 55 FVLINRYADGRDTIGEHQDNESELDPDVPIASLTLGATRDFVLRHRDVRRKCGAHSKLNP 114 Query: 166 KRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 L L G ++ + +YH + P R NLTFR+ Sbjct: 115 HTLPLPSGLLLTMEAPTNKCWYHSVPRRSLARCP---GPRINLTFRRIRP 161 >UniRef50_D0NI62 Alkylated DNA repair protein alkB n=2 Tax=Phytophthora infestans T30-4 RepID=D0NI62_PHYIN Length = 231 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 63/223 (28%), Gaps = 48/223 (21%) Query: 9 EPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLG 68 E +++ +G + + E I + R P + V H Sbjct: 35 EEFRKGPISGVYYIPNWITQDEEAAIVE----------RVYAVPHDNDLWVK---LKHRR 81 Query: 69 WTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGY-PDFQPDACLINRYAPGAK 127 + + +P+ + Q + + +P+ LIN Y G Sbjct: 82 LQMWGG-------EVKVPFEPNPLPEWLQQISQTLLDTGIFSEEKKPNHALINEYGVGDC 134 Query: 128 LSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK-----------------RNDPLKRLLL 170 + H+D P + +S G F + L Sbjct: 135 ILPHEDG--PAYFPLVSIISTGAECRVTFEPHRALASVDNQSVSEAAPTNEIVQNFDFQL 192 Query: 171 EHGDVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQA 212 E ++++ GE+ Y H + ++ R +LT R Sbjct: 193 ERRSLLLFTGEAYTRYLHSVDNIEV-------GTRISLTIRHV 228 >UniRef50_Q1ECQ5 At4g02485 n=2 Tax=Arabidopsis thaliana RepID=Q1ECQ5_ARATH Length = 226 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 65/205 (31%), Gaps = 33/205 (16%) Query: 18 GAVILRRFAFNA-AEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G + R F A L+ I + + T L T Sbjct: 46 GLWLYRNFLSIAHQSHLLSAILNEGWFVEESINQAMRFGDLPSWATELSDLIRETL---- 101 Query: 77 LYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDE 136 + P + + + D ++N Y PG + H D Sbjct: 102 --------ESVDLPVLSADLL-----------WREPLFDQLIVNLYQPGEGICAHVDL-- 140 Query: 137 PDLRAPIVSVSLGLPAIFQFGGLKRND-PLKRLLLEHGDVVVWGGESRL-FYHGIQPLKA 194 I VSL P + +F ++N+ +LL G +++ GE+R + H I + Sbjct: 141 LRFEDGIAIVSLESPCVMRFSPAEKNEYEAVDVLLNPGSLILMSGEARYRWKHEINRKQN 200 Query: 195 GFHPLTID-----CRYNLTFRQAGK 214 GF + R ++T R+ + Sbjct: 201 GFQLWEGEEIDQKRRISITLRKLCQ 225 >UniRef50_A9UYL7 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UYL7_MONBE Length = 333 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 64/251 (25%), Gaps = 62/251 (24%) Query: 8 AEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHL 67 P + L G + L A E + + + + + Sbjct: 99 TMPTEPQLPPGLLYLEDAIDAATEDQLVALLNEDTTP-----------WQVLRQRQVAQW 147 Query: 68 GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLC---QRAATAAGYPDFQPDACLINRYAP 124 G Y Y MP QR A G +P+ NRY P Sbjct: 148 GV-----AYDYDAFTAHPITA--NMPALLLQQAGEMQRLAQQHGLATARPNQLTANRYQP 200 Query: 125 GAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDP-------------------- 164 G ++ H D + +SL F L +P Sbjct: 201 GDGIAAHVD-SPVAFGPIVWILSLEGGLAMDFAPLTPAEPAPDPSRTEGEGDSKVPAAFG 259 Query: 165 ----------------LKRLLLEHGDVVVWGGESRL-FYHGIQPLKAGFHP---LTIDCR 204 + L L V++ E+R + H I P K L R Sbjct: 260 RNEGKALQDRHRDLTATQTLYLRPRSVLILADEARYNWSHQIAPRKTDLVQGIRLPRRQR 319 Query: 205 YNLTFRQAGKK 215 +LT+R K Sbjct: 320 TSLTYRYVPPK 330 >UniRef50_Q8YKL5 All7279 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YKL5_ANASP Length = 141 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 35/144 (24%), Positives = 46/144 (31%), Gaps = 16/144 (11%) Query: 70 TTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKL 128 YLYS W L + A G + + N+Y G + Sbjct: 4 WGEGCDYLYSNSVLLKPLAWTE---PLAKLRDKITAATG---YSFRIVIGNQYRSGQDSI 57 Query: 129 SLHQDKDEPDLRAP-IVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW-GGESRLFY 186 H DK+ P I S+SLG FQ + LEHG ++V G Sbjct: 58 GWHADKESSMGIEPAIASISLGSARKFQLKPI--GGKPTDFWLEHGSLLVMLPGCQTTHV 115 Query: 187 HGIQPLKAGFHPLTIDCRYNLTFR 210 H + R NLTFR Sbjct: 116 HQVPKTTKFVT-----TRINLTFR 134 >UniRef50_D2VTV7 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VTV7_NAEGR Length = 251 Score = 101 bits (252), Expect = 1e-20, Method: Composition-based stats. Identities = 40/162 (24%), Positives = 63/162 (38%), Gaps = 30/162 (18%) Query: 69 WTTHRQGYLYSPIDPQTN---------KPWPAMPQSFHNLCQRAATAAGYPDFQPDACLI 119 W T + Y Y D + W +P+ +L +R + C I Sbjct: 99 WKT--KTYNYGGKDVISPREFCGISSQSEWSKIPE-LISLKERIEKFTNH---TFTYCFI 152 Query: 120 NRYAPG-AKLSLHQDKDEPDL-RAPIVSVSLGLPAIFQFGGL-------KRNDPLKRLLL 170 N+Y G + H DK++ PIVS+SLG FQF +++ + L Sbjct: 153 NKYKDGNDSIYWHSDKEKGLKKGCPIVSISLGQERDFQFRPKISKNSKQQKDGNIIEKNL 212 Query: 171 EHGDVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQ 211 G +VV E++ +Y H + + + R NLTFR Sbjct: 213 PDGSMVVMNYETQEYYEHSLPKRR-----NIHNIRLNLTFRN 249 >UniRef50_D2VCK4 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2VCK4_NAEGR Length = 226 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 59/210 (28%), Gaps = 42/210 (20%) Query: 18 GAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYL 77 G +++ + + + + + + + Sbjct: 7 GLYLIKDILTPEEASSLVENLE---------------KQLWFLNRS-KKRRIQIYGPKHD 50 Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAG--------------YPDFQPDACLINRYA 123 +PQ +LC D + +N Y Sbjct: 51 KKYNIIPDE--ITPLPQFLKDLCTTILERTATHVPKIDLSQYESYLGDDKFTEIFVNEYK 108 Query: 124 PGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESR 183 P L H D + I +SL + F F + D ++ L + + G SR Sbjct: 109 PDDSLDQHFD-HRKTYKEIIFGLSLECDSTFTF---TKKDIKHQVKLPARSLYLMTGSSR 164 Query: 184 L-FYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 F HGI+P + L D R +LTFR Sbjct: 165 KSFKHGIEP-----NLLEGDRRISLTFRTV 189 >UniRef50_Q4PIA7 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PIA7_USTMA Length = 421 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 48/224 (21%), Positives = 74/224 (33%), Gaps = 27/224 (12%) Query: 5 FADAEPWQEPLAAGAVILRRFAFN--AAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMT 62 F A E A F A E + + + + Sbjct: 213 FCSAFRLDELPDAEVYYQPDFIDRSLAEEWR----SQLDRLPEWYRPKLKVYGREITQSR 268 Query: 63 NCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRY 122 T YS + + P+P + +L + + + + C++NRY Sbjct: 269 EIAAYA-TAPGLHLKYSGHPVELHSPFPPLLDHIASLIS--SDECLGKEVRFNHCMLNRY 325 Query: 123 APGA-KLSLHQDKDEPDLRAPIVSVSLGLPA--IFQFGGLKRNDPL------KRLLLEHG 173 G+ + H D E IV+VSLG I Q KR + KR L G Sbjct: 326 DDGSVYIGRHSDNIE---NKVIVTVSLGADRSWIMQRKATKRAEAHGKEQTRKRWTLAGG 382 Query: 174 DVVVWGGESRLFY-HGIQPLKAGFHPLTIDCRYNLTFRQAGKKE 216 ++V G+++ FY H I P R ++TFRQ E Sbjct: 383 SLLVMQGQTQKFYTHEIPKELKVKGP-----RISITFRQLVYDE 421 >UniRef50_C1FG54 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FG54_9CHLO Length = 684 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 45/234 (19%), Positives = 69/234 (29%), Gaps = 44/234 (18%) Query: 8 AEPWQEPLAAGAVILRRFA----FNAAEQLIRDIND-------VASQSPFRQMVTPGGYT 56 L +G ++ + ++ P + Sbjct: 41 TTEDSPTLVSGLRLVEGIVDTSGSPSEHDILVAWIRGTLERGRAGELPGNTYAPIPEKWR 100 Query: 57 MSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQR-AATAAGYPDFQPD 115 G TH +T+ P +P + A A +PD Sbjct: 101 KRNQSREMLQFGTYTH-------SNRVETHVPVAPLPPELDAVVDALIARGALTELQRPD 153 Query: 116 ACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQF--------------GGLKR 161 +C IN Y PG + H D P P V+VSL G +R Sbjct: 154 SCTINLYGPGQWIPPHIDN--PAFDRPFVTVSLCSEQPMVLGRGMVWPEGGRGPCGDDER 211 Query: 162 NDPLKRLLLEHGDVVVWGGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGK 214 + L L G VV GE + + H + P+ A R +LTFR+ G+ Sbjct: 212 LNEEHALSLPVGSAVVVEGEAADEYEHAVPPVTA--------ERISLTFRRRGR 257 >UniRef50_C5FMW8 Isochorismatase family protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FMW8_NANOT Length = 825 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 32/150 (21%), Positives = 53/150 (35%), Gaps = 23/150 (15%) Query: 84 QTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD-EPDLRA 141 + P + + + G+P + LI Y G ++S H DK + + Sbjct: 407 DESPPLSPFSATVNEIRAVVEKQLGHP---LNHVLIQLYRDGEDRISEHSDKTLDIVRGS 463 Query: 142 PIVSVSLGLPAIFQFGGLKRN-----------DPLKRLLLEHGDVVVWGGESRL-FYHGI 189 I +VSLG +R+ + H + + G +S + + HGI Sbjct: 464 SICNVSLGALRSMTLRTKADTKGSTGTLGENARQTQRVPMPHNSLFILGEKSNMEWLHGI 523 Query: 190 QPLKAGFHPLTI------DCRYNLTFRQAG 213 +P K T R +LTFR G Sbjct: 524 RPDKRQEKEKTRDERAFNGERISLTFRHIG 553 >UniRef50_B2AVQ7 Predicted CDS Pa_7_2120 n=8 Tax=Leotiomyceta RepID=B2AVQ7_PODAN Length = 260 Score = 100 bits (250), Expect = 2e-20, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 72/235 (30%), Gaps = 49/235 (20%) Query: 13 EPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMS---VAMTNCGHL-- 67 + L + F E+ I D + A ++ +RQ+ T V T Sbjct: 27 KALPPACYYISDFITEEEEKAILDKVNTAPKARWRQLTHRRLQTWPSDLVKNTLLDARPL 86 Query: 68 -GWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGA 126 W I ++P S P +P+ LIN Y P Sbjct: 87 PDWLEQPVVARLLSIPLSDSQPENMFHDS--------------PHQRPNHVLINEYPPNT 132 Query: 127 KLSLHQDKDEPDLRAPIVSVSLGLP-AIFQFGGLKRN----DPLKRLLLEHGDVVVWGGE 181 + H+D + +VSLG + + + +P+ R+L E +++ + Sbjct: 133 GIMPHKDG--GAYHPVVCTVSLGSALCLNLYKSKEDGALDLEPVWRILQEPRSLLITTAD 190 Query: 182 SRLFY-HGIQPLKAGFHPLT---------------------IDCRYNLTFRQAGK 214 Y HGI+P+ R +LT+R K Sbjct: 191 LYTEYLHGIEPIATDVELSEKTIVNWDLLRSPELYKDGLNQRRTRISLTYRDVLK 245 >UniRef50_B9SA55 Oxidoreductase, putative n=1 Tax=Ricinus communis RepID=B9SA55_RICCO Length = 253 Score = 100 bits (250), Expect = 2e-20, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 50/160 (31%), Gaps = 34/160 (21%) Query: 78 YSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKDE 136 YS P W P ++ + A P + ++ L+NRY G + H D ++ Sbjct: 102 YSGYKPHVYS-WDDYPP-LKDILEAVHRA--LPGSRFNSLLLNRYKGGNDNVGWHADDEK 157 Query: 137 PDLRAP-IVSVSLGLPAIFQFGGLKRNDP----------------------LKRLLLEHG 173 P I SVS G F + L+HG Sbjct: 158 LYGPTPEIASVSFGCEREFLLKKRQSKSKAAERRCDDEPDRKRLKKSSHVDQHSFTLKHG 217 Query: 174 DVVVWGGE-SRLFYHGIQPLKAGFHPLTIDCRYNLTFRQA 212 ++V G R + H + R NLTFR Sbjct: 218 SLLVMKGNTQRDWLHSLPKRAKAEA-----TRINLTFRHV 252 >UniRef50_B6JV21 2 OG-Fe(II) oxygenase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JV21_SCHJY Length = 251 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 55/205 (26%), Gaps = 40/205 (19%) Query: 17 AGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGY 76 G + + + + Sbjct: 79 PGLTYIPNVLPPSVQDELIQCLPDGILDRS------------------------------ 108 Query: 77 LYSPIDPQTNKPWPAMPQSFHN-LCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKD 135 + +P+ + Q+ A + +A ++ Y PG + H D Sbjct: 109 --TGRQAHLYRPFAPVLQNLMQNFIPSAFKKQVWEGKDAEAIIMQVYNPGDGIIPHVDL- 165 Query: 136 EPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRL-FYHGIQPLK- 193 P IV SL +F +LLE G + + GE+R + HGI Sbjct: 166 -PMFDDGIVIFSLLSDITMEFTQPSSKRKA-SVLLEKGSLTIMEGEARYQWLHGIPFRTG 223 Query: 194 --AGFHPLTIDCRYNLTFRQAGKKE 216 + R ++T R+ G + Sbjct: 224 DWTNGTWIPRAQRCSITMRRIGLNK 248 >UniRef50_UPI000023F682 hypothetical protein FG09872.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023F682 Length = 927 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 67/207 (32%), Gaps = 30/207 (14%) Query: 31 EQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWP 90 + L I D Q ++ G + + G +G Y + + P Sbjct: 474 DSLADGIFDKLRGEIQWQHMSHQGGQVPRLVAVQGEVGSDGSIPVYRHPS---DESPPLL 530 Query: 91 AMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPG-AKLSLHQDKD-EPDLRAPIVSVSL 148 + G+P + LI Y G +S H DK + + I +VSL Sbjct: 531 PFSPIVQAIRTETEKHLGHP---LNHVLIQYYRDGTDYISEHSDKTLDIVKGSYIANVSL 587 Query: 149 GLPAIFQFGGLKRN---------------DPLKRLLLEHGDVVVWGGESRL-FYHGIQPL 192 G +F F +++ ++R L H + G S + + H I+ Sbjct: 588 GAERVFTFRTKRKDKDAAQDEASSPIDSKRTIQRAKLPHNSLCRLGLCSNMKWLHAIRQD 647 Query: 193 KAGFHPLTI------DCRYNLTFRQAG 213 K + R +LTFR G Sbjct: 648 KRPKKEKSEAELAFEGGRISLTFRHIG 674 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.132 0.387 Lambda K H 0.267 0.0409 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,320,801,455 Number of Sequences: 3077464 Number of extensions: 54628062 Number of successful extensions: 133506 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 433 Number of HSP's successfully gapped in prelim test: 636 Number of HSP's that attempted gapping in prelim test: 130755 Number of HSP's gapped (non-prelim): 1211 length of query: 216 length of database: 1,040,396,356 effective HSP length: 123 effective length of query: 93 effective length of database: 661,868,284 effective search space: 61553750412 effective search space used: 61553750412 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 90 (39.3 bits)