BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (177 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0A8D8 UPF0189 protein ymdB n=66 Tax=Proteobacteria Rep... 363 1e-99 UniRef50_P67342 UPF0189 protein ymdB n=62 Tax=Bacteria RepID=YMD... 284 1e-75 UniRef50_A4W960 Appr-1-p processing domain protein n=5 Tax=Bacte... 283 2e-75 UniRef50_Q8EYT0 UPF0189 protein LA_4133 n=9 Tax=cellular organis... 214 1e-54 UniRef50_C7NE27 Appr-1-p processing domain protein n=2 Tax=Lepto... 194 1e-48 UniRef50_Q8Q0F9 UPF0189 protein MM_0177 n=18 Tax=cellular organi... 188 6e-47 UniRef50_D1N530 Appr-1-p processing domain protein n=4 Tax=cellu... 186 2e-46 UniRef50_D1AKA5 Appr-1-p processing domain protein n=2 Tax=Bacte... 186 3e-46 UniRef50_Q8RB30 UPF0189 protein TTE0995 n=12 Tax=Bacteria RepID=... 184 2e-45 UniRef50_C1PFE0 Appr-1-p processing domain protein n=5 Tax=Bacte... 180 1e-44 UniRef50_Q8TQD0 UPF0189 protein MA_1614 n=2 Tax=cellular organis... 177 2e-43 UniRef50_Q93SX7 UPF0189 protein n=2 Tax=Acinetobacter RepID=Y189... 177 2e-43 UniRef50_C6RT62 Appr-1-p processing n=2 Tax=Acinetobacter radior... 174 8e-43 UniRef50_Q71W03 UPF0189 protein LMOf2365_2748 n=23 Tax=Bacteria ... 174 2e-42 UniRef50_Q8KAE4 UPF0189 protein CT2219 n=7 Tax=Bacteria RepID=Y2... 173 2e-42 UniRef50_C7RS37 Appr-1-p processing domain protein n=15 Tax=cell... 171 7e-42 UniRef50_Q8PHB6 UPF0189 protein XAC3343 n=14 Tax=Proteobacteria ... 171 8e-42 UniRef50_Q3AEI4 Putative uncharacterized protein n=6 Tax=Bacteri... 171 1e-41 UniRef50_C8NAC1 RNase III regulator YmdB n=3 Tax=cellular organi... 168 6e-41 UniRef50_Q9HXU7 UPF0189 protein PA3693 n=16 Tax=Bacteria RepID=Y... 167 1e-40 UniRef50_Q985D2 UPF0189 protein mll7730 n=12 Tax=Bacteria RepID=... 163 3e-39 UniRef50_A5GC80 Appr-1-p processing domain protein n=2 Tax=Desul... 161 9e-39 UniRef50_Q8Y2K1 UPF0189 protein RSc0334 n=39 Tax=cellular organi... 159 4e-38 UniRef50_Q6PHJ5 Zgc:65960 n=11 Tax=cellular organisms RepID=Q6PH... 159 5e-38 UniRef50_A1Z1Q3 MACRO domain-containing protein 2 n=55 Tax=cellu... 158 6e-38 UniRef50_A8ZUR5 Appr-1-p processing domain protein n=3 Tax=cellu... 158 6e-38 UniRef50_C6BB95 Appr-1-p processing domain protein n=4 Tax=cellu... 158 8e-38 UniRef50_B7C8M6 Putative uncharacterized protein n=3 Tax=Bacteri... 157 2e-37 UniRef50_C1H4Y3 MACRO domain-containing protein n=4 Tax=cellular... 155 8e-37 UniRef50_Q8EP31 Hypothetical conserved protein n=1 Tax=Oceanobac... 154 2e-36 UniRef50_Q047N9 Predicted phosphatase, histone macroH2A1 family ... 153 2e-36 UniRef50_Q9EYI6 UPF0189 protein in sno 5'region n=22 Tax=Bacteri... 150 1e-35 UniRef50_Q9WYX8 UPF0189 protein TM_0508 n=15 Tax=cellular organi... 149 3e-35 UniRef50_A7IGI6 Appr-1-p processing domain protein n=53 Tax=cell... 149 3e-35 UniRef50_A2SS36 Appr-1-p processing domain protein n=26 Tax=cell... 149 3e-35 UniRef50_Q0UQZ6 Putative uncharacterized protein n=2 Tax=Leotiom... 148 9e-35 UniRef50_Q0CQJ0 Protein LRP16 n=10 Tax=cellular organisms RepID=... 145 5e-34 UniRef50_D1U7C0 Appr-1-p processing domain protein n=1 Tax=Desul... 145 7e-34 UniRef50_D1ZDH8 Whole genome shotgun sequence assembly, scaffold... 145 8e-34 UniRef50_Q9BQ69 MACRO domain-containing protein 1 n=11 Tax=Tetra... 144 1e-33 UniRef50_A0L536 Appr-1-p processing domain protein n=1 Tax=Magne... 143 2e-33 UniRef50_C8WYT5 Appr-1-p processing domain protein n=1 Tax=Desul... 143 3e-33 UniRef50_B2JCA0 Appr-1-p processing domain protein n=13 Tax=Prot... 143 3e-33 UniRef50_Q8K4G6 MACRO domain-containing protein 1 (Fragment) n=5... 142 4e-33 UniRef50_Q30ZH6 Appr-1-p processing n=1 Tax=Desulfovibrio desulf... 142 4e-33 UniRef50_Q87JZ5 UPF0189 protein VPA0103 n=5 Tax=Proteobacteria R... 142 4e-33 UniRef50_A4R3Q9 Putative uncharacterized protein n=1 Tax=Magnapo... 142 5e-33 UniRef50_B8HYS5 Appr-1-p processing domain protein n=2 Tax=Cyano... 142 6e-33 UniRef50_Q66HV6 Zgc:92353 n=1 Tax=Danio rerio RepID=Q66HV6_DANRE 140 1e-32 UniRef50_Q1R0S7 Appr-1-p processing n=12 Tax=Proteobacteria RepI... 140 2e-32 UniRef50_Q9HJ67 UPF0189 protein Ta1105 n=1 Tax=Thermoplasma acid... 140 2e-32 UniRef50_A7RJ44 Predicted protein (Fragment) n=4 Tax=Eukaryota R... 140 2e-32 UniRef50_C1BR35 MACRO domain-containing protein 1 n=2 Tax=Caligu... 139 3e-32 UniRef50_B9S4E3 Protein LRP16, putative n=2 Tax=cellular organis... 139 4e-32 UniRef50_Q2LUU1 Appr-1-p histone processing protein n=5 Tax=Bact... 139 5e-32 UniRef50_B6KFB3 Appr-1-p processing enzyme family domain-contain... 139 5e-32 UniRef50_Q47EQ7 Appr-1-p processing n=1 Tax=Dechloromonas aromat... 138 7e-32 UniRef50_B8DKL2 Appr-1-p processing domain protein n=3 Tax=Desul... 137 1e-31 UniRef50_B6Q324 LRP16 family protein n=3 Tax=Trichocomaceae RepI... 136 2e-31 UniRef50_Q03IQ8 Predicted phosphatase homologous to the C-termin... 136 3e-31 UniRef50_Q1HPZ5 LRP16 protein n=1 Tax=Bombyx mori RepID=Q1HPZ5_B... 136 3e-31 UniRef50_UPI000186F16D conserved hypothetical protein n=1 Tax=Pe... 136 4e-31 UniRef50_A8M6L5 Appr-1-p processing domain protein n=2 Tax=Micro... 135 6e-31 UniRef50_B9XAD9 Appr-1-p processing domain protein n=1 Tax=bacte... 134 1e-30 UniRef50_B7PF53 MACRO domain-containing protein, putative n=2 Ta... 134 1e-30 UniRef50_Q4P1I0 Putative uncharacterized protein n=1 Tax=Ustilag... 134 1e-30 UniRef50_Q0B030 Phosphatase n=1 Tax=Syntrophomonas wolfei subsp.... 134 1e-30 UniRef50_B9YC00 Putative uncharacterized protein n=1 Tax=Holdema... 134 2e-30 UniRef50_Q97AU0 UPF0189 protein TV0719 n=2 Tax=cellular organism... 133 2e-30 UniRef50_A6GJ81 Putative uncharacterized protein n=1 Tax=Plesioc... 133 3e-30 UniRef50_B6SKT6 Protein LRP16 n=12 Tax=cellular organisms RepID=... 132 4e-30 UniRef50_A6BCW6 Putative uncharacterized protein n=5 Tax=Bacteri... 132 4e-30 UniRef50_C2DZH9 Appr-1-p processing protein n=4 Tax=Lactobacillu... 132 6e-30 UniRef50_C4V1Q4 Appr-1-p processing domain protein n=3 Tax=Bacte... 131 8e-30 UniRef50_C8VIG2 LRP16 family protein (AFU_orthologue; AFUA_3G138... 131 9e-30 UniRef50_C2KRZ5 Appr-1-p processing domain protein n=2 Tax=Mobil... 131 1e-29 UniRef50_B8LP86 Putative uncharacterized protein n=1 Tax=Picea s... 131 1e-29 UniRef50_C4V152 Appr-1-p processing protein n=2 Tax=Clostridiale... 131 1e-29 UniRef50_A0LGZ1 Appr-1-p processing domain protein n=1 Tax=Syntr... 130 2e-29 UniRef50_C4Q6S1 Expressed protein n=1 Tax=Schistosoma mansoni Re... 130 2e-29 UniRef50_B9MLL8 Appr-1-p processing domain protein n=6 Tax=Clost... 130 2e-29 UniRef50_B2ACK5 Predicted CDS Pa_3_1270 n=5 Tax=Eukaryota RepID=... 130 2e-29 UniRef50_Q5KCD7 Putative uncharacterized protein n=1 Tax=Filobas... 128 8e-29 UniRef50_C4DDL7 Predicted phosphatase similar to C-terminal doma... 128 8e-29 UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZE... 127 1e-28 UniRef50_C0PSL1 Putative uncharacterized protein n=1 Tax=Picea s... 127 2e-28 UniRef50_D2S4L6 Appr-1-p processing domain protein n=4 Tax=Actin... 126 2e-28 UniRef50_D1VVA5 Putative uncharacterized protein n=1 Tax=Peptoni... 126 3e-28 UniRef50_UPI000050FFC7 predicted phosphatase, C-terminal domain ... 126 3e-28 UniRef50_C4FEN5 Putative uncharacterized protein n=1 Tax=Bifidob... 125 8e-28 UniRef50_A8JCH3 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 124 1e-27 UniRef50_A8FSV2 Putative uncharacterized protein n=1 Tax=Shewane... 123 2e-27 UniRef50_A9SRF5 Predicted protein n=1 Tax=Physcomitrella patens ... 123 3e-27 UniRef50_A1D5K4 Appr-1-p processing enzyme family protein n=1 Ta... 123 3e-27 UniRef50_A7BY23 Putative uncharacterized protein n=3 Tax=Beggiat... 123 3e-27 UniRef50_B8I4Z8 Appr-1-p processing domain protein n=7 Tax=Bacte... 122 5e-27 UniRef50_Q93RG0 UPF0189 protein in tap1-dppD intergenic region n... 122 8e-27 UniRef50_C7GZB8 Appr-1-p processing enzyme family domain protein... 121 9e-27 UniRef50_C7N880 Predicted phosphatase, C-terminal domain of hist... 121 1e-26 UniRef50_Q17432 Protein B0035.3, confirmed by transcript evidenc... 121 1e-26 UniRef50_B0EF86 MACRO domain-containing protein, putative n=2 Ta... 120 1e-26 UniRef50_C7H575 RNase III regulator YmdB n=2 Tax=Faecalibacteriu... 119 4e-26 UniRef50_C5C222 Appr-1-p processing domain protein n=2 Tax=Actin... 119 5e-26 UniRef50_B7C850 Putative uncharacterized protein n=1 Tax=Eubacte... 117 2e-25 UniRef50_A7B8S3 Putative uncharacterized protein n=1 Tax=Actinom... 116 4e-25 UniRef50_B5YAF3 Conserved protein n=2 Tax=Dictyoglomus RepID=B5Y... 115 7e-25 UniRef50_A5ZAB5 Putative uncharacterized protein n=4 Tax=Clostri... 114 1e-24 UniRef50_C2L199 Putative uncharacterized protein n=1 Tax=Oribact... 114 1e-24 UniRef50_A4TAV6 Appr-1-p processing domain protein n=6 Tax=Actin... 114 2e-24 UniRef50_C9LYS3 Appr-1-p processing enzyme family domain protein... 113 3e-24 UniRef50_A9WK70 Appr-1-p processing domain protein n=3 Tax=Chlor... 112 5e-24 UniRef50_A7EET2 Putative uncharacterized protein n=1 Tax=Sclerot... 112 5e-24 UniRef50_Q4DSL4 Putative uncharacterized protein n=4 Tax=Trypano... 112 8e-24 UniRef50_C4M8N0 Putative uncharacterized protein n=2 Tax=Entamoe... 111 8e-24 UniRef50_UPI000196AD9C hypothetical protein CATMIT_00588 n=1 Tax... 111 9e-24 UniRef50_C9KLM2 Appr-1-p processing enzyme family domain protein... 111 1e-23 UniRef50_D2V337 Predicted protein (Fragment) n=1 Tax=Naegleria g... 110 2e-23 UniRef50_C7HUZ2 RNase III regulator YmdB n=2 Tax=Anaerococcus Re... 110 2e-23 UniRef50_C2LSS3 Protein in Tap1-dppD intergenic region n=1 Tax=S... 110 2e-23 UniRef50_P67344 UPF0189 protein SA0314 n=54 Tax=Staphylococcus R... 110 3e-23 UniRef50_A6LTB5 Appr-1-p processing domain protein n=3 Tax=Clost... 109 3e-23 UniRef50_A2FMC7 Appr-1-p processing enzyme family protein n=1 Ta... 109 4e-23 UniRef50_B0EH33 Putative uncharacterized protein n=2 Tax=Entamoe... 109 5e-23 UniRef50_C4G1S1 Putative uncharacterized protein n=3 Tax=Abiotro... 108 9e-23 UniRef50_C8NG26 Appr-1-p processing enzyme family domain protein... 108 1e-22 UniRef50_B0A8R6 Putative uncharacterized protein n=3 Tax=Bacteri... 107 1e-22 UniRef50_A8STD9 Putative uncharacterized protein n=1 Tax=Coproco... 107 2e-22 UniRef50_C1QBX0 Predicted phosphatase similar to C-terminal doma... 107 2e-22 UniRef50_Q5XC09 UPF0189 protein M6_Spy0919 n=20 Tax=Streptococcu... 107 3e-22 UniRef50_C9XM94 Putative uncharacterized protein n=6 Tax=Clostri... 106 3e-22 UniRef50_UPI0001B4DEB3 hypothetical protein ShygA5_39675 n=1 Tax... 105 7e-22 UniRef50_D1BM15 Appr-1-p processing domain protein n=15 Tax=Bact... 105 8e-22 UniRef50_Q460N5 Poly [ADP-ribose] polymerase 14 n=19 Tax=Eutheri... 105 9e-22 UniRef50_A1WVH3 Appr-1-p processing domain protein n=14 Tax=Bact... 104 1e-21 UniRef50_A4YFR3 Appr-1-p processing domain protein n=9 Tax=Therm... 104 1e-21 UniRef50_A7T167 Protein GDAP2 homolog n=1 Tax=Nematostella vecte... 104 1e-21 UniRef50_A0Q2I9 Appr-1-p processing enzyme family protein n=3 Ta... 103 2e-21 UniRef50_UPI0000E80997 PREDICTED: similar to Poly [ADP-ribose] p... 103 3e-21 UniRef50_A7C4X9 Putative uncharacterized protein n=1 Tax=Beggiat... 103 4e-21 UniRef50_A6SR30 Putative uncharacterized protein n=1 Tax=Botryot... 102 6e-21 UniRef50_UPI00006A2284 UPI00006A2284 related cluster n=1 Tax=Xen... 102 7e-21 UniRef50_UPI0000ECB76F Poly [ADP-ribose] polymerase 14 (EC 2.4.2... 101 8e-21 UniRef50_A7S3X0 Predicted protein (Fragment) n=1 Tax=Nematostell... 101 9e-21 UniRef50_B9WC14 Putative uncharacterized protein n=5 Tax=Candida... 101 9e-21 UniRef50_B7CC50 Putative uncharacterized protein n=1 Tax=Eubacte... 101 1e-20 UniRef50_C2D2Z2 Appr-1-p processing enzyme family domain protein... 101 1e-20 UniRef50_C9RQW9 Appr-1-p processing domain protein n=5 Tax=Bacte... 101 1e-20 UniRef50_D2V113 Appr-1-p domain-containing protein n=1 Tax=Naegl... 100 2e-20 UniRef50_C3Y6H9 Putative uncharacterized protein n=1 Tax=Branchi... 100 3e-20 UniRef50_Q2ITR2 Appr-1-p processing n=1 Tax=Rhodopseudomonas pal... 100 4e-20 UniRef50_Q22CT8 Appr-1-p processing enzyme family protein n=1 Ta... 100 4e-20 UniRef50_C4FT52 Putative uncharacterized protein n=1 Tax=Catonel... 100 4e-20 UniRef50_A8H4N3 Appr-1-p processing domain protein n=1 Tax=Shewa... 99 6e-20 UniRef50_C5CIT5 Appr-1-p processing domain protein n=1 Tax=Kosmo... 99 8e-20 UniRef50_B1KG04 Appr-1-p processing domain protein n=1 Tax=Shewa... 99 8e-20 UniRef50_Q8B4N1 ORF-1 n=7 Tax=Infectious spleen and kidney necro... 99 9e-20 UniRef50_C3Y5X0 Putative uncharacterized protein n=3 Tax=Branchi... 98 1e-19 UniRef50_C5VD03 Appr-1-p processing enzyme family protein n=2 Ta... 97 2e-19 UniRef50_A0CX10 Chromosome undetermined scaffold_3, whole genome... 97 2e-19 UniRef50_A8FQZ3 Putative uncharacterized protein n=1 Tax=Shewane... 97 2e-19 UniRef50_C1SPD7 Predicted phosphatase similar to C-terminal doma... 97 2e-19 UniRef50_UPI000194CBCB PREDICTED: poly (ADP-ribose) polymerase f... 97 3e-19 UniRef50_UPI0000E4815A PREDICTED: similar to LRP16 protein n=1 T... 96 4e-19 UniRef50_D0MWM6 Putative uncharacterized protein n=1 Tax=Phytoph... 95 1e-18 UniRef50_UPI000194CBC9 PREDICTED: similar to B aggressive lympho... 95 1e-18 UniRef50_A3LYE6 Putative uncharacterized protein n=1 Tax=Pichia ... 95 1e-18 UniRef50_UPI0001C38755 appr-1-p processing domain-containing pro... 95 1e-18 UniRef50_UPI0000E4D641 UPI0000E4D641 related cluster n=2 Tax=Dan... 94 1e-18 UniRef50_C9YUB3 Putative uncharacterized protein n=1 Tax=Strepto... 93 4e-18 UniRef50_D1R847 Putative uncharacterized protein n=1 Tax=Parachl... 93 4e-18 UniRef50_UPI000196CD43 hypothetical protein CATMIT_02190 n=1 Tax... 93 5e-18 UniRef50_A1L291 LOC799852 protein (Fragment) n=5 Tax=Danio rerio... 92 5e-18 UniRef50_Q4SK43 Chromosome 2 SCAF14570, whole genome shotgun seq... 92 9e-18 UniRef50_Q0CEI7 Putative uncharacterized protein n=1 Tax=Aspergi... 92 1e-17 UniRef50_Q8ZXT3 UPF0189 protein PAE1111 n=10 Tax=Thermoprotei Re... 91 2e-17 UniRef50_C3Y5Q2 Putative uncharacterized protein n=1 Tax=Branchi... 89 5e-17 UniRef50_C3Y5X5 Putative uncharacterized protein n=3 Tax=Branchi... 89 5e-17 UniRef50_Q6NRC6 MGC83934 protein n=3 Tax=Xenopus RepID=Q6NRC6_XENLA 89 7e-17 UniRef50_UPI0000F2CC13 PREDICTED: similar to B aggressive lympho... 88 1e-16 UniRef50_C8WJT1 Appr-1-p processing domain protein n=1 Tax=Egger... 88 1e-16 UniRef50_UPI00005A247A PREDICTED: similar to H2A histone family,... 88 1e-16 UniRef50_C7Z089 Putative uncharacterized protein n=2 Tax=Nectria... 88 1e-16 UniRef50_A7HJC7 Appr-1-p processing domain protein n=1 Tax=Fervi... 88 1e-16 UniRef50_A2DTG7 Appr-1-p processing enzyme family protein n=2 Ta... 88 1e-16 UniRef50_Q94JV1 At1g69340/F10D13.28 n=23 Tax=Embryophyta RepID=Q... 88 1e-16 UniRef50_B0P6L4 Putative uncharacterized protein n=1 Tax=Anaerot... 87 2e-16 UniRef50_C0W547 Appr-1-p processing domain protein n=1 Tax=Actin... 87 2e-16 UniRef50_D0NNH8 Putative uncharacterized protein n=3 Tax=Phytoph... 87 2e-16 UniRef50_Q4RS18 Histone H2A (Fragment) n=2 Tax=Tetraodontidae Re... 87 3e-16 UniRef50_B2VUH2 MACRO domain containing protein 1 n=1 Tax=Pyreno... 87 3e-16 UniRef50_C3Y406 Putative uncharacterized protein n=2 Tax=Branchi... 87 3e-16 UniRef50_Q4RG95 Chromosome 12 SCAF15104, whole genome shotgun se... 86 5e-16 UniRef50_Q55AK6 U box domain-containing protein n=2 Tax=Eukaryot... 86 5e-16 UniRef50_UPI00006A1CA6 poly (ADP-ribose) polymerase family, memb... 86 6e-16 UniRef50_UPI0000E8099B PREDICTED: similar to PARP9 protein n=2 T... 84 1e-15 UniRef50_D0WKT6 Appr-1-p processing enzyme family domain protein... 84 2e-15 UniRef50_C3Y6H4 Putative uncharacterized protein n=1 Tax=Branchi... 84 2e-15 UniRef50_A5D049 Predicted phosphatase n=4 Tax=Bacteria RepID=A5D... 84 3e-15 UniRef50_A1R2V6 Putative uncharacterized protein n=1 Tax=Arthrob... 84 3e-15 UniRef50_Q0UG78 Putative uncharacterized protein n=1 Tax=Phaeosp... 83 3e-15 UniRef50_UPI000180B1B4 PREDICTED: similar to Poly [ADP-ribose] p... 83 3e-15 UniRef50_C3YS04 Putative uncharacterized protein (Fragment) n=1 ... 83 5e-15 UniRef50_Q2SM57 Predicted phosphatase n=1 Tax=Hahella chejuensis... 83 5e-15 UniRef50_B3RYC4 Putative uncharacterized protein n=1 Tax=Trichop... 82 1e-14 UniRef50_UPI00016E2DD3 UPI00016E2DD3 related cluster n=3 Tax=Tak... 81 1e-14 UniRef50_Q9NXN4 Ganglioside-induced differentiation-associated p... 81 1e-14 UniRef50_C3YS03 Putative uncharacterized protein n=2 Tax=Branchi... 81 1e-14 UniRef50_A2QSI2 Contig An08c0280, complete genome n=1 Tax=Asperg... 80 3e-14 UniRef50_C3YH95 Putative uncharacterized protein n=2 Tax=Eumetaz... 79 4e-14 UniRef50_B0QWK9 Putative uncharacterized protein n=1 Tax=Haemoph... 79 6e-14 UniRef50_D0NR00 Putative uncharacterized protein n=1 Tax=Phytoph... 79 8e-14 UniRef50_B9L2D9 Appr-1-p processing enzyme family protein n=2 Ta... 78 2e-13 UniRef50_Q460N3 Poly [ADP-ribose] polymerase 15 n=12 Tax=Eutheri... 78 2e-13 UniRef50_Q2TX23 Predicted phosphatase homologous to the C-termin... 78 2e-13 UniRef50_O67112 UPF0189 protein aq_987 n=4 Tax=cellular organism... 77 2e-13 UniRef50_C3Y417 Putative uncharacterized protein (Fragment) n=1 ... 77 2e-13 UniRef50_UPI000180BD0C PREDICTED: similar to Ci-Rhysin2/Deltex3-... 77 4e-13 UniRef50_Q8IXQ6 Poly [ADP-ribose] polymerase 9 n=27 Tax=Eutheria... 76 4e-13 UniRef50_Q4T065 Chromosome undetermined SCAF11328, whole genome ... 76 5e-13 UniRef50_B1H1M8 LOC100148704 protein (Fragment) n=5 Tax=Danio re... 75 1e-12 UniRef50_A7T7L3 Predicted protein (Fragment) n=1 Tax=Nematostell... 74 2e-12 UniRef50_B1L625 Appr-1-p processing domain protein n=1 Tax=Candi... 74 2e-12 UniRef50_O28751 UPF0189 protein AF_1521 n=32 Tax=Euryarchaeota R... 74 2e-12 UniRef50_UPI0001BC8416 Appr-1-p processing domain protein n=1 Ta... 74 3e-12 UniRef50_UPI00005A5611 PREDICTED: similar to poly (ADP-ribose) p... 72 6e-12 UniRef50_D1B7G8 Appr-1-p processing domain protein n=1 Tax=Therm... 72 6e-12 UniRef50_A7BVQ6 Appr-1-p processing enzyme family n=1 Tax=Beggia... 71 1e-11 UniRef50_UPI0001556316 PREDICTED: similar to LRP16 protein n=1 T... 71 1e-11 UniRef50_Q9YBE9 UPF0189 protein APE_1648.1 n=1 Tax=Aeropyrum per... 71 1e-11 UniRef50_UPI000180B63C PREDICTED: similar to Ci-Rhysin2/Deltex3-... 71 2e-11 UniRef50_UPI0001927649 PREDICTED: similar to predicted protein n... 71 2e-11 UniRef50_UPI00006CE511 hypothetical protein TTHERM_00141050 n=1 ... 70 3e-11 UniRef50_UPI0001C3795F Appr-1-p processing domain protein n=1 Ta... 69 7e-11 UniRef50_C2QLJ2 Appr-1-p processing enzyme n=8 Tax=Bacillus cere... 69 7e-11 UniRef50_UPI000180BD0B PREDICTED: similar to Poly [ADP-ribose] p... 69 8e-11 UniRef50_Q9P0M6 Core histone macro-H2A.2 n=118 Tax=Eukaryota Rep... 69 8e-11 UniRef50_C1XFR0 Predicted phosphatase similar to C-terminal doma... 69 1e-10 UniRef50_C3ZVW0 Putative uncharacterized protein n=1 Tax=Branchi... 69 1e-10 UniRef50_B5Y5Y4 Appr-1-p processing enzyme family protein n=2 Ta... 68 1e-10 UniRef50_Q1DG64 Appr-1-p processing enzyme family domain protein... 68 1e-10 UniRef50_Q5V4P3 Putative uncharacterized protein n=1 Tax=Haloarc... 68 2e-10 UniRef50_A2BJA7 A1pp, Appr-1-p processing enzyme n=1 Tax=Hyperth... 67 2e-10 UniRef50_Q1YRE7 Putative uncharacterized protein n=1 Tax=gamma p... 67 2e-10 UniRef50_UPI0001698AE7 Appr-1-p processing domain protein n=1 Ta... 67 3e-10 UniRef50_D2VX30 Predicted protein n=1 Tax=Naegleria gruberi RepI... 67 4e-10 UniRef50_O07733 UPF0189 protein Rv1899c/MT1950 n=16 Tax=Mycobact... 67 4e-10 UniRef50_Q54PT1 Protein GDAP2 homolog n=1 Tax=Dictyostelium disc... 66 5e-10 UniRef50_Q7JUR6 Protein GDAP2 homolog n=19 Tax=Neoptera RepID=GD... 66 5e-10 UniRef50_A3DLM0 Appr-1-p processing domain protein n=1 Tax=Staph... 66 5e-10 UniRef50_UPI00006A2286 UPI00006A2286 related cluster n=1 Tax=Xen... 66 6e-10 >UniRef50_P0A8D8 UPF0189 protein ymdB n=66 Tax=Proteobacteria RepID=YMDB_ECO57 Length = 177 Score = 363 bits (933), Expect = 1e-99, Method: Compositional matrix adjust. Identities = 177/177 (100%), Positives = 177/177 (100%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC Sbjct: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA Sbjct: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE Sbjct: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 >UniRef50_P67342 UPF0189 protein ymdB n=62 Tax=Bacteria RepID=YMDB_SALTI Length = 179 Score = 284 bits (726), Expect = 1e-75, Method: Compositional matrix adjust. Identities = 135/177 (76%), Positives = 153/177 (86%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +R+ V+QGDIT+L+VD IVNAAN SLMGGGGVDGAIHRAAGPALLDAC +RQQQG+C Sbjct: 1 MTSRLQVIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TGHAVIT AG L AKAV+HTVGPVWRGGE E +LL++AY N L L AN + S+AFPA Sbjct: 61 QTGHAVITPAGKLSAKAVIHTVGPVWRGGEHQEAELLEEAYRNCLLLAEANHFRSIAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTGVYGYPRA AAE+AV+TVS+FITR+ALPEQVYFVCYDEE A LY RLLTQQGD+ Sbjct: 121 ISTGVYGYPRAQAAEVAVRTVSDFITRYALPEQVYFVCYDEETARLYARLLTQQGDD 177 >UniRef50_A4W960 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=A4W960_ENT38 Length = 180 Score = 283 bits (724), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 136/175 (77%), Positives = 152/175 (86%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I VV GDIT + VDVIVNAANPSLMGGGGVDGAIHRAAGP LL+AC VRQQQG+C Sbjct: 1 MKPQIEVVVGDITTMEVDVIVNAANPSLMGGGGVDGAIHRAAGPQLLEACKTVRQQQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 GHAVIT+AGDLPAKAV+H VGPVW+GGE +E + LQDAYLN LRL AAN Y ++AFPA Sbjct: 61 APGHAVITIAGDLPAKAVIHAVGPVWQGGENHEARTLQDAYLNCLRLAAANGYKTLAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 ISTGVYGYP+AAAAEIAV TVSEF+TR LPE+VYFVCYDEENA LY+RLL Q+G Sbjct: 121 ISTGVYGYPKAAAAEIAVDTVSEFLTRKPLPERVYFVCYDEENAQLYQRLLIQRG 175 >UniRef50_Q8EYT0 UPF0189 protein LA_4133 n=9 Tax=cellular organisms RepID=Y4133_LEPIN Length = 175 Score = 214 bits (544), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 93/171 (54%), Positives = 129/171 (75%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +I +++ DIT+L VD IVNAAN SL+GGGGVDGAIHRA GP +L+ C K+R++QG+C Sbjct: 1 MNNKIKLIKEDITQLEVDAIVNAANSSLLGGGGVDGAIHRAGGPEILEECYKIREKQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AVIT AG L AK ++HTVGP+W GG +NED+LL +AY NSL L +S ++AFP Sbjct: 61 KVGEAVITTAGRLNAKFIIHTVGPIWSGGNKNEDELLSNAYKNSLLLAKNHSLKTIAFPN 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 ISTG+Y +P+ AA+IA+++V+EF+ + + V+FVC+D EN +Y +LL Sbjct: 121 ISTGIYHFPKERAAKIAIQSVTEFLKQDNQIQTVFFVCFDFENLEIYNKLL 171 >UniRef50_C7NE27 Appr-1-p processing domain protein n=2 Tax=Leptotrichia RepID=C7NE27_LEPBD Length = 187 Score = 194 bits (493), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 87/180 (48%), Positives = 125/180 (69%), Gaps = 5/180 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +K RI +V+GDIT+ DVIVNAAN SL+GG GVDGAIHR G + + C+K+R QG C Sbjct: 6 LKNRIVLVKGDITEYPADVIVNAANSSLLGGSGVDGAIHRKGGKEITEDCMKIRASQGKC 65 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-----DQLLQDAYLNSLRLVAANSYTS 115 G AVIT AG++ K V+HTVGPVW+ G+ NE ++LL++AY++SL L N + Sbjct: 66 NIGEAVITRAGNMSFKNVIHTVGPVWQSGKNNEAKLFAEKLLKNAYISSLELAEKNKLKN 125 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 ++FP ISTGVY +P+ AA+ A+ V E++ ++ E+V FVC++ EN +Y +LL ++G Sbjct: 126 ISFPNISTGVYRFPKDLAAKTAINAVIEYLEKNDFIEKVNFVCFENENFEIYRKLLEEKG 185 >UniRef50_Q8Q0F9 UPF0189 protein MM_0177 n=18 Tax=cellular organisms RepID=Y177_METMA Length = 187 Score = 188 bits (478), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 90/170 (52%), Positives = 117/170 (68%), Gaps = 4/170 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 RI + +GDI K+ VD IVNAAN +L+GGGGVDGAIHRAAGPALL+ C + CPTG Sbjct: 20 RIRIFEGDIVKMRVDAIVNAANNTLLGGGGVDGAIHRAAGPALLEEC----KTLNGCPTG 75 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT LPAK ++HTVGPVW+GGE+ ED+LL Y SL L ++AFPAIST Sbjct: 76 EAKITSGYLLPAKYIIHTVGPVWQGGEKGEDELLASCYRKSLELARDYKIKTIAFPAIST 135 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 G YG+P AA IAV V EF+ ++ +PE VY VCY++++ ++ L++ Sbjct: 136 GAYGFPSERAAGIAVSQVKEFLQKNEIPETVYLVCYNKDSCKSIKKALSK 185 >UniRef50_D1N530 Appr-1-p processing domain protein n=4 Tax=cellular organisms RepID=D1N530_9BACT Length = 164 Score = 186 bits (473), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 96/168 (57%), Positives = 117/168 (69%), Gaps = 6/168 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +VQ DIT+L D IVNAAN SL+GGGGVDGAIHRAAGP LL+AC K CPTG Sbjct: 2 KIQIVQDDITRLRADAIVNAANSSLLGGGGVDGAIHRAAGPELLEACRKF----NGCPTG 57 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT L A+ V+HT GPVW GG E +LL+ Y NSLRL AAN S+AFPAIST Sbjct: 58 EARITPGFRLAARFVIHTPGPVWHGGTHGEAELLEACYRNSLRLAAANGCRSIAFPAIST 117 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 GVY YP+A AA+IA++TV ++ R LPE+V F C+ + +Y+ LL Sbjct: 118 GVYRYPKAEAAQIALRTVRQW--REPLPEEVIFCCFSAADLDVYQELL 163 >UniRef50_D1AKA5 Appr-1-p processing domain protein n=2 Tax=Bacteria RepID=D1AKA5_SEBTE Length = 180 Score = 186 bits (472), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 88/174 (50%), Positives = 111/174 (63%), Gaps = 2/174 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + GDITK+ DVIVNAAN SL+GGGGVDGAIHR GP +LD C K+ +QG CP Sbjct: 6 TELRCENGDITKVKTDVIVNAANSSLLGGGGVDGAIHRTGGPLILDECRKIVDRQGSCPV 65 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AVIT G LPAK V+HTVGPVW G+ NE++ L+ Y NSL++ S+AF IS Sbjct: 66 GEAVITTGGKLPAKFVIHTVGPVWSYGKNNEEEKLRKCYRNSLKIAEDKQLESIAFSNIS 125 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERLLTQQ 174 TG YG+P+ A A+ V ++ T +V FVC D+EN +YE LL + Sbjct: 126 TGTYGFPKETAGRAALDEVKKYFIQTPDTTIREVVFVCLDDENFEIYEELLESE 179 >UniRef50_Q8RB30 UPF0189 protein TTE0995 n=12 Tax=Bacteria RepID=Y995_THETN Length = 175 Score = 184 bits (466), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 85/169 (50%), Positives = 120/169 (71%), Gaps = 1/169 (0%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I +++G+I VD IVNAAN SL+GGGGVDGAIH+A GPA+ + +R++QG C Sbjct: 1 MKEKIKLIKGNIVDQEVDAIVNAANSSLIGGGGVDGAIHKAGGPAIAEELKVIREKQGGC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTGHAVIT AG+L AK V+H VGP+W+GG NED LL AY+ SL+L + ++AFP+ Sbjct: 61 PTGHAVITGAGNLKAKYVIHAVGPIWKGGNHNEDNLLASAYIESLKLADEYNVKTIAFPS 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYER 169 ISTG YG+P AA IA++ VS+++ ++ E V FV + + + +Y + Sbjct: 121 ISTGAYGFPVERAARIALRVVSDYLEGSSIKE-VRFVLFSDRDYEVYSK 168 >UniRef50_C1PFE0 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=C1PFE0_BACCO Length = 188 Score = 180 bits (457), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 90/173 (52%), Positives = 118/173 (68%), Gaps = 5/173 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK+ +V GDITK+ D IVNAAN +L+GGGGVDGAIHRAAGP LL+ C K+ C Sbjct: 1 MKS-FKIVLGDITKVKTDAIVNAANTTLLGGGGVDGAIHRAAGPELLEECRKLN----GC 55 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T LPAK V+HT GPVW+GG +E +LL+++Y NSLRL + +VAFP+ Sbjct: 56 PTGEAKMTKGYRLPAKYVIHTPGPVWQGGGHHEAELLENSYQNSLRLAESKGLRTVAFPS 115 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTGVY +P AAA IAV+T+ F+ ++V+ VC+DE YE+ T+ Sbjct: 116 ISTGVYHFPLDAAARIAVRTICTFLETSDSVQEVWMVCFDERTKQAYEKAATE 168 >UniRef50_Q8TQD0 UPF0189 protein MA_1614 n=2 Tax=cellular organisms RepID=Y1614_METAC Length = 195 Score = 177 bits (448), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 87/159 (54%), Positives = 107/159 (67%), Gaps = 4/159 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 RI +++ DIT+L VD IVNAAN +L+GGGGVDGAIHRAAGP LL+ C R G CPTG Sbjct: 28 RIRIIERDITELKVDAIVNAANNTLLGGGGVDGAIHRAAGPGLLEEC---RTLNG-CPTG 83 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT LPAK V+HTVGP+W+ G + ED+ L Y SL L ++AFP IST Sbjct: 84 EAKITKGYLLPAKYVIHTVGPIWQEGTKGEDEFLASCYRKSLELARKYDVKTIAFPTIST 143 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEE 162 G YG+P AA IAV V EF+ + LPE V+ VCY++E Sbjct: 144 GAYGFPSERAARIAVSQVKEFLKVNELPEIVFLVCYNKE 182 >UniRef50_Q93SX7 UPF0189 protein n=2 Tax=Acinetobacter RepID=Y189_ACISE Length = 183 Score = 177 bits (448), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 81/173 (46%), Positives = 118/173 (68%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++H++Q DIT AV IVN+AN SL+GGGG+D IH+ AGP + + C+++ Q++G CPTG Sbjct: 3 KVHLIQADITAFAVHAIVNSANKSLLGGGGLDYVIHKKAGPLMKEECVRLNQEKGGCPTG 62 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A +T AG+LPAK ++H VGP W GE NE QLL DAY N+L +V+FP IST Sbjct: 63 QAEVTTAGNLPAKYLIHAVGPRWLDGEHNEPQLLCDAYSNALFKANEIHALTVSFPCIST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 GVYG+P AAEIA+ T+ + ++ +V+F+C ++EN +Y+ +L+ D Sbjct: 123 GVYGFPPQKAAEIAIGTILSMLPQYDHVAEVFFICREDENYLIYKNILSNIDD 175 >UniRef50_C6RT62 Appr-1-p processing n=2 Tax=Acinetobacter radioresistens RepID=C6RT62_ACIRA Length = 186 Score = 174 bits (442), Expect = 8e-43, Method: Compositional matrix adjust. Identities = 81/171 (47%), Positives = 112/171 (65%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + ++ GDIT + +D IVNAAN +L+GG GVDGAIH+A GP +++ C ++R +QG C G Sbjct: 3 QFRLIHGDITGIRIDAIVNAANSTLLGGHGVDGAIHQAGGPDIIEECRQIRARQGSCTVG 62 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T G LPA+ V+HTVGP+W G+ NE LL AY NS L + T +A+P IST Sbjct: 63 EAVMTTGGRLPAQYVIHTVGPIWEEGKANERTLLSQAYQNSFALAEQHYLTGIAYPNIST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 GVY +P+ AA IA+ T+ + ++V VC+D EN LYE LL Q+ Sbjct: 123 GVYRFPKVEAAAIAIDTLIPLLKNSETVQEVALVCFDLENFELYEELLKQR 173 >UniRef50_Q71W03 UPF0189 protein LMOf2365_2748 n=23 Tax=Bacteria RepID=Y2748_LISMF Length = 176 Score = 174 bits (440), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 92/173 (53%), Positives = 116/173 (67%), Gaps = 2/173 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I VV+GDIT+ VDVIVNAANP L+GGGGVDGAIH+AAGP LL C +V + G CP G Sbjct: 2 EITVVKGDITEQDVDVIVNAANPGLLGGGGVDGAIHQAAGPDLLKECQEVINRIGTCPAG 61 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT AGDL A ++H VGP+W+ GE E L Y +L L A TS+AFP IST Sbjct: 62 EAVITSAGDLQASYIIHAVGPIWKDGEHQEANKLASCYWKALDLAAGKELTSIAFPNIST 121 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQ 174 GVYG+P+ AAE+A+ TV ++ E++ FVC+DEEN LY +L+ + Sbjct: 122 GVYGFPKKLAAEVALYTVRKWAEEEYDTSIEEIRFVCFDEENLKLYNKLINSE 174 >UniRef50_Q8KAE4 UPF0189 protein CT2219 n=7 Tax=Bacteria RepID=Y2219_CHLTE Length = 172 Score = 173 bits (439), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 87/167 (52%), Positives = 110/167 (65%), Gaps = 4/167 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 IH ++ DIT L VD IVNAAN SL+GGGGVDGAIHRAAGP LL+AC ++ G C TG Sbjct: 7 IHAIKADITSLTVDAIVNAANTSLLGGGGVDGAIHRAAGPKLLEAC----RELGGCLTGE 62 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A IT LPA V+HTVGPVW GG E +LL Y NSL+L + ++AFP+ISTG Sbjct: 63 AKITKGYRLPATFVIHTVGPVWHGGNHGEAELLASCYRNSLKLAIEHHCRTIAFPSISTG 122 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +YGYP AA IA+ TV E + E+V F C+ + + +Y++ L Sbjct: 123 IYGYPVEQAAAIAITTVREMLADERGIEKVIFCCFSDRDLDVYQKAL 169 >UniRef50_C7RS37 Appr-1-p processing domain protein n=15 Tax=cellular organisms RepID=C7RS37_9PROT Length = 197 Score = 171 bits (434), Expect = 7e-42, Method: Compositional matrix adjust. Identities = 89/173 (51%), Positives = 108/173 (62%), Gaps = 4/173 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M T + + DIT LAVD IVNAAN SL+GGGGVDGAIHRAAGP LL C + G C Sbjct: 26 MSTMLRAICADITTLAVDAIVNAANSSLLGGGGVDGAIHRAAGPGLLAEC----RLLGGC 81 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T A LPA+ ++HTVGPVW GG E Q L Y SL L AN ++A P+ Sbjct: 82 PTGEARLTHAHRLPARYIIHTVGPVWHGGGSGEAQRLASCYRCSLELAVANDLVTLAIPS 141 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTG+YGYP AAE+AV TV + +V F C+ + +YERLL + Sbjct: 142 ISTGIYGYPIEQAAEVAVSTVRASVRELGRLREVVFCCFSPGDLRVYERLLGE 194 >UniRef50_Q8PHB6 UPF0189 protein XAC3343 n=14 Tax=Proteobacteria RepID=Y3343_XANAC Length = 179 Score = 171 bits (434), Expect = 8e-42, Method: Compositional matrix adjust. Identities = 90/173 (52%), Positives = 110/173 (63%), Gaps = 2/173 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG--DCP 61 RI V QGDIT+L VDVIVNAAN SL+GGGGVDGAIHRAAGP LL+AC + Q + CP Sbjct: 2 RIEVWQGDITELDVDVIVNAANESLLGGGGVDGAIHRAAGPRLLEACEALPQVRPGVRCP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG IT DL A+ + HTVGPVWR G NE + L + Y SL+L S+AFPAI Sbjct: 62 TGEIRITDGFDLKARHIFHTVGPVWRDGRHNEPEQLANCYWQSLKLAEQMMLHSIAFPAI 121 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 S G+YGYP AA IAV ++ H +P+ + V Y+E Y++ L Q Sbjct: 122 SCGIYGYPLHQAARIAVTETRDWQRSHKVPKHIVLVAYNEATYKAYQQALATQ 174 >UniRef50_Q3AEI4 Putative uncharacterized protein n=6 Tax=Bacteria RepID=Q3AEI4_CARHZ Length = 181 Score = 171 bits (432), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 85/169 (50%), Positives = 113/169 (66%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++I + GDITK VD IVNAAN L GGGGVDGAIHRA GP +++ C ++ + G P Sbjct: 8 SKIILKLGDITKEKVDAIVNAANSRLAGGGGVDGAIHRAGGPKIMEECREIINKIGVLPP 67 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV T AG+LPAK V+HTVGP++RGG++ E+ L++AYLNSL+L + ++AFP+IS Sbjct: 68 GEAVATTAGNLPAKYVIHTVGPIYRGGQKGEENTLRNAYLNSLKLAKQLNVKTIAFPSIS 127 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 TG YGYP AA +A+K V EF+ V FV +DE Y+ L Sbjct: 128 TGAYGYPVKDAARVALKAVIEFLEGEPEDFTVVFVLFDEITYAAYQEAL 176 >UniRef50_C8NAC1 RNase III regulator YmdB n=3 Tax=cellular organisms RepID=C8NAC1_9GAMM Length = 165 Score = 168 bits (426), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 89/167 (53%), Positives = 108/167 (64%), Gaps = 4/167 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V DIT LAVD IVNAAN SL+GG GVDGAIHRAAG L+ C + G C G Sbjct: 3 LEVQVADITTLAVDAIVNAANESLLGGSGVDGAIHRAAGKELVAEC----RTLGGCKVGE 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T LPA+ V+HTVGPVW GG+ E + L +AY NSLRL A+ TS+AFPAISTG Sbjct: 59 AKLTRGYRLPARFVIHTVGPVWYGGDDGEAEALANAYANSLRLAEAHELTSIAFPAISTG 118 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 V+GYP+ AA IA+ TV + +V F C+ E +A LY RLL Sbjct: 119 VFGYPKEDAARIAIDTVRATLKECPHMARVIFCCFSERDAALYRRLL 165 >UniRef50_Q9HXU7 UPF0189 protein PA3693 n=16 Tax=Bacteria RepID=Y3693_PSEAE Length = 173 Score = 167 bits (424), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 93/172 (54%), Positives = 109/172 (63%), Gaps = 4/172 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + V QGDIT+LAVD IVNAAN SL+GGGGVDGAIHRAAG L+ AC R G C T Sbjct: 2 TEVRVWQGDITRLAVDAIVNAANSSLLGGGGVDGAIHRAAGAELVAAC---RLLHG-CKT 57 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPA V+HTVGPVWRGG+ E +LL Y SL L SVAFPAIS Sbjct: 58 GEAKITRGFRLPAAHVIHTVGPVWRGGDNGEAELLASCYRRSLALAEQAGAASVAFPAIS 117 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 G+YGYP AA IAV+ V H+ E++ V +D A Y+RLL ++ Sbjct: 118 CGIYGYPLEQAAAIAVEEVCRQRPAHSSLEEIVLVAFDSSMAERYQRLLGER 169 >UniRef50_Q985D2 UPF0189 protein mll7730 n=12 Tax=Bacteria RepID=Y7730_RHILO Length = 176 Score = 163 bits (412), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 96/166 (57%), Positives = 109/166 (65%), Gaps = 4/166 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 RI + GDITKL VD IVNAAN L+GGGGVDGAIHRAAG L C R G C G Sbjct: 7 RIRIHTGDITKLDVDAIVNAANTLLLGGGGVDGAIHRAAGRELEVEC---RMLNG-CKVG 62 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT LPA+ ++HTVGPVW+GG + E +LL Y +SL L AAN SVAFPAIST Sbjct: 63 DAKITKGYKLPARHIIHTVGPVWQGGGKGEAELLASCYRSSLELAAANDCRSVAFPAIST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYER 169 GVY YP+ A IAV TVS I A+PE V F C+DE+ A LY R Sbjct: 123 GVYRYPKDEATGIAVGTVSMVIEEKAMPETVIFCCFDEQTAQLYLR 168 >UniRef50_A5GC80 Appr-1-p processing domain protein n=2 Tax=Desulfuromonadales RepID=A5GC80_GEOUR Length = 172 Score = 161 bits (407), Expect = 9e-39, Method: Compositional matrix adjust. Identities = 85/168 (50%), Positives = 106/168 (63%), Gaps = 4/168 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I ++QGDIT+LAVD IVNAAN +L+GGGGVDGAIHRAAGP L+ C G C Sbjct: 1 MKGKIEIIQGDITRLAVDAIVNAANNTLLGGGGVDGAIHRAAGPDLVAEC----STLGGC 56 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT LPAK V+HTVGPVW GG + E +LL+ AY + A+ S+AFPA Sbjct: 57 ETGDAKITKGYKLPAKHVIHTVGPVWHGGSKGEPELLRKAYRRCFEVAHASKLKSIAFPA 116 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYE 168 IS GVYGYP A EIA+ + + E+V FV + +Y+ Sbjct: 117 ISAGVYGYPMDQACEIAMVEAKAALEKFPELERVIFVPFSPGALAIYQ 164 >UniRef50_Q8Y2K1 UPF0189 protein RSc0334 n=39 Tax=cellular organisms RepID=Y334_RALSO Length = 171 Score = 159 bits (402), Expect = 4e-38, Method: Compositional matrix adjust. Identities = 87/169 (51%), Positives = 108/169 (63%), Gaps = 7/169 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++ DIT LA D IVNAAN +L+GGGGVDGAIHRAAGP LL+AC R G C TG Sbjct: 8 LRALRADITTLACDAIVNAANSALLGGGGVDGAIHRAAGPELLEAC---RALHG-CRTGQ 63 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A IT LPA+ ++HTVGP+WRGG Q+E LL Y NSL L + ++AFP ISTG Sbjct: 64 AKITPGFLLPARYIIHTVGPIWRGGRQDEAALLAACYRNSLALAKQHDVRTIAFPCISTG 123 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 VYG+P AA IAV+TV E A + + F C+ + LYE L + Sbjct: 124 VYGFPPQLAAPIAVRTVRE---HGADLDDIVFCCFSAADLALYETALNE 169 >UniRef50_Q6PHJ5 Zgc:65960 n=11 Tax=cellular organisms RepID=Q6PHJ5_DANRE Length = 452 Score = 159 bits (401), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 84/172 (48%), Positives = 113/172 (65%), Gaps = 8/172 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ + +GDIT L +D IVNAAN SL+GGGGVDG IHRAAG L + C + C TG Sbjct: 62 KVSLYKGDITILEIDAIVNAANSSLLGGGGVDGCIHRAAGHLLYEECHSL----NGCDTG 117 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A IT DLPAK V+HTVGP+ RG G+ D L + Y +SL+L+ N+ SVAFP I Sbjct: 118 KAKITCGYDLPAKYVIHTVGPIARGNVGQSQRDDL-ESCYYSSLKLMKDNNLRSVAFPCI 176 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYERLLT 172 STG+YG+P AAEIA+KTV E+I +H ++V F + E + +Y+R ++ Sbjct: 177 STGIYGFPNEPAAEIALKTVQEWIEKHQDEIDRVIFCVFLETDYEIYKRKMS 228 >UniRef50_A1Z1Q3 MACRO domain-containing protein 2 n=55 Tax=cellular organisms RepID=MACD2_HUMAN Length = 448 Score = 158 bits (400), Expect = 6e-38, Method: Compositional matrix adjust. Identities = 86/177 (48%), Positives = 115/177 (64%), Gaps = 10/177 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +GDIT L VD IVNAAN SL+GGGGVDG IHRAAGP LL C R G C Sbjct: 68 LTEKVSLYRGDITLLEVDAIVNAANASLLGGGGVDGCIHRAAGPCLLAEC---RNLNG-C 123 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRG---GEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 TGHA IT DLPAK V+HTVGP+ RG G ED L + Y +SL+LV N+ SVA Sbjct: 124 DTGHAKITCGYDLPAKYVIHTVGPIARGHINGSHKED--LANCYKSSLKLVKENNIRSVA 181 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITR-HALPEQVYFVCYDEENAHLYERLLTQ 173 FP ISTG+YG+P AA IA+ T+ E++ + H +++ F + E + +Y++ + + Sbjct: 182 FPCISTGIYGFPNEPAAVIALNTIKEWLAKNHHEVDRIIFCVFLEVDFKIYKKKMNE 238 >UniRef50_A8ZUR5 Appr-1-p processing domain protein n=3 Tax=cellular organisms RepID=A8ZUR5_DESOH Length = 195 Score = 158 bits (400), Expect = 6e-38, Method: Compositional matrix adjust. Identities = 86/170 (50%), Positives = 103/170 (60%), Gaps = 4/170 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +R+ V QGDIT L VD IVNAAN +L+GGGGVDGAIHRAAGP LL C + G C T Sbjct: 27 SRLKVWQGDITTLEVDAIVNAANKTLLGGGGVDGAIHRAAGPELLAEC----KTLGGCDT 82 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPAK V+HTVGPV+ +LL Y NSL+L SVAFPA+S Sbjct: 83 GQAKITRGYRLPAKFVIHTVGPVYSRSNPGVAKLLAGCYTNSLKLAKDQGLASVAFPAVS 142 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GVYGYP A IA+ TV +F+ EQV F + + +YE L+ Sbjct: 143 CGVYGYPMKEACRIALDTVCDFLETDRTIEQVIFALFSADAVRVYEGYLS 192 >UniRef50_C6BB95 Appr-1-p processing domain protein n=4 Tax=cellular organisms RepID=C6BB95_RALP1 Length = 171 Score = 158 bits (399), Expect = 8e-38, Method: Compositional matrix adjust. Identities = 86/170 (50%), Positives = 105/170 (61%), Gaps = 7/170 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++GDIT L D IVNAAN SL+GGGGVDGAIHRAAGP LL+AC R G C TG Sbjct: 8 LRALRGDITTLDCDAIVNAANSSLLGGGGVDGAIHRAAGPELLEAC---RALHG-CRTGE 63 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T L A+ V+HTVGP+WRGG Q+E LL Y NSL L S+AFP ISTG Sbjct: 64 AKLTPGFQLTARYVIHTVGPIWRGGRQDEAALLAACYRNSLELACKYEVRSIAFPCISTG 123 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 +YG+P AA IAV+ E +R E + F C+ + LYE L + Sbjct: 124 IYGFPPQLAAPIAVRAAREHGSRF---ETITFCCFSAADLILYEAALGNR 170 >UniRef50_B7C8M6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B7C8M6_9FIRM Length = 296 Score = 157 bits (396), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 81/177 (45%), Positives = 113/177 (63%), Gaps = 6/177 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +++ I VV+GDIT D IVNAAN SL+GGGGVDGAIHRAAGP LL+ C + C Sbjct: 125 LESEIKVVKGDITTFDGDCIVNAANESLLGGGGVDGAIHRAAGPMLLEEC----KLLNGC 180 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT DL AK V+HTVGP++ G ++E +L+D Y NSL L ++AFPA Sbjct: 181 QTGQAKITKGYDLKAKYVIHTVGPMYSGKHEDE-HMLRDCYWNSLTLARKYDIHTIAFPA 239 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQQGD 176 IS GVYGYP A + +KT+++++ ++ ++ C+DEE Y++ + QG+ Sbjct: 240 ISCGVYGYPVEKAVPLVLKTIADWLDANSDYTMKISLYCFDEETTKEYQKYTSYQGE 296 >UniRef50_C1H4Y3 MACRO domain-containing protein n=4 Tax=cellular organisms RepID=C1H4Y3_PARBA Length = 334 Score = 155 bits (391), Expect = 8e-37, Method: Compositional matrix adjust. Identities = 82/172 (47%), Positives = 109/172 (63%), Gaps = 6/172 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + I ++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAG L C + G C Sbjct: 38 LNNSICLITSDITKLEVDCIVNAANKSLLGGGGVDGAIHRAAGRGLWQEC----RSLGGC 93 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT A +LP + V+H VGP++ E E LL+ Y+ SL + A N S+AF + Sbjct: 94 MTGDAKITNAYNLPCRKVIHAVGPMYWADEDRE-SLLRSCYMRSLTIAAENGLKSIAFSS 152 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLYERLL 171 ISTGVYGYP + AAE+A++ V F+ R + PE+V F ++ ++ + Y LL Sbjct: 153 ISTGVYGYPSSKAAEVAIRAVKHFLEARSSPPERVIFCTFEPKDVNAYRALL 204 >UniRef50_Q8EP31 Hypothetical conserved protein n=1 Tax=Oceanobacillus iheyensis RepID=Q8EP31_OCEIH Length = 185 Score = 154 bits (388), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 77/170 (45%), Positives = 111/170 (65%), Gaps = 4/170 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ---GDCP 61 + +V GDITK +VIVNAAN SL+GGGGVDGAIH AAGP LL AC ++R + + P Sbjct: 10 LEIVVGDITKETTNVIVNAANGSLLGGGGVDGAIHHAAGPELLKACQEMRNNELNGEELP 69 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG +IT LP++ ++HTVGP+W +++LL + Y N+L LV +S++FP+I Sbjct: 70 TGEVIITSGFQLPSRFIIHTVGPIWNQTPDLQEELLANCYRNALELVKVKKLSSISFPSI 129 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 STGVYGYP AA IA++T+ +F+ + + V V + E + +Y+ L Sbjct: 130 STGVYGYPIHEAAAIALQTIIQFLQENDVG-LVKVVLFSERDYSIYQEKL 178 >UniRef50_Q047N9 Predicted phosphatase, histone macroH2A1 family n=3 Tax=Bacteria RepID=Q047N9_LACDB Length = 166 Score = 153 bits (387), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 87/168 (51%), Positives = 107/168 (63%), Gaps = 6/168 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + QGDIT L VD IVNAAN L GGGGVDGAIHRAAGP L +AC + G C TG Sbjct: 3 LEIWQGDITTLKVDAIVNAANRELRGGGGVDGAIHRAAGPKLNEAC----RALGSCETGE 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A IT +LPAK ++HTVGPV+ G ++ LL Y NSLR+ N SVAF AISTG Sbjct: 59 AKITPGFNLPAKYIIHTVGPVY-SGSHSDPLLLAACYRNSLRVAKENGLHSVAFSAISTG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLL 171 VYGYP AA+++A V +++ H E +V V YD LY++LL Sbjct: 118 VYGYPLDAASKVAFGEVRKWLREHKDYEMRVIMVAYDARTYALYQKLL 165 >UniRef50_Q9EYI6 UPF0189 protein in sno 5'region n=22 Tax=Bacteria RepID=Y189_STRNO Length = 181 Score = 150 bits (380), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 90/176 (51%), Positives = 108/176 (61%), Gaps = 9/176 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ-GD-C 60 T I +VQGDIT+ D +VNAAN SL+GGGGVDGAIHR GPA+L C +R + G+ Sbjct: 2 TTITLVQGDITRQHADALVNAANSSLLGGGGVDGAIHRRGGPAILAECRALRASRYGEGL 61 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG AV T AGDL A+ V+HTVGPVW E D LL Y SLRL +VAFPA Sbjct: 62 PTGRAVATTAGDLDARWVIHTVGPVWSSTEDRSD-LLASCYRESLRLAGELGARTVAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 +STGVY +P AA IAV+TV T E+V FV +D H Y+ + GD Sbjct: 121 LSTGVYRWPMGDAARIAVETVR---TTPTAVEEVRFVLFD---THAYDTFARELGD 170 >UniRef50_Q9WYX8 UPF0189 protein TM_0508 n=15 Tax=cellular organisms RepID=Y508_THEMA Length = 599 Score = 149 bits (377), Expect = 3e-35, Method: Composition-based stats. Identities = 76/167 (45%), Positives = 103/167 (61%), Gaps = 2/167 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +V+GDIT+ VD IVNAAN L GGGV GAI RA G + + ++ Q++G PTG Sbjct: 428 KIRIVKGDITREEVDAIVNAANEYLKHGGGVAGAIVRAGGSVIQEESDRIVQERGRVPTG 487 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T AG L AK V+HTVGPVWRGG ED+LL A N+L S++ PAIST Sbjct: 488 EAVVTSAGKLKAKYVIHTVGPVWRGGSHGEDELLYKAVYNALLRAHELKLKSISMPAIST 547 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYE 168 G++G+P+ A I K + +FI +H E++ DEE ++E Sbjct: 548 GIFGFPKERAVGIFSKAIRDFIDQHPDTTLEEIRICNIDEETTKIFE 594 >UniRef50_A7IGI6 Appr-1-p processing domain protein n=53 Tax=cellular organisms RepID=A7IGI6_XANP2 Length = 193 Score = 149 bits (377), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 87/174 (50%), Positives = 107/174 (61%), Gaps = 4/174 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + R+ +V GDIT+LA+D IVNAAN SL+GGGGVDGAIHRAAGP LL C + G CP Sbjct: 19 QARLDIVVGDITRLALDAIVNAANSSLLGGGGVDGAIHRAAGPELLAYC----RTLGGCP 74 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG A +T LPA V+HTVGPVW GG E+ LL Y SL+L S+AFPAI Sbjct: 75 TGEARLTPGFRLPAAHVIHTVGPVWHGGGAGEEGLLGSCYRESLKLADGAGLASIAFPAI 134 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 STG+YG+P AA +AV TV + +V F C+ +E A L+ G Sbjct: 135 STGIYGFPADRAAPLAVGTVLAHLGAPGSVTRVVFCCFSQEAADLHHDAFRAHG 188 >UniRef50_A2SS36 Appr-1-p processing domain protein n=26 Tax=cellular organisms RepID=A2SS36_METLZ Length = 183 Score = 149 bits (377), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 78/139 (56%), Positives = 96/139 (69%), Gaps = 4/139 (2%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 VV+ DIT L+VDVIVNAAN +L+GGGGVDGAIH AAGP LL C + G C G A Sbjct: 14 VVKTDITTLSVDVIVNAANTTLLGGGGVDGAIHHAAGPGLLAEC----RTLGGCRIGEAK 69 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 IT LPAK ++HTVGPVW GG + E + L+ Y +SL L + ++AFPA+STGVY Sbjct: 70 ITKGYALPAKYIIHTVGPVWWGGNEGEPEQLRACYFHSLTLAGEHGLRTIAFPAVSTGVY 129 Query: 127 GYPRAAAAEIAVKTVSEFI 145 GYP+ AA IAV+TV F+ Sbjct: 130 GYPKDKAAVIAVETVLSFL 148 >UniRef50_Q0UQZ6 Putative uncharacterized protein n=2 Tax=Leotiomyceta RepID=Q0UQZ6_PHANO Length = 291 Score = 148 bits (373), Expect = 9e-35, Method: Compositional matrix adjust. Identities = 79/173 (45%), Positives = 108/173 (62%), Gaps = 8/173 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I +++ DIT LA+D IVNAAN SL+GGGGVDGAIHRAAGP L D C + C Sbjct: 37 LNDKISIIRRDITTLAIDAIVNAANTSLLGGGGVDGAIHRAAGPKLYDEC----ETLDGC 92 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG+A +T +LP+K V+H VGP+ W+ G +LL Y SL+L N S+AF Sbjct: 93 ETGNAKMTRGYELPSKKVIHAVGPIYWKEGRSASAKLLSMCYRTSLQLAVDNECRSIAFS 152 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ---VYFVCYDEENAHLYER 169 A+STGVYGYP AA +A++TV +F+ E+ V F + E++ + Y R Sbjct: 153 ALSTGVYGYPSDEAAVVALQTVRQFLDEDGKAEKLDRVIFCNFLEKDENAYYR 205 >UniRef50_Q0CQJ0 Protein LRP16 n=10 Tax=cellular organisms RepID=Q0CQJ0_ASPTN Length = 344 Score = 145 bits (367), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 83/177 (46%), Positives = 105/177 (59%), Gaps = 13/177 (7%) Query: 1 MKTRIHVVQGDITKL-AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + RI +++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAGP L+ C + G Sbjct: 37 LNDRISLIRHDITKLLDVDCIVNAANSSLLGGGGVDGAIHRAAGPGLVRECRTL----GG 92 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVW----RGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 C TG A T A DLP + V+HTVGP++ + G +QLL+ Y L L N S Sbjct: 93 CATGDAKTTAAYDLPCRWVIHTVGPIYPVERQKGAARPEQLLRSCYRRCLELAVRNKARS 152 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALP----EQVYFVCYDEENAHLYE 168 +AFPAISTGVY YP+ AA IA+ F+ E+V F ++EE+ YE Sbjct: 153 IAFPAISTGVYAYPKRRAARIALDETRAFLESEGTDIVTLEKVVFCNFEEEDQRAYE 209 >UniRef50_D1U7C0 Appr-1-p processing domain protein n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U7C0_9DELT Length = 186 Score = 145 bits (365), Expect = 7e-34, Method: Compositional matrix adjust. Identities = 79/161 (49%), Positives = 103/161 (63%), Gaps = 7/161 (4%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPA-LLDACLKVRQQQGDCPTGH- 64 + QGDIT L VD +VNAANP L GGGGVDGAIHRAAG A L AC + G PTG Sbjct: 13 IRQGDITTLDVDCVVNAANPQLAGGGGVDGAIHRAAGIAQLRQACQAIIDDPGQLPTGQL 72 Query: 65 ----AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 AV+TL DLPA+ ++HTVGP+WRGG E + L+ +Y +SL+L ++ ++AFPA Sbjct: 73 PVGQAVLTLGFDLPARYIIHTVGPIWRGGVHGESEQLRSSYQSSLKLAHQHALATIAFPA 132 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDE 161 +S G YGYP AA IA+ + + + L QV+ V +D Sbjct: 133 LSCGAYGYPIPQAARIALDAIRQGLLD-GLAAQVHMVLHDH 172 >UniRef50_D1ZDH8 Whole genome shotgun sequence assembly, scaffold_20 n=4 Tax=cellular organisms RepID=D1ZDH8_SORMA Length = 261 Score = 145 bits (365), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 85/176 (48%), Positives = 112/176 (63%), Gaps = 10/176 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + RI + GDITKL +D IVNAAN SL+GGGGVDGAIHRAAGP LL C R C Sbjct: 90 LNKRIAIHTGDITKLHIDAIVNAANNSLLGGGGVDGAIHRAAGPQLLRECRTKRT----C 145 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFP 119 TG AV+T A +LP V+HTVGPV+ G +E ++LL YL SL++ A T++AFP Sbjct: 146 DTGDAVMTEAYNLPCAKVIHTVGPVYSGVNHDECEKLLISCYLRSLQIAAETGLTTIAFP 205 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFI----TRHALPEQVYFVCYDEENAHLYERLL 171 +ISTGVYGYP AA+ A+ + F+ TR+A+ +V V + +++ Y L Sbjct: 206 SISTGVYGYPSKEAAQAALAAIRHFLTDPKTRNAI-TKVIIVTFVDKDTRAYTEWL 260 >UniRef50_Q9BQ69 MACRO domain-containing protein 1 n=11 Tax=Tetrapoda RepID=MACD1_HUMAN Length = 325 Score = 144 bits (364), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 83/174 (47%), Positives = 109/174 (62%), Gaps = 8/174 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I +++ DITKL VD IVNAAN SL+GGGGVDG IHRAAGP L D C ++ C Sbjct: 150 LNEKISLLRSDITKLEVDAIVNAANSSLLGGGGVDGCIHRAAGPLLTDECRTLQ----SC 205 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL--LQDAYLNSLRLVAANSYTSVAF 118 TG A IT LPAK V+HTVGP+ GE + Q L+ YL+SL L+ + SVAF Sbjct: 206 KTGKAKITGGYRLPAKYVIHTVGPI-AYGEPSASQAAELRSCYLSSLDLLLEHRLRSVAF 264 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVC-YDEENAHLYERLL 171 P ISTGV+GYP AAAEI + T+ E++ +H +C + E++ +Y L Sbjct: 265 PCISTGVFGYPCEAAAEIVLATLREWLEQHKDKVDRLIICVFLEKDEDIYRSRL 318 >UniRef50_A0L536 Appr-1-p processing domain protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L536_MAGSM Length = 180 Score = 143 bits (361), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 81/173 (46%), Positives = 104/173 (60%), Gaps = 3/173 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDC 60 T + ++ DIT+L +D +VNAAN SL+GG GVDGAIHR G AL AC +R Sbjct: 2 TTLEIILTDITQLPIDGVVNAANNSLLGGMGVDGAIHRVGGTALTQACQALRHTHYPDGL 61 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG AV T AG+LPAK V+HTVGPV+ + + L D Y NSLR S+AFPA Sbjct: 62 ATGAAVATCAGELPAKRVIHTVGPVY-AKDPDPQARLADCYRNSLRCAQEEGLRSIAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTGVYG+P+ AA IAV T+ + + E+V V + EE+A + L Q Sbjct: 121 ISTGVYGFPKQQAANIAVATLLQALREGVALERVVLVAFSEEDAQILRHALNQ 173 >UniRef50_C8WYT5 Appr-1-p processing domain protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8WYT5_DESRD Length = 188 Score = 143 bits (360), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 78/177 (44%), Positives = 105/177 (59%), Gaps = 3/177 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 R+ + QGDIT V +VNAAN L GGGGVDGA+ RAAGP LL A + ++ G G Sbjct: 12 RLEIRQGDITAAEVGAVVNAANSRLAGGGGVDGALQRAAGPQLLQAGQEYVREHGALSVG 71 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T LPA V+HTVGP+WRGG NE+ LL+ AY N L++ S+AFPAIS Sbjct: 72 DAVVTPGFALPASQVIHTVGPIWRGGGHNEEALLERAYANCLQVAKDQGIQSIAFPAISC 131 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY---ERLLTQQGDE 177 GVYG+P AA IA+ + + R A+ ++ + A Y +RL+ + +E Sbjct: 132 GVYGFPEKRAAAIAIPVIVAALERDAVSSVALYLYSNPSYAVWYNEAQRLIGAEHEE 188 >UniRef50_B2JCA0 Appr-1-p processing domain protein n=13 Tax=Proteobacteria RepID=B2JCA0_BURP8 Length = 183 Score = 143 bits (360), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 78/151 (51%), Positives = 94/151 (62%), Gaps = 4/151 (2%) Query: 11 DITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLA 70 DIT L VD +VNAAN SL+GGGGVDGA+HRAAG LL C Q G C TG A IT Sbjct: 15 DITTLDVDAVVNAANTSLLGGGGVDGALHRAAGADLLREC----QTLGGCVTGDAKITGG 70 Query: 71 GDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPR 130 L A+ V+H VGPVW GG + E +LL Y SL L S+AFPAIS GVY +P Sbjct: 71 HRLKARHVIHAVGPVWHGGGRGEAELLASCYRRSLELARDAKAKSIAFPAISCGVYRFPA 130 Query: 131 AAAAEIAVKTVSEFITRHALPEQVYFVCYDE 161 A IA++TV + + R + E+V F C+DE Sbjct: 131 DEAVRIAMQTVIDTLPRVSTVERVIFACFDE 161 >UniRef50_Q8K4G6 MACRO domain-containing protein 1 (Fragment) n=5 Tax=cellular organisms RepID=MACD1_RAT Length = 258 Score = 142 bits (359), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 79/173 (45%), Positives = 108/173 (62%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I + +GDITKL VD IVNAAN SL+GGGGVDG IHRAAG L D C ++ +C Sbjct: 83 LNEKISLFRGDITKLEVDAIVNAANNSLLGGGGVDGCIHRAAGSLLTDECRTLQ----NC 138 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT LPAK V+HTVGP+ G ++ L+ YL+SL L+ + SVAFP Sbjct: 139 ETGKAKITCGYRLPAKHVIHTVGPIAVGQPTASQAAELRSCYLSSLDLLLEHRLRSVAFP 198 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVC-YDEENAHLYERLL 171 ISTGV+GYP AAE+ + T+ E++ +H +C + E++ +Y+ L Sbjct: 199 CISTGVFGYPNEEAAEVVLATLREWLEQHKDKVDRLIICVFLEKDEGIYQERL 251 >UniRef50_Q30ZH6 Appr-1-p processing n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30ZH6_DESDG Length = 183 Score = 142 bits (359), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 77/172 (44%), Positives = 105/172 (61%), Gaps = 3/172 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++QGD+T D +VNAAN L GGGGVDGA+H AAGPALL C + + G P G Sbjct: 10 LEILQGDLTLFKADAVVNAANSRLAGGGGVDGALHAAAGPALLADCSRWVARHGLLPAGK 69 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A++T A LPA+ V+HTVGPVWRGG+ NE+ L+ AY + L +N + VAFPAIS G Sbjct: 70 AMVTPAHRLPARHVIHTVGPVWRGGKNNEETTLRQAYESCFTLCRSNGFAHVAFPAISCG 129 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 YGYP + AA +A+ ++ + P ++ FV + A +Y L D Sbjct: 130 TYGYPASPAARVALACAAQALACQGAPAKITFVLH---TAQMYTIWLKAAQD 178 >UniRef50_Q87JZ5 UPF0189 protein VPA0103 n=5 Tax=Proteobacteria RepID=Y4103_VIBPA Length = 170 Score = 142 bits (359), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 86/171 (50%), Positives = 107/171 (62%), Gaps = 5/171 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-DCPTG 63 I +VQGDIT VD IVNAANP ++GGGGVDGAIHRAAGPAL++AC V G CP G Sbjct: 4 ISLVQGDITTAHVDAIVNAANPRMLGGGGVDGAIHRAAGPALINACYAVDDVDGIRCPFG 63 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT AG+L A+ V+H VGP++ + +L+ AY SL L AN SVA PAIS Sbjct: 64 DARITEAGNLNARYVIHAVGPIY-DKFADPKTVLESAYQRSLDLALANHCQSVALPAISC 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 GVYGYP AAE+A+ V + AL + Y + EE +++ LTQ Sbjct: 123 GVYGYPPQEAAEVAM-AVCQRPEYAALDMRFYL--FSEEMLSIWQHALTQH 170 >UniRef50_A4R3Q9 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4R3Q9_MAGGR Length = 263 Score = 142 bits (358), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 81/167 (48%), Positives = 101/167 (60%), Gaps = 7/167 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 RI + GDITKL VD IVNAAN +L+GGGGVDG+IHRAAG LL C R G C TG Sbjct: 63 RIALYHGDITKLMVDAIVNAANETLLGGGGVDGSIHRAAGGGLLREC---RTLDG-CDTG 118 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A +T A DLP K V+H VGPV+ + E + LL Y SL L N S+AFPAIS Sbjct: 119 DAKVTDAYDLPCKKVIHAVGPVYNERHREECEMLLSSCYTRSLELAVENGCRSIAFPAIS 178 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLY 167 TG+YGYP AA A+ V +F+ + V F C+ +++ +Y Sbjct: 179 TGIYGYPSRRAANAAITAVRKFLESDQGDKISLVVFCCFLQKDMEIY 225 >UniRef50_B8HYS5 Appr-1-p processing domain protein n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HYS5_CYAP4 Length = 187 Score = 142 bits (357), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 83/177 (46%), Positives = 111/177 (62%), Gaps = 11/177 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPT 62 R V+QGDIT L V+ IVNAAN L GGGV GAI RAAG L AC +Q G CPT Sbjct: 13 RFQVIQGDITTLEVEAIVNAANNELKPGGGVCGAIFRAAGYKQLQQAC----EQIGYCPT 68 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A+IT +LPA+ +VHTVGPV+ G ++LL Y N L+ S +S+AFP IS Sbjct: 69 GEALITPGFNLPAQWIVHTVGPVY-GVTWASEELLARCYRNCLQFAGEESLSSIAFPLIS 127 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEEN----AHLYERLLTQQG 175 TG+YG+P AAEIA++ + ++ ++ +QVY VCY E+ +Y+R + Q+G Sbjct: 128 TGIYGFPLEPAAEIAIREILTGLSCYSEIKQVYLVCYTPESYAAVLQIYDR-ICQKG 183 >UniRef50_Q66HV6 Zgc:92353 n=1 Tax=Danio rerio RepID=Q66HV6_DANRE Length = 248 Score = 140 bits (354), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 79/176 (44%), Positives = 109/176 (61%), Gaps = 12/176 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDITKL +D + NAAN +L+GGGGVDGAIHR AGP L C + C Sbjct: 66 LNMKVSLFGGDITKLEIDAVANAANKTLLGGGGVDGAIHRGAGPLLRKECATL----NGC 121 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 TG A IT A LPA+ V+HTVGP+ GE+ E++ L++ Y N L + +VAF Sbjct: 122 ETGEAKITGAYGLPARYVIHTVGPIVHDSVGER-EEEALRNCYYNCLHTATKHHLRTVAF 180 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ---VYFVCYDEENAHLYERLL 171 P ISTGVYGYP A E+A+KTV +++ ++ PE+ V F + + + LYE LL Sbjct: 181 PCISTGVYGYPPDQAVEVALKTVRDYLEQN--PEKLDRVIFCVFLKSDKQLYENLL 234 >UniRef50_Q1R0S7 Appr-1-p processing n=12 Tax=Proteobacteria RepID=Q1R0S7_CHRSD Length = 183 Score = 140 bits (354), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 79/174 (45%), Positives = 107/174 (61%), Gaps = 5/174 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCP 61 R+ VV GDIT+L VD IVNAAN SLMGGGGVDGAI+RAAGPAL AC +R+ P Sbjct: 9 RVDVVSGDITRLDVDAIVNAANHSLMGGGGVDGAIYRAAGPALKRACRALRETHWPDGLP 68 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G +T +LPA+ V+HTVGPV+ +++ LL + Y N++ L A +AFPAI Sbjct: 69 DGEVALTEGFELPARYVIHTVGPVY-AKTRDKSHLLANCYRNAVALAAETGCRRIAFPAI 127 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 STGVYGYP AA I + T+ + + H L +V + E + + + ++G Sbjct: 128 STGVYGYPFDDAAHIVIDTLHDALAIHDL--RVTLCFFSERDYQAFAEIAMRRG 179 >UniRef50_Q9HJ67 UPF0189 protein Ta1105 n=1 Tax=Thermoplasma acidophilum RepID=Y1105_THEAC Length = 196 Score = 140 bits (353), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 75/168 (44%), Positives = 100/168 (59%), Gaps = 2/168 (1%) Query: 10 GDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCPTGHAVI 67 GDIT+ + IVNAAN SLMGGGGVDGAIH AAGP L +K+R+++ P G AVI Sbjct: 16 GDITESDAEAIVNAANSSLMGGGGVDGAIHSAAGPELNGELVKIRRERYPNGLPPGEAVI 75 Query: 68 TLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYG 127 T L A ++HTVGPVW GG ED +L +Y + L L +AFPA+STG YG Sbjct: 76 TRGYRLKASHIIHTVGPVWMGGRNGEDDVLYRSYRSCLDLAREFGIHDIAFPALSTGAYG 135 Query: 128 YPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 +P A IA+++V +F+ + V FV Y E+ + +L+ G Sbjct: 136 FPFDRAERIAIRSVIDFLKDESAGYTVRFVFYTEDQGKRFLFILSDLG 183 >UniRef50_A7RJ44 Predicted protein (Fragment) n=4 Tax=Eukaryota RepID=A7RJ44_NEMVE Length = 183 Score = 140 bits (353), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 75/174 (43%), Positives = 106/174 (60%), Gaps = 12/174 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDIT L +D IVNAAN +L+GGGGVDG IHRAAG L C K+R C Sbjct: 5 LNDKVSLWTGDITALEIDAIVNAANTTLLGGGGVDGCIHRAAGDNLFKECRKLR----GC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A ITL LPAK V+HT GP+ + ++ LQD Y N L+L + ++AF Sbjct: 61 QTGEAKITLGHRLPAKYVIHTAGPMGKNRKK-----LQDCYKNCLQLAKQHGVKTLAFCC 115 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLL 171 ISTG+YGYP AA +A++TV +++ + E++ F + ++ +YERLL Sbjct: 116 ISTGIYGYPNKDAAHVALETVRQWLETDDNNDSVERIVFCTFLPKDTEIYERLL 169 >UniRef50_C1BR35 MACRO domain-containing protein 1 n=2 Tax=Caligus rogercresseyi RepID=C1BR35_9MAXI Length = 242 Score = 139 bits (351), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 77/172 (44%), Positives = 102/172 (59%), Gaps = 10/172 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I + QGDITKL VD IVNAAN L GGGV GAIHRAAG L C + G CP G Sbjct: 80 KIGMWQGDITKLEVDAIVNAANSGLKAGGGVCGAIHRAAGSQLQKECDSI----GGCPVG 135 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 + IT LPAK V+HTVGP + E L+ Y S+ L+ A S+AFP IST Sbjct: 136 DSRITAGYKLPAKHVIHTVGPQDKNSEH-----LKSCYRKSMELLIAKGLRSIAFPCIST 190 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYERLLTQQ 174 G+YGYP AAE+A++T+ FI ++ + V F + +++ Y LL+++ Sbjct: 191 GIYGYPSDKAAEVALQTIRSFIQDNSESVDSVIFCVFLDKDMQYYSELLSKK 242 >UniRef50_B9S4E3 Protein LRP16, putative n=2 Tax=cellular organisms RepID=B9S4E3_RICCO Length = 269 Score = 139 bits (350), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 76/164 (46%), Positives = 103/164 (62%), Gaps = 10/164 (6%) Query: 5 IHVVQGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-- 58 + + +GDITK VD IVN AN ++GGGG DGAIHRAAGP L+DAC KV + + Sbjct: 95 LKINKGDITKWFVDGSSDAIVNPANEKMLGGGGADGAIHRAAGPELVDACYKVPEVRPGI 154 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 CPTG A IT LPA V+HTVGP++ +N +L++AY NSL + N+ +AF Sbjct: 155 RCPTGEARITPGFKLPASHVIHTVGPIY-DANRNSAAILKNAYRNSLSVAKDNNIKFIAF 213 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEE 162 PAIS GVY YP AA +++ T+ EF ++V+FV + +E Sbjct: 214 PAISCGVYLYPFEEAASVSISTIKEFADDI---KEVHFVLFSDE 254 >UniRef50_Q2LUU1 Appr-1-p histone processing protein n=5 Tax=Bacteria RepID=Q2LUU1_SYNAS Length = 214 Score = 139 bits (349), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 79/165 (47%), Positives = 108/165 (65%), Gaps = 4/165 (2%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 ++QGDIT+ D IVNAAN L GGGGVDGAIHRA GP+++ C ++ G CPTG AV Sbjct: 45 LIQGDITQEDTDAIVNAANTGLRGGGGVDGAIHRAGGPSIMAECRRI----GGCPTGQAV 100 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 IT G + A+ V+HTVGPV+R G E +LL AY SL++ +A S++FPAIS GVY Sbjct: 101 ITTGGKMKARYVIHTVGPVYRDGSHGEAELLASAYRESLKMASARHLKSLSFPAISAGVY 160 Query: 127 GYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 GYP AA IA++TV +++ ++ E V FV +++ + L Sbjct: 161 GYPLEEAARIALQTVIDYLKKNRDIELVRFVLFNQSTYDAFSNAL 205 >UniRef50_B6KFB3 Appr-1-p processing enzyme family domain-containing protein n=3 Tax=Toxoplasma gondii RepID=B6KFB3_TOXGO Length = 817 Score = 139 bits (349), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 80/173 (46%), Positives = 111/173 (64%), Gaps = 12/173 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ + +GDIT+L VDVIVNAANPSL+GGGGVDGAIHR AGP L Q G C TG Sbjct: 48 KVVLYRGDITELDVDVIVNAANPSLLGGGGVDGAIHRKAGPQL----RVFNQTLGGCKTG 103 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 + A L K + HTVGP GEQ+ Q L+ YLN+L L+ + Y ++AFP IST Sbjct: 104 EVKASPAFQLVCKQIFHTVGPR---GEQS--QALRACYLNALELLKRSKYRTIAFPCIST 158 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFV--C-YDEENAHLYERLLTQ 173 G+YGYP+ AA++ K V++++ A E V F+ C ++ ++ YE+LL++ Sbjct: 159 GIYGYPQLNAAQVVTKCVTKWLKIPANYEAVDFIVFCVFERQDFLFYEQLLSK 211 >UniRef50_Q47EQ7 Appr-1-p processing n=1 Tax=Dechloromonas aromatica RCB RepID=Q47EQ7_DECAR Length = 186 Score = 138 bits (348), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 76/176 (43%), Positives = 105/176 (59%), Gaps = 5/176 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--G 58 M R+ + GD+T AVD IVNAAN +L+GGGGVDGAIHR GPA+LDAC ++R+ Q Sbjct: 10 MNGRVRLYVGDLTDQAVDAIVNAANRTLLGGGGVDGAIHRRGGPAILDACRELRRSQWPD 69 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 PTG +T G LPA V+HTVGP++ E +LL Y N++ L A S+AF Sbjct: 70 GLPTGQVALTNGGKLPAPYVIHTVGPIYGQHRGKEAELLAACYRNAIELAAHLELKSLAF 129 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 P+ISTG +GYP AA I +++ + + A +++ V + NA E + Q Sbjct: 130 PSISTGAFGYPPDKAALIVSRSMHKVLDEIAAIDEIRLVFF---NASQMETFIAHQ 182 >UniRef50_B8DKL2 Appr-1-p processing domain protein n=3 Tax=Desulfovibrio RepID=B8DKL2_DESVM Length = 202 Score = 137 bits (346), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 76/155 (49%), Positives = 93/155 (60%), Gaps = 1/155 (0%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 V GD+ A D +VNAAN L GGGGVDGA+HRAAGP LL A + ++G G AV Sbjct: 19 VSTGDLAATATDAVVNAANAELRGGGGVDGALHRAAGPMLLPAGRDIVARRGPLAAGEAV 78 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 IT +LPA+ V+H VGP+WRGG E Q L + NSLRL A + VAFPAIS G Y Sbjct: 79 ITPGFNLPARHVIHAVGPIWRGGTHGEPQALAAVHANSLRLAAEHGLARVAFPAISCGSY 138 Query: 127 GYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDE 161 GYP AA IA+ + R L +V FV + + Sbjct: 139 GYPPELAAPIALAEAVRGL-RAGLVREVRFVLHGQ 172 >UniRef50_B6Q324 LRP16 family protein n=3 Tax=Trichocomaceae RepID=B6Q324_PENMQ Length = 308 Score = 136 bits (343), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 78/168 (46%), Positives = 100/168 (59%), Gaps = 8/168 (4%) Query: 8 VQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVI 67 ++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAG LLD C + G C TG A I Sbjct: 45 IRHDITKLQVDCIVNAANRSLLGGGGVDGAIHRAAGHRLLDECRAL----GGCRTGDAKI 100 Query: 68 TLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 T +LPA ++HTVGP++ + + LL+ Y SL L + S+AF A+STGVY Sbjct: 101 TNGYNLPATKIIHTVGPIYDEDNHELSETLLRSCYRRSLELAVEHDQRSIAFSAVSTGVY 160 Query: 127 GYPRAAAAEIAVKTVSEFITRH---ALPEQVYFVCYDEENAHLYERLL 171 GYP AAA + V +F+ + E+V F + + YER L Sbjct: 161 GYPNEAAARAVLDEVDKFLREGDNVSKLERVIFCSFMPADVRAYERYL 208 >UniRef50_Q03IQ8 Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 n=5 Tax=Streptococcus RepID=Q03IQ8_STRTD Length = 260 Score = 136 bits (343), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 78/180 (43%), Positives = 113/180 (62%), Gaps = 13/180 (7%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 RI++ +GDIT+L +D IVNAAN +L+G VD AIH AG L AC ++ +QG Sbjct: 84 RIYLWKGDITRLEIDAIVNAANKTLLGCMKPLHNCVDNAIHTYAGVQLRQACFELILEQG 143 Query: 59 -DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ----NEDQLLQDAYLNSLRLVAANSY 113 + P G A IT A +LP+ V+HTVGP + G Q +ED L++ +YL+ L L N Sbjct: 144 YEEPVGMAKITPAYNLPSAFVIHTVGP--KIGNQVTPIDEDLLIK-SYLSVLALAEKNKI 200 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S+A P ISTG + +P+ AAEIA+KTV FI + ++V F +D+EN ++Y++LL + Sbjct: 201 ESIAIPCISTGDFNFPKQKAAEIAIKTVKSFIDHSEIVKKVIFNVFDDENLNIYQKLLAE 260 >UniRef50_Q1HPZ5 LRP16 protein n=1 Tax=Bombyx mori RepID=Q1HPZ5_BOMMO Length = 275 Score = 136 bits (343), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 71/168 (42%), Positives = 97/168 (57%), Gaps = 9/168 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 R+ + +GDITKL +D +VNAAN L GGGVDGAIHRAAGP L C + G CPTG Sbjct: 110 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTG 165 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A +T +LPAK ++HTVGP E+ L+ Y L S+AFP IST Sbjct: 166 DAKVTGGYNLPAKYIIHTVGPQDGSAEK-----LESCYEKCLSFQQEYQIKSIAFPCIST 220 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 G+YG+P AA IA++T +F+ + ++ F + + +YE L+ Sbjct: 221 GIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLM 268 >UniRef50_UPI000186F16D conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186F16D Length = 367 Score = 136 bits (342), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 73/145 (50%), Positives = 93/145 (64%), Gaps = 9/145 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M RI + +GDIT L VD IVNAAN SL+GGGGVDGAIHR AG LL+ C + C Sbjct: 56 MNDRISLWKGDITTLGVDAIVNAANSSLLGGGGVDGAIHRKAGKFLLEEC----KTLNGC 111 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT +LP+K V+HTVGP GE+ + LL+ Y + L+ N+ S+AFP Sbjct: 112 PTGSAKITGGYNLPSKYVIHTVGPQ---GEKPD--LLESCYKSCFHLMLDNNLESIAFPC 166 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFI 145 ISTG+YGYP+ AA +A+ F+ Sbjct: 167 ISTGIYGYPQGPAAVVALTCARNFL 191 >UniRef50_A8M6L5 Appr-1-p processing domain protein n=2 Tax=Micromonosporaceae RepID=A8M6L5_SALAI Length = 170 Score = 135 bits (340), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 80/161 (49%), Positives = 99/161 (61%), Gaps = 9/161 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I VV GDIT+ VD IV AAN SL+GGGGVDGA+HRAAGP L A + G C G Sbjct: 4 IEVVLGDITQQNVDAIVTAANESLLGGGGVDGAVHRAAGPRLAQAGGAI----GPCAPGD 59 Query: 65 AVITLAGDL--PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A+ T A DL P + ++HTVGPVWRGG E ++L Y SLR+ +VAFP I+ Sbjct: 60 AMPTPAFDLDPPVRHIIHTVGPVWRGGGHGEARVLASCYRRSLRIADDLDALTVAFPTIA 119 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEEN 163 TGVYG+P AA IAV T+ T +QV V +DE++ Sbjct: 120 TGVYGFPADQAARIAVATIRSTPTNV---QQVRLVAFDEDS 157 >UniRef50_B9XAD9 Appr-1-p processing domain protein n=1 Tax=bacterium Ellin514 RepID=B9XAD9_9BACT Length = 184 Score = 134 bits (338), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 73/177 (41%), Positives = 103/177 (58%), Gaps = 10/177 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 KT ++ GDI D +V AA+ L G G DG IH GP + + C ++ G CP Sbjct: 7 KTLFELITGDIADQETDAVVTAAHWKLNKGSGTDGVIHTRGGPQIYEECRRI----GGCP 62 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT G+L AK V+H VGPVWRGG+++E +LL AY SL + + S++FP+I Sbjct: 63 IGDAVITTGGNLKAKHVIHAVGPVWRGGDEHEPELLASAYRRSLEVATEHKLKSISFPSI 122 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAH---LYERLLTQ 173 STG + YP AA IA+KT+ +++ + H L E V V Y E+ +YE+ L + Sbjct: 123 STGAFVYPIKLAAPIALKTICDYLQKEQHTL-EFVRLVLYTREDDKAFLVYEKALQE 178 >UniRef50_B7PF53 MACRO domain-containing protein, putative n=2 Tax=cellular organisms RepID=B7PF53_IXOSC Length = 304 Score = 134 bits (337), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 79/176 (44%), Positives = 106/176 (60%), Gaps = 12/176 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDIT L +D IVNAAN L+GGGGVDGAIH AAGP L + C + C Sbjct: 134 LNNKVSIFVGDITALEIDAIVNAANNRLLGGGGVDGAIHSAAGPKLKEECATL----NGC 189 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT LPAK V+HTVGPV GE NE + L Y+ SL A+ ++AFP Sbjct: 190 PTGEAKITGGYKLPAKYVIHTVGPV---GE-NEAK-LHGCYVTSLETAKAHKIRTLAFPC 244 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFI--TRHALP-EQVYFVCYDEENAHLYERLLTQ 173 ISTG+YGYP AA +A+ E++ +AL +++ F + + LYE+LL + Sbjct: 245 ISTGIYGYPNEKAAHVALSAAREWLDSEENALKVDRIIFCLFLPIDVRLYEKLLPE 300 >UniRef50_Q4P1I0 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P1I0_USTMA Length = 220 Score = 134 bits (337), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 75/170 (44%), Positives = 103/170 (60%), Gaps = 8/170 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + GDIT L++D IVNAAN SL+GGGGVDGAIHRAAG L+ C K+ C TG Sbjct: 38 LSIFTGDITTLSIDAIVNAANNSLLGGGGVDGAIHRAAGRELVVECGKL----NGCETGS 93 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A TL LP+K V+HTVGPV+ E ++LL+ AY +SL + S+AFP+IST Sbjct: 94 AKTTLGYALPSKHVIHTVGPVYNSSRHEECERLLRSAYRSSLEELRKIGAKSIAFPSIST 153 Query: 124 GVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERL 170 GVYGYP AA A+ + ++ H E++ C+ +++ + Y L Sbjct: 154 GVYGYPFDTAATAALDEIGSWLESNENHKHIERIVLCCFSQKDYNKYLEL 203 >UniRef50_Q0B030 Phosphatase n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0B030_SYNWW Length = 176 Score = 134 bits (337), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 82/170 (48%), Positives = 102/170 (60%), Gaps = 10/170 (5%) Query: 5 IHVVQGDITKLA-VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I VVQGDIT+ + VIVNAAN SL GGGGVDGAIHRAAGP L ++ P G Sbjct: 8 IQVVQGDITRQEDMAVIVNAANSSLRGGGGVDGAIHRAAGPELK------KESSALAPIG 61 Query: 64 --HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 AVIT A LP + V+H VGPV+ G + ED+LL Y N+LRL S+AFPAI Sbjct: 62 PGQAVITGAYRLPNRYVIHCVGPVY-GVHKPEDELLASCYRNALRLAEKQQLDSIAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 STGVYGYP AA++ KT+ E I +++ V +D L+ + L Sbjct: 121 STGVYGYPMREAAQVMFKTIIEVIPELKHIKKIRIVLFDHPAYELHRQAL 170 >UniRef50_B9YC00 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YC00_9FIRM Length = 182 Score = 134 bits (336), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 84/181 (46%), Positives = 108/181 (59%), Gaps = 12/181 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + GDIT + ++IVNAAN SL+GGGGVDG IHR AGP LL C R G C TG Sbjct: 2 ITFIHGDITSVPAEIIVNAANRSLLGGGGVDGVIHRKAGPQLLAEC---RTLHG-CETGQ 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAA------NSYTSVAF 118 A +T A DL + ++HTVGPVW GG E LL Y SLRL S ++ F Sbjct: 58 AKVTKAYDLSCRWIIHTVGPVWSGGRHQEVDLLASCYQQSLRLARQLQKEHRLSSLTIVF 117 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ--VYFVCYDEENAHLYERLLTQQGD 176 P ISTG+Y +P+A A IAV TV + ++ ++ V F CY+ E+A LY+R L +GD Sbjct: 118 PCISTGIYHFPKALACSIAVDTVRDTLSELQAEKEIDVIFCCYESEDAQLYKRQLDNKGD 177 Query: 177 E 177 + Sbjct: 178 Q 178 >UniRef50_Q97AU0 UPF0189 protein TV0719 n=2 Tax=cellular organisms RepID=Y719_THEVO Length = 186 Score = 133 bits (335), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 68/145 (46%), Positives = 95/145 (65%), Gaps = 3/145 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCPT 62 I +++GDIT + + IVNAANPSLMGGGGVDGAIH G + C ++R+ + P Sbjct: 11 IEIIEGDITDVNCEAIVNAANPSLMGGGGVDGAIHLKGGKTIDLECAELRRTKWPKGLPP 70 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT G L AK V+HTVGP++RG E++ + L +Y SL + + +AFPAIS Sbjct: 71 GEADITSGGKLKAKYVIHTVGPIYRGQEEDAETLYS-SYYRSLEIAKIHGIKCIAFPAIS 129 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITR 147 TG+YGYP A+ IA+K V++F++ Sbjct: 130 TGIYGYPFEEASVIALKAVTDFLSN 154 >UniRef50_A6GJ81 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GJ81_9DELT Length = 173 Score = 133 bits (334), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 78/176 (44%), Positives = 106/176 (60%), Gaps = 4/176 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-D 59 M I + +GDIT+++ D IVNAANP ++GGGGVDGAIHRAAGP LL AC +V + G Sbjct: 1 MAPSITLERGDITRVSCDAIVNAANPKMLGGGGVDGAIHRAAGPELLAACRRVPKVNGIR 60 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 CP G A IT A L A+ V+H VGP++ E + +L AY ++L L AA+ T +A P Sbjct: 61 CPFGEARITPAFGLDARWVIHAVGPIYARSE-DPKGVLARAYASALELAAAHDVTELACP 119 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 A+STG YG+P AA IA++TV+ +V FV + E + + G Sbjct: 120 ALSTGAYGFPLDPAARIALETVAS--RDWGCVARVRFVLFTAEVMAAFAKFRDLSG 173 >UniRef50_B6SKT6 Protein LRP16 n=12 Tax=cellular organisms RepID=B6SKT6_MAIZE Length = 239 Score = 132 bits (333), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 79/175 (45%), Positives = 113/175 (64%), Gaps = 14/175 (8%) Query: 9 QGDITKLAVDV----IVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG--DCPT 62 +GDIT +VD IVNAAN ++GGGGVDGAIH+AAGP L+ AC KV + + CPT Sbjct: 66 KGDITLWSVDCATDAIVNAANERMLGGGGVDGAIHQAAGPELVQACRKVPEVKPGVRCPT 125 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT A +LPA V+HTVGP++ +++ + L+ AY NSL+L N +AFPAIS Sbjct: 126 GEARITPAFELPASRVIHTVGPIY-DLDKHPEVSLKKAYENSLKLAKDNGIQYIAFPAIS 184 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY----ERLLTQ 173 GVY YP A++IAV T +F ++V+FV + ++ +++ ++LL+Q Sbjct: 185 CGVYRYPPKEASKIAVSTAQKFSEDI---KEVHFVLFSDDLYNIWRETAQQLLSQ 236 >UniRef50_A6BCW6 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A6BCW6_9FIRM Length = 267 Score = 132 bits (333), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 74/176 (42%), Positives = 112/176 (63%), Gaps = 10/176 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 +I + +GDIT+L+VD IVNAAN ++G G +D AIH AAG L + C ++ + QG Sbjct: 93 KISLWRGDITRLSVDAIVNAANSQMLGCFVPCHGCIDNAIHSAAGIQLRNECAQIMEAQG 152 Query: 59 -DCPTGHAVITLAGDLPAKAVVHTVGPV--WRGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 + PTG A IT +LPAK V+HTVGP+ + E+ E++L + YLN ++L S Sbjct: 153 HEEPTGKAKITKGYNLPAKHVIHTVGPIVGMQVTEKQEEEL-KSCYLNCMKLAEKEGLKS 211 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +AF ISTG + +P AAEIAVKTV ++++ L E+V F + EE+ ++Y+++ Sbjct: 212 IAFCCISTGEFHFPNKLAAEIAVKTVDKYLSSSKL-ERVIFNVFKEEDYNIYKKIF 266 >UniRef50_C2DZH9 Appr-1-p processing protein n=4 Tax=Lactobacillus jensenii RepID=C2DZH9_9LACO Length = 218 Score = 132 bits (331), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 79/172 (45%), Positives = 101/172 (58%), Gaps = 6/172 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + I VV+ + D IVNAAN +L+GGGGVDGAIH+AAGP LL+AC K+ C Sbjct: 48 LSKNIFVVKASVVNFPADAIVNAANKTLLGGGGVDGAIHQAAGPNLLEACKKL----NGC 103 Query: 61 PTGHAVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT + DL K ++HTVGPV++ QN Q LQ Y SL L SVAF Sbjct: 104 DTGEAKITPSFDLKTCKYIIHTVGPVFKLS-QNPQQQLQSCYKKSLDLALEYKCNSVAFS 162 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 ISTGVY YP AA +A + V+E++ RH +V+ CY E Y +L+ Sbjct: 163 GISTGVYEYPVKQAASVASEAVAEWLKRHNFAIKVFLCCYKESEFEAYAQLV 214 >UniRef50_C4V1Q4 Appr-1-p processing domain protein n=3 Tax=Bacteria RepID=C4V1Q4_9FIRM Length = 289 Score = 131 bits (330), Expect = 8e-30, Method: Compositional matrix adjust. Identities = 79/177 (44%), Positives = 108/177 (61%), Gaps = 9/177 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 RI + QGDIT+L D IVNAAN +L+G +D AIH AAG L AC + ++QG Sbjct: 114 RIALWQGDITRLNADAIVNAANSALLGCFIPCHRCIDNAIHSAAGLQLRAACAALMEEQG 173 Query: 59 DCP--TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSYTS 115 P TG A IT +L ++ V+HTVGP+ G + + L Y + L L A + S Sbjct: 174 H-PEETGTAQITEGYNLSSRHVIHTVGPIVSGALTDRHRAQLASCYRSCLSLAAEHGLRS 232 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 +AF ISTG + +PRAAAAEIAV+ V +F+TR E+V F + +E+ H+YERLL+ Sbjct: 233 IAFCCISTGEFHFPRAAAAEIAVREVRDFLTRDTSIERVVFNVFKDEDRHIYERLLS 289 >UniRef50_C8VIG2 LRP16 family protein (AFU_orthologue; AFUA_3G13850) n=7 Tax=Trichocomaceae RepID=C8VIG2_EMENI Length = 374 Score = 131 bits (330), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 78/175 (44%), Positives = 106/175 (60%), Gaps = 12/175 (6%) Query: 5 IHVVQGDITKL-AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + +V+ DITKL VD IVNAA SL+GGGGVD AIH+AAGP LL C R G C TG Sbjct: 40 VAMVRHDITKLQGVDCIVNAAKRSLLGGGGVDYAIHKAAGPDLLKEC---RTLNG-CDTG 95 Query: 64 HAVITLAGDLPAKAVVHTVGPVW----RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 A IT A +LP K ++HTVGP++ R G+ ++LL+ Y L + N S+AF Sbjct: 96 DAKITNAYNLPNKRIIHTVGPIYSDAMRRGKDEPERLLRSCYRRCLEVAVENEMKSIAFN 155 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERLL 171 AISTG+YGYP AA+ A+ +F+ L E+V F ++ ++ YE+L+ Sbjct: 156 AISTGIYGYPSRDAAKAALDETRKFLETDKNTGLLERVIFCNFELKDVEAYEQLI 210 >UniRef50_C2KRZ5 Appr-1-p processing domain protein n=2 Tax=Mobiluncus mulieris RepID=C2KRZ5_9ACTO Length = 275 Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 76/143 (53%), Positives = 92/143 (64%), Gaps = 3/143 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCP 61 ++H + GDIT++ VD IVNAAN +L+GGGGVDGAIHRAAG LL AC +R + P Sbjct: 2 QLHAIGGDITRVHVDAIVNAANSTLLGGGGVDGAIHRAAGTELLAACRVIRATRYPDGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AV T LPAK V+HTVGP G Q + LL+ A++NSLR A SVAFPAI Sbjct: 62 VGQAVATKGFKLPAKWVIHTVGPNRHAG-QTDPGLLRAAFVNSLREAARVGAHSVAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEF 144 S GVYG+ A A I V V E+ Sbjct: 121 SGGVYGWDMAEVARIGVSAVHEW 143 >UniRef50_B8LP86 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=B8LP86_PICSI Length = 231 Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 75/158 (47%), Positives = 100/158 (63%), Gaps = 10/158 (6%) Query: 9 QGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKV-RQQQG-DCPT 62 +GDITK VD IVNAAN L+GGGGVDGAIHRAAGP LL AC + + +G CP Sbjct: 60 RGDITKWTVDGHTDAIVNAANERLLGGGGVDGAIHRAAGPDLLKACRQFPKVSRGIRCPV 119 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT +LP ++HTVGPV+ E++ + L DAY +SL + N +AFPAIS Sbjct: 120 GSARITRGFNLPVSRIIHTVGPVY-DMEEDPESKLADAYRSSLNITRENEVKYIAFPAIS 178 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYD 160 G+YGYP AA +++ TV + I ++V+FV ++ Sbjct: 179 CGIYGYPYEEAAAVSLTTVRDSIKDL---KEVHFVLFE 213 >UniRef50_C4V152 Appr-1-p processing protein n=2 Tax=Clostridiales RepID=C4V152_9FIRM Length = 346 Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 74/165 (44%), Positives = 102/165 (61%), Gaps = 6/165 (3%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 +V+ DITK+ VD IVN+ANP + GGGVD AIH+AAG LL A R++ G+ G A Sbjct: 5 IVRNDITKMQVDAIVNSANPRAIVGGGVDRAIHQAAGAELLTA----RRKIGNIAAGTAA 60 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 +T A L A+ V+HTVGPVW+ G E +LL AY NSLRL A +S+AFP +S GV+ Sbjct: 61 VTPAYRLHARYVIHTVGPVWQDGSHGERELLSRAYQNSLRLAAERDCSSIAFPLLSAGVF 120 Query: 127 GYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 G P A AV+ + +F+ H + VY V +D ++ + + L Sbjct: 121 GCPSEIAIAAAVQAIRDFLQEHDM--DVYLVVFDRKSFKISDTLF 163 >UniRef50_A0LGZ1 Appr-1-p processing domain protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LGZ1_SYNFM Length = 175 Score = 130 bits (328), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 72/173 (41%), Positives = 101/173 (58%), Gaps = 7/173 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +VQGD+T+L VD IVNAAN L GGGV GAI GP + + C + G G Sbjct: 9 KISLVQGDLTELRVDAIVNAANRHLALGGGVAGAIRMKGGPTIQEECDAI----GGTVVG 64 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT G+L A V+H VGP R GE +ED+ L++A LNSL+ S S+AFPA+ST Sbjct: 65 QAVITGGGNLKAAHVIHAVGP--RYGEGDEDEKLRNATLNSLKRATEKSLASIAFPAVST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYERLLTQQG 175 G++G+P+ A+I + F+ R V F + +E+ ++E+ L G Sbjct: 123 GIFGFPKDRCAKIMLDAAVAFLDRETTSLRDVIFCLWSKEDLEIFEKTLQSMG 175 >UniRef50_C4Q6S1 Expressed protein n=1 Tax=Schistosoma mansoni RepID=C4Q6S1_SCHMA Length = 224 Score = 130 bits (328), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 78/206 (37%), Positives = 109/206 (52%), Gaps = 39/206 (18%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI + +GDIT L +D I NAAN L GGGGVDGAIHRAAG LL+AC Q+ C Sbjct: 25 LGSRISLWRGDITHLQIDAIANAANSQLRGGGGVDGAIHRAAGSQLLEAC----QKLSGC 80 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T +LP+K V+H VGPV R D L+ Y +L L + ++ S+AFP Sbjct: 81 PTGDAKLTPGFNLPSKYVIHCVGPVGR-----NDVALESTYRKALELCSEHNIQSIAFPC 135 Query: 121 ISTGVY------------------------------GYPRAAAAEIAVKTVSEFITRHAL 150 ISTGVY +P AAA++A+ TV ++ H Sbjct: 136 ISTGVYEVQKTRENKKRIDLIKGLDDQIFKPDFPDDCFPNEAAAKVALHTVLSYLKSHQE 195 Query: 151 PEQVYFVCYDEENAHLYERLLTQQGD 176 ++V F + + + +YE L+ + D Sbjct: 196 IQRVIFCIFMDVDYKIYENLIPEMLD 221 >UniRef50_B9MLL8 Appr-1-p processing domain protein n=6 Tax=Clostridiales RepID=B9MLL8_ANATD Length = 181 Score = 130 bits (327), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 73/167 (43%), Positives = 103/167 (61%), Gaps = 3/167 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I + +GDITK VDVIVNAAN L GGGV AI +A G + ++ ++ G PTG Sbjct: 9 KIAIKKGDITKENVDVIVNAANSHLRHGGGVALAIVKAGGIEIQKESDEIIKKIGMLPTG 68 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 HAVIT A LP K V+HTVGP++ GE NED+ L A NSL L + S+AFPA+S+ Sbjct: 69 HAVITNAYRLPCKFVIHTVGPIY--GEGNEDEKLSMAIYNSLYLAHLYNLKSIAFPAVSS 126 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYER 169 G++G+P+ A+I + T +F++ E+V F +D+E +E Sbjct: 127 GIFGFPKDRCAKILIDTAVDFLSSIKTSIEKVVFCLFDDETYGYFEE 173 >UniRef50_B2ACK5 Predicted CDS Pa_3_1270 n=5 Tax=Eukaryota RepID=B2ACK5_PODAN Length = 253 Score = 130 bits (327), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 74/174 (42%), Positives = 103/174 (59%), Gaps = 7/174 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ V + DIT LAVD IVNAAN SL+GGGGVDGAIHRAAG L + C K+ C Sbjct: 47 LNDRVAVYRADITSLAVDAIVNAANRSLLGGGGVDGAIHRAAGRGLYEECKKLNG----C 102 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT A DLP V+H VGPV+ + + ++LL Y SL L + ++AF Sbjct: 103 KTGSAKITDAYDLPCNRVIHAVGPVYDPADHDTSEKLLVGCYTTSLELAVEHECRTIAFS 162 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLL 171 A+STG+YGYP AA A+ + +F+T ++V V +++++ Y + Sbjct: 163 ALSTGIYGYPSREAAPAALSAIRKFLTGKDGDKIDKVILVTFEKKDVDAYTEFV 216 >UniRef50_Q5KCD7 Putative uncharacterized protein n=1 Tax=Filobasidiella neoformans RepID=Q5KCD7_CRYNE Length = 252 Score = 128 bits (322), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 71/174 (40%), Positives = 102/174 (58%), Gaps = 5/174 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ + +GDIT+L D+IVNAAN SL+GGGGVDGAIHRAAG LL+ C K+ G Sbjct: 70 LNDRVSIWRGDITELEADMIVNAANSSLLGGGGVDGAIHRAAGKHLLEECKKL----GGA 125 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG T +L +K + HTVGPV+ Q QLL+ Y +SL + + F Sbjct: 126 QTGETKFTAGYNLSSKKIAHTVGPVYHSHPPQRAAQLLKSCYQSSLEGCRDSGGGVIGFS 185 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 +ISTGVYGYP A IA++T +F+ + +V +V + + + +Y ++ Q Sbjct: 186 SISTGVYGYPIKDATHIALETTRQFLEQDDSITRVIYVVFSKRDEDVYREIIPQ 239 >UniRef50_C4DDL7 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DDL7_9ACTO Length = 224 Score = 128 bits (322), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 3/155 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCP 61 RI +V+GDIT VD ++NAAN SLMGGGGVDGAIHR GP +LD C K+R P Sbjct: 2 RIELVKGDITTQDVDALINAANSSLMGGGGVDGAIHRKGGPTILDECRKLRDSHYPKGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G A+ T AG+LPA+ ++HTVGPV+ + + L+ Y NSL + T++A P I Sbjct: 62 EGQAIATTAGNLPAQWIIHTVGPVY-SRHDDRTETLRACYRNSLTIADTLGATTLAVPLI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYF 156 S+G+YG+P+ A AV + T L + F Sbjct: 121 SSGIYGWPKDDAIRQAVDVLQTTPTSVTLARIMLF 155 >UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZED8_SYNY3 Length = 588 Score = 127 bits (320), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 69/159 (43%), Positives = 94/159 (59%), Gaps = 4/159 (2%) Query: 10 GDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITL 69 GDITK + IVN+ + +L G + AIH+AAGP LL AC + QG C G A +T Sbjct: 425 GDITKEKAEAIVNSTDRNLSNSGALSRAIHQAAGPELLQACQDL---QG-CTVGGAKLTP 480 Query: 70 AGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYP 129 +L A V+HTV P W+GG Q E++LL Y N L+L + S S+AFPAI+ G G+P Sbjct: 481 GFNLRANWVIHTVAPKWKGGNQGEEELLVSCYQNCLQLAVSQSIRSLAFPAIACGAMGFP 540 Query: 130 RAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYE 168 AA IA++TVS F+ + V F+C D+E Y+ Sbjct: 541 PEIAARIALETVSNFLLSNMAIGSVAFICADKETLQYYQ 579 >UniRef50_C0PSL1 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PSL1_PICSI Length = 204 Score = 127 bits (318), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 73/139 (52%), Positives = 89/139 (64%), Gaps = 7/139 (5%) Query: 9 QGDITKLAV----DVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CPT 62 QGDITK + D IVNAAN ++GGGGVDGAIH AAGP LL ACL V + Q CP Sbjct: 26 QGDITKWFINGENDAIVNAANELMLGGGGVDGAIHSAAGPELLRACLNVPEIQPGVRCPA 85 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT A +LP ++HTVGP++ E + +L AY +SL + N VAFPAIS Sbjct: 86 GSARITEAFNLPVSHIIHTVGPIYD-EEGDSASVLSSAYKSSLEVAEENHIKYVAFPAIS 144 Query: 123 TGVYGYPRAAAAEIAVKTV 141 GVYGYP AAE+A+ T+ Sbjct: 145 CGVYGYPLEKAAEVALLTL 163 >UniRef50_D2S4L6 Appr-1-p processing domain protein n=4 Tax=Actinomycetales RepID=D2S4L6_9ACTO Length = 170 Score = 126 bits (317), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 82/170 (48%), Positives = 102/170 (60%), Gaps = 5/170 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDAC--LKVRQQQGDCPT 62 + V+GDIT+ VDV+VNAANP L+GGGGVDGAIH A GP +L C LK G P Sbjct: 3 LRAVRGDITEADVDVVVNAANPGLLGGGGVDGAIHAAGGPEILAECRALKAGLPGGRLPR 62 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV T AG LPA+ VVHT GP+W +Q+ +L+ SLR+ SVAFPAIS Sbjct: 63 GRAVATTAGRLPARWVVHTAGPIW-SADQDRSAVLRSCCTESLRVADGLGARSVAFPAIS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GVYG+P A AA AV V +H ++V FV +D+ +E LT Sbjct: 122 AGVYGWPLADAAVQAVAGVRAVEVQHV--QEVRFVLFDDRALAAFEAALT 169 >UniRef50_D1VVA5 Putative uncharacterized protein n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VVA5_9FIRM Length = 163 Score = 126 bits (317), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 65/140 (46%), Positives = 90/140 (64%), Gaps = 3/140 (2%) Query: 11 DITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLA 70 ++ K+ VD IVNAAN L+ GGGV GAI + A L+ K + G TG AVIT A Sbjct: 9 NLVKMDVDAIVNAANKELLPGGGVCGAIFQVAKSKSLEMDCK---KLGPIKTGQAVITSA 65 Query: 71 GDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPR 130 +LP+K ++H VGP++R G E++LL++AYLNSL+L +S S+AFP IS G+Y YP Sbjct: 66 YNLPSKYIIHAVGPIYRDGLSGEEELLRNAYLNSLKLAKKHSIKSIAFPLISAGIYAYPL 125 Query: 131 AAAAEIAVKTVSEFITRHAL 150 A +IAV T+ EF+ + Sbjct: 126 KEACKIAVDTIREFLKNEDM 145 >UniRef50_UPI000050FFC7 predicted phosphatase, C-terminal domain of histone macro H2A1 like protein n=1 Tax=Brevibacterium linens BL2 RepID=UPI000050FFC7 Length = 177 Score = 126 bits (317), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 76/168 (45%), Positives = 102/168 (60%), Gaps = 5/168 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCP 61 +I V++GDIT+ +VD IVNAAN SL+GGGGVDGAIH+AAGP LL+AC ++RQ P Sbjct: 2 KITVLEGDITEASVDAIVNAANSSLLGGGGVDGAIHKAAGPELLEACREIRQTSHPRGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AV T AG L A V+HTVGP GE + ++L+ + SL + A TSVAFPAI Sbjct: 62 AGQAVATSAGALKATWVIHTVGPNRTQGEADP-EVLESCFEASLNVAAELGATSVAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLY 167 GVYG+ AE A + + R + +V FV + + ++ Sbjct: 121 GGGVYGWSARDVAEAAHSVIVDGRERGHWEQVAEVVFVLFSDSMTSVF 168 >UniRef50_C4FEN5 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FEN5_9BIFI Length = 173 Score = 125 bits (313), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 66/140 (47%), Positives = 91/140 (65%), Gaps = 5/140 (3%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALL-DACLKVRQQQGDCPTGHA 65 VV DIT + VD I NAAN L+ G GV GAI RAAG + + +AC ++ + TG A Sbjct: 23 VVHHDITDMQVDAIANAANTDLLMGSGVCGAIFRAAGASRMQEACDRLSPIR----TGEA 78 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 VIT DLPA+ V+HT GP+WRGG+ NE+ LL+ Y + L + + + TS+AFP IS G+ Sbjct: 79 VITPGFDLPARYVIHTAGPLWRGGDHNEEALLRSCYRSCLAIASVHGCTSMAFPLISAGI 138 Query: 126 YGYPRAAAAEIAVKTVSEFI 145 YGYPRA A ++A + ++ Sbjct: 139 YGYPRAEALDVAEDEIRYWL 158 >UniRef50_A8JCH3 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8JCH3_CHLRE Length = 160 Score = 124 bits (312), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 70/140 (50%), Positives = 91/140 (65%), Gaps = 3/140 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG--DC 60 T++ + QGDIT VD IVNAAN ++GGGGVDGAIHRAAGP L+ AC +V + C Sbjct: 12 TKLVIKQGDITVEDVDAIVNAANERMLGGGGVDGAIHRAAGPQLVRACAEVPEVYPGVRC 71 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT L A+ V+HTVGP++ ++ LL AY +S+ L A S++FP Sbjct: 72 PTGEARITPGFHLKARHVIHTVGPIYH-NDRVSAPLLASAYRSSVELAAQQGLASLSFPG 130 Query: 121 ISTGVYGYPRAAAAEIAVKT 140 ISTGV+GYP AA++ V T Sbjct: 131 ISTGVFGYPWDKAAQVRVHT 150 >UniRef50_A8FSV2 Putative uncharacterized protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FSV2_SHESH Length = 293 Score = 123 bits (309), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 78/174 (44%), Positives = 106/174 (60%), Gaps = 13/174 (7%) Query: 10 GDITKLAVDVIVNAANPSLMGGGG-----VDGAIHRAAGPALLDACLKVRQQQGDC-PTG 63 GDIT+L VD I+NAAN L+G +D IH AAG L D C + +QQG PTG Sbjct: 117 GDITQLKVDAIINAANVYLLGCRQPNHRCIDNVIHSAAGSRLRDDCATIIEQQGGLEPTG 176 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGG----EQNEDQLLQDAYLNSLRLVAA-NSYTSVAF 118 A IT LPAK V+HTVGP G E++E QL + AY + L L + N ++AF Sbjct: 177 SAKITRGYALPAKYVIHTVGPCLHSGYLPDEEDEKQL-KSAYQSCLTLASEINDLKTLAF 235 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYERLL 171 AISTGV+ YP+ AA +A++TVS++++ H E+V F Y + +A +YERL+ Sbjct: 236 CAISTGVFSYPKIDAASVALETVSDWLSEHPQHFEKVVFNLYTQADAAIYERLI 289 >UniRef50_A9SRF5 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9SRF5_PHYPA Length = 207 Score = 123 bits (309), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 69/140 (49%), Positives = 89/140 (63%), Gaps = 6/140 (4%) Query: 9 QGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-DCPTG 63 +GDITK +D IVNAAN ++GGGGVDGAIH AAG LL+A K+ +G CP G Sbjct: 34 RGDITKWHIDGKTDAIVNAANERMVGGGGVDGAIHAAAGKQLLEATKKIPISEGVRCPVG 93 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T LP ++HTVGP++ E N LL A+ S+RL N +AFPAIS Sbjct: 94 SAVLTPGFKLPVSKIIHTVGPIYY-IEGNPASLLAKAHKESVRLATENGLKYIAFPAISC 152 Query: 124 GVYGYPRAAAAEIAVKTVSE 143 GVYGYP AAEI+++++ E Sbjct: 153 GVYGYPIEEAAEISIQSLRE 172 >UniRef50_A1D5K4 Appr-1-p processing enzyme family protein n=1 Tax=Neosartorya fischeri NRRL 181 RepID=A1D5K4_NEOFI Length = 257 Score = 123 bits (308), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 68/155 (43%), Positives = 89/155 (57%), Gaps = 2/155 (1%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T + ++ DI +L VD IVNAA SL GGGGVD A+H AAGP L AC+K + Q C Sbjct: 88 LNTLVSFIEHDIARLQVDCIVNAAKESLQGGGGVDRAMHLAAGPKLNQACIK-KLQDRQC 146 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G +T L K+V+HTVGP R +Q + Q+L+ Y NSL + S+ FP Sbjct: 147 SPGRVFMTPGFHLRCKSVIHTVGPDCRQKQQIDYAQVLRQCYRNSLNKAVSKGLRSIVFP 206 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQV 154 AIS GVY P A +EIA+ TV F+ H P + Sbjct: 207 AISVGVYACPAEATSEIALNTVRGFLDEHGRPSSL 241 >UniRef50_A7BY23 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BY23_9GAMM Length = 708 Score = 123 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 66/168 (39%), Positives = 93/168 (55%), Gaps = 8/168 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +IH++QG+IT+ VD IVN + SL G G +D AI A G L +AC +Q G C Sbjct: 532 KIHIIQGNITQQKVDAIVNTTDRSLSGSGAIDYAIQNAGGIELKEAC----RQLGTCSVA 587 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT +LPA+ V+HTVGP W GG Q E + L Y N L L + +AFP I Sbjct: 588 EAKITEGYNLPAQFVIHTVGPNWEGGNQKEAEKLAQCYRNCLALAEQQGFKIIAFPTIGV 647 Query: 124 GVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLYERL 170 G G+ AA++A+ +S F+ +++ E+V VC+ N +YE Sbjct: 648 GGLGFSHELAAKVAIYEISSFLQQKNSSLEKVILVCF---NQRVYEHF 692 >UniRef50_B8I4Z8 Appr-1-p processing domain protein n=7 Tax=Bacteria RepID=B8I4Z8_CLOCE Length = 341 Score = 122 bits (306), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 76/167 (45%), Positives = 96/167 (57%), Gaps = 8/167 (4%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 +V+ DITKL VD IVNAAN L GGGV GAI +AAG A L A V + TG V Sbjct: 5 IVRQDITKLKVDAIVNAANTDLRMGGGVCGAIFKAAGAAQLQA---VCDKLAPIKTGEVV 61 Query: 67 ITLAGDLPAKAVVHTVGPVWR--GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 IT +L AK V+H GPV+R EQ E Q L+ AY NSL+ N S+AFP IS+G Sbjct: 62 ITPGFNLSAKFVIHAAGPVYRHWNREQGE-QYLRAAYTNSLKCAVENKCESIAFPLISSG 120 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +YGYP+ A +A + FIT H + V V +D+ + +LL Sbjct: 121 IYGYPKDEALRVATSEIHNFITDHDI--DVTLVVFDKSAFTVSRKLL 165 >UniRef50_Q93RG0 UPF0189 protein in tap1-dppD intergenic region n=14 Tax=Bacteria RepID=Y189_TREMD Length = 261 Score = 122 bits (305), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 72/174 (41%), Positives = 101/174 (58%), Gaps = 7/174 (4%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG-D 59 +V +GDIT L VD IVNAAN + G +D IH AG L C + Q+QG + Sbjct: 88 YVWRGDITTLKVDAIVNAANSGMTGCWQPCHACIDNCIHTFAGVQLRTVCAGIMQEQGHE 147 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAF 118 PTG A IT A +LP K V+HTVGP+ G + D LL ++Y + L L A N S+AF Sbjct: 148 EPTGTAKITPAFNLPCKYVLHTVGPIISGQLTDRDCTLLANSYTSCLNLAAENGVKSIAF 207 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTGV+ +P AAEIAV TV ++ ++ ++ F + E++ LY +L++ Sbjct: 208 CCISTGVFRFPAQKAAEIAVATVEDWKAKNNSAMKIVFNVFSEKDEALYNKLMS 261 >UniRef50_C7GZB8 Appr-1-p processing enzyme family domain protein n=3 Tax=Bacteria RepID=C7GZB8_9FIRM Length = 268 Score = 121 bits (304), Expect = 9e-27, Method: Compositional matrix adjust. Identities = 74/185 (40%), Positives = 106/185 (57%), Gaps = 14/185 (7%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDAC----L 51 +K + V QGDIT+L VD IVNAAN ++G +D IH AG L + C Sbjct: 83 IKDNLSVWQGDITRLKVDAIVNAANSQMLGCFIPLHTCIDNQIHTFAGIQLREECDQKME 142 Query: 52 KVRQQQG---DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRL 107 K+R++ G + PT ++T +LPAK VVH VGP+ GG ++ ++ L D Y N+L + Sbjct: 143 KLREKYGRDYEQPTAIPMLTEGYNLPAKKVVHIVGPIVSGGLTSDLEKDLADCYTNTLDM 202 Query: 108 VAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHL 166 N+ SV F ISTGV+ +P AAEIAVKTV ++ H+ E++ F + +E+ Sbjct: 203 CMENNLKSVVFCCISTGVFHFPNKRAAEIAVKTVGKWCEAHSYSLERIIFNVFKDEDKKY 262 Query: 167 YERLL 171 YE LL Sbjct: 263 YEELL 267 >UniRef50_C7N880 Predicted phosphatase, C-terminal domain of histone macro H2A1 like protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N880_SLAHD Length = 263 Score = 121 bits (303), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 73/179 (40%), Positives = 102/179 (56%), Gaps = 8/179 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGG-----VDGAIHRAAGPALLDACLKVRQ 55 + R+ V QGDIT+L D IVNAAN ++G +D IH AG L + C ++ + Sbjct: 83 LDQRLSVWQGDITRLRADAIVNAANSQMLGCWAKCHSCIDNVIHTYAGVQLREECDRIMR 142 Query: 56 QQGDC-PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSY 113 QG+ PTGHA +T A +LP+K V+HTVGP+ +G +L L Y + L AA Sbjct: 143 AQGENEPTGHAKVTGAYNLPSKHVIHTVGPIAQGHPTARHRLQLAQCYTSCLDAAAATGC 202 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 S+AF ISTGVYG+P AA IAV TV +++ RH +P V F + +Y+ +L Sbjct: 203 ESIAFCGISTGVYGFPAEQAAPIAVDTVRDWLDRHPDVPMHVVFNVFGNRQLSIYQDIL 261 >UniRef50_Q17432 Protein B0035.3, confirmed by transcript evidence n=3 Tax=Chromadorea RepID=Q17432_CAEEL Length = 203 Score = 121 bits (303), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 75/166 (45%), Positives = 96/166 (57%), Gaps = 8/166 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPT 62 RI V GDITKL+VD IVNAAN L GGGGVDGAIHRAAG L + C QQ C Sbjct: 25 RISVWDGDITKLSVDAIVNAANSRLAGGGGVDGAIHRAAGRKQLQEEC----QQYNGCAV 80 Query: 63 GHAVITLAGDLP-AKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AVIT ++ K ++HTVGP G +E + L Y SL + N S+AF Sbjct: 81 GDAVITSGCNINHIKKIIHTVGPQVYGNVTDERRENLVACYRTSLDIAIENGMKSIAFCC 140 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY-DEENAH 165 ISTGVYGYP AA+ ++E++ ++ E++ V + D +N H Sbjct: 141 ISTGVYGYPNDDAAKTVTNFLTEYLEKNDTIERIVLVTFLDIDNEH 186 >UniRef50_B0EF86 MACRO domain-containing protein, putative n=2 Tax=Entamoeba RepID=B0EF86_ENTDI Length = 316 Score = 120 bits (302), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 66/172 (38%), Positives = 94/172 (54%), Gaps = 9/172 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +I VV GDITK+ DV+VNAAN L GG GVDGAIH AAG L D +R C Sbjct: 47 MNKKIIVVTGDITKIQADVVVNAANSYLRGGAGVDGAIHSAAGYELYDY---LRSHYKHC 103 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG + +P K ++H VGP+ Q LQ Y+ L V Y S+AFP Sbjct: 104 DTGDFKPSPGFKMPCKEILHGVGPIGENAIQ-----LQRVYVRCLEYVRLKEYKSIAFPC 158 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLL 171 ISTG++GY A + ++ V +++ + L + ++ F CY+ + ++Y + L Sbjct: 159 ISTGIFGYSNEKACPVVLEVVRDWLEVNPLWDGKIIFCCYNLTDLNIYSKFL 210 >UniRef50_C7H575 RNase III regulator YmdB n=2 Tax=Faecalibacterium prausnitzii RepID=C7H575_9FIRM Length = 343 Score = 119 bits (298), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 69/155 (44%), Positives = 91/155 (58%), Gaps = 7/155 (4%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGP-ALLDACLKVRQQQGDCPTGHA 65 +++ DITK+A D IVN AN +L+ G G AI++AAG L AC + G C G A Sbjct: 5 MIRNDITKVAADAIVNPANRNLLQGSGTSRAIYQAAGEQELTAACEAI----GRCDLGRA 60 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 V T A LPAK + H V P W GG E + L AY ++L+L A SVAFP +S+G Sbjct: 61 VCTPAFGLPAKYIFHAVCPAWHGGGFGEAEQLAGAYHSALKLAAKYHCESVAFPLLSSGN 120 Query: 126 YGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYD 160 YGYP+ A IAV T+++++ H L VY V YD Sbjct: 121 YGYPKEQAFRIAVDTITQYVMEHDLT--VYLVLYD 153 >UniRef50_C5C222 Appr-1-p processing domain protein n=2 Tax=Actinomycetales RepID=C5C222_BEUC1 Length = 193 Score = 119 bits (298), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 74/145 (51%), Positives = 90/145 (62%), Gaps = 5/145 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQ---QGDC 60 R V GDIT VDV+VNAANPSL+GGGGVDGAIHRAAGP+LL C +R+ +G Sbjct: 8 RREAVLGDITAQDVDVVVNAANPSLLGGGGVDGAIHRAAGPSLLAECQDLRRTVLPRG-L 66 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AV T AG+LPA VVHTVGP G Q + LL + SL + SVAFPA Sbjct: 67 SVGDAVATGAGNLPALWVVHTVGPNAHVG-QRDPALLASCFTRSLDVAGGLGARSVAFPA 125 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFI 145 +S GV+G+ A IAV +V ++ Sbjct: 126 VSAGVFGWDVDVVARIAVDSVDTWL 150 >UniRef50_B7C850 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C850_9FIRM Length = 310 Score = 117 bits (293), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 62/167 (37%), Positives = 91/167 (54%), Gaps = 7/167 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +++ DITKL VD IVN NPSL GG+D IH+ AG L C ++ G+ G Sbjct: 3 IKIIRQDITKLKVDAIVNTTNPSLDAKGGLDHYIHQFAGKELDVECRRI----GNLKVGQ 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T +G K ++HT PVW +N + LL+ YL+SL L S+AFP IS+G Sbjct: 59 ACLT-SGYKLCKYIIHTASPVWNIQNKNNEALLKSCYLSSLMLANEYKLKSIAFPLISSG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +P+ A ++A+ ++ F+T H + VY V YD + + L Sbjct: 118 TNQFPKELALQVAMNSIVSFLTDHEM--MVYLVVYDRNSYKISSELF 162 >UniRef50_A7B8S3 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7B8S3_9ACTO Length = 270 Score = 116 bits (290), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 77/183 (42%), Positives = 107/183 (58%), Gaps = 15/183 (8%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGG-----VDGAIHRAAGPALLDACLKV--RQQ 56 R+ + +GDIT+L VD IVNAAN +L+G +D AIH AAG L AC +V + Sbjct: 86 RMALWRGDITRLEVDAIVNAANSALLGCRAPGHTCIDNAIHSAAGLELRQACAEVMAERT 145 Query: 57 QGD----CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAAN 111 +GD PTG AV+T LP++ V+HTVGP+ G +E + L +Y L AA+ Sbjct: 146 RGDGPSGFPTGEAVLTPGFHLPSRFVIHTVGPIVNGELTDEHREALACSYQRCLEEAAAH 205 Query: 112 SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYE 168 +VAF ISTGV+G+P+ AA IAV TV++F+ TR A +V F + + + LY Sbjct: 206 GLNTVAFCCISTGVFGFPQEEAARIAVSTVADFLESDTRGASEVRVIFDVFGDHDEALYR 265 Query: 169 RLL 171 LL Sbjct: 266 ALL 268 >UniRef50_B5YAF3 Conserved protein n=2 Tax=Dictyoglomus RepID=B5YAF3_DICT6 Length = 182 Score = 115 bits (287), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 65/165 (39%), Positives = 97/165 (58%), Gaps = 3/165 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ VV+GDIT+ V+ IVNAAN L GGGV GAI RA G + + ++ G P G Sbjct: 13 KLKVVKGDITQEEVEAIVNAANSYLKHGGGVAGAIVRAGGEVIQKESDEYVEKYGPLPVG 72 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT AG L AK V+HTVGP W GE +E++ L+ A + L L + S++ PA+S Sbjct: 73 SATITSAGKLKAKYVIHTVGPRW--GEGDEEKKLEKAIESVLTLAKEKNIKSLSIPAVSC 130 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLY 167 G++G+P +I V V EF+ + + E+++F+ +E L+ Sbjct: 131 GIFGFPPQLGTKIIVNKVVEFLKDNPGVFEEIHFIGIGDEIPTLF 175 >UniRef50_A5ZAB5 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A5ZAB5_9FIRM Length = 274 Score = 114 bits (286), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 70/181 (38%), Positives = 99/181 (54%), Gaps = 14/181 (7%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 +I + QGD+T+L VD IVNAAN +L+G +D AIH AG L + C K+ Q+ Sbjct: 92 KISIWQGDMTRLKVDAIVNAANSALLGCFVPCHRCIDNAIHSGAGMELREECNKIMNQRK 151 Query: 59 -------DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAA 110 + PTG A IT A +LP K V+HTVGP+ G +E L++ Y + L A Sbjct: 152 IKYGTNYEEPTGTATITEAYNLPCKKVIHTVGPICYFGLNDELCNDLKNCYESVLNCCAE 211 Query: 111 NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYER 169 N +VAF ISTG + +P AA IA TV F+ + E+V F Y + + +Y++ Sbjct: 212 NGLKTVAFCCISTGEFRFPNKEAAVIAKDTVERFLMKKENNIERVIFCVYKDLDREIYDK 271 Query: 170 L 170 L Sbjct: 272 L 272 >UniRef50_C2L199 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2L199_9FIRM Length = 344 Score = 114 bits (286), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 57/166 (34%), Positives = 97/166 (58%), Gaps = 5/166 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +++ DITK+ VD IVN ANP G+D A+++AAG L L+ RQ+ G G Sbjct: 3 FQIIRNDITKMQVDAIVNPANPIPGYAAGIDSAVYKAAGEEKL---LRRRQEIGAIAPGS 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 + IT +LPAK ++HTVG W+GG +E+ +++ Y + +L + S+A P +++G Sbjct: 60 SFITDGYNLPAKYIIHTVGTAWQGGNSDEEIIIRKCYRSIFKLALEHHILSLAIPLLASG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 YG+P+ A IA+ + F++ + + ++Y V +DE++ L L Sbjct: 120 SYGFPKGIALRIALSEIESFMSENDI--ELYLVVFDEKSYSLSTEL 163 >UniRef50_A4TAV6 Appr-1-p processing domain protein n=6 Tax=Actinomycetales RepID=A4TAV6_MYCGI Length = 577 Score = 114 bits (284), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 64/126 (50%), Positives = 82/126 (65%), Gaps = 3/126 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V+GDIT+ VD +VN AN ++ GGGG DGAIHRA GPA+L C+K R G TG Sbjct: 12 ITAVRGDITEQEVDAVVNPANTAMRGGGGADGAIHRAGGPAILRDCVK-RFPDG-LATGD 69 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A T AGDLPA+ V+HTVGP + G++N LL+ Y +L++ VAFP ISTG Sbjct: 70 AGWTTAGDLPAQWVIHTVGPNYDTGQRNR-SLLESCYRRALKVADELGARIVAFPLISTG 128 Query: 125 VYGYPR 130 +G+PR Sbjct: 129 SFGWPR 134 >UniRef50_C9LYS3 Appr-1-p processing enzyme family domain protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LYS3_9FIRM Length = 302 Score = 113 bits (283), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 74/182 (40%), Positives = 95/182 (52%), Gaps = 19/182 (10%) Query: 9 QGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQGDC-PT 62 QGDIT+LAVD IVNAAN +L G +D AIH AAG AL AC ++ ++QG P Sbjct: 119 QGDITRLAVDAIVNAANSALRGCFVPLHRCIDNAIHSAAGLALRAACDEIMREQGHPEPA 178 Query: 63 GHAVITLAGDLPAKAVVHTVGP-------------VWRGGEQNEDQLLQDAYLNSLRLVA 109 G A IT +LPA+ V+HTVGP V+ G Q L Y L L A Sbjct: 179 GRAKITPGFNLPARHVLHTVGPIIAPAGSPVHEPGVFAGVTHEAQQCLVSCYRACLDLAA 238 Query: 110 ANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYER 169 SVAF ISTG + YP AAE AV T ++ H P ++ F + +E+ +Y R Sbjct: 239 ERRLASVAFCCISTGEFHYPPQEAAETAVATCRAWLQAHDTPMRIVFNVFKDEDLAIYRR 298 Query: 170 LL 171 + Sbjct: 299 IF 300 >UniRef50_A9WK70 Appr-1-p processing domain protein n=3 Tax=Chloroflexus RepID=A9WK70_CHLAA Length = 190 Score = 112 bits (280), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 73/175 (41%), Positives = 98/175 (56%), Gaps = 7/175 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLD-ACLKVRQQQGDCPTG 63 + VV+GDI VD IVNAAN L+ GGGV GAI RAAG A L AC V CPTG Sbjct: 15 LEVVEGDIVSQQVDAIVNAANEQLLQGGGVCGAIFRAAGAAELQRACDAV----APCPTG 70 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A IT LPA+ ++H VGP++ +E D+LL AY SL L S+AFP+I+ Sbjct: 71 EARITPGFALPARYIIHAVGPIFDHYAPSEADRLLISAYRASLALARQYGLQSIAFPSIA 130 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 TG+YG+P AA + ++T+ + + H P V V + + +Y + T E Sbjct: 131 TGIYGFPVTRAAPLVLQTLIDDLHTHQAPGLVRMVLW-RDTFPVYRDVFTHMQSE 184 >UniRef50_A7EET2 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EET2_SCLS1 Length = 506 Score = 112 bits (280), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 65/147 (44%), Positives = 85/147 (57%), Gaps = 2/147 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + VV GD+ K VDVIVNAAN SL+ G G+DG IHR AGP L A +K + Sbjct: 19 TTVEVVDGDLLKYPVDVIVNAANASLVRGDGIDGEIHRQAGPELA-AEMKTQFPHPGKQG 77 Query: 63 GHAVITLAGDLPA-KAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G T + D+ + + ++H VGP WR Q LL +AY NSL L A N+ S+AFPAI Sbjct: 78 GAYGTTHSWDITSCQYIIHAVGPDWRQPNQRATGLLANAYHNSLSLAAKNNLRSIAFPAI 137 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH 148 S G++ PR A +KT+ +I H Sbjct: 138 SVGIFQMPRGMAGVTVMKTIRSWIDSH 164 >UniRef50_Q4DSL4 Putative uncharacterized protein n=4 Tax=Trypanosoma RepID=Q4DSL4_TRYCR Length = 297 Score = 112 bits (279), Expect = 8e-24, Method: Compositional matrix adjust. Identities = 69/168 (41%), Positives = 91/168 (54%), Gaps = 10/168 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + G +T L +D IVNAAN + +GG GVDGAIH AAGP L+ C C TG Sbjct: 125 IALHNGPVTDLQLDAIVNAANKTCLGGKGVDGAIHAAAGPLLVRECATF----NGCDTGQ 180 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 IT +LPA+ V+HTVGP+ GE+ E L+ Y + L L N S+ F +STG Sbjct: 181 CRITKGYNLPARYVLHTVGPI---GERPE--ALRSCYRSILSLAHRNRLRSIGFCCVSTG 235 Query: 125 VYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLL 171 VYGYP A IAV E++ +H + + F C+ E + Y L Sbjct: 236 VYGYPLIPATRIAVDETIEYLKQHFSAFDLCCFACFKLEEYNAYTDCL 283 >UniRef50_C4M8N0 Putative uncharacterized protein n=2 Tax=Entamoeba RepID=C4M8N0_ENTHI Length = 627 Score = 111 bits (278), Expect = 8e-24, Method: Composition-based stats. Identities = 65/174 (37%), Positives = 104/174 (59%), Gaps = 9/174 (5%) Query: 9 QGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQG-DCPT 62 +GDITKL VD IVNAAN L+G +D AIH AGP L C + +QG + PT Sbjct: 138 KGDITKLCVDAIVNAANNQLLGCFVPHHLCIDNAIHTFAGPQLRRDCSIIMNKQGFEEPT 197 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G+A +T A +LP+K V+HTVGP+ +++ LL+ +Y+N L + S+AF I Sbjct: 198 GYAKVTRAYNLPSKYVIHTVGPIVESQLKESHCNLLRSSYINCLNIADDLHLESIAFSCI 257 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCYDEENAHLYERLLTQ 173 STG++G+P+ A+ IA++TV ++ + ++V F + + + +Y + +T+ Sbjct: 258 STGLFGFPQNVASVIAIETVINWLYENPFTSIKKVIFDVFSDNDLQIYTKNVTE 311 >UniRef50_UPI000196AD9C hypothetical protein CATMIT_00588 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AD9C Length = 334 Score = 111 bits (278), Expect = 9e-24, Method: Compositional matrix adjust. Identities = 63/165 (38%), Positives = 93/165 (56%), Gaps = 5/165 (3%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 +V+ DITK+ D+IVN ANP G D AI+ AAG +A L R+ G G Sbjct: 5 IVRNDITKVEADIIVNTANPQPKCVSGTDLAIYEAAGK---EALLAERKTIGPIERGEIA 61 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 +T A +L AK ++HTVGPVW G +E ++L+ Y L+ S+AFP ISTGVY Sbjct: 62 VTGAYNLNAKYIIHTVGPVWIDGNHHELEILERCYRLPLQKAIELGCQSIAFPLISTGVY 121 Query: 127 GYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +P+ A IAV S+F+T H + ++ V +D+ + L +++ Sbjct: 122 EFPKNKALHIAVSVFSQFLTEHEI--EIILVVFDKTSFQLSSQIV 164 >UniRef50_C9KLM2 Appr-1-p processing enzyme family domain protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KLM2_9FIRM Length = 262 Score = 111 bits (277), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 71/181 (39%), Positives = 102/181 (56%), Gaps = 9/181 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + R+ + QGDIT+L +D IVNAAN ++G +D AG + C K+ Q Sbjct: 81 LDPRLVLWQGDITRLRIDAIVNAANRQMLGCFLPNHNCIDNIEQTMAGVEMRYNCYKLMQ 140 Query: 56 QQG-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSY 113 QG D PTG IT LPA+ V+HTVGP+ +G +E + LL Y + L L A + Sbjct: 141 AQGHDEPTGKVKITSGYHLPARFVLHTVGPIVQGSLTDEHRRLLASCYESCLTLAAEHGL 200 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLL 171 VAF ISTGV+ +P+ AAA IAV+TV ++ H A ++V F +++ + +YE LL Sbjct: 201 KGVAFCCISTGVFRFPKDAAAHIAVRTVQHWLDVHPAASIKRVIFDVFEDADRRIYENLL 260 Query: 172 T 172 Sbjct: 261 N 261 >UniRef50_D2V337 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2V337_NAEGR Length = 177 Score = 110 bits (276), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 66/177 (37%), Positives = 93/177 (52%), Gaps = 22/177 (12%) Query: 17 VDVIVNAANPSLMGGGGVDGAIHRAAGPALL--------DACLKVR--------QQQGDC 60 +D IVNAAN SLMGGGG+D IH AG L +CLK++ + + C Sbjct: 1 IDTIVNAANESLMGGGGIDQIIHARAGDELKLECKTKYSPSCLKMKGSITYGNDELEYRC 60 Query: 61 PTGHAVITLAGDLPAKA--VVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 TG AVIT A +L K ++HTVGP + +LL + Y + L+L N+ S+AF Sbjct: 61 ATGEAVITQAHNLSEKCQYIIHTVGPYLDENGNTQPELLSNCYNSCLQLAMENNLKSIAF 120 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE----QVYFVCYDEENAHLYERLL 171 P ISTG YGYP A +A+K V F+ H + + FV +++ +Y+ L Sbjct: 121 PCISTGYYGYPIEEACRLALKIVKNFLHSHLNKQSSLRHIIFVIFNDLEFEIYKILF 177 >UniRef50_C7HUZ2 RNase III regulator YmdB n=2 Tax=Anaerococcus RepID=C7HUZ2_9FIRM Length = 163 Score = 110 bits (275), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 67/157 (42%), Positives = 89/157 (56%), Gaps = 8/157 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPTG 63 + V+ DI KL VD IVNAAN L+ GGG+ G I AG L ACLK+ + G Sbjct: 3 LKVIDIDILKLNVDAIVNAANVDLIEGGGICGQIFEKAGREKLKKACLKLSPIK----PG 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPAIS 122 AVIT +L K ++H VGPV+ + Q +LQDAY NSL++ S+AFP IS Sbjct: 59 EAVITDGFNLYQKYIIHAVGPVYNEMYKEACQKILQDAYKNSLKIAKKKGIKSIAFPLIS 118 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY 159 +G+YGYP A IA T+ EF+ + + +VY Y Sbjct: 119 SGIYGYPDKDAFMIAKNTIDEFLKNYEM--EVYLSTY 153 >UniRef50_C2LSS3 Protein in Tap1-dppD intergenic region n=1 Tax=Streptococcus salivarius SK126 RepID=C2LSS3_STRSL Length = 254 Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 70/178 (39%), Positives = 102/178 (57%), Gaps = 8/178 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 ++ +++ QGDIT+LA D IVNAAN L+G +D AIH AAG L AC ++ Q Sbjct: 76 IRPNLYLWQGDITRLAADAIVNAANSKLLGCFVPNHSCIDNAIHTAAGVELRLACQELMQ 135 Query: 56 QQG-DCPTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANSY 113 +QG D TG A +T A +LP++ V+HTVGP+ + E Q L +Y L L Sbjct: 136 EQGEDETTGQAKMTKAYNLPSRYVLHTVGPIIYDEVTDLERQQLASSYEECLNLAYEKGL 195 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 S+AF ISTG + +P AA+IA++TV +F H+ V F + + + +Y+ LL Sbjct: 196 RSLAFCCISTGEFRFPNEEAAKIAIETVLQFQKEHS-DMVVIFNVFKDLDYAIYQSLL 252 >UniRef50_P67344 UPF0189 protein SA0314 n=54 Tax=Staphylococcus RepID=Y314_STAAN Length = 266 Score = 110 bits (274), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 70/181 (38%), Positives = 98/181 (54%), Gaps = 10/181 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPAL-LDACLKVRQQQG 58 I V QGDIT L +D IVNAAN +G +D IH AG + LD +RQQ Sbjct: 87 IFVWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQGR 146 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRG---GEQNEDQLLQDAYLNSLRLVAANSYTS 115 + G A T +LPAK ++HTVGP R + N+D LL YL+ L+L +S Sbjct: 147 NEGVGKAKKTRGYNLPAKYIIHTVGPQIRRLPVSKMNQD-LLAKCYLSCLKLADQHSLNH 205 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 VAF ISTGV+ +P+ AAEIAV+TV ++ +V F + +++ LY+ L + Sbjct: 206 VAFCCISTGVFAFPQDEAAEIAVRTVESYLKETNSTLKVVFNVFTDKDLQLYKEALNRDA 265 Query: 176 D 176 + Sbjct: 266 E 266 >UniRef50_A6LTB5 Appr-1-p processing domain protein n=3 Tax=Clostridium RepID=A6LTB5_CLOB8 Length = 214 Score = 109 bits (273), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 72/215 (33%), Positives = 105/215 (48%), Gaps = 48/215 (22%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T ++ DITK+ D IVNAAN SL+GGGGVDGAIH+A G LLD C RQ G C T Sbjct: 2 TNFKILFDDITKIKFDAIVNAANASLLGGGGVDGAIHKACGEKLLDEC---RQLNG-CLT 57 Query: 63 GHAVITLAGDLPAKA---VVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVA---------- 109 G + +T + +L V+HTVGP++R +E++ L++AY + + A Sbjct: 58 GRSKLTRSYNLSDHGVHWVIHTVGPIYRNN-GSEEKYLRNAYRSVFDIAANYSEFYSKQC 116 Query: 110 -----ANSY------------------------TSVAFPAISTGVYGYPRAAAAEIAVKT 140 N Y ++A P+ISTG Y YP A IA+ Sbjct: 117 NEILNKNLYRFNTDKQRDFILKELDDYINDHPIKTIALPSISTGAYSYPLNEACNIALDE 176 Query: 141 VSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQQ 174 + FI +++ VC DE+ ++Y+ L ++ Sbjct: 177 ILSFINNSPDTFDEIAMVCLDEKTYNMYKSLYEER 211 >UniRef50_A2FMC7 Appr-1-p processing enzyme family protein n=1 Tax=Trichomonas vaginalis RepID=A2FMC7_TRIVA Length = 361 Score = 109 bits (272), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 63/166 (37%), Positives = 94/166 (56%), Gaps = 12/166 (7%) Query: 8 VQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVI 67 ++G+ KL D +VNAAN L GGG+ G +H AAG A+ C ++ G PTG + Sbjct: 122 MRGNSVKLECDAVVNAANSHLYPGGGICGVLHSAAGEAMERECSEI----GYTPTGKCAV 177 Query: 68 TLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYG 127 TL +LPAK +HTVGP+ GEQ + LQ+AY ++L + SV ISTG+YG Sbjct: 178 TLGYNLPAKYCIHTVGPI---GEQPDK--LQEAYESTLSCIDGKKIRSVGLCCISTGIYG 232 Query: 128 YPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERL 170 YP A IA+K V +F+ +++ FV ++ + +Y+R+ Sbjct: 233 YPIENATPIALKVVRKFLEDPNNREKTDRIIFVVFERRDVVVYDRM 278 >UniRef50_B0EH33 Putative uncharacterized protein n=2 Tax=Entamoeba RepID=B0EH33_ENTDI Length = 348 Score = 109 bits (272), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 67/173 (38%), Positives = 94/173 (54%), Gaps = 8/173 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQGD 59 I V +GDITKL +D IVNAAN +L+G VD IH AG L C +++ Sbjct: 93 IRVWKGDITKLKIDSIVNAANNTLVGCFIPLHSCVDSIIHERAGVQLRYECSQLKTAYKA 152 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 T IT +LPAK V+H VGP+ + + LLQ YLN L + TS+ F Sbjct: 153 -TTTTTEITKGYNLPAKYVIHVVGPIVDTLKPKDSYLLQQCYLNCLNKAIESGCTSIGFC 211 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTG++G+P AA+IA++TV+ F+ H + V F + E + ++Y LL Sbjct: 212 CISTGMFGFPNEEAAQIAIQTVNNFLKDHQI--DVVFCVFKEIDYNIYTSLLN 262 >UniRef50_C4G1S1 Putative uncharacterized protein n=3 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1S1_ABIDE Length = 359 Score = 108 bits (269), Expect = 9e-23, Method: Compositional matrix adjust. Identities = 56/169 (33%), Positives = 92/169 (54%), Gaps = 5/169 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +++ DITK+ D IVN ANP + GGGV+ AI+ AAG L L R++ G G Sbjct: 3 FRIIRNDITKVKADAIVNTANPEVAIGGGVETAIYSAAGKKKL---LDERKKIGILQPGE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T A DL AK ++H P W+GG + E + L+D Y L+ S+AFP ++TG Sbjct: 60 VGVTEAFDLAAKYIIHVSSPRWKGGNKGEIKCLRDCYEKVLKTAKDYGCESIAFPLLATG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 YG+P+ ++AV + F+ + + ++ V ++ E + +L+ + Sbjct: 120 TYGFPKEVGVQVAVDAFTAFLEENEM--EITLVVFESEAVSISGKLVEE 166 >UniRef50_C8NG26 Appr-1-p processing enzyme family domain protein n=2 Tax=Granulicatella RepID=C8NG26_9LACT Length = 264 Score = 108 bits (269), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 66/178 (37%), Positives = 99/178 (55%), Gaps = 9/178 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 +I + GD+ +L VD IVNAAN ++G +D AIH +G L C + ++QG Sbjct: 84 QIKLYYGDLCELKVDAIVNAANSEMLGCFIPNHRCIDNAIHTFSGIELRTFCHHLMKKQG 143 Query: 59 DC-PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRLVAANSYT 114 P G A IT A +LP+K ++HTVGP G++ +QLL Y + L T Sbjct: 144 KKEPVGKAKITPAFNLPSKYIIHTVGPFLSPGQKVTPLREQLLASCYKSCLEAAREAGLT 203 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 S+AF ISTG +G+P+ AA IA TV++++ A V F Y +E+ +Y++LL+ Sbjct: 204 SIAFCGISTGEFGFPKEPAALIAEDTVNKWLQDTASTITVVFSTYTKEDQSIYQKLLS 261 >UniRef50_B0A8R6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B0A8R6_9CLOT Length = 361 Score = 107 bits (268), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 60/177 (33%), Positives = 93/177 (52%), Gaps = 10/177 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDA-CLKVRQQQGDCPTG 63 +++ IT + D IVN N L GGV G+I AG +L+ C K+ G T Sbjct: 3 FEIIRQYITNMKTDAIVNPTNNELKPTGGVCGSIFEKAGYEILEKKCKKI----GYLETT 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT +L K ++HTVGP+W + + LL + Y N L+L + S+AFP IS+ Sbjct: 59 EAVITKGYNLDCKYIIHTVGPIWDNAKSDNATLLYNTYTNCLKLAKSKKCNSIAFPLISS 118 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL---LTQQGDE 177 G +GYP+ A +IA + F+ + + +Y V +D E+ + + L +TQ D+ Sbjct: 119 GNFGYPKDKALDIATNAIKNFLLENDM--LIYLVVFDRESFKINKDLFDSITQYIDD 173 >UniRef50_A8STD9 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8STD9_9FIRM Length = 348 Score = 107 bits (267), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 60/167 (35%), Positives = 93/167 (55%), Gaps = 6/167 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ-GDCPTG 63 + +V+ DI K+ D IVN AN ++ G G DGA++RAAG D L R++ G G Sbjct: 3 LRIVRNDIVKMTTDAIVNTANDHVVVGTGCDGAVYRAAG---YDELLNYRREYIGFVEEG 59 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT L A+ ++H V P + G+ E+ L+ Y SL+L N S+AFP IST Sbjct: 60 GAFITPGFGLNARYIIHAVSPRFIDGDHGEEGKLRSCYRKSLQLAKENGVRSIAFPLIST 119 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 G +GYP+ IAV ++ F+ + + ++ V +DE++ L E++ Sbjct: 120 GGFGYPKEEGLRIAVDEINAFLFENEV--DIFLVVFDEKSTRLGEKI 164 >UniRef50_C1QBX0 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBX0_9SPIR Length = 257 Score = 107 bits (266), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 57/177 (32%), Positives = 98/177 (55%), Gaps = 7/177 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 ++ +++ QGDIT L +D +VNAAN S++G +D AIH A+G L CL Sbjct: 81 IRDNLYLWQGDITTLNIDAVVNAANSSMLGCFIPLHKCIDNAIHSASGTRL-RLCLNNIM 139 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYT 114 + +G +IT A +LP++ ++HTVGP+ + + +++LL + Y + L N+ Sbjct: 140 KGKTEDSGQCIITKAFNLPSRYILHTVGPIIQNSVSKKDEELLYNCYKSCLETAKENNIK 199 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 S+AF ISTG + +P A++IAV V +F+ ++ F + + + LY +L Sbjct: 200 SIAFCCISTGEFKFPNKEASQIAVNAVKDFLNNSKYDIKIVFNVFKDLDYELYYDIL 256 >UniRef50_Q5XC09 UPF0189 protein M6_Spy0919 n=20 Tax=Streptococcus RepID=Y919_STRP6 Length = 270 Score = 107 bits (266), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 73/187 (39%), Positives = 98/187 (52%), Gaps = 15/187 (8%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 T + + GDI LAVD IVNAAN L+G G +D AIH AG L AC + +Q Sbjct: 84 TSLFLYHGDIRYLAVDAIVNAANSELLGCFIPNHGCIDNAIHTFAGSRLRLACQAIMTEQ 143 Query: 58 GDCPT-GHAVITLAGDLPAKAVVHTVGPVWRGGEQNED---QLLQDAYLNSLRLVAANSY 113 G G A +T A LPA ++HTVGP G LL Y +SL L Sbjct: 144 GRKEAIGQAKLTSAYHLPASYIIHTVGPRITKGRHVSPIRADLLARCYRSSLDLAVKAGL 203 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ----VYFVCYDEENAHLYER 169 TS+AF +ISTG +G+P+ AA+IA+KTV ++ H PE V F + E+ LY+ Sbjct: 204 TSLAFCSISTGEFGFPKKEAAQIAIKTVLKWQAEH--PESKTLTVIFNTFTSEDKALYDT 261 Query: 170 LLTQQGD 176 L ++ + Sbjct: 262 YLQKENN 268 >UniRef50_C9XM94 Putative uncharacterized protein n=6 Tax=Clostridium RepID=C9XM94_CLODC Length = 286 Score = 106 bits (265), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 72/181 (39%), Positives = 100/181 (55%), Gaps = 18/181 (9%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGG-----VDGAIHRAAGPALLDACLKVRQQQGD 59 I + +G+IT L D IVNAAN L+G VD IH AGP L + C K+ ++QG Sbjct: 109 IAIWRGNITNLRADAIVNAANNKLLGCLQPLHLCVDNEIHSCAGPRLREDCDKIIKKQGH 168 Query: 60 CP-TGHAVITLAGDLPAKAVVHTVGPVWRGGE---QNEDQLLQ--DAYLNSLRLVAANSY 113 TG A IT LPAK VVHTVGP+ GG+ + E QLL + LN+++ + + Sbjct: 169 LEYTGDAKITRGYCLPAKFVVHTVGPIVSGGQPSKEQEKQLLHCYKSCLNTIKEI--DEI 226 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE---QVYFVCYDEENAHLYERL 170 ++ F ISTGV+GYP+ AA +AV V ++ + PE +V F + EE Y R+ Sbjct: 227 KNIVFCGISTGVFGYPKKEAANLAVSRVRLWLKEN--PEKNLKVVFNVFTEEEEEKYRRI 284 Query: 171 L 171 Sbjct: 285 F 285 >UniRef50_UPI0001B4DEB3 hypothetical protein ShygA5_39675 n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4DEB3 Length = 311 Score = 105 bits (262), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 73/179 (40%), Positives = 100/179 (55%), Gaps = 9/179 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGG-----VDGAIHRAAGPALLDACLKVRQQQG 58 R + QGDIT L D +VNAAN +L+G +D AIH AAGP L C + +QG Sbjct: 128 RTVLWQGDITTLGADAVVNAANSALLGCFAPMHPCIDNAIHTAAGPRLRADCHTIMTRQG 187 Query: 59 DC-PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVA-ANSYTS 115 PTG A IT LPA+ V+HTVGP+ G + D+ L +Y L L A + + Sbjct: 188 HPEPTGTAKITRGYHLPARYVLHTVGPIVDGPLRPVHDRALAASYRACLDLAAEVDGLRT 247 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQ 173 VAF ISTGV+GYPR AA A+ TV++++ H ++V F Y +++ Y LT+ Sbjct: 248 VAFCGISTGVFGYPRKPAARAALDTVADWLGTHPGRLDRVIFNVYADDDHAAYTHALTE 306 >UniRef50_D1BM15 Appr-1-p processing domain protein n=15 Tax=Bacteria RepID=D1BM15_VEIPT Length = 259 Score = 105 bits (261), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 69/176 (39%), Positives = 96/176 (54%), Gaps = 9/176 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 +I++ QGDIT+LAV IVNAAN L+G +D AIH AG L AC ++ + Sbjct: 84 QIYLWQGDITRLAVKAIVNAANEQLLGCFLPNHKCIDNAIHTFAGIELRMACARMTEYM- 142 Query: 59 DCP--TGHAVITLAGDLPAKAVVHTVGP-VWRGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 D P TG A +T +LPA V+HTVGP V+ E + L Y + L L A S S Sbjct: 143 DMPEKTGVARMTYGFNLPASHVIHTVGPIVYDTVTDLEKEQLSSCYRSCLELANAYSLKS 202 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +AF ISTG + +P AA+IA+ TV ++ QV F + + + +Y +LL Sbjct: 203 IAFCCISTGEFRFPNELAAQIAIDTVRRYLKETNSKIQVVFNVFKDIDYDIYNKLL 258 >UniRef50_Q460N5 Poly [ADP-ribose] polymerase 14 n=19 Tax=Eutheria RepID=PAR14_HUMAN Length = 1720 Score = 105 bits (261), Expect = 9e-22, Method: Composition-based stats. Identities = 58/138 (42%), Positives = 79/138 (57%), Gaps = 1/138 (0%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 V QGD+ +L VDV+VNA+N L GG+ A+ +AAGP L C ++ +++G G+A Sbjct: 725 VQQGDLARLPVDVVVNASNEDLKHYGGLAAALSKAAGPELQADCDQIVKREGRLLPGNAT 784 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 I+ AG LP V+H VGP W G E LL+ A SL L Y S+A PAIS+GV Sbjct: 785 ISKAGKLPYHHVIHAVGPRWSGYEAPRCVYLLRRAVQLSLCLAEKYKYRSIAIPAISSGV 844 Query: 126 YGYPRAAAAEIAVKTVSE 143 +G+P E V + E Sbjct: 845 FGFPLGRCVETIVSAIKE 862 Score = 68.2 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 45/166 (27%), Positives = 83/166 (50%), Gaps = 6/166 (3%) Query: 7 VVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHA 65 +V+ + DV+VN+ L + G + ++ AGP L + V Q G Sbjct: 937 LVKEGVQNAKTDVVVNSVPLDLVLSRGPLSKSLLEKAGPELQEELDTVGQGVA-VSMGTV 995 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 + T + +L + V+H V P WR G + ++++D + + + S S+AFPAI TG Sbjct: 996 LKTSSWNLDCRYVLHVVAPEWRNGSTSSLKIMEDIIRECMEITESLSLKSIAFPAIGTGN 1055 Query: 126 YGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCY--DEENAHLY 167 G+P+ AE+ + V +F +++ L ++V+F+ + D EN + Sbjct: 1056 LGFPKNIFAELIISEVFKFSSKNQLKTLQEVHFLLHPSDHENIQAF 1101 Score = 52.8 bits (125), Expect = 6e-06, Method: Composition-based stats. Identities = 40/142 (28%), Positives = 64/142 (45%), Gaps = 14/142 (9%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHA 65 V GDITK DVIVN+ + S GV AI AG + C + QQ+ + Sbjct: 1149 QVASGDITKEEADVIVNSTSNSFNLKAGVSKAILECAGQNVERECSQQAQQRKN----DY 1204 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 +IT G L K ++H +G G + ++ + + L+ +Y+S+ PAI TG Sbjct: 1205 IITGGGFLRCKNIIHVIG----GND------VKSSVSSVLQECEKKNYSSICLPAIGTGN 1254 Query: 126 YGYPRAAAAEIAVKTVSEFITR 147 AE + + +F+ + Sbjct: 1255 AKQHPDKVAEAIIDAIEDFVQK 1276 >UniRef50_A1WVH3 Appr-1-p processing domain protein n=14 Tax=Bacteria RepID=A1WVH3_HALHL Length = 181 Score = 104 bits (260), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 66/174 (37%), Positives = 95/174 (54%), Gaps = 10/174 (5%) Query: 1 MKTRIHVVQGDITKLA-VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 ++TR+ GDI D +VNAAN LM GGGV GA+HRAAGP L +AC + Q Sbjct: 10 VETRV----GDIAAQGDCDAVVNAANAQLMPGGGVAGALHRAAGPELAEACRPLAPIQ-- 63 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G AVIT LP + V+H +GPV+ G ++ +QLL Y N+L + T VA P Sbjct: 64 --PGQAVITAGFGLPNRHVIHCLGPVY-GVDEPGEQLLAACYRNALHRAEEHELTRVAMP 120 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 A+STG +G+P AA +A+ T+ + V FV D +++ ++ + Sbjct: 121 ALSTGAFGFPMERAARVAIGTLQRTAAQLRYVRHVRFVLADAAAQQIHDHVIQE 174 >UniRef50_A4YFR3 Appr-1-p processing domain protein n=9 Tax=Thermoprotei RepID=A4YFR3_METS5 Length = 220 Score = 104 bits (260), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 65/173 (37%), Positives = 92/173 (53%), Gaps = 5/173 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + +++GDITK+ D IVNAAN L GGGV AI R G A+ + ++ G P G Sbjct: 51 VDLMKGDITKIEADAIVNAANSYLSHGGGVAWAIVRRGGEAIQRESDQYVREHGPVPVGE 110 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T AG L AK V+H VGP R G + ED+ L A SL S+A PAISTG Sbjct: 111 VAVTGAGSLRAKYVIHAVGP--RYGLEGEDK-LHSAIRRSLEKAEELGLRSLALPAISTG 167 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 +YGYP A + + + + + E+V V YD+ +E++ T++ E Sbjct: 168 IYGYPMEVCARVMASVLRSY--KPKILEKVIVVLYDDMAYSTFEKVFTRELQE 218 >UniRef50_A7T167 Protein GDAP2 homolog n=1 Tax=Nematostella vectensis RepID=GDAP2_NEMVE Length = 502 Score = 104 bits (259), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 57/149 (38%), Positives = 83/149 (55%), Gaps = 4/149 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDITKLA D IVN N SL G + +HRAAGP L+ C RQQ C Sbjct: 49 INAKVVLWNGDITKLAADAIVNTTNESLSDRGALSERVHRAAGPELMQEC---RQQLLGC 105 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A I+ +LPA+ V+HTVGP + + + L Y N++RLV N +++ Sbjct: 106 RTGEAKISEGYNLPARYVIHTVGPRYNTKYKTAAESALFSCYRNTMRLVRENKISTIGVC 165 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH 148 ++T GYP A IA++TV F+ ++ Sbjct: 166 VVNTTKRGYPPEDGAHIALRTVRRFLEKY 194 >UniRef50_A0Q2I9 Appr-1-p processing enzyme family protein n=3 Tax=Clostridia RepID=A0Q2I9_CLONN Length = 183 Score = 103 bits (257), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 60/172 (34%), Positives = 98/172 (56%), Gaps = 4/172 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + +GDIT + D IVN AN L GGGV AI + G + + K+ +++G PTG Sbjct: 9 IIIKKGDITNESSDAIVNPANGMLKHGGGVAAAIVKKGGREVQEESNKIVRKEGIIPTGG 68 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AVIT +LP K ++H VGP R GE +E L++A L++L L ++ S++ PAIS+G Sbjct: 69 AVITKGYNLPCKYIIHAVGP--RMGEGDEKLKLKNAVLSALCLAEQHNLKSISIPAISSG 126 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 ++ +P+ A+I + T +F+ A + +C ++ YE L ++ + Sbjct: 127 IFRFPKDECAKILINTSIKFLQTSAKSLKTIVMCNLDDKT--YEIFLQEEKE 176 >UniRef50_UPI0000E80997 PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=3 Tax=Gallus gallus RepID=UPI0000E80997 Length = 1655 Score = 103 bits (256), Expect = 3e-21, Method: Composition-based stats. Identities = 63/179 (35%), Positives = 96/179 (53%), Gaps = 10/179 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + V +G++ VDV+VNAA+ L G A+ +AAGP L C +V + G Sbjct: 642 TELLVYKGNLCNYPVDVVVNAASEDLRHTDGFAWALLQAAGPELQAECDEVVRMTGSLQA 701 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ---LLQDAYLNSLRLVAANSYTSVAFP 119 G AVIT AG LP K V+H +GP W+ E+N + LL +A SL+L ++ S+AFP Sbjct: 702 GDAVITGAGKLPCKQVIHAIGPQWK--EKNSGKCMYLLMEAIKKSLQLAETYNHRSIAFP 759 Query: 120 AISTGVYGYPRAAAAEIAV----KTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 ++S G++G+P V KT+ EF +L E ++ V DEE + + ++ Sbjct: 760 SVSGGIFGFPPHKCVNAIVSAIKKTLEEFKRDSSLKE-IHLVAVDEETVRVLRETVQKE 817 Score = 56.6 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 44/147 (29%), Positives = 68/147 (46%), Gaps = 14/147 (9%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GDITK +VIVN AN + GV AI AAG + + C Q G Sbjct: 1075 LKVTSGDITKEDTEVIVNIANQTFDATSGVFKAIMDAAGFDVKEEC----NQYGGLLQSG 1130 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 + T G L + ++H + + ++Q+ + + LR +Y SVAFPAI TG Sbjct: 1131 FITTKGGALLCRRIIHLIHSM-----NVKNQVSEVLHECQLR-----TYKSVAFPAIGTG 1180 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALP 151 A A+ + + EF++ ++P Sbjct: 1181 AAQQSPAKVADDMLDAIVEFVSSRSVP 1207 Score = 56.6 bits (135), Expect = 4e-07, Method: Composition-based stats. Identities = 51/170 (30%), Positives = 74/170 (43%), Gaps = 8/170 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 RI V + DI DVIVN+ L G G + A+ + AGP L K + QQ Sbjct: 866 RIQVEKKDIIDATTDVIVNSVGTDLKFGVGPLCRALLKEAGPELQMEFDKEKGQQV-AGN 924 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G V T L V H V P W G + L++ L S+AFPAI Sbjct: 925 GSVVCTKGYILDCTFVFHAVLPQWDRGSGQALKTLENTVHKCLMKAEEFGLKSIAFPAIG 984 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCY--DEENAHLY 167 TG + +P +++ V +F +R L ++V+FV + D +N + Sbjct: 985 TGGFSFPHTVVSKLMFDEVFKFSRCQSRKTL-QEVHFVLHPNDRQNIQAF 1033 >UniRef50_A7C4X9 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C4X9_9GAMM Length = 220 Score = 103 bits (256), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 58/158 (36%), Positives = 86/158 (54%), Gaps = 3/158 (1%) Query: 17 VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLPAK 76 VD IVN AN L GGG+ I AG L +AC K+ QQQG AV+T AG LP + Sbjct: 28 VDTIVNPANSGLSHGGGLAEQILLEAGSKLEEACHKIIQQQGKISVTKAVVTTAGQLPYQ 87 Query: 77 AVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEI 136 V+H VGP R G+ E ++ +N L++ + S+AFPAISTG++ P+ A+ Sbjct: 88 GVIHAVGP--RMGDGKEQSKIETTIINCLQIAEKYQWKSIAFPAISTGLFCVPKTVCAKA 145 Query: 137 AVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLLTQ 173 K +S + H + ++C E+ ++E++L Q Sbjct: 146 FDKAISYYWENHPNSAIKNIWLCLLTEDYPIFEKILNQ 183 >UniRef50_A6SR30 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6SR30_BOTFB Length = 474 Score = 102 bits (254), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 54/148 (36%), Positives = 79/148 (53%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T + V+ GD+ K VDVIVNAAN L GGG+DGAIH AAGP L ++ Q G Sbjct: 17 LDTTVEVLIGDMLKYPVDVIVNAANVKLKKGGGIDGAIHAAAGPELQGEMNELFQHPGQV 76 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 + + + ++H VGP W EQ + + L A NSL L N S+AFP Sbjct: 77 GGAYGTTSSWDIQSCRYIIHAVGPNWNIPEQQDGKFLFTAIQNSLDLAMKNKLRSIAFPG 136 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH 148 IS G++ P++ A + + + +I ++ Sbjct: 137 ISMGIFAMPKSLAGLVIISALRTWIIKY 164 >UniRef50_UPI00006A2284 UPI00006A2284 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2284 Length = 694 Score = 102 bits (253), Expect = 7e-21, Method: Compositional matrix adjust. Identities = 60/177 (33%), Positives = 97/177 (54%), Gaps = 4/177 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V + D+ + +VDV+VNAAN L GG+ GA+ RAAGP L C ++ + +G G Sbjct: 3 VAVYKDDLARHSVDVVVNAANEDLKHIGGLAGALLRAAGPKLQTDCDQIIKIRGRLSAGD 62 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT AG+LP K V+H VGPVW + D+ L A + L L A + S+ PA+S+ Sbjct: 63 AVITDAGNLPCKQVIHAVGPVWNAFFPGKCDRQLHKAITSCLDLAARKGHRSIGIPAVSS 122 Query: 124 GVYGYP-RAAAAEI--AVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 G++G+P + I ++K E + H+ +Q++ V + + L + ++ Sbjct: 123 GIFGFPLKRCVTHILGSIKAYVEDNSAHSTIKQIHLVALESATVQAFTDALRAESEQ 179 Score = 65.1 bits (157), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 49/170 (28%), Positives = 73/170 (42%), Gaps = 8/170 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V+Q I DVIVN L + + A+ AGP L L Q P G Sbjct: 193 IKVIQQAIEDSTTDVIVNNVGQKLQLNEWQISRALAARAGPQL-QQLLSNSSQGASAPNG 251 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 T +L V+H V P W Q+L+ + + L+L S S++ PAI T Sbjct: 252 SVFSTDGCNLNCAKVLHVVMPQW----DRRTQVLRKSIKSCLKLTEQQSLQSISIPAIGT 307 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHLYERLL 171 G GYP+ A + K + F ++ ++V V + D EN ++ + L Sbjct: 308 GKLGYPKDLVAAVTFKEILHFSSKAQSLQEVNIVLHPRDTENIQVFSKEL 357 >UniRef50_UPI0000ECB76F Poly [ADP-ribose] polymerase 14 (EC 2.4.2.30) (PARP-14) (B aggressive lymphoma protein 2). n=2 Tax=Gallus gallus RepID=UPI0000ECB76F Length = 1636 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 64/176 (36%), Positives = 95/176 (53%), Gaps = 10/176 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V + D+ VDV+VNA+N L GG+ A+ +AAGP L C V + G G Sbjct: 637 IAVYKADLCTHHVDVVVNASNEDLKHIGGLAWALLQAAGPELQAECDGVVRMSGSLQAGD 696 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ---LLQDAYLNSLRLVAANSYTSVAFPAI 121 AVIT AG LP K V+H VGP W+ EQ+ ++ LL+ SL+L ++ S+AFP++ Sbjct: 697 AVITGAGKLPCKQVIHAVGPRWK--EQDAEKCVYLLKKTIKKSLQLAETYNHRSIAFPSV 754 Query: 122 STGVYGYPRAAAAEIAV----KTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S G++G+P V KT+ EF +L E ++ V E+N + + L + Sbjct: 755 SGGIFGFPLHKCVNAIVSAIKKTLEEFKRDSSLKE-IHLVDITEDNVQAFIKALKE 809 Score = 61.6 bits (148), Expect = 1e-08, Method: Composition-based stats. Identities = 50/153 (32%), Positives = 70/153 (45%), Gaps = 28/153 (18%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHA 65 V GDITK DVIVN +N + GV AI AG + + C ++ Q P Sbjct: 1061 QVAAGDITKETGDVIVNISNQAFNLKTGVSKAILEGAGKEVENECAELALQ----PNDGY 1116 Query: 66 VITLAGDLPAKAVVHTVG------PVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 + T AG LP K ++H V PV ++LQ+ L YTSV FP Sbjct: 1117 ITTEAGSLPCKKIIHFVARDDIKVPV--------SKVLQECEL--------QQYTSVTFP 1160 Query: 120 AISTGVYG-YPRAAAAEIAVKTVSEFITRHALP 151 AI TG G +P A E+ + +++F ++ P Sbjct: 1161 AIGTGQAGRFPDLVADEM-MDAITDFARSNSTP 1192 Score = 50.4 bits (119), Expect = 3e-05, Method: Composition-based stats. Identities = 39/145 (26%), Positives = 64/145 (44%), Gaps = 4/145 (2%) Query: 5 IHVVQGDITKLAVD-VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I + +G+I + D V+++ + G + A+ AGP L + G P Sbjct: 848 IMLKKGNIEDASTDGVVISVGGDLQLEKGQLAKALLSKAGPRLQSDLND--EGLGKSPVE 905 Query: 64 HAVITLAG-DLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 +V T G +L V H V P W G ++ ++L L+ S S+ FPAI Sbjct: 906 GSVFTTRGYNLSCCYVFHAVTPGWSQGSESAVKILGKIVTKCLQTAEELSLKSITFPAIG 965 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITR 147 TG+ G+P + A+ V EF ++ Sbjct: 966 TGILGFPSSVVAKSLFDKVYEFSSK 990 >UniRef50_A7S3X0 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7S3X0_NEMVE Length = 143 Score = 101 bits (252), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 60/142 (42%), Positives = 82/142 (57%), Gaps = 3/142 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V QGDIT D +VNAAN L+ GGGV GAI G ++ + C ++ + G G Sbjct: 1 VTVYQGDITNERADAVVNAANCDLIHGGGVAGAILAKGGWSIQEECYQIVGRFGRLEVGD 60 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGG--EQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 AV T AG L KAV+H VGP W G EQ ++QL + A L SL + S+AFPAIS Sbjct: 61 AVQTNAGKLLCKAVIHAVGPTWLGATPEQVKNQLFR-ACLESLYTADNINLCSIAFPAIS 119 Query: 123 TGVYGYPRAAAAEIAVKTVSEF 144 +G+YG P+ A++ + V + Sbjct: 120 SGIYGVPKEICAQVMLDVVEHY 141 >UniRef50_B9WC14 Putative uncharacterized protein n=5 Tax=Candida RepID=B9WC14_CANDC Length = 564 Score = 101 bits (252), Expect = 9e-21, Method: Composition-based stats. Identities = 64/176 (36%), Positives = 96/176 (54%), Gaps = 14/176 (7%) Query: 9 QGDITKLA-VDVIVNAANPSLMG-----GGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +GDIT L V IVNAAN +L+G +D IH AAGP L AC + Q + + PT Sbjct: 97 KGDITTLTDVTAIVNAANSTLLGCFQPRHKCIDNVIHIAAGPDLRQACYNLMQSKSE-PT 155 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGE--QNEDQLLQDAY---LNSLRLVAANSYTSVA 117 G A IT +LPAK V+HTVGP+ + E + L Y L +L ++ S+A Sbjct: 156 GSAKITPGFNLPAKYVIHTVGPIIHNESVTKREQEQLASCYQSSLEALEMLNDEKDKSIA 215 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLL 171 F +STG++ +P+ A+ IA+ TV +++ H + + + F + E+ +YE L Sbjct: 216 FCCVSTGLFAFPKELASTIAINTVHDYLKTHPNSTIKHIVFNVFSNEDKEVYENNL 271 >UniRef50_B7CC50 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CC50_9FIRM Length = 175 Score = 101 bits (251), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 64/177 (36%), Positives = 89/177 (50%), Gaps = 16/177 (9%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I ++G+I L D+IV+ N ++ GV I AG ++ AC Q+ G Sbjct: 2 ISTLKGNIALLDFDLIVDPTNKQVLPMQGVSAQIFHQAGSEMMKAC----QELNGLEVGK 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRL----VAANSYTS--VAF 118 A +T A +LP KAV+HT GP + G NED+ L Y NS+ L + N S +AF Sbjct: 58 AKMTKAFNLPCKAVIHTCGPRYMDGTHNEDEYLAACYWNSMALAYDYMRKNDMESINIAF 117 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE----QVYFVCYDEENAHLYERLL 171 P ISTG+ YP A IA++TV + + PE V FVC E+ LY+ L Sbjct: 118 PCISTGINAYPNHEACVIAIQTVKRLMNK--FPETKAIHVCFVCDKTEDYMLYKEAL 172 >UniRef50_C2D2Z2 Appr-1-p processing enzyme family domain protein n=1 Tax=Lactobacillus brevis subsp. gravesensis ATCC 27305 RepID=C2D2Z2_LACBR Length = 274 Score = 101 bits (251), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 66/173 (38%), Positives = 94/173 (54%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMG-----GGGVDGAIHRAAGPALLDACLKVRQ 55 ++ +I++ QGDIT+LAVD IVN AN ++G G +D IH AG L A K Sbjct: 95 IRPKIYLWQGDITQLAVDAIVNPANSRMLGCFIPNHGCLDNQIHTKAGIQLRLADQKAMA 154 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYT 114 + TG A +T +LPAK V+HTVGPV QLL D+Y + L+L + Sbjct: 155 GERLEATGKAKLTPGFNLPAKFVIHTVGPVIIHQVTPLRRQLLADSYQSCLKLAEQKDLS 214 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY 167 +AF ISTG + +P AA+IAV TV+++++ H V F + + LY Sbjct: 215 ELAFCCISTGEFRFPHDLAAQIAVNTVNDYLSSHINAPDVIFAVNSDLDKALY 267 >UniRef50_C9RQW9 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=C9RQW9_FIBSS Length = 347 Score = 101 bits (251), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 56/166 (33%), Positives = 90/166 (54%), Gaps = 5/166 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + +V+ DI+++ D IVN+AN + + GGG + I+ AAG D L R++ G Sbjct: 3 LRIVRNDISRVRADAIVNSANKNPVCGGGAEYHIYEAAG---YDKLLAAREKIGVLDVAE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 ++ A L AK ++H VGP W GGE E L Y +L S+AFP IS+G Sbjct: 60 VAVSSAFALKAKYLIHVVGPKWNGGESGETSALASCYRRALEKALELGCESIAFPLISSG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 V+ +P+ +A +IA++ + EF+ H + Q+ V +D + + E L Sbjct: 120 VFRFPKDSALKIALQAIGEFLQSHEMDVQL--VVFDRKAFDVSEEL 163 >UniRef50_D2V113 Appr-1-p domain-containing protein n=1 Tax=Naegleria gruberi RepID=D2V113_NAEGR Length = 220 Score = 100 bits (249), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 58/180 (32%), Positives = 97/180 (53%), Gaps = 15/180 (8%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-DCPTG 63 + V +GD+T VDVIVNAAN L G+ GAI + G + K+ + G + G Sbjct: 34 LQVRKGDLTMEKVDVIVNAANCRLQHMSGLAGAIVKNGGQIIQKESNKLIKDLGRELENG 93 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLL----QDAYLN-----SLRLVAANSYT 114 V T++GDLP K + H VGP+W + N+ + L +D L SL + + + Sbjct: 94 EVVETISGDLPCKTLYHAVGPIWSSRKANDFKTLGAEQEDFELGMCVEASLNMAVESGLS 153 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH-----ALPEQVYFVCYDEENAHLYER 169 S++ PAIS+G++G+P+ A++ TV+EF+ + A +V F +D+E +++ + Sbjct: 154 SISLPAISSGIFGFPKDRCAKVLFNTVTEFLKSNKDNIKADRFEVRFTNFDDETCNIFSK 213 >UniRef50_C3Y6H9 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y6H9_BRAFL Length = 2209 Score = 100 bits (248), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 64/176 (36%), Positives = 93/176 (52%), Gaps = 5/176 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V + D+T+ VDVIVNAAN L GG+ +I AGP L C K+ +++ G Sbjct: 1144 VTVRKDDLTRHVVDVIVNAANRDLKHIGGLAKSISDVAGPVLQSECDKITRRRS-LLDGQ 1202 Query: 65 AVITLAGDLPA-KAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 V+T AG + K ++H VGP+W+GG + E L DA SL +TS+A PAIS+ Sbjct: 1203 VVVTSAGAMTTCKEIIHAVGPLWQGGFRREADALYDAAYGSLEEAGRRGHTSIAIPAISS 1262 Query: 124 GVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLY-ERLLTQQGD 176 G+Y +P A + V+ V EF R + V V D+ + E L ++ GD Sbjct: 1263 GIYSFPVDQCANLIVEAVDEFWKNNRSSTLSLVELVNNDDRTVDAFVEALTSRHGD 1318 Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 38/176 (21%), Positives = 82/176 (46%), Gaps = 7/176 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I +Q +I VDV+VN+ + +L + G + +I GP L + Q+G Sbjct: 1393 ITAMQANIASQRVDVMVNSTSHNLNLNSGQLSKSILDRGGPELQTLVNNAKAQKGIQSLA 1452 Query: 64 HAVITLAGD--LPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 I +G L + V+H+ W GG+ + +++L++ L++ + ++A PA+ Sbjct: 1453 DGDILESGPAGLNVQTVIHSALCRWDGGQGDSEKVLRELVRKCLKVAEEGGHKTIAIPAM 1512 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCY--DEENAHLYERLLTQ 173 TG +P AE ++ ++ E++ F+ + D ++ ++ ++T+ Sbjct: 1513 GTGGLHFPHEVVAEALFGEAVDYFKQNPQSSIEEIRFIVWEGDPKSMVAFDEIMTK 1568 >UniRef50_Q2ITR2 Appr-1-p processing n=1 Tax=Rhodopseudomonas palustris HaA2 RepID=Q2ITR2_RHOP2 Length = 127 Score = 99.8 bits (247), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 53/118 (44%), Positives = 69/118 (58%), Gaps = 4/118 (3%) Query: 46 LLDACLKVRQQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSL 105 +L AC K+ G C TG A ITL DLPA+ V+H VGPVW GG ED+ L Y +L Sbjct: 1 MLAACRKL----GGCATGDAKITLGYDLPARHVIHAVGPVWHGGRSGEDEALASCYRRAL 56 Query: 106 RLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEEN 163 +L + S+AF AISTGVYG+P AA IAV + + A ++V F C+ E + Sbjct: 57 QLCRQHGLASIAFSAISTGVYGFPPERAAPIAVAACIDALRTAAPVDRVVFCCFSEPS 114 >UniRef50_Q22CT8 Appr-1-p processing enzyme family protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22CT8_TETTH Length = 472 Score = 99.8 bits (247), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 54/160 (33%), Positives = 82/160 (51%), Gaps = 2/160 (1%) Query: 17 VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLPAK 76 VD IVNAAN L GGGV GAI R G + + + + + G +V T AG LP K Sbjct: 4 VDAIVNAANNFLAHGGGVAGAICRKGGRIIQNQSYDIIKIRNRIENGESVTTEAGQLPCK 63 Query: 77 AVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEI 136 V+HTVGP+W G+ NE + L LR S++ PAIS+G++G+P+ A+I Sbjct: 64 KVIHTVGPIWEDGDSNEKEELAKCMETILREAKFYKLKSISIPAISSGIFGFPKYLCAKI 123 Query: 137 AVKTVSEFITRHALP--EQVYFVCYDEENAHLYERLLTQQ 174 ++ + + E++ F +D E ++ +Q Sbjct: 124 LLEETQKLLKYDYSNQFEEIRFCNFDNETVQVFAEEFQKQ 163 >UniRef50_C4FT52 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FT52_9FIRM Length = 263 Score = 99.8 bits (247), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 66/183 (36%), Positives = 96/183 (52%), Gaps = 11/183 (6%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQ 56 + +++ QGDIT+LAVD IVNAAN +++G +D IH AG AL AC +++ Sbjct: 73 RPSLYLWQGDITRLAVDAIVNAANSAMLGCFEPNHYCIDNQIHTFAGVALRLACADLKKA 132 Query: 57 QG--DCPTGHAVITLAGDLPAKAVVHTVGPVWRG---GEQNEDQLLQDAYLNSLRLVAAN 111 +G P G A++T +LPAK V+HTVGP +D LL+ AY L Sbjct: 133 RGGKPLPVGQALMTSGFNLPAKQVIHTVGPRIHHLPVSPMMQD-LLKKAYRACLACADQA 191 Query: 112 SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 ++AF ISTG + YP A IA++TVS ++ +V F + + LY LL Sbjct: 192 GLATIAFCCISTGEFSYPIEEATPIAIETVSAYLAETGSKLKVIFNVWTDSQYQLYHDLL 251 Query: 172 TQQ 174 + Sbjct: 252 NSK 254 >UniRef50_A8H4N3 Appr-1-p processing domain protein n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H4N3_SHEPA Length = 304 Score = 99.0 bits (245), Expect = 6e-20, Method: Compositional matrix adjust. Identities = 69/181 (38%), Positives = 103/181 (56%), Gaps = 15/181 (8%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMG-----GGGVDGAIHRAAGPALLDACLKVRQQQ 57 T+I + +GDIT LAVD IVNAAN ++G +D AIH AG L C + + Q Sbjct: 120 TKIILWKGDITTLAVDAIVNAANNQMLGCFQPQHKCIDNAIHNRAGAQLRADCEVIMELQ 179 Query: 58 GDC-PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ--NEDQLLQDAYLNSLRLVA-ANSY 113 G+ TG A IT A +LP+K V+HTVGP+ + Q + Q L +Y + L L Sbjct: 180 GNIEETGIAKITRAYNLPSKFVIHTVGPIVQNMIQPIHAGQ-LASSYRSILTLAKQTERI 238 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ---VYFVCYDEENAHLYERL 170 S+AF +ISTG++GYP A +A+ TV++++ + P+Q + F + E + H+Y+ Sbjct: 239 RSLAFCSISTGIFGYPIEQATRVALDTVTQWLMEN--PDQFDTIVFNVFSEYDHHVYQSA 296 Query: 171 L 171 L Sbjct: 297 L 297 >UniRef50_C5CIT5 Appr-1-p processing domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIT5_KOSOT Length = 187 Score = 98.6 bits (244), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 56/168 (33%), Positives = 91/168 (54%), Gaps = 4/168 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +VQGDITK VD IVNAAN L GGGV GAI RA G + + ++ ++ G Sbjct: 13 IQIVQGDITKEEVDAIVNAANGYLRHGGGVAGAILRAGGKIIQEESDRIIRKNGPLEVSE 72 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T AG L K ++H GP R G++N ++LL +++LN+ + +++ PA+S+G Sbjct: 73 VAVTGAGSLHPKYIIHVHGP--RYGQENVEELLYESFLNAFKTAGKLGVKTLSVPAVSSG 130 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVC-YDEENAHLYERL 170 ++G P+ A + V + + P + VC D ++E++ Sbjct: 131 IFGVPKDLCARCFFRAVEYYFENYKDTPLSLIRVCNIDRATTEVFEKV 178 >UniRef50_B1KG04 Appr-1-p processing domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KG04_SHEWM Length = 296 Score = 98.6 bits (244), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 65/179 (36%), Positives = 96/179 (53%), Gaps = 10/179 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 ++I + GDIT+L +D + NAAN ++G +D AI+ AAGP L + C ++ Q Q Sbjct: 110 SKISIWNGDITRLKIDAVTNAANAQMLGCFQPFHSCIDNAINCAAGPQLREDCNQLMQLQ 169 Query: 58 G-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGG---EQNEDQLLQDAYLNSLRLVAANSY 113 G D TG A IT A +LP+K V+HTVGP+ + G + L Y L L A Sbjct: 170 GSDETTGSAKITRAYNLPSKFVLHTVGPIIQHGAVPSPRQIDELASCYDACLSLAAEAGA 229 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSE-FITRHALPEQVYFVCYDEENAHLYERLL 171 SVA ISTGV+GYP AA +A++ V+ F+ + + F + + +Y R + Sbjct: 230 QSVAVCGISTGVFGYPAEKAANVALQAVANWFLVNPDKLDHLVFNTFGDNATEIYHRAI 288 >UniRef50_Q8B4N1 ORF-1 n=7 Tax=Infectious spleen and kidney necrosis virus RepID=Q8B4N1_ISKNV Length = 566 Score = 98.6 bits (244), Expect = 9e-20, Method: Compositional matrix adjust. Identities = 70/174 (40%), Positives = 95/174 (54%), Gaps = 8/174 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +T + VV DIT L VD IVNAAN +GGGGVDG IHR AG L C + G Sbjct: 389 QTNVSVVLDDITSLRVDAIVNAANTVGLGGGGVDGRIHRVAGRELKREC----RTLGGIG 444 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGE---QNEDQLLQDAYLNSLRLVAANSYTSVAF 118 G A IT LPA V+HTVGP+ G+ Q + ++L Y+ SL + AN ++AF Sbjct: 445 FGEAKITGGYRLPATYVIHTVGPIINAGQRPTQADKRVLTSCYIQSLHVAQANGVRTIAF 504 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLL 171 P+ISTGVY YP A +A+ +V ++ +H + + F Y + +Y L Sbjct: 505 PSISTGVYNYPIEDAVHVAMSSVRAYVIQHPGAFDHIVFCTYSNADFDVYNSQL 558 >UniRef50_C3Y5X0 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y5X0_BRAFL Length = 970 Score = 98.2 bits (243), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 61/145 (42%), Positives = 79/145 (54%), Gaps = 4/145 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I V +GDIT+ VDVI NAAN L G GV GAI RA GP++ + G Sbjct: 571 QIVVARGDITQQPVDVIANAANEYLSHGSGVAGAISRAGGPSVQQESSYHVKTFGRVRVT 630 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNED--QLLQDAYLNSLRLVAAN-SYTSVAFPA 120 V+T G LP K ++H VGP W G +NE+ QL Q Y N L +A SVA PA Sbjct: 631 ETVVTRGGQLPCKHIIHAVGPRWERGHENENERQLRQTCY-NILTAASATLRARSVAIPA 689 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFI 145 IS+G++G P+ AE V + F+ Sbjct: 690 ISSGIFGMPKQKCAESLVSGLERFL 714 >UniRef50_C5VD03 Appr-1-p processing enzyme family protein n=2 Tax=Corynebacterium matruchotii RepID=C5VD03_9CORY Length = 274 Score = 97.4 bits (241), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 62/135 (45%), Positives = 81/135 (60%), Gaps = 8/135 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLK-VRQQ 56 +RI + +GDIT+L VD IVNAAN L+G VD AIH AAG L AC V Sbjct: 87 SRIRLWRGDITRLDVDGIVNAANNKLLGCFRPGHTCVDNAIHSAAGLQLRQACADLVPSP 146 Query: 57 QGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL--LQDAYLNSLRLVAANSYT 114 + PTG A IT +LPA+ V+HTVGP+ G E N Q+ L +Y++ L L ++ Sbjct: 147 DYEEPTGSARITPGFNLPARYVLHTVGPIVAGREANRQQVAELSASYISCLNLAHSSGLE 206 Query: 115 SVAFPAISTGVYGYP 129 S+AF ISTGV+G+P Sbjct: 207 SLAFCCISTGVFGFP 221 >UniRef50_A0CX10 Chromosome undetermined scaffold_3, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CX10_PARTE Length = 183 Score = 97.4 bits (241), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 61/186 (32%), Positives = 95/186 (51%), Gaps = 17/186 (9%) Query: 2 KTRIHVVQGDITKLA-VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + + +++ +I KL VD IVNAAN L+ GGGV GAI +AAG L C + QQ G Sbjct: 3 RFSVKIIKENIVKLVDVDAIVNAANQELLPGGGVCGAIFQAAGRELERECQQYIQQYGIV 62 Query: 61 PTGHAVITLAGDLPA---KAVVHTVGPVWRGGEQNEDQL------LQDAYLNSLRLVAAN 111 PT +T + L K ++H VGP + ED+L + + N L L Sbjct: 63 PTSKLAVTSSCQLKKNNIKYIIHAVGPKYFQSSSPEDELQICVNNILNQSFNVLEL---- 118 Query: 112 SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVC-YDEENAHLYERL 170 SVA PAIS+G+YG+P+ A+I + E+ + + +C +D+E +++++ Sbjct: 119 --KSVAIPAISSGIYGFPKGLCAQIFKLVIEEYQKDTSNKQGEIILCNFDQETTTIFQKV 176 Query: 171 LTQQGD 176 QQ Sbjct: 177 FQQQNS 182 >UniRef50_A8FQZ3 Putative uncharacterized protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FQZ3_SHESH Length = 268 Score = 97.1 bits (240), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 72/180 (40%), Positives = 99/180 (55%), Gaps = 10/180 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 K + + QGDIT+LA D IVNAAN L G +D AIH A+G L D C + + Sbjct: 88 KADVKLWQGDITRLAADAIVNAANKELQGCFQPLHSCIDNAIHSASGVRLRDDCAVIIKA 147 Query: 57 QGDCP-TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAAN-SY 113 QG T A IT +LP + V+HTVGP+ +G E Q LLQ Y N L L Sbjct: 148 QGQFEETAKAKITSGYNLPCQYVLHTVGPIVQGNVTGEHQKLLQLCYENCLALADQTLGI 207 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLL 171 S+AF ISTGV+GYP+ AA+ AV+ V +++ ++ + V F + E+ LY++ L Sbjct: 208 NSIAFCCISTGVFGYPQKPAAQAAVRAVQQWLLNNPNSNIDTVIFNTFKPEDTRLYQQFL 267 >UniRef50_C1SPD7 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SPD7_9BACT Length = 177 Score = 97.1 bits (240), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 56/165 (33%), Positives = 85/165 (51%), Gaps = 5/165 (3%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + + DITK D IVN AN L GGV GAI G ++ + C + G CP Sbjct: 10 TVLEIALRDITKQTTDAIVNPANRQLKMTGGVAGAIAAKGGRSIQEEC----DEIGSCPL 65 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV+T AG L ++H VGP + G + ++ L+ A + S+ L N+ + +A PAIS Sbjct: 66 GEAVMTGAGFLKTTYIIHAVGPRY-GVDPEPEKYLKSAVMKSIELADKNNLSDIAIPAIS 124 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY 167 G++GYP AAE+ + V E I ++ + E + ++ Sbjct: 125 AGIFGYPLEDAAEVIISAVIEKILSGTKLNKILLCLFTENDYMVF 169 >UniRef50_UPI000194CBCB PREDICTED: poly (ADP-ribose) polymerase family, member 14 n=1 Tax=Taeniopygia guttata RepID=UPI000194CBCB Length = 1883 Score = 97.1 bits (240), Expect = 3e-19, Method: Composition-based stats. Identities = 57/178 (32%), Positives = 95/178 (53%), Gaps = 8/178 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +T I + D+ VDV+VNA+N L GG+ A+ RAAGP L + C ++ ++ G+ Sbjct: 875 ETVIALYNADLCTHPVDVVVNASNEKLKHIGGLADALSRAAGPVLQEECDELVRKLGNLQ 934 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ---LLQDAYLNSLRLVAANSYTSVAF 118 G AVIT AG LP K V+H VGP W +N LL+ L+L A+ + S+A Sbjct: 935 PGCAVITHAGKLPCKNVIHAVGPRWSA--ENSVMCVWLLRKTVKKCLQLAEAHKHCSIAL 992 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQ 173 PAIS G++G+P + ++ E + ++ ++V+ V + ++N + + + Sbjct: 993 PAISGGIFGFPMELCTYSIISSIKETLEESKGNSTLKEVHLVGFAQDNIQAFSKAFKE 1050 Score = 72.0 bits (175), Expect = 9e-12, Method: Composition-based stats. Identities = 56/146 (38%), Positives = 73/146 (50%), Gaps = 20/146 (13%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 V +GDITK D IVN N + GV AI AG A+ D C + QQ G + + Sbjct: 1309 VAEGDITKEEGDAIVNITNQAFNLKTGVSRAILNGAGKAVEDECGVLAQQTGK----NYI 1364 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 IT AG+LP K ++H V QN+ + L L L YTSVAFPAI T Sbjct: 1365 ITQAGNLPCKKIMHFV-------YQNDIRSLVSQVLQECEL---QQYTSVAFPAIGT--- 1411 Query: 127 GYPRAAAAEIA---VKTVSEFITRHA 149 G R AAE+A + V++F R++ Sbjct: 1412 GEARRNAAEVADNMIDAVTDFAKRNS 1437 Score = 54.7 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 44/163 (26%), Positives = 75/163 (46%), Gaps = 7/163 (4%) Query: 10 GDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVIT 68 G I A ++V + L + G + A+ AGP L K + G P +V+ Sbjct: 1099 GSIEDAATSIVVVSVGKDLQLDKGPLGKALLSKAGPMLQTGLNK--EGGGRMPEEGSVLK 1156 Query: 69 LAG-DLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYG 127 G +L V+H V P+W + ++L D L + S S+ FPAI TG Sbjct: 1157 TKGYNLACSVVLHAVVPMW-SQKNTPSKVLGDIITKCLEIAEELSLKSITFPAIGTGNLE 1215 Query: 128 YPRAAAAEIAVKTVSEFITRHALP--EQVYFVCYDEENAHLYE 168 +PR+ A++ V EF + + E+V+F+ + ++ A++ E Sbjct: 1216 FPRSVVAKLLFDKVFEFSSEKRVNSLEEVHFLLHTKDTANIQE 1258 >UniRef50_UPI0000E4815A PREDICTED: similar to LRP16 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4815A Length = 415 Score = 96.3 bits (238), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 51/88 (57%), Positives = 60/88 (68%), Gaps = 4/88 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ V QGDITKL VD IVNAAN SL+GGGGVDGAIHRAAG LL C K+ C Sbjct: 159 LNNRVSVWQGDITKLDVDCIVNAANRSLLGGGGVDGAIHRAAGSNLLQECKKL----AGC 214 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRG 88 TG A +T LP++ V+HTVGP+ G Sbjct: 215 ETGDAKLTAGYLLPSRYVLHTVGPMVYG 242 Score = 53.9 bits (128), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 28/60 (46%), Positives = 40/60 (66%), Gaps = 5/60 (8%) Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQV---YFVCYDEENAHLYERLL 171 SVAFP ISTGVYGYP+ A+ +A+ TV E++ + PE+V F + + + +YERLL Sbjct: 337 SVAFPCISTGVYGYPQEEASRVALGTVREWLEEN--PEEVDRIVFCIFLDRDLKVYERLL 394 >UniRef50_D0MWM6 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MWM6_PHYIN Length = 579 Score = 95.1 bits (235), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 64/188 (34%), Positives = 97/188 (51%), Gaps = 17/188 (9%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 +I + +GDIT L IVNAAN +L+G +D IH AGP L AC ++ ++ Sbjct: 105 QIALWKGDITTLRATAIVNAANSALLGCFQPSHKCIDNVIHSMAGPRLRAACHEIMSRKA 164 Query: 59 -DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRLVAAN--- 111 + P G+A IT LP+ V+HTVGP R GEQ E LQ Y SL L+ Sbjct: 165 HEEPGGNAQITQGFALPSSFVIHTVGPQLRHGEQPTAAECDQLQSCYTKSLDLLLKKVGD 224 Query: 112 --SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE---QVYFVCYDEENAHL 166 + S+AF ISTG++ +P A +AV +V E++ +H ++ F + + + L Sbjct: 225 TEQHVSIAFSCISTGLFAFPSDVAVPLAVNSVLEWLNQHQEETRGWKIIFNTFLKRDYDL 284 Query: 167 YERLLTQQ 174 Y+ + + Sbjct: 285 YKSFIESK 292 >UniRef50_UPI000194CBC9 PREDICTED: similar to B aggressive lymphoma n=1 Tax=Taeniopygia guttata RepID=UPI000194CBC9 Length = 718 Score = 94.7 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 54/144 (37%), Positives = 79/144 (54%), Gaps = 3/144 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V + D+T+ VD++VNAAN L G G+ A+ +A GP + + Q+ G G Sbjct: 110 ICVYKDDLTRHKVDIVVNAANEYLEHGAGLALALVKAGGPEIKEESKLYVQRFGKVKVGD 169 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAA--NSYTSVAFPAI 121 +T G LP K ++H VGP W E+ LLQ A LN L V+A + SVA PA+ Sbjct: 170 IAVTGGGKLPCKGIIHVVGPRWYALEKERCCYLLQKAILNVLHYVSAPGKALKSVAIPAV 229 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI 145 S+G+Y +P +++ V V EF+ Sbjct: 230 SSGIYAFPIDLCSQVIVMAVKEFV 253 >UniRef50_A3LYE6 Putative uncharacterized protein n=1 Tax=Pichia stipitis RepID=A3LYE6_PICST Length = 583 Score = 94.7 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 65/186 (34%), Positives = 101/186 (54%), Gaps = 17/186 (9%) Query: 1 MKTRIHVVQGDITKLA-VDVIVNAANPSLMG-----GGGVDGAIHRAAGPALLDACLKVR 54 + ++ + +GDIT ++ V IVNAAN +L+G +D IH AAGP L AC + Sbjct: 91 LSPKLSIWKGDITTISDVTAIVNAANSALLGCFQPSHRCIDNIIHAAAGPDLRRACYNLV 150 Query: 55 QQQG--DCPTGHAVITLAGDLPAKAVVHTVGP-VWRGGEQNEDQLLQDA--YLNSLRLVA 109 +Q+ P G A IT +LPAK V+HTVGP + G E N++++ Q A Y +SL + Sbjct: 151 EQRDFTQEPVGSAQITPGFNLPAKMVIHTVGPSLLPGSEPNQEEISQLAACYTSSLAKLE 210 Query: 110 ANSY----TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEEN 163 S+ F ISTG++ +P A+ IA+++V + + H+ +V F + E N Sbjct: 211 EQEEDGNDKSIVFCCISTGLFSFPNDIASNIAIESVRNYFSEHPHSSISEVIFNVFTETN 270 Query: 164 AHLYER 169 LY + Sbjct: 271 LKLYRQ 276 >UniRef50_UPI0001C38755 appr-1-p processing domain-containing protein n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38755 Length = 575 Score = 94.7 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 56/147 (38%), Positives = 75/147 (51%), Gaps = 16/147 (10%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGG------------VDGAIHRAAGPALLD 48 + RI V+QGDIT+ VD IV + NP L+ VD IH++AG L Sbjct: 433 LSDRITVIQGDITQQPVDAIVCSTNPHLLPNKKWGSFFMSSDHPEVDIMIHKSAGVELKQ 492 Query: 49 ACLKVRQQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV 108 C Q+ C G A IT +LPA+ V+HTV P W+ GE ++LL Y N L LV Sbjct: 493 EC----QKLNGCKVGEAKITPGYNLPAEWVIHTVSPTWQNGEVQAEKLLAKCYQNCLNLV 548 Query: 109 AANSYTSVAFPAISTGVYGYPRAAAAE 135 + S+AFPA+ TG + AA+ Sbjct: 549 NSQEIESIAFPALGTGTGKFTLEKAAK 575 >UniRef50_UPI0000E4D641 UPI0000E4D641 related cluster n=2 Tax=Danio rerio RepID=UPI0000E4D641 Length = 692 Score = 94.4 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 56/143 (39%), Positives = 79/143 (55%), Gaps = 2/143 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V + DI L+VD +VNAAN L GGGV A+ +AAG L + C + G G Sbjct: 3 VTVRKADICTLSVDAVVNAANEDLQHGGGVAYALLQAAGRCLQEYCDLHIKVNGPLTPGD 62 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNE--DQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A+IT AG LP K VVH VGP +R +++ Q L+ A SL ++ +S+A P IS Sbjct: 63 AIITDAGRLPCKYVVHAVGPRFRASDRHTAVQQCLRRAVRESLNQASSKKCSSIAIPVIS 122 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI 145 +G++G P E K V ++I Sbjct: 123 SGIFGCPLDLCTESITKEVRQYI 145 Score = 40.8 bits (94), Expect = 0.019, Method: Composition-based stats. Identities = 37/133 (27%), Positives = 58/133 (43%), Gaps = 11/133 (8%) Query: 15 LAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDL 73 L DVIVN + + + G V A+ +AAG L + G VIT +L Sbjct: 212 LQADVIVNTISEDMDLRKGAVSNALLQAAGHQLQSEIKRASNH------GEIVITDGYNL 265 Query: 74 PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAA 133 V H + ++ +Q+++ N L+ +SV FPAI TG G+P+ Sbjct: 266 KCSRVFHVMIIYLFTLQKVLNQIIR----NCLKNAETQGLSSVVFPAIGTGNLGFPKDLV 321 Query: 134 AEIAVKTVSEFIT 146 A+ + V +F T Sbjct: 322 AKNMLTEVQQFNT 334 >UniRef50_C9YUB3 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9YUB3_STRSW Length = 333 Score = 93.2 bits (230), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 67/146 (45%), Positives = 90/146 (61%), Gaps = 8/146 (5%) Query: 9 QGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQGDC-PT 62 +GD+T LA D +VNAAN L+G +D A+H AAGP L D C + QG PT Sbjct: 156 RGDLTTLAADAVVNAANSRLLGCFRPRHPCIDNALHNAAGPRLRDDCHTIVTAQGTREPT 215 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVA-ANSYTSVAFPA 120 G A IT LPA+ V+HTVGP+ +G +D Q L +Y + L L A S +VAF A Sbjct: 216 GTAKITRGYHLPARHVLHTVGPLVQGRPHTDDAQALASSYRSCLDLAAQVESVRTVAFCA 275 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT 146 +STGV+GYP+ AA +A++TV ++IT Sbjct: 276 VSTGVFGYPKDEAASVALRTVEDWIT 301 >UniRef50_D1R847 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R847_9CHLA Length = 411 Score = 93.2 bits (230), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 58/154 (37%), Positives = 83/154 (53%), Gaps = 16/154 (10%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGP--------ALLDA---C 50 KT++ +V+G D IVNAAN L+GGGG+DG I +G L A Sbjct: 203 KTKVVLVKGSTLDQNTDAIVNAANERLLGGGGIDGQIWSRSGALSGAKDSGEFLKAEIMP 262 Query: 51 LKVRQQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAA 110 +K G+ P G AVIT A L ++ ++H VGP RG + ++L++AYLNSL L+ A Sbjct: 263 IKANLPSGNLPNGEAVITRALGLNSRYIIHAVGP--RGAQP---KVLRNAYLNSLELLDA 317 Query: 111 NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEF 144 N S++F IS ++GY AA I V + + Sbjct: 318 NQLKSISFCCISQSIFGYSPKDAAPIVVDLIRRY 351 >UniRef50_UPI000196CD43 hypothetical protein CATMIT_02190 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CD43 Length = 239 Score = 92.8 bits (229), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 56/157 (35%), Positives = 83/157 (52%), Gaps = 7/157 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPT 62 + +++ +I +A D IV AN +L G G AI AAG L AC ++ G C T Sbjct: 2 KFKIIKANIVDVASDAIVLPANEALKEGSGTSKAIFTAAGRKELTKAC----KELGHCST 57 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A+ TLA +L +K ++H V P W GE +E LL AYL SL + SVAFP ++ Sbjct: 58 GSAIPTLAYNLSSKYIIHAVVPKWIDGEHSEYDLLSSAYLASLNIAEVMGCESVAFPLLA 117 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY 159 +G G+ + A IA +++ F ++V+ V Y Sbjct: 118 SGNNGFDKQLAVRIAEESIKSF--EGVNLKKVFLVVY 152 >UniRef50_A1L291 LOC799852 protein (Fragment) n=5 Tax=Danio rerio RepID=A1L291_DANRE Length = 458 Score = 92.4 bits (228), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 56/145 (38%), Positives = 83/145 (57%), Gaps = 5/145 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V + D+T+ V+ +VNAAN L GGG+ A+ A GP + + ++ G TG Sbjct: 72 ISVWKDDLTQHKVEAVVNAANEKLQHGGGLAQALSMAGGPQIQRWSDDIIKRYGYVKTGE 131 Query: 65 AVITLAGDLPAKAVVHTVGP-VWRGGEQNE----DQLLQDAYLNSLRLVAANSYTSVAFP 119 AV+T AG+LP K ++H VGP V + Q E LL +A + L+ V + TSVA P Sbjct: 132 AVLTPAGNLPFKYIIHAVGPKVPQNPTQKEIGDATPLLYNAITSILQTVLRENITSVAIP 191 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEF 144 A+S+G++ +PR A+I VK + F Sbjct: 192 ALSSGLFNFPRDRCADIIVKAIKTF 216 Score = 60.1 bits (144), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 47/161 (29%), Positives = 78/161 (48%), Gaps = 5/161 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +++ +G I VDV+VN P + G + AI + AG + + K + + Sbjct: 285 LYLKRGAIEDEMVDVLVNTIAPDCKLHQGVISRAILKKAGDEIQNEIYKKKSNTSFYSSK 344 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 T +L K+V HTV + NE +L + L SL+ AA Y S++FPAI T Sbjct: 345 VLYKTKGYNLYCKSVFHTVCAHRSDSKSNE--ILFNVVLESLK-KAAEDYESISFPAIGT 401 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEEN 163 G + + A+I + V+EF ++ + VYFV + ++N Sbjct: 402 GNLDFKKWEVAKIMMDAVAEFAKQNKRKKLDVYFVVFPKDN 442 >UniRef50_Q4SK43 Chromosome 2 SCAF14570, whole genome shotgun sequence. (Fragment) n=4 Tax=Tetraodontidae RepID=Q4SK43_TETNG Length = 418 Score = 92.0 bits (227), Expect = 9e-18, Method: Compositional matrix adjust. Identities = 54/151 (35%), Positives = 79/151 (52%), Gaps = 2/151 (1%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + V + D+T VD +VNAAN L GG+ A+ +A G + + ++ G Sbjct: 54 RVTVSVHKADLTNFPVDAVVNAANERLQHVGGIALALSKAGGSQIQQDSDEYIRKNGVLR 113 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED--QLLQDAYLNSLRLVAANSYTSVAFP 119 TG +V AG LP K ++HTVGP G LL+ A LNSL+ SVA P Sbjct: 114 TGESVAMDAGSLPCKKIIHTVGPHVTGHSLTASAANLLEKAVLNSLKKADECRLRSVALP 173 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHAL 150 AIS+G++GYP A+ VK V +F ++ + Sbjct: 174 AISSGIFGYPLKECADTIVKAVRDFCEKYQI 204 Score = 55.5 bits (132), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 47/150 (31%), Positives = 68/150 (45%), Gaps = 8/150 (5%) Query: 10 GDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITL 69 G I + +VIVN G + AI + AG +L A LK + + ++T Sbjct: 268 GRIDEEQTNVIVNTTQKD-SWDGQISTAILKKAGTKMLKA-LKC----ANVGNRNVIVTE 321 Query: 70 AGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYP 129 +L V HT+ G Q+L DA L+L A +S S+AFPAI TG G Sbjct: 322 PYNLRCAEVYHTLFTA--GSTDKAYQILTDAVSECLQLAANHSRQSIAFPAIGTGGRGLE 379 Query: 130 RAAAAEIAVKTVSEFITRHALPEQVYFVCY 159 + A I + V +F + + +VYFV Y Sbjct: 380 KEKVASIMSEAVFKFANQSSKQMEVYFVIY 409 >UniRef50_Q0CEI7 Putative uncharacterized protein n=1 Tax=Aspergillus terreus NIH2624 RepID=Q0CEI7_ASPTN Length = 524 Score = 91.7 bits (226), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 55/141 (39%), Positives = 74/141 (52%), Gaps = 5/141 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + DIT L VD IV + G GG+DGA+H AAGP LLDAC + G C Sbjct: 319 ISLAHTDITTLEVDCIVTGIS-EPRGQGGLDGAVHAAAGPRLLDACNDL----GKCWVEE 373 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T A +LP K V+HTV P + G + LL+ Y L + ++AFPA+STG Sbjct: 374 VQVTDAYNLPCKKVIHTVSPPYADGSADSKWLLRACYRRCLEIAIEGGMRTIAFPALSTG 433 Query: 125 VYGYPRAAAAEIAVKTVSEFI 145 G+ AA A++ V F+ Sbjct: 434 SKGFKSYEAATAALEEVRCFL 454 >UniRef50_Q8ZXT3 UPF0189 protein PAE1111 n=10 Tax=Thermoprotei RepID=Y1111_PYRAE Length = 182 Score = 90.9 bits (224), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 62/165 (37%), Positives = 87/165 (52%), Gaps = 6/165 (3%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 +++GDIT++ D IVNAAN L GGGV GAI R G + + + ++ G P G Sbjct: 12 LMRGDITEVEADAIVNAANSYLEHGGGVAGAIVRKGGQVIQEESREWVRKHGPVPVGDVA 71 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 +T AG L AK V+H VGP R G + ++ L +A N+L S+A PAISTG++ Sbjct: 72 VTSAGRLKAKYVIHAVGP--RCGVEPIEK-LAEAVKNALLKAEELGLVSIALPAISTGIF 128 Query: 127 GYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 G P AAAE + E ++ V Y EE Y++ L Sbjct: 129 GCPYDAAAEQMATAIREVAPALRSIRRILVVLYGEEA---YQKFL 170 >UniRef50_C3Y5Q2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y5Q2_BRAFL Length = 1122 Score = 89.4 bits (220), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 54/174 (31%), Positives = 91/174 (52%), Gaps = 5/174 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ + QGDIT+ DVIV+ N SL GG+ AI A GP + AC+ ++ G G Sbjct: 708 KVFIYQGDITQEVADVIVSCNNESLDSAGGIARAISDAGGPEIRRACVDYIRRHGRLSAG 767 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAAN-SYTSVAFPAIS 122 ++ T G L + VVHTV P +Q + Q L +L+ L + + S+A PAI Sbjct: 768 QSIWTPGGRLRCQHVVHTVSP-QSSRDQTDHQQLFSTFLDLLNIAEFDLKVNSIAIPAIG 826 Query: 123 TGVYGYPRAAAAEIAVKTVSEF---ITRHALPEQVYFVCYDEENAHLYERLLTQ 173 +G+ G+P+A A++ + +S F T +L +++ V D + + ++ +Q Sbjct: 827 SGIAGFPKAVCADVMFRVISAFEDYQTPDSLLKEIRLVNIDAKTTAAFVQVFSQ 880 >UniRef50_C3Y5X5 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y5X5_BRAFL Length = 1925 Score = 89.4 bits (220), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 57/162 (35%), Positives = 79/162 (48%), Gaps = 5/162 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ V QGD+T L VDVIVNAAN L GG+ A+ +A G + C + G G Sbjct: 910 KLFVCQGDLTALQVDVIVNAANSRLSHVGGLAAALVKAGGKEIQRDCESYIRTSGQLSDG 969 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSL---RLVAANSYTSVAFPA 120 + T LP K VVH VGP W+ G +++ ++A L L A + S+ PA Sbjct: 970 DVMTTKPYRLPCKMVVHAVGPQWKSGLSEDEKGGKEANLYRAAFSSLQEAKDFHSIGIPA 1029 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYD 160 IS+GVYG+P ++ V F H +VYF D Sbjct: 1030 ISSGVYGFPIDLCVSAILEGVMSFFNIHPNCKLSEVYFTEMD 1071 Score = 65.9 bits (159), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 48/153 (31%), Positives = 70/153 (45%), Gaps = 9/153 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V QGDIT +VD I+ N L GV + R AG +L CL V QQ G+ G Sbjct: 1313 LQVQQGDITTESVDAIIVPTNNKLRLDAGVAQVVSRKAGGSLQAECLAVVQQYGELQNGA 1372 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 T AG LP + V+H P + L+D + L+ SVA PAI TG Sbjct: 1373 VATTGAGSLPCRHVLHLANP--------QPNHLKDNIKHCLQTADQKKLKSVALPAIGTG 1424 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFV 157 +A+ + ++EF+ + + P+ + V Sbjct: 1425 GINISPDQSAKGMLDGIAEFV-QQSNPQNLALV 1456 Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 49/177 (27%), Positives = 66/177 (37%), Gaps = 55/177 (31%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + + QG IT DV+VN L +G GGV A +A GP L QQ Sbjct: 1143 LQLKQGGITAEQADVLVNTVGTDLDLGQGGVASAFLKAGGPEL---------QQ------ 1187 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 P + ++ T L + N S+AFPA+ T Sbjct: 1188 ----------PLRTIIQTC----------------------LTMAHKNGLPSIAFPALGT 1215 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALP----EQVYFVCYDEENAHLYE-RLLTQQG 175 G GYPR+ AA V F A P + V V YD+ ++ L T+QG Sbjct: 1216 GNLGYPRSVAASAMFDEVVSF--SQANPSTSLKHVSIVVYDQPTVQAFQAELRTRQG 1270 >UniRef50_Q6NRC6 MGC83934 protein n=3 Tax=Xenopus RepID=Q6NRC6_XENLA Length = 914 Score = 89.0 bits (219), Expect = 7e-17, Method: Composition-based stats. Identities = 57/162 (35%), Positives = 85/162 (52%), Gaps = 6/162 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 R+ V +GD+T+ VD +VNAAN L GG+ A+ +A G + D + ++ +G Sbjct: 81 RVSVWKGDMTRQNVDAVVNAANEDLKHFGGLALALVKAGGAVIQDESRRHIEKYKKVKSG 140 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYT-SVAFPAI 121 +T AG+LP K ++H VGP W G + +Q L++ N L V S SVA PA+ Sbjct: 141 SIAVTSAGNLPCKMIIHAVGPEWSPGINAKCEQELKEVIRNVLMQVMNESNVRSVAIPAV 200 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYD 160 S+G++ +P EI T +F T H L E + FV D Sbjct: 201 SSGIFRFPLQRCTEIIASTTKKFCDTETYHKLAE-IRFVNID 241 Score = 47.8 bits (112), Expect = 2e-04, Method: Composition-based stats. Identities = 41/157 (26%), Positives = 74/157 (47%), Gaps = 8/157 (5%) Query: 5 IHVVQGDITKLAVDVIVNA--ANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +++ +G I + VIVN+ AN +L G + AI R AG +L L + + PT Sbjct: 358 LYLTKGYIEEQKTAVIVNSLGANRNL-NEGNISKAILRKAGNSLSQEVLD--KSKYVSPT 414 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + T LP V H + + R G ++ ++L+D L + +S++FPA+ Sbjct: 415 DIMIPTRGYYLPCDFVYHVI--LQRSG-SDQKKILKDGINACLNTALRYNTSSISFPALG 471 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY 159 TG+ +P+ A++ V F + ++FV + Sbjct: 472 TGMLCFPKPVVAKVMTDEVLSFAKENPCNMDIFFVIH 508 >UniRef50_UPI0000F2CC13 PREDICTED: similar to B aggressive lymphoma long n=1 Tax=Monodelphis domestica RepID=UPI0000F2CC13 Length = 1624 Score = 88.2 bits (217), Expect = 1e-16, Method: Composition-based stats. Identities = 48/145 (33%), Positives = 78/145 (53%), Gaps = 7/145 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + V + D+T+ D +VNAAN L+ GG+ A+ RA GP + + Q+G+ P Sbjct: 98 QIELSVWKDDLTRHPADAVVNAANERLLHAGGLALALVRAGGPLIEKESEAIIMQRGEVP 157 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED---QLLQDAYLNSLRLVAANSY--TSV 116 T +T G LP ++H VGP W + N + Q L+ A N L V +S+ +V Sbjct: 158 TSEIAVTTGGQLPCSCIIHAVGPRW--SDWNAERCCQELERATANILNYVTNDSHGIKTV 215 Query: 117 AFPAISTGVYGYPRAAAAEIAVKTV 141 A PA+S+G++G+P +I + T+ Sbjct: 216 AIPALSSGIFGFPLELCVQIIILTI 240 Score = 65.5 bits (158), Expect = 8e-10, Method: Composition-based stats. Identities = 56/183 (30%), Positives = 88/183 (48%), Gaps = 19/183 (10%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPS-LMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 T + +++G I K VDVIVN+ + S G V AI AGP + + K + Sbjct: 294 TNLQIIEGFIEKQQVDVIVNSISASNSFDLGKVSNAILIHAGPEIEEEFSKTYSGMSE-S 352 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 + V+T +L K V H V W Q + ++L++A + L + S++FPA+ Sbjct: 353 SKLVVVTEGFNLACKHVYHVV---WPSSYQTK-KVLKEAVMRCLEKTCQENMNSISFPAL 408 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQ---VYFVCYDEENAHLYE-------RLL 171 TG G P+ A I +K + +F H P++ V FV Y +N LYE +++ Sbjct: 409 GTGNIGLPKREAISIMLKEIFQFSKNH--PQKRLLVNFVVYPNDN-ELYEVMKSELDKMI 465 Query: 172 TQQ 174 TQQ Sbjct: 466 TQQ 468 >UniRef50_C8WJT1 Appr-1-p processing domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WJT1_EGGLE Length = 255 Score = 88.2 bits (217), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 65/146 (44%), Positives = 83/146 (56%), Gaps = 8/146 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 R+ + +GDIT LAVD IVNAAN L+G +D AIH AG L C ++ + QG Sbjct: 85 RLALWRGDITTLAVDAIVNAANSKLLGCFIPGHHCIDNAIHTFAGMQLRLVCDELMRAQG 144 Query: 59 -DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN--EDQLLQDAYLNSLRLVAANSYTS 115 D P G A +T A +LP++ VVHTVGP GE +++ L Y SL AA S Sbjct: 145 HDEPVGRAQVTSAFNLPSRFVVHTVGPQVPTGEPTAAQEEQLASCYRASLDAAAAAGVAS 204 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTV 141 +AF ISTG + +PR AA IAV V Sbjct: 205 LAFCCISTGEFRFPRERAARIAVGEV 230 >UniRef50_UPI00005A247A PREDICTED: similar to H2A histone family, member Y isoform 3 n=1 Tax=Canis lupus familiaris RepID=UPI00005A247A Length = 412 Score = 88.2 bits (217), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 48/169 (28%), Positives = 88/169 (52%), Gaps = 7/169 (4%) Query: 4 RIHVVQGDITKLA---VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +++++ +I+ LA V+ I+N N + + + + G ++A L++R++ G Sbjct: 236 KLNLIHSEISNLAGFEVEAIINPTNADIDLKDDLGNTLEKKGGKEFVEAVLELRKKNGPL 295 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 A ++ LPAK V+H PVW G ++LL+ N L L S+AFP+ Sbjct: 296 EVAGAAVSAGHGLPAKFVIHCNSPVW--GADKCEELLEKTVKNCLALADDKKLKSIAFPS 353 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLY 167 I +G G+P+ AA++ +K +S + T + + VYFV +D E+ +Y Sbjct: 354 IGSGRNGFPKQTAAQLILKAISSYFVSTMSSSIKTVYFVLFDSESIGIY 402 >UniRef50_C7Z089 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7Z089_NECH7 Length = 592 Score = 88.2 bits (217), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 62/186 (33%), Positives = 95/186 (51%), Gaps = 17/186 (9%) Query: 3 TRIHVVQGDITKLA-VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 T + V +GDIT L + I NAAN ++G +D IH AGP L D C ++ Q Sbjct: 99 TNLVVWRGDITTLTGITAITNAANGQMLGCFQPTHRCIDNIIHSRAGPRLRDECFQLMQD 158 Query: 57 Q-GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGG----EQNEDQLLQ--DAYLNSLRLVA 109 + D G ++T DLP+ V+HTVGP R G E QL + ++ L++L L+ Sbjct: 159 RDKDLGAGETLVTRGYDLPSPYVIHTVGPQLRRGASPTEVERRQLARCYESTLDALELLP 218 Query: 110 A--NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAH 165 A + ++A ISTG++ +P AAEIA+ TV ++ H + F + E + Sbjct: 219 AEEDGRKAIALCCISTGLFAFPAKEAAEIAILTVLSWLDNHPSTTITDIIFNTFTESDTE 278 Query: 166 LYERLL 171 +Y +L Sbjct: 279 IYSKLF 284 >UniRef50_A7HJC7 Appr-1-p processing domain protein n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJC7_FERNB Length = 184 Score = 88.2 bits (217), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 51/155 (32%), Positives = 76/155 (49%), Gaps = 2/155 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V GDIT +D IVNAAN L GGGV G I R GP + + ++ G G Sbjct: 10 EIEFVVGDITTQNIDAIVNAANSYLSHGGGVAGVISRKGGPTIQKESDEYVKKYGPVEPG 69 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T AG+L AK V+HTVGP+ G + D ++ ++N ++ ++A P + T Sbjct: 70 GVAVTGAGNLSAKYVLHTVGPI--GDKPQNDDIIVKCFINIIKKSDELGIKTIAIPFVGT 127 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVC 158 G++GYP E K + ++ + Q C Sbjct: 128 GIFGYPLERFIENVTKVLINYLKDYEGTLQKIIFC 162 >UniRef50_A2DTG7 Appr-1-p processing enzyme family protein n=2 Tax=Trichomonas vaginalis RepID=A2DTG7_TRIVA Length = 316 Score = 87.8 bits (216), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 64/165 (38%), Positives = 87/165 (52%), Gaps = 13/165 (7%) Query: 10 GDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPTGHAVIT 68 GD TKL D IVNAAN L GGG+ GAI AAG L AC +QG TG A +T Sbjct: 61 GDSTKLKCDAIVNAANSYLAAGGGICGAIFSAAGYEELQKAC----DEQGYTETGGAKMT 116 Query: 69 LAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGY 128 LP+K V+H VGPV G E L+ AY +L + + S+AF ISTG+YGY Sbjct: 117 PGFRLPSKYVIHAVGPV---GVHPE--ALRSAYNLTLGFMDNDKVKSIAFCCISTGIYGY 171 Query: 129 PRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERL 170 A +A+ TV +++ A +++ FV + ++ +Y Sbjct: 172 SIEKATPVALDTVRKWLEVPENLAKTDRLVFVVFMPKDQQVYSHF 216 >UniRef50_Q94JV1 At1g69340/F10D13.28 n=23 Tax=Embryophyta RepID=Q94JV1_ARATH Length = 562 Score = 87.8 bits (216), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 58/173 (33%), Positives = 87/173 (50%), Gaps = 7/173 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI++ +G+ L VD +VN+ N +L G +H AAGP L + C + G C Sbjct: 83 INSRIYLWRGEPWNLEVDAVVNSTNENLDEAHSSPG-LHVAAGPGLAEQCATL----GGC 137 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A +T A DLPA+ V+HTVGP + + L Y + L L+ + S+A Sbjct: 138 RTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYRSCLELLIDSGLQSIALG 197 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYERLL 171 I T YPR AA +A++TV F+ + V F + +Y+RLL Sbjct: 198 CIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKISAVVFCTTTSSDTEIYKRLL 250 >UniRef50_B0P6L4 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P6L4_9FIRM Length = 168 Score = 87.0 bits (214), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 60/170 (35%), Positives = 84/170 (49%), Gaps = 6/170 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGP-ALLDACLKVRQQQGDCPTG 63 + +V GDITK+ D IVNAA+ L G+ AI AA LL AC K+ G C G Sbjct: 1 MRLVLGDITKMDTDAIVNAASSDLRPCPGICSAIFAAADTEKLLAACKKI----GRCRIG 56 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT + L K ++H G W G N+ L D Y ++L+ AA SVA P + + Sbjct: 57 KAVITPSFGLACKYIIHVAGVGWYSGRYNDRMLFADCYRSALQKAAAYHCKSVAIPLMFS 116 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 G + PRA A +I V F H E + V Y + L +++++ Sbjct: 117 GDFHIPRAQALQIVADVVGGFEKSHPSLE-ISLVLYKQSIYDLAKKIISN 165 >UniRef50_C0W547 Appr-1-p processing domain protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W547_9ACTO Length = 285 Score = 87.0 bits (214), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 58/177 (32%), Positives = 88/177 (49%), Gaps = 9/177 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + T++ + +GD+T L +VNAAN +++G +D +H AAGP L C + Sbjct: 103 LGTQVALWRGDLTTLRAGGVVNAANSAMLGCFVPGHRCIDNVLHAAAGPGLRAECARYMD 162 Query: 56 QQGDCP--TGHAVITLAGDLPAKAVVHTVGP-VWRGGEQNEDQLLQDAYLNSLRLVAANS 112 + P TG A++T LPA V+HTVGP V G Q LL Y + L Sbjct: 163 SREGRPEETGRALVTGGYHLPAAHVIHTVGPIVTHGVTQEHRDLLASCYRSVLDAAEGAG 222 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVC-YDEENAHLYE 168 SV ++STGV+GYP+ AA + + T+ ++ RH +C + E + YE Sbjct: 223 LDSVGLCSVSTGVFGYPKQEAAPLVLDTIGRWLDRHPDSTLRIVICAFAEVDVRAYE 279 >UniRef50_D0NNH8 Putative uncharacterized protein n=3 Tax=Phytophthora infestans T30-4 RepID=D0NNH8_PHYIN Length = 287 Score = 87.0 bits (214), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 64/181 (35%), Positives = 92/181 (50%), Gaps = 15/181 (8%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 V+QGD+T D IVNAAN LM GGG+ GAI R+ G ++ K + G G AV Sbjct: 53 VMQGDLTCCKADAIVNAANTRLMHGGGLAGAIVRSGGSSIQQESSKWVKDHGPLTVGDAV 112 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGE----QNEDQL---LQDAYLNSLRLVAANSYTSVAFP 119 T AG L + V+HTVGP G E ++ QL + A L + RL SVA P Sbjct: 113 TTAAGKLTCQHVIHTVGPN-VGSETLTSEHATQLRHAVWSALLEADRL----KVKSVAVP 167 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALP---EQVYFVCYDEENAHLYERLLTQQGD 176 ISTG++GYPR A+ V ++F A +++ + D+ + + +T + + Sbjct: 168 GISTGIFGYPRDLGAKEIVTEAAKFCKEKAGSTALKRIALMNIDDPTVKSFVKAVTDEME 227 Query: 177 E 177 E Sbjct: 228 E 228 >UniRef50_Q4RS18 Histone H2A (Fragment) n=2 Tax=Tetraodontidae RepID=Q4RS18_TETNG Length = 415 Score = 86.7 bits (213), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 56/181 (30%), Positives = 88/181 (48%), Gaps = 22/181 (12%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-------D 59 VVQ DI+ + D +V+ + S GG V A+ + G +A +++++ G Sbjct: 227 VVQADISIVESDAVVHPTSSSFYTGGEVGTALEKKGGKEFTEALQELKKKNGPLEVAGGK 286 Query: 60 CP---TGH--------AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV 108 CP TG AV+T LPAK V+H P W G +++L N L L Sbjct: 287 CPDWKTGFLLLSQLLIAVLTAGFGLPAKYVIHCNSPGW--GSDKCEEMLDKTVKNCLALA 344 Query: 109 AANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHL 166 SVAFP+I +G G+P+ AA++ +K +S + T + + VYFV +D E+ + Sbjct: 345 DEKKLKSVAFPSIGSGRNGFPKQTAAQLILKAISSYFVATMSSTIKTVYFVLFDSESIGI 404 Query: 167 Y 167 Y Sbjct: 405 Y 405 >UniRef50_B2VUH2 MACRO domain containing protein 1 n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2VUH2_PYRTR Length = 1599 Score = 86.7 bits (213), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 53/169 (31%), Positives = 86/169 (50%), Gaps = 11/169 (6%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 +V+ DI KL VD++VN+ + S +G G +D ++ + GP L++ K G C G Sbjct: 960 LVREDIMKLEVDIMVNSTDSSFLGMGVLDRSVFKKGGPELMEQIKKF----GTCNEGDVK 1015 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVY 126 +T LPAK ++H + P ++ +L++ Y L TS+A P+I TG Sbjct: 1016 VTPGYLLPAKHILHAIPP--EQFSKSNKGILRNIYREILHTAVLLKATSIAIPSIGTGRL 1073 Query: 127 GYPRAAAAEIAVKTVSEFITRHALP----EQVYFVCYDEENAHLYERLL 171 YPR A +A++ V F+ A P E++ FV Y + +Y+ LL Sbjct: 1074 NYPRRDCASLAMEEVKRFL-ESADPNNTLEKIIFVVYSSNDEFVYKSLL 1121 Score = 68.9 bits (167), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 43/149 (28%), Positives = 73/149 (48%), Gaps = 10/149 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLM---GGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I + D+T+L VD IVN A L + AI +AAGP L + + + D Sbjct: 540 ISFIHHDLTRLKVDAIVNNAPTDLSLSPANNTLHSAIFKAAGPGLTEEA----KLKADIK 595 Query: 62 TGHAVITLAGDLPAKAVVHTVGPV--WRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G +T DLP+ ++H G W G ++ ++L Y ++L + + ++AFP Sbjct: 596 VGQVGLTQGHDLPSSWIIHAAGLKYNWSKG-YDQFKVLSSCYQSALEMATYHGIKTIAFP 654 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH 148 + TG G+P AA IA++ + +++ H Sbjct: 655 CLGTGGCGFPARVAARIALQEIRDYLDSH 683 >UniRef50_C3Y406 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3Y406_BRAFL Length = 2514 Score = 86.7 bits (213), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 57/183 (31%), Positives = 97/183 (53%), Gaps = 13/183 (7%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ V + D+TK VD NAAN +L GGG+ AI +A G + D C ++ + D P G Sbjct: 1068 KLVVFKDDLTKHHVDATTNAANKNLKNGGGLAEAIIKAGGKEIQDHCDQIMK---DEPAG 1124 Query: 64 HAV----ITLAGDLPAKAVVHTVGPVW---RGGEQNEDQLLQDAYLNSLRLVAANSYTSV 116 V +T G LP KAV+H VGP + + +++ D+L + N L + + ++SV Sbjct: 1125 LMVGAVRVTGPGKLPCKAVIHAVGPNFHEIKDDKRSRDELFK-TVTNVLEMASRYGFSSV 1183 Query: 117 AFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLLTQQ 174 A PAIS+G++G P + V+ + ++ + +V+FV D + A + + L + Sbjct: 1184 AIPAISSGIFGGPLDLCTKTVVRATGLYFKKNKESKVNEVHFVGIDLDIAQSFNKALLET 1243 Query: 175 GDE 177 +E Sbjct: 1244 FNE 1246 Score = 70.1 bits (170), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 51/178 (28%), Positives = 84/178 (47%), Gaps = 10/178 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +I +++G I+ DVIVN P L + G V A+ GP L C K+++ G P Sbjct: 1300 KITLIRGSISDQQADVIVNTIGPDLNLRTGAVSKALLDKGGPTLQVECDKIKRDLGRLPA 1359 Query: 63 -GHAVITLAGDLPAKAVVHTVGPVWRGGE--QNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G V T G+L V H V W + ++ED +L+ L+ +S ++AFP Sbjct: 1360 HGEVVYTSGGNLGCNLVYHAVCSFWNSQDTAKSED-VLRKIVTACLKSADKDSKRTIAFP 1418 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCYDEENAHLYERLLTQQG 175 A+ TG GYP+ A + + ++ E+ FV +D+ + +E L++ G Sbjct: 1419 AVGTGGLGYPKDVVARLMFEETLSHSNKNPAGDLEEAKFVIFDQPS---FEAFLSELG 1473 Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 38/141 (26%), Positives = 65/141 (46%), Gaps = 9/141 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V QGDIT+ VD IVN + + G V + + GP + C K + + Sbjct: 1522 VEVEQGDITREKVDAIVNPTRGDMDLSLGKVSQVLKKKGGPVVQTECEKYDKNK--LKRD 1579 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 IT AG L ++ ++H V P + E + + A +N L + S+AFPA+ T Sbjct: 1580 GVGITAAGGLASRYILHLVAPGF------ETERWKKAVMNCLAYAECHQLKSLAFPALGT 1633 Query: 124 GVYGYPRAAAAEIAVKTVSEF 144 G +A + ++ +++F Sbjct: 1634 GQMAKDPTESATMIIEAIADF 1654 >UniRef50_Q4RG95 Chromosome 12 SCAF15104, whole genome shotgun sequence n=10 Tax=Clupeocephala RepID=Q4RG95_TETNG Length = 1433 Score = 86.3 bits (212), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 56/176 (31%), Positives = 87/176 (49%), Gaps = 14/176 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 V+ GDIT+ DVI+N++N GV AI AG A+ C + + QG P GH Sbjct: 940 FEVLSGDITRETCDVIINSSNRDFTLKSGVSKAILDGAGWAVQVECAQQARAQGH-PPGH 998 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 ++T AG LP+KA+VH N ++ +L+L ++ S AFPA+ TG Sbjct: 999 MIVTSAGRLPSKAIVHV-------SISNNPADIKSTVYAALKLCEEKTFRSAAFPALGTG 1051 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYF----VCYDEENAHLYERLLTQQGD 176 V G P AA A+ V V++F + P+ ++ + E H + ++ QG+ Sbjct: 1052 VGGVPPAAVADAMVGAVADFAKKQ--PKSIHLAKIVIFQPEMLTHFHNSMMKMQGE 1105 Score = 63.9 bits (154), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 52/141 (36%), Positives = 71/141 (50%), Gaps = 1/141 (0%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ V Q D+ L VD +VN AN +L GG+ A+ AAGP L + G G Sbjct: 501 QLSVSQADLCALQVDAVVNPANENLQHTGGLALALLEAAGPELQNTSNLYVAVNGALCAG 560 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + T A LP K V+H VGP + + E LL+ SLR TSVA PAIS Sbjct: 561 QVIATDACRLPCKHVIHAVGPRFSDHSREESVLLLRRVVTQSLREAERLGCTSVAVPAIS 620 Query: 123 TGVYGYPRAAAAEIAVKTVSE 143 +GV+G+P + A+ + V E Sbjct: 621 SGVFGFPLSLCADTIAQAVWE 641 Score = 56.6 bits (135), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 51/177 (28%), Positives = 82/177 (46%), Gaps = 14/177 (7%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLK----VRQQQG 58 R+ + +G+I VIVN + ++ + G V A+ RAAG L A LK R Q Sbjct: 733 RVVLCKGNIEDQRSCVIVNTISETMNLDQGAVSRALLRAAGKGLQAAVLKEARLARLDQL 792 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 D G ++T L + V H V P W Q E + L L+ S++F Sbjct: 793 D--PGSLLVTDGFKLRCQKVFHAVCPQWSASYQAE-KTLTSIISRCLKEAERLKMRSLSF 849 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE---QVYFVCYDEENAHLYERLLT 172 PAI TG+ +P+ A + ++ V F +R P+ +V+ V + ++ ER++T Sbjct: 850 PAIGTGLLSFPKDLVARVLLEEVRTF-SRKKTPQHLLKVFVVVHPSDSG--TERVIT 903 >UniRef50_Q55AK6 U box domain-containing protein n=2 Tax=Eukaryota RepID=Q55AK6_DICDI Length = 1618 Score = 85.9 bits (211), Expect = 5e-16, Method: Composition-based stats. Identities = 56/173 (32%), Positives = 88/173 (50%), Gaps = 3/173 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +++GDITK IVN AN L GG +I AAG + C ++ G TG Sbjct: 918 IRIIKGDITKQKTHAIVNPANEKLKNLGGAAFSIQEAAGATFKEFCESYYEKNGPIGTGC 977 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +V + V++TVGP + N+ ++L + +SLR A + S++ PAISTG Sbjct: 978 SVYGSKFKMGNIFVINTVGP--KNDNPNKARILHMSIHSSLRSATALNCQSISIPAISTG 1035 Query: 125 VYGYPRAAAAEIAVKTVSEF-ITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 ++GY A I +K+ EF +T +V FV ++ A+++E L + D Sbjct: 1036 IFGYDPKEAVPIIIKSAIEFLLTNETTLNEVNFVDLNQSTANIFENSLIKFSD 1088 >UniRef50_UPI00006A1CA6 poly (ADP-ribose) polymerase family, member 14 n=11 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1CA6 Length = 1527 Score = 85.5 bits (210), Expect = 6e-16, Method: Composition-based stats. Identities = 50/145 (34%), Positives = 79/145 (54%), Gaps = 1/145 (0%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + I V + D+T+ VDV+VNAA L G+ A+ AAGP L C + +++G Sbjct: 523 RVTIAVYKDDLTRHRVDVVVNAAREDLKHTEGLALALLNAAGPKLQTECDHIIKREGKYS 582 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPA 120 G +VIT AG+LP K V+HTV P W Q +LL+ L L A N +S+ PA Sbjct: 583 VGDSVITGAGNLPCKQVIHTVSPKWDPNSQTRCTRLLRRGISRCLELAAENGLSSIGIPA 642 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFI 145 + + + G+P + + V++V +++ Sbjct: 643 VGSQMSGFPVTVSVQNIVESVRQYV 667 Score = 67.0 bits (162), Expect = 3e-10, Method: Composition-based stats. Identities = 43/156 (27%), Positives = 71/156 (45%), Gaps = 2/156 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I ++QG+I DVIVN+ L + G V A++ AG L L+ + G Sbjct: 737 IKIIQGNIQDATTDVIVNSVGKDLDLNTGAVSKALNAKAGTKLQQQ-LREMSRGTQVEEG 795 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T L K V+H V P W G+++ +++L+ N L S+ FPAI T Sbjct: 796 SVFVTNGFGLNCKKVIHVVTPGWDQGKRSAEKILRTIMTNCLSTTEKEKLRSITFPAIGT 855 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY 159 G G+P+ A + V + + ++V F+ + Sbjct: 856 GALGFPKDLVASLMFDEVLKSSCKGGQLQEVNFLLH 891 Score = 54.3 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 50/154 (32%), Positives = 75/154 (48%), Gaps = 12/154 (7%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V GDITK + DVIVN++N S GV AI AAG ++ D C + Q G Sbjct: 947 KYQVRTGDITKESTDVIVNSSNSSFTQKIGVSKAILEAAGKSIEDECATLGAQAN---KG 1003 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 + ++T G+LP + ++H + ++ + L+ L+ TSVA PA+ T Sbjct: 1004 Y-IVTQKGNLPCRHIIHVY-------TISTPDRIKASVLDVLQECENLKATSVALPAVGT 1055 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFV 157 G G AA A + V EF+T + P+ V V Sbjct: 1056 GAGGATSAAVAAAMLDAVEEFVTMKS-PKSVQTV 1088 >UniRef50_UPI0000E8099B PREDICTED: similar to PARP9 protein n=2 Tax=Gallus gallus RepID=UPI0000E8099B Length = 796 Score = 84.3 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 51/155 (32%), Positives = 88/155 (56%), Gaps = 4/155 (2%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAV 66 V + D+T D +VNAAN SL G + A+ A GP + + ++ G PTG Sbjct: 80 VYKDDLTSHKADAVVNAANESLEHSGALALALLNAGGPEIAEESRNFIRKHGKVPTGKIA 139 Query: 67 ITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVA--ANSYTSVAFPAIST 123 +T G LP K ++H +GP+W E+ + LL++A +N L+ + N+ SVA PA+S+ Sbjct: 140 VTGGGKLPCKKIIHAIGPIWYPSEKEKCCVLLEEAVVNVLKYASDPKNNIKSVAIPAVSS 199 Query: 124 GVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFV 157 GV+G+P A++ V ++ F+ T+ + ++++ V Sbjct: 200 GVFGFPVNLCAQVIVMSIKLFVETQPSCLKEIHLV 234 >UniRef50_D0WKT6 Appr-1-p processing enzyme family domain protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WKT6_9ACTO Length = 302 Score = 84.0 bits (206), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 61/174 (35%), Positives = 95/174 (54%), Gaps = 11/174 (6%) Query: 9 QGDITKLAVDVIVNAANPSLMGGGG-----VDGAIHRAAGPALLDACLKVRQQQG-DCPT 62 +GD+ +LA D +VNAA P+L+G +D I GP + + C +R+ QG D Sbjct: 123 RGDVRELAADAVVNAAMPNLLGCKDPLHPCIDNYIQGQGGPWIRNDCSVIREIQGKDQEV 182 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGE-QNED-QLLQDAYLNSLRL-VAANSYTSVAFP 119 G AV+T LPA+ V+HT+GP GGE +ED + L Y + L L + +V+F Sbjct: 183 GDAVLTRGYRLPARYVLHTLGPHLNGGEITDEDREKLAACYTSCLDLALEKGDIHNVSFC 242 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLL 171 A+STG +P A IA+ TV++++ H + E V F +++ +A Y + L Sbjct: 243 ALSTGRNNFPFEEATHIALDTVNQWLQYHGTDVIELVVFNIFEDADAEGYMQAL 296 >UniRef50_C3Y6H4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y6H4_BRAFL Length = 2120 Score = 83.6 bits (205), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 56/143 (39%), Positives = 76/143 (53%), Gaps = 1/143 (0%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ V + DITK DVIVNAAN L GG+ AI A G + C Q G G Sbjct: 1025 KLVVWRDDITKHKADVIVNAANVRLEHVGGLAKAIVDAGGDIIQKFCNDYIQANGKLIPG 1084 Query: 64 HAVITLAGDL-PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 V + G + + ++H VGP+W GG E+ L DA SL A +++ S+A PAIS Sbjct: 1085 QVVSSPPGRINTCQRILHAVGPIWNGGGLGEEGHLADAVYGSLEEAAKSNFRSIAIPAIS 1144 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI 145 +G+YGYP AEI V E++ Sbjct: 1145 SGIYGYPLKKCAEIIVAKTVEYL 1167 Score = 60.5 bits (145), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 45/152 (29%), Positives = 72/152 (47%), Gaps = 7/152 (4%) Query: 17 VDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCPTGHAVITLAGDL 73 VDV+VN +L + G V AI + AG L + QQ P G ++T + DL Sbjct: 1278 VDVLVNTTAGNLNLNTGAVSRAILQLAGNDLQTLVNRAMQQARITSLPDGQILVTDSADL 1337 Query: 74 PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAA 133 K V+H V W GG+ N +++L+ L+ +Y S+A PA+ TG +P Sbjct: 1338 LCKQVIHCVLCSWDGGQGNSEKVLRKIVQQCLQQAEKGNYASIAIPAMGTGGLHFPHDVV 1397 Query: 134 AEIAVKTVSEFITRH---ALPEQVYFVCYDEE 162 AE E ++ +L E + F+ ++E+ Sbjct: 1398 AEAMFDEAVEHCRKNPSGSLRE-IRFIVWEED 1428 Score = 50.1 bits (118), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 44/148 (29%), Positives = 68/148 (45%), Gaps = 20/148 (13%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +++ +GD+TK D IVN+ N L + G V AI +A GP + C + +G G Sbjct: 1510 LNIRKGDLTKETTDCIVNSTNEQLDLTRGAVSNAICKAGGPDIEQECKNI-AARGGMRDG 1568 Query: 64 HAVITLAGDLPAKAVVHTVGPV------WRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 AV T +G L ++H P W+ N LQ A ++LRL S+A Sbjct: 1569 IAV-TGSGQLKCGKIIHAAAPAPGQSTGWKKVITN---CLQTA--DTLRL------RSIA 1616 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFI 145 FPA+ TG + A + + +F+ Sbjct: 1617 FPALGTGTLQGSAESTATTMLDALQDFV 1644 >UniRef50_A5D049 Predicted phosphatase n=4 Tax=Bacteria RepID=A5D049_PELTS Length = 359 Score = 83.6 bits (205), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 60/167 (35%), Positives = 85/167 (50%), Gaps = 8/167 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V++GDIT+L VD IVNAAN L G GV GAI R G A+ + + +G P G Sbjct: 2 IKVLKGDITELQVDAIVNAANNHLWMGAGVAGAIKRKGGAAIEEEAV----AKGPIPVGE 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T AG L A+ VVH G + D + ++ A N+L ++AFPA+ T Sbjct: 58 AVVTGAGLLKARYVVHAAA---MGQDLVTDAEKVRAATRNALLRAGELGLKTIAFPALGT 114 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 GV G AA + V V + P +V F +D++ + R+ Sbjct: 115 GVGGLEFDTAARVMVGEVRRHLALGLEPGEVIFALFDDKGYDAFSRI 161 >UniRef50_A1R2V6 Putative uncharacterized protein n=1 Tax=Arthrobacter aurescens TC1 RepID=A1R2V6_ARTAT Length = 152 Score = 83.6 bits (205), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 58/147 (39%), Positives = 79/147 (53%), Gaps = 10/147 (6%) Query: 34 VDGAIHRAAGPALLDACLKVRQQQ--GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ 91 +DGAIHRAAG LL+AC ++R+ + P G AV T A LPA V+HTVGP G Q Sbjct: 1 MDGAIHRAAGSELLEACRELRRTELPEGLPVGAAVATPAFRLPAHWVIHTVGPNRHAG-Q 59 Query: 92 NEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALP 151 + LL + SL++ A S+AFPAIS G+YG+ AE+A V F + ++ Sbjct: 60 TDPALLASCFRESLKVAAGLGARSLAFPAISAGIYGWDSRQVAEVAFDAVGSFSSSSSVS 119 Query: 152 -------EQVYFVCYDEENAHLYERLL 171 E V FV + EE ++ L Sbjct: 120 AASERGFELVEFVLFSEETTAVFRAAL 146 >UniRef50_Q0UG78 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0UG78_PHANO Length = 2240 Score = 83.2 bits (204), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 54/148 (36%), Positives = 80/148 (54%), Gaps = 10/148 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL--MGGGGVDGAIHRAAGPAL-LDACLKVRQQQGDCP 61 I D+TKL VD IVN+AN SL G ++ AIH+AAGP L ++A L R + Sbjct: 603 ISFCHHDLTKLKVDAIVNSANKSLKMTRGDTLNNAIHKAAGPGLSVEARLTGRLE----- 657 Query: 62 TGHAVITLAGDLPAKAVVHTVGP-VWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G A+IT +LP++ V+H + P +R E L D Y L++ N ++AFP Sbjct: 658 -GQALITGGHNLPSEHVIHVLRPGYFRHKGMGEFNQLIDCYREVLKVAIENKIKTIAFPC 716 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH 148 + TG G+P AA I ++ + E++ H Sbjct: 717 LGTGGVGFPARVAARITLQEMREYLDAH 744 Score = 78.6 bits (192), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 52/172 (30%), Positives = 86/172 (50%), Gaps = 11/172 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I++V+ DITKL VDV+VN+ + S G G +D + + G + A G C G Sbjct: 1021 KIYLVREDITKLEVDVMVNSTDVSFRGMGTLDRTVLQKGGEQMRAAVTAF----GQCKIG 1076 Query: 64 HAVITLAGDLPAKAVVHTV-GPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 T LPAK V+H + + GG + +L+ Y L+ + TS+A P+I Sbjct: 1077 EVRHTEGYMLPAKHVLHIIPADRYNGGTK---IVLKKLYREVLQEAVSMRATSIALPSIG 1133 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERLL 171 TG+ YPR A +A++ F+ R+ E++ FV + + +Y+ L+ Sbjct: 1134 TGMLNYPRRDVASVALEEAKRFLESAERNNPVEKIIFVVFSSNDEFVYKSLM 1185 >UniRef50_UPI000180B1B4 PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2), partial n=1 Tax=Ciona intestinalis RepID=UPI000180B1B4 Length = 1271 Score = 83.2 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 55/161 (34%), Positives = 87/161 (54%), Gaps = 11/161 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPAL-LDACLKVRQQQGDCPT 62 + +++GDIT++ D IVNA+N L + G+ G+I + GP + + + G Sbjct: 523 VKILRGDITEVNCDAIVNASNDKLELRDAGISGSIKKKCGPTVQAEMNQHIASVGGTMLP 582 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNS-----LRLVAANSYTSVA 117 G AV T AG + + ++H VGPVW+G +E + +AYL S L+ + TSVA Sbjct: 583 GSAVSTSAGRMNCRRIIHVVGPVWKGDISDE---VCEAYLKSCVSETLKEAERYNLTSVA 639 Query: 118 FPAISTGVYGYPRAAAAEIAVKT-VSEFITRHALPEQVYFV 157 PAIS GV+G + + V+T V F+ + +Q+YFV Sbjct: 640 MPAISCGVFGGSVSVCPRLMVETLVDHFMKPSSCIKQIYFV 680 >UniRef50_C3YS04 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YS04_BRAFL Length = 178 Score = 82.8 bits (203), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 55/144 (38%), Positives = 73/144 (50%), Gaps = 3/144 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +++GDIT VD IVNAAN SL GV GAI RA G A+ C + + G T Sbjct: 15 QIDIIKGDITSQKVDTIVNAANSSLSLAVGVSGAISRAGGRAIQTECDNI-IKHGSLRTT 73 Query: 64 HAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAAN-SYTSVAFPAI 121 V T G L ++H VGP + G E Q L D L + A+ + S+A PAI Sbjct: 74 DCVWTTPGRLSCTYIIHAVGPNFVPGCESRCKQELYDTCQKVLNIAASRLNAKSIAMPAI 133 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI 145 S+G G PR AE + +F+ Sbjct: 134 SSGASGMPRRLCAEAMCSAIMDFV 157 >UniRef50_Q2SM57 Predicted phosphatase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SM57_HAHCH Length = 180 Score = 82.8 bits (203), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 54/147 (36%), Positives = 74/147 (50%), Gaps = 10/147 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + GDIT+L VD IV A+ L G G+ I AG L+A Q G C G Sbjct: 2 IEFLCGDITELEVDAIVCPAHKYLSKGRGLSAQIFEQAGEEALEAACS---QAGGCKVGG 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQ---NEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A +T LPAK ++HTV P W GG+Q ++ LL + Y + +RL ++AFPA+ Sbjct: 59 ACLTPGFKLPAKHIIHTVTPQWTGGDQWGGSDLHLLANCYDSVVRLALEQGVKTIAFPAL 118 Query: 122 STGVYGYPRAAAA----EIAVKTVSEF 144 G P++ AA E+ VK F Sbjct: 119 GAGTNKTPQSMAAHEGLEVLVKYADSF 145 >UniRef50_B3RYC4 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RYC4_TRIAD Length = 491 Score = 81.6 bits (200), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 50/163 (30%), Positives = 83/163 (50%), Gaps = 6/163 (3%) Query: 10 GDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITL 69 GDIT L VD IVN N +L ++ I AGP+L +R + G C TG + ++ Sbjct: 57 GDITTLKVDAIVNPTNENLSVMSPINQKIFEIAGPSL---HRDIRDEIGKCATGESKLSK 113 Query: 70 AGDLPAKAVVHTVGPVW--RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYG 127 +LP++ V+HTVGP + R E+ L + +Y +SL + S+A P + G Sbjct: 114 GYNLPSRYVIHTVGPKYNPRYLSAVENALYR-SYRSSLLIAGEYKVRSIAIPTVHLHQRG 172 Query: 128 YPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 +P + A IA++TV ++ + + + D+ +Y+RL Sbjct: 173 FPVSEGAHIALRTVRRYLEHQSCTLETVILILDDTEMEIYKRL 215 >UniRef50_UPI00016E2DD3 UPI00016E2DD3 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E2DD3 Length = 1673 Score = 81.3 bits (199), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 49/120 (40%), Positives = 65/120 (54%), Gaps = 11/120 (9%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V GDITK DVIVN++N + GV AI AAG A+ D C Q+ P Sbjct: 1099 IQAVTGDITKETTDVIVNSSNNTFSLKKGVSKAILEAAGQAVEDEC----QKLAASPNAG 1154 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 ++T G+L K +VH G QN+ L+ ++L++ ANSYTSV+FPAI TG Sbjct: 1155 IIMTQPGNLQCKKIVHVTG-------QNKAFLISKVVKSALQMCVANSYTSVSFPAIGTG 1207 Score = 53.5 bits (127), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 40/145 (27%), Positives = 67/145 (46%), Gaps = 6/145 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I +V G+I DV VN+ L + G + A+ AAGP L D ++ Q G Sbjct: 901 ITLVVGNIEDATTDVTVNSVFNDLDLNRGALSRALLHAAGPQLQDF---LKAQNSSGTLG 957 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 ++T L + V H V P + Q L + + L+ + TS++FP+I T Sbjct: 958 EIIMTEGCQLKSMFVYHAVTPASYNAQ--AVQALGGIFRDCLKKAEDSGMTSISFPSIGT 1015 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH 148 G G+P+ AA++ + +F ++ Sbjct: 1016 GGLGFPKDLAAQMLYDEILKFSSKR 1040 Score = 53.1 bits (126), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 31/89 (34%), Positives = 45/89 (50%), Gaps = 1/89 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T I V + DI V +V+ ANP G+ A+ +AAGP L + C ++ +G Sbjct: 684 TEIFVCKADICSYPVHAVVSYANPDFRFTSGLQRALLKAAGPQLQEDCDRLIHLKGRLKP 743 Query: 63 GHAVITLA-GDLPAKAVVHTVGPVWRGGE 90 G VIT A G L + ++H V P GG+ Sbjct: 744 GDNVITAAGGQLCCRNIIHAVAPKLDGGQ 772 >UniRef50_Q9NXN4 Ganglioside-induced differentiation-associated protein 2 n=36 Tax=Euteleostomi RepID=GDAP2_HUMAN Length = 497 Score = 81.3 bits (199), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 56/170 (32%), Positives = 85/170 (50%), Gaps = 7/170 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ + +GD+ L IVN +N SL V +I AGP L + K++ C TG Sbjct: 55 KVVLWKGDVALLNCTAIVNTSNESLTDKNPVSESIFMLAGPDLKEDLQKLK----GCRTG 110 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A +T +L A+ ++HTVGP ++ + + L Y N L+L S +SV F I+ Sbjct: 111 EAKLTKGFNLAARFIIHTVGPKYKSRYRTAAESSLYSCYRNVLQLAKEQSMSSVGFCVIN 170 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYERLL 171 + GYP A IA++TV F+ H E+V F D E Y++LL Sbjct: 171 SAKRGYPLEDATHIALRTVRRFLEIHGETIEKVVFAVSDLEEG-TYQKLL 219 >UniRef50_C3YS03 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3YS03_BRAFL Length = 2671 Score = 81.3 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 51/136 (37%), Positives = 75/136 (55%), Gaps = 10/136 (7%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + ++ VVQG++T VDV+VN AN SL GGG+ AI +A G + C + G Sbjct: 2200 RRKLVVVQGNLTSHRVDVMVNTANGSLSHGGGLAAAIVKAGGQEIQRDCTNYIKDNGKLT 2259 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G + T LP K VVH VGP+W +++ +++ L+ A N+ L+ A Y S+A PA Sbjct: 2260 EGQVMSTKGYKLPCKMVVHAVGPLWIADQKDSKEKALKMAVENA--LLEARDYHSIAIPA 2317 Query: 121 ISTG-------VYGYP 129 IS+G + GYP Sbjct: 2318 ISSGEELILLCISGYP 2333 Score = 68.2 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 57/176 (32%), Positives = 77/176 (43%), Gaps = 13/176 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I + +G+IT DV+VN + L + GGV A +A G L C G G Sbjct: 2428 IQLKKGNITAEKADVLVNTTSGDLDLSQGGVARAFGQAGGQELQQLC----NNHGKANAG 2483 Query: 64 HAVITL-AGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 VITL AG L K V H V P W Q DQ L+ + L YTS++FPA+ Sbjct: 2484 DIVITLRAGTLRCKQVYHAVLPNW----QESDQPLRTMVQDCLESADQGGYTSISFPAMG 2539 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYE-RLLTQQG 175 TG YPR AA + F + + V + +D+ +E L +QG Sbjct: 2540 TGNLKYPRDVAASCMYDEILSFSQSNPGTTLQDVGIIVFDQPTVQAFETELRVRQG 2595 >UniRef50_A2QSI2 Contig An08c0280, complete genome n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QSI2_ASPNC Length = 603 Score = 80.1 bits (196), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 60/187 (32%), Positives = 92/187 (49%), Gaps = 18/187 (9%) Query: 5 IHVVQGDITKL-AVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLK-VRQQQ 57 +H+ QGDIT L V I NAAN ++G +D IH AGP L + C + Q Q Sbjct: 109 LHLWQGDITTLDGVTAITNAANEQMLGCFQPAHRCLDNVIHARAGPRLREECFHHMDQGQ 168 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ----NEDQLLQDAYLNSLRLVAANSY 113 P GHA T LPA V+HTVGP G+ ++ Q L+ Y L + A Sbjct: 169 RTLPVGHACATKGYCLPAPYVIHTVGPQLDAGQPVPTAHQRQQLRQCYEAVLDVAEALPA 228 Query: 114 T-----SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHL 166 + S+A ISTG++ +P AA IA+++V +++ H + F + + + + Sbjct: 229 SDPRGKSIALCGISTGLFAFPVEEAASIAIQSVLDWLRHHLHTSITNIIFNTFTDTDTAV 288 Query: 167 YERLLTQ 173 Y++ L + Sbjct: 289 YQQTLKK 295 >UniRef50_C3YH95 Putative uncharacterized protein n=2 Tax=Eumetazoa RepID=C3YH95_BRAFL Length = 437 Score = 79.3 bits (194), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 8/171 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ + +GDIT L IVN N +L + I +AAGP L C C TG Sbjct: 47 KVVLWEGDITTLNCTAIVNTTNETLTDRNLISERIFQAAGPDLRAEC---SNHLKTCRTG 103 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A +T +LPA+ ++HTVGP + + + L + Y NSL++ N+ S+ ++ Sbjct: 104 EAKMTKGYNLPARYIIHTVGPRYNVKYRTAAESALFNCYRNSLQIARENNLQSIGLCVVN 163 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLL 171 GYP A IA++TV F+ ++ +L V+ V ++E+ +Y R++ Sbjct: 164 QPKRGYPPDEGAHIALRTVRRFLEKYDSSLETIVFAVTDNDED--IYRRVM 212 >UniRef50_B0QWK9 Putative uncharacterized protein n=1 Tax=Haemophilus parasuis 29755 RepID=B0QWK9_HAEPR Length = 156 Score = 79.0 bits (193), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 50/126 (39%), Positives = 73/126 (57%), Gaps = 5/126 (3%) Query: 49 ACLKVRQQQGDC-PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLR 106 AC ++ ++QG PTG A IT A +LP+ V+HTVGP+ G +D +LL Y + L Sbjct: 6 ACAELMEKQGHLGPTGQAKITPAFNLPSAYVLHTVGPIISGALSAKDCELLASCYRSCLE 65 Query: 107 LVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAH 165 L + SVAF ISTG + +P AAEIAV+TV F+ + P+ +V F + + + Sbjct: 66 LAKQHGIESVAFCCISTGEFRFPNQEAAEIAVQTVKAFLADN--PQMKVVFNVFKDVDLE 123 Query: 166 LYERLL 171 +Y LL Sbjct: 124 IYRGLL 129 >UniRef50_D0NR00 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NR00_PHYIN Length = 492 Score = 78.6 bits (192), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 52/173 (30%), Positives = 83/173 (47%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +G + L VD +VN+ S+ G + ++AGP + C + G C Sbjct: 44 INAKLSLWRGPLYCLRVDAVVNSTCESMRQSDGDFDKLLKSAGPEIAVEC----KAAGAC 99 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG V+T LPAK ++HTVGP ++ N + L Y + L + N SVA Sbjct: 100 RTGDTVLTRGCKLPAKFILHTVGPRYQAKYHNAAEHSLHSCYRSVLAVTKENGLRSVATG 159 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDE-ENAHLYERLL 171 I T GYPR A IA +TV ++ + +C D ++ +YER+L Sbjct: 160 CIYTIRKGYPREEGAHIAARTVRRYLEHYGDDFDRVILCMDSVQDMDVYERVL 212 >UniRef50_B9L2D9 Appr-1-p processing enzyme family protein n=2 Tax=Thermomicrobia (class) RepID=B9L2D9_THERP Length = 176 Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 64/173 (36%), Positives = 81/173 (46%), Gaps = 8/173 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GDIT + + IVNAAN L G GV GAI RA G + + QG G Sbjct: 6 LEVQVGDITAVDTEAIVNAANSQLWMGSGVAGAIKRAGGEEIEREAVA----QGPISVGE 61 Query: 65 AVITLAGDLPAKAVVH--TVGPVWRGGE-QNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 AV+T AG LP AV+H +G RG + + A +L A SVAFPA+ Sbjct: 62 AVVTTAGRLPFAAVIHAAAMGYDERGAMIPATSETVYAATRAALERCAERPLRSVAFPAL 121 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITR-HALPEQVYFVCYDEENAHLYERLLTQ 173 TGV G A V+ V + ALPE+V FV EE A + R + Sbjct: 122 GTGVGGLDLVTCAAAMVRAVRDHAASGAALPERVVFVVRSEEAADAFLRAIAS 174 >UniRef50_Q460N3 Poly [ADP-ribose] polymerase 15 n=12 Tax=Eutheria RepID=PAR15_HUMAN Length = 656 Score = 77.8 bits (190), Expect = 2e-13, Method: Composition-based stats. Identities = 48/165 (29%), Positives = 87/165 (52%), Gaps = 8/165 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + ++ GD+ + DVIVN+ +L +GGG + A + AGP +L L R+++ + G Sbjct: 69 LKLISGDVLYIWADVIVNSVPMNLQLGGGPLSRAFLQKAGP-MLQKELDDRRRETEEKVG 127 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 + +T +L KAV+H V P W G + Q++ + L V S++S+ FP I T Sbjct: 128 NIFMTSGCNLDCKAVLHAVAPYWNNGAETSWQIMANIIKKCLTTVEVLSFSSITFPMIGT 187 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALP-----EQVYFVCYDEEN 163 G +P+A A++ + V E+ + P ++V+F+ Y ++ Sbjct: 188 GSLQFPKAVFAKLILSEVFEY-SSSTRPITSPLQEVHFLVYTNDD 231 Score = 54.3 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 42/146 (28%), Positives = 61/146 (41%), Gaps = 14/146 (9%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHA 65 V GDI VDVIVN+ + GV AI AG A+ C + Q P Sbjct: 285 QVATGDIATEQVDVIVNSTARTFNRKSGVSRAILEGAGQAVESECAVLAAQ----PHRDF 340 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 +IT G L K ++H G + ++ + L YTSV+ PAI TG Sbjct: 341 IITPGGCLKCKIIIHVPG----------GKDVRKTVTSVLEECEQRKYTSVSLPAIGTGN 390 Query: 126 YGYPRAAAAEIAVKTVSEFITRHALP 151 G A+ + + +F ++H+ P Sbjct: 391 AGKNPITVADNIIDAIVDFSSQHSTP 416 >UniRef50_Q2TX23 Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 n=8 Tax=Fungi/Metazoa group RepID=Q2TX23_ASPOR Length = 615 Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 62/182 (34%), Positives = 90/182 (49%), Gaps = 19/182 (10%) Query: 5 IHVVQGDITKLA-VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDAC--LKVRQQ 56 I + +GDIT L V IVNAAN L+G +D IH AAGP L DAC L ++Q Sbjct: 114 ISLWKGDITSLTDVTAIVNAANSQLLGCFRPDHRCIDNIIHSAAGPRLRDACNSLMLKQC 173 Query: 57 QGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRLVAA--- 110 + G +T +LPA+ V+HTVGP + + Q L Y + L + Sbjct: 174 HPES-VGSVKVTSGFNLPAQWVLHTVGPQVNSRKSPGTLQQQQLASCYSSCLDATESLPA 232 Query: 111 --NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHL 166 + VAF ISTG++ +P AA+IA++TV ++ H + F + E + L Sbjct: 233 LPDGRKVVAFCCISTGLFAFPPDMAAKIALETVVQWCMNHPATSVTDIIFDTFLERDYEL 292 Query: 167 YE 168 Y+ Sbjct: 293 YQ 294 >UniRef50_O67112 UPF0189 protein aq_987 n=4 Tax=cellular organisms RepID=Y987_AQUAE Length = 165 Score = 77.4 bits (189), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 54/167 (32%), Positives = 82/167 (49%), Gaps = 7/167 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I VV+G IT++ DVIVN AN + GGGV I R G + + ++ P G Sbjct: 3 IKVVKGSITEVDADVIVNPANSRGLMGGGVAVVIKRLGGEEIEREAV----EKAPIPVGS 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AV+T AG L K V+H + + ++ ++ A +L L + VA P + TG Sbjct: 59 AVLTTAGKLKFKGVIHA-PTMEEPAMPSSEEKVRKATRAALELADKECFKIVAIPGMGTG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 V G P+ AA V+ + +F + E+V V DEE +E++L Sbjct: 118 VGGVPKEVAARAMVEEIRKFEPKCL--EKVILVDIDEEMVEAWEKVL 162 >UniRef50_C3Y417 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3Y417_BRAFL Length = 1060 Score = 77.0 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 52/143 (36%), Positives = 72/143 (50%), Gaps = 3/143 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + QGD+T+ V IVNAAN L G+ AI AGP+L + C K + G Sbjct: 460 VSMYQGDLTQEKVTAIVNAANGYLAHAAGIAAAIQEQAGPSLEEECRKYISKHGPLYETQ 519 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNED--QLLQDAYLNSLRLVAANSY-TSVAFPAI 121 + T AG+LP V+H VGP WR + L+ +LN L + T+VA PAI Sbjct: 520 VMHTSAGNLPCHYVIHAVGPKWRDYSNKTECASALRVTFLNCLDYANEKLHATTVALPAI 579 Query: 122 STGVYGYPRAAAAEIAVKTVSEF 144 STG++G P A+ V +F Sbjct: 580 STGIFGVPNDVCAKAVYDAVRDF 602 >UniRef50_UPI000180BD0C PREDICTED: similar to Ci-Rhysin2/Deltex3-a n=1 Tax=Ciona intestinalis RepID=UPI000180BD0C Length = 578 Score = 76.6 bits (187), Expect = 4e-13, Method: Composition-based stats. Identities = 51/153 (33%), Positives = 74/153 (48%), Gaps = 10/153 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDAC---LKVRQQQGDCP 61 + V G+I VD IVNAAN + G GV GAI + G C +K RQ + Sbjct: 384 VSVAHGNIALQDVDAIVNAANKYIQNGSGVTGAIFKQGGSKFEQLCKEAMKHRQNR-SLK 442 Query: 62 TGHAV-ITLAGDLPAKAVVHTVGPVWRGGEQNED--QLLQDAYLNSLRLVAANSYTSVAF 118 G V + AG+L K V+H VGP W+ ++ LL+D L+ L+ +++A Sbjct: 443 VGEVVSVKAAGNLQCKRVLHLVGPQWKNYSHKDEAYHLLEDGLLSVLKESNYCKASTLAL 502 Query: 119 PAISTGVYGYPR---AAAAEIAVKTVSEFITRH 148 P ++TG+YG P A A+ I+RH Sbjct: 503 PPVATGIYGTPLKLFVRAMNTALTCFETNISRH 535 >UniRef50_Q8IXQ6 Poly [ADP-ribose] polymerase 9 n=27 Tax=Eutheria RepID=PARP9_HUMAN Length = 854 Score = 76.3 bits (186), Expect = 4e-13, Method: Composition-based stats. Identities = 46/143 (32%), Positives = 78/143 (54%), Gaps = 3/143 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + V + D+T AVD +VNAAN L+ GGG+ A+ +A G + + + + G Sbjct: 117 RIELSVWKDDLTTHAVDAVVNAANEDLLHGGGLALALVKAGGFEIQEESKQFVARYGKVS 176 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLR-LVAANSY-TSVAF 118 G +T AG LP K ++H VGP W +Q LQ A ++ L ++ N++ +VA Sbjct: 177 AGEIAVTGAGRLPCKQIIHAVGPRWMEWDKQGCTGKLQRAIVSILNYVIYKNTHIKTVAI 236 Query: 119 PAISTGVYGYPRAAAAEIAVKTV 141 PA+S+G++ +P + V+T+ Sbjct: 237 PALSSGIFQFPLNLCTKTIVETI 259 Score = 50.8 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 42/144 (29%), Positives = 65/144 (45%), Gaps = 5/144 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + +VQG I DVIVN+ NP + G V +I + AG + L + +Q + Sbjct: 319 LQIVQGHIEWQTADVIVNSVNPHDITVGPVAKSILQQAGVEMKSEFLATKAKQFQ-RSQL 377 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 ++T +L K + H + W E + Q+L+ A L + TS++FPA+ TG Sbjct: 378 VLVTKGFNLFCKYIYHVL---WHS-EFPKPQILKHAMKECLEKCIEQNITSISFPALGTG 433 Query: 125 VYGYPRAAAAEIAVKTVSEFITRH 148 + AAEI V F H Sbjct: 434 NMEIKKETAAEILFDEVLTFAKDH 457 >UniRef50_Q4T065 Chromosome undetermined SCAF11328, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4T065_TETNG Length = 566 Score = 76.3 bits (186), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 47/140 (33%), Positives = 71/140 (50%), Gaps = 5/140 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I + +GD+ L IVN ++ SL V +IHR AGP L D LK++ C Sbjct: 50 INAKIVLFKGDVALLNCTSIVNTSSESLNDKNPVSDSIHRLAGPELRDELLKLK----GC 105 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A +T L A+ ++HTVGP ++ + + L Y + L+LV S SV Sbjct: 106 RTGEAKLTKGFGLAARFIIHTVGPKYKTKYRTAAESSLYSCYRSVLQLVVEQSMASVGLC 165 Query: 120 AISTGVYGYPRAAAAEIAVK 139 I+T GYP A +A++ Sbjct: 166 TITTSKRGYPLEEATHMALR 185 >UniRef50_B1H1M8 LOC100148704 protein (Fragment) n=5 Tax=Danio rerio RepID=B1H1M8_DANRE Length = 858 Score = 74.7 bits (182), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 50/142 (35%), Positives = 73/142 (51%), Gaps = 12/142 (8%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V GDITK+ V+ +VN+ N SL GV GAI +A+GP ++ C K + Q P Sbjct: 281 IRVSSGDITKVKVEAVVNSTNTSLNLSSGVSGAILKASGPTVVKEC-KAKAPQ---PEDG 336 Query: 65 AVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 V+T AG+L +VH VG R G ++ + L+ N SV+FPA+ T Sbjct: 337 VVLTRAGNLTNCTHIVHMVGQTSRTG-------IRSSMAKVLKTCEENHIRSVSFPALGT 389 Query: 124 GVYGYPRAAAAEIAVKTVSEFI 145 G P AA A+ +++F+ Sbjct: 390 GAGHLPAAAVADAMTTALADFV 411 Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 48/162 (29%), Positives = 76/162 (46%), Gaps = 12/162 (7%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T I V +G IT +V IVN N + GGV GAI +AAG ++ C K QGD Sbjct: 79 TTIEVRKGSITTESVRGIVNTTNRDMSRRGGVSGAIFKAAGASVEQECRKHGPLQGD--- 135 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A +T AG L ++H +GP ++ ++ + L T+V+FPAI Sbjct: 136 -DAAVTAAGLLHCDLILHMLGP--HSAAESRTRVRR-----VLERCEEKQITTVSFPAIG 187 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEEN 163 TG G AA ++ ++ +T+ ++ ++ D +N Sbjct: 188 TGGGGVQAVDAATAMLQGFADHLTKSTSSVVKLIYIVIDRDN 229 >UniRef50_A7T7L3 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7T7L3_NEMVE Length = 177 Score = 74.3 bits (181), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 53/177 (29%), Positives = 87/177 (49%), Gaps = 21/177 (11%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-- 58 + ++ + GDIT L +D IVNA N ++ G+D + KV +G Sbjct: 12 LNDKVSLWTGDITALEIDAIVNAGNTIMLMFIGIDVDSYPN----------KVYSGRGIF 61 Query: 59 DCPTGHAVITLAGD-LPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 C + + L G V+HT GP+ + + LQD Y N L+L + ++A Sbjct: 62 KCFFFNLSVLLKGSPYFGLDVIHTAGPMGKNRIK-----LQDCYKNCLQLAKQHGVKTLA 116 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLL 171 F ISTG+YGYP AA +A++TV +++ + E++ F + ++ +YERLL Sbjct: 117 FCCISTGIYGYPNKDAAHVALETVRQWLETDDNNDSVERIIFCTFLPKDTEIYERLL 173 >UniRef50_B1L625 Appr-1-p processing domain protein n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L625_KORCO Length = 175 Score = 73.9 bits (180), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 56/172 (32%), Positives = 86/172 (50%), Gaps = 10/172 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 R+ +V GDIT++ D IVN AN LM GGGV GAI R G + ++ + G Sbjct: 6 RLILVLGDITEVESDAIVNPANVFLMMGGGVAGAIKRKGGEEIEREAMR----KAPLKIG 61 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A+ T AG L AK V+H V G + + ++ A SL+ S+AFPA+ Sbjct: 62 EAIETSAGKLKAKYVIHAP-TVESPGGSSSPEYIRAAVKASLKKGEELGIRSIAFPAMGA 120 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 GV G P E +V+ + E I + E+V V ++++ +++R+ G Sbjct: 121 GVGGVP----VEESVRIILEEIKASPI-EEVLLVTRNKQDLEVFKRVSEYMG 167 >UniRef50_O28751 UPF0189 protein AF_1521 n=32 Tax=Euryarchaeota RepID=Y1521_ARCFU Length = 192 Score = 73.9 bits (180), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 64/181 (35%), Positives = 89/181 (49%), Gaps = 16/181 (8%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRA-AGPALLDACLK---VRQQQGD- 59 + + QGDIT+ IVNAAN L GGGV AI +A AG A L + +R+Q G Sbjct: 14 LKLAQGDITQYPAKAIVNAANKRLEHGGGVAYAIAKACAGDAGLYTEISKKAMREQFGRD 73 Query: 60 -CPTGHAVITLAGDLPA---KAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYT 114 G V+T A +L K V HTVGP+ G E + L A+L L Sbjct: 74 YIDHGEVVVTPAMNLEERGIKYVFHTVGPICSGMWSEELKEKLYKAFLGPLEKAEEMGVE 133 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAH----LYERL 170 S+AFPA+S G+YG E ++ V F + + ++V V YD ++A ++ER Sbjct: 134 SIAFPAVSAGIYGCDLEKVVETFLEAVKNF--KGSAVKEVALVIYDRKSAEVALKVFERS 191 Query: 171 L 171 L Sbjct: 192 L 192 >UniRef50_UPI0001BC8416 Appr-1-p processing domain protein n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC8416 Length = 430 Score = 73.6 bits (179), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 48/160 (30%), Positives = 75/160 (46%), Gaps = 5/160 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 K+R+ + GD+T DVIV++ + L GGGV +I RA G D + ++ C Sbjct: 11 KSRLIIKFGDLTSAVTDVIVSSDDAYLSMGGGVSASILRAGG----DVIARDARKNVPCQ 66 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G ++T AG L AK V H + W + E ++ + SL +++ S+AFPA Sbjct: 67 MGDVIVTSAGKLEAKYVFHAITIDWSQKDEFTVEKSINSIIKKSLNVLSVLGLKSIAFPA 126 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYD 160 I TG Y A +SEF++ ++Y D Sbjct: 127 IGTGAARYSLEDVAHFMSMAISEFLSNSDEELEIYIYLMD 166 >UniRef50_UPI00005A5611 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 14 n=1 Tax=Canis lupus familiaris RepID=UPI00005A5611 Length = 575 Score = 72.4 bits (176), Expect = 6e-12, Method: Composition-based stats. Identities = 49/162 (30%), Positives = 78/162 (48%), Gaps = 6/162 (3%) Query: 11 DITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITL 69 D ++ DVIVN +L +GGG + A+ + AGP L RQ + G +T Sbjct: 110 DDIRVVADVIVNTVPMNLQLGGGQLSQALLQKAGPELQKELYATRQGTEE-EVGSIFMTS 168 Query: 70 AGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYP 129 +L KAV+H V P W G + Q++ + L V S++S+ FP I TG +P Sbjct: 169 GCNLNCKAVLHVVAPHWDNGAGSSQQIMANIIKKCLTTVEEFSFSSITFPMIGTGSLRFP 228 Query: 130 RAAAAEIAVKTVSEFITR--HALPEQVYFVCY--DEENAHLY 167 +A AE+ + V F + ++V+F+ Y D+E + Sbjct: 229 KAIFAELILSEVFRFSSSLWQKSLQEVHFLVYPGDDETLQAF 270 Score = 68.9 bits (167), Expect = 7e-11, Method: Composition-based stats. Identities = 48/146 (32%), Positives = 65/146 (44%), Gaps = 14/146 (9%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHA 65 + GDITK DVIVN+ + GV A+ AGPA+ + C VR Q P G Sbjct: 318 QIATGDITKEKADVIVNSTTRTFNLKSGVSKAVLEGAGPAVENEC-AVRAAQ---PHGEF 373 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 +IT G L K ++H +G D ++ L YTSVA PAI TG Sbjct: 374 IITQGGYLMCKIIIHVLG----------DNDVRKTVSAVLEECEQRKYTSVALPAIGTGS 423 Query: 126 YGYPRAAAAEIAVKTVSEFITRHALP 151 G A+ + V +F +H+ P Sbjct: 424 AGKNPTIVADDMISAVVDFSWKHSTP 449 >UniRef50_D1B7G8 Appr-1-p processing domain protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B7G8_THEAS Length = 179 Score = 72.4 bits (176), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 49/137 (35%), Positives = 68/137 (49%), Gaps = 6/137 (4%) Query: 9 QGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVIT 68 +GDI D IVNAAN L G GV GAI R+AG + + +G G AV T Sbjct: 16 EGDICSYRGDAIVNAANDRLWMGSGVAGAIRRSAGEEVEAEAI----SKGPIRVGSAVAT 71 Query: 69 LAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGY 128 AG LP KAV+H V + + ++ + +LRL A +AFPA+ TGV G+ Sbjct: 72 GAGRLPLKAVIHCA--VMGQDLKTSREAIRSSTGEALRLAAEMELRRIAFPALGTGVGGF 129 Query: 129 PRAAAAEIAVKTVSEFI 145 P + + + EF+ Sbjct: 130 PVEECGHVMGEELKEFL 146 >UniRef50_A7BVQ6 Appr-1-p processing enzyme family n=1 Tax=Beggiatoa sp. PS RepID=A7BVQ6_9GAMM Length = 252 Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 50/168 (29%), Positives = 75/168 (44%), Gaps = 18/168 (10%) Query: 5 IHVVQGDITKLAVD--VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 I +++GDIT VD V+ A NP + R A+ A ++ Q Sbjct: 60 IEILRGDITTFTVDARVMTTAPNPE------IGSETSRYQLKAIFSALRRLNIYQ----- 108 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A I+ +LPA+ ++H V W+ G Q E L + Y + L S +AFP I Sbjct: 109 --AKISRTSNLPARYIIHIVESTWQQGTQQEIASLANNYRSCLTSATRKSLKVIAFPDII 166 Query: 123 TGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLY 167 + YP A A A K V EF+ + ++VYF+C +EE +Y Sbjct: 167 CSMSQYPIAQAVYTAFKEVLEFLMDKPNKSRFKKVYFICQNEEIYQIY 214 >UniRef50_UPI0001556316 PREDICTED: similar to LRP16 protein n=1 Tax=Ornithorhynchus anatinus RepID=UPI0001556316 Length = 169 Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 35/72 (48%), Positives = 49/72 (68%), Gaps = 1/72 (1%) Query: 78 VVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEI 136 V+HTVGP+ +G ++ Q L+ YLNSL+LV N SVAFP ISTGV+GYP AAA++ Sbjct: 80 VIHTVGPIAQGEPSPSQAQELRSCYLNSLQLVLENRLRSVAFPCISTGVFGYPNEAAAKV 139 Query: 137 AVKTVSEFITRH 148 + + E++ H Sbjct: 140 VLTALREWLEEH 151 >UniRef50_Q9YBE9 UPF0189 protein APE_1648.1 n=1 Tax=Aeropyrum pernix RepID=Y1648_AERPE Length = 189 Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 56/168 (33%), Positives = 77/168 (45%), Gaps = 13/168 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GD+TK+ + +VN AN ++ GGG GA+ RA G + + ++ + P G Sbjct: 11 LAVSMGDLTKVRAEAVVNPANSLMIMGGGAAGALKRAGGSVIEEEAMR----KAPVPVGE 66 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNS---LRLVAANSYTSVAFPAI 121 AVIT G LPA+ V+H P E L +A+ S LRL + SVA PA+ Sbjct: 67 AVITSGGSLPARFVIHA--PTME--EPGMRIPLVNAFKASYAALRLASEAGIESVAMPAM 122 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYER 169 GV G A A A S I R P + V EE E+ Sbjct: 123 GAGVGGLSVAEVAREAAMAAS--ILRGKWPRYIILVARGEEAYRGMEK 168 >UniRef50_UPI000180B63C PREDICTED: similar to Ci-Rhysin2/Deltex3-a n=1 Tax=Ciona intestinalis RepID=UPI000180B63C Length = 897 Score = 71.2 bits (173), Expect = 2e-11, Method: Composition-based stats. Identities = 54/164 (32%), Positives = 84/164 (51%), Gaps = 6/164 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLK-VRQQQGD-CP 61 R+ V G++ DVIVNAAN L G GV GAI + G A C +R ++G Sbjct: 563 RVSVGMGNVAIQDTDVIVNAANNRLENGVGVTGAIFKQGGHAFQIECQNAMRARRGQLLA 622 Query: 62 TGHAVITLA-GDLPAKAVVHTVGPVWRGG-EQNE-DQLLQDAYLNSLRLVAANSYTSVAF 118 G AV+ A G+L + V+H VGP W ++N+ +L ++ L ++ + ++A Sbjct: 623 VGEAVMVNATGNLKCRKVIHLVGPQWHSYIDKNKCCSVLIQGIMSVLVEASSVNAKTIAI 682 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDE 161 P +STGVYG P A E+ K + R+ + +Q+ + DE Sbjct: 683 PPVSTGVYGVPVAVFVEMVKKCLGILKQRNDITLKQIRILSIDE 726 >UniRef50_UPI0001927649 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI0001927649 Length = 175 Score = 70.9 bits (172), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 40/107 (37%), Positives = 60/107 (56%), Gaps = 16/107 (14%) Query: 73 LPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAA 132 LPAK ++HTVGPV + + LL+ Y N L+LV ++AF ISTG+YGYP Sbjct: 75 LPAKYIIHTVGPVGKNPD-----LLESCYKNCLQLVLDFEIKTIAFCCISTGIYGYPNKD 129 Query: 133 AAEIAVKTVSEFITRHALPE------QVYFVCYDEENAHLYERLLTQ 173 AA +A+K V RH L E ++ F Y ++ +Y++L++ Sbjct: 130 AAHVALKYV-----RHWLQENYDKIDRIIFCTYISKDFEIYKKLMSN 171 >UniRef50_UPI00006CE511 hypothetical protein TTHERM_00141050 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CE511 Length = 267 Score = 70.1 bits (170), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 43/171 (25%), Positives = 79/171 (46%), Gaps = 4/171 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +++G+I +D IVN + LM + +A L V+ +G Sbjct: 28 IIILKGNICNENIDCIVNWVDCFLMNERTY--ILKQALNDKLKKELDSVKHSKGILTLND 85 Query: 65 AVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 IT G L K ++H+ P+WRGG + E Q +++ ++L + +S+ F S+ Sbjct: 86 CFITSPGKLQNTKKIIHSTLPLWRGGHEKELQYFEESITQCIQLAINQNMSSIGFTQDSS 145 Query: 124 GVYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENAHLYERLLTQ 173 ++G P AEI +++ F T + ++VYF+ D +Y+ L + Sbjct: 146 DIFGIPLQDCAEILIQSFYRFATFKDTSIKRVYFIHQDSSAIQVYKNKLLK 196 >UniRef50_UPI0001C3795F Appr-1-p processing domain protein n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C3795F Length = 232 Score = 68.9 bits (167), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 55/159 (34%), Positives = 78/159 (49%), Gaps = 6/159 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPT 62 ++++++ DIT L VD IV ANP L G G AI AG L C + Sbjct: 2 KMYIIKADITTLNVDAIVLPANPQLKKGAGASQAIFEKAGEEELRKKCKSI----APIDV 57 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A+ T +LP++ ++H P W G NE LL AYL+SL++ +SVAFP +S Sbjct: 58 GSAIPTGGYNLPSEFIIHAAVPRWVDGGHNEYVLLSSAYLSSLKVADRIGVSSVAFPLLS 117 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDE 161 G+ A +A KT+ F L + VY YD+ Sbjct: 118 ASNNGFDPRVAFYVAQKTIESFKADKTL-KDVYLTIYDK 155 >UniRef50_C2QLJ2 Appr-1-p processing enzyme n=8 Tax=Bacillus cereus group RepID=C2QLJ2_BACCE Length = 159 Score = 68.9 bits (167), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 49/168 (29%), Positives = 84/168 (50%), Gaps = 20/168 (11%) Query: 4 RIHVVQGDITKLA--VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 R+ + GDIT VDVIVNA N + + G GV I + AG ++ + ++ Sbjct: 2 RVFIENGDITNYCDKVDVIVNAWNRNFIPGFLLIASGVSRTIFKKAGKSVYEEV----RR 57 Query: 57 QGDCPTGHAVITLAGDLPAKAVVHTVG--PVWRGGEQNEDQLLQDAYLNSLRLVAANSYT 114 +G G AVIT +G L KA++H G W+ E + ++++ N+L+L+ +Y Sbjct: 58 KGPLKIGEAVITSSGFLDCKAIIHVAGINAFWKASEYS----IRESTRNALQLLIKENYK 113 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEE 162 S+A P I +G GY + +K E ++++ V+ V Y+ + Sbjct: 114 SIAIPLIGSGSGGYKKEKCIAF-IKEECEKFQKYSV--NVFIVNYESD 158 >UniRef50_UPI000180BD0B PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=2 Tax=Ciona intestinalis RepID=UPI000180BD0B Length = 1729 Score = 68.9 bits (167), Expect = 8e-11, Method: Composition-based stats. Identities = 46/148 (31%), Positives = 76/148 (51%), Gaps = 7/148 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I+V++ DIT+ D I+NA+NP L + GG+ GAI + G + + V ++G G Sbjct: 905 INVLKTDITQHECDAILNASNPELDLLPGGISGAIQKTGGDKIQEEMHAVISKRGKLFPG 964 Query: 64 HAVITLAGDLPA-KAVVHTVGPVWRGGEQNED---QLLQDAYLNSLRLVAANSYTSVAFP 119 A IT AG L + ++H VGP W E + + LQ +++ + S++ P Sbjct: 965 DAAITGAGKLKTCRFIIHAVGPRW--AEHSHSTCCKYLQSCINYAMQEAESKRLRSISIP 1022 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITR 147 AIS GV+G + + V TV ++ + Sbjct: 1023 AISCGVFGGVPSVCIPLIVDTVLDYFKQ 1050 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 38/142 (26%), Positives = 59/142 (41%), Gaps = 11/142 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V QGD+T D IVN+ NP + G + AI + G +L+ C + QQG + Sbjct: 1122 ISVSQGDLTLDNSDAIVNSTNPQFDLTQGMISQAILKKGGRTVLNEC---KNQQGQWNSP 1178 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T G L + V H V P N + + L + ++A PA+ T Sbjct: 1179 RIRVTSGGKLQCRYVFHIVTP-------NNTKQITSVLLEVFTIADKLGLATLALPALGT 1231 Query: 124 GVYGYPRAAAAEIAVKTVSEFI 145 G G A+ + E++ Sbjct: 1232 GNLGIESLRIAQCIRGAIKEYV 1253 >UniRef50_Q9P0M6 Core histone macro-H2A.2 n=118 Tax=Eukaryota RepID=H2AW_HUMAN Length = 372 Score = 68.6 bits (166), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 44/169 (26%), Positives = 81/169 (47%), Gaps = 7/169 (4%) Query: 4 RIHVVQGDIT---KLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 ++ + Q DI+ + V+ IV+ + + A+ +A G L+ ++R+ QG Sbjct: 196 KLSLTQSDISHIGSMRVEGIVHPTTAEIDLKEDIGKALEKAGGKEFLETVKELRKSQGPL 255 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 A ++ + L AK V+H P W G ++ E+Q L++ N L SVAFP Sbjct: 256 EVAEAAVSQSSGLAAKFVIHCHIPQW-GSDKCEEQ-LEETIKNCLSAAEDKKLKSVAFPP 313 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCYDEENAHLY 167 +G +P+ AA++ +K +S + + VYF+ +D E+ +Y Sbjct: 314 FPSGRNCFPKQTAAQVTLKAISAHFDDSSASSLKNVYFLLFDSESIGIY 362 >UniRef50_C1XFR0 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=2 Tax=Meiothermus RepID=C1XFR0_MEIRU Length = 163 Score = 68.6 bits (166), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 45/116 (38%), Positives = 59/116 (50%), Gaps = 7/116 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 RI V QGDIT+ A D IVNAAN L+ G GV GAI R GP++ C + G G Sbjct: 3 RIQVAQGDITEFAGDAIVNAANNHLILGSGVAGAIRRRGGPSIQGEC----DRHGPIRVG 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 A +T AG LP + V+H G + ++ A +LRL + +AFP Sbjct: 59 EAALTGAGQLPVRKVIHA---AVLGDQPATLDTVRSATQAALRLALEHRLYRLAFP 111 >UniRef50_C3ZVW0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZVW0_BRAFL Length = 731 Score = 68.6 bits (166), Expect = 1e-10, Method: Composition-based stats. Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 2/151 (1%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T+I + QGD+ K VDVIVN N L G + A+ + G + C + G Sbjct: 538 LDTKISIYQGDVIKECVDVIVNETNDRLKLSGELSWALAQYGGHDIEADCRRYVATHGRL 597 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAAN-SYTSVAF 118 V T AG LP+K ++H V P W E + LL Y N + TS+A Sbjct: 598 AATQVVPTSAGQLPSKHILHAVVPHWVSAHPRESKMLLYKTYENIFKCAGIKMRVTSIAL 657 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHA 149 ++G G P+ AE + V F+ + Sbjct: 658 SLQTSGSTGIPKDVYAETMFQAVVSFLKTYG 688 >UniRef50_B5Y5Y4 Appr-1-p processing enzyme family protein n=2 Tax=Firmicutes RepID=B5Y5Y4_COPPD Length = 172 Score = 68.2 bits (165), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 47/133 (35%), Positives = 69/133 (51%), Gaps = 2/133 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ +V GDITK DVIVNAAN GGGV AI +A G + D ++V Q P G Sbjct: 8 KVKLVMGDITKAEADVIVNAANGIGPMGGGVALAIKKAGGKVIEDEAIRVCSQLDPRP-G 66 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T AG L AK + H V + R E + ++++ + L S+ PA++T Sbjct: 67 DVYVTTAGGLKAKYIFHAV-TMKRPAEPSSVEIVRKCLQSLLEKAREMKVKSMVLPALAT 125 Query: 124 GVYGYPRAAAAEI 136 GV G P+ A++ Sbjct: 126 GVGGVPKKDVAKV 138 >UniRef50_Q1DG64 Appr-1-p processing enzyme family domain protein n=2 Tax=Bacteria RepID=Q1DG64_MYXXD Length = 154 Score = 68.2 bits (165), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 49/139 (35%), Positives = 68/139 (48%), Gaps = 15/139 (10%) Query: 5 IHVVQGDITKLAVDVIVNAANPS-----LMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + VV+GD+ V+ IVNA N + L+ GV GAI R G + G Sbjct: 3 VSVVEGDLLDQPVEAIVNAWNRNIIPWWLLLPQGVSGAIKRRGGVGPFREVARA----GP 58 Query: 60 CPTGHAVITLAGDLPAKAVVHTVG--PVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 P G AV+T AG LP KA++H G +WR E++ +QD+ N+L + SVA Sbjct: 59 MPLGSAVVTSAGRLPFKAIIHVAGIDMLWRASERS----IQDSVRNALAKAREQGFRSVA 114 Query: 118 FPAISTGVYGYPRAAAAEI 136 FP I G + A A E+ Sbjct: 115 FPVIGAGSGSFDEARALEL 133 >UniRef50_Q5V4P3 Putative uncharacterized protein n=1 Tax=Haloarcula marismortui RepID=Q5V4P3_HALMA Length = 166 Score = 67.8 bits (164), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 54/170 (31%), Positives = 79/170 (46%), Gaps = 14/170 (8%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 V+QGDI + D +VNAAN SL G GV GA+ RAAG L D + +G G Sbjct: 2 EFEVIQGDIAAQSADALVNAANTSLRMGSGVAGALKRAAGSGLNDEAVA----KGPVDLG 57 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 T A DL A+ V+H G Q+ + +++A N+L A + SV FPAI Sbjct: 58 GVATTDAYDLDAEYVIHAA--AMPPGGQSTAESIRNATRNALAEADALNCESVVFPAIGC 115 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQ---VYFVCYDEENAHLYERL 170 G+ G+ I + E+ PE V + Y +++ +R+ Sbjct: 116 GIAGFDFEEGIRIICAVIEEY-----QPESLTDVRLIAYSDDDFEGMQRV 160 >UniRef50_A2BJA7 A1pp, Appr-1-p processing enzyme n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BJA7_HYPBU Length = 199 Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 48/166 (28%), Positives = 82/166 (49%), Gaps = 8/166 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + +GDIT+ + +VN AN ++ GGGV GA+ RAAGP + + +++ P G Sbjct: 16 VEIARGDITEAECEAVVNPANSLMIMGGGVAGALRRAAGPEVEEEA----RRKAPVPVGE 71 Query: 65 AVITLAGDLPA--KAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A+ T AG L K ++H + R + + A L +LR + +A PA+ Sbjct: 72 AIHTGAGRLEPRIKYIIHAP-TMERPAMRTTQGKVVKAVLAALREAEKLNVGCLALPAMG 130 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLY 167 GV G + E ++ + EF+ + LP ++ V Y E +A + Sbjct: 131 AGVGGLTARESLEAIMEALDEFLGSGGKLPPRIILVAYSERDAKQF 176 >UniRef50_Q1YRE7 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YRE7_9GAMM Length = 167 Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 50/171 (29%), Positives = 83/171 (48%), Gaps = 21/171 (12%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHR---AAGPALLDACLKVRQQQGDC 60 RI + QG I L V+ +V+ + S GA+ R A+G L+ L++ GD Sbjct: 13 RIKIHQGKIATLNVEAVVSCYSQS--------GALERLAVASGDGLVP--LRI----GD- 57 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 V+ A ++ ++ ++ +GP WRGG+ E+Q L Y ++ + + S+AF Sbjct: 58 ---VHVVAEAVEVTSRILIEAIGPRWRGGDYQEEQQLASCYSKAMDVAKQYNVRSIAFTP 114 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 IS G G+P A +A++ + + R+ L E V F C+D LY L Sbjct: 115 ISCGPLGFPANRATNVAIQQIKLGLGRNPLIESVIFCCFDPVTTALYRSRL 165 >UniRef50_UPI0001698AE7 Appr-1-p processing domain protein n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI0001698AE7 Length = 79 Score = 67.0 bits (162), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 33/40 (82%), Positives = 36/40 (90%) Query: 8 VQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALL 47 VQGDIT+L VD IVNAAN SL+GGGGVDGAIHRAAGP L+ Sbjct: 13 VQGDITQLEVDAIVNAANSSLLGGGGVDGAIHRAAGPELV 52 >UniRef50_D2VX30 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VX30_NAEGR Length = 249 Score = 66.6 bits (161), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 47/154 (30%), Positives = 74/154 (48%), Gaps = 25/154 (16%) Query: 11 DITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACL-----------KVRQQQGD 59 + TK V +VN +N +L+ GGG+ I + G ++ L +V +++ Sbjct: 97 NYTKEMVVSLVNPSNEALVIGGGLASVIFQCCGRCEMENSLADPSQWKYKSFEVNEKEIH 156 Query: 60 CPTGHAVITLAGDLPAKA----VVHTVGPVWRG----GEQNEDQLLQDAYLNSLRLVAAN 111 CP G ++T + + ++HT GP RG G+ LL + Y NSLR N Sbjct: 157 CPVGEILVTSSFKMQESNGFNYIIHTSGP--RGSQKYGDATNSLLLANCYRNSLRFFIDN 214 Query: 112 SYTSVA----FPAISTGVYGYPRAAAAEIAVKTV 141 + + A P ISTG +GYP+ A+ IA+ TV Sbjct: 215 ARNTKASTLILPCISTGKFGYPKIEASAIALGTV 248 >UniRef50_O07733 UPF0189 protein Rv1899c/MT1950 n=16 Tax=Mycobacterium RepID=Y1899_MYCTU Length = 359 Score = 66.6 bits (161), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 52/139 (37%), Positives = 66/139 (47%), Gaps = 10/139 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT-- 62 + V Q D+TKL +D I NAAN L GGV AI RA GP L R+ P Sbjct: 192 LEVHQADVTKLELDAITNAANTRLRHAGGVAAAIARAGGPELQ------RESTEKAPIGL 245 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV T AGD+PA+ V+H G +++ A +LR S+A A Sbjct: 246 GEAVETTAGDMPARYVIHAA--TMELGGPTSGEIITAATAATLRKADELGCRSLALVAFG 303 Query: 123 TGVYGYPRAAAAEIAVKTV 141 TGV G+P AA + V V Sbjct: 304 TGVGGFPLDDAARLMVGAV 322 >UniRef50_Q54PT1 Protein GDAP2 homolog n=1 Tax=Dictyostelium discoideum RepID=GDAP2_DICDI Length = 568 Score = 66.2 bits (160), Expect = 5e-10, Method: Composition-based stats. Identities = 44/174 (25%), Positives = 81/174 (46%), Gaps = 7/174 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI + GDI L D IV + + +L + I + G +++ Q+ G+C Sbjct: 55 INSRICLWMGDICNLNTDTIVYSNSKTLTESDTISDKIFKYGGSEMMNDI----QKNGEC 110 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 G ++IT G+LP++ VVHTV P + + + L Y ++ L S++F Sbjct: 111 RYGESIITSGGNLPSRFVVHTVCPTYNPKYLSAAENALNSCYRSAFHLSMDVKSKSISFS 170 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLL 171 + + +P IA++T+ F+ + E+V E+ LYE++L Sbjct: 171 TLHSEKRQFPSVGGCHIALRTIRRFLEKPFSKSFEKVILAINTFEDLRLYEQML 224 >UniRef50_Q7JUR6 Protein GDAP2 homolog n=19 Tax=Neoptera RepID=GDAP2_DROME Length = 540 Score = 66.2 bits (160), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 49/171 (28%), Positives = 76/171 (44%), Gaps = 6/171 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R + GD+T L VD I N ++ +L + I AG L + ++ +C Sbjct: 65 VNNRFVIWDGDMTTLEVDAITNTSDETLTESNSISERIFAVAGNQLRE---ELSTTVKEC 121 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG IT +LPAK V+HTV P +R + + L Y N L + ++A Sbjct: 122 RTGDVRITRGYNLPAKYVLHTVAPAYREKFKTAAENTLHCCYRNVLCKAKELNLHTIALC 181 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 IS +P AA IA++T+ ++ + L QV +C YE L Sbjct: 182 NISAHQKSFPADVAAHIALRTIRRYLDKCTL--QVVILCVGSSERGTYEVL 230 >UniRef50_A3DLM0 Appr-1-p processing domain protein n=1 Tax=Staphylothermus marinus F1 RepID=A3DLM0_STAMF Length = 192 Score = 65.9 bits (159), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 47/144 (32%), Positives = 69/144 (47%), Gaps = 11/144 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V+GDIT+L V+ IVN AN ++ GGG+ G + R G + + K P G Sbjct: 17 IKGVKGDITELDVEAIVNPANSFMLMGGGLAGVLKRKGGEIIENEAKKF----APVPVGK 72 Query: 65 AVITLAGDLPAKAVVHTV---GPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 AV+T+AG L AK ++H P R +N + A + L S +A P + Sbjct: 73 AVVTIAGVLKAKYIIHAPTMEKPAMRINPENAYKATFAALTKAFDL----SLNRIAVPGM 128 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI 145 TGV G + A + K + EF+ Sbjct: 129 GTGVGGLSPSDAGKAMAKAIKEFL 152 >UniRef50_UPI00006A2286 UPI00006A2286 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2286 Length = 566 Score = 65.9 bits (159), Expect = 6e-10, Method: Composition-based stats. Identities = 51/177 (28%), Positives = 77/177 (43%), Gaps = 6/177 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 K + V+Q I DVIVN L + + A+ AGP L L Q Sbjct: 11 KELLKVIQQAIEDSTTDVIVNNVGQKLQLNEWQISRALAARAGPQL-QQLLSNSSQGASA 69 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVW--RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 P G T +L V+H V P W RG Q+L+ + + L+L S S++ Sbjct: 70 PNGSVFSTDGCNLNCAKVLHVVMPQWDRRGFSLTHTQVLRKSIKSCLKLTEQQSLQSISI 129 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHLYERLLTQ 173 PAI TG GYP+ A + K + F ++ ++V V + D EN ++ + L + Sbjct: 130 PAIGTGKLGYPKDLVAAVTFKEILHFSSKAQSLQEVNIVLHPRDTENIQVFSKELQR 186 Score = 58.9 bits (141), Expect = 7e-08, Method: Composition-based stats. Identities = 49/152 (32%), Positives = 69/152 (45%), Gaps = 19/152 (12%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHA 65 V GDITK D+IVN+ N S GV AI AAGP++ C QQ G P G Sbjct: 228 QVKTGDITKENTDIIVNSTNNSFTLQSGVSKAILDAAGPSVTLEC----QQLG--PQGQT 281 Query: 66 --VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 ++T +G+L ++H VG Q + + +Q L+ L+ SVA PA+ T Sbjct: 282 SFILTQSGNLQCTNILHVVG-------QTDPKCIQRCVLDILQECNRLQMASVALPAMGT 334 Query: 124 GVYGYP----RAAAAEIAVKTVSEFITRHALP 151 G G + A + V +F+ A P Sbjct: 335 GETGAQVRLGHSIVAGAMLDGVEDFVKSQAAP 366 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0A8D8 UPF0189 protein ymdB n=66 Tax=Proteobacteria Rep... 267 2e-70 UniRef50_P67342 UPF0189 protein ymdB n=62 Tax=Bacteria RepID=YMD... 255 5e-67 UniRef50_A4W960 Appr-1-p processing domain protein n=5 Tax=Bacte... 250 2e-65 UniRef50_Q8EYT0 UPF0189 protein LA_4133 n=9 Tax=cellular organis... 243 2e-63 UniRef50_D1N530 Appr-1-p processing domain protein n=4 Tax=cellu... 232 6e-60 UniRef50_Q8PHB6 UPF0189 protein XAC3343 n=14 Tax=Proteobacteria ... 229 3e-59 UniRef50_C1PFE0 Appr-1-p processing domain protein n=5 Tax=Bacte... 228 4e-59 UniRef50_Q8KAE4 UPF0189 protein CT2219 n=7 Tax=Bacteria RepID=Y2... 228 5e-59 UniRef50_Q8RB30 UPF0189 protein TTE0995 n=12 Tax=Bacteria RepID=... 228 7e-59 UniRef50_C7NE27 Appr-1-p processing domain protein n=2 Tax=Lepto... 227 9e-59 UniRef50_Q8Q0F9 UPF0189 protein MM_0177 n=18 Tax=cellular organi... 227 1e-58 UniRef50_D1AKA5 Appr-1-p processing domain protein n=2 Tax=Bacte... 226 2e-58 UniRef50_Q3AEI4 Putative uncharacterized protein n=6 Tax=Bacteri... 226 2e-58 UniRef50_Q9WYX8 UPF0189 protein TM_0508 n=15 Tax=cellular organi... 220 2e-56 UniRef50_C1H4Y3 MACRO domain-containing protein n=4 Tax=cellular... 220 2e-56 UniRef50_A8ZUR5 Appr-1-p processing domain protein n=3 Tax=cellu... 220 2e-56 UniRef50_Q8TQD0 UPF0189 protein MA_1614 n=2 Tax=cellular organis... 219 3e-56 UniRef50_Q047N9 Predicted phosphatase, histone macroH2A1 family ... 218 7e-56 UniRef50_Q0CQJ0 Protein LRP16 n=10 Tax=cellular organisms RepID=... 218 8e-56 UniRef50_A7RJ44 Predicted protein (Fragment) n=4 Tax=Eukaryota R... 217 2e-55 UniRef50_Q9HXU7 UPF0189 protein PA3693 n=16 Tax=Bacteria RepID=Y... 217 2e-55 UniRef50_C8NAC1 RNase III regulator YmdB n=3 Tax=cellular organi... 217 2e-55 UniRef50_C7RS37 Appr-1-p processing domain protein n=15 Tax=cell... 216 2e-55 UniRef50_Q6PHJ5 Zgc:65960 n=11 Tax=cellular organisms RepID=Q6PH... 216 2e-55 UniRef50_A1Z1Q3 MACRO domain-containing protein 2 n=55 Tax=cellu... 216 3e-55 UniRef50_C1BR35 MACRO domain-containing protein 1 n=2 Tax=Caligu... 214 1e-54 UniRef50_A5GC80 Appr-1-p processing domain protein n=2 Tax=Desul... 214 1e-54 UniRef50_Q1HPZ5 LRP16 protein n=1 Tax=Bombyx mori RepID=Q1HPZ5_B... 213 1e-54 UniRef50_C6RT62 Appr-1-p processing n=2 Tax=Acinetobacter radior... 213 2e-54 UniRef50_A4R3Q9 Putative uncharacterized protein n=1 Tax=Magnapo... 213 2e-54 UniRef50_Q66HV6 Zgc:92353 n=1 Tax=Danio rerio RepID=Q66HV6_DANRE 212 3e-54 UniRef50_Q8Y2K1 UPF0189 protein RSc0334 n=39 Tax=cellular organi... 211 8e-54 UniRef50_Q9BQ69 MACRO domain-containing protein 1 n=11 Tax=Tetra... 210 2e-53 UniRef50_C6BB95 Appr-1-p processing domain protein n=4 Tax=cellu... 210 2e-53 UniRef50_Q71W03 UPF0189 protein LMOf2365_2748 n=23 Tax=Bacteria ... 210 2e-53 UniRef50_A6BCW6 Putative uncharacterized protein n=5 Tax=Bacteri... 209 3e-53 UniRef50_Q0UQZ6 Putative uncharacterized protein n=2 Tax=Leotiom... 209 3e-53 UniRef50_Q8K4G6 MACRO domain-containing protein 1 (Fragment) n=5... 209 4e-53 UniRef50_B7PF53 MACRO domain-containing protein, putative n=2 Ta... 208 5e-53 UniRef50_B7C8M6 Putative uncharacterized protein n=3 Tax=Bacteri... 208 8e-53 UniRef50_C4V1Q4 Appr-1-p processing domain protein n=3 Tax=Bacte... 207 1e-52 UniRef50_Q93SX7 UPF0189 protein n=2 Tax=Acinetobacter RepID=Y189... 207 1e-52 UniRef50_B6Q324 LRP16 family protein n=3 Tax=Trichocomaceae RepI... 206 3e-52 UniRef50_B2ACK5 Predicted CDS Pa_3_1270 n=5 Tax=Eukaryota RepID=... 205 8e-52 UniRef50_Q5KCD7 Putative uncharacterized protein n=1 Tax=Filobas... 204 1e-51 UniRef50_Q985D2 UPF0189 protein mll7730 n=12 Tax=Bacteria RepID=... 204 1e-51 UniRef50_B8HYS5 Appr-1-p processing domain protein n=2 Tax=Cyano... 203 2e-51 UniRef50_Q8EP31 Hypothetical conserved protein n=1 Tax=Oceanobac... 203 2e-51 UniRef50_D1ZDH8 Whole genome shotgun sequence assembly, scaffold... 203 2e-51 UniRef50_Q2LUU1 Appr-1-p histone processing protein n=5 Tax=Bact... 202 6e-51 UniRef50_B9MLL8 Appr-1-p processing domain protein n=6 Tax=Clost... 201 7e-51 UniRef50_A2SS36 Appr-1-p processing domain protein n=26 Tax=cell... 201 7e-51 UniRef50_Q03IQ8 Predicted phosphatase homologous to the C-termin... 201 9e-51 UniRef50_C8VIG2 LRP16 family protein (AFU_orthologue; AFUA_3G138... 201 1e-50 UniRef50_A7IGI6 Appr-1-p processing domain protein n=53 Tax=cell... 200 2e-50 UniRef50_C7N880 Predicted phosphatase, C-terminal domain of hist... 200 2e-50 UniRef50_Q4P1I0 Putative uncharacterized protein n=1 Tax=Ustilag... 199 3e-50 UniRef50_Q47EQ7 Appr-1-p processing n=1 Tax=Dechloromonas aromat... 198 4e-50 UniRef50_C9KLM2 Appr-1-p processing enzyme family domain protein... 198 5e-50 UniRef50_B2JCA0 Appr-1-p processing domain protein n=13 Tax=Prot... 198 5e-50 UniRef50_C4Q6S1 Expressed protein n=1 Tax=Schistosoma mansoni Re... 198 6e-50 UniRef50_Q9EYI6 UPF0189 protein in sno 5'region n=22 Tax=Bacteri... 198 6e-50 UniRef50_C2LSS3 Protein in Tap1-dppD intergenic region n=1 Tax=S... 197 1e-49 UniRef50_Q9HJ67 UPF0189 protein Ta1105 n=1 Tax=Thermoplasma acid... 197 1e-49 UniRef50_P67344 UPF0189 protein SA0314 n=54 Tax=Staphylococcus R... 197 2e-49 UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZE... 197 2e-49 UniRef50_Q93RG0 UPF0189 protein in tap1-dppD intergenic region n... 195 6e-49 UniRef50_A0LGZ1 Appr-1-p processing domain protein n=1 Tax=Syntr... 194 1e-48 UniRef50_Q1R0S7 Appr-1-p processing n=12 Tax=Proteobacteria RepI... 193 2e-48 UniRef50_C9LYS3 Appr-1-p processing enzyme family domain protein... 193 2e-48 UniRef50_A7BY23 Putative uncharacterized protein n=3 Tax=Beggiat... 193 2e-48 UniRef50_B6KFB3 Appr-1-p processing enzyme family domain-contain... 193 2e-48 UniRef50_A4YFR3 Appr-1-p processing domain protein n=9 Tax=Therm... 193 2e-48 UniRef50_B6SKT6 Protein LRP16 n=12 Tax=cellular organisms RepID=... 193 3e-48 UniRef50_C4M8N0 Putative uncharacterized protein n=2 Tax=Entamoe... 192 3e-48 UniRef50_A5ZAB5 Putative uncharacterized protein n=4 Tax=Clostri... 192 4e-48 UniRef50_B9S4E3 Protein LRP16, putative n=2 Tax=cellular organis... 192 5e-48 UniRef50_C8NG26 Appr-1-p processing enzyme family domain protein... 192 6e-48 UniRef50_UPI000186F16D conserved hypothetical protein n=1 Tax=Pe... 192 6e-48 UniRef50_C4FEN5 Putative uncharacterized protein n=1 Tax=Bifidob... 191 1e-47 UniRef50_B9XAD9 Appr-1-p processing domain protein n=1 Tax=bacte... 191 1e-47 UniRef50_D1BM15 Appr-1-p processing domain protein n=15 Tax=Bact... 191 1e-47 UniRef50_Q0B030 Phosphatase n=1 Tax=Syntrophomonas wolfei subsp.... 188 5e-47 UniRef50_A8FSV2 Putative uncharacterized protein n=1 Tax=Shewane... 188 8e-47 UniRef50_A7B8S3 Putative uncharacterized protein n=1 Tax=Actinom... 188 8e-47 UniRef50_B5YAF3 Conserved protein n=2 Tax=Dictyoglomus RepID=B5Y... 188 9e-47 UniRef50_D1U7C0 Appr-1-p processing domain protein n=1 Tax=Desul... 186 2e-46 UniRef50_B0EH33 Putative uncharacterized protein n=2 Tax=Entamoe... 186 2e-46 UniRef50_Q5XC09 UPF0189 protein M6_Spy0919 n=20 Tax=Streptococcu... 186 2e-46 UniRef50_A0L536 Appr-1-p processing domain protein n=1 Tax=Magne... 186 2e-46 UniRef50_UPI0001B4DEB3 hypothetical protein ShygA5_39675 n=1 Tax... 186 3e-46 UniRef50_A6GJ81 Putative uncharacterized protein n=1 Tax=Plesioc... 186 3e-46 UniRef50_C7GZB8 Appr-1-p processing enzyme family domain protein... 186 3e-46 UniRef50_B8LP86 Putative uncharacterized protein n=1 Tax=Picea s... 184 1e-45 UniRef50_B8I4Z8 Appr-1-p processing domain protein n=7 Tax=Bacte... 184 1e-45 UniRef50_C2D2Z2 Appr-1-p processing enzyme family domain protein... 184 1e-45 UniRef50_A0Q2I9 Appr-1-p processing enzyme family protein n=3 Ta... 183 2e-45 UniRef50_C8WYT5 Appr-1-p processing domain protein n=1 Tax=Desul... 183 2e-45 UniRef50_A8H4N3 Appr-1-p processing domain protein n=1 Tax=Shewa... 183 2e-45 UniRef50_C1QBX0 Predicted phosphatase similar to C-terminal doma... 183 2e-45 UniRef50_B7C850 Putative uncharacterized protein n=1 Tax=Eubacte... 183 2e-45 UniRef50_B9WC14 Putative uncharacterized protein n=5 Tax=Candida... 181 6e-45 UniRef50_UPI00006A2284 UPI00006A2284 related cluster n=1 Tax=Xen... 181 8e-45 UniRef50_Q97AU0 UPF0189 protein TV0719 n=2 Tax=cellular organism... 181 9e-45 UniRef50_B8DKL2 Appr-1-p processing domain protein n=3 Tax=Desul... 181 1e-44 UniRef50_C4FT52 Putative uncharacterized protein n=1 Tax=Catonel... 180 1e-44 UniRef50_Q17432 Protein B0035.3, confirmed by transcript evidenc... 180 1e-44 UniRef50_A8JCH3 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 180 1e-44 UniRef50_B0A8R6 Putative uncharacterized protein n=3 Tax=Bacteri... 180 1e-44 UniRef50_A9WK70 Appr-1-p processing domain protein n=3 Tax=Chlor... 180 2e-44 UniRef50_Q87JZ5 UPF0189 protein VPA0103 n=5 Tax=Proteobacteria R... 180 2e-44 UniRef50_C7H575 RNase III regulator YmdB n=2 Tax=Faecalibacteriu... 179 3e-44 UniRef50_B9YC00 Putative uncharacterized protein n=1 Tax=Holdema... 179 4e-44 UniRef50_C9XM94 Putative uncharacterized protein n=6 Tax=Clostri... 179 4e-44 UniRef50_A8M6L5 Appr-1-p processing domain protein n=2 Tax=Micro... 178 6e-44 UniRef50_C4DDL7 Predicted phosphatase similar to C-terminal doma... 178 6e-44 UniRef50_UPI000050FFC7 predicted phosphatase, C-terminal domain ... 178 7e-44 UniRef50_UPI000194CBCB PREDICTED: poly (ADP-ribose) polymerase f... 178 1e-43 UniRef50_C4V152 Appr-1-p processing protein n=2 Tax=Clostridiale... 177 2e-43 UniRef50_D1VVA5 Putative uncharacterized protein n=1 Tax=Peptoni... 177 2e-43 UniRef50_Q30ZH6 Appr-1-p processing n=1 Tax=Desulfovibrio desulf... 176 3e-43 UniRef50_B1KG04 Appr-1-p processing domain protein n=1 Tax=Shewa... 176 3e-43 UniRef50_UPI0000ECB76F Poly [ADP-ribose] polymerase 14 (EC 2.4.2... 176 4e-43 UniRef50_C5CIT5 Appr-1-p processing domain protein n=1 Tax=Kosmo... 176 4e-43 UniRef50_B0EF86 MACRO domain-containing protein, putative n=2 Ta... 175 5e-43 UniRef50_A7T167 Protein GDAP2 homolog n=1 Tax=Nematostella vecte... 175 5e-43 UniRef50_Q22CT8 Appr-1-p processing enzyme family protein n=1 Ta... 175 8e-43 UniRef50_A3LYE6 Putative uncharacterized protein n=1 Tax=Pichia ... 174 1e-42 UniRef50_A8FQZ3 Putative uncharacterized protein n=1 Tax=Shewane... 173 2e-42 UniRef50_A2FMC7 Appr-1-p processing enzyme family protein n=1 Ta... 173 2e-42 UniRef50_C3YH95 Putative uncharacterized protein n=2 Tax=Eumetaz... 173 2e-42 UniRef50_Q460N5 Poly [ADP-ribose] polymerase 14 n=19 Tax=Eutheri... 173 2e-42 UniRef50_A9SRF5 Predicted protein n=1 Tax=Physcomitrella patens ... 173 3e-42 UniRef50_C3Y5X0 Putative uncharacterized protein n=3 Tax=Branchi... 173 3e-42 UniRef50_A8STD9 Putative uncharacterized protein n=1 Tax=Coproco... 172 5e-42 UniRef50_C2DZH9 Appr-1-p processing protein n=4 Tax=Lactobacillu... 171 6e-42 UniRef50_C0W547 Appr-1-p processing domain protein n=1 Tax=Actin... 171 6e-42 UniRef50_Q8ZXT3 UPF0189 protein PAE1111 n=10 Tax=Thermoprotei Re... 171 9e-42 UniRef50_UPI000196AD9C hypothetical protein CATMIT_00588 n=1 Tax... 170 1e-41 UniRef50_UPI0000E80997 PREDICTED: similar to Poly [ADP-ribose] p... 170 2e-41 UniRef50_A1D5K4 Appr-1-p processing enzyme family protein n=1 Ta... 170 2e-41 UniRef50_UPI0000E4D641 UPI0000E4D641 related cluster n=2 Tax=Dan... 170 3e-41 UniRef50_Q2TX23 Predicted phosphatase homologous to the C-termin... 169 4e-41 UniRef50_A1WVH3 Appr-1-p processing domain protein n=14 Tax=Bact... 169 5e-41 UniRef50_Q4DSL4 Putative uncharacterized protein n=4 Tax=Trypano... 168 5e-41 UniRef50_B3RYC4 Putative uncharacterized protein n=1 Tax=Trichop... 168 5e-41 UniRef50_Q94JV1 At1g69340/F10D13.28 n=23 Tax=Embryophyta RepID=Q... 168 7e-41 UniRef50_C0PSL1 Putative uncharacterized protein n=1 Tax=Picea s... 168 7e-41 UniRef50_C4G1S1 Putative uncharacterized protein n=3 Tax=Abiotro... 167 1e-40 UniRef50_C9RQW9 Appr-1-p processing domain protein n=5 Tax=Bacte... 167 1e-40 UniRef50_A2DTG7 Appr-1-p processing enzyme family protein n=2 Ta... 167 2e-40 UniRef50_C5C222 Appr-1-p processing domain protein n=2 Tax=Actin... 166 2e-40 UniRef50_C2L199 Putative uncharacterized protein n=1 Tax=Oribact... 166 2e-40 UniRef50_UPI0000F2CC13 PREDICTED: similar to B aggressive lympho... 166 3e-40 UniRef50_D2V113 Appr-1-p domain-containing protein n=1 Tax=Naegl... 166 3e-40 UniRef50_A2QSI2 Contig An08c0280, complete genome n=1 Tax=Asperg... 166 3e-40 UniRef50_C2KRZ5 Appr-1-p processing domain protein n=2 Tax=Mobil... 166 3e-40 UniRef50_D0MWM6 Putative uncharacterized protein n=1 Tax=Phytoph... 165 4e-40 UniRef50_C5VD03 Appr-1-p processing enzyme family protein n=2 Ta... 165 4e-40 UniRef50_Q6NRC6 MGC83934 protein n=3 Tax=Xenopus RepID=Q6NRC6_XENLA 165 7e-40 UniRef50_A0CX10 Chromosome undetermined scaffold_3, whole genome... 165 8e-40 UniRef50_D2V337 Predicted protein (Fragment) n=1 Tax=Naegleria g... 165 9e-40 UniRef50_Q8B4N1 ORF-1 n=7 Tax=Infectious spleen and kidney necro... 164 1e-39 UniRef50_D2S4L6 Appr-1-p processing domain protein n=4 Tax=Actin... 164 1e-39 UniRef50_A1L291 LOC799852 protein (Fragment) n=5 Tax=Danio rerio... 164 1e-39 UniRef50_C1SPD7 Predicted phosphatase similar to C-terminal doma... 164 1e-39 UniRef50_C7Z089 Putative uncharacterized protein n=2 Tax=Nectria... 163 2e-39 UniRef50_A4TAV6 Appr-1-p processing domain protein n=6 Tax=Actin... 163 3e-39 UniRef50_Q0CEI7 Putative uncharacterized protein n=1 Tax=Aspergi... 162 4e-39 UniRef50_A5D049 Predicted phosphatase n=4 Tax=Bacteria RepID=A5D... 162 4e-39 UniRef50_D0NNH8 Putative uncharacterized protein n=3 Tax=Phytoph... 162 4e-39 UniRef50_D0NR00 Putative uncharacterized protein n=1 Tax=Phytoph... 162 5e-39 UniRef50_Q4SK43 Chromosome 2 SCAF14570, whole genome shotgun seq... 162 6e-39 UniRef50_Q9NXN4 Ganglioside-induced differentiation-associated p... 161 8e-39 UniRef50_UPI000194CBC9 PREDICTED: similar to B aggressive lympho... 161 1e-38 UniRef50_A7HJC7 Appr-1-p processing domain protein n=1 Tax=Fervi... 161 1e-38 UniRef50_D0WKT6 Appr-1-p processing enzyme family domain protein... 160 2e-38 UniRef50_C3Y6H9 Putative uncharacterized protein n=1 Tax=Branchi... 160 2e-38 UniRef50_C3Y6H4 Putative uncharacterized protein n=1 Tax=Branchi... 160 3e-38 UniRef50_UPI00005A247A PREDICTED: similar to H2A histone family,... 159 4e-38 UniRef50_C3Y5X5 Putative uncharacterized protein n=3 Tax=Branchi... 158 6e-38 UniRef50_C7HUZ2 RNase III regulator YmdB n=2 Tax=Anaerococcus Re... 157 2e-37 UniRef50_Q0UG78 Putative uncharacterized protein n=1 Tax=Phaeosp... 156 3e-37 UniRef50_C3Y5Q2 Putative uncharacterized protein n=1 Tax=Branchi... 156 3e-37 UniRef50_Q8IXQ6 Poly [ADP-ribose] polymerase 9 n=27 Tax=Eutheria... 156 3e-37 UniRef50_A7S3X0 Predicted protein (Fragment) n=1 Tax=Nematostell... 156 3e-37 UniRef50_C3Y406 Putative uncharacterized protein n=2 Tax=Branchi... 156 4e-37 UniRef50_C9YUB3 Putative uncharacterized protein n=1 Tax=Strepto... 155 8e-37 UniRef50_A6LTB5 Appr-1-p processing domain protein n=3 Tax=Clost... 154 1e-36 UniRef50_A7T7L3 Predicted protein (Fragment) n=1 Tax=Nematostell... 154 1e-36 UniRef50_C8WJT1 Appr-1-p processing domain protein n=1 Tax=Egger... 153 2e-36 UniRef50_B7CC50 Putative uncharacterized protein n=1 Tax=Eubacte... 153 3e-36 UniRef50_B0P6L4 Putative uncharacterized protein n=1 Tax=Anaerot... 153 3e-36 UniRef50_A7EET2 Putative uncharacterized protein n=1 Tax=Sclerot... 153 3e-36 UniRef50_UPI000180B1B4 PREDICTED: similar to Poly [ADP-ribose] p... 153 4e-36 UniRef50_A6SR30 Putative uncharacterized protein n=1 Tax=Botryot... 152 5e-36 UniRef50_Q4RS18 Histone H2A (Fragment) n=2 Tax=Tetraodontidae Re... 152 5e-36 UniRef50_UPI0001C38755 appr-1-p processing domain-containing pro... 151 7e-36 UniRef50_B7PR73 Ganglioside induced differentiation associated p... 151 1e-35 UniRef50_Q55AK6 U box domain-containing protein n=2 Tax=Eukaryot... 150 2e-35 UniRef50_B2VUH2 MACRO domain containing protein 1 n=1 Tax=Pyreno... 150 3e-35 UniRef50_Q7JUR6 Protein GDAP2 homolog n=19 Tax=Neoptera RepID=GD... 149 3e-35 UniRef50_O67112 UPF0189 protein aq_987 n=4 Tax=cellular organism... 149 4e-35 UniRef50_B1L625 Appr-1-p processing domain protein n=1 Tax=Candi... 148 6e-35 UniRef50_A7C4X9 Putative uncharacterized protein n=1 Tax=Beggiat... 148 6e-35 UniRef50_UPI000196CD43 hypothetical protein CATMIT_02190 n=1 Tax... 148 8e-35 UniRef50_UPI000180BD0B PREDICTED: similar to Poly [ADP-ribose] p... 145 4e-34 UniRef50_UPI00006A1CA6 poly (ADP-ribose) polymerase family, memb... 145 5e-34 UniRef50_C3YS04 Putative uncharacterized protein (Fragment) n=1 ... 145 6e-34 UniRef50_Q54PT1 Protein GDAP2 homolog n=1 Tax=Dictyostelium disc... 145 7e-34 UniRef50_C3Y417 Putative uncharacterized protein (Fragment) n=1 ... 145 9e-34 UniRef50_C3YS03 Putative uncharacterized protein n=2 Tax=Branchi... 144 1e-33 UniRef50_UPI0000E8099B PREDICTED: similar to PARP9 protein n=2 T... 143 2e-33 UniRef50_UPI0000E4815A PREDICTED: similar to LRP16 protein n=1 T... 143 3e-33 UniRef50_C3Y5X1 Putative uncharacterized protein n=1 Tax=Branchi... 143 3e-33 UniRef50_Q2SM57 Predicted phosphatase n=1 Tax=Hahella chejuensis... 143 4e-33 UniRef50_O07733 UPF0189 protein Rv1899c/MT1950 n=16 Tax=Mycobact... 140 2e-32 UniRef50_Q9P0M6 Core histone macro-H2A.2 n=118 Tax=Eukaryota Rep... 140 2e-32 UniRef50_B0QWK9 Putative uncharacterized protein n=1 Tax=Haemoph... 139 5e-32 UniRef50_C3ZVW0 Putative uncharacterized protein n=1 Tax=Branchi... 138 7e-32 UniRef50_B7P925 Histone H2A n=1 Tax=Ixodes scapularis RepID=B7P9... 138 8e-32 UniRef50_B9L2D9 Appr-1-p processing enzyme family protein n=2 Ta... 138 9e-32 UniRef50_D1R847 Putative uncharacterized protein n=1 Tax=Parachl... 136 3e-31 UniRef50_D1B7G8 Appr-1-p processing domain protein n=1 Tax=Therm... 136 4e-31 UniRef50_Q4RG95 Chromosome 12 SCAF15104, whole genome shotgun se... 133 3e-30 UniRef50_Q5V4P3 Putative uncharacterized protein n=1 Tax=Haloarc... 133 3e-30 UniRef50_O28751 UPF0189 protein AF_1521 n=32 Tax=Euryarchaeota R... 133 3e-30 UniRef50_Q4T065 Chromosome undetermined SCAF11328, whole genome ... 132 5e-30 UniRef50_Q2ITR2 Appr-1-p processing n=1 Tax=Rhodopseudomonas pal... 132 6e-30 UniRef50_A0CX06 Chromosome undetermined scaffold_3, whole genome... 131 7e-30 UniRef50_B8HYS6 Appr-1-p processing domain protein n=1 Tax=Cyano... 129 3e-29 UniRef50_UPI000180BD0C PREDICTED: similar to Ci-Rhysin2/Deltex3-... 129 4e-29 UniRef50_Q460N3 Poly [ADP-ribose] polymerase 15 n=12 Tax=Eutheri... 129 4e-29 UniRef50_UPI00005A5611 PREDICTED: similar to poly (ADP-ribose) p... 129 5e-29 UniRef50_D2VM45 Poly ADP-ribose polymerase family, member 14-lik... 129 5e-29 UniRef50_B5Y5Y4 Appr-1-p processing enzyme family protein n=2 Ta... 128 6e-29 UniRef50_A2BJA7 A1pp, Appr-1-p processing enzyme n=1 Tax=Hyperth... 128 9e-29 UniRef50_A1R2V6 Putative uncharacterized protein n=1 Tax=Arthrob... 128 1e-28 UniRef50_B1H1M8 LOC100148704 protein (Fragment) n=5 Tax=Danio re... 126 2e-28 UniRef50_UPI0001BC8416 Appr-1-p processing domain protein n=1 Ta... 126 2e-28 UniRef50_UPI000180C4AC PREDICTED: similar to Poly [ADP-ribose] p... 126 3e-28 UniRef50_UPI0001C3795F Appr-1-p processing domain protein n=1 Ta... 125 6e-28 UniRef50_D2MH71 Metallo-beta-lactamase family protein n=1 Tax=Ca... 125 8e-28 UniRef50_UPI00016E2DD3 UPI00016E2DD3 related cluster n=3 Tax=Tak... 124 8e-28 UniRef50_A2DE53 Appr-1-p processing enzyme family protein n=1 Ta... 124 9e-28 UniRef50_C1XFR0 Predicted phosphatase similar to C-terminal doma... 123 2e-27 UniRef50_UPI000180B63C PREDICTED: similar to Ci-Rhysin2/Deltex3-... 123 2e-27 UniRef50_UPI00006CE511 hypothetical protein TTHERM_00141050 n=1 ... 121 8e-27 UniRef50_UPI00006A2286 UPI00006A2286 related cluster n=1 Tax=Xen... 121 1e-26 UniRef50_UPI000180D216 PREDICTED: similar to Poly [ADP-ribose] p... 121 1e-26 Sequences not found previously or not previously below threshold: >UniRef50_P0A8D8 UPF0189 protein ymdB n=66 Tax=Proteobacteria RepID=YMDB_ECO57 Length = 177 Score = 267 bits (682), Expect = 2e-70, Method: Composition-based stats. Identities = 177/177 (100%), Positives = 177/177 (100%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC Sbjct: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA Sbjct: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE Sbjct: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 >UniRef50_P67342 UPF0189 protein ymdB n=62 Tax=Bacteria RepID=YMDB_SALTI Length = 179 Score = 255 bits (652), Expect = 5e-67, Method: Composition-based stats. Identities = 135/177 (76%), Positives = 153/177 (86%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +R+ V+QGDIT+L+VD IVNAAN SLMGGGGVDGAIHRAAGPALLDAC +RQQQG+C Sbjct: 1 MTSRLQVIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TGHAVIT AG L AKAV+HTVGPVWRGGE E +LL++AY N L L AN + S+AFPA Sbjct: 61 QTGHAVITPAGKLSAKAVIHTVGPVWRGGEHQEAELLEEAYRNCLLLAEANHFRSIAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTGVYGYPRA AAE+AV+TVS+FITR+ALPEQVYFVCYDEE A LY RLLTQQGD+ Sbjct: 121 ISTGVYGYPRAQAAEVAVRTVSDFITRYALPEQVYFVCYDEETARLYARLLTQQGDD 177 >UniRef50_A4W960 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=A4W960_ENT38 Length = 180 Score = 250 bits (638), Expect = 2e-65, Method: Composition-based stats. Identities = 136/175 (77%), Positives = 152/175 (86%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I VV GDIT + VDVIVNAANPSLMGGGGVDGAIHRAAGP LL+AC VRQQQG+C Sbjct: 1 MKPQIEVVVGDITTMEVDVIVNAANPSLMGGGGVDGAIHRAAGPQLLEACKTVRQQQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 GHAVIT+AGDLPAKAV+H VGPVW+GGE +E + LQDAYLN LRL AAN Y ++AFPA Sbjct: 61 APGHAVITIAGDLPAKAVIHAVGPVWQGGENHEARTLQDAYLNCLRLAAANGYKTLAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 ISTGVYGYP+AAAAEIAV TVSEF+TR LPE+VYFVCYDEENA LY+RLL Q+G Sbjct: 121 ISTGVYGYPKAAAAEIAVDTVSEFLTRKPLPERVYFVCYDEENAQLYQRLLIQRG 175 >UniRef50_Q8EYT0 UPF0189 protein LA_4133 n=9 Tax=cellular organisms RepID=Y4133_LEPIN Length = 175 Score = 243 bits (621), Expect = 2e-63, Method: Composition-based stats. Identities = 93/172 (54%), Positives = 129/172 (75%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +I +++ DIT+L VD IVNAAN SL+GGGGVDGAIHRA GP +L+ C K+R++QG+C Sbjct: 1 MNNKIKLIKEDITQLEVDAIVNAANSSLLGGGGVDGAIHRAGGPEILEECYKIREKQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AVIT AG L AK ++HTVGP+W GG +NED+LL +AY NSL L +S ++AFP Sbjct: 61 KVGEAVITTAGRLNAKFIIHTVGPIWSGGNKNEDELLSNAYKNSLLLAKNHSLKTIAFPN 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTG+Y +P+ AA+IA+++V+EF+ + + V+FVC+D EN +Y +LL Sbjct: 121 ISTGIYHFPKERAAKIAIQSVTEFLKQDNQIQTVFFVCFDFENLEIYNKLLQ 172 >UniRef50_D1N530 Appr-1-p processing domain protein n=4 Tax=cellular organisms RepID=D1N530_9BACT Length = 164 Score = 232 bits (591), Expect = 6e-60, Method: Composition-based stats. Identities = 96/169 (56%), Positives = 117/169 (69%), Gaps = 6/169 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +VQ DIT+L D IVNAAN SL+GGGGVDGAIHRAAGP LL+AC K CPTG Sbjct: 2 KIQIVQDDITRLRADAIVNAANSSLLGGGGVDGAIHRAAGPELLEACRKFN----GCPTG 57 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT L A+ V+HT GPVW GG E +LL+ Y NSLRL AAN S+AFPAIST Sbjct: 58 EARITPGFRLAARFVIHTPGPVWHGGTHGEAELLEACYRNSLRLAAANGCRSIAFPAIST 117 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GVY YP+A AA+IA++TV ++ R LPE+V F C+ + +Y+ LL Sbjct: 118 GVYRYPKAEAAQIALRTVRQW--REPLPEEVIFCCFSAADLDVYQELLK 164 >UniRef50_Q8PHB6 UPF0189 protein XAC3343 n=14 Tax=Proteobacteria RepID=Y3343_XANAC Length = 179 Score = 229 bits (584), Expect = 3e-59, Method: Composition-based stats. Identities = 90/173 (52%), Positives = 110/173 (63%), Gaps = 2/173 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CP 61 RI V QGDIT+L VDVIVNAAN SL+GGGGVDGAIHRAAGP LL+AC + Q + CP Sbjct: 2 RIEVWQGDITELDVDVIVNAANESLLGGGGVDGAIHRAAGPRLLEACEALPQVRPGVRCP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG IT DL A+ + HTVGPVWR G NE + L + Y SL+L S+AFPAI Sbjct: 62 TGEIRITDGFDLKARHIFHTVGPVWRDGRHNEPEQLANCYWQSLKLAEQMMLHSIAFPAI 121 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 S G+YGYP AA IAV ++ H +P+ + V Y+E Y++ L Q Sbjct: 122 SCGIYGYPLHQAARIAVTETRDWQRSHKVPKHIVLVAYNEATYKAYQQALATQ 174 >UniRef50_C1PFE0 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=C1PFE0_BACCO Length = 188 Score = 228 bits (583), Expect = 4e-59, Method: Composition-based stats. Identities = 88/169 (52%), Positives = 115/169 (68%), Gaps = 4/169 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +V GDITK+ D IVNAAN +L+GGGGVDGAIHRAAGP LL+ C K+ CPTG Sbjct: 4 FKIVLGDITKVKTDAIVNAANTTLLGGGGVDGAIHRAAGPELLEECRKLN----GCPTGE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T LPAK V+HT GPVW+GG +E +LL+++Y NSLRL + +VAFP+ISTG Sbjct: 60 AKMTKGYRLPAKYVIHTPGPVWQGGGHHEAELLENSYQNSLRLAESKGLRTVAFPSISTG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 VY +P AAA IAV+T+ F+ ++V+ VC+DE YE+ T+ Sbjct: 120 VYHFPLDAAARIAVRTICTFLETSDSVQEVWMVCFDERTKQAYEKAATE 168 >UniRef50_Q8KAE4 UPF0189 protein CT2219 n=7 Tax=Bacteria RepID=Y2219_CHLTE Length = 172 Score = 228 bits (582), Expect = 5e-59, Method: Composition-based stats. Identities = 87/171 (50%), Positives = 110/171 (64%), Gaps = 4/171 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 IH ++ DIT L VD IVNAAN SL+GGGGVDGAIHRAAGP LL+AC ++ G C Sbjct: 4 NVLIHAIKADITSLTVDAIVNAANTSLLGGGGVDGAIHRAAGPKLLEACREL----GGCL 59 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG A IT LPA V+HTVGPVW GG E +LL Y NSL+L + ++AFP+I Sbjct: 60 TGEAKITKGYRLPATFVIHTVGPVWHGGNHGEAELLASCYRNSLKLAIEHHCRTIAFPSI 119 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 STG+YGYP AA IA+ TV E + E+V F C+ + + +Y++ L Sbjct: 120 STGIYGYPVEQAAAIAITTVREMLADERGIEKVIFCCFSDRDLDVYQKALA 170 >UniRef50_Q8RB30 UPF0189 protein TTE0995 n=12 Tax=Bacteria RepID=Y995_THETN Length = 175 Score = 228 bits (582), Expect = 7e-59, Method: Composition-based stats. Identities = 84/173 (48%), Positives = 121/173 (69%), Gaps = 1/173 (0%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I +++G+I VD IVNAAN SL+GGGGVDGAIH+A GPA+ + +R++QG C Sbjct: 1 MKEKIKLIKGNIVDQEVDAIVNAANSSLIGGGGVDGAIHKAGGPAIAEELKVIREKQGGC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTGHAVIT AG+L AK V+H VGP+W+GG NED LL AY+ SL+L + ++AFP+ Sbjct: 61 PTGHAVITGAGNLKAKYVIHAVGPIWKGGNHNEDNLLASAYIESLKLADEYNVKTIAFPS 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTG YG+P AA IA++ VS+++ + ++V FV + + + +Y + + Sbjct: 121 ISTGAYGFPVERAARIALRVVSDYLE-GSSIKEVRFVLFSDRDYEVYSKAYEE 172 >UniRef50_C7NE27 Appr-1-p processing domain protein n=2 Tax=Leptotrichia RepID=C7NE27_LEPBD Length = 187 Score = 227 bits (580), Expect = 9e-59, Method: Composition-based stats. Identities = 87/180 (48%), Positives = 125/180 (69%), Gaps = 5/180 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +K RI +V+GDIT+ DVIVNAAN SL+GG GVDGAIHR G + + C+K+R QG C Sbjct: 6 LKNRIVLVKGDITEYPADVIVNAANSSLLGGSGVDGAIHRKGGKEITEDCMKIRASQGKC 65 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-----DQLLQDAYLNSLRLVAANSYTS 115 G AVIT AG++ K V+HTVGPVW+ G+ NE ++LL++AY++SL L N + Sbjct: 66 NIGEAVITRAGNMSFKNVIHTVGPVWQSGKNNEAKLFAEKLLKNAYISSLELAEKNKLKN 125 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 ++FP ISTGVY +P+ AA+ A+ V E++ ++ E+V FVC++ EN +Y +LL ++G Sbjct: 126 ISFPNISTGVYRFPKDLAAKTAINAVIEYLEKNDFIEKVNFVCFENENFEIYRKLLEEKG 185 >UniRef50_Q8Q0F9 UPF0189 protein MM_0177 n=18 Tax=cellular organisms RepID=Y177_METMA Length = 187 Score = 227 bits (579), Expect = 1e-58, Method: Composition-based stats. Identities = 90/171 (52%), Positives = 117/171 (68%), Gaps = 4/171 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 RI + +GDI K+ VD IVNAAN +L+GGGGVDGAIHRAAGPALL+ C + CPT Sbjct: 19 DRIRIFEGDIVKMRVDAIVNAANNTLLGGGGVDGAIHRAAGPALLEECKTLN----GCPT 74 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPAK ++HTVGPVW+GGE+ ED+LL Y SL L ++AFPAIS Sbjct: 75 GEAKITSGYLLPAKYIIHTVGPVWQGGEKGEDELLASCYRKSLELARDYKIKTIAFPAIS 134 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 TG YG+P AA IAV V EF+ ++ +PE VY VCY++++ ++ L++ Sbjct: 135 TGAYGFPSERAAGIAVSQVKEFLQKNEIPETVYLVCYNKDSCKSIKKALSK 185 >UniRef50_D1AKA5 Appr-1-p processing domain protein n=2 Tax=Bacteria RepID=D1AKA5_SEBTE Length = 180 Score = 226 bits (577), Expect = 2e-58, Method: Composition-based stats. Identities = 88/175 (50%), Positives = 111/175 (63%), Gaps = 2/175 (1%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 T + GDITK+ DVIVNAAN SL+GGGGVDGAIHR GP +LD C K+ +QG CP Sbjct: 5 NTELRCENGDITKVKTDVIVNAANSSLLGGGGVDGAIHRTGGPLILDECRKIVDRQGSCP 64 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT G LPAK V+HTVGPVW G+ NE++ L+ Y NSL++ S+AF I Sbjct: 65 VGEAVITTGGKLPAKFVIHTVGPVWSYGKNNEEEKLRKCYRNSLKIAEDKQLESIAFSNI 124 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERLLTQQ 174 STG YG+P+ A A+ V ++ T +V FVC D+EN +YE LL + Sbjct: 125 STGTYGFPKETAGRAALDEVKKYFIQTPDTTIREVVFVCLDDENFEIYEELLESE 179 >UniRef50_Q3AEI4 Putative uncharacterized protein n=6 Tax=Bacteria RepID=Q3AEI4_CARHZ Length = 181 Score = 226 bits (577), Expect = 2e-58, Method: Composition-based stats. Identities = 85/171 (49%), Positives = 113/171 (66%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 ++I + GDITK VD IVNAAN L GGGGVDGAIHRA GP +++ C ++ + G P Sbjct: 7 NSKIILKLGDITKEKVDAIVNAANSRLAGGGGVDGAIHRAGGPKIMEECREIINKIGVLP 66 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AV T AG+LPAK V+HTVGP++RGG++ E+ L++AYLNSL+L + ++AFP+I Sbjct: 67 PGEAVATTAGNLPAKYVIHTVGPIYRGGQKGEENTLRNAYLNSLKLAKQLNVKTIAFPSI 126 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 STG YGYP AA +A+K V EF+ V FV +DE Y+ L Sbjct: 127 STGAYGYPVKDAARVALKAVIEFLEGEPEDFTVVFVLFDEITYAAYQEALE 177 >UniRef50_Q9WYX8 UPF0189 protein TM_0508 n=15 Tax=cellular organisms RepID=Y508_THEMA Length = 599 Score = 220 bits (561), Expect = 2e-56, Method: Composition-based stats. Identities = 76/171 (44%), Positives = 104/171 (60%), Gaps = 2/171 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +V+GDIT+ VD IVNAAN L GGGV GAI RA G + + ++ Q++G PTG Sbjct: 428 KIRIVKGDITREEVDAIVNAANEYLKHGGGVAGAIVRAGGSVIQEESDRIVQERGRVPTG 487 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T AG L AK V+HTVGPVWRGG ED+LL A N+L S++ PAIST Sbjct: 488 EAVVTSAGKLKAKYVIHTVGPVWRGGSHGEDELLYKAVYNALLRAHELKLKSISMPAIST 547 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLT 172 G++G+P+ A I K + +FI +H E++ DEE ++E + Sbjct: 548 GIFGFPKERAVGIFSKAIRDFIDQHPDTTLEEIRICNIDEETTKIFEEKFS 598 >UniRef50_C1H4Y3 MACRO domain-containing protein n=4 Tax=cellular organisms RepID=C1H4Y3_PARBA Length = 334 Score = 220 bits (560), Expect = 2e-56, Method: Composition-based stats. Identities = 80/172 (46%), Positives = 111/172 (64%), Gaps = 6/172 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + I ++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAG L C + G C Sbjct: 38 LNNSICLITSDITKLEVDCIVNAANKSLLGGGGVDGAIHRAAGRGLWQECRSL----GGC 93 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT A +LP + V+H VGP++ +++ + LL+ Y+ SL + A N S+AF + Sbjct: 94 MTGDAKITNAYNLPCRKVIHAVGPMYW-ADEDRESLLRSCYMRSLTIAAENGLKSIAFSS 152 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENAHLYERLL 171 ISTGVYGYP + AAE+A++ V F+ R + PE+V F ++ ++ + Y LL Sbjct: 153 ISTGVYGYPSSKAAEVAIRAVKHFLEARSSPPERVIFCTFEPKDVNAYRALL 204 >UniRef50_A8ZUR5 Appr-1-p processing domain protein n=3 Tax=cellular organisms RepID=A8ZUR5_DESOH Length = 195 Score = 220 bits (560), Expect = 2e-56, Method: Composition-based stats. Identities = 86/170 (50%), Positives = 103/170 (60%), Gaps = 4/170 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +R+ V QGDIT L VD IVNAAN +L+GGGGVDGAIHRAAGP LL C + G C T Sbjct: 27 SRLKVWQGDITTLEVDAIVNAANKTLLGGGGVDGAIHRAAGPELLAECKTL----GGCDT 82 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPAK V+HTVGPV+ +LL Y NSL+L SVAFPA+S Sbjct: 83 GQAKITRGYRLPAKFVIHTVGPVYSRSNPGVAKLLAGCYTNSLKLAKDQGLASVAFPAVS 142 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GVYGYP A IA+ TV +F+ EQV F + + +YE L+ Sbjct: 143 CGVYGYPMKEACRIALDTVCDFLETDRTIEQVIFALFSADAVRVYEGYLS 192 >UniRef50_Q8TQD0 UPF0189 protein MA_1614 n=2 Tax=cellular organisms RepID=Y1614_METAC Length = 195 Score = 219 bits (559), Expect = 3e-56, Method: Composition-based stats. Identities = 86/172 (50%), Positives = 110/172 (63%), Gaps = 4/172 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 RI +++ DIT+L VD IVNAAN +L+GGGGVDGAIHRAAGP LL+ C + CP Sbjct: 26 SERIRIIERDITELKVDAIVNAANNTLLGGGGVDGAIHRAAGPGLLEECRTLN----GCP 81 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG A IT LPAK V+HTVGP+W+ G + ED+ L Y SL L ++AFP I Sbjct: 82 TGEAKITKGYLLPAKYVIHTVGPIWQEGTKGEDEFLASCYRKSLELARKYDVKTIAFPTI 141 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 STG YG+P AA IAV V EF+ + LPE V+ VCY++E ++ L + Sbjct: 142 STGAYGFPSERAARIAVSQVKEFLKVNELPEIVFLVCYNKEACKNIKKALEE 193 >UniRef50_Q047N9 Predicted phosphatase, histone macroH2A1 family n=3 Tax=Bacteria RepID=Q047N9_LACDB Length = 166 Score = 218 bits (555), Expect = 7e-56, Method: Composition-based stats. Identities = 87/168 (51%), Positives = 107/168 (63%), Gaps = 6/168 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + QGDIT L VD IVNAAN L GGGGVDGAIHRAAGP L +AC + G C TG Sbjct: 3 LEIWQGDITTLKVDAIVNAANRELRGGGGVDGAIHRAAGPKLNEACRAL----GSCETGE 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A IT +LPAK ++HTVGPV+ G ++ LL Y NSLR+ N SVAF AISTG Sbjct: 59 AKITPGFNLPAKYIIHTVGPVY-SGSHSDPLLLAACYRNSLRVAKENGLHSVAFSAISTG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLL 171 VYGYP AA+++A V +++ H E +V V YD LY++LL Sbjct: 118 VYGYPLDAASKVAFGEVRKWLREHKDYEMRVIMVAYDARTYALYQKLL 165 >UniRef50_Q0CQJ0 Protein LRP16 n=10 Tax=cellular organisms RepID=Q0CQJ0_ASPTN Length = 344 Score = 218 bits (555), Expect = 8e-56, Method: Composition-based stats. Identities = 83/179 (46%), Positives = 105/179 (58%), Gaps = 13/179 (7%) Query: 1 MKTRIHVVQGDITKL-AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + RI +++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAGP L+ C + G Sbjct: 37 LNDRISLIRHDITKLLDVDCIVNAANSSLLGGGGVDGAIHRAAGPGLVRECRTL----GG 92 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVW----RGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 C TG A T A DLP + V+HTVGP++ + G +QLL+ Y L L N S Sbjct: 93 CATGDAKTTAAYDLPCRWVIHTVGPIYPVERQKGAARPEQLLRSCYRRCLELAVRNKARS 152 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA----LPEQVYFVCYDEENAHLYERL 170 +AFPAISTGVY YP+ AA IA+ F+ E+V F ++EE+ YE Sbjct: 153 IAFPAISTGVYAYPKRRAARIALDETRAFLESEGTDIVTLEKVVFCNFEEEDQRAYEEA 211 >UniRef50_A7RJ44 Predicted protein (Fragment) n=4 Tax=Eukaryota RepID=A7RJ44_NEMVE Length = 183 Score = 217 bits (553), Expect = 2e-55, Method: Composition-based stats. Identities = 75/174 (43%), Positives = 104/174 (59%), Gaps = 12/174 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDIT L +D IVNAAN +L+GGGGVDG IHRAAG L C K+R C Sbjct: 5 LNDKVSLWTGDITALEIDAIVNAANTTLLGGGGVDGCIHRAAGDNLFKECRKLR----GC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A ITL LPAK V+HT GP+ + LQD Y N L+L + ++AF Sbjct: 61 QTGEAKITLGHRLPAKYVIHTAGPM-----GKNRKKLQDCYKNCLQLAKQHGVKTLAFCC 115 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLL 171 ISTG+YGYP AA +A++TV +++ + E++ F + ++ +YERLL Sbjct: 116 ISTGIYGYPNKDAAHVALETVRQWLETDDNNDSVERIVFCTFLPKDTEIYERLL 169 >UniRef50_Q9HXU7 UPF0189 protein PA3693 n=16 Tax=Bacteria RepID=Y3693_PSEAE Length = 173 Score = 217 bits (552), Expect = 2e-55, Method: Composition-based stats. Identities = 91/172 (52%), Positives = 108/172 (62%), Gaps = 4/172 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + V QGDIT+LAVD IVNAAN SL+GGGGVDGAIHRAAG L+ AC + C T Sbjct: 2 TEVRVWQGDITRLAVDAIVNAANSSLLGGGGVDGAIHRAAGAELVAACRLLH----GCKT 57 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPA V+HTVGPVWRGG+ E +LL Y SL L SVAFPAIS Sbjct: 58 GEAKITRGFRLPAAHVIHTVGPVWRGGDNGEAELLASCYRRSLALAEQAGAASVAFPAIS 117 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 G+YGYP AA IAV+ V H+ E++ V +D A Y+RLL ++ Sbjct: 118 CGIYGYPLEQAAAIAVEEVCRQRPAHSSLEEIVLVAFDSSMAERYQRLLGER 169 >UniRef50_C8NAC1 RNase III regulator YmdB n=3 Tax=cellular organisms RepID=C8NAC1_9GAMM Length = 165 Score = 217 bits (552), Expect = 2e-55, Method: Composition-based stats. Identities = 89/167 (53%), Positives = 108/167 (64%), Gaps = 4/167 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V DIT LAVD IVNAAN SL+GG GVDGAIHRAAG L+ C + G C G Sbjct: 3 LEVQVADITTLAVDAIVNAANESLLGGSGVDGAIHRAAGKELVAECRTL----GGCKVGE 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T LPA+ V+HTVGPVW GG+ E + L +AY NSLRL A+ TS+AFPAISTG Sbjct: 59 AKLTRGYRLPARFVIHTVGPVWYGGDDGEAEALANAYANSLRLAEAHELTSIAFPAISTG 118 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 V+GYP+ AA IA+ TV + +V F C+ E +A LY RLL Sbjct: 119 VFGYPKEDAARIAIDTVRATLKECPHMARVIFCCFSERDAALYRRLL 165 >UniRef50_C7RS37 Appr-1-p processing domain protein n=15 Tax=cellular organisms RepID=C7RS37_9PROT Length = 197 Score = 216 bits (551), Expect = 2e-55, Method: Composition-based stats. Identities = 89/173 (51%), Positives = 108/173 (62%), Gaps = 4/173 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M T + + DIT LAVD IVNAAN SL+GGGGVDGAIHRAAGP LL C + G C Sbjct: 26 MSTMLRAICADITTLAVDAIVNAANSSLLGGGGVDGAIHRAAGPGLLAECRLL----GGC 81 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T A LPA+ ++HTVGPVW GG E Q L Y SL L AN ++A P+ Sbjct: 82 PTGEARLTHAHRLPARYIIHTVGPVWHGGGSGEAQRLASCYRCSLELAVANDLVTLAIPS 141 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTG+YGYP AAE+AV TV + +V F C+ + +YERLL + Sbjct: 142 ISTGIYGYPIEQAAEVAVSTVRASVRELGRLREVVFCCFSPGDLRVYERLLGE 194 >UniRef50_Q6PHJ5 Zgc:65960 n=11 Tax=cellular organisms RepID=Q6PHJ5_DANRE Length = 452 Score = 216 bits (551), Expect = 2e-55, Method: Composition-based stats. Identities = 83/174 (47%), Positives = 114/174 (65%), Gaps = 6/174 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +GDIT L +D IVNAAN SL+GGGGVDG IHRAAG L + C + C Sbjct: 59 LADKVSLYKGDITILEIDAIVNAANSSLLGGGGVDGCIHRAAGHLLYEECHSLN----GC 114 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT DLPAK V+HTVGP+ RG Q++ L+ Y +SL+L+ N+ SVAFP Sbjct: 115 DTGKAKITCGYDLPAKYVIHTVGPIARGNVGQSQRDDLESCYYSSLKLMKDNNLRSVAFP 174 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLT 172 ISTG+YG+P AAEIA+KTV E+I +H ++V F + E + +Y+R ++ Sbjct: 175 CISTGIYGFPNEPAAEIALKTVQEWIEKHQDEIDRVIFCVFLETDYEIYKRKMS 228 >UniRef50_A1Z1Q3 MACRO domain-containing protein 2 n=55 Tax=cellular organisms RepID=MACD2_HUMAN Length = 448 Score = 216 bits (550), Expect = 3e-55, Method: Composition-based stats. Identities = 81/175 (46%), Positives = 111/175 (63%), Gaps = 6/175 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +GDIT L VD IVNAAN SL+GGGGVDG IHRAAGP LL C + C Sbjct: 68 LTEKVSLYRGDITLLEVDAIVNAANASLLGGGGVDGCIHRAAGPCLLAECRNLN----GC 123 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFP 119 TGHA IT DLPAK V+HTVGP+ RG + L + Y +SL+LV N+ SVAFP Sbjct: 124 DTGHAKITCGYDLPAKYVIHTVGPIARGHINGSHKEDLANCYKSSLKLVKENNIRSVAFP 183 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTG+YG+P AA IA+ T+ E++ H +++ F + E + +Y++ + + Sbjct: 184 CISTGIYGFPNEPAAVIALNTIKEWLAKNHHEVDRIIFCVFLEVDFKIYKKKMNE 238 >UniRef50_C1BR35 MACRO domain-containing protein 1 n=2 Tax=Caligus rogercresseyi RepID=C1BR35_9MAXI Length = 242 Score = 214 bits (545), Expect = 1e-54, Method: Composition-based stats. Identities = 77/174 (44%), Positives = 102/174 (58%), Gaps = 10/174 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +I + QGDITKL VD IVNAAN L GGGV GAIHRAAG L C + G CP Sbjct: 78 SPKIGMWQGDITKLEVDAIVNAANSGLKAGGGVCGAIHRAAGSQLQKECDSI----GGCP 133 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G + IT LPAK V+HTVGP + E L+ Y S+ L+ A S+AFP I Sbjct: 134 VGDSRITAGYKLPAKHVIHTVGPQDKNSEH-----LKSCYRKSMELLIAKGLRSIAFPCI 188 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQQ 174 STG+YGYP AAE+A++T+ FI ++ + V F + +++ Y LL+++ Sbjct: 189 STGIYGYPSDKAAEVALQTIRSFIQDNSESVDSVIFCVFLDKDMQYYSELLSKK 242 >UniRef50_A5GC80 Appr-1-p processing domain protein n=2 Tax=Desulfuromonadales RepID=A5GC80_GEOUR Length = 172 Score = 214 bits (545), Expect = 1e-54, Method: Composition-based stats. Identities = 85/168 (50%), Positives = 107/168 (63%), Gaps = 4/168 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I ++QGDIT+LAVD IVNAAN +L+GGGGVDGAIHRAAGP L+ C + G C Sbjct: 1 MKGKIEIIQGDITRLAVDAIVNAANNTLLGGGGVDGAIHRAAGPDLVAECSTL----GGC 56 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT LPAK V+HTVGPVW GG + E +LL+ AY + A+ S+AFPA Sbjct: 57 ETGDAKITKGYKLPAKHVIHTVGPVWHGGSKGEPELLRKAYRRCFEVAHASKLKSIAFPA 116 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYE 168 IS GVYGYP A EIA+ + + E+V FV + +Y+ Sbjct: 117 ISAGVYGYPMDQACEIAMVEAKAALEKFPELERVIFVPFSPGALAIYQ 164 >UniRef50_Q1HPZ5 LRP16 protein n=1 Tax=Bombyx mori RepID=Q1HPZ5_BOMMO Length = 275 Score = 213 bits (544), Expect = 1e-54, Method: Composition-based stats. Identities = 70/172 (40%), Positives = 98/172 (56%), Gaps = 9/172 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ + +GDITKL +D +VNAAN L GGGVDGAIHRAAGP L C + G C Sbjct: 107 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGC 162 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T +LPAK ++HTVGP + + L+ Y L S+AFP Sbjct: 163 PTGDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPC 217 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTG+YG+P AA IA++T +F+ + ++ F + + +YE L+ Sbjct: 218 ISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQ 269 >UniRef50_C6RT62 Appr-1-p processing n=2 Tax=Acinetobacter radioresistens RepID=C6RT62_ACIRA Length = 186 Score = 213 bits (543), Expect = 2e-54, Method: Composition-based stats. Identities = 81/171 (47%), Positives = 112/171 (65%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + ++ GDIT + +D IVNAAN +L+GG GVDGAIH+A GP +++ C ++R +QG C G Sbjct: 3 QFRLIHGDITGIRIDAIVNAANSTLLGGHGVDGAIHQAGGPDIIEECRQIRARQGSCTVG 62 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T G LPA+ V+HTVGP+W G+ NE LL AY NS L + T +A+P IST Sbjct: 63 EAVMTTGGRLPAQYVIHTVGPIWEEGKANERTLLSQAYQNSFALAEQHYLTGIAYPNIST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 GVY +P+ AA IA+ T+ + ++V VC+D EN LYE LL Q+ Sbjct: 123 GVYRFPKVEAAAIAIDTLIPLLKNSETVQEVALVCFDLENFELYEELLKQR 173 >UniRef50_A4R3Q9 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4R3Q9_MAGGR Length = 263 Score = 213 bits (543), Expect = 2e-54, Method: Composition-based stats. Identities = 80/173 (46%), Positives = 100/173 (57%), Gaps = 7/173 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 RI + GDITKL VD IVNAAN +L+GGGGVDG+IHRAAG LL C + C Sbjct: 61 NDRIALYHGDITKLMVDAIVNAANETLLGGGGVDGSIHRAAGGGLLRECRTL----DGCD 116 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPA 120 TG A +T A DLP K V+H VGPV+ + E + LL Y SL L N S+AFPA Sbjct: 117 TGDAKVTDAYDLPCKKVIHAVGPVYNERHREECEMLLSSCYTRSLELAVENGCRSIAFPA 176 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLL 171 ISTG+YGYP AA A+ V +F+ V F C+ +++ +Y L Sbjct: 177 ISTGIYGYPSRRAANAAITAVRKFLESDQGDKISLVVFCCFLQKDMEIYTDKL 229 >UniRef50_Q66HV6 Zgc:92353 n=1 Tax=Danio rerio RepID=Q66HV6_DANRE Length = 248 Score = 212 bits (541), Expect = 3e-54, Method: Composition-based stats. Identities = 75/173 (43%), Positives = 106/173 (61%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDITKL +D + NAAN +L+GGGGVDGAIHR AGP L C + C Sbjct: 66 LNMKVSLFGGDITKLEIDAVANAANKTLLGGGGVDGAIHRGAGPLLRKECATLN----GC 121 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT A LPA+ V+HTVGP+ + E++ L++ Y N L + +VAFP Sbjct: 122 ETGEAKITGAYGLPARYVIHTVGPIVHDSVGEREEEALRNCYYNCLHTATKHHLRTVAFP 181 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 ISTGVYGYP A E+A+KTV +++ ++ ++V F + + + LYE LL Sbjct: 182 CISTGVYGYPPDQAVEVALKTVRDYLEQNPEKLDRVIFCVFLKSDKQLYENLL 234 >UniRef50_Q8Y2K1 UPF0189 protein RSc0334 n=39 Tax=cellular organisms RepID=Y334_RALSO Length = 171 Score = 211 bits (538), Expect = 8e-54, Method: Composition-based stats. Identities = 85/170 (50%), Positives = 107/170 (62%), Gaps = 7/170 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + ++ DIT LA D IVNAAN +L+GGGGVDGAIHRAAGP LL+AC + C TG Sbjct: 7 TLRALRADITTLACDAIVNAANSALLGGGGVDGAIHRAAGPELLEACRALH----GCRTG 62 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT LPA+ ++HTVGP+WRGG Q+E LL Y NSL L + ++AFP IST Sbjct: 63 QAKITPGFLLPARYIIHTVGPIWRGGRQDEAALLAACYRNSLALAKQHDVRTIAFPCIST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 GVYG+P AA IAV+TV E A + + F C+ + LYE L + Sbjct: 123 GVYGFPPQLAAPIAVRTVRE---HGADLDDIVFCCFSAADLALYETALNE 169 >UniRef50_Q9BQ69 MACRO domain-containing protein 1 n=11 Tax=Tetrapoda RepID=MACD1_HUMAN Length = 325 Score = 210 bits (535), Expect = 2e-53, Method: Composition-based stats. Identities = 80/173 (46%), Positives = 107/173 (61%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I +++ DITKL VD IVNAAN SL+GGGGVDG IHRAAGP L D C ++ C Sbjct: 150 LNEKISLLRSDITKLEVDAIVNAANSSLLGGGGVDGCIHRAAGPLLTDECRTLQS----C 205 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSYTSVAFP 119 TG A IT LPAK V+HTVGP+ G L+ YL+SL L+ + SVAFP Sbjct: 206 KTGKAKITGGYRLPAKYVIHTVGPIAYGEPSASQAAELRSCYLSSLDLLLEHRLRSVAFP 265 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 ISTGV+GYP AAAEI + T+ E++ +H +++ + E++ +Y L Sbjct: 266 CISTGVFGYPCEAAAEIVLATLREWLEQHKDKVDRLIICVFLEKDEDIYRSRL 318 >UniRef50_C6BB95 Appr-1-p processing domain protein n=4 Tax=cellular organisms RepID=C6BB95_RALP1 Length = 171 Score = 210 bits (535), Expect = 2e-53, Method: Composition-based stats. Identities = 83/170 (48%), Positives = 103/170 (60%), Gaps = 7/170 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++GDIT L D IVNAAN SL+GGGGVDGAIHRAAGP LL+AC + C TG Sbjct: 8 LRALRGDITTLDCDAIVNAANSSLLGGGGVDGAIHRAAGPELLEACRALH----GCRTGE 63 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T L A+ V+HTVGP+WRGG Q+E LL Y NSL L S+AFP ISTG Sbjct: 64 AKLTPGFQLTARYVIHTVGPIWRGGRQDEAALLAACYRNSLELACKYEVRSIAFPCISTG 123 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 +YG+P AA IAV+ E + E + F C+ + LYE L + Sbjct: 124 IYGFPPQLAAPIAVRAARE---HGSRFETITFCCFSAADLILYEAALGNR 170 >UniRef50_Q71W03 UPF0189 protein LMOf2365_2748 n=23 Tax=Bacteria RepID=Y2748_LISMF Length = 176 Score = 210 bits (535), Expect = 2e-53, Method: Composition-based stats. Identities = 92/173 (53%), Positives = 116/173 (67%), Gaps = 2/173 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I VV+GDIT+ VDVIVNAANP L+GGGGVDGAIH+AAGP LL C +V + G CP G Sbjct: 2 EITVVKGDITEQDVDVIVNAANPGLLGGGGVDGAIHQAAGPDLLKECQEVINRIGTCPAG 61 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT AGDL A ++H VGP+W+ GE E L Y +L L A TS+AFP IST Sbjct: 62 EAVITSAGDLQASYIIHAVGPIWKDGEHQEANKLASCYWKALDLAAGKELTSIAFPNIST 121 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQ 174 GVYG+P+ AAE+A+ TV ++ E++ FVC+DEEN LY +L+ + Sbjct: 122 GVYGFPKKLAAEVALYTVRKWAEEEYDTSIEEIRFVCFDEENLKLYNKLINSE 174 >UniRef50_A6BCW6 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A6BCW6_9FIRM Length = 267 Score = 209 bits (533), Expect = 3e-53, Method: Composition-based stats. Identities = 71/177 (40%), Positives = 109/177 (61%), Gaps = 8/177 (4%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 +I + +GDIT+L+VD IVNAAN ++G G +D AIH AAG L + C ++ + Q Sbjct: 92 DKISLWRGDITRLSVDAIVNAANSQMLGCFVPCHGCIDNAIHSAAGIQLRNECAQIMEAQ 151 Query: 58 GDC-PTGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 G PTG A IT +LPAK V+HTVGP+ + +++ L+ YLN ++L S Sbjct: 152 GHEEPTGKAKITKGYNLPAKHVIHTVGPIVGMQVTEKQEEELKSCYLNCMKLAEKEGLKS 211 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 +AF ISTG + +P AAEIAVKTV ++++ + E+V F + EE+ ++Y+++ Sbjct: 212 IAFCCISTGEFHFPNKLAAEIAVKTVDKYLS-SSKLERVIFNVFKEEDYNIYKKIFA 267 >UniRef50_Q0UQZ6 Putative uncharacterized protein n=2 Tax=Leotiomyceta RepID=Q0UQZ6_PHANO Length = 291 Score = 209 bits (533), Expect = 3e-53, Method: Composition-based stats. Identities = 79/177 (44%), Positives = 110/177 (62%), Gaps = 8/177 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I +++ DIT LA+D IVNAAN SL+GGGGVDGAIHRAAGP L D C + C Sbjct: 37 LNDKISIIRRDITTLAIDAIVNAANTSLLGGGGVDGAIHRAAGPKLYDECETL----DGC 92 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG+A +T +LP+K V+H VGP+ W+ G +LL Y SL+L N S+AF Sbjct: 93 ETGNAKMTRGYELPSKKVIHAVGPIYWKEGRSASAKLLSMCYRTSLQLAVDNECRSIAFS 152 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPE---QVYFVCYDEENAHLYERLLTQ 173 A+STGVYGYP AA +A++TV +F+ E +V F + E++ + Y R + + Sbjct: 153 ALSTGVYGYPSDEAAVVALQTVRQFLDEDGKAEKLDRVIFCNFLEKDENAYYREIQK 209 >UniRef50_Q8K4G6 MACRO domain-containing protein 1 (Fragment) n=5 Tax=cellular organisms RepID=MACD1_RAT Length = 258 Score = 209 bits (532), Expect = 4e-53, Method: Composition-based stats. Identities = 78/173 (45%), Positives = 109/173 (63%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I + +GDITKL VD IVNAAN SL+GGGGVDG IHRAAG L D C ++ +C Sbjct: 83 LNEKISLFRGDITKLEVDAIVNAANNSLLGGGGVDGCIHRAAGSLLTDECRTLQ----NC 138 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGE-QNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT LPAK V+HTVGP+ G ++ L+ YL+SL L+ + SVAFP Sbjct: 139 ETGKAKITCGYRLPAKHVIHTVGPIAVGQPTASQAAELRSCYLSSLDLLLEHRLRSVAFP 198 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 ISTGV+GYP AAE+ + T+ E++ +H +++ + E++ +Y+ L Sbjct: 199 CISTGVFGYPNEEAAEVVLATLREWLEQHKDKVDRLIICVFLEKDEGIYQERL 251 >UniRef50_B7PF53 MACRO domain-containing protein, putative n=2 Tax=cellular organisms RepID=B7PF53_IXOSC Length = 304 Score = 208 bits (531), Expect = 5e-53, Method: Composition-based stats. Identities = 73/176 (41%), Positives = 99/176 (56%), Gaps = 12/176 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDIT L +D IVNAAN L+GGGGVDGAIH AAGP L + C + C Sbjct: 134 LNNKVSIFVGDITALEIDAIVNAANNRLLGGGGVDGAIHSAAGPKLKEECATLN----GC 189 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT LPAK V+HTVGPV + L Y+ SL A+ ++AFP Sbjct: 190 PTGEAKITGGYKLPAKYVIHTVGPV-----GENEAKLHGCYVTSLETAKAHKIRTLAFPC 244 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA---LPEQVYFVCYDEENAHLYERLLTQ 173 ISTG+YGYP AA +A+ E++ +++ F + + LYE+LL + Sbjct: 245 ISTGIYGYPNEKAAHVALSAAREWLDSEENALKVDRIIFCLFLPIDVRLYEKLLPE 300 >UniRef50_B7C8M6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B7C8M6_9FIRM Length = 296 Score = 208 bits (529), Expect = 8e-53, Method: Composition-based stats. Identities = 80/177 (45%), Positives = 113/177 (63%), Gaps = 6/177 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +++ I VV+GDIT D IVNAAN SL+GGGGVDGAIHRAAGP LL+ C + C Sbjct: 125 LESEIKVVKGDITTFDGDCIVNAANESLLGGGGVDGAIHRAAGPMLLEECKLLN----GC 180 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT DL AK V+HTVGP++ G+ ++ +L+D Y NSL L ++AFPA Sbjct: 181 QTGQAKITKGYDLKAKYVIHTVGPMY-SGKHEDEHMLRDCYWNSLTLARKYDIHTIAFPA 239 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQQGD 176 IS GVYGYP A + +KT+++++ ++ ++ C+DEE Y++ + QG+ Sbjct: 240 ISCGVYGYPVEKAVPLVLKTIADWLDANSDYTMKISLYCFDEETTKEYQKYTSYQGE 296 >UniRef50_C4V1Q4 Appr-1-p processing domain protein n=3 Tax=Bacteria RepID=C4V1Q4_9FIRM Length = 289 Score = 207 bits (528), Expect = 1e-52, Method: Composition-based stats. Identities = 78/178 (43%), Positives = 107/178 (60%), Gaps = 7/178 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 RI + QGDIT+L D IVNAAN +L+G +D AIH AAG L AC + ++ Sbjct: 112 DARIALWQGDITRLNADAIVNAANSALLGCFIPCHRCIDNAIHSAAGLQLRAACAALMEE 171 Query: 57 QG-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYT 114 QG TG A IT +L ++ V+HTVGP+ G + + L Y + L L A + Sbjct: 172 QGHPEETGTAQITEGYNLSSRHVIHTVGPIVSGALTDRHRAQLASCYRSCLSLAAEHGLR 231 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 S+AF ISTG + +PRAAAAEIAV+ V +F+TR E+V F + +E+ H+YERLL+ Sbjct: 232 SIAFCCISTGEFHFPRAAAAEIAVREVRDFLTRDTSIERVVFNVFKDEDRHIYERLLS 289 >UniRef50_Q93SX7 UPF0189 protein n=2 Tax=Acinetobacter RepID=Y189_ACISE Length = 183 Score = 207 bits (527), Expect = 1e-52, Method: Composition-based stats. Identities = 81/173 (46%), Positives = 118/173 (68%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++H++Q DIT AV IVN+AN SL+GGGG+D IH+ AGP + + C+++ Q++G CPTG Sbjct: 3 KVHLIQADITAFAVHAIVNSANKSLLGGGGLDYVIHKKAGPLMKEECVRLNQEKGGCPTG 62 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A +T AG+LPAK ++H VGP W GE NE QLL DAY N+L +V+FP IST Sbjct: 63 QAEVTTAGNLPAKYLIHAVGPRWLDGEHNEPQLLCDAYSNALFKANEIHALTVSFPCIST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 GVYG+P AAEIA+ T+ + ++ +V+F+C ++EN +Y+ +L+ D Sbjct: 123 GVYGFPPQKAAEIAIGTILSMLPQYDHVAEVFFICREDENYLIYKNILSNIDD 175 >UniRef50_B6Q324 LRP16 family protein n=3 Tax=Trichocomaceae RepID=B6Q324_PENMQ Length = 308 Score = 206 bits (525), Expect = 3e-52, Method: Composition-based stats. Identities = 78/175 (44%), Positives = 101/175 (57%), Gaps = 8/175 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + + ++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAG LLD C + G C Sbjct: 38 LNDTLSHIRHDITKLQVDCIVNAANRSLLGGGGVDGAIHRAAGHRLLDECRAL----GGC 93 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT +LPA ++HTVGP++ + LL+ Y SL L + S+AF Sbjct: 94 RTGDAKITNGYNLPATKIIHTVGPIYDEDNHELSETLLRSCYRRSLELAVEHDQRSIAFS 153 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH---ALPEQVYFVCYDEENAHLYERLL 171 A+STGVYGYP AAA + V +F+ + E+V F + + YER L Sbjct: 154 AVSTGVYGYPNEAAARAVLDEVDKFLREGDNVSKLERVIFCSFMPADVRAYERYL 208 >UniRef50_B2ACK5 Predicted CDS Pa_3_1270 n=5 Tax=Eukaryota RepID=B2ACK5_PODAN Length = 253 Score = 205 bits (521), Expect = 8e-52, Method: Composition-based stats. Identities = 73/173 (42%), Positives = 101/173 (58%), Gaps = 7/173 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ V + DIT LAVD IVNAAN SL+GGGGVDGAIHRAAG L + C K+ C Sbjct: 47 LNDRVAVYRADITSLAVDAIVNAANRSLLGGGGVDGAIHRAAGRGLYEECKKLN----GC 102 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT A DLP V+H VGPV+ + + ++LL Y SL L + ++AF Sbjct: 103 KTGSAKITDAYDLPCNRVIHAVGPVYDPADHDTSEKLLVGCYTTSLELAVEHECRTIAFS 162 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERL 170 A+STG+YGYP AA A+ + +F+ ++V V +++++ Y Sbjct: 163 ALSTGIYGYPSREAAPAALSAIRKFLTGKDGDKIDKVILVTFEKKDVDAYTEF 215 >UniRef50_Q5KCD7 Putative uncharacterized protein n=1 Tax=Filobasidiella neoformans RepID=Q5KCD7_CRYNE Length = 252 Score = 204 bits (519), Expect = 1e-51, Method: Composition-based stats. Identities = 71/174 (40%), Positives = 102/174 (58%), Gaps = 5/174 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ + +GDIT+L D+IVNAAN SL+GGGGVDGAIHRAAG LL+ C K+ G Sbjct: 70 LNDRVSIWRGDITELEADMIVNAANSSLLGGGGVDGAIHRAAGKHLLEECKKL----GGA 125 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG T +L +K + HTVGPV+ Q QLL+ Y +SL + + F Sbjct: 126 QTGETKFTAGYNLSSKKIAHTVGPVYHSHPPQRAAQLLKSCYQSSLEGCRDSGGGVIGFS 185 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 +ISTGVYGYP A IA++T +F+ + +V +V + + + +Y ++ Q Sbjct: 186 SISTGVYGYPIKDATHIALETTRQFLEQDDSITRVIYVVFSKRDEDVYREIIPQ 239 >UniRef50_Q985D2 UPF0189 protein mll7730 n=12 Tax=Bacteria RepID=Y7730_RHILO Length = 176 Score = 204 bits (519), Expect = 1e-51, Method: Composition-based stats. Identities = 94/170 (55%), Positives = 109/170 (64%), Gaps = 4/170 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 RI + GDITKL VD IVNAAN L+GGGGVDGAIHRAAG L C + C Sbjct: 6 DRIRIHTGDITKLDVDAIVNAANTLLLGGGGVDGAIHRAAGRELEVECRMLN----GCKV 61 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPA+ ++HTVGPVW+GG + E +LL Y +SL L AAN SVAFPAIS Sbjct: 62 GDAKITKGYKLPARHIIHTVGPVWQGGGKGEAELLASCYRSSLELAAANDCRSVAFPAIS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 TGVY YP+ A IAV TVS I A+PE V F C+DE+ A LY R + Sbjct: 122 TGVYRYPKDEATGIAVGTVSMVIEEKAMPETVIFCCFDEQTAQLYLRAVA 171 >UniRef50_B8HYS5 Appr-1-p processing domain protein n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HYS5_CYAP4 Length = 187 Score = 203 bits (518), Expect = 2e-51, Method: Composition-based stats. Identities = 80/177 (45%), Positives = 109/177 (61%), Gaps = 10/177 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDC 60 R V+QGDIT L V+ IVNAAN L GGGV GAI RAAG L AC ++ G C Sbjct: 11 DPRFQVIQGDITTLEVEAIVNAANNELKPGGGVCGAIFRAAGYKQLQQACEQI----GYC 66 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A+IT +LPA+ +VHTVGPV+ G ++LL Y N L+ S +S+AFP Sbjct: 67 PTGEALITPGFNLPAQWIVHTVGPVY-GVTWASEELLARCYRNCLQFAGEESLSSIAFPL 125 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENA----HLYERLLTQ 173 ISTG+YG+P AAEIA++ + ++ ++ +QVY VCY E+ +Y+R+ + Sbjct: 126 ISTGIYGFPLEPAAEIAIREILTGLSCYSEIKQVYLVCYTPESYAAVLQIYDRICQK 182 >UniRef50_Q8EP31 Hypothetical conserved protein n=1 Tax=Oceanobacillus iheyensis RepID=Q8EP31_OCEIH Length = 185 Score = 203 bits (518), Expect = 2e-51, Method: Composition-based stats. Identities = 77/179 (43%), Positives = 113/179 (63%), Gaps = 4/179 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ---G 58 + +V GDITK +VIVNAAN SL+GGGGVDGAIH AAGP LL AC ++R + Sbjct: 7 DNTLEIVVGDITKETTNVIVNAANGSLLGGGGVDGAIHHAAGPELLKACQEMRNNELNGE 66 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 + PTG +IT LP++ ++HTVGP+W +++LL + Y N+L LV +S++F Sbjct: 67 ELPTGEVIITSGFQLPSRFIIHTVGPIWNQTPDLQEELLANCYRNALELVKVKKLSSISF 126 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 P+ISTGVYGYP AA IA++T+ +F+ + + V V + E + +Y+ L ++ Sbjct: 127 PSISTGVYGYPIHEAAAIALQTIIQFLQENDVGL-VKVVLFSERDYSIYQEKLKYLIEK 184 >UniRef50_D1ZDH8 Whole genome shotgun sequence assembly, scaffold_20 n=4 Tax=cellular organisms RepID=D1ZDH8_SORMA Length = 261 Score = 203 bits (517), Expect = 2e-51, Method: Composition-based stats. Identities = 82/175 (46%), Positives = 109/175 (62%), Gaps = 8/175 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + RI + GDITKL +D IVNAAN SL+GGGGVDGAIHRAAGP LL C + + C Sbjct: 90 LNKRIAIHTGDITKLHIDAIVNAANNSLLGGGGVDGAIHRAAGPQLLREC----RTKRTC 145 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFP 119 TG AV+T A +LP V+HTVGPV+ G +E ++LL YL SL++ A T++AFP Sbjct: 146 DTGDAVMTEAYNLPCAKVIHTVGPVYSGVNHDECEKLLISCYLRSLQIAAETGLTTIAFP 205 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHAL---PEQVYFVCYDEENAHLYERLL 171 +ISTGVYGYP AA+ A+ + F+T +V V + +++ Y L Sbjct: 206 SISTGVYGYPSKEAAQAALAAIRHFLTDPKTRNAITKVIIVTFVDKDTRAYTEWL 260 >UniRef50_Q2LUU1 Appr-1-p histone processing protein n=5 Tax=Bacteria RepID=Q2LUU1_SYNAS Length = 214 Score = 202 bits (513), Expect = 6e-51, Method: Composition-based stats. Identities = 79/172 (45%), Positives = 111/172 (64%), Gaps = 4/172 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + ++QGDIT+ D IVNAAN L GGGGVDGAIHRA GP+++ C ++ G CP Sbjct: 40 NSVLALIQGDITQEDTDAIVNAANTGLRGGGGVDGAIHRAGGPSIMAECRRI----GGCP 95 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG AVIT G + A+ V+HTVGPV+R G E +LL AY SL++ +A S++FPAI Sbjct: 96 TGQAVITTGGKMKARYVIHTVGPVYRDGSHGEAELLASAYRESLKMASARHLKSLSFPAI 155 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S GVYGYP AA IA++TV +++ ++ E V FV +++ + L + Sbjct: 156 SAGVYGYPLEEAARIALQTVIDYLKKNRDIELVRFVLFNQSTYDAFSNALGK 207 >UniRef50_B9MLL8 Appr-1-p processing domain protein n=6 Tax=Clostridiales RepID=B9MLL8_ANATD Length = 181 Score = 201 bits (512), Expect = 7e-51, Method: Composition-based stats. Identities = 73/171 (42%), Positives = 103/171 (60%), Gaps = 3/171 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I + +GDITK VDVIVNAAN L GGGV AI +A G + ++ ++ G PTG Sbjct: 9 KIAIKKGDITKENVDVIVNAANSHLRHGGGVALAIVKAGGIEIQKESDEIIKKIGMLPTG 68 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 HAVIT A LP K V+HTVGP++ GE NED+ L A NSL L + S+AFPA+S+ Sbjct: 69 HAVITNAYRLPCKFVIHTVGPIY--GEGNEDEKLSMAIYNSLYLAHLYNLKSIAFPAVSS 126 Query: 124 GVYGYPRAAAAEIAVKTVSEFITR-HALPEQVYFVCYDEENAHLYERLLTQ 173 G++G+P+ A+I + T +F++ E+V F +D+E +E Sbjct: 127 GIFGFPKDRCAKILIDTAVDFLSSIKTSIEKVVFCLFDDETYGYFEEYYKN 177 >UniRef50_A2SS36 Appr-1-p processing domain protein n=26 Tax=cellular organisms RepID=A2SS36_METLZ Length = 183 Score = 201 bits (512), Expect = 7e-51, Method: Composition-based stats. Identities = 80/166 (48%), Positives = 104/166 (62%), Gaps = 5/166 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + + VV+ DIT L+VDVIVNAAN +L+GGGGVDGAIH AAGP LL C + G C Sbjct: 8 ITDHLGVVKTDITTLSVDVIVNAANTTLLGGGGVDGAIHHAAGPGLLAECRTL----GGC 63 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G A IT LPAK ++HTVGPVW GG + E + L+ Y +SL L + ++AFPA Sbjct: 64 RIGEAKITKGYALPAKYIIHTVGPVWWGGNEGEPEQLRACYFHSLTLAGEHGLRTIAFPA 123 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAH 165 +STGVYGYP+ AA IAV+TV F+ ++V V + + Sbjct: 124 VSTGVYGYPKDKAAVIAVETVLSFLRDDPDAFDRVILVAHSNADFQ 169 >UniRef50_Q03IQ8 Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 n=5 Tax=Streptococcus RepID=Q03IQ8_STRTD Length = 260 Score = 201 bits (512), Expect = 9e-51, Method: Composition-based stats. Identities = 75/179 (41%), Positives = 109/179 (60%), Gaps = 7/179 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 RI++ +GDIT+L +D IVNAAN +L+G VD AIH AG L AC ++ + Sbjct: 82 DKRIYLWKGDITRLEIDAIVNAANKTLLGCMKPLHNCVDNAIHTYAGVQLRQACFELILE 141 Query: 57 QG-DCPTGHAVITLAGDLPAKAVVHTVGP-VWRGGEQNEDQLLQDAYLNSLRLVAANSYT 114 QG + P G A IT A +LP+ V+HTVGP + ++ LL +YL+ L L N Sbjct: 142 QGYEEPVGMAKITPAYNLPSAFVIHTVGPKIGNQVTPIDEDLLIKSYLSVLALAEKNKIE 201 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S+A P ISTG + +P+ AAEIA+KTV FI + ++V F +D+EN ++Y++LL + Sbjct: 202 SIAIPCISTGDFNFPKQKAAEIAIKTVKSFIDHSEIVKKVIFNVFDDENLNIYQKLLAE 260 >UniRef50_C8VIG2 LRP16 family protein (AFU_orthologue; AFUA_3G13850) n=7 Tax=Trichocomaceae RepID=C8VIG2_EMENI Length = 374 Score = 201 bits (511), Expect = 1e-50, Method: Composition-based stats. Identities = 75/179 (41%), Positives = 105/179 (58%), Gaps = 12/179 (6%) Query: 1 MKTRIHVVQGDITKLA-VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + + +V+ DITKL VD IVNAA SL+GGGGVD AIH+AAGP LL C + Sbjct: 36 LNDTVAMVRHDITKLQGVDCIVNAAKRSLLGGGGVDYAIHKAAGPDLLKECRTLN----G 91 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRG----GEQNEDQLLQDAYLNSLRLVAANSYTS 115 C TG A IT A +LP K ++HTVGP++ G+ ++LL+ Y L + N S Sbjct: 92 CDTGDAKITNAYNLPNKRIIHTVGPIYSDAMRRGKDEPERLLRSCYRRCLEVAVENEMKS 151 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLL 171 +AF AISTG+YGYP AA+ A+ +F+ L E+V F ++ ++ YE+L+ Sbjct: 152 IAFNAISTGIYGYPSRDAAKAALDETRKFLETDKNTGLLERVIFCNFELKDVEAYEQLI 210 >UniRef50_A7IGI6 Appr-1-p processing domain protein n=53 Tax=cellular organisms RepID=A7IGI6_XANP2 Length = 193 Score = 200 bits (509), Expect = 2e-50, Method: Composition-based stats. Identities = 87/174 (50%), Positives = 107/174 (61%), Gaps = 4/174 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + R+ +V GDIT+LA+D IVNAAN SL+GGGGVDGAIHRAAGP LL C + G CP Sbjct: 19 QARLDIVVGDITRLALDAIVNAANSSLLGGGGVDGAIHRAAGPELLAYCRTL----GGCP 74 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG A +T LPA V+HTVGPVW GG E+ LL Y SL+L S+AFPAI Sbjct: 75 TGEARLTPGFRLPAAHVIHTVGPVWHGGGAGEEGLLGSCYRESLKLADGAGLASIAFPAI 134 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 STG+YG+P AA +AV TV + +V F C+ +E A L+ G Sbjct: 135 STGIYGFPADRAAPLAVGTVLAHLGAPGSVTRVVFCCFSQEAADLHHDAFRAHG 188 >UniRef50_C7N880 Predicted phosphatase, C-terminal domain of histone macro H2A1 like protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N880_SLAHD Length = 263 Score = 200 bits (508), Expect = 2e-50, Method: Composition-based stats. Identities = 73/179 (40%), Positives = 102/179 (56%), Gaps = 8/179 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + R+ V QGDIT+L D IVNAAN ++G +D IH AG L + C ++ + Sbjct: 83 LDQRLSVWQGDITRLRADAIVNAANSQMLGCWAKCHSCIDNVIHTYAGVQLREECDRIMR 142 Query: 56 QQGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSY 113 QG+ PTGHA +T A +LP+K V+HTVGP+ +G +L L Y + L AA Sbjct: 143 AQGENEPTGHAKVTGAYNLPSKHVIHTVGPIAQGHPTARHRLQLAQCYTSCLDAAAATGC 202 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 S+AF ISTGVYG+P AA IAV TV +++ RH +P V F + +Y+ +L Sbjct: 203 ESIAFCGISTGVYGFPAEQAAPIAVDTVRDWLDRHPDVPMHVVFNVFGNRQLSIYQDIL 261 >UniRef50_Q4P1I0 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P1I0_USTMA Length = 220 Score = 199 bits (507), Expect = 3e-50, Method: Composition-based stats. Identities = 75/170 (44%), Positives = 102/170 (60%), Gaps = 8/170 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + GDIT L++D IVNAAN SL+GGGGVDGAIHRAAG L+ C K+ C TG Sbjct: 38 LSIFTGDITTLSIDAIVNAANNSLLGGGGVDGAIHRAAGRELVVECGKLN----GCETGS 93 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPAIST 123 A TL LP+K V+HTVGPV+ E + LL+ AY +SL + S+AFP+IST Sbjct: 94 AKTTLGYALPSKHVIHTVGPVYNSSRHEECERLLRSAYRSSLEELRKIGAKSIAFPSIST 153 Query: 124 GVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERL 170 GVYGYP AA A+ + ++ H E++ C+ +++ + Y L Sbjct: 154 GVYGYPFDTAATAALDEIGSWLESNENHKHIERIVLCCFSQKDYNKYLEL 203 >UniRef50_Q47EQ7 Appr-1-p processing n=1 Tax=Dechloromonas aromatica RCB RepID=Q47EQ7_DECAR Length = 186 Score = 198 bits (505), Expect = 4e-50, Method: Composition-based stats. Identities = 72/169 (42%), Positives = 102/169 (60%), Gaps = 2/169 (1%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD- 59 M R+ + GD+T AVD IVNAAN +L+GGGGVDGAIHR GPA+LDAC ++R+ Q Sbjct: 10 MNGRVRLYVGDLTDQAVDAIVNAANRTLLGGGGVDGAIHRRGGPAILDACRELRRSQWPD 69 Query: 60 -CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 PTG +T G LPA V+HTVGP++ E +LL Y N++ L A S+AF Sbjct: 70 GLPTGQVALTNGGKLPAPYVIHTVGPIYGQHRGKEAELLAACYRNAIELAAHLELKSLAF 129 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY 167 P+ISTG +GYP AA I +++ + + A +++ V ++ + Sbjct: 130 PSISTGAFGYPPDKAALIVSRSMHKVLDEIAAIDEIRLVFFNASQMETF 178 >UniRef50_C9KLM2 Appr-1-p processing enzyme family domain protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KLM2_9FIRM Length = 262 Score = 198 bits (505), Expect = 5e-50, Method: Composition-based stats. Identities = 69/181 (38%), Positives = 100/181 (55%), Gaps = 9/181 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + R+ + QGDIT+L +D IVNAAN ++G +D AG + C K+ Q Sbjct: 81 LDPRLVLWQGDITRLRIDAIVNAANRQMLGCFLPNHNCIDNIEQTMAGVEMRYNCYKLMQ 140 Query: 56 QQGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSY 113 QG PTG IT LPA+ V+HTVGP+ +G +E +LL Y + L L A + Sbjct: 141 AQGHDEPTGKVKITSGYHLPARFVLHTVGPIVQGSLTDEHRRLLASCYESCLTLAAEHGL 200 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLL 171 VAF ISTGV+ +P+ AAA IAV+TV ++ H ++V F +++ + +YE LL Sbjct: 201 KGVAFCCISTGVFRFPKDAAAHIAVRTVQHWLDVHPAASIKRVIFDVFEDADRRIYENLL 260 Query: 172 T 172 Sbjct: 261 N 261 >UniRef50_B2JCA0 Appr-1-p processing domain protein n=13 Tax=Proteobacteria RepID=B2JCA0_BURP8 Length = 183 Score = 198 bits (505), Expect = 5e-50, Method: Composition-based stats. Identities = 79/164 (48%), Positives = 98/164 (59%), Gaps = 4/164 (2%) Query: 11 DITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLA 70 DIT L VD +VNAAN SL+GGGGVDGA+HRAAG LL C Q G C TG A IT Sbjct: 15 DITTLDVDAVVNAANTSLLGGGGVDGALHRAAGADLLREC----QTLGGCVTGDAKITGG 70 Query: 71 GDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPR 130 L A+ V+H VGPVW GG + E +LL Y SL L S+AFPAIS GVY +P Sbjct: 71 HRLKARHVIHAVGPVWHGGGRGEAELLASCYRRSLELARDAKAKSIAFPAISCGVYRFPA 130 Query: 131 AAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 A IA++TV + + R + E+V F C+DE Y+ ++ Sbjct: 131 DEAVRIAMQTVIDTLPRVSTVERVIFACFDEAMHARYKAEFGRR 174 >UniRef50_C4Q6S1 Expressed protein n=1 Tax=Schistosoma mansoni RepID=C4Q6S1_SCHMA Length = 224 Score = 198 bits (505), Expect = 6e-50, Method: Composition-based stats. Identities = 77/206 (37%), Positives = 108/206 (52%), Gaps = 39/206 (18%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI + +GDIT L +D I NAAN L GGGGVDGAIHRAAG LL+AC K+ C Sbjct: 25 LGSRISLWRGDITHLQIDAIANAANSQLRGGGGVDGAIHRAAGSQLLEACQKLS----GC 80 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T +LP+K V+H VGPV D L+ Y +L L + ++ S+AFP Sbjct: 81 PTGDAKLTPGFNLPSKYVIHCVGPV-----GRNDVALESTYRKALELCSEHNIQSIAFPC 135 Query: 121 ISTGVY------------------------------GYPRAAAAEIAVKTVSEFITRHAL 150 ISTGVY +P AAA++A+ TV ++ H Sbjct: 136 ISTGVYEVQKTRENKKRIDLIKGLDDQIFKPDFPDDCFPNEAAAKVALHTVLSYLKSHQE 195 Query: 151 PEQVYFVCYDEENAHLYERLLTQQGD 176 ++V F + + + +YE L+ + D Sbjct: 196 IQRVIFCIFMDVDYKIYENLIPEMLD 221 >UniRef50_Q9EYI6 UPF0189 protein in sno 5'region n=22 Tax=Bacteria RepID=Y189_STRNO Length = 181 Score = 198 bits (504), Expect = 6e-50, Method: Composition-based stats. Identities = 86/175 (49%), Positives = 104/175 (59%), Gaps = 6/175 (3%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG--DC 60 T I +VQGDIT+ D +VNAAN SL+GGGGVDGAIHR GPA+L C +R + Sbjct: 2 TTITLVQGDITRQHADALVNAANSSLLGGGGVDGAIHRRGGPAILAECRALRASRYGEGL 61 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG AV T AGDL A+ V+HTVGPVW ++ LL Y SLRL +VAFPA Sbjct: 62 PTGRAVATTAGDLDARWVIHTVGPVW-SSTEDRSDLLASCYRESLRLAGELGARTVAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 +STGVY +P AA IAV+TV T E+V FV +D + R L G Sbjct: 121 LSTGVYRWPMGDAARIAVETVR---TTPTAVEEVRFVLFDTHAYDTFARELGDAG 172 >UniRef50_C2LSS3 Protein in Tap1-dppD intergenic region n=1 Tax=Streptococcus salivarius SK126 RepID=C2LSS3_STRSL Length = 254 Score = 197 bits (502), Expect = 1e-49, Method: Composition-based stats. Identities = 69/180 (38%), Positives = 102/180 (56%), Gaps = 8/180 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 ++ +++ QGDIT+LA D IVNAAN L+G +D AIH AAG L AC ++ Q Sbjct: 76 IRPNLYLWQGDITRLAADAIVNAANSKLLGCFVPNHSCIDNAIHTAAGVELRLACQELMQ 135 Query: 56 QQGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSY 113 +QG+ TG A +T A +LP++ V+HTVGP+ + E Q L +Y L L Sbjct: 136 EQGEDETTGQAKMTKAYNLPSRYVLHTVGPIIYDEVTDLERQQLASSYEECLNLAYEKGL 195 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S+AF ISTG + +P AA+IA++TV +F H+ V F + + + +Y+ LL Sbjct: 196 RSLAFCCISTGEFRFPNEEAAKIAIETVLQFQKEHSDMV-VIFNVFKDLDYAIYQSLLKN 254 >UniRef50_Q9HJ67 UPF0189 protein Ta1105 n=1 Tax=Thermoplasma acidophilum RepID=Y1105_THEAC Length = 196 Score = 197 bits (502), Expect = 1e-49, Method: Composition-based stats. Identities = 76/173 (43%), Positives = 102/173 (58%), Gaps = 2/173 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CPT 62 + V GDIT+ + IVNAAN SLMGGGGVDGAIH AAGP L +K+R+++ P Sbjct: 11 LAVEVGDITESDAEAIVNAANSSLMGGGGVDGAIHSAAGPELNGELVKIRRERYPNGLPP 70 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AVIT L A ++HTVGPVW GG ED +L +Y + L L +AFPA+S Sbjct: 71 GEAVITRGYRLKASHIIHTVGPVWMGGRNGEDDVLYRSYRSCLDLAREFGIHDIAFPALS 130 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 TG YG+P A IA+++V +F+ + V FV Y E+ + +L+ G Sbjct: 131 TGAYGFPFDRAERIAIRSVIDFLKDESAGYTVRFVFYTEDQGKRFLFILSDLG 183 >UniRef50_P67344 UPF0189 protein SA0314 n=54 Tax=Staphylococcus RepID=Y314_STAAN Length = 266 Score = 197 bits (501), Expect = 2e-49, Method: Composition-based stats. Identities = 67/182 (36%), Positives = 95/182 (52%), Gaps = 8/182 (4%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 I V QGDIT L +D IVNAAN +G +D IH AG + C ++ +QQ Sbjct: 85 DNIFVWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQ 144 Query: 58 GD-CPTGHAVITLAGDLPAKAVVHTVGPVWR--GGEQNEDQLLQDAYLNSLRLVAANSYT 114 G G A T +LPAK ++HTVGP R + LL YL+ L+L +S Sbjct: 145 GRNEGVGKAKKTRGYNLPAKYIIHTVGPQIRRLPVSKMNQDLLAKCYLSCLKLADQHSLN 204 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 VAF ISTGV+ +P+ AAEIAV+TV ++ +V F + +++ LY+ L + Sbjct: 205 HVAFCCISTGVFAFPQDEAAEIAVRTVESYLKETNSTLKVVFNVFTDKDLQLYKEALNRD 264 Query: 175 GD 176 + Sbjct: 265 AE 266 >UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZED8_SYNY3 Length = 588 Score = 197 bits (500), Expect = 2e-49, Method: Composition-based stats. Identities = 67/164 (40%), Positives = 94/164 (57%), Gaps = 4/164 (2%) Query: 10 GDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITL 69 GDITK + IVN+ + +L G + AIH+AAGP LL AC ++ C G A +T Sbjct: 425 GDITKEKAEAIVNSTDRNLSNSGALSRAIHQAAGPELLQACQDLQ----GCTVGGAKLTP 480 Query: 70 AGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYP 129 +L A V+HTV P W+GG Q E++LL Y N L+L + S S+AFPAI+ G G+P Sbjct: 481 GFNLRANWVIHTVAPKWKGGNQGEEELLVSCYQNCLQLAVSQSIRSLAFPAIACGAMGFP 540 Query: 130 RAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 AA IA++TVS F+ + V F+C D+E Y+ + Sbjct: 541 PEIAARIALETVSNFLLSNMAIGSVAFICADKETLQYYQEAFQR 584 >UniRef50_Q93RG0 UPF0189 protein in tap1-dppD intergenic region n=14 Tax=Bacteria RepID=Y189_TREMD Length = 261 Score = 195 bits (496), Expect = 6e-49, Method: Composition-based stats. Identities = 71/174 (40%), Positives = 100/174 (57%), Gaps = 7/174 (4%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +V +GDIT L VD IVNAAN + G +D IH AG L C + Q+QG Sbjct: 88 YVWRGDITTLKVDAIVNAANSGMTGCWQPCHACIDNCIHTFAGVQLRTVCAGIMQEQGHE 147 Query: 61 -PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAF 118 PTG A IT A +LP K V+HTVGP+ G + + LL ++Y + L L A N S+AF Sbjct: 148 EPTGTAKITPAFNLPCKYVLHTVGPIISGQLTDRDCTLLANSYTSCLNLAAENGVKSIAF 207 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTGV+ +P AAEIAV TV ++ ++ ++ F + E++ LY +L++ Sbjct: 208 CCISTGVFRFPAQKAAEIAVATVEDWKAKNNSAMKIVFNVFSEKDEALYNKLMS 261 >UniRef50_A0LGZ1 Appr-1-p processing domain protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LGZ1_SYNFM Length = 175 Score = 194 bits (493), Expect = 1e-48, Method: Composition-based stats. Identities = 71/173 (41%), Positives = 102/173 (58%), Gaps = 7/173 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +VQGD+T+L VD IVNAAN L GGGV GAI GP + + C + G G Sbjct: 9 KISLVQGDLTELRVDAIVNAANRHLALGGGVAGAIRMKGGPTIQEECDAI----GGTVVG 64 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT G+L A V+H VGP + GE +ED+ L++A LNSL+ S S+AFPA+ST Sbjct: 65 QAVITGGGNLKAAHVIHAVGPRY--GEGDEDEKLRNATLNSLKRATEKSLASIAFPAVST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQ-VYFVCYDEENAHLYERLLTQQG 175 G++G+P+ A+I + F+ R + V F + +E+ ++E+ L G Sbjct: 123 GIFGFPKDRCAKIMLDAAVAFLDRETTSLRDVIFCLWSKEDLEIFEKTLQSMG 175 >UniRef50_Q1R0S7 Appr-1-p processing n=12 Tax=Proteobacteria RepID=Q1R0S7_CHRSD Length = 183 Score = 193 bits (492), Expect = 2e-48, Method: Composition-based stats. Identities = 79/174 (45%), Positives = 107/174 (61%), Gaps = 5/174 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CP 61 R+ VV GDIT+L VD IVNAAN SLMGGGGVDGAI+RAAGPAL AC +R+ P Sbjct: 9 RVDVVSGDITRLDVDAIVNAANHSLMGGGGVDGAIYRAAGPALKRACRALRETHWPDGLP 68 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G +T +LPA+ V+HTVGPV+ +++ LL + Y N++ L A +AFPAI Sbjct: 69 DGEVALTEGFELPARYVIHTVGPVY-AKTRDKSHLLANCYRNAVALAAETGCRRIAFPAI 127 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 STGVYGYP AA I + T+ + + H L +V + E + + + ++G Sbjct: 128 STGVYGYPFDDAAHIVIDTLHDALAIHDL--RVTLCFFSERDYQAFAEIAMRRG 179 >UniRef50_C9LYS3 Appr-1-p processing enzyme family domain protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LYS3_9FIRM Length = 302 Score = 193 bits (491), Expect = 2e-48, Method: Composition-based stats. Identities = 73/190 (38%), Positives = 96/190 (50%), Gaps = 19/190 (10%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 + QGDIT+LAVD IVNAAN +L G +D AIH AAG AL AC ++ ++ Sbjct: 112 DENFVLWQGDITRLAVDAIVNAANSALRGCFVPLHRCIDNAIHSAAGLALRAACDEIMRE 171 Query: 57 QG-DCPTGHAVITLAGDLPAKAVVHTVGPV-------------WRGGEQNEDQLLQDAYL 102 QG P G A IT +LPA+ V+HTVGP+ + G Q L Y Sbjct: 172 QGHPEPAGRAKITPGFNLPARHVLHTVGPIIAPAGSPVHEPGVFAGVTHEAQQCLVSCYR 231 Query: 103 NSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEE 162 L L A SVAF ISTG + YP AAE AV T ++ H P ++ F + +E Sbjct: 232 ACLDLAAERRLASVAFCCISTGEFHYPPQEAAETAVATCRAWLQAHDTPMRIVFNVFKDE 291 Query: 163 NAHLYERLLT 172 + +Y R+ Sbjct: 292 DLAIYRRIFQ 301 >UniRef50_A7BY23 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BY23_9GAMM Length = 708 Score = 193 bits (491), Expect = 2e-48, Method: Composition-based stats. Identities = 65/175 (37%), Positives = 98/175 (56%), Gaps = 7/175 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +IH++QG+IT+ VD IVN + SL G G +D AI A G L +AC +Q G C Sbjct: 532 KIHIIQGNITQQKVDAIVNTTDRSLSGSGAIDYAIQNAGGIELKEAC----RQLGTCSVA 587 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT +LPA+ V+HTVGP W GG Q E + L Y N L L + +AFP I Sbjct: 588 EAKITEGYNLPAQFVIHTVGPNWEGGNQKEAEKLAQCYRNCLALAEQQGFKIIAFPTIGV 647 Query: 124 GVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLYE--RLLTQQG 175 G G+ AA++A+ +S F+ +++ E+V VC+++ ++ +LL ++ Sbjct: 648 GGLGFSHELAAKVAIYEISSFLQQKNSSLEKVILVCFNQRVYEHFQETKLLLERS 702 >UniRef50_B6KFB3 Appr-1-p processing enzyme family domain-containing protein n=3 Tax=Toxoplasma gondii RepID=B6KFB3_TOXGO Length = 817 Score = 193 bits (491), Expect = 2e-48, Method: Composition-based stats. Identities = 73/173 (42%), Positives = 105/173 (60%), Gaps = 12/173 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ + +GDIT+L VDVIVNAANPSL+GGGGVDGAIHR AGP L Q G C TG Sbjct: 48 KVVLYRGDITELDVDVIVNAANPSLLGGGGVDGAIHRKAGPQL----RVFNQTLGGCKTG 103 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 + A L K + HTVGP Q L+ YLN+L L+ + Y ++AFP IST Sbjct: 104 EVKASPAFQLVCKQIFHTVGPRGEQS-----QALRACYLNALELLKRSKYRTIAFPCIST 158 Query: 124 GVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLLTQ 173 G+YGYP+ AA++ K V++++ + + + F ++ ++ YE+LL++ Sbjct: 159 GIYGYPQLNAAQVVTKCVTKWLKIPANYEAVDFIVFCVFERQDFLFYEQLLSK 211 >UniRef50_A4YFR3 Appr-1-p processing domain protein n=9 Tax=Thermoprotei RepID=A4YFR3_METS5 Length = 220 Score = 193 bits (491), Expect = 2e-48, Method: Composition-based stats. Identities = 64/177 (36%), Positives = 93/177 (52%), Gaps = 5/177 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + + +++GDITK+ D IVNAAN L GGGV AI R G A+ + ++ G Sbjct: 47 LGFEVDLMKGDITKIEADAIVNAANSYLSHGGGVAWAIVRRGGEAIQRESDQYVREHGPV 106 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 P G +T AG L AK V+H VGP + G + ED+ L A SL S+A PA Sbjct: 107 PVGEVAVTGAGSLRAKYVIHAVGPRY--GLEGEDK-LHSAIRRSLEKAEELGLRSLALPA 163 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTG+YGYP A + + + + + E+V V YD+ +E++ T++ E Sbjct: 164 ISTGIYGYPMEVCARVMASVLRSY--KPKILEKVIVVLYDDMAYSTFEKVFTRELQE 218 >UniRef50_B6SKT6 Protein LRP16 n=12 Tax=cellular organisms RepID=B6SKT6_MAIZE Length = 239 Score = 193 bits (490), Expect = 3e-48, Method: Composition-based stats. Identities = 78/179 (43%), Positives = 109/179 (60%), Gaps = 10/179 (5%) Query: 5 IHVVQGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD- 59 + + +GDIT +VD IVNAAN ++GGGGVDGAIH+AAGP L+ AC KV + + Sbjct: 62 LKLHKGDITLWSVDCATDAIVNAANERMLGGGGVDGAIHQAAGPELVQACRKVPEVKPGV 121 Query: 60 -CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 CPTG A IT A +LPA V+HTVGP++ + E L+ AY NSL+L N +AF Sbjct: 122 RCPTGEARITPAFELPASRVIHTVGPIYDLDKHPEVS-LKKAYENSLKLAKDNGIQYIAF 180 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 PAIS GVY YP A++IAV T +F ++V+FV + ++ +++ Q + Sbjct: 181 PAISCGVYRYPPKEASKIAVSTAQKFSED---IKEVHFVLFSDDLYNIWRETAQQLLSQ 236 >UniRef50_C4M8N0 Putative uncharacterized protein n=2 Tax=Entamoeba RepID=C4M8N0_ENTHI Length = 627 Score = 192 bits (489), Expect = 3e-48, Method: Composition-based stats. Identities = 65/181 (35%), Positives = 107/181 (59%), Gaps = 9/181 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQ 56 ++ + +GDITKL VD IVNAAN L+G +D AIH AGP L C + + Sbjct: 131 SNKLALWKGDITKLCVDAIVNAANNQLLGCFVPHHLCIDNAIHTFAGPQLRRDCSIIMNK 190 Query: 57 QG-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYT 114 QG + PTG+A +T A +LP+K V+HTVGP+ +++ LL+ +Y+N L + Sbjct: 191 QGFEEPTGYAKVTRAYNLPSKYVIHTVGPIVESQLKESHCNLLRSSYINCLNIADDLHLE 250 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLT 172 S+AF ISTG++G+P+ A+ IA++TV ++ + ++V F + + + +Y + +T Sbjct: 251 SIAFSCISTGLFGFPQNVASVIAIETVINWLYENPFTSIKKVIFDVFSDNDLQIYTKNVT 310 Query: 173 Q 173 + Sbjct: 311 E 311 >UniRef50_A5ZAB5 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A5ZAB5_9FIRM Length = 274 Score = 192 bits (488), Expect = 4e-48, Method: Composition-based stats. Identities = 70/186 (37%), Positives = 100/186 (53%), Gaps = 14/186 (7%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + +I + QGD+T+L VD IVNAAN +L+G +D AIH AG L + C K+ Sbjct: 89 LADKISIWQGDMTRLKVDAIVNAANSALLGCFVPCHRCIDNAIHSGAGMELREECNKIMN 148 Query: 56 QQG-------DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRL 107 Q+ + PTG A IT A +LP K V+HTVGP+ G +E L++ Y + L Sbjct: 149 QRKIKYGTNYEEPTGTATITEAYNLPCKKVIHTVGPICYFGLNDELCNDLKNCYESVLNC 208 Query: 108 VAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHL 166 A N +VAF ISTG + +P AA IA TV F+ + E+V F Y + + + Sbjct: 209 CAENGLKTVAFCCISTGEFRFPNKEAAVIAKDTVERFLMKKENNIERVIFCVYKDLDREI 268 Query: 167 YERLLT 172 Y++L Sbjct: 269 YDKLYK 274 >UniRef50_B9S4E3 Protein LRP16, putative n=2 Tax=cellular organisms RepID=B9S4E3_RICCO Length = 269 Score = 192 bits (488), Expect = 5e-48, Method: Composition-based stats. Identities = 76/171 (44%), Positives = 107/171 (62%), Gaps = 10/171 (5%) Query: 5 IHVVQGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD- 59 + + +GDITK VD IVN AN ++GGGG DGAIHRAAGP L+DAC KV + + Sbjct: 95 LKINKGDITKWFVDGSSDAIVNPANEKMLGGGGADGAIHRAAGPELVDACYKVPEVRPGI 154 Query: 60 -CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 CPTG A IT LPA V+HTVGP++ +N +L++AY NSL + N+ +AF Sbjct: 155 RCPTGEARITPGFKLPASHVIHTVGPIY-DANRNSAAILKNAYRNSLSVAKDNNIKFIAF 213 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYER 169 PAIS GVY YP AA +++ T+ EF ++V+FV + +E +++ + Sbjct: 214 PAISCGVYLYPFEEAASVSISTIKEFADD---IKEVHFVLFSDEIFNVWVK 261 >UniRef50_C8NG26 Appr-1-p processing enzyme family domain protein n=2 Tax=Granulicatella RepID=C8NG26_9LACT Length = 264 Score = 192 bits (487), Expect = 6e-48, Method: Composition-based stats. Identities = 66/180 (36%), Positives = 99/180 (55%), Gaps = 9/180 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 +I + GD+ +L VD IVNAAN ++G +D AIH +G L C + ++ Sbjct: 82 NDQIKLYYGDLCELKVDAIVNAANSEMLGCFIPNHRCIDNAIHTFSGIELRTFCHHLMKK 141 Query: 57 QGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRLVAANS 112 QG P G A IT A +LP+K ++HTVGP G++ +QLL Y + L Sbjct: 142 QGKKEPVGKAKITPAFNLPSKYIIHTVGPFLSPGQKVTPLREQLLASCYKSCLEAAREAG 201 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 TS+AF ISTG +G+P+ AA IA TV++++ A V F Y +E+ +Y++LL+ Sbjct: 202 LTSIAFCGISTGEFGFPKEPAALIAEDTVNKWLQDTASTITVVFSTYTKEDQSIYQKLLS 261 >UniRef50_UPI000186F16D conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186F16D Length = 367 Score = 192 bits (487), Expect = 6e-48, Method: Composition-based stats. Identities = 73/148 (49%), Positives = 94/148 (63%), Gaps = 9/148 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M RI + +GDIT L VD IVNAAN SL+GGGGVDGAIHR AG LL+ C + C Sbjct: 56 MNDRISLWKGDITTLGVDAIVNAANSSLLGGGGVDGAIHRKAGKFLLEECKTLN----GC 111 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT +LP+K V+HTVGP GE+ + LL+ Y + L+ N+ S+AFP Sbjct: 112 PTGSAKITGGYNLPSKYVIHTVGP---QGEKPD--LLESCYKSCFHLMLDNNLESIAFPC 166 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH 148 ISTG+YGYP+ AA +A+ F+ + Sbjct: 167 ISTGIYGYPQGPAAVVALTCARNFLESN 194 >UniRef50_C4FEN5 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FEN5_9BIFI Length = 173 Score = 191 bits (485), Expect = 1e-47, Method: Composition-based stats. Identities = 65/155 (41%), Positives = 89/155 (57%), Gaps = 3/155 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 VV DIT + VD I NAAN L+ G GV GAI RAAG + + + TG Sbjct: 21 FSVVHHDITDMQVDAIANAANTDLLMGSGVCGAIFRAAGASRMQEACD---RLSPIRTGE 77 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AVIT DLPA+ V+HT GP+WRGG+ NE+ LL+ Y + L + + + TS+AFP IS G Sbjct: 78 AVITPGFDLPARYVIHTAGPLWRGGDHNEEALLRSCYRSCLAIASVHGCTSMAFPLISAG 137 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY 159 +YGYPRA A ++A + ++ + V + Sbjct: 138 IYGYPRAEALDVAEDEIRYWLKENDSTMDVKLALW 172 >UniRef50_B9XAD9 Appr-1-p processing domain protein n=1 Tax=bacterium Ellin514 RepID=B9XAD9_9BACT Length = 184 Score = 191 bits (485), Expect = 1e-47, Method: Composition-based stats. Identities = 70/176 (39%), Positives = 101/176 (57%), Gaps = 8/176 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 KT ++ GDI D +V AA+ L G G DG IH GP + + C ++ G CP Sbjct: 7 KTLFELITGDIADQETDAVVTAAHWKLNKGSGTDGVIHTRGGPQIYEECRRI----GGCP 62 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT G+L AK V+H VGPVWRGG+++E +LL AY SL + + S++FP+I Sbjct: 63 IGDAVITTGGNLKAKHVIHAVGPVWRGGDEHEPELLASAYRRSLEVATEHKLKSISFPSI 122 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQ-VYFVCYDEEN---AHLYERLLTQ 173 STG + YP AA IA+KT+ +++ + + V V Y E+ +YE+ L + Sbjct: 123 STGAFVYPIKLAAPIALKTICDYLQKEQHTLEFVRLVLYTREDDKAFLVYEKALQE 178 >UniRef50_D1BM15 Appr-1-p processing domain protein n=15 Tax=Bacteria RepID=D1BM15_VEIPT Length = 259 Score = 191 bits (485), Expect = 1e-47, Method: Composition-based stats. Identities = 66/177 (37%), Positives = 95/177 (53%), Gaps = 7/177 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 + +I++ QGDIT+LAV IVNAAN L+G +D AIH AG L AC ++ + Sbjct: 82 EPQIYLWQGDITRLAVKAIVNAANEQLLGCFLPNHKCIDNAIHTFAGIELRMACARMTEY 141 Query: 57 QG-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYT 114 TG A +T +LPA V+HTVGP+ + E + L Y + L L A S Sbjct: 142 MDMPEKTGVARMTYGFNLPASHVIHTVGPIVYDTVTDLEKEQLSSCYRSCLELANAYSLK 201 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 S+AF ISTG + +P AA+IA+ TV ++ QV F + + + +Y +LL Sbjct: 202 SIAFCCISTGEFRFPNELAAQIAIDTVRRYLKETNSKIQVVFNVFKDIDYDIYNKLL 258 >UniRef50_Q0B030 Phosphatase n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0B030_SYNWW Length = 176 Score = 188 bits (479), Expect = 5e-47, Method: Composition-based stats. Identities = 82/171 (47%), Positives = 100/171 (58%), Gaps = 8/171 (4%) Query: 4 RIHVVQGDITKLAVD--VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I VVQGDIT+ D VIVNAAN SL GGGGVDGAIHRAAGP L + Sbjct: 7 EIQVVQGDITRQE-DMAVIVNAANSSLRGGGGVDGAIHRAAGPELKKESSALA----PIG 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT A LP + V+H VGPV+ G + ED+LL Y N+LRL S+AFPAI Sbjct: 62 PGQAVITGAYRLPNRYVIHCVGPVY-GVHKPEDELLASCYRNALRLAEKQQLDSIAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 STGVYGYP AA++ KT+ E I +++ V +D L+ + L Sbjct: 121 STGVYGYPMREAAQVMFKTIIEVIPELKHIKKIRIVLFDHPAYELHRQALE 171 >UniRef50_A8FSV2 Putative uncharacterized protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FSV2_SHESH Length = 293 Score = 188 bits (478), Expect = 8e-47, Method: Composition-based stats. Identities = 75/177 (42%), Positives = 107/177 (60%), Gaps = 11/177 (6%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQGDC 60 + GDIT+L VD I+NAAN L+G +D IH AAG L D C + +QQG Sbjct: 113 SIWVGDITQLKVDAIINAANVYLLGCRQPNHRCIDNVIHSAAGSRLRDDCATIIEQQGGL 172 Query: 61 -PTGHAVITLAGDLPAKAVVHTVGPVWRGG---EQNEDQLLQDAYLNSLRLVAA-NSYTS 115 PTG A IT LPAK V+HTVGP G ++ +++ L+ AY + L L + N + Sbjct: 173 EPTGSAKITRGYALPAKYVIHTVGPCLHSGYLPDEEDEKQLKSAYQSCLTLASEINDLKT 232 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLL 171 +AF AISTGV+ YP+ AA +A++TVS++++ H E+V F Y + +A +YERL+ Sbjct: 233 LAFCAISTGVFSYPKIDAASVALETVSDWLSEHPQHFEKVVFNLYTQADAAIYERLI 289 >UniRef50_A7B8S3 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7B8S3_9ACTO Length = 270 Score = 188 bits (477), Expect = 8e-47, Method: Composition-based stats. Identities = 74/185 (40%), Positives = 104/185 (56%), Gaps = 15/185 (8%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQ 57 R+ + +GDIT+L VD IVNAAN +L+G +D AIH AAG L AC +V ++ Sbjct: 85 PRMALWRGDITRLEVDAIVNAANSALLGCRAPGHTCIDNAIHSAAGLELRQACAEVMAER 144 Query: 58 G------DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAA 110 PTG AV+T LP++ V+HTVGP+ G +E + L +Y L AA Sbjct: 145 TRGDGPSGFPTGEAVLTPGFHLPSRFVIHTVGPIVNGELTDEHREALACSYQRCLEEAAA 204 Query: 111 NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLY 167 + +VAF ISTGV+G+P+ AA IAV TV++F+ R A +V F + + + LY Sbjct: 205 HGLNTVAFCCISTGVFGFPQEEAARIAVSTVADFLESDTRGASEVRVIFDVFGDHDEALY 264 Query: 168 ERLLT 172 LL Sbjct: 265 RALLR 269 >UniRef50_B5YAF3 Conserved protein n=2 Tax=Dictyoglomus RepID=B5YAF3_DICT6 Length = 182 Score = 188 bits (477), Expect = 9e-47, Method: Composition-based stats. Identities = 66/170 (38%), Positives = 98/170 (57%), Gaps = 3/170 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ VV+GDIT+ V+ IVNAAN L GGGV GAI RA G + + ++ G P G Sbjct: 13 KLKVVKGDITQEEVEAIVNAANSYLKHGGGVAGAIVRAGGEVIQKESDEYVEKYGPLPVG 72 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT AG L AK V+HTVGP W GE +E++ L+ A + L L + S++ PA+S Sbjct: 73 SATITSAGKLKAKYVIHTVGPRW--GEGDEEKKLEKAIESVLTLAKEKNIKSLSIPAVSC 130 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLT 172 G++G+P +I V V EF+ + + E+++F+ +E L+ L Sbjct: 131 GIFGFPPQLGTKIIVNKVVEFLKDNPGVFEEIHFIGIGDEIPTLFVDALK 180 >UniRef50_D1U7C0 Appr-1-p processing domain protein n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U7C0_9DELT Length = 186 Score = 186 bits (474), Expect = 2e-46, Method: Composition-based stats. Identities = 77/170 (45%), Positives = 104/170 (61%), Gaps = 7/170 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQ-----QQ 57 ++ + QGDIT L VD +VNAANP L GGGGVDGAIHRAAG L AC + Sbjct: 10 QLVIRQGDITTLDVDCVVNAANPQLAGGGGVDGAIHRAAGIAQLRQACQAIIDDPGQLPT 69 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 G P G AV+TL DLPA+ ++HTVGP+WRGG E + L+ +Y +SL+L ++ ++A Sbjct: 70 GQLPVGQAVLTLGFDLPARYIIHTVGPIWRGGVHGESEQLRSSYQSSLKLAHQHALATIA 129 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY 167 FPA+S G YGYP AA IA+ + + + L QV+ V +D + Sbjct: 130 FPALSCGAYGYPIPQAARIALDAIRQGLLD-GLAAQVHMVLHDHAACETW 178 >UniRef50_B0EH33 Putative uncharacterized protein n=2 Tax=Entamoeba RepID=B0EH33_ENTDI Length = 348 Score = 186 bits (474), Expect = 2e-46, Method: Composition-based stats. Identities = 67/178 (37%), Positives = 96/178 (53%), Gaps = 8/178 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQGD 59 I V +GDITKL +D IVNAAN +L+G VD IH AG L C +++ Sbjct: 93 IRVWKGDITKLKIDSIVNAANNTLVGCFIPLHSCVDSIIHERAGVQLRYECSQLKTAYKA 152 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 T IT +LPAK V+H VGP+ + + LLQ YLN L + TS+ F Sbjct: 153 TTT-TTEITKGYNLPAKYVIHVVGPIVDTLKPKDSYLLQQCYLNCLNKAIESGCTSIGFC 211 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTG++G+P AA+IA++TV+ F+ H + V F + E + ++Y LL ++ Sbjct: 212 CISTGMFGFPNEEAAQIAIQTVNNFLKDHQI--DVVFCVFKEIDYNIYTSLLNDGFNQ 267 >UniRef50_Q5XC09 UPF0189 protein M6_Spy0919 n=20 Tax=Streptococcus RepID=Y919_STRP6 Length = 270 Score = 186 bits (473), Expect = 2e-46, Method: Composition-based stats. Identities = 71/185 (38%), Positives = 97/185 (52%), Gaps = 11/185 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 T + + GDI LAVD IVNAAN L+G G +D AIH AG L AC + +Q Sbjct: 84 TSLFLYHGDIRYLAVDAIVNAANSELLGCFIPNHGCIDNAIHTFAGSRLRLACQAIMTEQ 143 Query: 58 GDCP-TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE---DQLLQDAYLNSLRLVAANSY 113 G G A +T A LPA ++HTVGP G LL Y +SL L Sbjct: 144 GRKEAIGQAKLTSAYHLPASYIIHTVGPRITKGRHVSPIRADLLARCYRSSLDLAVKAGL 203 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLL 171 TS+AF +ISTG +G+P+ AA+IA+KTV ++ H + V F + E+ LY+ L Sbjct: 204 TSLAFCSISTGEFGFPKKEAAQIAIKTVLKWQAEHPESKTLTVIFNTFTSEDKALYDTYL 263 Query: 172 TQQGD 176 ++ + Sbjct: 264 QKENN 268 >UniRef50_A0L536 Appr-1-p processing domain protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L536_MAGSM Length = 180 Score = 186 bits (473), Expect = 2e-46, Method: Composition-based stats. Identities = 81/173 (46%), Positives = 104/173 (60%), Gaps = 3/173 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--C 60 T + ++ DIT+L +D +VNAAN SL+GG GVDGAIHR G AL AC +R Sbjct: 2 TTLEIILTDITQLPIDGVVNAANNSLLGGMGVDGAIHRVGGTALTQACQALRHTHYPDGL 61 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG AV T AG+LPAK V+HTVGPV+ + + L D Y NSLR S+AFPA Sbjct: 62 ATGAAVATCAGELPAKRVIHTVGPVY-AKDPDPQARLADCYRNSLRCAQEEGLRSIAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTGVYG+P+ AA IAV T+ + + E+V V + EE+A + L Q Sbjct: 121 ISTGVYGFPKQQAANIAVATLLQALREGVALERVVLVAFSEEDAQILRHALNQ 173 >UniRef50_UPI0001B4DEB3 hypothetical protein ShygA5_39675 n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4DEB3 Length = 311 Score = 186 bits (473), Expect = 3e-46, Method: Composition-based stats. Identities = 73/180 (40%), Positives = 100/180 (55%), Gaps = 9/180 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 R + QGDIT L D +VNAAN +L+G +D AIH AAGP L C + +Q Sbjct: 127 DRTVLWQGDITTLGADAVVNAANSALLGCFAPMHPCIDNAIHTAAGPRLRADCHTIMTRQ 186 Query: 58 G-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAA-NSYT 114 G PTG A IT LPA+ V+HTVGP+ G + D+ L +Y L L A + Sbjct: 187 GHPEPTGTAKITRGYHLPARYVLHTVGPIVDGPLRPVHDRALAASYRACLDLAAEVDGLR 246 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQ 173 +VAF ISTGV+GYPR AA A+ TV++++ H ++V F Y +++ Y LT+ Sbjct: 247 TVAFCGISTGVFGYPRKPAARAALDTVADWLGTHPGRLDRVIFNVYADDDHAAYTHALTE 306 >UniRef50_A6GJ81 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GJ81_9DELT Length = 173 Score = 186 bits (472), Expect = 3e-46, Method: Composition-based stats. Identities = 77/171 (45%), Positives = 105/171 (61%), Gaps = 4/171 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-D 59 M I + +GDIT+++ D IVNAANP ++GGGGVDGAIHRAAGP LL AC +V + G Sbjct: 1 MAPSITLERGDITRVSCDAIVNAANPKMLGGGGVDGAIHRAAGPELLAACRRVPKVNGIR 60 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 CP G A IT A L A+ V+H VGP++ E + +L AY ++L L AA+ T +A P Sbjct: 61 CPFGEARITPAFGLDARWVIHAVGPIYARSE-DPKGVLARAYASALELAAAHDVTELACP 119 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 A+STG YG+P AA IA++TV+ +V FV + E + + Sbjct: 120 ALSTGAYGFPLDPAARIALETVAS--RDWGCVARVRFVLFTAEVMAAFAKF 168 >UniRef50_C7GZB8 Appr-1-p processing enzyme family domain protein n=3 Tax=Bacteria RepID=C7GZB8_9FIRM Length = 268 Score = 186 bits (472), Expect = 3e-46, Method: Composition-based stats. Identities = 74/185 (40%), Positives = 105/185 (56%), Gaps = 14/185 (7%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACL---- 51 +K + V QGDIT+L VD IVNAAN ++G +D IH AG L + C Sbjct: 83 IKDNLSVWQGDITRLKVDAIVNAANSQMLGCFIPLHTCIDNQIHTFAGIQLREECDQKME 142 Query: 52 KVRQQQGD---CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRL 107 K+R++ G PT ++T +LPAK VVH VGP+ GG ++ ++ L D Y N+L + Sbjct: 143 KLREKYGRDYEQPTAIPMLTEGYNLPAKKVVHIVGPIVSGGLTSDLEKDLADCYTNTLDM 202 Query: 108 VAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHL 166 N+ SV F ISTGV+ +P AAEIAVKTV ++ H+ E++ F + +E+ Sbjct: 203 CMENNLKSVVFCCISTGVFHFPNKRAAEIAVKTVGKWCEAHSYSLERIIFNVFKDEDKKY 262 Query: 167 YERLL 171 YE LL Sbjct: 263 YEELL 267 >UniRef50_B8LP86 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=B8LP86_PICSI Length = 231 Score = 184 bits (468), Expect = 1e-45, Method: Composition-based stats. Identities = 75/179 (41%), Positives = 103/179 (57%), Gaps = 10/179 (5%) Query: 5 IHVVQGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD- 59 + + +GDITK VD IVNAAN L+GGGGVDGAIHRAAGP LL AC + + Sbjct: 56 LLLHRGDITKWTVDGHTDAIVNAANERLLGGGGVDGAIHRAAGPDLLKACRQFPKVSRGI 115 Query: 60 -CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 CP G A IT +LP ++HTVGPV+ E E + L DAY +SL + N +AF Sbjct: 116 RCPVGSARITRGFNLPVSRIIHTVGPVYDMEEDPESK-LADAYRSSLNITRENEVKYIAF 174 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 PAIS G+YGYP AA +++ TV + I ++V+FV ++ + + ++ Sbjct: 175 PAISCGIYGYPYEEAAAVSLTTVRDSIKD---LKEVHFVLFEMPAWEAWLEKANELFEQ 230 >UniRef50_B8I4Z8 Appr-1-p processing domain protein n=7 Tax=Bacteria RepID=B8I4Z8_CLOCE Length = 341 Score = 184 bits (468), Expect = 1e-45, Method: Composition-based stats. Identities = 73/169 (43%), Positives = 95/169 (56%), Gaps = 8/169 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDA-CLKVRQQQGDCPTG 63 +V+ DITKL VD IVNAAN L GGGV GAI +AAG A L A C K+ TG Sbjct: 3 FIIVRQDITKLKVDAIVNAANTDLRMGGGVCGAIFKAAGAAQLQAVCDKLA----PIKTG 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 VIT +L AK V+H GPV+ + +Q L+ AY NSL+ N S+AFP IS Sbjct: 59 EVVITPGFNLSAKFVIHAAGPVYRHWNREQGEQYLRAAYTNSLKCAVENKCESIAFPLIS 118 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +G+YGYP+ A +A + FIT H + V V +D+ + +LL Sbjct: 119 SGIYGYPKDEALRVATSEIHNFITDHDI--DVTLVVFDKSAFTVSRKLL 165 >UniRef50_C2D2Z2 Appr-1-p processing enzyme family domain protein n=1 Tax=Lactobacillus brevis subsp. gravesensis ATCC 27305 RepID=C2D2Z2_LACBR Length = 274 Score = 184 bits (467), Expect = 1e-45, Method: Composition-based stats. Identities = 66/178 (37%), Positives = 95/178 (53%), Gaps = 6/178 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 ++ +I++ QGDIT+LAVD IVN AN ++G G +D IH AG L A K Sbjct: 95 IRPKIYLWQGDITQLAVDAIVNPANSRMLGCFIPNHGCLDNQIHTKAGIQLRLADQKAMA 154 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGP-VWRGGEQNEDQLLQDAYLNSLRLVAANSYT 114 + TG A +T +LPAK V+HTVGP + QLL D+Y + L+L + Sbjct: 155 GERLEATGKAKLTPGFNLPAKFVIHTVGPVIIHQVTPLRRQLLADSYQSCLKLAEQKDLS 214 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 +AF ISTG + +P AA+IAV TV+++++ H V F + + LY L Sbjct: 215 ELAFCCISTGEFRFPHDLAAQIAVNTVNDYLSSHINAPDVIFAVNSDLDKALYLHELE 272 >UniRef50_A0Q2I9 Appr-1-p processing enzyme family protein n=3 Tax=Clostridia RepID=A0Q2I9_CLONN Length = 183 Score = 183 bits (466), Expect = 2e-45, Method: Composition-based stats. Identities = 56/176 (31%), Positives = 96/176 (54%), Gaps = 6/176 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I + +GDIT + D IVN AN L GGGV AI + G + + K+ +++G P Sbjct: 6 NKEIIIKKGDITNESSDAIVNPANGMLKHGGGVAAAIVKKGGREVQEESNKIVRKEGIIP 65 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG AVIT +LP K ++H VGP GE +E L++A L++L L ++ S++ PAI Sbjct: 66 TGGAVITKGYNLPCKYIIHAVGPRM--GEGDEKLKLKNAVLSALCLAEQHNLKSISIPAI 123 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQGD 176 S+G++ +P+ A+I + T +F+ + + D++ ++ L ++ + Sbjct: 124 SSGIFRFPKDECAKILINTSIKFLQTSAKSLKTIVMCNLDDKTYEIF---LQEEKE 176 >UniRef50_C8WYT5 Appr-1-p processing domain protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8WYT5_DESRD Length = 188 Score = 183 bits (465), Expect = 2e-45, Method: Composition-based stats. Identities = 78/178 (43%), Positives = 105/178 (58%), Gaps = 5/178 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 R+ + QGDIT V +VNAAN L GGGGVDGA+ RAAGP LL A + ++ G G Sbjct: 12 RLEIRQGDITAAEVGAVVNAANSRLAGGGGVDGALQRAAGPQLLQAGQEYVREHGALSVG 71 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T LPA V+HTVGP+WRGG NE+ LL+ AY N L++ S+AFPAIS Sbjct: 72 DAVVTPGFALPASQVIHTVGPIWRGGGHNEEALLERAYANCLQVAKDQGIQSIAFPAISC 131 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY----ERLLTQQGDE 177 GVYG+P AA IA+ + + R A+ V Y + ++ +RL+ + +E Sbjct: 132 GVYGFPEKRAAAIAIPVIVAALERDAVS-SVALYLYSNPSYAVWYNEAQRLIGAEHEE 188 >UniRef50_A8H4N3 Appr-1-p processing domain protein n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H4N3_SHEPA Length = 304 Score = 183 bits (465), Expect = 2e-45, Method: Composition-based stats. Identities = 66/180 (36%), Positives = 99/180 (55%), Gaps = 9/180 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 T+I + +GDIT LAVD IVNAAN ++G +D AIH AG L C + + Sbjct: 119 DTKIILWKGDITTLAVDAIVNAANNQMLGCFQPQHKCIDNAIHNRAGAQLRADCEVIMEL 178 Query: 57 QGDC-PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAA-NSY 113 QG+ TG A IT A +LP+K V+HTVGP+ + Q L +Y + L L Sbjct: 179 QGNIEETGIAKITRAYNLPSKFVIHTVGPIVQNMIQPIHAGQLASSYRSILTLAKQTERI 238 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLT 172 S+AF +ISTG++GYP A +A+ TV++++ + + + F + E + H+Y+ L Sbjct: 239 RSLAFCSISTGIFGYPIEQATRVALDTVTQWLMENPDQFDTIVFNVFSEYDHHVYQSALE 298 >UniRef50_C1QBX0 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBX0_9SPIR Length = 257 Score = 183 bits (465), Expect = 2e-45, Method: Composition-based stats. Identities = 57/178 (32%), Positives = 98/178 (55%), Gaps = 7/178 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 ++ +++ QGDIT L +D +VNAAN S++G +D AIH A+G L CL Sbjct: 81 IRDNLYLWQGDITTLNIDAVVNAANSSMLGCFIPLHKCIDNAIHSASGTRL-RLCLNNIM 139 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYT 114 + +G +IT A +LP++ ++HTVGP+ + + +++LL + Y + L N+ Sbjct: 140 KGKTEDSGQCIITKAFNLPSRYILHTVGPIIQNSVSKKDEELLYNCYKSCLETAKENNIK 199 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 S+AF ISTG + +P A++IAV V +F+ ++ F + + + LY +L Sbjct: 200 SIAFCCISTGEFKFPNKEASQIAVNAVKDFLNNSKYDIKIVFNVFKDLDYELYYDILK 257 >UniRef50_B7C850 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C850_9FIRM Length = 310 Score = 183 bits (465), Expect = 2e-45, Method: Composition-based stats. Identities = 62/167 (37%), Positives = 90/167 (53%), Gaps = 7/167 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +++ DITKL VD IVN NPSL GG+D IH+ AG L C ++ G+ G Sbjct: 3 IKIIRQDITKLKVDAIVNTTNPSLDAKGGLDHYIHQFAGKELDVECRRI----GNLKVGQ 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T L K ++HT PVW +N + LL+ YL+SL L S+AFP IS+G Sbjct: 59 ACLTSGYKL-CKYIIHTASPVWNIQNKNNEALLKSCYLSSLMLANEYKLKSIAFPLISSG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +P+ A ++A+ ++ F+T H + VY V YD + + L Sbjct: 118 TNQFPKELALQVAMNSIVSFLTDHEMM--VYLVVYDRNSYKISSELF 162 >UniRef50_B9WC14 Putative uncharacterized protein n=5 Tax=Candida RepID=B9WC14_CANDC Length = 564 Score = 181 bits (461), Expect = 6e-45, Method: Composition-based stats. Identities = 64/184 (34%), Positives = 97/184 (52%), Gaps = 14/184 (7%) Query: 2 KTRIHVVQGDITKLA-VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + + +GDIT L V IVNAAN +L+G +D IH AAGP L AC + Q Sbjct: 90 NATVSLWKGDITTLTDVTAIVNAANSTLLGCFQPRHKCIDNVIHIAAGPDLRQACYNLMQ 149 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGE--QNEDQLLQDAYLNSLR---LVAA 110 + PTG A IT +LPAK V+HTVGP+ + E + L Y +SL ++ Sbjct: 150 SK-SEPTGSAKITPGFNLPAKYVIHTVGPIIHNESVTKREQEQLASCYQSSLEALEMLND 208 Query: 111 NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYE 168 S+AF +STG++ +P+ A+ IA+ TV +++ H + + + F + E+ +YE Sbjct: 209 EKDKSIAFCCVSTGLFAFPKELASTIAINTVHDYLKTHPNSTIKHIVFNVFSNEDKEVYE 268 Query: 169 RLLT 172 L Sbjct: 269 NNLQ 272 >UniRef50_UPI00006A2284 UPI00006A2284 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2284 Length = 694 Score = 181 bits (460), Expect = 8e-45, Method: Composition-based stats. Identities = 57/178 (32%), Positives = 95/178 (53%), Gaps = 4/178 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V + D+ + +VDV+VNAAN L GG+ GA+ RAAGP L C ++ + +G G Sbjct: 2 TVAVYKDDLARHSVDVVVNAANEDLKHIGGLAGALLRAAGPKLQTDCDQIIKIRGRLSAG 61 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 AVIT AG+LP K V+H VGPVW + D+ L A + L L A + S+ PA+S Sbjct: 62 DAVITDAGNLPCKQVIHAVGPVWNAFFPGKCDRQLHKAITSCLDLAARKGHRSIGIPAVS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQGDE 177 +G++G+P + ++ ++ H+ +Q++ V + + L + ++ Sbjct: 122 SGIFGFPLKRCVTHILGSIKAYVEDNSAHSTIKQIHLVALESATVQAFTDALRAESEQ 179 Score = 121 bits (305), Expect = 7e-27, Method: Composition-based stats. Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 8/172 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V+Q I DVIVN L + + A+ AGP L L Q P G Sbjct: 193 IKVIQQAIEDSTTDVIVNNVGQKLQLNEWQISRALAARAGPQLQQ-LLSNSSQGASAPNG 251 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 T +L V+H V P W Q+L+ + + L+L S S++ PAI T Sbjct: 252 SVFSTDGCNLNCAKVLHVVMPQWD----RRTQVLRKSIKSCLKLTEQQSLQSISIPAIGT 307 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHLYERLLTQ 173 G GYP+ A + K + F ++ ++V V + D EN ++ + L + Sbjct: 308 GKLGYPKDLVAAVTFKEILHFSSKAQSLQEVNIVLHPRDTENIQVFSKELQR 359 >UniRef50_Q97AU0 UPF0189 protein TV0719 n=2 Tax=cellular organisms RepID=Y719_THEVO Length = 186 Score = 181 bits (459), Expect = 9e-45, Method: Composition-based stats. Identities = 71/168 (42%), Positives = 103/168 (61%), Gaps = 4/168 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CPT 62 I +++GDIT + + IVNAANPSLMGGGGVDGAIH G + C ++R+ + P Sbjct: 11 IEIIEGDITDVNCEAIVNAANPSLMGGGGVDGAIHLKGGKTIDLECAELRRTKWPKGLPP 70 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT G L AK V+HTVGP++R G++ + + L +Y SL + + +AFPAIS Sbjct: 71 GEADITSGGKLKAKYVIHTVGPIYR-GQEEDAETLYSSYYRSLEIAKIHGIKCIAFPAIS 129 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 TG+YGYP A+ IA+K V++F++ + + FV Y + + L Sbjct: 130 TGIYGYPFEEASVIALKAVTDFLS-NKEGYIIKFVLYGQARYQTFVSL 176 >UniRef50_B8DKL2 Appr-1-p processing domain protein n=3 Tax=Desulfovibrio RepID=B8DKL2_DESVM Length = 202 Score = 181 bits (459), Expect = 1e-44, Method: Composition-based stats. Identities = 76/167 (45%), Positives = 96/167 (57%), Gaps = 1/167 (0%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GD+ A D +VNAAN L GGGGVDGA+HRAAGP LL A + ++G G Sbjct: 17 LAVSTGDLAATATDAVVNAANAELRGGGGVDGALHRAAGPMLLPAGRDIVARRGPLAAGE 76 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AVIT +LPA+ V+H VGP+WRGG E Q L + NSLRL A + VAFPAIS G Sbjct: 77 AVITPGFNLPARHVIHAVGPIWRGGTHGEPQALAAVHANSLRLAAEHGLARVAFPAISCG 136 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 YGYP AA IA+ + R L +V FV + + ++ Sbjct: 137 SYGYPPELAAPIALAEAVRGL-RAGLVREVRFVLHGQAMLAVWRTAF 182 >UniRef50_C4FT52 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FT52_9FIRM Length = 263 Score = 180 bits (458), Expect = 1e-44, Method: Composition-based stats. Identities = 65/182 (35%), Positives = 94/182 (51%), Gaps = 9/182 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQ 56 + +++ QGDIT+LAVD IVNAAN +++G +D IH AG AL AC +++ Sbjct: 73 RPSLYLWQGDITRLAVDAIVNAANSAMLGCFEPNHYCIDNQIHTFAGVALRLACADLKKA 132 Query: 57 QG--DCPTGHAVITLAGDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAANS 112 +G P G A++T +LPAK V+HTVGP LL+ AY L Sbjct: 133 RGGKPLPVGQALMTSGFNLPAKQVIHTVGPRIHHLPVSPMMQDLLKKAYRACLACADQAG 192 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ++AF ISTG + YP A IA++TVS ++ +V F + + LY LL Sbjct: 193 LATIAFCCISTGEFSYPIEEATPIAIETVSAYLAETGSKLKVIFNVWTDSQYQLYHDLLN 252 Query: 173 QQ 174 + Sbjct: 253 SK 254 >UniRef50_Q17432 Protein B0035.3, confirmed by transcript evidence n=3 Tax=Chromadorea RepID=Q17432_CAEEL Length = 203 Score = 180 bits (458), Expect = 1e-44, Method: Composition-based stats. Identities = 73/171 (42%), Positives = 96/171 (56%), Gaps = 7/171 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPT 62 RI V GDITKL+VD IVNAAN L GGGGVDGAIHRAAG L + C QQ C Sbjct: 25 RISVWDGDITKLSVDAIVNAANSRLAGGGGVDGAIHRAAGRKQLQEEC----QQYNGCAV 80 Query: 63 GHAVITLAGDLP-AKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AVIT ++ K ++HTVGP G +E + L Y SL + N S+AF Sbjct: 81 GDAVITSGCNINHIKKIIHTVGPQVYGNVTDERRENLVACYRTSLDIAIENGMKSIAFCC 140 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 ISTGVYGYP AA+ ++E++ ++ E++ V + + + Y + Sbjct: 141 ISTGVYGYPNDDAAKTVTNFLTEYLEKNDTIERIVLVTFLDIDNEHYNKYF 191 >UniRef50_A8JCH3 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8JCH3_CHLRE Length = 160 Score = 180 bits (458), Expect = 1e-44, Method: Composition-based stats. Identities = 71/148 (47%), Positives = 92/148 (62%), Gaps = 3/148 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--C 60 T++ + QGDIT VD IVNAAN ++GGGGVDGAIHRAAGP L+ AC +V + C Sbjct: 12 TKLVIKQGDITVEDVDAIVNAANERMLGGGGVDGAIHRAAGPQLVRACAEVPEVYPGVRC 71 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT L A+ V+HTVGP++ ++ LL AY +S+ L A S++FP Sbjct: 72 PTGEARITPGFHLKARHVIHTVGPIYHN-DRVSAPLLASAYRSSVELAAQQGLASLSFPG 130 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH 148 ISTGV+GYP AA++ V T H Sbjct: 131 ISTGVFGYPWDKAAQVRVHTTHGHPRSH 158 >UniRef50_B0A8R6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B0A8R6_9CLOT Length = 361 Score = 180 bits (458), Expect = 1e-44, Method: Composition-based stats. Identities = 56/167 (33%), Positives = 88/167 (52%), Gaps = 5/167 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +++ IT + D IVN N L GGV G+I AG +L+ K ++ G T Sbjct: 3 FEIIRQYITNMKTDAIVNPTNNELKPTGGVCGSIFEKAGYEILE---KKCKKIGYLETTE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AVIT +L K ++HTVGP+W + + LL + Y N L+L + S+AFP IS+G Sbjct: 60 AVITKGYNLDCKYIIHTVGPIWDNAKSDNATLLYNTYTNCLKLAKSKKCNSIAFPLISSG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +GYP+ A +IA + F+ + + +Y V +D E+ + + L Sbjct: 120 NFGYPKDKALDIATNAIKNFLLENDML--IYLVVFDRESFKINKDLF 164 >UniRef50_A9WK70 Appr-1-p processing domain protein n=3 Tax=Chloroflexus RepID=A9WK70_CHLAA Length = 190 Score = 180 bits (457), Expect = 2e-44, Method: Composition-based stats. Identities = 72/175 (41%), Positives = 97/175 (55%), Gaps = 7/175 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPTG 63 + VV+GDI VD IVNAAN L+ GGGV GAI RAAG L AC V CPTG Sbjct: 15 LEVVEGDIVSQQVDAIVNAANEQLLQGGGVCGAIFRAAGAAELQRACDAVA----PCPTG 70 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A IT LPA+ ++H VGP++ +E D+LL AY SL L S+AFP+I+ Sbjct: 71 EARITPGFALPARYIIHAVGPIFDHYAPSEADRLLISAYRASLALARQYGLQSIAFPSIA 130 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 TG+YG+P AA + ++T+ + + H P V V + + +Y + T E Sbjct: 131 TGIYGFPVTRAAPLVLQTLIDDLHTHQAPGLVRMVLW-RDTFPVYRDVFTHMQSE 184 >UniRef50_Q87JZ5 UPF0189 protein VPA0103 n=5 Tax=Proteobacteria RepID=Y4103_VIBPA Length = 170 Score = 180 bits (457), Expect = 2e-44, Method: Composition-based stats. Identities = 86/171 (50%), Positives = 107/171 (62%), Gaps = 5/171 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-DCPTG 63 I +VQGDIT VD IVNAANP ++GGGGVDGAIHRAAGPAL++AC V G CP G Sbjct: 4 ISLVQGDITTAHVDAIVNAANPRMLGGGGVDGAIHRAAGPALINACYAVDDVDGIRCPFG 63 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT AG+L A+ V+H VGP++ + +L+ AY SL L AN SVA PAIS Sbjct: 64 DARITEAGNLNARYVIHAVGPIY-DKFADPKTVLESAYQRSLDLALANHCQSVALPAISC 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 GVYGYP AAE+A+ V + AL + F + EE +++ LTQ Sbjct: 123 GVYGYPPQEAAEVAM-AVCQRPEYAALDMR--FYLFSEEMLSIWQHALTQH 170 >UniRef50_C7H575 RNase III regulator YmdB n=2 Tax=Faecalibacterium prausnitzii RepID=C7H575_9FIRM Length = 343 Score = 179 bits (455), Expect = 3e-44, Method: Composition-based stats. Identities = 69/169 (40%), Positives = 94/169 (55%), Gaps = 5/169 (2%) Query: 8 VQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVI 67 ++ DITK+A D IVN AN +L+ G G AI++AAG L A + G C G AV Sbjct: 6 IRNDITKVAADAIVNPANRNLLQGSGTSRAIYQAAGEQELTAACEAI---GRCDLGRAVC 62 Query: 68 TLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYG 127 T A LPAK + H V P W GG E + L AY ++L+L A SVAFP +S+G YG Sbjct: 63 TPAFGLPAKYIFHAVCPAWHGGGFGEAEQLAGAYHSALKLAAKYHCESVAFPLLSSGNYG 122 Query: 128 YPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 YP+ A IAV T+++++ H L VY V YD + + +L + Sbjct: 123 YPKEQAFRIAVDTITQYVMEHDLT--VYLVLYDRGSLAVSRKLFASVEE 169 >UniRef50_B9YC00 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YC00_9FIRM Length = 182 Score = 179 bits (454), Expect = 4e-44, Method: Composition-based stats. Identities = 82/181 (45%), Positives = 108/181 (59%), Gaps = 12/181 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + GDIT + ++IVNAAN SL+GGGGVDG IHR AGP LL C + C TG Sbjct: 2 ITFIHGDITSVPAEIIVNAANRSLLGGGGVDGVIHRKAGPQLLAECRTLH----GCETGQ 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV----AANSYTS--VAF 118 A +T A DL + ++HTVGPVW GG E LL Y SLRL + +S + F Sbjct: 58 AKVTKAYDLSCRWIIHTVGPVWSGGRHQEVDLLASCYQQSLRLARQLQKEHRLSSLTIVF 117 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ--VYFVCYDEENAHLYERLLTQQGD 176 P ISTG+Y +P+A A IAV TV + ++ ++ V F CY+ E+A LY+R L +GD Sbjct: 118 PCISTGIYHFPKALACSIAVDTVRDTLSELQAEKEIDVIFCCYESEDAQLYKRQLDNKGD 177 Query: 177 E 177 + Sbjct: 178 Q 178 >UniRef50_C9XM94 Putative uncharacterized protein n=6 Tax=Clostridium RepID=C9XM94_CLODC Length = 286 Score = 179 bits (454), Expect = 4e-44, Method: Composition-based stats. Identities = 68/178 (38%), Positives = 95/178 (53%), Gaps = 10/178 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQGD 59 I + +G+IT L D IVNAAN L+G VD IH AGP L + C K+ ++QG Sbjct: 109 IAIWRGNITNLRADAIVNAANNKLLGCLQPLHLCVDNEIHSCAGPRLREDCDKIIKKQGH 168 Query: 60 CP-TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ--LLQDAYLNSLRLVAA-NSYTS 115 TG A IT LPAK VVHTVGP+ GG+ +++Q L Y + L + + + Sbjct: 169 LEYTGDAKITRGYCLPAKFVVHTVGPIVSGGQPSKEQEKQLLHCYKSCLNTIKEIDEIKN 228 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLLT 172 + F ISTGV+GYP+ AA +AV V ++ + +V F + EE Y R+ Sbjct: 229 IVFCGISTGVFGYPKKEAANLAVSRVRLWLKENPEKNLKVVFNVFTEEEEEKYRRIFK 286 >UniRef50_A8M6L5 Appr-1-p processing domain protein n=2 Tax=Micromonosporaceae RepID=A8M6L5_SALAI Length = 170 Score = 178 bits (453), Expect = 6e-44, Method: Composition-based stats. Identities = 79/163 (48%), Positives = 99/163 (60%), Gaps = 9/163 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I VV GDIT+ VD IV AAN SL+GGGGVDGA+HRAAGP L A + G C G Sbjct: 4 IEVVLGDITQQNVDAIVTAANESLLGGGGVDGAVHRAAGPRLAQAGGAI----GPCAPGD 59 Query: 65 AVITLAGDL--PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A+ T A DL P + ++HTVGPVWRGG E ++L Y SLR+ +VAFP I+ Sbjct: 60 AMPTPAFDLDPPVRHIIHTVGPVWRGGGHGEARVLASCYRRSLRIADDLDALTVAFPTIA 119 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAH 165 TGVYG+P AA IAV T+ + +QV V +DE++ Sbjct: 120 TGVYGFPADQAARIAVATIR---STPTNVQQVRLVAFDEDSRQ 159 >UniRef50_C4DDL7 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DDL7_9ACTO Length = 224 Score = 178 bits (452), Expect = 6e-44, Method: Composition-based stats. Identities = 70/169 (41%), Positives = 97/169 (57%), Gaps = 6/169 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CP 61 RI +V+GDIT VD ++NAAN SLMGGGGVDGAIHR GP +LD C K+R P Sbjct: 2 RIELVKGDITTQDVDALINAANSSLMGGGGVDGAIHRKGGPTILDECRKLRDSHYPKGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G A+ T AG+LPA+ ++HTVGPV+ + + + L+ Y NSL + T++A P I Sbjct: 62 EGQAIATTAGNLPAQWIIHTVGPVYSRHD-DRTETLRACYRNSLTIADTLGATTLAVPLI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 S+G+YG+P+ A AV + T + + E + RL Sbjct: 121 SSGIYGWPKDDAIRQAVDVLQ---TTPTSVTLARIMLFSSEEVSVATRL 166 >UniRef50_UPI000050FFC7 predicted phosphatase, C-terminal domain of histone macro H2A1 like protein n=1 Tax=Brevibacterium linens BL2 RepID=UPI000050FFC7 Length = 177 Score = 178 bits (452), Expect = 7e-44, Method: Composition-based stats. Identities = 76/172 (44%), Positives = 104/172 (60%), Gaps = 5/172 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCP 61 +I V++GDIT+ +VD IVNAAN SL+GGGGVDGAIH+AAGP LL+AC ++RQ P Sbjct: 2 KITVLEGDITEASVDAIVNAANSSLLGGGGVDGAIHKAAGPELLEACREIRQTSHPRGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AV T AG L A V+HTVGP GE + ++L+ + SL + A TSVAFPAI Sbjct: 62 AGQAVATSAGALKATWVIHTVGPNRTQGEA-DPEVLESCFEASLNVAAELGATSVAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLL 171 GVYG+ AE A + + R + +V FV + + ++ ++ Sbjct: 121 GGGVYGWSARDVAEAAHSVIVDGRERGHWEQVAEVVFVLFSDSMTSVFCQVF 172 >UniRef50_UPI000194CBCB PREDICTED: poly (ADP-ribose) polymerase family, member 14 n=1 Tax=Taeniopygia guttata RepID=UPI000194CBCB Length = 1883 Score = 178 bits (451), Expect = 1e-43, Method: Composition-based stats. Identities = 57/180 (31%), Positives = 94/180 (52%), Gaps = 4/180 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +T I + D+ VDV+VNA+N L GG+ A+ RAAGP L + C ++ ++ G+ Sbjct: 875 ETVIALYNADLCTHPVDVVVNASNEKLKHIGGLADALSRAAGPVLQEECDELVRKLGNLQ 934 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AVIT AG LP K V+H VGP W LL+ L+L A+ + S+A PA Sbjct: 935 PGCAVITHAGKLPCKNVIHAVGPRWSAENSVMCVWLLRKTVKKCLQLAEAHKHCSIALPA 994 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQGDE 177 IS G++G+P + ++ E + ++ ++V+ V + ++N + + + E Sbjct: 995 ISGGIFGFPMELCTYSIISSIKETLEESKGNSTLKEVHLVGFAQDNIQAFSKAFKEVFSE 1054 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 50/172 (29%), Positives = 77/172 (44%), Gaps = 16/172 (9%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 V +GDITK D IVN N + GV AI AG A+ D C + QQ G + Sbjct: 1307 FRVAEGDITKEEGDAIVNITNQAFNLKTGVSRAILNGAGKAVEDECGVLAQQTGK----N 1362 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +IT AG+LP K ++H V ++ L+ YTSVAFPAI TG Sbjct: 1363 YIITQAGNLPCKKIMHFV----------YQNDIRSLVSQVLQECELQQYTSVAFPAIGTG 1412 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQ 174 A A+ + V++F R++ + + V + +++ + ++ Sbjct: 1413 EARRNAAEVADNMIDAVTDFAKRNSATSVKTIKVVIFQPHLMSVFQASMQKR 1464 Score = 100 bits (249), Expect = 3e-20, Method: Composition-based stats. Identities = 45/177 (25%), Positives = 77/177 (43%), Gaps = 9/177 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT- 62 I + G I A ++V + L + G + A+ AGP L K + G P Sbjct: 1094 IVLQTGSIEDAATSIVVVSVGKDLQLDKGPLGKALLSKAGPMLQTGLNK--EGGGRMPEE 1151 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G + T +L V+H V P+W ++L D L + S S+ FPAI Sbjct: 1152 GSVLKTKGYNLACSVVLHAVVPMWSQKNTPS-KVLGDIITKCLEIAEELSLKSITFPAIG 1210 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHL--YERLLTQQG 175 TG +PR+ A++ V EF + E+V+F+ + ++ A++ + L ++ Sbjct: 1211 TGNLEFPRSVVAKLLFDKVFEFSSEKRVNSLEEVHFLLHTKDTANIQEFSDELEKRS 1267 >UniRef50_C4V152 Appr-1-p processing protein n=2 Tax=Clostridiales RepID=C4V152_9FIRM Length = 346 Score = 177 bits (449), Expect = 2e-43, Method: Composition-based stats. Identities = 74/167 (44%), Positives = 102/167 (61%), Gaps = 6/167 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +V+ DITK+ VD IVN+ANP + GGGVD AIH+AAG LL A R++ G+ G Sbjct: 3 FAIVRNDITKMQVDAIVNSANPRAIVGGGVDRAIHQAAGAELLTA----RRKIGNIAAGT 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T A L A+ V+HTVGPVW+ G E +LL AY NSLRL A +S+AFP +S G Sbjct: 59 AAVTPAYRLHARYVIHTVGPVWQDGSHGERELLSRAYQNSLRLAAERDCSSIAFPLLSAG 118 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 V+G P A AV+ + +F+ H + VY V +D ++ + + L Sbjct: 119 VFGCPSEIAIAAAVQAIRDFLQEHDM--DVYLVVFDRKSFKISDTLF 163 >UniRef50_D1VVA5 Putative uncharacterized protein n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VVA5_9FIRM Length = 163 Score = 177 bits (449), Expect = 2e-43, Method: Composition-based stats. Identities = 68/170 (40%), Positives = 98/170 (57%), Gaps = 9/170 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ ++ K+ VD IVNAAN L+ GGGV GAI + A L+ K + G Sbjct: 3 LNIKLE----NLVKMDVDAIVNAANKELLPGGGVCGAIFQVAKSKSLEMDCK---KLGPI 55 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG AVIT A +LP+K ++H VGP++R G E++LL++AYLNSL+L +S S+AFP Sbjct: 56 KTGQAVITSAYNLPSKYIIHAVGPIYRDGLSGEEELLRNAYLNSLKLAKKHSIKSIAFPL 115 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 IS G+Y YP A +IAV T+ EF+ + V D + +L Sbjct: 116 ISAGIYAYPLKEACKIAVDTIREFLKNEDM--DVTIAVLDPNIYDILTKL 163 >UniRef50_Q30ZH6 Appr-1-p processing n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30ZH6_DESDG Length = 183 Score = 176 bits (447), Expect = 3e-43, Method: Composition-based stats. Identities = 73/166 (43%), Positives = 104/166 (62%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++QGD+T D +VNAAN L GGGGVDGA+H AAGPALL C + + G P G Sbjct: 10 LEILQGDLTLFKADAVVNAANSRLAGGGGVDGALHAAAGPALLADCSRWVARHGLLPAGK 69 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A++T A LPA+ V+HTVGPVWRGG+ NE+ L+ AY + L +N + VAFPAIS G Sbjct: 70 AMVTPAHRLPARHVIHTVGPVWRGGKNNEETTLRQAYESCFTLCRSNGFAHVAFPAISCG 129 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 YGYP + AA +A+ ++ + P ++ FV + + ++ + Sbjct: 130 TYGYPASPAARVALACAAQALACQGAPAKITFVLHTAQMYTIWLKA 175 >UniRef50_B1KG04 Appr-1-p processing domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KG04_SHEWM Length = 296 Score = 176 bits (447), Expect = 3e-43, Method: Composition-based stats. Identities = 65/181 (35%), Positives = 97/181 (53%), Gaps = 10/181 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 ++I + GDIT+L +D + NAAN ++G +D AI+ AAGP L + C ++ Q Q Sbjct: 110 SKISIWNGDITRLKIDAVTNAANAQMLGCFQPFHSCIDNAINCAAGPQLREDCNQLMQLQ 169 Query: 58 G-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGG---EQNEDQLLQDAYLNSLRLVAANSY 113 G D TG A IT A +LP+K V+HTVGP+ + G + L Y L L A Sbjct: 170 GSDETTGSAKITRAYNLPSKFVLHTVGPIIQHGAVPSPRQIDELASCYDACLSLAAEAGA 229 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSE-FITRHALPEQVYFVCYDEENAHLYERLLT 172 SVA ISTGV+GYP AA +A++ V+ F+ + + F + + +Y R + Sbjct: 230 QSVAVCGISTGVFGYPAEKAANVALQAVANWFLVNPDKLDHLVFNTFGDNATEIYHRAIG 289 Query: 173 Q 173 + Sbjct: 290 E 290 >UniRef50_UPI0000ECB76F Poly [ADP-ribose] polymerase 14 (EC 2.4.2.30) (PARP-14) (B aggressive lymphoma protein 2). n=2 Tax=Gallus gallus RepID=UPI0000ECB76F Length = 1636 Score = 176 bits (446), Expect = 4e-43, Method: Composition-based stats. Identities = 57/177 (32%), Positives = 92/177 (51%), Gaps = 4/177 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V + D+ VDV+VNA+N L GG+ A+ +AAGP L C V + G G Sbjct: 637 IAVYKADLCTHHVDVVVNASNEDLKHIGGLAWALLQAAGPELQAECDGVVRMSGSLQAGD 696 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT AG LP K V+H VGP W+ + + LL+ SL+L ++ S+AFP++S Sbjct: 697 AVITGAGKLPCKQVIHAVGPRWKEQDAEKCVYLLKKTIKKSLQLAETYNHRSIAFPSVSG 756 Query: 124 GVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 G++G+P V + + + R + ++++ V E+N + + L + + Sbjct: 757 GIFGFPLHKCVNAIVSAIKKTLEEFKRDSSLKEIHLVDITEDNVQAFIKALKEVFSD 813 Score = 116 bits (290), Expect = 5e-25, Method: Composition-based stats. Identities = 42/176 (23%), Positives = 73/176 (41%), Gaps = 8/176 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT- 62 I + +G+I + D +V + L + G + A+ AGP L + G P Sbjct: 848 IMLKKGNIEDASTDGVVISVGGDLQLEKGQLAKALLSKAGPRLQSDLND--EGLGKSPVE 905 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G T +L V H V P W G ++ ++L L+ S S+ FPAI Sbjct: 906 GSVFTTRGYNLSCCYVFHAVTPGWSQGSESAVKILGKIVTKCLQTAEELSLKSITFPAIG 965 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEE--NAHLYERLLTQQ 174 TG+ G+P + A+ V EF + +V+F+ + ++ N + ++ Sbjct: 966 TGILGFPSSVVAKSLFDKVYEFSSKKKTNSLREVHFLLHPKDVNNIQAFSNEFERR 1021 Score = 103 bits (256), Expect = 4e-21, Method: Composition-based stats. Identities = 45/176 (25%), Positives = 75/176 (42%), Gaps = 16/176 (9%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 V GDITK DVIVN +N + GV AI AG + + C ++ Q P Sbjct: 1059 TFQVAAGDITKETGDVIVNISNQAFNLKTGVSKAILEGAGKEVENECAELALQ----PND 1114 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 + T AG LP K ++H V ++ L+ YTSV FPAI T Sbjct: 1115 GYITTEAGSLPCKKIIHFV----------ARDDIKVPVSKVLQECELQQYTSVTFPAIGT 1164 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCYDEENAHLYERLLTQQGDE 177 G G A+ + +++F ++ P + + V + +++ + ++ ++ Sbjct: 1165 GQAGRFPDLVADEMMDAITDFARSNSTPSVKTIKIVIFQPHLLNVFHTSMKKREND 1220 >UniRef50_C5CIT5 Appr-1-p processing domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIT5_KOSOT Length = 187 Score = 176 bits (446), Expect = 4e-43, Method: Composition-based stats. Identities = 52/171 (30%), Positives = 89/171 (52%), Gaps = 4/171 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +VQGDITK VD IVNAAN L GGGV GAI RA G + + ++ ++ G Sbjct: 13 IQIVQGDITKEEVDAIVNAANGYLRHGGGVAGAILRAGGKIIQEESDRIIRKNGPLEVSE 72 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T AG L K ++H GP + G++N ++LL +++LN+ + +++ PA+S+G Sbjct: 73 VAVTGAGSLHPKYIIHVHGPRY--GQENVEELLYESFLNAFKTAGKLGVKTLSVPAVSSG 130 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQ 173 ++G P+ A + V + + + D ++E++ + Sbjct: 131 IFGVPKDLCARCFFRAVEYYFENYKDTPLSLIRVCNIDRATTEVFEKVSEE 181 >UniRef50_B0EF86 MACRO domain-containing protein, putative n=2 Tax=Entamoeba RepID=B0EF86_ENTDI Length = 316 Score = 175 bits (445), Expect = 5e-43, Method: Composition-based stats. Identities = 66/172 (38%), Positives = 94/172 (54%), Gaps = 9/172 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +I VV GDITK+ DV+VNAAN L GG GVDGAIH AAG L D +R C Sbjct: 47 MNKKIIVVTGDITKIQADVVVNAANSYLRGGAGVDGAIHSAAGYELYDY---LRSHYKHC 103 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG + +P K ++H VGP+ Q LQ Y+ L V Y S+AFP Sbjct: 104 DTGDFKPSPGFKMPCKEILHGVGPIGENAIQ-----LQRVYVRCLEYVRLKEYKSIAFPC 158 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLL 171 ISTG++GY A + ++ V +++ + L + ++ F CY+ + ++Y + L Sbjct: 159 ISTGIFGYSNEKACPVVLEVVRDWLEVNPLWDGKIIFCCYNLTDLNIYSKFL 210 >UniRef50_A7T167 Protein GDAP2 homolog n=1 Tax=Nematostella vectensis RepID=GDAP2_NEMVE Length = 502 Score = 175 bits (444), Expect = 5e-43, Method: Composition-based stats. Identities = 62/173 (35%), Positives = 94/173 (54%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDITKLA D IVN N SL G + +HRAAGP L+ C RQQ C Sbjct: 49 INAKVVLWNGDITKLAADAIVNTTNESLSDRGALSERVHRAAGPELMQEC---RQQLLGC 105 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG A I+ +LPA+ V+HTVGP + + + L Y N++RLV N +++ Sbjct: 106 RTGEAKISEGYNLPARYVIHTVGPRYNTKYKTAAESALFSCYRNTMRLVRENKISTIGVC 165 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLL 171 ++T GYP A IA++TV F+ ++ + + V FV E +Y +++ Sbjct: 166 VVNTTKRGYPPEDGAHIALRTVRRFLEKYGSAVDTVAFVVEGAEAV-VYAKVM 217 >UniRef50_Q22CT8 Appr-1-p processing enzyme family protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22CT8_TETTH Length = 472 Score = 175 bits (443), Expect = 8e-43, Method: Composition-based stats. Identities = 54/164 (32%), Positives = 82/164 (50%), Gaps = 2/164 (1%) Query: 15 LAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLP 74 VD IVNAAN L GGGV GAI R G + + + + + G +V T AG LP Sbjct: 2 ENVDAIVNAANNFLAHGGGVAGAICRKGGRIIQNQSYDIIKIRNRIENGESVTTEAGQLP 61 Query: 75 AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAA 134 K V+HTVGP+W G+ NE + L LR S++ PAIS+G++G+P+ A Sbjct: 62 CKKVIHTVGPIWEDGDSNEKEELAKCMETILREAKFYKLKSISIPAISSGIFGFPKYLCA 121 Query: 135 EIAVKTVSEFIT--RHALPEQVYFVCYDEENAHLYERLLTQQGD 176 +I ++ + + E++ F +D E ++ +Q Sbjct: 122 KILLEETQKLLKYDYSNQFEEIRFCNFDNETVQVFAEEFQKQFQ 165 >UniRef50_A3LYE6 Putative uncharacterized protein n=1 Tax=Pichia stipitis RepID=A3LYE6_PICST Length = 583 Score = 174 bits (441), Expect = 1e-42, Method: Composition-based stats. Identities = 62/193 (32%), Positives = 97/193 (50%), Gaps = 17/193 (8%) Query: 1 MKTRIHVVQGDITKL-AVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVR 54 + ++ + +GDIT + V IVNAAN +L+G +D IH AAGP L AC + Sbjct: 91 LSPKLSIWKGDITTISDVTAIVNAANSALLGCFQPSHRCIDNIIHAAAGPDLRRACYNLV 150 Query: 55 QQQG--DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ---LLQDAYLNSLRLVA 109 +Q+ P G A IT +LPAK V+HTVGP G + + L Y +SL + Sbjct: 151 EQRDFTQEPVGSAQITPGFNLPAKMVIHTVGPSLLPGSEPNQEEISQLAACYTSSLAKLE 210 Query: 110 AN----SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEEN 163 + S+ F ISTG++ +P A+ IA+++V + + H+ +V F + E N Sbjct: 211 EQEEDGNDKSIVFCCISTGLFSFPNDIASNIAIESVRNYFSEHPHSSISEVIFNVFTETN 270 Query: 164 AHLYERLLTQQGD 176 LY + + + Sbjct: 271 LKLYRQNFAEYQE 283 >UniRef50_A8FQZ3 Putative uncharacterized protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FQZ3_SHESH Length = 268 Score = 173 bits (440), Expect = 2e-42, Method: Composition-based stats. Identities = 71/181 (39%), Positives = 99/181 (54%), Gaps = 10/181 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 K + + QGDIT+LA D IVNAAN L G +D AIH A+G L D C + + Sbjct: 88 KADVKLWQGDITRLAADAIVNAANKELQGCFQPLHSCIDNAIHSASGVRLRDDCAVIIKA 147 Query: 57 QGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAA-NSY 113 QG T A IT +LP + V+HTVGP+ +G E +LLQ Y N L L Sbjct: 148 QGQFEETAKAKITSGYNLPCQYVLHTVGPIVQGNVTGEHQKLLQLCYENCLALADQTLGI 207 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLL 171 S+AF ISTGV+GYP+ AA+ AV+ V +++ ++ + V F + E+ LY++ L Sbjct: 208 NSIAFCCISTGVFGYPQKPAAQAAVRAVQQWLLNNPNSNIDTVIFNTFKPEDTRLYQQFL 267 Query: 172 T 172 Sbjct: 268 Q 268 >UniRef50_A2FMC7 Appr-1-p processing enzyme family protein n=1 Tax=Trichomonas vaginalis RepID=A2FMC7_TRIVA Length = 361 Score = 173 bits (439), Expect = 2e-42, Method: Composition-based stats. Identities = 64/174 (36%), Positives = 96/174 (55%), Gaps = 13/174 (7%) Query: 1 MKTRIHVV-QGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + +I +G+ KL D +VNAAN L GGG+ G +H AAG A+ C ++ G Sbjct: 114 INEKISFWMRGNSVKLECDAVVNAANSHLYPGGGICGVLHSAAGEAMERECSEI----GY 169 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 PTG +TL +LPAK +HTVGP+ GEQ + LQ+AY ++L + SV Sbjct: 170 TPTGKCAVTLGYNLPAKYCIHTVGPI---GEQPDK--LQEAYESTLSCIDGKKIRSVGLC 224 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERL 170 ISTG+YGYP A IA+K V +F+ +++ FV ++ + +Y+R+ Sbjct: 225 CISTGIYGYPIENATPIALKVVRKFLEDPNNREKTDRIIFVVFERRDVVVYDRM 278 >UniRef50_C3YH95 Putative uncharacterized protein n=2 Tax=Eumetazoa RepID=C3YH95_BRAFL Length = 437 Score = 173 bits (439), Expect = 2e-42, Method: Composition-based stats. Identities = 50/172 (29%), Positives = 85/172 (49%), Gaps = 6/172 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 ++ + +GDIT L IVN N +L + I +AAGP L C + C Sbjct: 45 NRKVVLWEGDITTLNCTAIVNTTNETLTDRNLISERIFQAAGPDLRAECSNHLKT---CR 101 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPA 120 TG A +T +LPA+ ++HTVGP + + + L + Y NSL++ N+ S+ Sbjct: 102 TGEAKMTKGYNLPARYIIHTVGPRYNVKYRTAAESALFNCYRNSLQIARENNLQSIGLCV 161 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLL 171 ++ GYP A IA++TV F+ ++ + E + F + + +Y R++ Sbjct: 162 VNQPKRGYPPDEGAHIALRTVRRFLEKYDSSLETIVFAV-TDNDEDIYRRVM 212 >UniRef50_Q460N5 Poly [ADP-ribose] polymerase 14 n=19 Tax=Eutheria RepID=PAR14_HUMAN Length = 1720 Score = 173 bits (439), Expect = 2e-42, Method: Composition-based stats. Identities = 62/172 (36%), Positives = 90/172 (52%), Gaps = 4/172 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V QGD+ +L VDV+VNA+N L GG+ A+ +AAGP L C ++ +++G G+ Sbjct: 723 LIVQQGDLARLPVDVVVNASNEDLKHYGGLAAALSKAAGPELQADCDQIVKREGRLLPGN 782 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A I+ AG LP V+H VGP W G E LL+ A SL L Y S+A PAIS+ Sbjct: 783 ATISKAGKLPYHHVIHAVGPRWSGYEAPRCVYLLRRAVQLSLCLAEKYKYRSIAIPAISS 842 Query: 124 GVYGYPRAAAAEIAVKTVSE---FITRHALPEQVYFVCYDEENAHLYERLLT 172 GV+G+P E V + E F +++Y V E+ + + Sbjct: 843 GVFGFPLGRCVETIVSAIKENFQFKKDGHCLKEIYLVDVSEKTVEAFAEAVK 894 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 43/166 (25%), Positives = 81/166 (48%), Gaps = 6/166 (3%) Query: 16 AVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLP 74 DV+VN+ L+ G + ++ AGP L + V Q G + T + +L Sbjct: 946 KTDVVVNSVPLDLVLSRGPLSKSLLEKAGPELQEELDTVGQGVA-VSMGTVLKTSSWNLD 1004 Query: 75 AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAA 134 + V+H V P WR G + ++++D + + + S S+AFPAI TG G+P+ A Sbjct: 1005 CRYVLHVVAPEWRNGSTSSLKIMEDIIRECMEITESLSLKSIAFPAIGTGNLGFPKNIFA 1064 Query: 135 EIAVKTVSEFITRH--ALPEQVYFVCY--DEENAHLYERLLTQQGD 176 E+ + V +F +++ ++V+F+ + D EN + ++ + Sbjct: 1065 ELIISEVFKFSSKNQLKTLQEVHFLLHPSDHENIQAFSDEFARRAN 1110 Score = 113 bits (282), Expect = 4e-24, Method: Composition-based stats. Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 16/172 (9%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 V GDITK DVIVN+ + S GV AI AG + C + QQ+ + Sbjct: 1148 FQVASGDITKEEADVIVNSTSNSFNLKAGVSKAILECAGQNVERECSQQAQQRKN----D 1203 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +IT G L K ++H +G ++ + + L+ +Y+S+ PAI TG Sbjct: 1204 YIITGGGFLRCKNIIHVIG----------GNDVKSSVSSVLQECEKKNYSSICLPAIGTG 1253 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQ 174 AE + + +F+ + + ++V V + + ++ + ++ Sbjct: 1254 NAKQHPDKVAEAIIDAIEDFVQKGSAQSVKKVKVVIFLPQVLDVFYANMKKR 1305 >UniRef50_A9SRF5 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9SRF5_PHYPA Length = 207 Score = 173 bits (438), Expect = 3e-42, Method: Composition-based stats. Identities = 72/168 (42%), Positives = 97/168 (57%), Gaps = 9/168 (5%) Query: 5 IHVVQGDITKL----AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-D 59 + + +GDITK D IVNAAN ++GGGGVDGAIH AAG LL+A K+ +G Sbjct: 30 LVLQRGDITKWHIDGKTDAIVNAANERMVGGGGVDGAIHAAAGKQLLEATKKIPISEGVR 89 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 CP G AV+T LP ++HTVGP++ E N LL A+ S+RL N +AFP Sbjct: 90 CPVGSAVLTPGFKLPVSKIIHTVGPIYYI-EGNPASLLAKAHKESVRLATENGLKYIAFP 148 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY 167 AIS GVYGYP AAEI+++++ E +V+FV + + Sbjct: 149 AISCGVYGYPIEEAAEISIQSLRE---SAGELLEVHFVHFQAATYRAW 193 >UniRef50_C3Y5X0 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y5X0_BRAFL Length = 970 Score = 173 bits (438), Expect = 3e-42, Method: Composition-based stats. Identities = 60/177 (33%), Positives = 88/177 (49%), Gaps = 6/177 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I V +GDIT+ VDVI NAAN L G GV GAI RA GP++ + G Sbjct: 571 QIVVARGDITQQPVDVIANAANEYLSHGSGVAGAISRAGGPSVQQESSYHVKTFGRVRVT 630 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAA-NSYTSVAFPAI 121 V+T G LP K ++H VGP W G +NE ++ L+ N L +A SVA PAI Sbjct: 631 ETVVTRGGQLPCKHIIHAVGPRWERGHENENERQLRQTCYNILTAASATLRARSVAIPAI 690 Query: 122 STGVYGYPRAAAAEIAVKTVSEFIT----RHALPEQVYFVCYDEENAHLYERLLTQQ 174 S+G++G P+ AE V + F+ ++ F+ D+ ++ ++ Sbjct: 691 SSGIFGMPKQKCAESLVSGLERFLQTAKVSSCTLRRIIFIDMDQATVNILADTFGKK 747 >UniRef50_A8STD9 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8STD9_9FIRM Length = 348 Score = 172 bits (436), Expect = 5e-42, Method: Composition-based stats. Identities = 60/167 (35%), Positives = 93/167 (55%), Gaps = 6/167 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ-GDCPTG 63 + +V+ DI K+ D IVN AN ++ G G DGA++RAAG D L R++ G G Sbjct: 3 LRIVRNDIVKMTTDAIVNTANDHVVVGTGCDGAVYRAAG---YDELLNYRREYIGFVEEG 59 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT L A+ ++H V P + G+ E+ L+ Y SL+L N S+AFP IST Sbjct: 60 GAFITPGFGLNARYIIHAVSPRFIDGDHGEEGKLRSCYRKSLQLAKENGVRSIAFPLIST 119 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 G +GYP+ IAV ++ F+ + + ++ V +DE++ L E++ Sbjct: 120 GGFGYPKEEGLRIAVDEINAFLFENEV--DIFLVVFDEKSTRLGEKI 164 >UniRef50_C2DZH9 Appr-1-p processing protein n=4 Tax=Lactobacillus jensenii RepID=C2DZH9_9LACO Length = 218 Score = 171 bits (435), Expect = 6e-42, Method: Composition-based stats. Identities = 79/173 (45%), Positives = 101/173 (58%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + I VV+ + D IVNAAN +L+GGGGVDGAIH+AAGP LL+AC K+ C Sbjct: 48 LSKNIFVVKASVVNFPADAIVNAANKTLLGGGGVDGAIHQAAGPNLLEACKKLN----GC 103 Query: 61 PTGHAVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT + DL K ++HTVGPV++ QN Q LQ Y SL L SVAF Sbjct: 104 DTGEAKITPSFDLKTCKYIIHTVGPVFKLS-QNPQQQLQSCYKKSLDLALEYKCNSVAFS 162 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTGVY YP AA +A + V+E++ RH +V+ CY E Y +L+ Sbjct: 163 GISTGVYEYPVKQAASVASEAVAEWLKRHNFAIKVFLCCYKESEFEAYAQLVR 215 >UniRef50_C0W547 Appr-1-p processing domain protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W547_9ACTO Length = 285 Score = 171 bits (435), Expect = 6e-42, Method: Composition-based stats. Identities = 56/181 (30%), Positives = 89/181 (49%), Gaps = 9/181 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVR- 54 + T++ + +GD+T L +VNAAN +++G +D +H AAGP L C + Sbjct: 103 LGTQVALWRGDLTTLRAGGVVNAANSAMLGCFVPGHRCIDNVLHAAAGPGLRAECARYMD 162 Query: 55 -QQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANS 112 ++ TG A++T LPA V+HTVGP+ G Q LL Y + L Sbjct: 163 SREGRPEETGRALVTGGYHLPAAHVIHTVGPIVTHGVTQEHRDLLASCYRSVLDAAEGAG 222 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 SV ++STGV+GYP+ AA + + T+ ++ RH ++ + E + YE L Sbjct: 223 LDSVGLCSVSTGVFGYPKQEAAPLVLDTIGRWLDRHPDSTLRIVICAFAEVDVRAYEAAL 282 Query: 172 T 172 Sbjct: 283 A 283 >UniRef50_Q8ZXT3 UPF0189 protein PAE1111 n=10 Tax=Thermoprotei RepID=Y1111_PYRAE Length = 182 Score = 171 bits (434), Expect = 9e-42, Method: Composition-based stats. Identities = 58/171 (33%), Positives = 83/171 (48%), Gaps = 3/171 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + +++GDIT++ D IVNAAN L GGGV GAI R G + + + ++ G P G Sbjct: 9 EVVLMRGDITEVEADAIVNAANSYLEHGGGVAGAIVRKGGQVIQEESREWVRKHGPVPVG 68 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T AG L AK V+H VGP + L +A N+L S+A PAIST Sbjct: 69 DVAVTSAGRLKAKYVIHAVGPRCGV---EPIEKLAEAVKNALLKAEELGLVSIALPAIST 125 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 G++G P AAAE + E ++ V Y EE + + + Sbjct: 126 GIFGCPYDAAAEQMATAIREVAPALRSIRRILVVLYGEEAYQKFLEVFKKH 176 >UniRef50_UPI000196AD9C hypothetical protein CATMIT_00588 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AD9C Length = 334 Score = 170 bits (432), Expect = 1e-41, Method: Composition-based stats. Identities = 63/169 (37%), Positives = 94/169 (55%), Gaps = 5/169 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +V+ DITK+ D+IVN ANP G D AI+ AAG +A L R+ G G Sbjct: 3 FKIVRNDITKVEADIIVNTANPQPKCVSGTDLAIYEAAGK---EALLAERKTIGPIERGE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T A +L AK ++HTVGPVW G +E ++L+ Y L+ S+AFP ISTG Sbjct: 60 IAVTGAYNLNAKYIIHTVGPVWIDGNHHELEILERCYRLPLQKAIELGCQSIAFPLISTG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 VY +P+ A IAV S+F+T H + ++ V +D+ + L +++ + Sbjct: 120 VYEFPKNKALHIAVSVFSQFLTEHEI--EIILVVFDKTSFQLSSQIVGE 166 >UniRef50_UPI0000E80997 PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=3 Tax=Gallus gallus RepID=UPI0000E80997 Length = 1655 Score = 170 bits (431), Expect = 2e-41, Method: Composition-based stats. Identities = 56/177 (31%), Positives = 92/177 (51%), Gaps = 4/177 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + V +G++ VDV+VNAA+ L G A+ +AAGP L C +V + G Sbjct: 642 TELLVYKGNLCNYPVDVVVNAASEDLRHTDGFAWALLQAAGPELQAECDEVVRMTGSLQA 701 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT AG LP K V+H +GP W+ + LL +A SL+L ++ S+AFP++ Sbjct: 702 GDAVITGAGKLPCKQVIHAIGPQWKEKNSGKCMYLLMEAIKKSLQLAETYNHRSIAFPSV 761 Query: 122 STGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLLTQQG 175 S G++G+P V + + + R + ++++ V DEE + + ++ Sbjct: 762 SGGIFGFPPHKCVNAIVSAIKKTLEEFKRDSSLKEIHLVAVDEETVRVLRETVQKEF 818 Score = 133 bits (334), Expect = 4e-30, Method: Composition-based stats. Identities = 50/174 (28%), Positives = 72/174 (41%), Gaps = 6/174 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-GGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 RI V + DI DVIVN+ L G G + A+ + AGP L K + QQ Sbjct: 866 RIQVEKKDIIDATTDVIVNSVGTDLKFGVGPLCRALLKEAGPELQMEFDKEKGQQVA-GN 924 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G V T L V H V P W G + L++ L S+AFPAI Sbjct: 925 GSVVCTKGYILDCTFVFHAVLPQWDRGSGQALKTLENTVHKCLMKAEEFGLKSIAFPAIG 984 Query: 123 TGVYGYPRAAAAEIAVKTVSEF--ITRHALPEQVYFVCY--DEENAHLYERLLT 172 TG + +P +++ V +F ++V+FV + D +N + L Sbjct: 985 TGGFSFPHTVVSKLMFDEVFKFSRCQSRKTLQEVHFVLHPNDRQNIQAFTSELK 1038 Score = 99.2 bits (246), Expect = 5e-20, Method: Composition-based stats. Identities = 43/177 (24%), Positives = 76/177 (42%), Gaps = 16/177 (9%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + V GDITK +VIVN AN + GV AI AAG + + C Q G Sbjct: 1072 SVTLKVTSGDITKEDTEVIVNIANQTFDATSGVFKAIMDAAGFDVKEEC----NQYGGLL 1127 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 + T G L + ++H + + +++ L +Y SVAFPAI Sbjct: 1128 QSGFITTKGGALLCRRIIHLIHSM----------NVKNQVSEVLHECQLRTYKSVAFPAI 1177 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCYDEENAHLYERLLTQQGD 176 TG A A+ + + EF++ ++P +++ + + + + + + ++ D Sbjct: 1178 GTGAAQQSPAKVADDMLDAIVEFVSSRSVPHLKEIRIIIFQKHMLRDFLQSMKKRED 1234 >UniRef50_A1D5K4 Appr-1-p processing enzyme family protein n=1 Tax=Neosartorya fischeri NRRL 181 RepID=A1D5K4_NEOFI Length = 257 Score = 170 bits (430), Expect = 2e-41, Method: Composition-based stats. Identities = 69/171 (40%), Positives = 94/171 (54%), Gaps = 5/171 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T + ++ DI +L VD IVNAA SL GGGGVD A+H AAGP L AC+K + Q C Sbjct: 88 LNTLVSFIEHDIARLQVDCIVNAAKESLQGGGGVDRAMHLAAGPKLNQACIK-KLQDRQC 146 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G +T L K+V+HTVGP R +Q + Q+L+ Y NSL + S+ FP Sbjct: 147 SPGRVFMTPGFHLRCKSVIHTVGPDCRQKQQIDYAQVLRQCYRNSLNKAVSKGLRSIVFP 206 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH---ALPEQVYFVCYDEENAHLY 167 AIS GVY P A +EIA+ TV F+ H + +++ F +Y Sbjct: 207 AISVGVYACPAEATSEIALNTVRGFLDEHGRPSSLDRIGFCNLGPNIHAIY 257 >UniRef50_UPI0000E4D641 UPI0000E4D641 related cluster n=2 Tax=Danio rerio RepID=UPI0000E4D641 Length = 692 Score = 170 bits (430), Expect = 3e-41, Method: Composition-based stats. Identities = 58/176 (32%), Positives = 91/176 (51%), Gaps = 4/176 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V + DI L+VD +VNAAN L GGGV A+ +AAG L + C + G G Sbjct: 2 TVTVRKADICTLSVDAVVNAANEDLQHGGGVAYALLQAAGRCLQEYCDLHIKVNGPLTPG 61 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE--DQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A+IT AG LP K VVH VGP +R +++ Q L+ A SL ++ +S+A P I Sbjct: 62 DAIITDAGRLPCKYVVHAVGPRFRASDRHTAVQQCLRRAVRESLNQASSKKCSSIAIPVI 121 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLLTQQG 175 S+G++G P E K V ++I + ++ V +++N + + + + Sbjct: 122 SSGIFGCPLDLCTESITKEVRQYIENWPSSTLTEIQLVDNNDKNVNAMAQAVRNEF 177 Score = 108 bits (269), Expect = 1e-22, Method: Composition-based stats. Identities = 38/155 (24%), Positives = 60/155 (38%), Gaps = 14/155 (9%) Query: 15 LAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDL 73 L DVIVN + + + G V A+ +AAG L + G VIT +L Sbjct: 212 LQADVIVNTISEDMDLRKGAVSNALLQAAGHQLQSEIKRASNH------GEIVITDGYNL 265 Query: 74 PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAA 133 V H + ++L N L+ +SV FPAI TG G+P+ Sbjct: 266 KCSRVFHVMIIYLFTL----QKVLNQIIRNCLKNAETQGLSSVVFPAIGTGNLGFPKDLV 321 Query: 134 AEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYE 168 A+ + V +F +V V + + ++ Sbjct: 322 AKNMLTEVQQF--NTTNLRKVTVVVH-PSDKEIFR 353 Score = 43.3 bits (101), Expect = 0.004, Method: Composition-based stats. Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 7/58 (12%) Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 ++T +G LP ++H G + +++ L+ L+L + +TS+AFPA+ T Sbjct: 406 IVTSSGRLPCGNIIHISG-------FDSLSTVKEVVLSVLKLCESRQFTSIAFPALGT 456 >UniRef50_Q2TX23 Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 n=8 Tax=Fungi/Metazoa group RepID=Q2TX23_ASPOR Length = 615 Score = 169 bits (428), Expect = 4e-41, Method: Composition-based stats. Identities = 61/186 (32%), Positives = 90/186 (48%), Gaps = 17/186 (9%) Query: 5 IHVVQGDITKLA-VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ- 57 I + +GDIT L V IVNAAN L+G +D IH AAGP L DAC + +Q Sbjct: 114 ISLWKGDITSLTDVTAIVNAANSQLLGCFRPDHRCIDNIIHSAAGPRLRDACNSLMLKQC 173 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRL-----VA 109 G +T +LPA+ V+HTVGP + + Q L Y + L Sbjct: 174 HPESVGSVKVTSGFNLPAQWVLHTVGPQVNSRKSPGTLQQQQLASCYSSCLDATESLPAL 233 Query: 110 ANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLY 167 + VAF ISTG++ +P AA+IA++TV ++ H + F + E + LY Sbjct: 234 PDGRKVVAFCCISTGLFAFPPDMAAKIALETVVQWCMNHPATSVTDIIFDTFLERDYELY 293 Query: 168 ERLLTQ 173 + +++ Sbjct: 294 QANISE 299 >UniRef50_A1WVH3 Appr-1-p processing domain protein n=14 Tax=Bacteria RepID=A1WVH3_HALHL Length = 181 Score = 169 bits (428), Expect = 5e-41, Method: Composition-based stats. Identities = 64/174 (36%), Positives = 89/174 (51%), Gaps = 6/174 (3%) Query: 4 RIHVVQGDITKL-AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + GDI D +VNAAN LM GGGV GA+HRAAGP L +AC + Sbjct: 9 TVETRVGDIAAQGDCDAVVNAANAQLMPGGGVAGALHRAAGPELAEAC----RPLAPIQP 64 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AVIT LP + V+H +GPV+ E E QLL Y N+L + T VA PA+S Sbjct: 65 GQAVITAGFGLPNRHVIHCLGPVYGVDEPGE-QLLAACYRNALHRAEEHELTRVAMPALS 123 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 TG +G+P AA +A+ T+ + V FV D +++ ++ + + Sbjct: 124 TGAFGFPMERAARVAIGTLQRTAAQLRYVRHVRFVLADAAAQQIHDHVIQELAE 177 >UniRef50_Q4DSL4 Putative uncharacterized protein n=4 Tax=Trypanosoma RepID=Q4DSL4_TRYCR Length = 297 Score = 168 bits (427), Expect = 5e-41, Method: Composition-based stats. Identities = 66/169 (39%), Positives = 88/169 (52%), Gaps = 10/169 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + G +T L +D IVNAAN + +GG GVDGAIH AAGP L+ C C TG Sbjct: 125 IALHNGPVTDLQLDAIVNAANKTCLGGKGVDGAIHAAAGPLLVRECATFN----GCDTGQ 180 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 IT +LPA+ V+HTVGP+ + L+ Y + L L N S+ F +STG Sbjct: 181 CRITKGYNLPARYVLHTVGPI-----GERPEALRSCYRSILSLAHRNRLRSIGFCCVSTG 235 Query: 125 VYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLT 172 VYGYP A IAV E++ +H + + F C+ E + Y L Sbjct: 236 VYGYPLIPATRIAVDETIEYLKQHFSAFDLCCFACFKLEEYNAYTDCLR 284 >UniRef50_B3RYC4 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RYC4_TRIAD Length = 491 Score = 168 bits (427), Expect = 5e-41, Method: Composition-based stats. Identities = 48/171 (28%), Positives = 85/171 (49%), Gaps = 4/171 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDIT L VD IVN N +L ++ I AGP+L +R + G C Sbjct: 48 INQKLVLWTGDITTLKVDAIVNPTNENLSVMSPINQKIFEIAGPSL---HRDIRDEIGKC 104 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFP 119 TG + ++ +LP++ V+HTVGP + + + L +Y +SL + S+A P Sbjct: 105 ATGESKLSKGYNLPSRYVIHTVGPKYNPRYLSAVENALYRSYRSSLLIAGEYKVRSIAIP 164 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 + G+P + A IA++TV ++ + + + D+ +Y+RL Sbjct: 165 TVHLHQRGFPVSEGAHIALRTVRRYLEHQSCTLETVILILDDTEMEIYKRL 215 >UniRef50_Q94JV1 At1g69340/F10D13.28 n=23 Tax=Embryophyta RepID=Q94JV1_ARATH Length = 562 Score = 168 bits (426), Expect = 7e-41, Method: Composition-based stats. Identities = 58/173 (33%), Positives = 87/173 (50%), Gaps = 7/173 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI++ +G+ L VD +VN+ N +L G +H AAGP L + C + G C Sbjct: 83 INSRIYLWRGEPWNLEVDAVVNSTNENLDEAHSSPG-LHVAAGPGLAEQCATL----GGC 137 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG A +T A DLPA+ V+HTVGP + + L Y + L L+ + S+A Sbjct: 138 RTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYRSCLELLIDSGLQSIALG 197 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 I T YPR AA +A++TV F+ + V F + +Y+RLL Sbjct: 198 CIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKISAVVFCTTTSSDTEIYKRLL 250 >UniRef50_C0PSL1 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PSL1_PICSI Length = 204 Score = 168 bits (426), Expect = 7e-41, Method: Composition-based stats. Identities = 74/157 (47%), Positives = 93/157 (59%), Gaps = 7/157 (4%) Query: 4 RIHVVQGDITKLAV----DVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + + QGDITK + D IVNAAN ++GGGGVDGAIH AAGP LL ACL V + Q Sbjct: 21 TLVIHQGDITKWFINGENDAIVNAANELMLGGGGVDGAIHSAAGPELLRACLNVPEIQPG 80 Query: 60 --CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 CP G A IT A +LP ++HTVGP++ E + +L AY +SL + N VA Sbjct: 81 VRCPAGSARITEAFNLPVSHIIHTVGPIY-DEEGDSASVLSSAYKSSLEVAEENHIKYVA 139 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQV 154 FPAIS GVYGYP AAE+A+ T+ +V Sbjct: 140 FPAISCGVYGYPLEKAAEVALLTLKNHAGDLEEILEV 176 >UniRef50_C4G1S1 Putative uncharacterized protein n=3 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1S1_ABIDE Length = 359 Score = 167 bits (424), Expect = 1e-40, Method: Composition-based stats. Identities = 56/169 (33%), Positives = 92/169 (54%), Gaps = 5/169 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +++ DITK+ D IVN ANP + GGGV+ AI+ AAG L L R++ G G Sbjct: 3 FRIIRNDITKVKADAIVNTANPEVAIGGGVETAIYSAAGKKKL---LDERKKIGILQPGE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T A DL AK ++H P W+GG + E + L+D Y L+ S+AFP ++TG Sbjct: 60 VGVTEAFDLAAKYIIHVSSPRWKGGNKGEIKCLRDCYEKVLKTAKDYGCESIAFPLLATG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 YG+P+ ++AV + F+ + + ++ V ++ E + +L+ + Sbjct: 120 TYGFPKEVGVQVAVDAFTAFLEENEM--EITLVVFESEAVSISGKLVEE 166 >UniRef50_C9RQW9 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=C9RQW9_FIBSS Length = 347 Score = 167 bits (424), Expect = 1e-40, Method: Composition-based stats. Identities = 56/166 (33%), Positives = 89/166 (53%), Gaps = 5/166 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + +V+ DI+++ D IVN+AN + + GGG + I+ AAG D L R++ G Sbjct: 3 LRIVRNDISRVRADAIVNSANKNPVCGGGAEYHIYEAAG---YDKLLAAREKIGVLDVAE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 ++ A L AK ++H VGP W GGE E L Y +L S+AFP IS+G Sbjct: 60 VAVSSAFALKAKYLIHVVGPKWNGGESGETSALASCYRRALEKALELGCESIAFPLISSG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 V+ +P+ +A +IA++ + EF+ H + V V +D + + E L Sbjct: 120 VFRFPKDSALKIALQAIGEFLQSHEM--DVQLVVFDRKAFDVSEEL 163 >UniRef50_A2DTG7 Appr-1-p processing enzyme family protein n=2 Tax=Trichomonas vaginalis RepID=A2DTG7_TRIVA Length = 316 Score = 167 bits (423), Expect = 2e-40, Method: Composition-based stats. Identities = 61/174 (35%), Positives = 88/174 (50%), Gaps = 12/174 (6%) Query: 1 MKTRIHVVQG-DITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + +I G D TKL D IVNAAN L GGG+ GAI AAG + K +QG Sbjct: 51 INKKISFWMGGDSTKLKCDAIVNAANSYLAAGGGICGAIFSAAG---YEELQKACDEQGY 107 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A +T LP+K V+H VGPV + L+ AY +L + + S+AF Sbjct: 108 TETGGAKMTPGFRLPSKYVIHAVGPVGVH-----PEALRSAYNLTLGFMDNDKVKSIAFC 162 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERL 170 ISTG+YGY A +A+ TV +++ A +++ FV + ++ +Y Sbjct: 163 CISTGIYGYSIEKATPVALDTVRKWLEVPENLAKTDRLVFVVFMPKDQQVYSHF 216 >UniRef50_C5C222 Appr-1-p processing domain protein n=2 Tax=Actinomycetales RepID=C5C222_BEUC1 Length = 193 Score = 166 bits (422), Expect = 2e-40, Method: Composition-based stats. Identities = 76/156 (48%), Positives = 92/156 (58%), Gaps = 7/156 (4%) Query: 8 VQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQ--QGDCPTGHA 65 V GDIT VDV+VNAANPSL+GGGGVDGAIHRAAGP+LL C +R+ G A Sbjct: 12 VLGDITAQDVDVVVNAANPSLLGGGGVDGAIHRAAGPSLLAECQDLRRTVLPRGLSVGDA 71 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 V T AG+LPA VVHTVGP G Q + LL + SL + SVAFPA+S GV Sbjct: 72 VATGAGNLPALWVVHTVGPNAHVG-QRDPALLASCFTRSLDVAGGLGARSVAFPAVSAGV 130 Query: 126 YGYPRAAAAEIAVKTVSEFIT----RHALPEQVYFV 157 +G+ A IAV +V ++ + E V FV Sbjct: 131 FGWDVDVVARIAVDSVDTWLDGADPAASALELVRFV 166 >UniRef50_C2L199 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2L199_9FIRM Length = 344 Score = 166 bits (422), Expect = 2e-40, Method: Composition-based stats. Identities = 57/166 (34%), Positives = 97/166 (58%), Gaps = 5/166 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +++ DITK+ VD IVN ANP G+D A+++AAG L L+ RQ+ G G Sbjct: 3 FQIIRNDITKMQVDAIVNPANPIPGYAAGIDSAVYKAAGEEKL---LRRRQEIGAIAPGS 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 + IT +LPAK ++HTVG W+GG +E+ +++ Y + +L + S+A P +++G Sbjct: 60 SFITDGYNLPAKYIIHTVGTAWQGGNSDEEIIIRKCYRSIFKLALEHHILSLAIPLLASG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 YG+P+ A IA+ + F++ + + ++Y V +DE++ L L Sbjct: 120 SYGFPKGIALRIALSEIESFMSENDI--ELYLVVFDEKSYSLSTEL 163 >UniRef50_UPI0000F2CC13 PREDICTED: similar to B aggressive lymphoma long n=1 Tax=Monodelphis domestica RepID=UPI0000F2CC13 Length = 1624 Score = 166 bits (421), Expect = 3e-40, Method: Composition-based stats. Identities = 48/175 (27%), Positives = 86/175 (49%), Gaps = 5/175 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V + D+T+ D +VNAAN L+ GG+ A+ RA GP + + Q+G+ PT Sbjct: 100 ELSVWKDDLTRHPADAVVNAANERLLHAGGLALALVRAGGPLIEKESEAIIMQRGEVPTS 159 Query: 64 HAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLV--AANSYTSVAFPA 120 +T G LP ++H VGP W + Q L+ A N L V ++ +VA PA Sbjct: 160 EIAVTTGGQLPCSCIIHAVGPRWSDWNAERCCQELERATANILNYVTNDSHGIKTVAIPA 219 Query: 121 ISTGVYGYPRAAAAEIAVKTVSE--FITRHALPEQVYFVCYDEENAHLYERLLTQ 173 +S+G++G+P +I + T+ + + ++++ V +E +++ Sbjct: 220 LSSGIFGFPLELCVQIIILTIVRCPLLQSSKVLKEIHLVSNEEPTVAAFKKACEN 274 Score = 118 bits (296), Expect = 9e-26, Method: Composition-based stats. Identities = 51/181 (28%), Positives = 78/181 (43%), Gaps = 17/181 (9%) Query: 2 KTRIHVVQGDITKLAVDVIVNA--ANPSLMGGGGVDGAIHRAAGPALLDACLKV---RQQ 56 T + +++G I K VDVIVN+ A+ S G V AI AGP + + K + Sbjct: 293 NTNLQIIEGFIEKQQVDVIVNSISASNSFDLGK-VSNAILIHAGPEIEEEFSKTYSGMSE 351 Query: 57 QGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSV 116 V+T +L K V H V P ++L++A + L + S+ Sbjct: 352 SSKL----VVVTEGFNLACKHVYHVVWP----SSYQTKKVLKEAVMRCLEKTCQENMNSI 403 Query: 117 AFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCY--DEENAHLYERLLTQ 173 +FPA+ TG G P+ A I +K + +F H V FV Y D E + + L + Sbjct: 404 SFPALGTGNIGLPKREAISIMLKEIFQFSKNHPQKRLLVNFVVYPNDNELYEVMKSELDK 463 Query: 174 Q 174 Sbjct: 464 M 464 >UniRef50_D2V113 Appr-1-p domain-containing protein n=1 Tax=Naegleria gruberi RepID=D2V113_NAEGR Length = 220 Score = 166 bits (421), Expect = 3e-40, Method: Composition-based stats. Identities = 57/187 (30%), Positives = 94/187 (50%), Gaps = 15/187 (8%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD-CPTG 63 + V +GD+T VDVIVNAAN L G+ GAI + G + K+ + G G Sbjct: 34 LQVRKGDLTMEKVDVIVNAANCRLQHMSGLAGAIVKNGGQIIQKESNKLIKDLGRELENG 93 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRG---------GEQNEDQLLQDAYLNSLRLVAANSYT 114 V T++GDLP K + H VGP+W G + ED L SL + + + Sbjct: 94 EVVETISGDLPCKTLYHAVGPIWSSRKANDFKTLGAEQEDFELGMCVEASLNMAVESGLS 153 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-----LPEQVYFVCYDEENAHLYER 169 S++ PAIS+G++G+P+ A++ TV+EF+ + +V F +D+E +++ + Sbjct: 154 SISLPAISSGIFGFPKDRCAKVLFNTVTEFLKSNKDNIKADRFEVRFTNFDDETCNIFSK 213 Query: 170 LLTQQGD 176 + + Sbjct: 214 EFKSRFN 220 >UniRef50_A2QSI2 Contig An08c0280, complete genome n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QSI2_ASPNC Length = 603 Score = 166 bits (421), Expect = 3e-40, Method: Composition-based stats. Identities = 60/190 (31%), Positives = 90/190 (47%), Gaps = 18/190 (9%) Query: 4 RIHVVQGDITKLA-VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVR-QQ 56 +H+ QGDIT L V I NAAN ++G +D IH AGP L + C Q Sbjct: 108 TLHLWQGDITTLDGVTAITNAANEQMLGCFQPAHRCLDNVIHARAGPRLREECFHHMDQG 167 Query: 57 QGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ----NEDQLLQDAYLNSLRLVAANS 112 Q P GHA T LPA V+HTVGP G+ ++ Q L+ Y L + A Sbjct: 168 QRTLPVGHACATKGYCLPAPYVIHTVGPQLDAGQPVPTAHQRQQLRQCYEAVLDVAEALP 227 Query: 113 Y-----TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAH 165 S+A ISTG++ +P AA IA+++V +++ H + F + + + Sbjct: 228 ASDPRGKSIALCGISTGLFAFPVEEAASIAIQSVLDWLRHHLHTSITNIIFNTFTDTDTA 287 Query: 166 LYERLLTQQG 175 +Y++ L + Sbjct: 288 VYQQTLKKMH 297 >UniRef50_C2KRZ5 Appr-1-p processing domain protein n=2 Tax=Mobiluncus mulieris RepID=C2KRZ5_9ACTO Length = 275 Score = 166 bits (420), Expect = 3e-40, Method: Composition-based stats. Identities = 76/143 (53%), Positives = 92/143 (64%), Gaps = 3/143 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CP 61 ++H + GDIT++ VD IVNAAN +L+GGGGVDGAIHRAAG LL AC +R + P Sbjct: 2 QLHAIGGDITRVHVDAIVNAANSTLLGGGGVDGAIHRAAGTELLAACRVIRATRYPDGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AV T LPAK V+HTVGP G Q + LL+ A++NSLR A SVAFPAI Sbjct: 62 VGQAVATKGFKLPAKWVIHTVGPNRHAG-QTDPGLLRAAFVNSLREAARVGAHSVAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEF 144 S GVYG+ A A I V V E+ Sbjct: 121 SGGVYGWDMAEVARIGVSAVHEW 143 >UniRef50_D0MWM6 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MWM6_PHYIN Length = 579 Score = 165 bits (419), Expect = 4e-40, Method: Composition-based stats. Identities = 64/188 (34%), Positives = 96/188 (51%), Gaps = 17/188 (9%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG 58 +I + +GDIT L IVNAAN +L+G +D IH AGP L AC ++ ++ Sbjct: 105 QIALWKGDITTLRATAIVNAANSALLGCFQPSHKCIDNVIHSMAGPRLRAACHEIMSRKA 164 Query: 59 D-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRLVAAN--- 111 P G+A IT LP+ V+HTVGP R GEQ E LQ Y SL L+ Sbjct: 165 HEEPGGNAQITQGFALPSSFVIHTVGPQLRHGEQPTAAECDQLQSCYTKSLDLLLKKVGD 224 Query: 112 --SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ---VYFVCYDEENAHL 166 + S+AF ISTG++ +P A +AV +V E++ +H + + F + + + L Sbjct: 225 TEQHVSIAFSCISTGLFAFPSDVAVPLAVNSVLEWLNQHQEETRGWKIIFNTFLKRDYDL 284 Query: 167 YERLLTQQ 174 Y+ + + Sbjct: 285 YKSFIESK 292 >UniRef50_C5VD03 Appr-1-p processing enzyme family protein n=2 Tax=Corynebacterium matruchotii RepID=C5VD03_9CORY Length = 274 Score = 165 bits (419), Expect = 4e-40, Method: Composition-based stats. Identities = 72/183 (39%), Positives = 99/183 (54%), Gaps = 10/183 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKV-RQ 55 +RI + +GDIT+L VD IVNAAN L+G VD AIH AAG L AC + Sbjct: 86 DSRIRLWRGDITRLDVDGIVNAANNKLLGCFRPGHTCVDNAIHSAAGLQLRQACADLVPS 145 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL--LQDAYLNSLRLVAANSY 113 + PTG A IT +LPA+ V+HTVGP+ G E N Q+ L +Y++ L L ++ Sbjct: 146 PDYEEPTGSARITPGFNLPARYVLHTVGPIVAGREANRQQVAELSASYISCLNLAHSSGL 205 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFIT--RHALPEQVYFVCYDEENAHLYERLL 171 S+AF ISTGV+G+P + AA IAV F+ + F + + + Y +LL Sbjct: 206 ESLAFCCISTGVFGFPPSHAARIAVAAARAFLAGLPKDSDFTIIFTVFTQNDYDRYAQLL 265 Query: 172 TQQ 174 Q Sbjct: 266 NPQ 268 >UniRef50_Q6NRC6 MGC83934 protein n=3 Tax=Xenopus RepID=Q6NRC6_XENLA Length = 914 Score = 165 bits (417), Expect = 7e-40, Method: Composition-based stats. Identities = 51/171 (29%), Positives = 86/171 (50%), Gaps = 4/171 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 R+ V +GD+T+ VD +VNAAN L GG+ A+ +A G + D + ++ +G Sbjct: 81 RVSVWKGDMTRQNVDAVVNAANEDLKHFGGLALALVKAGGAVIQDESRRHIEKYKKVKSG 140 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSL-RLVAANSYTSVAFPAI 121 +T AG+LP K ++H VGP W G +Q L++ N L +++ ++ SVA PA+ Sbjct: 141 SIAVTSAGNLPCKMIIHAVGPEWSPGINAKCEQELKEVIRNVLMQVMNESNVRSVAIPAV 200 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERL 170 S+G++ +P EI T +F + ++ FV D + Sbjct: 201 SSGIFRFPLQRCTEIIASTTKKFCDTETYHKLAEIRFVNIDTITVDAMKAA 251 Score = 108 bits (271), Expect = 7e-23, Method: Composition-based stats. Identities = 40/176 (22%), Positives = 78/176 (44%), Gaps = 10/176 (5%) Query: 5 IHVVQGDITKLAVDVIVNA--ANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +++ +G I + VIVN+ AN +L G + AI R AG +L + + + PT Sbjct: 358 LYLTKGYIEEQKTAVIVNSLGANRNLNEGN-ISKAILRKAGNSLSQE--VLDKSKYVSPT 414 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + T LP V H + + ++ ++L+D L + +S++FPA+ Sbjct: 415 DIMIPTRGYYLPCDFVYHVI---LQRSGSDQKKILKDGINACLNTALRYNTSSISFPALG 471 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHLYERLLTQQGD 176 TG+ +P+ A++ V F + ++FV + D + +++ Q Sbjct: 472 TGMLCFPKPVVAKVMTDEVLSFAKENPCNMDIFFVIHPNDTDTYSEFKKAFQAQQQ 527 >UniRef50_A0CX10 Chromosome undetermined scaffold_3, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CX10_PARTE Length = 183 Score = 165 bits (417), Expect = 8e-40, Method: Composition-based stats. Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 7/178 (3%) Query: 5 IHVVQGDITKL-AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + +++ +I KL VD IVNAAN L+ GGGV GAI +AAG L C + QQ G PT Sbjct: 6 VKIIKENIVKLVDVDAIVNAANQELLPGGGVCGAIFQAAGRELERECQQYIQQYGIVPTS 65 Query: 64 HAVITLAGDLP---AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV-AANSYTSVAFP 119 +T + L K ++H VGP + ED+ LQ N L SVA P Sbjct: 66 KLAVTSSCQLKKNNIKYIIHAVGPKYFQSSSPEDE-LQICVNNILNQSFNVLELKSVAIP 124 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLLTQQGD 176 AIS+G+YG+P+ A+I + E+ + + ++ +D+E +++++ QQ Sbjct: 125 AISSGIYGFPKGLCAQIFKLVIEEYQKDTSNKQGEIILCNFDQETTTIFQKVFQQQNS 182 >UniRef50_D2V337 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2V337_NAEGR Length = 177 Score = 165 bits (417), Expect = 9e-40, Method: Composition-based stats. Identities = 63/177 (35%), Positives = 88/177 (49%), Gaps = 22/177 (12%) Query: 17 VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLK----------------VRQQQGDC 60 +D IVNAAN SLMGGGG+D IH AG L C + + C Sbjct: 1 IDTIVNAANESLMGGGGIDQIIHARAGDELKLECKTKYSPSCLKMKGSITYGNDELEYRC 60 Query: 61 PTGHAVITLAGDL--PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 TG AVIT A +L + ++HTVGP + +LL + Y + L+L N+ S+AF Sbjct: 61 ATGEAVITQAHNLSEKCQYIIHTVGPYLDENGNTQPELLSNCYNSCLQLAMENNLKSIAF 120 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRH----ALPEQVYFVCYDEENAHLYERLL 171 P ISTG YGYP A +A+K V F+ H + + FV +++ +Y+ L Sbjct: 121 PCISTGYYGYPIEEACRLALKIVKNFLHSHLNKQSSLRHIIFVIFNDLEFEIYKILF 177 >UniRef50_Q8B4N1 ORF-1 n=7 Tax=Infectious spleen and kidney necrosis virus RepID=Q8B4N1_ISKNV Length = 566 Score = 164 bits (416), Expect = 1e-39, Method: Composition-based stats. Identities = 69/179 (38%), Positives = 96/179 (53%), Gaps = 8/179 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +T + VV DIT L VD IVNAAN +GGGGVDG IHR AG L C + G Sbjct: 389 QTNVSVVLDDITSLRVDAIVNAANTVGLGGGGVDGRIHRVAGRELKRECRTL----GGIG 444 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRLVAANSYTSVAF 118 G A IT LPA V+HTVGP+ G++ + ++L Y+ SL + AN ++AF Sbjct: 445 FGEAKITGGYRLPATYVIHTVGPIINAGQRPTQADKRVLTSCYIQSLHVAQANGVRTIAF 504 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQGD 176 P+ISTGVY YP A +A+ +V ++ +H + + F Y + +Y L + Sbjct: 505 PSISTGVYNYPIEDAVHVAMSSVRAYVIQHPGAFDHIVFCTYSNADFDVYNSQLPTYFN 563 >UniRef50_D2S4L6 Appr-1-p processing domain protein n=4 Tax=Actinomycetales RepID=D2S4L6_9ACTO Length = 170 Score = 164 bits (416), Expect = 1e-39, Method: Composition-based stats. Identities = 80/170 (47%), Positives = 102/170 (60%), Gaps = 5/170 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQ--QQGDCPT 62 + V+GDIT+ VDV+VNAANP L+GGGGVDGAIH A GP +L C ++ G P Sbjct: 3 LRAVRGDITEADVDVVVNAANPGLLGGGGVDGAIHAAGGPEILAECRALKAGLPGGRLPR 62 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV T AG LPA+ VVHT GP+W +Q+ +L+ SLR+ SVAFPAIS Sbjct: 63 GRAVATTAGRLPARWVVHTAGPIW-SADQDRSAVLRSCCTESLRVADGLGARSVAFPAIS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GVYG+P A AA AV V +H ++V FV +D+ +E LT Sbjct: 122 AGVYGWPLADAAVQAVAGVRAVEVQH--VQEVRFVLFDDRALAAFEAALT 169 >UniRef50_A1L291 LOC799852 protein (Fragment) n=5 Tax=Danio rerio RepID=A1L291_DANRE Length = 458 Score = 164 bits (415), Expect = 1e-39, Method: Composition-based stats. Identities = 59/181 (32%), Positives = 91/181 (50%), Gaps = 8/181 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V + D+T+ V+ +VNAAN L GGG+ A+ A GP + + ++ G TG Sbjct: 71 EISVWKDDLTQHKVEAVVNAANEKLQHGGGLAQALSMAGGPQIQRWSDDIIKRYGYVKTG 130 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNED-----QLLQDAYLNSLRLVAANSYTSVAF 118 AV+T AG+LP K ++H VGP ++ LL +A + L+ V + TSVA Sbjct: 131 EAVLTPAGNLPFKYIIHAVGPKVPQNPTQKEIGDATPLLYNAITSILQTVLRENITSVAI 190 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQG 175 PA+S+G++ +PR A+I VK + F +++ V DE + ER Sbjct: 191 PALSSGLFNFPRDRCADIIVKAIKTFHDHGGFQGRNLEIHLVNNDEPSVQEMERATRAIF 250 Query: 176 D 176 D Sbjct: 251 D 251 Score = 103 bits (257), Expect = 3e-21, Method: Composition-based stats. Identities = 48/172 (27%), Positives = 80/172 (46%), Gaps = 9/172 (5%) Query: 5 IHVVQGDITKLAVDVIVN--AANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +++ +G I VDV+VN A + L G + AI + AG + + K + + Sbjct: 285 LYLKRGAIEDEMVDVLVNTIAPDCKL-HQGVISRAILKKAGDEIQNEIYKKKSNTSFYSS 343 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 T +L K+V HTV + NE +L + L SL+ A Y S++FPAI Sbjct: 344 KVLYKTKGYNLYCKSVFHTVCAHRSDSKSNE--ILFNVVLESLKKAAE-DYESISFPAIG 400 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCY--DEENAHLYERLL 171 TG + + A+I + V+EF ++ + VYFV + D + +E + Sbjct: 401 TGNLDFKKWEVAKIMMDAVAEFAKQNKRKKLDVYFVVFPKDNDMMKAFENEM 452 >UniRef50_C1SPD7 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SPD7_9BACT Length = 177 Score = 164 bits (415), Expect = 1e-39, Method: Composition-based stats. Identities = 57/171 (33%), Positives = 88/171 (51%), Gaps = 5/171 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + + DITK D IVN AN L GGV GAI G ++ + C ++ G CP Sbjct: 10 TVLEIALRDITKQTTDAIVNPANRQLKMTGGVAGAIAAKGGRSIQEECDEI----GSCPL 65 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV+T AG L ++H VGP + G + ++ L+ A + S+ L N+ + +A PAIS Sbjct: 66 GEAVMTGAGFLKTTYIIHAVGPRY-GVDPEPEKYLKSAVMKSIELADKNNLSDIAIPAIS 124 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 G++GYP AAE+ + V E I ++ + E + ++ L + Sbjct: 125 AGIFGYPLEDAAEVIISAVIEKILSGTKLNKILLCLFTENDYMVFINTLDR 175 >UniRef50_C7Z089 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7Z089_NECH7 Length = 592 Score = 163 bits (414), Expect = 2e-39, Method: Composition-based stats. Identities = 59/187 (31%), Positives = 89/187 (47%), Gaps = 17/187 (9%) Query: 3 TRIHVVQGDITKLA-VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 T + V +GDIT L + I NAAN ++G +D IH AGP L D C ++ Q Sbjct: 99 TNLVVWRGDITTLTGITAITNAANGQMLGCFQPTHRCIDNIIHSRAGPRLRDECFQLMQD 158 Query: 57 Q-GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRL----- 107 + D G ++T DLP+ V+HTVGP R G E + L Y ++L Sbjct: 159 RDKDLGAGETLVTRGYDLPSPYVIHTVGPQLRRGASPTEVERRQLARCYESTLDALELLP 218 Query: 108 VAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAH 165 + ++A ISTG++ +P AAEIA+ TV ++ H + F + E + Sbjct: 219 AEEDGRKAIALCCISTGLFAFPAKEAAEIAILTVLSWLDNHPSTTITDIIFNTFTESDTE 278 Query: 166 LYERLLT 172 +Y +L Sbjct: 279 IYSKLFE 285 >UniRef50_A4TAV6 Appr-1-p processing domain protein n=6 Tax=Actinomycetales RepID=A4TAV6_MYCGI Length = 577 Score = 163 bits (412), Expect = 3e-39, Method: Composition-based stats. Identities = 68/169 (40%), Positives = 95/169 (56%), Gaps = 6/169 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V+GDIT+ VD +VN AN ++ GGGG DGAIHRA GPA+L C V++ TG Sbjct: 11 TITAVRGDITEQEVDAVVNPANTAMRGGGGADGAIHRAGGPAILRDC--VKRFPDGLATG 68 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A T AGDLPA+ V+HTVGP + G++N LL+ Y +L++ VAFP IST Sbjct: 69 DAGWTTAGDLPAQWVIHTVGPNYDTGQRN-RSLLESCYRRALKVADELGARIVAFPLIST 127 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 G +G+PR A A++T++ ++ V +D + L Sbjct: 128 GSFGWPRQDAIAAAIETIA---AADTRVDEARLVAFDPKTHEEIRSALA 173 >UniRef50_Q0CEI7 Putative uncharacterized protein n=1 Tax=Aspergillus terreus NIH2624 RepID=Q0CEI7_ASPTN Length = 524 Score = 162 bits (411), Expect = 4e-39, Method: Composition-based stats. Identities = 58/175 (33%), Positives = 84/175 (48%), Gaps = 8/175 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I + DIT L VD IV + G GG+DGA+H AAGP LLDAC + G C Sbjct: 316 NDIISLAHTDITTLEVDCIVTGISE-PRGQGGLDGAVHAAAGPRLLDACNDL----GKCW 370 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 +T A +LP K V+HTV P + G + LL+ Y L + ++AFPA+ Sbjct: 371 VEEVQVTDAYNLPCKKVIHTVSPPYADGSADSKWLLRACYRRCLEIAIEGGMRTIAFPAL 430 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALP---EQVYFVCYDEENAHLYERLLTQ 173 STG G+ AA A++ V F+ +++ F +++ +Y Q Sbjct: 431 STGSKGFKSYEAATAALEEVRCFLDEPGHLLRFDKIIFCNIHQQDMEVYVAFTGQ 485 >UniRef50_A5D049 Predicted phosphatase n=4 Tax=Bacteria RepID=A5D049_PELTS Length = 359 Score = 162 bits (411), Expect = 4e-39, Method: Composition-based stats. Identities = 58/166 (34%), Positives = 83/166 (50%), Gaps = 6/166 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V++GDIT+L VD IVNAAN L G GV GAI R G A+ + + +G P G Sbjct: 2 IKVLKGDITELQVDAIVNAANNHLWMGAGVAGAIKRKGGAAIEEEAVA----KGPIPVGE 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AV+T AG L A+ VVH + + ++ A N+L ++AFPA+ TG Sbjct: 58 AVVTGAGLLKARYVVHAAA--MGQDLVTDAEKVRAATRNALLRAGELGLKTIAFPALGTG 115 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 V G AA + V V + P +V F +D++ + R+ Sbjct: 116 VGGLEFDTAARVMVGEVRRHLALGLEPGEVIFALFDDKGYDAFSRI 161 >UniRef50_D0NNH8 Putative uncharacterized protein n=3 Tax=Phytophthora infestans T30-4 RepID=D0NNH8_PHYIN Length = 287 Score = 162 bits (411), Expect = 4e-39, Method: Composition-based stats. Identities = 58/180 (32%), Positives = 86/180 (47%), Gaps = 5/180 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + V+QGD+T D IVNAAN LM GGG+ GAI R+ G ++ K + G Sbjct: 49 PELLVMQGDLTCCKADAIVNAANTRLMHGGGLAGAIVRSGGSSIQQESSKWVKDHGPLTV 108 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGG--EQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AV T AG L + V+HTVGP L+ A ++L SVA P Sbjct: 109 GDAVTTAAGKLTCQHVIHTVGPNVGSETLTSEHATQLRHAVWSALLEADRLKVKSVAVPG 168 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTG++GYPR A+ V ++F +++ + D+ + + +T + +E Sbjct: 169 ISTGIFGYPRDLGAKEIVTEAAKFCKEKAGSTALKRIALMNIDDPTVKSFVKAVTDEMEE 228 >UniRef50_D0NR00 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NR00_PHYIN Length = 492 Score = 162 bits (410), Expect = 5e-39, Method: Composition-based stats. Identities = 51/173 (29%), Positives = 83/173 (47%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +G + L VD +VN+ S+ G + ++AGP + C + G C Sbjct: 44 INAKLSLWRGPLYCLRVDAVVNSTCESMRQSDGDFDKLLKSAGPEIAVEC----KAAGAC 99 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG V+T LPAK ++HTVGP ++ N + L Y + L + N SVA Sbjct: 100 RTGDTVLTRGCKLPAKFILHTVGPRYQAKYHNAAEHSLHSCYRSVLAVTKENGLRSVATG 159 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 I T GYPR A IA +TV ++ + ++V ++ +YER+L Sbjct: 160 CIYTIRKGYPREEGAHIAARTVRRYLEHYGDDFDRVILCMDSVQDMDVYERVL 212 >UniRef50_Q4SK43 Chromosome 2 SCAF14570, whole genome shotgun sequence. (Fragment) n=4 Tax=Tetraodontidae RepID=Q4SK43_TETNG Length = 418 Score = 162 bits (410), Expect = 6e-39, Method: Composition-based stats. Identities = 57/173 (32%), Positives = 86/173 (49%), Gaps = 4/173 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + V + D+T VD +VNAAN L GG+ A+ +A G + + ++ G Sbjct: 54 RVTVSVHKADLTNFPVDAVVNAANERLQHVGGIALALSKAGGSQIQQDSDEYIRKNGVLR 113 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGE--QNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG +V AG LP K ++HTVGP G + LL+ A LNSL+ SVA P Sbjct: 114 TGESVAMDAGSLPCKKIIHTVGPHVTGHSLTASAANLLEKAVLNSLKKADECRLRSVALP 173 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERL 170 AIS+G++GYP A+ VK V +F ++ + + V + + ER Sbjct: 174 AISSGIFGYPLKECADTIVKAVRDFCEKYQIMSLKDILLVDKVDLTVNEMERA 226 Score = 94.9 bits (235), Expect = 1e-18, Method: Composition-based stats. Identities = 48/166 (28%), Positives = 72/166 (43%), Gaps = 14/166 (8%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDA--CLKVRQQQGDCPT 62 + + G I + +VIVN G + AI + AG +L A C V + Sbjct: 263 LTLKWGRIDEEQTNVIVNTTQKDSWDGQ-ISTAILKKAGTKMLKALKCANVGNR------ 315 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + ++T +L V HT+ G Q+L DA L+L A +S S+AFPAI Sbjct: 316 -NVIVTEPYNLRCAEVYHTL--FTAGSTDKAYQILTDAVSECLQLAANHSRQSIAFPAIG 372 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHL 166 TG G + A I + V +F + + +VYFV Y D + Sbjct: 373 TGGRGLEKEKVASIMSEAVFKFANQSSKQMEVYFVIYPGDHSTFQV 418 >UniRef50_Q9NXN4 Ganglioside-induced differentiation-associated protein 2 n=36 Tax=Euteleostomi RepID=GDAP2_HUMAN Length = 497 Score = 161 bits (408), Expect = 8e-39, Method: Composition-based stats. Identities = 56/173 (32%), Positives = 86/173 (49%), Gaps = 7/173 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +GD+ L IVN +N SL V +I AGP L + K++ C Sbjct: 52 VNGKVVLWKGDVALLNCTAIVNTSNESLTDKNPVSESIFMLAGPDLKEDLQKLK----GC 107 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSYTSVAFP 119 TG A +T +L A+ ++HTVGP ++ + + L Y N L+L S +SV F Sbjct: 108 RTGEAKLTKGFNLAARFIIHTVGPKYKSRYRTAAESSLYSCYRNVLQLAKEQSMSSVGFC 167 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 I++ GYP A IA++TV F+ H E+V F D E Y++LL Sbjct: 168 VINSAKRGYPLEDATHIALRTVRRFLEIHGETIEKVVFAVSDLEE-GTYQKLL 219 >UniRef50_UPI000194CBC9 PREDICTED: similar to B aggressive lymphoma n=1 Tax=Taeniopygia guttata RepID=UPI000194CBC9 Length = 718 Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats. Identities = 55/174 (31%), Positives = 84/174 (48%), Gaps = 5/174 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V + D+T+ VD++VNAAN L G G+ A+ +A GP + + Q+ G G Sbjct: 110 ICVYKDDLTRHKVDIVVNAANEYLEHGAGLALALVKAGGPEIKEESKLYVQRFGKVKVGD 169 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRL--VAANSYTSVAFPAI 121 +T G LP K ++H VGP W E+ LLQ A LN L + SVA PA+ Sbjct: 170 IAVTGGGKLPCKGIIHVVGPRWYALEKERCCYLLQKAILNVLHYVSAPGKALKSVAIPAV 229 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQ 173 S+G+Y +P +++ V V EF+ ++ V DE ++ + Sbjct: 230 SSGIYAFPIDLCSQVIVMAVKEFVEASPPGCLREIRLVNIDESTVAEIKKACEK 283 >UniRef50_A7HJC7 Appr-1-p processing domain protein n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJC7_FERNB Length = 184 Score = 161 bits (407), Expect = 1e-38, Method: Composition-based stats. Identities = 52/175 (29%), Positives = 84/175 (48%), Gaps = 3/175 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V GDIT +D IVNAAN L GGGV G I R GP + + ++ G G Sbjct: 10 EIEFVVGDITTQNIDAIVNAANSYLSHGGGVAGVISRKGGPTIQKESDEYVKKYGPVEPG 69 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T AG+L AK V+HTVGP+ G + D ++ ++N ++ ++A P + T Sbjct: 70 GVAVTGAGNLSAKYVLHTVGPI--GDKPQNDDIIVKCFINIIKKSDELGIKTIAIPFVGT 127 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQGDE 177 G++GYP E K + ++ + +++ F D + +E + + Sbjct: 128 GIFGYPLERFIENVTKVLINYLKDYEGTLQKIIFCDIDGYKVNKFEEYFLAKFKD 182 >UniRef50_D0WKT6 Appr-1-p processing enzyme family domain protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WKT6_9ACTO Length = 302 Score = 160 bits (406), Expect = 2e-38, Method: Composition-based stats. Identities = 58/184 (31%), Positives = 94/184 (51%), Gaps = 11/184 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQGD 59 + + +GD+ +LA D +VNAA P+L+G +D I GP + + C +R+ QG Sbjct: 119 VAMWRGDVRELAADAVVNAAMPNLLGCKDPLHPCIDNYIQGQGGPWIRNDCSVIREIQGK 178 Query: 60 -CPTGHAVITLAGDLPAKAVVHTVGPVWRGGE--QNEDQLLQDAYLNSLRLVAANS-YTS 115 G AV+T LPA+ V+HT+GP GGE + + L Y + L L + Sbjct: 179 DQEVGDAVLTRGYRLPARYVLHTLGPHLNGGEITDEDREKLAACYTSCLDLALEKGDIHN 238 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQ 173 V+F A+STG +P A IA+ TV++++ H E V F +++ +A Y + L Sbjct: 239 VSFCALSTGRNNFPFEEATHIALDTVNQWLQYHGTDVIELVVFNIFEDADAEGYMQALES 298 Query: 174 QGDE 177 ++ Sbjct: 299 WVED 302 >UniRef50_C3Y6H9 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y6H9_BRAFL Length = 2209 Score = 160 bits (406), Expect = 2e-38, Method: Composition-based stats. Identities = 63/177 (35%), Positives = 92/177 (51%), Gaps = 5/177 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V + D+T+ VDVIVNAAN L GG+ +I AGP L C K+ +++ G Sbjct: 1143 TVTVRKDDLTRHVVDVIVNAANRDLKHIGGLAKSISDVAGPVLQSECDKITRRR-SLLDG 1201 Query: 64 HAVITLAGDL-PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 V+T AG + K ++H VGP+W+GG + E L DA SL +TS+A PAIS Sbjct: 1202 QVVVTSAGAMTTCKEIIHAVGPLWQGGFRREADALYDAAYGSLEEAGRRGHTSIAIPAIS 1261 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLT-QQGD 176 +G+Y +P A + V+ V EF + + V V D+ + LT + GD Sbjct: 1262 SGIYSFPVDQCANLIVEAVDEFWKNNRSSTLSLVELVNNDDRTVDAFVEALTSRHGD 1318 Score = 114 bits (286), Expect = 1e-24, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 83/176 (47%), Gaps = 7/176 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQG--DCP 61 I +Q +I VDV+VN+ + +L + G + +I GP L + Q+G Sbjct: 1393 ITAMQANIASQRVDVMVNSTSHNLNLNSGQLSKSILDRGGPELQTLVNNAKAQKGIQSLA 1452 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G + + L + V+H+ W GG+ + +++L++ L++ + ++A PA+ Sbjct: 1453 DGDILESGPAGLNVQTVIHSALCRWDGGQGDSEKVLRELVRKCLKVAEEGGHKTIAIPAM 1512 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCY--DEENAHLYERLLTQ 173 TG +P AE ++ ++ + E++ F+ + D ++ ++ ++T+ Sbjct: 1513 GTGGLHFPHEVVAEALFGEAVDYFKQNPQSSIEEIRFIVWEGDPKSMVAFDEIMTK 1568 Score = 60.3 bits (145), Expect = 3e-08, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 1/57 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + V GDIT+ IVN++N L + GGV AI GP++ C + G Sbjct: 1657 LQVQPGDITEETTVAIVNSSNEQLDLTKGGVSNAIRNKGGPSIERECGNIGMLGGKV 1713 >UniRef50_C3Y6H4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y6H4_BRAFL Length = 2120 Score = 160 bits (404), Expect = 3e-38, Method: Composition-based stats. Identities = 60/175 (34%), Positives = 87/175 (49%), Gaps = 3/175 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ V + DITK DVIVNAAN L GG+ AI A G + C Q G G Sbjct: 1025 KLVVWRDDITKHKADVIVNAANVRLEHVGGLAKAIVDAGGDIIQKFCNDYIQANGKLIPG 1084 Query: 64 HAVITLAGDL-PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 V + G + + ++H VGP+W GG E+ L DA SL A +++ S+A PAIS Sbjct: 1085 QVVSSPPGRINTCQRILHAVGPIWNGGGLGEEGHLADAVYGSLEEAAKSNFRSIAIPAIS 1144 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQG 175 +G+YGYP AEI V E++ + + + FV ++ A + L + Sbjct: 1145 SGIYGYPLKKCAEIIVAKTVEYLEDNPTTSLQVIKFVNIVDQTAEAFVDALVSEF 1199 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 40/174 (22%), Positives = 73/174 (41%), Gaps = 10/174 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +++ +GD+TK D IVN+ N L + G V AI +A GP + C + +G Sbjct: 1509 TLNIRKGDLTKETTDCIVNSTNEQLDLTRGAVSNAICKAGGPDIEQECKNIA-ARGGMRD 1567 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV T +G L ++H P + + N L+ S+AFPA+ Sbjct: 1568 GIAV-TGSGQLKCGKIIHAAAPA-----PGQSTGWKKVITNCLQTADTLRLRSIAFPALG 1621 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLLTQQ 174 TG + A + + +F+ ++ +V + +E + + ++ Sbjct: 1622 TGTLQGSAESTATTMLDALQDFVLQNKATRLNEVRITIFQQEMVRAFHEEMQKK 1675 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 43/166 (25%), Positives = 74/166 (44%), Gaps = 7/166 (4%) Query: 15 LAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQG--DCPTGHAVITLAG 71 VDV+VN +L + G V AI + AG L + QQ P G ++T + Sbjct: 1276 QNVDVLVNTTAGNLNLNTGAVSRAILQLAGNDLQTLVNRAMQQARITSLPDGQILVTDSA 1335 Query: 72 DLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRA 131 DL K V+H V W GG+ N +++L+ L+ +Y S+A PA+ TG +P Sbjct: 1336 DLLCKQVIHCVLCSWDGGQGNSEKVLRKIVQQCLQQAEKGNYASIAIPAMGTGGLHFPHD 1395 Query: 132 AAAEIAVKTVSEFITRH--ALPEQVYFVCY--DEENAHLYERLLTQ 173 AE E ++ ++ F+ + D ++ + ++ + Sbjct: 1396 VVAEAMFDEAVEHCRKNPSGSLREIRFIVWEEDPKSIPAFNEVMMK 1441 >UniRef50_UPI00005A247A PREDICTED: similar to H2A histone family, member Y isoform 3 n=1 Tax=Canis lupus familiaris RepID=UPI00005A247A Length = 412 Score = 159 bits (402), Expect = 4e-38, Method: Composition-based stats. Identities = 46/178 (25%), Positives = 90/178 (50%), Gaps = 7/178 (3%) Query: 1 MKTRIHVVQGDITKL---AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ 57 + +++++ +I+ L V+ I+N N + + + + G ++A L++R++ Sbjct: 233 LGQKLNLIHSEISNLAGFEVEAIINPTNADIDLKDDLGNTLEKKGGKEFVEAVLELRKKN 292 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 G A ++ LPAK V+H PVW G ++LL+ N L L S+A Sbjct: 293 GPLEVAGAAVSAGHGLPAKFVIHCNSPVW--GADKCEELLEKTVKNCLALADDKKLKSIA 350 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLLTQ 173 FP+I +G G+P+ AA++ +K +S + + + VYFV +D E+ +Y + + + Sbjct: 351 FPSIGSGRNGFPKQTAAQLILKAISSYFVSTMSSSIKTVYFVLFDSESIGIYVQEMAK 408 >UniRef50_C3Y5X5 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y5X5_BRAFL Length = 1925 Score = 158 bits (401), Expect = 6e-38, Method: Composition-based stats. Identities = 57/179 (31%), Positives = 86/179 (48%), Gaps = 9/179 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ V QGD+T L VDVIVNAAN L GG+ A+ +A G + C + G G Sbjct: 910 KLFVCQGDLTALQVDVIVNAANSRLSHVGGLAAALVKAGGKEIQRDCESYIRTSGQLSDG 969 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-----LQDAYLNSLRLVAANSYTSVAF 118 + T LP K VVH VGP W+ G +++ L A +SL+ + S+ Sbjct: 970 DVMTTKPYRLPCKMVVHAVGPQWKSGLSEDEKGGKEANLYRAAFSSLQEAKD--FHSIGI 1027 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLLTQQG 175 PAIS+GVYG+P ++ V F H + +VYF D + ++ + ++ Sbjct: 1028 PAISSGVYGFPIDLCVSAILEGVMSFFNIHPNCKLSEVYFTEMDAKKTGAFKAEMVKRF 1086 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 47/176 (26%), Positives = 70/176 (39%), Gaps = 10/176 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + V QGDIT +VD I+ N L GV + R AG +L CL V QQ G+ Sbjct: 1310 NVTLQVQQGDITTESVDAIIVPTNNKLRLDAGVAQVVSRKAGGSLQAECLAVVQQYGELQ 1369 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G T AG LP + V+H P L+D + L+ SVA PAI Sbjct: 1370 NGAVATTGAGSLPCRHVLHLANPQPNH--------LKDNIKHCLQTADQKKLKSVALPAI 1421 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQG 175 TG +A+ + ++EF+ + V + + + + + Sbjct: 1422 GTGGINISPDQSAKGMLDGIAEFVQQSNPQNLALVRITIFQPQMLQTFHTEMDNRA 1477 Score = 93.0 bits (230), Expect = 4e-18, Method: Composition-based stats. Identities = 41/175 (23%), Positives = 57/175 (32%), Gaps = 50/175 (28%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + + QG IT DV+VN L +G GGV A +A GP L Sbjct: 1142 TLQLKQGGITAEQADVLVNTVGTDLDLGQGGVASAFLKAGGPELQQP------------- 1188 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 L+ L + N S+AFPA+ Sbjct: 1189 ----------------------------------LRTIIQTCLTMAHKNGLPSIAFPALG 1214 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQG 175 TG GYPR+ AA V F + + V V YD+ ++ L + Sbjct: 1215 TGNLGYPRSVAASAMFDEVVSFSQANPSTSLKHVSIVVYDQPTVQAFQAELRTRQ 1269 >UniRef50_C7HUZ2 RNase III regulator YmdB n=2 Tax=Anaerococcus RepID=C7HUZ2_9FIRM Length = 163 Score = 157 bits (397), Expect = 2e-37, Method: Composition-based stats. Identities = 67/165 (40%), Positives = 91/165 (55%), Gaps = 8/165 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPA-LLDACLKVRQQQGDCPT 62 + V+ DI KL VD IVNAAN L+ GGG+ G I AG L ACLK+ Sbjct: 2 TLKVIDIDILKLNVDAIVNAANVDLIEGGGICGQIFEKAGREKLKKACLKLS----PIKP 57 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWR-GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT +L K ++H VGPV+ ++ ++LQDAY NSL++ S+AFP I Sbjct: 58 GEAVITDGFNLYQKYIIHAVGPVYNEMYKEACQKILQDAYKNSLKIAKKKGIKSIAFPLI 117 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHL 166 S+G+YGYP A IA T+ EF+ + + +VY Y + L Sbjct: 118 SSGIYGYPDKDAFMIAKNTIDEFLKNYEM--EVYLSTYGKNILSL 160 >UniRef50_Q0UG78 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0UG78_PHANO Length = 2240 Score = 156 bits (395), Expect = 3e-37, Method: Composition-based stats. Identities = 55/172 (31%), Positives = 82/172 (47%), Gaps = 10/172 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMG--GGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 I D+TKL VD IVN+AN SL G ++ AIH+AAGP L + G Sbjct: 603 ISFCHHDLTKLKVDAIVNSANKSLKMTRGDTLNNAIHKAAGPGLSVEA----RLTGRLE- 657 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWR-GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G A+IT +LP++ V+H + P + E L D Y L++ N ++AFP + Sbjct: 658 GQALITGGHNLPSEHVIHVLRPGYFRHKGMGEFNQLIDCYREVLKVAIENKIKTIAFPCL 717 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLL 171 TG G+P AA I ++ + E++ H E++ F + Y L Sbjct: 718 GTGGVGFPARVAARITLQEMREYLDAHPEHNLERIIFCVNTAADEKAYIDFL 769 Score = 141 bits (355), Expect = 1e-32, Method: Composition-based stats. Identities = 51/173 (29%), Positives = 83/173 (47%), Gaps = 9/173 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +I++V+ DITKL VDV+VN+ + S G G +D + + G + A G C Sbjct: 1019 NDKIYLVREDITKLEVDVMVNSTDVSFRGMGTLDRTVLQKGGEQMRAA----VTAFGQCK 1074 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G T LPAK V+H + G +L+ Y L+ + TS+A P+I Sbjct: 1075 IGEVRHTEGYMLPAKHVLHIIPADRYNG--GTKIVLKKLYREVLQEAVSMRATSIALPSI 1132 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERLL 171 TG+ YPR A +A++ F+ R+ E++ FV + + +Y+ L+ Sbjct: 1133 GTGMLNYPRRDVASVALEEAKRFLESAERNNPVEKIIFVVFSSNDEFVYKSLM 1185 >UniRef50_C3Y5Q2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y5Q2_BRAFL Length = 1122 Score = 156 bits (395), Expect = 3e-37, Method: Composition-based stats. Identities = 53/175 (30%), Positives = 91/175 (52%), Gaps = 5/175 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ + QGDIT+ DVIV+ N SL GG+ AI A GP + AC+ ++ G G Sbjct: 708 KVFIYQGDITQEVADVIVSCNNESLDSAGGIARAISDAGGPEIRRACVDYIRRHGRLSAG 767 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVA-ANSYTSVAFPAIS 122 ++ T G L + VVHTV P +Q + Q L +L+ L + S+A PAI Sbjct: 768 QSIWTPGGRLRCQHVVHTVSPQ-SSRDQTDHQQLFSTFLDLLNIAEFDLKVNSIAIPAIG 826 Query: 123 TGVYGYPRAAAAEIAVKTVS---EFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 +G+ G+P+A A++ + +S ++ T +L +++ V D + + ++ +Q Sbjct: 827 SGIAGFPKAVCADVMFRVISAFEDYQTPDSLLKEIRLVNIDAKTTAAFVQVFSQH 881 >UniRef50_Q8IXQ6 Poly [ADP-ribose] polymerase 9 n=27 Tax=Eutheria RepID=PARP9_HUMAN Length = 854 Score = 156 bits (395), Expect = 3e-37, Method: Composition-based stats. Identities = 47/175 (26%), Positives = 86/175 (49%), Gaps = 6/175 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + V + D+T AVD +VNAAN L+ GGG+ A+ +A G + + + + G Sbjct: 117 RIELSVWKDDLTTHAVDAVVNAANEDLLHGGGLALALVKAGGFEIQEESKQFVARYGKVS 176 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWR-GGEQNEDQLLQDAYLNSLRLVAANS--YTSVAF 118 G +T AG LP K ++H VGP W +Q LQ A ++ L V + +VA Sbjct: 177 AGEIAVTGAGRLPCKQIIHAVGPRWMEWDKQGCTGKLQRAIVSILNYVIYKNTHIKTVAI 236 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRH---ALPEQVYFVCYDEENAHLYERL 170 PA+S+G++ +P + V+T+ + + ++++ V ++ ++ Sbjct: 237 PALSSGIFQFPLNLCTKTIVETIRVSLQGKPMMSNLKEIHLVSNEDPTVAAFKAA 291 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 45/175 (25%), Positives = 74/175 (42%), Gaps = 7/175 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + +VQG I DVIVN+ NP + G V +I + AG + L + +Q Sbjct: 316 NLTLQIVQGHIEWQTADVIVNSVNPHDITVGPVAKSILQQAGVEMKSEFLATKAKQ-FQR 374 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 + ++T +L K + H + E + Q+L+ A L + TS++FPA+ Sbjct: 375 SQLVLVTKGFNLFCKYIYHVL----WHSEFPKPQILKHAMKECLEKCIEQNITSISFPAL 430 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQG 175 TG + AAEI V F H V FV + + +Y+ ++ Sbjct: 431 GTGNMEIKKETAAEILFDEVLTFAKDHVKHQLTVKFVIF-PTDLEIYKAFSSEMA 484 >UniRef50_A7S3X0 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7S3X0_NEMVE Length = 143 Score = 156 bits (395), Expect = 3e-37, Method: Composition-based stats. Identities = 57/143 (39%), Positives = 78/143 (54%), Gaps = 1/143 (0%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V QGDIT D +VNAAN L+ GGGV GAI G ++ + C ++ + G G Sbjct: 1 VTVYQGDITNERADAVVNAANCDLIHGGGVAGAILAKGGWSIQEECYQIVGRFGRLEVGD 60 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV T AG L KAV+H VGP W G + + L A L SL + S+AFPAIS+ Sbjct: 61 AVQTNAGKLLCKAVIHAVGPTWLGATPEQVKNQLFRACLESLYTADNINLCSIAFPAISS 120 Query: 124 GVYGYPRAAAAEIAVKTVSEFIT 146 G+YG P+ A++ + V + Sbjct: 121 GIYGVPKEICAQVMLDVVEHYAE 143 >UniRef50_C3Y406 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3Y406_BRAFL Length = 2514 Score = 156 bits (394), Expect = 4e-37, Method: Composition-based stats. Identities = 53/179 (29%), Positives = 90/179 (50%), Gaps = 5/179 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-DCPT 62 ++ V + D+TK VD NAAN +L GGG+ AI +A G + D C ++ + + Sbjct: 1068 KLVVFKDDLTKHHVDATTNAANKNLKNGGGLAEAIIKAGGKEIQDHCDQIMKDEPAGLMV 1127 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWR--GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G +T G LP KAV+H VGP + ++ L N L + + ++SVA PA Sbjct: 1128 GAVRVTGPGKLPCKAVIHAVGPNFHEIKDDKRSRDELFKTVTNVLEMASRYGFSSVAIPA 1187 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLLTQQGDE 177 IS+G++G P + V+ + ++ + +V+FV D + A + + L + +E Sbjct: 1188 ISSGIFGGPLDLCTKTVVRATGLYFKKNKESKVNEVHFVGIDLDIAQSFNKALLETFNE 1246 Score = 141 bits (357), Expect = 7e-33, Method: Composition-based stats. Identities = 47/178 (26%), Positives = 79/178 (44%), Gaps = 5/178 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP- 61 +I +++G I+ DVIVN P L + G V A+ GP L C K+++ G P Sbjct: 1300 KITLIRGSISDQQADVIVNTIGPDLNLRTGAVSKALLDKGGPTLQVECDKIKRDLGRLPA 1359 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWR-GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G V T G+L V H V W + +L+ L+ +S ++AFPA Sbjct: 1360 HGEVVYTSGGNLGCNLVYHAVCSFWNSQDTAKSEDVLRKIVTACLKSADKDSKRTIAFPA 1419 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQQGD 176 + TG GYP+ A + + ++ E+ FV +D+ + + L ++ + Sbjct: 1420 VGTGGLGYPKDVVARLMFEETLSHSNKNPAGDLEEAKFVIFDQPSFEAFLSELGKRTE 1477 Score = 112 bits (281), Expect = 5e-24, Method: Composition-based stats. Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 11/172 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-GGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V QGDIT+ VD IVN + G V + + GP + C K + Sbjct: 1522 VEVEQGDITREKVDAIVNPTRGDMDLSLGKVSQVLKKKGGPVVQTECEKYDK--NKLKRD 1579 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 IT AG L ++ ++H V P + E + + A +N L + S+AFPA+ T Sbjct: 1580 GVGITAAGGLASRYILHLVAPGF------ETERWKKAVMNCLAYAECHQLKSLAFPALGT 1633 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQ 173 G +A + ++ +++F + + V V ++ + L + Sbjct: 1634 GQMAKDPTESATMIIEAIADFAQKKNPKHLKHVRIVIFEAGMMKPFHDKLGK 1685 >UniRef50_C9YUB3 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9YUB3_STRSW Length = 333 Score = 155 bits (391), Expect = 8e-37, Method: Composition-based stats. Identities = 71/177 (40%), Positives = 101/177 (57%), Gaps = 9/177 (5%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG-D 59 + +GD+T LA D +VNAAN L+G +D A+H AAGP L D C + QG Sbjct: 153 TLWRGDLTTLAADAVVNAANSRLLGCFRPRHPCIDNALHNAAGPRLRDDCHTIVTAQGTR 212 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAA-NSYTSVA 117 PTG A IT LPA+ V+HTVGP+ +G +D Q L +Y + L L A S +VA Sbjct: 213 EPTGTAKITRGYHLPARHVLHTVGPLVQGRPHTDDAQALASSYRSCLDLAAQVESVRTVA 272 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENAHLYERLLTQ 173 F A+STGV+GYP+ AA +A++TV ++IT R ++V + ++ Y L + Sbjct: 273 FCAVSTGVFGYPKDEAASVALRTVEDWITARPHRFDRVVLTVFTADDERAYRHALGE 329 >UniRef50_A6LTB5 Appr-1-p processing domain protein n=3 Tax=Clostridium RepID=A6LTB5_CLOB8 Length = 214 Score = 154 bits (390), Expect = 1e-36, Method: Composition-based stats. Identities = 67/215 (31%), Positives = 103/215 (47%), Gaps = 48/215 (22%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T ++ DITK+ D IVNAAN SL+GGGGVDGAIH+A G LLD C ++ C T Sbjct: 2 TNFKILFDDITKIKFDAIVNAANASLLGGGGVDGAIHKACGEKLLDECRQLN----GCLT 57 Query: 63 GHAVITLAGDL---PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVA---------- 109 G + +T + +L V+HTVGP++R +E++ L++AY + + A Sbjct: 58 GRSKLTRSYNLSDHGVHWVIHTVGPIYRN-NGSEEKYLRNAYRSVFDIAANYSEFYSKQC 116 Query: 110 -----------------------------ANSYTSVAFPAISTGVYGYPRAAAAEIAVKT 140 + ++A P+ISTG Y YP A IA+ Sbjct: 117 NEILNKNLYRFNTDKQRDFILKELDDYINDHPIKTIALPSISTGAYSYPLNEACNIALDE 176 Query: 141 VSEFITRHALP-EQVYFVCYDEENAHLYERLLTQQ 174 + FI +++ VC DE+ ++Y+ L ++ Sbjct: 177 ILSFINNSPDTFDEIAMVCLDEKTYNMYKSLYEER 211 >UniRef50_A7T7L3 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7T7L3_NEMVE Length = 177 Score = 154 bits (389), Expect = 1e-36, Method: Composition-based stats. Identities = 53/177 (29%), Positives = 87/177 (49%), Gaps = 21/177 (11%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-- 58 + ++ + GDIT L +D IVNA N ++ G+D + KV +G Sbjct: 12 LNDKVSLWTGDITALEIDAIVNAGNTIMLMFIGIDVDSYPN----------KVYSGRGIF 61 Query: 59 DCPTGHAVITLAGD-LPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 C + + L G V+HT GP+ + + LQD Y N L+L + ++A Sbjct: 62 KCFFFNLSVLLKGSPYFGLDVIHTAGPMGKNRIK-----LQDCYKNCLQLAKQHGVKTLA 116 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLL 171 F ISTG+YGYP AA +A++TV +++ + E++ F + ++ +YERLL Sbjct: 117 FCCISTGIYGYPNKDAAHVALETVRQWLETDDNNDSVERIIFCTFLPKDTEIYERLL 173 >UniRef50_C8WJT1 Appr-1-p processing domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WJT1_EGGLE Length = 255 Score = 153 bits (388), Expect = 2e-36, Method: Composition-based stats. Identities = 64/150 (42%), Positives = 82/150 (54%), Gaps = 8/150 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + R+ + +GDIT LAVD IVNAAN L+G +D AIH AG L C ++ + Sbjct: 82 VDGRLALWRGDITTLAVDAIVNAANSKLLGCFIPGHHCIDNAIHTFAGMQLRLVCDELMR 141 Query: 56 QQGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE--DQLLQDAYLNSLRLVAANS 112 QG P G A +T A +LP++ VVHTVGP GE ++ L Y SL AA Sbjct: 142 AQGHDEPVGRAQVTSAFNLPSRFVVHTVGPQVPTGEPTAAQEEQLASCYRASLDAAAAAG 201 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVS 142 S+AF ISTG + +PR AA IAV V Sbjct: 202 VASLAFCCISTGEFRFPRERAARIAVGEVR 231 >UniRef50_B7CC50 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CC50_9FIRM Length = 175 Score = 153 bits (387), Expect = 3e-36, Method: Composition-based stats. Identities = 59/176 (33%), Positives = 86/176 (48%), Gaps = 12/176 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I ++G+I L D+IV+ N ++ GV I AG ++ AC ++ G Sbjct: 2 ISTLKGNIALLDFDLIVDPTNKQVLPMQGVSAQIFHQAGSEMMKACQELN----GLEVGK 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYT------SVAF 118 A +T A +LP KAV+HT GP + G NED+ L Y NS+ L ++AF Sbjct: 58 AKMTKAFNLPCKAVIHTCGPRYMDGTHNEDEYLAACYWNSMALAYDYMRKNDMESINIAF 117 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLLT 172 P ISTG+ YP A IA++TV + + + V FVC E+ LY+ L Sbjct: 118 PCISTGINAYPNHEACVIAIQTVKRLMNKFPETKAIHVCFVCDKTEDYMLYKEALR 173 >UniRef50_B0P6L4 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P6L4_9FIRM Length = 168 Score = 153 bits (387), Expect = 3e-36, Method: Composition-based stats. Identities = 60/170 (35%), Positives = 84/170 (49%), Gaps = 6/170 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPTG 63 + +V GDITK+ D IVNAA+ L G+ AI AA LL AC K+ G C G Sbjct: 1 MRLVLGDITKMDTDAIVNAASSDLRPCPGICSAIFAAADTEKLLAACKKI----GRCRIG 56 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT + L K ++H G W G N+ L D Y ++L+ AA SVA P + + Sbjct: 57 KAVITPSFGLACKYIIHVAGVGWYSGRYNDRMLFADCYRSALQKAAAYHCKSVAIPLMFS 116 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 G + PRA A +I V F H E + V Y + L +++++ Sbjct: 117 GDFHIPRAQALQIVADVVGGFEKSHPSLE-ISLVLYKQSIYDLAKKIISN 165 >UniRef50_A7EET2 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EET2_SCLS1 Length = 506 Score = 153 bits (386), Expect = 3e-36, Method: Composition-based stats. Identities = 61/158 (38%), Positives = 82/158 (51%), Gaps = 1/158 (0%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + VV GD+ K VDVIVNAAN SL+ G G+DG IHR AGP L G Sbjct: 19 TTVEVVDGDLLKYPVDVIVNAANASLVRGDGIDGEIHRQAGPELAAEMKTQFPHPGKQGG 78 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + + ++H VGP WR Q LL +AY NSL L A N+ S+AFPAIS Sbjct: 79 AYGTTHSWDITSCQYIIHAVGPDWRQPNQRATGLLANAYHNSLSLAAKNNLRSIAFPAIS 138 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCY 159 G++ PR A +KT+ +I H +++ + + Sbjct: 139 VGIFQMPRGMAGVTVMKTIRSWIDSHQGEMDRIGILLF 176 >UniRef50_UPI000180B1B4 PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2), partial n=1 Tax=Ciona intestinalis RepID=UPI000180B1B4 Length = 1271 Score = 153 bits (386), Expect = 4e-36, Method: Composition-based stats. Identities = 55/177 (31%), Positives = 90/177 (50%), Gaps = 6/177 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALL-DACLKVRQQQGD 59 + +++GDIT++ D IVNA+N L + G+ G+I + GP + + + G Sbjct: 520 NVEVKILRGDITEVNCDAIVNASNDKLELRDAGISGSIKKKCGPTVQAEMNQHIASVGGT 579 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE--DQLLQDAYLNSLRLVAANSYTSVA 117 G AV T AG + + ++H VGPVW+G +E + L+ +L+ + TSVA Sbjct: 580 MLPGSAVSTSAGRMNCRRIIHVVGPVWKGDISDEVCEAYLKSCVSETLKEAERYNLTSVA 639 Query: 118 FPAISTGVYGYPRAAAAEIAVKT-VSEFITRHALPEQVYFV-CYDEENAHLYERLLT 172 PAIS GV+G + + V+T V F+ + +Q+YFV + E + R L Sbjct: 640 MPAISCGVFGGSVSVCPRLMVETLVDHFMKPSSCIKQIYFVENSNNEVIQSFSRSLQ 696 Score = 84.9 bits (209), Expect = 1e-15, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 62/170 (36%), Gaps = 15/170 (8%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V QGDIT D I+ S G V AI + G ++ + Q + Sbjct: 1018 VIVKQGDITIENSDAIICPTAQSYDLSGQVGQAILQRGGQSIQTELQQQLFTQK-----N 1072 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T AG L K V H V N+ +++ + + S++ PAI TG Sbjct: 1073 YSVTGAGQLACKHVFHIVT-------GNDGTQMENVLMEVFEQADSLRIHSLSIPAIGTG 1125 Query: 125 VYGYPRAAAAEIAVKTVSEF---ITRHALPEQVYFVCYDEENAHLYERLL 171 +AAA + + F + Q+ V + + +++ Sbjct: 1126 NSSLTSSAAARHINRAIHIFEGNVRTSPTLHQINIVVFQHQMMADFQQEF 1175 Score = 74.5 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 26/114 (22%), Positives = 46/114 (40%), Gaps = 2/114 (1%) Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSYTSVAFPA 120 G V T L V H + ++ N + L A L+ Y ++AFP Sbjct: 886 VGDVVHTSGYRLQCTEVYHVIVANYQPNSHNNSKHNLVKAIKTCLQNADQAGYATIAFPT 945 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 ++TG +GYP + A + + F +H L + V + + A +++ Q Sbjct: 946 LATGGFGYPARSVARWMKRELDSFKPKH-LKQVVIAMLPTDSGAVVFQNEFNSQ 998 Score = 46.4 bits (109), Expect = 4e-04, Method: Composition-based stats. Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 3/52 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANP-SLMGGGG--VDGAIHRAAGPALLDACLKV 53 + +VQGDI+ VDVIV +P + G G + A+ R AGP L K+ Sbjct: 733 VRLVQGDISTQNVDVIVTTGSPQNFAKGSGSAITQALIRIAGPQLQREMQKI 784 >UniRef50_A6SR30 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6SR30_BOTFB Length = 474 Score = 152 bits (385), Expect = 5e-36, Method: Composition-based stats. Identities = 56/174 (32%), Positives = 87/174 (50%), Gaps = 1/174 (0%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T + V+ GD+ K VDVIVNAAN L GGG+DGAIH AAGP L ++ Q G Sbjct: 17 LDTTVEVLIGDMLKYPVDVIVNAANVKLKKGGGIDGAIHAAAGPELQGEMNELFQHPGQV 76 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 + + + ++H VGP W EQ + + L A NSL L N S+AFP Sbjct: 77 GGAYGTTSSWDIQSCRYIIHAVGPNWNIPEQQDGKFLFTAIQNSLDLAMKNKLRSIAFPG 136 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQ 173 IS G++ P++ A + + + +I ++ +++ + + E L + Sbjct: 137 ISMGIFAMPKSLAGLVIISALRTWIIKYRGEMDRISILLLGYSEDEITETRLRE 190 >UniRef50_Q4RS18 Histone H2A (Fragment) n=2 Tax=Tetraodontidae RepID=Q4RS18_TETNG Length = 415 Score = 152 bits (384), Expect = 5e-36, Method: Composition-based stats. Identities = 52/187 (27%), Positives = 87/187 (46%), Gaps = 22/187 (11%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH-- 64 VVQ DI+ + D +V+ + S GG V A+ + G +A +++++ G Sbjct: 227 VVQADISIVESDAVVHPTSSSFYTGGEVGTALEKKGGKEFTEALQELKKKNGPLEVAGGK 286 Query: 65 ----------------AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV 108 AV+T LPAK V+H P W G +++L N L L Sbjct: 287 CPDWKTGFLLLSQLLIAVLTAGFGLPAKYVIHCNSPGW--GSDKCEEMLDKTVKNCLALA 344 Query: 109 AANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHL 166 SVAFP+I +G G+P+ AA++ +K +S + T + + VYFV +D E+ + Sbjct: 345 DEKKLKSVAFPSIGSGRNGFPKQTAAQLILKAISSYFVATMSSTIKTVYFVLFDSESIGI 404 Query: 167 YERLLTQ 173 Y + + + Sbjct: 405 YVQEMAK 411 >UniRef50_UPI0001C38755 appr-1-p processing domain-containing protein n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38755 Length = 575 Score = 151 bits (383), Expect = 7e-36, Method: Composition-based stats. Identities = 56/147 (38%), Positives = 75/147 (51%), Gaps = 16/147 (10%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGG------------GVDGAIHRAAGPALLD 48 + RI V+QGDIT+ VD IV + NP L+ VD IH++AG L Sbjct: 433 LSDRITVIQGDITQQPVDAIVCSTNPHLLPNKKWGSFFMSSDHPEVDIMIHKSAGVELKQ 492 Query: 49 ACLKVRQQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV 108 C K+ C G A IT +LPA+ V+HTV P W+ GE ++LL Y N L LV Sbjct: 493 ECQKLN----GCKVGEAKITPGYNLPAEWVIHTVSPTWQNGEVQAEKLLAKCYQNCLNLV 548 Query: 109 AANSYTSVAFPAISTGVYGYPRAAAAE 135 + S+AFPA+ TG + AA+ Sbjct: 549 NSQEIESIAFPALGTGTGKFTLEKAAK 575 >UniRef50_B7PR73 Ganglioside induced differentiation associated protein, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PR73_IXOSC Length = 437 Score = 151 bits (381), Expect = 1e-35, Method: Composition-based stats. Identities = 43/172 (25%), Positives = 79/172 (45%), Gaps = 4/172 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GD+T L IVN+ N +L + I AG L L + C Sbjct: 54 VNRKVALWVGDLTSLNTHAIVNSTNENLTDKSPLSQRIVERAGEQLRRDMLNEIRT---C 110 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG A ++ +LPA+ V+HTVGP + + + L +Y L+++ + ++ Sbjct: 111 RTGEAKLSKGYNLPARFVIHTVGPKYNAKFRTAAESALHSSYWRVLQMLPEHGLATLGLC 170 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 I + GYP A +A++T+ F+ H ++ + + +YE+LL Sbjct: 171 PIHSARRGYPLQDGAHLALRTLRRFLELHGDCVELVVLVMEGTELGMYEQLL 222 >UniRef50_Q55AK6 U box domain-containing protein n=2 Tax=Eukaryota RepID=Q55AK6_DICDI Length = 1618 Score = 150 bits (380), Expect = 2e-35, Method: Composition-based stats. Identities = 56/173 (32%), Positives = 88/173 (50%), Gaps = 3/173 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +++GDITK IVN AN L GG +I AAG + C ++ G TG Sbjct: 918 IRIIKGDITKQKTHAIVNPANEKLKNLGGAAFSIQEAAGATFKEFCESYYEKNGPIGTGC 977 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +V + V++TVGP + N+ ++L + +SLR A + S++ PAISTG Sbjct: 978 SVYGSKFKMGNIFVINTVGP--KNDNPNKARILHMSIHSSLRSATALNCQSISIPAISTG 1035 Query: 125 VYGYPRAAAAEIAVKTVSEF-ITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 ++GY A I +K+ EF +T +V FV ++ A+++E L + D Sbjct: 1036 IFGYDPKEAVPIIIKSAIEFLLTNETTLNEVNFVDLNQSTANIFENSLIKFSD 1088 >UniRef50_B2VUH2 MACRO domain containing protein 1 n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2VUH2_PYRTR Length = 1599 Score = 150 bits (378), Expect = 3e-35, Method: Composition-based stats. Identities = 50/170 (29%), Positives = 87/170 (51%), Gaps = 9/170 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + +V+ DI KL VD++VN+ + S +G G +D ++ + GP L++ ++ G C G Sbjct: 958 VCLVREDIMKLEVDIMVNSTDSSFLGMGVLDRSVFKKGGPELMEQ----IKKFGTCNEGD 1013 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T LPAK ++H + P ++ +L++ Y L TS+A P+I TG Sbjct: 1014 VKVTPGYLLPAKHILHAIPP--EQFSKSNKGILRNIYREILHTAVLLKATSIAIPSIGTG 1071 Query: 125 VYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLL 171 YPR A +A++ V F+ + E++ FV Y + +Y+ LL Sbjct: 1072 RLNYPRRDCASLAMEEVKRFLESADPNNTLEKIIFVVYSSNDEFVYKSLL 1121 Score = 148 bits (375), Expect = 6e-35, Method: Composition-based stats. Identities = 43/176 (24%), Positives = 79/176 (44%), Gaps = 10/176 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG---GGVDGAIHRAAGPALLDACLKVRQQQG 58 I + D+T+L VD IVN A L + AI +AAGP L + + + Sbjct: 537 NQLISFIHHDLTRLKVDAIVNNAPTDLSLSPANNTLHSAIFKAAGPGLTEEA----KLKA 592 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVA 117 D G +T DLP+ ++H G + + ++ ++L Y ++L + + ++A Sbjct: 593 DIKVGQVGLTQGHDLPSSWIIHAAGLKYNWSKGYDQFKVLSSCYQSALEMATYHGIKTIA 652 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLL 171 FP + TG G+P AA IA++ + +++ H E++ + + Y Sbjct: 653 FPCLGTGGCGFPARVAARIALQEIRDYLDSHPKHGLERIVICVKTDFDKKAYMSFF 708 >UniRef50_Q7JUR6 Protein GDAP2 homolog n=19 Tax=Neoptera RepID=GDAP2_DROME Length = 540 Score = 149 bits (377), Expect = 3e-35, Method: Composition-based stats. Identities = 47/169 (27%), Positives = 73/169 (43%), Gaps = 6/169 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R + GD+T L VD I N ++ +L + I AG L + + +C Sbjct: 65 VNNRFVIWDGDMTTLEVDAITNTSDETLTESNSISERIFAVAGNQLREE---LSTTVKEC 121 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG IT +LPAK V+HTV P +R + + L Y N L + ++A Sbjct: 122 RTGDVRITRGYNLPAKYVLHTVAPAYREKFKTAAENTLHCCYRNVLCKAKELNLHTIALC 181 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYE 168 IS +P AA IA++T+ ++ + QV +C YE Sbjct: 182 NISAHQKSFPADVAAHIALRTIRRYLDK--CTLQVVILCVGSSERGTYE 228 >UniRef50_O67112 UPF0189 protein aq_987 n=4 Tax=cellular organisms RepID=Y987_AQUAE Length = 165 Score = 149 bits (376), Expect = 4e-35, Method: Composition-based stats. Identities = 54/167 (32%), Positives = 81/167 (48%), Gaps = 7/167 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I VV+G IT++ DVIVN AN + GGGV I R G + ++ + P G Sbjct: 3 IKVVKGSITEVDADVIVNPANSRGLMGGGVAVVIKRLGGEEIEREAVE----KAPIPVGS 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AV+T AG L K V+H + + ++ ++ A +L L + VA P + TG Sbjct: 59 AVLTTAGKLKFKGVIHA-PTMEEPAMPSSEEKVRKATRAALELADKECFKIVAIPGMGTG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 V G P+ AA V+ + +F E+V V DEE +E++L Sbjct: 118 VGGVPKEVAARAMVEEIRKF--EPKCLEKVILVDIDEEMVEAWEKVL 162 >UniRef50_B1L625 Appr-1-p processing domain protein n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L625_KORCO Length = 175 Score = 148 bits (375), Expect = 6e-35, Method: Composition-based stats. Identities = 53/175 (30%), Positives = 86/175 (49%), Gaps = 10/175 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ +V GDIT++ D IVN AN LM GGGV GAI R G + + ++ Sbjct: 3 IMPRLILVLGDITEVESDAIVNPANVFLMMGGGVAGAIKRKGGEEIEREAM----RKAPL 58 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G A+ T AG L AK V+H V G + + ++ A SL+ S+AFPA Sbjct: 59 KIGEAIETSAGKLKAKYVIHA-PTVESPGGSSSPEYIRAAVKASLKKGEELGIRSIAFPA 117 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 + GV G P + I ++ + + + E+V V ++++ +++R+ G Sbjct: 118 MGAGVGGVPVEESVRIILEEI-----KASPIEEVLLVTRNKQDLEVFKRVSEYMG 167 >UniRef50_A7C4X9 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C4X9_9GAMM Length = 220 Score = 148 bits (375), Expect = 6e-35, Method: Composition-based stats. Identities = 57/159 (35%), Positives = 85/159 (53%), Gaps = 3/159 (1%) Query: 16 AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLPA 75 VD IVN AN L GGG+ I AG L +AC K+ QQQG AV+T AG LP Sbjct: 27 PVDTIVNPANSGLSHGGGLAEQILLEAGSKLEEACHKIIQQQGKISVTKAVVTTAGQLPY 86 Query: 76 KAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAE 135 + V+H VGP G+ E ++ +N L++ + S+AFPAISTG++ P+ A+ Sbjct: 87 QGVIHAVGPRMGDGK--EQSKIETTIINCLQIAEKYQWKSIAFPAISTGLFCVPKTVCAK 144 Query: 136 IAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLLTQ 173 K +S + H + ++C E+ ++E++L Q Sbjct: 145 AFDKAISYYWENHPNSAIKNIWLCLLTEDYPIFEKILNQ 183 >UniRef50_UPI000196CD43 hypothetical protein CATMIT_02190 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CD43 Length = 239 Score = 148 bits (374), Expect = 8e-35, Method: Composition-based stats. Identities = 58/169 (34%), Positives = 87/169 (51%), Gaps = 8/169 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPT 62 + +++ +I +A D IV AN +L G G AI AAG L AC ++ G C T Sbjct: 2 KFKIIKANIVDVASDAIVLPANEALKEGSGTSKAIFTAAGRKELTKACKEL----GHCST 57 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A+ TLA +L +K ++H V P W GE +E LL AYL SL + SVAFP ++ Sbjct: 58 GSAIPTLAYNLSSKYIIHAVVPKWIDGEHSEYDLLSSAYLASLNIAEVMGCESVAFPLLA 117 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +G G+ + A IA +++ F ++V+ V Y + Y + L Sbjct: 118 SGNNGFDKQLAVRIAEESIKSF--EGVNLKKVFLVVYGD-TMETYMKSL 163 >UniRef50_UPI000180BD0B PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=2 Tax=Ciona intestinalis RepID=UPI000180BD0B Length = 1729 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 49/158 (31%), Positives = 80/158 (50%), Gaps = 5/158 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I+V++ DIT+ D I+NA+NP L + GG+ GAI + G + + V ++G G Sbjct: 905 INVLKTDITQHECDAILNASNPELDLLPGGISGAIQKTGGDKIQEEMHAVISKRGKLFPG 964 Query: 64 HAVITLAGDLP-AKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A IT AG L + ++H VGP W + LQ +++ + S++ PAI Sbjct: 965 DAAITGAGKLKTCRFIIHAVGPRWAEHSHSTCCKYLQSCINYAMQEAESKRLRSISIPAI 1024 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFV 157 S GV+G + + V TV ++ R++ +V FV Sbjct: 1025 SCGVFGGVPSVCIPLIVDTVLDYFKQKRNSSITRVDFV 1062 Score = 106 bits (265), Expect = 4e-22, Method: Composition-based stats. Identities = 40/170 (23%), Positives = 67/170 (39%), Gaps = 13/170 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V QGD+T D IVN+ NP G + AI + G +L+ C + QQG + Sbjct: 1122 ISVSQGDLTLDNSDAIVNSTNPQFDLTQGMISQAILKKGGRTVLNEC---KNQQGQWNSP 1178 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T G L + V H V P N + + L + ++A PA+ T Sbjct: 1179 RIRVTSGGKLQCRYVFHIVTP-------NNTKQITSVLLEVFTIADKLGLATLALPALGT 1231 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLL 171 G G A+ + E++ + + V +++ + + L Sbjct: 1232 GNLGIESLRIAQCIRGAIKEYVDSNTPANLNTIKVVIFEQSMVAEFRQGL 1281 >UniRef50_UPI00006A1CA6 poly (ADP-ribose) polymerase family, member 14 n=11 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1CA6 Length = 1527 Score = 145 bits (367), Expect = 5e-34, Method: Composition-based stats. Identities = 52/178 (29%), Positives = 89/178 (50%), Gaps = 4/178 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + I V + D+T+ VDV+VNAA L G+ A+ AAGP L C + +++G Sbjct: 523 RVTIAVYKDDLTRHRVDVVVNAAREDLKHTEGLALALLNAAGPKLQTECDHIIKREGKYS 582 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPA 120 G +VIT AG+LP K V+HTV P W Q +LL+ L L A N +S+ PA Sbjct: 583 VGDSVITGAGNLPCKQVIHTVSPKWDPNSQTRCTRLLRRGISRCLELAAENGLSSIGIPA 642 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLLTQQG 175 + + + G+P + + V++V +++ R +++ V + + + + + Sbjct: 643 VGSQMSGFPVTVSVQNIVESVRQYVESPQRSRKVTRIHLVDSADGTVAAFAKAVRAEF 700 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 42/160 (26%), Positives = 72/160 (45%), Gaps = 2/160 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I ++QG+I DVIVN+ L + G V A++ AG L ++ + G Sbjct: 737 IKIIQGNIQDATTDVIVNSVGKDLDLNTGAVSKALNAKAGTKLQQQLREMSRGT-QVEEG 795 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T L K V+H V P W G+++ +++L+ N L S+ FPAI T Sbjct: 796 SVFVTNGFGLNCKKVIHVVTPGWDQGKRSAEKILRTIMTNCLSTTEKEKLRSITFPAIGT 855 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEEN 163 G G+P+ A + V + + ++V F+ + + Sbjct: 856 GALGFPKDLVASLMFDEVLKSSCKGGQLQEVNFLLHPSDM 895 Score = 105 bits (263), Expect = 5e-22, Method: Composition-based stats. Identities = 48/175 (27%), Positives = 82/175 (46%), Gaps = 14/175 (8%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V GDITK + DVIVN++N S GV AI AAG ++ D C + Q Sbjct: 947 KYQVRTGDITKESTDVIVNSSNSSFTQKIGVSKAILEAAGKSIEDECATLGAQANK---- 1002 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 ++T G+LP + ++H + + ++ + L+ L+ TSVA PA+ T Sbjct: 1003 GYIVTQKGNLPCRHIIHV----YTISTPDR---IKASVLDVLQECENLKATSVALPAVGT 1055 Query: 124 GVYGYPRAAAAEIAVKTVSEF--ITRHALPEQVYFVCYDEENA-HLYERLLTQQG 175 G G AA A + V EF + + V + + ++ Y+ +++++G Sbjct: 1056 GAGGATSAAVAAAMLDAVEEFVTMKSPKSVQTVKVIVFQQKMLDDFYKSMMSKEG 1110 >UniRef50_C3YS04 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YS04_BRAFL Length = 178 Score = 145 bits (367), Expect = 6e-34, Method: Composition-based stats. Identities = 59/165 (35%), Positives = 78/165 (47%), Gaps = 7/165 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +++GDIT VD IVNAAN SL GV GAI RA G A+ C + + G T Sbjct: 15 QIDIIKGDITSQKVDTIVNAANSSLSLAVGVSGAISRAGGRAIQTECDNIIK-HGSLRTT 73 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVA-ANSYTSVAFPAI 121 V T G L ++H VGP + G E Q L D L + A + S+A PAI Sbjct: 74 DCVWTTPGRLSCTYIIHAVGPNFVPGCESRCKQELYDTCQKVLNIAASRLNAKSIAMPAI 133 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITR----HALPEQVYFVCYDEE 162 S+G G PR AE + +F+ + + V YD E Sbjct: 134 SSGASGMPRRLCAEAMCSAIMDFVENGQGIGSSLLDIRIVDYDRE 178 >UniRef50_Q54PT1 Protein GDAP2 homolog n=1 Tax=Dictyostelium discoideum RepID=GDAP2_DICDI Length = 568 Score = 145 bits (366), Expect = 7e-34, Method: Composition-based stats. Identities = 44/174 (25%), Positives = 80/174 (45%), Gaps = 7/174 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI + GDI L D IV + + +L + I + G +++ Q+ G+C Sbjct: 55 INSRICLWMGDICNLNTDTIVYSNSKTLTESDTISDKIFKYGGSEMMND----IQKNGEC 110 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGE-QNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G ++IT G+LP++ VVHTV P + + L Y ++ L S++F Sbjct: 111 RYGESIITSGGNLPSRFVVHTVCPTYNPKYLSAAENALNSCYRSAFHLSMDVKSKSISFS 170 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLL 171 + + +P IA++T+ F+ + E+V E+ LYE++L Sbjct: 171 TLHSEKRQFPSVGGCHIALRTIRRFLEKPFSKSFEKVILAINTFEDLRLYEQML 224 >UniRef50_C3Y417 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3Y417_BRAFL Length = 1060 Score = 145 bits (365), Expect = 9e-34, Method: Composition-based stats. Identities = 55/180 (30%), Positives = 82/180 (45%), Gaps = 6/180 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + + QGD+T+ V IVNAAN L G+ AI AGP+L + C K + G Sbjct: 459 TVSMYQGDLTQEKVTAIVNAANGYLAHAAGIAAAIQEQAGPSLEEECRKYISKHGPLYET 518 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAAN-SYTSVAFPA 120 + T AG+LP V+H VGP WR + L+ +LN L T+VA PA Sbjct: 519 QVMHTSAGNLPCHYVIHAVGPKWRDYSNKTECASALRVTFLNCLDYANEKLHATTVALPA 578 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA---LPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTG++G P A+ V +F + +V V + + H+ ++ + Sbjct: 579 ISTGIFGVPNDVCAKAVYDAVRDFSKSQSQLGSLGEVRLVNAELDMVHVLRQMFEVSMSQ 638 >UniRef50_C3YS03 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3YS03_BRAFL Length = 2671 Score = 144 bits (364), Expect = 1e-33, Method: Composition-based stats. Identities = 56/184 (30%), Positives = 85/184 (46%), Gaps = 12/184 (6%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + ++ VVQG++T VDV+VN AN SL GGG+ AI +A G + C + G Sbjct: 2200 RRKLVVVQGNLTSHRVDVMVNTANGSLSHGGGLAAAIVKAGGQEIQRDCTNYIKDNGKLT 2259 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPA 120 G + T LP K VVH VGP+W +++ ++ L+ A N+L S+A PA Sbjct: 2260 EGQVMSTKGYKLPCKMVVHAVGPLWIADQKDSKEKALKMAVENALLEARDY--HSIAIPA 2317 Query: 121 ISTG-------VYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLL 171 IS+G + GYP V V+ F + +V+F D + + L Sbjct: 2318 ISSGEELILLCISGYPIKPCVAAIVAAVTAFFNTNPDCALSEVHFAEMDPQKTDAFRDEL 2377 Query: 172 TQQG 175 + Sbjct: 2378 LNRF 2381 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 54/172 (31%), Positives = 73/172 (42%), Gaps = 12/172 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I + +G+IT DV+VN + L + GGV A +A G L C G G Sbjct: 2428 IQLKKGNITAEKADVLVNTTSGDLDLSQGGVARAFGQAGGQELQQLCNN----HGKANAG 2483 Query: 64 HAVIT-LAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 VIT AG L K V H V P W Q DQ L+ + L YTS++FPA+ Sbjct: 2484 DIVITLRAGTLRCKQVYHAVLPNW----QESDQPLRTMVQDCLESADQGGYTSISFPAMG 2539 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLT 172 TG YPR AA + F + + V + +D+ +E L Sbjct: 2540 TGNLKYPRDVAASCMYDEILSFSQSNPGTTLQDVGIIVFDQPTVQAFETELR 2591 >UniRef50_UPI0000E8099B PREDICTED: similar to PARP9 protein n=2 Tax=Gallus gallus RepID=UPI0000E8099B Length = 796 Score = 143 bits (362), Expect = 2e-33, Method: Composition-based stats. Identities = 52/170 (30%), Positives = 91/170 (53%), Gaps = 4/170 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V + D+T D +VNAAN SL G + A+ A GP + + ++ G PTG Sbjct: 78 LLVYKDDLTSHKADAVVNAANESLEHSGALALALLNAGGPEIAEESRNFIRKHGKVPTGK 137 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVA--ANSYTSVAFPAI 121 +T G LP K ++H +GP+W E+ + LL++A +N L+ + N+ SVA PA+ Sbjct: 138 IAVTGGGKLPCKKIIHAIGPIWYPSEKEKCCVLLEEAVVNVLKYASDPKNNIKSVAIPAV 197 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERL 170 S+GV+G+P A++ V ++ F+ + ++++ V E+ +R Sbjct: 198 SSGVFGFPVNLCAQVIVMSIKLFVETQPSCLKEIHLVNICEQTVAEIKRA 247 Score = 89.9 bits (222), Expect = 3e-17, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 74/174 (42%), Gaps = 7/174 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 R+ +++G + K+ IV++ + + A+ + AGP L L + Sbjct: 277 NIRLRIIKGYLEKIRTTAIVSSVSSDGEFCSQISTAMLQKAGPTLQAEILSQLKHLDSSK 336 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 ++T +LP+ V+H + P + +Q L++ L V S+AFP Sbjct: 337 --ELIVTSGYNLPSDFVLHVLWPCFNHVVLLCEQ-LKEIVNRCLYFVRNYPLPSIAFPEK 393 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCY--DEENAHLYERLLT 172 + + P A AEI ++ V +F ++ + V FV + D+ +++ + Sbjct: 394 NWSLK-LPVAIVAEIMIEEVLDFARKYPETKIDVQFVLHPDDDTTYQVFQEKMN 446 >UniRef50_UPI0000E4815A PREDICTED: similar to LRP16 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4815A Length = 415 Score = 143 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 54/113 (47%), Positives = 68/113 (60%), Gaps = 5/113 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ V QGDITKL VD IVNAAN SL+GGGGVDGAIHRAAG LL C K+ C Sbjct: 159 LNNRVSVWQGDITKLDVDCIVNAANRSLLGGGGVDGAIHRAAGSNLLQECKKLA----GC 214 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANS 112 TG A +T LP++ V+HTVGP+ + N + L Y L + ++ Sbjct: 215 ETGDAKLTAGYLLPSRYVLHTVGPMVYGQPMTNHREDLTSCYATCLHQILEHN 267 Score = 77.2 bits (189), Expect = 2e-13, Method: Composition-based stats. Identities = 25/60 (41%), Positives = 39/60 (65%), Gaps = 1/60 (1%) Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 SVAFP ISTGVYGYP+ A+ +A+ TV E++ + +++ F + + + +YERLL Sbjct: 335 IRSVAFPCISTGVYGYPQEEASRVALGTVREWLEENPEEVDRIVFCIFLDRDLKVYERLL 394 >UniRef50_C3Y5X1 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y5X1_BRAFL Length = 592 Score = 143 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 45/175 (25%), Positives = 75/175 (42%), Gaps = 10/175 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GDIT V IVN + +L GV AI AAGP++ C + + G Sbjct: 252 VQVQMGDITMEQVSAIVNPSQNNLDLDKGVSRAISMAAGPSVQKECRQYIRDNWYPKAGD 311 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 V T AG+LP +++H V P + L+ N L + + S+AFPA+ TG Sbjct: 312 VVATGAGNLPCASILHLVQPT--------AKYLRSDVKNCLLVAHQMNLRSLAFPAVGTG 363 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQGDE 177 + +A + ++EF+ + + V + E + + ++ E Sbjct: 364 RFHIKPERSARCMIDGIAEFVQDWSPTTLSIIRIVIFQENMLQAFHTAVHRKASE 418 >UniRef50_Q2SM57 Predicted phosphatase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SM57_HAHCH Length = 180 Score = 143 bits (360), Expect = 4e-33, Method: Composition-based stats. Identities = 54/180 (30%), Positives = 85/180 (47%), Gaps = 13/180 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + GDIT+L VD IV A+ L G G+ I AG L+A Q G C G Sbjct: 2 IEFLCGDITELEVDAIVCPAHKYLSKGRGLSAQIFEQAGEEALEAAC---SQAGGCKVGG 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQ---NEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A +T LPAK ++HTV P W GG+Q ++ LL + Y + +RL ++AFPA+ Sbjct: 59 ACLTPGFKLPAKHIIHTVTPQWTGGDQWGGSDLHLLANCYDSVVRLALEQGVKTIAFPAL 118 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENA----HLYERLLTQQGDE 177 G P++ AA ++ + ++ E++ + E YE ++ ++ Sbjct: 119 GAGTNKTPQSMAAHEGLEVLVKYADS---FERLIICLHWEAGLDTWRRTYEDFFARRVEQ 175 >UniRef50_O07733 UPF0189 protein Rv1899c/MT1950 n=16 Tax=Mycobacterium RepID=Y1899_MYCTU Length = 359 Score = 140 bits (354), Expect = 2e-32, Method: Composition-based stats. Identities = 53/169 (31%), Positives = 74/169 (43%), Gaps = 8/169 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V Q D+TKL +D I NAAN L GGV AI RA GP L + + G Sbjct: 191 ELEVHQADVTKLELDAITNAANTRLRHAGGVAAAIARAGGPELQRESTE----KAPIGLG 246 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV T AGD+PA+ V+H G +++ A +LR S+A A T Sbjct: 247 EAVETTAGDMPARYVIHAA--TMELGGPTSGEIITAATAATLRKADELGCRSLALVAFGT 304 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GV G+P AA + V V R ++V F + + + + Sbjct: 305 GVGGFPLDDAARLMVGAVRRH--RPGSLQRVVFAVHGDAAERAFSAAIQ 351 >UniRef50_Q9P0M6 Core histone macro-H2A.2 n=118 Tax=Eukaryota RepID=H2AW_HUMAN Length = 372 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 42/178 (23%), Positives = 82/178 (46%), Gaps = 7/178 (3%) Query: 1 MKTRIHVVQGDIT---KLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ 57 + ++ + Q DI+ + V+ IV+ + + A+ +A G L+ ++R+ Q Sbjct: 193 LGQKLSLTQSDISHIGSMRVEGIVHPTTAEIDLKEDIGKALEKAGGKEFLETVKELRKSQ 252 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 G A ++ + L AK V+H P W G ++ L++ N L SVA Sbjct: 253 GPLEVAEAAVSQSSGLAAKFVIHCHIPQW--GSDKCEEQLEETIKNCLSAAEDKKLKSVA 310 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQ 173 FP +G +P+ AA++ +K +S + + VYF+ +D E+ +Y + + + Sbjct: 311 FPPFPSGRNCFPKQTAAQVTLKAISAHFDDSSASSLKNVYFLLFDSESIGIYVQEMAK 368 >UniRef50_B0QWK9 Putative uncharacterized protein n=1 Tax=Haemophilus parasuis 29755 RepID=B0QWK9_HAEPR Length = 156 Score = 139 bits (350), Expect = 5e-32, Method: Composition-based stats. Identities = 49/131 (37%), Positives = 72/131 (54%), Gaps = 3/131 (2%) Query: 45 ALLDACLKVRQQQGDC-PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYL 102 L AC ++ ++QG PTG A IT A +LP+ V+HTVGP+ G + +LL Y Sbjct: 2 QLRLACAELMEKQGHLGPTGQAKITPAFNLPSAYVLHTVGPIISGALSAKDCELLASCYR 61 Query: 103 NSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEE 162 + L L + SVAF ISTG + +P AAEIAV+TV F+ + +V F + + Sbjct: 62 SCLELAKQHGIESVAFCCISTGEFRFPNQEAAEIAVQTVKAFLADNPQ-MKVVFNVFKDV 120 Query: 163 NAHLYERLLTQ 173 + +Y LL + Sbjct: 121 DLEIYRGLLGE 131 >UniRef50_C3ZVW0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZVW0_BRAFL Length = 731 Score = 138 bits (348), Expect = 7e-32, Method: Composition-based stats. Identities = 47/178 (26%), Positives = 71/178 (39%), Gaps = 3/178 (1%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T+I + QGD+ K VDVIVN N L G + A+ + G + C + G Sbjct: 538 LDTKISIYQGDVIKECVDVIVNETNDRLKLSGELSWALAQYGGHDIEADCRRYVATHGRL 597 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLV-AANSYTSVAF 118 V T AG LP+K ++H V P W E + LL Y N + TS+A Sbjct: 598 AATQVVPTSAGQLPSKHILHAVVPHWVSAHPRESKMLLYKTYENIFKCAGIKMRVTSIAL 657 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQG 175 ++G G P+ AE + V F+ + L + V + + Sbjct: 658 SLQTSGSTGIPKDVYAETMFQAVVSFLKTYGPLLRDIRMVNPSHRTVSTFIDAFKTKM 715 >UniRef50_B7P925 Histone H2A n=1 Tax=Ixodes scapularis RepID=B7P925_IXOSC Length = 366 Score = 138 bits (348), Expect = 8e-32, Method: Composition-based stats. Identities = 41/175 (23%), Positives = 72/175 (41%), Gaps = 8/175 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ V+QGD+ + D ++ N SL G V + +A G + + G Sbjct: 195 LSVQLTVIQGDMASVTADAAIHPTNASLSLSGEVGQVLEKAGGKEFVQEVKDLFSAHGPL 254 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 + AVI PAK V+H P + L+ N L L + +A P Sbjct: 255 ESAGAVICPGHQFPAKFVIHCNVP------SGSSEPLEKCVRNCLALADEKNIRVLAVPP 308 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLLTQ 173 ++T + AA+ +K +S + + +Q+YFV D E+ +Y L + Sbjct: 309 LATHSVASQKQQAAQTILKAISNYFVNVMSSSLKQIYFVLSDMESIGIYTSELAK 363 >UniRef50_B9L2D9 Appr-1-p processing enzyme family protein n=2 Tax=Thermomicrobia (class) RepID=B9L2D9_THERP Length = 176 Score = 138 bits (348), Expect = 9e-32, Method: Composition-based stats. Identities = 61/172 (35%), Positives = 78/172 (45%), Gaps = 8/172 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GDIT + + IVNAAN L G GV GAI RA G + + QG G Sbjct: 6 LEVQVGDITAVDTEAIVNAANSQLWMGSGVAGAIKRAGGEEIEREAVA----QGPISVGE 61 Query: 65 AVITLAGDLPAKAVVHTVGPVWR---GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 AV+T AG LP AV+H + + + A +L A SVAFPA+ Sbjct: 62 AVVTTAGRLPFAAVIHAAAMGYDERGAMIPATSETVYAATRAALERCAERPLRSVAFPAL 121 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLT 172 TGV G A V+ V + ALPE+V FV EE A + R + Sbjct: 122 GTGVGGLDLVTCAAAMVRAVRDHAASGAALPERVVFVVRSEEAADAFLRAIA 173 >UniRef50_D1R847 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R847_9CHLA Length = 411 Score = 136 bits (343), Expect = 3e-31, Method: Composition-based stats. Identities = 56/156 (35%), Positives = 80/156 (51%), Gaps = 16/156 (10%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGP--------ALLDACL-- 51 KT++ +V+G D IVNAAN L+GGGG+DG I +G L A + Sbjct: 203 KTKVVLVKGSTLDQNTDAIVNAANERLLGGGGIDGQIWSRSGALSGAKDSGEFLKAEIMP 262 Query: 52 -KVRQQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAA 110 K G+ P G AVIT A L ++ ++H VGP ++L++AYLNSL L+ A Sbjct: 263 IKANLPSGNLPNGEAVITRALGLNSRYIIHAVGPRGAQ-----PKVLRNAYLNSLELLDA 317 Query: 111 NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFIT 146 N S++F IS ++GY AA I V + + Sbjct: 318 NQLKSISFCCISQSIFGYSPKDAAPIVVDLIRRYCE 353 >UniRef50_D1B7G8 Appr-1-p processing domain protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B7G8_THEAS Length = 179 Score = 136 bits (342), Expect = 4e-31, Method: Composition-based stats. Identities = 53/162 (32%), Positives = 76/162 (46%), Gaps = 7/162 (4%) Query: 9 QGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVIT 68 +GDI D IVNAAN L G GV GAI R+AG + + +G G AV T Sbjct: 16 EGDICSYRGDAIVNAANDRLWMGSGVAGAIRRSAGEEVEAEAI----SKGPIRVGSAVAT 71 Query: 69 LAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGY 128 AG LP KAV+H V + + ++ + +LRL A +AFPA+ TGV G+ Sbjct: 72 GAGRLPLKAVIHCA--VMGQDLKTSREAIRSSTGEALRLAAEMELRRIAFPALGTGVGGF 129 Query: 129 PRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYER 169 P + + + EF+ ++V F + E + R Sbjct: 130 PVEECGHVMGEELKEFLLICPDGLDEVAFYLFGAEAFRQFVR 171 >UniRef50_Q4RG95 Chromosome 12 SCAF15104, whole genome shotgun sequence n=10 Tax=Clupeocephala RepID=Q4RG95_TETNG Length = 1433 Score = 133 bits (335), Expect = 3e-30, Method: Composition-based stats. Identities = 56/175 (32%), Positives = 85/175 (48%), Gaps = 10/175 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 V+ GDIT+ DVI+N++N GV AI AG A+ C + + QG P G Sbjct: 939 TFEVLSGDITRETCDVIINSSNRDFTLKSGVSKAILDGAGWAVQVECAQQARAQGH-PPG 997 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 H ++T AG LP+KA+VH N ++ +L+L ++ S AFPA+ T Sbjct: 998 HMIVTSAGRLPSKAIVHV-------SISNNPADIKSTVYAALKLCEEKTFRSAAFPALGT 1050 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQV-YFVCYDEENA-HLYERLLTQQGD 176 GV G P AA A+ V V++F + + V + E H + ++ QG+ Sbjct: 1051 GVGGVPPAAVADAMVGAVADFAKKQPKSIHLAKIVIFQPEMLTHFHNSMMKMQGE 1105 Score = 121 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 56/167 (33%), Positives = 77/167 (46%), Gaps = 4/167 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ V Q D+ L VD +VN AN +L GG+ A+ AAGP L + G G Sbjct: 501 QLSVSQADLCALQVDAVVNPANENLQHTGGLALALLEAAGPELQNTSNLYVAVNGALCAG 560 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + T A LP K V+H VGP + + E LL+ SLR TSVA PAIS Sbjct: 561 QVIATDACRLPCKHVIHAVGPRFSDHSREESVLLLRRVVTQSLREAERLGCTSVAVPAIS 620 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHL 166 +GV+G+P + A+ + V E +V V E+ A Sbjct: 621 SGVFGFPLSLCADTIAQAVWEHCGAAGGRGALREVQLVANTEQTAGA 667 Score = 119 bits (298), Expect = 5e-26, Method: Composition-based stats. Identities = 43/165 (26%), Positives = 73/165 (44%), Gaps = 6/165 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQ--QQGDC 60 R+ + +G+I VIVN + ++ + G V A+ RAAG L A LK + + Sbjct: 733 RVVLCKGNIEDQRSCVIVNTISETMNLDQGAVSRALLRAAGKGLQAAVLKEARLARLDQL 792 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G ++T L + V H V P W Q E + L L+ S++FPA Sbjct: 793 DPGSLLVTDGFKLRCQKVFHAVCPQWSASYQAE-KTLTSIISRCLKEAERLKMRSLSFPA 851 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEEN 163 I TG+ +P+ A + ++ V F + +V+ V + ++ Sbjct: 852 IGTGLLSFPKDLVARVLLEEVRTFSRKKTPQHLLKVFVVVHPSDS 896 >UniRef50_Q5V4P3 Putative uncharacterized protein n=1 Tax=Haloarcula marismortui RepID=Q5V4P3_HALMA Length = 166 Score = 133 bits (335), Expect = 3e-30, Method: Composition-based stats. Identities = 52/167 (31%), Positives = 77/167 (46%), Gaps = 8/167 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 V+QGDI + D +VNAAN SL G GV GA+ RAAG L D + +G G Sbjct: 2 EFEVIQGDIAAQSADALVNAANTSLRMGSGVAGALKRAAGSGLNDEAVA----KGPVDLG 57 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 T A DL A+ V+H G Q+ + +++A N+L A + SV FPAI Sbjct: 58 GVATTDAYDLDAEYVIHAAA--MPPGGQSTAESIRNATRNALAEADALNCESVVFPAIGC 115 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 G+ G+ I + E+ V + Y +++ +R+ Sbjct: 116 GIAGFDFEEGIRIICAVIEEYQPES--LTDVRLIAYSDDDFEGMQRV 160 >UniRef50_O28751 UPF0189 protein AF_1521 n=32 Tax=Euryarchaeota RepID=Y1521_ARCFU Length = 192 Score = 133 bits (334), Expect = 3e-30, Method: Composition-based stats. Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 12/180 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRA-AGPALLDAC---LKVRQQQGD 59 + + QGDIT+ IVNAAN L GGGV AI +A AG A L +R+Q G Sbjct: 13 TLKLAQGDITQYPAKAIVNAANKRLEHGGGVAYAIAKACAGDAGLYTEISKKAMREQFGR 72 Query: 60 --CPTGHAVITLAGDL---PAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSY 113 G V+T A +L K V HTVGP+ G E + L A+L L Sbjct: 73 DYIDHGEVVVTPAMNLEERGIKYVFHTVGPICSGMWSEELKEKLYKAFLGPLEKAEEMGV 132 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S+AFPA+S G+YG E ++ V F + + ++V V YD ++A + ++ + Sbjct: 133 ESIAFPAVSAGIYGCDLEKVVETFLEAVKNF--KGSAVKEVALVIYDRKSAEVALKVFER 190 >UniRef50_Q4T065 Chromosome undetermined SCAF11328, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4T065_TETNG Length = 566 Score = 132 bits (333), Expect = 5e-30, Method: Composition-based stats. Identities = 47/141 (33%), Positives = 71/141 (50%), Gaps = 5/141 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I + +GD+ L IVN ++ SL V +IHR AGP L D LK++ C Sbjct: 50 INAKIVLFKGDVALLNCTSIVNTSSESLNDKNPVSDSIHRLAGPELRDELLKLK----GC 105 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG A +T L A+ ++HTVGP ++ + + L Y + L+LV S SV Sbjct: 106 RTGEAKLTKGFGLAARFIIHTVGPKYKTKYRTAAESSLYSCYRSVLQLVVEQSMASVGLC 165 Query: 120 AISTGVYGYPRAAAAEIAVKT 140 I+T GYP A +A++ Sbjct: 166 TITTSKRGYPLEEATHMALRE 186 >UniRef50_Q2ITR2 Appr-1-p processing n=1 Tax=Rhodopseudomonas palustris HaA2 RepID=Q2ITR2_RHOP2 Length = 127 Score = 132 bits (332), Expect = 6e-30, Method: Composition-based stats. Identities = 53/118 (44%), Positives = 69/118 (58%), Gaps = 4/118 (3%) Query: 46 LLDACLKVRQQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSL 105 +L AC K+ G C TG A ITL DLPA+ V+H VGPVW GG ED+ L Y +L Sbjct: 1 MLAACRKL----GGCATGDAKITLGYDLPARHVIHAVGPVWHGGRSGEDEALASCYRRAL 56 Query: 106 RLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEEN 163 +L + S+AF AISTGVYG+P AA IAV + + A ++V F C+ E + Sbjct: 57 QLCRQHGLASIAFSAISTGVYGFPPERAAPIAVAACIDALRTAAPVDRVVFCCFSEPS 114 >UniRef50_A0CX06 Chromosome undetermined scaffold_3, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CX06_PARTE Length = 1064 Score = 131 bits (331), Expect = 7e-30, Method: Composition-based stats. Identities = 46/181 (25%), Positives = 85/181 (46%), Gaps = 11/181 (6%) Query: 1 MKTRIHVVQGDITKLA-VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKV---RQQ 56 ++ I + DIT++ VD IVN A+P+L GG+ GA+ RAAG LL+ + + + Sbjct: 700 LEQSIIIHNQDITQIKGVDAIVNVADPNLKNRGGICGAVFRAAGENLLEEEINMLFNKLG 759 Query: 57 QGDCPTGHAVITLAGDLP----AKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAAN 111 + T ++T + L K ++H VGP + Q + L +N L+ Sbjct: 760 RKQPETSEVIVTKSYRLGQENGPKYIIHAVGPKYNPQDPQKSKEQLNTCIVNILQKCQEY 819 Query: 112 SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 TSVA P IS + +P+ A+I + +F + P ++ + ++ +++ + Sbjct: 820 KITSVAIPPISEKNFDFPKQICAQIFHAALLQF--QFQNPMSIHIIDVRDKVVDIFKIIF 877 Query: 172 T 172 Sbjct: 878 K 878 Score = 54.5 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 59/170 (34%), Gaps = 5/170 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++QGDI + D I ++ +L+ G+ +H+ G + + ++ Sbjct: 2 LSLIQGDIIQQKADAIALPSDIALLKAPGL-KQLHQN-GQQQYNDSITNQKPFSQIQQTS 59 Query: 65 AVITLAGDLPAKAVVHTVGP--VWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + + K +++ + P E LL+ + N + S+ P + Sbjct: 60 VITLPLQNNQFKYIIYCIVPKSDLNNQELQLSLLLELLFENIFDEITFLKLQSILIPVLG 119 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 + +A +++ + + FV ++ R L+ Sbjct: 120 CDNADFTIQEFL-MAFQSIYAKQRDNLKDVNLIFVSQSQQEYEPVRRFLS 168 >UniRef50_B8HYS6 Appr-1-p processing domain protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HYS6_CYAP4 Length = 694 Score = 129 bits (326), Expect = 3e-29, Method: Composition-based stats. Identities = 46/177 (25%), Positives = 74/177 (41%), Gaps = 6/177 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 T + V DIT+L DVIV++ + L GGGV AI AAG + +++ Sbjct: 397 NTTVRVQYCDITQLEADVIVSSDDIHLSMGGGVSEAILLAAGEVAWEEA----RRRVPLK 452 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G IT AG L A+ + H ++ Q L++ L + A + S+A PA+ Sbjct: 453 LGEIAITTAGHLKARQLFHAAVLDYQQQTQTTVDLIRSVTRKCLEICHAQGFQSIALPAL 512 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENA--HLYERLLTQQGD 176 +TG G +A + + + + +V Y + H+ R Q D Sbjct: 513 ATGTAGLSPEQSAIAMMLEILPHLNQETSLRRVTIALYSRQGLPHHILTRFYLQVSD 569 >UniRef50_UPI000180BD0C PREDICTED: similar to Ci-Rhysin2/Deltex3-a n=1 Tax=Ciona intestinalis RepID=UPI000180BD0C Length = 578 Score = 129 bits (325), Expect = 4e-29, Method: Composition-based stats. Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 9/170 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVR--QQQGDCPT 62 + V G+I VD IVNAAN + G GV GAI + G C + +Q Sbjct: 384 VSVAHGNIALQDVDAIVNAANKYIQNGSGVTGAIFKQGGSKFEQLCKEAMKHRQNRSLKV 443 Query: 63 GHAV-ITLAGDLPAKAVVHTVGPVWRGGEQNED--QLLQDAYLNSLRLVAANSYTSVAFP 119 G V + AG+L K V+H VGP W+ ++ LL+D L+ L+ +++A P Sbjct: 444 GEVVSVKAAGNLQCKRVLHLVGPQWKNYSHKDEAYHLLEDGLLSVLKESNYCKASTLALP 503 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ----VYFVCYDEENAH 165 ++TG+YG P ++ F T + ++ + + D++ + Sbjct: 504 PVATGIYGTPLKLFVRAMNTALTCFETNISRHQRSLHYIRILSIDQDTVN 553 >UniRef50_Q460N3 Poly [ADP-ribose] polymerase 15 n=12 Tax=Eutheria RepID=PAR15_HUMAN Length = 656 Score = 129 bits (325), Expect = 4e-29, Method: Composition-based stats. Identities = 50/176 (28%), Positives = 88/176 (50%), Gaps = 8/176 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + ++ GD+ + DVIVN+ +L GGG + A + AGP L L R+++ + G Sbjct: 69 LKLISGDVLYIWADVIVNSVPMNLQLGGGPLSRAFLQKAGPMLQKE-LDDRRRETEEKVG 127 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 + +T +L KAV+H V P W G + Q++ + L V S++S+ FP I T Sbjct: 128 NIFMTSGCNLDCKAVLHAVAPYWNNGAETSWQIMANIIKKCLTTVEVLSFSSITFPMIGT 187 Query: 124 GVYGYPRAAAAEIAVKTVSEFITR----HALPEQVYFVCY--DEENAHLYERLLTQ 173 G +P+A A++ + V E+ + + ++V+F+ Y D+E + T Sbjct: 188 GSLQFPKAVFAKLILSEVFEYSSSTRPITSPLQEVHFLVYTNDDEGCQAFLDEFTN 243 Score = 112 bits (281), Expect = 4e-24, Method: Composition-based stats. Identities = 45/173 (26%), Positives = 72/173 (41%), Gaps = 16/173 (9%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 V GDI VDVIVN+ + GV AI AG A+ C + Q P Sbjct: 283 TFQVATGDIATEQVDVIVNSTARTFNRKSGVSRAILEGAGQAVESECAVLAAQ----PHR 338 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +IT G L K ++H G + ++ + L YTSV+ PAI T Sbjct: 339 DFIITPGGCLKCKIIIHVPG----------GKDVRKTVTSVLEECEQRKYTSVSLPAIGT 388 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCYDEENAHLYERLLTQQ 174 G G A+ + + +F ++H+ P + V V + E +++ + ++ Sbjct: 389 GNAGKNPITVADNIIDAIVDFSSQHSTPSLKTVKVVIFQPELLNIFYDSMKKR 441 >UniRef50_UPI00005A5611 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 14 n=1 Tax=Canis lupus familiaris RepID=UPI00005A5611 Length = 575 Score = 129 bits (324), Expect = 5e-29, Method: Composition-based stats. Identities = 51/167 (30%), Positives = 79/167 (47%), Gaps = 7/167 (4%) Query: 11 DITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITL 69 DI ++ DVIVN +L GGG + A+ + AGP L RQ + G +T Sbjct: 111 DI-RVVADVIVNTVPMNLQLGGGQLSQALLQKAGPELQKELYATRQGT-EEEVGSIFMTS 168 Query: 70 AGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYP 129 +L KAV+H V P W G + Q++ + L V S++S+ FP I TG +P Sbjct: 169 GCNLNCKAVLHVVAPHWDNGAGSSQQIMANIIKKCLTTVEEFSFSSITFPMIGTGSLRFP 228 Query: 130 RAAAAEIAVKTVSEFITR--HALPEQVYFVCY--DEENAHLYERLLT 172 +A AE+ + V F + ++V+F+ Y D+E + T Sbjct: 229 KAIFAELILSEVFRFSSSLWQKSLQEVHFLVYPGDDETLQAFLDKFT 275 Score = 115 bits (288), Expect = 6e-25, Method: Composition-based stats. Identities = 48/173 (27%), Positives = 75/173 (43%), Gaps = 16/173 (9%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + GDITK DVIVN+ + GV A+ AGPA+ + C Q P G Sbjct: 316 TFQIATGDITKEKADVIVNSTTRTFNLKSGVSKAVLEGAGPAVENECAVRAAQ----PHG 371 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +IT G L K ++H +G D ++ L YTSVA PAI T Sbjct: 372 EFIITQGGYLMCKIIIHVLG----------DNDVRKTVSAVLEECEQRKYTSVALPAIGT 421 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALP--EQVYFVCYDEENAHLYERLLTQQ 174 G G A+ + V +F +H+ P ++V V + + +++ + ++ Sbjct: 422 GSAGKNPTIVADDMISAVVDFSWKHSTPSLKKVKVVIFLSDLLNVFHDNMKKR 474 >UniRef50_D2VM45 Poly ADP-ribose polymerase family, member 14-like protein n=1 Tax=Naegleria gruberi RepID=D2VM45_NAEGR Length = 1557 Score = 129 bits (324), Expect = 5e-29, Method: Composition-based stats. Identities = 39/160 (24%), Positives = 73/160 (45%), Gaps = 5/160 (3%) Query: 2 KTRIHVVQGDI--TKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + I++ QGD+ K V+VN+AN L GGG+ G C + Sbjct: 831 NSFIYICQGDMFDKKWKAQVLVNSANDQLAHGGGIAAQCLEKCGKQFDQECKNITTSL-K 889 Query: 60 CPTGHAVITLAGDLPA-KAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVA 117 G V T AG+L + + + P++ N LL+ +N L + Y S+ Sbjct: 890 LKPGDVVPTTAGNLSYLSKIYNAIPPMYDSNNHLNSCSLLEQTVVNILTQAEKDGYCSII 949 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFV 157 PA+S+G++G+P + +I V+T+ ++ + + ++ + Sbjct: 950 IPALSSGIFGFPLDQSTDIIVRTIYKYAPQLSCLREIILI 989 >UniRef50_B5Y5Y4 Appr-1-p processing enzyme family protein n=2 Tax=Firmicutes RepID=B5Y5Y4_COPPD Length = 172 Score = 128 bits (323), Expect = 6e-29, Method: Composition-based stats. Identities = 52/165 (31%), Positives = 78/165 (47%), Gaps = 4/165 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++ +V GDITK DVIVNAAN GGGV AI +A G + D ++V Q P Sbjct: 7 DKVKLVMGDITKAEADVIVNAANGIGPMGGGVALAIKKAGGKVIEDEAIRVCSQLDPRP- 65 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G +T AG L AK + H V + R E + ++++ + L S+ PA++ Sbjct: 66 GDVYVTTAGGLKAKYIFHAVT-MKRPAEPSSVEIVRKCLQSLLEKAREMKVKSMVLPALA 124 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVY--FVCYDEENAH 165 TGV G P+ A++ + + + V F+ Y EE Sbjct: 125 TGVGGVPKKDVAKVYKEVLGDVKDIDITVMDVSGEFIKYLEEELK 169 >UniRef50_A2BJA7 A1pp, Appr-1-p processing enzyme n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BJA7_HYPBU Length = 199 Score = 128 bits (322), Expect = 9e-29, Method: Composition-based stats. Identities = 48/172 (27%), Positives = 84/172 (48%), Gaps = 8/172 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + +GDIT+ + +VN AN ++ GGGV GA+ RAAGP + + +++ P G Sbjct: 16 VEIARGDITEAECEAVVNPANSLMIMGGGVAGALRRAAGPEVEEEA----RRKAPVPVGE 71 Query: 65 AVITLAGDLPA--KAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A+ T AG L K ++H + R + + A L +LR + +A PA+ Sbjct: 72 AIHTGAGRLEPRIKYIIHA-PTMERPAMRTTQGKVVKAVLAALREAEKLNVGCLALPAMG 130 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLYERLLTQ 173 GV G + E ++ + EF+ + LP ++ V Y E +A + + + Sbjct: 131 AGVGGLTARESLEAIMEALDEFLGSGGKLPPRIILVAYSERDAKQFLDEIKR 182 >UniRef50_A1R2V6 Putative uncharacterized protein n=1 Tax=Arthrobacter aurescens TC1 RepID=A1R2V6_ARTAT Length = 152 Score = 128 bits (321), Expect = 1e-28, Method: Composition-based stats. Identities = 58/146 (39%), Positives = 78/146 (53%), Gaps = 10/146 (6%) Query: 35 DGAIHRAAGPALLDACLKVRQQQ--GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN 92 DGAIHRAAG LL+AC ++R+ + P G AV T A LPA V+HTVGP G Q Sbjct: 2 DGAIHRAAGSELLEACRELRRTELPEGLPVGAAVATPAFRLPAHWVIHTVGPNRHAG-QT 60 Query: 93 EDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL-- 150 + LL + SL++ A S+AFPAIS G+YG+ AE+A V F + ++ Sbjct: 61 DPALLASCFRESLKVAAGLGARSLAFPAISAGIYGWDSRQVAEVAFDAVGSFSSSSSVSA 120 Query: 151 -----PEQVYFVCYDEENAHLYERLL 171 E V FV + EE ++ L Sbjct: 121 ASERGFELVEFVLFSEETTAVFRAAL 146 >UniRef50_B1H1M8 LOC100148704 protein (Fragment) n=5 Tax=Danio rerio RepID=B1H1M8_DANRE Length = 858 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 50/171 (29%), Positives = 83/171 (48%), Gaps = 13/171 (7%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V GDITK+ V+ +VN+ N SL GV GAI +A+GP ++ C + + P Sbjct: 280 TIRVSSGDITKVKVEAVVNSTNTSLNLSSGVSGAILKASGPTVVKEC----KAKAPQPED 335 Query: 64 HAVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 V+T AG+L +VH VG R G ++ + L+ N SV+FPA+ Sbjct: 336 GVVLTRAGNLTNCTHIVHMVGQTSRTG-------IRSSMAKVLKTCEENHIRSVSFPALG 388 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLT 172 TG P AA A+ +++F+ ++V+ V + + +++ + Sbjct: 389 TGAGHLPAAAVADAMTTALADFVKDSPKHLKRVHIVIFQPKLLPDFQKAVR 439 Score = 121 bits (304), Expect = 9e-27, Method: Composition-based stats. Identities = 46/172 (26%), Positives = 75/172 (43%), Gaps = 14/172 (8%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 T I V +G IT +V IVN N + GGV GAI +AAG ++ C ++ G Sbjct: 78 NTTIEVRKGSITTESVRGIVNTTNRDMSRRGGVSGAIFKAAGASVEQEC----RKHGPLQ 133 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A +T AG L ++H +GP + L T+V+FPAI Sbjct: 134 GDDAAVTAAGLLHCDLILHMLGPHSAAESRTR-------VRRVLERCEEKQITTVSFPAI 186 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLT 172 TG G AA ++ ++ +T+ + ++ ++ D +N + + L Sbjct: 187 GTGGGGVQAVDAATAMLQGFADHLTKSTSSVVKLIYIVIDRDN--ILQEFLN 236 Score = 41.0 bits (95), Expect = 0.015, Method: Composition-based stats. Identities = 8/60 (13%), Positives = 22/60 (36%), Gaps = 2/60 (3%) Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFIT--RHALPEQVYFVCYDEENAHLYERLLTQ 173 +A PAI TG G+ + + + +T + ++ + ++ + + Sbjct: 1 LAIPAIGTGRGGFSPRDSMRAMLTALQTHLTEPNSSTLSRITVLALQQDTFQAFRHCFKE 60 >UniRef50_UPI0001BC8416 Appr-1-p processing domain protein n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC8416 Length = 430 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 47/161 (29%), Positives = 73/161 (45%), Gaps = 5/161 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 K+R+ + GD+T DVIV++ + L GGGV +I RA G + K C Sbjct: 11 KSRLIIKFGDLTSAVTDVIVSSDDAYLSMGGGVSASILRAGGDVIARDARKNV----PCQ 66 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G ++T AG L AK V H + W ++ ++ + SL +++ S+AFPA Sbjct: 67 MGDVIVTSAGKLEAKYVFHAITIDWSQKDEFTVEKSINSIIKKSLNVLSVLGLKSIAFPA 126 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDE 161 I TG Y A +SEF++ ++Y D Sbjct: 127 IGTGAARYSLEDVAHFMSMAISEFLSNSDEELEIYIYLMDR 167 >UniRef50_UPI000180C4AC PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=1 Tax=Ciona intestinalis RepID=UPI000180C4AC Length = 1679 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 49/178 (27%), Positives = 77/178 (43%), Gaps = 9/178 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANP---SLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +I V+ DIT+ DVIVN+ + + G + AI AAG L C + C Sbjct: 950 QITVIVKDITQQKADVIVNSTTSFAVNSVDGSALSRAIRDAAGKKLQVECNAIPPAS-KC 1008 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 G V T DLP K + H + W ++N L+ SL + + +TS+AFP Sbjct: 1009 TWG-VVATKGYDLPCKNIYHAMIAGWDFSDENTSRSHLKRVVAKSLEMATKDGHTSIAFP 1067 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDE--ENAHLYERLLTQQG 175 I G + P AE+ + V I + + +Y V + E ++E+ L + Sbjct: 1068 PIGCGGFNIPPYVLAEVIQEEVYA-IGSSSSLQNIYIVVHPSQPELTKVFEQKLNKSS 1124 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 73/166 (43%), Gaps = 6/166 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 I ++ D+T+ DVIV + SL + GV GAI + G ++ + G Sbjct: 757 NVTISSIKADLTQHQCDVIVVPISSSLSLSTQGVAGAIAKKGGKGIVKTMQVGKNFLG-- 814 Query: 61 PTGHAVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G VI+ G + K+++H V PV Q +L+ ++++ S+AFP Sbjct: 815 TGGEVVISGPGSITNCKSIIHAVVPV-SWALQGSQDVLKCCVRKAMQMANQQQAKSIAFP 873 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENA 164 A+ G G I V ++ ++ ++ E+V+ V + +N Sbjct: 874 ALCCGQGGGKPEDCISIMVHEITSYLRLNNSSIEEVHLVELNRDNV 919 Score = 70.7 bits (172), Expect = 2e-11, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 62/150 (41%), Gaps = 14/150 (9%) Query: 2 KTRIHVVQGDITKLAVDVIVNA--ANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + V G I + DVI+N+ + LM G + ++ G + ++++ Sbjct: 1133 NLNVEVCCGSIEEQHDDVIINSVTTSMDLMKSGKLSQSMAIKGGFVIKRQFHDMKRKYAS 1192 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 PT +T G+LP ++H +Q + SL++ ++VA P Sbjct: 1193 -PT--IRVTDGGNLPCDVIIHGATSKLG---------IQQLVIESLKIAEEMRKSTVAIP 1240 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA 149 A+ TG G P A++ + + +F ++ Sbjct: 1241 ALGTGAMGKPPLECAKLIKQGIIKFARKNP 1270 >UniRef50_UPI0001C3795F Appr-1-p processing domain protein n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C3795F Length = 232 Score = 125 bits (314), Expect = 6e-28, Method: Composition-based stats. Identities = 54/163 (33%), Positives = 78/163 (47%), Gaps = 4/163 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++++++ DIT L VD IV ANP L G G AI AG + K + G Sbjct: 2 KMYIIKADITTLNVDAIVLPANPQLKKGAGASQAIFEKAGE---EELRKKCKSIAPIDVG 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A+ T +LP++ ++H P W G NE LL AYL+SL++ +SVAFP +S Sbjct: 59 SAIPTGGYNLPSEFIIHAAVPRWVDGGHNEYVLLSSAYLSSLKVADRIGVSSVAFPLLSA 118 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHL 166 G+ A +A KT+ F + VY YD+ L Sbjct: 119 SNNGFDPRVAFYVAQKTIESF-KADKTLKDVYLTIYDKTAEAL 160 >UniRef50_D2MH71 Metallo-beta-lactamase family protein n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MH71_9BACT Length = 434 Score = 125 bits (314), Expect = 8e-28, Method: Composition-based stats. Identities = 45/159 (28%), Positives = 72/159 (45%), Gaps = 9/159 (5%) Query: 15 LAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLP 74 + VIVNAAN + GGGV G I RAAG + + ++Q P G AV+T G Sbjct: 1 MEAQVIVNAANSHGLMGGGVAGIIRRAAGSQVEEEA----RRQAPIPVGQAVLTSGGRTR 56 Query: 75 AKAVVHT-VGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAA 133 ++H P E+ ++ A +L+L + + ++A P + TGV A Sbjct: 57 FAGIIHAPTMPEPAMRIPVEN--VRLATRAALQLADEHGFVTLAIPGMGTGVGRVRPEDA 114 Query: 134 AEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 A+ V+ + +F R + V V D E ++ L+ Sbjct: 115 AQGMVEEIRQFQPRS--LQSVMLVDIDPEMVRAWQAALS 151 >UniRef50_UPI00016E2DD3 UPI00016E2DD3 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E2DD3 Length = 1673 Score = 124 bits (313), Expect = 8e-28, Method: Composition-based stats. Identities = 54/175 (30%), Positives = 81/175 (46%), Gaps = 13/175 (7%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I V GDITK DVIVN++N + GV AI AAG A+ D C K+ P Sbjct: 1096 SVTIQAVTGDITKETTDVIVNSSNNTFSLKKGVSKAILEAAGQAVEDECQKLAAS----P 1151 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 ++T G+L K +VH G QN+ L+ ++L++ ANSYTSV+FPAI Sbjct: 1152 NAGIIMTQPGNLQCKKIVHVTG-------QNKAFLISKVVKSALQMCVANSYTSVSFPAI 1204 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQQ 174 TG A+ V + + +++ V V + + + + Q+ Sbjct: 1205 GTGQGNIKATEVADAMFDAVIDELRQNSSTTLNTVRIVVFQPPMLNDFYTSMQQR 1259 Score = 115 bits (289), Expect = 6e-25, Method: Composition-based stats. Identities = 47/175 (26%), Positives = 78/175 (44%), Gaps = 10/175 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I +V G+I DV VN+ L + G + A+ AAGP L D LK + G G Sbjct: 901 ITLVVGNIEDATTDVTVNSVFNDLDLNRGALSRALLHAAGPQLQDF-LKAQNSSGTL--G 957 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 ++T L + V H V P + Q L + + L+ + TS++FP+I T Sbjct: 958 EIIMTEGCQLKSMFVYHAVTPASYNAQ--AVQALGGIFRDCLKKAEDSGMTSISFPSIGT 1015 Query: 124 GVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCY--DEENAHLYERLLTQQ 174 G G+P+ AA++ + +F ++ +V + Y D E + L ++ Sbjct: 1016 GGLGFPKDLAAQMLYDEILKFSSKRQTKRLAEVTIILYSGDTETQQAFTAELKKK 1070 Score = 97.6 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 43/150 (28%), Positives = 62/150 (41%), Gaps = 7/150 (4%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T I V + DI V +V+ ANP G+ A+ +AAGP L + C ++ +G Sbjct: 684 TEIFVCKADICSYPVHAVVSYANPDFRFTSGLQRALLKAAGPQLQEDCDRLIHLKGRLKP 743 Query: 63 GHAVIT-LAGDLPAKAVVHTVGPVWRGGE---QNEDQLLQDAYLNSLRLVAANSY---TS 115 G VIT G L + ++H V P GG+ L+ A SL L Sbjct: 744 GDNVITAAGGQLCCRNIIHAVAPKLDGGQIIFVKRVAQLKKAIKGSLELAEKKGCQLVRQ 803 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFI 145 + A S + +P AAAE + E Sbjct: 804 LKMNAKSLVLVAHPLEAAAESIRSALKEHF 833 >UniRef50_A2DE53 Appr-1-p processing enzyme family protein n=1 Tax=Trichomonas vaginalis RepID=A2DE53_TRIVA Length = 270 Score = 124 bits (313), Expect = 9e-28, Method: Composition-based stats. Identities = 44/175 (25%), Positives = 77/175 (44%), Gaps = 13/175 (7%) Query: 1 MKTRIHVVQ-GDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + I + + GD T+L D ++N + + GG + +I+ AAGP L AC ++ G Sbjct: 41 INNLISIWKCGDSTRLKCDAVINRTDNNFSSGGALFTSINNAAGPQLAQACRQI----GH 96 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 C + V+T LPAK V+HTVGP ++D L+ + + S S+ Sbjct: 97 CDDCNTVVTPGFSLPAKYVIHTVGPT-----GDDDPELESTMDSVFSHIDGESIRSIGMA 151 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA---LPEQVYFVCYDEENAHLYERLL 171 G+ A +IA +F+ +++ F+ + ++ RLL Sbjct: 152 PFFIENNGFSLGHATQIAFSKTRKFLENPENRQKVDRIVFIVTQPHSIPIFVRLL 206 >UniRef50_C1XFR0 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=2 Tax=Meiothermus RepID=C1XFR0_MEIRU Length = 163 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 55/172 (31%), Positives = 80/172 (46%), Gaps = 11/172 (6%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 RI V QGDIT+ A D IVNAAN L+ G GV GAI R GP++ C + G Sbjct: 2 ARIQVAQGDITEFAGDAIVNAANNHLILGSGVAGAIRRRGGPSIQGECD----RHGPIRV 57 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A +T AG LP + V+H G + ++ A +LRL + +AFP + Sbjct: 58 GEAALTGAGQLPVRKVIHAAV---LGDQPATLDTVRSATQAALRLALEHRLYRLAFPLLG 114 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 TGV G AE+ + + P ++ Y + +A + L ++ Sbjct: 115 TGVGGLGVPQVAEVMLDE----LEAAPDPLEITLYGYSQADAEAIRQALARR 162 >UniRef50_UPI000180B63C PREDICTED: similar to Ci-Rhysin2/Deltex3-a n=1 Tax=Ciona intestinalis RepID=UPI000180B63C Length = 897 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 51/168 (30%), Positives = 77/168 (45%), Gaps = 6/168 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CP 61 R+ V G++ DVIVNAAN L G GV GAI + G A C + + Sbjct: 563 RVSVGMGNVAIQDTDVIVNAANNRLENGVGVTGAIFKQGGHAFQIECQNAMRARRGQLLA 622 Query: 62 TGHAVITLA-GDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 G AV+ A G+L + V+H VGP W + +L ++ L ++ + ++A Sbjct: 623 VGEAVMVNATGNLKCRKVIHLVGPQWHSYIDKNKCCSVLIQGIMSVLVEASSVNAKTIAI 682 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAH 165 P +STGVYG P A E+ K + R+ +Q+ + DE Sbjct: 683 PPVSTGVYGVPVAVFVEMVKKCLGILKQRNDITLKQIRILSIDEPTVR 730 >UniRef50_UPI00006CE511 hypothetical protein TTHERM_00141050 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CE511 Length = 267 Score = 121 bits (305), Expect = 8e-27, Method: Composition-based stats. Identities = 43/172 (25%), Positives = 79/172 (45%), Gaps = 4/172 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I +++G+I +D IVN + LM + +A L V+ +G Sbjct: 27 EIIILKGNICNENIDCIVNWVDCFLMNERT--YILKQALNDKLKKELDSVKHSKGILTLN 84 Query: 64 HAVITLAGDL-PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 IT G L K ++H+ P+WRGG + E Q +++ ++L + +S+ F S Sbjct: 85 DCFITSPGKLQNTKKIIHSTLPLWRGGHEKELQYFEESITQCIQLAINQNMSSIGFTQDS 144 Query: 123 TGVYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENAHLYERLLTQ 173 + ++G P AEI +++ F T + ++VYF+ D +Y+ L + Sbjct: 145 SDIFGIPLQDCAEILIQSFYRFATFKDTSIKRVYFIHQDSSAIQVYKNKLLK 196 >UniRef50_UPI00006A2286 UPI00006A2286 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2286 Length = 566 Score = 121 bits (304), Expect = 1e-26, Method: Composition-based stats. Identities = 51/177 (28%), Positives = 77/177 (43%), Gaps = 6/177 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 K + V+Q I DVIVN L + + A+ AGP L L Q Sbjct: 11 KELLKVIQQAIEDSTTDVIVNNVGQKLQLNEWQISRALAARAGPQLQQ-LLSNSSQGASA 69 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVW--RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 P G T +L V+H V P W RG Q+L+ + + L+L S S++ Sbjct: 70 PNGSVFSTDGCNLNCAKVLHVVMPQWDRRGFSLTHTQVLRKSIKSCLKLTEQQSLQSISI 129 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHLYERLLTQ 173 PAI TG GYP+ A + K + F ++ ++V V + D EN ++ + L + Sbjct: 130 PAIGTGKLGYPKDLVAAVTFKEILHFSSKAQSLQEVNIVLHPRDTENIQVFSKELQR 186 Score = 109 bits (272), Expect = 5e-23, Method: Composition-based stats. Identities = 51/177 (28%), Positives = 75/177 (42%), Gaps = 18/177 (10%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHA 65 V GDITK D+IVN+ N S GV AI AAGP++ C QQ G Sbjct: 228 QVKTGDITKENTDIIVNSTNNSFTLQSGVSKAILDAAGPSVTLEC----QQLGPQGQTSF 283 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 ++T +G+L ++H VG Q + + +Q L+ L+ SVA PA+ TG Sbjct: 284 ILTQSGNLQCTNILHVVG-------QTDPKCIQRCVLDILQECNRLQMASVALPAMGTGE 336 Query: 126 YGYPRAAA----AEIAVKTVSEFITRHALPE--QVYFVCYDEENA-HLYERLLTQQG 175 G A + V +F+ A P V V + + Y + ++G Sbjct: 337 TGAQVRLGHSIVAGAMLDGVEDFVKSQAAPSVTTVRIVIFQQPMLSDFYTSMKGKEG 393 >UniRef50_UPI000180D216 PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=1 Tax=Ciona intestinalis RepID=UPI000180D216 Length = 1716 Score = 121 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 48/175 (27%), Positives = 74/175 (42%), Gaps = 14/175 (8%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQ-GD 59 I V + DIT+ +NA+N L + GGV G I G ++ K+RQQ G Sbjct: 905 NVNIRVFKADITEHKCGAFINASNDMLELREGGVSGNILHKGGASIKGELDKLRQQNIGM 964 Query: 60 CPTGHAVITLAGDL-PAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVA 117 G T +G L K ++H VGP W+ N L+ +L + SVA Sbjct: 965 FLPGDVRSTTSGSLRNCKRIIHVVGPDWKKSSHSNNCNYLKACVHGALVEADKHKLASVA 1024 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL----------PEQVYFVCYDEE 162 PA+S G+YG + + V+T+ ++ T + + C+ EE Sbjct: 1025 IPAVSCGIYGGVPSVCIRLIVETIQQYFTGNKSKVTMVDLIENSKDDVINCFMEE 1079 Score = 89.9 bits (222), Expect = 4e-17, Method: Composition-based stats. Identities = 42/178 (23%), Positives = 65/178 (36%), Gaps = 18/178 (10%) Query: 2 KTRI-----HVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQ 55 TRI + GDIT+ + DVIVN+ N + G V I + G + C + Sbjct: 1119 NTRIGSLNVELCNGDITQDSSDVIVNSTNSNFDLRNGKVSPQILKKGGNVISQQCTQNNH 1178 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 + IT G L K ++H V PV ++ L V A + Sbjct: 1179 PLNKP---NMRITDGGKLKCKQIIHVVVPV-------NQMQIEQVVSLILETVDALHKSV 1228 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLL 171 VA PAI TG + A+ + + H + + V V + + + L Sbjct: 1229 VALPAIGTGNLNISPSKVAQYIRTGIVYYTANHNPSHLKTVKVVVFQQNMMQDFHTEL 1286 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0A8D8 UPF0189 protein ymdB n=66 Tax=Proteobacteria Rep... 225 5e-58 UniRef50_P67342 UPF0189 protein ymdB n=62 Tax=Bacteria RepID=YMD... 223 2e-57 UniRef50_Q8EYT0 UPF0189 protein LA_4133 n=9 Tax=cellular organis... 219 2e-56 UniRef50_A4W960 Appr-1-p processing domain protein n=5 Tax=Bacte... 219 4e-56 UniRef50_Q3AEI4 Putative uncharacterized protein n=6 Tax=Bacteri... 213 3e-54 UniRef50_Q8RB30 UPF0189 protein TTE0995 n=12 Tax=Bacteria RepID=... 212 6e-54 UniRef50_D1N530 Appr-1-p processing domain protein n=4 Tax=cellu... 207 9e-53 UniRef50_Q9WYX8 UPF0189 protein TM_0508 n=15 Tax=cellular organi... 206 2e-52 UniRef50_Q8KAE4 UPF0189 protein CT2219 n=7 Tax=Bacteria RepID=Y2... 206 4e-52 UniRef50_C1PFE0 Appr-1-p processing domain protein n=5 Tax=Bacte... 205 5e-52 UniRef50_D1AKA5 Appr-1-p processing domain protein n=2 Tax=Bacte... 205 5e-52 UniRef50_Q8PHB6 UPF0189 protein XAC3343 n=14 Tax=Proteobacteria ... 204 8e-52 UniRef50_Q8Q0F9 UPF0189 protein MM_0177 n=18 Tax=cellular organi... 202 6e-51 UniRef50_C7NE27 Appr-1-p processing domain protein n=2 Tax=Lepto... 201 8e-51 UniRef50_C7RS37 Appr-1-p processing domain protein n=15 Tax=cell... 199 2e-50 UniRef50_A8ZUR5 Appr-1-p processing domain protein n=3 Tax=cellu... 199 2e-50 UniRef50_C1H4Y3 MACRO domain-containing protein n=4 Tax=cellular... 199 4e-50 UniRef50_Q047N9 Predicted phosphatase, histone macroH2A1 family ... 197 2e-49 UniRef50_Q1HPZ5 LRP16 protein n=1 Tax=Bombyx mori RepID=Q1HPZ5_B... 197 2e-49 UniRef50_C8NAC1 RNase III regulator YmdB n=3 Tax=cellular organi... 197 2e-49 UniRef50_A1Z1Q3 MACRO domain-containing protein 2 n=55 Tax=cellu... 196 3e-49 UniRef50_Q8TQD0 UPF0189 protein MA_1614 n=2 Tax=cellular organis... 196 3e-49 UniRef50_Q0CQJ0 Protein LRP16 n=10 Tax=cellular organisms RepID=... 196 3e-49 UniRef50_A7RJ44 Predicted protein (Fragment) n=4 Tax=Eukaryota R... 196 3e-49 UniRef50_Q66HV6 Zgc:92353 n=1 Tax=Danio rerio RepID=Q66HV6_DANRE 196 3e-49 UniRef50_Q9HXU7 UPF0189 protein PA3693 n=16 Tax=Bacteria RepID=Y... 196 4e-49 UniRef50_Q8Y2K1 UPF0189 protein RSc0334 n=39 Tax=cellular organi... 195 5e-49 UniRef50_A5GC80 Appr-1-p processing domain protein n=2 Tax=Desul... 195 6e-49 UniRef50_Q6PHJ5 Zgc:65960 n=11 Tax=cellular organisms RepID=Q6PH... 195 7e-49 UniRef50_Q71W03 UPF0189 protein LMOf2365_2748 n=23 Tax=Bacteria ... 193 2e-48 UniRef50_A4R3Q9 Putative uncharacterized protein n=1 Tax=Magnapo... 193 2e-48 UniRef50_C1BR35 MACRO domain-containing protein 1 n=2 Tax=Caligu... 193 2e-48 UniRef50_B7PF53 MACRO domain-containing protein, putative n=2 Ta... 193 2e-48 UniRef50_C6BB95 Appr-1-p processing domain protein n=4 Tax=cellu... 193 2e-48 UniRef50_C6RT62 Appr-1-p processing n=2 Tax=Acinetobacter radior... 193 2e-48 UniRef50_B7C8M6 Putative uncharacterized protein n=3 Tax=Bacteri... 191 8e-48 UniRef50_Q0UQZ6 Putative uncharacterized protein n=2 Tax=Leotiom... 190 1e-47 UniRef50_Q9BQ69 MACRO domain-containing protein 1 n=11 Tax=Tetra... 189 4e-47 UniRef50_Q2LUU1 Appr-1-p histone processing protein n=5 Tax=Bact... 189 5e-47 UniRef50_Q8K4G6 MACRO domain-containing protein 1 (Fragment) n=5... 188 6e-47 UniRef50_A6BCW6 Putative uncharacterized protein n=5 Tax=Bacteri... 188 8e-47 UniRef50_Q8EP31 Hypothetical conserved protein n=1 Tax=Oceanobac... 188 8e-47 UniRef50_B6Q324 LRP16 family protein n=3 Tax=Trichocomaceae RepI... 188 8e-47 UniRef50_B9MLL8 Appr-1-p processing domain protein n=6 Tax=Clost... 187 1e-46 UniRef50_Q47EQ7 Appr-1-p processing n=1 Tax=Dechloromonas aromat... 187 1e-46 UniRef50_B2ACK5 Predicted CDS Pa_3_1270 n=5 Tax=Eukaryota RepID=... 187 1e-46 UniRef50_A4YFR3 Appr-1-p processing domain protein n=9 Tax=Therm... 186 3e-46 UniRef50_A0LGZ1 Appr-1-p processing domain protein n=1 Tax=Syntr... 185 4e-46 UniRef50_A2SS36 Appr-1-p processing domain protein n=26 Tax=cell... 185 5e-46 UniRef50_C4V1Q4 Appr-1-p processing domain protein n=3 Tax=Bacte... 185 6e-46 UniRef50_Q93SX7 UPF0189 protein n=2 Tax=Acinetobacter RepID=Y189... 185 6e-46 UniRef50_Q5KCD7 Putative uncharacterized protein n=1 Tax=Filobas... 185 7e-46 UniRef50_B2JCA0 Appr-1-p processing domain protein n=13 Tax=Prot... 184 1e-45 UniRef50_Q9EYI6 UPF0189 protein in sno 5'region n=22 Tax=Bacteri... 184 1e-45 UniRef50_C8VIG2 LRP16 family protein (AFU_orthologue; AFUA_3G138... 184 2e-45 UniRef50_A7IGI6 Appr-1-p processing domain protein n=53 Tax=cell... 183 2e-45 UniRef50_Q985D2 UPF0189 protein mll7730 n=12 Tax=Bacteria RepID=... 183 3e-45 UniRef50_P67344 UPF0189 protein SA0314 n=54 Tax=Staphylococcus R... 182 3e-45 UniRef50_B8HYS5 Appr-1-p processing domain protein n=2 Tax=Cyano... 182 3e-45 UniRef50_Q1R0S7 Appr-1-p processing n=12 Tax=Proteobacteria RepI... 182 4e-45 UniRef50_D1ZDH8 Whole genome shotgun sequence assembly, scaffold... 182 5e-45 UniRef50_B6SKT6 Protein LRP16 n=12 Tax=cellular organisms RepID=... 180 2e-44 UniRef50_Q03IQ8 Predicted phosphatase homologous to the C-termin... 179 3e-44 UniRef50_A7BY23 Putative uncharacterized protein n=3 Tax=Beggiat... 179 3e-44 UniRef50_C7N880 Predicted phosphatase, C-terminal domain of hist... 179 5e-44 UniRef50_C4Q6S1 Expressed protein n=1 Tax=Schistosoma mansoni Re... 178 6e-44 UniRef50_B5YAF3 Conserved protein n=2 Tax=Dictyoglomus RepID=B5Y... 178 9e-44 UniRef50_B9XAD9 Appr-1-p processing domain protein n=1 Tax=bacte... 178 9e-44 UniRef50_Q4P1I0 Putative uncharacterized protein n=1 Tax=Ustilag... 177 1e-43 UniRef50_B9S4E3 Protein LRP16, putative n=2 Tax=cellular organis... 177 1e-43 UniRef50_Q9HJ67 UPF0189 protein Ta1105 n=1 Tax=Thermoplasma acid... 177 1e-43 UniRef50_A0Q2I9 Appr-1-p processing enzyme family protein n=3 Ta... 177 2e-43 UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZE... 176 2e-43 UniRef50_UPI000194CBCB PREDICTED: poly (ADP-ribose) polymerase f... 176 2e-43 UniRef50_C2LSS3 Protein in Tap1-dppD intergenic region n=1 Tax=S... 176 3e-43 UniRef50_A6GJ81 Putative uncharacterized protein n=1 Tax=Plesioc... 175 4e-43 UniRef50_B8LP86 Putative uncharacterized protein n=1 Tax=Picea s... 175 4e-43 UniRef50_C8WYT5 Appr-1-p processing domain protein n=1 Tax=Desul... 175 5e-43 UniRef50_C9LYS3 Appr-1-p processing enzyme family domain protein... 175 7e-43 UniRef50_C9KLM2 Appr-1-p processing enzyme family domain protein... 174 9e-43 UniRef50_B8DKL2 Appr-1-p processing domain protein n=3 Tax=Desul... 173 3e-42 UniRef50_Q93RG0 UPF0189 protein in tap1-dppD intergenic region n... 173 3e-42 UniRef50_Q0B030 Phosphatase n=1 Tax=Syntrophomonas wolfei subsp.... 172 3e-42 UniRef50_UPI00006A2284 UPI00006A2284 related cluster n=1 Tax=Xen... 172 3e-42 UniRef50_UPI0000ECB76F Poly [ADP-ribose] polymerase 14 (EC 2.4.2... 172 5e-42 UniRef50_B8I4Z8 Appr-1-p processing domain protein n=7 Tax=Bacte... 172 5e-42 UniRef50_D1U7C0 Appr-1-p processing domain protein n=1 Tax=Desul... 172 5e-42 UniRef50_Q8ZXT3 UPF0189 protein PAE1111 n=10 Tax=Thermoprotei Re... 172 5e-42 UniRef50_UPI000186F16D conserved hypothetical protein n=1 Tax=Pe... 172 6e-42 UniRef50_C4FEN5 Putative uncharacterized protein n=1 Tax=Bifidob... 172 7e-42 UniRef50_C4FT52 Putative uncharacterized protein n=1 Tax=Catonel... 171 7e-42 UniRef50_Q97AU0 UPF0189 protein TV0719 n=2 Tax=cellular organism... 171 8e-42 UniRef50_C8NG26 Appr-1-p processing enzyme family domain protein... 171 1e-41 UniRef50_C4M8N0 Putative uncharacterized protein n=2 Tax=Entamoe... 171 1e-41 UniRef50_B6KFB3 Appr-1-p processing enzyme family domain-contain... 170 2e-41 UniRef50_A7B8S3 Putative uncharacterized protein n=1 Tax=Actinom... 170 2e-41 UniRef50_A0L536 Appr-1-p processing domain protein n=1 Tax=Magne... 170 2e-41 UniRef50_C2D2Z2 Appr-1-p processing enzyme family domain protein... 169 4e-41 UniRef50_A5ZAB5 Putative uncharacterized protein n=4 Tax=Clostri... 168 7e-41 UniRef50_A8FSV2 Putative uncharacterized protein n=1 Tax=Shewane... 168 8e-41 UniRef50_UPI0001B4DEB3 hypothetical protein ShygA5_39675 n=1 Tax... 168 9e-41 UniRef50_Q30ZH6 Appr-1-p processing n=1 Tax=Desulfovibrio desulf... 167 1e-40 UniRef50_UPI000050FFC7 predicted phosphatase, C-terminal domain ... 167 1e-40 UniRef50_Q460N5 Poly [ADP-ribose] polymerase 14 n=19 Tax=Eutheri... 167 1e-40 UniRef50_B0A8R6 Putative uncharacterized protein n=3 Tax=Bacteri... 167 1e-40 UniRef50_C5CIT5 Appr-1-p processing domain protein n=1 Tax=Kosmo... 166 2e-40 UniRef50_Q87JZ5 UPF0189 protein VPA0103 n=5 Tax=Proteobacteria R... 166 3e-40 UniRef50_Q17432 Protein B0035.3, confirmed by transcript evidenc... 166 3e-40 UniRef50_Q22CT8 Appr-1-p processing enzyme family protein n=1 Ta... 166 3e-40 UniRef50_B9WC14 Putative uncharacterized protein n=5 Tax=Candida... 165 4e-40 UniRef50_Q5XC09 UPF0189 protein M6_Spy0919 n=20 Tax=Streptococcu... 165 4e-40 UniRef50_A8H4N3 Appr-1-p processing domain protein n=1 Tax=Shewa... 165 4e-40 UniRef50_B0EH33 Putative uncharacterized protein n=2 Tax=Entamoe... 165 6e-40 UniRef50_UPI0000E4D641 UPI0000E4D641 related cluster n=2 Tax=Dan... 165 7e-40 UniRef50_UPI0000E80997 PREDICTED: similar to Poly [ADP-ribose] p... 165 8e-40 UniRef50_C1QBX0 Predicted phosphatase similar to C-terminal doma... 164 1e-39 UniRef50_B1KG04 Appr-1-p processing domain protein n=1 Tax=Shewa... 164 1e-39 UniRef50_A8JCH3 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 163 3e-39 UniRef50_C3Y5X0 Putative uncharacterized protein n=3 Tax=Branchi... 163 3e-39 UniRef50_C7GZB8 Appr-1-p processing enzyme family domain protein... 163 3e-39 UniRef50_A1L291 LOC799852 protein (Fragment) n=5 Tax=Danio rerio... 162 3e-39 UniRef50_C9XM94 Putative uncharacterized protein n=6 Tax=Clostri... 162 3e-39 UniRef50_D1BM15 Appr-1-p processing domain protein n=15 Tax=Bact... 162 4e-39 UniRef50_C3YH95 Putative uncharacterized protein n=2 Tax=Eumetaz... 162 4e-39 UniRef50_A7T167 Protein GDAP2 homolog n=1 Tax=Nematostella vecte... 162 5e-39 UniRef50_A2FMC7 Appr-1-p processing enzyme family protein n=1 Ta... 162 5e-39 UniRef50_A9SRF5 Predicted protein n=1 Tax=Physcomitrella patens ... 162 6e-39 UniRef50_C1SPD7 Predicted phosphatase similar to C-terminal doma... 162 7e-39 UniRef50_C4DDL7 Predicted phosphatase similar to C-terminal doma... 161 8e-39 UniRef50_A9WK70 Appr-1-p processing domain protein n=3 Tax=Chlor... 161 9e-39 UniRef50_UPI0000F2CC13 PREDICTED: similar to B aggressive lympho... 161 9e-39 UniRef50_A1D5K4 Appr-1-p processing enzyme family protein n=1 Ta... 161 1e-38 UniRef50_D0NNH8 Putative uncharacterized protein n=3 Tax=Phytoph... 161 1e-38 UniRef50_B9YC00 Putative uncharacterized protein n=1 Tax=Holdema... 160 1e-38 UniRef50_D1VVA5 Putative uncharacterized protein n=1 Tax=Peptoni... 160 1e-38 UniRef50_A8M6L5 Appr-1-p processing domain protein n=2 Tax=Micro... 160 2e-38 UniRef50_Q4SK43 Chromosome 2 SCAF14570, whole genome shotgun seq... 159 3e-38 UniRef50_C7H575 RNase III regulator YmdB n=2 Tax=Faecalibacteriu... 159 3e-38 UniRef50_C4V152 Appr-1-p processing protein n=2 Tax=Clostridiale... 159 4e-38 UniRef50_B7C850 Putative uncharacterized protein n=1 Tax=Eubacte... 159 5e-38 UniRef50_A7HJC7 Appr-1-p processing domain protein n=1 Tax=Fervi... 159 5e-38 UniRef50_D2V113 Appr-1-p domain-containing protein n=1 Tax=Naegl... 159 5e-38 UniRef50_Q6NRC6 MGC83934 protein n=3 Tax=Xenopus RepID=Q6NRC6_XENLA 158 9e-38 UniRef50_A2DTG7 Appr-1-p processing enzyme family protein n=2 Ta... 158 9e-38 UniRef50_C0W547 Appr-1-p processing domain protein n=1 Tax=Actin... 157 1e-37 UniRef50_B0EF86 MACRO domain-containing protein, putative n=2 Ta... 157 2e-37 UniRef50_UPI000196AD9C hypothetical protein CATMIT_00588 n=1 Tax... 157 2e-37 UniRef50_A3LYE6 Putative uncharacterized protein n=1 Tax=Pichia ... 157 2e-37 UniRef50_C3Y5Q2 Putative uncharacterized protein n=1 Tax=Branchi... 156 2e-37 UniRef50_B3RYC4 Putative uncharacterized protein n=1 Tax=Trichop... 156 2e-37 UniRef50_D2S4L6 Appr-1-p processing domain protein n=4 Tax=Actin... 156 3e-37 UniRef50_UPI000194CBC9 PREDICTED: similar to B aggressive lympho... 156 3e-37 UniRef50_A1WVH3 Appr-1-p processing domain protein n=14 Tax=Bact... 156 4e-37 UniRef50_C0PSL1 Putative uncharacterized protein n=1 Tax=Picea s... 156 4e-37 UniRef50_A8STD9 Putative uncharacterized protein n=1 Tax=Coproco... 155 5e-37 UniRef50_Q2TX23 Predicted phosphatase homologous to the C-termin... 155 6e-37 UniRef50_C3Y6H9 Putative uncharacterized protein n=1 Tax=Branchi... 155 6e-37 UniRef50_A5D049 Predicted phosphatase n=4 Tax=Bacteria RepID=A5D... 155 6e-37 UniRef50_C3Y6H4 Putative uncharacterized protein n=1 Tax=Branchi... 154 1e-36 UniRef50_A2QSI2 Contig An08c0280, complete genome n=1 Tax=Asperg... 154 1e-36 UniRef50_C2L199 Putative uncharacterized protein n=1 Tax=Oribact... 154 1e-36 UniRef50_A4TAV6 Appr-1-p processing domain protein n=6 Tax=Actin... 154 1e-36 UniRef50_A8FQZ3 Putative uncharacterized protein n=1 Tax=Shewane... 154 2e-36 UniRef50_Q4DSL4 Putative uncharacterized protein n=4 Tax=Trypano... 153 2e-36 UniRef50_C4G1S1 Putative uncharacterized protein n=3 Tax=Abiotro... 153 2e-36 UniRef50_Q0CEI7 Putative uncharacterized protein n=1 Tax=Aspergi... 153 3e-36 UniRef50_O67112 UPF0189 protein aq_987 n=4 Tax=cellular organism... 152 3e-36 UniRef50_C7Z089 Putative uncharacterized protein n=2 Tax=Nectria... 152 4e-36 UniRef50_Q8IXQ6 Poly [ADP-ribose] polymerase 9 n=27 Tax=Eutheria... 152 6e-36 UniRef50_A0CX10 Chromosome undetermined scaffold_3, whole genome... 152 6e-36 UniRef50_Q8B4N1 ORF-1 n=7 Tax=Infectious spleen and kidney necro... 152 7e-36 UniRef50_C9RQW9 Appr-1-p processing domain protein n=5 Tax=Bacte... 151 7e-36 UniRef50_C3Y5X5 Putative uncharacterized protein n=3 Tax=Branchi... 151 8e-36 UniRef50_C2DZH9 Appr-1-p processing protein n=4 Tax=Lactobacillu... 150 1e-35 UniRef50_A7S3X0 Predicted protein (Fragment) n=1 Tax=Nematostell... 150 2e-35 UniRef50_C3Y406 Putative uncharacterized protein n=2 Tax=Branchi... 150 2e-35 UniRef50_D2V337 Predicted protein (Fragment) n=1 Tax=Naegleria g... 149 3e-35 UniRef50_C5C222 Appr-1-p processing domain protein n=2 Tax=Actin... 149 4e-35 UniRef50_C2KRZ5 Appr-1-p processing domain protein n=2 Tax=Mobil... 148 9e-35 UniRef50_UPI000180B1B4 PREDICTED: similar to Poly [ADP-ribose] p... 148 9e-35 UniRef50_D0WKT6 Appr-1-p processing enzyme family domain protein... 148 9e-35 UniRef50_C3Y5X1 Putative uncharacterized protein n=1 Tax=Branchi... 147 2e-34 UniRef50_Q9NXN4 Ganglioside-induced differentiation-associated p... 147 2e-34 UniRef50_C5VD03 Appr-1-p processing enzyme family protein n=2 Ta... 147 2e-34 UniRef50_B1L625 Appr-1-p processing domain protein n=1 Tax=Candi... 147 2e-34 UniRef50_Q0UG78 Putative uncharacterized protein n=1 Tax=Phaeosp... 145 5e-34 UniRef50_A6LTB5 Appr-1-p processing domain protein n=3 Tax=Clost... 144 1e-33 UniRef50_C7HUZ2 RNase III regulator YmdB n=2 Tax=Anaerococcus Re... 144 1e-33 UniRef50_Q2SM57 Predicted phosphatase n=1 Tax=Hahella chejuensis... 144 2e-33 UniRef50_A7C4X9 Putative uncharacterized protein n=1 Tax=Beggiat... 144 2e-33 UniRef50_C3YS04 Putative uncharacterized protein (Fragment) n=1 ... 143 2e-33 UniRef50_Q94JV1 At1g69340/F10D13.28 n=23 Tax=Embryophyta RepID=Q... 143 2e-33 UniRef50_C3Y417 Putative uncharacterized protein (Fragment) n=1 ... 143 3e-33 UniRef50_D0NR00 Putative uncharacterized protein n=1 Tax=Phytoph... 142 3e-33 UniRef50_B9L2D9 Appr-1-p processing enzyme family protein n=2 Ta... 142 4e-33 UniRef50_D0MWM6 Putative uncharacterized protein n=1 Tax=Phytoph... 142 6e-33 UniRef50_UPI00005A247A PREDICTED: similar to H2A histone family,... 142 6e-33 UniRef50_A6SR30 Putative uncharacterized protein n=1 Tax=Botryot... 141 1e-32 UniRef50_Q4RS18 Histone H2A (Fragment) n=2 Tax=Tetraodontidae Re... 140 2e-32 UniRef50_C9YUB3 Putative uncharacterized protein n=1 Tax=Strepto... 140 2e-32 UniRef50_B7CC50 Putative uncharacterized protein n=1 Tax=Eubacte... 140 3e-32 UniRef50_B7PR73 Ganglioside induced differentiation associated p... 139 3e-32 UniRef50_C3YS03 Putative uncharacterized protein n=2 Tax=Branchi... 139 4e-32 UniRef50_UPI000180BD0B PREDICTED: similar to Poly [ADP-ribose] p... 138 6e-32 UniRef50_UPI00006A1CA6 poly (ADP-ribose) polymerase family, memb... 138 6e-32 UniRef50_UPI000196CD43 hypothetical protein CATMIT_02190 n=1 Tax... 138 6e-32 UniRef50_Q55AK6 U box domain-containing protein n=2 Tax=Eukaryot... 138 6e-32 UniRef50_B0P6L4 Putative uncharacterized protein n=1 Tax=Anaerot... 138 7e-32 UniRef50_UPI0000E8099B PREDICTED: similar to PARP9 protein n=2 T... 138 8e-32 UniRef50_B2VUH2 MACRO domain containing protein 1 n=1 Tax=Pyreno... 138 8e-32 UniRef50_A7EET2 Putative uncharacterized protein n=1 Tax=Sclerot... 138 8e-32 UniRef50_O07733 UPF0189 protein Rv1899c/MT1950 n=16 Tax=Mycobact... 138 1e-31 UniRef50_C3ZVW0 Putative uncharacterized protein n=1 Tax=Branchi... 137 1e-31 UniRef50_C8WJT1 Appr-1-p processing domain protein n=1 Tax=Egger... 137 2e-31 UniRef50_Q7JUR6 Protein GDAP2 homolog n=19 Tax=Neoptera RepID=GD... 137 2e-31 UniRef50_UPI0001C38755 appr-1-p processing domain-containing pro... 137 2e-31 UniRef50_A2BJA7 A1pp, Appr-1-p processing enzyme n=1 Tax=Hyperth... 136 3e-31 UniRef50_D1B7G8 Appr-1-p processing domain protein n=1 Tax=Therm... 135 6e-31 UniRef50_Q54PT1 Protein GDAP2 homolog n=1 Tax=Dictyostelium disc... 134 9e-31 UniRef50_A7T7L3 Predicted protein (Fragment) n=1 Tax=Nematostell... 133 2e-30 UniRef50_Q4RG95 Chromosome 12 SCAF15104, whole genome shotgun se... 132 8e-30 UniRef50_B5Y5Y4 Appr-1-p processing enzyme family protein n=2 Ta... 131 8e-30 UniRef50_A3DLM0 Appr-1-p processing domain protein n=1 Tax=Staph... 131 9e-30 UniRef50_O28751 UPF0189 protein AF_1521 n=32 Tax=Euryarchaeota R... 130 2e-29 UniRef50_Q9P0M6 Core histone macro-H2A.2 n=118 Tax=Eukaryota Rep... 129 4e-29 UniRef50_UPI0000E4815A PREDICTED: similar to LRP16 protein n=1 T... 129 5e-29 UniRef50_B0QWK9 Putative uncharacterized protein n=1 Tax=Haemoph... 129 5e-29 UniRef50_D2MH71 Metallo-beta-lactamase family protein n=1 Tax=Ca... 128 6e-29 UniRef50_B7P925 Histone H2A n=1 Tax=Ixodes scapularis RepID=B7P9... 128 8e-29 UniRef50_Q5V4P3 Putative uncharacterized protein n=1 Tax=Haloarc... 127 2e-28 UniRef50_A0CX06 Chromosome undetermined scaffold_3, whole genome... 126 3e-28 UniRef50_Q9YBE9 UPF0189 protein APE_1648.1 n=1 Tax=Aeropyrum per... 126 3e-28 UniRef50_UPI000180B63C PREDICTED: similar to Ci-Rhysin2/Deltex3-... 126 4e-28 UniRef50_B1H1M8 LOC100148704 protein (Fragment) n=5 Tax=Danio re... 126 4e-28 UniRef50_C1XFR0 Predicted phosphatase similar to C-terminal doma... 125 5e-28 UniRef50_Q460N3 Poly [ADP-ribose] polymerase 15 n=12 Tax=Eutheri... 125 6e-28 UniRef50_Q2ITR2 Appr-1-p processing n=1 Tax=Rhodopseudomonas pal... 125 9e-28 UniRef50_UPI00016E2DD3 UPI00016E2DD3 related cluster n=3 Tax=Tak... 124 1e-27 UniRef50_UPI000180BD0C PREDICTED: similar to Ci-Rhysin2/Deltex3-... 124 1e-27 UniRef50_D2VM45 Poly ADP-ribose polymerase family, member 14-lik... 124 1e-27 UniRef50_B8HYS6 Appr-1-p processing domain protein n=1 Tax=Cyano... 123 3e-27 UniRef50_D1R847 Putative uncharacterized protein n=1 Tax=Parachl... 121 1e-26 UniRef50_UPI000180D216 PREDICTED: similar to Poly [ADP-ribose] p... 120 2e-26 UniRef50_UPI0001C3795F Appr-1-p processing domain protein n=1 Ta... 120 2e-26 UniRef50_UPI00005A5611 PREDICTED: similar to poly (ADP-ribose) p... 120 2e-26 UniRef50_A9JRH9 Si:ch211-219a4.3 protein n=5 Tax=Clupeocephala R... 119 3e-26 UniRef50_A1R2V6 Putative uncharacterized protein n=1 Tax=Arthrob... 119 3e-26 UniRef50_Q4T065 Chromosome undetermined SCAF11328, whole genome ... 119 5e-26 UniRef50_Q4SK44 Chromosome 2 SCAF14570, whole genome shotgun seq... 118 8e-26 UniRef50_UPI0001BC8416 Appr-1-p processing domain protein n=1 Ta... 118 1e-25 Sequences not found previously or not previously below threshold: >UniRef50_P0A8D8 UPF0189 protein ymdB n=66 Tax=Proteobacteria RepID=YMDB_ECO57 Length = 177 Score = 225 bits (574), Expect = 5e-58, Method: Composition-based stats. Identities = 177/177 (100%), Positives = 177/177 (100%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC Sbjct: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA Sbjct: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE Sbjct: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 >UniRef50_P67342 UPF0189 protein ymdB n=62 Tax=Bacteria RepID=YMDB_SALTI Length = 179 Score = 223 bits (569), Expect = 2e-57, Method: Composition-based stats. Identities = 135/177 (76%), Positives = 153/177 (86%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +R+ V+QGDIT+L+VD IVNAAN SLMGGGGVDGAIHRAAGPALLDAC +RQQQG+C Sbjct: 1 MTSRLQVIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TGHAVIT AG L AKAV+HTVGPVWRGGE E +LL++AY N L L AN + S+AFPA Sbjct: 61 QTGHAVITPAGKLSAKAVIHTVGPVWRGGEHQEAELLEEAYRNCLLLAEANHFRSIAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTGVYGYPRA AAE+AV+TVS+FITR+ALPEQVYFVCYDEE A LY RLLTQQGD+ Sbjct: 121 ISTGVYGYPRAQAAEVAVRTVSDFITRYALPEQVYFVCYDEETARLYARLLTQQGDD 177 >UniRef50_Q8EYT0 UPF0189 protein LA_4133 n=9 Tax=cellular organisms RepID=Y4133_LEPIN Length = 175 Score = 219 bits (559), Expect = 2e-56, Method: Composition-based stats. Identities = 93/172 (54%), Positives = 129/172 (75%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +I +++ DIT+L VD IVNAAN SL+GGGGVDGAIHRA GP +L+ C K+R++QG+C Sbjct: 1 MNNKIKLIKEDITQLEVDAIVNAANSSLLGGGGVDGAIHRAGGPEILEECYKIREKQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AVIT AG L AK ++HTVGP+W GG +NED+LL +AY NSL L +S ++AFP Sbjct: 61 KVGEAVITTAGRLNAKFIIHTVGPIWSGGNKNEDELLSNAYKNSLLLAKNHSLKTIAFPN 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTG+Y +P+ AA+IA+++V+EF+ + + V+FVC+D EN +Y +LL Sbjct: 121 ISTGIYHFPKERAAKIAIQSVTEFLKQDNQIQTVFFVCFDFENLEIYNKLLQ 172 >UniRef50_A4W960 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=A4W960_ENT38 Length = 180 Score = 219 bits (557), Expect = 4e-56, Method: Composition-based stats. Identities = 136/175 (77%), Positives = 152/175 (86%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I VV GDIT + VDVIVNAANPSLMGGGGVDGAIHRAAGP LL+AC VRQQQG+C Sbjct: 1 MKPQIEVVVGDITTMEVDVIVNAANPSLMGGGGVDGAIHRAAGPQLLEACKTVRQQQGEC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 GHAVIT+AGDLPAKAV+H VGPVW+GGE +E + LQDAYLN LRL AAN Y ++AFPA Sbjct: 61 APGHAVITIAGDLPAKAVIHAVGPVWQGGENHEARTLQDAYLNCLRLAAANGYKTLAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 ISTGVYGYP+AAAAEIAV TVSEF+TR LPE+VYFVCYDEENA LY+RLL Q+G Sbjct: 121 ISTGVYGYPKAAAAEIAVDTVSEFLTRKPLPERVYFVCYDEENAQLYQRLLIQRG 175 >UniRef50_Q3AEI4 Putative uncharacterized protein n=6 Tax=Bacteria RepID=Q3AEI4_CARHZ Length = 181 Score = 213 bits (542), Expect = 3e-54, Method: Composition-based stats. Identities = 85/171 (49%), Positives = 113/171 (66%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 ++I + GDITK VD IVNAAN L GGGGVDGAIHRA GP +++ C ++ + G P Sbjct: 7 NSKIILKLGDITKEKVDAIVNAANSRLAGGGGVDGAIHRAGGPKIMEECREIINKIGVLP 66 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AV T AG+LPAK V+HTVGP++RGG++ E+ L++AYLNSL+L + ++AFP+I Sbjct: 67 PGEAVATTAGNLPAKYVIHTVGPIYRGGQKGEENTLRNAYLNSLKLAKQLNVKTIAFPSI 126 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 STG YGYP AA +A+K V EF+ V FV +DE Y+ L Sbjct: 127 STGAYGYPVKDAARVALKAVIEFLEGEPEDFTVVFVLFDEITYAAYQEALE 177 >UniRef50_Q8RB30 UPF0189 protein TTE0995 n=12 Tax=Bacteria RepID=Y995_THETN Length = 175 Score = 212 bits (539), Expect = 6e-54, Method: Composition-based stats. Identities = 84/174 (48%), Positives = 121/174 (69%), Gaps = 1/174 (0%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I +++G+I VD IVNAAN SL+GGGGVDGAIH+A GPA+ + +R++QG C Sbjct: 1 MKEKIKLIKGNIVDQEVDAIVNAANSSLIGGGGVDGAIHKAGGPAIAEELKVIREKQGGC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTGHAVIT AG+L AK V+H VGP+W+GG NED LL AY+ SL+L + ++AFP+ Sbjct: 61 PTGHAVITGAGNLKAKYVIHAVGPIWKGGNHNEDNLLASAYIESLKLADEYNVKTIAFPS 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 ISTG YG+P AA IA++ VS+++ + ++V FV + + + +Y + + Sbjct: 121 ISTGAYGFPVERAARIALRVVSDYLE-GSSIKEVRFVLFSDRDYEVYSKAYEEL 173 >UniRef50_D1N530 Appr-1-p processing domain protein n=4 Tax=cellular organisms RepID=D1N530_9BACT Length = 164 Score = 207 bits (528), Expect = 9e-53, Method: Composition-based stats. Identities = 96/169 (56%), Positives = 117/169 (69%), Gaps = 6/169 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +VQ DIT+L D IVNAAN SL+GGGGVDGAIHRAAGP LL+AC K CPTG Sbjct: 2 KIQIVQDDITRLRADAIVNAANSSLLGGGGVDGAIHRAAGPELLEACRKFN----GCPTG 57 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT L A+ V+HT GPVW GG E +LL+ Y NSLRL AAN S+AFPAIST Sbjct: 58 EARITPGFRLAARFVIHTPGPVWHGGTHGEAELLEACYRNSLRLAAANGCRSIAFPAIST 117 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GVY YP+A AA+IA++TV ++ R LPE+V F C+ + +Y+ LL Sbjct: 118 GVYRYPKAEAAQIALRTVRQW--REPLPEEVIFCCFSAADLDVYQELLK 164 >UniRef50_Q9WYX8 UPF0189 protein TM_0508 n=15 Tax=cellular organisms RepID=Y508_THEMA Length = 599 Score = 206 bits (525), Expect = 2e-52, Method: Composition-based stats. Identities = 76/172 (44%), Positives = 104/172 (60%), Gaps = 2/172 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +I +V+GDIT+ VD IVNAAN L GGGV GAI RA G + + ++ Q++G PT Sbjct: 427 KKIRIVKGDITREEVDAIVNAANEYLKHGGGVAGAIVRAGGSVIQEESDRIVQERGRVPT 486 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV+T AG L AK V+HTVGPVWRGG ED+LL A N+L S++ PAIS Sbjct: 487 GEAVVTSAGKLKAKYVIHTVGPVWRGGSHGEDELLYKAVYNALLRAHELKLKSISMPAIS 546 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLT 172 TG++G+P+ A I K + +FI +H E++ DEE ++E + Sbjct: 547 TGIFGFPKERAVGIFSKAIRDFIDQHPDTTLEEIRICNIDEETTKIFEEKFS 598 >UniRef50_Q8KAE4 UPF0189 protein CT2219 n=7 Tax=Bacteria RepID=Y2219_CHLTE Length = 172 Score = 206 bits (523), Expect = 4e-52, Method: Composition-based stats. Identities = 87/171 (50%), Positives = 110/171 (64%), Gaps = 4/171 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 IH ++ DIT L VD IVNAAN SL+GGGGVDGAIHRAAGP LL+AC ++ G C Sbjct: 4 NVLIHAIKADITSLTVDAIVNAANTSLLGGGGVDGAIHRAAGPKLLEACREL----GGCL 59 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG A IT LPA V+HTVGPVW GG E +LL Y NSL+L + ++AFP+I Sbjct: 60 TGEAKITKGYRLPATFVIHTVGPVWHGGNHGEAELLASCYRNSLKLAIEHHCRTIAFPSI 119 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 STG+YGYP AA IA+ TV E + E+V F C+ + + +Y++ L Sbjct: 120 STGIYGYPVEQAAAIAITTVREMLADERGIEKVIFCCFSDRDLDVYQKALA 170 >UniRef50_C1PFE0 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=C1PFE0_BACCO Length = 188 Score = 205 bits (522), Expect = 5e-52, Method: Composition-based stats. Identities = 88/169 (52%), Positives = 115/169 (68%), Gaps = 4/169 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +V GDITK+ D IVNAAN +L+GGGGVDGAIHRAAGP LL+ C K+ CPTG Sbjct: 4 FKIVLGDITKVKTDAIVNAANTTLLGGGGVDGAIHRAAGPELLEECRKLN----GCPTGE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T LPAK V+HT GPVW+GG +E +LL+++Y NSLRL + +VAFP+ISTG Sbjct: 60 AKMTKGYRLPAKYVIHTPGPVWQGGGHHEAELLENSYQNSLRLAESKGLRTVAFPSISTG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 VY +P AAA IAV+T+ F+ ++V+ VC+DE YE+ T+ Sbjct: 120 VYHFPLDAAARIAVRTICTFLETSDSVQEVWMVCFDERTKQAYEKAATE 168 >UniRef50_D1AKA5 Appr-1-p processing domain protein n=2 Tax=Bacteria RepID=D1AKA5_SEBTE Length = 180 Score = 205 bits (522), Expect = 5e-52, Method: Composition-based stats. Identities = 88/175 (50%), Positives = 111/175 (63%), Gaps = 2/175 (1%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 T + GDITK+ DVIVNAAN SL+GGGGVDGAIHR GP +LD C K+ +QG CP Sbjct: 5 NTELRCENGDITKVKTDVIVNAANSSLLGGGGVDGAIHRTGGPLILDECRKIVDRQGSCP 64 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT G LPAK V+HTVGPVW G+ NE++ L+ Y NSL++ S+AF I Sbjct: 65 VGEAVITTGGKLPAKFVIHTVGPVWSYGKNNEEEKLRKCYRNSLKIAEDKQLESIAFSNI 124 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERLLTQQ 174 STG YG+P+ A A+ V ++ T +V FVC D+EN +YE LL + Sbjct: 125 STGTYGFPKETAGRAALDEVKKYFIQTPDTTIREVVFVCLDDENFEIYEELLESE 179 >UniRef50_Q8PHB6 UPF0189 protein XAC3343 n=14 Tax=Proteobacteria RepID=Y3343_XANAC Length = 179 Score = 204 bits (520), Expect = 8e-52, Method: Composition-based stats. Identities = 90/173 (52%), Positives = 110/173 (63%), Gaps = 2/173 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG--DCP 61 RI V QGDIT+L VDVIVNAAN SL+GGGGVDGAIHRAAGP LL+AC + Q + CP Sbjct: 2 RIEVWQGDITELDVDVIVNAANESLLGGGGVDGAIHRAAGPRLLEACEALPQVRPGVRCP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG IT DL A+ + HTVGPVWR G NE + L + Y SL+L S+AFPAI Sbjct: 62 TGEIRITDGFDLKARHIFHTVGPVWRDGRHNEPEQLANCYWQSLKLAEQMMLHSIAFPAI 121 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 S G+YGYP AA IAV ++ H +P+ + V Y+E Y++ L Q Sbjct: 122 SCGIYGYPLHQAARIAVTETRDWQRSHKVPKHIVLVAYNEATYKAYQQALATQ 174 >UniRef50_Q8Q0F9 UPF0189 protein MM_0177 n=18 Tax=cellular organisms RepID=Y177_METMA Length = 187 Score = 202 bits (513), Expect = 6e-51, Method: Composition-based stats. Identities = 90/172 (52%), Positives = 117/172 (68%), Gaps = 4/172 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 RI + +GDI K+ VD IVNAAN +L+GGGGVDGAIHRAAGPALL+ C + CPT Sbjct: 19 DRIRIFEGDIVKMRVDAIVNAANNTLLGGGGVDGAIHRAAGPALLEECKTLN----GCPT 74 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPAK ++HTVGPVW+GGE+ ED+LL Y SL L ++AFPAIS Sbjct: 75 GEAKITSGYLLPAKYIIHTVGPVWQGGEKGEDELLASCYRKSLELARDYKIKTIAFPAIS 134 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 TG YG+P AA IAV V EF+ ++ +PE VY VCY++++ ++ L++ Sbjct: 135 TGAYGFPSERAAGIAVSQVKEFLQKNEIPETVYLVCYNKDSCKSIKKALSKI 186 >UniRef50_C7NE27 Appr-1-p processing domain protein n=2 Tax=Leptotrichia RepID=C7NE27_LEPBD Length = 187 Score = 201 bits (511), Expect = 8e-51, Method: Composition-based stats. Identities = 87/180 (48%), Positives = 125/180 (69%), Gaps = 5/180 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +K RI +V+GDIT+ DVIVNAAN SL+GG GVDGAIHR G + + C+K+R QG C Sbjct: 6 LKNRIVLVKGDITEYPADVIVNAANSSLLGGSGVDGAIHRKGGKEITEDCMKIRASQGKC 65 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-----DQLLQDAYLNSLRLVAANSYTS 115 G AVIT AG++ K V+HTVGPVW+ G+ NE ++LL++AY++SL L N + Sbjct: 66 NIGEAVITRAGNMSFKNVIHTVGPVWQSGKNNEAKLFAEKLLKNAYISSLELAEKNKLKN 125 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 ++FP ISTGVY +P+ AA+ A+ V E++ ++ E+V FVC++ EN +Y +LL ++G Sbjct: 126 ISFPNISTGVYRFPKDLAAKTAINAVIEYLEKNDFIEKVNFVCFENENFEIYRKLLEEKG 185 >UniRef50_C7RS37 Appr-1-p processing domain protein n=15 Tax=cellular organisms RepID=C7RS37_9PROT Length = 197 Score = 199 bits (507), Expect = 2e-50, Method: Composition-based stats. Identities = 89/173 (51%), Positives = 108/173 (62%), Gaps = 4/173 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M T + + DIT LAVD IVNAAN SL+GGGGVDGAIHRAAGP LL C + G C Sbjct: 26 MSTMLRAICADITTLAVDAIVNAANSSLLGGGGVDGAIHRAAGPGLLAECRLL----GGC 81 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T A LPA+ ++HTVGPVW GG E Q L Y SL L AN ++A P+ Sbjct: 82 PTGEARLTHAHRLPARYIIHTVGPVWHGGGSGEAQRLASCYRCSLELAVANDLVTLAIPS 141 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTG+YGYP AAE+AV TV + +V F C+ + +YERLL + Sbjct: 142 ISTGIYGYPIEQAAEVAVSTVRASVRELGRLREVVFCCFSPGDLRVYERLLGE 194 >UniRef50_A8ZUR5 Appr-1-p processing domain protein n=3 Tax=cellular organisms RepID=A8ZUR5_DESOH Length = 195 Score = 199 bits (507), Expect = 2e-50, Method: Composition-based stats. Identities = 86/170 (50%), Positives = 103/170 (60%), Gaps = 4/170 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +R+ V QGDIT L VD IVNAAN +L+GGGGVDGAIHRAAGP LL C + G C T Sbjct: 27 SRLKVWQGDITTLEVDAIVNAANKTLLGGGGVDGAIHRAAGPELLAECKTL----GGCDT 82 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPAK V+HTVGPV+ +LL Y NSL+L SVAFPA+S Sbjct: 83 GQAKITRGYRLPAKFVIHTVGPVYSRSNPGVAKLLAGCYTNSLKLAKDQGLASVAFPAVS 142 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GVYGYP A IA+ TV +F+ EQV F + + +YE L+ Sbjct: 143 CGVYGYPMKEACRIALDTVCDFLETDRTIEQVIFALFSADAVRVYEGYLS 192 >UniRef50_C1H4Y3 MACRO domain-containing protein n=4 Tax=cellular organisms RepID=C1H4Y3_PARBA Length = 334 Score = 199 bits (505), Expect = 4e-50, Method: Composition-based stats. Identities = 80/176 (45%), Positives = 111/176 (63%), Gaps = 6/176 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + I ++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAG L C + G C Sbjct: 38 LNNSICLITSDITKLEVDCIVNAANKSLLGGGGVDGAIHRAAGRGLWQECRSL----GGC 93 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT A +LP + V+H VGP++ +++ + LL+ Y+ SL + A N S+AF + Sbjct: 94 MTGDAKITNAYNLPCRKVIHAVGPMY-WADEDRESLLRSCYMRSLTIAAENGLKSIAFSS 152 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENAHLYERLLTQQG 175 ISTGVYGYP + AAE+A++ V F+ R + PE+V F ++ ++ + Y LL Sbjct: 153 ISTGVYGYPSSKAAEVAIRAVKHFLEARSSPPERVIFCTFEPKDVNAYRALLPAYF 208 >UniRef50_Q047N9 Predicted phosphatase, histone macroH2A1 family n=3 Tax=Bacteria RepID=Q047N9_LACDB Length = 166 Score = 197 bits (500), Expect = 2e-49, Method: Composition-based stats. Identities = 87/169 (51%), Positives = 107/169 (63%), Gaps = 6/169 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + + QGDIT L VD IVNAAN L GGGGVDGAIHRAAGP L +AC + G C TG Sbjct: 2 NLEIWQGDITTLKVDAIVNAANRELRGGGGVDGAIHRAAGPKLNEACRAL----GSCETG 57 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT +LPAK ++HTVGPV+ G ++ LL Y NSLR+ N SVAF AIST Sbjct: 58 EAKITPGFNLPAKYIIHTVGPVYS-GSHSDPLLLAACYRNSLRVAKENGLHSVAFSAIST 116 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLL 171 GVYGYP AA+++A V +++ H E +V V YD LY++LL Sbjct: 117 GVYGYPLDAASKVAFGEVRKWLREHKDYEMRVIMVAYDARTYALYQKLL 165 >UniRef50_Q1HPZ5 LRP16 protein n=1 Tax=Bombyx mori RepID=Q1HPZ5_BOMMO Length = 275 Score = 197 bits (500), Expect = 2e-49, Method: Composition-based stats. Identities = 70/172 (40%), Positives = 98/172 (56%), Gaps = 9/172 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ + +GDITKL +D +VNAAN L GGGVDGAIHRAAGP L C + G C Sbjct: 107 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGC 162 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T +LPAK ++HTVGP + + L+ Y L S+AFP Sbjct: 163 PTGDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPC 217 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTG+YG+P AA IA++T +F+ + ++ F + + +YE L+ Sbjct: 218 ISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQ 269 >UniRef50_C8NAC1 RNase III regulator YmdB n=3 Tax=cellular organisms RepID=C8NAC1_9GAMM Length = 165 Score = 197 bits (500), Expect = 2e-49, Method: Composition-based stats. Identities = 89/168 (52%), Positives = 108/168 (64%), Gaps = 4/168 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V DIT LAVD IVNAAN SL+GG GVDGAIHRAAG L+ C + G C G Sbjct: 2 NLEVQVADITTLAVDAIVNAANESLLGGSGVDGAIHRAAGKELVAECRTL----GGCKVG 57 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A +T LPA+ V+HTVGPVW GG+ E + L +AY NSLRL A+ TS+AFPAIST Sbjct: 58 EAKLTRGYRLPARFVIHTVGPVWYGGDDGEAEALANAYANSLRLAEAHELTSIAFPAIST 117 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 GV+GYP+ AA IA+ TV + +V F C+ E +A LY RLL Sbjct: 118 GVFGYPKEDAARIAIDTVRATLKECPHMARVIFCCFSERDAALYRRLL 165 >UniRef50_A1Z1Q3 MACRO domain-containing protein 2 n=55 Tax=cellular organisms RepID=MACD2_HUMAN Length = 448 Score = 196 bits (498), Expect = 3e-49, Method: Composition-based stats. Identities = 81/178 (45%), Positives = 112/178 (62%), Gaps = 6/178 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +GDIT L VD IVNAAN SL+GGGGVDG IHRAAGP LL C + C Sbjct: 68 LTEKVSLYRGDITLLEVDAIVNAANASLLGGGGVDGCIHRAAGPCLLAECRNLN----GC 123 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TGHA IT DLPAK V+HTVGP+ RG + + L + Y +SL+LV N+ SVAFP Sbjct: 124 DTGHAKITCGYDLPAKYVIHTVGPIARGHINGSHKEDLANCYKSSLKLVKENNIRSVAFP 183 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 ISTG+YG+P AA IA+ T+ E++ H +++ F + E + +Y++ + + Sbjct: 184 CISTGIYGFPNEPAAVIALNTIKEWLAKNHHEVDRIIFCVFLEVDFKIYKKKMNEFFS 241 >UniRef50_Q8TQD0 UPF0189 protein MA_1614 n=2 Tax=cellular organisms RepID=Y1614_METAC Length = 195 Score = 196 bits (498), Expect = 3e-49, Method: Composition-based stats. Identities = 86/172 (50%), Positives = 110/172 (63%), Gaps = 4/172 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 RI +++ DIT+L VD IVNAAN +L+GGGGVDGAIHRAAGP LL+ C + CP Sbjct: 26 SERIRIIERDITELKVDAIVNAANNTLLGGGGVDGAIHRAAGPGLLEECRTLN----GCP 81 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG A IT LPAK V+HTVGP+W+ G + ED+ L Y SL L ++AFP I Sbjct: 82 TGEAKITKGYLLPAKYVIHTVGPIWQEGTKGEDEFLASCYRKSLELARKYDVKTIAFPTI 141 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 STG YG+P AA IAV V EF+ + LPE V+ VCY++E ++ L + Sbjct: 142 STGAYGFPSERAARIAVSQVKEFLKVNELPEIVFLVCYNKEACKNIKKALEE 193 >UniRef50_Q0CQJ0 Protein LRP16 n=10 Tax=cellular organisms RepID=Q0CQJ0_ASPTN Length = 344 Score = 196 bits (498), Expect = 3e-49, Method: Composition-based stats. Identities = 83/184 (45%), Positives = 105/184 (57%), Gaps = 13/184 (7%) Query: 1 MKTRIHVVQGDITKL-AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + RI +++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAGP L+ C + G Sbjct: 37 LNDRISLIRHDITKLLDVDCIVNAANSSLLGGGGVDGAIHRAAGPGLVRECRTL----GG 92 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWR----GGEQNEDQLLQDAYLNSLRLVAANSYTS 115 C TG A T A DLP + V+HTVGP++ G +QLL+ Y L L N S Sbjct: 93 CATGDAKTTAAYDLPCRWVIHTVGPIYPVERQKGAARPEQLLRSCYRRCLELAVRNKARS 152 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA----LPEQVYFVCYDEENAHLYERLL 171 +AFPAISTGVY YP+ AA IA+ F+ E+V F ++EE+ YE + Sbjct: 153 IAFPAISTGVYAYPKRRAARIALDETRAFLESEGTDIVTLEKVVFCNFEEEDQRAYEEAV 212 Query: 172 TQQG 175 Sbjct: 213 PDVF 216 >UniRef50_A7RJ44 Predicted protein (Fragment) n=4 Tax=Eukaryota RepID=A7RJ44_NEMVE Length = 183 Score = 196 bits (498), Expect = 3e-49, Method: Composition-based stats. Identities = 75/174 (43%), Positives = 104/174 (59%), Gaps = 12/174 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDIT L +D IVNAAN +L+GGGGVDG IHRAAG L C K+R C Sbjct: 5 LNDKVSLWTGDITALEIDAIVNAANTTLLGGGGVDGCIHRAAGDNLFKECRKLR----GC 60 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A ITL LPAK V+HT GP+ + LQD Y N L+L + ++AF Sbjct: 61 QTGEAKITLGHRLPAKYVIHTAGPM-----GKNRKKLQDCYKNCLQLAKQHGVKTLAFCC 115 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLL 171 ISTG+YGYP AA +A++TV +++ + E++ F + ++ +YERLL Sbjct: 116 ISTGIYGYPNKDAAHVALETVRQWLETDDNNDSVERIVFCTFLPKDTEIYERLL 169 >UniRef50_Q66HV6 Zgc:92353 n=1 Tax=Danio rerio RepID=Q66HV6_DANRE Length = 248 Score = 196 bits (498), Expect = 3e-49, Method: Composition-based stats. Identities = 75/177 (42%), Positives = 106/177 (59%), Gaps = 6/177 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDITKL +D + NAAN +L+GGGGVDGAIHR AGP L C + C Sbjct: 66 LNMKVSLFGGDITKLEIDAVANAANKTLLGGGGVDGAIHRGAGPLLRKECATLN----GC 121 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT A LPA+ V+HTVGP+ + E++ L++ Y N L + +VAFP Sbjct: 122 ETGEAKITGAYGLPARYVIHTVGPIVHDSVGEREEEALRNCYYNCLHTATKHHLRTVAFP 181 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQQG 175 ISTGVYGYP A E+A+KTV +++ ++ ++V F + + + LYE LL Sbjct: 182 CISTGVYGYPPDQAVEVALKTVRDYLEQNPEKLDRVIFCVFLKSDKQLYENLLPAYF 238 >UniRef50_Q9HXU7 UPF0189 protein PA3693 n=16 Tax=Bacteria RepID=Y3693_PSEAE Length = 173 Score = 196 bits (497), Expect = 4e-49, Method: Composition-based stats. Identities = 91/172 (52%), Positives = 108/172 (62%), Gaps = 4/172 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + V QGDIT+LAVD IVNAAN SL+GGGGVDGAIHRAAG L+ AC + C T Sbjct: 2 TEVRVWQGDITRLAVDAIVNAANSSLLGGGGVDGAIHRAAGAELVAACRLL----HGCKT 57 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPA V+HTVGPVWRGG+ E +LL Y SL L SVAFPAIS Sbjct: 58 GEAKITRGFRLPAAHVIHTVGPVWRGGDNGEAELLASCYRRSLALAEQAGAASVAFPAIS 117 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 G+YGYP AA IAV+ V H+ E++ V +D A Y+RLL ++ Sbjct: 118 CGIYGYPLEQAAAIAVEEVCRQRPAHSSLEEIVLVAFDSSMAERYQRLLGER 169 >UniRef50_Q8Y2K1 UPF0189 protein RSc0334 n=39 Tax=cellular organisms RepID=Y334_RALSO Length = 171 Score = 195 bits (496), Expect = 5e-49, Method: Composition-based stats. Identities = 85/171 (49%), Positives = 107/171 (62%), Gaps = 7/171 (4%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + ++ DIT LA D IVNAAN +L+GGGGVDGAIHRAAGP LL+AC + C T Sbjct: 6 VTLRALRADITTLACDAIVNAANSALLGGGGVDGAIHRAAGPELLEACRAL----HGCRT 61 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPA+ ++HTVGP+WRGG Q+E LL Y NSL L + ++AFP IS Sbjct: 62 GQAKITPGFLLPARYIIHTVGPIWRGGRQDEAALLAACYRNSLALAKQHDVRTIAFPCIS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 TGVYG+P AA IAV+TV E A + + F C+ + LYE L + Sbjct: 122 TGVYGFPPQLAAPIAVRTVREH---GADLDDIVFCCFSAADLALYETALNE 169 >UniRef50_A5GC80 Appr-1-p processing domain protein n=2 Tax=Desulfuromonadales RepID=A5GC80_GEOUR Length = 172 Score = 195 bits (495), Expect = 6e-49, Method: Composition-based stats. Identities = 85/175 (48%), Positives = 108/175 (61%), Gaps = 4/175 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 MK +I ++QGDIT+LAVD IVNAAN +L+GGGGVDGAIHRAAGP L+ C + G C Sbjct: 1 MKGKIEIIQGDITRLAVDAIVNAANNTLLGGGGVDGAIHRAAGPDLVAECSTL----GGC 56 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT LPAK V+HTVGPVW GG + E +LL+ AY + A+ S+AFPA Sbjct: 57 ETGDAKITKGYKLPAKHVIHTVGPVWHGGSKGEPELLRKAYRRCFEVAHASKLKSIAFPA 116 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 IS GVYGYP A EIA+ + + E+V FV + +Y+ + Sbjct: 117 ISAGVYGYPMDQACEIAMVEAKAALEKFPELERVIFVPFSPGALAIYQATYGKIF 171 >UniRef50_Q6PHJ5 Zgc:65960 n=11 Tax=cellular organisms RepID=Q6PHJ5_DANRE Length = 452 Score = 195 bits (495), Expect = 7e-49, Method: Composition-based stats. Identities = 83/178 (46%), Positives = 114/178 (64%), Gaps = 6/178 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +GDIT L +D IVNAAN SL+GGGGVDG IHRAAG L + C + C Sbjct: 59 LADKVSLYKGDITILEIDAIVNAANSSLLGGGGVDGCIHRAAGHLLYEECHSLN----GC 114 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT DLPAK V+HTVGP+ RG Q++ L+ Y +SL+L+ N+ SVAFP Sbjct: 115 DTGKAKITCGYDLPAKYVIHTVGPIARGNVGQSQRDDLESCYYSSLKLMKDNNLRSVAFP 174 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQGD 176 ISTG+YG+P AAEIA+KTV E+I +H ++V F + E + +Y+R ++ Sbjct: 175 CISTGIYGFPNEPAAEIALKTVQEWIEKHQDEIDRVIFCVFLETDYEIYKRKMSDFFS 232 >UniRef50_Q71W03 UPF0189 protein LMOf2365_2748 n=23 Tax=Bacteria RepID=Y2748_LISMF Length = 176 Score = 193 bits (491), Expect = 2e-48, Method: Composition-based stats. Identities = 92/173 (53%), Positives = 116/173 (67%), Gaps = 2/173 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I VV+GDIT+ VDVIVNAANP L+GGGGVDGAIH+AAGP LL C +V + G CP G Sbjct: 2 EITVVKGDITEQDVDVIVNAANPGLLGGGGVDGAIHQAAGPDLLKECQEVINRIGTCPAG 61 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT AGDL A ++H VGP+W+ GE E L Y +L L A TS+AFP IST Sbjct: 62 EAVITSAGDLQASYIIHAVGPIWKDGEHQEANKLASCYWKALDLAAGKELTSIAFPNIST 121 Query: 124 GVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLLTQQ 174 GVYG+P+ AAE+A+ TV ++ E++ FVC+DEEN LY +L+ + Sbjct: 122 GVYGFPKKLAAEVALYTVRKWAEEEYDTSIEEIRFVCFDEENLKLYNKLINSE 174 >UniRef50_A4R3Q9 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4R3Q9_MAGGR Length = 263 Score = 193 bits (491), Expect = 2e-48, Method: Composition-based stats. Identities = 80/173 (46%), Positives = 100/173 (57%), Gaps = 7/173 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 RI + GDITKL VD IVNAAN +L+GGGGVDG+IHRAAG LL C + C Sbjct: 61 NDRIALYHGDITKLMVDAIVNAANETLLGGGGVDGSIHRAAGGGLLRECRTL----DGCD 116 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPA 120 TG A +T A DLP K V+H VGPV+ + E + LL Y SL L N S+AFPA Sbjct: 117 TGDAKVTDAYDLPCKKVIHAVGPVYNERHREECEMLLSSCYTRSLELAVENGCRSIAFPA 176 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLL 171 ISTG+YGYP AA A+ V +F+ V F C+ +++ +Y L Sbjct: 177 ISTGIYGYPSRRAANAAITAVRKFLESDQGDKISLVVFCCFLQKDMEIYTDKL 229 >UniRef50_C1BR35 MACRO domain-containing protein 1 n=2 Tax=Caligus rogercresseyi RepID=C1BR35_9MAXI Length = 242 Score = 193 bits (491), Expect = 2e-48, Method: Composition-based stats. Identities = 76/174 (43%), Positives = 102/174 (58%), Gaps = 10/174 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +I + QGDITKL VD IVNAAN L GGGV GAIHRAAG L C + G CP Sbjct: 78 SPKIGMWQGDITKLEVDAIVNAANSGLKAGGGVCGAIHRAAGSQLQKECDSI----GGCP 133 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G + IT LPAK V+HTVGP + + L+ Y S+ L+ A S+AFP I Sbjct: 134 VGDSRITAGYKLPAKHVIHTVGP-----QDKNSEHLKSCYRKSMELLIAKGLRSIAFPCI 188 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQQ 174 STG+YGYP AAE+A++T+ FI ++ + V F + +++ Y LL+++ Sbjct: 189 STGIYGYPSDKAAEVALQTIRSFIQDNSESVDSVIFCVFLDKDMQYYSELLSKK 242 >UniRef50_B7PF53 MACRO domain-containing protein, putative n=2 Tax=cellular organisms RepID=B7PF53_IXOSC Length = 304 Score = 193 bits (491), Expect = 2e-48, Method: Composition-based stats. Identities = 73/178 (41%), Positives = 99/178 (55%), Gaps = 12/178 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDIT L +D IVNAAN L+GGGGVDGAIH AAGP L + C + C Sbjct: 134 LNNKVSIFVGDITALEIDAIVNAANNRLLGGGGVDGAIHSAAGPKLKEECATLN----GC 189 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT LPAK V+HTVGPV + L Y+ SL A+ ++AFP Sbjct: 190 PTGEAKITGGYKLPAKYVIHTVGPV-----GENEAKLHGCYVTSLETAKAHKIRTLAFPC 244 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHAL---PEQVYFVCYDEENAHLYERLLTQQG 175 ISTG+YGYP AA +A+ E++ +++ F + + LYE+LL + Sbjct: 245 ISTGIYGYPNEKAAHVALSAAREWLDSEENALKVDRIIFCLFLPIDVRLYEKLLPEYF 302 >UniRef50_C6BB95 Appr-1-p processing domain protein n=4 Tax=cellular organisms RepID=C6BB95_RALP1 Length = 171 Score = 193 bits (490), Expect = 2e-48, Method: Composition-based stats. Identities = 83/170 (48%), Positives = 103/170 (60%), Gaps = 7/170 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++GDIT L D IVNAAN SL+GGGGVDGAIHRAAGP LL+AC + C TG Sbjct: 8 LRALRGDITTLDCDAIVNAANSSLLGGGGVDGAIHRAAGPELLEACRAL----HGCRTGE 63 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T L A+ V+HTVGP+WRGG Q+E LL Y NSL L S+AFP ISTG Sbjct: 64 AKLTPGFQLTARYVIHTVGPIWRGGRQDEAALLAACYRNSLELACKYEVRSIAFPCISTG 123 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 +YG+P AA IAV+ E + E + F C+ + LYE L + Sbjct: 124 IYGFPPQLAAPIAVRAAREH---GSRFETITFCCFSAADLILYEAALGNR 170 >UniRef50_C6RT62 Appr-1-p processing n=2 Tax=Acinetobacter radioresistens RepID=C6RT62_ACIRA Length = 186 Score = 193 bits (490), Expect = 2e-48, Method: Composition-based stats. Identities = 82/175 (46%), Positives = 113/175 (64%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + ++ GDIT + +D IVNAAN +L+GG GVDGAIH+A GP +++ C ++R +QG C Sbjct: 2 KQFRLIHGDITGIRIDAIVNAANSTLLGGHGVDGAIHQAGGPDIIEECRQIRARQGSCTV 61 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV+T G LPA+ V+HTVGP+W G+ NE LL AY NS L + T +A+P IS Sbjct: 62 GEAVMTTGGRLPAQYVIHTVGPIWEEGKANERTLLSQAYQNSFALAEQHYLTGIAYPNIS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 TGVY +P+ AA IA+ T+ + ++V VC+D EN LYE LL Q+ E Sbjct: 122 TGVYRFPKVEAAAIAIDTLIPLLKNSETVQEVALVCFDLENFELYEELLKQRLLE 176 >UniRef50_B7C8M6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B7C8M6_9FIRM Length = 296 Score = 191 bits (486), Expect = 8e-48, Method: Composition-based stats. Identities = 80/177 (45%), Positives = 113/177 (63%), Gaps = 6/177 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 +++ I VV+GDIT D IVNAAN SL+GGGGVDGAIHRAAGP LL+ C + C Sbjct: 125 LESEIKVVKGDITTFDGDCIVNAANESLLGGGGVDGAIHRAAGPMLLEECKLLN----GC 180 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG A IT DL AK V+HTVGP++ G+ ++ +L+D Y NSL L ++AFPA Sbjct: 181 QTGQAKITKGYDLKAKYVIHTVGPMYS-GKHEDEHMLRDCYWNSLTLARKYDIHTIAFPA 239 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLTQQGD 176 IS GVYGYP A + +KT+++++ ++ ++ C+DEE Y++ + QG+ Sbjct: 240 ISCGVYGYPVEKAVPLVLKTIADWLDANSDYTMKISLYCFDEETTKEYQKYTSYQGE 296 >UniRef50_Q0UQZ6 Putative uncharacterized protein n=2 Tax=Leotiomyceta RepID=Q0UQZ6_PHANO Length = 291 Score = 190 bits (484), Expect = 1e-47, Method: Composition-based stats. Identities = 78/179 (43%), Positives = 110/179 (61%), Gaps = 8/179 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I +++ DIT LA+D IVNAAN SL+GGGGVDGAIHRAAGP L D C + C Sbjct: 37 LNDKISIIRRDITTLAIDAIVNAANTSLLGGGGVDGAIHRAAGPKLYDECETL----DGC 92 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG+A +T +LP+K V+H VGP+ W+ G +LL Y SL+L N S+AF Sbjct: 93 ETGNAKMTRGYELPSKKVIHAVGPIYWKEGRSASAKLLSMCYRTSLQLAVDNECRSIAFS 152 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA---LPEQVYFVCYDEENAHLYERLLTQQG 175 A+STGVYGYP AA +A++TV +F+ ++V F + E++ + Y R + + Sbjct: 153 ALSTGVYGYPSDEAAVVALQTVRQFLDEDGKAEKLDRVIFCNFLEKDENAYYREIQKYF 211 >UniRef50_Q9BQ69 MACRO domain-containing protein 1 n=11 Tax=Tetrapoda RepID=MACD1_HUMAN Length = 325 Score = 189 bits (479), Expect = 4e-47, Method: Composition-based stats. Identities = 79/177 (44%), Positives = 109/177 (61%), Gaps = 6/177 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I +++ DITKL VD IVNAAN SL+GGGGVDG IHRAAGP L D C ++ C Sbjct: 150 LNEKISLLRSDITKLEVDAIVNAANSSLLGGGGVDGCIHRAAGPLLTDECRTLQS----C 205 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT LPAK V+HTVGP+ + ++ L+ YL+SL L+ + SVAFP Sbjct: 206 KTGKAKITGGYRLPAKYVIHTVGPIAYGEPSASQAAELRSCYLSSLDLLLEHRLRSVAFP 265 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLTQQG 175 ISTGV+GYP AAAEI + T+ E++ +H +++ + E++ +Y L Sbjct: 266 CISTGVFGYPCEAAAEIVLATLREWLEQHKDKVDRLIICVFLEKDEDIYRSRLPHYF 322 >UniRef50_Q2LUU1 Appr-1-p histone processing protein n=5 Tax=Bacteria RepID=Q2LUU1_SYNAS Length = 214 Score = 189 bits (479), Expect = 5e-47, Method: Composition-based stats. Identities = 79/173 (45%), Positives = 111/173 (64%), Gaps = 4/173 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + ++QGDIT+ D IVNAAN L GGGGVDGAIHRA GP+++ C ++ G CP Sbjct: 40 NSVLALIQGDITQEDTDAIVNAANTGLRGGGGVDGAIHRAGGPSIMAECRRI----GGCP 95 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG AVIT G + A+ V+HTVGPV+R G E +LL AY SL++ +A S++FPAI Sbjct: 96 TGQAVITTGGKMKARYVIHTVGPVYRDGSHGEAELLASAYRESLKMASARHLKSLSFPAI 155 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 S GVYGYP AA IA++TV +++ ++ E V FV +++ + L + Sbjct: 156 SAGVYGYPLEEAARIALQTVIDYLKKNRDIELVRFVLFNQSTYDAFSNALGKL 208 >UniRef50_Q8K4G6 MACRO domain-containing protein 1 (Fragment) n=5 Tax=cellular organisms RepID=MACD1_RAT Length = 258 Score = 188 bits (478), Expect = 6e-47, Method: Composition-based stats. Identities = 77/177 (43%), Positives = 108/177 (61%), Gaps = 6/177 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I + +GDITKL VD IVNAAN SL+GGGGVDG IHRAAG L D C ++ +C Sbjct: 83 LNEKISLFRGDITKLEVDAIVNAANNSLLGGGGVDGCIHRAAGSLLTDECRTLQ----NC 138 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT LPAK V+HTVGP+ ++ L+ YL+SL L+ + SVAFP Sbjct: 139 ETGKAKITCGYRLPAKHVIHTVGPIAVGQPTASQAAELRSCYLSSLDLLLEHRLRSVAFP 198 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLTQQG 175 ISTGV+GYP AAE+ + T+ E++ +H +++ + E++ +Y+ L Sbjct: 199 CISTGVFGYPNEEAAEVVLATLREWLEQHKDKVDRLIICVFLEKDEGIYQERLPHYF 255 >UniRef50_A6BCW6 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A6BCW6_9FIRM Length = 267 Score = 188 bits (477), Expect = 8e-47, Method: Composition-based stats. Identities = 71/177 (40%), Positives = 108/177 (61%), Gaps = 8/177 (4%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 +I + +GDIT+L+VD IVNAAN ++G G +D AIH AAG L + C ++ + Q Sbjct: 92 DKISLWRGDITRLSVDAIVNAANSQMLGCFVPCHGCIDNAIHSAAGIQLRNECAQIMEAQ 151 Query: 58 GDC-PTGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 G PTG A IT +LPAK V+HTVGP+ + +++ L+ YLN ++L S Sbjct: 152 GHEEPTGKAKITKGYNLPAKHVIHTVGPIVGMQVTEKQEEELKSCYLNCMKLAEKEGLKS 211 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 +AF ISTG + +P AAEIAVKTV +++ + E+V F + EE+ ++Y+++ Sbjct: 212 IAFCCISTGEFHFPNKLAAEIAVKTVDKYL-SSSKLERVIFNVFKEEDYNIYKKIFA 267 >UniRef50_Q8EP31 Hypothetical conserved protein n=1 Tax=Oceanobacillus iheyensis RepID=Q8EP31_OCEIH Length = 185 Score = 188 bits (477), Expect = 8e-47, Method: Composition-based stats. Identities = 77/179 (43%), Positives = 112/179 (62%), Gaps = 4/179 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ---G 58 + +V GDITK +VIVNAAN SL+GGGGVDGAIH AAGP LL AC ++R + Sbjct: 7 DNTLEIVVGDITKETTNVIVNAANGSLLGGGGVDGAIHHAAGPELLKACQEMRNNELNGE 66 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 + PTG +IT LP++ ++HTVGP+W +++LL + Y N+L LV +S++F Sbjct: 67 ELPTGEVIITSGFQLPSRFIIHTVGPIWNQTPDLQEELLANCYRNALELVKVKKLSSISF 126 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 P+ISTGVYGYP AA IA++T+ +F+ + V V + E + +Y+ L ++ Sbjct: 127 PSISTGVYGYPIHEAAAIALQTIIQFLQEND-VGLVKVVLFSERDYSIYQEKLKYLIEK 184 >UniRef50_B6Q324 LRP16 family protein n=3 Tax=Trichocomaceae RepID=B6Q324_PENMQ Length = 308 Score = 188 bits (477), Expect = 8e-47, Method: Composition-based stats. Identities = 78/179 (43%), Positives = 101/179 (56%), Gaps = 8/179 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + + ++ DITKL VD IVNAAN SL+GGGGVDGAIHRAAG LLD C + G C Sbjct: 38 LNDTLSHIRHDITKLQVDCIVNAANRSLLGGGGVDGAIHRAAGHRLLDECRAL----GGC 93 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT +LPA ++HTVGP++ + LL+ Y SL L + S+AF Sbjct: 94 RTGDAKITNGYNLPATKIIHTVGPIYDEDNHELSETLLRSCYRRSLELAVEHDQRSIAFS 153 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH---ALPEQVYFVCYDEENAHLYERLLTQQG 175 A+STGVYGYP AAA + V +F+ + E+V F + + YER L Sbjct: 154 AVSTGVYGYPNEAAARAVLDEVDKFLREGDNVSKLERVIFCSFMPADVRAYERYLPAYF 212 >UniRef50_B9MLL8 Appr-1-p processing domain protein n=6 Tax=Clostridiales RepID=B9MLL8_ANATD Length = 181 Score = 187 bits (476), Expect = 1e-46, Method: Composition-based stats. Identities = 73/172 (42%), Positives = 103/172 (59%), Gaps = 3/172 (1%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I + +GDITK VDVIVNAAN L GGGV AI +A G + ++ ++ G PTG Sbjct: 9 KIAIKKGDITKENVDVIVNAANSHLRHGGGVALAIVKAGGIEIQKESDEIIKKIGMLPTG 68 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 HAVIT A LP K V+HTVGP++ GE NED+ L A NSL L + S+AFPA+S+ Sbjct: 69 HAVITNAYRLPCKFVIHTVGPIY--GEGNEDEKLSMAIYNSLYLAHLYNLKSIAFPAVSS 126 Query: 124 GVYGYPRAAAAEIAVKTVSEFITR-HALPEQVYFVCYDEENAHLYERLLTQQ 174 G++G+P+ A+I + T +F++ E+V F +D+E +E Sbjct: 127 GIFGFPKDRCAKILIDTAVDFLSSIKTSIEKVVFCLFDDETYGYFEEYYKNL 178 >UniRef50_Q47EQ7 Appr-1-p processing n=1 Tax=Dechloromonas aromatica RCB RepID=Q47EQ7_DECAR Length = 186 Score = 187 bits (476), Expect = 1e-46, Method: Composition-based stats. Identities = 72/169 (42%), Positives = 102/169 (60%), Gaps = 2/169 (1%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD- 59 M R+ + GD+T AVD IVNAAN +L+GGGGVDGAIHR GPA+LDAC ++R+ Q Sbjct: 10 MNGRVRLYVGDLTDQAVDAIVNAANRTLLGGGGVDGAIHRRGGPAILDACRELRRSQWPD 69 Query: 60 -CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 PTG +T G LPA V+HTVGP++ E +LL Y N++ L A S+AF Sbjct: 70 GLPTGQVALTNGGKLPAPYVIHTVGPIYGQHRGKEAELLAACYRNAIELAAHLELKSLAF 129 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY 167 P+ISTG +GYP AA I +++ + + A +++ V ++ + Sbjct: 130 PSISTGAFGYPPDKAALIVSRSMHKVLDEIAAIDEIRLVFFNASQMETF 178 >UniRef50_B2ACK5 Predicted CDS Pa_3_1270 n=5 Tax=Eukaryota RepID=B2ACK5_PODAN Length = 253 Score = 187 bits (475), Expect = 1e-46, Method: Composition-based stats. Identities = 73/178 (41%), Positives = 102/178 (57%), Gaps = 7/178 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ V + DIT LAVD IVNAAN SL+GGGGVDGAIHRAAG L + C K+ C Sbjct: 47 LNDRVAVYRADITSLAVDAIVNAANRSLLGGGGVDGAIHRAAGRGLYEECKKLN----GC 102 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT A DLP V+H VGPV+ + + ++LL Y SL L + ++AF Sbjct: 103 KTGSAKITDAYDLPCNRVIHAVGPVYDPADHDTSEKLLVGCYTTSLELAVEHECRTIAFS 162 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERLLTQQG 175 A+STG+YGYP AA A+ + +F+ ++V V +++++ Y + Sbjct: 163 ALSTGIYGYPSREAAPAALSAIRKFLTGKDGDKIDKVILVTFEKKDVDAYTEFVPHYF 220 >UniRef50_A4YFR3 Appr-1-p processing domain protein n=9 Tax=Thermoprotei RepID=A4YFR3_METS5 Length = 220 Score = 186 bits (472), Expect = 3e-46, Method: Composition-based stats. Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 5/177 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + + +++GDITK+ D IVNAAN L GGGV AI R G A+ + ++ G Sbjct: 47 LGFEVDLMKGDITKIEADAIVNAANSYLSHGGGVAWAIVRRGGEAIQRESDQYVREHGPV 106 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 P G +T AG L AK V+H VGP + + L A SL S+A PA Sbjct: 107 PVGEVAVTGAGSLRAKYVIHAVGPRY---GLEGEDKLHSAIRRSLEKAEELGLRSLALPA 163 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTG+YGYP A + + + + + E+V V YD+ +E++ T++ E Sbjct: 164 ISTGIYGYPMEVCARVMASVLRSY--KPKILEKVIVVLYDDMAYSTFEKVFTRELQE 218 >UniRef50_A0LGZ1 Appr-1-p processing domain protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LGZ1_SYNFM Length = 175 Score = 185 bits (471), Expect = 4e-46, Method: Composition-based stats. Identities = 70/173 (40%), Positives = 100/173 (57%), Gaps = 7/173 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I +VQGD+T+L VD IVNAAN L GGGV GAI GP + + C + G G Sbjct: 9 KISLVQGDLTELRVDAIVNAANRHLALGGGVAGAIRMKGGPTIQEECDAI----GGTVVG 64 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT G+L A V+H VGP + GE +ED+ L++A LNSL+ S S+AFPA+ST Sbjct: 65 QAVITGGGNLKAAHVIHAVGPRY--GEGDEDEKLRNATLNSLKRATEKSLASIAFPAVST 122 Query: 124 GVYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENAHLYERLLTQQG 175 G++G+P+ A+I + F+ V F + +E+ ++E+ L G Sbjct: 123 GIFGFPKDRCAKIMLDAAVAFLDRETTSLRDVIFCLWSKEDLEIFEKTLQSMG 175 >UniRef50_A2SS36 Appr-1-p processing domain protein n=26 Tax=cellular organisms RepID=A2SS36_METLZ Length = 183 Score = 185 bits (470), Expect = 5e-46, Method: Composition-based stats. Identities = 80/166 (48%), Positives = 104/166 (62%), Gaps = 5/166 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + + VV+ DIT L+VDVIVNAAN +L+GGGGVDGAIH AAGP LL C + G C Sbjct: 8 ITDHLGVVKTDITTLSVDVIVNAANTTLLGGGGVDGAIHHAAGPGLLAECRTL----GGC 63 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G A IT LPAK ++HTVGPVW GG + E + L+ Y +SL L + ++AFPA Sbjct: 64 RIGEAKITKGYALPAKYIIHTVGPVWWGGNEGEPEQLRACYFHSLTLAGEHGLRTIAFPA 123 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAH 165 +STGVYGYP+ AA IAV+TV F+ ++V V + + Sbjct: 124 VSTGVYGYPKDKAAVIAVETVLSFLRDDPDAFDRVILVAHSNADFQ 169 >UniRef50_C4V1Q4 Appr-1-p processing domain protein n=3 Tax=Bacteria RepID=C4V1Q4_9FIRM Length = 289 Score = 185 bits (470), Expect = 6e-46, Method: Composition-based stats. Identities = 78/178 (43%), Positives = 105/178 (58%), Gaps = 7/178 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 RI + QGDIT+L D IVNAAN +L+G +D AIH AAG L AC + ++ Sbjct: 112 DARIALWQGDITRLNADAIVNAANSALLGCFIPCHRCIDNAIHSAAGLQLRAACAALMEE 171 Query: 57 QGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYT 114 QG TG A IT +L ++ V+HTVGP+ G L Y + L L A + Sbjct: 172 QGHPEETGTAQITEGYNLSSRHVIHTVGPIVSGALTDRHRAQLASCYRSCLSLAAEHGLR 231 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 S+AF ISTG + +PRAAAAEIAV+ V +F+TR E+V F + +E+ H+YERLL+ Sbjct: 232 SIAFCCISTGEFHFPRAAAAEIAVREVRDFLTRDTSIERVVFNVFKDEDRHIYERLLS 289 >UniRef50_Q93SX7 UPF0189 protein n=2 Tax=Acinetobacter RepID=Y189_ACISE Length = 183 Score = 185 bits (469), Expect = 6e-46, Method: Composition-based stats. Identities = 81/174 (46%), Positives = 118/174 (67%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++H++Q DIT AV IVN+AN SL+GGGG+D IH+ AGP + + C+++ Q++G CPT Sbjct: 2 KKVHLIQADITAFAVHAIVNSANKSLLGGGGLDYVIHKKAGPLMKEECVRLNQEKGGCPT 61 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A +T AG+LPAK ++H VGP W GE NE QLL DAY N+L +V+FP IS Sbjct: 62 GQAEVTTAGNLPAKYLIHAVGPRWLDGEHNEPQLLCDAYSNALFKANEIHALTVSFPCIS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 TGVYG+P AAEIA+ T+ + ++ +V+F+C ++EN +Y+ +L+ D Sbjct: 122 TGVYGFPPQKAAEIAIGTILSMLPQYDHVAEVFFICREDENYLIYKNILSNIDD 175 >UniRef50_Q5KCD7 Putative uncharacterized protein n=1 Tax=Filobasidiella neoformans RepID=Q5KCD7_CRYNE Length = 252 Score = 185 bits (469), Expect = 7e-46, Method: Composition-based stats. Identities = 70/176 (39%), Positives = 101/176 (57%), Gaps = 5/176 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ + +GDIT+L D+IVNAAN SL+GGGGVDGAIHRAAG LL+ C K+ G Sbjct: 70 LNDRVSIWRGDITELEADMIVNAANSSLLGGGGVDGAIHRAAGKHLLEECKKL----GGA 125 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFP 119 TG T +L +K + HTVGPV+ QLL+ Y +SL + + F Sbjct: 126 QTGETKFTAGYNLSSKKIAHTVGPVYHSHPPQRAAQLLKSCYQSSLEGCRDSGGGVIGFS 185 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 +ISTGVYGYP A IA++T +F+ + +V +V + + + +Y ++ Q Sbjct: 186 SISTGVYGYPIKDATHIALETTRQFLEQDDSITRVIYVVFSKRDEDVYREIIPQYF 241 >UniRef50_B2JCA0 Appr-1-p processing domain protein n=13 Tax=Proteobacteria RepID=B2JCA0_BURP8 Length = 183 Score = 184 bits (467), Expect = 1e-45, Method: Composition-based stats. Identities = 78/164 (47%), Positives = 98/164 (59%), Gaps = 4/164 (2%) Query: 11 DITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLA 70 DIT L VD +VNAAN SL+GGGGVDGA+HRAAG LL C + G C TG A IT Sbjct: 15 DITTLDVDAVVNAANTSLLGGGGVDGALHRAAGADLLRECQTL----GGCVTGDAKITGG 70 Query: 71 GDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPR 130 L A+ V+H VGPVW GG + E +LL Y SL L S+AFPAIS GVY +P Sbjct: 71 HRLKARHVIHAVGPVWHGGGRGEAELLASCYRRSLELARDAKAKSIAFPAISCGVYRFPA 130 Query: 131 AAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 A IA++TV + + R + E+V F C+DE Y+ ++ Sbjct: 131 DEAVRIAMQTVIDTLPRVSTVERVIFACFDEAMHARYKAEFGRR 174 >UniRef50_Q9EYI6 UPF0189 protein in sno 5'region n=22 Tax=Bacteria RepID=Y189_STRNO Length = 181 Score = 184 bits (466), Expect = 1e-45, Method: Composition-based stats. Identities = 86/175 (49%), Positives = 104/175 (59%), Gaps = 6/175 (3%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVR--QQQGDC 60 T I +VQGDIT+ D +VNAAN SL+GGGGVDGAIHR GPA+L C +R + Sbjct: 2 TTITLVQGDITRQHADALVNAANSSLLGGGGVDGAIHRRGGPAILAECRALRASRYGEGL 61 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG AV T AGDL A+ V+HTVGPVW ++ LL Y SLRL +VAFPA Sbjct: 62 PTGRAVATTAGDLDARWVIHTVGPVWS-STEDRSDLLASCYRESLRLAGELGARTVAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 +STGVY +P AA IAV+TV T E+V FV +D + R L G Sbjct: 121 LSTGVYRWPMGDAARIAVETVR---TTPTAVEEVRFVLFDTHAYDTFARELGDAG 172 >UniRef50_C8VIG2 LRP16 family protein (AFU_orthologue; AFUA_3G13850) n=7 Tax=Trichocomaceae RepID=C8VIG2_EMENI Length = 374 Score = 184 bits (466), Expect = 2e-45, Method: Composition-based stats. Identities = 75/179 (41%), Positives = 105/179 (58%), Gaps = 12/179 (6%) Query: 1 MKTRIHVVQGDITKLA-VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + + +V+ DITKL VD IVNAA SL+GGGGVD AIH+AAGP LL C + Sbjct: 36 LNDTVAMVRHDITKLQGVDCIVNAAKRSLLGGGGVDYAIHKAAGPDLLKECRTLN----G 91 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVW----RGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 C TG A IT A +LP K ++HTVGP++ R G+ ++LL+ Y L + N S Sbjct: 92 CDTGDAKITNAYNLPNKRIIHTVGPIYSDAMRRGKDEPERLLRSCYRRCLEVAVENEMKS 151 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL---PEQVYFVCYDEENAHLYERLL 171 +AF AISTG+YGYP AA+ A+ +F+ E+V F ++ ++ YE+L+ Sbjct: 152 IAFNAISTGIYGYPSRDAAKAALDETRKFLETDKNTGLLERVIFCNFELKDVEAYEQLI 210 >UniRef50_A7IGI6 Appr-1-p processing domain protein n=53 Tax=cellular organisms RepID=A7IGI6_XANP2 Length = 193 Score = 183 bits (465), Expect = 2e-45, Method: Composition-based stats. Identities = 87/174 (50%), Positives = 107/174 (61%), Gaps = 4/174 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + R+ +V GDIT+LA+D IVNAAN SL+GGGGVDGAIHRAAGP LL C + G CP Sbjct: 19 QARLDIVVGDITRLALDAIVNAANSSLLGGGGVDGAIHRAAGPELLAYCRTL----GGCP 74 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG A +T LPA V+HTVGPVW GG E+ LL Y SL+L S+AFPAI Sbjct: 75 TGEARLTPGFRLPAAHVIHTVGPVWHGGGAGEEGLLGSCYRESLKLADGAGLASIAFPAI 134 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 STG+YG+P AA +AV TV + +V F C+ +E A L+ G Sbjct: 135 STGIYGFPADRAAPLAVGTVLAHLGAPGSVTRVVFCCFSQEAADLHHDAFRAHG 188 >UniRef50_Q985D2 UPF0189 protein mll7730 n=12 Tax=Bacteria RepID=Y7730_RHILO Length = 176 Score = 183 bits (464), Expect = 3e-45, Method: Composition-based stats. Identities = 94/170 (55%), Positives = 109/170 (64%), Gaps = 4/170 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 RI + GDITKL VD IVNAAN L+GGGGVDGAIHRAAG L C + C Sbjct: 6 DRIRIHTGDITKLDVDAIVNAANTLLLGGGGVDGAIHRAAGRELEVECRMLN----GCKV 61 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT LPA+ ++HTVGPVW+GG + E +LL Y +SL L AAN SVAFPAIS Sbjct: 62 GDAKITKGYKLPARHIIHTVGPVWQGGGKGEAELLASCYRSSLELAAANDCRSVAFPAIS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 TGVY YP+ A IAV TVS I A+PE V F C+DE+ A LY R + Sbjct: 122 TGVYRYPKDEATGIAVGTVSMVIEEKAMPETVIFCCFDEQTAQLYLRAVA 171 >UniRef50_P67344 UPF0189 protein SA0314 n=54 Tax=Staphylococcus RepID=Y314_STAAN Length = 266 Score = 182 bits (463), Expect = 3e-45, Method: Composition-based stats. Identities = 67/182 (36%), Positives = 96/182 (52%), Gaps = 8/182 (4%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 I V QGDIT L +D IVNAAN +G +D IH AG + C ++ +QQ Sbjct: 85 DNIFVWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQ 144 Query: 58 G-DCPTGHAVITLAGDLPAKAVVHTVGPVWR--GGEQNEDQLLQDAYLNSLRLVAANSYT 114 G + G A T +LPAK ++HTVGP R + LL YL+ L+L +S Sbjct: 145 GRNEGVGKAKKTRGYNLPAKYIIHTVGPQIRRLPVSKMNQDLLAKCYLSCLKLADQHSLN 204 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 VAF ISTGV+ +P+ AAEIAV+TV ++ +V F + +++ LY+ L + Sbjct: 205 HVAFCCISTGVFAFPQDEAAEIAVRTVESYLKETNSTLKVVFNVFTDKDLQLYKEALNRD 264 Query: 175 GD 176 + Sbjct: 265 AE 266 >UniRef50_B8HYS5 Appr-1-p processing domain protein n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HYS5_CYAP4 Length = 187 Score = 182 bits (463), Expect = 3e-45, Method: Composition-based stats. Identities = 80/177 (45%), Positives = 109/177 (61%), Gaps = 10/177 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDC 60 R V+QGDIT L V+ IVNAAN L GGGV GAI RAAG L AC ++ G C Sbjct: 11 DPRFQVIQGDITTLEVEAIVNAANNELKPGGGVCGAIFRAAGYKQLQQACEQI----GYC 66 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A+IT +LPA+ +VHTVGPV+ G ++LL Y N L+ S +S+AFP Sbjct: 67 PTGEALITPGFNLPAQWIVHTVGPVY-GVTWASEELLARCYRNCLQFAGEESLSSIAFPL 125 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENA----HLYERLLTQ 173 ISTG+YG+P AAEIA++ + ++ ++ +QVY VCY E+ +Y+R+ + Sbjct: 126 ISTGIYGFPLEPAAEIAIREILTGLSCYSEIKQVYLVCYTPESYAAVLQIYDRICQK 182 >UniRef50_Q1R0S7 Appr-1-p processing n=12 Tax=Proteobacteria RepID=Q1R0S7_CHRSD Length = 183 Score = 182 bits (462), Expect = 4e-45, Method: Composition-based stats. Identities = 78/174 (44%), Positives = 106/174 (60%), Gaps = 5/174 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CP 61 R+ VV GDIT+L VD IVNAAN SLMGGGGVDGAI+RAAGPAL AC +R+ P Sbjct: 9 RVDVVSGDITRLDVDAIVNAANHSLMGGGGVDGAIYRAAGPALKRACRALRETHWPDGLP 68 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G +T +LPA+ V+HTVGPV+ +++ LL + Y N++ L A +AFPAI Sbjct: 69 DGEVALTEGFELPARYVIHTVGPVY-AKTRDKSHLLANCYRNAVALAAETGCRRIAFPAI 127 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 STGVYGYP AA I + T+ + + H +V + E + + + ++G Sbjct: 128 STGVYGYPFDDAAHIVIDTLHDALAIHD--LRVTLCFFSERDYQAFAEIAMRRG 179 >UniRef50_D1ZDH8 Whole genome shotgun sequence assembly, scaffold_20 n=4 Tax=cellular organisms RepID=D1ZDH8_SORMA Length = 261 Score = 182 bits (462), Expect = 5e-45, Method: Composition-based stats. Identities = 82/175 (46%), Positives = 108/175 (61%), Gaps = 8/175 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + RI + GDITKL +D IVNAAN SL+GGGGVDGAIHRAAGP LL C + + C Sbjct: 90 LNKRIAIHTGDITKLHIDAIVNAANNSLLGGGGVDGAIHRAAGPQLLREC----RTKRTC 145 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG AV+T A +LP V+HTVGPV+ G +E + LL YL SL++ A T++AFP Sbjct: 146 DTGDAVMTEAYNLPCAKVIHTVGPVYSGVNHDECEKLLISCYLRSLQIAAETGLTTIAFP 205 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHAL---PEQVYFVCYDEENAHLYERLL 171 +ISTGVYGYP AA+ A+ + F+T +V V + +++ Y L Sbjct: 206 SISTGVYGYPSKEAAQAALAAIRHFLTDPKTRNAITKVIIVTFVDKDTRAYTEWL 260 >UniRef50_B6SKT6 Protein LRP16 n=12 Tax=cellular organisms RepID=B6SKT6_MAIZE Length = 239 Score = 180 bits (456), Expect = 2e-44, Method: Composition-based stats. Identities = 78/179 (43%), Positives = 109/179 (60%), Gaps = 10/179 (5%) Query: 5 IHVVQGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-- 58 + + +GDIT +VD IVNAAN ++GGGGVDGAIH+AAGP L+ AC KV + + Sbjct: 62 LKLHKGDITLWSVDCATDAIVNAANERMLGGGGVDGAIHQAAGPELVQACRKVPEVKPGV 121 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 CPTG A IT A +LPA V+HTVGP++ + E L+ AY NSL+L N +AF Sbjct: 122 RCPTGEARITPAFELPASRVIHTVGPIYDLDKHPEVS-LKKAYENSLKLAKDNGIQYIAF 180 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 PAIS GVY YP A++IAV T +F ++V+FV + ++ +++ Q + Sbjct: 181 PAISCGVYRYPPKEASKIAVSTAQKFSED---IKEVHFVLFSDDLYNIWRETAQQLLSQ 236 >UniRef50_Q03IQ8 Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 n=5 Tax=Streptococcus RepID=Q03IQ8_STRTD Length = 260 Score = 179 bits (455), Expect = 3e-44, Method: Composition-based stats. Identities = 75/179 (41%), Positives = 107/179 (59%), Gaps = 7/179 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 RI++ +GDIT+L +D IVNAAN +L+G VD AIH AG L AC ++ + Sbjct: 82 DKRIYLWKGDITRLEIDAIVNAANKTLLGCMKPLHNCVDNAIHTYAGVQLRQACFELILE 141 Query: 57 QGDC-PTGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYT 114 QG P G A IT A +LP+ V+HTVGP ++ LL +YL+ L L N Sbjct: 142 QGYEEPVGMAKITPAYNLPSAFVIHTVGPKIGNQVTPIDEDLLIKSYLSVLALAEKNKIE 201 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S+A P ISTG + +P+ AAEIA+KTV FI + ++V F +D+EN ++Y++LL + Sbjct: 202 SIAIPCISTGDFNFPKQKAAEIAIKTVKSFIDHSEIVKKVIFNVFDDENLNIYQKLLAE 260 >UniRef50_A7BY23 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BY23_9GAMM Length = 708 Score = 179 bits (455), Expect = 3e-44, Method: Composition-based stats. Identities = 64/174 (36%), Positives = 98/174 (56%), Gaps = 7/174 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +IH++QG+IT+ VD IVN + SL G G +D AI A G L +AC ++ G C Sbjct: 532 KIHIIQGNITQQKVDAIVNTTDRSLSGSGAIDYAIQNAGGIELKEACRQL----GTCSVA 587 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT +LPA+ V+HTVGP W GG Q E + L Y N L L + +AFP I Sbjct: 588 EAKITEGYNLPAQFVIHTVGPNWEGGNQKEAEKLAQCYRNCLALAEQQGFKIIAFPTIGV 647 Query: 124 GVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLYE--RLLTQQ 174 G G+ AA++A+ +S F+ +++ E+V VC+++ ++ +LL ++ Sbjct: 648 GGLGFSHELAAKVAIYEISSFLQQKNSSLEKVILVCFNQRVYEHFQETKLLLER 701 >UniRef50_C7N880 Predicted phosphatase, C-terminal domain of histone macro H2A1 like protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N880_SLAHD Length = 263 Score = 179 bits (453), Expect = 5e-44, Method: Composition-based stats. Identities = 73/179 (40%), Positives = 101/179 (56%), Gaps = 8/179 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + R+ V QGDIT+L D IVNAAN ++G +D IH AG L + C ++ + Sbjct: 83 LDQRLSVWQGDITRLRADAIVNAANSQMLGCWAKCHSCIDNVIHTYAGVQLREECDRIMR 142 Query: 56 QQG-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSY 113 QG + PTGHA +T A +LP+K V+HTVGP+ +G +L L Y + L AA Sbjct: 143 AQGENEPTGHAKVTGAYNLPSKHVIHTVGPIAQGHPTARHRLQLAQCYTSCLDAAAATGC 202 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLL 171 S+AF ISTGVYG+P AA IAV TV +++ RH P V F + +Y+ +L Sbjct: 203 ESIAFCGISTGVYGFPAEQAAPIAVDTVRDWLDRHPDVPMHVVFNVFGNRQLSIYQDIL 261 >UniRef50_C4Q6S1 Expressed protein n=1 Tax=Schistosoma mansoni RepID=C4Q6S1_SCHMA Length = 224 Score = 178 bits (452), Expect = 6e-44, Method: Composition-based stats. Identities = 77/206 (37%), Positives = 108/206 (52%), Gaps = 39/206 (18%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI + +GDIT L +D I NAAN L GGGGVDGAIHRAAG LL+AC K+ C Sbjct: 25 LGSRISLWRGDITHLQIDAIANAANSQLRGGGGVDGAIHRAAGSQLLEACQKLS----GC 80 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A +T +LP+K V+H VGPV D L+ Y +L L + ++ S+AFP Sbjct: 81 PTGDAKLTPGFNLPSKYVIHCVGPV-----GRNDVALESTYRKALELCSEHNIQSIAFPC 135 Query: 121 ISTGVY------------------------------GYPRAAAAEIAVKTVSEFITRHAL 150 ISTGVY +P AAA++A+ TV ++ H Sbjct: 136 ISTGVYEVQKTRENKKRIDLIKGLDDQIFKPDFPDDCFPNEAAAKVALHTVLSYLKSHQE 195 Query: 151 PEQVYFVCYDEENAHLYERLLTQQGD 176 ++V F + + + +YE L+ + D Sbjct: 196 IQRVIFCIFMDVDYKIYENLIPEMLD 221 >UniRef50_B5YAF3 Conserved protein n=2 Tax=Dictyoglomus RepID=B5YAF3_DICT6 Length = 182 Score = 178 bits (451), Expect = 9e-44, Method: Composition-based stats. Identities = 66/171 (38%), Positives = 97/171 (56%), Gaps = 3/171 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++ VV+GDIT+ V+ IVNAAN L GGGV GAI RA G + + ++ G P Sbjct: 12 VKLKVVKGDITQEEVEAIVNAANSYLKHGGGVAGAIVRAGGEVIQKESDEYVEKYGPLPV 71 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT AG L AK V+HTVGP W GE +E++ L+ A + L L + S++ PA+S Sbjct: 72 GSATITSAGKLKAKYVIHTVGPRW--GEGDEEKKLEKAIESVLTLAKEKNIKSLSIPAVS 129 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLT 172 G++G+P +I V V EF+ + E+++F+ +E L+ L Sbjct: 130 CGIFGFPPQLGTKIIVNKVVEFLKDNPGVFEEIHFIGIGDEIPTLFVDALK 180 >UniRef50_B9XAD9 Appr-1-p processing domain protein n=1 Tax=bacterium Ellin514 RepID=B9XAD9_9BACT Length = 184 Score = 178 bits (451), Expect = 9e-44, Method: Composition-based stats. Identities = 71/177 (40%), Positives = 101/177 (57%), Gaps = 8/177 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 KT ++ GDI D +V AA+ L G G DG IH GP + + C ++ G CP Sbjct: 7 KTLFELITGDIADQETDAVVTAAHWKLNKGSGTDGVIHTRGGPQIYEECRRI----GGCP 62 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT G+L AK V+H VGPVWRGG+++E +LL AY SL + + S++FP+I Sbjct: 63 IGDAVITTGGNLKAKHVIHAVGPVWRGGDEHEPELLASAYRRSLEVATEHKLKSISFPSI 122 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCY---DEENAHLYERLLTQQ 174 STG + YP AA IA+KT+ +++ E V V Y D++ +YE+ L + Sbjct: 123 STGAFVYPIKLAAPIALKTICDYLQKEQHTLEFVRLVLYTREDDKAFLVYEKALQEL 179 >UniRef50_Q4P1I0 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P1I0_USTMA Length = 220 Score = 177 bits (450), Expect = 1e-43, Method: Composition-based stats. Identities = 75/170 (44%), Positives = 102/170 (60%), Gaps = 8/170 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + GDIT L++D IVNAAN SL+GGGGVDGAIHRAAG L+ C K+ C TG Sbjct: 38 LSIFTGDITTLSIDAIVNAANNSLLGGGGVDGAIHRAAGRELVVECGKLN----GCETGS 93 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPAIST 123 A TL LP+K V+HTVGPV+ E + LL+ AY +SL + S+AFP+IST Sbjct: 94 AKTTLGYALPSKHVIHTVGPVYNSSRHEECERLLRSAYRSSLEELRKIGAKSIAFPSIST 153 Query: 124 GVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERL 170 GVYGYP AA A+ + ++ H E++ C+ +++ + Y L Sbjct: 154 GVYGYPFDTAATAALDEIGSWLESNENHKHIERIVLCCFSQKDYNKYLEL 203 >UniRef50_B9S4E3 Protein LRP16, putative n=2 Tax=cellular organisms RepID=B9S4E3_RICCO Length = 269 Score = 177 bits (450), Expect = 1e-43, Method: Composition-based stats. Identities = 76/176 (43%), Positives = 107/176 (60%), Gaps = 10/176 (5%) Query: 5 IHVVQGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-- 58 + + +GDITK VD IVN AN ++GGGG DGAIHRAAGP L+DAC KV + + Sbjct: 95 LKINKGDITKWFVDGSSDAIVNPANEKMLGGGGADGAIHRAAGPELVDACYKVPEVRPGI 154 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 CPTG A IT LPA V+HTVGP++ N +L++AY NSL + N+ +AF Sbjct: 155 RCPTGEARITPGFKLPASHVIHTVGPIYDANR-NSAAILKNAYRNSLSVAKDNNIKFIAF 213 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 PAIS GVY YP AA +++ T+ EF ++V+FV + +E +++ + + Sbjct: 214 PAISCGVYLYPFEEAASVSISTIKEFADD---IKEVHFVLFSDEIFNVWVKKAKEL 266 >UniRef50_Q9HJ67 UPF0189 protein Ta1105 n=1 Tax=Thermoplasma acidophilum RepID=Y1105_THEAC Length = 196 Score = 177 bits (449), Expect = 1e-43, Method: Composition-based stats. Identities = 76/173 (43%), Positives = 102/173 (58%), Gaps = 2/173 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CPT 62 + V GDIT+ + IVNAAN SLMGGGGVDGAIH AAGP L +K+R+++ P Sbjct: 11 LAVEVGDITESDAEAIVNAANSSLMGGGGVDGAIHSAAGPELNGELVKIRRERYPNGLPP 70 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AVIT L A ++HTVGPVW GG ED +L +Y + L L +AFPA+S Sbjct: 71 GEAVITRGYRLKASHIIHTVGPVWMGGRNGEDDVLYRSYRSCLDLAREFGIHDIAFPALS 130 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 TG YG+P A IA+++V +F+ + V FV Y E+ + +L+ G Sbjct: 131 TGAYGFPFDRAERIAIRSVIDFLKDESAGYTVRFVFYTEDQGKRFLFILSDLG 183 >UniRef50_A0Q2I9 Appr-1-p processing enzyme family protein n=3 Tax=Clostridia RepID=A0Q2I9_CLONN Length = 183 Score = 177 bits (448), Expect = 2e-43, Method: Composition-based stats. Identities = 56/177 (31%), Positives = 95/177 (53%), Gaps = 3/177 (1%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I + +GDIT + D IVN AN L GGGV AI + G + + K+ +++G P Sbjct: 6 NKEIIIKKGDITNESSDAIVNPANGMLKHGGGVAAAIVKKGGREVQEESNKIVRKEGIIP 65 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 TG AVIT +LP K ++H VGP GE +E L++A L++L L ++ S++ PAI Sbjct: 66 TGGAVITKGYNLPCKYIIHAVGPRM--GEGDEKLKLKNAVLSALCLAEQHNLKSISIPAI 123 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQGDE 177 S+G++ +P+ A+I + T +F+ + + D++ ++ + + E Sbjct: 124 SSGIFRFPKDECAKILINTSIKFLQTSAKSLKTIVMCNLDDKTYEIFLQEEKEILKE 180 >UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZED8_SYNY3 Length = 588 Score = 176 bits (447), Expect = 2e-43, Method: Composition-based stats. Identities = 67/165 (40%), Positives = 94/165 (56%), Gaps = 4/165 (2%) Query: 10 GDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITL 69 GDITK + IVN+ + +L G + AIH+AAGP LL AC ++ C G A +T Sbjct: 425 GDITKEKAEAIVNSTDRNLSNSGALSRAIHQAAGPELLQACQDLQ----GCTVGGAKLTP 480 Query: 70 AGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYP 129 +L A V+HTV P W+GG Q E++LL Y N L+L + S S+AFPAI+ G G+P Sbjct: 481 GFNLRANWVIHTVAPKWKGGNQGEEELLVSCYQNCLQLAVSQSIRSLAFPAIACGAMGFP 540 Query: 130 RAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 AA IA++TVS F+ + V F+C D+E Y+ + Sbjct: 541 PEIAARIALETVSNFLLSNMAIGSVAFICADKETLQYYQEAFQRV 585 >UniRef50_UPI000194CBCB PREDICTED: poly (ADP-ribose) polymerase family, member 14 n=1 Tax=Taeniopygia guttata RepID=UPI000194CBCB Length = 1883 Score = 176 bits (447), Expect = 2e-43, Method: Composition-based stats. Identities = 56/177 (31%), Positives = 92/177 (51%), Gaps = 4/177 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + D+ VDV+VNA+N L GG+ A+ RAAGP L + C ++ ++ G+ G Sbjct: 878 IALYNADLCTHPVDVVVNASNEKLKHIGGLADALSRAAGPVLQEECDELVRKLGNLQPGC 937 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT AG LP K V+H VGP W LL+ L+L A+ + S+A PAIS Sbjct: 938 AVITHAGKLPCKNVIHAVGPRWSAENSVMCVWLLRKTVKKCLQLAEAHKHCSIALPAISG 997 Query: 124 GVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQGDE 177 G++G+P + ++ E + ++ ++V+ V + ++N + + + E Sbjct: 998 GIFGFPMELCTYSIISSIKETLEESKGNSTLKEVHLVGFAQDNIQAFSKAFKEVFSE 1054 Score = 114 bits (285), Expect = 1e-24, Method: Composition-based stats. Identities = 50/172 (29%), Positives = 77/172 (44%), Gaps = 16/172 (9%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 V +GDITK D IVN N + GV AI AG A+ D C + QQ G + Sbjct: 1307 FRVAEGDITKEEGDAIVNITNQAFNLKTGVSRAILNGAGKAVEDECGVLAQQTGK----N 1362 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +IT AG+LP K ++H V ++ L+ YTSVAFPAI TG Sbjct: 1363 YIITQAGNLPCKKIMHFV----------YQNDIRSLVSQVLQECELQQYTSVAFPAIGTG 1412 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQ 174 A A+ + V++F R++ + + V + +++ + ++ Sbjct: 1413 EARRNAAEVADNMIDAVTDFAKRNSATSVKTIKVVIFQPHLMSVFQASMQKR 1464 Score = 101 bits (252), Expect = 9e-21, Method: Composition-based stats. Identities = 45/177 (25%), Positives = 76/177 (42%), Gaps = 9/177 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPT 62 I + G I A ++V + L G + A+ AGP L K + G P Sbjct: 1093 NIVLQTGSIEDAATSIVVVSVGKDLQLDKGPLGKALLSKAGPMLQTGLNK--EGGGRMPE 1150 Query: 63 -GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G + T +L V+H V P+W ++L D L + S S+ FPAI Sbjct: 1151 EGSVLKTKGYNLACSVVLHAVVPMWSQKNTPS-KVLGDIITKCLEIAEELSLKSITFPAI 1209 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHL--YERLLTQQ 174 TG +PR+ A++ V EF + E+V+F+ + ++ A++ + L ++ Sbjct: 1210 GTGNLEFPRSVVAKLLFDKVFEFSSEKRVNSLEEVHFLLHTKDTANIQEFSDELEKR 1266 >UniRef50_C2LSS3 Protein in Tap1-dppD intergenic region n=1 Tax=Streptococcus salivarius SK126 RepID=C2LSS3_STRSL Length = 254 Score = 176 bits (446), Expect = 3e-43, Method: Composition-based stats. Identities = 70/180 (38%), Positives = 102/180 (56%), Gaps = 8/180 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 ++ +++ QGDIT+LA D IVNAAN L+G +D AIH AAG L AC ++ Q Sbjct: 76 IRPNLYLWQGDITRLAADAIVNAANSKLLGCFVPNHSCIDNAIHTAAGVELRLACQELMQ 135 Query: 56 QQG-DCPTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANSY 113 +QG D TG A +T A +LP++ V+HTVGP+ + E Q L +Y L L Sbjct: 136 EQGEDETTGQAKMTKAYNLPSRYVLHTVGPIIYDEVTDLERQQLASSYEECLNLAYEKGL 195 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S+AF ISTG + +P AA+IA++TV +F H+ V F + + + +Y+ LL Sbjct: 196 RSLAFCCISTGEFRFPNEEAAKIAIETVLQFQKEHSDMV-VIFNVFKDLDYAIYQSLLKN 254 >UniRef50_A6GJ81 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GJ81_9DELT Length = 173 Score = 175 bits (445), Expect = 4e-43, Method: Composition-based stats. Identities = 76/171 (44%), Positives = 105/171 (61%), Gaps = 4/171 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-D 59 M I + +GDIT+++ D IVNAANP ++GGGGVDGAIHRAAGP LL AC +V + G Sbjct: 1 MAPSITLERGDITRVSCDAIVNAANPKMLGGGGVDGAIHRAAGPELLAACRRVPKVNGIR 60 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 CP G A IT A L A+ V+H VGP++ ++ +L AY ++L L AA+ T +A P Sbjct: 61 CPFGEARITPAFGLDARWVIHAVGPIY-ARSEDPKGVLARAYASALELAAAHDVTELACP 119 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 A+STG YG+P AA IA++TV+ +V FV + E + + Sbjct: 120 ALSTGAYGFPLDPAARIALETVAS--RDWGCVARVRFVLFTAEVMAAFAKF 168 >UniRef50_B8LP86 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=B8LP86_PICSI Length = 231 Score = 175 bits (445), Expect = 4e-43, Method: Composition-based stats. Identities = 74/179 (41%), Positives = 104/179 (58%), Gaps = 10/179 (5%) Query: 5 IHVVQGDITKLAVD----VIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-- 58 + + +GDITK VD IVNAAN L+GGGGVDGAIHRAAGP LL AC + + Sbjct: 56 LLLHRGDITKWTVDGHTDAIVNAANERLLGGGGVDGAIHRAAGPDLLKACRQFPKVSRGI 115 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 CP G A IT +LP ++HTVGPV+ E++ + L DAY +SL + N +AF Sbjct: 116 RCPVGSARITRGFNLPVSRIIHTVGPVYD-MEEDPESKLADAYRSSLNITRENEVKYIAF 174 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 PAIS G+YGYP AA +++ TV + I ++V+FV ++ + + ++ Sbjct: 175 PAISCGIYGYPYEEAAAVSLTTVRDSIKD---LKEVHFVLFEMPAWEAWLEKANELFEQ 230 >UniRef50_C8WYT5 Appr-1-p processing domain protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8WYT5_DESRD Length = 188 Score = 175 bits (445), Expect = 5e-43, Method: Composition-based stats. Identities = 77/178 (43%), Positives = 103/178 (57%), Gaps = 5/178 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 R+ + QGDIT V +VNAAN L GGGGVDGA+ RAAGP LL A + ++ G G Sbjct: 12 RLEIRQGDITAAEVGAVVNAANSRLAGGGGVDGALQRAAGPQLLQAGQEYVREHGALSVG 71 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV+T LPA V+HTVGP+WRGG NE+ LL+ AY N L++ S+AFPAIS Sbjct: 72 DAVVTPGFALPASQVIHTVGPIWRGGGHNEEALLERAYANCLQVAKDQGIQSIAFPAISC 131 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY----ERLLTQQGDE 177 GVYG+P AA IA+ + + R V Y + ++ +RL+ + +E Sbjct: 132 GVYGFPEKRAAAIAIPVIVAALERD-AVSSVALYLYSNPSYAVWYNEAQRLIGAEHEE 188 >UniRef50_C9LYS3 Appr-1-p processing enzyme family domain protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LYS3_9FIRM Length = 302 Score = 175 bits (443), Expect = 7e-43, Method: Composition-based stats. Identities = 73/190 (38%), Positives = 96/190 (50%), Gaps = 19/190 (10%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 + QGDIT+LAVD IVNAAN +L G +D AIH AAG AL AC ++ ++ Sbjct: 112 DENFVLWQGDITRLAVDAIVNAANSALRGCFVPLHRCIDNAIHSAAGLALRAACDEIMRE 171 Query: 57 QGD-CPTGHAVITLAGDLPAKAVVHTVGPV-------------WRGGEQNEDQLLQDAYL 102 QG P G A IT +LPA+ V+HTVGP+ + G Q L Y Sbjct: 172 QGHPEPAGRAKITPGFNLPARHVLHTVGPIIAPAGSPVHEPGVFAGVTHEAQQCLVSCYR 231 Query: 103 NSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEE 162 L L A SVAF ISTG + YP AAE AV T ++ H P ++ F + +E Sbjct: 232 ACLDLAAERRLASVAFCCISTGEFHYPPQEAAETAVATCRAWLQAHDTPMRIVFNVFKDE 291 Query: 163 NAHLYERLLT 172 + +Y R+ Sbjct: 292 DLAIYRRIFQ 301 >UniRef50_C9KLM2 Appr-1-p processing enzyme family domain protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KLM2_9FIRM Length = 262 Score = 174 bits (442), Expect = 9e-43, Method: Composition-based stats. Identities = 68/181 (37%), Positives = 98/181 (54%), Gaps = 9/181 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + R+ + QGDIT+L +D IVNAAN ++G +D AG + C K+ Q Sbjct: 81 LDPRLVLWQGDITRLRIDAIVNAANRQMLGCFLPNHNCIDNIEQTMAGVEMRYNCYKLMQ 140 Query: 56 QQGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSY 113 QG PTG IT LPA+ V+HTVGP+ +G +LL Y + L L A + Sbjct: 141 AQGHDEPTGKVKITSGYHLPARFVLHTVGPIVQGSLTDEHRRLLASCYESCLTLAAEHGL 200 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLL 171 VAF ISTGV+ +P+ AAA IAV+TV ++ H ++V F +++ + +YE LL Sbjct: 201 KGVAFCCISTGVFRFPKDAAAHIAVRTVQHWLDVHPAASIKRVIFDVFEDADRRIYENLL 260 Query: 172 T 172 Sbjct: 261 N 261 >UniRef50_B8DKL2 Appr-1-p processing domain protein n=3 Tax=Desulfovibrio RepID=B8DKL2_DESVM Length = 202 Score = 173 bits (438), Expect = 3e-42, Method: Composition-based stats. Identities = 75/167 (44%), Positives = 95/167 (56%), Gaps = 1/167 (0%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GD+ A D +VNAAN L GGGGVDGA+HRAAGP LL A + ++G G Sbjct: 17 LAVSTGDLAATATDAVVNAANAELRGGGGVDGALHRAAGPMLLPAGRDIVARRGPLAAGE 76 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AVIT +LPA+ V+H VGP+WRGG E Q L + NSLRL A + VAFPAIS G Sbjct: 77 AVITPGFNLPARHVIHAVGPIWRGGTHGEPQALAAVHANSLRLAAEHGLARVAFPAISCG 136 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 YGYP AA IA+ + L +V FV + + ++ Sbjct: 137 SYGYPPELAAPIALAEAVRGLRA-GLVREVRFVLHGQAMLAVWRTAF 182 >UniRef50_Q93RG0 UPF0189 protein in tap1-dppD intergenic region n=14 Tax=Bacteria RepID=Y189_TREMD Length = 261 Score = 173 bits (438), Expect = 3e-42, Method: Composition-based stats. Identities = 71/174 (40%), Positives = 99/174 (56%), Gaps = 7/174 (4%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQGDC 60 +V +GDIT L VD IVNAAN + G +D IH AG L C + Q+QG Sbjct: 88 YVWRGDITTLKVDAIVNAANSGMTGCWQPCHACIDNCIHTFAGVQLRTVCAGIMQEQGHE 147 Query: 61 -PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 PTG A IT A +LP K V+HTVGP+ G + LL ++Y + L L A N S+AF Sbjct: 148 EPTGTAKITPAFNLPCKYVLHTVGPIISGQLTDRDCTLLANSYTSCLNLAAENGVKSIAF 207 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTGV+ +P AAEIAV TV ++ ++ ++ F + E++ LY +L++ Sbjct: 208 CCISTGVFRFPAQKAAEIAVATVEDWKAKNNSAMKIVFNVFSEKDEALYNKLMS 261 >UniRef50_Q0B030 Phosphatase n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0B030_SYNWW Length = 176 Score = 172 bits (437), Expect = 3e-42, Method: Composition-based stats. Identities = 82/176 (46%), Positives = 101/176 (57%), Gaps = 6/176 (3%) Query: 3 TRIHVVQGDIT-KLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I VVQGDIT + + VIVNAAN SL GGGGVDGAIHRAAGP L + Sbjct: 6 VEIQVVQGDITRQEDMAVIVNAANSSLRGGGGVDGAIHRAAGPELKKESSALA----PIG 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT A LP + V+H VGPV+ G + ED+LL Y N+LRL S+AFPAI Sbjct: 62 PGQAVITGAYRLPNRYVIHCVGPVY-GVHKPEDELLASCYRNALRLAEKQQLDSIAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 STGVYGYP AA++ KT+ E I +++ V +D L+ + L E Sbjct: 121 STGVYGYPMREAAQVMFKTIIEVIPELKHIKKIRIVLFDHPAYELHRQALEACHTE 176 >UniRef50_UPI00006A2284 UPI00006A2284 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2284 Length = 694 Score = 172 bits (437), Expect = 3e-42, Method: Composition-based stats. Identities = 57/179 (31%), Positives = 94/179 (52%), Gaps = 4/179 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + V + D+ + +VDV+VNAAN L GG+ GA+ RAAGP L C ++ + +G Sbjct: 1 VTVAVYKDDLARHSVDVVVNAANEDLKHIGGLAGALLRAAGPKLQTDCDQIIKIRGRLSA 60 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT AG+LP K V+H VGPVW D+ L A + L L A + S+ PA+ Sbjct: 61 GDAVITDAGNLPCKQVIHAVGPVWNAFFPGKCDRQLHKAITSCLDLAARKGHRSIGIPAV 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQGDE 177 S+G++G+P + ++ ++ H+ +Q++ V + + L + ++ Sbjct: 121 SSGIFGFPLKRCVTHILGSIKAYVEDNSAHSTIKQIHLVALESATVQAFTDALRAESEQ 179 Score = 116 bits (291), Expect = 3e-25, Method: Composition-based stats. Identities = 49/176 (27%), Positives = 74/176 (42%), Gaps = 8/176 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V+Q I DVIVN L + + A+ AGP L Q P G Sbjct: 193 IKVIQQAIEDSTTDVIVNNVGQKLQLNEWQISRALAARAGPQLQQLLSNSSQGASA-PNG 251 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 T +L V+H V P W Q+L+ + + L+L S S++ PAI T Sbjct: 252 SVFSTDGCNLNCAKVLHVVMPQWDRRT----QVLRKSIKSCLKLTEQQSLQSISIPAIGT 307 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHLYERLLTQQGDE 177 G GYP+ A + K + F ++ ++V V + D EN ++ + L + E Sbjct: 308 GKLGYPKDLVAAVTFKEILHFSSKAQSLQEVNIVLHPRDTENIQVFSKELQRLCRE 363 >UniRef50_UPI0000ECB76F Poly [ADP-ribose] polymerase 14 (EC 2.4.2.30) (PARP-14) (B aggressive lymphoma protein 2). n=2 Tax=Gallus gallus RepID=UPI0000ECB76F Length = 1636 Score = 172 bits (436), Expect = 5e-42, Method: Composition-based stats. Identities = 56/177 (31%), Positives = 90/177 (50%), Gaps = 4/177 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V + D+ VDV+VNA+N L GG+ A+ +AAGP L C V + G G Sbjct: 637 IAVYKADLCTHHVDVVVNASNEDLKHIGGLAWALLQAAGPELQAECDGVVRMSGSLQAGD 696 Query: 65 AVITLAGDLPAKAVVHTVGPVWR-GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT AG LP K V+H VGP W+ + LL+ SL+L ++ S+AFP++S Sbjct: 697 AVITGAGKLPCKQVIHAVGPRWKEQDAEKCVYLLKKTIKKSLQLAETYNHRSIAFPSVSG 756 Query: 124 GVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQGDE 177 G++G+P V + + + + ++++ V E+N + + L + + Sbjct: 757 GIFGFPLHKCVNAIVSAIKKTLEEFKRDSSLKEIHLVDITEDNVQAFIKALKEVFSD 813 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 43/177 (24%), Positives = 71/177 (40%), Gaps = 8/177 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPT 62 I + +G+I + D +V + L G + A+ AGP L + G P Sbjct: 847 NIMLKKGNIEDASTDGVVISVGGDLQLEKGQLAKALLSKAGPRLQSDLND--EGLGKSPV 904 Query: 63 -GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G T +L V H V P W G ++ ++L L+ S S+ FPAI Sbjct: 905 EGSVFTTRGYNLSCCYVFHAVTPGWSQGSESAVKILGKIVTKCLQTAEELSLKSITFPAI 964 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCY--DEENAHLYERLLTQQ 174 TG+ G+P + A+ V EF + +V+F+ + D N + ++ Sbjct: 965 GTGILGFPSSVVAKSLFDKVYEFSSKKKTNSLREVHFLLHPKDVNNIQAFSNEFERR 1021 Score = 108 bits (269), Expect = 1e-22, Method: Composition-based stats. Identities = 44/178 (24%), Positives = 74/178 (41%), Gaps = 16/178 (8%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 V GDITK DVIVN +N + GV AI AG + + C ++ Q P Sbjct: 1057 SITFQVAAGDITKETGDVIVNISNQAFNLKTGVSKAILEGAGKEVENECAELALQ----P 1112 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 + T AG LP K ++H V ++ L+ YTSV FPAI Sbjct: 1113 NDGYITTEAGSLPCKKIIHFVA----------RDDIKVPVSKVLQECELQQYTSVTFPAI 1162 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQGDE 177 TG G A+ + +++F ++ + + V + +++ + ++ ++ Sbjct: 1163 GTGQAGRFPDLVADEMMDAITDFARSNSTPSVKTIKIVIFQPHLLNVFHTSMKKREND 1220 >UniRef50_B8I4Z8 Appr-1-p processing domain protein n=7 Tax=Bacteria RepID=B8I4Z8_CLOCE Length = 341 Score = 172 bits (436), Expect = 5e-42, Method: Composition-based stats. Identities = 72/168 (42%), Positives = 94/168 (55%), Gaps = 6/168 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +V+ DITKL VD IVNAAN L GGGV GAI +AAG A L A V + TG Sbjct: 3 FIIVRQDITKLKVDAIVNAANTDLRMGGGVCGAIFKAAGAAQLQA---VCDKLAPIKTGE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 VIT +L AK V+H GPV+ + +Q L+ AY NSL+ N S+AFP IS+ Sbjct: 60 VVITPGFNLSAKFVIHAAGPVYRHWNREQGEQYLRAAYTNSLKCAVENKCESIAFPLISS 119 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 G+YGYP+ A +A + FIT H + V V +D+ + +LL Sbjct: 120 GIYGYPKDEALRVATSEIHNFITDHDI--DVTLVVFDKSAFTVSRKLL 165 >UniRef50_D1U7C0 Appr-1-p processing domain protein n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U7C0_9DELT Length = 186 Score = 172 bits (436), Expect = 5e-42, Method: Composition-based stats. Identities = 77/170 (45%), Positives = 104/170 (61%), Gaps = 7/170 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQ-----QQ 57 ++ + QGDIT L VD +VNAANP L GGGGVDGAIHRAAG L AC + Sbjct: 10 QLVIRQGDITTLDVDCVVNAANPQLAGGGGVDGAIHRAAGIAQLRQACQAIIDDPGQLPT 69 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 G P G AV+TL DLPA+ ++HTVGP+WRGG E + L+ +Y +SL+L ++ ++A Sbjct: 70 GQLPVGQAVLTLGFDLPARYIIHTVGPIWRGGVHGESEQLRSSYQSSLKLAHQHALATIA 129 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLY 167 FPA+S G YGYP AA IA+ + + + L QV+ V +D + Sbjct: 130 FPALSCGAYGYPIPQAARIALDAIRQGLLD-GLAAQVHMVLHDHAACETW 178 >UniRef50_Q8ZXT3 UPF0189 protein PAE1111 n=10 Tax=Thermoprotei RepID=Y1111_PYRAE Length = 182 Score = 172 bits (435), Expect = 5e-42, Method: Composition-based stats. Identities = 58/172 (33%), Positives = 83/172 (48%), Gaps = 3/172 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + +++GDIT++ D IVNAAN L GGGV GAI R G + + + ++ G P Sbjct: 8 VEVVLMRGDITEVEADAIVNAANSYLEHGGGVAGAIVRKGGQVIQEESREWVRKHGPVPV 67 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G +T AG L AK V+H VGP + L +A N+L S+A PAIS Sbjct: 68 GDVAVTSAGRLKAKYVIHAVGPRC---GVEPIEKLAEAVKNALLKAEELGLVSIALPAIS 124 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 TG++G P AAAE + E ++ V Y EE + + + Sbjct: 125 TGIFGCPYDAAAEQMATAIREVAPALRSIRRILVVLYGEEAYQKFLEVFKKH 176 >UniRef50_UPI000186F16D conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186F16D Length = 367 Score = 172 bits (435), Expect = 6e-42, Method: Composition-based stats. Identities = 71/148 (47%), Positives = 92/148 (62%), Gaps = 9/148 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M RI + +GDIT L VD IVNAAN SL+GGGGVDGAIHR AG LL+ C + C Sbjct: 56 MNDRISLWKGDITTLGVDAIVNAANSSLLGGGGVDGAIHRKAGKFLLEECKTLN----GC 111 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT +LP+K V+HTVGP + + LL+ Y + L+ N+ S+AFP Sbjct: 112 PTGSAKITGGYNLPSKYVIHTVGP-----QGEKPDLLESCYKSCFHLMLDNNLESIAFPC 166 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH 148 ISTG+YGYP+ AA +A+ F+ + Sbjct: 167 ISTGIYGYPQGPAAVVALTCARNFLESN 194 >UniRef50_C4FEN5 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FEN5_9BIFI Length = 173 Score = 172 bits (435), Expect = 7e-42, Method: Composition-based stats. Identities = 67/156 (42%), Positives = 93/156 (59%), Gaps = 5/156 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPA-LLDACLKVRQQQGDCPTG 63 VV DIT + VD I NAAN L+ G GV GAI RAAG + + +AC ++ TG Sbjct: 21 FSVVHHDITDMQVDAIANAANTDLLMGSGVCGAIFRAAGASRMQEACDRLS----PIRTG 76 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AVIT DLPA+ V+HT GP+WRGG+ NE+ LL+ Y + L + + + TS+AFP IS Sbjct: 77 EAVITPGFDLPARYVIHTAGPLWRGGDHNEEALLRSCYRSCLAIASVHGCTSMAFPLISA 136 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY 159 G+YGYPRA A ++A + ++ + V + Sbjct: 137 GIYGYPRAEALDVAEDEIRYWLKENDSTMDVKLALW 172 >UniRef50_C4FT52 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FT52_9FIRM Length = 263 Score = 171 bits (434), Expect = 7e-42, Method: Composition-based stats. Identities = 65/182 (35%), Positives = 94/182 (51%), Gaps = 9/182 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 + +++ QGDIT+LAVD IVNAAN +++G +D IH AG AL AC +++ Sbjct: 73 RPSLYLWQGDITRLAVDAIVNAANSAMLGCFEPNHYCIDNQIHTFAGVALRLACADLKKA 132 Query: 57 QG--DCPTGHAVITLAGDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAANS 112 +G P G A++T +LPAK V+HTVGP LL+ AY L Sbjct: 133 RGGKPLPVGQALMTSGFNLPAKQVIHTVGPRIHHLPVSPMMQDLLKKAYRACLACADQAG 192 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ++AF ISTG + YP A IA++TVS ++ +V F + + LY LL Sbjct: 193 LATIAFCCISTGEFSYPIEEATPIAIETVSAYLAETGSKLKVIFNVWTDSQYQLYHDLLN 252 Query: 173 QQ 174 + Sbjct: 253 SK 254 >UniRef50_Q97AU0 UPF0189 protein TV0719 n=2 Tax=cellular organisms RepID=Y719_THEVO Length = 186 Score = 171 bits (434), Expect = 8e-42, Method: Composition-based stats. Identities = 71/168 (42%), Positives = 102/168 (60%), Gaps = 4/168 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CPT 62 I +++GDIT + + IVNAANPSLMGGGGVDGAIH G + C ++R+ + P Sbjct: 11 IEIIEGDITDVNCEAIVNAANPSLMGGGGVDGAIHLKGGKTIDLECAELRRTKWPKGLPP 70 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G A IT G L AK V+HTVGP++R G++ + + L +Y SL + + +AFPAIS Sbjct: 71 GEADITSGGKLKAKYVIHTVGPIYR-GQEEDAETLYSSYYRSLEIAKIHGIKCIAFPAIS 129 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 TG+YGYP A+ IA+K V++F++ + FV Y + + L Sbjct: 130 TGIYGYPFEEASVIALKAVTDFLSNKEGYI-IKFVLYGQARYQTFVSL 176 >UniRef50_C8NG26 Appr-1-p processing enzyme family domain protein n=2 Tax=Granulicatella RepID=C8NG26_9LACT Length = 264 Score = 171 bits (433), Expect = 1e-41, Method: Composition-based stats. Identities = 66/180 (36%), Positives = 98/180 (54%), Gaps = 9/180 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 +I + GD+ +L VD IVNAAN ++G +D AIH +G L C + ++ Sbjct: 82 NDQIKLYYGDLCELKVDAIVNAANSEMLGCFIPNHRCIDNAIHTFSGIELRTFCHHLMKK 141 Query: 57 QGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGE---QNEDQLLQDAYLNSLRLVAANS 112 QG P G A IT A +LP+K ++HTVGP G+ +QLL Y + L Sbjct: 142 QGKKEPVGKAKITPAFNLPSKYIIHTVGPFLSPGQKVTPLREQLLASCYKSCLEAAREAG 201 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 TS+AF ISTG +G+P+ AA IA TV++++ A V F Y +E+ +Y++LL+ Sbjct: 202 LTSIAFCGISTGEFGFPKEPAALIAEDTVNKWLQDTASTITVVFSTYTKEDQSIYQKLLS 261 >UniRef50_C4M8N0 Putative uncharacterized protein n=2 Tax=Entamoeba RepID=C4M8N0_ENTHI Length = 627 Score = 171 bits (433), Expect = 1e-41, Method: Composition-based stats. Identities = 65/181 (35%), Positives = 106/181 (58%), Gaps = 9/181 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 ++ + +GDITKL VD IVNAAN L+G +D AIH AGP L C + + Sbjct: 131 SNKLALWKGDITKLCVDAIVNAANNQLLGCFVPHHLCIDNAIHTFAGPQLRRDCSIIMNK 190 Query: 57 QGDC-PTGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYT 114 QG PTG+A +T A +LP+K V+HTVGP+ +++ LL+ +Y+N L + Sbjct: 191 QGFEEPTGYAKVTRAYNLPSKYVIHTVGPIVESQLKESHCNLLRSSYINCLNIADDLHLE 250 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLT 172 S+AF ISTG++G+P+ A+ IA++TV ++ + ++V F + + + +Y + +T Sbjct: 251 SIAFSCISTGLFGFPQNVASVIAIETVINWLYENPFTSIKKVIFDVFSDNDLQIYTKNVT 310 Query: 173 Q 173 + Sbjct: 311 E 311 >UniRef50_B6KFB3 Appr-1-p processing enzyme family domain-containing protein n=3 Tax=Toxoplasma gondii RepID=B6KFB3_TOXGO Length = 817 Score = 170 bits (431), Expect = 2e-41, Method: Composition-based stats. Identities = 73/174 (41%), Positives = 106/174 (60%), Gaps = 12/174 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ + +GDIT+L VDVIVNAANPSL+GGGGVDGAIHR AGP L Q G C TG Sbjct: 48 KVVLYRGDITELDVDVIVNAANPSLLGGGGVDGAIHRKAGPQL----RVFNQTLGGCKTG 103 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 + A L K + HTVGP + Q L+ YLN+L L+ + Y ++AFP IST Sbjct: 104 EVKASPAFQLVCKQIFHTVGPR-----GEQSQALRACYLNALELLKRSKYRTIAFPCIST 158 Query: 124 GVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLLTQQ 174 G+YGYP+ AA++ K V++++ + + + F ++ ++ YE+LL++ Sbjct: 159 GIYGYPQLNAAQVVTKCVTKWLKIPANYEAVDFIVFCVFERQDFLFYEQLLSKI 212 >UniRef50_A7B8S3 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7B8S3_9ACTO Length = 270 Score = 170 bits (430), Expect = 2e-41, Method: Composition-based stats. Identities = 71/185 (38%), Positives = 100/185 (54%), Gaps = 15/185 (8%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQ 57 R+ + +GDIT+L VD IVNAAN +L+G +D AIH AAG L AC +V ++ Sbjct: 85 PRMALWRGDITRLEVDAIVNAANSALLGCRAPGHTCIDNAIHSAAGLELRQACAEVMAER 144 Query: 58 GD------CPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAA 110 PTG AV+T LP++ V+HTVGP+ G + L +Y L AA Sbjct: 145 TRGDGPSGFPTGEAVLTPGFHLPSRFVIHTVGPIVNGELTDEHREALACSYQRCLEEAAA 204 Query: 111 NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA---LPEQVYFVCYDEENAHLY 167 + +VAF ISTGV+G+P+ AA IAV TV++F+ +V F + + + LY Sbjct: 205 HGLNTVAFCCISTGVFGFPQEEAARIAVSTVADFLESDTRGASEVRVIFDVFGDHDEALY 264 Query: 168 ERLLT 172 LL Sbjct: 265 RALLR 269 >UniRef50_A0L536 Appr-1-p processing domain protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L536_MAGSM Length = 180 Score = 170 bits (430), Expect = 2e-41, Method: Composition-based stats. Identities = 81/173 (46%), Positives = 104/173 (60%), Gaps = 3/173 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--C 60 T + ++ DIT+L +D +VNAAN SL+GG GVDGAIHR G AL AC +R Sbjct: 2 TTLEIILTDITQLPIDGVVNAANNSLLGGMGVDGAIHRVGGTALTQACQALRHTHYPDGL 61 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG AV T AG+LPAK V+HTVGPV+ + + L D Y NSLR S+AFPA Sbjct: 62 ATGAAVATCAGELPAKRVIHTVGPVY-AKDPDPQARLADCYRNSLRCAQEEGLRSIAFPA 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 ISTGVYG+P+ AA IAV T+ + + E+V V + EE+A + L Q Sbjct: 121 ISTGVYGFPKQQAANIAVATLLQALREGVALERVVLVAFSEEDAQILRHALNQ 173 >UniRef50_C2D2Z2 Appr-1-p processing enzyme family domain protein n=1 Tax=Lactobacillus brevis subsp. gravesensis ATCC 27305 RepID=C2D2Z2_LACBR Length = 274 Score = 169 bits (428), Expect = 4e-41, Method: Composition-based stats. Identities = 66/178 (37%), Positives = 95/178 (53%), Gaps = 6/178 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 ++ +I++ QGDIT+LAVD IVN AN ++G G +D IH AG L A K Sbjct: 95 IRPKIYLWQGDITQLAVDAIVNPANSRMLGCFIPNHGCLDNQIHTKAGIQLRLADQKAMA 154 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGP-VWRGGEQNEDQLLQDAYLNSLRLVAANSYT 114 + TG A +T +LPAK V+HTVGP + QLL D+Y + L+L + Sbjct: 155 GERLEATGKAKLTPGFNLPAKFVIHTVGPVIIHQVTPLRRQLLADSYQSCLKLAEQKDLS 214 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 +AF ISTG + +P AA+IAV TV+++++ H V F + + LY L Sbjct: 215 ELAFCCISTGEFRFPHDLAAQIAVNTVNDYLSSHINAPDVIFAVNSDLDKALYLHELE 272 >UniRef50_A5ZAB5 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A5ZAB5_9FIRM Length = 274 Score = 168 bits (426), Expect = 7e-41, Method: Composition-based stats. Identities = 69/186 (37%), Positives = 99/186 (53%), Gaps = 14/186 (7%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + +I + QGD+T+L VD IVNAAN +L+G +D AIH AG L + C K+ Sbjct: 89 LADKISIWQGDMTRLKVDAIVNAANSALLGCFVPCHRCIDNAIHSGAGMELREECNKIMN 148 Query: 56 QQG-------DCPTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRL 107 Q+ + PTG A IT A +LP K V+HTVGP+ + G L++ Y + L Sbjct: 149 QRKIKYGTNYEEPTGTATITEAYNLPCKKVIHTVGPICYFGLNDELCNDLKNCYESVLNC 208 Query: 108 VAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHL 166 A N +VAF ISTG + +P AA IA TV F+ + E+V F Y + + + Sbjct: 209 CAENGLKTVAFCCISTGEFRFPNKEAAVIAKDTVERFLMKKENNIERVIFCVYKDLDREI 268 Query: 167 YERLLT 172 Y++L Sbjct: 269 YDKLYK 274 >UniRef50_A8FSV2 Putative uncharacterized protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FSV2_SHESH Length = 293 Score = 168 bits (425), Expect = 8e-41, Method: Composition-based stats. Identities = 75/179 (41%), Positives = 108/179 (60%), Gaps = 11/179 (6%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQGDC 60 + GDIT+L VD I+NAAN L+G +D IH AAG L D C + +QQG Sbjct: 113 SIWVGDITQLKVDAIINAANVYLLGCRQPNHRCIDNVIHSAAGSRLRDDCATIIEQQGGL 172 Query: 61 -PTGHAVITLAGDLPAKAVVHTVGPVWRGG---EQNEDQLLQDAYLNSLRLVAA-NSYTS 115 PTG A IT LPAK V+HTVGP G ++ +++ L+ AY + L L + N + Sbjct: 173 EPTGSAKITRGYALPAKYVIHTVGPCLHSGYLPDEEDEKQLKSAYQSCLTLASEINDLKT 232 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQ 173 +AF AISTGV+ YP+ AA +A++TVS++++ H E+V F Y + +A +YERL+ + Sbjct: 233 LAFCAISTGVFSYPKIDAASVALETVSDWLSEHPQHFEKVVFNLYTQADAAIYERLIYE 291 >UniRef50_UPI0001B4DEB3 hypothetical protein ShygA5_39675 n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4DEB3 Length = 311 Score = 168 bits (425), Expect = 9e-41, Method: Composition-based stats. Identities = 73/180 (40%), Positives = 99/180 (55%), Gaps = 9/180 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 R + QGDIT L D +VNAAN +L+G +D AIH AAGP L C + +Q Sbjct: 127 DRTVLWQGDITTLGADAVVNAANSALLGCFAPMHPCIDNAIHTAAGPRLRADCHTIMTRQ 186 Query: 58 GD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAA-NSYT 114 G PTG A IT LPA+ V+HTVGP+ G D+ L +Y L L A + Sbjct: 187 GHPEPTGTAKITRGYHLPARYVLHTVGPIVDGPLRPVHDRALAASYRACLDLAAEVDGLR 246 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQ 173 +VAF ISTGV+GYPR AA A+ TV++++ H ++V F Y +++ Y LT+ Sbjct: 247 TVAFCGISTGVFGYPRKPAARAALDTVADWLGTHPGRLDRVIFNVYADDDHAAYTHALTE 306 >UniRef50_Q30ZH6 Appr-1-p processing n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30ZH6_DESDG Length = 183 Score = 167 bits (424), Expect = 1e-40, Method: Composition-based stats. Identities = 73/168 (43%), Positives = 104/168 (61%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++QGD+T D +VNAAN L GGGGVDGA+H AAGPALL C + + G P G Sbjct: 10 LEILQGDLTLFKADAVVNAANSRLAGGGGVDGALHAAAGPALLADCSRWVARHGLLPAGK 69 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A++T A LPA+ V+HTVGPVWRGG+ NE+ L+ AY + L +N + VAFPAIS G Sbjct: 70 AMVTPAHRLPARHVIHTVGPVWRGGKNNEETTLRQAYESCFTLCRSNGFAHVAFPAISCG 129 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 YGYP + AA +A+ ++ + P ++ FV + + ++ + Sbjct: 130 TYGYPASPAARVALACAAQALACQGAPAKITFVLHTAQMYTIWLKAAQ 177 >UniRef50_UPI000050FFC7 predicted phosphatase, C-terminal domain of histone macro H2A1 like protein n=1 Tax=Brevibacterium linens BL2 RepID=UPI000050FFC7 Length = 177 Score = 167 bits (424), Expect = 1e-40, Method: Composition-based stats. Identities = 76/172 (44%), Positives = 103/172 (59%), Gaps = 5/172 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQ--QQGDCP 61 +I V++GDIT+ +VD IVNAAN SL+GGGGVDGAIH+AAGP LL+AC ++RQ P Sbjct: 2 KITVLEGDITEASVDAIVNAANSSLLGGGGVDGAIHKAAGPELLEACREIRQTSHPRGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AV T AG L A V+HTVGP GE + ++L+ + SL + A TSVAFPAI Sbjct: 62 AGQAVATSAGALKATWVIHTVGPNRTQGEA-DPEVLESCFEASLNVAAELGATSVAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLL 171 GVYG+ AE A + + R +V FV + + ++ ++ Sbjct: 121 GGGVYGWSARDVAEAAHSVIVDGRERGHWEQVAEVVFVLFSDSMTSVFCQVF 172 >UniRef50_Q460N5 Poly [ADP-ribose] polymerase 14 n=19 Tax=Eutheria RepID=PAR14_HUMAN Length = 1720 Score = 167 bits (424), Expect = 1e-40, Method: Composition-based stats. Identities = 62/175 (35%), Positives = 90/175 (51%), Gaps = 4/175 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V QGD+ +L VDV+VNA+N L GG+ A+ +AAGP L C ++ +++G G+ Sbjct: 723 LIVQQGDLARLPVDVVVNASNEDLKHYGGLAAALSKAAGPELQADCDQIVKREGRLLPGN 782 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGE-QNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A I+ AG LP V+H VGP W G E LL+ A SL L Y S+A PAIS+ Sbjct: 783 ATISKAGKLPYHHVIHAVGPRWSGYEAPRCVYLLRRAVQLSLCLAEKYKYRSIAIPAISS 842 Query: 124 GVYGYPRAAAAEIAVKTVSE---FITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 GV+G+P E V + E F +++Y V E+ + + Sbjct: 843 GVFGFPLGRCVETIVSAIKENFQFKKDGHCLKEIYLVDVSEKTVEAFAEAVKTVF 897 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 44/175 (25%), Positives = 85/175 (48%), Gaps = 6/175 (3%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPTGHA 65 +V+ + DV+VN+ L+ G + ++ AGP L + V Q G Sbjct: 937 LVKEGVQNAKTDVVVNSVPLDLVLSRGPLSKSLLEKAGPELQEELDTVGQGV-AVSMGTV 995 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 + T + +L + V+H V P WR G + ++++D + + + S S+AFPAI TG Sbjct: 996 LKTSSWNLDCRYVLHVVAPEWRNGSTSSLKIMEDIIRECMEITESLSLKSIAFPAIGTGN 1055 Query: 126 YGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCY--DEENAHLYERLLTQQGD 176 G+P+ AE+ + V +F +++ ++V+F+ + D EN + ++ + Sbjct: 1056 LGFPKNIFAELIISEVFKFSSKNQLKTLQEVHFLLHPSDHENIQAFSDEFARRAN 1110 Score = 115 bits (288), Expect = 7e-25, Method: Composition-based stats. Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 16/172 (9%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 V GDITK DVIVN+ + S GV AI AG + C + QQ+ + Sbjct: 1148 FQVASGDITKEEADVIVNSTSNSFNLKAGVSKAILECAGQNVERECSQQAQQRKN----D 1203 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +IT G L K ++H +G ++ + + L+ +Y+S+ PAI TG Sbjct: 1204 YIITGGGFLRCKNIIHVIG----------GNDVKSSVSSVLQECEKKNYSSICLPAIGTG 1253 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQ 174 AE + + +F+ + + ++V V + + ++ + ++ Sbjct: 1254 NAKQHPDKVAEAIIDAIEDFVQKGSAQSVKKVKVVIFLPQVLDVFYANMKKR 1305 >UniRef50_B0A8R6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B0A8R6_9CLOT Length = 361 Score = 167 bits (424), Expect = 1e-40, Method: Composition-based stats. Identities = 56/172 (32%), Positives = 88/172 (51%), Gaps = 5/172 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +++ IT + D IVN N L GGV G+I AG +L+ K ++ G T Sbjct: 3 FEIIRQYITNMKTDAIVNPTNNELKPTGGVCGSIFEKAGYEILE---KKCKKIGYLETTE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AVIT +L K ++HTVGP+W + + LL + Y N L+L + S+AFP IS+G Sbjct: 60 AVITKGYNLDCKYIIHTVGPIWDNAKSDNATLLYNTYTNCLKLAKSKKCNSIAFPLISSG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 +GYP+ A +IA + F+ + + +Y V +D E+ + + L Sbjct: 120 NFGYPKDKALDIATNAIKNFLLENDML--IYLVVFDRESFKINKDLFDSITQ 169 >UniRef50_C5CIT5 Appr-1-p processing domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIT5_KOSOT Length = 187 Score = 166 bits (421), Expect = 2e-40, Method: Composition-based stats. Identities = 52/173 (30%), Positives = 89/173 (51%), Gaps = 4/173 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +VQGDITK VD IVNAAN L GGGV GAI RA G + + ++ ++ G Sbjct: 13 IQIVQGDITKEEVDAIVNAANGYLRHGGGVAGAILRAGGKIIQEESDRIIRKNGPLEVSE 72 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T AG L K ++H GP + G++N ++LL +++LN+ + +++ PA+S+G Sbjct: 73 VAVTGAGSLHPKYIIHVHGPRY--GQENVEELLYESFLNAFKTAGKLGVKTLSVPAVSSG 130 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQQG 175 ++G P+ A + V + + + D ++E++ + Sbjct: 131 IFGVPKDLCARCFFRAVEYYFENYKDTPLSLIRVCNIDRATTEVFEKVSEEFF 183 >UniRef50_Q87JZ5 UPF0189 protein VPA0103 n=5 Tax=Proteobacteria RepID=Y4103_VIBPA Length = 170 Score = 166 bits (421), Expect = 3e-40, Method: Composition-based stats. Identities = 85/175 (48%), Positives = 106/175 (60%), Gaps = 6/175 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-D 59 M I +VQGDIT VD IVNAANP ++GGGGVDGAIHRAAGPAL++AC V G Sbjct: 1 MNA-ISLVQGDITTAHVDAIVNAANPRMLGGGGVDGAIHRAAGPALINACYAVDDVDGIR 59 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 CP G A IT AG+L A+ V+H VGP++ + +L+ AY SL L AN SVA P Sbjct: 60 CPFGDARITEAGNLNARYVIHAVGPIYD-KFADPKTVLESAYQRSLDLALANHCQSVALP 118 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 AIS GVYGYP AAE+A+ +A + F + EE +++ LTQ Sbjct: 119 AISCGVYGYPPQEAAEVAMAVCQR--PEYAALDM-RFYLFSEEMLSIWQHALTQH 170 >UniRef50_Q17432 Protein B0035.3, confirmed by transcript evidence n=3 Tax=Chromadorea RepID=Q17432_CAEEL Length = 203 Score = 166 bits (420), Expect = 3e-40, Method: Composition-based stats. Identities = 71/171 (41%), Positives = 95/171 (55%), Gaps = 7/171 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPT 62 RI V GDITKL+VD IVNAAN L GGGGVDGAIHRAAG L + C + C Sbjct: 25 RISVWDGDITKLSVDAIVNAANSRLAGGGGVDGAIHRAAGRKQLQEECQQYN----GCAV 80 Query: 63 GHAVITLAGDLPA-KAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPA 120 G AVIT ++ K ++HTVGP G +E + L Y SL + N S+AF Sbjct: 81 GDAVITSGCNINHIKKIIHTVGPQVYGNVTDERRENLVACYRTSLDIAIENGMKSIAFCC 140 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 ISTGVYGYP AA+ ++E++ ++ E++ V + + + Y + Sbjct: 141 ISTGVYGYPNDDAAKTVTNFLTEYLEKNDTIERIVLVTFLDIDNEHYNKYF 191 >UniRef50_Q22CT8 Appr-1-p processing enzyme family protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22CT8_TETTH Length = 472 Score = 166 bits (420), Expect = 3e-40, Method: Composition-based stats. Identities = 54/164 (32%), Positives = 82/164 (50%), Gaps = 2/164 (1%) Query: 15 LAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLP 74 VD IVNAAN L GGGV GAI R G + + + + + G +V T AG LP Sbjct: 2 ENVDAIVNAANNFLAHGGGVAGAICRKGGRIIQNQSYDIIKIRNRIENGESVTTEAGQLP 61 Query: 75 AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAA 134 K V+HTVGP+W G+ NE + L LR S++ PAIS+G++G+P+ A Sbjct: 62 CKKVIHTVGPIWEDGDSNEKEELAKCMETILREAKFYKLKSISIPAISSGIFGFPKYLCA 121 Query: 135 EIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQGD 176 +I ++ + + E++ F +D E ++ +Q Sbjct: 122 KILLEETQKLLKYDYSNQFEEIRFCNFDNETVQVFAEEFQKQFQ 165 >UniRef50_B9WC14 Putative uncharacterized protein n=5 Tax=Candida RepID=B9WC14_CANDC Length = 564 Score = 165 bits (419), Expect = 4e-40, Method: Composition-based stats. Identities = 64/184 (34%), Positives = 97/184 (52%), Gaps = 14/184 (7%) Query: 2 KTRIHVVQGDITKL-AVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + + +GDIT L V IVNAAN +L+G +D IH AAGP L AC + Q Sbjct: 90 NATVSLWKGDITTLTDVTAIVNAANSTLLGCFQPRHKCIDNVIHIAAGPDLRQACYNLMQ 149 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVA---A 110 + + PTG A IT +LPAK V+HTVGP+ + E + L Y +SL + Sbjct: 150 SKSE-PTGSAKITPGFNLPAKYVIHTVGPIIHNESVTKREQEQLASCYQSSLEALEMLND 208 Query: 111 NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYE 168 S+AF +STG++ +P+ A+ IA+ TV +++ H + + + F + E+ +YE Sbjct: 209 EKDKSIAFCCVSTGLFAFPKELASTIAINTVHDYLKTHPNSTIKHIVFNVFSNEDKEVYE 268 Query: 169 RLLT 172 L Sbjct: 269 NNLQ 272 >UniRef50_Q5XC09 UPF0189 protein M6_Spy0919 n=20 Tax=Streptococcus RepID=Y919_STRP6 Length = 270 Score = 165 bits (419), Expect = 4e-40, Method: Composition-based stats. Identities = 71/185 (38%), Positives = 97/185 (52%), Gaps = 11/185 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 T + + GDI LAVD IVNAAN L+G G +D AIH AG L AC + +Q Sbjct: 84 TSLFLYHGDIRYLAVDAIVNAANSELLGCFIPNHGCIDNAIHTFAGSRLRLACQAIMTEQ 143 Query: 58 GDCP-TGHAVITLAGDLPAKAVVHTVGPVW---RGGEQNEDQLLQDAYLNSLRLVAANSY 113 G G A +T A LPA ++HTVGP R LL Y +SL L Sbjct: 144 GRKEAIGQAKLTSAYHLPASYIIHTVGPRITKGRHVSPIRADLLARCYRSSLDLAVKAGL 203 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLL 171 TS+AF +ISTG +G+P+ AA+IA+KTV ++ H + V F + E+ LY+ L Sbjct: 204 TSLAFCSISTGEFGFPKKEAAQIAIKTVLKWQAEHPESKTLTVIFNTFTSEDKALYDTYL 263 Query: 172 TQQGD 176 ++ + Sbjct: 264 QKENN 268 >UniRef50_A8H4N3 Appr-1-p processing domain protein n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H4N3_SHEPA Length = 304 Score = 165 bits (419), Expect = 4e-40, Method: Composition-based stats. Identities = 66/180 (36%), Positives = 99/180 (55%), Gaps = 9/180 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQ 56 T+I + +GDIT LAVD IVNAAN ++G +D AIH AG L C + + Sbjct: 119 DTKIILWKGDITTLAVDAIVNAANNQMLGCFQPQHKCIDNAIHNRAGAQLRADCEVIMEL 178 Query: 57 QGDCP-TGHAVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAA-NSY 113 QG+ TG A IT A +LP+K V+HTVGP+ + Q L +Y + L L Sbjct: 179 QGNIEETGIAKITRAYNLPSKFVIHTVGPIVQNMIQPIHAGQLASSYRSILTLAKQTERI 238 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLT 172 S+AF +ISTG++GYP A +A+ TV++++ + + + F + E + H+Y+ L Sbjct: 239 RSLAFCSISTGIFGYPIEQATRVALDTVTQWLMENPDQFDTIVFNVFSEYDHHVYQSALE 298 >UniRef50_B0EH33 Putative uncharacterized protein n=2 Tax=Entamoeba RepID=B0EH33_ENTDI Length = 348 Score = 165 bits (418), Expect = 6e-40, Method: Composition-based stats. Identities = 67/178 (37%), Positives = 96/178 (53%), Gaps = 8/178 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQGD 59 I V +GDITKL +D IVNAAN +L+G VD IH AG L C +++ Sbjct: 93 IRVWKGDITKLKIDSIVNAANNTLVGCFIPLHSCVDSIIHERAGVQLRYECSQLKTAYKA 152 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 T IT +LPAK V+H VGP+ + + LLQ YLN L + TS+ F Sbjct: 153 TTT-TTEITKGYNLPAKYVIHVVGPIVDTLKPKDSYLLQQCYLNCLNKAIESGCTSIGFC 211 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTG++G+P AA+IA++TV+ F+ H + V F + E + ++Y LL ++ Sbjct: 212 CISTGMFGFPNEEAAQIAIQTVNNFLKDHQI--DVVFCVFKEIDYNIYTSLLNDGFNQ 267 >UniRef50_UPI0000E4D641 UPI0000E4D641 related cluster n=2 Tax=Danio rerio RepID=UPI0000E4D641 Length = 692 Score = 165 bits (417), Expect = 7e-40, Method: Composition-based stats. Identities = 58/177 (32%), Positives = 91/177 (51%), Gaps = 4/177 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + V + DI L+VD +VNAAN L GGGV A+ +AAG L + C + G Sbjct: 1 VTVTVRKADICTLSVDAVVNAANEDLQHGGGVAYALLQAAGRCLQEYCDLHIKVNGPLTP 60 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNED--QLLQDAYLNSLRLVAANSYTSVAFPA 120 G A+IT AG LP K VVH VGP +R +++ Q L+ A SL ++ +S+A P Sbjct: 61 GDAIITDAGRLPCKYVVHAVGPRFRASDRHTAVQQCLRRAVRESLNQASSKKCSSIAIPV 120 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLLTQQG 175 IS+G++G P E K V ++I + ++ V +++N + + + + Sbjct: 121 ISSGIFGCPLDLCTESITKEVRQYIENWPSSTLTEIQLVDNNDKNVNAMAQAVRNEF 177 Score = 99.7 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 38/160 (23%), Positives = 62/160 (38%), Gaps = 14/160 (8%) Query: 12 ITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLA 70 + L DVIVN + + + G V A+ +AAG L + G VIT Sbjct: 209 LYHLQADVIVNTISEDMDLRKGAVSNALLQAAGHQLQSEIKRASNH------GEIVITDG 262 Query: 71 GDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPR 130 +L V H + ++L N L+ +SV FPAI TG G+P+ Sbjct: 263 YNLKCSRVFHVMIIYLF----TLQKVLNQIIRNCLKNAETQGLSSVVFPAIGTGNLGFPK 318 Query: 131 AAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 A+ + V +F +V V + + ++ + Sbjct: 319 DLVAKNMLTEVQQF--NTTNLRKVTVVVH-PSDKEIFRAV 355 >UniRef50_UPI0000E80997 PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=3 Tax=Gallus gallus RepID=UPI0000E80997 Length = 1655 Score = 165 bits (417), Expect = 8e-40, Method: Composition-based stats. Identities = 55/177 (31%), Positives = 90/177 (50%), Gaps = 4/177 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + V +G++ VDV+VNAA+ L G A+ +AAGP L C +V + G Sbjct: 642 TELLVYKGNLCNYPVDVVVNAASEDLRHTDGFAWALLQAAGPELQAECDEVVRMTGSLQA 701 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AVIT AG LP K V+H +GP W+ LL +A SL+L ++ S+AFP++ Sbjct: 702 GDAVITGAGKLPCKQVIHAIGPQWKEKNSGKCMYLLMEAIKKSLQLAETYNHRSIAFPSV 761 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQG 175 S G++G+P V + + + + ++++ V DEE + + ++ Sbjct: 762 SGGIFGFPPHKCVNAIVSAIKKTLEEFKRDSSLKEIHLVAVDEETVRVLRETVQKEF 818 Score = 131 bits (330), Expect = 9e-30, Method: Composition-based stats. Identities = 49/174 (28%), Positives = 71/174 (40%), Gaps = 6/174 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-GGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 RI V + DI DVIVN+ L G G + A+ + AGP L K + Q Sbjct: 866 RIQVEKKDIIDATTDVIVNSVGTDLKFGVGPLCRALLKEAGPELQMEFDKE-KGQQVAGN 924 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G V T L V H V P W G + L++ L S+AFPAI Sbjct: 925 GSVVCTKGYILDCTFVFHAVLPQWDRGSGQALKTLENTVHKCLMKAEEFGLKSIAFPAIG 984 Query: 123 TGVYGYPRAAAAEIAVKTVSEF--ITRHALPEQVYFVCY--DEENAHLYERLLT 172 TG + +P +++ V +F ++V+FV + D +N + L Sbjct: 985 TGGFSFPHTVVSKLMFDEVFKFSRCQSRKTLQEVHFVLHPNDRQNIQAFTSELK 1038 Score = 106 bits (265), Expect = 3e-22, Method: Composition-based stats. Identities = 42/177 (23%), Positives = 75/177 (42%), Gaps = 16/177 (9%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + V GDITK +VIVN AN + GV AI AAG + + C + G Sbjct: 1072 SVTLKVTSGDITKEDTEVIVNIANQTFDATSGVFKAIMDAAGFDVKEECNQY---GGLLQ 1128 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 +G + T G L + ++H + +++ L +Y SVAFPAI Sbjct: 1129 SG-FITTKGGALLCRRIIHLI----------HSMNVKNQVSEVLHECQLRTYKSVAFPAI 1177 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQGD 176 TG A A+ + + EF++ + +++ + + + + + + ++ D Sbjct: 1178 GTGAAQQSPAKVADDMLDAIVEFVSSRSVPHLKEIRIIIFQKHMLRDFLQSMKKRED 1234 >UniRef50_C1QBX0 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBX0_9SPIR Length = 257 Score = 164 bits (416), Expect = 1e-39, Method: Composition-based stats. Identities = 57/178 (32%), Positives = 98/178 (55%), Gaps = 7/178 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 ++ +++ QGDIT L +D +VNAAN S++G +D AIH A+G L CL Sbjct: 81 IRDNLYLWQGDITTLNIDAVVNAANSSMLGCFIPLHKCIDNAIHSASG-TRLRLCLNNIM 139 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAANSYT 114 + +G +IT A +LP++ ++HTVGP+ + + +++LL + Y + L N+ Sbjct: 140 KGKTEDSGQCIITKAFNLPSRYILHTVGPIIQNSVSKKDEELLYNCYKSCLETAKENNIK 199 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 S+AF ISTG + +P A++IAV V +F+ ++ F + + + LY +L Sbjct: 200 SIAFCCISTGEFKFPNKEASQIAVNAVKDFLNNSKYDIKIVFNVFKDLDYELYYDILK 257 >UniRef50_B1KG04 Appr-1-p processing domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KG04_SHEWM Length = 296 Score = 164 bits (415), Expect = 1e-39, Method: Composition-based stats. Identities = 65/181 (35%), Positives = 97/181 (53%), Gaps = 10/181 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 ++I + GDIT+L +D + NAAN ++G +D AI+ AAGP L + C ++ Q Q Sbjct: 110 SKISIWNGDITRLKIDAVTNAANAQMLGCFQPFHSCIDNAINCAAGPQLREDCNQLMQLQ 169 Query: 58 G-DCPTGHAVITLAGDLPAKAVVHTVGPVWRGG---EQNEDQLLQDAYLNSLRLVAANSY 113 G D TG A IT A +LP+K V+HTVGP+ + G + L Y L L A Sbjct: 170 GSDETTGSAKITRAYNLPSKFVLHTVGPIIQHGAVPSPRQIDELASCYDACLSLAAEAGA 229 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSE-FITRHALPEQVYFVCYDEENAHLYERLLT 172 SVA ISTGV+GYP AA +A++ V+ F+ + + F + + +Y R + Sbjct: 230 QSVAVCGISTGVFGYPAEKAANVALQAVANWFLVNPDKLDHLVFNTFGDNATEIYHRAIG 289 Query: 173 Q 173 + Sbjct: 290 E 290 >UniRef50_A8JCH3 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8JCH3_CHLRE Length = 160 Score = 163 bits (412), Expect = 3e-39, Method: Composition-based stats. Identities = 71/148 (47%), Positives = 92/148 (62%), Gaps = 3/148 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG--DC 60 T++ + QGDIT VD IVNAAN ++GGGGVDGAIHRAAGP L+ AC +V + C Sbjct: 12 TKLVIKQGDITVEDVDAIVNAANERMLGGGGVDGAIHRAAGPQLVRACAEVPEVYPGVRC 71 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 PTG A IT L A+ V+HTVGP++ ++ LL AY +S+ L A S++FP Sbjct: 72 PTGEARITPGFHLKARHVIHTVGPIYH-NDRVSAPLLASAYRSSVELAAQQGLASLSFPG 130 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH 148 ISTGV+GYP AA++ V T H Sbjct: 131 ISTGVFGYPWDKAAQVRVHTTHGHPRSH 158 >UniRef50_C3Y5X0 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y5X0_BRAFL Length = 970 Score = 163 bits (412), Expect = 3e-39, Method: Composition-based stats. Identities = 60/177 (33%), Positives = 87/177 (49%), Gaps = 6/177 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 +I V +GDIT+ VDVI NAAN L G GV GAI RA GP++ + G Sbjct: 571 QIVVARGDITQQPVDVIANAANEYLSHGSGVAGAISRAGGPSVQQESSYHVKTFGRVRVT 630 Query: 64 HAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAA-NSYTSVAFPAI 121 V+T G LP K ++H VGP W RG E ++ L+ N L +A SVA PAI Sbjct: 631 ETVVTRGGQLPCKHIIHAVGPRWERGHENENERQLRQTCYNILTAASATLRARSVAIPAI 690 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA----LPEQVYFVCYDEENAHLYERLLTQQ 174 S+G++G P+ AE V + F+ ++ F+ D+ ++ ++ Sbjct: 691 SSGIFGMPKQKCAESLVSGLERFLQTAKVSSCTLRRIIFIDMDQATVNILADTFGKK 747 >UniRef50_C7GZB8 Appr-1-p processing enzyme family domain protein n=3 Tax=Bacteria RepID=C7GZB8_9FIRM Length = 268 Score = 163 bits (412), Expect = 3e-39, Method: Composition-based stats. Identities = 71/185 (38%), Positives = 102/185 (55%), Gaps = 14/185 (7%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 +K + V QGDIT+L VD IVNAAN ++G +D IH AG L + C + + Sbjct: 83 IKDNLSVWQGDITRLKVDAIVNAANSQMLGCFIPLHTCIDNQIHTFAGIQLREECDQKME 142 Query: 56 QQGD-------CPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRL 107 + + PT ++T +LPAK VVH VGP+ GG + ++ L D Y N+L + Sbjct: 143 KLREKYGRDYEQPTAIPMLTEGYNLPAKKVVHIVGPIVSGGLTSDLEKDLADCYTNTLDM 202 Query: 108 VAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHL 166 N+ SV F ISTGV+ +P AAEIAVKTV ++ H+ E++ F + +E+ Sbjct: 203 CMENNLKSVVFCCISTGVFHFPNKRAAEIAVKTVGKWCEAHSYSLERIIFNVFKDEDKKY 262 Query: 167 YERLL 171 YE LL Sbjct: 263 YEELL 267 >UniRef50_A1L291 LOC799852 protein (Fragment) n=5 Tax=Danio rerio RepID=A1L291_DANRE Length = 458 Score = 162 bits (411), Expect = 3e-39, Method: Composition-based stats. Identities = 60/183 (32%), Positives = 93/183 (50%), Gaps = 10/183 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 I V + D+T+ V+ +VNAAN L GGG+ A+ A GP + + ++ G T Sbjct: 70 VEISVWKDDLTQHKVEAVVNAANEKLQHGGGLAQALSMAGGPQIQRWSDDIIKRYGYVKT 129 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-----QLLQDAYLNSLRLVAANSYTSVA 117 G AV+T AG+LP K ++H VGP ++ LL +A + L+ V + TSVA Sbjct: 130 GEAVLTPAGNLPFKYIIHAVGPKVPQNPTQKEIGDATPLLYNAITSILQTVLRENITSVA 189 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE----QVYFVCYDEENAHLYERLLTQ 173 PA+S+G++ +PR A+I VK + F H + +++ V DE + ER Sbjct: 190 IPALSSGLFNFPRDRCADIIVKAIKTF-HDHGGFQGRNLEIHLVNNDEPSVQEMERATRA 248 Query: 174 QGD 176 D Sbjct: 249 IFD 251 Score = 103 bits (258), Expect = 2e-21, Method: Composition-based stats. Identities = 45/174 (25%), Positives = 78/174 (44%), Gaps = 7/174 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDC 60 +++ +G I VDV+VN P G + AI + AG + + K + Sbjct: 282 NIILYLKRGAIEDEMVDVLVNTIAPDCKLHQGVISRAILKKAGDEIQNEIYKKKSNTSFY 341 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 + T +L K+V HTV + +++L + L SL+ A Y S++FPA Sbjct: 342 SSKVLYKTKGYNLYCKSVFHTVC--AHRSDSKSNEILFNVVLESLKKAAE-DYESISFPA 398 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCY--DEENAHLYERLL 171 I TG + + A+I + V+EF ++ + VYFV + D + +E + Sbjct: 399 IGTGNLDFKKWEVAKIMMDAVAEFAKQNKRKKLDVYFVVFPKDNDMMKAFENEM 452 >UniRef50_C9XM94 Putative uncharacterized protein n=6 Tax=Clostridium RepID=C9XM94_CLODC Length = 286 Score = 162 bits (411), Expect = 3e-39, Method: Composition-based stats. Identities = 68/178 (38%), Positives = 95/178 (53%), Gaps = 10/178 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQGD 59 I + +G+IT L D IVNAAN L+G VD IH AGP L + C K+ ++QG Sbjct: 109 IAIWRGNITNLRADAIVNAANNKLLGCLQPLHLCVDNEIHSCAGPRLREDCDKIIKKQGH 168 Query: 60 CP-TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ--LLQDAYLNSLRLVAA-NSYTS 115 TG A IT LPAK VVHTVGP+ GG+ +++Q L Y + L + + + Sbjct: 169 LEYTGDAKITRGYCLPAKFVVHTVGPIVSGGQPSKEQEKQLLHCYKSCLNTIKEIDEIKN 228 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLLT 172 + F ISTGV+GYP+ AA +AV V ++ + +V F + EE Y R+ Sbjct: 229 IVFCGISTGVFGYPKKEAANLAVSRVRLWLKENPEKNLKVVFNVFTEEEEEKYRRIFK 286 >UniRef50_D1BM15 Appr-1-p processing domain protein n=15 Tax=Bacteria RepID=D1BM15_VEIPT Length = 259 Score = 162 bits (411), Expect = 4e-39, Method: Composition-based stats. Identities = 66/176 (37%), Positives = 94/176 (53%), Gaps = 7/176 (3%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 +I++ QGDIT+LAV IVNAAN L+G +D AIH AG L AC ++ + Sbjct: 83 PQIYLWQGDITRLAVKAIVNAANEQLLGCFLPNHKCIDNAIHTFAGIELRMACARMTEYM 142 Query: 58 GD-CPTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANSYTS 115 TG A +T +LPA V+HTVGP+ + E + L Y + L L A S S Sbjct: 143 DMPEKTGVARMTYGFNLPASHVIHTVGPIVYDTVTDLEKEQLSSCYRSCLELANAYSLKS 202 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +AF ISTG + +P AA+IA+ TV ++ QV F + + + +Y +LL Sbjct: 203 IAFCCISTGEFRFPNELAAQIAIDTVRRYLKETNSKIQVVFNVFKDIDYDIYNKLL 258 >UniRef50_C3YH95 Putative uncharacterized protein n=2 Tax=Eumetazoa RepID=C3YH95_BRAFL Length = 437 Score = 162 bits (410), Expect = 4e-39, Method: Composition-based stats. Identities = 50/172 (29%), Positives = 85/172 (49%), Gaps = 6/172 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 ++ + +GDIT L IVN N +L + I +AAGP L C + C Sbjct: 45 NRKVVLWEGDITTLNCTAIVNTTNETLTDRNLISERIFQAAGPDLRAECSNHLKT---CR 101 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPA 120 TG A +T +LPA+ ++HTVGP + + + L + Y NSL++ N+ S+ Sbjct: 102 TGEAKMTKGYNLPARYIIHTVGPRYNVKYRTAAESALFNCYRNSLQIARENNLQSIGLCV 161 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLL 171 ++ GYP A IA++TV F+ ++ + E + F + + +Y R++ Sbjct: 162 VNQPKRGYPPDEGAHIALRTVRRFLEKYDSSLETIVFAV-TDNDEDIYRRVM 212 >UniRef50_A7T167 Protein GDAP2 homolog n=1 Tax=Nematostella vectensis RepID=GDAP2_NEMVE Length = 502 Score = 162 bits (410), Expect = 5e-39, Method: Composition-based stats. Identities = 62/173 (35%), Positives = 94/173 (54%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDITKLA D IVN N SL G + +HRAAGP L+ C RQQ C Sbjct: 49 INAKVVLWNGDITKLAADAIVNTTNESLSDRGALSERVHRAAGPELMQEC---RQQLLGC 105 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG A I+ +LPA+ V+HTVGP + + + L Y N++RLV N +++ Sbjct: 106 RTGEAKISEGYNLPARYVIHTVGPRYNTKYKTAAESALFSCYRNTMRLVRENKISTIGVC 165 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLL 171 ++T GYP A IA++TV F+ ++ + + V FV E +Y +++ Sbjct: 166 VVNTTKRGYPPEDGAHIALRTVRRFLEKYGSAVDTVAFVVEGAEAV-VYAKVM 217 >UniRef50_A2FMC7 Appr-1-p processing enzyme family protein n=1 Tax=Trichomonas vaginalis RepID=A2FMC7_TRIVA Length = 361 Score = 162 bits (410), Expect = 5e-39, Method: Composition-based stats. Identities = 61/173 (35%), Positives = 92/173 (53%), Gaps = 13/173 (7%) Query: 1 MKTRIHVV-QGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + +I +G+ KL D +VNAAN L GGG+ G +H AAG A+ C ++ G Sbjct: 114 INEKISFWMRGNSVKLECDAVVNAANSHLYPGGGICGVLHSAAGEAMERECSEI----GY 169 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 PTG +TL +LPAK +HTVGP+ + LQ+AY ++L + SV Sbjct: 170 TPTGKCAVTLGYNLPAKYCIHTVGPI-----GEQPDKLQEAYESTLSCIDGKKIRSVGLC 224 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHAL---PEQVYFVCYDEENAHLYER 169 ISTG+YGYP A IA+K V +F+ +++ FV ++ + +Y+R Sbjct: 225 CISTGIYGYPIENATPIALKVVRKFLEDPNNREKTDRIIFVVFERRDVVVYDR 277 >UniRef50_A9SRF5 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9SRF5_PHYPA Length = 207 Score = 162 bits (409), Expect = 6e-39, Method: Composition-based stats. Identities = 71/178 (39%), Positives = 99/178 (55%), Gaps = 9/178 (5%) Query: 5 IHVVQGDITKL----AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG-D 59 + + +GDITK D IVNAAN ++GGGGVDGAIH AAG LL+A K+ +G Sbjct: 30 LVLQRGDITKWHIDGKTDAIVNAANERMVGGGGVDGAIHAAAGKQLLEATKKIPISEGVR 89 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 CP G AV+T LP ++HTVGP++ E N LL A+ S+RL N +AFP Sbjct: 90 CPVGSAVLTPGFKLPVSKIIHTVGPIYY-IEGNPASLLAKAHKESVRLATENGLKYIAFP 148 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 AIS GVYGYP AAEI+++++ +V+FV + + + ++ Sbjct: 149 AISCGVYGYPIEEAAEISIQSLR---ESAGELLEVHFVHFQAATYRAWLAEAKVKLEK 203 >UniRef50_C1SPD7 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SPD7_9BACT Length = 177 Score = 162 bits (409), Expect = 7e-39, Method: Composition-based stats. Identities = 56/175 (32%), Positives = 88/175 (50%), Gaps = 6/175 (3%) Query: 1 MKTRI-HVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + + + DITK D IVN AN L GGV GAI G ++ + C ++ G Sbjct: 7 INGTVLEIALRDITKQTTDAIVNPANRQLKMTGGVAGAIAAKGGRSIQEECDEI----GS 62 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 CP G AV+T AG L ++H VGP + G + ++ L+ A + S+ L N+ + +A P Sbjct: 63 CPLGEAVMTGAGFLKTTYIIHAVGPRY-GVDPEPEKYLKSAVMKSIELADKNNLSDIAIP 121 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 AIS G++GYP AAE+ + V E I ++ + E + ++ L + Sbjct: 122 AISAGIFGYPLEDAAEVIISAVIEKILSGTKLNKILLCLFTENDYMVFINTLDRL 176 >UniRef50_C4DDL7 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DDL7_9ACTO Length = 224 Score = 161 bits (408), Expect = 8e-39, Method: Composition-based stats. Identities = 70/169 (41%), Positives = 97/169 (57%), Gaps = 6/169 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CP 61 RI +V+GDIT VD ++NAAN SLMGGGGVDGAIHR GP +LD C K+R P Sbjct: 2 RIELVKGDITTQDVDALINAANSSLMGGGGVDGAIHRKGGPTILDECRKLRDSHYPKGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G A+ T AG+LPA+ ++HTVGPV+ + + + L+ Y NSL + T++A P I Sbjct: 62 EGQAIATTAGNLPAQWIIHTVGPVYSRHD-DRTETLRACYRNSLTIADTLGATTLAVPLI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 S+G+YG+P+ A AV + T + + E + RL Sbjct: 121 SSGIYGWPKDDAIRQAVDVLQ---TTPTSVTLARIMLFSSEEVSVATRL 166 >UniRef50_A9WK70 Appr-1-p processing domain protein n=3 Tax=Chloroflexus RepID=A9WK70_CHLAA Length = 190 Score = 161 bits (408), Expect = 9e-39, Method: Composition-based stats. Identities = 71/175 (40%), Positives = 95/175 (54%), Gaps = 7/175 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPTG 63 + VV+GDI VD IVNAAN L+ GGGV GAI RAAG L AC V CPTG Sbjct: 15 LEVVEGDIVSQQVDAIVNAANEQLLQGGGVCGAIFRAAGAAELQRACDAVA----PCPTG 70 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPAIS 122 A IT LPA+ ++H VGP++ +E LL AY SL L S+AFP+I+ Sbjct: 71 EARITPGFALPARYIIHAVGPIFDHYAPSEADRLLISAYRASLALARQYGLQSIAFPSIA 130 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 TG+YG+P AA + ++T+ + + H P V V + + +Y + T E Sbjct: 131 TGIYGFPVTRAAPLVLQTLIDDLHTHQAPGLVRMVLW-RDTFPVYRDVFTHMQSE 184 >UniRef50_UPI0000F2CC13 PREDICTED: similar to B aggressive lymphoma long n=1 Tax=Monodelphis domestica RepID=UPI0000F2CC13 Length = 1624 Score = 161 bits (407), Expect = 9e-39, Method: Composition-based stats. Identities = 48/176 (27%), Positives = 86/176 (48%), Gaps = 5/176 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V + D+T+ D +VNAAN L+ GG+ A+ RA GP + + Q+G+ PT Sbjct: 100 ELSVWKDDLTRHPADAVVNAANERLLHAGGLALALVRAGGPLIEKESEAIIMQRGEVPTS 159 Query: 64 HAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLV--AANSYTSVAFPA 120 +T G LP ++H VGP W + Q L+ A N L V ++ +VA PA Sbjct: 160 EIAVTTGGQLPCSCIIHAVGPRWSDWNAERCCQELERATANILNYVTNDSHGIKTVAIPA 219 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEF--ITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 +S+G++G+P +I + T+ + + ++++ V +E +++ Sbjct: 220 LSSGIFGFPLELCVQIIILTIVRCPLLQSSKVLKEIHLVSNEEPTVAAFKKACENI 275 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 9/180 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAAN-PSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 T + +++G I K VDVIVN+ + + G V AI AGP + + K + Sbjct: 293 NTNLQIIEGFIEKQQVDVIVNSISASNSFDLGKVSNAILIHAGPEIEEEFSKTYSGMSE- 351 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 + V+T +L K V H V ++L++A + L + S++FPA Sbjct: 352 SSKLVVVTEGFNLACKHVYHVV----WPSSYQTKKVLKEAVMRCLEKTCQENMNSISFPA 407 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCY--DEENAHLYERLLTQQGDE 177 + TG G P+ A I +K + +F H V FV Y D E + + L + + Sbjct: 408 LGTGNIGLPKREAISIMLKEIFQFSKNHPQKRLLVNFVVYPNDNELYEVMKSELDKMITQ 467 >UniRef50_A1D5K4 Appr-1-p processing enzyme family protein n=1 Tax=Neosartorya fischeri NRRL 181 RepID=A1D5K4_NEOFI Length = 257 Score = 161 bits (407), Expect = 1e-38, Method: Composition-based stats. Identities = 69/171 (40%), Positives = 94/171 (54%), Gaps = 5/171 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T + ++ DI +L VD IVNAA SL GGGGVD A+H AAGP L AC+K Q + C Sbjct: 88 LNTLVSFIEHDIARLQVDCIVNAAKESLQGGGGVDRAMHLAAGPKLNQACIKKLQDR-QC 146 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G +T L K+V+HTVGP R +Q + Q+L+ Y NSL + S+ FP Sbjct: 147 SPGRVFMTPGFHLRCKSVIHTVGPDCRQKQQIDYAQVLRQCYRNSLNKAVSKGLRSIVFP 206 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH---ALPEQVYFVCYDEENAHLY 167 AIS GVY P A +EIA+ TV F+ H + +++ F +Y Sbjct: 207 AISVGVYACPAEATSEIALNTVRGFLDEHGRPSSLDRIGFCNLGPNIHAIY 257 >UniRef50_D0NNH8 Putative uncharacterized protein n=3 Tax=Phytophthora infestans T30-4 RepID=D0NNH8_PHYIN Length = 287 Score = 161 bits (407), Expect = 1e-38, Method: Composition-based stats. Identities = 58/180 (32%), Positives = 86/180 (47%), Gaps = 5/180 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + V+QGD+T D IVNAAN LM GGG+ GAI R+ G ++ K + G Sbjct: 49 PELLVMQGDLTCCKADAIVNAANTRLMHGGGLAGAIVRSGGSSIQQESSKWVKDHGPLTV 108 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGG--EQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G AV T AG L + V+HTVGP L+ A ++L SVA P Sbjct: 109 GDAVTTAAGKLTCQHVIHTVGPNVGSETLTSEHATQLRHAVWSALLEADRLKVKSVAVPG 168 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTG++GYPR A+ V ++F +++ + D+ + + +T + +E Sbjct: 169 ISTGIFGYPRDLGAKEIVTEAAKFCKEKAGSTALKRIALMNIDDPTVKSFVKAVTDEMEE 228 >UniRef50_B9YC00 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YC00_9FIRM Length = 182 Score = 160 bits (406), Expect = 1e-38, Method: Composition-based stats. Identities = 81/181 (44%), Positives = 105/181 (58%), Gaps = 12/181 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + GDIT + ++IVNAAN SL+GGGGVDG IHR AGP LL C + C TG Sbjct: 2 ITFIHGDITSVPAEIIVNAANRSLLGGGGVDGVIHRKAGPQLLAECRTL----HGCETGQ 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTS------VAF 118 A +T A DL + ++HTVGPVW GG E LL Y SLRL + F Sbjct: 58 AKVTKAYDLSCRWIIHTVGPVWSGGRHQEVDLLASCYQQSLRLARQLQKEHRLSSLTIVF 117 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ--VYFVCYDEENAHLYERLLTQQGD 176 P ISTG+Y +P+A A IAV TV + ++ ++ V F CY+ E+A LY+R L +GD Sbjct: 118 PCISTGIYHFPKALACSIAVDTVRDTLSELQAEKEIDVIFCCYESEDAQLYKRQLDNKGD 177 Query: 177 E 177 + Sbjct: 178 Q 178 >UniRef50_D1VVA5 Putative uncharacterized protein n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VVA5_9FIRM Length = 163 Score = 160 bits (406), Expect = 1e-38, Method: Composition-based stats. Identities = 69/171 (40%), Positives = 98/171 (57%), Gaps = 11/171 (6%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLD-ACLKVRQQQGD 59 + ++ ++ K+ VD IVNAAN L+ GGGV GAI + A L+ C K+ G Sbjct: 3 LNIKLE----NLVKMDVDAIVNAANKELLPGGGVCGAIFQVAKSKSLEMDCKKL----GP 54 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG AVIT A +LP+K ++H VGP++R G E++LL++AYLNSL+L +S S+AFP Sbjct: 55 IKTGQAVITSAYNLPSKYIIHAVGPIYRDGLSGEEELLRNAYLNSLKLAKKHSIKSIAFP 114 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 IS G+Y YP A +IAV T+ EF+ V D + +L Sbjct: 115 LISAGIYAYPLKEACKIAVDTIREFLKNED--MDVTIAVLDPNIYDILTKL 163 >UniRef50_A8M6L5 Appr-1-p processing domain protein n=2 Tax=Micromonosporaceae RepID=A8M6L5_SALAI Length = 170 Score = 160 bits (405), Expect = 2e-38, Method: Composition-based stats. Identities = 79/163 (48%), Positives = 98/163 (60%), Gaps = 9/163 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I VV GDIT+ VD IV AAN SL+GGGGVDGA+HRAAGP L A + G C G Sbjct: 4 IEVVLGDITQQNVDAIVTAANESLLGGGGVDGAVHRAAGPRLAQAGGAI----GPCAPGD 59 Query: 65 AVITLAGDL--PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A+ T A DL P + ++HTVGPVWRGG E ++L Y SLR+ +VAFP I+ Sbjct: 60 AMPTPAFDLDPPVRHIIHTVGPVWRGGGHGEARVLASCYRRSLRIADDLDALTVAFPTIA 119 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAH 165 TGVYG+P AA IAV T+ +QV V +DE++ Sbjct: 120 TGVYGFPADQAARIAVATIRS---TPTNVQQVRLVAFDEDSRQ 159 >UniRef50_Q4SK43 Chromosome 2 SCAF14570, whole genome shotgun sequence. (Fragment) n=4 Tax=Tetraodontidae RepID=Q4SK43_TETNG Length = 418 Score = 159 bits (403), Expect = 3e-38, Method: Composition-based stats. Identities = 57/179 (31%), Positives = 86/179 (48%), Gaps = 4/179 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + V + D+T VD +VNAAN L GG+ A+ +A G + + ++ G Sbjct: 54 RVTVSVHKADLTNFPVDAVVNAANERLQHVGGIALALSKAGGSQIQQDSDEYIRKNGVLR 113 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGE--QNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG +V AG LP K ++HTVGP G + LL+ A LNSL+ SVA P Sbjct: 114 TGESVAMDAGSLPCKKIIHTVGPHVTGHSLTASAANLLEKAVLNSLKKADECRLRSVALP 173 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQGD 176 AIS+G++GYP A+ VK V +F ++ + + V + + ER Sbjct: 174 AISSGIFGYPLKECADTIVKAVRDFCEKYQIMSLKDILLVDKVDLTVNEMERACRTCFS 232 Score = 91.6 bits (226), Expect = 1e-17, Method: Composition-based stats. Identities = 46/164 (28%), Positives = 69/164 (42%), Gaps = 10/164 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + G I + +VIVN G + AI + AG +L A + Sbjct: 263 LTLKWGRIDEEQTNVIVNTTQKDSW-DGQISTAILKKAGTKMLKALKCANVGNR-----N 316 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 ++T +L V HT+ G Q+L DA L+L A +S S+AFPAI TG Sbjct: 317 VIVTEPYNLRCAEVYHTL--FTAGSTDKAYQILTDAVSECLQLAANHSRQSIAFPAIGTG 374 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHL 166 G + A I + V +F + + +VYFV Y D + Sbjct: 375 GRGLEKEKVASIMSEAVFKFANQSSKQMEVYFVIYPGDHSTFQV 418 >UniRef50_C7H575 RNase III regulator YmdB n=2 Tax=Faecalibacterium prausnitzii RepID=C7H575_9FIRM Length = 343 Score = 159 bits (403), Expect = 3e-38, Method: Composition-based stats. Identities = 68/169 (40%), Positives = 93/169 (55%), Gaps = 5/169 (2%) Query: 8 VQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVI 67 ++ DITK+A D IVN AN +L+ G G AI++AAG L A + G C G AV Sbjct: 6 IRNDITKVAADAIVNPANRNLLQGSGTSRAIYQAAGEQELTAACEAI---GRCDLGRAVC 62 Query: 68 TLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYG 127 T A LPAK + H V P W GG E + L AY ++L+L A SVAFP +S+G YG Sbjct: 63 TPAFGLPAKYIFHAVCPAWHGGGFGEAEQLAGAYHSALKLAAKYHCESVAFPLLSSGNYG 122 Query: 128 YPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 YP+ A IAV T+++++ H VY V YD + + +L + Sbjct: 123 YPKEQAFRIAVDTITQYVMEHD--LTVYLVLYDRGSLAVSRKLFASVEE 169 >UniRef50_C4V152 Appr-1-p processing protein n=2 Tax=Clostridiales RepID=C4V152_9FIRM Length = 346 Score = 159 bits (402), Expect = 4e-38, Method: Composition-based stats. Identities = 74/172 (43%), Positives = 101/172 (58%), Gaps = 6/172 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +V+ DITK+ VD IVN+ANP + GGGVD AIH+AAG LL A R++ G+ G Sbjct: 3 FAIVRNDITKMQVDAIVNSANPRAIVGGGVDRAIHQAAGAELLTA----RRKIGNIAAGT 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T A L A+ V+HTVGPVW+ G E +LL AY NSLRL A +S+AFP +S G Sbjct: 59 AAVTPAYRLHARYVIHTVGPVWQDGSHGERELLSRAYQNSLRLAAERDCSSIAFPLLSAG 118 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 V+G P A AV+ + +F+ H VY V +D ++ + + L Sbjct: 119 VFGCPSEIAIAAAVQAIRDFLQEHD--MDVYLVVFDRKSFKISDTLFDDVQS 168 >UniRef50_B7C850 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C850_9FIRM Length = 310 Score = 159 bits (401), Expect = 5e-38, Method: Composition-based stats. Identities = 62/167 (37%), Positives = 90/167 (53%), Gaps = 7/167 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +++ DITKL VD IVN NPSL GG+D IH+ AG L C ++ G+ G Sbjct: 3 IKIIRQDITKLKVDAIVNTTNPSLDAKGGLDHYIHQFAGKELDVECRRI----GNLKVGQ 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 A +T L K ++HT PVW +N + LL+ YL+SL L S+AFP IS+G Sbjct: 59 ACLTSGYKL-CKYIIHTASPVWNIQNKNNEALLKSCYLSSLMLANEYKLKSIAFPLISSG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 +P+ A ++A+ ++ F+T H + VY V YD + + L Sbjct: 118 TNQFPKELALQVAMNSIVSFLTDHEMM--VYLVVYDRNSYKISSELF 162 >UniRef50_A7HJC7 Appr-1-p processing domain protein n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJC7_FERNB Length = 184 Score = 159 bits (401), Expect = 5e-38, Method: Composition-based stats. Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 3/176 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 I V GDIT +D IVNAAN L GGGV G I R GP + + ++ G Sbjct: 9 VEIEFVVGDITTQNIDAIVNAANSYLSHGGGVAGVISRKGGPTIQKESDEYVKKYGPVEP 68 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G +T AG+L AK V+HTVGP+ G + D ++ ++N ++ ++A P + Sbjct: 69 GGVAVTGAGNLSAKYVLHTVGPI--GDKPQNDDIIVKCFINIIKKSDELGIKTIAIPFVG 126 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQGDE 177 TG++GYP E K + ++ + +++ F D + +E + + Sbjct: 127 TGIFGYPLERFIENVTKVLINYLKDYEGTLQKIIFCDIDGYKVNKFEEYFLAKFKD 182 >UniRef50_D2V113 Appr-1-p domain-containing protein n=1 Tax=Naegleria gruberi RepID=D2V113_NAEGR Length = 220 Score = 159 bits (401), Expect = 5e-38, Method: Composition-based stats. Identities = 55/187 (29%), Positives = 95/187 (50%), Gaps = 15/187 (8%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD-CPTG 63 + V +GD+T VDVIVNAAN L G+ GAI + G + K+ + G G Sbjct: 34 LQVRKGDLTMEKVDVIVNAANCRLQHMSGLAGAIVKNGGQIIQKESNKLIKDLGRELENG 93 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL---------LQDAYLNSLRLVAANSYT 114 V T++GDLP K + H VGP+W + N+ + L SL + + + Sbjct: 94 EVVETISGDLPCKTLYHAVGPIWSSRKANDFKTLGAEQEDFELGMCVEASLNMAVESGLS 153 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPE----QVYFVCYDEENAHLYER 169 S++ PAIS+G++G+P+ A++ TV+EF+ + + +V F +D+E +++ + Sbjct: 154 SISLPAISSGIFGFPKDRCAKVLFNTVTEFLKSNKDNIKADRFEVRFTNFDDETCNIFSK 213 Query: 170 LLTQQGD 176 + + Sbjct: 214 EFKSRFN 220 >UniRef50_Q6NRC6 MGC83934 protein n=3 Tax=Xenopus RepID=Q6NRC6_XENLA Length = 914 Score = 158 bits (399), Expect = 9e-38, Method: Composition-based stats. Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 4/176 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 R+ V +GD+T+ VD +VNAAN L GG+ A+ +A G + D + ++ +G Sbjct: 81 RVSVWKGDMTRQNVDAVVNAANEDLKHFGGLALALVKAGGAVIQDESRRHIEKYKKVKSG 140 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLV-AANSYTSVAFPAI 121 +T AG+LP K ++H VGP W G +Q L++ N L V ++ SVA PA+ Sbjct: 141 SIAVTSAGNLPCKMIIHAVGPEWSPGINAKCEQELKEVIRNVLMQVMNESNVRSVAIPAV 200 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERLLTQQG 175 S+G++ +P EI T +F + ++ FV D + Sbjct: 201 SSGIFRFPLQRCTEIIASTTKKFCDTETYHKLAEIRFVNIDTITVDAMKAACEGVF 256 Score = 108 bits (269), Expect = 9e-23, Method: Composition-based stats. Identities = 41/177 (23%), Positives = 78/177 (44%), Gaps = 10/177 (5%) Query: 4 RIHVVQGDITKLAVDVIVNA--ANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +++ +G I + VIVN+ AN +L G + AI R AG +L L + + P Sbjct: 357 NLYLTKGYIEEQKTAVIVNSLGANRNL-NEGNISKAILRKAGNSLSQEVLD--KSKYVSP 413 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 T + T LP V H + + ++ ++L+D L + +S++FPA+ Sbjct: 414 TDIMIPTRGYYLPCDFVYHVIL---QRSGSDQKKILKDGINACLNTALRYNTSSISFPAL 470 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCY--DEENAHLYERLLTQQGD 176 TG+ +P+ A++ V F + ++FV + D + +++ Q Sbjct: 471 GTGMLCFPKPVVAKVMTDEVLSFAKENPCNMDIFFVIHPNDTDTYSEFKKAFQAQQQ 527 >UniRef50_A2DTG7 Appr-1-p processing enzyme family protein n=2 Tax=Trichomonas vaginalis RepID=A2DTG7_TRIVA Length = 316 Score = 158 bits (399), Expect = 9e-38, Method: Composition-based stats. Identities = 60/174 (34%), Positives = 87/174 (50%), Gaps = 12/174 (6%) Query: 1 MKTRIHVVQG-DITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + +I G D TKL D IVNAAN L GGG+ GAI AAG + K +QG Sbjct: 51 INKKISFWMGGDSTKLKCDAIVNAANSYLAAGGGICGAIFSAAG---YEELQKACDEQGY 107 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A +T LP+K V+H VGPV + L+ AY +L + + S+AF Sbjct: 108 TETGGAKMTPGFRLPSKYVIHAVGPV-----GVHPEALRSAYNLTLGFMDNDKVKSIAFC 162 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALP---EQVYFVCYDEENAHLYERL 170 ISTG+YGY A +A+ TV +++ +++ FV + ++ +Y Sbjct: 163 CISTGIYGYSIEKATPVALDTVRKWLEVPENLAKTDRLVFVVFMPKDQQVYSHF 216 >UniRef50_C0W547 Appr-1-p processing domain protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W547_9ACTO Length = 285 Score = 157 bits (398), Expect = 1e-37, Method: Composition-based stats. Identities = 56/181 (30%), Positives = 89/181 (49%), Gaps = 9/181 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVR- 54 + T++ + +GD+T L +VNAAN +++G +D +H AAGP L C + Sbjct: 103 LGTQVALWRGDLTTLRAGGVVNAANSAMLGCFVPGHRCIDNVLHAAAGPGLRAECARYMD 162 Query: 55 -QQQGDCPTGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANS 112 ++ TG A++T LPA V+HTVGP+ G Q LL Y + L Sbjct: 163 SREGRPEETGRALVTGGYHLPAAHVIHTVGPIVTHGVTQEHRDLLASCYRSVLDAAEGAG 222 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 SV ++STGV+GYP+ AA + + T+ ++ RH ++ + E + YE L Sbjct: 223 LDSVGLCSVSTGVFGYPKQEAAPLVLDTIGRWLDRHPDSTLRIVICAFAEVDVRAYEAAL 282 Query: 172 T 172 Sbjct: 283 A 283 >UniRef50_B0EF86 MACRO domain-containing protein, putative n=2 Tax=Entamoeba RepID=B0EF86_ENTDI Length = 316 Score = 157 bits (397), Expect = 2e-37, Method: Composition-based stats. Identities = 65/172 (37%), Positives = 93/172 (54%), Gaps = 9/172 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 M +I VV GDITK+ DV+VNAAN L GG GVDGAIH AAG L D +R C Sbjct: 47 MNKKIIVVTGDITKIQADVVVNAANSYLRGGAGVDGAIHSAAGYELYDY---LRSHYKHC 103 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 TG + +P K ++H VGP+ LQ Y+ L V Y S+AFP Sbjct: 104 DTGDFKPSPGFKMPCKEILHGVGPI-----GENAIQLQRVYVRCLEYVRLKEYKSIAFPC 158 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLL 171 ISTG++GY A + ++ V +++ + L + ++ F CY+ + ++Y + L Sbjct: 159 ISTGIFGYSNEKACPVVLEVVRDWLEVNPLWDGKIIFCCYNLTDLNIYSKFL 210 >UniRef50_UPI000196AD9C hypothetical protein CATMIT_00588 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AD9C Length = 334 Score = 157 bits (397), Expect = 2e-37, Method: Composition-based stats. Identities = 63/170 (37%), Positives = 94/170 (55%), Gaps = 5/170 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +V+ DITK+ D+IVN ANP G D AI+ AAG +A L R+ G G Sbjct: 3 FKIVRNDITKVEADIIVNTANPQPKCVSGTDLAIYEAAGK---EALLAERKTIGPIERGE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T A +L AK ++HTVGPVW G +E ++L+ Y L+ S+AFP ISTG Sbjct: 60 IAVTGAYNLNAKYIIHTVGPVWIDGNHHELEILERCYRLPLQKAIELGCQSIAFPLISTG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 VY +P+ A IAV S+F+T H + ++ V +D+ + L +++ + Sbjct: 120 VYEFPKNKALHIAVSVFSQFLTEHEI--EIILVVFDKTSFQLSSQIVGEI 167 >UniRef50_A3LYE6 Putative uncharacterized protein n=1 Tax=Pichia stipitis RepID=A3LYE6_PICST Length = 583 Score = 157 bits (396), Expect = 2e-37, Method: Composition-based stats. Identities = 62/193 (32%), Positives = 97/193 (50%), Gaps = 17/193 (8%) Query: 1 MKTRIHVVQGDITKL-AVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVR 54 + ++ + +GDIT + V IVNAAN +L+G +D IH AAGP L AC + Sbjct: 91 LSPKLSIWKGDITTISDVTAIVNAANSALLGCFQPSHRCIDNIIHAAAGPDLRRACYNLV 150 Query: 55 QQQGD--CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ---LLQDAYLNSLRLVA 109 +Q+ P G A IT +LPAK V+HTVGP G + + L Y +SL + Sbjct: 151 EQRDFTQEPVGSAQITPGFNLPAKMVIHTVGPSLLPGSEPNQEEISQLAACYTSSLAKLE 210 Query: 110 AN----SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEEN 163 + S+ F ISTG++ +P A+ IA+++V + + H+ +V F + E N Sbjct: 211 EQEEDGNDKSIVFCCISTGLFSFPNDIASNIAIESVRNYFSEHPHSSISEVIFNVFTETN 270 Query: 164 AHLYERLLTQQGD 176 LY + + + Sbjct: 271 LKLYRQNFAEYQE 283 >UniRef50_C3Y5Q2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y5Q2_BRAFL Length = 1122 Score = 156 bits (395), Expect = 2e-37, Method: Composition-based stats. Identities = 53/176 (30%), Positives = 89/176 (50%), Gaps = 5/176 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++ + QGDIT+ DVIV+ N SL GG+ AI A GP + AC+ ++ G Sbjct: 707 VKVFIYQGDITQEVADVIVSCNNESLDSAGGIARAISDAGGPEIRRACVDYIRRHGRLSA 766 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVA-ANSYTSVAFPAI 121 G ++ T G L + VVHTV P +Q + Q L +L+ L + S+A PAI Sbjct: 767 GQSIWTPGGRLRCQHVVHTVSPQ-SSRDQTDHQQLFSTFLDLLNIAEFDLKVNSIAIPAI 825 Query: 122 STGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLLTQQ 174 +G+ G+P+A A++ + +S F +L +++ V D + + ++ +Q Sbjct: 826 GSGIAGFPKAVCADVMFRVISAFEDYQTPDSLLKEIRLVNIDAKTTAAFVQVFSQH 881 >UniRef50_B3RYC4 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RYC4_TRIAD Length = 491 Score = 156 bits (395), Expect = 2e-37, Method: Composition-based stats. Identities = 49/182 (26%), Positives = 87/182 (47%), Gaps = 8/182 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GDIT L VD IVN N +L ++ I AGP+L +R + G C Sbjct: 48 INQKLVLWTGDITTLKVDAIVNPTNENLSVMSPINQKIFEIAGPSLH---RDIRDEIGKC 104 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGE-QNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG + ++ +LP++ V+HTVGP + + L +Y +SL + S+A P Sbjct: 105 ATGESKLSKGYNLPSRYVIHTVGPKYNPRYLSAVENALYRSYRSSLLIAGEYKVRSIAIP 164 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL----LTQQG 175 + G+P + A IA++TV ++ + + + D+ +Y+RL + Sbjct: 165 TVHLHQRGFPVSEGAHIALRTVRRYLEHQSCTLETVILILDDTEMEIYKRLAVLYFPRSN 224 Query: 176 DE 177 +E Sbjct: 225 EE 226 >UniRef50_D2S4L6 Appr-1-p processing domain protein n=4 Tax=Actinomycetales RepID=D2S4L6_9ACTO Length = 170 Score = 156 bits (394), Expect = 3e-37, Method: Composition-based stats. Identities = 80/170 (47%), Positives = 102/170 (60%), Gaps = 5/170 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ--GDCPT 62 + V+GDIT+ VDV+VNAANP L+GGGGVDGAIH A GP +L C ++ G P Sbjct: 3 LRAVRGDITEADVDVVVNAANPGLLGGGGVDGAIHAAGGPEILAECRALKAGLPGGRLPR 62 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV T AG LPA+ VVHT GP+W +Q+ +L+ SLR+ SVAFPAIS Sbjct: 63 GRAVATTAGRLPARWVVHTAGPIWS-ADQDRSAVLRSCCTESLRVADGLGARSVAFPAIS 121 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GVYG+P A AA AV V +H ++V FV +D+ +E LT Sbjct: 122 AGVYGWPLADAAVQAVAGVRAVEVQH--VQEVRFVLFDDRALAAFEAALT 169 >UniRef50_UPI000194CBC9 PREDICTED: similar to B aggressive lymphoma n=1 Tax=Taeniopygia guttata RepID=UPI000194CBC9 Length = 718 Score = 156 bits (394), Expect = 3e-37, Method: Composition-based stats. Identities = 54/174 (31%), Positives = 84/174 (48%), Gaps = 5/174 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V + D+T+ VD++VNAAN L G G+ A+ +A GP + + Q+ G G Sbjct: 110 ICVYKDDLTRHKVDIVVNAANEYLEHGAGLALALVKAGGPEIKEESKLYVQRFGKVKVGD 169 Query: 65 AVITLAGDLPAKAVVHTVGPVWRG-GEQNEDQLLQDAYLNSLRL--VAANSYTSVAFPAI 121 +T G LP K ++H VGP W ++ LLQ A LN L + SVA PA+ Sbjct: 170 IAVTGGGKLPCKGIIHVVGPRWYALEKERCCYLLQKAILNVLHYVSAPGKALKSVAIPAV 229 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQ 173 S+G+Y +P +++ V V EF+ ++ V DE ++ + Sbjct: 230 SSGIYAFPIDLCSQVIVMAVKEFVEASPPGCLREIRLVNIDESTVAEIKKACEK 283 >UniRef50_A1WVH3 Appr-1-p processing domain protein n=14 Tax=Bacteria RepID=A1WVH3_HALHL Length = 181 Score = 156 bits (394), Expect = 4e-37, Method: Composition-based stats. Identities = 63/174 (36%), Positives = 91/174 (52%), Gaps = 6/174 (3%) Query: 4 RIHVVQGDITKL-AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + GDI D +VNAAN LM GGGV GA+HRAAGP L +AC + Sbjct: 9 TVETRVGDIAAQGDCDAVVNAANAQLMPGGGVAGALHRAAGPELAEACRPLA----PIQP 64 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AVIT LP + V+H +GPV+ G ++ +QLL Y N+L + T VA PA+S Sbjct: 65 GQAVITAGFGLPNRHVIHCLGPVY-GVDEPGEQLLAACYRNALHRAEEHELTRVAMPALS 123 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 TG +G+P AA +A+ T+ + V FV D +++ ++ + + Sbjct: 124 TGAFGFPMERAARVAIGTLQRTAAQLRYVRHVRFVLADAAAQQIHDHVIQELAE 177 >UniRef50_C0PSL1 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PSL1_PICSI Length = 204 Score = 156 bits (394), Expect = 4e-37, Method: Composition-based stats. Identities = 74/157 (47%), Positives = 92/157 (58%), Gaps = 7/157 (4%) Query: 4 RIHVVQGDITKL----AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQG- 58 + + QGDITK D IVNAAN ++GGGGVDGAIH AAGP LL ACL V + Q Sbjct: 21 TLVIHQGDITKWFINGENDAIVNAANELMLGGGGVDGAIHSAAGPELLRACLNVPEIQPG 80 Query: 59 -DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 CP G A IT A +LP ++HTVGP++ E + +L AY +SL + N VA Sbjct: 81 VRCPAGSARITEAFNLPVSHIIHTVGPIYDE-EGDSASVLSSAYKSSLEVAEENHIKYVA 139 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQV 154 FPAIS GVYGYP AAE+A+ T+ +V Sbjct: 140 FPAISCGVYGYPLEKAAEVALLTLKNHAGDLEEILEV 176 >UniRef50_A8STD9 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8STD9_9FIRM Length = 348 Score = 155 bits (392), Expect = 5e-37, Method: Composition-based stats. Identities = 60/167 (35%), Positives = 93/167 (55%), Gaps = 6/167 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ-GDCPTG 63 + +V+ DI K+ D IVN AN ++ G G DGA++RAAG D L R++ G G Sbjct: 3 LRIVRNDIVKMTTDAIVNTANDHVVVGTGCDGAVYRAAG---YDELLNYRREYIGFVEEG 59 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A IT L A+ ++H V P + G+ E+ L+ Y SL+L N S+AFP IST Sbjct: 60 GAFITPGFGLNARYIIHAVSPRFIDGDHGEEGKLRSCYRKSLQLAKENGVRSIAFPLIST 119 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 G +GYP+ IAV ++ F+ + + ++ V +DE++ L E++ Sbjct: 120 GGFGYPKEEGLRIAVDEINAFLFENEV--DIFLVVFDEKSTRLGEKI 164 >UniRef50_Q2TX23 Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 n=8 Tax=Fungi/Metazoa group RepID=Q2TX23_ASPOR Length = 615 Score = 155 bits (392), Expect = 6e-37, Method: Composition-based stats. Identities = 61/188 (32%), Positives = 90/188 (47%), Gaps = 17/188 (9%) Query: 4 RIHVVQGDITKL-AVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQ 57 I + +GDIT L V IVNAAN L+G +D IH AAGP L DAC + +Q Sbjct: 113 NISLWKGDITSLTDVTAIVNAANSQLLGCFRPDHRCIDNIIHSAAGPRLRDACNSLMLKQ 172 Query: 58 GD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRL-----V 108 G +T +LPA+ V+HTVGP + + Q L Y + L Sbjct: 173 CHPESVGSVKVTSGFNLPAQWVLHTVGPQVNSRKSPGTLQQQQLASCYSSCLDATESLPA 232 Query: 109 AANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHL 166 + VAF ISTG++ +P AA+IA++TV ++ H + F + E + L Sbjct: 233 LPDGRKVVAFCCISTGLFAFPPDMAAKIALETVVQWCMNHPATSVTDIIFDTFLERDYEL 292 Query: 167 YERLLTQQ 174 Y+ +++ Sbjct: 293 YQANISEL 300 >UniRef50_C3Y6H9 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y6H9_BRAFL Length = 2209 Score = 155 bits (392), Expect = 6e-37, Method: Composition-based stats. Identities = 61/176 (34%), Positives = 90/176 (51%), Gaps = 4/176 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + V + D+T+ VDVIVNAAN L GG+ +I AGP L C K+ +++ Sbjct: 1142 KTVTVRKDDLTRHVVDVIVNAANRDLKHIGGLAKSISDVAGPVLQSECDKITRRRSLLD- 1200 Query: 63 GHAVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G V+T AG + K ++H VGP+W+GG + E L DA SL +TS+A PAI Sbjct: 1201 GQVVVTSAGAMTTCKEIIHAVGPLWQGGFRREADALYDAAYGSLEEAGRRGHTSIAIPAI 1260 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQG 175 S+G+Y +P A + V+ V EF + + V V D+ + LT + Sbjct: 1261 SSGIYSFPVDQCANLIVEAVDEFWKNNRSSTLSLVELVNNDDRTVDAFVEALTSRH 1316 Score = 112 bits (281), Expect = 5e-24, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 83/176 (47%), Gaps = 7/176 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CP 61 I +Q +I VDV+VN+ + +L + G + +I GP L + Q+G Sbjct: 1393 ITAMQANIASQRVDVMVNSTSHNLNLNSGQLSKSILDRGGPELQTLVNNAKAQKGIQSLA 1452 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G + + L + V+H+ W GG+ + +++L++ L++ + ++A PA+ Sbjct: 1453 DGDILESGPAGLNVQTVIHSALCRWDGGQGDSEKVLRELVRKCLKVAEEGGHKTIAIPAM 1512 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCY--DEENAHLYERLLTQ 173 TG +P AE ++ ++ + E++ F+ + D ++ ++ ++T+ Sbjct: 1513 GTGGLHFPHEVVAEALFGEAVDYFKQNPQSSIEEIRFIVWEGDPKSMVAFDEIMTK 1568 Score = 53.5 bits (127), Expect = 3e-06, Method: Composition-based stats. Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 1/50 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKV 53 + V GDIT+ IVN++N L + GGV AI GP++ C + Sbjct: 1657 LQVQPGDITEETTVAIVNSSNEQLDLTKGGVSNAIRNKGGPSIERECGNI 1706 >UniRef50_A5D049 Predicted phosphatase n=4 Tax=Bacteria RepID=A5D049_PELTS Length = 359 Score = 155 bits (392), Expect = 6e-37, Method: Composition-based stats. Identities = 58/166 (34%), Positives = 83/166 (50%), Gaps = 6/166 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V++GDIT+L VD IVNAAN L G GV GAI R G A+ + + +G P G Sbjct: 2 IKVLKGDITELQVDAIVNAANNHLWMGAGVAGAIKRKGGAAIEEEAVA----KGPIPVGE 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AV+T AG L A+ VVH + + ++ A N+L ++AFPA+ TG Sbjct: 58 AVVTGAGLLKARYVVHAAAM--GQDLVTDAEKVRAATRNALLRAGELGLKTIAFPALGTG 115 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 V G AA + V V + P +V F +D++ + R+ Sbjct: 116 VGGLEFDTAARVMVGEVRRHLALGLEPGEVIFALFDDKGYDAFSRI 161 >UniRef50_C3Y6H4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y6H4_BRAFL Length = 2120 Score = 154 bits (389), Expect = 1e-36, Method: Composition-based stats. Identities = 60/176 (34%), Positives = 87/176 (49%), Gaps = 3/176 (1%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++ V + DITK DVIVNAAN L GG+ AI A G + C Q G Sbjct: 1024 KKLVVWRDDITKHKADVIVNAANVRLEHVGGLAKAIVDAGGDIIQKFCNDYIQANGKLIP 1083 Query: 63 GHAVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G V + G + + ++H VGP+W GG E+ L DA SL A +++ S+A PAI Sbjct: 1084 GQVVSSPPGRINTCQRILHAVGPIWNGGGLGEEGHLADAVYGSLEEAAKSNFRSIAIPAI 1143 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQG 175 S+G+YGYP AEI V E++ + + + FV ++ A + L + Sbjct: 1144 SSGIYGYPLKKCAEIIVAKTVEYLEDNPTTSLQVIKFVNIVDQTAEAFVDALVSEF 1199 Score = 121 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 40/174 (22%), Positives = 72/174 (41%), Gaps = 10/174 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +++ +GD+TK D IVN+ N L G V AI +A GP + C + +G Sbjct: 1509 TLNIRKGDLTKETTDCIVNSTNEQLDLTRGAVSNAICKAGGPDIEQECKNIA-ARGGMRD 1567 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G AV T +G L ++H P + + N L+ S+AFPA+ Sbjct: 1568 GIAV-TGSGQLKCGKIIHAAAP-----APGQSTGWKKVITNCLQTADTLRLRSIAFPALG 1621 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQ 174 TG + A + + +F+ ++ +V + +E + + ++ Sbjct: 1622 TGTLQGSAESTATTMLDALQDFVLQNKATRLNEVRITIFQQEMVRAFHEEMQKK 1675 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 43/166 (25%), Positives = 74/166 (44%), Gaps = 7/166 (4%) Query: 15 LAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGD--CPTGHAVITLAG 71 VDV+VN +L + G V AI + AG L + QQ P G ++T + Sbjct: 1276 QNVDVLVNTTAGNLNLNTGAVSRAILQLAGNDLQTLVNRAMQQARITSLPDGQILVTDSA 1335 Query: 72 DLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRA 131 DL K V+H V W GG+ N +++L+ L+ +Y S+A PA+ TG +P Sbjct: 1336 DLLCKQVIHCVLCSWDGGQGNSEKVLRKIVQQCLQQAEKGNYASIAIPAMGTGGLHFPHD 1395 Query: 132 AAAEIAVKTVSEFITRHA--LPEQVYFVCY--DEENAHLYERLLTQ 173 AE E ++ ++ F+ + D ++ + ++ + Sbjct: 1396 VVAEAMFDEAVEHCRKNPSGSLREIRFIVWEEDPKSIPAFNEVMMK 1441 >UniRef50_A2QSI2 Contig An08c0280, complete genome n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QSI2_ASPNC Length = 603 Score = 154 bits (389), Expect = 1e-36, Method: Composition-based stats. Identities = 60/190 (31%), Positives = 90/190 (47%), Gaps = 18/190 (9%) Query: 4 RIHVVQGDITKLA-VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVR-QQ 56 +H+ QGDIT L V I NAAN ++G +D IH AGP L + C Q Sbjct: 108 TLHLWQGDITTLDGVTAITNAANEQMLGCFQPAHRCLDNVIHARAGPRLREECFHHMDQG 167 Query: 57 QGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ----NEDQLLQDAYLNSLRLVAANS 112 Q P GHA T LPA V+HTVGP G+ ++ Q L+ Y L + A Sbjct: 168 QRTLPVGHACATKGYCLPAPYVIHTVGPQLDAGQPVPTAHQRQQLRQCYEAVLDVAEALP 227 Query: 113 Y-----TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAH 165 S+A ISTG++ +P AA IA+++V +++ H + F + + + Sbjct: 228 ASDPRGKSIALCGISTGLFAFPVEEAASIAIQSVLDWLRHHLHTSITNIIFNTFTDTDTA 287 Query: 166 LYERLLTQQG 175 +Y++ L + Sbjct: 288 VYQQTLKKMH 297 >UniRef50_C2L199 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2L199_9FIRM Length = 344 Score = 154 bits (389), Expect = 1e-36, Method: Composition-based stats. Identities = 57/166 (34%), Positives = 97/166 (58%), Gaps = 5/166 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +++ DITK+ VD IVN ANP G+D A+++AAG L L+ RQ+ G G Sbjct: 3 FQIIRNDITKMQVDAIVNPANPIPGYAAGIDSAVYKAAGEEKL---LRRRQEIGAIAPGS 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 + IT +LPAK ++HTVG W+GG +E+ +++ Y + +L + S+A P +++G Sbjct: 60 SFITDGYNLPAKYIIHTVGTAWQGGNSDEEIIIRKCYRSIFKLALEHHILSLAIPLLASG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 YG+P+ A IA+ + F++ + + ++Y V +DE++ L L Sbjct: 120 SYGFPKGIALRIALSEIESFMSENDI--ELYLVVFDEKSYSLSTEL 163 >UniRef50_A4TAV6 Appr-1-p processing domain protein n=6 Tax=Actinomycetales RepID=A4TAV6_MYCGI Length = 577 Score = 154 bits (388), Expect = 1e-36, Method: Composition-based stats. Identities = 68/169 (40%), Positives = 93/169 (55%), Gaps = 6/169 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V+GDIT+ VD +VN AN ++ GGGG DGAIHRA GPA+L C+K + TG Sbjct: 11 TITAVRGDITEQEVDAVVNPANTAMRGGGGADGAIHRAGGPAILRDCVK--RFPDGLATG 68 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A T AGDLPA+ V+HTVGP + G Q LL+ Y +L++ VAFP IST Sbjct: 69 DAGWTTAGDLPAQWVIHTVGPNYDTG-QRNRSLLESCYRRALKVADELGARIVAFPLIST 127 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 G +G+PR A A++T++ ++ V +D + L Sbjct: 128 GSFGWPRQDAIAAAIETIA---AADTRVDEARLVAFDPKTHEEIRSALA 173 >UniRef50_A8FQZ3 Putative uncharacterized protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FQZ3_SHESH Length = 268 Score = 154 bits (388), Expect = 2e-36, Method: Composition-based stats. Identities = 69/178 (38%), Positives = 97/178 (54%), Gaps = 10/178 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQGD 59 + + QGDIT+LA D IVNAAN L G +D AIH A+G L D C + + QG Sbjct: 91 VKLWQGDITRLAADAIVNAANKELQGCFQPLHSCIDNAIHSASGVRLRDDCAVIIKAQGQ 150 Query: 60 -CPTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAYLNSLRLVAA-NSYTSV 116 T A IT +LP + V+HTVGP+ +G +LLQ Y N L L S+ Sbjct: 151 FEETAKAKITSGYNLPCQYVLHTVGPIVQGNVTGEHQKLLQLCYENCLALADQTLGINSI 210 Query: 117 AFPAISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLLT 172 AF ISTGV+GYP+ AA+ AV+ V +++ ++ + V F + E+ LY++ L Sbjct: 211 AFCCISTGVFGYPQKPAAQAAVRAVQQWLLNNPNSNIDTVIFNTFKPEDTRLYQQFLQ 268 >UniRef50_Q4DSL4 Putative uncharacterized protein n=4 Tax=Trypanosoma RepID=Q4DSL4_TRYCR Length = 297 Score = 153 bits (387), Expect = 2e-36, Method: Composition-based stats. Identities = 66/169 (39%), Positives = 88/169 (52%), Gaps = 10/169 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + G +T L +D IVNAAN + +GG GVDGAIH AAGP L+ C C TG Sbjct: 125 IALHNGPVTDLQLDAIVNAANKTCLGGKGVDGAIHAAAGPLLVRECATFN----GCDTGQ 180 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 IT +LPA+ V+HTVGP+ + L+ Y + L L N S+ F +STG Sbjct: 181 CRITKGYNLPARYVLHTVGPI-----GERPEALRSCYRSILSLAHRNRLRSIGFCCVSTG 235 Query: 125 VYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLT 172 VYGYP A IAV E++ +H + + F C+ E + Y L Sbjct: 236 VYGYPLIPATRIAVDETIEYLKQHFSAFDLCCFACFKLEEYNAYTDCLR 284 >UniRef50_C4G1S1 Putative uncharacterized protein n=3 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1S1_ABIDE Length = 359 Score = 153 bits (387), Expect = 2e-36, Method: Composition-based stats. Identities = 56/170 (32%), Positives = 91/170 (53%), Gaps = 5/170 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 +++ DITK+ D IVN ANP + GGGV+ AI+ AAG L L R++ G G Sbjct: 3 FRIIRNDITKVKADAIVNTANPEVAIGGGVETAIYSAAGKKKL---LDERKKIGILQPGE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +T A DL AK ++H P W+GG + E + L+D Y L+ S+AFP ++TG Sbjct: 60 VGVTEAFDLAAKYIIHVSSPRWKGGNKGEIKCLRDCYEKVLKTAKDYGCESIAFPLLATG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 YG+P+ ++AV + F+ + ++ V ++ E + +L+ + Sbjct: 120 TYGFPKEVGVQVAVDAFTAFLEENE--MEITLVVFESEAVSISGKLVEEV 167 >UniRef50_Q0CEI7 Putative uncharacterized protein n=1 Tax=Aspergillus terreus NIH2624 RepID=Q0CEI7_ASPTN Length = 524 Score = 153 bits (386), Expect = 3e-36, Method: Composition-based stats. Identities = 58/177 (32%), Positives = 84/177 (47%), Gaps = 8/177 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I + DIT L VD IV + G GG+DGA+H AAGP LLDAC + G C Sbjct: 316 NDIISLAHTDITTLEVDCIVTGISE-PRGQGGLDGAVHAAAGPRLLDACNDL----GKCW 370 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 +T A +LP K V+HTV P + G + LL+ Y L + ++AFPA+ Sbjct: 371 VEEVQVTDAYNLPCKKVIHTVSPPYADGSADSKWLLRACYRRCLEIAIEGGMRTIAFPAL 430 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALP---EQVYFVCYDEENAHLYERLLTQQG 175 STG G+ AA A++ V F+ +++ F +++ +Y Q Sbjct: 431 STGSKGFKSYEAATAALEEVRCFLDEPGHLLRFDKIIFCNIHQQDMEVYVAFTGQFF 487 >UniRef50_O67112 UPF0189 protein aq_987 n=4 Tax=cellular organisms RepID=Y987_AQUAE Length = 165 Score = 152 bits (385), Expect = 3e-36, Method: Composition-based stats. Identities = 54/167 (32%), Positives = 80/167 (47%), Gaps = 7/167 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I VV+G IT++ DVIVN AN + GGGV I R G + ++ + P G Sbjct: 3 IKVVKGSITEVDADVIVNPANSRGLMGGGVAVVIKRLGGEEIEREAVE----KAPIPVGS 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AV+T AG L K V+H + ++ ++ A +L L + VA P + TG Sbjct: 59 AVLTTAGKLKFKGVIHAPTMEEPAM-PSSEEKVRKATRAALELADKECFKIVAIPGMGTG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 V G P+ AA V+ + +F E+V V DEE +E++L Sbjct: 118 VGGVPKEVAARAMVEEIRKF--EPKCLEKVILVDIDEEMVEAWEKVL 162 >UniRef50_C7Z089 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7Z089_NECH7 Length = 592 Score = 152 bits (385), Expect = 4e-36, Method: Composition-based stats. Identities = 59/187 (31%), Positives = 89/187 (47%), Gaps = 17/187 (9%) Query: 3 TRIHVVQGDITKLA-VDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ- 55 T + V +GDIT L + I NAAN ++G +D IH AGP L D C ++ Q Sbjct: 99 TNLVVWRGDITTLTGITAITNAANGQMLGCFQPTHRCIDNIIHSRAGPRLRDECFQLMQD 158 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRL----- 107 + D G ++T DLP+ V+HTVGP R G E + L Y ++L Sbjct: 159 RDKDLGAGETLVTRGYDLPSPYVIHTVGPQLRRGASPTEVERRQLARCYESTLDALELLP 218 Query: 108 VAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAH 165 + ++A ISTG++ +P AAEIA+ TV ++ H + F + E + Sbjct: 219 AEEDGRKAIALCCISTGLFAFPAKEAAEIAILTVLSWLDNHPSTTITDIIFNTFTESDTE 278 Query: 166 LYERLLT 172 +Y +L Sbjct: 279 IYSKLFE 285 >UniRef50_Q8IXQ6 Poly [ADP-ribose] polymerase 9 n=27 Tax=Eutheria RepID=PARP9_HUMAN Length = 854 Score = 152 bits (383), Expect = 6e-36, Method: Composition-based stats. Identities = 47/173 (27%), Positives = 85/173 (49%), Gaps = 6/173 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V + D+T AVD +VNAAN L+ GGG+ A+ +A G + + + + G G Sbjct: 119 ELSVWKDDLTTHAVDAVVNAANEDLLHGGGLALALVKAGGFEIQEESKQFVARYGKVSAG 178 Query: 64 HAVITLAGDLPAKAVVHTVGPVWR-GGEQNEDQLLQDAYLNSLRLVAANS--YTSVAFPA 120 +T AG LP K ++H VGP W +Q LQ A ++ L V + +VA PA Sbjct: 179 EIAVTGAGRLPCKQIIHAVGPRWMEWDKQGCTGKLQRAIVSILNYVIYKNTHIKTVAIPA 238 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERL 170 +S+G++ +P + V+T+ + + ++++ V ++ ++ Sbjct: 239 LSSGIFQFPLNLCTKTIVETIRVSLQGKPMMSNLKEIHLVSNEDPTVAAFKAA 291 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 46/176 (26%), Positives = 75/176 (42%), Gaps = 8/176 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + +VQG I DVIVN+ NP + G V +I + AG + L + +Q Sbjct: 316 NLTLQIVQGHIEWQTADVIVNSVNPHDITVGPVAKSILQQAGVEMKSEFLATKAKQ-FQR 374 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 + ++T +L K + H + E + Q+L+ A L + TS++FPA+ Sbjct: 375 SQLVLVTKGFNLFCKYIYHVL----WHSEFPKPQILKHAMKECLEKCIEQNITSISFPAL 430 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCY--DEENAHLYERLLTQQ 174 TG + AAEI V F H + V FV + D E + + ++ Sbjct: 431 GTGNMEIKKETAAEILFDEVLTFAKDHVKHQLTVKFVIFPTDLEIYKAFSSEMAKR 486 >UniRef50_A0CX10 Chromosome undetermined scaffold_3, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CX10_PARTE Length = 183 Score = 152 bits (383), Expect = 6e-36, Method: Composition-based stats. Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 7/178 (3%) Query: 5 IHVVQGDITKL-AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + +++ +I KL VD IVNAAN L+ GGGV GAI +AAG L C + QQ G PT Sbjct: 6 VKIIKENIVKLVDVDAIVNAANQELLPGGGVCGAIFQAAGRELERECQQYIQQYGIVPTS 65 Query: 64 HAVITLAGDLP---AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV-AANSYTSVAFP 119 +T + L K ++H VGP + ED+ LQ N L SVA P Sbjct: 66 KLAVTSSCQLKKNNIKYIIHAVGPKYFQSSSPEDE-LQICVNNILNQSFNVLELKSVAIP 124 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCYDEENAHLYERLLTQQGD 176 AIS+G+YG+P+ A+I + E+ + + ++ +D+E +++++ QQ Sbjct: 125 AISSGIYGFPKGLCAQIFKLVIEEYQKDTSNKQGEIILCNFDQETTTIFQKVFQQQNS 182 >UniRef50_Q8B4N1 ORF-1 n=7 Tax=Infectious spleen and kidney necrosis virus RepID=Q8B4N1_ISKNV Length = 566 Score = 152 bits (383), Expect = 7e-36, Method: Composition-based stats. Identities = 70/179 (39%), Positives = 96/179 (53%), Gaps = 8/179 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +T + VV DIT L VD IVNAAN +GGGGVDG IHR AG L C + G Sbjct: 389 QTNVSVVLDDITSLRVDAIVNAANTVGLGGGGVDGRIHRVAGRELKRECRTL----GGIG 444 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGE---QNEDQLLQDAYLNSLRLVAANSYTSVAF 118 G A IT LPA V+HTVGP+ G+ Q + ++L Y+ SL + AN ++AF Sbjct: 445 FGEAKITGGYRLPATYVIHTVGPIINAGQRPTQADKRVLTSCYIQSLHVAQANGVRTIAF 504 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQQGD 176 P+ISTGVY YP A +A+ +V ++ +H + + F Y + +Y L + Sbjct: 505 PSISTGVYNYPIEDAVHVAMSSVRAYVIQHPGAFDHIVFCTYSNADFDVYNSQLPTYFN 563 >UniRef50_C9RQW9 Appr-1-p processing domain protein n=5 Tax=Bacteria RepID=C9RQW9_FIBSS Length = 347 Score = 151 bits (382), Expect = 7e-36, Method: Composition-based stats. Identities = 56/166 (33%), Positives = 88/166 (53%), Gaps = 5/166 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + +V+ DI+++ D IVN+AN + + GGG + I+ AAG L A R++ G Sbjct: 3 LRIVRNDISRVRADAIVNSANKNPVCGGGAEYHIYEAAGYDKLLAA---REKIGVLDVAE 59 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 ++ A L AK ++H VGP W GGE E L Y +L S+AFP IS+G Sbjct: 60 VAVSSAFALKAKYLIHVVGPKWNGGESGETSALASCYRRALEKALELGCESIAFPLISSG 119 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 V+ +P+ +A +IA++ + EF+ H V V +D + + E L Sbjct: 120 VFRFPKDSALKIALQAIGEFLQSHE--MDVQLVVFDRKAFDVSEEL 163 >UniRef50_C3Y5X5 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y5X5_BRAFL Length = 1925 Score = 151 bits (382), Expect = 8e-36, Method: Composition-based stats. Identities = 57/180 (31%), Positives = 85/180 (47%), Gaps = 9/180 (5%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++ V QGD+T L VDVIVNAAN L GG+ A+ +A G + C + G Sbjct: 909 KKLFVCQGDLTALQVDVIVNAANSRLSHVGGLAAALVKAGGKEIQRDCESYIRTSGQLSD 968 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-----LLQDAYLNSLRLVAANSYTSVA 117 G + T LP K VVH VGP W+ G +++ L A +SL+ + S+ Sbjct: 969 GDVMTTKPYRLPCKMVVHAVGPQWKSGLSEDEKGGKEANLYRAAFSSLQEAKD--FHSIG 1026 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQQG 175 PAIS+GVYG+P ++ V F H +VYF D + ++ + ++ Sbjct: 1027 IPAISSGVYGFPIDLCVSAILEGVMSFFNIHPNCKLSEVYFTEMDAKKTGAFKAEMVKRF 1086 Score = 136 bits (343), Expect = 3e-31, Method: Composition-based stats. Identities = 47/175 (26%), Positives = 71/175 (40%), Gaps = 10/175 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + V QGDIT +VD I+ N L GV + R AG +L CL V QQ G+ Sbjct: 1310 NVTLQVQQGDITTESVDAIIVPTNNKLRLDAGVAQVVSRKAGGSLQAECLAVVQQYGELQ 1369 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G T AG LP + V+H P + L+D + L+ SVA PAI Sbjct: 1370 NGAVATTGAGSLPCRHVLHLANP--------QPNHLKDNIKHCLQTADQKKLKSVALPAI 1421 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQ 174 TG +A+ + ++EF+ + V + + + + + Sbjct: 1422 GTGGINISPDQSAKGMLDGIAEFVQQSNPQNLALVRITIFQPQMLQTFHTEMDNR 1476 Score = 90.1 bits (222), Expect = 3e-17, Method: Composition-based stats. Identities = 41/174 (23%), Positives = 56/174 (32%), Gaps = 50/174 (28%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-GGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + + QG IT DV+VN L G GGV A +A GP L Sbjct: 1142 TLQLKQGGITAEQADVLVNTVGTDLDLGQGGVASAFLKAGGPELQQP------------- 1188 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 L+ L + N S+AFPA+ Sbjct: 1189 ----------------------------------LRTIIQTCLTMAHKNGLPSIAFPALG 1214 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQ 174 TG GYPR+ AA V F + + V V YD+ ++ L + Sbjct: 1215 TGNLGYPRSVAASAMFDEVVSFSQANPSTSLKHVSIVVYDQPTVQAFQAELRTR 1268 >UniRef50_C2DZH9 Appr-1-p processing protein n=4 Tax=Lactobacillus jensenii RepID=C2DZH9_9LACO Length = 218 Score = 150 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 77/173 (44%), Positives = 101/173 (58%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + I VV+ + D IVNAAN +L+GGGGVDGAIH+AAGP LL+AC K+ C Sbjct: 48 LSKNIFVVKASVVNFPADAIVNAANKTLLGGGGVDGAIHQAAGPNLLEACKKLN----GC 103 Query: 61 PTGHAVITLAGDLP-AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 TG A IT + DL K ++HTVGPV++ + + Q LQ Y SL L SVAF Sbjct: 104 DTGEAKITPSFDLKTCKYIIHTVGPVFKLSQNPQQQ-LQSCYKKSLDLALEYKCNSVAFS 162 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 ISTGVY YP AA +A + V+E++ RH +V+ CY E Y +L+ Sbjct: 163 GISTGVYEYPVKQAASVASEAVAEWLKRHNFAIKVFLCCYKESEFEAYAQLVR 215 >UniRef50_A7S3X0 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7S3X0_NEMVE Length = 143 Score = 150 bits (380), Expect = 2e-35, Method: Composition-based stats. Identities = 57/143 (39%), Positives = 77/143 (53%), Gaps = 1/143 (0%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V QGDIT D +VNAAN L+ GGGV GAI G ++ + C ++ + G G Sbjct: 1 VTVYQGDITNERADAVVNAANCDLIHGGGVAGAILAKGGWSIQEECYQIVGRFGRLEVGD 60 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV T AG L KAV+H VGP W G + L A L SL + S+AFPAIS+ Sbjct: 61 AVQTNAGKLLCKAVIHAVGPTWLGATPEQVKNQLFRACLESLYTADNINLCSIAFPAISS 120 Query: 124 GVYGYPRAAAAEIAVKTVSEFIT 146 G+YG P+ A++ + V + Sbjct: 121 GIYGVPKEICAQVMLDVVEHYAE 143 >UniRef50_C3Y406 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3Y406_BRAFL Length = 2514 Score = 150 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 53/181 (29%), Positives = 90/181 (49%), Gaps = 5/181 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD-C 60 ++ V + D+TK VD NAAN +L GGG+ AI +A G + D C ++ + + Sbjct: 1066 SRKLVVFKDDLTKHHVDATTNAANKNLKNGGGLAEAIIKAGGKEIQDHCDQIMKDEPAGL 1125 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 G +T G LP KAV+H VGP + ++ L N L + + ++SVA Sbjct: 1126 MVGAVRVTGPGKLPCKAVIHAVGPNFHEIKDDKRSRDELFKTVTNVLEMASRYGFSSVAI 1185 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQGD 176 PAIS+G++G P + V+ + ++ + +V+FV D + A + + L + + Sbjct: 1186 PAISSGIFGGPLDLCTKTVVRATGLYFKKNKESKVNEVHFVGIDLDIAQSFNKALLETFN 1245 Query: 177 E 177 E Sbjct: 1246 E 1246 Score = 138 bits (347), Expect = 9e-32, Method: Composition-based stats. Identities = 47/178 (26%), Positives = 79/178 (44%), Gaps = 5/178 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP- 61 +I +++G I+ DVIVN P L + G V A+ GP L C K+++ G P Sbjct: 1300 KITLIRGSISDQQADVIVNTIGPDLNLRTGAVSKALLDKGGPTLQVECDKIKRDLGRLPA 1359 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWR-GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G V T G+L V H V W + +L+ L+ +S ++AFPA Sbjct: 1360 HGEVVYTSGGNLGCNLVYHAVCSFWNSQDTAKSEDVLRKIVTACLKSADKDSKRTIAFPA 1419 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQGD 176 + TG GYP+ A + + ++ E+ FV +D+ + + L ++ + Sbjct: 1420 VGTGGLGYPKDVVARLMFEETLSHSNKNPAGDLEEAKFVIFDQPSFEAFLSELGKRTE 1477 Score = 116 bits (290), Expect = 4e-25, Method: Composition-based stats. Identities = 40/172 (23%), Positives = 70/172 (40%), Gaps = 11/172 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-GGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V QGDIT+ VD IVN + G V + + GP + C K + + Sbjct: 1522 VEVEQGDITREKVDAIVNPTRGDMDLSLGKVSQVLKKKGGPVVQTECEKYDKNK--LKRD 1579 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 IT AG L ++ ++H V P + + A +N L + S+AFPA+ T Sbjct: 1580 GVGITAAGGLASRYILHLVAPGFETERW------KKAVMNCLAYAECHQLKSLAFPALGT 1633 Query: 124 GVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERLLTQ 173 G +A + ++ +++F + V V ++ + L + Sbjct: 1634 GQMAKDPTESATMIIEAIADFAQKKNPKHLKHVRIVIFEAGMMKPFHDKLGK 1685 >UniRef50_D2V337 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2V337_NAEGR Length = 177 Score = 149 bits (377), Expect = 3e-35, Method: Composition-based stats. Identities = 63/177 (35%), Positives = 88/177 (49%), Gaps = 22/177 (12%) Query: 17 VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVR----------------QQQGDC 60 +D IVNAAN SLMGGGG+D IH AG L C + + C Sbjct: 1 IDTIVNAANESLMGGGGIDQIIHARAGDELKLECKTKYSPSCLKMKGSITYGNDELEYRC 60 Query: 61 PTGHAVITLAGDL--PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 TG AVIT A +L + ++HTVGP + +LL + Y + L+L N+ S+AF Sbjct: 61 ATGEAVITQAHNLSEKCQYIIHTVGPYLDENGNTQPELLSNCYNSCLQLAMENNLKSIAF 120 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRH----ALPEQVYFVCYDEENAHLYERLL 171 P ISTG YGYP A +A+K V F+ H + + FV +++ +Y+ L Sbjct: 121 PCISTGYYGYPIEEACRLALKIVKNFLHSHLNKQSSLRHIIFVIFNDLEFEIYKILF 177 >UniRef50_C5C222 Appr-1-p processing domain protein n=2 Tax=Actinomycetales RepID=C5C222_BEUC1 Length = 193 Score = 149 bits (376), Expect = 4e-35, Method: Composition-based stats. Identities = 76/156 (48%), Positives = 92/156 (58%), Gaps = 7/156 (4%) Query: 8 VQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQ--QGDCPTGHA 65 V GDIT VDV+VNAANPSL+GGGGVDGAIHRAAGP+LL C +R+ G A Sbjct: 12 VLGDITAQDVDVVVNAANPSLLGGGGVDGAIHRAAGPSLLAECQDLRRTVLPRGLSVGDA 71 Query: 66 VITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGV 125 V T AG+LPA VVHTVGP G Q + LL + SL + SVAFPA+S GV Sbjct: 72 VATGAGNLPALWVVHTVGPNAHVG-QRDPALLASCFTRSLDVAGGLGARSVAFPAVSAGV 130 Query: 126 YGYPRAAAAEIAVKTVSEFIT----RHALPEQVYFV 157 +G+ A IAV +V ++ + E V FV Sbjct: 131 FGWDVDVVARIAVDSVDTWLDGADPAASALELVRFV 166 >UniRef50_C2KRZ5 Appr-1-p processing domain protein n=2 Tax=Mobiluncus mulieris RepID=C2KRZ5_9ACTO Length = 275 Score = 148 bits (373), Expect = 9e-35, Method: Composition-based stats. Identities = 76/143 (53%), Positives = 92/143 (64%), Gaps = 3/143 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVR--QQQGDCP 61 ++H + GDIT++ VD IVNAAN +L+GGGGVDGAIHRAAG LL AC +R + P Sbjct: 2 QLHAIGGDITRVHVDAIVNAANSTLLGGGGVDGAIHRAAGTELLAACRVIRATRYPDGLP 61 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G AV T LPAK V+HTVGP G Q + LL+ A++NSLR A SVAFPAI Sbjct: 62 VGQAVATKGFKLPAKWVIHTVGPNRHAG-QTDPGLLRAAFVNSLREAARVGAHSVAFPAI 120 Query: 122 STGVYGYPRAAAAEIAVKTVSEF 144 S GVYG+ A A I V V E+ Sbjct: 121 SGGVYGWDMAEVARIGVSAVHEW 143 >UniRef50_UPI000180B1B4 PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2), partial n=1 Tax=Ciona intestinalis RepID=UPI000180B1B4 Length = 1271 Score = 148 bits (373), Expect = 9e-35, Method: Composition-based stats. Identities = 54/177 (30%), Positives = 87/177 (49%), Gaps = 6/177 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDAC-LKVRQQQGD 59 + +++GDIT++ D IVNA+N L + G+ G+I + GP + + G Sbjct: 520 NVEVKILRGDITEVNCDAIVNASNDKLELRDAGISGSIKKKCGPTVQAEMNQHIASVGGT 579 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNE--DQLLQDAYLNSLRLVAANSYTSVA 117 G AV T AG + + ++H VGPVW+G +E + L+ +L+ + TSVA Sbjct: 580 MLPGSAVSTSAGRMNCRRIIHVVGPVWKGDISDEVCEAYLKSCVSETLKEAERYNLTSVA 639 Query: 118 FPAISTGVYGYPRAAAAEIAVKT-VSEFITRHALPEQVYFVCYDEENA-HLYERLLT 172 PAIS GV+G + + V+T V F+ + +Q+YFV + R L Sbjct: 640 MPAISCGVFGGSVSVCPRLMVETLVDHFMKPSSCIKQIYFVENSNNEVIQSFSRSLQ 696 Score = 98.2 bits (243), Expect = 1e-19, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 63/179 (35%), Gaps = 15/179 (8%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + V QGDIT D I+ S G V AI + G ++ + Q Sbjct: 1015 SINVIVKQGDITIENSDAIICPTAQSYDLSGQVGQAILQRGGQSIQTELQQQLFTQK--- 1071 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 + +T AG L K V H V N+ +++ + + S++ PAI Sbjct: 1072 --NYSVTGAGQLACKHVFHIVT-------GNDGTQMENVLMEVFEQADSLRIHSLSIPAI 1122 Query: 122 STGVYGYPRAAAAEIAVKTVSEF---ITRHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 TG +AAA + + F + Q+ V + + +++ + Sbjct: 1123 GTGNSSLTSSAAARHINRAIHIFEGNVRTSPTLHQINIVVFQHQMMADFQQEFGVSASQ 1181 Score = 77.7 bits (190), Expect = 2e-13, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 46/118 (38%), Gaps = 4/118 (3%) Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSYTSVAFP 119 G V T L V H + ++ N + L A L+ Y ++AFP Sbjct: 885 QVGDVVHTSGYRLQCTEVYHVIVANYQPNSHNNSKHNLVKAIKTCLQNADQAGYATIAFP 944 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYF-VCYDEENAHLYERLLTQQGD 176 ++TG +GYP + A + + F +H +QV + + A +++ Q Sbjct: 945 TLATGGFGYPARSVARWMKRELDSFKPKH--LKQVVIAMLPTDSGAVVFQNEFNSQTQ 1000 Score = 45.0 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%) Query: 4 RIHVVQGDITKLAVDVIV---NAANPSLMGGGGVDGAIHRAAGPALLDACLKV 53 + +VQGDI+ VDVIV + N + G + A+ R AGP L K+ Sbjct: 732 NVRLVQGDISTQNVDVIVTTGSPQNFAKGSGSAITQALIRIAGPQLQREMQKI 784 >UniRef50_D0WKT6 Appr-1-p processing enzyme family domain protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WKT6_9ACTO Length = 302 Score = 148 bits (373), Expect = 9e-35, Method: Composition-based stats. Identities = 59/185 (31%), Positives = 95/185 (51%), Gaps = 11/185 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGG-----GVDGAIHRAAGPALLDACLKVRQQQG 58 + + +GD+ +LA D +VNAA P+L+G +D I GP + + C +R+ QG Sbjct: 118 NVAMWRGDVRELAADAVVNAAMPNLLGCKDPLHPCIDNYIQGQGGPWIRNDCSVIREIQG 177 Query: 59 -DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGE--QNEDQLLQDAYLNSLRLVAANS-YT 114 D G AV+T LPA+ V+HT+GP GGE + + L Y + L L Sbjct: 178 KDQEVGDAVLTRGYRLPARYVLHTLGPHLNGGEITDEDREKLAACYTSCLDLALEKGDIH 237 Query: 115 SVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLT 172 +V+F A+STG +P A IA+ TV++++ H E V F +++ +A Y + L Sbjct: 238 NVSFCALSTGRNNFPFEEATHIALDTVNQWLQYHGTDVIELVVFNIFEDADAEGYMQALE 297 Query: 173 QQGDE 177 ++ Sbjct: 298 SWVED 302 >UniRef50_C3Y5X1 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y5X1_BRAFL Length = 592 Score = 147 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 45/175 (25%), Positives = 75/175 (42%), Gaps = 10/175 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GDIT V IVN + +L GV AI AAGP++ C + + G Sbjct: 252 VQVQMGDITMEQVSAIVNPSQNNLDLDKGVSRAISMAAGPSVQKECRQYIRDNWYPKAGD 311 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 V T AG+LP +++H V P + L+ N L + + S+AFPA+ TG Sbjct: 312 VVATGAGNLPCASILHLVQPT--------AKYLRSDVKNCLLVAHQMNLRSLAFPAVGTG 363 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQGDE 177 + +A + ++EF+ + + V + E + + ++ E Sbjct: 364 RFHIKPERSARCMIDGIAEFVQDWSPTTLSIIRIVIFQENMLQAFHTAVHRKASE 418 >UniRef50_Q9NXN4 Ganglioside-induced differentiation-associated protein 2 n=36 Tax=Euteleostomi RepID=GDAP2_HUMAN Length = 497 Score = 147 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 56/173 (32%), Positives = 86/173 (49%), Gaps = 7/173 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +GD+ L IVN +N SL V +I AGP L + K++ C Sbjct: 52 VNGKVVLWKGDVALLNCTAIVNTSNESLTDKNPVSESIFMLAGPDLKEDLQKLK----GC 107 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSYTSVAFP 119 TG A +T +L A+ ++HTVGP ++ + + L Y N L+L S +SV F Sbjct: 108 RTGEAKLTKGFNLAARFIIHTVGPKYKSRYRTAAESSLYSCYRNVLQLAKEQSMSSVGFC 167 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLL 171 I++ GYP A IA++TV F+ H E+V F D E Y++LL Sbjct: 168 VINSAKRGYPLEDATHIALRTVRRFLEIHGETIEKVVFAVSDLEE-GTYQKLL 219 >UniRef50_C5VD03 Appr-1-p processing enzyme family protein n=2 Tax=Corynebacterium matruchotii RepID=C5VD03_9CORY Length = 274 Score = 147 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 71/181 (39%), Positives = 98/181 (54%), Gaps = 10/181 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVR-Q 55 +RI + +GDIT+L VD IVNAAN L+G VD AIH AAG L AC + Sbjct: 86 DSRIRLWRGDITRLDVDGIVNAANNKLLGCFRPGHTCVDNAIHSAAGLQLRQACADLVPS 145 Query: 56 QQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL--LQDAYLNSLRLVAANSY 113 + PTG A IT +LPA+ V+HTVGP+ G E N Q+ L +Y++ L L ++ Sbjct: 146 PDYEEPTGSARITPGFNLPARYVLHTVGPIVAGREANRQQVAELSASYISCLNLAHSSGL 205 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ--VYFVCYDEENAHLYERLL 171 S+AF ISTGV+G+P + AA IAV F+ + F + + + Y +LL Sbjct: 206 ESLAFCCISTGVFGFPPSHAARIAVAAARAFLAGLPKDSDFTIIFTVFTQNDYDRYAQLL 265 Query: 172 T 172 Sbjct: 266 N 266 >UniRef50_B1L625 Appr-1-p processing domain protein n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L625_KORCO Length = 175 Score = 147 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 52/170 (30%), Positives = 84/170 (49%), Gaps = 10/170 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ +V GDIT++ D IVN AN LM GGGV GAI R G + ++ + Sbjct: 3 IMPRLILVLGDITEVESDAIVNPANVFLMMGGGVAGAIKRKGGEEIEREAMR----KAPL 58 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G A+ T AG L AK V+H V G + + ++ A SL+ S+AFPA Sbjct: 59 KIGEAIETSAGKLKAKYVIHAPT-VESPGGSSSPEYIRAAVKASLKKGEELGIRSIAFPA 117 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 + GV G P + I ++ + + E+V V ++++ +++R+ Sbjct: 118 MGAGVGGVPVEESVRIILEEIKA-----SPIEEVLLVTRNKQDLEVFKRV 162 >UniRef50_Q0UG78 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0UG78_PHANO Length = 2240 Score = 145 bits (367), Expect = 5e-34, Method: Composition-based stats. Identities = 56/175 (32%), Positives = 83/175 (47%), Gaps = 10/175 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGG--VDGAIHRAAGPALLDACLKVRQQQGD 59 I D+TKL VD IVN+AN SL G ++ AIH+AAGP L + G Sbjct: 600 NRIISFCHHDLTKLKVDAIVNSANKSLKMTRGDTLNNAIHKAAGPGLSVEA----RLTGR 655 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAF 118 G A+IT +LP++ V+H + P + R E L D Y L++ N ++AF Sbjct: 656 LE-GQALITGGHNLPSEHVIHVLRPGYFRHKGMGEFNQLIDCYREVLKVAIENKIKTIAF 714 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLL 171 P + TG G+P AA I ++ + E++ H E++ F + Y L Sbjct: 715 PCLGTGGVGFPARVAARITLQEMREYLDAHPEHNLERIIFCVNTAADEKAYIDFL 769 Score = 132 bits (331), Expect = 7e-30, Method: Composition-based stats. Identities = 52/174 (29%), Positives = 85/174 (48%), Gaps = 11/174 (6%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 +I++V+ DITKL VDV+VN+ + S G G +D + + G + A G C Sbjct: 1019 NDKIYLVREDITKLEVDVMVNSTDVSFRGMGTLDRTVLQKGGEQMRAAVTAF----GQCK 1074 Query: 62 TGHAVITLAGDLPAKAVVHTVGP-VWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G T LPAK V+H + + GG + + L Y L+ + TS+A P+ Sbjct: 1075 IGEVRHTEGYMLPAKHVLHIIPADRYNGGTKIVLKKL---YREVLQEAVSMRATSIALPS 1131 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFI---TRHALPEQVYFVCYDEENAHLYERLL 171 I TG+ YPR A +A++ F+ R+ E++ FV + + +Y+ L+ Sbjct: 1132 IGTGMLNYPRRDVASVALEEAKRFLESAERNNPVEKIIFVVFSSNDEFVYKSLM 1185 >UniRef50_A6LTB5 Appr-1-p processing domain protein n=3 Tax=Clostridium RepID=A6LTB5_CLOB8 Length = 214 Score = 144 bits (364), Expect = 1e-33, Method: Composition-based stats. Identities = 66/218 (30%), Positives = 102/218 (46%), Gaps = 48/218 (22%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T ++ DITK+ D IVNAAN SL+GGGGVDGAIH+A G LLD C ++ C T Sbjct: 2 TNFKILFDDITKIKFDAIVNAANASLLGGGGVDGAIHKACGEKLLDECRQLN----GCLT 57 Query: 63 GHAVITLAGDL---PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV----------- 108 G + +T + +L V+HTVGP++R E++ L++AY + + Sbjct: 58 GRSKLTRSYNLSDHGVHWVIHTVGPIYRNNGS-EEKYLRNAYRSVFDIAANYSEFYSKQC 116 Query: 109 ----------------------------AANSYTSVAFPAISTGVYGYPRAAAAEIAVKT 140 + ++A P+ISTG Y YP A IA+ Sbjct: 117 NEILNKNLYRFNTDKQRDFILKELDDYINDHPIKTIALPSISTGAYSYPLNEACNIALDE 176 Query: 141 VSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQQGDE 177 + FI +++ VC DE+ ++Y+ L ++ + Sbjct: 177 ILSFINNSPDTFDEIAMVCLDEKTYNMYKSLYEERLQK 214 >UniRef50_C7HUZ2 RNase III regulator YmdB n=2 Tax=Anaerococcus RepID=C7HUZ2_9FIRM Length = 163 Score = 144 bits (364), Expect = 1e-33, Method: Composition-based stats. Identities = 64/164 (39%), Positives = 86/164 (52%), Gaps = 6/164 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V+ DI KL VD IVNAAN L+ GGG+ G I AG + K + G Sbjct: 2 TLKVIDIDILKLNVDAIVNAANVDLIEGGGICGQIFEKAG---REKLKKACLKLSPIKPG 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPAIS 122 AVIT +L K ++H VGPV+ + Q +LQDAY NSL++ S+AFP IS Sbjct: 59 EAVITDGFNLYQKYIIHAVGPVYNEMYKEACQKILQDAYKNSLKIAKKKGIKSIAFPLIS 118 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHL 166 +G+YGYP A IA T+ EF+ + +VY Y + L Sbjct: 119 SGIYGYPDKDAFMIAKNTIDEFLKNYE--MEVYLSTYGKNILSL 160 >UniRef50_Q2SM57 Predicted phosphatase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SM57_HAHCH Length = 180 Score = 144 bits (362), Expect = 2e-33, Method: Composition-based stats. Identities = 54/180 (30%), Positives = 85/180 (47%), Gaps = 13/180 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I + GDIT+L VD IV A+ L G G+ I AG L+A Q G C G Sbjct: 2 IEFLCGDITELEVDAIVCPAHKYLSKGRGLSAQIFEQAGEEALEAA---CSQAGGCKVGG 58 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQ---NEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A +T LPAK ++HTV P W GG+Q ++ LL + Y + +RL ++AFPA+ Sbjct: 59 ACLTPGFKLPAKHIIHTVTPQWTGGDQWGGSDLHLLANCYDSVVRLALEQGVKTIAFPAL 118 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAH----LYERLLTQQGDE 177 G P++ AA ++ + ++ E++ + E YE ++ ++ Sbjct: 119 GAGTNKTPQSMAAHEGLEVLVKYAD---SFERLIICLHWEAGLDTWRRTYEDFFARRVEQ 175 >UniRef50_A7C4X9 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C4X9_9GAMM Length = 220 Score = 144 bits (362), Expect = 2e-33, Method: Composition-based stats. Identities = 56/174 (32%), Positives = 87/174 (50%), Gaps = 5/174 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + + + VD IVN AN L GGG+ I AG L +AC K+ QQQG Sbjct: 13 NSVFIISDKSLLSAPVDTIVNPANSGLSHGGGLAEQILLEAGSKLEEACHKIIQQQGKIS 72 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 AV+T AG LP + V+H VGP G+ E ++ +N L++ + S+AFPAI Sbjct: 73 VTKAVVTTAGQLPYQGVIHAVGPRM--GDGKEQSKIETTIINCLQIAEKYQWKSIAFPAI 130 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQ 173 STG++ P+ A+ K +S + H + ++ E+ ++E++L Q Sbjct: 131 STGLFCVPKTVCAKAFDKAISYYWENHPNSAIKNIWLCLLT-EDYPIFEKILNQ 183 >UniRef50_C3YS04 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YS04_BRAFL Length = 178 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 59/166 (35%), Positives = 78/166 (46%), Gaps = 7/166 (4%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 +I +++GDIT VD IVNAAN SL GV GAI RA G A+ C + + G T Sbjct: 14 VQIDIIKGDITSQKVDTIVNAANSSLSLAVGVSGAISRAGGRAIQTECDNIIK-HGSLRT 72 Query: 63 GHAVITLAGDLPAKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVA-ANSYTSVAFPA 120 V T G L ++H VGP + G E Q L D L + A + S+A PA Sbjct: 73 TDCVWTTPGRLSCTYIIHAVGPNFVPGCESRCKQELYDTCQKVLNIAASRLNAKSIAMPA 132 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITR----HALPEQVYFVCYDEE 162 IS+G G PR AE + +F+ + + V YD E Sbjct: 133 ISSGASGMPRRLCAEAMCSAIMDFVENGQGIGSSLLDIRIVDYDRE 178 >UniRef50_Q94JV1 At1g69340/F10D13.28 n=23 Tax=Embryophyta RepID=Q94JV1_ARATH Length = 562 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 58/173 (33%), Positives = 87/173 (50%), Gaps = 7/173 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI++ +G+ L VD +VN+ N +L G +H AAGP L + C + G C Sbjct: 83 INSRIYLWRGEPWNLEVDAVVNSTNENLDEAHSSPG-LHVAAGPGLAEQCATL----GGC 137 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG A +T A DLPA+ V+HTVGP + + L Y + L L+ + S+A Sbjct: 138 RTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYRSCLELLIDSGLQSIALG 197 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLL 171 I T YPR AA +A++TV F+ + V F + +Y+RLL Sbjct: 198 CIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKISAVVFCTTTSSDTEIYKRLL 250 >UniRef50_C3Y417 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3Y417_BRAFL Length = 1060 Score = 143 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 55/180 (30%), Positives = 82/180 (45%), Gaps = 6/180 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + + QGD+T+ V IVNAAN L G+ AI AGP+L + C K + G Sbjct: 459 TVSMYQGDLTQEKVTAIVNAANGYLAHAAGIAAAIQEQAGPSLEEECRKYISKHGPLYET 518 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAAN-SYTSVAFPA 120 + T AG+LP V+H VGP WR + L+ +LN L T+VA PA Sbjct: 519 QVMHTSAGNLPCHYVIHAVGPKWRDYSNKTECASALRVTFLNCLDYANEKLHATTVALPA 578 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA---LPEQVYFVCYDEENAHLYERLLTQQGDE 177 ISTG++G P A+ V +F + +V V + + H+ ++ + Sbjct: 579 ISTGIFGVPNDVCAKAVYDAVRDFSKSQSQLGSLGEVRLVNAELDMVHVLRQMFEVSMSQ 638 >UniRef50_D0NR00 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NR00_PHYIN Length = 492 Score = 142 bits (359), Expect = 3e-33, Method: Composition-based stats. Identities = 51/173 (29%), Positives = 82/173 (47%), Gaps = 6/173 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + +G + L VD +VN+ S+ G + ++AGP + C G C Sbjct: 44 INAKLSLWRGPLYCLRVDAVVNSTCESMRQSDGDFDKLLKSAGPEIAVECKAA----GAC 99 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQL-LQDAYLNSLRLVAANSYTSVAFP 119 TG V+T LPAK ++HTVGP ++ N + L Y + L + N SVA Sbjct: 100 RTGDTVLTRGCKLPAKFILHTVGPRYQAKYHNAAEHSLHSCYRSVLAVTKENGLRSVATG 159 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLL 171 I T GYPR A IA +TV ++ + ++V ++ +YER+L Sbjct: 160 CIYTIRKGYPREEGAHIAARTVRRYLEHYGDDFDRVILCMDSVQDMDVYERVL 212 >UniRef50_B9L2D9 Appr-1-p processing enzyme family protein n=2 Tax=Thermomicrobia (class) RepID=B9L2D9_THERP Length = 176 Score = 142 bits (359), Expect = 4e-33, Method: Composition-based stats. Identities = 59/172 (34%), Positives = 76/172 (44%), Gaps = 8/172 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GDIT + + IVNAAN L G GV GAI RA G + + QG G Sbjct: 6 LEVQVGDITAVDTEAIVNAANSQLWMGSGVAGAIKRAGGEEIEREAVA----QGPISVGE 61 Query: 65 AVITLAGDLPAKAVVHTVGPVWR---GGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 AV+T AG LP AV+H + + + A +L A SVAFPA+ Sbjct: 62 AVVTTAGRLPFAAVIHAAAMGYDERGAMIPATSETVYAATRAALERCAERPLRSVAFPAL 121 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYERLLT 172 TGV G A V+ V + A E+V FV EE A + R + Sbjct: 122 GTGVGGLDLVTCAAAMVRAVRDHAASGAALPERVVFVVRSEEAADAFLRAIA 173 >UniRef50_D0MWM6 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MWM6_PHYIN Length = 579 Score = 142 bits (358), Expect = 6e-33, Method: Composition-based stats. Identities = 64/188 (34%), Positives = 97/188 (51%), Gaps = 17/188 (9%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVR-QQQ 57 +I + +GDIT L IVNAAN +L+G +D IH AGP L AC ++ ++ Sbjct: 105 QIALWKGDITTLRATAIVNAANSALLGCFQPSHKCIDNVIHSMAGPRLRAACHEIMSRKA 164 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN---EDQLLQDAYLNSLRLVAAN--- 111 + P G+A IT LP+ V+HTVGP R GEQ E LQ Y SL L+ Sbjct: 165 HEEPGGNAQITQGFALPSSFVIHTVGPQLRHGEQPTAAECDQLQSCYTKSLDLLLKKVGD 224 Query: 112 --SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ---VYFVCYDEENAHL 166 + S+AF ISTG++ +P A +AV +V E++ +H + + F + + + L Sbjct: 225 TEQHVSIAFSCISTGLFAFPSDVAVPLAVNSVLEWLNQHQEETRGWKIIFNTFLKRDYDL 284 Query: 167 YERLLTQQ 174 Y+ + + Sbjct: 285 YKSFIESK 292 >UniRef50_UPI00005A247A PREDICTED: similar to H2A histone family, member Y isoform 3 n=1 Tax=Canis lupus familiaris RepID=UPI00005A247A Length = 412 Score = 142 bits (357), Expect = 6e-33, Method: Composition-based stats. Identities = 47/179 (26%), Positives = 91/179 (50%), Gaps = 7/179 (3%) Query: 1 MKTRIHVVQGDITKL---AVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ 57 + +++++ +I+ L V+ I+N N + + + + G ++A L++R++ Sbjct: 233 LGQKLNLIHSEISNLAGFEVEAIINPTNADIDLKDDLGNTLEKKGGKEFVEAVLELRKKN 292 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 G A ++ LPAK V+H PVW G ++LL+ N L L S+A Sbjct: 293 GPLEVAGAAVSAGHGLPAKFVIHCNSPVW--GADKCEELLEKTVKNCLALADDKKLKSIA 350 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHLYERLLTQQ 174 FP+I +G G+P+ AA++ +K +S + T + + VYFV +D E+ +Y + + + Sbjct: 351 FPSIGSGRNGFPKQTAAQLILKAISSYFVSTMSSSIKTVYFVLFDSESIGIYVQEMAKL 409 >UniRef50_A6SR30 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6SR30_BOTFB Length = 474 Score = 141 bits (356), Expect = 1e-32, Method: Composition-based stats. Identities = 56/175 (32%), Positives = 87/175 (49%), Gaps = 1/175 (0%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T + V+ GD+ K VDVIVNAAN L GGG+DGAIH AAGP L ++ Q G Sbjct: 17 LDTTVEVLIGDMLKYPVDVIVNAANVKLKKGGGIDGAIHAAAGPELQGEMNELFQHPGQV 76 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 + + + ++H VGP W EQ + + L A NSL L N S+AFP Sbjct: 77 GGAYGTTSSWDIQSCRYIIHAVGPNWNIPEQQDGKFLFTAIQNSLDLAMKNKLRSIAFPG 136 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQ 174 IS G++ P++ A + + + +I ++ +++ + + E L + Sbjct: 137 ISMGIFAMPKSLAGLVIISALRTWIIKYRGEMDRISILLLGYSEDEITETRLREI 191 >UniRef50_Q4RS18 Histone H2A (Fragment) n=2 Tax=Tetraodontidae RepID=Q4RS18_TETNG Length = 415 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 52/188 (27%), Positives = 87/188 (46%), Gaps = 22/188 (11%) Query: 7 VVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH-- 64 VVQ DI+ + D +V+ + S GG V A+ + G +A +++++ G Sbjct: 227 VVQADISIVESDAVVHPTSSSFYTGGEVGTALEKKGGKEFTEALQELKKKNGPLEVAGGK 286 Query: 65 ----------------AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV 108 AV+T LPAK V+H P W G +++L N L L Sbjct: 287 CPDWKTGFLLLSQLLIAVLTAGFGLPAKYVIHCNSPGW--GSDKCEEMLDKTVKNCLALA 344 Query: 109 AANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCYDEENAHL 166 SVAFP+I +G G+P+ AA++ +K +S + T + + VYFV +D E+ + Sbjct: 345 DEKKLKSVAFPSIGSGRNGFPKQTAAQLILKAISSYFVATMSSTIKTVYFVLFDSESIGI 404 Query: 167 YERLLTQQ 174 Y + + + Sbjct: 405 YVQEMAKL 412 >UniRef50_C9YUB3 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9YUB3_STRSW Length = 333 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 69/177 (38%), Positives = 100/177 (56%), Gaps = 9/177 (5%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQQQG-D 59 + +GD+T LA D +VNAAN L+G +D A+H AAGP L D C + QG Sbjct: 153 TLWRGDLTTLAADAVVNAANSRLLGCFRPRHPCIDNALHNAAGPRLRDDCHTIVTAQGTR 212 Query: 60 CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAA-NSYTSVA 117 PTG A IT LPA+ V+HTVGP+ +G ++ Q L +Y + L L A S +VA Sbjct: 213 EPTGTAKITRGYHLPARHVLHTVGPLVQGRPHTDDAQALASSYRSCLDLAAQVESVRTVA 272 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFI-TRHALPEQVYFVCYDEENAHLYERLLTQ 173 F A+STGV+GYP+ AA +A++TV ++I R ++V + ++ Y L + Sbjct: 273 FCAVSTGVFGYPKDEAASVALRTVEDWITARPHRFDRVVLTVFTADDERAYRHALGE 329 >UniRef50_B7CC50 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CC50_9FIRM Length = 175 Score = 140 bits (352), Expect = 3e-32, Method: Composition-based stats. Identities = 59/176 (33%), Positives = 86/176 (48%), Gaps = 12/176 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I ++G+I L D+IV+ N ++ GV I AG ++ AC ++ G Sbjct: 2 ISTLKGNIALLDFDLIVDPTNKQVLPMQGVSAQIFHQAGSEMMKACQELN----GLEVGK 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYT------SVAF 118 A +T A +LP KAV+HT GP + G NED+ L Y NS+ L ++AF Sbjct: 58 AKMTKAFNLPCKAVIHTCGPRYMDGTHNEDEYLAACYWNSMALAYDYMRKNDMESINIAF 117 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPE--QVYFVCYDEENAHLYERLLT 172 P ISTG+ YP A IA++TV + + + V FVC E+ LY+ L Sbjct: 118 PCISTGINAYPNHEACVIAIQTVKRLMNKFPETKAIHVCFVCDKTEDYMLYKEALR 173 >UniRef50_B7PR73 Ganglioside induced differentiation associated protein, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PR73_IXOSC Length = 437 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 43/172 (25%), Positives = 79/172 (45%), Gaps = 4/172 (2%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ + GD+T L IVN+ N +L + I AG L L + C Sbjct: 54 VNRKVALWVGDLTSLNTHAIVNSTNENLTDKSPLSQRIVERAGEQLRRDMLNEIRT---C 110 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG A ++ +LPA+ V+HTVGP + + + L +Y L+++ + ++ Sbjct: 111 RTGEAKLSKGYNLPARFVIHTVGPKYNAKFRTAAESALHSSYWRVLQMLPEHGLATLGLC 170 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 I + GYP A +A++T+ F+ H ++ + + +YE+LL Sbjct: 171 PIHSARRGYPLQDGAHLALRTLRRFLELHGDCVELVVLVMEGTELGMYEQLL 222 >UniRef50_C3YS03 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3YS03_BRAFL Length = 2671 Score = 139 bits (351), Expect = 4e-32, Method: Composition-based stats. Identities = 56/182 (30%), Positives = 84/182 (46%), Gaps = 12/182 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++ VVQG++T VDV+VN AN SL GGG+ AI +A G + C + G G Sbjct: 2202 KLVVVQGNLTSHRVDVMVNTANGSLSHGGGLAAAIVKAGGQEIQRDCTNYIKDNGKLTEG 2261 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + T LP K VVH VGP+W +++ ++ L+ A N+L S+A PAIS Sbjct: 2262 QVMSTKGYKLPCKMVVHAVGPLWIADQKDSKEKALKMAVENALLEARDYH--SIAIPAIS 2319 Query: 123 TG-------VYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQ 173 +G + GYP V V+ F + +V+F D + + L Sbjct: 2320 SGEELILLCISGYPIKPCVAAIVAAVTAFFNTNPDCALSEVHFAEMDPQKTDAFRDELLN 2379 Query: 174 QG 175 + Sbjct: 2380 RF 2381 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 54/172 (31%), Positives = 72/172 (41%), Gaps = 12/172 (6%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGG-GGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I + +G+IT DV+VN + L GGV A +A G L C G G Sbjct: 2428 IQLKKGNITAEKADVLVNTTSGDLDLSQGGVARAFGQAGGQELQQLCNN----HGKANAG 2483 Query: 64 HAVIT-LAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 VIT AG L K V H V P W Q DQ L+ + L YTS++FPA+ Sbjct: 2484 DIVITLRAGTLRCKQVYHAVLPNW----QESDQPLRTMVQDCLESADQGGYTSISFPAMG 2539 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLT 172 TG YPR AA + F + + V + +D+ +E L Sbjct: 2540 TGNLKYPRDVAASCMYDEILSFSQSNPGTTLQDVGIIVFDQPTVQAFETELR 2591 >UniRef50_UPI000180BD0B PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=2 Tax=Ciona intestinalis RepID=UPI000180BD0B Length = 1729 Score = 138 bits (349), Expect = 6e-32, Method: Composition-based stats. Identities = 49/158 (31%), Positives = 80/158 (50%), Gaps = 5/158 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I+V++ DIT+ D I+NA+NP L + GG+ GAI + G + + V ++G G Sbjct: 905 INVLKTDITQHECDAILNASNPELDLLPGGISGAIQKTGGDKIQEEMHAVISKRGKLFPG 964 Query: 64 HAVITLAGDLP-AKAVVHTVGPVW-RGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A IT AG L + ++H VGP W + LQ +++ + S++ PAI Sbjct: 965 DAAITGAGKLKTCRFIIHAVGPRWAEHSHSTCCKYLQSCINYAMQEAESKRLRSISIPAI 1024 Query: 122 STGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFV 157 S GV+G + + V TV ++ R++ +V FV Sbjct: 1025 SCGVFGGVPSVCIPLIVDTVLDYFKQKRNSSITRVDFV 1062 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 40/170 (23%), Positives = 66/170 (38%), Gaps = 13/170 (7%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V QGD+T D IVN+ NP G + AI + G +L+ C QQG + Sbjct: 1122 ISVSQGDLTLDNSDAIVNSTNPQFDLTQGMISQAILKKGGRTVLNECKN---QQGQWNSP 1178 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +T G L + V H V P N + + L + ++A PA+ T Sbjct: 1179 RIRVTSGGKLQCRYVFHIVTP-------NNTKQITSVLLEVFTIADKLGLATLALPALGT 1231 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLL 171 G G A+ + E++ + + V +++ + + L Sbjct: 1232 GNLGIESLRIAQCIRGAIKEYVDSNTPANLNTIKVVIFEQSMVAEFRQGL 1281 >UniRef50_UPI00006A1CA6 poly (ADP-ribose) polymerase family, member 14 n=11 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1CA6 Length = 1527 Score = 138 bits (349), Expect = 6e-32, Method: Composition-based stats. Identities = 52/178 (29%), Positives = 89/178 (50%), Gaps = 4/178 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + I V + D+T+ VDV+VNAA L G+ A+ AAGP L C + +++G Sbjct: 523 RVTIAVYKDDLTRHRVDVVVNAAREDLKHTEGLALALLNAAGPKLQTECDHIIKREGKYS 582 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVAFPA 120 G +VIT AG+LP K V+HTV P W Q +LL+ L L A N +S+ PA Sbjct: 583 VGDSVITGAGNLPCKQVIHTVSPKWDPNSQTRCTRLLRRGISRCLELAAENGLSSIGIPA 642 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLLTQQG 175 + + + G+P + + V++V +++ R +++ V + + + + + Sbjct: 643 VGSQMSGFPVTVSVQNIVESVRQYVESPQRSRKVTRIHLVDSADGTVAAFAKAVRAEF 700 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 43/174 (24%), Positives = 77/174 (44%), Gaps = 4/174 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I ++QG+I DVIVN+ L + G V A++ AG L ++ + Sbjct: 735 VNIKIIQGNIQDATTDVIVNSVGKDLDLNTGAVSKALNAKAGTKLQQQLREMSRGT-QVE 793 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G +T L K V+H V P W G+++ +++L+ N L S+ FPAI Sbjct: 794 EGSVFVTNGFGLNCKKVIHVVTPGWDQGKRSAEKILRTIMTNCLSTTEKEKLRSITFPAI 853 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQG 175 TG G+P+ A + V + + ++V F+ + + + ++L + Sbjct: 854 GTGALGFPKDLVASLMFDEVLKSSCKGGQLQEVNFLLHPSDMNTI--KVLNMKF 905 Score = 100 bits (248), Expect = 3e-20, Method: Composition-based stats. Identities = 48/175 (27%), Positives = 81/175 (46%), Gaps = 14/175 (8%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V GDITK + DVIVN++N S GV AI AAG ++ D C + G Sbjct: 947 KYQVRTGDITKESTDVIVNSSNSSFTQKIGVSKAILEAAGKSIEDECATL----GAQANK 1002 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 ++T G+LP + ++H + ++ + L+ L+ TSVA PA+ T Sbjct: 1003 GYIVTQKGNLPCRHIIHVYTI-------STPDRIKASVLDVLQECENLKATSVALPAVGT 1055 Query: 124 GVYGYPRAAAAEIAVKTVSEF--ITRHALPEQVYFVCYDEENA-HLYERLLTQQG 175 G G AA A + V EF + + V + + ++ Y+ +++++G Sbjct: 1056 GAGGATSAAVAAAMLDAVEEFVTMKSPKSVQTVKVIVFQQKMLDDFYKSMMSKEG 1110 >UniRef50_UPI000196CD43 hypothetical protein CATMIT_02190 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CD43 Length = 239 Score = 138 bits (349), Expect = 6e-32, Method: Composition-based stats. Identities = 56/168 (33%), Positives = 85/168 (50%), Gaps = 6/168 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + +++ +I +A D IV AN +L G G AI AAG K ++ G C TG Sbjct: 2 KFKIIKANIVDVASDAIVLPANEALKEGSGTSKAIFTAAG---RKELTKACKELGHCSTG 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A+ TLA +L +K ++H V P W GE +E LL AYL SL + SVAFP +++ Sbjct: 59 SAIPTLAYNLSSKYIIHAVVPKWIDGEHSEYDLLSSAYLASLNIAEVMGCESVAFPLLAS 118 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 G G+ + A IA +++ F ++V+ V Y + Y + L Sbjct: 119 GNNGFDKQLAVRIAEESIKSF--EGVNLKKVFLVVYGD-TMETYMKSL 163 >UniRef50_Q55AK6 U box domain-containing protein n=2 Tax=Eukaryota RepID=Q55AK6_DICDI Length = 1618 Score = 138 bits (348), Expect = 6e-32, Method: Composition-based stats. Identities = 56/173 (32%), Positives = 87/173 (50%), Gaps = 3/173 (1%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I +++GDITK IVN AN L GG +I AAG + C ++ G TG Sbjct: 918 IRIIKGDITKQKTHAIVNPANEKLKNLGGAAFSIQEAAGATFKEFCESYYEKNGPIGTGC 977 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 +V + V++TVGP N+ ++L + +SLR A + S++ PAISTG Sbjct: 978 SVYGSKFKMGNIFVINTVGPK--NDNPNKARILHMSIHSSLRSATALNCQSISIPAISTG 1035 Query: 125 VYGYPRAAAAEIAVKTVSEF-ITRHALPEQVYFVCYDEENAHLYERLLTQQGD 176 ++GY A I +K+ EF +T +V FV ++ A+++E L + D Sbjct: 1036 IFGYDPKEAVPIIIKSAIEFLLTNETTLNEVNFVDLNQSTANIFENSLIKFSD 1088 >UniRef50_B0P6L4 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P6L4_9FIRM Length = 168 Score = 138 bits (348), Expect = 7e-32, Method: Composition-based stats. Identities = 59/169 (34%), Positives = 83/169 (49%), Gaps = 6/169 (3%) Query: 6 HVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAG-PALLDACLKVRQQQGDCPTGH 64 +V GDITK+ D IVNAA+ L G+ AI AA LL AC K+ G C G Sbjct: 2 RLVLGDITKMDTDAIVNAASSDLRPCPGICSAIFAAADTEKLLAACKKI----GRCRIGK 57 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AVIT + L K ++H G W G N+ L D Y ++L+ AA SVA P + +G Sbjct: 58 AVITPSFGLACKYIIHVAGVGWYSGRYNDRMLFADCYRSALQKAAAYHCKSVAIPLMFSG 117 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 + PRA A +I V F H ++ V Y + L +++++ Sbjct: 118 DFHIPRAQALQIVADVVGGFEKSHPSL-EISLVLYKQSIYDLAKKIISN 165 >UniRef50_UPI0000E8099B PREDICTED: similar to PARP9 protein n=2 Tax=Gallus gallus RepID=UPI0000E8099B Length = 796 Score = 138 bits (348), Expect = 8e-32, Method: Composition-based stats. Identities = 53/177 (29%), Positives = 92/177 (51%), Gaps = 4/177 (2%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V + D+T D +VNAAN SL G + A+ A GP + + ++ G PTG Sbjct: 78 LLVYKDDLTSHKADAVVNAANESLEHSGALALALLNAGGPEIAEESRNFIRKHGKVPTGK 137 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAA--NSYTSVAFPAI 121 +T G LP K ++H +GP+W E+ LL++A +N L+ + N+ SVA PA+ Sbjct: 138 IAVTGGGKLPCKKIIHAIGPIWYPSEKEKCCVLLEEAVVNVLKYASDPKNNIKSVAIPAV 197 Query: 122 STGVYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 S+GV+G+P A++ V ++ F+ + + ++++ V E+ +R E Sbjct: 198 SSGVFGFPVNLCAQVIVMSIKLFVETQPSCLKEIHLVNICEQTVAEIKRACEMILGE 254 Score = 95.9 bits (237), Expect = 5e-19, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 74/174 (42%), Gaps = 7/174 (4%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 R+ +++G + K+ IV++ + + A+ + AGP L L + Sbjct: 277 NIRLRIIKGYLEKIRTTAIVSSVSSDGEFCSQISTAMLQKAGPTLQAEILSQLKHLDSSK 336 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 ++T +LP+ V+H + P + +Q L++ L V S+AFP Sbjct: 337 --ELIVTSGYNLPSDFVLHVLWPCFNHVVLLCEQ-LKEIVNRCLYFVRNYPLPSIAFPEK 393 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPE-QVYFVCY--DEENAHLYERLLT 172 + + P A AEI ++ V +F ++ + V FV + D+ +++ + Sbjct: 394 NWSLK-LPVAIVAEIMIEEVLDFARKYPETKIDVQFVLHPDDDTTYQVFQEKMN 446 >UniRef50_B2VUH2 MACRO domain containing protein 1 n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2VUH2_PYRTR Length = 1599 Score = 138 bits (348), Expect = 8e-32, Method: Composition-based stats. Identities = 49/173 (28%), Positives = 86/173 (49%), Gaps = 9/173 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 + +V+ DI KL VD++VN+ + S +G G +D ++ + GP ++ ++ G C Sbjct: 955 NHMVCLVREDIMKLEVDIMVNSTDSSFLGMGVLDRSVFKKGGP----ELMEQIKKFGTCN 1010 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G +T LPAK ++H + P ++ +L++ Y L TS+A P+I Sbjct: 1011 EGDVKVTPGYLLPAKHILHAIPP--EQFSKSNKGILRNIYREILHTAVLLKATSIAIPSI 1068 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITR---HALPEQVYFVCYDEENAHLYERLL 171 TG YPR A +A++ V F+ + E++ FV Y + +Y+ LL Sbjct: 1069 GTGRLNYPRRDCASLAMEEVKRFLESADPNNTLEKIIFVVYSSNDEFVYKSLL 1121 Score = 135 bits (341), Expect = 4e-31, Method: Composition-based stats. Identities = 43/176 (24%), Positives = 78/176 (44%), Gaps = 10/176 (5%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGG---GGVDGAIHRAAGPALLDACLKVRQQQG 58 I + D+T+L VD IVN A L + AI +AAGP L + + + Sbjct: 537 NQLISFIHHDLTRLKVDAIVNNAPTDLSLSPANNTLHSAIFKAAGPGLTEEA----KLKA 592 Query: 59 DCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNED-QLLQDAYLNSLRLVAANSYTSVA 117 D G +T DLP+ ++H G + + + ++L Y ++L + + ++A Sbjct: 593 DIKVGQVGLTQGHDLPSSWIIHAAGLKYNWSKGYDQFKVLSSCYQSALEMATYHGIKTIA 652 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLL 171 FP + TG G+P AA IA++ + +++ H E++ + + Y Sbjct: 653 FPCLGTGGCGFPARVAARIALQEIRDYLDSHPKHGLERIVICVKTDFDKKAYMSFF 708 >UniRef50_A7EET2 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EET2_SCLS1 Length = 506 Score = 138 bits (348), Expect = 8e-32, Method: Composition-based stats. Identities = 61/158 (38%), Positives = 82/158 (51%), Gaps = 1/158 (0%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T + VV GD+ K VDVIVNAAN SL+ G G+DG IHR AGP L G Sbjct: 19 TTVEVVDGDLLKYPVDVIVNAANASLVRGDGIDGEIHRQAGPELAAEMKTQFPHPGKQGG 78 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + + ++H VGP WR Q LL +AY NSL L A N+ S+AFPAIS Sbjct: 79 AYGTTHSWDITSCQYIIHAVGPDWRQPNQRATGLLANAYHNSLSLAAKNNLRSIAFPAIS 138 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCY 159 G++ PR A +KT+ +I H +++ + + Sbjct: 139 VGIFQMPRGMAGVTVMKTIRSWIDSHQGEMDRIGILLF 176 >UniRef50_O07733 UPF0189 protein Rv1899c/MT1950 n=16 Tax=Mycobacterium RepID=Y1899_MYCTU Length = 359 Score = 138 bits (347), Expect = 1e-31, Method: Composition-based stats. Identities = 53/169 (31%), Positives = 74/169 (43%), Gaps = 8/169 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 + V Q D+TKL +D I NAAN L GGV AI RA GP L + + G Sbjct: 191 ELEVHQADVTKLELDAITNAANTRLRHAGGVAAAIARAGGPELQRESTE----KAPIGLG 246 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 AV T AGD+PA+ V+H G +++ A +LR S+A A T Sbjct: 247 EAVETTAGDMPARYVIHAATM--ELGGPTSGEIITAATAATLRKADELGCRSLALVAFGT 304 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 GV G+P AA + V V R ++V F + + + + Sbjct: 305 GVGGFPLDDAARLMVGAVRRH--RPGSLQRVVFAVHGDAAERAFSAAIQ 351 >UniRef50_C3ZVW0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZVW0_BRAFL Length = 731 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 47/178 (26%), Positives = 71/178 (39%), Gaps = 3/178 (1%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + T+I + QGD+ K VDVIVN N L G + A+ + G + C + G Sbjct: 538 LDTKISIYQGDVIKECVDVIVNETNDRLKLSGELSWALAQYGGHDIEADCRRYVATHGRL 597 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLV-AANSYTSVAF 118 V T AG LP+K ++H V P W E + LL Y N + TS+A Sbjct: 598 AATQVVPTSAGQLPSKHILHAVVPHWVSAHPRESKMLLYKTYENIFKCAGIKMRVTSIAL 657 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRH-ALPEQVYFVCYDEENAHLYERLLTQQG 175 ++G G P+ AE + V F+ + L + V + + Sbjct: 658 SLQTSGSTGIPKDVYAETMFQAVVSFLKTYGPLLRDIRMVNPSHRTVSTFIDAFKTKM 715 >UniRef50_C8WJT1 Appr-1-p processing domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WJT1_EGGLE Length = 255 Score = 137 bits (345), Expect = 2e-31, Method: Composition-based stats. Identities = 64/151 (42%), Positives = 83/151 (54%), Gaps = 8/151 (5%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGG-----GGVDGAIHRAAGPALLDACLKVRQ 55 + R+ + +GDIT LAVD IVNAAN L+G +D AIH AG L C ++ + Sbjct: 82 VDGRLALWRGDITTLAVDAIVNAANSKLLGCFIPGHHCIDNAIHTFAGMQLRLVCDELMR 141 Query: 56 QQGD-CPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN--EDQLLQDAYLNSLRLVAANS 112 QG P G A +T A +LP++ VVHTVGP GE +++ L Y SL AA Sbjct: 142 AQGHDEPVGRAQVTSAFNLPSRFVVHTVGPQVPTGEPTAAQEEQLASCYRASLDAAAAAG 201 Query: 113 YTSVAFPAISTGVYGYPRAAAAEIAVKTVSE 143 S+AF ISTG + +PR AA IAV V Sbjct: 202 VASLAFCCISTGEFRFPRERAARIAVGEVRA 232 >UniRef50_Q7JUR6 Protein GDAP2 homolog n=19 Tax=Neoptera RepID=GDAP2_DROME Length = 540 Score = 137 bits (345), Expect = 2e-31, Method: Composition-based stats. Identities = 43/159 (27%), Positives = 69/159 (43%), Gaps = 5/159 (3%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R + GD+T L VD I N ++ +L + I AG L + ++ C Sbjct: 65 VNNRFVIWDGDMTTLEVDAITNTSDETLTESNSISERIFAVAGNQLREELSTTVKE---C 121 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG IT +LPAK V+HTV P +R + + L Y N L + ++A Sbjct: 122 RTGDVRITRGYNLPAKYVLHTVAPAYREKFKTAAENTLHCCYRNVLCKAKELNLHTIALC 181 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVC 158 IS +P AA IA++T+ ++ + + V Sbjct: 182 NISAHQKSFPADVAAHIALRTIRRYLDKC-TLQVVILCV 219 >UniRef50_UPI0001C38755 appr-1-p processing domain-containing protein n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38755 Length = 575 Score = 137 bits (345), Expect = 2e-31, Method: Composition-based stats. Identities = 56/147 (38%), Positives = 75/147 (51%), Gaps = 16/147 (10%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGG------------GVDGAIHRAAGPALLD 48 + RI V+QGDIT+ VD IV + NP L+ VD IH++AG L Sbjct: 433 LSDRITVIQGDITQQPVDAIVCSTNPHLLPNKKWGSFFMSSDHPEVDIMIHKSAGVELKQ 492 Query: 49 ACLKVRQQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLV 108 C K+ C G A IT +LPA+ V+HTV P W+ GE ++LL Y N L LV Sbjct: 493 ECQKLN----GCKVGEAKITPGYNLPAEWVIHTVSPTWQNGEVQAEKLLAKCYQNCLNLV 548 Query: 109 AANSYTSVAFPAISTGVYGYPRAAAAE 135 + S+AFPA+ TG + AA+ Sbjct: 549 NSQEIESIAFPALGTGTGKFTLEKAAK 575 >UniRef50_A2BJA7 A1pp, Appr-1-p processing enzyme n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BJA7_HYPBU Length = 199 Score = 136 bits (343), Expect = 3e-31, Method: Composition-based stats. Identities = 45/173 (26%), Positives = 82/173 (47%), Gaps = 8/173 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + + +GDIT+ + +VN AN ++ GGGV GA+ RAAGP + + +++ P G Sbjct: 16 VEIARGDITEAECEAVVNPANSLMIMGGGVAGALRRAAGPEVEEEA----RRKAPVPVGE 71 Query: 65 AVITLAGDL--PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 A+ T AG L K ++H + ++++ A L +LR + +A PA+ Sbjct: 72 AIHTGAGRLEPRIKYIIHAPTMERPAMRTTQGKVVK-AVLAALREAEKLNVGCLALPAMG 130 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALP-EQVYFVCYDEENAHLYERLLTQQ 174 GV G + E ++ + EF+ ++ V Y E +A + + + Sbjct: 131 AGVGGLTARESLEAIMEALDEFLGSGGKLPPRIILVAYSERDAKQFLDEIKRV 183 >UniRef50_D1B7G8 Appr-1-p processing domain protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B7G8_THEAS Length = 179 Score = 135 bits (340), Expect = 6e-31, Method: Composition-based stats. Identities = 53/173 (30%), Positives = 78/173 (45%), Gaps = 7/173 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + +GDI D IVNAAN L G GV GAI R+AG + +G G Sbjct: 12 VIFREGDICSYRGDAIVNAANDRLWMGSGVAGAIRRSAGEEVEAEA----ISKGPIRVGS 67 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AV T AG LP KAV+H + + ++ + +LRL A +AFPA+ TG Sbjct: 68 AVATGAGRLPLKAVIHCAVM--GQDLKTSREAIRSSTGEALRLAAEMELRRIAFPALGTG 125 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLTQQGD 176 V G+P + + + EF+ ++V F + E + R T+ + Sbjct: 126 VGGFPVEECGHVMGEELKEFLLICPDGLDEVAFYLFGAEAFRQFVRGATRALE 178 >UniRef50_Q54PT1 Protein GDAP2 homolog n=1 Tax=Dictyostelium discoideum RepID=GDAP2_DICDI Length = 568 Score = 134 bits (338), Expect = 9e-31, Method: Composition-based stats. Identities = 44/174 (25%), Positives = 80/174 (45%), Gaps = 7/174 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +RI + GDI L D IV + + +L + I + G +++ Q+ G+C Sbjct: 55 INSRICLWMGDICNLNTDTIVYSNSKTLTESDTISDKIFKYGGSEMMND----IQKNGEC 110 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGE-QNEDQLLQDAYLNSLRLVAANSYTSVAFP 119 G ++IT G+LP++ VVHTV P + + L Y ++ L S++F Sbjct: 111 RYGESIITSGGNLPSRFVVHTVCPTYNPKYLSAAENALNSCYRSAFHLSMDVKSKSISFS 170 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLL 171 + + +P IA++T+ F+ + E+V E+ LYE++L Sbjct: 171 TLHSEKRQFPSVGGCHIALRTIRRFLEKPFSKSFEKVILAINTFEDLRLYEQML 224 >UniRef50_A7T7L3 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7T7L3_NEMVE Length = 177 Score = 133 bits (335), Expect = 2e-30, Method: Composition-based stats. Identities = 50/177 (28%), Positives = 82/177 (46%), Gaps = 21/177 (11%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHR---AAGPALLDACLKVRQQQ 57 + ++ + GDIT L +D IVNA N ++ G+D + +G + C Sbjct: 12 LNDKVSLWTGDITALEIDAIVNAGNTIMLMFIGIDVDSYPNKVYSGRGIFK-CFFFNLS- 69 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 V+ V+HT GP+ LQD Y N L+L + ++A Sbjct: 70 --------VLLKGSPYFGLDVIHTAGPM-----GKNRIKLQDCYKNCLQLAKQHGVKTLA 116 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFIT---RHALPEQVYFVCYDEENAHLYERLL 171 F ISTG+YGYP AA +A++TV +++ + E++ F + ++ +YERLL Sbjct: 117 FCCISTGIYGYPNKDAAHVALETVRQWLETDDNNDSVERIIFCTFLPKDTEIYERLL 173 >UniRef50_Q4RG95 Chromosome 12 SCAF15104, whole genome shotgun sequence n=10 Tax=Clupeocephala RepID=Q4RG95_TETNG Length = 1433 Score = 132 bits (331), Expect = 8e-30, Method: Composition-based stats. Identities = 54/175 (30%), Positives = 82/175 (46%), Gaps = 9/175 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 V+ GDIT+ DVI+N++N GV AI AG A+ C + + QG P G Sbjct: 939 TFEVLSGDITRETCDVIINSSNRDFTLKSGVSKAILDGAGWAVQVECAQQARAQGH-PPG 997 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 H ++T AG LP+KA+VH N ++ +L+L ++ S AFPA+ T Sbjct: 998 HMIVTSAGRLPSKAIVHV-------SISNNPADIKSTVYAALKLCEEKTFRSAAFPALGT 1050 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQV-YFVCYDEENAHLYERLLTQQGDE 177 GV G P AA A+ V V++F + + V + E + + + E Sbjct: 1051 GVGGVPPAAVADAMVGAVADFAKKQPKSIHLAKIVIFQPEMLTHFHNSMMKMQGE 1105 Score = 115 bits (289), Expect = 5e-25, Method: Composition-based stats. Identities = 56/174 (32%), Positives = 78/174 (44%), Gaps = 4/174 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++ V Q D+ L VD +VN AN +L GG+ A+ AAGP L + G Sbjct: 500 VQLSVSQADLCALQVDAVVNPANENLQHTGGLALALLEAAGPELQNTSNLYVAVNGALCA 559 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFPAI 121 G + T A LP K V+H VGP + + E LL+ SLR TSVA PAI Sbjct: 560 GQVIATDACRLPCKHVIHAVGPRFSDHSREESVLLLRRVVTQSLREAERLGCTSVAVPAI 619 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRH---ALPEQVYFVCYDEENAHLYERLLT 172 S+GV+G+P + A+ + V E +V V E+ A + Sbjct: 620 SSGVFGFPLSLCADTIAQAVWEHCGAAGGRGALREVQLVANTEQTAGALATAVQ 673 Score = 111 bits (278), Expect = 1e-23, Method: Composition-based stats. Identities = 42/161 (26%), Positives = 70/161 (43%), Gaps = 6/161 (3%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQ--QQGDC 60 R+ + +G+I VIVN + ++ G V A+ RAAG L A LK + + Sbjct: 733 RVVLCKGNIEDQRSCVIVNTISETMNLDQGAVSRALLRAAGKGLQAAVLKEARLARLDQL 792 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G ++T L + V H V P W Q ++ L L+ S++FPA Sbjct: 793 DPGSLLVTDGFKLRCQKVFHAVCPQWSASYQ-AEKTLTSIISRCLKEAERLKMRSLSFPA 851 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCY 159 I TG+ +P+ A + ++ V F + +V+ V + Sbjct: 852 IGTGLLSFPKDLVARVLLEEVRTFSRKKTPQHLLKVFVVVH 892 >UniRef50_B5Y5Y4 Appr-1-p processing enzyme family protein n=2 Tax=Firmicutes RepID=B5Y5Y4_COPPD Length = 172 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 51/165 (30%), Positives = 76/165 (46%), Gaps = 4/165 (2%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 ++ +V GDITK DVIVNAAN GGGV AI +A G + D ++V Q Sbjct: 7 DKVKLVMGDITKAEADVIVNAANGIGPMGGGVALAIKKAGGKVIEDEAIRVCSQLDP-RP 65 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G +T AG L AK + H V R E + ++++ + L S+ PA++ Sbjct: 66 GDVYVTTAGGLKAKYIFHAVTMK-RPAEPSSVEIVRKCLQSLLEKAREMKVKSMVLPALA 124 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQV--YFVCYDEENAH 165 TGV G P+ A++ + + + V F+ Y EE Sbjct: 125 TGVGGVPKKDVAKVYKEVLGDVKDIDITVMDVSGEFIKYLEEELK 169 >UniRef50_A3DLM0 Appr-1-p processing domain protein n=1 Tax=Staphylothermus marinus F1 RepID=A3DLM0_STAMF Length = 192 Score = 131 bits (330), Expect = 9e-30, Method: Composition-based stats. Identities = 49/174 (28%), Positives = 78/174 (44%), Gaps = 6/174 (3%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 I V+GDIT+L V+ IVN AN ++ GGG+ G + R G + + K P G Sbjct: 17 IKGVKGDITELDVEAIVNPANSFMLMGGGLAGVLKRKGGEIIENEAKKFA----PVPVGK 72 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AV+T+AG L AK ++H N + A +L S +A P + TG Sbjct: 73 AVVTIAGVLKAKYIIHAPTMEKPAMRINPENA-YKATFAALTKAFDLSLNRIAVPGMGTG 131 Query: 125 VYGYPRAAAAEIAVKTVSEFIT-RHALPEQVYFVCYDEENAHLYERLLTQQGDE 177 V G + A + K + EF+ + +++ V + E + + L + E Sbjct: 132 VGGLSPSDAGKAMAKAIKEFLDIIPSGIKEILVVDLNPEIPRMVCKALQEMIRE 185 >UniRef50_O28751 UPF0189 protein AF_1521 n=32 Tax=Euryarchaeota RepID=Y1521_ARCFU Length = 192 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 60/180 (33%), Positives = 87/180 (48%), Gaps = 12/180 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRA-AGPALL--DACLKVRQQQ--- 57 + + QGDIT+ IVNAAN L GGGV AI +A AG A L + K ++Q Sbjct: 13 TLKLAQGDITQYPAKAIVNAANKRLEHGGGVAYAIAKACAGDAGLYTEISKKAMREQFGR 72 Query: 58 GDCPTGHAVITLAGDL---PAKAVVHTVGPVWRGGEQNE-DQLLQDAYLNSLRLVAANSY 113 G V+T A +L K V HTVGP+ G E + L A+L L Sbjct: 73 DYIDHGEVVVTPAMNLEERGIKYVFHTVGPICSGMWSEELKEKLYKAFLGPLEKAEEMGV 132 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 S+AFPA+S G+YG E ++ V F + + ++V V YD ++A + ++ + Sbjct: 133 ESIAFPAVSAGIYGCDLEKVVETFLEAVKNF--KGSAVKEVALVIYDRKSAEVALKVFER 190 >UniRef50_Q9P0M6 Core histone macro-H2A.2 n=118 Tax=Eukaryota RepID=H2AW_HUMAN Length = 372 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 42/179 (23%), Positives = 82/179 (45%), Gaps = 7/179 (3%) Query: 1 MKTRIHVVQGDIT---KLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQ 57 + ++ + Q DI+ + V+ IV+ + + A+ +A G L+ ++R+ Q Sbjct: 193 LGQKLSLTQSDISHIGSMRVEGIVHPTTAEIDLKEDIGKALEKAGGKEFLETVKELRKSQ 252 Query: 58 GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 G A ++ + L AK V+H P W G ++ L++ N L SVA Sbjct: 253 GPLEVAEAAVSQSSGLAAKFVIHCHIPQW--GSDKCEEQLEETIKNCLSAAEDKKLKSVA 310 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLLTQQ 174 FP +G +P+ AA++ +K +S + + VYF+ +D E+ +Y + + + Sbjct: 311 FPPFPSGRNCFPKQTAAQVTLKAISAHFDDSSASSLKNVYFLLFDSESIGIYVQEMAKL 369 >UniRef50_UPI0000E4815A PREDICTED: similar to LRP16 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4815A Length = 415 Score = 129 bits (324), Expect = 5e-29, Method: Composition-based stats. Identities = 54/113 (47%), Positives = 68/113 (60%), Gaps = 5/113 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + R+ V QGDITKL VD IVNAAN SL+GGGGVDGAIHRAAG LL C K+ C Sbjct: 159 LNNRVSVWQGDITKLDVDCIVNAANRSLLGGGGVDGAIHRAAGSNLLQECKKLA----GC 214 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPV-WRGGEQNEDQLLQDAYLNSLRLVAANS 112 TG A +T LP++ V+HTVGP+ + N + L Y L + ++ Sbjct: 215 ETGDAKLTAGYLLPSRYVLHTVGPMVYGQPMTNHREDLTSCYATCLHQILEHN 267 >UniRef50_B0QWK9 Putative uncharacterized protein n=1 Tax=Haemophilus parasuis 29755 RepID=B0QWK9_HAEPR Length = 156 Score = 129 bits (324), Expect = 5e-29, Method: Composition-based stats. Identities = 49/133 (36%), Positives = 72/133 (54%), Gaps = 3/133 (2%) Query: 44 PALLDACLKVRQQQGDC-PTGHAVITLAGDLPAKAVVHTVGPVWRGG-EQNEDQLLQDAY 101 L AC ++ ++QG PTG A IT A +LP+ V+HTVGP+ G + +LL Y Sbjct: 1 MQLRLACAELMEKQGHLGPTGQAKITPAFNLPSAYVLHTVGPIISGALSAKDCELLASCY 60 Query: 102 LNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDE 161 + L L + SVAF ISTG + +P AAEIAV+TV F+ + + V F + + Sbjct: 61 RSCLELAKQHGIESVAFCCISTGEFRFPNQEAAEIAVQTVKAFLADNPQMK-VVFNVFKD 119 Query: 162 ENAHLYERLLTQQ 174 + +Y LL + Sbjct: 120 VDLEIYRGLLGEL 132 >UniRef50_D2MH71 Metallo-beta-lactamase family protein n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MH71_9BACT Length = 434 Score = 128 bits (323), Expect = 6e-29, Method: Composition-based stats. Identities = 42/158 (26%), Positives = 70/158 (44%), Gaps = 7/158 (4%) Query: 15 LAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLP 74 + VIVNAAN + GGGV G I RAAG + + ++Q P G AV+T G Sbjct: 1 MEAQVIVNAANSHGLMGGGVAGIIRRAAGSQVEEEA----RRQAPIPVGQAVLTSGGRTR 56 Query: 75 AKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAA 134 ++H + ++ A +L+L + + ++A P + TGV AA Sbjct: 57 FAGIIHAPTMPEPAMRIPVEN-VRLATRAALQLADEHGFVTLAIPGMGTGVGRVRPEDAA 115 Query: 135 EIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 + V+ + +F + + V V D E ++ L+ Sbjct: 116 QGMVEEIRQF--QPRSLQSVMLVDIDPEMVRAWQAALS 151 >UniRef50_B7P925 Histone H2A n=1 Tax=Ixodes scapularis RepID=B7P925_IXOSC Length = 366 Score = 128 bits (322), Expect = 8e-29, Method: Composition-based stats. Identities = 41/176 (23%), Positives = 72/176 (40%), Gaps = 8/176 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + ++ V+QGD+ + D ++ N SL G V + +A G + + G Sbjct: 195 LSVQLTVIQGDMASVTADAAIHPTNASLSLSGEVGQVLEKAGGKEFVQEVKDLFSAHGPL 254 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 + AVI PAK V+H P + L+ N L L + +A P Sbjct: 255 ESAGAVICPGHQFPAKFVIHCNVP------SGSSEPLEKCVRNCLALADEKNIRVLAVPP 308 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLLTQQ 174 ++T + AA+ +K +S + + +Q+YFV D E+ +Y L + Sbjct: 309 LATHSVASQKQQAAQTILKAISNYFVNVMSSSLKQIYFVLSDMESIGIYTSELAKL 364 >UniRef50_Q5V4P3 Putative uncharacterized protein n=1 Tax=Haloarcula marismortui RepID=Q5V4P3_HALMA Length = 166 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 52/167 (31%), Positives = 78/167 (46%), Gaps = 8/167 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 V+QGDI + D +VNAAN SL G GV GA+ RAAG L D + +G G Sbjct: 2 EFEVIQGDIAAQSADALVNAANTSLRMGSGVAGALKRAAGSGLNDEAVA----KGPVDLG 57 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 T A DL A+ V+H G Q+ + +++A N+L A + SV FPAI Sbjct: 58 GVATTDAYDLDAEYVIHAAAM--PPGGQSTAESIRNATRNALAEADALNCESVVFPAIGC 115 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 G+ G+ I + E+ + V + Y +++ +R+ Sbjct: 116 GIAGFDFEEGIRIICAVIEEY--QPESLTDVRLIAYSDDDFEGMQRV 160 >UniRef50_A0CX06 Chromosome undetermined scaffold_3, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CX06_PARTE Length = 1064 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 44/181 (24%), Positives = 85/181 (46%), Gaps = 11/181 (6%) Query: 1 MKTRIHVVQGDITKLA-VDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKV---RQQ 56 ++ I + DIT++ VD IVN A+P+L GG+ GA+ RAAG LL+ + + + Sbjct: 700 LEQSIIIHNQDITQIKGVDAIVNVADPNLKNRGGICGAVFRAAGENLLEEEINMLFNKLG 759 Query: 57 QGDCPTGHAVITLAGDLP----AKAVVHTVGPVWRGGEQN-EDQLLQDAYLNSLRLVAAN 111 + T ++T + L K ++H VGP + + + L +N L+ Sbjct: 760 RKQPETSEVIVTKSYRLGQENGPKYIIHAVGPKYNPQDPQKSKEQLNTCIVNILQKCQEY 819 Query: 112 SYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLL 171 TSVA P IS + +P+ A+I + +F ++ ++ + ++ +++ + Sbjct: 820 KITSVAIPPISEKNFDFPKQICAQIFHAALLQFQFQNP--MSIHIIDVRDKVVDIFKIIF 877 Query: 172 T 172 Sbjct: 878 K 878 Score = 63.1 bits (152), Expect = 4e-09, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 56/172 (32%), Gaps = 9/172 (5%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + ++QGDI + D I ++ +L+ G+ +H+ G + + ++ Sbjct: 2 LSLIQGDIIQQKADAIALPSDIALLKAPGL-KQLHQN-GQQQYNDSITNQKPFSQIQQTS 59 Query: 65 AVITLAGDLPAKAVVHTVGPV--WRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + + K +++ + P E LL+ + N + S+ P + Sbjct: 60 VITLPLQNNQFKYIIYCIVPKSDLNNQELQLSLLLELLFENIFDEITFLKLQSILIPVLG 119 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHALPEQV--YFVCYDEENAHLYERLLT 172 + + + + V FV ++ R L+ Sbjct: 120 CDNADFTIQEFLMAFQSI---YAKQRDNLKDVNLIFVSQSQQEYEPVRRFLS 168 >UniRef50_Q9YBE9 UPF0189 protein APE_1648.1 n=1 Tax=Aeropyrum pernix RepID=Y1648_AERPE Length = 189 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 53/169 (31%), Positives = 77/169 (45%), Gaps = 7/169 (4%) Query: 5 IHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGH 64 + V GD+TK+ + +VN AN ++ GGG GA+ RA G + + ++ + P G Sbjct: 11 LAVSMGDLTKVRAEAVVNPANSLMIMGGGAAGALKRAGGSVIEEEAMR----KAPVPVGE 66 Query: 65 AVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTG 124 AVIT G LPA+ V+H G + +Y +LRL + SVA PA+ G Sbjct: 67 AVITSGGSLPARFVIHAPTMEEPGMRIPLVNAFKASY-AALRLASEAGIESVAMPAMGAG 125 Query: 125 VYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQ 173 V G A A A S I R P + V EE E+ + + Sbjct: 126 VGGLSVAEVAREAAMAAS--ILRGKWPRYIILVARGEEAYRGMEKGVRE 172 >UniRef50_UPI000180B63C PREDICTED: similar to Ci-Rhysin2/Deltex3-a n=1 Tax=Ciona intestinalis RepID=UPI000180B63C Length = 897 Score = 126 bits (316), Expect = 4e-28, Method: Composition-based stats. Identities = 51/177 (28%), Positives = 78/177 (44%), Gaps = 6/177 (3%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD--C 60 R+ V G++ DVIVNAAN L G GV GAI + G A C + + Sbjct: 562 VRVSVGMGNVAIQDTDVIVNAANNRLENGVGVTGAIFKQGGHAFQIECQNAMRARRGQLL 621 Query: 61 PTGHAVITLA-GDLPAKAVVHTVGPVWRG--GEQNEDQLLQDAYLNSLRLVAANSYTSVA 117 G AV+ A G+L + V+H VGP W + +L ++ L ++ + ++A Sbjct: 622 AVGEAVMVNATGNLKCRKVIHLVGPQWHSYIDKNKCCSVLIQGIMSVLVEASSVNAKTIA 681 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHA-LPEQVYFVCYDEENAHLYERLLTQ 173 P +STGVYG P A E+ K + R+ +Q+ + DE + Sbjct: 682 IPPVSTGVYGVPVAVFVEMVKKCLGILKQRNDITLKQIRILSIDEPTVRQLVEGFKK 738 >UniRef50_B1H1M8 LOC100148704 protein (Fragment) n=5 Tax=Danio rerio RepID=B1H1M8_DANRE Length = 858 Score = 126 bits (316), Expect = 4e-28, Method: Composition-based stats. Identities = 49/171 (28%), Positives = 82/171 (47%), Gaps = 13/171 (7%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 I V GDITK+ V+ +VN+ N SL GV GAI +A+GP ++ C + + P Sbjct: 280 TIRVSSGDITKVKVEAVVNSTNTSLNLSSGVSGAILKASGPTVVKEC----KAKAPQPED 335 Query: 64 HAVITLAGDL-PAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 V+T AG+L +VH VG Q ++ + L+ N SV+FPA+ Sbjct: 336 GVVLTRAGNLTNCTHIVHMVG-------QTSRTGIRSSMAKVLKTCEENHIRSVSFPALG 388 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHAL-PEQVYFVCYDEENAHLYERLLT 172 TG P AA A+ +++F+ ++V+ V + + +++ + Sbjct: 389 TGAGHLPAAAVADAMTTALADFVKDSPKHLKRVHIVIFQPKLLPDFQKAVR 439 Score = 117 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 46/173 (26%), Positives = 71/173 (41%), Gaps = 13/173 (7%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 T I V +G IT +V IVN N + GGV GAI +AAG ++ C ++ G Sbjct: 78 NTTIEVRKGSITTESVRGIVNTTNRDMSRRGGVSGAIFKAAGASVEQEC----RKHGPLQ 133 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 A +T AG L ++H +GP + L T+V+FPAI Sbjct: 134 GDDAAVTAAGLLHCDLILHMLGPHSAAESRTR-------VRRVLERCEEKQITTVSFPAI 186 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLT 172 TG G AA ++ ++ +T+ + +Y V + + L Sbjct: 187 GTGGGGVQAVDAATAMLQGFADHLTKSTSSVVKLIYIVIDRDNILQEFLNGLK 239 Score = 48.5 bits (114), Expect = 9e-05, Method: Composition-based stats. Identities = 8/63 (12%), Positives = 22/63 (34%), Gaps = 2/63 (3%) Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFIT--RHALPEQVYFVCYDEENAHLYERLLTQ 173 +A PAI TG G+ + + + +T + ++ + ++ + + Sbjct: 1 LAIPAIGTGRGGFSPRDSMRAMLTALQTHLTEPNSSTLSRITVLALQQDTFQAFRHCFKE 60 Query: 174 QGD 176 Sbjct: 61 WNQ 63 >UniRef50_C1XFR0 Predicted phosphatase similar to C-terminal domain of histone macro H2A1 n=2 Tax=Meiothermus RepID=C1XFR0_MEIRU Length = 163 Score = 125 bits (315), Expect = 5e-28, Method: Composition-based stats. Identities = 55/171 (32%), Positives = 80/171 (46%), Gaps = 11/171 (6%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 RI V QGDIT+ A D IVNAAN L+ G GV GAI R GP++ C + G G Sbjct: 3 RIQVAQGDITEFAGDAIVNAANNHLILGSGVAGAIRRRGGPSIQGECDR----HGPIRVG 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A +T AG LP + V+H G + ++ A +LRL + +AFP + T Sbjct: 59 EAALTGAGQLPVRKVIHAA---VLGDQPATLDTVRSATQAALRLALEHRLYRLAFPLLGT 115 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 174 GV G AE+ + + P ++ Y + +A + L ++ Sbjct: 116 GVGGLGVPQVAEVMLDE----LEAAPDPLEITLYGYSQADAEAIRQALARR 162 >UniRef50_Q460N3 Poly [ADP-ribose] polymerase 15 n=12 Tax=Eutheria RepID=PAR15_HUMAN Length = 656 Score = 125 bits (314), Expect = 6e-28, Method: Composition-based stats. Identities = 49/177 (27%), Positives = 87/177 (49%), Gaps = 8/177 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + ++ GD+ + DVIVN+ +L GGG + A + AGP L R+++ + Sbjct: 68 NLKLISGDVLYIWADVIVNSVPMNLQLGGGPLSRAFLQKAGPMLQKELDD-RRRETEEKV 126 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G+ +T +L KAV+H V P W G + Q++ + L V S++S+ FP I Sbjct: 127 GNIFMTSGCNLDCKAVLHAVAPYWNNGAETSWQIMANIIKKCLTTVEVLSFSSITFPMIG 186 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITR----HALPEQVYFVCY--DEENAHLYERLLTQ 173 TG +P+A A++ + V E+ + + ++V+F+ Y D+E + T Sbjct: 187 TGSLQFPKAVFAKLILSEVFEYSSSTRPITSPLQEVHFLVYTNDDEGCQAFLDEFTN 243 Score = 114 bits (285), Expect = 1e-24, Method: Composition-based stats. Identities = 44/173 (25%), Positives = 71/173 (41%), Gaps = 16/173 (9%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 V GDI VDVIVN+ + GV AI AG A+ C + Q P Sbjct: 283 TFQVATGDIATEQVDVIVNSTARTFNRKSGVSRAILEGAGQAVESECAVLAAQ----PHR 338 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 +IT G L K ++H G + ++ + L YTSV+ PAI T Sbjct: 339 DFIITPGGCLKCKIIIHVPG----------GKDVRKTVTSVLEECEQRKYTSVSLPAIGT 388 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQ 174 G G A+ + + +F ++H+ + V V + E +++ + ++ Sbjct: 389 GNAGKNPITVADNIIDAIVDFSSQHSTPSLKTVKVVIFQPELLNIFYDSMKKR 441 >UniRef50_Q2ITR2 Appr-1-p processing n=1 Tax=Rhodopseudomonas palustris HaA2 RepID=Q2ITR2_RHOP2 Length = 127 Score = 125 bits (313), Expect = 9e-28, Method: Composition-based stats. Identities = 49/110 (44%), Positives = 65/110 (59%) Query: 54 RQQQGDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSY 113 ++ G C TG A ITL DLPA+ V+H VGPVW GG ED+ L Y +L+L + Sbjct: 5 CRKLGGCATGDAKITLGYDLPARHVIHAVGPVWHGGRSGEDEALASCYRRALQLCRQHGL 64 Query: 114 TSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEEN 163 S+AF AISTGVYG+P AA IAV + + A ++V F C+ E + Sbjct: 65 ASIAFSAISTGVYGFPPERAAPIAVAACIDALRTAAPVDRVVFCCFSEPS 114 >UniRef50_UPI00016E2DD3 UPI00016E2DD3 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E2DD3 Length = 1673 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 54/175 (30%), Positives = 81/175 (46%), Gaps = 13/175 (7%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I V GDITK DVIVN++N + GV AI AAG A+ D C K+ P Sbjct: 1096 SVTIQAVTGDITKETTDVIVNSSNNTFSLKKGVSKAILEAAGQAVEDECQKLAAS----P 1151 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 ++T G+L K +VH G QN+ L+ ++L++ ANSYTSV+FPAI Sbjct: 1152 NAGIIMTQPGNLQCKKIVHVTG-------QNKAFLISKVVKSALQMCVANSYTSVSFPAI 1204 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVCYDEENAHLYERLLTQQ 174 TG A+ V + + +++ V V + + + + Q+ Sbjct: 1205 GTGQGNIKATEVADAMFDAVIDELRQNSSTTLNTVRIVVFQPPMLNDFYTSMQQR 1259 Score = 112 bits (279), Expect = 7e-24, Method: Composition-based stats. Identities = 44/179 (24%), Positives = 73/179 (40%), Gaps = 10/179 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 I +V G+I DV VN+ L + G + A+ AAGP L D Sbjct: 900 NITLVVGNIEDATTDVTVNSVFNDLDLNRGALSRALLHAAGPQLQDFLKAQNSSG---TL 956 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G ++T L + V H V P Q L + + L+ + TS++FP+I Sbjct: 957 GEIIMTEGCQLKSMFVYHAVTP--ASYNAQAVQALGGIFRDCLKKAEDSGMTSISFPSIG 1014 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCY--DEENAHLYERLLTQQGDE 177 TG G+P+ AA++ + +F + +V + Y D E + L ++ + Sbjct: 1015 TGGLGFPKDLAAQMLYDEILKFSSKRQTKRLAEVTIILYSGDTETQQAFTAELKKKISK 1073 Score = 97.8 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 43/150 (28%), Positives = 62/150 (41%), Gaps = 7/150 (4%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 T I V + DI V +V+ ANP G+ A+ +AAGP L + C ++ +G Sbjct: 684 TEIFVCKADICSYPVHAVVSYANPDFRFTSGLQRALLKAAGPQLQEDCDRLIHLKGRLKP 743 Query: 63 GHAVIT-LAGDLPAKAVVHTVGPVWRGGEQ---NEDQLLQDAYLNSLRLVAANSY---TS 115 G VIT G L + ++H V P GG+ L+ A SL L Sbjct: 744 GDNVITAAGGQLCCRNIIHAVAPKLDGGQIIFVKRVAQLKKAIKGSLELAEKKGCQLVRQ 803 Query: 116 VAFPAISTGVYGYPRAAAAEIAVKTVSEFI 145 + A S + +P AAAE + E Sbjct: 804 LKMNAKSLVLVAHPLEAAAESIRSALKEHF 833 >UniRef50_UPI000180BD0C PREDICTED: similar to Ci-Rhysin2/Deltex3-a n=1 Tax=Ciona intestinalis RepID=UPI000180BD0C Length = 578 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 47/180 (26%), Positives = 80/180 (44%), Gaps = 9/180 (5%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVR--QQQGDCP 61 + V G+I VD IVNAAN + G GV GAI + G C + +Q Sbjct: 383 NVSVAHGNIALQDVDAIVNAANKYIQNGSGVTGAIFKQGGSKFEQLCKEAMKHRQNRSLK 442 Query: 62 TGHAVITL-AGDLPAKAVVHTVGPVWRGGEQNEDQ--LLQDAYLNSLRLVAANSYTSVAF 118 G V AG+L K V+H VGP W+ ++ LL+D L+ L+ +++A Sbjct: 443 VGEVVSVKAAGNLQCKRVLHLVGPQWKNYSHKDEAYHLLEDGLLSVLKESNYCKASTLAL 502 Query: 119 PAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQ----VYFVCYDEENAHLYERLLTQQ 174 P ++TG+YG P ++ F T + ++ + + D++ + + + + Sbjct: 503 PPVATGIYGTPLKLFVRAMNTALTCFETNISRHQRSLHYIRILSIDQDTVNDLKTMFLSR 562 >UniRef50_D2VM45 Poly ADP-ribose polymerase family, member 14-like protein n=1 Tax=Naegleria gruberi RepID=D2VM45_NAEGR Length = 1557 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 39/175 (22%), Positives = 76/175 (43%), Gaps = 5/175 (2%) Query: 2 KTRIHVVQGDIT--KLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGD 59 + I++ QGD+ K V+VN+AN L GGG+ G C + Sbjct: 831 NSFIYICQGDMFDKKWKAQVLVNSANDQLAHGGGIAAQCLEKCGKQFDQECKNITTSLK- 889 Query: 60 CPTGHAVITLAGDLPA-KAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVA 117 G V T AG+L + + + P++ N LL+ +N L + Y S+ Sbjct: 890 LKPGDVVPTTAGNLSYLSKIYNAIPPMYDSNNHLNSCSLLEQTVVNILTQAEKDGYCSII 949 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLT 172 PA+S+G++G+P + +I V+T+ ++ + + ++ + + + + Sbjct: 950 IPALSSGIFGFPLDQSTDIIVRTIYKYAPQLSCLREIILIGDINTVTESFSKSIN 1004 >UniRef50_B8HYS6 Appr-1-p processing domain protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HYS6_CYAP4 Length = 694 Score = 123 bits (309), Expect = 3e-27, Method: Composition-based stats. Identities = 42/161 (26%), Positives = 69/161 (42%), Gaps = 4/161 (2%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 T + V DIT+L DVIV++ + L GGGV AI AAG + +++ Sbjct: 397 NTTVRVQYCDITQLEADVIVSSDDIHLSMGGGVSEAILLAAGEVAWEEA----RRRVPLK 452 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAI 121 G IT AG L A+ + H ++ Q L++ L + A + S+A PA+ Sbjct: 453 LGEIAITTAGHLKARQLFHAAVLDYQQQTQTTVDLIRSVTRKCLEICHAQGFQSIALPAL 512 Query: 122 STGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEE 162 +TG G +A + + + + +V Y + Sbjct: 513 ATGTAGLSPEQSAIAMMLEILPHLNQETSLRRVTIALYSRQ 553 >UniRef50_D1R847 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R847_9CHLA Length = 411 Score = 121 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 60/208 (28%), Positives = 90/208 (43%), Gaps = 44/208 (21%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGP--------ALLDACLKV 53 KT++ +V+G D IVNAAN L+GGGG+DG I +G L A + Sbjct: 203 KTKVVLVKGSTLDQNTDAIVNAANERLLGGGGIDGQIWSRSGALSGAKDSGEFLKAEIMP 262 Query: 54 RQQQ---GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAA 110 + G+ P G AVIT A L ++ ++H VGP + ++L++AYLNSL L+ A Sbjct: 263 IKANLPSGNLPNGEAVITRALGLNSRYIIHAVGPR-----GAQPKVLRNAYLNSLELLDA 317 Query: 111 NSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALP------------------- 151 N S++F IS ++GY AA I V + + P Sbjct: 318 NQLKSISFCCISQSIFGYSPKDAAPIVVDLIRRYCEYKDGPTTDLQSDQLQKMVEEDESF 377 Query: 152 ---------EQVYFVCYDEENAHLYERL 170 ++ V Y + Y +L Sbjct: 378 DASKLEPFEREIRLVMYSDNEWDEYSKL 405 >UniRef50_UPI000180D216 PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) n=1 Tax=Ciona intestinalis RepID=UPI000180D216 Length = 1716 Score = 120 bits (302), Expect = 2e-26, Method: Composition-based stats. Identities = 48/175 (27%), Positives = 74/175 (42%), Gaps = 14/175 (8%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQ-GD 59 I V + DIT+ +NA+N L + GGV G I G ++ K+RQQ G Sbjct: 905 NVNIRVFKADITEHKCGAFINASNDMLELREGGVSGNILHKGGASIKGELDKLRQQNIGM 964 Query: 60 CPTGHAVITLAGDL-PAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVA 117 G T +G L K ++H VGP W+ N L+ +L + SVA Sbjct: 965 FLPGDVRSTTSGSLRNCKRIIHVVGPDWKKSSHSNNCNYLKACVHGALVEADKHKLASVA 1024 Query: 118 FPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL----------PEQVYFVCYDEE 162 PA+S G+YG + + V+T+ ++ T + + C+ EE Sbjct: 1025 IPAVSCGIYGGVPSVCIRLIVETIQQYFTGNKSKVTMVDLIENSKDDVINCFMEE 1079 Score = 102 bits (254), Expect = 5e-21, Method: Composition-based stats. Identities = 40/171 (23%), Positives = 63/171 (36%), Gaps = 13/171 (7%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + + GDIT+ + DVIVN+ N + G V I + G + C Q Sbjct: 1126 NVELCNGDITQDSSDVIVNSTNSNFDLRNGKVSPQILKKGGNVISQQCT---QNNHPLNK 1182 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 + IT G L K ++H V PV ++ L V A + VA PAI Sbjct: 1183 PNMRITDGGKLKCKQIIHVVVPV-------NQMQIEQVVSLILETVDALHKSVVALPAIG 1235 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRH--ALPEQVYFVCYDEENAHLYERLL 171 TG + A+ + + H + + V V + + + L Sbjct: 1236 TGNLNISPSKVAQYIRTGIVYYTANHNPSHLKTVKVVVFQQNMMQDFHTEL 1286 >UniRef50_UPI0001C3795F Appr-1-p processing domain protein n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C3795F Length = 232 Score = 120 bits (302), Expect = 2e-26, Method: Composition-based stats. Identities = 55/167 (32%), Positives = 79/167 (47%), Gaps = 4/167 (2%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG 63 ++++++ DIT L VD IV ANP L G G AI AG + K + G Sbjct: 2 KMYIIKADITTLNVDAIVLPANPQLKKGAGASQAIFEKAG---EEELRKKCKSIAPIDVG 58 Query: 64 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 123 A+ T +LP++ ++H P W G NE LL AYL+SL++ +SVAFP +S Sbjct: 59 SAIPTGGYNLPSEFIIHAAVPRWVDGGHNEYVLLSSAYLSSLKVADRIGVSSVAFPLLSA 118 Query: 124 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERL 170 G+ A +A KT+ F + VY YD+ L L Sbjct: 119 SNNGFDPRVAFYVAQKTIESF-KADKTLKDVYLTIYDKTAEALIMNL 164 >UniRef50_UPI00005A5611 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 14 n=1 Tax=Canis lupus familiaris RepID=UPI00005A5611 Length = 575 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 51/167 (30%), Positives = 79/167 (47%), Gaps = 7/167 (4%) Query: 11 DITKLAVDVIVNAANPSLMGGGG-VDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITL 69 DI ++ DVIVN +L GGG + A+ + AGP L RQ + G +T Sbjct: 111 DI-RVVADVIVNTVPMNLQLGGGQLSQALLQKAGPELQKELYATRQGTEE-EVGSIFMTS 168 Query: 70 AGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYP 129 +L KAV+H V P W G + Q++ + L V S++S+ FP I TG +P Sbjct: 169 GCNLNCKAVLHVVAPHWDNGAGSSQQIMANIIKKCLTTVEEFSFSSITFPMIGTGSLRFP 228 Query: 130 RAAAAEIAVKTVSEFITR--HALPEQVYFVCY--DEENAHLYERLLT 172 +A AE+ + V F + ++V+F+ Y D+E + T Sbjct: 229 KAIFAELILSEVFRFSSSLWQKSLQEVHFLVYPGDDETLQAFLDKFT 275 Score = 117 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 46/176 (26%), Positives = 75/176 (42%), Gaps = 16/176 (9%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + GDITK DVIVN+ + GV A+ AGPA+ + C + P Sbjct: 315 VTFQIATGDITKEKADVIVNSTTRTFNLKSGVSKAVLEGAGPAVENECA----VRAAQPH 370 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G +IT G L K ++H +G D ++ L YTSVA PAI Sbjct: 371 GEFIITQGGYLMCKIIIHVLG----------DNDVRKTVSAVLEECEQRKYTSVALPAIG 420 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITRHA--LPEQVYFVCYDEENAHLYERLLTQQGD 176 TG G A+ + V +F +H+ ++V V + + +++ + ++ + Sbjct: 421 TGSAGKNPTIVADDMISAVVDFSWKHSTPSLKKVKVVIFLSDLLNVFHDNMKKREN 476 >UniRef50_A9JRH9 Si:ch211-219a4.3 protein n=5 Tax=Clupeocephala RepID=A9JRH9_DANRE Length = 714 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 42/174 (24%), Positives = 72/174 (41%), Gaps = 13/174 (7%) Query: 3 TRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 + V+ GDIT DVIVN+ GV AI AGP + C ++ G Sbjct: 115 VTLQVLNGDITTEQTDVIVNSTTKEFTLKAGVSKAILDKAGPNVEAECQQL----GAKTD 170 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 ++T AG+L K ++H Q+ ++Q +L + A TS+AFPAI Sbjct: 171 SGLIMTQAGNLQCKKIIHIAA-------QSNAIMIQKHVRKALEMCAKEKLTSIAFPAIG 223 Query: 123 TGVYGYPRAAAAEIAVKTVSEFITR--HALPEQVYFVCYDEENAHLYERLLTQQ 174 TG G A+ + + E + + + + + V + + + + + Sbjct: 224 TGQAGLSPGQVADSMLDGMLEMLRKTPQSSLKLIRLVVFQAHMLPEFLKSMQNR 277 Score = 55.0 bits (131), Expect = 9e-07, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 35/79 (44%), Gaps = 4/79 (5%) Query: 101 YLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL--PEQVYFVC 158 L +S+ F AI TG G+P++ + +V +F T+ + ++V FV Sbjct: 1 MDTCLCKAEQRQQSSIVFSAIGTGNLGFPKSLVVSTMLDSVFKFSTKRSSKHIQEVAFVL 60 Query: 159 Y--DEENAHLYERLLTQQG 175 + D + ++ +++ Sbjct: 61 HPKDTQTIQVFTDEFSKRF 79 >UniRef50_A1R2V6 Putative uncharacterized protein n=1 Tax=Arthrobacter aurescens TC1 RepID=A1R2V6_ARTAT Length = 152 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 58/146 (39%), Positives = 78/146 (53%), Gaps = 10/146 (6%) Query: 35 DGAIHRAAGPALLDACLKVRQQQ--GDCPTGHAVITLAGDLPAKAVVHTVGPVWRGGEQN 92 DGAIHRAAG LL+AC ++R+ + P G AV T A LPA V+HTVGP G Q Sbjct: 2 DGAIHRAAGSELLEACRELRRTELPEGLPVGAAVATPAFRLPAHWVIHTVGPNRHAG-QT 60 Query: 93 EDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHAL-- 150 + LL + SL++ A S+AFPAIS G+YG+ AE+A V F + ++ Sbjct: 61 DPALLASCFRESLKVAAGLGARSLAFPAISAGIYGWDSRQVAEVAFDAVGSFSSSSSVSA 120 Query: 151 -----PEQVYFVCYDEENAHLYERLL 171 E V FV + EE ++ L Sbjct: 121 ASERGFELVEFVLFSEETTAVFRAAL 146 >UniRef50_Q4T065 Chromosome undetermined SCAF11328, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4T065_TETNG Length = 566 Score = 119 bits (298), Expect = 5e-26, Method: Composition-based stats. Identities = 49/157 (31%), Positives = 74/157 (47%), Gaps = 7/157 (4%) Query: 1 MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDC 60 + +I + +GD+ L IVN ++ SL V +IHR AGP L D LK++ C Sbjct: 50 INAKIVLFKGDVALLNCTSIVNTSSESLNDKNPVSDSIHRLAGPELRDELLKLK----GC 105 Query: 61 PTGHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQ-LLQDAYLNSLRLVAANSYTSVAFP 119 TG A +T L A+ ++HTVGP ++ + + L Y + L+LV S SV Sbjct: 106 RTGEAKLTKGFGLAARFIIHTVGPKYKTKYRTAAESSLYSCYRSVLQLVVEQSMASVGLC 165 Query: 120 AISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYF 156 I+T GYP A +A++ A +V Sbjct: 166 TITTSKRGYPLEEATHMALRE--SHTHTPATSARVRL 200 >UniRef50_Q4SK44 Chromosome 2 SCAF14570, whole genome shotgun sequence. (Fragment) n=2 Tax=Tetraodon nigroviridis RepID=Q4SK44_TETNG Length = 865 Score = 118 bits (296), Expect = 8e-26, Method: Composition-based stats. Identities = 42/179 (23%), Positives = 74/179 (41%), Gaps = 8/179 (4%) Query: 4 RIHVVQGDITKLAVDVIVNAANPSL-MGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPT 62 I + G I DVIVN+ +L + G + AI +AAGP L + + ++ Sbjct: 102 NIALATGKIEDATTDVIVNSVFKALNLKEGALSNAIFQAAGPQLQ---VLLNAKKSSGTV 158 Query: 63 GHAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIS 122 G ++T L + V H V P + + L + + L TS++FP I Sbjct: 159 GDVIVTEGCQLKSMFVYHAVTPAKGTAQDQAMKALSGIFRDCLNKAEDRGMTSISFPTIG 218 Query: 123 TGVYGYPRAAAAEIAVKTVSEFI--TRHALPEQVYFVCY--DEENAHLYERLLTQQGDE 177 TG G+ + A++ +S+F + +V + Y D E ++ L + + Sbjct: 219 TGQLGFSKDHVAQVLYGEISKFSHKRQTKSLTEVTVILYSGDTETQQVFTEELKKHFSK 277 Score = 63.1 bits (152), Expect = 4e-09, Method: Composition-based stats. Identities = 26/60 (43%), Positives = 31/60 (51%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 I V GDITK DVIVN++N + GV AI AAG A+ C K+ QQ C Sbjct: 326 SVTIQAVTGDITKETTDVIVNSSNENFTLKRGVSKAILEAAGQAVEAECQKLEWQQIVCQ 385 >UniRef50_UPI0001BC8416 Appr-1-p processing domain protein n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC8416 Length = 430 Score = 118 bits (295), Expect = 1e-25, Method: Composition-based stats. Identities = 47/161 (29%), Positives = 73/161 (45%), Gaps = 5/161 (3%) Query: 2 KTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCP 61 K+R+ + GD+T DVIV++ + L GGGV +I RA G + K C Sbjct: 11 KSRLIIKFGDLTSAVTDVIVSSDDAYLSMGGGVSASILRAGGDVIARDARKNV----PCQ 66 Query: 62 TGHAVITLAGDLPAKAVVHTVGPVWRGGEQ-NEDQLLQDAYLNSLRLVAANSYTSVAFPA 120 G ++T AG L AK V H + W ++ ++ + SL +++ S+AFPA Sbjct: 67 MGDVIVTSAGKLEAKYVFHAITIDWSQKDEFTVEKSINSIIKKSLNVLSVLGLKSIAFPA 126 Query: 121 ISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDE 161 I TG Y A +SEF++ ++Y D Sbjct: 127 IGTGAARYSLEDVAHFMSMAISEFLSNSDEELEIYIYLMDR 167 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.138 0.343 Lambda K H 0.267 0.0428 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,003,189,444 Number of Sequences: 3077464 Number of extensions: 37415931 Number of successful extensions: 113051 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 1053 Number of HSP's successfully gapped in prelim test: 336 Number of HSP's that attempted gapping in prelim test: 109351 Number of HSP's gapped (non-prelim): 1661 length of query: 177 length of database: 1,040,396,356 effective HSP length: 120 effective length of query: 57 effective length of database: 671,100,676 effective search space: 38252738532 effective search space used: 38252738532 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 89 (38.8 bits)