BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (337 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q7MNZ9 Probable O-sialoglycoprotein endopeptidase n=19 ... 546 e-154 UniRef50_P36175 O-sialoglycoprotein endopeptidase n=366 Tax=cell... 523 e-147 UniRef50_B0TIN7 Probable O-sialoglycoprotein endopeptidase n=130... 510 e-143 UniRef50_A1AXM9 Probable O-sialoglycoprotein endopeptidase n=36 ... 418 e-115 UniRef50_Q8D283 Probable O-sialoglycoprotein endopeptidase n=11 ... 407 e-112 UniRef50_C6P1W3 Metalloendopeptidase, glycoprotease family n=1 T... 401 e-110 UniRef50_B0TX13 Probable O-sialoglycoprotein endopeptidase n=19 ... 347 3e-94 UniRef50_Q058D1 Probable O-sialoglycoprotein endopeptidase n=1 T... 325 2e-87 UniRef50_Q127W3 Probable O-sialoglycoprotein endopeptidase n=4 T... 319 7e-86 UniRef50_A0L5L8 Probable O-sialoglycoprotein endopeptidase n=24 ... 310 4e-83 UniRef50_C0QTG9 Probable O-sialoglycoprotein endopeptidase n=3 T... 293 8e-78 UniRef50_C4Z311 O-sialoglycoprotein endopeptidase n=13 Tax=Bacte... 292 9e-78 UniRef50_A5G3X1 Probable O-sialoglycoprotein endopeptidase n=20 ... 292 1e-77 UniRef50_B3WUZ1 Probable O-sialoglycoprotein endopeptidase n=1 T... 292 1e-77 UniRef50_C1A601 Probable O-sialoglycoprotein endopeptidase n=1 T... 291 2e-77 UniRef50_B9KXJ0 Probable O-sialoglycoprotein endopeptidase n=4 T... 283 9e-75 UniRef50_C1SJZ8 Metalloendopeptidase, putative, glycoprotease fa... 282 1e-74 UniRef50_Q3YS67 Probable O-sialoglycoprotein endopeptidase n=24 ... 282 1e-74 UniRef50_B2V910 Probable O-sialoglycoprotein endopeptidase n=4 T... 279 1e-73 UniRef50_B6BRQ7 O-sialoglycoprotein endopeptidase n=1 Tax=Candid... 279 1e-73 UniRef50_Q11TP2 Probable O-sialoglycoprotein endopeptidase n=87 ... 279 1e-73 UniRef50_A9FDL0 Probable O-sialoglycoprotein endopeptidase n=5 T... 276 1e-72 UniRef50_Q4FNV6 Probable O-sialoglycoprotein endopeptidase n=15 ... 275 1e-72 UniRef50_Q18CP0 Probable O-sialoglycoprotein endopeptidase n=22 ... 274 4e-72 UniRef50_D0RQS5 Putative glycoprotease GCP n=1 Tax=alpha proteob... 272 1e-71 UniRef50_D0ME01 Metalloendopeptidase, glycoprotease family n=4 T... 272 1e-71 UniRef50_D1B623 Metalloendopeptidase, glycoprotease family n=3 T... 271 3e-71 UniRef50_Q6AL73 Probable O-sialoglycoprotein endopeptidase n=3 T... 271 3e-71 UniRef50_A5GMV4 Probable O-sialoglycoprotein endopeptidase n=17 ... 270 5e-71 UniRef50_Q0AVU0 Probable O-sialoglycoprotein endopeptidase n=27 ... 270 6e-71 UniRef50_B1GZV6 Probable O-sialoglycoprotein endopeptidase n=1 T... 268 2e-70 UniRef50_C0Q8X7 Probable O-sialoglycoprotein endopeptidase n=2 T... 267 5e-70 UniRef50_Q8RC98 Probable O-sialoglycoprotein endopeptidase n=12 ... 266 5e-70 UniRef50_D1AVQ5 Metalloendopeptidase, glycoprotease family n=1 T... 266 9e-70 UniRef50_C7ND80 Metalloendopeptidase, glycoprotease family n=3 T... 266 9e-70 UniRef50_A5CE49 Probable O-sialoglycoprotein endopeptidase n=2 T... 265 2e-69 UniRef50_B4U8B7 Metalloendopeptidase, glycoprotease family n=1 T... 264 4e-69 UniRef50_Q2GEG6 Probable O-sialoglycoprotein endopeptidase n=2 T... 263 5e-69 UniRef50_B2GAG0 Probable O-sialoglycoprotein endopeptidase n=56 ... 263 5e-69 UniRef50_B2UQZ0 Metalloendopeptidase, glycoprotease family n=3 T... 262 1e-68 UniRef50_B9XP92 Metalloendopeptidase, glycoprotease family n=1 T... 262 1e-68 UniRef50_Q0ATQ2 Probable O-sialoglycoprotein endopeptidase n=44 ... 261 2e-68 UniRef50_A6DFV1 Metalloendopeptidase, putative, glycoprotease fa... 261 2e-68 UniRef50_B3R0M3 Probable O-sialoglycoprotein endopeptidase n=2 T... 260 5e-68 UniRef50_C8W929 Metalloendopeptidase, glycoprotease family n=2 T... 259 9e-68 UniRef50_A7HLB0 Probable O-sialoglycoprotein endopeptidase n=3 T... 259 1e-67 UniRef50_C7H0S4 Putative glycoprotease GCP n=1 Tax=Eubacterium s... 259 1e-67 UniRef50_C7N1K1 Ribosomal-protein-alanine acetyltransferase n=1 ... 258 2e-67 UniRef50_Q2RGJ3 Probable O-sialoglycoprotein endopeptidase n=10 ... 258 3e-67 UniRef50_A4EBV8 Putative uncharacterized protein n=5 Tax=Bacteri... 257 5e-67 UniRef50_C9RIN4 Metalloendopeptidase, glycoprotease family n=1 T... 256 7e-67 UniRef50_C1TLM6 O-sialoglycoprotein endopeptidase n=1 Tax=Dethio... 256 1e-66 UniRef50_Q3SVF4 Probable O-sialoglycoprotein endopeptidase n=10 ... 255 2e-66 UniRef50_Q6MQ48 Probable O-sialoglycoprotein endopeptidase n=1 T... 255 2e-66 UniRef50_C7MKR9 Ribosomal-protein-alanine acetyltransferase n=10... 254 3e-66 UniRef50_Q8DLI9 Probable O-sialoglycoprotein endopeptidase n=12 ... 253 7e-66 UniRef50_B9JCG8 Probable O-sialoglycoprotein endopeptidase n=86 ... 253 7e-66 UniRef50_Q2JXG9 Probable O-sialoglycoprotein endopeptidase n=31 ... 252 1e-65 UniRef50_B7CBT6 Putative uncharacterized protein n=1 Tax=Eubacte... 250 5e-65 UniRef50_Q47LN7 Probable O-sialoglycoprotein endopeptidase n=58 ... 250 6e-65 UniRef50_B3DVR7 Metal-dependent protease with possible chaperone... 249 7e-65 UniRef50_D1IZQ0 Whole genome shotgun sequence of line PN40024, s... 249 9e-65 UniRef50_B1V8Z6 Probable O-sialoglycoprotein endopeptidase n=6 T... 248 2e-64 UniRef50_B2KE20 Metalloendopeptidase, glycoprotease family n=1 T... 248 2e-64 UniRef50_A8GM49 Probable O-sialoglycoprotein endopeptidase n=15 ... 248 2e-64 UniRef50_Q7UM42 Probable O-sialoglycoprotein endopeptidase n=5 T... 247 5e-64 UniRef50_B6JAE9 Probable O-sialoglycoprotein endopeptidase n=5 T... 246 8e-64 UniRef50_B5ZLG0 Metalloendopeptidase, glycoprotease family n=11 ... 246 1e-63 UniRef50_B0VHD4 Putative metalloendopeptidase, , glycoprotease f... 246 1e-63 UniRef50_B1ZYF9 Metalloendopeptidase, glycoprotease family n=3 T... 246 1e-63 UniRef50_A1BJ68 Probable O-sialoglycoprotein endopeptidase n=12 ... 244 4e-63 UniRef50_D0WGH2 O-sialoglycoprotein endopeptidase n=1 Tax=Slacki... 244 4e-63 UniRef50_Q6MD07 Probable O-sialoglycoprotein endopeptidase n=2 T... 243 6e-63 UniRef50_Q5FLZ3 Probable O-sialoglycoprotein endopeptidase n=10 ... 243 9e-63 UniRef50_A0JZ01 Probable O-sialoglycoprotein endopeptidase n=98 ... 243 9e-63 UniRef50_D1N4S8 Metalloendopeptidase, glycoprotease family n=1 T... 242 2e-62 UniRef50_C7LR95 Metalloendopeptidase, glycoprotease family n=1 T... 240 5e-62 UniRef50_D0N6Q4 O-sialoglycoprotein endopeptidase, putative n=1 ... 240 6e-62 UniRef50_B2S3R9 Probable O-sialoglycoprotein endopeptidase n=4 T... 239 1e-61 UniRef50_B8BPP0 Putative uncharacterized protein n=1 Tax=Oryza s... 238 2e-61 UniRef50_Q30ZN1 Probable O-sialoglycoprotein endopeptidase n=12 ... 236 7e-61 UniRef50_Q045T6 Probable O-sialoglycoprotein endopeptidase n=433... 236 8e-61 UniRef50_Q2SR45 Probable O-sialoglycoprotein endopeptidase n=5 T... 234 5e-60 UniRef50_Q04RH4 Probable O-sialoglycoprotein endopeptidase n=6 T... 233 1e-59 UniRef50_C7M316 Metalloendopeptidase, glycoprotease family n=1 T... 232 2e-59 UniRef50_B1XJF0 Probable O-sialoglycoprotein endopeptidase n=1 T... 230 4e-59 UniRef50_D2L1E2 Metalloendopeptidase, glycoprotease family n=1 T... 230 7e-59 UniRef50_Q0SM86 Probable O-sialoglycoprotein endopeptidase n=18 ... 228 2e-58 UniRef50_Q1IZH8 Probable O-sialoglycoprotein endopeptidase n=4 T... 228 3e-58 UniRef50_A4RXP4 Predicted protein n=6 Tax=Eukaryota RepID=A4RXP4... 228 3e-58 UniRef50_B5RQA5 Probable O-sialoglycoprotein endopeptidase n=4 T... 227 4e-58 UniRef50_A1R8N0 Probable O-sialoglycoprotein endopeptidase n=12 ... 226 1e-57 UniRef50_C8WN77 Metalloendopeptidase, glycoprotease family n=3 T... 224 4e-57 UniRef50_C0QY51 Probable O-sialoglycoprotein endopeptidase n=2 T... 223 5e-57 UniRef50_D0JBS4 Glycoprotease M22 family domain-containing prote... 223 6e-57 UniRef50_Q254Q0 Probable O-sialoglycoprotein endopeptidase n=6 T... 220 5e-56 UniRef50_A3EUW9 O-sialoglycoprotein endopeptidase n=3 Tax=Leptos... 220 6e-56 UniRef50_A9WHP1 Metalloendopeptidase, glycoprotease family n=4 T... 218 2e-55 UniRef50_C4XSD3 Probable O-sialoglycoprotein endopeptidase n=2 T... 216 1e-54 UniRef50_Q1IUF1 Probable O-sialoglycoprotein endopeptidase n=2 T... 216 1e-54 UniRef50_Q0BPC9 Probable O-sialoglycoprotein endopeptidase n=14 ... 216 1e-54 UniRef50_A6Q6J3 Probable O-sialoglycoprotein endopeptidase n=2 T... 214 3e-54 UniRef50_Q4A734 Probable O-sialoglycoprotein endopeptidase n=1 T... 214 4e-54 UniRef50_C2KP25 O-sialoglycoprotein endopeptidase n=3 Tax=Mobilu... 213 5e-54 UniRef50_A7H0K1 Probable O-sialoglycoprotein endopeptidase n=26 ... 213 1e-53 UniRef50_Q54EW4 Putative uncharacterized protein n=1 Tax=Dictyos... 212 1e-53 UniRef50_B0D096 Predicted protein n=2 Tax=Agaricales RepID=B0D09... 211 2e-53 UniRef50_Q5ZZQ1 Probable O-sialoglycoprotein endopeptidase n=8 T... 210 5e-53 UniRef50_D1B582 Metalloendopeptidase, glycoprotease family n=5 T... 210 6e-53 UniRef50_B8LEI0 Predicted protein (Fragment) n=1 Tax=Thalassiosi... 208 2e-52 UniRef50_C5ZWF6 Metal-dependent protease n=2 Tax=Helicobacter ca... 208 3e-52 UniRef50_C1F9R2 Metalloendopeptidase, glycoprotease family n=1 T... 202 1e-50 UniRef50_B0B9U7 Probable O-sialoglycoprotein endopeptidase n=6 T... 202 2e-50 UniRef50_UPI000058820F PREDICTED: hypothetical protein n=2 Tax=S... 202 2e-50 UniRef50_B3MQN2 GF20469 n=4 Tax=Drosophila RepID=B3MQN2_DROAN 194 3e-48 UniRef50_P75055 Probable O-sialoglycoprotein endopeptidase n=2 T... 192 1e-47 UniRef50_B8PI87 Predicted protein n=2 Tax=Postia placenta Mad-69... 192 2e-47 UniRef50_Q29HY2 GA12844 n=3 Tax=Sophophora RepID=Q29HY2_DROPS 190 5e-47 UniRef50_C3XEQ4 O-sialoglycoprotein endopeptidase n=1 Tax=Helico... 190 5e-47 UniRef50_B3PND6 Probable O-sialoglycoprotein endopeptidase n=2 T... 190 5e-47 UniRef50_B1AJ51 Probable O-sialoglycoprotein endopeptidase n=15 ... 189 1e-46 UniRef50_B6JWU0 Glycoprotease pgp1 n=1 Tax=Schizosaccharomyces j... 188 2e-46 UniRef50_B3RQR7 Putative uncharacterized protein n=1 Tax=Trichop... 187 4e-46 UniRef50_Q9H4B0 Probable O-sialoglycoprotein endopeptidase 2 n=3... 186 7e-46 UniRef50_Q6C9V8 YALI0D07920p n=1 Tax=Yarrowia lipolytica RepID=Q... 186 1e-45 UniRef50_Q8EUQ9 Probable O-sialoglycoprotein endopeptidase n=1 T... 186 1e-45 UniRef50_Q9VWD6 Probable O-sialoglycoprotein endopeptidase 2 n=6... 183 6e-45 UniRef50_UPI0000D561DB PREDICTED: similar to AGAP005215-PA n=1 T... 182 1e-44 UniRef50_Q17Z01 Probable O-sialoglycoprotein endopeptidase n=13 ... 181 3e-44 UniRef50_Q17CG3 O-sialoglycoprotein endopeptidase n=2 Tax=Culici... 176 1e-42 UniRef50_Q7NB15 Probable O-sialoglycoprotein endopeptidase n=1 T... 176 2e-42 UniRef50_UPI0001979AA5 putative DNA-binding/iron metalloprotein/... 175 2e-42 UniRef50_Q4PGZ6 Putative uncharacterized protein n=2 Tax=Ustilag... 175 2e-42 UniRef50_UPI000186D055 conserved hypothetical protein n=1 Tax=Pe... 172 2e-41 UniRef50_O94710 Glycoprotease pgp1, mitochondrial n=1 Tax=Schizo... 172 2e-41 UniRef50_C4PYC5 Mername-AA018 peptidase (M22 family) n=1 Tax=Sch... 169 1e-40 UniRef50_P43122 Putative protease QRI7 n=12 Tax=Saccharomycetace... 168 3e-40 UniRef50_C4QZU9 Putative metalloprotease, similar to O-sialoglyc... 168 3e-40 UniRef50_B5Y892 O-sialoglycoprotein endopeptidase n=1 Tax=Coprot... 167 6e-40 UniRef50_D2LQ34 Metalloendopeptidase, glycoprotease family n=1 T... 166 1e-39 UniRef50_UPI000180B634 PREDICTED: similar to Probable O-sialogly... 166 1e-39 UniRef50_Q93170 Protein C01G10.10, confirmed by transcript evide... 165 2e-39 UniRef50_A5UMH5 Putative O-sialoglycoprotein endopeptidase n=5 T... 162 1e-38 UniRef50_B7XIP4 O-sialoglycoprotein endopeptidase n=2 Tax=Eukary... 161 3e-38 UniRef50_UPI0001C42124 glycoprotease M22 family n=1 Tax=Methanob... 160 5e-38 UniRef50_D2VC41 Predicted protein n=1 Tax=Naegleria gruberi RepI... 159 1e-37 UniRef50_UPI0000DB7930 PREDICTED: similar to O-sialoglycoprotein... 159 1e-37 UniRef50_Q9NPF4 Probable O-sialoglycoprotein endopeptidase n=81 ... 159 2e-37 UniRef50_Q4U8J6 Glycoprotease, putative n=2 Tax=Theileria RepID=... 159 2e-37 UniRef50_Q74M58 Putative O-sialoglycoprotein endopeptidase n=1 T... 158 3e-37 UniRef50_Q6L243 Putative O-sialoglycoprotein endopeptidase n=3 T... 157 4e-37 UniRef50_UPI0000E8089C PREDICTED: similar to Osgepl1 protein n=1... 155 2e-36 UniRef50_A2QMR2 Function: O-sialoglycoprotein endopeptidase is a... 153 7e-36 UniRef50_B6GZQ3 Pc12g05880 protein n=9 Tax=Trichocomaceae RepID=... 152 2e-35 UniRef50_C5KYH6 Glycoprotein endopeptidase, putative n=4 Tax=Per... 152 2e-35 UniRef50_UPI000023E24C hypothetical protein FG06887.1 n=1 Tax=Gi... 151 3e-35 UniRef50_Q46FS9 Putative O-sialoglycoprotein endopeptidase n=17 ... 149 2e-34 UniRef50_A5DGU9 Putative uncharacterized protein n=2 Tax=Pichia ... 149 2e-34 UniRef50_UPI0000F51796 O-sialoglycoprotein endopeptidase/protein... 148 2e-34 UniRef50_B7QJD9 O-sialoglycoprotein endopeptidase, putative n=3 ... 145 1e-33 UniRef50_A4VEZ5 O-sialoglycoprotein endopeptidase n=1 Tax=Tetrah... 145 2e-33 UniRef50_C4Y0N8 Putative uncharacterized protein n=1 Tax=Clavisp... 142 1e-32 UniRef50_A3MSX6 Putative O-sialoglycoprotein endopeptidase n=2 T... 141 3e-32 UniRef50_A2BJY9 Putative O-sialoglycoprotein endopeptidase n=22 ... 139 2e-31 UniRef50_D2RYV2 Metalloendopeptidase, glycoprotease family n=1 T... 137 8e-31 UniRef50_C7DHT9 Metalloendopeptidase, glycoprotease family n=1 T... 136 9e-31 UniRef50_B9WFF4 Metalloprotease, putative n=8 Tax=Saccharomyceta... 136 1e-30 UniRef50_P36174 Putative O-sialoglycoprotein endopeptidase n=1 T... 135 3e-30 UniRef50_A6VJ51 Putative O-sialoglycoprotein endopeptidase n=26 ... 134 4e-30 UniRef50_Q4UA14 Glycoprotein endopeptidase, putative n=3 Tax=Pir... 134 6e-30 UniRef50_Q2GXN6 Putative glycoprotein endopeptidase KAE1 n=18 Ta... 134 6e-30 UniRef50_Q18KI0 Putative O-sialoglycoprotein endopeptidase n=14 ... 133 7e-30 UniRef50_Q6L4N8 Os05g0194600 protein n=21 Tax=Eukaryota RepID=Q6... 129 2e-28 UniRef50_A8WMS3 Putative uncharacterized protein n=1 Tax=Caenorh... 127 4e-28 UniRef50_B8MFK9 Glycoprotease family protein, putative n=5 Tax=L... 127 7e-28 UniRef50_Q83I95 Probable O-sialoglycoprotein endopeptidase n=2 T... 127 8e-28 UniRef50_P36132 Putative glycoprotein endopeptidase KAE1 n=40 Ta... 123 8e-27 UniRef50_A8QDL6 Glycoprotease family protein n=1 Tax=Brugia mala... 122 2e-26 UniRef50_Q5KFY5 Mitochondrion protein, putative n=2 Tax=Filobasi... 119 1e-25 UniRef50_C1GKA7 Glycoprotease pgp1 n=11 Tax=Onygenales RepID=C1G... 118 2e-25 UniRef50_A3CXS0 Putative O-sialoglycoprotein endopeptidase n=5 T... 117 7e-25 UniRef50_C8V9Q8 PROBABLE O-SIALOGLYCOPROTEIN ENDOPEPTIDASE (AFU_... 114 5e-24 UniRef50_A8BDD4 O-sialoglycoprotein endopeptidase n=2 Tax=Giardi... 111 5e-23 UniRef50_Q97ZY8 Putative O-sialoglycoprotein endopeptidase n=1 T... 110 1e-22 UniRef50_Q2HG58 Putative uncharacterized protein n=1 Tax=Chaetom... 108 2e-22 UniRef50_A7APL5 Glycoprotease family protein n=1 Tax=Babesia bov... 106 1e-21 UniRef50_C0GE31 O-sialoglycoprotein endopeptidase n=1 Tax=Dethio... 105 3e-21 UniRef50_B2A533 O-sialoglycoprotein endopeptidase n=1 Tax=Natran... 104 5e-21 UniRef50_A6S1G0 Putative uncharacterized protein n=1 Tax=Botryot... 103 8e-21 UniRef50_C5FT24 Glycoprotease family protein n=2 Tax=Onygenales ... 99 2e-19 UniRef50_D1BMJ2 Metal-dependent protease with possible chaperone... 99 2e-19 UniRef50_C0ZC04 Peptidase M22 family protein n=1 Tax=Brevibacill... 96 1e-18 UniRef50_Q3AAM2 Glycoprotease family protein n=1 Tax=Carboxydoth... 96 2e-18 UniRef50_A8MFJ2 O-sialoglycoprotein endopeptidase n=1 Tax=Alkali... 95 3e-18 UniRef50_A6NUZ4 Putative uncharacterized protein n=1 Tax=Bactero... 94 5e-18 UniRef50_Q7SD85 Predicted protein n=2 Tax=Sordariaceae RepID=Q7S... 94 5e-18 UniRef50_C0ATA9 Putative uncharacterized protein n=1 Tax=Proteus... 93 2e-17 UniRef50_B2WBX5 Glycoprotease pgp1, mitochondrial n=1 Tax=Pyreno... 92 3e-17 UniRef50_A4RG35 Putative uncharacterized protein n=1 Tax=Magnapo... 92 3e-17 UniRef50_A0RY43 O-sialoglycoprotein endopeptidase n=4 Tax=Thauma... 92 3e-17 UniRef50_D2RJI3 Peptidase M22 glycoprotease n=2 Tax=Acidaminococ... 92 4e-17 UniRef50_Q0V4Z5 Putative uncharacterized protein n=1 Tax=Phaeosp... 91 4e-17 UniRef50_C9LLA9 Glycoprotease family protein n=1 Tax=Dialister i... 91 4e-17 UniRef50_D2EF31 O-sialoglycoprotein endopeptidase (Fragment) n=1... 91 4e-17 UniRef50_Q8IJ99 Glycoprotease, putative n=5 Tax=Plasmodium RepID... 91 5e-17 UniRef50_A7VX43 Putative uncharacterized protein n=4 Tax=Clostri... 87 8e-16 UniRef50_A6TR37 O-sialoglycoprotein endopeptidase n=1 Tax=Alkali... 86 2e-15 UniRef50_Q0AZF6 Putative uncharacterized protein n=1 Tax=Syntrop... 83 1e-14 UniRef50_C8WXH0 Peptidase M22 glycoprotease n=2 Tax=Alicyclobaci... 83 2e-14 UniRef50_Q2RIB0 O-sialoglycoprotein endopeptidase n=5 Tax=Clostr... 83 2e-14 UniRef50_B0TEI7 O-sialoglycoprotein endopeptidase, putative n=1 ... 82 2e-14 UniRef50_UPI0000DD8AA6 Os01g0295900 n=1 Tax=Oryza sativa Japonic... 82 3e-14 UniRef50_B9PG42 O-sialoglycoprotein endopeptidase, putative n=3 ... 82 3e-14 UniRef50_C5KJ57 Putative uncharacterized protein (Fragment) n=1 ... 82 3e-14 UniRef50_D1AUR9 Putative endopeptidase n=1 Tax=Anaplasma central... 80 1e-13 UniRef50_B0AAV1 Putative uncharacterized protein n=2 Tax=Clostri... 80 1e-13 UniRef50_A9UYP5 Predicted protein n=1 Tax=Monosiga brevicollis R... 77 8e-13 UniRef50_B2AYU1 Predicted CDS Pa_1_12230 (Fragment) n=1 Tax=Podo... 77 9e-13 UniRef50_A5KDZ1 O-sialoglycoprotein endopeptidase, putative n=1 ... 75 2e-12 UniRef50_Q18B67 Probable O-sialoglycoprotein endopeptidase n=6 T... 73 1e-11 UniRef50_B9HH45 Predicted protein n=7 Tax=Eukaryota RepID=B9HH45... 70 9e-11 UniRef50_D1IQV9 Whole genome shotgun sequence of line PN40024, s... 69 3e-10 UniRef50_UPI000187E9E4 hypothetical protein MPER_08009 n=1 Tax=M... 65 5e-09 UniRef50_C7H6X1 Glycoprotease family protein n=2 Tax=Faecalibact... 63 1e-08 UniRef50_B8I821 Peptidase M22 glycoprotease n=1 Tax=Clostridium ... 62 2e-08 UniRef50_P43990 Probable M22 peptidase homolog HI0388 n=24 Tax=P... 62 4e-08 UniRef50_Q0JNG2 Os01g0295900 protein n=5 Tax=Oryza sativa RepID=... 61 6e-08 UniRef50_D1PKV9 Glycoprotease family protein n=1 Tax=Subdoligran... 61 7e-08 UniRef50_UPI00019087BD O-sialoglycoprotein endopeptidase n=1 Tax... 59 2e-07 UniRef50_Q1Q3G6 Putative uncharacterized protein n=1 Tax=Candida... 57 7e-07 UniRef50_B9Z6Q4 Peptidase M22 glycoprotease n=1 Tax=Lutiella nit... 57 8e-07 UniRef50_B7GZH2 Glycoprotease family protein n=17 Tax=Acinetobac... 57 9e-07 UniRef50_Q2SL20 Inactive metal-dependent protease-like protein n... 56 2e-06 UniRef50_P76256 M22 peptidase homolog yeaZ n=236 Tax=Gammaproteo... 55 3e-06 UniRef50_Q31G60 Peptidase M22 glycoprotease family protein n=1 T... 55 3e-06 UniRef50_C0GCU7 Peptidase M22 glycoprotease n=1 Tax=Dethiobacter... 55 3e-06 UniRef50_Q1MXN6 Putative uncharacterized protein n=1 Tax=Bermane... 55 4e-06 UniRef50_B3ERC8 Putative uncharacterized protein n=1 Tax=Candida... 54 6e-06 UniRef50_P57409 Uncharacterized protein BU324 n=4 Tax=Buchnera a... 54 6e-06 UniRef50_C9SIA9 Glycoprotease pgp1 n=2 Tax=Sordariomycetes RepID... 54 7e-06 UniRef50_C3WNF1 Glycoprotease n=10 Tax=Fusobacterium RepID=C3WNF... 54 1e-05 UniRef50_Q5E439 Predicted peptidase n=5 Tax=Vibrionaceae RepID=Q... 54 1e-05 UniRef50_D0SL00 Glycoprotease n=1 Tax=Acinetobacter junii SH205 ... 53 1e-05 UniRef50_B8GRE1 Peptidase M22 glycoprotease n=2 Tax=Chromatiales... 53 1e-05 UniRef50_A0LXU5 Peptidase, family M22 n=5 Tax=Bacteroidetes RepI... 53 1e-05 UniRef50_Q8KG29 Protease, putative n=1 Tax=Chlorobaculum tepidum... 53 2e-05 UniRef50_C1BYL4 Probable O-sialoglycoprotein endopeptidase 2 n=1... 53 2e-05 UniRef50_B2AYU2 Predicted CDS Pa_1_12240 (Fragment) n=1 Tax=Podo... 53 2e-05 UniRef50_A1WXT3 Peptidase M22, glycoprotease n=1 Tax=Halorhodosp... 52 2e-05 UniRef50_Q18CP2 Putative glycoprotease n=9 Tax=Clostridium RepID... 52 3e-05 >UniRef50_Q7MNZ9 Probable O-sialoglycoprotein endopeptidase n=19 Tax=Gammaproteobacteria RepID=GCP_VIBVY Length = 339 Score = 546 bits (1406), Expect = e-154, Method: Compositional matrix adjust. Identities = 259/334 (77%), Positives = 294/334 (88%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+LGIETSCDETGIAIYDDEKGLLA++LYSQ+KLHADYGGVVPELASRDHV+KT+PLI+ Sbjct: 1 MRILGIETSCDETGIAIYDDEKGLLAHKLYSQIKLHADYGGVVPELASRDHVKKTIPLIK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ LTAKDID VAYTAGPGLVGALLVGAT+GRSLA+AW VPA+PVHHMEGHLLAPM Sbjct: 61 EALKEANLTAKDIDGVAYTAGPGLVGALLVGATIGRSLAYAWGVPAVPVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LEDNPP FPFVA+LVSGGH+ ++ V GIG+Y++LGESIDDAAGEAFDKTAKL+GLDYPGG Sbjct: 121 LEDNPPPFPFVAVLVSGGHSMMVEVKGIGEYKILGESIDDAAGEAFDKTAKLMGLDYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 PLLSK+A +GT GRF FPRPMT+ PGLD SFSGLKTF ANTI NG D+QTRADIA AFE Sbjct: 181 PLLSKLAEKGTPGRFKFPRPMTNVPGLDMSFSGLKTFTANTIAANGDDEQTRADIAYAFE 240 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +AV TL IKCKRAL+QTG KR+V+AGGVSANR LRA+L ++ K G+V+Y R EFCTD Sbjct: 241 EAVCATLAIKCKRALEQTGMKRIVIAGGVSANRRLRAELEKLAHKVGGDVYYPRTEFCTD 300 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 NGAMIAYAGM R K +DL V RPRWP+ +L Sbjct: 301 NGAMIAYAGMQRLKNNEVSDLAVEARPRWPIDQL 334 >UniRef50_P36175 O-sialoglycoprotein endopeptidase n=366 Tax=cellular organisms RepID=GCP_PASHA Length = 325 Score = 523 bits (1347), Expect = e-147, Method: Compositional matrix adjust. Identities = 247/319 (77%), Positives = 282/319 (88%), Gaps = 5/319 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+LGIETSCDETG+AIYD++KGL+ANQLYSQ+ +HADYGGVVPELASRDH+RKT+PLIQ Sbjct: 1 MRILGIETSCDETGVAIYDEDKGLVANQLYSQIDMHADYGGVVPELASRDHIRKTLPLIQ 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ L DID +AYTAGPGLVGALLVG+T+ RSLA+AW+VPA+ VHHMEGHLLAPM Sbjct: 61 EALKEANLQPSDIDGIAYTAGPGLVGALLVGSTIARSLAYAWNVPALGVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE+N PEFPFVALL+SGGHTQL+ V G+GQYELLGESIDDAAGEAFDKT KLLGLDYP G Sbjct: 121 LEENAPEFPFVALLISGGHTQLVKVDGVGQYELLGESIDDAAGEAFDKTGKLLGLDYPAG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR----DNGT-DDQTRADI 235 +SK+A GT RF FPRPMTDRPGLDFSFSGLKTFAANTI+ +NG D+QT+ DI Sbjct: 181 VAMSKLAESGTPNRFKFPRPMTDRPGLDFSFSGLKTFAANTIKANLNENGELDEQTKCDI 240 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A AF+ AVVDT++IKCKRAL+QTG+KRLVMAGGVSAN+ LRA LAEMMKK +GEVFY RP Sbjct: 241 AHAFQQAVVDTILIKCKRALEQTGYKRLVMAGGVSANKQLRADLAEMMKKLKGEVFYPRP 300 Query: 296 EFCTDNGAMIAYAGMVRFK 314 +FCTDNGAMIAY G +R K Sbjct: 301 QFCTDNGAMIAYTGFLRLK 319 >UniRef50_B0TIN7 Probable O-sialoglycoprotein endopeptidase n=130 Tax=Gammaproteobacteria RepID=GCP_SHEHH Length = 338 Score = 510 bits (1314), Expect = e-143, Method: Compositional matrix adjust. Identities = 247/334 (73%), Positives = 284/334 (85%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVLGIETSCDETGIA+YDDEKGLL++ LYSQVKLHADYGGVVPELASRDHVRK VPLI+ Sbjct: 1 MRVLGIETSCDETGIAVYDDEKGLLSHALYSQVKLHADYGGVVPELASRDHVRKIVPLIR 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ +T +D+D +AYT GPGL+GALLVGA VGR+LAF+WD PAI VHHMEGHLLAPM Sbjct: 61 QALADADMTIEDLDGIAYTKGPGLIGALLVGACVGRALAFSWDKPAIGVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LED+ PEFPF+ALLVSGGH+ L+ V GIG+Y +LGES+DDAAGEAFDKTAKL+GLDYPGG Sbjct: 121 LEDDVPEFPFLALLVSGGHSMLVGVEGIGRYTVLGESVDDAAGEAFDKTAKLMGLDYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P LSK+AA+G + FPRPMTD+PGL+ SFSGLKTFAANTI D+QTRA+IA AFE Sbjct: 181 PRLSKLAAKGVPNSYRFPRPMTDKPGLNMSFSGLKTFAANTIAAEPKDEQTRANIACAFE 240 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +AVVDTL IKCKRAL QTG+K LV+AGGVSAN LRA L+EMM+ G+V+Y R EFCTD Sbjct: 241 EAVVDTLGIKCKRALKQTGYKNLVIAGGVSANTRLRASLSEMMQGLGGKVYYPRGEFCTD 300 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 NGAMIAYAG+ R KAG DL V +PRWPL L Sbjct: 301 NGAMIAYAGLQRLKAGQVEDLAVKGQPRWPLDTL 334 >UniRef50_A1AXM9 Probable O-sialoglycoprotein endopeptidase n=36 Tax=Proteobacteria RepID=GCP_RUTMC Length = 356 Score = 418 bits (1075), Expect = e-115, Method: Compositional matrix adjust. Identities = 202/332 (60%), Positives = 248/332 (74%), Gaps = 4/332 (1%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGIE+SCDETGI +Y E GL+ ++L+S VK+HA+YGGVVPELASRDH+++ +PLI+A L Sbjct: 24 LGIESSCDETGIGLYHSELGLIGHELFSSVKIHAEYGGVVPELASRDHIQRVLPLIKAVL 83 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 + T +D+ +AYTAGPGL GALLVG V +SLA++ D+P++ VHHMEGHLL P+LE+ Sbjct: 84 ADVKFTLQDLSGIAYTAGPGLAGALLVGCAVAKSLAWSLDIPSLAVHHMEGHLLTPLLEE 143 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLL 183 + PEFPFVALLVSGGHT LI V IGQY++LGES+DDA GEAFDKTAK+LGL YPGGP L Sbjct: 144 SQPEFPFVALLVSGGHTMLIDVKAIGQYKILGESLDDAVGEAFDKTAKILGLGYPGGPAL 203 Query: 184 SKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAV 243 + +A QG G F FP PM RPGLDFSFSGLKTF NT + + DIA+AFE A Sbjct: 204 AMLAEQGNYGAFKFPCPMVGRPGLDFSFSGLKTFVRNTFAKYPSK---KEDIAKAFEVAT 260 Query: 244 VDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGA 303 TLMIKC+RAL+QT + LV+AGGVSAN +LR KL +M +K VFY R EFCTDNGA Sbjct: 261 TQTLMIKCRRALEQTKYATLVVAGGVSANLSLRKKLNQMGQKLDVNVFYPRQEFCTDNGA 320 Query: 304 MIAYAGMVRFKAGAT-ADLGVSVRPRWPLAEL 334 MIA G R G ++++PRW L EL Sbjct: 321 MIALVGYFRLSHGQHDTHHEINIKPRWSLEEL 352 >UniRef50_Q8D283 Probable O-sialoglycoprotein endopeptidase n=11 Tax=Gammaproteobacteria RepID=GCP_WIGBR Length = 340 Score = 407 bits (1045), Expect = e-112, Method: Compositional matrix adjust. Identities = 179/335 (53%), Positives = 248/335 (74%), Gaps = 1/335 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETSCD+TG AIYD EKGL+ +++ SQ +H+ YGGVVPE +S+ H++ PL++ Sbjct: 1 MLILGIETSCDDTGAAIYDLEKGLIIHKVISQNNIHSKYGGVVPEKSSKYHLKNIQPLVE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 K S ++ ID +AYTAGPGLVG+L++GAT SLA+ +P+I ++H+EGHLL PM Sbjct: 61 NIFKNSNISLSKIDGIAYTAGPGLVGSLIIGATFACSLAYTLQIPSIAINHLEGHLLTPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 ++ P+FPF+ L++SG HTQ + IG+Y+++G+ +DDA GEAFDK AKLLG+ YPGG Sbjct: 121 IKYKRPKFPFLGLIISGAHTQFVLAEDIGKYKIIGDCLDDALGEAFDKVAKLLGIKYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADIARAF 239 LS +A QG + RF FPRPMT +PG++FSFSGLKT+A N + + D+QT+ DIARAF Sbjct: 181 KKLSIIAKQGNSKRFFFPRPMTKKPGINFSFSGLKTYAKNLVSSFSKIDNQTKCDIARAF 240 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 ED+++DT++IKCKRALD T K L+++GGVSAN LR L +MK R G++F+++ CT Sbjct: 241 EDSIIDTVIIKCKRALDITNSKILLISGGVSANEPLRKNLRNLMKSRNGKLFFSKKSLCT 300 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 DN AMIAY G +RFK T DL V + P+W L ++ Sbjct: 301 DNAAMIAYVGSIRFKKNKTKDLSVLINPKWSLEDI 335 >UniRef50_C6P1W3 Metalloendopeptidase, glycoprotease family n=1 Tax=Sideroxydans lithotrophicus ES-1 RepID=C6P1W3_9PROT Length = 383 Score = 401 bits (1030), Expect = e-110, Method: Compositional matrix adjust. Identities = 207/373 (55%), Positives = 256/373 (68%), Gaps = 41/373 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDETGIA+Y E+GLLA+ L+SQ+ LH +YGGVVPELASRDHVR +PLI++A Sbjct: 7 ILGIESSCDETGIALYHTERGLLAHTLHSQIALHNEYGGVVPELASRDHVRHALPLIRSA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+++G DIDA+AYT GPGL GALLVG+++ +LA+ DVP I VHH+EGHLL+P+L Sbjct: 67 LQKAGCALSDIDAIAYTQGPGLSGALLVGSSIACALAYTLDVPTIGVHHLEGHLLSPLLS 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 PEFPFVALLVSGGHTQL+ V G+G Y LLGES+DDAAGEAFDK+AKLLGLDYPGG L Sbjct: 127 RPAPEFPFVALLVSGGHTQLMRVDGVGHYTLLGESVDDAAGEAFDKSAKLLGLDYPGGAL 186 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTF------------------------- 217 LSK+A +GT GRF PRPM LDFSFSGLKT Sbjct: 187 LSKLAQRGTPGRFKLPRPMLHSGNLDFSFSGLKTAVLTLVNQQIDIPHPNPDGTTSHSTK 246 Query: 218 ---------------AANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKR 262 A ++R+ T +QTRADIA A ++A+VD L+ K AL QTG + Sbjct: 247 PASGQVAGYLPEGEGANESLREFPTPEQTRADIAHAAQEAIVDVLVNKALAALKQTGLNQ 306 Query: 263 LVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG-ATADL 321 LV+AGGV AN+ LR++L + K G VFY EFCTDNGAMIA+AG +R + A D Sbjct: 307 LVVAGGVGANQLLRSRLNASVGKHDGNVFYPELEFCTDNGAMIAFAGAMRLQQQVAQRDY 366 Query: 322 GVSVRPRWPLAEL 334 +V+PRW L E+ Sbjct: 367 RFNVKPRWDLREM 379 >UniRef50_B0TX13 Probable O-sialoglycoprotein endopeptidase n=19 Tax=Francisella RepID=GCP_FRAP2 Length = 336 Score = 347 bits (891), Expect = 3e-94, Method: Compositional matrix adjust. Identities = 172/333 (51%), Positives = 237/333 (71%), Gaps = 5/333 (1%) Query: 1 MRVLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M VLGIE+SCDETG+AIYD K L+A+ LYSQ+ LH YGGVVPELASR+H+ K L Sbjct: 1 MLVLGIESSCDETGLAIYDYTSKTLVADVLYSQIDLHKKYGGVVPELASREHIAKLNILT 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + L + + D+ +AYTA PGL+GAL+VGAT ++L ++ + VHH+EGHLL+P Sbjct: 61 KELLSNANINFNDLSCIAYTAMPGLIGALMVGATFAKTLGLIHNIDTVAVHHLEGHLLSP 120 Query: 120 MLEDNPP-EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 +L+ + ++PFVALLVSGGHTQL V G+Y LLGESIDDAAGEAFDKTAKLLG+ YP Sbjct: 121 LLDQSSDIKYPFVALLVSGGHTQLFEVREFGEYSLLGESIDDAAGEAFDKTAKLLGMSYP 180 Query: 179 GGPLLSKMAAQGT-AGRFVFPRPMTDRPGLDFSFSGLKTFAANT-IRDNGTDDQTRADIA 236 GG ++ +A + T ++ PRPM ++P LDFSFSGLKT NT + + +A++ Sbjct: 181 GGVEVANLAEKATDKKKYDLPRPMKNKPNLDFSFSGLKTAVLNTWYSETDQSYENKANLC 240 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 AF++A +D L+ KC++AL +TG KRLV++GGVSAN+ LR+KL + K + E+F+ + Sbjct: 241 YAFQEAAIDVLVTKCEKALQKTGNKRLVISGGVSANKLLRSKLDILSKNKGYEIFFPPMK 300 Query: 297 FCTDNGAMIAYAGMVRF-KAGATADLGVSVRPR 328 +CTDNGAMIA AG R+ + ++L ++V+ R Sbjct: 301 YCTDNGAMIALAGAYRYANSFRDSNLEINVKAR 333 >UniRef50_Q058D1 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Buchnera aphidicola str. Cc (Cinara cedri) RepID=GCP_BUCCC Length = 343 Score = 325 bits (832), Expect = 2e-87, Method: Compositional matrix adjust. Identities = 150/340 (44%), Positives = 228/340 (67%), Gaps = 9/340 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCD+T +AIYD + GL+ +Q +Q +H+ Y G+VPELA+R H+ + LI+ Sbjct: 1 MKILGIETSCDDTSVAIYDKKLGLIDHQTLNQNSVHSKYHGIVPELAARSHLNQLNFLIK 60 Query: 61 AALKE------SGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 + S K AVAYT GPGL G+++V + RS+A + D+P I ++H+EG Sbjct: 61 NIFSKYFLYNSSNFKKKFFKAVAYTVGPGLSGSIVVHSC--RSIALSLDIPYILINHLEG 118 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HLL+ ML FPF+ALLVSG +TQLI +G+Y +LG+++DDA G FD AK+LG Sbjct: 119 HLLSVMLSYKKNLFPFLALLVSGANTQLIYAKYLGKYIILGQTLDDAVGNVFDYIAKILG 178 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 L +PGG LS +A G +G++ FPRPMT L+FSFSGLKT N I ++ Q +++ Sbjct: 179 LGFPGGKNLSDLAKYGISGKYFFPRPMTKYSNLNFSFSGLKTHVKNVILNSSDSFQEKSN 238 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 IA++FE+A+VDTL+IKCK A+ + K ++ GGVS+NR LR KL +++ K + ++++++ Sbjct: 239 IAKSFEEAIVDTLIIKCKLAIKKIKVKNFLVCGGVSSNRLLRIKLKKLIYKNQRKLYFSK 298 Query: 295 PEFCTDNGAMIAYAGMVRFKAGA-TADLGVSVRPRWPLAE 333 +FCTDN MIAY G ++++ G + + S+ P +++ Sbjct: 299 KKFCTDNAGMIAYLGFLKYQQGMYSYNKSFSIYPNLLISD 338 >UniRef50_Q127W3 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Proteobacteria RepID=GCP_POLSJ Length = 347 Score = 319 bits (818), Expect = 7e-86, Method: Compositional matrix adjust. Identities = 182/339 (53%), Positives = 232/339 (68%), Gaps = 8/339 (2%) Query: 1 MRVLGIETSCDETGIAIYD----DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIE+SCDETG+A+ D + LL++ L+SQ+++H YGGVVPELASRDH+R+ + Sbjct: 1 MLVLGIESSCDETGVALVDAGGSEVPRLLSHALFSQIQMHQAYGGVVPELASRDHIRRVL 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 PL + + ++G + +D VAYT GPGL GALLVGA V +LA A P + VHH+EGHL Sbjct: 61 PLTRQVMAQAGRSLAQVDVVAYTRGPGLAGALLVGAGVACALAAALGKPVMGVHHLEGHL 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L+P L +PP FPFVALLVSGGHTQL+ V +G YELLGE+IDDAAGEAFDK+AKL+GL Sbjct: 121 LSPFLSADPPVFPFVALLVSGGHTQLMRVDRVGSYELLGETIDDAAGEAFDKSAKLMGLP 180 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR-ADI 235 YPGGP L+ +A QG F PRP+ LDFSF+GLKT + G + + R AD+ Sbjct: 181 YPGGPHLADLARQGDGTAFKLPRPLLHSGDLDFSFAGLKTAVLTQAKKLGPELENRKADL 240 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A A + A+VD L+ K A+ QTG KRLV+AGGV AN LR++L ++R V Y Sbjct: 241 AAATQAAIVDVLVKKSLAAMAQTGLKRLVVAGGVGANALLRSQLNAACQQRGIRVHYPEL 300 Query: 296 EFCTDNGAMIAYAGMVRFKAG-ATADLGVS--VRPRWPL 331 CTDNGAMIA A +R +AG T G + V+PRW L Sbjct: 301 HLCTDNGAMIALAAGMRLQAGLETLQRGYTFDVKPRWSL 339 >UniRef50_A0L5L8 Probable O-sialoglycoprotein endopeptidase n=24 Tax=Bacteria RepID=GCP_MAGSM Length = 353 Score = 310 bits (795), Expect = 4e-83, Method: Compositional matrix adjust. Identities = 174/347 (50%), Positives = 223/347 (64%), Gaps = 14/347 (4%) Query: 1 MRVLGIETSCDETGIAIYDD-EKG------LLANQLYSQVKLHADYGGVVPELASRDHVR 53 +RVLGIE+SCDET A+ + E G + +N ++SQ+++HA YGGVVPELASR H+R Sbjct: 2 LRVLGIESSCDETAAAVVEGAEHGHPHGVVVRSNVVWSQLEVHALYGGVVPELASRAHIR 61 Query: 54 KTVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 P+I+ AL E+G+ + +DA+A T PGLVGALLVG + LA A D P +PVHHME Sbjct: 62 HIQPVIEQALAEAGVRPQQLDAIAVTVAPGLVGALLVGVAAAQGLAVALDKPLVPVHHME 121 Query: 114 GHLLAPMLEDN---PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTA 170 GHL++P L EFPFVALLVSGGHT L+ G Y+LLG++ DDA GEAFDK A Sbjct: 122 GHLMSPFLMAGVVPAMEFPFVALLVSGGHTLLLHARDFGDYQLLGQTRDDAVGEAFDKGA 181 Query: 171 KLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD- 229 ++LGL YPGGP ++ +A G FPR + DR DFSFSGLKT + + Sbjct: 182 RMLGLGYPGGPEVAALAQSGDRQAVAFPRVLLDRSQFDFSFSGLKTALRTHLLKFPPESG 241 Query: 230 -QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 + AD+A ++++A+VDTL+IK A G RLV+AGGV ANR LR KLA+ K +G Sbjct: 242 GPSLADVAASYQEAIVDTLVIKSLSACRHVGVSRLVIAGGVGANRRLREKLAKQALK-QG 300 Query: 289 EVFYARP-EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 YA P CTDNGAMIA AG+ R G A V+ PR P+ EL Sbjct: 301 VQLYAPPIHLCTDNGAMIASAGVCRLARGDQARGVVNAVPRLPIHEL 347 >UniRef50_C0QTG9 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Bacteria RepID=GCP_PERMH Length = 344 Score = 293 bits (749), Expect = 8e-78, Method: Compositional matrix adjust. Identities = 141/332 (42%), Positives = 217/332 (65%), Gaps = 9/332 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCD+T +++YD E+GLL+N + SQ+K+H ++GGV P+LA+R+H + +P++ Sbjct: 1 MKILGIETSCDDTAVSVYDSEEGLLSNVVSSQIKMHEEWGGVYPDLAAREHTKNIIPVLD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ + KDID +A T PGL+ +L++G +V ++L++ + P IPVHH+E H+ A Sbjct: 61 RALKEASVNIKDIDGIAVTVAPGLIVSLVIGISVAKTLSWIYRKPLIPVHHIEAHIFASF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + ++PF+AL+VSGGHT+L + G Y LG ++DDA GEA+DK A++LGL YPGG Sbjct: 121 ITEK-IDYPFIALVVSGGHTELYLIKGFEDYRYLGGTLDDAVGEAYDKVARMLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG---LDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 P++ +++ +G PRP+ + G +FSFSGLKT +R+ + DIAR Sbjct: 180 PVIDRLSKEG-EDTVKLPRPLINDRGKNRFNFSFSGLKT---AVLREIQKGVYRKEDIAR 235 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 +F++A D L+ K A+ + K +V+AGGVSAN LR K E + + ++ Sbjct: 236 SFQEAATDVLLAKTIDAMKEFNIKNVVIAGGVSANSRLREKFKEAEENHGIKAYFPPLYL 295 Query: 298 CTDNGAMIAYAGMVRFK-AGATADLGVSVRPR 328 CTDNGAM+A+ G RFK +G T D + R Sbjct: 296 CTDNGAMVAFTGYKRFKESGTTVDYSFEGKAR 327 >UniRef50_C4Z311 O-sialoglycoprotein endopeptidase n=13 Tax=Bacteria RepID=C4Z311_EUBE2 Length = 352 Score = 292 bits (748), Expect = 9e-78, Method: Compositional matrix adjust. Identities = 146/333 (43%), Positives = 205/333 (61%), Gaps = 2/333 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCDET A+ + + +L+N + +Q+ +H +YGGVVPE+ASR H+ P+I+ A Sbjct: 18 ILAIESSCDETAAAVVKNGREVLSNVINTQIAIHTEYGGVVPEIASRKHIENINPVIRKA 77 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+++G+T DIDA+ T GPGLVGALLVG +++AFA + P + VHH+EGH+ A +E Sbjct: 78 LEDAGVTLDDIDAIGVTYGPGLVGALLVGVAEAKAIAFAKNKPLVGVHHIEGHISANYVE 137 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + E PFVAL+VSGGHT L+ V G+YE++G + DDAAGEAFDK A+ +GL YPGGP Sbjct: 138 NKELEPPFVALVVSGGHTHLVKVNDYGEYEIVGRTRDDAAGEAFDKVARAIGLGYPGGPK 197 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADIARAFE 240 + K+A +G FPR D DFSFSG+K+ N I + RAD+A +F+ Sbjct: 198 IDKLAKEGNPDAIEFPRAHVDDAPYDFSFSGIKSAVLNYINSANMQGKEINRADVAASFQ 257 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 AVVD L+ + R + G +L +AGGV++N LRA + E K + P CTD Sbjct: 258 KAVVDALVSRAVRLAKECGMDKLAIAGGVASNSALRAAIQEACAKNNIGFYSPSPILCTD 317 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 N AMI A + G ++ P L E Sbjct: 318 NAAMIGAAAYYEYIKGVRHGYDLNAVPNLKLGE 350 >UniRef50_A5G3X1 Probable O-sialoglycoprotein endopeptidase n=20 Tax=Bacteria RepID=GCP_GEOUR Length = 343 Score = 292 bits (748), Expect = 1e-77, Method: Compositional matrix adjust. Identities = 150/333 (45%), Positives = 211/333 (63%), Gaps = 3/333 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IE+SCDET A+ D + +L+N + SQ+ +HA YGGVVPE+ASR H+ +I+ Sbjct: 1 MLLLAIESSCDETAAAVVRDGRIILSNIVASQISVHAGYGGVVPEIASRKHLETISTVIE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ +G++ D+D +A T GPGL GALLVG + +++A+A VP V+H+E H+LA Sbjct: 61 EALQAAGVSLTDVDGIAVTQGPGLAGALLVGISTAKAMAYALGVPIAGVNHIESHILAIF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE + EFPFVAL VSGGHT L V +G+Y+ LG+++DDAAGEAFDK AKLLGL YPGG Sbjct: 121 LERS-IEFPFVALAVSGGHTHLYLVEAVGRYKTLGQTLDDAAGEAFDKVAKLLGLPYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN--GTDDQTRADIARA 238 L+ ++AA+G FPRP+ +FSFSGLKT N ++ N D + D+ + Sbjct: 180 ALIDRLAAEGDPEAIRFPRPLMRDESFNFSFSGLKTSVLNYLQKNPAAADGRALNDLCAS 239 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+ AV D L+ K A+ TG KR+V+AGGV+ N LR +++ + + + E+ P C Sbjct: 240 FQAAVCDVLVSKTAAAVSATGIKRVVVAGGVACNNGLRREMSRLAELKGIELHIPSPLLC 299 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 +DN AMIA G + + P WPL Sbjct: 300 SDNAAMIAVPGDYYLSNNILSGFDIDALPVWPL 332 >UniRef50_B3WUZ1 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Escherichia coli B171 RepID=B3WUZ1_ECOLX Length = 332 Score = 292 bits (747), Expect = 1e-77, Method: Compositional matrix adjust. Identities = 145/326 (44%), Positives = 210/326 (64%), Gaps = 4/326 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IETSCDETG+A++ ++ L+++ LYSQV +H+ +GG+VPE+ASR + PLI+ Sbjct: 2 LLAIETSCDETGVALFSEDGKLISHLLYSQVAIHSPFGGIVPEIASRKQLEVLYPLIKEL 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 LK++ + + AVA T GPGL+G+LLVG ++ ++++FA +P I V H++ HLLA LE Sbjct: 62 LKQNNIEISQLKAVAATFGPGLIGSLLVGVSLAKAISFALKIPLIAVDHLQAHLLAVFLE 121 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 EFPF+ LLVSGGHT L + ++ ++G + DDAAGEAFDK AKLLGL YPGGP+ Sbjct: 122 KE-IEFPFIGLLVSGGHTALFLINSFFEFYVIGHTKDDAAGEAFDKVAKLLGLPYPGGPI 180 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 +S++A +G PRP+ + LDFSFSGLKT N I+++ + D+ FE+A Sbjct: 181 ISQLAEKGDPKAINLPRPLLEDKSLDFSFSGLKTAVLNYIKNHSYRVE---DLCAGFEEA 237 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 V D L+ K RA+D R+V+AGGV+AN+ LR + E E+++ EFCTDN Sbjct: 238 VCDVLVYKTFRAVDLFKVPRVVVAGGVAANKRLRQRFREKAFNTGVEIYFPSLEFCTDNA 297 Query: 303 AMIAYAGMVRFKAGATADLGVSVRPR 328 AM+ G +++ ADL R Sbjct: 298 AMVGLLGYKQWQEKKYADLNTEAYAR 323 >UniRef50_C1A601 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=GCP_GEMAT Length = 357 Score = 291 bits (745), Expect = 2e-77, Method: Compositional matrix adjust. Identities = 165/332 (49%), Positives = 206/332 (62%), Gaps = 16/332 (4%) Query: 1 MRVLGIETSCDETGIAIYD--DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 MRVLGIETSCDET A+ E L + + +H +GGVVPE+ASR H+ VP Sbjct: 1 MRVLGIETSCDETSAAVVSGTPEAMTLESCVILSQDVHRLFGGVVPEIASRQHLIGIVPA 60 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 + AAL+E+ ++ DIDAVA T PGLVGALLVG + +SLA ++D P +PVHH+EGHL A Sbjct: 61 VAAALQEAQVSLSDIDAVAVTHAPGLVGALLVGTSFAKSLALSYDKPLVPVHHLEGHLFA 120 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 +LE PF ALLVSGGHT L+ V G+Y LLG++ DDA GEAFDK AKLLGL YP Sbjct: 121 TLLEHPDAAPPFTALLVSGGHTLLLDVPAWGEYRLLGQTRDDAVGEAFDKVAKLLGLPYP 180 Query: 179 GGPLLSKMAAQGTAGRFVFP----RPM-------TDRPGLDFSFSGLKTFAANTIRD--- 224 GG + ++AA A P RPM D D SFSGLKT +RD Sbjct: 181 GGRPIEQLAATAEAPVHKHPHRFARPMLRKSSTPADEDYYDCSFSGLKTAVLYAVRDAER 240 Query: 225 NGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMK 284 GT D RA IAR F+DAV+DTL+ K RA Q R+V+ GGV+ N+ L+A + M+ Sbjct: 241 TGTLDDARASIARGFQDAVIDTLVEKVVRAARQHRRSRVVLGGGVACNQALQAAMRNAME 300 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 +R+G VF P TDN AMIA AG+ R + G Sbjct: 301 QRKGHVFAPSPRLATDNAAMIAAAGIFRLQRG 332 >UniRef50_B9KXJ0 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Chloroflexi RepID=GCP_THERP Length = 365 Score = 283 bits (723), Expect = 9e-75, Method: Compositional matrix adjust. Identities = 166/358 (46%), Positives = 209/358 (58%), Gaps = 31/358 (8%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETSCDET A+ D + +L+N + SQV LH YGGVVPELASR HV VP++ Sbjct: 1 MIILGIETSCDETAAAVVRDGRFVLSNIIRSQVDLHQRYGGVVPELASRRHVTSIVPVLD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++G+ IDA+A T GPGL G+LLVG V ++LAF W+ P IPV+H+EGH+ A Sbjct: 61 LALEQAGIGPSAIDAIAVTEGPGLAGSLLVGINVAKTLAFVWEKPLIPVNHLEGHIYANW 120 Query: 121 L----EDNPPE--FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 L +D PE FP V L+VSGGHT+L+ + G G Y LLG ++DDAAGEAFDK A+LLG Sbjct: 121 LTLPGQDEVPEPTFPLVCLIVSGGHTELVLMRGHGDYVLLGRTLDDAAGEAFDKAARLLG 180 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR-- 232 L +PGGP + K A QG GRF PR DFSFSGLKT + R Sbjct: 181 LGFPGGPAIQKAAEQGRPGRFSLPRAWLGE-SYDFSFSGLKTALLRVLEQYQRRPARRVA 239 Query: 233 -------------------ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANR 273 AD+A F+ AVV+ L K RA + G +++AGGV+AN Sbjct: 240 AGQPFPEYVAPEYGPSVPIADLAAEFQAAVVEVLAEKTARAAREFGATMVLLAGGVAANA 299 Query: 274 TLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 LR +L E+ V Y P CTDN AMIA A + G ADL + V PL Sbjct: 300 ALRQRLREISPV---PVRYPPPILCTDNAAMIAGAAYYLAQRGVRADLDLDVHAHLPL 354 >UniRef50_C1SJZ8 Metalloendopeptidase, putative, glycoprotease family n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJZ8_9BACT Length = 327 Score = 282 bits (722), Expect = 1e-74, Method: Compositional matrix adjust. Identities = 145/330 (43%), Positives = 202/330 (61%), Gaps = 9/330 (2%) Query: 1 MRVLGIETSCDETGIAIYDD-EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M +LGIE+SCDET +A+YD + + A SQ +LH+ +GGVVPE+ASR+H+ K L Sbjct: 1 MIILGIESSCDETSLAVYDSVNRSVKATFTSSQAELHSKFGGVVPEVASRNHILKIESLF 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + + E+G+T +DIDA+ T PGL+GAL VG + ++L +A +P IPV+H+ H+LA Sbjct: 61 EQCMTEAGITPQDIDAIGVTNAPGLIGALFVGVSFAKALGYALKIPVIPVNHLSAHILAS 120 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L + + P++AL++SGGHT + V +ELL +IDDAAGE+FDK AK+LGL YPG Sbjct: 121 ELTNQELKAPYLALIISGGHTHIYDVDEAYNFELLARTIDDAAGESFDKVAKMLGLGYPG 180 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 GP + K+A G + P + +P DFSFSGLKT N I D D ADIA +F Sbjct: 181 GPAIEKLAESGDENKVTLPIAIKKKP--DFSFSGLKTAVLNKINDKSESD---ADIAASF 235 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + V +TL +K R + G ++V+AGGV+ N +R M+K+ EVF+ P CT Sbjct: 236 QKTVAETLTLKTLRMAESLGRNKIVVAGGVACNGYIRRAF---MEKQGYEVFFPSPRLCT 292 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 DNG MIAYA F A L + R Sbjct: 293 DNGDMIAYAASKFFGQRKFASLDETAHDRM 322 >UniRef50_Q3YS67 Probable O-sialoglycoprotein endopeptidase n=24 Tax=Rickettsiales RepID=GCP_EHRCJ Length = 350 Score = 282 bits (721), Expect = 1e-74, Method: Compositional matrix adjust. Identities = 145/340 (42%), Positives = 210/340 (61%), Gaps = 12/340 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCDET +AI + K +L++++ SQ K HA+YGGVVPE+ASR H+ L + Sbjct: 8 VLGIETSCDETAVAIVNSNKEVLSHKILSQ-KEHAEYGGVVPEIASRAHINYLYDLTVSC 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH-LLAPML 121 ++ES L+ +IDAVA T+GPGL+G L+VG + + +A P I ++H+E H L+ M Sbjct: 67 IEESQLSLNNIDAVAVTSGPGLIGGLIVGVMIAKGIASVTGKPIIEINHLEAHALIVRMF 126 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + FPF+ L++SGGH Q + V +G Y LG S+DD+ GE FDK AK+L L YPGGP Sbjct: 127 YE--INFPFLLLIISGGHCQFLIVYNVGCYHKLGSSLDDSLGEVFDKVAKMLNLGYPGGP 184 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG-TDDQTRADIARAFE 240 ++ K + G + FV PR +T R G DFSFSGLKT N I ++ D++ DI+ +F+ Sbjct: 185 VIEKKSLSGDSKSFVLPRALTGRCGCDFSFSGLKTAVRNIIMNHEYIDNKLICDISASFQ 244 Query: 241 DAVVDTLM------IKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + V D L+ I +A+D+ +LV+ GGV+AN+ LR ++ E+FY Sbjct: 245 ECVGDILVNRINNAIAMSKAIDKR-IDKLVVTGGVAANKLLRERMLRCASDNNFEIFYPP 303 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 + CTDNG MI +AG+ ++L + + RWPL L Sbjct: 304 SKLCTDNGIMIGWAGIENLVKDYVSNLDFAPKARWPLESL 343 >UniRef50_B2V910 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Aquificales RepID=GCP_SULSY Length = 337 Score = 279 bits (713), Expect = 1e-73, Method: Compositional matrix adjust. Identities = 148/338 (43%), Positives = 213/338 (63%), Gaps = 12/338 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCD+T IA+YD EKG+ +N + SQ+ +HA +GGV PE+A+R+H + +P++ Sbjct: 1 MVVLGIETSCDDTSIAVYDSEKGIPSNVVTSQL-IHAQFGGVYPEIAAREHTKNFLPVLD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ +T DIDA+A T PGL+ +L+ G + ++L+F+ P IPVHH+E H+ A Sbjct: 60 KALRDASITLSDIDAIATTFMPGLIVSLVAGVSGAKTLSFSLKKPLIPVHHIEAHIFANF 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + E+PF+AL+VSGGHT+LI V Y LG ++DDA GE +DK A+ LGL +PGG Sbjct: 120 ITKE-IEYPFLALVVSGGHTELILVKEFEDYIYLGGTLDDAVGEVYDKVARALGLGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTD--RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 PL+ K+A +G FPRP+ + +FSFSGLK+ IR+ + DI ++ Sbjct: 179 PLIDKLAKEGKEA-IKFPRPLLNDEENKYNFSFSGLKS---AVIREINKGIYKKEDITKS 234 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F++AVVD L+ K A + G R+V+AGGVSAN LR E + + EV + C Sbjct: 235 FQNAVVDVLVKKTVLACKEFGINRVVVAGGVSANSQLR---EEFLNIKDLEVHFPPMHLC 291 Query: 299 TDNGAMIAYAGMVRFK-AGATADLGVSVRPRWPLAELP 335 TDNGAM+AY G RFK G + L + R + + P Sbjct: 292 TDNGAMVAYTGYKRFKEKGISVSLDFEAKARCRIDKFP 329 >UniRef50_B6BRQ7 O-sialoglycoprotein endopeptidase n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BRQ7_9RICK Length = 357 Score = 279 bits (713), Expect = 1e-73, Method: Compositional matrix adjust. Identities = 146/341 (42%), Positives = 221/341 (64%), Gaps = 12/341 (3%) Query: 3 VLGIETSCDETGIA-IYDDEKGL---LANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +LGIE+SCDET + I ++E+G+ L+N + SQV++H ++GGVVPELA+R H+ K + Sbjct: 7 ILGIESSCDETAASLITENEQGIPIVLSNIISSQVEVHKEFGGVVPELAARSHMEKIDWI 66 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 ++ A+ +SG ++IDAVA TAGPGL+ L VG + G++ A A + P I V+H+EGH L+ Sbjct: 67 VEKAINDSGRKIEEIDAVASTAGPGLIVCLSVGLSFGKAFASALNKPFIAVNHLEGHALS 126 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 P L ++ +P++ LL+SGGH+Q ++V +G+Y+ LG +IDDA GEAFDKTAKLLG+++P Sbjct: 127 PKL-NSKLNYPYLVLLISGGHSQFLNVQDLGKYKRLGTTIDDALGEAFDKTAKLLGVEFP 185 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 GGP + MA +G + ++ P+P+ ++ G + SF+GLKT A I N DQ + D+A + Sbjct: 186 GGPQIEIMAEKGDSNKYDLPKPIFNKGGCNLSFAGLKT-AILKITKNIKTDQEKFDLAAS 244 Query: 239 FEDAVVDTLMIKCKRALD----QTGFKR--LVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 F+ V + L K K A + Q K V+AGGV+AN+ +R L + + + + Sbjct: 245 FQKTVEEILYKKTKIAFNEFEKQNKLKDKIFVVAGGVAANKKIRTMLINLCNENNYKGIF 304 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 E C DN AMIA G+ +FK + L +PRWPL E Sbjct: 305 PPIELCGDNAAMIAMVGLEKFKLKQFSALDHPAKPRWPLDE 345 >UniRef50_Q11TP2 Probable O-sialoglycoprotein endopeptidase n=87 Tax=Bacteria RepID=GCP_CYTH3 Length = 343 Score = 279 bits (713), Expect = 1e-73, Method: Compositional matrix adjust. Identities = 148/331 (44%), Positives = 204/331 (61%), Gaps = 9/331 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCDET A+ D +L N + SQ ++H YGG+VPELASR H + +P++ A Sbjct: 10 LLAIESSCDETAAAVIQD-GNILCNIVASQ-RIHEKYGGIVPELASRAHQQHIIPVVAQA 67 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ + D++AVA T+GPGL+GALLVG + ++ A A +P I V+HM+ H+LA + Sbjct: 68 LLEANIQKSDLNAVACTSGPGLLGALLVGVSFSKAFASALHIPVIKVNHMKAHILAHFIG 127 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 D P FPF+ + VSGGHTQL+ V + E++GE+ DDA GEAFDKTAKL+GL YPGGPL Sbjct: 128 DVKPSFPFICMTVSGGHTQLVIVRNYLEMEVVGETQDDAVGEAFDKTAKLMGLPYPGGPL 187 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT-----FAANTIRDNGTDDQTRADIAR 237 + A QG FP P D PG ++SFSG+KT NT D + DI Sbjct: 188 IDSYAKQGNP--LAFPFPTVDMPGYNYSFSGIKTAFMYFLKKNTAVDPDFIQKNLPDICA 245 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 + + A++D LM K KR + TG R+ +AGGVSAN LR + + ++ +V+ E+ Sbjct: 246 SVQHALIDVLMRKLKRLVVDTGINRVAIAGGVSANSGLRKAMEQKREQEGWDVYIPAFEY 305 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 CTDN AMIA AG ++ A +S PR Sbjct: 306 CTDNAAMIAVAGYHQYLENDFAGWDLSPEPR 336 >UniRef50_A9FDL0 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Deltaproteobacteria RepID=GCP_SORC5 Length = 356 Score = 276 bits (705), Expect = 1e-72, Method: Compositional matrix adjust. Identities = 165/346 (47%), Positives = 212/346 (61%), Gaps = 16/346 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVLGIETSCDET A+ + +L++ + SQV LHA YGGVVPE+A+RDH R VP+++ Sbjct: 1 MRVLGIETSCDETAAAVVTEGGDVLSDVVRSQVALHAPYGGVVPEVAARDHARAVVPVVR 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL +G++A D+D +A T+ PGL GALLVG + LA+A P + V H+ GHLLA Sbjct: 61 EALSRAGVSAADLDGIAVTSRPGLAGALLVGLQAAKGLAWAAGKPLVGVDHLVGHLLAVF 120 Query: 121 L---------EDNPPEFPFVALLVSGGHTQLISVTG--IGQYELLGESIDDAAGEAFDKT 169 L E P FP+VALL SGGHT + V G +G LG + DDAAGEAFDK Sbjct: 121 LRRGGAPLSDERERPSFPYVALLASGGHTAIYRVDGPALGAIRELGATRDDAAGEAFDKV 180 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFV--FPRPMTDRPGLDFSFSGLKTFAANTIRDNGT 227 AKLLGL YPGGP++ ++AA G A P M + L+FSFSG+K+ A + G Sbjct: 181 AKLLGLGYPGGPVVDRLAAGGDAAAAADAVPALMARKESLEFSFSGIKSSVARHVAKRGR 240 Query: 228 -DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 + Q D+ AF+ AVVD L+ K RA G R+V+ GGV+AN+ LRAK+A ++R Sbjct: 241 PEGQALRDLCAAFQGAVVDALVQKTVRAARAEGIGRVVLGGGVAANQGLRAKMAAACERR 300 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGA--TADLGVSVRPRWP 330 +F CTDNGAMIAYAG +R AG T DL R P Sbjct: 301 GLALFVPPLASCTDNGAMIAYAGALRLAAGERDTLDLAPETRTALP 346 >UniRef50_Q4FNV6 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Alphaproteobacteria RepID=GCP_PELUB Length = 357 Score = 275 bits (704), Expect = 1e-72, Method: Compositional matrix adjust. Identities = 142/341 (41%), Positives = 219/341 (64%), Gaps = 16/341 (4%) Query: 3 VLGIETSCDETGIAIY-DDEKGL---LANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +LGIE+SCDET +I ++E+G+ L++ + SQV +H ++GGVVPELA+R H+ K + Sbjct: 7 ILGIESSCDETAASIITENEQGMPTILSSIVSSQVDVHKEFGGVVPELAARSHMEKIDLI 66 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 + A +SG+ +D+DA+A TAGPGL+ L VG + G+++A + + P I V+H+EGH L+ Sbjct: 67 TKKAFDKSGVKMEDLDAIAATAGPGLMVCLSVGLSFGKAMASSLNKPFIAVNHLEGHALS 126 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 P L ++ +P++ LL+SGGHTQ +SV G+G Y+ LG +IDDA GEAFDKTAKLLG+++P Sbjct: 127 PKL-NSELNYPYLLLLISGGHTQFLSVQGLGNYKRLGTTIDDAVGEAFDKTAKLLGIEFP 185 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 GGP + A +G ++ P+P+ + G + SF+GLKT A I +Q + D+A + Sbjct: 186 GGPQIEVYAKKGDPNKYELPKPIFHKGGCNLSFAGLKT-AVLKISKQIKTEQEKYDLAAS 244 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRL--------VMAGGVSANRTLRAKLAEMMKKRRGEV 290 F+ + + L K K A ++ FK++ V+AGGV+AN+ +R L + K+ E Sbjct: 245 FQKTIEEILYKKSKIAFEE--FKKMNTINKNKFVVAGGVAANKRIREVLTNLCKEEEFEA 302 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 + C DN AMIA G+ +FK ++L +PRWPL Sbjct: 303 IFPPINLCGDNAAMIAMVGLEKFKLKQFSELDSPAKPRWPL 343 >UniRef50_Q18CP0 Probable O-sialoglycoprotein endopeptidase n=22 Tax=Bacteria RepID=GCP_CLOD6 Length = 338 Score = 274 bits (700), Expect = 4e-72, Method: Compositional matrix adjust. Identities = 136/334 (40%), Positives = 205/334 (61%), Gaps = 6/334 (1%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 L IE+SCDET ++ + + +L+N + +Q++ H +GGVVPE+ASR HV ++Q AL Sbjct: 7 LAIESSCDETAASVLKNGREVLSNIISTQIETHKKFGGVVPEVASRKHVENIDIVVQEAL 66 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 ++ + DID +A T GPGLVGALLVG + ++LA+ ++P + V+H+EGHL A +E Sbjct: 67 DKANIGFNDIDHIAVTYGPGLVGALLVGLSYAKALAYTLNIPLVGVNHIEGHLSANYIEH 126 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLL 183 + PF+ L+VSGGHT L+ V G+YE+LG++ DDA+GEAFDK ++ + L YPGGP++ Sbjct: 127 KDLKPPFITLIVSGGHTHLVEVKDYGKYEILGKTRDDASGEAFDKISRAMNLGYPGGPII 186 Query: 184 SKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG----TDDQTRADIARAF 239 +A G FPR + DFSFSGLK+ N + NG ++ D+A +F Sbjct: 187 DNLAKNGNKHAIEFPRAYLEEDSYDFSFSGLKSSVLNYL--NGKRMKNEEIVVEDVAASF 244 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 ++AVV+ L K +A+ G+ + ++GGV++N LRAK+ E+ K V Y CT Sbjct: 245 QEAVVEVLSTKALKAVKDKGYNIITLSGGVASNSGLRAKITELAKDNGITVKYPPLILCT 304 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 DN AMI AG F G T D+ ++ P + + Sbjct: 305 DNAAMIGCAGYYNFINGKTHDMSLNAVPNLKINQ 338 >UniRef50_D0RQS5 Putative glycoprotease GCP n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RQS5_9RICK Length = 358 Score = 272 bits (696), Expect = 1e-71, Method: Compositional matrix adjust. Identities = 138/339 (40%), Positives = 210/339 (61%), Gaps = 13/339 (3%) Query: 4 LGIETSCDETGIAIYDDEKG----LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 LGIETSCDET A+ K +L+N + SQ +H +GGVVPELA+R H K +I Sbjct: 8 LGIETSCDETAAALVKKSKNGKVKILSNVVSSQEIVHKKFGGVVPELAARAHSEKIDLII 67 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + A+K+S ++ ID VA TAGPGL+ L+VG T G+++A A P +H+EGH L Sbjct: 68 KEAIKKSKVSIHQIDGVACTAGPGLLICLMVGMTAGKTIASALKKPFFGTNHLEGHALT- 126 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 M P +FP++ LL+SGGH+Q +SV G+G+Y+ LG +IDDA GEAFDKTAK+LG+++PG Sbjct: 127 MGLIRPVKFPYLLLLISGGHSQFLSVEGVGKYKRLGTTIDDALGEAFDKTAKILGIEFPG 186 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 GP + A G F P+P+ + G + S++GLKT + + N Q + D+A +F Sbjct: 187 GPKIETFAKFGNENSFDLPKPILHKSGCNMSYAGLKTAVLHASK-NIKSKQDKYDLAASF 245 Query: 240 EDAVVDTLMIKCKRALD-------QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 + + + L +KC +A++ + K V+AGGV++N+++R + ++ + + Sbjct: 246 QKTINEILKVKCAKAIEMFLEKHKKIKNKNFVVAGGVASNQSIRKTIKQVSSTLKFNTHF 305 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 CTDN AMIA+AG+ ++AG +L + +PRWPL Sbjct: 306 PPLNLCTDNAAMIAWAGLQNYEAGKKPNLKIISQPRWPL 344 >UniRef50_D0ME01 Metalloendopeptidase, glycoprotease family n=4 Tax=Bacteria RepID=D0ME01_RHOM4 Length = 339 Score = 272 bits (695), Expect = 1e-71, Method: Compositional matrix adjust. Identities = 153/332 (46%), Positives = 211/332 (63%), Gaps = 11/332 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+T A+ + K L +N + SQ H YGGVVPELASRDH R+ VP+++ A Sbjct: 8 ILGIETSCDDTAAAVVVEGK-LRSNVVASQQATHLRYGGVVPELASRDHQRRIVPVVRQA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+GLT +D+DAVA T GPGLVG+LLVG + ++ A P I V+H+EGH+ + +E Sbjct: 67 LQEAGLTPRDLDAVAVTYGPGLVGSLLVGLSFAKAFALGLGRPLIGVNHLEGHIYSVFIE 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 P FP++ L+VSGGHTQL+ V ++ LLG + DDAAGEAFDK A+LLGL YPGGP Sbjct: 127 PPSPPFPYLCLIVSGGHTQLMRVDEGFRHTLLGRTRDDAAGEAFDKVARLLGLGYPGGPE 186 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-------DQTRADI 235 + ++A QG FPRP + G DFSFSGLKT A D ++ +Q RAD+ Sbjct: 187 IDRLARQGDPNFVAFPRPRLE--GYDFSFSGLKT-AVRYYLDQFSEAERARLLEQHRADL 243 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F+ AVVD L+ +RA+ TG + + + GGVSAN LRA + ++ ++ Sbjct: 244 CASFQQAVVDVLIDSLRRAIQDTGLRHVAIVGGVSANSALRAAAQALAEELDVRLYIPPL 303 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +C DN AMIA G + +AG + L ++ P Sbjct: 304 AYCMDNAAMIAITGYFKARAGLESPLTLAAVP 335 >UniRef50_D1B623 Metalloendopeptidase, glycoprotease family n=3 Tax=Synergistaceae RepID=D1B623_THEAS Length = 342 Score = 271 bits (692), Expect = 3e-71, Method: Compositional matrix adjust. Identities = 142/335 (42%), Positives = 212/335 (63%), Gaps = 5/335 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE+SCD+T +A+ ++ + + A+ + SQV+ HA +GGVVPELASR H + L++ Sbjct: 9 VLGIESSCDDTAVAVLEEPRRIRASLVMSQVEDHAPHGGVVPELASRRHQEAIMGLVRRC 68 Query: 63 LKESGLT--AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L ++G++ + + +A TAGPGL+G+LLVG + L+ W+VP + V+HMEGHL A + Sbjct: 69 LWQAGVSNPMRQLSLIAVTAGPGLMGSLLVGVMAAKGLSQGWEVPIMGVNHMEGHLFANV 128 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + PF+ L+VSGGHT++ V G Y LLG + DDA GEA+DK AK+LGL YPGG Sbjct: 129 LAHPDLKPPFLCLIVSGGHTEVHLVRSFGDYRLLGATRDDAVGEAYDKVAKMLGLGYPGG 188 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ ++A +G R+ P P ++FSFSGLKT +R G + + D+ +F+ Sbjct: 189 PVIDRLAREGDPDRYQLPVPFKGSSQVEFSFSGLKTAVLWLVRREG-EALSVPDLCASFQ 247 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP--EFC 298 A V++L+ K K A++QTG + + ++GGV+ANR LR +L ++ G V P E C Sbjct: 248 RAAVESLVSKVKLAMNQTGVRTVAVSGGVAANRELRRRLEDLAGSSGGRVRVYLPPLELC 307 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 TDN AM+A AG+ ++ G DL P W L+ Sbjct: 308 TDNAAMVAAAGLWAYRRGVRDDLSFRADPSWELSR 342 >UniRef50_Q6AL73 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Deltaproteobacteria RepID=GCP_DESPS Length = 344 Score = 271 bits (692), Expect = 3e-71, Method: Compositional matrix adjust. Identities = 144/328 (43%), Positives = 198/328 (60%), Gaps = 8/328 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIE+SCD+T A+ D + +N + Q ++H +GGVVPELASR H+ P+++ Sbjct: 8 MIILGIESSCDDTSAAVVIDGTAIQSNVISGQEEIHNCFGGVVPELASRSHLSAIQPVVE 67 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ ++ DID +A T GPGL G+LLVG + +SL+ +P + V HM GH LA + Sbjct: 68 KALSDAKISLDDIDLIATTQGPGLSGSLLVGYSYAKSLSLVKKIPFVGVDHMAGHALAIL 127 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE+ P+FPF+AL SGG + + V +ELLG + DDAAGEAFDK AK+LGL YPGG Sbjct: 128 LEEETPDFPFIALTASGGTSSIFLVKSSTDFELLGRTRDDAAGEAFDKVAKVLGLPYPGG 187 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAAN----TIRDNGT-DDQTRADI 235 P ++ A G FPR D+ G DFSFSGLKT N ++ NG+ + RADI Sbjct: 188 PHIAAHAETGDEKSIKFPRAWLDKDGFDFSFSGLKTAVLNYHNKIVQKNGSITKEERADI 247 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F+ AV+D L+ K A G +V+ GGVS+NR LR + K + + F Sbjct: 248 CASFQQAVIDVLVTKTINAARTHGISTVVLGGGVSSNRALRLAFSHECDKCKLQFFVPAA 307 Query: 296 EFCTDNGAMIAYAG---MVRFKAGATAD 320 + CTDN AMIA AG +RF G +D Sbjct: 308 KLCTDNAAMIAVAGYHKYLRFGPGNLSD 335 >UniRef50_A5GMV4 Probable O-sialoglycoprotein endopeptidase n=17 Tax=cellular organisms RepID=GCP_SYNPW Length = 356 Score = 270 bits (690), Expect = 5e-71, Method: Compositional matrix adjust. Identities = 155/341 (45%), Positives = 202/341 (59%), Gaps = 11/341 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKG---LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +VL +ETSCDE+ A+ G +LA+++ SQV+ HA +GGVVPE+ASR HV L Sbjct: 3 KVLALETSCDESAAAVVQHSAGGLEVLAHRIASQVEEHAQWGGVVPEIASRRHVEALPHL 62 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 I A L E+GL ++DAVA T PGLVGAL+VG+ GR+LA P + VHH+E HL + Sbjct: 63 ISAVLDEAGLAVGEMDAVAATVTPGLVGALMVGSLTGRTLAALHHKPFLGVHHLEAHLAS 122 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 L +PPE P+V LLVSGGHT+LI V + LG S DDAAGEAFDK A+LLGL YP Sbjct: 123 VRLASSPPEAPYVVLLVSGGHTELILVDSDSGLQRLGRSHDDAAGEAFDKVARLLGLAYP 182 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPG-----LDFSFSGLKTFAANTIRD--NGTDDQT 231 GGP + A G RF P+ RP DFSFSGLKT + +D Sbjct: 183 GGPAIQAAAKAGDPKRFSLPKGRVSRPEGGFYPYDFSFSGLKTAMLRQVESLKAQSDALP 242 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 D+A +FE VVD L+ + R G LVM GGV+AN LR ++ + ++R V Sbjct: 243 LEDLAASFEQIVVDVLVERSLRCCLDRGLSTLVMVGGVAANVRLRVQMEQQGRERGVSVH 302 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAG-ATADLGVSVRPRWPL 331 A +CTDN AM+ A + R +AG ++ + + V RWPL Sbjct: 303 LAPLAYCTDNAAMVGAAALGRLQAGWGSSSIRLGVSARWPL 343 >UniRef50_Q0AVU0 Probable O-sialoglycoprotein endopeptidase n=27 Tax=Bacteria RepID=GCP_SYNWW Length = 339 Score = 270 bits (690), Expect = 6e-71, Method: Compositional matrix adjust. Identities = 145/330 (43%), Positives = 200/330 (60%), Gaps = 9/330 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCDET AI + K +L+N + SQ+ +H +GGVVPE+ASR H+ ++ A Sbjct: 9 ILGIETSCDETAAAIVRNGKEILSNIVNSQIDIHQQFGGVVPEVASRKHIENIAGVVHRA 68 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 E+ L IDAVA T PGLVGALLVG + ++ A+A + P I V+H+ GH+ A LE Sbjct: 69 FSEAQLAYSAIDAVAVTNRPGLVGALLVGVSFAKAFAYALEKPLIAVNHLHGHIYANFLE 128 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 EFP + L+VSGGHT L+ ++ + E+LGE+ DDAAGEAFDK A+ LGL YPGGP Sbjct: 129 HRDIEFPAICLVVSGGHTSLLLMSNPNKMEVLGETRDDAAGEAFDKVARFLGLGYPGGPA 188 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA-DIARAFED 241 + + A +G AG+ PR DR +FSFSGLKT A N Q D+A F+ Sbjct: 189 IQEAATKGKAGQLQLPRVFLDRNDFEFSFSGLKTAAMNQWNKLQRRGQANVFDMAAEFQA 248 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE----VFYARPEF 297 A+V+ L+ K +A + + ++MAGGV+AN+ LR +MKKR E +FY + Sbjct: 249 ALVEVLVEKSIKAAAKYQVRTIMMAGGVAANQELR----NLMKKRTKEAGLKLFYPSLKL 304 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 CTDN AM+A + + A L ++ P Sbjct: 305 CTDNAAMVAANAHYHYGNRSFAPLSLNAYP 334 >UniRef50_B1GZV6 Probable O-sialoglycoprotein endopeptidase n=1 Tax=uncultured Termite group 1 bacterium phylotype Rs-D17 RepID=GCP_UNCTG Length = 342 Score = 268 bits (686), Expect = 2e-70, Method: Compositional matrix adjust. Identities = 140/333 (42%), Positives = 202/333 (60%), Gaps = 6/333 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M + IETSCDET ++ + + + +YSQ+K+HA + GVVPELASR H+ +I Sbjct: 1 MNIFAIETSCDETSASVVLNGLKVKSVVIYSQIKIHAGFFGVVPELASRSHIENINLVIW 60 Query: 61 AALKESGLTAKD----IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 AL ++G+ D IDA+A+T+GPGL GALLVGA +SLA + P IPV+H++GHL Sbjct: 61 RALSDAGINFTDFSQKIDALAFTSGPGLAGALLVGAIAAKSLACVYKKPLIPVNHLDGHL 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 + ++E+ + PF++L++SGGHT+L+ V G+Y++LG + DDAAGEAFDK AK+LGL Sbjct: 121 YSSLIENRSVKLPFLSLIISGGHTELVVVEDFGKYKVLGSTRDDAAGEAFDKAAKMLGLS 180 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR-ADI 235 YPGGP++ K+A G F RP + DFSFSG+KT N ++ N ++ + DI Sbjct: 181 YPGGPIIDKIAESGNPEAVRFTRPYL-KGSWDFSFSGIKTALLNYLKTNPVRNEKQLNDI 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F AV +TL K A + KR+V+ GGVSAN +R E +K +VF Sbjct: 240 CASFRQAVAETLCFKSFEAAKKFNLKRIVLGGGVSANSLIRKIFLETGQKNNTKVFIPSL 299 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + TDN AMI A + K + ++P Sbjct: 300 IYSTDNAAMIGCAAYFKQKKCGLKYDNIQLKPN 332 >UniRef50_C0Q8X7 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Desulfobacteraceae RepID=GCP_DESAH Length = 333 Score = 267 bits (682), Expect = 5e-70, Method: Compositional matrix adjust. Identities = 146/329 (44%), Positives = 201/329 (61%), Gaps = 1/329 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIE+SCD+T A+ D +L++ + SQV +H YGGVVPELASR H+ P++ Sbjct: 1 MIILGIESSCDDTAAAVVSDHNTVLSSVVSSQVDVHHRYGGVVPELASRMHIEAISPVVA 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ ++G++ I+ VA T GPGL+GALLVG + ++ A+A ++P V+H+EGH+ + + Sbjct: 61 QAVDQAGISPDQIEGVAVTRGPGLIGALLVGFSFAKAFAWAKNIPWAGVNHLEGHIYSLL 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L D+PP FPF ALL SGGHT + V ++ELLG++ DDAAGEAFDK AK+LGL YPGG Sbjct: 121 LSDDPPAFPFTALLASGGHTSIFHVVSQDRFELLGQTRDDAAGEAFDKVAKMLGLGYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-DQTRADIARAF 239 ++ +AA+G FPR D+ G DFSFSGLK+ A ++ N + + IA F Sbjct: 181 AVVEALAAKGDPCLIPFPRSFLDKDGFDFSFSGLKSAVARYVQLNRENLGEMMPHIAAGF 240 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + AV D L K A TG R+ +AGGVSANR L +++ K ++ P FC Sbjct: 241 QSAVTDVLAFKLIHAARATGCSRIAIAGGVSANRFLASRMKIEAAKHNMALYLPPPSFCG 300 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPR 328 DN AMIA G G L V R Sbjct: 301 DNAAMIAARGHRLISQGDLCQLDSDVFSR 329 >UniRef50_Q8RC98 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Bacteria RepID=GCP_THETN Length = 341 Score = 266 bits (681), Expect = 5e-70, Method: Compositional matrix adjust. Identities = 139/333 (41%), Positives = 203/333 (60%), Gaps = 3/333 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCDET + + K +L+N +YSQ+ +H YGGVVPE+ASR H+ +++ A Sbjct: 7 ILGIETSCDETAAGVVKNGKEVLSNVIYSQINVHKKYGGVVPEIASRKHIEAISFVVEEA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ L+ ++DA+A T GPGLVG LLVG + G++LA+A P I V+H++GH+ A + Sbjct: 67 LNEAKLSLDEVDAIAATYGPGLVGPLLVGLSYGKALAYAKGKPFIGVNHIDGHIAANYIG 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 N PFV L+ SGGH+ ++ V G+YE++G+++DDAAGEAFDK A+ LGL YPGGP Sbjct: 127 GNLTP-PFVCLVASGGHSHIVYVKDYGEYEVMGKTLDDAAGEAFDKVARALGLGYPGGPA 185 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTRADIARAFE 240 + K A G FP+ + DFSFSG+KT N + + ++ D+A +F+ Sbjct: 186 IEKAAKLGNMEAIEFPKSFMEEGNFDFSFSGVKTAVLNYLNRQKQKGEEVNIYDVAASFQ 245 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+ L+ K A ++ +AGGV++N LR KL E KK V+Y +CTD Sbjct: 246 RNIVEVLVKKLVEAARFKNVSKVSIAGGVASNGFLRQKLEEDAKKFGLSVYYPEKIYCTD 305 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 NGAMIA A F G + + ++ P + E Sbjct: 306 NGAMIAAAAYYDFVKGKFSGMDLNAIPYLKIGE 338 >UniRef50_D1AVQ5 Metalloendopeptidase, glycoprotease family n=1 Tax=Streptobacillus moniliformis DSM 12112 RepID=D1AVQ5_STRM9 Length = 332 Score = 266 bits (679), Expect = 9e-70, Method: Compositional matrix adjust. Identities = 130/310 (41%), Positives = 191/310 (61%), Gaps = 6/310 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IE+SCDET +AI D K +L+N + +Q+ +H +YGGVVPE+ASR H+ + + Sbjct: 1 MLILAIESSCDETSVAILKDGKNVLSNVIATQIDIHKEYGGVVPEIASRHHIENILTVYD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ DI +A T PGL+G+LLVG + L+ + ++P IPV+H+EGH+ + Sbjct: 61 KALKEANCKISDISYIAVTNTPGLIGSLLVGLMFAKGLSLSNNIPLIPVNHIEGHIFSTF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + D P+ P + L+ SGGHT L + LLGE++DDA GEA+DK A++LGL+YPGG Sbjct: 121 I-DYEPKLPMLTLVASGGHTSLYLIDENKDLTLLGETLDDAIGEAYDKVARILGLEYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTRADIARA 238 PLL KMA G F P P G DFSFSG+KTF N + +D + D+A+ Sbjct: 180 PLLEKMAIMG-HNSFDIPTPKV--SGYDFSFSGIKTFITNYVNRKKMKGEDFNKEDLAKT 236 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+D +++ L+ K +A + K + + GGVSAN+ +R + ++ + + E+C Sbjct: 237 FQDKIIEVLIDKLSKASRKNNIKTISVVGGVSANKAIREAIINSEYFENVDILFPKFEYC 296 Query: 299 TDNGAMIAYA 308 TDN AMIA A Sbjct: 297 TDNAAMIASA 306 >UniRef50_C7ND80 Metalloendopeptidase, glycoprotease family n=3 Tax=Leptotrichia RepID=C7ND80_LEPBD Length = 339 Score = 266 bits (679), Expect = 9e-70, Method: Compositional matrix adjust. Identities = 132/332 (39%), Positives = 207/332 (62%), Gaps = 14/332 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L ETSCDET +A+ +D K +L+N + +Q+ +H ++GGVVPE+ASR H+ +P+ Sbjct: 1 MKILAFETSCDETSVAVVEDGKKILSNIISTQIDIHKEFGGVVPEIASRHHIENILPVFT 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ DID +A T PGL+G+LLVG +SL++A ++P +PV+H+ GH+ + Sbjct: 61 EALEKANCELSDIDYIAVTNTPGLIGSLLVGLMFAKSLSYANNIPLLPVNHINGHIFSSF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISV---TGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 + DN + P ++L+VSGGHT L + G +LLGE++DDA GE +DK A++LGLDY Sbjct: 121 I-DNDVKLPAISLVVSGGHTNLYYIYEENGKIITDLLGETLDDAVGETYDKIARILGLDY 179 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADI 235 PGGP + K++ G +P D G +FSFSG+KTF N + + ++ DI Sbjct: 180 PGGPHIDKLSING-EDILKIKKPKVD--GYNFSFSGIKTFITNYVNNQKMKGNAISKEDI 236 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEM--MKKRRGE---V 290 A++ ++ +V+ L K A+ + K +++AGGVSAN+ LR K +E +K + E V Sbjct: 237 AKSLQEIIVNVLYDKILMAVKEKDVKTILVAGGVSANKRLREKFSEFTNIKTDKNEQIAV 296 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLG 322 + + E+CTDN AMI A K + +LG Sbjct: 297 HFPKMEYCTDNAAMIGVAAYYDLKNNSQVELG 328 >UniRef50_A5CE49 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Orientia tsutsugamushi RepID=GCP_ORITB Length = 344 Score = 265 bits (677), Expect = 2e-69, Method: Compositional matrix adjust. Identities = 142/347 (40%), Positives = 209/347 (60%), Gaps = 16/347 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M V+GIE+SCD+T IAI + + ++AN + SQ H Y GVVPE+A+R H++ ++ Sbjct: 1 MNVIGIESSCDDTAIAIVNSNREIIANVVISQYTEHLPYSGVVPEIAARAHLKNLQYAMK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L ++ + DID +A T+GPGL+G ++VG+ G+++A A I V+H+EGH+LA Sbjct: 61 ETLNQAKINFTDIDVIAATSGPGLIGGIIVGSVFGQAIACALGKDFIAVNHLEGHILAVR 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L +N FP++ LLVSGGH Q I+V G+G+Y++LG++IDDA GEAFDKTA+LL L YPGG Sbjct: 121 LNENI-SFPYLVLLVSGGHCQFIAVLGVGKYKILGQTIDDAVGEAFDKTARLLKLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADIARAF 239 P++ K+A++G ++ P MT + G D SFSGLKT I ++ DI +F Sbjct: 180 PIIEKLASKGDPHKYSLPLSMTKKSGCDLSFSGLKTAVKQLIFSIESLSEKVICDICASF 239 Query: 240 EDAVVDTLMIKCKRALD------QTGFK-----RLVMAGGVSANRTLRAKLAEMMKKRRG 288 + VV L+ + A+ FK V++GGV+AN+ LR ++ + G Sbjct: 240 QYTVVQILLCRSINAIKLFESYCSNNFKINRKNYFVISGGVAANQYLRQEIFN-LANTYG 298 Query: 289 EVFYARP-EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 A P CTDN AMIA+AG+ R A + V R +W + EL Sbjct: 299 YCGVAPPSNLCTDNAAMIAWAGIERLNANLFSSNFVP-RAKWSVEEL 344 >UniRef50_B4U8B7 Metalloendopeptidase, glycoprotease family n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U8B7_HYDS0 Length = 343 Score = 264 bits (674), Expect = 4e-69, Method: Compositional matrix adjust. Identities = 137/314 (43%), Positives = 201/314 (64%), Gaps = 12/314 (3%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGIETSCD+T +A+Y ++GL+ N L SQV H Y G+VPEL SR+H + L L Sbjct: 10 LGIETSCDDTALALYSSKRGLIDNLLSSQVNAHKIYNGIVPELCSREHTKNLYILFYELL 69 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 ++ + DID +A T PGL+ +LLVGA+ L++A D+P +PVHH+E H+ + LE Sbjct: 70 EKHKIKPSDIDFLAVTIAPGLILSLLVGASFASGLSYALDIPIVPVHHIEAHIYSVFLEY 129 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLL 183 N E+PF+AL+VSGGHT++ V G YEL+G+++DDAAGEAFDK A LLGL YPGGP + Sbjct: 130 N-VEYPFLALVVSGGHTEIYLVKGFEHYELIGKTLDDAAGEAFDKGAVLLGLQYPGGPAI 188 Query: 184 SK-MAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 K +++ FP P+ D + FSFSGLKTF +R+N D + + ++++A Sbjct: 189 EKFLSSYENPETIDFPIPIKD-DRIAFSFSGLKTF----LREN-KDKYPKDALVFSYQEA 242 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 +V+ ++ ++A+ +T RLV+ GGV+AN+ LR KL + E + ++CTDN Sbjct: 243 IVNHIIRTLQKAIKKTAVNRLVVVGGVAANKRLREKLNAL----DIECYIPSIKYCTDNA 298 Query: 303 AMIAYAGMVRFKAG 316 AM++ G +RF G Sbjct: 299 AMVSLVGNMRFLKG 312 >UniRef50_Q2GEG6 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Neorickettsia RepID=GCP_NEOSM Length = 329 Score = 263 bits (673), Expect = 5e-69, Method: Compositional matrix adjust. Identities = 139/327 (42%), Positives = 203/327 (62%), Gaps = 7/327 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LG+ETSCDET +AI +E + +++++Q H+ Y GV PE ASR+H++ +++ A Sbjct: 6 ILGVETSCDETSVAIVSEEGEVCFHEIFTQD--HSKYNGVYPEFASREHLKILPQILRRA 63 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ L + + A+A T GPGLVG+L+VG + R LAF+ P V+H+EGHLLA L Sbjct: 64 VQAHDL--EKLTAIACTVGPGLVGSLIVGVMMARGLAFSLKKPVFGVNHLEGHLLAVRLV 121 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + FPFV L++SGGH+QLI GIG Y LLGE++DDA GEAFDK A +LG YPGG Sbjct: 122 EKI-NFPFVCLVISGGHSQLIDARGIGDYVLLGETLDDAFGEAFDKLATMLGFTYPGGKT 180 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADIARAFED 241 + K+A +G + RF P M ++ G +FS SG+KT I ++ +ADI +F+ Sbjct: 181 VEKLAIKGDSERFRLPAAMINQSGCNFSLSGIKTALKKIITSLPQITEKDKADICASFQA 240 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 V ++ K ++A+ G R+V+AGGV +NR +R L E K + + CTDN Sbjct: 241 CVARIMVNKLEQAVKICGHSRIVLAGGVGSNRYIRETLEEFAKNHNLSLHFPEGILCTDN 300 Query: 302 GAMIAYAGMVRFKAGATADLGVSVRPR 328 AMIA+A + R KAG T +L + +PR Sbjct: 301 AAMIAWAAIERLKAGCT-ELSLEPQPR 326 >UniRef50_B2GAG0 Probable O-sialoglycoprotein endopeptidase n=56 Tax=Lactobacillales RepID=GCP_LACF3 Length = 344 Score = 263 bits (673), Expect = 5e-69, Method: Compositional matrix adjust. Identities = 137/336 (40%), Positives = 213/336 (63%), Gaps = 6/336 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L E+SCDET +++ +D +L+N + +Q+ H +GGVVPE+ASR H+ + + A Sbjct: 7 ILAFESSCDETSVSVIEDGHRVLSNIVATQIASHQRFGGVVPEVASRHHIEQITKCTKEA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+++G++ +D+ AVA T GPGLVG+LL+G T +++A+A +P +PV+HM GHL A Sbjct: 67 LEQAGVSYQDLTAVAVTYGPGLVGSLLIGVTAAKTIAWAHQLPLVPVNHMAGHLYAARFV 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + +P + LLVSGGHT+L+ + Y+++GE+ DDAAGEA+DK +++G++YP G Sbjct: 127 SDFT-YPMLGLLVSGGHTELVYMKEEHDYQIIGETRDDAAGEAYDKVGRVMGINYPAGKT 185 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARAFE 240 + + AA+G F FPR M DFSFSGLK+ NT+ D + + D+A +F+ Sbjct: 186 VDQWAAKGH-DTFHFPRAMEKEDNFDFSFSGLKSAFINTVHNADQRGEVLDKYDLAASFQ 244 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV--FYARPEFC 298 +VVD L+ K RALD+ K+L++AGGV+AN+ LR +L+ ++ + EV A ++C Sbjct: 245 QSVVDVLVAKTIRALDEFPVKQLILAGGVAANQGLRKQLSAGLQAKHPEVQLLQAPLKYC 304 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 DN AMI AG V + G AD ++ P A L Sbjct: 305 GDNAAMIGAAGYVNYLHGDRADGSLNAVPGLSFAHL 340 >UniRef50_B2UQZ0 Metalloendopeptidase, glycoprotease family n=3 Tax=Verrucomicrobia RepID=B2UQZ0_AKKM8 Length = 360 Score = 262 bits (670), Expect = 1e-68, Method: Compositional matrix adjust. Identities = 147/345 (42%), Positives = 204/345 (59%), Gaps = 12/345 (3%) Query: 1 MRVLGIETSCDETGIAIYD---DEKG--LLANQLYSQVKLHADYGGVVPELASRDHVRKT 55 + VLGIE+SCDET +AI +EK +L++ + SQ+ +H +GGVVPELASR+H Sbjct: 5 LTVLGIESSCDETAVAILRSAGEEKAPEILSSVISSQIAIHRQHGGVVPELASRNHSADL 64 Query: 56 VPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 +I+ A +E+G DID T GPGLV ALLVG + ++LA A P + V+H+EGH Sbjct: 65 PGIIRTACREAGTAPADIDVFGATGGPGLVAALLVGNSTAKALALAAGRPFVSVNHLEGH 124 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 LL+P L+ P + ++VSGGHT + V G+G Y LLG S+DDAAGEAFDK K+LGL Sbjct: 125 LLSPFLKRPGGPVPHLGMVVSGGHTLFVDVRGVGNYRLLGRSLDDAAGEAFDKVGKMLGL 184 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT---FAANTIRDNGTDD--- 229 YPGGP + ++AA+G F FPR + + SFSGLKT + I NG Sbjct: 185 PYPGGPEIDRLAAEGDPEAFSFPRALMKEHTANVSFSGLKTAVLYTLPKITKNGDPHGLP 244 Query: 230 -QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 QT D+ +F+ AV D L+ K +AL +G + L ++GGVS NR LR++L + + Sbjct: 245 RQTLRDLCASFQRAVTDVLIHKALKALRASGHRTLSISGGVSCNRELRSRLKTACDREKV 304 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 ++ + TDN AMIAY ++ + G L V P L E Sbjct: 305 KLVLPDFDLTTDNAAMIAYVTCLKARRGLFHSLDEDVDPNLKLTE 349 >UniRef50_B9XP92 Metalloendopeptidase, glycoprotease family n=1 Tax=bacterium Ellin514 RepID=B9XP92_9BACT Length = 341 Score = 262 bits (669), Expect = 1e-68, Method: Compositional matrix adjust. Identities = 143/342 (41%), Positives = 205/342 (59%), Gaps = 11/342 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L +ETSCDET +AI + K +L+ + SQ+KLHA+YGGVVPELA+R+H+ +P+ Sbjct: 1 MILLAVETSCDETSVAIIRNGK-VLSTIVSSQIKLHAEYGGVVPELAAREHLANLIPVAN 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AA+ + + + +DA+A T GPGL GAL+VG + +AFA + P ++H E HL +P Sbjct: 60 AAMTAAEVQSDQVDAIAATQGPGLPGALVVGLKAAQGMAFALNKPFFGINHHEAHLYSPW 119 Query: 121 LEDNPP--EF----PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 + +PP +F P ++L+VSGGHT LI V ++ +LG +IDDAAGE FDK AKL+G Sbjct: 120 ITGSPPVADFDSFQPNISLIVSGGHTMLIHVESELKHHVLGSTIDDAAGECFDKVAKLIG 179 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT---DDQT 231 L YPGGP + ++A+ G + FPRPM DFSFSGLKT IRDN Q Sbjct: 180 LPYPGGPEIDRLASAGNPKAYDFPRPMLRDASDDFSFSGLKTSVRYFIRDNPAVLDSLQK 239 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 D+ + ++A+V+ L+ K RA ++ K + +GGV+ NR LR+ L K++ + Sbjct: 240 LQDLCASVQEAIVEVLVTKTVRAANRLQVKCVTASGGVTCNRALRSALETACKRKHLTLR 299 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGAT-ADLGVSVRPRWPLA 332 A CTDN AMI + +T L + P W LA Sbjct: 300 LAEKSLCTDNAAMIGVLAERKLLHSSTPTSLDSEIMPGWALA 341 >UniRef50_Q0ATQ2 Probable O-sialoglycoprotein endopeptidase n=44 Tax=Proteobacteria RepID=GCP_MARMM Length = 377 Score = 261 bits (668), Expect = 2e-68, Method: Compositional matrix adjust. Identities = 149/345 (43%), Positives = 204/345 (59%), Gaps = 16/345 (4%) Query: 1 MRVLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 + VLG+E+SCDET AI D +LA+++ Q HA +GGVVPE+A+R H Sbjct: 15 LTVLGLESSCDETAAAILRREVDGSVTVLADRVLGQNDAHAPFGGVVPEIAARAHAEAMD 74 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L+ AL E+GL D+D +A T+GPGL+G ++ + LA P I V+H+EGH Sbjct: 75 GLVSQALAEAGLAVADLDGIAATSGPGLIGGVMAALMTAKGLALGAGKPLIAVNHLEGHA 134 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L+P + + P FP++ LLVSGGHTQL+ G+G Y LG ++DDAAGEAFDKTAK++GL Sbjct: 135 LSPRISE-PLAFPYLLLLVSGGHTQLLIAEGVGVYHRLGSTMDDAAGEAFDKTAKVMGLG 193 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD--NGTDDQTRAD 234 +PGGP L + A G A RF P P+ +PG DFSF+GLKT AA I D + DQ RAD Sbjct: 194 FPGGPALERCAQSGDATRFALPVPLKGKPGCDFSFAGLKT-AARQIWDGLDAPSDQDRAD 252 Query: 235 IARAFEDAVVDTLMIKCKRAL--------DQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 ++ + A+ L + +RAL D + LV+AGGV+AN+ +RA L + Sbjct: 253 LSACVQAAIARALSSRTRRALAMFVDRFPDASRPMALVVAGGVAANKAVRAALEDEAAAA 312 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 + ++CTDN AMIA G+ + G L R RWPL Sbjct: 313 GFRLVAPPMKWCTDNAAMIALVGLEKLARGQIDGLDAPARARWPL 357 >UniRef50_A6DFV1 Metalloendopeptidase, putative, glycoprotease family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFV1_9BACT Length = 355 Score = 261 bits (668), Expect = 2e-68, Method: Compositional matrix adjust. Identities = 141/350 (40%), Positives = 206/350 (58%), Gaps = 18/350 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LG+E+SCDET +++ + +LAN + SQ+K HA+YGGV+PELA+R+H+ P + Sbjct: 1 MIILGVESSCDETAVSLVRNGHEVLANAISSQIKDHANYGGVIPELAAREHLNNVRPTLN 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ L DID +A TA PGL+ ALLVGA LA + ++H+ H+ + Sbjct: 61 EALEKAALKLDDIDGIAVTAQPGLLPALLVGAGFANGLALSLGKKVCGINHLAAHIYGGL 120 Query: 121 LE-----DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 +E NP FP ALL+SGG+TQL + G EL+G +IDDAAGEAFDK AK+LGL Sbjct: 121 IERQDILSNPNAFPLCALLISGGNTQLFIIKKTGDCELVGSTIDDAAGEAFDKAAKILGL 180 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPM-------TDRPGLDFSFSGLKTFAANTIRDNGTD 228 YPGGP++ ++A G ++ FPR ++ L+FSFSG+KT N ++ N D Sbjct: 181 PYPGGPIIDRLAKSGDKNKYKFPRSFLPKTRSYSEEHKLNFSFSGVKTSLLNLVKKNWKD 240 Query: 229 ----DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMK 284 D D+ +++DA+VD L K K A + G + L++ GGV+ N +R ++ +M Sbjct: 241 GMVPDGDLPDLLASYQDAIVDVLSTKLKMAAESYGARTLLLCGGVACNSAIRERVQKMAI 300 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFK-AGATADLGVSVRPRWPLAE 333 + E+ P++CTDN AMIA G K T D V R P+ E Sbjct: 301 QTAKELVLTPPKYCTDNAAMIAGLGYHYLKDPNFTGDF-VEASGRAPIIE 349 >UniRef50_B3R0M3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Candidatus Phytoplasma mali RepID=GCP_PHYMT Length = 329 Score = 260 bits (665), Expect = 5e-68, Method: Compositional matrix adjust. Identities = 124/314 (39%), Positives = 197/314 (62%), Gaps = 4/314 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCDET ++ D K +++N ++SQ+K H+ GGV+PELASR+H++ +++ Sbjct: 1 MIILSIETSCDETSASVTQDGKKVISNIVFSQIKEHSLNGGVIPELASREHLKNITLVLE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +LKE+ + ++ID VA+T GPGL+G+LLVG + + P + V+H+ GH+ A Sbjct: 61 KSLKEANIQPQEIDLVAFTQGPGLIGSLLVGINCALVFGYIYKKPVLGVNHLLGHIYAAQ 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E N EFP + L++SGGHT+L+++ Q + LG + DDA GEA+DK +++LG YPGG Sbjct: 121 IE-NEIEFPSLVLIISGGHTELLALENYLQIKKLGFTCDDAVGEAYDKVSRILGFGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ ++A +G F F RP +FSFSGLK+ N + N + Q + +I +F+ Sbjct: 180 PIIDELAQKG-KDIFNFVRPYLKNDNFNFSFSGLKSSIFNLVSKNNFNLQEKINICSSFQ 238 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+D L+ K KR L + FK+L++ GGV+AN +LR + + + V ++C D Sbjct: 239 SSVIDVLVEKTKRVLKKYSFKQLIITGGVAANYSLRKRFLSEFSQLK--VIIPSLKYCGD 296 Query: 301 NGAMIAYAGMVRFK 314 AMI A +FK Sbjct: 297 QAAMIGIAAYYQFK 310 >UniRef50_C8W929 Metalloendopeptidase, glycoprotease family n=2 Tax=Atopobium RepID=C8W929_ATOPD Length = 832 Score = 259 bits (662), Expect = 9e-68, Method: Compositional matrix adjust. Identities = 148/337 (43%), Positives = 189/337 (56%), Gaps = 13/337 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +E+SCDET + I D + AN + +Q+ HA +GGVVPE+ASR H V L + Sbjct: 480 ILSLESSCDETAMCIMDSHGVVCANVVATQIDFHARFGGVVPEIASRKHTEAIVGLFEET 539 Query: 63 LKESG-------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 + +G L D+ AV TAGPGLVGAL+VG + A D+P IPVHH+EGH Sbjct: 540 MARAGAHFGCDTLVPSDLAAVGVTAGPGLVGALVVGVAFAKGFCVATDLPLIPVHHLEGH 599 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 LLA + E E PFVA LVSGG+T L+ V G Y +LG +IDDA GEAFDK AK LGL Sbjct: 600 LLANLFETPDLEPPFVASLVSGGNTMLVHVRAWGDYVVLGSTIDDAVGEAFDKVAKALGL 659 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA-- 233 YPGGP++SK+AAQG FPR M FS SGLKT I G + RA Sbjct: 660 GYPGGPVISKLAAQGNPKAIHFPRAMMHSGDYSFSLSGLKTAVITYI--EGENRAGRAIN 717 Query: 234 --DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 D+A +FE AV+D + K K A+++TG + GGV+AN LRA E K V Sbjct: 718 LPDLAASFEQAVIDVQVAKAKTAVEETGVSDFCVGGGVAANPALRAAYKETFGKMGVRVT 777 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 C DN AMIA + ++ + L + P Sbjct: 778 VPPMSVCGDNAAMIAVGALRSYRTQGFSPLTLDANPN 814 >UniRef50_A7HLB0 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Thermotogaceae RepID=GCP_FERNB Length = 337 Score = 259 bits (661), Expect = 1e-67, Method: Compositional matrix adjust. Identities = 133/335 (39%), Positives = 203/335 (60%), Gaps = 10/335 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCDET +A+ +D ++AN +YSQ+++H +GGVVPE+A+R+H+++ L Sbjct: 1 MIVLGIETSCDETSVALVEDNT-VIANLVYSQIQIHKKFGGVVPEIAAREHLKRLPILFS 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + ++ + + ID +A T GPGL+GALLVG + + LA + P + ++H+ GH+ + Sbjct: 60 ELISQTNINIERIDGIAVTKGPGLIGALLVGVSFAKGLALRYKKPLVGINHIIGHVYSNY 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + P++ L+VSGGHT ++ V +LG S+DDA GEAFDK A+LLGL YPGG Sbjct: 120 LAYPDLKPPYIVLMVSGGHTLILKVEENNNVTILGRSVDDAVGEAFDKIARLLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT---FAANTIRDNGTDDQTR--ADI 235 P + K++ G F FP+P P +FSFSGLKT + + +G + D+ Sbjct: 180 PEIDKISKNGNPNAFNFPKPKMYDPDYNFSFSGLKTAVLYEIKRLTKSGYSENNLPIPDL 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A + ++ ++D L+ K +A K +V+AGGV+AN LR K+ + ++ FY P Sbjct: 240 AASAQEVMIDVLLHKVTKAARDNNLKNIVLAGGVAANSRLREKIRALSEEFN---FYIPP 296 Query: 296 -EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 E+C+DN AMIA AG+ R K+G L P + Sbjct: 297 LEYCSDNAAMIARAGLERIKSGENDGLNFEPVPNF 331 >UniRef50_C7H0S4 Putative glycoprotease GCP n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7H0S4_9FIRM Length = 371 Score = 259 bits (661), Expect = 1e-67, Method: Compositional matrix adjust. Identities = 132/323 (40%), Positives = 199/323 (61%), Gaps = 3/323 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + VL IETSCDET +I + + +L+N +++Q+ +H +YGGVVPE+ASR+H+ K ++ Sbjct: 21 VNVLAIETSCDETACSIVRNGREVLSNAIFTQMHIHREYGGVVPEIASRNHLEKINDVVD 80 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ ++GL +DID +A T+ PGL+GAL+VG +++A+A P + VHH+ GH+ A Sbjct: 81 KAILDAGLHKEDIDVIAVTSTPGLIGALVVGVATAKTMAYALSKPLVGVHHIAGHIAANY 140 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+ E PF++L++SGGHT +I V ++E++G+++DDAAGEAFDK LLGL YP G Sbjct: 141 LDHGELEPPFISLVISGGHTSVIDVKDYNEHEVIGQTLDDAAGEAFDKVGILLGLTYPAG 200 Query: 181 P---LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 L++ A + F R ++ FSFSG+KT N IR N D + IA Sbjct: 201 KDMDELARSAIKNNVSPVYFKRTYLEKGSPHFSFSGIKTRVMNYIRANKDDPIDKEAIAL 260 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 F +AV D L+ K + K++V+AGGV+AN +R K E + + EV+ Sbjct: 261 GFHEAVTDVLVKKTMDMAKRRNRKKIVLAGGVAANSLIRNKFKEEGEAQGFEVYLPGLGM 320 Query: 298 CTDNGAMIAYAGMVRFKAGATAD 320 CTDN AMIA AG ++ +G +D Sbjct: 321 CTDNAAMIASAGYYKYISGGISD 343 >UniRef50_C7N1K1 Ribosomal-protein-alanine acetyltransferase n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1K1_SLAHD Length = 781 Score = 258 bits (659), Expect = 2e-67, Method: Compositional matrix adjust. Identities = 154/341 (45%), Positives = 192/341 (56%), Gaps = 11/341 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVR-------KT 55 +L IE+SCDET AI D E +L++ + SQ+ HA +GGVVPE+ASR H+ + Sbjct: 440 ILAIESSCDETAAAIIDGEGSMLSDVVASQIDFHARFGGVVPEIASRKHIEAICGVTDEC 499 Query: 56 VPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 + + AL S L +D+DAVA T PGLVGAL+VG + A+ D+P I V+H+EGH Sbjct: 500 LDVAARALGRSRLRWRDLDAVAVTYAPGLVGALVVGVAFAKGAAWGADLPIIAVNHLEGH 559 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 L A L + + P V LVSGGHT L+ V G YE LG +IDDA GEAFDK +K LGL Sbjct: 560 LYANRLAEPDIQPPMVVSLVSGGHTMLVHVKDWGDYETLGSTIDDAVGEAFDKVSKALGL 619 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI---RDNGTDDQTR 232 YPGGP++SK AAQG A FPR + L FS SGLKT I R+ G + Sbjct: 620 GYPGGPIISKYAAQGDAKAIAFPRALMHSGDLRFSLSGLKTAVTTYINKEREAGRELNI- 678 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +FE AVVD + K AL TG + + GGV+AN LR M KK + Sbjct: 679 PDIAASFEAAVVDVQVSKAHTALKDTGARTFCLGGGVAANPALRGAYEAMCKKHGYRLVM 738 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 C DN AMIA RF G AD + V PL E Sbjct: 739 PPLSACGDNAAMIAEVARDRFAQGKFADWSLDVTAHAPLDE 779 >UniRef50_Q2RGJ3 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Bacteria RepID=GCP_MOOTA Length = 342 Score = 258 bits (658), Expect = 3e-67, Method: Compositional matrix adjust. Identities = 146/331 (44%), Positives = 194/331 (58%), Gaps = 2/331 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCDET AI D + AN + SQ+ +H +GGVVPE+ASR H+ VP++ A Sbjct: 11 ILAIESSCDETAAAIVSDGTRVRANIIASQIAVHRRFGGVVPEIASRHHMENIVPVVSEA 70 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L +GL D+DAVA T GPGLVGALLVG +SLA+A P I VHH+ GH+ A L Sbjct: 71 LATAGLAFSDVDAVAVTYGPGLVGALLVGVAYAKSLAYALGKPLIGVHHLLGHIYAGFLA 130 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 P V+L+VSGGHT L+ + +LG + DDAAGEAFDK A++LGL YPGGP Sbjct: 131 YPGLPLPAVSLVVSGGHTNLVYLEDHTTRRILGSTRDDAAGEAFDKVARVLGLPYPGGPE 190 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADIARAFE 240 L K+A +G FPR + LDFSFSGLK+ N + Q RAD+A +F+ Sbjct: 191 LEKLAREGNPRAIPFPRAWLEENSLDFSFSGLKSAVINYLHHARQVGQEVNRADVAASFQ 250 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 AV + L+ K A + +++AGGV+AN LR +L ++ VF+ E CTD Sbjct: 251 AAVAEVLVTKTLLAATSYRARSILLAGGVAANSVLRRELRSAGEQAGLPVFFPPRELCTD 310 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 N AMI A ++ A L ++ P PL Sbjct: 311 NAAMIGCAAYYQYLRRDFAPLSLNAIPDLPL 341 >UniRef50_A4EBV8 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A4EBV8_9ACTN Length = 794 Score = 257 bits (656), Expect = 5e-67, Method: Compositional matrix adjust. Identities = 155/344 (45%), Positives = 196/344 (56%), Gaps = 19/344 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL IE+SCDET +AI D + +LANQ+ +Q+ HA +GGVVPE+ASR HV V ++ AA Sbjct: 455 VLAIESSCDETAVAIIDADGNMLANQVSTQIDFHARFGGVVPEIASRKHVEVIVSVVDAA 514 Query: 63 LKES----GLTA-----KDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 L+++ GLT ++ AV T GPGLVGAL+VG + A+A P + V+H+E Sbjct: 515 LEDAAASLGLTGGAIAPSELAAVGVTQGPGLVGALVVGVAFAKGFAYAAGKPLVCVNHLE 574 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 GHL A +L + PF+ LVSGGHT L+ V G YE+LGE++DDA GEAFDK AK L Sbjct: 575 GHLFANLLAQPDLKPPFIFTLVSGGHTMLVHVKAWGDYEVLGETLDDAVGEAFDKVAKAL 634 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT-- 231 GL YPGGP++SK+A G FPR + R FS SGLKT I +T Sbjct: 635 GLGYPGGPIISKLAETGNPKAIDFPRALNSRGDYRFSLSGLKTAVTLYIEQETKAGRTIH 694 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE-- 289 D+A +FE AV D K K AL TG K + GGVSAN LR EMM K+ G Sbjct: 695 LPDLAASFEAAVFDVQYKKAKNALHATGCKEYCIGGGVSANPHLR----EMMIKKLGRQG 750 Query: 290 VFYARPEF--CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 + P CTDN AMIA +F G + V P L Sbjct: 751 IRVTVPPLSACTDNAAMIAEVARRKFDRGEISPFDVDADPNMTL 794 >UniRef50_C9RIN4 Metalloendopeptidase, glycoprotease family n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RIN4_FIBSS Length = 335 Score = 256 bits (655), Expect = 7e-67, Method: Compositional matrix adjust. Identities = 156/338 (46%), Positives = 204/338 (60%), Gaps = 10/338 (2%) Query: 1 MRVLGIETSCDETGIAIY-DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M LGIE+SCDET A+ DD +L+N LYSQ+ HA YGGVVPE+A+R H++K P+ Sbjct: 1 MIWLGIESSCDETACAVLQDDPLKVLSNPLYSQIDEHALYGGVVPEIAARAHLQKIAPIA 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 +AA+KE+G+ KDIDA+AYT GPGL+G LLVGA+ + LA ++PA ++H+EGHL A Sbjct: 61 EAAVKEAGVELKDIDAIAYTTGPGLMGPLLVGASFAKGLARDLNIPAYGMNHLEGHLAAA 120 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L + E PF+ L VSGGHT+L+ +Y +G + DDAAGEAFDK KL+GL YP Sbjct: 121 WLSNPDIEPPFLTLTVSGGHTELVMEEPGFKYTSIGRTRDDAAGEAFDKCGKLIGLKYPA 180 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-----QTRAD 234 G +S++ FPR + +FSFSGLKT +R T D Q D Sbjct: 181 GATISRLGKDHNRKFVEFPRALHTHDSCEFSFSGLKT---AVLRYTETHDPEFIQQNLGD 237 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 I + EDA+VD+L+ K AL +T K LVM GGVSAN LR +L + K+ Sbjct: 238 ICASLEDAIVDSLVTKTINALKKTKMKTLVMGGGVSANSWLRTRLQDYCDKKGIRFCVPD 297 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 TDNGAMIA A + R G + V V+P PLA Sbjct: 298 RSLSTDNGAMIAAAAIRRKLQGKLESIDV-VKPWMPLA 334 >UniRef50_C1TLM6 O-sialoglycoprotein endopeptidase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TLM6_9BACT Length = 336 Score = 256 bits (653), Expect = 1e-66, Method: Compositional matrix adjust. Identities = 138/335 (41%), Positives = 206/335 (61%), Gaps = 11/335 (3%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 L IE+SCD+T +A+ D ++ +L++ + SQV+ HA +GGVVPE ASR H+ +PL+ AL Sbjct: 6 LAIESSCDDTAVAVIDGQRNVLSSTMSSQVESHAPFGGVVPEYASRMHLEAILPLVDRAL 65 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 E+ D+D +A TAGPGL+G+LLVG + LA AW P + V+H+EGH+ A ++ Sbjct: 66 AEADAKPSDLDLIAVTAGPGLMGSLLVGVMTAKGLAQAWGKPILGVNHLEGHVFANVVNH 125 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLL 183 + PF+A++VSGGHT+++ V +G Y +LG + DDAAGEA+DK AKLLGL YPGGP++ Sbjct: 126 PDLDPPFIAMIVSGGHTEVVLVEDLGFYRILGGTKDDAAGEAYDKVAKLLGLAYPGGPIV 185 Query: 184 SKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT---FAANTIRDNGTDDQTRADIARAFE 240 ++A G F FP P+ + FSFSGLKT + I+ G DI +F+ Sbjct: 186 DELAKDGDPQAFDFPVPLKRSDEISFSFSGLKTAVLWQVERIKKEGASLPVE-DICASFQ 244 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE--FC 298 A V+ L+ K A+ +TG +++V++GGV+AN LR + RG+ P+ +C Sbjct: 245 RAAVEALICKLDLAVQKTGVEKVVLSGGVAANSCLRD-----LVLNRGDWKGYVPDMFYC 299 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 TDN MI AG + G + L ++ P W + + Sbjct: 300 TDNAVMIGAAGYHGWMRGRRSGLDLAPSPSWSIMD 334 >UniRef50_Q3SVF4 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Rhizobiales RepID=GCP_NITWN Length = 357 Score = 255 bits (651), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 158/349 (45%), Positives = 207/349 (59%), Gaps = 14/349 (4%) Query: 1 MRVLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIET+CDET A+ D +L+N + SQ + HA YGGVVPE+A+R HV Sbjct: 1 MLVLGIETTCDETAAAVVERLPDGSARILSNIVRSQTEEHAPYGGVVPEIAARAHVELLD 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 LI A+ ESG+ + + VA AGPGL+G ++VG T +++A P V+H+E H Sbjct: 61 GLIARAMTESGVGFRQLSGVAAAAGPGLIGGVIVGLTTAKAIALVHGTPLTAVNHLEAHA 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L P L EFP+ L SGGHTQ+++V G+G Y LG ++DDA GEAFDK AK+LGL Sbjct: 121 LTPRLTSRL-EFPYCLFLASGGHTQIVAVLGVGNYVRLGTTVDDAMGEAFDKVAKMLGLP 179 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAAN-TIRDNGTDDQTRADI 235 YPGGP + + AA G A RF FPRPM RP +FS SGLKT N R + + + +D+ Sbjct: 180 YPGGPEVERAAASGDATRFNFPRPMLGRPDANFSLSGLKTAVRNEAARIDPLEPRDISDL 239 Query: 236 ARAFEDAVV----DTLMIKCKRALDQTGFKR-LVMAGGVSANRTLRAKLAEMMKKRRGEV 290 F+ AV+ D L + + ++ G R LV AGGV+AN+ +RA L + K R + Sbjct: 240 CAGFQAAVLEATADRLGVGLRLFEERFGRPRALVAAGGVAANQAIRASLEGVAAKARTSL 299 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL---AELPA 336 P CTDNGAMIA+AG R AG T L R RW L A+ PA Sbjct: 300 IIPPPALCTDNGAMIAWAGAERLAAGLTDSLETPPRARWLLDANAQAPA 348 >UniRef50_Q6MQ48 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Bdellovibrio bacteriovorus RepID=GCP_BDEBA Length = 345 Score = 255 bits (651), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 139/335 (41%), Positives = 195/335 (58%), Gaps = 8/335 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 RVL IETSCD+T +AI D + + SQ H YGG+VPE+A+R+H +PLI+ Sbjct: 4 RVLAIETSCDDTSVAIVDRTGWVHSVVAASQDLDHEIYGGIVPEIAARNHSIALIPLIEE 63 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A K++ + D+ +A T PGL+GAL+VG +SL+ A +P + V+H+EGHLLAP L Sbjct: 64 AFKKANMNWSDVQGIAVTNRPGLIGALIVGLVTAKSLSQAKHLPFLGVNHLEGHLLAPFL 123 Query: 122 EDN---PPE---FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 D+ PPE +P+V L +SGGHT L + G+G Y +LG + DDAAGE FDK AK+ GL Sbjct: 124 RDDKYAPPEDFGYPYVGLAISGGHTSLYQIKGLGDYRILGATKDDAAGECFDKFAKMAGL 183 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTRA 233 +PGG + +MA G F FPR M D SFSGLK+ + G + + Sbjct: 184 GFPGGVRVDQMAKAGNPQAFEFPRSMIHDDTFDMSFSGLKSSGQRMLEQLGPELVQERLP 243 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 D+ +F++A+VD L+ K RA KR+++ GGVSAN LR + E K+ + Sbjct: 244 DLCASFQEAIVDVLIAKLDRAAKVFRSKRVILTGGVSANSRLRQRAQEWADKKGYTLVIP 303 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +CTDN AMI Y G +R G + L + P+ Sbjct: 304 PLRYCTDNAAMIGYVGALRMARGEVSALDLGPSPQ 338 >UniRef50_C7MKR9 Ribosomal-protein-alanine acetyltransferase n=10 Tax=Bacteria RepID=C7MKR9_CRYCD Length = 860 Score = 254 bits (649), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 146/328 (44%), Positives = 191/328 (58%), Gaps = 9/328 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCDET AI D ++A+ + SQ+ HA +GGVVPE+ASR HV ++QA Sbjct: 519 ILAIESSCDETAAAIVDGHGRIIADVVASQIDFHARFGGVVPEIASRKHVEAICGVVQAC 578 Query: 63 LKES-------GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 L E+ L+ +DAVA T PGLVGAL+VG ++ A+A +P I V+H+EGH Sbjct: 579 LDEAAEHLGTANLSWNSLDAVAVTYAPGLVGALVVGVAYAKAAAWAAGIPFIKVNHLEGH 638 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 L A L + + P V LVSGGHT L+ V G Y ++G +IDDA GEAFDK AK LGL Sbjct: 639 LYANKLARSDIKPPLVVSLVSGGHTMLVHVRDWGDYCVMGSTIDDAVGEAFDKVAKALGL 698 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTRA 233 YPGGP++S++A QG FPR M L FS SGLKT I + D Sbjct: 699 GYPGGPVISRLAQQGNPAAIHFPRAMMHSGDLRFSLSGLKTAVVTYIHNQQQQKADLNVP 758 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 DIA +F+ AV+D + K ALD+TG K + GGV+AN LRA +R + Sbjct: 759 DIAASFQAAVIDVQVAKATAALDETGAKEFCLGGGVAANPALRAAYESACAQRGVRLTMP 818 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGATADL 321 CTDN AMIA + R++AG T+ L Sbjct: 819 PARACTDNAAMIALVALDRYQAGKTSGL 846 >UniRef50_Q8DLI9 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Cyanobacteria RepID=GCP_THEEB Length = 353 Score = 253 bits (646), Expect = 7e-66, Method: Compositional matrix adjust. Identities = 147/343 (42%), Positives = 195/343 (56%), Gaps = 8/343 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +R+L IETSCDET A+ D + + +N + SQV H +GGVVPE+ASR H+ +I Sbjct: 2 VRILAIETSCDETAAAVVRD-RAIESNVIASQVCAHQPFGGVVPEVASRAHLENINGVIT 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AA+ E+G IDA+A T PGLVG+LL+G T ++LA P + +HH+EGHL A Sbjct: 61 AAISEAGCDWSAIDAIAVTCAPGLVGSLLIGVTAAKTLALVHQKPLLGIHHLEGHLYASY 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + E PF+ LLVSGGHT LI V G G+Y+L G++ DDAAGEA+DK A+L+GL YPGG Sbjct: 121 LAEPTLEPPFLCLLVSGGHTSLIGVYGCGEYQLFGQTRDDAAGEAYDKVARLMGLGYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG-----LDFSFSGLKTFAANTIRD--NGTDDQTRA 233 PLL + A QG F P P D SFSGLKT A + + + A Sbjct: 181 PLLDRWAQQGNPEAFDLPEGNIRLPDGKVHPYDASFSGLKTAVARLVAELRQTHPELPVA 240 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 D+A +F+ AV L + A GFK L + GGV+AN LR L + + + Sbjct: 241 DLAASFQKAVAQALTKRAIAAAVDHGFKTLAIGGGVAANSGLRQHLTAAAEPLGLRLIFP 300 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 CTDN AMI A F+ G + L ++ R R L E+ A Sbjct: 301 PLRLCTDNAAMIGCAAADHFQRGDRSPLDLTARSRLSLLEISA 343 >UniRef50_B9JCG8 Probable O-sialoglycoprotein endopeptidase n=86 Tax=Alphaproteobacteria RepID=GCP_AGRRK Length = 365 Score = 253 bits (646), Expect = 7e-66, Method: Compositional matrix adjust. Identities = 169/348 (48%), Positives = 213/348 (61%), Gaps = 21/348 (6%) Query: 1 MRVLGIETSCDETGIAIY---DDEKGLL-ANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 +R+LGIETSCDET AI DD ++ ++ + SQ+ H+ YGGVVPE+A+R HV Sbjct: 5 LRILGIETSCDETAAAIVERQDDGTAIVRSDVVLSQLDEHSAYGGVVPEIAARAHVEALD 64 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 LI ALK + ++ D+DA+A T+GPGL+G LLVG G++++ A P ++H+EGH Sbjct: 65 TLIDEALKRANVSLADVDAIAATSGPGLIGGLLVGLMTGKAISKATGKPLYAINHLEGHA 124 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L L D FP++ LLVSGGHTQLI V G+GQYE G +IDDA GEAFDKTAKLLGL Sbjct: 125 LTARLTDGL-AFPYLMLLVSGGHTQLILVRGVGQYERWGTTIDDALGEAFDKTAKLLGLP 183 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT---FAANTIRDNGTDDQTRA 233 YPGGP + A +G RF PRP+ LDFSFSGLKT AA TI +Q A Sbjct: 184 YPGGPAVEAAAKKGNPDRFDLPRPLVGETRLDFSFSGLKTAVRLAATTIAP--VSEQDIA 241 Query: 234 DIARAFEDAVVDTLMIKCKRALD--QTGFKR------LVMAGGVSANRTLRAKLAEMMKK 285 DI +F+ AV TL + R L ++ F + LV+AGGV+AN LR L E+ Sbjct: 242 DICASFQKAVSRTLKDRIGRGLQRFKSEFPKTAEKPALVVAGGVAANLELRRTLQELC-D 300 Query: 286 RRGEVFYARP-EFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPL 331 G F A P CTDN MIA+AG+ R GA D L V R RWPL Sbjct: 301 LNGFRFIAPPLSLCTDNAVMIAWAGLERMATGAAPDGLDVQPRSRWPL 348 >UniRef50_Q2JXG9 Probable O-sialoglycoprotein endopeptidase n=31 Tax=Bacteria RepID=GCP_SYNJA Length = 366 Score = 252 bits (644), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 149/345 (43%), Positives = 205/345 (59%), Gaps = 10/345 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGL-------LANQLYSQVKLHADYGGVVPELASRDHVR 53 +R+L IETSCDET +A+ + + L++ + SQ+ LHA YGGVVPE+A+R HV Sbjct: 2 LRLLAIETSCDETAVAVVEADAAWPTFAPRQLSSVVASQIDLHAAYGGVVPEVAARRHVE 61 Query: 54 KTVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 ++++AL+++GL ++DAVA T PGLVG+LLVG ++LA ++ P I VHH+E Sbjct: 62 TLPFVLESALQQAGLGMAEVDAVAVTCAPGLVGSLLVGLMAAKTLALLYNKPLIGVHHLE 121 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 GHL + L P + LLVSGGHT LI + G+Y+ +G + DDAAGEAFDK A+LL Sbjct: 122 GHLFSGFLAAADLRPPCLGLLVSGGHTSLIWMKDYGEYQTMGRTRDDAAGEAFDKVARLL 181 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR- 232 GL YPGGP + + A QG RF P D P D SFSGLKT ++ + Q Sbjct: 182 GLGYPGGPQIDRWAQQGDPDRFPLPEGKLDHP-YDTSFSGLKTAVLRLVQQLQQEGQELP 240 Query: 233 -ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 ADIA +F+ + L K + G L++ GGV+ANR LRA+L E +++ V Sbjct: 241 VADIAASFQACLTRVLTEKAVACAEALGLSTLLVTGGVAANRELRARLLEAGRQKGLRVV 300 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 P CTDN AMI AG+ + G T+ L + V R L E+PA Sbjct: 301 IPPPNLCTDNAAMIGAAGLCHWLRGETSPLELGVASRLTLEEIPA 345 >UniRef50_B7CBT6 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CBT6_9FIRM Length = 333 Score = 250 bits (638), Expect = 5e-65, Method: Compositional matrix adjust. Identities = 136/322 (42%), Positives = 203/322 (63%), Gaps = 11/322 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M ++GIE+SCDET +A+ D+K +L++ + SQ+ +H ++GGVVPE+ASR HV I+ Sbjct: 1 MIIIGIESSCDETAVAVVKDKKEVLSSVVASQIDVHTEFGGVVPEVASRIHVENISYCIE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ +T +D+DAVA T GPGL+G L VG ++LAFA+ P +PVHH+ GH+ A Sbjct: 61 KALKDANITMEDVDAVAVTQGPGLIGCLHVGVQAAKTLAFAYHKPLVPVHHLAGHIYANE 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + ++P +AL+VSGG+T+L+ + +E+LGE+ DDA GEAFDK A++LGL YPGG Sbjct: 121 LVVD-MKYPVLALVVSGGNTELVYMKDETSFEILGETQDDAIGEAFDKVARVLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT----FAANTIRDNGTDDQTRADIA 236 P + K+A +G + +P T + DFSFSGLK+ F R T D AD+A Sbjct: 180 PKIDKLAKEGKP-VYELAKPKT-QGRYDFSFSGLKSSVLQFTKRMERQGKTFDM--ADLA 235 Query: 237 RAFEDAVVDTLMIKCKRAL-DQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F++ +D + + + L D + V+ GGVSAN LR K+ E+ + F P Sbjct: 236 CSFQECALDEIFSRVRAVLDDHKDIRHFVVGGGVSANSRLREKVEELRNEYPEVEFTVPP 295 Query: 296 EF-CTDNGAMIAYAGMVRFKAG 316 + CTDN +MI AG + + +G Sbjct: 296 MYCCTDNASMIGVAGTIAYLSG 317 >UniRef50_Q47LN7 Probable O-sialoglycoprotein endopeptidase n=58 Tax=Bacteria RepID=GCP_THEFY Length = 347 Score = 250 bits (638), Expect = 6e-65, Method: Compositional matrix adjust. Identities = 147/319 (46%), Positives = 190/319 (59%), Gaps = 8/319 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 ++GIE+SCDETG+A + LLA+++ S V HA +GGVVPE+ASR H+ P ++ A Sbjct: 10 IMGIESSCDETGVAFVRGCE-LLADEVASSVDEHARFGGVVPEVASRAHLEAMTPTVRRA 68 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + +G+ D+DA+A T GPGL GALLVG + ++ A A D P V+H+ GH+ LE Sbjct: 69 AERAGVRLSDVDAIAVTVGPGLAGALLVGLSAAKAYALALDKPLYGVNHLVGHVAVDQLE 128 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIG-QYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 P P VALLVSGGHT L+ V + +LLGE++DDAAGEA+DK A+LL L YPGGP Sbjct: 129 HGPLPKPVVALLVSGGHTSLLLVRDLATDVQLLGETVDDAAGEAYDKVARLLNLPYPGGP 188 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR----ADIAR 237 + + A G FPR DFSFSGLKT A + D + Q R D+A Sbjct: 189 PIDRAARDGDGTAIHFPRGKWGDGTYDFSFSGLKTAVARWVED--AERQGRPVSVPDVAA 246 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 AF++AV D L K A + G + LV++GGV+AN LRA E EV RP Sbjct: 247 AFQEAVADVLTRKAVDACREHGVRHLVISGGVAANSRLRALAEERCAAAGIEVRVPRPRL 306 Query: 298 CTDNGAMIAYAGMVRFKAG 316 CTDNGAMIA G AG Sbjct: 307 CTDNGAMIAALGAEVVAAG 325 >UniRef50_B3DVR7 Metal-dependent protease with possible chaperone activity n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DVR7_METI4 Length = 353 Score = 249 bits (637), Expect = 7e-65, Method: Compositional matrix adjust. Identities = 131/342 (38%), Positives = 195/342 (57%), Gaps = 8/342 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKG---LLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 M LGIE+SCDET IA+ G ++A++ +Q LH +GG+VPE A R+H + Sbjct: 3 MLWLGIESSCDETAIALVKTIAGKNVVMADRCITQAPLHKPFGGIVPEYAVREHSKNLPL 62 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L+Q+ ++ L K++ A+A T GPGL+ +LLVG R LA +P V+H+EGHL Sbjct: 63 LLQSMIRSKSLNLKEVQAIAVTEGPGLMASLLVGNAFARGLALGLGIPVFGVNHLEGHLF 122 Query: 118 APML-EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 +P + + +FPF+ L+VSGGHT L V G QY ++G +IDDAAGEAFDK A+LLGL Sbjct: 123 SPFIGREEKLKFPFLGLVVSGGHTLLARVEGPRQYSMIGSTIDDAAGEAFDKVARLLGLS 182 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN----GTDDQTR 232 YPGGP + K A +G FP + ++ +FSFSGLKT + N + + Sbjct: 183 YPGGPEIEKQAERGNPHSHNFPISLIEKNNYNFSFSGLKTAVKYFLEKNKESLSKNKEFL 242 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 AD+ +F+++V + K A + +GGV AN+ +R L + + EV + Sbjct: 243 ADVCASFQESVARVIQEKTIAAAKSFSLSLIAASGGVLANKRIRELLEKKALEEGIEVLF 302 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 A+ +FCTDN MIA+AG + + G + P + L++ Sbjct: 303 AKRQFCTDNAVMIAFAGALFYALGLPITKSFELNPNFSLSDF 344 >UniRef50_D1IZQ0 Whole genome shotgun sequence of line PN40024, scaffold_48.assembly12x (Fragment) n=15 Tax=Magnoliophyta RepID=D1IZQ0_VITVI Length = 468 Score = 249 bits (636), Expect = 9e-65, Method: Compositional matrix adjust. Identities = 142/359 (39%), Positives = 203/359 (56%), Gaps = 28/359 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+T AI +L+ + SQ L A YGGV P++A H++ ++Q A Sbjct: 77 VLGIETSCDDTAAAIVRSNGDILSQVVSSQADLLARYGGVAPKMAEGAHMQVIDRVVQDA 136 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ + LT +D+ AVA T GPGL L VG R +A + ++P + VHHME H L L Sbjct: 137 LENANLTERDLSAVAVTIGPGLSLCLRVGVQKARKIAGSHNLPIVGVHHMEAHALVARLI 196 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--PGG 180 + +FPF+ALL+SGGH LI +G Y LG +IDDA GEA+DKTAK LGLD GG Sbjct: 197 EKDLQFPFMALLISGGHNLLILARDLGHYIQLGTTIDDAIGEAYDKTAKWLGLDLRRSGG 256 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ---------- 230 P + ++A +G A F PM +FS++GLKT I + + Sbjct: 257 PAIEELAREGDAKAVKFSTPMKQHKDCNFSYAGLKTQVRLAIESRNINAEIPISSASSED 316 Query: 231 --TRADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLAEMM 283 +RADIA +F+ V L +C+RA++ + K LV++GGV++N+ +RA+L +++ Sbjct: 317 RSSRADIAASFQRVAVLHLEERCERAIEWALKIEPSIKHLVVSGGVASNQYVRAQLDQVV 376 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG---------ATADLGVSVRPRWPLAE 333 KK+ ++ P CTDNG M+A+ G+ F+ G D +RPRWPL E Sbjct: 377 KKKSLQLVCPPPSLCTDNGVMVAWTGLEHFRMGRYDPPPPANEPEDYVYDLRPRWPLGE 435 >UniRef50_B1V8Z6 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Candidatus Phytoplasma RepID=GCP_PHYAS Length = 328 Score = 248 bits (633), Expect = 2e-64, Method: Compositional matrix adjust. Identities = 127/312 (40%), Positives = 184/312 (58%), Gaps = 4/312 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCDET +AI D K +L+N ++SQ+K H +GGVVPE+ASR HV +++ Sbjct: 1 MNILSIETSCDETSVAITQDGKKVLSNIVFSQIKDHQMFGGVVPEIASRKHVELITLILE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A +++ LT ++ID VA T GPGLVG+LLVG A+ + P + ++H+ GHL A Sbjct: 61 KAFQKACLTPQEIDLVAVTQGPGLVGSLLVGINAANVFAYTYQKPLLGINHLLGHLYAAQ 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E + LLVSGGHT+L+ Q E+LG ++DDA GE +DK AK L L YPGG Sbjct: 121 IEHQIKPNALI-LLVSGGHTELLHFKNHDQIEVLGTTLDDALGEVYDKIAKALHLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 PL+ ++A G + RP +FSFSGLK+ N + D +I +F+ Sbjct: 180 PLIDQLAQTG-KDTYHLVRPYLKNNNFNFSFSGLKSHLVNLLLKQNIQDLNIPNICASFQ 238 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+D L+ K KR L + ++L++ GGV++N LR K+ E EV + ++CTD Sbjct: 239 ASVIDVLLTKTKRVLKKLPIQQLIVTGGVASNSALRKKMKETFLDL--EVIFPSVQYCTD 296 Query: 301 NGAMIAYAGMVR 312 AMI A + Sbjct: 297 QAAMIGIAAFYQ 308 >UniRef50_B2KE20 Metalloendopeptidase, glycoprotease family n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KE20_ELUMP Length = 342 Score = 248 bits (633), Expect = 2e-64, Method: Compositional matrix adjust. Identities = 135/336 (40%), Positives = 200/336 (59%), Gaps = 16/336 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +LGIET+CDET AI + L++N +++Q+ +H Y GVVPELASR H K +++ Sbjct: 7 ITILGIETTCDETSAAILKSGRDLVSNVVHTQIDIHKKYCGVVPELASRAHAVKVAEVVK 66 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ID VA+ +GPGL G L+VG +++ +VP I V+H+EGHL A Sbjct: 67 EALGNH-----KIDLVAFASGPGLPGGLMVGRVAAEAVSALKNVPIIGVNHLEGHLFACE 121 Query: 121 LE--------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL 172 + D +FP +AL+VSGGHT+L V G Y++LG + DDAAGEAFDK AKL Sbjct: 122 FDAKEGKIAADKQLKFPLIALIVSGGHTELWYVKNYGDYKMLGRTRDDAAGEAFDKVAKL 181 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL YPGGP+++K A +G FPRPM + +FSFSG+KT + +RD+ D + Sbjct: 182 LGLGYPGGPVVAKEALKGNPEAIKFPRPMM-KGTFEFSFSGIKTAVSYYLRDH--KDIKK 238 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 D+ +F+ A+V+TL+ K +A+ + K + + GGV+AN L+ + + +K +V + Sbjct: 239 EDVCASFQAAMVETLVAKTFQAVKKYKVKNVAVGGGVAANELLKESMVKRGQKEGVDVSF 298 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +DNGAMIA AG +F + + + P Sbjct: 299 VPRALSSDNGAMIALAGYKKFMFAGKFNANIRINPN 334 >UniRef50_A8GM49 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Rickettsia RepID=GCP_RICAH Length = 386 Score = 248 bits (633), Expect = 2e-64, Method: Compositional matrix adjust. Identities = 137/384 (35%), Positives = 209/384 (54%), Gaps = 51/384 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +++LGIE+SCD+T ++I + + +L+N + SQ HA +GGVVPE+A+R H+ + Sbjct: 2 IKILGIESSCDDTAVSIITENREILSNIIISQNTEHAVFGGVVPEIAARSHLSNLDKALT 61 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 LKES +I A+A T+GPGL+G ++VG+ RSL+ P I ++H+EGH L Sbjct: 62 NVLKESNTKLIEISAIAATSGPGLIGGVIVGSMFARSLSSTLKKPFIAINHLEGHALTAR 121 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L DN P +P++ LL SGGH Q ++V G+G+Y++LG +IDDA GE FDK AK+L L +PGG Sbjct: 122 LTDNIP-YPYLLLLASGGHCQFVAVLGLGKYKILGSTIDDAVGETFDKVAKMLNLAFPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT------------------------ 216 P + + A G ++ FP+P+ + + SFSGLKT Sbjct: 181 PEIEQKAKLGDPHKYKFPKPIINSGNCNMSFSGLKTAVRTLIMNLQEINYNECNHLESVR 240 Query: 217 -------FA------------ANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCK---RA 254 FA N++ + +D DIA +F+ + + L K + RA Sbjct: 241 QDEVQEEFAQRTKVHEHRRKLQNSLVSSFLNDSVINDIAASFQFTIGEILSSKVQDAIRA 300 Query: 255 LDQ--TGF--KRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGM 310 +Q F K +++AGGV+AN+ L+ L+ K ++ Y CTDN AMIAYAG+ Sbjct: 301 YEQITNNFDKKNIIIAGGVAANKYLQEILSNCAKTYGYQLIYPPIHLCTDNAAMIAYAGL 360 Query: 311 VRFKAGATADLGVSVRPRWPLAEL 334 R+ L + RW L ++ Sbjct: 361 ERYNNKLFTPLNFCPKARWSLEDI 384 >UniRef50_Q7UM42 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Planctomycetaceae RepID=GCP_RHOBA Length = 358 Score = 247 bits (630), Expect = 5e-64, Method: Compositional matrix adjust. Identities = 139/344 (40%), Positives = 195/344 (56%), Gaps = 20/344 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE++CDET A+ + +L + +Q LH +GGVVPE+A+R H+ + +P+I A Sbjct: 10 LLSIESTCDETAAAVIRRDGTVLGQCIATQETLHEQFGGVVPEIAARAHLERILPVIDTA 69 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ++ + +D+ A+A PGL G+LLVG ++LA AW+ P I ++H+ HL A L Sbjct: 70 LTQAKVRGEDLTAIAVADRPGLAGSLLVGVVAAKTLALAWNKPLISLNHLHAHLYACQLI 129 Query: 123 DNPPE--FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + P +P + L+VSGGHT L E LG +IDDAAGEAFDK A +L L +PGG Sbjct: 130 EGAPANIYPAIGLIVSGGHTSLYVCRTAIDLEYLGGTIDDAAGEAFDKVAAMLSLPFPGG 189 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG--------TDDQTR 232 ++K+A+QG + FPR M PG DFSFSGLKT I G DQ + Sbjct: 190 IEVAKLASQGNDKAYSFPRSMIHDPGDDFSFSGLKTAVRYAIVGPGRQDFASLDISDQVK 249 Query: 233 ADIARAFEDAVVDTLMIKCKRALD---------QTGFKRLVMAGGVSANRTLRAKLAEMM 283 D+ +FE AVVD L+ KC+RA+ Q RL++ GGV+AN+ LR L Sbjct: 250 RDVCASFEAAVVDVLVSKCRRAIKRHRNRNNDPQNSINRLIVGGGVAANQRLRRDLQAAA 309 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 K E++ A P CTDN M A A +F+A A L + + P Sbjct: 310 DKDGFELWIAPPHLCTDNAVMGAIA-WKKFEAEQFASLDLDITP 352 >UniRef50_B6JAE9 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Alphaproteobacteria RepID=GCP_OLICO Length = 357 Score = 246 bits (628), Expect = 8e-64, Method: Compositional matrix adjust. Identities = 147/341 (43%), Positives = 205/341 (60%), Gaps = 11/341 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKG----LLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIET+CDET A+ + + +L+N + SQ+ HA +GGVVPE+A+R HV Sbjct: 1 MLVLGIETTCDETAAAVIERQADGSGRILSNIVRSQIAEHAPFGGVVPEIAARAHVEMLD 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L+ A++E+G+ +D +A AGPGL+G ++VG T +++A D P I V+H+E H Sbjct: 61 VLVDRAMREAGVDFAQLDGIAAAAGPGLIGGVIVGLTTAKAIALVHDTPLIAVNHLEAHA 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L P L P FP+ L SGGHTQ+++V G+G+Y +G ++DDA GEAFDK AK+L L Sbjct: 121 LTPRLT-VPLAFPYCLFLASGGHTQIVAVLGVGEYVRIGTTVDDALGEAFDKVAKMLDLP 179 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI-RDNGTDDQTRADI 235 YPGGP + + A +G RF FPRPM R +FS SGLKT N R + Q AD+ Sbjct: 180 YPGGPQVERAAREGDPTRFDFPRPMLGRKDANFSLSGLKTAVRNEASRLMPLELQDIADL 239 Query: 236 ARAFEDAVVDTLMIKCKRAL----DQTGFKR-LVMAGGVSANRTLRAKLAEMMKKRRGEV 290 +F+ AV+D++ + + L +Q G R LV AGGV+AN +R L E+ + Sbjct: 240 CASFQAAVLDSIADRIRSGLRLFREQFGTPRALVAAGGVAANVAIRNALQEIAADDEITM 299 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 P+ CTDNGAMIA+AG R G T + + R RW L Sbjct: 300 IVPPPQLCTDNGAMIAWAGAERLALGLTDTMEAAPRARWKL 340 >UniRef50_B5ZLG0 Metalloendopeptidase, glycoprotease family n=11 Tax=Rhodospirillales RepID=B5ZLG0_GLUDA Length = 382 Score = 246 bits (627), Expect = 1e-63, Method: Compositional matrix adjust. Identities = 147/346 (42%), Positives = 198/346 (57%), Gaps = 16/346 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCD+T AI + +LA + SQ H +GGVVPE+A+R H+ L++ Sbjct: 30 ILAIESSCDDTACAILAPDGTILAETVLSQAG-HVPFGGVVPEIAARAHLAALPALVRHT 88 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L + L A+ + A+A + GPGL+G L+VGA + + LA A P + V+H+E H L L Sbjct: 89 LDVAALPAEALGAIAASTGPGLIGGLIVGAGMAKGLAVALGRPFVAVNHIEAHALTARLP 148 Query: 123 DNPP---EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 P FP++ LLVSGGH Q I+V G+G+Y LG +IDDAAGEAFDK AK+LGL +PG Sbjct: 149 GLVPGGASFPYLLLLVSGGHCQCIAVEGVGRYRKLGGTIDDAAGEAFDKVAKMLGLGWPG 208 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR---ADIA 236 GP + +A +G + PRP+ RPG DFSFSGLKT A + R A IA Sbjct: 209 GPAVEALAREGDPAPWPLPRPLRGRPGCDFSFSGLKTAVAQKLAPFAAGALPRTAAAGIA 268 Query: 237 RAFEDAVVDTLMIKCKRALD-QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F+DAV D + + ALD LV AGGV+AN LR +L + R F A P Sbjct: 269 ASFQDAVADIVADRVAHALDMMPQATLLVAAGGVAANTALRTRL-TTLATSRALPFAAPP 327 Query: 296 -EFCTDNGAMIAYAGMV------RFKAGATADLGVSVRPRWPLAEL 334 CTDN M+ +A + R T DL + RPRWPL ++ Sbjct: 328 LRLCTDNAVMVGWAAIETLRERRRLGLPPTDDLDLLPRPRWPLEQM 373 >UniRef50_B0VHD4 Putative metalloendopeptidase, , glycoprotease family n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VHD4_9BACT Length = 338 Score = 246 bits (627), Expect = 1e-63, Method: Compositional matrix adjust. Identities = 128/326 (39%), Positives = 197/326 (60%), Gaps = 5/326 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L E+SCD+T +AI D + ++ N + SQ + H ++GG++PELASR H++ V L +AA Sbjct: 6 ILAFESSCDDTSVAIVDTDYNVIVNLISSQPE-HLEFGGILPELASRLHLKNIVTLTKAA 64 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L S L +DI A+A + PGL+G+L+VG + LA++ +P I V+H+ H+ A +E Sbjct: 65 LNASKLNLQDISAIAVSINPGLIGSLIVGLAFAKGLAWSLSLPLITVNHILSHIFANFIE 124 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 E PF+AL+VSGGHT+L+ + + ++G+++DDAAGE+FDK AKLLGL +PGGP Sbjct: 125 HKAVEPPFLALVVSGGHTELVHFDTLTTFTVVGKTLDDAAGESFDKAAKLLGLGFPGGPA 184 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA---DIARAF 239 + ++A +G FPR + + +FS+SGLKT A T N + +A DIA + Sbjct: 185 IDELAQKGNPNFIKFPRALPQKNNFNFSYSGLKT-AIRTWLVNQNPETLQAELPDIAASV 243 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + A++D L+ K Q +++AGGV+AN LR +L K +VFY C Sbjct: 244 QQAIIDPLVHKTVLWARQHKIPYILLAGGVAANSALRQQLTTTSAKYGIKVFYPSNALCM 303 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSV 325 DN AM+ A + +F A L ++V Sbjct: 304 DNAAMVGAAAIPKFLTKNYAPLSINV 329 >UniRef50_B1ZYF9 Metalloendopeptidase, glycoprotease family n=3 Tax=Verrucomicrobia RepID=B1ZYF9_OPITP Length = 349 Score = 246 bits (627), Expect = 1e-63, Method: Compositional matrix adjust. Identities = 139/346 (40%), Positives = 203/346 (58%), Gaps = 17/346 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +E+SCDET +A++D +GL+ ++SQ+ LH +GGVVP+LA+R+H+R PL++ A Sbjct: 2 ILALESSCDETAVAVFDPARGLVGEWVHSQIALHERHGGVVPDLATREHLRHFAPLLERA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ + I VA T GPGL L +G ++LA W VP + V+H+ GH+ +P + Sbjct: 62 --QAAVPFDAITQVAVTNGPGLAACLAIGVAAAKALALQWRVPLVGVNHLRGHVWSPFIR 119 Query: 123 ---DNPPEF--------PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 D P EF P + L+VSGG+T L +V Q +L + DDAAGEA DK AK Sbjct: 120 LHADAPAEFGDRLAALLPHLGLIVSGGNTLLFAVDRARQVTVLSTTRDDAAGEALDKGAK 179 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-- 229 LLGL YPGGPL+ K+AA G A + FPR + R LDFSFSGLKT I ++ Sbjct: 180 LLGLSYPGGPLIEKLAATGRADAYDFPRGIGRRDELDFSFSGLKTSLRYLIEKLSPEEVV 239 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQ--TGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 R+D+ +++ AVVD L+ K + AL Q ++ L ++GGV+ NRTLRA L ++ + Sbjct: 240 ARRSDLCASYQQAVVDALVRKTRAALRQGEGDYRSLGLSGGVANNRTLRAALEREAQRSQ 299 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 F A+P+ DN MIA+A A + ++V P + E Sbjct: 300 IPFFAAQPQHTGDNAGMIAFAAWADSAGTDAAGMKLTVEPSATIGE 345 >UniRef50_A1BJ68 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Chlorobiaceae RepID=GCP_CHLPD Length = 353 Score = 244 bits (622), Expect = 4e-63, Method: Compositional matrix adjust. Identities = 131/320 (40%), Positives = 191/320 (59%), Gaps = 16/320 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCDET A+ + G + + + S H +GGVVPELASR+H R V ++ Sbjct: 1 MKILGIETSCDETSAAVLSN--GSVCSNIVSSQLCHTSFGGVVPELASREHERLIVSIVD 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +AL E+ +T D+D +A TAGPGL+GA++VG G+++A+A +P +PV+H+E H+ + Sbjct: 59 SALSEANITKNDLDVIAATAGPGLIGAVMVGLCFGQAMAYALAIPFVPVNHIEAHIFSAF 118 Query: 121 LEDNP----PEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 +++ P PE F++L VSGGHT L V YE++G ++DDAAGEAFDKT K+LGL Sbjct: 119 IQETPHHQAPEGDFISLTVSGGHTLLSHVHKDFTYEVIGRTLDDAAGEAFDKTGKMLGLP 178 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTD--------RPGLDFSFSGLKTFAANTIRDNGTD 228 YP GP++ ++A G FPR +T R DFSFSGLKT ++ + Sbjct: 179 YPAGPVIDRLAKNGDPFFHEFPRALTAHSQTSKNYRGNSDFSFSGLKTSVLTFLKKQSPE 238 Query: 229 --DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 ++ DIA + + A+V L+ K A K + +AGGVSAN LR + + ++ Sbjct: 239 FIEKHLPDIAASVQKAIVSVLVEKTVSAALAGNVKAISIAGGVSANSALRTSMKKACEQH 298 Query: 287 RGEVFYARPEFCTDNGAMIA 306 E+ TDN AMIA Sbjct: 299 GIAFHVPNAEYSTDNAAMIA 318 >UniRef50_D0WGH2 O-sialoglycoprotein endopeptidase n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WGH2_9ACTN Length = 807 Score = 244 bits (622), Expect = 4e-63, Method: Compositional matrix adjust. Identities = 141/340 (41%), Positives = 188/340 (55%), Gaps = 9/340 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 + E+SCDET +I + +L++ + SQV HA +GGVVPE+ASR H+ + Sbjct: 466 ICAFESSCDETASSIIAGDGTILSDVVASQVDFHARFGGVVPEIASRKHIEAICGVADEC 525 Query: 63 LKESG-------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 L+ + L +D+DA+A T PGLVGAL+VG + + LA+ +VP + V+H+EGH Sbjct: 526 LERAAVALGRPSLRWRDLDAIAVTYAPGLVGALVVGVSFAKGLAWGSEVPLVAVNHLEGH 585 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 L A + D P V LVSGGHT L+ V YE LG +IDDAAGEAFDK +K LGL Sbjct: 586 LYANKIADPAIAPPMVVSLVSGGHTMLVHVKDWANYETLGSTIDDAAGEAFDKVSKALGL 645 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRA 233 YPGGP++S+ AA+G FPR + L FS SGLKT I Sbjct: 646 GYPGGPIISRYAAKGNPRAIDFPRALMHSGDLRFSLSGLKTAVITYIHKQQEAGMPLNIP 705 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 DIA +F+ AVVD + K + AL +TG + + GGV+AN LRA +M K + Sbjct: 706 DIAASFQQAVVDVQVAKARTALIETGSRTFCLGGGVAANPALRAAYEKMCAKNGFRLVMP 765 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 C DN AMIA + RF G AD + V+ PL E Sbjct: 766 PLSACGDNAAMIAEVALDRFAQGKLADFTLDVKAHAPLDE 805 >UniRef50_Q6MD07 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Parachlamydiaceae RepID=GCP_PARUW Length = 343 Score = 243 bits (620), Expect = 6e-63, Method: Compositional matrix adjust. Identities = 136/331 (41%), Positives = 191/331 (57%), Gaps = 16/331 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIE++CDET AI D K +L+N + SQ+ LH +YGGVVPELA R H+ +P+I Sbjct: 1 MLVLGIESTCDETACAIVRDGKDILSNIVASQIDLHKEYGGVVPELACRRHIDLIIPVID 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ LT + ID +A GPGL+GALL+G ++LA A P I ++H+E HL A + Sbjct: 61 QALNQAKLTLEQIDLIAVANGPGLIGALLIGLNTAKALALALRKPFIGINHVEAHLYAAI 120 Query: 121 LEDNPP--EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 + +P +FP + +++SGGHT L+ + IGQYEL+G+++DDA GEAFDK AK+L L YP Sbjct: 121 MS-HPQDFQFPCLGVVLSGGHTALVLIKQIGQYELIGQTVDDAVGEAFDKVAKMLNLPYP 179 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-------DDQT 231 GGP + +A G + +F F LDFSFSGLKT I+D + Sbjct: 180 GGPEIENLARHGRSVKFNFKAGQVKGRPLDFSFSGLKTAVLYAIKDPKALKEMVLLSSEM 239 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 DIA +F++A ++ K A Q G L+ GGV+ N LR ++ + Sbjct: 240 TQDIAASFQEAACSDIVKKSLLAAKQYGVNTLLFGGGVTNNCYLR----KLFSVANSNLN 295 Query: 292 YARPE--FCTDNGAMIAYAGMVRFKAGATAD 320 Y P DN AMIA G R++ +D Sbjct: 296 YIWPSAGLSLDNAAMIAGLGYYRYQLQNKSD 326 >UniRef50_Q5FLZ3 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Lactobacillus RepID=GCP_LACAC Length = 349 Score = 243 bits (619), Expect = 9e-63, Method: Compositional matrix adjust. Identities = 135/343 (39%), Positives = 205/343 (59%), Gaps = 11/343 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +R+L E+SCDET A+ + + + + + +Q+K H +GGVVPE+ASR H+ + + Sbjct: 7 VRILAYESSCDETSTAVIKNGREIESLIVATQIKSHQRFGGVVPEVASRHHIEVVSQITK 66 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL E+ + KDIDA+A T GPGLVGALL+G + ++++ A +P I V H+ GH++A Sbjct: 67 EALNEANCSWKDIDAIAVTYGPGLVGALLIGVSAAKAVSMATGIPLIGVDHIMGHIMAAQ 126 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+D E+P +AL VSGGHT+++ + +E++G++ DDAAGEA+DK ++LG++YP G Sbjct: 127 LKDE-IEYPAIALQVSGGHTEIVLLKDPTHFEIIGDTRDDAAGEAYDKIGRVLGVNYPAG 185 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARA 238 + A QG F FPR M + DFSFSGLK+ NT D + + D+A + Sbjct: 186 KTIDAWAHQGK-DTFNFPRAMLEDDDYDFSFSGLKSAFINTCHHADQIHEKLNKYDLAAS 244 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMM----KKRRGEVFYAR 294 F+ AV+D L K RA+ + K +M GGV+AN+ LR +++E + K + +V Sbjct: 245 FQAAVIDVLAHKTIRAIKEYKPKTFIMGGGVAANQGLRDRMSEEIAKLPKADQPKVILPD 304 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 + C DN AMI A + G ADL ++ P ELP A Sbjct: 305 LKLCGDNAAMIGAAAYNLYNGGQFADLTLNADPSL---ELPYA 344 >UniRef50_A0JZ01 Probable O-sialoglycoprotein endopeptidase n=98 Tax=Bacteria RepID=GCP_ARTS2 Length = 356 Score = 243 bits (619), Expect = 9e-63, Method: Compositional matrix adjust. Identities = 151/342 (44%), Positives = 202/342 (59%), Gaps = 21/342 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE+SCDETG+ I LL+N + S ++ H +GGV+PE+ASR H+ VP +Q A Sbjct: 8 VLGIESSCDETGVGIVRG-TALLSNTVSSSMEEHVRFGGVIPEIASRAHLDAFVPTLQEA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 L ++G+ D+DA+A T+GPGL GAL+VG ++LA A P ++H+ H+ +L Sbjct: 67 LADAGVQLDDVDAIAVTSGPGLAGALMVGVCAAKALAVATGKPLYAINHLVAHVGVGLLQ 126 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGI-GQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 E+ PE ALLVSGGHT+++ + I ELLG +IDDAAGEA+DK A+LLGL YPG Sbjct: 127 EENTLPEH-LGALLVSGGHTEILRIRSITDDVELLGSTIDDAAGEAYDKVARLLGLGYPG 185 Query: 180 GPLLSKMAAQGTAGRFVFPRPMT--------DRPG---LDFSFSGLKTFAANTIR--DNG 226 GP + K+A G A FPR +T D PG D+SFSGLKT A + + Sbjct: 186 GPAIDKLARTGNAKAIRFPRGLTQPKYMGTADEPGPHRYDWSFSGLKTAVARCVEQFEAR 245 Query: 227 TDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 D+ ADIA AF++AVVD + K A + G L++ GGV+AN LR +L E + Sbjct: 246 GDEVPVADIAAAFQEAVVDVITSKAVLACTENGITELLLGGGVAANSRLR-QLTEQRCRA 304 Query: 287 RGEVFYARP-EFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 G P E CTDNGAM+A G AG G+S P Sbjct: 305 AGIRLTVPPLELCTDNGAMVAALGAQLVMAGIEPS-GISFAP 345 >UniRef50_D1N4S8 Metalloendopeptidase, glycoprotease family n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N4S8_9BACT Length = 359 Score = 242 bits (617), Expect = 2e-62, Method: Compositional matrix adjust. Identities = 140/347 (40%), Positives = 206/347 (59%), Gaps = 20/347 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDET A+ D +L++ + SQ+ HA +GGVVPELA+R+H+ P+++ A Sbjct: 4 ILGIESSCDETAAAVVRDGYQVLSSCVASQIAKHAVHGGVVPELAAREHLVALNPVVEGA 63 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+G+T K+IDA+A T GPGL+ ALLVG + + LA P I V+H H+ L+ Sbjct: 64 LREAGVTMKEIDAIAVTQGPGLIPALLVGLSFAKGLAMGNGKPLIGVNHFIAHIYGAFLD 123 Query: 123 ------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 +NP +P +AL+VSGGHT L+ + G+ LG +IDDAAGEA DK AKLLGL Sbjct: 124 EAHGVLENPATYPLLALVVSGGHTSLMLIERDGKARQLGCTIDDAAGEALDKGAKLLGLG 183 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPG--------LDFSFSGLKTFAANTIRDN-GT 227 YPGGP++ K A G ++ FPRP+T G +FSFSG+KT ++ + G Sbjct: 184 YPGGPIMQKTAEGGDPHKYEFPRPLTGGAGKPLAPENLYNFSFSGIKTALLYHVKHHAGA 243 Query: 228 DDQTRADIAR----AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMM 283 D + A++ + ++++AVVD L K A G K +V+AGGV+ N LR + E + Sbjct: 244 DGKLPAELLQDTVASYQEAVVDVLTRKTLLAAKNFGAKTIVVAGGVACNSVLRERF-EAL 302 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWP 330 + ++ A ++CTDN AM+ G + A + L + R P Sbjct: 303 TPKHVQLRLAARKYCTDNAAMVGGLGWHYHRKQAYSPLNIDSFARLP 349 >UniRef50_C7LR95 Metalloendopeptidase, glycoprotease family n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LR95_DESBD Length = 356 Score = 240 bits (613), Expect = 5e-62, Method: Compositional matrix adjust. Identities = 141/342 (41%), Positives = 200/342 (58%), Gaps = 16/342 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A++DD L+ + +++Q+ +H+ +GGVVPELASR+H+R L+ Sbjct: 1 MICLGIETSCDETSVALWDDGH-LVTDLVHTQIPMHSVFGGVVPELASREHLRLLDGLVS 59 Query: 61 AALKESGLTA-KDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + L+ + A + ID +A T GPGL+GALLVG + +SL+ + VP I V+H+ HLLA Sbjct: 60 SVLQSAERPAGQGIDLIAVTRGPGLLGALLVGISYAKSLSLSLGVPVIGVNHLYAHLLAC 119 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + P E+P + +LVSGGHT + + ++ LLG+++DDAAGEAFDK AKLL L YPG Sbjct: 120 DFTE-PIEYPALGVLVSGGHTHIYEMPAPCEFNLLGKTLDDAAGEAFDKIAKLLNLPYPG 178 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR-------DNGTDD--- 229 G + +A GTA +F +P DFSFSGLKT A + D D Sbjct: 179 GKYIDILARLGTADPRLFSKPYLQNDNCDFSFSGLKTAVAQYVHKKSFAAIDYAAFDVEL 238 Query: 230 --QTRADIARAFEDAVVDTLMIKCKRALDQTG-FKRLVMAGGVSANRTLRAKLAEMMKKR 286 Q D+ + +V+TL+ K +RA+ + K L +AGGV+AN LR K + R Sbjct: 239 IPQEIKDLCATVNETIVETLLEKTRRAVARCHDVKTLCLAGGVAANSHLRHKFSAFAHAR 298 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +C DN AMIAYAG+ K G + + PR Sbjct: 299 GFKFLAPAQNYCGDNAAMIAYAGVQWAKKGLMSSMDFEAVPR 340 >UniRef50_D0N6Q4 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0N6Q4_PHYIN Length = 374 Score = 240 bits (612), Expect = 6e-62, Method: Compositional matrix adjust. Identities = 136/351 (38%), Positives = 201/351 (57%), Gaps = 19/351 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 LGIETSCD+T A+ D + +L+N + SQ +L+A + G+VP LA+R H +I AA Sbjct: 21 TLGIETSCDDTAAAVLDQDGRVLSNVISSQWELNAKWRGIVPALAARAHENNLPHVINAA 80 Query: 63 LKESGLTA-KDIDAVAYTAGPGLVGALLVGATVGRSLAF-AWDVPAIPVHHMEGHLLA-- 118 L++SGL + + + AVA T+GPGL L VG R + D+ + ++H+E H+L Sbjct: 81 LEQSGLESLQQLSAVAVTSGPGLAPCLDVGLRTARQICLDNPDIAFLQINHLEAHVLVSR 140 Query: 119 -PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL---- 173 P LE PEFPFV LLVSGGH L+ G+G YELLG ++DD+ GEA+DK A++L Sbjct: 141 LPQLETPRPEFPFVVLLVSGGHCCLVLAKGLGDYELLGNTLDDSIGEAYDKVARMLDITA 200 Query: 174 --GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-DDQ 230 G GG L+ MAA+G F F PM R DFS+SG+KT ++ G D++ Sbjct: 201 SSGKGVHGGKLIEDMAARGNDRAFPFTEPMKHRKDCDFSYSGIKTAMLREVKKLGELDEK 260 Query: 231 TRADIARAFEDAVVDTLMIKCKRALDQT------GFKRLVMAGGVSANRTLRAKLAEMMK 284 + D+ +F+ VD L+ + +RA + LV+ GGV++N+ LR ++ Sbjct: 261 MKEDLCASFQRKAVDQLITRTRRACQWSKDRLGDNITSLVVCGGVASNQYLRDRMQAAAA 320 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAEL 334 + + ++CTDNG M+A+AG+ R+ G +D +PRWPL L Sbjct: 321 EEEVAAVFPPAKYCTDNGVMVAWAGLERYAKGMRSDPEPARYQPRWPLETL 371 >UniRef50_B2S3R9 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Treponema RepID=GCP_TREPS Length = 352 Score = 239 bits (609), Expect = 1e-61, Method: Compositional matrix adjust. Identities = 139/332 (41%), Positives = 188/332 (56%), Gaps = 8/332 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCDET +AI D + +N + +Q+ HA Y G+VPELASR H+ +P ++ Sbjct: 1 MNVLGIETSCDETAVAIVKDGTHVCSNVVATQIPFHAPYRGIVPELASRKHIEWILPTVK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + LT DID +A T PGL G+LLVG T ++LA++ +P I V+H+ H A Sbjct: 61 EALARAQLTLADIDGIAVTHAPGLTGSLLVGLTFAKTLAWSMHLPFIAVNHLHAHFCAAH 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E + +P+V LL SGGH + V Q E LG +IDDA GEAFDK A G YPGG Sbjct: 121 VEHD-LAYPYVGLLASGGHALVCVVHDFDQVEALGATIDDAPGEAFDKVAAFYGFGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG--LDFSFSGLKTFAANTIRD--NGTDDQTRADIA 236 ++ +A QG A FP P G D S+SGLKT + + N ++T +IA Sbjct: 180 KVIETLAEQGDARAARFPLPHFHGKGHRYDVSYSGLKTAVIHQLDHFWNKEYERTAQNIA 239 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 AF+ ++ L+ RAL TG V+ GGV+AN LR +A+ R VF +R E Sbjct: 240 AAFQACAINILLRPLARALQDTGLPTAVVCGGVAANSLLRKSVADWKHARC--VFPSR-E 296 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +CTDN M+A G G + GV+ R R Sbjct: 297 YCTDNAVMVAALGYRYLIRGDRSFYGVTERSR 328 >UniRef50_B8BPP0 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8BPP0_ORYSI Length = 401 Score = 238 bits (607), Expect = 2e-61, Method: Compositional matrix adjust. Identities = 138/360 (38%), Positives = 201/360 (55%), Gaps = 29/360 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+T A+ + +L+ + SQ L +GGV P++A H ++Q A Sbjct: 12 MLGIETSCDDTAAAVVRGDGEILSQVVSSQEDLLVRWGGVAPKMAEEAHSLAIDQVVQKA 71 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ++ ++ D+ AVA T GPGL L VG R +A ++ +P + VHHME H L L Sbjct: 72 LDDANVSENDLSAVAVTVGPGLSLCLRVGVHKARKIAKSFRLPIVGVHHMEAHALVSRLV 131 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP--GG 180 + +FPF+ALL+SGGH L+ G+GQY LG +IDDA GEA+DK+A+ LGLD GG Sbjct: 132 NKDLDFPFLALLISGGHNLLVLAHGLGQYVQLGTTIDDAIGEAYDKSARWLGLDMRKGGG 191 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDD--------- 229 P L ++A +G F PM +FS++GLKT I R+ TDD Sbjct: 192 PALEQLALEGDPNAVKFSVPMRQHKDCNFSYAGLKTQVRLAIESRNISTDDIPISSATKD 251 Query: 230 --QTRADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLAEM 282 Q RA+IA +F+ V L +C+RA++ + K V++GGV++N+ +R L ++ Sbjct: 252 DRQIRANIAASFQRVAVLHLEERCQRAVEWALKMEPSIKYFVVSGGVASNQYVRTHLNQI 311 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG---------ATADLGVSVRPRWPLAE 333 +K ++ P+ CTDNG MIA+ G+ F AG D+ +RPRWPL E Sbjct: 312 AEKNGLQLVCPPPKLCTDNGVMIAWTGIEHFIAGRFDDPPAVDEPDDMQYDLRPRWPLGE 371 >UniRef50_Q30ZN1 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Proteobacteria RepID=GCP_DESDG Length = 367 Score = 236 bits (603), Expect = 7e-61, Method: Compositional matrix adjust. Identities = 141/356 (39%), Positives = 195/356 (54%), Gaps = 30/356 (8%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR LGIE+SCDET +AI DD + L+ + +Q +LHA +GGVVPELASR+H R + Sbjct: 1 MRCLGIESSCDETALAIVDDGR-LVDAVMSTQAELHALFGGVVPELASREHYRLIGRMFD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + + GL +DID ++ GPGL+G+LLVG + LA A + V+H+ HLLA Sbjct: 60 SLMLRCGLGVQDIDVISVARGPGLLGSLLVGVGFAKGLALAGGQRLVGVNHLHAHLLAAG 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE FP + +LVSGGHT L + + L+G ++DDAAGEAFDK AK+L L YPGG Sbjct: 120 LEHRL-VFPALGVLVSGGHTHLYRIDSPRNFTLVGRTLDDAAGEAFDKVAKMLNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG--------TDDQTR 232 + + +FPRP TD LDFSFSGLKT + ++ +G + + + Sbjct: 179 RFIDVLGHMADPDDSMFPRPYTDNDNLDFSFSGLKTAVSTWLKAHGGTALAAPPAESELQ 238 Query: 233 AD----------------IARAFEDAVVDTLMIKCKRALDQTG----FKRLVMAGGVSAN 272 A + +F AV DTL IK +RAL + G + +V+AGGV+AN Sbjct: 239 AMLQNNVLPSGMPADMPLVCASFNAAVADTLYIKARRALQRLGGRGQIRSVVVAGGVAAN 298 Query: 273 RTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +R + + + + P CTDNGAMIAY G + G L + PR Sbjct: 299 SRVRTSMQRLAAEEGLHLHLPSPALCTDNGAMIAYTGWLLASEGLHHSLELETMPR 354 >UniRef50_Q045T6 Probable O-sialoglycoprotein endopeptidase n=433 Tax=cellular organisms RepID=GCP_LACGA Length = 348 Score = 236 bits (602), Expect = 8e-61, Method: Compositional matrix adjust. Identities = 134/343 (39%), Positives = 204/343 (59%), Gaps = 11/343 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +R+L E+SCDET A+ + + + + + +Q+K H +GGVVPE+ASR H+ + + Sbjct: 6 IRILAFESSCDETSTAVIKNGREIESLIVATQIKSHQRFGGVVPEVASRHHIEVITQITK 65 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL E+ T DIDA+A T GPGLVGALL+G + ++ + A +P I V H+ GH++A Sbjct: 66 EALAEANATWDDIDAIAVTYGPGLVGALLIGVSAAKAASMATGIPLIGVDHIMGHIMAAQ 125 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+D E+P +AL VSGGHT+++ + +E++G++ DDAAGEA+DK ++LG++YP G Sbjct: 126 LKDE-IEYPALALQVSGGHTEIVLMKDPIHFEIVGDTRDDAAGEAYDKIGRVLGVNYPAG 184 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARA 238 + + A +G F FPR M + DFS SGLK+ NT D + + D+A + Sbjct: 185 KTIDEWAHKGK-DTFHFPRAMMEDDDYDFSLSGLKSAFINTCHHADQIHEKLDKYDLAAS 243 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR----RGEVFYAR 294 F+ +VVD L K RA+ + K ++ GGV+AN LR +LAE ++K + +V Sbjct: 244 FQASVVDVLSHKTIRAIKEYKPKTFILGGGVAANHGLRDRLAEEIEKLPADIKPKVILPD 303 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 + C DN AMI A +KAG +D ++ P ELP A Sbjct: 304 LKLCGDNAAMIGAAAYNLYKAGKFSDENLNADPSL---ELPYA 343 >UniRef50_Q2SR45 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Mollicutes RepID=GCP_MYCCT Length = 319 Score = 234 bits (596), Expect = 5e-60, Method: Compositional matrix adjust. Identities = 119/309 (38%), Positives = 189/309 (61%), Gaps = 6/309 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L IE+SCDE I+I D+ K +L N + SQ+K H +GGVVPELA+R HV+ +++ Sbjct: 1 MKILAIESSCDEFSISIIDNNK-ILTNIISSQIKDHQVFGGVVPELAARLHVQNFNWVLK 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AAL +S L ++ID +AYT PGL+G+L++G V +++ + P + + H++GH+ Sbjct: 60 AALSQSNLNIEEIDYIAYTKSPGLIGSLIIGKLVAETISLYINKPILALDHIQGHIFGAS 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E N +P +A++VSGGHTQ+ + ++++G + DDA GE +DK A++LGL YPGG Sbjct: 120 IE-NEFIYPVLAMVVSGGHTQIEIINSANDFQIIGSTRDDAIGECYDKVARVLGLSYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADIARA 238 P+L K+A +G + P + D DFS+SGLKT N I + Q + A + Sbjct: 179 PILDKLALKGNKDFYSLP-VLKDDNTYDFSYSGLKTACINLIHNLNQKKQEINLENFAAS 237 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR-GEVFYARPEF 297 F+ + + K ++A+ + K L +AGGVSAN +R + ++ +K F + + Sbjct: 238 FQYTATNIIEKKLEKAIKEFKPKTLTVAGGVSANSEIRKIILKLGQKYNIKNTFVPKMSY 297 Query: 298 CTDNGAMIA 306 CTDN AMIA Sbjct: 298 CTDNAAMIA 306 >UniRef50_Q04RH4 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Leptospira RepID=GCP_LEPBJ Length = 338 Score = 233 bits (593), Expect = 1e-59, Method: Compositional matrix adjust. Identities = 128/330 (38%), Positives = 195/330 (59%), Gaps = 8/330 (2%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 +GIETSCDET I I D K LL+ ++SQ+ LH YGG+VPE+ASR H+ K L++ + Sbjct: 4 MGIETSCDETSIGIIRDGKELLSLGIFSQIDLHKPYGGIVPEIASRAHLEKINLLLEETM 63 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 +E+ + +D+ VA T+ PGL G+L+VGA + R + ++ P +PV H++ H LE Sbjct: 64 EEAKIRFEDLSYVAVTSSPGLTGSLMVGAQMARCINMVYETPILPVCHLQSHFAVLHLEG 123 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLL 183 P EFP + LL+SGG++ + + G+ EL+G+++DDA GEAFDK A LL L YPGGP + Sbjct: 124 VPTEFPVLGLLLSGGNSAVYILQEFGRMELVGDTMDDALGEAFDKVAGLLDLPYPGGPHI 183 Query: 184 SKMAAQGTAG---RFVFPRPMTDRPG--LDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 A + + + P + + P + FSFSGLKT A + + ++ I Sbjct: 184 EAKANEYIPTPDEKPILPLLLRNLPQGEVSFSFSGLKT--AVMVLLEKQKEVSKEQICWN 241 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE-F 297 F+++ D + KRA+ +TG +++ AGGV AN TL+ +L K E+F + + + Sbjct: 242 FQNSAFDLVERNLKRAVAKTGIRKVFAAGGVLANTTLQKRLEVWAGKNSVELFTPKKKIY 301 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 CTDNGAM+A G F+ G + +V P Sbjct: 302 CTDNGAMVASLGYHLFRKGYKKGVDFTVNP 331 >UniRef50_C7M316 Metalloendopeptidase, glycoprotease family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7M316_ACIFD Length = 347 Score = 232 bits (591), Expect = 2e-59, Method: Compositional matrix adjust. Identities = 144/329 (43%), Positives = 198/329 (60%), Gaps = 5/329 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL IETSCD+T +A+ + + AN + SQ LHA +GGVVPE+A+R H V +++ A Sbjct: 18 VLAIETSCDDTAVAVVAGGR-VAANVVRSQAALHAPFGGVVPEVAARAHDAAMVEVVEEA 76 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ESG+ A +++A+A T GPGL G+L+VG LA D P I V HMEGHL A +E Sbjct: 77 LAESGIDAHEVEAIAVTKGPGLPGSLVVGVGAALGLAVGLDRPLIGVDHMEGHLYAATIE 136 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 P P ++LLVSGGH++L+ + +Y LLG + DDAAGEAFDK A++LGL +PGGP Sbjct: 137 -GPVALPALSLLVSGGHSELVVIEAPFRYRLLGRTRDDAAGEAFDKVARILGLGFPGGPA 195 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 + A G F R + ++ G D SFSG+KT A + G AD+A +F++A Sbjct: 196 IEAAARDGRPDAIRFARALRNQ-GFDLSFSGIKTEVARYL--EGARAAEVADVAASFQEA 252 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 VVD L+ K +RAL+ + +V+ GGV+AN LR ++AE+ + R C DN Sbjct: 253 VVDVLVAKLERALESERVETVVIGGGVAANGPLRERVAELARARGVGAHIPARSLCADNA 312 Query: 303 AMIAYAGMVRFKAGATADLGVSVRPRWPL 331 AMIA AG R AG A G+ + P L Sbjct: 313 AMIAAAGAARLVAGEHAVDGLDIEPTRSL 341 >UniRef50_B1XJF0 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Synechococcus sp. PCC 7002 RepID=GCP_SYNP2 Length = 355 Score = 230 bits (587), Expect = 4e-59, Method: Compositional matrix adjust. Identities = 147/339 (43%), Positives = 201/339 (59%), Gaps = 8/339 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL IETSCDET +AI ++ K +L N + SQ+ +H ++GGVVPE+ASR H+ I A Sbjct: 4 VLAIETSCDETAVAIVNNRK-VLGNVVASQIDIHREFGGVVPEVASRHHLESINACIDTA 62 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++SGL+ +I+A+A T PGLVGALL+GA G++LA + P I VHH+EGH+ A L Sbjct: 63 FEQSGLSWSEIEAIATTCAPGLVGALLLGAAAGKTLAMIHNKPFIGVHHLEGHIYASYLS 122 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 E PF+ LLVSGGHT I V G G+Y+LLGE+ DDAAGEAFDK A+LL + YPGGP+ Sbjct: 123 QPELEPPFLCLLVSGGHTSFIEVRGCGEYKLLGETRDDAAGEAFDKVARLLRVGYPGGPV 182 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPG-----LDFSFSGLKTFAANTIRDNGTDDQT--RADI 235 + ++A G F P PG D SFSGLKT ++ T + ADI Sbjct: 183 IDRLAKTGDPQAFKLPEGRISLPGGGYHPYDCSFSGLKTAVLRLVQQFETQGKAVPVADI 242 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A +F+ V L + R + +V+ GGV+AN LR L + +V++ Sbjct: 243 AASFQYTVAQALTKRAVRCAGDRQLQTIVVGGGVAANSGLRQILTAAAAEAGIQVYFPPL 302 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 +FCTDN AMIA A F+ G + L + V R P+ ++ Sbjct: 303 KFCTDNAAMIACAAAEHFQKGDRSRLDLPVASRLPITQV 341 >UniRef50_D2L1E2 Metalloendopeptidase, glycoprotease family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L1E2_9DELT Length = 371 Score = 230 bits (586), Expect = 7e-59, Method: Compositional matrix adjust. Identities = 155/349 (44%), Positives = 200/349 (57%), Gaps = 25/349 (7%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A+ DD + LL +L SQ+KLHA +GGVVPELASR+H+R+ PL+ Sbjct: 1 MLCLGIETSCDETAVALCDDGRPLL-EKLASQIKLHALFGGVVPELASREHLRRMGPLLD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A +E+GL D+DAVA GPGL+G+LL+G V + LA A P I V H+ HLLA Sbjct: 60 ALFREAGLGLADVDAVAVARGPGLLGSLLIGLAVAKGLALAAGKPLIGVDHLHAHLLAAT 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L +P + LLVSGGHTQ++ + +LG ++DDAAGEAFDK AK L L YPGG Sbjct: 120 LGRE-VAYPALGLLVSGGHTQIVLLRSPLDLTVLGRTVDDAAGEAFDKAAKSLNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR-------------DNGT 227 + ++ A R +FPRP + LDFSFSGLKT A + D Sbjct: 179 VFVDRLGAGIEPDRALFPRPNLENTHLDFSFSGLKTAVATHVARHPGLRLAVMPAPDGPV 238 Query: 228 D------DQTRADIARAFEDAVVDTLMIKCKRALDQTGFK--RLVMAGGVSANRTLRAKL 279 D D R + + AV DTL +K +RALD ++ AGGV+AN +RA L Sbjct: 239 DAAAWPLDLRR--VCSSLNFAVADTLRVKMERALDGLDVPAVSILAAGGVAANSRIRAML 296 Query: 280 AEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +R F P C DN MIA AG + +AG T DL + PR Sbjct: 297 EALGARRGLPCFLPEPALCADNATMIAAAGCLLGRAGLTHDLALDAVPR 345 >UniRef50_Q0SM86 Probable O-sialoglycoprotein endopeptidase n=18 Tax=Borrelia burgdorferi group RepID=GCP_BORAP Length = 346 Score = 228 bits (582), Expect = 2e-58, Method: Compositional matrix adjust. Identities = 123/314 (39%), Positives = 187/314 (59%), Gaps = 10/314 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIETSCD+ +A+ ++ +L+N SQ K H Y GVVPE+ASR H + + Sbjct: 1 MKVLGIETSCDDCCVAVVENGIHILSNIKLSQ-KEHEKYYGVVPEIASRLHTEAIMSVCI 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ +ID +A T+ PGL+G+L+VG + LA + P I + H+ GHL AP+ Sbjct: 60 KALKKANTKISEIDLIAVTSRPGLIGSLIVGLNFAKGLAISLKKPIICIDHILGHLYAPL 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + E+PF++LL+SGGHT + E+LG ++DD+ GEAFDK AK + +PGG Sbjct: 120 MH-SKIEYPFISLLLSGGHTLIAKQKNFDDVEILGRTLDDSCGEAFDKVAKHYDIGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG--LDFSFSGLKTFAANTIR--DNGTDDQTRADIA 236 P + +++ G F FP + DFS+SGLKT + + N + T+ +IA Sbjct: 179 PNIEQISKNGDENTFKFPVTTFRKKENWYDFSYSGLKTACIHQLEKFKNKDNPTTKNNIA 238 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F+ A + L+ KRA+ T K+LV+AGGV++N LR K+ ++ + + +Y + Sbjct: 239 ASFQKAAFENLITPLKRAIKDTQIKKLVIAGGVASNLYLREKIDKL----KIQTYYPPLD 294 Query: 297 FCTDNGAMIAYAGM 310 CTDNGAMIA G Sbjct: 295 LCTDNGAMIAGLGF 308 >UniRef50_Q1IZH8 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Deinococci RepID=GCP_DEIGD Length = 333 Score = 228 bits (580), Expect = 3e-58, Method: Compositional matrix adjust. Identities = 134/282 (47%), Positives = 177/282 (62%), Gaps = 10/282 (3%) Query: 3 VLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +LGI+TSCD+TG+ + D + AN+++SQ +HA YGGV+PELASR+HV + + Sbjct: 7 ILGIDTSCDDTGVGVVELAPDGSVQVRANRVWSQT-VHAQYGGVLPELASREHVERIDTV 65 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 AL E+GLT D+ AVA T+GPGLVGALLVG G+ LA A +VP HH+EGH+ A Sbjct: 66 TGDALAEAGLTVGDLAAVAATSGPGLVGALLVGLMYGKGLAQALNVPFYAAHHLEGHIFA 125 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 E + + P++AL+VSGGHT L V G+Y L+G + DDAAGEAFDK A+L GL YP Sbjct: 126 AASEADL-QAPYLALVVSGGHTHLFDVPREGEYVLVGATRDDAAGEAFDKVARLAGLGYP 184 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 GGP +S+ A +G F P+ + G DFSFSGLKT A R + D+A Sbjct: 185 GGPAISEAARRGDPEAVPFKEPLQGQKGFDFSFSGLKTAALLAHRAGAKPE----DLAAG 240 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLA 280 FE A V L+ RA G + +V++GGV+ANR LR A Sbjct: 241 FERAAVRFLVGTTLRAARAYGRETVVVSGGVAANRALREAFA 282 >UniRef50_A4RXP4 Predicted protein n=6 Tax=Eukaryota RepID=A4RXP4_OSTLU Length = 492 Score = 228 bits (580), Expect = 3e-58, Method: Compositional matrix adjust. Identities = 144/367 (39%), Positives = 203/367 (55%), Gaps = 36/367 (9%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+T A+ + +L + SQ +H +GGVVP LA H +++ A Sbjct: 82 VLGIETSCDDTAAAVVRGDGVVLGEAIASQAAIHGPWGGVVPNLARAAHEEVIDDVVRRA 141 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 L E+G++A D+ AVA T GPGL L VG + ++ + +P PVHH+E H L L Sbjct: 142 LTEAGVSAADLSAVAVTCGPGLSMCLRVGVRKAQRMSAEYGIPIAPVHHVEAHALVSRLC 201 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK--LLGLDYP 178 +FPF+ALLVSGGH LI G+G Y +LG ++DDA GEA+DKTA+ L + Sbjct: 202 AGTETVKFPFLALLVSGGHNLLIKARGVGDYTILGTTLDDALGEAYDKTARLLGLPVGGG 261 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFA---------ANTIRDNGTDD 229 GGP L K+A +G RF FP P+ R DFS++GLKT A + +G D Sbjct: 262 GGPALEKLALEGDEKRFKFPVPLRQRKNCDFSYAGLKTAARMAIDAEIGGEDVEWDGVDK 321 Query: 230 -QTRADIARAFEDAVVDTLMIKCKRAL-----DQTGFKRLVMAGGVSANRTLRAKLAEMM 283 QTRADIA +F+ V L + +RAL D +V+AGGV+AN T+R+ L +++ Sbjct: 322 RQTRADIAASFQAKAVKHLEERMRRALTWALEDTPDLSCVVVAGGVAANATVRSTLVKVV 381 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATA-----------------DLGVSVR 326 ++ + + P++CTDNG M+A+ G R G D+ V++ Sbjct: 382 EETGLPLVFPPPKWCTDNGVMVAWTGCERLALGLAEAPVDAELEAKHAMMDPRDVHVNLL 441 Query: 327 PRWPLAE 333 PRWPL E Sbjct: 442 PRWPLGE 448 >UniRef50_B5RQA5 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Borrelia RepID=GCP_BORRA Length = 338 Score = 227 bits (579), Expect = 4e-58, Method: Compositional matrix adjust. Identities = 129/324 (39%), Positives = 192/324 (59%), Gaps = 11/324 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIE+SCD+ AI ++ +L+N SQ K H Y G+VPE+ASR H + + Q Sbjct: 1 MKVLGIESSCDDCCAAIVENGNTILSNIKLSQ-KEHKKYYGIVPEIASRLHTEFIMYVCQ 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ + + +ID +A T+ PGL+G+L+VG + L+ A P I + H+ GHL AP+ Sbjct: 60 QAIISAKINISEIDLIAVTSQPGLIGSLIVGVNFAKGLSIALKKPLICIDHILGHLYAPL 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L ++ E+PF++L++SGGHT L E+LG ++DDA GEAFDK AK + +PGG Sbjct: 120 L-NHTIEYPFLSLVLSGGHTILAKQNNFDDIEILGRTLDDACGEAFDKIAKHYKMGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG--LDFSFSGLKTFAANTIR--DNGTDDQTRADIA 236 P + K+A G F FP + D+ DFS+SGLKT + + N T +IA Sbjct: 179 PNIEKLAIDGNQYAFNFPITIFDKKENRYDFSYSGLKTACIHQLEKFKNNNAQITNNNIA 238 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F+ A + L+I KRA+ T K+L+++GGV++N LR K+ K E +Y + Sbjct: 239 ASFQRAAFENLIIPIKRAIKDTNIKKLIISGGVASNLYLREKI----KNLEIETYYPPID 294 Query: 297 FCTDNGAMIAYAGMVRF-KAGATA 319 CTDN AMIA G + + K GA++ Sbjct: 295 LCTDNAAMIAGIGYLMYLKYGASS 318 >UniRef50_A1R8N0 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Bacteria RepID=GCP_ARTAT Length = 368 Score = 226 bits (575), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 145/359 (40%), Positives = 200/359 (55%), Gaps = 38/359 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDETG+ I LL N + S + H +GGV+PE+ASR H+ VP +Q + Sbjct: 1 MLGIESSCDETGVGIVRGTT-LLTNTVSSSMDEHVRFGGVIPEIASRAHLDAFVPTLQES 59 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+G+T +DIDA+A T+GPGL GAL+VG ++LA A P ++H+ H+ +L+ Sbjct: 60 LQEAGVTLEDIDAIAVTSGPGLAGALMVGVCAAKALAVATGKPLYAINHLVAHVGVGLLD 119 Query: 123 DNP-------------------PEFPFVALLVSGGHTQLISVTGI-GQYELLGESIDDAA 162 N PE ALLVSGGHT+++ + I ELLG +IDDAA Sbjct: 120 GNRVSEGKHDAVAAAGLGAGKLPEN-LGALLVSGGHTEILRIRSITDDVELLGSTIDDAA 178 Query: 163 GEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGL-----------DFSF 211 GEA+DK A++LGL YPGGP + K+A QG FPR +T + D+SF Sbjct: 179 GEAYDKVARILGLGYPGGPAIDKLAHQGNPKSIRFPRGLTQPKYMGTAEEKGPHRYDWSF 238 Query: 212 SGLKTFAANTIR--DNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGV 269 SGLKT A + + ++ ADIA AF++AVVD + K A + G +++ GGV Sbjct: 239 SGLKTAVARCVEQFEARGEEVPVADIAAAFQEAVVDVISSKAVLACKEHGITDVLLGGGV 298 Query: 270 SANRTLRAKLAEMMKKRRGEVFYARP-EFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +AN LR +L G + P CTDNGAM+A G AG + GVS P Sbjct: 299 AANSRLR-ELTGQRCASAGITLHVPPLGLCTDNGAMVAALGAQLIMAGISPS-GVSFAP 355 >UniRef50_C8WN77 Metalloendopeptidase, glycoprotease family n=3 Tax=Bacteria RepID=C8WN77_EGGLE Length = 891 Score = 224 bits (571), Expect = 4e-57, Method: Compositional matrix adjust. Identities = 136/325 (41%), Positives = 184/325 (56%), Gaps = 13/325 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVR-------KT 55 +L IE+SCDET AI D L+A+ + SQ+ HA +GGVVPE+ASR H+ + Sbjct: 550 ILAIESSCDETAAAIVDGNGTLIADVVASQIDFHARFGGVVPEIASRKHIEAICGVCDEC 609 Query: 56 VPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 + +AL LT +D+D++A T PGLVGAL+VG + A+A P I V+H+EGH Sbjct: 610 FDVAASALGIERLTWRDLDSIAVTYAPGLVGALVVGVAFAKGAAWAAGKPFIGVNHLEGH 669 Query: 116 LLAPMLEDNPPEF--PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 L A + P+F P V LVSGG+T L+ + G G YE LG +IDDA GEAFDK AK L Sbjct: 670 LYANKI--GAPDFQPPAVVSLVSGGNTLLVHMKGWGDYETLGATIDDAVGEAFDKVAKAL 727 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN--GTDDQT 231 GL YPGGP++S+ AA+G FPR M L FS SGLKT I + + Sbjct: 728 GLGYPGGPVISREAAKGDPNAIPFPRAMMHSGDLRFSLSGLKTAVVTYINNERAAGRELN 787 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 +I +F+ AVVD + K + AL+QTG + + GGV+AN LR ++ ++ + Sbjct: 788 VPNICASFQQAVVDVQVKKAEMALEQTGARTFCLGGGVAANPALRDAYEQLCERLHVRLT 847 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAG 316 C DN MIA + R G Sbjct: 848 LPPLSACGDNAGMIALVALDRHNQG 872 >UniRef50_C0QY51 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Brachyspira RepID=GCP_BRAHW Length = 340 Score = 223 bits (569), Expect = 5e-57, Method: Compositional matrix adjust. Identities = 120/311 (38%), Positives = 193/311 (62%), Gaps = 11/311 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGI+TSCD+T AI +D K +L++ L S + H ++ GVVPE+A+R H+ + +I Sbjct: 1 MKILGIDTSCDDTSAAIVEDGKNVLSSVLSSSIDAHKEFQGVVPEIAARKHLEAILYVID 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ T DID A T PGL+G+LLVG +SLAF+ + P + + H+ H+ +P Sbjct: 61 KALKDANTTLDDIDLFAVTNRPGLLGSLLVGVASAKSLAFSLNKPLLALDHIAAHIYSPH 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L N EFP++AL+VSGGHT + V G+Y+++G ++DDA GEA+DK +K L L YPGG Sbjct: 121 LT-NDIEFPYIALVVSGGHTIITEVHDYGEYKVVGTTLDDAVGEAYDKVSKFLNLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLD---FSFSGLKTFAANTIRDNGTD--DQTRADI 235 P++ ++A +G +P + + G+D FS+SGLKT + + + + T +I Sbjct: 180 PIIDRLAKEGNKEAIKYPIVLLN--GIDEFNFSYSGLKTACVYSTKKYLKEGYEATNENI 237 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A AF+ + ++ L IK + +++G KR+ ++GGV+ N LR + + E + Sbjct: 238 AAAFQISAIEPLYIKTLKYAEKSGIKRVTLSGGVACNSYLRDRFGN---SKDFECYLPAL 294 Query: 296 EFCTDNGAMIA 306 ++ TDN AM+A Sbjct: 295 KYTTDNAAMVA 305 >UniRef50_D0JBS4 Glycoprotease M22 family domain-containing protein n=2 Tax=Blattabacterium RepID=D0JBS4_BLASB Length = 327 Score = 223 bits (569), Expect = 6e-57, Method: Compositional matrix adjust. Identities = 109/303 (35%), Positives = 184/303 (60%), Gaps = 15/303 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCD+T ++I + + +L+N + Q ++H YGGVVPELASR H + P + A Sbjct: 20 ILGIESSCDDTAVSIIKN-RDVLSNIIIHQ-EIHKQYGGVVPELASRLHDQNMTPAVNQA 77 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + + + +IDAV++T GPGL+G+LLVGA+ +S + ++P + V+H++ H+L ++ Sbjct: 78 IHSAKIKKNEIDAVSFTLGPGLIGSLLVGASFAKSFSMGLEIPLLTVNHVQAHILTHFIK 137 Query: 123 D-----NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 + + P+FPF+ L++SGGHTQ++ V + E+LG ++DD+ G+ FDK A+LLG Y Sbjct: 138 NANMNNSYPKFPFLGLVISGGHTQIVKVNDFFKMEILGSTLDDSIGDTFDKIARLLGFHY 197 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-----QTR 232 PGGP++ + G +F F +P + L+FSFSG K+ I+ + Q Sbjct: 198 PGGPMIELFSKNGNCKKFGFSKPSVN--DLNFSFSGFKSHVLQFIKKKSKKNPLFIKQNL 255 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK-RRGEVF 291 +DI + + + + L+ K ++A T R+ +AGGVSAN +R K+ ++ E+F Sbjct: 256 SDICASIQRIIAEILLEKVEKATLITDIFRVALAGGVSANCEIRRMFISFAKRNKKWEIF 315 Query: 292 YAR 294 + Sbjct: 316 IPK 318 >UniRef50_Q254Q0 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Chlamydiaceae RepID=GCP_CHLFF Length = 344 Score = 220 bits (561), Expect = 5e-56, Method: Compositional matrix adjust. Identities = 125/339 (36%), Positives = 187/339 (55%), Gaps = 16/339 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LG+E+SCDET A+ D + ++AN + SQ + HA YGGVVPELASR H++ ++ Sbjct: 1 MLTLGLESSCDETACALVDADAQIVANVVSSQ-QYHASYGGVVPELASRAHLQMLPSVVN 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +AL++SG++ DID +A T PGL+G+L VG + LA P I V+H+E HL A Sbjct: 60 SALEKSGVSLDDIDLIAVTHTPGLIGSLAVGVNFAKGLAIGSQKPMIGVNHVEAHLYAAY 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E EFP + L++SG HT + + Y+L+G + DDA GE FDK + LGL YPGG Sbjct: 120 MEAKNVEFPALGLVMSGAHTSMFLMEDPLSYKLIGNTRDDAIGETFDKVGRFLGLPYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT--------- 231 L+ MA+QG +P PG D SFSGLKT I+ N ++ ++ Sbjct: 180 ALIEMMASQGCEES--YPFSAAKVPGYDLSFSGLKTAVLYAIKGNNSNSRSPLPDLSQKE 237 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + +IA +F+ A T+ K + + + +++ GGV+ N+ + L + ++ Sbjct: 238 KNNIAASFQKAAFMTIAQKLPKIIKNFSCRSILVGGGVANNKYFQTLLQNTLNL---PLY 294 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRW 329 + + CTDN AMIA G F + T + R RW Sbjct: 295 FPSSKLCTDNAAMIAGLGRELFLSRKTTQGITPCARYRW 333 >UniRef50_A3EUW9 O-sialoglycoprotein endopeptidase n=3 Tax=Leptospirillum RepID=A3EUW9_9BACT Length = 345 Score = 220 bits (560), Expect = 6e-56, Method: Compositional matrix adjust. Identities = 125/317 (39%), Positives = 188/317 (59%), Gaps = 5/317 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+T +A+ D +L +Q++SQ LH YGGVVPE+ASR HV L+++A Sbjct: 2 ILGIETSCDDTSVALVDMTGAILFHQIHSQESLHGTYGGVVPEVASRAHVEVLPSLVRSA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++GL+ + +A T GPGL+G+LL G + + + A+ +P I V H++ HL A + Sbjct: 62 FLDTGLSPSQLQGIAVTRGPGLLGSLLTGISFAKGIGSAFRLPLIGVDHVQAHLRACVDS 121 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + L++SGGHT L + EL+ +++DDAAGEAFDK AKLLGL YPGGP Sbjct: 122 MESLRGKTIGLVISGGHTHLFRIENWPTMELVSQTVDDAAGEAFDKGAKLLGLPYPGGPS 181 Query: 183 LSKMAAQGTAGRFVF--PRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 + K A + T R T+ P LDFSFSGLKT + +R +++TR +A + + Sbjct: 182 IQKEAEKNTLPLLPLTKKRIRTENP-LDFSFSGLKTAFSLLVRKTELNERTRPLLAASLQ 240 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP-EFCT 299 A+V+ ++ + ++ + Q L++ GGVSAN LR KL ++ +++G + P Sbjct: 241 HAIVEHVLDRIEQTVIQESPSHLLVGGGVSANALLRKKL-QVFSEQQGMTLHLSPLSLAR 299 Query: 300 DNGAMIAYAGMVRFKAG 316 DN MIA G F +G Sbjct: 300 DNALMIARHGRELFLSG 316 >UniRef50_A9WHP1 Metalloendopeptidase, glycoprotease family n=4 Tax=Chloroflexi (class) RepID=A9WHP1_CHLAA Length = 355 Score = 218 bits (555), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 149/352 (42%), Positives = 197/352 (55%), Gaps = 29/352 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +ETSCDET A+ + +L+N + SQ+ H YGGVVPE+ASR H+ P+++AA Sbjct: 10 ILALETSCDETAAAVVRGGRTVLSNVVASQMATHERYGGVVPEIASRQHILSLAPVVRAA 69 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 L D+ AVA T GPGL GALL G +++A+ +P + V+H+E HL A L Sbjct: 70 LAVLPNGWADVHAVAATHGPGLSGALLTGLNAAKAMAWRRGLPFVAVNHLEAHLYAGWLG 129 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 D PP FP VALLVSGGHT L+ + G Y+LLG++ DDAAGEAFDK A++LGL YPGGP Sbjct: 130 SDPPPPFPLVALLVSGGHTLLVLLRDHGNYQLLGQTRDDAAGEAFDKVARILGLGYPGGP 189 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR--------- 232 + AA T G V PR R DFSFSGLKT + ++D Q+R Sbjct: 190 AIQAAAANATPGG-VLPRAWL-RDSYDFSFSGLKTAVLHRVQDR-LAQQSRLSGRKGAGE 246 Query: 233 ---------ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMM 283 A +A AF+++VVD L+ K A + + +++AGGV+ANR Sbjct: 247 TPQLDAPFVAQMAYAFQESVVDVLVTKTVDAARRYQAQAILLAGGVAANRR-----LREE 301 Query: 284 KKRRGEVFYARPEF--CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 RR V P F CTDN AM+A A RF +G V V PL + Sbjct: 302 LIRRASVPVHLPAFDLCTDNAAMVAAAAFYRFHSGVQYGWDVDVTANLPLEQ 353 >UniRef50_C4XSD3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Desulfovibrio RepID=GCP_DESMR Length = 371 Score = 216 bits (550), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 147/347 (42%), Positives = 198/347 (57%), Gaps = 21/347 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A++++ + +L +L SQ LHA +GGVVPELASR+H+R+ PL+Q Sbjct: 1 MLCLGIETSCDETAVALFENGRPVL-EKLASQADLHAVFGGVVPELASREHLRRLGPLLQ 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A SG + D+DA+A GPGL+G+LLVG + L+ A P I V H+ HLLA Sbjct: 60 ALFAASGRSLADVDAIAVARGPGLLGSLLVGLAAAKGLSLATGKPLIGVDHLHAHLLAAT 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + FP + LLVSGGHTQ++ + E+LG ++DDAAGEAFDK AK L YPGG Sbjct: 120 IGRD-VAFPALGLLVSGGHTQIVRLESALSLEVLGRTLDDAAGEAFDKAAKSFNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTF------------AANTIRDNGTD 228 + + + +FPRP D DFSFSGLKT A + G Sbjct: 179 VYIDVLGRGIAPDKTLFPRPFLDNDHFDFSFSGLKTAVASYAAAHPELRAGSLAEAGGAI 238 Query: 229 DQTRADIA-----RAFEDAVVDTLMIKCKRALD-QTG-FKRLVMAGGVSANRTLRAKLAE 281 D +A + A+ +TL IK +RALD Q G L+ AGGV+AN +RA LA+ Sbjct: 239 DPEAWPMALRRACSSLNFAIAETLRIKFERALDRQPGPPASLIAAGGVAANGPIRAMLAD 298 Query: 282 MMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +R ++ P C DN MIA AG +AG DL ++ PR Sbjct: 299 LAARRGLPLYLPEPALCADNAVMIAAAGSRLAEAGYAHDLALTAVPR 345 >UniRef50_Q1IUF1 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Acidobacteria RepID=GCP_ACIBL Length = 381 Score = 216 bits (549), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 145/368 (39%), Positives = 200/368 (54%), Gaps = 56/368 (15%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDET A+ + +L++ ++SQ+ H YGGVVPELASR+H++ VP+++ A Sbjct: 6 ILGIESSCDETAAAVIRNGAEILSSVVFSQIYTHMRYGGVVPELASREHLKAIVPVVRQA 65 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++++G + IDA+A T GPGL GALLVG + ++L+FA D P I V+H+EGH+ +LE Sbjct: 66 VEDAGQSYDKIDAIAVTRGPGLAGALLVGVSYAKALSFALDKPLIGVNHLEGHIHVVLLE 125 Query: 123 DNPP-----EFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKTAKLLG 174 +FP +AL+VSGGHT L Y +G + DDAAGEA+DK AKLLG Sbjct: 126 QKQQGVGEIQFPVLALVVSGGHTHLYLAEKKDAGWTYRDVGHTRDDAAGEAYDKVAKLLG 185 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFP-------------RPMTDRPGLDFSFSGLKT----- 216 L YPGGP+L +A G FP R D +DFS+SG+KT Sbjct: 186 LGYPGGPILDGLAKHGDPRAVRFPFAQIKHRDRNPQNRHEDDDARVDFSYSGIKTAVLRY 245 Query: 217 --------------FAANTIRDNGTDDQTRA------DIARAFEDAVVDTLMIKCKRALD 256 A I DD R D+ +F+ AVV+ L+ K A Sbjct: 246 VETHEMKAAIEARRTALKEIEKPSQDDYLRVCDRQTLDLIASFQRAVVNDLVSKALHAAA 305 Query: 257 QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV-----FYARPEFCTDNGAMIAYAGMV 311 + L++ GGV+AN LR E ++R GE+ F +RP TDN AMIA A Sbjct: 306 ENNAATLLVTGGVAANSELR----ETFERRAGELGLPVYFPSRP-LSTDNAAMIAAAAYP 360 Query: 312 RFKAGATA 319 RF +G A Sbjct: 361 RFLSGEFA 368 >UniRef50_Q0BPC9 Probable O-sialoglycoprotein endopeptidase n=14 Tax=Alphaproteobacteria RepID=GCP_GRABC Length = 370 Score = 216 bits (549), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 154/344 (44%), Positives = 205/344 (59%), Gaps = 15/344 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCDET A+ D +LA + SQ HA +GGVVPE+A+R H+ ++ Sbjct: 14 VLGIETSCDETAAAVLDGSGRILAEIVLSQYDDHARFGGVVPEIAARAHLAYLPGMVTEV 73 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + ++GL +D+ A+A T+GPGL+G LLVGA +G+ LA A P I ++H+E H LA +L Sbjct: 74 MDKAGLRFQDLAAIAATSGPGLIGGLLVGAGLGKGLALAAKRPFIAINHLEAHALAALLP 133 Query: 123 --------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 + FPF+ +L+SGGH Q I V G+G+Y LG +IDDA GEAFDK KLLG Sbjct: 134 ALGGVAEITSGEHFPFLLMLLSGGHCQCILVEGVGRYRRLGGTIDDAVGEAFDKVGKLLG 193 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR---DNGTDDQT 231 L +PGGP L ++A QG FPRPM R G DFSFSGLKT A + D Sbjct: 194 LGWPGGPALERLALQGNPHALAFPRPMKGRVGCDFSFSGLKTAVAQYVARFPDGPLPLSD 253 Query: 232 RADIARAFEDAVVDTLMIKCKRAL----DQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 ADIA +F+ AV D + + AL + K LV++GGV+AN +RA L+ + R Sbjct: 254 AADIAASFQAAVADVMADRATAALAMADEIAPAKMLVVSGGVAANAAIRAALSTAAEHRG 313 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 + P CTDN M+A+AG+ R K GA + L + PRWPL Sbjct: 314 IAMLAPPPRLCTDNAVMVAWAGLHRLKYGAVSGLDHAPLPRWPL 357 >UniRef50_A6Q6J3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Epsilonproteobacteria RepID=GCP_SULNB Length = 337 Score = 214 bits (545), Expect = 3e-54, Method: Compositional matrix adjust. Identities = 120/308 (38%), Positives = 180/308 (58%), Gaps = 10/308 (3%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ IA+ + K +L ++ SQ H+ YGGVVPELASR H +P I Sbjct: 2 ILSIESSCDDSSIAVTETSTKKILYHKKISQEAEHSCYGGVVPELASRLHAV-ALPKI-- 58 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E+ + AVA T PGL LL G + +++A ++P IPVHH++GH+ + + Sbjct: 59 -LEETKPWFDKLKAVAVTNQPGLGVTLLEGIAMAKTVAVLQNIPLIPVHHLKGHIYSLFI 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E FP + LL+SGGHTQ+I V E+L S+DD+ GE+FDK AK++ L YPGGP Sbjct: 118 EKKTL-FPLLVLLISGGHTQIIRVKDFEHMEILATSMDDSVGESFDKCAKMMHLGYPGGP 176 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG----TDDQTRADIAR 237 L+ +A +G RF P P+ + P + FS SGLK T+ G +Q AD++ Sbjct: 177 LIEALALKGDENRFDLPVPLRNSPLIAFSLSGLKNAVRLTVEKLGGAEKMTEQDEADLSA 236 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 +F+ AV L+ K K+ + + + GG SAN+ LR A++ ++ R + A ++ Sbjct: 237 SFQKAVKLHLLQKSKKIFAKEPIRDFAIVGGASANQYLRGAYADLCREFRKTMHVAPLQY 296 Query: 298 CTDNGAMI 305 C+DN AMI Sbjct: 297 CSDNAAMI 304 >UniRef50_Q4A734 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma synoviae 53 RepID=GCP_MYCS5 Length = 307 Score = 214 bits (545), Expect = 4e-54, Method: Compositional matrix adjust. Identities = 115/317 (36%), Positives = 186/317 (58%), Gaps = 10/317 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETS D++ IAI +D K +L SQ+ + YGG +PE+ASR+HV K + ++Q Sbjct: 1 MIILGIETSHDDSSIAILEDGK-VLNMWSISQIDIFKKYGGTIPEIASREHV-KNIAILQ 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L+E + ID +AYT+ PGL+G L VG +L+ A + P I ++H++GH + Sbjct: 59 NFLQEF-IDLNKIDHIAYTSEPGLIGCLQVGFLFASALSIALNKPLIKINHLDGHFFSGA 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +++ ++P + L+VSGGH+Q+I ++++GE++DDA GE +DK + L L +PGG Sbjct: 118 IDNKEIKYPALGLIVSGGHSQIIYAKNKFDFQIVGETLDDAIGECYDKVSSRLNLGFPGG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ K+ A +P T DFSFSG+KT N N ++ IA +F+ Sbjct: 178 PIIDKIHASYKGKYLKLTKPKTSGE-FDFSFSGIKTQVLNAF--NNKKYESIEQIAASFQ 234 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + ++ L+ K K A+D+ + +++ GGVSAN+ LR K ++ K + ++ TD Sbjct: 235 EVAINYLIEKFKLAIDKFKPESILLGGGVSANKYLREKFKDLHK----NTIFPEIKYATD 290 Query: 301 NGAMIAYAGMVRFKAGA 317 NGAMIA +R K + Sbjct: 291 NGAMIAMCAYLRMKKNS 307 >UniRef50_C2KP25 O-sialoglycoprotein endopeptidase n=3 Tax=Mobiluncus RepID=C2KP25_9ACTO Length = 375 Score = 213 bits (543), Expect = 5e-54, Method: Compositional matrix adjust. Identities = 132/334 (39%), Positives = 190/334 (56%), Gaps = 32/334 (9%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGIE++CDETG A+ + L+AN + + + +A YGG++PE+ASR H+ +P++ +AL Sbjct: 13 LGIESTCDETGAALVAGKTKLIANVVATSMDQYARYGGIIPEIASRAHLESFLPVVTSAL 72 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML-E 122 +++G+ +DID + + GPGL+G+L VG ++LA A P V+H+ GHL L Sbjct: 73 EQAGVKLEDIDRIGVSGGPGLIGSLAVGIAGAKALALALGKPLYGVNHVIGHLAVDQLAS 132 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGI---GQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + + P V L+VSGGHT L+ + G LG ++DDA+GEAFDK ++LGL YPG Sbjct: 133 EEMLKLPAVGLVVSGGHTNLLYIEDFAAPGGIRELGGTLDDASGEAFDKVGRILGLPYPG 192 Query: 180 GPLLSKMAAQGTAGRFVFPRPMT------DRPGLDFSFSGLKTFAANTIRDNGTDDQTR- 232 GP + +M+ QGT G FPR ++ P DFSFSGLKT A I + R Sbjct: 193 GPNVDRMSQQGTLGAIDFPRGLSGAKYAKSHP-YDFSFSGLKTAVARYIASLEASPEARS 251 Query: 233 --------------------ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSAN 272 ADI + F +++ D+L+ K +AL TG K LV+ GG SAN Sbjct: 252 HPEFTEDYQATREGKPWLPVADICKGFSESINDSLVSKTLKALQDTGAKTLVVGGGYSAN 311 Query: 273 RTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIA 306 LR+ LAE + + +FCTDNGA IA Sbjct: 312 SRLRSWLAEACPEIGVTLRIPPLKFCTDNGAQIA 345 >UniRef50_A7H0K1 Probable O-sialoglycoprotein endopeptidase n=26 Tax=Epsilonproteobacteria RepID=GCP_CAMC5 Length = 339 Score = 213 bits (541), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 131/342 (38%), Positives = 190/342 (55%), Gaps = 23/342 (6%) Query: 3 VLGIETSCDETGIAIYDDEK-GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCD++ +A+ D + LL ++ SQ H+ +GGVVPELA+R H R + A Sbjct: 2 ILGIESSCDDSSVALLDIKNLKLLYHKKISQESEHSPFGGVVPELAARLHTRA----LPA 57 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E KDI A+A T PGL +L+ G ++ ++L+ A +VP I V+H+ GH+ + L Sbjct: 58 LLEEIKPKFKDIKAIAVTNEPGLSVSLIGGVSMAKALSVALNVPLIAVNHLVGHIYSLFL 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 D FP LLVSGGHT ++ + G+ LL + DD+ GE+FDK AK++ L YPGG Sbjct: 118 -DCEARFPLGVLLVSGGHTMVLDIDAAGKISLLAGTSDDSFGESFDKVAKMMQLGYPGGA 176 Query: 182 LLSKMAAQGT-AGRFVFPRPMTDRPGLDFSFSGLK-------------TFAANTIRDNGT 227 + +A Q RF F P L++SFSGLK A T R+ Sbjct: 177 AVQNLAWQCKDKRRFKFTIPFLHDKRLEYSFSGLKNQVRLEIEKIKGQNLAGATDRELSN 236 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 DD ADI AFE+A + +M K + + FKR + GG SAN LR+++ + + Sbjct: 237 DDM--ADICYAFENAACEHIMDKLTKIFKERSFKRFGIVGGASANLNLRSRIERLCLENG 294 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRF-KAGATADLGVSVRPR 328 E+ A EFC+DN AMIA AG ++ K G +++ PR Sbjct: 295 CELLLAPLEFCSDNAAMIARAGREKYLKGGFVKHNELNINPR 336 >UniRef50_Q54EW4 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54EW4_DICDI Length = 468 Score = 212 bits (540), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 128/416 (30%), Positives = 200/416 (48%), Gaps = 87/416 (20%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 V+GIETSCD+T I I + E ++A Q LH + G+VP +A H + I+ Sbjct: 19 VIGIETSCDDTSIGIVNSEGKIMAEYSKPQWSLHKVHNGIVPSIAFEAHQNEIDNAIEKT 78 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ++G+T +DID +A T GPG+ +L VG + L + P V+HMEGH L +E Sbjct: 79 LDKAGMTMEDIDVIAVTTGPGMGKSLEVGLNKAKQLYREFKKPFCSVNHMEGHSLVVRME 138 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY----- 177 ++ EFPF+ +LVSGGH+Q++ + +Y+L+G ++DD+ GEA DK A++LG Y Sbjct: 139 NHSIEFPFLIVLVSGGHSQILICNDVSKYQLIGNTLDDSIGEALDKAARILGCPYGQVWD 198 Query: 178 --------PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG--- 226 GG + +A++G F PM D DFSFSG+K+ A +++ Sbjct: 199 GQSLIENIHGGQAIEILASKGDPNSHHFTLPMKDSNNCDFSFSGIKSSLARLVKEIKSKS 258 Query: 227 --------------------------------TDDQT-----RADIARAFEDAVVDTLMI 249 TD+ + ++A +F++ + L Sbjct: 259 SSSSSITNNTTTKTTTTTTTTTIITTETNNLITDENELSFVDKCNLAASFQNVAFNHLEH 318 Query: 250 KCKRALD--------------------QTG------------FKRLVMAGGVSANRTLRA 277 + K++LD ++G K +V++GGVS N LR Sbjct: 319 RIKKSLDWYYNFKTPKQKKNELLASKTKSGKPPAIEIIKREPLKGIVVSGGVSKNNNLRK 378 Query: 278 KLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATAD--LGVSVRPRWPL 331 ++ ++ K+ +++ RPE C DNG MIA+AG+ FK G T D V P WPL Sbjct: 379 RIDDIGKRYNLPIYFPRPELCNDNGTMIAWAGVEMFKKGMTVDDPEKVIYLPVWPL 434 >UniRef50_B0D096 Predicted protein n=2 Tax=Agaricales RepID=B0D096_LACBS Length = 379 Score = 211 bits (538), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 127/355 (35%), Positives = 189/355 (53%), Gaps = 20/355 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL E+S D+T A+ K +L+N + Q LH YGG+ P A H R ++ A Sbjct: 19 VLAFESSADDTCAAVVHSSKSILSNVVIKQNNLHEQYGGIYPITAIDAHQRNMPYAVRRA 78 Query: 63 LKESGL-TAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 LKE+ + KDI+ +A+T GPG+ G L VG ++LA A + P + VHHM+GH L P+L Sbjct: 79 LKEANVDLVKDINGIAFTRGPGMPGCLSVGMNAAKTLAAALNKPIVGVHHMQGHALTPLL 138 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--- 177 NPP+FPF++LLVSGGHT L+ T + +++L ++D++ G A D+ +KLL L + Sbjct: 139 TSSNPPKFPFLSLLVSGGHTLLLLATSLDSFQILATTVDESIGRAIDQVSKLLDLKWTSL 198 Query: 178 -PGGPLLSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRD----NGTDDQT 231 PG L A + V P P G L FS+SGL + I N D T Sbjct: 199 GPGDALEKFCAQKVDTDSIVIPLPRVTMAGKLSFSYSGLHSRVERYIETLGGINNIDLPT 258 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFK-----RLVMAGGVSANRTLRAKLAEMMKKR 286 R IARAF+ + + L K L K +V++GGV++N+ LR +L + + K Sbjct: 259 RMAIARAFQKSAMAQLEDKLLLGLQWCQQKDIPVRHVVLSGGVASNQYLRERLHQCILKA 318 Query: 287 ----RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 ++ + P CTDN MI +A M RF A + + RP+W + +L ++ Sbjct: 319 DLALSIDLVFPPPPLCTDNAVMIGWASMHRFLANDFDEYDIESRPKWSIDQLASS 373 >UniRef50_Q5ZZQ1 Probable O-sialoglycoprotein endopeptidase n=8 Tax=Mycoplasma RepID=GCP_MYCH2 Length = 322 Score = 210 bits (535), Expect = 5e-53, Method: Compositional matrix adjust. Identities = 123/326 (37%), Positives = 180/326 (55%), Gaps = 21/326 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETS D+ +A++ + K + + SQ +LH +GG VPELASR+H R +++ Sbjct: 1 MKILGIETSHDDASVALFSENKVEILLTI-SQFELHEQFGGTVPELASREHSRNLAIILE 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L ++ + IDA+AYT PGL+G L +G +L+ ++ P IP+ H+ GH + Sbjct: 60 KLLGKN-IDFSTIDAIAYTKNPGLIGPLKIGFLFASALSLFFNKPLIPIDHLLGHFWSAA 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E N EFP ++LL+SGGHTQLI E++G ++DDA GE +DK + LG YPGG Sbjct: 119 IE-NDLEFPVLSLLISGGHTQLIFAENKNNLEIIGSTVDDALGEIYDKIGRSLGCGYPGG 177 Query: 181 PLLSKMAAQGTAGRFV---FPRPMTDRPGLDFSFSGLKTFA---ANTIRDNGTDDQTR-A 233 P + + Q F P LDFSFSGLKT N +++N Q + Sbjct: 178 PKIDLIWQQNNVRNMELIDFSLPKVLENPLDFSFSGLKTQVINYTNNLKENYLFSQKKVV 237 Query: 234 DIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 +IA +F+ V+ L KR LD + K + + GGV+AN +R + K + Sbjct: 238 EIAVSFQKTVIKYL----KRQLDLALKTKKNVKTITLVGGVAANSEIRKLIKTYENKYK- 292 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFK 314 V + EFCTDNGAMIA A + K Sbjct: 293 -VVIPKKEFCTDNGAMIAKAAQIFLK 317 >UniRef50_D1B582 Metalloendopeptidase, glycoprotease family n=5 Tax=Campylobacterales RepID=D1B582_SULD5 Length = 334 Score = 210 bits (535), Expect = 6e-53, Method: Compositional matrix adjust. Identities = 123/323 (38%), Positives = 185/323 (57%), Gaps = 9/323 (2%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ IAI ++K LL ++ SQ + HA YGGVVPELA+R H T+P I Sbjct: 2 ILSIESSCDDSSIAITRIEDKKLLFHKKISQDEEHAKYGGVVPELAARLHAI-TLPKI-- 58 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E+ + + A+A T PGL +L+ G ++ ++L+ A +P + ++H++GH+ + + Sbjct: 59 -LEETQPYFEALKAIAVTNEPGLSVSLVEGVSMAKALSVALHLPLLGINHLKGHICSLFI 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E+ FP LLVSGGHTQL+ V + Q ELL ++DD+ GE+FDK K+LGL YP G Sbjct: 118 EEET-RFPMDVLLVSGGHTQLLHVKSLEQIELLATTMDDSFGESFDKVGKMLGLPYPAGA 176 Query: 182 LLSKMAAQGTAGRFVFPRPM--TDRPGLDFSFSGLKTFAANTIR-DNGTDDQTRADIARA 238 ++ A +G A F F P+ T L FS+SGLK I D+Q DI + Sbjct: 177 IIETYAQKGDAKCFDFTIPLQGTSSSMLAFSYSGLKNQVRLCIEAQERMDEQILCDICAS 236 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+ V LM K K+A + + GG SAN LR +L ++ +++ A+ FC Sbjct: 237 FQRVAVAHLMQKIKKAYQARKVEHFGVVGGASANLYLRGELERFCASKKAQLYTAKMAFC 296 Query: 299 TDNGAMIAYAGMVRFKAGATADL 321 +DN AMI G+ ++ G L Sbjct: 297 SDNAAMIGRCGVEAYQKGVFVSL 319 >UniRef50_B8LEI0 Predicted protein (Fragment) n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8LEI0_THAPS Length = 342 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 129/337 (38%), Positives = 189/337 (56%), Gaps = 21/337 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE+SCD+TG A+ + +L L SQ +H +GGV P LA H + +I A Sbjct: 5 VLGIESSCDDTGAAVLRSDGLILGESLASQHAIHEQFGGVFPGLAKAAHEQNIQTVISTA 64 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA---P 119 L+ + +T +D+DAV T GPGL L VG GR LA + P + +HH+E H+L P Sbjct: 65 LQNANMTMEDVDAVGVTVGPGLEICLRVGCNWGRELAMEYGKPFVGIHHLEAHILMARIP 124 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + + EFPF+ALLVSGGH Q++ GIGQY ++G ++DD+ GEAFDKTA+LLGL G Sbjct: 125 SEKYDTMEFPFLALLVSGGHCQILKCLGIGQYSIVGGTLDDSLGEAFDKTARLLGLPVGG 184 Query: 180 GPL--LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT---FAANTIR-DNGTDD---- 229 G + ++A G P P+ R DFS++GLKT A I + G + Sbjct: 185 GGGPAIEQLAKDGDPKSVKLPIPLQKRKDCDFSYAGLKTAVRLATEKICVERGVESAEEL 244 Query: 230 --QTRADIARAFEDAVVDTLMIKCKRALDQT----GFKRLVMAGGVSANRTLRAKLAEMM 283 Q +A++A +F+ + I+ RA+++ G L + GGV+AN+ LR++L + Sbjct: 245 PHQDKANVAASFQHTAFRHVEIRLGRAMERVEKEDGISTLAVVGGVAANKELRSRLNALC 304 Query: 284 KKRR--GEVFYARPEFCTDNGAMIAYAGMVRFKAGAT 318 R ++ P CTD GAM A+A + R G++ Sbjct: 305 SDRAEPWKMMVPPPRLCTDQGAMSAWAAVERLMVGSS 341 >UniRef50_C5ZWF6 Metal-dependent protease n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZWF6_9HELI Length = 351 Score = 208 bits (529), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 122/335 (36%), Positives = 185/335 (55%), Gaps = 16/335 (4%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ IAI +K ++ +Q SQ + H+ YGGVVPE+ASR H + +P I Sbjct: 20 ILSIESSCDDSSIAITQIKDKKIVFHQKISQEREHSSYGGVVPEIASRLHA-EILPQI-- 76 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+ + KD+ A+A T PGL L+ G + ++L+FA ++P I V+H++GHL + L Sbjct: 77 -LEHTKPYFKDLKAIAVTTEPGLNITLMEGLMMAKTLSFALEIPLISVNHLKGHLYSLFL 135 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E FP ALLVSGGHT L+ + ++ ++IDD+ GE+FDK +K+LGL YPGGP Sbjct: 136 EQEAI-FPLGALLVSGGHTMLLEARSFNEINIIAQTIDDSFGESFDKVSKMLGLGYPGGP 194 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA---DIARA 238 ++ A +G F P P+ R FSFSGLK I+ + Q++A DI + Sbjct: 195 IVEFQAQKGNDRAFELPLPLKSRKDFAFSFSGLKNAVRLVIQKQ--EIQSKAFVEDICAS 252 Query: 239 FEDAVVDTLMIKC-----KRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 F+ ++ L K K + +K + GG SAN LR ++ + + A Sbjct: 253 FQRVAIEHLSKKTQIFFEKNSKSMDSWKYFGVIGGASANLVLRNEIQRICDYYGVTLLLA 312 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 E+C+DN AMI + + G D + V+PR Sbjct: 313 PLEYCSDNAAMIGRVALESYLRGEFGDFNLQVKPR 347 >UniRef50_C1F9R2 Metalloendopeptidase, glycoprotease family n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F9R2_ACIC5 Length = 401 Score = 202 bits (515), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 131/355 (36%), Positives = 184/355 (51%), Gaps = 56/355 (15%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDET A+ + L+N + SQ+ +HA +GGVVPELASR+H+R VP+++ A Sbjct: 15 ILGIESSCDETSAAVVRGGREALSNVIASQIAVHAPFGGVVPELASREHLRAIVPVVEQA 74 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + +G+ D+DAVA T GPGL GALLVG + ++LA A P I V+H+EGH+ A +LE Sbjct: 75 MAGAGVAFDDLDAVAVTEGPGLPGALLVGVSYAKALALALGKPLIAVNHLEGHIHAVLLE 134 Query: 123 --------DNPPEF--PFVALLVSGGHTQLISVTGIGQ---YELLGESIDDAAGEAFDKT 169 PE P +AL+VSGGHT L Y +G ++DDAAGEAFDK Sbjct: 135 RVLQPAETQATPEHGQPKLALVVSGGHTHLYLAQETHHAWTYRNVGRTVDDAAGEAFDKV 194 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVF--------------PRPMTDRPGLDFSFSGLK 215 AKLLGL YPGGP + +A G A F P + FSFSG+K Sbjct: 195 AKLLGLGYPGGPWVDALAPFGDARAVPFSFAQVKAKAHRRADPVALHPEEATYFSFSGIK 254 Query: 216 TFAANTIRDNGTD-----------------------------DQTRADIARAFEDAVVDT 246 T ++ + + DQ D+ +F+ AVV Sbjct: 255 TAVLRYVQTHDMEARIAARRQAMATMPDASPRRDLEAVRALCDQESLDLLASFQRAVVGD 314 Query: 247 LMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 L+ K RA ++ ++++GGV+ANR LR + + V + + TDN Sbjct: 315 LVRKTFRAAERYDVAEILVSGGVAANRELRERFTAEAAAQGLPVAFPSLKLATDN 369 >UniRef50_B0B9U7 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Chlamydia trachomatis RepID=GCP_CHLT2 Length = 338 Score = 202 bits (513), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 120/340 (35%), Positives = 185/340 (54%), Gaps = 18/340 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LG+E+SCDET ++ + K +LAN++ SQ +HA YGGV+PELASR H++ L+ Sbjct: 1 MLTLGLESSCDETSCSLVQNGK-ILANKIASQ-DIHASYGGVIPELASRAHLQTFPELLT 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AA + +G++ +DI+ ++ PGL+GAL +G + LA P I V+H+E HL A Sbjct: 59 AATQSAGVSLEDIELISVANTPGLIGALSIGVNFAKGLASGLKRPLIGVNHVEAHLYAAC 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E +FP + L +SG HT L + + L+G++ DDA GE FDK A+ LGL YPGG Sbjct: 119 MEAPATQFPALGLAISGAHTSLFLMPDATTFLLIGKTRDDAIGETFDKVARFLGLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT---------DDQT 231 L ++A +G A F F G DFSFSGLKT ++ N + + Sbjct: 179 QKLEELAREGDADAFAFSPARVS--GYDFSFSGLKTAVLYALKGNNSSAKAPFPEVSETQ 236 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + +IA +F+ AV T+ K + + L++ GGV+ N R L ++ ++ Sbjct: 237 KRNIAASFQKAVFMTIAQKLPDIVKAFSCESLIVGGGVANNSYFRRLLNQICSL---PIY 293 Query: 292 YARPEFCTDNGAMIAYAGMVRF--KAGATADLGVSVRPRW 329 + + C+DN AMIA G F + + ++ R +W Sbjct: 294 FPSSQLCSDNAAMIAGLGERLFCNRTHVSKEVIPCARYQW 333 >UniRef50_UPI000058820F PREDICTED: hypothetical protein n=2 Tax=Strongylocentrotus purpuratus RepID=UPI000058820F Length = 400 Score = 202 bits (513), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 129/354 (36%), Positives = 197/354 (55%), Gaps = 34/354 (9%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIET+CD+TG A+ D+ +LA +L++Q ++HA GG++P LA H + P++Q Sbjct: 46 VLGIETTCDDTGAAVMDETGRVLAERLHTQKRIHAKNGGIIPPLAQALHRQFIDPVVQGT 105 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAW-DVPAIPVHHMEGHLLAPML 121 +K++G+ KD+ AVA + PG+ +L VG + + +P IP+HHME H L + Sbjct: 106 IKDAGIEMKDLSAVALSTMPGMPLSLRVGLDYTKDMLLRHPHLPLIPIHHMEAHALTVRM 165 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP--- 178 + +FPF+ LLVSGG+ L G+G +++LG + DDA GEAFDK A+ L L + Sbjct: 166 VER-VDFPFLVLLVSGGNCILAVARGVGDFKVLGVTWDDAPGEAFDKVARRLKLQHHPDC 224 Query: 179 ----GGPLLSKMAAQGTAGRFVFPR--PMTDRPGLDFSFSGLKTFAANTIRDN------- 225 GG + KMA G R + R PM+ +FSF+GLK A I+ + Sbjct: 225 LGLCGGQAIEKMAENGNF-RLLIERGVPMSRHRDCNFSFAGLKNMANWLIQHHEVRQGLT 283 Query: 226 GTDDQ---TRADIARAFEDAVVDTLMIKCKRAL---DQTGF-----KRLVMAGGVSANRT 274 +DD T +DIA +F+ V L+I+ RA+ QTG + LV++GGV++N Sbjct: 284 ASDDHHLATISDIAASFQHKVTQHLVIRIARAMLYCQQTGLIPEGNQTLVVSGGVASNDY 343 Query: 275 LRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +R L + ++ P CTDNG MIA+AG+ R + D+G + P+ Sbjct: 344 IRKALDFTTSLFKYKLICPPPYLCTDNGVMIAWAGVERLR----LDMGFAEDPQ 393 >UniRef50_B3MQN2 GF20469 n=4 Tax=Drosophila RepID=B3MQN2_DROAN Length = 416 Score = 194 bits (494), Expect = 3e-48, Method: Compositional matrix adjust. Identities = 127/336 (37%), Positives = 174/336 (51%), Gaps = 29/336 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D + AN L SQ + H YGG++P A H + + Sbjct: 34 VLGIETSCDDTGIAIVDTSGNVKANVLDSQQEFHTRYGGIIPPRAQDLHRARIHSAYERC 93 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+ L + + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 94 LEEANLQPEQLAAIAVTTRPGLPLSLLVGVRFARHLARRLKKPLLPVHHMEAHALQARME 153 Query: 123 --DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD---- 176 D P FPF+ LLVSGGH QL V G G+ LLG+++DDA GEAFDK A+ L L Sbjct: 154 HPDAIP-FPFLCLLVSGGHCQLAMVHGPGRLTLLGQTLDDAPGEAFDKIARRLRLYILPE 212 Query: 177 ---YPGGPLLSKMAAQGT-AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT- 231 + GG + A T + FP P++ + +FSF+G+K + IR ++T Sbjct: 213 YRLWNGGRAIEHAARLATDPSAYDFPLPLSQQRNCNFSFAGIKNNSFRAIRKKERMERTP 272 Query: 232 -------RADIARAFEDAVVDTLMIKCKRALDQT----------GFKRLVMAGGVSANRT 274 AD AV LM + +RAL+ G LV++GGV+ N T Sbjct: 273 PDGIISNYADFCAGLLRAVSRHLMHRTQRALEYCLQPQVRFFGDGQPTLVVSGGVANNDT 332 Query: 275 LRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGM 310 + A + + + F +C+DNG MIA+ G+ Sbjct: 333 IFANIQHLAAQYGCRSFRPSKRYCSDNGVMIAWHGV 368 >UniRef50_P75055 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Mycoplasma RepID=GCP_MYCPN Length = 319 Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 112/313 (35%), Positives = 174/313 (55%), Gaps = 19/313 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIET+CD+T I + + K + A+ + S KLHA GGVVPE+A+R H + + A Sbjct: 7 ILGIETTCDDTSIGVITESK-VQAHIVLSSAKLHAQTGGVVPEVAARSHEQNLL----KA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L++SG+ + I +AY A PGL G L VGAT RSL+F D P +P++H+ H+ + +++ Sbjct: 62 LQQSGVVLEQITHIAYAANPGLPGCLHVGATFARSLSFLLDKPLLPINHLYAHIFSALID 121 Query: 123 D--NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 N + P + L+VSGGHT + + + EL+ E+ DDA GE +DK + +G YP G Sbjct: 122 QDINQLKLPALGLVVSGGHTAIYLIKSLFDLELIAETSDDAIGEVYDKVGRAMGFPYPAG 181 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD----NGTDDQTR--AD 234 P L + F RP T FS+SGLK+ I+ G + QT + Sbjct: 182 PQLDSLFQPELVKSHYFFRPSTKWT--KFSYSGLKSQCFTKIKQLRERKGFNPQTHDWNE 239 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 A F+ ++D + K A+ Q + L++ GGVSAN+ LR ++ ++ + A Sbjct: 240 FASNFQATIIDHYINHVKDAIQQHQPQMLLLGGGVSANKYLREQVTQL----QLPYLIAP 295 Query: 295 PEFCTDNGAMIAY 307 ++ +DNGAMI + Sbjct: 296 LKYTSDNGAMIGF 308 >UniRef50_B8PI87 Predicted protein n=2 Tax=Postia placenta Mad-698-R RepID=B8PI87_POSPM Length = 691 Score = 192 bits (487), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 123/368 (33%), Positives = 188/368 (51%), Gaps = 40/368 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL IE+S D+T A+ ++ +L+N + Q H YGG+ P +A H + +Q A Sbjct: 318 VLAIESSADDTCAAVVTSDRQILSNVVVRQDSFHESYGGIHPYIAIEAHQQNMPGAVQKA 377 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ +G++A D+D +A+T GPG+ G L VG+ ++LA A + P + VHHM+ H L P L Sbjct: 378 LQVAGMSATDVDGIAFTRGPGIGGCLSVGSNAAKTLAAALNKPLVGVHHMQAHALTPFLT 437 Query: 123 ---DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY-- 177 ++ P +PF+ LLVSGGHT L+ T + +L ++D++ G AFDK +++L L + Sbjct: 438 TPANSLPTYPFLTLLVSGGHTLLLLATSPRAFRVLATTLDESIGRAFDKVSRMLALPWSA 497 Query: 178 --PGGPL----------------LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAA 219 PG L ++ + A P PM R L FS++GL + Sbjct: 498 HGPGAALEQFCRDGPAGGTGAPGGEEIGSGEPAEAPHIPLPMRGR--LAFSYTGLHSSVE 555 Query: 220 NTIRDNG--TDDQTRADIARAFEDAVVDTLMIK-------CKRALDQTGFKRLVMAGGVS 270 + G D +T+ IA F+ V L K C+R Q + +V++GGV+ Sbjct: 556 RFLHARGGVVDARTKHAIATTFQKNAVGQLEEKLALGLQLCRRKGIQ--IRHVVVSGGVA 613 Query: 271 ANRTLRAKLA----EMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVR 326 +N LR +L E + + P CTDN MIA+A M RF AG T D V +R Sbjct: 614 SNSYLRERLRICLDEASPDEHIALIFPPPSLCTDNAVMIAWASMHRFLAGDTDDYTVELR 673 Query: 327 PRWPLAEL 334 +W + EL Sbjct: 674 RKWSIEEL 681 >UniRef50_Q29HY2 GA12844 n=3 Tax=Sophophora RepID=Q29HY2_DROPS Length = 427 Score = 190 bits (483), Expect = 5e-47, Method: Compositional matrix adjust. Identities = 124/346 (35%), Positives = 177/346 (51%), Gaps = 27/346 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D + + +N LYSQ + H YGG++P A H + Sbjct: 30 VLGIETSCDDTGIAIVDTDGRVHSNVLYSQQEFHTRYGGIIPPRAQDLHRARIEDAYNRC 89 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ L + + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 90 LVEADLRPEQLTAIAVTNRPGLPLSLLVGLRFARHLARRLQKPLLPVHHMEAHALQARME 149 Query: 123 D-NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 + + FPF+ LL+SGGH QL V G G+ LLG+++DDA GEAFDK A+ L L Sbjct: 150 NISAISFPFLCLLISGGHCQLALVRGPGRLTLLGQTLDDAPGEAFDKIARRLRLYVLPQY 209 Query: 176 -DYPGGPLLSKMAAQGTA-GRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + GG + A + FP P+ + +FSF+G+K + IR +QT Sbjct: 210 RAWNGGQAIEHAAQSAVCPDAYDFPLPLAQQRNCNFSFAGIKNNSFRAIRARERLEQTPP 269 Query: 234 D-IARAFED-------AVVDTLMIKCKRALD-----QTGF-----KRLVMAGGVSANRTL 275 D I + D AV LM + +RAL+ + G LV++GGV+ N + Sbjct: 270 DGIISNYSDFCAGLLQAVSRHLMHRTQRALEYCLRPENGLFGDASPTLVVSGGVANNDVI 329 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL 321 + + + + +C+DNG MIA+ G+ + A + L Sbjct: 330 YRNIEHLAGQYNCRSYRPFKRYCSDNGVMIAWHGIEQLLANSAQHL 375 >UniRef50_C3XEQ4 O-sialoglycoprotein endopeptidase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XEQ4_9HELI Length = 500 Score = 190 bits (483), Expect = 5e-47, Method: Compositional matrix adjust. Identities = 110/339 (32%), Positives = 177/339 (52%), Gaps = 21/339 (6%) Query: 1 MRVLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 MR+L IE+SCD++ +A D LL ++ SQ H+ YGGVVPELASR R V L+ Sbjct: 1 MRILSIESSCDDSALAYTDGTNTKLLWHEKISQEASHSHYGGVVPELASRLFARDLVQLL 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + + + KDI +A T PGL +LL G + ++LA + ++P + ++H++GH+ + Sbjct: 61 ENF--KQNFSLKDITHIAVTNEPGLSTSLLEGVMMAKALALSLNIPLLGINHLKGHIYSL 118 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 +E P LLVSGGHT L+ ++ ++DD+ GE +DK AK+LGL YPG Sbjct: 119 FIESEAI-LPLCVLLVSGGHTMLLECYSYNDMRVIANTLDDSFGECYDKAAKMLGLGYPG 177 Query: 180 GPL---LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTF---------AANTIRDNGT 227 G + +++MA + P P+ ++ FSFSGLK I+D+ T Sbjct: 178 GMIIDSMAQMALKENIAPIALPIPLVNQNIQSFSFSGLKNAFRLQLEKMELKTLIQDSKT 237 Query: 228 DDQTRADIARA----FEDAVVDTLMIKCKRALDQTG-FKRLVMAGGVSANRTLRAKLAEM 282 D + A+A +++ L+ KC+ + Q K + GG SAN LR K + Sbjct: 238 QDIKNSTQAKALALGLQESATTHLIQKCRSYMKQNSHIKHFAIVGGASANSMLREKAQSL 297 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL 321 + ++ + ++C+DN AMI A + + + D+ Sbjct: 298 AAQFDNKLLMSELKYCSDNAAMIGRAAIAKIRHENMIDI 336 >UniRef50_B3PND6 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Mycoplasma RepID=GCP_MYCA5 Length = 311 Score = 190 bits (483), Expect = 5e-47, Method: Compositional matrix adjust. Identities = 113/319 (35%), Positives = 173/319 (54%), Gaps = 16/319 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M + IE+S D+T A+ DD K + + +Q ++H YGG VPE+ASR HV+ LI+ Sbjct: 1 MIIFAIESSHDDTSFALLDDNKPIWMKTI-TQTEIHKQYGGTVPEIASRLHVKNIGILIE 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +S + ID +AYT PGLVG+L VG V +SLA + + ++H+EGH + Sbjct: 60 DI--KSQININKIDLIAYTKEPGLVGSLHVGYVVAQSLALILNKKIVGLNHLEGHFYSAF 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + +P + LLVSGGH+QL+ ++++G++ DDA GE +DK A+ L L +PGG Sbjct: 118 IGKEVI-YPALGLLVSGGHSQLVLYNSKDDFKIIGQTQDDAVGEVYDKVARKLNLGFPGG 176 Query: 181 PLLSKMAAQG---TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-DDQTRAD-I 235 PL+ ++ P+ DFSFSG+KT N I + + ++Q + I Sbjct: 177 PLIDQIWKNNHKLYTAHLTIPKT---EGFFDFSFSGIKTNVINLINNCASRNEQINVNQI 233 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A F++ +V+ L + A+ + K +V+AGGVSAN +R EM VF Sbjct: 234 ATEFQNTIVEYLKEHMETAIKKFSPKCIVLAGGVSANFAIR----EMFYSLHKNVFLPDL 289 Query: 296 EFCTDNGAMIAYAGMVRFK 314 E+ TDN MIA +F+ Sbjct: 290 EYTTDNAMMIARLAYEKFR 308 >UniRef50_B1AJ51 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Ureaplasma RepID=GCP_UREP2 Length = 320 Score = 189 bits (480), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 114/313 (36%), Positives = 171/313 (54%), Gaps = 20/313 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCDET +A++++ K L+A+++ S + + +GGVVPELASR H + L Sbjct: 7 ILSIESSCDETSLALFENNK-LIAHKISSSASIQSLHGGVVPELASRYHEQNINHLFNEI 65 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ + I VAYTA PGL G L VG + LA + +P++H+ H+ + + Sbjct: 66 LNETKINPLTITHVAYTAMPGLPGCLHVGKVFAKQLAVLINAELVPINHLHAHVFSASIN 125 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 N FPF+ L+VSGG + + V + ++L ++ DDA GE +DK A++LG YPGGP+ Sbjct: 126 QNLT-FPFLGLVVSGGESCIYLVNDYDEIKVLNQTHDDAIGECYDKIARVLGWKYPGGPI 184 Query: 183 LSKMAAQGTAG-RFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD---IARA 238 + K + A F+ +P DFSFSGLKT N I N + D +A + Sbjct: 185 IDKNYQENLATLEFIKSQPAAK----DFSFSGLKTAVINYIH-NAKQKKISFDPVVVASS 239 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE-- 296 F+ ++ ++ K K L+ L + GGVSAN LR K+ + +V PE Sbjct: 240 FQKFAINEIIKKIKYYLNLYKLNHLAIGGGVSANSLLRKKIQSL------DVISYIPEMI 293 Query: 297 FCTDNGAMI-AYA 308 + DN AMI AYA Sbjct: 294 YTGDNAAMIGAYA 306 >UniRef50_B6JWU0 Glycoprotease pgp1 n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JWU0_SCHJY Length = 412 Score = 188 bits (478), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 122/359 (33%), Positives = 180/359 (50%), Gaps = 26/359 (7%) Query: 1 MRVLGIETSCDETGIAI--YDDEKG----LLANQLYSQVKLHADYGGVVPELASRDHVRK 54 + VLGIETSCD+ +A+ YD + +L + + L+ YGG+ P + +H R+ Sbjct: 33 INVLGIETSCDDCSVAVCQYDQSRNEPSKVLLQKTRRTIHLYEKYGGIHPNIVMHEHQRQ 92 Query: 55 TVPLIQAALKES-GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 PLIQ+ L E+ L A ID V+ T GPG++G L VG + LA VP I VHHM Sbjct: 93 LAPLIQSVLTEAEKLDASIIDIVSVTRGPGMLGPLAVGLNTAKGLAVGLKVPLIGVHHML 152 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 GHLLAP LE N +FPF++LLVSGGHT L+ + +E+L ++D A G+ DK A+LL Sbjct: 153 GHLLAPKLERN-IDFPFLSLLVSGGHTMLVYSKSLFDHEILATTLDIAVGDYLDKCARLL 211 Query: 174 GLDYPG---GPLLSKMAAQGTAGRFVFPRPMTDRPGLD---FSFSGLKTFAANTIRDNGT 227 + + G L + + F P++ FSF+GL+T + G Sbjct: 212 RIPWNGEMPAAALERYSVVSDVTEFPLHVPLSKNAKTRLHCFSFAGLQTQVEKVLTCLGG 271 Query: 228 D---DQTRADIARAFEDAVVDTLMIK---CKRALDQTGFKRLVMAGGVSANRTLRAKLAE 281 + + + IA A + D + K C L V +GGV+ NR LR L Sbjct: 272 ETAPENVKRRIAYAVQSIAFDHICRKVRLCMNDLVDKPISAFVCSGGVARNRYLRNMLVV 331 Query: 282 MMKKRRGEVFYARP------EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 M+ + ++ P + C+DN +MIA A + +K G T+ L + +W L L Sbjct: 332 MLSNFETDTSHSIPLVCPSADLCSDNASMIANAAIEMYKHGITSPLTIEPTSKWSLDAL 390 >UniRef50_B3RQR7 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RQR7_TRIAD Length = 405 Score = 187 bits (476), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 127/355 (35%), Positives = 182/355 (51%), Gaps = 28/355 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-GVVPELASRDHVRKTVPLIQA 61 V+GIETSCD+TG+AI DD+ LL + L SQ +H G G+ P A++ H R ++Q+ Sbjct: 38 VMGIETSCDDTGVAIVDDQGRLLGDALQSQSSIHKPLGWGIHPVTAAQLHERNIHAVVQS 97 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL +S L +DI +A T GPGL +L VG + L + I VHHM H L + Sbjct: 98 ALHKSNLKIEDIHTIATTVGPGLAFSLNVGLDYSKKLLQQHNKRFIAVHHMAAHALTVRM 157 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 NP EFP++ LLVSGGH L V G ++ LG ++DDA GE FDK A+ L L Sbjct: 158 L-NPIEFPYLVLLVSGGHCILAVVNGPCEFYRLGSTLDDAPGEVFDKVARTLELHTHPEV 216 Query: 176 -DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAAN---------TIRDN 225 D GG + +A G F P M +FSF+G K+ A N ++ D Sbjct: 217 GDIAGGRAIEIVAKLGDEKAFKLPHIMAGVRNCNFSFAGFKS-AVNAHLKRVSFASLSDW 275 Query: 226 GTDDQT-RADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKL 279 T A++A +F+ + + + +RAL + LV++GGV+ N +R +L Sbjct: 276 DQQKMTIAANMAASFQYYLTWHIAKRVRRALVFCKTFNPKCRTLVISGGVACNNYIRNEL 335 Query: 280 AEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLG---VSVRPRWPL 331 + ++ P CTDNG MIA+AG+ K+ L V +P+WPL Sbjct: 336 DKCATAFGFQLACPPPYLCTDNGIMIAWAGVEHLKSNTATILNPQSVIYQPKWPL 390 >UniRef50_Q9H4B0 Probable O-sialoglycoprotein endopeptidase 2 n=31 Tax=Bilateria RepID=OSGP2_HUMAN Length = 414 Score = 186 bits (473), Expect = 7e-46, Method: Compositional matrix adjust. Identities = 124/356 (34%), Positives = 177/356 (49%), Gaps = 29/356 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+T A+ D+ +L ++SQ ++H GG+VP A + H ++Q A Sbjct: 39 VLGIETSCDDTAAAVVDETGNVLGEAIHSQTEVHLKTGGIVPPAAQQLHRENIQRIVQEA 98 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L SG++ D+ A+A T PGL +L VG + L P IP+HHME H L L Sbjct: 99 LSASGVSPSDLSAIATTIKPGLALSLGVGLSFSLQLVGQLKKPFIPIHHMEAHALTIRL- 157 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-DYP--- 178 N EFPF+ LL+SGGH L V G+ + LLG+S+D A G+ DK A+ L L +P Sbjct: 158 TNKVEFPFLVLLISGGHCLLALVQGVSDFLLLGKSLDIAPGDMLDKVARRLSLIKHPECS 217 Query: 179 ---GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD------ 229 GG + +A QG F P+ DFSF+GL+ I ++ Sbjct: 218 TMSGGKAIEHLAKQGNRFHFDIKPPLHHAKNCDFSFTGLQHVTDKIIMKKEKEEGIEKGQ 277 Query: 230 --QTRADIARAFEDAVVDTLMIKCKRA---------LDQTGFKRLVMAGGVSANRTLRAK 278 + ADIA + + L+ + RA L Q LV +GGV++N +R Sbjct: 278 ILSSAADIAATVQHTMACHLVKRTHRAILFCKQRDLLPQNN-AVLVASGGVASNFYIRRA 336 Query: 279 LAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA--GATADL-GVSVRPRWPL 331 L + + + P CTDNG MIA+ G+ R +A G D+ G+ P+ PL Sbjct: 337 LEILTNATQCTLLCPPPRLCTDNGIMIAWNGIERLRAGLGILHDIEGIRYEPKCPL 392 >UniRef50_Q6C9V8 YALI0D07920p n=1 Tax=Yarrowia lipolytica RepID=Q6C9V8_YARLI Length = 376 Score = 186 bits (472), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 124/356 (34%), Positives = 183/356 (51%), Gaps = 32/356 (8%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVPL 58 VL IETSCD+T AI ++ L VK+ D GG+ P LA+ H + PL Sbjct: 26 NVLAIETSCDDTCAAIISRDREKNTAALIDHVKITLDSSLQGGINPALATAHHHQSVGPL 85 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 I+ LK+ T ID V T GPGL G L G T + L+ VP + VHHM HLL Sbjct: 86 IRDVLKKHADTT--IDLVCATRGPGLPGCLSSGVTFAKGLSLGLGVPYLGVHHMLAHLLT 143 Query: 119 PMLED-------NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 P L + + EFPF++LLVSGGHT L+ + + +L + D A G+A DK A+ Sbjct: 144 PRLFEAAEGYSGHKTEFPFLSLLVSGGHTMLVLSKSLYDHTVLCNTADVAIGDALDKCAR 203 Query: 172 LLGLDYPGGPLLSKM------AAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN 225 LG G +L K+ +A + ++ P P+ ++ + +SF+ ++ ++ Sbjct: 204 TLGFQ---GNMLGKVMDQYCRSADTPSSQWSIPMPVDNKNDIRYSFAAFHSYIG--MKKK 258 Query: 226 GTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKR-------LVMAGGVSANRTLRAK 278 T +T ++A + A+ + LM K K A + +K+ LV +GGV+AN LR Sbjct: 259 ETQAETTPELALEVQTAIFNHLMKKTKAAFNI--YKKEIASATTLVCSGGVAANPRLREA 316 Query: 279 LAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 L E+ K + E + P +CTDN AMI +AG+ + G +DL P+WPLAE Sbjct: 317 LQELCAKYKLEAVFPDPYWCTDNAAMIGWAGIELHEDGYRSDLEGFQIPKWPLAEF 372 >UniRef50_Q8EUQ9 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma penetrans RepID=GCP_MYCPE Length = 306 Score = 186 bits (471), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 108/311 (34%), Positives = 170/311 (54%), Gaps = 14/311 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCD+T +AI +D K +L+ + + K +GG+VPE+ +R H + + Sbjct: 1 MYILSIETSCDDTSVAILEDNK-VLSCIIKNDSKQLNPFGGIVPEIVARYHEENIIKALD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ES ++ ID VAYT PGL G+L VG +++A+A DV +P++H+ GH+L+P Sbjct: 60 LALQESNISLNQIDKVAYTNQPGLPGSLFVGEIFAKTMAYALDVECVPINHIHGHILSPF 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + ++ P++PF++L+ SG T + V + L ++ DDA GE FDK K LG DYP G Sbjct: 120 I-NSVPKYPFMSLIASGKTTSIFLVKSANEIIELTKTRDDAIGEIFDKVGKALGYDYPAG 178 Query: 181 PLLSKMAAQGTAGRF-VFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT--RADIAR 237 P L K A FP P+ + DFSFSG+K + I + ++ I Sbjct: 179 PKLDKYFDISKATITPSFP-PVKN----DFSFSGIKNKFLSIINSSKMKNEEIDTITIGS 233 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 +F +D ++ K K D+ + + GGV+ N + ++ KK + F ++ Sbjct: 234 SFLKYSIDLIIKKLKYYKDEYSVDCVCIGGGVANNNYFKQEI----KKLFSDSFVPESKY 289 Query: 298 CTDNGAMIAYA 308 TDN AMI +A Sbjct: 290 STDNAAMIGFA 300 >UniRef50_Q9VWD6 Probable O-sialoglycoprotein endopeptidase 2 n=6 Tax=Diptera RepID=OSGP2_DROME Length = 409 Score = 183 bits (465), Expect = 6e-45, Method: Compositional matrix adjust. Identities = 118/339 (34%), Positives = 171/339 (50%), Gaps = 35/339 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D ++AN L SQ + H YGG++P A H + Q Sbjct: 27 VLGIETSCDDTGIAIVDTTGRVIANVLESQQEFHTRYGGIIPPRAQDLHRARIESAYQRC 86 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ + L + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 87 MEAAQLKPDQLTAIAVTTRPGLPLSLLVGVRFARHLARRLQKPLLPVHHMEAHALQARME 146 Query: 123 DNPPE---FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD--- 176 PE +PF+ LL SGGH QL+ G G+ LLG+++DDA GEAFDK + L L Sbjct: 147 H--PEQIGYPFLCLLASGGHCQLVVANGPGRLTLLGQTLDDAPGEAFDKIGRRLRLHILP 204 Query: 177 ----YPGGPLL---SKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 + GG + +++A+ A + FP P+ + +FSF+G+K + IR + Sbjct: 205 EYRLWNGGRAIEHAAQLASDPLA--YEFPLPLAQQRNCNFSFAGIKNNSFRAIRARERAE 262 Query: 230 QT--------RADIARAFEDAVVDTLMIKCKRALDQTGFKR----------LVMAGGVSA 271 +T D +V LM + +RA++ LVM+GGV+ Sbjct: 263 RTPPDGVISNYGDFCAGLLRSVSRHLMHRTQRAIEYCLLPHRQLFGDTPPTLVMSGGVAN 322 Query: 272 NRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGM 310 N + A + + + F +C+DNG MIA+ G+ Sbjct: 323 NDAIYANIEHLAAQYGCRSFRPSKRYCSDNGVMIAWHGV 361 >UniRef50_UPI0000D561DB PREDICTED: similar to AGAP005215-PA n=1 Tax=Tribolium castaneum RepID=UPI0000D561DB Length = 406 Score = 182 bits (462), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 116/360 (32%), Positives = 179/360 (49%), Gaps = 35/360 (9%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+TG A+ D E +L L+SQ +H GG++P +A H ++ A Sbjct: 22 ILGIETSCDDTGCAVVDTEGNILGEALHSQHLIHLANGGIIPPIAQNLHRENIESVVNTA 81 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 +K S + +D+ AVA T PGL +L +G G+ L ++ P IP+HHME H L + Sbjct: 82 VKNSNYSFRDLSAVAVTVKPGLPLSLTIGMKYGKYLCRLYNKPFIPIHHMEAHALTARMH 141 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------- 175 D EFPF+ LL+SGGH L +G++ LLG + DDA GEAFDK A+ + L Sbjct: 142 DKTVEFPFLVLLISGGHCLLAVAQDVGRFFLLGSTRDDAPGEAFDKVARRMKLTNLSEFS 201 Query: 176 DYPGGPLLSKMAAQGTAG-RFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 GG + A++ +F F P+T FS +GLKT +R + +++ + + Sbjct: 202 KLSGGQAIELAASRAKNPLQFKFTIPLTQYRDCKFSLAGLKT----QVRRHLLEEEKKHN 257 Query: 235 I------------ARAFEDAVVDTLMIKCKRALDQTGFKR--------LVMAGGVSANRT 274 + F+ AV + + +RA+ K LV++GG + N Sbjct: 258 VPPDGLIPDVFNLCAGFQLAVTRHICQRVQRAMVYARRKEMIPENSQTLVVSGGAACNNF 317 Query: 275 LRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA--GATADL-GVSVRPRWPL 331 + L + + + P+ C DNG MIA+ G+ R++A G D V ++ PL Sbjct: 318 IARGLQLVCDEMAYKFVRPPPKLCLDNGVMIAWNGVERWRAKLGVLHDYASVEIQKSCPL 377 >UniRef50_Q17Z01 Probable O-sialoglycoprotein endopeptidase n=13 Tax=Helicobacter RepID=GCP_HELAH Length = 342 Score = 181 bits (460), Expect = 3e-44, Method: Compositional matrix adjust. Identities = 112/340 (32%), Positives = 177/340 (52%), Gaps = 12/340 (3%) Query: 3 VLGIETSCDETGIAIYDDEKG-LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ +A+ E L+A+ SQ K H+ YGGVVPELASR H + +PL+ Sbjct: 2 ILSIESSCDDSSLALTRIEDAKLIAHFKISQEKHHSSYGGVVPELASRLHA-ENLPLLLE 60 Query: 62 ALKESGLTAKD---IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 +K S KD + A+A T PGL L+ G + ++L+ + ++P I H+ GH+ + Sbjct: 61 RIKIS--LNKDFSKLKAIAITNQPGLSVTLIEGLMMAKALSLSLNLPLILEDHLRGHVYS 118 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 + + P LLVSGGH+ ++ +++ S+DD+ GE+FDK +K+L L YP Sbjct: 119 LFINEKKTCMPLSVLLVSGGHSLILEARNYEDIKIMATSLDDSFGESFDKVSKMLNLGYP 178 Query: 179 GGPLLSKMAAQGTAGR--FVFPRPMTDRPGLDFSFSGLKTFAANTIRDN--GTDDQTRAD 234 GGP++ K+A +FP P+ + L FSFSGLK I N ++ T+ Sbjct: 179 GGPVIEKLALDYAHKNEPLMFPIPLKNSLNLAFSFSGLKNAVRLEIEKNAPNLNEITKQK 238 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 I F+ ++ L+ + KR K + GG S N LR + + ++ A Sbjct: 239 IGYHFQSVAIEHLIQQTKRYFKTKRPKIFGIVGGASQNLVLRKAFENLCDEFDCKLVLAP 298 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAE 333 EFC+DN AMI + + ++ L S+ PR L + Sbjct: 299 LEFCSDNAAMIGRSSLEAYQKKHFVPLEKASISPRTLLKK 338 >UniRef50_Q17CG3 O-sialoglycoprotein endopeptidase n=2 Tax=Culicini RepID=Q17CG3_AEDAE Length = 400 Score = 176 bits (445), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 115/341 (33%), Positives = 171/341 (50%), Gaps = 30/341 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD++G AI +L + ++SQ H +GG++P +A H ++Q Sbjct: 28 ILGIETSCDDSGAAIVSGNGTVLGDCIHSQQNSHLKFGGIIPPVAQDFHRLNIDNVVQET 87 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + S + +DA+A T PGL +L+VG + LA + P IP+HHME H L + Sbjct: 88 FRRSDIDCSQLDAIAVTNRPGLPLSLIVGLRYAKYLARKYRKPIIPIHHMEAHALMARMT 147 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL----DY- 177 + P FPF+ +L+SGGH+ L V Q+ LLGE++DDA GEAFDK A+ L L +Y Sbjct: 148 NKVP-FPFLCILISGGHSLLTLVKSTSQFYLLGETLDDAPGEAFDKIARRLKLRNLPEYA 206 Query: 178 --PGGPLLSKMAAQGTAGR-FVFPRPMTDRPGLDFSFSGLKTFAANTI----RDNGTDDQ 230 GG + + A R + FP P++ FSF+GLK A I R+ D Sbjct: 207 WLSGGRSIEQAAMSSDNPRKYDFPLPLSHYRDCQFSFAGLKNTATRHILQQERELDLDPD 266 Query: 231 T----RADIARAFEDAVVDTLMIKCKRAL-----------DQTGFKRLVMAGGVSANRTL 275 D+ F +A + + +RA+ D F LV++GGV+ N + Sbjct: 267 AVLPDYQDLCAGFLNAAARHISQRTQRAIRFCEKEKLIGSDDAKF--LVISGGVACNDAI 324 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 ++ M K + CTDNG MIA+ G+ +F G Sbjct: 325 FNTVSNMAKGFGYTTVRPERQHCTDNGIMIAWNGVEKFLVG 365 >UniRef50_Q7NB15 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma gallisepticum RepID=GCP_MYCGA Length = 321 Score = 176 bits (445), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 103/313 (32%), Positives = 166/313 (53%), Gaps = 18/313 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCD+ IAI D K ++ + S +HA+YGGVVPE+A+R H + A Sbjct: 7 ILGIESSCDDLSIAIAIDNK-IVTTKTKSSSSVHANYGGVVPEIAARYHEEILHQTLNEA 65 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ LT ID + YT PGL+ L V +L + +PA ++H+ GH+ +PM++ Sbjct: 66 LTEANLTINKIDLITYTENPGLLNCLHVAKVFANTLGYLLKIPAQGINHLYGHIFSPMID 125 Query: 123 D-------NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 D + +P + ++VSGGHT + V + LL E++DDA GE +DK + LGL Sbjct: 126 DGDCLYQKSDLIYPALGIVVSGGHTAIYDVQSPSKITLLDETLDDAIGEVYDKVGRALGL 185 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-DQTRAD 234 YP G + ++ A F + T++ FS+SG K+ I N D Sbjct: 186 QYPAGAKIDQLYNPEQAETVEFLK--TNKLSA-FSYSGFKSAVLRYIELNKNQPDFNLVQ 242 Query: 235 IARAFEDAVVDTLMIKCKRALDQ--TGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 +F+ ++D + + K +++ + ++ +++ GGVSAN LR++L E+ K Sbjct: 243 AVSSFQKFIIDDFIDRIKNVINKADSKYQTILLGGGVSANSYLRSELKELAIK----TLV 298 Query: 293 ARPEFCTDNGAMI 305 +P + DN AMI Sbjct: 299 PKPIYSGDNAAMI 311 >UniRef50_UPI0001979AA5 putative DNA-binding/iron metalloprotein/AP endonuclease n=1 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI0001979AA5 Length = 380 Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 113/354 (31%), Positives = 178/354 (50%), Gaps = 46/354 (12%) Query: 3 VLGIETSCDETGIAIYDD-EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ +A+ +K L+ + SQ H+ YGG+VPE+ASR H ++ +P I Sbjct: 2 ILSIESSCDDSSLALTRIIDKKLIYHIKISQDSEHSTYGGIVPEIASRLHAKR-LPEILK 60 Query: 62 ALK---ESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 LK + L+ I AVA T PGL L+ G + ++L +P I V+H++GH+ + Sbjct: 61 KLKMFLNNDLSL--IKAVAVTTRPGLSVTLIEGLMMAKTLCLGLQIPLICVNHLKGHIYS 118 Query: 119 ---------------------PMLEDN-------------PPEFPFVALLVSGGHTQLIS 144 P+LE + + LLVSGGHTQ++ Sbjct: 119 LCISKDFATDSAKDSRKNAPPPLLESHLKSHTESLLESRQNKQDSLGVLLVSGGHTQILQ 178 Query: 145 VTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFV---FPRPM 201 V ++ +S+DD+ GE+FDK AK L L YPGGP + + A ++ FP P+ Sbjct: 179 VNDFHHISIIAQSLDDSFGESFDKVAKHLNLGYPGGPQVERYAKNCEINQYKPYEFPIPL 238 Query: 202 TDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG 259 L+FSFSGLK I+ + Q A I++ F++A + ++ K + Sbjct: 239 LHNKKLEFSFSGLKNAVRLAIQEMEQPLSLQDIASISKGFQNAACEHIVRKTRLFFQHFE 298 Query: 260 FKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRF 313 K + GG SAN LR +++E+ + E++ A EFC+DN AMI G+ + Sbjct: 299 GKYFAIVGGASANTYLRERMSELCNEFDKELYLADLEFCSDNAAMIGRVGVEHY 352 >UniRef50_Q4PGZ6 Putative uncharacterized protein n=2 Tax=Ustilaginomycotina RepID=Q4PGZ6_USTMA Length = 414 Score = 175 bits (443), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 121/367 (32%), Positives = 188/367 (51%), Gaps = 39/367 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD++ +I ++ +L++ + Q H+ GG+ P A+ H I AA Sbjct: 52 ILGIETSCDDSCASIVSSDRTILSSIVTKQD--HSSTGGIHPLSAALGHHSNLASTIAAA 109 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 ++++ +TA D+ A+A T GPG+ +L VG + ++L+ +P I VHHM+ H L P+L Sbjct: 110 IEQARITASDLHAIAVTQGPGMASSLGVGLSAAKTLSAVLHIPLIYVHHMQAHALTPLLT 169 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY---P 178 E +PP+ PF+ LLVSGGHT L+ + + +L + DD+ G+AFDK A+ LG+ + P Sbjct: 170 EPDPPKLPFLVLLVSGGHTMLVLARSVTHFRILATTSDDSIGDAFDKVARDLGIPWTSAP 229 Query: 179 GGPLLSKMAAQGTAGR-FVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD---DQTRAD 234 G L + A G VFP P +P FS+SGLK I D + ++ Sbjct: 230 GAALEALAARAEAHGDGLVFPTPCKGQP--TFSYSGLKAAVQRHIASCSPDAMAESAKSS 287 Query: 235 IARA--------FEDAVVDTL---------------MIKCKRALDQTGFKRLVMAGGVSA 271 IA A ED + L I+ + K +V +GGV++ Sbjct: 288 IAAAFQRAACAQLEDKLSMVLRPSHVSQDSRHRPFARIELLDGVSSDDVKTVVCSGGVAS 347 Query: 272 NRTLRAKLAEMMKKR-RGEVFYARP--EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 N +R++L E + + R +V P CTDN AMIA+ G + + T D RP+ Sbjct: 348 NAFIRSRLREHLDRLGRTDVDLQFPPLSLCTDNAAMIAWVGHLIYHQ-RTRDYTRHARPK 406 Query: 329 WPLAELP 335 W L ++P Sbjct: 407 WSLQDIP 413 >UniRef50_UPI000186D055 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D055 Length = 419 Score = 172 bits (436), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 108/338 (31%), Positives = 172/338 (50%), Gaps = 25/338 (7%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 VLGIE+SCD+TG ++ +D +L SQ +H + GG++P +AS H ++ Sbjct: 25 FHVLGIESSCDDTGASVVNDSGKVLGESHCSQSVIHVEAGGILPHVASALHKNNLKHVVN 84 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +A+ +S L +++D +A T PGL+ +L G ++L ++ P IP+HHME H L Sbjct: 85 SAMLQSKLKFENLDVIAVTVKPGLILSLTEGVNYAKNLCTLYNKPLIPIHHMEAHALTVR 144 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKT---AKLLGLD- 176 + D +FPF+ L+SGGH L + ++ LG+S D++ G+ FDK AKL+ L+ Sbjct: 145 IID-EVKFPFLVFLLSGGHCILALANSVRKFYKLGDSNDNSPGQVFDKIARRAKLINLNE 203 Query: 177 ---YPGGPLLSKMAAQGTAGRFVFPR-PMTDRPGLDFSFSGLKTFAANTIRD-----NGT 227 GG + K A G F + + + +FSFSG T A N I+ N + Sbjct: 204 LKGLVGGAAIEKAAKTGNPTAIPFSQTTLKSQKNCNFSFSGYITSAYNYIQSQEINLNLS 263 Query: 228 DDQTRADI---ARAFEDAVVDTLMIKC--------KRALDQTGFKRLVMAGGVSANRTLR 276 D DI +F+ ++ L + +R L KRLV++GGV++N ++ Sbjct: 264 PDAVIPDINDFCASFQWSLTTHLCQRLEMAIKYVEERKLLNEDEKRLVVSGGVASNSLIK 323 Query: 277 AKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFK 314 L + ++F P CTDNG MIA+ G+ K Sbjct: 324 NALKFVCNHYNYKIFIPPPRLCTDNGVMIAWNGVELLK 361 >UniRef50_O94710 Glycoprotease pgp1, mitochondrial n=1 Tax=Schizosaccharomyces pombe RepID=PGP1_SCHPO Length = 412 Score = 172 bits (435), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 111/364 (30%), Positives = 178/364 (48%), Gaps = 34/364 (9%) Query: 1 MRVLGIETSCDETGIAIY--DDEKGLLANQL-----YSQVKLHADYGGVVPELASRDHVR 53 + L IETSCD+T +++ D N++ + + + YGG+ P + +H + Sbjct: 39 LTALAIETSCDDTSVSVVRTSDSSSHCQNEIICLNTHRTISKYEAYGGIHPTIVIHEHQK 98 Query: 54 KTVPLIQAALKE---SGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVH 110 +IQ + + SG+T D D +A T GPG++G L VG + LA P + VH Sbjct: 99 NLAKVIQRTISDAARSGIT--DFDLIAVTRGPGMIGPLAVGLNTAKGLAVGLQKPLLAVH 156 Query: 111 HMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTA 170 HM+ H LA LE + +FP++ +LVSGGHT L+ + +E++ + D A G+ DK A Sbjct: 157 HMQAHALAVQLEKS-IDFPYLNILVSGGHTMLVYSNSLLNHEIIVTTSDIAVGDYLDKCA 215 Query: 171 KLLGLDY----PGGPL--LSKMAAQGTAGRFVFPRPMTDRPGL---DFSFSGLKTFAANT 221 K LG+ + P L + T+ P P+ R + FSFSGL+++A Sbjct: 216 KYLGIPWDNEMPAAALEQFASPEINSTSYSLKPPIPLNTREKVHSASFSFSGLESYACRI 275 Query: 222 IRDNGTDDQTRADIARAFEDA----VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRA 277 IR + + A + A + ++ KR LD + K LV +GGV+ N L+ Sbjct: 276 IRKTPLNLSEKKFFAYQLQYAAFQHICQKTLLALKR-LDLSKVKYLVCSGGVARNELLKK 334 Query: 278 KLAEMMKKRRGE-------VFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWP 330 L + + + E + Y P+ C+DN AMI Y + FKAG T+ V +WP Sbjct: 335 MLNDTLMVLQFEHQPTDIKLVYPSPDICSDNAAMIGYTAIQMFKAGYTSSFDVEPIRKWP 394 Query: 331 LAEL 334 + ++ Sbjct: 395 INQI 398 >UniRef50_C4PYC5 Mername-AA018 peptidase (M22 family) n=1 Tax=Schistosoma mansoni RepID=C4PYC5_SCHMA Length = 388 Score = 169 bits (429), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 106/327 (32%), Positives = 162/327 (49%), Gaps = 29/327 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TG A+ + LL + L SQ ++ GGV+P +A+ H ++ A Sbjct: 36 VLGIETSCDDTGAAVIETSGKLLGDCLSSQSRISVMLGGVLPSVAAELHKENIESVVNTA 95 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + +S + +D++ VA T PG+ +L +G + +SLA +P IP+ HME H L + Sbjct: 96 MAKSNIGLRDLNFVAVTVKPGMPLSLKIGVSFAKSLASRLKIPIIPIDHMEAHALTALFT 155 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD------ 176 D +FP++ LL+SGGH L V G+ Y LLG ++D + G+ DK ++ L L+ Sbjct: 156 DPQLKFPYMILLISGGHGILGIVQGLEDYVLLGTALDASPGDVLDKLSRRLKLNRLSDEC 215 Query: 177 ---YPGGPLLSKMAA--QGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 GG + +A G RF P P + DFSF+G+ A I N + + Sbjct: 216 LKGVAGGKAIEIIAKTYNGDHQRFNLPLPRSQSKDCDFSFTGIHAAAEQLI--NKLESEN 273 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 R T + C + Q V++GGV +N +RA L E+ Sbjct: 274 RG------------TFYLPCSIFISQMK----VVSGGVGSNCVIRAGLTEVANHYNLRFV 317 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGAT 318 P CTDNG MIA+ G++ K ++ Sbjct: 318 APPPSLCTDNGIMIAWNGVLLQKENSS 344 >UniRef50_P43122 Putative protease QRI7 n=12 Tax=Saccharomycetaceae RepID=QRI7_YEAST Length = 407 Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 121/366 (33%), Positives = 180/366 (49%), Gaps = 36/366 (9%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKL---HADYGGVVPELASRDHVRKTVPL 58 +VL IETSCD+T +++ D A + + +K D GG++P A H + PL Sbjct: 34 KVLAIETSCDDTCVSVLDRFSKSAAPNVLANLKDTLDSIDEGGIIPTKAHIHHQARIGPL 93 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 + AL ES + ID + T GPG+ G+L G + LA AW+ P I VHHM GHLL Sbjct: 94 TERALIESN-AREGIDLICVTRGPGMPGSLSGGLDFAKGLAVAWNKPLIGVHHMLGHLLI 152 Query: 119 PMLEDN--PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 P + N P+FPFV+LLVSGGHT + I +E+L ++ID A G++ DK + LG Sbjct: 153 PRMGTNGKVPQFPFVSLLVSGGHTTFVLSRAIDDHEILCDTIDIAVGDSLDKCGRELG-- 210 Query: 177 YPGGPLLSKMAA--------QGTAGRFVFPRPMTD----RPGLDFSFSGLKT-FAANTIR 223 + G + +M Q A + P P+ + R L FSFS T N + Sbjct: 211 FKGTMIAREMEKFINQDINDQDFALKLEMPSPLKNSASKRNMLSFSFSAFITALRTNLTK 270 Query: 224 DNGTDDQTRAD-----IARAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANR 273 T+ Q + IA +++V D ++ K K L + V +GGVS+N+ Sbjct: 271 LGKTEIQELPEREIRSIAYQVQESVFDHIINKLKHVLKSQPEKFKNVREFVCSGGVSSNQ 330 Query: 274 TLRAKLAEMMKKRRGEVF----YARPEFCTDNGAMIAYAGMVRFKA-GATADLGVSVRPR 328 LR KL + F Y + C+DN MI +AG+ +++ +DL + + Sbjct: 331 RLRTKLETELGTLNSTSFFNFYYPPMDLCSDNSIMIGWAGIEIWESLRLVSDLDICPIRQ 390 Query: 329 WPLAEL 334 WPL +L Sbjct: 391 WPLNDL 396 >UniRef50_C4QZU9 Putative metalloprotease, similar to O-sialoglycoprotein metallopeptidase from P. haemolytica n=1 Tax=Pichia pastoris GS115 RepID=C4QZU9_PICPG Length = 373 Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 115/358 (32%), Positives = 185/358 (51%), Gaps = 27/358 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKG---LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +VL IE+SCD++ +++ D G ++ + + S + GGV+P A H + L Sbjct: 13 KVLAIESSCDDSCVSLIDRSAGAKPIVLDHVKSTLN-SVKAGGVIPTSAHLHHQKSIAGL 71 Query: 59 IQAALKESGLTAKDI-DAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 ++ L++ ++ + + V T GPG+ G+L +G + L+ AW + VHHM GHLL Sbjct: 72 VKQVLQKHNISGVNCPELVCVTRGPGMPGSLSIGVDTAKGLSVAWGSQFLGVHHMLGHLL 131 Query: 118 APMLEDN--PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 P LE N P+FPF++LL SGGHT L+ + +E+L +ID AAG+A DK A+ +G+ Sbjct: 132 IPRLESNGEEPQFPFLSLLASGGHTMLVLSRSLLDHEILVNTIDIAAGDALDKCAREIGI 191 Query: 176 --DYPGGPL---LSKMAAQGTAG-RFVFPRPMTDRPG----LDFSF----SGLKTFAANT 221 + G L L+K + P+P+ ++P L FSF SG+K + Sbjct: 192 RGNMIGKELELFLNKNPQLSLKDIPWEMPQPLKNKPKRVDTLGFSFTPFISGVK-LSLER 250 Query: 222 IRDNGTDDQTRA----DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRA 277 +N D+ I A D ++D +++ K + K V +GGV AN+ LR Sbjct: 251 YHNNEVKDELMPAMGFRIQEAIFDHIIDRVLVAYKVRPELNQIKTFVGSGGVVANQRLRV 310 Query: 278 KLAEMMKKRRGEVF-YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 KL +K E F + P CTDN MI +AG+ ++ G T++L V+ +W + L Sbjct: 311 KLQAALKSHGVENFHFPPPALCTDNAIMIGWAGIELYENGVTSELDVTPLRKWSVEGL 368 >UniRef50_B5Y892 O-sialoglycoprotein endopeptidase n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y892_COPPD Length = 316 Score = 167 bits (422), Expect = 6e-40, Method: Compositional matrix adjust. Identities = 111/311 (35%), Positives = 175/311 (56%), Gaps = 21/311 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+L IETSCDET +A + +K ++ ++++SQ+ LH +GGV+PE A+R H+ L++ Sbjct: 4 RILAIETSCDETAVACLNGDK-VVQSKVFSQIDLHEAFGGVLPEAAARRHLEVLPVLLKD 62 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 K D +A TAGPGL+ ALL G +V L+ W VP + ++H+ H+ A L Sbjct: 63 VAKP--------DLIAVTAGPGLLPALLTGVSVALGLSRGWQVPVMGINHVVAHVAAAAL 114 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E E P + L+VSGGHT + ++LG + DDAAGE DK + LG+ YP G Sbjct: 115 ERRI-ELPVLGLVVSGGHTSFYLIEKWSDPKVLGWTYDDAAGECLDKVGRALGMKYPAGA 173 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 + +A R P P+ + +FSFSGLKT AA + +++ +A + + Sbjct: 174 EIDNLALT-IKERVTMPLPLKNEDSFNFSFSGLKT-AALKYKGKISNEV----LAASLME 227 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 AVV+ L+ + ++ L + + LV+ GGVSA++ LR ++ E +R V + + TDN Sbjct: 228 AVVNHLLDRIEKVLKKYPYP-LVVGGGVSASKFLRQRMHEHFGER---VIFPSAQLSTDN 283 Query: 302 GAMIA-YAGMV 311 M+A YA ++ Sbjct: 284 ADMVAVYAALL 294 >UniRef50_D2LQ34 Metalloendopeptidase, glycoprotease family n=1 Tax=Aciduliprofundum boonei T469 RepID=D2LQ34_9EURY Length = 530 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 111/325 (34%), Positives = 177/325 (54%), Gaps = 22/325 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLAN--QLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 M VLGIE + G+ I EK +LAN +Y + GG+ P A+ HV+ L Sbjct: 1 MLVLGIEGTAHTVGVGIVT-EKEVLANVSHMYRPPE-----GGIHPREAANHHVQYLPKL 54 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 + A + + + +++D ++++ GPGL L AT R L+ ++P + V+H HL Sbjct: 55 LNEAFRIANVKPEELDGISFSQGPGLGPCLRTVATAARVLSVKLNIPIVGVNHCIAHLEI 114 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 E P V L VSGG+TQ+IS G+Y + GE++D G DK A+ +G+ +P Sbjct: 115 GRFSTGA-EDP-VMLYVSGGNTQIISFAS-GRYRVFGETLDIGVGNMLDKLAREMGIPFP 171 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 GGP + K+A +G P P + + G+D +FSG+ T A N + +++++ DIA + Sbjct: 172 GGPRIEKLALEGKK---YIPLPYSIK-GMDMAFSGILTAAINKL-----NNESKEDIAYS 222 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP-EF 297 ++ V L+ +RAL +++AGGV+ N+ L+ L E+M + RG FY P + Sbjct: 223 VQETVFAMLVEATERALTHLRKDEVLLAGGVARNKRLQEML-EIMAEERGARFYVPPADL 281 Query: 298 CTDNGAMIAYAGMVRFKAGATADLG 322 C DNGAMIAY G++ K G ++G Sbjct: 282 CVDNGAMIAYLGLLFLKNGKRMEIG 306 >UniRef50_UPI000180B634 PREDICTED: similar to Probable O-sialoglycoprotein endopeptidase 2 (O-sialoglycoprotein endopeptidase-like protein 1) n=1 Tax=Ciona intestinalis RepID=UPI000180B634 Length = 386 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 116/342 (33%), Positives = 170/342 (49%), Gaps = 24/342 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP-LIQA 61 VLGIE++ D+TG AI D + + + +Q K H GGV P +A H R +P +++A Sbjct: 20 VLGIESTFDDTGAAIVDCDATIHGEAIATQTKAHVKAGGVDPRIAELLH-RDNLPRVVEA 78 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+++G+ +D+DAVA PG L G + + + IPVHHME HLL + Sbjct: 79 VLQQAGIRYQDLDAVATATRPGNPFCLKRGLEFTKMIVERHSLRFIPVHHMEAHLLTARM 138 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL--GLDYP- 178 +N FPF+ LL +GGH + +G +++LGE+ID+ G FDK A+ L LD P Sbjct: 139 -NNEVNFPFLGLLATGGHCIITITHDLGNHQILGEAIDEPPGAVFDKVARALQVKLDRPD 197 Query: 179 ------GGPLLSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRDNGTDDQT 231 G + ++A +G + P+ P LDFSFSGL+T I D Sbjct: 198 THERLWNGGDVERLACEGDRSKVKLTTPLRQSPRVLDFSFSGLQTQTLRVI-DQPEPGVK 256 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFK------RLVMAGGVSANRTLRAKLAEMMKK 285 ADIA +F+ + ++ + RA+ + K LV+AGGV N LR L+ + Sbjct: 257 YADIAASFQHTMTQHILSRVHRAILMSRDKLNQESPTLVVAGGVVCNSYLRNALSRLCDI 316 Query: 286 RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 + C DNG MIA+ GM K G G+S P Sbjct: 317 TNITIVCPPLPLCVDNGVMIAWTGMEYLKRGK----GISPHP 354 >UniRef50_Q93170 Protein C01G10.10, confirmed by transcript evidence n=3 Tax=Caenorhabditis RepID=Q93170_CAEEL Length = 421 Score = 165 bits (418), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 103/327 (31%), Positives = 177/327 (54%), Gaps = 17/327 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP-LI 59 ++VLGIETSCD+T +AI ++++ +L+++ Y++ + GG+ P + + H R+ +P LI Sbjct: 23 VKVLGIETSCDDTAVAIVNEKREILSSERYTERAIQRQQGGINPSVCALQH-RENLPRLI 81 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + L ++G + KD+DAVA T PGLV AL G + A +P IPVHHM H L+ Sbjct: 82 EKCLNDAGTSPKDLDAVAVTVTPGLVIALKEGISAAIGFAKKHRLPLIPVHHMRAHALSI 141 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL---LGLD 176 +L D+ FPF A+L+SGGH + + +++L G+S+ + GE DK A+ LG + Sbjct: 142 LLVDDSVRFPFSAVLLSGGHALISVAEDVEKFKLYGQSVSGSPGECIDKVARQLGDLGSE 201 Query: 177 YPG---GPLLSKMAAQGTA-GRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 + G G + +A++ +A G +P + + P + +F +K N + + +T Sbjct: 202 FDGIHVGAAVEILASRASADGHLRYPIFLPNVPKANMNFDQIKGSYLNLLERLRKNSETS 261 Query: 233 ADI---ARAFEDAVV----DTLMIKCKRALDQTGF-KRLVMAGGVSANRTLRAKLAEMMK 284 DI + ++ V L I + +Q K+LV+ GGV+AN+ + ++++ Sbjct: 262 IDIPDFCASLQNTVARHISSKLHIFFESLSEQEKLPKQLVIGGGVAANQYIFGAISKLSA 321 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMV 311 CTDN MIAY+G++ Sbjct: 322 AHNVTTIKVLLSLCTDNAEMIAYSGLL 348 >UniRef50_A5UMH5 Putative O-sialoglycoprotein endopeptidase n=5 Tax=Methanobacteriaceae RepID=GCP_METS3 Length = 538 Score = 162 bits (411), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 114/342 (33%), Positives = 179/342 (52%), Gaps = 32/342 (9%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGIE + ++TG+ I D + +LA + +L + GG+ P +A+ H LI A+ Sbjct: 7 LGIEGTAEKTGVGIVDSDGNILA---MAGEQLFPEKGGIHPRIAAEHHGYWIPKLIPKAI 63 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 E+G++ D+D ++++ GPGL AL + AT R+LA + + P I V+H GH+ L+ Sbjct: 64 DEAGISYDDLDLISFSQGPGLGPALRIVATSARTLALSLNKPIIGVNHCIGHVEVGKLDT 123 Query: 124 ---NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 NP V L VSGG++Q+IS G+Y + GE++D AAG D + GL +PGG Sbjct: 124 GAVNP-----VTLYVSGGNSQVISHES-GRYRIFGETLDIAAGNCLDHFGRETGLGHPGG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRP----GLDFSFSGLKTFAANTIRDNGTDDQTRADIA 236 P++ K+A +G+ D P G+DFSFSGL + A ++ GT + D+ Sbjct: 178 PVIEKLAKKGS---------YVDLPYVVKGMDFSFSGLLSAALREVK-KGTPIE---DVC 224 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 + ++ L+ +RAL T +++ GGVSAN LR L M ++ + + Sbjct: 225 FSLQETAFSMLVEVTERALSHTQKDEVMLCGGVSANSRLREMLKVMAEEHGAKFCMPEMK 284 Query: 297 FCTDNGAMIAYAGMV---RFKAGATADLGVSVRPRWPLAELP 335 C DNG MIA+ G++ +F D G+ R R E P Sbjct: 285 LCGDNGVMIAWLGLIMHNQFGPLDIKDTGIIQRFRTDEVEAP 326 >UniRef50_B7XIP4 O-sialoglycoprotein endopeptidase n=2 Tax=Eukaryota RepID=B7XIP4_ENTBH Length = 360 Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 107/329 (32%), Positives = 172/329 (52%), Gaps = 26/329 (7%) Query: 3 VLGIETSCDETGIAIY---DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 VLGIE+S ++ G+ I ++ LLAN+ + A GV+P A++ H + LI Sbjct: 13 VLGIESSANKIGVGILKIMNENVELLANE--RKTYTPAPGAGVIPIDAAKHHRDVILELI 70 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 +L++S L +DID AYT GPG+ L+VG V R+LA + P +PV+H H+ Sbjct: 71 DVSLQKSNLVIQDIDLYAYTKGPGMYQLLVVGCVVARTLALYHNKPLVPVNHCVAHIEMG 130 Query: 120 ML---EDNPPEFPFVALLVSGGHTQLIS-VTG-IGQYELLGESIDDAAGEAFDKTAKLLG 174 NP + L SGG+TQ+I+ ++G +Y++ GE+ID A G FDK A+ LG Sbjct: 131 RFITGAKNP-----IVLYASGGNTQIINRISGKTNKYKIFGETIDVAVGNCFDKVARALG 185 Query: 175 LDYPGGP--LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LD P + + A +++ P P T + G+D SFSG+ + I+D + + + Sbjct: 186 LDNAPSPGFNIERQAELNHEKKYI-PLPYTIK-GMDMSFSGILSTCLKLIKDFKSTNPSS 243 Query: 233 A-------DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK 285 A +I + ++ + L+ +R +++ GGV N L+ + +M+ + Sbjct: 244 AQFKKFISEICFSLQETMFSILVEATERCCSFVESNEVLIVGGVGCNLRLQEMIHKMITQ 303 Query: 286 RRGEVFYARPEFCTDNGAMIAYAGMVRFK 314 R G V+ +C DNGAMIAY G + FK Sbjct: 304 RGGTVYSMNEAYCIDNGAMIAYTGYLIFK 332 >UniRef50_UPI0001C42124 glycoprotease M22 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C42124 Length = 565 Score = 160 bits (406), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 111/323 (34%), Positives = 171/323 (52%), Gaps = 35/323 (10%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLA---NQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 M LGIE + ++TGI I D + +LA QLY +V GG+ P A+ H + Sbjct: 1 MISLGIEGTAEKTGIGIVDSDGNVLAMAGKQLYPEV------GGIHPREAAEHHAKWIPQ 54 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 LI A++E+GL KDID ++++ GPGL AL + A+ RSLA + +P + V+H GH+ Sbjct: 55 LIPQAMEEAGLDYKDIDLISFSQGPGLGPALRIVASSARSLALSLGIPIVGVNHCIGHVE 114 Query: 118 APMLE---DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 L+ NP V L VSGG++Q+I+ G+Y + GE++D A G D + G Sbjct: 115 IGKLDTGAKNP-----VTLYVSGGNSQVIAYES-GRYRIFGETLDIAIGNCLDHFGRETG 168 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRP----GLDFSFSGLKTFAANTIRDNGTDDQ 230 L +PGGP++ K+A G+ D P G+DFSFSGL + A +NG + Sbjct: 169 LGHPGGPVVEKLAKDGS---------YIDLPYVVKGMDFSFSGLLSSALRA-HENG---E 215 Query: 231 TRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 DI + ++ L+ +RAL T +++ GGVSAN LR + M ++ + Sbjct: 216 RIEDICFSLQETAFAMLVEVTERALAHTEKDEVLLCGGVSANSRLRDMMKIMAEEHYAKF 275 Query: 291 FYARPEFCTDNGAMIAYAGMVRF 313 + ++ DNG MIA+ G + + Sbjct: 276 YMPEMKYSGDNGVMIAWLGQLMY 298 >UniRef50_D2VC41 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VC41_NAEGR Length = 415 Score = 159 bits (403), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 120/402 (29%), Positives = 176/402 (43%), Gaps = 91/402 (22%) Query: 24 LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL-KESGLTA---KDIDAVAYT 79 +L Q+ + +L YGGV P + H +I+ AL K S L + + +D VA T Sbjct: 6 ILHEQVITHHELVNQYGGVHPTEMAHMHRATLDGMIENALEKVSNLDSNRERVVDYVAVT 65 Query: 80 AGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGH 139 GPGL L G +VP IPVHH+E HLL P++ FP++ LL SGGH Sbjct: 66 VGPGLPPCLSAGLDTAMKYCEKLNVPVIPVHHLEAHLLVPLMFSENTNFPYLVLLASGGH 125 Query: 140 TQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG------------------LDYPGGP 181 ++ GIGQYE++G + DD+ GEAFDKTA+LL +Y GG Sbjct: 126 CLVVFSRGIGQYEIVGGTEDDSIGEAFDKTARLLQESIDFNLNDYVNEKFGTRENYSGGA 185 Query: 182 LLSKMAAQGTAGRFVFPRPM---TDRPGLDFSFSGLKT--------------------FA 218 L+ K+A G + + FP P+ R + FSFSG+KT Sbjct: 186 LVEKLALLGDSSSYNFPIPLRKGNRRNDITFSFSGIKTDVLRTVRKEQNQGISKRDLHHL 245 Query: 219 ANTIRDNG---------------TDDQTR---------------------ADIARAFEDA 242 N +R+NG TD +R +I+ +F+ Sbjct: 246 LNRLRNNGSVKNVQDINANLQASTDPYSREGSQSLSTIELKNEKLSEEVVCNISASFQKC 305 Query: 243 VVDTLMIKCKRALDQTGF------KRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 L+ K + A+ + L+++GGVSAN+ R +L ++ K ++ A + Sbjct: 306 AFTHLIDKLEMAMHRYRANVDEYPNSLIVSGGVSANQYFRHELTKLSDKYEYDLKVAPMK 365 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVR----PRWPLAEL 334 +CTDN MI YA R + V + PRWP+ L Sbjct: 366 YCTDNAVMIGYAAFQRLFNECHKPVEVCDKERYIPRWPITTL 407 >UniRef50_UPI0000DB7930 PREDICTED: similar to O-sialoglycoprotein endopeptidase-like 1 n=1 Tax=Apis mellifera RepID=UPI0000DB7930 Length = 385 Score = 159 bits (402), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 105/328 (32%), Positives = 165/328 (50%), Gaps = 39/328 (11%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCD+T I D +L + SQ H ++GG++P A HV + A Sbjct: 31 ILGIESSCDDTAFGIVDSNGNILGESINSQYLTHLNFGGIIPTFARSLHVNNITKTCEDA 90 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ + L +DIDA+A T G+ LA P IP+HHME H L + Sbjct: 91 LRAANLRIRDIDAIA--------------TTFGKYLAKIGGKPFIPIHHMEAHALTARI- 135 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-DYP--- 178 + +FP++ALL+SGGH L V + ++ LLG S+ + G+ F+K A+ L L + P Sbjct: 136 NKKIDFPYLALLISGGHCLLAIVENVNKFYLLGTSLSNTPGDVFNKVARRLKLRNIPEFS 195 Query: 179 ---GGPLLSKMAAQGT-AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 GG + A++ + +F+FP M +FSFSGL F + I + + Sbjct: 196 TLNGGQAIELAASKASNVNQFLFPLIMMQFRNCNFSFSGLLNFFGDMIIPD------VYN 249 Query: 235 IARAFEDAVVDTLMIKCKRALD----QTGF----KRLVMAGGVSANRTLRAKLAEMMKKR 286 AF+ A+ + + +RA++ + F + LV++GGV+ N L AK ++ Sbjct: 250 FCAAFQLALTTHICQRTQRAMEFINKMSLFPENKQTLVISGGVACNNFL-AKALNIVSTE 308 Query: 287 RGEVFYARP-EFCTDNGAMIAYAGMVRF 313 G F P + CTDNG MIA+ G+ ++ Sbjct: 309 LGYTFVRTPSKLCTDNGIMIAWNGVEKW 336 >UniRef50_Q9NPF4 Probable O-sialoglycoprotein endopeptidase n=81 Tax=Eukaryota RepID=OSGEP_HUMAN Length = 335 Score = 159 bits (402), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 117/332 (35%), Positives = 178/332 (53%), Gaps = 16/332 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLG E S ++ G+ + D K +LAN + V G +P +R H + L+Q A Sbjct: 4 VLGFEGSANKIGVGVVRDGK-VLANPRRTYVTPPGT--GFLPGDTARHHRAVILDLLQEA 60 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ESGLT++DID +AYT GPG+ L+ A V R++A W+ P + V+H GH+ L Sbjct: 61 LTESGLTSQDIDCIAYTKGPGMGAPLVSVAVVARTVAQLWNKPLVGVNHCIGHIEMGRLI 120 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYPGG 180 P V L VSGG+TQ+I+ + +Y + GE+ID A G D+ A++L + D G Sbjct: 121 TGATS-PTV-LYVSGGNTQVIAYSE-HRYRIFGETIDIAVGNCLDRFARVLKISNDPSPG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI-RDNGTDDQTRADIARAF 239 + +MA + G+ + P T + G+D SFSG+ +F + R T + T D+ + Sbjct: 178 YNIEQMAKR---GKKLVELPYTVK-GMDVSFSGILSFIEDVAHRMLATGECTPEDLCFSL 233 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 ++ V L+ +RA+ G + ++ GGV N L+ +A M ++R +F FC Sbjct: 234 QETVFAMLVEITERAMAHCGSQEALIVGGVGCNVRLQEMMATMCQERGARLFATDERFCI 293 Query: 300 DNGAMIAYAGMVRFKAGAT---ADLGVSVRPR 328 DNGAMIA AG F+AG +D GV+ R R Sbjct: 294 DNGAMIAQAGWEMFRAGHRTPLSDSGVTQRYR 325 >UniRef50_Q4U8J6 Glycoprotease, putative n=2 Tax=Theileria RepID=Q4U8J6_THEAN Length = 630 Score = 159 bits (401), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 96/334 (28%), Positives = 175/334 (52%), Gaps = 31/334 (9%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IETS D+T IA+ + +L+++ SQ ++ +YGG+ P A +H++K L Sbjct: 98 ILSIETSFDDTCIAVVRSDGKILSDKKLSQEEVVKEYGGIKPVCAKLEHIKKIESLTDKV 157 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ESGL +DID +A T GPG L VG + L+ + +P + +H+ GH L+P+++ Sbjct: 158 IEESGLKIQDIDEIAVTRGPGTELCLRVGYNYAKELSEKYKIPLVSENHIAGHCLSPLID 217 Query: 123 D--------------NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDK 168 + N +FP++ LL+SGGH+Q+ V ++ L+ E+ D+ G DK Sbjct: 218 EHQFKYTVEGTPIKSNDLKFPYLCLLLSGGHSQIYLVENPSKFHLMCETQDEFVGNVLDK 277 Query: 169 TAKLLGLDYP--GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT----FAANTI 222 AKLLGLD GG L K+A + + ++ P ++F FSG+++ + Sbjct: 278 CAKLLGLDLSKGGGAELEKIADEVSDSKYKLTIPNKYNHYMEFCFSGVQSQLGLKTEQLV 337 Query: 223 RDNGTDDQTR------ADIARAFEDAVVDTLMIKCKRALD--QTGF--KRLVMAGGVSAN 272 + + +D R +++A + V + ++I+ + +L+ +T F +L + GGV++N Sbjct: 338 KSHNVEDAKRLPRKILSELAYGLQSTVFEGILIQLEMSLNAVETLFPINQLALVGGVASN 397 Query: 273 RTLRAKLAEMMKKRRGEVFYARPE-FCTDNGAMI 305 L+ + ++ R V ++ E F T M+ Sbjct: 398 DKLKKMILDLFYLRDESVRFSEQEMFLTRTKNMV 431 >UniRef50_Q74M58 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Nanoarchaeum equitans RepID=GCP_NANEQ Length = 314 Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 104/332 (31%), Positives = 173/332 (52%), Gaps = 25/332 (7%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIE + G+ I+D EKG+LAN+ K+ G+ P A+ H+++ ++ Sbjct: 1 MKVLGIECTAHTFGVGIFDSEKGVLANE-----KVTYKGYGIHPREAAELHLKEFDKVLL 55 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH--LLA 118 AL+++ ++ KDID +A ++GPGL+ L +G + L + P I V+H+ H Sbjct: 56 KALEKANISLKDIDLIAVSSGPGLLPTLKLGNYIAVYLGKKLNKPVIGVNHIVAHNEFAR 115 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 + + P F +V SG +TQ +++ + L+GE++D G DK A+ LGL++P Sbjct: 116 YLAKAKDPLFVYV----SGANTQFLAIVN-NSWFLVGETLDMGVGNLIDKVARDLGLEFP 170 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 GGP + ++A +G + + P T + GL+ G+ T+ D ++ DIA + Sbjct: 171 GGPKIEELAKKG---KNLIELPYTIK-GLNLQLGGIYTYIKRI-----KDQYSKEDIAYS 221 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP-EF 297 ++ V ++ +RA+ K L++ GGV+ N L +AE M K FY P ++ Sbjct: 222 LQEWVFALILEIAERAMHMLDKKELILTGGVACNNRLN-DMAEQMAKENNFKFYRLPCQY 280 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 TDNGAMIAY G + G + RP W Sbjct: 281 LTDNGAMIAYLGYYWYSQGIYYE--PKPRPYW 310 >UniRef50_Q6L243 Putative O-sialoglycoprotein endopeptidase n=3 Tax=Thermoplasmatales RepID=GCP_PICTO Length = 529 Score = 157 bits (398), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 108/340 (31%), Positives = 183/340 (53%), Gaps = 22/340 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLG+E + I D EK +L+N + V H GG+ P A+ H K +I+ Sbjct: 1 MIVLGLEGTAHTISAGIVD-EKSILSNVSSTYVPEH---GGIHPREAAVHHADKIYDVIK 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL-LAP 119 + +GL +D+D +A++ GPGL L V +T R+L+ + P + V+H GH+ + Sbjct: 57 RSFDNAGLKPEDLDLIAFSMGPGLGPCLRVVSTAARALSIKYSKPLLGVNHPLGHVEIGR 116 Query: 120 MLE--DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 L +P + L +SGG+TQ+I+ G+Y +LGE++D G DK A+ LG+ + Sbjct: 117 KLSGARDP-----IMLYISGGNTQVIAHLN-GRYRVLGETMDIGLGNMLDKFARDLGIPF 170 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 PGGP++ +MA G + + P + + G+D SFSG+ T A + + + + DI Sbjct: 171 PGGPVIERMALDG---KDLLELPYSVK-GMDTSFSGIYTAAKRYL----SLGKNKNDICY 222 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 + ++ ++ +RA+ T +++AGGV+ N LR+ + +M + + + E+ Sbjct: 223 SLQETSFSMVVEVLERAMYYTNKNEILLAGGVARNDRLRSMVNDMARDSGYKAYLTDKEY 282 Query: 298 CTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAELPA 336 C DNGAMIA AGM+ + GA D + + R+ + E+PA Sbjct: 283 CMDNGAMIAQAGMLMYMHGARQDIMETRINQRFRIDEVPA 322 >UniRef50_UPI0000E8089C PREDICTED: similar to Osgepl1 protein n=1 Tax=Gallus gallus RepID=UPI0000E8089C Length = 513 Score = 155 bits (393), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 89/236 (37%), Positives = 127/236 (53%), Gaps = 8/236 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TG A+ D+ +L L SQ ++H GG++P +A + H +++ A Sbjct: 111 VLGIETSCDDTGAAVLDEAGTVLGEALQSQKEVHLKAGGIIPHVAQQLHRESIQQVVKEA 170 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L SG++ ++ A+A T PGL +L VG L + P IP+HHME H L L Sbjct: 171 LSASGVSVNELAAIATTVKPGLALSLEVGLQYSLQLVDRYQKPFIPIHHMEAHALTIRLT 230 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-DYP--- 178 + EFPF+ LL+SGGH L G+ + LLG+SID A G+ DK A+ L L +P Sbjct: 231 EQ-VEFPFLVLLLSGGHCILAVARGVSDFLLLGQSIDIAPGDMLDKVARRLSLVKHPECH 289 Query: 179 ---GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 GG + +A G ++ F PM DFSFSGL++ I ++ T Sbjct: 290 GMAGGKAIEHLAQTGDWQQYTFRLPMQQYRNCDFSFSGLQSLVNKAILQKEKEEGT 345 >UniRef50_A2QMR2 Function: O-sialoglycoprotein endopeptidase is a neutral metalloprotease n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QMR2_ASPNC Length = 430 Score = 153 bits (387), Expect = 7e-36, Method: Compositional matrix adjust. Identities = 126/381 (33%), Positives = 173/381 (45%), Gaps = 58/381 (15%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVPLI 59 L IETSCD+T +AI + E A Q++ K+ D Y G+ P +A H L Sbjct: 32 TLAIETSCDDTSVAIVEKESN--AVQIHFLDKVTCDTSAYQGIHPVVALESHQENIASLQ 89 Query: 60 QA--ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 Q +S L K D V T GPG L VG G++L+ AW VP + VHHM+ HLL Sbjct: 90 QTINVSSDSQLRRKP-DFVCSTRGPGFRSNLFVGLDTGKALSVAWQVPFVGVHHMQAHLL 148 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK------ 171 P L P EFPF+++L+SGGHT L+ + I +E++ ++D A GEA DK A+ Sbjct: 149 TPRLPITP-EFPFLSILISGGHTMLVKSSSITDHEIMASTVDRALGEALDKAAREIIPPF 207 Query: 172 LLGLDYPG--GPLLSKMA----------AQGTAGR------------FVFPRPMTDRPGL 207 LL G LL + A Q R + F P L Sbjct: 208 LLQTSKSTMYGKLLEEFAFPNGKADYADYQAPKSRHDELIPRENPWGWSFTEPWAHSRQL 267 Query: 208 DFSFSGL-----KTFAANTIRDNGTDDQTRADIAR-----AFEDAVVDTLMIKCKRALDQ 257 +SF + + F+A + R +AR +FE T+M +L + Sbjct: 268 QYSFCFIGSTLARIFSAREAAGQTISHEERIALAREAMRTSFEHLASRTIM--ALESLAK 325 Query: 258 TG----FKRLVMAGGVSANRTLRAKLAEMMKKR-RGEV--FYARPEFCTDNGAMIAYAGM 310 G K LV++GGV+AN+ L L + R G V P CTDN AMIA+AGM Sbjct: 326 QGPEKEVKTLVVSGGVAANQYLMTVLRSWLDARGFGHVGLVAPPPYLCTDNAAMIAWAGM 385 Query: 311 VRFKAGATADLGVSVRPRWPL 331 F+AG +L +W L Sbjct: 386 EMFEAGWRTNLTSRAIRKWSL 406 >UniRef50_B6GZQ3 Pc12g05880 protein n=9 Tax=Trichocomaceae RepID=B6GZQ3_PENCW Length = 457 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 125/409 (30%), Positives = 181/409 (44%), Gaps = 85/409 (20%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLH------AD---YGGVVPELASRDHVR 53 L IETSCD+T +AI + K + S K+H AD + G+ P +A H Sbjct: 36 TLAIETSCDDTSVAIVEKTK----KESGSAAKIHFLENVTADTRAHRGIHPIIALESHQD 91 Query: 54 KTVPLIQAALK-------ESGLTAKD------IDAVAYTAGPGLVGALLVGATVGRSLAF 100 L+Q AL GL D D ++ T GPG+ L VG G++L+ Sbjct: 92 NLATLVQKALNYLPESKTSDGLKLADGTRRRLPDFISATRGPGMRSNLSVGLDTGKALSV 151 Query: 101 AWDVPAIPVHHMEGHLLAP----MLEDNP--------PEFPFVALLVSGGHTQLISVTGI 148 AW +P + VHHM+ HLL P LE+ PEFPF+++LVSGGHT L+ GI Sbjct: 152 AWQIPMVGVHHMQAHLLTPGLVTCLENASKAGPPAIAPEFPFLSILVSGGHTTLVQSKGI 211 Query: 149 GQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPR--------- 199 +++L S D A GEA DK+A+ + D S M + +FVFP Sbjct: 212 TDHKILATSEDIAIGEALDKSARDILPDSLLQEAKSTMYGKNLE-QFVFPNGKADFADYS 270 Query: 200 ----------------------PMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT----RA 233 P + + FSFS + + ++ +GT+ + R Sbjct: 271 PPDTRGQEITKRVSDWGWSLTTPFANTRMMQFSFSSISSMVGKIVQRSGTNIKMSHAERV 330 Query: 234 DIARAFEDAVVDTLMIKCKRALD-----QTG---FKRLVMAGGVSANRTLRAKLAEMMKK 285 D+ R + L + AL+ TG K LV++GGV+AN+ L L ++ Sbjct: 331 DLGREAMRVCFEHLASRTVIALETLRPHNTGKDEIKTLVVSGGVAANQFLMKVLTSFLEV 390 Query: 286 R---RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 R + P CTDN AMI +AG+ F+AG +DL +W L Sbjct: 391 RGFGNINIVAPPPYLCTDNAAMIGWAGIEMFEAGFRSDLSCRPLRKWTL 439 >UniRef50_C5KYH6 Glycoprotein endopeptidase, putative n=4 Tax=Perkinsus marinus ATCC 50983 RepID=C5KYH6_9ALVE Length = 298 Score = 152 bits (383), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 90/230 (39%), Positives = 131/230 (56%), Gaps = 28/230 (12%) Query: 126 PEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-DYPGGPLLS 184 PEFPFV LLVSGGH + G+G + +LG ++DD+ GE FDK A+LL + D PGGP+L Sbjct: 27 PEFPFVTLLVSGGHNMAVLTRGMGDHIILGSTLDDSVGECFDKVARLLDIHDVPGGPVLE 86 Query: 185 KMAAQGT--------AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIA 236 K+A++G A R + + G DFSF+GLKT + I ++ D+A Sbjct: 87 KLASEGNPRACLRELAKPLAKTRDLELKNGCDFSFAGLKTSMRHLIEGG---KYSKPDMA 143 Query: 237 RAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 +F+ VD L+ + RA+D K LV+AGGV+AN+++R+ + E+ K+ +G + Sbjct: 144 ASFQKRCVDHLVERAGRAIDWALEIDGSIKDLVVAGGVAANKSVRSNMQELAKE-KGLML 202 Query: 292 YARP-EFCTDNGAMIAYAGMVRFKAG---------ATADLGVSVRPRWPL 331 Y P CTDNG M+A+ + K G +A+ V VRPRWPL Sbjct: 203 YCPPTRLCTDNGTMVAWNAIEHLKEGLYERAPCTAESAEKFVEVRPRWPL 252 >UniRef50_UPI000023E24C hypothetical protein FG06887.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023E24C Length = 1434 Score = 151 bits (382), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 124/404 (30%), Positives = 180/404 (44%), Gaps = 75/404 (18%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVP 57 + L IETSCD+TG+A+ + +L ++ +D + G+ P +A++ H P Sbjct: 1015 LTTLAIETSCDDTGVAVLRHTS--QSTELLFNERISSDNRAFKGIHPIVAAKGHSVSLAP 1072 Query: 58 LIQAALKE---------------SGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAW 102 L++ AL SG+ + D V+ T GPG+ L +G + + LA AW Sbjct: 1073 LVRRALNALPAAEDGDNKRICYASGVRKQVPDFVSVTRGPGMRSNLGIGLDMAKGLAVAW 1132 Query: 103 DVPAIPVHHMEGHLLAPML-------------EDNPPEFPFVALLVSGGHTQLISVTGIG 149 DVP + VHHM+ H L P L PEFPF++LLVSGGHTQL+ TG+ Sbjct: 1133 DVPLVGVHHMQAHALTPRLARALGMSMGEAEESRKGPEFPFLSLLVSGGHTQLVHSTGLT 1192 Query: 150 QYELLGESIDDAAGEAFDKTAK-------------------LLGLDYPGG---------- 180 + ++ S D A G D+TA+ L +P G Sbjct: 1193 DHSIIATSGDIAIGNLLDQTARDILPSEVFDASEHVMYGRLLEAFAFPTGADTTSAYEAV 1252 Query: 181 --PLLSK---MAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAAN--TIRDNGTDDQTRA 233 P S+ M T + P P L FSFS + T + T R + + + RA Sbjct: 1253 FTPPASRSEEMTPVSTGYDWNIPTPFRQSRKLAFSFSSIYTHVHDIATARPSMSTSERRA 1312 Query: 234 DIARAFEDAVVD---TLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR--RG 288 A V L I + K LVMAGGV++N+ L L M+ R G Sbjct: 1313 LAQHTMMAAFVHLAGRLCIALDDKPELQAAKTLVMAGGVASNKFLMHVLRSMLAIRGYEG 1372 Query: 289 EVFYARP-EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 A P E CTDN AMIA+ G+ F+AG ++L ++ +WP+ Sbjct: 1373 IEIVAPPVELCTDNAAMIAWTGIEMFQAGYESELSITGIGKWPM 1416 >UniRef50_Q46FS9 Putative O-sialoglycoprotein endopeptidase n=17 Tax=root RepID=GCP_METBF Length = 545 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 99/295 (33%), Positives = 154/295 (52%), Gaps = 18/295 (6%) Query: 40 GGVVPELASRDHVRKTVPLIQAAL---KESGLTAKDIDAVAYTAGPGLVGALLVGATVGR 96 GG+ P A++ H + +I+ L KE G+ DID +A++ GPGL L AT R Sbjct: 39 GGIHPREAAQHHAKYAASVIKRLLAEAKEKGVKPSDIDGIAFSQGPGLGPCLRTVATAAR 98 Query: 97 SLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGE 156 L+ + +P I V+H H+ + P V L VSG ++Q+IS G G+Y + GE Sbjct: 99 MLSISLGIPLIGVNHCIAHIEIGIW--RTPAMDPVVLYVSGANSQVISYMG-GRYRVFGE 155 Query: 157 SIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFV-FPRPMTDRPGLDFSFSGLK 215 ++D G A DK A+ L +PGGP + A T +++ P + G+D SFSGL Sbjct: 156 TLDIGLGNALDKFARGANLPHPGGPKIEAYAKNAT--KYIHLPYVIK---GMDLSFSGLS 210 Query: 216 TFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTL 275 T A+ ++ +D + ++++ ++ +RAL TG K +++AGGV AN L Sbjct: 211 TAASEALKKAPLED-----VCYSYQETAFAMVVEVAERALAHTGKKEVLLAGGVGANTRL 265 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS-VRPRW 329 R L +M + R + + F DNG MIAY G++ +K+G T L S V P + Sbjct: 266 REMLNDMCEARGAKFYVPEKRFMGDNGTMIAYTGLLMYKSGNTLSLEDSRVNPSY 320 >UniRef50_A5DGU9 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DGU9_PICGU Length = 408 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 112/376 (29%), Positives = 182/376 (48%), Gaps = 44/376 (11%) Query: 2 RVLGIETSCDETGIAIYD--DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 RVL IE+SCD+ IA+ D D K + +Q+ S + A GGV+P A H + Sbjct: 24 RVLAIESSCDDACIALLDRKDGKTTVIDQVKSTLNSVA-AGGVIPTEAHGFHQYQIASQA 82 Query: 60 QAALKESGLTAKDI-DAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 ++ +++++ D + T GPG+VG+L G + L+ AWD P + VHHM GHL+ Sbjct: 83 SQFFQKHKISSQNSPDLICCTRGPGMVGSLSAGLQFAKGLSVAWDKPLVGVHHMLGHLMI 142 Query: 119 PMLEDN-----PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 L PP FPF++LL SGGHT L+ + ++++L ++D A G+A DK A+ L Sbjct: 143 ASLTSESQTNPPPRFPFLSLLCSGGHTMLVLSESLAKHQVLVNTVDIACGDALDKCARKL 202 Query: 174 GL--DYPGGPL--------------LSKMAAQGTAGRFVF--------PRPMTDRPGLDF 209 GL + G L +K+ F F P+ + + F Sbjct: 203 GLKGNMLGKELETFVNSFSKEELDEFTKIKTHTRDNPFNFQLKLPMRSPKHPRNAESVQF 262 Query: 210 SF-SGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQT-----GFKRL 263 SF S L T A + + +A + + + ++ + K A+D+ + Sbjct: 263 SFASFLSTLDAYSPPPGMEKSKVTKFLAFKVQQKIFEHIVDRIKLAVDKNETLFANVNDI 322 Query: 264 VMAGGVSANRTLRAKLAEMM--KKRRGEVFYARPE--FCTDNGAMIAYAGMVRFKA-GAT 318 V++GGV++N TLR L + + K +R + + PE CTDN MI AG+ ++ Sbjct: 323 VLSGGVASNSTLRRMLKDGLNDKMKRPNLNFHFPEIALCTDNAIMIGVAGIEIYENLNVV 382 Query: 319 ADLGVSVRPRWPLAEL 334 +DL ++ +WPL +L Sbjct: 383 SDLSITPIRKWPLDQL 398 >UniRef50_UPI0000F51796 O-sialoglycoprotein endopeptidase/protein kinase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI0000F51796 Length = 531 Score = 148 bits (374), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 102/319 (31%), Positives = 174/319 (54%), Gaps = 22/319 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLG+E + I DD + +++N +S + + GG+ P A+ H +P+++ Sbjct: 1 MKVLGLEGTAHTISAGIVDDNR-IISN--FSSTYIPKN-GGIHPREAAIHHADNILPVMK 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL-LAP 119 A +ESGL+ I+ VA++ GPGL L V AT R+ + + +P I V+H GH+ + Sbjct: 57 KAFEESGLSPGQINLVAFSMGPGLGPCLRVVATAARAFSIKYGIPLIGVNHPLGHVEIGR 116 Query: 120 MLE--DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 L +P + L +SGG+TQ+I+ Y++LGE++D G DK A+ +G+ + Sbjct: 117 KLSGAKDP-----IMLYISGGNTQIIAHEE-NSYKVLGETMDIGLGNLLDKLARDVGIPF 170 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 PGGP + + A +G + P + + G+D SFSG+ T A N I ++ +I Sbjct: 171 PGGPKIEEFALKGDK---LLDLPYSVK-GMDTSFSGIYTAARNYI-----GRESIENICY 221 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 + ++ L+ +RAL T + +++AGGV+ N LR+ ++ M K + ++ Sbjct: 222 SVQETTFSMLVEVLERALYYTDKREILLAGGVARNDRLRSMVSHMAKSSGYVAYLTDKKY 281 Query: 298 CTDNGAMIAYAGMVRFKAG 316 C DNGAMIA AGM+ + +G Sbjct: 282 CMDNGAMIAQAGMLMYLSG 300 >UniRef50_B7QJD9 O-sialoglycoprotein endopeptidase, putative n=3 Tax=Arthropoda RepID=B7QJD9_IXOSC Length = 309 Score = 145 bits (367), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 106/288 (36%), Positives = 148/288 (51%), Gaps = 30/288 (10%) Query: 73 IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVA 132 + A+A T PG+ +LLVG R LA P IP+HHME H LA L +FP++ Sbjct: 1 MSAIAVTVRPGMSLSLLVGLNFARRLAAKHGKPLIPIHHMEAHALAVRLVQRV-DFPYLV 59 Query: 133 LLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD-------YPGGPLLSK 185 LLVSGGH QL V I + LLG+++DDA GE FDK A+ L L GG L Sbjct: 60 LLVSGGHCQLAVVRDIDDFLLLGQTMDDAPGETFDKVARRLKLSNLPECRGLSGGRALEF 119 Query: 186 MAAQ--GTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI----RDNGTDDQTR----ADI 235 +A + G + FP P+T +FSFSGLK I +++G + AD+ Sbjct: 120 LAERDSGNPLAYRFPEPLTSYRTCNFSFSGLKNSVYRKIEALEKEHGLEADALLPEIADL 179 Query: 236 ARAFEDAVVDTLMIKCKRAL---DQTGF-----KRLVMAGGVSANRTLRAKLAEMMKKRR 287 + + AV L + +RAL DQ G LV+AGGV+AN L L+++ +K Sbjct: 180 CASTQHAVAYHLTRRTQRALAFCDQQGLLPEGKPTLVVAGGVAANAYLGRLLSQLCEKLD 239 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGA---TADL-GVSVRPRWPL 331 P+ C+DNG MIA+ G+ R++A + T + + PR PL Sbjct: 240 VAYVPTPPKLCSDNGLMIAWNGVERWRAASGIVTESFDSLDITPRCPL 287 >UniRef50_A4VEZ5 O-sialoglycoprotein endopeptidase n=1 Tax=Tetrahymena thermophila SB210 RepID=A4VEZ5_TETTH Length = 377 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 108/365 (29%), Positives = 176/365 (48%), Gaps = 53/365 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIE S ++ G+ I + +LAN + + G +P + H K + ++ Sbjct: 1 MIALGIEGSANKIGVGIVKSDGTILANPKTTFITPPGT--GFLPNETAVHHRSKILDIVD 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ LT KDI + YT GPG+ L +GA V R+L+ ++P I V+H GH+ Sbjct: 59 QALKEANLTFKDIGLICYTKGPGMGPPLSIGAIVSRTLSLLHNIPLIGVNHCIGHIEMGR 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYP 178 L P V L VSGG+TQ+I+ + +Y + GE++D A G D+ A+++ L D Sbjct: 119 LATGITH-PAV-LYVSGGNTQVIAYSN-QRYRIFGEALDIAVGNCLDRFARIINLSNDPA 175 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI---------------- 222 G + ++A QG +F+ P T + G+D SFSG+ ++ + + Sbjct: 176 PGYNIEQLAKQGK--QFI-QVPYTVK-GMDMSFSGILSYFEDIVAQNPHLQYEDGVVPEK 231 Query: 223 ------RDNGTD--------------------DQTRADIARAFEDAVVDTLMIKCKRALD 256 D+ D D TRAD+ + ++ + L +RA+ Sbjct: 232 DAKQQDEDDSLDNRKRKKNKKVVNKKILDLPKDITRADLCYSLQETIFAMLTEVTERAMA 291 Query: 257 QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 +++ GGV N L+ + +M+ +R G+V +C DNGAMIAYAG++ ++AG Sbjct: 292 HCNSNEVIIVGGVGCNVRLQEMIGQMVSERGGKVGAMDHRYCIDNGAMIAYAGILEYEAG 351 Query: 317 ATADL 321 D Sbjct: 352 GRMDF 356 >UniRef50_C4Y0N8 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y0N8_CLAL4 Length = 443 Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 112/376 (29%), Positives = 179/376 (47%), Gaps = 45/376 (11%) Query: 2 RVLGIETSCDETGIAIYDDEKG---LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 RVL IE+SCD++ +++ + + +LA A GG++P A H + L Sbjct: 43 RVLAIESSCDDSCVSLLEKKSPNGPVLAIDEIKATLSSAKVGGIIPTAAHEFHSAQISQL 102 Query: 59 IQAALKESGLTAKDI-DAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 + ++ +++ + D + T GPG+VG+L + LA AW P + VHHM GHLL Sbjct: 103 VGEFCRKHEISSSNPPDLLCVTRGPGMVGSLSASIQFAKGLAVAWQRPLVGVHHMLGHLL 162 Query: 118 APML----EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 P L P++PF++LL SGGHT L+ + +E++ ++ D AAG++ DK A+ L Sbjct: 163 TPNLTVEGSSCGPQYPFLSLLCSGGHTMLVLSKSLTNHEIIIDTSDIAAGDSLDKCAREL 222 Query: 174 GLD-YPGGPLLSKMAAQ---GTAGRFVFPRPMTDRPGLDFSFS----------------- 212 G + GP L K A T RF TD+ F Sbjct: 223 GFEGNMLGPELEKYVANIDPVTKERFAGINTNTDQNEFGFRLRMPMRTAKHKKIPDVIQF 282 Query: 213 GLKTFAANT------IRDNGTDDQTRADIARAFEDAVVDTLM--IKCKRALDQTGF---K 261 G +F ++ RD+ ++QTR +A ++ + D ++ I A D F + Sbjct: 283 GFASFLSSVEGFKMKSRDSW-NEQTRQFVAFKLQEVLFDHIINRINVAFAKDPQKFALVR 341 Query: 262 RLVMAGGVSANRTLRAKLAEMMKKRRGEVF-YARPEFCTDNGAMIAYAGMVRFKAGATAD 320 V +GGV+AN+ LRAKL ++ F + P+ CTDN MI AG+ F+ Sbjct: 342 DFVCSGGVAANKVLRAKLMHNIRSASTLKFHFPAPKLCTDNATMIGNAGIDIFE-NLRLK 400 Query: 321 LGVSVRP--RWPLAEL 334 +S+ P +WPL ++ Sbjct: 401 SRLSMLPIRKWPLHDI 416 >UniRef50_A3MSX6 Putative O-sialoglycoprotein endopeptidase n=2 Tax=Pyrobaculum RepID=GCP_PYRCJ Length = 339 Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 99/275 (36%), Positives = 145/275 (52%), Gaps = 12/275 (4%) Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L + ++E ++ D++AVAY+AGPGL AL VGA R+LA VP +PVHH H+ Sbjct: 62 LFRKLIEEFNVSLGDVEAVAYSAGPGLGPALRVGAVFARALAIKLGVPLVPVHHGVAHVE 121 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 + P V LL+SGGHT + + G+Y + GE++D A G A D A+ +GL + Sbjct: 122 IARYATGSCD-PLV-LLISGGHTVVAGFSD-GRYRVFGETLDVAIGNAIDMFAREVGLGF 178 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 PG P + K A+ FP P+ G D S++GL T+A ++ + R Sbjct: 179 PGVPAVEK-CAEAAEELVAFPMPIV---GQDLSYAGLTTYALQLVKRG----IPLPVVCR 230 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 + + L +RAL T + LV+AGGV+ +R LR L E+ ++ EV + E+ Sbjct: 231 SLVETAYYMLAEVTERALAFTKKRELVVAGGVARSRRLREILYEVGREHGAEVKFVPDEY 290 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVS-VRPRWPL 331 DNGAMIA G ++ G + G S VR RW L Sbjct: 291 AGDNGAMIALTGYYAYRRGIAVEPGESFVRQRWRL 325 >UniRef50_A2BJY9 Putative O-sialoglycoprotein endopeptidase n=22 Tax=Thermoprotei RepID=GCP_HYPBU Length = 363 Score = 139 bits (350), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 110/344 (31%), Positives = 169/344 (49%), Gaps = 21/344 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + VLGIE++ G+ I + + + H GG+ P A+ H R +I Sbjct: 30 IYVLGIESTAHTFGVGIASTKPPYILVSVRDT--YHPPKGGIHPREAASHHARVASEVIL 87 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ GL+ +DIDAVA GPGL AL VGAT+ R LA + P +PV+H H+ Sbjct: 88 DALRTVGLSIRDIDAVAVALGPGLGPALRVGATIARGLAAYYGKPLVPVNHAVAHIEIAR 147 Query: 121 LED---NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 L +P V L VSGG+T +++ +Y + GE++D A G D A+ G+ Sbjct: 148 LYTGLGDP-----VVLYVSGGNT-VVAAYAKARYRVFGETLDIALGNLLDTFARDAGIAP 201 Query: 178 P---GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 P G + A+ + P + G+D SFSGL T A G++D+ +A Sbjct: 202 PYIVSGLHIVDRCAEAASKPADLPYVVK---GMDVSFSGLLTAALRLWTKAGSEDE-KAA 257 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + + +++ +RAL T K +++ GGV+A+ LR K+ M + Sbjct: 258 VCLGLREVAYGSVVEVTERALAHTRKKSVMLTGGVAASPILRNKVRSMASYHGAVADWPP 317 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVS-VRPRWPL--AELP 335 P+ DNGAMIA+ G++ + AG T D+ S V+ RW L E+P Sbjct: 318 PQLAGDNGAMIAWTGLLNYLAGITVDVEESVVKQRWRLDVVEIP 361 >UniRef50_D2RYV2 Metalloendopeptidase, glycoprotease family n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RYV2_9EURY Length = 578 Score = 137 bits (344), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 114/363 (31%), Positives = 170/363 (46%), Gaps = 49/363 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP-LI 59 +R+LGIE + A+YD + D GG+ P A+ +H+ +P ++ Sbjct: 5 IRILGIEGTAWAASAAVYD---SATDDVFIESDAYQPDSGGIHPREAA-EHMHDAIPRVV 60 Query: 60 QAALKE---------------------SGLTAKDIDAVAYTAGPGLVGALLVGATVGRSL 98 + AL+ SG A +DA+A++ GPGL L + T R+L Sbjct: 61 ETALEHARETHDGPAGEAPVDVDERSSSGQQAAPVDAIAFSRGPGLGPCLRIVGTAARAL 120 Query: 99 AFAWDVPAIPVHHMEGHLLAPMLE---DNPPEFPFVALLVSGGHTQLISVTGIGQYELLG 155 + A +VP + V+HM HL D+P V L SG + L++ G+Y +LG Sbjct: 121 SQALEVPLVGVNHMVAHLEIGRHTADFDSP-----VCLNASGANAHLLAYRN-GRYRVLG 174 Query: 156 ESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFV-FPRPMTDRPGLDFSFSGL 214 E++D G A DK + +G +PGGP K+ A G +V P + G+DFSFSG+ Sbjct: 175 ETMDTGVGNAIDKFTRHVGWSHPGGP---KVEAAAEDGEYVDLPYVVK---GMDFSFSGI 228 Query: 215 KTFAANTIRDNGTDDQTRA-DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANR 273 + A DD+T DI + ++ + L +RAL TG LV+ GGV N Sbjct: 229 MSAAKQAY-----DDETPVEDICFSLQENIFGMLTEVAERALSLTGSDELVLGGGVGQNE 283 Query: 274 TLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS-VRPRWPLA 332 LR LAEM +R E P F DN MIA G ++AG T ++ S V P + Sbjct: 284 RLREMLAEMCAQRGAEFHAPEPRFLRDNAGMIAVLGAKMYEAGDTLEIEDSQVDPNYRPD 343 Query: 333 ELP 335 ++P Sbjct: 344 QVP 346 >UniRef50_C7DHT9 Metalloendopeptidase, glycoprotease family n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHT9_9EURY Length = 324 Score = 136 bits (343), Expect = 9e-31, Method: Compositional matrix adjust. Identities = 105/340 (30%), Positives = 174/340 (51%), Gaps = 24/340 (7%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M V+GIE+S G+ I + K +LAN+ ++ G++P + H + +I+ Sbjct: 1 MAVIGIESSAHTFGVGIVEKGK-ILANE---KMMYPISDKGIIPAKVAEYHAKNASAVIR 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + +DI+AV YT GPGL L +G ++L +P P++H GH+ Sbjct: 57 RALSVAHAALEDIEAVGYTKGPGLGPCLEIGMLAAKTLHEKLGIPIYPINHAVGHI---E 113 Query: 121 LEDNPPEF--PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 + + F P V L VSGG++Q++S+ G G Y + GE++D G D A+ G+ Sbjct: 114 ITKHLSGFADPIV-LYVSGGNSQILSLAG-GHYHVHGETLDIGVGNMLDNFARAAGMKPA 171 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 G ++K A T G++V P T + G+DF+F+GL T A T+ + AD++ + Sbjct: 172 WGSTVAKFA---TGGKYV-RLPYTVK-GMDFTFTGLLTAAIKTL-----PSSSIADVSFS 221 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 ++ L+ +RAL +G +++ GGV+ + LR LA M + + A +F Sbjct: 222 IQETAFSMLVEATERALLLSGKDSVILCGGVAQSLRLREMLATMSASHKKRFYVADNQFN 281 Query: 299 TDNGAMIAYAGMVRFKAG---ATADLGVSVRPRWPLAELP 335 DNGAMIAY ++G A +DL ++ + R A +P Sbjct: 282 ADNGAMIAYVAEKMDESGYAPARSDLTINQKFRIEKAGVP 321 >UniRef50_B9WFF4 Metalloprotease, putative n=8 Tax=Saccharomycetales RepID=B9WFF4_CANDC Length = 440 Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 111/396 (28%), Positives = 180/396 (45%), Gaps = 63/396 (15%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVK--LH-ADYGGVVPELASRDHVRKTVPL 58 RV+ IE+SCD++ +A+ + ++ Q K LH AD GG++P A H+ + Sbjct: 25 RVMAIESSCDDSCVALLEKSHPDTPPKIIDQFKRTLHSADIGGILPTAAYNYHMATIANM 84 Query: 59 IQAALKESGLTAKDI-DAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +Q + ++A + D + T GPG+ G+L + L+ AW VP I VHHM GHLL Sbjct: 85 VQEFCSKHQISALNPPDLLCVTRGPGMAGSLSTSTEFAKGLSVAWGVPLIGVHHMLGHLL 144 Query: 118 APML------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 L + PP++PF++LL SGGHT L+ + ++E++ D A G++ DK A+ Sbjct: 145 TANLPKSEQPDSPPPKYPFLSLLCSGGHTMLVLSKSLTEHEIVVNVGDIAVGDSLDKCAR 204 Query: 172 LLGL--DYPGGPLLSKMAA--QGTAGRFV-------FPRPMTDRPGLDFS---------- 210 LG+ + G L + + + T R+ P R L +S Sbjct: 205 ELGMYGNMLGKELEKYINSIPEETRNRYEKLSVNTRIANPYNFRLTLPYSAPKYGIPEDV 264 Query: 211 -------FSGLKTFAANTIRDNG-------TDDQTRADIARAFEDAVVDTLMIKCKRALD 256 S ++ + A +G D++T+ IA ++ + D ++ + A Sbjct: 265 KFAFSHFLSNIQEYKAMHYNKSGGGEIDVALDEETKQFIAYKTQEFIFDHIVDRINIAFK 324 Query: 257 QTGFKR------------LVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPE--FCT 299 + G K + +GGV+AN+ LR KL E + + V + P+ CT Sbjct: 325 KHGIKNRNSDGTFIGVKDFICSGGVAANKRLREKLRENLDFQEIGADNVNFHFPDLSLCT 384 Query: 300 DNGAMIAYAGMVRF-KAGATADLGVSVRPRWPLAEL 334 DN MI AG+ F K DL +WPL +L Sbjct: 385 DNAIMIGAAGIEIFEKLRLRTDLSFLPIRKWPLNKL 420 >UniRef50_P36174 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Haloarcula marismortui RepID=GCP_HALMA Length = 548 Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 104/338 (30%), Positives = 163/338 (48%), Gaps = 41/338 (12%) Query: 1 MRVLGIETSCDETGIAIY---------DDEKGLLANQLYSQVKLHADYGGVVPELASRDH 51 MR+LGIE + +++ DD+ + Y+ D GG+ P A+ +H Sbjct: 1 MRILGIEGTAWAASASVFETPDPARVTDDDHVFIETDAYA-----PDSGGIHPREAA-EH 54 Query: 52 VRKTVP-LIQAALKES-GLTAKD------IDAVAYTAGPGLVGALLVGATVGRSLAFAWD 103 + + +P +++ A++ + G +D IDAVA+ GPGL L + AT R++A +D Sbjct: 55 MGEAIPTVVETAIEHTHGRAGRDGDDSAPIDAVAFARGPGLGPCLRIVATAARAVAQRFD 114 Query: 104 VPAIPVHHMEGHLLAPMLE---DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDD 160 VP + V+HM HL D+P V L SG + ++ G+Y +LGE++D Sbjct: 115 VPLVGVNHMVAHLEVGRHRSGFDSP-----VCLNASGANAHILGYRN-GRYRVLGETMDT 168 Query: 161 AAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAAN 220 G A DK + +G +PGGP + + A G + G+DFSFSG+ + A Sbjct: 169 GVGNAIDKFTRHIGWSHPGGPKVEQHARDGEYHELPYVVK-----GMDFSFSGIMSAAKQ 223 Query: 221 TIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLA 280 + D+G + ++ R E+ + L +RAL TG LV+ GGV N L+ L Sbjct: 224 AV-DDGVPVE---NVCRGMEETIFAMLTEVSERALSLTGADELVLGGGVGQNARLQRMLG 279 Query: 281 EMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT 318 EM ++R E + F DN MIA G + AG T Sbjct: 280 EMCEQREAEFYAPENRFLRDNAGMIAMLGAKMYAAGDT 317 >UniRef50_A6VJ51 Putative O-sialoglycoprotein endopeptidase n=26 Tax=cellular organisms RepID=GCP_METM7 Length = 547 Score = 134 bits (337), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 99/319 (31%), Positives = 167/319 (52%), Gaps = 18/319 (5%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 +G E + ++TG+ I + +L N+ + G+ P A+ H V L++ AL Sbjct: 10 IGFEGTAEKTGVGIITSKGEVLFNK---TIIYTPPVQGIHPREAADHHAETFVKLLKEAL 66 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 + + + ID V+++ GPGL +L V AT R+L+ + + P I V+H H+ L+ Sbjct: 67 --TVVPIEKIDLVSFSLGPGLGPSLRVTATTARALSLSINKPIIGVNHCISHVEIGKLKT 124 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLL 183 + + P + L VSGG+TQ+++ TG +Y ++GE++D A G D+ A+ + +PGG + Sbjct: 125 DAVD-P-LTLYVSGGNTQVLAYTG-KKYRVIGETLDIAIGNCLDQFARHCNMPHPGGVYV 181 Query: 184 SKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR-ADIARAFEDA 242 K A G +F+ P T + G+D S SGL T A D + R D+ + ++ Sbjct: 182 EKYAKDGN--KFM-KLPYTVK-GMDISLSGLLTAAMKKY-----DSKERIEDVCYSLQET 232 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 L +RAL T +++ GGV+AN L+ L M ++ + + EFC DNG Sbjct: 233 SFSMLTEITERALAHTNKAEVMLVGGVAANNRLKEMLDVMCSEQNVDFYVPEREFCGDNG 292 Query: 303 AMIAYAGMVRFKAGATADL 321 AMIA+ G++++ G DL Sbjct: 293 AMIAWLGILQYLNGKRMDL 311 >UniRef50_Q4UA14 Glycoprotein endopeptidase, putative n=3 Tax=Piroplasmida RepID=Q4UA14_THEAN Length = 363 Score = 134 bits (336), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 98/327 (29%), Positives = 157/327 (48%), Gaps = 21/327 (6%) Query: 4 LGIETSCDETGIAIYDDEKGLLAN--QLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 LGIE S ++ GIA+ + +L+N + YS D G +P S+ H L+ Sbjct: 15 LGIEGSANKLGIAVIRGDGEILSNVRRTYSP----PDGEGFLPRQVSKHHRENMASLLME 70 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G+T D+ + YT GPG+ L VGA +++ F P + V+H H+ Sbjct: 71 ALEKAGITLSDLSLICYTKGPGIGSGLHVGALAAKTIHFITGKPIVGVNHCVAHVEMGRF 130 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQ-YELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + P + L VSGG+TQ++S + Y +LGE++D A G D+ A+LL L Sbjct: 131 LSGYKK-PAI-LYVSGGNTQVLSYDEKRKVYSVLGETLDIAIGNVLDRIARLLHLPNKPA 188 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-----------D 229 P LS + + + P P + G+D S SGL T + I T + Sbjct: 189 PGLSIELQARKSSKNLIPLPFVVK-GMDCSLSGLLTKCEDLIEHFKTKLIMSEDSAFEYE 247 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 Q + D+ + ++ L+ +RA+ T +++ GGV N L+ M K+R + Sbjct: 248 QFKVDLCFSVQEHTFAMLIEMLERAMSFTDSDEILLVGGVGCNLRLQEMANLMAKERNAK 307 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAG 316 +F +C DNGAMI Y GM+ + G Sbjct: 308 LFPMDERYCIDNGAMIGYTGMIDYLYG 334 >UniRef50_Q2GXN6 Putative glycoprotein endopeptidase KAE1 n=18 Tax=Eukaryota RepID=KAE1_CHAGB Length = 356 Score = 134 bits (336), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 101/327 (30%), Positives = 161/327 (49%), Gaps = 23/327 (7%) Query: 4 LGIETSCDETGIAIY---DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 LG E S ++ GI + D +L+N ++ V G +P+ ++ H V + + Sbjct: 14 LGCEGSANKLGIGVILHEGDTSTVLSNVRHTFVSPAGT--GFLPKDTAQHHRAFFVRVAK 71 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++G+ DID + YT GPG+ G L A R+LA W + V+H GH+ Sbjct: 72 QALSDAGIRIADIDCICYTRGPGMGGPLASVAVAARTLALLWGKELVGVNHCVGHIEMGR 131 Query: 121 L---EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 D+P V L VSGG+TQ+I+ +Y + GE++D A G D+ A+ L + Sbjct: 132 TITGADHP-----VVLYVSGGNTQVIAYAE-QRYRIFGETLDIAVGNCLDRFARALNISN 185 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFA---ANTIRDN---GTDDQ- 230 P + GR + P + G+D SFSG+ T A A ++ N GTD + Sbjct: 186 DPAPGYNIEVLARKGGRVLLDLPYAVK-GMDCSFSGILTRAEELAAQMKANEGKGTDGEP 244 Query: 231 -TRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 T AD+ + ++ V L+ +RA+ G ++++ GGV N L+ + M R G Sbjct: 245 FTGADLCFSLQETVFAMLVEITERAMAHVGSNQVLIVGGVGCNERLQEMMGLMAADRGGS 304 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAG 316 V+ FC DNG MIA+AG++ ++ G Sbjct: 305 VYATDERFCIDNGIMIAHAGLLAYETG 331 >UniRef50_Q18KI0 Putative O-sialoglycoprotein endopeptidase n=14 Tax=Euryarchaeota RepID=GCP_HALWD Length = 533 Score = 133 bits (335), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 115/340 (33%), Positives = 167/340 (49%), Gaps = 43/340 (12%) Query: 1 MRVLGIETSCDETGIAIYD--DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 MR+LGIE + A+Y+ DE ++ + Y D GG+ P A+ +H+ +P Sbjct: 1 MRILGIEGTAWAASAALYNTHDETIVIESDPY-----QPDSGGLHPREAA-EHMSTALPE 54 Query: 59 IQAALKESGLTAKD-----IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 + + + E +++ + IDA+A++ GPGL L V T R+L A VP I V+HM Sbjct: 55 VISTILERAVSSGNTDAIGIDAIAFSRGPGLGPCLRVVGTAARTLTQALSVPLIGVNHMI 114 Query: 114 GHLLAPMLEDNPPEFPF---VALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTA 170 HL E + F V L SG + L+ QY++LGE++D G A DK Sbjct: 115 AHL-----EIGRHQSGFTTPVCLNASGANAHLLGYHR-RQYQVLGETMDTGVGNAIDKFT 168 Query: 171 KLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRP----GLDFSFSGLKTFAANTIRDNG 226 + LG ++PGGP K+ A T G + D P G+DFSFSG+ + A + + DN Sbjct: 169 RHLGWNHPGGP---KVEAAATDGSY------HDLPYVVKGMDFSFSGIMSAAKDAV-DN- 217 Query: 227 TDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 + D+ ++ + L +RAL TG LV+ GGV N LR L+ M Sbjct: 218 --EVPVVDVCTGLQETIFAMLTEVAERALSLTGSNELVLGGGVGQNDRLREMLSTMCTA- 274 Query: 287 RGEVFYARPE--FCTDNGAMIAYAGMVRFKAGATADLGVS 324 RG FYA PE F DN MIA G ++AG T + S Sbjct: 275 RGASFYA-PESRFLRDNAGMIAVLGAAMYEAGQTISVNDS 313 >UniRef50_Q6L4N8 Os05g0194600 protein n=21 Tax=Eukaryota RepID=Q6L4N8_ORYSJ Length = 380 Score = 129 bits (323), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 104/325 (32%), Positives = 168/325 (51%), Gaps = 20/325 (6%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LG+E+S ++ GI + +L+N ++ V G +P + H+ +PL++AAL Sbjct: 17 LGLESSANKIGIGVVSLSGEILSNPRHTYVTPPGH--GFLPRETAHHHLAHLLPLLRAAL 74 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 E+G+T D+ V YT GPG+ L V A R+L+ W P + V+H H+ M Sbjct: 75 GEAGVTPADLACVCYTKGPGMGAPLQVAAAAARALSLLWGKPLVGVNHCVAHV--EMGRA 132 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYPGGP 181 V L VSGG+TQ+I+ + G+Y + GE+ID A G D+ A++L L D G Sbjct: 133 VTGAVDPVVLYVSGGNTQVIAYSE-GRYRIFGETIDIAVGNCLDRFARVLELSNDPSPGY 191 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRP----GLDFSFSGLKTF-AANTIRDNGTDDQTRADIA 236 + ++A +G +F+ D P G+D SFSG+ +F A I ++ T AD+ Sbjct: 192 NIEQLAKKGE--KFI------DLPYVVKGMDVSFSGILSFIEATAIEKLKNNECTPADLC 243 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 + ++ + L+ +RA+ K +++ GGV N L+ + M +R G +F Sbjct: 244 YSLQETLFAMLVEITERAMAHCDSKDVLIVGGVGCNERLQEMMRIMCSERGGRLFATDDR 303 Query: 297 FCTDNGAMIAYAGMVRFKAGATADL 321 +C DNGAMIAY G++ + G T L Sbjct: 304 YCIDNGAMIAYTGLLAYAHGMTTPL 328 >UniRef50_A8WMS3 Putative uncharacterized protein n=1 Tax=Caenorhabditis briggsae RepID=A8WMS3_CAEBR Length = 386 Score = 127 bits (320), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 100/305 (32%), Positives = 157/305 (51%), Gaps = 23/305 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-GVVPELASRDHVRKTVPLIQA 61 VLGIE S ++ G+ I D +L+N + HA G G P ++ H ++ V L+ Sbjct: 4 VLGIEGSANKIGVGIIRDGV-VLSN---PRATFHAPPGEGFRPTETAQHHRQQIVRLVGE 59 Query: 62 ALKESGLT--AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 A++E+G+ K+ID +A+T GPG+ L VGA V R+L+ W P IPV+H GH+ Sbjct: 60 AIREAGIQDPEKEIDGIAFTKGPGMGAPLQVGAIVARTLSLRWQKPIIPVNHCVGHIEMG 119 Query: 120 ML---EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L DNP V L VSGG+TQ+ +Y + GE+ID A G D+ A++L L Sbjct: 120 RLITGADNP-----VVLYVSGGNTQVFLPN--KRYRIFGETIDIAVGNCLDRFARVLKL- 171 Query: 177 YPGGPLLSKMAAQ-GTAGRFVFPRPMTDRPGLDFSFSG-LKTFAANTIRDNGTDDQTRAD 234 P P Q +G +F P T + +D S SG L + + + + T AD Sbjct: 172 -PNAPSPGYNIEQLAKSGAKLFELPYTVKARMDVSLSGILSCIESRAPQLLESREYTPAD 230 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRA-KLAEMMKKR-RGEVFY 292 + + ++ V L+ +RA+ TG + L++ GGV N L+ ++ ++K R + +F+ Sbjct: 231 LCFSLQETVFAMLIEITERAMAHTGSRELLIVGGVGCNLRLQVLEIVFLVKIRLKKLIFF 290 Query: 293 ARPEF 297 EF Sbjct: 291 NLSEF 295 >UniRef50_B8MFK9 Glycoprotease family protein, putative n=5 Tax=Leotiomyceta RepID=B8MFK9_TALSN Length = 883 Score = 127 bits (318), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 117/433 (27%), Positives = 175/433 (40%), Gaps = 106/433 (24%) Query: 4 LGIETSCDETGIAIYDDEK-----------GLLANQLYSQVKLHAD---YGGVVPELASR 49 L IE+SCD+T +AI + + G A +++ + AD Y G+ P A + Sbjct: 429 LAIESSCDDTSVAIVEKDSFHKSFETPRHTGHAAAEVHFLENITADTRKYRGIHPIEALQ 488 Query: 50 DHVRKTVPLIQAALKESGLTAKDI--------------------------DAVAYTAGPG 83 H L+Q A++ A+D + ++ T GPG Sbjct: 489 SHQENLAKLVQKAVRSLPPVAEDYSPEDGAVISHIIPKNKNGKSTRHRLPNFISVTRGPG 548 Query: 84 LVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML----------EDNPPEFPFVAL 133 + L VG + LA AW +P + VHHM+ HLL P L +D P FPF+++ Sbjct: 549 MRSNLSVGLDTAKGLAVAWQIPLVGVHHMQAHLLTPRLVSALNRSVLTDDLQPNFPFLSI 608 Query: 134 LVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL-------------------G 174 LVSGGH+ L+ + ++E+L + D A GE DK+A+L+ Sbjct: 609 LVSGGHSMLVHSKSLLEHEILATTADIAIGETLDKSARLILPESVLESANTTMYGKLLEK 668 Query: 175 LDYPGGPL-LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT----------------- 216 +PGGP + A T G V R D G F+ T Sbjct: 669 FAFPGGPADYADYQALKTRGEEVVKRD-NDTWGWSFTTPYANTRDLKFSFSSVSSTVSRI 727 Query: 217 FAANTIRDNGTDDQTRADIAR-----AFEDAVVDTLMI------KCKRALDQTG----FK 261 A D R +AR FE TL+ + ++ + +G Sbjct: 728 MANKEKADVRVTRDERVALARESMRVCFEHLASRTLIALELLRKQLRKQYNTSGSGQEID 787 Query: 262 RLVMAGGVSANRTLRAKLAEMMKKR---RGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT 318 LV++GGV+AN+ L L + R +V P CTDN AMI +AG+ F+AG + Sbjct: 788 TLVVSGGVAANQFLMTVLRAFLDVRGFSHIKVIAPPPYLCTDNAAMIGWAGIEMFEAGYS 847 Query: 319 ADLGVSVRPRWPL 331 DL +W L Sbjct: 848 TDLSCRAIRKWTL 860 >UniRef50_Q83I95 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Tropheryma whipplei RepID=GCP_TROW8 Length = 401 Score = 127 bits (318), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 79/187 (42%), Positives = 102/187 (54%), Gaps = 9/187 (4%) Query: 131 VALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQG 190 V LL SGGH+ L+ + + LLGE++DDAAGEAFDK A+L+GL YPGGP + +A+ G Sbjct: 188 VVLLASGGHSCLLKIHN-NKISLLGETLDDAAGEAFDKIARLMGLQYPGGPAIEMLASSG 246 Query: 191 TAGRFVFPRPM----TDRPGLDFSFSGLKTFAANT---IRDNGTDDQTR-ADIARAFEDA 242 FPR + + FSFSGLKT I+ N + DIA +F++A Sbjct: 247 NPNAVEFPRALLTHFEEHNRYSFSFSGLKTAVGRVVERIKSNPAHSIPKIEDIAASFQEA 306 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 V D L K A + +VM GGV+AN +R L E K +V CTDNG Sbjct: 307 VADVLTAKTVAAALASDVDLIVMGGGVAANNRIREMLCERAKIHGLDVKIPPIALCTDNG 366 Query: 303 AMIAYAG 309 AMIA AG Sbjct: 367 AMIAAAG 373 Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 52/121 (42%), Positives = 77/121 (63%), Gaps = 1/121 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCDETG+ I +LAN++ S H +GGV+PE+A+R H+ L++ A Sbjct: 4 ILGIETSCDETGVGIVSGST-VLANEVASSSLRHKPFGGVIPEIAARAHLEYLPNLLELA 62 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ + L KDID +A TAGPGLV +L VG + ++L + P V+H+ GH ++ L+ Sbjct: 63 LETAQLCIKDIDGIAVTAGPGLVTSLSVGVSAAKALGLSTGTPVYGVNHLVGHAVSAFLD 122 Query: 123 D 123 D Sbjct: 123 D 123 >UniRef50_P36132 Putative glycoprotein endopeptidase KAE1 n=40 Tax=Eukaryota RepID=KAE1_YEAST Length = 386 Score = 123 bits (309), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 98/348 (28%), Positives = 162/348 (46%), Gaps = 41/348 (11%) Query: 4 LGIETSCDETGIAI---------------YDDEKGLLANQLYSQVKLHADYGGVVPELAS 48 LG+E S ++ G+ I YD E +L+N + V + G +P + Sbjct: 19 LGLEGSANKLGVGIVKHPLLPKHANSDLSYDCEAEMLSNIRDTYVTPPGE--GFLPRDTA 76 Query: 49 RDHVRKTVPLIQAALKESGLTAK--DIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPA 106 R H + LI+ AL E+ + + DID + +T GPG+ L R+ + WDVP Sbjct: 77 RHHRNWCIRLIKQALAEADIKSPTLDIDVICFTKGPGMGAPLHSVVIAARTCSLLWDVPL 136 Query: 107 IPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAF 166 + V+H GH+ E + P V L VSGG+TQ+I+ + +Y + GE++D A G Sbjct: 137 VGVNHCIGHIEMGR-EITKAQNP-VVLYVSGGNTQVIAYSE-KRYRIFGETLDIAIGNCL 193 Query: 167 DKTAKLLGLDYPGGP--LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGL---------- 214 D+ A+ L + P + ++A + + P T + G+D S SG+ Sbjct: 194 DRFARTLKIPNEPSPGYNIEQLAKKAPHKENLVELPYTVK-GMDLSMSGILASIDLLAKD 252 Query: 215 --KTFAANTI---RDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGV 269 K N I + G T D+ + ++ + L+ +RA+ ++++ GGV Sbjct: 253 LFKGNKKNKILFDKTTGEQKVTVEDLCYSLQENLFAMLVEITERAMAHVNSNQVLIVGGV 312 Query: 270 SANRTLRAKLAEMMKKR-RGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 N L+ +A+M K R G+V FC DNG MIA AG++ ++ G Sbjct: 313 GCNVRLQEMMAQMCKDRANGQVHATDNRFCIDNGVMIAQAGLLEYRMG 360 >UniRef50_A8QDL6 Glycoprotease family protein n=1 Tax=Brugia malayi RepID=A8QDL6_BRUMA Length = 415 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 91/324 (28%), Positives = 154/324 (47%), Gaps = 23/324 (7%) Query: 3 VLGIET-SCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 V+GIET CD+T + I + ++ +L+++ Y+ ++ GG+ P + H + Sbjct: 33 VMGIETRHCDDTAVCILNSDRKILSSRRYADREVQKRLGGICPAAVADQHRSYIDLFVDE 92 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L ES + ++D +A T PGLV L VG SLA +P IPVHHM+ H L Sbjct: 93 CLDESRVRLCNLDGIAVTTQPGLVICLRVGTEKAISLARKGCIPLIPVHHMQAHATVATL 152 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK--------LL 173 +P+V++L+SGGH+ + G +E+L S+ + GE DK ++ LL Sbjct: 153 MTE-IXYPYVSVLISGGHSIIAVTNGPDDFEVLLTSMCGSPGECMDKISRALHFEEPELL 211 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTD--RPGLDFSFSGLKTFAANTIRDNGTDDQT 231 GL +PG L + G +P + + L F+FS +KT I + Sbjct: 212 GL-HPGAALEVIASRSSVDGYKRYPIDVNKFMKMALHFNFSWIKTTYLAMISRQSI--LS 268 Query: 232 RADIARAFEDAVVDTLMIK---CKRALDQTG----FKRLV-MAGGVSANRTLRAKLAEMM 283 D + + ++ + L K C + L+ + RLV ++GGV++N+ + A+ + Sbjct: 269 VPDFCASVQHSIANYLAEKLSCCLQYLNDSNKIPSRNRLVFVSGGVASNKYILARFNNVC 328 Query: 284 KKRRGEVFYARPEFCTDNGAMIAY 307 + V+ +C DN MIA+ Sbjct: 329 EPLGYSVYAPSQFYCCDNAEMIAW 352 >UniRef50_Q5KFY5 Mitochondrion protein, putative n=2 Tax=Filobasidiella neoformans RepID=Q5KFY5_CRYNE Length = 307 Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 81/284 (28%), Positives = 136/284 (47%), Gaps = 35/284 (12%) Query: 86 GALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNP-PEFPFVALLVSGGHTQLIS 144 G L VG R+LA A + VHHM+ H L P+L PEFPF+ LL+SGGHTQL+ Sbjct: 3 GCLSVGQGTARALAAALGKRLVGVHHMQAHALTPLLTSAAAPEFPFLILLLSGGHTQLVL 62 Query: 145 VTGIGQYELLGESIDDAAGEAFDKTAKLLGL------------DYPGGPLLSKMAAQGTA 192 G+ ++++L +++D G+ F+K+A+LL L Y P L Sbjct: 63 AKGLFKFKILLDTLDSKIGDVFEKSARLLALPSGPKAPGAILEHYASLPALPPYDTHPLP 122 Query: 193 GRFVFPRPMTD---RPGLDFSFSGLKTFAANTIRDN-----GTDDQTRADIARAFEDAVV 244 + P P+T + L +SF+G+ + D D+ R A + A+ Sbjct: 123 ASQLIPIPLTTLHAKNTLAWSFAGMLAALQRAVHDRRQRQPAWDEPDRRAFANLVQTALT 182 Query: 245 DTLMIKCKRAL------DQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG-------EVF 291 L+ K + + + +V++GGV++N +R++L ++K G ++ Sbjct: 183 THLLTKLAQRIALLPPDTRAQLGGIVVSGGVASNAYIRSQLDRLVKTENGLFPPAGRNLY 242 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAEL 334 Y CTDN AMIA+ ++R + G +D + +R +W L ++ Sbjct: 243 YPPLHLCTDNAAMIAHTALIRLQTGLRSDPDDLKLRAKWSLEDM 286 >UniRef50_C1GKA7 Glycoprotease pgp1 n=11 Tax=Onygenales RepID=C1GKA7_PARBD Length = 642 Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 120/454 (26%), Positives = 176/454 (38%), Gaps = 129/454 (28%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQV----KLHAD---YGGVVPELASRDHVRK 54 ++ G D+T +AI EK +++ S + + AD Y G+ P +A H Sbjct: 169 KIFGANKPFDDTSVAII--EKHGVSSPSRSSILFLENITADSRKYQGIHPAVALDSHQAN 226 Query: 55 TVPLIQAALKESGL----TAKDI----------------------DAVAYTAGPGLVGAL 88 T L+ AL L +A D+ D ++ T GPG+ L Sbjct: 227 TAKLVNKALAHLPLAQFPSANDVGRVICLPSSATDGITPHLRRKPDFISVTRGPGMRSNL 286 Query: 89 LVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED-----------------NPPEFPFV 131 VG + L+ AW VP + VHHM+ HLL P L N P FPF+ Sbjct: 287 SVGLDTAKGLSVAWQVPIVGVHHMQAHLLTPRLAASLQQQQLQSSENSSAFRNSPSFPFM 346 Query: 132 ALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL------------------ 173 ++LVSGGHT L+ I +E+L + D A G+A DKTA++L Sbjct: 347 SILVSGGHTLLVHSKSIVDHEILASTSDSAIGDALDKTARMLLPQSFLAKSTTTMYGKML 406 Query: 174 -GLDYPGGPL--------------LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTF- 217 +P GP L K+ ++ F P R ++FSFSG+ T Sbjct: 407 EEFAFPNGPSDYADYRPPATRGEELVKLKSERWGWSFGMPFAENRR--MEFSFSGVTTRA 464 Query: 218 ------------AANTIRDNGTDDQTRADIARAFEDAVVDTLMIKC-------------- 251 AA + + R + ARAF L + Sbjct: 465 RDIYLNRRKQWEAAGNSGEGFMSNDERIEFARAFMTVCFGHLASRTIIALQELRRQQQQQ 524 Query: 252 ----------KRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG----EVFYARPEF 297 ++ + L+++GGV AN+ L+ KL RG +V P Sbjct: 525 QQQQQQQERENQSPPAEDIQSLIISGGVGANQFLK-KLFRSYLDIRGFPHVDVIAPPPYL 583 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 CTDN AMI +AG+ F+AG +DL +W L Sbjct: 584 CTDNAAMIGWAGIEMFEAGWRSDLRCRPLRKWTL 617 >UniRef50_A3CXS0 Putative O-sialoglycoprotein endopeptidase n=5 Tax=Euryarchaeota RepID=GCP_METMJ Length = 527 Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 106/342 (30%), Positives = 160/342 (46%), Gaps = 37/342 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLG+E + A++ D+ L + Y K GG+ P A++ H ++ Sbjct: 11 VLGLEGTAWNLSAALFGDDLVALHSSPYVPPK-----GGIHPREAAQHHASAMKEVVSRV 65 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL----LA 118 L E + I AVA++ GPGL +L AT R+L+ A DVP + V+H H+ A Sbjct: 66 LTEP----ERIRAVAFSQGPGLGPSLRTVATAARALSIALDVPLVGVNHCVAHVEIGRWA 121 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 D P V L SG +TQ++ G+Y + GE++D G DK A+ L +P Sbjct: 122 TGFSD-----PIV-LYASGANTQVLGYLN-GRYRIFGETLDIGLGNGLDKFARSHDLPHP 174 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 GGP + ++A +G P T + G+D +FSGL + A + D+ Sbjct: 175 GGPAIERLAREGN----YIELPYTVK-GMDLAFSGLVSAAQES-------SAPLEDVCFG 222 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE-- 296 ++ + +RAL G +++ GGV AN L+ L +M + RG F A PE Sbjct: 223 LQETAFAMCVEVTERALAHAGKDEVLLVGGVGANGRLQEML-RVMCEERGAAF-AVPERT 280 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVS-VRPRWPLAELPAA 337 F DNGAMIAY G + + G L S +RP + E+ A Sbjct: 281 FLGDNGAMIAYTGKIMLEHGVVLPLDQSQIRPGYRADEVEVA 322 >UniRef50_C8V9Q8 PROBABLE O-SIALOGLYCOPROTEIN ENDOPEPTIDASE (AFU_orthologue; AFUA_7G05240) n=2 Tax=Emericella nidulans RepID=C8V9Q8_EMENI Length = 497 Score = 114 bits (285), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 114/438 (26%), Positives = 170/438 (38%), Gaps = 107/438 (24%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVPLI 59 L IETSCD+T +AI A +++ + D Y G+ P A H + L+ Sbjct: 34 TLAIETSCDDTSVAIVHKNDKSGAAKIHFLENITPDLTAYQGIHPVRALESHQQNVAKLV 93 Query: 60 QAALKESGLTAKDI------------------DAVAYTAGPGLVGALLVGATVGRSLAFA 101 AL ++ + D ++ T GPG+ L G + LA A Sbjct: 94 NKALSHLPYSSAESQNDPTKIVSLGDGNRQKPDFISVTRGPGMRSNLFAGLDTAKGLAVA 153 Query: 102 WDVPAIPVHHMEGHLLAPML-----------EDN----------PPEFPFVALLVSGGHT 140 W VP + VHHM+ HLL P L +N P FPF+++L SGGHT Sbjct: 154 WQVPFVGVHHMQAHLLTPRLVSALALSPGSSPNNTDRQNEKGELQPAFPFLSILASGGHT 213 Query: 141 QLISVTGIGQYELLGESIDDAAGEAFDKTAKLL--------GLDYPGGPLLSKMA---AQ 189 L++ + + + +L + D A GEA DK A+ + + G LL + A + Sbjct: 214 LLVNSSSLTDHRILATTTDVALGEALDKAAREILPSSLLSTSKNTMYGKLLEQYAFPNGR 273 Query: 190 GTAGRFVFPR---------------------PMTDRPGLDFSFSGLKTFAANTIR----- 223 +V P+ P L FSF+ L T +T+ Sbjct: 274 ADYADYVAPKSRGDEIAVSKVVSKYGWSLTTPYAQTRELAFSFAFLATAVNHTLAKARKR 333 Query: 224 --DNGTDDQTRADIARAFEDAVVDTLMIKCKRAL-----------DQTGFKR-------- 262 + G D+ R +AR + L + AL + KR Sbjct: 334 AGETGLSDEERVFLAREVMRVTFEHLASRTIIALESLCQWVPLVPNNPNDKRQKPLPSSV 393 Query: 263 ----LVMAGGVSANRTLRAKLAEMMKKR-RGEVFYARP--EFCTDNGAMIAYAGMVRFKA 315 LV++GGV+AN+ L L + R G V P CTDN AM+ +AG+ F+A Sbjct: 394 PVSTLVVSGGVAANKFLMHVLRTWLDGRGFGHVGVVAPPISLCTDNAAMVGWAGIEMFEA 453 Query: 316 GATADLGVSVRPRWPLAE 333 G + +W L E Sbjct: 454 GWRSAFEARALRKWGLEE 471 >UniRef50_A8BDD4 O-sialoglycoprotein endopeptidase n=2 Tax=Giardia intestinalis RepID=A8BDD4_GIALA Length = 396 Score = 111 bits (277), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 102/374 (27%), Positives = 161/374 (43%), Gaps = 67/374 (17%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-----GVVPELASRDHVRKTVP 57 +LG+E S ++ G+ I D + AN L + Y G P + H + + Sbjct: 2 ILGLEGSANKLGVGIVDASGVVHAN-------LRSTYNAPPGQGFQPNDVAAHHRQHIIG 54 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 LI+ AL E+ +++ I +AYT GPGL L A V R+L+ W VP + V+H H+ Sbjct: 55 LIERALLEAEISSDKITHIAYTRGPGLGAPLAAVAVVARTLSQLWKVPLLAVNHCVAHIE 114 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 L P V L SGG+TQ+I+ + G+Y + GE++D A G A D+ A+ L + Sbjct: 115 MGRLVTQLPN--PVVLYASGGNTQVIAYSQ-GRYRVFGEALDIAVGNALDRIARYLLISN 171 Query: 178 PGGPLLS--KMAAQGTA----------GRFVFPRPMT----------------------- 202 P L+ ++AA+ A + PR T Sbjct: 172 TPAPGLNIERLAAEWAAIFREEDCVHLDPDIVPRYTTLPRSKELLKEQLELYSANHPEAG 231 Query: 203 -----DRP----------GLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTL 247 D P G+D S SG+ T+ + + + D I + ++ + +L Sbjct: 232 IDTSYDIPIITTIPVPIKGMDISCSGISTYLKTYVETHTSLDPRL--ICYSLQETLFGSL 289 Query: 248 MIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAY 307 + +RA G ++ GGV N L+ L M +R G + +C DNGAMIA+ Sbjct: 290 VEITERAAAHVGAADILAVGGVGCNLRLQEMLQIMAAERNGRLGAMDDSYCVDNGAMIAW 349 Query: 308 AGMVRFKAGATADL 321 G +A + DL Sbjct: 350 CGACMLQAPLSMDL 363 >UniRef50_Q97ZY8 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Sulfolobus solfataricus RepID=GCP_SULSO Length = 246 Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 83/249 (33%), Positives = 129/249 (51%), Gaps = 15/249 (6%) Query: 90 VGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIG 149 VGAT+ R++A ++ +PV+H GH+ L + P + L +SGG+T +I+ G Sbjct: 3 VGATLARAIALKYNKKLVPVNHGIGHIEIGYLTTEARD-PLI-LYLSGGNT-IITTFYKG 59 Query: 150 QYELLGESIDDAAGEAFDKTAKLLGLDYP---GGPLLSKMAAQGTAGRFVFPRPMTDRPG 206 ++ + GE++D A G D + + L P G + + A+ G + P + G Sbjct: 60 RFRVFGETLDIALGNMMDVFVREVSLAPPYIINGIHVIDICAE--KGNKLLKLPYVVK-G 116 Query: 207 LDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMA 266 D SFSGL T A + + DI + + D L+ +RAL T K L++ Sbjct: 117 QDMSFSGLLTAALRVV-----GKEKLEDICYSVREIAFDMLLEATERALALTSKKELMIV 171 Query: 267 GGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS-V 325 GGV+A+ +LR KL E+ K+ ++ PEF DNGAMIAYAGM+ G D+ S + Sbjct: 172 GGVAASVSLRKKLEELGKEWNVQIKIVPPEFAGDNGAMIAYAGMLAASKGVFIDVDKSYI 231 Query: 326 RPRWPLAEL 334 RPRW + E+ Sbjct: 232 RPRWRVDEV 240 >UniRef50_Q2HG58 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2HG58_CHAGB Length = 1550 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 64/188 (34%), Positives = 100/188 (53%), Gaps = 19/188 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVPLI 59 L IETSCD+T + + EK A ++ K+ +D +GG+ P+ A + H ++ Sbjct: 1068 TLAIETSCDDTCVTVL--EKSGDAARVLFNAKVTSDNRRFGGIKPDEAVQGHSSSLPGIV 1125 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 QAA+++ D ++ T GPG+ AL +G T+ + LA AWD P + VHHM+ H L P Sbjct: 1126 QAAIQKLPADRPKPDFISVTRGPGITSALSIGLTMAKGLAVAWDRPLVAVHHMQAHALTP 1185 Query: 120 ML-------EDNPPE-------FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEA 165 L + PP +PF++LLVSGGH+QL+ + L E+ + A G+ Sbjct: 1186 RLVEALANGQQQPPHQGGARPAYPFLSLLVSGGHSQLLLTRSAVSHATLAEAANVAIGDM 1245 Query: 166 FDKTAKLL 173 DK A+ + Sbjct: 1246 LDKCARAI 1253 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 46/143 (32%), Positives = 68/143 (47%), Gaps = 13/143 (9%) Query: 200 PMTDRPGLDFSFSGLK-TFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALD-- 256 P+ +R + F FSGL A R+ D RA++AR + L + ALD Sbjct: 1325 PLHERRDMAFDFSGLGGQVQAIMQRNPSMDPPQRAELARETMRVAFEHLASRVIFALDGM 1384 Query: 257 -----QTGFKRLVMAGGVSANRTLRAKLAEMMKKR---RGEVFYARPE--FCTDNGAMIA 306 + LV++GGV+AN L L ++ R +V RP CTDN M+A Sbjct: 1385 RTQAAALPVRTLVVSGGVAANGFLMHVLGRVLAVRGYGPEKVAVVRPPRGLCTDNAVMVA 1444 Query: 307 YAGMVRFKAGATADLGVSVRPRW 329 +AG+ ++AG ++L V R RW Sbjct: 1445 WAGVEMWEAGWESELSVLPRRRW 1467 >UniRef50_A7APL5 Glycoprotease family protein n=1 Tax=Babesia bovis RepID=A7APL5_BABBO Length = 406 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 79/299 (26%), Positives = 139/299 (46%), Gaps = 32/299 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IETSCD+ A+ +++ + S +GG+ P+ + R H+ ++ Sbjct: 101 ILAIETSCDDCCAAVVSSNGDVVSEERASNPDSLIKFGGIKPDESYRFHLDNIDRIMNEV 160 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + ++ L +DI + T GPG+ L G ++ + +P I +H+ GH L+P ++ Sbjct: 161 VSKAKLKFEDIGYIVATRGPGMRICLNAGYDAAERISKTYSIPLIGENHLAGHCLSPFIK 220 Query: 123 --------------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDK 168 ++P+++LL+SGGH+Q+ V QY +L +++D AG K Sbjct: 221 GHQLRMTHDRGSVASEELKYPYLSLLLSGGHSQIYVVESPYQYHMLVDTMDHYAGNVLYK 280 Query: 169 TAKLLGL--DYPGGPLLSKMAAQGTAGR--FVFPRPMTDRPGLDFSFSGLKT---FAANT 221 AK LGL D GGP + + AA+ GR F P F FSG++T + Sbjct: 281 CAKELGLPIDTGGGPSIEE-AARKRQGRPMFRMTEPCKGMSFTSFCFSGIQTQLRSMVSK 339 Query: 222 IRDNGTDDQTRAD------IARAFEDAVVDTLMIKCKRALDQT----GFKRLVMAGGVS 270 IR + +D D +A ++ + ++ + +ALD G ++V+ GG S Sbjct: 340 IRQDLGEDALSEDPKLVNHLAYTCQEVTFNQVIRQLDKALDICETLFGISQIVVVGGRS 398 >UniRef50_C0GE31 O-sialoglycoprotein endopeptidase n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GE31_9FIRM Length = 307 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 96/308 (31%), Positives = 144/308 (46%), Gaps = 22/308 (7%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TSC T +A+ D + LL + + + + G+ H++ L + Sbjct: 3 LGIDTSCYTTSLAVMDTQGRLLCEK-RTLLTVPKGERGLRQSDGVFQHLQNLPRLAEEVA 61 Query: 64 KESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 E G + AVA + P + VG + GRSLA A+ VP + + H EGH+LA Sbjct: 62 GEVG--PLKLQAVAASVCPRPVEGSYMPVFTVGTSFGRSLAAAFGVPFLSLSHQEGHILA 119 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISV---TGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 M F AL VSGG T+L+ V G+ E LG S D AG+ D+ LGL Sbjct: 120 GMWSAGVDWPEFYALQVSGGTTELLFVRQNNGLKVAE-LGGSADLHAGQFIDRVGVALGL 178 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADI 235 +P GP + K+ G V P P++ + G + SFSG ++ I + + A + Sbjct: 179 SFPAGPAVEKL---GNDALEVLPVPVSVQ-GSNLSFSGPESHVQRVI---ASGEYAPAAV 231 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 AR E V ++L + + G K ++ GGV AN+ +R LAE + E +A+ Sbjct: 232 ARGVEKCVAESLWRVLRTVRKEHGAKPVLFVGGVMANQFIRGFLAEKLGD---EAAFAQI 288 Query: 296 EFCTDNGA 303 F DN A Sbjct: 289 RFAGDNAA 296 >UniRef50_B2A533 O-sialoglycoprotein endopeptidase n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A533_NATTJ Length = 322 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 81/328 (24%), Positives = 150/328 (45%), Gaps = 26/328 (7%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 +G++TSC T +A+ + + ++A + +++ GG+ A H+ +P + Sbjct: 1 MGLDTSCYTTSMAVINKQGKIIA-KTERPLEVAMGKGGLRQSEAVFQHIN-NLPQGLTEI 58 Query: 64 KESGLTAKDID---AVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 K+ A+A ++ P + VG + ++L+ + +P + H EGH Sbjct: 59 KKQLNVNNLDLNLAAIAVSSRPRPIEGSYMPVFKVGDSYAKALSLSSGIPLLEYTHQEGH 118 Query: 116 LLAPMLE-DNPPEF----PFVALLVSGGHTQLISVTGIGQY-----ELLGESIDDAAGEA 165 + + + E N F+ VSGG T+L+ G++ E++G + D AAG+ Sbjct: 119 IASIVYEKSNNIRLEDMDKFLVFHVSGGTTELLICHTKGKFSSFDIEIIGGTKDIAAGQL 178 Query: 166 FDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN 225 D+TAKL+ L +PGGP L K+ Q P + D +FSG +T I + Sbjct: 179 IDRTAKLMNLPFPGGPHLEKLGDQSGQTDISVPFSVEDT---KINFSGPETHIKRLIHN- 234 Query: 226 GTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK 285 +D + +AR E V +L+ + AL + K ++ GGV +N ++ L++ + Sbjct: 235 --EDYPKPAVARGIEQCVAKSLLTVLENALKKHQVKNILFVGGVMSNSYIKNYLSKNISN 292 Query: 286 RRGEVFYARPEFCTDNGAMIAYAGMVRF 313 + + + PE DN +A+ G F Sbjct: 293 EKYNLIFGSPELSKDNAVGVAWLGYNNF 320 >UniRef50_A6S1G0 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6S1G0_BOTFB Length = 323 Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 85/307 (27%), Positives = 138/307 (44%), Gaps = 70/307 (22%) Query: 88 LLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML--------------EDNPPEFPFVAL 133 L+ G + LA AW +P + V+HM+ H L P + +N P +PF++L Sbjct: 5 LITGIDTAKGLAVAWQIPLLGVNHMQAHALTPRMVSALEAGNNSKTEKHENDPAYPFLSL 64 Query: 134 LVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAG 193 LVSGGHT L+ + +E+L + D A G+ DKTA+ + P + S A+ G Sbjct: 65 LVSGGHTMLVHSRQLCDHEILATTSDLAVGDMVDKTARDI---LPASVIES--ASDVMYG 119 Query: 194 R----FVFPR--------------PMTDRP----------------------GLDFSFSG 213 R F FP T RP +FS+SG Sbjct: 120 RVMEEFAFPDANSSYDYEPSHKSIAQTSRPTKYEWTLTPPYMSTGHRPLKSYNSEFSYSG 179 Query: 214 LKTFAANTI-RDNGTDDQTRADIAR-----AFEDAVVDTLMIKCKRALDQTGFKRLVMAG 267 + + + R+ D R +A+ AFE + +++ +R D K LV++G Sbjct: 180 VGSQIKRIMNRNPEMDIAERRLLAQETMRVAFEH-LASRVILNLERP-DLKDTKTLVVSG 237 Query: 268 GVSANRTLRAKLAEMMK---KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS 324 GV+AN+ L+ L ++ + + + P+FCTDN AMI + G+ ++AG +DL + Sbjct: 238 GVAANQYLKYILRSLLDAWGHKTMRLIFPPPKFCTDNAAMIGWTGIEMWEAGWRSDLDIL 297 Query: 325 VRPRWPL 331 +WP+ Sbjct: 298 AARKWPI 304 >UniRef50_C5FT24 Glycoprotease family protein n=2 Tax=Onygenales RepID=C5FT24_NANOT Length = 492 Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 93/347 (26%), Positives = 136/347 (39%), Gaps = 91/347 (26%) Query: 74 DAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML---------EDN 124 D ++ T GPG+ L VG + + LA AW VP + VHHM+ HLL P L E+N Sbjct: 120 DFISVTRGPGMRSNLSVGLELAKGLAVAWQVPMVGVHHMQAHLLTPRLADALDIPSVEEN 179 Query: 125 ------PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL-LGLDY 177 P+FPF+++L+SGGHT L + ++ L ++D A G+ DK A++ L Y Sbjct: 180 DSIRALKPDFPFISVLISGGHTFLAHSKSLTDHKTLASTVDVAIGDVLDKFARMALPRSY 239 Query: 178 PGGPLLSKMAAQGTAGRFVFPR--------------------------------PMTDRP 205 + Q A + FP P D Sbjct: 240 IDQSKTTMYGKQLEA--YAFPNGYSDYADYEPPATRGQETKPIINAKYGWSLTLPYPDSK 297 Query: 206 GLDFSFSGLKTFAANT--IRDNGTDDQT-------------------RADIARAFEDAVV 244 + F+F+GL + A I NG +Q R + R F Sbjct: 298 KMAFTFAGLFSAAQRQVDIMVNGKVEQRKKTKEEMDSLNLDFLPHDGRVEFCRDFMRVCF 357 Query: 245 DTLMIKCKRALDQT-----------------GFKRLVMAGGVSANRTLRAKLAEMMKKR- 286 + L + AL+ K +V++GGV+AN+ LR L + R Sbjct: 358 EHLASRIVLALENALSSVPNTARKEQIEPGPSVKTIVVSGGVAANQYLRHILRAFLDIRG 417 Query: 287 RGEVFYARPE--FCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 +V P CTDN AMI +AG+ F+AG +W L Sbjct: 418 FSDVDIVAPPLYLCTDNAAMIGWAGIEMFEAGWRTSRKSQAIRKWNL 464 >UniRef50_D1BMJ2 Metal-dependent protease with possible chaperone activity n=3 Tax=Veillonella RepID=D1BMJ2_VEIPT Length = 317 Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 84/304 (27%), Positives = 141/304 (46%), Gaps = 14/304 (4%) Query: 4 LGIETSCDETGIAIYDDEKGLLAN-QLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 LGI+TSC T AI D++ ++ + +VKL G+ H K +P + + Sbjct: 6 LGIDTSCYTTSCAIIDNDFHIVGEARKILEVKLGER--GLQQSNMVFQHT-KALPKLMSE 62 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L + ++ + + A +VG G++L+ +VP H E H+LA + + Sbjct: 63 LPQVPISGIGVSGFPRREERSYMPAFMVGLGQGQTLSHLMNVPLHIFAHQENHILAALRD 122 Query: 123 -DNPPEFPFVALLVSGGHTQLISV----TGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 N P PF+AL +SGG T+L+ GI + ++G S D G+ D+ LGL + Sbjct: 123 LKNIPNEPFLALHLSGGTTELVYCHYQGNGIFESHIVGGSKDLQGGQYVDRIGVALGLPF 182 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 P G L +A Q T P P + + G SF+G + A I +N D ++ +AR Sbjct: 183 PAGKHLEALALQTTEYE---PLPSSVKDGW-ISFAGPCSAAMRRI-NNAMSDIDKSKLAR 237 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 A ++ + L + + L+ GGV +N LR ++ K+ ++ A+P+F Sbjct: 238 AVFTSIGNALEKMITYHTKEKSVRALIAVGGVISNSLLRKRMETYCKRNHLQLHVAQPQF 297 Query: 298 CTDN 301 DN Sbjct: 298 SVDN 301 >UniRef50_C0ZC04 Peptidase M22 family protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZC04_BREBN Length = 320 Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 91/325 (28%), Positives = 155/325 (47%), Gaps = 24/325 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGI+TS T + + +E G + + +K+ G+ A HV +P + Sbjct: 5 MLGIDTSNYRTSLCL-AEEDGRIVAEAKRLLKVKEGKRGLQQSEAVFQHVM-NLPELSDE 62 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +K +I A+ + P + VG + +SLA VP H EGH+ Sbjct: 63 MKWKDY---EIAAICVSEKPRPQDGSYMPVFKVGEGLAKSLATYLRVPLHLTTHQEGHIA 119 Query: 118 APML--EDNPPEFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKTAKL 172 A E P E F+A+ +SGG ++L+ E +G +ID AG+ D+ Sbjct: 120 AGEYTAEVRPTEDRFLAVHLSGGTSELLLCERHAAGYTIEKIGGTIDLHAGQLVDRIGVA 179 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL +P GP L ++A + T G F R + GL FSFSG + A+ +R+ + Sbjct: 180 LGLSFPAGPALEQLAKEAT-GEF---RVSSAVDGLSFSFSGPE---ASLLREVEKGSTSP 232 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK--RRGEV 290 A+IARA E + + L + A++Q K +++ GGV+AN +R +L + ++ + ++ Sbjct: 233 AEIARATEQCIANALEKSLRHAVEQGYPKDILIVGGVAANYYIRERLIKRLEHPAVKAKL 292 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKA 315 ++ P + DN +A G ++ KA Sbjct: 293 YFCDPVYSGDNAYGVAMLGWMKQKA 317 >UniRef50_Q3AAM2 Glycoprotease family protein n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AAM2_CARHZ Length = 319 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 92/323 (28%), Positives = 150/323 (46%), Gaps = 44/323 (13%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPE----LASRD----HVRKT 55 LG +TS T A D E G L L + VPE L RD H+R Sbjct: 6 LGFDTSNYTTSFAAVDGE-GRLIFDLRKILP--------VPEGEVGLRQRDVVFLHLRHL 56 Query: 56 VPLIQAALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVH 110 ++Q ++ + + + P + + L G + +L+ A DVP + Sbjct: 57 KEMVQEGFNR--ISRDQVRGIGVSVKPRPLPESYMPSFLAGEVIASTLSLALDVPLVKTT 114 Query: 111 HMEGHLLAPMLEDNPPEFP-FVALLVSGGHTQLISVTGIGQ---YELLGESIDDAAGEAF 166 H EGHL+A + +FP F+A+ SGG ++++ V Q ++LG+S+D +AG+ Sbjct: 115 HQEGHLVAALWSLKK-DFPRFLAIHFSGGTSEILEVEKEPQGYKVKVLGKSLDISAGQLV 173 Query: 167 DKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG 226 D+ LLGL +P G L ++A + + P T G ++ FSG + + ++D Sbjct: 174 DRIGVLLGLPFPSGKFLEELAQKAVG---ILKVPATFVNG-NWHFSGAEAYLKRKLKDFP 229 Query: 227 TDDQTRADIARAFEDAVVDTLM-IKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK 285 +IARA E+ + TL I A D +V+ GGV+AN ++ L E +KK Sbjct: 230 A-----FEIARAVEEVIARTLFKIIQYHAKDNLP---VVLMGGVAANNYIKNFLLEKLKK 281 Query: 286 RRGEV--FYARPEFCTDNGAMIA 306 RR V ++A ++ +DN +A Sbjct: 282 RRVAVDLYFAEVQYASDNAVGVA 304 >UniRef50_A8MFJ2 O-sialoglycoprotein endopeptidase n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MFJ2_ALKOO Length = 328 Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 62/231 (26%), Positives = 116/231 (50%), Gaps = 10/231 (4%) Query: 87 ALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPP-EFPFVALLVSGGHTQLISV 145 L + G + + +P H EGH+ A + +N + F+A+ +SGG T+++ V Sbjct: 92 VFLAAKSYGEITSNLFHIPFYEFSHQEGHIEAALWSENIHMKEEFIAIHISGGTTEVLVV 151 Query: 146 T--GIG-QYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMT 202 IG E++G + D +AG+ D+ +GL++P G L +++ + P +T Sbjct: 152 KPRDIGYDIEIIGGTSDLSAGQFIDRVGVAMGLEFPSGKSLEEISRGCSELSLNVPVSVT 211 Query: 203 DRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKR 262 SFSG +T + I+++ + ++ADIA V +L + K Q K Sbjct: 212 KN---KISFSGPETHFSRLIKES---NASKADIAYGVFHCVARSLELLVKNIGKQYPIKN 265 Query: 263 LVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRF 313 L++ GGV++N +R+ L E + +++A P++CTDN I+ G+ ++ Sbjct: 266 LLIVGGVASNNQIRSYLLEKLAPENIHIYFAAPKYCTDNAVGISSLGVSKY 316 >UniRef50_A6NUZ4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NUZ4_9BACE Length = 313 Score = 94.4 bits (233), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 92/329 (27%), Positives = 151/329 (45%), Gaps = 32/329 (9%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +R LG++TS T +A++D G +L + + + G+ A HV++ +P + Sbjct: 4 LRCLGLDTSNYTTSVAVFDGTTGENIGRL---LDVPSGTLGLRQSDALFQHVKR-LPGLF 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVG-----ALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 L E L ++ AV + P V L G GR+L+ +VP PV H +GH Sbjct: 60 EQLHEKDLLG-ELRAVGASTRPRAVDGSYMPCFLAGEGQGRALSATLNVPFFPVSHQQGH 118 Query: 116 LLAPMLEDNPPEF---PFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKT 169 + A P +A +SGG T+L+ V G + + +G + D +AG+ D+T Sbjct: 119 IAAAAWSAGRLGLLDEPMLAWHLSGGTTELLYVEPEGVNVRAQAIGGTSDISAGQLIDRT 178 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 KLLGLD+P G L +A + + + R G FS SG++ N ++ Sbjct: 179 GKLLGLDFPAGKALDALARESQSEK----RFKVKLNGCSFSLSGVE----NQVKAMAERG 230 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 + ADIAR + + D + AL++ ++ +GGV++N LR KL + Sbjct: 231 EAPADIARFALNTIADAVARATAAALEERPGLNVLCSGGVASNSLLREKLKNAV------ 284 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGAT 318 +A P + TDN +A +AG T Sbjct: 285 --FAEPRYSTDNAMGVAILAWRSLQAGET 311 >UniRef50_Q7SD85 Predicted protein n=2 Tax=Sordariaceae RepID=Q7SD85_NEUCR Length = 538 Score = 94.4 bits (233), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 75/258 (29%), Positives = 113/258 (43%), Gaps = 65/258 (25%) Query: 3 VLGIETSCDETGIAIYDDEKG--------LLANQLYSQVKLHAD---YGGVVPELASRDH 51 L IETSCD+T +A+ + ++A L+++ K+ +D +GGV P +A H Sbjct: 40 TLAIETSCDDTCVALLQSYESTVRTETPEMVARLLFNK-KITSDQRQFGGVHPAVAVEWH 98 Query: 52 VRKTVPLIQAAL-----------KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAF 100 R L++ A+ K + L + D +A T GPG+ +L G V + LA Sbjct: 99 QRHLATLVEEAIRSLPEGKTPAYKNTRLPYRAPDLIAVTRGPGMPTSLATGMEVAKGLAL 158 Query: 101 AWDVPAIPVHHMEGHLLAPMLE---DNPP------------------------------- 126 AW +P + VHHM+ H L P L D PP Sbjct: 159 AWGIPIVGVHHMQAHALTPQLVEALDRPPAPSVASSPWEERQQVDAEVKTASRQQEEAQH 218 Query: 127 ---EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLL 183 ++P++ LLVSGGHTQL+ + + +L + + A G+ DK A+ + P L Sbjct: 219 PNLDYPYLNLLVSGGHTQLVYSASLTSHLILCTTDNIALGDMLDKAARKI---LPPSMLN 275 Query: 184 SKMAAQGTAG--RFVFPR 199 S A RF FPR Sbjct: 276 SGQNVMYAAALERFAFPR 293 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 35/107 (32%), Positives = 57/107 (53%), Gaps = 6/107 (5%) Query: 232 RADIARAFE---DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR-R 287 RA + AFE +V L + K + +Q K LV++GGV++N+ LR L +++ R Sbjct: 412 RATMQLAFEHLASRIVMVLQQQAKTSCEQQKVKTLVVSGGVASNQFLRHVLRRVLEVRGF 471 Query: 288 GEVFYARP--EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 G + P CTDN AMIA+ G ++AG + L + +W ++ Sbjct: 472 GHIRIMAPPVNLCTDNAAMIAWTGSEMYRAGWVSKLDMLPIKKWSMS 518 >UniRef50_C0ATA9 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0ATA9_9ENTR Length = 71 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 42/59 (71%), Positives = 47/59 (79%) Query: 278 KLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 K+ +MK+ GEVFYARPE CTDNGAMIA AGM+RFK G L V+VRPRWPLAELPA Sbjct: 9 KMEAVMKQIGGEVFYARPELCTDNGAMIALAGMIRFKGGTEGPLSVTVRPRWPLAELPA 67 >UniRef50_B2WBX5 Glycoprotease pgp1, mitochondrial n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2WBX5_PYRTR Length = 417 Score = 92.0 bits (227), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 66/227 (29%), Positives = 99/227 (43%), Gaps = 58/227 (25%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLH-------ADYGGVVPELASRDHVRKT 55 L IETSCD+T +A+ +KG ++ +Q+ H ++Y GV P ++ + H Sbjct: 2 TLAIETSCDDTSVAVV--KKGCKNDRTTAQILFHKKVTSNNSEYQGVHPIVSLQSHQESL 59 Query: 56 VPLIQAALK--------------ESGLTAKDI------DAVAYTAGPGLVGALLVGATVG 95 L+ A++ +G DI D V+ T GPG+ L G Sbjct: 60 ATLVGEAIRCLPMQDGELPSEDDRTGPIPVDITTRTLPDFVSVTRGPGMRSNLFTGLDTA 119 Query: 96 RSLAFAWDVPAIPVHHMEGHLLAPMLED-----------------------NP------P 126 + LA AW P + VHHM+ H L L NP P Sbjct: 120 KGLAVAWQKPLVGVHHMQAHALTSRLVSALDAYKELNEPEAECLPNGTIGRNPTQAHVSP 179 Query: 127 EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 +FPF+++L SGGHT LI + + +LG + D A GE DK A+++ Sbjct: 180 DFPFLSVLASGGHTLLIHSASLTDHRVLGSTNDIAIGECLDKIARVV 226 >UniRef50_A4RG35 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4RG35_MAGGR Length = 596 Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 80/284 (28%), Positives = 115/284 (40%), Gaps = 93/284 (32%) Query: 3 VLGIETSCDETGIAIYDDEKGL--LANQLYSQVKLHAD---YGGVVPELASRDHVRKTVP 57 L IETSCD+T +A+ + E+G A L+ Q + AD +GG+ P H Sbjct: 68 TLAIETSCDDTCVALVEKERGPGGAARVLFHQ-RATADNSMFGGINPLPTLESHTALLAK 126 Query: 58 LIQAALK----------------------ESGLTAKDIDAVAYTAGPGLVGALLVGATVG 95 ++++A+ +S + + D V+ T GPG+ AL VG + Sbjct: 127 MVRSAVNALPQDAATGNSSFSTAFTRSKPDSSIPRRLPDFVSVTRGPGMAAALSVGLSTA 186 Query: 96 RSLAFAWDVPAIPVHHMEGHLLAPML---------------------------------- 121 + LA AW VP + VHHM+ HLL P L Sbjct: 187 KGLAVAWKVPLVGVHHMQAHLLTPRLMSAMRKPFYEWEKERAALTREAFVSEKEEKSGSL 246 Query: 122 ----------------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEA 165 E + P +PF LLVSGGHT L+ + Q+ + E AAG+A Sbjct: 247 KKARSSQSDPKAQDPKEYDWPRYPFFTLLVSGGHTMLMRSKNLVQHSTVAEVEGFAAGDA 306 Query: 166 FDKTAK-LLGLDYPG-----GPLLSKMAAQGTAGRFVFPRPMTD 203 DK A+ +L Y G G LL + FVFP+ + D Sbjct: 307 LDKCARAILPPKYQGKTSSFGQLLEE---------FVFPKNLKD 341 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 3/73 (4%) Query: 262 RLVMAGGVSANRTLRAKLAEMMKKRRG---EVFYARPEFCTDNGAMIAYAGMVRFKAGAT 318 RL+M+GGV++N+ LR + M++ +V P C DN AMI +AG+ F+ G T Sbjct: 475 RLLMSGGVASNKFLRYVVRSMLEAYHFNPVQVIGPPPHLCVDNAAMIGWAGLEMFEEGFT 534 Query: 319 ADLGVSVRPRWPL 331 DLGV + +W L Sbjct: 535 TDLGVLPKKKWSL 547 >UniRef50_A0RY43 O-sialoglycoprotein endopeptidase n=4 Tax=Thaumarchaeota RepID=A0RY43_CENSY Length = 237 Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 82/248 (33%), Positives = 123/248 (49%), Gaps = 24/248 (9%) Query: 90 VGATVGRSLAFAWDVPAIPVHHMEGHL-LAPMLEDNPPEFPFVALLVSGGHTQLISVTGI 148 +GA V R+L+ +P PV+H GH+ L +L + P V LLVSGGHT L++ G Sbjct: 1 MGAVVARALSSYHGIPIYPVNHAIGHIELGKLL--TGAQDPLV-LLVSGGHTMLLAFVG- 56 Query: 149 GQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRP--- 205 G++ + GE++D G+ D+ + LG P G + ++AA+ + TD P Sbjct: 57 GRWRVFGETLDITLGQLLDQFGRSLGFPSPCGRQVEELAAESS--------EYTDLPYSV 108 Query: 206 -GLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLV 264 G D SFSGL + AA T G + + + AF A+V + +RAL T + L+ Sbjct: 109 KGNDVSFSGLLS-AAKTAARRGKETASYSLQETAF--AMVAEAV---ERALSFTRKRELM 162 Query: 265 MAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLG-V 323 + GGV+AN+ L L ++R +F P + D GA IA G++ A L Sbjct: 163 VVGGVAANKRLAGMLEGACGRQRCRLFVVPPVYSGDCGAQIACTGLLEASIKDGAPLADT 222 Query: 324 SVRPRWPL 331 VR W L Sbjct: 223 FVRQSWRL 230 >UniRef50_D2RJI3 Peptidase M22 glycoprotease n=2 Tax=Acidaminococcus RepID=D2RJI3_ACIFE Length = 319 Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 91/320 (28%), Positives = 142/320 (44%), Gaps = 23/320 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLG++TSC T A+ D LL +Q +++ + G+V H R +P + A Sbjct: 7 VLGLDTSCYTTSAALMDLHGHLLGDQ-RRLLRVKPGHRGLVQSEMVFQHTR-NLPDLLEA 64 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L SG+ K A+ +A P + A LVG + RSL +P H H+ Sbjct: 65 LDLSGVQVK---AIGVSAKPRPREESYMPAFLVGLGMARSLGKLMGLPVHRFTHQHNHMF 121 Query: 118 APMLE-DNPPEFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKTAKL 172 A + P F+ + +SGG T L+ G E G SID AG+ D+ Sbjct: 122 AGLWSVGKPAPDRFLLVHISGGTTDLLLCERQPDGNFSLEPRGTSIDLHAGQFIDRVGVA 181 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL +P G L K+A + P + R G + S SG T I + G D Sbjct: 182 LGLPFPAGAPLEKLAETASEAH---PLKVWSREG-ELSLSGPCTQTLRAI-EKGEDP--- 233 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 A +A E A+ L ++ ++++AGGVSANR +R +L + + +R+ ++ Sbjct: 234 AALALGVEQAIGKALARTISWVCEKEQLSQVLLAGGVSANREIRRQLEDFLGQRQIGLWA 293 Query: 293 ARPEFCTDNGAMIAYAGMVR 312 P + D A+A ++R Sbjct: 294 PDPRYSVDGAVGNAWAALLR 313 >UniRef50_Q0V4Z5 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0V4Z5_PHANO Length = 497 Score = 91.3 bits (225), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 77/238 (32%), Positives = 108/238 (45%), Gaps = 46/238 (19%) Query: 1 MRVLGIETSCDETGIAIYDD--EKGLLANQLYSQVKL---HADYGGVVPELASRDHVRKT 55 + L IETSCD+T +AI + E G QL+ K+ +A+Y GV P ++ R H Sbjct: 28 LMTLAIETSCDDTSVAIVEKKVENGRAVAQLHFHKKVTANNAEYQGVHPLVSLRSHQENL 87 Query: 56 VPLIQAAL--------------KESGLTAK------DI------DAVAYTAGPGLVGALL 89 L+ A+ + GL A+ D+ D V+ T GPG+ L Sbjct: 88 ADLVSEAISHLPPKTASRDHDFEHGGLEAQRPEAVLDVTKKRLPDFVSVTRGPGMRSNLF 147 Query: 90 VGATVGRSLAFAWDVPAIP---VHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVT 146 G + LA AW A+ V +E P LE P+FPF+++L SGGHT LI Sbjct: 148 TGLDTAKGLAVAWQAHALTPRLVSALEPSA-TPTLE---PDFPFLSVLASGGHTLLIQSA 203 Query: 147 GIGQYELLGESIDDAAGEAFDKTAKLL--------GLDYPGGPLLSKMAAQGTAGRFV 196 + + LLG + D A GE DK A++L G LL K A +G A + V Sbjct: 204 SLNDHHLLGTTNDIAVGEYLDKVARILLPTELLQSTRSTMYGALLEKFAFEGNASQTV 261 Score = 64.3 bits (155), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 32/77 (41%), Positives = 46/77 (59%), Gaps = 3/77 (3%) Query: 261 KRLVMAGGVSANRTLRAKLAEMMKKR---RGEVFYARPEFCTDNGAMIAYAGMVRFKAGA 317 + +V+AGGV+AN LR LA + R +++ P FCTDN AMIA+ G+ F+AG Sbjct: 414 RSVVLAGGVAANSFLRHILASTLCARGFSHINLYFPPPSFCTDNAAMIAWTGIEMFEAGH 473 Query: 318 TADLGVSVRPRWPLAEL 334 T L + +WPL +L Sbjct: 474 TDTLSIRAIRKWPLNQL 490 >UniRef50_C9LLA9 Glycoprotease family protein n=1 Tax=Dialister invisus DSM 15470 RepID=C9LLA9_9FIRM Length = 319 Score = 91.3 bits (225), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 90/329 (27%), Positives = 145/329 (44%), Gaps = 37/329 (11%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG--GVVPELASRDHVRKTVPLI 59 + LGI+TSC T A+YD +G++ S++ L G G+ HVR +P+I Sbjct: 3 KFLGIDTSCYTTSAAVYDSTEGIVGE---SRIILSVKAGKRGLSQSEMVFQHVR-NLPVI 58 Query: 60 QAALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L+ I+ + + P + A LVG + SL+ VP H E Sbjct: 59 LGQLEP---WIDQINGIGVSVFPRRRADSYMPAFLVGKGMAESLSHVLRVPVFEFSHQEN 115 Query: 115 HLLAPMLEDNPPEF---PFVALLVSGGHTQLISV---TGIGQYELLGESIDDAAGEAFDK 168 H LA + N PE PF + +SGG ++SV I Q L S D AG+ D+ Sbjct: 116 HALAAI--QNMPEIWGTPFYMMHLSGGTQDVLSVEWEKDIMQIVDLIHSADITAGQFIDR 173 Query: 169 TAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD 228 LG+ +P GP + ++A + ++ P+ + FSF+G + A RD T Sbjct: 174 VGVSLGMPFPAGPSMERLAMKHQQ---LYKVPVANVKN-GFSFAGPE---AQVQRDIQTK 226 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQ-TGF---KRLVMAGGVSANRTLRAKLAEMMK 284 T DIA ++ +L + LD GF + + GGV +N LR + E+ + Sbjct: 227 RYTPEDIAYGVFSSIGKSL----HKVLDSYNGFIEGRTFIAVGGVMSNGYLRKSITEICR 282 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRF 313 + +A ++ +DN A+ +R+ Sbjct: 283 HKSLHPCFAEVKYSSDNATGNAFGAFMRY 311 >UniRef50_D2EF31 O-sialoglycoprotein endopeptidase (Fragment) n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EF31_9EURY Length = 242 Score = 91.3 bits (225), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 63/190 (33%), Positives = 100/190 (52%), Gaps = 15/190 (7%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGIE++ G+ I +++K ++AN+ + L GG++P A+ H + +I+ AL Sbjct: 27 LGIESTAHTFGVGISENDK-IIANE---RDTLKPTSGGIIPREAAMHHFKLAPEIIKRAL 82 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL----LAP 119 +SGL KDID A++ GPG++ AL VGA V L+ + I V+H HL L Sbjct: 83 DKSGLKLKDIDLFAFSQGPGIIPALKVGAQVSTFLSNKYKKKLIGVNHCIAHLEIARLYT 142 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L+D V L VSGG+TQ+I+ G Y + GE+ D G DK + + + +P Sbjct: 143 KLKDP------VMLYVSGGNTQIITYYN-GTYIVFGETQDIGIGNLIDKIGRRMDIPFPD 195 Query: 180 GPLLSKMAAQ 189 G + + + Sbjct: 196 GTKIEETCHE 205 >UniRef50_Q8IJ99 Glycoprotease, putative n=5 Tax=Plasmodium RepID=Q8IJ99_PLAF7 Length = 598 Score = 91.3 bits (225), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 53/184 (28%), Positives = 97/184 (52%), Gaps = 13/184 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-GVVPELASRDHVRKTVPLIQA 61 +LGIE S ++ GI+I +++ +L N + + ++ G G +P S H + +I++ Sbjct: 17 ILGIEGSANKLGISIINEDMNILVNMRRTYI---SEIGCGFIPREISAHHKYYIIDMIKS 73 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 LK+ + DI + YT GPG+ AL +G + + L +++P + V+H H+ + Sbjct: 74 CLKKVNIKISDITLICYTKGPGIGSALYIGYNIAKILYSYFNIPVVGVNHCIAHIEMGIF 133 Query: 122 ED---NPPEFPFVALLVSGGHTQLISVTG-IGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 NP + L VSG +TQ+I +YE++GE++D A G D++A++L + Sbjct: 134 ITKLYNP-----IVLYVSGSNTQIIYYNDHKKKYEIIGETLDIAIGNVIDRSARILKISN 188 Query: 178 PGGP 181 P Sbjct: 189 APSP 192 Score = 54.7 bits (130), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 25/106 (23%), Positives = 54/106 (50%), Gaps = 4/106 (3%) Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 +++ + I + + + L+ +RA+ T K +++ GGV N L+ + +M K++ Sbjct: 483 EEKRKIQICYSLQHHIFSMLIEITERAIAFTNSKEVIIVGGVGCNLFLQNMMKKMAKQKN 542 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL----GVSVRPRW 329 ++ + +C DNGAMIAY G + + D+ +++ R+ Sbjct: 543 IKIGFMDHSYCVDNGAMIAYTGYLEYLHAKNKDIYNFNNITIHQRY 588 >UniRef50_A7VX43 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A7VX43_9CLOT Length = 315 Score = 87.0 bits (214), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 88/310 (28%), Positives = 141/310 (45%), Gaps = 26/310 (8%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TS T +A+YD + + Q+ + + G+ A HV++ +P + L Sbjct: 5 LGIDTSNYTTSLALYDAQAHEIC-QVKRLLPVKEGEKGLRQSDAVFHHVQQ-LPELMDKL 62 Query: 64 KESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 + G K + AV +A P + VG T R L+ A +VP H GH+ A Sbjct: 63 WKPG-CGKALSAVGVSARPRDAEGSYMPCFTVGLTYARLLSTALEVPFYTFSHQAGHIAA 121 Query: 119 PMLEDNPP---EFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKTAK 171 + + PF+A VSGG T+ + V+ I +L +++D AG+ D+ Sbjct: 122 ALYSSGSLSLLKQPFLAFHVSGGTTEALLVSPDDQRILSCQLAAKTLDLNAGQLIDRVGV 181 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 +LGL +P GP L ++A + RP G D SG + +R+ G + Sbjct: 182 MLGLGFPAGPALERLALTCESKGLRGARPAMK--GNDCCLSGGENLCIKLLRE-GKEPAY 238 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 A + A +D + C+ L++ G +V AGGV +N LR E K+ G +F Sbjct: 239 IAAFCLEYVKAALDQM---CRGLLERYGRLPVVFAGGVMSNSILR----EYFSKQYGAMF 291 Query: 292 YARPEFCTDN 301 A P+F +DN Sbjct: 292 -AEPQFSSDN 300 >UniRef50_A6TR37 O-sialoglycoprotein endopeptidase n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TR37_ALKMQ Length = 330 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 77/320 (24%), Positives = 149/320 (46%), Gaps = 19/320 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGI+TS T +AI + + LL+ + S + + G+ A H+ K +P++ Sbjct: 8 ILGIDTSNYMTSLAIMNLQGALLSEE-RSLLPVKTGNLGLRQSDALFHHI-KNLPVLCKK 65 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L + + + +I ++ + P + L + S+A +VP H EGH+ Sbjct: 66 LMQQ-VDSINIVGISASVKPRPLADSYMPVFLASQSFATSMASLMNVPFYSFSHQEGHIE 124 Query: 118 APMLED-NPPEFPFVALLVSGGHTQLISVTGI-GQY--ELLGESIDDAAGEAFDKTAKLL 173 A F+ L +SGG T+++ V +Y E++G S D +AG+ D+ L Sbjct: 125 AGFWSQARTCTQEFLVLHISGGTTEMLKVVPYDNRYDIEIVGGSKDISAGQLIDRIGVRL 184 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + +P GP L ++ + + P ++ + G +FSGL+T + + Q Sbjct: 185 DMPFPAGPHLESLSLEWQGPKIKLP--ISVKEGW-VNFSGLETHITRLLNQEYSSQQ--- 238 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 IA + + +L++ K A Q+ K ++ GGV++N+ +R + + + EV + Sbjct: 239 -IASSLFHTIGQSLVLMIKTAKFQSLIKTALVVGGVASNQQIRTLIEKELSSENIEVLFG 297 Query: 294 RPEFCTDNGAMIAYAGMVRF 313 + ++C+DN IA G+ + Sbjct: 298 QTQYCSDNAVGIAALGVKSY 317 >UniRef50_Q0AZF6 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AZF6_SYNWW Length = 326 Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 91/320 (28%), Positives = 153/320 (47%), Gaps = 27/320 (8%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TS + +A+ D+E+ ++A++ +++ A G+ A H+ K +P + A L Sbjct: 5 LGIDTSAYTSSLALVDEEQNIIADERMI-LQVGAGKRGLRQSEAFFQHI-KNLPFLFARL 62 Query: 64 KESGLTAKDIDAVAYTAGPGLV-GALLVGATVGRSLAFAWD----VPAIPVHHMEGHLLA 118 S + A+ +A P V G+ + + G S A +P H EGH++A Sbjct: 63 --SSYFDAPVKAIGASAWPRRVEGSYMPVFSAGFSQAVVLSSFTGIPLYSFSHQEGHIIA 120 Query: 119 PMLEDNP--PEFPFVALLVSGGHTQLISVTGIGQYELLGES-----IDDAAGEAFDKTAK 171 + + F+A+ SGG ++L+ V Q LL S +D AG+ D+ Sbjct: 121 GIKGNEALLGRAEFLAVHFSGGTSELLHVRQ-QQGGLLDISPALAGLDLHAGQLVDRVGV 179 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRF-VFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ 230 +GLD+P G L KMA Q + + P ++D+ FSFSG +T A R + Sbjct: 180 AMGLDFPCGSELEKMARQSSGENLPLMPSSVSDK---GFSFSGAETRA----RKLMAEGI 232 Query: 231 TRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKL-AEMMKKRRG- 288 + DIA A + +TL + D+ G K +++ GGV AN ++ +L A + G Sbjct: 233 SYPDIALASLRCIANTLEKSILQESDKKGIKDVLLVGGVMANSIIKERLQARLEHPAVGL 292 Query: 289 EVFYARPEFCTDNGAMIAYA 308 ++F+A P +DN +A A Sbjct: 293 KLFFASPRLSSDNAVGVALA 312 >UniRef50_C8WXH0 Peptidase M22 glycoprotease n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WXH0_ALIAD Length = 329 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 91/327 (27%), Positives = 147/327 (44%), Gaps = 26/327 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLG++TS T + D G + + +++ G+ A+ HV + P + A Sbjct: 8 VLGVDTSNYTTSVCAVDAVHGRMVAEARRPLRVPRGERGLRQSEAAFQHV-QNFPTVMAE 66 Query: 63 LKESGLTAKDID------AVAYTAGPGLVGALLV---GATVGRSLAFAWDVPAIPVHHME 113 L + L A+ + AV+ P + V G V SLA + VP H E Sbjct: 67 LLDR-LMAEGVRPAWRRVAVSVRPRPWASSYMPVFQSGFAVAASLAHSLGVPLTRTSHQE 125 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQ-LISVTGIGQYEL--LGESIDDAAGEAFDKTA 170 GHL A P PFVA+ +SGG +I+ Y + +GE++D G+ D+ Sbjct: 126 GHLAAAEYFAPMPGAPFVAVHMSGGTCDVVIARRTPSGYAITRVGEALDLHPGQLVDRVG 185 Query: 171 KLLGLDYPGGPLLSKMAAQ-GTA-GRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD 228 LGL +P GP L ++A + GT+ G + P+ G SFSG T A ++ Sbjct: 186 VALGLPFPAGPHLEQLARRCGTSPGELLLKAPVR---GASMSFSGPLTAALRAVQAGAPA 242 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVM-AGGVSANRTLRAKLAEMMKKRR 287 ++ARA E + ++ + A+ R V+ AGGV++N+ ++ + +++R Sbjct: 243 H----EVARAVEACIARSVAKAVEYAVRHAQTARHVLIAGGVASNQFIQCTIRSRLERRV 298 Query: 288 G--EVFYARPEFCTDNGAMIAYAGMVR 312 V +A PEF DN +A G R Sbjct: 299 PGIHVAFAPPEFARDNALGVATIGYWR 325 >UniRef50_Q2RIB0 O-sialoglycoprotein endopeptidase n=5 Tax=Clostridia RepID=Q2RIB0_MOOTA Length = 321 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 72/228 (31%), Positives = 116/228 (50%), Gaps = 13/228 (5%) Query: 90 VGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDN-PPEFPFVALLVSGGHTQLISVT-- 146 V A GR LA A VP H EGH+ A + P F+A+ +SGG ++++ V+ Sbjct: 92 VAAGQGRILAAALGVPFRATTHQEGHIQAGLWSSGWQPSDSFLAVHLSGGTSEVLLVSRK 151 Query: 147 -GIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDR- 204 G E LG ++D AG+ D+ L+GL++P GP L ++A + AG + +T Sbjct: 152 PGGFTIEKLGGTLDLHAGQLVDRAGVLMGLEFPAGPALERLARE--AGPEMEKVHLTSAV 209 Query: 205 PGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLV 264 G +FSFSG + A + A +ARA E + +TL + A++ TG + ++ Sbjct: 210 RGYNFSFSGPASQAERLL----AAGAPPAAVARAVEQCIANTLERVLRPAVEATGLRDIL 265 Query: 265 MAGGVSANRTLRAKLAEMMKKR--RGEVFYARPEFCTDNGAMIAYAGM 310 + GGV+AN LR +L ++ + +A PE +DN +A G+ Sbjct: 266 IVGGVAANNYLRQRLRHRLEHPAVAARLHFAAPEHSSDNAIGVALLGL 313 >UniRef50_B0TEI7 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TEI7_HELMI Length = 385 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 93/330 (28%), Positives = 142/330 (43%), Gaps = 63/330 (19%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGI+TSC T +A + LLA Q + + G+ A H R+ +P + A Sbjct: 9 VLGIDTSCYTTSVAFASLDGRLLA-QKRQLLPVKPGERGLRQGDAFFLHGRQ-LPHVMEA 66 Query: 63 L---------KESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIP 108 L ++G ++AVA + P + L G VGRS+A A VP Sbjct: 67 LFADLRCSGEAKAGREGLRVEAVAASTRPRPEEGAYLPVFLAGEAVGRSVAAAQGVPFFA 126 Query: 109 VHHMEGHLLAPM--LEDNPP-----EFPFVALLVSGGHTQLISV-----TGIGQYELLGE 156 H EGH++A + LED + F+++ +SGG T+L+ V + + E LG Sbjct: 127 TTHQEGHIMAGIASLEDREQAEALLKKGFLSVHLSGGTTELLRVRFDGASAVFSIEKLGA 186 Query: 157 SIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRF--------------VFPRPMT 202 + D AG+ D+ LGL +P GP L +AAQ GR P P + Sbjct: 187 TTDLHAGQLVDRVGVALGLPFPAGPHLEALAAQCDGGRCAAEGAAEGSTEAIEAIPFPAS 246 Query: 203 DRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD--------------------IARAFEDA 242 + G + SFSG + A I ++ + IAR E Sbjct: 247 VK-GYNVSFSGAEAQALRLIEKWRKANEAASPAAIATLPGDPAHPGIPALPAIARGIEGC 305 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSAN 272 + TL +RA+ +TG + +++ GGV+AN Sbjct: 306 LASTLEKILRRAIAETGCRDVLIVGGVAAN 335 >UniRef50_UPI0000DD8AA6 Os01g0295900 n=1 Tax=Oryza sativa Japonica Group RepID=UPI0000DD8AA6 Length = 288 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 82/327 (25%), Positives = 131/327 (40%), Gaps = 102/327 (31%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+T A+ + +L+ + SQ L +GGV P++A H+ ++Q A Sbjct: 18 MLGIETSCDDTAAAVVRGDGEILSQVVSSQEDLLVRWGGVAPKMAEEAHLLAIDRVVQKA 77 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L + ++ D+ AVA TVG Sbjct: 78 LDNANVSESDLSAVA--------------VTVG--------------------------- 96 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 P ++L + G T I+ + Q K +K++ P Sbjct: 97 ------PGLSLCLRGYLTNHINCSWCSQSS---------------KNSKIIS------PA 129 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDD----------- 229 ++ G G F M +FS++GLKT I R+ TDD Sbjct: 130 YCWSSSYGGTG-ISFQVSMRQHKDCNFSYAGLKTQVRLAIESRNISTDDIPISSATKDDR 188 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 Q RA+IA +F+ ++K V++GGV++N+ +R L ++ +K + Sbjct: 189 QIRANIAASFQ-------LLK-------------VVSGGVASNQYVRTHLNQIAEKNGLQ 228 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAG 316 + P CTDNG MIA+ G+ F AG Sbjct: 229 LVCPPPRLCTDNGVMIAWTGIEHFIAG 255 >UniRef50_B9PG42 O-sialoglycoprotein endopeptidase, putative n=3 Tax=Toxoplasma gondii RepID=B9PG42_TOXGO Length = 1323 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 45/98 (45%), Positives = 63/98 (64%), Gaps = 1/98 (1%) Query: 3 VLGIETSCDETGIAIYDDEKG-LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCD+T + I D E G +LAN Q +L YGGV P A+ H R+ +++ Sbjct: 147 ILGIETSCDDTCVGIVDWESGRILANICTPQPELLIKYGGVHPSEAAAAHDRRMQSVVRN 206 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLA 99 AL+E+G++ DID +A+T GPGLV L VGA+ +A Sbjct: 207 ALQEAGVSLLDIDVIAFTRGPGLVPCLSVGASAALEIA 244 Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 40/126 (31%), Positives = 66/126 (52%), Gaps = 8/126 (6%) Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKT-FAANTIRDNGTDDQTRADIA 236 GG ++ +MAA G P ++ +P L+FSFSG+K+ FAA + D++ + D A Sbjct: 745 GGAMMEQMAASGNDKAVPLPNMLSLKPKTLNFSFSGMKSAFAAAVSKMGRQDEKAKCDFA 804 Query: 237 RAFEDAVVDTLMIKCKRALDQTGF-----KRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + + AV L + ++ + F +RL + GGVS N TLR +L ++ + RG+ Sbjct: 805 ASLQAAVFKHLEDQLRKTMWLYEFLEDFPRRLAVVGGVSCNETLRRRLRKLCES-RGDTS 863 Query: 292 YARPEF 297 EF Sbjct: 864 VHEQEF 869 >UniRef50_C5KJ57 Putative uncharacterized protein (Fragment) n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KJ57_9ALVE Length = 203 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 52/141 (36%), Positives = 79/141 (56%), Gaps = 19/141 (13%) Query: 206 GLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALD-----QTGF 260 G DFSF+GLKT + I + G ++ D+A +F+ VD L+ + RA+D Sbjct: 21 GCDFSFAGLKTSMRHLI-EGGK--YSKPDMAASFQKRCVDHLVERAGRAIDWALEIDDSI 77 Query: 261 KRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP-EFCTDNGAMIAYAGMVRFKAG--- 316 K LV+AGGV+AN+++R+ + E+ K++ G + P CTDNG M+A+ + K G Sbjct: 78 KDLVVAGGVAANKSVRSNMQELAKEK-GLTLHCPPTRLCTDNGTMVAWNAIEHLKEGLYE 136 Query: 317 ------ATADLGVSVRPRWPL 331 +A+ V VRPRWPL Sbjct: 137 RAPCTAESAEKFVEVRPRWPL 157 >UniRef50_D1AUR9 Putative endopeptidase n=1 Tax=Anaplasma centrale str. Israel RepID=D1AUR9_ANACI Length = 142 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 41/96 (42%), Positives = 65/96 (67%), Gaps = 4/96 (4%) Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH-L 116 L+ A+ +GL D+ A+A T+GPGLVG+L+VG + +++++ P I V+H+E H L Sbjct: 27 LVSRAMDSAGLGFSDLSAIAVTSGPGLVGSLIVGVMLAKAISYVAGKPIIAVNHLEAHAL 86 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQ--LISVTGIGQ 150 +A M+ D+ EFPF+ L++SGGH Q ++ V GIG Sbjct: 87 VARMVRDD-LEFPFLVLIISGGHCQFLVVCVCGIGS 121 >UniRef50_B0AAV1 Putative uncharacterized protein n=2 Tax=Clostridium RepID=B0AAV1_9CLOT Length = 326 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 77/333 (23%), Positives = 142/333 (42%), Gaps = 35/333 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 V+GI+TSC T IA K ++ N+ ++ L+ D L + V K V I Sbjct: 9 VIGIDTSCYTTSIAAISLNKEIIFNE---KIMLNVDTNS--KGLRQSEAVFKHVSNIGQI 63 Query: 63 LKESGLTAKDIDAVAYTAGPG---LVGALL----VGATVGRSLAFAWDVPAIPVHHMEGH 115 + +D + V A + G+ + VG +G+ L+ + P H E H Sbjct: 64 SENIAEKLRDYNIVGVCASEKPRPIKGSYMPVFTVGLNIGKLLSSTHNCPFFKTSHQENH 123 Query: 116 LLAPMLEDNP-PEFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKTA 170 + + +L N + F+A+ +SGG T+++ V G ++E++G + D + G+ D+ Sbjct: 124 IESSLLGKNLLDKNRFIAVHMSGGTTEIVLVNKGKCGKYEFEIIGGTKDVSFGQLIDRLG 183 Query: 171 KLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFS-------FSGLKTFAANTIR 223 L ++P G + K A + T GL S SG++ + Sbjct: 184 VKLSYNFPCGKYIDKNALE---------YEKTIENGLKTSVKEGYMNLSGIENQLDKIMS 234 Query: 224 DNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMM 283 + D ++ +++ DA++ + ++ G +V AGGVSA++ + L + + Sbjct: 235 NQKEID--KSFLSKLLMDAIIRNMFKSLSYLCEKHGVYEVVFAGGVSASKYISKNLTQKL 292 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 KK R + + + TDN A G+ G Sbjct: 293 KKYRIKTHFTHADLATDNAVGCALIGIQNLNLG 325 >UniRef50_A9UYP5 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UYP5_MONBE Length = 230 Score = 77.0 bits (188), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 49/142 (34%), Positives = 72/142 (50%), Gaps = 10/142 (7%) Query: 199 RPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRAL--- 255 R D DFSFSGLKT A N + D+ +A +F+ + D L+++ +RAL Sbjct: 66 RRTQDHSNCDFSFSGLKTRAINLSSEYAKRDEL-PLLAASFQRTIADHLLVRLERALRFC 124 Query: 256 DQTGFK--RLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRF 313 DQ G + R V AGGV N +R +L + V + P C DNG MIA+AG++ F Sbjct: 125 DQQGRRPRRFVAAGGVLCNAYIRQRLHAFARFHDLPVEFPAPPLCVDNGVMIAWAGLLHF 184 Query: 314 KAGATA----DLGVSVRPRWPL 331 G ++ + P+WP+ Sbjct: 185 LRGTSSVARDPQALRYHPKWPI 206 >UniRef50_B2AYU1 Predicted CDS Pa_1_12230 (Fragment) n=1 Tax=Podospora anserina RepID=B2AYU1_PODAN Length = 290 Score = 77.0 bits (188), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 65/210 (30%), Positives = 99/210 (47%), Gaps = 43/210 (20%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKL---HADYGGVVPELASRDHVRKTVPLI 59 + IETSCD+T +AI EK A +L ++ H ++ G+ P +AS+ H + L+ Sbjct: 42 TIAIETSCDDTCVAIL--EKAGPAARLQFNKRIPSNHVEFKGIHPTIASKSHEIQLAKLV 99 Query: 60 QAALKE-----------SGLTAKDI-----------DAVAYTAGPGLVGALLVGATVGRS 97 A++ ++ +D D V+ T GPG L VG V + Sbjct: 100 NEAVQSLPKHTNHSPEVKTISIRDPQTGKSTPRRLPDFVSVTRGPGFPRCLDVGLGVAKG 159 Query: 98 LAFAWDVPAIPVHHMEGHLLAPMLED----------------NPPEFPFVALLVSGGHTQ 141 L+ AW VP + VHHM+GH L P L+ P+FPF+ LL SGGHTQ Sbjct: 160 LSVAWQVPFLGVHHMQGHALTPRLDHALQQPFPPSSSTPSSKLSPKFPFLTLLASGGHTQ 219 Query: 142 LISVTGIGQYELLGESIDDAAGEAFDKTAK 171 L+ T + + +L + + G+ DK A+ Sbjct: 220 LLLSTTLTTHTILATVTNISLGDMLDKAAR 249 >UniRef50_A5KDZ1 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Plasmodium vivax RepID=A5KDZ1_PLAVI Length = 574 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 45/181 (24%), Positives = 91/181 (50%), Gaps = 7/181 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-GVVPELASRDHVRKTVPLIQA 61 +LG+E S ++ G++I + +L N + + ++ G G +P + H + +I+ Sbjct: 20 ILGLEGSANKLGVSIINSNFEILVNMRRTYI---SEIGCGFIPRQINAHHKYYIIEMIKD 76 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L + + D+ + YT GPG+ AL + + + + +++P I V+H H+ + Sbjct: 77 CLTKLKIKITDVHLICYTKGPGIGSALYIAYNISKFFSLLFNIPVIGVNHCIAHIEMGIF 136 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQ-YELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + L VSG +TQ+I + YE++GE++D A G D++A++L + Sbjct: 137 ITKL--YHPIILYVSGSNTQIIYFNDHKKRYEIIGETLDIAIGNVIDRSARILRISNSPS 194 Query: 181 P 181 P Sbjct: 195 P 195 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 29/108 (26%), Positives = 55/108 (50%), Gaps = 5/108 (4%) Query: 227 TDDQTRA-DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK 285 TD++ R I + + + L+ +RA+ T K +++ GGV N L+ + +M K+ Sbjct: 457 TDEEKRKIQICYSLQHHIFSMLIEITERAIAFTNSKEVIIVGGVGCNVFLQNMMKKMAKQ 516 Query: 286 RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL----GVSVRPRW 329 + ++ + +C DNGAMIAY G + F ++ +S+ R+ Sbjct: 517 KNIKIGFMDHSYCVDNGAMIAYTGYLEFANTKNREIYGFDNISIHQRY 564 >UniRef50_Q18B67 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Clostridium difficile RepID=Q18B67_CLOD6 Length = 356 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 75/350 (21%), Positives = 145/350 (41%), Gaps = 45/350 (12%) Query: 3 VLGIETSCDETGI-AIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++GI+TSC T I AI D+K + ++ +V+ ++ G+ A H+ + ++ Sbjct: 8 IIGIDTSCYTTSIAAISLDKKVIFNEKIMLEVRDNSK--GLRQSEAVFQHIN-NLGILSD 64 Query: 62 ALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 +K S +++ V + P + VG G+ L+ + H E H+ Sbjct: 65 RIK-SFKDKFNVEGVCSSKKPRPVENSYMPVFNVGHNFGKLLSSIYGCRFYETTHQENHI 123 Query: 117 LAPMLEDN-PPEFPFVALLVSGGHTQL-----------ISVTGIGQ-------------- 150 A +L F+++ +SGG T++ + T +G+ Sbjct: 124 EASLLNSKLKNNNKFISVHMSGGTTEILLTSKQDSHHNVCDTNLGKIAKISIKKDDKSKL 183 Query: 151 -------YELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTD 203 +++G S D + G+ D+ LG +P G L + A + + Sbjct: 184 YNNFGYNIDIIGGSKDISFGQLIDRVGIKLGYKFPSGKYLDENALNCNL-KIESGLKTSV 242 Query: 204 RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRL 263 R G + SGL+ I DNG + + I++ D+VV + + + Sbjct: 243 RDGY-MNLSGLENQVNKIINDNGDNTNQKEYISKLVLDSVVRNMFKSLVYLCETYNVNEV 301 Query: 264 VMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRF 313 + AGGVSA++ + +L+ ++K+ E ++ P++ TDN A G+ F Sbjct: 302 IFAGGVSASKYILRELSMKLRKKHIEAYFTEPQYSTDNAVGCAIIGLNNF 351 >UniRef50_B9HH45 Predicted protein n=7 Tax=Eukaryota RepID=B9HH45_POPTR Length = 139 Score = 70.5 bits (171), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 36/116 (31%), Positives = 65/116 (56%), Gaps = 2/116 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LG E S ++ G+ + + +L+N ++ + G +P ++ H++ +PLI+ Sbjct: 4 MTALGFEGSANKIGVGVDTLDGTILSNPRHTYITPAGQ--GFLPRETAQHHLQHVLPLIK 61 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 +AL+ +G+T+ +ID + YT GPG+ L V A V R L+ W P + V+H H+ Sbjct: 62 SALETAGITSDEIDCLCYTKGPGMGAPLQVSAVVVRVLSQLWKKPIVAVNHCVAHI 117 >UniRef50_D1IQV9 Whole genome shotgun sequence of line PN40024, scaffold_2082.assembly12x (Fragment) n=4 Tax=Eukaryota RepID=D1IQV9_VITVI Length = 151 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 39/120 (32%), Positives = 63/120 (52%), Gaps = 1/120 (0%) Query: 207 LDFSFSGLKTF-AANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVM 265 +D SFSGL ++ A + ++ T AD+ + ++ V L+ +RA+ K +++ Sbjct: 1 MDVSFSGLLSYIEATAVEKLQNNECTPADLCYSLQETVFAMLVEITERAMAHCDKKDVLI 60 Query: 266 AGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSV 325 GGV N L+ + M +R G +F +C DNGAMIAY G++ + GAT L S Sbjct: 61 VGGVGCNERLQEMMRVMCSERSGRLFATDDRYCIDNGAMIAYTGLLAYAHGATTPLEEST 120 >UniRef50_UPI000187E9E4 hypothetical protein MPER_08009 n=1 Tax=Moniliophthora perniciosa FA553 RepID=UPI000187E9E4 Length = 276 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 64/282 (22%), Positives = 113/282 (40%), Gaps = 43/282 (15%) Query: 4 LGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 LG+E S ++ G I D +L+N ++ + + G P + H + +I Sbjct: 21 LGLEGSANKLGAGIIKHSEDGSATVLSNIRHTYITPPGE--GFQPRDTALHHREWAMKVI 78 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 L ++ ++ D+D + YT GPG+ L A V R+L+ +D P + V+H GH+ Sbjct: 79 DECLTKAEVSMHDLDCICYTKGPGMGAPLQSVALVARTLSMLFDKPIVGVNHCVGHI--- 135 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + ++G ++ G+Y + F L D Sbjct: 136 ----------EMGREITGAQNPVVLYVSRGEY---------PSDSVFAAMLSYLWRD--T 174 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGL----------KTFAANTIRDNGTDD 229 G + + GR + P P + G+D S SG+ K F D D+ Sbjct: 175 GHCWYNIEQESKKGRRLLPLPYATK-GMDISLSGVLSSVEAYTNDKMFRQTPTSDEEKDE 233 Query: 230 Q--TRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGV 269 T AD+ + ++ V L+ +RA+ G K +++ GGV Sbjct: 234 SVITPADLCFSLQETVFAMLVEITERAMAHIGSKEVLIVGGV 275 >UniRef50_C7H6X1 Glycoprotease family protein n=2 Tax=Faecalibacterium prausnitzii RepID=C7H6X1_9FIRM Length = 312 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 57/219 (26%), Positives = 98/219 (44%), Gaps = 17/219 (7%) Query: 87 ALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEF---PFVALLVSGGHTQLI 143 L G + + A A +P I H +GH A + E + +SGG T L+ Sbjct: 91 CFLAGVSAATAFAQARGIPLIHTTHQQGHAAAALFAAKGEELFRQKVLLFHISGGTTDLL 150 Query: 144 SVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFP-RPMT 202 + + LG S D AG+A D+ LG +P G +S++AA P +P + Sbjct: 151 LCNEVKEITTLGTSTDLYAGQAVDRVGVKLGFGFPAGVEVSRLAALCEE-----PIKPRS 205 Query: 203 DRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKR 262 G+ S SGL+ N + + G +T + + V DT++ K A + Sbjct: 206 SVKGMQCSLSGLEN-QCNALLNEG---KTPEYVCKYCLLCVADTVVKMTKAAQKEYPGLP 261 Query: 263 LVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 +V AGGV ++ +RA +++R +V++ ++ +DN Sbjct: 262 VVCAGGVMSSDIIRA----WVQQRLPQVYFVPGQYSSDN 296 >UniRef50_B8I821 Peptidase M22 glycoprotease n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I821_CLOCE Length = 236 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 44/163 (26%), Positives = 78/163 (47%), Gaps = 26/163 (15%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+L ++TS + AI +DE + + + + G + H ++ +P++Q Sbjct: 1 MRILAVDTSTNVASAAILEDEVII--------GEYNCNRG--------KTHSQRLMPMVQ 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG------ 114 ++ +GLT DIDA + + GPG L +G T +++AFA + P I VH ++ Sbjct: 45 HLMETAGLTVSDIDAFSASIGPGSFTGLRIGVTTIKAMAFAAEKPVISVHTLDALAYNIP 104 Query: 115 ---HLLAPMLE-DNPPEFPFVALLVSGGHTQLISVTGIGQYEL 153 +L+ PM++ N F + + G +L GI EL Sbjct: 105 FAENLVCPMIDARNNQVFTAIYRFIGGKLERLTEYLGIPVTEL 147 >UniRef50_P43990 Probable M22 peptidase homolog HI0388 n=24 Tax=Pasteurellaceae RepID=Y388_HAEIN Length = 236 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 35/112 (31%), Positives = 57/112 (50%), Gaps = 17/112 (15%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +L ++TS + +A+ LY K H + ELA R H ++ +P+I Sbjct: 4 LTLLALDTSTEACSVAL-----------LYRGEKTH------INELAQRTHTKRILPMID 46 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHM 112 L SGL +DA+A+ GPG + VGA + + LAF D+P IP+ ++ Sbjct: 47 EILANSGLGLNQVDALAFGRGPGSFTGVRVGAGIAQGLAFGADLPVIPISNL 98 >UniRef50_Q0JNG2 Os01g0295900 protein n=5 Tax=Oryza sativa RepID=Q0JNG2_ORYSJ Length = 133 Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 30/79 (37%), Positives = 44/79 (55%), Gaps = 9/79 (11%) Query: 264 VMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG------- 316 V++GGV++N+ +R L ++ +K ++ P CTDNG MIA+ G+ F AG Sbjct: 25 VVSGGVASNQYVRTHLNQIAEKNGLQLVCPPPRLCTDNGVMIAWTGIEHFIAGRFDDPPA 84 Query: 317 --ATADLGVSVRPRWPLAE 333 DL +RPRWPL E Sbjct: 85 VDEPDDLQYDLRPRWPLGE 103 >UniRef50_D1PKV9 Glycoprotease family protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PKV9_9FIRM Length = 315 Score = 60.8 bits (146), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 83/314 (26%), Positives = 132/314 (42%), Gaps = 22/314 (7%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGI+TS T +A++D G + + + A G+ A H +P + Sbjct: 1 MLTLGIDTSNYATSLAVFDTNAGEVVCDCKKFLPVKAGQMGLRQSDALFHHT-SALPQML 59 Query: 61 AALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 L E ++ I AV +A P + L G + A A +P H +GH Sbjct: 60 LELGEKTDLSR-IGAVGVSAKPRPVEGSYMPCFLAGVNTATAFALARKIPMFKTTHQQGH 118 Query: 116 LLAPMLEDNPPE-FPFVALL--VSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL 172 + A + F AL+ VSGG T L+ G LG S D AG+A D+ Sbjct: 119 IAAALFATGVHSLFMQEALVFHVSGGTTDLLLCHGADTVVPLGTSSDLYAGQAVDRLGVK 178 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LG +P G +S+ AA A + RP G++ S SGL+ N + + G + Sbjct: 179 LGYPFPAGVYVSEQAAL-CAEKI---RPKVSVRGMECSLSGLEN-QCNRMLEEG---KNA 230 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 + + + + +TL+ AL + ++ AGGV ++ +R + M + G F Sbjct: 231 SYVCKYCLLCIGETLVRMAGTALQEHPGLPVIFAGGVMSSDLIRTYV---MHRVPGAHFV 287 Query: 293 ARPEFCTDNGAMIA 306 +F +DN IA Sbjct: 288 PG-KFASDNAIGIA 300 >UniRef50_UPI00019087BD O-sialoglycoprotein endopeptidase n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI00019087BD Length = 120 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 37/81 (45%), Positives = 45/81 (55%), Gaps = 3/81 (3%) Query: 253 RALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP-EFCTDNGAMIAYAGMV 311 R +D+ +R + GGV+AN LRA L + K G F A P CTDN MIA+AG+ Sbjct: 24 RPIDRRESRRWSLPGGVAANLELRATLQALCDKN-GFRFIAPPLSLCTDNAVMIAWAGLE 82 Query: 312 RFKAGATAD-LGVSVRPRWPL 331 R GA D L V R RWPL Sbjct: 83 RMATGAAPDPLDVQPRSRWPL 103 >UniRef50_Q1Q3G6 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q3G6_9BACT Length = 225 Score = 57.4 bits (137), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 33/109 (30%), Positives = 57/109 (52%), Gaps = 16/109 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIETS + GI++ ++++ ++ + G+V H R+ VP I+ Sbjct: 10 MKVLGIETSGNIGGISLCENQQCIITK----------TFSGIV------QHERELVPAIK 53 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPV 109 AL+E+ DI+ +A GPG L +G T ++L +A + P + V Sbjct: 54 DALEEAHWQINDIEVIAVNVGPGSYTGLRIGVTCAKTLGYALNRPVVDV 102 >UniRef50_B9Z6Q4 Peptidase M22 glycoprotease n=1 Tax=Lutiella nitroferrum 2002 RepID=B9Z6Q4_9NEIS Length = 227 Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 32/109 (29%), Positives = 54/109 (49%), Gaps = 17/109 (15%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L I+TS D +A+ +D+ + V E + H + +P +Q Sbjct: 1 MKLLAIDTSTDFLSLAVLNDDNTV-----------------VFHERVGQKHAEQALPHVQ 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPV 109 + L ++GLT + +D V Y GPG L +G + + LAFA +P IP+ Sbjct: 44 SLLCDAGLTLQQLDGVVYGQGPGSFTGLRIGCGLAQGLAFAAGLPVIPI 92 >UniRef50_B7GZH2 Glycoprotease family protein n=17 Tax=Acinetobacter RepID=B7GZH2_ACIB3 Length = 221 Score = 57.0 bits (136), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 29/114 (25%), Positives = 63/114 (55%), Gaps = 16/114 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L +ET+ ++ I++ D+ + +L+ Q+ A + + +P+I+ Sbjct: 1 MKLLALETANEQCSISLIDETQ-----ELFFQLDTRA-----------KAQTQTILPMIE 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L+++GL +DA+A++ GPG + + A V ++LA++ D+P IPV ++ Sbjct: 45 QGLQQTGLDVAGLDAIAFSRGPGSFSGVRINAAVAQALAWSQDLPVIPVSTLQA 98 >UniRef50_Q2SL20 Inactive metal-dependent protease-like protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SL20_HAHCH Length = 230 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 34/117 (29%), Positives = 57/117 (48%), Gaps = 17/117 (14%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++L ++TS D +A+++D G L L E R H ++ +P++ + Sbjct: 3 KILALDTSSDACSVALWND--GELTELL---------------ETTPRAHAKRCLPMVDS 45 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 L +SGL +DA+A+ GPG L + A + + LAF D+P PV +E A Sbjct: 46 LLGDSGLRVGQLDALAFGRGPGSFTGLRIAAGIVQGLAFGADLPVAPVSTLEAMAFA 102 >UniRef50_P76256 M22 peptidase homolog yeaZ n=236 Tax=Gammaproteobacteria RepID=YEAZ_ECOLI Length = 231 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 32/109 (29%), Positives = 55/109 (50%), Gaps = 17/109 (15%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+L I+T+ + +A+++D V H EL R+H ++ +P++Q Sbjct: 1 MRILAIDTATEACSVALWND----------GTVNAHF-------ELCPREHTQRILPMVQ 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPV 109 L SG + DI+A+AY GPG + +G + + LA ++P I V Sbjct: 44 DILTTSGTSLTDINALAYGRGPGSFTGVRIGIGIAQGLALGAELPMIGV 92 >UniRef50_Q31G60 Peptidase M22 glycoprotease family protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31G60_THICR Length = 223 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 32/114 (28%), Positives = 52/114 (45%), Gaps = 17/114 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VL +E+S + + DEK V E+A + H +P+++ Sbjct: 1 MNVLAVESSTKACSVCLKVDEKAY-----------------VEFEMAPQRHANLMLPMVE 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L +SG+T DI A+A++ GPG + + A V + LA W P + V +E Sbjct: 44 KVLNQSGITPDDIHALAFSEGPGAFTGIRIAAGVTQGLALGWGKPVLAVSTLEA 97 >UniRef50_C0GCU7 Peptidase M22 glycoprotease n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GCU7_9FIRM Length = 242 Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 22/131 (16%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVLGI+++ +A+ +EK L L QVK + H + +PLI Sbjct: 1 MRVLGIDSATLVCSVALVSEEKTLAEYNL--QVK--------------KTHSERLLPLIA 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A L+++GL D+D VA AGPG + +G +SL A VP V ++ Sbjct: 45 AMLRDTGLKPADLDGVAVAAGPGSFTGVRIGMVTAKSLGQALAVPLAGVSTLQA------ 98 Query: 121 LEDNPPEFPFV 131 L P FP V Sbjct: 99 LAAQHPHFPGV 109 >UniRef50_Q1MXN6 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1MXN6_9GAMM Length = 233 Score = 54.7 bits (130), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 32/112 (28%), Positives = 53/112 (47%), Gaps = 17/112 (15%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE + D IA+ D + V+P A+R H + P++ Sbjct: 4 ILAIEAASDFCSIALDDGTDC---------------FQEVLP--AARSHSKLLYPMLNRL 46 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 LKE+G + K +DA+A+ GPG L + A + + FA D+P +PV ++ Sbjct: 47 LKEAGYSPKQLDAIAFAKGPGSFTGLRIAAATAQGIGFANDIPLLPVSTLQA 98 >UniRef50_B3ERC8 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ERC8_AMOA5 Length = 228 Score = 54.3 bits (129), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 33/116 (28%), Positives = 55/116 (47%), Gaps = 16/116 (13%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IETS +A++ + K L L+ +R H + +I+ Sbjct: 4 ILSIETSTSVCSVALHREGKLLAYQSLF----------------IARSHAESLLTIIEHI 47 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 ++ S T KD+ A+A + GPG L +GAT L +A ++P I V+ +E +LA Sbjct: 48 VQLSQYTLKDLQAIAISKGPGSYTGLRIGATTATGLCYALNIPLISVNTLEAMVLA 103 >UniRef50_P57409 Uncharacterized protein BU324 n=4 Tax=Buchnera aphidicola RepID=Y324_BUCAI Length = 221 Score = 54.3 bits (129), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 26/107 (24%), Positives = 52/107 (48%), Gaps = 17/107 (15%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+S D +AIY +E Y + E + H +P+I+ Sbjct: 5 ILSIESSLDCCSVAIYKNE-----------------YIHSLSEKCKKKHTTHILPMIKEI 47 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPV 109 L ++ K+++ V+++ GPG ++ + A++ +SL+ + +P I V Sbjct: 48 LSQTKTEFKELNYVSFSKGPGNFTSIRIAASIAQSLSISLKIPIISV 94 >UniRef50_C9SIA9 Glycoprotease pgp1 n=2 Tax=Sordariomycetes RepID=C9SIA9_VERA1 Length = 208 Score = 53.9 bits (128), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 49/142 (34%), Positives = 68/142 (47%), Gaps = 9/142 (6%) Query: 198 PRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRAL-- 255 P T R DFS G + R ++ + R D+ R + + L + AL Sbjct: 48 PFATTRRMRYDFSGFGSQVQRIAEARPAMSEAERR-DLGRDTMRILFEHLASRVVLALGN 106 Query: 256 DQTGFK---RLVMAGGVSANRTLRAKLAEMMKKR--RGEVFYARP-EFCTDNGAMIAYAG 309 ++ G K LV+AGGV++NR L L + R G A P E CTDN AMIA+ G Sbjct: 107 EEMGLKDVRTLVVAGGVASNRYLMHVLRAFLDVRGYDGIEITAPPVELCTDNAAMIAWTG 166 Query: 310 MVRFKAGATADLGVSVRPRWPL 331 M F+AG ++L V +WPL Sbjct: 167 MEMFEAGYESELSVHSIKKWPL 188 >UniRef50_C3WNF1 Glycoprotease n=10 Tax=Fusobacterium RepID=C3WNF1_9FUSO Length = 214 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 32/113 (28%), Positives = 51/113 (45%), Gaps = 15/113 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGI+TS +I+D E G++A S ++H +P+I Sbjct: 1 MLILGIDTSTKICTCSIFDSENGVIAETSLS---------------VKKNHSNIVMPIID 45 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 K S LT DID +A GPG + + + + LA A + P I V+ ++ Sbjct: 46 NLFKISDLTINDIDKIAVAIGPGSFTGVRIALGIAKGLAMALNKPLIAVNELD 98 >UniRef50_Q5E439 Predicted peptidase n=5 Tax=Vibrionaceae RepID=Q5E439_VIBF1 Length = 233 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 58/112 (51%), Gaps = 17/112 (15%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++L ++T+ + +A+ D K +YS+ + A R+H K +P + Sbjct: 4 KILAVDTATENCSVALIVDGK------VYSRRAV-----------APREHTIKILPFVDE 46 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 LKE+G+ +D+DA+A+ GPG + +G + + LAF D+P + + +E Sbjct: 47 VLKEAGVRLQDLDALAFGQGPGSFTGVRIGIGIAQGLAFGADLPMVGISTLE 98 >UniRef50_D0SL00 Glycoprotease n=1 Tax=Acinetobacter junii SH205 RepID=D0SL00_ACIJU Length = 230 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 28/114 (24%), Positives = 61/114 (53%), Gaps = 16/114 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L +ET+ ++ +++ DD + +LY Q+ + ++ + +PL + Sbjct: 1 MKLLALETANEQCSVSLIDDTQ-----ELYFQL-----------DERTKAQTQTILPLTE 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 AL ++ D+ A+A++ GPG + + A V ++LA++ D+P IPV ++ Sbjct: 45 QALIQTQTQLSDLTAIAFSRGPGSFSGVRINAAVAQALAWSHDLPVIPVSTLQA 98 >UniRef50_B8GRE1 Peptidase M22 glycoprotease n=2 Tax=Chromatiales RepID=B8GRE1_THISH Length = 266 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 33/112 (29%), Positives = 56/112 (50%), Gaps = 17/112 (15%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L IET+ + A++ D G L + E+A R+H R +P++ Sbjct: 1 MKLLSIETATEACSAALWLD--GALTTRF---------------EMAPREHTRLILPMMD 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHM 112 A L E+ + D+DA+A+ GPG + + A V + AF D+P +PV + Sbjct: 44 ALLAEASVRLADLDALAFGRGPGAFTGVRIAAAVIQGAAFGADLPVVPVSTL 95 >UniRef50_A0LXU5 Peptidase, family M22 n=5 Tax=Bacteroidetes RepID=A0LXU5_GRAFK Length = 219 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 2/73 (2%) Query: 51 HVRKTVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVH 110 H K I+ LKE+GL D+DA+A + GPG L +G + + L F+ D+P I V Sbjct: 37 HAEKLHVFIENILKETGLKVDDLDAIAVSKGPGSYTGLRIGVSAAKGLCFSLDIPLISVP 96 Query: 111 HMEGHLLAPMLED 123 ++ LLA L+D Sbjct: 97 TLD--LLAYKLKD 107 >UniRef50_Q8KG29 Protease, putative n=1 Tax=Chlorobaculum tepidum RepID=Q8KG29_CHLTE Length = 224 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 24/54 (44%), Positives = 37/54 (68%) Query: 56 VPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPV 109 VPL+ + E+GLTA ++D VA ++GPG AL +G +V + +AF D+P +PV Sbjct: 39 VPLVMQVMDEAGLTAAELDGVAVSSGPGSFTALRIGLSVAKGIAFGADLPLVPV 92 >UniRef50_C1BYL4 Probable O-sialoglycoprotein endopeptidase 2 n=1 Tax=Esox lucius RepID=C1BYL4_ESOLU Length = 235 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 47/172 (27%), Positives = 73/172 (42%), Gaps = 19/172 (11%) Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN--------GTDDQ 230 GG + +A G +F F PM +FSF+GL+ TI+ GT Sbjct: 40 GGQAIELLAQDGDRLKFHFRPPMGAHYDCNFSFAGLRNQVKMTIQKKEAEEGVEPGTLLS 99 Query: 231 TRADIARAFEDAVVDTLMIKCKRAL---DQTGF-----KRLVMAGGVSANRTLRAKLAEM 282 DIA A + V + + RA+ G LV++GGV++N+ +R L + Sbjct: 100 CVNDIAAAMQHTVAFHIAKRTHRAILFCKAQGLLPSFNPTLVVSGGVASNQYIRKTLKIV 159 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSV---RPRWPL 331 ++ + C DNG MIA+ G+ R + G + V P+ PL Sbjct: 160 TDATGLDLLCPPSKLCNDNGVMIAWNGVERLREGKGILFYIDVVRYEPKAPL 211 >UniRef50_B2AYU2 Predicted CDS Pa_1_12240 (Fragment) n=1 Tax=Podospora anserina RepID=B2AYU2_PODAN Length = 205 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 32/73 (43%), Positives = 44/73 (60%), Gaps = 4/73 (5%) Query: 263 LVMAGGVSANRTLRAKLAEMMKKRRG-EVFYARP--EFCTDNGAMIAYAGMVRFKA-GAT 318 LV++GGV++N+ LR L ++ +R ++ A P CTDN AMIA+AGM F+ G Sbjct: 109 LVLSGGVASNKFLRHVLRSVLDQRGWPDIKLAAPPVSLCTDNAAMIAWAGMEMFETEGVE 168 Query: 319 ADLGVSVRPRWPL 331 DLGV RW L Sbjct: 169 TDLGVRSIQRWSL 181 >UniRef50_A1WXT3 Peptidase M22, glycoprotease n=1 Tax=Halorhodospira halophila SL1 RepID=A1WXT3_HALHL Length = 220 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 29/107 (27%), Positives = 53/107 (49%), Gaps = 17/107 (15%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 ++ +ET+ + +A+Y D V H E A R H + +P+++A Sbjct: 6 IVALETATEGCSVAVYCD----------GDVFHHC-------EEAPRRHTARLLPMLEAV 48 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPV 109 + E+G+ + + A+A+ GPG + + A+ + L AW VPA+PV Sbjct: 49 MAEAGVCGEQVSALAFGQGPGAFAGVRLAASAAQGLCTAWGVPALPV 95 >UniRef50_Q18CP2 Putative glycoprotease n=9 Tax=Clostridium RepID=Q18CP2_CLOD6 Length = 238 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 29/114 (25%), Positives = 57/114 (50%), Gaps = 16/114 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LG++TS +A+ +D+K + + ++ + H +K +P+I+ Sbjct: 1 MKILGMDTSSMAASVAVVEDDKLICEFTVNNK----------------KTHSQKLMPMIE 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L S L+ KD+D +A GPG L +G +++A ++P I V+ +E Sbjct: 45 NMLSMSDLSIKDMDLLAVCIGPGSFTGLRIGMATVKAMAHVNNIPIIAVNSLES 98 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q7MNZ9 Probable O-sialoglycoprotein endopeptidase n=19 ... 485 e-136 UniRef50_B0TIN7 Probable O-sialoglycoprotein endopeptidase n=130... 472 e-132 UniRef50_P36175 O-sialoglycoprotein endopeptidase n=366 Tax=cell... 462 e-129 UniRef50_Q8D283 Probable O-sialoglycoprotein endopeptidase n=11 ... 436 e-121 UniRef50_C4Z311 O-sialoglycoprotein endopeptidase n=13 Tax=Bacte... 429 e-119 UniRef50_C0QTG9 Probable O-sialoglycoprotein endopeptidase n=3 T... 427 e-118 UniRef50_A1AXM9 Probable O-sialoglycoprotein endopeptidase n=36 ... 424 e-117 UniRef50_Q18CP0 Probable O-sialoglycoprotein endopeptidase n=22 ... 422 e-117 UniRef50_A5G3X1 Probable O-sialoglycoprotein endopeptidase n=20 ... 417 e-115 UniRef50_C6P1W3 Metalloendopeptidase, glycoprotease family n=1 T... 416 e-115 UniRef50_Q2RGJ3 Probable O-sialoglycoprotein endopeptidase n=10 ... 415 e-114 UniRef50_B2V910 Probable O-sialoglycoprotein endopeptidase n=4 T... 415 e-114 UniRef50_D1B623 Metalloendopeptidase, glycoprotease family n=3 T... 414 e-114 UniRef50_Q8RC98 Probable O-sialoglycoprotein endopeptidase n=12 ... 414 e-114 UniRef50_Q4FNV6 Probable O-sialoglycoprotein endopeptidase n=15 ... 412 e-114 UniRef50_C1TLM6 O-sialoglycoprotein endopeptidase n=1 Tax=Dethio... 409 e-113 UniRef50_B0TX13 Probable O-sialoglycoprotein endopeptidase n=19 ... 408 e-112 UniRef50_A7HLB0 Probable O-sialoglycoprotein endopeptidase n=3 T... 407 e-112 UniRef50_B6BRQ7 O-sialoglycoprotein endopeptidase n=1 Tax=Candid... 406 e-112 UniRef50_B9KXJ0 Probable O-sialoglycoprotein endopeptidase n=4 T... 404 e-111 UniRef50_Q6AL73 Probable O-sialoglycoprotein endopeptidase n=3 T... 403 e-111 UniRef50_A4EBV8 Putative uncharacterized protein n=5 Tax=Bacteri... 403 e-111 UniRef50_C7N1K1 Ribosomal-protein-alanine acetyltransferase n=1 ... 402 e-111 UniRef50_B3WUZ1 Probable O-sialoglycoprotein endopeptidase n=1 T... 401 e-110 UniRef50_Q0AVU0 Probable O-sialoglycoprotein endopeptidase n=27 ... 401 e-110 UniRef50_D0ME01 Metalloendopeptidase, glycoprotease family n=4 T... 401 e-110 UniRef50_A0L5L8 Probable O-sialoglycoprotein endopeptidase n=24 ... 400 e-110 UniRef50_Q11TP2 Probable O-sialoglycoprotein endopeptidase n=87 ... 398 e-109 UniRef50_Q6MQ48 Probable O-sialoglycoprotein endopeptidase n=1 T... 397 e-109 UniRef50_A5CE49 Probable O-sialoglycoprotein endopeptidase n=2 T... 397 e-109 UniRef50_Q8DLI9 Probable O-sialoglycoprotein endopeptidase n=12 ... 396 e-109 UniRef50_D1IZQ0 Whole genome shotgun sequence of line PN40024, s... 396 e-109 UniRef50_Q3YS67 Probable O-sialoglycoprotein endopeptidase n=24 ... 396 e-109 UniRef50_B8BPP0 Putative uncharacterized protein n=1 Tax=Oryza s... 394 e-108 UniRef50_A8GM49 Probable O-sialoglycoprotein endopeptidase n=15 ... 394 e-108 UniRef50_D0RQS5 Putative glycoprotease GCP n=1 Tax=alpha proteob... 393 e-108 UniRef50_B2GAG0 Probable O-sialoglycoprotein endopeptidase n=56 ... 391 e-107 UniRef50_B1GZV6 Probable O-sialoglycoprotein endopeptidase n=1 T... 388 e-106 UniRef50_D0N6Q4 O-sialoglycoprotein endopeptidase, putative n=1 ... 388 e-106 UniRef50_Q0SM86 Probable O-sialoglycoprotein endopeptidase n=18 ... 387 e-106 UniRef50_C1SJZ8 Metalloendopeptidase, putative, glycoprotease fa... 387 e-106 UniRef50_C7ND80 Metalloendopeptidase, glycoprotease family n=3 T... 387 e-106 UniRef50_C8W929 Metalloendopeptidase, glycoprotease family n=2 T... 386 e-106 UniRef50_D0WGH2 O-sialoglycoprotein endopeptidase n=1 Tax=Slacki... 385 e-105 UniRef50_C7MKR9 Ribosomal-protein-alanine acetyltransferase n=10... 385 e-105 UniRef50_Q2GEG6 Probable O-sialoglycoprotein endopeptidase n=2 T... 385 e-105 UniRef50_C1A601 Probable O-sialoglycoprotein endopeptidase n=1 T... 384 e-105 UniRef50_D1AVQ5 Metalloendopeptidase, glycoprotease family n=1 T... 384 e-105 UniRef50_B2UQZ0 Metalloendopeptidase, glycoprotease family n=3 T... 383 e-105 UniRef50_Q058D1 Probable O-sialoglycoprotein endopeptidase n=1 T... 382 e-104 UniRef50_Q2JXG9 Probable O-sialoglycoprotein endopeptidase n=31 ... 381 e-104 UniRef50_B3DVR7 Metal-dependent protease with possible chaperone... 381 e-104 UniRef50_B9JCG8 Probable O-sialoglycoprotein endopeptidase n=86 ... 381 e-104 UniRef50_A6DFV1 Metalloendopeptidase, putative, glycoprotease fa... 381 e-104 UniRef50_B9XP92 Metalloendopeptidase, glycoprotease family n=1 T... 379 e-104 UniRef50_Q6MD07 Probable O-sialoglycoprotein endopeptidase n=2 T... 379 e-104 UniRef50_C0Q8X7 Probable O-sialoglycoprotein endopeptidase n=2 T... 378 e-103 UniRef50_B3R0M3 Probable O-sialoglycoprotein endopeptidase n=2 T... 378 e-103 UniRef50_B7CBT6 Putative uncharacterized protein n=1 Tax=Eubacte... 378 e-103 UniRef50_Q5FLZ3 Probable O-sialoglycoprotein endopeptidase n=10 ... 377 e-103 UniRef50_B1V8Z6 Probable O-sialoglycoprotein endopeptidase n=6 T... 377 e-103 UniRef50_C7H0S4 Putative glycoprotease GCP n=1 Tax=Eubacterium s... 376 e-103 UniRef50_Q47LN7 Probable O-sialoglycoprotein endopeptidase n=58 ... 376 e-103 UniRef50_Q127W3 Probable O-sialoglycoprotein endopeptidase n=4 T... 374 e-102 UniRef50_B0VHD4 Putative metalloendopeptidase, , glycoprotease f... 373 e-102 UniRef50_Q045T6 Probable O-sialoglycoprotein endopeptidase n=433... 373 e-102 UniRef50_B6JAE9 Probable O-sialoglycoprotein endopeptidase n=5 T... 373 e-102 UniRef50_A9FDL0 Probable O-sialoglycoprotein endopeptidase n=5 T... 371 e-101 UniRef50_B5RQA5 Probable O-sialoglycoprotein endopeptidase n=4 T... 371 e-101 UniRef50_C7LR95 Metalloendopeptidase, glycoprotease family n=1 T... 370 e-101 UniRef50_Q7UM42 Probable O-sialoglycoprotein endopeptidase n=5 T... 369 e-101 UniRef50_B1XJF0 Probable O-sialoglycoprotein endopeptidase n=1 T... 368 e-100 UniRef50_B2KE20 Metalloendopeptidase, glycoprotease family n=1 T... 367 e-100 UniRef50_C8WN77 Metalloendopeptidase, glycoprotease family n=3 T... 367 e-100 UniRef50_C9RIN4 Metalloendopeptidase, glycoprotease family n=1 T... 366 e-100 UniRef50_Q0ATQ2 Probable O-sialoglycoprotein endopeptidase n=44 ... 366 e-99 UniRef50_Q254Q0 Probable O-sialoglycoprotein endopeptidase n=6 T... 365 e-99 UniRef50_B2S3R9 Probable O-sialoglycoprotein endopeptidase n=4 T... 365 1e-99 UniRef50_Q3SVF4 Probable O-sialoglycoprotein endopeptidase n=10 ... 364 3e-99 UniRef50_B4U8B7 Metalloendopeptidase, glycoprotease family n=1 T... 364 3e-99 UniRef50_C5ZWF6 Metal-dependent protease n=2 Tax=Helicobacter ca... 363 4e-99 UniRef50_B5ZLG0 Metalloendopeptidase, glycoprotease family n=11 ... 363 4e-99 UniRef50_A5GMV4 Probable O-sialoglycoprotein endopeptidase n=17 ... 363 4e-99 UniRef50_C0QY51 Probable O-sialoglycoprotein endopeptidase n=2 T... 363 5e-99 UniRef50_D1N4S8 Metalloendopeptidase, glycoprotease family n=1 T... 363 5e-99 UniRef50_Q2SR45 Probable O-sialoglycoprotein endopeptidase n=5 T... 362 1e-98 UniRef50_A0JZ01 Probable O-sialoglycoprotein endopeptidase n=98 ... 362 1e-98 UniRef50_Q30ZN1 Probable O-sialoglycoprotein endopeptidase n=12 ... 354 3e-96 UniRef50_A1R8N0 Probable O-sialoglycoprotein endopeptidase n=12 ... 353 4e-96 UniRef50_UPI0000D561DB PREDICTED: similar to AGAP005215-PA n=1 T... 352 7e-96 UniRef50_A1BJ68 Probable O-sialoglycoprotein endopeptidase n=12 ... 352 9e-96 UniRef50_A4RXP4 Predicted protein n=6 Tax=Eukaryota RepID=A4RXP4... 352 9e-96 UniRef50_Q54EW4 Putative uncharacterized protein n=1 Tax=Dictyos... 350 4e-95 UniRef50_D1B582 Metalloendopeptidase, glycoprotease family n=5 T... 349 6e-95 UniRef50_B8LEI0 Predicted protein (Fragment) n=1 Tax=Thalassiosi... 349 6e-95 UniRef50_B0D096 Predicted protein n=2 Tax=Agaricales RepID=B0D09... 348 2e-94 UniRef50_B3MQN2 GF20469 n=4 Tax=Drosophila RepID=B3MQN2_DROAN 347 2e-94 UniRef50_B8PI87 Predicted protein n=2 Tax=Postia placenta Mad-69... 346 5e-94 UniRef50_B3RQR7 Putative uncharacterized protein n=1 Tax=Trichop... 346 8e-94 UniRef50_Q1IUF1 Probable O-sialoglycoprotein endopeptidase n=2 T... 346 9e-94 UniRef50_Q04RH4 Probable O-sialoglycoprotein endopeptidase n=6 T... 344 2e-93 UniRef50_Q29HY2 GA12844 n=3 Tax=Sophophora RepID=Q29HY2_DROPS 343 4e-93 UniRef50_B0B9U7 Probable O-sialoglycoprotein endopeptidase n=6 T... 340 4e-92 UniRef50_A3EUW9 O-sialoglycoprotein endopeptidase n=3 Tax=Leptos... 340 5e-92 UniRef50_Q9H4B0 Probable O-sialoglycoprotein endopeptidase 2 n=3... 339 7e-92 UniRef50_A7H0K1 Probable O-sialoglycoprotein endopeptidase n=26 ... 339 8e-92 UniRef50_Q17CG3 O-sialoglycoprotein endopeptidase n=2 Tax=Culici... 337 3e-91 UniRef50_Q4A734 Probable O-sialoglycoprotein endopeptidase n=1 T... 336 5e-91 UniRef50_A6Q6J3 Probable O-sialoglycoprotein endopeptidase n=2 T... 335 1e-90 UniRef50_Q9VWD6 Probable O-sialoglycoprotein endopeptidase 2 n=6... 335 1e-90 UniRef50_UPI000058820F PREDICTED: hypothetical protein n=2 Tax=S... 334 3e-90 UniRef50_UPI000180B634 PREDICTED: similar to Probable O-sialogly... 334 3e-90 UniRef50_Q0BPC9 Probable O-sialoglycoprotein endopeptidase n=14 ... 334 4e-90 UniRef50_Q6L243 Putative O-sialoglycoprotein endopeptidase n=3 T... 333 5e-90 UniRef50_A5UMH5 Putative O-sialoglycoprotein endopeptidase n=5 T... 330 3e-89 UniRef50_B1ZYF9 Metalloendopeptidase, glycoprotease family n=3 T... 330 5e-89 UniRef50_C2KP25 O-sialoglycoprotein endopeptidase n=3 Tax=Mobilu... 329 1e-88 UniRef50_UPI0001C42124 glycoprotease M22 family n=1 Tax=Methanob... 329 1e-88 UniRef50_B6JWU0 Glycoprotease pgp1 n=1 Tax=Schizosaccharomyces j... 328 2e-88 UniRef50_C7M316 Metalloendopeptidase, glycoprotease family n=1 T... 327 3e-88 UniRef50_UPI0000F51796 O-sialoglycoprotein endopeptidase/protein... 326 7e-88 UniRef50_B3PND6 Probable O-sialoglycoprotein endopeptidase n=2 T... 325 1e-87 UniRef50_D2LQ34 Metalloendopeptidase, glycoprotease family n=1 T... 325 1e-87 UniRef50_O94710 Glycoprotease pgp1, mitochondrial n=1 Tax=Schizo... 325 1e-87 UniRef50_D2L1E2 Metalloendopeptidase, glycoprotease family n=1 T... 325 2e-87 UniRef50_UPI000186D055 conserved hypothetical protein n=1 Tax=Pe... 324 2e-87 UniRef50_A9WHP1 Metalloendopeptidase, glycoprotease family n=4 T... 324 2e-87 UniRef50_C1F9R2 Metalloendopeptidase, glycoprotease family n=1 T... 323 4e-87 UniRef50_Q5ZZQ1 Probable O-sialoglycoprotein endopeptidase n=8 T... 323 6e-87 UniRef50_Q6C9V8 YALI0D07920p n=1 Tax=Yarrowia lipolytica RepID=Q... 322 8e-87 UniRef50_Q17Z01 Probable O-sialoglycoprotein endopeptidase n=13 ... 320 5e-86 UniRef50_Q46FS9 Putative O-sialoglycoprotein endopeptidase n=17 ... 320 5e-86 UniRef50_UPI0001979AA5 putative DNA-binding/iron metalloprotein/... 318 2e-85 UniRef50_D0JBS4 Glycoprotease M22 family domain-containing prote... 316 9e-85 UniRef50_P43122 Putative protease QRI7 n=12 Tax=Saccharomycetace... 314 3e-84 UniRef50_C3XEQ4 O-sialoglycoprotein endopeptidase n=1 Tax=Helico... 314 3e-84 UniRef50_Q9NPF4 Probable O-sialoglycoprotein endopeptidase n=81 ... 314 4e-84 UniRef50_C4XSD3 Probable O-sialoglycoprotein endopeptidase n=2 T... 311 2e-83 UniRef50_C4QZU9 Putative metalloprotease, similar to O-sialoglyc... 310 3e-83 UniRef50_Q1IZH8 Probable O-sialoglycoprotein endopeptidase n=4 T... 310 6e-83 UniRef50_A6VJ51 Putative O-sialoglycoprotein endopeptidase n=26 ... 306 5e-82 UniRef50_A4VEZ5 O-sialoglycoprotein endopeptidase n=1 Tax=Tetrah... 301 2e-80 UniRef50_Q8EUQ9 Probable O-sialoglycoprotein endopeptidase n=1 T... 301 2e-80 UniRef50_Q4PGZ6 Putative uncharacterized protein n=2 Tax=Ustilag... 301 2e-80 UniRef50_P75055 Probable O-sialoglycoprotein endopeptidase n=2 T... 300 5e-80 UniRef50_C4PYC5 Mername-AA018 peptidase (M22 family) n=1 Tax=Sch... 299 1e-79 UniRef50_Q74M58 Putative O-sialoglycoprotein endopeptidase n=1 T... 298 2e-79 UniRef50_B7XIP4 O-sialoglycoprotein endopeptidase n=2 Tax=Eukary... 295 2e-78 UniRef50_Q4UA14 Glycoprotein endopeptidase, putative n=3 Tax=Pir... 294 2e-78 UniRef50_B1AJ51 Probable O-sialoglycoprotein endopeptidase n=15 ... 294 4e-78 UniRef50_UPI0000DB7930 PREDICTED: similar to O-sialoglycoprotein... 293 5e-78 UniRef50_A2BJY9 Putative O-sialoglycoprotein endopeptidase n=22 ... 292 1e-77 UniRef50_D2RYV2 Metalloendopeptidase, glycoprotease family n=1 T... 292 1e-77 UniRef50_A2QMR2 Function: O-sialoglycoprotein endopeptidase is a... 292 1e-77 UniRef50_A5DGU9 Putative uncharacterized protein n=2 Tax=Pichia ... 292 2e-77 UniRef50_Q93170 Protein C01G10.10, confirmed by transcript evide... 290 6e-77 UniRef50_Q7NB15 Probable O-sialoglycoprotein endopeptidase n=1 T... 289 1e-76 UniRef50_B6GZQ3 Pc12g05880 protein n=9 Tax=Trichocomaceae RepID=... 287 3e-76 UniRef50_UPI000023E24C hypothetical protein FG06887.1 n=1 Tax=Gi... 287 3e-76 UniRef50_B5Y892 O-sialoglycoprotein endopeptidase n=1 Tax=Coprot... 287 3e-76 UniRef50_Q6L4N8 Os05g0194600 protein n=21 Tax=Eukaryota RepID=Q6... 280 4e-74 UniRef50_B9WFF4 Metalloprotease, putative n=8 Tax=Saccharomyceta... 277 4e-73 UniRef50_Q18KI0 Putative O-sialoglycoprotein endopeptidase n=14 ... 275 2e-72 UniRef50_Q2GXN6 Putative glycoprotein endopeptidase KAE1 n=18 Ta... 275 2e-72 UniRef50_Q83I95 Probable O-sialoglycoprotein endopeptidase n=2 T... 274 3e-72 UniRef50_B7QJD9 O-sialoglycoprotein endopeptidase, putative n=3 ... 273 4e-72 UniRef50_A3CXS0 Putative O-sialoglycoprotein endopeptidase n=5 T... 273 5e-72 UniRef50_Q2HG58 Putative uncharacterized protein n=1 Tax=Chaetom... 272 1e-71 UniRef50_C8V9Q8 PROBABLE O-SIALOGLYCOPROTEIN ENDOPEPTIDASE (AFU_... 272 2e-71 UniRef50_B8MFK9 Glycoprotease family protein, putative n=5 Tax=L... 271 3e-71 UniRef50_Q4U8J6 Glycoprotease, putative n=2 Tax=Theileria RepID=... 271 3e-71 UniRef50_P36174 Putative O-sialoglycoprotein endopeptidase n=1 T... 270 4e-71 UniRef50_C4Y0N8 Putative uncharacterized protein n=1 Tax=Clavisp... 268 2e-70 UniRef50_A3MSX6 Putative O-sialoglycoprotein endopeptidase n=2 T... 268 3e-70 UniRef50_P36132 Putative glycoprotein endopeptidase KAE1 n=40 Ta... 267 4e-70 UniRef50_C1GKA7 Glycoprotease pgp1 n=11 Tax=Onygenales RepID=C1G... 258 2e-67 UniRef50_A8QDL6 Glycoprotease family protein n=1 Tax=Brugia mala... 255 2e-66 UniRef50_B2A533 O-sialoglycoprotein endopeptidase n=1 Tax=Natran... 253 9e-66 UniRef50_UPI0000E8089C PREDICTED: similar to Osgepl1 protein n=1... 250 6e-65 UniRef50_A8BDD4 O-sialoglycoprotein endopeptidase n=2 Tax=Giardi... 250 6e-65 UniRef50_A8WMS3 Putative uncharacterized protein n=1 Tax=Caenorh... 249 8e-65 UniRef50_C5FT24 Glycoprotease family protein n=2 Tax=Onygenales ... 249 1e-64 UniRef50_Q7SD85 Predicted protein n=2 Tax=Sordariaceae RepID=Q7S... 249 1e-64 UniRef50_C7DHT9 Metalloendopeptidase, glycoprotease family n=1 T... 247 6e-64 UniRef50_A6TR37 O-sialoglycoprotein endopeptidase n=1 Tax=Alkali... 242 1e-62 UniRef50_A8MFJ2 O-sialoglycoprotein endopeptidase n=1 Tax=Alkali... 237 5e-61 UniRef50_C0ZC04 Peptidase M22 family protein n=1 Tax=Brevibacill... 234 4e-60 UniRef50_A4RG35 Putative uncharacterized protein n=1 Tax=Magnapo... 233 6e-60 UniRef50_C0GE31 O-sialoglycoprotein endopeptidase n=1 Tax=Dethio... 233 7e-60 UniRef50_D1BMJ2 Metal-dependent protease with possible chaperone... 231 3e-59 UniRef50_A7APL5 Glycoprotease family protein n=1 Tax=Babesia bov... 230 7e-59 UniRef50_C5KYH6 Glycoprotein endopeptidase, putative n=4 Tax=Per... 229 1e-58 UniRef50_D2RJI3 Peptidase M22 glycoprotease n=2 Tax=Acidaminococ... 227 4e-58 UniRef50_A6NUZ4 Putative uncharacterized protein n=1 Tax=Bactero... 224 3e-57 UniRef50_Q2RIB0 O-sialoglycoprotein endopeptidase n=5 Tax=Clostr... 221 2e-56 UniRef50_A6S1G0 Putative uncharacterized protein n=1 Tax=Botryot... 220 5e-56 UniRef50_Q3AAM2 Glycoprotease family protein n=1 Tax=Carboxydoth... 216 8e-55 UniRef50_C9LLA9 Glycoprotease family protein n=1 Tax=Dialister i... 216 8e-55 UniRef50_C8WXH0 Peptidase M22 glycoprotease n=2 Tax=Alicyclobaci... 216 1e-54 UniRef50_B0AAV1 Putative uncharacterized protein n=2 Tax=Clostri... 215 1e-54 UniRef50_Q97ZY8 Putative O-sialoglycoprotein endopeptidase n=1 T... 214 3e-54 UniRef50_Q0AZF6 Putative uncharacterized protein n=1 Tax=Syntrop... 213 9e-54 UniRef50_D2VC41 Predicted protein n=1 Tax=Naegleria gruberi RepI... 212 2e-53 UniRef50_Q5KFY5 Mitochondrion protein, putative n=2 Tax=Filobasi... 211 4e-53 UniRef50_C7H6X1 Glycoprotease family protein n=2 Tax=Faecalibact... 210 6e-53 UniRef50_D1PKV9 Glycoprotease family protein n=1 Tax=Subdoligran... 206 8e-52 UniRef50_Q18B67 Probable O-sialoglycoprotein endopeptidase n=6 T... 206 1e-51 UniRef50_UPI0000DD8AA6 Os01g0295900 n=1 Tax=Oryza sativa Japonic... 204 3e-51 UniRef50_A7VX43 Putative uncharacterized protein n=4 Tax=Clostri... 204 4e-51 UniRef50_A0RY43 O-sialoglycoprotein endopeptidase n=4 Tax=Thauma... 198 2e-49 UniRef50_B0TEI7 O-sialoglycoprotein endopeptidase, putative n=1 ... 197 5e-49 UniRef50_UPI000187E9E4 hypothetical protein MPER_08009 n=1 Tax=M... 178 4e-43 UniRef50_D2EF31 O-sialoglycoprotein endopeptidase (Fragment) n=1... 176 1e-42 UniRef50_Q8IJ99 Glycoprotease, putative n=5 Tax=Plasmodium RepID... 167 7e-40 UniRef50_B2WBX5 Glycoprotease pgp1, mitochondrial n=1 Tax=Pyreno... 163 7e-39 UniRef50_A5KDZ1 O-sialoglycoprotein endopeptidase, putative n=1 ... 156 2e-36 UniRef50_C5KJ57 Putative uncharacterized protein (Fragment) n=1 ... 149 2e-34 UniRef50_Q0V4Z5 Putative uncharacterized protein n=1 Tax=Phaeosp... 146 9e-34 UniRef50_A9UYP5 Predicted protein n=1 Tax=Monosiga brevicollis R... 145 3e-33 UniRef50_C1BYL4 Probable O-sialoglycoprotein endopeptidase 2 n=1... 144 3e-33 UniRef50_D1IQV9 Whole genome shotgun sequence of line PN40024, s... 143 9e-33 UniRef50_B2AYU1 Predicted CDS Pa_1_12230 (Fragment) n=1 Tax=Podo... 141 3e-32 UniRef50_C9SIA9 Glycoprotease pgp1 n=2 Tax=Sordariomycetes RepID... 132 2e-29 UniRef50_B9PG42 O-sialoglycoprotein endopeptidase, putative n=3 ... 124 5e-27 UniRef50_B9HH45 Predicted protein n=7 Tax=Eukaryota RepID=B9HH45... 116 1e-24 UniRef50_C0YUE5 Possible M22 family non-peptidase n=1 Tax=Chryse... 115 2e-24 UniRef50_Q11BE2 Peptidase M22, glycoprotease n=2 Tax=Phyllobacte... 112 1e-23 UniRef50_B3PIE3 Glycoprotease family protein n=1 Tax=Cellvibrio ... 112 2e-23 UniRef50_C6WZF1 Putative glycoprotease family exported protein n... 112 3e-23 UniRef50_C8PCP6 Putative uncharacterized protein n=1 Tax=Lactoba... 110 6e-23 UniRef50_D1AUR9 Putative endopeptidase n=1 Tax=Anaplasma central... 109 1e-22 UniRef50_Q5E439 Predicted peptidase n=5 Tax=Vibrionaceae RepID=Q... 108 3e-22 UniRef50_Q5ZU86 Glycoprotease (O-sialoglycoprotein endopeptidase... 106 9e-22 UniRef50_A0YCJ3 Inactive metal-dependent protease-like protein n... 106 2e-21 UniRef50_P76256 M22 peptidase homolog yeaZ n=236 Tax=Gammaproteo... 106 2e-21 UniRef50_A3HX68 Putative uncharacterized protein n=1 Tax=Algorip... 105 2e-21 UniRef50_UPI00019087BD O-sialoglycoprotein endopeptidase n=1 Tax... 105 2e-21 UniRef50_B8I821 Peptidase M22 glycoprotease n=1 Tax=Clostridium ... 105 2e-21 UniRef50_P43990 Probable M22 peptidase homolog HI0388 n=24 Tax=P... 105 3e-21 UniRef50_A5FJB4 Peptidase family M22-like protein n=10 Tax=Flavo... 105 3e-21 UniRef50_Q31G60 Peptidase M22 glycoprotease family protein n=1 T... 104 7e-21 UniRef50_Q0JNG2 Os01g0295900 protein n=5 Tax=Oryza sativa RepID=... 103 1e-20 UniRef50_Q3SMU0 Peptidase M22, glycoprotease n=14 Tax=Bradyrhizo... 103 1e-20 UniRef50_Q18CP2 Putative glycoprotease n=9 Tax=Clostridium RepID... 103 1e-20 UniRef50_A8U9X9 Glycoprotein endopeptidase n=1 Tax=Carnobacteriu... 103 1e-20 Sequences not found previously or not previously below threshold: UniRef50_Q9U0J7 Peptidase, M22 family, putative n=3 Tax=Plasmodi... 109 2e-22 UniRef50_B2A5P8 Peptidase M22 glycoprotease n=1 Tax=Natranaerobi... 108 3e-22 UniRef50_A8SIF9 Putative uncharacterized protein n=1 Tax=Parvimo... 104 5e-21 UniRef50_B0KBT6 Peptidase M22, glycoprotease n=11 Tax=Thermoanae... 103 8e-21 UniRef50_A0NUI5 Putative uncharacterized protein n=1 Tax=Labrenz... 103 9e-21 >UniRef50_Q7MNZ9 Probable O-sialoglycoprotein endopeptidase n=19 Tax=Gammaproteobacteria RepID=GCP_VIBVY Length = 339 Score = 485 bits (1250), Expect = e-136, Method: Composition-based stats. Identities = 259/334 (77%), Positives = 294/334 (88%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+LGIETSCDETGIAIYDDEKGLLA++LYSQ+KLHADYGGVVPELASRDHV+KT+PLI+ Sbjct: 1 MRILGIETSCDETGIAIYDDEKGLLAHKLYSQIKLHADYGGVVPELASRDHVKKTIPLIK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ LTAKDID VAYTAGPGLVGALLVGAT+GRSLA+AW VPA+PVHHMEGHLLAPM Sbjct: 61 EALKEANLTAKDIDGVAYTAGPGLVGALLVGATIGRSLAYAWGVPAVPVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LEDNPP FPFVA+LVSGGH+ ++ V GIG+Y++LGESIDDAAGEAFDKTAKL+GLDYPGG Sbjct: 121 LEDNPPPFPFVAVLVSGGHSMMVEVKGIGEYKILGESIDDAAGEAFDKTAKLMGLDYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 PLLSK+A +GT GRF FPRPMT+ PGLD SFSGLKTF ANTI NG D+QTRADIA AFE Sbjct: 181 PLLSKLAEKGTPGRFKFPRPMTNVPGLDMSFSGLKTFTANTIAANGDDEQTRADIAYAFE 240 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +AV TL IKCKRAL+QTG KR+V+AGGVSANR LRA+L ++ K G+V+Y R EFCTD Sbjct: 241 EAVCATLAIKCKRALEQTGMKRIVIAGGVSANRRLRAELEKLAHKVGGDVYYPRTEFCTD 300 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 NGAMIAYAGM R K +DL V RPRWP+ +L Sbjct: 301 NGAMIAYAGMQRLKNNEVSDLAVEARPRWPIDQL 334 >UniRef50_B0TIN7 Probable O-sialoglycoprotein endopeptidase n=130 Tax=Gammaproteobacteria RepID=GCP_SHEHH Length = 338 Score = 472 bits (1215), Expect = e-132, Method: Composition-based stats. Identities = 247/335 (73%), Positives = 284/335 (84%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVLGIETSCDETGIA+YDDEKGLL++ LYSQVKLHADYGGVVPELASRDHVRK VPLI+ Sbjct: 1 MRVLGIETSCDETGIAVYDDEKGLLSHALYSQVKLHADYGGVVPELASRDHVRKIVPLIR 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ +T +D+D +AYT GPGL+GALLVGA VGR+LAF+WD PAI VHHMEGHLLAPM Sbjct: 61 QALADADMTIEDLDGIAYTKGPGLIGALLVGACVGRALAFSWDKPAIGVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LED+ PEFPF+ALLVSGGH+ L+ V GIG+Y +LGES+DDAAGEAFDKTAKL+GLDYPGG Sbjct: 121 LEDDVPEFPFLALLVSGGHSMLVGVEGIGRYTVLGESVDDAAGEAFDKTAKLMGLDYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P LSK+AA+G + FPRPMTD+PGL+ SFSGLKTFAANTI D+QTRA+IA AFE Sbjct: 181 PRLSKLAAKGVPNSYRFPRPMTDKPGLNMSFSGLKTFAANTIAAEPKDEQTRANIACAFE 240 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +AVVDTL IKCKRAL QTG+K LV+AGGVSAN LRA L+EMM+ G+V+Y R EFCTD Sbjct: 241 EAVVDTLGIKCKRALKQTGYKNLVIAGGVSANTRLRASLSEMMQGLGGKVYYPRGEFCTD 300 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 NGAMIAYAG+ R KAG DL V +PRWPL L Sbjct: 301 NGAMIAYAGLQRLKAGQVEDLAVKGQPRWPLDTLE 335 >UniRef50_P36175 O-sialoglycoprotein endopeptidase n=366 Tax=cellular organisms RepID=GCP_PASHA Length = 325 Score = 462 bits (1190), Expect = e-129, Method: Composition-based stats. Identities = 246/319 (77%), Positives = 280/319 (87%), Gaps = 5/319 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+LGIETSCDETG+AIYD++KGL+ANQLYSQ+ +HADYGGVVPELASRDH+RKT+PLIQ Sbjct: 1 MRILGIETSCDETGVAIYDEDKGLVANQLYSQIDMHADYGGVVPELASRDHIRKTLPLIQ 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ L DID +AYTAGPGLVGALLVG+T+ RSLA+AW+VPA+ VHHMEGHLLAPM Sbjct: 61 EALKEANLQPSDIDGIAYTAGPGLVGALLVGSTIARSLAYAWNVPALGVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE+N PEFPFVALL+SGGHTQL+ V G+GQYELLGESIDDAAGEAFDKT KLLGLDYP G Sbjct: 121 LEENAPEFPFVALLISGGHTQLVKVDGVGQYELLGESIDDAAGEAFDKTGKLLGLDYPAG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN-----GTDDQTRADI 235 +SK+A GT RF FPRPMTDRPGLDFSFSGLKTFAANTI+ N D+QT+ DI Sbjct: 181 VAMSKLAESGTPNRFKFPRPMTDRPGLDFSFSGLKTFAANTIKANLNENGELDEQTKCDI 240 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A AF+ AVVDT++IKCKRAL+QTG+KRLVMAGGVSAN+ LRA LAEMMKK +GEVFY RP Sbjct: 241 AHAFQQAVVDTILIKCKRALEQTGYKRLVMAGGVSANKQLRADLAEMMKKLKGEVFYPRP 300 Query: 296 EFCTDNGAMIAYAGMVRFK 314 +FCTDNGAMIAY G +R K Sbjct: 301 QFCTDNGAMIAYTGFLRLK 319 >UniRef50_Q8D283 Probable O-sialoglycoprotein endopeptidase n=11 Tax=Gammaproteobacteria RepID=GCP_WIGBR Length = 340 Score = 436 bits (1121), Expect = e-121, Method: Composition-based stats. Identities = 179/335 (53%), Positives = 247/335 (73%), Gaps = 1/335 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETSCD+TG AIYD EKGL+ +++ SQ +H+ YGGVVPE +S+ H++ PL++ Sbjct: 1 MLILGIETSCDDTGAAIYDLEKGLIIHKVISQNNIHSKYGGVVPEKSSKYHLKNIQPLVE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 K S ++ ID +AYTAGPGLVG+L++GAT SLA+ +P+I ++H+EGHLL PM Sbjct: 61 NIFKNSNISLSKIDGIAYTAGPGLVGSLIIGATFACSLAYTLQIPSIAINHLEGHLLTPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 ++ P+FPF+ L++SG HTQ + IG+Y+++G+ +DDA GEAFDK AKLLG+ YPGG Sbjct: 121 IKYKRPKFPFLGLIISGAHTQFVLAEDIGKYKIIGDCLDDALGEAFDKVAKLLGIKYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-DDQTRADIARAF 239 LS +A QG + RF FPRPMT +PG++FSFSGLKT+A N + D+QT+ DIARAF Sbjct: 181 KKLSIIAKQGNSKRFFFPRPMTKKPGINFSFSGLKTYAKNLVSSFSKIDNQTKCDIARAF 240 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 ED+++DT++IKCKRALD T K L+++GGVSAN LR L +MK R G++F+++ CT Sbjct: 241 EDSIIDTVIIKCKRALDITNSKILLISGGVSANEPLRKNLRNLMKSRNGKLFFSKKSLCT 300 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 DN AMIAY G +RFK T DL V + P+W L ++ Sbjct: 301 DNAAMIAYVGSIRFKKNKTKDLSVLINPKWSLEDI 335 >UniRef50_C4Z311 O-sialoglycoprotein endopeptidase n=13 Tax=Bacteria RepID=C4Z311_EUBE2 Length = 352 Score = 429 bits (1103), Expect = e-119, Method: Composition-based stats. Identities = 146/334 (43%), Positives = 205/334 (61%), Gaps = 2/334 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET A+ + + +L+N + +Q+ +H +YGGVVPE+ASR H+ P+I+ Sbjct: 17 LILAIESSCDETAAAVVKNGREVLSNVINTQIAIHTEYGGVVPEIASRKHIENINPVIRK 76 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G+T DIDA+ T GPGLVGALLVG +++AFA + P + VHH+EGH+ A + Sbjct: 77 ALEDAGVTLDDIDAIGVTYGPGLVGALLVGVAEAKAIAFAKNKPLVGVHHIEGHISANYV 136 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E+ E PFVAL+VSGGHT L+ V G+YE++G + DDAAGEAFDK A+ +GL YPGGP Sbjct: 137 ENKELEPPFVALVVSGGHTHLVKVNDYGEYEIVGRTRDDAAGEAFDKVARAIGLGYPGGP 196 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTRADIARAF 239 + K+A +G FPR D DFSFSG+K+ N I + RAD+A +F Sbjct: 197 KIDKLAKEGNPDAIEFPRAHVDDAPYDFSFSGIKSAVLNYINSANMQGKEINRADVAASF 256 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + AVVD L+ + R + G +L +AGGV++N LRA + E K + P CT Sbjct: 257 QKAVVDALVSRAVRLAKECGMDKLAIAGGVASNSALRAAIQEACAKNNIGFYSPSPILCT 316 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 DN AMI A + G ++ P L E Sbjct: 317 DNAAMIGAAAYYEYIKGVRHGYDLNAVPNLKLGE 350 >UniRef50_C0QTG9 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Bacteria RepID=GCP_PERMH Length = 344 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 141/338 (41%), Positives = 218/338 (64%), Gaps = 9/338 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCD+T +++YD E+GLL+N + SQ+K+H ++GGV P+LA+R+H + +P++ Sbjct: 1 MKILGIETSCDDTAVSVYDSEEGLLSNVVSSQIKMHEEWGGVYPDLAAREHTKNIIPVLD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ + KDID +A T PGL+ +L++G +V ++L++ + P IPVHH+E H+ A Sbjct: 61 RALKEASVNIKDIDGIAVTVAPGLIVSLVIGISVAKTLSWIYRKPLIPVHHIEAHIFASF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + ++PF+AL+VSGGHT+L + G Y LG ++DDA GEA+DK A++LGL YPGG Sbjct: 121 ITEK-IDYPFIALVVSGGHTELYLIKGFEDYRYLGGTLDDAVGEAYDKVARMLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG---LDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 P++ +++ +G PRP+ + G +FSFSGLKT I+ + DIAR Sbjct: 180 PVIDRLSKEG-EDTVKLPRPLINDRGKNRFNFSFSGLKTAVLREIQKG---VYRKEDIAR 235 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 +F++A D L+ K A+ + K +V+AGGVSAN LR K E + + ++ Sbjct: 236 SFQEAATDVLLAKTIDAMKEFNIKNVVIAGGVSANSRLREKFKEAEENHGIKAYFPPLYL 295 Query: 298 CTDNGAMIAYAGMVRFK-AGATADLGVSVRPRWPLAEL 334 CTDNGAM+A+ G RFK +G T D + R + + Sbjct: 296 CTDNGAMVAFTGYKRFKESGTTVDYSFEGKARLRMDKF 333 >UniRef50_A1AXM9 Probable O-sialoglycoprotein endopeptidase n=36 Tax=Proteobacteria RepID=GCP_RUTMC Length = 356 Score = 424 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 202/334 (60%), Positives = 248/334 (74%), Gaps = 4/334 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 LGIE+SCDETGI +Y E GL+ ++L+S VK+HA+YGGVVPELASRDH+++ +PLI+A Sbjct: 22 ITLGIESSCDETGIGLYHSELGLIGHELFSSVKIHAEYGGVVPELASRDHIQRVLPLIKA 81 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L + T +D+ +AYTAGPGL GALLVG V +SLA++ D+P++ VHHMEGHLL P+L Sbjct: 82 VLADVKFTLQDLSGIAYTAGPGLAGALLVGCAVAKSLAWSLDIPSLAVHHMEGHLLTPLL 141 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E++ PEFPFVALLVSGGHT LI V IGQY++LGES+DDA GEAFDKTAK+LGL YPGGP Sbjct: 142 EESQPEFPFVALLVSGGHTMLIDVKAIGQYKILGESLDDAVGEAFDKTAKILGLGYPGGP 201 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 L+ +A QG G F FP PM RPGLDFSFSGLKTF NT + + DIA+AFE Sbjct: 202 ALAMLAEQGNYGAFKFPCPMVGRPGLDFSFSGLKTFVRNTFAKYPSK---KEDIAKAFEV 258 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 A TLMIKC+RAL+QT + LV+AGGVSAN +LR KL +M +K VFY R EFCTDN Sbjct: 259 ATTQTLMIKCRRALEQTKYATLVVAGGVSANLSLRKKLNQMGQKLDVNVFYPRQEFCTDN 318 Query: 302 GAMIAYAGMVRFKAGAT-ADLGVSVRPRWPLAEL 334 GAMIA G R G ++++PRW L EL Sbjct: 319 GAMIALVGYFRLSHGQHDTHHEINIKPRWSLEEL 352 >UniRef50_Q18CP0 Probable O-sialoglycoprotein endopeptidase n=22 Tax=Bacteria RepID=GCP_CLOD6 Length = 338 Score = 422 bits (1086), Expect = e-117, Method: Composition-based stats. Identities = 134/335 (40%), Positives = 205/335 (61%), Gaps = 2/335 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + L IE+SCDET ++ + + +L+N + +Q++ H +GGVVPE+ASR HV ++Q Sbjct: 4 IITLAIESSCDETAASVLKNGREVLSNIISTQIETHKKFGGVVPEVASRKHVENIDIVVQ 63 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ + DID +A T GPGLVGALLVG + ++LA+ ++P + V+H+EGHL A Sbjct: 64 EALDKANIGFNDIDHIAVTYGPGLVGALLVGLSYAKALAYTLNIPLVGVNHIEGHLSANY 123 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E + PF+ L+VSGGHT L+ V G+YE+LG++ DDA+GEAFDK ++ + L YPGG Sbjct: 124 IEHKDLKPPFITLIVSGGHTHLVEVKDYGKYEILGKTRDDASGEAFDKISRAMNLGYPGG 183 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTRADIARA 238 P++ +A G FPR + DFSFSGLK+ N + + ++ D+A + Sbjct: 184 PIIDNLAKNGNKHAIEFPRAYLEEDSYDFSFSGLKSSVLNYLNGKRMKNEEIVVEDVAAS 243 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F++AVV+ L K +A+ G+ + ++GGV++N LRAK+ E+ K V Y C Sbjct: 244 FQEAVVEVLSTKALKAVKDKGYNIITLSGGVASNSGLRAKITELAKDNGITVKYPPLILC 303 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 TDN AMI AG F G T D+ ++ P + + Sbjct: 304 TDNAAMIGCAGYYNFINGKTHDMSLNAVPNLKINQ 338 >UniRef50_A5G3X1 Probable O-sialoglycoprotein endopeptidase n=20 Tax=Bacteria RepID=GCP_GEOUR Length = 343 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 150/338 (44%), Positives = 212/338 (62%), Gaps = 3/338 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IE+SCDET A+ D + +L+N + SQ+ +HA YGGVVPE+ASR H+ +I+ Sbjct: 1 MLLLAIESSCDETAAAVVRDGRIILSNIVASQISVHAGYGGVVPEIASRKHLETISTVIE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ +G++ D+D +A T GPGL GALLVG + +++A+A VP V+H+E H+LA Sbjct: 61 EALQAAGVSLTDVDGIAVTQGPGLAGALLVGISTAKAMAYALGVPIAGVNHIESHILAIF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE EFPFVAL VSGGHT L V +G+Y+ LG+++DDAAGEAFDK AKLLGL YPGG Sbjct: 121 LE-RSIEFPFVALAVSGGHTHLYLVEAVGRYKTLGQTLDDAAGEAFDKVAKLLGLPYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG--TDDQTRADIARA 238 L+ ++AA+G FPRP+ +FSFSGLKT N ++ N D + D+ + Sbjct: 180 ALIDRLAAEGDPEAIRFPRPLMRDESFNFSFSGLKTSVLNYLQKNPAAADGRALNDLCAS 239 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+ AV D L+ K A+ TG KR+V+AGGV+ N LR +++ + + + E+ P C Sbjct: 240 FQAAVCDVLVSKTAAAVSATGIKRVVVAGGVACNNGLRREMSRLAELKGIELHIPSPLLC 299 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 +DN AMIA G + + P WPL + + Sbjct: 300 SDNAAMIAVPGDYYLSNNILSGFDIDALPVWPLDSIAS 337 >UniRef50_C6P1W3 Metalloendopeptidase, glycoprotease family n=1 Tax=Sideroxydans lithotrophicus ES-1 RepID=C6P1W3_9PROT Length = 383 Score = 416 bits (1071), Expect = e-115, Method: Composition-based stats. Identities = 206/377 (54%), Positives = 254/377 (67%), Gaps = 41/377 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCDETGIA+Y E+GLLA+ L+SQ+ LH +YGGVVPELASRDHVR +PLI++ Sbjct: 6 LILGIESSCDETGIALYHTERGLLAHTLHSQIALHNEYGGVVPELASRDHVRHALPLIRS 65 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G DIDA+AYT GPGL GALLVG+++ +LA+ DVP I VHH+EGHLL+P+L Sbjct: 66 ALQKAGCALSDIDAIAYTQGPGLSGALLVGSSIACALAYTLDVPTIGVHHLEGHLLSPLL 125 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 PEFPFVALLVSGGHTQL+ V G+G Y LLGES+DDAAGEAFDK+AKLLGLDYPGG Sbjct: 126 SRPAPEFPFVALLVSGGHTQLMRVDGVGHYTLLGESVDDAAGEAFDKSAKLLGLDYPGGA 185 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI------------------- 222 LLSK+A +GT GRF PRPM LDFSFSGLKT + Sbjct: 186 LLSKLAQRGTPGRFKLPRPMLHSGNLDFSFSGLKTAVLTLVNQQIDIPHPNPDGTTSHST 245 Query: 223 ---------------------RDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFK 261 R+ T +QTRADIA A ++A+VD L+ K AL QTG Sbjct: 246 KPASGQVAGYLPEGEGANESLREFPTPEQTRADIAHAAQEAIVDVLVNKALAALKQTGLN 305 Query: 262 RLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATA-D 320 +LV+AGGV AN+ LR++L + K G VFY EFCTDNGAMIA+AG +R + D Sbjct: 306 QLVVAGGVGANQLLRSRLNASVGKHDGNVFYPELEFCTDNGAMIAFAGAMRLQQQVAQRD 365 Query: 321 LGVSVRPRWPLAELPAA 337 +V+PRW L E+ A Sbjct: 366 YRFNVKPRWDLREMNYA 382 >UniRef50_Q2RGJ3 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Bacteria RepID=GCP_MOOTA Length = 342 Score = 415 bits (1068), Expect = e-114, Method: Composition-based stats. Identities = 145/333 (43%), Positives = 194/333 (58%), Gaps = 2/333 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET AI D + AN + SQ+ +H +GGVVPE+ASR H+ VP++ Sbjct: 10 NILAIESSCDETAAAIVSDGTRVRANIIASQIAVHRRFGGVVPEIASRHHMENIVPVVSE 69 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL +GL D+DAVA T GPGLVGALLVG +SLA+A P I VHH+ GH+ A L Sbjct: 70 ALATAGLAFSDVDAVAVTYGPGLVGALLVGVAYAKSLAYALGKPLIGVHHLLGHIYAGFL 129 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 P V+L+VSGGHT L+ + +LG + DDAAGEAFDK A++LGL YPGGP Sbjct: 130 AYPGLPLPAVSLVVSGGHTNLVYLEDHTTRRILGSTRDDAAGEAFDKVARVLGLPYPGGP 189 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTRADIARAF 239 L K+A +G FPR + LDFSFSGLK+ N + + RAD+A +F Sbjct: 190 ELEKLAREGNPRAIPFPRAWLEENSLDFSFSGLKSAVINYLHHARQVGQEVNRADVAASF 249 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + AV + L+ K A + +++AGGV+AN LR +L ++ VF+ E CT Sbjct: 250 QAAVAEVLVTKTLLAATSYRARSILLAGGVAANSVLRRELRSAGEQAGLPVFFPPRELCT 309 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 DN AMI A ++ A L ++ P PL Sbjct: 310 DNAAMIGCAAYYQYLRRDFAPLSLNAIPDLPLN 342 >UniRef50_B2V910 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Aquificales RepID=GCP_SULSY Length = 337 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 147/338 (43%), Positives = 211/338 (62%), Gaps = 12/338 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCD+T IA+YD EKG+ +N + SQ+ +HA +GGV PE+A+R+H + +P++ Sbjct: 1 MVVLGIETSCDDTSIAVYDSEKGIPSNVVTSQL-IHAQFGGVYPEIAAREHTKNFLPVLD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ +T DIDA+A T PGL+ +L+ G + ++L+F+ P IPVHH+E H+ A Sbjct: 60 KALRDASITLSDIDAIATTFMPGLIVSLVAGVSGAKTLSFSLKKPLIPVHHIEAHIFANF 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + E+PF+AL+VSGGHT+LI V Y LG ++DDA GE +DK A+ LGL +PGG Sbjct: 120 I-TKEIEYPFLALVVSGGHTELILVKEFEDYIYLGGTLDDAVGEVYDKVARALGLGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTD--RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 PL+ K+A +G FPRP+ + +FSFSGLK+ I + DI ++ Sbjct: 179 PLIDKLAKEGK-EAIKFPRPLLNDEENKYNFSFSGLKSAVIREINKG---IYKKEDITKS 234 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F++AVVD L+ K A + G R+V+AGGVSAN LR E + + EV + C Sbjct: 235 FQNAVVDVLVKKTVLACKEFGINRVVVAGGVSANSQLRE---EFLNIKDLEVHFPPMHLC 291 Query: 299 TDNGAMIAYAGMVRFK-AGATADLGVSVRPRWPLAELP 335 TDNGAM+AY G RFK G + L + R + + P Sbjct: 292 TDNGAMVAYTGYKRFKEKGISVSLDFEAKARCRIDKFP 329 >UniRef50_D1B623 Metalloendopeptidase, glycoprotease family n=3 Tax=Synergistaceae RepID=D1B623_THEAS Length = 342 Score = 414 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 141/335 (42%), Positives = 212/335 (63%), Gaps = 5/335 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIE+SCD+T +A+ ++ + + A+ + SQV+ HA +GGVVPELASR H + L++ Sbjct: 8 LVLGIESSCDDTAVAVLEEPRRIRASLVMSQVEDHAPHGGVVPELASRRHQEAIMGLVRR 67 Query: 62 ALKESGLT--AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 L ++G++ + + +A TAGPGL+G+LLVG + L+ W+VP + V+HMEGHL A Sbjct: 68 CLWQAGVSNPMRQLSLIAVTAGPGLMGSLLVGVMAAKGLSQGWEVPIMGVNHMEGHLFAN 127 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 +L + PF+ L+VSGGHT++ V G Y LLG + DDA GEA+DK AK+LGL YPG Sbjct: 128 VLAHPDLKPPFLCLIVSGGHTEVHLVRSFGDYRLLGATRDDAVGEAYDKVAKMLGLGYPG 187 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 GP++ ++A +G R+ P P ++FSFSGLKT +R G + + D+ +F Sbjct: 188 GPVIDRLAREGDPDRYQLPVPFKGSSQVEFSFSGLKTAVLWLVRREG-EALSVPDLCASF 246 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG--EVFYARPEF 297 + A V++L+ K K A++QTG + + ++GGV+ANR LR +L ++ G V+ E Sbjct: 247 QRAAVESLVSKVKLAMNQTGVRTVAVSGGVAANRELRRRLEDLAGSSGGRVRVYLPPLEL 306 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 CTDN AM+A AG+ ++ G DL P W L+ Sbjct: 307 CTDNAAMVAAAGLWAYRRGVRDDLSFRADPSWELS 341 >UniRef50_Q8RC98 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Bacteria RepID=GCP_THETN Length = 341 Score = 414 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 139/333 (41%), Positives = 203/333 (60%), Gaps = 3/333 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCDET + + K +L+N +YSQ+ +H YGGVVPE+ASR H+ +++ A Sbjct: 7 ILGIETSCDETAAGVVKNGKEVLSNVIYSQINVHKKYGGVVPEIASRKHIEAISFVVEEA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ L+ ++DA+A T GPGLVG LLVG + G++LA+A P I V+H++GH+ A + Sbjct: 67 LNEAKLSLDEVDAIAATYGPGLVGPLLVGLSYGKALAYAKGKPFIGVNHIDGHIAANYIG 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 N PFV L+ SGGH+ ++ V G+YE++G+++DDAAGEAFDK A+ LGL YPGGP Sbjct: 127 GNLTP-PFVCLVASGGHSHIVYVKDYGEYEVMGKTLDDAAGEAFDKVARALGLGYPGGPA 185 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTRADIARAFE 240 + K A G FP+ + DFSFSG+KT N + + ++ D+A +F+ Sbjct: 186 IEKAAKLGNMEAIEFPKSFMEEGNFDFSFSGVKTAVLNYLNRQKQKGEEVNIYDVAASFQ 245 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+ L+ K A ++ +AGGV++N LR KL E KK V+Y +CTD Sbjct: 246 RNIVEVLVKKLVEAARFKNVSKVSIAGGVASNGFLRQKLEEDAKKFGLSVYYPEKIYCTD 305 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 NGAMIA A F G + + ++ P + E Sbjct: 306 NGAMIAAAAYYDFVKGKFSGMDLNAIPYLKIGE 338 >UniRef50_Q4FNV6 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Alphaproteobacteria RepID=GCP_PELUB Length = 357 Score = 412 bits (1059), Expect = e-114, Method: Composition-based stats. Identities = 137/341 (40%), Positives = 212/341 (62%), Gaps = 12/341 (3%) Query: 2 RVLGIETSCDETGIAIYDDEK----GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 +LGIE+SCDET +I + + +L++ + SQV +H ++GGVVPELA+R H+ K Sbjct: 6 IILGIESSCDETAASIITENEQGMPTILSSIVSSQVDVHKEFGGVVPELAARSHMEKIDL 65 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 + + A +SG+ +D+DA+A TAGPGL+ L VG + G+++A + + P I V+H+EGH L Sbjct: 66 ITKKAFDKSGVKMEDLDAIAATAGPGLMVCLSVGLSFGKAMASSLNKPFIAVNHLEGHAL 125 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 +P L ++ +P++ LL+SGGHTQ +SV G+G Y+ LG +IDDA GEAFDKTAKLLG+++ Sbjct: 126 SPKL-NSELNYPYLLLLISGGHTQFLSVQGLGNYKRLGTTIDDAVGEAFDKTAKLLGIEF 184 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 PGGP + A +G ++ P+P+ + G + SF+GLKT I +Q + D+A Sbjct: 185 PGGPQIEVYAKKGDPNKYELPKPIFHKGGCNLSFAGLKTAVLK-ISKQIKTEQEKYDLAA 243 Query: 238 AFEDAVVDTLMIKCKRALDQT------GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 +F+ + + L K K A ++ + V+AGGV+AN+ +R L + K+ E Sbjct: 244 SFQKTIEEILYKKSKIAFEEFKKMNTINKNKFVVAGGVAANKRIREVLTNLCKEEEFEAI 303 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + C DN AMIA G+ +FK ++L +PRWPL Sbjct: 304 FPPINLCGDNAAMIAMVGLEKFKLKQFSELDSPAKPRWPLD 344 >UniRef50_C1TLM6 O-sialoglycoprotein endopeptidase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TLM6_9BACT Length = 336 Score = 409 bits (1053), Expect = e-113, Method: Composition-based stats. Identities = 133/334 (39%), Positives = 202/334 (60%), Gaps = 5/334 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 L IE+SCD+T +A+ D ++ +L++ + SQV+ HA +GGVVPE ASR H+ +PL+ Sbjct: 4 LTLAIESSCDDTAVAVIDGQRNVLSSTMSSQVESHAPFGGVVPEYASRMHLEAILPLVDR 63 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL E+ D+D +A TAGPGL+G+LLVG + LA AW P + V+H+EGH+ A ++ Sbjct: 64 ALAEADAKPSDLDLIAVTAGPGLMGSLLVGVMTAKGLAQAWGKPILGVNHLEGHVFANVV 123 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + PF+A++VSGGHT+++ V +G Y +LG + DDAAGEA+DK AKLLGL YPGGP Sbjct: 124 NHPDLDPPFIAMIVSGGHTEVVLVEDLGFYRILGGTKDDAAGEAYDKVAKLLGLAYPGGP 183 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA--DIARAF 239 ++ ++A G F FP P+ + FSFSGLKT + + + DI +F Sbjct: 184 IVDELAKDGDPQAFDFPVPLKRSDEISFSFSGLKTAVLWQVERIKKEGASLPVEDICASF 243 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + A V+ L+ K A+ +TG +++V++GGV+AN LR + + + +CT Sbjct: 244 QRAAVEALICKLDLAVQKTGVEKVVLSGGVAANSCLRDLVLNRGDWKG---YVPDMFYCT 300 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 DN MI AG + G + L ++ P W + + Sbjct: 301 DNAVMIGAAGYHGWMRGRRSGLDLAPSPSWSIMD 334 >UniRef50_B0TX13 Probable O-sialoglycoprotein endopeptidase n=19 Tax=Francisella RepID=GCP_FRAP2 Length = 336 Score = 408 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 171/333 (51%), Positives = 235/333 (70%), Gaps = 5/333 (1%) Query: 1 MRVLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M VLGIE+SCDETG+AIYD K L+A+ LYSQ+ LH YGGVVPELASR+H+ K L Sbjct: 1 MLVLGIESSCDETGLAIYDYTSKTLVADVLYSQIDLHKKYGGVVPELASREHIAKLNILT 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + L + + D+ +AYTA PGL+GAL+VGAT ++L ++ + VHH+EGHLL+P Sbjct: 61 KELLSNANINFNDLSCIAYTAMPGLIGALMVGATFAKTLGLIHNIDTVAVHHLEGHLLSP 120 Query: 120 MLE-DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 +L+ + ++PFVALLVSGGHTQL V G+Y LLGESIDDAAGEAFDKTAKLLG+ YP Sbjct: 121 LLDQSSDIKYPFVALLVSGGHTQLFEVREFGEYSLLGESIDDAAGEAFDKTAKLLGMSYP 180 Query: 179 GGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANT-IRDNGTDDQTRADIA 236 GG ++ +A + ++ PRPM ++P LDFSFSGLKT NT + + +A++ Sbjct: 181 GGVEVANLAEKATDKKKYDLPRPMKNKPNLDFSFSGLKTAVLNTWYSETDQSYENKANLC 240 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 AF++A +D L+ KC++AL +TG KRLV++GGVSAN+ LR+KL + K + E+F+ + Sbjct: 241 YAFQEAAIDVLVTKCEKALQKTGNKRLVISGGVSANKLLRSKLDILSKNKGYEIFFPPMK 300 Query: 297 FCTDNGAMIAYAGMVRFKAGAT-ADLGVSVRPR 328 +CTDNGAMIA AG R+ ++L ++V+ R Sbjct: 301 YCTDNGAMIALAGAYRYANSFRDSNLEINVKAR 333 >UniRef50_A7HLB0 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Thermotogaceae RepID=GCP_FERNB Length = 337 Score = 407 bits (1046), Expect = e-112, Method: Composition-based stats. Identities = 130/334 (38%), Positives = 199/334 (59%), Gaps = 8/334 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCDET +A+ +D ++AN +YSQ+++H +GGVVPE+A+R+H+++ L Sbjct: 1 MIVLGIETSCDETSVALVEDN-TVIANLVYSQIQIHKKFGGVVPEIAAREHLKRLPILFS 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + ++ + + ID +A T GPGL+GALLVG + + LA + P + ++H+ GH+ + Sbjct: 60 ELISQTNINIERIDGIAVTKGPGLIGALLVGVSFAKGLALRYKKPLVGINHIIGHVYSNY 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + P++ L+VSGGHT ++ V +LG S+DDA GEAFDK A+LLGL YPGG Sbjct: 120 LAYPDLKPPYIVLMVSGGHTLILKVEENNNVTILGRSVDDAVGEAFDKIARLLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT-----RADI 235 P + K++ G F FP+P P +FSFSGLKT I+ + D+ Sbjct: 180 PEIDKISKNGNPNAFNFPKPKMYDPDYNFSFSGLKTAVLYEIKRLTKSGYSENNLPIPDL 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A + ++ ++D L+ K +A K +V+AGGV+AN LR K+ + ++ + Sbjct: 240 AASAQEVMIDVLLHKVTKAARDNNLKNIVLAGGVAANSRLREKIRALSEEFN--FYIPPL 297 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 E+C+DN AMIA AG+ R K+G L P + Sbjct: 298 EYCSDNAAMIARAGLERIKSGENDGLNFEPVPNF 331 >UniRef50_B6BRQ7 O-sialoglycoprotein endopeptidase n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BRQ7_9RICK Length = 357 Score = 406 bits (1044), Expect = e-112, Method: Composition-based stats. Identities = 141/342 (41%), Positives = 217/342 (63%), Gaps = 12/342 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKG----LLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 +LGIE+SCDET ++ + + +L+N + SQV++H ++GGVVPELA+R H+ K Sbjct: 6 LILGIESSCDETAASLITENEQGIPIVLSNIISSQVEVHKEFGGVVPELAARSHMEKIDW 65 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +++ A+ +SG ++IDAVA TAGPGL+ L VG + G++ A A + P I V+H+EGH L Sbjct: 66 IVEKAINDSGRKIEEIDAVASTAGPGLIVCLSVGLSFGKAFASALNKPFIAVNHLEGHAL 125 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 +P L ++ +P++ LL+SGGH+Q ++V +G+Y+ LG +IDDA GEAFDKTAKLLG+++ Sbjct: 126 SPKL-NSKLNYPYLVLLISGGHSQFLNVQDLGKYKRLGTTIDDALGEAFDKTAKLLGVEF 184 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 PGGP + MA +G + ++ P+P+ ++ G + SF+GLKT I N DQ + D+A Sbjct: 185 PGGPQIEIMAEKGDSNKYDLPKPIFNKGGCNLSFAGLKTAILK-ITKNIKTDQEKFDLAA 243 Query: 238 AFEDAVVDTLMIKCKRALDQTGF------KRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 +F+ V + L K K A ++ K V+AGGV+AN+ +R L + + + Sbjct: 244 SFQKTVEEILYKKTKIAFNEFEKQNKLKDKIFVVAGGVAANKKIRTMLINLCNENNYKGI 303 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 + E C DN AMIA G+ +FK + L +PRWPL E Sbjct: 304 FPPIELCGDNAAMIAMVGLEKFKLKQFSALDHPAKPRWPLDE 345 >UniRef50_B9KXJ0 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Chloroflexi RepID=GCP_THERP Length = 365 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 165/358 (46%), Positives = 207/358 (57%), Gaps = 31/358 (8%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETSCDET A+ D + +L+N + SQV LH YGGVVPELASR HV VP++ Sbjct: 1 MIILGIETSCDETAAAVVRDGRFVLSNIIRSQVDLHQRYGGVVPELASRRHVTSIVPVLD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++G+ IDA+A T GPGL G+LLVG V ++LAF W+ P IPV+H+EGH+ A Sbjct: 61 LALEQAGIGPSAIDAIAVTEGPGLAGSLLVGINVAKTLAFVWEKPLIPVNHLEGHIYANW 120 Query: 121 L------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 L E P FP V L+VSGGHT+L+ + G G Y LLG ++DDAAGEAFDK A+LLG Sbjct: 121 LTLPGQDEVPEPTFPLVCLIVSGGHTELVLMRGHGDYVLLGRTLDDAAGEAFDKAARLLG 180 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR-- 232 L +PGGP + K A QG GRF PR DFSFSGLKT + R Sbjct: 181 LGFPGGPAIQKAAEQGRPGRFSLPRAWLGE-SYDFSFSGLKTALLRVLEQYQRRPARRVA 239 Query: 233 -------------------ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANR 273 AD+A F+ AVV+ L K RA + G +++AGGV+AN Sbjct: 240 AGQPFPEYVAPEYGPSVPIADLAAEFQAAVVEVLAEKTARAAREFGATMVLLAGGVAANA 299 Query: 274 TLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 LR +L E+ V Y P CTDN AMIA A + G ADL + V PL Sbjct: 300 ALRQRLREI---SPVPVRYPPPILCTDNAAMIAGAAYYLAQRGVRADLDLDVHAHLPL 354 >UniRef50_Q6AL73 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Deltaproteobacteria RepID=GCP_DESPS Length = 344 Score = 403 bits (1036), Expect = e-111, Method: Composition-based stats. Identities = 141/333 (42%), Positives = 195/333 (58%), Gaps = 5/333 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIE+SCD+T A+ D + +N + Q ++H +GGVVPELASR H+ P+++ Sbjct: 8 MIILGIESSCDDTSAAVVIDGTAIQSNVISGQEEIHNCFGGVVPELASRSHLSAIQPVVE 67 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ ++ DID +A T GPGL G+LLVG + +SL+ +P + V HM GH LA + Sbjct: 68 KALSDAKISLDDIDLIATTQGPGLSGSLLVGYSYAKSLSLVKKIPFVGVDHMAGHALAIL 127 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE+ P+FPF+AL SGG + + V +ELLG + DDAAGEAFDK AK+LGL YPGG Sbjct: 128 LEEETPDFPFIALTASGGTSSIFLVKSSTDFELLGRTRDDAAGEAFDKVAKVLGLPYPGG 187 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANT-----IRDNGTDDQTRADI 235 P ++ A G FPR D+ G DFSFSGLKT N ++ + RADI Sbjct: 188 PHIAAHAETGDEKSIKFPRAWLDKDGFDFSFSGLKTAVLNYHNKIVQKNGSITKEERADI 247 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F+ AV+D L+ K A G +V+ GGVS+NR LR + K + + F Sbjct: 248 CASFQQAVIDVLVTKTINAARTHGISTVVLGGGVSSNRALRLAFSHECDKCKLQFFVPAA 307 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + CTDN AMIA AG ++ +L V R Sbjct: 308 KLCTDNAAMIAVAGYHKYLRFGPGNLSDDVYSR 340 >UniRef50_A4EBV8 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A4EBV8_9ACTN Length = 794 Score = 403 bits (1035), Expect = e-111, Method: Composition-based stats. Identities = 147/341 (43%), Positives = 192/341 (56%), Gaps = 11/341 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VL IE+SCDET +AI D + +LANQ+ +Q+ HA +GGVVPE+ASR HV V ++ A Sbjct: 454 LVLAIESSCDETAVAIIDADGNMLANQVSTQIDFHARFGGVVPEIASRKHVEVIVSVVDA 513 Query: 62 ALKESG---------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHM 112 AL+++ + ++ AV T GPGLVGAL+VG + A+A P + V+H+ Sbjct: 514 ALEDAAASLGLTGGAIAPSELAAVGVTQGPGLVGALVVGVAFAKGFAYAAGKPLVCVNHL 573 Query: 113 EGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL 172 EGHL A +L + PF+ LVSGGHT L+ V G YE+LGE++DDA GEAFDK AK Sbjct: 574 EGHLFANLLAQPDLKPPFIFTLVSGGHTMLVHVKAWGDYEVLGETLDDAVGEAFDKVAKA 633 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL YPGGP++SK+A G FPR + R FS SGLKT I +T Sbjct: 634 LGLGYPGGPIISKLAETGNPKAIDFPRALNSRGDYRFSLSGLKTAVTLYIEQETKAGRTI 693 Query: 233 --ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 D+A +FE AV D K K AL TG K + GGVSAN LR + + + ++ V Sbjct: 694 HLPDLAASFEAAVFDVQYKKAKNALHATGCKEYCIGGGVSANPHLREMMIKKLGRQGIRV 753 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 CTDN AMIA +F G + V P L Sbjct: 754 TVPPLSACTDNAAMIAEVARRKFDRGEISPFDVDADPNMTL 794 Score = 71.8 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 42/120 (35%), Gaps = 13/120 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVR-KTVPLIQ 60 V+ ++TS D +A+ + Q G R H + V + Sbjct: 9 LVVALDTSTDMLAC---------VASWIDGQTGETKLVSGDH---MCRRHANVELVNTVD 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L ++GL D+ GPG + +G + + LA +VP + V ++ Sbjct: 57 GLLAQAGLDRSDVGCYVVGRGPGSFTGVRIGISTAKGLARGANVPLLGVSTLDACAWTAW 116 >UniRef50_C7N1K1 Ribosomal-protein-alanine acetyltransferase n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1K1_SLAHD Length = 781 Score = 402 bits (1034), Expect = e-111, Method: Composition-based stats. Identities = 152/341 (44%), Positives = 187/341 (54%), Gaps = 9/341 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET AI D E +L++ + SQ+ HA +GGVVPE+ASR H+ + Sbjct: 439 LILAIESSCDETAAAIIDGEGSMLSDVVASQIDFHARFGGVVPEIASRKHIEAICGVTDE 498 Query: 62 -------ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 AL S L +D+DAVA T PGLVGAL+VG + A+ D+P I V+H+EG Sbjct: 499 CLDVAARALGRSRLRWRDLDAVAVTYAPGLVGALVVGVAFAKGAAWGADLPIIAVNHLEG 558 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HL A L + + P V LVSGGHT L+ V G YE LG +IDDA GEAFDK +K LG Sbjct: 559 HLYANRLAEPDIQPPMVVSLVSGGHTMLVHVKDWGDYETLGSTIDDAVGEAFDKVSKALG 618 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTR 232 L YPGGP++SK AAQG A FPR + L FS SGLKT I + Sbjct: 619 LGYPGGPIISKYAAQGDAKAIAFPRALMHSGDLRFSLSGLKTAVTTYINKEREAGRELNI 678 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +FE AVVD + K AL TG + + GGV+AN LR M KK + Sbjct: 679 PDIAASFEAAVVDVQVSKAHTALKDTGARTFCLGGGVAANPALRGAYEAMCKKHGYRLVM 738 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 C DN AMIA RF G AD + V PL E Sbjct: 739 PPLSACGDNAAMIAEVARDRFAQGKFADWSLDVTAHAPLDE 779 Score = 65.2 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 42/112 (37%), Gaps = 5/112 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +T+ + + + G L + A+ A R + I A Sbjct: 5 ILAFDTANEVVAVGV-----GRLPDDAVDITAAQAECVASASVSARRASNTTLIARIDEA 59 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L +G+T + AV GPG + + + +A A +VP + V ++ Sbjct: 60 LASAGVTKDQVAAVVCGRGPGSFTGVRICMATAKGMASALEVPLLGVSTLDA 111 >UniRef50_B3WUZ1 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Escherichia coli B171 RepID=B3WUZ1_ECOLX Length = 332 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 145/326 (44%), Positives = 209/326 (64%), Gaps = 4/326 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IETSCDETG+A++ ++ L+++ LYSQV +H+ +GG+VPE+ASR + PLI+ Sbjct: 2 LLAIETSCDETGVALFSEDGKLISHLLYSQVAIHSPFGGIVPEIASRKQLEVLYPLIKEL 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 LK++ + + AVA T GPGL+G+LLVG ++ ++++FA +P I V H++ HLLA LE Sbjct: 62 LKQNNIEISQLKAVAATFGPGLIGSLLVGVSLAKAISFALKIPLIAVDHLQAHLLAVFLE 121 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 EFPF+ LLVSGGHT L + ++ ++G + DDAAGEAFDK AKLLGL YPGGP+ Sbjct: 122 -KEIEFPFIGLLVSGGHTALFLINSFFEFYVIGHTKDDAAGEAFDKVAKLLGLPYPGGPI 180 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 +S++A +G PRP+ + LDFSFSGLKT N I+++ D+ FE+A Sbjct: 181 ISQLAEKGDPKAINLPRPLLEDKSLDFSFSGLKTAVLNYIKNHS---YRVEDLCAGFEEA 237 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 V D L+ K RA+D R+V+AGGV+AN+ LR + E E+++ EFCTDN Sbjct: 238 VCDVLVYKTFRAVDLFKVPRVVVAGGVAANKRLRQRFREKAFNTGVEIYFPSLEFCTDNA 297 Query: 303 AMIAYAGMVRFKAGATADLGVSVRPR 328 AM+ G +++ ADL R Sbjct: 298 AMVGLLGYKQWQEKKYADLNTEAYAR 323 >UniRef50_Q0AVU0 Probable O-sialoglycoprotein endopeptidase n=27 Tax=Bacteria RepID=GCP_SYNWW Length = 339 Score = 401 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 141/329 (42%), Positives = 199/329 (60%), Gaps = 1/329 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCDET AI + K +L+N + SQ+ +H +GGVVPE+ASR H+ ++ Sbjct: 8 LILGIETSCDETAAAIVRNGKEILSNIVNSQIDIHQQFGGVVPEVASRKHIENIAGVVHR 67 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A E+ L IDAVA T PGLVGALLVG + ++ A+A + P I V+H+ GH+ A L Sbjct: 68 AFSEAQLAYSAIDAVAVTNRPGLVGALLVGVSFAKAFAYALEKPLIAVNHLHGHIYANFL 127 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E EFP + L+VSGGHT L+ ++ + E+LGE+ DDAAGEAFDK A+ LGL YPGGP Sbjct: 128 EHRDIEFPAICLVVSGGHTSLLLMSNPNKMEVLGETRDDAAGEAFDKVARFLGLGYPGGP 187 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ-TRADIARAFE 240 + + A +G AG+ PR DR +FSFSGLKT A N Q D+A F+ Sbjct: 188 AIQEAATKGKAGQLQLPRVFLDRNDFEFSFSGLKTAAMNQWNKLQRRGQANVFDMAAEFQ 247 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 A+V+ L+ K +A + + ++MAGGV+AN+ LR + + K+ ++FY + CTD Sbjct: 248 AALVEVLVEKSIKAAAKYQVRTIMMAGGVAANQELRNLMKKRTKEAGLKLFYPSLKLCTD 307 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRW 329 N AM+A + + A L ++ P Sbjct: 308 NAAMVAANAHYHYGNRSFAPLSLNAYPSL 336 >UniRef50_D0ME01 Metalloendopeptidase, glycoprotease family n=4 Tax=Bacteria RepID=D0ME01_RHOM4 Length = 339 Score = 401 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 151/331 (45%), Positives = 207/331 (62%), Gaps = 9/331 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+T A+ E L +N + SQ H YGGVVPELASRDH R+ VP+++ A Sbjct: 8 ILGIETSCDDTAAAVVV-EGKLRSNVVASQQATHLRYGGVVPELASRDHQRRIVPVVRQA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+GLT +D+DAVA T GPGLVG+LLVG + ++ A P I V+H+EGH+ + +E Sbjct: 67 LQEAGLTPRDLDAVAVTYGPGLVGSLLVGLSFAKAFALGLGRPLIGVNHLEGHIYSVFIE 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 P FP++ L+VSGGHTQL+ V ++ LLG + DDAAGEAFDK A+LLGL YPGGP Sbjct: 127 PPSPPFPYLCLIVSGGHTQLMRVDEGFRHTLLGRTRDDAAGEAFDKVARLLGLGYPGGPE 186 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI------RDNGTDDQTRADIA 236 + ++A QG FPRP + G DFSFSGLKT + +Q RAD+ Sbjct: 187 IDRLARQGDPNFVAFPRPRLE--GYDFSFSGLKTAVRYYLDQFSEAERARLLEQHRADLC 244 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F+ AVVD L+ +RA+ TG + + + GGVSAN LRA + ++ ++ Sbjct: 245 ASFQQAVVDVLIDSLRRAIQDTGLRHVAIVGGVSANSALRAAAQALAEELDVRLYIPPLA 304 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +C DN AMIA G + +AG + L ++ P Sbjct: 305 YCMDNAAMIAITGYFKARAGLESPLTLAAVP 335 >UniRef50_A0L5L8 Probable O-sialoglycoprotein endopeptidase n=24 Tax=Bacteria RepID=GCP_MAGSM Length = 353 Score = 400 bits (1029), Expect = e-110, Method: Composition-based stats. Identities = 168/347 (48%), Positives = 220/347 (63%), Gaps = 12/347 (3%) Query: 1 MRVLGIETSCDETGIAIYDD-------EKGLLANQLYSQVKLHADYGGVVPELASRDHVR 53 +RVLGIE+SCDET A+ + + +N ++SQ+++HA YGGVVPELASR H+R Sbjct: 2 LRVLGIESSCDETAAAVVEGAEHGHPHGVVVRSNVVWSQLEVHALYGGVVPELASRAHIR 61 Query: 54 KTVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 P+I+ AL E+G+ + +DA+A T PGLVGALLVG + LA A D P +PVHHME Sbjct: 62 HIQPVIEQALAEAGVRPQQLDAIAVTVAPGLVGALLVGVAAAQGLAVALDKPLVPVHHME 121 Query: 114 GHLLAPMLED---NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTA 170 GHL++P L EFPFVALLVSGGHT L+ G Y+LLG++ DDA GEAFDK A Sbjct: 122 GHLMSPFLMAGVVPAMEFPFVALLVSGGHTLLLHARDFGDYQLLGQTRDDAVGEAFDKGA 181 Query: 171 KLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-- 228 ++LGL YPGGP ++ +A G FPR + DR DFSFSGLKT + + Sbjct: 182 RMLGLGYPGGPEVAALAQSGDRQAVAFPRVLLDRSQFDFSFSGLKTALRTHLLKFPPESG 241 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 + AD+A ++++A+VDTL+IK A G RLV+AGGV ANR LR KLA+ K+ Sbjct: 242 GPSLADVAASYQEAIVDTLVIKSLSACRHVGVSRLVIAGGVGANRRLREKLAKQALKQGV 301 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 +++ CTDNGAMIA AG+ R G A V+ PR P+ EL Sbjct: 302 QLYAPPIHLCTDNGAMIASAGVCRLARGDQARGVVNAVPRLPIHELE 348 >UniRef50_Q11TP2 Probable O-sialoglycoprotein endopeptidase n=87 Tax=Bacteria RepID=GCP_CYTH3 Length = 343 Score = 398 bits (1024), Expect = e-109, Method: Composition-based stats. Identities = 147/335 (43%), Positives = 206/335 (61%), Gaps = 9/335 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCDET A+ D +L N + SQ ++H YGG+VPELASR H + +P++ A Sbjct: 10 LLAIESSCDETAAAVIQD-GNILCNIVASQ-RIHEKYGGIVPELASRAHQQHIIPVVAQA 67 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ + D++AVA T+GPGL+GALLVG + ++ A A +P I V+HM+ H+LA + Sbjct: 68 LLEANIQKSDLNAVACTSGPGLLGALLVGVSFSKAFASALHIPVIKVNHMKAHILAHFIG 127 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 D P FPF+ + VSGGHTQL+ V + E++GE+ DDA GEAFDKTAKL+GL YPGGPL Sbjct: 128 DVKPSFPFICMTVSGGHTQLVIVRNYLEMEVVGETQDDAVGEAFDKTAKLMGLPYPGGPL 187 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-----QTRADIAR 237 + A QG FP P D PG ++SFSG+KT ++ N D + DI Sbjct: 188 IDSYAKQGNP--LAFPFPTVDMPGYNYSFSGIKTAFMYFLKKNTAVDPDFIQKNLPDICA 245 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 + + A++D LM K KR + TG R+ +AGGVSAN LR + + ++ +V+ E+ Sbjct: 246 SVQHALIDVLMRKLKRLVVDTGINRVAIAGGVSANSGLRKAMEQKREQEGWDVYIPAFEY 305 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 CTDN AMIA AG ++ A +S PR + Sbjct: 306 CTDNAAMIAVAGYHQYLENDFAGWDLSPEPRLRIG 340 >UniRef50_Q6MQ48 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Bdellovibrio bacteriovorus RepID=GCP_BDEBA Length = 345 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 136/335 (40%), Positives = 191/335 (57%), Gaps = 8/335 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 RVL IETSCD+T +AI D + + SQ H YGG+VPE+A+R+H +PLI+ Sbjct: 4 RVLAIETSCDDTSVAIVDRTGWVHSVVAASQDLDHEIYGGIVPEIAARNHSIALIPLIEE 63 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A K++ + D+ +A T PGL+GAL+VG +SL+ A +P + V+H+EGHLLAP L Sbjct: 64 AFKKANMNWSDVQGIAVTNRPGLIGALIVGLVTAKSLSQAKHLPFLGVNHLEGHLLAPFL 123 Query: 122 ED------NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 D +P+V L +SGGHT L + G+G Y +LG + DDAAGE FDK AK+ GL Sbjct: 124 RDDKYAPPEDFGYPYVGLAISGGHTSLYQIKGLGDYRILGATKDDAAGECFDKFAKMAGL 183 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTRA 233 +PGG + +MA G F FPR M D SFSGLK+ + G + + Sbjct: 184 GFPGGVRVDQMAKAGNPQAFEFPRSMIHDDTFDMSFSGLKSSGQRMLEQLGPELVQERLP 243 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 D+ +F++A+VD L+ K RA KR+++ GGVSAN LR + E K+ + Sbjct: 244 DLCASFQEAIVDVLIAKLDRAAKVFRSKRVILTGGVSANSRLRQRAQEWADKKGYTLVIP 303 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +CTDN AMI Y G +R G + L + P+ Sbjct: 304 PLRYCTDNAAMIGYVGALRMARGEVSALDLGPSPQ 338 >UniRef50_A5CE49 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Orientia tsutsugamushi RepID=GCP_ORITB Length = 344 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 136/346 (39%), Positives = 203/346 (58%), Gaps = 14/346 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M V+GIE+SCD+T IAI + + ++AN + SQ H Y GVVPE+A+R H++ ++ Sbjct: 1 MNVIGIESSCDDTAIAIVNSNREIIANVVISQYTEHLPYSGVVPEIAARAHLKNLQYAMK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L ++ + DID +A T+GPGL+G ++VG+ G+++A A I V+H+EGH+LA Sbjct: 61 ETLNQAKINFTDIDVIAATSGPGLIGGIIVGSVFGQAIACALGKDFIAVNHLEGHILAVR 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L +N FP++ LLVSGGH Q I+V G+G+Y++LG++IDDA GEAFDKTA+LL L YPGG Sbjct: 121 LNEN-ISFPYLVLLVSGGHCQFIAVLGVGKYKILGQTIDDAVGEAFDKTARLLKLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADIARAF 239 P++ K+A++G ++ P MT + G D SFSGLKT I ++ DI +F Sbjct: 180 PIIEKLASKGDPHKYSLPLSMTKKSGCDLSFSGLKTAVKQLIFSIESLSEKVICDICASF 239 Query: 240 EDAVVDTLMIKCKRALDQT-----------GFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 + VV L+ + A+ V++GGV+AN+ LR ++ + Sbjct: 240 QYTVVQILLCRSINAIKLFESYCSNNFKINRKNYFVISGGVAANQYLRQEIFNLANTYGY 299 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 CTDN AMIA+AG+ R A + R +W + EL Sbjct: 300 CGVAPPSNLCTDNAAMIAWAGIERLNANLFSS-NFVPRAKWSVEEL 344 >UniRef50_Q8DLI9 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Cyanobacteria RepID=GCP_THEEB Length = 353 Score = 396 bits (1018), Expect = e-109, Method: Composition-based stats. Identities = 147/342 (42%), Positives = 194/342 (56%), Gaps = 8/342 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+L IETSCDET A+ D + + +N + SQV H +GGVVPE+ASR H+ +I A Sbjct: 3 RILAIETSCDETAAAVVRD-RAIESNVIASQVCAHQPFGGVVPEVASRAHLENINGVITA 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+ E+G IDA+A T PGLVG+LL+G T ++LA P + +HH+EGHL A L Sbjct: 62 AISEAGCDWSAIDAIAVTCAPGLVGSLLIGVTAAKTLALVHQKPLLGIHHLEGHLYASYL 121 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + E PF+ LLVSGGHT LI V G G+Y+L G++ DDAAGEA+DK A+L+GL YPGGP Sbjct: 122 AEPTLEPPFLCLLVSGGHTSLIGVYGCGEYQLFGQTRDDAAGEAYDKVARLMGLGYPGGP 181 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPG-----LDFSFSGLKTFAANTIRDNGT--DDQTRAD 234 LL + A QG F P P D SFSGLKT A + + + AD Sbjct: 182 LLDRWAQQGNPEAFDLPEGNIRLPDGKVHPYDASFSGLKTAVARLVAELRQTHPELPVAD 241 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 +A +F+ AV L + A GFK L + GGV+AN LR L + + + Sbjct: 242 LAASFQKAVAQALTKRAIAAAVDHGFKTLAIGGGVAANSGLRQHLTAAAEPLGLRLIFPP 301 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 CTDN AMI A F+ G + L ++ R R L E+ A Sbjct: 302 LRLCTDNAAMIGCAAADHFQRGDRSPLDLTARSRLSLLEISA 343 >UniRef50_D1IZQ0 Whole genome shotgun sequence of line PN40024, scaffold_48.assembly12x (Fragment) n=15 Tax=Magnoliophyta RepID=D1IZQ0_VITVI Length = 468 Score = 396 bits (1017), Expect = e-109, Method: Composition-based stats. Identities = 142/359 (39%), Positives = 203/359 (56%), Gaps = 28/359 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+T AI +L+ + SQ L A YGGV P++A H++ ++Q A Sbjct: 77 VLGIETSCDDTAAAIVRSNGDILSQVVSSQADLLARYGGVAPKMAEGAHMQVIDRVVQDA 136 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ + LT +D+ AVA T GPGL L VG R +A + ++P + VHHME H L L Sbjct: 137 LENANLTERDLSAVAVTIGPGLSLCLRVGVQKARKIAGSHNLPIVGVHHMEAHALVARLI 196 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--PGG 180 + +FPF+ALL+SGGH LI +G Y LG +IDDA GEA+DKTAK LGLD GG Sbjct: 197 EKDLQFPFMALLISGGHNLLILARDLGHYIQLGTTIDDAIGEAYDKTAKWLGLDLRRSGG 256 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ---------- 230 P + ++A +G A F PM +FS++GLKT I + + Sbjct: 257 PAIEELAREGDAKAVKFSTPMKQHKDCNFSYAGLKTQVRLAIESRNINAEIPISSASSED 316 Query: 231 --TRADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLAEMM 283 +RADIA +F+ V L +C+RA++ + K LV++GGV++N+ +RA+L +++ Sbjct: 317 RSSRADIAASFQRVAVLHLEERCERAIEWALKIEPSIKHLVVSGGVASNQYVRAQLDQVV 376 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATA---------DLGVSVRPRWPLAE 333 KK+ ++ P CTDNG M+A+ G+ F+ G D +RPRWPL E Sbjct: 377 KKKSLQLVCPPPSLCTDNGVMVAWTGLEHFRMGRYDPPPPANEPEDYVYDLRPRWPLGE 435 >UniRef50_Q3YS67 Probable O-sialoglycoprotein endopeptidase n=24 Tax=Rickettsiales RepID=GCP_EHRCJ Length = 350 Score = 396 bits (1017), Expect = e-109, Method: Composition-based stats. Identities = 142/338 (42%), Positives = 206/338 (60%), Gaps = 8/338 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCDET +AI + K +L++++ SQ K HA+YGGVVPE+ASR H+ L + Sbjct: 8 VLGIETSCDETAVAIVNSNKEVLSHKILSQ-KEHAEYGGVVPEIASRAHINYLYDLTVSC 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ES L+ +IDAVA T+GPGL+G L+VG + + +A P I ++H+E H L + Sbjct: 67 IEESQLSLNNIDAVAVTSGPGLIGGLIVGVMIAKGIASVTGKPIIEINHLEAHALIVRMF 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 FPF+ L++SGGH Q + V +G Y LG S+DD+ GE FDK AK+L L YPGGP+ Sbjct: 127 YE-INFPFLLLIISGGHCQFLIVYNVGCYHKLGSSLDDSLGEVFDKVAKMLNLGYPGGPV 185 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG-TDDQTRADIARAFED 241 + K + G + FV PR +T R G DFSFSGLKT N I ++ D++ DI+ +F++ Sbjct: 186 IEKKSLSGDSKSFVLPRALTGRCGCDFSFSGLKTAVRNIIMNHEYIDNKLICDISASFQE 245 Query: 242 AVVDTLMIKCKRALDQTG-----FKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 V D L+ + A+ + +LV+ GGV+AN+ LR ++ E+FY + Sbjct: 246 CVGDILVNRINNAIAMSKAIDKRIDKLVVTGGVAANKLLRERMLRCASDNNFEIFYPPSK 305 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 CTDNG MI +AG+ ++L + + RWPL L Sbjct: 306 LCTDNGIMIGWAGIENLVKDYVSNLDFAPKARWPLESL 343 >UniRef50_B8BPP0 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8BPP0_ORYSI Length = 401 Score = 394 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 135/362 (37%), Positives = 198/362 (54%), Gaps = 29/362 (8%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +LGIETSCD+T A+ + +L+ + SQ L +GGV P++A H ++Q Sbjct: 10 LLMLGIETSCDDTAAAVVRGDGEILSQVVSSQEDLLVRWGGVAPKMAEEAHSLAIDQVVQ 69 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ ++ D+ AVA T GPGL L VG R +A ++ +P + VHHME H L Sbjct: 70 KALDDANVSENDLSAVAVTVGPGLSLCLRVGVHKARKIAKSFRLPIVGVHHMEAHALVSR 129 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--P 178 L + +FPF+ALL+SGGH L+ G+GQY LG +IDDA GEA+DK+A+ LGLD Sbjct: 130 LVNKDLDFPFLALLISGGHNLLVLAHGLGQYVQLGTTIDDAIGEAYDKSARWLGLDMRKG 189 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN------------- 225 GGP L ++A +G F PM +FS++GLKT I Sbjct: 190 GGPALEQLALEGDPNAVKFSVPMRQHKDCNFSYAGLKTQVRLAIESRNISTDDIPISSAT 249 Query: 226 GTDDQTRADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLA 280 D Q RA+IA +F+ V L +C+RA++ + K V++GGV++N+ +R L Sbjct: 250 KDDRQIRANIAASFQRVAVLHLEERCQRAVEWALKMEPSIKYFVVSGGVASNQYVRTHLN 309 Query: 281 EMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT---------ADLGVSVRPRWPL 331 ++ +K ++ P+ CTDNG MIA+ G+ F AG D+ +RPRWPL Sbjct: 310 QIAEKNGLQLVCPPPKLCTDNGVMIAWTGIEHFIAGRFDDPPAVDEPDDMQYDLRPRWPL 369 Query: 332 AE 333 E Sbjct: 370 GE 371 >UniRef50_A8GM49 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Rickettsia RepID=GCP_RICAH Length = 386 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 131/384 (34%), Positives = 201/384 (52%), Gaps = 51/384 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +++LGIE+SCD+T ++I + + +L+N + SQ HA +GGVVPE+A+R H+ + Sbjct: 2 IKILGIESSCDDTAVSIITENREILSNIIISQNTEHAVFGGVVPEIAARSHLSNLDKALT 61 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 LKES +I A+A T+GPGL+G ++VG+ RSL+ P I ++H+EGH L Sbjct: 62 NVLKESNTKLIEISAIAATSGPGLIGGVIVGSMFARSLSSTLKKPFIAINHLEGHALTAR 121 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L DN +P++ LL SGGH Q ++V G+G+Y++LG +IDDA GE FDK AK+L L +PGG Sbjct: 122 LTDN-IPYPYLLLLASGGHCQFVAVLGLGKYKILGSTIDDAVGETFDKVAKMLNLAFPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT------------- 227 P + + A G ++ FP+P+ + + SFSGLKT I + Sbjct: 181 PEIEQKAKLGDPHKYKFPKPIINSGNCNMSFSGLKTAVRTLIMNLQEINYNECNHLESVR 240 Query: 228 ------------------------------DDQTRADIARAFEDAVVDTLMIKCKRALDQ 257 +D DIA +F+ + + L K + A+ Sbjct: 241 QDEVQEEFAQRTKVHEHRRKLQNSLVSSFLNDSVINDIAASFQFTIGEILSSKVQDAIRA 300 Query: 258 T-------GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGM 310 K +++AGGV+AN+ L+ L+ K ++ Y CTDN AMIAYAG+ Sbjct: 301 YEQITNNFDKKNIIIAGGVAANKYLQEILSNCAKTYGYQLIYPPIHLCTDNAAMIAYAGL 360 Query: 311 VRFKAGATADLGVSVRPRWPLAEL 334 R+ L + RW L ++ Sbjct: 361 ERYNNKLFTPLNFCPKARWSLEDI 384 >UniRef50_D0RQS5 Putative glycoprotease GCP n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RQS5_9RICK Length = 358 Score = 393 bits (1010), Expect = e-108, Method: Composition-based stats. Identities = 138/341 (40%), Positives = 209/341 (61%), Gaps = 13/341 (3%) Query: 4 LGIETSCDETGIAIYDDEK----GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 LGIETSCDET A+ K +L+N + SQ +H +GGVVPELA+R H K +I Sbjct: 8 LGIETSCDETAAALVKKSKNGKVKILSNVVSSQEIVHKKFGGVVPELAARAHSEKIDLII 67 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + A+K+S ++ ID VA TAGPGL+ L+VG T G+++A A P +H+EGH L Sbjct: 68 KEAIKKSKVSIHQIDGVACTAGPGLLICLMVGMTAGKTIASALKKPFFGTNHLEGHALTM 127 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L P +FP++ LL+SGGH+Q +SV G+G+Y+ LG +IDDA GEAFDKTAK+LG+++PG Sbjct: 128 GLI-RPVKFPYLLLLISGGHSQFLSVEGVGKYKRLGTTIDDALGEAFDKTAKILGIEFPG 186 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 GP + A G F P+P+ + G + S++GLKT + N Q + D+A +F Sbjct: 187 GPKIETFAKFGNENSFDLPKPILHKSGCNMSYAGLKTAVLHA-SKNIKSKQDKYDLAASF 245 Query: 240 EDAVVDTLMIKCKRALDQT-------GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 + + + L +KC +A++ K V+AGGV++N+++R + ++ + + Sbjct: 246 QKTINEILKVKCAKAIEMFLEKHKKIKNKNFVVAGGVASNQSIRKTIKQVSSTLKFNTHF 305 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 CTDN AMIA+AG+ ++AG +L + +PRWPL + Sbjct: 306 PPLNLCTDNAAMIAWAGLQNYEAGKKPNLKIISQPRWPLDQ 346 >UniRef50_B2GAG0 Probable O-sialoglycoprotein endopeptidase n=56 Tax=Lactobacillales RepID=GCP_LACF3 Length = 344 Score = 391 bits (1005), Expect = e-107, Method: Composition-based stats. Identities = 137/337 (40%), Positives = 213/337 (63%), Gaps = 6/337 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L E+SCDET +++ +D +L+N + +Q+ H +GGVVPE+ASR H+ + + Sbjct: 6 LILAFESSCDETSVSVIEDGHRVLSNIVATQIASHQRFGGVVPEVASRHHIEQITKCTKE 65 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G++ +D+ AVA T GPGLVG+LL+G T +++A+A +P +PV+HM GHL A Sbjct: 66 ALEQAGVSYQDLTAVAVTYGPGLVGSLLIGVTAAKTIAWAHQLPLVPVNHMAGHLYAARF 125 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + +P + LLVSGGHT+L+ + Y+++GE+ DDAAGEA+DK +++G++YP G Sbjct: 126 VSD-FTYPMLGLLVSGGHTELVYMKEEHDYQIIGETRDDAAGEAYDKVGRVMGINYPAGK 184 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARAF 239 + + AA+G F FPR M DFSFSGLK+ NT+ D + + D+A +F Sbjct: 185 TVDQWAAKG-HDTFHFPRAMEKEDNFDFSFSGLKSAFINTVHNADQRGEVLDKYDLAASF 243 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV--FYARPEF 297 + +VVD L+ K RALD+ K+L++AGGV+AN+ LR +L+ ++ + EV A ++ Sbjct: 244 QQSVVDVLVAKTIRALDEFPVKQLILAGGVAANQGLRKQLSAGLQAKHPEVQLLQAPLKY 303 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 C DN AMI AG V + G AD ++ P A L Sbjct: 304 CGDNAAMIGAAGYVNYLHGDRADGSLNAVPGLSFAHL 340 >UniRef50_B1GZV6 Probable O-sialoglycoprotein endopeptidase n=1 Tax=uncultured Termite group 1 bacterium phylotype Rs-D17 RepID=GCP_UNCTG Length = 342 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 144/339 (42%), Positives = 205/339 (60%), Gaps = 8/339 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M + IETSCDET ++ + + + +YSQ+K+HA + GVVPELASR H+ +I Sbjct: 1 MNIFAIETSCDETSASVVLNGLKVKSVVIYSQIKIHAGFFGVVPELASRSHIENINLVIW 60 Query: 61 AALKESGLTAKD----IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 AL ++G+ D IDA+A+T+GPGL GALLVGA +SLA + P IPV+H++GHL Sbjct: 61 RALSDAGINFTDFSQKIDALAFTSGPGLAGALLVGAIAAKSLACVYKKPLIPVNHLDGHL 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 + ++E+ + PF++L++SGGHT+L+ V G+Y++LG + DDAAGEAFDK AK+LGL Sbjct: 121 YSSLIENRSVKLPFLSLIISGGHTELVVVEDFGKYKVLGSTRDDAAGEAFDKAAKMLGLS 180 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG-TDDQTRADI 235 YPGGP++ K+A G F RP + DFSFSG+KT N ++ N +++ DI Sbjct: 181 YPGGPIIDKIAESGNPEAVRFTRPYL-KGSWDFSFSGIKTALLNYLKTNPVRNEKQLNDI 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F AV +TL K A + KR+V+ GGVSAN +R E +K +VF Sbjct: 240 CASFRQAVAETLCFKSFEAAKKFNLKRIVLGGGVSANSLIRKIFLETGQKNNTKVFIPSL 299 Query: 296 EFCTDNGAMIAYAGMVRFKA-GATAD-LGVSVRPRWPLA 332 + TDN AMI A + K G D + + P PL Sbjct: 300 IYSTDNAAMIGCAAYFKQKKCGLKYDNIQLKPNPSLPLE 338 >UniRef50_D0N6Q4 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0N6Q4_PHYIN Length = 374 Score = 388 bits (996), Expect = e-106, Method: Composition-based stats. Identities = 136/351 (38%), Positives = 201/351 (57%), Gaps = 19/351 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 LGIETSCD+T A+ D + +L+N + SQ +L+A + G+VP LA+R H +I AA Sbjct: 21 TLGIETSCDDTAAAVLDQDGRVLSNVISSQWELNAKWRGIVPALAARAHENNLPHVINAA 80 Query: 63 LKESGL-TAKDIDAVAYTAGPGLVGALLVGATVGRSLAF-AWDVPAIPVHHMEGHLLA-- 118 L++SGL + + + AVA T+GPGL L VG R + D+ + ++H+E H+L Sbjct: 81 LEQSGLESLQQLSAVAVTSGPGLAPCLDVGLRTARQICLDNPDIAFLQINHLEAHVLVSR 140 Query: 119 -PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL---- 173 P LE PEFPFV LLVSGGH L+ G+G YELLG ++DD+ GEA+DK A++L Sbjct: 141 LPQLETPRPEFPFVVLLVSGGHCCLVLAKGLGDYELLGNTLDDSIGEAYDKVARMLDITA 200 Query: 174 --GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-DDQ 230 G GG L+ MAA+G F F PM R DFS+SG+KT ++ G D++ Sbjct: 201 SSGKGVHGGKLIEDMAARGNDRAFPFTEPMKHRKDCDFSYSGIKTAMLREVKKLGELDEK 260 Query: 231 TRADIARAFEDAVVDTLMIKCKRALDQT------GFKRLVMAGGVSANRTLRAKLAEMMK 284 + D+ +F+ VD L+ + +RA + LV+ GGV++N+ LR ++ Sbjct: 261 MKEDLCASFQRKAVDQLITRTRRACQWSKDRLGDNITSLVVCGGVASNQYLRDRMQAAAA 320 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGV-SVRPRWPLAEL 334 + + ++CTDNG M+A+AG+ R+ G +D +PRWPL L Sbjct: 321 EEEVAAVFPPAKYCTDNGVMVAWAGLERYAKGMRSDPEPARYQPRWPLETL 371 >UniRef50_Q0SM86 Probable O-sialoglycoprotein endopeptidase n=18 Tax=Borrelia burgdorferi group RepID=GCP_BORAP Length = 346 Score = 387 bits (995), Expect = e-106, Method: Composition-based stats. Identities = 125/332 (37%), Positives = 191/332 (57%), Gaps = 10/332 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIETSCD+ +A+ ++ +L+N SQ K H Y GVVPE+ASR H + + Sbjct: 1 MKVLGIETSCDDCCVAVVENGIHILSNIKLSQ-KEHEKYYGVVPEIASRLHTEAIMSVCI 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ +ID +A T+ PGL+G+L+VG + LA + P I + H+ GHL AP+ Sbjct: 60 KALKKANTKISEIDLIAVTSRPGLIGSLIVGLNFAKGLAISLKKPIICIDHILGHLYAPL 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + E+PF++LL+SGGHT + E+LG ++DD+ GEAFDK AK + +PGG Sbjct: 120 M-HSKIEYPFISLLLSGGHTLIAKQKNFDDVEILGRTLDDSCGEAFDKVAKHYDIGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG--LDFSFSGLKTFAANTIR--DNGTDDQTRADIA 236 P + +++ G F FP + DFS+SGLKT + + N + T+ +IA Sbjct: 179 PNIEQISKNGDENTFKFPVTTFRKKENWYDFSYSGLKTACIHQLEKFKNKDNPTTKNNIA 238 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F+ A + L+ KRA+ T K+LV+AGGV++N LR K+ K + + +Y + Sbjct: 239 ASFQKAAFENLITPLKRAIKDTQIKKLVIAGGVASNLYLREKI----DKLKIQTYYPPLD 294 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 CTDNGAMIA G + + + + R Sbjct: 295 LCTDNGAMIAGLGFNMYLKYGESPIEIEANSR 326 >UniRef50_C1SJZ8 Metalloendopeptidase, putative, glycoprotease family n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJZ8_9BACT Length = 327 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 145/330 (43%), Positives = 202/330 (61%), Gaps = 9/330 (2%) Query: 1 MRVLGIETSCDETGIAIYDD-EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M +LGIE+SCDET +A+YD + + A SQ +LH+ +GGVVPE+ASR+H+ K L Sbjct: 1 MIILGIESSCDETSLAVYDSVNRSVKATFTSSQAELHSKFGGVVPEVASRNHILKIESLF 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + + E+G+T +DIDA+ T PGL+GAL VG + ++L +A +P IPV+H+ H+LA Sbjct: 61 EQCMTEAGITPQDIDAIGVTNAPGLIGALFVGVSFAKALGYALKIPVIPVNHLSAHILAS 120 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L + + P++AL++SGGHT + V +ELL +IDDAAGE+FDK AK+LGL YPG Sbjct: 121 ELTNQELKAPYLALIISGGHTHIYDVDEAYNFELLARTIDDAAGESFDKVAKMLGLGYPG 180 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 GP + K+A G + P + +P DFSFSGLKT N I D D ADIA +F Sbjct: 181 GPAIEKLAESGDENKVTLPIAIKKKP--DFSFSGLKTAVLNKINDKSESD---ADIAASF 235 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + V +TL +K R + G ++V+AGGV+ N +R M+K+ EVF+ P CT Sbjct: 236 QKTVAETLTLKTLRMAESLGRNKIVVAGGVACNGYIRRAF---MEKQGYEVFFPSPRLCT 292 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 DNG MIAYA F A L + R Sbjct: 293 DNGDMIAYAASKFFGQRKFASLDETAHDRM 322 >UniRef50_C7ND80 Metalloendopeptidase, glycoprotease family n=3 Tax=Leptotrichia RepID=C7ND80_LEPBD Length = 339 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 131/332 (39%), Positives = 206/332 (62%), Gaps = 14/332 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L ETSCDET +A+ +D K +L+N + +Q+ +H ++GGVVPE+ASR H+ +P+ Sbjct: 1 MKILAFETSCDETSVAVVEDGKKILSNIISTQIDIHKEFGGVVPEIASRHHIENILPVFT 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ DID +A T PGL+G+LLVG +SL++A ++P +PV+H+ GH+ + Sbjct: 61 EALEKANCELSDIDYIAVTNTPGLIGSLLVGLMFAKSLSYANNIPLLPVNHINGHIFSSF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISV---TGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 + DN + P ++L+VSGGHT L + G +LLGE++DDA GE +DK A++LGLDY Sbjct: 121 I-DNDVKLPAISLVVSGGHTNLYYIYEENGKIITDLLGETLDDAVGETYDKIARILGLDY 179 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTRADI 235 PGGP + K++ G +P D G +FSFSG+KTF N + + + ++ DI Sbjct: 180 PGGPHIDKLSING-EDILKIKKPKVD--GYNFSFSGIKTFITNYVNNQKMKGNAISKEDI 236 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMM-----KKRRGEV 290 A++ ++ +V+ L K A+ + K +++AGGVSAN+ LR K +E K + V Sbjct: 237 AKSLQEIIVNVLYDKILMAVKEKDVKTILVAGGVSANKRLREKFSEFTNIKTDKNEQIAV 296 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLG 322 + + E+CTDN AMI A K + +LG Sbjct: 297 HFPKMEYCTDNAAMIGVAAYYDLKNNSQVELG 328 >UniRef50_C8W929 Metalloendopeptidase, glycoprotease family n=2 Tax=Atopobium RepID=C8W929_ATOPD Length = 832 Score = 386 bits (991), Expect = e-106, Method: Composition-based stats. Identities = 146/340 (42%), Positives = 187/340 (55%), Gaps = 9/340 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L +E+SCDET + I D + AN + +Q+ HA +GGVVPE+ASR H V L + Sbjct: 479 LILSLESSCDETAMCIMDSHGVVCANVVATQIDFHARFGGVVPEIASRKHTEAIVGLFEE 538 Query: 62 ALKESG-------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 + +G L D+ AV TAGPGLVGAL+VG + A D+P IPVHH+EG Sbjct: 539 TMARAGAHFGCDTLVPSDLAAVGVTAGPGLVGALVVGVAFAKGFCVATDLPLIPVHHLEG 598 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HLLA + E E PFVA LVSGG+T L+ V G Y +LG +IDDA GEAFDK AK LG Sbjct: 599 HLLANLFETPDLEPPFVASLVSGGNTMLVHVRAWGDYVVLGSTIDDAVGEAFDKVAKALG 658 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TR 232 L YPGGP++SK+AAQG FPR M FS SGLKT I + Sbjct: 659 LGYPGGPVISKLAAQGNPKAIHFPRAMMHSGDYSFSLSGLKTAVITYIEGENRAGRAINL 718 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 D+A +FE AV+D + K K A+++TG + GGV+AN LRA E K V Sbjct: 719 PDLAASFEQAVIDVQVAKAKTAVEETGVSDFCVGGGVAANPALRAAYKETFGKMGVRVTV 778 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 C DN AMIA + ++ + L + P L Sbjct: 779 PPMSVCGDNAAMIAVGALRSYRTQGFSPLTLDANPNAQLG 818 Score = 71.0 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 31/126 (24%), Positives = 49/126 (38%), Gaps = 8/126 (6%) Query: 3 VLGIETSCD--ETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHV------RK 54 VL ++TS D +A A + AD G V LAS DH+ + Sbjct: 11 VLAVDTSTDMLACTVARLTKRNADAAVAAAADGASAADGGFNVEVLASTDHLCRRQANVE 70 Query: 55 TVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 V +Q AL + LT D+DAV GPG + +G + ++ +P ++ Sbjct: 71 LVSSVQEALVAADLTMADVDAVIAGRGPGSFTGVRIGVATAKGISCGSGLPLYGASALDA 130 Query: 115 HLLAPM 120 + Sbjct: 131 MAFSAW 136 >UniRef50_D0WGH2 O-sialoglycoprotein endopeptidase n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WGH2_9ACTN Length = 807 Score = 385 bits (990), Expect = e-105, Method: Composition-based stats. Identities = 141/341 (41%), Positives = 188/341 (55%), Gaps = 9/341 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 + E+SCDET +I + +L++ + SQV HA +GGVVPE+ASR H+ + Sbjct: 465 LICAFESSCDETASSIIAGDGTILSDVVASQVDFHARFGGVVPEIASRKHIEAICGVADE 524 Query: 62 ALKESG-------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L+ + L +D+DA+A T PGLVGAL+VG + + LA+ +VP + V+H+EG Sbjct: 525 CLERAAVALGRPSLRWRDLDAIAVTYAPGLVGALVVGVSFAKGLAWGSEVPLVAVNHLEG 584 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HL A + D P V LVSGGHT L+ V YE LG +IDDAAGEAFDK +K LG Sbjct: 585 HLYANKIADPAIAPPMVVSLVSGGHTMLVHVKDWANYETLGSTIDDAAGEAFDKVSKALG 644 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TR 232 L YPGGP++S+ AA+G FPR + L FS SGLKT I Sbjct: 645 LGYPGGPIISRYAAKGNPRAIDFPRALMHSGDLRFSLSGLKTAVITYIHKQQEAGMPLNI 704 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +F+ AVVD + K + AL +TG + + GGV+AN LRA +M K + Sbjct: 705 PDIAASFQQAVVDVQVAKARTALIETGSRTFCLGGGVAANPALRAAYEKMCAKNGFRLVM 764 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 C DN AMIA + RF G AD + V+ PL E Sbjct: 765 PPLSACGDNAAMIAEVALDRFAQGKLADFTLDVKAHAPLDE 805 Score = 68.3 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 39/117 (33%), Gaps = 10/117 (8%) Query: 3 VLGIETSCDETGIAI--YDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRK---TVP 57 VL +T+ + + I D + + + G A H +P Sbjct: 4 VLAFDTANEAVVVGIGSVDADGAEAGRIVLKEAPARLVAG-----EARAAHRASNTVLIP 58 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 +I + + DI AV GPG + + + +A +VP V ++ Sbjct: 59 MIDELMAGENIEKDDIAAVVCGRGPGSFTGVRICMAAAKGIASGLEVPLFGVSTLDA 115 >UniRef50_C7MKR9 Ribosomal-protein-alanine acetyltransferase n=10 Tax=Bacteria RepID=C7MKR9_CRYCD Length = 860 Score = 385 bits (990), Expect = e-105, Method: Composition-based stats. Identities = 148/341 (43%), Positives = 193/341 (56%), Gaps = 9/341 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET AI D ++A+ + SQ+ HA +GGVVPE+ASR HV ++QA Sbjct: 518 LILAIESSCDETAAAIVDGHGRIIADVVASQIDFHARFGGVVPEIASRKHVEAICGVVQA 577 Query: 62 ALKES-------GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L E+ L+ +DAVA T PGLVGAL+VG ++ A+A +P I V+H+EG Sbjct: 578 CLDEAAEHLGTANLSWNSLDAVAVTYAPGLVGALVVGVAYAKAAAWAAGIPFIKVNHLEG 637 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HL A L + + P V LVSGGHT L+ V G Y ++G +IDDA GEAFDK AK LG Sbjct: 638 HLYANKLARSDIKPPLVVSLVSGGHTMLVHVRDWGDYCVMGSTIDDAVGEAFDKVAKALG 697 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTR 232 L YPGGP++S++A QG FPR M L FS SGLKT I + D Sbjct: 698 LGYPGGPVISRLAQQGNPAAIHFPRAMMHSGDLRFSLSGLKTAVVTYIHNQQQQKADLNV 757 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +F+ AV+D + K ALD+TG K + GGV+AN LRA +R + Sbjct: 758 PDIAASFQAAVIDVQVAKATAALDETGAKEFCLGGGVAANPALRAAYESACAQRGVRLTM 817 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 CTDN AMIA + R++AG T+ L L E Sbjct: 818 PPARACTDNAAMIALVALDRYQAGKTSGLDTDAAAHSNLEE 858 Score = 69.4 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 24/114 (21%), Positives = 41/114 (35%), Gaps = 14/114 (12%) Query: 3 VLGIETSCD--ETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 VL +T+ + G+ + DD + ++ H R + +P I Sbjct: 11 VLAFDTANEVIALGLGVLDDTTQTVRCVASKRIPAH------------RSSNTRLLPEID 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 A L + DI V GPG + + + +A A VP I V ++ Sbjct: 59 ALLTAEKMERADIATVCCGRGPGSFTGVRICIATAKGIAQALGVPLIGVSTLDA 112 >UniRef50_Q2GEG6 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Neorickettsia RepID=GCP_NEOSM Length = 329 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 139/329 (42%), Positives = 203/329 (61%), Gaps = 7/329 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LG+ETSCDET +AI +E + +++++Q H+ Y GV PE ASR+H++ +++ Sbjct: 5 LILGVETSCDETSVAIVSEEGEVCFHEIFTQD--HSKYNGVYPEFASREHLKILPQILRR 62 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A++ L + + A+A T GPGLVG+L+VG + R LAF+ P V+H+EGHLLA L Sbjct: 63 AVQAHDL--EKLTAIACTVGPGLVGSLIVGVMMARGLAFSLKKPVFGVNHLEGHLLAVRL 120 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + FPFV L++SGGH+QLI GIG Y LLGE++DDA GEAFDK A +LG YPGG Sbjct: 121 VEK-INFPFVCLVISGGHSQLIDARGIGDYVLLGETLDDAFGEAFDKLATMLGFTYPGGK 179 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-DDQTRADIARAFE 240 + K+A +G + RF P M ++ G +FS SG+KT I ++ +ADI +F+ Sbjct: 180 TVEKLAIKGDSERFRLPAAMINQSGCNFSLSGIKTALKKIITSLPQITEKDKADICASFQ 239 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 V ++ K ++A+ G R+V+AGGV +NR +R L E K + + CTD Sbjct: 240 ACVARIMVNKLEQAVKICGHSRIVLAGGVGSNRYIRETLEEFAKNHNLSLHFPEGILCTD 299 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRW 329 N AMIA+A + R KAG T +L + +PR Sbjct: 300 NAAMIAWAAIERLKAGCT-ELSLEPQPRL 327 >UniRef50_C1A601 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=GCP_GEMAT Length = 357 Score = 384 bits (987), Expect = e-105, Method: Composition-based stats. Identities = 166/347 (47%), Positives = 211/347 (60%), Gaps = 16/347 (4%) Query: 1 MRVLGIETSCDETGIAIYDD--EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 MRVLGIETSCDET A+ E L + + +H +GGVVPE+ASR H+ VP Sbjct: 1 MRVLGIETSCDETSAAVVSGTPEAMTLESCVILSQDVHRLFGGVVPEIASRQHLIGIVPA 60 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 + AAL+E+ ++ DIDAVA T PGLVGALLVG + +SLA ++D P +PVHH+EGHL A Sbjct: 61 VAAALQEAQVSLSDIDAVAVTHAPGLVGALLVGTSFAKSLALSYDKPLVPVHHLEGHLFA 120 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 +LE PF ALLVSGGHT L+ V G+Y LLG++ DDA GEAFDK AKLLGL YP Sbjct: 121 TLLEHPDAAPPFTALLVSGGHTLLLDVPAWGEYRLLGQTRDDAVGEAFDKVAKLLGLPYP 180 Query: 179 GGPLLSKMAAQGTA----GRFVFPRPMTDRPG-------LDFSFSGLKTFAANTIRD--- 224 GG + ++AA A F RPM + D SFSGLKT +RD Sbjct: 181 GGRPIEQLAATAEAPVHKHPHRFARPMLRKSSTPADEDYYDCSFSGLKTAVLYAVRDAER 240 Query: 225 NGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMK 284 GT D RA IAR F+DAV+DTL+ K RA Q R+V+ GGV+ N+ L+A + M+ Sbjct: 241 TGTLDDARASIARGFQDAVIDTLVEKVVRAARQHRRSRVVLGGGVACNQALQAAMRNAME 300 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 +R+G VF P TDN AMIA AG+ R + G A ++ P+ Sbjct: 301 QRKGHVFAPSPRLATDNAAMIAAAGIFRLQRGEFAAPDMTATASLPI 347 >UniRef50_D1AVQ5 Metalloendopeptidase, glycoprotease family n=1 Tax=Streptobacillus moniliformis DSM 12112 RepID=D1AVQ5_STRM9 Length = 332 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 129/314 (41%), Positives = 193/314 (61%), Gaps = 6/314 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IE+SCDET +AI D K +L+N + +Q+ +H +YGGVVPE+ASR H+ + + Sbjct: 1 MLILAIESSCDETSVAILKDGKNVLSNVIATQIDIHKEYGGVVPEIASRHHIENILTVYD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ DI +A T PGL+G+LLVG + L+ + ++P IPV+H+EGH+ + Sbjct: 61 KALKEANCKISDISYIAVTNTPGLIGSLLVGLMFAKGLSLSNNIPLIPVNHIEGHIFSTF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 ++ P + P + L+ SGGHT L + LLGE++DDA GEA+DK A++LGL+YPGG Sbjct: 121 IDYEP-KLPMLTLVASGGHTSLYLIDENKDLTLLGETLDDAIGEAYDKVARILGLEYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTRADIARA 238 PLL KMA G F P P G DFSFSG+KTF N + + +D + D+A+ Sbjct: 180 PLLEKMAIMG-HNSFDIPTPKVS--GYDFSFSGIKTFITNYVNRKKMKGEDFNKEDLAKT 236 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+D +++ L+ K +A + K + + GGVSAN+ +R + ++ + + E+C Sbjct: 237 FQDKIIEVLIDKLSKASRKNNIKTISVVGGVSANKAIREAIINSEYFENVDILFPKFEYC 296 Query: 299 TDNGAMIAYAGMVR 312 TDN AMIA A + Sbjct: 297 TDNAAMIASACYHK 310 >UniRef50_B2UQZ0 Metalloendopeptidase, glycoprotease family n=3 Tax=Verrucomicrobia RepID=B2UQZ0_AKKM8 Length = 360 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 144/345 (41%), Positives = 201/345 (58%), Gaps = 12/345 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKG-----LLANQLYSQVKLHADYGGVVPELASRDHVRKT 55 + VLGIE+SCDET +AI +L++ + SQ+ +H +GGVVPELASR+H Sbjct: 5 LTVLGIESSCDETAVAILRSAGEEKAPEILSSVISSQIAIHRQHGGVVPELASRNHSADL 64 Query: 56 VPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 +I+ A +E+G DID T GPGLV ALLVG + ++LA A P + V+H+EGH Sbjct: 65 PGIIRTACREAGTAPADIDVFGATGGPGLVAALLVGNSTAKALALAAGRPFVSVNHLEGH 124 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 LL+P L+ P + ++VSGGHT + V G+G Y LLG S+DDAAGEAFDK K+LGL Sbjct: 125 LLSPFLKRPGGPVPHLGMVVSGGHTLFVDVRGVGNYRLLGRSLDDAAGEAFDKVGKMLGL 184 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-------NGTD 228 YPGGP + ++AA+G F FPR + + SFSGLKT T+ +G Sbjct: 185 PYPGGPEIDRLAAEGDPEAFSFPRALMKEHTANVSFSGLKTAVLYTLPKITKNGDPHGLP 244 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 QT D+ +F+ AV D L+ K +AL +G + L ++GGVS NR LR++L + + Sbjct: 245 RQTLRDLCASFQRAVTDVLIHKALKALRASGHRTLSISGGVSCNRELRSRLKTACDREKV 304 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 ++ + TDN AMIAY ++ + G L V P L E Sbjct: 305 KLVLPDFDLTTDNAAMIAYVTCLKARRGLFHSLDEDVDPNLKLTE 349 >UniRef50_Q058D1 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Buchnera aphidicola str. Cc (Cinara cedri) RepID=GCP_BUCCC Length = 343 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 150/340 (44%), Positives = 228/340 (67%), Gaps = 9/340 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCD+T +AIYD + GL+ +Q +Q +H+ Y G+VPELA+R H+ + LI+ Sbjct: 1 MKILGIETSCDDTSVAIYDKKLGLIDHQTLNQNSVHSKYHGIVPELAARSHLNQLNFLIK 60 Query: 61 AALKE------SGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 + S K AVAYT GPGL G+++V + RS+A + D+P I ++H+EG Sbjct: 61 NIFSKYFLYNSSNFKKKFFKAVAYTVGPGLSGSIVVHSC--RSIALSLDIPYILINHLEG 118 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HLL+ ML FPF+ALLVSG +TQLI +G+Y +LG+++DDA G FD AK+LG Sbjct: 119 HLLSVMLSYKKNLFPFLALLVSGANTQLIYAKYLGKYIILGQTLDDAVGNVFDYIAKILG 178 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 L +PGG LS +A G +G++ FPRPMT L+FSFSGLKT N I ++ Q +++ Sbjct: 179 LGFPGGKNLSDLAKYGISGKYFFPRPMTKYSNLNFSFSGLKTHVKNVILNSSDSFQEKSN 238 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 IA++FE+A+VDTL+IKCK A+ + K ++ GGVS+NR LR KL +++ K + ++++++ Sbjct: 239 IAKSFEEAIVDTLIIKCKLAIKKIKVKNFLVCGGVSSNRLLRIKLKKLIYKNQRKLYFSK 298 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATA-DLGVSVRPRWPLAE 333 +FCTDN MIAY G ++++ G + + S+ P +++ Sbjct: 299 KKFCTDNAGMIAYLGFLKYQQGMYSYNKSFSIYPNLLISD 338 >UniRef50_Q2JXG9 Probable O-sialoglycoprotein endopeptidase n=31 Tax=Bacteria RepID=GCP_SYNJA Length = 366 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 148/345 (42%), Positives = 203/345 (58%), Gaps = 10/345 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGL-------LANQLYSQVKLHADYGGVVPELASRDHVR 53 +R+L IETSCDET +A+ + + L++ + SQ+ LHA YGGVVPE+A+R HV Sbjct: 2 LRLLAIETSCDETAVAVVEADAAWPTFAPRQLSSVVASQIDLHAAYGGVVPEVAARRHVE 61 Query: 54 KTVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 ++++AL+++GL ++DAVA T PGLVG+LLVG ++LA ++ P I VHH+E Sbjct: 62 TLPFVLESALQQAGLGMAEVDAVAVTCAPGLVGSLLVGLMAAKTLALLYNKPLIGVHHLE 121 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 GHL + L P + LLVSGGHT LI + G+Y+ +G + DDAAGEAFDK A+LL Sbjct: 122 GHLFSGFLAAADLRPPCLGLLVSGGHTSLIWMKDYGEYQTMGRTRDDAAGEAFDKVARLL 181 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQT 231 GL YPGGP + + A QG RF P D P D SFSGLKT + + Sbjct: 182 GLGYPGGPQIDRWAQQGDPDRFPLPEGKLDHP-YDTSFSGLKTAVLRLVQQLQQEGQELP 240 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 ADIA +F+ + L K + G L++ GGV+ANR LRA+L E +++ V Sbjct: 241 VADIAASFQACLTRVLTEKAVACAEALGLSTLLVTGGVAANRELRARLLEAGRQKGLRVV 300 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 P CTDN AMI AG+ + G T+ L + V R L E+PA Sbjct: 301 IPPPNLCTDNAAMIGAAGLCHWLRGETSPLELGVASRLTLEEIPA 345 >UniRef50_B3DVR7 Metal-dependent protease with possible chaperone activity n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DVR7_METI4 Length = 353 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 130/342 (38%), Positives = 195/342 (57%), Gaps = 8/342 (2%) Query: 1 MRVLGIETSCDETGIAIYD---DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 M LGIE+SCDET IA+ + ++A++ +Q LH +GG+VPE A R+H + Sbjct: 3 MLWLGIESSCDETAIALVKTIAGKNVVMADRCITQAPLHKPFGGIVPEYAVREHSKNLPL 62 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L+Q+ ++ L K++ A+A T GPGL+ +LLVG R LA +P V+H+EGHL Sbjct: 63 LLQSMIRSKSLNLKEVQAIAVTEGPGLMASLLVGNAFARGLALGLGIPVFGVNHLEGHLF 122 Query: 118 APMLE-DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 +P + + +FPF+ L+VSGGHT L V G QY ++G +IDDAAGEAFDK A+LLGL Sbjct: 123 SPFIGREEKLKFPFLGLVVSGGHTLLARVEGPRQYSMIGSTIDDAAGEAFDKVARLLGLS 182 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG----TDDQTR 232 YPGGP + K A +G FP + ++ +FSFSGLKT + N + + Sbjct: 183 YPGGPEIEKQAERGNPHSHNFPISLIEKNNYNFSFSGLKTAVKYFLEKNKESLSKNKEFL 242 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 AD+ +F+++V + K A + +GGV AN+ +R L + + EV + Sbjct: 243 ADVCASFQESVARVIQEKTIAAAKSFSLSLIAASGGVLANKRIRELLEKKALEEGIEVLF 302 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 A+ +FCTDN MIA+AG + + G + P + L++ Sbjct: 303 AKRQFCTDNAVMIAFAGALFYALGLPITKSFELNPNFSLSDF 344 >UniRef50_B9JCG8 Probable O-sialoglycoprotein endopeptidase n=86 Tax=Alphaproteobacteria RepID=GCP_AGRRK Length = 365 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 159/347 (45%), Positives = 201/347 (57%), Gaps = 15/347 (4%) Query: 1 MRVLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 +R+LGIETSCDET AI D + ++ + SQ+ H+ YGGVVPE+A+R HV Sbjct: 5 LRILGIETSCDETAAAIVERQDDGTAIVRSDVVLSQLDEHSAYGGVVPEIAARAHVEALD 64 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 LI ALK + ++ D+DA+A T+GPGL+G LLVG G++++ A P ++H+EGH Sbjct: 65 TLIDEALKRANVSLADVDAIAATSGPGLIGGLLVGLMTGKAISKATGKPLYAINHLEGHA 124 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L L D FP++ LLVSGGHTQLI V G+GQYE G +IDDA GEAFDKTAKLLGL Sbjct: 125 LTARLTDG-LAFPYLMLLVSGGHTQLILVRGVGQYERWGTTIDDALGEAFDKTAKLLGLP 183 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADI 235 YPGGP + A +G RF PRP+ LDFSFSGLKT +Q ADI Sbjct: 184 YPGGPAVEAAAKKGNPDRFDLPRPLVGETRLDFSFSGLKTAVRLAATTIAPVSEQDIADI 243 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGF--------KRLVMAGGVSANRTLRAKLAEMMKKRR 287 +F+ AV TL + R L + LV+AGGV+AN LR L E+ Sbjct: 244 CASFQKAVSRTLKDRIGRGLQRFKSEFPKTAEKPALVVAGGVAANLELRRTLQELCDLNG 303 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAE 333 CTDN MIA+AG+ R GA D L V R RWPL + Sbjct: 304 FRFIAPPLSLCTDNAVMIAWAGLERMATGAAPDGLDVQPRSRWPLDQ 350 >UniRef50_A6DFV1 Metalloendopeptidase, putative, glycoprotease family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFV1_9BACT Length = 355 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 141/353 (39%), Positives = 206/353 (58%), Gaps = 17/353 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LG+E+SCDET +++ + +LAN + SQ+K HA+YGGV+PELA+R+H+ P + Sbjct: 1 MIILGVESSCDETAVSLVRNGHEVLANAISSQIKDHANYGGVIPELAAREHLNNVRPTLN 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ L DID +A TA PGL+ ALLVGA LA + ++H+ H+ + Sbjct: 61 EALEKAALKLDDIDGIAVTAQPGLLPALLVGAGFANGLALSLGKKVCGINHLAAHIYGGL 120 Query: 121 LE-----DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 +E NP FP ALL+SGG+TQL + G EL+G +IDDAAGEAFDK AK+LGL Sbjct: 121 IERQDILSNPNAFPLCALLISGGNTQLFIIKKTGDCELVGSTIDDAAGEAFDKAAKILGL 180 Query: 176 DYPGGPLLSKMAAQGTAGRFVFP-------RPMTDRPGLDFSFSGLKTFAANTIRDNGTD 228 YPGGP++ ++A G ++ FP R ++ L+FSFSG+KT N ++ N D Sbjct: 181 PYPGGPIIDRLAKSGDKNKYKFPRSFLPKTRSYSEEHKLNFSFSGVKTSLLNLVKKNWKD 240 Query: 229 ----DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMK 284 D D+ +++DA+VD L K K A + G + L++ GGV+ N +R ++ +M Sbjct: 241 GMVPDGDLPDLLASYQDAIVDVLSTKLKMAAESYGARTLLLCGGVACNSAIRERVQKMAI 300 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE-LPA 336 + E+ P++CTDN AMIA G K V R P+ E LP Sbjct: 301 QTAKELVLTPPKYCTDNAAMIAGLGYHYLKDPNFTGDFVEASGRAPIIEKLPV 353 >UniRef50_B9XP92 Metalloendopeptidase, glycoprotease family n=1 Tax=bacterium Ellin514 RepID=B9XP92_9BACT Length = 341 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 142/342 (41%), Positives = 203/342 (59%), Gaps = 11/342 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L +ETSCDET +AI + K +L+ + SQ+KLHA+YGGVVPELA+R+H+ +P+ Sbjct: 1 MILLAVETSCDETSVAIIRNGK-VLSTIVSSQIKLHAEYGGVVPELAAREHLANLIPVAN 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AA+ + + + +DA+A T GPGL GAL+VG + +AFA + P ++H E HL +P Sbjct: 60 AAMTAAEVQSDQVDAIAATQGPGLPGALVVGLKAAQGMAFALNKPFFGINHHEAHLYSPW 119 Query: 121 LEDNPPE------FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 + +PP P ++L+VSGGHT LI V ++ +LG +IDDAAGE FDK AKL+G Sbjct: 120 ITGSPPVADFDSFQPNISLIVSGGHTMLIHVESELKHHVLGSTIDDAAGECFDKVAKLIG 179 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT---DDQT 231 L YPGGP + ++A+ G + FPRPM DFSFSGLKT IRDN Q Sbjct: 180 LPYPGGPEIDRLASAGNPKAYDFPRPMLRDASDDFSFSGLKTSVRYFIRDNPAVLDSLQK 239 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 D+ + ++A+V+ L+ K RA ++ K + +GGV+ NR LR+ L K++ + Sbjct: 240 LQDLCASVQEAIVEVLVTKTVRAANRLQVKCVTASGGVTCNRALRSALETACKRKHLTLR 299 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGAT-ADLGVSVRPRWPLA 332 A CTDN AMI + +T L + P W LA Sbjct: 300 LAEKSLCTDNAAMIGVLAERKLLHSSTPTSLDSEIMPGWALA 341 >UniRef50_Q6MD07 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Parachlamydiaceae RepID=GCP_PARUW Length = 343 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 136/343 (39%), Positives = 193/343 (56%), Gaps = 11/343 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIE++CDET AI D K +L+N + SQ+ LH +YGGVVPELA R H+ +P+I Sbjct: 1 MLVLGIESTCDETACAIVRDGKDILSNIVASQIDLHKEYGGVVPELACRRHIDLIIPVID 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ LT + ID +A GPGL+GALL+G ++LA A P I ++H+E HL A + Sbjct: 61 QALNQAKLTLEQIDLIAVANGPGLIGALLIGLNTAKALALALRKPFIGINHVEAHLYAAI 120 Query: 121 LEDN-PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + +FP + +++SGGHT L+ + IGQYEL+G+++DDA GEAFDK AK+L L YPG Sbjct: 121 MSHPQDFQFPCLGVVLSGGHTALVLIKQIGQYELIGQTVDDAVGEAFDKVAKMLNLPYPG 180 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-------DDQTR 232 GP + +A G + +F F LDFSFSGLKT I+D + Sbjct: 181 GPEIENLARHGRSVKFNFKAGQVKGRPLDFSFSGLKTAVLYAIKDPKALKEMVLLSSEMT 240 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +F++A ++ K A Q G L+ GGV+ N LR + + + Sbjct: 241 QDIAASFQEAACSDIVKKSLLAAKQYGVNTLLFGGGVTNNCYLRKLFS--VANSNLNYIW 298 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAEL 334 DN AMIA G R++ +D + + R PL + Sbjct: 299 PSAGLSLDNAAMIAGLGYYRYQLQNKSDSMDLEPLTRTPLQSV 341 >UniRef50_C0Q8X7 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Desulfobacteraceae RepID=GCP_DESAH Length = 333 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 146/329 (44%), Positives = 201/329 (61%), Gaps = 1/329 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIE+SCD+T A+ D +L++ + SQV +H YGGVVPELASR H+ P++ Sbjct: 1 MIILGIESSCDDTAAAVVSDHNTVLSSVVSSQVDVHHRYGGVVPELASRMHIEAISPVVA 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ ++G++ I+ VA T GPGL+GALLVG + ++ A+A ++P V+H+EGH+ + + Sbjct: 61 QAVDQAGISPDQIEGVAVTRGPGLIGALLVGFSFAKAFAWAKNIPWAGVNHLEGHIYSLL 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L D+PP FPF ALL SGGHT + V ++ELLG++ DDAAGEAFDK AK+LGL YPGG Sbjct: 121 LSDDPPAFPFTALLASGGHTSIFHVVSQDRFELLGQTRDDAAGEAFDKVAKMLGLGYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-DQTRADIARAF 239 ++ +AA+G FPR D+ G DFSFSGLK+ A ++ N + + IA F Sbjct: 181 AVVEALAAKGDPCLIPFPRSFLDKDGFDFSFSGLKSAVARYVQLNRENLGEMMPHIAAGF 240 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + AV D L K A TG R+ +AGGVSANR L +++ K ++ P FC Sbjct: 241 QSAVTDVLAFKLIHAARATGCSRIAIAGGVSANRFLASRMKIEAAKHNMALYLPPPSFCG 300 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPR 328 DN AMIA G G L V R Sbjct: 301 DNAAMIAARGHRLISQGDLCQLDSDVFSR 329 >UniRef50_B3R0M3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Candidatus Phytoplasma mali RepID=GCP_PHYMT Length = 329 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 126/329 (38%), Positives = 200/329 (60%), Gaps = 8/329 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCDET ++ D K +++N ++SQ+K H+ GGV+PELASR+H++ +++ Sbjct: 1 MIILSIETSCDETSASVTQDGKKVISNIVFSQIKEHSLNGGVIPELASREHLKNITLVLE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +LKE+ + ++ID VA+T GPGL+G+LLVG + + P + V+H+ GH+ A Sbjct: 61 KSLKEANIQPQEIDLVAFTQGPGLIGSLLVGINCALVFGYIYKKPVLGVNHLLGHIYAAQ 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E N EFP + L++SGGHT+L+++ Q + LG + DDA GEA+DK +++LG YPGG Sbjct: 121 IE-NEIEFPSLVLIISGGHTELLALENYLQIKKLGFTCDDAVGEAYDKVSRILGFGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ ++A +G F F RP +FSFSGLK+ N + N + Q + +I +F+ Sbjct: 180 PIIDELAQKGK-DIFNFVRPYLKNDNFNFSFSGLKSSIFNLVSKNNFNLQEKINICSSFQ 238 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+D L+ K KR L + FK+L++ GGV+AN +LR + + +V ++C D Sbjct: 239 SSVIDVLVEKTKRVLKKYSFKQLIITGGVAANYSLRKRFLSEFSQ--LKVIIPSLKYCGD 296 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRW 329 AMI A +FK L + W Sbjct: 297 QAAMIGIAAYYQFK----YQLKFNQNYHW 321 >UniRef50_B7CBT6 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CBT6_9FIRM Length = 333 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 133/335 (39%), Positives = 206/335 (61%), Gaps = 7/335 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M ++GIE+SCDET +A+ D+K +L++ + SQ+ +H ++GGVVPE+ASR HV I+ Sbjct: 1 MIIIGIESSCDETAVAVVKDKKEVLSSVVASQIDVHTEFGGVVPEVASRIHVENISYCIE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ +T +D+DAVA T GPGL+G L VG ++LAFA+ P +PVHH+ GH+ A Sbjct: 61 KALKDANITMEDVDAVAVTQGPGLIGCLHVGVQAAKTLAFAYHKPLVPVHHLAGHIYANE 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + ++P +AL+VSGG+T+L+ + +E+LGE+ DDA GEAFDK A++LGL YPGG Sbjct: 121 LVVD-MKYPVLALVVSGGNTELVYMKDETSFEILGETQDDAIGEAFDKVARVLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT--RADIARA 238 P + K+A +G + +P T + DFSFSGLK+ + +T AD+A + Sbjct: 180 PKIDKLAKEGKP-VYELAKPKT-QGRYDFSFSGLKSSVLQFTKRMERQGKTFDMADLACS 237 Query: 239 FEDAVVDTLMIKCKRALDQTG-FKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 F++ +D + + + LD + V+ GGVSAN LR K+ E+ + F P + Sbjct: 238 FQECALDEIFSRVRAVLDDHKDIRHFVVGGGVSANSRLREKVEELRNEYPEVEFTVPPMY 297 Query: 298 -CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 CTDN +MI AG + + +G + ++ + Sbjct: 298 CCTDNASMIGVAGTIAYLSGRRGNASLTADSSLEI 332 >UniRef50_Q5FLZ3 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Lactobacillus RepID=GCP_LACAC Length = 349 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 135/342 (39%), Positives = 204/342 (59%), Gaps = 11/342 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+L E+SCDET A+ + + + + + +Q+K H +GGVVPE+ASR H+ + + Sbjct: 8 RILAYESSCDETSTAVIKNGREIESLIVATQIKSHQRFGGVVPEVASRHHIEVVSQITKE 67 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL E+ + KDIDA+A T GPGLVGALL+G + ++++ A +P I V H+ GH++A L Sbjct: 68 ALNEANCSWKDIDAIAVTYGPGLVGALLIGVSAAKAVSMATGIPLIGVDHIMGHIMAAQL 127 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 +D E+P +AL VSGGHT+++ + +E++G++ DDAAGEA+DK ++LG++YP G Sbjct: 128 KDE-IEYPAIALQVSGGHTEIVLLKDPTHFEIIGDTRDDAAGEAYDKIGRVLGVNYPAGK 186 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARAF 239 + A QG F FPR M + DFSFSGLK+ NT D + + D+A +F Sbjct: 187 TIDAWAHQGK-DTFNFPRAMLEDDDYDFSFSGLKSAFINTCHHADQIHEKLNKYDLAASF 245 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR----RGEVFYARP 295 + AV+D L K RA+ + K +M GGV+AN+ LR +++E + K + +V Sbjct: 246 QAAVIDVLAHKTIRAIKEYKPKTFIMGGGVAANQGLRDRMSEEIAKLPKADQPKVILPDL 305 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 + C DN AMI A + G ADL ++ P ELP A Sbjct: 306 KLCGDNAAMIGAAAYNLYNGGQFADLTLNADPSL---ELPYA 344 >UniRef50_B1V8Z6 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Candidatus Phytoplasma RepID=GCP_PHYAS Length = 328 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 129/332 (38%), Positives = 189/332 (56%), Gaps = 9/332 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCDET +AI D K +L+N ++SQ+K H +GGVVPE+ASR HV +++ Sbjct: 1 MNILSIETSCDETSVAITQDGKKVLSNIVFSQIKDHQMFGGVVPEIASRKHVELITLILE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A +++ LT ++ID VA T GPGLVG+LLVG A+ + P + ++H+ GHL A Sbjct: 61 KAFQKACLTPQEIDLVAVTQGPGLVGSLLVGINAANVFAYTYQKPLLGINHLLGHLYAAQ 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E + LLVSGGHT+L+ Q E+LG ++DDA GE +DK AK L L YPGG Sbjct: 121 IEHQIKPNALI-LLVSGGHTELLHFKNHDQIEVLGTTLDDALGEVYDKIAKALHLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 PL+ ++A G + RP +FSFSGLK+ N + D +I +F+ Sbjct: 180 PLIDQLAQTGK-DTYHLVRPYLKNNNFNFSFSGLKSHLVNLLLKQNIQDLNIPNICASFQ 238 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+D L+ K KR L + ++L++ GGV++N LR K+ E EV + ++CTD Sbjct: 239 ASVIDVLLTKTKRVLKKLPIQQLIVTGGVASNSALRKKMKETF--LDLEVIFPSVQYCTD 296 Query: 301 NGAMIAYAGMVRFKAGATAD---LGVSVRPRW 329 AMI A ++ T ++ P Sbjct: 297 QAAMIGIAAF--YQKNITPPSYKYDLTALPNL 326 >UniRef50_C7H0S4 Putative glycoprotease GCP n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7H0S4_9FIRM Length = 371 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 131/330 (39%), Positives = 199/330 (60%), Gaps = 3/330 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VL IETSCDET +I + + +L+N +++Q+ +H +YGGVVPE+ASR+H+ K ++ Sbjct: 22 NVLAIETSCDETACSIVRNGREVLSNAIFTQMHIHREYGGVVPEIASRNHLEKINDVVDK 81 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+ ++GL +DID +A T+ PGL+GAL+VG +++A+A P + VHH+ GH+ A L Sbjct: 82 AILDAGLHKEDIDVIAVTSTPGLIGALVVGVATAKTMAYALSKPLVGVHHIAGHIAANYL 141 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + E PF++L++SGGHT +I V ++E++G+++DDAAGEAFDK LLGL YP G Sbjct: 142 DHGELEPPFISLVISGGHTSVIDVKDYNEHEVIGQTLDDAAGEAFDKVGILLGLTYPAGK 201 Query: 182 LLSKMAA---QGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 + ++A + F R ++ FSFSG+KT N IR N D + IA Sbjct: 202 DMDELARSAIKNNVSPVYFKRTYLEKGSPHFSFSGIKTRVMNYIRANKDDPIDKEAIALG 261 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F +AV D L+ K + K++V+AGGV+AN +R K E + + EV+ C Sbjct: 262 FHEAVTDVLVKKTMDMAKRRNRKKIVLAGGVAANSLIRNKFKEEGEAQGFEVYLPGLGMC 321 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 TDN AMIA AG ++ +G +D + Sbjct: 322 TDNAAMIASAGYYKYISGGISDYYLDAVSN 351 >UniRef50_Q47LN7 Probable O-sialoglycoprotein endopeptidase n=58 Tax=Bacteria RepID=GCP_THEFY Length = 347 Score = 376 bits (966), Expect = e-103, Method: Composition-based stats. Identities = 148/337 (43%), Positives = 197/337 (58%), Gaps = 5/337 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++GIE+SCDETG+A LLA+++ S V HA +GGVVPE+ASR H+ P ++ Sbjct: 9 LIMGIESSCDETGVAFVRGC-ELLADEVASSVDEHARFGGVVPEVASRAHLEAMTPTVRR 67 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A + +G+ D+DA+A T GPGL GALLVG + ++ A A D P V+H+ GH+ L Sbjct: 68 AAERAGVRLSDVDAIAVTVGPGLAGALLVGLSAAKAYALALDKPLYGVNHLVGHVAVDQL 127 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIG-QYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 E P P VALLVSGGHT L+ V + +LLGE++DDAAGEA+DK A+LL L YPGG Sbjct: 128 EHGPLPKPVVALLVSGGHTSLLLVRDLATDVQLLGETVDDAAGEAYDKVARLLNLPYPGG 187 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADIARA 238 P + + A G FPR DFSFSGLKT A + D + + D+A A Sbjct: 188 PPIDRAARDGDGTAIHFPRGKWGDGTYDFSFSGLKTAVARWVEDAERQGRPVSVPDVAAA 247 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F++AV D L K A + G + LV++GGV+AN LRA E EV RP C Sbjct: 248 FQEAVADVLTRKAVDACREHGVRHLVISGGVAANSRLRALAEERCAAAGIEVRVPRPRLC 307 Query: 299 TDNGAMIAYAGMVRFKAGA-TADLGVSVRPRWPLAEL 334 TDNGAMIA G AG + L ++V P++ + Sbjct: 308 TDNGAMIAALGAEVVAAGLPPSPLDMAVDTSLPVSSV 344 >UniRef50_Q127W3 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Proteobacteria RepID=GCP_POLSJ Length = 347 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 179/339 (52%), Positives = 228/339 (67%), Gaps = 8/339 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEK----GLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIE+SCDETG+A+ D LL++ L+SQ+++H YGGVVPELASRDH+R+ + Sbjct: 1 MLVLGIESSCDETGVALVDAGGSEVPRLLSHALFSQIQMHQAYGGVVPELASRDHIRRVL 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 PL + + ++G + +D VAYT GPGL GALLVGA V +LA A P + VHH+EGHL Sbjct: 61 PLTRQVMAQAGRSLAQVDVVAYTRGPGLAGALLVGAGVACALAAALGKPVMGVHHLEGHL 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L+P L +PP FPFVALLVSGGHTQL+ V +G YELLGE+IDDAAGEAFDK+AKL+GL Sbjct: 121 LSPFLSADPPVFPFVALLVSGGHTQLMRVDRVGSYELLGETIDDAAGEAFDKSAKLMGLP 180 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-DQTRADI 235 YPGGP L+ +A QG F PRP+ LDFSF+GLKT + G + + +AD+ Sbjct: 181 YPGGPHLADLARQGDGTAFKLPRPLLHSGDLDFSFAGLKTAVLTQAKKLGPELENRKADL 240 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A A + A+VD L+ K A+ QTG KRLV+AGGV AN LR++L ++R V Y Sbjct: 241 AAATQAAIVDVLVKKSLAAMAQTGLKRLVVAGGVGANALLRSQLNAACQQRGIRVHYPEL 300 Query: 296 EFCTDNGAMIAYAGMVRFKAGATA---DLGVSVRPRWPL 331 CTDNGAMIA A +R +AG V+PRW L Sbjct: 301 HLCTDNGAMIALAAGMRLQAGLETLQRGYTFDVKPRWSL 339 >UniRef50_B0VHD4 Putative metalloendopeptidase, , glycoprotease family n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VHD4_9BACT Length = 338 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 125/325 (38%), Positives = 194/325 (59%), Gaps = 3/325 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L E+SCD+T +AI D + ++ N + SQ H ++GG++PELASR H++ V L +AA Sbjct: 6 ILAFESSCDDTSVAIVDTDYNVIVNLISSQ-PEHLEFGGILPELASRLHLKNIVTLTKAA 64 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L S L +DI A+A + PGL+G+L+VG + LA++ +P I V+H+ H+ A +E Sbjct: 65 LNASKLNLQDISAIAVSINPGLIGSLIVGLAFAKGLAWSLSLPLITVNHILSHIFANFIE 124 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 E PF+AL+VSGGHT+L+ + + ++G+++DDAAGE+FDK AKLLGL +PGGP Sbjct: 125 HKAVEPPFLALVVSGGHTELVHFDTLTTFTVVGKTLDDAAGESFDKAAKLLGLGFPGGPA 184 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTRADIARAFE 240 + ++A +G FPR + + +FS+SGLKT + ++ T DIA + + Sbjct: 185 IDELAQKGNPNFIKFPRALPQKNNFNFSYSGLKTAIRTWLVNQNPETLQAELPDIAASVQ 244 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 A++D L+ K Q +++AGGV+AN LR +L K +VFY C D Sbjct: 245 QAIIDPLVHKTVLWARQHKIPYILLAGGVAANSALRQQLTTTSAKYGIKVFYPSNALCMD 304 Query: 301 NGAMIAYAGMVRFKAGATADLGVSV 325 N AM+ A + +F A L ++V Sbjct: 305 NAAMVGAAAIPKFLTKNYAPLSINV 329 >UniRef50_Q045T6 Probable O-sialoglycoprotein endopeptidase n=433 Tax=cellular organisms RepID=GCP_LACGA Length = 348 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 134/343 (39%), Positives = 204/343 (59%), Gaps = 11/343 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +R+L E+SCDET A+ + + + + + +Q+K H +GGVVPE+ASR H+ + + Sbjct: 6 IRILAFESSCDETSTAVIKNGREIESLIVATQIKSHQRFGGVVPEVASRHHIEVITQITK 65 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL E+ T DIDA+A T GPGLVGALL+G + ++ + A +P I V H+ GH++A Sbjct: 66 EALAEANATWDDIDAIAVTYGPGLVGALLIGVSAAKAASMATGIPLIGVDHIMGHIMAAQ 125 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+D E+P +AL VSGGHT+++ + +E++G++ DDAAGEA+DK ++LG++YP G Sbjct: 126 LKDE-IEYPALALQVSGGHTEIVLMKDPIHFEIVGDTRDDAAGEAYDKIGRVLGVNYPAG 184 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARA 238 + + A +G F FPR M + DFS SGLK+ NT D + + D+A + Sbjct: 185 KTIDEWAHKGK-DTFHFPRAMMEDDDYDFSLSGLKSAFINTCHHADQIHEKLDKYDLAAS 243 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR----RGEVFYAR 294 F+ +VVD L K RA+ + K ++ GGV+AN LR +LAE ++K + +V Sbjct: 244 FQASVVDVLSHKTIRAIKEYKPKTFILGGGVAANHGLRDRLAEEIEKLPADIKPKVILPD 303 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 + C DN AMI A +KAG +D ++ P ELP A Sbjct: 304 LKLCGDNAAMIGAAAYNLYKAGKFSDENLNADPSL---ELPYA 343 >UniRef50_B6JAE9 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Alphaproteobacteria RepID=GCP_OLICO Length = 357 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 144/342 (42%), Positives = 200/342 (58%), Gaps = 11/342 (3%) Query: 1 MRVLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIET+CDET A+ D +L+N + SQ+ HA +GGVVPE+A+R HV Sbjct: 1 MLVLGIETTCDETAAAVIERQADGSGRILSNIVRSQIAEHAPFGGVVPEIAARAHVEMLD 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L+ A++E+G+ +D +A AGPGL+G ++VG T +++A D P I V+H+E H Sbjct: 61 VLVDRAMREAGVDFAQLDGIAAAAGPGLIGGVIVGLTTAKAIALVHDTPLIAVNHLEAHA 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L P L P FP+ L SGGHTQ+++V G+G+Y +G ++DDA GEAFDK AK+L L Sbjct: 121 LTPRL-TVPLAFPYCLFLASGGHTQIVAVLGVGEYVRIGTTVDDALGEAFDKVAKMLDLP 179 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-QTRADI 235 YPGGP + + A +G RF FPRPM R +FS SGLKT N + Q AD+ Sbjct: 180 YPGGPQVERAAREGDPTRFDFPRPMLGRKDANFSLSGLKTAVRNEASRLMPLELQDIADL 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQTG-----FKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 +F+ AV+D++ + + L + LV AGGV+AN +R L E+ + Sbjct: 240 CASFQAAVLDSIADRIRSGLRLFREQFGTPRALVAAGGVAANVAIRNALQEIAADDEITM 299 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 P+ CTDNGAMIA+AG R G T + + R RW L Sbjct: 300 IVPPPQLCTDNGAMIAWAGAERLALGLTDTMEAAPRARWKLD 341 >UniRef50_A9FDL0 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Deltaproteobacteria RepID=GCP_SORC5 Length = 356 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 163/345 (47%), Positives = 212/345 (61%), Gaps = 14/345 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVLGIETSCDET A+ + +L++ + SQV LHA YGGVVPE+A+RDH R VP+++ Sbjct: 1 MRVLGIETSCDETAAAVVTEGGDVLSDVVRSQVALHAPYGGVVPEVAARDHARAVVPVVR 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL +G++A D+D +A T+ PGL GALLVG + LA+A P + V H+ GHLLA Sbjct: 61 EALSRAGVSAADLDGIAVTSRPGLAGALLVGLQAAKGLAWAAGKPLVGVDHLVGHLLAVF 120 Query: 121 L---------EDNPPEFPFVALLVSGGHTQLISVTG--IGQYELLGESIDDAAGEAFDKT 169 L E P FP+VALL SGGHT + V G +G LG + DDAAGEAFDK Sbjct: 121 LRRGGAPLSDERERPSFPYVALLASGGHTAIYRVDGPALGAIRELGATRDDAAGEAFDKV 180 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVFPRP--MTDRPGLDFSFSGLKTFAANTIRDNGT 227 AKLLGL YPGGP++ ++AA G A P M + L+FSFSG+K+ A + G Sbjct: 181 AKLLGLGYPGGPVVDRLAAGGDAAAAADAVPALMARKESLEFSFSGIKSSVARHVAKRGR 240 Query: 228 DD-QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 + Q D+ AF+ AVVD L+ K RA G R+V+ GGV+AN+ LRAK+A ++R Sbjct: 241 PEGQALRDLCAAFQGAVVDALVQKTVRAARAEGIGRVVLGGGVAANQGLRAKMAAACERR 300 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 +F CTDNGAMIAYAG +R AG L ++ R L Sbjct: 301 GLALFVPPLASCTDNGAMIAYAGALRLAAGERDTLDLAPETRTAL 345 >UniRef50_B5RQA5 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Borrelia RepID=GCP_BORRA Length = 338 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 125/332 (37%), Positives = 190/332 (57%), Gaps = 10/332 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIE+SCD+ AI ++ +L+N SQ K H Y G+VPE+ASR H + + Q Sbjct: 1 MKVLGIESSCDDCCAAIVENGNTILSNIKLSQ-KEHKKYYGIVPEIASRLHTEFIMYVCQ 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ + + +ID +A T+ PGL+G+L+VG + L+ A P I + H+ GHL AP+ Sbjct: 60 QAIISAKINISEIDLIAVTSQPGLIGSLIVGVNFAKGLSIALKKPLICIDHILGHLYAPL 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L ++ E+PF++L++SGGHT L E+LG ++DDA GEAFDK AK + +PGG Sbjct: 120 L-NHTIEYPFLSLVLSGGHTILAKQNNFDDIEILGRTLDDACGEAFDKIAKHYKMGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDR--PGLDFSFSGLKTFAANTIRDNG--TDDQTRADIA 236 P + K+A G F FP + D+ DFS+SGLKT + + T +IA Sbjct: 179 PNIEKLAIDGNQYAFNFPITIFDKKENRYDFSYSGLKTACIHQLEKFKNNNAQITNNNIA 238 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F+ A + L+I KRA+ T K+L+++GGV++N LR K+ + E +Y + Sbjct: 239 ASFQRAAFENLIIPIKRAIKDTNIKKLIISGGVASNLYLREKIKNL----EIETYYPPID 294 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 CTDN AMIA G + + + + + R Sbjct: 295 LCTDNAAMIAGIGYLMYLKYGASSIETNANSR 326 >UniRef50_C7LR95 Metalloendopeptidase, glycoprotease family n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LR95_DESBD Length = 356 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 138/342 (40%), Positives = 196/342 (57%), Gaps = 16/342 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A++DD L+ + +++Q+ +H+ +GGVVPELASR+H+R L+ Sbjct: 1 MICLGIETSCDETSVALWDD-GHLVTDLVHTQIPMHSVFGGVVPELASREHLRLLDGLVS 59 Query: 61 AALKES-GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + L+ + + ID +A T GPGL+GALLVG + +SL+ + VP I V+H+ HLLA Sbjct: 60 SVLQSAERPAGQGIDLIAVTRGPGLLGALLVGISYAKSLSLSLGVPVIGVNHLYAHLLAC 119 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 P E+P + +LVSGGHT + + ++ LLG+++DDAAGEAFDK AKLL L YPG Sbjct: 120 DF-TEPIEYPALGVLVSGGHTHIYEMPAPCEFNLLGKTLDDAAGEAFDKIAKLLNLPYPG 178 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD----------- 228 G + +A GTA +F +P DFSFSGLKT A + Sbjct: 179 GKYIDILARLGTADPRLFSKPYLQNDNCDFSFSGLKTAVAQYVHKKSFAAIDYAAFDVEL 238 Query: 229 -DQTRADIARAFEDAVVDTLMIKCKRALDQT-GFKRLVMAGGVSANRTLRAKLAEMMKKR 286 Q D+ + +V+TL+ K +RA+ + K L +AGGV+AN LR K + R Sbjct: 239 IPQEIKDLCATVNETIVETLLEKTRRAVARCHDVKTLCLAGGVAANSHLRHKFSAFAHAR 298 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +C DN AMIAYAG+ K G + + PR Sbjct: 299 GFKFLAPAQNYCGDNAAMIAYAGVQWAKKGLMSSMDFEAVPR 340 >UniRef50_Q7UM42 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Planctomycetaceae RepID=GCP_RHOBA Length = 358 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 137/345 (39%), Positives = 193/345 (55%), Gaps = 20/345 (5%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE++CDET A+ + +L + +Q LH +GGVVPE+A+R H+ + +P+I Sbjct: 9 LLLSIESTCDETAAAVIRRDGTVLGQCIATQETLHEQFGGVVPEIAARAHLERILPVIDT 68 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL ++ + +D+ A+A PGL G+LLVG ++LA AW+ P I ++H+ HL A L Sbjct: 69 ALTQAKVRGEDLTAIAVADRPGLAGSLLVGVVAAKTLALAWNKPLISLNHLHAHLYACQL 128 Query: 122 EDNPPE--FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + P +P + L+VSGGHT L E LG +IDDAAGEAFDK A +L L +PG Sbjct: 129 IEGAPANIYPAIGLIVSGGHTSLYVCRTAIDLEYLGGTIDDAAGEAFDKVAAMLSLPFPG 188 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--------DDQT 231 G ++K+A+QG + FPR M PG DFSFSGLKT I G DQ Sbjct: 189 GIEVAKLASQGNDKAYSFPRSMIHDPGDDFSFSGLKTAVRYAIVGPGRQDFASLDISDQV 248 Query: 232 RADIARAFEDAVVDTLMIKCKRALD---------QTGFKRLVMAGGVSANRTLRAKLAEM 282 + D+ +FE AVVD L+ KC+RA+ Q RL++ GGV+AN+ LR L Sbjct: 249 KRDVCASFEAAVVDVLVSKCRRAIKRHRNRNNDPQNSINRLIVGGGVAANQRLRRDLQAA 308 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 K E++ A P CTDN M +F+A A L + + P Sbjct: 309 ADKDGFELWIAPPHLCTDNAVM-GAIAWKKFEAEQFASLDLDITP 352 >UniRef50_B1XJF0 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Synechococcus sp. PCC 7002 RepID=GCP_SYNP2 Length = 355 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 146/340 (42%), Positives = 200/340 (58%), Gaps = 8/340 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VL IETSCDET +AI + + +L N + SQ+ +H ++GGVVPE+ASR H+ I Sbjct: 3 IVLAIETSCDETAVAIV-NNRKVLGNVVASQIDIHREFGGVVPEVASRHHLESINACIDT 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A ++SGL+ +I+A+A T PGLVGALL+GA G++LA + P I VHH+EGH+ A L Sbjct: 62 AFEQSGLSWSEIEAIATTCAPGLVGALLLGAAAGKTLAMIHNKPFIGVHHLEGHIYASYL 121 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E PF+ LLVSGGHT I V G G+Y+LLGE+ DDAAGEAFDK A+LL + YPGGP Sbjct: 122 SQPELEPPFLCLLVSGGHTSFIEVRGCGEYKLLGETRDDAAGEAFDKVARLLRVGYPGGP 181 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPG-----LDFSFSGLKTFAANTIRDNGTDDQ--TRAD 234 ++ ++A G F P PG D SFSGLKT ++ T + AD Sbjct: 182 VIDRLAKTGDPQAFKLPEGRISLPGGGYHPYDCSFSGLKTAVLRLVQQFETQGKAVPVAD 241 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 IA +F+ V L + R + +V+ GGV+AN LR L + +V++ Sbjct: 242 IAASFQYTVAQALTKRAVRCAGDRQLQTIVVGGGVAANSGLRQILTAAAAEAGIQVYFPP 301 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 +FCTDN AMIA A F+ G + L + V R P+ ++ Sbjct: 302 LKFCTDNAAMIACAAAEHFQKGDRSRLDLPVASRLPITQV 341 >UniRef50_B2KE20 Metalloendopeptidase, glycoprotease family n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KE20_ELUMP Length = 342 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 135/339 (39%), Positives = 201/339 (59%), Gaps = 16/339 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +LGIET+CDET AI + L++N +++Q+ +H Y GVVPELASR H K +++ Sbjct: 7 ITILGIETTCDETSAAILKSGRDLVSNVVHTQIDIHKKYCGVVPELASRAHAVKVAEVVK 66 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ID VA+ +GPGL G L+VG +++ +VP I V+H+EGHL A Sbjct: 67 EALGNHK-----IDLVAFASGPGLPGGLMVGRVAAEAVSALKNVPIIGVNHLEGHLFACE 121 Query: 121 LE--------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL 172 + D +FP +AL+VSGGHT+L V G Y++LG + DDAAGEAFDK AKL Sbjct: 122 FDAKEGKIAADKQLKFPLIALIVSGGHTELWYVKNYGDYKMLGRTRDDAAGEAFDKVAKL 181 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL YPGGP+++K A +G FPRPM + +FSFSG+KT + +RD+ D + Sbjct: 182 LGLGYPGGPVVAKEALKGNPEAIKFPRPMM-KGTFEFSFSGIKTAVSYYLRDHK--DIKK 238 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 D+ +F+ A+V+TL+ K +A+ + K + + GGV+AN L+ + + +K +V + Sbjct: 239 EDVCASFQAAMVETLVAKTFQAVKKYKVKNVAVGGGVAANELLKESMVKRGQKEGVDVSF 298 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 +DNGAMIA AG +F + + + P + Sbjct: 299 VPRALSSDNGAMIALAGYKKFMFAGKFNANIRINPNMRI 337 >UniRef50_C8WN77 Metalloendopeptidase, glycoprotease family n=3 Tax=Bacteria RepID=C8WN77_EGGLE Length = 891 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 135/341 (39%), Positives = 183/341 (53%), Gaps = 9/341 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET AI D L+A+ + SQ+ HA +GGVVPE+ASR H+ + Sbjct: 549 LILAIESSCDETAAAIVDGNGTLIADVVASQIDFHARFGGVVPEIASRKHIEAICGVCDE 608 Query: 62 ALKESG-------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 + LT +D+D++A T PGLVGAL+VG + A+A P I V+H+EG Sbjct: 609 CFDVAASALGIERLTWRDLDSIAVTYAPGLVGALVVGVAFAKGAAWAAGKPFIGVNHLEG 668 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HL A + + P V LVSGG+T L+ + G G YE LG +IDDA GEAFDK AK LG Sbjct: 669 HLYANKIGAPDFQPPAVVSLVSGGNTLLVHMKGWGDYETLGATIDDAVGEAFDKVAKALG 728 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTR 232 L YPGGP++S+ AA+G FPR M L FS SGLKT I + + Sbjct: 729 LGYPGGPVISREAAKGDPNAIPFPRAMMHSGDLRFSLSGLKTAVVTYINNERAAGRELNV 788 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 +I +F+ AVVD + K + AL+QTG + + GGV+AN LR ++ ++ + Sbjct: 789 PNICASFQQAVVDVQVKKAEMALEQTGARTFCLGGGVAANPALRDAYEQLCERLHVRLTL 848 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 C DN MIA + R G L + L E Sbjct: 849 PPLSACGDNAGMIALVALDRHNQGKFFTLEADAQAHANLDE 889 Score = 76.0 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 30/112 (26%), Positives = 50/112 (44%), Gaps = 10/112 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL +T+ + I + G+L + ++L A V A R + +P I AA Sbjct: 18 VLAFDTANEIIAIGL-----GVL-HASSRMIELTAS----VEAEARRASNTQLLPRIDAA 67 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L E G+ +DI VA GPG + + + +A A +VP + V ++ Sbjct: 68 LAEHGVAREDIACVAVGRGPGSFTGVRIAMATAKGIASALEVPLVGVSSLDA 119 >UniRef50_C9RIN4 Metalloendopeptidase, glycoprotease family n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RIN4_FIBSS Length = 335 Score = 366 bits (941), Expect = e-100, Method: Composition-based stats. Identities = 139/305 (45%), Positives = 186/305 (60%), Gaps = 3/305 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEK-GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M LGIE+SCDET A+ D+ +L+N LYSQ+ HA YGGVVPE+A+R H++K P+ Sbjct: 1 MIWLGIESSCDETACAVLQDDPLKVLSNPLYSQIDEHALYGGVVPEIAARAHLQKIAPIA 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 +AA+KE+G+ KDIDA+AYT GPGL+G LLVGA+ + LA ++PA ++H+EGHL A Sbjct: 61 EAAVKEAGVELKDIDAIAYTTGPGLMGPLLVGASFAKGLARDLNIPAYGMNHLEGHLAAA 120 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L + E PF+ L VSGGHT+L+ +Y +G + DDAAGEAFDK KL+GL YP Sbjct: 121 WLSNPDIEPPFLTLTVSGGHTELVMEEPGFKYTSIGRTRDDAAGEAFDKCGKLIGLKYPA 180 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTRADIAR 237 G +S++ FPR + +FSFSGLKT + + Q DI Sbjct: 181 GATISRLGKDHNRKFVEFPRALHTHDSCEFSFSGLKTAVLRYTETHDPEFIQQNLGDICA 240 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 + EDA+VD+L+ K AL +T K LVM GGVSAN LR +L + K+ Sbjct: 241 SLEDAIVDSLVTKTINALKKTKMKTLVMGGGVSANSWLRTRLQDYCDKKGIRFCVPDRSL 300 Query: 298 CTDNG 302 TDNG Sbjct: 301 STDNG 305 >UniRef50_Q0ATQ2 Probable O-sialoglycoprotein endopeptidase n=44 Tax=Proteobacteria RepID=GCP_MARMM Length = 377 Score = 366 bits (939), Expect = e-99, Method: Composition-based stats. Identities = 147/349 (42%), Positives = 202/349 (57%), Gaps = 14/349 (4%) Query: 1 MRVLGIETSCDETGIAI----YDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 + VLG+E+SCDET AI D +LA+++ Q HA +GGVVPE+A+R H Sbjct: 15 LTVLGLESSCDETAAAILRREVDGSVTVLADRVLGQNDAHAPFGGVVPEIAARAHAEAMD 74 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L+ AL E+GL D+D +A T+GPGL+G ++ + LA P I V+H+EGH Sbjct: 75 GLVSQALAEAGLAVADLDGIAATSGPGLIGGVMAALMTAKGLALGAGKPLIAVNHLEGHA 134 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L+P + + P FP++ LLVSGGHTQL+ G+G Y LG ++DDAAGEAFDKTAK++GL Sbjct: 135 LSPRISE-PLAFPYLLLLVSGGHTQLLIAEGVGVYHRLGSTMDDAAGEAFDKTAKVMGLG 193 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADI 235 +PGGP L + A G A RF P P+ +PG DFSF+GLKT A + DQ RAD+ Sbjct: 194 FPGGPALERCAQSGDATRFALPVPLKGKPGCDFSFAGLKTAARQIWDGLDAPSDQDRADL 253 Query: 236 ARAFEDAVVDTLMIKCKRAL--------DQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 + + A+ L + +RAL D + LV+AGGV+AN+ +RA L + Sbjct: 254 SACVQAAIARALSSRTRRALAMFVDRFPDASRPMALVVAGGVAANKAVRAALEDEAAAAG 313 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 + ++CTDN AMIA G+ + G L R RWPL A Sbjct: 314 FRLVAPPMKWCTDNAAMIALVGLEKLARGQIDGLDAPARARWPLDGAAA 362 >UniRef50_Q254Q0 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Chlamydiaceae RepID=GCP_CHLFF Length = 344 Score = 365 bits (938), Expect = e-99, Method: Composition-based stats. Identities = 125/339 (36%), Positives = 186/339 (54%), Gaps = 16/339 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LG+E+SCDET A+ D + ++AN + SQ + HA YGGVVPELASR H++ ++ Sbjct: 1 MLTLGLESSCDETACALVDADAQIVANVVSSQ-QYHASYGGVVPELASRAHLQMLPSVVN 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +AL++SG++ DID +A T PGL+G+L VG + LA P I V+H+E HL A Sbjct: 60 SALEKSGVSLDDIDLIAVTHTPGLIGSLAVGVNFAKGLAIGSQKPMIGVNHVEAHLYAAY 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E EFP + L++SG HT + + Y+L+G + DDA GE FDK + LGL YPGG Sbjct: 120 MEAKNVEFPALGLVMSGAHTSMFLMEDPLSYKLIGNTRDDAIGETFDKVGRFLGLPYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD---------QT 231 L+ MA+QG + F PG D SFSGLKT I+ N ++ + Sbjct: 180 ALIEMMASQGCEESYPFSAAKV--PGYDLSFSGLKTAVLYAIKGNNSNSRSPLPDLSQKE 237 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + +IA +F+ A T+ K + + + +++ GGV+ N+ + L + ++ Sbjct: 238 KNNIAASFQKAAFMTIAQKLPKIIKNFSCRSILVGGGVANNKYFQTLLQNTL---NLPLY 294 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATA-DLGVSVRPRW 329 + + CTDN AMIA G F + T + R RW Sbjct: 295 FPSSKLCTDNAAMIAGLGRELFLSRKTTQGITPCARYRW 333 >UniRef50_B2S3R9 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Treponema RepID=GCP_TREPS Length = 352 Score = 365 bits (937), Expect = 1e-99, Method: Composition-based stats. Identities = 134/332 (40%), Positives = 183/332 (55%), Gaps = 8/332 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCDET +AI D + +N + +Q+ HA Y G+VPELASR H+ +P ++ Sbjct: 1 MNVLGIETSCDETAVAIVKDGTHVCSNVVATQIPFHAPYRGIVPELASRKHIEWILPTVK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + LT DID +A T PGL G+LLVG T ++LA++ +P I V+H+ H A Sbjct: 61 EALARAQLTLADIDGIAVTHAPGLTGSLLVGLTFAKTLAWSMHLPFIAVNHLHAHFCAAH 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E + +P+V LL SGGH + V Q E LG +IDDA GEAFDK A G YPGG Sbjct: 121 VEHD-LAYPYVGLLASGGHALVCVVHDFDQVEALGATIDDAPGEAFDKVAAFYGFGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG--LDFSFSGLKTFAANTIRDNGTDDQTR--ADIA 236 ++ +A QG A FP P G D S+SGLKT + + + R +IA Sbjct: 180 KVIETLAEQGDARAARFPLPHFHGKGHRYDVSYSGLKTAVIHQLDHFWNKEYERTAQNIA 239 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 AF+ ++ L+ RAL TG V+ GGV+AN LR +A+ + + E Sbjct: 240 AAFQACAINILLRPLARALQDTGLPTAVVCGGVAANSLLRKSVADW---KHARCVFPSRE 296 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +CTDN M+A G G + GV+ R R Sbjct: 297 YCTDNAVMVAALGYRYLIRGDRSFYGVTERSR 328 >UniRef50_Q3SVF4 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Rhizobiales RepID=GCP_NITWN Length = 357 Score = 364 bits (935), Expect = 3e-99, Method: Composition-based stats. Identities = 151/342 (44%), Positives = 198/342 (57%), Gaps = 11/342 (3%) Query: 1 MRVLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIET+CDET A+ D +L+N + SQ + HA YGGVVPE+A+R HV Sbjct: 1 MLVLGIETTCDETAAAVVERLPDGSARILSNIVRSQTEEHAPYGGVVPEIAARAHVELLD 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 LI A+ ESG+ + + VA AGPGL+G ++VG T +++A P V+H+E H Sbjct: 61 GLIARAMTESGVGFRQLSGVAAAAGPGLIGGVIVGLTTAKAIALVHGTPLTAVNHLEAHA 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L P L EFP+ L SGGHTQ+++V G+G Y LG ++DDA GEAFDK AK+LGL Sbjct: 121 LTPRLTSR-LEFPYCLFLASGGHTQIVAVLGVGNYVRLGTTVDDAMGEAFDKVAKMLGLP 179 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADI 235 YPGGP + + AA G A RF FPRPM RP +FS SGLKT N + + + +D+ Sbjct: 180 YPGGPEVERAAASGDATRFNFPRPMLGRPDANFSLSGLKTAVRNEAARIDPLEPRDISDL 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 F+ AV++ + L + LV AGGV+AN+ +RA L + K R + Sbjct: 240 CAGFQAAVLEATADRLGVGLRLFEERFGRPRALVAAGGVAANQAIRASLEGVAAKARTSL 299 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 P CTDNGAMIA+AG R AG T L R RW L Sbjct: 300 IIPPPALCTDNGAMIAWAGAERLAAGLTDSLETPPRARWLLD 341 >UniRef50_B4U8B7 Metalloendopeptidase, glycoprotease family n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U8B7_HYDS0 Length = 343 Score = 364 bits (934), Expect = 3e-99, Method: Composition-based stats. Identities = 139/336 (41%), Positives = 205/336 (61%), Gaps = 16/336 (4%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGIETSCD+T +A+Y ++GL+ N L SQV H Y G+VPEL SR+H + L L Sbjct: 10 LGIETSCDDTALALYSSKRGLIDNLLSSQVNAHKIYNGIVPELCSREHTKNLYILFYELL 69 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 ++ + DID +A T PGL+ +LLVGA+ L++A D+P +PVHH+E H+ + LE Sbjct: 70 EKHKIKPSDIDFLAVTIAPGLILSLLVGASFASGLSYALDIPIVPVHHIEAHIYSVFLEY 129 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLL 183 N E+PF+AL+VSGGHT++ V G YEL+G+++DDAAGEAFDK A LLGL YPGGP + Sbjct: 130 N-VEYPFLALVVSGGHTEIYLVKGFEHYELIGKTLDDAAGEAFDKGAVLLGLQYPGGPAI 188 Query: 184 SK-MAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 K +++ FP P+ D + FSFSGLKTF D + + ++++A Sbjct: 189 EKFLSSYENPETIDFPIPIKDDR-IAFSFSGLKTFL-----RENKDKYPKDALVFSYQEA 242 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 +V+ ++ ++A+ +T RLV+ GGV+AN+ LR KL + E + ++CTDN Sbjct: 243 IVNHIIRTLQKAIKKTAVNRLVVVGGVAANKRLREKLNAL----DIECYIPSIKYCTDNA 298 Query: 303 AMIAYAGMVRFKAGAT---ADL-GVSVRPRWPLAEL 334 AM++ G +RF G +DL ++ P L + Sbjct: 299 AMVSLVGNMRFLKGKYYKKSDLHKLNPDPSLRLEDF 334 >UniRef50_C5ZWF6 Metal-dependent protease n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZWF6_9HELI Length = 351 Score = 363 bits (933), Expect = 4e-99, Method: Composition-based stats. Identities = 119/334 (35%), Positives = 181/334 (54%), Gaps = 12/334 (3%) Query: 2 RVLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +L IE+SCD++ IAI +K ++ +Q SQ + H+ YGGVVPE+ASR H +P I Sbjct: 19 LILSIESSCDDSSIAITQIKDKKIVFHQKISQEREHSSYGGVVPEIASRLHAE-ILPQI- 76 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L+ + KD+ A+A T PGL L+ G + ++L+FA ++P I V+H++GHL + Sbjct: 77 --LEHTKPYFKDLKAIAVTTEPGLNITLMEGLMMAKTLSFALEIPLISVNHLKGHLYSLF 134 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE FP ALLVSGGHT L+ + ++ ++IDD+ GE+FDK +K+LGL YPGG Sbjct: 135 LEQEAI-FPLGALLVSGGHTMLLEARSFNEINIIAQTIDDSFGESFDKVSKMLGLGYPGG 193 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT-RADIARAF 239 P++ A +G F P P+ R FSFSGLK I+ + DI +F Sbjct: 194 PIVEFQAQKGNDRAFELPLPLKSRKDFAFSFSGLKNAVRLVIQKQEIQSKAFVEDICASF 253 Query: 240 EDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + ++ L K + ++ +K + GG SAN LR ++ + + A Sbjct: 254 QRVAIEHLSKKTQIFFEKNSKSMDSWKYFGVIGGASANLVLRNEIQRICDYYGVTLLLAP 313 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 E+C+DN AMI + + G D + V+PR Sbjct: 314 LEYCSDNAAMIGRVALESYLRGEFGDFNLQVKPR 347 >UniRef50_B5ZLG0 Metalloendopeptidase, glycoprotease family n=11 Tax=Rhodospirillales RepID=B5ZLG0_GLUDA Length = 382 Score = 363 bits (933), Expect = 4e-99, Method: Composition-based stats. Identities = 141/345 (40%), Positives = 195/345 (56%), Gaps = 14/345 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCD+T AI + +LA + SQ H +GGVVPE+A+R H+ L++ Sbjct: 30 ILAIESSCDDTACAILAPDGTILAETVLSQAG-HVPFGGVVPEIAARAHLAALPALVRHT 88 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L + L A+ + A+A + GPGL+G L+VGA + + LA A P + V+H+E H L L Sbjct: 89 LDVAALPAEALGAIAASTGPGLIGGLIVGAGMAKGLAVALGRPFVAVNHIEAHALTARLP 148 Query: 123 D---NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 FP++ LLVSGGH Q I+V G+G+Y LG +IDDAAGEAFDK AK+LGL +PG Sbjct: 149 GLVPGGASFPYLLLLVSGGHCQCIAVEGVGRYRKLGGTIDDAAGEAFDKVAKMLGLGWPG 208 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR---ADIA 236 GP + +A +G + PRP+ RPG DFSFSGLKT A + R A IA Sbjct: 209 GPAVEALAREGDPAPWPLPRPLRGRPGCDFSFSGLKTAVAQKLAPFAAGALPRTAAAGIA 268 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVM-AGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F+DAV D + + ALD L++ AGGV+AN LR +L + R Sbjct: 269 ASFQDAVADIVADRVAHALDMMPQATLLVAAGGVAANTALRTRLTTLATSRALPFAAPPL 328 Query: 296 EFCTDNGAMIAYAGMVRFKAGA------TADLGVSVRPRWPLAEL 334 CTDN M+ +A + + T DL + RPRWPL ++ Sbjct: 329 RLCTDNAVMVGWAAIETLRERRRLGLPPTDDLDLLPRPRWPLEQM 373 >UniRef50_A5GMV4 Probable O-sialoglycoprotein endopeptidase n=17 Tax=cellular organisms RepID=GCP_SYNPW Length = 356 Score = 363 bits (933), Expect = 4e-99, Method: Composition-based stats. Identities = 154/342 (45%), Positives = 201/342 (58%), Gaps = 11/342 (3%) Query: 2 RVLGIETSCDETGIAIYD---DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +VL +ETSCDE+ A+ +LA+++ SQV+ HA +GGVVPE+ASR HV L Sbjct: 3 KVLALETSCDESAAAVVQHSAGGLEVLAHRIASQVEEHAQWGGVVPEIASRRHVEALPHL 62 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 I A L E+GL ++DAVA T PGLVGAL+VG+ GR+LA P + VHH+E HL + Sbjct: 63 ISAVLDEAGLAVGEMDAVAATVTPGLVGALMVGSLTGRTLAALHHKPFLGVHHLEAHLAS 122 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 L +PPE P+V LLVSGGHT+LI V + LG S DDAAGEAFDK A+LLGL YP Sbjct: 123 VRLASSPPEAPYVVLLVSGGHTELILVDSDSGLQRLGRSHDDAAGEAFDKVARLLGLAYP 182 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPG-----LDFSFSGLKTFAANTIR--DNGTDDQT 231 GGP + A G RF P+ RP DFSFSGLKT + +D Sbjct: 183 GGPAIQAAAKAGDPKRFSLPKGRVSRPEGGFYPYDFSFSGLKTAMLRQVESLKAQSDALP 242 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 D+A +FE VVD L+ + R G LVM GGV+AN LR ++ + ++R V Sbjct: 243 LEDLAASFEQIVVDVLVERSLRCCLDRGLSTLVMVGGVAANVRLRVQMEQQGRERGVSVH 302 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGA-TADLGVSVRPRWPLA 332 A +CTDN AM+ A + R +AG ++ + + V RWPL Sbjct: 303 LAPLAYCTDNAAMVGAAALGRLQAGWGSSSIRLGVSARWPLE 344 >UniRef50_C0QY51 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Brachyspira RepID=GCP_BRAHW Length = 340 Score = 363 bits (933), Expect = 5e-99, Method: Composition-based stats. Identities = 121/331 (36%), Positives = 195/331 (58%), Gaps = 7/331 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGI+TSCD+T AI +D K +L++ L S + H ++ GVVPE+A+R H+ + +I Sbjct: 1 MKILGIDTSCDDTSAAIVEDGKNVLSSVLSSSIDAHKEFQGVVPEIAARKHLEAILYVID 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ T DID A T PGL+G+LLVG +SLAF+ + P + + H+ H+ +P Sbjct: 61 KALKDANTTLDDIDLFAVTNRPGLLGSLLVGVASAKSLAFSLNKPLLALDHIAAHIYSPH 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L N EFP++AL+VSGGHT + V G+Y+++G ++DDA GEA+DK +K L L YPGG Sbjct: 121 L-TNDIEFPYIALVVSGGHTIITEVHDYGEYKVVGTTLDDAVGEAYDKVSKFLNLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDR-PGLDFSFSGLKTFAANTIRDNGTD--DQTRADIAR 237 P++ ++A +G +P + + +FS+SGLKT + + + + T +IA Sbjct: 180 PIIDRLAKEGNKEAIKYPIVLLNGIDEFNFSYSGLKTACVYSTKKYLKEGYEATNENIAA 239 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 AF+ + ++ L IK + +++G KR+ ++GGV+ N LR + + E + ++ Sbjct: 240 AFQISAIEPLYIKTLKYAEKSGIKRVTLSGGVACNSYLRDRFGN---SKDFECYLPALKY 296 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 TDN AM+A AD + R Sbjct: 297 TTDNAAMVAGLAYHMKDKQNFADYNLDCFSR 327 >UniRef50_D1N4S8 Metalloendopeptidase, glycoprotease family n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N4S8_9BACT Length = 359 Score = 363 bits (933), Expect = 5e-99, Method: Composition-based stats. Identities = 140/354 (39%), Positives = 206/354 (58%), Gaps = 21/354 (5%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCDET A+ D +L++ + SQ+ HA +GGVVPELA+R+H+ P+++ Sbjct: 3 LILGIESSCDETAAAVVRDGYQVLSSCVASQIAKHAVHGGVVPELAAREHLVALNPVVEG 62 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+E+G+T K+IDA+A T GPGL+ ALLVG + + LA P I V+H H+ L Sbjct: 63 ALREAGVTMKEIDAIAVTQGPGLIPALLVGLSFAKGLAMGNGKPLIGVNHFIAHIYGAFL 122 Query: 122 E------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 + +NP +P +AL+VSGGHT L+ + G+ LG +IDDAAGEA DK AKLLGL Sbjct: 123 DEAHGVLENPATYPLLALVVSGGHTSLMLIERDGKARQLGCTIDDAAGEALDKGAKLLGL 182 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPG--------LDFSFSGLKTFAANTIRDNGT 227 YPGGP++ K A G ++ FPRP+T G +FSFSG+KT ++ + Sbjct: 183 GYPGGPIMQKTAEGGDPHKYEFPRPLTGGAGKPLAPENLYNFSFSGIKTALLYHVKHHAG 242 Query: 228 DD-----QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEM 282 D + D ++++AVVD L K A G K +V+AGGV+ N LR + E Sbjct: 243 ADGKLPAELLQDTVASYQEAVVDVLTRKTLLAAKNFGAKTIVVAGGVACNSVLRERF-EA 301 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWP-LAELP 335 + + ++ A ++CTDN AM+ G + A + L + R P + ++P Sbjct: 302 LTPKHVQLRLAARKYCTDNAAMVGGLGWHYHRKQAYSPLNIDSFARLPQITQVP 355 >UniRef50_Q2SR45 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Mollicutes RepID=GCP_MYCCT Length = 319 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 118/315 (37%), Positives = 191/315 (60%), Gaps = 6/315 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L IE+SCDE I+I D+ K +L N + SQ+K H +GGVVPELA+R HV+ +++ Sbjct: 1 MKILAIESSCDEFSISIIDNNK-ILTNIISSQIKDHQVFGGVVPELAARLHVQNFNWVLK 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AAL +S L ++ID +AYT PGL+G+L++G V +++ + P + + H++GH+ Sbjct: 60 AALSQSNLNIEEIDYIAYTKSPGLIGSLIIGKLVAETISLYINKPILALDHIQGHIFGAS 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E N +P +A++VSGGHTQ+ + ++++G + DDA GE +DK A++LGL YPGG Sbjct: 120 IE-NEFIYPVLAMVVSGGHTQIEIINSANDFQIIGSTRDDAIGECYDKVARVLGLSYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARA 238 P+L K+A +G + P + D DFS+SGLKT N I + + + A + Sbjct: 179 PILDKLALKGNKDFYSLPV-LKDDNTYDFSYSGLKTACINLIHNLNQKKQEINLENFAAS 237 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE-VFYARPEF 297 F+ + + K ++A+ + K L +AGGVSAN +R + ++ +K + F + + Sbjct: 238 FQYTATNIIEKKLEKAIKEFKPKTLTVAGGVSANSEIRKIILKLGQKYNIKNTFVPKMSY 297 Query: 298 CTDNGAMIAYAGMVR 312 CTDN AMIA + Sbjct: 298 CTDNAAMIAKLAYEK 312 >UniRef50_A0JZ01 Probable O-sialoglycoprotein endopeptidase n=98 Tax=Bacteria RepID=GCP_ARTS2 Length = 356 Score = 362 bits (929), Expect = 1e-98, Method: Composition-based stats. Identities = 142/351 (40%), Positives = 198/351 (56%), Gaps = 17/351 (4%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIE+SCDETG+ I LL+N + S ++ H +GGV+PE+ASR H+ VP +Q Sbjct: 7 LVLGIESSCDETGVGIVRGT-ALLSNTVSSSMEEHVRFGGVIPEIASRAHLDAFVPTLQE 65 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL ++G+ D+DA+A T+GPGL GAL+VG ++LA A P ++H+ H+ +L Sbjct: 66 ALADAGVQLDDVDAIAVTSGPGLAGALMVGVCAAKALAVATGKPLYAINHLVAHVGVGLL 125 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGI-GQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 E+N ALLVSGGHT+++ + I ELLG +IDDAAGEA+DK A+LLGL YPG Sbjct: 126 QEENTLPEHLGALLVSGGHTEILRIRSITDDVELLGSTIDDAAGEAYDKVARLLGLGYPG 185 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRP-----------GLDFSFSGLKTFAANTIR--DNG 226 GP + K+A G A FPR +T D+SFSGLKT A + + Sbjct: 186 GPAIDKLARTGNAKAIRFPRGLTQPKYMGTADEPGPHRYDWSFSGLKTAVARCVEQFEAR 245 Query: 227 TDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 D+ ADIA AF++AVVD + K A + G L++ GGV+AN LR + + Sbjct: 246 GDEVPVADIAAAFQEAVVDVITSKAVLACTENGITELLLGGGVAANSRLRQLTEQRCRAA 305 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAG-ATADLGVSVRPRWPLAELPA 336 + E CTDNGAM+A G AG + + + P+ + A Sbjct: 306 GIRLTVPPLELCTDNGAMVAALGAQLVMAGIEPSGISFAPDSSMPVTTVSA 356 >UniRef50_Q30ZN1 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Proteobacteria RepID=GCP_DESDG Length = 367 Score = 354 bits (909), Expect = 3e-96, Method: Composition-based stats. Identities = 140/356 (39%), Positives = 191/356 (53%), Gaps = 30/356 (8%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR LGIE+SCDET +AI DD + L+ + +Q +LHA +GGVVPELASR+H R + Sbjct: 1 MRCLGIESSCDETALAIVDDGR-LVDAVMSTQAELHALFGGVVPELASREHYRLIGRMFD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + + GL +DID ++ GPGL+G+LLVG + LA A + V+H+ HLLA Sbjct: 60 SLMLRCGLGVQDIDVISVARGPGLLGSLLVGVGFAKGLALAGGQRLVGVNHLHAHLLAAG 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE FP + +LVSGGHT L + + L+G ++DDAAGEAFDK AK+L L YPGG Sbjct: 120 LEHRLV-FPALGVLVSGGHTHLYRIDSPRNFTLVGRTLDDAAGEAFDKVAKMLNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD------------ 228 + + +FPRP TD LDFSFSGLKT + ++ +G Sbjct: 179 RFIDVLGHMADPDDSMFPRPYTDNDNLDFSFSGLKTAVSTWLKAHGGTALAAPPAESELQ 238 Query: 229 ------------DQTRADIARAFEDAVVDTLMIKCKRALDQTG----FKRLVMAGGVSAN 272 + +F AV DTL IK +RAL + G + +V+AGGV+AN Sbjct: 239 AMLQNNVLPSGMPADMPLVCASFNAAVADTLYIKARRALQRLGGRGQIRSVVVAGGVAAN 298 Query: 273 RTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +R + + + + P CTDNGAMIAY G + G L + PR Sbjct: 299 SRVRTSMQRLAAEEGLHLHLPSPALCTDNGAMIAYTGWLLASEGLHHSLELETMPR 354 >UniRef50_A1R8N0 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Bacteria RepID=GCP_ARTAT Length = 368 Score = 353 bits (907), Expect = 4e-96, Method: Composition-based stats. Identities = 137/367 (37%), Positives = 196/367 (53%), Gaps = 34/367 (9%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDETG+ I LL N + S + H +GGV+PE+ASR H+ VP +Q + Sbjct: 1 MLGIESSCDETGVGIVRGT-TLLTNTVSSSMDEHVRFGGVIPEIASRAHLDAFVPTLQES 59 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+G+T +DIDA+A T+GPGL GAL+VG ++LA A P ++H+ H+ +L+ Sbjct: 60 LQEAGVTLEDIDAIAVTSGPGLAGALMVGVCAAKALAVATGKPLYAINHLVAHVGVGLLD 119 Query: 123 DNPPEFP------------------FVALLVSGGHTQLISVTGI-GQYELLGESIDDAAG 163 N ALLVSGGHT+++ + I ELLG +IDDAAG Sbjct: 120 GNRVSEGKHDAVAAAGLGAGKLPENLGALLVSGGHTEILRIRSITDDVELLGSTIDDAAG 179 Query: 164 EAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRP-----------GLDFSFS 212 EA+DK A++LGL YPGGP + K+A QG FPR +T D+SFS Sbjct: 180 EAYDKVARILGLGYPGGPAIDKLAHQGNPKSIRFPRGLTQPKYMGTAEEKGPHRYDWSFS 239 Query: 213 GLKTFAANTIR--DNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVS 270 GLKT A + + ++ ADIA AF++AVVD + K A + G +++ GGV+ Sbjct: 240 GLKTAVARCVEQFEARGEEVPVADIAAAFQEAVVDVISSKAVLACKEHGITDVLLGGGVA 299 Query: 271 ANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT-ADLGVSVRPRW 329 AN LR + + CTDNGAM+A G AG + + + + Sbjct: 300 ANSRLRELTGQRCASAGITLHVPPLGLCTDNGAMVAALGAQLIMAGISPSGVSFAPDSSM 359 Query: 330 PLAELPA 336 P+ + Sbjct: 360 PVTTVSV 366 >UniRef50_UPI0000D561DB PREDICTED: similar to AGAP005215-PA n=1 Tax=Tribolium castaneum RepID=UPI0000D561DB Length = 406 Score = 352 bits (905), Expect = 7e-96, Method: Composition-based stats. Identities = 116/358 (32%), Positives = 174/358 (48%), Gaps = 27/358 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCD+TG A+ D E +L L+SQ +H GG++P +A H ++ Sbjct: 21 LILGIETSCDDTGCAVVDTEGNILGEALHSQHLIHLANGGIIPPIAQNLHRENIESVVNT 80 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+K S + +D+ AVA T PGL +L +G G+ L ++ P IP+HHME H L + Sbjct: 81 AVKNSNYSFRDLSAVAVTVKPGLPLSLTIGMKYGKYLCRLYNKPFIPIHHMEAHALTARM 140 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 D EFPF+ LL+SGGH L +G++ LLG + DDA GEAFDK A+ + L Sbjct: 141 HDKTVEFPFLVLLISGGHCLLAVAQDVGRFFLLGSTRDDAPGEAFDKVARRMKLTNLSEF 200 Query: 176 -DYPGGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 GG + A++ +F F P+T FS +GLKT + + Sbjct: 201 SKLSGGQAIELAASRAKNPLQFKFTIPLTQYRDCKFSLAGLKTQVRRHLLEEEKKHNVPP 260 Query: 234 D--------IARAFEDAVVDTLMIKCKRALDQTGFK--------RLVMAGGVSANRTLRA 277 D + F+ AV + + +RA+ K LV++GG + N + Sbjct: 261 DGLIPDVFNLCAGFQLAVTRHICQRVQRAMVYARRKEMIPENSQTLVVSGGAACNNFIAR 320 Query: 278 KLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA--GATADL-GVSVRPRWPLA 332 L + + + P+ C DNG MIA+ G+ R++A G D V ++ PL Sbjct: 321 GLQLVCDEMAYKFVRPPPKLCLDNGVMIAWNGVERWRAKLGVLHDYASVEIQKSCPLG 378 >UniRef50_A1BJ68 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Chlorobiaceae RepID=GCP_CHLPD Length = 353 Score = 352 bits (905), Expect = 9e-96, Method: Composition-based stats. Identities = 131/329 (39%), Positives = 192/329 (58%), Gaps = 16/329 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCDET A+ + +N + SQ+ H +GGVVPELASR+H R V ++ Sbjct: 1 MKILGIETSCDETSAAVL-SNGSVCSNIVSSQL-CHTSFGGVVPELASREHERLIVSIVD 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +AL E+ +T D+D +A TAGPGL+GA++VG G+++A+A +P +PV+H+E H+ + Sbjct: 59 SALSEANITKNDLDVIAATAGPGLIGAVMVGLCFGQAMAYALAIPFVPVNHIEAHIFSAF 118 Query: 121 LEDNPP----EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 +++ P E F++L VSGGHT L V YE++G ++DDAAGEAFDKT K+LGL Sbjct: 119 IQETPHHQAPEGDFISLTVSGGHTLLSHVHKDFTYEVIGRTLDDAAGEAFDKTGKMLGLP 178 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMT--------DRPGLDFSFSGLKTFAANTIRDNGTD 228 YP GP++ ++A G FPR +T R DFSFSGLKT ++ + Sbjct: 179 YPAGPVIDRLAKNGDPFFHEFPRALTAHSQTSKNYRGNSDFSFSGLKTSVLTFLKKQSPE 238 Query: 229 --DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 ++ DIA + + A+V L+ K A K + +AGGVSAN LR + + ++ Sbjct: 239 FIEKHLPDIAASVQKAIVSVLVEKTVSAALAGNVKAISIAGGVSANSALRTSMKKACEQH 298 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKA 315 E+ TDN AMIA + Sbjct: 299 GIAFHVPNAEYSTDNAAMIATLAGLLLAH 327 >UniRef50_A4RXP4 Predicted protein n=6 Tax=Eukaryota RepID=A4RXP4_OSTLU Length = 492 Score = 352 bits (904), Expect = 9e-96, Method: Composition-based stats. Identities = 144/367 (39%), Positives = 201/367 (54%), Gaps = 36/367 (9%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+T A+ + +L + SQ +H +GGVVP LA H +++ A Sbjct: 82 VLGIETSCDDTAAAVVRGDGVVLGEAIASQAAIHGPWGGVVPNLARAAHEEVIDDVVRRA 141 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 L E+G++A D+ AVA T GPGL L VG + ++ + +P PVHH+E H L L Sbjct: 142 LTEAGVSAADLSAVAVTCGPGLSMCLRVGVRKAQRMSAEYGIPIAPVHHVEAHALVSRLC 201 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK--LLGLDYP 178 +FPF+ALLVSGGH LI G+G Y +LG ++DDA GEA+DKTA+ L + Sbjct: 202 AGTETVKFPFLALLVSGGHNLLIKARGVGDYTILGTTLDDALGEAYDKTARLLGLPVGGG 261 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR----------DNGTD 228 GGP L K+A +G RF FP P+ R DFS++GLKT A I D Sbjct: 262 GGPALEKLALEGDEKRFKFPVPLRQRKNCDFSYAGLKTAARMAIDAEIGGEDVEWDGVDK 321 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRAL-----DQTGFKRLVMAGGVSANRTLRAKLAEMM 283 QTRADIA +F+ V L + +RAL D +V+AGGV+AN T+R+ L +++ Sbjct: 322 RQTRADIAASFQAKAVKHLEERMRRALTWALEDTPDLSCVVVAGGVAANATVRSTLVKVV 381 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATA-----------------DLGVSVR 326 ++ + + P++CTDNG M+A+ G R G D+ V++ Sbjct: 382 EETGLPLVFPPPKWCTDNGVMVAWTGCERLALGLAEAPVDAELEAKHAMMDPRDVHVNLL 441 Query: 327 PRWPLAE 333 PRWPL E Sbjct: 442 PRWPLGE 448 >UniRef50_Q54EW4 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54EW4_DICDI Length = 468 Score = 350 bits (899), Expect = 4e-95, Method: Composition-based stats. Identities = 126/417 (30%), Positives = 195/417 (46%), Gaps = 87/417 (20%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 V+GIETSCD+T I I + E ++A Q LH + G+VP +A H + I+ Sbjct: 19 VIGIETSCDDTSIGIVNSEGKIMAEYSKPQWSLHKVHNGIVPSIAFEAHQNEIDNAIEKT 78 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ++G+T +DID +A T GPG+ +L VG + L + P V+HMEGH L +E Sbjct: 79 LDKAGMTMEDIDVIAVTTGPGMGKSLEVGLNKAKQLYREFKKPFCSVNHMEGHSLVVRME 138 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY----- 177 ++ EFPF+ +LVSGGH+Q++ + +Y+L+G ++DD+ GEA DK A++LG Y Sbjct: 139 NHSIEFPFLIVLVSGGHSQILICNDVSKYQLIGNTLDDSIGEALDKAARILGCPYGQVWD 198 Query: 178 --------PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR------ 223 GG + +A++G F PM D DFSFSG+K+ A ++ Sbjct: 199 GQSLIENIHGGQAIEILASKGDPNSHHFTLPMKDSNNCDFSFSGIKSSLARLVKEIKSKS 258 Query: 224 ----------------------------------DNGTDDQTRADIARAFEDAVVDTLMI 249 +N + ++A +F++ + L Sbjct: 259 SSSSSITNNTTTKTTTTTTTTTIITTETNNLITDENELSFVDKCNLAASFQNVAFNHLEH 318 Query: 250 KCKRALDQT--------------------------------GFKRLVMAGGVSANRTLRA 277 + K++LD K +V++GGVS N LR Sbjct: 319 RIKKSLDWYYNFKTPKQKKNELLASKTKSGKPPAIEIIKREPLKGIVVSGGVSKNNNLRK 378 Query: 278 KLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL--GVSVRPRWPLA 332 ++ ++ K+ +++ RPE C DNG MIA+AG+ FK G T D V P WPL Sbjct: 379 RIDDIGKRYNLPIYFPRPELCNDNGTMIAWAGVEMFKKGMTVDDPEKVIYLPVWPLD 435 >UniRef50_D1B582 Metalloendopeptidase, glycoprotease family n=5 Tax=Campylobacterales RepID=D1B582_SULD5 Length = 334 Score = 349 bits (897), Expect = 6e-95, Method: Composition-based stats. Identities = 120/324 (37%), Positives = 181/324 (55%), Gaps = 9/324 (2%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ IAI ++K LL ++ SQ + HA YGGVVPELA+R H Sbjct: 2 ILSIESSCDDSSIAITRIEDKKLLFHKKISQDEEHAKYGGVVPELAARLHAITLP----K 57 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E+ + + A+A T PGL +L+ G ++ ++L+ A +P + ++H++GH+ + + Sbjct: 58 ILEETQPYFEALKAIAVTNEPGLSVSLVEGVSMAKALSVALHLPLLGINHLKGHICSLFI 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E+ FP LLVSGGHTQL+ V + Q ELL ++DD+ GE+FDK K+LGL YP G Sbjct: 118 EEE-TRFPMDVLLVSGGHTQLLHVKSLEQIELLATTMDDSFGESFDKVGKMLGLPYPAGA 176 Query: 182 LLSKMAAQGTAGRFVFPRPM--TDRPGLDFSFSGLKTFAANTIR-DNGTDDQTRADIARA 238 ++ A +G A F F P+ T L FS+SGLK I D+Q DI + Sbjct: 177 IIETYAQKGDAKCFDFTIPLQGTSSSMLAFSYSGLKNQVRLCIEAQERMDEQILCDICAS 236 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+ V LM K K+A + + GG SAN LR +L ++ +++ A+ FC Sbjct: 237 FQRVAVAHLMQKIKKAYQARKVEHFGVVGGASANLYLRGELERFCASKKAQLYTAKMAFC 296 Query: 299 TDNGAMIAYAGMVRFKAGATADLG 322 +DN AMI G+ ++ G L Sbjct: 297 SDNAAMIGRCGVEAYQKGVFVSLE 320 >UniRef50_B8LEI0 Predicted protein (Fragment) n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8LEI0_THAPS Length = 342 Score = 349 bits (897), Expect = 6e-95, Method: Composition-based stats. Identities = 123/338 (36%), Positives = 183/338 (54%), Gaps = 21/338 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE+SCD+TG A+ + +L L SQ +H +GGV P LA H + +I A Sbjct: 5 VLGIESSCDDTGAAVLRSDGLILGESLASQHAIHEQFGGVFPGLAKAAHEQNIQTVISTA 64 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 L+ + +T +D+DAV T GPGL L VG GR LA + P + +HH+E H+L + Sbjct: 65 LQNANMTMEDVDAVGVTVGPGLEICLRVGCNWGRELAMEYGKPFVGIHHLEAHILMARIP 124 Query: 122 --EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK--LLGLDY 177 + + EFPF+ALLVSGGH Q++ GIGQY ++G ++DD+ GEAFDKTA+ L + Sbjct: 125 SEKYDTMEFPFLALLVSGGHCQILKCLGIGQYSIVGGTLDDSLGEAFDKTARLLGLPVGG 184 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD----------NGT 227 GGP + ++A G P P+ R DFS++GLKT Sbjct: 185 GGGPAIEQLAKDGDPKSVKLPIPLQKRKDCDFSYAGLKTAVRLATEKICVERGVESAEEL 244 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALD----QTGFKRLVMAGGVSANRTLRAKLAEMM 283 Q +A++A +F+ + I+ RA++ + G L + GGV+AN+ LR++L + Sbjct: 245 PHQDKANVAASFQHTAFRHVEIRLGRAMERVEKEDGISTLAVVGGVAANKELRSRLNALC 304 Query: 284 KKRR--GEVFYARPEFCTDNGAMIAYAGMVRFKAGATA 319 R ++ P CTD GAM A+A + R G++ Sbjct: 305 SDRAEPWKMMVPPPRLCTDQGAMSAWAAVERLMVGSSD 342 >UniRef50_B0D096 Predicted protein n=2 Tax=Agaricales RepID=B0D096_LACBS Length = 379 Score = 348 bits (893), Expect = 2e-94, Method: Composition-based stats. Identities = 126/355 (35%), Positives = 189/355 (53%), Gaps = 20/355 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL E+S D+T A+ K +L+N + Q LH YGG+ P A H R ++ A Sbjct: 19 VLAFESSADDTCAAVVHSSKSILSNVVIKQNNLHEQYGGIYPITAIDAHQRNMPYAVRRA 78 Query: 63 LKESGLTA-KDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 LKE+ + KDI+ +A+T GPG+ G L VG ++LA A + P + VHHM+GH L P+L Sbjct: 79 LKEANVDLVKDINGIAFTRGPGMPGCLSVGMNAAKTLAAALNKPIVGVHHMQGHALTPLL 138 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--- 177 NPP+FPF++LLVSGGHT L+ T + +++L ++D++ G A D+ +KLL L + Sbjct: 139 TSSNPPKFPFLSLLVSGGHTLLLLATSLDSFQILATTVDESIGRAIDQVSKLLDLKWTSL 198 Query: 178 -PGGPLLSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRD----NGTDDQT 231 PG L A + V P P G L FS+SGL + I N D T Sbjct: 199 GPGDALEKFCAQKVDTDSIVIPLPRVTMAGKLSFSYSGLHSRVERYIETLGGINNIDLPT 258 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTG-----FKRLVMAGGVSANRTLRAKLAEMMKKR 286 R IARAF+ + + L K L + +V++GGV++N+ LR +L + + K Sbjct: 259 RMAIARAFQKSAMAQLEDKLLLGLQWCQQKDIPVRHVVLSGGVASNQYLRERLHQCILKA 318 Query: 287 R----GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 ++ + P CTDN MI +A M RF A + + RP+W + +L ++ Sbjct: 319 DLALSIDLVFPPPPLCTDNAVMIGWASMHRFLANDFDEYDIESRPKWSIDQLASS 373 >UniRef50_B3MQN2 GF20469 n=4 Tax=Drosophila RepID=B3MQN2_DROAN Length = 416 Score = 347 bits (892), Expect = 2e-94, Method: Composition-based stats. Identities = 125/349 (35%), Positives = 174/349 (49%), Gaps = 27/349 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D + AN L SQ + H YGG++P A H + + Sbjct: 34 VLGIETSCDDTGIAIVDTSGNVKANVLDSQQEFHTRYGGIIPPRAQDLHRARIHSAYERC 93 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+ L + + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 94 LEEANLQPEQLAAIAVTTRPGLPLSLLVGVRFARHLARRLKKPLLPVHHMEAHALQARME 153 Query: 123 DNP-PEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD----- 176 FPF+ LLVSGGH QL V G G+ LLG+++DDA GEAFDK A+ L L Sbjct: 154 HPDAIPFPFLCLLVSGGHCQLAMVHGPGRLTLLGQTLDDAPGEAFDKIARRLRLYILPEY 213 Query: 177 --YPGGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR- 232 + GG + A + FP P++ + +FSF+G+K + IR ++T Sbjct: 214 RLWNGGRAIEHAARLATDPSAYDFPLPLSQQRNCNFSFAGIKNNSFRAIRKKERMERTPP 273 Query: 233 -------ADIARAFEDAVVDTLMIKCKRALDQT----------GFKRLVMAGGVSANRTL 275 AD AV LM + +RAL+ G LV++GGV+ N T+ Sbjct: 274 DGIISNYADFCAGLLRAVSRHLMHRTQRALEYCLQPQVRFFGDGQPTLVVSGGVANNDTI 333 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS 324 A + + + F +C+DNG MIA+ G+ + + L Sbjct: 334 FANIQHLAAQYGCRSFRPSKRYCSDNGVMIAWHGVEQLLQDGDSSLRFD 382 >UniRef50_B8PI87 Predicted protein n=2 Tax=Postia placenta Mad-698-R RepID=B8PI87_POSPM Length = 691 Score = 346 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 120/370 (32%), Positives = 183/370 (49%), Gaps = 37/370 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL IE+S D+T A+ ++ +L+N + Q H YGG+ P +A H + +Q A Sbjct: 318 VLAIESSADDTCAAVVTSDRQILSNVVVRQDSFHESYGGIHPYIAIEAHQQNMPGAVQKA 377 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ +G++A D+D +A+T GPG+ G L VG+ ++LA A + P + VHHM+ H L P L Sbjct: 378 LQVAGMSATDVDGIAFTRGPGIGGCLSVGSNAAKTLAAALNKPLVGVHHMQAHALTPFLT 437 Query: 123 DNPPE---FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP- 178 +PF+ LLVSGGHT L+ T + +L ++D++ G AFDK +++L L + Sbjct: 438 TPANSLPTYPFLTLLVSGGHTLLLLATSPRAFRVLATTLDESIGRAFDKVSRMLALPWSA 497 Query: 179 --GGPLLSKMAAQ-----------------GTAGRFVFPRPMTDRPGLDFSFSGLKTFAA 219 G L + A P PM R L FS++GL + Sbjct: 498 HGPGAALEQFCRDGPAGGTGAPGGEEIGSGEPAEAPHIPLPM--RGRLAFSYTGLHSSVE 555 Query: 220 NTIRDNG--TDDQTRADIARAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSAN 272 + G D +T+ IA F+ V L K L + +V++GGV++N Sbjct: 556 RFLHARGGVVDARTKHAIATTFQKNAVGQLEEKLALGLQLCRRKGIQIRHVVVSGGVASN 615 Query: 273 RTLRAKLAEMMKK----RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 LR +L + + + + P CTDN MIA+A M RF AG T D V +R + Sbjct: 616 SYLRERLRICLDEASPDEHIALIFPPPSLCTDNAVMIAWASMHRFLAGDTDDYTVELRRK 675 Query: 329 WPLAEL-PAA 337 W + EL P+A Sbjct: 676 WSIEELDPSA 685 >UniRef50_B3RQR7 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RQR7_TRIAD Length = 405 Score = 346 bits (887), Expect = 8e-94, Method: Composition-based stats. Identities = 124/355 (34%), Positives = 180/355 (50%), Gaps = 26/355 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-GVVPELASRDHVRKTVPLIQA 61 V+GIETSCD+TG+AI DD+ LL + L SQ +H G G+ P A++ H R ++Q+ Sbjct: 38 VMGIETSCDDTGVAIVDDQGRLLGDALQSQSSIHKPLGWGIHPVTAAQLHERNIHAVVQS 97 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL +S L +DI +A T GPGL +L VG + L + I VHHM H L + Sbjct: 98 ALHKSNLKIEDIHTIATTVGPGLAFSLNVGLDYSKKLLQQHNKRFIAVHHMAAHALTVRM 157 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 NP EFP++ LLVSGGH L V G ++ LG ++DDA GE FDK A+ L L Sbjct: 158 L-NPIEFPYLVLLVSGGHCILAVVNGPCEFYRLGSTLDDAPGEVFDKVARTLELHTHPEV 216 Query: 176 -DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-----D 229 D GG + +A G F P M +FSF+G K+ ++ D Sbjct: 217 GDIAGGRAIEIVAKLGDEKAFKLPHIMAGVRNCNFSFAGFKSAVNAHLKRVSFASLSDWD 276 Query: 230 QTR----ADIARAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANRTLRAKLA 280 Q + A++A +F+ + + + +RAL + LV++GGV+ N +R +L Sbjct: 277 QQKMTIAANMAASFQYYLTWHIAKRVRRALVFCKTFNPKCRTLVISGGVACNNYIRNELD 336 Query: 281 EMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATA---DLGVSVRPRWPLA 332 + ++ P CTDNG MIA+AG+ K+ V +P+WPL Sbjct: 337 KCATAFGFQLACPPPYLCTDNGIMIAWAGVEHLKSNTATILNPQSVIYQPKWPLG 391 >UniRef50_Q1IUF1 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Acidobacteria RepID=GCP_ACIBL Length = 381 Score = 346 bits (887), Expect = 9e-94, Method: Composition-based stats. Identities = 138/375 (36%), Positives = 193/375 (51%), Gaps = 46/375 (12%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDET A+ + +L++ ++SQ+ H YGGVVPELASR+H++ VP+++ A Sbjct: 6 ILGIESSCDETAAAVIRNGAEILSSVVFSQIYTHMRYGGVVPELASREHLKAIVPVVRQA 65 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++++G + IDA+A T GPGL GALLVG + ++L+FA D P I V+H+EGH+ +LE Sbjct: 66 VEDAGQSYDKIDAIAVTRGPGLAGALLVGVSYAKALSFALDKPLIGVNHLEGHIHVVLLE 125 Query: 123 DN-----PPEFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKTAKLLG 174 +FP +AL+VSGGHT L Y +G + DDAAGEA+DK AKLLG Sbjct: 126 QKQQGVGEIQFPVLALVVSGGHTHLYLAEKKDAGWTYRDVGHTRDDAAGEAYDKVAKLLG 185 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPG-------------LDFSFSGLKTFAANT 221 L YPGGP+L +A G FP +DFS+SG+KT Sbjct: 186 LGYPGGPILDGLAKHGDPRAVRFPFAQIKHRDRNPQNRHEDDDARVDFSYSGIKTAVLRY 245 Query: 222 -------------------IRDNGTDDQTRA------DIARAFEDAVVDTLMIKCKRALD 256 I DD R D+ +F+ AVV+ L+ K A Sbjct: 246 VETHEMKAAIEARRTALKEIEKPSQDDYLRVCDRQTLDLIASFQRAVVNDLVSKALHAAA 305 Query: 257 QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 + L++ GGV+AN LR + V++ TDN AMIA A RF +G Sbjct: 306 ENNAATLLVTGGVAANSELRETFERRAGELGLPVYFPSRPLSTDNAAMIAAAAYPRFLSG 365 Query: 317 ATADLGVSVRPRWPL 331 A +S L Sbjct: 366 EFAAPDLSAEANLRL 380 >UniRef50_Q04RH4 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Leptospira RepID=GCP_LEPBJ Length = 338 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 129/334 (38%), Positives = 193/334 (57%), Gaps = 8/334 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +GIETSCDET I I D K LL+ ++SQ+ LH YGG+VPE+ASR H+ K L++ Sbjct: 1 MIGMGIETSCDETSIGIIRDGKELLSLGIFSQIDLHKPYGGIVPEIASRAHLEKINLLLE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ++E+ + +D+ VA T+ PGL G+L+VGA + R + ++ P +PV H++ H Sbjct: 61 ETMEEAKIRFEDLSYVAVTSSPGLTGSLMVGAQMARCINMVYETPILPVCHLQSHFAVLH 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE P EFP + LL+SGG++ + + G+ EL+G+++DDA GEAFDK A LL L YPGG Sbjct: 121 LEGVPTEFPVLGLLLSGGNSAVYILQEFGRMELVGDTMDDALGEAFDKVAGLLDLPYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPR-PMTDRP----GLDFSFSGLKTFAANTIRDNGTDDQTRADI 235 P + A + P P+ R + FSFSGLKT + + ++ I Sbjct: 181 PHIEAKANEYIPTPDEKPILPLLLRNLPQGEVSFSFSGLKTAVMVLLEKQK--EVSKEQI 238 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 F+++ D + KRA+ +TG +++ AGGV AN TL+ +L K E+F + Sbjct: 239 CWNFQNSAFDLVERNLKRAVAKTGIRKVFAAGGVLANTTLQKRLEVWAGKNSVELFTPKK 298 Query: 296 E-FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +CTDNGAM+A G F+ G + +V P Sbjct: 299 KIYCTDNGAMVASLGYHLFRKGYKKGVDFTVNPS 332 >UniRef50_Q29HY2 GA12844 n=3 Tax=Sophophora RepID=Q29HY2_DROPS Length = 427 Score = 343 bits (881), Expect = 4e-93, Method: Composition-based stats. Identities = 121/351 (34%), Positives = 172/351 (49%), Gaps = 27/351 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D + + +N LYSQ + H YGG++P A H + Sbjct: 30 VLGIETSCDDTGIAIVDTDGRVHSNVLYSQQEFHTRYGGIIPPRAQDLHRARIEDAYNRC 89 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ L + + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 90 LVEADLRPEQLTAIAVTNRPGLPLSLLVGLRFARHLARRLQKPLLPVHHMEAHALQARME 149 Query: 123 D-NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 + + FPF+ LL+SGGH QL V G G+ LLG+++DDA GEAFDK A+ L L Sbjct: 150 NISAISFPFLCLLISGGHCQLALVRGPGRLTLLGQTLDDAPGEAFDKIARRLRLYVLPQY 209 Query: 176 -DYPGGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + GG + A + FP P+ + +FSF+G+K + IR +QT Sbjct: 210 RAWNGGQAIEHAAQSAVCPDAYDFPLPLAQQRNCNFSFAGIKNNSFRAIRARERLEQTPP 269 Query: 234 --------DIARAFEDAVVDTLMIKCKRALDQT----------GFKRLVMAGGVSANRTL 275 D AV LM + +RAL+ LV++GGV+ N + Sbjct: 270 DGIISNYSDFCAGLLQAVSRHLMHRTQRALEYCLRPENGLFGDASPTLVVSGGVANNDVI 329 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVR 326 + + + + +C+DNG MIA+ G+ + A + L Sbjct: 330 YRNIEHLAGQYNCRSYRPFKRYCSDNGVMIAWHGIEQLLANSAQHLRFDYH 380 >UniRef50_B0B9U7 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Chlamydia trachomatis RepID=GCP_CHLT2 Length = 338 Score = 340 bits (873), Expect = 4e-92, Method: Composition-based stats. Identities = 120/338 (35%), Positives = 181/338 (53%), Gaps = 16/338 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LG+E+SCDET ++ + K +LAN++ SQ +HA YGGV+PELASR H++ L+ Sbjct: 1 MLTLGLESSCDETSCSLVQNGK-ILANKIASQ-DIHASYGGVIPELASRAHLQTFPELLT 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AA + +G++ +DI+ ++ PGL+GAL +G + LA P I V+H+E HL A Sbjct: 59 AATQSAGVSLEDIELISVANTPGLIGALSIGVNFAKGLASGLKRPLIGVNHVEAHLYAAC 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E +FP + L +SG HT L + + L+G++ DDA GE FDK A+ LGL YPGG Sbjct: 119 MEAPATQFPALGLAISGAHTSLFLMPDATTFLLIGKTRDDAIGETFDKVARFLGLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT---------DDQT 231 L ++A +G A F F G DFSFSGLKT ++ N + + Sbjct: 179 QKLEELAREGDADAFAFSPARVS--GYDFSFSGLKTAVLYALKGNNSSAKAPFPEVSETQ 236 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + +IA +F+ AV T+ K + + L++ GGV+ N R L ++ ++ Sbjct: 237 KRNIAASFQKAVFMTIAQKLPDIVKAFSCESLIVGGGVANNSYFRRLLNQICS---LPIY 293 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 + + C+DN AMIA G F V R+ Sbjct: 294 FPSSQLCSDNAAMIAGLGERLFCNRTHVSKEVIPCARY 331 >UniRef50_A3EUW9 O-sialoglycoprotein endopeptidase n=3 Tax=Leptospirillum RepID=A3EUW9_9BACT Length = 345 Score = 340 bits (872), Expect = 5e-92, Method: Composition-based stats. Identities = 120/320 (37%), Positives = 182/320 (56%), Gaps = 1/320 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+T +A+ D +L +Q++SQ LH YGGVVPE+ASR HV L+++A Sbjct: 2 ILGIETSCDDTSVALVDMTGAILFHQIHSQESLHGTYGGVVPEVASRAHVEVLPSLVRSA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++GL+ + +A T GPGL+G+LL G + + + A+ +P I V H++ HL A + Sbjct: 62 FLDTGLSPSQLQGIAVTRGPGLLGSLLTGISFAKGIGSAFRLPLIGVDHVQAHLRACVDS 121 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + L++SGGHT L + EL+ +++DDAAGEAFDK AKLLGL YPGGP Sbjct: 122 MESLRGKTIGLVISGGHTHLFRIENWPTMELVSQTVDDAAGEAFDKGAKLLGLPYPGGPS 181 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 + K A + T + LDFSFSGLKT + +R +++TR +A + + Sbjct: 182 IQKEAEKNTLPLLPLTKKRIRTENPLDFSFSGLKTAFSLLVRKTELNERTRPLLAASLQH 241 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 A+V+ ++ + ++ + Q L++ GGVSAN LR KL +++ + + DN Sbjct: 242 AIVEHVLDRIEQTVIQESPSHLLVGGGVSANALLRKKLQVFSEQQGMTLHLSPLSLARDN 301 Query: 302 GAMIAYAGMVRFKAGATADL 321 MIA G F +G Sbjct: 302 ALMIARHGRELFLSGMYTPY 321 >UniRef50_Q9H4B0 Probable O-sialoglycoprotein endopeptidase 2 n=31 Tax=Bilateria RepID=OSGP2_HUMAN Length = 414 Score = 339 bits (871), Expect = 7e-92, Method: Composition-based stats. Identities = 121/357 (33%), Positives = 176/357 (49%), Gaps = 27/357 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIETSCD+T A+ D+ +L ++SQ ++H GG+VP A + H ++Q Sbjct: 38 IVLGIETSCDDTAAAVVDETGNVLGEAIHSQTEVHLKTGGIVPPAAQQLHRENIQRIVQE 97 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL SG++ D+ A+A T PGL +L VG + L P IP+HHME H L L Sbjct: 98 ALSASGVSPSDLSAIATTIKPGLALSLGVGLSFSLQLVGQLKKPFIPIHHMEAHALTIRL 157 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-DYP-- 178 N EFPF+ LL+SGGH L V G+ + LLG+S+D A G+ DK A+ L L +P Sbjct: 158 -TNKVEFPFLVLLISGGHCLLALVQGVSDFLLLGKSLDIAPGDMLDKVARRLSLIKHPEC 216 Query: 179 ----GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ---- 230 GG + +A QG F P+ DFSF+GL+ I ++ Sbjct: 217 STMSGGKAIEHLAKQGNRFHFDIKPPLHHAKNCDFSFTGLQHVTDKIIMKKEKEEGIEKG 276 Query: 231 ----TRADIARAFEDAVVDTLMIKCKRALDQTGFKR--------LVMAGGVSANRTLRAK 278 + ADIA + + L+ + RA+ + LV +GGV++N +R Sbjct: 277 QILSSAADIAATVQHTMACHLVKRTHRAILFCKQRDLLPQNNAVLVASGGVASNFYIRRA 336 Query: 279 LAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA--GATADLG-VSVRPRWPLA 332 L + + + P CTDNG MIA+ G+ R +A G D+ + P+ PL Sbjct: 337 LEILTNATQCTLLCPPPRLCTDNGIMIAWNGIERLRAGLGILHDIEGIRYEPKCPLG 393 >UniRef50_A7H0K1 Probable O-sialoglycoprotein endopeptidase n=26 Tax=Epsilonproteobacteria RepID=GCP_CAMC5 Length = 339 Score = 339 bits (870), Expect = 8e-92, Method: Composition-based stats. Identities = 126/340 (37%), Positives = 184/340 (54%), Gaps = 19/340 (5%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCD++ +A+ D LL ++ SQ H+ +GGVVPELA+R H R A Sbjct: 2 ILGIESSCDDSSVALLDIKNLKLLYHKKISQESEHSPFGGVVPELAARLHTRALP----A 57 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E KDI A+A T PGL +L+ G ++ ++L+ A +VP I V+H+ GH+ + L Sbjct: 58 LLEEIKPKFKDIKAIAVTNEPGLSVSLIGGVSMAKALSVALNVPLIAVNHLVGHIYSLFL 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + FP LLVSGGHT ++ + G+ LL + DD+ GE+FDK AK++ L YPGG Sbjct: 118 DCEA-RFPLGVLLVSGGHTMVLDIDAAGKISLLAGTSDDSFGESFDKVAKMMQLGYPGGA 176 Query: 182 LLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR-----------DNGTDD 229 + +A Q RF F P L++SFSGLK I D + Sbjct: 177 AVQNLAWQCKDKRRFKFTIPFLHDKRLEYSFSGLKNQVRLEIEKIKGQNLAGATDRELSN 236 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 ADI AFE+A + +M K + + FKR + GG SAN LR+++ + + E Sbjct: 237 DDMADICYAFENAACEHIMDKLTKIFKERSFKRFGIVGGASANLNLRSRIERLCLENGCE 296 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGAT-ADLGVSVRPR 328 + A EFC+DN AMIA AG ++ G +++ PR Sbjct: 297 LLLAPLEFCSDNAAMIARAGREKYLKGGFVKHNELNINPR 336 >UniRef50_Q17CG3 O-sialoglycoprotein endopeptidase n=2 Tax=Culicini RepID=Q17CG3_AEDAE Length = 400 Score = 337 bits (866), Expect = 3e-91, Method: Composition-based stats. Identities = 111/345 (32%), Positives = 168/345 (48%), Gaps = 26/345 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD++G AI +L + ++SQ H +GG++P +A H ++Q Sbjct: 28 ILGIETSCDDSGAAIVSGNGTVLGDCIHSQQNSHLKFGGIIPPVAQDFHRLNIDNVVQET 87 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + S + +DA+A T PGL +L+VG + LA + P IP+HHME H L + Sbjct: 88 FRRSDIDCSQLDAIAVTNRPGLPLSLIVGLRYAKYLARKYRKPIIPIHHMEAHALMARM- 146 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-DYP--- 178 N FPF+ +L+SGGH+ L V Q+ LLGE++DDA GEAFDK A+ L L + P Sbjct: 147 TNKVPFPFLCILISGGHSLLTLVKSTSQFYLLGETLDDAPGEAFDKIARRLKLRNLPEYA 206 Query: 179 ---GGPLLSKMAAQGT-AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD------ 228 GG + + A ++ FP P++ FSF+GLK A I + Sbjct: 207 WLSGGRSIEQAAMSSDNPRKYDFPLPLSHYRDCQFSFAGLKNTATRHILQQERELDLDPD 266 Query: 229 --DQTRADIARAFEDAVVDTLMIKCKRALDQT---------GFKRLVMAGGVSANRTLRA 277 D+ F +A + + +RA+ K LV++GGV+ N + Sbjct: 267 AVLPDYQDLCAGFLNAAARHISQRTQRAIRFCEKEKLIGSDDAKFLVISGGVACNDAIFN 326 Query: 278 KLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLG 322 ++ M K + CTDNG MIA+ G+ +F G + Sbjct: 327 TVSNMAKGFGYTTVRPERQHCTDNGIMIAWNGVEKFLVGEDVTMD 371 >UniRef50_Q4A734 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma synoviae 53 RepID=GCP_MYCS5 Length = 307 Score = 336 bits (863), Expect = 5e-91, Method: Composition-based stats. Identities = 113/316 (35%), Positives = 183/316 (57%), Gaps = 10/316 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETS D++ IAI +D K +L SQ+ + YGG +PE+ASR+HV+ ++Q Sbjct: 1 MIILGIETSHDDSSIAILEDGK-VLNMWSISQIDIFKKYGGTIPEIASREHVKNIA-ILQ 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L+E + ID +AYT+ PGL+G L VG +L+ A + P I ++H++GH + Sbjct: 59 NFLQE-FIDLNKIDHIAYTSEPGLIGCLQVGFLFASALSIALNKPLIKINHLDGHFFSGA 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +++ ++P + L+VSGGH+Q+I ++++GE++DDA GE +DK + L L +PGG Sbjct: 118 IDNKEIKYPALGLIVSGGHSQIIYAKNKFDFQIVGETLDDAIGECYDKVSSRLNLGFPGG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ K+ A +P T DFSFSG+KT N N ++ IA +F+ Sbjct: 178 PIIDKIHASYKGKYLKLTKPKT-SGEFDFSFSGIKTQVLNAF--NNKKYESIEQIAASFQ 234 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + ++ L+ K K A+D+ + +++ GGVSAN+ LR K ++ + ++ TD Sbjct: 235 EVAINYLIEKFKLAIDKFKPESILLGGGVSANKYLREKFKDL----HKNTIFPEIKYATD 290 Query: 301 NGAMIAYAGMVRFKAG 316 NGAMIA +R K Sbjct: 291 NGAMIAMCAYLRMKKN 306 >UniRef50_A6Q6J3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Epsilonproteobacteria RepID=GCP_SULNB Length = 337 Score = 335 bits (860), Expect = 1e-90, Method: Composition-based stats. Identities = 119/323 (36%), Positives = 181/323 (56%), Gaps = 10/323 (3%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ IA+ + K +L ++ SQ H+ YGGVVPELASR H Sbjct: 2 ILSIESSCDDSSIAVTETSTKKILYHKKISQEAEHSCYGGVVPELASRLHAVALP----K 57 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E+ + AVA T PGL LL G + +++A ++P IPVHH++GH+ + + Sbjct: 58 ILEETKPWFDKLKAVAVTNQPGLGVTLLEGIAMAKTVAVLQNIPLIPVHHLKGHIYSLFI 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E FP + LL+SGGHTQ+I V E+L S+DD+ GE+FDK AK++ L YPGGP Sbjct: 118 EKKTL-FPLLVLLISGGHTQIIRVKDFEHMEILATSMDDSVGESFDKCAKMMHLGYPGGP 176 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG----TDDQTRADIAR 237 L+ +A +G RF P P+ + P + FS SGLK T+ G +Q AD++ Sbjct: 177 LIEALALKGDENRFDLPVPLRNSPLIAFSLSGLKNAVRLTVEKLGGAEKMTEQDEADLSA 236 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 +F+ AV L+ K K+ + + + GG SAN+ LR A++ ++ R + A ++ Sbjct: 237 SFQKAVKLHLLQKSKKIFAKEPIRDFAIVGGASANQYLRGAYADLCREFRKTMHVAPLQY 296 Query: 298 CTDNGAMIAYAGMVRFKAGATAD 320 C+DN AMI + ++ D Sbjct: 297 CSDNAAMIGRYAIDAYEREQFID 319 >UniRef50_Q9VWD6 Probable O-sialoglycoprotein endopeptidase 2 n=6 Tax=Diptera RepID=OSGP2_DROME Length = 409 Score = 335 bits (860), Expect = 1e-90, Method: Composition-based stats. Identities = 115/340 (33%), Positives = 165/340 (48%), Gaps = 27/340 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D ++AN L SQ + H YGG++P A H + Q Sbjct: 27 VLGIETSCDDTGIAIVDTTGRVIANVLESQQEFHTRYGGIIPPRAQDLHRARIESAYQRC 86 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ + L + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 87 MEAAQLKPDQLTAIAVTTRPGLPLSLLVGVRFARHLARRLQKPLLPVHHMEAHALQARME 146 Query: 123 DNPP-EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD----- 176 +PF+ LL SGGH QL+ G G+ LLG+++DDA GEAFDK + L L Sbjct: 147 HPEQIGYPFLCLLASGGHCQLVVANGPGRLTLLGQTLDDAPGEAFDKIGRRLRLHILPEY 206 Query: 177 --YPGGPLLSKMAA-QGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + GG + A + FP P+ + +FSF+G+K + IR ++T Sbjct: 207 RLWNGGRAIEHAAQLASDPLAYEFPLPLAQQRNCNFSFAGIKNNSFRAIRARERAERTPP 266 Query: 234 --------DIARAFEDAVVDTLMIKCKRALDQTGF----------KRLVMAGGVSANRTL 275 D +V LM + +RA++ LVM+GGV+ N + Sbjct: 267 DGVISNYGDFCAGLLRSVSRHLMHRTQRAIEYCLLPHRQLFGDTPPTLVMSGGVANNDAI 326 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA 315 A + + + F +C+DNG MIA+ G+ + Sbjct: 327 YANIEHLAAQYGCRSFRPSKRYCSDNGVMIAWHGVEQLLQ 366 >UniRef50_UPI000058820F PREDICTED: hypothetical protein n=2 Tax=Strongylocentrotus purpuratus RepID=UPI000058820F Length = 400 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 120/340 (35%), Positives = 183/340 (53%), Gaps = 28/340 (8%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIET+CD+TG A+ D+ +LA +L++Q ++HA GG++P LA H + P++Q Sbjct: 45 LVLGIETTCDDTGAAVMDETGRVLAERLHTQKRIHAKNGGIIPPLAQALHRQFIDPVVQG 104 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAW-DVPAIPVHHMEGHLLAPM 120 +K++G+ KD+ AVA + PG+ +L VG + + +P IP+HHME H L Sbjct: 105 TIKDAGIEMKDLSAVALSTMPGMPLSLRVGLDYTKDMLLRHPHLPLIPIHHMEAHALTVR 164 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP-- 178 + + +FPF+ LLVSGG+ L G+G +++LG + DDA GEAFDK A+ L L + Sbjct: 165 MVER-VDFPFLVLLVSGGNCILAVARGVGDFKVLGVTWDDAPGEAFDKVARRLKLQHHPD 223 Query: 179 -----GGPLLSKMAAQGT-AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD--- 229 GG + KMA G PM+ +FSF+GLK A I+ + Sbjct: 224 CLGLCGGQAIEKMAENGNFRLLIERGVPMSRHRDCNFSFAGLKNMANWLIQHHEVRQGLT 283 Query: 230 -------QTRADIARAFEDAVVDTLMIKCKRALDQT--------GFKRLVMAGGVSANRT 274 T +DIA +F+ V L+I+ RA+ G + LV++GGV++N Sbjct: 284 ASDDHHLATISDIAASFQHKVTQHLVIRIARAMLYCQQTGLIPEGNQTLVVSGGVASNDY 343 Query: 275 LRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFK 314 +R L + ++ P CTDNG MIA+AG+ R + Sbjct: 344 IRKALDFTTSLFKYKLICPPPYLCTDNGVMIAWAGVERLR 383 >UniRef50_UPI000180B634 PREDICTED: similar to Probable O-sialoglycoprotein endopeptidase 2 (O-sialoglycoprotein endopeptidase-like protein 1) n=1 Tax=Ciona intestinalis RepID=UPI000180B634 Length = 386 Score = 334 bits (856), Expect = 3e-90, Method: Composition-based stats. Identities = 114/350 (32%), Positives = 167/350 (47%), Gaps = 21/350 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE++ D+TG AI D + + + +Q K H GGV P +A H +++A Sbjct: 20 VLGIESTFDDTGAAIVDCDATIHGEAIATQTKAHVKAGGVDPRIAELLHRDNLPRVVEAV 79 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+++G+ +D+DAVA PG L G + + + IPVHHME HLL + Sbjct: 80 LQQAGIRYQDLDAVATATRPGNPFCLKRGLEFTKMIVERHSLRFIPVHHMEAHLLTARM- 138 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL--GLDYPG- 179 +N FPF+ LL +GGH + +G +++LGE+ID+ G FDK A+ L LD P Sbjct: 139 NNEVNFPFLGLLATGGHCIITITHDLGNHQILGEAIDEPPGAVFDKVARALQVKLDRPDT 198 Query: 180 ------GPLLSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRDNGTDDQTR 232 G + ++A +G + P+ P LDFSFSGL+T I D Sbjct: 199 HERLWNGGDVERLACEGDRSKVKLTTPLRQSPRVLDFSFSGLQTQTLRVI-DQPEPGVKY 257 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFK------RLVMAGGVSANRTLRAKLAEMMKKR 286 ADIA +F+ + ++ + RA+ + K LV+AGGV N LR L+ + Sbjct: 258 ADIAASFQHTMTQHILSRVHRAILMSRDKLNQESPTLVVAGGVVCNSYLRNALSRLCDIT 317 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT---ADLGVSVRPRWPLAE 333 + C DNG MIA+ GM K G P+ L E Sbjct: 318 NITIVCPPLPLCVDNGVMIAWTGMEYLKRGKGISPHPYNERYEPKCRLGE 367 >UniRef50_Q0BPC9 Probable O-sialoglycoprotein endopeptidase n=14 Tax=Alphaproteobacteria RepID=GCP_GRABC Length = 370 Score = 334 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 154/345 (44%), Positives = 205/345 (59%), Gaps = 15/345 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCDET A+ D +LA + SQ HA +GGVVPE+A+R H+ ++ Sbjct: 14 VLGIETSCDETAAAVLDGSGRILAEIVLSQYDDHARFGGVVPEIAARAHLAYLPGMVTEV 73 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + ++GL +D+ A+A T+GPGL+G LLVGA +G+ LA A P I ++H+E H LA +L Sbjct: 74 MDKAGLRFQDLAAIAATSGPGLIGGLLVGAGLGKGLALAAKRPFIAINHLEAHALAALLP 133 Query: 123 --------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 + FPF+ +L+SGGH Q I V G+G+Y LG +IDDA GEAFDK KLLG Sbjct: 134 ALGGVAEITSGEHFPFLLMLLSGGHCQCILVEGVGRYRRLGGTIDDAVGEAFDKVGKLLG 193 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR---DNGTDDQT 231 L +PGGP L ++A QG FPRPM R G DFSFSGLKT A + D Sbjct: 194 LGWPGGPALERLALQGNPHALAFPRPMKGRVGCDFSFSGLKTAVAQYVARFPDGPLPLSD 253 Query: 232 RADIARAFEDAVVDTLMIKCKRAL----DQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 ADIA +F+ AV D + + AL + K LV++GGV+AN +RA L+ + R Sbjct: 254 AADIAASFQAAVADVMADRATAALAMADEIAPAKMLVVSGGVAANAAIRAALSTAAEHRG 313 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + P CTDN M+A+AG+ R K GA + L + PRWPL Sbjct: 314 IAMLAPPPRLCTDNAVMVAWAGLHRLKYGAVSGLDHAPLPRWPLD 358 >UniRef50_Q6L243 Putative O-sialoglycoprotein endopeptidase n=3 Tax=Thermoplasmatales RepID=GCP_PICTO Length = 529 Score = 333 bits (855), Expect = 5e-90, Method: Composition-based stats. Identities = 106/337 (31%), Positives = 179/337 (53%), Gaps = 16/337 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLG+E + I D EK +L+N + V H GG+ P A+ H K +I+ Sbjct: 1 MIVLGLEGTAHTISAGIVD-EKSILSNVSSTYVPEH---GGIHPREAAVHHADKIYDVIK 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + +GL +D+D +A++ GPGL L V +T R+L+ + P + V+H GH+ Sbjct: 57 RSFDNAGLKPEDLDLIAFSMGPGLGPCLRVVSTAARALSIKYSKPLLGVNHPLGHVEIGR 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + L +SGG+TQ+I+ G+Y +LGE++D G DK A+ LG+ +PGG Sbjct: 117 KLSGARDP--IMLYISGGNTQVIAHLN-GRYRVLGETMDIGLGNMLDKFARDLGIPFPGG 173 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ +MA G + P + + G+D SFSG+ T A + + + + DI + + Sbjct: 174 PVIERMALDGKD---LLELPYSVK-GMDTSFSGIYTAAKRYL----SLGKNKNDICYSLQ 225 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + ++ +RA+ T +++AGGV+ N LR+ + +M + + + E+C D Sbjct: 226 ETSFSMVVEVLERAMYYTNKNEILLAGGVARNDRLRSMVNDMARDSGYKAYLTDKEYCMD 285 Query: 301 NGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAELPA 336 NGAMIA AGM+ + GA D + + R+ + E+PA Sbjct: 286 NGAMIAQAGMLMYMHGARQDIMETRINQRFRIDEVPA 322 >UniRef50_A5UMH5 Putative O-sialoglycoprotein endopeptidase n=5 Tax=Methanobacteriaceae RepID=GCP_METS3 Length = 538 Score = 330 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 106/339 (31%), Positives = 173/339 (51%), Gaps = 20/339 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + LGIE + ++TG+ I D + +LA + +L + GG+ P +A+ H LI Sbjct: 4 LICLGIEGTAEKTGVGIVDSDGNILA---MAGEQLFPEKGGIHPRIAAEHHGYWIPKLIP 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ E+G++ D+D ++++ GPGL AL + AT R+LA + + P I V+H GH+ Sbjct: 61 KAIDEAGISYDDLDLISFSQGPGLGPALRIVATSARTLALSLNKPIIGVNHCIGHVEVGK 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+ V L VSGG++Q+IS G+Y + GE++D AAG D + GL +PGG Sbjct: 121 LDTGAVNP--VTLYVSGGNSQVISHES-GRYRIFGETLDIAAGNCLDHFGRETGLGHPGG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ K+A +G P + G+DFSFSGL + A ++ D+ + + Sbjct: 178 PVIEKLAKKG--SYVDLPYVV---KGMDFSFSGLLSAALREVKKGT----PIEDVCFSLQ 228 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L+ +RAL T +++ GGVSAN LR L M ++ + + C D Sbjct: 229 ETAFSMLVEVTERALSHTQKDEVMLCGGVSANSRLREMLKVMAEEHGAKFCMPEMKLCGD 288 Query: 301 NGAMIAYAGMVRFKAGATADLGVS---VRPRWPLAELPA 336 NG MIA+ G++ L + + R+ E+ A Sbjct: 289 NGVMIAWLGLIM--HNQFGPLDIKDTGIIQRFRTDEVEA 325 >UniRef50_B1ZYF9 Metalloendopeptidase, glycoprotease family n=3 Tax=Verrucomicrobia RepID=B1ZYF9_OPITP Length = 349 Score = 330 bits (846), Expect = 5e-89, Method: Composition-based stats. Identities = 139/346 (40%), Positives = 203/346 (58%), Gaps = 17/346 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +E+SCDET +A++D +GL+ ++SQ+ LH +GGVVP+LA+R+H+R PL++ A Sbjct: 2 ILALESSCDETAVAVFDPARGLVGEWVHSQIALHERHGGVVPDLATREHLRHFAPLLERA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 ++ + I VA T GPGL L +G ++LA W VP + V+H+ GH+ +P + Sbjct: 62 --QAAVPFDAITQVAVTNGPGLAACLAIGVAAAKALALQWRVPLVGVNHLRGHVWSPFIR 119 Query: 122 --EDNPPEF--------PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 D P EF P + L+VSGG+T L +V Q +L + DDAAGEA DK AK Sbjct: 120 LHADAPAEFGDRLAALLPHLGLIVSGGNTLLFAVDRARQVTVLSTTRDDAAGEALDKGAK 179 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-- 229 LLGL YPGGPL+ K+AA G A + FPR + R LDFSFSGLKT I ++ Sbjct: 180 LLGLSYPGGPLIEKLAATGRADAYDFPRGIGRRDELDFSFSGLKTSLRYLIEKLSPEEVV 239 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQ--TGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 R+D+ +++ AVVD L+ K + AL Q ++ L ++GGV+ NRTLRA L ++ + Sbjct: 240 ARRSDLCASYQQAVVDALVRKTRAALRQGEGDYRSLGLSGGVANNRTLRAALEREAQRSQ 299 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 F A+P+ DN MIA+A A + ++V P + E Sbjct: 300 IPFFAAQPQHTGDNAGMIAFAAWADSAGTDAAGMKLTVEPSATIGE 345 >UniRef50_C2KP25 O-sialoglycoprotein endopeptidase n=3 Tax=Mobiluncus RepID=C2KP25_9ACTO Length = 375 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 133/363 (36%), Positives = 192/363 (52%), Gaps = 30/363 (8%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 LGIE++CDETG A+ + L+AN + + + +A YGG++PE+ASR H+ +P++ + Sbjct: 11 LTLGIESTCDETGAALVAGKTKLIANVVATSMDQYARYGGIIPEIASRAHLESFLPVVTS 70 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G+ +DID + + GPGL+G+L VG ++LA A P V+H+ GHL L Sbjct: 71 ALEQAGVKLEDIDRIGVSGGPGLIGSLAVGIAGAKALALALGKPLYGVNHVIGHLAVDQL 130 Query: 122 EDNPP-EFPFVALLVSGGHTQLISVTG---IGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 + P V L+VSGGHT L+ + G LG ++DDA+GEAFDK ++LGL Y Sbjct: 131 ASEEMLKLPAVGLVVSGGHTNLLYIEDFAAPGGIRELGGTLDDASGEAFDKVGRILGLPY 190 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMT-----DRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 PGGP + +M+ QGT G FPR ++ DFSFSGLKT A I + R Sbjct: 191 PGGPNVDRMSQQGTLGAIDFPRGLSGAKYAKSHPYDFSFSGLKTAVARYIASLEASPEAR 250 Query: 233 ---------------------ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSA 271 ADI + F +++ D+L+ K +AL TG K LV+ GG SA Sbjct: 251 SHPEFTEDYQATREGKPWLPVADICKGFSESINDSLVSKTLKALQDTGAKTLVVGGGYSA 310 Query: 272 NRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 N LR+ LAE + + +FCTDNGA IA + S L Sbjct: 311 NSRLRSWLAEACPEIGVTLRIPPLKFCTDNGAQIAAITAEIADRHEPSRPDFSPVSALDL 370 Query: 332 AEL 334 + Sbjct: 371 TRV 373 >UniRef50_UPI0001C42124 glycoprotease M22 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C42124 Length = 565 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 107/337 (31%), Positives = 174/337 (51%), Gaps = 16/337 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIE + ++TGI I D + +LA + +L+ + GG+ P A+ H + LI Sbjct: 1 MISLGIEGTAEKTGIGIVDSDGNVLA---MAGKQLYPEVGGIHPREAAEHHAKWIPQLIP 57 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A++E+GL KDID ++++ GPGL AL + A+ RSLA + +P + V+H GH+ Sbjct: 58 QAMEEAGLDYKDIDLISFSQGPGLGPALRIVASSARSLALSLGIPIVGVNHCIGHVEIGK 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+ V L VSGG++Q+I+ G+Y + GE++D A G D + GL +PGG Sbjct: 118 LDTGAKNP--VTLYVSGGNSQVIAYES-GRYRIFGETLDIAIGNCLDHFGRETGLGHPGG 174 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ K+A G P + G+DFSFSGL + A + + DI + + Sbjct: 175 PVVEKLAKDG--SYIDLPYVV---KGMDFSFSGLLSSALRAHE----NGERIEDICFSLQ 225 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L+ +RAL T +++ GGVSAN LR + M ++ + + ++ D Sbjct: 226 ETAFAMLVEVTERALAHTEKDEVLLCGGVSANSRLRDMMKIMAEEHYAKFYMPEMKYSGD 285 Query: 301 NGAMIAYAGMVRFKA-GATADLGVSVRPRWPLAELPA 336 NG MIA+ G + + G ++ R+ E+ A Sbjct: 286 NGVMIAWLGQLMYDNFGPLDIKDTAIIQRFRTDEVDA 322 >UniRef50_B6JWU0 Glycoprotease pgp1 n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JWU0_SCHJY Length = 412 Score = 328 bits (841), Expect = 2e-88, Method: Composition-based stats. Identities = 117/359 (32%), Positives = 177/359 (49%), Gaps = 26/359 (7%) Query: 1 MRVLGIETSCDETGIAIY------DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRK 54 + VLGIETSCD+ +A+ ++ +L + + L+ YGG+ P + +H R+ Sbjct: 33 INVLGIETSCDDCSVAVCQYDQSRNEPSKVLLQKTRRTIHLYEKYGGIHPNIVMHEHQRQ 92 Query: 55 TVPLIQAALKES-GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 PLIQ+ L E+ L A ID V+ T GPG++G L VG + LA VP I VHHM Sbjct: 93 LAPLIQSVLTEAEKLDASIIDIVSVTRGPGMLGPLAVGLNTAKGLAVGLKVPLIGVHHML 152 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 GHLLAP LE N +FPF++LLVSGGHT L+ + +E+L ++D A G+ DK A+LL Sbjct: 153 GHLLAPKLERN-IDFPFLSLLVSGGHTMLVYSKSLFDHEILATTLDIAVGDYLDKCARLL 211 Query: 174 GLDYPG---GPLLSKMAAQGTAGRFVFPRPMTDRPGLD---FSFSGLKTFAANTIRDNGT 227 + + G L + + F P++ FSF+GL+T + G Sbjct: 212 RIPWNGEMPAAALERYSVVSDVTEFPLHVPLSKNAKTRLHCFSFAGLQTQVEKVLTCLGG 271 Query: 228 D---DQTRADIARAFEDAVVDTLMIKCKRALD---QTGFKRLVMAGGVSANRTLRAKLAE 281 + + + IA A + D + K + ++ V +GGV+ NR LR L Sbjct: 272 ETAPENVKRRIAYAVQSIAFDHICRKVRLCMNDLVDKPISAFVCSGGVARNRYLRNMLVV 331 Query: 282 MMKK------RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 M+ + + C+DN +MIA A + +K G T+ L + +W L L Sbjct: 332 MLSNFETDTSHSIPLVCPSADLCSDNASMIANAAIEMYKHGITSPLTIEPTSKWSLDAL 390 >UniRef50_C7M316 Metalloendopeptidase, glycoprotease family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7M316_ACIFD Length = 347 Score = 327 bits (839), Expect = 3e-88, Method: Composition-based stats. Identities = 129/296 (43%), Positives = 180/296 (60%), Gaps = 5/296 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL IETSCD+T +A+ + AN + SQ LHA +GGVVPE+A+R H V +++ A Sbjct: 18 VLAIETSCDDTAVAVV-AGGRVAANVVRSQAALHAPFGGVVPEVAARAHDAAMVEVVEEA 76 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ESG+ A +++A+A T GPGL G+L+VG LA D P I V HMEGHL A +E Sbjct: 77 LAESGIDAHEVEAIAVTKGPGLPGSLVVGVGAALGLAVGLDRPLIGVDHMEGHLYAATIE 136 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 P P ++LLVSGGH++L+ + +Y LLG + DDAAGEAFDK A++LGL +PGGP Sbjct: 137 -GPVALPALSLLVSGGHSELVVIEAPFRYRLLGRTRDDAAGEAFDKVARILGLGFPGGPA 195 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 + A G F R + ++ G D SFSG+KT A + G AD+A +F++A Sbjct: 196 IEAAARDGRPDAIRFARALRNQ-GFDLSFSGIKTEVARYLE--GARAAEVADVAASFQEA 252 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 VVD L+ K +RAL+ + +V+ GGV+AN LR ++AE+ + R C Sbjct: 253 VVDVLVAKLERALESERVETVVIGGGVAANGPLRERVAELARARGVGAHIPARSLC 308 >UniRef50_UPI0000F51796 O-sialoglycoprotein endopeptidase/protein kinase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI0000F51796 Length = 531 Score = 326 bits (836), Expect = 7e-88, Method: Composition-based stats. Identities = 101/335 (30%), Positives = 175/335 (52%), Gaps = 17/335 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLG+E + I DD + +++N + + GG+ P A+ H +P+++ Sbjct: 1 MKVLGLEGTAHTISAGIVDDNR-IISNFSSTYI---PKNGGIHPREAAIHHADNILPVMK 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A +ESGL+ I+ VA++ GPGL L V AT R+ + + +P I V+H GH+ Sbjct: 57 KAFEESGLSPGQINLVAFSMGPGLGPCLRVVATAARAFSIKYGIPLIGVNHPLGHVEIGR 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + L +SGG+TQ+I+ Y++LGE++D G DK A+ +G+ +PGG Sbjct: 117 KLSGAKDP--IMLYISGGNTQIIA-HEENSYKVLGETMDIGLGNLLDKLARDVGIPFPGG 173 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P + + A +G + P + + G+D SFSG+ T A N I ++ +I + + Sbjct: 174 PKIEEFALKGDK---LLDLPYSVK-GMDTSFSGIYTAARNYI-----GRESIENICYSVQ 224 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L+ +RAL T + +++AGGV+ N LR+ ++ M K + ++C D Sbjct: 225 ETTFSMLVEVLERALYYTDKREILLAGGVARNDRLRSMVSHMAKSSGYVAYLTDKKYCMD 284 Query: 301 NGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAEL 334 NGAMIA AGM+ + +G + V + + E+ Sbjct: 285 NGAMIAQAGMLMYLSGQRQHIMDTKVNQSFRIDEV 319 >UniRef50_B3PND6 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Mycoplasma RepID=GCP_MYCA5 Length = 311 Score = 325 bits (835), Expect = 1e-87, Method: Composition-based stats. Identities = 109/318 (34%), Positives = 167/318 (52%), Gaps = 10/318 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M + IE+S D+T A+ DD K + + +Q ++H YGG VPE+ASR HV+ LI+ Sbjct: 1 MIIFAIESSHDDTSFALLDDNKPI-WMKTITQTEIHKQYGGTVPEIASRLHVKNIGILIE 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +S + ID +AYT PGLVG+L VG V +SLA + + ++H+EGH + Sbjct: 60 DI--KSQININKIDLIAYTKEPGLVGSLHVGYVVAQSLALILNKKIVGLNHLEGHFYSAF 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + +P + LLVSGGH+QL+ ++++G++ DDA GE +DK A+ L L +PGG Sbjct: 118 IG-KEVIYPALGLLVSGGHSQLVLYNSKDDFKIIGQTQDDAVGEVYDKVARKLNLGFPGG 176 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD--NGTDDQTRADIARA 238 PL+ ++ DFSFSG+KT N I + + + IA Sbjct: 177 PLIDQIWKNNHKLYTAHLTIPKTEGFFDFSFSGIKTNVINLINNCASRNEQINVNQIATE 236 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F++ +V+ L + A+ + K +V+AGGVSAN +R + VF E+ Sbjct: 237 FQNTIVEYLKEHMETAIKKFSPKCIVLAGGVSANFAIREMFYSL----HKNVFLPDLEYT 292 Query: 299 TDNGAMIAYAGMVRFKAG 316 TDN MIA +F+ Sbjct: 293 TDNAMMIARLAYEKFRYN 310 >UniRef50_D2LQ34 Metalloendopeptidase, glycoprotease family n=1 Tax=Aciduliprofundum boonei T469 RepID=D2LQ34_9EURY Length = 530 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 104/338 (30%), Positives = 174/338 (51%), Gaps = 19/338 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIE + G+ I EK +LAN + GG+ P A+ HV+ L+ Sbjct: 1 MLVLGIEGTAHTVGVGIV-TEKEVLANVSH---MYRPPEGGIHPREAANHHVQYLPKLLN 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A + + + +++D ++++ GPGL L AT R L+ ++P + V+H HL Sbjct: 57 EAFRIANVKPEELDGISFSQGPGLGPCLRTVATAARVLSVKLNIPIVGVNHCIAHLEIGR 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + V L VSGG+TQ+IS G+Y + GE++D G DK A+ +G+ +PGG Sbjct: 117 FSTGAEDP--VMLYVSGGNTQIISFAS-GRYRVFGETLDIGVGNMLDKLAREMGIPFPGG 173 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P + K+A +G P P + + G+D +FSG+ T A N + + +++ DIA + + Sbjct: 174 PRIEKLALEGKKY---IPLPYSIK-GMDMAFSGILTAAINKLNN-----ESKEDIAYSVQ 224 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + V L+ +RAL +++AGGV+ N+ L+ L M ++R + + C D Sbjct: 225 ETVFAMLVEATERALTHLRKDEVLLAGGVARNKRLQEMLEIMAEERGARFYVPPADLCVD 284 Query: 301 NGAMIAYAGMVRFKAGATADL-GVSVRPRWPLA--ELP 335 NGAMIAY G++ K G ++ V ++ ++P Sbjct: 285 NGAMIAYLGLLFLKNGKRMEIGDTQVIQKFRTDAVDIP 322 >UniRef50_O94710 Glycoprotease pgp1, mitochondrial n=1 Tax=Schizosaccharomyces pombe RepID=PGP1_SCHPO Length = 412 Score = 325 bits (833), Expect = 1e-87, Method: Composition-based stats. Identities = 104/358 (29%), Positives = 172/358 (48%), Gaps = 28/358 (7%) Query: 4 LGIETSCDETGIAIYDD-------EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 L IETSCD+T +++ + ++ + + + YGG+ P + +H + Sbjct: 42 LAIETSCDDTSVSVVRTSDSSSHCQNEIICLNTHRTISKYEAYGGIHPTIVIHEHQKNLA 101 Query: 57 PLIQAALKESGLT-AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 +IQ + ++ + D D +A T GPG++G L VG + LA P + VHHM+ H Sbjct: 102 KVIQRTISDAARSGITDFDLIAVTRGPGMIGPLAVGLNTAKGLAVGLQKPLLAVHHMQAH 161 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 LA LE +FP++ +LVSGGHT L+ + +E++ + D A G+ DK AK LG+ Sbjct: 162 ALAVQLE-KSIDFPYLNILVSGGHTMLVYSNSLLNHEIIVTTSDIAVGDYLDKCAKYLGI 220 Query: 176 DY---PGGPLLSKMAA---QGTAGRFVFPRPMTDRPGL---DFSFSGLKTFAANTIRDNG 226 + L + A+ T+ P P+ R + FSFSGL+++A IR Sbjct: 221 PWDNEMPAAALEQFASPEINSTSYSLKPPIPLNTREKVHSASFSFSGLESYACRIIRKTP 280 Query: 227 TDDQTRADIARAFEDAVVDTLMIKCKRALDQ---TGFKRLVMAGGVSANRTLRAKLAEMM 283 + + A + A + K AL + + K LV +GGV+ N L+ L + + Sbjct: 281 LNLSEKKFFAYQLQYAAFQHICQKTLLALKRLDLSKVKYLVCSGGVARNELLKKMLNDTL 340 Query: 284 -------KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 + ++ Y P+ C+DN AMI Y + FKAG T+ V +WP+ ++ Sbjct: 341 MVLQFEHQPTDIKLVYPSPDICSDNAAMIGYTAIQMFKAGYTSSFDVEPIRKWPINQI 398 >UniRef50_D2L1E2 Metalloendopeptidase, glycoprotease family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L1E2_9DELT Length = 371 Score = 325 bits (833), Expect = 2e-87, Method: Composition-based stats. Identities = 151/347 (43%), Positives = 197/347 (56%), Gaps = 21/347 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A+ DD + LL +L SQ+KLHA +GGVVPELASR+H+R+ PL+ Sbjct: 1 MLCLGIETSCDETAVALCDDGRPLL-EKLASQIKLHALFGGVVPELASREHLRRMGPLLD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A +E+GL D+DAVA GPGL+G+LL+G V + LA A P I V H+ HLLA Sbjct: 60 ALFREAGLGLADVDAVAVARGPGLLGSLLIGLAVAKGLALAAGKPLIGVDHLHAHLLAAT 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L +P + LLVSGGHTQ++ + +LG ++DDAAGEAFDK AK L L YPGG Sbjct: 120 LG-REVAYPALGLLVSGGHTQIVLLRSPLDLTVLGRTVDDAAGEAFDKAAKSLNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD------------ 228 + ++ A R +FPRP + LDFSFSGLKT A + + Sbjct: 179 VFVDRLGAGIEPDRALFPRPNLENTHLDFSFSGLKTAVATHVARHPGLRLAVMPAPDGPV 238 Query: 229 -----DQTRADIARAFEDAVVDTLMIKCKRALDQTGFK--RLVMAGGVSANRTLRAKLAE 281 + + AV DTL +K +RALD ++ AGGV+AN +RA L Sbjct: 239 DAAAWPLDLRRVCSSLNFAVADTLRVKMERALDGLDVPAVSILAAGGVAANSRIRAMLEA 298 Query: 282 MMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +R F P C DN MIA AG + +AG T DL + PR Sbjct: 299 LGARRGLPCFLPEPALCADNATMIAAAGCLLGRAGLTHDLALDAVPR 345 >UniRef50_UPI000186D055 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D055 Length = 419 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 102/338 (30%), Positives = 166/338 (49%), Gaps = 25/338 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE+SCD+TG ++ +D +L SQ +H + GG++P +AS H ++ +A Sbjct: 27 VLGIESSCDDTGASVVNDSGKVLGESHCSQSVIHVEAGGILPHVASALHKNNLKHVVNSA 86 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + +S L +++D +A T PGL+ +L G ++L ++ P IP+HHME H L + Sbjct: 87 MLQSKLKFENLDVIAVTVKPGLILSLTEGVNYAKNLCTLYNKPLIPIHHMEAHALTVRII 146 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------- 175 D +FPF+ L+SGGH L + ++ LG+S D++ G+ FDK A+ L Sbjct: 147 DE-VKFPFLVFLLSGGHCILALANSVRKFYKLGDSNDNSPGQVFDKIARRAKLINLNELK 205 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPR-PMTDRPGLDFSFSGLKTFAANTIRDNGTDD----- 229 GG + K A G F + + + +FSFSG T A N I+ + Sbjct: 206 GLVGGAAIEKAAKTGNPTAIPFSQTTLKSQKNCNFSFSGYITSAYNYIQSQEINLNLSPD 265 Query: 230 ---QTRADIARAFEDAVVDTLMIKCKRALD--------QTGFKRLVMAGGVSANRTLRAK 278 D +F+ ++ L + + A+ KRLV++GGV++N ++ Sbjct: 266 AVIPDINDFCASFQWSLTTHLCQRLEMAIKYVEERKLLNEDEKRLVVSGGVASNSLIKNA 325 Query: 279 LAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 L + ++F P CTDNG MIA+ G+ K Sbjct: 326 LKFVCNHYNYKIFIPPPRLCTDNGVMIAWNGVELLKEN 363 >UniRef50_A9WHP1 Metalloendopeptidase, glycoprotease family n=4 Tax=Chloroflexi (class) RepID=A9WHP1_CHLAA Length = 355 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 146/349 (41%), Positives = 195/349 (55%), Gaps = 23/349 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +ETSCDET A+ + +L+N + SQ+ H YGGVVPE+ASR H+ P+++AA Sbjct: 10 ILALETSCDETAAAVVRGGRTVLSNVVASQMATHERYGGVVPEIASRQHILSLAPVVRAA 69 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 L D+ AVA T GPGL GALL G +++A+ +P + V+H+E HL A L Sbjct: 70 LAVLPNGWADVHAVAATHGPGLSGALLTGLNAAKAMAWRRGLPFVAVNHLEAHLYAGWLG 129 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 D PP FP VALLVSGGHT L+ + G Y+LLG++ DDAAGEAFDK A++LGL YPGGP Sbjct: 130 SDPPPPFPLVALLVSGGHTLLVLLRDHGNYQLLGQTRDDAAGEAFDKVARILGLGYPGGP 189 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD----------------- 224 + AA T G PR R DFSFSGLKT + ++D Sbjct: 190 AIQAAAANATPGGV-LPRAWL-RDSYDFSFSGLKTAVLHRVQDRLAQQSRLSGRKGAGET 247 Query: 225 NGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMK 284 D A +A AF+++VVD L+ K A + + +++AGGV+ANR LR +L Sbjct: 248 PQLDAPFVAQMAYAFQESVVDVLVTKTVDAARRYQAQAILLAGGVAANRRLREELIRRAS 307 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 V + CTDN AM+A A RF +G V V PL + Sbjct: 308 ---VPVHLPAFDLCTDNAAMVAAAAFYRFHSGVQYGWDVDVTANLPLEQ 353 >UniRef50_C1F9R2 Metalloendopeptidase, glycoprotease family n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F9R2_ACIC5 Length = 401 Score = 323 bits (829), Expect = 4e-87, Method: Composition-based stats. Identities = 136/387 (35%), Positives = 192/387 (49%), Gaps = 56/387 (14%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCDET A+ + L+N + SQ+ +HA +GGVVPELASR+H+R VP+++ Sbjct: 14 LILGIESSCDETSAAVVRGGREALSNVIASQIAVHAPFGGVVPELASREHLRAIVPVVEQ 73 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+ +G+ D+DAVA T GPGL GALLVG + ++LA A P I V+H+EGH+ A +L Sbjct: 74 AMAGAGVAFDDLDAVAVTEGPGLPGALLVGVSYAKALALALGKPLIAVNHLEGHIHAVLL 133 Query: 122 E----------DNPPEFPFVALLVSGGHTQLISVTGIGQ---YELLGESIDDAAGEAFDK 168 E P +AL+VSGGHT L Y +G ++DDAAGEAFDK Sbjct: 134 ERVLQPAETQATPEHGQPKLALVVSGGHTHLYLAQETHHAWTYRNVGRTVDDAAGEAFDK 193 Query: 169 TAKLLGLDYPGGPLLSKMAAQGTAGRFVF--------------PRPMTDRPGLDFSFSGL 214 AKLLGL YPGGP + +A G A F P + FSFSG+ Sbjct: 194 VAKLLGLGYPGGPWVDALAPFGDARAVPFSFAQVKAKAHRRADPVALHPEEATYFSFSGI 253 Query: 215 KTFAANTIRDNGTD-----------------------------DQTRADIARAFEDAVVD 245 KT ++ + + DQ D+ +F+ AVV Sbjct: 254 KTAVLRYVQTHDMEARIAARRQAMATMPDASPRRDLEAVRALCDQESLDLLASFQRAVVG 313 Query: 246 TLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMI 305 L+ K RA ++ ++++GGV+ANR LR + + V + + TDN AMI Sbjct: 314 DLVRKTFRAAERYDVAEILVSGGVAANRELRERFTAEAAAQGLPVAFPSLKLATDNAAMI 373 Query: 306 AYAGMVRFKAGATADLGVSVRPRWPLA 332 A A + A ++ L Sbjct: 374 AAAAWPKLITSEFAGETLTAAAGLKLG 400 >UniRef50_Q5ZZQ1 Probable O-sialoglycoprotein endopeptidase n=8 Tax=Mycoplasma RepID=GCP_MYCH2 Length = 322 Score = 323 bits (828), Expect = 6e-87, Method: Composition-based stats. Identities = 118/322 (36%), Positives = 175/322 (54%), Gaps = 13/322 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETS D+ +A++ + K + SQ +LH +GG VPELASR+H R +++ Sbjct: 1 MKILGIETSHDDASVALFSENKVEIL-LTISQFELHEQFGGTVPELASREHSRNLAIILE 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L + + IDA+AYT PGL+G L +G +L+ ++ P IP+ H+ GH + Sbjct: 60 KLLGK-NIDFSTIDAIAYTKNPGLIGPLKIGFLFASALSLFFNKPLIPIDHLLGHFWSAA 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E N EFP ++LL+SGGHTQLI E++G ++DDA GE +DK + LG YPGG Sbjct: 119 IE-NDLEFPVLSLLISGGHTQLIFAENKNNLEIIGSTVDDALGEIYDKIGRSLGCGYPGG 177 Query: 181 PLLSKMAAQGTAGR---FVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD----DQTRA 233 P + + Q F P LDFSFSGLKT N + + + Sbjct: 178 PKIDLIWQQNNVRNMELIDFSLPKVLENPLDFSFSGLKTQVINYTNNLKENYLFSQKKVV 237 Query: 234 DIARAFEDAVVDTLMIKCKRALD-QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 +IA +F+ V+ L + AL + K + + GGV+AN +R + + + +V Sbjct: 238 EIAVSFQKTVIKYLKRQLDLALKTKKNVKTITLVGGVAANSEIRKLIKTY--ENKYKVVI 295 Query: 293 ARPEFCTDNGAMIAYAGMVRFK 314 + EFCTDNGAMIA A + K Sbjct: 296 PKKEFCTDNGAMIAKAAQIFLK 317 >UniRef50_Q6C9V8 YALI0D07920p n=1 Tax=Yarrowia lipolytica RepID=Q6C9V8_YARLI Length = 376 Score = 322 bits (827), Expect = 8e-87, Method: Composition-based stats. Identities = 120/352 (34%), Positives = 177/352 (50%), Gaps = 22/352 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVPL 58 VL IETSCD+T AI ++ L VK+ D GG+ P LA+ H + PL Sbjct: 26 NVLAIETSCDDTCAAIISRDREKNTAALIDHVKITLDSSLQGGINPALATAHHHQSVGPL 85 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 I+ LK+ T ID V T GPGL G L G T + L+ VP + VHHM HLL Sbjct: 86 IRDVLKKHADTT--IDLVCATRGPGLPGCLSSGVTFAKGLSLGLGVPYLGVHHMLAHLLT 143 Query: 119 PML-------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 P L + EFPF++LLVSGGHT L+ + + +L + D A G+A DK A+ Sbjct: 144 PRLFEAAEGYSGHKTEFPFLSLLVSGGHTMLVLSKSLYDHTVLCNTADVAIGDALDKCAR 203 Query: 172 LLGL-DYPGGPLLSKMAAQGT--AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD 228 LG G ++ + + ++ P P+ ++ + +SF+ ++ ++ T Sbjct: 204 TLGFQGNMLGKVMDQYCRSADTPSSQWSIPMPVDNKNDIRYSFAAFHSYIG--MKKKETQ 261 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQTGFK-----RLVMAGGVSANRTLRAKLAEMM 283 +T ++A + A+ + LM K K A + + LV +GGV+AN LR L E+ Sbjct: 262 AETTPELALEVQTAIFNHLMKKTKAAFNIYKKEIASATTLVCSGGVAANPRLREALQELC 321 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 K + E + P +CTDN AMI +AG+ + G +DL P+WPLAE Sbjct: 322 AKYKLEAVFPDPYWCTDNAAMIGWAGIELHEDGYRSDLEGFQIPKWPLAEFE 373 >UniRef50_Q17Z01 Probable O-sialoglycoprotein endopeptidase n=13 Tax=Helicobacter RepID=GCP_HELAH Length = 342 Score = 320 bits (821), Expect = 5e-86, Method: Composition-based stats. Identities = 106/335 (31%), Positives = 170/335 (50%), Gaps = 6/335 (1%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ +A+ ++ L+A+ SQ K H+ YGGVVPELASR H L++ Sbjct: 2 ILSIESSCDDSSLALTRIEDAKLIAHFKISQEKHHSSYGGVVPELASRLHAENLPLLLER 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 + A+A T PGL L+ G + ++L+ + ++P I H+ GH+ + + Sbjct: 62 IKISLNKDFSKLKAIAITNQPGLSVTLIEGLMMAKALSLSLNLPLILEDHLRGHVYSLFI 121 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + P LLVSGGH+ ++ +++ S+DD+ GE+FDK +K+L L YPGGP Sbjct: 122 NEKKTCMPLSVLLVSGGHSLILEARNYEDIKIMATSLDDSFGESFDKVSKMLNLGYPGGP 181 Query: 182 LLSKMAAQGTAG--RFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTRADIAR 237 ++ K+A +FP P+ + L FSFSGLK I N ++ T+ I Sbjct: 182 VIEKLALDYAHKNEPLMFPIPLKNSLNLAFSFSGLKNAVRLEIEKNAPNLNEITKQKIGY 241 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 F+ ++ L+ + KR K + GG S N LR + + ++ A EF Sbjct: 242 HFQSVAIEHLIQQTKRYFKTKRPKIFGIVGGASQNLVLRKAFENLCDEFDCKLVLAPLEF 301 Query: 298 CTDNGAMIAYAGMVRFKAGATADLG-VSVRPRWPL 331 C+DN AMI + + ++ L S+ PR L Sbjct: 302 CSDNAAMIGRSSLEAYQKKHFVPLEKASISPRTLL 336 >UniRef50_Q46FS9 Putative O-sialoglycoprotein endopeptidase n=17 Tax=root RepID=GCP_METBF Length = 545 Score = 320 bits (820), Expect = 5e-86, Method: Composition-based stats. Identities = 105/341 (30%), Positives = 166/341 (48%), Gaps = 23/341 (6%) Query: 1 MR---VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 M+ +LGIE + AI E ++A + GG+ P A++ H + Sbjct: 1 MKNTFILGIEGTAWNLSAAIV-TETEIIAEVTETY---KPTAGGIHPREAAQHHAKYAAS 56 Query: 58 LIQAALKES---GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 +I+ L E+ G+ DID +A++ GPGL L AT R L+ + +P I V+H Sbjct: 57 VIKRLLAEAKEKGVKPSDIDGIAFSQGPGLGPCLRTVATAARMLSISLGIPLIGVNHCIA 116 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 H+ + + V L VSG ++Q+IS G G+Y + GE++D G A DK A+ Sbjct: 117 HIEIGIWRTPAMDP--VVLYVSGANSQVISYMG-GRYRVFGETLDIGLGNALDKFARGAN 173 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 L +PGGP + A T P + G+D SFSGL T A+ ++ D Sbjct: 174 LPHPGGPKIEAYAKNATKY---IHLPYVIK-GMDLSFSGLSTAASEALKK-----APLED 224 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + ++++ ++ +RAL TG K +++AGGV AN LR L +M + R + + Sbjct: 225 VCYSYQETAFAMVVEVAERALAHTGKKEVLLAGGVGANTRLREMLNDMCEARGAKFYVPE 284 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAEL 334 F DNG MIAY G++ +K+G T L V P + ++ Sbjct: 285 KRFMGDNGTMIAYTGLLMYKSGNTLSLEDSRVNPSYRTDDV 325 >UniRef50_UPI0001979AA5 putative DNA-binding/iron metalloprotein/AP endonuclease n=1 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI0001979AA5 Length = 380 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 109/359 (30%), Positives = 173/359 (48%), Gaps = 40/359 (11%) Query: 3 VLGIETSCDETGIAIYDD-EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ +A+ +K L+ + SQ H+ YGG+VPE+ASR H ++ +++ Sbjct: 2 ILSIESSCDDSSLALTRIIDKKLIYHIKISQDSEHSTYGGIVPEIASRLHAKRLPEILKK 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA--- 118 I AVA T PGL L+ G + ++L +P I V+H++GH+ + Sbjct: 62 LKMFLNNDLSLIKAVAVTTRPGLSVTLIEGLMMAKTLCLGLQIPLICVNHLKGHIYSLCI 121 Query: 119 ------------------PMLEDN-------------PPEFPFVALLVSGGHTQLISVTG 147 P+LE + + LLVSGGHTQ++ V Sbjct: 122 SKDFATDSAKDSRKNAPPPLLESHLKSHTESLLESRQNKQDSLGVLLVSGGHTQILQVND 181 Query: 148 IGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGT---AGRFVFPRPMTDR 204 ++ +S+DD+ GE+FDK AK L L YPGGP + + A + FP P+ Sbjct: 182 FHHISIIAQSLDDSFGESFDKVAKHLNLGYPGGPQVERYAKNCEINQYKPYEFPIPLLHN 241 Query: 205 PGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKR 262 L+FSFSGLK I+ + Q A I++ F++A + ++ K + K Sbjct: 242 KKLEFSFSGLKNAVRLAIQEMEQPLSLQDIASISKGFQNAACEHIVRKTRLFFQHFEGKY 301 Query: 263 LVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL 321 + GG SAN LR +++E+ + E++ A EFC+DN AMI G+ + L Sbjct: 302 FAIVGGASANTYLRERMSELCNEFDKELYLADLEFCSDNAAMIGRVGVEHYLRDEFTPL 360 >UniRef50_D0JBS4 Glycoprotease M22 family domain-containing protein n=2 Tax=Blattabacterium RepID=D0JBS4_BLASB Length = 327 Score = 316 bits (810), Expect = 9e-85, Method: Composition-based stats. Identities = 108/303 (35%), Positives = 178/303 (58%), Gaps = 15/303 (4%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCD+T ++I + +L+N + Q ++H YGGVVPELASR H + P + Sbjct: 19 IILGIESSCDDTAVSII-KNRDVLSNIIIHQ-EIHKQYGGVVPELASRLHDQNMTPAVNQ 76 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+ + + +IDAV++T GPGL+G+LLVGA+ +S + ++P + V+H++ H+L + Sbjct: 77 AIHSAKIKKNEIDAVSFTLGPGLIGSLLVGASFAKSFSMGLEIPLLTVNHVQAHILTHFI 136 Query: 122 EDNPP-----EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 ++ +FPF+ L++SGGHTQ++ V + E+LG ++DD+ G+ FDK A+LLG Sbjct: 137 KNANMNNSYPKFPFLGLVISGGHTQIVKVNDFFKMEILGSTLDDSIGDTFDKIARLLGFH 196 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-----DQT 231 YPGGP++ + G +F F +P L+FSFSG K+ I+ Q Sbjct: 197 YPGGPMIELFSKNGNCKKFGFSKP--SVNDLNFSFSGFKSHVLQFIKKKSKKNPLFIKQN 254 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK-RRGEV 290 +DI + + + + L+ K ++A T R+ +AGGVSAN +R K+ ++ E+ Sbjct: 255 LSDICASIQRIIAEILLEKVEKATLITDIFRVALAGGVSANCEIRRMFISFAKRNKKWEI 314 Query: 291 FYA 293 F Sbjct: 315 FIP 317 >UniRef50_P43122 Putative protease QRI7 n=12 Tax=Saccharomycetaceae RepID=QRI7_YEAST Length = 407 Score = 314 bits (805), Expect = 3e-84, Method: Composition-based stats. Identities = 119/368 (32%), Positives = 179/368 (48%), Gaps = 40/368 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKG-----LLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 +VL IETSCD+T +++ D +LAN + + D GG++P A H + Sbjct: 34 KVLAIETSCDDTCVSVLDRFSKSAAPNVLANLKDTLDSI--DEGGIIPTKAHIHHQARIG 91 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 PL + AL ES + ID + T GPG+ G+L G + LA AW+ P I VHHM GHL Sbjct: 92 PLTERALIESNAR-EGIDLICVTRGPGMPGSLSGGLDFAKGLAVAWNKPLIGVHHMLGHL 150 Query: 117 LAPMLEDN--PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 L P + N P+FPFV+LLVSGGHT + I +E+L ++ID A G++ DK + LG Sbjct: 151 LIPRMGTNGKVPQFPFVSLLVSGGHTTFVLSRAIDDHEILCDTIDIAVGDSLDKCGRELG 210 Query: 175 LDYPGGPLLSKMAA--------QGTAGRFVFPRPMTD----RPGLDFSFSGLKTFAANTI 222 + G + +M Q A + P P+ + R L FSFS T + Sbjct: 211 --FKGTMIAREMEKFINQDINDQDFALKLEMPSPLKNSASKRNMLSFSFSAFITALRTNL 268 Query: 223 RDNGT------DDQTRADIARAFEDAVVDTLMIKCKRALDQ-----TGFKRLVMAGGVSA 271 G ++ IA +++V D ++ K K L + V +GGVS+ Sbjct: 269 TKLGKTEIQELPEREIRSIAYQVQESVFDHIINKLKHVLKSQPEKFKNVREFVCSGGVSS 328 Query: 272 NRTLRAKLAEMMKKRR----GEVFYARPEFCTDNGAMIAYAGMVRFKA-GATADLGVSVR 326 N+ LR KL + +Y + C+DN MI +AG+ +++ +DL + Sbjct: 329 NQRLRTKLETELGTLNSTSFFNFYYPPMDLCSDNSIMIGWAGIEIWESLRLVSDLDICPI 388 Query: 327 PRWPLAEL 334 +WPL +L Sbjct: 389 RQWPLNDL 396 >UniRef50_C3XEQ4 O-sialoglycoprotein endopeptidase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XEQ4_9HELI Length = 500 Score = 314 bits (805), Expect = 3e-84, Method: Composition-based stats. Identities = 109/340 (32%), Positives = 173/340 (50%), Gaps = 21/340 (6%) Query: 1 MRVLGIETSCDETGIAIYDD-EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 MR+L IE+SCD++ +A D LL ++ SQ H+ YGGVVPELASR R V L+ Sbjct: 1 MRILSIESSCDDSALAYTDGTNTKLLWHEKISQEASHSHYGGVVPELASRLFARDLVQLL 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + + + KDI +A T PGL +LL G + ++LA + ++P + ++H++GH+ + Sbjct: 61 ENF--KQNFSLKDITHIAVTNEPGLSTSLLEGVMMAKALALSLNIPLLGINHLKGHIYSL 118 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 +E P LLVSGGHT L+ ++ ++DD+ GE +DK AK+LGL YPG Sbjct: 119 FIESEAI-LPLCVLLVSGGHTMLLECYSYNDMRVIANTLDDSFGECYDKAAKMLGLGYPG 177 Query: 180 GPLLSKMAAQGTAG---RFVFPRPMTDRPGLDFSFSGLKTF---------AANTIRDNGT 227 G ++ MA P P+ ++ FSFSGLK I+D+ T Sbjct: 178 GMIIDSMAQMALKENIAPIALPIPLVNQNIQSFSFSGLKNAFRLQLEKMELKTLIQDSKT 237 Query: 228 DDQTRA----DIARAFEDAVVDTLMIKCKRALDQTG-FKRLVMAGGVSANRTLRAKLAEM 282 D + +A +++ L+ KC+ + Q K + GG SAN LR K + Sbjct: 238 QDIKNSTQAKALALGLQESATTHLIQKCRSYMKQNSHIKHFAIVGGASANSMLREKAQSL 297 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLG 322 + ++ + ++C+DN AMI A + + + D+ Sbjct: 298 AAQFDNKLLMSELKYCSDNAAMIGRAAIAKIRHENMIDIE 337 >UniRef50_Q9NPF4 Probable O-sialoglycoprotein endopeptidase n=81 Tax=Eukaryota RepID=OSGEP_HUMAN Length = 335 Score = 314 bits (804), Expect = 4e-84, Method: Composition-based stats. Identities = 114/339 (33%), Positives = 174/339 (51%), Gaps = 14/339 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLG E S ++ G+ + D K +LAN + V G +P +R H + L+Q A Sbjct: 4 VLGFEGSANKIGVGVVRDGK-VLANPRRTYVTPPGT--GFLPGDTARHHRAVILDLLQEA 60 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ESGLT++DID +AYT GPG+ L+ A V R++A W+ P + V+H GH+ L Sbjct: 61 LTESGLTSQDIDCIAYTKGPGMGAPLVSVAVVARTVAQLWNKPLVGVNHCIGHIEMGRLI 120 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYPGG 180 L VSGG+TQ+I+ + +Y + GE+ID A G D+ A++L + D G Sbjct: 121 TGATSP--TVLYVSGGNTQVIAYSEH-RYRIFGETIDIAVGNCLDRFARVLKISNDPSPG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADIARAF 239 + +MA +G + P T + G+D SFSG+ +F + T + T D+ + Sbjct: 178 YNIEQMAKRGKK---LVELPYTVK-GMDVSFSGILSFIEDVAHRMLATGECTPEDLCFSL 233 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 ++ V L+ +RA+ G + ++ GGV N L+ +A M ++R +F FC Sbjct: 234 QETVFAMLVEITERAMAHCGSQEALIVGGVGCNVRLQEMMATMCQERGARLFATDERFCI 293 Query: 300 DNGAMIAYAGMVRFKAGATADLGVS-VRPRWPLAELPAA 337 DNGAMIA AG F+AG L S V R+ E+ Sbjct: 294 DNGAMIAQAGWEMFRAGHRTPLSDSGVTQRYRTDEVEVT 332 >UniRef50_C4XSD3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Desulfovibrio RepID=GCP_DESMR Length = 371 Score = 311 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 142/347 (40%), Positives = 193/347 (55%), Gaps = 21/347 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A++++ + +L +L SQ LHA +GGVVPELASR+H+R+ PL+Q Sbjct: 1 MLCLGIETSCDETAVALFENGRPVL-EKLASQADLHAVFGGVVPELASREHLRRLGPLLQ 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A SG + D+DA+A GPGL+G+LLVG + L+ A P I V H+ HLLA Sbjct: 60 ALFAASGRSLADVDAIAVARGPGLLGSLLVGLAAAKGLSLATGKPLIGVDHLHAHLLAAT 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + FP + LLVSGGHTQ++ + E+LG ++DDAAGEAFDK AK L YPGG Sbjct: 120 IG-RDVAFPALGLLVSGGHTQIVRLESALSLEVLGRTLDDAAGEAFDKAAKSFNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD------------ 228 + + + +FPRP D DFSFSGLKT A+ + Sbjct: 179 VYIDVLGRGIAPDKTLFPRPFLDNDHFDFSFSGLKTAVASYAAAHPELRAGSLAEAGGAI 238 Query: 229 -----DQTRADIARAFEDAVVDTLMIKCKRALDQT--GFKRLVMAGGVSANRTLRAKLAE 281 + A+ +TL IK +RALD+ L+ AGGV+AN +RA LA+ Sbjct: 239 DPEAWPMALRRACSSLNFAIAETLRIKFERALDRQPGPPASLIAAGGVAANGPIRAMLAD 298 Query: 282 MMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +R ++ P C DN MIA AG +AG DL ++ PR Sbjct: 299 LAARRGLPLYLPEPALCADNAVMIAAAGSRLAEAGYAHDLALTAVPR 345 >UniRef50_C4QZU9 Putative metalloprotease, similar to O-sialoglycoprotein metallopeptidase from P. haemolytica n=1 Tax=Pichia pastoris GS115 RepID=C4QZU9_PICPG Length = 373 Score = 310 bits (796), Expect = 3e-83, Method: Composition-based stats. Identities = 111/361 (30%), Positives = 181/361 (50%), Gaps = 27/361 (7%) Query: 2 RVLGIETSCDETGIAIYDDE---KGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +VL IE+SCD++ +++ D K ++ + + S + GGV+P A H + L Sbjct: 13 KVLAIESSCDDSCVSLIDRSAGAKPIVLDHVKSTLNS-VKAGGVIPTSAHLHHQKSIAGL 71 Query: 59 IQAALKESGLTAKDI-DAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 ++ L++ ++ + + V T GPG+ G+L +G + L+ AW + VHHM GHLL Sbjct: 72 VKQVLQKHNISGVNCPELVCVTRGPGMPGSLSIGVDTAKGLSVAWGSQFLGVHHMLGHLL 131 Query: 118 APMLEDN--PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 P LE N P+FPF++LL SGGHT L+ + +E+L +ID AAG+A DK A+ +G+ Sbjct: 132 IPRLESNGEEPQFPFLSLLASGGHTMLVLSRSLLDHEILVNTIDIAAGDALDKCAREIGI 191 Query: 176 -DYPGGPLLS----KMAAQGTAG-RFVFPRPMTDRPG----LDFSF----SGLKTFAANT 221 G L K + P+P+ ++P L FSF SG+K Sbjct: 192 RGNMIGKELELFLNKNPQLSLKDIPWEMPQPLKNKPKRVDTLGFSFTPFISGVKLSLERY 251 Query: 222 IRDNGTDDQTRADIARAFEDAVVDTLMIKCKRA----LDQTGFKRLVMAGGVSANRTLRA 277 +N D+ + ++A+ D ++ + A + K V +GGV AN+ LR Sbjct: 252 -HNNEVKDELMPAMGFRIQEAIFDHIIDRVLVAYKVRPELNQIKTFVGSGGVVANQRLRV 310 Query: 278 KLAEMMKKRRGE-VFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 KL +K E + P CTDN MI +AG+ ++ G T++L V+ +W + L Sbjct: 311 KLQAALKSHGVENFHFPPPALCTDNAIMIGWAGIELYENGVTSELDVTPLRKWSVEGLEK 370 Query: 337 A 337 + Sbjct: 371 S 371 >UniRef50_Q1IZH8 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Deinococci RepID=GCP_DEIGD Length = 333 Score = 310 bits (794), Expect = 6e-83, Method: Composition-based stats. Identities = 135/303 (44%), Positives = 179/303 (59%), Gaps = 14/303 (4%) Query: 3 VLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +LGI+TSCD+TG+ + D + AN+++SQ +HA YGGV+PELASR+HV + + Sbjct: 7 ILGIDTSCDDTGVGVVELAPDGSVQVRANRVWSQ-TVHAQYGGVLPELASREHVERIDTV 65 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 AL E+GLT D+ AVA T+GPGLVGALLVG G+ LA A +VP HH+EGH+ A Sbjct: 66 TGDALAEAGLTVGDLAAVAATSGPGLVGALLVGLMYGKGLAQALNVPFYAAHHLEGHIFA 125 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 + + P++AL+VSGGHT L V G+Y L+G + DDAAGEAFDK A+L GL YP Sbjct: 126 AA-SEADLQAPYLALVVSGGHTHLFDVPREGEYVLVGATRDDAAGEAFDKVARLAGLGYP 184 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 GGP +S+ A +G F P+ + G DFSFSGLKT A R + D+A Sbjct: 185 GGPAISEAARRGDPEAVPFKEPLQGQKGFDFSFSGLKTAALLAHRAGAKPE----DLAAG 240 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 FE A V L+ RA G + +V++GGV+ANR LR + Sbjct: 241 FERAAVRFLVGTTLRAARAYGRETVVVSGGVAANRALREAF----AASPVRAVFPGKGLN 296 Query: 299 TDN 301 TDN Sbjct: 297 TDN 299 >UniRef50_A6VJ51 Putative O-sialoglycoprotein endopeptidase n=26 Tax=cellular organisms RepID=GCP_METM7 Length = 547 Score = 306 bits (785), Expect = 5e-82, Method: Composition-based stats. Identities = 95/331 (28%), Positives = 162/331 (48%), Gaps = 17/331 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +G E + ++TG+ I + +L N+ + G+ P A+ H V L++ Sbjct: 7 LICIGFEGTAEKTGVGIITSKGEVLFNKT---IIYTPPVQGIHPREAADHHAETFVKLLK 63 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + + ID V+++ GPGL +L V AT R+L+ + + P I V+H H+ Sbjct: 64 EALTV--VPIEKIDLVSFSLGPGLGPSLRVTATTARALSLSINKPIIGVNHCISHVEIGK 121 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+ + + + L VSGG+TQ+++ TG +Y ++GE++D A G D+ A+ + +PGG Sbjct: 122 LKTDAVDP--LTLYVSGGNTQVLAYTGK-KYRVIGETLDIAIGNCLDQFARHCNMPHPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 + K A G P T + G+D S SGL T A + D+ + + Sbjct: 179 VYVEKYAKDGNK---FMKLPYTVK-GMDISLSGLLTAAMKKY----DSKERIEDVCYSLQ 230 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L +RAL T +++ GGV+AN L+ L M ++ + + EFC D Sbjct: 231 ETSFSMLTEITERALAHTNKAEVMLVGGVAANNRLKEMLDVMCSEQNVDFYVPEREFCGD 290 Query: 301 NGAMIAYAGMVRFKAGATADL-GVSVRPRWP 330 NGAMIA+ G++++ G DL + Sbjct: 291 NGAMIAWLGILQYLNGKRMDLADTKPISNYR 321 >UniRef50_A4VEZ5 O-sialoglycoprotein endopeptidase n=1 Tax=Tetrahymena thermophila SB210 RepID=A4VEZ5_TETTH Length = 377 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 107/379 (28%), Positives = 175/379 (46%), Gaps = 54/379 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIE S ++ G+ I + +LAN + + G +P + H K + ++ Sbjct: 1 MIALGIEGSANKIGVGIVKSDGTILANPKTTFITPPGT--GFLPNETAVHHRSKILDIVD 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ LT KDI + YT GPG+ L +GA V R+L+ ++P I V+H GH+ Sbjct: 59 QALKEANLTFKDIGLICYTKGPGMGPPLSIGAIVSRTLSLLHNIPLIGVNHCIGHIEMGR 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYP 178 L L VSGG+TQ+I+ + +Y + GE++D A G D+ A+++ L D Sbjct: 119 LATGITHPA--VLYVSGGNTQVIAYSNQ-RYRIFGEALDIAVGNCLDRFARIINLSNDPA 175 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG------------ 226 G + ++A QG + P T + G+D SFSG+ ++ + + N Sbjct: 176 PGYNIEQLAKQG---KQFIQVPYTVK-GMDMSFSGILSYFEDIVAQNPHLQYEDGVVPEK 231 Query: 227 ------------------------------TDDQTRADIARAFEDAVVDTLMIKCKRALD 256 D TRAD+ + ++ + L +RA+ Sbjct: 232 DAKQQDEDDSLDNRKRKKNKKVVNKKILDLPKDITRADLCYSLQETIFAMLTEVTERAMA 291 Query: 257 QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 +++ GGV N L+ + +M+ +R G+V +C DNGAMIAYAG++ ++AG Sbjct: 292 HCNSNEVIIVGGVGCNVRLQEMIGQMVSERGGKVGAMDHRYCIDNGAMIAYAGILEYEAG 351 Query: 317 ATADLGVSV-RPRWPLAEL 334 D S R+ E+ Sbjct: 352 GRMDFKDSYFTQRFRTDEV 370 >UniRef50_Q8EUQ9 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma penetrans RepID=GCP_MYCPE Length = 306 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 104/314 (33%), Positives = 166/314 (52%), Gaps = 12/314 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCD+T +AI +D K +L+ + + K +GG+VPE+ +R H + + Sbjct: 1 MYILSIETSCDDTSVAILEDNK-VLSCIIKNDSKQLNPFGGIVPEIVARYHEENIIKALD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ES ++ ID VAYT PGL G+L VG +++A+A DV +P++H+ GH+L+P Sbjct: 60 LALQESNISLNQIDKVAYTNQPGLPGSLFVGEIFAKTMAYALDVECVPINHIHGHILSPF 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + P ++PF++L+ SG T + V + L ++ DDA GE FDK K LG DYP G Sbjct: 120 INSVP-KYPFMSLIASGKTTSIFLVKSANEIIELTKTRDDAIGEIFDKVGKALGYDYPAG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARA 238 P L K A P+ + DFSFSG+K + I ++ I + Sbjct: 179 PKLDKYFDISKATITPSFPPVKN----DFSFSGIKNKFLSIINSSKMKNEEIDTITIGSS 234 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F +D ++ K K D+ + + GGV+ N + ++ ++ + F ++ Sbjct: 235 FLKYSIDLIIKKLKYYKDEYSVDCVCIGGGVANNNYFKQEIKKLFS----DSFVPESKYS 290 Query: 299 TDNGAMIAYAGMVR 312 TDN AMI +A + Sbjct: 291 TDNAAMIGFAYYEK 304 >UniRef50_Q4PGZ6 Putative uncharacterized protein n=2 Tax=Ustilaginomycotina RepID=Q4PGZ6_USTMA Length = 414 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 118/368 (32%), Positives = 185/368 (50%), Gaps = 39/368 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCD++ +I ++ +L++ + Q H+ GG+ P A+ H I A Sbjct: 51 LILGIETSCDDSCASIVSSDRTILSSIVTKQD--HSSTGGIHPLSAALGHHSNLASTIAA 108 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A++++ +TA D+ A+A T GPG+ +L VG + ++L+ +P I VHHM+ H L P+L Sbjct: 109 AIEQARITASDLHAIAVTQGPGMASSLGVGLSAAKTLSAVLHIPLIYVHHMQAHALTPLL 168 Query: 122 EDN-PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--- 177 + PP+ PF+ LLVSGGHT L+ + + +L + DD+ G+AFDK A+ LG+ + Sbjct: 169 TEPDPPKLPFLVLLVSGGHTMLVLARSVTHFRILATTSDDSIGDAFDKVARDLGIPWTSA 228 Query: 178 PGGPLLSKMAAQGTAGR-FVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD---DQTRA 233 PG L + A G VFP P +P FS+SGLK I D + ++ Sbjct: 229 PGAALEALAARAEAHGDGLVFPTPCKGQPT--FSYSGLKAAVQRHIASCSPDAMAESAKS 286 Query: 234 DIARAFEDAVVDTLMIKCKRALD-----------------------QTGFKRLVMAGGVS 270 IA AF+ A L K L K +V +GGV+ Sbjct: 287 SIAAAFQRAACAQLEDKLSMVLRPSHVSQDSRHRPFARIELLDGVSSDDVKTVVCSGGVA 346 Query: 271 ANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +N +R++L E + + ++ + CTDN AMIA+ G + + T D RP Sbjct: 347 SNAFIRSRLREHLDRLGRTDVDLQFPPLSLCTDNAAMIAWVGHLIY-HQRTRDYTRHARP 405 Query: 328 RWPLAELP 335 +W L ++P Sbjct: 406 KWSLQDIP 413 >UniRef50_P75055 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Mycoplasma RepID=GCP_MYCPN Length = 319 Score = 300 bits (768), Expect = 5e-80, Method: Composition-based stats. Identities = 109/313 (34%), Positives = 170/313 (54%), Gaps = 19/313 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIET+CD+T I + + K + A+ + S KLHA GGVVPE+A+R H + + A Sbjct: 7 ILGIETTCDDTSIGVITESK-VQAHIVLSSAKLHAQTGGVVPEVAARSHEQNLL----KA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L++SG+ + I +AY A PGL G L VGAT RSL+F D P +P++H+ H+ + +++ Sbjct: 62 LQQSGVVLEQITHIAYAANPGLPGCLHVGATFARSLSFLLDKPLLPINHLYAHIFSALID 121 Query: 123 D--NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 N + P + L+VSGGHT + + + EL+ E+ DDA GE +DK + +G YP G Sbjct: 122 QDINQLKLPALGLVVSGGHTAIYLIKSLFDLELIAETSDDAIGEVYDKVGRAMGFPYPAG 181 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA------D 234 P L + F RP T FS+SGLK+ I+ + Sbjct: 182 PQLDSLFQPELVKSHYFFRPSTKWTK--FSYSGLKSQCFTKIKQLRERKGFNPQTHDWNE 239 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 A F+ ++D + K A+ Q + L++ GGVSAN+ LR ++ ++ + A Sbjct: 240 FASNFQATIIDHYINHVKDAIQQHQPQMLLLGGGVSANKYLREQVTQL----QLPYLIAP 295 Query: 295 PEFCTDNGAMIAY 307 ++ +DNGAMI + Sbjct: 296 LKYTSDNGAMIGF 308 >UniRef50_C4PYC5 Mername-AA018 peptidase (M22 family) n=1 Tax=Schistosoma mansoni RepID=C4PYC5_SCHMA Length = 388 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 105/341 (30%), Positives = 164/341 (48%), Gaps = 35/341 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TG A+ + LL + L SQ ++ GGV+P +A+ H ++ A Sbjct: 36 VLGIETSCDDTGAAVIETSGKLLGDCLSSQSRISVMLGGVLPSVAAELHKENIESVVNTA 95 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + +S + +D++ VA T PG+ +L +G + +SLA +P IP+ HME H L + Sbjct: 96 MAKSNIGLRDLNFVAVTVKPGMPLSLKIGVSFAKSLASRLKIPIIPIDHMEAHALTALFT 155 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------- 175 D +FP++ LL+SGGH L V G+ Y LLG ++D + G+ DK ++ L L Sbjct: 156 DPQLKFPYMILLISGGHGILGIVQGLEDYVLLGTALDASPGDVLDKLSRRLKLNRLSDEC 215 Query: 176 --DYPGGPLLSKMAA--QGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 GG + +A G RF P P + DFSF+G+ A I ++++ Sbjct: 216 LKGVAGGKAIEIIAKTYNGDHQRFNLPLPRSQSKDCDFSFTGIHAAAEQLINKLESENR- 274 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 T + C + Q V++GGV +N +RA L E+ Sbjct: 275 -------------GTFYLPCSIFISQMK----VVSGGVGSNCVIRAGLTEVANHYNLRFV 317 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGAT------ADLGVSVR 326 P CTDNG MIA+ G++ K ++ + + R Sbjct: 318 APPPSLCTDNGIMIAWNGVLLQKENSSRIIEDISSVDFCPR 358 >UniRef50_Q74M58 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Nanoarchaeum equitans RepID=GCP_NANEQ Length = 314 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 98/331 (29%), Positives = 169/331 (51%), Gaps = 19/331 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIE + G+ I+D EKG+LAN+ K+ G+ P A+ H+++ ++ Sbjct: 1 MKVLGIECTAHTFGVGIFDSEKGVLANE-----KVTYKGYGIHPREAAELHLKEFDKVLL 55 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ ++ KDID +A ++GPGL+ L +G + L + P I V+H+ H Sbjct: 56 KALEKANISLKDIDLIAVSSGPGLLPTLKLGNYIAVYLGKKLNKPVIGVNHIVAHNEFAR 115 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + F + VSG +TQ +++ + L+GE++D G DK A+ LGL++PGG Sbjct: 116 YLAKAKDPLF--VYVSGANTQFLAIVN-NSWFLVGETLDMGVGNLIDKVARDLGLEFPGG 172 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P + ++A +G + + P T + GL+ G+ T+ D ++ DIA + + Sbjct: 173 PKIEELAKKG---KNLIELPYTIK-GLNLQLGGIYTYIKRI-----KDQYSKEDIAYSLQ 223 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + V ++ +RA+ K L++ GGV+ N L +M K+ + + ++ TD Sbjct: 224 EWVFALILEIAERAMHMLDKKELILTGGVACNNRLNDMAEQMAKENNFKFYRLPCQYLTD 283 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 NGAMIAY G + G RP W + Sbjct: 284 NGAMIAYLGYYWYSQGIY--YEPKPRPYWRI 312 >UniRef50_B7XIP4 O-sialoglycoprotein endopeptidase n=2 Tax=Eukaryota RepID=B7XIP4_ENTBH Length = 360 Score = 295 bits (755), Expect = 2e-78, Method: Composition-based stats. Identities = 104/327 (31%), Positives = 166/327 (50%), Gaps = 20/327 (6%) Query: 3 VLGIETSCDETGIAIY---DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 VLGIE+S ++ G+ I ++ LLAN+ + A GV+P A++ H + LI Sbjct: 13 VLGIESSANKIGVGILKIMNENVELLANERKTY--TPAPGAGVIPIDAAKHHRDVILELI 70 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 +L++S L +DID AYT GPG+ L+VG V R+LA + P +PV+H H+ Sbjct: 71 DVSLQKSNLVIQDIDLYAYTKGPGMYQLLVVGCVVARTLALYHNKPLVPVNHCVAHIEMG 130 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTG--IGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 + L SGG+TQ+I+ +Y++ GE+ID A G FDK A+ LGLD Sbjct: 131 RFITGAKNP--IVLYASGGNTQIINRISGKTNKYKIFGETIDVAVGNCFDKVARALGLDN 188 Query: 178 PGGP--LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA-- 233 P + + A ++ P P T + G+D SFSG+ + I+D + + + A Sbjct: 189 APSPGFNIERQAELNHEKKY-IPLPYTIK-GMDMSFSGILSTCLKLIKDFKSTNPSSAQF 246 Query: 234 -----DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 +I + ++ + L+ +R +++ GGV N L+ + +M+ +R G Sbjct: 247 KKFISEICFSLQETMFSILVEATERCCSFVESNEVLIVGGVGCNLRLQEMIHKMITQRGG 306 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFKA 315 V+ +C DNGAMIAY G + FK Sbjct: 307 TVYSMNEAYCIDNGAMIAYTGYLIFKH 333 >UniRef50_Q4UA14 Glycoprotein endopeptidase, putative n=3 Tax=Piroplasmida RepID=Q4UA14_THEAN Length = 363 Score = 294 bits (754), Expect = 2e-78, Method: Composition-based stats. Identities = 99/350 (28%), Positives = 160/350 (45%), Gaps = 22/350 (6%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGIE S ++ GIA+ + +L+N + D G +P S+ H L+ AL Sbjct: 15 LGIEGSANKLGIAVIRGDGEILSNVRRTY--SPPDGEGFLPRQVSKHHRENMASLLMEAL 72 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 +++G+T D+ + YT GPG+ L VGA +++ F P + V+H H+ Sbjct: 73 EKAGITLSDLSLICYTKGPGIGSGLHVGALAAKTIHFITGKPIVGVNHCVAHVEMGRFLS 132 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQ-YELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + L VSGG+TQ++S + Y +LGE++D A G D+ A+LL L P Sbjct: 133 GYKKPA--ILYVSGGNTQVLSYDEKRKVYSVLGETLDIAIGNVLDRIARLLHLPNKPAPG 190 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-----------DQT 231 LS + + + P P + G+D S SGL T + I T +Q Sbjct: 191 LSIELQARKSSKNLIPLPFVVK-GMDCSLSGLLTKCEDLIEHFKTKLIMSEDSAFEYEQF 249 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + D+ + ++ L+ +RA+ T +++ GGV N L+ M K+R ++F Sbjct: 250 KVDLCFSVQEHTFAMLIEMLERAMSFTDSDEILLVGGVGCNLRLQEMANLMAKERNAKLF 309 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGAT-----ADLGVSVRPRWPLAELPA 336 +C DNGAMI Y GM+ + G V+V R+ + P Sbjct: 310 PMDERYCIDNGAMIGYTGMIDYLYGLKEKCVLEPKEVTVSQRYRTDQAPV 359 >UniRef50_B1AJ51 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Ureaplasma RepID=GCP_UREP2 Length = 320 Score = 294 bits (752), Expect = 4e-78, Method: Composition-based stats. Identities = 108/323 (33%), Positives = 167/323 (51%), Gaps = 13/323 (4%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET +A++++ K L+A+++ S + + +GGVVPELASR H + L Sbjct: 6 LILSIESSCDETSLALFENNK-LIAHKISSSASIQSLHGGVVPELASRYHEQNINHLFNE 64 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L E+ + I VAYTA PGL G L VG + LA + +P++H+ H+ + + Sbjct: 65 ILNETKINPLTITHVAYTAMPGLPGCLHVGKVFAKQLAVLINAELVPINHLHAHVFSASI 124 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 N FPF+ L+VSGG + + V + ++L ++ DDA GE +DK A++LG YPGGP Sbjct: 125 NQNLT-FPFLGLVVSGGESCIYLVNDYDEIKVLNQTHDDAIGECYDKIARVLGWKYPGGP 183 Query: 182 LLSKMAAQGTAG-RFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT--RADIARA 238 ++ K + A F+ +P DFSFSGLKT N I + + +A + Sbjct: 184 IIDKNYQENLATLEFIKSQPAAK----DFSFSGLKTAVINYIHNAKQKKISFDPVVVASS 239 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+ ++ ++ K K L+ L + GGVSAN LR K+ + + + Sbjct: 240 FQKFAINEIIKKIKYYLNLYKLNHLAIGGGVSANSLLRKKIQSL----DVISYIPEMIYT 295 Query: 299 TDNGAMIAYAGMVRFKAGATADL 321 DN AMI K + L Sbjct: 296 GDNAAMIGAYAYALIKNHKKSIL 318 >UniRef50_UPI0000DB7930 PREDICTED: similar to O-sialoglycoprotein endopeptidase-like 1 n=1 Tax=Apis mellifera RepID=UPI0000DB7930 Length = 385 Score = 293 bits (751), Expect = 5e-78, Method: Composition-based stats. Identities = 100/332 (30%), Positives = 157/332 (47%), Gaps = 37/332 (11%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCD+T I D +L + SQ H ++GG++P A HV + Sbjct: 30 IILGIESSCDDTAFGIVDSNGNILGESINSQYLTHLNFGGIIPTFARSLHVNNITKTCED 89 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+ + L +DIDA+A T G+ LA P IP+HHME H L + Sbjct: 90 ALRAANLRIRDIDAIATT--------------FGKYLAKIGGKPFIPIHHMEAHALTARI 135 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-DYP-- 178 + +FP++ALL+SGGH L V + ++ LLG S+ + G+ F+K A+ L L + P Sbjct: 136 -NKKIDFPYLALLISGGHCLLAIVENVNKFYLLGTSLSNTPGDVFNKVARRLKLRNIPEF 194 Query: 179 ----GGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 GG + A++ +F+FP M +FSFSGL F + I Sbjct: 195 STLNGGQAIELAASKASNVNQFLFPLIMMQFRNCNFSFSGLLNFFGDMI------IPDVY 248 Query: 234 DIARAFEDAVVDTLMIKCKRALDQ--------TGFKRLVMAGGVSANRTLRAKLAEMMKK 285 + AF+ A+ + + +RA++ + LV++GGV+ N L L + + Sbjct: 249 NFCAAFQLALTTHICQRTQRAMEFINKMSLFPENKQTLVISGGVACNNFLAKALNIVSTE 308 Query: 286 RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGA 317 + CTDNG MIA+ G+ ++ Sbjct: 309 LGYTFVRTPSKLCTDNGIMIAWNGVEKWIQNI 340 >UniRef50_A2BJY9 Putative O-sialoglycoprotein endopeptidase n=22 Tax=Thermoprotei RepID=GCP_HYPBU Length = 363 Score = 292 bits (749), Expect = 1e-77, Method: Composition-based stats. Identities = 109/339 (32%), Positives = 167/339 (49%), Gaps = 15/339 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE++ G+ I + + + H GG+ P A+ H R +I A Sbjct: 32 VLGIESTAHTFGVGIASTKPPYILVSVR--DTYHPPKGGIHPREAASHHARVASEVILDA 89 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ GL+ +DIDAVA GPGL AL VGAT+ R LA + P +PV+H H+ L Sbjct: 90 LRTVGLSIRDIDAVAVALGPGLGPALRVGATIARGLAAYYGKPLVPVNHAVAHIEIARLY 149 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP---G 179 + V L VSGG+T +++ +Y + GE++D A G D A+ G+ P Sbjct: 150 TGLGDP--VVLYVSGGNT-VVAAYAKARYRVFGETLDIALGNLLDTFARDAGIAPPYIVS 206 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 G + A+ + P + G+D SFSGL T A G++D+ +A + Sbjct: 207 GLHIVDRCAEAASKPADLPYVV---KGMDVSFSGLLTAALRLWTKAGSEDE-KAAVCLGL 262 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + +++ +RAL T K +++ GGV+A+ LR K+ M + P+ Sbjct: 263 REVAYGSVVEVTERALAHTRKKSVMLTGGVAASPILRNKVRSMASYHGAVADWPPPQLAG 322 Query: 300 DNGAMIAYAGMVRFKAGATADLGVS-VRPRWPLA--ELP 335 DNGAMIA+ G++ + AG T D+ S V+ RW L E+P Sbjct: 323 DNGAMIAWTGLLNYLAGITVDVEESVVKQRWRLDVVEIP 361 >UniRef50_D2RYV2 Metalloendopeptidase, glycoprotease family n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RYV2_9EURY Length = 578 Score = 292 bits (749), Expect = 1e-77, Method: Composition-based stats. Identities = 104/359 (28%), Positives = 157/359 (43%), Gaps = 37/359 (10%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +R+LGIE + A+YD + D GG+ P A+ +++ Sbjct: 5 IRILGIEGTAWAASAAVYDSATD---DVFIESDAYQPDSGGIHPREAAEHMHDAIPRVVE 61 Query: 61 AALKES---------------------GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLA 99 AL+ + G A +DA+A++ GPGL L + T R+L+ Sbjct: 62 TALEHARETHDGPAGEAPVDVDERSSSGQQAAPVDAIAFSRGPGLGPCLRIVGTAARALS 121 Query: 100 FAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESID 159 A +VP + V+HM HL + V L SG + L++ G+Y +LGE++D Sbjct: 122 QALEVPLVGVNHMVAHLEIGRHTADFDSP--VCLNASGANAHLLAYRN-GRYRVLGETMD 178 Query: 160 DAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAA 219 G A DK + +G +PGGP + AA P + G+DFSFSG+ + A Sbjct: 179 TGVGNAIDKFTRHVGWSHPGGPKVE--AAAEDGEYVDLPYVV---KGMDFSFSGIMSAAK 233 Query: 220 NTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKL 279 D+ DI + ++ + L +RAL TG LV+ GGV N LR L Sbjct: 234 QAY----DDETPVEDICFSLQENIFGMLTEVAERALSLTGSDELVLGGGVGQNERLREML 289 Query: 280 AEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAELPAA 337 AEM +R E P F DN MIA G ++AG T ++ V P + ++P Sbjct: 290 AEMCAQRGAEFHAPEPRFLRDNAGMIAVLGAKMYEAGDTLEIEDSQVDPNYRPDQVPVT 348 >UniRef50_A2QMR2 Function: O-sialoglycoprotein endopeptidase is a neutral metalloprotease n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QMR2_ASPNC Length = 430 Score = 292 bits (747), Expect = 1e-77, Method: Composition-based stats. Identities = 117/382 (30%), Positives = 169/382 (44%), Gaps = 54/382 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVP 57 + L IETSCD+T +AI + E A Q++ K+ D Y G+ P +A H Sbjct: 30 LLTLAIETSCDDTSVAIVEKESN--AVQIHFLDKVTCDTSAYQGIHPVVALESHQENIAS 87 Query: 58 LIQA--ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 L Q +S L K D V T GPG L VG G++L+ AW VP + VHHM+ H Sbjct: 88 LQQTINVSSDSQLRRKP-DFVCSTRGPGFRSNLFVGLDTGKALSVAWQVPFVGVHHMQAH 146 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 LL P L P EFPF+++L+SGGHT L+ + I +E++ ++D A GEA DK A+ + Sbjct: 147 LLTPRLPITP-EFPFLSILISGGHTMLVKSSSITDHEIMASTVDRALGEALDKAAREIIP 205 Query: 176 DY--------PGGPLLSKMA----------------------AQGTAGRFVFPRPMTDRP 205 + G LL + A + + F P Sbjct: 206 PFLLQTSKSTMYGKLLEEFAFPNGKADYADYQAPKSRHDELIPRENPWGWSFTEPWAHSR 265 Query: 206 GLDFSFSGLKT-----FAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALD---- 256 L +SF + + F+A + R +AR + L + AL+ Sbjct: 266 QLQYSFCFIGSTLARIFSAREAAGQTISHEERIALAREAMRTSFEHLASRTIMALESLAK 325 Query: 257 ---QTGFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGM 310 + K LV++GGV+AN+ L L + R + P CTDN AMIA+AGM Sbjct: 326 QGPEKEVKTLVVSGGVAANQYLMTVLRSWLDARGFGHVGLVAPPPYLCTDNAAMIAWAGM 385 Query: 311 VRFKAGATADLGVSVRPRWPLA 332 F+AG +L +W L Sbjct: 386 EMFEAGWRTNLTSRAIRKWSLD 407 >UniRef50_A5DGU9 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DGU9_PICGU Length = 408 Score = 292 bits (747), Expect = 2e-77, Method: Composition-based stats. Identities = 110/376 (29%), Positives = 177/376 (47%), Gaps = 44/376 (11%) Query: 2 RVLGIETSCDETGIAIYD--DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 RVL IE+SCD+ IA+ D D K + +Q+ S + A GGV+P A H + Sbjct: 24 RVLAIESSCDDACIALLDRKDGKTTVIDQVKSTLNSVA-AGGVIPTEAHGFHQYQIASQA 82 Query: 60 QAALKESGLTAKD-IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 ++ +++++ D + T GPG+VG+L G + L+ AWD P + VHHM GHL+ Sbjct: 83 SQFFQKHKISSQNSPDLICCTRGPGMVGSLSAGLQFAKGLSVAWDKPLVGVHHMLGHLMI 142 Query: 119 PMLE-----DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 L + PP FPF++LL SGGHT L+ + ++++L ++D A G+A DK A+ L Sbjct: 143 ASLTSESQTNPPPRFPFLSLLCSGGHTMLVLSESLAKHQVLVNTVDIACGDALDKCARKL 202 Query: 174 GL-DYPGGPLLS---------------KMAAQGTAGRFVF--------PRPMTDRPGLDF 209 GL G L K+ F F P+ + + F Sbjct: 203 GLKGNMLGKELETFVNSFSKEELDEFTKIKTHTRDNPFNFQLKLPMRSPKHPRNAESVQF 262 Query: 210 SF-SGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQT-----GFKRL 263 SF S L T A + + +A + + + ++ + K A+D+ + Sbjct: 263 SFASFLSTLDAYSPPPGMEKSKVTKFLAFKVQQKIFEHIVDRIKLAVDKNETLFANVNDI 322 Query: 264 VMAGGVSANRTLRAK----LAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA-GAT 318 V++GGV++N TLR L + MK+ + CTDN MI AG+ ++ Sbjct: 323 VLSGGVASNSTLRRMLKDGLNDKMKRPNLNFHFPEIALCTDNAIMIGVAGIEIYENLNVV 382 Query: 319 ADLGVSVRPRWPLAEL 334 +DL ++ +WPL +L Sbjct: 383 SDLSITPIRKWPLDQL 398 >UniRef50_Q93170 Protein C01G10.10, confirmed by transcript evidence n=3 Tax=Caenorhabditis RepID=Q93170_CAEEL Length = 421 Score = 290 bits (742), Expect = 6e-77, Method: Composition-based stats. Identities = 101/349 (28%), Positives = 176/349 (50%), Gaps = 22/349 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +VLGIETSCD+T +AI ++++ +L+++ Y++ + GG+ P + + H LI+ Sbjct: 24 KVLGIETSCDDTAVAIVNEKREILSSERYTERAIQRQQGGINPSVCALQHRENLPRLIEK 83 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L ++G + KD+DAVA T PGLV AL G + A +P IPVHHM H L+ +L Sbjct: 84 CLNDAGTSPKDLDAVAVTVTPGLVIALKEGISAAIGFAKKHRLPLIPVHHMRAHALSILL 143 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK---LLGLDYP 178 D+ FPF A+L+SGGH + + +++L G+S+ + GE DK A+ LG ++ Sbjct: 144 VDDSVRFPFSAVLLSGGHALISVAEDVEKFKLYGQSVSGSPGECIDKVARQLGDLGSEFD 203 Query: 179 G---GPLLSKMAAQGTA-GRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT--- 231 G G + +A++ +A G +P + + P + +F +K N + + +T Sbjct: 204 GIHVGAAVEILASRASADGHLRYPIFLPNVPKANMNFDQIKGSYLNLLERLRKNSETSID 263 Query: 232 RADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 D + ++ V + K + + K+LV+ GGV+AN+ + ++++ Sbjct: 264 IPDFCASLQNTVARHISSKLHIFFESLSEQEKLPKQLVIGGGVAANQYIFGAISKLSAAH 323 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 CTDN MIAY+G++ + A W ++P Sbjct: 324 NVTTIKVLLSLCTDNAEMIAYSGLLMLVNRSEAIW-------WRPNDIP 365 >UniRef50_Q7NB15 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma gallisepticum RepID=GCP_MYCGA Length = 321 Score = 289 bits (739), Expect = 1e-76, Method: Composition-based stats. Identities = 101/313 (32%), Positives = 160/313 (51%), Gaps = 18/313 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCD+ IAI D K ++ + S +HA+YGGVVPE+A+R H + A Sbjct: 7 ILGIESSCDDLSIAIAIDNK-IVTTKTKSSSSVHANYGGVVPEIAARYHEEILHQTLNEA 65 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ LT ID + YT PGL+ L V +L + +PA ++H+ GH+ +PM++ Sbjct: 66 LTEANLTINKIDLITYTENPGLLNCLHVAKVFANTLGYLLKIPAQGINHLYGHIFSPMID 125 Query: 123 DNPP-------EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 D +P + ++VSGGHT + V + LL E++DDA GE +DK + LGL Sbjct: 126 DGDCLYQKSDLIYPALGIVVSGGHTAIYDVQSPSKITLLDETLDDAIGEVYDKVGRALGL 185 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR-DNGTDDQTRAD 234 YP G + ++ A F + FS+SG K+ I + D Sbjct: 186 QYPAGAKIDQLYNPEQAETVEF---LKTNKLSAFSYSGFKSAVLRYIELNKNQPDFNLVQ 242 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFK--RLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 +F+ ++D + + K +++ K +++ GGVSAN LR++L E+ + Sbjct: 243 AVSSFQKFIIDDFIDRIKNVINKADSKYQTILLGGGVSANSYLRSELKELA----IKTLV 298 Query: 293 ARPEFCTDNGAMI 305 +P + DN AMI Sbjct: 299 PKPIYSGDNAAMI 311 >UniRef50_B6GZQ3 Pc12g05880 protein n=9 Tax=Trichocomaceae RepID=B6GZQ3_PENCW Length = 457 Score = 287 bits (736), Expect = 3e-76, Method: Composition-based stats. Identities = 116/407 (28%), Positives = 178/407 (43%), Gaps = 75/407 (18%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLL--ANQLYSQVKLHAD---YGGVVPELASRDHVRKT 55 + L IETSCD+T +AI + K A +++ + AD + G+ P +A H Sbjct: 34 LLTLAIETSCDDTSVAIVEKTKKESGSAAKIHFLENVTADTRAHRGIHPIIALESHQDNL 93 Query: 56 VPLIQAALK-------ESGLTAKD------IDAVAYTAGPGLVGALLVGATVGRSLAFAW 102 L+Q AL GL D D ++ T GPG+ L VG G++L+ AW Sbjct: 94 ATLVQKALNYLPESKTSDGLKLADGTRRRLPDFISATRGPGMRSNLSVGLDTGKALSVAW 153 Query: 103 DVPAIPVHHMEGHLLAPMLEDN------------PPEFPFVALLVSGGHTQLISVTGIGQ 150 +P + VHHM+ HLL P L PEFPF+++LVSGGHT L+ GI Sbjct: 154 QIPMVGVHHMQAHLLTPGLVTCLENASKAGPPAIAPEFPFLSILVSGGHTTLVQSKGITD 213 Query: 151 YELLGESIDDAAGEAFDKTAKLLGLD-------------------YPGGPL--------- 182 +++L S D A GEA DK+A+ + D +P G Sbjct: 214 HKILATSEDIAIGEALDKSARDILPDSLLQEAKSTMYGKNLEQFVFPNGKADFADYSPPD 273 Query: 183 --LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD----DQTRADIA 236 ++ + + + P + + FSFS + + ++ +GT+ R D+ Sbjct: 274 TRGQEITKRVSDWGWSLTTPFANTRMMQFSFSSISSMVGKIVQRSGTNIKMSHAERVDLG 333 Query: 237 RAFEDAVVDTLMIKCKRALD--------QTGFKRLVMAGGVSANRTLRAKLAEMMKKRR- 287 R + L + AL+ + K LV++GGV+AN+ L L ++ R Sbjct: 334 REAMRVCFEHLASRTVIALETLRPHNTGKDEIKTLVVSGGVAANQFLMKVLTSFLEVRGF 393 Query: 288 --GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + P CTDN AMI +AG+ F+AG +DL +W L Sbjct: 394 GNINIVAPPPYLCTDNAAMIGWAGIEMFEAGFRSDLSCRPLRKWTLD 440 >UniRef50_UPI000023E24C hypothetical protein FG06887.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023E24C Length = 1434 Score = 287 bits (736), Expect = 3e-76, Method: Composition-based stats. Identities = 120/405 (29%), Positives = 174/405 (42%), Gaps = 75/405 (18%) Query: 1 MRVLGIETSCDETGIAIYDDE---KGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 + L IETSCD+TG+A+ LL N+ S + + G+ P +A++ H P Sbjct: 1015 LTTLAIETSCDDTGVAVLRHTSQSTELLFNERISSD--NRAFKGIHPIVAAKGHSVSLAP 1072 Query: 58 LIQAALKE---------------SGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAW 102 L++ AL SG+ + D V+ T GPG+ L +G + + LA AW Sbjct: 1073 LVRRALNALPAAEDGDNKRICYASGVRKQVPDFVSVTRGPGMRSNLGIGLDMAKGLAVAW 1132 Query: 103 DVPAIPVHHMEGHLLAPML-------------EDNPPEFPFVALLVSGGHTQLISVTGIG 149 DVP + VHHM+ H L P L PEFPF++LLVSGGHTQL+ TG+ Sbjct: 1133 DVPLVGVHHMQAHALTPRLARALGMSMGEAEESRKGPEFPFLSLLVSGGHTQLVHSTGLT 1192 Query: 150 QYELLGESIDDAAGEAFDKTAKLL-------------------GLDYPGGPLL------- 183 + ++ S D A G D+TA+ + +P G Sbjct: 1193 DHSIIATSGDIAIGNLLDQTARDILPSEVFDASEHVMYGRLLEAFAFPTGADTTSAYEAV 1252 Query: 184 --------SKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI-RDNGTDDQTRAD 234 +M T + P P L FSFS + T + R Sbjct: 1253 FTPPASRSEEMTPVSTGYDWNIPTPFRQSRKLAFSFSSIYTHVHDIATARPSMSTSERRA 1312 Query: 235 IARAFEDAVVDTLMIKCKRALDQTG----FKRLVMAGGVSANRTLRAKLAEMMKKR---R 287 +A+ A L + ALD K LVMAGGV++N+ L L M+ R Sbjct: 1313 LAQHTMMAAFVHLAGRLCIALDDKPELQAAKTLVMAGGVASNKFLMHVLRSMLAIRGYEG 1372 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 E+ E CTDN AMIA+ G+ F+AG ++L ++ +WP+ Sbjct: 1373 IEIVAPPVELCTDNAAMIAWTGIEMFQAGYESELSITGIGKWPMD 1417 >UniRef50_B5Y892 O-sialoglycoprotein endopeptidase n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y892_COPPD Length = 316 Score = 287 bits (736), Expect = 3e-76, Method: Composition-based stats. Identities = 110/328 (33%), Positives = 172/328 (52%), Gaps = 20/328 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+L IETSCDET +A + +K ++ ++++SQ+ LH +GGV+PE A+R H+ L++ Sbjct: 4 RILAIETSCDETAVACLNGDK-VVQSKVFSQIDLHEAFGGVLPEAAARRHLEVLPVLLKD 62 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 D +A TAGPGL+ ALL G +V L+ W VP + ++H+ H+ A L Sbjct: 63 V--------AKPDLIAVTAGPGLLPALLTGVSVALGLSRGWQVPVMGINHVVAHVAAAAL 114 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E E P + L+VSGGHT + ++LG + DDAAGE DK + LG+ YP G Sbjct: 115 ERR-IELPVLGLVVSGGHTSFYLIEKWSDPKVLGWTYDDAAGECLDKVGRALGMKYPAGA 173 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 + +A R P P+ + +FSFSGLKT A + +A + + Sbjct: 174 EIDNLAL-TIKERVTMPLPLKNEDSFNFSFSGLKTAALKY-----KGKISNEVLAASLME 227 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 AVV+ L+ + ++ L + + LV+ GGVSA++ LR ++ E +R V + + TDN Sbjct: 228 AVVNHLLDRIEKVLKKYPYP-LVVGGGVSASKFLRQRMHEHFGER---VIFPSAQLSTDN 283 Query: 302 GAMIAYAGMVRFKAGATADLGVSVRPRW 329 M+A + + G V+ P Sbjct: 284 ADMVAVYAALLLQEGIVPGSCVTPDPNM 311 >UniRef50_Q6L4N8 Os05g0194600 protein n=21 Tax=Eukaryota RepID=Q6L4N8_ORYSJ Length = 380 Score = 280 bits (717), Expect = 4e-74, Method: Composition-based stats. Identities = 104/346 (30%), Positives = 169/346 (48%), Gaps = 22/346 (6%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LG+E+S ++ GI + +L+N ++ + G +P + H+ +PL++AAL Sbjct: 17 LGLESSANKIGIGVVSLSGEILSNPRHTY--VTPPGHGFLPRETAHHHLAHLLPLLRAAL 74 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 E+G+T D+ V YT GPG+ L V A R+L+ W P + V+H H+ Sbjct: 75 GEAGVTPADLACVCYTKGPGMGAPLQVAAAAARALSLLWGKPLVGVNHCVAHVEMGRAVT 134 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYPGGP 181 + V L VSGG+TQ+I+ G+Y + GE+ID A G D+ A++L L D G Sbjct: 135 GAVDP--VVLYVSGGNTQVIAY-SEGRYRIFGETIDIAVGNCLDRFARVLELSNDPSPGY 191 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFA-ANTIRDNGTDDQTRADIARAFE 240 + ++A +G P + G+D SFSG+ +F A I ++ T AD+ + + Sbjct: 192 NIEQLAKKG-EKFIDLPYVV---KGMDVSFSGILSFIEATAIEKLKNNECTPADLCYSLQ 247 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + + L+ +RA+ K +++ GGV N L+ + M +R G +F +C D Sbjct: 248 ETLFAMLVEITERAMAHCDSKDVLIVGGVGCNERLQEMMRIMCSERGGRLFATDDRYCID 307 Query: 301 NGAMIAYAGMVRFKAGATADLGV----------SVRPRWPLAELPA 336 NGAMIAY G++ + G T L V W E+P Sbjct: 308 NGAMIAYTGLLAYAHGMTTPLEESTFTQRFRTDEVHAIWREKEMPV 353 >UniRef50_B9WFF4 Metalloprotease, putative n=8 Tax=Saccharomycetales RepID=B9WFF4_CANDC Length = 440 Score = 277 bits (709), Expect = 4e-73, Method: Composition-based stats. Identities = 106/396 (26%), Positives = 170/396 (42%), Gaps = 63/396 (15%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVK--LH-ADYGGVVPELASRDHVRKTVPL 58 RV+ IE+SCD++ +A+ + ++ Q K LH AD GG++P A H+ + Sbjct: 25 RVMAIESSCDDSCVALLEKSHPDTPPKIIDQFKRTLHSADIGGILPTAAYNYHMATIANM 84 Query: 59 IQAALKESGLT-AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +Q + ++ D + T GPG+ G+L + L+ AW VP I VHHM GHLL Sbjct: 85 VQEFCSKHQISALNPPDLLCVTRGPGMAGSLSTSTEFAKGLSVAWGVPLIGVHHMLGHLL 144 Query: 118 APML------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 L + PP++PF++LL SGGHT L+ + ++E++ D A G++ DK A+ Sbjct: 145 TANLPKSEQPDSPPPKYPFLSLLCSGGHTMLVLSKSLTEHEIVVNVGDIAVGDSLDKCAR 204 Query: 172 LLGL-DYPGGPLLSKM------------------AAQGTAGRFVFPRPMT-------DRP 205 LG+ G L K F P + + Sbjct: 205 ELGMYGNMLGKELEKYINSIPEETRNRYEKLSVNTRIANPYNFRLTLPYSAPKYGIPEDV 264 Query: 206 GLDFS--FSGLKTFAANTIRDNG-------TDDQTRADIARAFEDAVVDTLMIKCKRALD 256 FS S ++ + A +G D++T+ IA ++ + D ++ + A Sbjct: 265 KFAFSHFLSNIQEYKAMHYNKSGGGEIDVALDEETKQFIAYKTQEFIFDHIVDRINIAFK 324 Query: 257 QTGF------------KRLVMAGGVSANRTLRAKLAEMMKKR-----RGEVFYARPEFCT 299 + G K + +GGV+AN+ LR KL E + + + CT Sbjct: 325 KHGIKNRNSDGTFIGVKDFICSGGVAANKRLREKLRENLDFQEIGADNVNFHFPDLSLCT 384 Query: 300 DNGAMIAYAGMVRFKA-GATADLGVSVRPRWPLAEL 334 DN MI AG+ F+ DL +WPL +L Sbjct: 385 DNAIMIGAAGIEIFEKLRLRTDLSFLPIRKWPLNKL 420 >UniRef50_Q18KI0 Putative O-sialoglycoprotein endopeptidase n=14 Tax=Euryarchaeota RepID=GCP_HALWD Length = 533 Score = 275 bits (704), Expect = 2e-72, Method: Composition-based stats. Identities = 98/329 (29%), Positives = 145/329 (44%), Gaps = 19/329 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+LGIE + A+Y+ + + D GG+ P A+ +I Sbjct: 1 MRILGIEGTAWAASAALYNTHDETI---VIESDPYQPDSGGLHPREAAEHMSTALPEVIS 57 Query: 61 AALKES----GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L+ + A IDA+A++ GPGL L V T R+L A VP I V+HM HL Sbjct: 58 TILERAVSSGNTDAIGIDAIAFSRGPGLGPCLRVVGTAARTLTQALSVPLIGVNHMIAHL 117 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 + V L SG + L+ QY++LGE++D G A DK + LG + Sbjct: 118 EIGRHQSGFTTP--VCLNASGANAHLLGYHRR-QYQVLGETMDTGVGNAIDKFTRHLGWN 174 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIA 236 +PGGP + A G+ + G+DFSFSG+ + A + + ++ D+ Sbjct: 175 HPGGPKVEAAATDGSYHDLPY-----VVKGMDFSFSGIMSAAKDAV----DNEVPVVDVC 225 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 ++ + L +RAL TG LV+ GGV N LR L+ M R + Sbjct: 226 TGLQETIFAMLTEVAERALSLTGSNELVLGGGVGQNDRLREMLSTMCTARGASFYAPESR 285 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSV 325 F DN MIA G ++AG T + S Sbjct: 286 FLRDNAGMIAVLGAAMYEAGQTISVNDSA 314 >UniRef50_Q2GXN6 Putative glycoprotein endopeptidase KAE1 n=18 Tax=Eukaryota RepID=KAE1_CHAGB Length = 356 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 100/345 (28%), Positives = 163/345 (47%), Gaps = 22/345 (6%) Query: 4 LGIETSCDETGIAIY---DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 LG E S ++ GI + D +L+N ++ V G +P+ ++ H V + + Sbjct: 14 LGCEGSANKLGIGVILHEGDTSTVLSNVRHTFVSPAGT--GFLPKDTAQHHRAFFVRVAK 71 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++G+ DID + YT GPG+ G L A R+LA W + V+H GH+ Sbjct: 72 QALSDAGIRIADIDCICYTRGPGMGGPLASVAVAARTLALLWGKELVGVNHCVGHIEMGR 131 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYP 178 V L VSGG+TQ+I+ +Y + GE++D A G D+ A+ L + D Sbjct: 132 TITGADHP--VVLYVSGGNTQVIAYAEQ-RYRIFGETLDIAVGNCLDRFARALNISNDPA 188 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD--------Q 230 G + +A +G GR + P + G+D SFSG+ T A ++ Sbjct: 189 PGYNIEVLARKG--GRVLLDLPYAVK-GMDCSFSGILTRAEELAAQMKANEGKGTDGEPF 245 Query: 231 TRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 T AD+ + ++ V L+ +RA+ G ++++ GGV N L+ + M R G V Sbjct: 246 TGADLCFSLQETVFAMLVEITERAMAHVGSNQVLIVGGVGCNERLQEMMGLMAADRGGSV 305 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSV-RPRWPLAEL 334 + FC DNG MIA+AG++ ++ G + S R+ E+ Sbjct: 306 YATDERFCIDNGIMIAHAGLLAYETGFRTPIEESTCTQRFRTDEV 350 >UniRef50_Q83I95 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Tropheryma whipplei RepID=GCP_TROW8 Length = 401 Score = 274 bits (702), Expect = 3e-72, Method: Composition-based stats. Identities = 126/399 (31%), Positives = 176/399 (44%), Gaps = 68/399 (17%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCDETG+ I +LAN++ S H +GGV+PE+A+R H+ L++ Sbjct: 3 IILGIETSCDETGVGIV-SGSTVLANEVASSSLRHKPFGGVIPEIAARAHLEYLPNLLEL 61 Query: 62 ALKESGLTAKDIDA--------------VAYTAG---------PGLVGALLVGATVGR-- 96 AL+ + L KDID V +A P LVG V Sbjct: 62 ALETAQLCIKDIDGIAVTAGPGLVTSLSVGVSAAKALGLSTGTPVYGVNHLVGHAVSAFL 121 Query: 97 ------SLAFAWDVPAIPVHHMEG----------------H----LLAP--MLEDNPPEF 128 L +I + +E H + P + + ++ Sbjct: 122 DDYTNDGLGVIHRRDSIGSNGIENDASSTHSHTHTTQVNRHSNLCVYTPPRRVLRDVCKY 181 Query: 129 PFV----ALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLS 184 V LL SGGH+ L+ + + LLGE++DDAAGEAFDK A+L+GL YPGGP + Sbjct: 182 MHVRDSVVLLASGGHSCLLKIHN-NKISLLGETLDDAAGEAFDKIARLMGLQYPGGPAIE 240 Query: 185 KMAAQGTAGRFVFPRPMT----DRPGLDFSFSGLKTFAANTIRDNGTDDQ----TRADIA 236 +A+ G FPR + + FSFSGLKT + ++ DIA Sbjct: 241 MLASSGNPNAVEFPRALLTHFEEHNRYSFSFSGLKTAVGRVVERIKSNPAHSIPKIEDIA 300 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F++AV D L K A + +VM GGV+AN +R L E K +V Sbjct: 301 ASFQEAVADVLTAKTVAAALASDVDLIVMGGGVAANNRIREMLCERAKIHGLDVKIPPIA 360 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLG-VSVRPRWPLAEL 334 CTDNGAMIA AG + G S PL ++ Sbjct: 361 LCTDNGAMIAAAGSWLMQLGYNPSHSRFSPVSIMPLTQM 399 >UniRef50_B7QJD9 O-sialoglycoprotein endopeptidase, putative n=3 Tax=Arthropoda RepID=B7QJD9_IXOSC Length = 309 Score = 273 bits (700), Expect = 4e-72, Method: Composition-based stats. Identities = 102/289 (35%), Positives = 142/289 (49%), Gaps = 30/289 (10%) Query: 73 IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVA 132 + A+A T PG+ +LLVG R LA P IP+HHME H LA L +FP++ Sbjct: 1 MSAIAVTVRPGMSLSLLVGLNFARRLAAKHGKPLIPIHHMEAHALAVRLVQR-VDFPYLV 59 Query: 133 LLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-------DYPGGPLLSK 185 LLVSGGH QL V I + LLG+++DDA GE FDK A+ L L GG L Sbjct: 60 LLVSGGHCQLAVVRDIDDFLLLGQTMDDAPGETFDKVARRLKLSNLPECRGLSGGRALEF 119 Query: 186 MAAQ--GTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD--------QTRADI 235 +A + G + FP P+T +FSFSGLK I + AD+ Sbjct: 120 LAERDSGNPLAYRFPEPLTSYRTCNFSFSGLKNSVYRKIEALEKEHGLEADALLPEIADL 179 Query: 236 ARAFEDAVVDTLMIKCKRALDQT--------GFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 + + AV L + +RAL G LV+AGGV+AN L L+++ +K Sbjct: 180 CASTQHAVAYHLTRRTQRALAFCDQQGLLPEGKPTLVVAGGVAANAYLGRLLSQLCEKLD 239 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGA----TADLGVSVRPRWPLA 332 P+ C+DNG MIA+ G+ R++A + + + + PR PL Sbjct: 240 VAYVPTPPKLCSDNGLMIAWNGVERWRAASGIVTESFDSLDITPRCPLG 288 >UniRef50_A3CXS0 Putative O-sialoglycoprotein endopeptidase n=5 Tax=Euryarchaeota RepID=GCP_METMJ Length = 527 Score = 273 bits (700), Expect = 5e-72, Method: Composition-based stats. Identities = 95/337 (28%), Positives = 150/337 (44%), Gaps = 25/337 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLG+E + A++ D+ L + GG+ P A++ H ++ Sbjct: 10 LVLGLEGTAWNLSAALFGDDLVALHSS-----PYVPPKGGIHPREAAQHHASAMKEVVSR 64 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L E + I AVA++ GPGL +L AT R+L+ A DVP + V+H H+ Sbjct: 65 VLTE----PERIRAVAFSQGPGLGPSLRTVATAARALSIALDVPLVGVNHCVAHVEIGRW 120 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + + L SG +TQ++ G+Y + GE++D G DK A+ L +PGGP Sbjct: 121 ATGFSDP--IVLYASGANTQVLGYLN-GRYRIFGETLDIGLGNGLDKFARSHDLPHPGGP 177 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 + ++A +G P T + G+D +FSGL + A D+ ++ Sbjct: 178 AIERLAREGNY----IELPYTVK-GMDLAFSGLVSAAQ-------ESSAPLEDVCFGLQE 225 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 + +RAL G +++ GGV AN L+ L M ++R F DN Sbjct: 226 TAFAMCVEVTERALAHAGKDEVLLVGGVGANGRLQEMLRVMCEERGAAFAVPERTFLGDN 285 Query: 302 GAMIAYAGMVRFKAGATADLGVS-VRPRWPLAELPAA 337 GAMIAY G + + G L S +RP + E+ A Sbjct: 286 GAMIAYTGKIMLEHGVVLPLDQSQIRPGYRADEVEVA 322 >UniRef50_Q2HG58 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2HG58_CHAGB Length = 1550 Score = 272 bits (697), Expect = 1e-71, Method: Composition-based stats. Identities = 107/406 (26%), Positives = 170/406 (41%), Gaps = 77/406 (18%) Query: 3 VLGIETSCDETGIAIYDDEK---GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 L IETSCD+T + + + +L N + + +GG+ P+ A + H ++ Sbjct: 1068 TLAIETSCDDTCVTVLEKSGDAARVLFNAKVTSD--NRRFGGIKPDEAVQGHSSSLPGIV 1125 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 QAA+++ D ++ T GPG+ AL +G T+ + LA AWD P + VHHM+ H L P Sbjct: 1126 QAAIQKLPADRPKPDFISVTRGPGITSALSIGLTMAKGLAVAWDRPLVAVHHMQAHALTP 1185 Query: 120 MLED--------------NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEA 165 L + P +PF++LLVSGGH+QL+ + L E+ + A G+ Sbjct: 1186 RLVEALANGQQQPPHQGGARPAYPFLSLLVSGGHSQLLLTRSAVSHATLAEAANVAIGDM 1245 Query: 166 FDKTAKLL--------GLDYPGGPLLSKMAAQGTAGR----------------------- 194 DK A+ + D L + A T + Sbjct: 1246 LDKCARAILPSDILASTPDVMYAAELERFAFAPTPTQTQTHQTQTQHPSNPYTNYHPPTT 1305 Query: 195 --------------FVFPRPMTDRPGLDFSFSGLKTFAANTI-RDNGTDDQTRADIARAF 239 + P+ +R + F FSGL + R+ D RA++AR Sbjct: 1306 RRDEIRPYTSPTHGWTLTPPLHERRDMAFDFSGLGGQVQAIMQRNPSMDPPQRAELARET 1365 Query: 240 EDAVVDTLMIKCKRALD-------QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG---- 288 + L + ALD + LV++GGV+AN L L ++ R Sbjct: 1366 MRVAFEHLASRVIFALDGMRTQAAALPVRTLVVSGGVAANGFLMHVLGRVLAVRGYGPEK 1425 Query: 289 -EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 V CTDN M+A+AG+ ++AG ++L V R RW + + Sbjct: 1426 VAVVRPPRGLCTDNAVMVAWAGVEMWEAGWESELSVLPRRRWEMDD 1471 >UniRef50_C8V9Q8 PROBABLE O-SIALOGLYCOPROTEIN ENDOPEPTIDASE (AFU_orthologue; AFUA_7G05240) n=2 Tax=Emericella nidulans RepID=C8V9Q8_EMENI Length = 497 Score = 272 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 105/440 (23%), Positives = 159/440 (36%), Gaps = 107/440 (24%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVP 57 + L IETSCD+T +AI A +++ + D Y G+ P A H + Sbjct: 32 LLTLAIETSCDDTSVAIVHKNDKSGAAKIHFLENITPDLTAYQGIHPVRALESHQQNVAK 91 Query: 58 LIQAALKESGLTAKD------------------IDAVAYTAGPGLVGALLVGATVGRSLA 99 L+ AL ++ + D ++ T GPG+ L G + LA Sbjct: 92 LVNKALSHLPYSSAESQNDPTKIVSLGDGNRQKPDFISVTRGPGMRSNLFAGLDTAKGLA 151 Query: 100 FAWDVPAIPVHHMEGHLLAPMLEDN---------------------PPEFPFVALLVSGG 138 AW VP + VHHM+ HLL P L P FPF+++L SGG Sbjct: 152 VAWQVPFVGVHHMQAHLLTPRLVSALALSPGSSPNNTDRQNEKGELQPAFPFLSILASGG 211 Query: 139 HTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL-------------------GLDYPG 179 HT L++ + + + +L + D A GEA DK A+ + +P Sbjct: 212 HTLLVNSSSLTDHRILATTTDVALGEALDKAAREILPSSLLSTSKNTMYGKLLEQYAFPN 271 Query: 180 GPL--LSKMAAQGTAGR-----------FVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG 226 G +A + + P L FSF+ L T +T+ Sbjct: 272 GRADYADYVAPKSRGDEIAVSKVVSKYGWSLTTPYAQTRELAFSFAFLATAVNHTLAKAR 331 Query: 227 T-------DDQTRADIARAFEDAVVDTLMIKCKRALDQT--------------------- 258 D+ R +AR + L + AL+ Sbjct: 332 KRAGETGLSDEERVFLAREVMRVTFEHLASRTIIALESLCQWVPLVPNNPNDKRQKPLPS 391 Query: 259 --GFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRF 313 LV++GGV+AN+ L L + R V CTDN AM+ +AG+ F Sbjct: 392 SVPVSTLVVSGGVAANKFLMHVLRTWLDGRGFGHVGVVAPPISLCTDNAAMVGWAGIEMF 451 Query: 314 KAGATADLGVSVRPRWPLAE 333 +AG + +W L E Sbjct: 452 EAGWRSAFEARALRKWGLEE 471 >UniRef50_B8MFK9 Glycoprotease family protein, putative n=5 Tax=Leotiomyceta RepID=B8MFK9_TALSN Length = 883 Score = 271 bits (693), Expect = 3e-71, Method: Composition-based stats. Identities = 112/441 (25%), Positives = 169/441 (38%), Gaps = 106/441 (24%) Query: 1 MRVLGIETSCDETGIAIYDDE-----------KGLLANQLYSQVKLHAD---YGGVVPEL 46 + L IE+SCD+T +AI + + G A +++ + AD Y G+ P Sbjct: 426 LLTLAIESSCDDTSVAIVEKDSFHKSFETPRHTGHAAAEVHFLENITADTRKYRGIHPIE 485 Query: 47 ASRDHVRKTVPLIQAALKESGLTAKDI--------------------------DAVAYTA 80 A + H L+Q A++ A+D + ++ T Sbjct: 486 ALQSHQENLAKLVQKAVRSLPPVAEDYSPEDGAVISHIIPKNKNGKSTRHRLPNFISVTR 545 Query: 81 GPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML----------EDNPPEFPF 130 GPG+ L VG + LA AW +P + VHHM+ HLL P L +D P FPF Sbjct: 546 GPGMRSNLSVGLDTAKGLAVAWQIPLVGVHHMQAHLLTPRLVSALNRSVLTDDLQPNFPF 605 Query: 131 VALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL----------------- 173 +++LVSGGH+ L+ + ++E+L + D A GE DK+A+L+ Sbjct: 606 LSILVSGGHSMLVHSKSLLEHEILATTADIAIGETLDKSARLILPESVLESANTTMYGKL 665 Query: 174 --GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT--------------- 216 +PGGP + D G F+ T Sbjct: 666 LEKFAFPGGPADYADYQALKTRGEEVVKRDNDTWGWSFTTPYANTRDLKFSFSSVSSTVS 725 Query: 217 --FAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQ---------------TG 259 A D R +AR + L + AL+ Sbjct: 726 RIMANKEKADVRVTRDERVALARESMRVCFEHLASRTLIALELLRKQLRKQYNTSGSGQE 785 Query: 260 FKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 LV++GGV+AN+ L L + R +V P CTDN AMI +AG+ F+AG Sbjct: 786 IDTLVVSGGVAANQFLMTVLRAFLDVRGFSHIKVIAPPPYLCTDNAAMIGWAGIEMFEAG 845 Query: 317 ATADLGVSVRPRWPLAELPAA 337 + DL +W L P+A Sbjct: 846 YSTDLSCRAIRKWTLD--PSA 864 >UniRef50_Q4U8J6 Glycoprotease, putative n=2 Tax=Theileria RepID=Q4U8J6_THEAN Length = 630 Score = 271 bits (693), Expect = 3e-71, Method: Composition-based stats. Identities = 94/335 (28%), Positives = 172/335 (51%), Gaps = 31/335 (9%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IETS D+T IA+ + +L+++ SQ ++ +YGG+ P A +H++K L Sbjct: 97 NILSIETSFDDTCIAVVRSDGKILSDKKLSQEEVVKEYGGIKPVCAKLEHIKKIESLTDK 156 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 ++ESGL +DID +A T GPG L VG + L+ + +P + +H+ GH L+P++ Sbjct: 157 VIEESGLKIQDIDEIAVTRGPGTELCLRVGYNYAKELSEKYKIPLVSENHIAGHCLSPLI 216 Query: 122 ED--------------NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFD 167 ++ N +FP++ LL+SGGH+Q+ V ++ L+ E+ D+ G D Sbjct: 217 DEHQFKYTVEGTPIKSNDLKFPYLCLLLSGGHSQIYLVENPSKFHLMCETQDEFVGNVLD 276 Query: 168 KTAKLLGLDYP--GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFA----ANT 221 K AKLLGLD GG L K+A + + ++ P ++F FSG+++ Sbjct: 277 KCAKLLGLDLSKGGGAELEKIADEVSDSKYKLTIPNKYNHYMEFCFSGVQSQLGLKTEQL 336 Query: 222 IRDNGTDDQTR------ADIARAFEDAVVDTLMIKCKRALDQ----TGFKRLVMAGGVSA 271 ++ + +D R +++A + V + ++I+ + +L+ +L + GGV++ Sbjct: 337 VKSHNVEDAKRLPRKILSELAYGLQSTVFEGILIQLEMSLNAVETLFPINQLALVGGVAS 396 Query: 272 NRTLRAKLAEMMKKRRGEVFYARPE-FCTDNGAMI 305 N L+ + ++ R V ++ E F T M+ Sbjct: 397 NDKLKKMILDLFYLRDESVRFSEQEMFLTRTKNMV 431 Score = 43.3 bits (101), Expect = 0.013, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 30/64 (46%), Gaps = 7/64 (10%) Query: 275 LRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT---ADLGV---SVRPR 328 LR + + RR +++ ++CTDN MI ++ + + + G + + V PR Sbjct: 549 LRRH-GDEISDRRWDLYTTSKKYCTDNAVMIGFSLIQKNRMGIKEINSPEKINGKDVAPR 607 Query: 329 WPLA 332 W L Sbjct: 608 WDLG 611 >UniRef50_P36174 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Haloarcula marismortui RepID=GCP_HALMA Length = 548 Score = 270 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 97/349 (27%), Positives = 151/349 (43%), Gaps = 24/349 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLA----NQLYSQVKLHADYGGVVPELASRDHVRKTV 56 MR+LGIE + ++++ + D GG+ P A+ Sbjct: 1 MRILGIEGTAWAASASVFETPDPARVTDDDHVFIETDAYAPDSGGIHPREAAEHMGEAIP 60 Query: 57 PLIQAALKE----SGLTAKD---IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPV 109 +++ A++ +G D IDAVA+ GPGL L + AT R++A +DVP + V Sbjct: 61 TVVETAIEHTHGRAGRDGDDSAPIDAVAFARGPGLGPCLRIVATAARAVAQRFDVPLVGV 120 Query: 110 HHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKT 169 +HM HL V L SG + ++ G+Y +LGE++D G A DK Sbjct: 121 NHMVAHLEVGR--HRSGFDSPVCLNASGANAHILGYRN-GRYRVLGETMDTGVGNAIDKF 177 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 + +G +PGGP + + A G + G+DFSFSG+ + A + D Sbjct: 178 TRHIGWSHPGGPKVEQHARDGEYHELPY-----VVKGMDFSFSGIMSAAKQAV----DDG 228 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 ++ R E+ + L +RAL TG LV+ GGV N L+ L EM ++R E Sbjct: 229 VPVENVCRGMEETIFAMLTEVSERALSLTGADELVLGGGVGQNARLQRMLGEMCEQREAE 288 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAELPAA 337 + F DN MIA G + AG T + + + E+ Sbjct: 289 FYAPENRFLRDNAGMIAMLGAKMYAAGDTIAIEDSRIDSNFRPDEVAVT 337 >UniRef50_C4Y0N8 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y0N8_CLAL4 Length = 443 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 103/374 (27%), Positives = 170/374 (45%), Gaps = 41/374 (10%) Query: 2 RVLGIETSCDETGIAIYDD---EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 RVL IE+SCD++ +++ + +LA A GG++P A H + L Sbjct: 43 RVLAIESSCDDSCVSLLEKKSPNGPVLAIDEIKATLSSAKVGGIIPTAAHEFHSAQISQL 102 Query: 59 IQAALKESGLTAKDI-DAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 + ++ +++ + D + T GPG+VG+L + LA AW P + VHHM GHLL Sbjct: 103 VGEFCRKHEISSSNPPDLLCVTRGPGMVGSLSASIQFAKGLAVAWQRPLVGVHHMLGHLL 162 Query: 118 APML----EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 P L P++PF++LL SGGHT L+ + +E++ ++ D AAG++ DK A+ L Sbjct: 163 TPNLTVEGSSCGPQYPFLSLLCSGGHTMLVLSKSLTNHEIIIDTSDIAAGDSLDKCAREL 222 Query: 174 GL-DYPGGPLLSKMAAQGTA---GRFVFPRPMTDRPGLDFS------------------- 210 G GP L K A RF TD+ F Sbjct: 223 GFEGNMLGPELEKYVANIDPVTKERFAGINTNTDQNEFGFRLRMPMRTAKHKKIPDVIQF 282 Query: 211 -FSGLKTFAA--NTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG-----FKR 262 F+ + + ++QTR +A ++ + D ++ + A + + Sbjct: 283 GFASFLSSVEGFKMKSRDSWNEQTRQFVAFKLQEVLFDHIINRINVAFAKDPQKFALVRD 342 Query: 263 LVMAGGVSANRTLRAKLAEMMKKRR-GEVFYARPEFCTDNGAMIAYAGMVRFKA-GATAD 320 V +GGV+AN+ LRAKL ++ + + P+ CTDN MI AG+ F+ + Sbjct: 343 FVCSGGVAANKVLRAKLMHNIRSASTLKFHFPAPKLCTDNATMIGNAGIDIFENLRLKSR 402 Query: 321 LGVSVRPRWPLAEL 334 L + +WPL ++ Sbjct: 403 LSMLPIRKWPLHDI 416 >UniRef50_A3MSX6 Putative O-sialoglycoprotein endopeptidase n=2 Tax=Pyrobaculum RepID=GCP_PYRCJ Length = 339 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 104/333 (31%), Positives = 166/333 (49%), Gaps = 15/333 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 ++G+E++ + + +L + + G+ P A+ H + L + Sbjct: 10 IIGVESTAHTFSLGLV-SGGRVLGQVGKTY--VPPAGRGIHPREAAEHHAKAAPQLFRKL 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++E ++ D++AVAY+AGPGL AL VGA R+LA VP +PVHH H+ Sbjct: 67 IEEFNVSLGDVEAVAYSAGPGLGPALRVGAVFARALAIKLGVPLVPVHHGVAHVEIARYA 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + + LL+SGGHT +++ G+Y + GE++D A G A D A+ +GL +PG P Sbjct: 127 TGSCDP--LVLLISGGHT-VVAGFSDGRYRVFGETLDVAIGNAIDMFAREVGLGFPGVPA 183 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 + K A + FP P+ G D S++GL T+A ++ + R+ + Sbjct: 184 VEKCA-EAAEELVAFPMPIV---GQDLSYAGLTTYALQLVKR----GIPLPVVCRSLVET 235 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 L +RAL T + LV+AGGV+ +R LR L E+ ++ EV + E+ DNG Sbjct: 236 AYYMLAEVTERALAFTKKRELVVAGGVARSRRLREILYEVGREHGAEVKFVPDEYAGDNG 295 Query: 303 AMIAYAGMVRFKAGATADLGVS-VRPRWPLAEL 334 AMIA G ++ G + G S VR RW L + Sbjct: 296 AMIALTGYYAYRRGIAVEPGESFVRQRWRLDTV 328 >UniRef50_P36132 Putative glycoprotein endopeptidase KAE1 n=40 Tax=Eukaryota RepID=KAE1_YEAST Length = 386 Score = 267 bits (683), Expect = 4e-70, Method: Composition-based stats. Identities = 102/371 (27%), Positives = 170/371 (45%), Gaps = 43/371 (11%) Query: 4 LGIETSCDETGIAIY---------------DDEKGLLANQLYSQVKLHADYGGVVPELAS 48 LG+E S ++ G+ I D E +L+N + V + G +P + Sbjct: 19 LGLEGSANKLGVGIVKHPLLPKHANSDLSYDCEAEMLSNIRDTYVTPPGE--GFLPRDTA 76 Query: 49 RDHVRKTVPLIQAALKESGLTAK--DIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPA 106 R H + LI+ AL E+ + + DID + +T GPG+ L R+ + WDVP Sbjct: 77 RHHRNWCIRLIKQALAEADIKSPTLDIDVICFTKGPGMGAPLHSVVIAARTCSLLWDVPL 136 Query: 107 IPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAF 166 + V+H GH+ E + P V L VSGG+TQ+I+ + +Y + GE++D A G Sbjct: 137 VGVNHCIGHIEMGR-EITKAQNP-VVLYVSGGNTQVIAYSEK-RYRIFGETLDIAIGNCL 193 Query: 167 DKTAKLLGLDYPG--GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGL---------- 214 D+ A+ L + G + ++A + + P T + G+D S SG+ Sbjct: 194 DRFARTLKIPNEPSPGYNIEQLAKKAPHKENLVELPYTVK-GMDLSMSGILASIDLLAKD 252 Query: 215 --KTFAANTI---RDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGV 269 K N I + G T D+ + ++ + L+ +RA+ ++++ GGV Sbjct: 253 LFKGNKKNKILFDKTTGEQKVTVEDLCYSLQENLFAMLVEITERAMAHVNSNQVLIVGGV 312 Query: 270 SANRTLRAKLAEMMKKR-RGEVFYARPEFCTDNGAMIAYAGMVRFKA-GATADLGVS-VR 326 N L+ +A+M K R G+V FC DNG MIA AG++ ++ G D + V Sbjct: 313 GCNVRLQEMMAQMCKDRANGQVHATDNRFCIDNGVMIAQAGLLEYRMGGIVKDFSETVVT 372 Query: 327 PRWPLAELPAA 337 ++ E+ AA Sbjct: 373 QKFRTDEVYAA 383 >UniRef50_C1GKA7 Glycoprotease pgp1 n=11 Tax=Onygenales RepID=C1GKA7_PARBD Length = 642 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 113/452 (25%), Positives = 169/452 (37%), Gaps = 123/452 (27%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHA-------DYGGVVPELASRDHVRK 54 ++ G D+T +AI EK +++ S + Y G+ P +A H Sbjct: 169 KIFGANKPFDDTSVAII--EKHGVSSPSRSSILFLENITADSRKYQGIHPAVALDSHQAN 226 Query: 55 TVPLIQAALKESGL----TAKDI----------------------DAVAYTAGPGLVGAL 88 T L+ AL L +A D+ D ++ T GPG+ L Sbjct: 227 TAKLVNKALAHLPLAQFPSANDVGRVICLPSSATDGITPHLRRKPDFISVTRGPGMRSNL 286 Query: 89 LVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE-----------------DNPPEFPFV 131 VG + L+ AW VP + VHHM+ HLL P L N P FPF+ Sbjct: 287 SVGLDTAKGLSVAWQVPIVGVHHMQAHLLTPRLAASLQQQQLQSSENSSAFRNSPSFPFM 346 Query: 132 ALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL------------------ 173 ++LVSGGHT L+ I +E+L + D A G+A DKTA++L Sbjct: 347 SILVSGGHTLLVHSKSIVDHEILASTSDSAIGDALDKTARMLLPQSFLAKSTTTMYGKML 406 Query: 174 -GLDYPGGPL------------LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAAN 220 +P GP + + + F P + ++FSFSG+ T A + Sbjct: 407 EEFAFPNGPSDYADYRPPATRGEELVKLKSERWGWSFGMPFAENRRMEFSFSGVTTRARD 466 Query: 221 TIRDNGT-------------DDQTRADIARAFEDAVVDTLMIKCKRALD----------- 256 + + R + ARAF L + AL Sbjct: 467 IYLNRRKQWEAAGNSGEGFMSNDERIEFARAFMTVCFGHLASRTIIALQELRRQQQQQQQ 526 Query: 257 -------------QTGFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTD 300 + L+++GGV AN+ L+ + R +V P CTD Sbjct: 527 QQQQQERENQSPPAEDIQSLIISGGVGANQFLKKLFRSYLDIRGFPHVDVIAPPPYLCTD 586 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 N AMI +AG+ F+AG +DL +W L Sbjct: 587 NAAMIGWAGIEMFEAGWRSDLRCRPLRKWTLD 618 >UniRef50_A8QDL6 Glycoprotease family protein n=1 Tax=Brugia malayi RepID=A8QDL6_BRUMA Length = 415 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 84/324 (25%), Positives = 148/324 (45%), Gaps = 21/324 (6%) Query: 3 VLGIET-SCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 V+GIET CD+T + I + ++ +L+++ Y+ ++ GG+ P + H + Sbjct: 33 VMGIETRHCDDTAVCILNSDRKILSSRRYADREVQKRLGGICPAAVADQHRSYIDLFVDE 92 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L ES + ++D +A T PGLV L VG SLA +P IPVHHM+ H L Sbjct: 93 CLDESRVRLCNLDGIAVTTQPGLVICLRVGTEKAISLARKGCIPLIPVHHMQAHATVATL 152 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP--- 178 +P+V++L+SGGH+ + G +E+L S+ + GE DK ++ L + P Sbjct: 153 MTE-IXYPYVSVLISGGHSIIAVTNGPDDFEVLLTSMCGSPGECMDKISRALHFEEPELL 211 Query: 179 ---GGPLLSKMAAQGT---AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 G L +A++ + R+ + L F+FS +KT I + Sbjct: 212 GLHPGAALEVIASRSSVDGYKRYPIDVNKFMKMALHFNFSWIKTTYLAMISRQSI--LSV 269 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTG--------FKRLVMAGGVSANRTLRAKLAEMMK 284 D + + ++ + L K L + + ++GGV++N+ + A+ + + Sbjct: 270 PDFCASVQHSIANYLAEKLSCCLQYLNDSNKIPSRNRLVFVSGGVASNKYILARFNNVCE 329 Query: 285 KRRGEVFYARPEFCTDNGAMIAYA 308 V+ +C DN MIA+ Sbjct: 330 PLGYSVYAPSQFYCCDNAEMIAWN 353 >UniRef50_B2A533 O-sialoglycoprotein endopeptidase n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A533_NATTJ Length = 322 Score = 253 bits (646), Expect = 9e-66, Method: Composition-based stats. Identities = 80/327 (24%), Positives = 152/327 (46%), Gaps = 24/327 (7%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 +G++TSC T +A+ + + ++A + +++ GG+ A H+ + Sbjct: 1 MGLDTSCYTTSMAVINKQGKIIA-KTERPLEVAMGKGGLRQSEAVFQHINNLPQGLTEIK 59 Query: 64 KESGLTAKDID--AVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 K+ + D++ A+A ++ P + VG + ++L+ + +P + H EGH+ Sbjct: 60 KQLNVNNLDLNLAAIAVSSRPRPIEGSYMPVFKVGDSYAKALSLSSGIPLLEYTHQEGHI 119 Query: 117 LAPMLEDNPPE-----FPFVALLVSGGHTQLISVTGIGQY-----ELLGESIDDAAGEAF 166 + + E + F+ VSGG T+L+ G++ E++G + D AAG+ Sbjct: 120 ASIVYEKSNNIRLEDMDKFLVFHVSGGTTELLICHTKGKFSSFDIEIIGGTKDIAAGQLI 179 Query: 167 DKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG 226 D+TAKL+ L +PGGP L K+ Q P + D +FSG +T I + Sbjct: 180 DRTAKLMNLPFPGGPHLEKLGDQSGQTDISVPFSVEDTK---INFSGPETHIKRLIHN-- 234 Query: 227 TDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 +D + +AR E V +L+ + AL + K ++ GGV +N ++ L++ + Sbjct: 235 -EDYPKPAVARGIEQCVAKSLLTVLENALKKHQVKNILFVGGVMSNSYIKNYLSKNISNE 293 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRF 313 + + + PE DN +A+ G F Sbjct: 294 KYNLIFGSPELSKDNAVGVAWLGYNNF 320 >UniRef50_UPI0000E8089C PREDICTED: similar to Osgepl1 protein n=1 Tax=Gallus gallus RepID=UPI0000E8089C Length = 513 Score = 250 bits (638), Expect = 6e-65, Method: Composition-based stats. Identities = 87/235 (37%), Positives = 124/235 (52%), Gaps = 8/235 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIETSCD+TG A+ D+ +L L SQ ++H GG++P +A + H +++ Sbjct: 110 LVLGIETSCDDTGAAVLDEAGTVLGEALQSQKEVHLKAGGIIPHVAQQLHRESIQQVVKE 169 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL SG++ ++ A+A T PGL +L VG L + P IP+HHME H L L Sbjct: 170 ALSASGVSVNELAAIATTVKPGLALSLEVGLQYSLQLVDRYQKPFIPIHHMEAHALTIRL 229 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 + EFPF+ LL+SGGH L G+ + LLG+SID A G+ DK A+ L L Sbjct: 230 TEQ-VEFPFLVLLLSGGHCILAVARGVSDFLLLGQSIDIAPGDMLDKVARRLSLVKHPEC 288 Query: 176 -DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 GG + +A G ++ F PM DFSFSGL++ I ++ Sbjct: 289 HGMAGGKAIEHLAQTGDWQQYTFRLPMQQYRNCDFSFSGLQSLVNKAILQKEKEE 343 >UniRef50_A8BDD4 O-sialoglycoprotein endopeptidase n=2 Tax=Giardia intestinalis RepID=A8BDD4_GIALA Length = 396 Score = 250 bits (638), Expect = 6e-65, Method: Composition-based stats. Identities = 100/402 (24%), Positives = 160/402 (39%), Gaps = 78/402 (19%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LG+E S ++ G+ I D + AN + G P + H + + LI+ A Sbjct: 2 ILGLEGSANKLGVGIVDASGVVHANLRSTYNAPPGQ--GFQPNDVAAHHRQHIIGLIERA 59 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ +++ I +AYT GPGL L A V R+L+ W VP + V+H H+ L Sbjct: 60 LLEAEISSDKITHIAYTRGPGLGAPLAAVAVVARTLSQLWKVPLLAVNHCVAHIEMGRLV 119 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD--YPGG 180 P V L SGG+TQ+I+ G+Y + GE++D A G A D+ A+ L + G Sbjct: 120 TQLPNP--VVLYASGGNTQVIAY-SQGRYRVFGEALDIAVGNALDRIARYLLISNTPAPG 176 Query: 181 PLLSKMAAQ---------------------------------------------GTAGRF 195 + ++AA+ G + Sbjct: 177 LNIERLAAEWAAIFREEDCVHLDPDIVPRYTTLPRSKELLKEQLELYSANHPEAGIDTSY 236 Query: 196 VFPRPMT---DRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCK 252 P T G+D S SG+ T+ + + + D I + ++ + +L+ + Sbjct: 237 DIPIITTIPVPIKGMDISCSGISTYLKTYVETHTSLDPRL--ICYSLQETLFGSLVEITE 294 Query: 253 RALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVR 312 RA G ++ GGV N L+ L M +R G + +C DNGAMIA+ G Sbjct: 295 RAAAHVGAADILAVGGVGCNLRLQEMLQIMAAERNGRLGAMDDSYCVDNGAMIAWCGACM 354 Query: 313 FKAGATADL---------------------GVSVRPRWPLAE 333 +A + DL V +WPL + Sbjct: 355 LQAPLSMDLLIPYTEVNCATVTQRYRTDSVDVPWHSKWPLTQ 396 >UniRef50_A8WMS3 Putative uncharacterized protein n=1 Tax=Caenorhabditis briggsae RepID=A8WMS3_CAEBR Length = 386 Score = 249 bits (637), Expect = 8e-65, Method: Composition-based stats. Identities = 91/295 (30%), Positives = 151/295 (51%), Gaps = 14/295 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-GVVPELASRDHVRKTVPLIQA 61 VLGIE S ++ G+ I D +L+N + HA G G P ++ H ++ V L+ Sbjct: 4 VLGIEGSANKIGVGIIRD-GVVLSNPRAT---FHAPPGEGFRPTETAQHHRQQIVRLVGE 59 Query: 62 ALKESGLT--AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 A++E+G+ K+ID +A+T GPG+ L VGA V R+L+ W P IPV+H GH+ Sbjct: 60 AIREAGIQDPEKEIDGIAFTKGPGMGAPLQVGAIVARTLSLRWQKPIIPVNHCVGHIEMG 119 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L V L VSGG+TQ+ +Y + GE+ID A G D+ A++L L Sbjct: 120 RLITGADNP--VVLYVSGGNTQVFLPN--KRYRIFGETIDIAVGNCLDRFARVLKLPNAP 175 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFA-ANTIRDNGTDDQTRADIARA 238 P + +G +F P T + +D S SG+ + + + + + T AD+ + Sbjct: 176 SPGY-NIEQLAKSGAKLFELPYTVKARMDVSLSGILSCIESRAPQLLESREYTPADLCFS 234 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRA-KLAEMMKKRRGEVFY 292 ++ V L+ +RA+ TG + L++ GGV N L+ ++ ++K R ++ + Sbjct: 235 LQETVFAMLIEITERAMAHTGSRELLIVGGVGCNLRLQVLEIVFLVKIRLKKLIF 289 >UniRef50_C5FT24 Glycoprotease family protein n=2 Tax=Onygenales RepID=C5FT24_NANOT Length = 492 Score = 249 bits (636), Expect = 1e-64, Method: Composition-based stats. Identities = 105/456 (23%), Positives = 161/456 (35%), Gaps = 134/456 (29%) Query: 11 DETGIAIYDDEKGLLANQLYSQVK-----LH---------ADYGGVVPELASRDHVRKTV 56 D+T +AI + + + Q LH +Y G+ P ++ H Sbjct: 10 DDTSVAIVEKHGTRINDASSLQTPRPHTTLHFLANITADSREYRGIHPIVSLESHQANLS 69 Query: 57 PLIQAAL----KESGLTAKDIDA-----------------------------VAYTAGPG 83 L+ AL SGL+ K+ DA ++ T GPG Sbjct: 70 DLVDKALWYLPSASGLSHKEPDALRQYASRTIQLSPEGGKARDTVNKLKPDFISVTRGPG 129 Query: 84 LVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDN---------------PPEF 128 + L VG + + LA AW VP + VHHM+ HLL P L D P+F Sbjct: 130 MRSNLSVGLELAKGLAVAWQVPMVGVHHMQAHLLTPRLADALDIPSVEENDSIRALKPDF 189 Query: 129 PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL-LGLDYPGGPLLSKMA 187 PF+++L+SGGHT L + ++ L ++D A G+ DK A++ L Y + Sbjct: 190 PFISVLISGGHTFLAHSKSLTDHKTLASTVDVAIGDVLDKFARMALPRSYIDQSKTTMYG 249 Query: 188 AQGTAGRF------------------------------VFPRPMTDRPGLDFSFSGLKTF 217 Q A F P D + F+F+GL + Sbjct: 250 KQLEAYAFPNGYSDYADYEPPATRGQETKPIINAKYGWSLTLPYPDSKKMAFTFAGLFSA 309 Query: 218 AANTI----------RDNGTDDQT-----------RADIARAFEDAVVDTLMIKCKRALD 256 A + R ++ R + R F + L + AL+ Sbjct: 310 AQRQVDIMVNGKVEQRKKTKEEMDSLNLDFLPHDGRVEFCRDFMRVCFEHLASRIVLALE 369 Query: 257 QT-----------------GFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPE 296 K +V++GGV+AN+ LR L + R ++ Sbjct: 370 NALSSVPNTARKEQIEPGPSVKTIVVSGGVAANQYLRHILRAFLDIRGFSDVDIVAPPLY 429 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 CTDN AMI +AG+ F+AG +W L Sbjct: 430 LCTDNAAMIGWAGIEMFEAGWRTSRKSQAIRKWNLD 465 >UniRef50_Q7SD85 Predicted protein n=2 Tax=Sordariaceae RepID=Q7SD85_NEUCR Length = 538 Score = 249 bits (636), Expect = 1e-64, Method: Composition-based stats. Identities = 101/481 (20%), Positives = 175/481 (36%), Gaps = 149/481 (30%) Query: 1 MRVLGIETSCDETGIAI-------YDDEKGLLANQLYSQVKLHAD---YGGVVPELASRD 50 + L IETSCD+T +A+ E + +L K+ +D +GGV P +A Sbjct: 38 LLTLAIETSCDDTCVALLQSYESTVRTETPEMVARLLFNKKITSDQRQFGGVHPAVAVEW 97 Query: 51 HVRKTVPLIQAAL-----------KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLA 99 H R L++ A+ K + L + D +A T GPG+ +L G V + LA Sbjct: 98 HQRHLATLVEEAIRSLPEGKTPAYKNTRLPYRAPDLIAVTRGPGMPTSLATGMEVAKGLA 157 Query: 100 FAWDVPAIPVHHMEGHLLAPML-------------------------------------E 122 AW +P + VHHM+ H L P L + Sbjct: 158 LAWGIPIVGVHHMQAHALTPQLVEALDRPPAPSVASSPWEERQQVDAEVKTASRQQEEAQ 217 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL--------- 173 ++P++ LLVSGGHTQL+ + + +L + + A G+ DK A+ + Sbjct: 218 HPNLDYPYLNLLVSGGHTQLVYSASLTSHLILCTTDNIALGDMLDKAARKILPPSMLNSG 277 Query: 174 --------------------------GLDY-PGGPLLSKMAAQGTAGRFVFPRPMTDRPG 206 Y P +++ + + P+ Sbjct: 278 QNVMYAAALERFAFPRFPAGADEREYNFKYTPPATRAAEIEQHKSPYGWHLSPPLYASRK 337 Query: 207 LDFSFSGLKTFAAN--------------------------------------------TI 222 ++++F+GL + A + Sbjct: 338 MEYNFTGLGSQAQRIAESLDISSSYENHTEHILSLENSPKSGSDLAPSPDSSTTILSPAL 397 Query: 223 RDNGTDDQTRADIARAFEDAVVDTLMIKC--------KRALDQTGFKRLVMAGGVSANRT 274 ++ + R +ARA + L + K + +Q K LV++GGV++N+ Sbjct: 398 KEEDHQIEQRRYLARATMQLAFEHLASRIVMVLQQQAKTSCEQQKVKTLVVSGGVASNQF 457 Query: 275 LRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 LR L +++ R + CTDN AMIA+ G ++AG + L + +W + Sbjct: 458 LRHVLRRVLEVRGFGHIRIMAPPVNLCTDNAAMIAWTGSEMYRAGWVSKLDMLPIKKWSM 517 Query: 332 A 332 + Sbjct: 518 S 518 >UniRef50_C7DHT9 Metalloendopeptidase, glycoprotease family n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHT9_9EURY Length = 324 Score = 247 bits (630), Expect = 6e-64, Method: Composition-based stats. Identities = 98/338 (28%), Positives = 164/338 (48%), Gaps = 20/338 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M V+GIE+S G+ I + K +LAN+ + G++P + H + +I+ Sbjct: 1 MAVIGIESSAHTFGVGIVEKGK-ILANEK---MMYPISDKGIIPAKVAEYHAKNASAVIR 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + +DI+AV YT GPGL L +G ++L +P P++H GH+ Sbjct: 57 RALSVAHAALEDIEAVGYTKGPGLGPCLEIGMLAAKTLHEKLGIPIYPINHAVGHIEITK 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + L VSGG++Q++S+ G G Y + GE++D G D A+ G+ G Sbjct: 117 HLSGFADP--IVLYVSGGNSQILSLAG-GHYHVHGETLDIGVGNMLDNFARAAGMKPAWG 173 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 ++K A G R + G+DF+F+GL T A T+ + AD++ + + Sbjct: 174 STVAKFATGGKYVRLPYTV-----KGMDFTFTGLLTAAIKTL-----PSSSIADVSFSIQ 223 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L+ +RAL +G +++ GGV+ + LR LA M + + A +F D Sbjct: 224 ETAFSMLVEATERALLLSGKDSVILCGGVAQSLRLREMLATMSASHKKRFYVADNQFNAD 283 Query: 301 NGAMIAYAGMVRFKAGA---TADLGVSVRPRWPLAELP 335 NGAMIAY ++G +DL ++ + R A +P Sbjct: 284 NGAMIAYVAEKMDESGYAPARSDLTINQKFRIEKAGVP 321 >UniRef50_A6TR37 O-sialoglycoprotein endopeptidase n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TR37_ALKMQ Length = 330 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 71/322 (22%), Positives = 143/322 (44%), Gaps = 19/322 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGI+TS T +AI + + LL+ + S + + G+ A H++ L + Sbjct: 8 ILGIDTSNYMTSLAIMNLQGALLSEE-RSLLPVKTGNLGLRQSDALFHHIKNLPVLCKKL 66 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +++ + + +I ++ + P + L + S+A +VP H EGH+ Sbjct: 67 MQQ--VDSINIVGISASVKPRPLADSYMPVFLASQSFATSMASLMNVPFYSFSHQEGHIE 124 Query: 118 APMLED-NPPEFPFVALLVSGGHTQ---LISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 A F+ L +SGG T+ ++ E++G S D +AG+ D+ L Sbjct: 125 AGFWSQARTCTQEFLVLHISGGTTEMLKVVPYDNRYDIEIVGGSKDISAGQLIDRIGVRL 184 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + +P GP L ++ + + P + + +FSGL+T + + + Sbjct: 185 DMPFPAGPHLESLSLEWQGPKIKLPISVKEGW---VNFSGLETHITRLLNQ----EYSSQ 237 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 IA + + +L++ K A Q+ K ++ GGV++N+ +R + + + EV + Sbjct: 238 QIASSLFHTIGQSLVLMIKTAKFQSLIKTALVVGGVASNQQIRTLIEKELSSENIEVLFG 297 Query: 294 RPEFCTDNGAMIAYAGMVRFKA 315 + ++C+DN IA G+ + Sbjct: 298 QTQYCSDNAVGIAALGVKSYLN 319 >UniRef50_A8MFJ2 O-sialoglycoprotein endopeptidase n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MFJ2_ALKOO Length = 328 Score = 237 bits (605), Expect = 5e-61, Method: Composition-based stats. Identities = 76/325 (23%), Positives = 148/325 (45%), Gaps = 18/325 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LG++TS T +++ + L+ + H G+ A HV++ L Sbjct: 6 ILGLDTSNYTTSMSLMSLDGELVYDARKLLPVDHGK-RGLRQSEALFYHVQQLPYLSNEI 64 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 ++S I A++ + P + L + G + + +P H EGH+ Sbjct: 65 SQKSDEFH--IVAISASTRPRPVEDSYMPVFLAAKSYGEITSNLFHIPFYEFSHQEGHIE 122 Query: 118 APMLEDNPP-EFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKTAKLL 173 A + +N + F+A+ +SGG T+++ V E++G + D +AG+ D+ + Sbjct: 123 AALWSENIHMKEEFIAIHISGGTTEVLVVKPRDIGYDIEIIGGTSDLSAGQFIDRVGVAM 182 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 GL++P G L +++ + P +T SFSG +T + I++ + ++A Sbjct: 183 GLEFPSGKSLEEISRGCSELSLNVPVSVTKNK---ISFSGPETHFSRLIKE---SNASKA 236 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 DIA V +L + K Q K L++ GGV++N +R+ L E + +++A Sbjct: 237 DIAYGVFHCVARSLELLVKNIGKQYPIKNLLIVGGVASNNQIRSYLLEKLAPENIHIYFA 296 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGAT 318 P++CTDN I+ G+ ++ + Sbjct: 297 APKYCTDNAVGISSLGVSKYLKQNS 321 >UniRef50_C0ZC04 Peptidase M22 family protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZC04_BREBN Length = 320 Score = 234 bits (597), Expect = 4e-60, Method: Composition-based stats. Identities = 87/327 (26%), Positives = 148/327 (45%), Gaps = 24/327 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGI+TS T + + +++ ++A +K+ G+ A HV Sbjct: 5 MLGIDTSNYRTSLCLAEEDGRIVAEAKR-LLKVKEGKRGLQQSEAVFQHVMNLP----EL 59 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 E +I A+ + P + VG + +SLA VP H EGH+ Sbjct: 60 SDEMKWKDYEIAAICVSEKPRPQDGSYMPVFKVGEGLAKSLATYLRVPLHLTTHQEGHIA 119 Query: 118 APML--EDNPPEFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKTAKL 172 A E P E F+A+ +SGG ++L+ E +G +ID AG+ D+ Sbjct: 120 AGEYTAEVRPTEDRFLAVHLSGGTSELLLCERHAAGYTIEKIGGTIDLHAGQLVDRIGVA 179 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL +P GP L ++A + T G F + GL FSFSG + + T + Sbjct: 180 LGLSFPAGPALEQLAKEAT-GEFRVSSAV---DGLSFSFSGPEASLLREVEKGST---SP 232 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK--RRGEV 290 A+IARA E + + L + A++Q K +++ GGV+AN +R +L + ++ + ++ Sbjct: 233 AEIARATEQCIANALEKSLRHAVEQGYPKDILIVGGVAANYYIRERLIKRLEHPAVKAKL 292 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGA 317 ++ P + DN +A G ++ KA Sbjct: 293 YFCDPVYSGDNAYGVAMLGWMKQKANI 319 >UniRef50_A4RG35 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4RG35_MAGGR Length = 596 Score = 233 bits (596), Expect = 6e-60, Method: Composition-based stats. Identities = 110/485 (22%), Positives = 175/485 (36%), Gaps = 155/485 (31%) Query: 1 MRVLGIETSCDETGIAIYDDEKG------LLANQLYSQVKLHADYGGVVPELASRDHVRK 54 + L IETSCD+T +A+ + E+G +L +Q + ++ +GG+ P H Sbjct: 66 LLTLAIETSCDDTCVALVEKERGPGGAARVLFHQRAT--ADNSMFGGINPLPTLESHTAL 123 Query: 55 TVPLIQAALK----------------------ESGLTAKDIDAVAYTAGPGLVGALLVGA 92 ++++A+ +S + + D V+ T GPG+ AL VG Sbjct: 124 LAKMVRSAVNALPQDAATGNSSFSTAFTRSKPDSSIPRRLPDFVSVTRGPGMAAALSVGL 183 Query: 93 TVGRSLAFAWDVPAIPVHHMEGHLLAPML------------------------------- 121 + + LA AW VP + VHHM+ HLL P L Sbjct: 184 STAKGLAVAWKVPLVGVHHMQAHLLTPRLMSAMRKPFYEWEKERAALTREAFVSEKEEKS 243 Query: 122 -------------------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAA 162 E + P +PF LLVSGGHT L+ + Q+ + E AA Sbjct: 244 GSLKKARSSQSDPKAQDPKEYDWPRYPFFTLLVSGGHTMLMRSKNLVQHSTVAEVEGFAA 303 Query: 163 GEAFDKTAK-LLGLDYPG-----GPLLSKM------------------------------ 186 G+A DK A+ +L Y G G LL + Sbjct: 304 GDALDKCARAILPPKYQGKTSSFGQLLEEFVFPKNLKDYSSVYRAPRNRAEHSSTVSPRR 363 Query: 187 -------------------AAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT 227 G + M + + ++F GL +++ Sbjct: 364 RVLMDRAAERRSPLNIIYTEENGPRYPWALKPMMAESREMKYAFGGLLDQVLRIVKERTA 423 Query: 228 ----DDQTRADIARAFEDAVVDTLMIKCKRALDQTG-------------FKRLVMAGGVS 270 D + R + + + L + +L RL+M+GGV+ Sbjct: 424 AGAFDLEERRVLGYETMRIMFEHLASRVVLSLTSYRDSSRKKNPGQGPTAARLLMSGGVA 483 Query: 271 ANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +N+ LR + M++ +V P C DN AMI +AG+ F+ G T DLGV + Sbjct: 484 SNKFLRYVVRSMLEAYHFNPVQVIGPPPHLCVDNAAMIGWAGLEMFEEGFTTDLGVLPKK 543 Query: 328 RWPLA 332 +W L Sbjct: 544 KWSLD 548 >UniRef50_C0GE31 O-sialoglycoprotein endopeptidase n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GE31_9FIRM Length = 307 Score = 233 bits (595), Expect = 7e-60, Method: Composition-based stats. Identities = 94/310 (30%), Positives = 142/310 (45%), Gaps = 20/310 (6%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TSC T +A+ D + LL + + + + G+ H++ L + Sbjct: 3 LGIDTSCYTTSLAVMDTQGRLLCEK-RTLLTVPKGERGLRQSDGVFQHLQNLPRLAEEVA 61 Query: 64 KESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 E G + AVA + P + VG + GRSLA A+ VP + + H EGH+LA Sbjct: 62 GEVG--PLKLQAVAASVCPRPVEGSYMPVFTVGTSFGRSLAAAFGVPFLSLSHQEGHILA 119 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYEL--LGESIDDAAGEAFDKTAKLLGLD 176 M F AL VSGG T+L+ V ++ LG S D AG+ D+ LGL Sbjct: 120 GMWSAGVDWPEFYALQVSGGTTELLFVRQNNGLKVAELGGSADLHAGQFIDRVGVALGLS 179 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIA 236 +P GP + K+ G V P P++ + G + SFSG ++ I + A +A Sbjct: 180 FPAGPAVEKL---GNDALEVLPVPVSVQ-GSNLSFSGPESHVQRVIASG---EYAPAAVA 232 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 R E V ++L + + G K ++ GGV AN+ +R L +K E +A+ Sbjct: 233 RGVEKCVAESLWRVLRTVRKEHGAKPVLFVGGVMANQFIRGFL---AEKLGDEAAFAQIR 289 Query: 297 FCTDNGAMIA 306 F DN A A Sbjct: 290 FAGDNAAGAA 299 >UniRef50_D1BMJ2 Metal-dependent protease with possible chaperone activity n=3 Tax=Veillonella RepID=D1BMJ2_VEIPT Length = 317 Score = 231 bits (590), Expect = 3e-59, Method: Composition-based stats. Identities = 84/319 (26%), Positives = 145/319 (45%), Gaps = 13/319 (4%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TSC T AI D++ ++ +++ G+ H K +P + + L Sbjct: 6 LGIDTSCYTTSCAIIDNDFHIVGEARKI-LEVKLGERGLQQSNMVFQHT-KALPKLMSEL 63 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE- 122 + ++ + + A +VG G++L+ +VP H E H+LA + + Sbjct: 64 PQVPISGIGVSGFPRREERSYMPAFMVGLGQGQTLSHLMNVPLHIFAHQENHILAALRDL 123 Query: 123 DNPPEFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 N P PF+AL +SGG T+L+ GI + ++G S D G+ D+ LGL +P Sbjct: 124 KNIPNEPFLALHLSGGTTELVYCHYQGNGIFESHIVGGSKDLQGGQYVDRIGVALGLPFP 183 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 G L +A Q T P P + + G SF+G + A I +N D ++ +ARA Sbjct: 184 AGKHLEALALQTTEYE---PLPSSVKDGW-ISFAGPCSAAMRRI-NNAMSDIDKSKLARA 238 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 ++ + L + + L+ GGV +N LR ++ K+ ++ A+P+F Sbjct: 239 VFTSIGNALEKMITYHTKEKSVRALIAVGGVISNSLLRKRMETYCKRNHLQLHVAQPQFS 298 Query: 299 TDNGAMIAY-AGMVRFKAG 316 DN A+ A ++ G Sbjct: 299 VDNATGNAFGAAYLQESRG 317 >UniRef50_A7APL5 Glycoprotease family protein n=1 Tax=Babesia bovis RepID=A7APL5_BABBO Length = 406 Score = 230 bits (586), Expect = 7e-59, Method: Composition-based stats. Identities = 75/298 (25%), Positives = 135/298 (45%), Gaps = 30/298 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IETSCD+ A+ +++ + S +GG+ P+ + R H+ ++ Sbjct: 101 ILAIETSCDDCCAAVVSSNGDVVSEERASNPDSLIKFGGIKPDESYRFHLDNIDRIMNEV 160 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + ++ L +DI + T GPG+ L G ++ + +P I +H+ GH L+P ++ Sbjct: 161 VSKAKLKFEDIGYIVATRGPGMRICLNAGYDAAERISKTYSIPLIGENHLAGHCLSPFIK 220 Query: 123 --------------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDK 168 ++P+++LL+SGGH+Q+ V QY +L +++D AG K Sbjct: 221 GHQLRMTHDRGSVASEELKYPYLSLLLSGGHSQIYVVESPYQYHMLVDTMDHYAGNVLYK 280 Query: 169 TAKLLGLDYP--GGPLLSKMAAQGTAGR-FVFPRPMTDRPGLDFSFSGLKTFAANT---I 222 AK LGL GGP + + A + F P F FSG++T + I Sbjct: 281 CAKELGLPIDTGGGPSIEEAARKRQGRPMFRMTEPCKGMSFTSFCFSGIQTQLRSMVSKI 340 Query: 223 RDNGTDDQTRAD------IARAFEDAVVDTLMIKCKRALDQT----GFKRLVMAGGVS 270 R + +D D +A ++ + ++ + +ALD G ++V+ GG S Sbjct: 341 RQDLGEDALSEDPKLVNHLAYTCQEVTFNQVIRQLDKALDICETLFGISQIVVVGGRS 398 >UniRef50_C5KYH6 Glycoprotein endopeptidase, putative n=4 Tax=Perkinsus marinus ATCC 50983 RepID=C5KYH6_9ALVE Length = 298 Score = 229 bits (585), Expect = 1e-58, Method: Composition-based stats. Identities = 91/256 (35%), Positives = 135/256 (52%), Gaps = 38/256 (14%) Query: 112 MEGHLLAPMLEDNPP------------EFPFVALLVSGGHTQLISVTGIGQYELLGESID 159 ME H+L D P EFPFV LLVSGGH + G+G + +LG ++D Sbjct: 1 MESHMLVTRKPDPTPTASSTSPSPHRPEFPFVTLLVSGGHNMAVLTRGMGDHIILGSTLD 60 Query: 160 DAAGEAFDKTAKLLGL-DYPGGPLLSKMAAQGTAGR--FVFPRPMTD------RPGLDFS 210 D+ GE FDK A+LL + D PGGP+L K+A++G +P+ + G DFS Sbjct: 61 DSVGECFDKVARLLDIHDVPGGPVLEKLASEGNPRACLRELAKPLAKTRDLELKNGCDFS 120 Query: 211 FSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQT-----GFKRLVM 265 F+GLKT + I ++ D+A +F+ VD L+ + RA+D K LV+ Sbjct: 121 FAGLKTSMRHLIEGGK---YSKPDMAASFQKRCVDHLVERAGRAIDWALEIDGSIKDLVV 177 Query: 266 AGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG--------- 316 AGGV+AN+++R+ + E+ K++ ++ CTDNG M+A+ + K G Sbjct: 178 AGGVAANKSVRSNMQELAKEKGLMLYCPPTRLCTDNGTMVAWNAIEHLKEGLYERAPCTA 237 Query: 317 ATADLGVSVRPRWPLA 332 +A+ V VRPRWPL Sbjct: 238 ESAEKFVEVRPRWPLG 253 >UniRef50_D2RJI3 Peptidase M22 glycoprotease n=2 Tax=Acidaminococcus RepID=D2RJI3_ACIFE Length = 319 Score = 227 bits (580), Expect = 4e-58, Method: Composition-based stats. Identities = 88/320 (27%), Positives = 139/320 (43%), Gaps = 23/320 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLG++TSC T A+ D LL +Q +++ + G+V H R L+ A Sbjct: 7 VLGLDTSCYTTSAALMDLHGHLLGDQ-RRLLRVKPGHRGLVQSEMVFQHTRNLPDLL-EA 64 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L SG+ + A+ +A P + A LVG + RSL +P H H+ Sbjct: 65 LDLSGV---QVKAIGVSAKPRPREESYMPAFLVGLGMARSLGKLMGLPVHRFTHQHNHMF 121 Query: 118 APMLE-DNPPEFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKTAKL 172 A + P F+ + +SGG T L+ G E G SID AG+ D+ Sbjct: 122 AGLWSVGKPAPDRFLLVHISGGTTDLLLCERQPDGNFSLEPRGTSIDLHAGQFIDRVGVA 181 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL +P G L K+A + P + R G + S SG T I + Sbjct: 182 LGLPFPAGAPLEKLAETASEAH---PLKVWSREG-ELSLSGPCTQTLRAIEKG----EDP 233 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 A +A E A+ L ++ ++++AGGVSANR +R +L + + +R+ ++ Sbjct: 234 AALALGVEQAIGKALARTISWVCEKEQLSQVLLAGGVSANREIRRQLEDFLGQRQIGLWA 293 Query: 293 ARPEFCTDNGAMIAYAGMVR 312 P + D A+A ++R Sbjct: 294 PDPRYSVDGAVGNAWAALLR 313 >UniRef50_A6NUZ4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NUZ4_9BACE Length = 313 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 89/329 (27%), Positives = 146/329 (44%), Gaps = 32/329 (9%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +R LG++TS T +A++D G + + + + G+ A HV++ L + Sbjct: 4 LRCLGLDTSNYTTSVAVFDGTTG---ENIGRLLDVPSGTLGLRQSDALFQHVKRLPGLFE 60 Query: 61 AALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 L E L ++ AV + P + L G GR+L+ +VP PV H +GH Sbjct: 61 Q-LHEKDL-LGELRAVGASTRPRAVDGSYMPCFLAGEGQGRALSATLNVPFFPVSHQQGH 118 Query: 116 LLAPMLEDNPP---EFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKT 169 + A + P +A +SGG T+L+ V G + + +G + D +AG+ D+T Sbjct: 119 IAAAAWSAGRLGLLDEPMLAWHLSGGTTELLYVEPEGVNVRAQAIGGTSDISAGQLIDRT 178 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 KLLGLD+P G L +A + + + R G FS SG++ Sbjct: 179 GKLLGLDFPAGKALDALARESQSEK----RFKVKLNGCSFSLSGVENQVKAMAER----G 230 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 + ADIAR + + D + AL++ ++ +GGV++N LR KL Sbjct: 231 EAPADIARFALNTIADAVARATAAALEERPGLNVLCSGGVASNSLLREKLKNA------- 283 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGAT 318 +A P + TDN +A +AG T Sbjct: 284 -VFAEPRYSTDNAMGVAILAWRSLQAGET 311 >UniRef50_Q2RIB0 O-sialoglycoprotein endopeptidase n=5 Tax=Clostridia RepID=Q2RIB0_MOOTA Length = 321 Score = 221 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 83/322 (25%), Positives = 139/322 (43%), Gaps = 19/322 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGI+TS A + LLA + + G+ A HV+ ++ Sbjct: 1 MAILGIDTSAYTCSAAAVSQDGELLAAH-RRLLPVPPGERGLQQATAVFHHVQILPEVLS 59 Query: 61 AALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 + + A I V + P + V A GR LA A VP H EGH Sbjct: 60 EVF--AAVPAARIRRVVASVKPRPVEGSYMPVFTVAAGQGRILAAALGVPFRATTHQEGH 117 Query: 116 LLAPMLEDN-PPEFPFVALLVSGGHTQLISVT---GIGQYELLGESIDDAAGEAFDKTAK 171 + A + P F+A+ +SGG ++++ V+ G E LG ++D AG+ D+ Sbjct: 118 IQAGLWSSGWQPSDSFLAVHLSGGTSEVLLVSRKPGGFTIEKLGGTLDLHAGQLVDRAGV 177 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 L+GL++P GP L ++A + + G +FSFSG + A + Sbjct: 178 LMGLEFPAGPALERLAREAGPEMEKVHL-TSAVRGYNFSFSGPASQAERLLAAGAPPAAV 236 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG--E 289 + E + +TL + A++ TG + +++ GGV+AN LR +L ++ Sbjct: 237 ARAV----EQCIANTLERVLRPAVEATGLRDILIVGGVAANNYLRQRLRHRLEHPAVAAR 292 Query: 290 VFYARPEFCTDNGAMIAYAGMV 311 + +A PE +DN +A G+ Sbjct: 293 LHFAAPEHSSDNAIGVALLGLE 314 >UniRef50_A6S1G0 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6S1G0_BOTFB Length = 323 Score = 220 bits (561), Expect = 5e-56, Method: Composition-based stats. Identities = 75/305 (24%), Positives = 129/305 (42%), Gaps = 56/305 (18%) Query: 84 LVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML--------------EDNPPEFP 129 + L+ G + LA AW +P + V+HM+ H L P + +N P +P Sbjct: 1 MRANLITGIDTAKGLAVAWQIPLLGVNHMQAHALTPRMVSALEAGNNSKTEKHENDPAYP 60 Query: 130 FVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL---------------- 173 F++LLVSGGHT L+ + +E+L + D A G+ DKTA+ + Sbjct: 61 FLSLLVSGGHTMLVHSRQLCDHEILATTSDLAVGDMVDKTARDILPASVIESASDVMYGR 120 Query: 174 ---GLDYPGG-------PLLSKMAAQGTAGRFV--FPRPMTDRP-------GLDFSFSGL 214 +P P +A ++ P +FS+SG+ Sbjct: 121 VMEEFAFPDANSSYDYEPSHKSIAQTSRPTKYEWTLTPPYMSTGHRPLKSYNSEFSYSGV 180 Query: 215 KTFAANTI-RDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFK---RLVMAGGVS 270 + + R+ D R +A+ + L + L++ K LV++GGV+ Sbjct: 181 GSQIKRIMNRNPEMDIAERRLLAQETMRVAFEHLASRVILNLERPDLKDTKTLVVSGGVA 240 Query: 271 ANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 AN+ L+ L ++ + + P+FCTDN AMI + G+ ++AG +DL + Sbjct: 241 ANQYLKYILRSLLDAWGHKTMRLIFPPPKFCTDNAAMIGWTGIEMWEAGWRSDLDILAAR 300 Query: 328 RWPLA 332 +WP+ Sbjct: 301 KWPID 305 >UniRef50_Q3AAM2 Glycoprotease family protein n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AAM2_CARHZ Length = 319 Score = 216 bits (551), Expect = 8e-55, Method: Composition-based stats. Identities = 80/313 (25%), Positives = 144/313 (46%), Gaps = 24/313 (7%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LG +TS T A D E L+ + L + + G+ H+R ++Q Sbjct: 6 LGFDTSNYTTSFAAVDGEGRLIFD-LRKILPVPEGEVGLRQRDVVFLHLRHLKEMVQEGF 64 Query: 64 KESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 ++ + + + P + + L G + +L+ A DVP + H EGHL+A Sbjct: 65 NR--ISRDQVRGIGVSVKPRPLPESYMPSFLAGEVIASTLSLALDVPLVKTTHQEGHLVA 122 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQ---YELLGESIDDAAGEAFDKTAKLLGL 175 + F+A+ SGG ++++ V Q ++LG+S+D +AG+ D+ LLGL Sbjct: 123 ALWSLKKDFPRFLAIHFSGGTSEILEVEKEPQGYKVKVLGKSLDISAGQLVDRIGVLLGL 182 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADI 235 +P G L ++A + + P T G ++ FSG + + ++D + I Sbjct: 183 PFPSGKFLEELAQKAVG---ILKVPATFVNG-NWHFSGAEAYLKRKLKDFPAFE-----I 233 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR--GEVFYA 293 ARA E+ + TL + +V+ GGV+AN ++ L E +KKRR ++++A Sbjct: 234 ARAVEEVIARTLFKIIQYHAKDN--LPVVLMGGVAANNYIKNFLLEKLKKRRVAVDLYFA 291 Query: 294 RPEFCTDNGAMIA 306 ++ +DN +A Sbjct: 292 EVQYASDNAVGVA 304 >UniRef50_C9LLA9 Glycoprotease family protein n=1 Tax=Dialister invisus DSM 15470 RepID=C9LLA9_9FIRM Length = 319 Score = 216 bits (551), Expect = 8e-55, Method: Composition-based stats. Identities = 79/321 (24%), Positives = 134/321 (41%), Gaps = 21/321 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 + LGI+TSC T A+YD +G++ + + A G+ HVR P+I Sbjct: 3 KFLGIDTSCYTTSAAVYDSTEGIVGESRII-LSVKAGKRGLSQSEMVFQHVRNL-PVILG 60 Query: 62 ALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L+ I+ + + P + A LVG + SL+ VP H E H Sbjct: 61 QLE---PWIDQINGIGVSVFPRRRADSYMPAFLVGKGMAESLSHVLRVPVFEFSHQENHA 117 Query: 117 LAPMLEDNPPEF-PFVALLVSGGHTQLISV---TGIGQYELLGESIDDAAGEAFDKTAKL 172 LA + PF + +SGG ++SV I Q L S D AG+ D+ Sbjct: 118 LAAIQNMPEIWGTPFYMMHLSGGTQDVLSVEWEKDIMQIVDLIHSADITAGQFIDRVGVS 177 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LG+ +P GP + ++A + + ++ P+ + FSF+G + I+ T T Sbjct: 178 LGMPFPAGPSMERLAMK---HQQLYKVPVANVKN-GFSFAGPEAQVQRDIQ---TKRYTP 230 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA ++ +L + + GGV +N LR + E+ + + + Sbjct: 231 EDIAYGVFSSIGKSLHKVLDSYNGFIEGRTFIAVGGVMSNGYLRKSITEICRHKSLHPCF 290 Query: 293 ARPEFCTDNGAMIAYAGMVRF 313 A ++ +DN A+ +R+ Sbjct: 291 AEVKYSSDNATGNAFGAFMRY 311 >UniRef50_C8WXH0 Peptidase M22 glycoprotease n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WXH0_ALIAD Length = 329 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 81/327 (24%), Positives = 138/327 (42%), Gaps = 24/327 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLG++TS T + D G + + +++ G+ A+ HV+ ++ Sbjct: 7 LVLGVDTSNYTTSVCAVDAVHGRMVAEARRPLRVPRGERGLRQSEAAFQHVQNFPTVMAE 66 Query: 62 ALKE---SGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 L G+ VA + P + G V SLA + VP H E Sbjct: 67 LLDRLMAEGVRPA-WRRVAVSVRPRPWASSYMPVFQSGFAVAASLAHSLGVPLTRTSHQE 125 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQ---YELLGESIDDAAGEAFDKTA 170 GHL A P PFVA+ +SGG ++ +GE++D G+ D+ Sbjct: 126 GHLAAAEYFAPMPGAPFVAVHMSGGTCDVVIARRTPSGYAITRVGEALDLHPGQLVDRVG 185 Query: 171 KLLGLDYPGGPLLSKMAAQG--TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD 228 LGL +P GP L ++A + + G + P+ G SFSG T A ++ Sbjct: 186 VALGLPFPAGPHLEQLARRCGTSPGELLLKAPV---RGASMSFSGPLTAALRAVQAGAPA 242 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQTG-FKRLVMAGGVSANRTLRAKLAEMMKKR- 286 + +ARA E + ++ + A+ + +++AGGV++N+ ++ + +++R Sbjct: 243 HE----VARAVEACIARSVAKAVEYAVRHAQTARHVLIAGGVASNQFIQCTIRSRLERRV 298 Query: 287 -RGEVFYARPEFCTDNGAMIAYAGMVR 312 V +A PEF DN +A G R Sbjct: 299 PGIHVAFAPPEFARDNALGVATIGYWR 325 >UniRef50_B0AAV1 Putative uncharacterized protein n=2 Tax=Clostridium RepID=B0AAV1_9CLOT Length = 326 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 68/328 (20%), Positives = 135/328 (41%), Gaps = 21/328 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 V+GI+TSC T IA K ++ N+ + + + G+ A HV Sbjct: 8 IVIGIDTSCYTTSIAAISLNKEIIFNEKIM-LNVDTNSKGLRQSEAVFKHVSNI----GQ 62 Query: 62 ALKESGLTAKDIDAVAYTAGP-------GLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 + +D + V A + VG +G+ L+ + P H E Sbjct: 63 ISENIAEKLRDYNIVGVCASEKPRPIKGSYMPVFTVGLNIGKLLSSTHNCPFFKTSHQEN 122 Query: 115 HLLAPMLEDNP-PEFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKT 169 H+ + +L N + F+A+ +SGG T+++ V G ++E++G + D + G+ D+ Sbjct: 123 HIESSLLGKNLLDKNRFIAVHMSGGTTEIVLVNKGKCGKYEFEIIGGTKDVSFGQLIDRL 182 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 L ++P G + K A + + ++ SG++ + + + Sbjct: 183 GVKLSYNFPCGKYIDKNALEYEKTIENGLKTSVKEGYMN--LSGIENQLDKIMSNQK--E 238 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 ++ +++ DA++ + ++ G +V AGGVSA++ + L + +KK R + Sbjct: 239 IDKSFLSKLLMDAIIRNMFKSLSYLCEKHGVYEVVFAGGVSASKYISKNLTQKLKKYRIK 298 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGA 317 + + TDN A G+ G Sbjct: 299 THFTHADLATDNAVGCALIGIQNLNLGE 326 >UniRef50_Q97ZY8 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Sulfolobus solfataricus RepID=GCP_SULSO Length = 246 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 82/252 (32%), Positives = 129/252 (51%), Gaps = 17/252 (6%) Query: 88 LLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTG 147 + VGAT+ R++A ++ +PV+H GH+ L + + L +SGG+T +I+ Sbjct: 1 MRVGATLARAIALKYNKKLVPVNHGIGHIEIGYLTTEARDP--LILYLSGGNT-IITTFY 57 Query: 148 IGQYELLGESIDDAAGEAFDKTAKLLGLDYP----GGPLLSKMAAQGTAGRFVFPRPMTD 203 G++ + GE++D A G D + + L P G ++ A +G + P Sbjct: 58 KGRFRVFGETLDIALGNMMDVFVREVSLAPPYIINGIHVIDICAEKGNK---LLKLPYVV 114 Query: 204 RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRL 263 + G D SFSGL T A + + DI + + D L+ +RAL T K L Sbjct: 115 K-GQDMSFSGLLTAALRVVGK-----EKLEDICYSVREIAFDMLLEATERALALTSKKEL 168 Query: 264 VMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGV 323 ++ GGV+A+ +LR KL E+ K+ ++ PEF DNGAMIAYAGM+ G D+ Sbjct: 169 MIVGGVAASVSLRKKLEELGKEWNVQIKIVPPEFAGDNGAMIAYAGMLAASKGVFIDVDK 228 Query: 324 S-VRPRWPLAEL 334 S +RPRW + E+ Sbjct: 229 SYIRPRWRVDEV 240 >UniRef50_Q0AZF6 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AZF6_SYNWW Length = 326 Score = 213 bits (542), Expect = 9e-54, Method: Composition-based stats. Identities = 81/325 (24%), Positives = 142/325 (43%), Gaps = 23/325 (7%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TS + +A+ D+E+ ++A++ +++ A G+ A H++ P + A L Sbjct: 5 LGIDTSAYTSSLALVDEEQNIIADERMI-LQVGAGKRGLRQSEAFFQHIKNL-PFLFARL 62 Query: 64 KESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 S + A+ +A P + G + L+ +P H EGH++A Sbjct: 63 --SSYFDAPVKAIGASAWPRRVEGSYMPVFSAGFSQAVVLSSFTGIPLYSFSHQEGHIIA 120 Query: 119 PMLEDNP--PEFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKTAKL 172 + + F+A+ SGG ++L+ V G+ +D AG+ D+ Sbjct: 121 GIKGNEALLGRAEFLAVHFSGGTSELLHVRQQQGGLLDISPALAGLDLHAGQLVDRVGVA 180 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 +GLD+P G L KMA Q + G FSFSG +T A R + + Sbjct: 181 MGLDFPCGSELEKMARQSSGENLPLMPSSVSDKG--FSFSGAETRA----RKLMAEGISY 234 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK--RRGEV 290 DIA A + +TL + D+ G K +++ GGV AN ++ +L ++ ++ Sbjct: 235 PDIALASLRCIANTLEKSILQESDKKGIKDVLLVGGVMANSIIKERLQARLEHPAVGLKL 294 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKA 315 F+A P +DN +A A + Sbjct: 295 FFASPRLSSDNAVGVALAAQFILRK 319 >UniRef50_D2VC41 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VC41_NAEGR Length = 415 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 85/256 (33%), Positives = 124/256 (48%), Gaps = 34/256 (13%) Query: 23 GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAALKE-SGLTAKD---IDAVAY 78 +L Q+ + +L YGGV P + H +I+ AL++ S L + +D VA Sbjct: 5 KILHEQVITHHELVNQYGGVHPTEMAHMHRATLDGMIENALEKVSNLDSNRERVVDYVAV 64 Query: 79 TAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGG 138 T GPGL L G +VP IPVHH+E HLL P++ FP++ LL SGG Sbjct: 65 TVGPGLPPCLSAGLDTAMKYCEKLNVPVIPVHHLEAHLLVPLMFSENTNFPYLVLLASGG 124 Query: 139 HTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL------------------GLDYPGG 180 H ++ GIGQYE++G + DD+ GEAFDKTA+LL +Y GG Sbjct: 125 HCLVVFSRGIGQYEIVGGTEDDSIGEAFDKTARLLQESIDFNLNDYVNEKFGTRENYSGG 184 Query: 181 PLLSKMAAQGTAGRFVFPRPMTD---RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 L+ K+A G + + FP P+ R + FSFSG+KT T+R ++ D+ Sbjct: 185 ALVEKLALLGDSSSYNFPIPLRKGNRRNDITFSFSGIKTDVLRTVRKEQNQGISKRDL-- 242 Query: 238 AFEDAVVDTLMIKCKR 253 L+ + + Sbjct: 243 -------HHLLNRLRN 251 Score = 117 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 32/132 (24%), Positives = 61/132 (46%), Gaps = 10/132 (7%) Query: 213 GLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG------FKRLVMA 266 G ++ + +++ ++ +I+ +F+ L+ K + A+ + L+++ Sbjct: 276 GSQSLSTIELKNEKLSEEVVCNISASFQKCAFTHLIDKLEMAMHRYRANVDEYPNSLIVS 335 Query: 267 GGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS-- 324 GGVSAN+ R +L ++ K ++ A ++CTDN MI YA R + V Sbjct: 336 GGVSANQYFRHELTKLSDKYEYDLKVAPMKYCTDNAVMIGYAAFQRLFNECHKPVEVCDK 395 Query: 325 --VRPRWPLAEL 334 PRWP+ L Sbjct: 396 ERYIPRWPITTL 407 >UniRef50_Q5KFY5 Mitochondrion protein, putative n=2 Tax=Filobasidiella neoformans RepID=Q5KFY5_CRYNE Length = 307 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 81/286 (28%), Positives = 136/286 (47%), Gaps = 35/286 (12%) Query: 84 LVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML-EDNPPEFPFVALLVSGGHTQL 142 + G L VG R+LA A + VHHM+ H L P+L PEFPF+ LL+SGGHTQL Sbjct: 1 MPGCLSVGQGTARALAAALGKRLVGVHHMQAHALTPLLTSAAAPEFPFLILLLSGGHTQL 60 Query: 143 ISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD------------YPGGPLLSKMAAQG 190 + G+ ++++L +++D G+ F+K+A+LL L Y P L Sbjct: 61 VLAKGLFKFKILLDTLDSKIGDVFEKSARLLALPSGPKAPGAILEHYASLPALPPYDTHP 120 Query: 191 TAGRFVFPRPMTD---RPGLDFSFSGLKTFAANTIRDNGT-----DDQTRADIARAFEDA 242 + P P+T + L +SF+G+ + D D+ R A + A Sbjct: 121 LPASQLIPIPLTTLHAKNTLAWSFAGMLAALQRAVHDRRQRQPAWDEPDRRAFANLVQTA 180 Query: 243 VVDTLMIKCKRALDQTGFKR------LVMAGGVSANRTLRAKLAEMMKKRRG-------E 289 + L+ K + + +V++GGV++N +R++L ++K G Sbjct: 181 LTTHLLTKLAQRIALLPPDTRAQLGGIVVSGGVASNAYIRSQLDRLVKTENGLFPPAGRN 240 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAEL 334 ++Y CTDN AMIA+ ++R + G +D + +R +W L ++ Sbjct: 241 LYYPPLHLCTDNAAMIAHTALIRLQTGLRSDPDDLKLRAKWSLEDM 286 >UniRef50_C7H6X1 Glycoprotease family protein n=2 Tax=Faecalibacterium prausnitzii RepID=C7H6X1_9FIRM Length = 312 Score = 210 bits (535), Expect = 6e-53, Method: Composition-based stats. Identities = 70/317 (22%), Positives = 129/317 (40%), Gaps = 23/317 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 LGI+TS T +A++ ++ + + + G+ A H +++ Sbjct: 5 TLGIDTSNYATSLAVFHTAGEVVCAKKRF-LPVKEGQLGLRQSDALFHHTAALPEMLEEL 63 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +E LT I AV + P + L G + + A A +P I H +GH Sbjct: 64 GREFDLT--QISAVGVSQKPRPVEGSYMPCFLAGVSAATAFAQARGIPLIHTTHQQGHAA 121 Query: 118 APMLEDNPPEF---PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 A + E + +SGG T L+ + + LG S D AG+A D+ LG Sbjct: 122 AALFAAKGEELFRQKVLLFHISGGTTDLLLCNEVKEITTLGTSTDLYAGQAVDRVGVKLG 181 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 +P G +S++AA +P + G+ S SGL+ + + T + Sbjct: 182 FGFPAGVEVSRLAALCEEP----IKPRSSVKGMQCSLSGLENQCNALLNEGKTPEY---- 233 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + + V DT++ K A + +V AGGV ++ +RA + + + +V++ Sbjct: 234 VCKYCLLCVADTVVKMTKAAQKEYPGLPVVCAGGVMSSDIIRAWVQQRL----PQVYFVP 289 Query: 295 PEFCTDNGAMIAYAGMV 311 ++ +DN ++ Sbjct: 290 GQYSSDNAIGVSILAAQ 306 >UniRef50_D1PKV9 Glycoprotease family protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PKV9_9FIRM Length = 315 Score = 206 bits (525), Expect = 8e-52, Method: Composition-based stats. Identities = 73/317 (23%), Positives = 123/317 (38%), Gaps = 22/317 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGI+TS T +A++D G + + + A G+ A H ++ Sbjct: 1 MLTLGIDTSNYATSLAVFDTNAGEVVCDCKKFLPVKAGQMGLRQSDALFHHTSALPQMLL 60 Query: 61 AALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 +++ L+ I AV +A P + L G + A A +P H +GH Sbjct: 61 ELGEKTDLSR--IGAVGVSAKPRPVEGSYMPCFLAGVNTATAFALARKIPMFKTTHQQGH 118 Query: 116 LLAPMLEDNPPEF---PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL 172 + A + + VSGG T L+ G LG S D AG+A D+ Sbjct: 119 IAAALFATGVHSLFMQEALVFHVSGGTTDLLLCHGADTVVPLGTSSDLYAGQAVDRLGVK 178 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LG +P G +S+ AA RP G++ S SGL+ + + + Sbjct: 179 LGYPFPAGVYVSEQAALCAEK----IRPKVSVRGMECSLSGLENQCNRMLE----EGKNA 230 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 + + + + +TL+ AL + ++ AGGV ++ +R + R + Sbjct: 231 SYVCKYCLLCIGETLVRMAGTALQEHPGLPVIFAGGVMSSDLIRTYVMH----RVPGAHF 286 Query: 293 ARPEFCTDNGAMIAYAG 309 +F +DN IA Sbjct: 287 VPGKFASDNAIGIAVLA 303 >UniRef50_Q18B67 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Clostridium difficile RepID=Q18B67_CLOD6 Length = 356 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 138/351 (39%), Gaps = 43/351 (12%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++GI+TSC T IA +K ++ N+ +++ + G+ A H+ + ++ Sbjct: 7 IIIGIDTSCYTTSIAAISLDKKVIFNEKIM-LEVRDNSKGLRQSEAVFQHI-NNLGILSD 64 Query: 62 ALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 +K ++ V + P + VG G+ L+ + H E H+ Sbjct: 65 RIKSFKDKFN-VEGVCSSKKPRPVENSYMPVFNVGHNFGKLLSSIYGCRFYETTHQENHI 123 Query: 117 LAPMLEDN-PPEFPFVALLVSGGHTQLI-------------------------------S 144 A +L F+++ +SGG T+++ Sbjct: 124 EASLLNSKLKNNNKFISVHMSGGTTEILLTSKQDSHHNVCDTNLGKIAKISIKKDDKSKL 183 Query: 145 VTGIG-QYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTD 203 G +++G S D + G+ D+ LG +P G L + A + + Sbjct: 184 YNNFGYNIDIIGGSKDISFGQLIDRVGIKLGYKFPSGKYLDENALNCN-LKIESGLKTSV 242 Query: 204 RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRL 263 R G + SGL+ I DNG + + I++ D+VV + + + Sbjct: 243 RDGY-MNLSGLENQVNKIINDNGDNTNQKEYISKLVLDSVVRNMFKSLVYLCETYNVNEV 301 Query: 264 VMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFK 314 + AGGVSA++ + +L+ ++K+ E ++ P++ TDN A G+ F Sbjct: 302 IFAGGVSASKYILRELSMKLRKKHIEAYFTEPQYSTDNAVGCAIIGLNNFL 352 >UniRef50_UPI0000DD8AA6 Os01g0295900 n=1 Tax=Oryza sativa Japonica Group RepID=UPI0000DD8AA6 Length = 288 Score = 204 bits (520), Expect = 3e-51, Method: Composition-based stats. Identities = 81/333 (24%), Positives = 128/333 (38%), Gaps = 102/333 (30%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +LGIETSCD+T A+ + +L+ + SQ L +GGV P++A H+ ++Q Sbjct: 16 LLMLGIETSCDDTAAAVVRGDGEILSQVVSSQEDLLVRWGGVAPKMAEEAHLLAIDRVVQ 75 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + ++ D+ AVA T GPGL Sbjct: 76 KALDNANVSESDLSAVAVTVGPGL------------------------------------ 99 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +L + G T I+ + Q K +K++ Sbjct: 100 -----------SLCLRGYLTNHINCSWCSQSS---------------KNSKIIS------ 127 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN-------------GT 227 P ++ G G F M +FS++GLKT I Sbjct: 128 PAYCWSSSYGGTG-ISFQVSMRQHKDCNFSYAGLKTQVRLAIESRNISTDDIPISSATKD 186 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 D Q RA+IA +F+ L+ V++GGV++N+ +R L ++ +K Sbjct: 187 DRQIRANIAASFQ------LLK--------------VVSGGVASNQYVRTHLNQIAEKNG 226 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATAD 320 ++ P CTDNG MIA+ G+ F AG D Sbjct: 227 LQLVCPPPRLCTDNGVMIAWTGIEHFIAGRFDD 259 >UniRef50_A7VX43 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A7VX43_9CLOT Length = 315 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 87/329 (26%), Positives = 143/329 (43%), Gaps = 27/329 (8%) Query: 1 MRV-LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M + LGI+TS T +A+YD + + Q+ + + G+ A HV++ P + Sbjct: 1 MNLALGIDTSNYTTSLALYDAQAHEIC-QVKRLLPVKEGEKGLRQSDAVFHHVQQL-PEL 58 Query: 60 QAALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L + G K + AV +A P + VG T R L+ A +VP H G Sbjct: 59 MDKLWKPGC-GKALSAVGVSARPRDAEGSYMPCFTVGLTYARLLSTALEVPFYTFSHQAG 117 Query: 115 HLLAPMLEDNPP---EFPFVALLVSGGHTQLISVTGIGQ----YELLGESIDDAAGEAFD 167 H+ A + + PF+A VSGG T+ + V+ Q +L +++D AG+ D Sbjct: 118 HIAAALYSSGSLSLLKQPFLAFHVSGGTTEALLVSPDDQRILSCQLAAKTLDLNAGQLID 177 Query: 168 KTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT 227 + +LGL +P GP L ++A + RP G D SG + +R Sbjct: 178 RVGVMLGLGFPAGPALERLALTCESKGLRGARPAM--KGNDCCLSGGENLCIKLLR---- 231 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 + + A IA + V L C+ L++ G +V AGGV +N LR ++ + Sbjct: 232 EGKEPAYIAAFCLEYVKAALDQMCRGLLERYGRLPVVFAGGVMSNSILREYFSK-----Q 286 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 +A P+F +DN + ++ G Sbjct: 287 YGAMFAEPQFSSDNAGGVGVLTAIKAGLG 315 >UniRef50_A0RY43 O-sialoglycoprotein endopeptidase n=4 Tax=Thaumarchaeota RepID=A0RY43_CENSY Length = 237 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 71/245 (28%), Positives = 114/245 (46%), Gaps = 14/245 (5%) Query: 91 GATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQ 150 GA V R+L+ +P PV+H GH+ L + + LLVSGGHT L++ G G+ Sbjct: 2 GAVVARALSSYHGIPIYPVNHAIGHIELGKLLTGAQDP--LVLLVSGGHTMLLAFVG-GR 58 Query: 151 YELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFS 210 + + GE++D G+ D+ + LG P G + ++AA+ + P + G D S Sbjct: 59 WRVFGETLDITLGQLLDQFGRSLGFPSPCGRQVEELAAESSEYT-DLPYSV---KGNDVS 114 Query: 211 FSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVS 270 FSGL + A R + + + ++ + +RAL T + L++ GGV+ Sbjct: 115 FSGLLSAAKTAARRG------KETASYSLQETAFAMVAEAVERALSFTRKRELMVVGGVA 168 Query: 271 ANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRW 329 AN+ L L ++R +F P + D GA IA G++ A L VR W Sbjct: 169 ANKRLAGMLEGACGRQRCRLFVVPPVYSGDCGAQIACTGLLEASIKDGAPLADTFVRQSW 228 Query: 330 PLAEL 334 L + Sbjct: 229 RLDTV 233 >UniRef50_B0TEI7 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TEI7_HELMI Length = 385 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 104/375 (27%), Positives = 155/375 (41%), Gaps = 65/375 (17%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGI+TSC T +A + LLA Q + + G+ A H R+ P + A Sbjct: 9 VLGIDTSCYTTSVAFASLDGRLLA-QKRQLLPVKPGERGLRQGDAFFLHGRQL-PHVMEA 66 Query: 63 L---------KESGLTAKDIDAVAYTAGPG-----LVGALLVGATVGRSLAFAWDVPAIP 108 L ++G ++AVA + P + L G VGRS+A A VP Sbjct: 67 LFADLRCSGEAKAGREGLRVEAVAASTRPRPEEGAYLPVFLAGEAVGRSVAAAQGVPFFA 126 Query: 109 VHHMEGHLLAPMLEDNPPEFP-------FVALLVSGGHTQLISVT-----GIGQYELLGE 156 H EGH++A + E F+++ +SGG T+L+ V + E LG Sbjct: 127 TTHQEGHIMAGIASLEDREQAEALLKKGFLSVHLSGGTTELLRVRFDGASAVFSIEKLGA 186 Query: 157 SIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGR--------------FVFPRPMT 202 + D AG+ D+ LGL +P GP L +AAQ GR P P + Sbjct: 187 TTDLHAGQLVDRVGVALGLPFPAGPHLEALAAQCDGGRCAAEGAAEGSTEAIEAIPFPAS 246 Query: 203 DRPGLDFSFSGLKTFAANTIRDNGTDDQ--------------------TRADIARAFEDA 242 + G + SFSG + A I ++ IAR E Sbjct: 247 VK-GYNVSFSGAEAQALRLIEKWRKANEAASPAAIATLPGDPAHPGIPALPAIARGIEGC 305 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR--RGEVFYARPEFCTD 300 + TL +RA+ +TG + +++ GGV+AN LR +L E ++ R G + +A D Sbjct: 306 LASTLEKILRRAIAETGCRDVLIVGGVAANGFLRRRLRERLEHRAVGGRLAFATTALSGD 365 Query: 301 NGAMIAYAGMVRFKA 315 N A +A G A Sbjct: 366 NAAGVALLGAKFLSA 380 >UniRef50_UPI000187E9E4 hypothetical protein MPER_08009 n=1 Tax=Moniliophthora perniciosa FA553 RepID=UPI000187E9E4 Length = 276 Score = 178 bits (451), Expect = 4e-43, Method: Composition-based stats. Identities = 61/283 (21%), Positives = 109/283 (38%), Gaps = 43/283 (15%) Query: 4 LGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 LG+E S ++ G I D +L+N ++ + + G P + H + +I Sbjct: 21 LGLEGSANKLGAGIIKHSEDGSATVLSNIRHTYITPPGE--GFQPRDTALHHREWAMKVI 78 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 L ++ ++ D+D + YT GPG+ L A V R+L+ +D P + V+H GH+ Sbjct: 79 DECLTKAEVSMHDLDCICYTKGPGMGAPLQSVALVARTLSMLFDKPIVGVNHCVGHIEMG 138 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 ++G ++ G+Y + F L D Sbjct: 139 R-------------EITGAQNPVVLYVSRGEY---------PSDSVFAAMLSYLWRD--T 174 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG------------T 227 G + + GR + P P + G+D S SG+ + D Sbjct: 175 GHCWYNIEQESKKGRRLLPLPYATK-GMDISLSGVLSSVEAYTNDKMFRQTPTSDEEKDE 233 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVS 270 T AD+ + ++ V L+ +RA+ G K +++ GGV Sbjct: 234 SVITPADLCFSLQETVFAMLVEITERAMAHIGSKEVLIVGGVG 276 >UniRef50_D2EF31 O-sialoglycoprotein endopeptidase (Fragment) n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EF31_9EURY Length = 242 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 61/187 (32%), Positives = 98/187 (52%), Gaps = 7/187 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 LGIE++ G+ I +++K ++AN+ + L GG++P A+ H + +I+ A Sbjct: 26 TLGIESTAHTFGVGISENDK-IIANERDT---LKPTSGGIIPREAAMHHFKLAPEIIKRA 81 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L +SGL KDID A++ GPG++ AL VGA V L+ + I V+H HL L Sbjct: 82 LDKSGLKLKDIDLFAFSQGPGIIPALKVGAQVSTFLSNKYKKKLIGVNHCIAHLEIARLY 141 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + V L VSGG+TQ+I+ G Y + GE+ D G DK + + + +P G Sbjct: 142 TKLKDP--VMLYVSGGNTQIITYYN-GTYIVFGETQDIGIGNLIDKIGRRMDIPFPDGTK 198 Query: 183 LSKMAAQ 189 + + + Sbjct: 199 IEETCHE 205 >UniRef50_Q8IJ99 Glycoprotease, putative n=5 Tax=Plasmodium RepID=Q8IJ99_PLAF7 Length = 598 Score = 167 bits (423), Expect = 7e-40, Method: Composition-based stats. Identities = 51/190 (26%), Positives = 97/190 (51%), Gaps = 7/190 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE S ++ GI+I +++ +L N + + G +P S H + +I++ Sbjct: 17 ILGIEGSANKLGISIINEDMNILVNMRRTYIS--EIGCGFIPREISAHHKYYIIDMIKSC 74 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 LK+ + DI + YT GPG+ AL +G + + L +++P + V+H H+ + Sbjct: 75 LKKVNIKISDITLICYTKGPGIGSALYIGYNIAKILYSYFNIPVVGVNHCIAHIEMGIFI 134 Query: 123 DNPPEFPFVALLVSGGHTQLISVTG-IGQYELLGESIDDAAGEAFDKTAKLLGL--DYPG 179 + + L VSG +TQ+I +YE++GE++D A G D++A++L + Sbjct: 135 TKL--YNPIVLYVSGSNTQIIYYNDHKKKYEIIGETLDIAIGNVIDRSARILKISNAPSP 192 Query: 180 GPLLSKMAAQ 189 G + +A + Sbjct: 193 GYNVELLARK 202 Score = 117 bits (294), Expect = 5e-25, Method: Composition-based stats. Identities = 26/115 (22%), Positives = 57/115 (49%), Gaps = 4/115 (3%) Query: 224 DNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMM 283 D +++ + I + + + L+ +RA+ T K +++ GGV N L+ + +M Sbjct: 479 DLTEEEKRKIQICYSLQHHIFSMLIEITERAIAFTNSKEVIIVGGVGCNLFLQNMMKKMA 538 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL----GVSVRPRWPLAEL 334 K++ ++ + +C DNGAMIAY G + + D+ +++ R+ ++ Sbjct: 539 KQKNIKIGFMDHSYCVDNGAMIAYTGYLEYLHAKNKDIYNFNNITIHQRYRTDDV 593 >UniRef50_B2WBX5 Glycoprotease pgp1, mitochondrial n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2WBX5_PYRTR Length = 417 Score = 163 bits (414), Expect = 7e-39, Method: Composition-based stats. Identities = 64/227 (28%), Positives = 97/227 (42%), Gaps = 58/227 (25%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLH-------ADYGGVVPELASRDHVRKT 55 L IETSCD+T +A+ +KG ++ +Q+ H ++Y GV P ++ + H Sbjct: 2 TLAIETSCDDTSVAVV--KKGCKNDRTTAQILFHKKVTSNNSEYQGVHPIVSLQSHQESL 59 Query: 56 VPLIQAALK--------------ESGLTAKDI------DAVAYTAGPGLVGALLVGATVG 95 L+ A++ +G DI D V+ T GPG+ L G Sbjct: 60 ATLVGEAIRCLPMQDGELPSEDDRTGPIPVDITTRTLPDFVSVTRGPGMRSNLFTGLDTA 119 Query: 96 RSLAFAWDVPAIPVHHMEGHLLAPML-----------------------------EDNPP 126 + LA AW P + VHHM+ H L L P Sbjct: 120 KGLAVAWQKPLVGVHHMQAHALTSRLVSALDAYKELNEPEAECLPNGTIGRNPTQAHVSP 179 Query: 127 EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 +FPF+++L SGGHT LI + + +LG + D A GE DK A+++ Sbjct: 180 DFPFLSVLASGGHTLLIHSASLTDHRVLGSTNDIAIGECLDKIARVV 226 Score = 53.3 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 29/122 (23%), Positives = 41/122 (33%), Gaps = 27/122 (22%) Query: 184 SKMAAQGTAGRFVFPRPMTDRPG------LDFSFSGLKTFAANTIR-------------- 223 M TA + RP+T G L+ SFSG+ T +R Sbjct: 296 DSMIRNVTAWGWALNRPLTKSGGGIKINSLEMSFSGITTMIERIVRYGMDPITRKLNKKE 355 Query: 224 --DNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG-----FKRLVMAGGVSANRTLR 276 + R D+AR A + + + L +VMAGGV+AN R Sbjct: 356 RAATEVSLEERRDLARETMRAAFEHVASRVVLGLQSQQELLEANPAVVMAGGVAANSFFR 415 Query: 277 AK 278 Sbjct: 416 HM 417 >UniRef50_A5KDZ1 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Plasmodium vivax RepID=A5KDZ1_PLAVI Length = 574 Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats. Identities = 45/189 (23%), Positives = 91/189 (48%), Gaps = 7/189 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LG+E S ++ G++I + +L N + + G +P + H + +I+ Sbjct: 20 ILGLEGSANKLGVSIINSNFEILVNMRRTYIS--EIGCGFIPRQINAHHKYYIIEMIKDC 77 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L + + D+ + YT GPG+ AL + + + + +++P I V+H H+ + Sbjct: 78 LTKLKIKITDVHLICYTKGPGIGSALYIAYNISKFFSLLFNIPVIGVNHCIAHIEMGIFI 137 Query: 123 DNPPEFPFVALLVSGGHTQLISVTG-IGQYELLGESIDDAAGEAFDKTAKLLGL--DYPG 179 + + L VSG +TQ+I +YE++GE++D A G D++A++L + Sbjct: 138 TKL--YHPIILYVSGSNTQIIYFNDHKKRYEIIGETLDIAIGNVIDRSARILRISNSPSP 195 Query: 180 GPLLSKMAA 188 G + +A Sbjct: 196 GYNVEILAR 204 Score = 113 bits (284), Expect = 8e-24, Method: Composition-based stats. Identities = 26/112 (23%), Positives = 56/112 (50%), Gaps = 4/112 (3%) Query: 227 TDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 +++ + I + + + L+ +RA+ T K +++ GGV N L+ + +M K++ Sbjct: 458 DEEKRKIQICYSLQHHIFSMLIEITERAIAFTNSKEVIIVGGVGCNVFLQNMMKKMAKQK 517 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL----GVSVRPRWPLAEL 334 ++ + +C DNGAMIAY G + F ++ +S+ R+ ++ Sbjct: 518 NIKIGFMDHSYCVDNGAMIAYTGYLEFANTKNREIYGFDNISIHQRYRTDDV 569 >UniRef50_C5KJ57 Putative uncharacterized protein (Fragment) n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KJ57_9ALVE Length = 203 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 51/155 (32%), Positives = 79/155 (50%), Gaps = 17/155 (10%) Query: 192 AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKC 251 A R + + G DFSF+GLKT + I ++ D+A +F+ VD L+ + Sbjct: 7 AKPLAKTRDLELKNGCDFSFAGLKTSMRHLIEGGK---YSKPDMAASFQKRCVDHLVERA 63 Query: 252 KRALD-----QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIA 306 RA+D K LV+AGGV+AN+++R+ + E+ K++ + CTDNG M+A Sbjct: 64 GRAIDWALEIDDSIKDLVVAGGVAANKSVRSNMQELAKEKGLTLHCPPTRLCTDNGTMVA 123 Query: 307 YAGMVRFKAG---------ATADLGVSVRPRWPLA 332 + + K G +A+ V VRPRWPL Sbjct: 124 WNAIEHLKEGLYERAPCTAESAEKFVEVRPRWPLG 158 >UniRef50_Q0V4Z5 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0V4Z5_PHANO Length = 497 Score = 146 bits (370), Expect = 9e-34, Method: Composition-based stats. Identities = 70/244 (28%), Positives = 98/244 (40%), Gaps = 62/244 (25%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLH-------ADYGGVVPELASRDHVR 53 + L IETSCD+T +AI EK + + +Q+ H A+Y GV P ++ R H Sbjct: 28 LMTLAIETSCDDTSVAIV--EKKVENGRAVAQLHFHKKVTANNAEYQGVHPLVSLRSHQE 85 Query: 54 KTVPLIQAALKESGLTAK--------------------DI------DAVAYTAGPGLVGA 87 L+ A+ D+ D V+ T GPG+ Sbjct: 86 NLADLVSEAISHLPPKTASRDHDFEHGGLEAQRPEAVLDVTKKRLPDFVSVTRGPGMRSN 145 Query: 88 LLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDN---------PPEFPFVALLVSGG 138 L G + LA AW + H L P L P+FPF+++L SGG Sbjct: 146 LFTGLDTAKGLAVAW----------QAHALTPRLVSALEPSATPTLEPDFPFLSVLASGG 195 Query: 139 HTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL--------GLDYPGGPLLSKMAAQG 190 HT LI + + LLG + D A GE DK A++L G LL K A +G Sbjct: 196 HTLLIQSASLNDHHLLGTTNDIAVGEYLDKVARILLPTELLQSTRSTMYGALLEKFAFEG 255 Query: 191 TAGR 194 A + Sbjct: 256 NASQ 259 Score = 133 bits (336), Expect = 7e-30, Method: Composition-based stats. Identities = 46/155 (29%), Positives = 68/155 (43%), Gaps = 23/155 (14%) Query: 203 DRPGLDFSFSGLKTFAANTI--------RDNGTDD--------QTRADIARAFEDAVVDT 246 ++ SFSGL T I R + + + IAR A + Sbjct: 336 KVNDIELSFSGLLTAVERVIGYQTDPVTRKRTKIERTLDEISLEEKKHIAREAMRAAFEH 395 Query: 247 LMIKCKRALD----QTGFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCT 299 + + AL + +V+AGGV+AN LR LA + R +++ P FCT Sbjct: 396 VAYRVVLALRSLASDPAPRSVVLAGGVAANSFLRHILASTLCARGFSHINLYFPPPSFCT 455 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 DN AMIA+ G+ F+AG T L + +WPL +L Sbjct: 456 DNAAMIAWTGIEMFEAGHTDTLSIRAIRKWPLNQL 490 >UniRef50_A9UYP5 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UYP5_MONBE Length = 230 Score = 145 bits (366), Expect = 3e-33, Method: Composition-based stats. Identities = 45/141 (31%), Positives = 68/141 (48%), Gaps = 10/141 (7%) Query: 201 MTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG- 259 D DFSFSGLKT A N + D+ +A +F+ + D L+++ +RAL Sbjct: 68 TQDHSNCDFSFSGLKTRAINLSSEYAKRDE-LPLLAASFQRTIADHLLVRLERALRFCDQ 126 Query: 260 ----FKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA 315 +R V AGGV N +R +L + V + P C DNG MIA+AG++ F Sbjct: 127 QGRRPRRFVAAGGVLCNAYIRQRLHAFARFHDLPVEFPAPPLCVDNGVMIAWAGLLHFLR 186 Query: 316 GATA----DLGVSVRPRWPLA 332 G ++ + P+WP+ Sbjct: 187 GTSSVARDPQALRYHPKWPIG 207 Score = 44.4 bits (104), Expect = 0.007, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 24/37 (64%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADY 39 VLGIE++ D+T I I + ++ +LA+ +Q + D+ Sbjct: 40 VLGIESTFDDTAIGIVNHQRQILADVRRTQDHSNCDF 76 >UniRef50_C1BYL4 Probable O-sialoglycoprotein endopeptidase 2 n=1 Tax=Esox lucius RepID=C1BYL4_ESOLU Length = 235 Score = 144 bits (365), Expect = 3e-33, Method: Composition-based stats. Identities = 48/190 (25%), Positives = 79/190 (41%), Gaps = 26/190 (13%) Query: 169 TAKLLGL-DYP------GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANT 221 A+ L L +P GG + +A G +F F PM +FSF+GL+ T Sbjct: 23 VARRLPLIKHPKCSTISGGQAIELLAQDGDRLKFHFRPPMGAHYDCNFSFAGLRNQVKMT 82 Query: 222 IRDNGTDD--------QTRADIARAFEDAVVDTLMIKCKRALDQTGFK--------RLVM 265 I+ ++ DIA A + V + + RA+ + LV+ Sbjct: 83 IQKKEAEEGVEPGTLLSCVNDIAAAMQHTVAFHIAKRTHRAILFCKAQGLLPSFNPTLVV 142 Query: 266 AGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSV 325 +GGV++N+ +R L + ++ + C DNG MIA+ G+ R + G + V Sbjct: 143 SGGVASNQYIRKTLKIVTDATGLDLLCPPSKLCNDNGVMIAWNGVERLREGKGILFYIDV 202 Query: 326 ---RPRWPLA 332 P+ PL Sbjct: 203 VRYEPKAPLG 212 >UniRef50_D1IQV9 Whole genome shotgun sequence of line PN40024, scaffold_2082.assembly12x (Fragment) n=4 Tax=Eukaryota RepID=D1IQV9_VITVI Length = 151 Score = 143 bits (361), Expect = 9e-33, Method: Composition-based stats. Identities = 42/132 (31%), Positives = 68/132 (51%), Gaps = 2/132 (1%) Query: 207 LDFSFSGLKTFA-ANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVM 265 +D SFSGL ++ A + ++ T AD+ + ++ V L+ +RA+ K +++ Sbjct: 1 MDVSFSGLLSYIEATAVEKLQNNECTPADLCYSLQETVFAMLVEITERAMAHCDKKDVLI 60 Query: 266 AGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS- 324 GGV N L+ + M +R G +F +C DNGAMIAY G++ + GAT L S Sbjct: 61 VGGVGCNERLQEMMRVMCSERSGRLFATDDRYCIDNGAMIAYTGLLAYAHGATTPLEEST 120 Query: 325 VRPRWPLAELPA 336 R+ E+ A Sbjct: 121 FTQRFRTDEVHA 132 >UniRef50_B2AYU1 Predicted CDS Pa_1_12230 (Fragment) n=1 Tax=Podospora anserina RepID=B2AYU1_PODAN Length = 290 Score = 141 bits (357), Expect = 3e-32, Method: Composition-based stats. Identities = 69/239 (28%), Positives = 107/239 (44%), Gaps = 40/239 (16%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKL-HADYGGVVPELASRDHVRKTVPLI 59 + + IETSCD+T +AI + Q ++ H ++ G+ P +AS+ H + L+ Sbjct: 40 LLTIAIETSCDDTCVAILEKAGPAARLQFNKRIPSNHVEFKGIHPTIASKSHEIQLAKLV 99 Query: 60 QAALKESG-----------LTAKDI-----------DAVAYTAGPGLVGALLVGATVGRS 97 A++ ++ +D D V+ T GPG L VG V + Sbjct: 100 NEAVQSLPKHTNHSPEVKTISIRDPQTGKSTPRRLPDFVSVTRGPGFPRCLDVGLGVAKG 159 Query: 98 LAFAWDVPAIPVHHMEGHLLAPMLED----------------NPPEFPFVALLVSGGHTQ 141 L+ AW VP + VHHM+GH L P L+ P+FPF+ LL SGGHTQ Sbjct: 160 LSVAWQVPFLGVHHMQGHALTPRLDHALQQPFPPSSSTPSSKLSPKFPFLTLLASGGHTQ 219 Query: 142 LISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRP 200 L+ T + + +L + + G+ DK A+ + L L +A +F FP P Sbjct: 220 LLLSTTLTTHTILATVTNISLGDMLDKAAREI-LPPSLLSSLPNIAYAAALEQFAFPSP 277 >UniRef50_C9SIA9 Glycoprotease pgp1 n=2 Tax=Sordariomycetes RepID=C9SIA9_VERA1 Length = 208 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 44/173 (25%), Positives = 69/173 (39%), Gaps = 10/173 (5%) Query: 170 AKLLGLDY-PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR-DNGT 227 + + Y P ++A + + P + + FSG + Sbjct: 17 GRAVDYAYTPPRIRADEIATYRSPHGWHLKPPFATTRRMRYDFSGFGSQVQRIAEARPAM 76 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQ-----TGFKRLVMAGGVSANRTLRAKLAEM 282 + R D+ R + + L + AL + LV+AGGV++NR L L Sbjct: 77 SEAERRDLGRDTMRILFEHLASRVVLALGNEEMGLKDVRTLVVAGGVASNRYLMHVLRAF 136 Query: 283 MKKRRG---EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + R E+ E CTDN AMIA+ GM F+AG ++L V +WPL Sbjct: 137 LDVRGYDGIEITAPPVELCTDNAAMIAWTGMEMFEAGYESELSVHSIKKWPLD 189 >UniRef50_B9PG42 O-sialoglycoprotein endopeptidase, putative n=3 Tax=Toxoplasma gondii RepID=B9PG42_TOXGO Length = 1323 Score = 124 bits (311), Expect = 5e-27, Method: Composition-based stats. Identities = 43/99 (43%), Positives = 62/99 (62%), Gaps = 1/99 (1%) Query: 2 RVLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +LGIETSCD+T + I D + +LAN Q +L YGGV P A+ H R+ +++ Sbjct: 146 LILGIETSCDDTCVGIVDWESGRILANICTPQPELLIKYGGVHPSEAAAAHDRRMQSVVR 205 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLA 99 AL+E+G++ DID +A+T GPGLV L VGA+ +A Sbjct: 206 NALQEAGVSLLDIDVIAFTRGPGLVPCLSVGASAALEIA 244 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 7/116 (6%) Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRDNG-TDDQTRADIA 236 GG ++ +MAA G P ++ +P L+FSFSG+K+ A + G D++ + D A Sbjct: 745 GGAMMEQMAASGNDKAVPLPNMLSLKPKTLNFSFSGMKSAFAAAVSKMGRQDEKAKCDFA 804 Query: 237 RAFEDAVVDTLMIKCKRALDQTGF-----KRLVMAGGVSANRTLRAKLAEMMKKRR 287 + + AV L + ++ + F +RL + GGVS N TLR +L ++ + R Sbjct: 805 ASLQAAVFKHLEDQLRKTMWLYEFLEDFPRRLAVVGGVSCNETLRRRLRKLCESRG 860 Score = 44.8 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 32/76 (42%), Gaps = 18/76 (23%) Query: 96 RSLAFAWDVPAI-PVHHMEGHLLAPMLEDNPPEF-----------------PFVALLVSG 137 + ++ VP + V+H+ GHLL+ + + + F+ALLVSG Sbjct: 380 KVVSRGRGVPLVLAVNHLHGHLLSAGRDSSQQQHASDRQGDSCSPASALPPKFLALLVSG 439 Query: 138 GHTQLISVTGIGQYEL 153 GHT V Y L Sbjct: 440 GHTFSCIVKACEDYHL 455 >UniRef50_B9HH45 Predicted protein n=7 Tax=Eukaryota RepID=B9HH45_POPTR Length = 139 Score = 116 bits (291), Expect = 1e-24, Method: Composition-based stats. Identities = 36/128 (28%), Positives = 67/128 (52%), Gaps = 2/128 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LG E S ++ G+ + + +L+N ++ + G +P ++ H++ +PLI+ Sbjct: 4 MTALGFEGSANKIGVGVDTLDGTILSNPRHTYITPAGQ--GFLPRETAQHHLQHVLPLIK 61 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +AL+ +G+T+ +ID + YT GPG+ L V A V R L+ W P + V+H H+ Sbjct: 62 SALETAGITSDEIDCLCYTKGPGMGAPLQVSAVVVRVLSQLWKKPIVAVNHCVAHIEMGR 121 Query: 121 LEDNPPEF 128 + + Sbjct: 122 IVTGADDP 129 >UniRef50_C0YUE5 Possible M22 family non-peptidase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YUE5_9FLAO Length = 226 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 44/180 (24%), Positives = 78/180 (43%), Gaps = 25/180 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L +ETS +A+ D+EK L + S+ ++ Sbjct: 1 MKILYLETSSKNCSVAVSDNEKLLCLCEEVSENY---------------KQSESLHTYVE 45 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ +G++ K+I+AV+ GPG L +GA + + VP + V+ +E + P Sbjct: 46 WALEGAGISLKEIEAVSLGKGPGSYTGLRIGAASAKGFCYGLKVPLVAVNSLESMIE-PF 104 Query: 121 LEDNPPEFPFVALLVSGGHTQLISV-----TGIGQYELLGESIDDAAGEAF-DKTAKLLG 174 L DN + + LV ++ + TG E + +D+A+ E F DK +G Sbjct: 105 LGDN---YDLIVPLVDARRMEVYTAVYDGKTGKELSETEAKILDEASFEEFKDKKVLFVG 161 >UniRef50_Q11BE2 Peptidase M22, glycoprotease n=2 Tax=Phyllobacteriaceae RepID=Q11BE2_MESSB Length = 225 Score = 112 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 49/193 (25%), Positives = 83/193 (43%), Gaps = 25/193 (12%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L I+T+ +YD + A+ G + H + +I+ Sbjct: 1 MNILAIDTAAAFCSACVYDAQA--------------AEEKGRAVLDLGKGHAEHIMDVIE 46 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ L+ K++DA+A + GPG L VG + R LA A +PAI V L + Sbjct: 47 KALGQARLSFKEMDAIAVSVGPGSFTGLRVGISAARGLALALKIPAIGVTT-----LEAL 101 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + FP ++L + G + ++ + Q++ G +D G L+ + G Sbjct: 102 AAEARAAFPGRSVLCALGRAEPVA---MAQFDAAGHCLD---GPRLTSVENLVRIAGEGR 155 Query: 181 PLLSKMAAQGTAG 193 PLL AA+ AG Sbjct: 156 PLLVGDAAERIAG 168 >UniRef50_B3PIE3 Glycoprotease family protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PIE3_CELJU Length = 254 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 29/133 (21%), Positives = 59/133 (44%), Gaps = 17/133 (12%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L +++S D +A+ D K G+ E+A++ H ++ +P++ Sbjct: 13 LILALDSSTDACSVALNRDGKL-----------------GIRHEIATKSHTQRLLPMVDE 55 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L E G++ ++D +A+ GPG L + + + LA+ +P +PV +E L Sbjct: 56 VLGEEGISVSEVDVIAFGRGPGSFTGLRICMGIVQGLAYGSGIPVVPVSTLEAMALQVYR 115 Query: 122 EDNPPEFPFVALL 134 + P + L Sbjct: 116 QHPEWRGPVMVAL 128 >UniRef50_C6WZF1 Putative glycoprotease family exported protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6WZF1_FLAB3 Length = 239 Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats. Identities = 45/224 (20%), Positives = 83/224 (37%), Gaps = 37/224 (16%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L IETS +AI D ++ L + S+ ++ Sbjct: 14 MKILHIETSSRNCSVAISDGDELLCLCEEVSENY---------------KQSESLHTFVE 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ +G+ D+DAV+ GPG L +G++ + + +P I V+ +E + P Sbjct: 59 WALEGAGIALNDLDAVSLGMGPGSYTGLRIGSSAAKGFCYGLQIPLIAVNSLETMIE-PF 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQY-----ELLGESID-DAAGEAFDKTAKLLG 174 L+ N F F+ L+ ++ + G + ID D+ E K +G Sbjct: 118 LDQN---FDFIVPLLDARRMEVYTAHFDGNSGQMLTQTEASIIDQDSFQEFLGKKVVFVG 174 Query: 175 ---------LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDF 209 L P + + +F+ + + DF Sbjct: 175 DGALKAKGVLQLPDA---EFNSDVYPSAKFLIKKAVEKFRNKDF 215 >UniRef50_C8PCP6 Putative uncharacterized protein n=1 Tax=Lactobacillus iners DSM 13335 RepID=C8PCP6_9LACO Length = 245 Score = 110 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 50/188 (26%), Positives = 77/188 (40%), Gaps = 39/188 (20%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VL I T+ D +A+ D E+ ++A + EL +H + PLI Sbjct: 1 MKVLSITTATDHLSVALTDGEQ-IIAEKN---------------ELGMHNHAERLDPLID 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH----- 115 LK++ LT ++ID A GPG L + T + A + P + V ++ Sbjct: 45 ELLKQNQLTLQEIDRFAVAQGPGSYTGLRISITTAKMFASILNKPLVGVSTLKALAQGVT 104 Query: 116 -----LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTA 170 L++ + N F V L G QL++ G Y L + DK A Sbjct: 105 SNREILISELDARNLNFFAGVYLKEDGQLKQLLAD---GHYNL---------SKLLDKVA 152 Query: 171 KLLGLDYP 178 + L LDYP Sbjct: 153 Q-LELDYP 159 >UniRef50_D1AUR9 Putative endopeptidase n=1 Tax=Anaplasma centrale str. Israel RepID=D1AUR9_ANACI Length = 142 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 35/90 (38%), Positives = 57/90 (63%) Query: 56 VPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 L+ A+ +GL D+ A+A T+GPGLVG+L+VG + +++++ P I V+H+E H Sbjct: 25 ASLVSRAMDSAGLGFSDLSAIAVTSGPGLVGSLIVGVMLAKAISYVAGKPIIAVNHLEAH 84 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISV 145 L + + EFPF+ L++SGGH Q + V Sbjct: 85 ALVARMVRDDLEFPFLVLIISGGHCQFLVV 114 >UniRef50_Q9U0J7 Peptidase, M22 family, putative n=3 Tax=Plasmodium RepID=Q9U0J7_PLAF7 Length = 693 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 43/195 (22%), Positives = 75/195 (38%), Gaps = 26/195 (13%) Query: 121 LEDNPPEFPFVALLVSGGHTQLISV----TGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 ++ + ++ +LVSGG T + V + ++D G+ DK +LL L Sbjct: 327 IQTEYMKDGYLCILVSGGSTDVYKVQKDTKNAINVCKISTTMDITIGDVIDKVTRLLELP 386 Query: 177 --YPGGPLLSKMAAQG-------------TAGRFVFPRPMTDRPGLDFSFSGLKTFAANT 221 GGP L K A + FP P + +DFSFSG+ + Sbjct: 387 VGLGGGPFLEKEAQKYLTNLKSASSENLQNDPFQPFPNPFSTNNIIDFSFSGIYNHMSKI 446 Query: 222 IRDNGTD---DQTRADIARAFEDAVVDTLMIKCKRAL----DQTGFKRLVMAGGVSANRT 274 I+ ++ ++ + A + + L+ + + + K + + GGV N Sbjct: 447 IKKLKSEKSFEKEKGRYAYYCQKNIFHHLLKQVNKIMYFSELHFNIKNVFIVGGVGCNNF 506 Query: 275 LRAKLAEMMKKRRGE 289 L L +M KR E Sbjct: 507 LYQSLKDMAAKRDNE 521 Score = 66.7 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 53/122 (43%), Gaps = 3/122 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 ++GIE +CD+T I + D + ++ N + S K+ Y GV P S + + Sbjct: 87 IVGIENTCDDTCICVIDTDLNIIKNVIISHYKVVHSYEGVYPFFISSLNSLFLKHYVNKI 146 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATV-GRSLAFAWDVPAIPVHHMEGHLLAPML 121 L + K + +++ PG+ + G ++ V+H+ H+L+P+ Sbjct: 147 LD--NIDPKHVICYGFSSCPGIAKNMEAAKNYIGEKKKQNENIKISAVNHIFAHILSPLF 204 Query: 122 ED 123 + Sbjct: 205 FN 206 >UniRef50_B2A5P8 Peptidase M22 glycoprotease n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A5P8_NATTJ Length = 236 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 27/117 (23%), Positives = 50/117 (42%), Gaps = 16/117 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGI+T+ +A+ D K L+ + + + H + +PLI Sbjct: 1 MKVLGIDTATKTCCVALIDGNK-LMGEFILNNFQT---------------HSERLMPLID 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L G+ +I+ +A + GPG L +G + LA ++P + V ++ Sbjct: 45 KLLDSLGIKIDEIEGIAVSRGPGAFTGLRIGIGTAQGLAMGNEIPLVGVSTLDALAY 101 >UniRef50_Q5E439 Predicted peptidase n=5 Tax=Vibrionaceae RepID=Q5E439_VIBF1 Length = 233 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 31/119 (26%), Positives = 59/119 (49%), Gaps = 17/119 (14%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++L ++T+ + +A+ D K +YS+ + A R+H K +P + Sbjct: 4 KILAVDTATENCSVALIVDGK------VYSRRAV-----------APREHTIKILPFVDE 46 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 LKE+G+ +D+DA+A+ GPG + +G + + LAF D+P + + +E A Sbjct: 47 VLKEAGVRLQDLDALAFGQGPGSFTGVRIGIGIAQGLAFGADLPMVGISTLEAMAQAGY 105 >UniRef50_Q5ZU86 Glycoprotease (O-sialoglycoprotein endopeptidase) n=6 Tax=Legionella RepID=Q5ZU86_LEGPH Length = 223 Score = 106 bits (266), Expect = 9e-22, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 59/134 (44%), Gaps = 17/134 (12%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L I+TS + +AI + +++ + SQ R H + +P+I Sbjct: 2 MKLLAIDTSTELASVAILIGD-EIISREQDSQ----------------RIHAQLILPMID 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + ++GL +D + + GPG L + ++ + LA+A D+P +PV + Sbjct: 45 ELIAQTGLGLNQLDGIIFGCGPGSFTGLRIACSIAKGLAYANDLPLVPVSSLAAIAWTAR 104 Query: 121 LEDNPPEFPFVALL 134 P +++L Sbjct: 105 EIKEDFNQPVLSVL 118 >UniRef50_A0YCJ3 Inactive metal-dependent protease-like protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YCJ3_9GAMM Length = 236 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 54/111 (48%), Gaps = 13/111 (11%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L ++ S + +A+ DD G ++ ELA R H ++ +PL++ Sbjct: 1 MKLLALDCSTEACSVALLDDSSGNISIDEIF-------------ELAPRQHTQRILPLVE 47 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHH 111 L +S ++ ++DA+AY GPG L + + LA+ ++P + V Sbjct: 48 QLLSDSHVSLNELDAIAYGRGPGSFTGLRICLGAVQGLAYGAELPVVGVST 98 >UniRef50_P76256 M22 peptidase homolog yeaZ n=236 Tax=Gammaproteobacteria RepID=YEAZ_ECOLI Length = 231 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 31/127 (24%), Positives = 60/127 (47%), Gaps = 17/127 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+L I+T+ + +A+++D ++A + EL R+H ++ +P++Q Sbjct: 1 MRILAIDTATEACSVALWNDG------------TVNAHF-----ELCPREHTQRILPMVQ 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L SG + DI+A+AY GPG + +G + + LA ++P I V + Sbjct: 44 DILTTSGTSLTDINALAYGRGPGSFTGVRIGIGIAQGLALGAELPMIGVSTLMTMAQGAW 103 Query: 121 LEDNPPE 127 ++ Sbjct: 104 RKNGATR 110 >UniRef50_A3HX68 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HX68_9SPHI Length = 230 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 59/142 (41%), Gaps = 18/142 (12%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++L +ETS +A++D + G+ + H K + LI+ Sbjct: 3 KILSLETSTPVCSVALHDSGNIM----------------GLKEIEENGAHSEKLIKLIEE 46 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L E + K++DA+A + GPG L +G + + LAFAW P I V + L Sbjct: 47 LLDELQVDRKEVDAIAVSEGPGSYTGLRIGVSTAKGLAFAWGKPLIAVSTLAALARGATL 106 Query: 122 EDNPPEFPFVALLVSGGHTQLI 143 ++N V ++ ++ Sbjct: 107 DENNSS--VVIAMLDARRMEVY 126 >UniRef50_UPI00019087BD O-sialoglycoprotein endopeptidase n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI00019087BD Length = 120 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 35/99 (35%), Positives = 46/99 (46%), Gaps = 3/99 (3%) Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 I R + + + R +D+ +R + GGV+AN LRA L + K Sbjct: 8 IGRGLQR--FKDGISRGDRPIDRRESRRWSLPGGVAANLELRATLQALCDKNGFRFIAPP 65 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLA 332 CTDN MIA+AG+ R GA D L V R RWPL Sbjct: 66 LSLCTDNAVMIAWAGLERMATGAAPDPLDVQPRSRWPLD 104 >UniRef50_B8I821 Peptidase M22 glycoprotease n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I821_CLOCE Length = 236 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 43/168 (25%), Positives = 75/168 (44%), Gaps = 27/168 (16%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+L ++TS + AI +DE + G + H ++ +P++Q Sbjct: 1 MRILAVDTSTNVASAAILEDEVII----------------GEYNCNRGKTHSQRLMPMVQ 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL--- 117 ++ +GLT DIDA + + GPG L +G T +++AFA + P I VH ++ Sbjct: 45 HLMETAGLTVSDIDAFSASIGPGSFTGLRIGVTTIKAMAFAAEKPVISVHTLDALAYNIP 104 Query: 118 ------APMLE-DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESI 158 PM++ N F + + G +L GI E L +++ Sbjct: 105 FAENLVCPMIDARNNQVFTAIYRFIGGKLERLTEYLGIPVTE-LADTL 151 >UniRef50_P43990 Probable M22 peptidase homolog HI0388 n=24 Tax=Pasteurellaceae RepID=Y388_HAEIN Length = 236 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 57/120 (47%), Gaps = 17/120 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +L ++TS + +A+ Y K H + ELA R H ++ +P+I Sbjct: 4 LTLLALDTSTEACSVALL-----------YRGEKTH------INELAQRTHTKRILPMID 46 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L SGL +DA+A+ GPG + VGA + + LAF D+P IP+ ++ A Sbjct: 47 EILANSGLGLNQVDALAFGRGPGSFTGVRVGAGIAQGLAFGADLPVIPISNLTAMAQAAF 106 >UniRef50_A5FJB4 Peptidase family M22-like protein n=10 Tax=Flavobacteriales RepID=A5FJB4_FLAJ1 Length = 223 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 28/131 (21%), Positives = 55/131 (41%), Gaps = 15/131 (11%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IET+ ++I + + +L ++ H K I+ A Sbjct: 4 ILNIETATKNCSVSIAKNGETILCKEIA---------------EEGYSHAEKLHVFIEEA 48 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + ESG++ +D++AVA + GPG L +G + + L +A ++P I V ++ + Sbjct: 49 IAESGVSIQDLNAVAVSQGPGSYTGLRIGVSAAKGLCYALNIPLIAVDTLQTLASKAKIS 108 Query: 123 DNPPEFPFVAL 133 + A Sbjct: 109 EGKIIPMLDAR 119 >UniRef50_A8SIF9 Putative uncharacterized protein n=1 Tax=Parvimonas micra ATCC 33270 RepID=A8SIF9_9FIRM Length = 227 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 67/173 (38%), Gaps = 27/173 (15%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L I+TS + A+ +D ++ + +Q S H + ++ Sbjct: 1 MKILAIDTSTTHSSCAVMEDN-NIVGDFSINQ---------------SMSHNEILLVMVD 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 LK+ + +DID GPG + +G TV ++LA A + P + V+ +E Sbjct: 45 EMLKKLNIDIEDIDLFVAVTGPGSFTGIRIGVTVVKALAMALNKPIVAVNTLEALSFGVF 104 Query: 121 LEDNPPEFPFVALLVSGGHTQLIS--VTGIGQYELLGE---SIDDAAGEAFDK 168 + L+ ++ G+ ++ +ID+ E DK Sbjct: 105 TDKKKIP------LIDARGERVYYGVYEGLENKNIVAPALLTIDELLEEFLDK 151 >UniRef50_Q31G60 Peptidase M22 glycoprotease family protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31G60_THICR Length = 223 Score = 104 bits (259), Expect = 7e-21, Method: Composition-based stats. Identities = 33/127 (25%), Positives = 55/127 (43%), Gaps = 17/127 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VL +E+S + + DEK V E+A + H +P+++ Sbjct: 1 MNVLAVESSTKACSVCLKVDEKAY-----------------VEFEMAPQRHANLMLPMVE 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L +SG+T DI A+A++ GPG + + A V + LA W P + V +E Sbjct: 44 KVLNQSGITPDDIHALAFSEGPGAFTGIRIAAGVTQGLALGWGKPVLAVSTLEALAWQAY 103 Query: 121 LEDNPPE 127 + N + Sbjct: 104 KDTNQTD 110 >UniRef50_B0KBT6 Peptidase M22, glycoprotease n=11 Tax=Thermoanaerobacterales RepID=B0KBT6_THEP3 Length = 230 Score = 103 bits (258), Expect = 8e-21, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 66/163 (40%), Gaps = 24/163 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L I++S +A+ D EKG++ + ++ H +P+I Sbjct: 1 MKILAIDSSSKTATVALVD-EKGIIGEYSINYLR----------------HSVILMPMID 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 LK+ + I VA + GPG L +GA + LA A ++P + V + Sbjct: 44 ELLKKCEVPINQITHVAVSEGPGSFTGLRIGAATAKGLAHALNIPIVGVSSLLALAYNVS 103 Query: 121 LEDNPPEFPFVAL-------LVSGGHTQLISVTGIGQYELLGE 156 + AL L+ GG+ +++ G+ E + E Sbjct: 104 EFEGLICPVIDALNENVYGMLIRGGNFEVLIDAGVYSLEEITE 146 >UniRef50_A0NUI5 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NUI5_9RHOB Length = 225 Score = 103 bits (258), Expect = 9e-21, Method: Composition-based stats. Identities = 56/228 (24%), Positives = 81/228 (35%), Gaps = 40/228 (17%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVL I+T+ A+ DD + E R H K + +I Sbjct: 1 MRVLAIDTALANCAAAVLDDGAETACFEACG-------------EEIGRGHAEKLMDMIG 47 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + ES T ++D VA T GPG L VG V R P + V + Sbjct: 48 EVMAESSTTFSELDRVAVTIGPGSFTGLRVGLAVARGFGLVLGKPVVGVTTLAA---IAR 104 Query: 121 LEDNPPEFPFVALLVSGG----HTQLISVTGIGQYELLGESI-DDA---------AGEAF 166 E V + ++G + QL +G E +I D A AG A Sbjct: 105 SAVPGDEGGPVLVALTGKADEVYCQLYHASGTAADEAGVRTIADLAANLPKDVRLAGSAS 164 Query: 167 DKTAKLLGLD---------YPGGPLLSKMAAQGTAGRFVFPRPMTDRP 205 +K A+ LGL +PG ++++ R P P+ RP Sbjct: 165 EKIARELGLPESAILSRSTFPGIRDVAELGISADPSRSS-PSPLYLRP 211 >UniRef50_Q0JNG2 Os01g0295900 protein n=5 Tax=Oryza sativa RepID=Q0JNG2_ORYSJ Length = 133 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 30/79 (37%), Positives = 44/79 (55%), Gaps = 9/79 (11%) Query: 264 VMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT----- 318 V++GGV++N+ +R L ++ +K ++ P CTDNG MIA+ G+ F AG Sbjct: 25 VVSGGVASNQYVRTHLNQIAEKNGLQLVCPPPRLCTDNGVMIAWTGIEHFIAGRFDDPPA 84 Query: 319 ----ADLGVSVRPRWPLAE 333 DL +RPRWPL E Sbjct: 85 VDEPDDLQYDLRPRWPLGE 103 >UniRef50_Q3SMU0 Peptidase M22, glycoprotease n=14 Tax=Bradyrhizobiaceae RepID=Q3SMU0_NITWN Length = 232 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 59/143 (41%), Gaps = 17/143 (11%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VL I+T+ D I D ++G L Q +K R H +PLI Sbjct: 1 MLVLAIDTALDACAAGILDTDEGRLIAQETLPMK--------------RGHAETLMPLIA 46 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ++ +G+ K +D +A T GPG L VG + R +A A D P + V + + Sbjct: 47 RVIESAGIGFKALDRIAATTGPGSFTGLRVGLSAARGIALAADKPVVGVTTLAAFAAPVI 106 Query: 121 LEDNPPEFPFVALLVSGGHTQLI 143 ED P V + H Q+ Sbjct: 107 GEDRK---PPVISAIDARHDQVY 126 >UniRef50_Q18CP2 Putative glycoprotease n=9 Tax=Clostridium RepID=Q18CP2_CLOD6 Length = 238 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 29/113 (25%), Positives = 57/113 (50%), Gaps = 16/113 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LG++TS +A+ +D+K + + ++ + H +K +P+I+ Sbjct: 1 MKILGMDTSSMAASVAVVEDDKLICEFTVNNK----------------KTHSQKLMPMIE 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 L S L+ KD+D +A GPG L +G +++A ++P I V+ +E Sbjct: 45 NMLSMSDLSIKDMDLLAVCIGPGSFTGLRIGMATVKAMAHVNNIPIIAVNSLE 97 >UniRef50_A8U9X9 Glycoprotein endopeptidase n=1 Tax=Carnobacterium sp. AT7 RepID=A8U9X9_9LACT Length = 240 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 62/144 (43%), Gaps = 21/144 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VL I+TS IA+ DDEK + G + R+H + +P I Sbjct: 1 MKVLAIDTSNQAMSIAVLDDEKVI----------------GEITTNIKRNHSERLMPAID 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +K+ + +++ + GPG L +G TV ++LA+ V + + ++ +LA Sbjct: 45 ELMKDVQWQSSELNRIVVAKGPGSYTGLRIGVTVAKTLAWTLGVELVGISSLK--ILAGN 102 Query: 121 LEDNPPEFPFVALLVSGGHTQLIS 144 E +P ++ L + + Sbjct: 103 CESSPH---YLVPLFDARRKNIYT 123 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q7MNZ9 Probable O-sialoglycoprotein endopeptidase n=19 ... 432 e-120 UniRef50_B0TIN7 Probable O-sialoglycoprotein endopeptidase n=130... 425 e-118 UniRef50_P36175 O-sialoglycoprotein endopeptidase n=366 Tax=cell... 414 e-114 UniRef50_C0QTG9 Probable O-sialoglycoprotein endopeptidase n=3 T... 409 e-113 UniRef50_C4Z311 O-sialoglycoprotein endopeptidase n=13 Tax=Bacte... 405 e-111 UniRef50_Q18CP0 Probable O-sialoglycoprotein endopeptidase n=22 ... 404 e-111 UniRef50_Q8D283 Probable O-sialoglycoprotein endopeptidase n=11 ... 399 e-110 UniRef50_A5G3X1 Probable O-sialoglycoprotein endopeptidase n=20 ... 397 e-109 UniRef50_Q8RC98 Probable O-sialoglycoprotein endopeptidase n=12 ... 397 e-109 UniRef50_B2V910 Probable O-sialoglycoprotein endopeptidase n=4 T... 396 e-109 UniRef50_Q2RGJ3 Probable O-sialoglycoprotein endopeptidase n=10 ... 395 e-109 UniRef50_A7HLB0 Probable O-sialoglycoprotein endopeptidase n=3 T... 394 e-108 UniRef50_D1B623 Metalloendopeptidase, glycoprotease family n=3 T... 393 e-108 UniRef50_C1TLM6 O-sialoglycoprotein endopeptidase n=1 Tax=Dethio... 393 e-108 UniRef50_Q4FNV6 Probable O-sialoglycoprotein endopeptidase n=15 ... 388 e-106 UniRef50_A1AXM9 Probable O-sialoglycoprotein endopeptidase n=36 ... 388 e-106 UniRef50_B3WUZ1 Probable O-sialoglycoprotein endopeptidase n=1 T... 386 e-106 UniRef50_B6BRQ7 O-sialoglycoprotein endopeptidase n=1 Tax=Candid... 385 e-105 UniRef50_Q6AL73 Probable O-sialoglycoprotein endopeptidase n=3 T... 385 e-105 UniRef50_Q0AVU0 Probable O-sialoglycoprotein endopeptidase n=27 ... 381 e-104 UniRef50_Q11TP2 Probable O-sialoglycoprotein endopeptidase n=87 ... 379 e-104 UniRef50_C6P1W3 Metalloendopeptidase, glycoprotease family n=1 T... 379 e-104 UniRef50_Q6MQ48 Probable O-sialoglycoprotein endopeptidase n=1 T... 378 e-103 UniRef50_B8BPP0 Putative uncharacterized protein n=1 Tax=Oryza s... 378 e-103 UniRef50_B0TX13 Probable O-sialoglycoprotein endopeptidase n=19 ... 378 e-103 UniRef50_D0ME01 Metalloendopeptidase, glycoprotease family n=4 T... 377 e-103 UniRef50_A0L5L8 Probable O-sialoglycoprotein endopeptidase n=24 ... 376 e-103 UniRef50_A5CE49 Probable O-sialoglycoprotein endopeptidase n=2 T... 376 e-103 UniRef50_B9KXJ0 Probable O-sialoglycoprotein endopeptidase n=4 T... 375 e-103 UniRef50_C7N1K1 Ribosomal-protein-alanine acetyltransferase n=1 ... 375 e-102 UniRef50_D1IZQ0 Whole genome shotgun sequence of line PN40024, s... 374 e-102 UniRef50_A8GM49 Probable O-sialoglycoprotein endopeptidase n=15 ... 374 e-102 UniRef50_Q8DLI9 Probable O-sialoglycoprotein endopeptidase n=12 ... 374 e-102 UniRef50_D0RQS5 Putative glycoprotease GCP n=1 Tax=alpha proteob... 372 e-101 UniRef50_Q0SM86 Probable O-sialoglycoprotein endopeptidase n=18 ... 370 e-101 UniRef50_Q3YS67 Probable O-sialoglycoprotein endopeptidase n=24 ... 370 e-101 UniRef50_A4EBV8 Putative uncharacterized protein n=5 Tax=Bacteri... 369 e-101 UniRef50_B9JCG8 Probable O-sialoglycoprotein endopeptidase n=86 ... 368 e-100 UniRef50_B1GZV6 Probable O-sialoglycoprotein endopeptidase n=1 T... 367 e-100 UniRef50_Q2GEG6 Probable O-sialoglycoprotein endopeptidase n=2 T... 367 e-100 UniRef50_B2GAG0 Probable O-sialoglycoprotein endopeptidase n=56 ... 367 e-100 UniRef50_B3DVR7 Metal-dependent protease with possible chaperone... 365 1e-99 UniRef50_C7ND80 Metalloendopeptidase, glycoprotease family n=3 T... 364 2e-99 UniRef50_C1SJZ8 Metalloendopeptidase, putative, glycoprotease fa... 364 3e-99 UniRef50_B1V8Z6 Probable O-sialoglycoprotein endopeptidase n=6 T... 364 3e-99 UniRef50_D0N6Q4 O-sialoglycoprotein endopeptidase, putative n=1 ... 363 7e-99 UniRef50_B9XP92 Metalloendopeptidase, glycoprotease family n=1 T... 362 1e-98 UniRef50_A6DFV1 Metalloendopeptidase, putative, glycoprotease fa... 361 1e-98 UniRef50_C7H0S4 Putative glycoprotease GCP n=1 Tax=Eubacterium s... 361 2e-98 UniRef50_B2UQZ0 Metalloendopeptidase, glycoprotease family n=3 T... 361 3e-98 UniRef50_Q6MD07 Probable O-sialoglycoprotein endopeptidase n=2 T... 361 3e-98 UniRef50_Q5FLZ3 Probable O-sialoglycoprotein endopeptidase n=10 ... 361 3e-98 UniRef50_Q045T6 Probable O-sialoglycoprotein endopeptidase n=433... 360 4e-98 UniRef50_C7MKR9 Ribosomal-protein-alanine acetyltransferase n=10... 360 4e-98 UniRef50_D1AVQ5 Metalloendopeptidase, glycoprotease family n=1 T... 360 5e-98 UniRef50_B3R0M3 Probable O-sialoglycoprotein endopeptidase n=2 T... 359 5e-98 UniRef50_B0VHD4 Putative metalloendopeptidase, , glycoprotease f... 359 1e-97 UniRef50_B7CBT6 Putative uncharacterized protein n=1 Tax=Eubacte... 358 2e-97 UniRef50_B5RQA5 Probable O-sialoglycoprotein endopeptidase n=4 T... 357 3e-97 UniRef50_D0WGH2 O-sialoglycoprotein endopeptidase n=1 Tax=Slacki... 357 3e-97 UniRef50_C8W929 Metalloendopeptidase, glycoprotease family n=2 T... 357 3e-97 UniRef50_C0Q8X7 Probable O-sialoglycoprotein endopeptidase n=2 T... 357 4e-97 UniRef50_Q2JXG9 Probable O-sialoglycoprotein endopeptidase n=31 ... 355 1e-96 UniRef50_Q47LN7 Probable O-sialoglycoprotein endopeptidase n=58 ... 355 2e-96 UniRef50_C7LR95 Metalloendopeptidase, glycoprotease family n=1 T... 354 2e-96 UniRef50_D1N4S8 Metalloendopeptidase, glycoprotease family n=1 T... 354 3e-96 UniRef50_Q058D1 Probable O-sialoglycoprotein endopeptidase n=1 T... 353 4e-96 UniRef50_B1XJF0 Probable O-sialoglycoprotein endopeptidase n=1 T... 353 8e-96 UniRef50_C5ZWF6 Metal-dependent protease n=2 Tax=Helicobacter ca... 351 2e-95 UniRef50_Q127W3 Probable O-sialoglycoprotein endopeptidase n=4 T... 350 3e-95 UniRef50_Q7UM42 Probable O-sialoglycoprotein endopeptidase n=5 T... 350 4e-95 UniRef50_B4U8B7 Metalloendopeptidase, glycoprotease family n=1 T... 350 4e-95 UniRef50_B6JAE9 Probable O-sialoglycoprotein endopeptidase n=5 T... 348 1e-94 UniRef50_C1A601 Probable O-sialoglycoprotein endopeptidase n=1 T... 348 1e-94 UniRef50_A0JZ01 Probable O-sialoglycoprotein endopeptidase n=98 ... 346 5e-94 UniRef50_C9RIN4 Metalloendopeptidase, glycoprotease family n=1 T... 346 6e-94 UniRef50_C0QY51 Probable O-sialoglycoprotein endopeptidase n=2 T... 346 7e-94 UniRef50_A5GMV4 Probable O-sialoglycoprotein endopeptidase n=17 ... 345 1e-93 UniRef50_B2S3R9 Probable O-sialoglycoprotein endopeptidase n=4 T... 345 1e-93 UniRef50_A9FDL0 Probable O-sialoglycoprotein endopeptidase n=5 T... 345 1e-93 UniRef50_B2KE20 Metalloendopeptidase, glycoprotease family n=1 T... 344 3e-93 UniRef50_C8WN77 Metalloendopeptidase, glycoprotease family n=3 T... 343 4e-93 UniRef50_Q0ATQ2 Probable O-sialoglycoprotein endopeptidase n=44 ... 343 6e-93 UniRef50_Q254Q0 Probable O-sialoglycoprotein endopeptidase n=6 T... 342 9e-93 UniRef50_A1BJ68 Probable O-sialoglycoprotein endopeptidase n=12 ... 340 5e-92 UniRef50_A1R8N0 Probable O-sialoglycoprotein endopeptidase n=12 ... 339 9e-92 UniRef50_B5ZLG0 Metalloendopeptidase, glycoprotease family n=11 ... 339 1e-91 UniRef50_UPI0000D561DB PREDICTED: similar to AGAP005215-PA n=1 T... 338 2e-91 UniRef50_Q3SVF4 Probable O-sialoglycoprotein endopeptidase n=10 ... 338 3e-91 UniRef50_Q2SR45 Probable O-sialoglycoprotein endopeptidase n=5 T... 337 3e-91 UniRef50_B3MQN2 GF20469 n=4 Tax=Drosophila RepID=B3MQN2_DROAN 334 3e-90 UniRef50_D1B582 Metalloendopeptidase, glycoprotease family n=5 T... 332 1e-89 UniRef50_A3EUW9 O-sialoglycoprotein endopeptidase n=3 Tax=Leptos... 331 2e-89 UniRef50_Q04RH4 Probable O-sialoglycoprotein endopeptidase n=6 T... 331 2e-89 UniRef50_UPI0001C42124 glycoprotease M22 family n=1 Tax=Methanob... 331 2e-89 UniRef50_A4RXP4 Predicted protein n=6 Tax=Eukaryota RepID=A4RXP4... 331 3e-89 UniRef50_Q30ZN1 Probable O-sialoglycoprotein endopeptidase n=12 ... 331 3e-89 UniRef50_B0D096 Predicted protein n=2 Tax=Agaricales RepID=B0D09... 330 4e-89 UniRef50_A5UMH5 Putative O-sialoglycoprotein endopeptidase n=5 T... 329 6e-89 UniRef50_B3RQR7 Putative uncharacterized protein n=1 Tax=Trichop... 329 1e-88 UniRef50_B8LEI0 Predicted protein (Fragment) n=1 Tax=Thalassiosi... 328 1e-88 UniRef50_Q54EW4 Putative uncharacterized protein n=1 Tax=Dictyos... 327 3e-88 UniRef50_Q1IUF1 Probable O-sialoglycoprotein endopeptidase n=2 T... 326 7e-88 UniRef50_Q29HY2 GA12844 n=3 Tax=Sophophora RepID=Q29HY2_DROPS 326 8e-88 UniRef50_B8PI87 Predicted protein n=2 Tax=Postia placenta Mad-69... 325 1e-87 UniRef50_A7H0K1 Probable O-sialoglycoprotein endopeptidase n=26 ... 325 1e-87 UniRef50_Q17CG3 O-sialoglycoprotein endopeptidase n=2 Tax=Culici... 323 4e-87 UniRef50_Q9VWD6 Probable O-sialoglycoprotein endopeptidase 2 n=6... 323 4e-87 UniRef50_Q4A734 Probable O-sialoglycoprotein endopeptidase n=1 T... 322 8e-87 UniRef50_B0B9U7 Probable O-sialoglycoprotein endopeptidase n=6 T... 322 1e-86 UniRef50_UPI0000F51796 O-sialoglycoprotein endopeptidase/protein... 322 1e-86 UniRef50_Q9H4B0 Probable O-sialoglycoprotein endopeptidase 2 n=3... 322 1e-86 UniRef50_Q6L243 Putative O-sialoglycoprotein endopeptidase n=3 T... 322 1e-86 UniRef50_A6Q6J3 Probable O-sialoglycoprotein endopeptidase n=2 T... 321 2e-86 UniRef50_D2LQ34 Metalloendopeptidase, glycoprotease family n=1 T... 320 5e-86 UniRef50_Q0BPC9 Probable O-sialoglycoprotein endopeptidase n=14 ... 318 2e-85 UniRef50_C2KP25 O-sialoglycoprotein endopeptidase n=3 Tax=Mobilu... 317 3e-85 UniRef50_Q5ZZQ1 Probable O-sialoglycoprotein endopeptidase n=8 T... 316 9e-85 UniRef50_UPI000058820F PREDICTED: hypothetical protein n=2 Tax=S... 315 2e-84 UniRef50_Q46FS9 Putative O-sialoglycoprotein endopeptidase n=17 ... 315 2e-84 UniRef50_B6JWU0 Glycoprotease pgp1 n=1 Tax=Schizosaccharomyces j... 313 6e-84 UniRef50_UPI0001979AA5 putative DNA-binding/iron metalloprotein/... 313 7e-84 UniRef50_Q17Z01 Probable O-sialoglycoprotein endopeptidase n=13 ... 312 7e-84 UniRef50_B3PND6 Probable O-sialoglycoprotein endopeptidase n=2 T... 310 6e-83 UniRef50_UPI000180B634 PREDICTED: similar to Probable O-sialogly... 309 1e-82 UniRef50_O94710 Glycoprotease pgp1, mitochondrial n=1 Tax=Schizo... 308 1e-82 UniRef50_UPI000186D055 conserved hypothetical protein n=1 Tax=Pe... 308 2e-82 UniRef50_D2L1E2 Metalloendopeptidase, glycoprotease family n=1 T... 307 3e-82 UniRef50_Q6C9V8 YALI0D07920p n=1 Tax=Yarrowia lipolytica RepID=Q... 306 5e-82 UniRef50_B1ZYF9 Metalloendopeptidase, glycoprotease family n=3 T... 305 1e-81 UniRef50_C1F9R2 Metalloendopeptidase, glycoprotease family n=1 T... 305 2e-81 UniRef50_C7M316 Metalloendopeptidase, glycoprotease family n=1 T... 303 6e-81 UniRef50_A6VJ51 Putative O-sialoglycoprotein endopeptidase n=26 ... 302 2e-80 UniRef50_Q9NPF4 Probable O-sialoglycoprotein endopeptidase n=81 ... 301 2e-80 UniRef50_D0JBS4 Glycoprotease M22 family domain-containing prote... 297 4e-79 UniRef50_C4XSD3 Probable O-sialoglycoprotein endopeptidase n=2 T... 296 6e-79 UniRef50_C3XEQ4 O-sialoglycoprotein endopeptidase n=1 Tax=Helico... 296 6e-79 UniRef50_C4QZU9 Putative metalloprotease, similar to O-sialoglyc... 296 8e-79 UniRef50_A9WHP1 Metalloendopeptidase, glycoprotease family n=4 T... 295 2e-78 UniRef50_P43122 Putative protease QRI7 n=12 Tax=Saccharomycetace... 292 1e-77 UniRef50_Q74M58 Putative O-sialoglycoprotein endopeptidase n=1 T... 292 1e-77 UniRef50_Q4UA14 Glycoprotein endopeptidase, putative n=3 Tax=Pir... 292 2e-77 UniRef50_A4VEZ5 O-sialoglycoprotein endopeptidase n=1 Tax=Tetrah... 291 2e-77 UniRef50_A2QMR2 Function: O-sialoglycoprotein endopeptidase is a... 288 2e-76 UniRef50_Q1IZH8 Probable O-sialoglycoprotein endopeptidase n=4 T... 288 2e-76 UniRef50_Q83I95 Probable O-sialoglycoprotein endopeptidase n=2 T... 287 4e-76 UniRef50_B5Y892 O-sialoglycoprotein endopeptidase n=1 Tax=Coprot... 287 4e-76 UniRef50_UPI0000DB7930 PREDICTED: similar to O-sialoglycoprotein... 287 4e-76 UniRef50_UPI000023E24C hypothetical protein FG06887.1 n=1 Tax=Gi... 287 4e-76 UniRef50_D2RYV2 Metalloendopeptidase, glycoprotease family n=1 T... 286 8e-76 UniRef50_P75055 Probable O-sialoglycoprotein endopeptidase n=2 T... 285 2e-75 UniRef50_C4PYC5 Mername-AA018 peptidase (M22 family) n=1 Tax=Sch... 285 2e-75 UniRef50_B6GZQ3 Pc12g05880 protein n=9 Tax=Trichocomaceae RepID=... 285 2e-75 UniRef50_Q8EUQ9 Probable O-sialoglycoprotein endopeptidase n=1 T... 284 3e-75 UniRef50_B7XIP4 O-sialoglycoprotein endopeptidase n=2 Tax=Eukary... 282 9e-75 UniRef50_Q4PGZ6 Putative uncharacterized protein n=2 Tax=Ustilag... 282 1e-74 UniRef50_B1AJ51 Probable O-sialoglycoprotein endopeptidase n=15 ... 282 2e-74 UniRef50_Q93170 Protein C01G10.10, confirmed by transcript evide... 278 2e-73 UniRef50_Q7NB15 Probable O-sialoglycoprotein endopeptidase n=1 T... 277 6e-73 UniRef50_A2BJY9 Putative O-sialoglycoprotein endopeptidase n=22 ... 276 7e-73 UniRef50_Q6L4N8 Os05g0194600 protein n=21 Tax=Eukaryota RepID=Q6... 275 1e-72 UniRef50_Q18KI0 Putative O-sialoglycoprotein endopeptidase n=14 ... 273 7e-72 UniRef50_Q2HG58 Putative uncharacterized protein n=1 Tax=Chaetom... 272 8e-72 UniRef50_B8MFK9 Glycoprotease family protein, putative n=5 Tax=L... 271 3e-71 UniRef50_A5DGU9 Putative uncharacterized protein n=2 Tax=Pichia ... 271 3e-71 UniRef50_A3CXS0 Putative O-sialoglycoprotein endopeptidase n=5 T... 270 4e-71 UniRef50_P36174 Putative O-sialoglycoprotein endopeptidase n=1 T... 268 2e-70 UniRef50_C8V9Q8 PROBABLE O-SIALOGLYCOPROTEIN ENDOPEPTIDASE (AFU_... 265 1e-69 UniRef50_Q2GXN6 Putative glycoprotein endopeptidase KAE1 n=18 Ta... 265 1e-69 UniRef50_A3MSX6 Putative O-sialoglycoprotein endopeptidase n=2 T... 265 2e-69 UniRef50_B9WFF4 Metalloprotease, putative n=8 Tax=Saccharomyceta... 264 2e-69 UniRef50_B7QJD9 O-sialoglycoprotein endopeptidase, putative n=3 ... 262 9e-69 UniRef50_C4Y0N8 Putative uncharacterized protein n=1 Tax=Clavisp... 262 2e-68 UniRef50_P36132 Putative glycoprotein endopeptidase KAE1 n=40 Ta... 256 8e-67 UniRef50_Q7SD85 Predicted protein n=2 Tax=Sordariaceae RepID=Q7S... 252 1e-65 UniRef50_Q4U8J6 Glycoprotease, putative n=2 Tax=Theileria RepID=... 252 1e-65 UniRef50_A8QDL6 Glycoprotease family protein n=1 Tax=Brugia mala... 247 5e-64 UniRef50_C5FT24 Glycoprotease family protein n=2 Tax=Onygenales ... 245 2e-63 UniRef50_C1GKA7 Glycoprotease pgp1 n=11 Tax=Onygenales RepID=C1G... 243 6e-63 UniRef50_A8BDD4 O-sialoglycoprotein endopeptidase n=2 Tax=Giardi... 240 4e-62 UniRef50_A6TR37 O-sialoglycoprotein endopeptidase n=1 Tax=Alkali... 239 9e-62 UniRef50_B2A533 O-sialoglycoprotein endopeptidase n=1 Tax=Natran... 239 1e-61 UniRef50_A8WMS3 Putative uncharacterized protein n=1 Tax=Caenorh... 238 2e-61 UniRef50_A8MFJ2 O-sialoglycoprotein endopeptidase n=1 Tax=Alkali... 237 4e-61 UniRef50_C7DHT9 Metalloendopeptidase, glycoprotease family n=1 T... 237 4e-61 UniRef50_UPI0000E8089C PREDICTED: similar to Osgepl1 protein n=1... 230 4e-59 UniRef50_C0ZC04 Peptidase M22 family protein n=1 Tax=Brevibacill... 229 9e-59 UniRef50_C0GE31 O-sialoglycoprotein endopeptidase n=1 Tax=Dethio... 228 3e-58 UniRef50_D1BMJ2 Metal-dependent protease with possible chaperone... 227 3e-58 UniRef50_A4RG35 Putative uncharacterized protein n=1 Tax=Magnapo... 225 2e-57 UniRef50_D2RJI3 Peptidase M22 glycoprotease n=2 Tax=Acidaminococ... 223 6e-57 UniRef50_A6S1G0 Putative uncharacterized protein n=1 Tax=Botryot... 222 1e-56 UniRef50_Q2RIB0 O-sialoglycoprotein endopeptidase n=5 Tax=Clostr... 220 4e-56 UniRef50_A7APL5 Glycoprotease family protein n=1 Tax=Babesia bov... 220 5e-56 UniRef50_C5KYH6 Glycoprotein endopeptidase, putative n=4 Tax=Per... 220 8e-56 UniRef50_Q97ZY8 Putative O-sialoglycoprotein endopeptidase n=1 T... 216 1e-54 UniRef50_C9LLA9 Glycoprotease family protein n=1 Tax=Dialister i... 215 2e-54 UniRef50_A6NUZ4 Putative uncharacterized protein n=1 Tax=Bactero... 213 8e-54 UniRef50_Q3AAM2 Glycoprotease family protein n=1 Tax=Carboxydoth... 211 4e-53 UniRef50_C8WXH0 Peptidase M22 glycoprotease n=2 Tax=Alicyclobaci... 210 4e-53 UniRef50_C7H6X1 Glycoprotease family protein n=2 Tax=Faecalibact... 207 5e-52 UniRef50_D1PKV9 Glycoprotease family protein n=1 Tax=Subdoligran... 207 6e-52 UniRef50_B0AAV1 Putative uncharacterized protein n=2 Tax=Clostri... 206 1e-51 UniRef50_Q5KFY5 Mitochondrion protein, putative n=2 Tax=Filobasi... 205 2e-51 UniRef50_Q0AZF6 Putative uncharacterized protein n=1 Tax=Syntrop... 205 3e-51 UniRef50_A7VX43 Putative uncharacterized protein n=4 Tax=Clostri... 204 4e-51 UniRef50_A0RY43 O-sialoglycoprotein endopeptidase n=4 Tax=Thauma... 196 1e-48 UniRef50_Q18B67 Probable O-sialoglycoprotein endopeptidase n=6 T... 195 1e-48 UniRef50_D2VC41 Predicted protein n=1 Tax=Naegleria gruberi RepI... 194 4e-48 UniRef50_UPI0000DD8AA6 Os01g0295900 n=1 Tax=Oryza sativa Japonic... 191 3e-47 UniRef50_B0TEI7 O-sialoglycoprotein endopeptidase, putative n=1 ... 190 4e-47 UniRef50_B2WBX5 Glycoprotease pgp1, mitochondrial n=1 Tax=Pyreno... 184 3e-45 UniRef50_UPI000187E9E4 hypothetical protein MPER_08009 n=1 Tax=M... 170 5e-41 UniRef50_D2EF31 O-sialoglycoprotein endopeptidase (Fragment) n=1... 160 6e-38 UniRef50_Q8IJ99 Glycoprotease, putative n=5 Tax=Plasmodium RepID... 155 3e-36 UniRef50_C1BYL4 Probable O-sialoglycoprotein endopeptidase 2 n=1... 151 4e-35 UniRef50_C5KJ57 Putative uncharacterized protein (Fragment) n=1 ... 151 4e-35 UniRef50_D1IQV9 Whole genome shotgun sequence of line PN40024, s... 151 5e-35 UniRef50_C9SIA9 Glycoprotease pgp1 n=2 Tax=Sordariomycetes RepID... 148 3e-34 UniRef50_A5KDZ1 O-sialoglycoprotein endopeptidase, putative n=1 ... 144 5e-33 UniRef50_A9UYP5 Predicted protein n=1 Tax=Monosiga brevicollis R... 141 2e-32 UniRef50_Q0V4Z5 Putative uncharacterized protein n=1 Tax=Phaeosp... 141 3e-32 UniRef50_B2AYU1 Predicted CDS Pa_1_12230 (Fragment) n=1 Tax=Podo... 132 2e-29 UniRef50_C6WZF1 Putative glycoprotease family exported protein n... 131 5e-29 UniRef50_C6XTX3 Peptidase M22 glycoprotease n=3 Tax=Sphingobacte... 127 5e-28 UniRef50_C0YUE5 Possible M22 family non-peptidase n=1 Tax=Chryse... 127 6e-28 UniRef50_P76256 M22 peptidase homolog yeaZ n=236 Tax=Gammaproteo... 127 7e-28 UniRef50_B2A5P8 Peptidase M22 glycoprotease n=1 Tax=Natranaerobi... 126 8e-28 UniRef50_A3HX68 Putative uncharacterized protein n=1 Tax=Algorip... 126 1e-27 UniRef50_B3PIE3 Glycoprotease family protein n=1 Tax=Cellvibrio ... 125 2e-27 UniRef50_B8I821 Peptidase M22 glycoprotease n=1 Tax=Clostridium ... 125 2e-27 UniRef50_Q11YX3 Probable peptidase M22, glycoprotease family n=1... 125 2e-27 UniRef50_A1SX27 Peptidase M22, glycoprotease n=3 Tax=Gammaproteo... 124 3e-27 UniRef50_A8SIF9 Putative uncharacterized protein n=1 Tax=Parvimo... 124 3e-27 UniRef50_A1ZHG0 Glycoprotease family n=1 Tax=Microscilla marina ... 123 6e-27 UniRef50_A6TLG1 Peptidase M22, glycoprotease n=2 Tax=Alkaliphilu... 123 7e-27 UniRef50_Q9U0J7 Peptidase, M22 family, putative n=3 Tax=Plasmodi... 123 7e-27 UniRef50_C9CSP7 Peptidase M22, glycoprotease n=3 Tax=Rhodobacter... 123 9e-27 UniRef50_B8D0Y9 O-sialoglycoprotein endopeptidase n=1 Tax=Haloth... 122 1e-26 UniRef50_UPI0001BC4F5F O-sialoglycoprotein endopeptidase n=1 Tax... 122 2e-26 UniRef50_B3ERC8 Putative uncharacterized protein n=1 Tax=Candida... 122 2e-26 UniRef50_A5FJB4 Peptidase family M22-like protein n=10 Tax=Flavo... 122 2e-26 UniRef50_A6ECY4 Putative glycoprotease family exported protein n... 122 2e-26 UniRef50_A0LXU5 Peptidase, family M22 n=5 Tax=Bacteroidetes RepI... 121 3e-26 UniRef50_A8U9X9 Glycoprotein endopeptidase n=1 Tax=Carnobacteriu... 121 3e-26 UniRef50_C0GCU7 Peptidase M22 glycoprotease n=1 Tax=Dethiobacter... 121 4e-26 UniRef50_C3JGW8 Universal bacterial protein YeaZ n=2 Tax=Rhodoco... 121 5e-26 UniRef50_C4LDA2 Peptidase M22 glycoprotease n=1 Tax=Tolumonas au... 120 7e-26 UniRef50_Q5E439 Predicted peptidase n=5 Tax=Vibrionaceae RepID=Q... 120 8e-26 UniRef50_P43990 Probable M22 peptidase homolog HI0388 n=24 Tax=P... 119 9e-26 Sequences not found previously or not previously below threshold: >UniRef50_Q7MNZ9 Probable O-sialoglycoprotein endopeptidase n=19 Tax=Gammaproteobacteria RepID=GCP_VIBVY Length = 339 Score = 432 bits (1111), Expect = e-120, Method: Composition-based stats. Identities = 259/334 (77%), Positives = 294/334 (88%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+LGIETSCDETGIAIYDDEKGLLA++LYSQ+KLHADYGGVVPELASRDHV+KT+PLI+ Sbjct: 1 MRILGIETSCDETGIAIYDDEKGLLAHKLYSQIKLHADYGGVVPELASRDHVKKTIPLIK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ LTAKDID VAYTAGPGLVGALLVGAT+GRSLA+AW VPA+PVHHMEGHLLAPM Sbjct: 61 EALKEANLTAKDIDGVAYTAGPGLVGALLVGATIGRSLAYAWGVPAVPVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LEDNPP FPFVA+LVSGGH+ ++ V GIG+Y++LGESIDDAAGEAFDKTAKL+GLDYPGG Sbjct: 121 LEDNPPPFPFVAVLVSGGHSMMVEVKGIGEYKILGESIDDAAGEAFDKTAKLMGLDYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 PLLSK+A +GT GRF FPRPMT+ PGLD SFSGLKTF ANTI NG D+QTRADIA AFE Sbjct: 181 PLLSKLAEKGTPGRFKFPRPMTNVPGLDMSFSGLKTFTANTIAANGDDEQTRADIAYAFE 240 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +AV TL IKCKRAL+QTG KR+V+AGGVSANR LRA+L ++ K G+V+Y R EFCTD Sbjct: 241 EAVCATLAIKCKRALEQTGMKRIVIAGGVSANRRLRAELEKLAHKVGGDVYYPRTEFCTD 300 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 NGAMIAYAGM R K +DL V RPRWP+ +L Sbjct: 301 NGAMIAYAGMQRLKNNEVSDLAVEARPRWPIDQL 334 >UniRef50_B0TIN7 Probable O-sialoglycoprotein endopeptidase n=130 Tax=Gammaproteobacteria RepID=GCP_SHEHH Length = 338 Score = 425 bits (1094), Expect = e-118, Method: Composition-based stats. Identities = 247/335 (73%), Positives = 284/335 (84%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVLGIETSCDETGIA+YDDEKGLL++ LYSQVKLHADYGGVVPELASRDHVRK VPLI+ Sbjct: 1 MRVLGIETSCDETGIAVYDDEKGLLSHALYSQVKLHADYGGVVPELASRDHVRKIVPLIR 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ +T +D+D +AYT GPGL+GALLVGA VGR+LAF+WD PAI VHHMEGHLLAPM Sbjct: 61 QALADADMTIEDLDGIAYTKGPGLIGALLVGACVGRALAFSWDKPAIGVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LED+ PEFPF+ALLVSGGH+ L+ V GIG+Y +LGES+DDAAGEAFDKTAKL+GLDYPGG Sbjct: 121 LEDDVPEFPFLALLVSGGHSMLVGVEGIGRYTVLGESVDDAAGEAFDKTAKLMGLDYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P LSK+AA+G + FPRPMTD+PGL+ SFSGLKTFAANTI D+QTRA+IA AFE Sbjct: 181 PRLSKLAAKGVPNSYRFPRPMTDKPGLNMSFSGLKTFAANTIAAEPKDEQTRANIACAFE 240 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +AVVDTL IKCKRAL QTG+K LV+AGGVSAN LRA L+EMM+ G+V+Y R EFCTD Sbjct: 241 EAVVDTLGIKCKRALKQTGYKNLVIAGGVSANTRLRASLSEMMQGLGGKVYYPRGEFCTD 300 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 NGAMIAYAG+ R KAG DL V +PRWPL L Sbjct: 301 NGAMIAYAGLQRLKAGQVEDLAVKGQPRWPLDTLE 335 >UniRef50_P36175 O-sialoglycoprotein endopeptidase n=366 Tax=cellular organisms RepID=GCP_PASHA Length = 325 Score = 414 bits (1064), Expect = e-114, Method: Composition-based stats. Identities = 245/319 (76%), Positives = 279/319 (87%), Gaps = 5/319 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+LGIETSCDETG+AIYD++KGL+ANQLYSQ+ +HADYGGVVPELASRDH+RKT+PLIQ Sbjct: 1 MRILGIETSCDETGVAIYDEDKGLVANQLYSQIDMHADYGGVVPELASRDHIRKTLPLIQ 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ L DID +AYTAGPGLVGALLVG+T+ RSLA+AW+VPA+ VHHMEGHLLAPM Sbjct: 61 EALKEANLQPSDIDGIAYTAGPGLVGALLVGSTIARSLAYAWNVPALGVHHMEGHLLAPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE+N PEFPFVALL+SGGHTQL+ V G+GQYELLGESIDDAAGEAFDKT KLLGLDYP G Sbjct: 121 LEENAPEFPFVALLISGGHTQLVKVDGVGQYELLGESIDDAAGEAFDKTGKLLGLDYPAG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI-----RDNGTDDQTRADI 235 +SK+A GT RF FPRPMTDRPGLDFSFSGLKTFAANTI + D+QT+ DI Sbjct: 181 VAMSKLAESGTPNRFKFPRPMTDRPGLDFSFSGLKTFAANTIKANLNENGELDEQTKCDI 240 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A AF+ AVVDT++IKCKRAL+QTG+KRLVMAGGVSAN+ LRA LAEMMKK +GEVFY RP Sbjct: 241 AHAFQQAVVDTILIKCKRALEQTGYKRLVMAGGVSANKQLRADLAEMMKKLKGEVFYPRP 300 Query: 296 EFCTDNGAMIAYAGMVRFK 314 +FCTDNGAMIAY G +R K Sbjct: 301 QFCTDNGAMIAYTGFLRLK 319 >UniRef50_C0QTG9 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Bacteria RepID=GCP_PERMH Length = 344 Score = 409 bits (1051), Expect = e-113, Method: Composition-based stats. Identities = 141/338 (41%), Positives = 218/338 (64%), Gaps = 9/338 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCD+T +++YD E+GLL+N + SQ+K+H ++GGV P+LA+R+H + +P++ Sbjct: 1 MKILGIETSCDDTAVSVYDSEEGLLSNVVSSQIKMHEEWGGVYPDLAAREHTKNIIPVLD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ + KDID +A T PGL+ +L++G +V ++L++ + P IPVHH+E H+ A Sbjct: 61 RALKEASVNIKDIDGIAVTVAPGLIVSLVIGISVAKTLSWIYRKPLIPVHHIEAHIFASF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + ++PF+AL+VSGGHT+L + G Y LG ++DDA GEA+DK A++LGL YPGG Sbjct: 121 ITE-KIDYPFIALVVSGGHTELYLIKGFEDYRYLGGTLDDAVGEAYDKVARMLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG---LDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 P++ +++ +G PRP+ + G +FSFSGLKT I+ + DIAR Sbjct: 180 PVIDRLSKEGEDT-VKLPRPLINDRGKNRFNFSFSGLKTAVLREIQKG---VYRKEDIAR 235 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 +F++A D L+ K A+ + K +V+AGGVSAN LR K E + + ++ Sbjct: 236 SFQEAATDVLLAKTIDAMKEFNIKNVVIAGGVSANSRLREKFKEAEENHGIKAYFPPLYL 295 Query: 298 CTDNGAMIAYAGMVRFK-AGATADLGVSVRPRWPLAEL 334 CTDNGAM+A+ G RFK +G T D + R + + Sbjct: 296 CTDNGAMVAFTGYKRFKESGTTVDYSFEGKARLRMDKF 333 >UniRef50_C4Z311 O-sialoglycoprotein endopeptidase n=13 Tax=Bacteria RepID=C4Z311_EUBE2 Length = 352 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 146/334 (43%), Positives = 206/334 (61%), Gaps = 2/334 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET A+ + + +L+N + +Q+ +H +YGGVVPE+ASR H+ P+I+ Sbjct: 17 LILAIESSCDETAAAVVKNGREVLSNVINTQIAIHTEYGGVVPEIASRKHIENINPVIRK 76 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G+T DIDA+ T GPGLVGALLVG +++AFA + P + VHH+EGH+ A + Sbjct: 77 ALEDAGVTLDDIDAIGVTYGPGLVGALLVGVAEAKAIAFAKNKPLVGVHHIEGHISANYV 136 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E+ E PFVAL+VSGGHT L+ V G+YE++G + DDAAGEAFDK A+ +GL YPGGP Sbjct: 137 ENKELEPPFVALVVSGGHTHLVKVNDYGEYEIVGRTRDDAAGEAFDKVARAIGLGYPGGP 196 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARAF 239 + K+A +G FPR D DFSFSG+K+ N I + + RAD+A +F Sbjct: 197 KIDKLAKEGNPDAIEFPRAHVDDAPYDFSFSGIKSAVLNYINSANMQGKEINRADVAASF 256 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + AVVD L+ + R + G +L +AGGV++N LRA + E K + P CT Sbjct: 257 QKAVVDALVSRAVRLAKECGMDKLAIAGGVASNSALRAAIQEACAKNNIGFYSPSPILCT 316 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 DN AMI A + G ++ P L E Sbjct: 317 DNAAMIGAAAYYEYIKGVRHGYDLNAVPNLKLGE 350 >UniRef50_Q18CP0 Probable O-sialoglycoprotein endopeptidase n=22 Tax=Bacteria RepID=GCP_CLOD6 Length = 338 Score = 404 bits (1038), Expect = e-111, Method: Composition-based stats. Identities = 134/334 (40%), Positives = 204/334 (61%), Gaps = 2/334 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 L IE+SCDET ++ + + +L+N + +Q++ H +GGVVPE+ASR HV ++Q Sbjct: 5 ITLAIESSCDETAASVLKNGREVLSNIISTQIETHKKFGGVVPEVASRKHVENIDIVVQE 64 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL ++ + DID +A T GPGLVGALLVG + ++LA+ ++P + V+H+EGHL A + Sbjct: 65 ALDKANIGFNDIDHIAVTYGPGLVGALLVGLSYAKALAYTLNIPLVGVNHIEGHLSANYI 124 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E + PF+ L+VSGGHT L+ V G+YE+LG++ DDA+GEAFDK ++ + L YPGGP Sbjct: 125 EHKDLKPPFITLIVSGGHTHLVEVKDYGKYEILGKTRDDASGEAFDKISRAMNLGYPGGP 184 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTRADIARAF 239 ++ +A G FPR + DFSFSGLK+ N + + ++ D+A +F Sbjct: 185 IIDNLAKNGNKHAIEFPRAYLEEDSYDFSFSGLKSSVLNYLNGKRMKNEEIVVEDVAASF 244 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 ++AVV+ L K +A+ G+ + ++GGV++N LRAK+ E+ K V Y CT Sbjct: 245 QEAVVEVLSTKALKAVKDKGYNIITLSGGVASNSGLRAKITELAKDNGITVKYPPLILCT 304 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 DN AMI AG F G T D+ ++ P + + Sbjct: 305 DNAAMIGCAGYYNFINGKTHDMSLNAVPNLKINQ 338 >UniRef50_Q8D283 Probable O-sialoglycoprotein endopeptidase n=11 Tax=Gammaproteobacteria RepID=GCP_WIGBR Length = 340 Score = 399 bits (1026), Expect = e-110, Method: Composition-based stats. Identities = 179/336 (53%), Positives = 247/336 (73%), Gaps = 1/336 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETSCD+TG AIYD EKGL+ +++ SQ +H+ YGGVVPE +S+ H++ PL++ Sbjct: 1 MLILGIETSCDDTGAAIYDLEKGLIIHKVISQNNIHSKYGGVVPEKSSKYHLKNIQPLVE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 K S ++ ID +AYTAGPGLVG+L++GAT SLA+ +P+I ++H+EGHLL PM Sbjct: 61 NIFKNSNISLSKIDGIAYTAGPGLVGSLIIGATFACSLAYTLQIPSIAINHLEGHLLTPM 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 ++ P+FPF+ L++SG HTQ + IG+Y+++G+ +DDA GEAFDK AKLLG+ YPGG Sbjct: 121 IKYKRPKFPFLGLIISGAHTQFVLAEDIGKYKIIGDCLDDALGEAFDKVAKLLGIKYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-DDQTRADIARAF 239 LS +A QG + RF FPRPMT +PG++FSFSGLKT+A N + D+QT+ DIARAF Sbjct: 181 KKLSIIAKQGNSKRFFFPRPMTKKPGINFSFSGLKTYAKNLVSSFSKIDNQTKCDIARAF 240 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 ED+++DT++IKCKRALD T K L+++GGVSAN LR L +MK R G++F+++ CT Sbjct: 241 EDSIIDTVIIKCKRALDITNSKILLISGGVSANEPLRKNLRNLMKSRNGKLFFSKKSLCT 300 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 DN AMIAY G +RFK T DL V + P+W L ++ Sbjct: 301 DNAAMIAYVGSIRFKKNKTKDLSVLINPKWSLEDIS 336 >UniRef50_A5G3X1 Probable O-sialoglycoprotein endopeptidase n=20 Tax=Bacteria RepID=GCP_GEOUR Length = 343 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 150/338 (44%), Positives = 212/338 (62%), Gaps = 3/338 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IE+SCDET A+ D + +L+N + SQ+ +HA YGGVVPE+ASR H+ +I+ Sbjct: 1 MLLLAIESSCDETAAAVVRDGRIILSNIVASQISVHAGYGGVVPEIASRKHLETISTVIE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ +G++ D+D +A T GPGL GALLVG + +++A+A VP V+H+E H+LA Sbjct: 61 EALQAAGVSLTDVDGIAVTQGPGLAGALLVGISTAKAMAYALGVPIAGVNHIESHILAIF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE EFPFVAL VSGGHT L V +G+Y+ LG+++DDAAGEAFDK AKLLGL YPGG Sbjct: 121 LE-RSIEFPFVALAVSGGHTHLYLVEAVGRYKTLGQTLDDAAGEAFDKVAKLLGLPYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG--TDDQTRADIARA 238 L+ ++AA+G FPRP+ +FSFSGLKT N ++ N D + D+ + Sbjct: 180 ALIDRLAAEGDPEAIRFPRPLMRDESFNFSFSGLKTSVLNYLQKNPAAADGRALNDLCAS 239 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+ AV D L+ K A+ TG KR+V+AGGV+ N LR +++ + + + E+ P C Sbjct: 240 FQAAVCDVLVSKTAAAVSATGIKRVVVAGGVACNNGLRREMSRLAELKGIELHIPSPLLC 299 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 +DN AMIA G + + P WPL + + Sbjct: 300 SDNAAMIAVPGDYYLSNNILSGFDIDALPVWPLDSIAS 337 >UniRef50_Q8RC98 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Bacteria RepID=GCP_THETN Length = 341 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 138/333 (41%), Positives = 202/333 (60%), Gaps = 3/333 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCDET + + K +L+N +YSQ+ +H YGGVVPE+ASR H+ +++ A Sbjct: 7 ILGIETSCDETAAGVVKNGKEVLSNVIYSQINVHKKYGGVVPEIASRKHIEAISFVVEEA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ L+ ++DA+A T GPGLVG LLVG + G++LA+A P I V+H++GH+ A + Sbjct: 67 LNEAKLSLDEVDAIAATYGPGLVGPLLVGLSYGKALAYAKGKPFIGVNHIDGHIAANYI- 125 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 PFV L+ SGGH+ ++ V G+YE++G+++DDAAGEAFDK A+ LGL YPGGP Sbjct: 126 GGNLTPPFVCLVASGGHSHIVYVKDYGEYEVMGKTLDDAAGEAFDKVARALGLGYPGGPA 185 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTRADIARAFE 240 + K A G FP+ + DFSFSG+KT N + + ++ D+A +F+ Sbjct: 186 IEKAAKLGNMEAIEFPKSFMEEGNFDFSFSGVKTAVLNYLNRQKQKGEEVNIYDVAASFQ 245 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+ L+ K A ++ +AGGV++N LR KL E KK V+Y +CTD Sbjct: 246 RNIVEVLVKKLVEAARFKNVSKVSIAGGVASNGFLRQKLEEDAKKFGLSVYYPEKIYCTD 305 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 NGAMIA A F G + + ++ P + E Sbjct: 306 NGAMIAAAAYYDFVKGKFSGMDLNAIPYLKIGE 338 >UniRef50_B2V910 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Aquificales RepID=GCP_SULSY Length = 337 Score = 396 bits (1017), Expect = e-109, Method: Composition-based stats. Identities = 147/338 (43%), Positives = 211/338 (62%), Gaps = 12/338 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCD+T IA+YD EKG+ +N + SQ+ +HA +GGV PE+A+R+H + +P++ Sbjct: 1 MVVLGIETSCDDTSIAVYDSEKGIPSNVVTSQL-IHAQFGGVYPEIAAREHTKNFLPVLD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ +T DIDA+A T PGL+ +L+ G + ++L+F+ P IPVHH+E H+ A Sbjct: 60 KALRDASITLSDIDAIATTFMPGLIVSLVAGVSGAKTLSFSLKKPLIPVHHIEAHIFANF 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + E+PF+AL+VSGGHT+LI V Y LG ++DDA GE +DK A+ LGL +PGG Sbjct: 120 I-TKEIEYPFLALVVSGGHTELILVKEFEDYIYLGGTLDDAVGEVYDKVARALGLGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDR--PGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 PL+ K+A +G FPRP+ + +FSFSGLK+ I + DI ++ Sbjct: 179 PLIDKLAKEGK-EAIKFPRPLLNDEENKYNFSFSGLKSAVIREINKG---IYKKEDITKS 234 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F++AVVD L+ K A + G R+V+AGGVSAN LR E + + EV + C Sbjct: 235 FQNAVVDVLVKKTVLACKEFGINRVVVAGGVSANSQLR---EEFLNIKDLEVHFPPMHLC 291 Query: 299 TDNGAMIAYAGMVRFK-AGATADLGVSVRPRWPLAELP 335 TDNGAM+AY G RFK G + L + R + + P Sbjct: 292 TDNGAMVAYTGYKRFKEKGISVSLDFEAKARCRIDKFP 329 >UniRef50_Q2RGJ3 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Bacteria RepID=GCP_MOOTA Length = 342 Score = 395 bits (1016), Expect = e-109, Method: Composition-based stats. Identities = 145/333 (43%), Positives = 194/333 (58%), Gaps = 2/333 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET AI D + AN + SQ+ +H +GGVVPE+ASR H+ VP++ Sbjct: 10 NILAIESSCDETAAAIVSDGTRVRANIIASQIAVHRRFGGVVPEIASRHHMENIVPVVSE 69 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL +GL D+DAVA T GPGLVGALLVG +SLA+A P I VHH+ GH+ A L Sbjct: 70 ALATAGLAFSDVDAVAVTYGPGLVGALLVGVAYAKSLAYALGKPLIGVHHLLGHIYAGFL 129 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 P V+L+VSGGHT L+ + +LG + DDAAGEAFDK A++LGL YPGGP Sbjct: 130 AYPGLPLPAVSLVVSGGHTNLVYLEDHTTRRILGSTRDDAAGEAFDKVARVLGLPYPGGP 189 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTRADIARAF 239 L K+A +G FPR + LDFSFSGLK+ N + + RAD+A +F Sbjct: 190 ELEKLAREGNPRAIPFPRAWLEENSLDFSFSGLKSAVINYLHHARQVGQEVNRADVAASF 249 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + AV + L+ K A + +++AGGV+AN LR +L ++ VF+ E CT Sbjct: 250 QAAVAEVLVTKTLLAATSYRARSILLAGGVAANSVLRRELRSAGEQAGLPVFFPPRELCT 309 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 DN AMI A ++ A L ++ P PL Sbjct: 310 DNAAMIGCAAYYQYLRRDFAPLSLNAIPDLPLN 342 >UniRef50_A7HLB0 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Thermotogaceae RepID=GCP_FERNB Length = 337 Score = 394 bits (1012), Expect = e-108, Method: Composition-based stats. Identities = 130/334 (38%), Positives = 199/334 (59%), Gaps = 8/334 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCDET +A+ +D ++AN +YSQ+++H +GGVVPE+A+R+H+++ L Sbjct: 1 MIVLGIETSCDETSVALVEDN-TVIANLVYSQIQIHKKFGGVVPEIAAREHLKRLPILFS 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + ++ + + ID +A T GPGL+GALLVG + + LA + P + ++H+ GH+ + Sbjct: 60 ELISQTNINIERIDGIAVTKGPGLIGALLVGVSFAKGLALRYKKPLVGINHIIGHVYSNY 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + P++ L+VSGGHT ++ V +LG S+DDA GEAFDK A+LLGL YPGG Sbjct: 120 LAYPDLKPPYIVLMVSGGHTLILKVEENNNVTILGRSVDDAVGEAFDKIARLLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT-----RADI 235 P + K++ G F FP+P P +FSFSGLKT I+ + D+ Sbjct: 180 PEIDKISKNGNPNAFNFPKPKMYDPDYNFSFSGLKTAVLYEIKRLTKSGYSENNLPIPDL 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A + ++ ++D L+ K +A K +V+AGGV+AN LR K+ + ++ + Sbjct: 240 AASAQEVMIDVLLHKVTKAARDNNLKNIVLAGGVAANSRLREKIRALSEEFN--FYIPPL 297 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 E+C+DN AMIA AG+ R K+G L P + Sbjct: 298 EYCSDNAAMIARAGLERIKSGENDGLNFEPVPNF 331 >UniRef50_D1B623 Metalloendopeptidase, glycoprotease family n=3 Tax=Synergistaceae RepID=D1B623_THEAS Length = 342 Score = 393 bits (1009), Expect = e-108, Method: Composition-based stats. Identities = 139/334 (41%), Positives = 209/334 (62%), Gaps = 5/334 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIE+SCD+T +A+ ++ + + A+ + SQV+ HA +GGVVPELASR H + L++ Sbjct: 8 LVLGIESSCDDTAVAVLEEPRRIRASLVMSQVEDHAPHGGVVPELASRRHQEAIMGLVRR 67 Query: 62 ALKESGLT--AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 L ++G++ + + +A TAGPGL+G+LLVG + L+ W+VP + V+HMEGHL A Sbjct: 68 CLWQAGVSNPMRQLSLIAVTAGPGLMGSLLVGVMAAKGLSQGWEVPIMGVNHMEGHLFAN 127 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 +L + PF+ L+VSGGHT++ V G Y LLG + DDA GEA+DK AK+LGL YPG Sbjct: 128 VLAHPDLKPPFLCLIVSGGHTEVHLVRSFGDYRLLGATRDDAVGEAYDKVAKMLGLGYPG 187 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 GP++ ++A +G R+ P P ++FSFSGLKT +R + + D+ +F Sbjct: 188 GPVIDRLAREGDPDRYQLPVPFKGSSQVEFSFSGLKTAVLWLVRR-EGEALSVPDLCASF 246 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR--GEVFYARPEF 297 + A V++L+ K K A++QTG + + ++GGV+ANR LR +L ++ V+ E Sbjct: 247 QRAAVESLVSKVKLAMNQTGVRTVAVSGGVAANRELRRRLEDLAGSSGGRVRVYLPPLEL 306 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 CTDN AM+A AG+ ++ G DL P W L Sbjct: 307 CTDNAAMVAAAGLWAYRRGVRDDLSFRADPSWEL 340 >UniRef50_C1TLM6 O-sialoglycoprotein endopeptidase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TLM6_9BACT Length = 336 Score = 393 bits (1009), Expect = e-108, Method: Composition-based stats. Identities = 133/334 (39%), Positives = 202/334 (60%), Gaps = 5/334 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 L IE+SCD+T +A+ D ++ +L++ + SQV+ HA +GGVVPE ASR H+ +PL+ Sbjct: 4 LTLAIESSCDDTAVAVIDGQRNVLSSTMSSQVESHAPFGGVVPEYASRMHLEAILPLVDR 63 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL E+ D+D +A TAGPGL+G+LLVG + LA AW P + V+H+EGH+ A ++ Sbjct: 64 ALAEADAKPSDLDLIAVTAGPGLMGSLLVGVMTAKGLAQAWGKPILGVNHLEGHVFANVV 123 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + PF+A++VSGGHT+++ V +G Y +LG + DDAAGEA+DK AKLLGL YPGGP Sbjct: 124 NHPDLDPPFIAMIVSGGHTEVVLVEDLGFYRILGGTKDDAAGEAYDKVAKLLGLAYPGGP 183 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA--DIARAF 239 ++ ++A G F FP P+ + FSFSGLKT + + + DI +F Sbjct: 184 IVDELAKDGDPQAFDFPVPLKRSDEISFSFSGLKTAVLWQVERIKKEGASLPVEDICASF 243 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + A V+ L+ K A+ +TG +++V++GGV+AN LR + + + +CT Sbjct: 244 QRAAVEALICKLDLAVQKTGVEKVVLSGGVAANSCLRDLVLNRGDWKG---YVPDMFYCT 300 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 DN MI AG + G + L ++ P W + + Sbjct: 301 DNAVMIGAAGYHGWMRGRRSGLDLAPSPSWSIMD 334 >UniRef50_Q4FNV6 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Alphaproteobacteria RepID=GCP_PELUB Length = 357 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 137/341 (40%), Positives = 212/341 (62%), Gaps = 12/341 (3%) Query: 2 RVLGIETSCDETGIAIYDDEK----GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 +LGIE+SCDET +I + + +L++ + SQV +H ++GGVVPELA+R H+ K Sbjct: 6 IILGIESSCDETAASIITENEQGMPTILSSIVSSQVDVHKEFGGVVPELAARSHMEKIDL 65 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 + + A +SG+ +D+DA+A TAGPGL+ L VG + G+++A + + P I V+H+EGH L Sbjct: 66 ITKKAFDKSGVKMEDLDAIAATAGPGLMVCLSVGLSFGKAMASSLNKPFIAVNHLEGHAL 125 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 +P L ++ +P++ LL+SGGHTQ +SV G+G Y+ LG +IDDA GEAFDKTAKLLG+++ Sbjct: 126 SPKL-NSELNYPYLLLLISGGHTQFLSVQGLGNYKRLGTTIDDAVGEAFDKTAKLLGIEF 184 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 PGGP + A +G ++ P+P+ + G + SF+GLKT I +Q + D+A Sbjct: 185 PGGPQIEVYAKKGDPNKYELPKPIFHKGGCNLSFAGLKTAVLK-ISKQIKTEQEKYDLAA 243 Query: 238 AFEDAVVDTLMIKCKRALDQT------GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 +F+ + + L K K A ++ + V+AGGV+AN+ +R L + K+ E Sbjct: 244 SFQKTIEEILYKKSKIAFEEFKKMNTINKNKFVVAGGVAANKRIREVLTNLCKEEEFEAI 303 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + C DN AMIA G+ +FK ++L +PRWPL Sbjct: 304 FPPINLCGDNAAMIAMVGLEKFKLKQFSELDSPAKPRWPLD 344 >UniRef50_A1AXM9 Probable O-sialoglycoprotein endopeptidase n=36 Tax=Proteobacteria RepID=GCP_RUTMC Length = 356 Score = 388 bits (996), Expect = e-106, Method: Composition-based stats. Identities = 202/335 (60%), Positives = 248/335 (74%), Gaps = 4/335 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 LGIE+SCDETGI +Y E GL+ ++L+S VK+HA+YGGVVPELASRDH+++ +PLI+A Sbjct: 22 ITLGIESSCDETGIGLYHSELGLIGHELFSSVKIHAEYGGVVPELASRDHIQRVLPLIKA 81 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L + T +D+ +AYTAGPGL GALLVG V +SLA++ D+P++ VHHMEGHLL P+L Sbjct: 82 VLADVKFTLQDLSGIAYTAGPGLAGALLVGCAVAKSLAWSLDIPSLAVHHMEGHLLTPLL 141 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E++ PEFPFVALLVSGGHT LI V IGQY++LGES+DDA GEAFDKTAK+LGL YPGGP Sbjct: 142 EESQPEFPFVALLVSGGHTMLIDVKAIGQYKILGESLDDAVGEAFDKTAKILGLGYPGGP 201 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 L+ +A QG G F FP PM RPGLDFSFSGLKTF NT + + DIA+AFE Sbjct: 202 ALAMLAEQGNYGAFKFPCPMVGRPGLDFSFSGLKTFVRNTFAKYPSK---KEDIAKAFEV 258 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 A TLMIKC+RAL+QT + LV+AGGVSAN +LR KL +M +K VFY R EFCTDN Sbjct: 259 ATTQTLMIKCRRALEQTKYATLVVAGGVSANLSLRKKLNQMGQKLDVNVFYPRQEFCTDN 318 Query: 302 GAMIAYAGMVRFKAGATA-DLGVSVRPRWPLAELP 335 GAMIA G R G ++++PRW L EL Sbjct: 319 GAMIALVGYFRLSHGQHDTHHEINIKPRWSLEELS 353 >UniRef50_B3WUZ1 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Escherichia coli B171 RepID=B3WUZ1_ECOLX Length = 332 Score = 386 bits (992), Expect = e-106, Method: Composition-based stats. Identities = 146/328 (44%), Positives = 209/328 (63%), Gaps = 5/328 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M L IETSCDETG+A++ ++ L+++ LYSQV +H+ +GG+VPE+ASR + PLI+ Sbjct: 1 ML-LAIETSCDETGVALFSEDGKLISHLLYSQVAIHSPFGGIVPEIASRKQLEVLYPLIK 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 LK++ + + AVA T GPGL+G+LLVG ++ ++++FA +P I V H++ HLLA Sbjct: 60 ELLKQNNIEISQLKAVAATFGPGLIGSLLVGVSLAKAISFALKIPLIAVDHLQAHLLAVF 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE EFPF+ LLVSGGHT L + ++ ++G + DDAAGEAFDK AKLLGL YPGG Sbjct: 120 LE-KEIEFPFIGLLVSGGHTALFLINSFFEFYVIGHTKDDAAGEAFDKVAKLLGLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++S++A +G PRP+ + LDFSFSGLKT N I+++ D+ FE Sbjct: 179 PIISQLAEKGDPKAINLPRPLLEDKSLDFSFSGLKTAVLNYIKNHS---YRVEDLCAGFE 235 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +AV D L+ K RA+D R+V+AGGV+AN+ LR + E E+++ EFCTD Sbjct: 236 EAVCDVLVYKTFRAVDLFKVPRVVVAGGVAANKRLRQRFREKAFNTGVEIYFPSLEFCTD 295 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPR 328 N AM+ G +++ ADL R Sbjct: 296 NAAMVGLLGYKQWQEKKYADLNTEAYAR 323 >UniRef50_B6BRQ7 O-sialoglycoprotein endopeptidase n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BRQ7_9RICK Length = 357 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 141/342 (41%), Positives = 217/342 (63%), Gaps = 12/342 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKG----LLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 +LGIE+SCDET ++ + + +L+N + SQV++H ++GGVVPELA+R H+ K Sbjct: 6 LILGIESSCDETAASLITENEQGIPIVLSNIISSQVEVHKEFGGVVPELAARSHMEKIDW 65 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +++ A+ +SG ++IDAVA TAGPGL+ L VG + G++ A A + P I V+H+EGH L Sbjct: 66 IVEKAINDSGRKIEEIDAVASTAGPGLIVCLSVGLSFGKAFASALNKPFIAVNHLEGHAL 125 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 +P L ++ +P++ LL+SGGH+Q ++V +G+Y+ LG +IDDA GEAFDKTAKLLG+++ Sbjct: 126 SPKL-NSKLNYPYLVLLISGGHSQFLNVQDLGKYKRLGTTIDDALGEAFDKTAKLLGVEF 184 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 PGGP + MA +G + ++ P+P+ ++ G + SF+GLKT I N DQ + D+A Sbjct: 185 PGGPQIEIMAEKGDSNKYDLPKPIFNKGGCNLSFAGLKTAILK-ITKNIKTDQEKFDLAA 243 Query: 238 AFEDAVVDTLMIKCKRALDQTGF------KRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 +F+ V + L K K A ++ K V+AGGV+AN+ +R L + + + Sbjct: 244 SFQKTVEEILYKKTKIAFNEFEKQNKLKDKIFVVAGGVAANKKIRTMLINLCNENNYKGI 303 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 + E C DN AMIA G+ +FK + L +PRWPL E Sbjct: 304 FPPIELCGDNAAMIAMVGLEKFKLKQFSALDHPAKPRWPLDE 345 >UniRef50_Q6AL73 Probable O-sialoglycoprotein endopeptidase n=3 Tax=Deltaproteobacteria RepID=GCP_DESPS Length = 344 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 142/337 (42%), Positives = 194/337 (57%), Gaps = 5/337 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIE+SCD+T A+ D + +N + Q ++H +GGVVPELASR H+ P+++ Sbjct: 8 MIILGIESSCDDTSAAVVIDGTAIQSNVISGQEEIHNCFGGVVPELASRSHLSAIQPVVE 67 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ ++ DID +A T GPGL G+LLVG + +SL+ +P + V HM GH LA + Sbjct: 68 KALSDAKISLDDIDLIATTQGPGLSGSLLVGYSYAKSLSLVKKIPFVGVDHMAGHALAIL 127 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE+ P+FPF+AL SGG + + V +ELLG + DDAAGEAFDK AK+LGL YPGG Sbjct: 128 LEEETPDFPFIALTASGGTSSIFLVKSSTDFELLGRTRDDAAGEAFDKVAKVLGLPYPGG 187 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-----NGTDDQTRADI 235 P ++ A G FPR D+ G DFSFSGLKT N + RADI Sbjct: 188 PHIAAHAETGDEKSIKFPRAWLDKDGFDFSFSGLKTAVLNYHNKIVQKNGSITKEERADI 247 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F+ AV+D L+ K A G +V+ GGVS+NR LR + K + + F Sbjct: 248 CASFQQAVIDVLVTKTINAARTHGISTVVLGGGVSSNRALRLAFSHECDKCKLQFFVPAA 307 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + CTDN AMIA AG ++ +L V R L Sbjct: 308 KLCTDNAAMIAVAGYHKYLRFGPGNLSDDVYSRSQLG 344 >UniRef50_Q0AVU0 Probable O-sialoglycoprotein endopeptidase n=27 Tax=Bacteria RepID=GCP_SYNWW Length = 339 Score = 381 bits (978), Expect = e-104, Method: Composition-based stats. Identities = 141/329 (42%), Positives = 199/329 (60%), Gaps = 1/329 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCDET AI + K +L+N + SQ+ +H +GGVVPE+ASR H+ ++ Sbjct: 8 LILGIETSCDETAAAIVRNGKEILSNIVNSQIDIHQQFGGVVPEVASRKHIENIAGVVHR 67 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A E+ L IDAVA T PGLVGALLVG + ++ A+A + P I V+H+ GH+ A L Sbjct: 68 AFSEAQLAYSAIDAVAVTNRPGLVGALLVGVSFAKAFAYALEKPLIAVNHLHGHIYANFL 127 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E EFP + L+VSGGHT L+ ++ + E+LGE+ DDAAGEAFDK A+ LGL YPGGP Sbjct: 128 EHRDIEFPAICLVVSGGHTSLLLMSNPNKMEVLGETRDDAAGEAFDKVARFLGLGYPGGP 187 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ-TRADIARAFE 240 + + A +G AG+ PR DR +FSFSGLKT A N Q D+A F+ Sbjct: 188 AIQEAATKGKAGQLQLPRVFLDRNDFEFSFSGLKTAAMNQWNKLQRRGQANVFDMAAEFQ 247 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 A+V+ L+ K +A + + ++MAGGV+AN+ LR + + K+ ++FY + CTD Sbjct: 248 AALVEVLVEKSIKAAAKYQVRTIMMAGGVAANQELRNLMKKRTKEAGLKLFYPSLKLCTD 307 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRW 329 N AM+A + + A L ++ P Sbjct: 308 NAAMVAANAHYHYGNRSFAPLSLNAYPSL 336 >UniRef50_Q11TP2 Probable O-sialoglycoprotein endopeptidase n=87 Tax=Bacteria RepID=GCP_CYTH3 Length = 343 Score = 379 bits (975), Expect = e-104, Method: Composition-based stats. Identities = 146/335 (43%), Positives = 206/335 (61%), Gaps = 9/335 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCDET A+ + +L N + SQ ++H YGG+VPELASR H + +P++ A Sbjct: 10 LLAIESSCDETAAAVIQ-DGNILCNIVASQ-RIHEKYGGIVPELASRAHQQHIIPVVAQA 67 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ + D++AVA T+GPGL+GALLVG + ++ A A +P I V+HM+ H+LA + Sbjct: 68 LLEANIQKSDLNAVACTSGPGLLGALLVGVSFSKAFASALHIPVIKVNHMKAHILAHFIG 127 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 D P FPF+ + VSGGHTQL+ V + E++GE+ DDA GEAFDKTAKL+GL YPGGPL Sbjct: 128 DVKPSFPFICMTVSGGHTQLVIVRNYLEMEVVGETQDDAVGEAFDKTAKLMGLPYPGGPL 187 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-----QTRADIAR 237 + A QG FP P D PG ++SFSG+KT ++ N D + DI Sbjct: 188 IDSYAKQGNP--LAFPFPTVDMPGYNYSFSGIKTAFMYFLKKNTAVDPDFIQKNLPDICA 245 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 + + A++D LM K KR + TG R+ +AGGVSAN LR + + ++ +V+ E+ Sbjct: 246 SVQHALIDVLMRKLKRLVVDTGINRVAIAGGVSANSGLRKAMEQKREQEGWDVYIPAFEY 305 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 CTDN AMIA AG ++ A +S PR + Sbjct: 306 CTDNAAMIAVAGYHQYLENDFAGWDLSPEPRLRIG 340 >UniRef50_C6P1W3 Metalloendopeptidase, glycoprotease family n=1 Tax=Sideroxydans lithotrophicus ES-1 RepID=C6P1W3_9PROT Length = 383 Score = 379 bits (975), Expect = e-104, Method: Composition-based stats. Identities = 206/377 (54%), Positives = 254/377 (67%), Gaps = 41/377 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCDETGIA+Y E+GLLA+ L+SQ+ LH +YGGVVPELASRDHVR +PLI++ Sbjct: 6 LILGIESSCDETGIALYHTERGLLAHTLHSQIALHNEYGGVVPELASRDHVRHALPLIRS 65 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G DIDA+AYT GPGL GALLVG+++ +LA+ DVP I VHH+EGHLL+P+L Sbjct: 66 ALQKAGCALSDIDAIAYTQGPGLSGALLVGSSIACALAYTLDVPTIGVHHLEGHLLSPLL 125 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 PEFPFVALLVSGGHTQL+ V G+G Y LLGES+DDAAGEAFDK+AKLLGLDYPGG Sbjct: 126 SRPAPEFPFVALLVSGGHTQLMRVDGVGHYTLLGESVDDAAGEAFDKSAKLLGLDYPGGA 185 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI------------------- 222 LLSK+A +GT GRF PRPM LDFSFSGLKT + Sbjct: 186 LLSKLAQRGTPGRFKLPRPMLHSGNLDFSFSGLKTAVLTLVNQQIDIPHPNPDGTTSHST 245 Query: 223 ---------------------RDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFK 261 R+ T +QTRADIA A ++A+VD L+ K AL QTG Sbjct: 246 KPASGQVAGYLPEGEGANESLREFPTPEQTRADIAHAAQEAIVDVLVNKALAALKQTGLN 305 Query: 262 RLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATA-D 320 +LV+AGGV AN+ LR++L + K G VFY EFCTDNGAMIA+AG +R + D Sbjct: 306 QLVVAGGVGANQLLRSRLNASVGKHDGNVFYPELEFCTDNGAMIAFAGAMRLQQQVAQRD 365 Query: 321 LGVSVRPRWPLAELPAA 337 +V+PRW L E+ A Sbjct: 366 YRFNVKPRWDLREMNYA 382 >UniRef50_Q6MQ48 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Bdellovibrio bacteriovorus RepID=GCP_BDEBA Length = 345 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 136/334 (40%), Positives = 191/334 (57%), Gaps = 8/334 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 RVL IETSCD+T +AI D + + SQ H YGG+VPE+A+R+H +PLI+ Sbjct: 4 RVLAIETSCDDTSVAIVDRTGWVHSVVAASQDLDHEIYGGIVPEIAARNHSIALIPLIEE 63 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A K++ + D+ +A T PGL+GAL+VG +SL+ A +P + V+H+EGHLLAP L Sbjct: 64 AFKKANMNWSDVQGIAVTNRPGLIGALIVGLVTAKSLSQAKHLPFLGVNHLEGHLLAPFL 123 Query: 122 EDNPPE------FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 D+ +P+V L +SGGHT L + G+G Y +LG + DDAAGE FDK AK+ GL Sbjct: 124 RDDKYAPPEDFGYPYVGLAISGGHTSLYQIKGLGDYRILGATKDDAAGECFDKFAKMAGL 183 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTRA 233 +PGG + +MA G F FPR M D SFSGLK+ + G + + Sbjct: 184 GFPGGVRVDQMAKAGNPQAFEFPRSMIHDDTFDMSFSGLKSSGQRMLEQLGPELVQERLP 243 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 D+ +F++A+VD L+ K RA KR+++ GGVSAN LR + E K+ + Sbjct: 244 DLCASFQEAIVDVLIAKLDRAAKVFRSKRVILTGGVSANSRLRQRAQEWADKKGYTLVIP 303 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +CTDN AMI Y G +R G + L + P Sbjct: 304 PLRYCTDNAAMIGYVGALRMARGEVSALDLGPSP 337 >UniRef50_B8BPP0 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8BPP0_ORYSI Length = 401 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 135/362 (37%), Positives = 198/362 (54%), Gaps = 29/362 (8%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +LGIETSCD+T A+ + +L+ + SQ L +GGV P++A H ++Q Sbjct: 10 LLMLGIETSCDDTAAAVVRGDGEILSQVVSSQEDLLVRWGGVAPKMAEEAHSLAIDQVVQ 69 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ ++ D+ AVA T GPGL L VG R +A ++ +P + VHHME H L Sbjct: 70 KALDDANVSENDLSAVAVTVGPGLSLCLRVGVHKARKIAKSFRLPIVGVHHMEAHALVSR 129 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--P 178 L + +FPF+ALL+SGGH L+ G+GQY LG +IDDA GEA+DK+A+ LGLD Sbjct: 130 LVNKDLDFPFLALLISGGHNLLVLAHGLGQYVQLGTTIDDAIGEAYDKSARWLGLDMRKG 189 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN------------- 225 GGP L ++A +G F PM +FS++GLKT I Sbjct: 190 GGPALEQLALEGDPNAVKFSVPMRQHKDCNFSYAGLKTQVRLAIESRNISTDDIPISSAT 249 Query: 226 GTDDQTRADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLA 280 D Q RA+IA +F+ V L +C+RA++ + K V++GGV++N+ +R L Sbjct: 250 KDDRQIRANIAASFQRVAVLHLEERCQRAVEWALKMEPSIKYFVVSGGVASNQYVRTHLN 309 Query: 281 EMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT---------ADLGVSVRPRWPL 331 ++ +K ++ P+ CTDNG MIA+ G+ F AG D+ +RPRWPL Sbjct: 310 QIAEKNGLQLVCPPPKLCTDNGVMIAWTGIEHFIAGRFDDPPAVDEPDDMQYDLRPRWPL 369 Query: 332 AE 333 E Sbjct: 370 GE 371 >UniRef50_B0TX13 Probable O-sialoglycoprotein endopeptidase n=19 Tax=Francisella RepID=GCP_FRAP2 Length = 336 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 171/336 (50%), Positives = 236/336 (70%), Gaps = 5/336 (1%) Query: 1 MRVLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M VLGIE+SCDETG+AIYD K L+A+ LYSQ+ LH YGGVVPELASR+H+ K L Sbjct: 1 MLVLGIESSCDETGLAIYDYTSKTLVADVLYSQIDLHKKYGGVVPELASREHIAKLNILT 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + L + + D+ +AYTA PGL+GAL+VGAT ++L ++ + VHH+EGHLL+P Sbjct: 61 KELLSNANINFNDLSCIAYTAMPGLIGALMVGATFAKTLGLIHNIDTVAVHHLEGHLLSP 120 Query: 120 MLE-DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 +L+ + ++PFVALLVSGGHTQL V G+Y LLGESIDDAAGEAFDKTAKLLG+ YP Sbjct: 121 LLDQSSDIKYPFVALLVSGGHTQLFEVREFGEYSLLGESIDDAAGEAFDKTAKLLGMSYP 180 Query: 179 GGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANT-IRDNGTDDQTRADIA 236 GG ++ +A + ++ PRPM ++P LDFSFSGLKT NT + + +A++ Sbjct: 181 GGVEVANLAEKATDKKKYDLPRPMKNKPNLDFSFSGLKTAVLNTWYSETDQSYENKANLC 240 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 AF++A +D L+ KC++AL +TG KRLV++GGVSAN+ LR+KL + K + E+F+ + Sbjct: 241 YAFQEAAIDVLVTKCEKALQKTGNKRLVISGGVSANKLLRSKLDILSKNKGYEIFFPPMK 300 Query: 297 FCTDNGAMIAYAGMVRFKAGAT-ADLGVSVRPRWPL 331 +CTDNGAMIA AG R+ ++L ++V+ R + Sbjct: 301 YCTDNGAMIALAGAYRYANSFRDSNLEINVKARAQI 336 >UniRef50_D0ME01 Metalloendopeptidase, glycoprotease family n=4 Tax=Bacteria RepID=D0ME01_RHOM4 Length = 339 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 151/331 (45%), Positives = 208/331 (62%), Gaps = 9/331 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+T A+ + K L +N + SQ H YGGVVPELASRDH R+ VP+++ A Sbjct: 8 ILGIETSCDDTAAAVVVEGK-LRSNVVASQQATHLRYGGVVPELASRDHQRRIVPVVRQA 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+GLT +D+DAVA T GPGLVG+LLVG + ++ A P I V+H+EGH+ + +E Sbjct: 67 LQEAGLTPRDLDAVAVTYGPGLVGSLLVGLSFAKAFALGLGRPLIGVNHLEGHIYSVFIE 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 P FP++ L+VSGGHTQL+ V ++ LLG + DDAAGEAFDK A+LLGL YPGGP Sbjct: 127 PPSPPFPYLCLIVSGGHTQLMRVDEGFRHTLLGRTRDDAAGEAFDKVARLLGLGYPGGPE 186 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI------RDNGTDDQTRADIA 236 + ++A QG FPRP + G DFSFSGLKT + +Q RAD+ Sbjct: 187 IDRLARQGDPNFVAFPRPRLE--GYDFSFSGLKTAVRYYLDQFSEAERARLLEQHRADLC 244 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F+ AVVD L+ +RA+ TG + + + GGVSAN LRA + ++ ++ Sbjct: 245 ASFQQAVVDVLIDSLRRAIQDTGLRHVAIVGGVSANSALRAAAQALAEELDVRLYIPPLA 304 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +C DN AMIA G + +AG + L ++ P Sbjct: 305 YCMDNAAMIAITGYFKARAGLESPLTLAAVP 335 >UniRef50_A0L5L8 Probable O-sialoglycoprotein endopeptidase n=24 Tax=Bacteria RepID=GCP_MAGSM Length = 353 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 168/347 (48%), Positives = 220/347 (63%), Gaps = 12/347 (3%) Query: 1 MRVLGIETSCDETGIAIYDD-------EKGLLANQLYSQVKLHADYGGVVPELASRDHVR 53 +RVLGIE+SCDET A+ + + +N ++SQ+++HA YGGVVPELASR H+R Sbjct: 2 LRVLGIESSCDETAAAVVEGAEHGHPHGVVVRSNVVWSQLEVHALYGGVVPELASRAHIR 61 Query: 54 KTVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 P+I+ AL E+G+ + +DA+A T PGLVGALLVG + LA A D P +PVHHME Sbjct: 62 HIQPVIEQALAEAGVRPQQLDAIAVTVAPGLVGALLVGVAAAQGLAVALDKPLVPVHHME 121 Query: 114 GHLLAPMLED---NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTA 170 GHL++P L EFPFVALLVSGGHT L+ G Y+LLG++ DDA GEAFDK A Sbjct: 122 GHLMSPFLMAGVVPAMEFPFVALLVSGGHTLLLHARDFGDYQLLGQTRDDAVGEAFDKGA 181 Query: 171 KLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-- 228 ++LGL YPGGP ++ +A G FPR + DR DFSFSGLKT + + Sbjct: 182 RMLGLGYPGGPEVAALAQSGDRQAVAFPRVLLDRSQFDFSFSGLKTALRTHLLKFPPESG 241 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 + AD+A ++++A+VDTL+IK A G RLV+AGGV ANR LR KLA+ K+ Sbjct: 242 GPSLADVAASYQEAIVDTLVIKSLSACRHVGVSRLVIAGGVGANRRLREKLAKQALKQGV 301 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 +++ CTDNGAMIA AG+ R G A V+ PR P+ EL Sbjct: 302 QLYAPPIHLCTDNGAMIASAGVCRLARGDQARGVVNAVPRLPIHELE 348 >UniRef50_A5CE49 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Orientia tsutsugamushi RepID=GCP_ORITB Length = 344 Score = 376 bits (966), Expect = e-103, Method: Composition-based stats. Identities = 135/346 (39%), Positives = 202/346 (58%), Gaps = 14/346 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M V+GIE+SCD+T IAI + + ++AN + SQ H Y GVVPE+A+R H++ ++ Sbjct: 1 MNVIGIESSCDDTAIAIVNSNREIIANVVISQYTEHLPYSGVVPEIAARAHLKNLQYAMK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L ++ + DID +A T+GPGL+G ++VG+ G+++A A I V+H+EGH+LA Sbjct: 61 ETLNQAKINFTDIDVIAATSGPGLIGGIIVGSVFGQAIACALGKDFIAVNHLEGHILAVR 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + FP++ LLVSGGH Q I+V G+G+Y++LG++IDDA GEAFDKTA+LL L YPGG Sbjct: 121 L-NENISFPYLVLLVSGGHCQFIAVLGVGKYKILGQTIDDAVGEAFDKTARLLKLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADIARAF 239 P++ K+A++G ++ P MT + G D SFSGLKT I ++ DI +F Sbjct: 180 PIIEKLASKGDPHKYSLPLSMTKKSGCDLSFSGLKTAVKQLIFSIESLSEKVICDICASF 239 Query: 240 EDAVVDTLMIKCKRALDQT-----------GFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 + VV L+ + A+ V++GGV+AN+ LR ++ + Sbjct: 240 QYTVVQILLCRSINAIKLFESYCSNNFKINRKNYFVISGGVAANQYLRQEIFNLANTYGY 299 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 CTDN AMIA+AG+ R A + R +W + EL Sbjct: 300 CGVAPPSNLCTDNAAMIAWAGIERLNANLFSS-NFVPRAKWSVEEL 344 >UniRef50_B9KXJ0 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Chloroflexi RepID=GCP_THERP Length = 365 Score = 375 bits (964), Expect = e-103, Method: Composition-based stats. Identities = 165/358 (46%), Positives = 207/358 (57%), Gaps = 31/358 (8%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETSCDET A+ D + +L+N + SQV LH YGGVVPELASR HV VP++ Sbjct: 1 MIILGIETSCDETAAAVVRDGRFVLSNIIRSQVDLHQRYGGVVPELASRRHVTSIVPVLD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++G+ IDA+A T GPGL G+LLVG V ++LAF W+ P IPV+H+EGH+ A Sbjct: 61 LALEQAGIGPSAIDAIAVTEGPGLAGSLLVGINVAKTLAFVWEKPLIPVNHLEGHIYANW 120 Query: 121 L------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 L E P FP V L+VSGGHT+L+ + G G Y LLG ++DDAAGEAFDK A+LLG Sbjct: 121 LTLPGQDEVPEPTFPLVCLIVSGGHTELVLMRGHGDYVLLGRTLDDAAGEAFDKAARLLG 180 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR-- 232 L +PGGP + K A QG GRF PR DFSFSGLKT + R Sbjct: 181 LGFPGGPAIQKAAEQGRPGRFSLPRAWLGE-SYDFSFSGLKTALLRVLEQYQRRPARRVA 239 Query: 233 -------------------ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANR 273 AD+A F+ AVV+ L K RA + G +++AGGV+AN Sbjct: 240 AGQPFPEYVAPEYGPSVPIADLAAEFQAAVVEVLAEKTARAAREFGATMVLLAGGVAANA 299 Query: 274 TLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 LR +L E+ V Y P CTDN AMIA A + G ADL + V PL Sbjct: 300 ALRQRLREI---SPVPVRYPPPILCTDNAAMIAGAAYYLAQRGVRADLDLDVHAHLPL 354 >UniRef50_C7N1K1 Ribosomal-protein-alanine acetyltransferase n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1K1_SLAHD Length = 781 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 151/341 (44%), Positives = 186/341 (54%), Gaps = 9/341 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET AI D E +L++ + SQ+ HA +GGVVPE+ASR H+ + Sbjct: 439 LILAIESSCDETAAAIIDGEGSMLSDVVASQIDFHARFGGVVPEIASRKHIEAICGVTDE 498 Query: 62 ALK-------ESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L S L +D+DAVA T PGLVGAL+VG + A+ D+P I V+H+EG Sbjct: 499 CLDVAARALGRSRLRWRDLDAVAVTYAPGLVGALVVGVAFAKGAAWGADLPIIAVNHLEG 558 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HL A L + + P V LVSGGHT L+ V G YE LG +IDDA GEAFDK +K LG Sbjct: 559 HLYANRLAEPDIQPPMVVSLVSGGHTMLVHVKDWGDYETLGSTIDDAVGEAFDKVSKALG 618 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTR 232 L YPGGP++SK AAQG A FPR + L FS SGLKT I + Sbjct: 619 LGYPGGPIISKYAAQGDAKAIAFPRALMHSGDLRFSLSGLKTAVTTYINKEREAGRELNI 678 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +FE AVVD + K AL TG + + GGV+AN LR M KK + Sbjct: 679 PDIAASFEAAVVDVQVSKAHTALKDTGARTFCLGGGVAANPALRGAYEAMCKKHGYRLVM 738 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 C DN AMIA RF G AD + V PL E Sbjct: 739 PPLSACGDNAAMIAEVARDRFAQGKFADWSLDVTAHAPLDE 779 Score = 83.0 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 50/150 (33%), Gaps = 8/150 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +T+ + + + G L + A+ A R + I A Sbjct: 5 ILAFDTANEVVAVGV-----GRLPDDAVDITAAQAECVASASVSARRASNTTLIARIDEA 59 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L +G+T + AV GPG + + + +A A +VP + V ++ E Sbjct: 60 LASAGVTKDQVAAVVCGRGPGSFTGVRICMATAKGMASALEVPLLGVSTLDAVAWRAWAE 119 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYE 152 + ++ ++ V E Sbjct: 120 GVR---GALLVVADAMRKEVYPVLFRLDDE 146 >UniRef50_D1IZQ0 Whole genome shotgun sequence of line PN40024, scaffold_48.assembly12x (Fragment) n=15 Tax=Magnoliophyta RepID=D1IZQ0_VITVI Length = 468 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 142/359 (39%), Positives = 203/359 (56%), Gaps = 28/359 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+T AI +L+ + SQ L A YGGV P++A H++ ++Q A Sbjct: 77 VLGIETSCDDTAAAIVRSNGDILSQVVSSQADLLARYGGVAPKMAEGAHMQVIDRVVQDA 136 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ + LT +D+ AVA T GPGL L VG R +A + ++P + VHHME H L L Sbjct: 137 LENANLTERDLSAVAVTIGPGLSLCLRVGVQKARKIAGSHNLPIVGVHHMEAHALVARLI 196 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--PGG 180 + +FPF+ALL+SGGH LI +G Y LG +IDDA GEA+DKTAK LGLD GG Sbjct: 197 EKDLQFPFMALLISGGHNLLILARDLGHYIQLGTTIDDAIGEAYDKTAKWLGLDLRRSGG 256 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ---------- 230 P + ++A +G A F PM +FS++GLKT I + + Sbjct: 257 PAIEELAREGDAKAVKFSTPMKQHKDCNFSYAGLKTQVRLAIESRNINAEIPISSASSED 316 Query: 231 --TRADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLAEMM 283 +RADIA +F+ V L +C+RA++ + K LV++GGV++N+ +RA+L +++ Sbjct: 317 RSSRADIAASFQRVAVLHLEERCERAIEWALKIEPSIKHLVVSGGVASNQYVRAQLDQVV 376 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATA---------DLGVSVRPRWPLAE 333 KK+ ++ P CTDNG M+A+ G+ F+ G D +RPRWPL E Sbjct: 377 KKKSLQLVCPPPSLCTDNGVMVAWTGLEHFRMGRYDPPPPANEPEDYVYDLRPRWPLGE 435 >UniRef50_A8GM49 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Rickettsia RepID=GCP_RICAH Length = 386 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 132/384 (34%), Positives = 201/384 (52%), Gaps = 51/384 (13%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++LGIE+SCD+T ++I + + +L+N + SQ HA +GGVVPE+A+R H+ + Sbjct: 3 KILGIESSCDDTAVSIITENREILSNIIISQNTEHAVFGGVVPEIAARSHLSNLDKALTN 62 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 LKES +I A+A T+GPGL+G ++VG+ RSL+ P I ++H+EGH L L Sbjct: 63 VLKESNTKLIEISAIAATSGPGLIGGVIVGSMFARSLSSTLKKPFIAINHLEGHALTARL 122 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 DN P +P++ LL SGGH Q ++V G+G+Y++LG +IDDA GE FDK AK+L L +PGGP Sbjct: 123 TDNIP-YPYLLLLASGGHCQFVAVLGLGKYKILGSTIDDAVGETFDKVAKMLNLAFPGGP 181 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-------------- 227 + + A G ++ FP+P+ + + SFSGLKT I + Sbjct: 182 EIEQKAKLGDPHKYKFPKPIINSGNCNMSFSGLKTAVRTLIMNLQEINYNECNHLESVRQ 241 Query: 228 -----------------------------DDQTRADIARAFEDAVVDTLMIKCKRALDQT 258 +D DIA +F+ + + L K + A+ Sbjct: 242 DEVQEEFAQRTKVHEHRRKLQNSLVSSFLNDSVINDIAASFQFTIGEILSSKVQDAIRAY 301 Query: 259 -------GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMV 311 K +++AGGV+AN+ L+ L+ K ++ Y CTDN AMIAYAG+ Sbjct: 302 EQITNNFDKKNIIIAGGVAANKYLQEILSNCAKTYGYQLIYPPIHLCTDNAAMIAYAGLE 361 Query: 312 RFKAGATADLGVSVRPRWPLAELP 335 R+ L + RW L ++ Sbjct: 362 RYNNKLFTPLNFCPKARWSLEDIS 385 >UniRef50_Q8DLI9 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Cyanobacteria RepID=GCP_THEEB Length = 353 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 146/342 (42%), Positives = 193/342 (56%), Gaps = 8/342 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+L IETSCDET A+ ++ + +N + SQV H +GGVVPE+ASR H+ +I A Sbjct: 3 RILAIETSCDETAAAVVR-DRAIESNVIASQVCAHQPFGGVVPEVASRAHLENINGVITA 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+ E+G IDA+A T PGLVG+LL+G T ++LA P + +HH+EGHL A L Sbjct: 62 AISEAGCDWSAIDAIAVTCAPGLVGSLLIGVTAAKTLALVHQKPLLGIHHLEGHLYASYL 121 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + E PF+ LLVSGGHT LI V G G+Y+L G++ DDAAGEA+DK A+L+GL YPGGP Sbjct: 122 AEPTLEPPFLCLLVSGGHTSLIGVYGCGEYQLFGQTRDDAAGEAYDKVARLMGLGYPGGP 181 Query: 182 LLSKMAAQGTAGRFVFP-----RPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRAD 234 LL + A QG F P P D SFSGLKT A + + AD Sbjct: 182 LLDRWAQQGNPEAFDLPEGNIRLPDGKVHPYDASFSGLKTAVARLVAELRQTHPELPVAD 241 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 +A +F+ AV L + A GFK L + GGV+AN LR L + + + Sbjct: 242 LAASFQKAVAQALTKRAIAAAVDHGFKTLAIGGGVAANSGLRQHLTAAAEPLGLRLIFPP 301 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 CTDN AMI A F+ G + L ++ R R L E+ A Sbjct: 302 LRLCTDNAAMIGCAAADHFQRGDRSPLDLTARSRLSLLEISA 343 >UniRef50_D0RQS5 Putative glycoprotease GCP n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RQS5_9RICK Length = 358 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 138/344 (40%), Positives = 210/344 (61%), Gaps = 13/344 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKG----LLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 + LGIETSCDET A+ K +L+N + SQ +H +GGVVPELA+R H K Sbjct: 5 LIFLGIETSCDETAAALVKKSKNGKVKILSNVVSSQEIVHKKFGGVVPELAARAHSEKID 64 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 +I+ A+K+S ++ ID VA TAGPGL+ L+VG T G+++A A P +H+EGH Sbjct: 65 LIIKEAIKKSKVSIHQIDGVACTAGPGLLICLMVGMTAGKTIASALKKPFFGTNHLEGHA 124 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L L P +FP++ LL+SGGH+Q +SV G+G+Y+ LG +IDDA GEAFDKTAK+LG++ Sbjct: 125 LTMGLI-RPVKFPYLLLLISGGHSQFLSVEGVGKYKRLGTTIDDALGEAFDKTAKILGIE 183 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIA 236 +PGGP + A G F P+P+ + G + S++GLKT + N Q + D+A Sbjct: 184 FPGGPKIETFAKFGNENSFDLPKPILHKSGCNMSYAGLKTAVLHA-SKNIKSKQDKYDLA 242 Query: 237 RAFEDAVVDTLMIKCKRALDQT-------GFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 +F+ + + L +KC +A++ K V+AGGV++N+++R + ++ + Sbjct: 243 ASFQKTINEILKVKCAKAIEMFLEKHKKIKNKNFVVAGGVASNQSIRKTIKQVSSTLKFN 302 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 + CTDN AMIA+AG+ ++AG +L + +PRWPL + Sbjct: 303 THFPPLNLCTDNAAMIAWAGLQNYEAGKKPNLKIISQPRWPLDQ 346 >UniRef50_Q0SM86 Probable O-sialoglycoprotein endopeptidase n=18 Tax=Borrelia burgdorferi group RepID=GCP_BORAP Length = 346 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 126/332 (37%), Positives = 190/332 (57%), Gaps = 10/332 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIETSCD+ +A+ ++ +L+N SQ K H Y GVVPE+ASR H + + Sbjct: 1 MKVLGIETSCDDCCVAVVENGIHILSNIKLSQ-KEHEKYYGVVPEIASRLHTEAIMSVCI 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ +ID +A T+ PGL+G+L+VG + LA + P I + H+ GHL AP Sbjct: 60 KALKKANTKISEIDLIAVTSRPGLIGSLIVGLNFAKGLAISLKKPIICIDHILGHLYAP- 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L + E+PF++LL+SGGHT + E+LG ++DD+ GEAFDK AK + +PGG Sbjct: 119 LMHSKIEYPFISLLLSGGHTLIAKQKNFDDVEILGRTLDDSCGEAFDKVAKHYDIGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG--LDFSFSGLKTFAANTIR--DNGTDDQTRADIA 236 P + +++ G F FP + DFS+SGLKT + + N + T+ +IA Sbjct: 179 PNIEQISKNGDENTFKFPVTTFRKKENWYDFSYSGLKTACIHQLEKFKNKDNPTTKNNIA 238 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F+ A + L+ KRA+ T K+LV+AGGV++N LR K+ K + + +Y + Sbjct: 239 ASFQKAAFENLITPLKRAIKDTQIKKLVIAGGVASNLYLREKI----DKLKIQTYYPPLD 294 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 CTDNGAMIA G + + + + R Sbjct: 295 LCTDNGAMIAGLGFNMYLKYGESPIEIEANSR 326 >UniRef50_Q3YS67 Probable O-sialoglycoprotein endopeptidase n=24 Tax=Rickettsiales RepID=GCP_EHRCJ Length = 350 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 142/338 (42%), Positives = 205/338 (60%), Gaps = 8/338 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCDET +AI + K +L++++ SQ K HA+YGGVVPE+ASR H+ L + Sbjct: 8 VLGIETSCDETAVAIVNSNKEVLSHKILSQ-KEHAEYGGVVPEIASRAHINYLYDLTVSC 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ES L+ +IDAVA T+GPGL+G L+VG + + +A P I ++H+E H L + Sbjct: 67 IEESQLSLNNIDAVAVTSGPGLIGGLIVGVMIAKGIASVTGKPIIEINHLEAHALIVRMF 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 FPF+ L++SGGH Q + V +G Y LG S+DD+ GE FDK AK+L L YPGGP+ Sbjct: 127 -YEINFPFLLLIISGGHCQFLIVYNVGCYHKLGSSLDDSLGEVFDKVAKMLNLGYPGGPV 185 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG-TDDQTRADIARAFED 241 + K + G + FV PR +T R G DFSFSGLKT N I ++ D++ DI+ +F++ Sbjct: 186 IEKKSLSGDSKSFVLPRALTGRCGCDFSFSGLKTAVRNIIMNHEYIDNKLICDISASFQE 245 Query: 242 AVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 V D L+ + A+ +LV+ GGV+AN+ LR ++ E+FY + Sbjct: 246 CVGDILVNRINNAIAMSKAIDKRIDKLVVTGGVAANKLLRERMLRCASDNNFEIFYPPSK 305 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 CTDNG MI +AG+ ++L + + RWPL L Sbjct: 306 LCTDNGIMIGWAGIENLVKDYVSNLDFAPKARWPLESL 343 >UniRef50_A4EBV8 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A4EBV8_9ACTN Length = 794 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 147/341 (43%), Positives = 192/341 (56%), Gaps = 11/341 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VL IE+SCDET +AI D + +LANQ+ +Q+ HA +GGVVPE+ASR HV V ++ A Sbjct: 454 LVLAIESSCDETAVAIIDADGNMLANQVSTQIDFHARFGGVVPEIASRKHVEVIVSVVDA 513 Query: 62 ALKESG---------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHM 112 AL+++ + ++ AV T GPGLVGAL+VG + A+A P + V+H+ Sbjct: 514 ALEDAAASLGLTGGAIAPSELAAVGVTQGPGLVGALVVGVAFAKGFAYAAGKPLVCVNHL 573 Query: 113 EGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL 172 EGHL A +L + PF+ LVSGGHT L+ V G YE+LGE++DDA GEAFDK AK Sbjct: 574 EGHLFANLLAQPDLKPPFIFTLVSGGHTMLVHVKAWGDYEVLGETLDDAVGEAFDKVAKA 633 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL YPGGP++SK+A G FPR + R FS SGLKT I +T Sbjct: 634 LGLGYPGGPIISKLAETGNPKAIDFPRALNSRGDYRFSLSGLKTAVTLYIEQETKAGRTI 693 Query: 233 --ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 D+A +FE AV D K K AL TG K + GGVSAN LR + + + ++ V Sbjct: 694 HLPDLAASFEAAVFDVQYKKAKNALHATGCKEYCIGGGVSANPHLREMMIKKLGRQGIRV 753 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 CTDN AMIA +F G + V P L Sbjct: 754 TVPPLSACTDNAAMIAEVARRKFDRGEISPFDVDADPNMTL 794 Score = 89.6 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 48/148 (32%), Gaps = 16/148 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRK-TVPLIQ 60 V+ ++TS D +A+ + Q G R H V + Sbjct: 9 LVVALDTSTDMLAC---------VASWIDGQTGETKLVSGDH---MCRRHANVELVNTVD 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L ++GL D+ GPG + +G + + LA +VP + V ++ Sbjct: 57 GLLAQAGLDRSDVGCYVVGRGPGSFTGVRIGISTAKGLARGANVPLLGVSTLDACAWTAW 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGI 148 + + +L ++ + Sbjct: 117 KAGVRGK---LGILADAMRGEVYPALYM 141 >UniRef50_B9JCG8 Probable O-sialoglycoprotein endopeptidase n=86 Tax=Alphaproteobacteria RepID=GCP_AGRRK Length = 365 Score = 368 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 159/347 (45%), Positives = 201/347 (57%), Gaps = 15/347 (4%) Query: 1 MRVLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 +R+LGIETSCDET AI D + ++ + SQ+ H+ YGGVVPE+A+R HV Sbjct: 5 LRILGIETSCDETAAAIVERQDDGTAIVRSDVVLSQLDEHSAYGGVVPEIAARAHVEALD 64 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 LI ALK + ++ D+DA+A T+GPGL+G LLVG G++++ A P ++H+EGH Sbjct: 65 TLIDEALKRANVSLADVDAIAATSGPGLIGGLLVGLMTGKAISKATGKPLYAINHLEGHA 124 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L L D FP++ LLVSGGHTQLI V G+GQYE G +IDDA GEAFDKTAKLLGL Sbjct: 125 LTARLTD-GLAFPYLMLLVSGGHTQLILVRGVGQYERWGTTIDDALGEAFDKTAKLLGLP 183 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADI 235 YPGGP + A +G RF PRP+ LDFSFSGLKT +Q ADI Sbjct: 184 YPGGPAVEAAAKKGNPDRFDLPRPLVGETRLDFSFSGLKTAVRLAATTIAPVSEQDIADI 243 Query: 236 ARAFEDAVVDTLMIKCKRALDQTG--------FKRLVMAGGVSANRTLRAKLAEMMKKRR 287 +F+ AV TL + R L + LV+AGGV+AN LR L E+ Sbjct: 244 CASFQKAVSRTLKDRIGRGLQRFKSEFPKTAEKPALVVAGGVAANLELRRTLQELCDLNG 303 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAE 333 CTDN MIA+AG+ R GA D L V R RWPL + Sbjct: 304 FRFIAPPLSLCTDNAVMIAWAGLERMATGAAPDGLDVQPRSRWPLDQ 350 >UniRef50_B1GZV6 Probable O-sialoglycoprotein endopeptidase n=1 Tax=uncultured Termite group 1 bacterium phylotype Rs-D17 RepID=GCP_UNCTG Length = 342 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 144/339 (42%), Positives = 205/339 (60%), Gaps = 8/339 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M + IETSCDET ++ + + + +YSQ+K+HA + GVVPELASR H+ +I Sbjct: 1 MNIFAIETSCDETSASVVLNGLKVKSVVIYSQIKIHAGFFGVVPELASRSHIENINLVIW 60 Query: 61 AALKESGLTAKD----IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 AL ++G+ D IDA+A+T+GPGL GALLVGA +SLA + P IPV+H++GHL Sbjct: 61 RALSDAGINFTDFSQKIDALAFTSGPGLAGALLVGAIAAKSLACVYKKPLIPVNHLDGHL 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 + ++E+ + PF++L++SGGHT+L+ V G+Y++LG + DDAAGEAFDK AK+LGL Sbjct: 121 YSSLIENRSVKLPFLSLIISGGHTELVVVEDFGKYKVLGSTRDDAAGEAFDKAAKMLGLS 180 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG-TDDQTRADI 235 YPGGP++ K+A G F RP + DFSFSG+KT N ++ N +++ DI Sbjct: 181 YPGGPIIDKIAESGNPEAVRFTRPYL-KGSWDFSFSGIKTALLNYLKTNPVRNEKQLNDI 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F AV +TL K A + KR+V+ GGVSAN +R E +K +VF Sbjct: 240 CASFRQAVAETLCFKSFEAAKKFNLKRIVLGGGVSANSLIRKIFLETGQKNNTKVFIPSL 299 Query: 296 EFCTDNGAMIAYAGMVRFKA-GATAD-LGVSVRPRWPLA 332 + TDN AMI A + K G D + + P PL Sbjct: 300 IYSTDNAAMIGCAAYFKQKKCGLKYDNIQLKPNPSLPLE 338 >UniRef50_Q2GEG6 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Neorickettsia RepID=GCP_NEOSM Length = 329 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 138/329 (41%), Positives = 202/329 (61%), Gaps = 7/329 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LG+ETSCDET +AI +E + +++++Q H+ Y GV PE ASR+H++ +++ Sbjct: 5 LILGVETSCDETSVAIVSEEGEVCFHEIFTQD--HSKYNGVYPEFASREHLKILPQILRR 62 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A++ + + A+A T GPGLVG+L+VG + R LAF+ P V+H+EGHLLA L Sbjct: 63 AVQAH--DLEKLTAIACTVGPGLVGSLIVGVMMARGLAFSLKKPVFGVNHLEGHLLAVRL 120 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + FPFV L++SGGH+QLI GIG Y LLGE++DDA GEAFDK A +LG YPGG Sbjct: 121 VE-KINFPFVCLVISGGHSQLIDARGIGDYVLLGETLDDAFGEAFDKLATMLGFTYPGGK 179 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-DDQTRADIARAFE 240 + K+A +G + RF P M ++ G +FS SG+KT I ++ +ADI +F+ Sbjct: 180 TVEKLAIKGDSERFRLPAAMINQSGCNFSLSGIKTALKKIITSLPQITEKDKADICASFQ 239 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 V ++ K ++A+ G R+V+AGGV +NR +R L E K + + CTD Sbjct: 240 ACVARIMVNKLEQAVKICGHSRIVLAGGVGSNRYIRETLEEFAKNHNLSLHFPEGILCTD 299 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRW 329 N AMIA+A + R KAG T +L + +PR Sbjct: 300 NAAMIAWAAIERLKAGCT-ELSLEPQPRL 327 >UniRef50_B2GAG0 Probable O-sialoglycoprotein endopeptidase n=56 Tax=Lactobacillales RepID=GCP_LACF3 Length = 344 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 134/337 (39%), Positives = 213/337 (63%), Gaps = 6/337 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L E+SCDET +++ +D +L+N + +Q+ H +GGVVPE+ASR H+ + + Sbjct: 6 LILAFESSCDETSVSVIEDGHRVLSNIVATQIASHQRFGGVVPEVASRHHIEQITKCTKE 65 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G++ +D+ AVA T GPGLVG+LL+G T +++A+A +P +PV+HM GHL A Sbjct: 66 ALEQAGVSYQDLTAVAVTYGPGLVGSLLIGVTAAKTIAWAHQLPLVPVNHMAGHLYAARF 125 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + +P + LLVSGGHT+L+ + Y+++GE+ DDAAGEA+DK +++G++YP G Sbjct: 126 VSDFT-YPMLGLLVSGGHTELVYMKEEHDYQIIGETRDDAAGEAYDKVGRVMGINYPAGK 184 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADIARAF 239 + + AA+G F FPR M DFSFSGLK+ NT+ + + + D+A +F Sbjct: 185 TVDQWAAKG-HDTFHFPRAMEKEDNFDFSFSGLKSAFINTVHNADQRGEVLDKYDLAASF 243 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR--GEVFYARPEF 297 + +VVD L+ K RALD+ K+L++AGGV+AN+ LR +L+ ++ + ++ A ++ Sbjct: 244 QQSVVDVLVAKTIRALDEFPVKQLILAGGVAANQGLRKQLSAGLQAKHPEVQLLQAPLKY 303 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 C DN AMI AG V + G AD ++ P A L Sbjct: 304 CGDNAAMIGAAGYVNYLHGDRADGSLNAVPGLSFAHL 340 >UniRef50_B3DVR7 Metal-dependent protease with possible chaperone activity n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DVR7_METI4 Length = 353 Score = 365 bits (938), Expect = 1e-99, Method: Composition-based stats. Identities = 130/342 (38%), Positives = 195/342 (57%), Gaps = 8/342 (2%) Query: 1 MRVLGIETSCDETGIAIYD---DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 M LGIE+SCDET IA+ + ++A++ +Q LH +GG+VPE A R+H + Sbjct: 3 MLWLGIESSCDETAIALVKTIAGKNVVMADRCITQAPLHKPFGGIVPEYAVREHSKNLPL 62 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L+Q+ ++ L K++ A+A T GPGL+ +LLVG R LA +P V+H+EGHL Sbjct: 63 LLQSMIRSKSLNLKEVQAIAVTEGPGLMASLLVGNAFARGLALGLGIPVFGVNHLEGHLF 122 Query: 118 APMLE-DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 +P + + +FPF+ L+VSGGHT L V G QY ++G +IDDAAGEAFDK A+LLGL Sbjct: 123 SPFIGREEKLKFPFLGLVVSGGHTLLARVEGPRQYSMIGSTIDDAAGEAFDKVARLLGLS 182 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG----TDDQTR 232 YPGGP + K A +G FP + ++ +FSFSGLKT + N + + Sbjct: 183 YPGGPEIEKQAERGNPHSHNFPISLIEKNNYNFSFSGLKTAVKYFLEKNKESLSKNKEFL 242 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 AD+ +F+++V + K A + +GGV AN+ +R L + + EV + Sbjct: 243 ADVCASFQESVARVIQEKTIAAAKSFSLSLIAASGGVLANKRIRELLEKKALEEGIEVLF 302 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 A+ +FCTDN MIA+AG + + G + P + L++ Sbjct: 303 AKRQFCTDNAVMIAFAGALFYALGLPITKSFELNPNFSLSDF 344 >UniRef50_C7ND80 Metalloendopeptidase, glycoprotease family n=3 Tax=Leptotrichia RepID=C7ND80_LEPBD Length = 339 Score = 364 bits (936), Expect = 2e-99, Method: Composition-based stats. Identities = 129/341 (37%), Positives = 206/341 (60%), Gaps = 18/341 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L ETSCDET +A+ +D K +L+N + +Q+ +H ++GGVVPE+ASR H+ +P+ Sbjct: 1 MKILAFETSCDETSVAVVEDGKKILSNIISTQIDIHKEFGGVVPEIASRHHIENILPVFT 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ DID +A T PGL+G+LLVG +SL++A ++P +PV+H+ GH+ + Sbjct: 61 EALEKANCELSDIDYIAVTNTPGLIGSLLVGLMFAKSLSYANNIPLLPVNHINGHIFSSF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISV---TGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 +++ + P ++L+VSGGHT L + G +LLGE++DDA GE +DK A++LGLDY Sbjct: 121 IDN-DVKLPAISLVVSGGHTNLYYIYEENGKIITDLLGETLDDAVGETYDKIARILGLDY 179 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTRADI 235 PGGP + K++ G +P D G +FSFSG+KTF N + + + ++ DI Sbjct: 180 PGGPHIDKLSINGE-DILKIKKPKVD--GYNFSFSGIKTFITNYVNNQKMKGNAISKEDI 236 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMM-----KKRRGEV 290 A++ ++ +V+ L K A+ + K +++AGGVSAN+ LR K +E K + V Sbjct: 237 AKSLQEIIVNVLYDKILMAVKEKDVKTILVAGGVSANKRLREKFSEFTNIKTDKNEQIAV 296 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADL----GVSVRP 327 + + E+CTDN AMI A K + +L V Sbjct: 297 HFPKMEYCTDNAAMIGVAAYYDLKNNSQVELGKQYDVDAIS 337 >UniRef50_C1SJZ8 Metalloendopeptidase, putative, glycoprotease family n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJZ8_9BACT Length = 327 Score = 364 bits (935), Expect = 3e-99, Method: Composition-based stats. Identities = 145/330 (43%), Positives = 202/330 (61%), Gaps = 9/330 (2%) Query: 1 MRVLGIETSCDETGIAIYDD-EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M +LGIE+SCDET +A+YD + + A SQ +LH+ +GGVVPE+ASR+H+ K L Sbjct: 1 MIILGIESSCDETSLAVYDSVNRSVKATFTSSQAELHSKFGGVVPEVASRNHILKIESLF 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + + E+G+T +DIDA+ T PGL+GAL VG + ++L +A +P IPV+H+ H+LA Sbjct: 61 EQCMTEAGITPQDIDAIGVTNAPGLIGALFVGVSFAKALGYALKIPVIPVNHLSAHILAS 120 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L + + P++AL++SGGHT + V +ELL +IDDAAGE+FDK AK+LGL YPG Sbjct: 121 ELTNQELKAPYLALIISGGHTHIYDVDEAYNFELLARTIDDAAGESFDKVAKMLGLGYPG 180 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 GP + K+A G + P + +P DFSFSGLKT N I D D ADIA +F Sbjct: 181 GPAIEKLAESGDENKVTLPIAIKKKP--DFSFSGLKTAVLNKINDKSESD---ADIAASF 235 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + V +TL +K R + G ++V+AGGV+ N +R M+K+ EVF+ P CT Sbjct: 236 QKTVAETLTLKTLRMAESLGRNKIVVAGGVACNGYIRRAF---MEKQGYEVFFPSPRLCT 292 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 DNG MIAYA F A L + R Sbjct: 293 DNGDMIAYAASKFFGQRKFASLDETAHDRM 322 >UniRef50_B1V8Z6 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Candidatus Phytoplasma RepID=GCP_PHYAS Length = 328 Score = 364 bits (935), Expect = 3e-99, Method: Composition-based stats. Identities = 129/332 (38%), Positives = 190/332 (57%), Gaps = 9/332 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCDET +AI D K +L+N ++SQ+K H +GGVVPE+ASR HV +++ Sbjct: 1 MNILSIETSCDETSVAITQDGKKVLSNIVFSQIKDHQMFGGVVPEIASRKHVELITLILE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A +++ LT ++ID VA T GPGLVG+LLVG A+ + P + ++H+ GHL A Sbjct: 61 KAFQKACLTPQEIDLVAVTQGPGLVGSLLVGINAANVFAYTYQKPLLGINHLLGHLYAAQ 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E + + LLVSGGHT+L+ Q E+LG ++DDA GE +DK AK L L YPGG Sbjct: 121 IEHQ-IKPNALILLVSGGHTELLHFKNHDQIEVLGTTLDDALGEVYDKIAKALHLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 PL+ ++A G + RP +FSFSGLK+ N + D +I +F+ Sbjct: 180 PLIDQLAQTGKDT-YHLVRPYLKNNNFNFSFSGLKSHLVNLLLKQNIQDLNIPNICASFQ 238 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+D L+ K KR L + ++L++ GGV++N LR K+ E EV + ++CTD Sbjct: 239 ASVIDVLLTKTKRVLKKLPIQQLIVTGGVASNSALRKKMKETF--LDLEVIFPSVQYCTD 296 Query: 301 NGAMIAYAGMVRFKAGATAD---LGVSVRPRW 329 AMI A ++ T ++ P Sbjct: 297 QAAMIGIAAF--YQKNITPPSYKYDLTALPNL 326 >UniRef50_D0N6Q4 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0N6Q4_PHYIN Length = 374 Score = 363 bits (931), Expect = 7e-99, Method: Composition-based stats. Identities = 135/351 (38%), Positives = 201/351 (57%), Gaps = 19/351 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 LGIETSCD+T A+ D + +L+N + SQ +L+A + G+VP LA+R H +I AA Sbjct: 21 TLGIETSCDDTAAAVLDQDGRVLSNVISSQWELNAKWRGIVPALAARAHENNLPHVINAA 80 Query: 63 LKESGL-TAKDIDAVAYTAGPGLVGALLVGATVGRSLAF-AWDVPAIPVHHMEGHLLA-- 118 L++SGL + + + AVA T+GPGL L VG R + D+ + ++H+E H+L Sbjct: 81 LEQSGLESLQQLSAVAVTSGPGLAPCLDVGLRTARQICLDNPDIAFLQINHLEAHVLVSR 140 Query: 119 -PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL---- 173 P LE PEFPFV LLVSGGH L+ G+G YELLG ++DD+ GEA+DK A++L Sbjct: 141 LPQLETPRPEFPFVVLLVSGGHCCLVLAKGLGDYELLGNTLDDSIGEAYDKVARMLDITA 200 Query: 174 --GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQ 230 G GG L+ MAA+G F F PM R DFS+SG+KT ++ D++ Sbjct: 201 SSGKGVHGGKLIEDMAARGNDRAFPFTEPMKHRKDCDFSYSGIKTAMLREVKKLGELDEK 260 Query: 231 TRADIARAFEDAVVDTLMIKCKRALDQTGFK------RLVMAGGVSANRTLRAKLAEMMK 284 + D+ +F+ VD L+ + +RA + + LV+ GGV++N+ LR ++ Sbjct: 261 MKEDLCASFQRKAVDQLITRTRRACQWSKDRLGDNITSLVVCGGVASNQYLRDRMQAAAA 320 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLG-VSVRPRWPLAEL 334 + + ++CTDNG M+A+AG+ R+ G +D +PRWPL L Sbjct: 321 EEEVAAVFPPAKYCTDNGVMVAWAGLERYAKGMRSDPEPARYQPRWPLETL 371 >UniRef50_B9XP92 Metalloendopeptidase, glycoprotease family n=1 Tax=bacterium Ellin514 RepID=B9XP92_9BACT Length = 341 Score = 362 bits (929), Expect = 1e-98, Method: Composition-based stats. Identities = 141/341 (41%), Positives = 202/341 (59%), Gaps = 11/341 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L +ETSCDET +AI + K +L+ + SQ+KLHA+YGGVVPELA+R+H+ +P+ Sbjct: 1 MILLAVETSCDETSVAIIRNGK-VLSTIVSSQIKLHAEYGGVVPELAAREHLANLIPVAN 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AA+ + + + +DA+A T GPGL GAL+VG + +AFA + P ++H E HL +P Sbjct: 60 AAMTAAEVQSDQVDAIAATQGPGLPGALVVGLKAAQGMAFALNKPFFGINHHEAHLYSPW 119 Query: 121 LEDNPPEFPF------VALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 + +PP F ++L+VSGGHT LI V ++ +LG +IDDAAGE FDK AKL+G Sbjct: 120 ITGSPPVADFDSFQPNISLIVSGGHTMLIHVESELKHHVLGSTIDDAAGECFDKVAKLIG 179 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG---TDDQT 231 L YPGGP + ++A+ G + FPRPM DFSFSGLKT IRDN Q Sbjct: 180 LPYPGGPEIDRLASAGNPKAYDFPRPMLRDASDDFSFSGLKTSVRYFIRDNPAVLDSLQK 239 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 D+ + ++A+V+ L+ K RA ++ K + +GGV+ NR LR+ L K++ + Sbjct: 240 LQDLCASVQEAIVEVLVTKTVRAANRLQVKCVTASGGVTCNRALRSALETACKRKHLTLR 299 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGAT-ADLGVSVRPRWPL 331 A CTDN AMI + +T L + P W L Sbjct: 300 LAEKSLCTDNAAMIGVLAERKLLHSSTPTSLDSEIMPGWAL 340 >UniRef50_A6DFV1 Metalloendopeptidase, putative, glycoprotease family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFV1_9BACT Length = 355 Score = 361 bits (928), Expect = 1e-98, Method: Composition-based stats. Identities = 140/353 (39%), Positives = 206/353 (58%), Gaps = 17/353 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LG+E+SCDET +++ + +LAN + SQ+K HA+YGGV+PELA+R+H+ P + Sbjct: 1 MIILGVESSCDETAVSLVRNGHEVLANAISSQIKDHANYGGVIPELAAREHLNNVRPTLN 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ L DID +A TA PGL+ ALLVGA LA + ++H+ H+ + Sbjct: 61 EALEKAALKLDDIDGIAVTAQPGLLPALLVGAGFANGLALSLGKKVCGINHLAAHIYGGL 120 Query: 121 LE-----DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 +E NP FP ALL+SGG+TQL + G EL+G +IDDAAGEAFDK AK+LGL Sbjct: 121 IERQDILSNPNAFPLCALLISGGNTQLFIIKKTGDCELVGSTIDDAAGEAFDKAAKILGL 180 Query: 176 DYPGGPLLSKMAAQGTAGRFVFP-------RPMTDRPGLDFSFSGLKTFAANTIRDNGTD 228 YPGGP++ ++A G ++ FP R ++ L+FSFSG+KT N ++ N D Sbjct: 181 PYPGGPIIDRLAKSGDKNKYKFPRSFLPKTRSYSEEHKLNFSFSGVKTSLLNLVKKNWKD 240 Query: 229 ----DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMK 284 D D+ +++DA+VD L K K A + G + L++ GGV+ N +R ++ +M Sbjct: 241 GMVPDGDLPDLLASYQDAIVDVLSTKLKMAAESYGARTLLLCGGVACNSAIRERVQKMAI 300 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWP-LAELPA 336 + E+ P++CTDN AMIA G K V R P + +LP Sbjct: 301 QTAKELVLTPPKYCTDNAAMIAGLGYHYLKDPNFTGDFVEASGRAPIIEKLPV 353 >UniRef50_C7H0S4 Putative glycoprotease GCP n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7H0S4_9FIRM Length = 371 Score = 361 bits (928), Expect = 2e-98, Method: Composition-based stats. Identities = 131/330 (39%), Positives = 199/330 (60%), Gaps = 3/330 (0%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VL IETSCDET +I + + +L+N +++Q+ +H +YGGVVPE+ASR+H+ K ++ Sbjct: 22 NVLAIETSCDETACSIVRNGREVLSNAIFTQMHIHREYGGVVPEIASRNHLEKINDVVDK 81 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+ ++GL +DID +A T+ PGL+GAL+VG +++A+A P + VHH+ GH+ A L Sbjct: 82 AILDAGLHKEDIDVIAVTSTPGLIGALVVGVATAKTMAYALSKPLVGVHHIAGHIAANYL 141 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + E PF++L++SGGHT +I V ++E++G+++DDAAGEAFDK LLGL YP G Sbjct: 142 DHGELEPPFISLVISGGHTSVIDVKDYNEHEVIGQTLDDAAGEAFDKVGILLGLTYPAGK 201 Query: 182 LLSKMAA---QGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 + ++A + F R ++ FSFSG+KT N IR N D + IA Sbjct: 202 DMDELARSAIKNNVSPVYFKRTYLEKGSPHFSFSGIKTRVMNYIRANKDDPIDKEAIALG 261 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F +AV D L+ K + K++V+AGGV+AN +R K E + + EV+ C Sbjct: 262 FHEAVTDVLVKKTMDMAKRRNRKKIVLAGGVAANSLIRNKFKEEGEAQGFEVYLPGLGMC 321 Query: 299 TDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 TDN AMIA AG ++ +G +D + Sbjct: 322 TDNAAMIASAGYYKYISGGISDYYLDAVSN 351 >UniRef50_B2UQZ0 Metalloendopeptidase, glycoprotease family n=3 Tax=Verrucomicrobia RepID=B2UQZ0_AKKM8 Length = 360 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 144/343 (41%), Positives = 200/343 (58%), Gaps = 12/343 (3%) Query: 3 VLGIETSCDETGIAIYDDEKG-----LLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 VLGIE+SCDET +AI +L++ + SQ+ +H +GGVVPELASR+H Sbjct: 7 VLGIESSCDETAVAILRSAGEEKAPEILSSVISSQIAIHRQHGGVVPELASRNHSADLPG 66 Query: 58 LIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +I+ A +E+G DID T GPGLV ALLVG + ++LA A P + V+H+EGHLL Sbjct: 67 IIRTACREAGTAPADIDVFGATGGPGLVAALLVGNSTAKALALAAGRPFVSVNHLEGHLL 126 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 +P L+ P + ++VSGGHT + V G+G Y LLG S+DDAAGEAFDK K+LGL Y Sbjct: 127 SPFLKRPGGPVPHLGMVVSGGHTLFVDVRGVGNYRLLGRSLDDAAGEAFDKVGKMLGLPY 186 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-------NGTDDQ 230 PGGP + ++AA+G F FPR + + SFSGLKT T+ +G Q Sbjct: 187 PGGPEIDRLAAEGDPEAFSFPRALMKEHTANVSFSGLKTAVLYTLPKITKNGDPHGLPRQ 246 Query: 231 TRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 T D+ +F+ AV D L+ K +AL +G + L ++GGVS NR LR++L + + ++ Sbjct: 247 TLRDLCASFQRAVTDVLIHKALKALRASGHRTLSISGGVSCNRELRSRLKTACDREKVKL 306 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 + TDN AMIAY ++ + G L V P L E Sbjct: 307 VLPDFDLTTDNAAMIAYVTCLKARRGLFHSLDEDVDPNLKLTE 349 >UniRef50_Q6MD07 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Parachlamydiaceae RepID=GCP_PARUW Length = 343 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 136/343 (39%), Positives = 193/343 (56%), Gaps = 11/343 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIE++CDET AI D K +L+N + SQ+ LH +YGGVVPELA R H+ +P+I Sbjct: 1 MLVLGIESTCDETACAIVRDGKDILSNIVASQIDLHKEYGGVVPELACRRHIDLIIPVID 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++ LT + ID +A GPGL+GALL+G ++LA A P I ++H+E HL A + Sbjct: 61 QALNQAKLTLEQIDLIAVANGPGLIGALLIGLNTAKALALALRKPFIGINHVEAHLYAAI 120 Query: 121 LEDNPP-EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + +FP + +++SGGHT L+ + IGQYEL+G+++DDA GEAFDK AK+L L YPG Sbjct: 121 MSHPQDFQFPCLGVVLSGGHTALVLIKQIGQYELIGQTVDDAVGEAFDKVAKMLNLPYPG 180 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR------- 232 GP + +A G + +F F LDFSFSGLKT I+D + Sbjct: 181 GPEIENLARHGRSVKFNFKAGQVKGRPLDFSFSGLKTAVLYAIKDPKALKEMVLLSSEMT 240 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +F++A ++ K A Q G L+ GGV+ N LR + + + Sbjct: 241 QDIAASFQEAACSDIVKKSLLAAKQYGVNTLLFGGGVTNNCYLRKLFS--VANSNLNYIW 298 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAEL 334 DN AMIA G R++ +D + + R PL + Sbjct: 299 PSAGLSLDNAAMIAGLGYYRYQLQNKSDSMDLEPLTRTPLQSV 341 >UniRef50_Q5FLZ3 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Lactobacillus RepID=GCP_LACAC Length = 349 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 135/342 (39%), Positives = 204/342 (59%), Gaps = 11/342 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+L E+SCDET A+ + + + + + +Q+K H +GGVVPE+ASR H+ + + Sbjct: 8 RILAYESSCDETSTAVIKNGREIESLIVATQIKSHQRFGGVVPEVASRHHIEVVSQITKE 67 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL E+ + KDIDA+A T GPGLVGALL+G + ++++ A +P I V H+ GH++A L Sbjct: 68 ALNEANCSWKDIDAIAVTYGPGLVGALLIGVSAAKAVSMATGIPLIGVDHIMGHIMAAQL 127 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 +D E+P +AL VSGGHT+++ + +E++G++ DDAAGEA+DK ++LG++YP G Sbjct: 128 KDE-IEYPAIALQVSGGHTEIVLLKDPTHFEIIGDTRDDAAGEAYDKIGRVLGVNYPAGK 186 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARAF 239 + A QG F FPR M + DFSFSGLK+ NT D + + D+A +F Sbjct: 187 TIDAWAHQGKDT-FNFPRAMLEDDDYDFSFSGLKSAFINTCHHADQIHEKLNKYDLAASF 245 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR----RGEVFYARP 295 + AV+D L K RA+ + K +M GGV+AN+ LR +++E + K + +V Sbjct: 246 QAAVIDVLAHKTIRAIKEYKPKTFIMGGGVAANQGLRDRMSEEIAKLPKADQPKVILPDL 305 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 + C DN AMI A + G ADL ++ P ELP A Sbjct: 306 KLCGDNAAMIGAAAYNLYNGGQFADLTLNADPSL---ELPYA 344 >UniRef50_Q045T6 Probable O-sialoglycoprotein endopeptidase n=433 Tax=cellular organisms RepID=GCP_LACGA Length = 348 Score = 360 bits (925), Expect = 4e-98, Method: Composition-based stats. Identities = 134/342 (39%), Positives = 203/342 (59%), Gaps = 11/342 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+L E+SCDET A+ + + + + + +Q+K H +GGVVPE+ASR H+ + + Sbjct: 7 RILAFESSCDETSTAVIKNGREIESLIVATQIKSHQRFGGVVPEVASRHHIEVITQITKE 66 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL E+ T DIDA+A T GPGLVGALL+G + ++ + A +P I V H+ GH++A L Sbjct: 67 ALAEANATWDDIDAIAVTYGPGLVGALLIGVSAAKAASMATGIPLIGVDHIMGHIMAAQL 126 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 +D E+P +AL VSGGHT+++ + +E++G++ DDAAGEA+DK ++LG++YP G Sbjct: 127 KDE-IEYPALALQVSGGHTEIVLMKDPIHFEIVGDTRDDAAGEAYDKIGRVLGVNYPAGK 185 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARAF 239 + + A +G F FPR M + DFS SGLK+ NT D + + D+A +F Sbjct: 186 TIDEWAHKGKDT-FHFPRAMMEDDDYDFSLSGLKSAFINTCHHADQIHEKLDKYDLAASF 244 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR----RGEVFYARP 295 + +VVD L K RA+ + K ++ GGV+AN LR +LAE ++K + +V Sbjct: 245 QASVVDVLSHKTIRAIKEYKPKTFILGGGVAANHGLRDRLAEEIEKLPADIKPKVILPDL 304 Query: 296 EFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 + C DN AMI A +KAG +D ++ P ELP A Sbjct: 305 KLCGDNAAMIGAAAYNLYKAGKFSDENLNADPSL---ELPYA 343 >UniRef50_C7MKR9 Ribosomal-protein-alanine acetyltransferase n=10 Tax=Bacteria RepID=C7MKR9_CRYCD Length = 860 Score = 360 bits (924), Expect = 4e-98, Method: Composition-based stats. Identities = 147/341 (43%), Positives = 192/341 (56%), Gaps = 9/341 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET AI D ++A+ + SQ+ HA +GGVVPE+ASR HV ++QA Sbjct: 518 LILAIESSCDETAAAIVDGHGRIIADVVASQIDFHARFGGVVPEIASRKHVEAICGVVQA 577 Query: 62 ALKES-------GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L E+ L+ +DAVA T PGLVGAL+VG ++ A+A +P I V+H+EG Sbjct: 578 CLDEAAEHLGTANLSWNSLDAVAVTYAPGLVGALVVGVAYAKAAAWAAGIPFIKVNHLEG 637 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HL A L + + P V LVSGGHT L+ V G Y ++G +IDDA GEAFDK AK LG Sbjct: 638 HLYANKLARSDIKPPLVVSLVSGGHTMLVHVRDWGDYCVMGSTIDDAVGEAFDKVAKALG 697 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR-- 232 L YPGGP++S++A QG FPR M L FS SGLKT I + Sbjct: 698 LGYPGGPVISRLAQQGNPAAIHFPRAMMHSGDLRFSLSGLKTAVVTYIHNQQQQKADLNV 757 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +F+ AV+D + K ALD+TG K + GGV+AN LRA +R + Sbjct: 758 PDIAASFQAAVIDVQVAKATAALDETGAKEFCLGGGVAANPALRAAYESACAQRGVRLTM 817 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 CTDN AMIA + R++AG T+ L L E Sbjct: 818 PPARACTDNAAMIALVALDRYQAGKTSGLDTDAAAHSNLEE 858 Score = 82.6 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 21/169 (12%) Query: 3 VLGIETSCD--ETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 VL +T+ + G+ + DD + ++ H R + +P I Sbjct: 11 VLAFDTANEVIALGLGVLDDTTQTVRCVASKRIPAH------------RSSNTRLLPEID 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A L + DI V GPG + + + +A A VP I V ++ Sbjct: 59 ALLTAEKMERADIATVCCGRGPGSFTGVRICIATAKGIAQALGVPLIGVSTLDAIAWQ-- 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEA 165 +L ++ V G + L +++ A EA Sbjct: 117 -MHEAGIRGQALVLADAMRKEVYPVRFTLDDAGVHRLELDTVVKAQEEA 164 >UniRef50_D1AVQ5 Metalloendopeptidase, glycoprotease family n=1 Tax=Streptobacillus moniliformis DSM 12112 RepID=D1AVQ5_STRM9 Length = 332 Score = 360 bits (924), Expect = 5e-98, Method: Composition-based stats. Identities = 130/314 (41%), Positives = 193/314 (61%), Gaps = 6/314 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IE+SCDET +AI D K +L+N + +Q+ +H +YGGVVPE+ASR H+ + + Sbjct: 1 MLILAIESSCDETSVAILKDGKNVLSNVIATQIDIHKEYGGVVPEIASRHHIENILTVYD 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ DI +A T PGL+G+LLVG + L+ + ++P IPV+H+EGH+ + Sbjct: 61 KALKEANCKISDISYIAVTNTPGLIGSLLVGLMFAKGLSLSNNIPLIPVNHIEGHIFSTF 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + D P+ P + L+ SGGHT L + LLGE++DDA GEA+DK A++LGL+YPGG Sbjct: 121 I-DYEPKLPMLTLVASGGHTSLYLIDENKDLTLLGETLDDAIGEAYDKVARILGLEYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTRADIARA 238 PLL KMA G F P P G DFSFSG+KTF N + + +D + D+A+ Sbjct: 180 PLLEKMAIMG-HNSFDIPTPKVS--GYDFSFSGIKTFITNYVNRKKMKGEDFNKEDLAKT 236 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+D +++ L+ K +A + K + + GGVSAN+ +R + ++ + + E+C Sbjct: 237 FQDKIIEVLIDKLSKASRKNNIKTISVVGGVSANKAIREAIINSEYFENVDILFPKFEYC 296 Query: 299 TDNGAMIAYAGMVR 312 TDN AMIA A + Sbjct: 297 TDNAAMIASACYHK 310 >UniRef50_B3R0M3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Candidatus Phytoplasma mali RepID=GCP_PHYMT Length = 329 Score = 359 bits (923), Expect = 5e-98, Method: Composition-based stats. Identities = 125/329 (37%), Positives = 202/329 (61%), Gaps = 8/329 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCDET ++ D K +++N ++SQ+K H+ GGV+PELASR+H++ +++ Sbjct: 1 MIILSIETSCDETSASVTQDGKKVISNIVFSQIKEHSLNGGVIPELASREHLKNITLVLE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +LKE+ + ++ID VA+T GPGL+G+LLVG + + P + V+H+ GH+ A Sbjct: 61 KSLKEANIQPQEIDLVAFTQGPGLIGSLLVGINCALVFGYIYKKPVLGVNHLLGHIYAAQ 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E+ EFP + L++SGGHT+L+++ Q + LG + DDA GEA+DK +++LG YPGG Sbjct: 121 IENE-IEFPSLVLIISGGHTELLALENYLQIKKLGFTCDDAVGEAYDKVSRILGFGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ ++A +G F F RP +FSFSGLK+ N + N + Q + +I +F+ Sbjct: 180 PIIDELAQKGK-DIFNFVRPYLKNDNFNFSFSGLKSSIFNLVSKNNFNLQEKINICSSFQ 238 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +V+D L+ K KR L + FK+L++ GGV+AN +LR + + + + +V ++C D Sbjct: 239 SSVIDVLVEKTKRVLKKYSFKQLIITGGVAANYSLRKRF--LSEFSQLKVIIPSLKYCGD 296 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRW 329 AMI A +FK L + W Sbjct: 297 QAAMIGIAAYYQFKYQ----LKFNQNYHW 321 >UniRef50_B0VHD4 Putative metalloendopeptidase, , glycoprotease family n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VHD4_9BACT Length = 338 Score = 359 bits (921), Expect = 1e-97, Method: Composition-based stats. Identities = 124/327 (37%), Positives = 193/327 (59%), Gaps = 3/327 (0%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L E+SCD+T +AI D + ++ N + SQ H ++GG++PELASR H++ V L +AA Sbjct: 6 ILAFESSCDDTSVAIVDTDYNVIVNLISSQ-PEHLEFGGILPELASRLHLKNIVTLTKAA 64 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L S L +DI A+A + PGL+G+L+VG + LA++ +P I V+H+ H+ A +E Sbjct: 65 LNASKLNLQDISAIAVSINPGLIGSLIVGLAFAKGLAWSLSLPLITVNHILSHIFANFIE 124 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 E PF+AL+VSGGHT+L+ + + ++G+++DDAAGE+FDK AKLLGL +PGGP Sbjct: 125 HKAVEPPFLALVVSGGHTELVHFDTLTTFTVVGKTLDDAAGESFDKAAKLLGLGFPGGPA 184 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTRADIARAFE 240 + ++A +G FPR + + +FS+SGLKT + + + DIA + + Sbjct: 185 IDELAQKGNPNFIKFPRALPQKNNFNFSYSGLKTAIRTWLVNQNPETLQAELPDIAASVQ 244 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 A++D L+ K Q +++AGGV+AN LR +L K +VFY C D Sbjct: 245 QAIIDPLVHKTVLWARQHKIPYILLAGGVAANSALRQQLTTTSAKYGIKVFYPSNALCMD 304 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRP 327 N AM+ A + +F A L ++V Sbjct: 305 NAAMVGAAAIPKFLTKNYAPLSINVSS 331 >UniRef50_B7CBT6 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CBT6_9FIRM Length = 333 Score = 358 bits (919), Expect = 2e-97, Method: Composition-based stats. Identities = 131/335 (39%), Positives = 201/335 (60%), Gaps = 7/335 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M ++GIE+SCDET +A+ D+K +L++ + SQ+ +H ++GGVVPE+ASR HV I+ Sbjct: 1 MIIIGIESSCDETAVAVVKDKKEVLSSVVASQIDVHTEFGGVVPEVASRIHVENISYCIE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ +T +D+DAVA T GPGL+G L VG ++LAFA+ P +PVHH+ GH+ A Sbjct: 61 KALKDANITMEDVDAVAVTQGPGLIGCLHVGVQAAKTLAFAYHKPLVPVHHLAGHIYANE 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L ++P +AL+VSGG+T+L+ + +E+LGE+ DDA GEAFDK A++LGL YPGG Sbjct: 121 LV-VDMKYPVLALVVSGGNTELVYMKDETSFEILGETQDDAIGEAFDKVARVLGLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADIARA 238 P + K+A +G + +P T DFSFSGLK+ + + AD+A + Sbjct: 180 PKIDKLAKEGKP-VYELAKPKTQGR-YDFSFSGLKSSVLQFTKRMERQGKTFDMADLACS 237 Query: 239 FEDAVVDTLMIKCKRALDQTG-FKRLVMAGGVSANRTLRAKLAEMMKKR-RGEVFYARPE 296 F++ +D + + + LD + V+ GGVSAN LR K+ E+ + E Sbjct: 238 FQECALDEIFSRVRAVLDDHKDIRHFVVGGGVSANSRLREKVEELRNEYPEVEFTVPPMY 297 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 CTDN +MI AG + + +G + ++ + Sbjct: 298 CCTDNASMIGVAGTIAYLSGRRGNASLTADSSLEI 332 >UniRef50_B5RQA5 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Borrelia RepID=GCP_BORRA Length = 338 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 125/332 (37%), Positives = 190/332 (57%), Gaps = 10/332 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIE+SCD+ AI ++ +L+N SQ K H Y G+VPE+ASR H + + Q Sbjct: 1 MKVLGIESSCDDCCAAIVENGNTILSNIKLSQ-KEHKKYYGIVPEIASRLHTEFIMYVCQ 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ + + +ID +A T+ PGL+G+L+VG + L+ A P I + H+ GHL AP+ Sbjct: 60 QAIISAKINISEIDLIAVTSQPGLIGSLIVGVNFAKGLSIALKKPLICIDHILGHLYAPL 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L ++ E+PF++L++SGGHT L E+LG ++DDA GEAFDK AK + +PGG Sbjct: 120 L-NHTIEYPFLSLVLSGGHTILAKQNNFDDIEILGRTLDDACGEAFDKIAKHYKMGFPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRP--GLDFSFSGLKTFAANTIRDNG--TDDQTRADIA 236 P + K+A G F FP + D+ DFS+SGLKT + + T +IA Sbjct: 179 PNIEKLAIDGNQYAFNFPITIFDKKENRYDFSYSGLKTACIHQLEKFKNNNAQITNNNIA 238 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F+ A + L+I KRA+ T K+L+++GGV++N LR K+ + E +Y + Sbjct: 239 ASFQRAAFENLIIPIKRAIKDTNIKKLIISGGVASNLYLREKIKNL----EIETYYPPID 294 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 CTDN AMIA G + + + + + R Sbjct: 295 LCTDNAAMIAGIGYLMYLKYGASSIETNANSR 326 >UniRef50_D0WGH2 O-sialoglycoprotein endopeptidase n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WGH2_9ACTN Length = 807 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 141/341 (41%), Positives = 188/341 (55%), Gaps = 9/341 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 + E+SCDET +I + +L++ + SQV HA +GGVVPE+ASR H+ + Sbjct: 465 LICAFESSCDETASSIIAGDGTILSDVVASQVDFHARFGGVVPEIASRKHIEAICGVADE 524 Query: 62 ALKESG-------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L+ + L +D+DA+A T PGLVGAL+VG + + LA+ +VP + V+H+EG Sbjct: 525 CLERAAVALGRPSLRWRDLDAIAVTYAPGLVGALVVGVSFAKGLAWGSEVPLVAVNHLEG 584 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HL A + D P V LVSGGHT L+ V YE LG +IDDAAGEAFDK +K LG Sbjct: 585 HLYANKIADPAIAPPMVVSLVSGGHTMLVHVKDWANYETLGSTIDDAAGEAFDKVSKALG 644 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTR 232 L YPGGP++S+ AA+G FPR + L FS SGLKT I Sbjct: 645 LGYPGGPIISRYAAKGNPRAIDFPRALMHSGDLRFSLSGLKTAVITYIHKQQEAGMPLNI 704 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA +F+ AVVD + K + AL +TG + + GGV+AN LRA +M K + Sbjct: 705 PDIAASFQQAVVDVQVAKARTALIETGSRTFCLGGGVAANPALRAAYEKMCAKNGFRLVM 764 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 C DN AMIA + RF G AD + V+ PL E Sbjct: 765 PPLSACGDNAAMIAEVALDRFAQGKLADFTLDVKAHAPLDE 805 Score = 82.2 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 43/145 (29%), Gaps = 7/145 (4%) Query: 3 VLGIETSCDETGIAI--YDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 VL +T+ + + I D + + + G A R +P+I Sbjct: 4 VLAFDTANEAVVVGIGSVDADGAEAGRIVLKEAPARLVAGEAR--AAHRASNTVLIPMID 61 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + + DI AV GPG + + + +A +VP V ++ Sbjct: 62 ELMAGENIEKDDIAAVVCGRGPGSFTGVRICMAAAKGIASGLEVPLFGVSTLDAVAWGVW 121 Query: 121 LEDNPPEFPFVALLVSGGHTQLISV 145 + + ++ Sbjct: 122 ESGYR---GAMIVAADAMRKEVYPA 143 >UniRef50_C8W929 Metalloendopeptidase, glycoprotease family n=2 Tax=Atopobium RepID=C8W929_ATOPD Length = 832 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 146/340 (42%), Positives = 187/340 (55%), Gaps = 9/340 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L +E+SCDET + I D + AN + +Q+ HA +GGVVPE+ASR H V L + Sbjct: 479 LILSLESSCDETAMCIMDSHGVVCANVVATQIDFHARFGGVVPEIASRKHTEAIVGLFEE 538 Query: 62 ALKESG-------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 + +G L D+ AV TAGPGLVGAL+VG + A D+P IPVHH+EG Sbjct: 539 TMARAGAHFGCDTLVPSDLAAVGVTAGPGLVGALVVGVAFAKGFCVATDLPLIPVHHLEG 598 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HLLA + E E PFVA LVSGG+T L+ V G Y +LG +IDDA GEAFDK AK LG Sbjct: 599 HLLANLFETPDLEPPFVASLVSGGNTMLVHVRAWGDYVVLGSTIDDAVGEAFDKVAKALG 658 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQTR 232 L YPGGP++SK+AAQG FPR M FS SGLKT I + Sbjct: 659 LGYPGGPVISKLAAQGNPKAIHFPRAMMHSGDYSFSLSGLKTAVITYIEGENRAGRAINL 718 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 D+A +FE AV+D + K K A+++TG + GGV+AN LRA E K V Sbjct: 719 PDLAASFEQAVIDVQVAKAKTAVEETGVSDFCVGGGVAANPALRAAYKETFGKMGVRVTV 778 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 C DN AMIA + ++ + L + P L Sbjct: 779 PPMSVCGDNAAMIAVGALRSYRTQGFSPLTLDANPNAQLG 818 Score = 79.9 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 33/149 (22%), Positives = 53/149 (35%), Gaps = 11/149 (7%) Query: 3 VLGIETSCD--ETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHV------RK 54 VL ++TS D +A A + AD G V LAS DH+ + Sbjct: 11 VLAVDTSTDMLACTVARLTKRNADAAVAAAADGASAADGGFNVEVLASTDHLCRRQANVE 70 Query: 55 TVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 V +Q AL + LT D+DAV GPG + +G + ++ +P ++ Sbjct: 71 LVSSVQEALVAADLTMADVDAVIAGRGPGSFTGVRIGVATAKGISCGSGLPLYGASALDA 130 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLI 143 + VA ++ Sbjct: 131 MAFSAWKVGVRGTVGVVA---DAMRGEVY 156 >UniRef50_C0Q8X7 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Desulfobacteraceae RepID=GCP_DESAH Length = 333 Score = 357 bits (916), Expect = 4e-97, Method: Composition-based stats. Identities = 146/331 (44%), Positives = 201/331 (60%), Gaps = 1/331 (0%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIE+SCD+T A+ D +L++ + SQV +H YGGVVPELASR H+ P++ Sbjct: 1 MIILGIESSCDDTAAAVVSDHNTVLSSVVSSQVDVHHRYGGVVPELASRMHIEAISPVVA 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ ++G++ I+ VA T GPGL+GALLVG + ++ A+A ++P V+H+EGH+ + + Sbjct: 61 QAVDQAGISPDQIEGVAVTRGPGLIGALLVGFSFAKAFAWAKNIPWAGVNHLEGHIYSLL 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L D+PP FPF ALL SGGHT + V ++ELLG++ DDAAGEAFDK AK+LGL YPGG Sbjct: 121 LSDDPPAFPFTALLASGGHTSIFHVVSQDRFELLGQTRDDAAGEAFDKVAKMLGLGYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-DQTRADIARAF 239 ++ +AA+G FPR D+ G DFSFSGLK+ A ++ N + + IA F Sbjct: 181 AVVEALAAKGDPCLIPFPRSFLDKDGFDFSFSGLKSAVARYVQLNRENLGEMMPHIAAGF 240 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + AV D L K A TG R+ +AGGVSANR L +++ K ++ P FC Sbjct: 241 QSAVTDVLAFKLIHAARATGCSRIAIAGGVSANRFLASRMKIEAAKHNMALYLPPPSFCG 300 Query: 300 DNGAMIAYAGMVRFKAGATADLGVSVRPRWP 330 DN AMIA G G L V R Sbjct: 301 DNAAMIAARGHRLISQGDLCQLDSDVFSRTR 331 >UniRef50_Q2JXG9 Probable O-sialoglycoprotein endopeptidase n=31 Tax=Bacteria RepID=GCP_SYNJA Length = 366 Score = 355 bits (912), Expect = 1e-96, Method: Composition-based stats. Identities = 148/345 (42%), Positives = 203/345 (58%), Gaps = 10/345 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEK-------GLLANQLYSQVKLHADYGGVVPELASRDHVR 53 +R+L IETSCDET +A+ + + L++ + SQ+ LHA YGGVVPE+A+R HV Sbjct: 2 LRLLAIETSCDETAVAVVEADAAWPTFAPRQLSSVVASQIDLHAAYGGVVPEVAARRHVE 61 Query: 54 KTVPLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 ++++AL+++GL ++DAVA T PGLVG+LLVG ++LA ++ P I VHH+E Sbjct: 62 TLPFVLESALQQAGLGMAEVDAVAVTCAPGLVGSLLVGLMAAKTLALLYNKPLIGVHHLE 121 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 GHL + L P + LLVSGGHT LI + G+Y+ +G + DDAAGEAFDK A+LL Sbjct: 122 GHLFSGFLAAADLRPPCLGLLVSGGHTSLIWMKDYGEYQTMGRTRDDAAGEAFDKVARLL 181 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--RDNGTDDQT 231 GL YPGGP + + A QG RF P D P D SFSGLKT + + Sbjct: 182 GLGYPGGPQIDRWAQQGDPDRFPLPEGKLDHP-YDTSFSGLKTAVLRLVQQLQQEGQELP 240 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 ADIA +F+ + L K + G L++ GGV+ANR LRA+L E +++ V Sbjct: 241 VADIAASFQACLTRVLTEKAVACAEALGLSTLLVTGGVAANRELRARLLEAGRQKGLRVV 300 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 P CTDN AMI AG+ + G T+ L + V R L E+PA Sbjct: 301 IPPPNLCTDNAAMIGAAGLCHWLRGETSPLELGVASRLTLEEIPA 345 >UniRef50_Q47LN7 Probable O-sialoglycoprotein endopeptidase n=58 Tax=Bacteria RepID=GCP_THEFY Length = 347 Score = 355 bits (911), Expect = 2e-96, Method: Composition-based stats. Identities = 148/337 (43%), Positives = 197/337 (58%), Gaps = 5/337 (1%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++GIE+SCDETG+A LLA+++ S V HA +GGVVPE+ASR H+ P ++ Sbjct: 9 LIMGIESSCDETGVAFVR-GCELLADEVASSVDEHARFGGVVPEVASRAHLEAMTPTVRR 67 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A + +G+ D+DA+A T GPGL GALLVG + ++ A A D P V+H+ GH+ L Sbjct: 68 AAERAGVRLSDVDAIAVTVGPGLAGALLVGLSAAKAYALALDKPLYGVNHLVGHVAVDQL 127 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIG-QYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 E P P VALLVSGGHT L+ V + +LLGE++DDAAGEA+DK A+LL L YPGG Sbjct: 128 EHGPLPKPVVALLVSGGHTSLLLVRDLATDVQLLGETVDDAAGEAYDKVARLLNLPYPGG 187 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADIARA 238 P + + A G FPR DFSFSGLKT A + D + + D+A A Sbjct: 188 PPIDRAARDGDGTAIHFPRGKWGDGTYDFSFSGLKTAVARWVEDAERQGRPVSVPDVAAA 247 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F++AV D L K A + G + LV++GGV+AN LRA E EV RP C Sbjct: 248 FQEAVADVLTRKAVDACREHGVRHLVISGGVAANSRLRALAEERCAAAGIEVRVPRPRLC 307 Query: 299 TDNGAMIAYAGMVRFKAGA-TADLGVSVRPRWPLAEL 334 TDNGAMIA G AG + L ++V P++ + Sbjct: 308 TDNGAMIAALGAEVVAAGLPPSPLDMAVDTSLPVSSV 344 >UniRef50_C7LR95 Metalloendopeptidase, glycoprotease family n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LR95_DESBD Length = 356 Score = 354 bits (910), Expect = 2e-96, Method: Composition-based stats. Identities = 137/342 (40%), Positives = 197/342 (57%), Gaps = 16/342 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A++D + L+ + +++Q+ +H+ +GGVVPELASR+H+R L+ Sbjct: 1 MICLGIETSCDETSVALWD-DGHLVTDLVHTQIPMHSVFGGVVPELASREHLRLLDGLVS 59 Query: 61 AALKES-GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + L+ + + ID +A T GPGL+GALLVG + +SL+ + VP I V+H+ HLLA Sbjct: 60 SVLQSAERPAGQGIDLIAVTRGPGLLGALLVGISYAKSLSLSLGVPVIGVNHLYAHLLAC 119 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + P E+P + +LVSGGHT + + ++ LLG+++DDAAGEAFDK AKLL L YPG Sbjct: 120 DFTE-PIEYPALGVLVSGGHTHIYEMPAPCEFNLLGKTLDDAAGEAFDKIAKLLNLPYPG 178 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD----------- 228 G + +A GTA +F +P DFSFSGLKT A + Sbjct: 179 GKYIDILARLGTADPRLFSKPYLQNDNCDFSFSGLKTAVAQYVHKKSFAAIDYAAFDVEL 238 Query: 229 -DQTRADIARAFEDAVVDTLMIKCKRALDQT-GFKRLVMAGGVSANRTLRAKLAEMMKKR 286 Q D+ + +V+TL+ K +RA+ + K L +AGGV+AN LR K + R Sbjct: 239 IPQEIKDLCATVNETIVETLLEKTRRAVARCHDVKTLCLAGGVAANSHLRHKFSAFAHAR 298 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +C DN AMIAYAG+ K G + + PR Sbjct: 299 GFKFLAPAQNYCGDNAAMIAYAGVQWAKKGLMSSMDFEAVPR 340 >UniRef50_D1N4S8 Metalloendopeptidase, glycoprotease family n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N4S8_9BACT Length = 359 Score = 354 bits (908), Expect = 3e-96, Method: Composition-based stats. Identities = 140/354 (39%), Positives = 205/354 (57%), Gaps = 21/354 (5%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCDET A+ D +L++ + SQ+ HA +GGVVPELA+R+H+ P+++ Sbjct: 3 LILGIESSCDETAAAVVRDGYQVLSSCVASQIAKHAVHGGVVPELAAREHLVALNPVVEG 62 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+E+G+T K+IDA+A T GPGL+ ALLVG + + LA P I V+H H+ L Sbjct: 63 ALREAGVTMKEIDAIAVTQGPGLIPALLVGLSFAKGLAMGNGKPLIGVNHFIAHIYGAFL 122 Query: 122 E------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 + +NP +P +AL+VSGGHT L+ + G+ LG +IDDAAGEA DK AKLLGL Sbjct: 123 DEAHGVLENPATYPLLALVVSGGHTSLMLIERDGKARQLGCTIDDAAGEALDKGAKLLGL 182 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPG--------LDFSFSGLKTFAANTIRDNGT 227 YPGGP++ K A G ++ FPRP+T G +FSFSG+KT ++ + Sbjct: 183 GYPGGPIMQKTAEGGDPHKYEFPRPLTGGAGKPLAPENLYNFSFSGIKTALLYHVKHHAG 242 Query: 228 DD-----QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEM 282 D + D ++++AVVD L K A G K +V+AGGV+ N LR + + Sbjct: 243 ADGKLPAELLQDTVASYQEAVVDVLTRKTLLAAKNFGAKTIVVAGGVACNSVLRERFEAL 302 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWP-LAELP 335 K ++ A ++CTDN AM+ G + A + L + R P + ++P Sbjct: 303 TPKH-VQLRLAARKYCTDNAAMVGGLGWHYHRKQAYSPLNIDSFARLPQITQVP 355 >UniRef50_Q058D1 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Buchnera aphidicola str. Cc (Cinara cedri) RepID=GCP_BUCCC Length = 343 Score = 353 bits (907), Expect = 4e-96, Method: Composition-based stats. Identities = 150/340 (44%), Positives = 228/340 (67%), Gaps = 9/340 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCD+T +AIYD + GL+ +Q +Q +H+ Y G+VPELA+R H+ + LI+ Sbjct: 1 MKILGIETSCDDTSVAIYDKKLGLIDHQTLNQNSVHSKYHGIVPELAARSHLNQLNFLIK 60 Query: 61 AALKE------SGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 + S K AVAYT GPGL G+++V + RS+A + D+P I ++H+EG Sbjct: 61 NIFSKYFLYNSSNFKKKFFKAVAYTVGPGLSGSIVVHSC--RSIALSLDIPYILINHLEG 118 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HLL+ ML FPF+ALLVSG +TQLI +G+Y +LG+++DDA G FD AK+LG Sbjct: 119 HLLSVMLSYKKNLFPFLALLVSGANTQLIYAKYLGKYIILGQTLDDAVGNVFDYIAKILG 178 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 L +PGG LS +A G +G++ FPRPMT L+FSFSGLKT N I ++ Q +++ Sbjct: 179 LGFPGGKNLSDLAKYGISGKYFFPRPMTKYSNLNFSFSGLKTHVKNVILNSSDSFQEKSN 238 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 IA++FE+A+VDTL+IKCK A+ + K ++ GGVS+NR LR KL +++ K + ++++++ Sbjct: 239 IAKSFEEAIVDTLIIKCKLAIKKIKVKNFLVCGGVSSNRLLRIKLKKLIYKNQRKLYFSK 298 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATA-DLGVSVRPRWPLAE 333 +FCTDN MIAY G ++++ G + + S+ P +++ Sbjct: 299 KKFCTDNAGMIAYLGFLKYQQGMYSYNKSFSIYPNLLISD 338 >UniRef50_B1XJF0 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Synechococcus sp. PCC 7002 RepID=GCP_SYNP2 Length = 355 Score = 353 bits (905), Expect = 8e-96, Method: Composition-based stats. Identities = 145/340 (42%), Positives = 199/340 (58%), Gaps = 8/340 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VL IETSCDET +AI + + +L N + SQ+ +H ++GGVVPE+ASR H+ I Sbjct: 3 IVLAIETSCDETAVAIV-NNRKVLGNVVASQIDIHREFGGVVPEVASRHHLESINACIDT 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A ++SGL+ +I+A+A T PGLVGALL+GA G++LA + P I VHH+EGH+ A L Sbjct: 62 AFEQSGLSWSEIEAIATTCAPGLVGALLLGAAAGKTLAMIHNKPFIGVHHLEGHIYASYL 121 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E PF+ LLVSGGHT I V G G+Y+LLGE+ DDAAGEAFDK A+LL + YPGGP Sbjct: 122 SQPELEPPFLCLLVSGGHTSFIEVRGCGEYKLLGETRDDAAGEAFDKVARLLRVGYPGGP 181 Query: 182 LLSKMAAQGTAGRFVFP-----RPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRAD 234 ++ ++A G F P P D SFSGLKT ++ T + AD Sbjct: 182 VIDRLAKTGDPQAFKLPEGRISLPGGGYHPYDCSFSGLKTAVLRLVQQFETQGKAVPVAD 241 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 IA +F+ V L + R + +V+ GGV+AN LR L + +V++ Sbjct: 242 IAASFQYTVAQALTKRAVRCAGDRQLQTIVVGGGVAANSGLRQILTAAAAEAGIQVYFPP 301 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 +FCTDN AMIA A F+ G + L + V R P+ ++ Sbjct: 302 LKFCTDNAAMIACAAAEHFQKGDRSRLDLPVASRLPITQV 341 >UniRef50_C5ZWF6 Metal-dependent protease n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZWF6_9HELI Length = 351 Score = 351 bits (901), Expect = 2e-95, Method: Composition-based stats. Identities = 117/334 (35%), Positives = 177/334 (52%), Gaps = 12/334 (3%) Query: 2 RVLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +L IE+SCD++ IAI +K ++ +Q SQ + H+ YGGVVPE+ASR H Sbjct: 19 LILSIESSCDDSSIAITQIKDKKIVFHQKISQEREHSSYGGVVPEIASRLHAEILP---- 74 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L+ + KD+ A+A T PGL L+ G + ++L+FA ++P I V+H++GHL + Sbjct: 75 QILEHTKPYFKDLKAIAVTTEPGLNITLMEGLMMAKTLSFALEIPLISVNHLKGHLYSLF 134 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE FP ALLVSGGHT L+ + ++ ++IDD+ GE+FDK +K+LGL YPGG Sbjct: 135 LEQEAI-FPLGALLVSGGHTMLLEARSFNEINIIAQTIDDSFGESFDKVSKMLGLGYPGG 193 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT-RADIARAF 239 P++ A +G F P P+ R FSFSGLK I+ + DI +F Sbjct: 194 PIVEFQAQKGNDRAFELPLPLKSRKDFAFSFSGLKNAVRLVIQKQEIQSKAFVEDICASF 253 Query: 240 EDAVVDTLMIKCKRALDQTGF-----KRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + ++ L K + ++ K + GG SAN LR ++ + + A Sbjct: 254 QRVAIEHLSKKTQIFFEKNSKSMDSWKYFGVIGGASANLVLRNEIQRICDYYGVTLLLAP 313 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 E+C+DN AMI + + G D + V+PR Sbjct: 314 LEYCSDNAAMIGRVALESYLRGEFGDFNLQVKPR 347 >UniRef50_Q127W3 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Proteobacteria RepID=GCP_POLSJ Length = 347 Score = 350 bits (899), Expect = 3e-95, Method: Composition-based stats. Identities = 179/340 (52%), Positives = 228/340 (67%), Gaps = 8/340 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEK----GLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIE+SCDETG+A+ D LL++ L+SQ+++H YGGVVPELASRDH+R+ + Sbjct: 1 MLVLGIESSCDETGVALVDAGGSEVPRLLSHALFSQIQMHQAYGGVVPELASRDHIRRVL 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 PL + + ++G + +D VAYT GPGL GALLVGA V +LA A P + VHH+EGHL Sbjct: 61 PLTRQVMAQAGRSLAQVDVVAYTRGPGLAGALLVGAGVACALAAALGKPVMGVHHLEGHL 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L+P L +PP FPFVALLVSGGHTQL+ V +G YELLGE+IDDAAGEAFDK+AKL+GL Sbjct: 121 LSPFLSADPPVFPFVALLVSGGHTQLMRVDRVGSYELLGETIDDAAGEAFDKSAKLMGLP 180 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-DQTRADI 235 YPGGP L+ +A QG F PRP+ LDFSF+GLKT + G + + +AD+ Sbjct: 181 YPGGPHLADLARQGDGTAFKLPRPLLHSGDLDFSFAGLKTAVLTQAKKLGPELENRKADL 240 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 A A + A+VD L+ K A+ QTG KRLV+AGGV AN LR++L ++R V Y Sbjct: 241 AAATQAAIVDVLVKKSLAAMAQTGLKRLVVAGGVGANALLRSQLNAACQQRGIRVHYPEL 300 Query: 296 EFCTDNGAMIAYAGMVRFKAGATA---DLGVSVRPRWPLA 332 CTDNGAMIA A +R +AG V+PRW L Sbjct: 301 HLCTDNGAMIALAAGMRLQAGLETLQRGYTFDVKPRWSLT 340 >UniRef50_Q7UM42 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Planctomycetaceae RepID=GCP_RHOBA Length = 358 Score = 350 bits (899), Expect = 4e-95, Method: Composition-based stats. Identities = 135/347 (38%), Positives = 192/347 (55%), Gaps = 20/347 (5%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE++CDET A+ + +L + +Q LH +GGVVPE+A+R H+ + +P+I Sbjct: 9 LLLSIESTCDETAAAVIRRDGTVLGQCIATQETLHEQFGGVVPEIAARAHLERILPVIDT 68 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL ++ + +D+ A+A PGL G+LLVG ++LA AW+ P I ++H+ HL A L Sbjct: 69 ALTQAKVRGEDLTAIAVADRPGLAGSLLVGVVAAKTLALAWNKPLISLNHLHAHLYACQL 128 Query: 122 EDNPPE--FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 + P +P + L+VSGGHT L E LG +IDDAAGEAFDK A +L L +PG Sbjct: 129 IEGAPANIYPAIGLIVSGGHTSLYVCRTAIDLEYLGGTIDDAAGEAFDKVAAMLSLPFPG 188 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI--------RDNGTDDQT 231 G ++K+A+QG + FPR M PG DFSFSGLKT I DQ Sbjct: 189 GIEVAKLASQGNDKAYSFPRSMIHDPGDDFSFSGLKTAVRYAIVGPGRQDFASLDISDQV 248 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTG---------FKRLVMAGGVSANRTLRAKLAEM 282 + D+ +FE AVVD L+ KC+RA+ + RL++ GGV+AN+ LR L Sbjct: 249 KRDVCASFEAAVVDVLVSKCRRAIKRHRNRNNDPQNSINRLIVGGGVAANQRLRRDLQAA 308 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 K E++ A P CTDN M +F+A A L + + P Sbjct: 309 ADKDGFELWIAPPHLCTDNAVM-GAIAWKKFEAEQFASLDLDITPGL 354 >UniRef50_B4U8B7 Metalloendopeptidase, glycoprotease family n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U8B7_HYDS0 Length = 343 Score = 350 bits (898), Expect = 4e-95, Method: Composition-based stats. Identities = 138/338 (40%), Positives = 204/338 (60%), Gaps = 16/338 (4%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 LGIETSCD+T +A+Y ++GL+ N L SQV H Y G+VPEL SR+H + L Sbjct: 8 LWLGIETSCDDTALALYSSKRGLIDNLLSSQVNAHKIYNGIVPELCSREHTKNLYILFYE 67 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L++ + DID +A T PGL+ +LLVGA+ L++A D+P +PVHH+E H+ + L Sbjct: 68 LLEKHKIKPSDIDFLAVTIAPGLILSLLVGASFASGLSYALDIPIVPVHHIEAHIYSVFL 127 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E E+PF+AL+VSGGHT++ V G YEL+G+++DDAAGEAFDK A LLGL YPGGP Sbjct: 128 E-YNVEYPFLALVVSGGHTEIYLVKGFEHYELIGKTLDDAAGEAFDKGAVLLGLQYPGGP 186 Query: 182 LLSK-MAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 + K +++ FP P+ D + FSFSGLKTF D + + +++ Sbjct: 187 AIEKFLSSYENPETIDFPIPIKDDR-IAFSFSGLKTFL-----RENKDKYPKDALVFSYQ 240 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 +A+V+ ++ ++A+ +T RLV+ GGV+AN+ LR KL + E + ++CTD Sbjct: 241 EAIVNHIIRTLQKAIKKTAVNRLVVVGGVAANKRLREKLNAL----DIECYIPSIKYCTD 296 Query: 301 NGAMIAYAGMVRFKAGAT---ADL-GVSVRPRWPLAEL 334 N AM++ G +RF G +DL ++ P L + Sbjct: 297 NAAMVSLVGNMRFLKGKYYKKSDLHKLNPDPSLRLEDF 334 >UniRef50_B6JAE9 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Alphaproteobacteria RepID=GCP_OLICO Length = 357 Score = 348 bits (894), Expect = 1e-94, Method: Composition-based stats. Identities = 144/342 (42%), Positives = 200/342 (58%), Gaps = 11/342 (3%) Query: 1 MRVLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIET+CDET A+ D +L+N + SQ+ HA +GGVVPE+A+R HV Sbjct: 1 MLVLGIETTCDETAAAVIERQADGSGRILSNIVRSQIAEHAPFGGVVPEIAARAHVEMLD 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L+ A++E+G+ +D +A AGPGL+G ++VG T +++A D P I V+H+E H Sbjct: 61 VLVDRAMREAGVDFAQLDGIAAAAGPGLIGGVIVGLTTAKAIALVHDTPLIAVNHLEAHA 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L P L P FP+ L SGGHTQ+++V G+G+Y +G ++DDA GEAFDK AK+L L Sbjct: 121 LTPRL-TVPLAFPYCLFLASGGHTQIVAVLGVGEYVRIGTTVDDALGEAFDKVAKMLDLP 179 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-QTRADI 235 YPGGP + + A +G RF FPRPM R +FS SGLKT N + Q AD+ Sbjct: 180 YPGGPQVERAAREGDPTRFDFPRPMLGRKDANFSLSGLKTAVRNEASRLMPLELQDIADL 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQTG-----FKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 +F+ AV+D++ + + L + LV AGGV+AN +R L E+ + Sbjct: 240 CASFQAAVLDSIADRIRSGLRLFREQFGTPRALVAAGGVAANVAIRNALQEIAADDEITM 299 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 P+ CTDNGAMIA+AG R G T + + R RW L Sbjct: 300 IVPPPQLCTDNGAMIAWAGAERLALGLTDTMEAAPRARWKLD 341 >UniRef50_C1A601 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=GCP_GEMAT Length = 357 Score = 348 bits (894), Expect = 1e-94, Method: Composition-based stats. Identities = 163/347 (46%), Positives = 208/347 (59%), Gaps = 16/347 (4%) Query: 1 MRVLGIETSCDETGIAIYDD--EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 MRVLGIETSCDET A+ E L + + +H +GGVVPE+ASR H+ VP Sbjct: 1 MRVLGIETSCDETSAAVVSGTPEAMTLESCVILSQDVHRLFGGVVPEIASRQHLIGIVPA 60 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 + AAL+E+ ++ DIDAVA T PGLVGALLVG + +SLA ++D P +PVHH+EGHL A Sbjct: 61 VAAALQEAQVSLSDIDAVAVTHAPGLVGALLVGTSFAKSLALSYDKPLVPVHHLEGHLFA 120 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 +LE PF ALLVSGGHT L+ V G+Y LLG++ DDA GEAFDK AKLLGL YP Sbjct: 121 TLLEHPDAAPPFTALLVSGGHTLLLDVPAWGEYRLLGQTRDDAVGEAFDKVAKLLGLPYP 180 Query: 179 GGPLLSKMAAQGT----AGRFVFPRPMTDRPG-------LDFSFSGLKTFAANTIRDNGT 227 GG + ++AA F RPM + D SFSGLKT +RD Sbjct: 181 GGRPIEQLAATAEAPVHKHPHRFARPMLRKSSTPADEDYYDCSFSGLKTAVLYAVRDAER 240 Query: 228 D---DQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMK 284 D RA IAR F+DAV+DTL+ K RA Q R+V+ GGV+ N+ L+A + M+ Sbjct: 241 TGTLDDARASIARGFQDAVIDTLVEKVVRAARQHRRSRVVLGGGVACNQALQAAMRNAME 300 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 +R+G VF P TDN AMIA AG+ R + G A ++ P+ Sbjct: 301 QRKGHVFAPSPRLATDNAAMIAAAGIFRLQRGEFAAPDMTATASLPI 347 >UniRef50_A0JZ01 Probable O-sialoglycoprotein endopeptidase n=98 Tax=Bacteria RepID=GCP_ARTS2 Length = 356 Score = 346 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 140/351 (39%), Positives = 197/351 (56%), Gaps = 17/351 (4%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIE+SCDETG+ I LL+N + S ++ H +GGV+PE+ASR H+ VP +Q Sbjct: 7 LVLGIESSCDETGVGIVR-GTALLSNTVSSSMEEHVRFGGVIPEIASRAHLDAFVPTLQE 65 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL ++G+ D+DA+A T+GPGL GAL+VG ++LA A P ++H+ H+ +L Sbjct: 66 ALADAGVQLDDVDAIAVTSGPGLAGALMVGVCAAKALAVATGKPLYAINHLVAHVGVGLL 125 Query: 122 EDNPPEFPFV-ALLVSGGHTQLISVTGI-GQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 ++ + ALLVSGGHT+++ + I ELLG +IDDAAGEA+DK A+LLGL YPG Sbjct: 126 QEENTLPEHLGALLVSGGHTEILRIRSITDDVELLGSTIDDAAGEAYDKVARLLGLGYPG 185 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRP-----------GLDFSFSGLKTFAANTIRDNGT- 227 GP + K+A G A FPR +T D+SFSGLKT A + Sbjct: 186 GPAIDKLARTGNAKAIRFPRGLTQPKYMGTADEPGPHRYDWSFSGLKTAVARCVEQFEAR 245 Query: 228 -DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 D+ ADIA AF++AVVD + K A + G L++ GGV+AN LR + + Sbjct: 246 GDEVPVADIAAAFQEAVVDVITSKAVLACTENGITELLLGGGVAANSRLRQLTEQRCRAA 305 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAG-ATADLGVSVRPRWPLAELPA 336 + E CTDNGAM+A G AG + + + P+ + A Sbjct: 306 GIRLTVPPLELCTDNGAMVAALGAQLVMAGIEPSGISFAPDSSMPVTTVSA 356 >UniRef50_C9RIN4 Metalloendopeptidase, glycoprotease family n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RIN4_FIBSS Length = 335 Score = 346 bits (888), Expect = 6e-94, Method: Composition-based stats. Identities = 139/305 (45%), Positives = 186/305 (60%), Gaps = 3/305 (0%) Query: 1 MRVLGIETSCDETGIAIYDDE-KGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M LGIE+SCDET A+ D+ +L+N LYSQ+ HA YGGVVPE+A+R H++K P+ Sbjct: 1 MIWLGIESSCDETACAVLQDDPLKVLSNPLYSQIDEHALYGGVVPEIAARAHLQKIAPIA 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 +AA+KE+G+ KDIDA+AYT GPGL+G LLVGA+ + LA ++PA ++H+EGHL A Sbjct: 61 EAAVKEAGVELKDIDAIAYTTGPGLMGPLLVGASFAKGLARDLNIPAYGMNHLEGHLAAA 120 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 L + E PF+ L VSGGHT+L+ +Y +G + DDAAGEAFDK KL+GL YP Sbjct: 121 WLSNPDIEPPFLTLTVSGGHTELVMEEPGFKYTSIGRTRDDAAGEAFDKCGKLIGLKYPA 180 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD--DQTRADIAR 237 G +S++ FPR + +FSFSGLKT + + Q DI Sbjct: 181 GATISRLGKDHNRKFVEFPRALHTHDSCEFSFSGLKTAVLRYTETHDPEFIQQNLGDICA 240 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 + EDA+VD+L+ K AL +T K LVM GGVSAN LR +L + K+ Sbjct: 241 SLEDAIVDSLVTKTINALKKTKMKTLVMGGGVSANSWLRTRLQDYCDKKGIRFCVPDRSL 300 Query: 298 CTDNG 302 TDNG Sbjct: 301 STDNG 305 >UniRef50_C0QY51 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Brachyspira RepID=GCP_BRAHW Length = 340 Score = 346 bits (888), Expect = 7e-94, Method: Composition-based stats. Identities = 121/331 (36%), Positives = 195/331 (58%), Gaps = 7/331 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGI+TSCD+T AI +D K +L++ L S + H ++ GVVPE+A+R H+ + +I Sbjct: 1 MKILGIDTSCDDTSAAIVEDGKNVLSSVLSSSIDAHKEFQGVVPEIAARKHLEAILYVID 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALK++ T DID A T PGL+G+LLVG +SLAF+ + P + + H+ H+ +P Sbjct: 61 KALKDANTTLDDIDLFAVTNRPGLLGSLLVGVASAKSLAFSLNKPLLALDHIAAHIYSPH 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L N EFP++AL+VSGGHT + V G+Y+++G ++DDA GEA+DK +K L L YPGG Sbjct: 121 L-TNDIEFPYIALVVSGGHTIITEVHDYGEYKVVGTTLDDAVGEAYDKVSKFLNLGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDR-PGLDFSFSGLKTFAANTIRDNGTD--DQTRADIAR 237 P++ ++A +G +P + + +FS+SGLKT + + + + T +IA Sbjct: 180 PIIDRLAKEGNKEAIKYPIVLLNGIDEFNFSYSGLKTACVYSTKKYLKEGYEATNENIAA 239 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 AF+ + ++ L IK + +++G KR+ ++GGV+ N LR + + E + ++ Sbjct: 240 AFQISAIEPLYIKTLKYAEKSGIKRVTLSGGVACNSYLRDRF---GNSKDFECYLPALKY 296 Query: 298 CTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 TDN AM+A AD + R Sbjct: 297 TTDNAAMVAGLAYHMKDKQNFADYNLDCFSR 327 >UniRef50_A5GMV4 Probable O-sialoglycoprotein endopeptidase n=17 Tax=cellular organisms RepID=GCP_SYNPW Length = 356 Score = 345 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 154/342 (45%), Positives = 200/342 (58%), Gaps = 11/342 (3%) Query: 2 RVLGIETSCDETGIAIYD---DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +VL +ETSCDE+ A+ +LA+++ SQV+ HA +GGVVPE+ASR HV L Sbjct: 3 KVLALETSCDESAAAVVQHSAGGLEVLAHRIASQVEEHAQWGGVVPEIASRRHVEALPHL 62 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 I A L E+GL ++DAVA T PGLVGAL+VG+ GR+LA P + VHH+E HL + Sbjct: 63 ISAVLDEAGLAVGEMDAVAATVTPGLVGALMVGSLTGRTLAALHHKPFLGVHHLEAHLAS 122 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 L +PPE P+V LLVSGGHT+LI V + LG S DDAAGEAFDK A+LLGL YP Sbjct: 123 VRLASSPPEAPYVVLLVSGGHTELILVDSDSGLQRLGRSHDDAAGEAFDKVARLLGLAYP 182 Query: 179 GGPLLSKMAAQGTAGRFVFP-----RPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQT 231 GGP + A G RF P RP DFSFSGLKT + +D Sbjct: 183 GGPAIQAAAKAGDPKRFSLPKGRVSRPEGGFYPYDFSFSGLKTAMLRQVESLKAQSDALP 242 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 D+A +FE VVD L+ + R G LVM GGV+AN LR ++ + ++R V Sbjct: 243 LEDLAASFEQIVVDVLVERSLRCCLDRGLSTLVMVGGVAANVRLRVQMEQQGRERGVSVH 302 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGA-TADLGVSVRPRWPLA 332 A +CTDN AM+ A + R +AG ++ + + V RWPL Sbjct: 303 LAPLAYCTDNAAMVGAAALGRLQAGWGSSSIRLGVSARWPLE 344 >UniRef50_B2S3R9 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Treponema RepID=GCP_TREPS Length = 352 Score = 345 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 133/332 (40%), Positives = 180/332 (54%), Gaps = 8/332 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIETSCDET +AI D + +N + +Q+ HA Y G+VPELASR H+ +P ++ Sbjct: 1 MNVLGIETSCDETAVAIVKDGTHVCSNVVATQIPFHAPYRGIVPELASRKHIEWILPTVK 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + LT DID +A T PGL G+LLVG T ++LA++ +P I V+H+ H A Sbjct: 61 EALARAQLTLADIDGIAVTHAPGLTGSLLVGLTFAKTLAWSMHLPFIAVNHLHAHFCAAH 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E +P+V LL SGGH + V Q E LG +IDDA GEAFDK A G YPGG Sbjct: 121 VEH-DLAYPYVGLLASGGHALVCVVHDFDQVEALGATIDDAPGEAFDKVAAFYGFGYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPG--LDFSFSGLKTFAANTIRDNGTDDQTR--ADIA 236 ++ +A QG A FP P G D S+SGLKT + + + R +IA Sbjct: 180 KVIETLAEQGDARAARFPLPHFHGKGHRYDVSYSGLKTAVIHQLDHFWNKEYERTAQNIA 239 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 AF+ ++ L+ RAL TG V+ GGV+AN LR + + + E Sbjct: 240 AAFQACAINILLRPLARALQDTGLPTAVVCGGVAANSLLR---KSVADWKHARCVFPSRE 296 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +CTDN M+A G G + GV+ R R Sbjct: 297 YCTDNAVMVAALGYRYLIRGDRSFYGVTERSR 328 >UniRef50_A9FDL0 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Deltaproteobacteria RepID=GCP_SORC5 Length = 356 Score = 345 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 163/345 (47%), Positives = 212/345 (61%), Gaps = 14/345 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVLGIETSCDET A+ + +L++ + SQV LHA YGGVVPE+A+RDH R VP+++ Sbjct: 1 MRVLGIETSCDETAAAVVTEGGDVLSDVVRSQVALHAPYGGVVPEVAARDHARAVVPVVR 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL +G++A D+D +A T+ PGL GALLVG + LA+A P + V H+ GHLLA Sbjct: 61 EALSRAGVSAADLDGIAVTSRPGLAGALLVGLQAAKGLAWAAGKPLVGVDHLVGHLLAVF 120 Query: 121 L---------EDNPPEFPFVALLVSGGHTQLISVTG--IGQYELLGESIDDAAGEAFDKT 169 L E P FP+VALL SGGHT + V G +G LG + DDAAGEAFDK Sbjct: 121 LRRGGAPLSDERERPSFPYVALLASGGHTAIYRVDGPALGAIRELGATRDDAAGEAFDKV 180 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVFPRP--MTDRPGLDFSFSGLKTFAANTIRDNGT 227 AKLLGL YPGGP++ ++AA G A P M + L+FSFSG+K+ A + G Sbjct: 181 AKLLGLGYPGGPVVDRLAAGGDAAAAADAVPALMARKESLEFSFSGIKSSVARHVAKRGR 240 Query: 228 DD-QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 + Q D+ AF+ AVVD L+ K RA G R+V+ GGV+AN+ LRAK+A ++R Sbjct: 241 PEGQALRDLCAAFQGAVVDALVQKTVRAARAEGIGRVVLGGGVAANQGLRAKMAAACERR 300 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 +F CTDNGAMIAYAG +R AG L ++ R L Sbjct: 301 GLALFVPPLASCTDNGAMIAYAGALRLAAGERDTLDLAPETRTAL 345 >UniRef50_B2KE20 Metalloendopeptidase, glycoprotease family n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KE20_ELUMP Length = 342 Score = 344 bits (882), Expect = 3e-93, Method: Composition-based stats. Identities = 135/337 (40%), Positives = 199/337 (59%), Gaps = 16/337 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIET+CDET AI + L++N +++Q+ +H Y GVVPELASR H K +++ A Sbjct: 9 ILGIETTCDETSAAILKSGRDLVSNVVHTQIDIHKKYCGVVPELASRAHAVKVAEVVKEA 68 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ID VA+ +GPGL G L+VG +++ +VP I V+H+EGHL A + Sbjct: 69 LGNHK-----IDLVAFASGPGLPGGLMVGRVAAEAVSALKNVPIIGVNHLEGHLFACEFD 123 Query: 123 --------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 D +FP +AL+VSGGHT+L V G Y++LG + DDAAGEAFDK AKLLG Sbjct: 124 AKEGKIAADKQLKFPLIALIVSGGHTELWYVKNYGDYKMLGRTRDDAAGEAFDKVAKLLG 183 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 L YPGGP+++K A +G FPRPM +FSFSG+KT + +RD+ D + D Sbjct: 184 LGYPGGPVVAKEALKGNPEAIKFPRPMMKGT-FEFSFSGIKTAVSYYLRDHK--DIKKED 240 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + +F+ A+V+TL+ K +A+ + K + + GGV+AN L+ + + +K +V + Sbjct: 241 VCASFQAAMVETLVAKTFQAVKKYKVKNVAVGGGVAANELLKESMVKRGQKEGVDVSFVP 300 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 +DNGAMIA AG +F + + + P + Sbjct: 301 RALSSDNGAMIALAGYKKFMFAGKFNANIRINPNMRI 337 >UniRef50_C8WN77 Metalloendopeptidase, glycoprotease family n=3 Tax=Bacteria RepID=C8WN77_EGGLE Length = 891 Score = 343 bits (881), Expect = 4e-93, Method: Composition-based stats. Identities = 135/341 (39%), Positives = 183/341 (53%), Gaps = 9/341 (2%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET AI D L+A+ + SQ+ HA +GGVVPE+ASR H+ + Sbjct: 549 LILAIESSCDETAAAIVDGNGTLIADVVASQIDFHARFGGVVPEIASRKHIEAICGVCDE 608 Query: 62 ALKESG-------LTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 + LT +D+D++A T PGLVGAL+VG + A+A P I V+H+EG Sbjct: 609 CFDVAASALGIERLTWRDLDSIAVTYAPGLVGALVVGVAFAKGAAWAAGKPFIGVNHLEG 668 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HL A + + P V LVSGG+T L+ + G G YE LG +IDDA GEAFDK AK LG Sbjct: 669 HLYANKIGAPDFQPPAVVSLVSGGNTLLVHMKGWGDYETLGATIDDAVGEAFDKVAKALG 728 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT--DDQTR 232 L YPGGP++S+ AA+G FPR M L FS SGLKT I + + Sbjct: 729 LGYPGGPVISREAAKGDPNAIPFPRAMMHSGDLRFSLSGLKTAVVTYINNERAAGRELNV 788 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 +I +F+ AVVD + K + AL+QTG + + GGV+AN LR ++ ++ + Sbjct: 789 PNICASFQQAVVDVQVKKAEMALEQTGARTFCLGGGVAANPALRDAYEQLCERLHVRLTL 848 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 C DN MIA + R G L + L E Sbjct: 849 PPLSACGDNAGMIALVALDRHNQGKFFTLEADAQAHANLDE 889 Score = 93.0 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 55/146 (37%), Gaps = 13/146 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL +T+ + I + L A+ ++ V A R + +P I AA Sbjct: 18 VLAFDTANEIIAIGL----GVLHASSRMIELTAS------VEAEARRASNTQLLPRIDAA 67 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E G+ +DI VA GPG + + + +A A +VP + V ++ Sbjct: 68 LAEHGVAREDIACVAVGRGPGSFTGVRIAMATAKGIASALEVPLVGVSSLDAVAWNAWAA 127 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGI 148 ++++ ++ V + Sbjct: 128 GERGP---LSVVADAMRKEVYPVRYL 150 >UniRef50_Q0ATQ2 Probable O-sialoglycoprotein endopeptidase n=44 Tax=Proteobacteria RepID=GCP_MARMM Length = 377 Score = 343 bits (880), Expect = 6e-93, Method: Composition-based stats. Identities = 146/347 (42%), Positives = 198/347 (57%), Gaps = 14/347 (4%) Query: 3 VLGIETSCDETGIAI----YDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 VLG+E+SCDET AI D +LA+++ Q HA +GGVVPE+A+R H L Sbjct: 17 VLGLESSCDETAAAILRREVDGSVTVLADRVLGQNDAHAPFGGVVPEIAARAHAEAMDGL 76 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 + AL E+GL D+D +A T+GPGL+G ++ + LA P I V+H+EGH L+ Sbjct: 77 VSQALAEAGLAVADLDGIAATSGPGLIGGVMAALMTAKGLALGAGKPLIAVNHLEGHALS 136 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 P + + P FP++ LLVSGGHTQL+ G+G Y LG ++DDAAGEAFDKTAK++GL +P Sbjct: 137 PRISE-PLAFPYLLLLVSGGHTQLLIAEGVGVYHRLGSTMDDAAGEAFDKTAKVMGLGFP 195 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG-TDDQTRADIAR 237 GGP L + A G A RF P P+ +PG DFSF+GLKT A DQ RAD++ Sbjct: 196 GGPALERCAQSGDATRFALPVPLKGKPGCDFSFAGLKTAARQIWDGLDAPSDQDRADLSA 255 Query: 238 AFEDAVVDTLMIKCKRALDQT--------GFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 + A+ L + +RAL LV+AGGV+AN+ +RA L + Sbjct: 256 CVQAAIARALSSRTRRALAMFVDRFPDASRPMALVVAGGVAANKAVRAALEDEAAAAGFR 315 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 + ++CTDN AMIA G+ + G L R RWPL A Sbjct: 316 LVAPPMKWCTDNAAMIALVGLEKLARGQIDGLDAPARARWPLDGAAA 362 >UniRef50_Q254Q0 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Chlamydiaceae RepID=GCP_CHLFF Length = 344 Score = 342 bits (878), Expect = 9e-93, Method: Composition-based stats. Identities = 125/339 (36%), Positives = 186/339 (54%), Gaps = 16/339 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LG+E+SCDET A+ D + ++AN + SQ + HA YGGVVPELASR H++ ++ Sbjct: 1 MLTLGLESSCDETACALVDADAQIVANVVSSQ-QYHASYGGVVPELASRAHLQMLPSVVN 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +AL++SG++ DID +A T PGL+G+L VG + LA P I V+H+E HL A Sbjct: 60 SALEKSGVSLDDIDLIAVTHTPGLIGSLAVGVNFAKGLAIGSQKPMIGVNHVEAHLYAAY 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E EFP + L++SG HT + + Y+L+G + DDA GE FDK + LGL YPGG Sbjct: 120 MEAKNVEFPALGLVMSGAHTSMFLMEDPLSYKLIGNTRDDAIGETFDKVGRFLGLPYPGG 179 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD---------DQT 231 L+ MA+QG + F PG D SFSGLKT I+ N ++ + Sbjct: 180 ALIEMMASQGCEESYPFSAAKV--PGYDLSFSGLKTAVLYAIKGNNSNSRSPLPDLSQKE 237 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + +IA +F+ A T+ K + + + +++ GGV+ N+ + L + ++ Sbjct: 238 KNNIAASFQKAAFMTIAQKLPKIIKNFSCRSILVGGGVANNKYFQTLLQNTL---NLPLY 294 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATA-DLGVSVRPRW 329 + + CTDN AMIA G F + T + R RW Sbjct: 295 FPSSKLCTDNAAMIAGLGRELFLSRKTTQGITPCARYRW 333 >UniRef50_A1BJ68 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Chlorobiaceae RepID=GCP_CHLPD Length = 353 Score = 340 bits (872), Expect = 5e-92, Method: Composition-based stats. Identities = 134/350 (38%), Positives = 198/350 (56%), Gaps = 22/350 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETSCDET A+ + +N + SQ+ H +GGVVPELASR+H R V ++ Sbjct: 1 MKILGIETSCDETSAAVL-SNGSVCSNIVSSQL-CHTSFGGVVPELASREHERLIVSIVD 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +AL E+ +T D+D +A TAGPGL+GA++VG G+++A+A +P +PV+H+E H+ + Sbjct: 59 SALSEANITKNDLDVIAATAGPGLIGAVMVGLCFGQAMAYALAIPFVPVNHIEAHIFSAF 118 Query: 121 LEDNP----PEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 +++ P PE F++L VSGGHT L V YE++G ++DDAAGEAFDKT K+LGL Sbjct: 119 IQETPHHQAPEGDFISLTVSGGHTLLSHVHKDFTYEVIGRTLDDAAGEAFDKTGKMLGLP 178 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRP--------GLDFSFSGLKTFAANTIRDNGTD 228 YP GP++ ++A G FPR +T DFSFSGLKT ++ + Sbjct: 179 YPAGPVIDRLAKNGDPFFHEFPRALTAHSQTSKNYRGNSDFSFSGLKTSVLTFLKKQSPE 238 Query: 229 DQTR--ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 + DIA + + A+V L+ K A K + +AGGVSAN LR + + ++ Sbjct: 239 FIEKHLPDIAASVQKAIVSVLVEKTVSAALAGNVKAISIAGGVSANSALRTSMKKACEQH 298 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 E+ TDN AMIA + + R R+ +A + Sbjct: 299 GIAFHVPNAEYSTDNAAMIATLAGLLLAH------DLVPRNRYNIAPFAS 342 >UniRef50_A1R8N0 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Bacteria RepID=GCP_ARTAT Length = 368 Score = 339 bits (870), Expect = 9e-92, Method: Composition-based stats. Identities = 137/367 (37%), Positives = 195/367 (53%), Gaps = 34/367 (9%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDETG+ I LL N + S + H +GGV+PE+ASR H+ VP +Q + Sbjct: 1 MLGIESSCDETGVGIVR-GTTLLTNTVSSSMDEHVRFGGVIPEIASRAHLDAFVPTLQES 59 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+G+T +DIDA+A T+GPGL GAL+VG ++LA A P ++H+ H+ +L+ Sbjct: 60 LQEAGVTLEDIDAIAVTSGPGLAGALMVGVCAAKALAVATGKPLYAINHLVAHVGVGLLD 119 Query: 123 DNPPEFP------------------FVALLVSGGHTQLISVTGI-GQYELLGESIDDAAG 163 N ALLVSGGHT+++ + I ELLG +IDDAAG Sbjct: 120 GNRVSEGKHDAVAAAGLGAGKLPENLGALLVSGGHTEILRIRSITDDVELLGSTIDDAAG 179 Query: 164 EAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRP-----------GLDFSFS 212 EA+DK A++LGL YPGGP + K+A QG FPR +T D+SFS Sbjct: 180 EAYDKVARILGLGYPGGPAIDKLAHQGNPKSIRFPRGLTQPKYMGTAEEKGPHRYDWSFS 239 Query: 213 GLKTFAANTIRDNGT--DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVS 270 GLKT A + ++ ADIA AF++AVVD + K A + G +++ GGV+ Sbjct: 240 GLKTAVARCVEQFEARGEEVPVADIAAAFQEAVVDVISSKAVLACKEHGITDVLLGGGVA 299 Query: 271 ANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT-ADLGVSVRPRW 329 AN LR + + CTDNGAM+A G AG + + + + Sbjct: 300 ANSRLRELTGQRCASAGITLHVPPLGLCTDNGAMVAALGAQLIMAGISPSGVSFAPDSSM 359 Query: 330 PLAELPA 336 P+ + Sbjct: 360 PVTTVSV 366 >UniRef50_B5ZLG0 Metalloendopeptidase, glycoprotease family n=11 Tax=Rhodospirillales RepID=B5ZLG0_GLUDA Length = 382 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 141/346 (40%), Positives = 195/346 (56%), Gaps = 14/346 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IE+SCD+T AI + +LA + SQ H +GGVVPE+A+R H+ L++ Sbjct: 30 ILAIESSCDDTACAILAPDGTILAETVLSQAG-HVPFGGVVPEIAARAHLAALPALVRHT 88 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L + L A+ + A+A + GPGL+G L+VGA + + LA A P + V+H+E H L L Sbjct: 89 LDVAALPAEALGAIAASTGPGLIGGLIVGAGMAKGLAVALGRPFVAVNHIEAHALTARLP 148 Query: 123 DNPP---EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 P FP++ LLVSGGH Q I+V G+G+Y LG +IDDAAGEAFDK AK+LGL +PG Sbjct: 149 GLVPGGASFPYLLLLVSGGHCQCIAVEGVGRYRKLGGTIDDAAGEAFDKVAKMLGLGWPG 208 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR---ADIA 236 GP + +A +G + PRP+ RPG DFSFSGLKT A + R A IA Sbjct: 209 GPAVEALAREGDPAPWPLPRPLRGRPGCDFSFSGLKTAVAQKLAPFAAGALPRTAAAGIA 268 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVM-AGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 +F+DAV D + + ALD L++ AGGV+AN LR +L + R Sbjct: 269 ASFQDAVADIVADRVAHALDMMPQATLLVAAGGVAANTALRTRLTTLATSRALPFAAPPL 328 Query: 296 EFCTDNGAMIAYAGMVRFKAGAT------ADLGVSVRPRWPLAELP 335 CTDN M+ +A + + DL + RPRWPL ++ Sbjct: 329 RLCTDNAVMVGWAAIETLRERRRLGLPPTDDLDLLPRPRWPLEQMA 374 >UniRef50_UPI0000D561DB PREDICTED: similar to AGAP005215-PA n=1 Tax=Tribolium castaneum RepID=UPI0000D561DB Length = 406 Score = 338 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 116/359 (32%), Positives = 174/359 (48%), Gaps = 27/359 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCD+TG A+ D E +L L+SQ +H GG++P +A H ++ Sbjct: 21 LILGIETSCDDTGCAVVDTEGNILGEALHSQHLIHLANGGIIPPIAQNLHRENIESVVNT 80 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+K S + +D+ AVA T PGL +L +G G+ L ++ P IP+HHME H L + Sbjct: 81 AVKNSNYSFRDLSAVAVTVKPGLPLSLTIGMKYGKYLCRLYNKPFIPIHHMEAHALTARM 140 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 D EFPF+ LL+SGGH L +G++ LLG + DDA GEAFDK A+ + L Sbjct: 141 HDKTVEFPFLVLLISGGHCLLAVAQDVGRFFLLGSTRDDAPGEAFDKVARRMKLTNLSEF 200 Query: 176 -DYPGGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 GG + A++ +F F P+T FS +GLKT + + Sbjct: 201 SKLSGGQAIELAASRAKNPLQFKFTIPLTQYRDCKFSLAGLKTQVRRHLLEEEKKHNVPP 260 Query: 234 D--------IARAFEDAVVDTLMIKCKRALDQTGFK--------RLVMAGGVSANRTLRA 277 D + F+ AV + + +RA+ K LV++GG + N + Sbjct: 261 DGLIPDVFNLCAGFQLAVTRHICQRVQRAMVYARRKEMIPENSQTLVVSGGAACNNFIAR 320 Query: 278 KLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA--GATADL-GVSVRPRWPLAE 333 L + + + P+ C DNG MIA+ G+ R++A G D V ++ PL Sbjct: 321 GLQLVCDEMAYKFVRPPPKLCLDNGVMIAWNGVERWRAKLGVLHDYASVEIQKSCPLGT 379 >UniRef50_Q3SVF4 Probable O-sialoglycoprotein endopeptidase n=10 Tax=Rhizobiales RepID=GCP_NITWN Length = 357 Score = 338 bits (866), Expect = 3e-91, Method: Composition-based stats. Identities = 151/342 (44%), Positives = 198/342 (57%), Gaps = 11/342 (3%) Query: 1 MRVLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 M VLGIET+CDET A+ D +L+N + SQ + HA YGGVVPE+A+R HV Sbjct: 1 MLVLGIETTCDETAAAVVERLPDGSARILSNIVRSQTEEHAPYGGVVPEIAARAHVELLD 60 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 LI A+ ESG+ + + VA AGPGL+G ++VG T +++A P V+H+E H Sbjct: 61 GLIARAMTESGVGFRQLSGVAAAAGPGLIGGVIVGLTTAKAIALVHGTPLTAVNHLEAHA 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 L P L EFP+ L SGGHTQ+++V G+G Y LG ++DDA GEAFDK AK+LGL Sbjct: 121 LTPRLTSR-LEFPYCLFLASGGHTQIVAVLGVGNYVRLGTTVDDAMGEAFDKVAKMLGLP 179 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADI 235 YPGGP + + AA G A RF FPRPM RP +FS SGLKT N + + + +D+ Sbjct: 180 YPGGPEVERAAASGDATRFNFPRPMLGRPDANFSLSGLKTAVRNEAARIDPLEPRDISDL 239 Query: 236 ARAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 F+ AV++ + L + LV AGGV+AN+ +RA L + K R + Sbjct: 240 CAGFQAAVLEATADRLGVGLRLFEERFGRPRALVAAGGVAANQAIRASLEGVAAKARTSL 299 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 P CTDNGAMIA+AG R AG T L R RW L Sbjct: 300 IIPPPALCTDNGAMIAWAGAERLAAGLTDSLETPPRARWLLD 341 >UniRef50_Q2SR45 Probable O-sialoglycoprotein endopeptidase n=5 Tax=Mollicutes RepID=GCP_MYCCT Length = 319 Score = 337 bits (865), Expect = 3e-91, Method: Composition-based stats. Identities = 117/317 (36%), Positives = 191/317 (60%), Gaps = 6/317 (1%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L IE+SCDE I+I D+ K +L N + SQ+K H +GGVVPELA+R HV+ +++ Sbjct: 1 MKILAIESSCDEFSISIIDNNK-ILTNIISSQIKDHQVFGGVVPELAARLHVQNFNWVLK 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AAL +S L ++ID +AYT PGL+G+L++G V +++ + P + + H++GH+ Sbjct: 60 AALSQSNLNIEEIDYIAYTKSPGLIGSLIIGKLVAETISLYINKPILALDHIQGHIFGAS 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E+ +P +A++VSGGHTQ+ + ++++G + DDA GE +DK A++LGL YPGG Sbjct: 120 IENEFI-YPVLAMVVSGGHTQIEIINSANDFQIIGSTRDDAIGECYDKVARVLGLSYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARA 238 P+L K+A +G + P + D DFS+SGLKT N I + + + A + Sbjct: 179 PILDKLALKGNKDFYSLPV-LKDDNTYDFSYSGLKTACINLIHNLNQKKQEINLENFAAS 237 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE-VFYARPEF 297 F+ + + K ++A+ + K L +AGGVSAN +R + ++ +K + F + + Sbjct: 238 FQYTATNIIEKKLEKAIKEFKPKTLTVAGGVSANSEIRKIILKLGQKYNIKNTFVPKMSY 297 Query: 298 CTDNGAMIAYAGMVRFK 314 CTDN AMIA + Sbjct: 298 CTDNAAMIAKLAYEKIL 314 >UniRef50_B3MQN2 GF20469 n=4 Tax=Drosophila RepID=B3MQN2_DROAN Length = 416 Score = 334 bits (856), Expect = 3e-90, Method: Composition-based stats. Identities = 124/349 (35%), Positives = 173/349 (49%), Gaps = 27/349 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D + AN L SQ + H YGG++P A H + + Sbjct: 34 VLGIETSCDDTGIAIVDTSGNVKANVLDSQQEFHTRYGGIIPPRAQDLHRARIHSAYERC 93 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+E+ L + + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 94 LEEANLQPEQLAAIAVTTRPGLPLSLLVGVRFARHLARRLKKPLLPVHHMEAHALQARME 153 Query: 123 DN-PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD----- 176 FPF+ LLVSGGH QL V G G+ LLG+++DDA GEAFDK A+ L L Sbjct: 154 HPDAIPFPFLCLLVSGGHCQLAMVHGPGRLTLLGQTLDDAPGEAFDKIARRLRLYILPEY 213 Query: 177 --YPGGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + GG + A + FP P++ + +FSF+G+K + IR ++T Sbjct: 214 RLWNGGRAIEHAARLATDPSAYDFPLPLSQQRNCNFSFAGIKNNSFRAIRKKERMERTPP 273 Query: 234 --------DIARAFEDAVVDTLMIKCKRALDQT----------GFKRLVMAGGVSANRTL 275 D AV LM + +RAL+ G LV++GGV+ N T+ Sbjct: 274 DGIISNYADFCAGLLRAVSRHLMHRTQRALEYCLQPQVRFFGDGQPTLVVSGGVANNDTI 333 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS 324 A + + + F +C+DNG MIA+ G+ + + L Sbjct: 334 FANIQHLAAQYGCRSFRPSKRYCSDNGVMIAWHGVEQLLQDGDSSLRFD 382 >UniRef50_D1B582 Metalloendopeptidase, glycoprotease family n=5 Tax=Campylobacterales RepID=D1B582_SULD5 Length = 334 Score = 332 bits (851), Expect = 1e-89, Method: Composition-based stats. Identities = 120/324 (37%), Positives = 181/324 (55%), Gaps = 9/324 (2%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ IAI ++K LL ++ SQ + HA YGGVVPELA+R H Sbjct: 2 ILSIESSCDDSSIAITRIEDKKLLFHKKISQDEEHAKYGGVVPELAARLHAITLP----K 57 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E+ + + A+A T PGL +L+ G ++ ++L+ A +P + ++H++GH+ + + Sbjct: 58 ILEETQPYFEALKAIAVTNEPGLSVSLVEGVSMAKALSVALHLPLLGINHLKGHICSLFI 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E+ FP LLVSGGHTQL+ V + Q ELL ++DD+ GE+FDK K+LGL YP G Sbjct: 118 EEET-RFPMDVLLVSGGHTQLLHVKSLEQIELLATTMDDSFGESFDKVGKMLGLPYPAGA 176 Query: 182 LLSKMAAQGTAGRFVFPRPM--TDRPGLDFSFSGLKTFAANTIRDNGT-DDQTRADIARA 238 ++ A +G A F F P+ T L FS+SGLK I D+Q DI + Sbjct: 177 IIETYAQKGDAKCFDFTIPLQGTSSSMLAFSYSGLKNQVRLCIEAQERMDEQILCDICAS 236 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F+ V LM K K+A + + GG SAN LR +L ++ +++ A+ FC Sbjct: 237 FQRVAVAHLMQKIKKAYQARKVEHFGVVGGASANLYLRGELERFCASKKAQLYTAKMAFC 296 Query: 299 TDNGAMIAYAGMVRFKAGATADLG 322 +DN AMI G+ ++ G L Sbjct: 297 SDNAAMIGRCGVEAYQKGVFVSLE 320 >UniRef50_A3EUW9 O-sialoglycoprotein endopeptidase n=3 Tax=Leptospirillum RepID=A3EUW9_9BACT Length = 345 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 122/330 (36%), Positives = 184/330 (55%), Gaps = 4/330 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD+T +A+ D +L +Q++SQ LH YGGVVPE+ASR HV L+++A Sbjct: 2 ILGIETSCDDTSVALVDMTGAILFHQIHSQESLHGTYGGVVPEVASRAHVEVLPSLVRSA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++GL+ + +A T GPGL+G+LL G + + + A+ +P I V H++ HL A + Sbjct: 62 FLDTGLSPSQLQGIAVTRGPGLLGSLLTGISFAKGIGSAFRLPLIGVDHVQAHLRACVDS 121 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + L++SGGHT L + EL+ +++DDAAGEAFDK AKLLGL YPGGP Sbjct: 122 MESLRGKTIGLVISGGHTHLFRIENWPTMELVSQTVDDAAGEAFDKGAKLLGLPYPGGPS 181 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 + K A + T + LDFSFSGLKT + +R +++TR +A + + Sbjct: 182 IQKEAEKNTLPLLPLTKKRIRTENPLDFSFSGLKTAFSLLVRKTELNERTRPLLAASLQH 241 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 A+V+ ++ + ++ + Q L++ GGVSAN LR KL +++ + + DN Sbjct: 242 AIVEHVLDRIEQTVIQESPSHLLVGGGVSANALLRKKLQVFSEQQGMTLHLSPLSLARDN 301 Query: 302 GAMIAYAGMVRFKAGATADL---GVSVRPR 328 MIA G F +G S R Sbjct: 302 ALMIARHGRELFLSGMYTPYPYTSFSPYTR 331 >UniRef50_Q04RH4 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Leptospira RepID=GCP_LEPBJ Length = 338 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 129/334 (38%), Positives = 193/334 (57%), Gaps = 8/334 (2%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +GIETSCDET I I D K LL+ ++SQ+ LH YGG+VPE+ASR H+ K L++ Sbjct: 1 MIGMGIETSCDETSIGIIRDGKELLSLGIFSQIDLHKPYGGIVPEIASRAHLEKINLLLE 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ++E+ + +D+ VA T+ PGL G+L+VGA + R + ++ P +PV H++ H Sbjct: 61 ETMEEAKIRFEDLSYVAVTSSPGLTGSLMVGAQMARCINMVYETPILPVCHLQSHFAVLH 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE P EFP + LL+SGG++ + + G+ EL+G+++DDA GEAFDK A LL L YPGG Sbjct: 121 LEGVPTEFPVLGLLLSGGNSAVYILQEFGRMELVGDTMDDALGEAFDKVAGLLDLPYPGG 180 Query: 181 PLLSKMAAQGTAGRFVFP-RPMTDRP----GLDFSFSGLKTFAANTIRDNGTDDQTRADI 235 P + A + P P+ R + FSFSGLKT + + ++ I Sbjct: 181 PHIEAKANEYIPTPDEKPILPLLLRNLPQGEVSFSFSGLKTAVMVLLEKQK--EVSKEQI 238 Query: 236 ARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARP 295 F+++ D + KRA+ +TG +++ AGGV AN TL+ +L K E+F + Sbjct: 239 CWNFQNSAFDLVERNLKRAVAKTGIRKVFAAGGVLANTTLQKRLEVWAGKNSVELFTPKK 298 Query: 296 E-FCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +CTDNGAM+A G F+ G + +V P Sbjct: 299 KIYCTDNGAMVASLGYHLFRKGYKKGVDFTVNPS 332 >UniRef50_UPI0001C42124 glycoprotease M22 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C42124 Length = 565 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 107/337 (31%), Positives = 174/337 (51%), Gaps = 16/337 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIE + ++TGI I D + +LA + +L+ + GG+ P A+ H + LI Sbjct: 1 MISLGIEGTAEKTGIGIVDSDGNVLA---MAGKQLYPEVGGIHPREAAEHHAKWIPQLIP 57 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A++E+GL KDID ++++ GPGL AL + A+ RSLA + +P + V+H GH+ Sbjct: 58 QAMEEAGLDYKDIDLISFSQGPGLGPALRIVASSARSLALSLGIPIVGVNHCIGHVEIGK 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+ V L VSGG++Q+I+ G+Y + GE++D A G D + GL +PGG Sbjct: 118 LDTGAKNP--VTLYVSGGNSQVIAYES-GRYRIFGETLDIAIGNCLDHFGRETGLGHPGG 174 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ K+A G+ P + G+DFSFSGL + A + DI + + Sbjct: 175 PVVEKLAKDGSY--IDLPYVV---KGMDFSFSGLLSSALRAHENGER----IEDICFSLQ 225 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L+ +RAL T +++ GGVSAN LR + M ++ + + ++ D Sbjct: 226 ETAFAMLVEVTERALAHTEKDEVLLCGGVSANSRLRDMMKIMAEEHYAKFYMPEMKYSGD 285 Query: 301 NGAMIAYAGMVRFKA-GATADLGVSVRPRWPLAELPA 336 NG MIA+ G + + G ++ R+ E+ A Sbjct: 286 NGVMIAWLGQLMYDNFGPLDIKDTAIIQRFRTDEVDA 322 >UniRef50_A4RXP4 Predicted protein n=6 Tax=Eukaryota RepID=A4RXP4_OSTLU Length = 492 Score = 331 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 144/367 (39%), Positives = 201/367 (54%), Gaps = 36/367 (9%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+T A+ + +L + SQ +H +GGVVP LA H +++ A Sbjct: 82 VLGIETSCDDTAAAVVRGDGVVLGEAIASQAAIHGPWGGVVPNLARAAHEEVIDDVVRRA 141 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 L E+G++A D+ AVA T GPGL L VG + ++ + +P PVHH+E H L L Sbjct: 142 LTEAGVSAADLSAVAVTCGPGLSMCLRVGVRKAQRMSAEYGIPIAPVHHVEAHALVSRLC 201 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK--LLGLDYP 178 +FPF+ALLVSGGH LI G+G Y +LG ++DDA GEA+DKTA+ L + Sbjct: 202 AGTETVKFPFLALLVSGGHNLLIKARGVGDYTILGTTLDDALGEAYDKTARLLGLPVGGG 261 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR----------DNGTD 228 GGP L K+A +G RF FP P+ R DFS++GLKT A I D Sbjct: 262 GGPALEKLALEGDEKRFKFPVPLRQRKNCDFSYAGLKTAARMAIDAEIGGEDVEWDGVDK 321 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRAL-----DQTGFKRLVMAGGVSANRTLRAKLAEMM 283 QTRADIA +F+ V L + +RAL D +V+AGGV+AN T+R+ L +++ Sbjct: 322 RQTRADIAASFQAKAVKHLEERMRRALTWALEDTPDLSCVVVAGGVAANATVRSTLVKVV 381 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG-----------------ATADLGVSVR 326 ++ + + P++CTDNG M+A+ G R G D+ V++ Sbjct: 382 EETGLPLVFPPPKWCTDNGVMVAWTGCERLALGLAEAPVDAELEAKHAMMDPRDVHVNLL 441 Query: 327 PRWPLAE 333 PRWPL E Sbjct: 442 PRWPLGE 448 >UniRef50_Q30ZN1 Probable O-sialoglycoprotein endopeptidase n=12 Tax=Proteobacteria RepID=GCP_DESDG Length = 367 Score = 331 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 140/356 (39%), Positives = 191/356 (53%), Gaps = 30/356 (8%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR LGIE+SCDET +AI DD + + A + +Q +LHA +GGVVPELASR+H R + Sbjct: 1 MRCLGIESSCDETALAIVDDGRLVDA-VMSTQAELHALFGGVVPELASREHYRLIGRMFD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + + GL +DID ++ GPGL+G+LLVG + LA A + V+H+ HLLA Sbjct: 60 SLMLRCGLGVQDIDVISVARGPGLLGSLLVGVGFAKGLALAGGQRLVGVNHLHAHLLAAG 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 LE FP + +LVSGGHT L + + L+G ++DDAAGEAFDK AK+L L YPGG Sbjct: 120 LEHRLV-FPALGVLVSGGHTHLYRIDSPRNFTLVGRTLDDAAGEAFDKVAKMLNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD------------ 228 + + +FPRP TD LDFSFSGLKT + ++ +G Sbjct: 179 RFIDVLGHMADPDDSMFPRPYTDNDNLDFSFSGLKTAVSTWLKAHGGTALAAPPAESELQ 238 Query: 229 ------------DQTRADIARAFEDAVVDTLMIKCKRALDQTG----FKRLVMAGGVSAN 272 + +F AV DTL IK +RAL + G + +V+AGGV+AN Sbjct: 239 AMLQNNVLPSGMPADMPLVCASFNAAVADTLYIKARRALQRLGGRGQIRSVVVAGGVAAN 298 Query: 273 RTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 +R + + + + P CTDNGAMIAY G + G L + PR Sbjct: 299 SRVRTSMQRLAAEEGLHLHLPSPALCTDNGAMIAYTGWLLASEGLHHSLELETMPR 354 >UniRef50_B0D096 Predicted protein n=2 Tax=Agaricales RepID=B0D096_LACBS Length = 379 Score = 330 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 126/355 (35%), Positives = 189/355 (53%), Gaps = 20/355 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL E+S D+T A+ K +L+N + Q LH YGG+ P A H R ++ A Sbjct: 19 VLAFESSADDTCAAVVHSSKSILSNVVIKQNNLHEQYGGIYPITAIDAHQRNMPYAVRRA 78 Query: 63 LKESGLTA-KDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 LKE+ + KDI+ +A+T GPG+ G L VG ++LA A + P + VHHM+GH L P+L Sbjct: 79 LKEANVDLVKDINGIAFTRGPGMPGCLSVGMNAAKTLAAALNKPIVGVHHMQGHALTPLL 138 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY--- 177 NPP+FPF++LLVSGGHT L+ T + +++L ++D++ G A D+ +KLL L + Sbjct: 139 TSSNPPKFPFLSLLVSGGHTLLLLATSLDSFQILATTVDESIGRAIDQVSKLLDLKWTSL 198 Query: 178 -PGGPLLSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRD----NGTDDQT 231 PG L A + V P P G L FS+SGL + I N D T Sbjct: 199 GPGDALEKFCAQKVDTDSIVIPLPRVTMAGKLSFSYSGLHSRVERYIETLGGINNIDLPT 258 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANRTLRAKLAEMMKKR 286 R IARAF+ + + L K L + +V++GGV++N+ LR +L + + K Sbjct: 259 RMAIARAFQKSAMAQLEDKLLLGLQWCQQKDIPVRHVVLSGGVASNQYLRERLHQCILKA 318 Query: 287 R----GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPAA 337 ++ + P CTDN MI +A M RF A + + RP+W + +L ++ Sbjct: 319 DLALSIDLVFPPPPLCTDNAVMIGWASMHRFLANDFDEYDIESRPKWSIDQLASS 373 >UniRef50_A5UMH5 Putative O-sialoglycoprotein endopeptidase n=5 Tax=Methanobacteriaceae RepID=GCP_METS3 Length = 538 Score = 329 bits (845), Expect = 6e-89, Method: Composition-based stats. Identities = 105/339 (30%), Positives = 173/339 (51%), Gaps = 20/339 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + LGIE + ++TG+ I D + +LA + +L + GG+ P +A+ H LI Sbjct: 4 LICLGIEGTAEKTGVGIVDSDGNILA---MAGEQLFPEKGGIHPRIAAEHHGYWIPKLIP 60 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A+ E+G++ D+D ++++ GPGL AL + AT R+LA + + P I V+H GH+ Sbjct: 61 KAIDEAGISYDDLDLISFSQGPGLGPALRIVATSARTLALSLNKPIIGVNHCIGHVEVGK 120 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+ V L VSGG++Q+IS G+Y + GE++D AAG D + GL +PGG Sbjct: 121 LDTGAVNP--VTLYVSGGNSQVISHES-GRYRIFGETLDIAAGNCLDHFGRETGLGHPGG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ K+A +G+ + G+DFSFSGL + A ++ D+ + + Sbjct: 178 PVIEKLAKKGSYVDLPYVV-----KGMDFSFSGLLSAALREVKKGT----PIEDVCFSLQ 228 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L+ +RAL T +++ GGVSAN LR L M ++ + + C D Sbjct: 229 ETAFSMLVEVTERALSHTQKDEVMLCGGVSANSRLREMLKVMAEEHGAKFCMPEMKLCGD 288 Query: 301 NGAMIAYAGMVRFKAGATADLGVS---VRPRWPLAELPA 336 NG MIA+ G++ L + + R+ E+ A Sbjct: 289 NGVMIAWLGLIM--HNQFGPLDIKDTGIIQRFRTDEVEA 325 >UniRef50_B3RQR7 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RQR7_TRIAD Length = 405 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 124/355 (34%), Positives = 181/355 (50%), Gaps = 26/355 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-GVVPELASRDHVRKTVPLIQA 61 V+GIETSCD+TG+AI DD+ LL + L SQ +H G G+ P A++ H R ++Q+ Sbjct: 38 VMGIETSCDDTGVAIVDDQGRLLGDALQSQSSIHKPLGWGIHPVTAAQLHERNIHAVVQS 97 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL +S L +DI +A T GPGL +L VG + L + I VHHM H L + Sbjct: 98 ALHKSNLKIEDIHTIATTVGPGLAFSLNVGLDYSKKLLQQHNKRFIAVHHMAAHALTVRM 157 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 + P EFP++ LLVSGGH L V G ++ LG ++DDA GE FDK A+ L L Sbjct: 158 LN-PIEFPYLVLLVSGGHCILAVVNGPCEFYRLGSTLDDAPGEVFDKVARTLELHTHPEV 216 Query: 176 -DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-----D 229 D GG + +A G F P M +FSF+G K+ ++ D Sbjct: 217 GDIAGGRAIEIVAKLGDEKAFKLPHIMAGVRNCNFSFAGFKSAVNAHLKRVSFASLSDWD 276 Query: 230 QTR----ADIARAFEDAVVDTLMIKCKRALDQTG-----FKRLVMAGGVSANRTLRAKLA 280 Q + A++A +F+ + + + +RAL + LV++GGV+ N +R +L Sbjct: 277 QQKMTIAANMAASFQYYLTWHIAKRVRRALVFCKTFNPKCRTLVISGGVACNNYIRNELD 336 Query: 281 EMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL---GVSVRPRWPLA 332 + ++ P CTDNG MIA+AG+ K+ L V +P+WPL Sbjct: 337 KCATAFGFQLACPPPYLCTDNGIMIAWAGVEHLKSNTATILNPQSVIYQPKWPLG 391 >UniRef50_B8LEI0 Predicted protein (Fragment) n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8LEI0_THAPS Length = 342 Score = 328 bits (842), Expect = 1e-88, Method: Composition-based stats. Identities = 123/338 (36%), Positives = 180/338 (53%), Gaps = 21/338 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE+SCD+TG A+ + +L L SQ +H +GGV P LA H + +I A Sbjct: 5 VLGIESSCDDTGAAVLRSDGLILGESLASQHAIHEQFGGVFPGLAKAAHEQNIQTVISTA 64 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ + +T +D+DAV T GPGL L VG GR LA + P + +HH+E H+L + Sbjct: 65 LQNANMTMEDVDAVGVTVGPGLEICLRVGCNWGRELAMEYGKPFVGIHHLEAHILMARIP 124 Query: 123 DNP---PEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK--LLGLDY 177 EFPF+ALLVSGGH Q++ GIGQY ++G ++DD+ GEAFDKTA+ L + Sbjct: 125 SEKYDTMEFPFLALLVSGGHCQILKCLGIGQYSIVGGTLDDSLGEAFDKTARLLGLPVGG 184 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD----------NGT 227 GGP + ++A G P P+ R DFS++GLKT Sbjct: 185 GGGPAIEQLAKDGDPKSVKLPIPLQKRKDCDFSYAGLKTAVRLATEKICVERGVESAEEL 244 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRAL----DQTGFKRLVMAGGVSANRTLRAKLAEMM 283 Q +A++A +F+ + I+ RA+ + G L + GGV+AN+ LR++L + Sbjct: 245 PHQDKANVAASFQHTAFRHVEIRLGRAMERVEKEDGISTLAVVGGVAANKELRSRLNALC 304 Query: 284 KKRR--GEVFYARPEFCTDNGAMIAYAGMVRFKAGATA 319 R ++ P CTD GAM A+A + R G++ Sbjct: 305 SDRAEPWKMMVPPPRLCTDQGAMSAWAAVERLMVGSSD 342 >UniRef50_Q54EW4 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54EW4_DICDI Length = 468 Score = 327 bits (839), Expect = 3e-88, Method: Composition-based stats. Identities = 126/417 (30%), Positives = 195/417 (46%), Gaps = 87/417 (20%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 V+GIETSCD+T I I + E ++A Q LH + G+VP +A H + I+ Sbjct: 19 VIGIETSCDDTSIGIVNSEGKIMAEYSKPQWSLHKVHNGIVPSIAFEAHQNEIDNAIEKT 78 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ++G+T +DID +A T GPG+ +L VG + L + P V+HMEGH L +E Sbjct: 79 LDKAGMTMEDIDVIAVTTGPGMGKSLEVGLNKAKQLYREFKKPFCSVNHMEGHSLVVRME 138 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY----- 177 ++ EFPF+ +LVSGGH+Q++ + +Y+L+G ++DD+ GEA DK A++LG Y Sbjct: 139 NHSIEFPFLIVLVSGGHSQILICNDVSKYQLIGNTLDDSIGEALDKAARILGCPYGQVWD 198 Query: 178 --------PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR------ 223 GG + +A++G F PM D DFSFSG+K+ A ++ Sbjct: 199 GQSLIENIHGGQAIEILASKGDPNSHHFTLPMKDSNNCDFSFSGIKSSLARLVKEIKSKS 258 Query: 224 ----------------------------------DNGTDDQTRADIARAFEDAVVDTLMI 249 +N + ++A +F++ + L Sbjct: 259 SSSSSITNNTTTKTTTTTTTTTIITTETNNLITDENELSFVDKCNLAASFQNVAFNHLEH 318 Query: 250 KCKRALDQT--------------------------------GFKRLVMAGGVSANRTLRA 277 + K++LD K +V++GGVS N LR Sbjct: 319 RIKKSLDWYYNFKTPKQKKNELLASKTKSGKPPAIEIIKREPLKGIVVSGGVSKNNNLRK 378 Query: 278 KLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL--GVSVRPRWPLA 332 ++ ++ K+ +++ RPE C DNG MIA+AG+ FK G T D V P WPL Sbjct: 379 RIDDIGKRYNLPIYFPRPELCNDNGTMIAWAGVEMFKKGMTVDDPEKVIYLPVWPLD 435 >UniRef50_Q1IUF1 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Acidobacteria RepID=GCP_ACIBL Length = 381 Score = 326 bits (836), Expect = 7e-88, Method: Composition-based stats. Identities = 134/375 (35%), Positives = 193/375 (51%), Gaps = 46/375 (12%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCDET A+ + +L++ ++SQ+ H YGGVVPELASR+H++ VP+++ A Sbjct: 6 ILGIESSCDETAAAVIRNGAEILSSVVFSQIYTHMRYGGVVPELASREHLKAIVPVVRQA 65 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++++G + IDA+A T GPGL GALLVG + ++L+FA D P I V+H+EGH+ +LE Sbjct: 66 VEDAGQSYDKIDAIAVTRGPGLAGALLVGVSYAKALSFALDKPLIGVNHLEGHIHVVLLE 125 Query: 123 DNP-----PEFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKTAKLLG 174 +FP +AL+VSGGHT L Y +G + DDAAGEA+DK AKLLG Sbjct: 126 QKQQGVGEIQFPVLALVVSGGHTHLYLAEKKDAGWTYRDVGHTRDDAAGEAYDKVAKLLG 185 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPG-------------LDFSFSGLKTFAANT 221 L YPGGP+L +A G FP +DFS+SG+KT Sbjct: 186 LGYPGGPILDGLAKHGDPRAVRFPFAQIKHRDRNPQNRHEDDDARVDFSYSGIKTAVLRY 245 Query: 222 IRDNGT-----------DDQTRA--------------DIARAFEDAVVDTLMIKCKRALD 256 + + + + D+ +F+ AVV+ L+ K A Sbjct: 246 VETHEMKAAIEARRTALKEIEKPSQDDYLRVCDRQTLDLIASFQRAVVNDLVSKALHAAA 305 Query: 257 QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 + L++ GGV+AN LR + V++ TDN AMIA A RF +G Sbjct: 306 ENNAATLLVTGGVAANSELRETFERRAGELGLPVYFPSRPLSTDNAAMIAAAAYPRFLSG 365 Query: 317 ATADLGVSVRPRWPL 331 A +S L Sbjct: 366 EFAAPDLSAEANLRL 380 >UniRef50_Q29HY2 GA12844 n=3 Tax=Sophophora RepID=Q29HY2_DROPS Length = 427 Score = 326 bits (836), Expect = 8e-88, Method: Composition-based stats. Identities = 121/351 (34%), Positives = 172/351 (49%), Gaps = 27/351 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D + + +N LYSQ + H YGG++P A H + Sbjct: 30 VLGIETSCDDTGIAIVDTDGRVHSNVLYSQQEFHTRYGGIIPPRAQDLHRARIEDAYNRC 89 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ L + + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 90 LVEADLRPEQLTAIAVTNRPGLPLSLLVGLRFARHLARRLQKPLLPVHHMEAHALQARME 149 Query: 123 D-NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 + + FPF+ LL+SGGH QL V G G+ LLG+++DDA GEAFDK A+ L L Sbjct: 150 NISAISFPFLCLLISGGHCQLALVRGPGRLTLLGQTLDDAPGEAFDKIARRLRLYVLPQY 209 Query: 176 -DYPGGPLLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + GG + A + FP P+ + +FSF+G+K + IR +QT Sbjct: 210 RAWNGGQAIEHAAQSAVCPDAYDFPLPLAQQRNCNFSFAGIKNNSFRAIRARERLEQTPP 269 Query: 234 --------DIARAFEDAVVDTLMIKCKRALDQT----------GFKRLVMAGGVSANRTL 275 D AV LM + +RAL+ LV++GGV+ N + Sbjct: 270 DGIISNYSDFCAGLLQAVSRHLMHRTQRALEYCLRPENGLFGDASPTLVVSGGVANNDVI 329 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVR 326 + + + + +C+DNG MIA+ G+ + A + L Sbjct: 330 YRNIEHLAGQYNCRSYRPFKRYCSDNGVMIAWHGIEQLLANSAQHLRFDYH 380 >UniRef50_B8PI87 Predicted protein n=2 Tax=Postia placenta Mad-698-R RepID=B8PI87_POSPM Length = 691 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 118/368 (32%), Positives = 182/368 (49%), Gaps = 33/368 (8%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL IE+S D+T A+ ++ +L+N + Q H YGG+ P +A H + +Q A Sbjct: 318 VLAIESSADDTCAAVVTSDRQILSNVVVRQDSFHESYGGIHPYIAIEAHQQNMPGAVQKA 377 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ +G++A D+D +A+T GPG+ G L VG+ ++LA A + P + VHHM+ H L P L Sbjct: 378 LQVAGMSATDVDGIAFTRGPGIGGCLSVGSNAAKTLAAALNKPLVGVHHMQAHALTPFLT 437 Query: 123 DNP---PEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP- 178 P +PF+ LLVSGGHT L+ T + +L ++D++ G AFDK +++L L + Sbjct: 438 TPANSLPTYPFLTLLVSGGHTLLLLATSPRAFRVLATTLDESIGRAFDKVSRMLALPWSA 497 Query: 179 --GGPLLSKMAAQ---------------GTAGRFVFPRPMTDRPGLDFSFSGLKTFAANT 221 G L + P+ R L FS++GL + Sbjct: 498 HGPGAALEQFCRDGPAGGTGAPGGEEIGSGEPAEAPHIPLPMRGRLAFSYTGLHSSVERF 557 Query: 222 IRDNGT--DDQTRADIARAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANRT 274 + G D +T+ IA F+ V L K L + +V++GGV++N Sbjct: 558 LHARGGVVDARTKHAIATTFQKNAVGQLEEKLALGLQLCRRKGIQIRHVVVSGGVASNSY 617 Query: 275 LRAKLAEMMKK----RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWP 330 LR +L + + + + P CTDN MIA+A M RF AG T D V +R +W Sbjct: 618 LRERLRICLDEASPDEHIALIFPPPSLCTDNAVMIAWASMHRFLAGDTDDYTVELRRKWS 677 Query: 331 LAEL-PAA 337 + EL P+A Sbjct: 678 IEELDPSA 685 >UniRef50_A7H0K1 Probable O-sialoglycoprotein endopeptidase n=26 Tax=Epsilonproteobacteria RepID=GCP_CAMC5 Length = 339 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 126/340 (37%), Positives = 183/340 (53%), Gaps = 19/340 (5%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCD++ +A+ D LL ++ SQ H+ +GGVVPELA+R H R A Sbjct: 2 ILGIESSCDDSSVALLDIKNLKLLYHKKISQESEHSPFGGVVPELAARLHTRALP----A 57 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E KDI A+A T PGL +L+ G ++ ++L+ A +VP I V+H+ GH+ + L Sbjct: 58 LLEEIKPKFKDIKAIAVTNEPGLSVSLIGGVSMAKALSVALNVPLIAVNHLVGHIYSLFL 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 D FP LLVSGGHT ++ + G+ LL + DD+ GE+FDK AK++ L YPGG Sbjct: 118 -DCEARFPLGVLLVSGGHTMVLDIDAAGKISLLAGTSDDSFGESFDKVAKMMQLGYPGGA 176 Query: 182 LLSKMAAQG-TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-----------NGTDD 229 + +A Q RF F P L++SFSGLK I + Sbjct: 177 AVQNLAWQCKDKRRFKFTIPFLHDKRLEYSFSGLKNQVRLEIEKIKGQNLAGATDRELSN 236 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 ADI AFE+A + +M K + + FKR + GG SAN LR+++ + + E Sbjct: 237 DDMADICYAFENAACEHIMDKLTKIFKERSFKRFGIVGGASANLNLRSRIERLCLENGCE 296 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGATA-DLGVSVRPR 328 + A EFC+DN AMIA AG ++ G +++ PR Sbjct: 297 LLLAPLEFCSDNAAMIARAGREKYLKGGFVKHNELNINPR 336 >UniRef50_Q17CG3 O-sialoglycoprotein endopeptidase n=2 Tax=Culicini RepID=Q17CG3_AEDAE Length = 400 Score = 323 bits (829), Expect = 4e-87, Method: Composition-based stats. Identities = 110/345 (31%), Positives = 167/345 (48%), Gaps = 26/345 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIETSCD++G AI +L + ++SQ H +GG++P +A H ++Q Sbjct: 28 ILGIETSCDDSGAAIVSGNGTVLGDCIHSQQNSHLKFGGIIPPVAQDFHRLNIDNVVQET 87 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + S + +DA+A T PGL +L+VG + LA + P IP+HHME H L + Sbjct: 88 FRRSDIDCSQLDAIAVTNRPGLPLSLIVGLRYAKYLARKYRKPIIPIHHMEAHALMARMT 147 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD------ 176 + P FPF+ +L+SGGH+ L V Q+ LLGE++DDA GEAFDK A+ L L Sbjct: 148 NKVP-FPFLCILISGGHSLLTLVKSTSQFYLLGETLDDAPGEAFDKIARRLKLRNLPEYA 206 Query: 177 -YPGGPLLSKMAAQGT-AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD------ 228 GG + + A ++ FP P++ FSF+GLK A I + Sbjct: 207 WLSGGRSIEQAAMSSDNPRKYDFPLPLSHYRDCQFSFAGLKNTATRHILQQERELDLDPD 266 Query: 229 --DQTRADIARAFEDAVVDTLMIKCKRALDQT---------GFKRLVMAGGVSANRTLRA 277 D+ F +A + + +RA+ K LV++GGV+ N + Sbjct: 267 AVLPDYQDLCAGFLNAAARHISQRTQRAIRFCEKEKLIGSDDAKFLVISGGVACNDAIFN 326 Query: 278 KLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLG 322 ++ M K + CTDNG MIA+ G+ +F G + Sbjct: 327 TVSNMAKGFGYTTVRPERQHCTDNGIMIAWNGVEKFLVGEDVTMD 371 >UniRef50_Q9VWD6 Probable O-sialoglycoprotein endopeptidase 2 n=6 Tax=Diptera RepID=OSGP2_DROME Length = 409 Score = 323 bits (829), Expect = 4e-87, Method: Composition-based stats. Identities = 116/352 (32%), Positives = 166/352 (47%), Gaps = 27/352 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TGIAI D ++AN L SQ + H YGG++P A H + Q Sbjct: 27 VLGIETSCDDTGIAIVDTTGRVIANVLESQQEFHTRYGGIIPPRAQDLHRARIESAYQRC 86 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ + L + A+A T PGL +LLVG R LA P +PVHHME H L +E Sbjct: 87 MEAAQLKPDQLTAIAVTTRPGLPLSLLVGVRFARHLARRLQKPLLPVHHMEAHALQARME 146 Query: 123 DN-PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD----- 176 +PF+ LL SGGH QL+ G G+ LLG+++DDA GEAFDK + L L Sbjct: 147 HPEQIGYPFLCLLASGGHCQLVVANGPGRLTLLGQTLDDAPGEAFDKIGRRLRLHILPEY 206 Query: 177 --YPGGPLLSKMAA-QGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + GG + A + FP P+ + +FSF+G+K + IR ++T Sbjct: 207 RLWNGGRAIEHAAQLASDPLAYEFPLPLAQQRNCNFSFAGIKNNSFRAIRARERAERTPP 266 Query: 234 --------DIARAFEDAVVDTLMIKCKRALDQT----------GFKRLVMAGGVSANRTL 275 D +V LM + +RA++ LVM+GGV+ N + Sbjct: 267 DGVISNYGDFCAGLLRSVSRHLMHRTQRAIEYCLLPHRQLFGDTPPTLVMSGGVANNDAI 326 Query: 276 RAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 A + + + F +C+DNG MIA+ G+ + A Sbjct: 327 YANIEHLAAQYGCRSFRPSKRYCSDNGVMIAWHGVEQLLQDKEASTRYDYDS 378 >UniRef50_Q4A734 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma synoviae 53 RepID=GCP_MYCS5 Length = 307 Score = 322 bits (827), Expect = 8e-87, Method: Composition-based stats. Identities = 112/316 (35%), Positives = 183/316 (57%), Gaps = 10/316 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGIETS D++ IAI +D K +L SQ+ + YGG +PE+ASR+HV+ ++Q Sbjct: 1 MIILGIETSHDDSSIAILEDGK-VLNMWSISQIDIFKKYGGTIPEIASREHVKNIA-ILQ 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L+E + ID +AYT+ PGL+G L VG +L+ A + P I ++H++GH + Sbjct: 59 NFLQEF-IDLNKIDHIAYTSEPGLIGCLQVGFLFASALSIALNKPLIKINHLDGHFFSGA 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +++ ++P + L+VSGGH+Q+I ++++GE++DDA GE +DK + L L +PGG Sbjct: 118 IDNKEIKYPALGLIVSGGHSQIIYAKNKFDFQIVGETLDDAIGECYDKVSSRLNLGFPGG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ K+ A +P T DFSFSG+KT N + ++ IA +F+ Sbjct: 178 PIIDKIHASYKGKYLKLTKPKT-SGEFDFSFSGIKTQVLNAFNN--KKYESIEQIAASFQ 234 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + ++ L+ K K A+D+ + +++ GGVSAN+ LR K ++ + ++ TD Sbjct: 235 EVAINYLIEKFKLAIDKFKPESILLGGGVSANKYLREKFKDL----HKNTIFPEIKYATD 290 Query: 301 NGAMIAYAGMVRFKAG 316 NGAMIA +R K Sbjct: 291 NGAMIAMCAYLRMKKN 306 >UniRef50_B0B9U7 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Chlamydia trachomatis RepID=GCP_CHLT2 Length = 338 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 120/338 (35%), Positives = 180/338 (53%), Gaps = 16/338 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LG+E+SCDET ++ + K +LAN++ SQ +HA YGGV+PELASR H++ L+ Sbjct: 1 MLTLGLESSCDETSCSLVQNGK-ILANKIASQ-DIHASYGGVIPELASRAHLQTFPELLT 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AA + +G++ +DI+ ++ PGL+GAL +G + LA P I V+H+E HL A Sbjct: 59 AATQSAGVSLEDIELISVANTPGLIGALSIGVNFAKGLASGLKRPLIGVNHVEAHLYAAC 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E +FP + L +SG HT L + + L+G++ DDA GE FDK A+ LGL YPGG Sbjct: 119 MEAPATQFPALGLAISGAHTSLFLMPDATTFLLIGKTRDDAIGETFDKVARFLGLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN---------GTDDQT 231 L ++A +G A F F G DFSFSGLKT ++ N + Sbjct: 179 QKLEELAREGDADAFAFSPARVS--GYDFSFSGLKTAVLYALKGNNSSAKAPFPEVSETQ 236 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + +IA +F+ AV T+ K + + L++ GGV+ N R L ++ ++ Sbjct: 237 KRNIAASFQKAVFMTIAQKLPDIVKAFSCESLIVGGGVANNSYFRRLLNQICS---LPIY 293 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 + + C+DN AMIA G F V R+ Sbjct: 294 FPSSQLCSDNAAMIAGLGERLFCNRTHVSKEVIPCARY 331 >UniRef50_UPI0000F51796 O-sialoglycoprotein endopeptidase/protein kinase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI0000F51796 Length = 531 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 101/337 (29%), Positives = 172/337 (51%), Gaps = 17/337 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLG+E + I DD + +++N + + GG+ P A+ H +P+++ Sbjct: 1 MKVLGLEGTAHTISAGIVDDNR-IISNFSSTYI---PKNGGIHPREAAIHHADNILPVMK 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A +ESGL+ I+ VA++ GPGL L V AT R+ + + +P I V+H GH+ Sbjct: 57 KAFEESGLSPGQINLVAFSMGPGLGPCLRVVATAARAFSIKYGIPLIGVNHPLGHVEIGR 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + L +SGG+TQ+I Y++LGE++D G DK A+ +G+ +PGG Sbjct: 117 KLSGAKDP--IMLYISGGNTQII-AHEENSYKVLGETMDIGLGNLLDKLARDVGIPFPGG 173 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P + + A +G P + G+D SFSG+ T A N I ++ +I + + Sbjct: 174 PKIEEFALKGD-KLLDLPYSV---KGMDTSFSGIYTAARNYIGR-----ESIENICYSVQ 224 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L+ +RAL T + +++AGGV+ N LR+ ++ M K + ++C D Sbjct: 225 ETTFSMLVEVLERALYYTDKREILLAGGVARNDRLRSMVSHMAKSSGYVAYLTDKKYCMD 284 Query: 301 NGAMIAYAGMVRFKAGAT-ADLGVSVRPRWPLAELPA 336 NGAMIA AGM+ + +G + V + + E+ Sbjct: 285 NGAMIAQAGMLMYLSGQRQHIMDTKVNQSFRIDEVKV 321 >UniRef50_Q9H4B0 Probable O-sialoglycoprotein endopeptidase 2 n=31 Tax=Bilateria RepID=OSGP2_HUMAN Length = 414 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 119/357 (33%), Positives = 172/357 (48%), Gaps = 27/357 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIETSCD+T A+ D+ +L ++SQ ++H GG+VP A + H ++Q Sbjct: 38 IVLGIETSCDDTAAAVVDETGNVLGEAIHSQTEVHLKTGGIVPPAAQQLHRENIQRIVQE 97 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL SG++ D+ A+A T PGL +L VG + L P IP+HHME H L L Sbjct: 98 ALSASGVSPSDLSAIATTIKPGLALSLGVGLSFSLQLVGQLKKPFIPIHHMEAHALTIRL 157 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 N EFPF+ LL+SGGH L V G+ + LLG+S+D A G+ DK A+ L L Sbjct: 158 -TNKVEFPFLVLLISGGHCLLALVQGVSDFLLLGKSLDIAPGDMLDKVARRLSLIKHPEC 216 Query: 176 -DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ---- 230 GG + +A QG F P+ DFSF+GL+ I ++ Sbjct: 217 STMSGGKAIEHLAKQGNRFHFDIKPPLHHAKNCDFSFTGLQHVTDKIIMKKEKEEGIEKG 276 Query: 231 ----TRADIARAFEDAVVDTLMIKCKRALDQTGFKR--------LVMAGGVSANRTLRAK 278 + ADIA + + L+ + RA+ + LV +GGV++N +R Sbjct: 277 QILSSAADIAATVQHTMACHLVKRTHRAILFCKQRDLLPQNNAVLVASGGVASNFYIRRA 336 Query: 279 LAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS---VRPRWPLA 332 L + + + P CTDNG MIA+ G+ R +AG + P+ PL Sbjct: 337 LEILTNATQCTLLCPPPRLCTDNGIMIAWNGIERLRAGLGILHDIEGIRYEPKCPLG 393 >UniRef50_Q6L243 Putative O-sialoglycoprotein endopeptidase n=3 Tax=Thermoplasmatales RepID=GCP_PICTO Length = 529 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 106/337 (31%), Positives = 177/337 (52%), Gaps = 16/337 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLG+E + I D EK +L+N + V H GG+ P A+ H K +I+ Sbjct: 1 MIVLGLEGTAHTISAGIVD-EKSILSNVSSTYVPEH---GGIHPREAAVHHADKIYDVIK 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + +GL +D+D +A++ GPGL L V +T R+L+ + P + V+H GH+ Sbjct: 57 RSFDNAGLKPEDLDLIAFSMGPGLGPCLRVVSTAARALSIKYSKPLLGVNHPLGHVEIGR 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + L +SGG+TQ+I+ G+Y +LGE++D G DK A+ LG+ +PGG Sbjct: 117 KLSGARDP--IMLYISGGNTQVIAHLN-GRYRVLGETMDIGLGNMLDKFARDLGIPFPGG 173 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P++ +MA G + P + G+D SFSG+ T A + + + DI + + Sbjct: 174 PVIERMALDGKD---LLELPYSV-KGMDTSFSGIYTAAKRYLS----LGKNKNDICYSLQ 225 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + ++ +RA+ T +++AGGV+ N LR+ + +M + + + E+C D Sbjct: 226 ETSFSMVVEVLERAMYYTNKNEILLAGGVARNDRLRSMVNDMARDSGYKAYLTDKEYCMD 285 Query: 301 NGAMIAYAGMVRFKAGATAD-LGVSVRPRWPLAELPA 336 NGAMIA AGM+ + GA D + + R+ + E+PA Sbjct: 286 NGAMIAQAGMLMYMHGARQDIMETRINQRFRIDEVPA 322 >UniRef50_A6Q6J3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Epsilonproteobacteria RepID=GCP_SULNB Length = 337 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 118/325 (36%), Positives = 180/325 (55%), Gaps = 10/325 (3%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ IA+ + K +L ++ SQ H+ YGGVVPELASR H Sbjct: 2 ILSIESSCDDSSIAVTETSTKKILYHKKISQEAEHSCYGGVVPELASRLHAVALP----K 57 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L+E+ + AVA T PGL LL G + +++A ++P IPVHH++GH+ + + Sbjct: 58 ILEETKPWFDKLKAVAVTNQPGLGVTLLEGIAMAKTVAVLQNIPLIPVHHLKGHIYSLFI 117 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E FP + LL+SGGHTQ+I V E+L S+DD+ GE+FDK AK++ L YPGGP Sbjct: 118 EKKTL-FPLLVLLISGGHTQIIRVKDFEHMEILATSMDDSVGESFDKCAKMMHLGYPGGP 176 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD----NGTDDQTRADIAR 237 L+ +A +G RF P P+ + P + FS SGLK T+ +Q AD++ Sbjct: 177 LIEALALKGDENRFDLPVPLRNSPLIAFSLSGLKNAVRLTVEKLGGAEKMTEQDEADLSA 236 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 +F+ AV L+ K K+ + + + GG SAN+ LR A++ ++ R + A ++ Sbjct: 237 SFQKAVKLHLLQKSKKIFAKEPIRDFAIVGGASANQYLRGAYADLCREFRKTMHVAPLQY 296 Query: 298 CTDNGAMIAYAGMVRFKAGATADLG 322 C+DN AMI + ++ D Sbjct: 297 CSDNAAMIGRYAIDAYEREQFIDPN 321 >UniRef50_D2LQ34 Metalloendopeptidase, glycoprotease family n=1 Tax=Aciduliprofundum boonei T469 RepID=D2LQ34_9EURY Length = 530 Score = 320 bits (820), Expect = 5e-86, Method: Composition-based stats. Identities = 103/335 (30%), Positives = 171/335 (51%), Gaps = 17/335 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M VLGIE + G+ I EK +LAN + GG+ P A+ HV+ L+ Sbjct: 1 MLVLGIEGTAHTVGVGIV-TEKEVLANVSHMYR---PPEGGIHPREAANHHVQYLPKLLN 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A + + + +++D ++++ GPGL L AT R L+ ++P + V+H HL Sbjct: 57 EAFRIANVKPEELDGISFSQGPGLGPCLRTVATAARVLSVKLNIPIVGVNHCIAHLEIGR 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + V L VSGG+TQ+IS G+Y + GE++D G DK A+ +G+ +PGG Sbjct: 117 FSTGAEDP--VMLYVSGGNTQIISFAS-GRYRVFGETLDIGVGNMLDKLAREMGIPFPGG 173 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P + K+A +G P P + G+D +FSG+ T A N + + +++ DIA + + Sbjct: 174 PRIEKLALEGKKY---IPLPYS-IKGMDMAFSGILTAAINKLNN-----ESKEDIAYSVQ 224 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + V L+ +RAL +++AGGV+ N+ L+ L M ++R + + C D Sbjct: 225 ETVFAMLVEATERALTHLRKDEVLLAGGVARNKRLQEMLEIMAEERGARFYVPPADLCVD 284 Query: 301 NGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAEL 334 NGAMIAY G++ K G ++ V ++ + Sbjct: 285 NGAMIAYLGLLFLKNGKRMEIGDTQVIQKFRTDAV 319 >UniRef50_Q0BPC9 Probable O-sialoglycoprotein endopeptidase n=14 Tax=Alphaproteobacteria RepID=GCP_GRABC Length = 370 Score = 318 bits (816), Expect = 2e-85, Method: Composition-based stats. Identities = 153/345 (44%), Positives = 204/345 (59%), Gaps = 15/345 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCDET A+ D +LA + SQ HA +GGVVPE+A+R H+ ++ Sbjct: 14 VLGIETSCDETAAAVLDGSGRILAEIVLSQYDDHARFGGVVPEIAARAHLAYLPGMVTEV 73 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + ++GL +D+ A+A T+GPGL+G LLVGA +G+ LA A P I ++H+E H LA +L Sbjct: 74 MDKAGLRFQDLAAIAATSGPGLIGGLLVGAGLGKGLALAAKRPFIAINHLEAHALAALLP 133 Query: 123 --------DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 + FPF+ +L+SGGH Q I V G+G+Y LG +IDDA GEAFDK KLLG Sbjct: 134 ALGGVAEITSGEHFPFLLMLLSGGHCQCILVEGVGRYRRLGGTIDDAVGEAFDKVGKLLG 193 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR-- 232 L +PGGP L ++A QG FPRPM R G DFSFSGLKT A + Sbjct: 194 LGWPGGPALERLALQGNPHALAFPRPMKGRVGCDFSFSGLKTAVAQYVARFPDGPLPLSD 253 Query: 233 -ADIARAFEDAVVDTLMIKCKRAL----DQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 ADIA +F+ AV D + + AL + K LV++GGV+AN +RA L+ + R Sbjct: 254 AADIAASFQAAVADVMADRATAALAMADEIAPAKMLVVSGGVAANAAIRAALSTAAEHRG 313 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + P CTDN M+A+AG+ R K GA + L + PRWPL Sbjct: 314 IAMLAPPPRLCTDNAVMVAWAGLHRLKYGAVSGLDHAPLPRWPLD 358 >UniRef50_C2KP25 O-sialoglycoprotein endopeptidase n=3 Tax=Mobiluncus RepID=C2KP25_9ACTO Length = 375 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 133/363 (36%), Positives = 193/363 (53%), Gaps = 30/363 (8%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 LGIE++CDETG A+ + L+AN + + + +A YGG++PE+ASR H+ +P++ + Sbjct: 11 LTLGIESTCDETGAALVAGKTKLIANVVATSMDQYARYGGIIPEIASRAHLESFLPVVTS 70 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+++G+ +DID + + GPGL+G+L VG ++LA A P V+H+ GHL L Sbjct: 71 ALEQAGVKLEDIDRIGVSGGPGLIGSLAVGIAGAKALALALGKPLYGVNHVIGHLAVDQL 130 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTG---IGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 + + P V L+VSGGHT L+ + G LG ++DDA+GEAFDK ++LGL Y Sbjct: 131 ASEEMLKLPAVGLVVSGGHTNLLYIEDFAAPGGIRELGGTLDDASGEAFDKVGRILGLPY 190 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMT-----DRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 PGGP + +M+ QGT G FPR ++ DFSFSGLKT A I + R Sbjct: 191 PGGPNVDRMSQQGTLGAIDFPRGLSGAKYAKSHPYDFSFSGLKTAVARYIASLEASPEAR 250 Query: 233 ---------------------ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSA 271 ADI + F +++ D+L+ K +AL TG K LV+ GG SA Sbjct: 251 SHPEFTEDYQATREGKPWLPVADICKGFSESINDSLVSKTLKALQDTGAKTLVVGGGYSA 310 Query: 272 NRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 N LR+ LAE + + +FCTDNGA IA + S L Sbjct: 311 NSRLRSWLAEACPEIGVTLRIPPLKFCTDNGAQIAAITAEIADRHEPSRPDFSPVSALDL 370 Query: 332 AEL 334 + Sbjct: 371 TRV 373 >UniRef50_Q5ZZQ1 Probable O-sialoglycoprotein endopeptidase n=8 Tax=Mycoplasma RepID=GCP_MYCH2 Length = 322 Score = 316 bits (809), Expect = 9e-85, Method: Composition-based stats. Identities = 117/325 (36%), Positives = 175/325 (53%), Gaps = 13/325 (4%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++LGIETS D+ +A++ + K + SQ +LH +GG VPELASR+H R +++ Sbjct: 1 MKILGIETSHDDASVALFSENKVEIL-LTISQFELHEQFGGTVPELASREHSRNLAIILE 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L + + IDA+AYT PGL+G L +G +L+ ++ P IP+ H+ GH + Sbjct: 60 KLLGK-NIDFSTIDAIAYTKNPGLIGPLKIGFLFASALSLFFNKPLIPIDHLLGHFWSAA 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 +E+ EFP ++LL+SGGHTQLI E++G ++DDA GE +DK + LG YPGG Sbjct: 119 IEN-DLEFPVLSLLISGGHTQLIFAENKNNLEIIGSTVDDALGEIYDKIGRSLGCGYPGG 177 Query: 181 PLLSKMAAQG---TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD----DQTRA 233 P + + Q F P LDFSFSGLKT N + + + Sbjct: 178 PKIDLIWQQNNVRNMELIDFSLPKVLENPLDFSFSGLKTQVINYTNNLKENYLFSQKKVV 237 Query: 234 DIARAFEDAVVDTLMIKCKRALD-QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 +IA +F+ V+ L + AL + K + + GGV+AN +R + + + +V Sbjct: 238 EIAVSFQKTVIKYLKRQLDLALKTKKNVKTITLVGGVAANSEIRKLIKT--YENKYKVVI 295 Query: 293 ARPEFCTDNGAMIAYAGMVRFKAGA 317 + EFCTDNGAMIA A + K Sbjct: 296 PKKEFCTDNGAMIAKAAQIFLKFNE 320 >UniRef50_UPI000058820F PREDICTED: hypothetical protein n=2 Tax=Strongylocentrotus purpuratus RepID=UPI000058820F Length = 400 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 120/340 (35%), Positives = 183/340 (53%), Gaps = 28/340 (8%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIET+CD+TG A+ D+ +LA +L++Q ++HA GG++P LA H + P++Q Sbjct: 45 LVLGIETTCDDTGAAVMDETGRVLAERLHTQKRIHAKNGGIIPPLAQALHRQFIDPVVQG 104 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAW-DVPAIPVHHMEGHLLAPM 120 +K++G+ KD+ AVA + PG+ +L VG + + +P IP+HHME H L Sbjct: 105 TIKDAGIEMKDLSAVALSTMPGMPLSLRVGLDYTKDMLLRHPHLPLIPIHHMEAHALTVR 164 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP-- 178 + + +FPF+ LLVSGG+ L G+G +++LG + DDA GEAFDK A+ L L + Sbjct: 165 MVER-VDFPFLVLLVSGGNCILAVARGVGDFKVLGVTWDDAPGEAFDKVARRLKLQHHPD 223 Query: 179 -----GGPLLSKMAAQGT-AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD--- 229 GG + KMA G PM+ +FSF+GLK A I+ + Sbjct: 224 CLGLCGGQAIEKMAENGNFRLLIERGVPMSRHRDCNFSFAGLKNMANWLIQHHEVRQGLT 283 Query: 230 -------QTRADIARAFEDAVVDTLMIKCKRALDQT--------GFKRLVMAGGVSANRT 274 T +DIA +F+ V L+I+ RA+ G + LV++GGV++N Sbjct: 284 ASDDHHLATISDIAASFQHKVTQHLVIRIARAMLYCQQTGLIPEGNQTLVVSGGVASNDY 343 Query: 275 LRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFK 314 +R L + ++ P CTDNG MIA+AG+ R + Sbjct: 344 IRKALDFTTSLFKYKLICPPPYLCTDNGVMIAWAGVERLR 383 >UniRef50_Q46FS9 Putative O-sialoglycoprotein endopeptidase n=17 Tax=root RepID=GCP_METBF Length = 545 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 105/344 (30%), Positives = 165/344 (47%), Gaps = 23/344 (6%) Query: 1 MR---VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVP 57 M+ +LGIE + AI E ++A + GG+ P A++ H + Sbjct: 1 MKNTFILGIEGTAWNLSAAIV-TETEIIAEVTETY---KPTAGGIHPREAAQHHAKYAAS 56 Query: 58 LIQAALKES---GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 +I+ L E+ G+ DID +A++ GPGL L AT R L+ + +P I V+H Sbjct: 57 VIKRLLAEAKEKGVKPSDIDGIAFSQGPGLGPCLRTVATAARMLSISLGIPLIGVNHCIA 116 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 H+ + + V L VSG ++Q+IS G G+Y + GE++D G A DK A+ Sbjct: 117 HIEIGIWRTPAMDP--VVLYVSGANSQVISYMG-GRYRVFGETLDIGLGNALDKFARGAN 173 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 L +PGGP + A T P G+D SFSGL T A+ ++ D Sbjct: 174 LPHPGGPKIEAYAKNATKY---IHLPYV-IKGMDLSFSGLSTAASEALKKAP-----LED 224 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + ++++ ++ +RAL TG K +++AGGV AN LR L +M + R + + Sbjct: 225 VCYSYQETAFAMVVEVAERALAHTGKKEVLLAGGVGANTRLREMLNDMCEARGAKFYVPE 284 Query: 295 PEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAELPAA 337 F DNG MIAY G++ +K+G T L V P + ++ Sbjct: 285 KRFMGDNGTMIAYTGLLMYKSGNTLSLEDSRVNPSYRTDDVKVT 328 >UniRef50_B6JWU0 Glycoprotease pgp1 n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JWU0_SCHJY Length = 412 Score = 313 bits (802), Expect = 6e-84, Method: Composition-based stats. Identities = 116/359 (32%), Positives = 175/359 (48%), Gaps = 26/359 (7%) Query: 2 RVLGIETSCDETGIAIYDDEK------GLLANQLYSQVKLHADYGGVVPELASRDHVRKT 55 VLGIETSCD+ +A+ ++ +L + + L+ YGG+ P + +H R+ Sbjct: 34 NVLGIETSCDDCSVAVCQYDQSRNEPSKVLLQKTRRTIHLYEKYGGIHPNIVMHEHQRQL 93 Query: 56 VPLIQAALKES-GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 PLIQ+ L E+ L A ID V+ T GPG++G L VG + LA VP I VHHM G Sbjct: 94 APLIQSVLTEAEKLDASIIDIVSVTRGPGMLGPLAVGLNTAKGLAVGLKVPLIGVHHMLG 153 Query: 115 HLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 HLLAP LE +FPF++LLVSGGHT L+ + +E+L ++D A G+ DK A+LL Sbjct: 154 HLLAPKLE-RNIDFPFLSLLVSGGHTMLVYSKSLFDHEILATTLDIAVGDYLDKCARLLR 212 Query: 175 LDYPG---GPLLSKMAAQGTAGRFVFPRPMTDRPGLD---FSFSGLKTFAANTIRDNGTD 228 + + G L + + F P++ FSF+GL+T + G + Sbjct: 213 IPWNGEMPAAALERYSVVSDVTEFPLHVPLSKNAKTRLHCFSFAGLQTQVEKVLTCLGGE 272 Query: 229 ---DQTRADIARAFEDAVVDTLMIKCKRALD---QTGFKRLVMAGGVSANRTLRAKLAEM 282 + + IA A + D + K + ++ V +GGV+ NR LR L M Sbjct: 273 TAPENVKRRIAYAVQSIAFDHICRKVRLCMNDLVDKPISAFVCSGGVARNRYLRNMLVVM 332 Query: 283 MKK------RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 + + + C+DN +MIA A + +K G T+ L + +W L L Sbjct: 333 LSNFETDTSHSIPLVCPSADLCSDNASMIANAAIEMYKHGITSPLTIEPTSKWSLDALS 391 >UniRef50_UPI0001979AA5 putative DNA-binding/iron metalloprotein/AP endonuclease n=1 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI0001979AA5 Length = 380 Score = 313 bits (802), Expect = 7e-84, Method: Composition-based stats. Identities = 107/359 (29%), Positives = 169/359 (47%), Gaps = 40/359 (11%) Query: 3 VLGIETSCDETGIAIYDD-EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ +A+ +K L+ + SQ H+ YGG+VPE+ASR H ++ +++ Sbjct: 2 ILSIESSCDDSSLALTRIIDKKLIYHIKISQDSEHSTYGGIVPEIASRLHAKRLPEILKK 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA--- 118 I AVA T PGL L+ G + ++L +P I V+H++GH+ + Sbjct: 62 LKMFLNNDLSLIKAVAVTTRPGLSVTLIEGLMMAKTLCLGLQIPLICVNHLKGHIYSLCI 121 Query: 119 --------PMLEDNPPEFPFV-----------------------ALLVSGGHTQLISVTG 147 P + LLVSGGHTQ++ V Sbjct: 122 SKDFATDSAKDSRKNAPPPLLESHLKSHTESLLESRQNKQDSLGVLLVSGGHTQILQVND 181 Query: 148 IGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGT---AGRFVFPRPMTDR 204 ++ +S+DD+ GE+FDK AK L L YPGGP + + A + FP P+ Sbjct: 182 FHHISIIAQSLDDSFGESFDKVAKHLNLGYPGGPQVERYAKNCEINQYKPYEFPIPLLHN 241 Query: 205 PGLDFSFSGLKTFAANTIRD--NGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKR 262 L+FSFSGLK I++ Q A I++ F++A + ++ K + K Sbjct: 242 KKLEFSFSGLKNAVRLAIQEMEQPLSLQDIASISKGFQNAACEHIVRKTRLFFQHFEGKY 301 Query: 263 LVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL 321 + GG SAN LR +++E+ + E++ A EFC+DN AMI G+ + L Sbjct: 302 FAIVGGASANTYLRERMSELCNEFDKELYLADLEFCSDNAAMIGRVGVEHYLRDEFTPL 360 >UniRef50_Q17Z01 Probable O-sialoglycoprotein endopeptidase n=13 Tax=Helicobacter RepID=GCP_HELAH Length = 342 Score = 312 bits (801), Expect = 7e-84, Method: Composition-based stats. Identities = 105/332 (31%), Positives = 169/332 (50%), Gaps = 6/332 (1%) Query: 3 VLGIETSCDETGIAIYD-DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCD++ +A+ ++ L+A+ SQ K H+ YGGVVPELASR H L++ Sbjct: 2 ILSIESSCDDSSLALTRIEDAKLIAHFKISQEKHHSSYGGVVPELASRLHAENLPLLLER 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 + A+A T PGL L+ G + ++L+ + ++P I H+ GH+ + + Sbjct: 62 IKISLNKDFSKLKAIAITNQPGLSVTLIEGLMMAKALSLSLNLPLILEDHLRGHVYSLFI 121 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + P LLVSGGH+ ++ +++ S+DD+ GE+FDK +K+L L YPGGP Sbjct: 122 NEKKTCMPLSVLLVSGGHSLILEARNYEDIKIMATSLDDSFGESFDKVSKMLNLGYPGGP 181 Query: 182 LLSKMAAQGTAG--RFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN--GTDDQTRADIAR 237 ++ K+A +FP P+ + L FSFSGLK I N ++ T+ I Sbjct: 182 VIEKLALDYAHKNEPLMFPIPLKNSLNLAFSFSGLKNAVRLEIEKNAPNLNEITKQKIGY 241 Query: 238 AFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEF 297 F+ ++ L+ + KR K + GG S N LR + + ++ A EF Sbjct: 242 HFQSVAIEHLIQQTKRYFKTKRPKIFGIVGGASQNLVLRKAFENLCDEFDCKLVLAPLEF 301 Query: 298 CTDNGAMIAYAGMVRFKAGATADLG-VSVRPR 328 C+DN AMI + + ++ L S+ PR Sbjct: 302 CSDNAAMIGRSSLEAYQKKHFVPLEKASISPR 333 >UniRef50_B3PND6 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Mycoplasma RepID=GCP_MYCA5 Length = 311 Score = 310 bits (794), Expect = 6e-83, Method: Composition-based stats. Identities = 109/319 (34%), Positives = 167/319 (52%), Gaps = 10/319 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M + IE+S D+T A+ DD K + + +Q ++H YGG VPE+ASR HV+ LI+ Sbjct: 1 MIIFAIESSHDDTSFALLDDNKPI-WMKTITQTEIHKQYGGTVPEIASRLHVKNIGILIE 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +S + ID +AYT PGLVG+L VG V +SLA + + ++H+EGH + Sbjct: 60 DI--KSQININKIDLIAYTKEPGLVGSLHVGYVVAQSLALILNKKIVGLNHLEGHFYSAF 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + +P + LLVSGGH+QL+ ++++G++ DDA GE +DK A+ L L +PGG Sbjct: 118 I-GKEVIYPALGLLVSGGHSQLVLYNSKDDFKIIGQTQDDAVGEVYDKVARKLNLGFPGG 176 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD--NGTDDQTRADIARA 238 PL+ ++ DFSFSG+KT N I + + + IA Sbjct: 177 PLIDQIWKNNHKLYTAHLTIPKTEGFFDFSFSGIKTNVINLINNCASRNEQINVNQIATE 236 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F++ +V+ L + A+ + K +V+AGGVSAN +R + VF E+ Sbjct: 237 FQNTIVEYLKEHMETAIKKFSPKCIVLAGGVSANFAIREMFYSL----HKNVFLPDLEYT 292 Query: 299 TDNGAMIAYAGMVRFKAGA 317 TDN MIA +F+ Sbjct: 293 TDNAMMIARLAYEKFRYNN 311 >UniRef50_UPI000180B634 PREDICTED: similar to Probable O-sialoglycoprotein endopeptidase 2 (O-sialoglycoprotein endopeptidase-like protein 1) n=1 Tax=Ciona intestinalis RepID=UPI000180B634 Length = 386 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 114/350 (32%), Positives = 167/350 (47%), Gaps = 21/350 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE++ D+TG AI D + + + +Q K H GGV P +A H +++A Sbjct: 20 VLGIESTFDDTGAAIVDCDATIHGEAIATQTKAHVKAGGVDPRIAELLHRDNLPRVVEAV 79 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+++G+ +D+DAVA PG L G + + + IPVHHME HLL + Sbjct: 80 LQQAGIRYQDLDAVATATRPGNPFCLKRGLEFTKMIVERHSLRFIPVHHMEAHLLTARM- 138 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL--GLDYPG- 179 +N FPF+ LL +GGH + +G +++LGE+ID+ G FDK A+ L LD P Sbjct: 139 NNEVNFPFLGLLATGGHCIITITHDLGNHQILGEAIDEPPGAVFDKVARALQVKLDRPDT 198 Query: 180 ------GPLLSKMAAQGTAGRFVFPRPMTDRPG-LDFSFSGLKTFAANTIRDNGTDDQTR 232 G + ++A +G + P+ P LDFSFSGL+T I D Sbjct: 199 HERLWNGGDVERLACEGDRSKVKLTTPLRQSPRVLDFSFSGLQTQTLRVI-DQPEPGVKY 257 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFK------RLVMAGGVSANRTLRAKLAEMMKKR 286 ADIA +F+ + ++ + RA+ + K LV+AGGV N LR L+ + Sbjct: 258 ADIAASFQHTMTQHILSRVHRAILMSRDKLNQESPTLVVAGGVVCNSYLRNALSRLCDIT 317 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGA---TADLGVSVRPRWPLAE 333 + C DNG MIA+ GM K G P+ L E Sbjct: 318 NITIVCPPLPLCVDNGVMIAWTGMEYLKRGKGISPHPYNERYEPKCRLGE 367 >UniRef50_O94710 Glycoprotease pgp1, mitochondrial n=1 Tax=Schizosaccharomyces pombe RepID=PGP1_SCHPO Length = 412 Score = 308 bits (790), Expect = 1e-82, Method: Composition-based stats. Identities = 104/358 (29%), Positives = 172/358 (48%), Gaps = 28/358 (7%) Query: 4 LGIETSCDETGIAIYDD-------EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 L IETSCD+T +++ + ++ + + + YGG+ P + +H + Sbjct: 42 LAIETSCDDTSVSVVRTSDSSSHCQNEIICLNTHRTISKYEAYGGIHPTIVIHEHQKNLA 101 Query: 57 PLIQAALKESGLT-AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 +IQ + ++ + D D +A T GPG++G L VG + LA P + VHHM+ H Sbjct: 102 KVIQRTISDAARSGITDFDLIAVTRGPGMIGPLAVGLNTAKGLAVGLQKPLLAVHHMQAH 161 Query: 116 LLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 LA LE +FP++ +LVSGGHT L+ + +E++ + D A G+ DK AK LG+ Sbjct: 162 ALAVQLE-KSIDFPYLNILVSGGHTMLVYSNSLLNHEIIVTTSDIAVGDYLDKCAKYLGI 220 Query: 176 DY---PGGPLLSKMAA---QGTAGRFVFPRPMTDRPGLD---FSFSGLKTFAANTIRDNG 226 + L + A+ T+ P P+ R + FSFSGL+++A IR Sbjct: 221 PWDNEMPAAALEQFASPEINSTSYSLKPPIPLNTREKVHSASFSFSGLESYACRIIRKTP 280 Query: 227 TDDQTRADIARAFEDAVVDTLMIKCKRALDQ---TGFKRLVMAGGVSANRTLRAKLAEMM 283 + + A + A + K AL + + K LV +GGV+ N L+ L + + Sbjct: 281 LNLSEKKFFAYQLQYAAFQHICQKTLLALKRLDLSKVKYLVCSGGVARNELLKKMLNDTL 340 Query: 284 -------KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAEL 334 + ++ Y P+ C+DN AMI Y + FKAG T+ V +WP+ ++ Sbjct: 341 MVLQFEHQPTDIKLVYPSPDICSDNAAMIGYTAIQMFKAGYTSSFDVEPIRKWPINQI 398 >UniRef50_UPI000186D055 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D055 Length = 419 Score = 308 bits (789), Expect = 2e-82, Method: Composition-based stats. Identities = 102/338 (30%), Positives = 165/338 (48%), Gaps = 25/338 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE+SCD+TG ++ +D +L SQ +H + GG++P +AS H ++ +A Sbjct: 27 VLGIESSCDDTGASVVNDSGKVLGESHCSQSVIHVEAGGILPHVASALHKNNLKHVVNSA 86 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + +S L +++D +A T PGL+ +L G ++L ++ P IP+HHME H L + Sbjct: 87 MLQSKLKFENLDVIAVTVKPGLILSLTEGVNYAKNLCTLYNKPLIPIHHMEAHALTVRII 146 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------- 175 D +FPF+ L+SGGH L + ++ LG+S D++ G+ FDK A+ L Sbjct: 147 DE-VKFPFLVFLLSGGHCILALANSVRKFYKLGDSNDNSPGQVFDKIARRAKLINLNELK 205 Query: 176 DYPGGPLLSKMAAQGTAGRFVFP-RPMTDRPGLDFSFSGLKTFAANTIRDNGTD------ 228 GG + K A G F + + +FSFSG T A N I+ + Sbjct: 206 GLVGGAAIEKAAKTGNPTAIPFSQTTLKSQKNCNFSFSGYITSAYNYIQSQEINLNLSPD 265 Query: 229 --DQTRADIARAFEDAVVDTLMIKCKRALDQ--------TGFKRLVMAGGVSANRTLRAK 278 D +F+ ++ L + + A+ KRLV++GGV++N ++ Sbjct: 266 AVIPDINDFCASFQWSLTTHLCQRLEMAIKYVEERKLLNEDEKRLVVSGGVASNSLIKNA 325 Query: 279 LAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 L + ++F P CTDNG MIA+ G+ K Sbjct: 326 LKFVCNHYNYKIFIPPPRLCTDNGVMIAWNGVELLKEN 363 >UniRef50_D2L1E2 Metalloendopeptidase, glycoprotease family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L1E2_9DELT Length = 371 Score = 307 bits (787), Expect = 3e-82, Method: Composition-based stats. Identities = 151/347 (43%), Positives = 197/347 (56%), Gaps = 21/347 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A+ DD + LL +L SQ+KLHA +GGVVPELASR+H+R+ PL+ Sbjct: 1 MLCLGIETSCDETAVALCDDGRPLL-EKLASQIKLHALFGGVVPELASREHLRRMGPLLD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A +E+GL D+DAVA GPGL+G+LL+G V + LA A P I V H+ HLLA Sbjct: 60 ALFREAGLGLADVDAVAVARGPGLLGSLLIGLAVAKGLALAAGKPLIGVDHLHAHLLAAT 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L +P + LLVSGGHTQ++ + +LG ++DDAAGEAFDK AK L L YPGG Sbjct: 120 L-GREVAYPALGLLVSGGHTQIVLLRSPLDLTVLGRTVDDAAGEAFDKAAKSLNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD------------ 228 + ++ A R +FPRP + LDFSFSGLKT A + + Sbjct: 179 VFVDRLGAGIEPDRALFPRPNLENTHLDFSFSGLKTAVATHVARHPGLRLAVMPAPDGPV 238 Query: 229 -----DQTRADIARAFEDAVVDTLMIKCKRALDQTGFK--RLVMAGGVSANRTLRAKLAE 281 + + AV DTL +K +RALD ++ AGGV+AN +RA L Sbjct: 239 DAAAWPLDLRRVCSSLNFAVADTLRVKMERALDGLDVPAVSILAAGGVAANSRIRAMLEA 298 Query: 282 MMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +R F P C DN MIA AG + +AG T DL + PR Sbjct: 299 LGARRGLPCFLPEPALCADNATMIAAAGCLLGRAGLTHDLALDAVPR 345 >UniRef50_Q6C9V8 YALI0D07920p n=1 Tax=Yarrowia lipolytica RepID=Q6C9V8_YARLI Length = 376 Score = 306 bits (785), Expect = 5e-82, Method: Composition-based stats. Identities = 120/352 (34%), Positives = 178/352 (50%), Gaps = 22/352 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADY---GGVVPELASRDHVRKTVPL 58 VL IETSCD+T AI ++ L VK+ D GG+ P LA+ H + PL Sbjct: 26 NVLAIETSCDDTCAAIISRDREKNTAALIDHVKITLDSSLQGGINPALATAHHHQSVGPL 85 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 I+ LK+ T ID V T GPGL G L G T + L+ VP + VHHM HLL Sbjct: 86 IRDVLKKHADTT--IDLVCATRGPGLPGCLSSGVTFAKGLSLGLGVPYLGVHHMLAHLLT 143 Query: 119 PMLED-------NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 P L + + EFPF++LLVSGGHT L+ + + +L + D A G+A DK A+ Sbjct: 144 PRLFEAAEGYSGHKTEFPFLSLLVSGGHTMLVLSKSLYDHTVLCNTADVAIGDALDKCAR 203 Query: 172 LLGL-DYPGGPLLSKMAAQGT--AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD 228 LG G ++ + + ++ P P+ ++ + +SF+ ++ ++ T Sbjct: 204 TLGFQGNMLGKVMDQYCRSADTPSSQWSIPMPVDNKNDIRYSFAAFHSYIG--MKKKETQ 261 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQTGFK-----RLVMAGGVSANRTLRAKLAEMM 283 +T ++A + A+ + LM K K A + + LV +GGV+AN LR L E+ Sbjct: 262 AETTPELALEVQTAIFNHLMKKTKAAFNIYKKEIASATTLVCSGGVAANPRLREALQELC 321 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 K + E + P +CTDN AMI +AG+ + G +DL P+WPLAE Sbjct: 322 AKYKLEAVFPDPYWCTDNAAMIGWAGIELHEDGYRSDLEGFQIPKWPLAEFE 373 >UniRef50_B1ZYF9 Metalloendopeptidase, glycoprotease family n=3 Tax=Verrucomicrobia RepID=B1ZYF9_OPITP Length = 349 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 135/346 (39%), Positives = 200/346 (57%), Gaps = 17/346 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +E+SCDET +A++D +GL+ ++SQ+ LH +GGVVP+LA+R+H+R PL++ A Sbjct: 2 ILALESSCDETAVAVFDPARGLVGEWVHSQIALHERHGGVVPDLATREHLRHFAPLLERA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ + I VA T GPGL L +G ++LA W VP + V+H+ GH+ +P + Sbjct: 62 --QAAVPFDAITQVAVTNGPGLAACLAIGVAAAKALALQWRVPLVGVNHLRGHVWSPFIR 119 Query: 123 DNPPEF-----------PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 + P + L+VSGG+T L +V Q +L + DDAAGEA DK AK Sbjct: 120 LHADAPAEFGDRLAALLPHLGLIVSGGNTLLFAVDRARQVTVLSTTRDDAAGEALDKGAK 179 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-- 229 LLGL YPGGPL+ K+AA G A + FPR + R LDFSFSGLKT I ++ Sbjct: 180 LLGLSYPGGPLIEKLAATGRADAYDFPRGIGRRDELDFSFSGLKTSLRYLIEKLSPEEVV 239 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQ--TGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 R+D+ +++ AVVD L+ K + AL Q ++ L ++GGV+ NRTLRA L ++ + Sbjct: 240 ARRSDLCASYQQAVVDALVRKTRAALRQGEGDYRSLGLSGGVANNRTLRAALEREAQRSQ 299 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 F A+P+ DN MIA+A A + ++V P + E Sbjct: 300 IPFFAAQPQHTGDNAGMIAFAAWADSAGTDAAGMKLTVEPSATIGE 345 >UniRef50_C1F9R2 Metalloendopeptidase, glycoprotease family n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F9R2_ACIC5 Length = 401 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 136/387 (35%), Positives = 192/387 (49%), Gaps = 56/387 (14%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCDET A+ + L+N + SQ+ +HA +GGVVPELASR+H+R VP+++ Sbjct: 14 LILGIESSCDETSAAVVRGGREALSNVIASQIAVHAPFGGVVPELASREHLRAIVPVVEQ 73 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+ +G+ D+DAVA T GPGL GALLVG + ++LA A P I V+H+EGH+ A +L Sbjct: 74 AMAGAGVAFDDLDAVAVTEGPGLPGALLVGVSYAKALALALGKPLIAVNHLEGHIHAVLL 133 Query: 122 E----------DNPPEFPFVALLVSGGHTQLISVTGIGQ---YELLGESIDDAAGEAFDK 168 E P +AL+VSGGHT L Y +G ++DDAAGEAFDK Sbjct: 134 ERVLQPAETQATPEHGQPKLALVVSGGHTHLYLAQETHHAWTYRNVGRTVDDAAGEAFDK 193 Query: 169 TAKLLGLDYPGGPLLSKMAAQGTAGRFVF--------------PRPMTDRPGLDFSFSGL 214 AKLLGL YPGGP + +A G A F P + FSFSG+ Sbjct: 194 VAKLLGLGYPGGPWVDALAPFGDARAVPFSFAQVKAKAHRRADPVALHPEEATYFSFSGI 253 Query: 215 KTFAANTIRDNGTD-----------------------------DQTRADIARAFEDAVVD 245 KT ++ + + DQ D+ +F+ AVV Sbjct: 254 KTAVLRYVQTHDMEARIAARRQAMATMPDASPRRDLEAVRALCDQESLDLLASFQRAVVG 313 Query: 246 TLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMI 305 L+ K RA ++ ++++GGV+ANR LR + + V + + TDN AMI Sbjct: 314 DLVRKTFRAAERYDVAEILVSGGVAANRELRERFTAEAAAQGLPVAFPSLKLATDNAAMI 373 Query: 306 AYAGMVRFKAGATADLGVSVRPRWPLA 332 A A + A ++ L Sbjct: 374 AAAAWPKLITSEFAGETLTAAAGLKLG 400 >UniRef50_C7M316 Metalloendopeptidase, glycoprotease family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7M316_ACIFD Length = 347 Score = 303 bits (776), Expect = 6e-81, Method: Composition-based stats. Identities = 128/296 (43%), Positives = 180/296 (60%), Gaps = 5/296 (1%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VL IETSCD+T +A+ + AN + SQ LHA +GGVVPE+A+R H V +++ A Sbjct: 18 VLAIETSCDDTAVAVV-AGGRVAANVVRSQAALHAPFGGVVPEVAARAHDAAMVEVVEEA 76 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ESG+ A +++A+A T GPGL G+L+VG LA D P I V HMEGHL A +E Sbjct: 77 LAESGIDAHEVEAIAVTKGPGLPGSLVVGVGAALGLAVGLDRPLIGVDHMEGHLYAATIE 136 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 P P ++LLVSGGH++L+ + +Y LLG + DDAAGEAFDK A++LGL +PGGP Sbjct: 137 -GPVALPALSLLVSGGHSELVVIEAPFRYRLLGRTRDDAAGEAFDKVARILGLGFPGGPA 195 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 + A G F R + ++ G D SFSG+KT A + + AD+A +F++A Sbjct: 196 IEAAARDGRPDAIRFARALRNQ-GFDLSFSGIKTEVARYLEGARAAE--VADVAASFQEA 252 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 VVD L+ K +RAL+ + +V+ GGV+AN LR ++AE+ + R C Sbjct: 253 VVDVLVAKLERALESERVETVVIGGGVAANGPLRERVAELARARGVGAHIPARSLC 308 >UniRef50_A6VJ51 Putative O-sialoglycoprotein endopeptidase n=26 Tax=cellular organisms RepID=GCP_METM7 Length = 547 Score = 302 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 95/337 (28%), Positives = 161/337 (47%), Gaps = 17/337 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +G E + ++TG+ I + +L N+ G+ P A+ H V L++ Sbjct: 7 LICIGFEGTAEKTGVGIITSKGEVLFNKTIIY---TPPVQGIHPREAADHHAETFVKLLK 63 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + + ID V+++ GPGL +L V AT R+L+ + + P I V+H H+ Sbjct: 64 EALTVVPI--EKIDLVSFSLGPGLGPSLRVTATTARALSLSINKPIIGVNHCISHVEIGK 121 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 L+ + + + L VSGG+TQ+++ TG +Y ++GE++D A G D+ A+ + +PGG Sbjct: 122 LKTDAVDP--LTLYVSGGNTQVLAYTGK-KYRVIGETLDIAIGNCLDQFARHCNMPHPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 + K A G P T G+D S SGL T A + D+ + + Sbjct: 179 VYVEKYAKDGNKF---MKLPYTV-KGMDISLSGLLTAAMKKY----DSKERIEDVCYSLQ 230 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L +RAL T +++ GGV+AN L+ L M ++ + + EFC D Sbjct: 231 ETSFSMLTEITERALAHTNKAEVMLVGGVAANNRLKEMLDVMCSEQNVDFYVPEREFCGD 290 Query: 301 NGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAELPA 336 NGAMIA+ G++++ G DL + + Sbjct: 291 NGAMIAWLGILQYLNGKRMDLADTKPISNYRSDMVEV 327 >UniRef50_Q9NPF4 Probable O-sialoglycoprotein endopeptidase n=81 Tax=Eukaryota RepID=OSGEP_HUMAN Length = 335 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 111/339 (32%), Positives = 170/339 (50%), Gaps = 14/339 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLG E S ++ G+ + D K +LAN + + G +P +R H + L+Q A Sbjct: 4 VLGFEGSANKIGVGVVRDGK-VLANPRRTY--VTPPGTGFLPGDTARHHRAVILDLLQEA 60 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ESGLT++DID +AYT GPG+ L+ A V R++A W+ P + V+H GH+ L Sbjct: 61 LTESGLTSQDIDCIAYTKGPGMGAPLVSVAVVARTVAQLWNKPLVGVNHCIGHIEMGRLI 120 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYPGG 180 L VSGG+TQ+I+ +Y + GE+ID A G D+ A++L + D G Sbjct: 121 TGATSP--TVLYVSGGNTQVIAY-SEHRYRIFGETIDIAVGNCLDRFARVLKISNDPSPG 177 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD-NGTDDQTRADIARAF 239 + +MA +G P + G+D SFSG+ +F + T + T D+ + Sbjct: 178 YNIEQMAKRGK-KLVELPYTV---KGMDVSFSGILSFIEDVAHRMLATGECTPEDLCFSL 233 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 ++ V L+ +RA+ G + ++ GGV N L+ +A M ++R +F FC Sbjct: 234 QETVFAMLVEITERAMAHCGSQEALIVGGVGCNVRLQEMMATMCQERGARLFATDERFCI 293 Query: 300 DNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAELPAA 337 DNGAMIA AG F+AG L V R+ E+ Sbjct: 294 DNGAMIAQAGWEMFRAGHRTPLSDSGVTQRYRTDEVEVT 332 >UniRef50_D0JBS4 Glycoprotease M22 family domain-containing protein n=2 Tax=Blattabacterium RepID=D0JBS4_BLASB Length = 327 Score = 297 bits (761), Expect = 4e-79, Method: Composition-based stats. Identities = 109/303 (35%), Positives = 180/303 (59%), Gaps = 15/303 (4%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCD+T ++I + +L+N + Q ++H YGGVVPELASR H + P + Sbjct: 19 IILGIESSCDDTAVSII-KNRDVLSNIIIHQ-EIHKQYGGVVPELASRLHDQNMTPAVNQ 76 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A+ + + +IDAV++T GPGL+G+LLVGA+ +S + ++P + V+H++ H+L + Sbjct: 77 AIHSAKIKKNEIDAVSFTLGPGLIGSLLVGASFAKSFSMGLEIPLLTVNHVQAHILTHFI 136 Query: 122 EDNP-----PEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 ++ P+FPF+ L++SGGHTQ++ V + E+LG ++DD+ G+ FDK A+LLG Sbjct: 137 KNANMNNSYPKFPFLGLVISGGHTQIVKVNDFFKMEILGSTLDDSIGDTFDKIARLLGFH 196 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-----DQT 231 YPGGP++ + G +F F +P + L+FSFSG K+ I+ Q Sbjct: 197 YPGGPMIELFSKNGNCKKFGFSKPSVN--DLNFSFSGFKSHVLQFIKKKSKKNPLFIKQN 254 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK-RRGEV 290 +DI + + + + L+ K ++A T R+ +AGGVSAN +R K+ ++ E+ Sbjct: 255 LSDICASIQRIIAEILLEKVEKATLITDIFRVALAGGVSANCEIRRMFISFAKRNKKWEI 314 Query: 291 FYA 293 F Sbjct: 315 FIP 317 >UniRef50_C4XSD3 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Desulfovibrio RepID=GCP_DESMR Length = 371 Score = 296 bits (759), Expect = 6e-79, Method: Composition-based stats. Identities = 142/347 (40%), Positives = 193/347 (55%), Gaps = 21/347 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIETSCDET +A++++ + +L +L SQ LHA +GGVVPELASR+H+R+ PL+Q Sbjct: 1 MLCLGIETSCDETAVALFENGRPVL-EKLASQADLHAVFGGVVPELASREHLRRLGPLLQ 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A SG + D+DA+A GPGL+G+LLVG + L+ A P I V H+ HLLA Sbjct: 60 ALFAASGRSLADVDAIAVARGPGLLGSLLVGLAAAKGLSLATGKPLIGVDHLHAHLLAAT 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + FP + LLVSGGHTQ++ + E+LG ++DDAAGEAFDK AK L YPGG Sbjct: 120 I-GRDVAFPALGLLVSGGHTQIVRLESALSLEVLGRTLDDAAGEAFDKAAKSFNLPYPGG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD------------ 228 + + + +FPRP D DFSFSGLKT A+ + Sbjct: 179 VYIDVLGRGIAPDKTLFPRPFLDNDHFDFSFSGLKTAVASYAAAHPELRAGSLAEAGGAI 238 Query: 229 -----DQTRADIARAFEDAVVDTLMIKCKRALDQT--GFKRLVMAGGVSANRTLRAKLAE 281 + A+ +TL IK +RALD+ L+ AGGV+AN +RA LA+ Sbjct: 239 DPEAWPMALRRACSSLNFAIAETLRIKFERALDRQPGPPASLIAAGGVAANGPIRAMLAD 298 Query: 282 MMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPR 328 + +R ++ P C DN MIA AG +AG DL ++ PR Sbjct: 299 LAARRGLPLYLPEPALCADNAVMIAAAGSRLAEAGYAHDLALTAVPR 345 >UniRef50_C3XEQ4 O-sialoglycoprotein endopeptidase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XEQ4_9HELI Length = 500 Score = 296 bits (759), Expect = 6e-79, Method: Composition-based stats. Identities = 111/356 (31%), Positives = 172/356 (48%), Gaps = 24/356 (6%) Query: 1 MRVLGIETSCDETGIAIYDD-EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 MR+L IE+SCD++ +A D LL ++ SQ H+ YGGVVPELASR R V L+ Sbjct: 1 MRILSIESSCDDSALAYTDGTNTKLLWHEKISQEASHSHYGGVVPELASRLFARDLVQLL 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 + + + KDI +A T PGL +LL G + ++LA + ++P + ++H++GH+ + Sbjct: 61 ENF--KQNFSLKDITHIAVTNEPGLSTSLLEGVMMAKALALSLNIPLLGINHLKGHIYSL 118 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 +E P LLVSGGHT L+ ++ ++DD+ GE +DK AK+LGL YPG Sbjct: 119 FIESEAI-LPLCVLLVSGGHTMLLECYSYNDMRVIANTLDDSFGECYDKAAKMLGLGYPG 177 Query: 180 GPLLSKMAAQGTAG---RFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT----DDQTR 232 G ++ MA P P+ ++ FSFSGLK + D Sbjct: 178 GMIIDSMAQMALKENIAPIALPIPLVNQNIQSFSFSGLKNAFRLQLEKMELKTLIQDSKT 237 Query: 233 ADI---------ARAFEDAVVDTLMIKCKRALDQTG-FKRLVMAGGVSANRTLRAKLAEM 282 DI A +++ L+ KC+ + Q K + GG SAN LR K + Sbjct: 238 QDIKNSTQAKALALGLQESATTHLIQKCRSYMKQNSHIKHFAIVGGASANSMLREKAQSL 297 Query: 283 MKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSV---RPRWPLAELP 335 + ++ + ++C+DN AMI A + + + D+ W L P Sbjct: 298 AAQFDNKLLMSELKYCSDNAAMIGRAAIAKIRHENMIDIESKSSINEAIWNLDSPP 353 >UniRef50_C4QZU9 Putative metalloprotease, similar to O-sialoglycoprotein metallopeptidase from P. haemolytica n=1 Tax=Pichia pastoris GS115 RepID=C4QZU9_PICPG Length = 373 Score = 296 bits (758), Expect = 8e-79, Method: Composition-based stats. Identities = 110/361 (30%), Positives = 180/361 (49%), Gaps = 27/361 (7%) Query: 2 RVLGIETSCDETGIAIYD---DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +VL IE+SCD++ +++ D K ++ + + S + GGV+P A H + L Sbjct: 13 KVLAIESSCDDSCVSLIDRSAGAKPIVLDHVKSTLNS-VKAGGVIPTSAHLHHQKSIAGL 71 Query: 59 IQAALKESGLTAKD-IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 ++ L++ ++ + + V T GPG+ G+L +G + L+ AW + VHHM GHLL Sbjct: 72 VKQVLQKHNISGVNCPELVCVTRGPGMPGSLSIGVDTAKGLSVAWGSQFLGVHHMLGHLL 131 Query: 118 APMLEDN--PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 P LE N P+FPF++LL SGGHT L+ + +E+L +ID AAG+A DK A+ +G+ Sbjct: 132 IPRLESNGEEPQFPFLSLLASGGHTMLVLSRSLLDHEILVNTIDIAAGDALDKCAREIGI 191 Query: 176 -DYPGGPLLSKMAAQG-----TAGRFVFPRPMTDRPG----LDFSF----SGLKTFAANT 221 G L + + P+P+ ++P L FSF SG+K Sbjct: 192 RGNMIGKELELFLNKNPQLSLKDIPWEMPQPLKNKPKRVDTLGFSFTPFISGVKLSLERY 251 Query: 222 IRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALD----QTGFKRLVMAGGVSANRTLRA 277 +N D+ + ++A+ D ++ + A K V +GGV AN+ LR Sbjct: 252 -HNNEVKDELMPAMGFRIQEAIFDHIIDRVLVAYKVRPELNQIKTFVGSGGVVANQRLRV 310 Query: 278 KLAEMMKKRRGE-VFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELPA 336 KL +K E + P CTDN MI +AG+ ++ G T++L V+ +W + L Sbjct: 311 KLQAALKSHGVENFHFPPPALCTDNAIMIGWAGIELYENGVTSELDVTPLRKWSVEGLEK 370 Query: 337 A 337 + Sbjct: 371 S 371 >UniRef50_A9WHP1 Metalloendopeptidase, glycoprotease family n=4 Tax=Chloroflexi (class) RepID=A9WHP1_CHLAA Length = 355 Score = 295 bits (755), Expect = 2e-78, Method: Composition-based stats. Identities = 146/349 (41%), Positives = 195/349 (55%), Gaps = 23/349 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L +ETSCDET A+ + +L+N + SQ+ H YGGVVPE+ASR H+ P+++AA Sbjct: 10 ILALETSCDETAAAVVRGGRTVLSNVVASQMATHERYGGVVPEIASRQHILSLAPVVRAA 69 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 L D+ AVA T GPGL GALL G +++A+ +P + V+H+E HL A L Sbjct: 70 LAVLPNGWADVHAVAATHGPGLSGALLTGLNAAKAMAWRRGLPFVAVNHLEAHLYAGWLG 129 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 D PP FP VALLVSGGHT L+ + G Y+LLG++ DDAAGEAFDK A++LGL YPGGP Sbjct: 130 SDPPPPFPLVALLVSGGHTLLVLLRDHGNYQLLGQTRDDAAGEAFDKVARILGLGYPGGP 189 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD----------------- 224 + AA T G PR R DFSFSGLKT + ++D Sbjct: 190 AIQAAAANATPGGV-LPRAWL-RDSYDFSFSGLKTAVLHRVQDRLAQQSRLSGRKGAGET 247 Query: 225 NGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMK 284 D A +A AF+++VVD L+ K A + + +++AGGV+ANR LR +L Sbjct: 248 PQLDAPFVAQMAYAFQESVVDVLVTKTVDAARRYQAQAILLAGGVAANRRLREELIRRAS 307 Query: 285 KRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 V + CTDN AM+A A RF +G V V PL + Sbjct: 308 ---VPVHLPAFDLCTDNAAMVAAAAFYRFHSGVQYGWDVDVTANLPLEQ 353 >UniRef50_P43122 Putative protease QRI7 n=12 Tax=Saccharomycetaceae RepID=QRI7_YEAST Length = 407 Score = 292 bits (748), Expect = 1e-77, Method: Composition-based stats. Identities = 116/368 (31%), Positives = 176/368 (47%), Gaps = 40/368 (10%) Query: 2 RVLGIETSCDETGIAIYD-----DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTV 56 +VL IETSCD+T +++ D +LAN + + D GG++P A H + Sbjct: 34 KVLAIETSCDDTCVSVLDRFSKSAAPNVLANLKDTLDSI--DEGGIIPTKAHIHHQARIG 91 Query: 57 PLIQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 PL + AL ES + ID + T GPG+ G+L G + LA AW+ P I VHHM GHL Sbjct: 92 PLTERALIESNAR-EGIDLICVTRGPGMPGSLSGGLDFAKGLAVAWNKPLIGVHHMLGHL 150 Query: 117 LAPMLEDNP--PEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 L P + N P+FPFV+LLVSGGHT + I +E+L ++ID A G++ DK + LG Sbjct: 151 LIPRMGTNGKVPQFPFVSLLVSGGHTTFVLSRAIDDHEILCDTIDIAVGDSLDKCGRELG 210 Query: 175 LDYPGGPLLSKMAAQGTAG--------RFVFPRPMTD----RPGLDFSFSGLKTFAANTI 222 + G + +M + P P+ + R L FSFS T + Sbjct: 211 --FKGTMIAREMEKFINQDINDQDFALKLEMPSPLKNSASKRNMLSFSFSAFITALRTNL 268 Query: 223 RD------NGTDDQTRADIARAFEDAVVDTLMIKCKRAL-----DQTGFKRLVMAGGVSA 271 ++ IA +++V D ++ K K L + V +GGVS+ Sbjct: 269 TKLGKTEIQELPEREIRSIAYQVQESVFDHIINKLKHVLKSQPEKFKNVREFVCSGGVSS 328 Query: 272 NRTLRAKLAEMMKKRR----GEVFYARPEFCTDNGAMIAYAGMVRFKA-GATADLGVSVR 326 N+ LR KL + +Y + C+DN MI +AG+ +++ +DL + Sbjct: 329 NQRLRTKLETELGTLNSTSFFNFYYPPMDLCSDNSIMIGWAGIEIWESLRLVSDLDICPI 388 Query: 327 PRWPLAEL 334 +WPL +L Sbjct: 389 RQWPLNDL 396 >UniRef50_Q74M58 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Nanoarchaeum equitans RepID=GCP_NANEQ Length = 314 Score = 292 bits (748), Expect = 1e-77, Method: Composition-based stats. Identities = 96/331 (29%), Positives = 165/331 (49%), Gaps = 19/331 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGIE + G+ I+D EKG+LAN+ + G+ P A+ H+++ ++ Sbjct: 1 MKVLGIECTAHTFGVGIFDSEKGVLANEKVTY-----KGYGIHPREAAELHLKEFDKVLL 55 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+++ ++ KDID +A ++GPGL+ L +G + L + P I V+H+ H Sbjct: 56 KALEKANISLKDIDLIAVSSGPGLLPTLKLGNYIAVYLGKKLNKPVIGVNHIVAHNEFAR 115 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + F + VSG +TQ +++ + L+GE++D G DK A+ LGL++PGG Sbjct: 116 YLAKAKDPLF--VYVSGANTQFLAIVN-NSWFLVGETLDMGVGNLIDKVARDLGLEFPGG 172 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 P + ++A +G P + GL+ G+ T+ D ++ DIA + + Sbjct: 173 PKIEELAKKGK-NLIELPYTI---KGLNLQLGGIYTYIKRI-----KDQYSKEDIAYSLQ 223 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + V ++ +RA+ K L++ GGV+ N L +M K+ + + ++ TD Sbjct: 224 EWVFALILEIAERAMHMLDKKELILTGGVACNNRLNDMAEQMAKENNFKFYRLPCQYLTD 283 Query: 301 NGAMIAYAGMVRFKAGATADLGVSVRPRWPL 331 NGAMIAY G + G RP W + Sbjct: 284 NGAMIAYLGYYWYSQGIY--YEPKPRPYWRI 312 >UniRef50_Q4UA14 Glycoprotein endopeptidase, putative n=3 Tax=Piroplasmida RepID=Q4UA14_THEAN Length = 363 Score = 292 bits (747), Expect = 2e-77, Method: Composition-based stats. Identities = 99/350 (28%), Positives = 159/350 (45%), Gaps = 22/350 (6%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGIE S ++ GIA+ + +L+N + D G +P S+ H L+ AL Sbjct: 15 LGIEGSANKLGIAVIRGDGEILSNVRRTY--SPPDGEGFLPRQVSKHHRENMASLLMEAL 72 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 +++G+T D+ + YT GPG+ L VGA +++ F P + V+H H+ Sbjct: 73 EKAGITLSDLSLICYTKGPGIGSGLHVGALAAKTIHFITGKPIVGVNHCVAHVEMGRFLS 132 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQ-YELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + L VSGG+TQ++S + Y +LGE++D A G D+ A+LL L P Sbjct: 133 GYKKPA--ILYVSGGNTQVLSYDEKRKVYSVLGETLDIAIGNVLDRIARLLHLPNKPAPG 190 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD-----------DQT 231 LS + + + P P G+D S SGL T + I T +Q Sbjct: 191 LSIELQARKSSKNLIPLPFVV-KGMDCSLSGLLTKCEDLIEHFKTKLIMSEDSAFEYEQF 249 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 + D+ + ++ L+ +RA+ T +++ GGV N L+ M K+R ++F Sbjct: 250 KVDLCFSVQEHTFAMLIEMLERAMSFTDSDEILLVGGVGCNLRLQEMANLMAKERNAKLF 309 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGAT-----ADLGVSVRPRWPLAELPA 336 +C DNGAMI Y GM+ + G V+V R+ + P Sbjct: 310 PMDERYCIDNGAMIGYTGMIDYLYGLKEKCVLEPKEVTVSQRYRTDQAPV 359 >UniRef50_A4VEZ5 O-sialoglycoprotein endopeptidase n=1 Tax=Tetrahymena thermophila SB210 RepID=A4VEZ5_TETTH Length = 377 Score = 291 bits (746), Expect = 2e-77, Method: Composition-based stats. Identities = 107/379 (28%), Positives = 173/379 (45%), Gaps = 54/379 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGIE S ++ G+ I + +LAN + + G +P + H K + ++ Sbjct: 1 MIALGIEGSANKIGVGIVKSDGTILANPKTTF--ITPPGTGFLPNETAVHHRSKILDIVD 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ALKE+ LT KDI + YT GPG+ L +GA V R+L+ ++P I V+H GH+ Sbjct: 59 QALKEANLTFKDIGLICYTKGPGMGPPLSIGAIVSRTLSLLHNIPLIGVNHCIGHIEMGR 118 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYP 178 L L VSGG+TQ+I+ + +Y + GE++D A G D+ A+++ L D Sbjct: 119 LATGITHPA--VLYVSGGNTQVIAYSNQ-RYRIFGEALDIAVGNCLDRFARIINLSNDPA 175 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG------------ 226 G + ++A QG P T G+D SFSG+ ++ + + N Sbjct: 176 PGYNIEQLAKQGKQF---IQVPYTV-KGMDMSFSGILSYFEDIVAQNPHLQYEDGVVPEK 231 Query: 227 ------------------------------TDDQTRADIARAFEDAVVDTLMIKCKRALD 256 D TRAD+ + ++ + L +RA+ Sbjct: 232 DAKQQDEDDSLDNRKRKKNKKVVNKKILDLPKDITRADLCYSLQETIFAMLTEVTERAMA 291 Query: 257 QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 +++ GGV N L+ + +M+ +R G+V +C DNGAMIAYAG++ ++AG Sbjct: 292 HCNSNEVIIVGGVGCNVRLQEMIGQMVSERGGKVGAMDHRYCIDNGAMIAYAGILEYEAG 351 Query: 317 ATADLGVSV-RPRWPLAEL 334 D S R+ E+ Sbjct: 352 GRMDFKDSYFTQRFRTDEV 370 >UniRef50_A2QMR2 Function: O-sialoglycoprotein endopeptidase is a neutral metalloprotease n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QMR2_ASPNC Length = 430 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 112/380 (29%), Positives = 164/380 (43%), Gaps = 50/380 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKG-LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 + L IETSCD+T +AI + E + + L + Y G+ P +A H L Sbjct: 30 LLTLAIETSCDDTSVAIVEKESNAVQIHFLDKVTCDTSAYQGIHPVVALESHQENIASLQ 89 Query: 60 QAA--LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 Q +S L + D V T GPG L VG G++L+ AW VP + VHHM+ HLL Sbjct: 90 QTINVSSDSQLR-RKPDFVCSTRGPGFRSNLFVGLDTGKALSVAWQVPFVGVHHMQAHLL 148 Query: 118 APMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDY 177 P L PEFPF+++L+SGGHT L+ + I +E++ ++D A GEA DK A+ + + Sbjct: 149 TPRLP-ITPEFPFLSILISGGHTMLVKSSSITDHEIMASTVDRALGEALDKAAREIIPPF 207 Query: 178 --------PGGPLLSKMA----------------------AQGTAGRFVFPRPMTDRPGL 207 G LL + A + + F P L Sbjct: 208 LLQTSKSTMYGKLLEEFAFPNGKADYADYQAPKSRHDELIPRENPWGWSFTEPWAHSRQL 267 Query: 208 DFSFSGLKTFAANTIRDNGT-----DDQTRADIARAFEDAVVDTLMIKCKRALD------ 256 +SF + + A + R +AR + L + AL+ Sbjct: 268 QYSFCFIGSTLARIFSAREAAGQTISHEERIALAREAMRTSFEHLASRTIMALESLAKQG 327 Query: 257 -QTGFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVR 312 + K LV++GGV+AN+ L L + R + P CTDN AMIA+AGM Sbjct: 328 PEKEVKTLVVSGGVAANQYLMTVLRSWLDARGFGHVGLVAPPPYLCTDNAAMIAWAGMEM 387 Query: 313 FKAGATADLGVSVRPRWPLA 332 F+AG +L +W L Sbjct: 388 FEAGWRTNLTSRAIRKWSLD 407 >UniRef50_Q1IZH8 Probable O-sialoglycoprotein endopeptidase n=4 Tax=Deinococci RepID=GCP_DEIGD Length = 333 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 135/303 (44%), Positives = 179/303 (59%), Gaps = 14/303 (4%) Query: 3 VLGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 +LGI+TSCD+TG+ + D + AN+++SQ +HA YGGV+PELASR+HV + + Sbjct: 7 ILGIDTSCDDTGVGVVELAPDGSVQVRANRVWSQ-TVHAQYGGVLPELASREHVERIDTV 65 Query: 59 IQAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 AL E+GLT D+ AVA T+GPGLVGALLVG G+ LA A +VP HH+EGH+ A Sbjct: 66 TGDALAEAGLTVGDLAAVAATSGPGLVGALLVGLMYGKGLAQALNVPFYAAHHLEGHIFA 125 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP 178 + + P++AL+VSGGHT L V G+Y L+G + DDAAGEAFDK A+L GL YP Sbjct: 126 A-ASEADLQAPYLALVVSGGHTHLFDVPREGEYVLVGATRDDAAGEAFDKVARLAGLGYP 184 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 GGP +S+ A +G F P+ + G DFSFSGLKT A R + D+A Sbjct: 185 GGPAISEAARRGDPEAVPFKEPLQGQKGFDFSFSGLKTAALLAHRAGAKPE----DLAAG 240 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 FE A V L+ RA G + +V++GGV+ANR LR + Sbjct: 241 FERAAVRFLVGTTLRAARAYGRETVVVSGGVAANRALREAF----AASPVRAVFPGKGLN 296 Query: 299 TDN 301 TDN Sbjct: 297 TDN 299 >UniRef50_Q83I95 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Tropheryma whipplei RepID=GCP_TROW8 Length = 401 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 130/399 (32%), Positives = 181/399 (45%), Gaps = 68/399 (17%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCDETG+ I +LAN++ S H +GGV+PE+A+R H+ L++ Sbjct: 3 IILGIETSCDETGVGIV-SGSTVLANEVASSSLRHKPFGGVIPEIAARAHLEYLPNLLEL 61 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLV--------GATVGR----------------- 96 AL+ + L KDID +A TAGPGLV +L V G + G Sbjct: 62 ALETAQLCIKDIDGIAVTAGPGLVTSLSVGVSAAKALGLSTGTPVYGVNHLVGHAVSAFL 121 Query: 97 ------SLAFAWDVPAIPVH--------------------HMEGHLLAP--MLEDNPPEF 128 L +I + H + P + + ++ Sbjct: 122 DDYTNDGLGVIHRRDSIGSNGIENDASSTHSHTHTTQVNRHSNLCVYTPPRRVLRDVCKY 181 Query: 129 PFV----ALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLS 184 V LL SGGH+ L+ + + LLGE++DDAAGEAFDK A+L+GL YPGGP + Sbjct: 182 MHVRDSVVLLASGGHSCLLKIHN-NKISLLGETLDDAAGEAFDKIARLMGLQYPGGPAIE 240 Query: 185 KMAAQGTAGRFVFPRPMT----DRPGLDFSFSGLKTFAANTIRDNGTDD----QTRADIA 236 +A+ G FPR + + FSFSGLKT + ++ DIA Sbjct: 241 MLASSGNPNAVEFPRALLTHFEEHNRYSFSFSGLKTAVGRVVERIKSNPAHSIPKIEDIA 300 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 +F++AV D L K A + +VM GGV+AN +R L E K +V Sbjct: 301 ASFQEAVADVLTAKTVAAALASDVDLIVMGGGVAANNRIREMLCERAKIHGLDVKIPPIA 360 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLG-VSVRPRWPLAEL 334 CTDNGAMIA AG + G S PL ++ Sbjct: 361 LCTDNGAMIAAAGSWLMQLGYNPSHSRFSPVSIMPLTQM 399 >UniRef50_B5Y892 O-sialoglycoprotein endopeptidase n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y892_COPPD Length = 316 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 109/328 (33%), Positives = 169/328 (51%), Gaps = 20/328 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+L IETSCDET +A + +K + + +++SQ+ LH +GGV+PE A+R H+ L++ Sbjct: 4 RILAIETSCDETAVACLNGDKVVQS-KVFSQIDLHEAFGGVLPEAAARRHLEVLPVLLKD 62 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 D +A TAGPGL+ ALL G +V L+ W VP + ++H+ H+ A L Sbjct: 63 V--------AKPDLIAVTAGPGLLPALLTGVSVALGLSRGWQVPVMGINHVVAHVAAAAL 114 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 E E P + L+VSGGHT + ++LG + DDAAGE DK + LG+ YP G Sbjct: 115 E-RRIELPVLGLVVSGGHTSFYLIEKWSDPKVLGWTYDDAAGECLDKVGRALGMKYPAGA 173 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 + +A R P P+ + +FSFSGLKT A + +A + + Sbjct: 174 EIDNLALT-IKERVTMPLPLKNEDSFNFSFSGLKTAALKYKGK-----ISNEVLAASLME 227 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 AVV+ L+ + ++ L + + LV+ GGVSA++ LR ++ E V + + TDN Sbjct: 228 AVVNHLLDRIEKVLKKYPYP-LVVGGGVSASKFLRQRMHE---HFGERVIFPSAQLSTDN 283 Query: 302 GAMIAYAGMVRFKAGATADLGVSVRPRW 329 M+A + + G V+ P Sbjct: 284 ADMVAVYAALLLQEGIVPGSCVTPDPNM 311 >UniRef50_UPI0000DB7930 PREDICTED: similar to O-sialoglycoprotein endopeptidase-like 1 n=1 Tax=Apis mellifera RepID=UPI0000DB7930 Length = 385 Score = 287 bits (734), Expect = 4e-76, Method: Composition-based stats. Identities = 99/339 (29%), Positives = 155/339 (45%), Gaps = 37/339 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIE+SCD+T I D +L + SQ H ++GG++P A HV + Sbjct: 30 IILGIESSCDDTAFGIVDSNGNILGESINSQYLTHLNFGGIIPTFARSLHVNNITKTCED 89 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL+ + L +DIDA+A T G+ LA P IP+HHME H L + Sbjct: 90 ALRAANLRIRDIDAIATT--------------FGKYLAKIGGKPFIPIHHMEAHALTARI 135 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 + +FP++ALL+SGGH L V + ++ LLG S+ + G+ F+K A+ L L Sbjct: 136 -NKKIDFPYLALLISGGHCLLAIVENVNKFYLLGTSLSNTPGDVFNKVARRLKLRNIPEF 194 Query: 176 -DYPGGPLLSKMA-AQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 GG + A +F+FP M +FSFSGL F + I Sbjct: 195 STLNGGQAIELAASKASNVNQFLFPLIMMQFRNCNFSFSGLLNFFGDMI------IPDVY 248 Query: 234 DIARAFEDAVVDTLMIKCKRALDQ--------TGFKRLVMAGGVSANRTLRAKLAEMMKK 285 + AF+ A+ + + +RA++ + LV++GGV+ N L L + + Sbjct: 249 NFCAAFQLALTTHICQRTQRAMEFINKMSLFPENKQTLVISGGVACNNFLAKALNIVSTE 308 Query: 286 RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS 324 + CTDNG MIA+ G+ ++ ++ Sbjct: 309 LGYTFVRTPSKLCTDNGIMIAWNGVEKWIQNIDVIRDIN 347 >UniRef50_UPI000023E24C hypothetical protein FG06887.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023E24C Length = 1434 Score = 287 bits (734), Expect = 4e-76, Method: Composition-based stats. Identities = 120/403 (29%), Positives = 172/403 (42%), Gaps = 75/403 (18%) Query: 3 VLGIETSCDETGIAIYD---DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 L IETSCD+TG+A+ LL N+ S + G+ P +A++ H PL+ Sbjct: 1017 TLAIETSCDDTGVAVLRHTSQSTELLFNERISSDN--RAFKGIHPIVAAKGHSVSLAPLV 1074 Query: 60 QAALKE---------------SGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDV 104 + AL SG+ + D V+ T GPG+ L +G + + LA AWDV Sbjct: 1075 RRALNALPAAEDGDNKRICYASGVRKQVPDFVSVTRGPGMRSNLGIGLDMAKGLAVAWDV 1134 Query: 105 PAIPVHHMEGHLLAPML-------------EDNPPEFPFVALLVSGGHTQLISVTGIGQY 151 P + VHHM+ H L P L PEFPF++LLVSGGHTQL+ TG+ + Sbjct: 1135 PLVGVHHMQAHALTPRLARALGMSMGEAEESRKGPEFPFLSLLVSGGHTQLVHSTGLTDH 1194 Query: 152 ELLGESIDDAAGEAFDKTAKLL-------------------GLDYPGGPLL--------- 183 ++ S D A G D+TA+ + +P G Sbjct: 1195 SIIATSGDIAIGNLLDQTARDILPSEVFDASEHVMYGRLLEAFAFPTGADTTSAYEAVFT 1254 Query: 184 ------SKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI-RDNGTDDQTRADIA 236 +M T + P P L FSFS + T + R +A Sbjct: 1255 PPASRSEEMTPVSTGYDWNIPTPFRQSRKLAFSFSSIYTHVHDIATARPSMSTSERRALA 1314 Query: 237 RAFEDAVVDTLMIKCKRALDQTG----FKRLVMAGGVSANRTLRAKLAEMMKKRRGE--- 289 + A L + ALD K LVMAGGV++N+ L L M+ R E Sbjct: 1315 QHTMMAAFVHLAGRLCIALDDKPELQAAKTLVMAGGVASNKFLMHVLRSMLAIRGYEGIE 1374 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + E CTDN AMIA+ G+ F+AG ++L ++ +WP+ Sbjct: 1375 IVAPPVELCTDNAAMIAWTGIEMFQAGYESELSITGIGKWPMD 1417 >UniRef50_D2RYV2 Metalloendopeptidase, glycoprotease family n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RYV2_9EURY Length = 578 Score = 286 bits (732), Expect = 8e-76, Method: Composition-based stats. Identities = 103/358 (28%), Positives = 155/358 (43%), Gaps = 37/358 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 R+LGIE + A+YD + D GG+ P A+ +++ Sbjct: 6 RILGIEGTAWAASAAVYDSATD---DVFIESDAYQPDSGGIHPREAAEHMHDAIPRVVET 62 Query: 62 ALKES---------------------GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAF 100 AL+ + G A +DA+A++ GPGL L + T R+L+ Sbjct: 63 ALEHARETHDGPAGEAPVDVDERSSSGQQAAPVDAIAFSRGPGLGPCLRIVGTAARALSQ 122 Query: 101 AWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDD 160 A +VP + V+HM HL + V L SG + L++ G+Y +LGE++D Sbjct: 123 ALEVPLVGVNHMVAHLEIGRHTADFDSP--VCLNASGANAHLLAYRN-GRYRVLGETMDT 179 Query: 161 AAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAAN 220 G A DK + +G +PGGP + A G + G+DFSFSG+ + A Sbjct: 180 GVGNAIDKFTRHVGWSHPGGPKVEAAAEDGEYVDLPYVV-----KGMDFSFSGIMSAAKQ 234 Query: 221 TIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLA 280 D+ DI + ++ + L +RAL TG LV+ GGV N LR LA Sbjct: 235 AY----DDETPVEDICFSLQENIFGMLTEVAERALSLTGSDELVLGGGVGQNERLREMLA 290 Query: 281 EMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAELPAA 337 EM +R E P F DN MIA G ++AG T ++ V P + ++P Sbjct: 291 EMCAQRGAEFHAPEPRFLRDNAGMIAVLGAKMYEAGDTLEIEDSQVDPNYRPDQVPVT 348 >UniRef50_P75055 Probable O-sialoglycoprotein endopeptidase n=2 Tax=Mycoplasma RepID=GCP_MYCPN Length = 319 Score = 285 bits (729), Expect = 2e-75, Method: Composition-based stats. Identities = 109/315 (34%), Positives = 169/315 (53%), Gaps = 19/315 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIET+CD+T I + + K + A+ + S KLHA GGVVPE+A+R H + + A Sbjct: 7 ILGIETTCDDTSIGVITESK-VQAHIVLSSAKLHAQTGGVVPEVAARSHEQNLL----KA 61 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L++SG+ + I +AY A PGL G L VGAT RSL+F D P +P++H+ H+ + +++ Sbjct: 62 LQQSGVVLEQITHIAYAANPGLPGCLHVGATFARSLSFLLDKPLLPINHLYAHIFSALID 121 Query: 123 D--NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 N + P + L+VSGGHT + + + EL+ E+ DDA GE +DK + +G YP G Sbjct: 122 QDINQLKLPALGLVVSGGHTAIYLIKSLFDLELIAETSDDAIGEVYDKVGRAMGFPYPAG 181 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD------ 234 P L + F RP T FS+SGLK+ I+ Sbjct: 182 PQLDSLFQPELVKSHYFFRPSTKWTK--FSYSGLKSQCFTKIKQLRERKGFNPQTHDWNE 239 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 A F+ ++D + K A+ Q + L++ GGVSAN+ LR ++ ++ + A Sbjct: 240 FASNFQATIIDHYINHVKDAIQQHQPQMLLLGGGVSANKYLREQVTQL----QLPYLIAP 295 Query: 295 PEFCTDNGAMIAYAG 309 ++ +DNGAMI + Sbjct: 296 LKYTSDNGAMIGFYA 310 >UniRef50_C4PYC5 Mername-AA018 peptidase (M22 family) n=1 Tax=Schistosoma mansoni RepID=C4PYC5_SCHMA Length = 388 Score = 285 bits (729), Expect = 2e-75, Method: Composition-based stats. Identities = 105/342 (30%), Positives = 164/342 (47%), Gaps = 35/342 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIETSCD+TG A+ + LL + L SQ ++ GGV+P +A+ H ++ A Sbjct: 36 VLGIETSCDDTGAAVIETSGKLLGDCLSSQSRISVMLGGVLPSVAAELHKENIESVVNTA 95 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + +S + +D++ VA T PG+ +L +G + +SLA +P IP+ HME H L + Sbjct: 96 MAKSNIGLRDLNFVAVTVKPGMPLSLKIGVSFAKSLASRLKIPIIPIDHMEAHALTALFT 155 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------- 175 D +FP++ LL+SGGH L V G+ Y LLG ++D + G+ DK ++ L L Sbjct: 156 DPQLKFPYMILLISGGHGILGIVQGLEDYVLLGTALDASPGDVLDKLSRRLKLNRLSDEC 215 Query: 176 --DYPGGPLLSKMAA--QGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 GG + +A G RF P P + DFSF+G+ A I ++++ Sbjct: 216 LKGVAGGKAIEIIAKTYNGDHQRFNLPLPRSQSKDCDFSFTGIHAAAEQLINKLESENR- 274 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 T + C + Q V++GGV +N +RA L E+ Sbjct: 275 -------------GTFYLPCSIFISQMK----VVSGGVGSNCVIRAGLTEVANHYNLRFV 317 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGAT------ADLGVSVRP 327 P CTDNG MIA+ G++ K ++ + + R Sbjct: 318 APPPSLCTDNGIMIAWNGVLLQKENSSRIIEDISSVDFCPRS 359 >UniRef50_B6GZQ3 Pc12g05880 protein n=9 Tax=Trichocomaceae RepID=B6GZQ3_PENCW Length = 457 Score = 285 bits (729), Expect = 2e-75, Method: Composition-based stats. Identities = 114/407 (28%), Positives = 175/407 (42%), Gaps = 75/407 (18%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLL--ANQLYSQVKLHAD---YGGVVPELASRDHVRKT 55 + L IETSCD+T +AI + K A +++ + AD + G+ P +A H Sbjct: 34 LLTLAIETSCDDTSVAIVEKTKKESGSAAKIHFLENVTADTRAHRGIHPIIALESHQDNL 93 Query: 56 VPLIQAALKES-------GLTAKD------IDAVAYTAGPGLVGALLVGATVGRSLAFAW 102 L+Q AL GL D D ++ T GPG+ L VG G++L+ AW Sbjct: 94 ATLVQKALNYLPESKTSDGLKLADGTRRRLPDFISATRGPGMRSNLSVGLDTGKALSVAW 153 Query: 103 DVPAIPVHHMEGHLLAPMLEDN------------PPEFPFVALLVSGGHTQLISVTGIGQ 150 +P + VHHM+ HLL P L PEFPF+++LVSGGHT L+ GI Sbjct: 154 QIPMVGVHHMQAHLLTPGLVTCLENASKAGPPAIAPEFPFLSILVSGGHTTLVQSKGITD 213 Query: 151 YELLGESIDDAAGEAFDKTAKLLGLD-------------------YPGGPLL-------- 183 +++L S D A GEA DK+A+ + D +P G Sbjct: 214 HKILATSEDIAIGEALDKSARDILPDSLLQEAKSTMYGKNLEQFVFPNGKADFADYSPPD 273 Query: 184 ---SKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN----GTDDQTRADIA 236 ++ + + + P + + FSFS + + ++ + R D+ Sbjct: 274 TRGQEITKRVSDWGWSLTTPFANTRMMQFSFSSISSMVGKIVQRSGTNIKMSHAERVDLG 333 Query: 237 RAFEDAVVDTLMIKCKRALD--------QTGFKRLVMAGGVSANRTLRAKLAEMMKKRR- 287 R + L + AL+ + K LV++GGV+AN+ L L ++ R Sbjct: 334 REAMRVCFEHLASRTVIALETLRPHNTGKDEIKTLVVSGGVAANQFLMKVLTSFLEVRGF 393 Query: 288 --GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + P CTDN AMI +AG+ F+AG +DL +W L Sbjct: 394 GNINIVAPPPYLCTDNAAMIGWAGIEMFEAGFRSDLSCRPLRKWTLD 440 >UniRef50_Q8EUQ9 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma penetrans RepID=GCP_MYCPE Length = 306 Score = 284 bits (728), Expect = 3e-75, Method: Composition-based stats. Identities = 104/314 (33%), Positives = 168/314 (53%), Gaps = 12/314 (3%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L IETSCD+T +AI +D K +L+ + + K +GG+VPE+ +R H + + Sbjct: 1 MYILSIETSCDDTSVAILEDNK-VLSCIIKNDSKQLNPFGGIVPEIVARYHEENIIKALD 59 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ES ++ ID VAYT PGL G+L VG +++A+A DV +P++H+ GH+L+P Sbjct: 60 LALQESNISLNQIDKVAYTNQPGLPGSLFVGEIFAKTMAYALDVECVPINHIHGHILSPF 119 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + ++ P++PF++L+ SG T + V + L ++ DDA GE FDK K LG DYP G Sbjct: 120 I-NSVPKYPFMSLIASGKTTSIFLVKSANEIIELTKTRDDAIGEIFDKVGKALGYDYPAG 178 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR--DNGTDDQTRADIARA 238 P L K A P+ + DFSFSG+K + I ++ I + Sbjct: 179 PKLDKYFDISKATITPSFPPVKN----DFSFSGIKNKFLSIINSSKMKNEEIDTITIGSS 234 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 F +D ++ K K D+ + + GGV+ N + ++ ++ + F ++ Sbjct: 235 FLKYSIDLIIKKLKYYKDEYSVDCVCIGGGVANNNYFKQEIKKLFS----DSFVPESKYS 290 Query: 299 TDNGAMIAYAGMVR 312 TDN AMI +A + Sbjct: 291 TDNAAMIGFAYYEK 304 >UniRef50_B7XIP4 O-sialoglycoprotein endopeptidase n=2 Tax=Eukaryota RepID=B7XIP4_ENTBH Length = 360 Score = 282 bits (723), Expect = 9e-75, Method: Composition-based stats. Identities = 105/352 (29%), Positives = 171/352 (48%), Gaps = 23/352 (6%) Query: 3 VLGIETSCDETGIAIY---DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 VLGIE+S ++ G+ I ++ LLAN+ + A GV+P A++ H + LI Sbjct: 13 VLGIESSANKIGVGILKIMNENVELLANERKTY--TPAPGAGVIPIDAAKHHRDVILELI 70 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 +L++S L +DID AYT GPG+ L+VG V R+LA + P +PV+H H+ Sbjct: 71 DVSLQKSNLVIQDIDLYAYTKGPGMYQLLVVGCVVARTLALYHNKPLVPVNHCVAHIEMG 130 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTG--IGQYELLGESIDDAAGEAFDKTAKLLGLD- 176 + L SGG+TQ+I+ +Y++ GE+ID A G FDK A+ LGLD Sbjct: 131 RFITGAKNP--IVLYASGGNTQIINRISGKTNKYKIFGETIDVAVGNCFDKVARALGLDN 188 Query: 177 -YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT---- 231 G + + A ++ P P T G+D SFSG+ + I+D + + + Sbjct: 189 APSPGFNIERQAELNHEKKY-IPLPYT-IKGMDMSFSGILSTCLKLIKDFKSTNPSSAQF 246 Query: 232 ---RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG 288 ++I + ++ + L+ +R +++ GGV N L+ + +M+ +R G Sbjct: 247 KKFISEICFSLQETMFSILVEATERCCSFVESNEVLIVGGVGCNLRLQEMIHKMITQRGG 306 Query: 289 EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVS---VRPRWPLAELPAA 337 V+ +C DNGAMIAY G + FK + + V R+ + Sbjct: 307 TVYSMNEAYCIDNGAMIAYTGYLIFKHQSKYVTNLEDCYVTQRFRTDSVDIT 358 >UniRef50_Q4PGZ6 Putative uncharacterized protein n=2 Tax=Ustilaginomycotina RepID=Q4PGZ6_USTMA Length = 414 Score = 282 bits (722), Expect = 1e-74, Method: Composition-based stats. Identities = 115/368 (31%), Positives = 182/368 (49%), Gaps = 39/368 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LGIETSCD++ +I ++ +L++ + Q H+ GG+ P A+ H I A Sbjct: 51 LILGIETSCDDSCASIVSSDRTILSSIVTKQD--HSSTGGIHPLSAALGHHSNLASTIAA 108 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 A++++ +TA D+ A+A T GPG+ +L VG + ++L+ +P I VHHM+ H L P+L Sbjct: 109 AIEQARITASDLHAIAVTQGPGMASSLGVGLSAAKTLSAVLHIPLIYVHHMQAHALTPLL 168 Query: 122 EDN-PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + PP+ PF+ LLVSGGHT L+ + + +L + DD+ G+AFDK A+ LG+ + Sbjct: 169 TEPDPPKLPFLVLLVSGGHTMLVLARSVTHFRILATTSDDSIGDAFDKVARDLGIPWTSA 228 Query: 181 P----LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD---DQTRA 233 P A+ VFP P +P FS+SGLK I D + ++ Sbjct: 229 PGAALEALAARAEAHGDGLVFPTPCKGQP--TFSYSGLKAAVQRHIASCSPDAMAESAKS 286 Query: 234 DIARAFEDAVVDTLMIKCKRALD-----------------------QTGFKRLVMAGGVS 270 IA AF+ A L K L K +V +GGV+ Sbjct: 287 SIAAAFQRAACAQLEDKLSMVLRPSHVSQDSRHRPFARIELLDGVSSDDVKTVVCSGGVA 346 Query: 271 ANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +N +R++L E + + ++ + CTDN AMIA+ G + + T D RP Sbjct: 347 SNAFIRSRLREHLDRLGRTDVDLQFPPLSLCTDNAAMIAWVGHLIY-HQRTRDYTRHARP 405 Query: 328 RWPLAELP 335 +W L ++P Sbjct: 406 KWSLQDIP 413 >UniRef50_B1AJ51 Probable O-sialoglycoprotein endopeptidase n=15 Tax=Ureaplasma RepID=GCP_UREP2 Length = 320 Score = 282 bits (721), Expect = 2e-74, Method: Composition-based stats. Identities = 105/322 (32%), Positives = 163/322 (50%), Gaps = 11/322 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IE+SCDET +A++++ K L+A+++ S + + +GGVVPELASR H + L Sbjct: 6 LILSIESSCDETSLALFENNK-LIAHKISSSASIQSLHGGVVPELASRYHEQNINHLFNE 64 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L E+ + I VAYTA PGL G L VG + LA + +P++H+ H+ + + Sbjct: 65 ILNETKINPLTITHVAYTAMPGLPGCLHVGKVFAKQLAVLINAELVPINHLHAHVFSASI 124 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + FPF+ L+VSGG + + V + ++L ++ DDA GE +DK A++LG YPGGP Sbjct: 125 -NQNLTFPFLGLVVSGGESCIYLVNDYDEIKVLNQTHDDAIGECYDKIARVLGWKYPGGP 183 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ--TRADIARAF 239 ++ K + A + DFSFSGLKT N I + +A +F Sbjct: 184 IIDKNYQENLAT---LEFIKSQPAAKDFSFSGLKTAVINYIHNAKQKKISFDPVVVASSF 240 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + ++ ++ K K L+ L + GGVSAN LR K+ + + + Sbjct: 241 QKFAINEIIKKIKYYLNLYKLNHLAIGGGVSANSLLRKKIQSL----DVISYIPEMIYTG 296 Query: 300 DNGAMIAYAGMVRFKAGATADL 321 DN AMI K + L Sbjct: 297 DNAAMIGAYAYALIKNHKKSIL 318 >UniRef50_Q93170 Protein C01G10.10, confirmed by transcript evidence n=3 Tax=Caenorhabditis RepID=Q93170_CAEEL Length = 421 Score = 278 bits (712), Expect = 2e-73, Method: Composition-based stats. Identities = 99/349 (28%), Positives = 170/349 (48%), Gaps = 22/349 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +VLGIETSCD+T +AI ++++ +L+++ Y++ + GG+ P + + H LI+ Sbjct: 24 KVLGIETSCDDTAVAIVNEKREILSSERYTERAIQRQQGGINPSVCALQHRENLPRLIEK 83 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L ++G + KD+DAVA T PGLV AL G + A +P IPVHHM H L+ +L Sbjct: 84 CLNDAGTSPKDLDAVAVTVTPGLVIALKEGISAAIGFAKKHRLPLIPVHHMRAHALSILL 143 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK---LLGLDYP 178 D+ FPF A+L+SGGH + + +++L G+S+ + GE DK A+ LG ++ Sbjct: 144 VDDSVRFPFSAVLLSGGHALISVAEDVEKFKLYGQSVSGSPGECIDKVARQLGDLGSEFD 203 Query: 179 G---GPLLSKMA-AQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQ---T 231 G G + +A G +P + + P + +F +K N + + + Sbjct: 204 GIHVGAAVEILASRASADGHLRYPIFLPNVPKANMNFDQIKGSYLNLLERLRKNSETSID 263 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTG-----FKRLVMAGGVSANRTLRAKLAEMMKKR 286 D + ++ V + K + K+LV+ GGV+AN+ + ++++ Sbjct: 264 IPDFCASLQNTVARHISSKLHIFFESLSEQEKLPKQLVIGGGVAANQYIFGAISKLSAAH 323 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAELP 335 CTDN MIAY+G++ + A W ++P Sbjct: 324 NVTTIKVLLSLCTDNAEMIAYSGLLMLVNRSEAIW-------WRPNDIP 365 >UniRef50_Q7NB15 Probable O-sialoglycoprotein endopeptidase n=1 Tax=Mycoplasma gallisepticum RepID=GCP_MYCGA Length = 321 Score = 277 bits (708), Expect = 6e-73, Method: Composition-based stats. Identities = 101/313 (32%), Positives = 160/313 (51%), Gaps = 18/313 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE+SCD+ IAI D K ++ + S +HA+YGGVVPE+A+R H + A Sbjct: 7 ILGIESSCDDLSIAIAIDNK-IVTTKTKSSSSVHANYGGVVPEIAARYHEEILHQTLNEA 65 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ LT ID + YT PGL+ L V +L + +PA ++H+ GH+ +PM++ Sbjct: 66 LTEANLTINKIDLITYTENPGLLNCLHVAKVFANTLGYLLKIPAQGINHLYGHIFSPMID 125 Query: 123 DNPP-------EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL 175 D +P + ++VSGGHT + V + LL E++DDA GE +DK + LGL Sbjct: 126 DGDCLYQKSDLIYPALGIVVSGGHTAIYDVQSPSKITLLDETLDDAIGEVYDKVGRALGL 185 Query: 176 DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR-DNGTDDQTRAD 234 YP G + ++ A F + FS+SG K+ I + D Sbjct: 186 QYPAGAKIDQLYNPEQAETVEF---LKTNKLSAFSYSGFKSAVLRYIELNKNQPDFNLVQ 242 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFK--RLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 +F+ ++D + + K +++ K +++ GGVSAN LR++L E+ + Sbjct: 243 AVSSFQKFIIDDFIDRIKNVINKADSKYQTILLGGGVSANSYLRSELKELA----IKTLV 298 Query: 293 ARPEFCTDNGAMI 305 +P + DN AMI Sbjct: 299 PKPIYSGDNAAMI 311 >UniRef50_A2BJY9 Putative O-sialoglycoprotein endopeptidase n=22 Tax=Thermoprotei RepID=GCP_HYPBU Length = 363 Score = 276 bits (707), Expect = 7e-73, Method: Composition-based stats. Identities = 107/337 (31%), Positives = 165/337 (48%), Gaps = 13/337 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGIE++ G+ I + + + H GG+ P A+ H R +I A Sbjct: 32 VLGIESTAHTFGVGIASTKPPYILVSVR--DTYHPPKGGIHPREAASHHARVASEVILDA 89 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L+ GL+ +DIDAVA GPGL AL VGAT+ R LA + P +PV+H H+ L Sbjct: 90 LRTVGLSIRDIDAVAVALGPGLGPALRVGATIARGLAAYYGKPLVPVNHAVAHIEIARLY 149 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP---G 179 + V L VSGG+T +++ +Y + GE++D A G D A+ G+ P Sbjct: 150 TGLGDP--VVLYVSGGNT-VVAAYAKARYRVFGETLDIALGNLLDTFARDAGIAPPYIVS 206 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAF 239 G + A+ + P + G+D SFSGL T A G++D+ +A + Sbjct: 207 GLHIVDRCAEAASKPADLPYVV---KGMDVSFSGLLTAALRLWTKAGSEDE-KAAVCLGL 262 Query: 240 EDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCT 299 + +++ +RAL T K +++ GGV+A+ LR K+ M + P+ Sbjct: 263 REVAYGSVVEVTERALAHTRKKSVMLTGGVAASPILRNKVRSMASYHGAVADWPPPQLAG 322 Query: 300 DNGAMIAYAGMVRFKAGATADLGVS-VRPRWPLAELP 335 DNGAMIA+ G++ + AG T D+ S V+ RW L + Sbjct: 323 DNGAMIAWTGLLNYLAGITVDVEESVVKQRWRLDVVE 359 >UniRef50_Q6L4N8 Os05g0194600 protein n=21 Tax=Eukaryota RepID=Q6L4N8_ORYSJ Length = 380 Score = 275 bits (704), Expect = 1e-72, Method: Composition-based stats. Identities = 102/337 (30%), Positives = 169/337 (50%), Gaps = 13/337 (3%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LG+E+S ++ GI + +L+N ++ + G +P + H+ +PL++AAL Sbjct: 17 LGLESSANKIGIGVVSLSGEILSNPRHTY--VTPPGHGFLPRETAHHHLAHLLPLLRAAL 74 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLED 123 E+G+T D+ V YT GPG+ L V A R+L+ W P + V+H H+ Sbjct: 75 GEAGVTPADLACVCYTKGPGMGAPLQVAAAAARALSLLWGKPLVGVNHCVAHVEMGRAVT 134 Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYPGGP 181 + V L VSGG+TQ+I+ G+Y + GE+ID A G D+ A++L L D G Sbjct: 135 GAVDP--VVLYVSGGNTQVIAY-SEGRYRIFGETIDIAVGNCLDRFARVLELSNDPSPGY 191 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAA-NTIRDNGTDDQTRADIARAFE 240 + ++A +G P + G+D SFSG+ +F I ++ T AD+ + + Sbjct: 192 NIEQLAKKGE-KFIDLPYVV---KGMDVSFSGILSFIEATAIEKLKNNECTPADLCYSLQ 247 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + + L+ +RA+ K +++ GGV N L+ + M +R G +F +C D Sbjct: 248 ETLFAMLVEITERAMAHCDSKDVLIVGGVGCNERLQEMMRIMCSERGGRLFATDDRYCID 307 Query: 301 NGAMIAYAGMVRFKAGATADLGV-SVRPRWPLAELPA 336 NGAMIAY G++ + G T L + R+ E+ A Sbjct: 308 NGAMIAYTGLLAYAHGMTTPLEESTFTQRFRTDEVHA 344 >UniRef50_Q18KI0 Putative O-sialoglycoprotein endopeptidase n=14 Tax=Euryarchaeota RepID=GCP_HALWD Length = 533 Score = 273 bits (698), Expect = 7e-72, Method: Composition-based stats. Identities = 99/342 (28%), Positives = 149/342 (43%), Gaps = 20/342 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+LGIE + A+Y+ + + D GG+ P A+ +I Sbjct: 1 MRILGIEGTAWAASAALYNTHDETI---VIESDPYQPDSGGLHPREAAEHMSTALPEVIS 57 Query: 61 AALKES----GLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L+ + A IDA+A++ GPGL L V T R+L A VP I V+HM HL Sbjct: 58 TILERAVSSGNTDAIGIDAIAFSRGPGLGPCLRVVGTAARTLTQALSVPLIGVNHMIAHL 117 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 + V L SG + L+ QY++LGE++D G A DK + LG + Sbjct: 118 EIGRHQSGFTTP--VCLNASGANAHLLGYH-RRQYQVLGETMDTGVGNAIDKFTRHLGWN 174 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIA 236 +PGGP + A G+ + G+DFSFSG+ + A + + ++ D+ Sbjct: 175 HPGGPKVEAAATDGSYHDLPYVV-----KGMDFSFSGIMSAAKDAV----DNEVPVVDVC 225 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 ++ + L +RAL TG LV+ GGV N LR L+ M R + Sbjct: 226 TGLQETIFAMLTEVAERALSLTGSNELVLGGGVGQNDRLREMLSTMCTARGASFYAPESR 285 Query: 297 FCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAELPAA 337 F DN MIA G ++AG T + +V P + + Sbjct: 286 FLRDNAGMIAVLGAAMYEAGQTISVNDSAVDPTFRPDAVTVT 327 >UniRef50_Q2HG58 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2HG58_CHAGB Length = 1550 Score = 272 bits (697), Expect = 8e-72, Method: Composition-based stats. Identities = 106/406 (26%), Positives = 167/406 (41%), Gaps = 77/406 (18%) Query: 3 VLGIETSCDETGIAIYDDEK---GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 L IETSCD+T + + + +L N + +GG+ P+ A + H ++ Sbjct: 1068 TLAIETSCDDTCVTVLEKSGDAARVLFNAKVTSDN--RRFGGIKPDEAVQGHSSSLPGIV 1125 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 QAA+++ D ++ T GPG+ AL +G T+ + LA AWD P + VHHM+ H L P Sbjct: 1126 QAAIQKLPADRPKPDFISVTRGPGITSALSIGLTMAKGLAVAWDRPLVAVHHMQAHALTP 1185 Query: 120 MLED--------------NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEA 165 L + P +PF++LLVSGGH+QL+ + L E+ + A G+ Sbjct: 1186 RLVEALANGQQQPPHQGGARPAYPFLSLLVSGGHSQLLLTRSAVSHATLAEAANVAIGDM 1245 Query: 166 FDKTAKLL--------GLDYPGGPLLSKMAAQ---------------------------- 189 DK A+ + D L + A Sbjct: 1246 LDKCARAILPSDILASTPDVMYAAELERFAFAPTPTQTQTHQTQTQHPSNPYTNYHPPTT 1305 Query: 190 ---------GTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG-TDDQTRADIARAF 239 + P+ +R + F FSGL ++ N D RA++AR Sbjct: 1306 RRDEIRPYTSPTHGWTLTPPLHERRDMAFDFSGLGGQVQAIMQRNPSMDPPQRAELARET 1365 Query: 240 EDAVVDTLMIKCKRALD-------QTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG---- 288 + L + ALD + LV++GGV+AN L L ++ R Sbjct: 1366 MRVAFEHLASRVIFALDGMRTQAAALPVRTLVVSGGVAANGFLMHVLGRVLAVRGYGPEK 1425 Query: 289 -EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLAE 333 V CTDN M+A+AG+ ++AG ++L V R RW + + Sbjct: 1426 VAVVRPPRGLCTDNAVMVAWAGVEMWEAGWESELSVLPRRRWEMDD 1471 >UniRef50_B8MFK9 Glycoprotease family protein, putative n=5 Tax=Leotiomyceta RepID=B8MFK9_TALSN Length = 883 Score = 271 bits (693), Expect = 3e-71, Method: Composition-based stats. Identities = 110/436 (25%), Positives = 166/436 (38%), Gaps = 104/436 (23%) Query: 1 MRVLGIETSCDETGIAIYDDE-----------KGLLANQLYSQVKLHAD---YGGVVPEL 46 + L IE+SCD+T +AI + + G A +++ + AD Y G+ P Sbjct: 426 LLTLAIESSCDDTSVAIVEKDSFHKSFETPRHTGHAAAEVHFLENITADTRKYRGIHPIE 485 Query: 47 ASRDHVRKTVPLIQAALKESGLTAKDI--------------------------DAVAYTA 80 A + H L+Q A++ A+D + ++ T Sbjct: 486 ALQSHQENLAKLVQKAVRSLPPVAEDYSPEDGAVISHIIPKNKNGKSTRHRLPNFISVTR 545 Query: 81 GPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML----------EDNPPEFPF 130 GPG+ L VG + LA AW +P + VHHM+ HLL P L +D P FPF Sbjct: 546 GPGMRSNLSVGLDTAKGLAVAWQIPLVGVHHMQAHLLTPRLVSALNRSVLTDDLQPNFPF 605 Query: 131 VALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL----------------- 173 +++LVSGGH+ L+ + ++E+L + D A GE DK+A+L+ Sbjct: 606 LSILVSGGHSMLVHSKSLLEHEILATTADIAIGETLDKSARLILPESVLESANTTMYGKL 665 Query: 174 --GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKT--------------- 216 +PGGP + D G F+ T Sbjct: 666 LEKFAFPGGPADYADYQALKTRGEEVVKRDNDTWGWSFTTPYANTRDLKFSFSSVSSTVS 725 Query: 217 --FAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG--------------- 259 A D R +AR + L + AL+ Sbjct: 726 RIMANKEKADVRVTRDERVALARESMRVCFEHLASRTLIALELLRKQLRKQYNTSGSGQE 785 Query: 260 FKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 LV++GGV+AN+ L L + R +V P CTDN AMI +AG+ F+AG Sbjct: 786 IDTLVVSGGVAANQFLMTVLRAFLDVRGFSHIKVIAPPPYLCTDNAAMIGWAGIEMFEAG 845 Query: 317 ATADLGVSVRPRWPLA 332 + DL +W L Sbjct: 846 YSTDLSCRAIRKWTLD 861 >UniRef50_A5DGU9 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DGU9_PICGU Length = 408 Score = 271 bits (693), Expect = 3e-71, Method: Composition-based stats. Identities = 110/376 (29%), Positives = 175/376 (46%), Gaps = 44/376 (11%) Query: 2 RVLGIETSCDETGIAIYD--DEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 RVL IE+SCD+ IA+ D D K + +Q+ S + A GGV+P A H + Sbjct: 24 RVLAIESSCDDACIALLDRKDGKTTVIDQVKSTLNSVAA-GGVIPTEAHGFHQYQIASQA 82 Query: 60 QAALKESGLTAKD-IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 ++ +++++ D + T GPG+VG+L G + L+ AWD P + VHHM GHL+ Sbjct: 83 SQFFQKHKISSQNSPDLICCTRGPGMVGSLSAGLQFAKGLSVAWDKPLVGVHHMLGHLMI 142 Query: 119 PML-----EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 L + PP FPF++LL SGGHT L+ + ++++L ++D A G+A DK A+ L Sbjct: 143 ASLTSESQTNPPPRFPFLSLLCSGGHTMLVLSESLAKHQVLVNTVDIACGDALDKCARKL 202 Query: 174 GL-DYPGGPLLS-----------------KMAAQGTAGRFVFPRPMTDRP------GLDF 209 GL G L K + F PM + F Sbjct: 203 GLKGNMLGKELETFVNSFSKEELDEFTKIKTHTRDNPFNFQLKLPMRSPKHPRNAESVQF 262 Query: 210 SF-SGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQT-----GFKRL 263 SF S L T A + + +A + + + ++ + K A+D+ + Sbjct: 263 SFASFLSTLDAYSPPPGMEKSKVTKFLAFKVQQKIFEHIVDRIKLAVDKNETLFANVNDI 322 Query: 264 VMAGGVSANRTLRAKLA----EMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKA-GAT 318 V++GGV++N TLR L + MK+ + CTDN MI AG+ ++ Sbjct: 323 VLSGGVASNSTLRRMLKDGLNDKMKRPNLNFHFPEIALCTDNAIMIGVAGIEIYENLNVV 382 Query: 319 ADLGVSVRPRWPLAEL 334 +DL ++ +WPL +L Sbjct: 383 SDLSITPIRKWPLDQL 398 >UniRef50_A3CXS0 Putative O-sialoglycoprotein endopeptidase n=5 Tax=Euryarchaeota RepID=GCP_METMJ Length = 527 Score = 270 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 92/337 (27%), Positives = 147/337 (43%), Gaps = 25/337 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLG+E + A++ D+ L + GG+ P A++ H ++ Sbjct: 10 LVLGLEGTAWNLSAALFGDDLVALHSS-----PYVPPKGGIHPREAAQHHASAMKEVVSR 64 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L E + I AVA++ GPGL +L AT R+L+ A DVP + V+H H+ Sbjct: 65 VLTE----PERIRAVAFSQGPGLGPSLRTVATAARALSIALDVPLVGVNHCVAHVEIGRW 120 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGP 181 + + L SG +TQ++ G+Y + GE++D G DK A+ L +PGGP Sbjct: 121 ATGFSDP--IVLYASGANTQVLGYLN-GRYRIFGETLDIGLGNGLDKFARSHDLPHPGGP 177 Query: 182 LLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFED 241 + ++A +G + G+D +FSGL + A D+ ++ Sbjct: 178 AIERLAREGNYIELPYTV-----KGMDLAFSGLVSAAQ-------ESSAPLEDVCFGLQE 225 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDN 301 + +RAL G +++ GGV AN L+ L M ++R F DN Sbjct: 226 TAFAMCVEVTERALAHAGKDEVLLVGGVGANGRLQEMLRVMCEERGAAFAVPERTFLGDN 285 Query: 302 GAMIAYAGMVRFKAGATADLG-VSVRPRWPLAELPAA 337 GAMIAY G + + G L +RP + E+ A Sbjct: 286 GAMIAYTGKIMLEHGVVLPLDQSQIRPGYRADEVEVA 322 >UniRef50_P36174 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Haloarcula marismortui RepID=GCP_HALMA Length = 548 Score = 268 bits (685), Expect = 2e-70, Method: Composition-based stats. Identities = 97/349 (27%), Positives = 151/349 (43%), Gaps = 24/349 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLA----NQLYSQVKLHADYGGVVPELASRDHVRKTV 56 MR+LGIE + ++++ + D GG+ P A+ Sbjct: 1 MRILGIEGTAWAASASVFETPDPARVTDDDHVFIETDAYAPDSGGIHPREAAEHMGEAIP 60 Query: 57 PLIQAALKE----SGLTAKD---IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPV 109 +++ A++ +G D IDAVA+ GPGL L + AT R++A +DVP + V Sbjct: 61 TVVETAIEHTHGRAGRDGDDSAPIDAVAFARGPGLGPCLRIVATAARAVAQRFDVPLVGV 120 Query: 110 HHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKT 169 +HM HL V L SG + ++ G+Y +LGE++D G A DK Sbjct: 121 NHMVAHLEVGRHRSGFDSP--VCLNASGANAHILGYRN-GRYRVLGETMDTGVGNAIDKF 177 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 + +G +PGGP + + A G + G+DFSFSG+ + A + D Sbjct: 178 TRHIGWSHPGGPKVEQHARDGEYHELPYVV-----KGMDFSFSGIMSAAKQAV----DDG 228 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 ++ R E+ + L +RAL TG LV+ GGV N L+ L EM ++R E Sbjct: 229 VPVENVCRGMEETIFAMLTEVSERALSLTGADELVLGGGVGQNARLQRMLGEMCEQREAE 288 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAELPAA 337 + F DN MIA G + AG T + + + E+ Sbjct: 289 FYAPENRFLRDNAGMIAMLGAKMYAAGDTIAIEDSRIDSNFRPDEVAVT 337 >UniRef50_C8V9Q8 PROBABLE O-SIALOGLYCOPROTEIN ENDOPEPTIDASE (AFU_orthologue; AFUA_7G05240) n=2 Tax=Emericella nidulans RepID=C8V9Q8_EMENI Length = 497 Score = 265 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 106/440 (24%), Positives = 161/440 (36%), Gaps = 107/440 (24%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHAD---YGGVVPELASRDHVRKTVP 57 + L IETSCD+T +AI A +++ + D Y G+ P A H + Sbjct: 32 LLTLAIETSCDDTSVAIVHKNDKSGAAKIHFLENITPDLTAYQGIHPVRALESHQQNVAK 91 Query: 58 LIQAALKESGLTAKD------------------IDAVAYTAGPGLVGALLVGATVGRSLA 99 L+ AL ++ + D ++ T GPG+ L G + LA Sbjct: 92 LVNKALSHLPYSSAESQNDPTKIVSLGDGNRQKPDFISVTRGPGMRSNLFAGLDTAKGLA 151 Query: 100 FAWDVPAIPVHHMEGHLLAPML---------------------EDNPPEFPFVALLVSGG 138 AW VP + VHHM+ HLL P L + P FPF+++L SGG Sbjct: 152 VAWQVPFVGVHHMQAHLLTPRLVSALALSPGSSPNNTDRQNEKGELQPAFPFLSILASGG 211 Query: 139 HTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL--------GLDYPGGPLLSKMA--- 187 HT L++ + + + +L + D A GEA DK A+ + + G LL + A Sbjct: 212 HTLLVNSSSLTDHRILATTTDVALGEALDKAAREILPSSLLSTSKNTMYGKLLEQYAFPN 271 Query: 188 ---------------------AQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG 226 + + P L FSF+ L T +T+ Sbjct: 272 GRADYADYVAPKSRGDEIAVSKVVSKYGWSLTTPYAQTRELAFSFAFLATAVNHTLAKAR 331 Query: 227 T-------DDQTRADIARAFEDAVVDTLMIKCKRALDQT--------------------- 258 D+ R +AR + L + AL+ Sbjct: 332 KRAGETGLSDEERVFLAREVMRVTFEHLASRTIIALESLCQWVPLVPNNPNDKRQKPLPS 391 Query: 259 --GFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRF 313 LV++GGV+AN+ L L + R V CTDN AM+ +AG+ F Sbjct: 392 SVPVSTLVVSGGVAANKFLMHVLRTWLDGRGFGHVGVVAPPISLCTDNAAMVGWAGIEMF 451 Query: 314 KAGATADLGVSVRPRWPLAE 333 +AG + +W L E Sbjct: 452 EAGWRSAFEARALRKWGLEE 471 >UniRef50_Q2GXN6 Putative glycoprotein endopeptidase KAE1 n=18 Tax=Eukaryota RepID=KAE1_CHAGB Length = 356 Score = 265 bits (678), Expect = 1e-69, Method: Composition-based stats. Identities = 96/345 (27%), Positives = 160/345 (46%), Gaps = 22/345 (6%) Query: 4 LGIETSCDETGIAIY---DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 LG E S ++ GI + D +L+N ++ + G +P+ ++ H V + + Sbjct: 14 LGCEGSANKLGIGVILHEGDTSTVLSNVRHTF--VSPAGTGFLPKDTAQHHRAFFVRVAK 71 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL ++G+ DID + YT GPG+ G L A R+LA W + V+H GH+ Sbjct: 72 QALSDAGIRIADIDCICYTRGPGMGGPLASVAVAARTLALLWGKELVGVNHCVGHIEMGR 131 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL--DYP 178 V L VSGG+TQ+I+ +Y + GE++D A G D+ A+ L + D Sbjct: 132 TITGADHP--VVLYVSGGNTQVIAYAEQ-RYRIFGETLDIAVGNCLDRFARALNISNDPA 188 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD--------Q 230 G + +A +G P + G+D SFSG+ T A ++ Sbjct: 189 PGYNIEVLARKGGRVLLDLPYAV---KGMDCSFSGILTRAEELAAQMKANEGKGTDGEPF 245 Query: 231 TRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEV 290 T AD+ + ++ V L+ +RA+ G ++++ GGV N L+ + M R G V Sbjct: 246 TGADLCFSLQETVFAMLVEITERAMAHVGSNQVLIVGGVGCNERLQEMMGLMAADRGGSV 305 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGATADLGV-SVRPRWPLAEL 334 + FC DNG MIA+AG++ ++ G + + R+ E+ Sbjct: 306 YATDERFCIDNGIMIAHAGLLAYETGFRTPIEESTCTQRFRTDEV 350 >UniRef50_A3MSX6 Putative O-sialoglycoprotein endopeptidase n=2 Tax=Pyrobaculum RepID=GCP_PYRCJ Length = 339 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 102/335 (30%), Positives = 162/335 (48%), Gaps = 15/335 (4%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 ++G+E++ + + +L + + G+ P A+ H + L + Sbjct: 10 IIGVESTAHTFSLGLV-SGGRVLGQVGKTY--VPPAGRGIHPREAAEHHAKAAPQLFRKL 66 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++E ++ D++AVAY+AGPGL AL VGA R+LA VP +PVHH H+ Sbjct: 67 IEEFNVSLGDVEAVAYSAGPGLGPALRVGAVFARALAIKLGVPLVPVHHGVAHVEIARYA 126 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + + LL+SGGHT +++ G+Y + GE++D A G A D A+ +GL +PG P Sbjct: 127 TGSCDP--LVLLISGGHT-VVAGFSDGRYRVFGETLDVAIGNAIDMFAREVGLGFPGVPA 183 Query: 183 LSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDA 242 + K A FP P+ G D S++GL T+A ++ + R+ + Sbjct: 184 VEKCAEAAE-ELVAFPMPIV---GQDLSYAGLTTYALQLVKRG----IPLPVVCRSLVET 235 Query: 243 VVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNG 302 L +RAL T + LV+AGGV+ +R LR L E+ ++ EV + E+ DNG Sbjct: 236 AYYMLAEVTERALAFTKKRELVVAGGVARSRRLREILYEVGREHGAEVKFVPDEYAGDNG 295 Query: 303 AMIAYAGMVRFKAGATA-DLGVSVRPRWPLAELPA 336 AMIA G ++ G VR RW L + Sbjct: 296 AMIALTGYYAYRRGIAVEPGESFVRQRWRLDTVDV 330 >UniRef50_B9WFF4 Metalloprotease, putative n=8 Tax=Saccharomycetales RepID=B9WFF4_CANDC Length = 440 Score = 264 bits (676), Expect = 2e-69, Method: Composition-based stats. Identities = 103/396 (26%), Positives = 165/396 (41%), Gaps = 63/396 (15%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLH---ADYGGVVPELASRDHVRKTVPL 58 RV+ IE+SCD++ +A+ + ++ Q K AD GG++P A H+ + Sbjct: 25 RVMAIESSCDDSCVALLEKSHPDTPPKIIDQFKRTLHSADIGGILPTAAYNYHMATIANM 84 Query: 59 IQAALKESGLT-AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +Q + ++ D + T GPG+ G+L + L+ AW VP I VHHM GHLL Sbjct: 85 VQEFCSKHQISALNPPDLLCVTRGPGMAGSLSTSTEFAKGLSVAWGVPLIGVHHMLGHLL 144 Query: 118 APML------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK 171 L + PP++PF++LL SGGHT L+ + ++E++ D A G++ DK A+ Sbjct: 145 TANLPKSEQPDSPPPKYPFLSLLCSGGHTMLVLSKSLTEHEIVVNVGDIAVGDSLDKCAR 204 Query: 172 LLGL-DYPGGPLLSKM------------------AAQGTAGRFVFPRPMTDRP-----GL 207 LG+ G L K F P + + Sbjct: 205 ELGMYGNMLGKELEKYINSIPEETRNRYEKLSVNTRIANPYNFRLTLPYSAPKYGIPEDV 264 Query: 208 DFSFSGLKTFAANTIR-----------DNGTDDQTRADIARAFEDAVVDTLMIKCKRALD 256 F+FS + D D++T+ IA ++ + D ++ + A Sbjct: 265 KFAFSHFLSNIQEYKAMHYNKSGGGEIDVALDEETKQFIAYKTQEFIFDHIVDRINIAFK 324 Query: 257 QTGF------------KRLVMAGGVSANRTLRAKLAEMMKKR-----RGEVFYARPEFCT 299 + G K + +GGV+AN+ LR KL E + + + CT Sbjct: 325 KHGIKNRNSDGTFIGVKDFICSGGVAANKRLREKLRENLDFQEIGADNVNFHFPDLSLCT 384 Query: 300 DNGAMIAYAGMVRFKA-GATADLGVSVRPRWPLAEL 334 DN MI AG+ F+ DL +WPL +L Sbjct: 385 DNAIMIGAAGIEIFEKLRLRTDLSFLPIRKWPLNKL 420 >UniRef50_B7QJD9 O-sialoglycoprotein endopeptidase, putative n=3 Tax=Arthropoda RepID=B7QJD9_IXOSC Length = 309 Score = 262 bits (671), Expect = 9e-69, Method: Composition-based stats. Identities = 102/289 (35%), Positives = 142/289 (49%), Gaps = 30/289 (10%) Query: 73 IDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVA 132 + A+A T PG+ +LLVG R LA P IP+HHME H LA L +FP++ Sbjct: 1 MSAIAVTVRPGMSLSLLVGLNFARRLAAKHGKPLIPIHHMEAHALAVRLV-QRVDFPYLV 59 Query: 133 LLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-------DYPGGPLLSK 185 LLVSGGH QL V I + LLG+++DDA GE FDK A+ L L GG L Sbjct: 60 LLVSGGHCQLAVVRDIDDFLLLGQTMDDAPGETFDKVARRLKLSNLPECRGLSGGRALEF 119 Query: 186 MAAQ--GTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD--------QTRADI 235 +A + G + FP P+T +FSFSGLK I + AD+ Sbjct: 120 LAERDSGNPLAYRFPEPLTSYRTCNFSFSGLKNSVYRKIEALEKEHGLEADALLPEIADL 179 Query: 236 ARAFEDAVVDTLMIKCKRALDQT--------GFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 + + AV L + +RAL G LV+AGGV+AN L L+++ +K Sbjct: 180 CASTQHAVAYHLTRRTQRALAFCDQQGLLPEGKPTLVVAGGVAANAYLGRLLSQLCEKLD 239 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGA----TADLGVSVRPRWPLA 332 P+ C+DNG MIA+ G+ R++A + + + + PR PL Sbjct: 240 VAYVPTPPKLCSDNGLMIAWNGVERWRAASGIVTESFDSLDITPRCPLG 288 >UniRef50_C4Y0N8 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y0N8_CLAL4 Length = 443 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 102/374 (27%), Positives = 169/374 (45%), Gaps = 41/374 (10%) Query: 2 RVLGIETSCDETGIAIYDD---EKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPL 58 RVL IE+SCD++ +++ + +LA A GG++P A H + L Sbjct: 43 RVLAIESSCDDSCVSLLEKKSPNGPVLAIDEIKATLSSAKVGGIIPTAAHEFHSAQISQL 102 Query: 59 IQAALKESGLTAKDI-DAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 + ++ +++ + D + T GPG+VG+L + LA AW P + VHHM GHLL Sbjct: 103 VGEFCRKHEISSSNPPDLLCVTRGPGMVGSLSASIQFAKGLAVAWQRPLVGVHHMLGHLL 162 Query: 118 APML----EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 P L P++PF++LL SGGHT L+ + +E++ ++ D AAG++ DK A+ L Sbjct: 163 TPNLTVEGSSCGPQYPFLSLLCSGGHTMLVLSKSLTNHEIIIDTSDIAAGDSLDKCAREL 222 Query: 174 GL-DYPGGPLLSKMAAQGTA-----------------GRFVFPRPMTDRPG------LDF 209 G GP L K A F PM + F Sbjct: 223 GFEGNMLGPELEKYVANIDPVTKERFAGINTNTDQNEFGFRLRMPMRTAKHKKIPDVIQF 282 Query: 210 SFSGLKTFAA--NTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG-----FKR 262 F+ + + ++QTR +A ++ + D ++ + A + + Sbjct: 283 GFASFLSSVEGFKMKSRDSWNEQTRQFVAFKLQEVLFDHIINRINVAFAKDPQKFALVRD 342 Query: 263 LVMAGGVSANRTLRAKLAEMMKKRR-GEVFYARPEFCTDNGAMIAYAGMVRFKA-GATAD 320 V +GGV+AN+ LRAKL ++ + + P+ CTDN MI AG+ F+ + Sbjct: 343 FVCSGGVAANKVLRAKLMHNIRSASTLKFHFPAPKLCTDNATMIGNAGIDIFENLRLKSR 402 Query: 321 LGVSVRPRWPLAEL 334 L + +WPL ++ Sbjct: 403 LSMLPIRKWPLHDI 416 >UniRef50_P36132 Putative glycoprotein endopeptidase KAE1 n=40 Tax=Eukaryota RepID=KAE1_YEAST Length = 386 Score = 256 bits (655), Expect = 8e-67, Method: Composition-based stats. Identities = 97/371 (26%), Positives = 162/371 (43%), Gaps = 43/371 (11%) Query: 4 LGIETSCDETGIAIY---------------DDEKGLLANQLYSQVKLHADYGGVVPELAS 48 LG+E S ++ G+ I D E +L+N + + G +P + Sbjct: 19 LGLEGSANKLGVGIVKHPLLPKHANSDLSYDCEAEMLSNIRDTY--VTPPGEGFLPRDTA 76 Query: 49 RDHVRKTVPLIQAALKESGLTAK--DIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPA 106 R H + LI+ AL E+ + + DID + +T GPG+ L R+ + WDVP Sbjct: 77 RHHRNWCIRLIKQALAEADIKSPTLDIDVICFTKGPGMGAPLHSVVIAARTCSLLWDVPL 136 Query: 107 IPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAF 166 + V+H GH+ V L VSGG+TQ+I+ +Y + GE++D A G Sbjct: 137 VGVNHCIGHIEMGREITKAQNP--VVLYVSGGNTQVIAY-SEKRYRIFGETLDIAIGNCL 193 Query: 167 DKTAKLLGLD--YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRD 224 D+ A+ L + G + ++A + + P T G+D S SG+ +D Sbjct: 194 DRFARTLKIPNEPSPGYNIEQLAKKAPHKENLVELPYTV-KGMDLSMSGILASIDLLAKD 252 Query: 225 N---------------GTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGV 269 G T D+ + ++ + L+ +RA+ ++++ GGV Sbjct: 253 LFKGNKKNKILFDKTTGEQKVTVEDLCYSLQENLFAMLVEITERAMAHVNSNQVLIVGGV 312 Query: 270 SANRTLRAKLAEMMKKR-RGEVFYARPEFCTDNGAMIAYAGMVRFKA-GATADLGVS-VR 326 N L+ +A+M K R G+V FC DNG MIA AG++ ++ G D + V Sbjct: 313 GCNVRLQEMMAQMCKDRANGQVHATDNRFCIDNGVMIAQAGLLEYRMGGIVKDFSETVVT 372 Query: 327 PRWPLAELPAA 337 ++ E+ AA Sbjct: 373 QKFRTDEVYAA 383 >UniRef50_Q7SD85 Predicted protein n=2 Tax=Sordariaceae RepID=Q7SD85_NEUCR Length = 538 Score = 252 bits (645), Expect = 1e-65, Method: Composition-based stats. Identities = 99/482 (20%), Positives = 172/482 (35%), Gaps = 153/482 (31%) Query: 1 MRVLGIETSCDETGIAIYDDE------------KGLLANQLYSQVKLHADYGGVVPELAS 48 + L IETSCD+T +A+ LL N+ + + +GGV P +A Sbjct: 38 LLTLAIETSCDDTCVALLQSYESTVRTETPEMVARLLFNKKITSDQ--RQFGGVHPAVAV 95 Query: 49 RDHVRKTVPLIQAALK-----------ESGLTAKDIDAVAYTAGPGLVGALLVGATVGRS 97 H R L++ A++ + L + D +A T GPG+ +L G V + Sbjct: 96 EWHQRHLATLVEEAIRSLPEGKTPAYKNTRLPYRAPDLIAVTRGPGMPTSLATGMEVAKG 155 Query: 98 LAFAWDVPAIPVHHMEGHLLAPML------------------------------------ 121 LA AW +P + VHHM+ H L P L Sbjct: 156 LALAWGIPIVGVHHMQAHALTPQLVEALDRPPAPSVASSPWEERQQVDAEVKTASRQQEE 215 Query: 122 -EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD---- 176 + ++P++ LLVSGGHTQL+ + + +L + + A G+ DK A+ + Sbjct: 216 AQHPNLDYPYLNLLVSGGHTQLVYSASLTSHLILCTTDNIALGDMLDKAARKILPPSMLN 275 Query: 177 ------------------YPGGPL--------------LSKMAAQGTAGRFVFPRPMTDR 204 +P G +++ + + P+ Sbjct: 276 SGQNVMYAAALERFAFPRFPAGADEREYNFKYTPPATRAAEIEQHKSPYGWHLSPPLYAS 335 Query: 205 PGLDFSFSGLKTFAANT------------------------------------------- 221 ++++F+GL + A Sbjct: 336 RKMEYNFTGLGSQAQRIAESLDISSSYENHTEHILSLENSPKSGSDLAPSPDSSTTILSP 395 Query: 222 -IRDNGTDDQTRADIARAFEDAVVDTLMIKCKRAL--------DQTGFKRLVMAGGVSAN 272 +++ + R +ARA + L + L +Q K LV++GGV++N Sbjct: 396 ALKEEDHQIEQRRYLARATMQLAFEHLASRIVMVLQQQAKTSCEQQKVKTLVVSGGVASN 455 Query: 273 RTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRW 329 + LR L +++ R + CTDN AMIA+ G ++AG + L + +W Sbjct: 456 QFLRHVLRRVLEVRGFGHIRIMAPPVNLCTDNAAMIAWTGSEMYRAGWVSKLDMLPIKKW 515 Query: 330 PL 331 + Sbjct: 516 SM 517 >UniRef50_Q4U8J6 Glycoprotease, putative n=2 Tax=Theileria RepID=Q4U8J6_THEAN Length = 630 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 93/335 (27%), Positives = 172/335 (51%), Gaps = 31/335 (9%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IETS D+T IA+ + +L+++ SQ ++ +YGG+ P A +H++K L Sbjct: 97 NILSIETSFDDTCIAVVRSDGKILSDKKLSQEEVVKEYGGIKPVCAKLEHIKKIESLTDK 156 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 ++ESGL +DID +A T GPG L VG + L+ + +P + +H+ GH L+P++ Sbjct: 157 VIEESGLKIQDIDEIAVTRGPGTELCLRVGYNYAKELSEKYKIPLVSENHIAGHCLSPLI 216 Query: 122 EDN--------------PPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFD 167 +++ +FP++ LL+SGGH+Q+ V ++ L+ E+ D+ G D Sbjct: 217 DEHQFKYTVEGTPIKSNDLKFPYLCLLLSGGHSQIYLVENPSKFHLMCETQDEFVGNVLD 276 Query: 168 KTAKLLGLDYP--GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFA----ANT 221 K AKLLGLD GG L K+A + + ++ P ++F FSG+++ Sbjct: 277 KCAKLLGLDLSKGGGAELEKIADEVSDSKYKLTIPNKYNHYMEFCFSGVQSQLGLKTEQL 336 Query: 222 IRDNGTDDQTR------ADIARAFEDAVVDTLMIKCKRALDQ----TGFKRLVMAGGVSA 271 ++ + +D R +++A + V + ++I+ + +L+ +L + GGV++ Sbjct: 337 VKSHNVEDAKRLPRKILSELAYGLQSTVFEGILIQLEMSLNAVETLFPINQLALVGGVAS 396 Query: 272 NRTLRAKLAEMMKKRRGEVFYARPE-FCTDNGAMI 305 N L+ + ++ R V ++ E F T M+ Sbjct: 397 NDKLKKMILDLFYLRDESVRFSEQEMFLTRTKNMV 431 Score = 43.3 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 30/65 (46%), Gaps = 7/65 (10%) Query: 275 LRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGAT---ADLGV---SVRPR 328 LR E+ +R +++ ++CTDN MI ++ + + + G + + V PR Sbjct: 549 LRRHGDEISDRR-WDLYTTSKKYCTDNAVMIGFSLIQKNRMGIKEINSPEKINGKDVAPR 607 Query: 329 WPLAE 333 W L Sbjct: 608 WDLGT 612 >UniRef50_A8QDL6 Glycoprotease family protein n=1 Tax=Brugia malayi RepID=A8QDL6_BRUMA Length = 415 Score = 247 bits (630), Expect = 5e-64, Method: Composition-based stats. Identities = 84/324 (25%), Positives = 148/324 (45%), Gaps = 21/324 (6%) Query: 3 VLGIET-SCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 V+GIET CD+T + I + ++ +L+++ Y+ ++ GG+ P + H + Sbjct: 33 VMGIETRHCDDTAVCILNSDRKILSSRRYADREVQKRLGGICPAAVADQHRSYIDLFVDE 92 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L ES + ++D +A T PGLV L VG SLA +P IPVHHM+ H L Sbjct: 93 CLDESRVRLCNLDGIAVTTQPGLVICLRVGTEKAISLARKGCIPLIPVHHMQAHATVATL 152 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYP--- 178 +P+V++L+SGGH+ + G +E+L S+ + GE DK ++ L + P Sbjct: 153 MTE-IXYPYVSVLISGGHSIIAVTNGPDDFEVLLTSMCGSPGECMDKISRALHFEEPELL 211 Query: 179 ---GGPLLSKMAAQGT---AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 G L +A++ + R+ + L F+FS +KT I + Sbjct: 212 GLHPGAALEVIASRSSVDGYKRYPIDVNKFMKMALHFNFSWIKTTYLAMISRQSI--LSV 269 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTG--------FKRLVMAGGVSANRTLRAKLAEMMK 284 D + + ++ + L K L + + ++GGV++N+ + A+ + + Sbjct: 270 PDFCASVQHSIANYLAEKLSCCLQYLNDSNKIPSRNRLVFVSGGVASNKYILARFNNVCE 329 Query: 285 KRRGEVFYARPEFCTDNGAMIAYA 308 V+ +C DN MIA+ Sbjct: 330 PLGYSVYAPSQFYCCDNAEMIAWN 353 >UniRef50_C5FT24 Glycoprotease family protein n=2 Tax=Onygenales RepID=C5FT24_NANOT Length = 492 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 103/456 (22%), Positives = 160/456 (35%), Gaps = 134/456 (29%) Query: 11 DETGIAIYDDEKGLLANQLYSQVKLHA--------------DYGGVVPELASRDHVRKTV 56 D+T +AI + + + Q +Y G+ P ++ H Sbjct: 10 DDTSVAIVEKHGTRINDASSLQTPRPHTTLHFLANITADSREYRGIHPIVSLESHQANLS 69 Query: 57 PLIQAAL----KESGLTAKDIDA-----------------------------VAYTAGPG 83 L+ AL SGL+ K+ DA ++ T GPG Sbjct: 70 DLVDKALWYLPSASGLSHKEPDALRQYASRTIQLSPEGGKARDTVNKLKPDFISVTRGPG 129 Query: 84 LVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDN---------------PPEF 128 + L VG + + LA AW VP + VHHM+ HLL P L D P+F Sbjct: 130 MRSNLSVGLELAKGLAVAWQVPMVGVHHMQAHLLTPRLADALDIPSVEENDSIRALKPDF 189 Query: 129 PFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL-LGLDY-------PGG 180 PF+++L+SGGHT L + ++ L ++D A G+ DK A++ L Y G Sbjct: 190 PFISVLISGGHTFLAHSKSLTDHKTLASTVDVAIGDVLDKFARMALPRSYIDQSKTTMYG 249 Query: 181 PLLSKMA------------------AQGTA-----GRFVFPRPMTDRPGLDFSFSGLKTF 217 L A + + P D + F+F+GL + Sbjct: 250 KQLEAYAFPNGYSDYADYEPPATRGQETKPIINAKYGWSLTLPYPDSKKMAFTFAGLFSA 309 Query: 218 AANTI----------RDNGTDDQT-----------RADIARAFEDAVVDTLMIKCKRALD 256 A + R ++ R + R F + L + AL+ Sbjct: 310 AQRQVDIMVNGKVEQRKKTKEEMDSLNLDFLPHDGRVEFCRDFMRVCFEHLASRIVLALE 369 Query: 257 QT-----------------GFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPE 296 K +V++GGV+AN+ LR L + R ++ Sbjct: 370 NALSSVPNTARKEQIEPGPSVKTIVVSGGVAANQYLRHILRAFLDIRGFSDVDIVAPPLY 429 Query: 297 FCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 CTDN AMI +AG+ F+AG +W L Sbjct: 430 LCTDNAAMIGWAGIEMFEAGWRTSRKSQAIRKWNLD 465 >UniRef50_C1GKA7 Glycoprotease pgp1 n=11 Tax=Onygenales RepID=C1GKA7_PARBD Length = 642 Score = 243 bits (621), Expect = 6e-63, Method: Composition-based stats. Identities = 110/443 (24%), Positives = 164/443 (37%), Gaps = 123/443 (27%) Query: 11 DETGIAIYDDEKG-------LLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 D+T +AI + +L + + Y G+ P +A H T L+ AL Sbjct: 178 DDTSVAIIEKHGVSSPSRSSILFLENITADS--RKYQGIHPAVALDSHQANTAKLVNKAL 235 Query: 64 KESGL----TAKDI----------------------DAVAYTAGPGLVGALLVGATVGRS 97 L +A D+ D ++ T GPG+ L VG + Sbjct: 236 AHLPLAQFPSANDVGRVICLPSSATDGITPHLRRKPDFISVTRGPGMRSNLSVGLDTAKG 295 Query: 98 LAFAWDVPAIPVHHMEGHLLAPMLED-----------------NPPEFPFVALLVSGGHT 140 L+ AW VP + VHHM+ HLL P L N P FPF+++LVSGGHT Sbjct: 296 LSVAWQVPIVGVHHMQAHLLTPRLAASLQQQQLQSSENSSAFRNSPSFPFMSILVSGGHT 355 Query: 141 QLISVTGIGQYELLGESIDDAAGEAFDKTAKLL-------------------GLDYPGGP 181 L+ I +E+L + D A G+A DKTA++L +P GP Sbjct: 356 LLVHSKSIVDHEILASTSDSAIGDALDKTARMLLPQSFLAKSTTTMYGKMLEEFAFPNGP 415 Query: 182 LLSKMAA------------QGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT-- 227 + + F P + ++FSFSG+ T A + + Sbjct: 416 SDYADYRPPATRGEELVKLKSERWGWSFGMPFAENRRMEFSFSGVTTRARDIYLNRRKQW 475 Query: 228 -----------DDQTRADIARAFEDAVVDTLMIKCKRALDQT------------------ 258 + R + ARAF L + AL + Sbjct: 476 EAAGNSGEGFMSNDERIEFARAFMTVCFGHLASRTIIALQELRRQQQQQQQQQQQQEREN 535 Query: 259 ------GFKRLVMAGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAG 309 + L+++GGV AN+ L+ + R +V P CTDN AMI +AG Sbjct: 536 QSPPAEDIQSLIISGGVGANQFLKKLFRSYLDIRGFPHVDVIAPPPYLCTDNAAMIGWAG 595 Query: 310 MVRFKAGATADLGVSVRPRWPLA 332 + F+AG +DL +W L Sbjct: 596 IEMFEAGWRSDLRCRPLRKWTLD 618 >UniRef50_A8BDD4 O-sialoglycoprotein endopeptidase n=2 Tax=Giardia intestinalis RepID=A8BDD4_GIALA Length = 396 Score = 240 bits (614), Expect = 4e-62, Method: Composition-based stats. Identities = 98/392 (25%), Positives = 159/392 (40%), Gaps = 65/392 (16%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LG+E S ++ G+ I D + AN + G P + H + + LI+ A Sbjct: 2 ILGLEGSANKLGVGIVDASGVVHANLRSTYNA--PPGQGFQPNDVAAHHRQHIIGLIERA 59 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L E+ +++ I +AYT GPGL L A V R+L+ W VP + V+H H+ L Sbjct: 60 LLEAEISSDKITHIAYTRGPGLGAPLAAVAVVARTLSQLWKVPLLAVNHCVAHIEMGRLV 119 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD--YPGG 180 P V L SGG+TQ+I+ G+Y + GE++D A G A D+ A+ L + G Sbjct: 120 TQLPNP--VVLYASGGNTQVIAY-SQGRYRVFGEALDIAVGNALDRIARYLLISNTPAPG 176 Query: 181 PLLSKMAAQ---------------------------------------------GTAGRF 195 + ++AA+ G + Sbjct: 177 LNIERLAAEWAAIFREEDCVHLDPDIVPRYTTLPRSKELLKEQLELYSANHPEAGIDTSY 236 Query: 196 VFPRPMT---DRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCK 252 P T G+D S SG+ T+ + + + D I + ++ + +L+ + Sbjct: 237 DIPIITTIPVPIKGMDISCSGISTYLKTYVETHTSLDPRL--ICYSLQETLFGSLVEITE 294 Query: 253 RALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVR 312 RA G ++ GGV N L+ L M +R G + +C DNGAMIA+ G Sbjct: 295 RAAAHVGAADILAVGGVGCNLRLQEMLQIMAAERNGRLGAMDDSYCVDNGAMIAWCGACM 354 Query: 313 FKAGATADL--------GVSVRPRWPLAELPA 336 +A + DL +V R+ + Sbjct: 355 LQAPLSMDLLIPYTEVNCATVTQRYRTDSVDV 386 >UniRef50_A6TR37 O-sialoglycoprotein endopeptidase n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TR37_ALKMQ Length = 330 Score = 239 bits (611), Expect = 9e-62, Method: Composition-based stats. Identities = 71/324 (21%), Positives = 143/324 (44%), Gaps = 19/324 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGI+TS T +AI + + LL+ + S + + G+ A H++ L + Sbjct: 8 ILGIDTSNYMTSLAIMNLQGALLSEER-SLLPVKTGNLGLRQSDALFHHIKNLPVLCKKL 66 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +++ + + +I ++ + P + L + S+A +VP H EGH+ Sbjct: 67 MQQ--VDSINIVGISASVKPRPLADSYMPVFLASQSFATSMASLMNVPFYSFSHQEGHIE 124 Query: 118 APMLE-DNPPEFPFVALLVSGGHTQ---LISVTGIGQYELLGESIDDAAGEAFDKTAKLL 173 A F+ L +SGG T+ ++ E++G S D +AG+ D+ L Sbjct: 125 AGFWSQARTCTQEFLVLHISGGTTEMLKVVPYDNRYDIEIVGGSKDISAGQLIDRIGVRL 184 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 + +P GP L ++ + + P + + +FSGL+T + + + Sbjct: 185 DMPFPAGPHLESLSLEWQGPKIKLPISVKEGW---VNFSGLETHITRLLN----QEYSSQ 237 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 IA + + +L++ K A Q+ K ++ GGV++N+ +R + + + EV + Sbjct: 238 QIASSLFHTIGQSLVLMIKTAKFQSLIKTALVVGGVASNQQIRTLIEKELSSENIEVLFG 297 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGA 317 + ++C+DN IA G+ + Sbjct: 298 QTQYCSDNAVGIAALGVKSYLNRN 321 >UniRef50_B2A533 O-sialoglycoprotein endopeptidase n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A533_NATTJ Length = 322 Score = 239 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 80/327 (24%), Positives = 149/327 (45%), Gaps = 24/327 (7%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 +G++TSC T +A+ + + ++A + +++ GG+ A H+ + Sbjct: 1 MGLDTSCYTTSMAVINKQGKIIA-KTERPLEVAMGKGGLRQSEAVFQHINNLPQGLTEIK 59 Query: 64 KESGLTAKDI--DAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 K+ + D+ A+A ++ P + VG + ++L+ + +P + H EGH+ Sbjct: 60 KQLNVNNLDLNLAAIAVSSRPRPIEGSYMPVFKVGDSYAKALSLSSGIPLLEYTHQEGHI 119 Query: 117 LAPMLEDNPPE-----FPFVALLVSGGHTQLISVTGIG-----QYELLGESIDDAAGEAF 166 + + E + F+ VSGG T+L+ G E++G + D AAG+ Sbjct: 120 ASIVYEKSNNIRLEDMDKFLVFHVSGGTTELLICHTKGKFSSFDIEIIGGTKDIAAGQLI 179 Query: 167 DKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG 226 D+TAKL+ L +PGGP L K+ Q P + D +FSG +T I + Sbjct: 180 DRTAKLMNLPFPGGPHLEKLGDQSGQTDISVPFSVEDTK---INFSGPETHIKRLIHN-- 234 Query: 227 TDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR 286 +D + +AR E V +L+ + AL + K ++ GGV +N ++ L++ + Sbjct: 235 -EDYPKPAVARGIEQCVAKSLLTVLENALKKHQVKNILFVGGVMSNSYIKNYLSKNISNE 293 Query: 287 RGEVFYARPEFCTDNGAMIAYAGMVRF 313 + + + PE DN +A+ G F Sbjct: 294 KYNLIFGSPELSKDNAVGVAWLGYNNF 320 >UniRef50_A8WMS3 Putative uncharacterized protein n=1 Tax=Caenorhabditis briggsae RepID=A8WMS3_CAEBR Length = 386 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 91/297 (30%), Positives = 152/297 (51%), Gaps = 18/297 (6%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYG-GVVPELASRDHVRKTVPLIQA 61 VLGIE S ++ G+ I + +L+N + HA G G P ++ H ++ V L+ Sbjct: 4 VLGIEGSANKIGVGIIR-DGVVLSNPRAT---FHAPPGEGFRPTETAQHHRQQIVRLVGE 59 Query: 62 ALKESGLT--AKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 A++E+G+ K+ID +A+T GPG+ L VGA V R+L+ W P IPV+H GH+ Sbjct: 60 AIREAGIQDPEKEIDGIAFTKGPGMGAPLQVGAIVARTLSLRWQKPIIPVNHCVGHIEMG 119 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD--Y 177 L V L VSGG+TQ+ +Y + GE+ID A G D+ A++L L Sbjct: 120 RLITGADNP--VVLYVSGGNTQVFLPNK--RYRIFGETIDIAVGNCLDRFARVLKLPNAP 175 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAA-NTIRDNGTDDQTRADIA 236 G + ++A G +F P T + +D S SG+ + + + + T AD+ Sbjct: 176 SPGYNIEQLAKSGAK---LFELPYTVKARMDVSLSGILSCIESRAPQLLESREYTPADLC 232 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRA-KLAEMMKKRRGEVFY 292 + ++ V L+ +RA+ TG + L++ GGV N L+ ++ ++K R ++ + Sbjct: 233 FSLQETVFAMLIEITERAMAHTGSRELLIVGGVGCNLRLQVLEIVFLVKIRLKKLIF 289 >UniRef50_A8MFJ2 O-sialoglycoprotein endopeptidase n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MFJ2_ALKOO Length = 328 Score = 237 bits (606), Expect = 4e-61, Method: Composition-based stats. Identities = 76/325 (23%), Positives = 148/325 (45%), Gaps = 18/325 (5%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LG++TS T +++ + L+ + H G+ A HV++ L Sbjct: 6 ILGLDTSNYTTSMSLMSLDGELVYDARKLLPVDH-GKRGLRQSEALFYHVQQLPYLSNEI 64 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 ++S I A++ + P + L + G + + +P H EGH+ Sbjct: 65 SQKSDEF--HIVAISASTRPRPVEDSYMPVFLAAKSYGEITSNLFHIPFYEFSHQEGHIE 122 Query: 118 APMLEDN-PPEFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKTAKLL 173 A + +N + F+A+ +SGG T+++ V E++G + D +AG+ D+ + Sbjct: 123 AALWSENIHMKEEFIAIHISGGTTEVLVVKPRDIGYDIEIIGGTSDLSAGQFIDRVGVAM 182 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 GL++P G L +++ + P +T SFSG +T + I++ + ++A Sbjct: 183 GLEFPSGKSLEEISRGCSELSLNVPVSVTKNK---ISFSGPETHFSRLIKE---SNASKA 236 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYA 293 DIA V +L + K Q K L++ GGV++N +R+ L E + +++A Sbjct: 237 DIAYGVFHCVARSLELLVKNIGKQYPIKNLLIVGGVASNNQIRSYLLEKLAPENIHIYFA 296 Query: 294 RPEFCTDNGAMIAYAGMVRFKAGAT 318 P++CTDN I+ G+ ++ + Sbjct: 297 APKYCTDNAVGISSLGVSKYLKQNS 321 >UniRef50_C7DHT9 Metalloendopeptidase, glycoprotease family n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHT9_9EURY Length = 324 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 94/333 (28%), Positives = 160/333 (48%), Gaps = 18/333 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M V+GIE+S G+ I + K +LAN+ G++P + H + +I+ Sbjct: 1 MAVIGIESSAHTFGVGIVEKGK-ILANEKMMY---PISDKGIIPAKVAEYHAKNASAVIR 56 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + +DI+AV YT GPGL L +G ++L +P P++H GH+ Sbjct: 57 RALSVAHAALEDIEAVGYTKGPGLGPCLEIGMLAAKTLHEKLGIPIYPINHAVGHIEITK 116 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 + + L VSGG++Q++S+ G G Y + GE++D G D A+ G+ G Sbjct: 117 HLSGFADP--IVLYVSGGNSQILSLAG-GHYHVHGETLDIGVGNMLDNFARAAGMKPAWG 173 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE 240 ++K A G R + G+DF+F+GL T A T+ + AD++ + + Sbjct: 174 STVAKFATGGKYVRLPYTV-----KGMDFTFTGLLTAAIKTL-----PSSSIADVSFSIQ 223 Query: 241 DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTD 300 + L+ +RAL +G +++ GGV+ + LR LA M + + A +F D Sbjct: 224 ETAFSMLVEATERALLLSGKDSVILCGGVAQSLRLREMLATMSASHKKRFYVADNQFNAD 283 Query: 301 NGAMIAYAGMVRFKAGA-TADLGVSVRPRWPLA 332 NGAMIAY ++G A +++ ++ + Sbjct: 284 NGAMIAYVAEKMDESGYAPARSDLTINQKFRIE 316 >UniRef50_UPI0000E8089C PREDICTED: similar to Osgepl1 protein n=1 Tax=Gallus gallus RepID=UPI0000E8089C Length = 513 Score = 230 bits (588), Expect = 4e-59, Method: Composition-based stats. Identities = 87/235 (37%), Positives = 124/235 (52%), Gaps = 8/235 (3%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLGIETSCD+TG A+ D+ +L L SQ ++H GG++P +A + H +++ Sbjct: 110 LVLGIETSCDDTGAAVLDEAGTVLGEALQSQKEVHLKAGGIIPHVAQQLHRESIQQVVKE 169 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 AL SG++ ++ A+A T PGL +L VG L + P IP+HHME H L L Sbjct: 170 ALSASGVSVNELAAIATTVKPGLALSLEVGLQYSLQLVDRYQKPFIPIHHMEAHALTIRL 229 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL------ 175 + EFPF+ LL+SGGH L G+ + LLG+SID A G+ DK A+ L L Sbjct: 230 TEQ-VEFPFLVLLLSGGHCILAVARGVSDFLLLGQSIDIAPGDMLDKVARRLSLVKHPEC 288 Query: 176 -DYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 GG + +A G ++ F PM DFSFSGL++ I ++ Sbjct: 289 HGMAGGKAIEHLAQTGDWQQYTFRLPMQQYRNCDFSFSGLQSLVNKAILQKEKEE 343 >UniRef50_C0ZC04 Peptidase M22 family protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZC04_BREBN Length = 320 Score = 229 bits (585), Expect = 9e-59, Method: Composition-based stats. Identities = 85/327 (25%), Positives = 144/327 (44%), Gaps = 24/327 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGI+TS T + + +++ ++A +K+ G+ A HV Sbjct: 5 MLGIDTSNYRTSLCLAEEDGRIVAEAKR-LLKVKEGKRGLQQSEAVFQHVMNLP----EL 59 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 E +I A+ + P + VG + +SLA VP H EGH+ Sbjct: 60 SDEMKWKDYEIAAICVSEKPRPQDGSYMPVFKVGEGLAKSLATYLRVPLHLTTHQEGHIA 119 Query: 118 APMLED--NPPEFPFVALLVSGGHTQLISVTGI---GQYELLGESIDDAAGEAFDKTAKL 172 A P E F+A+ +SGG ++L+ E +G +ID AG+ D+ Sbjct: 120 AGEYTAEVRPTEDRFLAVHLSGGTSELLLCERHAAGYTIEKIGGTIDLHAGQLVDRIGVA 179 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL +P GP L ++A + T R + GL FSFSG + + T Sbjct: 180 LGLSFPAGPALEQLAKEATGEF----RVSSAVDGLSFSFSGPEASLLREVEKGSTS---P 232 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG--EV 290 A+IARA E + + L + A++Q K +++ GGV+AN +R +L + ++ ++ Sbjct: 233 AEIARATEQCIANALEKSLRHAVEQGYPKDILIVGGVAANYYIRERLIKRLEHPAVKAKL 292 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKAGA 317 ++ P + DN +A G ++ KA Sbjct: 293 YFCDPVYSGDNAYGVAMLGWMKQKANI 319 >UniRef50_C0GE31 O-sialoglycoprotein endopeptidase n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GE31_9FIRM Length = 307 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 94/311 (30%), Positives = 142/311 (45%), Gaps = 20/311 (6%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TSC T +A+ D + LL + + + + G+ H++ L + Sbjct: 3 LGIDTSCYTTSLAVMDTQGRLLCEKR-TLLTVPKGERGLRQSDGVFQHLQNLPRLAEEVA 61 Query: 64 KESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 E G + AVA + P + VG + GRSLA A+ VP + + H EGH+LA Sbjct: 62 GEVG--PLKLQAVAASVCPRPVEGSYMPVFTVGTSFGRSLAAAFGVPFLSLSHQEGHILA 119 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYEL--LGESIDDAAGEAFDKTAKLLGLD 176 M F AL VSGG T+L+ V ++ LG S D AG+ D+ LGL Sbjct: 120 GMWSAGVDWPEFYALQVSGGTTELLFVRQNNGLKVAELGGSADLHAGQFIDRVGVALGLS 179 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIA 236 +P GP + K+ G V P P++ + G + SFSG ++ I + A +A Sbjct: 180 FPAGPAVEKL---GNDALEVLPVPVSVQ-GSNLSFSGPESHVQRVIASG---EYAPAAVA 232 Query: 237 RAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPE 296 R E V ++L + + G K ++ GGV AN+ +R L +K E +A+ Sbjct: 233 RGVEKCVAESLWRVLRTVRKEHGAKPVLFVGGVMANQFIRGFL---AEKLGDEAAFAQIR 289 Query: 297 FCTDNGAMIAY 307 F DN A A Sbjct: 290 FAGDNAAGAAV 300 >UniRef50_D1BMJ2 Metal-dependent protease with possible chaperone activity n=3 Tax=Veillonella RepID=D1BMJ2_VEIPT Length = 317 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 82/319 (25%), Positives = 142/319 (44%), Gaps = 13/319 (4%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TSC T AI D++ ++ +++ G+ H + L+ L Sbjct: 6 LGIDTSCYTTSCAIIDNDFHIVGEARKI-LEVKLGERGLQQSNMVFQHTKALPKLMSE-L 63 Query: 64 KESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE- 122 + ++ + + A +VG G++L+ +VP H E H+LA + + Sbjct: 64 PQVPISGIGVSGFPRREERSYMPAFMVGLGQGQTLSHLMNVPLHIFAHQENHILAALRDL 123 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIG----QYELLGESIDDAAGEAFDKTAKLLGLDYP 178 N P PF+AL +SGG T+L+ G + ++G S D G+ D+ LGL +P Sbjct: 124 KNIPNEPFLALHLSGGTTELVYCHYQGNGIFESHIVGGSKDLQGGQYVDRIGVALGLPFP 183 Query: 179 GGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARA 238 G L +A Q T P P + + G SF+G + A I +N D ++ +ARA Sbjct: 184 AGKHLEALALQTTEYE---PLPSSVKDGW-ISFAGPCSAAMRRI-NNAMSDIDKSKLARA 238 Query: 239 FEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFC 298 ++ + L + + L+ GGV +N LR ++ K+ ++ A+P+F Sbjct: 239 VFTSIGNALEKMITYHTKEKSVRALIAVGGVISNSLLRKRMETYCKRNHLQLHVAQPQFS 298 Query: 299 TDNGAMIAY-AGMVRFKAG 316 DN A+ A ++ G Sbjct: 299 VDNATGNAFGAAYLQESRG 317 >UniRef50_A4RG35 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4RG35_MAGGR Length = 596 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 109/485 (22%), Positives = 174/485 (35%), Gaps = 155/485 (31%) Query: 1 MRVLGIETSCDETGIAIYDDEK------GLLANQLYSQVKLHADYGGVVPELASRDHVRK 54 + L IETSCD+T +A+ + E+ +L +Q + ++ +GG+ P H Sbjct: 66 LLTLAIETSCDDTCVALVEKERGPGGAARVLFHQRAT--ADNSMFGGINPLPTLESHTAL 123 Query: 55 TVPLIQAALK----------------------ESGLTAKDIDAVAYTAGPGLVGALLVGA 92 ++++A+ +S + + D V+ T GPG+ AL VG Sbjct: 124 LAKMVRSAVNALPQDAATGNSSFSTAFTRSKPDSSIPRRLPDFVSVTRGPGMAAALSVGL 183 Query: 93 TVGRSLAFAWDVPAIPVHHMEGHLLAPML------------------------------- 121 + + LA AW VP + VHHM+ HLL P L Sbjct: 184 STAKGLAVAWKVPLVGVHHMQAHLLTPRLMSAMRKPFYEWEKERAALTREAFVSEKEEKS 243 Query: 122 -------------------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAA 162 E + P +PF LLVSGGHT L+ + Q+ + E AA Sbjct: 244 GSLKKARSSQSDPKAQDPKEYDWPRYPFFTLLVSGGHTMLMRSKNLVQHSTVAEVEGFAA 303 Query: 163 GEAFDKTAK-LLGLDYPG-----GPLLSKM------------------------------ 186 G+A DK A+ +L Y G G LL + Sbjct: 304 GDALDKCARAILPPKYQGKTSSFGQLLEEFVFPKNLKDYSSVYRAPRNRAEHSSTVSPRR 363 Query: 187 -------------------AAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT 227 G + M + + ++F GL +++ Sbjct: 364 RVLMDRAAERRSPLNIIYTEENGPRYPWALKPMMAESREMKYAFGGLLDQVLRIVKERTA 423 Query: 228 ----DDQTRADIARAFEDAVVDTLMIKCKRALDQTG-------------FKRLVMAGGVS 270 D + R + + + L + +L RL+M+GGV+ Sbjct: 424 AGAFDLEERRVLGYETMRIMFEHLASRVVLSLTSYRDSSRKKNPGQGPTAARLLMSGGVA 483 Query: 271 ANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 +N+ LR + M++ +V P C DN AMI +AG+ F+ G T DLGV + Sbjct: 484 SNKFLRYVVRSMLEAYHFNPVQVIGPPPHLCVDNAAMIGWAGLEMFEEGFTTDLGVLPKK 543 Query: 328 RWPLA 332 +W L Sbjct: 544 KWSLD 548 >UniRef50_D2RJI3 Peptidase M22 glycoprotease n=2 Tax=Acidaminococcus RepID=D2RJI3_ACIFE Length = 319 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 84/320 (26%), Positives = 135/320 (42%), Gaps = 23/320 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLG++TSC T A+ D LL +Q +++ + G+V H R Sbjct: 7 VLGLDTSCYTTSAALMDLHGHLLGDQRR-LLRVKPGHRGLVQSEMVFQHTRNLP----DL 61 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 L+ L+ + A+ +A P + A LVG + RSL +P H H+ Sbjct: 62 LEALDLSGVQVKAIGVSAKPRPREESYMPAFLVGLGMARSLGKLMGLPVHRFTHQHNHMF 121 Query: 118 APMLEDNPPEFP-FVALLVSGGHTQLISVTGIGQY----ELLGESIDDAAGEAFDKTAKL 172 A + P F+ + +SGG T L+ E G SID AG+ D+ Sbjct: 122 AGLWSVGKPAPDRFLLVHISGGTTDLLLCERQPDGNFSLEPRGTSIDLHAGQFIDRVGVA 181 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LGL +P G L K+A + P + R G + S SG T I + Sbjct: 182 LGLPFPAGAPLEKLAETASE---AHPLKVWSREG-ELSLSGPCTQTLRAIEKG----EDP 233 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 A +A E A+ L ++ ++++AGGVSANR +R +L + + +R+ ++ Sbjct: 234 AALALGVEQAIGKALARTISWVCEKEQLSQVLLAGGVSANREIRRQLEDFLGQRQIGLWA 293 Query: 293 ARPEFCTDNGAMIAYAGMVR 312 P + D A+A ++R Sbjct: 294 PDPRYSVDGAVGNAWAALLR 313 >UniRef50_A6S1G0 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6S1G0_BOTFB Length = 323 Score = 222 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 75/305 (24%), Positives = 128/305 (41%), Gaps = 56/305 (18%) Query: 84 LVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML--------------EDNPPEFP 129 + L+ G + LA AW +P + V+HM+ H L P + +N P +P Sbjct: 1 MRANLITGIDTAKGLAVAWQIPLLGVNHMQAHALTPRMVSALEAGNNSKTEKHENDPAYP 60 Query: 130 FVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAK------------------ 171 F++LLVSGGHT L+ + +E+L + D A G+ DKTA+ Sbjct: 61 FLSLLVSGGHTMLVHSRQLCDHEILATTSDLAVGDMVDKTARDILPASVIESASDVMYGR 120 Query: 172 -LLGLDYPGG-------PLLSKMAAQGTAGRFV--FPRPMT-------DRPGLDFSFSGL 214 + +P P +A ++ P +FS+SG+ Sbjct: 121 VMEEFAFPDANSSYDYEPSHKSIAQTSRPTKYEWTLTPPYMSTGHRPLKSYNSEFSYSGV 180 Query: 215 KTFAANTIRDNGTDDQ-TRADIARAFEDAVVDTLMIKCKRALDQTGFK---RLVMAGGVS 270 + + N D R +A+ + L + L++ K LV++GGV+ Sbjct: 181 GSQIKRIMNRNPEMDIAERRLLAQETMRVAFEHLASRVILNLERPDLKDTKTLVVSGGVA 240 Query: 271 ANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRP 327 AN+ L+ L ++ + + P+FCTDN AMI + G+ ++AG +DL + Sbjct: 241 ANQYLKYILRSLLDAWGHKTMRLIFPPPKFCTDNAAMIGWTGIEMWEAGWRSDLDILAAR 300 Query: 328 RWPLA 332 +WP+ Sbjct: 301 KWPID 305 >UniRef50_Q2RIB0 O-sialoglycoprotein endopeptidase n=5 Tax=Clostridia RepID=Q2RIB0_MOOTA Length = 321 Score = 220 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 84/322 (26%), Positives = 139/322 (43%), Gaps = 19/322 (5%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +LGI+TS A + LLA + + G+ A HV+ ++ Sbjct: 1 MAILGIDTSAYTCSAAAVSQDGELLAAHRR-LLPVPPGERGLQQATAVFHHVQILPEVLS 59 Query: 61 AALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 + + A I V + P + V A GR LA A VP H EGH Sbjct: 60 EVF--AAVPAARIRRVVASVKPRPVEGSYMPVFTVAAGQGRILAAALGVPFRATTHQEGH 117 Query: 116 LLAPMLED-NPPEFPFVALLVSGGHTQLISVT---GIGQYELLGESIDDAAGEAFDKTAK 171 + A + P F+A+ +SGG ++++ V+ G E LG ++D AG+ D+ Sbjct: 118 IQAGLWSSGWQPSDSFLAVHLSGGTSEVLLVSRKPGGFTIEKLGGTLDLHAGQLVDRAGV 177 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 L+GL++P GP L ++A + + G +FSFSG + A + Sbjct: 178 LMGLEFPAGPALERLAREAGPEMEKVHL-TSAVRGYNFSFSGPASQAERLLAAGAPPAAV 236 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRG--E 289 A E + +TL + A++ TG + +++ GGV+AN LR +L ++ Sbjct: 237 AR----AVEQCIANTLERVLRPAVEATGLRDILIVGGVAANNYLRQRLRHRLEHPAVAAR 292 Query: 290 VFYARPEFCTDNGAMIAYAGMV 311 + +A PE +DN +A G+ Sbjct: 293 LHFAAPEHSSDNAIGVALLGLE 314 >UniRef50_A7APL5 Glycoprotease family protein n=1 Tax=Babesia bovis RepID=A7APL5_BABBO Length = 406 Score = 220 bits (561), Expect = 5e-56, Method: Composition-based stats. Identities = 72/298 (24%), Positives = 131/298 (43%), Gaps = 30/298 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IETSCD+ A+ +++ + S +GG+ P+ + R H+ ++ Sbjct: 101 ILAIETSCDDCCAAVVSSNGDVVSEERASNPDSLIKFGGIKPDESYRFHLDNIDRIMNEV 160 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML- 121 + ++ L +DI + T GPG+ L G ++ + +P I +H+ GH L+P + Sbjct: 161 VSKAKLKFEDIGYIVATRGPGMRICLNAGYDAAERISKTYSIPLIGENHLAGHCLSPFIK 220 Query: 122 -------------EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDK 168 ++P+++LL+SGGH+Q+ V QY +L +++D AG K Sbjct: 221 GHQLRMTHDRGSVASEELKYPYLSLLLSGGHSQIYVVESPYQYHMLVDTMDHYAGNVLYK 280 Query: 169 TAKLLGLDY--PGGPLLSKMAAQGTAGR-FVFPRPMTDRPGLDFSFSGLKTFAANTIRD- 224 AK LGL GGP + + A + F P F FSG++T + + Sbjct: 281 CAKELGLPIDTGGGPSIEEAARKRQGRPMFRMTEPCKGMSFTSFCFSGIQTQLRSMVSKI 340 Query: 225 --------NGTDDQTRADIARAFEDAVVDTLMIKCKRALDQT----GFKRLVMAGGVS 270 D + +A ++ + ++ + +ALD G ++V+ GG S Sbjct: 341 RQDLGEDALSEDPKLVNHLAYTCQEVTFNQVIRQLDKALDICETLFGISQIVVVGGRS 398 >UniRef50_C5KYH6 Glycoprotein endopeptidase, putative n=4 Tax=Perkinsus marinus ATCC 50983 RepID=C5KYH6_9ALVE Length = 298 Score = 220 bits (560), Expect = 8e-56, Method: Composition-based stats. Identities = 86/232 (37%), Positives = 129/232 (55%), Gaps = 26/232 (11%) Query: 124 NPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGL-DYPGGPL 182 + PEFPFV LLVSGGH + G+G + +LG ++DD+ GE FDK A+LL + D PGGP+ Sbjct: 25 HRPEFPFVTLLVSGGHNMAVLTRGMGDHIILGSTLDDSVGECFDKVARLLDIHDVPGGPV 84 Query: 183 LSKMAAQGTAGR--FVFPRPMTD------RPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 L K+A++G +P+ + G DFSF+GLKT + I ++ D Sbjct: 85 LEKLASEGNPRACLRELAKPLAKTRDLELKNGCDFSFAGLKTSMRHLIEGGK---YSKPD 141 Query: 235 IARAFEDAVVDTLMIKCKRALDQT-----GFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 +A +F+ VD L+ + RA+D K LV+AGGV+AN+++R+ + E+ K++ Sbjct: 142 MAASFQKRCVDHLVERAGRAIDWALEIDGSIKDLVVAGGVAANKSVRSNMQELAKEKGLM 201 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGAT---------ADLGVSVRPRWPLA 332 ++ CTDNG M+A+ + K G A+ V VRPRWPL Sbjct: 202 LYCPPTRLCTDNGTMVAWNAIEHLKEGLYERAPCTAESAEKFVEVRPRWPLG 253 >UniRef50_Q97ZY8 Putative O-sialoglycoprotein endopeptidase n=1 Tax=Sulfolobus solfataricus RepID=GCP_SULSO Length = 246 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 82/252 (32%), Positives = 128/252 (50%), Gaps = 17/252 (6%) Query: 88 LLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTG 147 + VGAT+ R++A ++ +PV+H GH+ L + + L +SGG+T +I+ Sbjct: 1 MRVGATLARAIALKYNKKLVPVNHGIGHIEIGYLTTEARDP--LILYLSGGNT-IITTFY 57 Query: 148 IGQYELLGESIDDAAGEAFDKTAKLLGLDYP----GGPLLSKMAAQGTAGRFVFPRPMTD 203 G++ + GE++D A G D + + L P G ++ A +G + P Sbjct: 58 KGRFRVFGETLDIALGNMMDVFVREVSLAPPYIINGIHVIDICAEKGNK---LLKLPYVV 114 Query: 204 RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRL 263 G D SFSGL T A + + DI + + D L+ +RAL T K L Sbjct: 115 -KGQDMSFSGLLTAALRVVGK-----EKLEDICYSVREIAFDMLLEATERALALTSKKEL 168 Query: 264 VMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGV 323 ++ GGV+A+ +LR KL E+ K+ ++ PEF DNGAMIAYAGM+ G D+ Sbjct: 169 MIVGGVAASVSLRKKLEELGKEWNVQIKIVPPEFAGDNGAMIAYAGMLAASKGVFIDVDK 228 Query: 324 S-VRPRWPLAEL 334 S +RPRW + E+ Sbjct: 229 SYIRPRWRVDEV 240 >UniRef50_C9LLA9 Glycoprotease family protein n=1 Tax=Dialister invisus DSM 15470 RepID=C9LLA9_9FIRM Length = 319 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 78/321 (24%), Positives = 133/321 (41%), Gaps = 21/321 (6%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 + LGI+TSC T A+YD +G++ + + A G+ HVR P+I Sbjct: 3 KFLGIDTSCYTTSAAVYDSTEGIVGESRII-LSVKAGKRGLSQSEMVFQHVRNL-PVI-- 58 Query: 62 ALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 L + I+ + + P + A LVG + SL+ VP H E H Sbjct: 59 -LGQLEPWIDQINGIGVSVFPRRRADSYMPAFLVGKGMAESLSHVLRVPVFEFSHQENHA 117 Query: 117 LAPMLEDNPPEF-PFVALLVSGGHTQLISV---TGIGQYELLGESIDDAAGEAFDKTAKL 172 LA + PF + +SGG ++SV I Q L S D AG+ D+ Sbjct: 118 LAAIQNMPEIWGTPFYMMHLSGGTQDVLSVEWEKDIMQIVDLIHSADITAGQFIDRVGVS 177 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LG+ +P GP + ++A + + ++ P+ + FSF+G + I+ T Sbjct: 178 LGMPFPAGPSMERLAMK---HQQLYKVPVANVKN-GFSFAGPEAQVQRDIQTKR---YTP 230 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 DIA ++ +L + + GGV +N LR + E+ + + + Sbjct: 231 EDIAYGVFSSIGKSLHKVLDSYNGFIEGRTFIAVGGVMSNGYLRKSITEICRHKSLHPCF 290 Query: 293 ARPEFCTDNGAMIAYAGMVRF 313 A ++ +DN A+ +R+ Sbjct: 291 AEVKYSSDNATGNAFGAFMRY 311 >UniRef50_A6NUZ4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NUZ4_9BACE Length = 313 Score = 213 bits (542), Expect = 8e-54, Method: Composition-based stats. Identities = 88/329 (26%), Positives = 144/329 (43%), Gaps = 32/329 (9%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 +R LG++TS T +A++D + + + + G+ A HV++ L + Sbjct: 4 LRCLGLDTSNYTTSVAVFDGTT---GENIGRLLDVPSGTLGLRQSDALFQHVKRLPGLFE 60 Query: 61 AALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 L E L ++ AV + P + L G GR+L+ +VP PV H +GH Sbjct: 61 Q-LHEKDL-LGELRAVGASTRPRAVDGSYMPCFLAGEGQGRALSATLNVPFFPVSHQQGH 118 Query: 116 LLAPMLEDNPP---EFPFVALLVSGGHTQLISVTGIG---QYELLGESIDDAAGEAFDKT 169 + A + P +A +SGG T+L+ V G + + +G + D +AG+ D+T Sbjct: 119 IAAAAWSAGRLGLLDEPMLAWHLSGGTTELLYVEPEGVNVRAQAIGGTSDISAGQLIDRT 178 Query: 170 AKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD 229 KLLGLD+P G L +A + + + R G FS SG++ Sbjct: 179 GKLLGLDFPAGKALDALARESQSEK----RFKVKLNGCSFSLSGVENQVKAMAERGEA-- 232 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGE 289 ADIAR + + D + AL++ ++ +GGV++N LR KL Sbjct: 233 --PADIARFALNTIADAVARATAAALEERPGLNVLCSGGVASNSLLREKLKNA------- 283 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGAT 318 +A P + TDN +A +AG T Sbjct: 284 -VFAEPRYSTDNAMGVAILAWRSLQAGET 311 >UniRef50_Q3AAM2 Glycoprotease family protein n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AAM2_CARHZ Length = 319 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 79/315 (25%), Positives = 143/315 (45%), Gaps = 24/315 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 LG +TS T A D E L+ + + + G+ H+R ++Q Sbjct: 4 IFLGFDTSNYTTSFAAVDGEGRLIFDLRKI-LPVPEGEVGLRQRDVVFLHLRHLKEMVQE 62 Query: 62 ALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 ++ + + + P + + L G + +L+ A DVP + H EGHL Sbjct: 63 GFNR--ISRDQVRGIGVSVKPRPLPESYMPSFLAGEVIASTLSLALDVPLVKTTHQEGHL 120 Query: 117 LAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQ---YELLGESIDDAAGEAFDKTAKLL 173 +A + F+A+ SGG ++++ V Q ++LG+S+D +AG+ D+ LL Sbjct: 121 VAALWSLKKDFPRFLAIHFSGGTSEILEVEKEPQGYKVKVLGKSLDISAGQLVDRIGVLL 180 Query: 174 GLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRA 233 GL +P G L ++A + + P T G ++ FSG + + ++D + Sbjct: 181 GLPFPSGKFLEELAQKAVG---ILKVPATFVNG-NWHFSGAEAYLKRKLKDFPAFE---- 232 Query: 234 DIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR--GEVF 291 IARA E+ + TL + +V+ GGV+AN ++ L E +KKRR +++ Sbjct: 233 -IARAVEEVIARTLFKIIQYHAKDNLP--VVLMGGVAANNYIKNFLLEKLKKRRVAVDLY 289 Query: 292 YARPEFCTDNGAMIA 306 +A ++ +DN +A Sbjct: 290 FAEVQYASDNAVGVA 304 >UniRef50_C8WXH0 Peptidase M22 glycoprotease n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WXH0_ALIAD Length = 329 Score = 210 bits (536), Expect = 4e-53, Method: Composition-based stats. Identities = 81/327 (24%), Positives = 138/327 (42%), Gaps = 24/327 (7%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 VLG++TS T + D G + + +++ G+ A+ HV+ ++ Sbjct: 7 LVLGVDTSNYTTSVCAVDAVHGRMVAEARRPLRVPRGERGLRQSEAAFQHVQNFPTVMAE 66 Query: 62 ALKES---GLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHME 113 L G+ VA + P + G V SLA + VP H E Sbjct: 67 LLDRLMAEGVRPA-WRRVAVSVRPRPWASSYMPVFQSGFAVAASLAHSLGVPLTRTSHQE 125 Query: 114 GHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELL---GESIDDAAGEAFDKTA 170 GHL A P PFVA+ +SGG ++ + GE++D G+ D+ Sbjct: 126 GHLAAAEYFAPMPGAPFVAVHMSGGTCDVVIARRTPSGYAITRVGEALDLHPGQLVDRVG 185 Query: 171 KLLGLDYPGGPLLSKMAAQG--TAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTD 228 LGL +P GP L ++A + + G + P+ G SFSG T A ++ Sbjct: 186 VALGLPFPAGPHLEQLARRCGTSPGELLLKAPV---RGASMSFSGPLTAALRAVQAGAPA 242 Query: 229 DQTRADIARAFEDAVVDTLMIKCKRALDQTG-FKRLVMAGGVSANRTLRAKLAEMMKKR- 286 + +ARA E + ++ + A+ + +++AGGV++N+ ++ + +++R Sbjct: 243 HE----VARAVEACIARSVAKAVEYAVRHAQTARHVLIAGGVASNQFIQCTIRSRLERRV 298 Query: 287 -RGEVFYARPEFCTDNGAMIAYAGMVR 312 V +A PEF DN +A G R Sbjct: 299 PGIHVAFAPPEFARDNALGVATIGYWR 325 >UniRef50_C7H6X1 Glycoprotease family protein n=2 Tax=Faecalibacterium prausnitzii RepID=C7H6X1_9FIRM Length = 312 Score = 207 bits (527), Expect = 5e-52, Method: Composition-based stats. Identities = 69/317 (21%), Positives = 129/317 (40%), Gaps = 23/317 (7%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 LGI+TS T +A++ ++ + + + G+ A H +++ Sbjct: 5 TLGIDTSNYATSLAVFHTAGEVVCAKKRF-LPVKEGQLGLRQSDALFHHTAALPEMLEEL 63 Query: 63 LKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLL 117 +E LT I AV + P + L G + + A A +P I H +GH Sbjct: 64 GREFDLT--QISAVGVSQKPRPVEGSYMPCFLAGVSAATAFAQARGIPLIHTTHQQGHAA 121 Query: 118 APMLE---DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 A + + + +SGG T L+ + + LG S D AG+A D+ LG Sbjct: 122 AALFAAKGEELFRQKVLLFHISGGTTDLLLCNEVKEITTLGTSTDLYAGQAVDRVGVKLG 181 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRAD 234 +P G +S++AA +P + G+ S SGL+ + + T + Sbjct: 182 FGFPAGVEVSRLAALCEEPI----KPRSSVKGMQCSLSGLENQCNALLNEGKTPEY---- 233 Query: 235 IARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYAR 294 + + V DT++ K A + +V AGGV ++ +RA + + + +V++ Sbjct: 234 VCKYCLLCVADTVVKMTKAAQKEYPGLPVVCAGGVMSSDIIRAWVQQRL----PQVYFVP 289 Query: 295 PEFCTDNGAMIAYAGMV 311 ++ +DN ++ Sbjct: 290 GQYSSDNAIGVSILAAQ 306 >UniRef50_D1PKV9 Glycoprotease family protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PKV9_9FIRM Length = 315 Score = 207 bits (526), Expect = 6e-52, Method: Composition-based stats. Identities = 72/319 (22%), Positives = 121/319 (37%), Gaps = 22/319 (6%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M LGI+TS T +A++D G + + + A G+ A H ++ Sbjct: 1 MLTLGIDTSNYATSLAVFDTNAGEVVCDCKKFLPVKAGQMGLRQSDALFHHTSALPQMLL 60 Query: 61 AALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGH 115 +++ L+ I AV +A P + L G + A A +P H +GH Sbjct: 61 ELGEKTDLSR--IGAVGVSAKPRPVEGSYMPCFLAGVNTATAFALARKIPMFKTTHQQGH 118 Query: 116 LLAPMLEDNPPE---FPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKL 172 + A + + VSGG T L+ G LG S D AG+A D+ Sbjct: 119 IAAALFATGVHSLFMQEALVFHVSGGTTDLLLCHGADTVVPLGTSSDLYAGQAVDRLGVK 178 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 LG +P G +S+ AA RP G++ S SGL+ + + Sbjct: 179 LGYPFPAGVYVSEQAALCAEKI----RPKVSVRGMECSLSGLENQCNRMLEEGKNASY-- 232 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFY 292 + + + +TL+ AL + ++ AGGV ++ +R + + + Sbjct: 233 --VCKYCLLCIGETLVRMAGTALQEHPGLPVIFAGGVMSSDLIRTYVMHRV----PGAHF 286 Query: 293 ARPEFCTDNGAMIAYAGMV 311 +F +DN IA Sbjct: 287 VPGKFASDNAIGIAVLAAK 305 >UniRef50_B0AAV1 Putative uncharacterized protein n=2 Tax=Clostridium RepID=B0AAV1_9CLOT Length = 326 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 69/326 (21%), Positives = 139/326 (42%), Gaps = 17/326 (5%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 V+GI+TSC T IA K ++ N+ + + + G+ A HV + + Sbjct: 8 IVIGIDTSCYTTSIAAISLNKEIIFNEKIM-LNVDTNSKGLRQSEAVFKHVSNIGQISEN 66 Query: 62 ALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 ++ L +I V + P + VG +G+ L+ + P H E H+ Sbjct: 67 IAEK--LRDYNIVGVCASEKPRPIKGSYMPVFTVGLNIGKLLSSTHNCPFFKTSHQENHI 124 Query: 117 LAPMLEDNPPE-FPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKTAK 171 + +L N + F+A+ +SGG T+++ V G ++E++G + D + G+ D+ Sbjct: 125 ESSLLGKNLLDKNRFIAVHMSGGTTEIVLVNKGKCGKYEFEIIGGTKDVSFGQLIDRLGV 184 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 L ++P G + K A + + ++ SG++ + + + Sbjct: 185 KLSYNFPCGKYIDKNALEYEKTIENGLKTSVKEGYMN--LSGIENQLDKIMSNQK--EID 240 Query: 232 RADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVF 291 ++ +++ DA++ + ++ G +V AGGVSA++ + L + +KK R + Sbjct: 241 KSFLSKLLMDAIIRNMFKSLSYLCEKHGVYEVVFAGGVSASKYISKNLTQKLKKYRIKTH 300 Query: 292 YARPEFCTDNGAMIAYAGMVRFKAGA 317 + + TDN A G+ G Sbjct: 301 FTHADLATDNAVGCALIGIQNLNLGE 326 >UniRef50_Q5KFY5 Mitochondrion protein, putative n=2 Tax=Filobasidiella neoformans RepID=Q5KFY5_CRYNE Length = 307 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 80/286 (27%), Positives = 135/286 (47%), Gaps = 35/286 (12%) Query: 84 LVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML-EDNPPEFPFVALLVSGGHTQL 142 + G L VG R+LA A + VHHM+ H L P+L PEFPF+ LL+SGGHTQL Sbjct: 1 MPGCLSVGQGTARALAAALGKRLVGVHHMQAHALTPLLTSAAAPEFPFLILLLSGGHTQL 60 Query: 143 ISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD------------YPGGPLLSKMAAQG 190 + G+ ++++L +++D G+ F+K+A+LL L Y P L Sbjct: 61 VLAKGLFKFKILLDTLDSKIGDVFEKSARLLALPSGPKAPGAILEHYASLPALPPYDTHP 120 Query: 191 TAGRFVFPRPMTD---RPGLDFSFSGLKTFAANTIRDNG-----TDDQTRADIARAFEDA 242 + P P+T + L +SF+G+ + D D+ R A + A Sbjct: 121 LPASQLIPIPLTTLHAKNTLAWSFAGMLAALQRAVHDRRQRQPAWDEPDRRAFANLVQTA 180 Query: 243 VVDTLMIKCKRALDQTGFKR------LVMAGGVSANRTLRAKLAEMMKKR-------RGE 289 + L+ K + + +V++GGV++N +R++L ++K Sbjct: 181 LTTHLLTKLAQRIALLPPDTRAQLGGIVVSGGVASNAYIRSQLDRLVKTENGLFPPAGRN 240 Query: 290 VFYARPEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRWPLAEL 334 ++Y CTDN AMIA+ ++R + G +D + +R +W L ++ Sbjct: 241 LYYPPLHLCTDNAAMIAHTALIRLQTGLRSDPDDLKLRAKWSLEDM 286 >UniRef50_Q0AZF6 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AZF6_SYNWW Length = 326 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 78/325 (24%), Positives = 139/325 (42%), Gaps = 23/325 (7%) Query: 4 LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAAL 63 LGI+TS + +A+ D+E+ ++A++ +++ A G+ A H++ L Sbjct: 5 LGIDTSAYTSSLALVDEEQNIIADERMI-LQVGAGKRGLRQSEAFFQHIKNLPFLFARL- 62 Query: 64 KESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA 118 S + A+ +A P + G + L+ +P H EGH++A Sbjct: 63 --SSYFDAPVKAIGASAWPRRVEGSYMPVFSAGFSQAVVLSSFTGIPLYSFSHQEGHIIA 120 Query: 119 PMLEDNPP--EFPFVALLVSGGHTQLISVT----GIGQYELLGESIDDAAGEAFDKTAKL 172 + + F+A+ SGG ++L+ V G+ +D AG+ D+ Sbjct: 121 GIKGNEALLGRAEFLAVHFSGGTSELLHVRQQQGGLLDISPALAGLDLHAGQLVDRVGVA 180 Query: 173 LGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTR 232 +GLD+P G L KMA Q + G FSFSG +T A + + + Sbjct: 181 MGLDFPCGSELEKMARQSSGENLPLMPSSVSDKG--FSFSGAETRARKLMA----EGISY 234 Query: 233 ADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR--RGEV 290 DIA A + +TL + D+ G K +++ GGV AN ++ +L ++ ++ Sbjct: 235 PDIALASLRCIANTLEKSILQESDKKGIKDVLLVGGVMANSIIKERLQARLEHPAVGLKL 294 Query: 291 FYARPEFCTDNGAMIAYAGMVRFKA 315 F+A P +DN +A A + Sbjct: 295 FFASPRLSSDNAVGVALAAQFILRK 319 >UniRef50_A7VX43 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A7VX43_9CLOT Length = 315 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 87/329 (26%), Positives = 143/329 (43%), Gaps = 27/329 (8%) Query: 1 MRV-LGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M + LGI+TS T +A+YD + + Q+ + + G+ A HV++ L+ Sbjct: 1 MNLALGIDTSNYTTSLALYDAQAHEIC-QVKRLLPVKEGEKGLRQSDAVFHHVQQLPELM 59 Query: 60 QAALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEG 114 L + G K + AV +A P + VG T R L+ A +VP H G Sbjct: 60 DK-LWKPG-CGKALSAVGVSARPRDAEGSYMPCFTVGLTYARLLSTALEVPFYTFSHQAG 117 Query: 115 HLLAPMLEDNPP---EFPFVALLVSGGHTQLISVTGIGQ----YELLGESIDDAAGEAFD 167 H+ A + + PF+A VSGG T+ + V+ Q +L +++D AG+ D Sbjct: 118 HIAAALYSSGSLSLLKQPFLAFHVSGGTTEALLVSPDDQRILSCQLAAKTLDLNAGQLID 177 Query: 168 KTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGT 227 + +LGL +P GP L ++A + RP G D SG + +R Sbjct: 178 RVGVMLGLGFPAGPALERLALTCESKGLRGARP--AMKGNDCCLSGGENLCIKLLR---- 231 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 + + A IA + V L C+ L++ G +V AGGV +N LR ++ + Sbjct: 232 EGKEPAYIAAFCLEYVKAALDQMCRGLLERYGRLPVVFAGGVMSNSILREYFSK-----Q 286 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAG 316 +A P+F +DN + ++ G Sbjct: 287 YGAMFAEPQFSSDNAGGVGVLTAIKAGLG 315 >UniRef50_A0RY43 O-sialoglycoprotein endopeptidase n=4 Tax=Thaumarchaeota RepID=A0RY43_CENSY Length = 237 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 71/247 (28%), Positives = 113/247 (45%), Gaps = 14/247 (5%) Query: 91 GATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQ 150 GA V R+L+ +P PV+H GH+ L + + LLVSGGHT L++ G G+ Sbjct: 2 GAVVARALSSYHGIPIYPVNHAIGHIELGKLLTGAQDP--LVLLVSGGHTMLLAFVG-GR 58 Query: 151 YELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFS 210 + + GE++D G+ D+ + LG P G + ++AA+ + P + G D S Sbjct: 59 WRVFGETLDITLGQLLDQFGRSLGFPSPCGRQVEELAAESSEYT-DLPYSV---KGNDVS 114 Query: 211 FSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVS 270 FSGL + A R + + ++ + +RAL T + L++ GGV+ Sbjct: 115 FSGLLSAAKTAARRGKETA------SYSLQETAFAMVAEAVERALSFTRKRELMVVGGVA 168 Query: 271 ANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL-GVSVRPRW 329 AN+ L L ++R +F P + D GA IA G++ A L VR W Sbjct: 169 ANKRLAGMLEGACGRQRCRLFVVPPVYSGDCGAQIACTGLLEASIKDGAPLADTFVRQSW 228 Query: 330 PLAELPA 336 L + Sbjct: 229 RLDTVDV 235 >UniRef50_Q18B67 Probable O-sialoglycoprotein endopeptidase n=6 Tax=Clostridium difficile RepID=Q18B67_CLOD6 Length = 356 Score = 195 bits (497), Expect = 1e-48, Method: Composition-based stats. Identities = 70/351 (19%), Positives = 135/351 (38%), Gaps = 43/351 (12%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++GI+TSC T IA +K ++ N+ +++ + G+ A H+ L Sbjct: 7 IIIGIDTSCYTTSIAAISLDKKVIFNEKIM-LEVRDNSKGLRQSEAVFQHINNLGILSDR 65 Query: 62 ALKESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHL 116 +S +++ V + P + VG G+ L+ + H E H+ Sbjct: 66 I--KSFKDKFNVEGVCSSKKPRPVENSYMPVFNVGHNFGKLLSSIYGCRFYETTHQENHI 123 Query: 117 LAPMLEDN-PPEFPFVALLVSGGHTQLI-------------------------------S 144 A +L F+++ +SGG T+++ Sbjct: 124 EASLLNSKLKNNNKFISVHMSGGTTEILLTSKQDSHHNVCDTNLGKIAKISIKKDDKSKL 183 Query: 145 VTGIG-QYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTD 203 G +++G S D + G+ D+ LG +P G L + A + Sbjct: 184 YNNFGYNIDIIGGSKDISFGQLIDRVGIKLGYKFPSGKYLDENALNCNLKIESGLKTSVR 243 Query: 204 RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRL 263 ++ SGL+ I DNG + + I++ D+VV + + + Sbjct: 244 DGYMN--LSGLENQVNKIINDNGDNTNQKEYISKLVLDSVVRNMFKSLVYLCETYNVNEV 301 Query: 264 VMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFK 314 + AGGVSA++ + +L+ ++K+ E ++ P++ TDN A G+ F Sbjct: 302 IFAGGVSASKYILRELSMKLRKKHIEAYFTEPQYSTDNAVGCAIIGLNNFL 352 >UniRef50_D2VC41 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VC41_NAEGR Length = 415 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 85/256 (33%), Positives = 124/256 (48%), Gaps = 34/256 (13%) Query: 23 GLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAALKE-SGLTAKD---IDAVAY 78 +L Q+ + +L YGGV P + H +I+ AL++ S L + +D VA Sbjct: 5 KILHEQVITHHELVNQYGGVHPTEMAHMHRATLDGMIENALEKVSNLDSNRERVVDYVAV 64 Query: 79 TAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGG 138 T GPGL L G +VP IPVHH+E HLL P++ FP++ LL SGG Sbjct: 65 TVGPGLPPCLSAGLDTAMKYCEKLNVPVIPVHHLEAHLLVPLMFSENTNFPYLVLLASGG 124 Query: 139 HTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL------------------GLDYPGG 180 H ++ GIGQYE++G + DD+ GEAFDKTA+LL +Y GG Sbjct: 125 HCLVVFSRGIGQYEIVGGTEDDSIGEAFDKTARLLQESIDFNLNDYVNEKFGTRENYSGG 184 Query: 181 PLLSKMAAQGTAGRFVFPRPMTD---RPGLDFSFSGLKTFAANTIRDNGTDDQTRADIAR 237 L+ K+A G + + FP P+ R + FSFSG+KT T+R ++ D+ Sbjct: 185 ALVEKLALLGDSSSYNFPIPLRKGNRRNDITFSFSGIKTDVLRTVRKEQNQGISKRDL-- 242 Query: 238 AFEDAVVDTLMIKCKR 253 L+ + + Sbjct: 243 -------HHLLNRLRN 251 Score = 124 bits (311), Expect = 5e-27, Method: Composition-based stats. Identities = 33/137 (24%), Positives = 63/137 (45%), Gaps = 10/137 (7%) Query: 209 FSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG------FKR 262 +S G ++ + +++ ++ +I+ +F+ L+ K + A+ + Sbjct: 272 YSREGSQSLSTIELKNEKLSEEVVCNISASFQKCAFTHLIDKLEMAMHRYRANVDEYPNS 331 Query: 263 LVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLG 322 L+++GGVSAN+ R +L ++ K ++ A ++CTDN MI YA R + Sbjct: 332 LIVSGGVSANQYFRHELTKLSDKYEYDLKVAPMKYCTDNAVMIGYAAFQRLFNECHKPVE 391 Query: 323 VS----VRPRWPLAELP 335 V PRWP+ L Sbjct: 392 VCDKERYIPRWPITTLS 408 >UniRef50_UPI0000DD8AA6 Os01g0295900 n=1 Tax=Oryza sativa Japonica Group RepID=UPI0000DD8AA6 Length = 288 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 78/333 (23%), Positives = 126/333 (37%), Gaps = 102/333 (30%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + +LGIETSCD+T A+ + +L+ + SQ L +GGV P++A H+ ++Q Sbjct: 16 LLMLGIETSCDDTAAAVVRGDGEILSQVVSSQEDLLVRWGGVAPKMAEEAHLLAIDRVVQ 75 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL + ++ D+ AVA T G Sbjct: 76 KALDNANVSESDLSAVAVTVG--------------------------------------- 96 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGG 180 P ++L + G T I+ + Q K +K++ Sbjct: 97 --------PGLSLCLRGYLTNHINCSWCSQSS---------------KNSKIIS------ 127 Query: 181 PLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDN-------------GT 227 P ++ G F M +FS++GLKT I Sbjct: 128 PAYCWSSSYG-GTGISFQVSMRQHKDCNFSYAGLKTQVRLAIESRNISTDDIPISSATKD 186 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRR 287 D Q RA+IA +F+ L+ V++GGV++N+ +R L ++ +K Sbjct: 187 DRQIRANIAASFQ------LLK--------------VVSGGVASNQYVRTHLNQIAEKNG 226 Query: 288 GEVFYARPEFCTDNGAMIAYAGMVRFKAGATAD 320 ++ P CTDNG MIA+ G+ F AG D Sbjct: 227 LQLVCPPPRLCTDNGVMIAWTGIEHFIAGRFDD 259 >UniRef50_B0TEI7 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TEI7_HELMI Length = 385 Score = 190 bits (484), Expect = 4e-47, Method: Composition-based stats. Identities = 99/376 (26%), Positives = 153/376 (40%), Gaps = 67/376 (17%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 VLGI+TSC T +A + LLA + + + G+ A H R+ +++A Sbjct: 9 VLGIDTSCYTTSVAFASLDGRLLAQKRQ-LLPVKPGERGLRQGDAFFLHGRQLPHVMEAL 67 Query: 63 LK--------ESGLTAKDIDAVAYTAGP-----GLVGALLVGATVGRSLAFAWDVPAIPV 109 ++G ++AVA + P + L G VGRS+A A VP Sbjct: 68 FADLRCSGEAKAGREGLRVEAVAASTRPRPEEGAYLPVFLAGEAVGRSVAAAQGVPFFAT 127 Query: 110 HHMEGHLLAPMLE-------DNPPEFPFVALLVSGGHTQLISVTGIG-----QYELLGES 157 H EGH++A + + + F+++ +SGG T+L+ V G E LG + Sbjct: 128 THQEGHIMAGIASLEDREQAEALLKKGFLSVHLSGGTTELLRVRFDGASAVFSIEKLGAT 187 Query: 158 IDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTA----------------GRFVFPRPM 201 D AG+ D+ LGL +P GP L +AAQ FP + Sbjct: 188 TDLHAGQLVDRVGVALGLPFPAGPHLEALAAQCDGGRCAAEGAAEGSTEAIEAIPFPASV 247 Query: 202 TDRPGLDFSFSGLKTFAANTIRDNGTDDQT--------------------RADIARAFED 241 G + SFSG + A I ++ IAR E Sbjct: 248 ---KGYNVSFSGAEAQALRLIEKWRKANEAASPAAIATLPGDPAHPGIPALPAIARGIEG 304 Query: 242 AVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKR--RGEVFYARPEFCT 299 + TL +RA+ +TG + +++ GGV+AN LR +L E ++ R G + +A Sbjct: 305 CLASTLEKILRRAIAETGCRDVLIVGGVAANGFLRRRLRERLEHRAVGGRLAFATTALSG 364 Query: 300 DNGAMIAYAGMVRFKA 315 DN A +A G A Sbjct: 365 DNAAGVALLGAKFLSA 380 >UniRef50_B2WBX5 Glycoprotease pgp1, mitochondrial n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2WBX5_PYRTR Length = 417 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 93/418 (22%), Positives = 137/418 (32%), Gaps = 144/418 (34%) Query: 3 VLGIETSCDETGIAIYDDE-------KGLLANQLYSQVKLHADYGGVVPELASRDHVRKT 55 L IETSCD+T +A+ +L ++ + ++Y GV P ++ + H Sbjct: 2 TLAIETSCDDTSVAVVKKGCKNDRTTAQILFHKKVTSNN--SEYQGVHPIVSLQSHQESL 59 Query: 56 VPLIQAALK--------------ESGLTAKDI------DAVAYTAGPGLVGALLVGATVG 95 L+ A++ +G DI D V+ T GPG+ L G Sbjct: 60 ATLVGEAIRCLPMQDGELPSEDDRTGPIPVDITTRTLPDFVSVTRGPGMRSNLFTGLDTA 119 Query: 96 RSLAFAWDVPAIPVHHMEGHLLAPML-----------------------------EDNPP 126 + LA AW P + VHHM+ H L L P Sbjct: 120 KGLAVAWQKPLVGVHHMQAHALTSRLVSALDAYKELNEPEAECLPNGTIGRNPTQAHVSP 179 Query: 127 EFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD--------YP 178 +FPF+++L SGGHT LI + + +LG + D A GE DK A+++ Sbjct: 180 DFPFLSVLASGGHTLLIHSASLTDHRVLGSTNDIAIGECLDKIARVVLPPEVLQATKSTM 239 Query: 179 GGPLLSKMA--------------------------------------------------- 187 G LL A Sbjct: 240 YGALLEDFAFPVSLREEPASHDTPTLSSDLPVCSADKYKNTFQHQYSWYEVPLNNEDSMI 299 Query: 188 AQGTAGRFVFPRPMTDRPG------LDFSFSGLKTFAANTIR----------------DN 225 TA + RP+T G L+ SFSG+ T +R Sbjct: 300 RNVTAWGWALNRPLTKSGGGIKINSLEMSFSGITTMIERIVRYGMDPITRKLNKKERAAT 359 Query: 226 GTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG-----FKRLVMAGGVSANRTLRAK 278 + R D+AR A + + + L +VMAGGV+AN R Sbjct: 360 EVSLEERRDLARETMRAAFEHVASRVVLGLQSQQELLEANPAVVMAGGVAANSFFRHM 417 >UniRef50_UPI000187E9E4 hypothetical protein MPER_08009 n=1 Tax=Moniliophthora perniciosa FA553 RepID=UPI000187E9E4 Length = 276 Score = 170 bits (432), Expect = 5e-41, Method: Composition-based stats. Identities = 61/283 (21%), Positives = 108/283 (38%), Gaps = 43/283 (15%) Query: 4 LGIETSCDETGIAIY----DDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLI 59 LG+E S ++ G I D +L+N ++ + + G P + H + +I Sbjct: 21 LGLEGSANKLGAGIIKHSEDGSATVLSNIRHTYITPPGE--GFQPRDTALHHREWAMKVI 78 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 L ++ ++ D+D + YT GPG+ L A V R+L+ +D P + V+H GH+ Sbjct: 79 DECLTKAEVSMHDLDCICYTKGPGMGAPLQSVALVARTLSMLFDKPIVGVNHCVGHIEMG 138 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 ++G ++ G+Y + F L D Sbjct: 139 R-------------EITGAQNPVVLYVSRGEY---------PSDSVFAAMLSYLWRD--T 174 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNG------------T 227 G + + GR + P P G+D S SG+ + D Sbjct: 175 GHCWYNIEQESKKGRRLLPLPYA-TKGMDISLSGVLSSVEAYTNDKMFRQTPTSDEEKDE 233 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVS 270 T AD+ + ++ V L+ +RA+ G K +++ GGV Sbjct: 234 SVITPADLCFSLQETVFAMLVEITERAMAHIGSKEVLIVGGVG 276 >UniRef50_D2EF31 O-sialoglycoprotein endopeptidase (Fragment) n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EF31_9EURY Length = 242 Score = 160 bits (405), Expect = 6e-38, Method: Composition-based stats. Identities = 61/187 (32%), Positives = 98/187 (52%), Gaps = 7/187 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 LGIE++ G+ I +++K ++AN+ + L GG++P A+ H + +I+ A Sbjct: 26 TLGIESTAHTFGVGISENDK-IIANERDT---LKPTSGGIIPREAAMHHFKLAPEIIKRA 81 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L +SGL KDID A++ GPG++ AL VGA V L+ + I V+H HL L Sbjct: 82 LDKSGLKLKDIDLFAFSQGPGIIPALKVGAQVSTFLSNKYKKKLIGVNHCIAHLEIARLY 141 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPL 182 + V L VSGG+TQ+I+ G Y + GE+ D G DK + + + +P G Sbjct: 142 TKLKDP--VMLYVSGGNTQIITYYN-GTYIVFGETQDIGIGNLIDKIGRRMDIPFPDGTK 198 Query: 183 LSKMAAQ 189 + + + Sbjct: 199 IEETCHE 205 >UniRef50_Q8IJ99 Glycoprotease, putative n=5 Tax=Plasmodium RepID=Q8IJ99_PLAF7 Length = 598 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 51/190 (26%), Positives = 96/190 (50%), Gaps = 7/190 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LGIE S ++ GI+I +++ +L N + + G +P S H + +I++ Sbjct: 17 ILGIEGSANKLGISIINEDMNILVNMRRTYIS--EIGCGFIPREISAHHKYYIIDMIKSC 74 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 LK+ + DI + YT GPG+ AL +G + + L +++P + V+H H+ + Sbjct: 75 LKKVNIKISDITLICYTKGPGIGSALYIGYNIAKILYSYFNIPVVGVNHCIAHIEMGIFI 134 Query: 123 DNPPEFPFVALLVSGGHTQLISVTG-IGQYELLGESIDDAAGEAFDKTAKLLGL--DYPG 179 + L VSG +TQ+I +YE++GE++D A G D++A++L + Sbjct: 135 TKLYNP--IVLYVSGSNTQIIYYNDHKKKYEIIGETLDIAIGNVIDRSARILKISNAPSP 192 Query: 180 GPLLSKMAAQ 189 G + +A + Sbjct: 193 GYNVELLARK 202 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 57/118 (48%), Gaps = 4/118 (3%) Query: 224 DNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMM 283 D +++ + I + + + L+ +RA+ T K +++ GGV N L+ + +M Sbjct: 479 DLTEEEKRKIQICYSLQHHIFSMLIEITERAIAFTNSKEVIIVGGVGCNLFLQNMMKKMA 538 Query: 284 KKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL----GVSVRPRWPLAELPAA 337 K++ ++ + +C DNGAMIAY G + + D+ +++ R+ ++ Sbjct: 539 KQKNIKIGFMDHSYCVDNGAMIAYTGYLEYLHAKNKDIYNFNNITIHQRYRTDDVFVT 596 >UniRef50_C1BYL4 Probable O-sialoglycoprotein endopeptidase 2 n=1 Tax=Esox lucius RepID=C1BYL4_ESOLU Length = 235 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 47/184 (25%), Positives = 76/184 (41%), Gaps = 24/184 (13%) Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDD-------- 229 GG + +A G +F F PM +FSF+GL+ TI+ ++ Sbjct: 39 SGGQAIELLAQDGDRLKFHFRPPMGAHYDCNFSFAGLRNQVKMTIQKKEAEEGVEPGTLL 98 Query: 230 QTRADIARAFEDAVVDTLMIKCKRALDQTGF--------KRLVMAGGVSANRTLRAKLAE 281 DIA A + V + + RA+ LV++GGV++N+ +R L Sbjct: 99 SCVNDIAAAMQHTVAFHIAKRTHRAILFCKAQGLLPSFNPTLVVSGGVASNQYIRKTLKI 158 Query: 282 MMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVR---PRWPLA-----E 333 + ++ + C DNG MIA+ G+ R + G + V P+ PL E Sbjct: 159 VTDATGLDLLCPPSKLCNDNGVMIAWNGVERLREGKGILFYIDVVRYEPKAPLGVDITAE 218 Query: 334 LPAA 337 + AA Sbjct: 219 VEAA 222 >UniRef50_C5KJ57 Putative uncharacterized protein (Fragment) n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KJ57_9ALVE Length = 203 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 51/155 (32%), Positives = 78/155 (50%), Gaps = 17/155 (10%) Query: 192 AGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKC 251 A R + + G DFSF+GLKT + I ++ D+A +F+ VD L+ + Sbjct: 7 AKPLAKTRDLELKNGCDFSFAGLKTSMRHLIEGGK---YSKPDMAASFQKRCVDHLVERA 63 Query: 252 KRALDQT-----GFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIA 306 RA+D K LV+AGGV+AN+++R+ + E+ K++ + CTDNG M+A Sbjct: 64 GRAIDWALEIDDSIKDLVVAGGVAANKSVRSNMQELAKEKGLTLHCPPTRLCTDNGTMVA 123 Query: 307 YAGMVRFKAGAT---------ADLGVSVRPRWPLA 332 + + K G A+ V VRPRWPL Sbjct: 124 WNAIEHLKEGLYERAPCTAESAEKFVEVRPRWPLG 158 >UniRef50_D1IQV9 Whole genome shotgun sequence of line PN40024, scaffold_2082.assembly12x (Fragment) n=4 Tax=Eukaryota RepID=D1IQV9_VITVI Length = 151 Score = 151 bits (381), Expect = 5e-35, Method: Composition-based stats. Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 2/132 (1%) Query: 207 LDFSFSGLKTFAA-NTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVM 265 +D SFSGL ++ + ++ T AD+ + ++ V L+ +RA+ K +++ Sbjct: 1 MDVSFSGLLSYIEATAVEKLQNNECTPADLCYSLQETVFAMLVEITERAMAHCDKKDVLI 60 Query: 266 AGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGV-S 324 GGV N L+ + M +R G +F +C DNGAMIAY G++ + GAT L + Sbjct: 61 VGGVGCNERLQEMMRVMCSERSGRLFATDDRYCIDNGAMIAYTGLLAYAHGATTPLEEST 120 Query: 325 VRPRWPLAELPA 336 R+ E+ A Sbjct: 121 FTQRFRTDEVHA 132 >UniRef50_C9SIA9 Glycoprotease pgp1 n=2 Tax=Sordariomycetes RepID=C9SIA9_VERA1 Length = 208 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 44/173 (25%), Positives = 69/173 (39%), Gaps = 10/173 (5%) Query: 170 AKLLGLDY-PGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIR-DNGT 227 + + Y P ++A + + P + + FSG + Sbjct: 17 GRAVDYAYTPPRIRADEIATYRSPHGWHLKPPFATTRRMRYDFSGFGSQVQRIAEARPAM 76 Query: 228 DDQTRADIARAFEDAVVDTLMIKCKRALD-----QTGFKRLVMAGGVSANRTLRAKLAEM 282 + R D+ R + + L + AL + LV+AGGV++NR L L Sbjct: 77 SEAERRDLGRDTMRILFEHLASRVVLALGNEEMGLKDVRTLVVAGGVASNRYLMHVLRAF 136 Query: 283 MKKRRG---EVFYARPEFCTDNGAMIAYAGMVRFKAGATADLGVSVRPRWPLA 332 + R E+ E CTDN AMIA+ GM F+AG ++L V +WPL Sbjct: 137 LDVRGYDGIEITAPPVELCTDNAAMIAWTGMEMFEAGYESELSVHSIKKWPLD 189 >UniRef50_A5KDZ1 O-sialoglycoprotein endopeptidase, putative n=1 Tax=Plasmodium vivax RepID=A5KDZ1_PLAVI Length = 574 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 45/189 (23%), Positives = 91/189 (48%), Gaps = 7/189 (3%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +LG+E S ++ G++I + +L N + + G +P + H + +I+ Sbjct: 20 ILGLEGSANKLGVSIINSNFEILVNMRRTYIS--EIGCGFIPRQINAHHKYYIIEMIKDC 77 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L + + D+ + YT GPG+ AL + + + + +++P I V+H H+ + Sbjct: 78 LTKLKIKITDVHLICYTKGPGIGSALYIAYNISKFFSLLFNIPVIGVNHCIAHIEMGIFI 137 Query: 123 DNPPEFPFVALLVSGGHTQLISVTG-IGQYELLGESIDDAAGEAFDKTAKLLGL--DYPG 179 + + L VSG +TQ+I +YE++GE++D A G D++A++L + Sbjct: 138 TKL--YHPIILYVSGSNTQIIYFNDHKKRYEIIGETLDIAIGNVIDRSARILRISNSPSP 195 Query: 180 GPLLSKMAA 188 G + +A Sbjct: 196 GYNVEILAR 204 Score = 121 bits (304), Expect = 3e-26, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 56/116 (48%), Gaps = 4/116 (3%) Query: 226 GTDDQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKK 285 +++ + I + + + L+ +RA+ T K +++ GGV N L+ + +M K+ Sbjct: 457 TDEEKRKIQICYSLQHHIFSMLIEITERAIAFTNSKEVIIVGGVGCNVFLQNMMKKMAKQ 516 Query: 286 RRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATADL----GVSVRPRWPLAELPAA 337 + ++ + +C DNGAMIAY G + F ++ +S+ R+ ++ Sbjct: 517 KNIKIGFMDHSYCVDNGAMIAYTGYLEFANTKNREIYGFDNISIHQRYRTDDVLVT 572 >UniRef50_A9UYP5 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UYP5_MONBE Length = 230 Score = 141 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 45/139 (32%), Positives = 68/139 (48%), Gaps = 10/139 (7%) Query: 203 DRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFEDAVVDTLMIKCKRALDQTG--- 259 D DFSFSGLKT A N + D+ +A +F+ + D L+++ +RAL Sbjct: 70 DHSNCDFSFSGLKTRAINLSSEYAKRDE-LPLLAASFQRTIADHLLVRLERALRFCDQQG 128 Query: 260 --FKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGA 317 +R V AGGV N +R +L + V + P C DNG MIA+AG++ F G Sbjct: 129 RRPRRFVAAGGVLCNAYIRQRLHAFARFHDLPVEFPAPPLCVDNGVMIAWAGLLHFLRGT 188 Query: 318 TA----DLGVSVRPRWPLA 332 ++ + P+WP+ Sbjct: 189 SSVARDPQALRYHPKWPIG 207 >UniRef50_Q0V4Z5 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0V4Z5_PHANO Length = 497 Score = 141 bits (356), Expect = 3e-32, Method: Composition-based stats. Identities = 50/192 (26%), Positives = 76/192 (39%), Gaps = 30/192 (15%) Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFPRPMT------DRPGLDFSFSGLKTFAANTI--- 222 P + T + F +P+ ++ SFSGL T I Sbjct: 300 YTWYKVPPNHE-EALKTAMTKWGWAFNQPLVTAHCGLKVNDIELSFSGLLTAVERVIGYQ 358 Query: 223 -----RDNGTDD--------QTRADIARAFEDAVVDTLMIKCKRALD----QTGFKRLVM 265 R + + + IAR A + + + AL + +V+ Sbjct: 359 TDPVTRKRTKIERTLDEISLEEKKHIAREAMRAAFEHVAYRVVLALRSLASDPAPRSVVL 418 Query: 266 AGGVSANRTLRAKLAEMMKKRR---GEVFYARPEFCTDNGAMIAYAGMVRFKAGATADLG 322 AGGV+AN LR LA + R +++ P FCTDN AMIA+ G+ F+AG T L Sbjct: 419 AGGVAANSFLRHILASTLCARGFSHINLYFPPPSFCTDNAAMIAWTGIEMFEAGHTDTLS 478 Query: 323 VSVRPRWPLAEL 334 + +WPL +L Sbjct: 479 IRAIRKWPLNQL 490 Score = 133 bits (335), Expect = 9e-30, Method: Composition-based stats. Identities = 68/244 (27%), Positives = 96/244 (39%), Gaps = 62/244 (25%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLH-------ADYGGVVPELASRDHVR 53 + L IETSCD+T +AI + K + + +Q+ H A+Y GV P ++ R H Sbjct: 28 LMTLAIETSCDDTSVAIVE--KKVENGRAVAQLHFHKKVTANNAEYQGVHPLVSLRSHQE 85 Query: 54 KTVPLIQAALKESGLTAKD--------------------------IDAVAYTAGPGLVGA 87 L+ A+ D V+ T GPG+ Sbjct: 86 NLADLVSEAISHLPPKTASRDHDFEHGGLEAQRPEAVLDVTKKRLPDFVSVTRGPGMRSN 145 Query: 88 LLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDN---------PPEFPFVALLVSGG 138 L G + LA AW + H L P L P+FPF+++L SGG Sbjct: 146 LFTGLDTAKGLAVAW----------QAHALTPRLVSALEPSATPTLEPDFPFLSVLASGG 195 Query: 139 HTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLL--------GLDYPGGPLLSKMAAQG 190 HT LI + + LLG + D A GE DK A++L G LL K A +G Sbjct: 196 HTLLIQSASLNDHHLLGTTNDIAVGEYLDKVARILLPTELLQSTRSTMYGALLEKFAFEG 255 Query: 191 TAGR 194 A + Sbjct: 256 NASQ 259 >UniRef50_B2AYU1 Predicted CDS Pa_1_12230 (Fragment) n=1 Tax=Podospora anserina RepID=B2AYU1_PODAN Length = 290 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 68/239 (28%), Positives = 106/239 (44%), Gaps = 40/239 (16%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKL-HADYGGVVPELASRDHVRKTVPLI 59 + + IETSCD+T +AI + Q ++ H ++ G+ P +AS+ H + L+ Sbjct: 40 LLTIAIETSCDDTCVAILEKAGPAARLQFNKRIPSNHVEFKGIHPTIASKSHEIQLAKLV 99 Query: 60 QAALKESG-----------LTAKDI-----------DAVAYTAGPGLVGALLVGATVGRS 97 A++ ++ +D D V+ T GPG L VG V + Sbjct: 100 NEAVQSLPKHTNHSPEVKTISIRDPQTGKSTPRRLPDFVSVTRGPGFPRCLDVGLGVAKG 159 Query: 98 LAFAWDVPAIPVHHMEGHLLAPMLED----------------NPPEFPFVALLVSGGHTQ 141 L+ AW VP + VHHM+GH L P L+ P+FPF+ LL SGGHTQ Sbjct: 160 LSVAWQVPFLGVHHMQGHALTPRLDHALQQPFPPSSSTPSSKLSPKFPFLTLLASGGHTQ 219 Query: 142 LISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRP 200 L+ T + + +L + + G+ DK A+ + L +A +F FP P Sbjct: 220 LLLSTTLTTHTILATVTNISLGDMLDKAAREILPPSLL-SSLPNIAYAAALEQFAFPSP 277 >UniRef50_C6WZF1 Putative glycoprotease family exported protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6WZF1_FLAB3 Length = 239 Score = 131 bits (329), Expect = 5e-29, Method: Composition-based stats. Identities = 47/221 (21%), Positives = 88/221 (39%), Gaps = 31/221 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L IETS +AI D ++ L + S+ ++ Sbjct: 14 MKILHIETSSRNCSVAISDGDELLCLCEEVSENY---------------KQSESLHTFVE 58 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ +G+ D+DAV+ GPG L +G++ + + +P I V+ +E + P Sbjct: 59 WALEGAGIALNDLDAVSLGMGPGSYTGLRIGSSAAKGFCYGLQIPLIAVNSLETMIE-PF 117 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQY-ELLGES----ID-DAAGEAFDKTAKLLG 174 L+ N F F+ L+ ++ + G ++L ++ ID D+ E K +G Sbjct: 118 LDQN---FDFIVPLLDARRMEVYTAHFDGNSGQMLTQTEASIIDQDSFQEFLGKKVVFVG 174 Query: 175 ---LDYPG---GPLLSKMAAQGTAGRFVFPRPMTDRPGLDF 209 L G P + + +F+ + + DF Sbjct: 175 DGALKAKGVLQLPDAEFNSDVYPSAKFLIKKAVEKFRNKDF 215 >UniRef50_C6XTX3 Peptidase M22 glycoprotease n=3 Tax=Sphingobacteriaceae RepID=C6XTX3_PEDHD Length = 230 Score = 127 bits (320), Expect = 5e-28, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 63/162 (38%), Gaps = 17/162 (10%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IET+ +AI + + + + ELA+ H I+ A Sbjct: 6 ILQIETATQACSVAISRNGETIALKE----------------ELANNIHAGSLTLFIETA 49 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 ++ +GL D+DAVA + GPG L +G + + L FA D P I + ++ A L Sbjct: 50 METAGLHFNDLDAVAVSKGPGSYTGLRIGVSTAKGLCFALDKPLIAIDTLQTMA-AGFLL 108 Query: 123 DNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGE 164 + + ++ ++ + L ++ E Sbjct: 109 EQAGYEGLICTMIDARRMEVFTAVFDADLNYLVPTVAKIIDE 150 >UniRef50_C0YUE5 Possible M22 family non-peptidase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YUE5_9FLAO Length = 226 Score = 127 bits (319), Expect = 6e-28, Method: Composition-based stats. Identities = 43/180 (23%), Positives = 78/180 (43%), Gaps = 25/180 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L +ETS +A+ D+EK L + S+ ++ Sbjct: 1 MKILYLETSSKNCSVAVSDNEKLLCLCEEVSENY---------------KQSESLHTYVE 45 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 AL+ +G++ K+I+AV+ GPG L +GA + + VP + V+ +E + P Sbjct: 46 WALEGAGISLKEIEAVSLGKGPGSYTGLRIGAASAKGFCYGLKVPLVAVNSLESMIE-PF 104 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQY-----ELLGESIDDAAGEAF-DKTAKLLG 174 L DN + + LV ++ + G+ E + +D+A+ E F DK +G Sbjct: 105 LGDN---YDLIVPLVDARRMEVYTAVYDGKTGKELSETEAKILDEASFEEFKDKKVLFVG 161 >UniRef50_P76256 M22 peptidase homolog yeaZ n=236 Tax=Gammaproteobacteria RepID=YEAZ_ECOLI Length = 231 Score = 127 bits (319), Expect = 7e-28, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 60/150 (40%), Gaps = 20/150 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+L I+T+ + +A+++D EL R+H ++ +P++Q Sbjct: 1 MRILAIDTATEACSVALWNDGTV-----------------NAHFELCPREHTQRILPMVQ 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L SG + DI+A+AY GPG + +G + + LA ++P I V + Sbjct: 44 DILTTSGTSLTDINALAYGRGPGSFTGVRIGIGIAQGLALGAELPMIGVSTLMTMAQGAW 103 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQ 150 ++ V + ++ Sbjct: 104 RKNGATR---VLAAIDARMGEVYWAEYQRD 130 >UniRef50_B2A5P8 Peptidase M22 glycoprotease n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A5P8_NATTJ Length = 236 Score = 126 bits (318), Expect = 8e-28, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 61/159 (38%), Gaps = 22/159 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VLGI+T+ +A+ D K L+ + + + H + +PLI Sbjct: 1 MKVLGIDTATKTCCVALIDGNK-LMGEFILNNF---------------QTHSERLMPLID 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L G+ +I+ +A + GPG L +G + LA ++P + V ++ Sbjct: 45 KLLDSLGIKIDEIEGIAVSRGPGAFTGLRIGIGTAQGLAMGNEIPLVGVSTLDALAY--- 101 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESID 159 ++ ++ +L + + + + D Sbjct: 102 ---QRATLGYICPIMDAKKQELYTSLYYVNEKEIEQVWD 137 >UniRef50_A3HX68 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HX68_9SPHI Length = 230 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 59/142 (41%), Gaps = 18/142 (12%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++L +ETS +A++D + G+ + H K + LI+ Sbjct: 3 KILSLETSTPVCSVALHDSGNIM----------------GLKEIEENGAHSEKLIKLIEE 46 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L E + K++DA+A + GPG L +G + + LAFAW P I V + L Sbjct: 47 LLDELQVDRKEVDAIAVSEGPGSYTGLRIGVSTAKGLAFAWGKPLIAVSTLAALARGATL 106 Query: 122 EDNPPEFPFVALLVSGGHTQLI 143 ++N V ++ ++ Sbjct: 107 DENNSS--VVIAMLDARRMEVY 126 >UniRef50_B3PIE3 Glycoprotease family protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PIE3_CELJU Length = 254 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 61/143 (42%), Gaps = 18/143 (12%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L +++S D +A+ D K G+ E+A++ H ++ +P++ Sbjct: 13 LILALDSSTDACSVALNRDGKL-----------------GIRHEIATKSHTQRLLPMVDE 55 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 L E G++ ++D +A+ GPG L + + + LA+ +P +PV +E L Sbjct: 56 VLGEEGISVSEVDVIAFGRGPGSFTGLRICMGIVQGLAYGSGIPVVPVSTLEAMALQVYR 115 Query: 122 EDNPPEFPFVALLVSGGHTQLIS 144 + P + L ++ Sbjct: 116 QHPEWRGPVMVAL-DARMDEVYW 137 >UniRef50_B8I821 Peptidase M22 glycoprotease n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I821_CLOCE Length = 236 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 42/170 (24%), Positives = 74/170 (43%), Gaps = 24/170 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MR+L ++TS + AI +DE + G + H ++ +P++Q Sbjct: 1 MRILAVDTSTNVASAAILEDEVII----------------GEYNCNRGKTHSQRLMPMVQ 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 ++ +GLT DIDA + + GPG L +G T +++AFA + P I VH ++ Sbjct: 45 HLMETAGLTVSDIDAFSASIGPGSFTGLRIGVTTIKAMAFAAEKPVISVHTLDALAYNIP 104 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGI---GQYELLGESIDDAAGEAFD 167 +N V ++ + Q+ + G+ E L E + E D Sbjct: 105 FAEN-----LVCPMIDARNNQVFTAIYRFIGGKLERLTEYLGIPVTELAD 149 >UniRef50_Q11YX3 Probable peptidase M22, glycoprotease family n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11YX3_CYTH3 Length = 225 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 74/210 (35%), Gaps = 28/210 (13%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L I+TS +A++ D K + + + R H R +I Sbjct: 4 ILSIDTSTSICSVALHTDGKLIAHTETFLD----------------RSHSRNISHMIDHI 47 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L ++ D+ A A +AGPG + +G + + FA D P I V + LA LE Sbjct: 48 LAICEISMNDLSAYAVSAGPGSYTGMRIGTSTAKGFCFALDKPLISVSSLYS--LAAKLE 105 Query: 123 DNPPEFPFVALLVSGGHTQLIS------VTGIGQYELLGESIDDAAGEAFDKTAKLLGLD 176 P + ++ ++ + + I + + L + + + DK + Sbjct: 106 HKQPGI-YYVPMIDARRMEVYTTIYDSGLNEIAEEQALILTEESFQEQLIDKK---VLFG 161 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRPMTDRPG 206 G ++ + A P + G Sbjct: 162 GDGSRKFQEICSHSNAFFANDAYPSAEFMG 191 >UniRef50_A1SX27 Peptidase M22, glycoprotease n=3 Tax=Gammaproteobacteria RepID=A1SX27_PSYIN Length = 237 Score = 124 bits (313), Expect = 3e-27, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 75/207 (36%), Gaps = 25/207 (12%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 + VL ++TS + +A+ + + Q LA R+H K +P ++ Sbjct: 5 LNVLCVDTSTEACSVAVLC--QTAAGQVINDQFM-----------LAPREHTTKILPTVE 51 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L+ +G+ D+D +AY GPG + +G ++ + LAF + + V ++ Sbjct: 52 QVLQSAGVNLSDMDFIAYGRGPGSFTGVRIGISIAQGLAFGSEKNMVGVSTLQAMAQQAF 111 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESID---------DAAGEAFDKTAK 171 + V + ++ +L+ D A E + A Sbjct: 112 KMKGAQD---VYAAIDARMGEVYFGHYQLDKKLMVLVNDEVVIKPADLIARQENIAENAV 168 Query: 172 LLGLDYPGGPLLSKMAAQGTAGRFVFP 198 L+G + P LS+ FP Sbjct: 169 LVGSGWAAYPELSEHFNAPEETEIEFP 195 >UniRef50_A8SIF9 Putative uncharacterized protein n=1 Tax=Parvimonas micra ATCC 33270 RepID=A8SIF9_9FIRM Length = 227 Score = 124 bits (313), Expect = 3e-27, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 67/175 (38%), Gaps = 27/175 (15%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L I+TS + A+ +D ++ + +Q S H + ++ Sbjct: 1 MKILAIDTSTTHSSCAVMEDN-NIVGDFSINQ---------------SMSHNEILLVMVD 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 LK+ + +DID GPG + +G TV ++LA A + P + V+ +E Sbjct: 45 EMLKKLNIDIEDIDLFVAVTGPGSFTGIRIGVTVVKALAMALNKPIVAVNTLEALSFGVF 104 Query: 121 LEDNPPEFPFVALLVSGGHTQLIS--VTGIGQYELLGE---SIDDAAGEAFDKTA 170 + L+ ++ G+ ++ +ID+ E DK Sbjct: 105 TDKKKI------PLIDARGERVYYGVYEGLENKNIVAPALLTIDELLEEFLDKGE 153 >UniRef50_A1ZHG0 Glycoprotease family n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZHG0_9SPHI Length = 230 Score = 123 bits (310), Expect = 6e-27, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 70/209 (33%), Gaps = 22/209 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++ IETS +A++ E LL + ++ H LI+ Sbjct: 3 LIVSIETSTKVCSVALHQ-EGELLGDATL---------------WVAQSHSVMLTSLIKD 46 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 + + +D+DA+A GPG L +G + L FA D P + ++ + H +A L Sbjct: 47 VVSHAQQKLEDLDAIALGKGPGSYTGLRIGTATAKGLCFALDKPLVAINSL--HAMAAAL 104 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGES----IDDAAGEAFDKTAKLLGLDY 177 + + + ++ ++ + + ID + E T K++ Sbjct: 105 QHTSVDKHWFCPMIDARRMEVYCAVYDEDLQEQQATEAKIIDANSFEDILSTQKVVFFGD 164 Query: 178 PGGPLLSKMAAQGTAGRFVFPRPMTDRPG 206 + A P G Sbjct: 165 GAAKCKEVLGNNSNALFVDNFHPTARSVG 193 >UniRef50_A6TLG1 Peptidase M22, glycoprotease n=2 Tax=Alkaliphilus RepID=A6TLG1_ALKMQ Length = 236 Score = 123 bits (310), Expect = 7e-27, Method: Composition-based stats. Identities = 46/237 (19%), Positives = 84/237 (35%), Gaps = 31/237 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L ++TS +A+ D EK LA ++ K R H ++ +P+IQ Sbjct: 1 MKILALDTSSIVGTVALLDGEK--LAGEIIVNYK--------------RTHSQQLMPMIQ 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L+ L KDID A + GPG L +G + +++A A D P + + ++G + Sbjct: 45 DLLESCALKPKDIDVFAVSLGPGSFTGLRIGVSTMKAMAQALDKPIVGISTLDGLAFNLL 104 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESID------DAAGEAFDKTAKLLG 174 + ++ + + + E + D D + FD + + Sbjct: 105 YSQ-----GIICPIIDAQRDMVYTASYRWSGEDFQQVKDYEMIHIDEMIQRFDGETESII 159 Query: 175 LDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQT 231 L + R VFP M S + A + + Sbjct: 160 FVGDAVEKLKERIQHSLKKRAVFPPGMVAMARA----SAIGELARRAVIEGRVQKPE 212 >UniRef50_Q9U0J7 Peptidase, M22 family, putative n=3 Tax=Plasmodium RepID=Q9U0J7_PLAF7 Length = 693 Score = 123 bits (310), Expect = 7e-27, Method: Composition-based stats. Identities = 42/194 (21%), Positives = 77/194 (39%), Gaps = 26/194 (13%) Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYEL----LGESIDDAAGEAFDKTAKLLGL 175 +++ + ++ +LVSGG T + V + + + ++D G+ DK +LL L Sbjct: 326 IIQTEYMKDGYLCILVSGGSTDVYKVQKDTKNAINVCKISTTMDITIGDVIDKVTRLLEL 385 Query: 176 DYP--GGPLLSKMAAQG-------------TAGRFVFPRPMTDRPGLDFSFSGLKTFAAN 220 GGP L K A + FP P + +DFSFSG+ + Sbjct: 386 PVGLGGGPFLEKEAQKYLTNLKSASSENLQNDPFQPFPNPFSTNNIIDFSFSGIYNHMSK 445 Query: 221 TIRDNGTD---DQTRADIARAFEDAVVDTLMIKCKRAL----DQTGFKRLVMAGGVSANR 273 I+ ++ ++ + A + + L+ + + + K + + GGV N Sbjct: 446 IIKKLKSEKSFEKEKGRYAYYCQKNIFHHLLKQVNKIMYFSELHFNIKNVFIVGGVGCNN 505 Query: 274 TLRAKLAEMMKKRR 287 L L +M KR Sbjct: 506 FLYQSLKDMAAKRD 519 Score = 81.9 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 53/122 (43%), Gaps = 3/122 (2%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 ++GIE +CD+T I + D + ++ N + S K+ Y GV P S + + Sbjct: 87 IVGIENTCDDTCICVIDTDLNIIKNVIISHYKVVHSYEGVYPFFISSLNSLFLKHYVNKI 146 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATV-GRSLAFAWDVPAIPVHHMEGHLLAPML 121 L + K + +++ PG+ + G ++ V+H+ H+L+P+ Sbjct: 147 LD--NIDPKHVICYGFSSCPGIAKNMEAAKNYIGEKKKQNENIKISAVNHIFAHILSPLF 204 Query: 122 ED 123 + Sbjct: 205 FN 206 >UniRef50_C9CSP7 Peptidase M22, glycoprotease n=3 Tax=Rhodobacteraceae RepID=C9CSP7_9RHOB Length = 215 Score = 123 bits (309), Expect = 9e-27, Method: Composition-based stats. Identities = 44/203 (21%), Positives = 74/203 (36%), Gaps = 21/203 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +LG +TS A+ + L E ++ + +PL++ Sbjct: 7 LILGFDTSSAHCAAALLRGDSVLAQ----------------RREEMAKGQAERLMPLLEE 50 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLA--- 118 L E G+T D+DA+A GPG + + + R LA DVPAI V +E Sbjct: 51 LLTEGGVTWADLDAIAVGIGPGNFTGIRISVSAARGLALGLDVPAIGVSSLEAQAFGQEK 110 Query: 119 PMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLD-Y 177 P++ ++ L + H L + + I A E + + Y Sbjct: 111 PVISSLDARRDYLYLQIDAEHPGLFTADNLPPLATGARCIGHRADEIAARCGGTVAEARY 170 Query: 178 PGGPLLSKMAAQGTAGRFVFPRP 200 P ++++AA A P P Sbjct: 171 PMAEAIARVAATRLAQP-DLPAP 192 >UniRef50_B8D0Y9 O-sialoglycoprotein endopeptidase n=1 Tax=Halothermothrix orenii H 168 RepID=B8D0Y9_HALOH Length = 239 Score = 122 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 63/169 (37%), Gaps = 23/169 (13%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M V+GI+TS + +Y+D+ L G + R H + +P+I Sbjct: 1 MLVMGIDTSGAVGSVGLYNDDGVL----------------GEINIKLKRRHSERLLPVID 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L E G I+ V GPG L +G + +S A ++P + + ++ + Sbjct: 45 RLLMECGREIDQINGVGVVTGPGSFTGLRIGMSTAKSFAQVLNIPVVGLSSLDILAYNLI 104 Query: 121 LEDNPPEFPFVALLVSGGHTQLI--SVTGIGQYELLGESIDDAAGEAFD 167 + + + ++ + ++ G + + D+ A D Sbjct: 105 IAEGW-----IVPVIDARNARVYTSLYRGWSRDIKNAKVRDERALSVND 148 >UniRef50_UPI0001BC4F5F O-sialoglycoprotein endopeptidase n=1 Tax=Fusobacterium ulcerans ATCC 49185 RepID=UPI0001BC4F5F Length = 231 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 69/174 (39%), Gaps = 20/174 (11%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M +L I+T+ +A+Y+D+ G++ + VK+ +H + + Sbjct: 1 MLILAIDTATKIGSVALYEDKTGIIGE-INLYVKV--------------NHSNVIMKAVD 45 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 + SG T KD+D +A T GPG + +G + + LA++ + P I ++ ++ Sbjct: 46 SLFDLSGYTIKDVDKIAVTTGPGSFTGIRIGVAIAKGLAYSLEKPIIGINELDVLAETG- 104 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLG 174 + L+ ++ + L + GE D KL G Sbjct: 105 ----EEREGLIVPLIDARKERVYYSQYKYENRKLVRKEEYKDGELRDILEKLKG 154 >UniRef50_B3ERC8 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ERC8_AMOA5 Length = 228 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 60/142 (42%), Gaps = 18/142 (12%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IETS +A++ + K L L+ +R H + +I+ Sbjct: 3 LILSIETSTSVCSVALHREGKLLAYQSLF----------------IARSHAESLLTIIEH 46 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 ++ S T KD+ A+A + GPG L +GAT L +A ++P I V+ +E +LA Sbjct: 47 IVQLSQYTLKDLQAIAISKGPGSYTGLRIGATTATGLCYALNIPLISVNTLEAMVLAVKP 106 Query: 122 EDNPPEFPFVALLVSGGHTQLI 143 + ++ ++ Sbjct: 107 FN--INSALCCPMIDARRMEVY 126 >UniRef50_A5FJB4 Peptidase family M22-like protein n=10 Tax=Flavobacteriales RepID=A5FJB4_FLAJ1 Length = 223 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 59/141 (41%), Gaps = 20/141 (14%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L IET+ ++I + + +L ++ H K I+ A Sbjct: 4 ILNIETATKNCSVSIAKNGETILCKEIA---------------EEGYSHAEKLHVFIEEA 48 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 + ESG++ +D++AVA + GPG L +G + + L +A ++P I V ++ + Sbjct: 49 IAESGVSIQDLNAVAVSQGPGSYTGLRIGVSAAKGLCYALNIPLIAVDTLQTLASKAKIS 108 Query: 123 DNPPEFPFVALLVSGGHTQLI 143 + + ++ ++ Sbjct: 109 EGK-----IIPMLDARRMEVY 124 >UniRef50_A6ECY4 Putative glycoprotease family exported protein n=3 Tax=Bacteroidetes RepID=A6ECY4_9SPHI Length = 227 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 68/179 (37%), Gaps = 18/179 (10%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L IETS AI D ++ + E+AS H IQ Sbjct: 3 IILQIETSTQVCSAAISRDGHTIVLKE----------------EMASNIHAGSLTLFIQD 46 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 +K +G+ +DAVA + GPG L +G + + L +A + P I V ++ A L Sbjct: 47 VMKTAGIGFDALDAVAVSKGPGSYTGLRIGVSTAKGLCYALETPLIAVDTLQMMA-AGFL 105 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGES-IDDAAGEAFDKTAKLLGLDYPG 179 +P + ++ ++ + + ++ E++ K+ + + G Sbjct: 106 SQHPHFEGLICPMIDARRMEVFTAVFDPELLMVRPVEARIITEESYTDLLKMHTISFMG 164 >UniRef50_A0LXU5 Peptidase, family M22 n=5 Tax=Bacteroidetes RepID=A0LXU5_GRAFK Length = 219 Score = 121 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 27/204 (13%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 +L +ET+ + I D K L + S+ H K I+ Sbjct: 3 IILCLETATTNCSVGIAKDGKLLSLKEDNSKNY---------------SHAEKLHVFIEN 47 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 LKE+GL D+DA+A + GPG L +G + + L F+ D+P I V ++ L Sbjct: 48 ILKETGLKVDDLDAIAVSKGPGSYTGLRIGVSAAKGLCFSLDIPLISVPTLDLLAY--KL 105 Query: 122 EDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGES----IDD-AAGEAFDKTAKLLGLD 176 +D F ++ ++ S + + + ++ +D+ + E +K+ + Sbjct: 106 KDRDGIF---VSMLDARRMEVYSAVYDAEIKQIRDTEAQILDENSFSEYLEKSE--VHFI 160 Query: 177 YPGGPLLSKMAAQGTAGRFVFPRP 200 G ++ A P Sbjct: 161 GNGVTKFEEICKHSNAVFHKLKYP 184 >UniRef50_A8U9X9 Glycoprotein endopeptidase n=1 Tax=Carnobacterium sp. AT7 RepID=A8U9X9_9LACT Length = 240 Score = 121 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 57/144 (39%), Gaps = 21/144 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M+VL I+TS IA+ DDEK + G + R+H + +P I Sbjct: 1 MKVLAIDTSNQAMSIAVLDDEKVI----------------GEITTNIKRNHSERLMPAID 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 +K+ + +++ + GPG L +G TV ++LA+ V + + ++ Sbjct: 45 ELMKDVQWQSSELNRIVVAKGPGSYTGLRIGVTVAKTLAWTLGVELVGISSLKILA---- 100 Query: 121 LEDNPPEFPFVALLVSGGHTQLIS 144 + ++ L + + Sbjct: 101 -GNCESSPHYLVPLFDARRKNIYT 123 >UniRef50_C0GCU7 Peptidase M22 glycoprotease n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GCU7_9FIRM Length = 242 Score = 121 bits (304), Expect = 4e-26, Method: Composition-based stats. Identities = 39/145 (26%), Positives = 63/145 (43%), Gaps = 21/145 (14%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 MRVLGI+++ +A+ +EK L + QVK + H + +PLI Sbjct: 1 MRVLGIDSATLVCSVALVSEEKTL--AEYNLQVK--------------KTHSERLLPLIA 44 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 A L+++GL D+D VA AGPG + +G +SL A VP V ++ Sbjct: 45 AMLRDTGLKPADLDGVAVAAGPGSFTGVRIGMVTAKSLGQALAVPLAGVSTLQALA---- 100 Query: 121 LEDNPPEFPFVALLVSGGHTQLISV 145 +P V ++ Q+ + Sbjct: 101 -AQHPHFPGVVCPILDARRDQVYNA 124 >UniRef50_C3JGW8 Universal bacterial protein YeaZ n=2 Tax=Rhodococcus erythropolis RepID=C3JGW8_RHOER Length = 227 Score = 121 bits (303), Expect = 5e-26, Method: Composition-based stats. Identities = 49/223 (21%), Positives = 76/223 (34%), Gaps = 24/223 (10%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQL-YSQVKLHADYGGVVPELASRDHVRKTVPLI 59 M VL I+TS + ++ + V + R H P I Sbjct: 1 MLVLAIDTSTPAVTAGVVSLSASSPDPVSPDAESPDTVETLAVRVTVNPRAHAEVLTPHI 60 Query: 60 QAALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAP 119 L E+GLT D++AV GPG L VG G + A VP V ++ A Sbjct: 61 LECLAEAGLTPADLNAVVVGVGPGPYTGLRVGMATGAAFGDALGVPVYGVCSLDAIAAAV 120 Query: 120 MLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAAGEAFDKTAKLLGLDYPG 179 P P + ++ ++ G + G +++ A +D Sbjct: 121 ------PTTPSLLVVTDARRREIYWARYDGGVRVEGPAVNSAGD-----------VDPSP 163 Query: 180 GPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTI 222 L++ A+ F P + P S +GL T AA I Sbjct: 164 SMLIAGSASHVD--FFDLPVDPAETP----SPAGLVTVAAREI 200 >UniRef50_C4LDA2 Peptidase M22 glycoprotease n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LDA2_TOLAT Length = 235 Score = 120 bits (301), Expect = 7e-26, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 63/162 (38%), Gaps = 20/162 (12%) Query: 1 MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQ 60 M++L I+T+ + A+ ++ L Q+ A + H R +P++ Sbjct: 1 MKILAIDTATEGCSAALLWNDAVLTREQV-----------------APQAHTRLILPMVS 43 Query: 61 AALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPM 120 L E+G + +DA+A+ GPG + +G + LA+ VP I V ++ Sbjct: 44 ELLAEAGASLSGLDAIAFGRGPGSFTGVRIGIGAAQGLAYGAGVPLIGVSTLQMLAQGAY 103 Query: 121 LEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDDAA 162 + + ++ + + L+ +D+A Sbjct: 104 RRQQAEKA---VAAIDARMNEIYIGAFLLRDGLMQSVVDEAV 142 >UniRef50_Q5E439 Predicted peptidase n=5 Tax=Vibrionaceae RepID=Q5E439_VIBF1 Length = 233 Score = 120 bits (301), Expect = 8e-26, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 65/142 (45%), Gaps = 20/142 (14%) Query: 2 RVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQA 61 ++L ++T+ + +A+ D K +YS+ + A R+H K +P + Sbjct: 4 KILAVDTATENCSVALIVDGK------VYSRRAV-----------APREHTIKILPFVDE 46 Query: 62 ALKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPML 121 LKE+G+ +D+DA+A+ GPG + +G + + LAF D+P + + +E A Sbjct: 47 VLKEAGVRLQDLDALAFGQGPGSFTGVRIGIGIAQGLAFGADLPMVGISTLEAMAQAGYR 106 Query: 122 EDNPPEFPFVALLVSGGHTQLI 143 + VA + ++ Sbjct: 107 LHGATQ---VAASIDARMGEVY 125 >UniRef50_P43990 Probable M22 peptidase homolog HI0388 n=24 Tax=Pasteurellaceae RepID=Y388_HAEIN Length = 236 Score = 119 bits (300), Expect = 9e-26, Method: Composition-based stats. Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 20/141 (14%) Query: 3 VLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAA 62 +L ++TS + +A+ Y K H + ELA R H ++ +P+I Sbjct: 6 LLALDTSTEACSVALL-----------YRGEKTH------INELAQRTHTKRILPMIDEI 48 Query: 63 LKESGLTAKDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLE 122 L SGL +DA+A+ GPG + VGA + + LAF D+P IP+ ++ A Sbjct: 49 LANSGLGLNQVDALAFGRGPGSFTGVRVGAGIAQGLAFGADLPVIPISNLTAMAQAAFEL 108 Query: 123 DNPPEFPFVALLVSGGHTQLI 143 V + ++ Sbjct: 109 HQAEN---VVAAIDARMNEVY 126 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.152 0.444 Lambda K H 0.267 0.0465 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,175,616,598 Number of Sequences: 3077464 Number of extensions: 91231077 Number of successful extensions: 275756 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 1767 Number of HSP's successfully gapped in prelim test: 283 Number of HSP's that attempted gapping in prelim test: 270183 Number of HSP's gapped (non-prelim): 2401 length of query: 337 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 208 effective length of database: 643,403,500 effective search space: 133827928000 effective search space used: 133827928000 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 94 (40.6 bits)