BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (217 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0ABU5 Enhancing lycopene biosynthesis protein 2 n=267 ... 439 e-122 UniRef50_B6XFP3 Putative uncharacterized protein n=1 Tax=Provide... 288 1e-76 UniRef50_B4EXM9 Enhancing lycopene biosynthesis protein 2 n=2 Ta... 273 3e-72 UniRef50_Q2NWH9 Sigma cross-reacting protein 27A (SCRP-27A) n=1 ... 238 8e-62 UniRef50_Q3AU80 Es1 family protein n=30 Tax=Bacteria RepID=Q3AU8... 226 5e-58 UniRef50_Q6APY5 Putative uncharacterized protein n=1 Tax=Desulfo... 200 3e-50 UniRef50_Q6MQ93 Enhancing lycopene biosynthesis protein 2 n=3 Ta... 197 2e-49 UniRef50_A9V3H5 Predicted protein n=5 Tax=Fungi/Metazoa group Re... 195 9e-49 UniRef50_A3M9Q1 Enhancing lycopene biosynthesis protein 2 n=5 Ta... 194 1e-48 UniRef50_Q2RPB9 ThiJ/PfpI n=7 Tax=Bacteria RepID=Q2RPB9_RHORT 189 6e-47 UniRef50_P30042 ES1 protein homolog, mitochondrial n=45 Tax=Meta... 187 2e-46 UniRef50_A4IY89 DJ-1/PfpI family protein n=24 Tax=Gammaproteobac... 187 2e-46 UniRef50_B9EL82 ES1 protein homolog, mitochondrial n=11 Tax=Eume... 175 9e-43 UniRef50_C9J1C8 Putative uncharacterized protein C21orf33 (Fragm... 172 7e-42 UniRef50_C5LKS2 Putative uncharacterized protein n=4 Tax=Eukaryo... 161 1e-38 UniRef50_Q21UI1 ThiJ/PfpI n=1 Tax=Rhodoferax ferrireducens T118 ... 161 1e-38 UniRef50_C0R4T2 Enhancing lycopene biosynthesis protein 2, putat... 158 1e-37 UniRef50_Q2GLV6 Es1 family protein n=5 Tax=Anaplasma RepID=Q2GLV... 156 4e-37 UniRef50_B9Z0T4 ThiJ/PfpI domain protein n=1 Tax=Lutiella nitrof... 150 2e-35 UniRef50_D2W324 Glutamine amidotransferase domain-containing pro... 150 4e-35 UniRef50_Q90257 ES1 protein, mitochondrial n=3 Tax=Danio rerio R... 146 4e-34 UniRef50_P30042-2 Isoform Short of ES1 protein homolog, mitochon... 140 4e-32 UniRef50_Q8WQI1 Lycopene biosynthesis-enhancing protein n=2 Tax=... 139 6e-32 UniRef50_UPI0000DB6CF4 PREDICTED: similar to es1 protein n=2 Tax... 130 4e-29 UniRef50_C5CG98 ThiJ/PfpI domain protein n=1 Tax=Kosmotoga olear... 127 3e-28 UniRef50_UPI000186E026 conserved hypothetical protein n=1 Tax=Pe... 112 1e-23 UniRef50_Q48464 Enhancing lycopene biosynthesis protein 2 homolo... 111 2e-23 UniRef50_C1C1F4 Enhancing lycopene biosynthesis protein 2 n=2 Ta... 100 6e-20 UniRef50_UPI0000E47ADE PREDICTED: similar to KNP-Ia, partial n=2... 79 7e-14 UniRef50_D2V705 Predicted protein n=1 Tax=Naegleria gruberi RepI... 74 2e-12 UniRef50_A6NYE6 Putative uncharacterized protein n=1 Tax=Bactero... 73 7e-12 UniRef50_Q29CB8 GA12322 n=5 Tax=Endopterygota RepID=Q29CB8_DROPS 44 0.004 UniRef50_C8WTD3 ThiJ/PfpI domain protein n=5 Tax=Bacillales RepI... 42 0.020 UniRef50_B0SBM0 Transcription regulator, DJ-1/PfpI family intrac... 41 0.025 UniRef50_A4X5M7 ThiJ/PfpI domain protein n=4 Tax=Actinomycetales... 40 0.049 >UniRef50_P0ABU5 Enhancing lycopene biosynthesis protein 2 n=267 Tax=Bacteria RepID=ELBB_ECOLI Length = 217 Score = 439 bits (1129), Expect = e-122, Method: Compositional matrix adjust. Identities = 217/217 (100%), Positives = 217/217 (100%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET Sbjct: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL Sbjct: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV Sbjct: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE Sbjct: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 >UniRef50_B6XFP3 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XFP3_9ENTR Length = 232 Score = 288 bits (736), Expect = 1e-76, Method: Compositional matrix adjust. Identities = 134/216 (62%), Positives = 171/216 (79%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK I VILSGCGV+DGSEIHE+VLT+LA+S++ A+ FAPD+ Q VINH+ GE TET Sbjct: 17 MKSIAVILSGCGVFDGSEIHESVLTMLALSKNNAEVHFFAPDEDQATVINHINGELKTET 76 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RN + E+ARI+RG+I PL+ D ++LDALI+PGGFG AKNL NFA+ GSEC ++++L +L Sbjct: 77 RNQMEESARISRGKIAPLSSVDPSKLDALIIPGGFGVAKNLCNFATKGSECEINKQLLSL 136 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q MHQ KPLG MCIAP MLPK+ + ++LTIG D +T +E+MG HV C VD+IVV Sbjct: 137 VQVMHQQKKPLGLMCIAPVMLPKMLNTSVKLTIGNDTETIAQIEKMGGLHVECTVDNIVV 196 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 DE+NK+VTTPAYMLAQ+IAEA GI+KLV +VL +A Sbjct: 197 DENNKVVTTPAYMLAQSIAEANVGINKLVEKVLEMA 232 >UniRef50_B4EXM9 Enhancing lycopene biosynthesis protein 2 n=2 Tax=Proteus mirabilis RepID=B4EXM9_PROMH Length = 216 Score = 273 bits (698), Expect = 3e-72, Method: Compositional matrix adjust. Identities = 126/216 (58%), Positives = 168/216 (77%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK I VILSGCGV+DGSEIHE+VLT+LA+S++ A+ F+P+ Q VINH+TGE E Sbjct: 1 MKSIAVILSGCGVFDGSEIHESVLTMLALSQNKAEVHYFSPNDFQPTVINHITGEEKAEK 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RN++ EA+RI+RG+I PL+ A A DA+I+PGGFGAAKNL NFA+ G +C ++++L Sbjct: 61 RNMMEEASRISRGKISPLSDAKAENFDAVIIPGGFGAAKNLCNFATKGVQCEINQQLLTF 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q MHQ KPLG MCIAP MLPK+ + P++LTIG D TAE++ EMG H+ CPVD+IVV Sbjct: 121 VQKMHQQKKPLGLMCIAPVMLPKMLNAPVKLTIGNDAKTAEMITEMGGIHINCPVDEIVV 180 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 D++ ++VTTPAYMLA++IA+A GI+KLV +VL +A Sbjct: 181 DDEYRVVTTPAYMLAESIAQAQVGIEKLVKKVLEMA 216 >UniRef50_Q2NWH9 Sigma cross-reacting protein 27A (SCRP-27A) n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NWH9_SODGM Length = 171 Score = 238 bits (608), Expect = 8e-62, Method: Compositional matrix adjust. Identities = 115/168 (68%), Positives = 135/168 (80%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK+IGV+LSGCGV DGSEI EAVLTLLAI R+G AVCFA DK Q+ V+NHL+GE E Sbjct: 1 MKRIGVVLSGCGVNDGSEIQEAVLTLLAIDRTGLDAVCFATDKPQLQVVNHLSGEQTDER 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+EAARI RG+I+PLA A A +LDALIVPGGFG AKNLSN A G++C VD EL L Sbjct: 61 RNVLVEAARIARGQIQPLAAASAEDLDALIVPGGFGVAKNLSNLAQTGADCEVDAELAQL 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGA 168 QA+H KPLGF+CIAPA+LPKI PLRLT+GTD+D AE+++ MGA Sbjct: 121 VQALHLQRKPLGFICIAPALLPKILAVPLRLTLGTDVDAAEMVDTMGA 168 >UniRef50_Q3AU80 Es1 family protein n=30 Tax=Bacteria RepID=Q3AU80_CHLCH Length = 225 Score = 226 bits (575), Expect = 5e-58, Method: Compositional matrix adjust. Identities = 114/217 (52%), Positives = 153/217 (70%), Gaps = 2/217 (0%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTG-EAMTETR 61 +IGV+L+GCG DGSEIHEAVLTLLAIS+ GAQA+C APD Q V+NHLTG E + E+R Sbjct: 8 RIGVLLAGCGYLDGSEIHEAVLTLLAISKKGAQAICLAPDMVQHHVVNHLTGQEVIGESR 67 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL+EAARI RG I L+ + LDA IVPGG+GAAKNLS+FA G+ CT+ ++ Sbjct: 68 NVLVEAARIARGAIHNLSDIASLHLDAFIVPGGYGAAKNLSSFAFDGTPCTIHPDVATAI 127 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDF-PLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q ++AGKP+GF+CI+P + K+ + +TIG D TA +E MGA H+ C V V Sbjct: 128 QLFYKAGKPMGFICISPVLAAKVLGSEKIEVTIGNDASTAASIEAMGARHINCVVTKAHV 187 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 + + IV+TPAYML ++A+ A+GI++LV V+ L + Sbjct: 188 SKPHNIVSTPAYMLEASLADIATGIEQLVGNVVELVK 224 >UniRef50_Q6APY5 Putative uncharacterized protein n=1 Tax=Desulfotalea psychrophila RepID=Q6APY5_DESPS Length = 218 Score = 200 bits (508), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 103/215 (47%), Positives = 138/215 (64%), Gaps = 1/215 (0%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 KI VILSGCG DGSEIHEA ++L AI G CFAPD Q+ VINHL GE E+RN Sbjct: 5 KIAVILSGCGHLDGSEIHEATMSLWAIHSHGCDYHCFAPDIDQLHVINHLNGEETGESRN 64 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL+E+ARI RG+I L Q A + DALI+PGGFGAAKNLS++ S G C V+ E+K Sbjct: 65 VLVESARIARGKISDLNQFKAEDYDALIIPGGFGAAKNLSDYFSAGVNCQVNPEVKKAII 124 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 MHQA KP+G +CIAP +L ++ + +T+G D + + E+MGA IV+D Sbjct: 125 DMHQAKKPIGALCIAPMLLARLIS-GVEITLGQDPISHQNAEKMGASTQTTDHGQIVIDR 183 Query: 183 DNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 N +V+TP YML + + +G D L++ ++ + E Sbjct: 184 KNLVVSTPCYMLDARVDQIGAGADALMTEIVEMME 218 >UniRef50_Q6MQ93 Enhancing lycopene biosynthesis protein 2 n=3 Tax=Bacteria RepID=Q6MQ93_BDEBA Length = 217 Score = 197 bits (500), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 96/195 (49%), Positives = 135/195 (69%), Gaps = 2/195 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKI V+LSGCG DGSEI E+V L+ + ++GA+ CFAPD Q+ + NH+ GEA E Sbjct: 1 MKKIAVVLSGCGHRDGSEITESVSLLIGLHQAGAEVHCFAPD-IQIPITNHINGEAQGEK 59 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R++L EAARI RG I+ L + A + DA++ PGG+GAAKNLSN+A G++C V+ ++K + Sbjct: 60 RSLLTEAARIARGHIQSLDKLHAKDFDAVVFPGGYGAAKNLSNWAEKGAQCEVNPDVKRV 119 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIF-DFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 H A KP+G +CIAP ++ K+ D + +TIG D TA +E+ GA H CPV+D + Sbjct: 120 ILEFHSASKPIGALCIAPVLVAKVLGDKKVTVTIGDDAATAAEIEKTGAIHEECPVNDYI 179 Query: 180 VDEDNKIVTTPAYML 194 D ++K+VTTPAYM Sbjct: 180 TDRESKVVTTPAYMY 194 >UniRef50_A9V3H5 Predicted protein n=5 Tax=Fungi/Metazoa group RepID=A9V3H5_MONBE Length = 245 Score = 195 bits (496), Expect = 9e-49, Method: Compositional matrix adjust. Identities = 101/221 (45%), Positives = 139/221 (62%), Gaps = 10/221 (4%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ ++LSG GVYDG+E+HEA + A+SR GA FAPDKQQ V+NH+TGE M E+RN Sbjct: 23 RVALVLSGSGVYDGTEVHEASAAMGALSRQGADYKIFAPDKQQHHVVNHMTGEEMDESRN 82 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL+EAARI RG I+ L + A+ DA++VPGGFGAAKNLSNFA G+ CTVD L + + Sbjct: 83 VLVEAARIARGNIQALDKLQVADFDAVVVPGGFGAAKNLSNFAVEGAACTVDATLTDILK 142 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDT--------AEVLEEMGAEHVPCP 174 H KP+GF CIAP + +F +T+G+D ++ A +EMGA++V Sbjct: 143 KFHAEQKPMGFCCIAPVIAANLFKG--EVTMGSDTESDKWPFAGAAGACKEMGADYVVGD 200 Query: 175 VDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVL 215 + VD+ N+IVT PA+M + + +V VL L Sbjct: 201 ESKVHVDQANRIVTAPAFMANTAVHLVQDNVTNMVQTVLEL 241 >UniRef50_A3M9Q1 Enhancing lycopene biosynthesis protein 2 n=5 Tax=Acinetobacter RepID=A3M9Q1_ACIBT Length = 220 Score = 194 bits (494), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 105/220 (47%), Positives = 143/220 (65%), Gaps = 3/220 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTE 59 MKK+ VILSGCG DGSEI E+VLTLLA+ + FAPD+ VI+H++GE MTE Sbjct: 1 MKKVAVILSGCGYLDGSEIRESVLTLLALDTVNIEYQIFAPDEPLFHVIDHVSGEINMTE 60 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 RN+L EA RI RG+I L Q + E D LI+PGGFG AKNLS FA G+E V + + Sbjct: 61 RRNILQEAGRIARGKISSLDQLNENEFDGLILPGGFGVAKNLSTFAFKGAEARVHGTVAS 120 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIF-DFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 + +A HQ+ KP+G +CI+PA+L F + +T+G+D++ A+ +E+ G+ H C D Sbjct: 121 ILKAFHQSKKPIGAICISPALLALTFGELHPTITLGSDLNIAKEIEKTGSIHHVCQTSDC 180 Query: 179 VVDEDNKIVTTPAYMLAQ-NIAEAASGIDKLVSRVLVLAE 217 VVD+ N VTTPAYM Q N+ + +GI LV+ + LA Sbjct: 181 VVDKQNLFVTTPAYMDDQANLKDIYTGITSLVNTMTALAN 220 >UniRef50_Q2RPB9 ThiJ/PfpI n=7 Tax=Bacteria RepID=Q2RPB9_RHORT Length = 227 Score = 189 bits (480), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 104/213 (48%), Positives = 142/213 (66%), Gaps = 1/213 (0%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 + V+LSGCGV+DG+EIHE+VLTLLAI R G A CFAPD+ Q VI+H +G+ ETRN Sbjct: 6 RFAVLLSGCGVFDGAEIHESVLTLLAIDRQGGVARCFAPDRPQYHVIDHRSGQPTGETRN 65 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL E+ARI RG I LA D A DALI+PGGFGAAKNL +FA G++C VD ++ + Sbjct: 66 VLCESARIARGAIDDLADFDPAAFDALILPGGFGAAKNLCSFAIDGADCAVDPTVERALR 125 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 A AG +G +CIAP +L ++F + LTIG+D TAE + +GA H ++VVD Sbjct: 126 AARAAGLAIGALCIAPVVLARVFGEGV-LTIGSDAATAEAITALGAHHQKATHAEVVVDR 184 Query: 183 DNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVL 215 ++V++P YML +I++ A G + V ++ L Sbjct: 185 ALRLVSSPCYMLDASISQIAEGAENTVKALIAL 217 >UniRef50_P30042 ES1 protein homolog, mitochondrial n=45 Tax=Metazoa RepID=ES1_HUMAN Length = 268 Score = 187 bits (476), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 101/223 (45%), Positives = 142/223 (63%), Gaps = 11/223 (4%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTETR 61 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ + E+R Sbjct: 44 RVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGESR 103 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL E+ARI RG+I LA AA DA I PGGFGAAKNLS FA G +C V++E++ + Sbjct: 104 NVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLSTFAVDGKDCKVNKEVERVL 163 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID---------TAEVLEEMGAEHVP 172 + HQAGKP+G CIAP + K+ + +T+G + + TAE ++ +GA+H Sbjct: 164 KEFHQAGKPIGLCCIAPVLAAKVLR-GVEVTVGHEQEEGGKWPYAGTAEAIKALGAKHCV 222 Query: 173 CPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVL 215 V + VD+ NK+VTTPA+M + GI +V +VL L Sbjct: 223 KEVVEAHVDQKNKVVTTPAFMCETALHYIHDGIGAMVRKVLEL 265 >UniRef50_A4IY89 DJ-1/PfpI family protein n=24 Tax=Gammaproteobacteria RepID=A4IY89_FRATW Length = 219 Score = 187 bits (475), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 104/216 (48%), Positives = 147/216 (68%), Gaps = 3/216 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHL--TGEAMT 58 M K+ V+LSGCG DGSEIHE VLT+LA+ + G + A ++ Q VINHL + ++ Sbjct: 1 MAKVAVVLSGCGYLDGSEIHETVLTILALEKQGVEWQGVALNRDQKQVINHLHQSVDSKA 60 Query: 59 ETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSEC-TVDREL 117 RN+L E+ARITRG + +A AD+ + DA+I PGGFGAAKN+ +FA +G++ +D E+ Sbjct: 61 SPRNILEESARITRGNVIDIADADSDDYDAIIFPGGFGAAKNIMDFAFVGNDSYQMDEEV 120 Query: 118 KALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDD 177 A+A + A KP G++CIAP M+P ++ + T+GTD +T +L + GAE + D Sbjct: 121 LKFARAFYLADKPAGYICIAPLMIPLVYPEGTKATVGTDENTTAILAKKGAEAIIMDATD 180 Query: 178 IVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVL 213 I VDE KIV+TPAYM A+NI EAA GI+KLV +V+ Sbjct: 181 ICVDESVKIVSTPAYMCARNILEAAQGIEKLVEKVV 216 >UniRef50_B9EL82 ES1 protein homolog, mitochondrial n=11 Tax=Eumetazoa RepID=B9EL82_SALSA Length = 259 Score = 175 bits (444), Expect = 9e-43, Method: Compositional matrix adjust. Identities = 94/224 (41%), Positives = 140/224 (62%), Gaps = 11/224 (4%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTG-EAMTETR 61 K+ V+LSGCGVYDG+EIHEA L+ +SR GA+ +APD Q+ VI+H G A E+R Sbjct: 35 KVAVVLSGCGVYDGTEIHEASAILVHLSRGGAEVQMYAPDVSQMHVIDHGKGVPAENESR 94 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL E+ARI RG I L + + DA+I PGGFGAAKNLS FA G +C ++ +++ + Sbjct: 95 NVLSESARIARGNITDLVKLSVSNHDAIIFPGGFGAAKNLSTFAVDGPDCKINADVERVL 154 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID---------TAEVLEEMGAEHVP 172 + H+AGKP+G CI+P + K+ + +T+G + + TA ++ +GA+H+ Sbjct: 155 KDFHKAGKPIGLCCISPVLAAKLLP-GVEVTVGHEEEKGGKWPYAGTAGAIKALGAKHIV 213 Query: 173 CPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 V + VD+ NK+VT+PA+M + GI +V+ VL L+ Sbjct: 214 KEVTEAHVDQKNKVVTSPAFMCETQLHLIFDGIGAMVTNVLKLS 257 >UniRef50_C9J1C8 Putative uncharacterized protein C21orf33 (Fragment) n=1 Tax=Homo sapiens RepID=C9J1C8_HUMAN Length = 257 Score = 172 bits (436), Expect = 7e-42, Method: Compositional matrix adjust. Identities = 101/250 (40%), Positives = 142/250 (56%), Gaps = 38/250 (15%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTETR 61 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ + E+R Sbjct: 6 RVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGESR 65 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNL-------------------- 101 NVL E+ARI RG+I LA AA DA I PGGFGAAKNL Sbjct: 66 NVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLCVFELQGLPLSMWSRWEGGA 125 Query: 102 -------SNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIG 154 S FA G +C V++E++ + + HQAGKP+G CIAP + K+ + +T+G Sbjct: 126 PVCCPMWSTFAVDGKDCKVNKEVERVLKEFHQAGKPIGLCCIAPVLAAKVLR-GVEVTVG 184 Query: 155 TDID---------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGI 205 + + TAE ++ +GA+H V + VD+ NK+VTTPA+M + GI Sbjct: 185 HEQEEGGKWPYAGTAEAIKALGAKHCVKEVVEAHVDQKNKVVTTPAFMCETALHYIHDGI 244 Query: 206 DKLVSRVLVL 215 +V +VL L Sbjct: 245 GAMVRKVLEL 254 >UniRef50_C5LKS2 Putative uncharacterized protein n=4 Tax=Eukaryota RepID=C5LKS2_9ALVE Length = 348 Score = 161 bits (408), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 86/201 (42%), Positives = 123/201 (61%), Gaps = 11/201 (5%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMT-ET 60 K++ V+LSGCG DGSEI EAV L +SR+ + CFAPD QQ+ V++H G + Sbjct: 124 KRVAVVLSGCGHLDGSEIREAVFVLTELSRANTKYQCFAPDIQQMHVVDHSDGSIDSGSK 183 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+EA+RI R PL A + DAL+ PGGFGAAKNLSNFA GS +V E++ Sbjct: 184 RNVLVEASRIGREGTLPLTDLKAKDYDALMFPGGFGAAKNLSNFAVKGSGMSVHPEVERA 243 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTD--------IDTAEVLEEMGAEHVP 172 + +QAG P+G +CIAP + K+ + +T+G+D A+ ++E+G +H Sbjct: 244 IKEFNQAGHPIGLVCIAPVLAAKVLN--AEVTMGSDEVSEEYPNASAAQAVKEIGGKHFN 301 Query: 173 CPVDDIVVDEDNKIVTTPAYM 193 +++ VD K+VT+ AYM Sbjct: 302 TKLNEAHVDTAKKVVTSAAYM 322 >UniRef50_Q21UI1 ThiJ/PfpI n=1 Tax=Rhodoferax ferrireducens T118 RepID=Q21UI1_RHOFD Length = 226 Score = 161 bits (408), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 93/219 (42%), Positives = 136/219 (62%), Gaps = 7/219 (3%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMT-ET 60 +KI V+L+GCG DG+E+ EAVLTLLA+ + GA C AP+ Q VINH+TGE + Sbjct: 5 RKIAVLLAGCGHLDGAEVREAVLTLLALDQHGAAFQCIAPNAPQFHVINHITGEPVAGAQ 64 Query: 61 RNVLIEAARITR-GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 RN+L E++RI R G+ LA+A A+ DAL++PGG+G AKN +FA G++ V ++ A Sbjct: 65 RNILEESSRIARLGQCLDLAKAKVADYDALVMPGGYGVAKNNCSFAFKGADAEVRPDVAA 124 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIF---DFPLRLTIGTDIDTAEVLEEMGAEHVPCP-V 175 + A KP+G +CIAPA++ D LT+G D A + ++G H P Sbjct: 125 FVRGFFDAKKPVGAICIAPALVALALHQVDDSATLTLGNDAGVAAAMGQLGQRHQNTPNA 184 Query: 176 DDIVVDEDNKIVTTPAYMLAQ-NIAEAASGIDKLVSRVL 213 +IV+DE +K+VTTPAYM +++ GI++ V+ VL Sbjct: 185 REIVIDEAHKLVTTPAYMFDDARLSDVFVGIERCVAEVL 223 >UniRef50_C0R4T2 Enhancing lycopene biosynthesis protein 2, putative n=13 Tax=Rickettsiales RepID=C0R4T2_WOLWR Length = 241 Score = 158 bits (399), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 83/198 (41%), Positives = 115/198 (58%), Gaps = 9/198 (4%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K V+LSGCG DG E+ EAVL+LL + + CFAPD V+NH T EA E RN Sbjct: 13 KAAVVLSGCGHLDGVEVREAVLSLLVLDQQEVDVKCFAPDINITQVMNHRTKEATKEKRN 72 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL+EAARI RGEI L +A A D L+VPGG+G AKNLS+ A TV E + L Sbjct: 73 VLVEAARIARGEIYDLKEAKAENFDMLVVPGGYGVAKNLSDLAESKDMVTVMPEFERLVS 132 Query: 123 AMHQAGKPLGFMCIAPAMLPKIF-------DFPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 KP+G +CI+PA++ I + +++TIG D +++E +G EH+ C Sbjct: 133 EFFVTKKPIGAICISPAIIVSILSSKIGKEESKVKVTIGD--DREQLIERLGGEHIKCDT 190 Query: 176 DDIVVDEDNKIVTTPAYM 193 + + DE++ + + AYM Sbjct: 191 ELSIEDEEHNVFSCSAYM 208 >UniRef50_Q2GLV6 Es1 family protein n=5 Tax=Anaplasma RepID=Q2GLV6_ANAPZ Length = 221 Score = 156 bits (395), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 84/206 (40%), Positives = 120/206 (58%), Gaps = 4/206 (1%) Query: 6 VILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLI 65 V+L GCG DGSEI EAVL LLA+ G C AP+ +QVDV++HL+G + E R+++ Sbjct: 5 VLLCGCGHMDGSEIREAVLALLALDSYGINVTCCAPNIKQVDVVDHLSGSTLEEERDIMS 64 Query: 66 EAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMH 125 E+ARI RG + + D LI+PGGFG AKN S+ S +V E+K H Sbjct: 65 ESARIARGNVVDPKDISPNDFDMLILPGGFGVAKNYSDILKGESPVSVLEEVKQTIVKFH 124 Query: 126 QAGKPLGFMCIAPAMLPKIFD--FPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDED 183 + K +G +CIAPA++ +++T+G DID+ ++ G EHV C DD V D D Sbjct: 125 KEKKAIGAICIAPAIVAASLSSVSKVKVTLGEDIDS--IISRCGGEHVFCETDDYVADID 182 Query: 184 NKIVTTPAYMLAQNIAEAASGIDKLV 209 + +TPAYM ++ + GI K+V Sbjct: 183 MGVFSTPAYMRKDSLHKIHVGIHKMV 208 >UniRef50_B9Z0T4 ThiJ/PfpI domain protein n=1 Tax=Lutiella nitroferrum 2002 RepID=B9Z0T4_9NEIS Length = 226 Score = 150 bits (380), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 80/202 (39%), Positives = 120/202 (59%), Gaps = 7/202 (3%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 + + ++LSGCGVYDGSEI EAV ++A+S++G +APD+ Q+ V++H G+ E R Sbjct: 3 QTVAIVLSGCGVYDGSEITEAVGVVIALSQAGLPYAFYAPDRAQMHVVDHARGQESGEAR 62 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 N+L EAARI RG+IRPL + DAA+ A++ PGGFGAAKNL+ F G + + ++ A Sbjct: 63 NILSEAARIARGQIRPLTELDAAQHSAIVFPGGFGAAKNLTTFIKDGRDAVLYDDVAAAV 122 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFD----FPLRLTIGTDID---TAEVLEEMGAEHVPCP 174 + Q KP+ +C AP + I + +T G+ + A+ L G HV P Sbjct: 123 RPFVQQHKPVVALCAAPLVQGLIARDEGLAGVNITFGSYAEGQAMADALTSWGQTHVETP 182 Query: 175 VDDIVVDEDNKIVTTPAYMLAQ 196 VD VD ++ ++ PAYM + Sbjct: 183 VDQACVDLAHRFISAPAYMYGE 204 >UniRef50_D2W324 Glutamine amidotransferase domain-containing protein n=1 Tax=Naegleria gruberi RepID=D2W324_NAEGR Length = 305 Score = 150 bits (378), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 89/215 (41%), Positives = 120/215 (55%), Gaps = 4/215 (1%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTETRN 62 + VILSGCG DGSEI EAV ++ +S+ G + FAPD Q + +HLT E RN Sbjct: 49 VAVILSGCGYLDGSEITEAVSVMVHLSKKGYKLAFFAPDINQEETYDHLTKNVEKNEVRN 108 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + EA+RITR + L Q +ALI+PGGFG AKNLSN+A + + E++ Sbjct: 109 IRTEASRITRQNVLRLDQFRPQAFEALIIPGGFGVAKNLSNYAENPTNFKIHPEVEKAIV 168 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIG-TDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + H+A P+G CIAP + K +T+G +D E + GA VP +VVD Sbjct: 169 STHEAKIPIGMCCIAPVLAAKAIP-QCSITLGDSDPSVTEHAKSYGANCVPKSTSQVVVD 227 Query: 182 EDNKIVTTPAYMLAQNIA-EAASGIDKLVSRVLVL 215 +DNK+VTTPAYM A E GI +V V+ L Sbjct: 228 KDNKLVTTPAYMGKNPTAFEVFDGIGSMVDSVIEL 262 >UniRef50_Q90257 ES1 protein, mitochondrial n=3 Tax=Danio rerio RepID=ES1_DANRE Length = 270 Score = 146 bits (369), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 73/232 (31%), Positives = 133/232 (57%), Gaps = 20/232 (8%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTETR 61 I V+ SGCG +DG++IHEA T+ +SR+GA+ FAP++QQ+ V++H+ + + ++ R Sbjct: 37 NIAVVFSGCGWWDGTDIHEAAYTMYHLSRNGARFQIFAPNQQQMHVMDHMKMQPSSSDNR 96 Query: 62 NVLIEAARITRG----EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDREL 117 N+++E+AR + G ++ L++ DA DA+I PGG G KN+S F+ G +C ++ ++ Sbjct: 97 NIMMESARFSHGQGMMQMNDLSKLDANSFDAVIFPGGHGIVKNMSTFSKDGKDCKLNNDV 156 Query: 118 KALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTA------------EVLEE 165 + + + H+A KP+G +AP + ++ L +T+G + D + + ++ Sbjct: 157 ERVLKDFHRARKPIGLSSMAPLLACRVLP-SLEVTMGYERDESSRWGRWPNTNMVQAVKS 215 Query: 166 MGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAA--SGIDKLVSRVLVL 215 MGA H + VDE NK+++TP +M + GI +V V+ + Sbjct: 216 MGARHNTREPYEAYVDEKNKVISTPTFMWETDYHYHYIFDGIGNMVKHVMRM 267 >UniRef50_P30042-2 Isoform Short of ES1 protein homolog, mitochondrial n=6 Tax=Euarchontoglires RepID=P30042-2 Length = 237 Score = 140 bits (352), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 87/223 (39%), Positives = 120/223 (53%), Gaps = 42/223 (18%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTETR 61 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ + E+R Sbjct: 44 RVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGESR 103 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL E+ARI RG+I LA AA DA I PGGFGAAKNL Sbjct: 104 NVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNL-------------------- 143 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID---------TAEVLEEMGAEHVP 172 CIAP + K+ + +T+G + + TAE ++ +GA+H Sbjct: 144 -----------LCCIAPVLAAKVLR-GVEVTVGHEQEEGGKWPYAGTAEAIKALGAKHCV 191 Query: 173 CPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVL 215 V + VD+ NK+VTTPA+M + GI +V +VL L Sbjct: 192 KEVVEAHVDQKNKVVTTPAFMCETALHYIHDGIGAMVRKVLEL 234 >UniRef50_Q8WQI1 Lycopene biosynthesis-enhancing protein n=2 Tax=Tetrahymena thermophila RepID=Q8WQI1_TETTH Length = 380 Score = 139 bits (351), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 60/124 (48%), Positives = 91/124 (73%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 KK+ +ILSGCGVYDGSE+ E V ++ +++S CFAP++ Q+ V+NH+TGE TET Sbjct: 46 FKKVAIILSGCGVYDGSEVTEVVSLMVHLNKSHVSFQCFAPNQDQLHVVNHITGETTTET 105 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+E+ARI RGE++ + Q + A+++PGGFGAAKNLS++A G+ TV+ E++ + Sbjct: 106 RNVLVESARIARGEVKDITQLKGEDYQAVLLPGGFGAAKNLSDYAVNGTNFTVNSEVERV 165 Query: 121 AQAM 124 + + Sbjct: 166 LRVI 169 >UniRef50_UPI0000DB6CF4 PREDICTED: similar to es1 protein n=2 Tax=Apocrita RepID=UPI0000DB6CF4 Length = 226 Score = 130 bits (326), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 72/217 (33%), Positives = 123/217 (56%), Gaps = 12/217 (5%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTG--EAMTETR 61 + VIL GCG DG+EI EA+ ++ I + +APD + ++H + + +R Sbjct: 3 VAVILCGCGYLDGTEISEAMSAMIHICLKDMKPHFYAPDVNICETVDHFIKKPDPDSPSR 62 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 N L+EAARI R +I+PL Q A + +AL++PGGFGAAK LSNFA G++CT+ +L+ + Sbjct: 63 NALVEAARIARSDIKPLCQCQACKHEALVIPGGFGAAKTLSNFAEKGADCTIHPDLEQII 122 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTD--------IDTAEVLEEMGAEHVPC 173 + + GKP+ +CI+ ++ ++ +++T+G + D + ++MGA+ Sbjct: 123 EDFYYEGKPIASICISSVLVARVLK-GVKITLGKESPAEEWPFADAIKKAKDMGAKIEQK 181 Query: 174 PVDDIVVDEDNKIVTTPAYMLAQ-NIAEAASGIDKLV 209 V + + + +TPA+M AE +GI KL+ Sbjct: 182 SVKGMTKCKKYNVFSTPAWMYKPATFAEIYTGIGKLI 218 >UniRef50_C5CG98 ThiJ/PfpI domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CG98_KOSOT Length = 206 Score = 127 bits (319), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 68/190 (35%), Positives = 108/190 (56%), Gaps = 2/190 (1%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K G++LSGCG+ DG++I E +LT L++ + G + FAP++ Q DVI+H T + E RN Sbjct: 2 KAGILLSGCGLGDGTQIEEVMLTYLSLDKYGIDYITFAPNEMQHDVIDHYTEKPQNEKRN 61 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 +LIE+ARI RG+I + + ++DA+I+PGG G KNLS F TV++ + L + Sbjct: 62 ILIESARIGRGKICDIREVSCKDIDAIIIPGGLGVFKNLSTFIVDKKSFTVNKNVDDLLK 121 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLR-LTIGTDIDT-AEVLEEMGAEHVPCPVDDIVV 180 AM+ + K + +C A ++ K + L + T D E+L E+ V C + V+ Sbjct: 122 AMYLSKKSIAGICGAVILIAKSLSQHVSDLKVATANDAYGELLSELNVNAVNCSAKECVI 181 Query: 181 DEDNKIVTTP 190 D + P Sbjct: 182 DRKKQSSNYP 191 >UniRef50_UPI000186E026 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186E026 Length = 256 Score = 112 bits (279), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 72/211 (34%), Positives = 105/211 (49%), Gaps = 9/211 (4%) Query: 7 ILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIE 66 +LSG G+ DG+EIHEA + +SR Q F+ Q DVI+H E RN L+E Sbjct: 39 VLSGSGMMDGTEIHEASACAVHLSRLDIQPKFFSVPCPQTDVIDHYKLSPTNEMRNALVE 98 Query: 67 AARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQ 126 +ARI RG+I + + E D LI PGGFG AK L+ F G+ C V+ E+ + Sbjct: 99 SARIARGKICSINSLTSDEADVLIFPGGFGVAKTLTTFDKDGANCGVNEEVVRVVNEFCA 158 Query: 127 AGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTA--------EVLEEMGAEHVPCPVDDI 178 KP+ F CIA + +IF +++T+G D + + +MGA VD Sbjct: 159 CRKPMAFTCIAAILPARIFP-GVKVTLGKKGDPKKWPHSEAIDTVSDMGAVVEVKNVDSF 217 Query: 179 VVDEDNKIVTTPAYMLAQNIAEAASGIDKLV 209 D+ + TTPA+M + GI ++ Sbjct: 218 TFDKQFLVFTTPAFMYEGTFYQIFEGIGNMI 248 >UniRef50_Q48464 Enhancing lycopene biosynthesis protein 2 homolog (Fragment) n=3 Tax=Enterobacteriaceae RepID=ELBB_KLEOX Length = 80 Score = 111 bits (278), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 56/67 (83%), Positives = 62/67 (92%), Gaps = 3/67 (4%) Query: 149 LRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKL 208 LRL D+DTA+ +EEMGAEHVPCPVDDIVVDEDNK+VTTPAYMLAQNIAEAASGI+KL Sbjct: 15 LRLC---DLDTADAVEEMGAEHVPCPVDDIVVDEDNKVVTTPAYMLAQNIAEAASGIEKL 71 Query: 209 VSRVLVL 215 V+RVLVL Sbjct: 72 VARVLVL 78 >UniRef50_C1C1F4 Enhancing lycopene biosynthesis protein 2 n=2 Tax=Caligus RepID=C1C1F4_9MAXI Length = 231 Score = 99.8 bits (247), Expect = 6e-20, Method: Compositional matrix adjust. Identities = 75/228 (32%), Positives = 113/228 (49%), Gaps = 15/228 (6%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 + V+LSGCG DGS+ E L LA+SR + + +AP +NH+ G RN Sbjct: 5 NVAVLLSGCGHLDGSDPLEVSLLCLALSRLDIKPIFYAPYMSMSTGVNHVNGAEAETGRN 64 Query: 63 VLIEAARITRGEIRPLAQADAAE--LDALIVPGGFGAAKNLSNF-ASLGSECTVDRELKA 119 VLIE+AR+ + + L + D ++ L ALI+PGG G N S+F SL + +V +E+ Sbjct: 65 VLIESARLVKESVLKLDELDPSDETLSALIIPGGHGPLNNFSDFKTSLETPPSVIKEILG 124 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFP-LRLTIGTDID--------TAEVLEEMGAEH 170 + + AGKP+G C + A + P + +T+G+ + A L E G Sbjct: 125 IIEGFKAAGKPIG--CTSHANILVALAIPNIEITLGSRDEEECPVASLVAPGLIEQGTTV 182 Query: 171 VPCPVDDIVVDEDNKIVTTPAYMLAQ-NIAEAASGIDKLVSRVLVLAE 217 P V ++ VD +NKIVT A + A E A I ++ L E Sbjct: 183 TPTSVYEVQVDFENKIVTAAASLFASAKYHEVADQITIFFDMLMTLIE 230 >UniRef50_UPI0000E47ADE PREDICTED: similar to KNP-Ia, partial n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47ADE Length = 71 Score = 79.3 bits (194), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 36/71 (50%), Positives = 52/71 (73%) Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL+E+ARI RG+I L+ + DA++ PGGFGAAKNLS+FA G+ CTV+ +++ + Sbjct: 1 NVLVESARIARGKITALSGLSSGNFDAVVFPGGFGAAKNLSDFAVNGAGCTVNPDVERVI 60 Query: 122 QAMHQAGKPLG 132 + HQA KP+G Sbjct: 61 KEFHQAKKPIG 71 >UniRef50_D2V705 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V705_NAEGR Length = 250 Score = 74.3 bits (181), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 46/134 (34%), Positives = 75/134 (55%), Gaps = 4/134 (2%) Query: 6 VILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTETRNVL 64 VILSGCG DGS++ E+V ++ ++R G F+P ++ + N++T + +E R + Sbjct: 18 VILSGCGFMDGSDVVESVSVIVELTRKGIVPRFFSPHEEIDESYNYITKQIDSSEERYMH 77 Query: 65 IEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLG---SECTVDRELKALA 121 E+ARI R +I + Q A + D L++PGG G +NLSNF +E V+ ++ Sbjct: 78 KESARIAREKILSIDQLRADQFDMLVIPGGNGVVRNLSNFEQEEYNVNEVEVNSHVEKAI 137 Query: 122 QAMHQAGKPLGFMC 135 + KP+GFM Sbjct: 138 VDFFKQKKPIGFMS 151 >UniRef50_A6NYE6 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NYE6_9BACE Length = 189 Score = 72.8 bits (177), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 65/214 (30%), Positives = 89/214 (41%), Gaps = 34/214 (15%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K V+L+GCG+ DGS I E VLT A+ + G A D V ++H+T E E R+ Sbjct: 2 KFLVLLAGCGLGDGSCIEEVVLTYTALDKYGCDYTPAAAD-MLVPSMDHIT-EQPGEKRS 59 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL E+AR RG IR L + DAL++PGG G N RE +A Sbjct: 60 VLTESARTGRGRIRNLHDISPDDYDALLIPGGIGLVVNY-------------RESGLVAD 106 Query: 123 AMH---QAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 ++ Q KP+G MC L I L D+D Sbjct: 107 WVNRFVQQKKPIGTMCAGIDFLRGILGAGLLREEVRDLDAVSFCR--------------- 151 Query: 180 VDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVL 213 D I TPA+ + + G+D +V +L Sbjct: 152 -DTSGAIFYTPAFRKTGSCHDVMLGVDAMVHAML 184 >UniRef50_Q29CB8 GA12322 n=5 Tax=Endopterygota RepID=Q29CB8_DROPS Length = 187 Score = 43.9 bits (102), Expect = 0.004, Method: Compositional matrix adjust. Identities = 39/145 (26%), Positives = 71/145 (48%), Gaps = 22/145 (15%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 LA+A + D +++PGG G + + + A++G L +A AG + +C A Sbjct: 57 LAKAACDKFDVVVLPGGLGGSNAMGDSAAVGD----------LLRAQESAGGLIAAICAA 106 Query: 138 PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT----PAYM 193 P +L K + G + + ++E + C VDD V +D ++T+ AY Sbjct: 107 PTVLAK-----HGIAAGKSLTSYPSMKEQLVDKY-CYVDDKSVVKDGNLITSRGPGTAYD 160 Query: 194 LAQNIAEAASGIDKL--VSRVLVLA 216 A IAE +G++K+ V++ L+L+ Sbjct: 161 FALKIAEELAGLEKVKEVAKGLLLS 185 >UniRef50_C8WTD3 ThiJ/PfpI domain protein n=5 Tax=Bacillales RepID=C8WTD3_ALIAD Length = 223 Score = 41.6 bits (96), Expect = 0.020, Method: Compositional matrix adjust. Identities = 42/147 (28%), Positives = 65/147 (44%), Gaps = 31/147 (21%) Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 N EA I + I PL+Q A++ DA+ +PGG G + + A EL+AL Sbjct: 66 NQWPEAVEILKQTI-PLSQVSASDYDAIFLPGGHGTMFDFPDSA----------ELQALI 114 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIF---DFPL----RLTIGTDIDTAEV------------ 162 + ++GK + +C PA L + PL R+T TD + V Sbjct: 115 RTFAESGKVVAAVCHGPAGLVNVRLSNGDPLVKGKRVTAFTDEEERAVKLDDKVPFMLET 174 Query: 163 -LEEMGAEHVPCPVDDIVVDEDNKIVT 188 L E+GA+ V P+ V+ D ++T Sbjct: 175 RLRELGAQFVAQPMWSDHVERDGNLIT 201 >UniRef50_B0SBM0 Transcription regulator, DJ-1/PfpI family intracellular protease n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SBM0_LEPBA Length = 188 Score = 41.2 bits (95), Expect = 0.025, Method: Compositional matrix adjust. Identities = 28/91 (30%), Positives = 42/91 (46%), Gaps = 17/91 (18%) Query: 53 TGEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECT 112 T E + +RN I + T EI + E DA+++PGG KNL Sbjct: 38 TKEPVVASRNT-IHISDTTFSEI------NVDEFDAIVLPGGMNGTKNL----------M 80 Query: 113 VDRELKALAQAMHQAGKPLGFMCIAPAMLPK 143 D E++ + H + K +G +C APA+L K Sbjct: 81 ADTEIQKILSIFHSSKKHIGAICAAPAVLRK 111 >UniRef50_A4X5M7 ThiJ/PfpI domain protein n=4 Tax=Actinomycetales RepID=A4X5M7_SALTO Length = 231 Score = 40.0 bits (92), Expect = 0.049, Method: Compositional matrix adjust. Identities = 44/157 (28%), Positives = 67/157 (42%), Gaps = 28/157 (17%) Query: 1 MKKIGVILSGCGVYD---------GSEIHEAVLTLLAISRSGAQAVCFAPDK--QQVD-- 47 M KI +++G ++ G E V+ ++ +G + V PD +VD Sbjct: 1 MSKILFVVTGADHWELADGTRHPTGVWAEEIVVPHEMLTSAGHEVVIATPDGVVPRVDRG 60 Query: 48 -VINHLTGEAMTETRNVLIEAARITRGEIRP--LAQADAAELDALIVPGGFGAAKNLSNF 104 ++ TG R + EA G +P LA+ D E A+ PGG G ++L+ Sbjct: 61 SLLPEFTGGPAGAVR--MTEAVEGLEGLRKPIRLAEVDLDEYQAVFYPGGHGPMEDLA-- 116 Query: 105 ASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAML 141 VD++ L A QA KPLG +C PA L Sbjct: 117 --------VDQDSGRLLVAAQQAKKPLGIVCHGPAAL 145 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0ABU5 Enhancing lycopene biosynthesis protein 2 n=267 ... 319 5e-86 UniRef50_B4EXM9 Enhancing lycopene biosynthesis protein 2 n=2 Ta... 313 2e-84 UniRef50_B6XFP3 Putative uncharacterized protein n=1 Tax=Provide... 311 7e-84 UniRef50_P30042 ES1 protein homolog, mitochondrial n=45 Tax=Meta... 310 2e-83 UniRef50_C9J1C8 Putative uncharacterized protein C21orf33 (Fragm... 302 6e-81 UniRef50_B9EL82 ES1 protein homolog, mitochondrial n=11 Tax=Eume... 301 1e-80 UniRef50_Q6APY5 Putative uncharacterized protein n=1 Tax=Desulfo... 299 4e-80 UniRef50_A3M9Q1 Enhancing lycopene biosynthesis protein 2 n=5 Ta... 292 6e-78 UniRef50_Q3AU80 Es1 family protein n=30 Tax=Bacteria RepID=Q3AU8... 288 7e-77 UniRef50_Q6MQ93 Enhancing lycopene biosynthesis protein 2 n=3 Ta... 280 2e-74 UniRef50_Q90257 ES1 protein, mitochondrial n=3 Tax=Danio rerio R... 279 5e-74 UniRef50_C5LKS2 Putative uncharacterized protein n=4 Tax=Eukaryo... 278 1e-73 UniRef50_A9V3H5 Predicted protein n=5 Tax=Fungi/Metazoa group Re... 272 4e-72 UniRef50_D2W324 Glutamine amidotransferase domain-containing pro... 272 7e-72 UniRef50_Q2RPB9 ThiJ/PfpI n=7 Tax=Bacteria RepID=Q2RPB9_RHORT 266 5e-70 UniRef50_Q2GLV6 Es1 family protein n=5 Tax=Anaplasma RepID=Q2GLV... 263 3e-69 UniRef50_Q21UI1 ThiJ/PfpI n=1 Tax=Rhodoferax ferrireducens T118 ... 262 6e-69 UniRef50_C0R4T2 Enhancing lycopene biosynthesis protein 2, putat... 255 7e-67 UniRef50_UPI0000DB6CF4 PREDICTED: similar to es1 protein n=2 Tax... 249 7e-65 UniRef50_P30042-2 Isoform Short of ES1 protein homolog, mitochon... 247 2e-64 UniRef50_A4IY89 DJ-1/PfpI family protein n=24 Tax=Gammaproteobac... 243 3e-63 UniRef50_UPI000186E026 conserved hypothetical protein n=1 Tax=Pe... 242 4e-63 UniRef50_Q2NWH9 Sigma cross-reacting protein 27A (SCRP-27A) n=1 ... 242 5e-63 UniRef50_B9Z0T4 ThiJ/PfpI domain protein n=1 Tax=Lutiella nitrof... 241 1e-62 UniRef50_C5CG98 ThiJ/PfpI domain protein n=1 Tax=Kosmotoga olear... 224 2e-57 UniRef50_C1C1F4 Enhancing lycopene biosynthesis protein 2 n=2 Ta... 205 1e-51 UniRef50_A6NYE6 Putative uncharacterized protein n=1 Tax=Bactero... 196 5e-49 UniRef50_Q8WQI1 Lycopene biosynthesis-enhancing protein n=2 Tax=... 195 7e-49 UniRef50_D2V705 Predicted protein n=1 Tax=Naegleria gruberi RepI... 165 6e-40 UniRef50_UPI0000E47ADE PREDICTED: similar to KNP-Ia, partial n=2... 119 6e-26 UniRef50_Q48464 Enhancing lycopene biosynthesis protein 2 homolo... 85 1e-15 Sequences not found previously or not previously below threshold: UniRef50_C5DA95 Intracellular protease, PfpI family n=11 Tax=cel... 68 3e-10 UniRef50_A1K1D1 ThiJ/PfpI family protein n=17 Tax=cellular organ... 62 1e-08 UniRef50_B1KEU0 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria ... 61 2e-08 UniRef50_C9RGD3 Intracellular protease, PfpI family n=1 Tax=Meth... 61 4e-08 UniRef50_Q7NER9 Glr3809 protein n=1 Tax=Gloeobacter violaceus Re... 60 6e-08 UniRef50_C8WTD3 ThiJ/PfpI domain protein n=5 Tax=Bacillales RepI... 59 1e-07 UniRef50_B8DQK9 ThiJ/PfpI domain protein n=3 Tax=Bacteria RepID=... 58 3e-07 UniRef50_B9TI86 Protease C56, putative n=1 Tax=Ricinus communis ... 58 3e-07 UniRef50_B0SBM0 Transcription regulator, DJ-1/PfpI family intrac... 58 3e-07 UniRef50_Q0ULG5 Putative uncharacterized protein n=2 Tax=Pleospo... 57 4e-07 UniRef50_Q5ZU31 Intracellular protease, ThiJ/PfpI family n=4 Tax... 57 5e-07 UniRef50_A1VNH7 Intracellular protease, PfpI family n=12 Tax=Pro... 56 6e-07 UniRef50_Q5HPG8 ThiJ/PfpI family protein n=9 Tax=Staphylococcus ... 56 6e-07 UniRef50_UPI0001B4C882 ThiJ/PfpI domain-containing protein n=1 T... 56 8e-07 UniRef50_A1TMJ4 Intracellular protease, PfpI family n=11 Tax=Bac... 56 1e-06 UniRef50_D1CDU1 Intracellular protease, PfpI family n=40 Tax=Bac... 55 1e-06 UniRef50_C6CWE5 Intracellular protease, PfpI family n=6 Tax=Bact... 55 1e-06 UniRef50_Q12ZS1 Intracellular protease 1 n=3 Tax=Methanosarcinac... 55 2e-06 UniRef50_A4X5M7 ThiJ/PfpI domain protein n=4 Tax=Actinomycetales... 54 3e-06 UniRef50_Q1QTA8 Peptidase C56, PfpI n=27 Tax=Bacteria RepID=Q1QT... 53 5e-06 UniRef50_B2UQP3 DJ-1 family protein n=1 Tax=Akkermansia muciniph... 53 6e-06 UniRef50_A7GXC1 DJ-1 family protein n=6 Tax=Campylobacter RepID=... 53 7e-06 UniRef50_Q313C6 Peptidase C56, PfpI n=12 Tax=cellular organisms ... 52 2e-05 UniRef50_Q464Y3 Putative intracellular protease n=2 Tax=cellular... 51 2e-05 UniRef50_Q1DD54 Peptidase, C56 (PfpI) family n=2 Tax=Cystobacter... 51 2e-05 UniRef50_A5WBS8 ThiJ/PfpI domain protein n=2 Tax=Psychrobacter R... 51 2e-05 UniRef50_B9XL12 Intracellular protease, PfpI family n=1 Tax=bact... 51 2e-05 UniRef50_B2JT66 ThiJ/PfpI domain protein n=4 Tax=Burkholderiales... 51 2e-05 UniRef50_A3IN92 ThiJ/PfpI n=2 Tax=Cyanothece RepID=A3IN92_9CHRO 51 3e-05 UniRef50_D1CA42 Intracellular protease, PfpI family n=4 Tax=Bact... 51 3e-05 UniRef50_Q0JPK7 Os01g0217800 protein n=16 Tax=Magnoliophyta RepI... 51 3e-05 UniRef50_Q27SQ0 Protease/amidase (Fragment) n=1 Tax=Pavlova luth... 50 4e-05 UniRef50_Q0W5Q2 Intracellular protease (C56 family) n=3 Tax=cell... 50 6e-05 UniRef50_D1YEB9 Intracellular protease, PfpI family n=4 Tax=Acti... 50 6e-05 UniRef50_A9HQ45 Putative transcriptional regulator n=1 Tax=Gluco... 50 7e-05 UniRef50_Q26CT8 Intracellular protease, PfpI family n=3 Tax=Bact... 50 8e-05 UniRef50_B0N5W0 Putative uncharacterized protein n=2 Tax=Bacteri... 49 9e-05 UniRef50_Q04P14 ThiJ/PfpI family protein n=5 Tax=Bacteria RepID=... 49 9e-05 UniRef50_C5CQ44 Intracellular protease, PfpI family n=11 Tax=Bac... 49 1e-04 UniRef50_B2IIZ8 ThiJ/PfpI domain protein n=1 Tax=Beijerinckia in... 49 1e-04 UniRef50_A4FDV5 Protease I n=8 Tax=Bacteria RepID=A4FDV5_SACEN 49 1e-04 UniRef50_B8DF54 Intracellular protease 1 (Intracellular protease... 49 1e-04 UniRef50_A8RCF9 Putative uncharacterized protein n=2 Tax=Firmicu... 49 1e-04 UniRef50_Q0B5J2 ThiJ/PfpI domain protein n=18 Tax=Proteobacteria... 48 2e-04 UniRef50_D2S4J1 Intracellular protease, PfpI family n=2 Tax=Fran... 48 2e-04 UniRef50_Q2NCQ5 Protease n=5 Tax=Proteobacteria RepID=Q2NCQ5_ERYLH 48 2e-04 UniRef50_B3DRT7 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 48 2e-04 UniRef50_C3MPE6 Intracellular protease, PfpI family n=11 Tax=Sul... 48 2e-04 UniRef50_Q7MQ54 Putative uncharacterized protein VV0154 n=1 Tax=... 48 2e-04 UniRef50_A8P0K7 Putative uncharacterized protein n=1 Tax=Coprino... 48 2e-04 UniRef50_B5HD84 Protease I n=1 Tax=Streptomyces pristinaespirali... 48 2e-04 UniRef50_Q58377 Uncharacterized protein MJ0967 n=8 Tax=Euryarcha... 48 2e-04 UniRef50_Q4KCJ7 ThiJ/PfpI family protein, putative n=2 Tax=Pseud... 48 2e-04 UniRef50_C2BZS9 C56 family peptidase n=1 Tax=Listeria grayi DSM ... 48 2e-04 UniRef50_Q0SRB0 DJ-1 family protein n=35 Tax=Bacteria RepID=Q0SR... 48 3e-04 UniRef50_C5VG63 DJ-1 family protein n=6 Tax=Prevotella RepID=C5V... 48 3e-04 UniRef50_D2W0D5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 48 3e-04 UniRef50_UPI00017898AF intracellular protease, PfpI family n=1 T... 48 3e-04 UniRef50_A7HTR2 ThiJ/PfpI domain protein n=1 Tax=Parvibaculum la... 48 3e-04 UniRef50_C6RJA7 Intracellular protease, PfpI family n=3 Tax=Acin... 47 4e-04 UniRef50_B5Y9N6 Intracellular protease 1 (Intracellular protease... 47 4e-04 UniRef50_UPI0001979AA3 4-methyl-5(beta-hydroxyethyl)-thiazole mo... 47 5e-04 UniRef50_B8KZH8 Intracellular proteinase PfpI n=1 Tax=Stenotroph... 47 5e-04 UniRef50_A6Q7R5 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosp... 47 5e-04 UniRef50_B4U9D2 DJ-1 family protein n=1 Tax=Hydrogenobaculum sp.... 46 6e-04 UniRef50_A4XRE4 ThiJ/PfpI domain protein n=3 Tax=Gammaproteobact... 46 6e-04 UniRef50_C7N926 DJ-1 family protein n=3 Tax=Leptotrichia RepID=C... 46 6e-04 UniRef50_B8FFC8 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria ... 46 6e-04 UniRef50_Q7M905 MONOPHOSPHATE SYNTHESISPROTEIN n=1 Tax=Wolinella... 46 7e-04 UniRef50_B1YM90 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium... 46 7e-04 UniRef50_Q5FQ93 Protease I n=1 Tax=Gluconobacter oxydans RepID=Q... 46 7e-04 UniRef50_Q21F36 ThiJ/PfpI n=1 Tax=Saccharophagus degradans 2-40 ... 46 0.001 UniRef50_C0WIV6 C56 family peptidase n=8 Tax=Actinomycetales Rep... 46 0.001 UniRef50_C4G3X0 Putative uncharacterized protein n=1 Tax=Abiotro... 46 0.001 UniRef50_P80876 General stress protein 18 n=22 Tax=Bacteria RepI... 46 0.001 UniRef50_C7PRI4 ThiJ/PfpI domain protein n=1 Tax=Chitinophaga pi... 46 0.001 UniRef50_C1SLG0 DJ-1 family protein n=1 Tax=Denitrovibrio acetip... 46 0.001 UniRef50_D0SXB4 Intracellular protease n=2 Tax=Acinetobacter Rep... 46 0.001 UniRef50_D2RSY5 Intracellular protease, PfpI family n=1 Tax=Halo... 46 0.001 UniRef50_C3M432 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 46 0.001 UniRef50_B1K8Q4 ThiJ/PfpI domain protein n=220 Tax=cellular orga... 46 0.001 UniRef50_UPI0001AEB94E ThiJ/PfpI domain-containing protein n=1 T... 46 0.001 UniRef50_Q045Z0 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 46 0.001 UniRef50_A4WJ08 ThiJ/PfpI domain protein n=1 Tax=Pyrobaculum ars... 45 0.001 UniRef50_A1AQV7 Metal dependent phosphohydrolase n=3 Tax=Bacteri... 45 0.001 UniRef50_Q9ZV19 ProteaseI (PfpI)-like protein n=4 Tax=Arabidopsi... 45 0.002 UniRef50_A8QA68 Putative uncharacterized protein n=1 Tax=Malasse... 45 0.002 UniRef50_D0STJ8 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 45 0.002 UniRef50_A1ZY79 ThiJ/PfpI n=3 Tax=Bacteria RepID=A1ZY79_9SPHI 45 0.002 UniRef50_Q6F1K3 Putative intracellular protease/amidase n=1 Tax=... 45 0.002 UniRef50_C6PQH3 ThiJ/PfpI domain protein n=11 Tax=Bacteria RepID... 45 0.002 UniRef50_C5NVZ3 DJ-1 family protein n=1 Tax=Gemella haemolysans ... 45 0.002 UniRef50_C4L0C2 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium... 45 0.002 UniRef50_C5A5Q9 Peptidase C56, intracellular protease PfpI famil... 45 0.002 UniRef50_B3T0X6 Putative DJ-1/PfpI family protein n=1 Tax=uncult... 45 0.003 UniRef50_B0UB01 ThiJ/PfpI domain protein n=2 Tax=Alphaproteobact... 45 0.003 UniRef50_D1YUX4 Putative uncharacterized protein n=1 Tax=Methano... 45 0.003 UniRef50_Q13FG4 Peptidase C56, PfpI n=1 Tax=Burkholderia xenovor... 45 0.003 UniRef50_Q24FT7 DJ-1/PfpI family protein n=1 Tax=Tetrahymena the... 45 0.003 UniRef50_A9F9P4 Putative uncharacterized protein n=1 Tax=Sorangi... 44 0.003 UniRef50_B1YMA0 Intracellular protease, PfpI family n=14 Tax=Bac... 44 0.003 UniRef50_C0W6P5 Possible transcriptional regulator n=1 Tax=Actin... 44 0.003 UniRef50_B1GZW9 Putative intracellular protease n=1 Tax=uncultur... 44 0.003 UniRef50_Q0SMN5 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 44 0.004 UniRef50_A1VCT8 Intracellular protease, PfpI family n=14 Tax=Bac... 44 0.004 UniRef50_C8V2F0 ThiJ/PfpI family protein (AFU_orthologue; AFUA_3... 44 0.004 UniRef50_Q15SH5 ThiJ/PfpI n=14 Tax=Gammaproteobacteria RepID=Q15... 44 0.004 UniRef50_A4XSU5 Intracellular protease, PfpI family n=11 Tax=cel... 44 0.004 UniRef50_Q2NIP2 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 44 0.004 UniRef50_D1BGR7 Intracellular protease, PfpI family n=14 Tax=Act... 44 0.005 UniRef50_C0SMV9 Putative uncharacterized protein n=1 Tax=Strepto... 43 0.005 UniRef50_B9L1B0 Protease I n=2 Tax=Thermomicrobia (class) RepID=... 43 0.005 UniRef50_C3WFG1 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 43 0.005 UniRef50_Q54MG7 Protein DJ-1 n=21 Tax=cellular organisms RepID=P... 43 0.005 UniRef50_A0Q9X2 Intracellular protease, PfpI family protein n=4 ... 43 0.005 UniRef50_Q5JGM7 Intracellular protease 1 n=22 Tax=cellular organ... 43 0.005 UniRef50_C2BGT7 Possible transcriptional regulator n=2 Tax=Anaer... 43 0.006 UniRef50_Q1PX19 Similar to intracellular proteinase I n=1 Tax=Ca... 43 0.006 UniRef50_Q8LGH3 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 43 0.006 UniRef50_B5JP08 DJ-1 family protein n=1 Tax=Verrucomicrobiae bac... 43 0.007 UniRef50_A7HFM3 ThiJ/PfpI domain protein n=1 Tax=Anaeromyxobacte... 43 0.007 UniRef50_A0K1V5 ThiJ/PfpI domain protein n=6 Tax=Actinomycetales... 43 0.007 UniRef50_Q2FT73 ThiJ/PfpI n=1 Tax=Methanospirillum hungatei JF-1... 43 0.008 UniRef50_Q29CB8 GA12322 n=5 Tax=Endopterygota RepID=Q29CB8_DROPS 43 0.008 UniRef50_A3CWP5 Intracellular protease, PfpI family n=1 Tax=Meth... 43 0.009 UniRef50_A8I9F1 Type 1 glutamine amidotransferase n=3 Tax=Alphap... 43 0.009 UniRef50_B9KCD0 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosp... 43 0.010 UniRef50_Q9V1F8 Intracellular protease 1 n=6 Tax=cellular organi... 43 0.010 UniRef50_B1YY58 ThiJ/PfpI domain protein n=10 Tax=Proteobacteria... 43 0.010 UniRef50_O06006 Putative cysteine protease yraA n=81 Tax=cellula... 42 0.011 UniRef50_C2AV24 Putative intracellular protease/amidase n=1 Tax=... 42 0.011 UniRef50_Q8RHW4 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 42 0.011 UniRef50_Q72HB0 Putative amidotransferase n=2 Tax=Thermus RepID=... 42 0.011 UniRef50_Q488G4 DJ-1/PfpI family protein n=14 Tax=Bacteria RepID... 42 0.011 UniRef50_C8PUR4 Intracellular protease 1 n=1 Tax=Enhydrobacter a... 42 0.011 UniRef50_Q4V0N9 NonF-related protein n=5 Tax=cellular organisms ... 42 0.012 UniRef50_Q0C397 Intracellular protease, PfpI family n=6 Tax=Alph... 42 0.012 UniRef50_Q5KKT1 Putative uncharacterized protein n=3 Tax=Agarico... 42 0.012 UniRef50_O28987 Uncharacterized protein AF_1281 n=11 Tax=cellula... 42 0.014 UniRef50_C7PM17 ThiJ/PfpI domain protein n=2 Tax=Bacteria RepID=... 42 0.015 UniRef50_A5TUD8 Possible transcriptional regulator n=10 Tax=Fuso... 42 0.015 UniRef50_B8I402 ThiJ/PfpI domain protein n=1 Tax=Clostridium cel... 42 0.015 UniRef50_C7Z460 Putative uncharacterized protein n=1 Tax=Nectria... 42 0.017 UniRef50_B1ZX06 ThiJ/PfpI domain protein n=1 Tax=Opitutus terrae... 42 0.018 UniRef50_Q9M8R4 F13E7.34 protein n=273 Tax=cellular organisms Re... 41 0.018 UniRef50_B0NJL3 Putative uncharacterized protein n=1 Tax=Clostri... 41 0.018 UniRef50_C5AII4 Intracellular protease, PfpI family protein n=1 ... 41 0.018 UniRef50_C5RPJ4 DJ-1 family protein n=1 Tax=Clostridium cellulov... 41 0.022 UniRef50_C5PN70 C56 family peptidase n=2 Tax=Sphingobacterium sp... 41 0.022 UniRef50_P45470 Protein yhbO n=95 Tax=Bacteria RepID=YHBO_ECOLI 41 0.022 UniRef50_D0BJF4 DJ-1 family protein n=1 Tax=Granulicatella elega... 41 0.022 UniRef50_Q7MWH8 ThiJ/PfpI family protein n=2 Tax=Porphyromonas g... 41 0.023 UniRef50_D1PPH9 ThiJ/PfpI family protein n=1 Tax=Subdoligranulum... 41 0.025 UniRef50_C7MB29 Intracellular protease, PfpI family n=1 Tax=Brac... 41 0.025 UniRef50_C9KP75 ThiJ/PfpI family protein n=1 Tax=Mitsuokella mul... 41 0.028 UniRef50_B7AW87 Putative uncharacterized protein n=2 Tax=Bacteri... 41 0.029 UniRef50_B0N2E2 Putative uncharacterized protein n=2 Tax=Bacteri... 41 0.029 UniRef50_A6CIJ2 Predicted intracellular protease/amidase, ThiJ/P... 41 0.029 UniRef50_B0SGM8 ThiJ/PfpI family intracellular protease n=2 Tax=... 41 0.031 UniRef50_Q11NL5 Putative uncharacterized protein n=1 Tax=Cytopha... 41 0.032 UniRef50_A4XSL6 CheA signal transduction histidine kinase n=1 Ta... 41 0.036 UniRef50_C5F0X7 Putative uncharacterized protein n=2 Tax=Helicob... 40 0.046 UniRef50_D2RN75 ThiJ/PfpI domain protein n=2 Tax=Veillonellaceae... 40 0.046 UniRef50_C0WAN6 Glutamine amidotransferase n=1 Tax=Acidaminococc... 40 0.048 UniRef50_Q051B9 Transcription regulator, DJ-1/PfpI family intrac... 40 0.050 UniRef50_A7NLT5 ThiJ/PfpI domain protein n=7 Tax=Bacteria RepID=... 40 0.051 UniRef50_D1VVY7 DJ-1 family protein n=1 Tax=Peptoniphilus lacrim... 40 0.051 UniRef50_B2B3G0 Predicted CDS Pa_6_120 n=5 Tax=Dikarya RepID=B2B... 40 0.051 UniRef50_Q10356 Uncharacterized protein C22E12.03c n=2 Tax=Schiz... 40 0.051 UniRef50_C7NZR3 ThiJ/PfpI domain protein n=10 Tax=Halobacteriace... 40 0.056 UniRef50_A4WTY0 Intracellular protease, PfpI family n=7 Tax=Bact... 40 0.064 UniRef50_C8W7V5 DJ-1 family protein n=1 Tax=Atopobium parvulum D... 40 0.064 UniRef50_C4ZGF1 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosp... 40 0.069 UniRef50_D1QMP0 ThiJ/PfpI family protein n=3 Tax=Prevotella RepI... 40 0.069 UniRef50_D1AAI5 ThiJ/PfpI domain protein n=3 Tax=Bacteria RepID=... 40 0.070 UniRef50_Q9RX24 Putative uncharacterized protein n=1 Tax=Deinoco... 40 0.082 UniRef50_Q8PSJ5 Protease I n=1 Tax=Methanosarcina mazei RepID=Q8... 39 0.090 UniRef50_C3XNX9 Putative uncharacterized protein n=1 Tax=Helicob... 39 0.091 UniRef50_A5FSI4 DJ-1 family protein n=5 Tax=Dehalococcoides RepI... 39 0.094 >UniRef50_P0ABU5 Enhancing lycopene biosynthesis protein 2 n=267 Tax=Bacteria RepID=ELBB_ECOLI Length = 217 Score = 319 bits (817), Expect = 5e-86, Method: Composition-based stats. Identities = 217/217 (100%), Positives = 217/217 (100%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET Sbjct: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL Sbjct: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV Sbjct: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE Sbjct: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 >UniRef50_B4EXM9 Enhancing lycopene biosynthesis protein 2 n=2 Tax=Proteus mirabilis RepID=B4EXM9_PROMH Length = 216 Score = 313 bits (803), Expect = 2e-84, Method: Composition-based stats. Identities = 126/216 (58%), Positives = 168/216 (77%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK I VILSGCGV+DGSEIHE+VLT+LA+S++ A+ F+P+ Q VINH+TGE E Sbjct: 1 MKSIAVILSGCGVFDGSEIHESVLTMLALSQNKAEVHYFSPNDFQPTVINHITGEEKAEK 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RN++ EA+RI+RG+I PL+ A A DA+I+PGGFGAAKNL NFA+ G +C ++++L Sbjct: 61 RNMMEEASRISRGKISPLSDAKAENFDAVIIPGGFGAAKNLCNFATKGVQCEINQQLLTF 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q MHQ KPLG MCIAP MLPK+ + P++LTIG D TAE++ EMG H+ CPVD+IVV Sbjct: 121 VQKMHQQKKPLGLMCIAPVMLPKMLNAPVKLTIGNDAKTAEMITEMGGIHINCPVDEIVV 180 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 D++ ++VTTPAYMLA++IA+A GI+KLV +VL +A Sbjct: 181 DDEYRVVTTPAYMLAESIAQAQVGIEKLVKKVLEMA 216 >UniRef50_B6XFP3 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XFP3_9ENTR Length = 232 Score = 311 bits (798), Expect = 7e-84, Method: Composition-based stats. Identities = 134/216 (62%), Positives = 171/216 (79%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK I VILSGCGV+DGSEIHE+VLT+LA+S++ A+ FAPD+ Q VINH+ GE TET Sbjct: 17 MKSIAVILSGCGVFDGSEIHESVLTMLALSKNNAEVHFFAPDEDQATVINHINGELKTET 76 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RN + E+ARI+RG+I PL+ D ++LDALI+PGGFG AKNL NFA+ GSEC ++++L +L Sbjct: 77 RNQMEESARISRGKIAPLSSVDPSKLDALIIPGGFGVAKNLCNFATKGSECEINKQLLSL 136 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q MHQ KPLG MCIAP MLPK+ + ++LTIG D +T +E+MG HV C VD+IVV Sbjct: 137 VQVMHQQKKPLGLMCIAPVMLPKMLNTSVKLTIGNDTETIAQIEKMGGLHVECTVDNIVV 196 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 DE+NK+VTTPAYMLAQ+IAEA GI+KLV +VL +A Sbjct: 197 DENNKVVTTPAYMLAQSIAEANVGINKLVEKVLEMA 232 >UniRef50_P30042 ES1 protein homolog, mitochondrial n=45 Tax=Metazoa RepID=ES1_HUMAN Length = 268 Score = 310 bits (795), Expect = 2e-83, Method: Composition-based stats. Identities = 101/224 (45%), Positives = 141/224 (62%), Gaps = 11/224 (4%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-TET 60 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ E+ Sbjct: 43 ARVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES 102 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL E+ARI RG+I LA AA DA I PGGFGAAKNLS FA G +C V++E++ + Sbjct: 103 RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLSTFAVDGKDCKVNKEVERV 162 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID---------TAEVLEEMGAEHV 171 + HQAGKP+G CIAP + K+ + +T+G + + TAE ++ +GA+H Sbjct: 163 LKEFHQAGKPIGLCCIAPVLAAKVLR-GVEVTVGHEQEEGGKWPYAGTAEAIKALGAKHC 221 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVL 215 V + VD+ NK+VTTPA+M + GI +V +VL L Sbjct: 222 VKEVVEAHVDQKNKVVTTPAFMCETALHYIHDGIGAMVRKVLEL 265 >UniRef50_C9J1C8 Putative uncharacterized protein C21orf33 (Fragment) n=1 Tax=Homo sapiens RepID=C9J1C8_HUMAN Length = 257 Score = 302 bits (773), Expect = 6e-81, Method: Composition-based stats. Identities = 101/251 (40%), Positives = 141/251 (56%), Gaps = 38/251 (15%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-TET 60 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ E+ Sbjct: 5 ARVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES 64 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNL------------------- 101 RNVL E+ARI RG+I LA AA DA I PGGFGAAKNL Sbjct: 65 RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLCVFELQGLPLSMWSRWEGG 124 Query: 102 --------SNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTI 153 S FA G +C V++E++ + + HQAGKP+G CIAP + K+ + +T+ Sbjct: 125 APVCCPMWSTFAVDGKDCKVNKEVERVLKEFHQAGKPIGLCCIAPVLAAKVLR-GVEVTV 183 Query: 154 GTDID---------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASG 204 G + + TAE ++ +GA+H V + VD+ NK+VTTPA+M + G Sbjct: 184 GHEQEEGGKWPYAGTAEAIKALGAKHCVKEVVEAHVDQKNKVVTTPAFMCETALHYIHDG 243 Query: 205 IDKLVSRVLVL 215 I +V +VL L Sbjct: 244 IGAMVRKVLEL 254 >UniRef50_B9EL82 ES1 protein homolog, mitochondrial n=11 Tax=Eumetazoa RepID=B9EL82_SALSA Length = 259 Score = 301 bits (771), Expect = 1e-80, Method: Composition-based stats. Identities = 93/225 (41%), Positives = 139/225 (61%), Gaps = 11/225 (4%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTET 60 K+ V+LSGCGVYDG+EIHEA L+ +SR GA+ +APD Q+ VI+H G E+ Sbjct: 34 AKVAVVLSGCGVYDGTEIHEASAILVHLSRGGAEVQMYAPDVSQMHVIDHGKGVPAENES 93 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL E+ARI RG I L + + DA+I PGGFGAAKNLS FA G +C ++ +++ + Sbjct: 94 RNVLSESARIARGNITDLVKLSVSNHDAIIFPGGFGAAKNLSTFAVDGPDCKINADVERV 153 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID---------TAEVLEEMGAEHV 171 + H+AGKP+G CI+P + K+ + +T+G + + TA ++ +GA+H+ Sbjct: 154 LKDFHKAGKPIGLCCISPVLAAKLLP-GVEVTVGHEEEKGGKWPYAGTAGAIKALGAKHI 212 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 V + VD+ NK+VT+PA+M + GI +V+ VL L+ Sbjct: 213 VKEVTEAHVDQKNKVVTSPAFMCETQLHLIFDGIGAMVTNVLKLS 257 >UniRef50_Q6APY5 Putative uncharacterized protein n=1 Tax=Desulfotalea psychrophila RepID=Q6APY5_DESPS Length = 218 Score = 299 bits (766), Expect = 4e-80, Method: Composition-based stats. Identities = 103/215 (47%), Positives = 138/215 (64%), Gaps = 1/215 (0%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 KI VILSGCG DGSEIHEA ++L AI G CFAPD Q+ VINHL GE E+RN Sbjct: 5 KIAVILSGCGHLDGSEIHEATMSLWAIHSHGCDYHCFAPDIDQLHVINHLNGEETGESRN 64 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL+E+ARI RG+I L Q A + DALI+PGGFGAAKNLS++ S G C V+ E+K Sbjct: 65 VLVESARIARGKISDLNQFKAEDYDALIIPGGFGAAKNLSDYFSAGVNCQVNPEVKKAII 124 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 MHQA KP+G +CIAP +L ++ + +T+G D + + E+MGA IV+D Sbjct: 125 DMHQAKKPIGALCIAPMLLARLIS-GVEITLGQDPISHQNAEKMGASTQTTDHGQIVIDR 183 Query: 183 DNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 N +V+TP YML + + +G D L++ ++ + E Sbjct: 184 KNLVVSTPCYMLDARVDQIGAGADALMTEIVEMME 218 >UniRef50_A3M9Q1 Enhancing lycopene biosynthesis protein 2 n=5 Tax=Acinetobacter RepID=A3M9Q1_ACIBT Length = 220 Score = 292 bits (747), Expect = 6e-78, Method: Composition-based stats. Identities = 105/220 (47%), Positives = 142/220 (64%), Gaps = 3/220 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTE 59 MKK+ VILSGCG DGSEI E+VLTLLA+ + FAPD+ VI+H++GE MTE Sbjct: 1 MKKVAVILSGCGYLDGSEIRESVLTLLALDTVNIEYQIFAPDEPLFHVIDHVSGEINMTE 60 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 RN+L EA RI RG+I L Q + E D LI+PGGFG AKNLS FA G+E V + + Sbjct: 61 RRNILQEAGRIARGKISSLDQLNENEFDGLILPGGFGVAKNLSTFAFKGAEARVHGTVAS 120 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDF-PLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 + +A HQ+ KP+G +CI+PA+L F +T+G+D++ A+ +E+ G+ H C D Sbjct: 121 ILKAFHQSKKPIGAICISPALLALTFGELHPTITLGSDLNIAKEIEKTGSIHHVCQTSDC 180 Query: 179 VVDEDNKIVTTPAYMLAQ-NIAEAASGIDKLVSRVLVLAE 217 VVD+ N VTTPAYM Q N+ + +GI LV+ + LA Sbjct: 181 VVDKQNLFVTTPAYMDDQANLKDIYTGITSLVNTMTALAN 220 >UniRef50_Q3AU80 Es1 family protein n=30 Tax=Bacteria RepID=Q3AU80_CHLCH Length = 225 Score = 288 bits (738), Expect = 7e-77, Method: Composition-based stats. Identities = 113/217 (52%), Positives = 153/217 (70%), Gaps = 2/217 (0%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-TETR 61 +IGV+L+GCG DGSEIHEAVLTLLAIS+ GAQA+C APD Q V+NHLTG+ + E+R Sbjct: 8 RIGVLLAGCGYLDGSEIHEAVLTLLAISKKGAQAICLAPDMVQHHVVNHLTGQEVIGESR 67 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL+EAARI RG I L+ + LDA IVPGG+GAAKNLS+FA G+ CT+ ++ Sbjct: 68 NVLVEAARIARGAIHNLSDIASLHLDAFIVPGGYGAAKNLSSFAFDGTPCTIHPDVATAI 127 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDF-PLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q ++AGKP+GF+CI+P + K+ + +TIG D TA +E MGA H+ C V V Sbjct: 128 QLFYKAGKPMGFICISPVLAAKVLGSEKIEVTIGNDASTAASIEAMGARHINCVVTKAHV 187 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 + + IV+TPAYML ++A+ A+GI++LV V+ L + Sbjct: 188 SKPHNIVSTPAYMLEASLADIATGIEQLVGNVVELVK 224 >UniRef50_Q6MQ93 Enhancing lycopene biosynthesis protein 2 n=3 Tax=Bacteria RepID=Q6MQ93_BDEBA Length = 217 Score = 280 bits (716), Expect = 2e-74, Method: Composition-based stats. Identities = 100/218 (45%), Positives = 142/218 (65%), Gaps = 3/218 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKI V+LSGCG DGSEI E+V L+ + ++GA+ CFAPD Q+ + NH+ GEA E Sbjct: 1 MKKIAVVLSGCGHRDGSEITESVSLLIGLHQAGAEVHCFAPDI-QIPITNHINGEAQGEK 59 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R++L EAARI RG I+ L + A + DA++ PGG+GAAKNLSN+A G++C V+ ++K + Sbjct: 60 RSLLTEAARIARGHIQSLDKLHAKDFDAVVFPGGYGAAKNLSNWAEKGAQCEVNPDVKRV 119 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDF-PLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 H A KP+G +CIAP ++ K+ + +TIG D TA +E+ GA H CPV+D + Sbjct: 120 ILEFHSASKPIGALCIAPVLVAKVLGDKKVTVTIGDDAATAAEIEKTGAIHEECPVNDYI 179 Query: 180 VDEDNKIVTTPAYML-AQNIAEAASGIDKLVSRVLVLA 216 D ++K+VTTPAYM E +GI L ++ A Sbjct: 180 TDRESKVVTTPAYMYGDAKPNEVFAGIFGLAHEIVEWA 217 >UniRef50_Q90257 ES1 protein, mitochondrial n=3 Tax=Danio rerio RepID=ES1_DANRE Length = 270 Score = 279 bits (713), Expect = 5e-74, Method: Composition-based stats. Identities = 73/232 (31%), Positives = 131/232 (56%), Gaps = 20/232 (8%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMT-ETR 61 I V+ SGCG +DG++IHEA T+ +SR+GA+ FAP++QQ+ V++H+ + + + R Sbjct: 37 NIAVVFSGCGWWDGTDIHEAAYTMYHLSRNGARFQIFAPNQQQMHVMDHMKMQPSSSDNR 96 Query: 62 NVLIEAARITRGE----IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDREL 117 N+++E+AR + G+ + L++ DA DA+I PGG G KN+S F+ G +C ++ ++ Sbjct: 97 NIMMESARFSHGQGMMQMNDLSKLDANSFDAVIFPGGHGIVKNMSTFSKDGKDCKLNNDV 156 Query: 118 KALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID------------TAEVLEE 165 + + + H+A KP+G +AP + ++ L +T+G + D + ++ Sbjct: 157 ERVLKDFHRARKPIGLSSMAPLLACRVLPS-LEVTMGYERDESSRWGRWPNTNMVQAVKS 215 Query: 166 MGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQN--IAEAASGIDKLVSRVLVL 215 MGA H + VDE NK+++TP +M + GI +V V+ + Sbjct: 216 MGARHNTREPYEAYVDEKNKVISTPTFMWETDYHYHYIFDGIGNMVKHVMRM 267 >UniRef50_C5LKS2 Putative uncharacterized protein n=4 Tax=Eukaryota RepID=C5LKS2_9ALVE Length = 348 Score = 278 bits (711), Expect = 1e-73, Method: Composition-based stats. Identities = 91/226 (40%), Positives = 129/226 (57%), Gaps = 12/226 (5%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTET 60 K++ V+LSGCG DGSEI EAV L +SR+ + CFAPD QQ+ V++H G Sbjct: 124 KRVAVVLSGCGHLDGSEIREAVFVLTELSRANTKYQCFAPDIQQMHVVDHSDGSIDSGSK 183 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+EA+RI R PL A + DAL+ PGGFGAAKNLSNFA GS +V E++ Sbjct: 184 RNVLVEASRIGREGTLPLTDLKAKDYDALMFPGGFGAAKNLSNFAVKGSGMSVHPEVERA 243 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTD--------IDTAEVLEEMGAEHVP 172 + +QAG P+G +CIAP + K+ + +T+G+D A+ ++E+G +H Sbjct: 244 IKEFNQAGHPIGLVCIAPVLAAKVLNA--EVTMGSDEVSEEYPNASAAQAVKEIGGKHFN 301 Query: 173 CPVDDIVVDEDNKIVTTPAYMLAQN-IAEAASGIDKLVSRVLVLAE 217 +++ VD K+VT+ AYM I E + +V L L Sbjct: 302 TKLNEAHVDTAKKVVTSAAYMNNTAPIHEIHESVAAMVRETLKLIN 347 >UniRef50_A9V3H5 Predicted protein n=5 Tax=Fungi/Metazoa group RepID=A9V3H5_MONBE Length = 245 Score = 272 bits (697), Expect = 4e-72, Method: Composition-based stats. Identities = 101/222 (45%), Positives = 138/222 (62%), Gaps = 10/222 (4%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ ++LSG GVYDG+E+HEA + A+SR GA FAPDKQQ V+NH+TGE M E+RN Sbjct: 23 RVALVLSGSGVYDGTEVHEASAAMGALSRQGADYKIFAPDKQQHHVVNHMTGEEMDESRN 82 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL+EAARI RG I+ L + A+ DA++VPGGFGAAKNLSNFA G+ CTVD L + + Sbjct: 83 VLVEAARIARGNIQALDKLQVADFDAVVVPGGFGAAKNLSNFAVEGAACTVDATLTDILK 142 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID--------TAEVLEEMGAEHVPCP 174 H KP+GF CIAP + +F +T+G+D + A +EMGA++V Sbjct: 143 KFHAEQKPMGFCCIAPVIAANLFKG--EVTMGSDTESDKWPFAGAAGACKEMGADYVVGD 200 Query: 175 VDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 + VD+ N+IVT PA+M + + +V VL L Sbjct: 201 ESKVHVDQANRIVTAPAFMANTAVHLVQDNVTNMVQTVLELV 242 >UniRef50_D2W324 Glutamine amidotransferase domain-containing protein n=1 Tax=Naegleria gruberi RepID=D2W324_NAEGR Length = 305 Score = 272 bits (695), Expect = 7e-72, Method: Composition-based stats. Identities = 89/215 (41%), Positives = 120/215 (55%), Gaps = 4/215 (1%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTETRN 62 + VILSGCG DGSEI EAV ++ +S+ G + FAPD Q + +HLT E RN Sbjct: 49 VAVILSGCGYLDGSEITEAVSVMVHLSKKGYKLAFFAPDINQEETYDHLTKNVEKNEVRN 108 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + EA+RITR + L Q +ALI+PGGFG AKNLSN+A + + E++ Sbjct: 109 IRTEASRITRQNVLRLDQFRPQAFEALIIPGGFGVAKNLSNYAENPTNFKIHPEVEKAIV 168 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIG-TDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + H+A P+G CIAP + K +T+G +D E + GA VP +VVD Sbjct: 169 STHEAKIPIGMCCIAPVLAAKAIPQ-CSITLGDSDPSVTEHAKSYGANCVPKSTSQVVVD 227 Query: 182 EDNKIVTTPAYMLAQNIA-EAASGIDKLVSRVLVL 215 +DNK+VTTPAYM A E GI +V V+ L Sbjct: 228 KDNKLVTTPAYMGKNPTAFEVFDGIGSMVDSVIEL 262 >UniRef50_Q2RPB9 ThiJ/PfpI n=7 Tax=Bacteria RepID=Q2RPB9_RHORT Length = 227 Score = 266 bits (679), Expect = 5e-70, Method: Composition-based stats. Identities = 104/214 (48%), Positives = 142/214 (66%), Gaps = 1/214 (0%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 + V+LSGCGV+DG+EIHE+VLTLLAI R G A CFAPD+ Q VI+H +G+ ETRN Sbjct: 6 RFAVLLSGCGVFDGAEIHESVLTLLAIDRQGGVARCFAPDRPQYHVIDHRSGQPTGETRN 65 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL E+ARI RG I LA D A DALI+PGGFGAAKNL +FA G++C VD ++ + Sbjct: 66 VLCESARIARGAIDDLADFDPAAFDALILPGGFGAAKNLCSFAIDGADCAVDPTVERALR 125 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 A AG +G +CIAP +L ++F + LTIG+D TAE + +GA H ++VVD Sbjct: 126 AARAAGLAIGALCIAPVVLARVFGEGV-LTIGSDAATAEAITALGAHHQKATHAEVVVDR 184 Query: 183 DNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 ++V++P YML +I++ A G + V ++ L Sbjct: 185 ALRLVSSPCYMLDASISQIAEGAENTVKALIALI 218 >UniRef50_Q2GLV6 Es1 family protein n=5 Tax=Anaplasma RepID=Q2GLV6_ANAPZ Length = 221 Score = 263 bits (672), Expect = 3e-69, Method: Composition-based stats. Identities = 84/217 (38%), Positives = 123/217 (56%), Gaps = 4/217 (1%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 V+L GCG DGSEI EAVL LLA+ G C AP+ +QVDV++HL+G + E R+ Sbjct: 2 NCAVLLCGCGHMDGSEIREAVLALLALDSYGINVTCCAPNIKQVDVVDHLSGSTLEEERD 61 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 ++ E+ARI RG + + D LI+PGGFG AKN S+ S +V E+K Sbjct: 62 IMSESARIARGNVVDPKDISPNDFDMLILPGGFGVAKNYSDILKGESPVSVLEEVKQTIV 121 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFD--FPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 H+ K +G +CIAPA++ +++T+G DID+ ++ G EHV C DD V Sbjct: 122 KFHKEKKAIGAICIAPAIVAASLSSVSKVKVTLGEDIDS--IISRCGGEHVFCETDDYVA 179 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 D D + +TPAYM ++ + GI K+V ++ + Sbjct: 180 DIDMGVFSTPAYMRKDSLHKIHVGIHKMVGAMVDFVK 216 >UniRef50_Q21UI1 ThiJ/PfpI n=1 Tax=Rhodoferax ferrireducens T118 RepID=Q21UI1_RHOFD Length = 226 Score = 262 bits (670), Expect = 6e-69, Method: Composition-based stats. Identities = 93/220 (42%), Positives = 135/220 (61%), Gaps = 7/220 (3%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTET 60 +KI V+L+GCG DG+E+ EAVLTLLA+ + GA C AP+ Q VINH+TGE Sbjct: 5 RKIAVLLAGCGHLDGAEVREAVLTLLALDQHGAAFQCIAPNAPQFHVINHITGEPVAGAQ 64 Query: 61 RNVLIEAARITR-GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 RN+L E++RI R G+ LA+A A+ DAL++PGG+G AKN +FA G++ V ++ A Sbjct: 65 RNILEESSRIARLGQCLDLAKAKVADYDALVMPGGYGVAKNNCSFAFKGADAEVRPDVAA 124 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIF---DFPLRLTIGTDIDTAEVLEEMGAEHVPCP-V 175 + A KP+G +CIAPA++ D LT+G D A + ++G H P Sbjct: 125 FVRGFFDAKKPVGAICIAPALVALALHQVDDSATLTLGNDAGVAAAMGQLGQRHQNTPNA 184 Query: 176 DDIVVDEDNKIVTTPAYMLAQ-NIAEAASGIDKLVSRVLV 214 +IV+DE +K+VTTPAYM +++ GI++ V+ VL Sbjct: 185 REIVIDEAHKLVTTPAYMFDDARLSDVFVGIERCVAEVLK 224 >UniRef50_C0R4T2 Enhancing lycopene biosynthesis protein 2, putative n=13 Tax=Rickettsiales RepID=C0R4T2_WOLWR Length = 241 Score = 255 bits (652), Expect = 7e-67, Method: Composition-based stats. Identities = 85/223 (38%), Positives = 123/223 (55%), Gaps = 10/223 (4%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K V+LSGCG DG E+ EAVL+LL + + CFAPD V+NH T EA E RN Sbjct: 13 KAAVVLSGCGHLDGVEVREAVLSLLVLDQQEVDVKCFAPDINITQVMNHRTKEATKEKRN 72 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL+EAARI RGEI L +A A D L+VPGG+G AKNLS+ A TV E + L Sbjct: 73 VLVEAARIARGEIYDLKEAKAENFDMLVVPGGYGVAKNLSDLAESKDMVTVMPEFERLVS 132 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFD-------FPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 KP+G +CI+PA++ I +++TIG D + +++E +G EH+ C Sbjct: 133 EFFVTKKPIGAICISPAIIVSILSSKIGKEESKVKVTIGDDRE--QLIERLGGEHIKCDT 190 Query: 176 DDIVVDEDNKIVTTPAYML-AQNIAEAASGIDKLVSRVLVLAE 217 + + DE++ + + AYM ++ GI ++ ++ Sbjct: 191 ELSIEDEEHNVFSCSAYMRSDESTYSVYQGIKHMIDSMVKKIN 233 >UniRef50_UPI0000DB6CF4 PREDICTED: similar to es1 protein n=2 Tax=Apocrita RepID=UPI0000DB6CF4 Length = 226 Score = 249 bits (635), Expect = 7e-65, Method: Composition-based stats. Identities = 72/220 (32%), Positives = 124/220 (56%), Gaps = 12/220 (5%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTG--EAMTETR 61 + VIL GCG DG+EI EA+ ++ I + +APD + ++H + + +R Sbjct: 3 VAVILCGCGYLDGTEISEAMSAMIHICLKDMKPHFYAPDVNICETVDHFIKKPDPDSPSR 62 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 N L+EAARI R +I+PL Q A + +AL++PGGFGAAK LSNFA G++CT+ +L+ + Sbjct: 63 NALVEAARIARSDIKPLCQCQACKHEALVIPGGFGAAKTLSNFAEKGADCTIHPDLEQII 122 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTD--------IDTAEVLEEMGAEHVPC 173 + + GKP+ +CI+ ++ ++ +++T+G + D + ++MGA+ Sbjct: 123 EDFYYEGKPIASICISSVLVARVLK-GVKITLGKESPAEEWPFADAIKKAKDMGAKIEQK 181 Query: 174 PVDDIVVDEDNKIVTTPAYMLAQ-NIAEAASGIDKLVSRV 212 V + + + +TPA+M AE +GI KL+ + Sbjct: 182 SVKGMTKCKKYNVFSTPAWMYKPATFAEIYTGIGKLIGTM 221 >UniRef50_P30042-2 Isoform Short of ES1 protein homolog, mitochondrial n=6 Tax=Euarchontoglires RepID=P30042-2 Length = 237 Score = 247 bits (630), Expect = 2e-64, Method: Composition-based stats. Identities = 87/224 (38%), Positives = 119/224 (53%), Gaps = 42/224 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-TET 60 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ E+ Sbjct: 43 ARVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES 102 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL E+ARI RG+I LA AA DA I PGGFGAAKNL Sbjct: 103 RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNL------------------- 143 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID---------TAEVLEEMGAEHV 171 CIAP + K+ + +T+G + + TAE ++ +GA+H Sbjct: 144 ------------LCCIAPVLAAKVLR-GVEVTVGHEQEEGGKWPYAGTAEAIKALGAKHC 190 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVL 215 V + VD+ NK+VTTPA+M + GI +V +VL L Sbjct: 191 VKEVVEAHVDQKNKVVTTPAFMCETALHYIHDGIGAMVRKVLEL 234 >UniRef50_A4IY89 DJ-1/PfpI family protein n=24 Tax=Gammaproteobacteria RepID=A4IY89_FRATW Length = 219 Score = 243 bits (621), Expect = 3e-63, Method: Composition-based stats. Identities = 104/216 (48%), Positives = 147/216 (68%), Gaps = 3/216 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHL--TGEAMT 58 M K+ V+LSGCG DGSEIHE VLT+LA+ + G + A ++ Q VINHL + ++ Sbjct: 1 MAKVAVVLSGCGYLDGSEIHETVLTILALEKQGVEWQGVALNRDQKQVINHLHQSVDSKA 60 Query: 59 ETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSEC-TVDREL 117 RN+L E+ARITRG + +A AD+ + DA+I PGGFGAAKN+ +FA +G++ +D E+ Sbjct: 61 SPRNILEESARITRGNVIDIADADSDDYDAIIFPGGFGAAKNIMDFAFVGNDSYQMDEEV 120 Query: 118 KALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDD 177 A+A + A KP G++CIAP M+P ++ + T+GTD +T +L + GAE + D Sbjct: 121 LKFARAFYLADKPAGYICIAPLMIPLVYPEGTKATVGTDENTTAILAKKGAEAIIMDATD 180 Query: 178 IVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVL 213 I VDE KIV+TPAYM A+NI EAA GI+KLV +V+ Sbjct: 181 ICVDESVKIVSTPAYMCARNILEAAQGIEKLVEKVV 216 >UniRef50_UPI000186E026 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186E026 Length = 256 Score = 242 bits (619), Expect = 4e-63, Method: Composition-based stats. Identities = 72/218 (33%), Positives = 107/218 (49%), Gaps = 9/218 (4%) Query: 7 ILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIE 66 +LSG G+ DG+EIHEA + +SR Q F+ Q DVI+H E RN L+E Sbjct: 39 VLSGSGMMDGTEIHEASACAVHLSRLDIQPKFFSVPCPQTDVIDHYKLSPTNEMRNALVE 98 Query: 67 AARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQ 126 +ARI RG+I + + E D LI PGGFG AK L+ F G+ C V+ E+ + Sbjct: 99 SARIARGKICSINSLTSDEADVLIFPGGFGVAKTLTTFDKDGANCGVNEEVVRVVNEFCA 158 Query: 127 AGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID--------TAEVLEEMGAEHVPCPVDDI 178 KP+ F CIA + +IF +++T+G D + + +MGA VD Sbjct: 159 CRKPMAFTCIAAILPARIFP-GVKVTLGKKGDPKKWPHSEAIDTVSDMGAVVEVKNVDSF 217 Query: 179 VVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 D+ + TTPA+M + GI ++ + + Sbjct: 218 TFDKQFLVFTTPAFMYEGTFYQIFEGIGNMIEALQKIM 255 >UniRef50_Q2NWH9 Sigma cross-reacting protein 27A (SCRP-27A) n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NWH9_SODGM Length = 171 Score = 242 bits (618), Expect = 5e-63, Method: Composition-based stats. Identities = 115/168 (68%), Positives = 135/168 (80%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK+IGV+LSGCGV DGSEI EAVLTLLAI R+G AVCFA DK Q+ V+NHL+GE E Sbjct: 1 MKRIGVVLSGCGVNDGSEIQEAVLTLLAIDRTGLDAVCFATDKPQLQVVNHLSGEQTDER 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+EAARI RG+I+PLA A A +LDALIVPGGFG AKNLSN A G++C VD EL L Sbjct: 61 RNVLVEAARIARGQIQPLAAASAEDLDALIVPGGFGVAKNLSNLAQTGADCEVDAELAQL 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGA 168 QA+H KPLGF+CIAPA+LPKI PLRLT+GTD+D AE+++ MGA Sbjct: 121 VQALHLQRKPLGFICIAPALLPKILAVPLRLTLGTDVDAAEMVDTMGA 168 >UniRef50_B9Z0T4 ThiJ/PfpI domain protein n=1 Tax=Lutiella nitroferrum 2002 RepID=B9Z0T4_9NEIS Length = 226 Score = 241 bits (615), Expect = 1e-62, Method: Composition-based stats. Identities = 83/222 (37%), Positives = 126/222 (56%), Gaps = 8/222 (3%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 + + ++LSGCGVYDGSEI EAV ++A+S++G +APD+ Q+ V++H G+ E R Sbjct: 3 QTVAIVLSGCGVYDGSEITEAVGVVIALSQAGLPYAFYAPDRAQMHVVDHARGQESGEAR 62 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 N+L EAARI RG+IRPL + DAA+ A++ PGGFGAAKNL+ F G + + ++ A Sbjct: 63 NILSEAARIARGQIRPLTELDAAQHSAIVFPGGFGAAKNLTTFIKDGRDAVLYDDVAAAV 122 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFD----FPLRLTIGTDID---TAEVLEEMGAEHVPCP 174 + Q KP+ +C AP + I + +T G+ + A+ L G HV P Sbjct: 123 RPFVQQHKPVVALCAAPLVQGLIARDEGLAGVNITFGSYAEGQAMADALTSWGQTHVETP 182 Query: 175 VDDIVVDEDNKIVTTPAYML-AQNIAEAASGIDKLVSRVLVL 215 VD VD ++ ++ PAYM AE + ++ + L Sbjct: 183 VDQACVDLAHRFISAPAYMYGEATPAEVFASCQAAITALKSL 224 >UniRef50_C5CG98 ThiJ/PfpI domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CG98_KOSOT Length = 206 Score = 224 bits (571), Expect = 2e-57, Method: Composition-based stats. Identities = 68/190 (35%), Positives = 108/190 (56%), Gaps = 2/190 (1%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K G++LSGCG+ DG++I E +LT L++ + G + FAP++ Q DVI+H T + E RN Sbjct: 2 KAGILLSGCGLGDGTQIEEVMLTYLSLDKYGIDYITFAPNEMQHDVIDHYTEKPQNEKRN 61 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 +LIE+ARI RG+I + + ++DA+I+PGG G KNLS F TV++ + L + Sbjct: 62 ILIESARIGRGKICDIREVSCKDIDAIIIPGGLGVFKNLSTFIVDKKSFTVNKNVDDLLK 121 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLR-LTIGTDIDT-AEVLEEMGAEHVPCPVDDIVV 180 AM+ + K + +C A ++ K + L + T D E+L E+ V C + V+ Sbjct: 122 AMYLSKKSIAGICGAVILIAKSLSQHVSDLKVATANDAYGELLSELNVNAVNCSAKECVI 181 Query: 181 DEDNKIVTTP 190 D + P Sbjct: 182 DRKKQSSNYP 191 >UniRef50_C1C1F4 Enhancing lycopene biosynthesis protein 2 n=2 Tax=Caligus RepID=C1C1F4_9MAXI Length = 231 Score = 205 bits (521), Expect = 1e-51, Method: Composition-based stats. Identities = 73/227 (32%), Positives = 111/227 (48%), Gaps = 13/227 (5%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 + V+LSGCG DGS+ E L LA+SR + + +AP +NH+ G RN Sbjct: 5 NVAVLLSGCGHLDGSDPLEVSLLCLALSRLDIKPIFYAPYMSMSTGVNHVNGAEAETGRN 64 Query: 63 VLIEAARITRGEIRPLAQADAAE--LDALIVPGGFGAAKNLSNF-ASLGSECTVDRELKA 119 VLIE+AR+ + + L + D ++ L ALI+PGG G N S+F SL + +V +E+ Sbjct: 65 VLIESARLVKESVLKLDELDPSDETLSALIIPGGHGPLNNFSDFKTSLETPPSVIKEILG 124 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID--------TAEVLEEMGAEHV 171 + + AGKP+G A ++ + +T+G+ + A L E G Sbjct: 125 IIEGFKAAGKPIGCTSHANILVALAIPN-IEITLGSRDEEECPVASLVAPGLIEQGTTVT 183 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLA-QNIAEAASGIDKLVSRVLVLAE 217 P V ++ VD +NKIVT A + A E A I ++ L E Sbjct: 184 PTSVYEVQVDFENKIVTAAASLFASAKYHEVADQITIFFDMLMTLIE 230 >UniRef50_A6NYE6 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NYE6_9BACE Length = 189 Score = 196 bits (498), Expect = 5e-49, Method: Composition-based stats. Identities = 61/212 (28%), Positives = 84/212 (39%), Gaps = 28/212 (13%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K V+L+GCG+ DGS I E VLT A+ + G A D V ++H+T + E R+ Sbjct: 2 KFLVLLAGCGLGDGSCIEEVVLTYTALDKYGCDYTPAAADM-LVPSMDHITEQP-GEKRS 59 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL E+AR RG IR L + DAL++PGG G N + Sbjct: 60 VLTESARTGRGRIRNLHDISPDDYDALLIPGGIGLVVNYRESGL----------VADWVN 109 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 Q KP+G MC L I L D+D D Sbjct: 110 RFVQQKKPIGTMCAGIDFLRGILGAGLLREEVRDLDAVSFCR----------------DT 153 Query: 183 DNKIVTTPAYMLAQNIAEAASGIDKLVSRVLV 214 I TPA+ + + G+D +V +L Sbjct: 154 SGAIFYTPAFRKTGSCHDVMLGVDAMVHAMLE 185 >UniRef50_Q8WQI1 Lycopene biosynthesis-enhancing protein n=2 Tax=Tetrahymena thermophila RepID=Q8WQI1_TETTH Length = 380 Score = 195 bits (497), Expect = 7e-49, Method: Composition-based stats. Identities = 60/122 (49%), Positives = 90/122 (73%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 KK+ +ILSGCGVYDGSE+ E V ++ +++S CFAP++ Q+ V+NH+TGE TET Sbjct: 46 FKKVAIILSGCGVYDGSEVTEVVSLMVHLNKSHVSFQCFAPNQDQLHVVNHITGETTTET 105 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+E+ARI RGE++ + Q + A+++PGGFGAAKNLS++A G+ TV+ E++ + Sbjct: 106 RNVLVESARIARGEVKDITQLKGEDYQAVLLPGGFGAAKNLSDYAVNGTNFTVNSEVERV 165 Query: 121 AQ 122 + Sbjct: 166 LR 167 >UniRef50_D2V705 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V705_NAEGR Length = 250 Score = 165 bits (419), Expect = 6e-40, Method: Composition-based stats. Identities = 57/230 (24%), Positives = 107/230 (46%), Gaps = 18/230 (7%) Query: 6 VILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTETRNVL 64 VILSGCG DGS++ E+V ++ ++R G F+P ++ + N++T + +E R + Sbjct: 18 VILSGCGFMDGSDVVESVSVIVELTRKGIVPRFFSPHEEIDESYNYITKQIDSSEERYMH 77 Query: 65 IEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLG---SECTVDRELKALA 121 E+ARI R +I + Q A + D L++PGG G +NLSNF +E V+ ++ Sbjct: 78 KESARIAREKILSIDQLRADQFDMLVIPGGNGVVRNLSNFEQEEYNVNEVEVNSHVEKAI 137 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDF-------PLRLTIGTDID---TAEVLEEMGAE-- 169 + KP+GFM + + K + + +G +D ++ + G E Sbjct: 138 VDFFKQKKPIGFMSNSVILGAKALGKVSGKTGNGIAVALGKTLDRVFVETLMTKFGNELS 197 Query: 170 HVPCPVDDIVVDEDNKIVTTPAYMLAQNIA--EAASGIDKLVSRVLVLAE 217 + + D ++I + + A + E + L+ ++ L + Sbjct: 198 QESGDAEVVCTDSSHRIASVASVSAAGTVQPNEIHAAAKNLIEELIDLTK 247 >UniRef50_UPI0000E47ADE PREDICTED: similar to KNP-Ia, partial n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47ADE Length = 71 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 36/71 (50%), Positives = 52/71 (73%) Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL+E+ARI RG+I L+ + DA++ PGGFGAAKNLS+FA G+ CTV+ +++ + Sbjct: 1 NVLVESARIARGKITALSGLSSGNFDAVVFPGGFGAAKNLSDFAVNGAGCTVNPDVERVI 60 Query: 122 QAMHQAGKPLG 132 + HQA KP+G Sbjct: 61 KEFHQAKKPIG 71 >UniRef50_Q48464 Enhancing lycopene biosynthesis protein 2 homolog (Fragment) n=3 Tax=Enterobacteriaceae RepID=ELBB_KLEOX Length = 80 Score = 85.4 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 55/74 (74%), Positives = 62/74 (83%), Gaps = 4/74 (5%) Query: 146 DFPLRLTIGT----DIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEA 201 R +G D+DTA+ +EEMGAEHVPCPVDDIVVDEDNK+VTTPAYMLAQNIAEA Sbjct: 5 PPGCRHCVGQLRLCDLDTADAVEEMGAEHVPCPVDDIVVDEDNKVVTTPAYMLAQNIAEA 64 Query: 202 ASGIDKLVSRVLVL 215 ASGI+KLV+RVLVL Sbjct: 65 ASGIEKLVARVLVL 78 >UniRef50_C5DA95 Intracellular protease, PfpI family n=11 Tax=cellular organisms RepID=C5DA95_GEOSW Length = 184 Score = 67.7 bits (164), Expect = 3e-10, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 76/203 (37%), Gaps = 34/203 (16%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH-LTG-EAMT 58 M K +I++G D E E + G AP K+++ + H TG + Sbjct: 1 MSKKVLIVTG----DAVEALEVYYPYYRLLEEGYDVTIAAPKKKKLQTVVHDFTGWDTYE 56 Query: 59 ETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELK 118 E + LIEA A D + D +++PGG +D +L+ Sbjct: 57 EKQAYLIEAH-------AAFADIDPTQYDGIVIPGG-----------RAPEYIRLDADLQ 98 Query: 119 ALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 + + +A KP+ +C A + + D ++ I +E +GA +V Sbjct: 99 RIVRHFFEANKPIAAICHASLIFETMPDLLKGRSLTAYIACKPGVEALGATYVSDSTT-- 156 Query: 179 VVDEDNKIVTT------PAYMLA 195 VD+ +V+ P +M Sbjct: 157 HVDQ--NLVSAHAWPDLPVFMRE 177 >UniRef50_A1K1D1 ThiJ/PfpI family protein n=17 Tax=cellular organisms RepID=A1K1D1_AZOSB Length = 193 Score = 62.3 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 28/115 (24%), Positives = 43/115 (37%), Gaps = 16/115 (13%) Query: 79 AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC-IA 137 A + DAL++PGG ++ ++ A Q A KP+ +C A Sbjct: 74 DSLRAEDYDALVIPGG-----------RAPEYLRLNPKVIAAVQHFFAADKPVAAICHGA 122 Query: 138 PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAY 192 + T A ++ G E+ PVD VD K+VT PA+ Sbjct: 123 QVLAAAGVLKG--RTCSAYPACAPEVKAAGGEYAEIPVDKARVD--GKLVTAPAW 173 >UniRef50_B1KEU0 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria RepID=B1KEU0_SHEWM Length = 255 Score = 61.1 bits (147), Expect = 2e-08, Method: Composition-based stats. Identities = 33/151 (21%), Positives = 60/151 (39%), Gaps = 21/151 (13%) Query: 2 KKIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE 55 K++ +++S G G+ E + + G + V + + I+ L+ + Sbjct: 30 KRVLIVMSSESAMGISGKLTGTWFEEVATPYYTLRKEGYEVVMASLEGGDAP-IDLLSMQ 88 Query: 56 AMTETRNV---LIE-AARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSEC 111 A T N L + A L++ + + DAL PGG+G L + AS Sbjct: 89 APFTTPNTDKFLNDIVAMHALENTNKLSEINPDDFDALFFPGGYGL---LWDLASDSMTI 145 Query: 112 TVDRELKALAQAMHQAGKPLGFMCIAPAMLP 142 + + + A KP+ +C APA+L Sbjct: 146 KM-------IEDFYAANKPIAMVCHAPAILR 169 >UniRef50_C9RGD3 Intracellular protease, PfpI family n=1 Tax=Methanocaldococcus vulcanius M7 RepID=C9RGD3_METVM Length = 208 Score = 60.7 bits (146), Expect = 4e-08, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 54/116 (46%), Gaps = 14/116 (12%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + + + E A+++PGG G+ + L N + EL +L + ++ K + +C++ Sbjct: 89 INEVNPNEYVAIVIPGGIGSKEYLWN----------NTELLSLVKKFYEDHKVVAAICLS 138 Query: 138 PAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAY 192 P +L + + T+ D + E L++ GA + V VVD + +P Y Sbjct: 139 PVVLARAGILKGKKATVFPDPEAIEELKKYGAIYEDKGV---VVDGNIITAQSPNY 191 >UniRef50_Q7NER9 Glr3809 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NER9_GLOVI Length = 526 Score = 59.6 bits (143), Expect = 6e-08, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 51/144 (35%), Gaps = 34/144 (23%) Query: 65 IEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAM 124 IEA+R PLA + DA+ +PGG G +L + + +L L Sbjct: 74 IEASR----ATLPLAGMASENFDAIFLPGGHGPMFDLPD----------NPDLARLLTEF 119 Query: 125 HQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDIDTAEV-------------LE 164 ++AGK + +C PA L + LT T + L Sbjct: 120 YKAGKIIAAICHGPAGLVGARRPDGAPLVAGVTLTSYTASEEVAAELDKEVPFILEDRLR 179 Query: 165 EMGAEHVPCPVDDIVVDEDNKIVT 188 +GA + ++ D + +T Sbjct: 180 ALGAHFIARENKADHIERDGQFIT 203 >UniRef50_C8WTD3 ThiJ/PfpI domain protein n=5 Tax=Bacillales RepID=C8WTD3_ALIAD Length = 223 Score = 59.2 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 47/197 (23%), Positives = 78/197 (39%), Gaps = 34/197 (17%) Query: 12 GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARIT 71 G G + E ++G + +P Q I+ + + N EA I Sbjct: 19 GHPTGLWLSEFAEPYTEFKQAGYEVTVASPRGGQAP-IDERSVQ--NGELNQWPEAVEIL 75 Query: 72 RGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPL 131 + I PL+Q A++ DA+ +PGG G + + A EL+AL + ++GK + Sbjct: 76 KQTI-PLSQVSASDYDAIFLPGGHGTMFDFPDSA----------ELQALIRTFAESGKVV 124 Query: 132 GFMCIAPAMLPKI-------FDFPLRLTIGTDIDTAEV-------------LEEMGAEHV 171 +C PA L + R+T TD + V L E+GA+ V Sbjct: 125 AAVCHGPAGLVNVRLSNGDPLVKGKRVTAFTDEEERAVKLDDKVPFMLETRLRELGAQFV 184 Query: 172 PCPVDDIVVDEDNKIVT 188 P+ V+ D ++T Sbjct: 185 AQPMWSDHVERDGNLIT 201 >UniRef50_B8DQK9 ThiJ/PfpI domain protein n=3 Tax=Bacteria RepID=B8DQK9_DESVM Length = 246 Score = 57.6 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 70/201 (34%), Gaps = 35/201 (17%) Query: 12 GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLI----EA 67 G G + E + +GA+ +P V + + + R A Sbjct: 17 GHKTGLWLEELAAPYYVFTDAGARVTLASPKGGAAPV-DPRSETEEAQNRTTRRFTADPA 75 Query: 68 ARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQA 127 A + PLA+ + D L PGG G +L + D A+ + MH+A Sbjct: 76 AMAALKDTVPLAEVRPEDYDVLFYPGGHGPLWDLVD----------DARSLAIIEKMHRA 125 Query: 128 GKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTA--------------------EVLEEMG 167 GKP+ +C PA+L + + + T + L +G Sbjct: 126 GKPVAAVCHGPAVLVRATTPDGKPLVARRNMTGFSNAEEDAVGLSQVVPFLLQDELTRLG 185 Query: 168 AEHVPCPVDDIVVDEDNKIVT 188 A++ P+ + V D +VT Sbjct: 186 AKYERGPLWEPHVVADGLLVT 206 >UniRef50_B9TI86 Protease C56, putative n=1 Tax=Ricinus communis RepID=B9TI86_RICCO Length = 199 Score = 57.6 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 43/200 (21%), Positives = 74/200 (37%), Gaps = 45/200 (22%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFA--PDKQQVDVINHLTGEAMTE 59 K I V+++ DG E E + + GA+ + P + V NHLT + Sbjct: 23 KHIAVLMT-----DGVEQVEYTQPRQFLEQQGAEVTLVSTKPKGEAVQGFNHLTPANTFD 77 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 +E + A + DAL++PGG NL ++ Sbjct: 78 -----VE---------LDVRDARPVDFDALVLPGGVANPDNL----------RLNTTAIT 113 Query: 120 LAQAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 + + KP+ +C P ++ R+T + + L GA+ V + + Sbjct: 114 FIREFARENKPIAAICHGPWTLIDAGVAQGKRMT--SWPSLKQDLSNAGAQWVD---EQV 168 Query: 179 VVDEDNKIVTT------PAY 192 VVD K+VT+ PA+ Sbjct: 169 VVD--GKLVTSRKPDDIPAF 186 >UniRef50_B0SBM0 Transcription regulator, DJ-1/PfpI family intracellular protease n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SBM0_LEPBA Length = 188 Score = 57.6 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 34/144 (23%), Positives = 58/144 (40%), Gaps = 37/144 (25%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ + L G E EA++ + + R + V + T E + +R Sbjct: 3 KKVLIPLC-----PGFEEMEAIILIDVLRRGNVEVVSAS-----------KTKEPVVASR 46 Query: 62 NVLIEAARITRGEIRP--LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 N + I ++ + E DA+++PGG KNL D E++ Sbjct: 47 NTI---------HISDTTFSEINVDEFDAIVLPGGMNGTKNLM----------ADTEIQK 87 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPK 143 + H + K +G +C APA+L K Sbjct: 88 ILSIFHSSKKHIGAICAAPAVLRK 111 >UniRef50_Q0ULG5 Putative uncharacterized protein n=2 Tax=Pleosporineae RepID=Q0ULG5_PHANO Length = 236 Score = 56.9 bits (136), Expect = 4e-07, Method: Composition-based stats. Identities = 46/215 (21%), Positives = 84/215 (39%), Gaps = 44/215 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K ++++ DGSE E V ++R+G + D + + H+T Sbjct: 44 MPKALILIA-----DGSEEIEFVTPYDVLTRAGFEVQSVGVDLK-NEGYAHMT------- 90 Query: 61 RNVLIEAARIT--RGEIRP-LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDREL 117 RNV RI + Q D LI+PGG AK S + + Sbjct: 91 RNV-----RIVPDHTNLTSFPHQLAHEHYDILILPGGGPGAKTFST----------NPSV 135 Query: 118 KALAQAMHQAGKPLGFMCIA-PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVD 176 L ++ ++GK + +C A++ + + + + + ++ G E+ + Sbjct: 136 LQLIKSFVRSGKFVAAICAGTTALVAAGIEKKI---VTSHPSVMQEIKGAGWEY---SEE 189 Query: 177 DIVVDEDNKIVTT----PAYMLAQNIAEAASGIDK 207 +VVD K+VT+ A + + I E G +K Sbjct: 190 RVVVD--GKVVTSRGPGTALLFSLTIVEVMVGKEK 222 >UniRef50_Q5ZU31 Intracellular protease, ThiJ/PfpI family n=4 Tax=Legionella pneumophila RepID=Q5ZU31_LEGPH Length = 198 Score = 56.9 bits (136), Expect = 5e-07, Method: Composition-based stats. Identities = 28/131 (21%), Positives = 51/131 (38%), Gaps = 18/131 (13%) Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC- 135 ++ + DALI+PGG A +++++ A+ + H A KP+ +C Sbjct: 79 DFSKITLKDYDALIIPGGRAA-----------EYQRLNKDILAIIRHFHDADKPIACICH 127 Query: 136 -IAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYML 194 I I + T+G + + G + +D +VVD K+VT ++ Sbjct: 128 GIQILAEAGILEDKKCTTVGF---CEPDVRKAGGHFIDTGMDGVVVD--GKLVTGATWLG 182 Query: 195 AQNIAEAASGI 205 A I Sbjct: 183 NAPWMRAFLHI 193 >UniRef50_A1VNH7 Intracellular protease, PfpI family n=12 Tax=Proteobacteria RepID=A1VNH7_POLNA Length = 204 Score = 56.5 bits (135), Expect = 6e-07, Method: Composition-based stats. Identities = 43/190 (22%), Positives = 73/190 (38%), Gaps = 35/190 (18%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 I ++ V DG E E A+ SG + QV + H + ++ + Sbjct: 13 NIAIL-----VTDGFEQEEMTGPQAALEESGVMIRLLSDRTGQVQGVRH---DQPGDSFD 64 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 V + A E DA+++PGG A S + + + L + Sbjct: 65 VDT-----------TFDKVTADEFDAVLLPGG----------AVNASRIRNNADAQELVR 103 Query: 123 AMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 M Q GKPL +C AP ++ ++T + + + LE+ GA+ V + +VVD Sbjct: 104 QMDQQGKPLAMICHAPWLLVSAGLVKGRKMT--SAPELQKDLEQAGAQWVD---EKVVVD 158 Query: 182 EDNKIVTTPA 191 + PA Sbjct: 159 RNWVSSRKPA 168 >UniRef50_Q5HPG8 ThiJ/PfpI family protein n=9 Tax=Staphylococcus RepID=Q5HPG8_STAEQ Length = 219 Score = 56.5 bits (135), Expect = 6e-07, Method: Composition-based stats. Identities = 45/214 (21%), Positives = 81/214 (37%), Gaps = 43/214 (20%) Query: 2 KKIGVIL-SGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 KK+ +L S DG+E T L + +GA + VDVI+ G+ + Sbjct: 3 KKVLFVLTSTSQFTDGTE------TGLWLEEAGAPYNILTEEGINVDVISIKGGKVNLDP 56 Query: 61 RNVLIEA----ARITR--GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVD 114 +V E+ A+ + + +A E DA+ +PGG G + +N + Sbjct: 57 NSVSNESLNQYAKFVSHLNDTPSIENVNADEYDAIYLPGGHGTVYDFAN----------N 106 Query: 115 RELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDID--------- 158 +L + + K + +C P++ + +++T TD + Sbjct: 107 EKLADILLQFKNSNKIISSVCHGPSVFVGVKDANNHYLVDGVKITSFTDSEEKAMGFENK 166 Query: 159 ----TAEVLEEMGAEHVPCPVDDIVVDEDNKIVT 188 T LEE GA V V++D + +T Sbjct: 167 VPFLTQSKLEEQGANFVVKDDFTSHVEKDGQFIT 200 >UniRef50_UPI0001B4C882 ThiJ/PfpI domain-containing protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4C882 Length = 230 Score = 56.1 bits (134), Expect = 8e-07, Method: Composition-based stats. Identities = 58/255 (22%), Positives = 83/255 (32%), Gaps = 77/255 (30%) Query: 1 MKKIGVILSGCGVY---DGSE------------IHE-------AV------LTLLAISRS 32 M KI +++S ++ DGSE HE AV L + Sbjct: 1 MSKILIVMSAASIWERTDGSEYPTGYWAEELAAPHEKFVQAGFAVDFASPGGVLQPLDAH 60 Query: 33 GAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVP 92 A PD + H L E G + L + D + A+++P Sbjct: 61 SADPEIAGPDCA--HYVEHAAR--------ALSEF-----GPLLKLDEIDINDYVAVVIP 105 Query: 93 GGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFD------ 146 GG G +L DR+L L AGK +G +C PA L D Sbjct: 106 GGHGPVVDLYK----------DRDLGRLLTEADAAGKIIGAVCHGPAGLLSAVDENGKWL 155 Query: 147 -FPLRLTIGTDIDT-------------AEVLEEMGAEHVPCPVDDIVVDEDNKIVT--TP 190 +T TD + A L + GA+H P +D + T P Sbjct: 156 FAGREMTAFTDEEEQSFGTAEGAPWLLASTLRQKGAKHSGGPAYQAYNVQDRNLFTGQNP 215 Query: 191 AYMLAQNIAEAASGI 205 A + +AEA G Sbjct: 216 AS--SAPMAEAMIGA 228 >UniRef50_A1TMJ4 Intracellular protease, PfpI family n=11 Tax=Bacteria RepID=A1TMJ4_ACIAC Length = 199 Score = 55.7 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 45/175 (25%), Positives = 66/175 (37%), Gaps = 32/175 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 ++I ++ V DG E E A+ +G A AP QV NH+ Sbjct: 18 RRIALL-----VTDGFEQAELTGPRDALEGAGFDAQIVAPKPGQVQGFNHVDK------- 65 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 A R + L QA DA+++PGG E D + +A Sbjct: 66 -----ADRFDVDQT--LDQASPDAFDAVVLPGG----------VVNADELRTDEKARAFV 108 Query: 122 QAMHQAGKPLGFMC-IAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 QA+ +AGKP+ +C A ++ LT + A L GA+ V PV Sbjct: 109 QAIDRAGKPVAVICHGAWLLIDAGLVKGKTLT--SWPSLATDLRNAGAQWVDRPV 161 >UniRef50_D1CDU1 Intracellular protease, PfpI family n=40 Tax=Bacteria RepID=D1CDU1_THET1 Length = 197 Score = 55.3 bits (132), Expect = 1e-06, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 83/213 (38%), Gaps = 46/213 (21%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI + + DG E E A+++ GA+A + ++ +N + + Sbjct: 8 KKIAFLAT-----DGVEQVELTEPWKAVTQEGAEAHLISIKSGEIQGVNGMDKADTFKVD 62 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + + Q A+E DAL++PGG + + ++++ L Sbjct: 63 --------------KTVDQVSASEYDALVLPGG----------VANPDKLRMNQDAVRLV 98 Query: 122 QAMHQAGKPLGFMCIAP--AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + ++GKP+ +C P + + T+ + ++ G V ++V Sbjct: 99 REFVESGKPVAAICHGPWTLVEADVVRG---RTLTSYPSLKTDIKNAGGNWVD---QEVV 152 Query: 180 VDEDNKIVTT------PAYMLAQNIAEAASGID 206 VD+ I+T+ PA+ A+ I E GI Sbjct: 153 VDQ--GIITSRNPNDLPAF-CAKLIEEVQEGIH 182 >UniRef50_C6CWE5 Intracellular protease, PfpI family n=6 Tax=Bacteria RepID=C6CWE5_PAESJ Length = 200 Score = 55.3 bits (132), Expect = 1e-06, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 76/206 (36%), Gaps = 38/206 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH--LTGEAMT 58 M +I++G D +E+ E + G +AV +P ++ + + H + G Sbjct: 1 MSNKVLIVTG----DAAEVLEVYYPYYRLLEEGYEAVIASPTQKILHTVCHDFIEGWDTY 56 Query: 59 ETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELK 118 + + + + D ++ A+I+PGG + EL Sbjct: 57 TEKPAHQLQSHLG------FSDVDPSDYAAIIIPGG-----------RAPEYIRGNAELP 99 Query: 119 ALAQAMHQAGKPLGFMC-IAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVD 176 + Q A KP+G +C A L + + T+ + +E +GA C D Sbjct: 100 RILQHFIDADKPIGAICHGAQVFLSLPDYSYFNGRTMTAYNASRLEVERLGA----CYAD 155 Query: 177 D-IVVDEDNKIVTT------PAYMLA 195 + + VD K+VT P +M Sbjct: 156 ETLHVD--GKLVTGHAWPDLPGFMRE 179 >UniRef50_Q12ZS1 Intracellular protease 1 n=3 Tax=Methanosarcinaceae RepID=Q12ZS1_METBU Length = 174 Score = 54.6 bits (130), Expect = 2e-06, Method: Composition-based stats. Identities = 22/100 (22%), Positives = 49/100 (49%), Gaps = 11/100 (11%) Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 ++ + + DA+ + GG GA + L + ++EL+ + + ++ GK +G +CI Sbjct: 58 SISDVNIDDYDAISITGGGGAKQYLWD----------NKELQEIVRKANEQGKIIGAICI 107 Query: 137 APAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 AP +L T+ + +T ++L++ A++ V Sbjct: 108 APVVLANAGLLEGKMTTVFKNEETVKILKDNDAKYRDKDV 147 >UniRef50_A4X5M7 ThiJ/PfpI domain protein n=4 Tax=Actinomycetales RepID=A4X5M7_SALTO Length = 231 Score = 54.2 bits (129), Expect = 3e-06, Method: Composition-based stats. Identities = 46/182 (25%), Positives = 71/182 (39%), Gaps = 29/182 (15%) Query: 1 MKKIGVILSGCGVY---DGSE------IHEAVLTLLAISRSGAQAVCFAPDKQQVDV--- 48 M KI +++G + DG+ E V+ ++ +G + V PD V Sbjct: 1 MSKILFVVTGADHWELADGTRHPTGVWAEEIVVPHEMLTSAGHEVVIATPDGVVPRVDRG 60 Query: 49 --INHLTGEAMTETRNVLIEAARITRG--EIRPLAQADAAELDALIVPGGFGAAKNLSNF 104 + TG R + EA G + LA+ D E A+ PGG G ++L+ Sbjct: 61 SLLPEFTGGPAGAVR--MTEAVEGLEGLRKPIRLAEVDLDEYQAVFYPGGHGPMEDLA-- 116 Query: 105 ASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVL 163 VD++ L A QA KPLG +C P A+L + G + + Sbjct: 117 --------VDQDSGRLLVAAQQAKKPLGIVCHGPAALLAAVTADGSNAFAGCRVAAFTNV 168 Query: 164 EE 165 EE Sbjct: 169 EE 170 >UniRef50_Q1QTA8 Peptidase C56, PfpI n=27 Tax=Bacteria RepID=Q1QTA8_CHRSD Length = 204 Score = 53.4 bits (127), Expect = 5e-06, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 68/190 (35%), Gaps = 35/190 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K++ ++ + G E E A+ G + +PD + + E Sbjct: 25 KRVAILAT-----HGFEESELSAPRAALRSQGVEVHVVSPDGKGIRAWAETDWGDTYEAD 79 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + L+ D+ + AL++PGG E ++ + Sbjct: 80 --------------KALSDVDSTDYHALVLPGGL----------FNPDELRLNDQALDFV 115 Query: 122 QAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + +AGKP+ +C AP ++ R+T + AE L+ GAE V + +VV Sbjct: 116 RGFFEAGKPVAAICHAPWILINAGVVEGRRMT--SVASVAEDLKNAGAEWVD---EKVVV 170 Query: 181 DEDNKIVTTP 190 D TP Sbjct: 171 DNGLVTSRTP 180 >UniRef50_B2UQP3 DJ-1 family protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQP3_AKKM8 Length = 187 Score = 53.4 bits (127), Expect = 6e-06, Method: Composition-based stats. Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 15/119 (12%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L + A +LDALI+PGG G ++ + E+ L + MH+AGK + +C A Sbjct: 55 LDKLHADKLDALILPGGAG------SWVLRDT-----PEVIHLVKKMHEAGKLVAAICAA 103 Query: 138 PAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP-AYML 194 P +L K +T D L E GA H+ +++V+D + P A ML Sbjct: 104 PIVLAKAGLVRDRNVTAYPAQDVYRELNEAGA-HIVKD-ENVVLDGNMLTANGPGAAML 160 >UniRef50_A7GXC1 DJ-1 family protein n=6 Tax=Campylobacter RepID=A7GXC1_CAMC5 Length = 185 Score = 53.0 bits (126), Expect = 7e-06, Method: Composition-based stats. Identities = 34/144 (23%), Positives = 61/144 (42%), Gaps = 33/144 (22%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK++ VIL+ +G E EA+ + + R+ A+C D+ V + ++ + Sbjct: 1 MKRVAVILA-----NGFEEIEALSVVDILRRADIDALCVGLDRALVVGAHGVSVKVD--- 52 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 L++ ELDA+++PGG A+NL++ +EL + Sbjct: 53 ---------------LLLSELREIELDAIVLPGGLPGAQNLAD----------SKELGEI 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI 144 + GK + +C AP L K Sbjct: 88 LRRFDDNGKLICAICAAPMALAKA 111 >UniRef50_Q313C6 Peptidase C56, PfpI n=12 Tax=cellular organisms RepID=Q313C6_DESDG Length = 255 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 35/119 (29%), Gaps = 16/119 (13%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 D A DAL+VPGG ++ + + + +A KP+ +C Sbjct: 131 CDFDSVDVAHYDALVVPGG-----------RAPEYIRLNARVIEIVRQFDKARKPIAAVC 179 Query: 136 IA--PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAY 192 + + T +E GA +VT PA+ Sbjct: 180 HGQQVLVTAGVLQGH---TCTAYPAVKPDVEAAGATWCEVNDTASNACVSGHVVTAPAW 235 >UniRef50_Q464Y3 Putative intracellular protease n=2 Tax=cellular organisms RepID=Q464Y3_METBF Length = 189 Score = 51.5 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 43/194 (22%), Positives = 75/194 (38%), Gaps = 27/194 (13%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K +IL+G D +E +E + ++ G Q AP+K+ DV+ + + T Sbjct: 1 MSKKILILTG----DCAEDYEVKVPQQSLQMLGYQVDIAAPNKKTGDVLQLVVHDFTTLD 56 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + RI +++ A + L+VPGG + E L Sbjct: 57 TYIELPGHRIPVD--VSVSEVKADDYAGLVVPGG-----------RAPEYIRMYDETIKL 103 Query: 121 AQAMHQAGKPLGFMCIAPAML--PKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 Q AGKP+ +C +L K+ ++T + A GA V Sbjct: 104 VQDFFAAGKPVAVICHGLQLLAAAKVL-EGYKVT--SYPACAPECRLAGANWQSESV--- 157 Query: 179 VVDEDNKIVTTPAY 192 ++D+ +VT A+ Sbjct: 158 IIDK--NLVTAQAW 169 >UniRef50_Q1DD54 Peptidase, C56 (PfpI) family n=2 Tax=Cystobacterineae RepID=Q1DD54_MYXXD Length = 223 Score = 51.1 bits (121), Expect = 2e-05, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 65/196 (33%), Gaps = 51/196 (26%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ V+ + DG E E + + R GA +P K ++ + N Sbjct: 8 RVAVLAA-----DGFEQVELTAPVKKLERQGADVTIVSPHKGRIRGM------------N 50 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA--- 119 +LI R++ L + AA+ DA+++PGGF V+ +L Sbjct: 51 LLIPGKRVSVDA--SLREVKAADFDAVLLPGGF-----------------VNPDLLRQSA 91 Query: 120 ----LAQAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCP 174 + P+ +C P ++ + + + G V P Sbjct: 92 LALDFVRDADALDMPIAVICHGPWVLISAGLVEG--RALAAWPGIRDDVRNAGGRWVDEP 149 Query: 175 VDDIVVDEDNKIVTTP 190 V D V++P Sbjct: 150 VM-----RDGNWVSSP 160 >UniRef50_A5WBS8 ThiJ/PfpI domain protein n=2 Tax=Psychrobacter RepID=A5WBS8_PSYWF Length = 175 Score = 51.1 bits (121), Expect = 2e-05, Method: Composition-based stats. Identities = 28/131 (21%), Positives = 49/131 (37%), Gaps = 21/131 (16%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L++A A + DA+++PGG +D+ + + + A KP+ +C A Sbjct: 62 LSEASAEDYDAVVLPGG----------TVNADTIRIDKSAQNFVKQFYDANKPVAAICHA 111 Query: 138 PAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQ 196 P +L +T +E G V V D I+T+ Sbjct: 112 PWLLVNSGLVKGKTVT--AYPSLQTDIENAGGTFVDKSVQQ-----DGNIITS---RKPD 161 Query: 197 NIAEAASGIDK 207 +I + + IDK Sbjct: 162 DIEDFVAAIDK 172 >UniRef50_B9XL12 Intracellular protease, PfpI family n=1 Tax=bacterium Ellin514 RepID=B9XL12_9BACT Length = 236 Score = 51.1 bits (121), Expect = 2e-05, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 56/170 (32%), Gaps = 30/170 (17%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ V+ + DG E E + + + GAQ + ++ +N L +N Sbjct: 10 RVAVLAA-----DGVEQIELTSPVKHLEKHGAQIEVISLHPGKIKGMNLLL-----PGKN 59 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + + R + +A+ DAL++PGG + + Sbjct: 60 IKVN---------RTIFRANPDNYDALLIPGGH----------INPDFLRQSDSVLQFVR 100 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVP 172 A KP+ +C P +L T+ + + + G V Sbjct: 101 EFDAANKPIAVICHGPWVLVSA-GVVKNRTLTSWPGIKDDVINAGGNWVN 149 >UniRef50_B2JT66 ThiJ/PfpI domain protein n=4 Tax=Burkholderiales RepID=B2JT66_BURP8 Length = 230 Score = 51.1 bits (121), Expect = 2e-05, Method: Composition-based stats. Identities = 27/97 (27%), Positives = 40/97 (41%), Gaps = 16/97 (16%) Query: 54 GEAMTETRNVLIE------AARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASL 107 E + L E AA+ L A + DA+ PGG G +L+ Sbjct: 56 KSDTPEGKTELTERFKNDPAAQKVLANTVKLDTVKADDYDAVFYPGGHGPMWDLAE---- 111 Query: 108 GSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI 144 DR AL ++ + AGKP+ F+C AP +L + Sbjct: 112 ------DRRSIALIESFYNAGKPVAFVCHAPGVLRHV 142 >UniRef50_A3IN92 ThiJ/PfpI n=2 Tax=Cyanothece RepID=A3IN92_9CHRO Length = 234 Score = 51.1 bits (121), Expect = 3e-05, Method: Composition-based stats. Identities = 41/197 (20%), Positives = 73/197 (37%), Gaps = 35/197 (17%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH---LTGEAMTETRNVLI-EAARIT 71 G + EAV +G + +P+ +V I+ L +TR+ E A+ Sbjct: 23 GIWLEEAVNPYYRFLEAGFEVTLASPNGGEVP-IDEKSILDDAQTEDTRHFFQDETAQKC 81 Query: 72 RGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPL 131 L + +A DA+ +PGG G +L + +L L +A +A K + Sbjct: 82 FKNTVRLTEVEADNYDAIFIPGGHGPMWDLCE----------NEKLANLVEAFDRADKVI 131 Query: 132 GFMC--IAPAMLPKI-----FDFPLRLT--IGTDIDTAEV-----------LEEMGAEHV 171 +C A + K F LT ++ +T + L+E+GA + Sbjct: 132 AAVCHGSAGLLSAKKADGTPFVAGKELTSFSNSEEETVGLHELVPFLLESRLKELGANYT 191 Query: 172 PCPVDDIVVDEDNKIVT 188 V +D ++T Sbjct: 192 NADDFQAKVVQDGNLIT 208 >UniRef50_D1CA42 Intracellular protease, PfpI family n=4 Tax=Bacteria RepID=D1CA42_SPHTD Length = 238 Score = 51.1 bits (121), Expect = 3e-05, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 74/202 (36%), Gaps = 45/202 (22%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDV--INHLTGEAMTE 59 +++ +L+ DG E E + A+ +GA+ A +V +H E + Sbjct: 24 QRVAALLT-----DGVEQVELTEPMKALQEAGAEVKIVALKSGKVKAWDFDHWGEEFDVD 78 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 + A+ + AL++PGG ++ + Sbjct: 79 ----------------LTIDHANPNDFQALLLPGG----------VMNPDTLRMNEKAVQ 112 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + M ++GKP+ +C P ML + D T+ + + G + V ++V Sbjct: 113 FVRQMVRSGKPVASICHGPWMLVEA-DVVEGRTLTSYPSLQTDIRNAGGKWVD---QEVV 168 Query: 180 VDEDNKIVTT------PAYMLA 195 VD+ IVT+ PA++ Sbjct: 169 VDQ--GIVTSRNPNDLPAFIRK 188 >UniRef50_Q0JPK7 Os01g0217800 protein n=16 Tax=Magnoliophyta RepID=Q0JPK7_ORYSJ Length = 428 Score = 50.7 bits (120), Expect = 3e-05, Method: Composition-based stats. Identities = 43/195 (22%), Positives = 73/195 (37%), Gaps = 38/195 (19%) Query: 13 VYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAA---- 68 V DG+E EA T ++R+GA+ T + + R +L+EAA Sbjct: 42 VADGTEPVEAAATADVLNRAGARVTV-------------ATADPAGDDRGLLVEAAFGVK 88 Query: 69 RITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAG 128 + G + L + D + +PGG + NL +C V L+ + + + G Sbjct: 89 LVADGRVADL---EGEAFDLIALPGGMPGSANL-------RDCKV---LEKMVKKQAEQG 135 Query: 129 KPLGFMCIAPA--MLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKI 186 +C PA + L+ T +E+ AE +P +VVD + Sbjct: 136 GLYAAICATPAVTLAHWGLLKGLKATC-----YPSFMEKFTAEIIPVN-SRVVVDRNAVT 189 Query: 187 VTTPAYMLAQNIAEA 201 PA + +A Sbjct: 190 SQGPATAIEYALALV 204 >UniRef50_Q27SQ0 Protease/amidase (Fragment) n=1 Tax=Pavlova lutheri RepID=Q27SQ0_PAVLU Length = 161 Score = 50.3 bits (119), Expect = 4e-05, Method: Composition-based stats. Identities = 28/116 (24%), Positives = 44/116 (37%), Gaps = 13/116 (11%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + DAL++PGGF + + A M + GKP+G +C Sbjct: 47 IDNVSPDAFDALVIPGGF-----------SPDYMRRNPAMLAFIVRMLEQGKPVGAICHG 95 Query: 138 PAMLPKIFDFPLR-LTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVT-TPA 191 P ML D + + G + +++ VD+ VV + N I TPA Sbjct: 96 PWMLCSARDASGKPVCSGVRCTSFGAIKDDVINAGGMWVDEPVVVDANIITARTPA 151 >UniRef50_Q0W5Q2 Intracellular protease (C56 family) n=3 Tax=cellular organisms RepID=Q0W5Q2_UNCMA Length = 189 Score = 49.9 bits (118), Expect = 6e-05, Method: Composition-based stats. Identities = 21/104 (20%), Positives = 36/104 (34%), Gaps = 24/104 (23%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + E DAL++PGG + E+ Q + GK + +C Sbjct: 62 IRDLAPDEFDALVIPGG-----------QSPDHIRIYPEVIKFVQDFDRTGKTIAAVCHG 110 Query: 138 P--AMLPKIFDFPLRLTIGTDIDTAEVL----EEMGAEHVPCPV 175 P + ++ G D + L E+ GA ++ PV Sbjct: 111 PQILITARLLK-------GKDATAWKSLRVDMEDAGANYIDKPV 147 >UniRef50_D1YEB9 Intracellular protease, PfpI family n=4 Tax=Actinomycetales RepID=D1YEB9_PROAC Length = 179 Score = 49.9 bits (118), Expect = 6e-05, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 62/164 (37%), Gaps = 26/164 (15%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G E E V L A+ ++G + + + I +TG+ ++ V ++ Sbjct: 13 GVEEAELVEPLNALKKAGIEVTVAS---NSGESIQTVTGDKDWASK-VNADS-------- 60 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 LA A A++ D L++PGG +D + + L + AGKP+G +C Sbjct: 61 -RLADAKASDYDLLVIPGG----------TVNADTLRIDEDGRRLVKEFATAGKPVGAIC 109 Query: 136 IAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 P ++ +T + I LE G V + Sbjct: 110 HGPWVLIDADVAKGKTMT--SYISIRPDLENAGVSWVDKELFRC 151 >UniRef50_A9HQ45 Putative transcriptional regulator n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HQ45_GLUDA Length = 292 Score = 49.6 bits (117), Expect = 7e-05, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 59/167 (35%), Gaps = 32/167 (19%) Query: 2 KKIGVILSGCGVYD---------GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHL 52 K++ VILS D G ++E + A+ +G V P V +H Sbjct: 32 KRVLVILSSARYLDLQKHKKYETGFYLNELAVPAKALVTAGYDLVFTNPKGNIV-TWDHH 90 Query: 53 TGEAMTETRNVLIE-----------AARITRGEIRPLAQADAAELDALIVPGGFGAAKNL 101 + A+ +++ E R R + + + + DA+ +PGG ++L Sbjct: 91 SANALYFNKDLKQEQEAEHFVEHLLTVRHPR-SLSSVRKEGVDDYDAVFIPGGHAPMQDL 149 Query: 102 SNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFP 148 + + +L A+ H+ + +C P L P Sbjct: 150 AT----------NPDLGAILSEFHKRHRITALICHGPIALLSTLTSP 186 >UniRef50_Q26CT8 Intracellular protease, PfpI family n=3 Tax=Bacteria RepID=Q26CT8_9BACT Length = 182 Score = 49.6 bits (117), Expect = 8e-05, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 64/191 (33%), Gaps = 37/191 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ ++ + +G E E A+ +GA +P + N E Sbjct: 3 KKVAILAT-----NGFEEIELTSPKKALEDAGATVHIISPTGDSIKAWNGGNWSQTYEVD 57 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 ++ A++ ++L++PGG + D + Sbjct: 58 YA--------------VSDVSASDYNSLMLPGG----------VLNPDQLRQDEKSIDFI 93 Query: 122 QAMHQAGKPLGFMC--IAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + + KP+ +C I P + + + + + + +E G V +++V Sbjct: 94 KDFFKQQKPVSAICHGIQPLIDADVVNG---RKLTSYPSLKKDVENAGGHWVD---EEVV 147 Query: 180 VDEDNKIVTTP 190 VDE TP Sbjct: 148 VDEGFTTSRTP 158 >UniRef50_B0N5W0 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0N5W0_9FIRM Length = 183 Score = 49.2 bits (116), Expect = 9e-05, Method: Composition-based stats. Identities = 31/141 (21%), Positives = 53/141 (37%), Gaps = 35/141 (24%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ V+ +DG E EA+ + + R+ + DK +V + + Sbjct: 1 MKKVAVL-----FHDGFEEVEALSVVDIMRRANVECTMVGMDKLEVTSSHQIK------- 48 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 I +I DA+++PGG A NL + D + L Sbjct: 49 ---------IKMDQIYD----GLDNYDAVVIPGGMPGASNLRD----------DSRVIDL 85 Query: 121 AQAMHQAGKPLGFMCIAPAML 141 + + GK +G +C P +L Sbjct: 86 VKQFNHDGKIIGAICAGPIVL 106 >UniRef50_Q04P14 ThiJ/PfpI family protein n=5 Tax=Bacteria RepID=Q04P14_LEPBJ Length = 264 Score = 49.2 bits (116), Expect = 9e-05, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 31/91 (34%), Gaps = 9/91 (9%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 + + LI+PGG A + + +EL+ + GKPLG +C Sbjct: 87 LSYKDLKPEDFEGLILPGGH--APGMKAYLES-------KELQEFVGSFFATGKPLGAIC 137 Query: 136 IAPAMLPKIFDFPLRLTIGTDIDTAEVLEEM 166 + + +I T +L+ Sbjct: 138 HGVVLAARSKIPGTDRSILYGKKTTALLKSQ 168 >UniRef50_C5CQ44 Intracellular protease, PfpI family n=11 Tax=Bacteria RepID=C5CQ44_VARPS Length = 191 Score = 49.2 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 32/136 (23%), Positives = 57/136 (41%), Gaps = 29/136 (21%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K+ ++++ DG E E A+ +GAQ +P L G + Sbjct: 14 KVAILVA-----DGFEQAEMTEPRKALELAGAQTQIVSP----------LDGSVRAWKQ- 57 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 +E A ++ PL ADA + DAL++PGG + ++++ A + Sbjct: 58 --LEPADTFEVDV-PLKNADADDFDALLLPGG----------VANPDALRINQKAVAFVR 104 Query: 123 AMHQAGKPLGFMCIAP 138 A ++GKP+ +C P Sbjct: 105 AFVESGKPIAAICHGP 120 >UniRef50_B2IIZ8 ThiJ/PfpI domain protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IIZ8_BEII9 Length = 242 Score = 49.2 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 29/141 (20%), Positives = 49/141 (34%), Gaps = 28/141 (19%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC-I 136 + + + DA+ VPGG A + V+ + Q GKP+G +C Sbjct: 129 IEEVSVEDYDAVYVPGG----------AWNPDQLRVNPAVLKYLQDFQSTGKPVGALCHG 178 Query: 137 APAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQ 196 + L + T + E + GA + PV VVD + V T ++ Sbjct: 179 SQVFLSAKLLKGRKAT--GYWNIMEDMANAGAHVLDEPV---VVDGN---VITSRFIYD- 229 Query: 197 NIAEAASGIDKLVSRVLVLAE 217 I + V ++ L Sbjct: 230 --------IPQFVKAIIDLLN 242 >UniRef50_A4FDV5 Protease I n=8 Tax=Bacteria RepID=A4FDV5_SACEN Length = 189 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 49/137 (35%), Gaps = 28/137 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 ++I V+ + DG E E A+ ++G Q + ++ +N G+ R Sbjct: 8 RRIAVLAT-----DGVEQVEYEQPRQAVEQAGGQVSLVSVHDGEIQAMN---GDIDKGDR 59 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + ++ + D L++PGG +D + Sbjct: 60 FTVD----------AKVSDVSPDDFDGLVLPGG----------TINPDRLRIDADAVGFV 99 Query: 122 QAMHQAGKPLGFMCIAP 138 ++ Q GKP+G +C P Sbjct: 100 RSFVQQGKPVGAICHGP 116 >UniRef50_B8DF54 Intracellular protease 1 (Intracellular protease I) n=26 Tax=Firmicutes RepID=B8DF54_LISMH Length = 173 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 46/121 (38%), Gaps = 21/121 (17%) Query: 63 VLIEAARITRGEI-------RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDR 115 V EA ++ G+ A + D ++VPGG+ K L F S Sbjct: 38 VAEEAKKVYHGKYGVPVTSDYDFDSVRAEDYDGILVPGGWSPDK-LRRFDS--------- 87 Query: 116 ELKALAQAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCP 174 + L +A +A KP+G +C A ++ + +T + + + GA P Sbjct: 88 -VLNLVRAFDKAKKPIGQICHAGWVLVSAGILEGVNVT--STPGIKDDMTNAGAIWHNEP 144 Query: 175 V 175 V Sbjct: 145 V 145 >UniRef50_A8RCF9 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=A8RCF9_9FIRM Length = 177 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 52/134 (38%), Gaps = 29/134 (21%) Query: 11 CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARI 70 C + DG E EAV T+ + R+G D +V +LT + Sbjct: 5 CIMKDGFEELEAVGTIALLRRAGIDVDVCTSDANKVSGRFNLTLQP-------------- 50 Query: 71 TRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKP 130 ++ L + D DAL +PGG ++ +L S+ + + + + K Sbjct: 51 ----VKDLKEVDPTSYDALFLPGG-------PHYQTLESDAYIME----ILSSYIHSNKV 95 Query: 131 LGFMCIAPAMLPKI 144 + +C AP +L + Sbjct: 96 VAAICAAPTILGRA 109 >UniRef50_Q0B5J2 ThiJ/PfpI domain protein n=18 Tax=Proteobacteria RepID=Q0B5J2_BURCM Length = 237 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 76/218 (34%), Gaps = 48/218 (22%) Query: 21 EAVLTLLAISRSGAQAVCFA-----PDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 E L + +G + P +++ + + ++ A R Sbjct: 36 EVTHPLAELDAAGIPVEFASIQGGEPPVDGLELTDEVNARYWNDS------AFRDALRHT 89 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 + L DA++ A+ GG GA + + +++ + +A+++AG +G +C Sbjct: 90 QRLGDVDASKYAAVFFAGGHGAM----------WDFPGNADVQQVTRAVYEAGGVVGAVC 139 Query: 136 IAP-AMLPKIFDFPLRLTIGTDIDT-------------------AEVLEEMGAEHVPCPV 175 P A++ L G ++ A L + GA H P P Sbjct: 140 HGPAALVDVTLGDGTYLVAGKNLGAFTDEEERAVQLDHVVPFLLASTLTQRGAHHHPAPS 199 Query: 176 DDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVL 213 V D ++VT ++A+G+ + +L Sbjct: 200 WTAKVVVDGRLVT-------GQNPQSAAGVGAAIRYLL 230 >UniRef50_D2S4J1 Intracellular protease, PfpI family n=2 Tax=Frankineae RepID=D2S4J1_9ACTO Length = 189 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 29/130 (22%), Positives = 47/130 (36%), Gaps = 24/130 (18%) Query: 15 DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGE 74 DG E E A+ +GAQ + + ++ H+ + V+ + Sbjct: 16 DGVEQVELDRPWQALEEAGAQPELVSLEAGEITAYEHIDKGDSKKVDAVVSSS------- 68 Query: 75 IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFM 134 D E DAL++PGG G D + A +A AGKP+ + Sbjct: 69 -------DPDEYDALVLPGG----------VINGDFVRADADAVAFVKAFFDAGKPVAAI 111 Query: 135 CIAPAMLPKI 144 C A +L + Sbjct: 112 CHAGWVLAEA 121 >UniRef50_Q2NCQ5 Protease n=5 Tax=Proteobacteria RepID=Q2NCQ5_ERYLH Length = 179 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 10/67 (14%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + + A + AL++PGG ++ + L + ++A KP+ +C A Sbjct: 59 VEEVSADDYGALLLPGG----------QINPDALRMNDTVIGLIREFNEANKPIAAICHA 108 Query: 138 PAMLPKI 144 P +L + Sbjct: 109 PWLLAEA 115 >UniRef50_B3DRT7 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme n=6 Tax=Bifidobacterium RepID=B3DRT7_BIFLD Length = 182 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 38/171 (22%), Positives = 56/171 (32%), Gaps = 27/171 (15%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G E E L + +G + A + + H E T L AR Sbjct: 16 GIEETELTRPLRDLKAAGVKVTLAATTLDPCETVQHDRYEGET-----LTPDAR------ 64 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 L+ AA+ D L+VPGG V+ + LAQ GKP+ +C Sbjct: 65 --LSDVQAADYDLLVVPGG----------TCNVDRIRVNEDAITLAQEFAHEGKPIAAIC 112 Query: 136 IAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKI 186 +L T A +E G +V + + VD+ N Sbjct: 113 HGAWLLVNA-GLVAGKTAAPCRYIAADIENAGGHYVD---EQLHVDDANGF 159 >UniRef50_C3MPE6 Intracellular protease, PfpI family n=11 Tax=Sulfolobus RepID=C3MPE6_SULIL Length = 173 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 48/129 (37%), Gaps = 25/129 (19%) Query: 75 IRPLA--QADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLG 132 I +A + AL++PGG G E+K L + + KP+ Sbjct: 53 ISDIAFKDVRPEDYVALVIPGGRGP-----------EHIRTLEEVKNLTRKFFELKKPVA 101 Query: 133 FMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT-- 189 +C P ++ RLT T + + G ++ +D+VVDE N I + Sbjct: 102 AICHGPQILISANLVKGRRLTSVT--SIKDDVIAAGGIYID---NDVVVDE-NLISSRVP 155 Query: 190 ---PAYMLA 195 PA+ Sbjct: 156 SDLPAFAFT 164 >UniRef50_Q7MQ54 Putative uncharacterized protein VV0154 n=1 Tax=Vibrio vulnificus YJ016 RepID=Q7MQ54_VIBVY Length = 261 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 34/145 (23%), Positives = 55/145 (37%), Gaps = 23/145 (15%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI ++ + DG E E ++ L + GA AP K+ G + E R Sbjct: 73 KKIAILAT-----DGVEELEILVPLNYLREVGANVTIVAPRKKIYP---ETLGLKIPENR 124 Query: 62 NVLIEAARITRG----EIRP-LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 I R+ +I + + + D L++PGG A D E Sbjct: 125 RTHIMTVRLMENSGWLKIDKYIDEVSFDDFDGLVLPGG----------AWNPDFLRTDVE 174 Query: 117 LKALAQAMHQAGKPLGFMCIAPAML 141 + L + + + KPL +C P +L Sbjct: 175 AQNLVREIVNSNKPLATICHGPLVL 199 >UniRef50_A8P0K7 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P0K7_COPC7 Length = 197 Score = 48.0 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 32/149 (21%), Positives = 55/149 (36%), Gaps = 25/149 (16%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAV-CFAPDKQQVDVINHLTGEAMTE 59 M V+L+ DG+E E +T + R+G Q V F P + + + A Sbjct: 1 MPSAVVLLA-----DGTEEMEFTITYDTLVRAGVQVVSAFVPAQSPGASV---SPPAAKC 52 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 +R V RI + + D L++PGG A + + + ++ Sbjct: 53 SRGV-----RILPDSYLDPTECGPDKHDLLVIPGG----------AVGAATMSANATVQK 97 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFP 148 L QA K +G +C + + P Sbjct: 98 LIQAYLDKKKYVGMICAGS-LAARTAKLP 125 >UniRef50_B5HD84 Protease I n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HD84_STRPR Length = 186 Score = 48.0 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 44/129 (34%), Gaps = 15/129 (11%) Query: 14 YDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRG 73 D +E E + + G + AP ++++ + H + E T Sbjct: 55 GDAAESLEVLYPYQRLLEEGYEVHIAAPARKKLQFVVH----DFEPGFDTYTEKPGYTWQ 110 Query: 74 EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGF 133 ++ + A+++PGG D EL+ + +A + KP+ Sbjct: 111 ADLAFSEVEPGAYAAIVIPGG-----------RAPEYLRNDPELRKILKAFFDSDKPVAQ 159 Query: 134 MCIAPAMLP 142 +C P + Sbjct: 160 ICHGPLLTA 168 >UniRef50_Q58377 Uncharacterized protein MJ0967 n=8 Tax=Euryarchaeota RepID=Y967_METJA Length = 205 Score = 48.0 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 49/116 (42%), Gaps = 14/116 (12%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + + + A+++ GG G+ + L N + +L L + + K + +C++ Sbjct: 86 IYDVNPDDYVAIVIVGGIGSKEYLWN----------NTKLIELVKEFYNKNKVVSAICLS 135 Query: 138 PAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAY 192 P +L + + T+ + E L++ GA + V VVD + +P Y Sbjct: 136 PVVLARAGILKGKKATVYPAPEAIEELKKAGAIYEDRGV---VVDGNVITAKSPDY 188 >UniRef50_Q4KCJ7 ThiJ/PfpI family protein, putative n=2 Tax=Pseudomonas fluorescens RepID=Q4KCJ7_PSEF5 Length = 261 Score = 48.0 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 31/71 (43%), Gaps = 9/71 (12%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 P A D A + L++PGG A+ + + E++ + A QA KP+ +C Sbjct: 81 LPYAGVDPAAYEGLLIPGGH--ARGMRSLLESE-------EVRRIILAFFQADKPVAAVC 131 Query: 136 IAPAMLPKIFD 146 P L + D Sbjct: 132 HGPLALARCID 142 >UniRef50_C2BZS9 C56 family peptidase n=1 Tax=Listeria grayi DSM 20601 RepID=C2BZS9_LISGR Length = 201 Score = 48.0 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 37/107 (34%), Gaps = 17/107 (15%) Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 + + D ++VPGG+ K ++ + + KP+G +C Sbjct: 87 SFDEIKPEDYDGILVPGGWSPDK-----------LRRYEKVLDFIKYFDREKKPIGQICH 135 Query: 137 A--PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAE-HVPCPVDDIVV 180 A + I D + +T + + + GA H V D + Sbjct: 136 AGWVLISAGILD-GVNVT--STPGIKDDMTNAGAIWHNEAVVTDRHI 179 >UniRef50_Q0SRB0 DJ-1 family protein n=35 Tax=Bacteria RepID=Q0SRB0_CLOPS Length = 191 Score = 47.6 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 74/191 (38%), Gaps = 37/191 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ V L+ +G E EA+ + +R A+ C A + +N G Sbjct: 1 MKKVLVFLA-----EGFETIEALSVVDVCNR--AKVTCHACSLTENRTVNSAHGTM---- 49 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 VL + + ++ D DA+++PGG + NL + + ++++L Sbjct: 50 --VLCD---------KLISDNDLETYDAIVLPGGMPGSTNLRD----------NEKVQSL 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + ++ K + +C AP L K + G + + +E D +VV Sbjct: 89 IKKYNEENKIVAAICAAPIALAKA-----GVIEGKKVTSYPGFKEELGNVNYVEEDTVVV 143 Query: 181 DEDNKIVTTPA 191 D + PA Sbjct: 144 DGNTITSRGPA 154 >UniRef50_C5VG63 DJ-1 family protein n=6 Tax=Prevotella RepID=C5VG63_9BACT Length = 189 Score = 47.6 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 72/200 (36%), Gaps = 37/200 (18%) Query: 15 DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGE 74 +G E EA+ + + R G + + +TG + E+ + ++ A + Sbjct: 10 NGFEEVEALAPVDILRRGGVEVKMVS-----------ITGSNLVESSHGVVVKADLLFEN 58 Query: 75 IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFM 134 I + A D L++PGG +KNL+ ++ + + GK + + Sbjct: 59 ITDFSDA-----DLLMLPGGMPGSKNLNE----------HEGVRKALKEQFEKGKRIAAI 103 Query: 135 CIAPAMLPKI-FDFPLRLT--------IGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNK 185 C AP +L + + T +G D + L + + Sbjct: 104 CAAPLVLASVGLLKGKKATIYPGMESYLGEDAEYTGALIQEDGNVTTGAGPAASFPYGYQ 163 Query: 186 IVTTPAYMLAQNIAEAASGI 205 +++ ++ A+ + E G+ Sbjct: 164 LLSY--FLPAEKVEEIKKGM 181 >UniRef50_D2W0D5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W0D5_NAEGR Length = 197 Score = 47.6 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 43/194 (22%), Positives = 68/194 (35%), Gaps = 40/194 (20%) Query: 13 VYDGSEIHEAVLTLLAISRSGAQAVCFAP-----DKQQVDVINHLTGEAMTET---RNVL 64 V D E +EA++ A++ G +P DK + + L GE N Sbjct: 10 VGDYVEDYEAMVPYQALTMVGHSVSVISPGKKKGDKVVTAIHDFLPGEQTYTELKGHNFA 69 Query: 65 IEAARITRGEIRPLAQA--DAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 I A + LI+PGG S + + + + + Sbjct: 70 ITA---------DFDDVLSNLDNFGGLILPGG-----RCSEYL------RLHDNVLTIVK 109 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLE----EMGAEHVPCPVDDI 178 + KP+ +C P +L F L+ G I + GAE+V C +D Sbjct: 110 HFLEKKKPIAAICHGPLILTP-FPEHLK---GKRISAYFACKHDIQNTGAEYVQCGAEDA 165 Query: 179 VVDEDNKIVTTPAY 192 VV + IVT A+ Sbjct: 166 VV--SDNIVTGVAW 177 >UniRef50_UPI00017898AF intracellular protease, PfpI family n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017898AF Length = 168 Score = 47.6 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 26/113 (23%), Positives = 45/113 (39%), Gaps = 15/113 (13%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 +A + + DA+++PGG S +D + A +AGKP+ +C Sbjct: 57 IADVNINDYDAVVIPGG-----------SSPENLRLDSHILQFVAAADKAGKPIASICHG 105 Query: 138 PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 P +L D TI + + + GA +++VVD + TP Sbjct: 106 PQILASA-DLLKGRTITSYPPLQDDMVNAGANFKD---EEVVVDRNFITSRTP 154 >UniRef50_A7HTR2 ThiJ/PfpI domain protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HTR2_PARL1 Length = 238 Score = 47.6 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 47/132 (35%), Gaps = 30/132 (22%) Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 P+++ +A + DA+ +PGG G ++ + + L L +++ G +G +C Sbjct: 90 PVSELNAKDFDAVYLPGGHGCMWDMPD----------NDALSRLISEVYEKGGVVGAVCH 139 Query: 137 APA-MLPKIFDFPL---------------RLTIGTDIDT----AEVLEEMGAEHVPCPVD 176 PA +L +G D L +GA Sbjct: 140 GPAGLLGARLSDGTPFVKDRLINSFTDEEERKVGKDKAVPFLLETQLRGLGARFEGGKPF 199 Query: 177 DIVVDEDNKIVT 188 + V + ++VT Sbjct: 200 ERHVCREGRVVT 211 >UniRef50_C6RJA7 Intracellular protease, PfpI family n=3 Tax=Acinetobacter RepID=C6RJA7_ACIRA Length = 181 Score = 47.2 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 10/59 (16%) Query: 80 QADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP 138 + D + D LIVPGG ++ +++ L Q GKP+ +C AP Sbjct: 64 KVDPTDYDILIVPGG----------TVNADTLRINEQVQQLIQHFTDNGKPIAMICHAP 112 >UniRef50_B5Y9N6 Intracellular protease 1 (Intracellular protease I) n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y9N6_COPPD Length = 169 Score = 46.9 bits (110), Expect = 4e-04, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 19/113 (16%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 LA+ E L +PGG + L F E+K A ++ G+P+G +C Sbjct: 56 LAKHKPEEFVGLYIPGGHAPDR-LRRF----------DEVKEFVSAFYKLGRPIGTICHG 104 Query: 138 P-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 P ++ + +T + I + LE GA V PV VVD+ IV++ Sbjct: 105 PQVLISAKVVEGVTMTSVSAI--KDDLENAGAIWVNQPV---VVDK--NIVSS 150 >UniRef50_UPI0001979AA3 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis protein ThiJ n=1 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI0001979AA3 Length = 193 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 59/144 (40%), Gaps = 32/144 (22%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK + V ++ G E E V + + R+G + V + D + + H Sbjct: 1 MKNVMVPIA-----RGFEEIELVSVVDILRRAGVRVVLVSLDSHKRVLGAH--------- 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 N++IEA L + D+ + DA+I+ GG+ +NL+N + + Sbjct: 47 -NIVIEAD-------NALPEFDSEDFDAIILVGGYNGMQNLAN----------NELVTLW 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI 144 + + K + +C +P +L K Sbjct: 89 LKQFENSQKLIAAICASPIVLDKA 112 >UniRef50_B8KZH8 Intracellular proteinase PfpI n=1 Tax=Stenotrophomonas sp. SKA14 RepID=B8KZH8_9GAMM Length = 221 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 29/124 (23%), Positives = 49/124 (39%), Gaps = 24/124 (19%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 PL +ADA DAL++PGG D +++ +AGKP+ +C Sbjct: 105 LPLDEADAGRFDALVLPGG----------VINPDTLRTDEAALGFIRSVAEAGKPVAAIC 154 Query: 136 IAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT----- 189 P ++ LT + + L GA+ ++VVD ++T+ Sbjct: 155 HGPWLLINSGLADGRELT--SWPSLQQDLANAGAKWRN---AEVVVD--GNVITSRKPDD 207 Query: 190 -PAY 192 PA+ Sbjct: 208 IPAF 211 >UniRef50_A6Q7R5 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosphate synthesis protein n=2 Tax=unclassified Epsilonproteobacteria RepID=A6Q7R5_SULNB Length = 186 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 38/169 (22%), Positives = 66/169 (39%), Gaps = 35/169 (20%) Query: 16 GSEIHEAVLTLLAISRSGAQAVC-FAPDKQQVDVI---NHLTGEAMTETRNVLIEAARIT 71 G E EAV + + R G + + D+ Q D++ N +T +A T +NV+ Sbjct: 11 GFEELEAVALIDVMRRGGIEVRVAYLEDEMQSDLVLGANGITVKADTSIKNVI------- 63 Query: 72 RGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPL 131 + + D +++PGG+G +A + ++ L + +A K + Sbjct: 64 -----------SDDFDMMVLPGGWG-----GTYALAE-----NTRVQELLREF-KAKKIV 101 Query: 132 GFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 G MC AP L + R T E ++ G V+D V Sbjct: 102 GAMCAAPFALKQAGVLGERYT--AYPGAVEEIDHPGYVADEKVVEDGNV 148 >UniRef50_B4U9D2 DJ-1 family protein n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U9D2_HYDS0 Length = 183 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 34/141 (24%), Positives = 53/141 (37%), Gaps = 32/141 (22%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V+L+ G E EA+ + + R G + + + + + Sbjct: 1 MAKVAVLLA-----PGFEEVEAIAPIDILRRGGVEVLIVG-----------VKDKVIPSA 44 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNV IE I L D LD +I+PGG +NL E+K L Sbjct: 45 RNVKIE----VDVTIDELKDVD--NLDMIIIPGGMIGVENL----------KKSEEVKNL 88 Query: 121 AQAMHQAGKPLGFMCIAPAML 141 M+ K + +C P +L Sbjct: 89 INQMNAKKKYVSAICAGPLVL 109 >UniRef50_A4XRE4 ThiJ/PfpI domain protein n=3 Tax=Gammaproteobacteria RepID=A4XRE4_PSEMY Length = 284 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 27/109 (24%), Positives = 38/109 (34%), Gaps = 21/109 (19%) Query: 82 DAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAML 141 D L++PGG +L+N + E+ AL + HQAGKP +C P L Sbjct: 124 DLQRYAGLLIPGGHAPLIDLAN----------NPEVGALLRHFHQAGKPTAAICHGPIAL 173 Query: 142 PKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 + D A + P D I I +TP Sbjct: 174 -----------LSAQRDPAAYQAALANGETPAAADWIYQGYRMTIFSTP 211 >UniRef50_C7N926 DJ-1 family protein n=3 Tax=Leptotrichia RepID=C7N926_LEPBD Length = 187 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 73/191 (38%), Gaps = 38/191 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ V L+ +G E EA+ + + R+G T + ++ T Sbjct: 4 KKVAVFLA-----NGFEEIEAITPIDLLERAGI------------------TVDTVSITE 40 Query: 62 NVLIEAARITRGEIRP-LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 N L+E+AR R + + + +E D LI+PGG G + L ++++ Sbjct: 41 NNLVESARKVRVLADKVIKEINFSEYDMLILPGGPGFKNYFDSQLLLDKIVEFSKDVEN- 99 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 K + +C AP +L L + G EE + P + VV Sbjct: 100 --------KKVAAICAAPTVL-----SSLGILEGKKAVCFPACEEDLLKGNPILTRERVV 146 Query: 181 DEDNKIVTTPA 191 ++N I + A Sbjct: 147 KDENIITSRSA 157 >UniRef50_B8FFC8 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria RepID=B8FFC8_DESAA Length = 199 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 29/117 (24%), Positives = 43/117 (36%), Gaps = 14/117 (11%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L D AE DAL +PGGF A + + + + GKP+ +C+ Sbjct: 60 LKDLDLAEFDALAIPGGFQRAGFYED--------AYHEDFLEAVRHFDKTGKPIAAICVG 111 Query: 138 PAMLPKI-FDFPLRLTIGTDIDT--AEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPA 191 + K T ++ + L E GA + V VVD + T PA Sbjct: 112 AMPVGKSGVLTGRNATTYHLVNARRRKQLAEFGAVVLDQHV---VVDRNIITSTGPA 165 >UniRef50_Q7M905 MONOPHOSPHATE SYNTHESISPROTEIN n=1 Tax=Wolinella succinogenes RepID=Q7M905_WOLSU Length = 185 Score = 46.5 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 31/58 (53%), Gaps = 10/58 (17%) Query: 81 ADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP 138 + +D +++PGG+G L++ ++++ +A+H+ KP+G +C AP Sbjct: 61 IEPQNIDMIVLPGGWGGTVALAS----------HPLVRSMVEALHKRQKPIGAICAAP 108 >UniRef50_B1YM90 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YM90_EXIS2 Length = 218 Score = 46.1 bits (108), Expect = 7e-04, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 10/77 (12%) Query: 68 ARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQA 127 AR + PLA D A DA+ PGG GA + N + + +AM + Sbjct: 68 ARRALHDTTPLADVDPASFDAVYFPGGHGAVVDFPN----------NPLVAGAIEAMVRK 117 Query: 128 GKPLGFMCIAPAMLPKI 144 + +C PA + Sbjct: 118 DGVVASVCHGPAAFAHV 134 >UniRef50_Q5FQ93 Protease I n=1 Tax=Gluconobacter oxydans RepID=Q5FQ93_GLUOX Length = 175 Score = 46.1 bits (108), Expect = 7e-04, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 70/190 (36%), Gaps = 32/190 (16%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ + + DG E E A+ ++GA + + +N + Sbjct: 1 MVKVAALAT-----DGLEEIELTGPQEALEKAGATVTVISLKAGEFQAVN-KDIYPSNKI 54 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R L +A A + DAL++PGG + +++++ A Sbjct: 55 RADL------------AIADAKVEDYDALLLPGGL----------ASPDALRMNKDVVAF 92 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 A+A +A KP+ +C L ++ + T+ + LE GA V +++VV Sbjct: 93 ARAFVKANKPIAAICHGAQTLIEVNELK-GRTVTSWPAIRTDLENAGASWVD---NEVVV 148 Query: 181 DEDNKIVTTP 190 D P Sbjct: 149 DGPYVFSRCP 158 >UniRef50_Q21F36 ThiJ/PfpI n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21F36_SACD2 Length = 366 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 27/132 (20%), Positives = 54/132 (40%), Gaps = 16/132 (12%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G E+ E + +G + + + ++ + + M + + A I + +I Sbjct: 80 GYELTELSRAYYVLQANGFEVDVASTQGGKPKMV--IDTDDMGQHDYAFLNDA-IAQTKI 136 Query: 76 R---PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLG 132 P+ Q A++ DA+ GG GA + N + ++ L + M+Q GK + Sbjct: 137 TNTIPINQVSASQYDAIYFVGGKGALFDFPN----------NTAIQTLVRDMYQQGKVIA 186 Query: 133 FMCIAPAMLPKI 144 +C PA L + Sbjct: 187 AICHGPAALVNV 198 >UniRef50_C0WIV6 C56 family peptidase n=8 Tax=Actinomycetales RepID=C0WIV6_9CORY Length = 243 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 60/171 (35%), Gaps = 31/171 (18%) Query: 39 FAPDKQQVDVINHLTGEAMTETRNVLIEA--ARITR--GEIRPLAQADAAELDALIVPGG 94 FA + V++ + E ++E E A I + LA A+ D + PGG Sbjct: 65 FATPNAKAPVVDEYSLEVLSEGDRAEQENYLAEIGPDLEDPLNLADVKEADYDLVFYPGG 124 Query: 95 FGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI---------- 144 G ++L+ D++ L Q ++G+ L +C APA L + Sbjct: 125 HGPMEDLA----------YDQDSAKLLQERIESGRALSLVCHAPAALLALDNDNWPLKGY 174 Query: 145 -FDFPLRLTIGTDIDTAEV------LEEMGAEHVPCPVDDIVVDEDNKIVT 188 G + A L E+GA+ V+ D + T Sbjct: 175 TMTGFTNAEEGEETIAAAKWVVETRLRELGADFKQTDPMQPYVEVDRNLYT 225 >UniRef50_C4G3X0 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G3X0_ABIDE Length = 182 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 39/83 (46%), Gaps = 17/83 (20%) Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 N+ I++ R+ L + + E +A+I+PGG A NL + D ++ + Sbjct: 44 NITIKSDRL-------LEEIKSEEYNAVILPGGLPGATNLRD----------DDKVITIL 86 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI 144 + M+ GK + +C AP L + Sbjct: 87 KEMNNEGKIVAAICAAPIALERA 109 >UniRef50_P80876 General stress protein 18 n=22 Tax=Bacteria RepID=GS18_BACSU Length = 172 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 25/115 (21%), Positives = 45/115 (39%), Gaps = 16/115 (13%) Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 + ++++ DAL++PGGF + D +A KP+ +C Sbjct: 57 SIDDVNSSDFDALLIPGGF-----------SPDQLRADDRFVQFTKAFMTDKKPVFAICH 105 Query: 137 APAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVT-TP 190 P +L R G + +E GA+ V ++VV +D + + TP Sbjct: 106 GPQLLINAKALDGRKATGYTSIRVD-MENAGADVVDK---EVVVCQDQLVTSRTP 156 >UniRef50_C7PRI4 ThiJ/PfpI domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PRI4_CHIPD Length = 257 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 60/161 (37%), Gaps = 38/161 (23%) Query: 2 KKIG-VILSGCGVYDGSE----IHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 KK+ V+ S + DG++ + E + + + +P+ G+A Sbjct: 31 KKVLIVVTSFSALKDGTKMGLWLEEFTTPYYLLKENNIELTIASPE----------GGKA 80 Query: 57 MTETRNVL----IEAARITRG---------EIRPLAQADAAELDALIVPGGFGAAKNLSN 103 + R++L +A+ G L+ A + DA+ PGG +L Sbjct: 81 PVDPRSILPDFLTPSAKQFLGDGQAQKVLNNTVKLSTVKAKDYDAVFYPGGHAPMWDLPE 140 Query: 104 FASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI 144 + + AL QA + KP+ F+C PA L I Sbjct: 141 ----------NAKSVALIQAFIEQQKPVAFVCHGPAALKNI 171 >UniRef50_C1SLG0 DJ-1 family protein n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SLG0_9BACT Length = 187 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 48/146 (32%), Gaps = 41/146 (28%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V+L+ DG E EAV + + R+ + C A K G Sbjct: 1 MGKVIVVLA-----DGFEEIEAVSVIDILRRADVEV-CAAGVKD---------GNVKG-- 43 Query: 61 RNVLIEAARITRGEIRP----LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 G I L D + D +++PGG A ++ Sbjct: 44 ----------AHGLIVKPDSTLEDIDEDDYDMIVLPGG----------AVGAENIGKSKD 83 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLP 142 + + + K + +C AP +L Sbjct: 84 ADDILRKFKKDDKYIAAICAAPKILA 109 >UniRef50_D0SXB4 Intracellular protease n=2 Tax=Acinetobacter RepID=D0SXB4_ACILW Length = 188 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 11/63 (17%), Positives = 22/63 (34%), Gaps = 10/63 (15%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 D + D L++PGG ++++ + + Q KP+ +C Sbjct: 65 TSFDHVDPNDYDLLVIPGG----------TVNADTLRINQDAQKIIQHFADNHKPIAAIC 114 Query: 136 IAP 138 P Sbjct: 115 HGP 117 >UniRef50_D2RSY5 Intracellular protease, PfpI family n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RSY5_9EURY Length = 187 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 33/133 (24%), Positives = 52/133 (39%), Gaps = 25/133 (18%) Query: 15 DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVD----VINHLTGEAMTETRNVL-----I 65 DGSE+ E V + +++ G + V F V ++ + G E + V Sbjct: 7 DGSEL-EDVTVAVFLAQEGTEEVEFVEPTDLVTDAGATVD-VVGSETGEGQTVNNDLEGS 64 Query: 66 EAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMH 125 E+ I + + A + DA+IVPGG A L + E L + Sbjct: 65 ESYEIKK----SFDEISADDYDAVIVPGGTVGADTLRTY----------DEGVDLLRQHV 110 Query: 126 QAGKPLGFMCIAP 138 +AGKP +C P Sbjct: 111 EAGKPTAVICHGP 123 >UniRef50_C3M432 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme n=33 Tax=Vibrio RepID=C3M432_VIBC3 Length = 205 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 36/127 (28%), Positives = 52/127 (40%), Gaps = 29/127 (22%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAP-DKQQVDVINHLTGEAMTETRNVLIEAARITRGE 74 GSE E V+ + + R+G Q A DK QV +R V + A + Sbjct: 16 GSEEMETVIIVDTLVRAGFQVTMAAVGDKLQVQG-----------SRGVWLTAEQT---- 60 Query: 75 IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFM 134 L A DAL +PGG G A+ ++ + L AL A Q GK + + Sbjct: 61 ---LEACSAEAFDALALPGGVGGAQAFADSTA----------LLALIDAFSQQGKLVAAI 107 Query: 135 CIAPAML 141 C PA++ Sbjct: 108 CATPALV 114 >UniRef50_B1K8Q4 ThiJ/PfpI domain protein n=220 Tax=cellular organisms RepID=B1K8Q4_BURCC Length = 228 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 21/69 (30%), Positives = 29/69 (42%), Gaps = 10/69 (14%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 R LA A + DA+ PGG G L + A L + AGKP+ +C Sbjct: 84 RKLADVSADDYDAVFYPGGHGP---LWDLAED-------LHSIGLIERALAAGKPVAAVC 133 Query: 136 IAPAMLPKI 144 AP +L + Sbjct: 134 HAPGVLRHV 142 >UniRef50_UPI0001AEB94E ThiJ/PfpI domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB94E Length = 288 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 22/114 (19%), Positives = 37/114 (32%), Gaps = 18/114 (15%) Query: 34 AQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIRPLAQADA------AELD 87 A AP + VI ++ R++ A I I E + Sbjct: 79 ATPKGNAPSVDKKSVIPQYFDGGESQMRDIQTFVASI--EGIDDTLSLSEVIGQGLEEFE 136 Query: 88 ALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAML 141 + +PGG +L+N + ++ + H GKP +C P L Sbjct: 137 GVFIPGGHAPLIDLAN----------NPQVGEILSHFHSEGKPTAAICHGPIAL 180 >UniRef50_Q045Z0 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme, amidase family n=37 Tax=Lactobacillales RepID=Q045Z0_LACGA Length = 194 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 44/201 (21%), Positives = 73/201 (36%), Gaps = 40/201 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V+ + DG E E + + + R DK+++D +H+ Sbjct: 1 MTKVAVVFA-----DGCEEVEGLSVVDVLRRLNIDCDMVGLDKKEIDGDHHI-------- 47 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +L + + D + PGG A NL N +++L L Sbjct: 48 --LLT---------CDKVVDDSLLDYDLVAFPGGRTGALNLRN----------NKKLADL 86 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 ++AGK MC AP L T + ++ EE H + V Sbjct: 87 MIQRNKAGKWDAAMCAAPIALGHYGLLEGANYTCYPGFE-KQIEEECPNGHFSTDIT--V 143 Query: 180 VDEDNKIVTT--PAYMLAQNI 198 VD+++KI+T+ PA A Sbjct: 144 VDKEHKIITSRGPATAWAYAY 164 >UniRef50_A4WJ08 ThiJ/PfpI domain protein n=1 Tax=Pyrobaculum arsenaticum DSM 13514 RepID=A4WJ08_PYRAR Length = 158 Score = 45.3 bits (106), Expect = 0.001, Method: Composition-based stats. Identities = 26/124 (20%), Positives = 44/124 (35%), Gaps = 24/124 (19%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 LA+ E D L++PG ++K + + + + P+ +C A Sbjct: 45 LAEVKPEEYDGLVIPG---------RRVPEYVRVVASGDVKRVVRHIFERNTPVAAICYA 95 Query: 138 PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT------PA 191 PA + +T + I E G V ++VVD +VT PA Sbjct: 96 PATARVV--KGREVT--SHIAVGPEAENNGGIWVD---QEVVVD--GNLVTARAWLDNPA 146 Query: 192 YMLA 195 +M Sbjct: 147 WMRE 150 >UniRef50_A1AQV7 Metal dependent phosphohydrolase n=3 Tax=Bacteria RepID=A1AQV7_PELPD Length = 388 Score = 45.3 bits (106), Expect = 0.001, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 54/142 (38%), Gaps = 35/142 (24%) Query: 15 DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGE 74 DG E EA+ + + R+G + V+ L G +E+ R R Sbjct: 10 DGFEEIEAMTVVDVLRRAGFEV-----------VLAGLHGGP--------VESVR--RVS 48 Query: 75 IRP---LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPL 131 + P + A + + D +I+PGG A NLS D + L + K + Sbjct: 49 VIPDATIDAARSDQFDMVILPGGQPGAANLS----------ADVRVIRLLNDFSKDNKLI 98 Query: 132 GFMCIAPAMLPKI-FDFPLRLT 152 G +C A +L + R+T Sbjct: 99 GAICAATTVLSEAGLIRGKRVT 120 >UniRef50_Q9ZV19 ProteaseI (PfpI)-like protein n=4 Tax=Arabidopsis thaliana RepID=Q9ZV19_ARATH Length = 398 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 55/160 (34%), Gaps = 31/160 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQ-----QVDVINHLTGE 55 ++K ++L G D E +E ++ L + G C +P++ + + L E Sbjct: 5 VQKSALLLCG----DYMEAYETIVPLYVLQSFGVSVHCVSPNRNAGDRCVMSAHDFLGLE 60 Query: 56 AMTE-TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVD 114 TE + L A D +I+PGG F L + D Sbjct: 61 LYTELVVDQLTLNA--------NFDDVTPENYDVIIIPGG--------RFTEL---LSAD 101 Query: 115 RELKALAQAMHQAGKPLGFMCIAPAML--PKIFDFPLRLT 152 + L ++ K + C + ML I ++ T Sbjct: 102 EKCVDLVARFAESKKLIFTSCHSQVMLMAAGILAGGVKCT 141 >UniRef50_A8QA68 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8QA68_MALGO Length = 159 Score = 44.9 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 31/89 (34%), Gaps = 11/89 (12%) Query: 82 DAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAML 141 A + DA I+PGG G A LS D + + + H GK +G +C + Sbjct: 72 HAGDYDAYIIPGGAGGANTLSK----------DPTVLQILRDSHANGKIVGMICAGS-LA 120 Query: 142 PKIFDFPLRLTIGTDIDTAEVLEEMGAEH 170 L I + + L H Sbjct: 121 ALEARVGLGGPITSHPSVKDKLASCTYAH 149 >UniRef50_D0STJ8 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0STJ8_ACILW Length = 175 Score = 44.9 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 37/143 (25%), Positives = 55/143 (38%), Gaps = 32/143 (22%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I V+L+ ++ E E V LL S A + V+ L G Sbjct: 6 KRIAVLLTN--NFEDQEYLEPVAALLEAKHSICNIEFNAGN-----VVYGLHG------- 51 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 A I RG + Q + DAL++PGG A +LS D + Sbjct: 52 ---CSAVTIDRG----IEQISIDDFDALLIPGGESAV-SLST----------DLRVLQFI 93 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI 144 QA +QA K + +C A +L + Sbjct: 94 QAFNQAKKTIFSLCDASLLLCEA 116 >UniRef50_A1ZY79 ThiJ/PfpI n=3 Tax=Bacteria RepID=A1ZY79_9SPHI Length = 320 Score = 44.9 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 22/72 (30%), Gaps = 10/72 (13%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + Q DA+ VPGG +L + + L H KP G +C Sbjct: 158 IEQIGIENFDAVFVPGGHAPLGDLVD----------NDLLSKFLHHFHAKSKPTGLVCHG 207 Query: 138 PAMLPKIFDFPL 149 P L Sbjct: 208 PVALLSSLPNSA 219 >UniRef50_Q6F1K3 Putative intracellular protease/amidase n=1 Tax=Mesoplasma florum RepID=Q6F1K3_MESFL Length = 181 Score = 44.9 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 55/144 (38%), Gaps = 33/144 (22%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ +IL + E EA++T+ + RS + + + V +H Sbjct: 1 MKKVAIIL-----HKNFEESEAIVTIDILRRSEIIVDIYNIENKDFQVGSH--------- 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 N++++ + ++ D +++PGG G +E + L L Sbjct: 47 -NIIVKTE-------YNIQSLNSQNYDGIVIPGGPGV-----------NELFDNEILLNL 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI 144 + + K + +C AP +L Sbjct: 88 IKDFNDKNKMVSAICAAPQILGLA 111 >UniRef50_C6PQH3 ThiJ/PfpI domain protein n=11 Tax=Bacteria RepID=C6PQH3_9CLOT Length = 192 Score = 44.9 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 54/141 (38%), Gaps = 15/141 (10%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L++ + E DA+ +PGGF A F + L + +A K + +C+A Sbjct: 60 LSEVNVEEFDAVAIPGGFEEA----GFYED----AFSEDFLNLIREFDKADKIIASICVA 111 Query: 138 PAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP--AYML 194 + K T + L E G + P IV+D++ P A+ + Sbjct: 112 ALPIGKSGVLNGRNATTYNLGKRQKQLSEFGVNVI--PDKPIVIDKNIITSYNPSTAFNV 169 Query: 195 AQNIAEAASGIDKL--VSRVL 213 A + E + + V R++ Sbjct: 170 AFKLLELLTSKENCNNVKRLM 190 >UniRef50_C5NVZ3 DJ-1 family protein n=1 Tax=Gemella haemolysans ATCC 10379 RepID=C5NVZ3_9BACL Length = 191 Score = 44.5 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 58/143 (40%), Gaps = 31/143 (21%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ + V +GSE E + L + R+ Q + + E +T + Sbjct: 3 KKVALF-----VENGSEELELIAPLDILRRANIQVDLISAN----------NEEYITSSH 47 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 +V I I +I + + DA+++PGG + L + + ++ Sbjct: 48 DVKI----IVDKKINDIDNI--LDYDAIVIPGGMPGSTLLRD----------NDKIIKFY 91 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI 144 Q M+ AGK + +C AP +L K Sbjct: 92 QEMYNAGKLVAAICAAPIVLSKA 114 >UniRef50_C4L0C2 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L0C2_EXISA Length = 219 Score = 44.5 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 35/85 (41%), Gaps = 18/85 (21%) Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 TR +L + AR L Q DA+ +PGG G + + L+ Sbjct: 70 TRELLKDTAR--------LDQVADESYDAIFLPGGHGTV----------VDFPENETLQR 111 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKI 144 + + ++++G +G +C P L + Sbjct: 112 VIRNVYESGNIVGAVCHGPIGLVNV 136 >UniRef50_C5A5Q9 Peptidase C56, intracellular protease PfpI family (PfpI) n=4 Tax=Euryarchaeota RepID=C5A5Q9_THEGJ Length = 166 Score = 44.5 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 52/144 (36%), Gaps = 32/144 (22%) Query: 68 ARITRGEI-----------RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 A RG I + D E DAL++PGG ++ + Sbjct: 33 ASFERGRITGKHGYSVEVHLRFDEVDPDEFDALVLPGG-----------RAPERIRLNEK 81 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVD 176 A+A+ M + GKP+ +C P +L + G + +++ VD Sbjct: 82 AVAIAKKMFEDGKPVATICHGPQIL---ISAGVLK--GRKGTSYAGIKDDMINAGVKWVD 136 Query: 177 DIVVDEDNKIVT-TP----AYMLA 195 + VV + N + + P A+M Sbjct: 137 EPVVVDGNWVSSRHPEDLYAWMRE 160 >UniRef50_B3T0X6 Putative DJ-1/PfpI family protein n=1 Tax=uncultured marine microorganism HF4000_007I05 RepID=B3T0X6_9ZZZZ Length = 175 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 44/120 (36%), Gaps = 20/120 (16%) Query: 75 IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFM 134 ++ + Q + D L++PGG A + + D+ H+A K + + Sbjct: 57 VKDINQVKVNDYDLLVLPGGVKALE----------KTRQDKRFIKFIADFHKADKVIACI 106 Query: 135 CIAPAML--PKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAY 192 C +L KI I + + G + P VVD +KIVTT Y Sbjct: 107 CSGVQLLISAKIIKGK---KIAGYYSLEDDIVNAGGIYTDQP---AVVD--SKIVTTAHY 158 >UniRef50_B0UB01 ThiJ/PfpI domain protein n=2 Tax=Alphaproteobacteria RepID=B0UB01_METS4 Length = 242 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 32/89 (35%), Gaps = 10/89 (11%) Query: 56 AMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDR 115 + R + E AR L D DA+ +PGG G + D Sbjct: 80 PASVRRFLADERARAVAKNSPALTSVDPQAFDAVFLPGGHGPM----------WDAANDD 129 Query: 116 ELKALAQAMHQAGKPLGFMCIAPAMLPKI 144 L + +M AGK + +C PA L + Sbjct: 130 TLARIIGSMIDAGKFVAAVCHGPAGLVRA 158 >UniRef50_D1YUX4 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YUX4_METPS Length = 186 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 54/124 (43%), Gaps = 16/124 (12%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + L+AL++ GG GA ++L + + L + ++ GK +G +CI+ Sbjct: 62 IEDIKVDSLNALVIAGGKGAREHLWH----------NEALLRKVREANEKGKVIGAICIS 111 Query: 138 PAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQ 196 A+ R T+ D EVL+E G +V + +VVD +VT A+ Sbjct: 112 GAIPAIAGIMRGRRGTVYPDTGALEVLKENGETYVN---EGVVVD--GNVVTGAGPTYAK 166 Query: 197 NIAE 200 AE Sbjct: 167 EFAE 170 >UniRef50_Q13FG4 Peptidase C56, PfpI n=1 Tax=Burkholderia xenovorans LB400 RepID=Q13FG4_BURXL Length = 227 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 43/115 (37%), Gaps = 19/115 (16%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 + A + DA+++PGGF + + M G+ + +C Sbjct: 58 LSIDDVKAEDFDAVVIPGGFAP-----------EGMRRHPAMIEFVREMDAQGRLIAAIC 106 Query: 136 IAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 A +L +LT + + + GA +V + +V+D + I+T+ Sbjct: 107 HAGLVLASAQIARNRKLTCVS--LVKDDVINAGANYVN---EGLVID--HNIITS 154 >UniRef50_Q24FT7 DJ-1/PfpI family protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FT7_TETTH Length = 204 Score = 44.5 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 60/183 (32%), Gaps = 20/183 (10%) Query: 14 YDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLT--GEAMTETRNVLIEAARIT 71 D E +E ++ + G P+K+ +++T E E +I Sbjct: 10 GDFGEDYEVMVPFQVLHAIGYTVHTVCPNKKAG---DYVTCVVEEGGEIEKFQTYTEKIG 66 Query: 72 RGEIR--PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGK 129 Q E AL++ GG D + L + + K Sbjct: 67 HRFFLNYDFDQVKPEEYYALVLAGG-----------RAPEYLKYDPSVLKLVKHFTDSKK 115 Query: 130 PLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 + +C +L + + +G T+ + G + ++D ++ N ++T Sbjct: 116 SILVICHGYQILCALQGCIEGIVLGGPTPTSYEITNAGGIYQQIKMEDALL--YNNFIST 173 Query: 190 PAY 192 PAY Sbjct: 174 PAY 176 >UniRef50_A9F9P4 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F9P4_SORC5 Length = 327 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 25/104 (24%), Positives = 43/104 (41%), Gaps = 15/104 (14%) Query: 83 AAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLP 142 A D ++VPGG G +L + + +L AL +P+G +C APA+L Sbjct: 175 ADSFDGIVVPGGQGVMVDLLD----------NADLHALLAHFGAKHQPVGLICHAPAVLT 224 Query: 143 KIFDFPLR--LTIGTDIDTAEVLEE---MGAEHVPCPVDDIVVD 181 ++ T+ + E E MGA+ + + + D Sbjct: 225 RMRQPHPLSGRTVTSVSSFEEFYIETFVMGADAQVRGIGEQLED 268 >UniRef50_B1YMA0 Intracellular protease, PfpI family n=14 Tax=Bacteria RepID=B1YMA0_EXIS2 Length = 176 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 43/120 (35%), Gaps = 17/120 (14%) Query: 79 AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA- 137 + + A+ DA++VPGG+ + + + ++ +P+G +C A Sbjct: 62 DEINPADYDAILVPGGW-----------SPDLLRRFDSVLTMVRHFNETKQPIGQICHAG 110 Query: 138 -PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAE-HVPCPVDDIVVDEDNKIVTTPAYMLA 195 + + + +T + + + GA H V D + + P YM Sbjct: 111 WVLISAGVLK-GINVT--STPGIKDDMTNAGATWHDEPVVVDGHIISSRRPPDLPDYMRE 167 >UniRef50_C0W6P5 Possible transcriptional regulator n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6P5_9ACTO Length = 196 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 42/181 (23%), Positives = 74/181 (40%), Gaps = 35/181 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ V+++ G E EA+ L + R+G A + I H + + + + Sbjct: 10 KKVAVLVA-----PGLEEVEALAPLDILFRAGIPAHLIS--------ITH-SRQVTSSHQ 55 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 VL A + L++AD D + +PGG NL ++ V + Sbjct: 56 VVLSCTA-----TLDELSEADLDSYDMVFLPGGIPGTPNL------KADARVRELVTQRV 104 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + A +P+ +C AP++L ++ R T + +VL + GA+ V VV Sbjct: 105 R----ADRPVAAICAAPSILAELGLLEGRRAT--ANPSFVQVLADHGAQVSEASV---VV 155 Query: 181 D 181 D Sbjct: 156 D 156 >UniRef50_B1GZW9 Putative intracellular protease n=1 Tax=uncultured Termite group 1 bacterium phylotype Rs-D17 RepID=B1GZW9_UNCTG Length = 168 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 22/100 (22%), Positives = 39/100 (39%), Gaps = 14/100 (14%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + + + + DA++ GG G S + LA + KP+ +CIA Sbjct: 56 IQEINPNDFDAIVYIGGNG-----SVIFFD------NHYALKLANDFFKQRKPVSSICIA 104 Query: 138 PAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVD 176 +L + T+ ID E L + GA + P++ Sbjct: 105 GVILANAGILKGKKATV--FIDGKEALIKGGAIYTGNPLE 142 >UniRef50_Q0SMN5 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis protein n=21 Tax=Borrelia RepID=Q0SMN5_BORAP Length = 184 Score = 44.2 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 30/64 (46%), Gaps = 10/64 (15%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 ++ + D +I+PGG A NL N +EL + + M+ GK + +C + Sbjct: 55 ISNCNENRFDLIILPGGMPGATNLFN----------SKELDLILKDMNAKGKFIAAICAS 104 Query: 138 PAML 141 P ++ Sbjct: 105 PVVV 108 >UniRef50_A1VCT8 Intracellular protease, PfpI family n=14 Tax=Bacteria RepID=A1VCT8_DESVV Length = 204 Score = 43.8 bits (102), Expect = 0.004, Method: Composition-based stats. Identities = 32/123 (26%), Positives = 49/123 (39%), Gaps = 27/123 (21%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 +A +AA+ D L++PGGF K D ++ L + MH AGK + +C A Sbjct: 92 IADMNAADFDLLVIPGGFAPDK-----------LRRDPKVLELTRQMHHAGKIVAHICHA 140 Query: 138 ---PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT----- 189 P + + T G D GA ++VVD N+I + Sbjct: 141 GWIPISAGIMRGYRCTSTPGIKDDLINA----GALWEN---SEVVVDR-NQISSRKPGDL 192 Query: 190 PAY 192 PA+ Sbjct: 193 PAF 195 >UniRef50_C8V2F0 ThiJ/PfpI family protein (AFU_orthologue; AFUA_3G01210) n=2 Tax=Emericella nidulans RepID=C8V2F0_EMENI Length = 933 Score = 43.8 bits (102), Expect = 0.004, Method: Composition-based stats. Identities = 14/109 (12%), Positives = 39/109 (35%), Gaps = 17/109 (15%) Query: 74 EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGF 133 ++ ++ + + +PGG +L + +++L + + H+ KP Sbjct: 787 KLNSISDDELKNFAGVFIPGGHAPLADLGD----------NKDLGRILEYFHKENKPTAA 836 Query: 134 MCIAP--AMLPKIFD-----FPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 +C P + K+ ++T ++ + + +G E Sbjct: 837 ICHGPYALLSTKVSGGEFAYKGYKITSWSNAEEKVMESMLGGEVEKVET 885 >UniRef50_Q15SH5 ThiJ/PfpI n=14 Tax=Gammaproteobacteria RepID=Q15SH5_PSEA6 Length = 224 Score = 43.8 bits (102), Expect = 0.004, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 11/76 (14%) Query: 74 EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGF 133 + + + A+ DA+ PGGFG LS+ A + + L ++ +AG +G Sbjct: 80 NTKKASDVNPADYDAVFYPGGFGL---LSDLADDEN-------VAKLTASIFEAGAVVGA 129 Query: 134 MCIAPA-MLPKIFDFP 148 +C PA +LP Sbjct: 130 VCHGPAGLLPIKLSNG 145 >UniRef50_A4XSU5 Intracellular protease, PfpI family n=11 Tax=cellular organisms RepID=A4XSU5_PSEMY Length = 186 Score = 43.8 bits (102), Expect = 0.004, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 10/64 (15%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L AA+ D L++PGG D + + L Q QA K + +C Sbjct: 67 LKGLSAADYDLLVIPGG----------TVNADTLRQDSDAQRLVQEFRQASKTVAAICHG 116 Query: 138 PAML 141 P +L Sbjct: 117 PWLL 120 >UniRef50_Q2NIP2 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme n=4 Tax=Candidatus Phytoplasma RepID=Q2NIP2_AYWBP Length = 180 Score = 43.8 bits (102), Expect = 0.004, Method: Composition-based stats. Identities = 25/99 (25%), Positives = 40/99 (40%), Gaps = 13/99 (13%) Query: 49 INHLTGEAMTETRNVLIE---AARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFA 105 +NH + + T N +E ++ + LA + E D L++PGG A+ + Sbjct: 23 LNHASLPLTSATLNPQLEVVSSSGLCVKANANLATINPLEYDFLVIPGGPYVAQIIEKET 82 Query: 106 SLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI 144 L L Q A K +G +C AP L K+ Sbjct: 83 F----------LLQLIQIFFDANKVIGAICAAPMFLGKL 111 >UniRef50_D1BGR7 Intracellular protease, PfpI family n=14 Tax=Actinomycetales RepID=D1BGR7_SANKS Length = 197 Score = 43.8 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 33/138 (23%), Positives = 53/138 (38%), Gaps = 21/138 (15%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G E E V+ L + +GA AP++ V+ L G+ R V EA R + Sbjct: 26 GVEQDELVVPLEHLRAAGAHVDVAAPERGTVET---LVGD-KDPGRPV--EADR----AL 75 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 L AD D L+VPGG ++ + A ++G+P+ +C Sbjct: 76 GDLTDADLDSYDLLLVPGG----------TINADALRLEEKAVAAVGTFARSGRPVAAIC 125 Query: 136 IAPAMLPKI-FDFPLRLT 152 P ++ + LT Sbjct: 126 HGPWLVVEAGLATGKTLT 143 >UniRef50_C0SMV9 Putative uncharacterized protein n=1 Tax=Streptomyces spiroverticillatus RepID=C0SMV9_9ACTO Length = 231 Score = 43.4 bits (101), Expect = 0.005, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 56/155 (36%), Gaps = 28/155 (18%) Query: 1 MKKIGVILSGCGVY---DGSE------IHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH 51 M K+ I+SG + DG+ E ++ +G + V P+ ++ Sbjct: 1 MAKVLFIVSGATYWVLKDGTRHATGYWAEEFANPYKILTDAGHEVVVATPNGV-TPTVDM 59 Query: 52 LTGEAMTETRN-------VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNF 104 ++ N +I +A + R L+ + DA+ +PGG G Sbjct: 60 MSLRPEMVGGNDSALELEAIIRSAEVMR-RPLQLSDVRLEDYDAVYLPGGHGPM------ 112 Query: 105 ASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPA 139 ++ D ++ L +G PL +C PA Sbjct: 113 ----ADLAWDADVGRLLTQQLTSGNPLFVVCHGPA 143 >UniRef50_B9L1B0 Protease I n=2 Tax=Thermomicrobia (class) RepID=B9L1B0_THERP Length = 179 Score = 43.4 bits (101), Expect = 0.005, Method: Composition-based stats. Identities = 35/129 (27%), Positives = 58/129 (44%), Gaps = 23/129 (17%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + + E DAL++PGG G+ +NL +D A +A ++GKP+ +C Sbjct: 60 IDEVTVEEFDALVIPGG-GSPENL----------RIDDRAVAFTRAFVESGKPVAAICHG 108 Query: 138 PAML--PKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVT-TPAYM- 193 P +L + T+ + ++ GA + VD+ VV + N I + PA + Sbjct: 109 PQLLISADVLRG---RTVTCVKKIRDDVKNAGAIY----VDEAVVIDGNLITSRVPADLP 161 Query: 194 -LAQNIAEA 201 Q IAEA Sbjct: 162 FFDQAIAEA 170 >UniRef50_C3WFG1 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme n=3 Tax=Fusobacterium RepID=C3WFG1_FUSMR Length = 183 Score = 43.4 bits (101), Expect = 0.005, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 60/142 (42%), Gaps = 32/142 (22%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ V+L+ +G E+ EA+ + + R GA+ V + +T+ R Sbjct: 3 KKVYVLLA-----EGFELIEAMTPVDVLRRGGAEVVTVS----------------ITDNR 41 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 V A ++ L + D + D +I+PGG+ NL +E+ + Sbjct: 42 EV-TSAQKVPVISDTTLKEKDITDGDMIILPGGYPGYVNLGE----------SQEVGKVL 90 Query: 122 QAMHQAGKPLGFMCIAPAMLPK 143 + + K +G +C AP +L K Sbjct: 91 KYYVENNKFVGAICGAPTVLAK 112 >UniRef50_Q54MG7 Protein DJ-1 n=21 Tax=cellular organisms RepID=PARK7_DICDI Length = 205 Score = 43.4 bits (101), Expect = 0.005, Method: Composition-based stats. Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 8/61 (13%) Query: 84 AELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPK 143 E DAL +PGGF +N S + SE ++ L + GK + +C+A L K Sbjct: 74 DEFDALAIPGGF---ENYSFYEEAYSE-----DVSQLIRDFDSKGKHIASVCVAALALGK 125 Query: 144 I 144 Sbjct: 126 S 126 >UniRef50_A0Q9X2 Intracellular protease, PfpI family protein n=4 Tax=Bacteria RepID=A0Q9X2_MYCA1 Length = 180 Score = 43.4 bits (101), Expect = 0.005, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 54/171 (31%), Gaps = 35/171 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTE-- 59 KKI ++ + DG E E A+ +GA + ++ NH E Sbjct: 4 KKIAILAA-----DGVEKVELEQPAAALREAGAGVEVVSLQDGEIQARNH-DLEPAGTFT 57 Query: 60 -TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELK 118 R V A A + D L++PGG + +D Sbjct: 58 VDRKV---------------ADASVDDFDGLVLPGG----------TVNPDKLRLDDTAV 92 Query: 119 ALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAE 169 + + +GKP+ +C P L + T+ + L GA Sbjct: 93 SFVRDFVGSGKPVAAICHGPWTLVEA-GVAAGRTLTSYPSIRTDLRNAGAH 142 >UniRef50_Q5JGM7 Intracellular protease 1 n=22 Tax=cellular organisms RepID=PFPI_PYRKO Length = 166 Score = 43.4 bits (101), Expect = 0.005, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 53/148 (35%), Gaps = 40/148 (27%) Query: 68 ARITRGEI-----------RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 A RG+I + D E DAL++PGG ++ + Sbjct: 33 ASFQRGKITGKHGYTVNVDLAFDEVDPDEFDALVLPGG-----------RAPEIVRLNEK 81 Query: 117 LKALAQAMHQAGKPLGFMCIAP--AMLPKIFDFP---LRLTIGTDIDTAEVLEEMGAEHV 171 A+ + M + GKP+ +C P + + +TI D ++ GAE + Sbjct: 82 AVAITKKMFEDGKPVASICHGPQILISAGVLKGRKGTSTVTIRDD------VKNAGAEWI 135 Query: 172 PCPVDDIVVDEDNKIVTTP----AYMLA 195 ++VVD + P A+M Sbjct: 136 D---AEVVVDGNWVSSRHPGDLYAWMRE 160 >UniRef50_C2BGT7 Possible transcriptional regulator n=2 Tax=Anaerococcus RepID=C2BGT7_9FIRM Length = 193 Score = 43.4 bits (101), Expect = 0.006, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 10/67 (14%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L + + + D + +PGG A+ L + D + + + A K + +C Sbjct: 55 LEEINPEDYDGVYIPGGTKGAETLRD----------DDRVIEIVKKFEAANKLIAAICAG 104 Query: 138 PAMLPKI 144 P +L K Sbjct: 105 PIVLDKA 111 >UniRef50_Q1PX19 Similar to intracellular proteinase I n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PX19_9BACT Length = 182 Score = 43.0 bits (100), Expect = 0.006, Method: Composition-based stats. Identities = 34/145 (23%), Positives = 56/145 (38%), Gaps = 24/145 (16%) Query: 53 TGEAMTETRNVLIEAARITRGEIRP---LAQADAAELDALIVPGGFGAAKNLSNFASLGS 109 G A+T + L E+ + ++ P + DA+I GG G+ S Sbjct: 41 NGAAVTIASSSLKESKGMLGAKVSPDILFTDIAVGDYDAVIFVGGTGS-----------S 89 Query: 110 ECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGA 168 E + +A+ ++ K +G +CIAP L K + T T T ++ GA Sbjct: 90 EYWDNPTAHTIAKEANKVNKIVGAICIAPVTLAKAGLLKGKKAT--TYSSTVNDIKSEGA 147 Query: 169 EHVPCPVDDIVVDEDNKIVTT--PA 191 + V+ D I+T PA Sbjct: 148 NY-----TGEGVERDGNIITADGPA 167 >UniRef50_Q8LGH3 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis protein, putative n=23 Tax=Eukaryota RepID=Q8LGH3_ARATH Length = 438 Score = 43.0 bits (100), Expect = 0.006, Method: Composition-based stats. Identities = 32/127 (25%), Positives = 55/127 (43%), Gaps = 27/127 (21%) Query: 15 DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGE 74 DGSE EAV + + R+ A V A ++V+ + + + VL Sbjct: 266 DGSEEMEAVAIIDVLKRAKANVVVAALG-NSLEVVASRKVKLVAD---VL---------- 311 Query: 75 IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFM 134 L +A+ D +++PGG G A+ FAS L + + ++ KP G + Sbjct: 312 ---LDEAEKNLYDLIVLPGGLGGAE---AFASSEK-------LVNMLKKQAESNKPYGAI 358 Query: 135 CIAPAML 141 C +PA++ Sbjct: 359 CASPALV 365 >UniRef50_B5JP08 DJ-1 family protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JP08_9BACT Length = 187 Score = 43.0 bits (100), Expect = 0.007, Method: Composition-based stats. Identities = 29/130 (22%), Positives = 48/130 (36%), Gaps = 28/130 (21%) Query: 13 VYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITR 72 V+DG E EA+ + + R+ + + + G RN + AA Sbjct: 12 VFDGIEEIEALTPVDILRRAEIKVTVAS-----------VNGLPTVTGRNQITFAAD--- 57 Query: 73 GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLG 132 + + D +I+PGG G + L N A + + A +A K L Sbjct: 58 ---TSITRVAEDSFDLVILPGGPGVLELLENQA-----------VSHILVAQDKAQKELA 103 Query: 133 FMCIAPAMLP 142 +C AP +L Sbjct: 104 AICAAPKVLA 113 >UniRef50_A7HFM3 ThiJ/PfpI domain protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HFM3_ANADF Length = 194 Score = 43.0 bits (100), Expect = 0.007, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 32/85 (37%), Gaps = 20/85 (23%) Query: 64 LIEAARITRGEIRPL----------AQADAAELDALIVPGGFGAAKNLSNFASLGSECTV 113 L EAA R E+ L A A+ DAL++PGG + Sbjct: 40 LAEAAEGARLEVHRLGRRVPDRRFPRSAVPADFDALLLPGG----------VLNPDRLRI 89 Query: 114 DRELKALAQAMHQAGKPLGFMCIAP 138 + A A++ A KP+ +C P Sbjct: 90 EPRAVAFAKSFFDAAKPVAAICHGP 114 >UniRef50_A0K1V5 ThiJ/PfpI domain protein n=6 Tax=Actinomycetales RepID=A0K1V5_ARTS2 Length = 233 Score = 43.0 bits (100), Expect = 0.007, Method: Composition-based stats. Identities = 30/131 (22%), Positives = 47/131 (35%), Gaps = 30/131 (22%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 LA D + DA+++PGG G ++ D +L + +AGK + C Sbjct: 93 LADVDPSGYDAVVMPGGHGPM----------ADLYQDADLGRILAEADRAGKVIAPFCHG 142 Query: 138 PAMLPKIFD-------FPLRLTIGTDID-------------TAEVLEEMGAEHVPCPVDD 177 PA L D LT+ +D + +VL+E GA Sbjct: 143 PAGLLSAVDGDGKFAFAGRHLTVFSDDEELSGGTGPNTPWFVEDVLKEKGAIVENGAAWG 202 Query: 178 IVVDEDNKIVT 188 V D ++T Sbjct: 203 SNVVRDRNLIT 213 >UniRef50_Q2FT73 ThiJ/PfpI n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FT73_METHJ Length = 193 Score = 43.0 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 25/63 (39%), Gaps = 10/63 (15%) Query: 82 DAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAML 141 E DAL + GG GA + N + +L L + K +G + AP ++ Sbjct: 62 REDEFDALAILGGHGAQAHFWN----------NPDLLELVKIFRIHRKVIGAISTAPVVM 111 Query: 142 PKI 144 + Sbjct: 112 ARA 114 >UniRef50_Q29CB8 GA12322 n=5 Tax=Endopterygota RepID=Q29CB8_DROPS Length = 187 Score = 42.6 bits (99), Expect = 0.008, Method: Composition-based stats. Identities = 37/137 (27%), Positives = 60/137 (43%), Gaps = 22/137 (16%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 LA+A + D +++PGG G + + + A+ + L +A AG + +C Sbjct: 55 TSLAKAACDKFDVVVLPGGLGGSNAMGDSAA----------VGDLLRAQESAGGLIAAIC 104 Query: 136 IAPAMLPK-IFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT----P 190 AP +L K LT + E L + C VDD V +D ++T+ Sbjct: 105 AAPTVLAKHGIAAGKSLT--SYPSMKEQLVDK-----YCYVDDKSVVKDGNLITSRGPGT 157 Query: 191 AYMLAQNIAEAASGIDK 207 AY A IAE +G++K Sbjct: 158 AYDFALKIAEELAGLEK 174 >UniRef50_A3CWP5 Intracellular protease, PfpI family n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CWP5_METMJ Length = 171 Score = 42.6 bits (99), Expect = 0.009, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 40/97 (41%), Gaps = 11/97 (11%) Query: 80 QADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPA 139 AD + +++ GG G+ ++L L+ L ++ + GK + +C+AP Sbjct: 57 DADPDDYVGIVIVGGSGSEEHLWG----------SERLRDLVRSFFEQGKVVAAICLAPV 106 Query: 140 MLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 +L + T+ +++ GA + PV Sbjct: 107 VLARAGILAGRHATVYRSPAAVAEMKKAGANLLEIPV 143 >UniRef50_A8I9F1 Type 1 glutamine amidotransferase n=3 Tax=Alphaproteobacteria RepID=A8I9F1_AZOC5 Length = 290 Score = 42.6 bits (99), Expect = 0.009, Method: Composition-based stats. Identities = 29/141 (20%), Positives = 49/141 (34%), Gaps = 21/141 (14%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVIN-------HLTGEAMTETRNVLIEAA 68 G ++E V+ +A+ +G V PD + I+ H G+ R A Sbjct: 51 GQYLNETVVPAMAVIAAGYDVVLATPDGTRPH-IDPASDSAVHFEGDEAAYGRAKAFYAE 109 Query: 69 RITRGEIRPLAQADAAELDA---LIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMH 125 +R L LDA + VPGG +L D ++ + + H Sbjct: 110 HPAMTNVRTLRAVIEEGLDAYAGVFVPGGQAPVVDLMQ----------DADVGFILRHFH 159 Query: 126 QAGKPLGFMCIAPAMLPKIFD 146 + KP +C P ++ Sbjct: 160 ERAKPTALLCHGPIVVAAAMP 180 >UniRef50_B9KCD0 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosphate synthesis protein n=2 Tax=Campylobacterales RepID=B9KCD0_CAMLR Length = 188 Score = 42.6 bits (99), Expect = 0.010, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 10/68 (14%) Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 LA D LDA+ + GGF NL N + + + Q +H K + +C Sbjct: 60 SLASVDQENLDAIALAGGFEGMMNLKN----------NSLIIKIIQDLHAKKKIVAAICA 109 Query: 137 APAMLPKI 144 +P +L K Sbjct: 110 SPMVLAKA 117 >UniRef50_Q9V1F8 Intracellular protease 1 n=6 Tax=cellular organisms RepID=PFPI_PYRAB Length = 166 Score = 42.6 bits (99), Expect = 0.010, Method: Composition-based stats. Identities = 33/155 (21%), Positives = 55/155 (35%), Gaps = 34/155 (21%) Query: 57 MTETRNVLIEAARITRGEIR---------PLA--QADAAELDALIVPGGFGAAKNLSNFA 105 E VL+ A RG I LA + + E DAL++PGG Sbjct: 24 KEEGHEVLV--ASFKRGVITGKHGYTVNVDLAFEEVNPDEFDALVLPGG----------- 70 Query: 106 SLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEE 165 ++ + +A+ M GKP+ +C P +L + G + +++ Sbjct: 71 RAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQIL---ISAGVLR--GRRGTSYPGIKD 125 Query: 166 MGAEHVPCPVD-DIVVDEDNKIVTTP----AYMLA 195 VD ++VVD + P A+M Sbjct: 126 DMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMRE 160 >UniRef50_B1YY58 ThiJ/PfpI domain protein n=10 Tax=Proteobacteria RepID=B1YY58_BURA4 Length = 280 Score = 42.6 bits (99), Expect = 0.010, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 57/174 (32%), Gaps = 34/174 (19%) Query: 2 KKIGVILSGCGVYD---------GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVIN-- 50 + V+LS D G ++E + + A+ +G Q FA + + ++ Sbjct: 23 ANVLVVLSDSDHLDLKDGKVFATGFYLNELMQPVKALLDAGHQVA-FATPEGKAPTMDTT 81 Query: 51 -----HLTGEAMT--ETRNVLIEAARITRGE-----IRPLAQADAAELDALIVPGGFGAA 98 + G+ + R +L + +R + + Q DA+ VPGG Sbjct: 82 SADKMYFNGDEKAMRDYRALLDKLQITSRVHSPVISLSRVEQIGYGHFDAVYVPGGHAPM 141 Query: 99 KNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLT 152 ++L + + L H GK +C P L T Sbjct: 142 QDLLS----------SPAVGRLLADFHARGKTTALVCHGPIALLSTLPDAPGFT 185 >UniRef50_O06006 Putative cysteine protease yraA n=81 Tax=cellular organisms RepID=YRAA_BACSU Length = 169 Score = 42.2 bits (98), Expect = 0.011, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 44/113 (38%), Gaps = 19/113 (16%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 ++ DA++ DAL++PGGF D A+A + KP+ +C Sbjct: 57 ISDVDASDFDALLIPGGF-----------SPDLLRADDRPGEFAKAFVENKKPVFAICHG 105 Query: 138 P-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 P ++ +T G +++ GA + D V + IVT+ Sbjct: 106 PQVLIDTDLLKGKDIT-GYRSIRKDLINA-GANY-----KDAEVVVSHNIVTS 151 >UniRef50_C2AV24 Putative intracellular protease/amidase n=1 Tax=Tsukamurella paurometabola DSM 20162 RepID=C2AV24_TSUPA Length = 230 Score = 42.2 bits (98), Expect = 0.011, Method: Composition-based stats. Identities = 35/167 (20%), Positives = 58/167 (34%), Gaps = 40/167 (23%) Query: 68 ARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQA 127 ARI L DA++ DA+ + GG G + + D L L + +A Sbjct: 80 ARIA--NTPSLNDIDASDFDAIYLTGGHGVMFDFPD----------DARLATLLREFDEA 127 Query: 128 GKPLGFMC-------------IAPAMLPKI---FDFPLRLTIGTDI----DTAEVLEEMG 167 GK + +C AP + + F + + G D + E + E G Sbjct: 128 GKVVSAVCHGTAGLLGATKADGAPLIAGRRISGFSWNEEVLAGLDAIVPFNLEERIAERG 187 Query: 168 AEHVPCPVDDIVVD-EDNKIVT--TPAYMLAQNIAEAASGIDKLVSR 211 A ++ D +VT PA + A G+ L+ + Sbjct: 188 ATYIEADEAWAPFAVTDGNVVTGQNPA-----SAHPVAQGVLTLLDK 229 >UniRef50_Q8RHW4 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme n=10 Tax=Fusobacterium RepID=Q8RHW4_FUSNN Length = 200 Score = 42.2 bits (98), Expect = 0.011, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 42/105 (40%), Gaps = 10/105 (9%) Query: 85 ELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI 144 E DAL++PGGFG A NF + K L + + K + +C A L + Sbjct: 73 EYDALVIPGGFGKA----NFFKDND----NEIFKKLIKYFSENNKVIVAICSAVINLLET 124 Query: 145 FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 + +D ++ ++ ++IV+D N + T Sbjct: 125 TYIRDKKVTTYLLDNKRYFNQLKNYNIIPVEEEIVID--NNLFTC 167 >UniRef50_Q72HB0 Putative amidotransferase n=2 Tax=Thermus RepID=Q72HB0_THET2 Length = 166 Score = 42.2 bits (98), Expect = 0.011, Method: Composition-based stats. Identities = 30/121 (24%), Positives = 50/121 (41%), Gaps = 33/121 (27%) Query: 86 LDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP-AMLPKI 144 L+ L++PGGF ++ E+ AL + + + GKPLG +C P ++ Sbjct: 61 LEGLLIPGGFAP-----DYLRR------SPEVLALVRKVAEEGKPLGAICHGPWVLVSAG 109 Query: 145 FDFPLRLT----IGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT------PAYML 194 ++T I D++ A L + +VVD +VT PA+M Sbjct: 110 LVRGRKVTGFFSIRDDLENAGGLYR---------EEGVVVD--GNLVTAQGPKDLPAFMR 158 Query: 195 A 195 A Sbjct: 159 A 159 >UniRef50_Q488G4 DJ-1/PfpI family protein n=14 Tax=Bacteria RepID=Q488G4_COLP3 Length = 207 Score = 42.2 bits (98), Expect = 0.011, Method: Composition-based stats. Identities = 26/115 (22%), Positives = 43/115 (37%), Gaps = 15/115 (13%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L D A+ DA+ +PGGF S F + + ++ GK + +C++ Sbjct: 60 LQDIDLADYDAIAIPGGFEP----SGFYVD----ALSEPFIKAIKYFNEQGKTIASVCVS 111 Query: 138 PAML--PKIFDFPLRLTIGT-DIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 L I T + LEE GA + P +V + + I +T Sbjct: 112 SIALGNAGILTGKKATTYHQVGGKRKQQLEESGAIFIDRP----IVQDQHIITST 162 >UniRef50_C8PUR4 Intracellular protease 1 n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PUR4_9GAMM Length = 176 Score = 42.2 bits (98), Expect = 0.011, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 44/102 (43%), Gaps = 19/102 (18%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTV---DRELKALAQAMHQAGKPLGFM 134 + +A+AA+ DA+++PGG G+ V + + +A+ +A GKP+ + Sbjct: 60 IGEANAADYDAIVLPGG-------------GANADVLRANTDAQAMVKAFMNVGKPVAAI 106 Query: 135 CIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 C AP + +LT A L+ GA+ V Sbjct: 107 CHAPWIFADTEIARGKKLT--AYKTIATDLKNAGAQFEDKSV 146 >UniRef50_Q4V0N9 NonF-related protein n=5 Tax=cellular organisms RepID=Q4V0N9_XANC8 Length = 225 Score = 42.2 bits (98), Expect = 0.012, Method: Composition-based stats. Identities = 25/101 (24%), Positives = 42/101 (41%), Gaps = 13/101 (12%) Query: 68 ARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQA 127 R+ L++ D + DA+ VPGG G ++S +R+++AL + +A Sbjct: 76 GRMAHSR--KLSEVDVRDYDAVFVPGGLGPMVDVSG----------NRDVQALIKQAWEA 123 Query: 128 GKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMG 167 + +C P A+L D L G + EE G Sbjct: 124 DMLVAAVCHGPSALLGITLDDGTALVQGRRVTGFSTAEEDG 164 >UniRef50_Q0C397 Intracellular protease, PfpI family n=6 Tax=Alphaproteobacteria RepID=Q0C397_HYPNA Length = 208 Score = 42.2 bits (98), Expect = 0.012, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 46/120 (38%), Gaps = 22/120 (18%) Query: 79 AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP 138 +Q + DALI+PGG + + A A + KP+ +C P Sbjct: 95 SQVHVDDFDALIIPGG----------TVGSDKLRASLDAVAFVSAFFRQSKPVAAICHGP 144 Query: 139 AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT------PAY 192 ++ + + + ++ G + V +++VVD N ++T+ PA+ Sbjct: 145 WLIVEA-GAAKGRKLTSYSSLRTDIQNAGGDWVD---EEVVVD--NGLITSRSPDDLPAF 198 >UniRef50_Q5KKT1 Putative uncharacterized protein n=3 Tax=Agaricomycotina RepID=Q5KKT1_CRYNE Length = 233 Score = 42.2 bits (98), Expect = 0.012, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 25/65 (38%), Gaps = 10/65 (15%) Query: 74 EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGF 133 + + AA+ +A+ V GG G L + A L + + A KP+ Sbjct: 84 NTKKVEDVKAADYEAMFVIGGHGP---LIDLAKSEK-------FAKLVEDFYVAKKPVSA 133 Query: 134 MCIAP 138 +C P Sbjct: 134 VCHGP 138 >UniRef50_O28987 Uncharacterized protein AF_1281 n=11 Tax=cellular organisms RepID=Y1281_ARCFU Length = 168 Score = 41.9 bits (97), Expect = 0.014, Method: Composition-based stats. Identities = 21/110 (19%), Positives = 39/110 (35%), Gaps = 17/110 (15%) Query: 80 QADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPA 139 + L++PGG ++ + + + GKP+ +C P Sbjct: 56 DVKVEDYAGLVIPGG-----------KSPERVRINERAVEIVKDFLELGKPVAAICHGPQ 104 Query: 140 MLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 +L R + + I + L GA + PV VVD ++T+ Sbjct: 105 LLISAMAVKGRR-MTSWIGIRDDLIAAGALYEDRPV---VVD--GNVITS 148 >UniRef50_C7PM17 ThiJ/PfpI domain protein n=2 Tax=Bacteria RepID=C7PM17_CHIPD Length = 223 Score = 41.9 bits (97), Expect = 0.015, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 48/134 (35%), Gaps = 16/134 (11%) Query: 33 GAQAVC-FAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIV 91 A FA V +++LT + + + R L+ D + DA+ V Sbjct: 38 DAHYQIDFASITGGVPAVDNLTASEESSNARFIKDGGLAKMQHNRKLSDVDTSGYDAVFV 97 Query: 92 PGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRL 151 PGG ++ D LK + + +GK +G +C P L +RL Sbjct: 98 PGGLAPMVDMPE----------DPLLKKVIAGFYDSGKIVGAVCHGPVSLLN-----VRL 142 Query: 152 TIGTDIDTAEVLEE 165 G+ + + + Sbjct: 143 NDGSYLIAGKNITS 156 >UniRef50_A5TUD8 Possible transcriptional regulator n=10 Tax=Fusobacterium RepID=A5TUD8_FUSNP Length = 184 Score = 41.9 bits (97), Expect = 0.015, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 49/139 (35%), Gaps = 26/139 (18%) Query: 15 DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGE 74 DG E E + + R GA+ + + K + +N++ + + Sbjct: 9 DGFEPLEVFAPVDVLKRCGAEVIMVSTGKDLF-------VASSGSQKNII-------KAD 54 Query: 75 IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFM 134 + L+ D D +I+PGG+ NL ++E+ + + K + + Sbjct: 55 VM-LSDIDYKAADLVIIPGGYPGYVNLRE----------NKEVVDIVKYFLDNDKYVASI 103 Query: 135 CIAPAMLP-KIFDFPLRLT 152 C P + +LT Sbjct: 104 CGGPTIFSYNNLANGTKLT 122 >UniRef50_B8I402 ThiJ/PfpI domain protein n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I402_CLOCE Length = 192 Score = 41.9 bits (97), Expect = 0.015, Method: Composition-based stats. Identities = 17/70 (24%), Positives = 28/70 (40%), Gaps = 14/70 (20%) Query: 75 IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFM 134 + +A+A A E + ++PGG+ V E+ L A + GK + + Sbjct: 53 VITVAEAKADEYEGFLIPGGW--------------NPVVKVEILDLINAFYSGGKLIAAI 98 Query: 135 CIAPAMLPKI 144 C P L K Sbjct: 99 CAGPRYLAKA 108 >UniRef50_C7Z460 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7Z460_NECH7 Length = 233 Score = 41.9 bits (97), Expect = 0.017, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 25/63 (39%), Gaps = 10/63 (15%) Query: 82 DAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAML 141 +E DA+ PGG G +L + + + + + +H K + +C PA L Sbjct: 94 RTSEFDAVFYPGGHGPMFDLIS----------NPDSLRILRDLHAEDKVIAAVCHGPAAL 143 Query: 142 PKI 144 Sbjct: 144 VNA 146 >UniRef50_B1ZX06 ThiJ/PfpI domain protein n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZX06_OPITP Length = 178 Score = 41.9 bits (97), Expect = 0.018, Method: Composition-based stats. Identities = 33/124 (26%), Positives = 48/124 (38%), Gaps = 19/124 (15%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L A A DA++ GG GA + E LA+ K +G +C+A Sbjct: 62 LKDAQADNFDAVVFVGGMGARRLYHE-----------PEALRLARQAVAHNKVVGAICLA 110 Query: 138 PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQN 197 P +L + + + TA E AE V V + VV D +I+T A Sbjct: 111 PNILAQ---AGV---LKNRRATAWAFEVPAAEAVN-QVGESVV-RDGRIITANGPQAATP 162 Query: 198 IAEA 201 A+A Sbjct: 163 FAQA 166 >UniRef50_Q9M8R4 F13E7.34 protein n=273 Tax=cellular organisms RepID=Q9M8R4_ARATH Length = 388 Score = 41.5 bits (96), Expect = 0.018, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 37/117 (31%), Gaps = 13/117 (11%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 ++ DAL++PGG ++ + + + + KP+ +C Sbjct: 265 TNFDDLVSSSYDALVIPGG-----------RAPEYLALNEHVLNIVKEFMNSEKPVASIC 313 Query: 136 IAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAY 192 +L R V+ G P P+D D +VT A+ Sbjct: 314 HGQQILAAAGVLKGRKCTAYPAVKLNVVLGGGTWLEPDPIDRCFTD--GNLVTGAAW 368 Score = 41.5 bits (96), Expect = 0.019, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 65/211 (30%), Gaps = 33/211 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAP-----DKQQVDVINHLTGE 55 M +L CG D E +E ++ A+ G P D V + + Sbjct: 1 MANSRTVLILCG--DYMEDYEVMVPFQALQAFGITVHTVCPGKKAGDSCPTAVHDFCGHQ 58 Query: 56 AMTETR--NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTV 113 E+R N + A + D ++ D L++PGG + Sbjct: 59 TYFESRGHNFTLNAT---------FDEVDLSKYDGLVIPGG-----------RAPEYLAL 98 Query: 114 DRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAE-HVP 172 + L + ++GKP+ +C +L R L GA+ P Sbjct: 99 TASVVELVKEFSRSGKPIASICHGQLILAAADTVNGRKCTA-YATVGPSLVAAGAKWVEP 157 Query: 173 CPVDDIVVDEDNKIVTTPAYMLAQNIAEAAS 203 D VVD ++T Y + Sbjct: 158 ITPDVCVVD--GSLITAATYEGHPEFIQLFV 186 >UniRef50_B0NJL3 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0NJL3_EUBSP Length = 187 Score = 41.5 bits (96), Expect = 0.018, Method: Composition-based stats. Identities = 31/157 (19%), Positives = 56/157 (35%), Gaps = 41/157 (26%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQV----DVINHLTGEA 56 MKK+ VIL+ DG E EA+ + + R+ + ++ IN T + Sbjct: 4 MKKVSVILA-----DGFEEIEALTAVDLLRRAQIYVDTVSITEEYTVHGAHGINVQTEDL 58 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 E +E+ D +++PGG NL Sbjct: 59 FEEVN--FVES-------------------DMIVLPGGMPGTLNL----------DAHSG 87 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLT 152 ++ + + + GK +G +C P +L + R+T Sbjct: 88 VRRVVKDFFEEGKYIGAICAGPTVLANLGLLKGKRIT 124 >UniRef50_C5AII4 Intracellular protease, PfpI family protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AII4_BURGB Length = 196 Score = 41.5 bits (96), Expect = 0.018, Method: Composition-based stats. Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 10/56 (17%) Query: 80 QADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 QA A + DA+++PGG G + + A HQA KP+ +C Sbjct: 67 QAKAGDYDAVVLPGG----------VVNGDAIRMIPAAREFVVAAHQADKPIAVIC 112 >UniRef50_C5RPJ4 DJ-1 family protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RPJ4_CLOCL Length = 187 Score = 41.5 bits (96), Expect = 0.022, Method: Composition-based stats. Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 10/69 (14%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 + + + D +++PGG + NL D + L + + K +G +C Sbjct: 53 KDFDLVNFKDYDIIVLPGGMPGSTNL----------RADDRVINLVKDFNNKNKFIGAIC 102 Query: 136 IAPAMLPKI 144 AP +L K Sbjct: 103 AAPIVLEKA 111 >UniRef50_C5PN70 C56 family peptidase n=2 Tax=Sphingobacterium spiritivorum RepID=C5PN70_9SPHI Length = 177 Score = 41.5 bits (96), Expect = 0.022, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 45/136 (33%), Gaps = 30/136 (22%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAP-DKQQVDVINHLTGEAMTE 59 M KI ++ + DG + E + + G Q +P D + V NH Sbjct: 1 MNKIAILAA-----DGFKEIELKSPKIYLQNKGFQVDIVSPKDIEFVRSWNHFDWGPSYP 55 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 +V LA+AD A +ALI+PGG V + Sbjct: 56 I-DVH-------------LAEADPAVYEALILPGG----------TLSPDALRVLPKALD 91 Query: 120 LAQAMHQAGKPLGFMC 135 + + K + +C Sbjct: 92 FIKHFIEQKKLIAAIC 107 >UniRef50_P45470 Protein yhbO n=95 Tax=Bacteria RepID=YHBO_ECOLI Length = 172 Score = 41.5 bits (96), Expect = 0.022, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 46/118 (38%), Gaps = 20/118 (16%) Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 + + + AE DAL++PGG ++ D + +GKP+ +C Sbjct: 56 KSIDEVTPAEFDALLLPGGHSP-----DYLRG------DNRFVTFTRDFVNSGKPVFAIC 104 Query: 136 IAPAMLPKIFDFPLRLTIGTDIDTAE--VLEEMGAEHVPCPVDDIVVDEDNKIVT-TP 190 P +L + G + + +++ A ++VVD+D + + TP Sbjct: 105 HGPQLL-----ISADVIRGRKLTAVKPIIIDVKNAGAEFYD-QEVVVDKDQLVTSRTP 156 >UniRef50_D0BJF4 DJ-1 family protein n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BJF4_9LACT Length = 183 Score = 41.5 bits (96), Expect = 0.022, Method: Composition-based stats. Identities = 29/141 (20%), Positives = 51/141 (36%), Gaps = 33/141 (23%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK+ ++L+ G E EA+ + + R+G + D + ++LT E Sbjct: 1 MKRAAIVLT-----TGFEEIEAIAPMDILRRAGVEVDIVGVDAKVATGSHNLTISTDKEL 55 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 V+ E D +I+PGG AK L N + ++ Sbjct: 56 VEVMNEL------------------YDIVILPGGMPGAKLLKN----------HQAVQDF 87 Query: 121 AQAMHQAGKPLGFMCIAPAML 141 + + GK + C AP + Sbjct: 88 VKRHYDVGKLVAANCAAPIAI 108 >UniRef50_Q7MWH8 ThiJ/PfpI family protein n=2 Tax=Porphyromonas gingivalis RepID=Q7MWH8_PORGI Length = 181 Score = 41.5 bits (96), Expect = 0.023, Method: Composition-based stats. Identities = 36/141 (25%), Positives = 54/141 (38%), Gaps = 35/141 (24%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK ++ G E EAV TL + R G A + +T E Sbjct: 1 MKKTALVFLAPGF----EETEAVGTLDILRRGGVVAEFVS-----------ITDSLYVEG 45 Query: 61 RN-VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 N + ++A R+ + L DAL++PGG A NL + C L+ Sbjct: 46 ANGITVKADRL----MTDLPTV-----DALVLPGGLPGADNL-------NSCE---PLRR 86 Query: 120 LAQAMHQAGKPLGFMCIAPAM 140 L + A K + +C AP + Sbjct: 87 LLSEHYAAQKLVAAICAAPLV 107 >UniRef50_D1PPH9 ThiJ/PfpI family protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PPH9_9FIRM Length = 184 Score = 41.1 bits (95), Expect = 0.025, Method: Composition-based stats. Identities = 29/127 (22%), Positives = 46/127 (36%), Gaps = 27/127 (21%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G E E +L + + R+G + A Q +H N++ +A Sbjct: 11 GLEECEGLLCVDLLRRAGVEVTIAAVGGSQTVTSSHHV--------NIVADAL------- 55 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 Q D + DA I+PGG NL D ++ + Q AGK + +C Sbjct: 56 --AEQVDYSAYDACILPGGIPGVNNL----------KADATVRKVCQDYAAAGKTVAAIC 103 Query: 136 IAPAMLP 142 P +L Sbjct: 104 AGPTVLA 110 >UniRef50_C7MB29 Intracellular protease, PfpI family n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MB29_BRAFD Length = 184 Score = 41.1 bits (95), Expect = 0.025, Method: Composition-based stats. Identities = 33/138 (23%), Positives = 53/138 (38%), Gaps = 24/138 (17%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G+E E L A+ +GAQ + AP+ G T R+ A + + Sbjct: 18 GTETDEIQHPLAALREAGAQVIVAAPEA----------GSVATLQRD-REPGADVPVDTV 66 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 A ++DAL++PGG D + L +++ AGKP+ +C Sbjct: 67 YD--TVKAKDVDALVLPGG----------TLNADTLRADETAQFLVRSVAAAGKPVAAIC 114 Query: 136 IAPAMLPKI-FDFPLRLT 152 AP +L + LT Sbjct: 115 HAPWLLVETGLANGRTLT 132 >UniRef50_C9KP75 ThiJ/PfpI family protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KP75_9FIRM Length = 289 Score = 41.1 bits (95), Expect = 0.028, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 27/76 (35%), Gaps = 10/76 (13%) Query: 74 EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGF 133 +I+ A+ VPGG +L D++ + + H+ GKP F Sbjct: 121 KIKDAIAEGLDNYAAVYVPGGHAPMNDLMQ----------DKDFGKVLRYFHEKGKPTAF 170 Query: 134 MCIAPAMLPKIFDFPL 149 +C P D P Sbjct: 171 LCHGPIASLAALDDPA 186 >UniRef50_B7AW87 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B7AW87_9BACE Length = 186 Score = 41.1 bits (95), Expect = 0.029, Method: Composition-based stats. Identities = 28/130 (21%), Positives = 53/130 (40%), Gaps = 28/130 (21%) Query: 15 DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGE 74 DG E EA+ + + R+G + I+ + G + +N++++A + Sbjct: 14 DGFETVEALAVVDVLRRAGMNV----------ETISLMDGLEVKSAQNIIVKADK----- 58 Query: 75 IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFM 134 A D A+ D +PGG G KN E + ++ G+ + + Sbjct: 59 --EFAGYDFADTDVFFLPGGPGT-KNYET----------KPEFIDVIANAYKEGRLITAI 105 Query: 135 CIAPAMLPKI 144 C AP++L K+ Sbjct: 106 CAAPSVLGKM 115 >UniRef50_B0N2E2 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0N2E2_9FIRM Length = 186 Score = 41.1 bits (95), Expect = 0.029, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 53/144 (36%), Gaps = 34/144 (23%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M+K V+ V DG E E V + + R+G + F +Q V + Sbjct: 4 MRKAAVL-----VVDGYEESETVTIVDLLRRAGIECHTFGFAEQYVRGM----------- 47 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + ++I+ +I EI D L++PGG G + + + Sbjct: 48 QGMMIKVDKIFSDEI--------KNYDMLVLPGG----------RPGGVNLGANPLVIEM 89 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI 144 Q ++ GK L +C +L K Sbjct: 90 VQYYNENGKYLAAICSGTIVLSKA 113 >UniRef50_A6CIJ2 Predicted intracellular protease/amidase, ThiJ/PfpL family protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CIJ2_9BACI Length = 225 Score = 41.1 bits (95), Expect = 0.029, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 61/166 (36%), Gaps = 25/166 (15%) Query: 1 MKKIGVILSGCGVYD-------GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLT 53 M K+ +LS G D G E L A+ ++G Q +P V++ ++ Sbjct: 1 MAKVLAVLS-SGYKDEENNYETGWWGEELFAPLEALEKAGHQVDIASP-LGGKPVVDQVS 58 Query: 54 GEAMTETR---NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSE 110 + L E+ R E + L + A E D ++V GG GA + + A Sbjct: 59 FLPDYDPEGTYKALYESGRA--DETQKLTEVLAEEYDVVLVVGGHGA---MYDLAKDE-- 111 Query: 111 CTVDRELKALAQAMHQAGKPLGFMCIAPA-MLPKIFDFPLRLTIGT 155 +L + ++ G + C PA ++ + + G Sbjct: 112 -----DLHRIINTVYDNGGIVAAECHGPAPLIWTLRPDGKSIIEGK 152 >UniRef50_B0SGM8 ThiJ/PfpI family intracellular protease n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SGM8_LEPBA Length = 308 Score = 40.7 bits (94), Expect = 0.031, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 10/60 (16%) Query: 82 DAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAML 141 D ++VPGG G ++ D ++ L + H+ KP+G +C APA+L Sbjct: 155 DDQNFVGILVPGGQGLM----------TDLLYDEKIPELLRRFHKKQKPIGIVCHAPALL 204 >UniRef50_Q11NL5 Putative uncharacterized protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11NL5_CYTH3 Length = 175 Score = 40.7 bits (94), Expect = 0.032, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 43/112 (38%), Gaps = 13/112 (11%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L + DA D + GG G+A+ L + + ++ + KP+ +C A Sbjct: 56 LEEIDARTFDGIYFVGGAGSAQYLQDEIAK-----------SVFNSFLHLNKPIAAICAA 104 Query: 138 PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 P L K + G D D + +M AEH + V D I+T Sbjct: 105 PRNLLKWDMLKNKRATGFDAD--GIFSKMAAEHGAIALPQEKVVTDGLILTA 154 >UniRef50_A4XSL6 CheA signal transduction histidine kinase n=1 Tax=Pseudomonas mendocina ymp RepID=A4XSL6_PSEMY Length = 739 Score = 40.7 bits (94), Expect = 0.036, Method: Composition-based stats. Identities = 29/125 (23%), Positives = 46/125 (36%), Gaps = 10/125 (8%) Query: 55 EAMTETRNVLIEAAR---ITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSEC 111 E + R + E R I RG I A ++ AL+ GF A ++S+ + G Sbjct: 494 EVADDGRGLNTERIRQKAIDRGLIDAQANLAEPDIHALVFAAGFSTADSVSDLSGRGVGM 553 Query: 112 TVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFD--FPLRLTIGTD-----IDTAEVLE 164 V R + + + LG C LP +++G + +D Sbjct: 554 DVVRSVIDGLRGSIEIDSVLGAGCTFRIRLPLTLAIIDGFLISLGDEYFVVPLDMVTECL 613 Query: 165 EMGAE 169 EM AE Sbjct: 614 EMDAE 618 >UniRef50_C5F0X7 Putative uncharacterized protein n=2 Tax=Helicobacter RepID=C5F0X7_9HELI Length = 183 Score = 40.3 bits (93), Expect = 0.046, Method: Composition-based stats. Identities = 35/144 (24%), Positives = 62/144 (43%), Gaps = 32/144 (22%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M KI V L G E EA+ + + R+G Q + A K ++V++ + + + Sbjct: 1 MVKILVPL-----GKGFEELEAISIIDVLRRAGCQVII-ASLKDNLEVLSQGGVKIIAD- 53 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +++ +A ++DA++ PGG+ +NL EC +EL+ L Sbjct: 54 ---------------VDVSKVEALKIDAVVFPGGWEGTENL-------IEC---KELREL 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI 144 M K + +C AP L K+ Sbjct: 89 VLEMDSQRKIIAAICAAPYALFKM 112 >UniRef50_D2RN75 ThiJ/PfpI domain protein n=2 Tax=Veillonellaceae RepID=D2RN75_ACIFE Length = 274 Score = 40.3 bits (93), Expect = 0.046, Method: Composition-based stats. Identities = 12/52 (23%), Positives = 20/52 (38%), Gaps = 10/52 (19%) Query: 84 AELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 DAL +PGG +L +++L + + H KP +C Sbjct: 119 DRYDALFIPGGHAPMGSLME----------NQDLGTILKDFHAKEKPTALIC 160 >UniRef50_C0WAN6 Glutamine amidotransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WAN6_9FIRM Length = 246 Score = 40.3 bits (93), Expect = 0.048, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 33/81 (40%), Gaps = 8/81 (9%) Query: 77 PLAQADAAEL----DALIVPGGFGAAKNL----SNFASLGSECTVDRELKALAQAMHQAG 128 + A + DAL +PGG L + G++ +DR L +A ++AG Sbjct: 49 DVEGASGKDYVDLADALFLPGGQDVDPTLFGEEPTWKVGGADYKMDRFEIDLIRAFYEAG 108 Query: 129 KPLGFMCIAPAMLPKIFDFPL 149 KP+ +C +L + Sbjct: 109 KPIFGICRGIQVLNIALGGTV 129 >UniRef50_Q051B9 Transcription regulator, DJ-1/PfpI family intracellular protease n=4 Tax=Leptospira RepID=Q051B9_LEPBL Length = 178 Score = 40.3 bits (93), Expect = 0.050, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 10/64 (15%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L + + D +++PG G K L D ++ Q + K +G +C A Sbjct: 52 LDAVNLKDFDMIVLPGRAGGTKVL----------GADPKIADFLQEAKKENKWIGAICAA 101 Query: 138 PAML 141 P++L Sbjct: 102 PSIL 105 >UniRef50_A7NLT5 ThiJ/PfpI domain protein n=7 Tax=Bacteria RepID=A7NLT5_ROSCS Length = 174 Score = 40.3 bits (93), Expect = 0.051, Method: Composition-based stats. Identities = 34/134 (25%), Positives = 54/134 (40%), Gaps = 34/134 (25%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I ++L+ +G E E + ++ + GA+ V D + V N L T Sbjct: 6 KRIAILLA-----EGVEDLEFYVPMMRLQEEGAEVVAAGLDLRPVRGKNGLEITPTTT-- 58 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 +A+ A +L AL++PGG+ K R + L Sbjct: 59 ----------------IAELRADDLFALVLPGGWAPDK-----------LRRYRAVTDLV 91 Query: 122 QAMHQAGKPLGFMC 135 QAMH AGK +G +C Sbjct: 92 QAMHAAGKVIGIIC 105 >UniRef50_D1VVY7 DJ-1 family protein n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VVY7_9FIRM Length = 193 Score = 40.3 bits (93), Expect = 0.051, Method: Composition-based stats. Identities = 29/141 (20%), Positives = 46/141 (32%), Gaps = 32/141 (22%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK + V L+ DG E EA+ + + R G + I + Sbjct: 1 MKDLLVFLA-----DGFEEVEALSVVDILRRGGLSVDTCS--------IKDSKKVTSSHQ 47 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 VL + + A +PGG A NL N DR + + Sbjct: 48 VTVLAD---------VHIDDIKIDNYKACYIPGGQPGATNLQN----------DRRIIQI 88 Query: 121 AQAMHQAGKPLGFMCIAPAML 141 + + GK + +C P +L Sbjct: 89 VEMFKEQGKLVAAICAGPQVL 109 >UniRef50_B2B3G0 Predicted CDS Pa_6_120 n=5 Tax=Dikarya RepID=B2B3G0_PODAN Length = 293 Score = 40.3 bits (93), Expect = 0.051, Method: Composition-based stats. Identities = 23/86 (26%), Positives = 32/86 (37%), Gaps = 11/86 (12%) Query: 84 AELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPK 143 DAL PGG G F G D+E + + + +AGK + +C PA L Sbjct: 160 DSYDALFFPGGHGPM-----FDLAG-----DKESQEIVKRFWEAGKIVSAVCHGPAALVN 209 Query: 144 I-FDFPLRLTIGTDIDTAEVLEEMGA 168 + L G + EE G Sbjct: 210 VKLSNGDYLLKGKKVTAFSNSEEDGV 235 >UniRef50_Q10356 Uncharacterized protein C22E12.03c n=2 Tax=Schizosaccharomyces RepID=YDB3_SCHPO Length = 191 Score = 40.3 bits (93), Expect = 0.051, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 23/132 (17%) Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 +R+V + A R + EI A + D I+PGG AK LS ++ Sbjct: 47 SRDVEMYANR-SYKEIPSADDF-AKQYDIAIIPGGGLGAKTLST----------TPFVQQ 94 Query: 120 LAQAMHQAGKP---LGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVD 176 + + + KP +G +C A + K P + G LEE G ++ +D Sbjct: 95 VVKEFY--KKPNKWIGMIC-AGTLTAKTSGLPNKQITGH-PSVRGQLEEGGYKY----LD 146 Query: 177 DIVVDEDNKIVT 188 VV E+N I + Sbjct: 147 QPVVLEENLITS 158 >UniRef50_C7NZR3 ThiJ/PfpI domain protein n=10 Tax=Halobacteriaceae RepID=C7NZR3_HALMD Length = 228 Score = 39.9 bits (92), Expect = 0.056, Method: Composition-based stats. Identities = 30/103 (29%), Positives = 42/103 (40%), Gaps = 12/103 (11%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLT------G 54 M I+S G + G E E L ++ +G + P VI+ + G Sbjct: 1 MTTALFIVSEEGYW-GEECIEP---LTTLTEAGVEVTVATPTGN-PPVIDERSIDPEEVG 55 Query: 55 EAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGA 97 E E + E+ I LA ADA E DA++ PGG GA Sbjct: 56 EETAERVTSVAESDDRLNDPIA-LADADAQEYDAVVFPGGHGA 97 >UniRef50_A4WTY0 Intracellular protease, PfpI family n=7 Tax=Bacteria RepID=A4WTY0_RHOS5 Length = 189 Score = 39.9 bits (92), Expect = 0.064, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 40/106 (37%), Gaps = 16/106 (15%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 ++ A+ D L++PGG + E A + +AGKP+ +C Sbjct: 65 VSDVSASGFDGLVIPGG----------TVGADKIRGSAEAVAFVRGFFEAGKPVAAICHG 114 Query: 138 P-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 P A++ RLT + A + G ++VVD+ Sbjct: 115 PWALVEAGVLEGRRLT--SFPSLATDIRNAGGHWTD---AEVVVDQ 155 >UniRef50_C8W7V5 DJ-1 family protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W7V5_ATOPD Length = 190 Score = 39.9 bits (92), Expect = 0.064, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 16/105 (15%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 L+Q + L++PGG NL L A G+ L +C A Sbjct: 62 LSQISFDDYSMLVLPGGLPGTTNL----------EACEPLMQAVDAFAADGRALAAICAA 111 Query: 138 PAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 P++ K + T ++ L E GA+ D + VD Sbjct: 112 PSIYAKRGLLQGKKAT--SNPGFQHFLSENGAKLTK---DAVCVD 151 >UniRef50_C4ZGF1 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosphate synthesis protein n=2 Tax=Clostridiales RepID=C4ZGF1_EUBR3 Length = 181 Score = 39.5 bits (91), Expect = 0.069, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 10/63 (15%) Query: 79 AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP 138 AQAD DA+++PGG NL D + + GK + +C AP Sbjct: 57 AQADFDSYDAIVLPGGMPGTLNL----------GADETVVKTIKRFAAEGKLVAAICAAP 106 Query: 139 AML 141 ++L Sbjct: 107 SVL 109 >UniRef50_D1QMP0 ThiJ/PfpI family protein n=3 Tax=Prevotella RepID=D1QMP0_9BACT Length = 191 Score = 39.5 bits (91), Expect = 0.069, Method: Composition-based stats. Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 10/62 (16%) Query: 81 ADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAM 140 AD + D L++PGG A NL +KA + +AGK +G +C AP + Sbjct: 62 ADFNDADILLLPGGMPGATNL----------NAHEGVKAALKKQIEAGKRVGAICAAPMV 111 Query: 141 LP 142 L Sbjct: 112 LA 113 >UniRef50_D1AAI5 ThiJ/PfpI domain protein n=3 Tax=Bacteria RepID=D1AAI5_THECD Length = 226 Score = 39.5 bits (91), Expect = 0.070, Method: Composition-based stats. Identities = 28/130 (21%), Positives = 49/130 (37%), Gaps = 30/130 (23%) Query: 79 AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP 138 Q D + A+ GG G + +F + A A+ +++AG + +C P Sbjct: 87 DQVDPSGYAAIFYVGGHG---TMWDFPQDAAL-------AAAARRIYEAGGVVAAVCHGP 136 Query: 139 A-MLPKIFDFPLRLTIGTDIDT-------------------AEVLEEMGAEHVPCPVDDI 178 A +LP L G D+ + ++ LE++GA H P Sbjct: 137 AGLLPITLSDGRPLVEGRDLTSFTNEEEADQGLTDVVPFLLSDALEKLGARHHGKPAYQA 196 Query: 179 VVDEDNKIVT 188 V D +++T Sbjct: 197 NVVVDGRLIT 206 >UniRef50_Q9RX24 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RX24_DEIRA Length = 261 Score = 39.5 bits (91), Expect = 0.082, Method: Composition-based stats. Identities = 28/137 (20%), Positives = 46/137 (33%), Gaps = 17/137 (12%) Query: 84 AELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPK 143 AL +PGG ++ D L + + H A KP +C AP L Sbjct: 107 QAYAALFLPGGHAPM----------ADLMADPALGEILRQFHAAAKPTALICHAPTALLA 156 Query: 144 IFDFPLRLTIGTDID---TAEVLEEMGAEHVPCPVDDIVVDEDN---KIVTTPAYMLAQN 197 P+ G + +A+ G D+ E ++ PA L Q Sbjct: 157 AQADPVAFQKGMEAGEQPSAQDFTYQGYRVTVFANDEEEATEKTFEAPMLYYPADALTQG 216 Query: 198 IAEAASGIDKLVSRVLV 214 A+ +G + + V+ Sbjct: 217 GAQVENG-EAMKPNVVR 232 >UniRef50_Q8PSJ5 Protease I n=1 Tax=Methanosarcina mazei RepID=Q8PSJ5_METMA Length = 144 Score = 39.2 bits (90), Expect = 0.090, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 41/115 (35%), Gaps = 21/115 (18%) Query: 79 AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP 138 + + D L++ GG G K +D++ + + + KP+ +C P Sbjct: 33 KDVNPEDYDILVISGGKGPEK-----------MRLDKDALEITKHFFEKNKPVAAICHGP 81 Query: 139 -AMLPKIFDFPLRLT--IGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 ++ + T IG D GA + ++V+D + +P Sbjct: 82 QVLVSAGVIKGRKATCWIGIRDDIIAA----GALYED---SEVVIDGNFVSSRSP 129 >UniRef50_C3XNX9 Putative uncharacterized protein n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XNX9_9HELI Length = 191 Score = 39.2 bits (90), Expect = 0.091, Method: Composition-based stats. Identities = 29/131 (22%), Positives = 55/131 (41%), Gaps = 26/131 (19%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEA---ARITR 72 G E EA+ + + R+G A + + DV++ N+++E+ +I Sbjct: 12 GFEELEAISVIDVLRRAGCDV-IVAKVESKNDVLD----------SNLIVESQKGVKIVA 60 Query: 73 GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLG 132 + L+ D LD ++ PGG+ +NL LK + + ++ G+ + Sbjct: 61 DKF--LSAVDCECLDGIVFPGGWEGTQNL----------IASSSLKEVLEKLNAKGRIIA 108 Query: 133 FMCIAPAMLPK 143 +C AP L K Sbjct: 109 AICAAPLALFK 119 >UniRef50_A5FSI4 DJ-1 family protein n=5 Tax=Dehalococcoides RepID=A5FSI4_DEHSB Length = 180 Score = 39.2 bits (90), Expect = 0.094, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 44/113 (38%), Gaps = 22/113 (19%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + + + L++PGG N+ D+ + L + H K L +C Sbjct: 55 IDDLKTTDYEVLVLPGGNPGFINMGK----------DQRVLELIRTAHAENKYLAAICAG 104 Query: 138 PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHV--PCPVDDIVVDEDNKIVT 188 PA+L + + ID EV G +H+ C D+ V + +++T Sbjct: 105 PAVLSR---AGV-------IDGKEVAIYPGVKHLLKNCTACDLRVKVEGRLIT 147 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B4EXM9 Enhancing lycopene biosynthesis protein 2 n=2 Ta... 256 5e-67 UniRef50_P0ABU5 Enhancing lycopene biosynthesis protein 2 n=267 ... 255 7e-67 UniRef50_B6XFP3 Putative uncharacterized protein n=1 Tax=Provide... 254 1e-66 UniRef50_P30042 ES1 protein homolog, mitochondrial n=45 Tax=Meta... 245 6e-64 UniRef50_Q6APY5 Putative uncharacterized protein n=1 Tax=Desulfo... 245 1e-63 UniRef50_B9EL82 ES1 protein homolog, mitochondrial n=11 Tax=Eume... 243 4e-63 UniRef50_A3M9Q1 Enhancing lycopene biosynthesis protein 2 n=5 Ta... 240 4e-62 UniRef50_C9J1C8 Putative uncharacterized protein C21orf33 (Fragm... 236 3e-61 UniRef50_Q6MQ93 Enhancing lycopene biosynthesis protein 2 n=3 Ta... 232 7e-60 UniRef50_Q3AU80 Es1 family protein n=30 Tax=Bacteria RepID=Q3AU8... 229 4e-59 UniRef50_C5LKS2 Putative uncharacterized protein n=4 Tax=Eukaryo... 229 6e-59 UniRef50_Q2GLV6 Es1 family protein n=5 Tax=Anaplasma RepID=Q2GLV... 224 1e-57 UniRef50_D2W324 Glutamine amidotransferase domain-containing pro... 221 2e-56 UniRef50_Q90257 ES1 protein, mitochondrial n=3 Tax=Danio rerio R... 218 9e-56 UniRef50_A9V3H5 Predicted protein n=5 Tax=Fungi/Metazoa group Re... 218 2e-55 UniRef50_C0R4T2 Enhancing lycopene biosynthesis protein 2, putat... 214 2e-54 UniRef50_Q21UI1 ThiJ/PfpI n=1 Tax=Rhodoferax ferrireducens T118 ... 211 1e-53 UniRef50_Q2RPB9 ThiJ/PfpI n=7 Tax=Bacteria RepID=Q2RPB9_RHORT 209 7e-53 UniRef50_B9Z0T4 ThiJ/PfpI domain protein n=1 Tax=Lutiella nitrof... 200 2e-50 UniRef50_A4IY89 DJ-1/PfpI family protein n=24 Tax=Gammaproteobac... 196 4e-49 UniRef50_UPI0000DB6CF4 PREDICTED: similar to es1 protein n=2 Tax... 194 1e-48 UniRef50_P30042-2 Isoform Short of ES1 protein homolog, mitochon... 192 6e-48 UniRef50_UPI000186E026 conserved hypothetical protein n=1 Tax=Pe... 189 7e-47 UniRef50_Q2NWH9 Sigma cross-reacting protein 27A (SCRP-27A) n=1 ... 186 5e-46 UniRef50_C5CG98 ThiJ/PfpI domain protein n=1 Tax=Kosmotoga olear... 178 1e-43 UniRef50_C1C1F4 Enhancing lycopene biosynthesis protein 2 n=2 Ta... 167 3e-40 UniRef50_A6NYE6 Putative uncharacterized protein n=1 Tax=Bactero... 162 6e-39 UniRef50_Q8WQI1 Lycopene biosynthesis-enhancing protein n=2 Tax=... 151 1e-35 UniRef50_D2V705 Predicted protein n=1 Tax=Naegleria gruberi RepI... 142 9e-33 UniRef50_D1CDU1 Intracellular protease, PfpI family n=40 Tax=Bac... 137 3e-31 UniRef50_B9TI86 Protease C56, putative n=1 Tax=Ricinus communis ... 134 3e-30 UniRef50_C5DA95 Intracellular protease, PfpI family n=11 Tax=cel... 131 1e-29 UniRef50_Q1QTA8 Peptidase C56, PfpI n=27 Tax=Bacteria RepID=Q1QT... 130 2e-29 UniRef50_D1CA42 Intracellular protease, PfpI family n=4 Tax=Bact... 130 3e-29 UniRef50_Q26CT8 Intracellular protease, PfpI family n=3 Tax=Bact... 129 6e-29 UniRef50_B9XL12 Intracellular protease, PfpI family n=1 Tax=bact... 129 6e-29 UniRef50_C8WTD3 ThiJ/PfpI domain protein n=5 Tax=Bacillales RepI... 125 7e-28 UniRef50_Q2NCQ5 Protease n=5 Tax=Proteobacteria RepID=Q2NCQ5_ERYLH 125 1e-27 UniRef50_A1VNH7 Intracellular protease, PfpI family n=12 Tax=Pro... 124 2e-27 UniRef50_A1TMJ4 Intracellular protease, PfpI family n=11 Tax=Bac... 124 3e-27 UniRef50_B8DQK9 ThiJ/PfpI domain protein n=3 Tax=Bacteria RepID=... 124 3e-27 UniRef50_C6CWE5 Intracellular protease, PfpI family n=6 Tax=Bact... 122 6e-27 UniRef50_A1K1D1 ThiJ/PfpI family protein n=17 Tax=cellular organ... 122 9e-27 UniRef50_C5CQ44 Intracellular protease, PfpI family n=11 Tax=Bac... 121 1e-26 UniRef50_A3IN92 ThiJ/PfpI n=2 Tax=Cyanothece RepID=A3IN92_9CHRO 121 2e-26 UniRef50_Q5ZU31 Intracellular protease, ThiJ/PfpI family n=4 Tax... 120 3e-26 UniRef50_Q7MQ54 Putative uncharacterized protein VV0154 n=1 Tax=... 119 5e-26 UniRef50_Q5FQ93 Protease I n=1 Tax=Gluconobacter oxydans RepID=Q... 119 5e-26 UniRef50_Q464Y3 Putative intracellular protease n=2 Tax=cellular... 118 1e-25 UniRef50_Q1DD54 Peptidase, C56 (PfpI) family n=2 Tax=Cystobacter... 118 1e-25 UniRef50_Q0ULG5 Putative uncharacterized protein n=2 Tax=Pleospo... 117 3e-25 UniRef50_D2S4J1 Intracellular protease, PfpI family n=2 Tax=Fran... 116 6e-25 UniRef50_UPI00017898AF intracellular protease, PfpI family n=1 T... 115 7e-25 UniRef50_Q313C6 Peptidase C56, PfpI n=12 Tax=cellular organisms ... 115 9e-25 UniRef50_B2JT66 ThiJ/PfpI domain protein n=4 Tax=Burkholderiales... 113 4e-24 UniRef50_Q5HPG8 ThiJ/PfpI family protein n=9 Tax=Staphylococcus ... 112 6e-24 UniRef50_A4FDV5 Protease I n=8 Tax=Bacteria RepID=A4FDV5_SACEN 112 6e-24 UniRef50_B1KEU0 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria ... 112 7e-24 UniRef50_D2W0D5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 111 2e-23 UniRef50_B1K8Q4 ThiJ/PfpI domain protein n=220 Tax=cellular orga... 109 7e-23 UniRef50_D1YEB9 Intracellular protease, PfpI family n=4 Tax=Acti... 109 7e-23 UniRef50_B2UQP3 DJ-1 family protein n=1 Tax=Akkermansia muciniph... 109 9e-23 UniRef50_C7PRI4 ThiJ/PfpI domain protein n=1 Tax=Chitinophaga pi... 109 1e-22 UniRef50_B2IIZ8 ThiJ/PfpI domain protein n=1 Tax=Beijerinckia in... 108 1e-22 UniRef50_Q0SRB0 DJ-1 family protein n=35 Tax=Bacteria RepID=Q0SR... 108 1e-22 UniRef50_B8KZH8 Intracellular proteinase PfpI n=1 Tax=Stenotroph... 108 2e-22 UniRef50_Q0B5J2 ThiJ/PfpI domain protein n=18 Tax=Proteobacteria... 107 3e-22 UniRef50_B3DRT7 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 106 5e-22 UniRef50_UPI0001B4C882 ThiJ/PfpI domain-containing protein n=1 T... 105 8e-22 UniRef50_A5WBS8 ThiJ/PfpI domain protein n=2 Tax=Psychrobacter R... 105 8e-22 UniRef50_A1AQV7 Metal dependent phosphohydrolase n=3 Tax=Bacteri... 105 8e-22 UniRef50_C2BZS9 C56 family peptidase n=1 Tax=Listeria grayi DSM ... 105 1e-21 UniRef50_D0SXB4 Intracellular protease n=2 Tax=Acinetobacter Rep... 104 2e-21 UniRef50_UPI0001979AA3 4-methyl-5(beta-hydroxyethyl)-thiazole mo... 104 2e-21 UniRef50_Q7NER9 Glr3809 protein n=1 Tax=Gloeobacter violaceus Re... 104 2e-21 UniRef50_B0N5W0 Putative uncharacterized protein n=2 Tax=Bacteri... 104 3e-21 UniRef50_C9RGD3 Intracellular protease, PfpI family n=1 Tax=Meth... 104 3e-21 UniRef50_A4X5M7 ThiJ/PfpI domain protein n=4 Tax=Actinomycetales... 103 4e-21 UniRef50_Q045Z0 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 102 6e-21 UniRef50_C7N926 DJ-1 family protein n=3 Tax=Leptotrichia RepID=C... 102 7e-21 UniRef50_B5HD84 Protease I n=1 Tax=Streptomyces pristinaespirali... 102 7e-21 UniRef50_P80876 General stress protein 18 n=22 Tax=Bacteria RepI... 101 1e-20 UniRef50_C5VG63 DJ-1 family protein n=6 Tax=Prevotella RepID=C5V... 101 2e-20 UniRef50_B0SBM0 Transcription regulator, DJ-1/PfpI family intrac... 101 2e-20 UniRef50_A7GXC1 DJ-1 family protein n=6 Tax=Campylobacter RepID=... 100 2e-20 UniRef50_B8DF54 Intracellular protease 1 (Intracellular protease... 100 3e-20 UniRef50_A7HTR2 ThiJ/PfpI domain protein n=1 Tax=Parvibaculum la... 100 3e-20 UniRef50_D2RSY5 Intracellular protease, PfpI family n=1 Tax=Halo... 99 1e-19 UniRef50_C1SLG0 DJ-1 family protein n=1 Tax=Denitrovibrio acetip... 98 2e-19 UniRef50_B5Y9N6 Intracellular protease 1 (Intracellular protease... 98 2e-19 UniRef50_C6PQH3 ThiJ/PfpI domain protein n=11 Tax=Bacteria RepID... 98 2e-19 UniRef50_C6RJA7 Intracellular protease, PfpI family n=3 Tax=Acin... 98 2e-19 UniRef50_Q58377 Uncharacterized protein MJ0967 n=8 Tax=Euryarcha... 97 3e-19 UniRef50_A4XRE4 ThiJ/PfpI domain protein n=3 Tax=Gammaproteobact... 97 3e-19 UniRef50_A9HQ45 Putative transcriptional regulator n=1 Tax=Gluco... 97 4e-19 UniRef50_Q0W5Q2 Intracellular protease (C56 family) n=3 Tax=cell... 97 4e-19 UniRef50_A8RCF9 Putative uncharacterized protein n=2 Tax=Firmicu... 96 7e-19 UniRef50_Q27SQ0 Protease/amidase (Fragment) n=1 Tax=Pavlova luth... 95 2e-18 UniRef50_Q0JPK7 Os01g0217800 protein n=16 Tax=Magnoliophyta RepI... 95 2e-18 UniRef50_Q12ZS1 Intracellular protease 1 n=3 Tax=Methanosarcinac... 94 2e-18 UniRef50_UPI0000E47ADE PREDICTED: similar to KNP-Ia, partial n=2... 94 3e-18 UniRef50_B4U9D2 DJ-1 family protein n=1 Tax=Hydrogenobaculum sp.... 94 5e-18 UniRef50_C3MPE6 Intracellular protease, PfpI family n=11 Tax=Sul... 93 6e-18 UniRef50_Q21F36 ThiJ/PfpI n=1 Tax=Saccharophagus degradans 2-40 ... 92 2e-17 UniRef50_C0WIV6 C56 family peptidase n=8 Tax=Actinomycetales Rep... 92 2e-17 UniRef50_B1YM90 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium... 91 3e-17 UniRef50_A8P0K7 Putative uncharacterized protein n=1 Tax=Coprino... 91 4e-17 UniRef50_B8FFC8 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria ... 88 2e-16 UniRef50_C4G3X0 Putative uncharacterized protein n=1 Tax=Abiotro... 88 2e-16 UniRef50_A6Q7R5 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosp... 87 3e-16 UniRef50_Q7M905 MONOPHOSPHATE SYNTHESISPROTEIN n=1 Tax=Wolinella... 87 4e-16 UniRef50_Q9ZV19 ProteaseI (PfpI)-like protein n=4 Tax=Arabidopsi... 87 5e-16 UniRef50_UPI0001AEB94E ThiJ/PfpI domain-containing protein n=1 T... 87 5e-16 UniRef50_Q6F1K3 Putative intracellular protease/amidase n=1 Tax=... 87 5e-16 UniRef50_Q04P14 ThiJ/PfpI family protein n=5 Tax=Bacteria RepID=... 86 8e-16 UniRef50_A4WJ08 ThiJ/PfpI domain protein n=1 Tax=Pyrobaculum ars... 85 2e-15 UniRef50_A8QA68 Putative uncharacterized protein n=1 Tax=Malasse... 81 2e-14 Sequences not found previously or not previously below threshold: UniRef50_Q5JGM7 Intracellular protease 1 n=22 Tax=cellular organ... 110 4e-23 UniRef50_Q48CJ9 Protease PfpI n=12 Tax=Bacteria RepID=Q48CJ9_PSE14 109 7e-23 UniRef50_C5PN70 C56 family peptidase n=2 Tax=Sphingobacterium sp... 107 2e-22 UniRef50_Q9V1F8 Intracellular protease 1 n=6 Tax=cellular organi... 107 3e-22 UniRef50_A0Q9X2 Intracellular protease, PfpI family protein n=4 ... 106 5e-22 UniRef50_Q88JC4 Protease PfpI n=3 Tax=Bacteria RepID=Q88JC4_PSEPK 106 6e-22 UniRef50_Q47QQ3 Peptidase C56, PfpI n=12 Tax=Bacteria RepID=Q47Q... 106 6e-22 UniRef50_C5RPJ4 DJ-1 family protein n=1 Tax=Clostridium cellulov... 105 8e-22 UniRef50_C5A5Q9 Peptidase C56, intracellular protease PfpI famil... 105 8e-22 UniRef50_Q0C397 Intracellular protease, PfpI family n=6 Tax=Alph... 105 1e-21 UniRef50_A7H8I0 Intracellular protease, PfpI family n=6 Tax=Bact... 104 2e-21 UniRef50_B9M1L7 Intracellular protease, PfpI family n=6 Tax=Bact... 104 2e-21 UniRef50_A9ARH9 Intracellular protease, PfpI family n=5 Tax=Burk... 103 4e-21 UniRef50_B9L1B0 Protease I n=2 Tax=Thermomicrobia (class) RepID=... 103 4e-21 UniRef50_A4XSU5 Intracellular protease, PfpI family n=11 Tax=cel... 103 5e-21 UniRef50_C5AII4 Intracellular protease, PfpI family protein n=1 ... 102 7e-21 UniRef50_Q9M8R4 F13E7.34 protein n=273 Tax=cellular organisms Re... 102 1e-20 UniRef50_B3XE12 Peptidase C56, PfpI n=6 Tax=Escherichia RepID=B3... 102 1e-20 UniRef50_B0UB01 ThiJ/PfpI domain protein n=2 Tax=Alphaproteobact... 101 2e-20 UniRef50_P45470 Protein yhbO n=95 Tax=Bacteria RepID=YHBO_ECOLI 101 2e-20 UniRef50_C8PUR4 Intracellular protease 1 n=1 Tax=Enhydrobacter a... 101 2e-20 UniRef50_A4WTY0 Intracellular protease, PfpI family n=7 Tax=Bact... 100 3e-20 UniRef50_A8L5X9 Intracellular protease, PfpI family n=32 Tax=Bac... 100 4e-20 UniRef50_C0QRM2 Intracellular protease 1 (Intracellular protease... 99 8e-20 UniRef50_C6CHL2 ThiJ/PfpI domain protein n=4 Tax=Enterobacteriac... 99 1e-19 UniRef50_O28987 Uncharacterized protein AF_1281 n=11 Tax=cellula... 99 1e-19 UniRef50_D1BGR7 Intracellular protease, PfpI family n=14 Tax=Act... 98 2e-19 UniRef50_C0SMV9 Putative uncharacterized protein n=1 Tax=Strepto... 98 2e-19 UniRef50_C5D5R3 ThiJ/PfpI domain protein n=6 Tax=Bacillaceae Rep... 97 3e-19 UniRef50_Q026F0 Intracellular protease, PfpI family n=3 Tax=Bact... 97 4e-19 UniRef50_B1YMA0 Intracellular protease, PfpI family n=14 Tax=Bac... 96 6e-19 UniRef50_C4L0C2 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium... 96 7e-19 UniRef50_B2A567 Intracellular protease, PfpI family n=1 Tax=Natr... 96 8e-19 UniRef50_C0GN28 Intracellular protease, PfpI family n=1 Tax=Desu... 96 1e-18 UniRef50_O06006 Putative cysteine protease yraA n=81 Tax=cellula... 95 1e-18 UniRef50_C5RB54 C56 family peptidase n=12 Tax=Lactobacillales Re... 94 2e-18 UniRef50_B5XSZ0 Intracellular protease, PfpI family n=7 Tax=Bact... 94 2e-18 UniRef50_Q2SRY0 DJ-1 family protein n=3 Tax=Mycoplasma mycoides ... 94 3e-18 UniRef50_B4RT08 Protease n=2 Tax=Alteromonadales RepID=B4RT08_ALTMD 94 3e-18 UniRef50_A1HSG1 Intracellular protease, PfpI family n=1 Tax=Ther... 93 5e-18 UniRef50_C7MB29 Intracellular protease, PfpI family n=1 Tax=Brac... 93 6e-18 UniRef50_A1VCT8 Intracellular protease, PfpI family n=14 Tax=Bac... 93 7e-18 UniRef50_Q03XP7 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 92 8e-18 UniRef50_B9ZSD7 ThiJ/PfpI domain protein n=2 Tax=Bacteria RepID=... 92 8e-18 UniRef50_C5NVZ3 DJ-1 family protein n=1 Tax=Gemella haemolysans ... 92 1e-17 UniRef50_Q0SMN5 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 91 2e-17 UniRef50_C4RK05 Intracellular proteinase I n=1 Tax=Micromonospor... 91 2e-17 UniRef50_A7HFM3 ThiJ/PfpI domain protein n=1 Tax=Anaeromyxobacte... 91 2e-17 UniRef50_A0K1V5 ThiJ/PfpI domain protein n=6 Tax=Actinomycetales... 91 2e-17 UniRef50_A0B7B8 DJ-1 family protein n=3 Tax=cellular organisms R... 91 3e-17 UniRef50_C2BGT7 Possible transcriptional regulator n=2 Tax=Anaer... 91 3e-17 UniRef50_Q72HB0 Putative amidotransferase n=2 Tax=Thermus RepID=... 91 3e-17 UniRef50_C4L1D0 Intracellular protease, PfpI family n=2 Tax=Bact... 91 3e-17 UniRef50_Q4V0N9 NonF-related protein n=5 Tax=cellular organisms ... 90 4e-17 UniRef50_Q24FT7 DJ-1/PfpI family protein n=1 Tax=Tetrahymena the... 90 5e-17 UniRef50_B2GGQ1 ThiJ/PfpI family protein n=6 Tax=Actinomycetales... 90 6e-17 UniRef50_B2IWY8 Intracellular protease, PfpI family n=3 Tax=Cyan... 90 7e-17 UniRef50_C9RKY9 DJ-1 family protein n=1 Tax=Fibrobacter succinog... 89 8e-17 UniRef50_B0N2E2 Putative uncharacterized protein n=2 Tax=Bacteri... 89 8e-17 UniRef50_D2R736 ThiJ/PfpI domain protein n=2 Tax=Bacteria RepID=... 89 1e-16 UniRef50_B0NJL3 Putative uncharacterized protein n=1 Tax=Clostri... 89 1e-16 UniRef50_B4PML8 GE10903 n=8 Tax=Neoptera RepID=B4PML8_DROYA 89 1e-16 UniRef50_Q0BE72 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria ... 89 1e-16 UniRef50_A8UMK5 DJ-1/PfpI family protein n=1 Tax=Flavobacteriale... 89 1e-16 UniRef50_A2BLI2 Protease I n=1 Tax=Hyperthermus butylicus DSM 54... 89 1e-16 UniRef50_D0BJF4 DJ-1 family protein n=1 Tax=Granulicatella elega... 89 1e-16 UniRef50_Q311F3 Peptidase C56, PfpI n=2 Tax=Bacteria RepID=Q311F... 88 2e-16 UniRef50_Q13FG4 Peptidase C56, PfpI n=1 Tax=Burkholderia xenovor... 88 2e-16 UniRef50_B5YN08 Predicted protein n=2 Tax=Bacillariophyta RepID=... 88 2e-16 UniRef50_B5ZFI3 ThiJ/PfpI domain protein n=1 Tax=Gluconacetobact... 88 2e-16 UniRef50_Q47L11 Peptidase C56, PfpI n=2 Tax=Bacteria RepID=Q47L1... 88 2e-16 UniRef50_C0W6P5 Possible transcriptional regulator n=1 Tax=Actin... 88 2e-16 UniRef50_C5F0X7 Putative uncharacterized protein n=2 Tax=Helicob... 88 2e-16 UniRef50_Q30R88 DJ-1 n=4 Tax=Epsilonproteobacteria RepID=Q30R88_... 88 2e-16 UniRef50_B5E3I6 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 87 3e-16 UniRef50_C1QDE2 Intracellular protease, PfpI family n=1 Tax=Brac... 87 3e-16 UniRef50_C8N9N9 ThiJ/PfpI family protein n=1 Tax=Cardiobacterium... 87 3e-16 UniRef50_C7PM17 ThiJ/PfpI domain protein n=2 Tax=Bacteria RepID=... 87 3e-16 UniRef50_Q5HN59 Uncharacterized protein SERP1413 n=71 Tax=Bacter... 87 3e-16 UniRef50_D1AAI5 ThiJ/PfpI domain protein n=3 Tax=Bacteria RepID=... 87 3e-16 UniRef50_C7RFT2 DJ-1 family protein n=2 Tax=Anaerococcus RepID=C... 87 3e-16 UniRef50_A7NLT5 ThiJ/PfpI domain protein n=7 Tax=Bacteria RepID=... 87 3e-16 UniRef50_C1XV98 Putative intracellular protease/amidase n=1 Tax=... 87 3e-16 UniRef50_C8NVB7 ThiJ/PfpI family protein n=1 Tax=Corynebacterium... 87 4e-16 UniRef50_B7HF13 ThiJ/pfpI family protein n=61 Tax=Bacillus RepID... 87 4e-16 UniRef50_C2EQY3 Possible transcriptional regulator n=10 Tax=Lact... 87 5e-16 UniRef50_Q01V75 Intracellular protease, PfpI family n=15 Tax=Bac... 87 5e-16 UniRef50_Q0SC28 Possible transcriptional regulator n=14 Tax=Acti... 87 5e-16 UniRef50_B4D2S1 ThiJ/PfpI domain protein n=8 Tax=Bacteria RepID=... 87 5e-16 UniRef50_Q486I7 DJ-1/PfpI family protein n=1 Tax=Colwellia psych... 86 6e-16 UniRef50_Q8YYF7 Alr0893 protein n=3 Tax=Nostocaceae RepID=Q8YYF7... 86 7e-16 UniRef50_Q29CB8 GA12322 n=5 Tax=Endopterygota RepID=Q29CB8_DROPS 86 9e-16 UniRef50_A9KDZ3 Protease I n=3 Tax=Coxiella burnetii RepID=A9KDZ... 86 1e-15 UniRef50_B1WZ24 ThiJ/PfpI peptidase C56 family n=5 Tax=Cyanobact... 86 1e-15 UniRef50_Q47L13 Intracellular protease/amidase, putative n=2 Tax... 86 1e-15 UniRef50_Q47YD7 DJ-1/PfpI family protein n=1 Tax=Colwellia psych... 86 1e-15 UniRef50_C7RHW9 Intracellular protease, PfpI family n=1 Tax=Anae... 86 1e-15 UniRef50_A8A9H6 Intracellular protease, PfpI family n=1 Tax=Igni... 85 1e-15 UniRef50_B1L3Q5 Intracellular protease, PfpI family n=1 Tax=Cand... 85 1e-15 UniRef50_A1ZP43 ThiJ/PfpI family n=1 Tax=Microscilla marina ATCC... 85 2e-15 UniRef50_D1VVY7 DJ-1 family protein n=1 Tax=Peptoniphilus lacrim... 85 2e-15 UniRef50_Q97KC8 Putative intracellular protease n=1 Tax=Clostrid... 85 2e-15 UniRef50_Q49YS0 Uncharacterized protein SSP0918 n=14 Tax=Bacilli... 85 2e-15 UniRef50_B4SJZ4 ThiJ/PfpI domain protein n=1 Tax=Stenotrophomona... 84 2e-15 UniRef50_Q15SH5 ThiJ/PfpI n=14 Tax=Gammaproteobacteria RepID=Q15... 84 2e-15 UniRef50_B8CX62 DJ-1 family protein n=1 Tax=Halothermothrix oren... 84 2e-15 UniRef50_Q1GFT3 ThiJ/PfpI n=5 Tax=Proteobacteria RepID=Q1GFT3_SILST 84 3e-15 UniRef50_C3XNX9 Putative uncharacterized protein n=1 Tax=Helicob... 84 3e-15 UniRef50_C7HVF0 DJ-1 family protein n=1 Tax=Anaerococcus vaginal... 84 3e-15 UniRef50_A6CIJ2 Predicted intracellular protease/amidase, ThiJ/P... 84 3e-15 UniRef50_B6YYX0 Intracellular protease 1 n=1 Tax=Pseudovibrio sp... 84 3e-15 UniRef50_C8KYS8 Type 1 glutamine amidotransferase n=1 Tax=Actino... 84 3e-15 UniRef50_B3T0X6 Putative DJ-1/PfpI family protein n=1 Tax=uncult... 84 4e-15 UniRef50_B9W435 Putative DJ-1 family protein (Fragment) n=1 Tax=... 84 4e-15 UniRef50_C5VYK3 DJ-1/PfpI family protein n=6 Tax=Streptococcus s... 84 4e-15 UniRef50_C2AV24 Putative intracellular protease/amidase n=1 Tax=... 84 5e-15 UniRef50_B1YFC6 ThiJ/PfpI domain protein n=4 Tax=Bacillales RepI... 84 5e-15 UniRef50_C4KYU5 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium... 83 5e-15 UniRef50_C5EU56 DJ-1 family protein n=3 Tax=Clostridiales RepID=... 83 5e-15 UniRef50_Q16AF6 Protease, putative n=10 Tax=Alphaproteobacteria ... 83 6e-15 UniRef50_A6DTE4 Putative intracellular protease/amidase, ThiJ fa... 83 7e-15 UniRef50_A6WAG0 ThiJ/PfpI domain protein n=1 Tax=Kineococcus rad... 83 8e-15 UniRef50_A6C613 Intracellular proteinase pfpI n=1 Tax=Planctomyc... 82 8e-15 UniRef50_D1PPH9 ThiJ/PfpI family protein n=1 Tax=Subdoligranulum... 82 9e-15 UniRef50_Q15TQ8 ThiJ/PfpI n=1 Tax=Pseudoalteromonas atlantica T6... 82 1e-14 UniRef50_C4ZGF1 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosp... 82 1e-14 UniRef50_C3WFG1 4-methyl-5(B-hydroxyethyl)-thiazole monophosphat... 82 1e-14 UniRef50_A8RKB7 Putative uncharacterized protein n=2 Tax=Clostri... 82 1e-14 UniRef50_A9KSW4 DJ-1 family protein n=14 Tax=Clostridiales RepID... 82 1e-14 UniRef50_A8SLZ0 Putative uncharacterized protein n=1 Tax=Parvimo... 82 2e-14 UniRef50_C8V2F0 ThiJ/PfpI family protein (AFU_orthologue; AFUA_3... 82 2e-14 UniRef50_C3QHA6 ThiJ family intracellular protease/amidase n=28 ... 82 2e-14 UniRef50_Q8PSJ5 Protease I n=1 Tax=Methanosarcina mazei RepID=Q8... 81 2e-14 >UniRef50_B4EXM9 Enhancing lycopene biosynthesis protein 2 n=2 Tax=Proteus mirabilis RepID=B4EXM9_PROMH Length = 216 Score = 256 bits (653), Expect = 5e-67, Method: Composition-based stats. Identities = 126/216 (58%), Positives = 168/216 (77%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK I VILSGCGV+DGSEIHE+VLT+LA+S++ A+ F+P+ Q VINH+TGE E Sbjct: 1 MKSIAVILSGCGVFDGSEIHESVLTMLALSQNKAEVHYFSPNDFQPTVINHITGEEKAEK 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RN++ EA+RI+RG+I PL+ A A DA+I+PGGFGAAKNL NFA+ G +C ++++L Sbjct: 61 RNMMEEASRISRGKISPLSDAKAENFDAVIIPGGFGAAKNLCNFATKGVQCEINQQLLTF 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q MHQ KPLG MCIAP MLPK+ + P++LTIG D TAE++ EMG H+ CPVD+IVV Sbjct: 121 VQKMHQQKKPLGLMCIAPVMLPKMLNAPVKLTIGNDAKTAEMITEMGGIHINCPVDEIVV 180 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 D++ ++VTTPAYMLA++IA+A GI+KLV +VL +A Sbjct: 181 DDEYRVVTTPAYMLAESIAQAQVGIEKLVKKVLEMA 216 >UniRef50_P0ABU5 Enhancing lycopene biosynthesis protein 2 n=267 Tax=Bacteria RepID=ELBB_ECOLI Length = 217 Score = 255 bits (651), Expect = 7e-67, Method: Composition-based stats. Identities = 217/217 (100%), Positives = 217/217 (100%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET Sbjct: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL Sbjct: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV Sbjct: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE Sbjct: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 >UniRef50_B6XFP3 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XFP3_9ENTR Length = 232 Score = 254 bits (649), Expect = 1e-66, Method: Composition-based stats. Identities = 134/216 (62%), Positives = 171/216 (79%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK I VILSGCGV+DGSEIHE+VLT+LA+S++ A+ FAPD+ Q VINH+ GE TET Sbjct: 17 MKSIAVILSGCGVFDGSEIHESVLTMLALSKNNAEVHFFAPDEDQATVINHINGELKTET 76 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RN + E+ARI+RG+I PL+ D ++LDALI+PGGFG AKNL NFA+ GSEC ++++L +L Sbjct: 77 RNQMEESARISRGKIAPLSSVDPSKLDALIIPGGFGVAKNLCNFATKGSECEINKQLLSL 136 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q MHQ KPLG MCIAP MLPK+ + ++LTIG D +T +E+MG HV C VD+IVV Sbjct: 137 VQVMHQQKKPLGLMCIAPVMLPKMLNTSVKLTIGNDTETIAQIEKMGGLHVECTVDNIVV 196 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 DE+NK+VTTPAYMLAQ+IAEA GI+KLV +VL +A Sbjct: 197 DENNKVVTTPAYMLAQSIAEANVGINKLVEKVLEMA 232 >UniRef50_P30042 ES1 protein homolog, mitochondrial n=45 Tax=Metazoa RepID=ES1_HUMAN Length = 268 Score = 245 bits (626), Expect = 6e-64, Method: Composition-based stats. Identities = 101/225 (44%), Positives = 139/225 (61%), Gaps = 11/225 (4%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-TET 60 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ E+ Sbjct: 43 ARVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES 102 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL E+ARI RG+I LA AA DA I PGGFGAAKNLS FA G +C V++E++ + Sbjct: 103 RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLSTFAVDGKDCKVNKEVERV 162 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIG---------TDIDTAEVLEEMGAEHV 171 + HQAGKP+G CIAP + K+ + +T+G TAE ++ +GA+H Sbjct: 163 LKEFHQAGKPIGLCCIAPVLAAKVLR-GVEVTVGHEQEEGGKWPYAGTAEAIKALGAKHC 221 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 V + VD+ NK+VTTPA+M + GI +V +VL L Sbjct: 222 VKEVVEAHVDQKNKVVTTPAFMCETALHYIHDGIGAMVRKVLELT 266 >UniRef50_Q6APY5 Putative uncharacterized protein n=1 Tax=Desulfotalea psychrophila RepID=Q6APY5_DESPS Length = 218 Score = 245 bits (624), Expect = 1e-63, Method: Composition-based stats. Identities = 105/219 (47%), Positives = 139/219 (63%), Gaps = 3/219 (1%) Query: 1 MKK--IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMT 58 MKK I VILSGCG DGSEIHEA ++L AI G CFAPD Q+ VINHL GE Sbjct: 1 MKKHKIAVILSGCGHLDGSEIHEATMSLWAIHSHGCDYHCFAPDIDQLHVINHLNGEETG 60 Query: 59 ETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELK 118 E+RNVL+E+ARI RG+I L Q A + DALI+PGGFGAAKNLS++ S G C V+ E+K Sbjct: 61 ESRNVLVESARIARGKISDLNQFKAEDYDALIIPGGFGAAKNLSDYFSAGVNCQVNPEVK 120 Query: 119 ALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 MHQA KP+G +CIAP +L + + +T+G D + + E+MGA I Sbjct: 121 KAIIDMHQAKKPIGALCIAPMLLAR-LISGVEITLGQDPISHQNAEKMGASTQTTDHGQI 179 Query: 179 VVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 V+D N +V+TP YML + + +G D L++ ++ + E Sbjct: 180 VIDRKNLVVSTPCYMLDARVDQIGAGADALMTEIVEMME 218 >UniRef50_B9EL82 ES1 protein homolog, mitochondrial n=11 Tax=Eumetazoa RepID=B9EL82_SALSA Length = 259 Score = 243 bits (619), Expect = 4e-63, Method: Composition-based stats. Identities = 93/224 (41%), Positives = 136/224 (60%), Gaps = 11/224 (4%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTET 60 K+ V+LSGCGVYDG+EIHEA L+ +SR GA+ +APD Q+ VI+H G E+ Sbjct: 34 AKVAVVLSGCGVYDGTEIHEASAILVHLSRGGAEVQMYAPDVSQMHVIDHGKGVPAENES 93 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL E+ARI RG I L + + DA+I PGGFGAAKNLS FA G +C ++ +++ + Sbjct: 94 RNVLSESARIARGNITDLVKLSVSNHDAIIFPGGFGAAKNLSTFAVDGPDCKINADVERV 153 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIG---------TDIDTAEVLEEMGAEHV 171 + H+AGKP+G CI+P + K+ + +T+G TA ++ +GA+H+ Sbjct: 154 LKDFHKAGKPIGLCCISPVLAAKLLP-GVEVTVGHEEEKGGKWPYAGTAGAIKALGAKHI 212 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVL 215 V + VD+ NK+VT+PA+M + GI +V+ VL L Sbjct: 213 VKEVTEAHVDQKNKVVTSPAFMCETQLHLIFDGIGAMVTNVLKL 256 >UniRef50_A3M9Q1 Enhancing lycopene biosynthesis protein 2 n=5 Tax=Acinetobacter RepID=A3M9Q1_ACIBT Length = 220 Score = 240 bits (611), Expect = 4e-62, Method: Composition-based stats. Identities = 104/220 (47%), Positives = 141/220 (64%), Gaps = 3/220 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTE 59 MKK+ VILSGCG DGSEI E+VLTLLA+ + FAPD+ VI+H++GE MTE Sbjct: 1 MKKVAVILSGCGYLDGSEIRESVLTLLALDTVNIEYQIFAPDEPLFHVIDHVSGEINMTE 60 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 RN+L EA RI RG+I L Q + E D LI+PGGFG AKNLS FA G+E V + + Sbjct: 61 RRNILQEAGRIARGKISSLDQLNENEFDGLILPGGFGVAKNLSTFAFKGAEARVHGTVAS 120 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDF-PLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 + +A HQ+ KP+G +CI+PA+L F +T+G+D++ A+ +E+ G+ H C D Sbjct: 121 ILKAFHQSKKPIGAICISPALLALTFGELHPTITLGSDLNIAKEIEKTGSIHHVCQTSDC 180 Query: 179 VVDEDNKIVTTPAYMLA-QNIAEAASGIDKLVSRVLVLAE 217 VVD+ N VTTPAYM N+ + +GI LV+ + LA Sbjct: 181 VVDKQNLFVTTPAYMDDQANLKDIYTGITSLVNTMTALAN 220 >UniRef50_C9J1C8 Putative uncharacterized protein C21orf33 (Fragment) n=1 Tax=Homo sapiens RepID=C9J1C8_HUMAN Length = 257 Score = 236 bits (603), Expect = 3e-61, Method: Composition-based stats. Identities = 101/252 (40%), Positives = 139/252 (55%), Gaps = 38/252 (15%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-TET 60 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ E+ Sbjct: 5 ARVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES 64 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNL------------------- 101 RNVL E+ARI RG+I LA AA DA I PGGFGAAKNL Sbjct: 65 RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLCVFELQGLPLSMWSRWEGG 124 Query: 102 --------SNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTI 153 S FA G +C V++E++ + + HQAGKP+G CIAP + K+ + +T+ Sbjct: 125 APVCCPMWSTFAVDGKDCKVNKEVERVLKEFHQAGKPIGLCCIAPVLAAKVLR-GVEVTV 183 Query: 154 G---------TDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASG 204 G TAE ++ +GA+H V + VD+ NK+VTTPA+M + G Sbjct: 184 GHEQEEGGKWPYAGTAEAIKALGAKHCVKEVVEAHVDQKNKVVTTPAFMCETALHYIHDG 243 Query: 205 IDKLVSRVLVLA 216 I +V +VL L Sbjct: 244 IGAMVRKVLELT 255 >UniRef50_Q6MQ93 Enhancing lycopene biosynthesis protein 2 n=3 Tax=Bacteria RepID=Q6MQ93_BDEBA Length = 217 Score = 232 bits (591), Expect = 7e-60, Method: Composition-based stats. Identities = 101/218 (46%), Positives = 143/218 (65%), Gaps = 3/218 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKI V+LSGCG DGSEI E+V L+ + ++GA+ CFAPD Q+ + NH+ GEA E Sbjct: 1 MKKIAVVLSGCGHRDGSEITESVSLLIGLHQAGAEVHCFAPDI-QIPITNHINGEAQGEK 59 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R++L EAARI RG I+ L + A + DA++ PGG+GAAKNLSN+A G++C V+ ++K + Sbjct: 60 RSLLTEAARIARGHIQSLDKLHAKDFDAVVFPGGYGAAKNLSNWAEKGAQCEVNPDVKRV 119 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIF-DFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 H A KP+G +CIAP ++ K+ D + +TIG D TA +E+ GA H CPV+D + Sbjct: 120 ILEFHSASKPIGALCIAPVLVAKVLGDKKVTVTIGDDAATAAEIEKTGAIHEECPVNDYI 179 Query: 180 VDEDNKIVTTPAYML-AQNIAEAASGIDKLVSRVLVLA 216 D ++K+VTTPAYM E +GI L ++ A Sbjct: 180 TDRESKVVTTPAYMYGDAKPNEVFAGIFGLAHEIVEWA 217 >UniRef50_Q3AU80 Es1 family protein n=30 Tax=Bacteria RepID=Q3AU80_CHLCH Length = 225 Score = 229 bits (584), Expect = 4e-59, Method: Composition-based stats. Identities = 113/218 (51%), Positives = 153/218 (70%), Gaps = 2/218 (0%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-TET 60 +IGV+L+GCG DGSEIHEAVLTLLAIS+ GAQA+C APD Q V+NHLTG+ + E+ Sbjct: 7 PRIGVLLAGCGYLDGSEIHEAVLTLLAISKKGAQAICLAPDMVQHHVVNHLTGQEVIGES 66 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+EAARI RG I L+ + LDA IVPGG+GAAKNLS+FA G+ CT+ ++ Sbjct: 67 RNVLVEAARIARGAIHNLSDIASLHLDAFIVPGGYGAAKNLSSFAFDGTPCTIHPDVATA 126 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIF-DFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 Q ++AGKP+GF+CI+P + K+ + +TIG D TA +E MGA H+ C V Sbjct: 127 IQLFYKAGKPMGFICISPVLAAKVLGSEKIEVTIGNDASTAASIEAMGARHINCVVTKAH 186 Query: 180 VDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 V + + IV+TPAYML ++A+ A+GI++LV V+ L + Sbjct: 187 VSKPHNIVSTPAYMLEASLADIATGIEQLVGNVVELVK 224 >UniRef50_C5LKS2 Putative uncharacterized protein n=4 Tax=Eukaryota RepID=C5LKS2_9ALVE Length = 348 Score = 229 bits (583), Expect = 6e-59, Method: Composition-based stats. Identities = 91/226 (40%), Positives = 129/226 (57%), Gaps = 12/226 (5%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTET 60 K++ V+LSGCG DGSEI EAV L +SR+ + CFAPD QQ+ V++H G Sbjct: 124 KRVAVVLSGCGHLDGSEIREAVFVLTELSRANTKYQCFAPDIQQMHVVDHSDGSIDSGSK 183 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+EA+RI R PL A + DAL+ PGGFGAAKNLSNFA GS +V E++ Sbjct: 184 RNVLVEASRIGREGTLPLTDLKAKDYDALMFPGGFGAAKNLSNFAVKGSGMSVHPEVERA 243 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTD--------IDTAEVLEEMGAEHVP 172 + +QAG P+G +CIAP + K+ + +T+G+D A+ ++E+G +H Sbjct: 244 IKEFNQAGHPIGLVCIAPVLAAKVLNA--EVTMGSDEVSEEYPNASAAQAVKEIGGKHFN 301 Query: 173 CPVDDIVVDEDNKIVTTPAYMLA-QNIAEAASGIDKLVSRVLVLAE 217 +++ VD K+VT+ AYM I E + +V L L Sbjct: 302 TKLNEAHVDTAKKVVTSAAYMNNTAPIHEIHESVAAMVRETLKLIN 347 >UniRef50_Q2GLV6 Es1 family protein n=5 Tax=Anaplasma RepID=Q2GLV6_ANAPZ Length = 221 Score = 224 bits (571), Expect = 1e-57, Method: Composition-based stats. Identities = 80/215 (37%), Positives = 117/215 (54%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 V+L GCG DGSEI EAVL LLA+ G C AP+ +QVDV++HL+G + E R+ Sbjct: 2 NCAVLLCGCGHMDGSEIREAVLALLALDSYGINVTCCAPNIKQVDVVDHLSGSTLEEERD 61 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 ++ E+ARI RG + + D LI+PGGFG AKN S+ S +V E+K Sbjct: 62 IMSESARIARGNVVDPKDISPNDFDMLILPGGFGVAKNYSDILKGESPVSVLEEVKQTIV 121 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 H+ K +G +CIAPA++ ++ + D ++ G EHV C DD V D Sbjct: 122 KFHKEKKAIGAICIAPAIVAASLSSVSKVKVTLGEDIDSIISRCGGEHVFCETDDYVADI 181 Query: 183 DNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 D + +TPAYM ++ + GI K+V ++ + Sbjct: 182 DMGVFSTPAYMRKDSLHKIHVGIHKMVGAMVDFVK 216 >UniRef50_D2W324 Glutamine amidotransferase domain-containing protein n=1 Tax=Naegleria gruberi RepID=D2W324_NAEGR Length = 305 Score = 221 bits (562), Expect = 2e-56, Method: Composition-based stats. Identities = 89/216 (41%), Positives = 120/216 (55%), Gaps = 4/216 (1%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTETRN 62 + VILSGCG DGSEI EAV ++ +S+ G + FAPD Q + +HLT E RN Sbjct: 49 VAVILSGCGYLDGSEITEAVSVMVHLSKKGYKLAFFAPDINQEETYDHLTKNVEKNEVRN 108 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + EA+RITR + L Q +ALI+PGGFG AKNLSN+A + + E++ Sbjct: 109 IRTEASRITRQNVLRLDQFRPQAFEALIIPGGFGVAKNLSNYAENPTNFKIHPEVEKAIV 168 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIG-TDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + H+A P+G CIAP + K +T+G +D E + GA VP +VVD Sbjct: 169 STHEAKIPIGMCCIAPVLAAKAIPQ-CSITLGDSDPSVTEHAKSYGANCVPKSTSQVVVD 227 Query: 182 EDNKIVTTPAYMLAQNIA-EAASGIDKLVSRVLVLA 216 +DNK+VTTPAYM A E GI +V V+ L Sbjct: 228 KDNKLVTTPAYMGKNPTAFEVFDGIGSMVDSVIELT 263 >UniRef50_Q90257 ES1 protein, mitochondrial n=3 Tax=Danio rerio RepID=ES1_DANRE Length = 270 Score = 218 bits (556), Expect = 9e-56, Method: Composition-based stats. Identities = 73/233 (31%), Positives = 131/233 (56%), Gaps = 20/233 (8%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMT-ETR 61 I V+ SGCG +DG++IHEA T+ +SR+GA+ FAP++QQ+ V++H+ + + + R Sbjct: 37 NIAVVFSGCGWWDGTDIHEAAYTMYHLSRNGARFQIFAPNQQQMHVMDHMKMQPSSSDNR 96 Query: 62 NVLIEAARITRGE----IRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDREL 117 N+++E+AR + G+ + L++ DA DA+I PGG G KN+S F+ G +C ++ ++ Sbjct: 97 NIMMESARFSHGQGMMQMNDLSKLDANSFDAVIFPGGHGIVKNMSTFSKDGKDCKLNNDV 156 Query: 118 KALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID------------TAEVLEE 165 + + + H+A KP+G +AP + ++ L +T+G + D + ++ Sbjct: 157 ERVLKDFHRARKPIGLSSMAPLLACRVLPS-LEVTMGYERDESSRWGRWPNTNMVQAVKS 215 Query: 166 MGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQN--IAEAASGIDKLVSRVLVLA 216 MGA H + VDE NK+++TP +M + GI +V V+ + Sbjct: 216 MGARHNTREPYEAYVDEKNKVISTPTFMWETDYHYHYIFDGIGNMVKHVMRMT 268 >UniRef50_A9V3H5 Predicted protein n=5 Tax=Fungi/Metazoa group RepID=A9V3H5_MONBE Length = 245 Score = 218 bits (554), Expect = 2e-55, Method: Composition-based stats. Identities = 101/223 (45%), Positives = 138/223 (61%), Gaps = 10/223 (4%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 ++ ++LSG GVYDG+E+HEA + A+SR GA FAPDKQQ V+NH+TGE M E+R Sbjct: 22 PRVALVLSGSGVYDGTEVHEASAAMGALSRQGADYKIFAPDKQQHHVVNHMTGEEMDESR 81 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL+EAARI RG I+ L + A+ DA++VPGGFGAAKNLSNFA G+ CTVD L + Sbjct: 82 NVLVEAARIARGNIQALDKLQVADFDAVVVPGGFGAAKNLSNFAVEGAACTVDATLTDIL 141 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID--------TAEVLEEMGAEHVPC 173 + H KP+GF CIAP + +F +T+G+D + A +EMGA++V Sbjct: 142 KKFHAEQKPMGFCCIAPVIAANLFKG--EVTMGSDTESDKWPFAGAAGACKEMGADYVVG 199 Query: 174 PVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 + VD+ N+IVT PA+M + + +V VL L Sbjct: 200 DESKVHVDQANRIVTAPAFMANTAVHLVQDNVTNMVQTVLELV 242 >UniRef50_C0R4T2 Enhancing lycopene biosynthesis protein 2, putative n=13 Tax=Rickettsiales RepID=C0R4T2_WOLWR Length = 241 Score = 214 bits (545), Expect = 2e-54, Method: Composition-based stats. Identities = 85/223 (38%), Positives = 124/223 (55%), Gaps = 10/223 (4%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K V+LSGCG DG E+ EAVL+LL + + CFAPD V+NH T EA E RN Sbjct: 13 KAAVVLSGCGHLDGVEVREAVLSLLVLDQQEVDVKCFAPDINITQVMNHRTKEATKEKRN 72 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL+EAARI RGEI L +A A D L+VPGG+G AKNLS+ A TV E + L Sbjct: 73 VLVEAARIARGEIYDLKEAKAENFDMLVVPGGYGVAKNLSDLAESKDMVTVMPEFERLVS 132 Query: 123 AMHQAGKPLGFMCIAPAMLPKIF-------DFPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 KP+G +CI+PA++ I + +++TIG D + +++E +G EH+ C Sbjct: 133 EFFVTKKPIGAICISPAIIVSILSSKIGKEESKVKVTIGDDRE--QLIERLGGEHIKCDT 190 Query: 176 DDIVVDEDNKIVTTPAYML-AQNIAEAASGIDKLVSRVLVLAE 217 + + DE++ + + AYM ++ GI ++ ++ Sbjct: 191 ELSIEDEEHNVFSCSAYMRSDESTYSVYQGIKHMIDSMVKKIN 233 >UniRef50_Q21UI1 ThiJ/PfpI n=1 Tax=Rhodoferax ferrireducens T118 RepID=Q21UI1_RHOFD Length = 226 Score = 211 bits (537), Expect = 1e-53, Method: Composition-based stats. Identities = 93/220 (42%), Positives = 135/220 (61%), Gaps = 7/220 (3%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA-MTET 60 +KI V+L+GCG DG+E+ EAVLTLLA+ + GA C AP+ Q VINH+TGE Sbjct: 5 RKIAVLLAGCGHLDGAEVREAVLTLLALDQHGAAFQCIAPNAPQFHVINHITGEPVAGAQ 64 Query: 61 RNVLIEAARITR-GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 RN+L E++RI R G+ LA+A A+ DAL++PGG+G AKN +FA G++ V ++ A Sbjct: 65 RNILEESSRIARLGQCLDLAKAKVADYDALVMPGGYGVAKNNCSFAFKGADAEVRPDVAA 124 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIF---DFPLRLTIGTDIDTAEVLEEMGAEHVPCP-V 175 + A KP+G +CIAPA++ D LT+G D A + ++G H P Sbjct: 125 FVRGFFDAKKPVGAICIAPALVALALHQVDDSATLTLGNDAGVAAAMGQLGQRHQNTPNA 184 Query: 176 DDIVVDEDNKIVTTPAYMLAQN-IAEAASGIDKLVSRVLV 214 +IV+DE +K+VTTPAYM +++ GI++ V+ VL Sbjct: 185 REIVIDEAHKLVTTPAYMFDDARLSDVFVGIERCVAEVLK 224 >UniRef50_Q2RPB9 ThiJ/PfpI n=7 Tax=Bacteria RepID=Q2RPB9_RHORT Length = 227 Score = 209 bits (531), Expect = 7e-53, Method: Composition-based stats. Identities = 104/215 (48%), Positives = 142/215 (66%), Gaps = 1/215 (0%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 + V+LSGCGV+DG+EIHE+VLTLLAI R G A CFAPD+ Q VI+H +G+ ETR Sbjct: 5 PRFAVLLSGCGVFDGAEIHESVLTLLAIDRQGGVARCFAPDRPQYHVIDHRSGQPTGETR 64 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL E+ARI RG I LA D A DALI+PGGFGAAKNL +FA G++C VD ++ Sbjct: 65 NVLCESARIARGAIDDLADFDPAAFDALILPGGFGAAKNLCSFAIDGADCAVDPTVERAL 124 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 +A AG +G +CIAP +L ++F + LTIG+D TAE + +GA H ++VVD Sbjct: 125 RAARAAGLAIGALCIAPVVLARVFGEGV-LTIGSDAATAEAITALGAHHQKATHAEVVVD 183 Query: 182 EDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 ++V++P YML +I++ A G + V ++ L Sbjct: 184 RALRLVSSPCYMLDASISQIAEGAENTVKALIALI 218 >UniRef50_B9Z0T4 ThiJ/PfpI domain protein n=1 Tax=Lutiella nitroferrum 2002 RepID=B9Z0T4_9NEIS Length = 226 Score = 200 bits (509), Expect = 2e-50, Method: Composition-based stats. Identities = 83/221 (37%), Positives = 125/221 (56%), Gaps = 8/221 (3%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNV 63 + ++LSGCGVYDGSEI EAV ++A+S++G +APD+ Q+ V++H G+ E RN+ Sbjct: 5 VAIVLSGCGVYDGSEITEAVGVVIALSQAGLPYAFYAPDRAQMHVVDHARGQESGEARNI 64 Query: 64 LIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQA 123 L EAARI RG+IRPL + DAA+ A++ PGGFGAAKNL+ F G + + ++ A + Sbjct: 65 LSEAARIARGQIRPLTELDAAQHSAIVFPGGFGAAKNLTTFIKDGRDAVLYDDVAAAVRP 124 Query: 124 MHQAGKPLGFMCIAPAMLPKIFDF----PLRLTIGTDID---TAEVLEEMGAEHVPCPVD 176 Q KP+ +C AP + I + +T G+ + A+ L G HV PVD Sbjct: 125 FVQQHKPVVALCAAPLVQGLIARDEGLAGVNITFGSYAEGQAMADALTSWGQTHVETPVD 184 Query: 177 DIVVDEDNKIVTTPAYML-AQNIAEAASGIDKLVSRVLVLA 216 VD ++ ++ PAYM AE + ++ + L Sbjct: 185 QACVDLAHRFISAPAYMYGEATPAEVFASCQAAITALKSLL 225 >UniRef50_A4IY89 DJ-1/PfpI family protein n=24 Tax=Gammaproteobacteria RepID=A4IY89_FRATW Length = 219 Score = 196 bits (498), Expect = 4e-49, Method: Composition-based stats. Identities = 104/219 (47%), Positives = 147/219 (67%), Gaps = 3/219 (1%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHL--TGEAMT 58 M K+ V+LSGCG DGSEIHE VLT+LA+ + G + A ++ Q VINHL + ++ Sbjct: 1 MAKVAVVLSGCGYLDGSEIHETVLTILALEKQGVEWQGVALNRDQKQVINHLHQSVDSKA 60 Query: 59 ETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSEC-TVDREL 117 RN+L E+ARITRG + +A AD+ + DA+I PGGFGAAKN+ +FA +G++ +D E+ Sbjct: 61 SPRNILEESARITRGNVIDIADADSDDYDAIIFPGGFGAAKNIMDFAFVGNDSYQMDEEV 120 Query: 118 KALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDD 177 A+A + A KP G++CIAP M+P ++ + T+GTD +T +L + GAE + D Sbjct: 121 LKFARAFYLADKPAGYICIAPLMIPLVYPEGTKATVGTDENTTAILAKKGAEAIIMDATD 180 Query: 178 IVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 I VDE KIV+TPAYM A+NI EAA GI+KLV +V+ Sbjct: 181 ICVDESVKIVSTPAYMCARNILEAAQGIEKLVEKVVSYI 219 >UniRef50_UPI0000DB6CF4 PREDICTED: similar to es1 protein n=2 Tax=Apocrita RepID=UPI0000DB6CF4 Length = 226 Score = 194 bits (494), Expect = 1e-48, Method: Composition-based stats. Identities = 72/225 (32%), Positives = 124/225 (55%), Gaps = 12/225 (5%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHL--TGEAMTETR 61 + VIL GCG DG+EI EA+ ++ I + +APD + ++H + + +R Sbjct: 3 VAVILCGCGYLDGTEISEAMSAMIHICLKDMKPHFYAPDVNICETVDHFIKKPDPDSPSR 62 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 N L+EAARI R +I+PL Q A + +AL++PGGFGAAK LSNFA G++CT+ +L+ + Sbjct: 63 NALVEAARIARSDIKPLCQCQACKHEALVIPGGFGAAKTLSNFAEKGADCTIHPDLEQII 122 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTD--------IDTAEVLEEMGAEHVPC 173 + + GKP+ +CI+ ++ ++ +++T+G + D + ++MGA+ Sbjct: 123 EDFYYEGKPIASICISSVLVARVLK-GVKITLGKESPAEEWPFADAIKKAKDMGAKIEQK 181 Query: 174 PVDDIVVDEDNKIVTTPAYMLA-QNIAEAASGIDKLVSRVLVLAE 217 V + + + +TPA+M AE +GI KL+ + Sbjct: 182 SVKGMTKCKKYNVFSTPAWMYKPATFAEIYTGIGKLIGTMKKHMN 226 >UniRef50_P30042-2 Isoform Short of ES1 protein homolog, mitochondrial n=6 Tax=Euarchontoglires RepID=P30042-2 Length = 237 Score = 192 bits (488), Expect = 6e-48, Method: Composition-based stats. Identities = 87/225 (38%), Positives = 117/225 (52%), Gaps = 42/225 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-TET 60 ++ ++LSGCGVYDG+EIHEA L+ +SR GA+ FAPD Q+ VI+H G+ E+ Sbjct: 43 ARVALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES 102 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL E+ARI RG+I LA AA DA I PGGFGAAKNL Sbjct: 103 RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNL------------------- 143 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIG---------TDIDTAEVLEEMGAEHV 171 CIAP + K+ + +T+G TAE ++ +GA+H Sbjct: 144 ------------LCCIAPVLAAKVLR-GVEVTVGHEQEEGGKWPYAGTAEAIKALGAKHC 190 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 V + VD+ NK+VTTPA+M + GI +V +VL L Sbjct: 191 VKEVVEAHVDQKNKVVTTPAFMCETALHYIHDGIGAMVRKVLELT 235 >UniRef50_UPI000186E026 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186E026 Length = 256 Score = 189 bits (479), Expect = 7e-47, Method: Composition-based stats. Identities = 71/218 (32%), Positives = 107/218 (49%), Gaps = 9/218 (4%) Query: 7 ILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIE 66 +LSG G+ DG+EIHEA + +SR Q F+ Q DVI+H E RN L+E Sbjct: 39 VLSGSGMMDGTEIHEASACAVHLSRLDIQPKFFSVPCPQTDVIDHYKLSPTNEMRNALVE 98 Query: 67 AARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQ 126 +ARI RG+I + + E D LI PGGFG AK L+ F G+ C V+ E+ + Sbjct: 99 SARIARGKICSINSLTSDEADVLIFPGGFGVAKTLTTFDKDGANCGVNEEVVRVVNEFCA 158 Query: 127 AGKPLGFMCIAPAMLPKIFDFPLRLTIGT--------DIDTAEVLEEMGAEHVPCPVDDI 178 KP+ F CIA + +IF +++T+G + + + +MGA VD Sbjct: 159 CRKPMAFTCIAAILPARIFP-GVKVTLGKKGDPKKWPHSEAIDTVSDMGAVVEVKNVDSF 217 Query: 179 VVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 D+ + TTPA+M + GI ++ + + Sbjct: 218 TFDKQFLVFTTPAFMYEGTFYQIFEGIGNMIEALQKIM 255 >UniRef50_Q2NWH9 Sigma cross-reacting protein 27A (SCRP-27A) n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NWH9_SODGM Length = 171 Score = 186 bits (472), Expect = 5e-46, Method: Composition-based stats. Identities = 115/169 (68%), Positives = 135/169 (79%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK+IGV+LSGCGV DGSEI EAVLTLLAI R+G AVCFA DK Q+ V+NHL+GE E Sbjct: 1 MKRIGVVLSGCGVNDGSEIQEAVLTLLAIDRTGLDAVCFATDKPQLQVVNHLSGEQTDER 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+EAARI RG+I+PLA A A +LDALIVPGGFG AKNLSN A G++C VD EL L Sbjct: 61 RNVLVEAARIARGQIQPLAAASAEDLDALIVPGGFGVAKNLSNLAQTGADCEVDAELAQL 120 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAE 169 QA+H KPLGF+CIAPA+LPKI PLRLT+GTD+D AE+++ MGA Sbjct: 121 VQALHLQRKPLGFICIAPALLPKILAVPLRLTLGTDVDAAEMVDTMGAI 169 >UniRef50_C5CG98 ThiJ/PfpI domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CG98_KOSOT Length = 206 Score = 178 bits (451), Expect = 1e-43, Method: Composition-based stats. Identities = 65/190 (34%), Positives = 105/190 (55%), Gaps = 2/190 (1%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K G++LSGCG+ DG++I E +LT L++ + G + FAP++ Q DVI+H T + E RN Sbjct: 2 KAGILLSGCGLGDGTQIEEVMLTYLSLDKYGIDYITFAPNEMQHDVIDHYTEKPQNEKRN 61 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 +LIE+ARI RG+I + + ++DA+I+PGG G KNLS F TV++ + L + Sbjct: 62 ILIESARIGRGKICDIREVSCKDIDAIIIPGGLGVFKNLSTFIVDKKSFTVNKNVDDLLK 121 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLR--LTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 AM+ + K + +C A ++ K + + E+L E+ V C + V+ Sbjct: 122 AMYLSKKSIAGICGAVILIAKSLSQHVSDLKVATANDAYGELLSELNVNAVNCSAKECVI 181 Query: 181 DEDNKIVTTP 190 D + P Sbjct: 182 DRKKQSSNYP 191 >UniRef50_C1C1F4 Enhancing lycopene biosynthesis protein 2 n=2 Tax=Caligus RepID=C1C1F4_9MAXI Length = 231 Score = 167 bits (422), Expect = 3e-40, Method: Composition-based stats. Identities = 73/227 (32%), Positives = 111/227 (48%), Gaps = 13/227 (5%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 + V+LSGCG DGS+ E L LA+SR + + +AP +NH+ G RN Sbjct: 5 NVAVLLSGCGHLDGSDPLEVSLLCLALSRLDIKPIFYAPYMSMSTGVNHVNGAEAETGRN 64 Query: 63 VLIEAARITRGEIRPLAQADAAE--LDALIVPGGFGAAKNLSNF-ASLGSECTVDRELKA 119 VLIE+AR+ + + L + D ++ L ALI+PGG G N S+F SL + +V +E+ Sbjct: 65 VLIESARLVKESVLKLDELDPSDETLSALIIPGGHGPLNNFSDFKTSLETPPSVIKEILG 124 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDID--------TAEVLEEMGAEHV 171 + + AGKP+G A ++ + +T+G+ + A L E G Sbjct: 125 IIEGFKAAGKPIGCTSHANILVALAIPN-IEITLGSRDEEECPVASLVAPGLIEQGTTVT 183 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLA-QNIAEAASGIDKLVSRVLVLAE 217 P V ++ VD +NKIVT A + A E A I ++ L E Sbjct: 184 PTSVYEVQVDFENKIVTAAASLFASAKYHEVADQITIFFDMLMTLIE 230 >UniRef50_A6NYE6 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NYE6_9BACE Length = 189 Score = 162 bits (410), Expect = 6e-39, Method: Composition-based stats. Identities = 61/212 (28%), Positives = 84/212 (39%), Gaps = 28/212 (13%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K V+L+GCG+ DGS I E VLT A+ + G A D V ++H+T + E R+ Sbjct: 2 KFLVLLAGCGLGDGSCIEEVVLTYTALDKYGCDYTPAAADM-LVPSMDHITEQP-GEKRS 59 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 VL E+AR RG IR L + DAL++PGG G N + Sbjct: 60 VLTESARTGRGRIRNLHDISPDDYDALLIPGGIGLVVNYRESGL----------VADWVN 109 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 Q KP+G MC L I L D+D D Sbjct: 110 RFVQQKKPIGTMCAGIDFLRGILGAGLLREEVRDLDAVSFCR----------------DT 153 Query: 183 DNKIVTTPAYMLAQNIAEAASGIDKLVSRVLV 214 I TPA+ + + G+D +V +L Sbjct: 154 SGAIFYTPAFRKTGSCHDVMLGVDAMVHAMLE 185 >UniRef50_Q8WQI1 Lycopene biosynthesis-enhancing protein n=2 Tax=Tetrahymena thermophila RepID=Q8WQI1_TETTH Length = 380 Score = 151 bits (382), Expect = 1e-35, Method: Composition-based stats. Identities = 60/122 (49%), Positives = 90/122 (73%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 KK+ +ILSGCGVYDGSE+ E V ++ +++S CFAP++ Q+ V+NH+TGE TET Sbjct: 46 FKKVAIILSGCGVYDGSEVTEVVSLMVHLNKSHVSFQCFAPNQDQLHVVNHITGETTTET 105 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNVL+E+ARI RGE++ + Q + A+++PGGFGAAKNLS++A G+ TV+ E++ + Sbjct: 106 RNVLVESARIARGEVKDITQLKGEDYQAVLLPGGFGAAKNLSDYAVNGTNFTVNSEVERV 165 Query: 121 AQ 122 + Sbjct: 166 LR 167 >UniRef50_D2V705 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V705_NAEGR Length = 250 Score = 142 bits (357), Expect = 9e-33, Method: Composition-based stats. Identities = 58/230 (25%), Positives = 106/230 (46%), Gaps = 18/230 (7%) Query: 6 VILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE-AMTETRNVL 64 VILSGCG DGS++ E+V ++ ++R G F+P ++ + N++T + +E R + Sbjct: 18 VILSGCGFMDGSDVVESVSVIVELTRKGIVPRFFSPHEEIDESYNYITKQIDSSEERYMH 77 Query: 65 IEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLG---SECTVDRELKALA 121 E+ARI R +I + Q A + D L++PGG G +NLSNF +E V+ ++ Sbjct: 78 KESARIAREKILSIDQLRADQFDMLVIPGGNGVVRNLSNFEQEEYNVNEVEVNSHVEKAI 137 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLT-------IGTDID---TAEVLEEMGAE-- 169 + KP+GFM + + K T +G +D ++ + G E Sbjct: 138 VDFFKQKKPIGFMSNSVILGAKALGKVSGKTGNGIAVALGKTLDRVFVETLMTKFGNELS 197 Query: 170 HVPCPVDDIVVDEDNKIVTTPAYMLAQNIA--EAASGIDKLVSRVLVLAE 217 + + D ++I + + A + E + L+ ++ L + Sbjct: 198 QESGDAEVVCTDSSHRIASVASVSAAGTVQPNEIHAAAKNLIEELIDLTK 247 >UniRef50_D1CDU1 Intracellular protease, PfpI family n=40 Tax=Bacteria RepID=D1CDU1_THET1 Length = 197 Score = 137 bits (344), Expect = 3e-31, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 80/209 (38%), Gaps = 38/209 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI + + DG E E A+++ GA+A + ++ +N + + Sbjct: 8 KKIAFLAT-----DGVEQVELTEPWKAVTQEGAEAHLISIKSGEIQGVNGMDKADTFKVD 62 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + + Q A+E DAL++PGG + + ++++ L Sbjct: 63 --------------KTVDQVSASEYDALVLPGG----------VANPDKLRMNQDAVRLV 98 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + ++GKP+ +C P L + D T+ + ++ G V ++VVD Sbjct: 99 REFVESGKPVAAICHGPWTLVEA-DVVRGRTLTSYPSLKTDIKNAGGNWVD---QEVVVD 154 Query: 182 EDNKIVTT----PAYMLAQNIAEAASGID 206 + PA+ A+ I E GI Sbjct: 155 QGIITSRNPNDLPAF-CAKLIEEVQEGIH 182 >UniRef50_B9TI86 Protease C56, putative n=1 Tax=Ricinus communis RepID=B9TI86_RICCO Length = 199 Score = 134 bits (336), Expect = 3e-30, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 72/203 (35%), Gaps = 45/203 (22%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFA--PDKQQVDVINHLTGEAMTE 59 K I V+++ DG E E + + GA+ + P + V NHLT + Sbjct: 23 KHIAVLMT-----DGVEQVEYTQPRQFLEQQGAEVTLVSTKPKGEAVQGFNHLTPANTFD 77 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 + A + DAL++PGG + ++ Sbjct: 78 VE--------------LDVRDARPVDFDALVLPGG----------VANPDNLRLNTTAIT 113 Query: 120 LAQAMHQAGKPLGFMCIAPA-MLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 + + KP+ +C P ++ R+T + + L GA+ V + + Sbjct: 114 FIREFARENKPIAAICHGPWTLIDAGVAQGKRMT--SWPSLKQDLSNAGAQWVD---EQV 168 Query: 179 VVDEDNKIVTT------PAYMLA 195 VVD K+VT+ PA+ A Sbjct: 169 VVD--GKLVTSRKPDDIPAFNKA 189 >UniRef50_C5DA95 Intracellular protease, PfpI family n=11 Tax=cellular organisms RepID=C5DA95_GEOSW Length = 184 Score = 131 bits (330), Expect = 1e-29, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 72/193 (37%), Gaps = 26/193 (13%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH-LTGEAMTE 59 M K +I++G D E E + G AP K+++ + H TG E Sbjct: 1 MSKKVLIVTG----DAVEALEVYYPYYRLLEEGYDVTIAAPKKKKLQTVVHDFTGWDTYE 56 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 + + A A D + D +++PGG +D +L+ Sbjct: 57 EKQAYLIEAHAA------FADIDPTQYDGIVIPGG-----------RAPEYIRLDADLQR 99 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + + +A KP+ +C A + + D ++ I +E +GA +V Sbjct: 100 IVRHFFEANKPIAAICHASLIFETMPDLLKGRSLTAYIACKPGVEALGATYVSDSTT--H 157 Query: 180 VDEDNKIVTTPAY 192 VD+ +V+ A+ Sbjct: 158 VDQ--NLVSAHAW 168 >UniRef50_Q1QTA8 Peptidase C56, PfpI n=27 Tax=Bacteria RepID=Q1QTA8_CHRSD Length = 204 Score = 130 bits (328), Expect = 2e-29, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K++ ++ + G E E A+ G + +PD + + E Sbjct: 25 KRVAILAT-----HGFEESELSAPRAALRSQGVEVHVVSPDGKGIRAWAETDWGDTYEAD 79 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + L+ D+ + AL++PGG E ++ + Sbjct: 80 --------------KALSDVDSTDYHALVLPGG----------LFNPDELRLNDQALDFV 115 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + +AGKP+ +C AP +L + + AE L+ GAE V + +VVD Sbjct: 116 RGFFEAGKPVAAICHAPWILINA-GVVEGRRMTSVASVAEDLKNAGAEWVD---EKVVVD 171 Query: 182 EDNKIVTTP----AYMLAQNIAEAASG 204 TP A+ + I E A G Sbjct: 172 NGLVTSRTPKDLDAFN-DKLIEELAEG 197 >UniRef50_D1CA42 Intracellular protease, PfpI family n=4 Tax=Bacteria RepID=D1CA42_SPHTD Length = 238 Score = 130 bits (326), Expect = 3e-29, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 65/189 (34%), Gaps = 33/189 (17%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 +++ +L+ DG E E + A+ +GA+ A +V + + Sbjct: 24 QRVAALLT-----DGVEQVELTEPMKALQEAGAEVKIVALKSGKVKAWDFDHWGEEFDVD 78 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + A+ + AL++PGG ++ + Sbjct: 79 --------------LTIDHANPNDFQALLLPGG----------VMNPDTLRMNEKAVQFV 114 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + M ++GKP+ +C P ML + D T+ + + G + V ++VVD Sbjct: 115 RQMVRSGKPVASICHGPWMLVEA-DVVEGRTLTSYPSLQTDIRNAGGKWVD---QEVVVD 170 Query: 182 EDNKIVTTP 190 + P Sbjct: 171 QGIVTSRNP 179 >UniRef50_Q26CT8 Intracellular protease, PfpI family n=3 Tax=Bacteria RepID=Q26CT8_9BACT Length = 182 Score = 129 bits (325), Expect = 6e-29, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 69/210 (32%), Gaps = 40/210 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ ++ + +G E E A+ +GA +P + N E Sbjct: 3 KKVAILAT-----NGFEEIELTSPKKALEDAGATVHIISPTGDSIKAWNGGNWSQTYEVD 57 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 ++ A++ ++L++PGG + D + Sbjct: 58 YA--------------VSDVSASDYNSLMLPGG----------VLNPDQLRQDEKSIDFI 93 Query: 122 QAMHQAGKPLGFMCIA--PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + + KP+ +C P + + + + + + +E G V +++V Sbjct: 94 KDFFKQQKPVSAICHGIQPLIDADVVN---GRKLTSYPSLKKDVENAGGHWVD---EEVV 147 Query: 180 VDEDNKIVTTP---AYMLAQNIAEAASGID 206 VDE TP A A+ + E G Sbjct: 148 VDEGFTTSRTPDDLAAFNAKLVEEVKEGKH 177 >UniRef50_B9XL12 Intracellular protease, PfpI family n=1 Tax=bacterium Ellin514 RepID=B9XL12_9BACT Length = 236 Score = 129 bits (325), Expect = 6e-29, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 62/188 (32%), Gaps = 33/188 (17%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ V+ + DG E E + + + GAQ + ++ +N L +N Sbjct: 10 RVAVLAA-----DGVEQIELTSPVKHLEKHGAQIEVISLHPGKIKGMNLL-----LPGKN 59 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + + R + +A+ DAL++PGG + + Sbjct: 60 IKVN---------RTIFRANPDNYDALLIPGGH----------INPDFLRQSDSVLQFVR 100 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 A KP+ +C P +L T+ + + + G V + V D Sbjct: 101 EFDAANKPIAVICHGPWVLVSA-GVVKNRTLTSWPGIKDDVINAGGNWVN---NAAVRDG 156 Query: 183 DNKIVTTP 190 + +P Sbjct: 157 NWISSRSP 164 >UniRef50_C8WTD3 ThiJ/PfpI domain protein n=5 Tax=Bacillales RepID=C8WTD3_ALIAD Length = 223 Score = 125 bits (315), Expect = 7e-28, Method: Composition-based stats. Identities = 49/213 (23%), Positives = 85/213 (39%), Gaps = 39/213 (18%) Query: 2 KKIGVILSGC-----GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 KK+ ++++ G G + E ++G + +P Q I+ + + Sbjct: 4 KKVLMVVTSADKMTDGHPTGLWLSEFAEPYTEFKQAGYEVTVASPRGGQAP-IDERSVQ- 61 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 N EA I + I PL+Q A++ DA+ +PGG G + + A E Sbjct: 62 -NGELNQWPEAVEILKQTI-PLSQVSASDYDAIFLPGGHGTMFDFPDSA----------E 109 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDIDTAEV------- 162 L+AL + ++GK + +C PA L + R+T TD + V Sbjct: 110 LQALIRTFAESGKVVAAVCHGPAGLVNVRLSNGDPLVKGKRVTAFTDEEERAVKLDDKVP 169 Query: 163 ------LEEMGAEHVPCPVDDIVVDEDNKIVTT 189 L E+GA+ V P+ V+ D ++T Sbjct: 170 FMLETRLRELGAQFVAQPMWSDHVERDGNLITG 202 >UniRef50_Q2NCQ5 Protease n=5 Tax=Proteobacteria RepID=Q2NCQ5_ERYLH Length = 179 Score = 125 bits (313), Expect = 1e-27, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 66/201 (32%), Gaps = 33/201 (16%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK++ +I + DG E E + + +G + ++ ++ + Sbjct: 1 MKRVLIIAT-----DGFEQSELMKPKKRLEEAGIDTTVASLEEGEITGWKDKNWGDSVKV 55 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + A + AL++PGG ++ + L Sbjct: 56 D--------------LTVEEVSADDYGALLLPGG----------QINPDALRMNDTVIGL 91 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + ++A KP+ +C AP +L + + T+ L GA + + V Sbjct: 92 IREFNEANKPIAAICHAPWLLAEA-NIIRDRTVTGWPSIRTDLSNAGANVID---SEAAV 147 Query: 181 DEDNKIVTTPAYMLAQNIAEA 201 D + P + A + A Sbjct: 148 DGNLITARNPDDIPAFSNALI 168 >UniRef50_A1VNH7 Intracellular protease, PfpI family n=12 Tax=Proteobacteria RepID=A1VNH7_POLNA Length = 204 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 47/203 (23%), Positives = 78/203 (38%), Gaps = 35/203 (17%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 I ++++ DG E E A+ SG + QV + H + ++ + Sbjct: 13 NIAILVT-----DGFEQEEMTGPQAALEESGVMIRLLSDRTGQVQGVRH---DQPGDSFD 64 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 V + A E DA+++PGG A S + + + L + Sbjct: 65 VDT-----------TFDKVTADEFDAVLLPGG----------AVNASRIRNNADAQELVR 103 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 M Q GKPL +C AP +L + + + + LE+ GA+ V + +VVD Sbjct: 104 QMDQQGKPLAMICHAPWLLVSA-GLVKGRKMTSAPELQKDLEQAGAQWVD---EKVVVDR 159 Query: 183 DNKIVTTPAYMLAQNIAEAASGI 205 + PA + A N A GI Sbjct: 160 NWVSSRKPADIPAFNAA--FKGI 180 >UniRef50_A1TMJ4 Intracellular protease, PfpI family n=11 Tax=Bacteria RepID=A1TMJ4_ACIAC Length = 199 Score = 124 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 43/189 (22%), Positives = 68/189 (35%), Gaps = 33/189 (17%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 ++I ++++ DG E E A+ +G A AP QV NH+ + Sbjct: 18 RRIALLVT-----DGFEQAELTGPRDALEGAGFDAQIVAPKPGQVQGFNHVDKADRFDVD 72 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + L QA DA+++PGG E D + +A Sbjct: 73 --------------QTLDQASPDAFDAVVLPGG----------VVNADELRTDEKARAFV 108 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 QA+ +AGKP+ +C +L T+ + A L GA+ V PV V+ Sbjct: 109 QAIDRAGKPVAVICHGAWLLIDA-GLVKGKTLTSWPSLATDLRNAGAQWVDRPVQ---VE 164 Query: 182 EDNKIVTTP 190 P Sbjct: 165 GRWISSRKP 173 >UniRef50_B8DQK9 ThiJ/PfpI domain protein n=3 Tax=Bacteria RepID=B8DQK9_DESVM Length = 246 Score = 124 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 48/221 (21%), Positives = 77/221 (34%), Gaps = 40/221 (18%) Query: 1 MKKIGVILSG-----CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE 55 MK + ++ S G G + E + +GA+ +P V E Sbjct: 1 MKVLMIVTSNDRLGDTGHKTGLWLEELAAPYYVFTDAGARVTLASPKGGAAPVDPRSETE 60 Query: 56 AM---TETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECT 112 T R AA + PLA+ + D L PGG G +L + Sbjct: 61 EAQNRTTRRFTADPAAMAALKDTVPLAEVRPEDYDVLFYPGGHGPLWDLVD--------- 111 Query: 113 VDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDT------------- 159 D A+ + MH+AGKP+ +C PA+L + + + T Sbjct: 112 -DARSLAIIEKMHRAGKPVAAVCHGPAVLVRATTPDGKPLVARRNMTGFSNAEEDAVGLS 170 Query: 160 -------AEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PA 191 + L +GA++ P+ + V D +VT PA Sbjct: 171 QVVPFLLQDELTRLGAKYERGPLWEPHVVADGLLVTGQNPA 211 >UniRef50_C6CWE5 Intracellular protease, PfpI family n=6 Tax=Bacteria RepID=C6CWE5_PAESJ Length = 200 Score = 122 bits (307), Expect = 6e-27, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 73/203 (35%), Gaps = 32/203 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M +I++G D +E+ E + G +AV +P ++ + + H E Sbjct: 1 MSNKVLIVTG----DAAEVLEVYYPYYRLLEEGYEAVIASPTQKILHTVCH----DFIEG 52 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + E + D ++ A+I+PGG + EL + Sbjct: 53 WDTYTEKPAHQLQSHLGFSDVDPSDYAAIIIPGG-----------RAPEYIRGNAELPRI 101 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI--FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 Q A KP+G +C + + + + T+ + +E +GA + + + Sbjct: 102 LQHFIDADKPIGAICHGAQVFLSLPDYSYFNGRTMTAYNASRLEVERLGACYAD---ETL 158 Query: 179 VVDEDNKIVTT------PAYMLA 195 VD K+VT P +M Sbjct: 159 HVD--GKLVTGHAWPDLPGFMRE 179 >UniRef50_A1K1D1 ThiJ/PfpI family protein n=17 Tax=cellular organisms RepID=A1K1D1_AZOSB Length = 193 Score = 122 bits (306), Expect = 9e-27, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 67/191 (35%), Gaps = 21/191 (10%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI +I D +E +E ++ A+ +G P K+ + + T E Sbjct: 4 KKILMI-----TGDYTEDYETMVPFQALLMAGHTVHAACPGKKAGEQV--RTAIHDFEGD 56 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 E A + DAL++PGG ++ ++ A Sbjct: 57 QTYSEKPGHNFTLNAEFDSLRAEDYDALVIPGG-----------RAPEYLRLNPKVIAAV 105 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 Q A KP+ +C +L T A ++ G E+ PVD VD Sbjct: 106 QHFFAADKPVAAICHGAQVLAAA-GVLKGRTCSAYPACAPEVKAAGGEYAEIPVDKARVD 164 Query: 182 EDNKIVTTPAY 192 K+VT PA+ Sbjct: 165 --GKLVTAPAW 173 >UniRef50_C5CQ44 Intracellular protease, PfpI family n=11 Tax=Bacteria RepID=C5CQ44_VARPS Length = 191 Score = 121 bits (304), Expect = 1e-26, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 69/201 (34%), Gaps = 37/201 (18%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K+ ++++ DG E E A+ +GAQ +P V L E Sbjct: 14 KVAILVA-----DGFEQAEMTEPRKALELAGAQTQIVSPLDGSVRAWKQLEPADTFEVD- 67 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 PL ADA + DAL++PGG + ++++ A + Sbjct: 68 -------------VPLKNADADDFDALLLPGG----------VANPDALRINQKAVAFVR 104 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 A ++GKP+ +C P L + + L GA+ V ++VVD Sbjct: 105 AFVESGKPIAAICHGPWTLIDA-GGVQGRRMTSWPSLRADLHNAGAKWVDK---EVVVDS 160 Query: 183 DNKIVTT----PAYMLAQNIA 199 PA++ Sbjct: 161 GLVTSRRPGDLPAFIREMTTM 181 >UniRef50_A3IN92 ThiJ/PfpI n=2 Tax=Cyanothece RepID=A3IN92_9CHRO Length = 234 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 79/221 (35%), Gaps = 41/221 (18%) Query: 2 KKIGVILSG---CGVYD---GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVIN--HLT 53 K+I +L+ G D G + EAV +G + +P+ +V + L Sbjct: 3 KQILFVLTSHRQLGDTDQKTGIWLEEAVNPYYRFLEAGFEVTLASPNGGEVPIDEKSILD 62 Query: 54 GEAMTETRNVLI-EAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECT 112 +TR+ E A+ L + +A DA+ +PGG G +L Sbjct: 63 DAQTEDTRHFFQDETAQKCFKNTVRLTEVEADNYDAIFIPGGHGPMWDLCE--------- 113 Query: 113 VDRELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDID------- 158 + +L L +A +A K + +C A L F LT ++ + Sbjct: 114 -NEKLANLVEAFDRADKVIAAVCHGSAGLLSAKKADGTPFVAGKELTSFSNSEEETVGLH 172 Query: 159 ------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PA 191 L+E+GA + V +D ++T PA Sbjct: 173 ELVPFLLESRLKELGANYTNADDFQAKVVQDGNLITGQNPA 213 >UniRef50_Q5ZU31 Intracellular protease, ThiJ/PfpI family n=4 Tax=Legionella pneumophila RepID=Q5ZU31_LEGPH Length = 198 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 73/206 (35%), Gaps = 23/206 (11%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI +I V D +E E A+ + +P K + D I + Sbjct: 9 KKILMI-----VGDFNEDLEVYFPYQAMLMNNYLIDTVSPGKNKGDFIQTAVHDLDKPHL 63 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 E ++ + DALI+PGG A +++++ A+ Sbjct: 64 QSYTEKLGHLFPITADFSKITLKDYDALIIPGGRAA-----------EYQRLNKDILAII 112 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI--FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + H A KP+ +C +L + + T+G + + G + +D +V Sbjct: 113 RHFHDADKPIACICHGIQILAEAGILEDKKCTTVGF---CEPDVRKAGGHFIDTGMDGVV 169 Query: 180 VDEDNKIVTTPAYMLAQNIAEAASGI 205 VD K+VT ++ A I Sbjct: 170 VD--GKLVTGATWLGNAPWMRAFLHI 193 >UniRef50_Q7MQ54 Putative uncharacterized protein VV0154 n=1 Tax=Vibrio vulnificus YJ016 RepID=Q7MQ54_VIBVY Length = 261 Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats. Identities = 44/194 (22%), Positives = 69/194 (35%), Gaps = 27/194 (13%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI ++ + DG E E ++ L + GA AP K+ G + E R Sbjct: 73 KKIAILAT-----DGVEELEILVPLNYLREVGANVTIVAPRKKIYP---ETLGLKIPENR 124 Query: 62 NVLIEAARITRG----EIRP-LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 I R+ +I + + + D L++PGG A D E Sbjct: 125 RTHIMTVRLMENSGWLKIDKYIDEVSFDDFDGLVLPGG----------AWNPDFLRTDVE 174 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVD 176 + L + + + KPL +C P +L I + LE GA+ PV Sbjct: 175 AQNLVREIVNSNKPLATICHGPLVLINS-GLVKDRKITGYWAIMKDLENAGAKVYDQPV- 232 Query: 177 DIVVDEDNKIVTTP 190 V+D + P Sbjct: 233 --VIDGNLISSRFP 244 >UniRef50_Q5FQ93 Protease I n=1 Tax=Gluconobacter oxydans RepID=Q5FQ93_GLUOX Length = 175 Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 70/190 (36%), Gaps = 32/190 (16%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ + + DG E E A+ ++GA + + +N + Sbjct: 1 MVKVAALAT-----DGLEEIELTGPQEALEKAGATVTVISLKAGEFQAVN-KDIYPSNKI 54 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R L +A A + DAL++PGG + +++++ A Sbjct: 55 RADL------------AIADAKVEDYDALLLPGG----------LASPDALRMNKDVVAF 92 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 A+A +A KP+ +C L ++ + T+ + LE GA V +++VV Sbjct: 93 ARAFVKANKPIAAICHGAQTLIEV-NELKGRTVTSWPAIRTDLENAGASWVD---NEVVV 148 Query: 181 DEDNKIVTTP 190 D P Sbjct: 149 DGPYVFSRCP 158 >UniRef50_Q464Y3 Putative intracellular protease n=2 Tax=cellular organisms RepID=Q464Y3_METBF Length = 189 Score = 118 bits (296), Expect = 1e-25, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 73/203 (35%), Gaps = 23/203 (11%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K +IL+G D +E +E + ++ G Q AP+K+ DV+ + + T Sbjct: 1 MSKKILILTG----DCAEDYEVKVPQQSLQMLGYQVDIAAPNKKTGDVLQLVVHDFTTLD 56 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + RI +++ A + L+VPGG + E L Sbjct: 57 TYIELPGHRIPVD--VSVSEVKADDYAGLVVPGG-----------RAPEYIRMYDETIKL 103 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q AGKP+ +C +L + + A GA + +++ Sbjct: 104 VQDFFAAGKPVAVICHGLQLLAAA-KVLEGYKVTSYPACAPECRLAGANWQS---ESVII 159 Query: 181 DEDNKIVTTPAYMLAQNIAEAAS 203 D+ +VT A+ A Sbjct: 160 DK--NLVTAQAWPDHPAWLRAFV 180 >UniRef50_Q1DD54 Peptidase, C56 (PfpI) family n=2 Tax=Cystobacterineae RepID=Q1DD54_MYXXD Length = 223 Score = 118 bits (295), Expect = 1e-25, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 66/208 (31%), Gaps = 41/208 (19%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ V+ + DG E E + + R GA +P K ++ + N Sbjct: 8 RVAVLAA-----DGFEQVELTAPVKKLERQGADVTIVSPHKGRIRGM------------N 50 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 +LI R++ L + AA+ DA+++PGGF + Sbjct: 51 LLIPGKRVSVDA--SLREVKAADFDAVLLPGGF----------VNPDLLRQSALALDFVR 98 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 P+ +C P +L + + + G V PV Sbjct: 99 DADALDMPIAVICHGPWVLISA-GLVEGRALAAWPGIRDDVRNAGGRWVDEPVM-----R 152 Query: 183 DNKIVTTPAYMLAQNIAEAASGIDKLVS 210 D V++P + + I +V Sbjct: 153 DGNWVSSPG------PRQMFAFIKGMVE 174 >UniRef50_Q0ULG5 Putative uncharacterized protein n=2 Tax=Pleosporineae RepID=Q0ULG5_PHANO Length = 236 Score = 117 bits (292), Expect = 3e-25, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 74/209 (35%), Gaps = 32/209 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K ++++ DGSE E V ++R+G + D + + H+T Sbjct: 44 MPKALILIA-----DGSEEIEFVTPYDVLTRAGFEVQSVGVDL-KNEGYAHMTRNVRIVP 97 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + + Q D LI+PGG AK S + + L Sbjct: 98 DHTNLTSF---------PHQLAHEHYDILILPGGGPGAKTFST----------NPSVLQL 138 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 ++ ++GK + +C L +T + + ++ G E+ + +VV Sbjct: 139 IKSFVRSGKFVAAICAGTTALVAAGIEKKIVT--SHPSVMQEIKGAGWEY---SEERVVV 193 Query: 181 DEDNKIVTTP--AYMLAQNIAEAASGIDK 207 D P A + + I E G +K Sbjct: 194 DGKVVTSRGPGTALLFSLTIVEVMVGKEK 222 >UniRef50_D2S4J1 Intracellular protease, PfpI family n=2 Tax=Frankineae RepID=D2S4J1_9ACTO Length = 189 Score = 116 bits (290), Expect = 6e-25, Method: Composition-based stats. Identities = 39/187 (20%), Positives = 66/187 (35%), Gaps = 33/187 (17%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNV 63 + V+ + DG E E A+ +GAQ + + ++ H+ + V Sbjct: 10 VAVLAT-----DGVEQVELDRPWQALEEAGAQPELVSLEAGEITAYEHIDKGDSKKVDAV 64 Query: 64 LIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQA 123 + + D E DAL++PGG G D + A +A Sbjct: 65 VSSS--------------DPDEYDALVLPGG----------VINGDFVRADADAVAFVKA 100 Query: 124 MHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDED 183 AGKP+ +C A +L + D + + L GA V +++VVD + Sbjct: 101 FFDAGKPVAAICHAGWVLAEA-DVVRGRRMTSWPSLQTDLRNAGATWVD---EEVVVDGN 156 Query: 184 NKIVTTP 190 P Sbjct: 157 LVTSRNP 163 >UniRef50_UPI00017898AF intracellular protease, PfpI family n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017898AF Length = 168 Score = 115 bits (289), Expect = 7e-25, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 64/190 (33%), Gaps = 36/190 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ +L+ D E E A+ +G QA + Q T+ Sbjct: 1 MSKVAFLLA-----DQFEDSEMKTPYDAVKEAGHQADIIGLKQGQEVKGKQGKASYTTDK 55 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +A + + DA+++PGG S +D + Sbjct: 56 ----------------AIADVNINDYDAVVIPGG-----------SSPENLRLDSHILQF 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 A +AGKP+ +C P +L D TI + + + GA +++VV Sbjct: 89 VAAADKAGKPIASICHGPQILASA-DLLKGRTITSYPPLQDDMVNAGANFKD---EEVVV 144 Query: 181 DEDNKIVTTP 190 D + TP Sbjct: 145 DRNFITSRTP 154 >UniRef50_Q313C6 Peptidase C56, PfpI n=12 Tax=cellular organisms RepID=Q313C6_DESDG Length = 255 Score = 115 bits (288), Expect = 9e-25, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 58/191 (30%), Gaps = 19/191 (9%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I ++ V D E +EA++ + G P K+ + + T E Sbjct: 64 KRILML-----VGDFVEDYEAMVPFQMLGMVGHTVHAVCPGKKAGETV--KTAVHDFEGD 116 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 E D A DAL+VPGG ++ + + Sbjct: 117 QTYTEKPGHNFMLNCDFDSVDVAHYDALVVPGG-----------RAPEYIRLNARVIEIV 165 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + +A KP+ +C +L T +E GA Sbjct: 166 RQFDKARKPIAAVCHGQQVLVTA-GVLQGHTCTAYPAVKPDVEAAGATWCEVNDTASNAC 224 Query: 182 EDNKIVTTPAY 192 +VT PA+ Sbjct: 225 VSGHVVTAPAW 235 >UniRef50_B2JT66 ThiJ/PfpI domain protein n=4 Tax=Burkholderiales RepID=B2JT66_BURP8 Length = 230 Score = 113 bits (283), Expect = 4e-24, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 76/225 (33%), Gaps = 40/225 (17%) Query: 3 KIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDV---INHLT 53 KI ++L+ G G + E +G +P Q + + Sbjct: 2 KILMVLTSHDQLGNTGKKTGFWLEEFAAPYFTFLDAGVTLTVSSPKGGQPPLDPKSDTPE 61 Query: 54 GEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTV 113 G+ R AA+ L A + DA+ PGG G +L+ Sbjct: 62 GKTELTERFKNDPAAQKVLANTVKLDTVKADDYDAVFYPGGHGPMWDLAE---------- 111 Query: 114 DRELKALAQAMHQAGKPLGFMCIAPAMLPKI------FDFPLRLTIGTDID--------- 158 DR AL ++ + AGKP+ F+C AP +L + R+T T+ + Sbjct: 112 DRRSIALIESFYNAGKPVAFVCHAPGVLRHVKVNGEPLVKGKRVTGFTNSEEEAVQLTKV 171 Query: 159 ----TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PAYMLAQN 197 + L+ +G + D ++VT PA A Sbjct: 172 VPFLVEDELKRLGGHFEKVDDWQPLSIIDGRLVTGQNPASSTAGA 216 >UniRef50_Q5HPG8 ThiJ/PfpI family protein n=9 Tax=Staphylococcus RepID=Q5HPG8_STAEQ Length = 219 Score = 112 bits (281), Expect = 6e-24, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 74/213 (34%), Gaps = 39/213 (18%) Query: 2 KKIGVIL-SGCGVYDGSE----IHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 KK+ +L S DG+E + EA ++ G + + +N Sbjct: 3 KKVLFVLTSTSQFTDGTETGLWLEEAGAPYNILTEEGINVDVISIKGGK---VNLDPNSV 59 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 E+ N + + + +A E DA+ +PGG G + +N + + Sbjct: 60 SNESLNQYAKFVSH-LNDTPSIENVNADEYDAIYLPGGHGTVYDFAN----------NEK 108 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDID----------- 158 L + + K + +C P++ + +++T TD + Sbjct: 109 LADILLQFKNSNKIISSVCHGPSVFVGVKDANNHYLVDGVKITSFTDSEEKAMGFENKVP 168 Query: 159 --TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 T LEE GA V V++D + +T Sbjct: 169 FLTQSKLEEQGANFVVKDDFTSHVEKDGQFITG 201 >UniRef50_A4FDV5 Protease I n=8 Tax=Bacteria RepID=A4FDV5_SACEN Length = 189 Score = 112 bits (281), Expect = 6e-24, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 65/189 (34%), Gaps = 32/189 (16%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 ++I V+ + DG E E A+ ++G Q + ++ +N G+ R Sbjct: 8 RRIAVLAT-----DGVEQVEYEQPRQAVEQAGGQVSLVSVHDGEIQAMN---GDIDKGDR 59 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + ++ + D L++PGG +D + Sbjct: 60 FTVD----------AKVSDVSPDDFDGLVLPGG----------TINPDRLRIDADAVGFV 99 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 ++ Q GKP+G +C P L + D + + L GAE V + +V D Sbjct: 100 RSFVQQGKPVGAICHGPWTLVEA-DVVRGRRVTSFPSIRTDLRNAGAEVVD---EQVVTD 155 Query: 182 EDNKIVTTP 190 + P Sbjct: 156 QGLVTSRNP 164 >UniRef50_B1KEU0 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria RepID=B1KEU0_SHEWM Length = 255 Score = 112 bits (281), Expect = 7e-24, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 79/223 (35%), Gaps = 44/223 (19%) Query: 2 KKIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE 55 K++ +++S G G+ E + + G + V + + I+ L+ + Sbjct: 30 KRVLIVMSSESAMGISGKLTGTWFEEVATPYYTLRKEGYEVVMASLEGGDAP-IDLLSMQ 88 Query: 56 AMTETRNVLIE----AARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSEC 111 A T N A L++ + + DAL PGG+G +L++ Sbjct: 89 APFTTPNTDKFLNDIVAMHALENTNKLSEINPDDFDALFFPGGYGLLWDLAS-------- 140 Query: 112 TVDRELKALAQAMHQAGKPLGFMCIAPAML-----PKIFDFPLRLTIGTDIDTAEV---- 162 D + + + A KP+ +C APA+L LT+ ++ + Sbjct: 141 --DSMTIKMIEDFYAANKPIAMVCHAPAILRDAKKANGEPLVKGLTVTGFMNAEDDELDL 198 Query: 163 -----------LEEMGAEHVPCPVDDI-VVDEDNKIVTT--PA 191 L+ G + + + ++ D ++T PA Sbjct: 199 SRHLLFSLEDMLKANGGNYKKTKKNWVAHIEIDGPLMTGQNPA 241 >UniRef50_D2W0D5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W0D5_NAEGR Length = 197 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 64/192 (33%), Gaps = 16/192 (8%) Query: 13 VYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITR 72 V D E +EA++ A++ G +P K++ D + + + + E Sbjct: 10 VGDYVEDYEAMVPYQALTMVGHSVSVISPGKKKGDKVVTAIHDFLPGEQ-TYTELKGHNF 68 Query: 73 GEIRPLAQA--DAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKP 130 + LI+PGG + + + + + KP Sbjct: 69 AITADFDDVLSNLDNFGGLILPGG-----------RCSEYLRLHDNVLTIVKHFLEKKKP 117 Query: 131 LGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 + +C P +L + I ++ GAE+V C +D VV + IVT Sbjct: 118 IAAICHGPLILTPFPEHLKGKRISAYFACKHDIQNTGAEYVQCGAEDAVV--SDNIVTGV 175 Query: 191 AYMLAQNIAEAA 202 A+ A Sbjct: 176 AWPGHPKWLRAF 187 >UniRef50_Q5JGM7 Intracellular protease 1 n=22 Tax=cellular organisms RepID=PFPI_PYRKO Length = 166 Score = 110 bits (274), Expect = 4e-23, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 67/197 (34%), Gaps = 42/197 (21%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K+ ++ + DG E E + L I G + + + ++ + T Sbjct: 2 KVLILSA-----DGFEDLELIYPLHRIKEEGHEVYVASFQRGKITGKHGYTVNVD----- 51 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + D E DAL++PGG ++ + A+ + Sbjct: 52 -------------LAFDEVDPDEFDALVLPGG-----------RAPEIVRLNEKAVAITK 87 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 M + GKP+ +C P +L + + + ++ GAE + ++VVD Sbjct: 88 KMFEDGKPVASICHGPQILISA-GVLKGRKGTSTVTIRDDVKNAGAEWIDA---EVVVDG 143 Query: 183 DNKIVTTP----AYMLA 195 + P A+M Sbjct: 144 NWVSSRHPGDLYAWMRE 160 >UniRef50_B1K8Q4 ThiJ/PfpI domain protein n=220 Tax=cellular organisms RepID=B1K8Q4_BURCC Length = 228 Score = 109 bits (272), Expect = 7e-23, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 70/217 (32%), Gaps = 40/217 (18%) Query: 3 KIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHL---T 53 KI V+L+ G G + E +G + +P Q + T Sbjct: 2 KILVVLTSHDTLGDTGKKTGFWLEELAAPYYTFKDAGIELTLASPKGGQPPLDPKSSDPT 61 Query: 54 GEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTV 113 + R AA+ R LA A + DA+ PGG G +L+ Sbjct: 62 AQTDATRRFDADAAAKAELASTRKLADVSADDYDAVFYPGGHGPLWDLAE---------- 111 Query: 114 DRELKALAQAMHQAGKPLGFMCIAPAMLPKI--------FDFPLRLTIGTDID------- 158 D L + AGKP+ +C AP +L + R T T+ + Sbjct: 112 DLHSIGLIERALAAGKPVAAVCHAPGVLRHVKNPQTGESVVRGKRATGFTNSEEAAVELT 171 Query: 159 ------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 ++L+ GAE V D ++T Sbjct: 172 EVVPFLVEDMLKTNGAEFERSADWAPHVVTDGLLITG 208 >UniRef50_D1YEB9 Intracellular protease, PfpI family n=4 Tax=Actinomycetales RepID=D1YEB9_PROAC Length = 179 Score = 109 bits (272), Expect = 7e-23, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 60/163 (36%), Gaps = 24/163 (14%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G E E V L A+ ++G + + + + + TG+ ++ Sbjct: 13 GVEEAELVEPLNALKKAGIEVTVASNSGESIQTV---TGDKDWASK----------VNAD 59 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 LA A A++ D L++PGG +D + + L + AGKP+G +C Sbjct: 60 SRLADAKASDYDLLVIPGG----------TVNADTLRIDEDGRRLVKEFATAGKPVGAIC 109 Query: 136 IAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 P +L D T+ + I LE G V + Sbjct: 110 HGPWVLIDA-DVAKGKTMTSYISIRPDLENAGVSWVDKELFRC 151 >UniRef50_Q48CJ9 Protease PfpI n=12 Tax=Bacteria RepID=Q48CJ9_PSE14 Length = 228 Score = 109 bits (272), Expect = 7e-23, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 70/197 (35%), Gaps = 41/197 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I ++++ DG E E A+ ++GA + ++ +V NH + Sbjct: 57 KRIAILVT-----DGFEQVELTGPKEALEQAGATVEILSTEEGKVKGWNHDKPADDFKID 111 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 A+A++ +++PGG +D + + L Sbjct: 112 RT--------------FKAANASDYHGVVLPGG----------VQNSDTIRIDTDAQKLV 147 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + + +GKP+ +C +L T+ + + L GA+ V V Sbjct: 148 KDIEGSGKPVAVICHGGWLLISA-GLVKGKTLTSFKTLKDDLVNAGAKWVDQE-----VV 201 Query: 182 EDNKIVTT------PAY 192 D ++++ PA+ Sbjct: 202 TDGTLISSRQPDDIPAF 218 >UniRef50_B2UQP3 DJ-1 family protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQP3_AKKM8 Length = 187 Score = 109 bits (271), Expect = 9e-23, Method: Composition-based stats. Identities = 45/191 (23%), Positives = 69/191 (36%), Gaps = 37/191 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ ++ + G E E + L + R V +V + +T T Sbjct: 1 MKKVAILAAP-----GFEEIELMAPLDILRRLNMDVVLAGVQSDKVVSTHEVTVSTDTM- 54 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 L + A +LDALI+PGG G E+ L Sbjct: 55 -----------------LDKLHADKLDALILPGGAG-----------SWVLRDTPEVIHL 86 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + MH+AGK + +C AP +L K +T D L E GA V +++V Sbjct: 87 VKKMHEAGKLVAAICAAPIVLAKAGLVRDRNVTAYPAQDVYRELNEAGAHIV--KDENVV 144 Query: 180 VDEDNKIVTTP 190 +D + P Sbjct: 145 LDGNMLTANGP 155 >UniRef50_C7PRI4 ThiJ/PfpI domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PRI4_CHIPD Length = 257 Score = 109 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 81/234 (34%), Gaps = 51/234 (21%) Query: 2 KKIGVILSG-CGVYDGS----EIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 KK+ ++++ + DG+ + E + + + +P+ + V + Sbjct: 31 KKVLIVVTSFSALKDGTKMGLWLEEFTTPYYLLKENNIELTIASPEGGKAPV------DP 84 Query: 57 MTETRNVLIEAARITRG---------EIRPLAQADAAELDALIVPGGFGAAKNLSNFASL 107 + + L +A+ G L+ A + DA+ PGG +L Sbjct: 85 RSILPDFLTPSAKQFLGDGQAQKVLNNTVKLSTVKAKDYDAVFYPGGHAPMWDLPE---- 140 Query: 108 GSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDID-- 158 + + AL QA + KP+ F+C PA L I F +T ++ + Sbjct: 141 ------NAKSVALIQAFIEQQKPVAFVCHGPAALKNIKTKSGAYFTSGKTVTGYSNNEEQ 194 Query: 159 -----------TAEVLEEMGAEHVPCPV-DDIVVDEDNKIVTTPAYMLAQNIAE 200 ++L+E GA++ +D ++T A A+ Sbjct: 195 TGQTTHLIPFSLEDMLKERGAKYEKSETPWGPFAVQDGLLITGQNPASAAPTAQ 248 >UniRef50_B2IIZ8 ThiJ/PfpI domain protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IIZ8_BEII9 Length = 242 Score = 108 bits (270), Expect = 1e-22, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 73/219 (33%), Gaps = 35/219 (15%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+IG+I + G E E + GA +P+ + + + +T Sbjct: 56 KRIGIIAT-----HGVEETEISIPRKWFEERGATCHLVSPNHIEYGATFGIQFPEIAKTH 110 Query: 62 NVLIEAARITRGEIRP--LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 + I+ + + + + DA+ VPGG A + V+ + Sbjct: 111 VLAIQFTENSGWIPIDARIEEVSVEDYDAVYVPGG----------AWNPDQLRVNPAVLK 160 Query: 120 LAQAMHQAGKPLGFMCIA-PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 Q GKP+G +C L + T + E + GA + PV Sbjct: 161 YLQDFQSTGKPVGALCHGSQVFLSAKLLKGRKAT--GYWNIMEDMANAGAHVLDEPV--- 215 Query: 179 VVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE 217 VVD + V T ++ I + V ++ L Sbjct: 216 VVDGN---VITSRFIYD---------IPQFVKAIIDLLN 242 >UniRef50_Q0SRB0 DJ-1 family protein n=35 Tax=Bacteria RepID=Q0SRB0_CLOPS Length = 191 Score = 108 bits (269), Expect = 1e-22, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 73/194 (37%), Gaps = 37/194 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ V L+ +G E EA+ + +R+ + C A + +N G + Sbjct: 1 MKKVLVFLA-----EGFETIEALSVVDVCNRA--KVTCHACSLTENRTVNSAHGTMVLCD 53 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + ++ D DA+++PGG + NL + + ++++L Sbjct: 54 K---------------LISDNDLETYDAIVLPGGMPGSTNLRD----------NEKVQSL 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + ++ K + +C AP L K + G + + +E D +VV Sbjct: 89 IKKYNEENKIVAAICAAPIALAKA-----GVIEGKKVTSYPGFKEELGNVNYVEEDTVVV 143 Query: 181 DEDNKIVTTPAYML 194 D + PA L Sbjct: 144 DGNTITSRGPATAL 157 >UniRef50_B8KZH8 Intracellular proteinase PfpI n=1 Tax=Stenotrophomonas sp. SKA14 RepID=B8KZH8_9GAMM Length = 221 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 63/189 (33%), Gaps = 32/189 (16%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I ++ + G E E + GA+ +P K+ I + Sbjct: 49 KQIAILAT-----HGFEQSELTEPKRLLEAEGARVSVVSPAKE--ATIKGWKDKDWGGVV 101 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 V PL +ADA DAL++PGG D Sbjct: 102 AV-----------DLPLDEADAGRFDALVLPGG----------VINPDTLRTDEAALGFI 140 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 +++ +AGKP+ +C P +L + + + L GA+ ++VVD Sbjct: 141 RSVAEAGKPVAAICHGPWLLINSGLAD-GRELTSWPSLQQDLANAGAKWRNA---EVVVD 196 Query: 182 EDNKIVTTP 190 + P Sbjct: 197 GNVITSRKP 205 >UniRef50_C5PN70 C56 family peptidase n=2 Tax=Sphingobacterium spiritivorum RepID=C5PN70_9SPHI Length = 177 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 61/191 (31%), Gaps = 34/191 (17%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAP-DKQQVDVINHLTGEAMTE 59 M KI ++ + DG + E + + G Q +P D + V NH Sbjct: 1 MNKIAILAA-----DGFKEIELKSPKIYLQNKGFQVDIVSPKDIEFVRSWNHFDWGPSYP 55 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 LA+AD A +ALI+PGG V + Sbjct: 56 ID--------------VHLAEADPAVYEALILPGG----------TLSPDALRVLPKALD 91 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + + K + +C L + D+ + + + L+ GA + ++ Sbjct: 92 FIKHFIEQKKLIAAICHGAWPLVE-LDYVKGKRMTSVSNIRSDLKNAGAIWED---EAVI 147 Query: 180 VDEDNKIVTTP 190 D + TP Sbjct: 148 QDGNLISSRTP 158 >UniRef50_Q0B5J2 ThiJ/PfpI domain protein n=18 Tax=Proteobacteria RepID=Q0B5J2_BURCM Length = 237 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 79/236 (33%), Gaps = 44/236 (18%) Query: 4 IGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM 57 + +L+ G G + E L + +G + + V + L Sbjct: 13 VLFVLTSHATKGATGEPTGFYLGEVTHPLAELDAAGIPVEFASIQGGEPPV-DGLELTDE 71 Query: 58 TETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDREL 117 R A R + L DA++ A+ GG GA + + ++ Sbjct: 72 VNARYWNDSAFRDALRHTQRLGDVDASKYAAVFFAGGHGAMWDFPG----------NADV 121 Query: 118 KALAQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEV-------------- 162 + + +A+++AG +G +C PA L + L G ++ Sbjct: 122 QQVTRAVYEAGGVVGAVCHGPAALVDVTLGDGTYLVAGKNLGAFTDEEERAVQLDHVVPF 181 Query: 163 -----LEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVL 213 L + GA H P P V D ++VT ++A+G+ + +L Sbjct: 182 LLASTLTQRGAHHHPAPSWTAKVVVDGRLVT-------GQNPQSAAGVGAAIRYLL 230 >UniRef50_Q9V1F8 Intracellular protease 1 n=6 Tax=cellular organisms RepID=PFPI_PYRAB Length = 166 Score = 107 bits (266), Expect = 3e-22, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 63/198 (31%), Gaps = 44/198 (22%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ ++ + D E E + + G + + + + + + T Sbjct: 2 RVLILSA-----DQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVD----- 51 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + + E DAL++PGG ++ + +A+ Sbjct: 52 -------------LAFEEVNPDEFDALVLPGG-----------RAPERVRLNEKAVEIAK 87 Query: 123 AMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 M GKP+ +C P ++ R T + + + G + V ++VVD Sbjct: 88 KMFSEGKPVASICHGPQILISAGVLRGRRGT--SYPGIKDDMINAGVDWVDA---EVVVD 142 Query: 182 EDNKIVTTP----AYMLA 195 + P A+M Sbjct: 143 GNWVSSRVPGDLYAWMRE 160 >UniRef50_A0Q9X2 Intracellular protease, PfpI family protein n=4 Tax=Bacteria RepID=A0Q9X2_MYCA1 Length = 180 Score = 106 bits (265), Expect = 5e-22, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 57/168 (33%), Gaps = 29/168 (17%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI ++ + DG E E A+ +GA + ++ NH A T T Sbjct: 4 KKIAILAA-----DGVEKVELEQPAAALREAGAGVEVVSLQDGEIQARNHDLEPAGTFTV 58 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + R +A A + D L++PGG + +D + Sbjct: 59 D-------------RKVADASVDDFDGLVLPGG----------TVNPDKLRLDDTAVSFV 95 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAE 169 + +GKP+ +C P L + T+ + L GA Sbjct: 96 RDFVGSGKPVAAICHGPWTLVEA-GVAAGRTLTSYPSIRTDLRNAGAH 142 >UniRef50_B3DRT7 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme n=6 Tax=Bifidobacterium RepID=B3DRT7_BIFLD Length = 182 Score = 106 bits (265), Expect = 5e-22, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 67/190 (35%), Gaps = 33/190 (17%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+ +I++ G E E L + +G + A + + H E T Sbjct: 6 SKVLIIVNNW----GIEETELTRPLRDLKAAGVKVTLAATTLDPCETVQHDRYEGET--- 58 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 L AR L+ AA+ D L+VPGG V+ + LA Sbjct: 59 --LTPDAR--------LSDVQAADYDLLVVPGG----------TCNVDRIRVNEDAITLA 98 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 Q GKP+ +C +L T A +E G +V + + VD Sbjct: 99 QEFAHEGKPIAAICHGAWLLVNA-GLVAGKTAAPCRYIAADIENAGGHYVD---EQLHVD 154 Query: 182 EDN--KIVTT 189 + N K++T+ Sbjct: 155 DANGFKLITS 164 >UniRef50_Q88JC4 Protease PfpI n=3 Tax=Bacteria RepID=Q88JC4_PSEPK Length = 180 Score = 106 bits (264), Expect = 6e-22, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 58/190 (30%), Gaps = 35/190 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K++ +++ DG E E A+ SGA + + V NH Sbjct: 9 KRVAFLVT-----DGFEQVELTGPREALENSGAVVDILSEKEGTVRGWNHDKPADAFSVD 63 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 A DAL++PGG + + L Sbjct: 64 AT--------------FDSAQLDLYDALVLPGG----------VQNSDTIRLIPGAQKLV 99 Query: 122 QAMHQAGKPLGFMCIAPAML-PKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 ++ AG+PL +C +L R+T + + + G V + +VV Sbjct: 100 KSHDAAGRPLAVICHGAWLLISSGLAKGKRMT--SYKTLQDDIRNAGGTWVD---EQVVV 154 Query: 181 DEDNKIVTTP 190 D + P Sbjct: 155 DGNLITSRQP 164 >UniRef50_Q47QQ3 Peptidase C56, PfpI n=12 Tax=Bacteria RepID=Q47QQ3_THEFY Length = 189 Score = 106 bits (264), Expect = 6e-22, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 67/200 (33%), Gaps = 40/200 (20%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNV 63 + V+++ +G+E E + AI +G + + +V +HL Sbjct: 11 VAVLIA----PEGAEQVELTVPWDAIRDAGGRPRLISTTGGRVQAFDHLDRADTFTVDTT 66 Query: 64 LIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQA 123 + Q A + DAL++PGG + A + Sbjct: 67 --------------VHQVTAHDFDALLLPGG----------VANPDYLRTHERAVAFVRE 102 Query: 124 MHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDED 183 G+P+ +C A +L + T+ + L GA V ++VV + Sbjct: 103 FFDTGRPVAAICHALWILIEA-GVVRGRTLTSYPSLRTDLVNAGATWVDR---EVVVSTE 158 Query: 184 N--KIVTT------PAYMLA 195 ++T+ PA+ A Sbjct: 159 GPSTLITSRNPKDLPAFTKA 178 >UniRef50_UPI0001B4C882 ThiJ/PfpI domain-containing protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4C882 Length = 230 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 50/238 (21%), Positives = 76/238 (31%), Gaps = 43/238 (18%) Query: 1 MKKIGVILS---------GCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH 51 M KI +++S G G E ++G +P + H Sbjct: 1 MSKILIVMSAASIWERTDGSEYPTGYWAEELAAPHEKFVQAGFAVDFASPGGVLQPLDAH 60 Query: 52 -LTGEAMTETRNVLIEAARITR---GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASL 107 E +E A G + L + D + A+++PGG G +L Sbjct: 61 SADPEIAGPDCAHYVEHAARALSEFGPLLKLDEIDINDYVAVVIPGGHGPVVDLYK---- 116 Query: 108 GSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDF-------PLRLTIGTDID-- 158 DR+L L AGK +G +C PA L D +T TD + Sbjct: 117 ------DRDLGRLLTEADAAGKIIGAVCHGPAGLLSAVDENGKWLFAGREMTAFTDEEEQ 170 Query: 159 -----------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGI 205 A L + GA+H P +D + T + +AEA G Sbjct: 171 SFGTAEGAPWLLASTLRQKGAKHSGGPAYQAYNVQDRNLFTGQNPASSAPMAEAMIGA 228 >UniRef50_A5WBS8 ThiJ/PfpI domain protein n=2 Tax=Psychrobacter RepID=A5WBS8_PSYWF Length = 175 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 71/208 (34%), Gaps = 38/208 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQ-AVCFAPDKQQVDVINHLTGEAMTET 60 K+I +L+ E E + G + + D + V + +A T T Sbjct: 3 KRIAFLLNSH-----FEQAEYADVDRLLQDKGYETVLITTNDNKTVQAMQQDVDKADTFT 57 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 ++ L++A A + DA+++PGG +D+ + Sbjct: 58 ADLF-------------LSEASAEDYDAVVLPGG----------TVNADTIRIDKSAQNF 94 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + A KP+ +C AP +L T+ +E G V V Sbjct: 95 VKQFYDANKPVAAICHAPWLLVNS-GLVKGKTVTAYPSLQTDIENAGGTFVDKSVQQ--- 150 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKL 208 D I+T+ +I + + IDK Sbjct: 151 --DGNIITS---RKPDDIEDFVAAIDKA 173 >UniRef50_C5RPJ4 DJ-1 family protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RPJ4_CLOCL Length = 187 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 67/207 (32%), Gaps = 39/207 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK V+ + G E EA+ + + R Q + + V Sbjct: 1 MKKAAVLFAT-----GFEEIEALTVVDVLRRGKVQCDMVSLYGENVVG------------ 43 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 A I + + + D +++PGG + NL D + L Sbjct: 44 ------AHAIEIKTDKDFDLVNFKDYDIIVLPGGMPGSTNL----------RADDRVINL 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + K +G +C AP +L K I + + LE A + +VV Sbjct: 88 VKDFNNKNKFIGAICAAPIVLEKAEVVGT-RKITSYPGS---LENQNA--FDYKEEIVVV 141 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDK 207 D + PA + ++ I K Sbjct: 142 DGNLITSRGPATAIEFSLKLIELLIGK 168 >UniRef50_C5A5Q9 Peptidase C56, intracellular protease PfpI family (PfpI) n=4 Tax=Euryarchaeota RepID=C5A5Q9_THEGJ Length = 166 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 65/197 (32%), Gaps = 42/197 (21%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K+ + + + E E + L I G + + ++ ++ + + E Sbjct: 2 KVLFLSA-----NDFEDVELIYPLHRIREEGHEVYIASFERGRITGKHGYSVEVH----- 51 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + D E DAL++PGG ++ + A+A+ Sbjct: 52 -------------LRFDEVDPDEFDALVLPGG-----------RAPERIRLNEKAVAIAK 87 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 M + GKP+ +C P +L + + + G + V PV VVD Sbjct: 88 KMFEDGKPVATICHGPQILISA-GVLKGRKGTSYAGIKDDMINAGVKWVDEPV---VVDG 143 Query: 183 DNKIVTTP----AYMLA 195 + P A+M Sbjct: 144 NWVSSRHPEDLYAWMRE 160 >UniRef50_A1AQV7 Metal dependent phosphohydrolase n=3 Tax=Bacteria RepID=A1AQV7_PELPD Length = 388 Score = 105 bits (263), Expect = 8e-22, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 70/202 (34%), Gaps = 41/202 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M + + L+ DG E EA+ + + R+G + V V+ + ++ Sbjct: 1 MPRALIPLA-----DGFEEIEAMTVVDVLRRAGFEVVLAGLHGGPVESVRRVSVIPDAT- 54 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + A + + D +I+PGG A NLS D + L Sbjct: 55 -----------------IDAARSDQFDMVILPGGQPGAANLS----------ADVRVIRL 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + K +G +C A +L + R+T D + L GA++ +V Sbjct: 88 LNDFSKDNKLIGAICAATTVLSEAGLIRGKRVT--AYPDYRDRL--PGAQYED---SAVV 140 Query: 180 VDEDNKIVTTPAYMLAQNIAEA 201 +D P +A +A Sbjct: 141 IDGKIITSQGPGTAMAFALAIV 162 >UniRef50_Q0C397 Intracellular protease, PfpI family n=6 Tax=Alphaproteobacteria RepID=Q0C397_HYPNA Length = 208 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 66/189 (34%), Gaps = 31/189 (16%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 +KI ++++ GSE E A+ GA+ V + + + + + Sbjct: 35 RKIAILIA----PRGSEDSEFTEPRKAVEAEGAETVIISTQLGKAETM-----------K 79 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 L A + + +Q + DALI+PGG + + A Sbjct: 80 GDLDPAGQYDVDAV--FSQVHVDDFDALIIPGG----------TVGSDKLRASLDAVAFV 127 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 A + KP+ +C P ++ + + + ++ G + V +++VVD Sbjct: 128 SAFFRQSKPVAAICHGPWLIVEA-GAAKGRKLTSYSSLRTDIQNAGGDWVD---EEVVVD 183 Query: 182 EDNKIVTTP 190 +P Sbjct: 184 NGLITSRSP 192 >UniRef50_C2BZS9 C56 family peptidase n=1 Tax=Listeria grayi DSM 20601 RepID=C2BZS9_LISGR Length = 201 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 64/206 (31%), Gaps = 49/206 (23%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I ++S D E E +L + GA + V + G T Sbjct: 34 KRILALVS-----DDFEDLELWYPVLRVREEGATVDLVGEEAG--HVYHGKYGVPATSD- 85 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + + D ++VPGG+ + ++ Sbjct: 86 --------------YSFDEIKPEDYDGILVPGGW-----------SPDKLRRYEKVLDFI 120 Query: 122 QAMHQAGKPLGFMCIAPAML-PKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + KP+G +C A +L + +T + + + GA + VV Sbjct: 121 KYFDREKKPIGQICHAGWVLISAGILDGVNVT--STPGIKDDMTNAGAIWHN----EAVV 174 Query: 181 DEDNKIVTT---------PAYMLAQN 197 + + I + PAY+ A + Sbjct: 175 TDRHIISSRRPPDLPEYLPAYIKAFS 200 >UniRef50_D0SXB4 Intracellular protease n=2 Tax=Acinetobacter RepID=D0SXB4_ACILW Length = 188 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 61/198 (30%), Gaps = 35/198 (17%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 +KK+ I S G E E L + G + + A + V + T A T Sbjct: 7 VKKVLFITSNQ----GIEHDELTEPLNFLKSKGFEVIHAAEKNEDVATVKGDTKAATQYT 62 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + D + D L++PGG ++++ + + Sbjct: 63 PD-------------TSFDHVDPNDYDLLVIPGG----------TVNADTLRINQDAQKI 99 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q KP+ +C P L + + LE G + V V Sbjct: 100 IQHFADNHKPIAAICHGPWTLIDA-QRIKDKNLTSYKSIKLDLENAGGKWVDEQVHRCNT 158 Query: 181 DEDNKIVTT------PAY 192 D ++T+ PA+ Sbjct: 159 -GDWVLITSRNPDDLPAF 175 >UniRef50_UPI0001979AA3 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis protein ThiJ n=1 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI0001979AA3 Length = 193 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 74/199 (37%), Gaps = 37/199 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK + V ++ G E E V + + R+G + V + D + + H Sbjct: 1 MKNVMVPIAR-----GFEEIELVSVVDILRRAGVRVVLVSLDSHKRVLGAH--------- 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 N++IEA L + D+ + DA+I+ GG+ +NL+N + + Sbjct: 47 -NIVIEAD-------NALPEFDSEDFDAIILVGGYNGMQNLAN----------NELVTLW 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + K + +C +P +L K T ++ +E+ +V Sbjct: 89 LKQFENSQKLIAAICASPIVLDKAGVLKGDFTCYPGCESQINMEK-----KNKKSSAVVK 143 Query: 181 DEDNKIVTTPAYMLAQNIA 199 + + T PA + + Sbjct: 144 NGNIITSTGPATAVVFALE 162 >UniRef50_Q7NER9 Glr3809 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NER9_GLOVI Length = 526 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 76/233 (32%), Gaps = 42/233 (18%) Query: 1 MKKIGVILSGCGVYDG-------SEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLT 53 MK I ++ S ++G + E + + + +P + V Sbjct: 2 MKTILMVTSSHDRFEGPDPRPTGVWLEEFAVPYMELLARKIGITVASPRGGAMPVDPRSN 61 Query: 54 GEAMTETR-NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECT 112 + + IEA+R PLA + DA+ +PGG G +L + Sbjct: 62 PTPEQQQQWQAAIEASR----ATLPLAGMASENFDAIFLPGGHGPMFDLPD--------- 108 Query: 113 VDRELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDIDTAEV--- 162 + +L L ++AGK + +C PA L + LT T + Sbjct: 109 -NPDLARLLTEFYKAGKIIAAICHGPAGLVGARRPDGAPLVAGVTLTSYTASEEVAAELD 167 Query: 163 ----------LEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGI 205 L +GA + ++ D + +T + +IA A Sbjct: 168 KEVPFILEDRLRALGAHFIARENKADHIERDGQFITGQNPNSSTSIARAIVAA 220 >UniRef50_A7H8I0 Intracellular protease, PfpI family n=6 Tax=Bacteria RepID=A7H8I0_ANADF Length = 189 Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 65/190 (34%), Gaps = 36/190 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M +I I V D E E + + +G + + ++ + + G+ E Sbjct: 1 MARIAFI-----VDDMFEDSELRVPYDRLRDAGHEVIVVGLEQGKR-----IEGKQKKEK 50 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 V A A ELDAL++PGG+ ++ L Sbjct: 51 LTVERAA-----------KDVRAQELDALVIPGGY-----------SPDHLRTSIDMVRL 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + M AGKP+ +C P ML + D T+ + L GA V ++V Sbjct: 89 TRDMFVAGKPVAAVCHGPWMLVEA-DAIDGRTVTSWPSLKTDLINAGARWVDR---EVVE 144 Query: 181 DEDNKIVTTP 190 D + P Sbjct: 145 DGNLITSRNP 154 >UniRef50_B9M1L7 Intracellular protease, PfpI family n=6 Tax=Bacteria RepID=B9M1L7_GEOSF Length = 246 Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 63/187 (33%), Gaps = 35/187 (18%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 +I ++ + DG E E + L A+ +GA + ++ +N Sbjct: 11 RIALLAA-----DGFEKVELEVPLKALRLAGATVDVVSLRPGRIRGVN------------ 53 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + E A + + + +AD + + +PGGF E + Sbjct: 54 -MHEPAGKVQVTMT-VQEADPKNYEGIFIPGGF----------INPDLLRQSAEAREFVH 101 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 + +GKP+ +C +L TI + + + GA + V Sbjct: 102 SFDVSGKPIATICHGAWVLASA-GMLRGRTITSWPGIRDDVVNAGAIWLDQA-----VVR 155 Query: 183 DNKIVTT 189 D ++T+ Sbjct: 156 DGNLITS 162 >UniRef50_B0N5W0 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0N5W0_9FIRM Length = 183 Score = 104 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 43/198 (21%), Positives = 72/198 (36%), Gaps = 41/198 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ V+ +DG E EA+ + + R+ + DK +V + + Sbjct: 1 MKKVAVL-----FHDGFEEVEALSVVDIMRRANVECTMVGMDKLEVTSSHQI-------- 47 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +I +I DA+++PGG A NL + D + L Sbjct: 48 --------KIKMDQIYD----GLDNYDAVVIPGGMPGASNLRD----------DSRVIDL 85 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + GK +G +C P +L + D T+ E L +G+ + V Sbjct: 86 VKQFNHDGKIIGAICAGPIVLQEA-DVIKGKTVTCYPGFEEQL--IGSNYQETLVQR--- 139 Query: 181 DEDNKIVTTPAYMLAQNI 198 DE+ PA LA Sbjct: 140 DENIITGKGPAAALAFGY 157 >UniRef50_C9RGD3 Intracellular protease, PfpI family n=1 Tax=Methanocaldococcus vulcanius M7 RepID=C9RGD3_METVM Length = 208 Score = 104 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 77/209 (36%), Gaps = 41/209 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVI--NHLTGEAMTE 59 KK+ ++++ D E + +G + + K + N +T Sbjct: 35 KKVLMVIAPKDFRD----EELFEPMAVFEANGLKVDVVSTKKGTCIGMLGNKITVNKT-- 88 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 + + + E A+++PGG G+ + L N + EL + Sbjct: 89 ------------------INEVNPNEYVAIVIPGGIGSKEYLWN----------NTELLS 120 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 L + ++ K + +C++P +L + + T+ D + E L++ GA + V Sbjct: 121 LVKKFYEDHKVVAAICLSPVVLARAGILKGKKATVFPDPEAIEELKKYGAIYEDKGV--- 177 Query: 179 VVDEDNKIVTTPAYMLAQNIAEAASGIDK 207 VVD + +P Y + E I+ Sbjct: 178 VVDGNIITAQSPNYARVFGL-EVLKVIEN 205 >UniRef50_A9ARH9 Intracellular protease, PfpI family n=5 Tax=Burkholderia cepacia complex RepID=A9ARH9_BURM1 Length = 189 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 59/187 (31%), Gaps = 35/187 (18%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K+ ++ DG E E V A+S GAQ + ++ H+ A + + Sbjct: 9 KVAILA-----VDGFEEAELVEPQRALSAEGAQVDVISQQPGEIQGFRHVDKGARVKVDH 63 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 A + DA+++PGG G + + Sbjct: 64 T--------------FDDAKQGDYDAIVLPGG----------VVNGDAMRMIPAAREFVT 99 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 A A KP+ +C +L T+ + + + G + V V Sbjct: 100 AAIGADKPVAVICHGGWLLVSA-GLVDGRTMTSWPSLQDDIRNAGGKWVDER-----VVR 153 Query: 183 DNKIVTT 189 D ++T+ Sbjct: 154 DGNLITS 160 >UniRef50_A4X5M7 ThiJ/PfpI domain protein n=4 Tax=Actinomycetales RepID=A4X5M7_SALTO Length = 231 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 46/223 (20%), Positives = 73/223 (32%), Gaps = 44/223 (19%) Query: 1 MKKIGVILSGCGVYD---------GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDV--- 48 M KI +++G ++ G E V+ ++ +G + V PD V Sbjct: 1 MSKILFVVTGADHWELADGTRHPTGVWAEEIVVPHEMLTSAGHEVVIATPDGVVPRVDRG 60 Query: 49 --INHLTGEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFAS 106 + TG R + LA+ D E A+ PGG G ++L+ Sbjct: 61 SLLPEFTGGPAGAVRMTEAVEGLEGLRKPIRLAEVDLDEYQAVFYPGGHGPMEDLA---- 116 Query: 107 LGSECTVDRELKALAQAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEE 165 VD++ L A QA KPLG +C P A+L + G + +EE Sbjct: 117 ------VDQDSGRLLVAAQQAKKPLGIVCHGPAALLAAVTADGSNAFAGCRVAAFTNVEE 170 Query: 166 M-------------------GAEHVPCPVDDIVVDEDNKIVTT 189 G + V D ++T Sbjct: 171 AQAGFADKAKWLLQDRLVDIGVQFQEGEAWAPNVVVDGNLITG 213 >UniRef50_B9L1B0 Protease I n=2 Tax=Thermomicrobia (class) RepID=B9L1B0_THERP Length = 179 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 73/208 (35%), Gaps = 40/208 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K++ ++L+ E EA + GA+ V D+Q + + Sbjct: 7 KRVAMLLAKD-----FEDSEATDPKQYLETRGAEVVIVGLDRQPITGKKGTVLQPDKT-- 59 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + + E DAL++PGG +D A Sbjct: 60 ----------------IDEVTVEEFDALVIPGG-----------GSPENLRIDDRAVAFT 92 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 +A ++GKP+ +C P +L D T+ + ++ GA +V + +V+D Sbjct: 93 RAFVESGKPVAAICHGPQLLISA-DVLRGRTVTCVKKIRDDVKNAGAIYVD---EAVVID 148 Query: 182 EDNKIVTTPAYM--LAQNIAEAASGIDK 207 + PA + Q IAEA + + Sbjct: 149 GNLITSRVPADLPFFDQAIAEALARVPA 176 >UniRef50_A4XSU5 Intracellular protease, PfpI family n=11 Tax=cellular organisms RepID=A4XSU5_PSEMY Length = 186 Score = 103 bits (256), Expect = 5e-21, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 70/202 (34%), Gaps = 35/202 (17%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ I + G E E + + A+ GA + + + + Sbjct: 8 KKVLFITANS----GIERDELLKPMQALKDQGASVTHASVKGGEAETW------LKDSEK 57 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 +V +++ L AA+ D L++PGG D + + L Sbjct: 58 DVTVQSD-------TQLKGLSAADYDLLVIPGG----------TVNADTLRQDSDAQRLV 100 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 Q QA K + +C P +L + + + L GA+ V V + Sbjct: 101 QEFRQASKTVAAICHGPWLLIDAGVVSGKA-LTSYSSVRTDLVNAGADWVDAQVK-VCPG 158 Query: 182 EDNKIVTT------PAYMLAQN 197 ++ K++T+ PA+ A + Sbjct: 159 QNWKLITSRTPDDLPAFNEALS 180 >UniRef50_Q045Z0 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme, amidase family n=37 Tax=Lactobacillales RepID=Q045Z0_LACGA Length = 194 Score = 102 bits (255), Expect = 6e-21, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 71/201 (35%), Gaps = 40/201 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V+ + DG E E + + + R DK+++D +H+ Sbjct: 1 MTKVAVVFA-----DGCEEVEGLSVVDVLRRLNIDCDMVGLDKKEIDGDHHI-------- 47 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + D + PGG A NL N +++L L Sbjct: 48 -----------LLTCDKVVDDSLLDYDLVAFPGGRTGALNLRN----------NKKLADL 86 Query: 121 AQAMHQAGKPLGFMCIAPAMLPK-IFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 ++AGK MC AP L T + ++ EE H + V Sbjct: 87 MIQRNKAGKWDAAMCAAPIALGHYGLLEGANYTCYPGFE-KQIEEECPNGHFSTDIT--V 143 Query: 180 VDEDNKIVTT--PAYMLAQNI 198 VD+++KI+T+ PA A Sbjct: 144 VDKEHKIITSRGPATAWAYAY 164 >UniRef50_C5AII4 Intracellular protease, PfpI family protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AII4_BURGB Length = 196 Score = 102 bits (255), Expect = 7e-21, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 60/189 (31%), Gaps = 35/189 (18%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 +I ++ DG E E A+ GA+ + ++ H+ T Sbjct: 9 RIAILA-----VDGFEQVELTEPQRALQAEGAKVEVISQKPGEIQGFKHVDKGDRTRVD- 62 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 QA A + DA+++PGG G + + Sbjct: 63 -------------LTFEQAKAGDYDAVVLPGG----------VVNGDAIRMIPAAREFVV 99 Query: 123 AMHQAGKPLGFMCIAPAM-LPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 A HQA KP+ +C + + R+T + + + G + V + +V D Sbjct: 100 AAHQADKPIAVICHGGWLPVSAGIVEGRRMT--SWPSLQDDIRNAGGQWVD---ERVVKD 154 Query: 182 EDNKIVTTP 190 + P Sbjct: 155 GNLITSRKP 163 >UniRef50_C7N926 DJ-1 family protein n=3 Tax=Leptotrichia RepID=C7N926_LEPBD Length = 187 Score = 102 bits (255), Expect = 7e-21, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 70/191 (36%), Gaps = 38/191 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ V L+ +G E EA+ + + R+G + T Sbjct: 4 KKVAVFLA-----NGFEEIEAITPIDLLERAGITVDTVSI------------------TE 40 Query: 62 NVLIEAARITRGEIRP-LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 N L+E+AR R + + + +E D LI+PGG G + L ++++ Sbjct: 41 NNLVESARKVRVLADKVIKEINFSEYDMLILPGGPGFKNYFDSQLLLDKIVEFSKDVE-- 98 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 K + +C AP +L L + G EE + P + VV Sbjct: 99 -------NKKVAAICAAPTVLSS-----LGILEGKKAVCFPACEEDLLKGNPILTRERVV 146 Query: 181 DEDNKIVTTPA 191 ++N I + A Sbjct: 147 KDENIITSRSA 157 >UniRef50_B5HD84 Protease I n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HD84_STRPR Length = 186 Score = 102 bits (255), Expect = 7e-21, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 51/159 (32%), Gaps = 20/159 (12%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M +I++G D +E E + + G + AP ++++ + H Sbjct: 46 MTTKILIVTG----DAAESLEVLYPYQRLLEEGYEVHIAAPARKKLQFVVH----DFEPG 97 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + E T ++ + A+++PGG D EL+ + Sbjct: 98 FDTYTEKPGYTWQADLAFSEVEPGAYAAIVIPGG-----------RAPEYLRNDPELRKI 146 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDT 159 +A + KP+ +C P + + Sbjct: 147 LKAFFDSDKPVAQICHGPLLTA-ATGGLEGRRFTSYPGA 184 >UniRef50_Q9M8R4 F13E7.34 protein n=273 Tax=cellular organisms RepID=Q9M8R4_ARATH Length = 388 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 60/192 (31%), Gaps = 17/192 (8%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M +L CG D E +E ++ A+ G P K+ D T Sbjct: 1 MANSRTVLILCG--DYMEDYEVMVPFQALQAFGITVHTVCPGKKAGDSCP--TAVHDFCG 56 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 E+ + D ++ D L++PGG + + L Sbjct: 57 HQTYFESRGHNFTLNATFDEVDLSKYDGLVIPGG-----------RAPEYLALTASVVEL 105 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + ++GKP+ +C +L D L GA+ V D+ V Sbjct: 106 VKEFSRSGKPIASICHGQLILAAA-DTVNGRKCTAYATVGPSLVAAGAKWVEPITPDVCV 164 Query: 181 DEDNKIVTTPAY 192 D ++T Y Sbjct: 165 V-DGSLITAATY 175 Score = 99.0 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 57/191 (29%), Gaps = 20/191 (10%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I + CG D E +E + ++ G Q P+K+ D T E Sbjct: 198 KRILFL---CG--DYMEDYEVKVPFQSLQALGCQVDAVCPEKKAGDRCP--TAIHDFEGD 250 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 E T ++ DAL++PGG ++ + + Sbjct: 251 QTYSEKPGHTFALTTNFDDLVSSSYDALVIPGG-----------RAPEYLALNEHVLNIV 299 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + + KP+ +C +L + G + D Sbjct: 300 KEFMNSEKPVASICHGQQILAAA-GVLKGRKCTAYPAVKLNVVLGGGTWLEPDPIDRCF- 357 Query: 182 EDNKIVTTPAY 192 D +VT A+ Sbjct: 358 TDGNLVTGAAW 368 >UniRef50_B3XE12 Peptidase C56, PfpI n=6 Tax=Escherichia RepID=B3XE12_ECOLX Length = 191 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 66/213 (30%), Gaps = 23/213 (10%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKI +I D SE +E ++ A++ G + P K+ + I T E Sbjct: 1 MKKILLI-----TGDFSEDYEVMVPWQALNMLGFRVDVVCPGKRTGEFI--KTAIHDFEG 53 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 E E + + GG +++ + + Sbjct: 54 DQTYTEKPGHLFRLTASFDDIRLQEYSGVYISGG-----------RSSEYLRLNKSVLDI 102 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 P+ +C P +L + +E G + V D+ +V Sbjct: 103 VHYAMNLTLPVAAICHGPQILAAA-GVLKGRKLTGYFTVKPEVEMAGGQWVTAADDEAIV 161 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVL 213 D ++T +M I I ++ + ++ Sbjct: 162 D--GNLITATTWMGHPAILRHF--ITQMGTSII 190 >UniRef50_P80876 General stress protein 18 n=22 Tax=Bacteria RepID=GS18_BACSU Length = 172 Score = 101 bits (252), Expect = 1e-20, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 64/199 (32%), Gaps = 41/199 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI V+L+ E E A +G + +K + T E + Sbjct: 3 KKIAVVLTY-----YFEDSEYTEPAKAFKEAGHELTVIEKEKGKTVKGKQGTAEVTVDA- 56 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + ++++ DAL++PGGF + D Sbjct: 57 ---------------SIDDVNSSDFDALLIPGGF-----------SPDQLRADDRFVQFT 90 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 +A KP+ +C P +L R G +E GA+ V ++VV Sbjct: 91 KAFMTDKKPVFAICHGPQLLINAKALDGRKATG-YTSIRVDMENAGADVVDK---EVVVC 146 Query: 182 EDNKIVTT-----PAYMLA 195 +D + + PA+ Sbjct: 147 QDQLVTSRTPDDIPAFNRE 165 >UniRef50_B0UB01 ThiJ/PfpI domain protein n=2 Tax=Alphaproteobacteria RepID=B0UB01_METS4 Length = 242 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 69/229 (30%), Gaps = 52/229 (22%) Query: 3 KIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVD----VINHL 52 K+ V+ + G G + E + GA + ++ + Sbjct: 5 KVLVVATSHAELGSSGHRTGVWLEELATPYYVLQDGGADITLVSIRGGEIPFDPRSVPAE 64 Query: 53 TGEAMTET------------RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKN 100 G + R + E AR L D DA+ +PGG G + Sbjct: 65 AGRGPGDKPADQQEVPASVRRFLADERARAVAKNSPALTSVDPQAFDAVFLPGGHGPMWD 124 Query: 101 LSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTI 153 +N D L + +M AGK + +C PA L + R++ Sbjct: 125 AAN----------DDTLARIIGSMIDAGKFVAAVCHGPAGLVRAKRRDGHPIVEGRRVSA 174 Query: 154 GTDID-------------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 T+ + + L+E+G + P D ++T Sbjct: 175 FTNTEEEAVGLTKVVPFLLEDRLKELGGKFERGPDWQPYAVRDGNLITG 223 >UniRef50_P45470 Protein yhbO n=95 Tax=Bacteria RepID=YHBO_ECOLI Length = 172 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 67/201 (33%), Gaps = 43/201 (21%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI V+++ D E E ++G + +KQ + GEA Sbjct: 3 KKIAVLIT-----DEFEDSEFTSPADEFRKAGHEV--ITIEKQAGKTVKGKKGEASVTID 55 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + + + AE DAL++PGG D Sbjct: 56 --------------KSIDEVTPAEFDALLLPGGH-----------SPDYLRGDNRFVTFT 90 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + +GKP+ +C P +L +LT I ++ GAE ++VV Sbjct: 91 RDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIII--DVKNAGAEFYD---QEVVV 145 Query: 181 DEDNKIVTT-----PAYMLAQ 196 D+D + + PA+ Sbjct: 146 DKDQLVTSRTPDDLPAFNREA 166 >UniRef50_C5VG63 DJ-1 family protein n=6 Tax=Prevotella RepID=C5VG63_9BACT Length = 189 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 77/214 (35%), Gaps = 42/214 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ L+ +G E EA+ + + R G + + +TG + E+ Sbjct: 1 MAKVYEFLA-----NGFEEVEALAPVDILRRGGVEVKMVS-----------ITGSNLVES 44 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + ++ A + I + A D L++PGG +KNL+ ++ Sbjct: 45 SHGVVVKADLLFENITDFSDA-----DLLMLPGGMPGSKNLNE----------HEGVRKA 89 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLT--------IGTDIDTAEVLEEMGAEHV 171 + + GK + +C AP +L + + T +G D + L + Sbjct: 90 LKEQFEKGKRIAAICAAPLVLASVGLLKGKKATIYPGMESYLGEDAEYTGALIQEDGNVT 149 Query: 172 PCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGI 205 ++++ ++ A+ + E G+ Sbjct: 150 TGAGPAASFPYGYQLLSY--FLPAEKVEEIKKGM 181 >UniRef50_B0SBM0 Transcription regulator, DJ-1/PfpI family intracellular protease n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SBM0_LEPBA Length = 188 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 74/209 (35%), Gaps = 38/209 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ + L G E EA++ + + R + V + T E + +R Sbjct: 3 KKVLIPLC-----PGFEEMEAIILIDVLRRGNVEVVSAS-----------KTKEPVVASR 46 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 N + ++ + E DA+++PGG KNL D E++ + Sbjct: 47 NTI-------HISDTTFSEINVDEFDAIVLPGGMNGTKNLM----------ADTEIQKIL 89 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 H + K +G +C APA+L K T ++ + G + ++ Sbjct: 90 SIFHSSKKHIGAICAAPAVLRKWDIISGNDPYTAFPSTDDLAKGKGGRYTGNRIESFH-- 147 Query: 182 EDNKIVTTP--AYMLAQNIAEAASGIDKL 208 P A+ A + E G + + Sbjct: 148 -HIHTSVGPGSAFAFALYLLELFEGKEVM 175 >UniRef50_C8PUR4 Intracellular protease 1 n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PUR4_9GAMM Length = 176 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 76/212 (35%), Gaps = 39/212 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAP-DKQQVDVINHLTGEAMTE 59 M KI ++L E E T + G Q ++QV +NH Sbjct: 1 MAKIAILL-----DTDFEQVEYTQTNDLLKAKGHQTTLITTQPQKQVRGLNHTDPADTFT 55 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 + +A+AA+ DA+++PGG + + + +A Sbjct: 56 AD--------------LLIGEANAADYDAIVLPGG----------GANADVLRANTDAQA 91 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + +A GKP+ +C AP + + + A L+ GA+ V V Sbjct: 92 MVKAFMNVGKPVAAICHAPWIFADT-EIARGKKLTAYKTIATDLKNAGAQFEDKSV---V 147 Query: 180 VDEDNKIVTTPAYMLAQNIAEAASGIDKLVSR 211 +D + P ++I + A ID+ +++ Sbjct: 148 IDGNLITSRQP-----EDIPDFAEAIDQALTK 174 >UniRef50_A7GXC1 DJ-1 family protein n=6 Tax=Campylobacter RepID=A7GXC1_CAMC5 Length = 185 Score = 100 bits (250), Expect = 2e-20, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 76/203 (37%), Gaps = 37/203 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK++ VIL+ +G E EA+ + + R+ A+C D+ V + ++ + Sbjct: 1 MKRVAVILA-----NGFEEIEALSVVDILRRADIDALCVGLDRALVVGAHGVSVKVD--- 52 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 L++ ELDA+++PGG A+NL++ +EL + Sbjct: 53 ---------------LLLSELREIELDAIVLPGGLPGAQNLAD----------SKELGEI 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + GK + +C AP L K T +T ++ G +++ Sbjct: 88 LRRFDDNGKLICAICAAPMALAKAGVLKGAFTCYPGFETNVRSDKNGYI----SDKNVIC 143 Query: 181 DEDNKIVTTPAYMLAQNIAEAAS 203 D + PA + + Sbjct: 144 DHNIITSRGPATAMEFALEIVKE 166 >UniRef50_A4WTY0 Intracellular protease, PfpI family n=7 Tax=Bacteria RepID=A4WTY0_RHOS5 Length = 189 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 75/210 (35%), Gaps = 36/210 (17%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K I ++++ G+E E V A++ +GA+ V + + + +N Sbjct: 6 KTIAILIA----PRGTEDVEFVRPAKALADAGAKIVAVSLEAGAAETVNQD--------- 52 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 L +R ++ A+ D L++PGG A + E A Sbjct: 53 --LDPGSRHPVDAT--VSDVSASGFDGLVIPGGTVGA----------DKIRGSAEAVAFV 98 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + +AGKP+ +C P L + R + + A + G ++VVD Sbjct: 99 RGFFEAGKPVAAICHGPWALVEAGVLEGRR-LTSFPSLATDIRNAGGHWTDA---EVVVD 154 Query: 182 EDNKIVTTP----AYMLAQNIAEAASGIDK 207 + P A+ A+ I E A G Sbjct: 155 QGLVTSRKPDDLEAF-CARMIEEFAEGPHA 183 >UniRef50_B8DF54 Intracellular protease 1 (Intracellular protease I) n=26 Tax=Firmicutes RepID=B8DF54_LISMH Length = 173 Score = 100 bits (249), Expect = 3e-20, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 63/188 (33%), Gaps = 40/188 (21%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ ++S + E E +L + +GA A + ++ + H + Sbjct: 6 KKVIALVS-----EDFEDLELWYPVLRLREAGASVHLVAEEAKK---VYHGKYGVPVTSD 57 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 A + D ++VPGG+ + + L Sbjct: 58 Y--------------DFDSVRAEDYDGILVPGGW-----------SPDKLRRFDSVLNLV 92 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 +A +A KP+G +C A +L + +T + + + GA + VV Sbjct: 93 RAFDKAKKPIGQICHAGWVLVSAGILEGVNVT--STPGIKDDMTNAGAIWHN----EPVV 146 Query: 181 DEDNKIVT 188 + + I + Sbjct: 147 TDGHIISS 154 >UniRef50_A7HTR2 ThiJ/PfpI domain protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HTR2_PARL1 Length = 238 Score = 100 bits (249), Expect = 3e-20, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 71/223 (31%), Gaps = 43/223 (19%) Query: 2 KKIGVILSGCGVY------DGSEIHEAVLTLLAISRSGAQAVCFA-----PDKQQVDVIN 50 ++I ++++ G G E G + + P + + Sbjct: 4 QRIAIVVTSHGKLGNTGDDTGFHYEEMTTPYYIFLDDGCEVTLGSIQGGEPPADPSSLPD 63 Query: 51 HLTGEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSE 110 A + R + + A P+++ +A + DA+ +PGG G ++ + Sbjct: 64 DEEKRAESVRRFLKDKNAVKALKATVPVSELNAKDFDAVYLPGGHGCMWDMPD------- 116 Query: 111 CTVDRELKALAQAMHQAGKPLGFMCIAPA-MLPKIFDFPL---------------RLTIG 154 + L L +++ G +G +C PA +L +G Sbjct: 117 ---NDALSRLISEVYEKGGVVGAVCHGPAGLLGARLSDGTPFVKDRLINSFTDEEERKVG 173 Query: 155 TDIDT----AEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PA 191 D L +GA + V + ++VT PA Sbjct: 174 KDKAVPFLLETQLRGLGARFEGGKPFERHVCREGRVVTGQNPA 216 >UniRef50_A8L5X9 Intracellular protease, PfpI family n=32 Tax=Bacteria RepID=A8L5X9_FRASN Length = 195 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 55/183 (30%), Gaps = 32/183 (17%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 +++ + + +G E E A+ +G V + ++ V +HL Sbjct: 16 RQVAFLTA----KEGVEQVELTGPWRAVRDAGFTPVLVSTEEGTVQAFHHLDRADTFPVD 71 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + A D DAL++PGG + + Sbjct: 72 ATVTGA--------------DPGGFDALVLPGG----------VANPDTLRWQPGVVPFV 107 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + G P+ +C P L + D + + L GA V D++V+ Sbjct: 108 RTFFDRGLPVAVICHGPWTLIEA-DVVRGRRVTSWPSLRTDLRNAGATWVD---DEVVIC 163 Query: 182 EDN 184 Sbjct: 164 RSG 166 >UniRef50_C0QRM2 Intracellular protease 1 (Intracellular protease I) n=3 Tax=Aquificales RepID=C0QRM2_PERMH Length = 169 Score = 99.4 bits (246), Expect = 8e-20, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 60/199 (30%), Gaps = 46/199 (23%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDV----INHLTGEA 56 MKKI ++L D E E + L G AP + I H + Sbjct: 1 MKKIAILL-----EDLVEDVEFIYPLYRFMEEGYVVDVLAPRVGEFSGKKGMIFHASKRV 55 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 + A DA+ VPGG+ D+E Sbjct: 56 DPDM----------------------ADYYDAVFVPGGYA-----------PDRFRRDKE 82 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVD 176 + M++ GK + +C P L + I + +E GA + PV+ Sbjct: 83 TIEFIRNMYKKGKIVAAICHGPWALISAKIVKGKR-ITAFFSIRDDIENAGAIYTGKPVE 141 Query: 177 DIVVDEDNKIVTTPAYMLA 195 VD + T P M Sbjct: 142 ---VDGNIVTATDPKAMPE 157 >UniRef50_D2RSY5 Intracellular protease, PfpI family n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RSY5_9EURY Length = 187 Score = 99.0 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 66/188 (35%), Gaps = 33/188 (17%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH-LTGEAMTETRN 62 + V L+ +G+E E V ++ +GA + + +N+ L G E + Sbjct: 16 VAVFLA----QEGTEEVEFVEPTDLVTDAGATVDVVGSETGEGQTVNNDLEGSESYEIK- 70 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + + A + DA+IVPGG A L + E L + Sbjct: 71 -------------KSFDEISADDYDAVIVPGGTVGADTLRTY----------DEGVDLLR 107 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 +AGKP +C P L + + + + + G E V + +VVD+ Sbjct: 108 QHVEAGKPTAVICHGPWTLVEADVVDGKQ-LTSYHSLQTDVRNAGGEWVD---EAVVVDD 163 Query: 183 DNKIVTTP 190 P Sbjct: 164 GLITSRNP 171 >UniRef50_C6CHL2 ThiJ/PfpI domain protein n=4 Tax=Enterobacteriaceae RepID=C6CHL2_DICZE Length = 261 Score = 99.0 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 71/226 (31%), Gaps = 36/226 (15%) Query: 3 KIGVILSGCGVYD-----GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM 57 KI +++S G E + ++G +P + + Sbjct: 34 KILIVVSSLDKKTENLVGGFWFPELTHPVKVFDKAGVDFDIASPKGGLAP-FDGFDLKDQ 92 Query: 58 TETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDREL 117 R G+ L+ D ++ A+++ GG G + N + EL Sbjct: 93 ASLEFWTNPQHRNKLGQTIKLSDIDPSKYSAILLVGGHGPMWDFVN----------NTEL 142 Query: 118 KALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDIDTAE--------- 161 + + +++ + +C PA L + RLT T + A Sbjct: 143 SNIVRTLYENNGVISAVCHGPAGLINVKLSNGEDLIKGRRLTGFTAAEEASRQYDKIVPF 202 Query: 162 ----VLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAAS 203 L++ GA P+ V D +++T A + EA Sbjct: 203 ELQGALKKAGATFEEAPIFANNVVVDGRLITGQNPASATALGEAVV 248 >UniRef50_O28987 Uncharacterized protein AF_1281 n=11 Tax=cellular organisms RepID=Y1281_ARCFU Length = 168 Score = 98.6 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 50/174 (28%), Gaps = 33/174 (18%) Query: 17 SEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIR 76 E E L + G + + + G+ + R L Sbjct: 11 FEDLELFYPLYRLREEGLEVKVAS------SSLEVRVGKKGYQVRPDLT----------- 53 Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 + L++PGG ++ + + + GKP+ +C Sbjct: 54 -YEDVKVEDYAGLVIPGG-----------KSPERVRINERAVEIVKDFLELGKPVAAICH 101 Query: 137 APAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 P +L R + + I + L GA + PV VVD + P Sbjct: 102 GPQLLISAMAVKGRR-MTSWIGIRDDLIAAGALYEDRPV---VVDGNVITSRMP 151 >UniRef50_C1SLG0 DJ-1 family protein n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SLG0_9BACT Length = 187 Score = 98.2 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 64/201 (31%), Gaps = 38/201 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V+L+ DG E EAV + + R+ + V + L + + Sbjct: 1 MGKVIVVLA-----DGFEEIEAVSVIDILRRADVEVCAAGVKDGNVKGAHGLIVKPDST- 54 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 L D + D +++PGG A+N ++ + Sbjct: 55 -----------------LEDIDEDDYDMIVLPGGAVGAEN----------IGKSKDADDI 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + K + +C AP +L L G + ++ A+ +VV Sbjct: 88 LRKFKKDDKYIAAICAAPKILA-----DKGLLNGCMATSYPSFKDAVAKDSDYQEAIVVV 142 Query: 181 DEDNKIVTTPAYMLAQNIAEA 201 DE+ PA Sbjct: 143 DENIITSRGPATAAEFAFTLV 163 >UniRef50_B5Y9N6 Intracellular protease 1 (Intracellular protease I) n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y9N6_COPPD Length = 169 Score = 97.9 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 60/188 (31%), Gaps = 38/188 (20%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 KI ++L DG E E + + V + + N L + Sbjct: 4 KIAILL-----DDGFEDLEFFYPYFRVQEDNFEPVVLGVEPKLAKGKNGLYFQITET--- 55 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 LA+ E L +PGG E+K Sbjct: 56 ---------------LAKHKPEEFVGLYIPGGHA-----------PDRLRRFDEVKEFVS 89 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 A ++ G+P+G +C P +L +T+ + + LE GA V PV VVD+ Sbjct: 90 AFYKLGRPIGTICHGPQVLISA-KVVEGVTMTSVSAIKDDLENAGAIWVNQPV---VVDK 145 Query: 183 DNKIVTTP 190 + P Sbjct: 146 NIVSSRVP 153 >UniRef50_C6PQH3 ThiJ/PfpI domain protein n=11 Tax=Bacteria RepID=C6PQH3_9CLOT Length = 192 Score = 97.9 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 76/218 (34%), Gaps = 33/218 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKI ++L+ +G E EA + + + + + T Sbjct: 1 MKKILLLLA-----NGFEAVEASVFTDVLGWNKFE-------------GDGTTTLVTAGM 42 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R L T L++ + E DA+ +PGGF A + + L Sbjct: 43 RERLKCTWNFTIIPEMLLSEVNVEEFDAVAIPGGFEEAGFYED--------AFSEDFLNL 94 Query: 121 AQAMHQAGKPLGFMCIAPAMLPK-IFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + +A K + +C+A + K T + L E G + P IV Sbjct: 95 IREFDKADKIIASICVAALPIGKSGVLNGRNATTYNLGKRQKQLSEFGVNVI--PDKPIV 152 Query: 180 VDEDNKIVTTP--AYMLAQNIAEAASGIDKL--VSRVL 213 +D++ P A+ +A + E + + V R++ Sbjct: 153 IDKNIITSYNPSTAFNVAFKLLELLTSKENCNNVKRLM 190 >UniRef50_D1BGR7 Intracellular protease, PfpI family n=14 Tax=Actinomycetales RepID=D1BGR7_SANKS Length = 197 Score = 97.9 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 64/187 (34%), Gaps = 25/187 (13%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 +++ +++ G E E V+ L + +GA AP++ V+ + R Sbjct: 16 RRVLAVVTTY----GVEQDELVVPLEHLRAAGAHVDVAAPERGTVETL----VGDKDPGR 67 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 V EA R + L AD D L+VPGG ++ + A Sbjct: 68 PV--EADR----ALGDLTDADLDSYDLLLVPGG----------TINADALRLEEKAVAAV 111 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 ++G+P+ +C P ++ + T+ + + G V D Sbjct: 112 GTFARSGRPVAAICHGPWLVVEA-GLATGKTLTSYPTLQTDVRNAGGTWQDQSVVSDPTD 170 Query: 182 EDNKIVT 188 + + Sbjct: 171 GWTLVTS 177 >UniRef50_C0SMV9 Putative uncharacterized protein n=1 Tax=Streptomyces spiroverticillatus RepID=C0SMV9_9ACTO Length = 231 Score = 97.9 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 43/229 (18%), Positives = 75/229 (32%), Gaps = 50/229 (21%) Query: 1 MKKIGVILSGCGVY---------DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH 51 M K+ I+SG + G E ++ +G + V P+ ++ Sbjct: 1 MAKVLFIVSGATYWVLKDGTRHATGYWAEEFANPYKILTDAGHEVVVATPN-GVTPTVDM 59 Query: 52 LTGEAMTETRN-------VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNF 104 ++ N +I +A + R ++ L+ + DA+ +PGG G +L+ Sbjct: 60 MSLRPEMVGGNDSALELEAIIRSAEVMRRPLQ-LSDVRLEDYDAVYLPGGHGPMADLA-- 116 Query: 105 ASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLE 164 D ++ L +G PL +C PA + G +I E Sbjct: 117 --------WDADVGRLLTQQLTSGNPLFVVCHGPAAMLATRIHGESPFKGYNITCFTDEE 168 Query: 165 EM--------------------GAEHVPCPVDDIVVDEDNKIVTT--PA 191 E G P+ + V ED +VT PA Sbjct: 169 EDGVGLASRAPWLLETDVRTKVGVNFSRGPIWEPYVVEDRNLVTGQNPA 217 >UniRef50_C6RJA7 Intracellular protease, PfpI family n=3 Tax=Acinetobacter RepID=C6RJA7_ACIRA Length = 181 Score = 97.9 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 55/175 (31%), Gaps = 27/175 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K +I++ G E E + L + G +A+ A ++V + T Sbjct: 1 MAKKALIITSNA---GVEHDELIKPLEFLKSKGIEAIHAAEKNEEVQTMKGDKEPGPAYT 57 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + D + D LIVPGG ++ +++ L Sbjct: 58 PD-------------STFEKVDPTDYDILIVPGG----------TVNADTLRINEQVQQL 94 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 Q GKP+ +C AP L T+ L+ G V Sbjct: 95 IQHFTDNGKPIAMICHAPWTLINA-GRIEGKTVTGYQSLELDLKNAGGLWKDEAV 148 >UniRef50_Q58377 Uncharacterized protein MJ0967 n=8 Tax=Euryarchaeota RepID=Y967_METJA Length = 205 Score = 97.5 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 76/212 (35%), Gaps = 43/212 (20%) Query: 1 MK--KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVI--NHLTGEA 56 MK K+ ++++ D E + +G + + K + + N +T E Sbjct: 29 MKNAKVLMVIAPKDFRD----EELFEPMAVFESNGLKVDVVSTTKGECVGMLGNKITVEK 84 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 + + + A+++ GG G+ + L N + + Sbjct: 85 T--------------------IYDVNPDDYVAIVIVGGIGSKEYLWN----------NTK 114 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPV 175 L L + + K + +C++P +L + + T+ + E L++ GA + V Sbjct: 115 LIELVKEFYNKNKVVSAICLSPVVLARAGILKGKKATVYPAPEAIEELKKAGAIYEDRGV 174 Query: 176 DDIVVDEDNKIVTTPAYMLAQNIAEAASGIDK 207 VVD + +P Y + E I+K Sbjct: 175 ---VVDGNVITAKSPDYARLFGL-EVLKAIEK 202 >UniRef50_C5D5R3 ThiJ/PfpI domain protein n=6 Tax=Bacillaceae RepID=C5D5R3_GEOSW Length = 221 Score = 97.5 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 67/213 (31%), Gaps = 40/213 (18%) Query: 2 KKIGVILSG-----CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 K++ ++++ G + E + L G + +V L + Sbjct: 3 KRVLMVVTNHTTITDDHKTGLWLEEFAVPYLVFKEKGYDVKVASIQGGEVP----LDPRS 58 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 + E E A L++ DA DA+ +PGG G + + + Sbjct: 59 IEEKDPAWAE-AEKELKNTARLSEDDATGFDAIFLPGGHGTMFDFPD----------NET 107 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDIDTAE-------- 161 L+ + Q + G+ +G +C P+ L + +T TD + E Sbjct: 108 LQYVLQQFAEDGRIIGAVCHGPSGLVNVTYKDGTPLVKGKTVTAFTDEEEREVQLDQYMP 167 Query: 162 -----VLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 L GA V D ++T Sbjct: 168 FLLESTLRLRGANFVHGEKWTDFSVRDGNLITG 200 >UniRef50_A4XRE4 ThiJ/PfpI domain protein n=3 Tax=Gammaproteobacteria RepID=A4XRE4_PSEMY Length = 284 Score = 97.1 bits (240), Expect = 3e-19, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 67/205 (32%), Gaps = 39/205 (19%) Query: 4 IGVILS---------GCGVYDGSEIHEAVLTLLAISRSGAQAVCF-----APDKQQVDVI 49 + V+LS G G ++E + + ++G + V AP + V Sbjct: 28 VLVLLSSETQLPLKDGKQYTTGFYLNEFGVPADHLLKAGYELVLVTPKGNAPRVDENSVT 87 Query: 50 NHLTGEAMTETRNVLIEAARI-TRGEIRPLAQA---DAAELDALIVPGGFGAAKNLSNFA 105 G E + + + + L + D L++PGG +L+N Sbjct: 88 PQYFGGDEQEMQRIRSLVENLPGIDDTLSLKEVLEGDLQRYAGLLIPGGHAPLIDLAN-- 145 Query: 106 SLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEE 165 + E+ AL + HQAGKP +C P L + D A Sbjct: 146 --------NPEVGALLRHFHQAGKPTAAICHGPIAL-----------LSAQRDPAAYQAA 186 Query: 166 MGAEHVPCPVDDIVVDEDNKIVTTP 190 + P D I I +TP Sbjct: 187 LANGETPAAADWIYQGYRMTIFSTP 211 >UniRef50_A9HQ45 Putative transcriptional regulator n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HQ45_GLUDA Length = 292 Score = 97.1 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 59/166 (35%), Gaps = 30/166 (18%) Query: 2 KKIGVILSGCGVYD---------GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHL 52 K++ VILS D G ++E + A+ +G V P V +H Sbjct: 32 KRVLVILSSARYLDLQKHKKYETGFYLNELAVPAKALVTAGYDLVFTNPKGNIV-TWDHH 90 Query: 53 TGEAMTETRNVLIEA-------ARITRGEIRPLAQAD---AAELDALIVPGGFGAAKNLS 102 + A+ +++ E +T R L+ + DA+ +PGG ++L+ Sbjct: 91 SANALYFNKDLKQEQEAEHFVEHLLTVRHPRSLSSVRKEGVDDYDAVFIPGGHAPMQDLA 150 Query: 103 NFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFP 148 + +L A+ H+ + +C P L P Sbjct: 151 T----------NPDLGAILSEFHKRHRITALICHGPIALLSTLTSP 186 >UniRef50_Q026F0 Intracellular protease, PfpI family n=3 Tax=Bacteria RepID=Q026F0_SOLUE Length = 189 Score = 97.1 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 67/205 (32%), Gaps = 29/205 (14%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M KI VI DG E +E + + G + AP +++++++ H Sbjct: 1 MSKILVI-----TGDGGESYETLYAVHRFQEEGWEVAVAAPSRRRLNLVMH----DFKPG 51 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + IE + E A+++ GG + +L L Sbjct: 52 WDTYIERRGYGLDADLSFDEVKVDEYAAILLLGG-----------RAPEYLRNNAQLLEL 100 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 A+ + GK + +C +L R+T +E GA D V Sbjct: 101 ARDFDRQGKWIFAICHGVQILAAAGLAKGKRVTCYEH--VRLEVELSGATWH---TDQTV 155 Query: 180 VDEDNKIVTTPAYMLAQNIA-EAAS 203 D ++VT + + E + Sbjct: 156 --RDGRVVTAQTWQSHPSFYREIFA 178 >UniRef50_Q0W5Q2 Intracellular protease (C56 family) n=3 Tax=cellular organisms RepID=Q0W5Q2_UNCMA Length = 189 Score = 97.1 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 54/180 (30%), Gaps = 35/180 (19%) Query: 13 VYDGSEIHEAVLTLLAIS-RSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARIT 71 V DG E E + L + A +K Q H V EAA Sbjct: 13 VGDGFEDSELLYPLYRFRYEACADVTVAGIEKGQTLKGKH--------GVPVTSEAA--- 61 Query: 72 RGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPL 131 + E DAL++PGG + E+ Q + GK + Sbjct: 62 ------IRDLAPDEFDALVIPGG-----------QSPDHIRIYPEVIKFVQDFDRTGKTI 104 Query: 132 GFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 +C P ++ T +E+ GA ++ PV V+ + P Sbjct: 105 AAVCHGPQILITARLLKGKDAT--AWKSLRVDMEDAGANYIDKPV---VISQQYIFSRQP 159 >UniRef50_B1YMA0 Intracellular protease, PfpI family n=14 Tax=Bacteria RepID=B1YMA0_EXIS2 Length = 176 Score = 96.3 bits (238), Expect = 6e-19, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 62/199 (31%), Gaps = 43/199 (21%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI ++S + E E + + GA + + Sbjct: 7 KKIIQLVS-----NDFEDLELWYPVHRLREEGATVDIVGEKAGE---------------K 46 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + I + + + A+ DA++VPGG+ + + Sbjct: 47 YIGKYGVPIVSDKT--FDEINPADYDAILVPGGW-----------SPDLLRRFDSVLTMV 93 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + ++ +P+G +C A +L + + + + + GA + VV Sbjct: 94 RHFNETKQPIGQICHAGWVLISA-GVLKGINVTSTPGIKDDMTNAGATWHD----EPVVV 148 Query: 182 EDNKIVTT-----PAYMLA 195 + + I + P YM Sbjct: 149 DGHIISSRRPPDLPDYMRE 167 >UniRef50_C4L0C2 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L0C2_EXISA Length = 219 Score = 96.3 bits (238), Expect = 7e-19, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 71/213 (33%), Gaps = 39/213 (18%) Query: 2 KKIGVILSGC-----GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 K + ++ + G G + E + L + G + + + V + + Sbjct: 3 KHVLIVTTSANQLSNGHATGLWLEEFAVPYLLFEKEGYKVTVASIEGGDVP----IDANS 58 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 + + + I R + L Q DA+ +PGG G + + Sbjct: 59 LEDGLSEDILNTRELLKDTARLDQVADESYDAIFLPGGHGTVVDFPE----------NET 108 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDIDTAEV------- 162 L+ + + ++++G +G +C P L + ++T TD + E+ Sbjct: 109 LQRVIRNVYESGNIVGAVCHGPIGLVNVKLSNDEPLVKDKQVTGFTDAEEKEMQLDSAVP 168 Query: 163 ------LEEMGAEHVPCPVDDIVVDEDNKIVTT 189 L G + + V D ++VT Sbjct: 169 FLLETGLRNQGGQFKGADNWAVNVAVDERLVTG 201 >UniRef50_A8RCF9 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=A8RCF9_9FIRM Length = 177 Score = 95.9 bits (237), Expect = 7e-19, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 66/192 (34%), Gaps = 38/192 (19%) Query: 11 CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARI 70 C + DG E EAV T+ + R+G D +V +LT + Sbjct: 5 CIMKDGFEELEAVGTIALLRRAGIDVDVCTSDANKVSGRFNLTLQP-------------- 50 Query: 71 TRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKP 130 ++ L + D DAL +PGG D + + + + K Sbjct: 51 ----VKDLKEVDPTSYDALFLPGG-----------PHYQTLESDAYIMEILSSYIHSNKV 95 Query: 131 LGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 + +C AP +L + + T T ++ E+ G +V V+D + + Sbjct: 96 VAAICAAPTILGRAGYLKNKNYTCFTSMN-----EDFGGTYVDRY---AVIDGNIITGRS 147 Query: 190 PAYMLAQNIAEA 201 A ++ A Sbjct: 148 AAAVIDFAFALI 159 >UniRef50_B2A567 Intracellular protease, PfpI family n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A567_NATTJ Length = 170 Score = 95.9 bits (237), Expect = 8e-19, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 63/200 (31%), Gaps = 47/200 (23%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K ++L E E + +G + ++ L + + Sbjct: 4 KKAIVLCANMY----EERELWYPYYRLQEAGLEVELVGAEEGTYTGKAGLPCKVDKQ--- 56 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 ++Q +A E+DA+I+PGGF ++++ + + Sbjct: 57 ---------------ISQINAEEVDAVIIPGGFA-----------PDSLRRNQQILNVIK 90 Query: 123 AMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 ++ G + +C P +L +T + ++ GA + V Sbjct: 91 KVNNNGGLIAAICHGPWLLVSADIISGKHITCF--WAIIDDVKNAGAHYEDRE-----VI 143 Query: 182 EDNKIVTT------PAYMLA 195 D IVT+ P +M Sbjct: 144 RDGNIVTSRIPHDLPEFMKE 163 >UniRef50_C0GN28 Intracellular protease, PfpI family n=1 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GN28_9DELT Length = 174 Score = 95.6 bits (236), Expect = 1e-18, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 58/182 (31%), Gaps = 38/182 (20%) Query: 20 HEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIRPLA 79 +E V + +GA AP+ ++ G T + Sbjct: 19 YEFVYPYYRLLEAGAHVDVVAPEAKK--TYPGKGGTTATSS---------------LAAK 61 Query: 80 QADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPA 139 A + +++PGGF + L + M GK + +C Sbjct: 62 DAVPGDYAGIVIPGGFA-----------PDFMRRHEAMVNLVREMFNQGKVVAAICHGGW 110 Query: 140 MLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT----PAYML 194 ML ++T + + L GA V +++VVD++ T PA+M Sbjct: 111 MLASARILQDKKVT--SFFAIKDDLIHAGANWVD---EEVVVDKNLITSRTPDDLPAFMR 165 Query: 195 AQ 196 A Sbjct: 166 AA 167 >UniRef50_O06006 Putative cysteine protease yraA n=81 Tax=cellular organisms RepID=YRAA_BACSU Length = 169 Score = 95.2 bits (235), Expect = 1e-18, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 64/189 (33%), Gaps = 37/189 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI V+++ D E E + A +G V + + H Sbjct: 3 KKIAVLVT-----DQFEDIEYTSPVKAYEEAGYSVVAIDLEAGKEVTGKHG--------- 48 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 E +I + ++ DA++ DAL++PGGF D A Sbjct: 49 ----EKVKIDK----AISDVDASDFDALLIPGGF-----------SPDLLRADDRPGEFA 89 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 +A + KP+ +C P +L D I + L GA + ++VV Sbjct: 90 KAFVENKKPVFAICHGPQVLIDT-DLLKGKDITGYRSIRKDLINAGANYKDA---EVVVS 145 Query: 182 EDNKIVTTP 190 + TP Sbjct: 146 HNIVTSRTP 154 >UniRef50_Q27SQ0 Protease/amidase (Fragment) n=1 Tax=Pavlova lutheri RepID=Q27SQ0_PAVLU Length = 161 Score = 94.8 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 51/180 (28%), Gaps = 35/180 (19%) Query: 17 SEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIR 76 E E + GA TG+ + Sbjct: 2 FEDLEVTYPQKRLEEEGAVVSVIGGAAAGTK----YTGKFGYPVISHAC----------- 46 Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 + DAL++PGGF + + A M + GKP+G +C Sbjct: 47 -IDNVSPDAFDALVIPGGF-----------SPDYMRRNPAMLAFIVRMLEQGKPVGAICH 94 Query: 137 APAMLPKIFDFPLR-----LTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPA 191 P ML D + + + + + G V PV VVD + TPA Sbjct: 95 GPWMLCSARDASGKPVCSGVRCTSFGAIKDDVINAGGMWVDEPV---VVDANIITARTPA 151 >UniRef50_Q0JPK7 Os01g0217800 protein n=16 Tax=Magnoliophyta RepID=Q0JPK7_ORYSJ Length = 428 Score = 94.8 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 75/203 (36%), Gaps = 37/203 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K++ V ++ DG+E EA T ++R+GA+ T + + R Sbjct: 36 KRVLVPVA-----DGTEPVEAAATADVLNRAGARVTVA-------------TADPAGDDR 77 Query: 62 NVLIEAA-RITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +L+EAA + +A + D + +PGG + NL + L+ + Sbjct: 78 GLLVEAAFGVKLVADGRVADLEGEAFDLIALPGGMPGSANLRDCKV----------LEKM 127 Query: 121 AQAMHQAGKPLGFMCIAPA--MLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 + + G +C PA + L+ T +E+ AE +P + Sbjct: 128 VKKQAEQGGLYAAICATPAVTLAHWGLLKGLKATCYP-----SFMEKFTAEIIPVN-SRV 181 Query: 179 VVDEDNKIVTTPAYMLAQNIAEA 201 VVD + PA + +A Sbjct: 182 VVDRNAVTSQGPATAIEYALALV 204 Score = 77.1 bits (188), Expect = 4e-13, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 77/206 (37%), Gaps = 33/206 (16%) Query: 13 VYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITR 72 V +GSE EA+ + + R+GA + + ++ V+ R+ A I Sbjct: 252 VANGSEEMEALNLIDILRRAGANVTVASVE-DKLQVV---------TRRHKFNLIADIM- 300 Query: 73 GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLG 132 + +A E D +++PGG A+ LS+ L L + ++ KP G Sbjct: 301 -----VEEAAKREFDLIVMPGGLPGAQKLSSTKV----------LVDLLKKQAESNKPYG 345 Query: 133 FMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP-- 190 +C +PA + + + A +L + A +VVD + P Sbjct: 346 AICASPAYVLEPHGLLKGKKATSFPPMAHLLTDQSA-----CDSRVVVDGNLITSKAPGS 400 Query: 191 AYMLAQNIAEAASGIDKLVSRVLVLA 216 A A I E G +K VS L Sbjct: 401 ATEFALAIVEKLFGREKAVSIAKELI 426 >UniRef50_Q12ZS1 Intracellular protease 1 n=3 Tax=Methanosarcinaceae RepID=Q12ZS1_METBU Length = 174 Score = 94.4 bits (233), Expect = 2e-18, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 71/190 (37%), Gaps = 36/190 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI ++++ D E + SGA+ + ++ + Sbjct: 5 KKILMVIAQENFRD----EEFLKPKKVFEDSGAKVTVASNTTKKAKGVLGKKVSPD---- 56 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 ++ + + DA+ + GG GA + L + ++EL+ + Sbjct: 57 --------------ISISDVNIDDYDAISITGGGGAKQYLWD----------NKELQEIV 92 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + ++ GK +G +CIAP +L T+ + +T ++L++ A++ V V Sbjct: 93 RKANEQGKIIGAICIAPVVLANAGLLEGKMTTVFKNEETVKILKDNDAKYRDKDVM---V 149 Query: 181 DEDNKIVTTP 190 D + P Sbjct: 150 DGNIITGRDP 159 >UniRef50_C5RB54 C56 family peptidase n=12 Tax=Lactobacillales RepID=C5RB54_WEIPA Length = 182 Score = 94.4 bits (233), Expect = 2e-18, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 57/189 (30%), Gaps = 38/189 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ V+++ D E E + AI +G +K V + Sbjct: 17 KKVAVLVT-----DFVEDVEYTDPVKAIEDAGHDVTTIGFEKGTVTGKHGTKITID---- 67 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + +++ + DAL +PGGF + D+ Sbjct: 68 --------------QAISEVKPEDFDALFIPGGF-----------SPDQLRADQRFVDFV 102 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + K +C P ++ + T+ + + L GA PV V+D Sbjct: 103 KYFLAKNKLTASICHGPQLMIQTGLVH-GRTMTSYLTVQPDLYYAGARIEDKPV---VID 158 Query: 182 EDNKIVTTP 190 + P Sbjct: 159 GNLITSREP 167 >UniRef50_B5XSZ0 Intracellular protease, PfpI family n=7 Tax=Bacteria RepID=B5XSZ0_KLEP3 Length = 178 Score = 94.4 bits (233), Expect = 2e-18, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 62/200 (31%), Gaps = 41/200 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI V+++ D E E A +G Q +KQ + GEA Sbjct: 3 KKIAVLIT-----DDFEDSEFTSPADAFKLAGHQV--VTIEKQAGKTVKGKQGEAEVAID 55 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 R + E DAL++PGG+ + D Sbjct: 56 --------------RAIDDVTPGEFDALLLPGGY-----------SPDQLRGDERFVTFT 90 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + GKP+ +C P +L D + ++ G E ++VVD Sbjct: 91 RDFVNGGKPVFAICHGPQLLISA-DVIRGRKLTAVKPIVVDVKNAGGEFYD---QEVVVD 146 Query: 182 EDNKIVTT-----PAYMLAQ 196 + + + PA+ Sbjct: 147 NEQLVTSRTPDDLPAFNREA 166 >UniRef50_Q2SRY0 DJ-1 family protein n=3 Tax=Mycoplasma mycoides group RepID=Q2SRY0_MYCCT Length = 182 Score = 94.0 bits (232), Expect = 3e-18, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 73/209 (34%), Gaps = 41/209 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKI + L G E EAV + R+G + +H Sbjct: 1 MKKIALYL-----NPGFEEIEAVTPCDVLKRAGILVDMISTTDNLEVKGSH--------- 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 N+ I+A ++ + D +I+PGG G + ++ L Sbjct: 47 -NITIKADKLW-------KDLNINYYDGMILPGGSGV-----------TSLFKNQTLIDN 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 ++ K + +C AP ++ + + TI L++ A V P VV Sbjct: 88 ILEFNKQNKLIASICAAPQVIGQTRLLDNK-TITHYPKCDLYLDK--ANVVNKP---YVV 141 Query: 181 DEDNKIVTTPAYMLAQNIA--EAASGIDK 207 D++ T+ + ++A E G +K Sbjct: 142 DQNFITATSAGTSMQFSLAIVEYLLGKEK 170 >UniRef50_B4RT08 Protease n=2 Tax=Alteromonadales RepID=B4RT08_ALTMD Length = 183 Score = 94.0 bits (232), Expect = 3e-18, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 64/200 (32%), Gaps = 44/200 (22%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFA-PDKQQVDVINHLTGEAMTET 60 KKI ++ + +G E E V + GA+ + D+ + + + Sbjct: 10 KKIAILAT-----NGFEQSELVEPKTLFTEQGAKVDILSIEDQTTIKAWDEDNWGKEVDV 64 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 ++ A+ + DAL++PGG + + A Sbjct: 65 D--------------LQVSSANLEDYDALVLPGG----------QINPDVLRTNNDAVAF 100 Query: 121 AQAMHQAG--KPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 + K +G +C P +L + T+ + L+ GA V Sbjct: 101 ISKANGTDSIKAIGAICHGPWLLVESGLAD-GATLTSFPSIKTDLQNAGATWVDEE---- 155 Query: 179 VVDEDNKIVTT------PAY 192 V K+VT+ PA+ Sbjct: 156 -VVNHEKLVTSRNPNDIPAF 174 >UniRef50_UPI0000E47ADE PREDICTED: similar to KNP-Ia, partial n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47ADE Length = 71 Score = 94.0 bits (232), Expect = 3e-18, Method: Composition-based stats. Identities = 36/71 (50%), Positives = 52/71 (73%) Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 NVL+E+ARI RG+I L+ + DA++ PGGFGAAKNLS+FA G+ CTV+ +++ + Sbjct: 1 NVLVESARIARGKITALSGLSSGNFDAVVFPGGFGAAKNLSDFAVNGAGCTVNPDVERVI 60 Query: 122 QAMHQAGKPLG 132 + HQA KP+G Sbjct: 61 KEFHQAKKPIG 71 >UniRef50_B4U9D2 DJ-1 family protein n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U9D2_HYDS0 Length = 183 Score = 93.6 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 67/193 (34%), Gaps = 38/193 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V+L+ G E EA+ + + R G + + + + Sbjct: 1 MAKVAVLLA-----PGFEEVEAIAPIDILRRGGVEVLIVGVKDKVIPS-----------A 44 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 RNV IE I L LD +I+PGG +NL E+K L Sbjct: 45 RNVKIE----VDVTIDELKDV--DNLDMIIIPGGMIGVENL----------KKSEEVKNL 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 M+ K + +C P +L + I + ++ EH+ + +V Sbjct: 89 INQMNAKKKYVSAICAGPLVLKNA-----GVVENKHITSHPSVKLEFNEHLYKE-ESVVE 142 Query: 181 DEDNKIVTTPAYM 193 DE+ PA Sbjct: 143 DENIISSRGPATA 155 >UniRef50_A1HSG1 Intracellular protease, PfpI family n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HSG1_9FIRM Length = 168 Score = 93.2 bits (230), Expect = 5e-18, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 55/192 (28%), Gaps = 40/192 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ +++ DG E EA+ + +G AP K Sbjct: 1 MAKVMLLI-----EDGFEEMEALYPYYRMKEAGHTVDVVAPAKGVYKGKYGY-------- 47 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + A+ LI+PGG V L L Sbjct: 48 ----------PLTATLTPNEVNVADYAGLIIPGG-----------QAPDRMRVHDGLVDL 86 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + +AG + +C P ML + +T + GA + +++ Sbjct: 87 VKQADKAGLVVAAICHGPQMLIEADIVRGRNVTCYK--SILTDVLNAGAIYHDL---EVI 141 Query: 180 VDEDNKIVTTPA 191 +D PA Sbjct: 142 IDGKLITSRLPA 153 >UniRef50_C3MPE6 Intracellular protease, PfpI family n=11 Tax=Sulfolobus RepID=C3MPE6_SULIL Length = 173 Score = 93.2 bits (230), Expect = 6e-18, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 54/187 (28%), Gaps = 32/187 (17%) Query: 17 SEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIR 76 E E + + G + V + +I Sbjct: 14 FEDIELLYPYYRVIEEGFRPVIA---------WKEANARVTGKHGYTVISD--------I 56 Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 + AL++PGG G E+K L + + KP+ +C Sbjct: 57 AFKDVRPEDYVALVIPGGRG-----------PEHIRTLEEVKNLTRKFFELKKPVAAICH 105 Query: 137 APAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQ 196 P +L + + + + + G ++ +D+VVDE+ P+ + A Sbjct: 106 GPQILISA-NLVKGRRLTSVTSIKDDVIAAGGIYID---NDVVVDENLISSRVPSDLPAF 161 Query: 197 NIAEAAS 203 + Sbjct: 162 AFTLVKA 168 >UniRef50_C7MB29 Intracellular protease, PfpI family n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MB29_BRAFD Length = 184 Score = 93.2 bits (230), Expect = 6e-18, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 67/190 (35%), Gaps = 31/190 (16%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G+E E L A+ +GAQ + AP+ V + +V ++ Sbjct: 18 GTETDEIQHPLAALREAGAQVIVAAPEAGSVATLQ----RDREPGADVPVDTV------- 66 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 A ++DAL++PGG D + L +++ AGKP+ +C Sbjct: 67 --YDTVKAKDVDALVLPGG----------TLNADTLRADETAQFLVRSVAAAGKPVAAIC 114 Query: 136 IAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE----DNKIVTTPA 191 AP +L + T+ + L G E ++VVD+ TP Sbjct: 115 HAPWLLVET-GLANGRTLTSVPTIRTDLVNAGGEWTD---QEVVVDDTAGFRLITSRTPD 170 Query: 192 YMLAQNIAEA 201 + A A Sbjct: 171 DLDAFTTAII 180 >UniRef50_A1VCT8 Intracellular protease, PfpI family n=14 Tax=Bacteria RepID=A1VCT8_DESVV Length = 204 Score = 92.9 bits (229), Expect = 7e-18, Method: Composition-based stats. Identities = 40/194 (20%), Positives = 66/194 (34%), Gaps = 40/194 (20%) Query: 1 MKKIG---VILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM 57 M+++ V++ V D E E L + GA V P+K + H Sbjct: 32 MQRLAGKRVLM---FVDDIYEDLELWYPRLRLEEEGATVVVAGPEKGRSYAGKH------ 82 Query: 58 TETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDREL 117 +A +AA+ D L++PGGF + D ++ Sbjct: 83 -----------SYPCVADAAIADMNAADFDLLVIPGGFA-----------PDKLRRDPKV 120 Query: 118 KALAQAMHQAGKPLGFMCIAPAM-LPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVD 176 L + MH AGK + +C A + + R T + + L GA Sbjct: 121 LELTRQMHHAGKIVAHICHAGWIPISAGIMRGYRCT--STPGIKDDLINAGALWEN---S 175 Query: 177 DIVVDEDNKIVTTP 190 ++VVD + P Sbjct: 176 EVVVDRNQISSRKP 189 >UniRef50_Q03XP7 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme, amidase family n=2 Tax=Leuconostocaceae RepID=Q03XP7_LEUMM Length = 168 Score = 92.5 bits (228), Expect = 8e-18, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 56/189 (29%), Gaps = 38/189 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ V+++ D E E + A+ SGA + + V + + Sbjct: 3 KKVAVLVT-----DLVEDIEFTDPVKALKESGASVTTISFSTEAVTGKHGTKIDID---- 53 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + + ++ DAL +PGGF + D Sbjct: 54 --------------KSIGDVAPSDFDALFIPGGF-----------SPDQLRADERFVNFT 88 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + K L +C P ++ L T+ L GA PV V+D Sbjct: 89 KHFLATNKSLFSICHGPQLMIPT-GLVLGRTMTAYRTVQPDLYYAGARIENKPV---VID 144 Query: 182 EDNKIVTTP 190 + P Sbjct: 145 GNLITSREP 153 >UniRef50_B9ZSD7 ThiJ/PfpI domain protein n=2 Tax=Bacteria RepID=B9ZSD7_9GAMM Length = 224 Score = 92.5 bits (228), Expect = 8e-18, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 61/201 (30%), Gaps = 34/201 (16%) Query: 10 GCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAAR 69 G G G E + ++G + +P V ++ + E+ + AA+ Sbjct: 16 GDGHATGVWFDEFSVPYDRFRKAGFEIKVASPRGGPVP-LDPKSLESNHPGPAAV--AAQ 72 Query: 70 ITRGEIRPLAQ-ADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAG 128 + L + A+ PGG G +L + ++ L + Sbjct: 73 EALDDTIHLDEGMHHDAYAAIFFPGGHGTMFDLPE----------NPHVQRLVAEFLEND 122 Query: 129 KPLGFMCIAPA-MLPKIFDFPLRLTIGTDIDT-------------------AEVLEEMGA 168 K +G +C PA ++ + G + + L E+G Sbjct: 123 KVVGAVCHGPACLVGAMLKDGSPAVKGRKVAAFTNSEEKAVQLDQAVPFLLQDRLAELGG 182 Query: 169 EHVPCPVDDIVVDEDNKIVTT 189 E V D K+VT Sbjct: 183 EVETAEDWADHVVVDGKLVTG 203 >UniRef50_C5NVZ3 DJ-1 family protein n=1 Tax=Gemella haemolysans ATCC 10379 RepID=C5NVZ3_9BACL Length = 191 Score = 91.7 bits (226), Expect = 1e-17, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 67/191 (35%), Gaps = 38/191 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ + V +GSE E + L + R+ Q + + ++ +H Sbjct: 3 KKVALF-----VENGSEELELIAPLDILRRANIQVDLISANNEEYITSSH---------- 47 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + I +I + + DA+++PGG + L + + ++ Sbjct: 48 ----DVKIIVDKKINDIDNI--LDYDAIVIPGGMPGSTLLRD----------NDKIIKFY 91 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q M+ AGK + +C AP +L K +T D + ++ Sbjct: 92 QEMYNAGKLVAAICAAPIVLSKAGILEDKEVTSYPGFDKEINCK------TYDKEKAVIA 145 Query: 181 DEDNKIVTTPA 191 D++ PA Sbjct: 146 DKNVITAQGPA 156 >UniRef50_Q21F36 ThiJ/PfpI n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21F36_SACD2 Length = 366 Score = 91.7 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 71/198 (35%), Gaps = 36/198 (18%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAA--RITRG 73 G E+ E + +G + + + ++ + + M + + A + Sbjct: 80 GYELTELSRAYYVLQANGFEVDVASTQGGKPKMV--IDTDDMGQHDYAFLNDAIAQTKIT 137 Query: 74 EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGF 133 P+ Q A++ DA+ GG GA + N + ++ L + M+Q GK + Sbjct: 138 NTIPINQVSASQYDAIYFVGGKGALFDFPN----------NTAIQTLVRDMYQQGKVIAA 187 Query: 134 MCIAPAMLPKI-------FDFPLRLTIGTDID---------------TAEVLEEMGAEHV 171 +C PA L + ++T T+ + L++ GA Sbjct: 188 ICHGPAALVNVKLANGNFLLADKQVTSFTNEEELFLMPNAAKVFPFLLESELKQRGARFY 247 Query: 172 PCPVDDIVVDEDNKIVTT 189 P+ + D+K++T Sbjct: 248 AGPLYLDNLITDDKLITG 265 >UniRef50_C0WIV6 C56 family peptidase n=8 Tax=Actinomycetales RepID=C0WIV6_9CORY Length = 243 Score = 91.7 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 71/215 (33%), Gaps = 32/215 (14%) Query: 7 ILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIE 66 + G G E + +G P+ + V++ + E ++E E Sbjct: 34 LADGSARPTGYWAEELIAPHRVFKNAGWDIDFATPNA-KAPVVDEYSLEVLSEGDRAEQE 92 Query: 67 A--ARIT--RGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 A I + LA A+ D + PGG G ++L+ D++ L Q Sbjct: 93 NYLAEIGPDLEDPLNLADVKEADYDLVFYPGGHGPMEDLA----------YDQDSAKLLQ 142 Query: 123 AMHQAGKPLGFMCIAPAMLPK-----------IFDFPLRLTIGTDIDTAEV------LEE 165 ++G+ L +C APA L G + A L E Sbjct: 143 ERIESGRALSLVCHAPAALLALDNDNWPLKGYTMTGFTNAEEGEETIAAAKWVVETRLRE 202 Query: 166 MGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAE 200 +GA+ V+ D + T ++ +A+ Sbjct: 203 LGADFKQTDPMQPYVEVDRNLYTGQNPASSEPLAQ 237 >UniRef50_Q0SMN5 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis protein n=21 Tax=Borrelia RepID=Q0SMN5_BORAP Length = 184 Score = 91.3 bits (225), Expect = 2e-17, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 73/211 (34%), Gaps = 47/211 (22%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNV 63 +G+IL+ +G E EA++ + + R + + NV Sbjct: 3 VGIILA-----NGFEDIEAIVPIDILRRGNVNIQVISINDN-----------------NV 40 Query: 64 LIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQA 123 + + ++ ++ + D +I+PGG A NL N +EL + + Sbjct: 41 VTSSKGVSFLTDDIISNCNENRFDLIILPGGMPGATNLFN----------SKELDLILKD 90 Query: 124 MHQAGKPLGFMCIAP--AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 M+ GK + +C +P + K + T +EE V D V Sbjct: 91 MNAKGKFIAAICASPVVVLAAKGLLGFNKFTCY------PGMEE---NVVDGEFVDKNVV 141 Query: 182 EDNKIVTT----PAYMLAQNIAEAASGIDKL 208 N +T+ ++ A + E G + Sbjct: 142 RSNNFITSKGVGTSFEFAFTLLEIVKGKQIM 172 >UniRef50_C4RK05 Intracellular proteinase I n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RK05_9ACTO Length = 204 Score = 91.3 bits (225), Expect = 2e-17, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 59/188 (31%), Gaps = 25/188 (13%) Query: 5 GVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVL 64 ++L+G D +E + + + G + + V ++ H + Sbjct: 19 ALLLTG----DAAEELDTMYPYYRVQEGGWDVDVSSRTLRDVQLVIH----EFDPNSDAY 70 Query: 65 IEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAM 124 +E P A D DALI+PGG VD +++ + + Sbjct: 71 VEKNGRKLPVDVPWADVDVERYDALIIPGGRAPEW-----------IRVDADVRRITEHF 119 Query: 125 HQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDN 184 P+ +C + ++ +E GA + P VVD Sbjct: 120 FARNLPIALVCHGAQVPA-VYGLLKGRKTACFPPITGDMENAGATVIDAPD---VVD--G 173 Query: 185 KIVTTPAY 192 +V+ + Sbjct: 174 NLVSCRGW 181 >UniRef50_A7HFM3 ThiJ/PfpI domain protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HFM3_ANADF Length = 194 Score = 91.3 bits (225), Expect = 2e-17, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 60/190 (31%), Gaps = 37/190 (19%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ +++ DG E E T + R G + Sbjct: 8 RVAILI-----EDGFEQVEDDGTAESARRGGRRYE----------------DRLAEAAEG 46 Query: 63 VLIEAARITRGEIRP--LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +E R+ R A A+ DAL++PGG ++ A Sbjct: 47 ARLEVHRLGRRVPDRRFPRSAVPADFDALLLPGG----------VLNPDRLRIEPRAVAF 96 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 A++ A KP+ +C P + + I + L GAE V ++VV Sbjct: 97 AKSFFDAAKPVAAICHGPWTVIET-GAARGRRIASWPSLKTDLRNAGAEWVDR---EVVV 152 Query: 181 DEDNKIVTTP 190 D + + P Sbjct: 153 DGNLVLSRKP 162 >UniRef50_A0K1V5 ThiJ/PfpI domain protein n=6 Tax=Actinomycetales RepID=A0K1V5_ARTS2 Length = 233 Score = 90.9 bits (224), Expect = 2e-17, Method: Composition-based stats. Identities = 43/225 (19%), Positives = 73/225 (32%), Gaps = 47/225 (20%) Query: 1 MKKIGVILS---------GCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH 51 M I +++S G G E V++ + +G P ++ Sbjct: 1 MANILMVVSAADSLTMRDGTQHPTGYWAEELVVSHQTLRDAGHTVHIATP-GGAKPTVDE 59 Query: 52 LTGEAMTETRNVLIEAARITRGEIRP-------LAQADAAELDALIVPGGFGAAKNLSNF 104 ++ A + + R +I LA D + DA+++PGG G +L Sbjct: 60 VSLAAESAGGQDRADGFREYLAKIDAELSAPLVLADVDPSGYDAVVMPGGHGPMADL--- 116 Query: 105 ASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDF-------PLRLTIGTDI 157 D +L + +AGK + C PA L D LT+ +D Sbjct: 117 -------YQDADLGRILAEADRAGKVIAPFCHGPAGLLSAVDGDGKFAFAGRHLTVFSDD 169 Query: 158 D-------------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 + +VL+E GA V D ++T Sbjct: 170 EELSGGTGPNTPWFVEDVLKEKGAIVENGAAWGSNVVRDRNLITG 214 >UniRef50_A0B7B8 DJ-1 family protein n=3 Tax=cellular organisms RepID=A0B7B8_METTP Length = 182 Score = 90.9 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 68/210 (32%), Gaps = 40/210 (19%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K+ V L+ +G E E V + + R+G V + + T Sbjct: 2 KVVVPLA-----EGFEEIEFVTVVDILRRAGIDVEIAGLRDGPVQGSHGVRVIPDTT--- 53 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + D DA+++PGG NL D + + Sbjct: 54 ---------------FDKVDLNSADAIVLPGGNPGFINLGK----------DERVLDAVR 88 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 M AGK + +C AP++L K + G +E A + +VVD Sbjct: 89 KMSAAGKYVAAICGAPSVLVKA-----GVLSGRMATVHPAGKEEVAACARYMDERVVVDG 143 Query: 183 DNKIVTTP--AYMLAQNIAEAASGIDKLVS 210 P A A + E +G + +++ Sbjct: 144 KMVTSQGPGTAMDFALKLVELLAGKEAMLN 173 >UniRef50_B1YM90 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YM90_EXIS2 Length = 218 Score = 90.9 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 53/169 (31%), Gaps = 18/169 (10%) Query: 1 MKKIGVILSGC----GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 M KI ++ + G G E +G + V + + Sbjct: 1 MSKILIVSTSADDMNGHKTGLWFEEFAAPYNLFKEAGHDVTVVSVKGGDVP----IDKAS 56 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 M + + AR + PLA D A DA+ PGG GA + N + Sbjct: 57 MVKEILPKFQEARRALHDTTPLADVDPASFDAVYFPGGHGAVVDFPN----------NPL 106 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEE 165 + +AM + + +C PA + G I+ EE Sbjct: 107 VAGAIEAMVRKDGVVASVCHGPAAFAHVTIDGKPFVSGRQINGFTDEEE 155 >UniRef50_C2BGT7 Possible transcriptional regulator n=2 Tax=Anaerococcus RepID=C2BGT7_9FIRM Length = 193 Score = 90.9 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 63/187 (33%), Gaps = 38/187 (20%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 KI +L+ DG+E E + + + R+ + + + V +H Sbjct: 2 KILELLA-----DGNETIELLTVVDYLRRADIKIDMVSTTGSKDLVTSH----------- 45 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + L + + + D + +PGG A+ L + D + + + Sbjct: 46 ------GVRYQADYLLEEINPEDYDGVYIPGGTKGAETLRD----------DDRVIEIVK 89 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 A K + +C P +L K + + L+ +G +DD +V Sbjct: 90 KFEAANKLIAAICAGPIVLDKA-GVLADKKATSFPTIKQELKNVG-----EYMDDEIVVT 143 Query: 183 DNKIVTT 189 D + T Sbjct: 144 DGNVTTG 150 >UniRef50_Q72HB0 Putative amidotransferase n=2 Tax=Thermus RepID=Q72HB0_THET2 Length = 166 Score = 90.5 bits (223), Expect = 3e-17, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 71/201 (35%), Gaps = 48/201 (23%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M++IG++L+ D + E + + +G + P+ ++ Sbjct: 1 MRRIGILLA-----DLFDEREFLYPYYRVQEAGYAPMVLGPEAREYRA------------ 43 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 ++ A GE PL L++PGGF E+ AL Sbjct: 44 KSGFAWKAEAAAGEAPPLE--------GLLIPGGFA-----------PDYLRRSPEVLAL 84 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + + GKPLG +C P +L + + LE G + + +VV Sbjct: 85 VRKVAEEGKPLGAICHGPWVLVSA-GLVRGRKVTGFFSIRDDLENAGGLYRE---EGVVV 140 Query: 181 DEDNKIVTT------PAYMLA 195 D +VT PA+M A Sbjct: 141 D--GNLVTAQGPKDLPAFMRA 159 >UniRef50_C4L1D0 Intracellular protease, PfpI family n=2 Tax=Bacteria RepID=C4L1D0_EXISA Length = 174 Score = 90.5 bits (223), Expect = 3e-17, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 61/200 (30%), Gaps = 45/200 (22%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ ++S D E E + + GA + Sbjct: 6 KKVLQLVS-----DDFEDLELWYPVHRLREEGAHVILAGEKAD----------------- 43 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + I + D DA++VPGG+ +K Sbjct: 44 HAYIGKYGVPAKSDVAFDDVDITSFDAILVPGGW-----------SPDLLRRFDSVKGFV 92 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + M + +P+G +C A +L + +T + + +E GA + VV Sbjct: 93 RYMDEQKRPIGQICHAGWVLISANILDGVNVT--STPGIKDDMENAGAIWHD----EPVV 146 Query: 181 DEDNKIVTT-----PAYMLA 195 + + + + P YM A Sbjct: 147 VDGHIVSSRRPPDLPDYMRA 166 >UniRef50_A8P0K7 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P0K7_COPC7 Length = 197 Score = 90.5 bits (223), Expect = 4e-17, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 70/210 (33%), Gaps = 32/210 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAV-CFAPDKQQVDVINHLTGEAMTE 59 M V+L+ DG+E E +T + R+G Q V F P + + + A Sbjct: 1 MPSAVVLLA-----DGTEEMEFTITYDTLVRAGVQVVSAFVPAQSPGASV---SPPAAKC 52 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 +R V RI + + D L++PGG A +S + ++ Sbjct: 53 SRGV-----RILPDSYLDPTECGPDKHDLLVIPGGAVGAATMS----------ANATVQK 97 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 L QA K +G +C +T + LE D +V Sbjct: 98 LIQAYLDKKKYVGMICAGSLAARTAKLPKQPIT--SHPSVRGDLEAD----FEYSEDPVV 151 Query: 180 VDEDNKIVTTP--AYMLAQNIAEAASGIDK 207 V P A+ A + E G +K Sbjct: 152 VSGTLVTSRGPGTAFPFALTLVELLCGKEK 181 >UniRef50_Q4V0N9 NonF-related protein n=5 Tax=cellular organisms RepID=Q4V0N9_XANC8 Length = 225 Score = 90.2 bits (222), Expect = 4e-17, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 75/234 (32%), Gaps = 47/234 (20%) Query: 2 KKIGVILSGCGV------YDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE 55 K + +L+ G G E + + ++G +P E Sbjct: 4 KHVLFVLTNAGQIGPHQRPTGYFFPEVAHPVEVLEQAGIAIEFASPAGGAAP-------E 56 Query: 56 AMTETRNVLIEAARITRG-----EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSE 110 + + A R +R R L++ D + DA+ VPGG G ++S Sbjct: 57 DGYDASDAAQLAFRHSRAFGRMAHSRKLSEVDVRDYDAVFVPGGLGPMVDVSG------- 109 Query: 111 CTVDRELKALAQAMHQAGKPLGFMCIAPAML-------PKIFDFPLRLTIGTDI------ 157 +R+++AL + +A + +C P+ L R+T + Sbjct: 110 ---NRDVQALIKQAWEADMLVAAVCHGPSALLGITLDDGTALVQGRRVTGFSTAEEDGYA 166 Query: 158 ------DTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGI 205 D L E GA +V V D ++T A +A A Sbjct: 167 RADVPFDLESALREEGALYVAGADWQPHVVVDGTLITGQNPASAGPLAHALVAA 220 >UniRef50_Q24FT7 DJ-1/PfpI family protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FT7_TETTH Length = 204 Score = 90.2 bits (222), Expect = 5e-17, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 65/193 (33%), Gaps = 18/193 (9%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMT-E 59 M + ++++G D E +E ++ + G P+K+ D + + E E Sbjct: 1 MSQKILLITG----DFGEDYEVMVPFQVLHAIGYTVHTVCPNKKAGDYVTCVVEEGGEIE 56 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 E Q E AL++ GG D + Sbjct: 57 KFQTYTEKIGHRFFLNYDFDQVKPEEYYALVLAGG-----------RAPEYLKYDPSVLK 105 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 L + + K + +C +L + + +G T+ + G + ++D + Sbjct: 106 LVKHFTDSKKSILVICHGYQILCALQGCIEGIVLGGPTPTSYEITNAGGIYQQIKMEDAL 165 Query: 180 VDEDNKIVTTPAY 192 + N ++TPAY Sbjct: 166 L--YNNFISTPAY 176 >UniRef50_B2GGQ1 ThiJ/PfpI family protein n=6 Tax=Actinomycetales RepID=B2GGQ1_KOCRD Length = 260 Score = 89.8 bits (221), Expect = 6e-17, Method: Composition-based stats. Identities = 47/240 (19%), Positives = 87/240 (36%), Gaps = 44/240 (18%) Query: 1 MKKIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTG 54 MKKI ++L+ G G + EA +G + Q + + Sbjct: 6 MKKILLVLTSQDTLGDTGEATGYNVAEAAHPWKVFRDAGYFVDFASIAGGQPPQ-DAVQQ 64 Query: 55 EAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVD 114 + + E A+ + ++ D A+ DA+ + GG G + T + Sbjct: 65 DDPVQVEFTQDETAKASLYNTPKVSVVDPAQYDAVYLVGGHGTMWDF----------TGN 114 Query: 115 RELKALAQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDT-------------- 159 R+L+ L A++ AG +G +C P+ L + + L G + Sbjct: 115 RDLQKLVAAVYDAGGVVGAVCHGPSGLVDVELANGVNLLSGRKVAAFTTAEEEEVGKKDT 174 Query: 160 -----AEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLV 214 E LEE GA + V D ++VT ++A+G+ K ++++L Sbjct: 175 VPYLLQERLEEQGATVKVAENWEENVQVDERLVT-------GQNPQSAAGVAKEMTKLLT 227 >UniRef50_B2IWY8 Intracellular protease, PfpI family n=3 Tax=Cyanobacteria RepID=B2IWY8_NOSP7 Length = 365 Score = 89.8 bits (221), Expect = 7e-17, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 62/189 (32%), Gaps = 36/189 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ +++ E E + + ++G + V + G+ + Sbjct: 11 KKVAILIEQA-----VEDAEFTVPYNGLKQAGIEVVVLGSRMNEK--YKGKRGKLSIQAD 63 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 EA AA+ DA+I+PGG + Sbjct: 64 GTTTEAI--------------AAQFDAVIIPGGMA-----------PDRMRRNINTVRFV 98 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 Q Q GK + +C P +L + + G I A+ + GA+++ PV VVD Sbjct: 99 QEAIQQGKLVAAVCHGPQVLIEGNLLKGKQATGF-IAIAKDMINAGAKYLDEPV---VVD 154 Query: 182 EDNKIVTTP 190 + P Sbjct: 155 GNLITSREP 163 >UniRef50_C9RKY9 DJ-1 family protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RKY9_FIBSS Length = 181 Score = 89.4 bits (220), Expect = 8e-17, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 68/201 (33%), Gaps = 40/201 (19%) Query: 4 IGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNV 63 I V+L+ DG E E V+ + R+G + + DV++ L G Sbjct: 3 ILVLLA-----DGFEETEFVVPVDLWRRAGFKVTVASVSG--ADVVDGLHG--------- 46 Query: 64 LIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQA 123 I L++ + + DA+ +PGG +NL ++ + Sbjct: 47 ------IKVQADVALSKLEPTDFDAVFLPGGGVGVQNL----------KASAAVENTVCS 90 Query: 124 MHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 ++ K + +C AP +L K + T +T V E + +VVD Sbjct: 91 LNDDNKWVLAICAAPTVLSKARILVDRKATCYPGCETDLVCREF-------SEERVVVDG 143 Query: 183 DNKIVTTPAYMLAQNIAEAAS 203 + P + A Sbjct: 144 NIVTSRGPGTAEEFALKCIAV 164 >UniRef50_B0N2E2 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0N2E2_9FIRM Length = 186 Score = 89.4 bits (220), Expect = 8e-17, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 71/214 (33%), Gaps = 41/214 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M+K V+ V DG E E V + + R+G + F +Q V + + Sbjct: 4 MRKAAVL-----VVDGYEESETVTIVDLLRRAGIECHTFGFAEQYVRGMQGM-------- 50 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 ++ + + D L++PGG NL + + + Sbjct: 51 -----------MIKVDKIFSDEIKNYDMLVLPGGRPGGVNL----------GANPLVIEM 89 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q ++ GK L +C +L K + G +++ G E V V V Sbjct: 90 VQYYNENGKYLAAICSGTIVLSKARVIDGKNVTGYTGYADKLV---GGEFVDKVV---VF 143 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLV 214 D++ PA + + + VS + Sbjct: 144 DQNIITSQGPATSYPFAF-KIIEVLGQDVSEMKE 176 >UniRef50_D2R736 ThiJ/PfpI domain protein n=2 Tax=Bacteria RepID=D2R736_9PLAN Length = 190 Score = 89.0 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 62/215 (28%), Gaps = 27/215 (12%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M I + L D +E + + G + + P+ + + H + Sbjct: 1 MPTILMPL-----GDATEALDTFYPFFRLQEEGYKVIVCGPEARLYHTVLHEIPPDSSIP 55 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 ++ E ++ D + + GG D++L L Sbjct: 56 WDITQERPGYFIRSTAAFRDLKGSDCDGMFISGG-----------RAPEYIRYDKDLLRL 104 Query: 121 AQAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 ++ AGKP+ +C + ++T +E+ G +V Sbjct: 105 VNEVNDAGKPIASVCHGVEILTAANIIQGKKVTTVA--KCKLDVEQGGGTYVNEE----- 157 Query: 180 VDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLV 214 V +V++ + + ++ L Sbjct: 158 VVLAGNLVSSRTWHDNAPLMREFL---AMIKANLK 189 >UniRef50_B0NJL3 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0NJL3_EUBSP Length = 187 Score = 89.0 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 74/217 (34%), Gaps = 39/217 (17%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ VIL+ DG E EA+ + + R+ + ++ H Sbjct: 4 MKKVSVILA-----DGFEEIEALTAVDLLRRAQIYVDTVSITEEYTVHGAH--------G 50 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 NV E + + E D +++PGG NL ++ + Sbjct: 51 INVQTE---------DLFEEVNFVESDMIVLPGGMPGTLNLD----------AHSGVRRV 91 Query: 121 AQAMHQAGKPLGFMCIAPAMLPK-IFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + + GK +G +C P +L R+T + ++ VP VD+ V Sbjct: 92 VKDFFEEGKYIGAICAGPTVLANLGLLKGKRITC--HPTVEQDIQGAVITKVPVTVDNNV 149 Query: 180 VDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 + A A + E +G DK + + Sbjct: 150 ITGRG---AGAAVDFALKLIEVLAGSDKA-KEIGEMI 182 >UniRef50_B4PML8 GE10903 n=8 Tax=Neoptera RepID=B4PML8_DROYA Length = 187 Score = 88.6 bits (218), Expect = 1e-16, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 80/212 (37%), Gaps = 37/212 (17%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K +++ + G+E E ++ + R+G + + GE + + Sbjct: 1 MSKSALVI----LAPGAEEMEFIIAADVLRRAGIKVTVAGLNG----------GELVKCS 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R+V I LAQ A + D +++PGG G + ++ + + + Sbjct: 47 RDVQILPD-------TSLAQVAADQFDVVVLPGGLGGSNAMAESSV----------VGDI 89 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + +G + +C AP +L K + ++ + L + + +V Sbjct: 90 LRRQESSGGLIAAICAAPTVLAKHGIASGK-SLTSYPSMKPQLLD---NYSYVDDKTVVK 145 Query: 181 DEDNKIVTTP--AYMLAQNIAEAASGIDKLVS 210 D + P AY A IAE +G +K++ Sbjct: 146 DGNLLTSRGPGTAYEFALRIAEELAGKEKVLE 177 >UniRef50_Q0BE72 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria RepID=Q0BE72_BURCM Length = 234 Score = 88.6 bits (218), Expect = 1e-16, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 69/209 (33%), Gaps = 31/209 (14%) Query: 16 GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEI 75 G + E ++ +G +P + V + L + A R Sbjct: 27 GFYLSEVTHPHRVLADAGHAVDFVSPKGGKTHV-DGLDLDDPINAAFWNNAALRGATENA 85 Query: 76 RPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMC 135 AQ D A+ GG + + + EL A+A +++ G + +C Sbjct: 86 LAPAQVDPDAYAAIFYAGGHATMWDFPD----------NAELSAIAARIYERGGVVAAVC 135 Query: 136 IAPA-MLPKIFDFPLRLTIGTDIDT-------------------AEVLEEMGAEHVPCPV 175 PA ++ L G D+ A+ L+E GA HVP P Sbjct: 136 HGPAGLVNLKLSDGRYLVAGKDVSAFTNDEERAVGLYDTVPFLLADALQERGARHVPAPN 195 Query: 176 DDIVVDEDNKIVTTPAYMLAQNIAEAASG 204 V +++VT A+ +AEA Sbjct: 196 FQAQVVVSDRLVTGQNPASAKGVAEAMLS 224 >UniRef50_A8UMK5 DJ-1/PfpI family protein n=1 Tax=Flavobacteriales bacterium ALC-1 RepID=A8UMK5_9FLAO Length = 412 Score = 88.6 bits (218), Expect = 1e-16, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 68/217 (31%), Gaps = 42/217 (19%) Query: 3 KIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 KI +++ G G E+ E +G + +P + ++ + + Sbjct: 109 KILAVVTSTETMGNSGKSTGYELTELSRAYYVFEANGFEVDVASPLGGKPPIV--IDDDD 166 Query: 57 MTETRNVLIEA--ARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVD 114 M + + A+ + A + A+ GG GA + + Sbjct: 167 MGQFDYAFLNDSIAQYKTSHTIAVNNVKAEDYQAIFFVGGKGAMFDFPE----------N 216 Query: 115 RELKALAQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDT-------------- 159 + ++ + + +Q+ K +G +C PA L + D L I + Sbjct: 217 KAIQDIVKMYYQSDKVVGAVCHGPAALVNVTLDNGRHLLENKTISSFTNKEELLLIPEAE 276 Query: 160 -------AEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 + L GA+ + V D +VT Sbjct: 277 SIFPFLLQDKLAAQGAQFNEGAMYLNKVSHDKNLVTG 313 >UniRef50_A2BLI2 Protease I n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BLI2_HYPBU Length = 194 Score = 88.6 bits (218), Expect = 1e-16, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 58/203 (28%), Gaps = 32/203 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFA-------PDKQQVDVINHLT 53 M + I V + E + G + + P Sbjct: 1 MPRALFI-----VEPDFDDLEFFYAYHRLLEEGFEIDIASHAKYSDVPRYDPQTGRLEPR 55 Query: 54 GEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTV 113 + R +EA R + L D L++PGG Sbjct: 56 PLKIKGKRGFEVEATLSYREAVERL-----DSYDVLVIPGG-----------RSPERARQ 99 Query: 114 DRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPC 173 RE +A+ M + GKP+ +C P +L R G + L GAE+V Sbjct: 100 HREAVEIARRMAEKGKPIIAICHGPLLLASASVIRGRRVTG-YPGIKDDLVNAGAEYVDA 158 Query: 174 PVDDIVVDEDNKIVTTPAYMLAQ 196 V+D + V + M Sbjct: 159 G---AVLDGNIVTVRHTSSMGEG 178 >UniRef50_D0BJF4 DJ-1 family protein n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BJF4_9LACT Length = 183 Score = 88.6 bits (218), Expect = 1e-16, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 66/209 (31%), Gaps = 40/209 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK+ ++L+ G E EA+ + + R+G + D + ++LT E Sbjct: 1 MKRAAIVLTT-----GFEEIEAIAPMDILRRAGVEVDIVGVDAKVATGSHNLTISTDKEL 55 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 V+ E D +I+PGG AK L N + ++ Sbjct: 56 VEVMNEL------------------YDIVILPGGMPGAKLLKN----------HQAVQDF 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + GK + C AP + + + + + D + Sbjct: 88 VKRHYDVGKLVAANCAAPIAIENSGALKNCH-YTCYPGFEKQIVD--GTYTG---DFVHQ 141 Query: 181 DEDNKIVTTPAYMLAQNIAEA-ASGIDKL 208 D + PA + A GID Sbjct: 142 DGRVITGSGPAAAFEFSYTIVGALGIDAA 170 >UniRef50_Q311F3 Peptidase C56, PfpI n=2 Tax=Bacteria RepID=Q311F3_DESDG Length = 174 Score = 88.2 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 69/190 (36%), Gaps = 39/190 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI + V + E E + + GA+ V P TG+ Sbjct: 7 KKILMF-----VEEYYEDLELWYPKIRLQEEGAEVVVAGP----------ATGKVYKGKN 51 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 EA +A DA D L++ GG+ + D + + Sbjct: 52 GYPCEAD-------VAIADVDAGGYDGLVLCGGWA-----------PDKLRRDPRVLEIT 93 Query: 122 QAMHQAGKPLGFMCIAPAM-LPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + +H AGKP+ +C A + + +R T + + L+ GA+ V ++VV Sbjct: 94 RTIHDAGKPVAHICHAGWVPISAGVMKGIRCT--SVNAIRDDLQNAGAQWVD---QEVVV 148 Query: 181 DEDNKIVTTP 190 D+++ TP Sbjct: 149 DKNHITSRTP 158 >UniRef50_Q13FG4 Peptidase C56, PfpI n=1 Tax=Burkholderia xenovorans LB400 RepID=Q13FG4_BURXL Length = 227 Score = 88.2 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 50/172 (29%), Gaps = 32/172 (18%) Query: 19 IHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIRPL 78 E +L + +GA + PD TR V + Sbjct: 18 ELELWHPVLRLREAGANVLLVGPD-----------------TRLVYSSKLGYPARPDLSI 60 Query: 79 AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP 138 A + DA+++PGGF + + M G+ + +C A Sbjct: 61 DDVKAEDFDAVVIPGGFA-----------PEGMRRHPAMIEFVREMDAQGRLIAAICHAG 109 Query: 139 AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 +L + + + GA +V + +V+D + P Sbjct: 110 LVLASA-QIARNRKLTCVSLVKDDVINAGANYVN---EGLVIDHNIITSQLP 157 >UniRef50_B8FFC8 ThiJ/PfpI domain protein n=2 Tax=Proteobacteria RepID=B8FFC8_DESAA Length = 199 Score = 88.2 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 64/201 (31%), Gaps = 32/201 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ + L+ G E +EA + + V + Sbjct: 1 MKKVLLFLAQ-----GFEEYEAAVFTDVL-------------GWSRVVGDVPVEVVTAGL 42 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R + + L D AE DAL +PGGF A + + Sbjct: 43 RPEIQCTWSLIVKPQAQLKDLDLAEFDALAIPGGFQRAGFYED--------AYHEDFLEA 94 Query: 121 AQAMHQAGKPLGFMCIAPAMLPK-IFDFPLRLTIGTDIDT--AEVLEEMGAEHVPCPVDD 177 + + GKP+ +C+ + K T ++ + L E GA + V Sbjct: 95 VRHFDKTGKPIAAICVGAMPVGKSGVLTGRNATTYHLVNARRRKQLAEFGAVVLDQHV-- 152 Query: 178 IVVDEDNKIVTTPAYMLAQNI 198 VVD + T PA L + Sbjct: 153 -VVDRNIITSTGPATGLEVAL 172 >UniRef50_B5YN08 Predicted protein n=2 Tax=Bacillariophyta RepID=B5YN08_THAPS Length = 247 Score = 88.2 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 74/224 (33%), Gaps = 34/224 (15%) Query: 6 VILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE---AMTETRN 62 +I + G G I E ++G + +P+ V + GE + Sbjct: 31 LISTLKGHKTGLWIEELAAPYYEFKKAGYEVEIASPEGGAVPIDAASLGEGFFTEPAKKF 90 Query: 63 VLIEAARITRGEIRPLAQAD-AAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + A LA D + DA+ + GG G + ++ + LKA Sbjct: 91 MHDAEAIGMLSHSTKLASIDFSTAADAIFICGGHGTCTDFAD----------NGVLKAAI 140 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDID-------------TAE 161 + M+ + K + +C P LP+ +T D + Sbjct: 141 ETMYSSDKIVSAVCHGPVSLPQCNKPDGTPLVKDKVVTGFKDSEELAVQLEKLVPFMLET 200 Query: 162 VLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGI 205 L+E+G ++ + V D K+VT ++ A GI Sbjct: 201 KLKELGGKYESADDWNSKVCVDGKLVTGQNPQSSEACAAVVIGI 244 >UniRef50_B5ZFI3 ThiJ/PfpI domain protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=B5ZFI3_GLUDA Length = 267 Score = 87.8 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 69/221 (31%), Gaps = 42/221 (19%) Query: 3 KIGVILSGC------GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDV----INHL 52 KI +I + G G + E A +GA+ + V + + + Sbjct: 42 KILMIATSADRLGQGGHATGVWLEELTTPFYAFQDAGAEVTLASIAGGTVPIDTRSVKEV 101 Query: 53 TGEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECT 112 + R + A R T DAA DA+ +PGG G + Sbjct: 102 GQNEASVDRYLADPALRQTVAATPKFTAIDAAGFDAVFLPGGHGTMTDYPE--------- 152 Query: 113 VDRELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDID------- 158 + L L + + G+ + +C PA L F R++ D + Sbjct: 153 -NLALAHLIEQFDRNGRIVSAVCHGPAGLLTARKPDGTPFVAGRRVSAFADSEERAVGLE 211 Query: 159 ------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PA 191 + L +GA + P D ++T PA Sbjct: 212 HAVPFLLEDRLRALGAVYEQGPDFASFAISDGNLITGQNPA 252 >UniRef50_Q47L11 Peptidase C56, PfpI n=2 Tax=Bacteria RepID=Q47L11_THEFY Length = 179 Score = 87.8 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 65/209 (31%), Gaps = 33/209 (15%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M + VI++G G D H+ V T + G + V + Sbjct: 1 MSRSAVIITGPGFQD----HDVVYTYYRLLEEGWHVDVATKEAAPVTGKYGVPLPMDKTA 56 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + D DA+I+ GG D+++ A Sbjct: 57 ------------APLISFSDLDVNNYDAVILTGGH----------EAPDRVRQDQQVLAF 94 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 AM AGK + +C P ++ R I + + GAE V D++V Sbjct: 95 VAAMADAGKVVAGLCHGPWIMVSAGVLKGRRAC-AYIGLRDDVINAGAEVVD---SDVIV 150 Query: 181 DEDNKIVTTPAYMLAQNIAE-AASGIDKL 208 D I+T Y ++KL Sbjct: 151 D--GNIITCSYYAYVGAFMRAVFETVEKL 177 >UniRef50_C4G3X0 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G3X0_ABIDE Length = 182 Score = 87.8 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 65/196 (33%), Gaps = 40/196 (20%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 + VI +G E E + + + R +T N Sbjct: 2 RAAVI-----FMNGFEEVEGLTVVDMLRRLDITCDIVG------------KTSEVTGAHN 44 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + I++ R+ L + + E +A+I+PGG A NL + D ++ + + Sbjct: 45 ITIKSDRL-------LEEIKSEEYNAVILPGGLPGATNLRD----------DDKVITILK 87 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 M+ GK + +C AP L + E ++ G + + +V D Sbjct: 88 EMNNEGKIVAAICAAPIALERA-GVLEGKEFTAYPGVGENIK--GGKFRE---ELVVKDG 141 Query: 183 DNKIVTTPAYMLAQNI 198 + PA + Sbjct: 142 NVITSRGPATSMEFAF 157 >UniRef50_C0W6P5 Possible transcriptional regulator n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6P5_9ACTO Length = 196 Score = 87.8 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 42/200 (21%), Positives = 74/200 (37%), Gaps = 33/200 (16%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ V+++ G E EA+ L + R+G A + I H + + + + Sbjct: 10 KKVAVLVAP-----GLEEVEALAPLDILFRAGIPAHLIS--------ITH-SRQVTSSHQ 55 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 VL A + L++AD D + +PGG NL D ++ L Sbjct: 56 VVLSCTA-----TLDELSEADLDSYDMVFLPGGIPGTPNL----------KADARVRELV 100 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 +A +P+ +C AP++L + + +VL + GA+ V VVD Sbjct: 101 TQRVRADRPVAAICAAPSILAE-LGLLEGRRATANPSFVQVLADHGAQVSEASV---VVD 156 Query: 182 EDNKIVTTPAYMLAQNIAEA 201 A + + Sbjct: 157 GRLLTSRGMATAVDLGLEMV 176 >UniRef50_C5F0X7 Putative uncharacterized protein n=2 Tax=Helicobacter RepID=C5F0X7_9HELI Length = 183 Score = 87.8 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 76/203 (37%), Gaps = 37/203 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M KI V L G E EA+ + + R+G Q + + K ++V++ + + + Sbjct: 1 MVKILVPL-----GKGFEELEAISIIDVLRRAGCQVIIASL-KDNLEVLSQGGVKIIAD- 53 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +++ +A ++DA++ PGG+ +NL +EL+ L Sbjct: 54 ---------------VDVSKVEALKIDAVVFPGGWEGTENLIEC----------KELREL 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 M K + +C AP L K+ R + +E+M +++ Sbjct: 89 VLEMDSQRKIIAAICAAPYALFKMGVLKNR-----NFTCYPSIEKMIDNPNYQDSKNVIH 143 Query: 181 DEDNKIVTTPAYMLAQNIAEAAS 203 DE+ PA L + Sbjct: 144 DENIITSKGPATALEFAYYLVKT 166 >UniRef50_Q30R88 DJ-1 n=4 Tax=Epsilonproteobacteria RepID=Q30R88_SULDN Length = 185 Score = 87.8 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 63/193 (32%), Gaps = 38/193 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M ++ V L+ DG E EA+ + + R + + + ++ H Sbjct: 1 MSRVLVPLA-----DGFEEIEAISIIDVLRRGDIEVIVASLGSKKEVRGAH--------- 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + D + L+ +++PGG+ L + D ++ + Sbjct: 47 --------GVVVLADVEIQSVDVSALEMIVLPGGWKGTLALRD----------DENVQKI 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + M + K +G +C AP L T + E G + +V Sbjct: 89 LKIMDRDAKLIGAICAAPLALHSAGVLKHNYTC--YPSVEAQIREDG----FSDKEMVVQ 142 Query: 181 DEDNKIVTTPAYM 193 DE+ PA Sbjct: 143 DENVITSRGPATA 155 >UniRef50_B5E3I6 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis protein, putative n=33 Tax=Bacteria RepID=B5E3I6_STRP4 Length = 184 Score = 87.5 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 69/201 (34%), Gaps = 41/201 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V+L+ G E EA+ + + R+ ++Q +TG + Sbjct: 1 MVKVAVMLAQ-----GFEEIEALTVVDVLRRANITCDMVGFEEQ-------VTGSHAIQV 48 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R + D ++ D +++PGG + +L + ++ L Sbjct: 49 RADHVF-------------DGDLSDYDMIVLPGGMPGSAHLRD----------NQTLIQE 85 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q+ Q GK L +C AP L + + E + + +V + +VV Sbjct: 86 LQSFEQEGKKLAAICAAPIALNQAEILKNKR-YTCYDGVQEQI--LDGXYVK---ETVVV 139 Query: 181 DEDNKIVTTPAYMLAQNIAEA 201 D P+ LA Sbjct: 140 DGQLTTSRGPSTALAFAYELV 160 >UniRef50_C1QDE2 Intracellular protease, PfpI family n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QDE2_9SPIR Length = 166 Score = 87.5 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 46/175 (26%), Gaps = 35/175 (20%) Query: 17 SEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIR 76 E E + G A +K ++ EA Sbjct: 11 FEDSELFYPYFRLIEEGIDVDIAALNKGEIKGEYFFKAEAK------------------L 52 Query: 77 PLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCI 136 ++ D + ALI+PGG + ++K + + +G +C Sbjct: 53 NFSEVDPSNYKALIIPGG-----------RAPEAIRGNDDVKRIIKYFVDNNLTIGAICH 101 Query: 137 A-PAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 ++ T I + L A + + +VV + P Sbjct: 102 GQQTLISAKVLEGKDATC--YIGIRDDLMNAKANYKD---EKVVVCGNIVTSRCP 151 >UniRef50_C8N9N9 ThiJ/PfpI family protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N9N9_9GAMM Length = 221 Score = 87.5 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 66/219 (30%), Gaps = 37/219 (16%) Query: 2 KKIGVILSG----CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM 57 K+I IL+ G + E + +G Q +P + V + L Sbjct: 3 KRILFILTSHDRKGDAPSGYYLSEVSHPYYILRDAGYQIDFASPKGGKTHV-DGLDLSDP 61 Query: 58 TETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDREL 117 A R A +A + A+ GG +L + L Sbjct: 62 DNAAFWNDAALRAQTENTLAPADINADDYAAIYYAGGHATMWDLPH----------SPAL 111 Query: 118 KALAQAMHQAGKPLGFMCIAPAML----------------PKIFDFPLRLTIGTDIDT-- 159 A+A ++ G +G +C PA L F +G Sbjct: 112 DAIAARIYDNGGVVGAVCHGPAGLLNIRLANGRYLIENKQVAAFTNDEERAVGLYDTVPF 171 Query: 160 --AEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PAYML 194 A+ L+E GA H+P V ++V+ PA Sbjct: 172 LLADALQERGATHLPAANFQAQVVISERLVSGQNPASAR 210 >UniRef50_C7PM17 ThiJ/PfpI domain protein n=2 Tax=Bacteria RepID=C7PM17_CHIPD Length = 223 Score = 87.5 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 67/217 (30%), Gaps = 38/217 (17%) Query: 2 KKIGVILSGCGVY------DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE 55 KKI I+S G + E + + Q + V +++LT Sbjct: 3 KKILFIVSNASFIGPNNRQTGVFLDEVAHPYVEFDDAHYQIDFASITGG-VPAVDNLTAS 61 Query: 56 AMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDR 115 + + + R L+ D + DA+ VPGG ++ D Sbjct: 62 EESSNARFIKDGGLAKMQHNRKLSDVDTSGYDAVFVPGGLAPMVDMPE----------DP 111 Query: 116 ELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDI----------- 157 LK + + +GK +G +C P L + +T T+ Sbjct: 112 LLKKVIAGFYDSGKIVGAVCHGPVSLLNVRLNDGSYLIAGKNITSFTNEEEDNYAKNDVP 171 Query: 158 -DTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PA 191 + L GA+ D ++VT PA Sbjct: 172 FELETALTNQGAKFHAAAPWSSNSIADGRLVTGQNPA 208 >UniRef50_Q5HN59 Uncharacterized protein SERP1413 n=71 Tax=Bacteria RepID=Y1413_STAEQ Length = 172 Score = 87.5 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 62/191 (32%), Gaps = 39/191 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ +IL+ D E E A+ +G + D NH E Sbjct: 3 KKVAIILA-----DEFEDIELTSPKEALENAGFETEVIG------DTANHEVVGKHGEKV 51 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELK--A 119 V + +A A DAL++PGGF D E + Sbjct: 52 TVDV-----------SIADAKPENYDALLIPGGF-----------SPDHLRGDEEGRYGT 89 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 A+ + P +C P +L D TI I+ + L GA V + +V Sbjct: 90 FAKYFTKNDVPTFAICHGPLVLVDT-DDLKGRTITGVINVRKDLSNAGANVVD---ESVV 145 Query: 180 VDEDNKIVTTP 190 VD + P Sbjct: 146 VDNNIVTSRVP 156 >UniRef50_D1AAI5 ThiJ/PfpI domain protein n=3 Tax=Bacteria RepID=D1AAI5_THECD Length = 226 Score = 87.5 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 74/220 (33%), Gaps = 35/220 (15%) Query: 8 LSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEA 67 L G G + EA +G + +P + + G+ + + +E Sbjct: 17 LGDTGRATGFFLPEAAEPWKVFRDAGYRVDLVSPRGGRPPMYGQDPGDPV---QREFLED 73 Query: 68 ARITRGEIRPL--AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMH 125 +I R Q D + A+ GG G + A+L + + ++ Sbjct: 74 PQIARALAATPRSDQVDPSGYAAIFYVGGHGTMWDFPQDAALAAAA----------RRIY 123 Query: 126 QAGKPLGFMCIAPAML-------PKIFDFPLRLTIGTDIDTA-------------EVLEE 165 +AG + +C PA L + LT T+ + A + LE+ Sbjct: 124 EAGGVVAAVCHGPAGLLPITLSDGRPLVEGRDLTSFTNEEEADQGLTDVVPFLLSDALEK 183 Query: 166 MGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGI 205 +GA H P V D +++T + +AEA Sbjct: 184 LGARHHGKPAYQANVVVDGRLITGQNPASSTGVAEAVVAA 223 >UniRef50_C7RFT2 DJ-1 family protein n=2 Tax=Anaerococcus RepID=C7RFT2_ANAPD Length = 194 Score = 87.5 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 67/189 (35%), Gaps = 38/189 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K V+++ +G+E E + + R G + + ++ + Sbjct: 1 MNKFLVLVA-----NGNETIEIFTVIDYLRRIGVKLDIASTEESKE-------------- 41 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 L + ++ + + + +PGG A + + + ++ L Sbjct: 42 ---LKTSQDVSFKADISFSDIKEEDYMGVYIPGGTKGAYAMRD----------NEKVLDL 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + AGK +G +C P +L + + + D + +++ G VD+ +V Sbjct: 89 LRRFNDAGKIIGAICAGPVVLNEAGILSDKK-ATSFPDMKDEMDQTG-----EYVDNEIV 142 Query: 181 DEDNKIVTT 189 D I T Sbjct: 143 VTDGNITTG 151 >UniRef50_A7NLT5 ThiJ/PfpI domain protein n=7 Tax=Bacteria RepID=A7NLT5_ROSCS Length = 174 Score = 87.5 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 42/195 (21%), Positives = 71/195 (36%), Gaps = 36/195 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I ++L+ +G E E + ++ + GA+ V D + V N L T Sbjct: 6 KRIAILLA-----EGVEDLEFYVPMMRLQEEGAEVVAAGLDLRPVRGKNGLEITPTTT-- 58 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 +A+ A +L AL++PGG+ + R + L Sbjct: 59 ----------------IAELRADDLFALVLPGGWA-----------PDKLRRYRAVTDLV 91 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 QAMH AGK +G +C ++ R G+ + + L GA V Sbjct: 92 QAMHAAGKVIGIICHGGSIAISAGIVGGRRATGS-LGIKDDLINAGATWVDEAAFRDENL 150 Query: 182 EDNKIVTT-PAYMLA 195 ++V PA+ Sbjct: 151 VWGRVVADIPAFCRE 165 >UniRef50_A6Q7R5 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosphate synthesis protein n=2 Tax=unclassified Epsilonproteobacteria RepID=A6Q7R5_SULNB Length = 186 Score = 87.1 bits (214), Expect = 3e-16, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 64/181 (35%), Gaps = 34/181 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCF-APDKQQVDVINHLTGEAMTE 59 M + + L+ G E EAV + + R G + D+ Q D++ G Sbjct: 1 MASVLIPLAK-----GFEELEAVALIDVMRRGGIEVRVAYLEDEMQSDLVLGANG----- 50 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 IT + + + D +++PGG+G L+ + ++ Sbjct: 51 ----------ITVKADTSIKNVISDDFDMMVLPGGWGGTYALAE----------NTRVQE 90 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 L + +A K +G MC AP L + R T E ++ G V+D Sbjct: 91 LLREF-KAKKIVGAMCAAPFALKQAGVLGERYT--AYPGAVEEIDHPGYVADEKVVEDGN 147 Query: 180 V 180 V Sbjct: 148 V 148 >UniRef50_C1XV98 Putative intracellular protease/amidase n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XV98_9DEIN Length = 167 Score = 87.1 bits (214), Expect = 3e-16, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 66/202 (32%), Gaps = 48/202 (23%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M+ IG++L D + E V + +G Q + P T EA Sbjct: 1 MRAIGILL-----EDYFDEREVVYPYYRVQEAGFQPLMIGPKPGLYHGKTPFTFEATVAA 55 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + A +L LIVPGGF + L Sbjct: 56 ------------------SAVKAGDLAGLIVPGGFA-----------PDRIRRSEAMVQL 86 Query: 121 AQAMHQAGKPLGFMCIAPA-MLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + + +A KPLG +C A ++ RLT + LE GA + + Sbjct: 87 IREIDRAQKPLGAICHAGWALISAGVVRGRRLTGFS--SIRIDLENAGALY-----QEER 139 Query: 180 VDEDNKIVTT------PAYMLA 195 V + +VT PA+M A Sbjct: 140 VVTEGHLVTAQHADDLPAFMQA 161 >UniRef50_C8NVB7 ThiJ/PfpI family protein n=1 Tax=Corynebacterium genitalium ATCC 33030 RepID=C8NVB7_9CORY Length = 227 Score = 87.1 bits (214), Expect = 4e-16, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 67/231 (29%), Gaps = 45/231 (19%) Query: 10 GCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM-----TETRNVL 64 G++ G + E ++++G + + D V I+ ++ + + Sbjct: 15 SSGIHTGMWLGEFTHFYDVLTKAGHEVDLASVDGGAVP-IDPVSLKTPVIQMGGTNKRYK 73 Query: 65 IEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAM 124 + + D D + + GG G + + + E+ A Sbjct: 74 DPEFMALLDDTPAITDVDLDSYDGIYLIGGHGTMFDFT-----------NEEVTAAVAHF 122 Query: 125 HQAGKPLGFMCIAPAML-------PKIFDFPLRLTIGTDIDTA-------------EVLE 164 A K + +C P L +T + + E L+ Sbjct: 123 ADADKIVSAVCHGPVGLLDVTLADGSSLLDGRNVTGYSWAEEKLAQRTAEVPFSLEEKLK 182 Query: 165 EMGAEHVP-CPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLV 214 E E+ V D K+VT M A G+ + V +L Sbjct: 183 EQAGEYTTAKLPMTKHVVVDGKLVTGQNPMSAA-------GVGEAVLELLD 226 >UniRef50_B7HF13 ThiJ/pfpI family protein n=61 Tax=Bacillus RepID=B7HF13_BACC4 Length = 220 Score = 87.1 bits (214), Expect = 4e-16, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 79/236 (33%), Gaps = 41/236 (17%) Query: 1 MKKIGVILSGC----GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 +K+I ++ + G G + E + ++ + +V I+ ++ Sbjct: 2 LKRILLVSTSAHDMNGHPTGLWLEELAAPYHVLKKAKFDVDIVSIKGGRVP-IDRVSI-P 59 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 R A+ RP++ ++ DA++ GG GA + + Sbjct: 60 NGIPREFKHVAS--LLQNTRPISNVHFSDYDAVLFGGGHGAIVDFPG----------NPY 107 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDIDTAE-------- 161 + L + M+ + + +C + L + F R+T T+ + Sbjct: 108 VANLIENMYNNNRIVAAVCHGVSSLIGVKNKDGSFFVAGKRITGYTNDEERAVHLEKRVP 167 Query: 162 -----VLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRV 212 L+E GA P V D ++T Q+ E I + V+++ Sbjct: 168 FLLESKLKEEGALFYVAPNFTPHVVVDGHLITGQ---NPQSSVEIGKAIKRAVNKL 220 >UniRef50_Q7M905 MONOPHOSPHATE SYNTHESISPROTEIN n=1 Tax=Wolinella succinogenes RepID=Q7M905_WOLSU Length = 185 Score = 86.7 bits (213), Expect = 4e-16, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 80/202 (39%), Gaps = 40/202 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M+K+ ++ + G E EAV + + R+GA+ V D + Sbjct: 2 MQKVVLV----PLAKGFEELEAVTIIDVLRRAGARVVVAGLD-----------------S 40 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +++ I + + +D +++PGG+G L++ ++++ Sbjct: 41 TDLVQSQGGIFIRPDSKMELIEPQNIDMIVLPGGWGGTVALAS----------HPLVRSM 90 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 +A+H+ KP+G +C AP L +I + T E ++ V+ +VV Sbjct: 91 VEALHKRQKPIGAICAAPYALSEIGVIEGQYTC--YPGIQEKIQR------GEFVESLVV 142 Query: 181 DEDNKIVT-TPAYMLAQNIAEA 201 + D+ + PA L +A Sbjct: 143 ESDHIFTSQGPATALPFALALV 164 >UniRef50_Q9ZV19 ProteaseI (PfpI)-like protein n=4 Tax=Arabidopsis thaliana RepID=Q9ZV19_ARATH Length = 398 Score = 86.7 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 66/219 (30%), Gaps = 32/219 (14%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 ++K ++L G D E +E ++ L + G C +P++ D + + Sbjct: 5 VQKSALLLCG----DYMEAYETIVPLYVLQSFGVSVHCVSPNRNAGDRCVMSAHDFLGLE 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + ++T D +I+PGG + D + L Sbjct: 61 LYTELVVDQLTLNA--NFDDVTPENYDVIIIPGG-----------RFTELLSADEKCVDL 107 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCP------ 174 ++ K + C + ML + ++E G E P Sbjct: 108 VARFAESKKLIFTSCHSQVMLMAAGILAGGVKCTAFESIKPLIELSGGEWWQQPGIQSMF 167 Query: 175 -VDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRV 212 + D V D + +M GI L+ + Sbjct: 168 EITDCVKDGN--------FMSTVGWPTLGHGIKLLLESL 198 Score = 70.5 bits (171), Expect = 4e-11, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 67/204 (32%), Gaps = 25/204 (12%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK +L G D E + + A+ G + P+K++ +V + + R Sbjct: 207 KKQASVLFLIG--DYVEDYGINVPFRALQALGCKVDAVTPNKKKGEVCA-TAVYDLEDGR 263 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + E + D ++VPGG ++ + AL Sbjct: 264 QIPAEKRGHNFFVTASWDDICVDDYDCVVVPGG-----------RSPELLVMNEKAVALV 312 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 ++ + K + +L + + +++ G E V ++ V Sbjct: 313 KSFAEKDKVFAAIGQGKLLLAATGVLKGKR-CASGKGMKVMVKVAGGEAV---MEKGCV- 367 Query: 182 EDNKIVTT------PAYMLAQNIA 199 D K+VT PA++ + A Sbjct: 368 TDGKVVTAASATDLPAFLFDLSTA 391 >UniRef50_C2EQY3 Possible transcriptional regulator n=10 Tax=Lactobacillales RepID=C2EQY3_9LACO Length = 197 Score = 86.7 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 63/200 (31%), Gaps = 36/200 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V+ + DG E E + + + R + + V+ +++ Sbjct: 4 MTKVAVVFA-----DGCEEVEGLSVVDVLRRLNVETDMVGLTSKDVNGDHNIKL------ 52 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + + D + PGG + NL + D +L+ L Sbjct: 53 -------------TCDKVVDDSLLDYDLVAFPGGMTGSANLRD----------DTKLRDL 89 Query: 121 AQAMHQAGKPLGFMCIAPAMLPK-IFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 H+ GK MC AP L + + T I+ E L++ H + Sbjct: 90 MVKRHEQGKWDAAMCAAPRALARYGVLDDAKFTCYPGIE-KECLKDQPNAHFSEEITVTD 148 Query: 180 VDEDNKIVTTPAYMLAQNIA 199 D+ PA A A Sbjct: 149 NDKKILTSRGPATAWAFAYA 168 >UniRef50_Q01V75 Intracellular protease, PfpI family n=15 Tax=Bacteria RepID=Q01V75_SOLUE Length = 177 Score = 86.7 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 55/189 (29%), Gaps = 36/189 (19%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KKI +++ + E L +GA V GE Sbjct: 6 KKIAILVDTL-----YQEMEVWYPLFRFQEAGATVVTIGAKA----------GETYGSKL 50 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 +++ +A AA+ D ++VPGG+ + Sbjct: 51 GYPVKSQ-------LSYDEARAADFDGVVVPGGYA-----------PDHIRRHAKANQFV 92 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 M+ GK + +C P +L + + + GAE ++VVD Sbjct: 93 HDMNAQGKLVASICHGPWVLCSAGGMLKGRKATSFFAIKDDVVNAGAEWSDA---EVVVD 149 Query: 182 EDNKIVTTP 190 + P Sbjct: 150 GNLVTSRKP 158 >UniRef50_UPI0001AEB94E ThiJ/PfpI domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB94E Length = 288 Score = 86.7 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 72/237 (30%), Gaps = 59/237 (24%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCF-----APDKQQVDVINHLTGEAM 57 + + G G ++E + A+ +G + V AP + VI Sbjct: 43 NVMQLADGKTEPTGYYLNEFGVPAKALVDAGYKLVLATPKGNAPSVDKKSVIPQYFDGGE 102 Query: 58 TETRNVLIEAARI-TRGEIRPLAQA---DAAELDALIVPGGFGAAKNLSNFASLGSECTV 113 ++ R++ A I + L++ E + + +PGG +L+N Sbjct: 103 SQMRDIQTFVASIEGIDDTLSLSEVIGQGLEEFEGVFIPGGHAPLIDLAN---------- 152 Query: 114 DRELKALAQAMHQAGKPLGFMCIAPAMLPKI----------------------FDFPLRL 151 + ++ + H GKP +C P L ++ Sbjct: 153 NPQVGEILSHFHSEGKPTAAICHGPIALLSGQSNPQAFELALKSGEQASSESWIYDGYKM 212 Query: 152 TIGTDID----------------TAEVLEEMGA--EHVPCPVDDIVVDEDNKIVTTP 190 TI + + A+ + + G E ++VVD + P Sbjct: 213 TIFSTPEEAYFESTLNDATLLYYPADAMAQAGGKMEFKGMWAPNVVVDRELITGQNP 269 >UniRef50_Q6F1K3 Putative intracellular protease/amidase n=1 Tax=Mesoplasma florum RepID=Q6F1K3_MESFL Length = 181 Score = 86.7 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 76/207 (36%), Gaps = 37/207 (17%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKK+ +IL + E EA++T+ + RS + + + V +H Sbjct: 1 MKKVAIIL-----HKNFEESEAIVTIDILRRSEIIVDIYNIENKDFQVGSH--------- 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 N++++ + ++ D +++PGG G +E + L L Sbjct: 47 -NIIVKTE-------YNIQSLNSQNYDGIVIPGGPGV-----------NELFDNEILLNL 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + K + +C AP +L + I + + LE+ + +D +V Sbjct: 88 IKDFNDKNKMVSAICAAPQILGLAGIID-DIKIVKFPTSNKYLEKALVQEKDSIIDKNIV 146 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDK 207 + P A NI E G ++ Sbjct: 147 TGSSIGTVVP---FALNIIEYLQGKEQ 170 >UniRef50_Q0SC28 Possible transcriptional regulator n=14 Tax=Actinobacteria (class) RepID=Q0SC28_RHOSR Length = 241 Score = 86.7 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 72/225 (32%), Gaps = 47/225 (20%) Query: 1 MKKIGVILS---------GCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINH 51 M K+ +++ G G E + + +G P + ++ Sbjct: 1 MTKVLFVVTAADRWTLNDGTVHPSGYWAEELAVPHRIFTEAGWDITVATP-GGKAPTLDQ 59 Query: 52 ----LTGEAMTETRNV--LIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFA 105 ++G + R+V +++ L D+ E D + PGG G ++L+ Sbjct: 60 LSLGISGGMPWKRRDVKNYLDSIADVLSHPASLDSVDSDEYDLVFYPGGHGPMEDLA--- 116 Query: 106 SLGSECTVDRELKALAQAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDID------ 158 D+ L +GKPL +C AP A+L G + Sbjct: 117 -------YDKTSGELLSHRLASGKPLALLCHAPAAVLAATNPDGTSAFAGYRMTGLSNRE 169 Query: 159 -------------TAEVLEEMGAEHVPCPVD-DIVVDEDNKIVTT 189 + L+E+GAE+ + V D + T Sbjct: 170 ELLNRFAKKAPWLLEDKLKEVGAEYSKGLIPLRPHVVVDRNLYTG 214 >UniRef50_B4D2S1 ThiJ/PfpI domain protein n=8 Tax=Bacteria RepID=B4D2S1_9BACT Length = 199 Score = 86.7 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 57/183 (31%), Gaps = 24/183 (13%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+ VI V D +E + + + G Q V AP+K++ ++ H + Sbjct: 15 PKVLVI-----VGDATETVDTLYPYYRLIEGGYQPVVAAPEKRKYQMVLH----EVKPGW 65 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 + E + + + GG D++L + Sbjct: 66 TITKEWEGYSIDAEIAFKDIQPEDYAGIFFSGG-----------RAPEYIREDQDLLRIT 114 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + + KP+ +C + + L + T LE G +V P V+D Sbjct: 115 RWFWENKKPMASVCHGVEIPARA-GIVKGLRMATVAKCQFDLEVCGGIYVNEP---CVID 170 Query: 182 EDN 184 Sbjct: 171 RHM 173 >UniRef50_Q486I7 DJ-1/PfpI family protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q486I7_COLP3 Length = 357 Score = 86.3 bits (212), Expect = 6e-16, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 70/216 (32%), Gaps = 41/216 (18%) Query: 3 KIGVILS-----GCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAM 57 KI +++ G G E E + +G +P + V+ + GE M Sbjct: 61 KILAVVTSVDKMGEDQDTGYEHTELARAYWVFTANGFSVDIASPKGGKPPVV--IDGEDM 118 Query: 58 TETRNVLIEAARITRG--EIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDR 115 + I + LA+ + + +A+ GG G + + + Sbjct: 119 GAYDYAFLNDDTIQQKVANSIALAEVNPNDYEAVYFVGGKGTMFDFPD----------NP 168 Query: 116 ELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTD--------------IDTA- 160 ++ +A+ ++Q K + +C P+ L + + + D Sbjct: 169 YVQNIAKTLYQNNKVVSAVCHGPSALVNVVLDNGEMLLSNKKVSGFTNNEELFLIPDAKQ 228 Query: 161 -------EVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 + L E GA+ V +D K++T Sbjct: 229 IFPFLLEDKLIEQGAQFQSGTTYLEKVTQDGKLITG 264 >UniRef50_Q8YYF7 Alr0893 protein n=3 Tax=Nostocaceae RepID=Q8YYF7_ANASP Length = 364 Score = 86.3 bits (212), Expect = 7e-16, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 63/190 (33%), Gaps = 38/190 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ +++ E E ++ + ++G + V + G T+ Sbjct: 10 KKVAILIEQA-----VEDAEFIIPCNGLKQAGFEVVVLGSRTNEK--YKGKRGRLSTQAD 62 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 EA A+E DA+++PGG + + Sbjct: 63 GTTTEAI--------------ASEFDAVVIPGGMA-----------PDKMRRNPNTVRFV 97 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 Q Q GK + +C P +L + + T I + + GA++V + +VV Sbjct: 98 QEAMQQGKLVAAVCHGPQVLIESDLLRGKQATGF--IAISRDMINAGADYVD---EALVV 152 Query: 181 DEDNKIVTTP 190 D + P Sbjct: 153 DGNLITSREP 162 >UniRef50_Q04P14 ThiJ/PfpI family protein n=5 Tax=Bacteria RepID=Q04P14_LEPBJ Length = 264 Score = 85.9 bits (211), Expect = 8e-16, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 55/176 (31%), Gaps = 25/176 (14%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDV-INHLTGEAMTETR 61 KI + L + E+ + + +G + P+ + LTG+ + + Sbjct: 6 KILIPLPSAD----FDPSESSIPWKILKENGYEVFFSTPNGKPGSADFRMLTGKGLGIWK 61 Query: 62 NVLI--EAARIT---------RGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSE 110 +LI + AR + + LI+PGG Sbjct: 62 PLLIAHKKARTAYNEMISDSHFQNPLSYKDLKPEDFEGLILPGGHAP---------GMKA 112 Query: 111 CTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEM 166 +EL+ + GKPLG +C + + +I T +L+ Sbjct: 113 YLESKELQEFVGSFFATGKPLGAICHGVVLAARSKIPGTDRSILYGKKTTALLKSQ 168 >UniRef50_Q29CB8 GA12322 n=5 Tax=Endopterygota RepID=Q29CB8_DROPS Length = 187 Score = 85.9 bits (211), Expect = 9e-16, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 83/216 (38%), Gaps = 38/216 (17%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K +I+ + G+E E V+ + R+G + E + + Sbjct: 1 MSKTALII----LAPGAEEMEFVIAADVLRRAGIKVTVAGLK----------DSEPVKCS 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R+V+I LA+A + D +++PGG G + + + A+ + L Sbjct: 47 RDVVI-------VPDTSLAKAACDKFDVVVLPGGLGGSNAMGDSAA----------VGDL 89 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 +A AG + +C AP +L K ++ + E L + ++ +V Sbjct: 90 LRAQESAGGLIAAICAAPTVLAKHGIA-AGKSLTSYPSMKEQLVD---KYCYVDDKSVVK 145 Query: 181 DEDNKIVTTP--AYMLAQNIAEAASGIDKLVSRVLV 214 D + P AY A IAE +G++K V V Sbjct: 146 DGNLITSRGPGTAYDFALKIAEELAGLEK-VKEVAK 180 >UniRef50_A9KDZ3 Protease I n=3 Tax=Coxiella burnetii RepID=A9KDZ3_COXBN Length = 172 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 54/173 (31%), Gaps = 34/173 (19%) Query: 19 IHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIRPL 78 E LL + GA+ P K Q V G +T Sbjct: 18 ELELWYPLLRMKEEGAEVTIVGPKKNQ--VYKSKLGYPVTAE---------------ATP 60 Query: 79 AQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAP 138 ++DALI+PGG+ + + + L +++ + K + +C A Sbjct: 61 DSVSPDKIDALIIPGGYA-----------PDKMRAHKAMTDLVRSVFERQKTVAAICHAA 109 Query: 139 AMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTP 190 +L + T + L G ++ PV V DE+ P Sbjct: 110 WVLISANIINGKKATCYH--TVKDDLINAGGIYLDQPV---VKDENLITSRQP 157 >UniRef50_B1WZ24 ThiJ/PfpI peptidase C56 family n=5 Tax=Cyanobacteria RepID=B1WZ24_CYAA5 Length = 363 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 56/188 (29%), Gaps = 36/188 (19%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ +++ E E + A+ ++ A+ V D G+ + Sbjct: 11 RVAILIENH-----FEDSEFQIPYTALKQANAEVVVLGSRMN--DTYKGKRGKVSIKPDA 63 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 E + + D +I+PGG + + L Sbjct: 64 TATE--------------VRSEDFDVIIIPGG-----------AAPDAIRANPNAVRLVM 98 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 K + +C P +L + + G + ++ GA ++ PV +V E Sbjct: 99 NGMAQNKLIAAICHGPQVLIEADQLRGKRATGF-QAIRKDMQNAGAIYIDEPV---IVQE 154 Query: 183 DNKIVTTP 190 + P Sbjct: 155 NLITARRP 162 >UniRef50_Q47L13 Intracellular protease/amidase, putative n=2 Tax=cellular organisms RepID=Q47L13_THEFY Length = 227 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 67/218 (30%), Gaps = 39/218 (17%) Query: 2 KKIGVILSGCGVY------DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE 55 K++ + L+ G G + EA ++G + + Q E Sbjct: 6 KRVLLALTSHGSLGDTGKPTGYYVPEAAHPWEVFRKAGYEVSFVSVKGGQPPAT-GYNAE 64 Query: 56 AMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDR 115 + + + + + A+ A+ GG G + + A L + Sbjct: 65 DEVQREFLEDPEVKKALADTPTADSLNPADYAAVYFVGGHGTMWDFPDAADLAALA---- 120 Query: 116 ELKALAQAMHQAGKPLGFMCIAPA-MLPKIFDFPLRLTIGTDIDT--------------- 159 + +++AG + +C PA ++ L G + + Sbjct: 121 ------RDIYEAGGVVAAVCHGPAGLVNLKLSDGRYLVEGKEFASFTNEEEDAVNLSGVV 174 Query: 160 ----AEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PA 191 LEE G HV P + V ++VT PA Sbjct: 175 PFLLQSKLEERGGIHVKKPKFESCVVVSERLVTGQNPA 212 >UniRef50_Q47YD7 DJ-1/PfpI family protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q47YD7_COLP3 Length = 270 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 72/221 (32%), Gaps = 36/221 (16%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 K+ +I+S G + E V + ++ + +P + I + + N Sbjct: 40 KVLIIISSDQH--GFWLPEVVEPYKLLEQAEFEIDIASPKGGK--GIASGSSRLSGKDSN 95 Query: 63 VLIEAARITR-GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 +++ + + L Q + + A+ GG G +L ++E + + Sbjct: 96 WFKQSSLPEKLEQSIELKQVISRQYRAVYFAGGAGPMFDLVE----------NQEAQRVT 145 Query: 122 QAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEE--------------- 165 + +++ G + C PA L + RL G + +EE Sbjct: 146 REIYENGGIISADCHGPAALINVKLSDGSRLISGRKLTAKANIEEGRWAKNNYPFLLEDK 205 Query: 166 ---MGAEHVPCPVDDIVVDEDNKIVTT--PAYMLAQNIAEA 201 +G + V D +++T PA A Sbjct: 206 IVSLGGIYTSAAKGKEHVIVDGRLITGQNPASAAPMTKALI 246 >UniRef50_C7RHW9 Intracellular protease, PfpI family n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RHW9_ANAPD Length = 167 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 59/213 (27%), Gaps = 48/213 (22%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MKKI V+L E E + + R V Sbjct: 1 MKKIVVLLESL-----FEKSELIYPYHRL-REDFDVVLV------------------GSE 36 Query: 61 RNVLIEAARITRGEIRPLA-QADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 + V + + + ++ +A +E D + +PG A Sbjct: 37 KYVEYPSKAGYKVKSDIISKEAYPSEFDGVYIPG-----------AYSPDGMRKHEATIN 85 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + H+ GK + +C P ++ + + + L GA+ V Sbjct: 86 FVKKFHENGKSIAAVCHGPWVVSDAGLLD-GVKASSTPTIKKDLINAGAKWEDRE----V 140 Query: 180 VDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRV 212 V +N I + + + + V + Sbjct: 141 VVYNNIITRR-------SPKDLPAHVKAFVDAL 166 >UniRef50_A8A9H6 Intracellular protease, PfpI family n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8A9H6_IGNH4 Length = 176 Score = 85.2 bits (209), Expect = 1e-15, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 73/206 (35%), Gaps = 36/206 (17%) Query: 5 GVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVL 64 VIL G V + +E V+ G + + GE + R V Sbjct: 3 AVILVGPMV----DEYEVVVPYSLFKAYGFEVDIAS----------FKAGEEVVGKRGVK 48 Query: 65 IEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAM 124 +E + ++ A E DA+I+ GG+ + D +K + M Sbjct: 49 LE-----LVPNKSFSELKADEYDAVIIAGGYA-----------PDKVRRDENVKRFVREM 92 Query: 125 HQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDN 184 ++ GK + +C +L + G+ + L GAE PVD+ +V D Sbjct: 93 YEKGKLVLSICHGGWVLISAGVAKGKKVTGSK-GIWDDLRNAGAE----PVDEPLV-IDG 146 Query: 185 KIVTTPAYMLAQNIAEAASGIDKLVS 210 +V+ + ++ + + K + Sbjct: 147 NVVSVKTWREFDHLIKNFDKVLKAIK 172 >UniRef50_B1L3Q5 Intracellular protease, PfpI family n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L3Q5_KORCO Length = 172 Score = 85.2 bits (209), Expect = 1e-15, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 54/183 (29%), Gaps = 33/183 (18%) Query: 13 VYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITR 72 V D + E + + G + P+ + L +A L EA Sbjct: 7 VEDLFDERELIYPFYRLKEMGFRVDLVGPEAKTYRSKLGLEVKADVSAEPELAEA----- 61 Query: 73 GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLG 132 D + +PGG+ +++ + + + GK + Sbjct: 62 -------------YDVIWIPGGYA-----------PDRLRRSKKIVEMVRRAVERGKIVA 97 Query: 133 FMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAY 192 +C AP +L R G + L GA V ++ VD + T P Sbjct: 98 AVCHAPWVLISAGVVKGRRVTGFH-SIWDDLRNAGANLVE---EEATVDGNIITGTGPDA 153 Query: 193 MLA 195 M Sbjct: 154 MPE 156 >UniRef50_A1ZP43 ThiJ/PfpI family n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZP43_9SPHI Length = 366 Score = 85.2 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 75/225 (33%), Gaps = 36/225 (16%) Query: 2 KKIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGE 55 KKI +++ G+ + E ++ G Q +P + + Sbjct: 141 KKILFVVTNHEKLGNTSKKAGTYLPEITYPFEVFNQKGYQVDFVSPKGGMLAINGIANAA 200 Query: 56 AMTETRNVLIEAARI-TRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVD 114 TR+ + R+ + Q D + + GG G + + + Sbjct: 201 VDETTRHFFRDKQRLNELRKTLSPDQVDINQYAGIYYVGGKGTMWDFPD----------N 250 Query: 115 RELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDID--------- 158 ++L+ + M+++ K +G +C P+ L + + T ++ + Sbjct: 251 KKLQKITAKMYESNKVVGAVCHGPSGLLNVKLSDGSYLLAGKKATGYSNAEDAKIKHILP 310 Query: 159 --TAEVLEEMGAEHVPCPVDDI-VVDEDNKIVTTPAYMLAQNIAE 200 + L+E G ++ V +++VT A +AE Sbjct: 311 FLLEDRLKERGVKYSKATKKQAKHVVVSDRLVTGQNPASAAGVAE 355 >UniRef50_D1VVY7 DJ-1 family protein n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VVY7_9FIRM Length = 193 Score = 85.2 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 61/192 (31%), Gaps = 40/192 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK + V L+ DG E EA+ + + R G + + +H Sbjct: 1 MKDLLVFLA-----DGFEEVEALSVVDILRRGGLSVDTCSIKDSKKVTSSH--------- 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 ++T + A +PGG A NL N DR + + Sbjct: 47 --------QVTVLADVHIDDIKIDNYKACYIPGGQPGATNLQN----------DRRIIQI 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLP-KIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + + GK + +C P +L + T ++ ++ +V Sbjct: 89 VEMFKEQGKLVAAICAGPQVLDTAGVLTDEKFTCYPGVEERLKTKK-------RLDVPVV 141 Query: 180 VDEDNKIVTTPA 191 VD++ PA Sbjct: 142 VDDNIITAMGPA 153 >UniRef50_Q97KC8 Putative intracellular protease n=1 Tax=Clostridium acetobutylicum RepID=Q97KC8_CLOAB Length = 169 Score = 85.2 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 56/188 (29%), Gaps = 38/188 (20%) Query: 13 VYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITR 72 V DG E E + L + +G T ++ Sbjct: 8 VEDGFEDIELLYPLYRLREAGYDVTLVG-----------------TGSKYNYTGTHGYIV 50 Query: 73 GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLG 132 + D D +I+PGG + K + ++K + + M + K L Sbjct: 51 DVDASAEEIDENNYDGVIIPGGSASYK-----------LRTNDDVKRIIRHMKEKDKLLA 99 Query: 133 FMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKI----V 187 +C ++ ++T I + + G E+V VD++ V Sbjct: 100 VICDGNYVLISAKVLSGHKVTCTEAIS--DDVINAGGEYVRIGN---CVDKNIITAKMQV 154 Query: 188 TTPAYMLA 195 P +M Sbjct: 155 NLPQFMSD 162 >UniRef50_A4WJ08 ThiJ/PfpI domain protein n=1 Tax=Pyrobaculum arsenaticum DSM 13514 RepID=A4WJ08_PYRAR Length = 158 Score = 84.8 bits (208), Expect = 2e-15, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 46/132 (34%), Gaps = 24/132 (18%) Query: 70 ITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGK 129 + R + LA+ E D L++PG ++K + + + + Sbjct: 37 LFRWVTKTLAEVKPEEYDGLVIPG---------RRVPEYVRVVASGDVKRVVRHIFERNT 87 Query: 130 PLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 P+ +C APA + + + I E G V ++VVD +VT Sbjct: 88 PVAAICYAPATARVV----KGREVTSHIAVGPEAENNGGIWVD---QEVVVD--GNLVTA 138 Query: 190 ------PAYMLA 195 PA+M Sbjct: 139 RAWLDNPAWMRE 150 >UniRef50_Q49YS0 Uncharacterized protein SSP0918 n=14 Tax=Bacilli RepID=Y918_STAS1 Length = 172 Score = 84.8 bits (208), Expect = 2e-15, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 64/191 (33%), Gaps = 39/191 (20%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ +IL+ E E AI +G + V V H T A+ Sbjct: 3 KKVAIILTNE-----FEDIELTSPKEAIEEAGHETVVIGDQANSEVVGKHGTKVAVD--- 54 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELK--A 119 +A A + D L++PGGF D E + Sbjct: 55 --------------VSIADAKPEDFDGLLIPGGF-----------SPDHLRGDAEGRYGT 89 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 A+ + P +C P +L D T+ ++ + L GA+ V + +V Sbjct: 90 FAKYFTKNDVPAFAICHGPQILIDT-DDLNGRTLTAVLNVRKDLANAGAQVVD---ESVV 145 Query: 180 VDEDNKIVTTP 190 VD++ TP Sbjct: 146 VDKNIVTSRTP 156 >UniRef50_B4SJZ4 ThiJ/PfpI domain protein n=1 Tax=Stenotrophomonas maltophilia R551-3 RepID=B4SJZ4_STRM5 Length = 376 Score = 84.4 bits (207), Expect = 2e-15, Method: Composition-based stats. Identities = 43/229 (18%), Positives = 76/229 (33%), Gaps = 38/229 (16%) Query: 2 KKIGVILSGCG-----VYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 ++ +++S G + G E+ E L + R+G + +P V+ + E Sbjct: 31 PRVLLVVSSEGRDQGRIRPGFEMDEFAQAWLILRRNGFEIDVASPRGGAVEADKYNAAEP 90 Query: 57 MTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 A R P AQ A + ++V GG GA +L D Sbjct: 91 FNAAVLADPLAVR-ALAATLPTAQLRAGDYRGVLVIGGKGAMFDLP----------ADSA 139 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLP-------KIFDFPLRLTIGTDIDTA--------- 160 L+ + + G + +C PA L + +T ++ + A Sbjct: 140 LQRTIATIWEQGGVVAAVCHGPAALAGIRLGNGRALVEGRSMTGFSEEEEALFGKRWAKE 199 Query: 161 ------EVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAAS 203 + E+GA P+ V D ++VT +AEA Sbjct: 200 FAFQLEPRMRELGARWQEAPLMMPKVVVDGRLVTGQNPYSTPVLAEAFV 248 >UniRef50_Q15SH5 ThiJ/PfpI n=14 Tax=Gammaproteobacteria RepID=Q15SH5_PSEA6 Length = 224 Score = 84.4 bits (207), Expect = 2e-15, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 74/223 (33%), Gaps = 38/223 (17%) Query: 1 MKKIGVILSGCGVY------DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTG 54 MK + +IL+ +G+ E L A +G + + + V Sbjct: 1 MKNVLIILTNHATLGTTDEANGTFSPELTHALHAFLEAGYRYDLVSIKGGEAPVYGVDME 60 Query: 55 EAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVD 114 + + E + + + A+ DA+ PGGFG +L++ D Sbjct: 61 DDKINSDVFKSEDLASKLSNTKKASDVNPADYDAVFYPGGFGLLSDLAD----------D 110 Query: 115 RELKALAQAMHQAGKPLGFMCIAPAML-------PKIFDFPLRLTIGTDID--------- 158 + L ++ +AG +G +C PA L K + +T T + Sbjct: 111 ENVAKLTASIFEAGAVVGAVCHGPAGLLPIKLSNGKYLIEDIMVTGFTREEEVEYDTINK 170 Query: 159 ----TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PAYMLA 195 E L ++ V + ++++T PA A Sbjct: 171 IPFLLEEALTRRAGQYAKIAPWGEYVVKQDRVITGQNPASAGA 213 >UniRef50_B8CX62 DJ-1 family protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8CX62_HALOH Length = 181 Score = 84.4 bits (207), Expect = 2e-15, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 73/215 (33%), Gaps = 39/215 (18%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 KI + L+ +G E EA+ ++ + R+G + + + + + +H Sbjct: 2 KILIPLA-----EGFEEIEAITSIDVLRRAGIEVITSSLTESTEVMGSH----------- 45 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 + L + LD +++PGG + NL + D + L + Sbjct: 46 ------DVKVTADTTLDKVSVDNLDGILLPGGMPGSANLKD----------DIRIIKLIK 89 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 +++ + +C AP +L K + + + ++ + + +VVD Sbjct: 90 RLNKKSGLIAAICAAPIVLEKAGVIKEKR-ATSYPGFDKEMKTCNYQ-----ENRVVVDG 143 Query: 183 DNKIVTTPAYMLAQNIAEA-ASGIDKLVSRVLVLA 216 + P + + + +V + Sbjct: 144 NIITGRGPGVAMEFALTVVNYLTSEDMVKELSEKM 178 >UniRef50_Q1GFT3 ThiJ/PfpI n=5 Tax=Proteobacteria RepID=Q1GFT3_SILST Length = 226 Score = 84.4 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 73/229 (31%), Gaps = 42/229 (18%) Query: 1 MKKIGVILSGC------GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLT- 53 M +I ++ + G G E A +G + + + V + + Sbjct: 1 MARILILSTAADVLGDTGKPTGVWYEELATPYYAFLDAGHEVTLVTLEGKPVPIDPNSDE 60 Query: 54 ---GEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSE 110 + TR AA + L D DAL +PGG GA +L+ A+ + Sbjct: 61 TGDAAPASVTRFRADAAATALLAKPGRLEDEDVTAYDALYIPGGHGAMYDLAESATAAAA 120 Query: 111 CTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFD-------FPLRLTIGTDID----- 158 +GK + +C PA K+ D ++T T+ + Sbjct: 121 ----------IGKAWDSGKVVASVCHGPAAFAKVVDAKGEPIVKGRKVTAFTNSEETAVG 170 Query: 159 --------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT--PAYMLAQN 197 L E+GAE + D ++VT PA A Sbjct: 171 LEKAVPFLLETKLRELGAEFENVADWQPLAVADGQLVTGQNPASSEAAA 219 >UniRef50_C3XNX9 Putative uncharacterized protein n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XNX9_9HELI Length = 191 Score = 84.4 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 77/205 (37%), Gaps = 32/205 (15%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ V+++ G E EA+ + + R+G A + + DV++ N Sbjct: 2 RVSVLVALAK---GFEELEAISVIDVLRRAGCDV-IVAKVESKNDVLD----------SN 47 Query: 63 VLIEAARITRGEIRP-LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 +++E+ + + L+ D LD ++ PGG+ +NL LK + Sbjct: 48 LIVESQKGVKIVADKFLSAVDCECLDGIVFPGGWEGTQNL----------IASSSLKEVL 97 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + ++ G+ + +C AP L F + G +E+M +++ D Sbjct: 98 EKLNAKGRIIAAICAAPLAL-----FKHGILKGQAFTCYPSIEKMIENPQYKTDSNVIQD 152 Query: 182 EDNKIVTTPAYMLAQN--IAEAASG 204 + PA L +A G Sbjct: 153 GNLITSRGPATALEFAFYLASVFVG 177 >UniRef50_C7HVF0 DJ-1 family protein n=1 Tax=Anaerococcus vaginalis ATCC 51170 RepID=C7HVF0_9FIRM Length = 208 Score = 84.0 bits (206), Expect = 3e-15, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 70/210 (33%), Gaps = 42/210 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M + V L+ DG E EA+ + R+ + V +H Sbjct: 15 MDRFLVFLA-----DGFEEIEALTLVDYFRRADIFVDMVSVGNDLFVVGSH--------- 60 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +++++A R+ D + +PGG AKNL + D+ + + Sbjct: 61 -DIIVKADRLIDDIDLDF-------YDGIYIPGGSLGAKNLRD----------DKRVIDI 102 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + K + +C P++L + + + L +G + Sbjct: 103 VKKFDEEKKIICAICAGPSVLDRA-GIVSDRNLVCHPSVEKSLLNVGNIKSDKL-----I 156 Query: 181 DEDNKIVTT----PAYMLAQNIAEAASGID 206 D I T+ + LA + E GID Sbjct: 157 VMDGNIFTSRGAGASVFLALKLIEMIKGID 186 >UniRef50_A6CIJ2 Predicted intracellular protease/amidase, ThiJ/PfpL family protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CIJ2_9BACI Length = 225 Score = 84.0 bits (206), Expect = 3e-15, Method: Composition-based stats. Identities = 40/180 (22%), Positives = 66/180 (36%), Gaps = 31/180 (17%) Query: 1 MKKIGVILSGCGVYD-------GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLT 53 M K+ +LS G D G E L A+ ++G Q +P + V++ ++ Sbjct: 1 MAKVLAVLS-SGYKDEENNYETGWWGEELFAPLEALEKAGHQVDIASPLGGK-PVVDQVS 58 Query: 54 GEAMTETRNV---LIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSE 110 + L E+ R E + L + A E D ++V GG GA +L+ Sbjct: 59 FLPDYDPEGTYKALYESGRA--DETQKLTEVLAEEYDVVLVVGGHGAMYDLAK------- 109 Query: 111 CTVDRELKALAQAMHQAGKPLGFMCIAPAML-------PKIFDFPLRLTIGTDIDTAEVL 163 D +L + ++ G + C PA L K ++T D E L Sbjct: 110 ---DEDLHRIINTVYDNGGIVAAECHGPAPLIWTLRPDGKSIIEGKKVTGYPDEIEPEGL 166 >UniRef50_B6YYX0 Intracellular protease 1 n=1 Tax=Pseudovibrio sp. JE062 RepID=B6YYX0_9RHOB Length = 214 Score = 84.0 bits (206), Expect = 3e-15, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 65/203 (32%), Gaps = 21/203 (10%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M ++++G D E +E ++ + + G + P K++ D + + Sbjct: 1 MSSKILLITG----DFGEDYEVIVPFMLLKAIGYEVHVACPQKREGDTVASSIHDINKNY 56 Query: 61 RNVLI-EAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 + V E RI L + + AL++PGG ++ Sbjct: 57 QTVTEWEGHRININ--VALDHLNPTDYIALVLPGG-----------RSCEYLRTYPIVRE 103 Query: 120 LAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + KP+ +C +L T+ + ++ G + + +V Sbjct: 104 VTAHFITENKPIASICRGVQIL-LSTGLLKGKTMTGNFVCETEVQMAGNTYEKLNYEGVV 162 Query: 180 VDEDNKIVTTPAYMLAQNIAEAA 202 VD IV + +A Sbjct: 163 VD--GNIVYGVEWHGLWAWMQAF 183 >UniRef50_C8KYS8 Type 1 glutamine amidotransferase n=1 Tax=Actinobacillus minor 202 RepID=C8KYS8_9PAST Length = 288 Score = 84.0 bits (206), Expect = 3e-15, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 64/246 (26%), Gaps = 69/246 (28%) Query: 3 KIGVILS---------GCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVI-NHL 52 KI ++ S G G + E ++G + V PD + N L Sbjct: 31 KILLVASSANELTFKDGRKHPTGYYLPELSTPAQEFIKAGYEVVVATPDGNTPALDKNSL 90 Query: 53 TGEAMTETRNVLIEAARITRGEIR--------PLAQADAAELDALIVPGGFGAAKNLSNF 104 T T+ + L +A R + + DAL VPGG +L Sbjct: 91 TESLFTDGKTGLDQAMRFVLTHPSMQKPKRLGDVVKNGLGSFDALYVPGGHAPMIDLMQ- 149 Query: 105 ASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFD------------------ 146 + +L + H+ K +C P Sbjct: 150 ---------NADLGKALKHFHENNKVTAMLCHGPIAFSAAMKNSKAFRQAMVNGNHEQAK 200 Query: 147 --------FPLRLTIGTDID---------------TAEVLEEMGAEHVPCPVDDIVVDED 183 +T+ + + + L GA D V D Sbjct: 201 KLASDWPYKGYNMTVYSTQEEYGVEDWLKAKIEFYMEDALRNAGANVTVGKPDQPYVVVD 260 Query: 184 NKIVTT 189 ++VT Sbjct: 261 RELVTG 266 >UniRef50_B3T0X6 Putative DJ-1/PfpI family protein n=1 Tax=uncultured marine microorganism HF4000_007I05 RepID=B3T0X6_9ZZZZ Length = 175 Score = 83.6 bits (205), Expect = 4e-15, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 59/197 (29%), Gaps = 33/197 (16%) Query: 6 VILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLI 65 +I+SG D HE + + + ++ + V I + Sbjct: 5 IIISGALAQD----HEFIYPFYRLLEAESKLDVCLIGGKSVQGILGTNLPPTKDY----- 55 Query: 66 EAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMH 125 ++ + Q + D L++PGG + D+ H Sbjct: 56 --------PVKDINQVKVNDYDLLVLPGG----------VKALEKTRQDKRFIKFIADFH 97 Query: 126 QAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNK 185 +A K + +C +L + G + + G + P VVD K Sbjct: 98 KADKVIACICSGVQLLISAKIIKGKKIAGYY-SLEDDIVNAGGIYTDQP---AVVDS--K 151 Query: 186 IVTTPAYMLAQNIAEAA 202 IVTT Y AA Sbjct: 152 IVTTAHYKHMGPWMRAA 168 >UniRef50_B9W435 Putative DJ-1 family protein (Fragment) n=1 Tax=Histomonas meleagridis RepID=B9W435_9EUKA Length = 186 Score = 83.6 bits (205), Expect = 4e-15, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 65/211 (30%), Gaps = 38/211 (18%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 MK + +I G E EA+ + R+G + + + + H Sbjct: 1 MKAVILIA------PGFEEVEAITPADFLRRAGVEVILAS--------VGHDDLSIKG-- 44 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 A I + DA+I PGG NL+ D + Sbjct: 45 ------AHNIVIKCNAKFPEISKNIYDAIICPGGLPGTTNLAK----------DANVVEA 88 Query: 121 AQAMHQAGKPLGFMCIAP-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 +A AGK + +C AP +L + T + + E G V D + Sbjct: 89 IKAHLAAGKIVAAICAAPGFVLAEACGIMNGKTGCGYPGCDDKITENGGTKVE---DRVY 145 Query: 180 VDEDNKIVTTP--AYMLAQNIAEAASGIDKL 208 D + P A + A I G +K Sbjct: 146 ADGNIITSRGPGTASLFALEILRKLVGNEKA 176 >UniRef50_C5VYK3 DJ-1/PfpI family protein n=6 Tax=Streptococcus suis RepID=C5VYK3_STRSE Length = 179 Score = 83.6 bits (205), Expect = 4e-15, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 65/202 (32%), Gaps = 42/202 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M ++ V+L+ DG E EA+ ++ R+ D +V+ + + Sbjct: 1 MTRVAVVLA-----DGFEEIEALASVDVFRRAHFDCQIVGLDSHEVEGSHGI-------- 47 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R + + D + D +++PGG + +L + L Sbjct: 48 -----------RVQTDQVFDGDLSSFDLIVLPGGMPGSVHLRD----------SESLITE 86 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKI-FDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 Q +GK + +C AP +L K T + + + H+ +D+V Sbjct: 87 LQRAVASGKSVAAICAAPIVLDKAGLLDSRHYTCFPGKE--KDIPS--GIHLE---EDVV 139 Query: 180 VDEDNKIVTTPAYMLAQNIAEA 201 VD L Sbjct: 140 VDGPIITSRGAGTSLDFAYKLV 161 >UniRef50_C2AV24 Putative intracellular protease/amidase n=1 Tax=Tsukamurella paurometabola DSM 20162 RepID=C2AV24_TSUPA Length = 230 Score = 83.6 bits (205), Expect = 5e-15, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 75/242 (30%), Gaps = 44/242 (18%) Query: 1 MKKIGVILSG------CGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTG 54 M I V+ + G G + E + A+ +G + D V + G Sbjct: 1 MSNILVVTTSVPSYRVSGRRTGLWLGELTHFVDAVEAAGHTTTIASIDGGFVPIDPESLG 60 Query: 55 EA---MTETRNVLIEAARITR-GEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSE 110 TR + A + R L DA++ DA+ + GG G + + Sbjct: 61 HEVLAQGGTRARYDDPAFMARIANTPSLNDIDASDFDAIYLTGGHGVMFDFPD------- 113 Query: 111 CTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKI-------FDFPLRLTIGTDID----- 158 D L L + +AGK + +C A L R++ + + Sbjct: 114 ---DARLATLLREFDEAGKVVSAVCHGTAGLLGATKADGAPLIAGRRISGFSWNEEVLAG 170 Query: 159 --------TAEVLEEMGAEHVPCP-VDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLV 209 E + E GA ++ D +VT + A G+ L+ Sbjct: 171 LDAIVPFNLEERIAERGATYIEADEAWAPFAVTDGNVVTGQ---NPASAHPVAQGVLTLL 227 Query: 210 SR 211 + Sbjct: 228 DK 229 >UniRef50_B1YFC6 ThiJ/PfpI domain protein n=4 Tax=Bacillales RepID=B1YFC6_EXIS2 Length = 220 Score = 83.6 bits (205), Expect = 5e-15, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 66/214 (30%), Gaps = 40/214 (18%) Query: 2 KKIGVILSGC-----GVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEA 56 K + ++++ G G + E +A + G +P+ + + Sbjct: 3 KHVLMVVTNASEMKEGHATGIWLSEFGEAYVAFQKEGYTITVASPNGGLSP----IDARS 58 Query: 57 MTETRNVLIEAARITRGEIRPLAQA-DAAELDALIVPGGFGAAKNLSNFASLGSECTVDR 115 + + I+A L D + DA+ +PGG G +L + Sbjct: 59 LEDEVPADIQATAPLLENTLDLESISDFSAFDAIFMPGGHGTMFDLPH----------SD 108 Query: 116 ELKALAQAMHQAGKPLGFMCIAPA-MLPKIFDFPLRLTIGTDIDTAEV------------ 162 L + + +AGK + +C PA ++ L G I T Sbjct: 109 ALNHALRTLFEAGKTVAAVCHGPAGLVSATLTDGTPLVAGKTIATFTDEEERATGLDIYM 168 Query: 163 -------LEEMGAEHVPCPVDDIVVDEDNKIVTT 189 L E+GA + D +VT Sbjct: 169 PFLLETRLRELGANIIVADNFTENFQVDGNLVTG 202 >UniRef50_C4KYU5 ThiJ/PfpI domain protein n=1 Tax=Exiguobacterium sp. AT1b RepID=C4KYU5_EXISA Length = 235 Score = 83.2 bits (204), Expect = 5e-15, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 69/225 (30%), Gaps = 46/225 (20%) Query: 1 MKKIGVILSGCGVYD------GSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVI----- 49 MK + +++S V G + E +S +G Q +PD +V+ Sbjct: 1 MKSVLLVVSNPTVSTTTNWPVGFWLSELTHPYDVLSATGHQLTIASPDGGKVEWDALSDP 60 Query: 50 ---NHLTGEAMTETRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFAS 106 + + + + ++ D + DA++V GG + Sbjct: 61 THESGYSKHDTLSLKYLDDADFLALLENTPKISDLDLSGFDAILVAGGQAPMFTFQDA-- 118 Query: 107 LGSECTVDRELKALAQAMHQAGKPLGFMCIAPAML------PKIFDFPLRLTIGTDID-- 158 LK +A + +GKP +C ++L + F +T T+ + Sbjct: 119 --------DSLKEAFEAFYASGKPSIALCHGTSLLLYLQEENRPFVEGKVMTGFTNEEED 170 Query: 159 --------------TAEVLEEMGAEHVPCPVDDIVVDEDNKIVTT 189 E ++GA V D ++T Sbjct: 171 QADASVGMKVMPFRIEEEAIKLGAHFKKAEPWTPYVVRDGHLITG 215 >UniRef50_C5EU56 DJ-1 family protein n=3 Tax=Clostridiales RepID=C5EU56_9FIRM Length = 194 Score = 83.2 bits (204), Expect = 5e-15, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 59/189 (31%), Gaps = 40/189 (21%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ L+ +G E E + + + RSG + + ++ +H Sbjct: 11 MAKVYAFLA-----EGLEEVECLAVVDVLRRSGVEVTMVSVSGKKEVTGSH--------- 56 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 I +AD + D L +PGG NL R L+ Sbjct: 57 --------GIRLMADALFEEADPDQADVLFLPGGMPGTNNL----------REHRGLREA 98 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + ++ G+ + +C AP++L T E L G + V Sbjct: 99 IERANKQGRRVAAICAAPSVLG-AMGLLKGRTATCYPGFEEQLT--GVSYTSQG-----V 150 Query: 181 DEDNKIVTT 189 D I T Sbjct: 151 VTDGNITTG 159 >UniRef50_Q16AF6 Protease, putative n=10 Tax=Alphaproteobacteria RepID=Q16AF6_ROSDO Length = 203 Score = 82.8 bits (203), Expect = 6e-15, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 60/191 (31%), Gaps = 19/191 (9%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 K+I +++ + SE +E + A+ G P+ ++ D + + Sbjct: 4 KRILMLI-----GEYSEEYEIFVVQQAMEAVGHTVHIICPETKKGDRVT-TSVHDFGPGV 57 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 E Q D ++ D++ + GG G ++ + Sbjct: 58 MTWTEHKGHGIEVDVDFDQVDTSDYDSVYIAGGRG-----------PEYIRTYPRVREIV 106 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVD 181 + H+ KP+ +C +L + + + + A +V + Sbjct: 107 REFHRDDKPIASICHGLQVLIAVPEVIAGKKVSGLFTVEPEVALTDATYVKIGPKAAL-- 164 Query: 182 EDNKIVTTPAY 192 D +VT + Sbjct: 165 RDGNLVTAEGW 175 >UniRef50_A6DTE4 Putative intracellular protease/amidase, ThiJ family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTE4_9BACT Length = 183 Score = 82.8 bits (203), Expect = 7e-15, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 57/180 (31%), Gaps = 36/180 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M ++ V+ + +G E EA+ + + R+ V Sbjct: 1 MPRLAVVFA-----EGFEEIEAITIVDVLRRAQIDVDMVGLRSLSVKG------------ 43 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + I + L + D D +++PGG + L + + +++ Sbjct: 44 ------SHDIEVKVEKLLKEVDPNLYDGVVLPGGLPGSFKLRD----------NEDVQNF 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 +A + GK +C AP L K + + + + P +D VV Sbjct: 88 IRAFN--GKLQAAVCAAPIALQKA-GALEGRHVTSHPSMKDEFSKQLYLESPAVIDGKVV 144 >UniRef50_A6WAG0 ThiJ/PfpI domain protein n=1 Tax=Kineococcus radiotolerans SRS30216 RepID=A6WAG0_KINRD Length = 232 Score = 82.8 bits (203), Expect = 8e-15, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 51/192 (26%), Gaps = 38/192 (19%) Query: 3 KIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRN 62 ++ ++ + D E E G P + + + Sbjct: 63 RVAILTA-----DRVEDVEFFYPYYRFVEEGYAVDVITPSGGALTGYKGMGLKET----- 112 Query: 63 VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ 122 L Q D + D L +PGG E D A Q Sbjct: 113 -------------IALDQVDPRDYDLLFIPGG-----------LAPGELRRDPRAIAFVQ 148 Query: 123 AMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDE 182 A+ G +G +C P +L ++ + + A + G ++ +V D Sbjct: 149 AVAGWGTTIGAVCHGPQVLVDA-GLVAGRSMTSWHEVAPEITAAGGTYLD---QALVEDG 204 Query: 183 DNKIVTTPAYML 194 P M Sbjct: 205 QFITSRKPGDMP 216 >UniRef50_A6C613 Intracellular proteinase pfpI n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C613_9PLAN Length = 178 Score = 82.5 bits (202), Expect = 8e-15, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 59/192 (30%), Gaps = 38/192 (19%) Query: 4 IGVILSGCGVY----DGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTE 59 + + LSG + E E L + +GA+ + + + H Sbjct: 3 VSLPLSGQKFLLFTGEDYEDLELWYPKLRLEEAGAETTLAGQEAGKTYLGKHGYPSVSEA 62 Query: 60 TRNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKA 119 T + ++A+ +I GG+ + + ++ + Sbjct: 63 T-----------------IDSINSADYHGVICAGGW-----------MPDKLRRSEKVLS 94 Query: 120 LAQAMHQAGKPLGFMCIAPAM-LPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDI 178 L Q H++ K + +C M + +++T + L GA PV Sbjct: 95 LLQEFHESEKLIAAICHGGWMPISAGIYSGVKVTGS--PGIKDDLINAGAVWEDAPV--- 149 Query: 179 VVDEDNKIVTTP 190 VVD P Sbjct: 150 VVDRHFVCSRRP 161 >UniRef50_D1PPH9 ThiJ/PfpI family protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PPH9_9FIRM Length = 184 Score = 82.5 bits (202), Expect = 9e-15, Method: Composition-based stats. Identities = 47/204 (23%), Positives = 70/204 (34%), Gaps = 40/204 (19%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K + + G E E +L + + R+G + A Q +H Sbjct: 1 MSKAVIFFA-----PGLEECEGLLCVDLLRRAGVEVTIAAVGGSQTVTSSH--------- 46 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 +V I A + Q D + DA I+PGG NL D ++ + Sbjct: 47 -HVNIVADALA-------EQVDYSAYDACILPGGIPGVNNL----------KADATVRKV 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPK-IFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 Q AGK + +C P +L + T+ L E GAE+ P+ Sbjct: 89 CQDYAAAGKTVAAICAGPTVLASFGVLNGKKATV--YPGMYGALTEGGAEYTGLPLT--- 143 Query: 180 VDEDNKIVTTPAYMLAQNIAEAAS 203 D IVT A A A A + Sbjct: 144 --IDGNIVTGEALGAAIPFALALA 165 >UniRef50_Q15TQ8 ThiJ/PfpI n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15TQ8_PSEA6 Length = 186 Score = 82.5 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 60/191 (31%), Gaps = 34/191 (17%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK IL+ DG E E + T A+ +G + + + N Sbjct: 8 KKTAAILA----KDGFEQCELIETRDALIEAGVDVHIVSLEPGTIIGWNGSKWG------ 57 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 I + +++ A + DALI+PGG DR+ Sbjct: 58 --------IEVDVDKVVSKVSADDYDALILPGG----------LFNPDALLQDRDAVDFV 99 Query: 122 QAMH--QAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 +A KP+ + ML + D + + L GA+ V +V Sbjct: 100 KAFFVDAKLKPVVAINQGTWMLLEA-DVLRNRLVASFPTVLNRLRNAGAKVVDRD---LV 155 Query: 180 VDEDNKIVTTP 190 VD+ + Sbjct: 156 VDQGLYTSRSS 166 >UniRef50_C4ZGF1 4-methyl-5(Beta-hydroxyethyl)-thiazole monophosphate synthesis protein n=2 Tax=Clostridiales RepID=C4ZGF1_EUBR3 Length = 181 Score = 82.1 bits (201), Expect = 1e-14, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 70/203 (34%), Gaps = 42/203 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+G+ ++ DG E E + + + R+ + + ++ +H T Sbjct: 1 MIKVGIFMA-----DGCEEIEGLTVVDIVRRAKLEIETISITEKAEVTSSHQVTFKTDTT 55 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 + AQAD DA+++PGG NL D + Sbjct: 56 K-----------------AQADFDSYDAIVLPGGMPGTLNL----------GADETVVKT 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEE--MGAEHVPCPVDDI 178 + GK + +C AP++L + + G EE +GA+ + PV Sbjct: 89 IKRFAAEGKLVAAICAAPSVLGE-----NHILEGKKATCHPGFEEKLLGAQWLEQPV--- 140 Query: 179 VVDEDNKIVTTPAYMLAQNIAEA 201 VVD + +A + Sbjct: 141 VVDGNVITSRGMGTAIAFALELV 163 >UniRef50_C3WFG1 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis enzyme n=3 Tax=Fusobacterium RepID=C3WFG1_FUSMR Length = 183 Score = 82.1 bits (201), Expect = 1e-14, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 68/176 (38%), Gaps = 33/176 (18%) Query: 2 KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETR 61 KK+ V+L+ +G E+ EA+ + + R GA+ V + +T+ R Sbjct: 3 KKVYVLLA-----EGFELIEAMTPVDVLRRGGAEVVTVS----------------ITDNR 41 Query: 62 NVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALA 121 V A ++ L + D + D +I+PGG+ NL +E+ + Sbjct: 42 EV-TSAQKVPVISDTTLKEKDITDGDMIILPGGYPGYVNLGE----------SQEVGKVL 90 Query: 122 QAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDD 177 + + K +G +C AP +L K + L + E ++ D+ Sbjct: 91 KYYVENNKFVGAICGAPTVLAKN-EVFLGKELTCHSSVVEEMKRYNYNGSKAFTDE 145 >UniRef50_A8RKB7 Putative uncharacterized protein n=2 Tax=Clostridium RepID=A8RKB7_9CLOT Length = 191 Score = 82.1 bits (201), Expect = 1e-14, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 56/189 (29%), Gaps = 40/189 (21%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ L+ DG E E + + + RSG + + + +H Sbjct: 8 MAKVYAFLA-----DGLEEVECLAVVDVLRRSGVEVTLVSVTGDRKVTGSH--------- 53 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 I G + D L +PGG NL L+A Sbjct: 54 --------GIELGTDALFEDVNPDVADVLFLPGGMPGTNNL----------KAHMGLRAA 95 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + ++ G+ + +C AP++L T + L G + V Sbjct: 96 VECANKQGRRIAAICAAPSILG-SMGLLKGRTATCYPGFEDQLT--GVSYTSQG-----V 147 Query: 181 DEDNKIVTT 189 D I T Sbjct: 148 VTDGNITTG 156 >UniRef50_A9KSW4 DJ-1 family protein n=14 Tax=Clostridiales RepID=A9KSW4_CLOPH Length = 181 Score = 81.7 bits (200), Expect = 1e-14, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 66/202 (32%), Gaps = 41/202 (20%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K+ V L+ DG E EA+ + + R+G + T Sbjct: 1 MAKVYVFLA-----DGFEEIEALTVVDLLRRAGVDVTTVSI------------------T 37 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 N L+ A L + D +E D L++PGG +NL + LK L Sbjct: 38 ENNLVHGAHGIDVMADILFKDDLSEADMLVLPGGGLGTRNLLD----------HEGLKDL 87 Query: 121 AQAMHQAGKPLGFMCIAPAMLP-KIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIV 179 + G+ L +C AP++L R + L GA D +V Sbjct: 88 LIDYEKKGRYLAAICAAPSILGTHGLLKGKRAIC--YPGFEDKLT--GAVVTN---DKVV 140 Query: 180 VDEDNKIVTTPAYMLAQNIAEA 201 VD + ++ Sbjct: 141 VDGKIITSKGAGTSIEFSLELI 162 >UniRef50_A8SLZ0 Putative uncharacterized protein n=1 Tax=Parvimonas micra ATCC 33270 RepID=A8SLZ0_9FIRM Length = 196 Score = 81.7 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 69/216 (31%), Gaps = 38/216 (17%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M + +IL+ D E EA+ + + R+ + + + +T Sbjct: 1 MDRYILILT-----DTFEEVEALTQVDYLRRADIKVDMISITGKL----------QVTSN 45 Query: 61 RNVLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKAL 120 R + + A + + + E +I+PGG AA + D+ + + Sbjct: 46 RGITVLADDL-------IENINLKEYAGIIIPGGLPAAFD----------IREDKRVLEI 88 Query: 121 AQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVV 180 + + K + +C P++L K + G + +E+ + D I + Sbjct: 89 IKKFDEEHKLISAICAGPSVLAKA-----GVLSGRNAVIYPGMEDELLDANVKE-DAICI 142 Query: 181 DEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLA 216 D++ K+ + + Sbjct: 143 DDNIITARGAGLAGELAYTLIRKIKGKVQEKQVRFM 178 >UniRef50_C8V2F0 ThiJ/PfpI family protein (AFU_orthologue; AFUA_3G01210) n=2 Tax=Emericella nidulans RepID=C8V2F0_EMENI Length = 933 Score = 81.7 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 66/215 (30%), Gaps = 51/215 (23%) Query: 2 KKIGVILSGCGVYD----------------------GSEIHEAVLTLLAISRSGAQAVCF 39 K+I ++LS + G + E L + +G + Sbjct: 681 KRILIVLSDANYFPLKKPAGSGEGSSSNSKIVDQPSGFFLMELAKPLQKLLDAGHEVTFA 740 Query: 40 APDKQQVDVINH-----LTGEAMTETR--NVLIEAARITRG-----EIRPLAQADAAELD 87 +P+ ++ + E R N L+E + G ++ ++ + Sbjct: 741 SPEGREPQPDPNSESLLAFAGNFYERRRENELLERMKKENGFTKPRKLNSISDDELKNFA 800 Query: 88 ALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDF 147 + +PGG +L + +++L + + H+ KP +C P L Sbjct: 801 GVFIPGGHAPLADLGD----------NKDLGRILEYFHKENKPTAAICHGPYALLSTKVS 850 Query: 148 P-----LRLTIGTDIDTAEVLEE--MGAEHVPCPV 175 I + + E + E +G E Sbjct: 851 GGEFAYKGYKITSWSNAEEKVMESMLGGEVEKVET 885 >UniRef50_C3QHA6 ThiJ family intracellular protease/amidase n=28 Tax=Bacteroides RepID=C3QHA6_9BACE Length = 183 Score = 81.7 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 63/210 (30%), Gaps = 44/210 (20%) Query: 13 VYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITR 72 DG E EA + + R+G + ++ V H Sbjct: 8 FADGFEEIEAFTAIDTLRRAGLNVEIVSVTPDEIVVGAH-------------------DV 48 Query: 73 GEIRPLAQADAAELDA--LIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKP 130 + + + DA L++PGG A L L+ L GKP Sbjct: 49 SVLCDINFENCDFFDAELLLLPGGMPGAATLDK----------HEGLRKLILDFAAKGKP 98 Query: 131 LGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVT-- 188 + +C AP M+ + L+ GAE V V D I+T Sbjct: 99 IAAICAAP-MVLGKLGLLKGKKATCYPSFEQYLD--GAECVN-----AHVVRDGNIITGM 150 Query: 189 --TPAYMLAQNIAEAASGIDKLVSRVLVLA 216 A A I + G +K V ++ Sbjct: 151 GPGAAMEFALTIVDLLVGKEK-VDELVEAM 179 >UniRef50_A8QA68 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8QA68_MALGO Length = 159 Score = 81.3 bits (199), Expect = 2e-14, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 56/174 (32%), Gaps = 29/174 (16%) Query: 1 MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTET 60 M K V L+ G+E E +T + R G V E ++ Sbjct: 1 MPKAIVFLAQ-----GAEEMEFSITYDVLVRGGVDVT---------SVYVPGADEPLSPA 46 Query: 61 RNVLIEAARITRGEIRPLAQA----DAAELDALIVPGGFGAAKNLSNFASLGSECTVDRE 116 +++ + + G L A + DA I+PGG G A LS D Sbjct: 47 DGLVVASRGVKLGVDTTLEALTKSGHAGDYDAYIIPGGAGGANTLSK----------DPT 96 Query: 117 LKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEH 170 + + + H GK +G +C + L I + + L H Sbjct: 97 VLQILRDSHANGKIVGMICAGS-LAALEARVGLGGPITSHPSVKDKLASCTYAH 149 >UniRef50_Q8PSJ5 Protease I n=1 Tax=Methanosarcina mazei RepID=Q8PSJ5_METMA Length = 144 Score = 81.3 bits (199), Expect = 2e-14, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 17/119 (14%) Query: 78 LAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIA 137 + + D L++ GG G K +D++ + + + KP+ +C Sbjct: 32 FKDVNPEDYDILVISGGKGPEK-----------MRLDKDALEITKHFFEKNKPVAAICHG 80 Query: 138 P-AMLPKIFDFPLRLTIGTDIDTAEVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLA 195 P ++ + T I + + GA + ++V+D + +P + A Sbjct: 81 PQVLVSAGVIKGRKATC--WIGIRDDIIAAGALYED---SEVVIDGNFVSSRSPDDLYA 134 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.130 0.349 Lambda K H 0.267 0.0404 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,170,925,825 Number of Sequences: 3077464 Number of extensions: 41716757 Number of successful extensions: 135380 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 314 Number of HSP's successfully gapped in prelim test: 629 Number of HSP's that attempted gapping in prelim test: 133668 Number of HSP's gapped (non-prelim): 991 length of query: 217 length of database: 1,040,396,356 effective HSP length: 124 effective length of query: 93 effective length of database: 658,790,820 effective search space: 61267546260 effective search space used: 61267546260 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 90 (39.3 bits)