A Privacy-Preserving Approach to Conformance Checking
Pith reviewed 2026-05-09 19:32 UTC · model grok-4.3
The pith
Conformance checking between a process model and event log can be performed securely using homomorphic encryption so neither owner sees the other's data.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We reduce alignment-based conformance checking to string processing algorithms that existing homomorphic encryption schemes can evaluate. The process model and event log are encrypted so that the model owner and log owner can jointly compute the optimal alignments and the resulting fitness and precision metrics without revealing the plaintext contents. Tests with synthetic logs and a real-world event log demonstrate that the encrypted computation yields the same conformance outcomes as the standard method.
What carries the argument
Reduction of alignment computation to homomorphic-encryption-compatible string processing operations that allow joint evaluation of model-log fitness without plaintext disclosure.
If this is right
- Conformance checking becomes possible in settings where the model and log must remain confidential to their respective owners.
- The same string-processing reduction can support other alignment-derived measures such as fitness and precision under encryption.
- The approach works on both artificial and real event logs, establishing basic feasibility.
- Computational cost grows with log and model size, limiting immediate use to modest-sized inputs.
Where Pith is reading between the lines
- Organizations could perform cross-company process audits without exchanging proprietary models or logs.
- Advances in homomorphic encryption efficiency would directly expand the scale of logs this method can handle.
- The same encryption layer might apply to related process mining tasks that also rely on alignments.
Load-bearing premise
The alignment problem can be translated entirely into string operations that homomorphic encryption handles accurately without losing the ability to derive usable conformance metrics.
What would settle it
Running the encrypted method and the standard unencrypted alignment algorithm on identical inputs and observing whether the resulting alignments and conformance scores match exactly.
Figures
read the original abstract
Conformance checking, one of the main process mining operations, aims to identify discrepancies between a process model and an event log. The model represents the expected behaviour, whereas the event log represents the actual process behaviour as captured in information systems records. Traditionally, the process model and the event log are both accessible to the business analyst performing the conformance checking. However, in some contexts, it is necessary to keep either the model or the log private to protect critical or sensitive information. In this paper, we propose a secure approach to conformance checking based on string processing algorithms and homomorphic encryption, where the process model and event log ar not visible to either the model's or event log's owner. The proposed technique is based on alignments, a well-known formalism used for conformance checking. An evaluation is performed using a synthetic and a real-world event log, showing that conformance checking can be securely computed at the expense of high memory and processing requirements.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to introduce a privacy-preserving conformance checking method for process mining. It leverages homomorphic encryption and string processing algorithms to compute alignments between a process model and an event log without either party revealing their data to the other. An evaluation on synthetic and real-world event logs is presented to demonstrate that the approach functions despite significant computational overhead.
Significance. If the string-based reduction accurately captures the semantics of standard alignments (including handling of concurrency and branching), this could be significant for enabling conformance checking in privacy-sensitive settings like inter-organizational processes. The approach builds on established primitives, which is a positive, but the high overhead noted suggests limited practicality without further optimization.
major comments (2)
- [Abstract] The central claim relies on reducing alignment-based conformance checking to string processing operations compatible with homomorphic encryption. However, standard alignments operate on the state space of the process model (e.g., Petri net markings), not raw strings; the abstract provides no evidence that the encoding preserves model semantics such as parallel branches or silent transitions, which is load-bearing for the utility of the conformance metrics.
- [Evaluation] The evaluation mentions support for functionality on synthetic and real-world logs but lacks any quantitative comparison of conformance results (e.g., fitness or precision values) between the proposed method and the non-private baseline, error metrics, or performance benchmarks beyond qualitative 'high requirements'. This undermines assessment of whether the results are usable.
minor comments (2)
- [Abstract] There is a typo: 'ar not visible' should read 'are not visible'.
- The manuscript would benefit from including security proofs or formal arguments for the privacy guarantees, as well as more detailed algorithm descriptions.
Simulated Author's Rebuttal
Thank you for the opportunity to revise our manuscript based on the referee's insightful comments. We have carefully considered each point and provide detailed responses below, along with planned revisions to strengthen the paper.
read point-by-point responses
-
Referee: [Abstract] The central claim relies on reducing alignment-based conformance checking to string processing operations compatible with homomorphic encryption. However, standard alignments operate on the state space of the process model (e.g., Petri net markings), not raw strings; the abstract provides no evidence that the encoding preserves model semantics such as parallel branches or silent transitions, which is load-bearing for the utility of the conformance metrics.
Authors: We thank the referee for this observation. The manuscript details a reduction of alignment computation to string operations that are designed to preserve the necessary semantics of the process model, including handling of concurrency and silent transitions through appropriate encoding of the model's behavior into strings. However, we agree that the abstract does not sufficiently highlight this, and the main text could benefit from more explicit discussion or examples. We will revise the abstract to better reflect this and add a paragraph or subsection providing evidence (such as a formal mapping or illustrative example) that the encoding maintains equivalence to standard state-space based alignments for models with parallel branches and silent transitions. revision: yes
-
Referee: [Evaluation] The evaluation mentions support for functionality on synthetic and real-world logs but lacks any quantitative comparison of conformance results (e.g., fitness or precision values) between the proposed method and the non-private baseline, error metrics, or performance benchmarks beyond qualitative 'high requirements'. This undermines assessment of whether the results are usable.
Authors: We agree that providing quantitative validation is essential to demonstrate the correctness and usability of the results. The current evaluation focuses on showing that the private computation is feasible and produces valid outputs, but we did not include direct comparisons. In the revised manuscript, we will add a new subsection in the evaluation with tables reporting fitness and precision values for both the private method and the standard alignment-based conformance checking on the same datasets. We will also include error metrics (e.g., difference in alignment costs) and more detailed performance benchmarks such as exact runtime and memory consumption figures for different log sizes. This will allow readers to assess the trade-off between privacy and computational cost. revision: yes
Circularity Check
No circularity: construction from standard alignments and HE primitives
full rationale
The paper presents a new protocol that encodes alignments as string operations evaluable under homomorphic encryption. All load-bearing steps cite well-known external formalisms (alignments) and standard cryptographic primitives rather than reducing to self-defined quantities, fitted parameters, or author-overlapping citations. Evaluation on synthetic and real logs is independent of the construction itself. No equation or claim is equivalent to its inputs by definition.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Homomorphic encryption permits meaningful computation on encrypted data without decryption.
- domain assumption Conformance checking alignments can be computed using string processing algorithms.
Reference graph
Works this paper leans on
-
[1]
van der Aalst, W., Adriansyah, A., van Dongen, B.: Replaying history on process models for conformance checking and performance analysis. WIREs Data Mining Knowl. Discov.2(2), 182–192 (2012)
work page 2012
-
[2]
van der Aalst, W., Carmona, J.: Process Mining Handbook, LNBIP, vol. 448. Springer (2022)
work page 2022
-
[3]
Armas-Cervantes, A., Baldan, P., Dumas, M., Garcia-Bañuelos, L.: Diagnosing behavioral differences between business process models: An approach based on event structures. Inf. Syst. pp. 304–325 (2016)
work page 2016
-
[4]
Burrows, M., Wheeler, D.J.: A block-sorting lossless data compression algorithm. In: SRC-RR-124. HP Labs Technical Reports (1994)
work page 1994
-
[5]
Carmona, J., van Dongen, B.F., Solti, A., Weidlich, M.: Conformance Checking - Relating Processes and Models. Springer (2018)
work page 2018
- [6]
-
[7]
Esparza, J., Römer, S., V ogler, W.: An Improvement of McMillan’s Unfolding Algorithm. Formal Methods Syst. Des.30(2), 285–310 (2002)
work page 2002
-
[8]
Ferragina, P., Manzini, G.: Opportunistic data structures with applica- tions. In: FOCS 2000. pp. 390–398. IEEE Computer Society (2000)
work page 2000
-
[9]
In: Rosa, M.L., Loos, P., Pastor, O
Lu, X., Fahland, D., van den Biggelaar, F.J.H.M., van der Aalst, W.M.P.: Handling duplicated tasks in process discovery by refining event labels. In: Rosa, M.L., Loos, P., Pastor, O. (eds.) BPM 2016. LNCS, vol. 9850, pp. 90–107. Springer (2016)
work page 2016
-
[10]
McMillan, K.L., McMillan, K.L.: Symbolic model checking. Springer (1993)
work page 1993
-
[11]
Montanari, U., Rossi, F.: Contextual occurrence nets and concurrent con- straint programming. pp. 280–295. Springer Berlin Heidelberg (1994)
work page 1994
-
[12]
Proceedings of the IEEE77(4), 541–580 (apr 1989) 8
Murata, T.: Petri nets: Properties, analysis and applications. Proceedings of the IEEE77(4), 541–580 (apr 1989) 8
work page 1989
-
[13]
Nong, G., Zhang, S., Chan, W.H.: Two efficient algorithms for linear time suffix array construction. IEEE Trans. Computers60(10), 1471– 1484 (2011)
work page 2011
-
[14]
Sudo, H., Jimbo, M., Nuida, K., Shimizu, K.: Secure wavelet matrix: Alphabet-friendly privacy-preserving string search for bioinformatics. IEEE ACM Trans. Comput. Biol. Bioinform.16(5), 1675–1684 (2019) 9
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.