Developing a Totally Unimodular Linear Program for Optimal Conformance Checking: When and Why It Complements A*
Pith reviewed 2026-06-29 17:01 UTC · model grok-4.3
The pith
Reformulating conformance checking as a totally unimodular linear program guarantees integral optimal solutions via standard LP solvers.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By modeling alignment as a minimum-cost flow on the reachability graph of the synchronous product and exploiting its network-flow structure, the LP constraint matrix is totally unimodular; therefore the optimal extreme-point solution of the relaxed LP is guaranteed to be integral and can be obtained without combinatorial overhead.
What carries the argument
The totally unimodular linear program on the reachability graph of the synchronous product, whose network-flow structure supplies the total unimodularity that converts the relaxed LP into an exact integer solution.
If this is right
- The LP formulation supplies substantial speedups precisely on longer traces that contain deviations.
- A* remains preferable on short traces that conform well to the model.
- A hybrid selector that chooses between the two methods based on trace characteristics achieves 38.6 percent average runtime savings at 96 percent accuracy.
- The approach eliminates the exponential worst-case behavior of A* when substantial deviations are present.
Where Pith is reading between the lines
- The same network-flow structure may appear in other alignment variants, allowing analogous LP reformulations without integer variables.
- Process-mining platforms could embed both solvers and apply the selection rule to handle mixed workloads efficiently in practice.
- Sensitivity information from the LP optimum could be used to rank which deviations matter most, an analysis not directly available from A*.
Load-bearing premise
The reachability graph of the synchronous product has an underlying network-flow structure that makes the LP constraint matrix totally unimodular.
What would settle it
An instance in which the optimal solution to the LP relaxation is fractional rather than integral.
Figures
read the original abstract
Alignment-based conformance checking is the state-of-the-art approach for comparing observed process executions with normative process models. The standard exact solution relies on an A*-based heuristic search, which can exhibit exponential runtime in the presence of long traces or substantial deviations. This paper introduces a reformulation of alignment-based conformance checking as a totally unimodular linear program (LP) defined on the reachability graph of the synchronous product. By exploiting the underlying network-flow structure, the proposed formulation guarantees the existence of an integral optimal extreme-point solution through LP relaxation, thereby avoiding the combinatorial overhead associated with integer variables and branch-and-bound search. We conduct an extensive empirical evaluation on more than 2.1 million conformance checking instances derived from real-world and synthetic benchmark datasets. The results show that A* and the LP approach exhibit complementary performance characteristics: the former performs best on short, well-conforming traces, while the LP formulation provides substantial speedups for longer traces with deviations, precisely where conformance checking is most informative. Based on these findings, we derive simple algorithm-selection guidelines that combine both approaches, achieving average runtime savings of 38.6% with 96% selection accuracy compared to always using A*.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that alignment-based conformance checking can be reformulated as a totally unimodular linear program on the reachability graph of the synchronous product. By exploiting an underlying network-flow structure, the LP relaxation is asserted to yield integral optimal solutions without branch-and-bound. Large-scale experiments on >2.1 million instances from real and synthetic data show complementary runtime behavior with A* (LP faster on long traces with deviations), leading to a selection heuristic with 38.6% average savings at 96% accuracy.
Significance. If the total-unimodularity claim is correct, the work supplies a useful complementary exact method for conformance checking that scales better than A* precisely on the harder instances. The scale of the empirical evaluation (millions of instances) is a clear strength and supports the practical value of the derived selection guidelines.
major comments (2)
- [Abstract] Abstract (reformulation paragraph): the central guarantee that 'the proposed formulation guarantees the existence of an integral optimal extreme-point solution through LP relaxation' rests on the constraint matrix being totally unimodular because of a network-flow structure, yet no explicit LP (variables for log/model/synchronous moves, flow-conservation equalities, objective) or proof that the matrix satisfies the network-matrix TU characterization (at most two nonzero entries per row of opposite sign) is supplied.
- [Empirical evaluation] Empirical evaluation (results paragraph): the reported 38.6% average runtime savings and 96% selection accuracy are presented without any table, per-dataset breakdown, or raw timing data, making it impossible to verify the complementarity claim or the conditions under which the LP is selected.
minor comments (1)
- The abstract states 'more than 2.1 million' instances but does not give the exact count or the split between real-world and synthetic benchmarks; this should be stated precisely in the experimental section.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of the work's significance and the scale of the empirical evaluation. We address each major comment below and will revise the manuscript to improve clarity and verifiability of the claims.
read point-by-point responses
-
Referee: [Abstract] Abstract (reformulation paragraph): the central guarantee that 'the proposed formulation guarantees the existence of an integral optimal extreme-point solution through LP relaxation' rests on the constraint matrix being totally unimodular because of a network-flow structure, yet no explicit LP (variables for log/model/synchronous moves, flow-conservation equalities, objective) or proof that the matrix satisfies the network-matrix TU characterization (at most two nonzero entries per row of opposite sign) is supplied.
Authors: The abstract is intentionally concise and does not contain the explicit LP or TU proof. The full manuscript (Section 3) defines the LP with variables for log moves, model moves, and synchronous moves on the reachability graph of the synchronous product, along with flow-conservation equalities and the standard alignment cost objective. The TU property is established by showing the constraint matrix is a network matrix (each row has at most two nonzero entries of opposite sign) due to the flow structure. We will revise the abstract to include a one-sentence pointer to this structure and add an explicit statement of the LP and a short proof sketch (or reference to the network-matrix characterization) in the main text or an appendix. revision: yes
-
Referee: [Empirical evaluation] Empirical evaluation (results paragraph): the reported 38.6% average runtime savings and 96% selection accuracy are presented without any table, per-dataset breakdown, or raw timing data, making it impossible to verify the complementarity claim or the conditions under which the LP is selected.
Authors: We agree that summary statistics alone limit verifiability. The manuscript will be revised to include a table (or set of tables) providing per-dataset breakdowns of runtime savings, selection accuracy, and the simple selection rules (e.g., based on trace length and number of deviations). We will also add a link to a public repository containing the raw timing data for the >2.1 million instances to allow independent verification of the complementarity between A* and the LP approach. revision: yes
Circularity Check
No circularity: TU property asserted from graph encoding, not reduced to self-fit or self-citation
full rationale
The paper's central claim is that the reformulation of conformance checking as an LP on the synchronous-product reachability graph yields a totally unimodular constraint matrix due to its network-flow structure, guaranteeing integral optima via LP relaxation. No equations, fitted parameters, or self-citations are exhibited that would make the TU guarantee equivalent to its inputs by construction (e.g., no redefinition of the matrix via the result itself, no renaming of known results, and no load-bearing self-citation chain). The derivation is presented as a direct structural property of the encoding, making the result self-contained against external verification rather than internally circular.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The reachability graph of the synchronous product has a network-flow structure that makes the LP constraint matrix totally unimodular.
Reference graph
Works this paper leans on
-
[1]
PhD The- sis.TechnischeUniversiteitEindhoven
Aligning observed and modeled behavior. PhD The- sis.TechnischeUniversiteitEindhoven. URL:https://doi.org/10.6100/ IR770080, doi:10.6100/IR770080. Adriansyah, A., van Dongen, B.F., van der Aalst, W.M.,
-
[2]
Efficient approximate conformance checking using trie data structures, in: 2021 3rd International Confer- ence on Process Mining (ICPM), pp. 1–8. doi:10.1109/ICPM53251.2021. 9576845. Bauer, M., Van der Aa, H., Weidlich, M.,
- [3]
-
[4]
arXiv preprint arXiv:2406.05439
A scalable and near-optimal confor- mance checking approach for long traces. arXiv preprint arXiv:2406.05439 . Boltenhagen, M., Chatain, T., Carmona, J.,
-
[5]
Aligning event logs and process models for multi-perspective conformance checking: An approach based on integer linear programming, in: Business Process Management: 11th International Conference, BPM 2013, Beijing, China, August 26-30,
2013
-
[7]
Efficiently computing alignments: Using the extended marking equation, in: Business Process Management (BPM), Springer. pp. 197–214. doi:10.1007/978-3-319-98648-7_12. Fani Sani, M., van Zelst, S.J., van der Aalst, W.M.,
-
[10]
BPI Challenge 2013, incidents. 4TU.ResearchData. URL:https://doi.org/10.4121/uuid: 500573e6-accc-4b0c-9576-aa5468b10cee, doi:10.4121/uuid: 500573e6-accc-4b0c-9576-aa5468b10cee. Tardos, É.,
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.