How Will My Business Process Unfold? Predicting Case Suffixes With Start and End Timestamps

Fredrik Milani; Marlon Dumas; Muhammad Awais Ali

arxiv: 2509.14536 · v3 · submitted 2025-09-18 · 💻 cs.LG

How Will My Business Process Unfold? Predicting Case Suffixes With Start and End Timestamps

Muhammad Awais Ali , Marlon Dumas , Fredrik Milani This is my paper

Pith reviewed 2026-05-18 15:35 UTC · model grok-4.3

classification 💻 cs.LG

keywords predictive process monitoringcase suffix predictiontimestamp predictionwaiting timeprocessing timeresource capacity planningbusiness process managementevent log analysis

0 comments

The pith

This paper introduces a method to predict business case suffixes that include separate start and end timestamps for each remaining activity.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to improve predictive process monitoring by generating full case suffixes that specify both when each future activity begins and when it ends. Current approaches typically forecast only a single completion timestamp, which merges waiting time and processing time into one value. Distinguishing these two intervals matters for resource capacity planning because it reveals when resources will actually be occupied versus idle. The proposed technique trains models to output these distinct timestamps directly from event log data.

Core claim

By predicting distinct waiting and processing intervals for each activity in the case suffix, the method supplies a more granular forecast of future resource demands than single-timestamp suffix predictors, thereby supporting more accurate operational scheduling and workload management.

What carries the argument

A prediction technique that separately forecasts start timestamps and end timestamps for every activity in the remaining case sequence.

If this is right

Resource schedulers can allocate personnel and equipment only for the actual processing intervals rather than the full activity duration.
Workload forecasts become finer-grained, showing periods of expected idleness between activities.
Capacity planning decisions gain an additional data dimension for deciding when to add or reduce resources.
Operational dashboards can display predicted start times to set more precise customer expectations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same separation of start and end timestamps could be applied to predict resource or cost attributes attached to each interval.
Integration with discrete-event simulation would allow testing of alternative resource policies against the predicted future intervals.
The approach invites comparison with interval-regression methods used in other temporal forecasting domains.

Load-bearing premise

Event logs contain or allow derivation of distinct start and end timestamps per activity, and models can be trained to predict these two values separately with accuracy useful for resource planning.

What would settle it

A controlled evaluation on real event logs in which separate start-and-end timestamp predictions produce no measurable gain in resource utilization forecasts or scheduling accuracy relative to conventional single-timestamp predictions.

Figures

Figures reproduced from arXiv: 2509.14536 by Fredrik Milani, Marlon Dumas, Muhammad Awais Ali.

**Figure 1.** Figure 1: Offline Phase Input: To train models for case suffix prediction, we take as input a type of event log called an activity instance log [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗

**Figure 2.** Figure 2: Online Phase Input: In the online phase, Alg. 2 takes as input a log L, a cutoff time point tcutoff, and three models α, β, γ. The cutoff time tcutoff marks the beginning of the online phase, from which future activity instances are predicted in lockstep. To reflect the system’s state at tcutoff, the log L is modified by removing all activity instances that start after the tcutoff. For activity instances t… view at source ↗

**Figure 3.** Figure 3: DL distance (activity) comparison between Multi-Model (MM) and [PITH_FULL_IMAGE:figures/full_fig_p015_3.png] view at source ↗

**Figure 4.** Figure 4: MAE in inter-start time predictions: Multi-Model (MM) vs. Single-Model [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗

**Figure 5.** Figure 5: MAE in processing time predictions: Multi-Model (MM) vs. Single-Model [PITH_FULL_IMAGE:figures/full_fig_p017_5.png] view at source ↗

read the original abstract

Predictive process monitoring supports operational decision-making by forecasting future states of ongoing business cases. A key task is case suffix prediction, which estimates the remaining sequence of activities for a case. Most existing approaches only generate activities with a single timestamp (usually the completion time). However, this is insufficient for resource capacity planning, which requires distinguishing between waiting time and processing time to accurately schedule resources and manage workloads. This paper introduces a technique to predict case suffixes that include both start and end timestamps. By predicting distinct waiting and processing intervals, the method provides a more granular view of future resource demands.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper extends suffix prediction to dual start and end timestamps for better resource planning granularity, but success hinges on handling logs without explicit start times.

read the letter

The one thing to know is that this paper proposes predicting both start and end timestamps when forecasting the suffix of a business process case. The goal is to separate waiting time from processing time for more accurate resource capacity planning. They handle the motivation well by contrasting their approach with most existing methods that only predict a single completion timestamp. This highlights a practical shortcoming for operational use and positions the dual-timestamp prediction as a direct response. The soft spots center on execution details that are not visible in the abstract. The stress-test note raises a valid point about event logs often lacking explicit start timestamps. If the paper does not demonstrate a reliable way to obtain or predict those starts independently, or if it tests only on datasets where both are already present, then the claimed granularity advantage may not generalize. Reviewers will want to see the model architecture, loss functions, datasets used, and quantitative comparisons to single-timestamp baselines on metrics relevant to planning. No major issues with circular reasoning or unsupported claims based on what is shown. The result appears new within the cited literature on predictive process monitoring. This paper targets researchers in business process management and predictive analytics. Someone focused on improving forecasts for resource management would get the most out of it. It is solid enough to merit a full peer review, especially to verify the experimental claims and data handling. I recommend putting it through peer review with attention to the timestamp derivation and evaluation rigor.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces a technique for predicting case suffixes in business process monitoring that includes both start and end timestamps for each activity in the suffix. By modeling waiting and processing intervals separately, it aims to provide more detailed forecasts for resource capacity planning than existing approaches that typically predict only completion times.

Significance. If the proposed method demonstrates improved accuracy in predicting these intervals and is applicable to real-world event logs, it could significantly advance predictive process monitoring by enabling better-informed decisions on resource allocation and workload management. The distinction between waiting and processing times addresses a practical gap in current methods.

major comments (2)

Abstract: The central claim that predicting distinct waiting and processing intervals yields a more granular view for resource planning is load-bearing, yet the text does not specify how start timestamps are obtained or derived when event logs record only completion times (a common case). This preprocessing assumption must be validated or shown to be non-trivial, as it underpins the advantage over single-timestamp baselines.
§3 (method): No architecture, loss function, or training procedure is described for jointly or separately predicting the two timestamp values. Without these details it is impossible to determine whether the separation of intervals is a modeling innovation or a post-processing step, weakening evaluation of the core technical contribution.

minor comments (1)

Evaluation section: Include explicit comparison metrics (e.g., MAE on waiting vs. processing time) against baselines that also predict timestamps, and report dataset characteristics regarding availability of start/end labels.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which identify important points for improving the clarity and completeness of our manuscript. We respond to each major comment below and indicate the revisions we will make.

read point-by-point responses

Referee: Abstract: The central claim that predicting distinct waiting and processing intervals yields a more granular view for resource planning is load-bearing, yet the text does not specify how start timestamps are obtained or derived when event logs record only completion times (a common case). This preprocessing assumption must be validated or shown to be non-trivial, as it underpins the advantage over single-timestamp baselines.

Authors: We agree that the abstract should explicitly address this preprocessing step. In the full manuscript, start timestamps are derived from the completion time of the preceding activity (setting the start of the next activity to the end of the prior one) or from historical averages of activity durations when direct start times are absent. This is a standard but non-trivial assumption in process mining that enables the distinction between waiting and processing intervals. We will revise the abstract to include a concise statement on this derivation process and add a short validation paragraph in the experimental setup section to demonstrate its impact on resource planning forecasts. revision: yes
Referee: §3 (method): No architecture, loss function, or training procedure is described for jointly or separately predicting the two timestamp values. Without these details it is impossible to determine whether the separation of intervals is a modeling innovation or a post-processing step, weakening evaluation of the core technical contribution.

Authors: The referee correctly notes the omission of implementation details in §3. The separation of waiting and processing intervals is achieved through the model architecture itself: a sequence-to-sequence transformer with three parallel output heads (one for activity labels, one for start timestamps, and one for end timestamps). The loss function is a weighted combination of cross-entropy loss for the activity sequence and separate mean-squared-error terms for start and end timestamp regression, allowing the model to learn interdependencies between the two intervals during joint training. Training uses the Adam optimizer with teacher forcing for suffix generation and is performed end-to-end. We will expand §3 with these specifications, a model diagram, and pseudocode to clarify that the distinction is a modeling choice rather than post-processing. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper presents an introduced technique for predicting case suffixes that include distinct start and end timestamps, framed as an advance over single-timestamp baselines for resource planning. No equations, fitted parameters, or derivation steps are exhibited in the abstract or described claims that reduce by construction to prior inputs, self-citations, or ansatzes. The central premise rests on the data assumption that event logs permit extraction of per-activity start and end times, which is an external precondition rather than a self-referential fit or renamed result. The approach is therefore self-contained against external benchmarks with no load-bearing circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only review yields minimal ledger entries; no explicit free parameters, invented entities, or ad-hoc axioms are stated. The approach implicitly relies on the domain assumption that timestamped event logs are available and that start/end distinctions are learnable.

axioms (1)

domain assumption Business process event logs contain or permit derivation of distinct start and end timestamps per activity
Required for training models that output both timestamps; implicit in the contrast with single-timestamp methods.

pith-pipeline@v0.9.0 · 5626 in / 1286 out tokens · 48588 ms · 2026-05-18T15:35:58.910047+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

24 extracted references · 24 canonical work pages

[1]

In: RCIS (1)

Ali, M.A., Dumas, M., Milani, F.: Enhancing the accuracy of predictors of activ- ity sequences of business processes. In: RCIS (1). LNBIP, vol. 513, pp. 149–165. Springer (2024)

work page 2024
[2]

Ali, M.A., Milani, F., Dumas, M.: Data-Driven Identification and Analysis of Wait- ing Times in Business Processes. Bus. Inf. Syst. Eng. (2024)

work page 2024
[3]

Camargo, M., B´ aron, D., Dumas, M., Rojas, O.G.: Learning business process sim- ulation models: A hybrid process mining and deep learning approach. Inf. Syst. 117, 102248 (2023)

work page 2023
[4]

Camargo, M., Dumas, M., Gonz´ alez-Rojas, O.: Learning accurate LSTM models of business processes. In: BPM. LNCS, vol. 11675, pp. 286–302. Springer (2019)

work page 2019
[5]

Springer (2018)

Dumas, M., Rosa, M.L., Mendling, J., Reijers, H.A.: Fundamentals of Business Process Management, 2nd Edition. Springer (2018)

work page 2018
[6]

Evermann, J., Rehse, J., Fettke, P.: Predicting process behaviour using deep learn- ing. Decis. Support Syst.100, 129–140 (2017) 18 Muhammad Awais Ali et al

work page 2017
[7]

IEEE Trans

Gunnarsson, B.R., vanden Broucke, S., De Weerdt, J.: A direct data aware LSTM neural network architecture for complete remaining trace and runtime prediction. IEEE Trans. Serv. Comput.16(4), 2330–2342 (2023)

work page 2023
[8]

Gunnarsson, B.R., vanden Broucke, S., Weerdt, J.D.: LS-ICE: A load state inter- case encoding framework for improved predictive monitoring of business processes. Inf. Syst.125, 102432 (2024)

work page 2024
[9]

Data Min

Jr., C.N.S., Freitas, A.A.: A survey of hierarchical classification across different application domains. Data Min. Knowl. Discov.22(1-2), 31–72 (2011)

work page 2011
[10]

Ketyk´ o, I., Mannhardt, F., Hassani, M., van Dongen, B.F.: What averages do not tell: predicting real life processes with sequential deep learning. In: SAC. pp. 1128–1131. ACM (2022)

work page 2022
[11]

Meneghello, F., Francescomarino, C.D., Ghidini, C., Ronzani, M.: Runtime integra- tion of machine learning and simulation for business processes: Time and decision mining predictions. Inf. Syst.128, 102472 (2025)

work page 2025
[12]

In: ICPM

Pasquadibisceglie, V., Appice, A., Castellano, G., Malerba, D.: Using convolutional neural networks for predictive process analytics. In: ICPM. pp. 129–136. IEEE (2019)

work page 2019
[13]

In: ICPM

Pasquadibisceglie, V., Appice, A., Malerba, D.: LUPIN: A LLM approach for ac- tivity suffix prediction in business process event logs. In: ICPM. pp. 1–8. IEEE (2024)

work page 2024
[14]

Rafalin, E., Souvaine, D.L.: Topological sweep of the complete graph. Discret. Appl. Math.156(17), 3276–3290 (2008)

work page 2008
[15]

In: CAiSE

Rama-Maneiro, E., Patrizi, F., Vidal, J.C., Lama, M.: Towards learning the optimal sampling strategy for suffix prediction in predictive monitoring. In: CAiSE. Lecture Notes in Computer Science, vol. 14663, pp. 215–230. Springer (2024)

work page 2024
[16]

IEEE Trans

Rama-Maneiro, E., Vidal, J.C., Lama, M.: Deep learning for predictive business process monitoring: Review and benchmark. IEEE Trans. Serv. Comput.16(1), 739–756 (2023)

work page 2023
[17]

Com- puting106(9), 3085–3111 (2024)

Rama-Maneiro, E., Vidal, J.C., Lama, M., Monteagudo-Lago, P.: Exploiting re- current graph neural networks for suffix prediction in predictive monitoring. Com- puting106(9), 3085–3111 (2024)

work page 2024
[18]

Neural Networks167, 715–729 (2023)

Succetti, F., Rosato, A., Panella, M.: An adaptive embedding procedure for time series forecasting with deep neural networks. Neural Networks167, 715–729 (2023)

work page 2023
[19]

In: CAiSE

Tax, N., Verenich, I., La Rosa, M., Dumas, M.: Predictive business process mon- itoring with LSTM neural networks. In: CAiSE. LNCS, vol. 10253, pp. 477–492. Springer (2017)

work page 2017
[20]

Taymouri, F., La Rosa, M., Erfani, S.M., Bozorgi, Z.D., Verenich, I.: Predictive business process monitoring via generative adversarial nets: The case of next event prediction. In: BPM. LNCS, vol. 12168, pp. 237–256. Springer (2020)

work page 2020
[21]

ACM Trans

Teinemaa, I., Dumas, M., Rosa, M.L., Maggi, F.M.: Outcome-oriented predictive process monitoring: Review and benchmark. ACM Trans. Knowl. Discov. Data 13(2), 17:1–17:57 (2019)

work page 2019
[22]

In: BPM (PhD/Demos)

Verenich, I.: Explainable predictive monitoring of temporal measures of business processes. In: BPM (PhD/Demos). CEUR Workshop Proceedings, vol. 2420, pp. 26–30. CEUR-WS.org (2019)

work page 2019
[23]

ACM Trans

Verenich, I., Dumas, M., La Rosa, M., Maggi, F.M., Teinemaa, I.: Survey and cross- benchmark comparison of remaining time prediction methods in business process monitoring. ACM Trans. Intell. Syst. Technol.10(4), 34:1–34:34 (2019)

work page 2019
[24]

In: ICPM

Wuyts, B., vanden Broucke, S.K.L.M., Weerdt, J.D.: Sutran: an encoder-decoder transformer for full-context-aware suffix prediction of business processes. In: ICPM. pp. 17–24. IEEE (2024)

work page 2024

[1] [1]

In: RCIS (1)

Ali, M.A., Dumas, M., Milani, F.: Enhancing the accuracy of predictors of activ- ity sequences of business processes. In: RCIS (1). LNBIP, vol. 513, pp. 149–165. Springer (2024)

work page 2024

[2] [2]

Ali, M.A., Milani, F., Dumas, M.: Data-Driven Identification and Analysis of Wait- ing Times in Business Processes. Bus. Inf. Syst. Eng. (2024)

work page 2024

[3] [3]

Camargo, M., B´ aron, D., Dumas, M., Rojas, O.G.: Learning business process sim- ulation models: A hybrid process mining and deep learning approach. Inf. Syst. 117, 102248 (2023)

work page 2023

[4] [4]

Camargo, M., Dumas, M., Gonz´ alez-Rojas, O.: Learning accurate LSTM models of business processes. In: BPM. LNCS, vol. 11675, pp. 286–302. Springer (2019)

work page 2019

[5] [5]

Springer (2018)

Dumas, M., Rosa, M.L., Mendling, J., Reijers, H.A.: Fundamentals of Business Process Management, 2nd Edition. Springer (2018)

work page 2018

[6] [6]

Evermann, J., Rehse, J., Fettke, P.: Predicting process behaviour using deep learn- ing. Decis. Support Syst.100, 129–140 (2017) 18 Muhammad Awais Ali et al

work page 2017

[7] [7]

IEEE Trans

Gunnarsson, B.R., vanden Broucke, S., De Weerdt, J.: A direct data aware LSTM neural network architecture for complete remaining trace and runtime prediction. IEEE Trans. Serv. Comput.16(4), 2330–2342 (2023)

work page 2023

[8] [8]

Gunnarsson, B.R., vanden Broucke, S., Weerdt, J.D.: LS-ICE: A load state inter- case encoding framework for improved predictive monitoring of business processes. Inf. Syst.125, 102432 (2024)

work page 2024

[9] [9]

Data Min

Jr., C.N.S., Freitas, A.A.: A survey of hierarchical classification across different application domains. Data Min. Knowl. Discov.22(1-2), 31–72 (2011)

work page 2011

[10] [10]

Ketyk´ o, I., Mannhardt, F., Hassani, M., van Dongen, B.F.: What averages do not tell: predicting real life processes with sequential deep learning. In: SAC. pp. 1128–1131. ACM (2022)

work page 2022

[11] [11]

Meneghello, F., Francescomarino, C.D., Ghidini, C., Ronzani, M.: Runtime integra- tion of machine learning and simulation for business processes: Time and decision mining predictions. Inf. Syst.128, 102472 (2025)

work page 2025

[12] [12]

In: ICPM

Pasquadibisceglie, V., Appice, A., Castellano, G., Malerba, D.: Using convolutional neural networks for predictive process analytics. In: ICPM. pp. 129–136. IEEE (2019)

work page 2019

[13] [13]

In: ICPM

Pasquadibisceglie, V., Appice, A., Malerba, D.: LUPIN: A LLM approach for ac- tivity suffix prediction in business process event logs. In: ICPM. pp. 1–8. IEEE (2024)

work page 2024

[14] [14]

Rafalin, E., Souvaine, D.L.: Topological sweep of the complete graph. Discret. Appl. Math.156(17), 3276–3290 (2008)

work page 2008

[15] [15]

In: CAiSE

Rama-Maneiro, E., Patrizi, F., Vidal, J.C., Lama, M.: Towards learning the optimal sampling strategy for suffix prediction in predictive monitoring. In: CAiSE. Lecture Notes in Computer Science, vol. 14663, pp. 215–230. Springer (2024)

work page 2024

[16] [16]

IEEE Trans

Rama-Maneiro, E., Vidal, J.C., Lama, M.: Deep learning for predictive business process monitoring: Review and benchmark. IEEE Trans. Serv. Comput.16(1), 739–756 (2023)

work page 2023

[17] [17]

Com- puting106(9), 3085–3111 (2024)

Rama-Maneiro, E., Vidal, J.C., Lama, M., Monteagudo-Lago, P.: Exploiting re- current graph neural networks for suffix prediction in predictive monitoring. Com- puting106(9), 3085–3111 (2024)

work page 2024

[18] [18]

Neural Networks167, 715–729 (2023)

Succetti, F., Rosato, A., Panella, M.: An adaptive embedding procedure for time series forecasting with deep neural networks. Neural Networks167, 715–729 (2023)

work page 2023

[19] [19]

In: CAiSE

Tax, N., Verenich, I., La Rosa, M., Dumas, M.: Predictive business process mon- itoring with LSTM neural networks. In: CAiSE. LNCS, vol. 10253, pp. 477–492. Springer (2017)

work page 2017

[20] [20]

Taymouri, F., La Rosa, M., Erfani, S.M., Bozorgi, Z.D., Verenich, I.: Predictive business process monitoring via generative adversarial nets: The case of next event prediction. In: BPM. LNCS, vol. 12168, pp. 237–256. Springer (2020)

work page 2020

[21] [21]

ACM Trans

Teinemaa, I., Dumas, M., Rosa, M.L., Maggi, F.M.: Outcome-oriented predictive process monitoring: Review and benchmark. ACM Trans. Knowl. Discov. Data 13(2), 17:1–17:57 (2019)

work page 2019

[22] [22]

In: BPM (PhD/Demos)

Verenich, I.: Explainable predictive monitoring of temporal measures of business processes. In: BPM (PhD/Demos). CEUR Workshop Proceedings, vol. 2420, pp. 26–30. CEUR-WS.org (2019)

work page 2019

[23] [23]

ACM Trans

Verenich, I., Dumas, M., La Rosa, M., Maggi, F.M., Teinemaa, I.: Survey and cross- benchmark comparison of remaining time prediction methods in business process monitoring. ACM Trans. Intell. Syst. Technol.10(4), 34:1–34:34 (2019)

work page 2019

[24] [24]

In: ICPM

Wuyts, B., vanden Broucke, S.K.L.M., Weerdt, J.D.: Sutran: an encoder-decoder transformer for full-context-aware suffix prediction of business processes. In: ICPM. pp. 17–24. IEEE (2024)

work page 2024