Bridging the Sim-to-Real Gap in Reinforcement Learning-Based Industrial Dispatching through Execution Semantics

Jonathan Hoss; Noah Klarmann

arxiv: 2605.29078 · v1 · pith:4YZTHAJ5new · submitted 2026-05-27 · 💻 cs.AI · cs.LG

Bridging the Sim-to-Real Gap in Reinforcement Learning-Based Industrial Dispatching through Execution Semantics

Jonathan Hoss , Noah Klarmann This is my paper

Pith reviewed 2026-06-29 12:01 UTC · model grok-4.3

classification 💻 cs.AI cs.LG

keywords reinforcement learningindustrial dispatchingsim-to-real gapexecution semanticsscheduling policiesevent-driven systemsdeployment mismatchobservation lag

0 comments

The pith

A policy-neutral execution layer records divergences between policy intent and physical results to make industrial RL deployment mismatches observable and attributable.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper identifies problems in event-driven scheduling policies for industrial environments, where asynchronous and partially observed states lead to temporally inconsistent decisions, undefined action admissibility, and ambiguous execution error origins. It proposes a policy-neutral execution and measurement layer that constructs decision-valid snapshots from event streams, defines a standardized execution contract with explicit admissibility, and records outcomes as divergences among policy intent, transactional outcomes, physical execution, and human intervention. This structure separates decision semantics from execution behavior and renders deployment mismatches observable and structurally attributable. Discrete-event simulation evaluation demonstrates analytical benefits across all observation lag regimes by converting undifferentiated failures into structured typed outcomes with full attribution coverage, with operational benefits strongest under low lag where errors can be prevented before commitment.

Core claim

The proposed framework introduces a policy-neutral execution and measurement layer to mediate between scheduling policies and the industrial execution environment. The layer constructs decision-valid snapshots from asynchronous event streams, defines a standardized execution contract with explicit action admissibility, and records outcomes as divergences between policy intent, transactional outcomes, physical execution, and human intervention. This enables separation between decision semantics and execution behavior and makes deployment mismatch observable and structurally attributable. The framework turns execution uncertainty into supervisory data for evaluation and policy refinement.

What carries the argument

The policy-neutral execution and measurement layer, which constructs snapshots from event streams, defines an execution contract, and records typed divergences to separate decision semantics from execution behavior.

If this is right

Undifferentiated execution failures are transformed into structured, typed outcomes with full attribution coverage.
Analytical benefits are obtained across all observation lag regimes.
Operational benefits are strongest under low observation lag, where avoidable execution errors can be prevented before commitment.
Execution uncertainty is converted into supervisory data usable for policy evaluation and refinement.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same layer structure could support attribution in other asynchronous RL settings such as robotic control or network routing.
Typed divergence data might enable automated detection of recurring mismatch patterns for targeted policy updates.
Real-world use would require confirming that the added layer does not itself increase latency or create new attribution blind spots.

Load-bearing premise

A standardized policy-neutral execution contract can be defined and divergences between policy intent, transactional outcomes, physical execution, and human intervention can be recorded accurately without introducing new errors or latency.

What would settle it

A controlled industrial deployment in which recorded divergences are cross-checked against independent ground-truth logs of execution causes to verify whether attribution matches actual failure sources and covers all cases.

Figures

Figures reproduced from arXiv: 2605.29078 by Jonathan Hoss, Noah Klarmann.

read the original abstract

Event-driven scheduling policies are increasingly deployed in industrial environments, where decisions are made under asynchronous and partially observed system states. As a result, decision states are not temporally consistent, action admissibility is not explicitly defined, and the origin of execution errors remains ambiguous. These issues limit both reliability and interpretability. To address this gap, a policy-neutral execution and measurement layer is proposed to mediate between scheduling policies and the industrial execution environment. The layer constructs decision-valid snapshots from asynchronous event streams, defines a standardized execution contract with explicit action admissibility, and records outcomes as divergences between policy intent, transactional outcomes, physical execution, and human intervention. This enables a separation between decision semantics and execution behavior and makes deployment mismatch observable and structurally attributable. The proposed framework is evaluated using a discrete-event simulation. The results show analytical benefits across all observation lag regimes, as undifferentiated execution failures are transformed into structured, typed outcomes with full attribution coverage. Operational benefits are strongest under low observation lag, where avoidable execution errors can be prevented before commitment. Overall, the layer turns execution uncertainty into supervisory data for evaluation and policy refinement.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper defines a policy-neutral layer to record four kinds of execution divergences in event-driven industrial scheduling, but evaluates the whole thing only inside a discrete-event simulator.

read the letter

The core contribution is a three-part execution layer: it builds decision-valid snapshots from async event streams, imposes a standardized contract that declares which actions are admissible, and logs divergences across policy intent, transactional results, physical outcomes, and human overrides. That framing turns undifferentiated failures into typed, attributable events, which is a clean way to make deployment mismatch visible for later policy work.

The simulation results show the expected pattern—stronger gains when observation lag is low, because errors can be caught before commitment—and the attribution coverage looks complete within the model. The authors are right that this structure gives supervisory data that plain RL setups usually lack.

The soft spot is exactly what the stress-test flags: the title promises a sim-to-real bridge, yet every experiment stays inside the same discrete-event simulator that generates both the policy decisions and the execution semantics. No hardware, no sensor noise outside the model, no external human interventions. So the claim that the contract can be realized without adding latency or recording errors in actual plants remains untested. The weakest assumption in the abstract—that a policy-neutral contract can be defined and maintained in real environments—is therefore still an assumption.

This is for people already working on RL for manufacturing dispatch who need better diagnostics on execution mismatch. It is coherent on its own terms and shows clear thinking about the problem, even if the evidence is limited to simulation. I would send it to peer review so the authors can either add a real-plant pilot or tighten the claims to what the sim actually demonstrates.

Referee Report

2 major / 0 minor

Summary. The paper proposes a policy-neutral execution and measurement layer to mediate between event-driven RL scheduling policies and industrial execution environments. The layer builds decision-valid snapshots from asynchronous event streams, defines a standardized execution contract with explicit action admissibility, and records outcomes as divergences among policy intent, transactional results, physical execution, and human intervention. This separation is claimed to make deployment mismatches observable and structurally attributable. The framework is evaluated in a discrete-event simulation, where it converts undifferentiated execution failures into typed outcomes with full attribution coverage, yielding analytical benefits across observation lag regimes and operational benefits under low lag.

Significance. If the execution contract can be implemented in real industrial settings without introducing latency or recording errors, the approach would supply structured supervisory data for policy refinement and improve interpretability of RL dispatching under partial observability. The conceptual distinction between decision semantics and execution behavior targets a known sim-to-real challenge in asynchronous industrial systems. The manuscript provides no machine-checked proofs or reproducible code, and the evaluation remains confined to simulation.

major comments (2)

[Abstract] Abstract (evaluation paragraph): The central claim that the layer bridges the sim-to-real gap by making deployment mismatches observable and attributable in real environments is load-bearing, yet the evaluation occurs only inside a discrete-event simulator in which both policy decisions and execution semantics are generated by the same model. No physical hardware, sensor noise, or human interventions outside the modeled contract are present, so the results demonstrate only intra-sim attribution improvements and leave the assumption that a policy-neutral contract can be realized without new errors or latency untested.
[Abstract] Abstract (layer description): The claim that divergences between policy intent, transactional outcomes, physical execution, and human intervention can be recorded accurately rests on the existence of a standardized, policy-neutral execution contract; the simulation does not introduce external physical divergences, so it cannot validate that the layer records such divergences without introducing new measurement artifacts in actual deployments.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments highlighting the scope of our evaluation. We respond point by point to the major comments below.

read point-by-point responses

Referee: [Abstract] Abstract (evaluation paragraph): The central claim that the layer bridges the sim-to-real gap by making deployment mismatches observable and attributable in real environments is load-bearing, yet the evaluation occurs only inside a discrete-event simulator in which both policy decisions and execution semantics are generated by the same model. No physical hardware, sensor noise, or human interventions outside the modeled contract are present, so the results demonstrate only intra-sim attribution improvements and leave the assumption that a policy-neutral contract can be realized without new errors or latency untested.

Authors: We agree that the evaluation is confined to discrete-event simulation and does not incorporate physical hardware, sensor noise, or unmodeled interventions. The manuscript's contribution centers on defining a policy-neutral execution layer whose attribution mechanisms can be validated internally before deployment; the simulation confirms that undifferentiated failures become typed, attributable outcomes. We acknowledge that demonstrating the contract can be realized in physical systems without introducing latency or recording errors requires separate empirical study, which lies outside the present scope. We will revise the abstract to qualify the bridging claim as a design objective supported by simulation evidence rather than a fully validated real-world outcome. revision: yes
Referee: [Abstract] Abstract (layer description): The claim that divergences between policy intent, transactional outcomes, physical execution, and human intervention can be recorded accurately rests on the existence of a standardized, policy-neutral execution contract; the simulation does not introduce external physical divergences, so it cannot validate that the layer records such divergences without introducing new measurement artifacts in actual deployments.

Authors: The simulation validates the layer's ability to apply the contract consistently and produce complete attribution within the modeled environment. Because the contract is defined to be policy-neutral and interface-based, it is intended to be realized by mapping to industrial control and logging systems; the simulation therefore tests the attribution logic that would apply to external divergences when they arise. We concur that the simulation cannot rule out new measurement artifacts in physical deployments. We will revise the abstract to separate the demonstrated internal consistency from the untested aspects of real-world measurement fidelity. revision: yes

Circularity Check

0 steps flagged

No circularity: conceptual framework with no equations, fitted parameters, or self-referential derivations

full rationale

The manuscript proposes a policy-neutral execution layer as a conceptual architecture for separating decision semantics from execution behavior in industrial dispatching. The central claim is that this layer makes deployment mismatches observable and attributable. Evaluation occurs entirely inside a discrete-event simulator, but the paper presents no mathematical derivations, predictions derived from fitted parameters, or first-principles results that reduce to their own inputs. No self-citations are invoked to justify uniqueness theorems or ansatzes. The load-bearing assumption (that a standardized execution contract can be realized without new errors in real environments) is stated but not derived from prior results within the paper; it remains an untested modeling choice rather than a circular reduction. This is a standard non-finding for a framework paper lacking quantitative self-referential structure.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 1 invented entities

The abstract contains no explicit free parameters, mathematical axioms, or invented physical entities; the core addition is a conceptual software layer whose properties are stated without quantitative grounding.

invented entities (1)

policy-neutral execution and measurement layer no independent evidence
purpose: Mediates between RL scheduling policies and the industrial execution environment by constructing decision-valid snapshots, defining action admissibility, and recording typed divergences
Introduced as the central proposal in the abstract; no independent evidence or external validation is mentioned.

pith-pipeline@v0.9.1-grok · 5726 in / 1328 out tokens · 55316 ms · 2026-06-29T12:01:15.579560+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

18 extracted references · 16 canonical work pages · 1 internal anchor

[1]

A survey of dynamic scheduling in manufacturing systems,

D. Ouelhadj and S. Petrovic, “A survey of dynamic scheduling in manufacturing systems,”Journal of Scheduling, vol. 12, no. 4, pp. 417– 431, Aug. 2009, doi: 10.1007/s10951-008-0090-8

work page doi:10.1007/s10951-008-0090-8 2009
[2]

A literature review of reinforcement learn- ing methods applied to job-shop scheduling problems,

X. Zhang and G.-Y . Zhu, “A literature review of reinforcement learn- ing methods applied to job-shop scheduling problems,”Computers & Operations Research, vol. 175, Art. no. 106929, Mar. 2025, doi: 10.1016/j.cor.2024.106929

work page doi:10.1016/j.cor.2024.106929 2025
[3]

Graph neural networks for job shop scheduling problems: A survey,

I. G. Smitet al., “Graph neural networks for job shop scheduling problems: A survey,”Computers & Operations Research, vol. 176, Art. no. 106914, Apr. 2025, doi: 10.1016/j.cor.2024.106914

work page doi:10.1016/j.cor.2024.106914 2025
[4]

Offline reinforcement learning for learning to dispatch for job shop scheduling,

J. van Remmerden, Z. Bukhsh, and Y . Zhang, “Offline reinforcement learning for learning to dispatch for job shop scheduling,”Machine Learning, vol. 114, no. 8, Mar. 2025, doi: 10.1007/s10994-025-06826-w

work page doi:10.1007/s10994-025-06826-w 2025
[5]

Scalable Production Scheduling: Linear Complexity via Unified Homogeneous Graphs

J. Hoss, M. Link, and N. Klarmann, “Scalable production scheduling: Linear complexity via unified homogeneous graphs,”arXiv preprint arXiv:2604.23841, Apr. 2026, doi: 10.48550/arXiv.2604.23841

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2604.23841 2026
[6]

Reinforcement learning based dispatching solu- tions in semiconductor manufacturing: A literature review on validation and deployment,

P. St ¨ockermannet al., “Reinforcement learning based dispatching solu- tions in semiconductor manufacturing: A literature review on validation and deployment,”Production & Manufacturing Research, vol. 13, no. 1, Art. no. 2582472, 2025, doi: 10.1080/21693277.2025.2582472

work page doi:10.1080/21693277.2025.2582472 2025
[7]

Challenges of real- world reinforcement learning,

G. Dulac-Arnold, D. Mankowitz, and T. Hester, “Challenges of real- world reinforcement learning,”Machine Learning, vol. 110, pp. 2419– 2468, Sep. 2021, doi: 10.1007/s10994-021-05961-4

work page doi:10.1007/s10994-021-05961-4 2021
[8]

Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning,

S. Luo, “Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning,”Applied Soft Computing, vol. 91, Art. no. 106208, Jun. 2020, doi: 10.1016/j.asoc.2020.106208

work page doi:10.1016/j.asoc.2020.106208 2020
[9]

Designing an adaptive and deep learning based control framework for modular production systems,

M. Panzer and N. Gronau, “Designing an adaptive and deep learning based control framework for modular production systems,”Journal of Intelligent Manufacturing, vol. 35, no. 8, pp. 4113–4136, Dec. 2024, doi: 10.1007/s10845-023-02249-3

work page doi:10.1007/s10845-023-02249-3 2024
[10]

Action robust reinforcement learn- ing and applications in continuous control,

C. Tessler, Y . Efroni, and S. Mannor, “Action robust reinforcement learn- ing and applications in continuous control,” inInternational Conference on Machine Learning, 2019, pp. 6215–6224

2019
[11]

A production scheduling frame- work for reinforcement learning under real-world constraints,

J. Hoss, F. Schelling, and N. Klarmann, “A production scheduling frame- work for reinforcement learning under real-world constraints,” inProc. 2025 IEEE 21st Int. Conf. Automation Science and Engineering (CASE), 2025, pp. 1736–1743, doi: 10.1109/CASE58245.2025.11163982

work page doi:10.1109/case58245.2025.11163982 2025
[12]

https://doi.org/10.1109/SSCI47803.2020.9308468, https://doi.org/10.1109/SSCI47803.2020.9308468

W. Zhao, J. P. Queralta, and T. Westerlund, “Sim-to-real transfer in deep reinforcement learning for robotics: A survey,” inProc. 2020 IEEE Symposium Series on Computational Intelligence (SSCI), Dec. 2020, pp. 737–744, doi: 10.1109/SSCI47803.2020.9308468

work page doi:10.1109/ssci47803.2020.9308468 2020
[13]

DeepREM: Deep-learning- based radio environment map estimation from sparse measurements,

H. Xu, W. Yu, D. Griffith, and N. Golmie, “A survey on In- dustrial Internet of Things: A cyber-physical systems perspective,” IEEE Access, vol. 6, pp. 78238–78259, Dec. 2018, doi: 10.1109/AC- CESS.2018.2884906

work page doi:10.1109/ac- 2018
[14]

Automation pyramid as constructor for a complete digital twin, case study: A didactic manufacturing system,

E. M. Martinez, P. Ponce, I. Macias, and A. Molina, “Automation pyramid as constructor for a complete digital twin, case study: A didactic manufacturing system,”Sensors, vol. 21, no. 14, Art. no. 4656, Jul. 2021, doi: 10.3390/s21144656

work page doi:10.3390/s21144656 2021
[15]

Digital twins in Industry 5.0,

Z. Lv, “Digital twins in Industry 5.0,”Research, vol. 6, Art. no. 0071, Mar. 2023, doi: 10.34133/research.0071

work page doi:10.34133/research.0071 2023
[16]

Edge computing in Industrial Internet of Things: Architecture, advances and challenges,

T. Qiu, N. Chen, K. Li, D. Qiao, Z. Fu, and W. Si, “Edge computing in Industrial Internet of Things: Architecture, advances and challenges,” IEEE Communications Surveys & Tutorials, vol. 22, no. 4, pp. 2462– 2488, Jul. 2020, doi: 10.1109/COMST.2020.3009103

work page doi:10.1109/comst.2020.3009103 2020
[17]

Flexible job shop scheduling problem under Industry 5.0: A survey on human reintegration, environmental consideration and resilience improvement,

C. Destouet, H. Tlahig, B. Bettayeb, and B. Mazari, “Flexible job shop scheduling problem under Industry 5.0: A survey on human reintegration, environmental consideration and resilience improvement,” Journal of Manufacturing Systems, vol. 67, pp. 155–173, Apr. 2023, doi: 10.1016/j.jmsy.2023.01.004

work page doi:10.1016/j.jmsy.2023.01.004 2023
[18]

Scherfke and O

S. Scherfke and O. V olkmer,SimPy: Discrete Event Simulation for Python, version 4.1.1, [Online]. Available: https://simpy.readthedocs.io/

[1] [1]

A survey of dynamic scheduling in manufacturing systems,

D. Ouelhadj and S. Petrovic, “A survey of dynamic scheduling in manufacturing systems,”Journal of Scheduling, vol. 12, no. 4, pp. 417– 431, Aug. 2009, doi: 10.1007/s10951-008-0090-8

work page doi:10.1007/s10951-008-0090-8 2009

[2] [2]

A literature review of reinforcement learn- ing methods applied to job-shop scheduling problems,

X. Zhang and G.-Y . Zhu, “A literature review of reinforcement learn- ing methods applied to job-shop scheduling problems,”Computers & Operations Research, vol. 175, Art. no. 106929, Mar. 2025, doi: 10.1016/j.cor.2024.106929

work page doi:10.1016/j.cor.2024.106929 2025

[3] [3]

Graph neural networks for job shop scheduling problems: A survey,

I. G. Smitet al., “Graph neural networks for job shop scheduling problems: A survey,”Computers & Operations Research, vol. 176, Art. no. 106914, Apr. 2025, doi: 10.1016/j.cor.2024.106914

work page doi:10.1016/j.cor.2024.106914 2025

[4] [4]

Offline reinforcement learning for learning to dispatch for job shop scheduling,

J. van Remmerden, Z. Bukhsh, and Y . Zhang, “Offline reinforcement learning for learning to dispatch for job shop scheduling,”Machine Learning, vol. 114, no. 8, Mar. 2025, doi: 10.1007/s10994-025-06826-w

work page doi:10.1007/s10994-025-06826-w 2025

[5] [5]

Scalable Production Scheduling: Linear Complexity via Unified Homogeneous Graphs

J. Hoss, M. Link, and N. Klarmann, “Scalable production scheduling: Linear complexity via unified homogeneous graphs,”arXiv preprint arXiv:2604.23841, Apr. 2026, doi: 10.48550/arXiv.2604.23841

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2604.23841 2026

[6] [6]

Reinforcement learning based dispatching solu- tions in semiconductor manufacturing: A literature review on validation and deployment,

P. St ¨ockermannet al., “Reinforcement learning based dispatching solu- tions in semiconductor manufacturing: A literature review on validation and deployment,”Production & Manufacturing Research, vol. 13, no. 1, Art. no. 2582472, 2025, doi: 10.1080/21693277.2025.2582472

work page doi:10.1080/21693277.2025.2582472 2025

[7] [7]

Challenges of real- world reinforcement learning,

G. Dulac-Arnold, D. Mankowitz, and T. Hester, “Challenges of real- world reinforcement learning,”Machine Learning, vol. 110, pp. 2419– 2468, Sep. 2021, doi: 10.1007/s10994-021-05961-4

work page doi:10.1007/s10994-021-05961-4 2021

[8] [8]

Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning,

S. Luo, “Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning,”Applied Soft Computing, vol. 91, Art. no. 106208, Jun. 2020, doi: 10.1016/j.asoc.2020.106208

work page doi:10.1016/j.asoc.2020.106208 2020

[9] [9]

Designing an adaptive and deep learning based control framework for modular production systems,

M. Panzer and N. Gronau, “Designing an adaptive and deep learning based control framework for modular production systems,”Journal of Intelligent Manufacturing, vol. 35, no. 8, pp. 4113–4136, Dec. 2024, doi: 10.1007/s10845-023-02249-3

work page doi:10.1007/s10845-023-02249-3 2024

[10] [10]

Action robust reinforcement learn- ing and applications in continuous control,

C. Tessler, Y . Efroni, and S. Mannor, “Action robust reinforcement learn- ing and applications in continuous control,” inInternational Conference on Machine Learning, 2019, pp. 6215–6224

2019

[11] [11]

A production scheduling frame- work for reinforcement learning under real-world constraints,

J. Hoss, F. Schelling, and N. Klarmann, “A production scheduling frame- work for reinforcement learning under real-world constraints,” inProc. 2025 IEEE 21st Int. Conf. Automation Science and Engineering (CASE), 2025, pp. 1736–1743, doi: 10.1109/CASE58245.2025.11163982

work page doi:10.1109/case58245.2025.11163982 2025

[12] [12]

https://doi.org/10.1109/SSCI47803.2020.9308468, https://doi.org/10.1109/SSCI47803.2020.9308468

W. Zhao, J. P. Queralta, and T. Westerlund, “Sim-to-real transfer in deep reinforcement learning for robotics: A survey,” inProc. 2020 IEEE Symposium Series on Computational Intelligence (SSCI), Dec. 2020, pp. 737–744, doi: 10.1109/SSCI47803.2020.9308468

work page doi:10.1109/ssci47803.2020.9308468 2020

[13] [13]

DeepREM: Deep-learning- based radio environment map estimation from sparse measurements,

H. Xu, W. Yu, D. Griffith, and N. Golmie, “A survey on In- dustrial Internet of Things: A cyber-physical systems perspective,” IEEE Access, vol. 6, pp. 78238–78259, Dec. 2018, doi: 10.1109/AC- CESS.2018.2884906

work page doi:10.1109/ac- 2018

[14] [14]

Automation pyramid as constructor for a complete digital twin, case study: A didactic manufacturing system,

E. M. Martinez, P. Ponce, I. Macias, and A. Molina, “Automation pyramid as constructor for a complete digital twin, case study: A didactic manufacturing system,”Sensors, vol. 21, no. 14, Art. no. 4656, Jul. 2021, doi: 10.3390/s21144656

work page doi:10.3390/s21144656 2021

[15] [15]

Digital twins in Industry 5.0,

Z. Lv, “Digital twins in Industry 5.0,”Research, vol. 6, Art. no. 0071, Mar. 2023, doi: 10.34133/research.0071

work page doi:10.34133/research.0071 2023

[16] [16]

Edge computing in Industrial Internet of Things: Architecture, advances and challenges,

T. Qiu, N. Chen, K. Li, D. Qiao, Z. Fu, and W. Si, “Edge computing in Industrial Internet of Things: Architecture, advances and challenges,” IEEE Communications Surveys & Tutorials, vol. 22, no. 4, pp. 2462– 2488, Jul. 2020, doi: 10.1109/COMST.2020.3009103

work page doi:10.1109/comst.2020.3009103 2020

[17] [17]

Flexible job shop scheduling problem under Industry 5.0: A survey on human reintegration, environmental consideration and resilience improvement,

C. Destouet, H. Tlahig, B. Bettayeb, and B. Mazari, “Flexible job shop scheduling problem under Industry 5.0: A survey on human reintegration, environmental consideration and resilience improvement,” Journal of Manufacturing Systems, vol. 67, pp. 155–173, Apr. 2023, doi: 10.1016/j.jmsy.2023.01.004

work page doi:10.1016/j.jmsy.2023.01.004 2023

[18] [18]

Scherfke and O

S. Scherfke and O. V olkmer,SimPy: Discrete Event Simulation for Python, version 4.1.1, [Online]. Available: https://simpy.readthedocs.io/