pith. machine review for the scientific record. sign in

arxiv: 2604.15279 · v1 · submitted 2026-04-16 · 💻 cs.DC

Recognition: unknown

Wave-Based Dispatch for Circuit Cutting in Hybrid HPC--Quantum Systems

Authors on Pith no claims yet

Pith reviewed 2026-05-10 09:46 UTC · model grok-4.3

classification 💻 cs.DC
keywords circuit cuttinghybrid HPC-quantumquantum fragmentswave-based dispatchNISQ workloadsdynamic queue routermakespanfault recovery
0
0 comments X

The pith

Treating quantum circuit fragments as first-class schedulable units lets HPC schedulers manage NISQ workloads without parsing quantum code.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tries to establish that hybrid HPC-quantum systems can apply mature resource management to circuit-cutting workloads once fragments are exposed through a clean interface. This would matter because current frameworks bind cutting logic directly to execution, blocking use of proven HPC policies on NISQ hardware. DQR achieves the separation with a backend-agnostic fragment descriptor and a wave-based coordinator that uses non-blocking polling to sustain pipeline concurrency. Experiments on a 32-qubit HEA circuit show shorter makespans than a monolithic CPU baseline plus automatic per-fragment recovery when a local QPU fails, all while keeping coordination time at 5 percent for deeper circuits.

Core claim

DQR introduces a backend-agnostic fragment descriptor to expose structural properties without requiring execution layers to parse quantum code, a wave-based coordinator that achieves pipeline concurrency via non-blocking polling, and a production-ready implementation on the CESGA Qmio supercomputer integrating both QPUs local on-premises and remote cloud backends. Experiments on a 32-qubit Hardware-Efficient Ansatz circuit demonstrate makespan improvements over a monolithic CPU baseline and transparent per-fragment failover recovery without pipeline restart. For deeper circuits, the coordination residual accounts for only 5 percent of the total execution time.

What carries the argument

The wave-based coordinator that achieves pipeline concurrency via non-blocking polling, supported by a backend-agnostic fragment descriptor that exposes structural properties without quantum code parsing.

If this is right

  • HPC centers can integrate NISQ workloads into existing production infrastructure while preserving flexibility for new cutting algorithms.
  • Per-fragment failover can reroute tasks from a local QPU to classical simulators without restarting the entire pipeline.
  • Coordination overhead stays low enough to remain negligible as circuit depth increases.
  • Local on-premises and remote cloud QPUs can be mixed with classical resources under the same scheduler.
  • Makespan improves relative to running the full circuit on a monolithic CPU baseline.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same descriptor-plus-wave pattern could apply to other hybrid classical-quantum workflows that decompose large tasks into independent pieces.
  • Low coordination cost suggests the approach would scale to circuits with hundreds of fragments once better cutting methods appear.
  • Standard HPC features such as priority queues or checkpointing could now be applied directly to quantum fragments.
  • Swapping in improved cutting algorithms would require no changes to the dispatch or scheduling layers.

Load-bearing premise

A backend-agnostic fragment descriptor combined with non-blocking wave-based polling can expose enough structural information for mature HPC schedulers to manage quantum fragments without parsing quantum code or incurring high coordination costs.

What would settle it

An experiment on deeper or wider circuits where coordination overhead rises above 5 percent of runtime or where HPC schedulers still require direct quantum code access to allocate fragments effectively.

Figures

Figures reproduced from arXiv: 2604.15279 by Josep Jorba, Ricard S. Garc\'ia-Raigada, Sergio Iserte.

Figure 1
Figure 1. Figure 1: Circuit cutting strategies. Both map the circuit to a DAG and split it into independent subcircuits executed in parallel. Wire cutting severs qubit paths (8 𝑘 subcircuits), while gate cutting decomposes two-qubit gates into quasi￾probabilistic locals (6 𝑘 variants). Results are recombined via tensor reconstruction. up to 8 𝑘 subcircuits for 𝑘 cuts; gate cuts decompose multi￾qubit gates (e.g., CNOT/CZ) into… view at source ↗
Figure 2
Figure 2. Figure 2: Runtime-oriented execution model for circuit-cut workloads. The circuit cutting layer is decoupled from the orchestration layer via fragment descriptors, enabling hardware-aware, policy-driven scheduling across heterogeneous HPC-QC backends. The three contributions of this work (right) are positioned within the layer they address; see Section 1. In this regard, a fragment 𝑓𝑖 is represented as an im￾mutable… view at source ↗
Figure 3
Figure 3. Figure 3: DQR Fragment lifecycle state machine. A fragment is created in the PENDING state and transitions to DISPATCHED upon assignment to a backend. A transient failure returns the fragment to PENDING for retry; once the retry counter exceeds the configured maximum, the fragment reaches the terminal PERMANENT_FAILED state. Successful execution transitions the fragment to the terminal SUCCESS state, at which point … view at source ↗
Figure 4
Figure 4. Figure 4: System architecture of the DQR framework. The three loosely coupled layers communicate via GPFS (file￾based fragment descriptors and results) and MPI (coordinator– worker messages). The Circuit Cutting Layer decomposes the input OpenQASM circuit and emits fragment descriptors. The Runtime Orchestration Layer comprises the RMS, which manages HPC allocation and reconfiguration policies, and the DQR, which pe… view at source ↗
Figure 5
Figure 5. Figure 5: Fragment dispatch comparison between policies B and C [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Makespan (s) for the CPU baseline and DQR Policies A–D on the 32-qubit HEA circuit. Hatching marks the DQR-time portion of each bar, and the dashed line marks the CPU baseline reference (56.2 s). rank contention. With only 3 QC slots, fragments queue sequentially for QPU access; with 72 slots, all QC fragments can be concurrently in flight. Likewise, with 12 MPI rank workers managing 72 fragments, ranks mu… view at source ↗
Figure 8
Figure 8. Figure 8: Makespan sensitivity to 𝑡̄QPU for the 𝐿=2 run. Pipeline (𝑡 setupℎ ) and 𝑇HPC remain constant; substituting 𝑡̄QPU = 18.5 s (IBM Cloud) with 𝑡̄QPU = 4.5 or 6.6 s (Qmio) reduces makespan by 104.6 s (33 %) [PITH_FULL_IMAGE:figures/full_fig_p015_8.png] view at source ↗
Figure 7
Figure 7. Figure 7: DQR-time decomposition for the 𝐿=2 run: 𝑇HPC = 76.7 s (40 %) runs in parallel with the 130 QC fragments; the remaining 55 % corresponds to 𝑇QC − 𝑇HPC = 104.6 s, with 𝐶fixed = 9.1 s (5 %) for MPI coordination and tensor reconstruction. baseline, with a DQR-time of 190.4 s and 𝑡 setupℎ = 126.3 s dominated by QCut server startup and subcircuit generation. The 3.3× slowdown is primarily due to high 𝑡̄QPU on IB… view at source ↗
read the original abstract

Hybrid High-performance Computing (HPC)-quantum workloads based on circuit cutting decompose large quantum circuits into independent fragments, but existing frameworks tightly couple cutting logic to execution orchestration, preventing HPC centers from applying mature resource management policies to Noisy Intermediate-Scale Quantum (NISQ) workloads. We present DQR (Dynamic Queue Router), a runtime framework that bridges this gap by treating circuit fragments as first-class schedulable units. The framework introduces a backend-agnostic fragment descriptor to expose structural properties without requiring execution layers to parse quantum code, a wave-based coordinator that achieves pipeline concurrency via non-blocking polling, and a production-ready implementation on the CESGA Qmio supercomputer integrating both QPUs local on-premises (Qmio) and remote cloud (IBM Torino) backends. Experiments on a 32-qubit Hardware-Efficient Ansatz (HEA) circuit demonstrate not only makespan improvements over a monolithic CPU baseline but also transparent per-fragment failover recovery-specifically rerouting tasks from the local QPU to classical simulators upon encountering hardware-level incompatibilities-without pipeline restart. For deeper circuits, the coordination residual accounts for only 5% of the total execution time, highlighting the framework's scalability. These results show that DQR enables HPC centers to integrate NISQ workloads into existing production infrastructure while preserving the flexibility to adopt improved cutting algorithms or heterogeneous backend technologies.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript presents DQR (Dynamic Queue Router), a runtime framework for hybrid HPC-quantum systems that decomposes circuits via cutting and treats fragments as schedulable units. It introduces a backend-agnostic fragment descriptor exposing structural properties without quantum code parsing, a wave-based coordinator achieving concurrency through non-blocking polling, and a production implementation on CESGA Qmio integrating local Qmio and remote IBM Torino QPUs. Experiments on a 32-qubit Hardware-Efficient Ansatz (HEA) circuit report makespan improvements over a monolithic CPU baseline, transparent per-fragment failover (rerouting from local QPU to simulators without restart), and coordination residual of only 5% of execution time for deeper circuits.

Significance. If the results hold, the work offers a practical mechanism for incorporating NISQ workloads into production HPC environments by exposing fragments to mature schedulers while supporting heterogeneous backends and failover. The real-hardware demonstration of low-overhead coordination and transparent recovery on Qmio is a concrete strength that could facilitate adoption of circuit-cutting techniques in supercomputing centers.

major comments (2)
  1. Experiments section: the central performance claims (makespan improvements and 5% coordination overhead) are reported without details on number of trials, error bars, variance, statistical significance testing, or exact baseline construction. This limits independent verification of the reported gains over the monolithic CPU baseline.
  2. Framework and implementation sections (e.g., description of fragment descriptor and DQR runtime): the claim that the backend-agnostic descriptor plus wave-based polling allows off-the-shelf HPC schedulers to manage fragments without parsing quantum code is not demonstrated. Experiments use the custom DQR runtime on Qmio for placement and failover; no evidence is provided that a standard scheduler such as Slurm can consume only the descriptor fields for decisions on placement, priority, or preemption.
minor comments (2)
  1. Abstract: the phrase 'transparent per-fragment failover recovery' would benefit from a brief parenthetical example of the specific hardware-level incompatibilities that trigger rerouting.
  2. Consider adding a short table in the implementation section contrasting DQR's descriptor fields and polling mechanism against prior circuit-cutting runtimes to clarify the novelty of the 'without parsing quantum code' property.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback, which highlights important areas for improving the rigor and clarity of our work. We address each major comment below and indicate the planned revisions.

read point-by-point responses
  1. Referee: Experiments section: the central performance claims (makespan improvements and 5% coordination overhead) are reported without details on number of trials, error bars, variance, statistical significance testing, or exact baseline construction. This limits independent verification of the reported gains over the monolithic CPU baseline.

    Authors: We agree that additional statistical details are needed for independent verification. In the revised manuscript, we will expand the Experiments section to specify that all reported results are averaged over 20 independent trials, include error bars representing one standard deviation, provide an exact description of the monolithic CPU baseline (full 32-qubit circuit execution on a single CPU core using Qiskit Aer without circuit cutting or parallelism), and add paired t-test results confirming statistical significance of the makespan improvements (p < 0.01). These additions draw from re-analysis of the existing experimental logs and will not change the reported trends. revision: yes

  2. Referee: Framework and implementation sections (e.g., description of fragment descriptor and DQR runtime): the claim that the backend-agnostic descriptor plus wave-based polling allows off-the-shelf HPC schedulers to manage fragments without parsing quantum code is not demonstrated. Experiments use the custom DQR runtime on Qmio for placement and failover; no evidence is provided that a standard scheduler such as Slurm can consume only the descriptor fields for decisions on placement, priority, or preemption.

    Authors: We acknowledge that the experiments focus on the DQR runtime for hybrid QPU-simulator integration rather than a direct demonstration with Slurm. The fragment descriptor is intentionally limited to backend-agnostic structural fields (qubit count, estimated depth, resource needs, and dependency metadata) that require no quantum code parsing, enabling consumption by standard schedulers. The wave-based coordinator further isolates scheduling decisions from execution details. To address the comment, we will add a clarifying subsection with a concrete example of mapping descriptor fields to Slurm job parameters for placement and preemption. This substantiates the design claim while noting that full Slurm prototype integration is left for future work. revision: partial

Circularity Check

0 steps flagged

No significant circularity: claims rest on implementation and experiments

full rationale

The paper introduces the DQR runtime with a fragment descriptor and wave-based polling for hybrid HPC-quantum workloads. All load-bearing claims are grounded in concrete implementation choices and measured outcomes (makespan improvements, per-fragment failover, 5% coordination overhead) on the CESGA Qmio system. No equations, fitted parameters, self-referential definitions, or self-citation chains appear that would reduce any result to its own inputs by construction. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work relies on standard domain assumptions from circuit cutting and HPC scheduling without introducing fitted parameters or new postulated entities.

axioms (1)
  • domain assumption Quantum circuits can be decomposed into independent executable fragments via circuit cutting techniques
    This is a foundational premise of all circuit-cutting approaches referenced in the abstract.

pith-pipeline@v0.9.0 · 5551 in / 1319 out tokens · 59968 ms · 2026-05-10T09:46:21.763343+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

31 extracted references · 25 canonical work pages · 1 internal anchor

  1. [1]

    J.Preskill,QuantumcomputingintheNISQeraandbeyond,Quantum 2 (2018) 79.doi:10.22331/q-2018-08-06-79

  2. [2]

    S. Wang, E. Fontana, M. Cerezo, K. Sharma, A. Sone, L. Cincio, P. J. Coles, Noise-induced barren plateaus in variational quantum algorithms,NatureCommunications12(1)(2021)6961.doi:10.1038/ s41467-021-27045-6

  3. [4]

    Berezutskii, M

    A. Berezutskii, M. Liu, A. Acharya, R. Ellerbrock, J. Gray, R. Haghshenas, Z. He, A. Khan, V. Kuzmin, D. Lyakh, D. Lykov, S. Mandrà, C. Mansell, A. Melnikov, A. Melnikov, V. Mironov, D. Morozov, F. Neukart, A. Nocera, M. A. Perlin, M. Perelshtein, M. Steinberg, R. Shaydulin, B. Villalonga, M. Pflitsch, M. Pistoia, V. Vinokur, Y. Alexeev, Tensor networks f...

  4. [5]

    K. R. Fratus, K. Bark, N. Vogt, J. Leppäkangas, S. Zanker, M. Marthaler, J.-M. Reiner, Describing Trotterized Time Evolutions on Noisy Quantum Computers via Static Effective Lindbladians, Quantum9(2025)1854,arXiv:2210.11371[quant-ph].doi:10.22331/ q-2025-09-11-1854

  5. [6]

    C.Piveteau,D.Sutter,Circuitknittingwithclassicalcommunication, IEEE Transactions on Information Theory 70 (4) (2024) 2734–2745, arXiv:2205.00016 [quant-ph].doi:10.1109/TIT.2023.3310797

  6. [7]

    W. Tang, T. Tomesh, M. Suchara, J. Larson, M. Martonosi, CutQC: using small Quantum computers for large Quantum circuit evalu- ations, in: Proceedings of the 26th ACM International Conference onArchitecturalSupportforProgrammingLanguagesandOperating Systems, ACM, Virtual USA, 2021, pp. 473–486.doi:10.1145/ 3445814.3446758

  7. [8]

    Constructing a virtual two-qubit gate by sampling single-qubit operations,

    K.Mitarai,K.Fujii,Constructingavirtualtwo-qubitgatebysampling single-qubitoperations,NewJournalofPhysics23(2)(2021)023021. doi:10.1088/1367-2630/abd7bc

  8. [9]

    T.Peng,A.W.Harrow,M.Ozols,X.Wu,SimulatingLargeQuantum Circuits on a Small Quantum Computer, Physical Review Letters 125 (15) (2020) 150504.doi:10.1103/PhysRevLett.125.150504

  9. [10]

    URLhttps://hdl.handle.net/2445/222573 R

    M.TejedorNinou,Towardsscalablequantumsimulation:Distributed circuit cutting for hybrid quantum-hpc systems (2025). URLhttps://hdl.handle.net/2445/222573 R. S. Raigada-García et al.:Preprint submitted to ElsevierPage 17 of 18 Wave-Based Dispatch for Circuit Cutting in Hybrid HPC–Quantum Systems

  10. [11]

    Elsharkawy, X.-T

    A. Elsharkawy, X.-T. M. To, P. Seitz, Y. Chen, Y. Stade, M. Geiger, Q. Huang, X. Guo, M. A. Ansari, C. B. Mendl, D. Kranzlmüller, M. Schulz, Integration of quantum accelerators with high perfor- mance computing – a review of quantum programming tools, arXiv preprint arXiv:2309.06167 (2023).doi:10.48550/arXiv.2309.06167

  11. [12]

    T. Beck, A. Baroni, R. Bennink, G. Buchs, E. A. Coello Perez, M. Eisenbach, R. Ferreira da Silva, M. Gopalakrishnan Meena, K. Gottiparthi, P. Groszkowski, T. S. Humble, R. Landfield, K. Ma- heshwari,S.Oral,M.A.Sandoval,A.Shehata,I.-S.Suh,C.Zimmer, Integrating quantum computing resources into scientific hpc ecosys- tems,arXivpreprintarXiv:2408.16159Related...

  12. [13]

    Shehata, T

    A. Shehata, T. Naughton, I.-S. Suh, High performance computing and quantum computing integration framework architecture and re- quirements document, Tech. Rep. ORNL/TM-2024/3388, Oak Ridge National Laboratory, Oak Ridge, TN, USA (2024)

  13. [14]

    A survey on integrating quantum computers into high performance computing systems,

    P. Döbler, M. S. Jattana, A survey on integrating quantum com- puters into high performance computing systems, arXiv preprint arXiv:2507.03540 (2025).doi:10.48550/arXiv.2507.03540

  14. [15]

    Tejedor, B

    M. Tejedor, B. Casas, J. Conejero, A. Cervera-Lierta, R. M. Badia, Orchestrating quantum-hpc workflows with distributed quantum cir- cuit cutting, in: Proceedings of the SC ’25 Workshops of the Inter- national Conference for High Performance Computing, Networking, StorageandAnalysis,SCWorkshops’25,AssociationforComputing Machinery, New York, NY, USA, 2025...

  15. [16]

    K. A. Britt, T. S. Humble, High-performance computing with quan- tum processing units, arXiv preprint arXiv:1511.04386 (2015).doi: 10.48550/arXiv.1511.04386

  16. [17]

    Kubernetes-orchestrated hybrid quantum-classical workflows

    M. Tejedor, M. Grossi, C. Tüysüz, R. Rocha, S. Vallecorsa, Kubernetes-orchestratedhybridquantum-classicalworkflows(2026). arXiv:2603.24206

  17. [18]

    S. Endo, Z. Cai, S. C. Benjamin, X. Yuan, Hybrid quantum-classical algorithms and quantum error mitigation, Journal of the Physical Society of Japan 90 (3) (2021) 032001.doi:10.7566/JPSJ.90.032001

  18. [19]

    Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks

    X. Ge, R.-B. Wu, H. Rabitz, The optimization landscape of hybrid quantum-classicalalgorithms:Fromquantumcontroltonisqapplica- tions, arXiv preprint arXiv:2201.07448 (2022).doi:10.48550/arXiv. 2201.07448

  19. [20]

    De Luca, A survey of nisq era hybrid quantum-classical machine learning research, Journal of Artificial Intelligence and Technology 2 (1) (2021) 9–15.doi:10.37965/jait.2021.12002

    G. De Luca, A survey of nisq era hybrid quantum-classical machine learning research, Journal of Artificial Intelligence and Technology 2 (1) (2021) 9–15.doi:10.37965/jait.2021.12002

  20. [21]

    Accelerating large-scale linear algebra using variational quantum imaginary time evolution,

    R. Rocco, S. Rizzo, M. Barbieri, G. Bettonte, E. Boella, F. Ganz, S. Iserte, A. J. Peña, P. Sandås, A. Scionti, O. Terzo, C. Ver- cellino, G. Vitali, P. Viviani, J. Frassineti, S. Marzella, D. Ottaviani, I. Colonnelli, D. Gregori, Dynamic Solutions for Hybrid Quantum- HPC Resource Allocation, in: 2025 IEEE International Conference on Quantum Computing and...

  21. [22]

    A. J. McCaskey, D. I. Lyakh, E. F. Dumitrescu, S. S. Powers, T. S. Humble, XACC: A system-level software infrastructure for heterogeneous quantum–classical computing, Quantum Science and Technology 5 (2) (2020) 024002.doi:10.1088/2058-9565/ab6bf6

  22. [23]

    T. M. Mintz, A. J. McCaskey, E. F. Dumitrescu, S. S. Powers, S. Moore, P. Lougovski, QCOR: A language extension specification fortheheterogeneousquantum-classicalmodelofcomputation,ACM Journal on Emerging Technologies in Computing Systems 16 (2) (2020) Article 24.doi:10.1145/3380964

  23. [24]

    Nguyen, A

    T. Nguyen, A. Santana, T. Kharazi, D. Claudino, H. Finkel, A. Mc- Caskey, Extending C++ for heterogeneous quantum-classical com- puting, ACM Transactions on Quantum Computing 3 (2) (2022) Article 10.doi:10.1145/3462670

  24. [25]

    25th IEEE Int

    P.Mantha,F.J.Kiwit,N.Saurabh,S.Jha,A.Luckow,Pilot-quantum: A middleware for quantum-HPC resource, workload and task man- agement, in: Proc. 25th IEEE Int. Symp. on Cluster, Cloud and Internet Computing (CCGrid), 2025, pp. 164–173.doi:10.1109/ CCGRID64434.2025.00070

  25. [26]

    Misa-akmc:achieve kinetic monte carlo simulation of 20 quadrillion atoms on gpu clusters,

    E. Giortamis, F. Romao, N. Tornow, D. Lugovoy, P. Bhatotia, Qon- ductor:Acloudorchestratorforquantumcomputing,in:Proc.SC’25, ACM, 2025, pp. 728–745.doi:10.1145/3712285.3759785

  26. [27]

    Viviani, et al., Demystifying HPC-Quantum integration: It’s all about scheduling, in: Proc

    P. Viviani, et al., Demystifying HPC-Quantum integration: It’s all about scheduling, in: Proc. Workshop on High Performance and QuantumComputingIntegration(HPQCI),SC’24,ACM,2024.doi: 10.1145/3659996.3673223

  27. [28]

    A. B. Yoo, M. A. Jette, M. Grondona, Slurm: Simple linux utility for resource management, in: D. Feitelson, L. Rudolph, U. Schwiegelshohn (Eds.), Job Scheduling Strategies for Parallel Processing,SpringerBerlinHeidelberg,Berlin,Heidelberg,2003,pp. 44–60

  28. [29]

    Y.Suzuki,Y.Kawase,Y.Masumura,Y.Hiraga,M.Nakadai,J.Chen, K. M. Nakanishi, K. Mitarai, R. Imai, S. Tamiya, T. Yamamoto, T. Yan, T. Kawakubo, Y. O. Nakagawa, Y. Ibe, Y. Zhang, H. Ya- mashita, H. Yoshimura, A. Hayashi, K. Fujii, Qulacs: a fast and versatile quantum circuit simulator for research purpose, Quantum 5 (2020) 559. URLhttps://api.semanticscholar.or...

  29. [30]

    Ibm quantum com- puters: evolution, performance, and future directions.The Journal of Supercomputing, 81(5), April 2025

    M. AbuGhanem, Ibm quantum computers: evolution, performance, and future directions, The Journal of Supercomputing 81 (5) (2025) 687.doi:10.1007/s11227-025-07047-7

  30. [31]

    Turner, J

    S. Iserte, M. Madon, G. Da Costa, J.-M. Pierson, A. J. Peña, MPI MalleabilityValidationunderReplayedReal-WorldHPCConditions, Future Generation Computer Systems (2025) 108305doi:10.1016/j. future.2025.108305

  31. [32]

    S.Iserte,I.Martín-Álvarez,K.Rojek,J.I.Aliaga,M.Castillo,W.Fol- warska, A. J. Peña, Resource optimization with MPI process mal- leability for dynamic workloads in HPC clusters, Future Generation ComputerSystems(2025)107949doi:10.1016/j.future.2025.107949. R. S. Raigada-García et al.:Preprint submitted to ElsevierPage 18 of 18