pith. sign in

arxiv: 2604.10093 · v1 · submitted 2026-04-11 · 💻 cs.AR

Late Breaking Results: CHESSY: Coupled Hybrid Emulation with SystemC-FPGA Synchronization

Pith reviewed 2026-05-10 16:03 UTC · model grok-4.3

classification 💻 cs.AR
keywords hybrid emulationSystemCFPGA co-emulationvirtual platformsRISC-Vopen-sourcecyber-physical systemsJTAG synchronization
0
0 comments X

The pith

An open-source framework synchronizes SystemC virtual platforms with FPGA hardware to enable accurate full-system co-emulation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents CHESSY, an open-source framework that couples SystemC-based virtual platforms with FPGA emulation for hybrid co-simulation of cyber-physical systems. The FPGA runs the core digital subsystems for speed and timing accuracy, while the virtual platform handles peripherals and non-digital elements. A wrapper manages synchronization over JTAG. On a RISC-V SoC example from biosignals processing, the setup delivers up to 2500 times faster execution than RTL simulation yet keeps overall runtime below twice that of pure FPGA emulation.

Core claim

CHESSY connects SystemC virtual platforms to FPGA emulation through a JTAG-based wrapper that coordinates timing and data exchange. This lets designers accelerate the main digital components on hardware while retaining flexible simulation of the rest of the system, producing up to 2500x speedup over RTL simulation and less than 2x the runtime of standalone FPGA emulation on the evaluated RISC-V SoC.

What carries the argument

The JTAG-based wrapper that coordinates timing and communication between the FPGA emulation and the SystemC virtual platform.

If this is right

  • Designers gain an open, vendor-independent path to hybrid emulation instead of relying on proprietary tools.
  • Full-system validation of cyber-physical designs can combine hardware-accurate digital timing with simulated non-digital components.
  • RISC-V based SoCs become practical to evaluate at scale with both speed and accuracy for embedded applications.
  • The same synchronization approach could support additional peripheral models without rewriting the FPGA design.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Alternative communication links such as PCIe could be tested to lower synchronization cost further.
  • The framework might extend to other simulation languages or mixed-signal domains not covered in the RISC-V example.
  • Larger IoT or automotive systems could use the same pattern to balance accuracy and simulation speed.

Load-bearing premise

The JTAG wrapper can keep the FPGA and SystemC sides synchronized without adding enough delay or error to erase the reported speed gains in practical full-system runs.

What would settle it

Measure wall-clock time for the RISC-V SoC biosignals workload under CHESSY, pure RTL simulation, and pure FPGA emulation to check whether the 2500x speedup versus RTL and the sub-2x overhead versus pure FPGA actually appear.

Figures

Figures reproduced from arXiv: 2604.10093 by Alessio Burrello, Daniele Jahier Pagliari, Enrico Macii, Giovanni Pollo, Lorenzo Ruotolo, Massimo Poncino, Matteo Risso, Mohamed Amine Hamdi, Sara Vinco, Yukai Chen.

Figure 1
Figure 1. Figure 1: Overview of CHESSY. The background shades in the pseudocode [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Simulation time overhead [%] for different transfer sizes and interaction [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗
read the original abstract

The growing complexity of cyber-physical systems (CPSs) calls for early prototyping tools that combine accuracy, speed, and usability. Virtual Platforms (VPs) provide fast functional simulation, but hybrid co-emulation solutions, in which key digital components are deployed on FPGA, become necessary when accurate timing modelling is required and RTL simulation is too costly. However, existing hybrid emulation tools are mostly proprietary, and rely on vendor-specific FPGA features. To address this gap, we introduce an open-source framework that connects SystemC-based VPs with FPGA emulation, enabling full-system co-emulation of digital and non-digital components. The FPGA accelerates the execution of main digital subsystems, while a wrapper coordinates timing and communication with the VP through JTAG, maintaining synchronization with simulated peripherals. Evaluations using a RISC-V SoC, with an example in the biosignals processing domain, show up to 2500x speedup compared to RTL simulation, while maintaining less than 2x total simulation time relative to pure FPGA emulation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper introduces CHESSY, an open-source framework for coupled hybrid emulation that connects SystemC-based virtual platforms with FPGA emulation through a JTAG wrapper for synchronization. It enables full-system co-emulation of digital and non-digital components in cyber-physical systems. Evaluations on a RISC-V SoC for biosignal processing demonstrate up to 2500x speedup over RTL simulation and less than 2x the simulation time of pure FPGA emulation.

Significance. If the synchronization and performance claims hold under detailed scrutiny, this open-source framework could provide a practical tool for early prototyping of complex cyber-physical systems, combining FPGA speed for digital cores with SystemC flexibility for peripherals and addressing the limitations of proprietary or slow RTL-only approaches. The open-source release is a clear strength for reproducibility and adoption.

major comments (2)
  1. [Abstract] Abstract: The headline performance claims (up to 2500x speedup vs. RTL simulation and <2x total time vs. pure FPGA emulation) are stated without any description of the experimental setup, workloads, RISC-V SoC configuration details, measurement methodology, baselines, error bars, or variability. This absence prevents verification of the results.
  2. [Framework description] JTAG wrapper and synchronization: The wrapper is said to 'coordinate timing and communication' and 'maintain synchronization,' but no quantitative bounds are given on round-trip JTAG latency, maximum sustainable event rate, or measured timing error versus a pure-FPGA baseline. These data are load-bearing for the <2x overhead claim, especially under full-system peripheral loads in biosignal processing.
minor comments (1)
  1. [Title] The manuscript is presented as 'Late Breaking Results,' which may suit a conference format but leaves the evaluation section unusually brief for a journal submission; expanding the experimental reporting would strengthen it.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive review. The comments highlight opportunities to strengthen the clarity and verifiability of our results. We address each major comment below and will incorporate revisions in the next version of the manuscript.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The headline performance claims (up to 2500x speedup vs. RTL simulation and <2x total time vs. pure FPGA emulation) are stated without any description of the experimental setup, workloads, RISC-V SoC configuration details, measurement methodology, baselines, error bars, or variability. This absence prevents verification of the results.

    Authors: We agree that the abstract, due to its brevity, does not include the requested experimental details. The full manuscript provides these in Section IV (Evaluation), including the RISC-V SoC configuration (RV32IMAC core with peripherals), the biosignal processing workload, cycle-accurate measurement methodology on the FPGA, baselines (RTL simulation via Verilator and pure FPGA emulation), and reported speedups. To improve standalone readability of the abstract, we will revise it to include a concise sentence summarizing the key setup parameters, workload, and measurement approach while respecting length constraints. We will also ensure error bars and variability data from repeated runs are explicitly reported in the evaluation section if not already detailed. revision: yes

  2. Referee: [Framework description] JTAG wrapper and synchronization: The wrapper is said to 'coordinate timing and communication' and 'maintain synchronization,' but no quantitative bounds are given on round-trip JTAG latency, maximum sustainable event rate, or measured timing error versus a pure-FPGA baseline. These data are load-bearing for the <2x overhead claim, especially under full-system peripheral loads in biosignal processing.

    Authors: The JTAG wrapper implementation and synchronization protocol are detailed in Section III, including the use of a custom protocol over the JTAG interface for event-driven communication. We acknowledge that explicit quantitative characterization of round-trip latency, sustainable event rate, and timing error relative to pure FPGA execution would provide stronger support for the overhead claims. We will add these measurements in the revised manuscript, obtained from targeted micro-benchmarks under varying peripheral loads representative of the biosignal processing application, to directly substantiate the <2x total time claim. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical tool paper with measured timings only

full rationale

The paper describes an open-source hybrid SystemC-FPGA emulation framework and reports direct experimental results (up to 2500x speedup vs RTL, <2x overhead vs pure FPGA) from evaluations on a RISC-V SoC. No mathematical derivations, equations, parameter fitting, predictions, or first-principles claims exist that could reduce to inputs by construction. Synchronization is implemented via a JTAG wrapper whose behavior is described functionally; performance numbers are measured outcomes, not self-referential or fitted quantities presented as predictions. No self-citation chains or uniqueness theorems are invoked. This is a standard engineering tool paper whose central claims rest on external benchmarks and measurements, not internal circular logic.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The paper is an engineering tool contribution with no explicit mathematical axioms, free parameters, or invented physical entities; the framework itself is the main addition.

pith-pipeline@v0.9.0 · 5513 in / 1163 out tokens · 64107 ms · 2026-05-10T16:03:44.131512+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

24 extracted references · 24 canonical work pages

  1. [1]

    A survey of cyber-physical systems,

    J. Shiet al., “A survey of cyber-physical systems,” in2011 International Conference on Wireless Communications and Signal Processing (WCSP), 2011, pp. 1–6

  2. [2]

    Virtual platforms: Breaking new grounds,

    R. Leuperset al., “Virtual platforms: Breaking new grounds,” in2012 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2012, pp. 685–690

  3. [3]

    QEMU - a generic and open source machine emulator and virtualizer,

    QEMU, “QEMU - a generic and open source machine emulator and virtualizer,” https://www.qemu.org/

  4. [4]

    QEMU support for RISC-V: Current state and future releases,

    P. Adeltet al., “QEMU support for RISC-V: Current state and future releases,”International Workshop on RISC-V Research Activities, 2019

  5. [5]

    Develop your iot product with renode,

    antmicro, “Develop your iot product with renode,” https://renode.io/

  6. [6]

    Embedded system simulation using Renode,

    F. Speiseret al., “Embedded system simulation using Renode,”Engineer- ing Proceedings, vol. 79, no. 1, 2024

  7. [7]

    Ieee standard for standard systemc® language reference manual,

    “Ieee standard for standard systemc® language reference manual,”IEEE Std 1666-2023 (Revision of IEEE Std 1666-2011), pp. 1–618, 2023

  8. [8]

    SystemC analog/mixed-signal extensions,

    Accellera Systems Initiative, “SystemC analog/mixed-signal extensions,” https://systemc.org/overview/systemc-ams/, 2024

  9. [9]

    Fpga design automation: A survey,

    D. Chenet al., “Fpga design automation: A survey,”Foundations and Trends® in Electronic Design Automation, vol. 1, no. 3, pp. 195–330,

  10. [10]

    Available: http://dx.doi.org/10.1561/1000000003

    [Online]. Available: http://dx.doi.org/10.1561/1000000003

  11. [11]

    Sce-mi, standard co-emulation modeling interface,

    Accelera, “Sce-mi, standard co-emulation modeling interface,” https://www.accellera.org/downloads/standards/sce-mi

  12. [12]

    An hla-based formal co-simulation approach for rapid prototyping of heterogeneous mixed-signal socs,

    M. G. Seoket al., “An hla-based formal co-simulation approach for rapid prototyping of heterogeneous mixed-signal socs,”IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. 100, no. 7, pp. 1374–1383, 2017

  13. [13]

    Hw/sw co-verification environment for hybrid systems using qemu,

    Aldec, “Hw/sw co-verification environment for hybrid systems using qemu,” https://www.aldec.com/en/company/blog/176–hwsw-co- verification-environment-for-hybrid-systems-using-qemu

  14. [14]

    Systemc cosimulation and emulation of multiprocessor soc designs,

    L. Beniniet al., “Systemc cosimulation and emulation of multiprocessor soc designs,”Computer, vol. 36, no. 4, pp. 53–59, 2003

  15. [15]

    Sythil: A system level hardware-in-the-loop framework for fpga, systemc and qemu-based virtual platforms

    L. Jungeret al., “Sythil: A system level hardware-in-the-loop framework for fpga, systemc and qemu-based virtual platforms.”

  16. [16]

    Fpga-accelerated simulation technologies (fast): Fast, full-system, cycle-accurate simulators,

    D. Chiouet al., “Fpga-accelerated simulation technologies (fast): Fast, full-system, cycle-accurate simulators,” in40th Annual IEEE/ACM In- ternational Symposium on Microarchitecture (MICRO 2007), 2007, pp. 249–261

  17. [17]

    Matlab/simulink based hardware/software co-simulation for designing using fpga configured soft processors,

    J. Ou and V . K. Prasanna, “Matlab/simulink based hardware/software co-simulation for designing using fpga configured soft processors,” in 19th IEEE International Parallel and Distributed Processing Symposium. IEEE, 2005, pp. 8–pp

  18. [18]

    Chimera: A co-simulation framework combining with gem5 and fpga platform for efficient verification,

    C. Fuet al., “Chimera: A co-simulation framework combining with gem5 and fpga platform for efficient verification,” in2024 34th International Conference on Field-Programmable Logic and Applications (FPL), 2024, pp. 133–139

  19. [19]

    Firesim: Fpga-accelerated cycle-exact scale-out system simulation in the public cloud,

    S. Karandikaret al., “Firesim: Fpga-accelerated cycle-exact scale-out system simulation in the public cloud,” in2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA), 2018, pp. 29–42

  20. [20]

    Emunoc: Hybrid emulation for fast and flexible network- on-chip prototyping on fpgas,

    Y . Y . Tanet al., “Emunoc: Hybrid emulation for fast and flexible network- on-chip prototyping on fpgas,” in2022 32nd International Conference on Field-Programmable Logic and Applications (FPL). IEEE, 2022, pp. 334–341

  21. [21]

    Integrating SystemC-AMS power modeling with a RISC-V ISS for virtual prototyping of battery-operated embedded devices,

    M. A. Hamdiet al., “Integrating SystemC-AMS power modeling with a RISC-V ISS for virtual prototyping of battery-operated embedded devices,” inProc. of ACM International Conference on Computing Frontiers: Workshops and Special Sessions, 2024, p. 51–54

  22. [22]

    Astral, a space computing platform built around cheshire,

    PULP, “Astral, a space computing platform built around cheshire,” https://github.com/pulp-platform/astral

  23. [23]

    Energy efficient parallel computing on the PULP platform with support for openmp,

    D. Rossiet al., “Energy efficient parallel computing on the PULP platform with support for openmp,” inProc. of Convention of Electrical & Electronics Engineers in Israel (IEEEI), 2014

  24. [24]

    Robust real-time embedded emg recognition frame- work using temporal convolutional networks on a multicore iot proces- sor,

    M. Zanghieriet al., “Robust real-time embedded emg recognition frame- work using temporal convolutional networks on a multicore iot proces- sor,”IEEE Transactions on Biomedical Circuits and Systems, vol. 14, no. 2, pp. 244–256, 2020