pith. sign in

Communication Offloading on SmartNIC DPUs: A Quantitative Approach

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

SmartNIC Data Processing Units (DPUs) offer a promising solution for saving high-end CPU resources by offloading tasks to programmable cores near the network interface. In this work, we explore the feasibility of SmartNIC DPUs in supporting an asynchronous communication model called "fire-and-forget", particularly its core message routing service. We design a communication offloading engine called Buddy that decouples communication tasks from the application process. Buddy runs flexibly on SmartNIC DPUs such as the Nvidia BlueField-3 DPU and generic x86 CPUs. Our evaluation results in five applications identify the memory-to-communication ratio as a key predictor of the offloading performance. Host-dominated workloads, such as Quicksilver and Sparse Matrix Transpose, achieved up to 1.55x speedup with communication offloaded to the DPU. We further identify a 625x increase in DRAM traffic due to the absence of Direct Cache Access support on the DPU, highlighting a critical need in future SmartNIC designs.

citation-role summary

background 1

citation-polarity summary

fields

cs.ET 1

years

2026 1

verdicts

UNVERDICTED 1

roles

background 1

polarities

background 1

representative citing papers

Post-Moore Technologies for Plasma Simulation: A Community Roadmap

cs.ET · 2026-05-08 · unverdicted · novelty 4.0

No single post-Moore technology replaces current HPC for plasma simulations, but FPGA-class accelerators offer near-term kernel offload, non-von Neumann architectures medium-term operator acceleration, and quantum computing long-term potential for warm dense matter microphysics.

citing papers explorer

Showing 1 of 1 citing paper.

  • Post-Moore Technologies for Plasma Simulation: A Community Roadmap cs.ET · 2026-05-08 · unverdicted · none · ref 107 · internal anchor

    No single post-Moore technology replaces current HPC for plasma simulations, but FPGA-class accelerators offer near-term kernel offload, non-von Neumann architectures medium-term operator acceleration, and quantum computing long-term potential for warm dense matter microphysics.