SparseDPD: A Sparse Neural Network-based Digital Predistortion FPGA Accelerator for RF Power Amplifier Linearization
Pith reviewed 2026-05-19 08:50 UTC · model grok-4.3
The pith
SparseDPD deploys a 74% sparse neural network on FPGA to linearize RF power amplifiers with -59.4 dBc ACPR at 241 mW.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By applying unstructured pruning to a phase-normalized time-delay neural network, SparseDPD produces a 74% sparse model with 64 parameters that, when accelerated on a Xilinx Zynq-7Z010 FPGA at 170 MHz, achieves ACPR of -59.4 dBc, EVM of -54.0 dBc, and NMSE of -48.2 dB while consuming 241 mW dynamic power.
What carries the argument
Spatially sparse phase-normalized time-delay neural network (PNTDNN) created by unstructured pruning, which reduces multiply-accumulate operations for FPGA deployment while keeping linearization performance.
If this is right
- NN-based DPD becomes feasible on low-cost FPGAs for real-time wireless transmitters.
- Dynamic power for predistortion drops to levels suitable for battery or edge devices.
- Unstructured pruning offers a general route to shrink other signal-processing neural networks for hardware.
- The 64-parameter footprint leaves headroom for additional baseband processing on the same chip.
Where Pith is reading between the lines
- The same pruning strategy could be tested on larger FPGAs or ASICs to support wider bandwidth signals.
- Public code release makes it straightforward to adapt the accelerator to different PA models or frequency bands.
- Further sparsity levels or mixed-precision arithmetic might push power below 200 mW while retaining performance.
Load-bearing premise
Unstructured pruning keeps the neural network's linearization accuracy intact once the sparse model is placed on the target FPGA hardware.
What would settle it
Measure ACPR, EVM, and NMSE on the actual Xilinx Zynq-7Z010 FPGA running the SparseDPD bitstream and check whether the values reach or exceed the reported -59.4 dBc, -54.0 dBc, and -48.2 dB.
Figures
read the original abstract
Digital predistortion (DPD) is crucial for linearizing radio frequency (RF) power amplifiers (PAs), improving signal integrity and efficiency in wireless systems. Neural network (NN)-based DPD methods surpass traditional polynomial models but face computational challenges limiting their practical deployment. This paper introduces SparseDPD, an FPGA accelerator employing a spatially sparse phase-normalized time-delay neural network (PNTDNN), optimized through unstructured pruning to reduce computational load without accuracy loss. Implemented on a Xilinx Zynq-7Z010 FPGA, SparseDPD operates at 170 MHz, achieving exceptional linearization performance (ACPR: -59.4 dBc, EVM: -54.0 dBc, NMSE: -48.2 dB) with only 241 mW dynamic power, using 64 parameters with 74% sparsity. This work demonstrates FPGA-based acceleration, making NN-based DPD practical and efficient for real-time wireless communication applications. Code is publicly available at https://github.com/MannoVersluis/SparseDPD.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces SparseDPD, an FPGA accelerator for RF power amplifier linearization that employs unstructured pruning on a phase-normalized time-delay neural network (PNTDNN) to achieve 74% sparsity with 64 parameters. Implemented on a Xilinx Zynq-7Z010 at 170 MHz, it reports ACPR of -59.4 dBc, EVM of -54.0 dBc, NMSE of -48.2 dB, and 241 mW dynamic power, claiming no accuracy loss relative to the dense model while making the code publicly available.
Significance. If the no-accuracy-loss claim is substantiated, the work would demonstrate a practical route to deploying NN-based DPD on low-cost FPGAs, improving efficiency for real-time wireless applications. The public code release and measured hardware results (rather than purely simulated) strengthen the contribution to the field of hardware-accelerated signal processing.
major comments (1)
- [Results] Results section (performance metrics paragraph): The headline figures (ACPR -59.4 dBc, EVM -54.0 dBc, NMSE -48.2 dB) are reported exclusively for the 74%-sparse PNTDNN. No table or figure directly compares these metrics for the dense versus pruned model after FPGA synthesis, quantization, and routing under identical test waveforms and PA conditions; this comparison is required to substantiate the central claim that unstructured pruning incurs no accuracy loss on the target hardware.
minor comments (2)
- [Abstract] Abstract and §3: The measurement setup (test waveform, PA model, training dataset size, and verification procedure for the no-accuracy-loss claim) is not summarized; adding a concise description would improve reproducibility.
- [Method] Implementation details: Clarify how the 64-parameter count and 74% sparsity are obtained after pruning (e.g., exact pruning criterion and retraining protocol) to allow readers to assess the method's generality.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback on our manuscript. We appreciate the emphasis on substantiating the no-accuracy-loss claim with hardware-level comparisons and will revise the paper to address this.
read point-by-point responses
-
Referee: The headline figures (ACPR -59.4 dBc, EVM -54.0 dBc, NMSE -48.2 dB) are reported exclusively for the 74%-sparse PNTDNN. No table or figure directly compares these metrics for the dense versus pruned model after FPGA synthesis, quantization, and routing under identical test waveforms and PA conditions; this comparison is required to substantiate the central claim that unstructured pruning incurs no accuracy loss on the target hardware.
Authors: We concur that a side-by-side comparison of the metrics for the dense and pruned models post-FPGA synthesis, quantization, and routing is essential to robustly support our claim. The manuscript includes software-based evaluations (in the performance evaluation subsection) demonstrating that the 74% sparse PNTDNN achieves nearly identical ACPR, EVM, and NMSE to the dense model under the same test waveforms and PA conditions. Due to the dense model's significantly higher computational complexity, it could not be synthesized and routed on the resource-constrained Xilinx Zynq-7Z010 without exceeding available hardware resources. The sparse implementation is specifically designed to enable deployment on such low-cost FPGAs. In the revised version, we will include an additional table in the Results section that tabulates the software metrics for both models and explicitly discusses the hardware implementation feasibility, thereby clarifying the context of our no-accuracy-loss statement. revision: yes
Circularity Check
No circularity: performance claims rest on hardware measurements, not on equations or fits that reduce to inputs by construction
full rationale
The paper presents an FPGA implementation of a pruned PNTDNN for DPD, with reported metrics (ACPR, EVM, NMSE, power) obtained from physical synthesis and benchmarking on the Zynq-7Z010 at 170 MHz. No derivation chain, first-principles prediction, or fitted parameter is invoked whose output is mathematically forced to equal the input data or a prior self-citation. The central results are empirical measurements under stated test conditions rather than algebraic identities or renamed fits. Minor self-citations to prior NN-DPD work exist but are not load-bearing for the hardware claims, which remain independently verifiable via the public code and FPGA platform.
Axiom & Free-Parameter Ledger
free parameters (1)
- sparsity level
axioms (1)
- domain assumption PNTDNN is a suitable neural architecture for RF power amplifier linearization
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
SparseDPD accelerator employing a spatially sparse phase-normalized time-delay neural network (PNTDNN), optimized through unstructured pruning... 64 parameters with 74% sparsity... 170 MHz... ACPR -59.4 dBc
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
feature extraction... phase normalization... fully connected layers... phase denormalization
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Energy efficient extreme mimo: Design goals and directions,
S. Wesemann, J. Du, and H. Viswanathan, “Energy efficient extreme mimo: Design goals and directions,” IEEE Communications Magazine , vol. 61, no. 10, pp. 132–138, 2023
work page 2023
-
[2]
A generalized memory polynomial model for digital predistortion of rf power ampli- fiers,
D. Morgan, Z. Ma, J. Kim, M. Zierdt, and J. Pastalan, “A generalized memory polynomial model for digital predistortion of rf power ampli- fiers,” IEEE Transactions on Signal Processing , vol. 54, no. 10, pp. 3852–3860, 2006
work page 2006
-
[3]
D. Wang, M. Aziz, M. Helaoui, and F. M. Ghannouchi, “Augmented real-valued time-delay neural network for compensation of distortions and impairments in wireless transmitters,” IEEE Transactions on Neural Networks and Learning Systems , vol. 30, no. 1, pp. 242–254, Jun 2019
work page 2019
-
[4]
X. Chen, Z. Lu, S. Zhang, S. Xu, and Y . Wang, “An inter-modulation oriented learning based digital pre-distortion technique via joint interme- diate and radio frequency optimization,” IEEE Transactions on Wireless Communications, vol. 22, no. 6, pp. 3777–3789, 2023
work page 2023
-
[5]
Phase- normalized neural network for linearization of rf power amplifiers,
A. Fischer-B ¨uhner, L. Anttila, M. D. Gomony, and M. Valkama, “Phase- normalized neural network for linearization of rf power amplifiers,” IEEE Microwave and Wireless Technology Letters , vol. 33, no. 9, pp. 1357–1360, 2023
work page 2023
-
[6]
Learning both weights and connections for efficient neural networks,
S. Han, J. Pool, J. Tran, and W. J. Dally, “Learning both weights and connections for efficient neural networks,” in Proceedings of the 29th International Conference on Neural Information Processing Systems - V olume 1, ser. NIPS’15. Cambridge, MA, USA: MIT Press, 2015, p. 1135–1143
work page 2015
-
[7]
Parhami, Computer arithmetic: Algorithms and hardware designs
B. Parhami, Computer arithmetic: Algorithms and hardware designs . Oxford University Press, 2000
work page 2000
-
[8]
Y . Wu, A. Li, M. Beikmirza, G. D. Singh, Q. Chen, L. C. N. de Vreede, M. Alavi, and C. Gao, “MP-DPD: Low-complexity mixed-precision neural networks for energy-efficient digital predistortion of wideband power amplifiers,” IEEE Microwave and Wireless Technology Letters , vol. 34, no. 6, pp. 817–820, 2024
work page 2024
-
[9]
A sparse neural network-based power adaptive dpd design and its hardware implementation,
M. Tanio, N. Ishii, and N. Kamiya, “A sparse neural network-based power adaptive dpd design and its hardware implementation,” IEEE Access, vol. 10, pp. 114 673–114 682, 2022
work page 2022
-
[10]
T. Cappello, G. Jindal, J. Nunez-Yanez, and K. Morris, “Power con- sumption and linearization performance of a bit- and frequency-scalable am/am am/pm pre-distortion on fpga,” in 2022 International Workshop on Integrated Nonlinear Microwave and Millimetre-Wave Circuits (IN- MMiC), 2022, pp. 1–3
work page 2022
-
[11]
Y . Li, X. Wang, and A. Zhu, “Reducing power consumption of digital predistortion for rf power amplifiers using real-time model switching,” IEEE Transactions on Microwave Theory and Techniques , vol. 70, no. 3, pp. 1500–1508, 2022
work page 2022
-
[12]
H. Huang, J. Xia, and S. Boumaiza, “Novel parallel-processing-based hardware implementation of baseband digital predistorters for linearizing wideband 5g transmitters,”IEEE Transactions on Microwave Theory and Techniques, vol. 68, no. 9, pp. 4066–4076, 2020
work page 2020
-
[13]
W. Li, R. Criado, W. Thompson, C. G. K. Montoro, and P. L. Gilabert, “Gpu-based implementation of pruned artificial neural networks for dig- ital predistortion linearization of wideband power amplfiers,” TechRxiv, 2024
work page 2024
-
[14]
A. Li, H. Wu, Y . Wu, Q. Chen, L. C. N. de Vreede, and C. Gao, “DPD- NeuralEngine: A 22-nm 6.6-tops/w/mm 2 recurrent neural network accelerator for wideband power amplifier digital pre-distortion,” 2025. [Online]. Available: https://arxiv.org/abs/2410.11766
-
[15]
Y . Wu, G. D. Singh, M. Beikmirza, L. C. N. de Vreede, M. Alavi, and C. Gao, “OpenDPD: An open-source end-to-end learning & benchmarking framework for wideband power amplifier modeling and digital pre-distortion,” in 2024 IEEE International Symposium on Circuits and Systems (ISCAS) . IEEE, May 2024, p. 1–5. [Online]. Available: http://dx.doi.org/10.1109/I...
-
[16]
Energy-efficient activity- driven computing architectures for edge intelligence,
S.-C. Liu, C. Gao, K. Kim, and T. Delbruck, “Energy-efficient activity- driven computing architectures for edge intelligence,” in 2022 Interna- tional Electron Devices Meeting (IEDM) , 2022, pp. 21.2.1–21.2.4
work page 2022
-
[17]
Y . Wu, Y . Zhu, K. Qian, Q. Chen, A. Zhu, J. Gajadharsing, L. C. N. de Vreede, and C. Gao, “Deltadpd: Exploiting dynamic temporal sparsity in recurrent neural networks for energy-efficient wideband digital pre- distortion,” IEEE Microwave and Wireless Technology Letters , vol. 35, no. 6, pp. 772–775, 2025. 5
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.