Learning Energy-Efficient Modular Arrays under Hardware Non-linearities
Pith reviewed 2026-05-22 04:20 UTC · model grok-4.3
The pith
Unsupervised DNN jointly optimizes power allocation and modular activation to raise energy efficiency in sparse arrays under amplifier non-linearities.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The unsupervised DNN learns the non-linear mapping between singular-value and geometric channel features and the optimal energy-efficiency operating point, jointly outputting distortion-aware power allocation, total transmit power scaling, and modular sub-array activation; numerical results show that the resulting arrays achieve significant energy-efficiency gains over conventional sparse arrays.
What carries the argument
Unsupervised deep neural network that maps singular-value and geometric channel features to the joint choice of power allocation, total power scaling, and modular sub-array activation while maximizing energy efficiency under non-linear distortions.
If this is right
- The closed-form spectral-efficiency expression derived from Bussgang decomposition makes non-linear distortion effects tractable inside the energy-efficiency objective.
- The DNN replaces repeated solution of a combinatorial non-convex problem with a single forward pass once trained.
- Modular sub-array activation combined with per-antenna power scaling improves energy efficiency compared with fixed sparse layouts.
- The approach scales to array sizes where exhaustive search or convex relaxations become impractical.
Where Pith is reading between the lines
- The same unsupervised learning structure could be applied to joint optimization problems involving other hardware impairments such as phase noise or quantization.
- Adding more channel statistics or array geometry parameters as input features might tighten the learned mapping without increasing inference cost.
- The framework points toward online retraining or transfer learning when array size or amplifier characteristics change after deployment.
Load-bearing premise
The Bussgang decomposition supplies an accurate analytical expression for achievable spectral efficiency under power amplifier non-linearities, and singular-value plus geometric channel features are sufficient for the unsupervised DNN to learn the optimal joint operating point.
What would settle it
Running exhaustive or high-accuracy numerical search for the true energy-efficiency optimum on a held-out set of channel realizations and comparing those values to the energy efficiency produced by the DNN outputs would reveal whether the network reaches or falls short of the optimal operating points.
Figures
read the original abstract
This paper investigates the joint optimization of power allocation and antenna activation in sparse extremely large aperture array systems operating under power amplifier non-linearities. We first derive an analytical expression for the achievable spectral efficiency (SE) of point-to-point MIMO channels affected by non-linear distortions using the Bussgang decomposition. To address the combinatorial and non-convex nature of the energy-efficiency (EE) maximization problem, we employ an unsupervised deep neural network (DNN) that learns the non-linear mapping between the channel state information and the optimal EE operating point. The DNN jointly predicts distortion-aware power allocation, total transmit power scaling, and modular sub-array activation based on singular-value and geometric channel features. Numerical results demonstrate that the proposed DNN-based arrays achieve significant EE gains over the conventional sparse arrays.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper investigates joint optimization of power allocation and modular sub-array activation for energy efficiency (EE) in sparse extremely large aperture array (ELAA) MIMO systems impaired by power amplifier non-linearities. It first derives a closed-form achievable spectral efficiency (SE) expression via Bussgang decomposition for point-to-point MIMO channels, then trains an unsupervised DNN that maps singular-value and geometric channel features to distortion-aware power allocation, total power scaling, and binary activation decisions. Numerical results claim that the resulting DNN-based designs achieve significant EE gains relative to conventional sparse arrays.
Significance. If the Bussgang-derived SE remains an accurate surrogate once activations and scalings are jointly optimized, the unsupervised DNN approach offers a scalable alternative to combinatorial EE maximization in hardware-impaired massive MIMO. The explicit use of an analytical objective inside the unsupervised loss is a methodological strength that could enable reproducible, low-complexity inference at deployment. However, the overall significance hinges on whether the reported EE gains survive verification against the true (non-Bussgang) rate; without that check the numerical claims rest on an unconfirmed modeling assumption.
major comments (1)
- [SE derivation and numerical results] The headline numerical claim (significant EE gains) is obtained by training the DNN to maximize the Bussgang-based SE expression. Bussgang requires the effective transmit vector to be circularly symmetric complex Gaussian with distortion uncorrelated to the input; both conditions become approximate when the optimizer is free to activate or deactivate entire modules and to apply per-module power scaling. The manuscript should therefore include, in the numerical-results section, a direct comparison of the analytical SE against Monte-Carlo-estimated mutual information for the final optimized activation patterns; absence of this check leaves the central EE improvement claim only weakly supported.
minor comments (3)
- [Abstract and numerical results] The abstract and numerical section report 'significant EE gains' without stating the quantitative improvement (e.g., percentage or dB) or the precise baseline configurations (e.g., which conventional sparse arrays and power-allocation schemes are used).
- [DNN design and training] No description is given of the DNN architecture (layer sizes, activation functions), the precise unsupervised loss implementation, training hyperparameters, or the size and diversity of the channel realizations used for training and testing.
- [Numerical results] The paper does not report error bars, number of Monte-Carlo trials, or statistical significance tests on the EE curves, which is needed to substantiate the reliability of the plotted gains.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback and for identifying a key point that can strengthen the validation of our numerical results. We address the major comment below.
read point-by-point responses
-
Referee: [SE derivation and numerical results] The headline numerical claim (significant EE gains) is obtained by training the DNN to maximize the Bussgang-based SE expression. Bussgang requires the effective transmit vector to be circularly symmetric complex Gaussian with distortion uncorrelated to the input; both conditions become approximate when the optimizer is free to activate or deactivate entire modules and to apply per-module power scaling. The manuscript should therefore include, in the numerical-results section, a direct comparison of the analytical SE against Monte-Carlo-estimated mutual information for the final optimized activation patterns; absence of this check leaves the central EE improvement claim only weakly supported.
Authors: We agree that confirming the accuracy of the Bussgang-based SE expression for the jointly optimized activation patterns and power scalings is essential to support the reported EE gains. Although the derivation applies the Bussgang decomposition to the effective (post-activation and scaled) transmit vector and the unsupervised loss directly maximizes this closed-form expression, the circularly symmetric Gaussian assumption becomes approximate under binary sub-array decisions. In the revised manuscript we will add, in the numerical-results section, a direct comparison of the analytical SE against Monte-Carlo estimates of mutual information evaluated on the final optimized activation patterns and power allocations. This verification will quantify any gap and thereby strengthen the central claims. revision: yes
Circularity Check
No significant circularity; SE derivation and DNN optimization remain independent
full rationale
The paper first derives a closed-form SE expression via Bussgang decomposition applied to the point-to-point MIMO channel under PA non-linearities. This analytical expression is then employed as the objective function inside the unsupervised DNN loss, which maps singular-value and geometric channel features to joint power allocation, total power scaling, and sub-array activation decisions. No equation or step reduces the final EE result to a fitted parameter by construction, nor does any load-bearing claim rest on a self-citation chain whose validity is internal to the present work. The Bussgang-based SE is a standard external tool whose assumptions are stated separately from the DNN training procedure, making the overall derivation self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Bussgang decomposition accurately models the effect of power amplifier non-linearities on the transmitted signal in point-to-point MIMO channels.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Using Bussgang decomposition, we can write the received signal as y = (1 + 2ρ)Hx + Hη + n ... SE = log2(det(IM + (1 + 2ρ)² C_n^{-1} H Q H^H))
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The DNN jointly predicts distortion-aware power allocation, total transmit power scaling, and modular sub-array activation
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Near-field MIMO communications for 6G: Fundamentals, challenges, potenti als, and future directions,
M. Cui, Z. Wu, Y . Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potenti als, and future directions,” IEEE Commun. Mag. , vol. 61, no. 1, pp. 40–46, 2023
work page 2023
-
[2]
Ex ploiting the depth and angular domains for massive near-field spatial mul tiplexing,
P . Ramezani, A. Kosasih, A. Irshad, and E. Bj¨ ornson, “Ex ploiting the depth and angular domains for massive near-field spatial mul tiplexing,” IEEE BITS the Inf. Theory Mag. , pp. 1–12, Oct. 2023
work page 2023
-
[3]
S. Y e, M. Xiao, M.-W. Kwan, Z. Ma, Y . Huang, G. Karagiannid is, and P . Fan, “Extremely large aperture array (ELAA) communic ations: Foundations, research advances and challenges,” IEEE Open J. of the Commun. Soc. , vol. 5, pp. 7075–7120, 2024
work page 2024
-
[4]
C. Zhou, C. Y ou, H. Zhang, L. Chen, and S. Shi, “Sparse arra y enabled near-field communications: Beam pattern analysis a nd hybrid beamforming design,” IEEE Trans. on Wireless Commun. , Early Access 2025
work page 2025
-
[5]
Can sparse arrays outperform colloc ated arrays for future wireless communications?
H. Wang and Y . Zeng, “Can sparse arrays outperform colloc ated arrays for future wireless communications?” in Proc. IEEE Globecom W orkshops, Malaysia, Dec. 2023, pp. 667–672
work page 2023
-
[6]
Near-field multiu ser communications based on sparse arrays,
K. Chen, C. Qi, G. Y . Li, and O. A. Dobre, “Near-field multiu ser communications based on sparse arrays,” IEEE J. Sel. Top. Signal Process., vol. 18, no. 4, pp. 619–632, May 2024
work page 2024
-
[7]
Near-field beamfocusing, localization, and channel estimation with modular linear a rrays,
A. Kosasih, ¨O. T. Demir, and E. Bj¨ ornson, “Near-field beamfocusing, localization, and channel estimation with modular linear a rrays,” arXiv preprint arXiv:2505.07991, 2025
-
[8]
6G Workshop 2025 - Tdoc 6GWS-250004,
Nokia, “6G Workshop 2025 - Tdoc 6GWS-250004,” 3GPP Techn ical Document, Mar. 2025. [Online]. Available: https://www.3g pp.org/ftp/ workshop/2025-03-10 3GPP 6G WS/Docs/6GWS-250004.zip
work page 2025
-
[9]
6G Workshop 2025 - Tdoc 6GWS-250083,
Ericsson, “6G Workshop 2025 - Tdoc 6GWS-250083,” 3GPP Te chnical Document, Mar. 2025. [Online]. Available: https://www.3g pp.org/ftp/ workshop/2025-03-10 3GPP 6G WS/Docs/6GWS-250083.zip
work page 2025
-
[10]
Impact of backward crosstalk in 2 × 2 MIMO transmitters on NMSE and spec tral efficiency,
P . H¨ andel, ¨O. T. Demir, E. Bj¨ ornson, and D. R¨ onnow, “Impact of backward crosstalk in 2 × 2 MIMO transmitters on NMSE and spec tral efficiency,” IEEE Trans. on Commun. , vol. 68, no. 7, pp. 4277–4292, 2020
work page 2020
-
[11]
Hardware distortion corre- lation has negligible impact on ul massive MIMO spectral effi ciency,
E. Bj¨ ornson, L. Sanguinetti, and J. Hoydis, “Hardware distortion corre- lation has negligible impact on ul massive MIMO spectral effi ciency,” IEEE Trans. on Commun. , vol. 67, no. 2, pp. 1085–1098, 2019
work page 2019
-
[12]
K.-K. Wong, A. Shojaeifard, K.-F. Tong, and Y . Zhang, “F luid antenna systems,” IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1950–1962, 2021
work page 1950
-
[13]
A tutorial on movable antennas for wireless networks,
L. Zhu, W. Ma et al. , “A tutorial on movable antennas for wireless networks,” IEEE Commun. Surveys Tuts. , 2025, to appear
work page 2025
-
[14]
Antenna sel ection in measured massive mimo channels using convex optimization,
X. Gao, O. Edfors, J. Liu, and F. Tufvesson, “Antenna sel ection in measured massive mimo channels using convex optimization, ” in Proc. IEEE Globecom W orkshops, 2013, p. 129–134
work page 2013
-
[15]
N. Rajapaksha, J. Mohammadi, S. Wesemann, T. Wild, and N . Rajatheva, “Minimizing energy consumption in MU-MIMO via antenna muti ng by neural networks with asymmetric loss,” IEEE Trans. on V ehic. Technol., vol. 73, no. 5, pp. 6600–6613, May 2024
work page 2024
-
[16]
The Bussgang decomposition of non-linear systems: Basic theory and MIMO extensions,
¨O. T. Demir and E. Bj¨ ornson, “The Bussgang decomposition of non-linear systems: Basic theory and MIMO extensions,” CoRR, 2020. [Online]. Available: http://arxiv.org/abs/2005.01597
-
[17]
How much training is need ed in multiple-antenna wireless links?
B. Hassibi and B. M. Hochwald, “How much training is need ed in multiple-antenna wireless links?” IEEE Trans. on Inf. Theory , vol. 49, no. 4, pp. 951–963, 2003
work page 2003
-
[18]
Fundamentals of energy-efficient wireless links: Optimal ratios and scalin g behaviors,
A. Enqvist, ¨O. T. Demir, C. Cavdar, and E. Bj¨ ornson, “Fundamentals of energy-efficient wireless links: Optimal ratios and scalin g behaviors,” in 2024 IEEE 99th V ehicular Technology Conference (VTC2024-S pring). IEEE, 2024, pp. 1–6
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.