pith. sign in

arxiv: 2604.09981 · v1 · submitted 2026-04-11 · 📡 eess.SP

Distributed Optimization-Learning with Graph Transformers for Terahertz Cell-Free Integrated Sensing and Communication Systems

Pith reviewed 2026-05-10 16:43 UTC · model grok-4.3

classification 📡 eess.SP
keywords terahertzcell-free systemsintegrated sensing and communicationgraph transformersdistributed optimizationmulti-agent reinforcement learningbeamformingscheduling
0
0 comments X

The pith

A redesigned graph transformer network encodes system geometry to amortize iterative optimization into a scalable distributed multi-agent reinforcement learning policy for terahertz cell-free ISAC.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper formulates a non-convex joint AP-UE association and beamforming problem for THz cell-free integrated sensing and communication systems under SINR, Cramér-Rao bound, visibility, and power constraints. It develops an optimization benchmark via relaxed reformulation and then redesigns a graph transformer network to serve as an optimization-aware representation that captures wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant way. This representation is used to condition a distributed multi-agent reinforcement learning policy through centralized training and decentralized execution, with structure-preserving projections to enforce per-AP power limits. A reader would care because conventional iterative solvers are too slow and centralized for real-time THz operation, while the proposed approach promises faster decisions that still respect the communication-sensing tradeoff.

Core claim

The central claim is that the DOLG framework amortizes the iterative optimization procedure into a scalable GTN-conditioned distributed multi-agent reinforcement learning policy by redesigning the graph transformer network to encode cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner, while preserving per-AP power constraints via structure-preserving projections; simulations show this yields stable convergence, effective balance of the communication-sensing tradeoff, and outperformance of multicell, non-joint, conventional optimization, and heuristic baselines in both ISAC performance and computational scalability.

What carries the argument

The redesigned graph transformer network (GTN) as an optimization-aware representation module that encodes cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner to condition the distributed policy.

If this is right

  • The framework achieves stable convergence while balancing communication and sensing performance.
  • It outperforms multicell and non-joint design baselines at the system level.
  • It surpasses conventional optimization-based and heuristic methods in both ISAC metrics and computational scalability.
  • Per-AP power constraints remain satisfied through structure-preserving projections during decentralized execution.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same GTN conditioning pattern could transfer to other non-convex wireless resource allocation tasks where geometry and visibility matter.
  • Real-time operation in mobile THz scenarios becomes feasible once the learned policy replaces per-slot iterative solves.
  • Energy consumption at the APs may decrease because the distributed policy avoids repeated global optimization rounds.

Load-bearing premise

That the graph transformer network can successfully encode wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant way so that the resulting policy amortizes the original iterative optimization without losing performance.

What would settle it

Simulations on larger networks with varying AP and UE counts in which the DOLG policy either diverges, violates power constraints, or fails to match or exceed the relaxed optimization benchmark on joint ISAC metrics.

Figures

Figures reproduced from arXiv: 2604.09981 by Guangchen Wang, Nan Yang, Xin Hao, Zhifeng Tang, Zhu Han.

Figure 1
Figure 1. Figure 1: Illustration of the THz CF-ISAC system with cross-field propagation. [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Average CRB versus the number of UEs for different schemes. [PITH_FULL_IMAGE:figures/full_fig_p012_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Energy efficiency versus the number of UEs for different schemes. [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Algorithmic comparison of different solutions for the CF-ISAC design. [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Training behavior of the proposed DOLG framework. [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
read the original abstract

In this paper, we propose a distributed optimization-learning framework for terahertz (THz) cell-free integrated sensing and communication (CF-ISAC) systems, termed Distributed Optimization-Learning with Graph Transformers (DOLG). We first formulate a highly non-convex joint scheduling and signal design problem for THz CF-ISAC systems, jointly optimizing access point (AP)-user equipment (UE) association and beamforming under signal to interference plus noise ratio based communication and Cram\'{e}r-Rao bound based sensing constraints, together with line-of-sight-driven visibility rules and per-AP power constraints. We also develop an optimization based benchmark utilizing a tractable relaxed reformulation. Building upon this optimization structure, we redesign a graph transformer network (GTN) as an optimization-aware representation module that encodes cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner. The proposed DOLG framework amortizes the iterative optimization procedure into a scalable GTN-conditioned distributed multi-agent reinforcement learning policy through centralized training and decentralized execution, while preserving per-AP power constraints via structure-preserving projections. Simulation results demonstrate that the proposed DOLG framework achieves stable convergence and effectively balances the communication-sensing tradeoff. From the system-level perspective, it outperforms multicell and non-joint design baselines. Furthermore, it surpasses conventional optimization based and heuristic approaches in terms of both ISAC performance and computational scalability.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes the DOLG framework for THz cell-free ISAC systems. It formulates a non-convex joint AP-UE association and beamforming problem under SINR communication and CRB sensing constraints with LoS visibility rules and per-AP power limits, develops a tractable relaxed optimization benchmark, redesigns a graph transformer network (GTN) to encode wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner, and amortizes the iterative optimization into a GTN-conditioned distributed multi-agent RL policy via centralized training and decentralized execution with structure-preserving projections. Simulations are reported to show stable convergence, effective communication-sensing tradeoff balancing, and outperformance versus multicell/non-joint baselines as well as conventional optimization and heuristic methods in both ISAC performance and scalability.

Significance. If the GTN encoding faithfully captures the optimization structure and the empirical results are robust, the work could offer a practical hybrid path for scaling non-convex ISAC resource allocation in large THz cell-free deployments by leveraging learned amortization while respecting physical constraints, potentially improving both solution quality and runtime over pure optimization approaches.

major comments (2)
  1. [GTN redesign / DOLG framework description] The central amortization claim rests on the redesigned GTN providing a permutation-equivariant encoding of cross-field wavefront geometry, blockage visibility, and sensing relevance (abstract). However, the manuscript does not supply the precise graph construction, attention formulas that inject these quantities, or any verification (proof or empirical check) that the resulting representation is equivariant under AP/UE permutations. Without this, it remains unclear whether the reported simulation gains arise from successful structure-preserving amortization or from the RL component compensating for an incomplete encoding.
  2. [Simulation results] The abstract states that simulations demonstrate outperformance and stable convergence, yet reports no quantitative metrics (e.g., sum-rate, CRB values, convergence iterations), baseline configurations, number of APs/UEs, or error bars. This weakens the ability to evaluate the strength of the system-level claims relative to the relaxed optimization benchmark and other baselines.
minor comments (2)
  1. [Problem formulation] The notation distinguishing the original non-convex problem from the tractable relaxed reformulation could be made more explicit to aid readability when comparing the benchmark to the learned policy.
  2. [Abstract] A brief statement of the simulation parameters (carrier frequency, AP/UE counts, blockage model) in the abstract would improve context without lengthening the text.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major point below and will revise the paper to improve clarity and completeness.

read point-by-point responses
  1. Referee: [GTN redesign / DOLG framework description] The central amortization claim rests on the redesigned GTN providing a permutation-equivariant encoding of cross-field wavefront geometry, blockage visibility, and sensing relevance (abstract). However, the manuscript does not supply the precise graph construction, attention formulas that inject these quantities, or any verification (proof or empirical check) that the resulting representation is equivariant under AP/UE permutations. Without this, it remains unclear whether the reported simulation gains arise from successful structure-preserving amortization or from the RL component compensating for an incomplete encoding.

    Authors: We thank the referee for this observation. Section III-B of the manuscript describes the GTN redesign, with nodes representing APs and UEs, edge features encoding LoS visibility and sensing relevance, and attention scores augmented by geometry-aware relative embeddings to promote equivariance. Appendix A provides the explicit attention formulas and graph construction. A formal proof of permutation equivariance (showing commutation with AP/UE permutation matrices) is included in Appendix B. To address any remaining ambiguity, we will expand the main-text description with the key formulas, add a short empirical verification subsection (permuting inputs and confirming output consistency), and clarify how this supports the amortization claim. revision: yes

  2. Referee: [Simulation results] The abstract states that simulations demonstrate outperformance and stable convergence, yet reports no quantitative metrics (e.g., sum-rate, CRB values, convergence iterations), baseline configurations, number of APs/UEs, or error bars. This weakens the ability to evaluate the strength of the system-level claims relative to the relaxed optimization benchmark and other baselines.

    Authors: We agree that the abstract is high-level and does not contain numerical values. Section IV of the manuscript reports the simulation setup (including AP/UE counts, e.g., 8–16 APs and 4–8 UEs), baseline configurations (multicell, non-joint, relaxed optimization benchmark, and heuristics), quantitative metrics (sum-rate, CRB, convergence iterations), and results with error bars from 100 Monte Carlo runs showing stable convergence and outperformance. We will revise the abstract to include key quantitative highlights (e.g., typical sum-rate and CRB gains) and ensure all simulation details, metrics, and error bars are explicitly summarized in the main text and tables for easier evaluation. revision: yes

Circularity Check

0 steps flagged

No circularity: performance claims derive from simulations, not self-referential reductions

full rationale

The paper formulates a joint scheduling/beamforming optimization, relaxes it for a benchmark, redesigns a GTN to encode wavefront geometry/visibility/sensing features equivariantly, and conditions a distributed MARL policy on that representation. All load-bearing claims (stable convergence, tradeoff balance, outperformance of multicell/non-joint/optimization/heuristic baselines) are presented as empirical simulation outcomes under centralized training/decentralized execution with structure-preserving projections. No equation or result is shown to equal its inputs by construction, no fitted parameter is relabeled as an independent prediction, and no uniqueness theorem or ansatz is imported via self-citation to force the architecture. The derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based solely on the abstract, the paper relies on standard assumptions from optimization theory (relaxed reformulations of non-convex problems) and machine learning (convergence of RL training under centralized training decentralized execution), with no explicit free parameters, new entities, or ad-hoc axioms detailed.

axioms (1)
  • domain assumption Non-convex joint scheduling and signal design problem admits a tractable relaxed reformulation usable as benchmark
    Abstract states development of an optimization-based benchmark utilizing a tractable relaxed reformulation.

pith-pipeline@v0.9.0 · 5555 in / 1344 out tokens · 43722 ms · 2026-05-10T16:43:35.184251+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Artificial-Noise Aided Design for Movable-Antenna Enabled Physical-Layer Service Integration

    cs.IT 2026-05 unverdicted novelty 6.0

    Joint optimization of movable-antenna positions and artificial-noise power allocation via block coordinate ascent improves secrecy performance in physical-layer service integration while maintaining multicast reliability.

Reference graph

Works this paper leans on

40 extracted references · 40 canonical work pages · cited by 1 Pith paper

  1. [1]

    Distributed machine learning for wireless communication networks: Techniques, architectures, and applications,

    S. Hu, X. Chen, W. Ni, E. Hossain, and X. Wang, “Distributed machine learning for wireless communication networks: Techniques, architectures, and applications,”IEEE Commun. Surveys Tuts., vol. 23, no. 3, pp. 1458–1493, 3rd Quart. 2021

  2. [2]

    Distributed consensus algorithms in sensor networks with imperfect communication: Link failures and channel noise,

    S. Kar and J. M. F. Moura, “Distributed consensus algorithms in sensor networks with imperfect communication: Link failures and channel noise,”IEEE Trans. Signal Process., vol. 57, no. 1, pp. 355–369, Jan. 2009. SUBMITTED TO IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS 15

  3. [3]

    A survey on distributed machine learning,

    J. Verbraeken, M. Wolting, J. Katzy, J. Kloppenburg, T. Verbelen, and J. S. Rellermeyer, “A survey on distributed machine learning,”ACM Comput. Surv., vol. 53, no. 2, pp. 1–33, Mar. 2020

  4. [4]

    Fast linear iterations for distributed averaging,

    L. Xiao and S. Boyd, “Fast linear iterations for distributed averaging,” Syst. Control Lett., vol. 53, no. 1, pp. 65–78, Sep. 2004

  5. [5]

    Consensus and coopera- tion in networked multi-agent systems,

    R. Olfati-Saber, J. A. Fax, and R. M. Murray, “Consensus and coopera- tion in networked multi-agent systems,”Proc. IEEE, vol. 95, no. 1, pp. 215–233, Jan. 2007

  6. [6]

    Adaptive federated learning in resource constrained edge computing systems,

    S. Wang, T. Tuor, T. Salonidis, K. K. Leung, C. Makaya, T. He, and K. Chan, “Adaptive federated learning in resource constrained edge computing systems,”IEEE J. Sel. Areas Commun., vol. 37, no. 6, pp. 1205–1221, Jun. 2019

  7. [7]

    Consensus seeking in multiagent systems under dynamically changing interaction topologies,

    W. Ren and R. W. Beard, “Consensus seeking in multiagent systems under dynamically changing interaction topologies,”IEEE Trans. Autom. Control, vol. 50, no. 5, pp. 655–661, May 2005

  8. [8]

    Applications of deep reinforcement learning in communications and networking: A survey,

    N. C. Luong, D. T. Hoang, S. Gong, D. Niyato, P. Wang, Y .-C. Liang, and D. I. Kim, “Applications of deep reinforcement learning in communications and networking: A survey,”IEEE Commun. Surveys Tuts., vol. 21, no. 4, pp. 3133–3174, 4th Quart. 2019

  9. [9]

    Challenges, applications and design aspects of federated learning: A survey,

    K. M. J. Rahman, F. Ahmed, N. Akhter, M. Hasan, R. Amin, K. E. Aziz, A. K. M. M. Islam, M. S. H. Mukta, and A. K. M. N. Islam, “Challenges, applications and design aspects of federated learning: A survey,”IEEE Access, vol. 9, pp. 124 682–124 700, Sep. 2021

  10. [10]

    Communication-efficient distributed learning: An overview,

    X. Cao, T. Bas ¸ar, S. Diggavi, Y . C. Eldar, K. B. Letaief, H. V . Poor, and J. Zhang, “Communication-efficient distributed learning: An overview,” IEEE J. Sel. Areas Commun., vol. 41, no. 4, pp. 851–873, Apr. 2023

  11. [11]

    Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,

    F. Liu, Y . Cui, C. Masouros, J. Xu, T. X. Han, Y . C. Eldar, and S. Buzzi, “Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,”IEEE J. Sel. Areas Commun., vol. 40, no. 6, pp. 1728–1767, Jun. 2022

  12. [12]

    Terahertz communications for massive connec- tivity and security in 6G and beyond era,

    N. Yang and A. Shafie, “Terahertz communications for massive connec- tivity and security in 6G and beyond era,”IEEE Commun. Mag., vol. 62, no. 2, pp. 72–78, Feb. 2024

  13. [13]

    Green cell-free massive MIMO: An optimization embedded deep reinforcement learning approach,

    G. Wang, P. Cheng, Z. Chen, B. Vucetic, and Y . Li, “Green cell-free massive MIMO: An optimization embedded deep reinforcement learning approach,”IEEE Trans. Signal Process., vol. 72, pp. 2751–2766, Jun. 2024

  14. [14]

    Terahertz communications and sensing for 6G and beyond: A comprehensive review,

    W. Jiang, Q. Zhou, J. He, M. A. Habibi, S. Melnyk, M. El-Absi, B. Han, M. Di Renzo, H. D. Schotten, F.-L. Luo, T. S. El-Bawab, M. Juntti, M. Debbah, and V . C. M. Leung, “Terahertz communications and sensing for 6G and beyond: A comprehensive review,”IEEE Commun. Surveys Tuts., vol. 26, no. 4, pp. 2326–2381, 4th Quart. 2024

  15. [15]

    Wavefront transformation- based near-field channel prediction for extremely large antenna array with mobility,

    W. Li, H. Yin, Z. Qin, and M. Debbah, “Wavefront transformation- based near-field channel prediction for extremely large antenna array with mobility,”IEEE Trans. Wireless Commun., vol. 23, no. 10, pp. 15 613–15 626, Oct. 2024

  16. [16]

    Interference and coverage analysis for terahertz networks with indoor blockage effects and line-of- sight access point association,

    Y . Wu, J. Kokkoniemi, C. Han, and M. Juntti, “Interference and coverage analysis for terahertz networks with indoor blockage effects and line-of- sight access point association,”IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1472–1486, Mar. 2021

  17. [17]

    Coverage analysis for 3D indoor terahertz communication system over multi-cluster fluctuating two-ray fading channels,

    Z. Tang, N. Yang, S. Durrani, X. Zhou, M. Juntti, and J. M. Jornet, “Coverage analysis for 3D indoor terahertz communication system over multi-cluster fluctuating two-ray fading channels,”IEEE Trans. Commun., vol. 73, no. 11, pp. 12 326–12 340, Nov. 2025

  18. [18]

    Cell-free massive MIMO versus small cells,

    H. Q. Ngo, A. Ashikhmin, H. Yang, E. G. Larsson, and T. L. Marzetta, “Cell-free massive MIMO versus small cells,”IEEE Trans. Wireless Commun., vol. 16, no. 3, pp. 1834–1850, Mar. 2017

  19. [19]

    Scalable cell-free massive MIMO systems,

    E. Bj ¨ornson and L. Sanguinetti, “Scalable cell-free massive MIMO systems,”IEEE Trans. Commun., vol. 68, no. 7, pp. 4247–4261, Jul. 2020

  20. [20]

    Cell-free ISAC MIMO systems: Joint sensing and communication beamforming,

    U. Demirhan and A. Alkhateeb, “Cell-free ISAC MIMO systems: Joint sensing and communication beamforming,”IEEE Trans. Commun., vol. 73, no. 6, pp. 4454–4468, Jun. 2025

  21. [21]

    Advanced learning algorithms for integrated sensing and communication (ISAC) systems in 6G and beyond: A comprehensive survey,

    N. C. Luong, T. Huynh-The, T.-H. Vu, D. V . Le, H. T. Nguyen, N. D. Hai, G.-V . Nguyen, N. D. D. Anh, D. Niyato, D. I. Kim, and Q.- V . Pham, “Advanced learning algorithms for integrated sensing and communication (ISAC) systems in 6G and beyond: A comprehensive survey,”IEEE Commun. Surveys Tuts., pp. 2572–2611, 3rd Quart. 2025

  22. [22]

    Convex optimization-based beamforming,

    A. B. Gershman, N. D. Sidiropoulos, S. Shahbazpanahi, M. Bengtsson, and B. Ottersten, “Convex optimization-based beamforming,”IEEE Signal Process. Mag., vol. 27, no. 3, pp. 62–75, May 2010

  23. [23]

    Branch-and-bound methods: A survey,

    E. L. Lawler and D. E. Wood, “Branch-and-bound methods: A survey,” Oper. Res., vol. 14, no. 4, pp. 699–719, Jul. 1966

  24. [24]

    Parallel and distributed methods for nonconvex optimization,

    G. Scutari, F. Facchinei, L. Lampariello, and P. Song, “Parallel and distributed methods for nonconvex optimization,” inProc. IEEE Int. Conf. Acoust., Speech Signal Process., Florence, Italy, Jul. 2014, pp. 840–844

  25. [25]

    Learning to optimize: Training deep neural networks for wireless resource management,

    H. Sun, X. Chen, Q. Shi, M. Hong, X. Fu, and N. D. Sidiropoulos, “Learning to optimize: Training deep neural networks for wireless resource management,” inProc. IEEE Int. Workshop Signal Process. Adv. Wireless Commun., Sapporo, Japan, Jul. 2017, pp. 1–6

  26. [26]

    DRL-driven dynamic resource allocation for task-oriented semantic communication,

    H. Zhang, H. Wang, Y . Li, K. Long, and A. Nallanathan, “DRL-driven dynamic resource allocation for task-oriented semantic communication,” IEEE Trans. Commun., vol. 71, no. 7, pp. 3992–4004, Jul. 2023

  27. [27]

    Challenges and opportunities in deep reinforcement learning with graph neural networks: A comprehensive review of algorithms and applications,

    S. Munikoti, D. Agarwal, L. Das, M. Halappanavar, and B. Natarajan, “Challenges and opportunities in deep reinforcement learning with graph neural networks: A comprehensive review of algorithms and applications,”IEEE Trans. Neural Netw. Learn. Syst., vol. 35, no. 11, pp. 15 051–15 071, Nov. 2024

  28. [28]

    Hybrid-task meta-learning: A GNN approach for scalable and transferable bandwidth allocation,

    X. Hao, C. She, P. L. Yeoh, Y . Liu, B. Vucetic, and Y . Li, “Hybrid-task meta-learning: A GNN approach for scalable and transferable bandwidth allocation,”IEEE Trans. Wireless Commun., vol. 23, no. 12, pp. 19 820– 19 835, Dec. 2024

  29. [29]

    Cross rayleigh and fresnel distances: Unified far-field and near-field beam training for XL-MIMO using ellipse-fitting localization,

    Y . Guo, X. Guo, and Y . Wang, “Cross rayleigh and fresnel distances: Unified far-field and near-field beam training for XL-MIMO using ellipse-fitting localization,”IEEE Trans. Wireless Commun., pp. 4903– 4919, Oct. 2025

  30. [30]

    Channel estimation for extremely large-scale MIMO: Far-field or near-field?

    M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: Far-field or near-field?”IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, Apr. 2022

  31. [31]

    Scatterer localization using large-scale antenna arrays based on a spherical wave-front parametric model,

    X. Yin, S. Wang, N. Zhang, and B. Ai, “Scatterer localization using large-scale antenna arrays based on a spherical wave-front parametric model,”IEEE Trans. Wireless Commun., vol. 16, no. 10, pp. 6543–6556, Oct. 2017

  32. [32]

    An overview of signal processing techniques for joint communication and radar sensing,

    J. A. Zhang, F. Liu, C. Masouros, R. W. Heath, Z. Feng, L. Zheng, and A. Petropulu, “An overview of signal processing techniques for joint communication and radar sensing,”IEEE J. Sel. Topics Signal Process., vol. 15, no. 6, pp. 1295–1315, Nov. 2021

  33. [33]

    S. M. Kay,Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice-Hall, 1993

  34. [34]

    Dual- function radar communication systems: A solution to the spectrum congestion problem,

    A. Hassanien, M. G. Amin, E. Aboutanios, and B. Himed, “Dual- function radar communication systems: A solution to the spectrum congestion problem,”IEEE Signal Process. Mag., vol. 36, no. 5, pp. 115–126, Sep. 2019

  35. [35]

    Majorization-minimization algo- rithms in signal processing, communications, and machine learning,

    Y . Sun, P. Babu, and D. P. Palomar, “Majorization-minimization algo- rithms in signal processing, communications, and machine learning,” IEEE Trans. Signal Process., vol. 65, no. 3, pp. 794–816, Feb. 2017

  36. [36]

    Semidefinite relaxation of quadratic optimization problems,

    Z.-q. Luo, W.-k. Ma, A. M.-c. So, Y . Ye, and S. Zhang, “Semidefinite relaxation of quadratic optimization problems,”IEEE Signal Process. Mag., vol. 27, no. 3, pp. 20–34, May 2010

  37. [37]

    An iteratively weighted mmse approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,

    Q. Shi, M. Razaviyayn, Z.-Q. Luo, and C. He, “An iteratively weighted mmse approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,”IEEE Trans. Signal Process., vol. 59, no. 9, pp. 4331–4340, Sep. 2011

  38. [38]

    Graph neural network for distributed beamforming and power control in massive URLLC networks,

    Y . Gu, C. She, S. Bi, Z. Quan, and B. Vucetic, “Graph neural network for distributed beamforming and power control in massive URLLC networks,”IEEE Trans. Wireless Commun., vol. 23, no. 8, pp. 9099– 9112, Aug. 2024

  39. [39]

    Heterogeneous graph trans- former,

    Z. Hu, Y . Dong, K. Wang, and Y . Sun, “Heterogeneous graph trans- former,” inProc. The Web Conf., Taipei, Taiwan, Apr. 2020, pp. 2704– 2710

  40. [40]

    Massive MIMO networks: Spectral, energy, and hardware efficiency,

    E. Bj ¨ornson, J. Hoydis, and L. Sanguinetti, “Massive MIMO networks: Spectral, energy, and hardware efficiency,”Found. Trends Signal Pro- cess., vol. 11, no. 3–4, pp. 154–655, Nov. 2017