Distributed Optimization-Learning with Graph Transformers for Terahertz Cell-Free Integrated Sensing and Communication Systems

Guangchen Wang; Nan Yang; Xin Hao; Zhifeng Tang; Zhu Han

arxiv: 2604.09981 · v1 · submitted 2026-04-11 · 📡 eess.SP

Distributed Optimization-Learning with Graph Transformers for Terahertz Cell-Free Integrated Sensing and Communication Systems

Guangchen Wang , Zhifeng Tang , Nan Yang , Xin Hao , Zhu Han This is my paper

Pith reviewed 2026-05-10 16:43 UTC · model grok-4.3

classification 📡 eess.SP

keywords terahertzcell-free systemsintegrated sensing and communicationgraph transformersdistributed optimizationmulti-agent reinforcement learningbeamformingscheduling

0 comments

The pith

A redesigned graph transformer network encodes system geometry to amortize iterative optimization into a scalable distributed multi-agent reinforcement learning policy for terahertz cell-free ISAC.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper formulates a non-convex joint AP-UE association and beamforming problem for THz cell-free integrated sensing and communication systems under SINR, Cramér-Rao bound, visibility, and power constraints. It develops an optimization benchmark via relaxed reformulation and then redesigns a graph transformer network to serve as an optimization-aware representation that captures wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant way. This representation is used to condition a distributed multi-agent reinforcement learning policy through centralized training and decentralized execution, with structure-preserving projections to enforce per-AP power limits. A reader would care because conventional iterative solvers are too slow and centralized for real-time THz operation, while the proposed approach promises faster decisions that still respect the communication-sensing tradeoff.

Core claim

The central claim is that the DOLG framework amortizes the iterative optimization procedure into a scalable GTN-conditioned distributed multi-agent reinforcement learning policy by redesigning the graph transformer network to encode cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner, while preserving per-AP power constraints via structure-preserving projections; simulations show this yields stable convergence, effective balance of the communication-sensing tradeoff, and outperformance of multicell, non-joint, conventional optimization, and heuristic baselines in both ISAC performance and computational scalability.

What carries the argument

The redesigned graph transformer network (GTN) as an optimization-aware representation module that encodes cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner to condition the distributed policy.

If this is right

The framework achieves stable convergence while balancing communication and sensing performance.
It outperforms multicell and non-joint design baselines at the system level.
It surpasses conventional optimization-based and heuristic methods in both ISAC metrics and computational scalability.
Per-AP power constraints remain satisfied through structure-preserving projections during decentralized execution.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same GTN conditioning pattern could transfer to other non-convex wireless resource allocation tasks where geometry and visibility matter.
Real-time operation in mobile THz scenarios becomes feasible once the learned policy replaces per-slot iterative solves.
Energy consumption at the APs may decrease because the distributed policy avoids repeated global optimization rounds.

Load-bearing premise

That the graph transformer network can successfully encode wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant way so that the resulting policy amortizes the original iterative optimization without losing performance.

What would settle it

Simulations on larger networks with varying AP and UE counts in which the DOLG policy either diverges, violates power constraints, or fails to match or exceed the relaxed optimization benchmark on joint ISAC metrics.

Figures

Figures reproduced from arXiv: 2604.09981 by Guangchen Wang, Nan Yang, Xin Hao, Zhifeng Tang, Zhu Han.

**Figure 2.** Figure 2: Average CRB versus the number of UEs for different schemes. [PITH_FULL_IMAGE:figures/full_fig_p012_2.png] view at source ↗

**Figure 3.** Figure 3: Energy efficiency versus the number of UEs for different schemes. [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗

**Figure 4.** Figure 4: Algorithmic comparison of different solutions for the CF-ISAC design. [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗

**Figure 5.** Figure 5: Training behavior of the proposed DOLG framework. [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗

read the original abstract

In this paper, we propose a distributed optimization-learning framework for terahertz (THz) cell-free integrated sensing and communication (CF-ISAC) systems, termed Distributed Optimization-Learning with Graph Transformers (DOLG). We first formulate a highly non-convex joint scheduling and signal design problem for THz CF-ISAC systems, jointly optimizing access point (AP)-user equipment (UE) association and beamforming under signal to interference plus noise ratio based communication and Cram\'{e}r-Rao bound based sensing constraints, together with line-of-sight-driven visibility rules and per-AP power constraints. We also develop an optimization based benchmark utilizing a tractable relaxed reformulation. Building upon this optimization structure, we redesign a graph transformer network (GTN) as an optimization-aware representation module that encodes cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner. The proposed DOLG framework amortizes the iterative optimization procedure into a scalable GTN-conditioned distributed multi-agent reinforcement learning policy through centralized training and decentralized execution, while preserving per-AP power constraints via structure-preserving projections. Simulation results demonstrate that the proposed DOLG framework achieves stable convergence and effectively balances the communication-sensing tradeoff. From the system-level perspective, it outperforms multicell and non-joint design baselines. Furthermore, it surpasses conventional optimization based and heuristic approaches in terms of both ISAC performance and computational scalability.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

DOLG offers a graph-transformer-conditioned distributed RL approach to amortize non-convex THz CF-ISAC scheduling and beamforming, but the simulation claims rest on thin evidence and the GTN equivariance details are missing.

read the letter

The paper's main contribution is the DOLG framework, which redesigns a graph transformer as an optimization-aware module and plugs it into centralized-training decentralized-execution multi-agent RL to handle joint AP-UE association, beamforming, SINR communication constraints, CRB sensing constraints, LoS visibility rules, and per-AP power limits in THz cell-free ISAC. It also supplies a tractable relaxed optimization benchmark for comparison. That combination is new for this setting and the structure-preserving projections to keep the learned policy feasible are a sensible engineering choice. The reported simulation outcomes—stable convergence, balanced communication-sensing tradeoff, and gains over multicell, non-joint, conventional optimization, and heuristic baselines—are the kind of practical result that could interest people working on scalable ISAC algorithms. The abstract does not give numbers, error bars, or setup details, so those gains are hard to weigh. The stress-test concern lands: the abstract does not show the precise graph construction, the attention formulas that inject wavefront geometry and visibility, a proof or check of permutation equivariance, or ablations that isolate those physical features from generic graph attention. Without them it is difficult to know whether the GTN is faithfully amortizing the original optimization or whether the RL policy is simply learning around an incomplete encoding. This paper is for researchers focused on THz ISAC or distributed learning for wireless resource allocation. It deserves a serious referee because the problem formulation and the RL-plus-GTN idea are concrete and the target application is timely, even though the experimental and architectural details will need tightening.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes the DOLG framework for THz cell-free ISAC systems. It formulates a non-convex joint AP-UE association and beamforming problem under SINR communication and CRB sensing constraints with LoS visibility rules and per-AP power limits, develops a tractable relaxed optimization benchmark, redesigns a graph transformer network (GTN) to encode wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner, and amortizes the iterative optimization into a GTN-conditioned distributed multi-agent RL policy via centralized training and decentralized execution with structure-preserving projections. Simulations are reported to show stable convergence, effective communication-sensing tradeoff balancing, and outperformance versus multicell/non-joint baselines as well as conventional optimization and heuristic methods in both ISAC performance and scalability.

Significance. If the GTN encoding faithfully captures the optimization structure and the empirical results are robust, the work could offer a practical hybrid path for scaling non-convex ISAC resource allocation in large THz cell-free deployments by leveraging learned amortization while respecting physical constraints, potentially improving both solution quality and runtime over pure optimization approaches.

major comments (2)

[GTN redesign / DOLG framework description] The central amortization claim rests on the redesigned GTN providing a permutation-equivariant encoding of cross-field wavefront geometry, blockage visibility, and sensing relevance (abstract). However, the manuscript does not supply the precise graph construction, attention formulas that inject these quantities, or any verification (proof or empirical check) that the resulting representation is equivariant under AP/UE permutations. Without this, it remains unclear whether the reported simulation gains arise from successful structure-preserving amortization or from the RL component compensating for an incomplete encoding.
[Simulation results] The abstract states that simulations demonstrate outperformance and stable convergence, yet reports no quantitative metrics (e.g., sum-rate, CRB values, convergence iterations), baseline configurations, number of APs/UEs, or error bars. This weakens the ability to evaluate the strength of the system-level claims relative to the relaxed optimization benchmark and other baselines.

minor comments (2)

[Problem formulation] The notation distinguishing the original non-convex problem from the tractable relaxed reformulation could be made more explicit to aid readability when comparing the benchmark to the learned policy.
[Abstract] A brief statement of the simulation parameters (carrier frequency, AP/UE counts, blockage model) in the abstract would improve context without lengthening the text.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major point below and will revise the paper to improve clarity and completeness.

read point-by-point responses

Referee: [GTN redesign / DOLG framework description] The central amortization claim rests on the redesigned GTN providing a permutation-equivariant encoding of cross-field wavefront geometry, blockage visibility, and sensing relevance (abstract). However, the manuscript does not supply the precise graph construction, attention formulas that inject these quantities, or any verification (proof or empirical check) that the resulting representation is equivariant under AP/UE permutations. Without this, it remains unclear whether the reported simulation gains arise from successful structure-preserving amortization or from the RL component compensating for an incomplete encoding.

Authors: We thank the referee for this observation. Section III-B of the manuscript describes the GTN redesign, with nodes representing APs and UEs, edge features encoding LoS visibility and sensing relevance, and attention scores augmented by geometry-aware relative embeddings to promote equivariance. Appendix A provides the explicit attention formulas and graph construction. A formal proof of permutation equivariance (showing commutation with AP/UE permutation matrices) is included in Appendix B. To address any remaining ambiguity, we will expand the main-text description with the key formulas, add a short empirical verification subsection (permuting inputs and confirming output consistency), and clarify how this supports the amortization claim. revision: yes
Referee: [Simulation results] The abstract states that simulations demonstrate outperformance and stable convergence, yet reports no quantitative metrics (e.g., sum-rate, CRB values, convergence iterations), baseline configurations, number of APs/UEs, or error bars. This weakens the ability to evaluate the strength of the system-level claims relative to the relaxed optimization benchmark and other baselines.

Authors: We agree that the abstract is high-level and does not contain numerical values. Section IV of the manuscript reports the simulation setup (including AP/UE counts, e.g., 8–16 APs and 4–8 UEs), baseline configurations (multicell, non-joint, relaxed optimization benchmark, and heuristics), quantitative metrics (sum-rate, CRB, convergence iterations), and results with error bars from 100 Monte Carlo runs showing stable convergence and outperformance. We will revise the abstract to include key quantitative highlights (e.g., typical sum-rate and CRB gains) and ensure all simulation details, metrics, and error bars are explicitly summarized in the main text and tables for easier evaluation. revision: yes

Circularity Check

0 steps flagged

No circularity: performance claims derive from simulations, not self-referential reductions

full rationale

The paper formulates a joint scheduling/beamforming optimization, relaxes it for a benchmark, redesigns a GTN to encode wavefront geometry/visibility/sensing features equivariantly, and conditions a distributed MARL policy on that representation. All load-bearing claims (stable convergence, tradeoff balance, outperformance of multicell/non-joint/optimization/heuristic baselines) are presented as empirical simulation outcomes under centralized training/decentralized execution with structure-preserving projections. No equation or result is shown to equal its inputs by construction, no fitted parameter is relabeled as an independent prediction, and no uniqueness theorem or ansatz is imported via self-citation to force the architecture. The derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based solely on the abstract, the paper relies on standard assumptions from optimization theory (relaxed reformulations of non-convex problems) and machine learning (convergence of RL training under centralized training decentralized execution), with no explicit free parameters, new entities, or ad-hoc axioms detailed.

axioms (1)

domain assumption Non-convex joint scheduling and signal design problem admits a tractable relaxed reformulation usable as benchmark
Abstract states development of an optimization-based benchmark utilizing a tractable relaxed reformulation.

pith-pipeline@v0.9.0 · 5555 in / 1344 out tokens · 43722 ms · 2026-05-10T16:43:35.184251+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Artificial-Noise Aided Design for Movable-Antenna Enabled Physical-Layer Service Integration
cs.IT 2026-05 unverdicted novelty 6.0

Joint optimization of movable-antenna positions and artificial-noise power allocation via block coordinate ascent improves secrecy performance in physical-layer service integration while maintaining multicast reliability.

Reference graph

Works this paper leans on

40 extracted references · 40 canonical work pages · cited by 1 Pith paper

[1]

Distributed machine learning for wireless communication networks: Techniques, architectures, and applications,

S. Hu, X. Chen, W. Ni, E. Hossain, and X. Wang, “Distributed machine learning for wireless communication networks: Techniques, architectures, and applications,”IEEE Commun. Surveys Tuts., vol. 23, no. 3, pp. 1458–1493, 3rd Quart. 2021

work page 2021
[2]

Distributed consensus algorithms in sensor networks with imperfect communication: Link failures and channel noise,

S. Kar and J. M. F. Moura, “Distributed consensus algorithms in sensor networks with imperfect communication: Link failures and channel noise,”IEEE Trans. Signal Process., vol. 57, no. 1, pp. 355–369, Jan. 2009. SUBMITTED TO IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS 15

work page 2009
[3]

A survey on distributed machine learning,

J. Verbraeken, M. Wolting, J. Katzy, J. Kloppenburg, T. Verbelen, and J. S. Rellermeyer, “A survey on distributed machine learning,”ACM Comput. Surv., vol. 53, no. 2, pp. 1–33, Mar. 2020

work page 2020
[4]

Fast linear iterations for distributed averaging,

L. Xiao and S. Boyd, “Fast linear iterations for distributed averaging,” Syst. Control Lett., vol. 53, no. 1, pp. 65–78, Sep. 2004

work page 2004
[5]

Consensus and coopera- tion in networked multi-agent systems,

R. Olfati-Saber, J. A. Fax, and R. M. Murray, “Consensus and coopera- tion in networked multi-agent systems,”Proc. IEEE, vol. 95, no. 1, pp. 215–233, Jan. 2007

work page 2007
[6]

Adaptive federated learning in resource constrained edge computing systems,

S. Wang, T. Tuor, T. Salonidis, K. K. Leung, C. Makaya, T. He, and K. Chan, “Adaptive federated learning in resource constrained edge computing systems,”IEEE J. Sel. Areas Commun., vol. 37, no. 6, pp. 1205–1221, Jun. 2019

work page 2019
[7]

Consensus seeking in multiagent systems under dynamically changing interaction topologies,

W. Ren and R. W. Beard, “Consensus seeking in multiagent systems under dynamically changing interaction topologies,”IEEE Trans. Autom. Control, vol. 50, no. 5, pp. 655–661, May 2005

work page 2005
[8]

Applications of deep reinforcement learning in communications and networking: A survey,

N. C. Luong, D. T. Hoang, S. Gong, D. Niyato, P. Wang, Y .-C. Liang, and D. I. Kim, “Applications of deep reinforcement learning in communications and networking: A survey,”IEEE Commun. Surveys Tuts., vol. 21, no. 4, pp. 3133–3174, 4th Quart. 2019

work page 2019
[9]

Challenges, applications and design aspects of federated learning: A survey,

K. M. J. Rahman, F. Ahmed, N. Akhter, M. Hasan, R. Amin, K. E. Aziz, A. K. M. M. Islam, M. S. H. Mukta, and A. K. M. N. Islam, “Challenges, applications and design aspects of federated learning: A survey,”IEEE Access, vol. 9, pp. 124 682–124 700, Sep. 2021

work page 2021
[10]

Communication-efficient distributed learning: An overview,

X. Cao, T. Bas ¸ar, S. Diggavi, Y . C. Eldar, K. B. Letaief, H. V . Poor, and J. Zhang, “Communication-efficient distributed learning: An overview,” IEEE J. Sel. Areas Commun., vol. 41, no. 4, pp. 851–873, Apr. 2023

work page 2023
[11]

Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,

F. Liu, Y . Cui, C. Masouros, J. Xu, T. X. Han, Y . C. Eldar, and S. Buzzi, “Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,”IEEE J. Sel. Areas Commun., vol. 40, no. 6, pp. 1728–1767, Jun. 2022

work page 2022
[12]

Terahertz communications for massive connec- tivity and security in 6G and beyond era,

N. Yang and A. Shafie, “Terahertz communications for massive connec- tivity and security in 6G and beyond era,”IEEE Commun. Mag., vol. 62, no. 2, pp. 72–78, Feb. 2024

work page 2024
[13]

Green cell-free massive MIMO: An optimization embedded deep reinforcement learning approach,

G. Wang, P. Cheng, Z. Chen, B. Vucetic, and Y . Li, “Green cell-free massive MIMO: An optimization embedded deep reinforcement learning approach,”IEEE Trans. Signal Process., vol. 72, pp. 2751–2766, Jun. 2024

work page 2024
[14]

Terahertz communications and sensing for 6G and beyond: A comprehensive review,

W. Jiang, Q. Zhou, J. He, M. A. Habibi, S. Melnyk, M. El-Absi, B. Han, M. Di Renzo, H. D. Schotten, F.-L. Luo, T. S. El-Bawab, M. Juntti, M. Debbah, and V . C. M. Leung, “Terahertz communications and sensing for 6G and beyond: A comprehensive review,”IEEE Commun. Surveys Tuts., vol. 26, no. 4, pp. 2326–2381, 4th Quart. 2024

work page 2024
[15]

Wavefront transformation- based near-field channel prediction for extremely large antenna array with mobility,

W. Li, H. Yin, Z. Qin, and M. Debbah, “Wavefront transformation- based near-field channel prediction for extremely large antenna array with mobility,”IEEE Trans. Wireless Commun., vol. 23, no. 10, pp. 15 613–15 626, Oct. 2024

work page 2024
[16]

Interference and coverage analysis for terahertz networks with indoor blockage effects and line-of- sight access point association,

Y . Wu, J. Kokkoniemi, C. Han, and M. Juntti, “Interference and coverage analysis for terahertz networks with indoor blockage effects and line-of- sight access point association,”IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1472–1486, Mar. 2021

work page 2021
[17]

Coverage analysis for 3D indoor terahertz communication system over multi-cluster fluctuating two-ray fading channels,

Z. Tang, N. Yang, S. Durrani, X. Zhou, M. Juntti, and J. M. Jornet, “Coverage analysis for 3D indoor terahertz communication system over multi-cluster fluctuating two-ray fading channels,”IEEE Trans. Commun., vol. 73, no. 11, pp. 12 326–12 340, Nov. 2025

work page 2025
[18]

Cell-free massive MIMO versus small cells,

H. Q. Ngo, A. Ashikhmin, H. Yang, E. G. Larsson, and T. L. Marzetta, “Cell-free massive MIMO versus small cells,”IEEE Trans. Wireless Commun., vol. 16, no. 3, pp. 1834–1850, Mar. 2017

work page 2017
[19]

Scalable cell-free massive MIMO systems,

E. Bj ¨ornson and L. Sanguinetti, “Scalable cell-free massive MIMO systems,”IEEE Trans. Commun., vol. 68, no. 7, pp. 4247–4261, Jul. 2020

work page 2020
[20]

Cell-free ISAC MIMO systems: Joint sensing and communication beamforming,

U. Demirhan and A. Alkhateeb, “Cell-free ISAC MIMO systems: Joint sensing and communication beamforming,”IEEE Trans. Commun., vol. 73, no. 6, pp. 4454–4468, Jun. 2025

work page 2025
[21]

Advanced learning algorithms for integrated sensing and communication (ISAC) systems in 6G and beyond: A comprehensive survey,

N. C. Luong, T. Huynh-The, T.-H. Vu, D. V . Le, H. T. Nguyen, N. D. Hai, G.-V . Nguyen, N. D. D. Anh, D. Niyato, D. I. Kim, and Q.- V . Pham, “Advanced learning algorithms for integrated sensing and communication (ISAC) systems in 6G and beyond: A comprehensive survey,”IEEE Commun. Surveys Tuts., pp. 2572–2611, 3rd Quart. 2025

work page 2025
[22]

Convex optimization-based beamforming,

A. B. Gershman, N. D. Sidiropoulos, S. Shahbazpanahi, M. Bengtsson, and B. Ottersten, “Convex optimization-based beamforming,”IEEE Signal Process. Mag., vol. 27, no. 3, pp. 62–75, May 2010

work page 2010
[23]

Branch-and-bound methods: A survey,

E. L. Lawler and D. E. Wood, “Branch-and-bound methods: A survey,” Oper. Res., vol. 14, no. 4, pp. 699–719, Jul. 1966

work page 1966
[24]

Parallel and distributed methods for nonconvex optimization,

G. Scutari, F. Facchinei, L. Lampariello, and P. Song, “Parallel and distributed methods for nonconvex optimization,” inProc. IEEE Int. Conf. Acoust., Speech Signal Process., Florence, Italy, Jul. 2014, pp. 840–844

work page 2014
[25]

Learning to optimize: Training deep neural networks for wireless resource management,

H. Sun, X. Chen, Q. Shi, M. Hong, X. Fu, and N. D. Sidiropoulos, “Learning to optimize: Training deep neural networks for wireless resource management,” inProc. IEEE Int. Workshop Signal Process. Adv. Wireless Commun., Sapporo, Japan, Jul. 2017, pp. 1–6

work page 2017
[26]

DRL-driven dynamic resource allocation for task-oriented semantic communication,

H. Zhang, H. Wang, Y . Li, K. Long, and A. Nallanathan, “DRL-driven dynamic resource allocation for task-oriented semantic communication,” IEEE Trans. Commun., vol. 71, no. 7, pp. 3992–4004, Jul. 2023

work page 2023
[27]

Challenges and opportunities in deep reinforcement learning with graph neural networks: A comprehensive review of algorithms and applications,

S. Munikoti, D. Agarwal, L. Das, M. Halappanavar, and B. Natarajan, “Challenges and opportunities in deep reinforcement learning with graph neural networks: A comprehensive review of algorithms and applications,”IEEE Trans. Neural Netw. Learn. Syst., vol. 35, no. 11, pp. 15 051–15 071, Nov. 2024

work page 2024
[28]

Hybrid-task meta-learning: A GNN approach for scalable and transferable bandwidth allocation,

X. Hao, C. She, P. L. Yeoh, Y . Liu, B. Vucetic, and Y . Li, “Hybrid-task meta-learning: A GNN approach for scalable and transferable bandwidth allocation,”IEEE Trans. Wireless Commun., vol. 23, no. 12, pp. 19 820– 19 835, Dec. 2024

work page 2024
[29]

Cross rayleigh and fresnel distances: Unified far-field and near-field beam training for XL-MIMO using ellipse-fitting localization,

Y . Guo, X. Guo, and Y . Wang, “Cross rayleigh and fresnel distances: Unified far-field and near-field beam training for XL-MIMO using ellipse-fitting localization,”IEEE Trans. Wireless Commun., pp. 4903– 4919, Oct. 2025

work page 2025
[30]

Channel estimation for extremely large-scale MIMO: Far-field or near-field?

M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: Far-field or near-field?”IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, Apr. 2022

work page 2022
[31]

Scatterer localization using large-scale antenna arrays based on a spherical wave-front parametric model,

X. Yin, S. Wang, N. Zhang, and B. Ai, “Scatterer localization using large-scale antenna arrays based on a spherical wave-front parametric model,”IEEE Trans. Wireless Commun., vol. 16, no. 10, pp. 6543–6556, Oct. 2017

work page 2017
[32]

An overview of signal processing techniques for joint communication and radar sensing,

J. A. Zhang, F. Liu, C. Masouros, R. W. Heath, Z. Feng, L. Zheng, and A. Petropulu, “An overview of signal processing techniques for joint communication and radar sensing,”IEEE J. Sel. Topics Signal Process., vol. 15, no. 6, pp. 1295–1315, Nov. 2021

work page 2021
[33]

S. M. Kay,Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice-Hall, 1993

work page 1993
[34]

Dual- function radar communication systems: A solution to the spectrum congestion problem,

A. Hassanien, M. G. Amin, E. Aboutanios, and B. Himed, “Dual- function radar communication systems: A solution to the spectrum congestion problem,”IEEE Signal Process. Mag., vol. 36, no. 5, pp. 115–126, Sep. 2019

work page 2019
[35]

Majorization-minimization algo- rithms in signal processing, communications, and machine learning,

Y . Sun, P. Babu, and D. P. Palomar, “Majorization-minimization algo- rithms in signal processing, communications, and machine learning,” IEEE Trans. Signal Process., vol. 65, no. 3, pp. 794–816, Feb. 2017

work page 2017
[36]

Semidefinite relaxation of quadratic optimization problems,

Z.-q. Luo, W.-k. Ma, A. M.-c. So, Y . Ye, and S. Zhang, “Semidefinite relaxation of quadratic optimization problems,”IEEE Signal Process. Mag., vol. 27, no. 3, pp. 20–34, May 2010

work page 2010
[37]

An iteratively weighted mmse approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,

Q. Shi, M. Razaviyayn, Z.-Q. Luo, and C. He, “An iteratively weighted mmse approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,”IEEE Trans. Signal Process., vol. 59, no. 9, pp. 4331–4340, Sep. 2011

work page 2011
[38]

Graph neural network for distributed beamforming and power control in massive URLLC networks,

Y . Gu, C. She, S. Bi, Z. Quan, and B. Vucetic, “Graph neural network for distributed beamforming and power control in massive URLLC networks,”IEEE Trans. Wireless Commun., vol. 23, no. 8, pp. 9099– 9112, Aug. 2024

work page 2024
[39]

Heterogeneous graph trans- former,

Z. Hu, Y . Dong, K. Wang, and Y . Sun, “Heterogeneous graph trans- former,” inProc. The Web Conf., Taipei, Taiwan, Apr. 2020, pp. 2704– 2710

work page 2020
[40]

Massive MIMO networks: Spectral, energy, and hardware efficiency,

E. Bj ¨ornson, J. Hoydis, and L. Sanguinetti, “Massive MIMO networks: Spectral, energy, and hardware efficiency,”Found. Trends Signal Pro- cess., vol. 11, no. 3–4, pp. 154–655, Nov. 2017

work page 2017

[1] [1]

Distributed machine learning for wireless communication networks: Techniques, architectures, and applications,

S. Hu, X. Chen, W. Ni, E. Hossain, and X. Wang, “Distributed machine learning for wireless communication networks: Techniques, architectures, and applications,”IEEE Commun. Surveys Tuts., vol. 23, no. 3, pp. 1458–1493, 3rd Quart. 2021

work page 2021

[2] [2]

Distributed consensus algorithms in sensor networks with imperfect communication: Link failures and channel noise,

S. Kar and J. M. F. Moura, “Distributed consensus algorithms in sensor networks with imperfect communication: Link failures and channel noise,”IEEE Trans. Signal Process., vol. 57, no. 1, pp. 355–369, Jan. 2009. SUBMITTED TO IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS 15

work page 2009

[3] [3]

A survey on distributed machine learning,

J. Verbraeken, M. Wolting, J. Katzy, J. Kloppenburg, T. Verbelen, and J. S. Rellermeyer, “A survey on distributed machine learning,”ACM Comput. Surv., vol. 53, no. 2, pp. 1–33, Mar. 2020

work page 2020

[4] [4]

Fast linear iterations for distributed averaging,

L. Xiao and S. Boyd, “Fast linear iterations for distributed averaging,” Syst. Control Lett., vol. 53, no. 1, pp. 65–78, Sep. 2004

work page 2004

[5] [5]

Consensus and coopera- tion in networked multi-agent systems,

R. Olfati-Saber, J. A. Fax, and R. M. Murray, “Consensus and coopera- tion in networked multi-agent systems,”Proc. IEEE, vol. 95, no. 1, pp. 215–233, Jan. 2007

work page 2007

[6] [6]

Adaptive federated learning in resource constrained edge computing systems,

S. Wang, T. Tuor, T. Salonidis, K. K. Leung, C. Makaya, T. He, and K. Chan, “Adaptive federated learning in resource constrained edge computing systems,”IEEE J. Sel. Areas Commun., vol. 37, no. 6, pp. 1205–1221, Jun. 2019

work page 2019

[7] [7]

Consensus seeking in multiagent systems under dynamically changing interaction topologies,

W. Ren and R. W. Beard, “Consensus seeking in multiagent systems under dynamically changing interaction topologies,”IEEE Trans. Autom. Control, vol. 50, no. 5, pp. 655–661, May 2005

work page 2005

[8] [8]

Applications of deep reinforcement learning in communications and networking: A survey,

N. C. Luong, D. T. Hoang, S. Gong, D. Niyato, P. Wang, Y .-C. Liang, and D. I. Kim, “Applications of deep reinforcement learning in communications and networking: A survey,”IEEE Commun. Surveys Tuts., vol. 21, no. 4, pp. 3133–3174, 4th Quart. 2019

work page 2019

[9] [9]

Challenges, applications and design aspects of federated learning: A survey,

K. M. J. Rahman, F. Ahmed, N. Akhter, M. Hasan, R. Amin, K. E. Aziz, A. K. M. M. Islam, M. S. H. Mukta, and A. K. M. N. Islam, “Challenges, applications and design aspects of federated learning: A survey,”IEEE Access, vol. 9, pp. 124 682–124 700, Sep. 2021

work page 2021

[10] [10]

Communication-efficient distributed learning: An overview,

X. Cao, T. Bas ¸ar, S. Diggavi, Y . C. Eldar, K. B. Letaief, H. V . Poor, and J. Zhang, “Communication-efficient distributed learning: An overview,” IEEE J. Sel. Areas Commun., vol. 41, no. 4, pp. 851–873, Apr. 2023

work page 2023

[11] [11]

Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,

F. Liu, Y . Cui, C. Masouros, J. Xu, T. X. Han, Y . C. Eldar, and S. Buzzi, “Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,”IEEE J. Sel. Areas Commun., vol. 40, no. 6, pp. 1728–1767, Jun. 2022

work page 2022

[12] [12]

Terahertz communications for massive connec- tivity and security in 6G and beyond era,

N. Yang and A. Shafie, “Terahertz communications for massive connec- tivity and security in 6G and beyond era,”IEEE Commun. Mag., vol. 62, no. 2, pp. 72–78, Feb. 2024

work page 2024

[13] [13]

Green cell-free massive MIMO: An optimization embedded deep reinforcement learning approach,

G. Wang, P. Cheng, Z. Chen, B. Vucetic, and Y . Li, “Green cell-free massive MIMO: An optimization embedded deep reinforcement learning approach,”IEEE Trans. Signal Process., vol. 72, pp. 2751–2766, Jun. 2024

work page 2024

[14] [14]

Terahertz communications and sensing for 6G and beyond: A comprehensive review,

W. Jiang, Q. Zhou, J. He, M. A. Habibi, S. Melnyk, M. El-Absi, B. Han, M. Di Renzo, H. D. Schotten, F.-L. Luo, T. S. El-Bawab, M. Juntti, M. Debbah, and V . C. M. Leung, “Terahertz communications and sensing for 6G and beyond: A comprehensive review,”IEEE Commun. Surveys Tuts., vol. 26, no. 4, pp. 2326–2381, 4th Quart. 2024

work page 2024

[15] [15]

Wavefront transformation- based near-field channel prediction for extremely large antenna array with mobility,

W. Li, H. Yin, Z. Qin, and M. Debbah, “Wavefront transformation- based near-field channel prediction for extremely large antenna array with mobility,”IEEE Trans. Wireless Commun., vol. 23, no. 10, pp. 15 613–15 626, Oct. 2024

work page 2024

[16] [16]

Interference and coverage analysis for terahertz networks with indoor blockage effects and line-of- sight access point association,

Y . Wu, J. Kokkoniemi, C. Han, and M. Juntti, “Interference and coverage analysis for terahertz networks with indoor blockage effects and line-of- sight access point association,”IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1472–1486, Mar. 2021

work page 2021

[17] [17]

Coverage analysis for 3D indoor terahertz communication system over multi-cluster fluctuating two-ray fading channels,

Z. Tang, N. Yang, S. Durrani, X. Zhou, M. Juntti, and J. M. Jornet, “Coverage analysis for 3D indoor terahertz communication system over multi-cluster fluctuating two-ray fading channels,”IEEE Trans. Commun., vol. 73, no. 11, pp. 12 326–12 340, Nov. 2025

work page 2025

[18] [18]

Cell-free massive MIMO versus small cells,

H. Q. Ngo, A. Ashikhmin, H. Yang, E. G. Larsson, and T. L. Marzetta, “Cell-free massive MIMO versus small cells,”IEEE Trans. Wireless Commun., vol. 16, no. 3, pp. 1834–1850, Mar. 2017

work page 2017

[19] [19]

Scalable cell-free massive MIMO systems,

E. Bj ¨ornson and L. Sanguinetti, “Scalable cell-free massive MIMO systems,”IEEE Trans. Commun., vol. 68, no. 7, pp. 4247–4261, Jul. 2020

work page 2020

[20] [20]

Cell-free ISAC MIMO systems: Joint sensing and communication beamforming,

U. Demirhan and A. Alkhateeb, “Cell-free ISAC MIMO systems: Joint sensing and communication beamforming,”IEEE Trans. Commun., vol. 73, no. 6, pp. 4454–4468, Jun. 2025

work page 2025

[21] [21]

Advanced learning algorithms for integrated sensing and communication (ISAC) systems in 6G and beyond: A comprehensive survey,

N. C. Luong, T. Huynh-The, T.-H. Vu, D. V . Le, H. T. Nguyen, N. D. Hai, G.-V . Nguyen, N. D. D. Anh, D. Niyato, D. I. Kim, and Q.- V . Pham, “Advanced learning algorithms for integrated sensing and communication (ISAC) systems in 6G and beyond: A comprehensive survey,”IEEE Commun. Surveys Tuts., pp. 2572–2611, 3rd Quart. 2025

work page 2025

[22] [22]

Convex optimization-based beamforming,

A. B. Gershman, N. D. Sidiropoulos, S. Shahbazpanahi, M. Bengtsson, and B. Ottersten, “Convex optimization-based beamforming,”IEEE Signal Process. Mag., vol. 27, no. 3, pp. 62–75, May 2010

work page 2010

[23] [23]

Branch-and-bound methods: A survey,

E. L. Lawler and D. E. Wood, “Branch-and-bound methods: A survey,” Oper. Res., vol. 14, no. 4, pp. 699–719, Jul. 1966

work page 1966

[24] [24]

Parallel and distributed methods for nonconvex optimization,

G. Scutari, F. Facchinei, L. Lampariello, and P. Song, “Parallel and distributed methods for nonconvex optimization,” inProc. IEEE Int. Conf. Acoust., Speech Signal Process., Florence, Italy, Jul. 2014, pp. 840–844

work page 2014

[25] [25]

Learning to optimize: Training deep neural networks for wireless resource management,

H. Sun, X. Chen, Q. Shi, M. Hong, X. Fu, and N. D. Sidiropoulos, “Learning to optimize: Training deep neural networks for wireless resource management,” inProc. IEEE Int. Workshop Signal Process. Adv. Wireless Commun., Sapporo, Japan, Jul. 2017, pp. 1–6

work page 2017

[26] [26]

DRL-driven dynamic resource allocation for task-oriented semantic communication,

H. Zhang, H. Wang, Y . Li, K. Long, and A. Nallanathan, “DRL-driven dynamic resource allocation for task-oriented semantic communication,” IEEE Trans. Commun., vol. 71, no. 7, pp. 3992–4004, Jul. 2023

work page 2023

[27] [27]

Challenges and opportunities in deep reinforcement learning with graph neural networks: A comprehensive review of algorithms and applications,

S. Munikoti, D. Agarwal, L. Das, M. Halappanavar, and B. Natarajan, “Challenges and opportunities in deep reinforcement learning with graph neural networks: A comprehensive review of algorithms and applications,”IEEE Trans. Neural Netw. Learn. Syst., vol. 35, no. 11, pp. 15 051–15 071, Nov. 2024

work page 2024

[28] [28]

Hybrid-task meta-learning: A GNN approach for scalable and transferable bandwidth allocation,

X. Hao, C. She, P. L. Yeoh, Y . Liu, B. Vucetic, and Y . Li, “Hybrid-task meta-learning: A GNN approach for scalable and transferable bandwidth allocation,”IEEE Trans. Wireless Commun., vol. 23, no. 12, pp. 19 820– 19 835, Dec. 2024

work page 2024

[29] [29]

Cross rayleigh and fresnel distances: Unified far-field and near-field beam training for XL-MIMO using ellipse-fitting localization,

Y . Guo, X. Guo, and Y . Wang, “Cross rayleigh and fresnel distances: Unified far-field and near-field beam training for XL-MIMO using ellipse-fitting localization,”IEEE Trans. Wireless Commun., pp. 4903– 4919, Oct. 2025

work page 2025

[30] [30]

Channel estimation for extremely large-scale MIMO: Far-field or near-field?

M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: Far-field or near-field?”IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, Apr. 2022

work page 2022

[31] [31]

Scatterer localization using large-scale antenna arrays based on a spherical wave-front parametric model,

X. Yin, S. Wang, N. Zhang, and B. Ai, “Scatterer localization using large-scale antenna arrays based on a spherical wave-front parametric model,”IEEE Trans. Wireless Commun., vol. 16, no. 10, pp. 6543–6556, Oct. 2017

work page 2017

[32] [32]

An overview of signal processing techniques for joint communication and radar sensing,

J. A. Zhang, F. Liu, C. Masouros, R. W. Heath, Z. Feng, L. Zheng, and A. Petropulu, “An overview of signal processing techniques for joint communication and radar sensing,”IEEE J. Sel. Topics Signal Process., vol. 15, no. 6, pp. 1295–1315, Nov. 2021

work page 2021

[33] [33]

S. M. Kay,Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice-Hall, 1993

work page 1993

[34] [34]

Dual- function radar communication systems: A solution to the spectrum congestion problem,

A. Hassanien, M. G. Amin, E. Aboutanios, and B. Himed, “Dual- function radar communication systems: A solution to the spectrum congestion problem,”IEEE Signal Process. Mag., vol. 36, no. 5, pp. 115–126, Sep. 2019

work page 2019

[35] [35]

Majorization-minimization algo- rithms in signal processing, communications, and machine learning,

Y . Sun, P. Babu, and D. P. Palomar, “Majorization-minimization algo- rithms in signal processing, communications, and machine learning,” IEEE Trans. Signal Process., vol. 65, no. 3, pp. 794–816, Feb. 2017

work page 2017

[36] [36]

Semidefinite relaxation of quadratic optimization problems,

Z.-q. Luo, W.-k. Ma, A. M.-c. So, Y . Ye, and S. Zhang, “Semidefinite relaxation of quadratic optimization problems,”IEEE Signal Process. Mag., vol. 27, no. 3, pp. 20–34, May 2010

work page 2010

[37] [37]

An iteratively weighted mmse approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,

Q. Shi, M. Razaviyayn, Z.-Q. Luo, and C. He, “An iteratively weighted mmse approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,”IEEE Trans. Signal Process., vol. 59, no. 9, pp. 4331–4340, Sep. 2011

work page 2011

[38] [38]

Graph neural network for distributed beamforming and power control in massive URLLC networks,

Y . Gu, C. She, S. Bi, Z. Quan, and B. Vucetic, “Graph neural network for distributed beamforming and power control in massive URLLC networks,”IEEE Trans. Wireless Commun., vol. 23, no. 8, pp. 9099– 9112, Aug. 2024

work page 2024

[39] [39]

Heterogeneous graph trans- former,

Z. Hu, Y . Dong, K. Wang, and Y . Sun, “Heterogeneous graph trans- former,” inProc. The Web Conf., Taipei, Taiwan, Apr. 2020, pp. 2704– 2710

work page 2020

[40] [40]

Massive MIMO networks: Spectral, energy, and hardware efficiency,

E. Bj ¨ornson, J. Hoydis, and L. Sanguinetti, “Massive MIMO networks: Spectral, energy, and hardware efficiency,”Found. Trends Signal Pro- cess., vol. 11, no. 3–4, pp. 154–655, Nov. 2017

work page 2017