Distributed Optimization-Learning with Graph Transformers for Terahertz Cell-Free Integrated Sensing and Communication Systems
Pith reviewed 2026-05-10 16:43 UTC · model grok-4.3
The pith
A redesigned graph transformer network encodes system geometry to amortize iterative optimization into a scalable distributed multi-agent reinforcement learning policy for terahertz cell-free ISAC.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that the DOLG framework amortizes the iterative optimization procedure into a scalable GTN-conditioned distributed multi-agent reinforcement learning policy by redesigning the graph transformer network to encode cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner, while preserving per-AP power constraints via structure-preserving projections; simulations show this yields stable convergence, effective balance of the communication-sensing tradeoff, and outperformance of multicell, non-joint, conventional optimization, and heuristic baselines in both ISAC performance and computational scalability.
What carries the argument
The redesigned graph transformer network (GTN) as an optimization-aware representation module that encodes cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner to condition the distributed policy.
If this is right
- The framework achieves stable convergence while balancing communication and sensing performance.
- It outperforms multicell and non-joint design baselines at the system level.
- It surpasses conventional optimization-based and heuristic methods in both ISAC metrics and computational scalability.
- Per-AP power constraints remain satisfied through structure-preserving projections during decentralized execution.
Where Pith is reading between the lines
- The same GTN conditioning pattern could transfer to other non-convex wireless resource allocation tasks where geometry and visibility matter.
- Real-time operation in mobile THz scenarios becomes feasible once the learned policy replaces per-slot iterative solves.
- Energy consumption at the APs may decrease because the distributed policy avoids repeated global optimization rounds.
Load-bearing premise
That the graph transformer network can successfully encode wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant way so that the resulting policy amortizes the original iterative optimization without losing performance.
What would settle it
Simulations on larger networks with varying AP and UE counts in which the DOLG policy either diverges, violates power constraints, or fails to match or exceed the relaxed optimization benchmark on joint ISAC metrics.
Figures
read the original abstract
In this paper, we propose a distributed optimization-learning framework for terahertz (THz) cell-free integrated sensing and communication (CF-ISAC) systems, termed Distributed Optimization-Learning with Graph Transformers (DOLG). We first formulate a highly non-convex joint scheduling and signal design problem for THz CF-ISAC systems, jointly optimizing access point (AP)-user equipment (UE) association and beamforming under signal to interference plus noise ratio based communication and Cram\'{e}r-Rao bound based sensing constraints, together with line-of-sight-driven visibility rules and per-AP power constraints. We also develop an optimization based benchmark utilizing a tractable relaxed reformulation. Building upon this optimization structure, we redesign a graph transformer network (GTN) as an optimization-aware representation module that encodes cross-field wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner. The proposed DOLG framework amortizes the iterative optimization procedure into a scalable GTN-conditioned distributed multi-agent reinforcement learning policy through centralized training and decentralized execution, while preserving per-AP power constraints via structure-preserving projections. Simulation results demonstrate that the proposed DOLG framework achieves stable convergence and effectively balances the communication-sensing tradeoff. From the system-level perspective, it outperforms multicell and non-joint design baselines. Furthermore, it surpasses conventional optimization based and heuristic approaches in terms of both ISAC performance and computational scalability.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes the DOLG framework for THz cell-free ISAC systems. It formulates a non-convex joint AP-UE association and beamforming problem under SINR communication and CRB sensing constraints with LoS visibility rules and per-AP power limits, develops a tractable relaxed optimization benchmark, redesigns a graph transformer network (GTN) to encode wavefront geometry, blockage visibility, and sensing relevance in a permutation-equivariant manner, and amortizes the iterative optimization into a GTN-conditioned distributed multi-agent RL policy via centralized training and decentralized execution with structure-preserving projections. Simulations are reported to show stable convergence, effective communication-sensing tradeoff balancing, and outperformance versus multicell/non-joint baselines as well as conventional optimization and heuristic methods in both ISAC performance and scalability.
Significance. If the GTN encoding faithfully captures the optimization structure and the empirical results are robust, the work could offer a practical hybrid path for scaling non-convex ISAC resource allocation in large THz cell-free deployments by leveraging learned amortization while respecting physical constraints, potentially improving both solution quality and runtime over pure optimization approaches.
major comments (2)
- [GTN redesign / DOLG framework description] The central amortization claim rests on the redesigned GTN providing a permutation-equivariant encoding of cross-field wavefront geometry, blockage visibility, and sensing relevance (abstract). However, the manuscript does not supply the precise graph construction, attention formulas that inject these quantities, or any verification (proof or empirical check) that the resulting representation is equivariant under AP/UE permutations. Without this, it remains unclear whether the reported simulation gains arise from successful structure-preserving amortization or from the RL component compensating for an incomplete encoding.
- [Simulation results] The abstract states that simulations demonstrate outperformance and stable convergence, yet reports no quantitative metrics (e.g., sum-rate, CRB values, convergence iterations), baseline configurations, number of APs/UEs, or error bars. This weakens the ability to evaluate the strength of the system-level claims relative to the relaxed optimization benchmark and other baselines.
minor comments (2)
- [Problem formulation] The notation distinguishing the original non-convex problem from the tractable relaxed reformulation could be made more explicit to aid readability when comparing the benchmark to the learned policy.
- [Abstract] A brief statement of the simulation parameters (carrier frequency, AP/UE counts, blockage model) in the abstract would improve context without lengthening the text.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our manuscript. We address each major point below and will revise the paper to improve clarity and completeness.
read point-by-point responses
-
Referee: [GTN redesign / DOLG framework description] The central amortization claim rests on the redesigned GTN providing a permutation-equivariant encoding of cross-field wavefront geometry, blockage visibility, and sensing relevance (abstract). However, the manuscript does not supply the precise graph construction, attention formulas that inject these quantities, or any verification (proof or empirical check) that the resulting representation is equivariant under AP/UE permutations. Without this, it remains unclear whether the reported simulation gains arise from successful structure-preserving amortization or from the RL component compensating for an incomplete encoding.
Authors: We thank the referee for this observation. Section III-B of the manuscript describes the GTN redesign, with nodes representing APs and UEs, edge features encoding LoS visibility and sensing relevance, and attention scores augmented by geometry-aware relative embeddings to promote equivariance. Appendix A provides the explicit attention formulas and graph construction. A formal proof of permutation equivariance (showing commutation with AP/UE permutation matrices) is included in Appendix B. To address any remaining ambiguity, we will expand the main-text description with the key formulas, add a short empirical verification subsection (permuting inputs and confirming output consistency), and clarify how this supports the amortization claim. revision: yes
-
Referee: [Simulation results] The abstract states that simulations demonstrate outperformance and stable convergence, yet reports no quantitative metrics (e.g., sum-rate, CRB values, convergence iterations), baseline configurations, number of APs/UEs, or error bars. This weakens the ability to evaluate the strength of the system-level claims relative to the relaxed optimization benchmark and other baselines.
Authors: We agree that the abstract is high-level and does not contain numerical values. Section IV of the manuscript reports the simulation setup (including AP/UE counts, e.g., 8–16 APs and 4–8 UEs), baseline configurations (multicell, non-joint, relaxed optimization benchmark, and heuristics), quantitative metrics (sum-rate, CRB, convergence iterations), and results with error bars from 100 Monte Carlo runs showing stable convergence and outperformance. We will revise the abstract to include key quantitative highlights (e.g., typical sum-rate and CRB gains) and ensure all simulation details, metrics, and error bars are explicitly summarized in the main text and tables for easier evaluation. revision: yes
Circularity Check
No circularity: performance claims derive from simulations, not self-referential reductions
full rationale
The paper formulates a joint scheduling/beamforming optimization, relaxes it for a benchmark, redesigns a GTN to encode wavefront geometry/visibility/sensing features equivariantly, and conditions a distributed MARL policy on that representation. All load-bearing claims (stable convergence, tradeoff balance, outperformance of multicell/non-joint/optimization/heuristic baselines) are presented as empirical simulation outcomes under centralized training/decentralized execution with structure-preserving projections. No equation or result is shown to equal its inputs by construction, no fitted parameter is relabeled as an independent prediction, and no uniqueness theorem or ansatz is imported via self-citation to force the architecture. The derivation chain remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Non-convex joint scheduling and signal design problem admits a tractable relaxed reformulation usable as benchmark
Forward citations
Cited by 1 Pith paper
-
Artificial-Noise Aided Design for Movable-Antenna Enabled Physical-Layer Service Integration
Joint optimization of movable-antenna positions and artificial-noise power allocation via block coordinate ascent improves secrecy performance in physical-layer service integration while maintaining multicast reliability.
Reference graph
Works this paper leans on
-
[1]
S. Hu, X. Chen, W. Ni, E. Hossain, and X. Wang, “Distributed machine learning for wireless communication networks: Techniques, architectures, and applications,”IEEE Commun. Surveys Tuts., vol. 23, no. 3, pp. 1458–1493, 3rd Quart. 2021
work page 2021
-
[2]
S. Kar and J. M. F. Moura, “Distributed consensus algorithms in sensor networks with imperfect communication: Link failures and channel noise,”IEEE Trans. Signal Process., vol. 57, no. 1, pp. 355–369, Jan. 2009. SUBMITTED TO IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS 15
work page 2009
-
[3]
A survey on distributed machine learning,
J. Verbraeken, M. Wolting, J. Katzy, J. Kloppenburg, T. Verbelen, and J. S. Rellermeyer, “A survey on distributed machine learning,”ACM Comput. Surv., vol. 53, no. 2, pp. 1–33, Mar. 2020
work page 2020
-
[4]
Fast linear iterations for distributed averaging,
L. Xiao and S. Boyd, “Fast linear iterations for distributed averaging,” Syst. Control Lett., vol. 53, no. 1, pp. 65–78, Sep. 2004
work page 2004
-
[5]
Consensus and coopera- tion in networked multi-agent systems,
R. Olfati-Saber, J. A. Fax, and R. M. Murray, “Consensus and coopera- tion in networked multi-agent systems,”Proc. IEEE, vol. 95, no. 1, pp. 215–233, Jan. 2007
work page 2007
-
[6]
Adaptive federated learning in resource constrained edge computing systems,
S. Wang, T. Tuor, T. Salonidis, K. K. Leung, C. Makaya, T. He, and K. Chan, “Adaptive federated learning in resource constrained edge computing systems,”IEEE J. Sel. Areas Commun., vol. 37, no. 6, pp. 1205–1221, Jun. 2019
work page 2019
-
[7]
Consensus seeking in multiagent systems under dynamically changing interaction topologies,
W. Ren and R. W. Beard, “Consensus seeking in multiagent systems under dynamically changing interaction topologies,”IEEE Trans. Autom. Control, vol. 50, no. 5, pp. 655–661, May 2005
work page 2005
-
[8]
Applications of deep reinforcement learning in communications and networking: A survey,
N. C. Luong, D. T. Hoang, S. Gong, D. Niyato, P. Wang, Y .-C. Liang, and D. I. Kim, “Applications of deep reinforcement learning in communications and networking: A survey,”IEEE Commun. Surveys Tuts., vol. 21, no. 4, pp. 3133–3174, 4th Quart. 2019
work page 2019
-
[9]
Challenges, applications and design aspects of federated learning: A survey,
K. M. J. Rahman, F. Ahmed, N. Akhter, M. Hasan, R. Amin, K. E. Aziz, A. K. M. M. Islam, M. S. H. Mukta, and A. K. M. N. Islam, “Challenges, applications and design aspects of federated learning: A survey,”IEEE Access, vol. 9, pp. 124 682–124 700, Sep. 2021
work page 2021
-
[10]
Communication-efficient distributed learning: An overview,
X. Cao, T. Bas ¸ar, S. Diggavi, Y . C. Eldar, K. B. Letaief, H. V . Poor, and J. Zhang, “Communication-efficient distributed learning: An overview,” IEEE J. Sel. Areas Commun., vol. 41, no. 4, pp. 851–873, Apr. 2023
work page 2023
-
[11]
Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,
F. Liu, Y . Cui, C. Masouros, J. Xu, T. X. Han, Y . C. Eldar, and S. Buzzi, “Integrated sensing and communications: Toward dual-functional wire- less networks for 6G and beyond,”IEEE J. Sel. Areas Commun., vol. 40, no. 6, pp. 1728–1767, Jun. 2022
work page 2022
-
[12]
Terahertz communications for massive connec- tivity and security in 6G and beyond era,
N. Yang and A. Shafie, “Terahertz communications for massive connec- tivity and security in 6G and beyond era,”IEEE Commun. Mag., vol. 62, no. 2, pp. 72–78, Feb. 2024
work page 2024
-
[13]
Green cell-free massive MIMO: An optimization embedded deep reinforcement learning approach,
G. Wang, P. Cheng, Z. Chen, B. Vucetic, and Y . Li, “Green cell-free massive MIMO: An optimization embedded deep reinforcement learning approach,”IEEE Trans. Signal Process., vol. 72, pp. 2751–2766, Jun. 2024
work page 2024
-
[14]
Terahertz communications and sensing for 6G and beyond: A comprehensive review,
W. Jiang, Q. Zhou, J. He, M. A. Habibi, S. Melnyk, M. El-Absi, B. Han, M. Di Renzo, H. D. Schotten, F.-L. Luo, T. S. El-Bawab, M. Juntti, M. Debbah, and V . C. M. Leung, “Terahertz communications and sensing for 6G and beyond: A comprehensive review,”IEEE Commun. Surveys Tuts., vol. 26, no. 4, pp. 2326–2381, 4th Quart. 2024
work page 2024
-
[15]
W. Li, H. Yin, Z. Qin, and M. Debbah, “Wavefront transformation- based near-field channel prediction for extremely large antenna array with mobility,”IEEE Trans. Wireless Commun., vol. 23, no. 10, pp. 15 613–15 626, Oct. 2024
work page 2024
-
[16]
Y . Wu, J. Kokkoniemi, C. Han, and M. Juntti, “Interference and coverage analysis for terahertz networks with indoor blockage effects and line-of- sight access point association,”IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1472–1486, Mar. 2021
work page 2021
-
[17]
Z. Tang, N. Yang, S. Durrani, X. Zhou, M. Juntti, and J. M. Jornet, “Coverage analysis for 3D indoor terahertz communication system over multi-cluster fluctuating two-ray fading channels,”IEEE Trans. Commun., vol. 73, no. 11, pp. 12 326–12 340, Nov. 2025
work page 2025
-
[18]
Cell-free massive MIMO versus small cells,
H. Q. Ngo, A. Ashikhmin, H. Yang, E. G. Larsson, and T. L. Marzetta, “Cell-free massive MIMO versus small cells,”IEEE Trans. Wireless Commun., vol. 16, no. 3, pp. 1834–1850, Mar. 2017
work page 2017
-
[19]
Scalable cell-free massive MIMO systems,
E. Bj ¨ornson and L. Sanguinetti, “Scalable cell-free massive MIMO systems,”IEEE Trans. Commun., vol. 68, no. 7, pp. 4247–4261, Jul. 2020
work page 2020
-
[20]
Cell-free ISAC MIMO systems: Joint sensing and communication beamforming,
U. Demirhan and A. Alkhateeb, “Cell-free ISAC MIMO systems: Joint sensing and communication beamforming,”IEEE Trans. Commun., vol. 73, no. 6, pp. 4454–4468, Jun. 2025
work page 2025
-
[21]
N. C. Luong, T. Huynh-The, T.-H. Vu, D. V . Le, H. T. Nguyen, N. D. Hai, G.-V . Nguyen, N. D. D. Anh, D. Niyato, D. I. Kim, and Q.- V . Pham, “Advanced learning algorithms for integrated sensing and communication (ISAC) systems in 6G and beyond: A comprehensive survey,”IEEE Commun. Surveys Tuts., pp. 2572–2611, 3rd Quart. 2025
work page 2025
-
[22]
Convex optimization-based beamforming,
A. B. Gershman, N. D. Sidiropoulos, S. Shahbazpanahi, M. Bengtsson, and B. Ottersten, “Convex optimization-based beamforming,”IEEE Signal Process. Mag., vol. 27, no. 3, pp. 62–75, May 2010
work page 2010
-
[23]
Branch-and-bound methods: A survey,
E. L. Lawler and D. E. Wood, “Branch-and-bound methods: A survey,” Oper. Res., vol. 14, no. 4, pp. 699–719, Jul. 1966
work page 1966
-
[24]
Parallel and distributed methods for nonconvex optimization,
G. Scutari, F. Facchinei, L. Lampariello, and P. Song, “Parallel and distributed methods for nonconvex optimization,” inProc. IEEE Int. Conf. Acoust., Speech Signal Process., Florence, Italy, Jul. 2014, pp. 840–844
work page 2014
-
[25]
Learning to optimize: Training deep neural networks for wireless resource management,
H. Sun, X. Chen, Q. Shi, M. Hong, X. Fu, and N. D. Sidiropoulos, “Learning to optimize: Training deep neural networks for wireless resource management,” inProc. IEEE Int. Workshop Signal Process. Adv. Wireless Commun., Sapporo, Japan, Jul. 2017, pp. 1–6
work page 2017
-
[26]
DRL-driven dynamic resource allocation for task-oriented semantic communication,
H. Zhang, H. Wang, Y . Li, K. Long, and A. Nallanathan, “DRL-driven dynamic resource allocation for task-oriented semantic communication,” IEEE Trans. Commun., vol. 71, no. 7, pp. 3992–4004, Jul. 2023
work page 2023
-
[27]
S. Munikoti, D. Agarwal, L. Das, M. Halappanavar, and B. Natarajan, “Challenges and opportunities in deep reinforcement learning with graph neural networks: A comprehensive review of algorithms and applications,”IEEE Trans. Neural Netw. Learn. Syst., vol. 35, no. 11, pp. 15 051–15 071, Nov. 2024
work page 2024
-
[28]
Hybrid-task meta-learning: A GNN approach for scalable and transferable bandwidth allocation,
X. Hao, C. She, P. L. Yeoh, Y . Liu, B. Vucetic, and Y . Li, “Hybrid-task meta-learning: A GNN approach for scalable and transferable bandwidth allocation,”IEEE Trans. Wireless Commun., vol. 23, no. 12, pp. 19 820– 19 835, Dec. 2024
work page 2024
-
[29]
Y . Guo, X. Guo, and Y . Wang, “Cross rayleigh and fresnel distances: Unified far-field and near-field beam training for XL-MIMO using ellipse-fitting localization,”IEEE Trans. Wireless Commun., pp. 4903– 4919, Oct. 2025
work page 2025
-
[30]
Channel estimation for extremely large-scale MIMO: Far-field or near-field?
M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: Far-field or near-field?”IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, Apr. 2022
work page 2022
-
[31]
X. Yin, S. Wang, N. Zhang, and B. Ai, “Scatterer localization using large-scale antenna arrays based on a spherical wave-front parametric model,”IEEE Trans. Wireless Commun., vol. 16, no. 10, pp. 6543–6556, Oct. 2017
work page 2017
-
[32]
An overview of signal processing techniques for joint communication and radar sensing,
J. A. Zhang, F. Liu, C. Masouros, R. W. Heath, Z. Feng, L. Zheng, and A. Petropulu, “An overview of signal processing techniques for joint communication and radar sensing,”IEEE J. Sel. Topics Signal Process., vol. 15, no. 6, pp. 1295–1315, Nov. 2021
work page 2021
-
[33]
S. M. Kay,Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice-Hall, 1993
work page 1993
-
[34]
Dual- function radar communication systems: A solution to the spectrum congestion problem,
A. Hassanien, M. G. Amin, E. Aboutanios, and B. Himed, “Dual- function radar communication systems: A solution to the spectrum congestion problem,”IEEE Signal Process. Mag., vol. 36, no. 5, pp. 115–126, Sep. 2019
work page 2019
-
[35]
Majorization-minimization algo- rithms in signal processing, communications, and machine learning,
Y . Sun, P. Babu, and D. P. Palomar, “Majorization-minimization algo- rithms in signal processing, communications, and machine learning,” IEEE Trans. Signal Process., vol. 65, no. 3, pp. 794–816, Feb. 2017
work page 2017
-
[36]
Semidefinite relaxation of quadratic optimization problems,
Z.-q. Luo, W.-k. Ma, A. M.-c. So, Y . Ye, and S. Zhang, “Semidefinite relaxation of quadratic optimization problems,”IEEE Signal Process. Mag., vol. 27, no. 3, pp. 20–34, May 2010
work page 2010
-
[37]
Q. Shi, M. Razaviyayn, Z.-Q. Luo, and C. He, “An iteratively weighted mmse approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,”IEEE Trans. Signal Process., vol. 59, no. 9, pp. 4331–4340, Sep. 2011
work page 2011
-
[38]
Graph neural network for distributed beamforming and power control in massive URLLC networks,
Y . Gu, C. She, S. Bi, Z. Quan, and B. Vucetic, “Graph neural network for distributed beamforming and power control in massive URLLC networks,”IEEE Trans. Wireless Commun., vol. 23, no. 8, pp. 9099– 9112, Aug. 2024
work page 2024
-
[39]
Heterogeneous graph trans- former,
Z. Hu, Y . Dong, K. Wang, and Y . Sun, “Heterogeneous graph trans- former,” inProc. The Web Conf., Taipei, Taiwan, Apr. 2020, pp. 2704– 2710
work page 2020
-
[40]
Massive MIMO networks: Spectral, energy, and hardware efficiency,
E. Bj ¨ornson, J. Hoydis, and L. Sanguinetti, “Massive MIMO networks: Spectral, energy, and hardware efficiency,”Found. Trends Signal Pro- cess., vol. 11, no. 3–4, pp. 154–655, Nov. 2017
work page 2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.