pith. sign in

arxiv: 2604.08306 · v1 · submitted 2026-04-09 · 📡 eess.SP

Temporal Graph Neural Network for ISAC Target Detection and Tracking

Pith reviewed 2026-05-10 17:32 UTC · model grok-4.3

classification 📡 eess.SP
keywords ISACtarget trackingtemporal graph neural networkdelay-Doppler mapmulti-target detectionnode classificationKalman filter baseline
0
0 comments X

The pith

Temporal graph neural networks track multiple targets more accurately in ISAC by classifying nodes over sequences of delay-Doppler graphs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes modeling delay-Doppler maps from wireless channels as sequences of graphs and using a temporal graph neural network to solve multi-target tracking as a temporal node classification task. This formulation performs joint clustering and data association for dynamic targets without separate stages. Evaluation on ray-tracing channel outputs across varied scenes shows lower normalized mean squared error in delay and Doppler estimates than a Kalman filter baseline. The approach aims to support reliable environment sensing in integrated sensing and communication systems for 6G. A sympathetic reader would care because accurate tracking of multiple moving targets enables better awareness of surroundings using existing communication infrastructure.

Core claim

By representing the delay-Doppler map as a sequence of graphs, the temporal graph neural network formulates tracking as temporal node classification. This enables simultaneous clustering and data association of targets using their delay and Doppler features. When compared to a Kalman filter on ray-tracing ground truth in multiple scenes with different target positions, velocities, and trajectories, the method yields reduced normalized mean squared error in delay and Doppler, producing more accurate multi-target tracking.

What carries the argument

The temporal graph neural network that converts sequences of delay-Doppler maps into graphs and performs node classification over time to track targets.

If this is right

  • Joint clustering and data association occur in one step instead of sequential processing.
  • Normalized mean squared error decreases for both delay and Doppler parameters in multi-target cases.
  • Tracking remains consistent across scenes with changing target positions, speeds, and paths.
  • A data-driven graph method offers an alternative to model-based filters like the Kalman filter.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The graph construction could extend to other sensing parameters beyond delay and Doppler if similar map sequences are available.
  • Success with simulated ray-tracing data suggests testing the same architecture on measured channel data from actual hardware.
  • Implicit handling of unknown target numbers through node classification may reduce reliance on explicit initialization steps.

Load-bearing premise

That ray-tracing channel outputs provide adequate ground truth for real target dynamics and that constructing graphs from delay-Doppler maps captures motion without extra assumptions on target count or trajectories.

What would settle it

Real-world channel measurements in which the temporal graph neural network shows no reduction in normalized mean squared error for delay and Doppler or fails to maintain correct target associations compared to a Kalman filter would disprove the performance claim.

Figures

Figures reproduced from arXiv: 2604.08306 by Marcus Grossmann, Markus Landmann, Saiedeh Maboud Sanaie, Thomas Dallmann.

Figure 1
Figure 1. Figure 1: Framework for the proposed EvolveGCN target detection and tracking. [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Construction of the delay-Doppler graph for window [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Tracking performance over the test time steps: EvolveGCN predictions [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: RMSE of Doppler and delay tracking per target for the Kalman filter [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗
read the original abstract

Integrated sensing and communication (ISAC) is a key enabler of 6G, supporting environment-aware services. A fundamental sensing task in this setting is reliable multi-target detection and tracking. This paper proposes a temporal graph neural network (TGNN)-based tracking method that exploits delay and Doppler information from the wireless channel. The delay-Doppler map is modeled as a sequence of graphs, and tracking is formulated as a temporal node classification problem, enabling joint clustering and data association of dynamic targets. Using ray-tracing-based channel outputs as ground truth, the method is evaluated across multiple scenes with varying target positions, velocities, and trajectories and is compared with a Kalman filter baseline. Results demonstrate reduced normalized mean squared error (NMSE) in delay and Doppler, leading to more accurate multi-target tracking.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes a temporal graph neural network (TGNN) method for multi-target detection and tracking in integrated sensing and communication (ISAC) systems. It represents the delay-Doppler map as a sequence of graphs and formulates tracking as a temporal node classification problem to perform joint clustering and data association. The approach is evaluated using ray-tracing-based channel simulations across scenes with varying target positions, velocities, and trajectories, and is compared to a Kalman filter baseline, claiming reduced normalized mean squared error (NMSE) in delay and Doppler estimates that yields more accurate tracking.

Significance. If the performance gains hold under more realistic conditions, the work would contribute a graph-based machine-learning framing for ISAC sensing that jointly handles clustering and association in dynamic environments. The temporal node-classification formulation is a reasonable way to leverage delay-Doppler information, and the comparison against a standard Kalman filter provides a clear baseline. However, the exclusive use of idealized ray-tracing simulations limits the immediate practical significance for real 6G ISAC deployments.

major comments (2)
  1. [Abstract and Evaluation] Abstract and Evaluation section: The central claims of reduced NMSE in delay/Doppler and superior multi-target tracking are stated without any numerical values, confidence intervals, training/validation split details, or quantitative comparison tables. This absence prevents verification of the reported improvements over the Kalman baseline.
  2. [Evaluation] Evaluation section: The method relies on ray-tracing outputs as both training labels and ground truth. Ray-tracing assumes perfect geometric knowledge, deterministic reflections, and no hardware impairments (phase noise, unmodeled multipath, or sensor noise). Because the graph construction (delay-Doppler map to nodes/edges) and the node-classification formulation inherit these idealizations, any mismatch with real ISAC channels would invalidate the claimed NMSE and tracking gains.
minor comments (2)
  1. [Method] Provide explicit definitions of node features, edge construction rules, and the temporal graph sequence formation from the delay-Doppler maps.
  2. [Evaluation] Include at least one figure or table reporting concrete NMSE values, tracking error metrics, and ablation results across the tested scenes.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive and detailed review of our manuscript. We address each major comment point by point below, indicating the revisions we will make.

read point-by-point responses
  1. Referee: [Abstract and Evaluation] Abstract and Evaluation section: The central claims of reduced NMSE in delay/Doppler and superior multi-target tracking are stated without any numerical values, confidence intervals, training/validation split details, or quantitative comparison tables. This absence prevents verification of the reported improvements over the Kalman baseline.

    Authors: We agree that the abstract does not include specific numerical values or a summary table, which limits immediate verification. In the revised manuscript we will update the abstract to report the NMSE values achieved for delay and Doppler estimates, include details on the training/validation splits used across the ray-tracing scenes, and add a concise quantitative comparison table against the Kalman filter baseline. These elements already exist in the evaluation section and will be highlighted for clarity. revision: yes

  2. Referee: [Evaluation] Evaluation section: The method relies on ray-tracing outputs as both training labels and ground truth. Ray-tracing assumes perfect geometric knowledge, deterministic reflections, and no hardware impairments (phase noise, unmodeled multipath, or sensor noise). Because the graph construction (delay-Doppler map to nodes/edges) and the node-classification formulation inherit these idealizations, any mismatch with real ISAC channels would invalidate the claimed NMSE and tracking gains.

    Authors: We acknowledge that ray-tracing simulations omit hardware impairments and unmodeled effects, which is a genuine limitation of the current evaluation. Ray-tracing is employed because it supplies the precise, labeled ground-truth trajectories required for supervised training of the temporal node-classification task; obtaining equivalent labeled data from real ISAC hardware is currently impractical. The model does incorporate deterministic multipath reflections based on scene geometry. In the revision we will add an explicit discussion subsection on these modeling assumptions, their potential impact on generalization, and planned future extensions to more realistic channel models or hardware-in-the-loop validation. revision: partial

standing simulated objections not resolved
  • Validation of the TGNN under real hardware impairments, phase noise, and unmodeled multipath conditions beyond ray-tracing simulations.

Circularity Check

0 steps flagged

No circularity in TGNN derivation for ISAC target tracking

full rationale

The paper's core derivation models delay-Doppler maps as temporal graphs and casts multi-target tracking as a supervised temporal node-classification task solved by a TGNN. Training and evaluation labels come from independent ray-tracing channel simulations, not from the model's own outputs or fitted parameters. The Kalman-filter baseline is an external comparator. No self-definitional equations, no predictions that reduce to the training inputs by construction, and no load-bearing self-citations are present. The claimed NMSE reductions are therefore measured against an external ground-truth source and are not tautological.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review performed on abstract only; no explicit free parameters, axioms, or invented entities are stated. The neural-network weights are implicitly learned but not enumerated.

pith-pipeline@v0.9.0 · 5438 in / 1093 out tokens · 84607 ms · 2026-05-10T17:32:31.131832+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

17 extracted references · 17 canonical work pages

  1. [1]

    Integrated sensing and communications: Toward dual-functional wireless networks for 6G and beyond,

    F. Liu, Y . Cui, C. Masouros, J. Xu, T. X. Han, Y . C. Eldar, and S. Buzzi, “Integrated sensing and communications: Toward dual-functional wireless networks for 6G and beyond,”IEEE J. Sel. Areas Commun., vol. 40, no. 6, pp. 1728–1767, Jun. 2022

  2. [2]

    Passive radar tracking in clutter using range and range-rate measurements,

    A. Asif, S. Kandeepan, and R. J. Evans, “Passive radar tracking in clutter using range and range-rate measurements,”Sensors, vol. 23, no. 12, p. 5451, 2023

  3. [3]

    Integrated sensing and communication based outdoor multi-target detection, tracking, and localization in practical 5G networks,

    R. Liu, M. Jian, D. Chen, X. Lin, Y . Cheng, W. Cheng, and S. Chen, “Integrated sensing and communication based outdoor multi-target detection, tracking, and localization in practical 5G networks,”Intell. Converged Netw., vol. 4, no. 3, pp. 261–272, 2023

  4. [4]

    EKF-based beamforming design for joint beam tracking and communication systems,

    M. Li, F. Dong, T. Liu, and F. Liu, “EKF-based beamforming design for joint beam tracking and communication systems,”IEEE Wireless Commun. Lett., vol. 14, no. 9, pp. 2937–2941, 2025

  5. [5]

    Wireless propagation parameter estimation with convolutional neural networks,

    S. Schieler, S. Semper, and R. Thomä, “Wireless propagation parameter estimation with convolutional neural networks,”Int. J. Microw. Wireless Technol., vol. 17, no. 5, pp. 892–899, 2025

  6. [6]

    Radar multi-object tracking using DNN features,

    M. Hassan, F. Fioranelli, A. Yarovoy, and S. Ravindran, “Radar multi-object tracking using DNN features,” inProc. 2023 IEEE Int. Radar Conf. (RADAR), 2023, pp. 1–6

  7. [7]

    Human tracking with mmWave radars: A deep learning approach with uncertainty estimation,

    J. Pegoraro and M. Rossi, “Human tracking with mmWave radars: A deep learning approach with uncertainty estimation,” inProc. 2022 IEEE 23rd Int. Workshop Signal Process. Adv. Wireless Commun. (SPA WC), 2022, pp. 1–5

  8. [8]

    Predictive ISAC beamforming using LoRA-Transformer LSTM integrated with GNN in ultra-dense D2D mmWave networks,

    H. Liu, J. Bi, B. Li, L. Xiang, X. Yang, Z. Zhang, and L. Wang, “Predictive ISAC beamforming using LoRA-Transformer LSTM integrated with GNN in ultra-dense D2D mmWave networks,” IEEE Trans. V eh. Technol., vol. 74, no. 9, pp. 13 938–13 952, 2025

  9. [9]

    Multiframe detection via graph neural networks: A link prediction approach,

    Z. Lin, C. Gao, J. Yan, Q. Zhang, B. Chen, and H. Liu, “Multiframe detection via graph neural networks: A link prediction approach,”IEEE Trans. Aerosp. Electron. Syst., vol. 61, no. 6, pp. 16 186–16 204, 2025

  10. [10]

    A sea clutter suppression method with graph neural network for maritime target detection,

    J. Tan, W. Sheng, and H. Zhu, “A sea clutter suppression method with graph neural network for maritime target detection,”IEEE Sensors J., 2025, early access

  11. [11]

    EvolveGCN: Evolving graph convolutional networks for dynamic graphs,

    A. Pareja, G. Domeniconi, J. Chen, T. Ma, T. Suzumura, H. Kanezashi, T. Kaler, T. Schardl, and C. E. Leiserson, “EvolveGCN: Evolving graph convolutional networks for dynamic graphs,” inProc. AAAI Conf. Artif. Intell., vol. 34, no. 4, 2020, pp. 5363–5370

  12. [12]

    Framework for simulation models and algorithms in ISAC networks,

    C. Smeenk, C. Schneider, and R. S. Thomä, “Framework for simulation models and algorithms in ISAC networks,” inProc. IEEE 3rd Int. Symp. Joint Commun. & Sensing (JC&S), 2023, pp. 1–6

  13. [13]

    Statistical analysis and modeling of vehicular radar cross section,

    S. J. Myint, C. Schneider, M. Roding, G. D. Galdo, and R. S. Thomä, “Statistical analysis and modeling of vehicular radar cross section,” in Proc. 13th Eur . Conf. Antennas Propag. (EuCAP), 2019, pp. 1–5

  14. [14]

    Dynamic graph node classification via time augmentation,

    J. Sun, M. Gu, C.-C. M. Yeh, Y . Fan, G. Chowdhary, and W. Zhang, “Dynamic graph node classification via time augmentation,” inProc. 2022 IEEE Int. Conf. Big Data (Big Data), 2022, pp. 800–805

  15. [15]

    Inductive representation learning on large graphs,

    W. L. Hamilton, R. Ying, and J. Leskovec, “Inductive representation learning on large graphs,” inAdv. Neural Inf. Process. Syst. (NeurIPS), 2017, pp. 1024–1034

  16. [16]

    Robust CFAR detector with ordered statistic of sub-reference cells in multiple target situations,

    T. Jeong, S. Park, J.-W. Kim, and J.-W. Yu, “Robust CFAR detector with ordered statistic of sub-reference cells in multiple target situations,”IEEE Access, vol. 10, pp. 42 750–42 761, 2022

  17. [17]

    A density-based algorithm for discovering clusters in large spatial databases with noise,

    M. Ester, H.-P. Kriegel, J. Sander, and X. Xu, “A density-based algorithm for discovering clusters in large spatial databases with noise,” inProc. 2nd Int. Conf. Knowl. Discov. Data Min. (KDD), 1996, pp. 226–231