pith. sign in

Knowledge Distillation for Lightweight Multimodal Sensing-Aided mmWave Beam Tracking

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Beam training and prediction in real-world millimeter-wave (mmWave) communications systems are challenging due to rapidly time-varying channels and strong interference from surrounding objects. In this context, widely available sensors, such as cameras and radars, can capture rich environmental information, enabling efficient beam management. This paper proposes a knowledge-distillation (KD)-enabled learning framework for developing lightweight and low-complexity models for beam prediction and tracking using real-world camera and radar data from the DeepSense 6G dataset. Specifically, a powerful teacher network based on convolutional neural networks (CNNs) and gated recurrent units (GRUs) is first designed to predict current and future beams from historical sensor observations. Then, a compact student model is constructed and trained via KD to transfer the predictive capability of the teacher model to a lightweight architecture. Simulation results demonstrate that jointly leveraging radar and image modalities significantly outperforms single-modality approaches. Moreover, the proposed student model achieves over 96% Top-5 beam prediction accuracy while reducing computational complexity by more than 4 times and the number of parameters by over 27 times compared with the teacher model.

fields

eess.SP 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Lightweight Vision-Aided Beam Tracking for Cross-Environment mmWave Communications eess.SP · 2026-07-01 · unverdicted · none · ref 11 · internal anchor

    Lightweight CNN with separable convolutions, hierarchical augmentation and power-based label smoothing reaches 84% cross-environment beam prediction accuracy on two real DeepSense 6G scenarios while cutting parameters by 52x and complexity by 79x versus ResNet.