PilotWiMAE pretrains an encoder on noisy pilots with factorized attention, 99% masking, patch-normalized reconstruction, scale loss, and AWGN curriculum to outperform supervised baselines in cross-frequency beam selection and channel tasks from 3.5 GHz pretraining to 28 GHz evaluation.
WiFo-2: a generalist foundation model unifies heterogeneous wireless system design
6 Pith papers cite this work. Polarity classification is still indexing.
abstract
Emerging sixth-generation wireless systems are increasingly heterogeneous, with compatibility across diverse configurations, ubiquitous coverage, and expanded functionalities. Although deep learning has substantially benefited wireless system design, existing approaches are typically trained for specific system settings and scenarios with limited generalizability. Here we present WiFo-2, a space-time-frequency foundation model for unified wireless communications and sensing system design. Pretrained on a heterogeneous dataset of 11.6 billion channel state information (CSI) points, WiFo-2 learns generalized wireless representations across scenarios, configurations, and tasks, and exhibits scaling-law behavior. WiFo-2 achieves reliable and accurate zero-shot channel reconstruction, outperforming fully supervised task-specific models. With only 1% of the training samples required by supervised AI models, WiFo-2 achieves state-of-the-art performance across 9 distinct wireless tasks. A functional hardware prototype further demonstrates its real-world deployability and superior capability across diverse wireless tasks. This work provides a versatile wireless design framework and advances understanding of wireless channels.
years
2026 6verdicts
UNVERDICTED 6representative citing papers
FARM is a foundation model combining masked autoencoders and diffusion decoders to estimate high-resolution aerial radio maps from a new multi-band low-altitude dataset, claiming superior accuracy and generalization over prior methods.
WiFo-MiSAC is a task-agnostic foundation model that unifies multimodal wireless signals via tokenization and self-supervised learning with SS-DMoE to achieve strong few-shot performance on beam prediction and channel estimation.
AirFM-DDA reparameterizes wireless channel data into the delay-Doppler-angle domain and uses efficient window attention to achieve better zero-shot performance on channel prediction and estimation with lower compute cost.
A pre-trained interference-aware graph Transformer model for wireless resource allocation that achieves strong few-shot adaptation to new tasks and scenarios.
Adaptive 3D-RoPE adapts rotary positional encoding to wireless channel physics via learnable 3D frequencies and dynamic CSI control, yielding up to 10.7 dB NMSE gains in scale extrapolation and 1 dB in zero-shot tasks.
citing papers explorer
-
PilotWiMAE: Pilot-Native Representation Learning for Wireless Channels
PilotWiMAE pretrains an encoder on noisy pilots with factorized attention, 99% masking, patch-normalized reconstruction, scale loss, and AWGN curriculum to outperform supervised baselines in cross-frequency beam selection and channel tasks from 3.5 GHz pretraining to 28 GHz evaluation.
-
FARM: Foundational Aerial Radio Map for Intelligent Low-Altitude Networking
FARM is a foundation model combining masked autoencoders and diffusion decoders to estimate high-resolution aerial radio maps from a new multi-band low-altitude dataset, claiming superior accuracy and generalization over prior methods.
-
WiFo-MiSAC: A Wireless Foundation Model for Multimodal Sensing and Communication Integration via Synesthesia of Machines (SoM)
WiFo-MiSAC is a task-agnostic foundation model that unifies multimodal wireless signals via tokenization and self-supervised learning with SS-DMoE to achieve strong few-shot performance on beam prediction and channel estimation.
-
AirFM-DDA: Air-Interface Foundation Model in the Delay-Doppler-Angle Domain for AI-Native 6G
AirFM-DDA reparameterizes wireless channel data into the delay-Doppler-angle domain and uses efficient window attention to achieve better zero-shot performance on channel prediction and estimation with lower compute cost.
-
A Graph Foundation Model for Wireless Resource Allocation
A pre-trained interference-aware graph Transformer model for wireless resource allocation that achieves strong few-shot adaptation to new tasks and scenarios.
-
Adaptive 3D-RoPE: Physics-Aligned Rotary Positional Encoding for Wireless Foundation Models
Adaptive 3D-RoPE adapts rotary positional encoding to wireless channel physics via learnable 3D frequencies and dynamic CSI control, yielding up to 10.7 dB NMSE gains in scale extrapolation and 1 dB in zero-shot tasks.