Generative Semantic Communication: Diffusion Models Beyond Bit Recovery
read the original abstract
Semantic communication is expected to be one of the cores of next-generation AI-based communications. One of the possibilities offered by semantic communication is the capability to regenerate, at the destination side, images or videos semantically equivalent to the transmitted ones, without necessarily recovering the transmitted sequence of bits. The current solutions still lack the ability to build complex scenes from the received partial information. Clearly, there is an unmet need to balance the effectiveness of generation methods and the complexity of the transmitted information, possibly taking into account the goal of communication. In this paper, we aim to bridge this gap by proposing a novel generative diffusion-guided framework for semantic communication that leverages the strong abilities of diffusion models in synthesizing multimedia content while preserving semantic features. We reduce bandwidth usage by sending highly-compressed semantic information only. Then, the diffusion model learns to synthesize semantic-consistent scenes through spatially-adaptive normalizations from such denoised semantic information. We prove, through an in-depth assessment of multiple scenarios, that our method outperforms existing solutions in generating high-quality images with preserved semantic information even in cases where the received content is significantly degraded. More specifically, our results show that objects, locations, and depths are still recognizable even in the presence of extremely noisy conditions of the communication channel. The code is available at https://github.com/ispamm/GESCO.
This paper has not been read by Pith yet.
Forward citations
Cited by 5 Pith papers
-
A Causal Diffusion Model for Video Reconstruction from Ultra-Low-Bitrate Representations
A causal diffusion model reconstructs videos from ultra-low-bitrate semantics and compressed frames using temporal distillation from a bidirectional teacher, outperforming prior baselines.
-
Intention-Aware Semantic Agent Communications for AI Glasses
An intention-aware semantic agent system for AI glasses reduces bandwidth by over 50% in simulations while preserving task performance through adaptive preprocessing guided by inferred user intentions.
-
Anchor-Aided Multi-User Semantic Communication with Adaptive Decoders
A multi-user semantic communication framework uses an anchor decoder symmetric to the encoder to overcome catastrophic forgetting, enabling one frozen encoder to support adaptive decoders for users with varying comput...
-
Anchor-Aided Multi-User Semantic Communication with Adaptive Decoders
A multi-user semantic communication framework employs an anchor decoder symmetric to the encoder to mitigate catastrophic forgetting, enabling sequential training and frozen-encoder adaptation for users with distinct ...
-
Generative AI Meets 6G and Beyond: Diffusion Models for Semantic Communications
The tutorial synthesizes diffusion model techniques for generative semantic communications to achieve high compression while preserving meaning in wireless transmission.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.