pith. sign in

arxiv: 2410.17966 · v3 · submitted 2024-10-23 · 📡 eess.IV · cs.CV

A Wavelet Diffusion GAN for Image Super-Resolution

classification 📡 eess.IV cs.CV
keywords diffusionsuper-resolutionapplicationsapproachgenerationhigh-fidelityimageinference
0
0 comments X
read the original abstract

In recent years, diffusion models have emerged as a superior alternative to generative adversarial networks (GANs) for high-fidelity image generation, with wide applications in text-to-image generation, image-to-image translation, and super-resolution. However, their real-time feasibility is hindered by slow training and inference speeds. This study addresses this challenge by proposing a wavelet-based conditional Diffusion GAN scheme for Single-Image Super-Resolution (SISR). Our approach utilizes the diffusion GAN paradigm to reduce the timesteps required by the reverse diffusion process and the Discrete Wavelet Transform (DWT) to achieve dimensionality reduction, decreasing training and inference times significantly. The results of an experimental validation on the CelebA-HQ dataset confirm the effectiveness of our proposed scheme. Our approach outperforms other state-of-the-art methodologies successfully ensuring high-fidelity output while overcoming inherent drawbacks associated with diffusion models in time-sensitive applications. The code is available at https://www.github.com/aloilor/WaDiGAN-SR

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Latent Wavelet Diffusion For Ultra-High-Resolution Image Synthesis

    cs.CV 2025-05 unverdicted novelty 6.0

    Latent Wavelet Diffusion uses wavelet energy map masking and a scale-consistent VAE to improve detail fidelity in 2K-4K image generation without extra inference overhead.