Learning Energy-Based Models by Diffusion Recovery Likelihood

Ben Poole; Diederik P. Kingma; Ruiqi Gao; Yang Song; Ying Nian Wu

arxiv: 2012.08125 · v2 · pith:S6SOOBGWnew · submitted 2020-12-15 · 💻 cs.LG · stat.ML

Learning Energy-Based Models by Diffusion Recovery Likelihood

Ruiqi Gao , Yang Song , Ben Poole , Ying Nian Wu , Diederik P. Kingma This is my paper

classification 💻 cs.LG stat.ML

keywords likelihoodrecoveryconditionaldistributionsnoisesamplingdatasetsdiffusion

0 comments

read the original abstract

While energy-based models (EBMs) exhibit a number of desirable properties, training and sampling on high-dimensional datasets remains challenging. Inspired by recent progress on diffusion probabilistic models, we present a diffusion recovery likelihood method to tractably learn and sample from a sequence of EBMs trained on increasingly noisy versions of a dataset. Each EBM is trained with recovery likelihood, which maximizes the conditional probability of the data at a certain noise level given their noisy versions at a higher noise level. Optimizing recovery likelihood is more tractable than marginal likelihood, as sampling from the conditional distributions is much easier than sampling from the marginal distributions. After training, synthesized images can be generated by the sampling process that initializes from Gaussian white noise distribution and progressively samples the conditional distributions at decreasingly lower noise levels. Our method generates high fidelity samples on various image datasets. On unconditional CIFAR-10 our method achieves FID 9.58 and inception score 8.30, superior to the majority of GANs. Moreover, we demonstrate that unlike previous work on EBMs, our long-run MCMC samples from the conditional distributions do not diverge and still represent realistic images, allowing us to accurately estimate the normalized density of data even for high-dimensional datasets. Our implementation is available at https://github.com/ruiqigao/recovery_likelihood.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Theoretical Analysis of Engression and Reverse Markov Engression
stat.ME 2026-05 unverdicted novelty 7.0

Derives near-optimal nonasymptotic excess-risk bounds for Engression and reverse Markov Engression over Hölder classes via energy distance.
Diffusion Models Beat GANs on Image Synthesis
cs.LG 2021-05 accept novelty 7.0

Diffusion models with architecture improvements and classifier guidance achieve superior FID scores to GANs on unconditional and conditional ImageNet image synthesis.
Knowledge Distillation in Iterative Generative Models for Improved Sampling Speed
cs.LG 2021-01 unverdicted novelty 6.0

Denoising Student distills the multi-step denoising process of score-based and diffusion models into a single forward pass, matching GAN sampling speed while producing comparable sample quality on CIFAR-10, CelebA, an...