Towards Deeper Understanding of Variational Autoencoding Models

Jiaming Song; Shengjia Zhao; Stefano Ermon

arxiv: 1702.08658 · v1 · pith:D7SNEWM7new · submitted 2017-02-28 · 💻 cs.LG · stat.ML

Towards Deeper Understanding of Variational Autoencoding Models

Shengjia Zhao , Jiaming Song , Stefano Ermon This is my paper

classification 💻 cs.LG stat.ML

keywords featureslatentproposeconditionsmodelsoptimizationsamplesvariational

0 comments

read the original abstract

We propose a new family of optimization criteria for variational auto-encoding models, generalizing the standard evidence lower bound. We provide conditions under which they recover the data distribution and learn latent features, and formally show that common issues such as blurry samples and uninformative latent features arise when these conditions are not met. Based on these new insights, we propose a new sequential VAE model that can generate sharp samples on the LSUN image dataset based on pixel-wise reconstruction loss, and propose an optimization criterion that encourages unsupervised learning of informative latent features.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Not Too Generative, Not Too Discriminative: The Human Alignment Sweet Spot
cs.CV 2026-05 unverdicted novelty 6.0

Hybrid JEMs at intermediate generative-discriminative balance maximize human alignment on perceptual similarity, gloss, uncertainty, robustness, cue conflict, and feature attribution benchmarks.