Stochastic Backpropagation and Approximate Inference in Deep Generative Models

Daan Wierstra; Danilo Jimenez Rezende; Shakir Mohamed

arxiv: 1401.4082 · v3 · pith:Z6GRZY2Rnew · submitted 2014-01-16 · 📊 stat.ML · cs.AI· cs.LG· stat.CO· stat.ME

Stochastic Backpropagation and Approximate Inference in Deep Generative Models

Danilo Jimenez Rezende , Shakir Mohamed , Daan Wierstra This is my paper

classification 📊 stat.ML cs.AIcs.LGstat.COstat.ME

keywords datastochasticalgorithmapproximatedeepgenerativeinferencemodel

0 comments

read the original abstract

We marry ideas from deep neural networks and approximate Bayesian inference to derive a generalised class of deep, directed generative models, endowed with a new algorithm for scalable inference and learning. Our algorithm introduces a recognition model to represent approximate posterior distributions, and that acts as a stochastic encoder of the data. We develop stochastic back-propagation -- rules for back-propagation through stochastic variables -- and use this to develop an algorithm that allows for joint optimisation of the parameters of both the generative and recognition model. We demonstrate on several real-world data sets that the model generates realistic samples, provides accurate imputations of missing data and is a useful tool for high-dimensional data visualisation.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 16 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Generative Adversarial Networks
stat.ML 2014-06 accept novelty 9.0

A generative model is trained to match a data distribution by competing in a minimax game against a discriminator, reaching an equilibrium where the generator recovers the true distribution and the discriminator outpu...
Denoising Diffusion Implicit Models
cs.LG 2020-10 unverdicted novelty 8.0

DDIMs construct non-Markovian diffusion processes that share DDPM training objectives but allow much faster reverse sampling, demonstrated empirically at 10-50x wall-clock speedup.
Categorical Reparameterization with Gumbel-Softmax
stat.ML 2016-11 unverdicted novelty 8.0

Gumbel-Softmax provides a continuous relaxation of categorical sampling that anneals to discrete samples for gradient-based optimization.
Density estimation using Real NVP
cs.LG 2016-05 accept novelty 8.0

Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.
NICE: Non-linear Independent Components Estimation
cs.LG 2014-10 accept novelty 8.0

NICE learns a composition of invertible neural-network layers that transform data into independent latent variables, enabling exact log-likelihood training and sampling for density estimation.
Mastering Atari with Discrete World Models
cs.LG 2020-10 accept novelty 7.0

DreamerV2 reaches human-level performance on 55 Atari games by learning behaviors inside a separately trained discrete-latent world model.
Dream to Control: Learning Behaviors by Latent Imagination
cs.LG 2019-12 accept novelty 7.0

Dreamer learns to control from images by imagining and optimizing behaviors in a learned latent world model, outperforming prior methods on 20 visual tasks in data efficiency and final performance.
CredibleDFGO: Differentiable Factor Graph Optimization with Credibility Supervision
eess.SP 2026-05 unverdicted novelty 6.0

CredibleDFGO adds explicit supervision of covariance credibility to differentiable factor graph optimization for GNSS by using proper scoring rules on the predictive distribution, yielding more trustworthy uncertainti...
GazeVaLM: A Multi-Observer Eye-Tracking Benchmark for Evaluating Clinical Realism in AI-Generated X-Rays
cs.CV 2026-04 unverdicted novelty 6.0

GazeVaLM provides 960 gaze recordings from 16 radiologists on 60 chest X-rays (half synthetic) plus LLM predictions for diagnostic accuracy and real-fake detection under matched conditions.
MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework
cs.AI 2023-08 unverdicted novelty 6.0

MetaGPT embeds human SOPs into LLM prompts to create role-specialized agent teams that produce more coherent solutions on collaborative software engineering tasks than prior chat-based multi-agent systems.
A Deep Generative Model for Code-Switched Text
cs.CL 2019-06 unverdicted novelty 6.0

VACS is a two-level hierarchical VAE that generates diverse code-switched sentences, and augmenting monolingual data with its output reduces language model perplexity by 33.06%.
Information theoretic underpinning of self-supervised learning by clustering
cs.LG 2026-05 unverdicted novelty 5.0

SSL clustering is derived as KL-divergence optimization where a teacher-distribution constraint normalizes via inverse cluster priors and simplifies to batch centering by Jensen's inequality.
Molecular Design beyond Training Data with Novel Extended Objective Functionals of Generative AI Models Driven by Quantum Annealing Computer
q-bio.QM 2026-02 unverdicted novelty 5.0

Quantum annealing combined with a Neural Hash Function lets generative models create molecules that are more drug-like than classical versions or the training set itself.
Shaping Belief States with Generative Environment Models for RL
cs.LG 2019-06 unverdicted novelty 5.0

Multi-step predictive generative models form stable belief states capturing environment layout and agent pose, yielding higher data efficiency on RL tasks than model-free agents.
A Unified Measure-Theoretic View of Diffusion, Score-Based, and Flow Matching Generative Models
cs.LG 2026-05 unverdicted novelty 4.0

Diffusion, score-based, and flow matching models are unified as instances of learning time-dependent vector fields inducing marginal distributions governed by continuity and Fokker-Planck equations.
Convolutional Reservoir Computing for World Models
cs.LG 2019-07 unverdicted novelty 4.0

RCRC uses untrained random CNNs and reservoir computing plus evolution strategies to reach claimed state-of-the-art scores in reinforcement learning tasks while avoiding data storage and heavy training.