https://arxiv

He, J · 2019 · cs.LG · arXiv 1901.05534

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open full Pith review browse 6 citing papers arXiv PDF

abstract

The variational autoencoder (VAE) is a popular combination of deep latent variable model and accompanying variational learning technique. By using a neural inference network to approximate the model's posterior on latent variables, VAEs efficiently parameterize a lower bound on marginal data likelihood that can be optimized directly via gradient methods. In practice, however, VAE training often results in a degenerate local optimum known as "posterior collapse" where the model learns to ignore the latent variable and the approximate posterior mimics the prior. In this paper, we investigate posterior collapse from the perspective of training dynamics. We find that during the initial stages of training the inference network fails to approximate the model's true posterior, which is a moving target. As a result, the model is encouraged to ignore the latent encoding and posterior collapse occurs. Based on this observation, we propose an extremely simple modification to VAE training to reduce inference lag: depending on the model's current mutual information between latent variable and observation, we aggressively optimize the inference network before performing each model update. Despite introducing neither new model components nor significant complexity over basic VAE, our approach is able to avoid the problem of collapse that has plagued a large amount of previous work. Empirically, our approach outperforms strong autoregressive baselines on text and image benchmarks in terms of held-out likelihood, and is competitive with more complex techniques for avoiding collapse while being substantially faster.

citation-role summary

background 2 method 1

citation-polarity summary

background 2 use method 1

representative citing papers

Multimodal Continuous Reasoning via Asymmetric Mutual Variational Learning

cs.CV · 2026-07-01 · unverdicted · novelty 6.0

AMVL applies bidirectional KL calibration to align answer-agnostic prior with answer-conditioned posterior in variational multimodal reasoning, reducing leakage and yielding +10.83 average gain on BLINK benchmark.

Ensemble-Based Dirichlet Modeling for Predictive Uncertainty and Selective Classification

stat.ML · 2026-04-07 · unverdicted · novelty 6.0

Ensemble-based method of moments on softmax outputs produces stable Dirichlet predictive distributions that improve uncertainty-guided tasks like selective classification over evidential deep learning.

eXact-Prior Variational Autoencoder (X-VAE): Learning Data-Adaptive Gaussian Mixture Priors for Latent Distributions

stat.ML · 2026-06-30 · unverdicted · novelty 4.0

X-VAE uses empirical statistics from a pretrained autoencoder to set a data-adaptive Gaussian prior and introduces a latent scaling factor for controllable generation.

Conditional Flow-VAE for Safety-Critical Traffic Scenario Generation

cs.RO · 2026-05-06 · unverdicted · novelty 4.0

A conditional flow matching model generates realistic safety-critical traffic scenarios by turning nominal scenes into dangerous rollouts using combined simulation and real data.

Representation learning from OCT images

cs.CV · 2026-05-04 · unverdicted · novelty 3.0

A structured survey of representation learning methods for retinal OCT image analysis, covering supervised, self-supervised, generative, multimodal, and foundation model approaches along with datasets and open problems.

Synthetic Flight Data Generation Using Generative Models

cs.LG · 2026-04-22 · unverdicted · novelty 3.0

Synthetic flight data generated by TVAE and Gaussian Copula models supports flight delay prediction models with accuracy comparable to real data.

citing papers explorer

Showing 6 of 6 citing papers after filters.

Multimodal Continuous Reasoning via Asymmetric Mutual Variational Learning cs.CV · 2026-07-01 · unverdicted · none · ref 35 · internal anchor
AMVL applies bidirectional KL calibration to align answer-agnostic prior with answer-conditioned posterior in variational multimodal reasoning, reducing leakage and yielding +10.83 average gain on BLINK benchmark.
Ensemble-Based Dirichlet Modeling for Predictive Uncertainty and Selective Classification stat.ML · 2026-04-07 · unverdicted · none · ref 25
Ensemble-based method of moments on softmax outputs produces stable Dirichlet predictive distributions that improve uncertainty-guided tasks like selective classification over evidential deep learning.
eXact-Prior Variational Autoencoder (X-VAE): Learning Data-Adaptive Gaussian Mixture Priors for Latent Distributions stat.ML · 2026-06-30 · unverdicted · none · ref 9 · internal anchor
X-VAE uses empirical statistics from a pretrained autoencoder to set a data-adaptive Gaussian prior and introduces a latent scaling factor for controllable generation.
Conditional Flow-VAE for Safety-Critical Traffic Scenario Generation cs.RO · 2026-05-06 · unverdicted · none · ref 33
A conditional flow matching model generates realistic safety-critical traffic scenarios by turning nominal scenes into dangerous rollouts using combined simulation and real data.
Representation learning from OCT images cs.CV · 2026-05-04 · unverdicted · none · ref 103
A structured survey of representation learning methods for retinal OCT image analysis, covering supervised, self-supervised, generative, multimodal, and foundation model approaches along with datasets and open problems.
Synthetic Flight Data Generation Using Generative Models cs.LG · 2026-04-22 · unverdicted · none · ref 48
Synthetic flight data generated by TVAE and Gaussian Copula models supports flight delay prediction models with accuracy comparable to real data.

https://arxiv

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer