Isolating Sources of Disentanglement in Variational Autoencoders

David Duvenaud; Ricky T. Q. Chen; Roger Grosse; Xuechen Li

Isolating Sources of Disentanglement in Variational Autoencoders

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1802.04942 v5 pith:P4YLH564 submitted 2018-02-14 cs.LG cs.AIstat.ML

Isolating Sources of Disentanglement in Variational Autoencoders

Ricky T. Q. Chen , Xuechen Li , Roger Grosse , David Duvenaud This is my paper

classification cs.LG cs.AIstat.ML

keywords correlationdisentanglementtotalbetalatentvariablesvariationaladditional

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

We decompose the evidence lower bound to show the existence of a term measuring the total correlation between latent variables. We use this to motivate our $\beta$-TCVAE (Total Correlation Variational Autoencoder), a refinement of the state-of-the-art $\beta$-VAE objective for learning disentangled representations, requiring no additional hyperparameters during training. We further propose a principled classifier-free measure of disentanglement called the mutual information gap (MIG). We perform extensive quantitative and qualitative experiments, in both restricted and non-restricted settings, and show a strong relation between total correlation and disentanglement, when the latent variables model is trained using our framework.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Distributional Autoencoders Know the Score
stat.ML 2025-02 unverdicted novelty 6.0

DPA provides closed-form relation from level-set geometry to data score and proves extra latent components are conditionally independent, revealing intrinsic dimension.
Deep Attention Reweighting: Post-Hoc Attention-Based Feature Aggregation in CNNs for Disentangling Core and Spurious Features under Spurious Correlations
cs.CV 2026-05 unverdicted novelty 5.0

DAR replaces GAP with an attention-based aggregation module retrained jointly with the classifier head to disentangle core from spurious features and outperforms DFR on multiple datasets.