Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations

Bernhard Sch\"olkopf; Francesco Locatello; Gunnar R\"atsch; Mario Lucic; Olivier Bachem; Stefan Bauer; Sylvain Gelly

arxiv: 1811.12359 · v4 · pith:J66SYIHCnew · submitted 2018-11-29 · 💻 cs.LG · cs.AI· stat.ML

Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations

Francesco Locatello , Stefan Bauer , Mario Lucic , Gunnar R\"atsch , Sylvain Gelly , Bernhard Sch\"olkopf , Olivier Bachem This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords learningdatarepresentationsunsuperviseddisentangleddisentanglementmodelsassumptions

0 comments

read the original abstract

The key idea behind the unsupervised learning of disentangled representations is that real-world data is generated by a few explanatory factors of variation which can be recovered by unsupervised learning algorithms. In this paper, we provide a sober look at recent progress in the field and challenge some common assumptions. We first theoretically show that the unsupervised learning of disentangled representations is fundamentally impossible without inductive biases on both the models and the data. Then, we train more than 12000 models covering most prominent methods and evaluation metrics in a reproducible large-scale experimental study on seven different data sets. We observe that while the different methods successfully enforce properties ``encouraged'' by the corresponding losses, well-disentangled models seemingly cannot be identified without supervision. Furthermore, increased disentanglement does not seem to lead to a decreased sample complexity of learning for downstream tasks. Our results suggest that future work on disentanglement learning should be explicit about the role of inductive biases and (implicit) supervision, investigate concrete benefits of enforcing disentanglement of the learned representations, and consider a reproducible experimental setup covering several data sets.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Prognostic Value of Lung Ultrasound Biomarkers for Readmission Risk in Congestive Heart Failure: A Pilot Data-Driven Analysis
eess.SP 2026-05 unverdicted novelty 6.0

Pilot study uses pretrained video encoder features from lung ultrasound to predict 30-day CHF readmission, finding lower-lung views and temporal differences most informative with top MLP F1 of 0.80.
Affine Disentangled GAN for Interpretable and Robust AV Perception
cs.CV 2019-07 unverdicted novelty 5.0

ADIS-GAN disentangles affine transformations in a GAN to achieve over 98% classification accuracy on MNIST within 30 degrees rotation and over 90% under FGSM and PGD attacks while generating rotation and scaling factors.