Towards a Definition of Disentangled Representations

Alexander Lerchner; Danilo Rezende; David Amos; David Pfau; Irina Higgins; Loic Matthey; Sebastien Racaniere

Towards a Definition of Disentangled Representations

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1812.02230 v1 pith:RERGKRCD submitted 2018-12-05 cs.LG stat.ML

Towards a Definition of Disentangled Representations

Irina Higgins , David Amos , David Pfau , Sebastien Racaniere , Loic Matthey , Danilo Rezende , Alexander Lerchner This is my paper

classification cs.LG stat.ML

keywords worlddefinitiondisentangleddisentanglingrepresentationrepresentationsstructuretransformations

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

How can intelligent agents solve a diverse set of tasks in a data-efficient manner? The disentangled representation learning approach posits that such an agent would benefit from separating out (disentangling) the underlying structure of the world into disjoint parts of its representation. However, there is no generally agreed-upon definition of disentangling, not least because it is unclear how to formalise the notion of world structure beyond toy datasets with a known ground truth generative process. Here we propose that a principled solution to characterising disentangled representations can be found by focusing on the transformation properties of the world. In particular, we suggest that those transformations that change only some properties of the underlying world state, while leaving all other properties invariant, are what gives exploitable structure to any kind of data. Similar ideas have already been successfully applied in physics, where the study of symmetry transformations has revolutionised the understanding of the world structure. By connecting symmetry transformations to vector representations using the formalism of group and representation theory we arrive at the first formal definition of disentangled representations. Our new definition is in agreement with many of the current intuitions about disentangling, while also providing principled resolutions to a number of previous points of contention. While this work focuses on formally defining disentangling - as opposed to solving the learning problem - we believe that the shift in perspective to studying data transformations can stimulate the development of better representation learning algorithms.

discussion (0)

Forward citations

Cited by 25 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Disentanglement Beyond Generative Models with Riemannian ICA
cs.LG 2026-05 unverdicted novelty 8.0

RICA replaces ICA's global generative model with local Riemannian geometry, introducing a disentanglement tensor based on the Hessian of the log-likelihood and Ricci curvature to measure pointwise disentanglement, whi...
The Linear Representation Hypothesis and the Geometry of Large Language Models
cs.CL 2023-11 conditional novelty 8.0

Linear representations of high-level concepts in LLMs are formalized via counterfactuals in input and output spaces, unified under a causal inner product that enables consistent probing and steering.
Unsupervised Disentanglement Without Compromises : How Functional Orthogonality Enforces Identifiability
cs.LG 2026-06 unverdicted novelty 7.0

Enforcing local orthogonality on the Jacobian of the generative mapping yields identifiability for general nonlinear models when the latent domain has full combinatorial support.
Winner-Take-All bottlenecks enforce disentangled symbolic representations in multi-task learning
cs.LG 2026-05 unverdicted novelty 7.0

WTA bottlenecks enforce highly symbolic representations of categorical latent factors in multi-task deep learning under specific conditions.
KamonBench: A Grammar-Based Dataset for Evaluating Compositional Factor Recovery in Vision-Language Models
cs.CV 2026-05 unverdicted novelty 7.0

KamonBench is a grammar-based dataset of 20,000 synthetic Japanese crests with multi-format annotations that enables direct evaluation of factor recovery beyond caption accuracy in vision-language models.
KamonBench: A Grammar-Based Dataset for Evaluating Compositional Factor Recovery in Vision-Language Models
cs.CV 2026-05 unverdicted novelty 7.0

KamonBench is a grammar-generated synthetic dataset of compositional kamon crests with explicit factor annotations to evaluate factor recovery in vision-language models.
Learning to Theorize the World from Observation
cs.LG 2026-05 unverdicted novelty 7.0

NEO is a probabilistic neural model that induces compositional programs as a learned Language of Thought from non-textual observations and executes them via a shared transition model to enable explanation-driven gener...
A framework for analyzing concept representations in neural models
cs.CL 2026-05 unverdicted novelty 7.0

A new framework shows concept subspaces are not unique, estimator choice affects containment and disentanglement, LEACE works well but generalizes poorly, and HuBERT encodes phone info as contained and disentangled fr...
Transformation Categorization Based on Group Decomposition Theory Using Parameter Division
cs.LG 2026-04 unverdicted novelty 7.0

Parameter division decomposes group transformations via parameter splitting and homomorphism constraints to enable unsupervised categorization of image transformations like rotation, translation, and scale.
Mechanistic Independence: A Principle for Identifiable Disentangled Representations
cs.LG 2025-09 unverdicted novelty 7.0

Mechanistic independence criteria yield identifiability of latent subspaces under nonlinear mixing by focusing on action-based independence rather than latent distributions, with a hierarchy and graph-theoretic view o...
Algebraic Priors for Approximately Equivariant Networks
cs.LG 2025-06 conditional novelty 7.0

Proves regular representation must appear in latent space of finite-group equivariant encoders and enforces it via auxiliary loss to match specialized equivariant models without added parameters.
Product of Orthogonal Spheres Parameterization for Disentangled Representation Learning
cs.CV 2019-07 unverdicted novelty 7.0

Proposes PrOSe parameterization of latent space as product of orthogonal spheres to improve disentangled representation learning, with closed-form ortho-normality loss under equal block size assumption.
A Deep Learning-based surrogate model for Severe Accidents in nuclear reactors using ASTEC
cs.LG 2026-07 conditional novelty 6.0

AE-NODE surrogate of ASTEC vessel physics predicts ~80 multi-physics variables with stable 10k–50k-step rollouts and >300× dimensionality reduction, running 40-hour scenarios in under a minute.
Unsupervised Causal Abstractions Discovery
cs.LG 2026-06 unverdicted novelty 6.0

Low-rank graphs induce latents that form causal abstractions, with identifiability results and a practical objective enabling unsupervised learning of high-level SCMs from low-level measurements.
Winner-Take-All bottlenecks enforce disentangled symbolic representations in multi-task learning
cs.LG 2026-05 unverdicted novelty 6.0

WTA bottlenecks enforce highly symbolic, disentangled categorical representations of latent factors under defined conditions in multi-task DNNs, shown via theorem and experiments on two datasets.
Concepts Worth Having: Refining VLM-Guided Concept Bottleneck Models with Minimal Annotations
cs.CV 2026-05 unverdicted novelty 6.0

VH-CBM uses a Gaussian process in VLM embedding space to propagate sparse human annotations and improve concept accuracy and calibration over pure VLM-guided concept bottleneck models.
A renormalization-group inspired lattice-based framework for piecewise generalized linear models
stat.ME 2026-05 unverdicted novelty 6.0

RG-inspired lattice models for piecewise GLMs provide explicit interpretable partitions and a replica-analysis-derived scaling law for regularization that allows increasing complexity without expected rise in generali...
Learning to Theorize the World from Observation
cs.LG 2026-05 unverdicted novelty 6.0

NEO induces compositional latent programs as world theories from observations and executes them to enable explanation-driven generalization.
Continuous Limits of Coupled Flows in Representation Learning
cs.LG 2026-04 unverdicted novelty 6.0

Discrete decentralized learning dynamics on manifolds converge uniformly to an overdamped Langevin SDE whose stationary states produce orthogonally disentangled, linearly separable features.
Disentangling Influence: Using Disentangled Representations to Audit Model Predictions
cs.LG 2019-06 unverdicted novelty 6.0

Disentangled representations enable a new auditing procedure to identify proxy features and quantify their influence on model outcomes more effectively than prior methods.
Disentanglement-Based Equivariant Learning for Compositional VQA
cs.CV 2026-06 unverdicted novelty 5.0

DEAL disentangles concepts from images and text using causal interventions and enforces equivariance on compositional transformations to boost generalization in VQA, outperforming prior methods on CLEVR-CoGenT and GQA-SGL.
Stimulus symmetries can confound representational similarity analyses
q-bio.NC 2026-05 unverdicted novelty 5.0

Stimulus symmetries render many neural representations functionally equivalent yet produce qualitatively different RSMs, including drifting ones from SGD or regularization in image-encoding networks.
If Concept Bottlenecks are the Question, are Foundation Models the Answer?
cs.LG 2025-04 unverdicted novelty 5.0

Empirical tests of VLM-CBMs show VLM supervision differs from expert annotations depending on task and that concept accuracy correlates weakly with quality metrics.
Affine Disentangled GAN for Interpretable and Robust AV Perception
cs.CV 2019-07 unverdicted novelty 5.0

ADIS-GAN disentangles affine transformations in a GAN to achieve over 98% classification accuracy on MNIST within 30 degrees rotation and over 90% under FGSM and PGD attacks while generating rotation and scaling factors.
Gauge theory and twins paradox of disentangled representations
cs.LG 2019-06 unverdicted novelty 3.0

Authors propose a fibre bundle gauge theory model for disentangled representations and connect it to the relativity twins paradox.