pith. machine review for the scientific record. sign in

arxiv: 1804.03599 · v1 · submitted 2018-04-10 · 📊 stat.ML · cs.AI· cs.LG

Recognition: unknown

Understanding disentangling in β-VAE

Authors on Pith no claims yet
classification 📊 stat.ML cs.AIcs.LG
keywords betatrainingdisentangledmodificationrepresentationsaccuracyalignedassessments
0
0 comments X
read the original abstract

We present new intuitions and theoretical assessments of the emergence of disentangled representation in variational autoencoders. Taking a rate-distortion theory perspective, we show the circumstances under which representations aligned with the underlying generative factors of variation of data emerge when optimising the modified ELBO bound in $\beta$-VAE, as training progresses. From these insights, we propose a modification to the training regime of $\beta$-VAE, that progressively increases the information capacity of the latent code during training. This modification facilitates the robust learning of disentangled representations in $\beta$-VAE, without the previous trade-off in reconstruction accuracy.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 12 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Inference-Time Refinement Closes the Synthetic-Real Gap in Tabular Diffusion

    cs.LG 2026-05 unverdicted novelty 8.0

    Inference-time refinement of pre-trained tabular diffusion models via Bidirectional Chamfer Refinement achieves median 8.6% better downstream performance than real data across 15 benchmarks while preserving fidelity a...

  2. Gradient-Based Program Synthesis with Neurally Interpreted Languages

    cs.LG 2026-04 unverdicted novelty 8.0

    NLI autonomously discovers a vocabulary of primitive operations and interprets variable-length programs via a neural executor, allowing end-to-end training and gradient-based test-time adaptation that outperforms prio...

  3. StrADiff: A Structured Source-Wise Adaptive Diffusion Framework for Linear and Nonlinear Blind Source Separation

    stat.ML 2026-04 unverdicted novelty 7.0

    StrADiff recovers latent source trajectories from linear and nonlinear mixtures via source-wise adaptive diffusion and a Gaussian process prior in a single unsupervised end-to-end objective.

  4. Unsupervised learning of acquisition variability in structural connectomes via hybrid latent space modeling

    cs.LG 2026-05 unverdicted novelty 6.0

    A hybrid VAE with architectural annealing learns discrete clusters aligned with scanner and protocol differences in a dataset of 7416 structural connectomes spanning 13 studies.

  5. A renormalization-group inspired lattice-based framework for piecewise generalized linear models

    stat.ME 2026-05 unverdicted novelty 6.0

    RG-inspired lattice models for piecewise GLMs provide explicit interpretable partitions and a replica-analysis-derived scaling law for regularization that allows increasing complexity without expected rise in generali...

  6. Discovering quantum phenomena with Interpretable Machine Learning

    quant-ph 2026-04 unverdicted novelty 6.0

    Variational autoencoders combined with symbolic regression extract physically meaningful representations and order parameters from raw quantum measurement data, revealing new phenomena such as corner-ordering in Rydbe...

  7. Cross-Modal Generation: From Commodity WiFi to High-Fidelity mmWave and RFID Sensing

    cs.LG 2026-04 unverdicted novelty 6.0

    RF-CMG synthesizes high-quality mmWave and RFID signals from WiFi using a diffusion model with Modality-Guided Embedding for high-frequency details and Low-Frequency Modality Consistency to preserve physical structure.

  8. From Unsupervised to Guided Clustering: A Variational Implementation

    stat.ME 2026-04 unverdicted novelty 6.0

    GCVAE is a variational autoencoder that structures its latent space as a Gaussian mixture and optimizes a variational objective to make the representation maximally informative about a user-chosen guiding variable, en...

  9. Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

    cs.LG 2024-03 unverdicted novelty 6.0

    Sparse feature circuits are introduced as interpretable causal subnetworks in language models, supporting unsupervised discovery of thousands of circuits and a method called SHIFT to improve classifier generalization ...

  10. To Use AI as Dice of Possibilities with Timing Computation

    cs.AI 2026-05 unverdicted novelty 5.0

    Proposes verb-based paradigm with timing computation to enable data-driven discovery of patient trajectories and counterfactual timing from EHR data without domain knowledge.

  11. Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifolds

    cs.LG 2026-04 unverdicted novelty 5.0

    Aligning the DDIM forward diffusion process with flow-matching manifold evolution enables high-quality generation without time conditioning, and class-conditional synthesis is possible with an unconditional denoiser b...

  12. A Systematic Framework for Tabular Data Disentanglement

    cs.LG 2026-04 unverdicted novelty 5.0

    A systematic framework modularizes tabular data disentanglement into data extraction, modeling, analysis, and latent extrapolation, with a case study on synthetic data generation.