Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders

Hugh Salimbeni; Kai Arulkumaran; Marta Garnelo; Matthew C.H. Lee; Murray Shanahan; Nat Dilokthanakul; Pedro A.M. Mediano

arxiv: 1611.02648 · v2 · pith:2ZUIU4AZnew · submitted 2016-11-08 · 💻 cs.LG · cs.NE· stat.ML

Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders

Nat Dilokthanakul , Pedro A.M. Mediano , Marta Garnelo , Matthew C.H. Lee , Hugh Salimbeni , Kai Arulkumaran , Murray Shanahan This is my paper

classification 💻 cs.LG cs.NEstat.ML

keywords clusteringmodelunsupervisedperformancebeendeepeffectgaussian

0 comments

read the original abstract

We study a variant of the variational autoencoder model (VAE) with a Gaussian mixture as a prior distribution, with the goal of performing unsupervised clustering through deep generative models. We observe that the known problem of over-regularisation that has been shown to arise in regular VAEs also manifests itself in our model and leads to cluster degeneracy. We show that a heuristic called minimum information constraint that has been shown to mitigate this effect in VAEs can also be applied to improve unsupervised clustering performance with our model. Furthermore we analyse the effect of this heuristic and provide an intuition of the various processes with the help of visualizations. Finally, we demonstrate the performance of our model on synthetic data, MNIST and SVHN, showing that the obtained clusters are distinct, interpretable and result in achieving competitive performance on unsupervised clustering to the state-of-the-art results.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Testable Certificate for Constant Collapse in Teacher-Guided VAEs
cs.LG 2026-05 unverdicted novelty 7.0

For any fixed nonconstant teacher T, the best constant student has alignment cost exactly equal to the teacher mutual information I_T(X;T); a latent-only witness below this threshold with margin cannot be constant.
From Unsupervised to Guided Clustering: A Variational Implementation
stat.ME 2026-04 unverdicted novelty 6.0

GCVAE is a variational autoencoder that structures its latent space as a Gaussian mixture and optimizes a variational objective to make the representation maximally informative about a user-chosen guiding variable, en...
PDGMM-VAE: A Variational Autoencoder with Adaptive Per-Dimension Gaussian Mixture Model Priors for Nonlinear ICA
stat.ML 2026-03 unverdicted novelty 6.0

PDGMM-VAE recovers latent sources in nonlinear ICA by using jointly learned per-dimension GMM priors that fit source-specific marginals and reduce permutation symmetry.
Prototype Guided Post-pretraining for Single-Cell Representation Learning
cs.LG 2026-05 unverdicted novelty 5.0

CellRefine adds a marker-gene-guided post-pretraining stage to single-cell models that refines the cell embedding manifold and improves downstream task performance by up to 15%.
Heavy-Tailed Class-Conditional Priors for Long-Tailed Generative Modeling
cs.LG 2025-09 unverdicted novelty 5.0

C-t³VAE introduces class-conditional Student's t priors and a gamma-power divergence objective to improve class-balanced generation in VAEs under severe imbalance.
Risk-Aware Aerocapture Guidance Through a Probabilistic Indicator Function
eess.SY 2025-07 unverdicted novelty 5.0

A new aerocapture guidance method uses a probabilistic indicator function to estimate and mitigate failure risks, saving 71.43% to 100% of recoverable cases in high-uncertainty simulations across varied initial condit...
Learning Disentangled Representations for Generalized Multi-view Clustering
cs.CV 2026-05 unverdicted novelty 4.0

GMAE learns disentangled view-specific and view-common embeddings via dual-path autoencoders and cross-view adversarial training to boost performance on complete and incomplete multi-view clustering tasks.