Entropy-Based Characterisation of the Polarised Regime in Latent Variable Models
Pith reviewed 2026-05-20 19:30 UTC · model grok-4.3
The pith
The entropy of the mean representation classifies active dimensions in the polarised regime of latent variable models without relying on a Gaussian prior.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors propose an information-theoretic classification of the polarised regime in latent variable models based on the entropy of the mean representation. They demonstrate theoretically that this entropy is coupled to KL minimisation via entropy-variance bounds and relate the criterion to Bonheme's active/passive conditions. The criterion recovers the polarised regime consistently across beta-VAEs, identifiable VAEs, least-volume autoencoders and L2-regularised autoencoders. Entropy of the mean alone cannot distinguish active from mixed dimensions without variance signals, but passive dimensions yield small consistent improvements on downstream tasks when codes are normalised, suggesting
What carries the argument
the entropy of the mean representation, which classifies active dimensions by its coupling to the KL term via variance bounds
If this is right
- The proposed entropy criterion applies to variational models with various priors without requiring Gaussian assumptions.
- Passive dimensions can improve downstream task performance when latent codes are appropriately normalised.
- The entropy measure alone requires additional variance information to separate active from mixed dimensions.
- The classification recovers polarised regimes whenever they appear in the tested model classes.
Where Pith is reading between the lines
- This criterion could be used to monitor latent dimension usage during training in a wider range of generative models.
- Appropriate scaling of all latent codes might allow models to retain more information without changing the training objective.
- Testing the entropy criterion on models with non-Gaussian or discrete latent variables would check its broader applicability beyond the studied cases.
Load-bearing premise
The variance bounds that tie mean entropy to KL minimisation must be valid for the model and prior being used.
What would settle it
Running the entropy classification on a variational model with a heavy-tailed prior and checking if it matches the dimensions that actually contribute to reducing the KL term would test the claim.
Figures
read the original abstract
Variational Autoencoders (VAEs) often exhibit a polarised regime in which latent variables separate into active, passive, and mixed subsets. Existing criteria for identifying active dimensions depend on a Gaussian prior, limiting their applicability to variational models and specific priors. We propose a simple information-theoretic classification of the polarised regime based on the entropy of the mean representation. We show theoretically how this entropy couples to KL minimisation through entropy--variance bounds, and we relate the resulting criterion to Bonheme's active/passive conditions. We also clarify a key limitation: entropy of the mean alone cannot reliably distinguish active from mixed dimensions without additional signals from the variance representation. Empirically, we evaluate the entropy criterion on $\beta$-VAEs, identifiable VAEs, Least-Volume Autoencoders, and L2-regularised autoencoders, and find that it consistently recovers a polarised regime when such a regime is present across the model classes studied. Finally, we show that passive dimensions can yield small but consistent improvements on downstream tasks when latent codes are appropriately normalised, suggesting that collapse is often a matter of scale rather than absolute information removal.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes an entropy-based criterion for characterizing the polarised regime in latent variable models such as VAEs, using the entropy of the mean representation to classify active, passive, and mixed dimensions. It claims a theoretical coupling of this entropy to KL minimisation via entropy-variance bounds, relates the criterion to Bonheme's active/passive conditions, explicitly notes the limitation that mean entropy alone cannot separate active from mixed dimensions without variance signals, and reports empirical consistency in recovering the polarised regime across β-VAEs, identifiable VAEs, Least-Volume Autoencoders, and L2-regularised autoencoders. It further suggests that passive dimensions can yield small downstream improvements when latent codes are normalised.
Significance. If the entropy-variance bounds hold generally and the empirical consistency is robust, the work could provide a useful prior-independent information-theoretic tool for analysing latent polarisation in variational models, extending beyond Gaussian-specific criteria. The cross-architecture evaluation and the practical observation on normalised passive dimensions are constructive contributions that could aid representation learning research.
major comments (2)
- [Theoretical analysis section] Theoretical derivation of entropy-variance bounds: the central claim that mean entropy couples to KL minimisation (and thereby classifies active/passive dimensions) rests on these bounds. The manuscript must explicitly state the assumptions under which the bounds are derived and verify their tightness for the non-Gaussian priors and regularised objectives used in the β-VAE, identifiable VAE, and L2-regularised experiments; if the bounds become loose outside standard Gaussian variational families, the claimed coupling and classification do not reliably follow from mean entropy alone.
- [Section discussing limitations of the mean entropy criterion] Clarification of limitation and its impact on the criterion: the paper correctly notes that entropy of the mean cannot reliably distinguish active from mixed dimensions without variance signals. This limitation should be quantified (e.g., via an explicit statement of the additional variance information required) because it directly affects whether the proposed entropy criterion can stand alone as a classification method or still depends on signals similar to those in existing approaches.
minor comments (2)
- [Abstract and experimental setup] Ensure consistent terminology between the abstract (Least-Volume Autoencoders) and the experimental section for all four model classes.
- [Methods or notation section] Provide an explicit mathematical definition of 'entropy of the mean representation' at the first point of use, including the precise expectation or summation involved.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed comments, which help clarify the presentation of our theoretical and empirical contributions. We address each major comment below and outline the revisions we will make.
read point-by-point responses
-
Referee: [Theoretical analysis section] Theoretical derivation of entropy-variance bounds: the central claim that mean entropy couples to KL minimisation (and thereby classifies active/passive dimensions) rests on these bounds. The manuscript must explicitly state the assumptions under which the bounds are derived and verify their tightness for the non-Gaussian priors and regularised objectives used in the β-VAE, identifiable VAE, and L2-regularised experiments; if the bounds become loose outside standard Gaussian variational families, the claimed coupling and classification do not reliably follow from mean entropy alone.
Authors: We agree that the assumptions must be stated explicitly. In the revised manuscript we will add a dedicated paragraph in the theoretical analysis section listing the assumptions (Gaussian variational posteriors, standard normal prior, and the specific entropy-variance inequality used). For tightness outside these assumptions, we acknowledge that the bounds are derived under Gaussian variational families and may loosen for non-Gaussian or heavily regularised objectives. Nevertheless, the empirical results across β-VAEs, identifiable VAEs, Least-Volume Autoencoders and L2-regularised autoencoders show consistent recovery of the polarised regime, indicating that the mean-entropy criterion remains practically useful even when the theoretical coupling is approximate. We will add a short discussion of this point. revision: partial
-
Referee: [Section discussing limitations of the mean entropy criterion] Clarification of limitation and its impact on the criterion: the paper correctly notes that entropy of the mean cannot reliably distinguish active from mixed dimensions without variance signals. This limitation should be quantified (e.g., via an explicit statement of the additional variance information required) because it directly affects whether the proposed entropy criterion can stand alone as a classification method or still depends on signals similar to those in existing approaches.
Authors: We will expand the limitations paragraph to quantify the requirement: distinguishing active from mixed dimensions requires at least one additional signal from the variance representation (e.g., the entropy of the per-dimension variances or a threshold on the average variance). We will state explicitly that mean entropy alone is therefore not a fully standalone classifier and is intended to be combined with variance information, consistent with the spirit of prior active/passive criteria. This clarification will be added without altering the core claim that mean entropy provides a prior-independent indicator of polarisation. revision: yes
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The paper proposes an entropy-based criterion for the polarised regime and derives its coupling to KL minimisation via entropy-variance bounds under the stated variational and prior assumptions. The relation to Bonheme's active/passive conditions is presented as an additional connection rather than the foundation or definition of the new result. No equations or claims reduce by construction to fitted inputs, self-definitions, or unverified self-citations; the theoretical steps rely on modeling assumptions that are external to the target classification. The work is therefore self-contained against external benchmarks with no load-bearing circular steps.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Entropy-variance bounds link the entropy of the mean latent representation to the KL divergence term under the variational family.
Reference graph
Works this paper leans on
-
[1]
Junxian He and Daniel Spokoyny and Graham Neubig and Taylor Berg-Kirkpatrick , journal=. 2019 , volume=
work page 2019
-
[2]
Jordan and Zoubin Ghahramani and T
Michael I. Jordan and Zoubin Ghahramani and T. Jaakkola and Lawrence K. Saul , journal=. 1999 , volume=
work page 1999
-
[3]
Yuhta Takida and Wei-Hsiang Liao and T. Uesaka and S. Takahashi and Yuki Mitsufuji , journal=. 2021 , volume=
work page 2021
- [4]
-
[5]
Samuel R. Bowman and L. Vilnis and Oriol Vinyals and Andrew M. Dai and R. J. CoNLL , year=
-
[6]
International Conference on Learning Representations (ICLR) , year=
Auto-Encoding Variational Bayes , author=. International Conference on Learning Representations (ICLR) , year=
- [7]
-
[8]
International Conference on Machine Learning , year=
Fixing a Broken ELBO , author=. International Conference on Machine Learning , year=
- [9]
-
[10]
IEEE Transactions on Pattern Analysis and Machine Intelligence , volume =
Yoshua Bengio and Aaron Courville and Pascal Vincent , title =. IEEE Transactions on Pattern Analysis and Machine Intelligence , volume =
-
[11]
Reducing the dimensionality of data with neural networks , author=. Science , volume=. 2006 , publisher=
work page 2006
-
[12]
Proceedings of the 35th International Conference on Machine Learning , volume =
Optimizing the Latent Space of Generative Networks , author =. Proceedings of the 35th International Conference on Machine Learning , volume =. 2018 , publisher =
work page 2018
-
[13]
A Survey of Inductive Biases for Factorial Representation-Learning
A survey of inductive biases for factorial representation-learning , author=. arXiv preprint arXiv:1612.05299 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[14]
International Conference on Learning Representations (ICLR) , year=
A framework for the quantitative evaluation of disentangled representations , author=. International Conference on Learning Representations (ICLR) , year=
- [15]
-
[16]
The information bottleneck method
The information bottleneck method , author=. arXiv preprint physics/0004057 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[17]
Sparse coding with an overcomplete basis set: A strategy employed by V1? , author=. Vision Research , volume=. 1997 , publisher=
work page 1997
-
[18]
Nonlinear dimensionality reduction by locally linear embedding , author=. Science , volume=. 2000 , publisher=
work page 2000
- [19]
-
[20]
Bin Dai and Yu Wang and John A. D. Aston and Gang Hua and David Paul Wipf , journal=. 2018 , volume=
work page 2018
- [21]
-
[22]
Simon Kornblith and Mohammad Norouzi and Honglak Lee and Geoffrey E. Hinton , journal=. 2019 , volume=
work page 2019
-
[23]
Francesco Locatello and S. Bauer and M. Lucic and S. Gelly and B. Sch. ArXiv , year=
- [24]
-
[25]
Journal of Machine Learning Research , year =
Lisa Bonheme and Marek Grzes , title =. Journal of Machine Learning Research , year =
-
[26]
2023 International Conference on Machine Learning and Applications (ICMLA) , year=
Posterior Collapse in Variational Gradient Origin Networks , author=. 2023 International Conference on Machine Learning and Applications (ICMLA) , year=
work page 2023
-
[27]
Claude E. Shannon , title =. Bell System Technical Journal , volume =. 1948 , publisher =
work page 1948
-
[28]
Neural networks and principal component analysis: Learning from examples without local minima , author=. Neural Networks , year=
-
[29]
Multilayer feedforward networks are universal approximators
Multilayer feedforward networks are universal approximators , journal =. 1989 , issn =. doi:https://doi.org/10.1016/0893-6080(89)90020-8 , url =
-
[30]
Independent component analysis, A new concept? , author=. Signal Processing , volume=. 1994 , publisher=
work page 1994
-
[31]
Independent component analysis: algorithms and applications , author=. Neural Networks , volume=. 2000 , publisher=
work page 2000
-
[32]
Convergent Learning: Do different neural networks learn the same representations?
Convergent Learning: Do different neural networks learn the same representations? , author =. arXiv preprint arXiv:1511.07543 , year =
work page internal anchor Pith review Pith/arXiv arXiv
-
[33]
Advances in Neural Information Processing Systems , year =
Revisiting Model Stitching to Compare Neural Representations , author =. Advances in Neural Information Processing Systems , year =
-
[34]
Proceedings of the 36th International Conference on Machine Learning , series =
Similarity of Neural Network Representations Revisited , author =. Proceedings of the 36th International Conference on Machine Learning , series =. 2019 , editor =
work page 2019
-
[35]
arXiv preprint arXiv:2205.08399 , year=
How do Variational Autoencoders Learn? Insights from Representational Similarity , author=. arXiv preprint arXiv:2205.08399 , year=
-
[36]
Advances in Neural Information Processing Systems , year=
Implicit Neural Representations with Periodic Activation Functions , author=. Advances in Neural Information Processing Systems , year=
-
[37]
Bin Dai and David Wipf , booktitle=. Diagnosing and Enhancing. 2019 , url=
work page 2019
- [38]
-
[39]
Variational Autoencoders Pursue PCA Directions (by Accident) , year=
Rolínek, Michal and Zietlow, Dominik and Martius, Georg , booktitle=. Variational Autoencoders Pursue PCA Directions (by Accident) , year=
- [40]
-
[41]
Pattern Recognition and Machine Learning , author=. 2006 , publisher=
work page 2006
-
[42]
Journal of the Royal Statistical Society: Series B (Statistical Methodology) , volume=
Probabilistic Principal Component Analysis , author=. Journal of the Royal Statistical Society: Series B (Statistical Methodology) , volume=. 1999 , publisher=
work page 1999
-
[43]
Yann LeCun and L. Proc. IEEE , year=
-
[44]
Yann LeCun and Fu Jie Huang and L. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004. , year=
work page 2004
-
[45]
Loic Matthey and Irina Higgins and Demis Hassabis and Alexander Lerchner , title =. 2017
work page 2017
-
[46]
Information Flows of Diverse Autoencoders , volume=
Lee, Sungyeop and Jo, Junghyo , year=. Information Flows of Diverse Autoencoders , volume=. Entropy , publisher=. doi:10.3390/e23070862 , number=
-
[47]
Neural networks : the official journal of the International Neural Network Society , year=
Understanding Autoencoders with Information Theoretic Concepts , author=. Neural networks : the official journal of the International Neural Network Society , year=
-
[48]
Proceedings of the 37th annual Allerton conference on communication, control and computing , volume=
The information bottleneck method , author=. Proceedings of the 37th annual Allerton conference on communication, control and computing , volume=
-
[49]
IBM Journal of Research and Development , volume=
Information theoretical analysis of multivariate correlation , author=. IBM Journal of Research and Development , volume=. 1960 , publisher=
work page 1960
-
[50]
James Lucas and G. Tucker and R. Grosse and Mohammad Norouzi. DGS@ICLR. 2019
work page 2019
-
[51]
Annals of Human Genetics , year=
THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , author=. Annals of Human Genetics , year=
-
[52]
Bruno A. Olshausen and David J. Field , keywords =. Sparse coding with an overcomplete basis set: A strategy employed by V1? , journal =. 1997 , issn =. doi:https://doi.org/10.1016/S0042-6989(97)00169-7 , url =
- [53]
-
[54]
Rumelhart, David E. and McClelland, James L. , booktitle=. Learning Internal Representations by Error Propagation , year=
-
[55]
The Polarised Regime of identifiable Variational Autoencoders , booktitle =
Lisa Bonheme and Marek Grzes , url =. The Polarised Regime of identifiable Variational Autoencoders , booktitle =. 2023 , month =
work page 2023
-
[56]
The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science , volume=
On lines and planes of closest fit to systems of points in space , author=. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science , volume=. 1901 , publisher=
work page 1901
-
[57]
Journal of Educational Psychology , volume=
Analysis of a complex of statistical variables into principal components , author=. Journal of Educational Psychology , volume=. 1933 , publisher=
work page 1933
-
[58]
Towards A Rigorous Science of Interpretable Machine Learning
Towards A Rigorous Science of Interpretable Machine Learning , author=. arXiv preprint arXiv:1702.08608 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[59]
Advances in Neural Information Processing Systems , pages=
Attention Is All You Need , author=. Advances in Neural Information Processing Systems , pages=
-
[60]
Advances in Neural Information Processing Systems , pages=
Generative Adversarial Nets , author=. Advances in Neural Information Processing Systems , pages=
- [61]
- [62]
- [63]
- [64]
-
[65]
Alféd Rényi , title =. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics , year =
-
[66]
Equation of State Calculations by Fast Computing Machines , author=. Resonance , year=
-
[67]
Monte Carlo Sampling Methods Using Markov Chains and Their Applications , author=. Biometrika , year=
- [68]
- [69]
-
[70]
International Conference on Artificial Intelligence and Statistics , year=
Variational Autoencoders and Nonlinear ICA: A Unifying Framework , author=. International Conference on Artificial Intelligence and Statistics , year=
-
[71]
Chen, Ricky T. Q. and Li, Xuechen and Grosse, Roger B and Duvenaud, David K , booktitle =. Isolating Sources of Disentanglement in Variational Autoencoders , url =
-
[72]
Third Symposium on Advances in Approximate Bayesian Inference , year=
Posterior Collapse and Latent Variable Non-identifiability , author=. Third Symposium on Advances in Approximate Bayesian Inference , year=
-
[73]
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) , year=
The Intrinsic Dimension of Images and Its Impact on Learning , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) , year=
-
[74]
Spectral Intrinsic Dimensionality Estimation , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops , year=
-
[75]
An Empirical Bayes Approach to Statistics
Robbins, Herbert E. An Empirical Bayes Approach to Statistics. Breakthroughs in Statistics: Foundations and Basic Theory. 1992. doi:10.1007/978-1-4612-0919-5_26
-
[76]
Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences , author =. 1983 , edition =
work page 1983
-
[77]
Social Science Research Council Bulletin , year =
Horst, Paul , title =. Social Science Research Council Bulletin , year =
-
[78]
Empirical Comparison between Autoencoders and Traditional Dimensionality Reduction Methods , author=. 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE) , pages=. 2019 , organization=
work page 2019
-
[79]
Yu, Jinyue and Sun, Zhiqiang and Yu, Chengcheng , TITLE =. Applied Sciences , VOLUME =. 2025 , NUMBER =
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.