Characterizations of Admissible Objective Functions for Hierarchical Clustering
Pith reviewed 2026-05-08 05:29 UTC · model grok-4.3
The pith
Admissible sum-type objective functions for hierarchical clustering are characterized exactly when the scaling function is a symmetric polynomial of degree at most two.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We characterize admissible sum-type objective functions when the scaling function g is a symmetric polynomial of degree at most two, together with sufficient conditions in the degree-three case. For admissible objective functions in this class, we show that the recursive sparsest cut algorithm achieves an O(φ)-approximation ratio, where φ denotes the approximation factor of the sparsest cut subroutine. We establish a general characterization of admissibility for arbitrary scaling functions g, and a complete characterization when g is a symmetric polynomial of degree at most two for the new max-type class.
What carries the argument
The admissibility criterion, which requires that minimizers of the objective recover consistent hierarchical representations from similarities that admit them.
If this is right
- Admissible sum-type objectives with symmetric polynomial scaling of degree at most two can be optimized to within an O(φ) factor of optimal using recursive sparsest cut.
- Max-type objectives provide an alternative formulation that measures cluster interactions by the strongest pairwise similarity rather than aggregate sums.
- The characterizations supply necessary and sufficient conditions that delimit the admissible members of both the sum-type and max-type classes.
- Sufficient conditions for degree-three sum-type functions identify additional admissible objectives beyond the exact characterization for lower degrees.
Where Pith is reading between the lines
- Designers could now enumerate candidate scaling functions g within the characterized families to produce new objectives that are both admissible and amenable to approximation.
- Max-type objectives may prove more stable under noisy or outlier similarities because they emphasize the strongest connections.
- The framework could be tested on real similarity data to check whether the recovered hierarchies align with domain-expected structure when using the identified admissible g.
Load-bearing premise
The input similarities are assumed to come from some consistent hierarchical representation for the admissibility recovery guarantee to be meaningful.
What would settle it
A similarity matrix constructed from a known hierarchy that admits a consistent representation but is not recovered as a minimizer under a function the characterization declares admissible.
Figures
read the original abstract
Hierarchical clustering is a fundamental task in data analysis, yet for a long time it lacked a principled objective function. Dasgupta [STOC 2016] initiated a formal framework by introducing a discrete objective function for cluster trees. This framework was subsequently expanded by Cohen-Addad et al. [J. ACM 2019], who introduced the notion of admissibility -- a criterion ensuring that, whenever the input similarities admit consistent hierarchical representations, the minimizers of an objective function recover them. They also provided a necessary and sufficient condition for admissibility within a broad class of objective functions, which we refer to as sum-type objective functions. Our contributions are twofold. First, we characterize admissible sum-type objective functions when the scaling function g is a symmetric polynomial of degree at most two, together with sufficient conditions in the degree-three case. For admissible objective functions in this class, we show that the recursive sparsest cut algorithm achieves an O($\phi$)-approximation ratio, where $\phi$ denotes the approximation factor of the sparsest cut subroutine. Second, we introduce a new class of objective functions for hierarchical clustering, which we term max-type objective functions, where cluster interactions are measured by maximum rather than aggregate similarity. For this class, we establish a general characterization of admissibility for arbitrary scaling functions g, and a complete characterization when g is a symmetric polynomial of degree at most two. These results provide new theoretical insights into admissible objective functions for hierarchical clustering and clarify the scope of algorithmic guarantees for optimizing them.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript characterizes admissible objective functions for hierarchical clustering. For sum-type objectives with scaling function g a symmetric polynomial of degree at most 2, it gives necessary and sufficient conditions for admissibility along with sufficient conditions in the degree-3 case. It proves that the recursive sparsest cut algorithm achieves an O(φ)-approximation ratio on admissible instances in this class. It introduces a new max-type class of objectives (using maximum rather than aggregate similarity) and establishes a general characterization of admissibility for arbitrary g together with a complete characterization when g is a symmetric polynomial of degree at most 2. All results are scoped to inputs admitting consistent hierarchical representations.
Significance. If the characterizations hold, the work provides concrete, usable descriptions of admissible objectives in restricted but natural classes, which can guide the design of new hierarchical clustering objectives with recovery guarantees. The O(φ)-approximation links the admissible sum-type objectives to existing sparsest-cut approximation algorithms. The introduction and characterization of the max-type class expands the framework beyond the sum-type setting of prior work. These scoped, explicit results strengthen the theoretical foundations of the admissibility approach to hierarchical clustering objectives.
minor comments (3)
- §2: The definition of symmetric polynomials of degree d would benefit from an explicit example (e.g., for d=2) to clarify the algebraic form used in the subsequent characterizations.
- §3.2, Theorem 3: The statement of the O(φ) approximation could include a brief reminder of how φ is defined in terms of the sparsest-cut subroutine to improve readability for readers unfamiliar with the prior literature.
- The paper would benefit from a short table summarizing the characterization results across the sum-type and max-type classes and the different degree bounds on g.
Simulated Author's Rebuttal
We thank the referee for their positive assessment of the manuscript, including the summary of our characterizations for sum-type and max-type objectives, the O(φ)-approximation result, and the recommendation for minor revision. No specific major comments appear in the provided report.
Circularity Check
No significant circularity; derivations are direct mathematical characterizations
full rationale
The paper derives necessary and sufficient conditions for admissibility of sum-type and max-type objective functions by direct algebraic analysis of the admissibility definition (when similarities admit consistent hierarchies) applied to the functional forms involving scaling function g. These characterizations are scoped to symmetric polynomials of degree ≤2 (complete) and degree 3 (sufficient conditions), plus a general result for max-type with arbitrary g. No parameters are fitted to data and then renamed as predictions; no self-citations are load-bearing for the central claims; the O(φ)-approximation for recursive sparsest cut follows from the admissibility property without reduction to inputs by construction. The argument chain is self-contained within the stated classes and does not invoke uniqueness theorems or ansatzes from prior author work.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Admissibility criterion: whenever input similarities admit consistent hierarchical representations, the minimizers of the objective recover them (Cohen-Addad et al., J. ACM 2019)
invented entities (1)
-
max-type objective functions
no independent evidence
Reference graph
Works this paper leans on
- [1]
-
[2]
M. Charikar and V. Chatziafraitis: Approximate hierarchical clustering via sparsest cut and spreading metrics. In:Proceedings of the 28th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’17)(2017), pp. 841-854
work page 2017
-
[3]
V. Cohen-Addad, V. Kanade, F. Mallman-Trenn and C. Mathieu: Hierarchical clustering: Objective functions and algorithms.Journal of the Association for Computing Machinery 66(2019) Article 26
work page 2019
-
[4]
Dasgupta: A cost function of similarity-based hierarchical clustering
S. Dasgupta: A cost function of similarity-based hierarchical clustering. In:Proceedings of the 48-th Annual ACM SIGACT Symposium on Theory of Computing (STOC’16)(ACM, New York, 2016), pp. 118-127
work page 2016
- [5]
-
[6]
C. Semple and M. Steel:Phylogenetics(Oxford Lecture Series in Mathematics and its Ap- plications Vol. 24, Oxford University Press, 2003)
work page 2003
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.