Constant-Factor Approximation for the Uniform Decision Tree
Pith reviewed 2026-05-10 15:13 UTC · model grok-4.3
The pith
A polynomial-time algorithm approximates the optimal uniform decision tree within a factor less than 11.57.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We resolve the open question by giving a simple polynomial-time algorithm whose approximation ratio for the uniform decision tree problem is 2 over (1 minus the square root of (e plus 1) over (2e)) plus epsilon, a quantity less than 11.57. The algorithm works by first decomposing the optimal decision tree into separating subfamilies and then approximating each subfamily via a reduction to maximum coverage obtained by analyzing the effect of cutting cliques into pieces that represent pairs of hypotheses to be separated.
What carries the argument
The separating subfamily, obtained by decomposing the optimal decision tree and reducing the subproblem to maximum coverage through the analysis of cutting cliques into small pieces that separate hypothesis pairs.
Load-bearing premise
The decomposition of the optimal decision tree into separating subfamilies preserves the approximation guarantee when each subfamily is replaced by an approximate solution from the maximum coverage reduction.
What would settle it
An explicit family of hypotheses together with a decision tree whose expected cost is more than 11.57 times the optimal cost would show that the claimed ratio does not hold.
read the original abstract
We resolve a long-standing open question, about the existence of a constant-factor approximation algorithm for the average-case \textsc{Decision Tree} problem with uniform probability distribution over the hypotheses. We answer the question in the affirmative by providing a simple polynomial-time algorithm with approximation ratio of $\frac{2}{1-\sqrt{(e+1)/(2e)}}+\epsilon <11.57$. This improves upon the currently best-known, greedy algorithm which achieves $O(\log n/{\log\log n})$-approximation. The first key ingredient in our analysis is the usage of a decomposition technique known from problems related to \textsc{Hierarchical Clustering} [SODA '17, WALCOM '26], which allows us to decompose the optimal decision tree into a series of objects called separating subfamilies. The second crucial idea is to reduce the subproblem of finding a \textsc{Separating Subfamily} to an instance of the \textsc{Maximum Coverage} problem. To do so, we analyze the properties of cutting cliques into small pieces, which represent pairs of hypotheses to be separated. This allows us to obtain a good approximation for the \textsc{Separating Subfamily} problem, which then enables the design of the approximation algorithm for the original problem.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims to resolve a long-standing open question on the existence of a constant-factor approximation for the average-case Decision Tree problem under uniform probability distribution over hypotheses. It presents a simple polynomial-time algorithm achieving approximation ratio 2/(1-sqrt((e+1)/(2e))) + ε < 11.57, improving on the prior greedy O(log n / log log n) bound. The approach relies on a decomposition technique from hierarchical clustering to break the optimal decision tree into separating subfamilies, followed by a reduction of the Separating Subfamily subproblem to Maximum Coverage by analyzing the properties of cutting cliques into small pieces representing hypothesis pairs to be separated.
Significance. If the claims and analysis hold, this would be a significant contribution to approximation algorithms, as it provides the first constant-factor approximation for uniform Decision Tree where only super-logarithmic ratios were previously known. The reuse of decomposition methods from hierarchical clustering and the reduction to Maximum Coverage demonstrates a potentially elegant combination of existing tools, though the specific preservation of approximation guarantees in the decomposition step is central to the result.
major comments (1)
- The central claim depends on the decomposition into separating subfamilies preserving the approximation guarantee and on the correctness of the clique-cutting analysis for the Maximum Coverage reduction (both described only at high level in the abstract); these steps are load-bearing but cannot be verified without the full proof and analysis.
Simulated Author's Rebuttal
We thank the referee for reviewing our manuscript and for their comments. We address the major comment below.
read point-by-point responses
-
Referee: The central claim depends on the decomposition into separating subfamilies preserving the approximation guarantee and on the correctness of the clique-cutting analysis for the Maximum Coverage reduction (both described only at high level in the abstract); these steps are load-bearing but cannot be verified without the full proof and analysis.
Authors: The full manuscript contains the complete, formal proofs for both steps. Section 3 details the decomposition technique adapted from hierarchical clustering results, including the argument that it preserves the approximation guarantee when breaking the optimal decision tree into separating subfamilies. Section 4 then presents the reduction of the Separating Subfamily problem to Maximum Coverage, with the clique-cutting analysis that bounds the contribution of each small piece representing a hypothesis pair. These sections provide the rigorous analysis and lemmas supporting the overall constant-factor bound. The abstract summarizes these ideas at a high level, which is standard, but the body supplies the verifiable details. If the referee identifies a specific gap in the provided proofs, we are happy to clarify or expand the relevant section. revision: no
Circularity Check
No significant circularity detected
full rationale
The abstract describes a derivation that invokes a known decomposition technique from prior Hierarchical Clustering work (cited as SODA '17 and WALCOM '26) to break the optimal decision tree into separating subfamilies, followed by a reduction of the Separating Subfamily subproblem to Maximum Coverage via new analysis of clique-cutting properties. These steps are presented as building on external techniques with independent analysis for the approximation guarantee, without any visible self-definitional reductions, fitted inputs renamed as predictions, or load-bearing self-citations. No equations or detailed derivations are available in the abstract to inspect for constructional equivalence, and the central constant-factor result is framed as arising from the correctness of the cited decomposition and the new clique analysis, rendering the chain self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard assumptions in approximation algorithms and graph theory hold for the decision tree and clique models.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.