Constant-Factor Approximation for the Uniform Decision Tree
TLDR
This paper presents the first constant-factor approximation algorithm for the average-case Uniform Decision Tree problem, improving prior bounds.
Key contributions
- Achieves the first constant-factor approximation for the uniform average-case Decision Tree problem.
- Provides a polynomial-time algorithm with an approximation ratio better than 11.57.
- Utilizes a decomposition technique to break down optimal decision trees into separating subfamilies.
- Reduces the separating subfamily subproblem to an instance of the Maximum Coverage problem.
Why it matters
The paper resolves a long-standing open question in theoretical computer science, significantly improving the state-of-the-art for decision tree approximation. This breakthrough offers a much more efficient and reliable approach for constructing optimal decision trees in practical applications.
Original Abstract
We resolve a long-standing open question, about the existence of a constant-factor approximation algorithm for the average-case \textsc{Decision Tree} problem with uniform probability distribution over the hypotheses. We answer the question in the affirmative by providing a simple polynomial-time algorithm with approximation ratio of $\frac{2}{1-\sqrt{(e+1)/(2e)}}+ε<11.57$. This improves upon the currently best-known, greedy algorithm which achieves $O(\log n/{\log\log n})$-approximation. The first key ingredient in our analysis is the usage of a decomposition technique known from problems related to \textsc{Hierarchical Clustering} [SODA '17, WALCOM '26], which allows us to decompose the optimal decision tree into a series of objects called separating subfamilies. The second crucial idea is to reduce the subproblem of finding a \textsc{Separating Subfamily} to an instance of the \textsc{Maximum Coverage} problem. To do so, we analyze the properties of cutting cliques into small pieces, which represent pairs of hypotheses to be separated. This allows us to obtain a good approximation for the \textsc{Separating Subfamily} problem, which then enables the design of the approximation algorithm for the original problem.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.