The Optimal Sample Complexity of Multiclass and List Learning
TLDR
This paper closes a longstanding gap in multiclass and list learning sample complexity by proving a conjecture relating hypergraph density to DS dimension.
Key contributions
- Proves that the maximum hypergraph density of multiclass hypothesis classes is bounded by their DS dimension.
- Resolves a longstanding conjecture by Daniely and Shalev-Shwartz (2014) regarding multiclass learning.
- Determines the optimal sample complexity dependence on the DS dimension for multiclass classification.
- Establishes the optimal sample complexity for list learning, closing a significant theoretical gap.
Why it matters
This paper resolves a fundamental open problem in statistical learning theory by determining the optimal sample complexity for multiclass and list learning. It closes a persistent \u221aDS gap, providing crucial theoretical foundations. This advance impacts the design and analysis of more efficient learning algorithms.
Original Abstract
While the optimal sample complexity of binary classification in terms of the VC dimension is well-established, determining the optimal sample complexity of multiclass classification has remained open. The appropriate complexity parameter for multiclass classification is the DS dimension, and despite significant efforts, a gap of $\sqrt{\text{DS}}$ has persisted between the upper and lower bounds on sample complexity. Recent work by Hanneke et al. (2026) shows a novel algebraic characterization of multiclass hypothesis classes in terms of their DS dimension. Building up on this, we show that the maximum hypergraph density of any multiclass hypothesis class is upper-bounded by its DS dimension. This proves a longstanding conjecture of Daniely and Shalev-Shwartz (2014). As a consequence, we determine the optimal dependence of the sample complexity on the DS dimension for multiclass as well as list learning.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.