ArXiv TLDR

Optimized Deferral for Imbalanced Settings

🐦 Tweet
2604.27723

Corinna Cortes, Anqi Mao, Mehryar Mohri, Yutao Zhong

cs.LGstat.ML

TLDR

MILD is a new framework addressing expert imbalance in two-stage learning to defer by using cost-sensitive learning and novel margin-based loss functions.

Key contributions

  • Identifies and studies the "expert imbalance problem" in two-stage learning to defer systems.
  • Formulates deferral loss optimization as a novel cost-sensitive learning problem.
  • Develops MILD, a new algorithm using margin-based loss functions for imbalanced deferral.
  • Demonstrates MILD's superior performance on image classification and LLM routing tasks.

Why it matters

This paper addresses a critical challenge in learning to defer, where expert imbalance can lead to suboptimal performance. By introducing MILD, it offers a principled solution that significantly improves decision-making in real-world applications like LLM routing and medical diagnosis, enhancing efficiency and accuracy.

Original Abstract

Learning algorithms can be significantly improved by routing complex or uncertain inputs to specialized experts, balancing accuracy with computational cost. This approach, known as learning to defer, is essential in domains like natural language generation, medical diagnosis, and computer vision, where an effective deferral can reduce errors at low extra resource consumption. However, the two-stage learning to defer setting, which leverages existing predictors such as a collection of LLMs or other classifiers, often faces challenges due to an expert imbalance problem. This imbalance can lead to suboptimal performance, with deferral algorithms favoring the majority expert. We present a comprehensive study of two-stage learning to defer in expert imbalance settings. We cast the deferral loss optimization as a novel cost-sensitive learning problem over the input-expert domain. We derive new margin-based loss functions and guarantees tailored to this setting, and develop novel algorithms for cost-sensitive learning. Leveraging these results, we design principled deferral algorithms, MILD (Margin-based Imbalanced Learning to Defer), specifically suited for expert imbalance settings. Extensive experiments demonstrate the effectiveness of our approach, showing clear improvements over existing baselines on both image classification and real-world Large Language Model (LLM) routing tasks.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.