Remi Munos

4 papers · Latest: May 5, 2026

Bandits attack function optimization

This paper introduces Simultaneous Optimistic Optimization (SOO), a bandit-inspired algorithm for efficient function optimization under budget constraints.

2605.03496May 5, 2026

Statistical Machine Learning

Spectral bandits

This paper introduces "spectral bandits," an online learning framework for graph-based problems like recommendations, using smooth payoffs and effective dimension.

2604.25272Apr 28, 2026

Machine Learning

Planning in entropy-regularized Markov decision processes and games

SmoothCruiser is a new planning algorithm for entropy-regularized MDPs and games, achieving O~(1/epsilon^4) sample complexity.

2604.19695Apr 21, 2026

Machine Learning

Spectral Thompson sampling

SpectralTS efficiently solves graph bandit problems by leveraging an effective dimension, achieving comparable regret with improved computational performance.

2604.13739Apr 15, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.