Remi Munos
4 papers ยท Latest:
Machine Learning
Bandits attack function optimization
This paper introduces Simultaneous Optimistic Optimization (SOO), a bandit-inspired algorithm for efficient function optimization under budget constraints.
2605.03496
Statistical Machine LearningSpectral bandits
This paper introduces "spectral bandits," an online learning framework for graph-based problems like recommendations, using smooth payoffs and effective dimension.
2604.25272
Machine LearningPlanning in entropy-regularized Markov decision processes and games
SmoothCruiser is a new planning algorithm for entropy-regularized MDPs and games, achieving O~(1/epsilon^4) sample complexity.
2604.19695
Machine LearningSpectral Thompson sampling
SpectralTS efficiently solves graph bandit problems by leveraging an effective dimension, achieving comparable regret with improved computational performance.
2604.13739
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.