Jose Blanchet

3 papers · Latest: April 30, 2026

Wasserstein Distributionally Robust Regret Optimization for Reinforcement Learning from Human Feedback

This paper introduces Wasserstein Distributionally Robust Regret Optimization (DRRO) for RLHF to mitigate reward over-optimization, offering a less pessimistic approach.

2605.00155Apr 30, 2026

Classical and Quantum Speedups for Non-Convex Optimization via Energy Conserving Descent

New stochastic and quantum Energy Conserving Descent algorithms achieve exponential speedups over gradient descent for non-convex optimization.

2604.13022Apr 14, 2026

Partial Identification of Policy-Relevant Treatment Effects with Instrumental Variables via Optimal Transport

This paper uses optimal transport to derive sharper bounds for policy-relevant treatment effects, improving identification with instrumental variables.

2604.12263Apr 14, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.