4wnhbcppot@OpenReview

Total: 1

#1 Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms [PDF1] [Copy] [Kimi] [REL]

Authors: William Réveillard, Richard Combes

We consider a stochastic multi-armed bandit problem with i.i.d. rewards where the expected reward function is multimodal with at most $m$ modes. We propose the first known computationally tractable algorithm for computing the solution to the Graves-Lai optimization problem, which in turn enables the implementation of asymptotically optimal algorithms for this bandit problem.

Subject: NeurIPS.2025 - Poster