Improved Regret for Zeroth-Order Stochastic Convex Bandits

lattimore21a@v134@PMLR

Total: 1

#1 Improved Regret for Zeroth-Order Stochastic Convex Bandits [PDF³] [Copy] [Kimi²] [REL]

Authors: Tor Lattimore, Andras Gyorgy

We present an efficient algorithm for stochastic bandit convex optimisation with no assumptions on smoothness or strong convexity and for which the regret is bounded by O(d^(4.5) sqrt(n) polylog(n)), where n is the number of interactions and d is the dimension.

Subject: COLT.2021 - Award

lattimore21a@v134@PMLR

#1 Improved Regret for Zeroth-Order Stochastic Convex Bandits [PDF3] [Copy] [Kimi2] [REL]

#1 Improved Regret for Zeroth-Order Stochastic Convex Bandits [PDF³] [Copy] [Kimi²] [REL]