Processing math: 100%

2507.04255

Total: 1

#1 Bandit Pareto Set Identification in a Multi-Output Linear Model [PDF] [Copy] [Kimi] [REL]

Authors: Cyrille Kone, Emilie Kaufmann, Laura Richert

We study the Pareto Set Identification (PSI) problem in a structured multi-output linear bandit model. In this setting, each arm is associated a feature vector belonging to Rh, and its mean vector in Rd linearly depends on this feature vector through a common unknown matrix ΘRh×d. The goal is to identify the set of non-dominated arms by adaptively collecting samples from the arms. We introduce and analyze the first optimal design-based algorithms for PSI, providing nearly optimal guarantees in both the fixed-budget and the fixed-confidence settings. Notably, we show that the difficulty of these tasks mainly depends on the sub-optimality gaps of h arms only. Our theoretical results are supported by an extensive benchmark on synthetic and real-world datasets.

Subjects: Machine Learning , Machine Learning

Publish: 2025-07-06 06:14:43 UTC