Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation

#1 Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation [PDF¹] [Copy] [Kimi] [REL]

Authors: Paula Rodriguez Diaz, Jackson A. Killian, Lily Xu, Arun Sai Suggala, Aparna Taneja, Milind Tambe

Restless multi-armed bandits (RMABs) are an important model to optimize allocation of limited resources in sequential decision-making settings. Typical RMABs assume the budget --- the number of arms pulled --- to be fixed for each step in the planning horizon. However, for realistic real-world planning, resources are not necessarily limited at each planning step; we may be able to distribute surplus resources in one round to an earlier or later round. In real-world planning settings, this flexibility in budget is often constrained to within a subset of consecutive planning steps, e.g., weekly planning of a monthly budget. In this paper we define a general class of RMABs with flexible budget, which we term F-RMABs, and provide an algorithm to optimally solve for them. We derive a min-max formulation to find optimal policies for F-RMABs and leverage gradient primal-dual algorithms to solve for reward-maximizing policies with flexible budgets. We introduce a scheme to sample expected gradients to apply primal-dual algorithms to the F-RMAB setting and make an otherwise computationally expensive approach tractable. Additionally, we provide heuristics that trade off solution quality for efficiency and present experimental comparisons of different F-RMAB solution approaches.

Subject: AAAI.2023 - Planning, Routing, and Scheduling

26427@AAAI

#1 Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation [PDF1] [Copy] [Kimi] [REL]

#1 Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation [PDF¹] [Copy] [Kimi] [REL]