Multi-Environment POMDPs with Finite-Horizon Objectives

2605.07537

Total: 1

#1 Multi-Environment POMDPs with Finite-Horizon Objectives [PDF¹] [Copy] [Kimi¹] [REL]

Authors: Léonard Brice, Filip Cano, Krishnendu Chatterjee, Thomas A. Henzinger, Stefanie Muroya

Partially Observable Markov Decision Processes (POMDPs) are systems in which one agent interacts with a stochastic environment, and receives only partial information about the current state. In a multi-environment POMDP (MEPOMDP), the initial state is unknown, and assumed to be adversarially chosen. In this work we focus on computing the optimal value and policy in MEPOMDPs with finite-horizon objectives. That problem is known to be PSPACE-complete in POMDPs. Our main results are as follows: (1) we establish that it is also PSPACE-complete in the more general setting of MEPOMDPs; (2) we present a practical algorithm and evaluate it on classical benchmarks, significantly outperforming the only previously known algorithm.

Subject: Artificial Intelligence

Publish: 2026-05-08 10:14:03 UTC

2605.07537

#1 Multi-Environment POMDPs with Finite-Horizon Objectives [PDF1] [Copy] [Kimi1] [REL]

#1 Multi-Environment POMDPs with Finite-Horizon Objectives [PDF¹] [Copy] [Kimi¹] [REL]