Maturing Markov Decision Processes: Decision Making under Increasing Information and Shrinking Action Sets

#1 Maturing Markov Decision Processes: Decision Making under Increasing Information and Shrinking Action Sets [PDF] [Copy] [Kimi] [REL]

Authors: Jiaxi Liu, Aiping Yang, Yuhang Yang, Shuqi Zhang, Zewei Dong, Jiangming Yang, Xuebin Chen

Sequential decision problems often exhibit an asymmetric evolution of information and decision flexibility: as a decision cycle unfolds, the agent receives richer information while feasible actions expire due to operational cutoffs, commitments, or resource constraints. Standard MDP formulations typically flatten this structure into stage-dependent state descriptions and action masks, thereby obscuring the nested information--action asymmetry that determines which decisions are urgent and which can be deferred. We introduce Maturing Markov Decision Processes (MMDPs), a formulation built around this information--action asymmetry. We characterize one of its key consequences through an expiring-action priority principle, which identifies the actions that must be resolved before the next stage. Motivated by this structure, we develop a structure-aware reinforcement learning framework with stage-aware policy design, expiring-action abstraction, and search-augmented learning with distillation. Experiments on a controlled multi-supplier replenishment problem, simplified cash-management environments of increasing complexity, and a production-scale simulator show that explicitly modeling this asymmetry improves learning efficiency and becomes increasingly valuable as decision problems scale.

Subjects: Machine Learning , Artificial Intelligence

Publish: 2026-06-17 08:41:55 UTC

2606.18820

#1 Maturing Markov Decision Processes: Decision Making under Increasing Information and Shrinking Action Sets [PDF] [Copy] [Kimi] [REL]