ASP-Driven Emergency Planning for Norm Violations in Reinforcement Learning

#1 ASP-Driven Emergency Planning for Norm Violations in Reinforcement Learning [PDF¹⁶] [Copy] [Kimi¹⁸] [REL]

Authors: Sebastian Adam, Thomas Eiter

Reinforcement learning is a widely used approach for training an agent to maximize rewards in a given environment. Action policies learned with this technique see a broad range of applications in practical areas like games, healthcare, robotics, or autonomous driving. However, enforcing ethical behavior or norms based on deontic constraints that the agent should adhere to during policy execution remains a complex challenge. Especially constraints that emerge after the training can necessitate to redo policy learning, which can be costly and, more critically, time-intense. In order to mitigate this problem, we present a framework for policy fixing in case of a norm violation, which allows the agent to stay operational. Based on answer set programming (ASP), emergency plans are generated that exclude or minimize cost of norm violations by future actions in a horizon of interest. By combining and developing optimization techniques, efficient policy fixing under real-time constraints can be achieved.

Subject: AAAI.2025 - Knowledge Representation and Reasoning

33619@AAAI

#1 ASP-Driven Emergency Planning for Norm Violations in Reinforcement Learning [PDF16] [Copy] [Kimi18] [REL]

#1 ASP-Driven Emergency Planning for Norm Violations in Reinforcement Learning [PDF¹⁶] [Copy] [Kimi¹⁸] [REL]