Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics

2405.15430

Total: 1

#1 Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics [PDF] [Copy] [Kimi⁴] [REL]

Naively trained Deep Reinforcement Learning agents may fail to satisfy vital safety constraints. To avoid costly retraining, we may desire to repair a previously trained reinforcement learning agent to obviate unsafe behaviour. We devise a counterexample-guided repair algorithm for repairing reinforcement learning systems leveraging safety critics. The algorithm jointly repairs a reinforcement learning agent and a safety critic using gradient-based constrained optimisation.

Subjects: Machine Learning , Logic in Computer Science

Publish: 2024-05-24 10:56:51 UTC

2405.15430

#1 Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics [PDF] [Copy] [Kimi4] [REL]

#1 Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics [PDF] [Copy] [Kimi⁴] [REL]