Hyperproperty-Constrained Secure Reinforcement Learning

#1 Hyperproperty-Constrained Secure Reinforcement Learning [PDF] [Copy] [Kimi] [REL]

Authors: Ernest Bonnah, Luan Viet Nguyen, Khaza Anuarul Hoque

Hyperproperties for Time Window Temporal Logic (HyperTWTL) is a domain-specific formal specification language known for its effectiveness in compactly representing security, opacity, and concurrency properties for robotics applications. This paper focuses on HyperTWTL-constrained secure reinforcement learning (SecRL). Although temporal logic-constrained safe reinforcement learning (SRL) is an evolving research problem with several existing literature, there is a significant research gap in exploring security-aware reinforcement learning (RL) using hyperproperties. Given the dynamics of an agent as a Markov Decision Process (MDP) and opacity/security constraints formalized as HyperTWTL, we propose an approach for learning security-aware optimal policies using dynamic Boltzmann softmax RL while satisfying the HyperTWTL constraints. The effectiveness and scalability of our proposed approach are demonstrated using a pick-up and delivery robotic mission case study. We also compare our results with two other baseline RL algorithms, showing that our proposed method outperforms them.

Subjects: Artificial Intelligence , Machine Learning , Logic in Computer Science , Systems and Control

Publish: 2025-07-31 18:57:18 UTC

2508.00106

#1 Hyperproperty-Constrained Secure Reinforcement Learning [PDF] [Copy] [Kimi] [REL]