Conceptualisation and Implementation of Human-centric Privacy Preserving Framework for Explainable AI

#1 Conceptualisation and Implementation of Human-centric Privacy Preserving Framework for Explainable AI [PDF¹] [Copy] [Kimi¹] [REL]

Explainability has emerged as a pillar of Trustworthy AI for ensuring safety in high-risk application domains. However, the incorporation of explainability to boost the transparency of black-box AI systems can inadvertently introduce unforeseen vulnerabilities. Previous research has drawn attention to privacy leakage, malicious or otherwise, from explainable interfaces leading to identification of individuals and exposure of sensitive personal information. Privacy preservation methods used in response to this leakage are found to adversely affect the utility of the system, including the degradation of model accuracy and explanation quality. The proposed thesis will examine the advancement of Privacy Enhancing Technologies (PETs) in Explainable AI (XAI) while ensuring that users remain at the core of the design process. The main objectives of this research are: (1) determining defenses for privacy attacks in XAI (2) building interpretable algorithms for private models and (3) examining user requirements for privacy preserving XAI. This research is expected to yield characteristics of privacy preserving XAI, guidelines and recommendations for effectively building privacy compliant XAI while considering the diverse needs of end users. The research outcomes will enable developers and researchers in designing XAI that is safe for deployment and considers the balance between privacy, explainability and utility.

Subject: AAAI.2026 - Doctoral Consortium Track

42140@AAAI

#1 Conceptualisation and Implementation of Human-centric Privacy Preserving Framework for Explainable AI [PDF1] [Copy] [Kimi1] [REL]

#1 Conceptualisation and Implementation of Human-centric Privacy Preserving Framework for Explainable AI [PDF¹] [Copy] [Kimi¹] [REL]