Computer Science > Machine Learning

arXiv:2405.17640 (cs)

[Submitted on 27 May 2024 (v1), last revised 7 Aug 2024 (this version, v2)]

Title:Probabilistically Plausible Counterfactual Explanations with Normalizing Flows

Authors:Patryk Wielopolski, Oleksii Furman, Jerzy Stefanowski, Maciej Zięba

Abstract:We present PPCEF, a novel method for generating probabilistically plausible counterfactual explanations (CFs). PPCEF advances beyond existing methods by combining a probabilistic formulation that leverages the data distribution with the optimization of plausibility within a unified framework. Compared to reference approaches, our method enforces plausibility by directly optimizing the explicit density function without assuming a particular family of parametrized distributions. This ensures CFs are not only valid (i.e., achieve class change) but also align with the underlying data's probability density. For that purpose, our approach leverages normalizing flows as powerful density estimators to capture the complex high-dimensional data distribution. Furthermore, we introduce a novel loss that balances the trade-off between achieving class change and maintaining closeness to the original instance while also incorporating a probabilistic plausibility term. PPCEF's unconstrained formulation allows for efficient gradient-based optimization with batch processing, leading to orders of magnitude faster computation compared to prior methods. Moreover, the unconstrained formulation of PPCEF allows for the seamless integration of future constraints tailored to specific counterfactual properties. Finally, extensive evaluations demonstrate PPCEF's superiority in generating high-quality, probabilistically plausible counterfactual explanations in high-dimensional tabular settings. This makes PPCEF a powerful tool for not only interpreting complex machine learning models but also for improving fairness, accountability, and trust in AI systems.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
Cite as:	arXiv:2405.17640 [cs.LG]
	(or arXiv:2405.17640v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.17640

Submission history

From: Patryk Wielopolski [view email]
[v1] Mon, 27 May 2024 20:24:03 UTC (132 KB)
[v2] Wed, 7 Aug 2024 07:29:39 UTC (138 KB)

Computer Science > Machine Learning

Title:Probabilistically Plausible Counterfactual Explanations with Normalizing Flows

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Probabilistically Plausible Counterfactual Explanations with Normalizing Flows

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators