Computer Science > Machine Learning

arXiv:2305.18362 (cs)

[Submitted on 27 May 2023 (v1), last revised 31 May 2023 (this version, v2)]

Title:Statistically Significant Concept-based Explanation of Image Classifiers via Model Knockoffs

Authors:Kaiwen Xu, Kazuto Fukuchi, Youhei Akimoto, Jun Sakuma

View PDF

Abstract:A concept-based classifier can explain the decision process of a deep learning model by human-understandable concepts in image classification problems. However, sometimes concept-based explanations may cause false positives, which misregards unrelated concepts as important for the prediction task. Our goal is to find the statistically significant concept for classification to prevent misinterpretation. In this study, we propose a method using a deep learning model to learn the image concept and then using the Knockoff samples to select the important concepts for prediction by controlling the False Discovery Rate (FDR) under a certain value. We evaluate the proposed method in our synthetic and real data experiments. Also, it shows that our method can control the FDR properly while selecting highly interpretable concepts to improve the trustworthiness of the model.

Comments:	Accepted to IJCAI'23
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Report number:	p519-526
Cite as:	arXiv:2305.18362 [cs.LG]
	(or arXiv:2305.18362v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.18362
Journal reference:	Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI 2023
Related DOI:	https://doi.org/10.24963/IJCAI.2023/58

Submission history

From: Kaiwen Xu [view email]
[v1] Sat, 27 May 2023 05:40:05 UTC (321 KB)
[v2] Wed, 31 May 2023 03:20:18 UTC (320 KB)

Computer Science > Machine Learning

Title:Statistically Significant Concept-based Explanation of Image Classifiers via Model Knockoffs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Statistically Significant Concept-based Explanation of Image Classifiers via Model Knockoffs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators