short-paper

A study on the Interpretability of Neural Retrieval Models using DeepSHAP

Authors:

Zeon Trevor Fernando,

Jaspreet Singh,

Avishek AnandAuthors Info & Claims

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 1005 - 1008

https://doi.org/10.1145/3331184.3331312

Published: 18 July 2019 Publication History

Abstract

A recent trend in IR has been the usage of neural networks to learn retrieval models for text based adhoc search. While various approaches and architectures have yielded significantly better performance than traditional retrieval models such as BM25, it is still difficult to understand exactly why a document is relevant to a query. In the ML community several approaches for explaining decisions made by deep neural networks have been proposed -- including DeepSHAP which modifies the DeepLift algorithm to estimate the relative importance (shapley values) of input features for a given decision by comparing the activations in the network for a given image against the activations caused by a reference input. In image classification, the reference input tends to be a plain black image. While DeepSHAP has been well studied for image classification tasks, it remains to be seen how we can adapt it to explain the output of Neural Retrieval Models (NRMs). In particular, what is a good "black" image in the context of IR? In this paper we explored various reference input document construction techniques. Additionally, we compared the explanations generated by DeepSHAP to LIME (a model agnostic approach) and found that the explanations differ considerably. Our study raises concerns regarding the robustness and accuracy of explanations produced for NRMs. With this paper we aim to shed light on interesting problems surrounding interpretability in NRMs and highlight areas of future work.

References

[1]

Leila Arras, Franziska Horn, Grégoire Montavon, Klaus-Robert Müller, and Wojciech Samek. 2017. "What is relevant in a text document?": An interpretable machine learning approach. PLOS ONE, Vol. 12 (2017), 1--23.

[2]

Sebastian Bach, Alexander Binder, Grégoire Montavon, Frederick Klauschen, Klaus-Robert Müller, and Wojciech Samek. 2015. On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation. PLOS ONE, Vol. 10 (2015), 1--46.

[3]

Yixing Fan, Liang Pang, Jianpeng Hou, Jiafeng Guo, Yanyan Lan, and Xueqi Cheng. 2017. MatchZoo: A Toolkit for Deep Text Matching.(2017). arxiv: 1707.07270

[4]

Amirata Ghorbani, Abubakar Abid, and James Y. Zou. 2019. Interpretation of Neural Networks is Fragile. In AAAI '19 .

[5]

Jiafeng Guo, Yixing Fan, Qingyao Ai, and W. Bruce Croft. 2016. A Deep Relevance Matching Model for Ad-hoc Retrieval. In CIKM '16 . ACM, 55--64.

Digital Library

[6]

Benjamin Letham, Cynthia Rudin, Tyler H. McCormick, and David Madigan. 2015. Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model. The Annals of Applied Statistics, Vol. 9, 3 (2015), 1350--1371.

[7]

Scott M Lundberg and Su-In Lee. 2017. A Unified Approach to Interpreting Model Predictions. In Advances in Neural Information Processing Systems 30. 4765--4774.

Digital Library

[8]

Ryan McDonald, George Brokos, and Ion Androutsopoulos. 2018. Deep Relevance Ranking Using Enhanced Document-Query Interactions. In EMNLP '18 .

[9]

Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, and Xueqi Cheng. 2017. A Deep Investigation of Deep IR Models. arXiv preprint (2017). arxiv: 1707.07700

[10]

Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, Shengxian Wan, and Xueqi Cheng. 2016. Text Matching As Image Recognition. In AAAI'16. 2793--2799.

Digital Library

[11]

Daan Rennings, Felipe Moraes, and Claudia Hauff. 2019. An Axiomatic Approach to Diagnosing Neural IR Models. In ECIR '19. 489--503.

Digital Library

[12]

Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. "Why Should I Trust You?": Explaining the Predictions of Any Classifier. In KDD '16 . ACM, 1135--1144.

Digital Library

[13]

Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2018. Anchors: High-Precision Model-Agnostic Explanations. In AAAI '18 .

[14]

Lloyd S Shapley. 1953. A value for n-person games. Contributions to the Theory of Games, Vol. 2, 28 (1953), 307--317.

[15]

Avanti Shrikumar, Peyton Greenside, and Anshul Kundaje. 2017. Learning Important Features Through Propagating Activation Differences. arXiv preprint (2017). arxiv: 1704.02685

Digital Library

[16]

Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2014. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps. ICLR Workshop (2014).

[17]

Jaspreet Singh and Avishek Anand. 2018. Interpreting search result rankings through intent modeling. arXiv preprint (2018). arxiv: 1809.05190

[18]

Jaspreet Singh and Avishek Anand. 2019. EXS: Explainable Search Using Local Model Agnostic Interpretability. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining (WSDM '19). ACM, 770--773.

Digital Library

[19]

Berk Ustun and Cynthia Rudin. 2016. Supersparse Linear Integer Models for Optimized Medical Scoring Systems. Machine Learning, Vol. 102, 3 (2016), 349--391.

Digital Library

[20]

Kelvin Xu, Jimmy Lei Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard S. Zemel, and Yoshua Bengio. 2015. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. In International Conference on Machine Learning - Volume 37 (ICML'15). 2048--2057.

Digital Library

Cited By

Zheng LCao SDing TTian JSun J(2024)Research on Active Safety Situation of Road Passenger Transportation Enterprises: Evaluation, Prediction, and AnalysisEntropy10.3390/e2606043426:6(434)Online publication date: 21-May-2024
https://doi.org/10.3390/e26060434
Pei YWen YPan S(2024)Road Traffic Accident Risk Prediction and Key Factor Identification Framework Based on Explainable Deep LearningIEEE Access10.1109/ACCESS.2024.345152212(120597-120611)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3451522
Saeed SRajput QHaider S(2024)SUMEXInformation Processing and Management: an International Journal10.1016/j.ipm.2024.10377161:5Online publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1016/j.ipm.2024.103771
Show More Cited By

Index Terms

A study on the Interpretability of Neural Retrieval Models using DeepSHAP
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

EXS: Explainable Search Using Local Model Agnostic Interpretability
WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining

Retrieval models in information retrieval are used to rank documents for typically under-specified queries. Today machine learning is used to learn retrieval models from click logs and/or relevance judgments that maximizes an objective correlated with ...
Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models
SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

Neural models have demonstrated remarkable performance across diverse ranking tasks. However, the processes and internal mechanisms along which they determine relevance are still largely unknown. Existing approaches for analyzing neural ranker behavior ...
Interpretability-Guided Content-Based Medical Image Retrieval
Medical Image Computing and Computer Assisted Intervention – MICCAI 2020
Abstract
When encountering a dubious diagnostic case, radiologists typically search in public or internal databases for similar cases that would help them in their decision-making process. This search represents a massive burden to their workflow, as it ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2019

1512 pages

ISBN:9781450361729

DOI:10.1145/3331184

General Chairs:
Benjamin Piwowarski
CNRS - Sorbonne Universite, France
,
Max Chevalier
Universite de Toulouse, CNRS, France
,
Eric Gaussier
Universite Grenoble Alpes, CNRS, France
,
Program Chairs:
Yoelle Maarek
Amazon Research, Israel
,
Jian-Yun Nie
University of Montreal, Canada
,
Falk Scholer
RMIT University, Australia

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Amazon Research Awards

Conference

SIGIR '19

Sponsor:

SIGIR

SIGIR '19: The 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 21 - 25, 2019

Paris, France

Acceptance Rates

SIGIR'19 Paper Acceptance Rate 84 of 426 submissions, 20%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

47
Total Citations
View Citations
731
Total Downloads

Downloads (Last 12 months)157
Downloads (Last 6 weeks)12

Reflects downloads up to 19 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zheng LCao SDing TTian JSun J(2024)Research on Active Safety Situation of Road Passenger Transportation Enterprises: Evaluation, Prediction, and AnalysisEntropy10.3390/e2606043426:6(434)Online publication date: 21-May-2024
https://doi.org/10.3390/e26060434
Pei YWen YPan S(2024)Road Traffic Accident Risk Prediction and Key Factor Identification Framework Based on Explainable Deep LearningIEEE Access10.1109/ACCESS.2024.345152212(120597-120611)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3451522
Saeed SRajput QHaider S(2024)SUMEXInformation Processing and Management: an International Journal10.1016/j.ipm.2024.10377161:5Online publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1016/j.ipm.2024.103771
Jung MHoon Roh TKim HJin Ha EYoon DMin Park CKim SYou NKim D(2024)Hyperosmolar therapy response in traumatic brain injury: Explainable artificial intelligence based long-term time series forecasting approachExpert Systems with Applications10.1016/j.eswa.2024.124795255(124795)Online publication date: Dec-2024
https://doi.org/10.1016/j.eswa.2024.124795
Chen TWang YLin C(2024)Enhancing the effectiveness of output projection in wafer fabrication using an Industry 4.0 and XAI approachThe International Journal of Advanced Manufacturing Technology10.1007/s00170-024-14105-6134:1-2(113-125)Online publication date: 16-Jul-2024
https://doi.org/10.1007/s00170-024-14105-6
Schedl MAnelli VLex ESchedl MAnelli VLex E(2024)Conclusions and Open ChallengesTechnical and Regulatory Perspectives on Information Retrieval and Recommender Systems10.1007/978-3-031-69978-8_6(143-146)Online publication date: 24-Oct-2024
https://doi.org/10.1007/978-3-031-69978-8_6
Schedl MAnelli VLex ESchedl MAnelli VLex E(2024)Privacy and SecurityTechnical and Regulatory Perspectives on Information Retrieval and Recommender Systems10.1007/978-3-031-69978-8_5(103-141)Online publication date: 24-Oct-2024
https://doi.org/10.1007/978-3-031-69978-8_5
Schedl MAnelli VLex ESchedl MAnelli VLex E(2024)TransparencyTechnical and Regulatory Perspectives on Information Retrieval and Recommender Systems10.1007/978-3-031-69978-8_4(69-102)Online publication date: 24-Oct-2024
https://doi.org/10.1007/978-3-031-69978-8_4
Schedl MAnelli VLex ESchedl MAnelli VLex E(2024)Biases, Fairness, and Non-discriminationTechnical and Regulatory Perspectives on Information Retrieval and Recommender Systems10.1007/978-3-031-69978-8_3(29-67)Online publication date: 24-Oct-2024
https://doi.org/10.1007/978-3-031-69978-8_3
Schedl MAnelli VLex ESchedl MAnelli VLex E(2024)Regulatory InitiativesTechnical and Regulatory Perspectives on Information Retrieval and Recommender Systems10.1007/978-3-031-69978-8_2(11-27)Online publication date: 24-Oct-2024
https://doi.org/10.1007/978-3-031-69978-8_2
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents