research-article

Explore, exploit, and explain: personalizing explainable recommendations with bandits

Authors:

James McInerney,

Benjamin Lacker,

Samantha Hansen,

Hugues Bouchard,

Rishabh MehrotraAuthors Info & Claims

RecSys '18: Proceedings of the 12th ACM Conference on Recommender Systems

Pages 31 - 39

https://doi.org/10.1145/3240323.3240354

Published: 27 September 2018 Publication History

Abstract

The multi-armed bandit is an important framework for balancing exploration with exploitation in recommendation. Exploitation recommends content (e.g., products, movies, music playlists) with the highest predicted user engagement and has traditionally been the focus of recommender systems. Exploration recommends content with uncertain predicted user engagement for the purpose of gathering more information. The importance of exploration has been recognized in recent years, particularly in settings with new users, new items, non-stationary preferences and attributes. In parallel, explaining recommendations ("recsplanations") is crucial if users are to understand their recommendations. Existing work has looked at bandits and explanations independently. We provide the first method that combines both in a principled manner. In particular, our method is able to jointly (1) learn which explanations each user responds to; (2) learn the best content to recommend for each user; and (3) balance exploration with exploitation to deal with uncertainty. Experiments with historical log data and tests with live production traffic in a large-scale music recommendation service show a significant improvement in user engagement.

Supplementary Material

MP4 File (p31-mcinerney.mp4)

Download
1831.30 MB

References

[1]

Alekh Agarwal, Daniel Hsu, Satyen Kale, John Langford, Lihong Li, and Robert Schapire. 2014. Taming the monster: A fast and simple algorithm for contextual bandits. In International Conference on Machine Learning. 1638--1646.

Digital Library

[2]

Svetlin Bostandjiev, John O'Donovan, and Tobias Höllerer. 2012. TasteWeights: a visual interactive hybrid recommender system. In Proceedings of the sixth ACM conference on Recommender systems. ACM, 35--42.

Digital Library

[3]

Allison J. B. Chaney, Brandon Stewart, and Barbara Engelhardt. 2017. How algorithmic confounding in recommendation systems increases homogeneity and decreases utility. arXiv preprint arXiv:1710.11214 (2017).

[4]

Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for YouTube recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, 191--198.

Digital Library

[5]

Miroslav Dudik, John Langford, and Lihong Li. 2011. Doubly robust policy evaluation and learning. arXiv preprint arXiv:1103.4601 (2011).

Digital Library

[6]

Gerhard Friedrich and Markus Zanker. 2011. A taxonomy for generating explanations in recommender systems. AI Magazine 32, 3 (2011), 90--98.

Digital Library

[7]

Alexandre Gilotte, Clément Calauzènes, Thomas Nedelec, Alexandre Abraham, and Simon Dollé. 2018. Offline A/B testing for recommender systems. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining. ACM, 198--206.

Digital Library

[8]

Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative filtering for implicit feedback datasets. In Data Mining, 2008. ICDM'08. Eighth IEEE International Conference on. Ieee, 263--272.

Digital Library

[9]

Thorsten Joachims, Dayne Freitag, and Tom Mitchell. 1997. Webwatcher: A tour guide for the world wide web. In International Joint Conference on Artificial Intelligence (IJCAI). 770--777.

[10]

Thorsten Joachims and Adith Swaminathan. 2016. Tutorial on Counterfactual Evaluation and Learning for Search, Recommendation and Ad Placement. In ACM Conference on Research and Development in Information Retrieval (SIGIR). 1199--1201.

Digital Library

[11]

Antti Kangasrääsiö, Dorota Glowacka, and Samuel Kaski. 2015. Improving controllability and predictability of interactive recommendation interfaces for exploratory search. In Proceedings of the 20th international conference on intelligent user interfaces. ACM, 247--251.

Digital Library

[12]

Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009).

Digital Library

[13]

Pigi Kouki, James Schaffer, Jay Pujara, John O'Donovan, and Lise Getoor. 2017. User Preferences for Hybrid Explanations. In Proceedings of the Eleventh ACM Conference on Recommender Systems. ACM, 84--88.

Digital Library

[14]

Branislav Kveton, Csaba Szepesvari, Zheng Wen, and Azin Ashkan. 2015. Cascading bandits: Learning to rank in the cascade model. In International Conference on Machine Learning (ICML). 767--776.

Digital Library

[15]

Branislav Kveton, Zheng Wen, Azin Ashkan, Hoda Eydgahi, and Brian Eriksson. 2014. Matroid Bandits: Fast Combinatorial Optimization with Learning. In Conference on Uncertainty in Artificial Intelligence (UAI).

Digital Library

[16]

Paul Lamere. 2017. https://twitter.com/plamere/status/822021478170423296. Twitter. (2017).

[17]

Lihong Li, Wei Chu, John Langford, and Robert E Schapire. 2010. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th International Conference on World Wide Web (WWW). ACM, 661--670.

Digital Library

[18]

Filip Radlinski, Robert Kleinberg, and Thorsten Joachims. 2008. Learning diverse rankings with multi-armed bandits. In International Conference on Machine Learning (ICML).

Digital Library

[19]

Steffen Rendle. 2010. Factorization machines. In IEEE 10th International Conference on Data Mining (ICDM). IEEE, 995--1000.

Digital Library

[20]

Guy Shani, David Heckerman, and Ronen I Brafman. 2005. An MDP-based recommender system. Journal of Machine Learning Research 6, Sep (2005), 1265--1295.

Digital Library

[21]

Anongnart Srivihok and Pisit Sukonmanee. 2005. E-commerce intelligent agent: personalization travel support agent using Q Learning. In Proceedings of the 7th international conference on Electronic commerce. ACM, 287--292.

Digital Library

[22]

Richard S Sutton and Andrew G Barto. 1998. Reinforcement learning: An introduction. Vol. 1. MIT press Cambridge.

Digital Library

[23]

Adith Swaminathan, Akshay Krishnamurthy, Alekh Agarwal, Miro Dudik, John Langford, Damien Jose, and Imed Zitouni. 2017. Off-policy evaluation for slate recommendation. In Advances in Neural Information Processing Systems. 3635--3645.

[24]

Nava Tintarev and Judith Masthoff. 2007. A survey of explanations in recommender systems. In IEEE 23rd International Conference on Data Engineering Workshop. IEEE, 801--810.

Digital Library

[25]

Xinxi Wang, Yi Wang, David Hsu, and Ye Wang. 2014. Exploration in interactive personalized music recommendation: a reinforcement learning approach. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 11, 1 (2014), 7.

Digital Library

Cited By

Borole P(2024)Analysis and Implications of Adopting AI and Machine Learning in Marketing, Servicing, and Communications TechnologyInternational Journal of Artificial Intelligence and Machine Learning10.4018/IJAIML.33837913:1(1-11)Online publication date: 20-Feb-2024
https://doi.org/10.4018/IJAIML.338379
Topaloglu BOztaysi BDogan O(2024)Cart-State-Aware Discovery of E-Commerce Visitor Journeys with Process MiningJournal of Theoretical and Applied Electronic Commerce Research10.3390/jtaer1904013819:4(2851-2879)Online publication date: 17-Oct-2024
https://doi.org/10.3390/jtaer19040138
Iana AAlam MPaulheim H(2024)A survey on knowledge-aware news recommender systemsSemantic Web10.3233/SW-22299115:1(21-82)Online publication date: 12-Jan-2024
https://doi.org/10.3233/SW-222991
Show More Cited By

Index Terms

Explore, exploit, and explain: personalizing explainable recommendations with bandits
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
      1. Causal reasoning and diagnostics
  2. Machine learning
    1. Learning paradigms
      1. Reinforcement learning
2. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Recommender systems
  2. Information systems applications
    1. Data mining
      1. Collaborative filtering

Recommendations

The Exploit-Explore Dilemma in Music Recommendation
RecSys '16: Proceedings of the 10th ACM Conference on Recommender Systems

Were The Rolling Stones right when they said, "You can't always get what you want; but if you try sometime you get what you need"? Recommendation systems are the crystal ball of the Internet: predicting user intentions, making sense of big data, and ...
Let Me Explain: Impact of Personal and Impersonal Explanations on Trust in Recommender Systems
CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

Trust in a Recommender System (RS) is crucial for its overall success. However, it remains underexplored whether users trust personal recommendation sources (i.e. other humans) more than impersonal sources (i.e. conventional RS), and, if they do, ...
Explore-exploit in top-N recommender systems via Gaussian processes
RecSys '14: Proceedings of the 8th ACM Conference on Recommender systems

We address the challenge of ranking recommendation lists based on click feedback by efficiently encoding similarities among users and among items. The key challenges are threefold: (1) combinatorial number of lists; (2) sparse feedback and (3) context ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

RecSys '18: Proceedings of the 12th ACM Conference on Recommender Systems

September 2018

600 pages

ISBN:9781450359016

DOI:10.1145/3240323

General Chairs:
Sole Pera
Boise State University
,
Michael Ekstrand
Boise State University
,
Program Chairs:
Xavier Amatriain
Curai, USA
,
John O'Donovan
University of California, Santa Barbara

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGCHI: ACM Special Interest Group on Computer-Human Interaction

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 September 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article

Conference

RecSys '18

Sponsor:

SIGCHI

RecSys '18: Twelfth ACM Conference on Recommender Systems

October 2, 2018

British Columbia, Vancouver, Canada

Acceptance Rates

RecSys '18 Paper Acceptance Rate 32 of 181 submissions, 18%;

Overall Acceptance Rate 254 of 1,295 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

100
Total Citations
View Citations
3,824
Total Downloads

Downloads (Last 12 months)316
Downloads (Last 6 weeks)35

Reflects downloads up to 16 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Borole P(2024)Analysis and Implications of Adopting AI and Machine Learning in Marketing, Servicing, and Communications TechnologyInternational Journal of Artificial Intelligence and Machine Learning10.4018/IJAIML.33837913:1(1-11)Online publication date: 20-Feb-2024
https://doi.org/10.4018/IJAIML.338379
Topaloglu BOztaysi BDogan O(2024)Cart-State-Aware Discovery of E-Commerce Visitor Journeys with Process MiningJournal of Theoretical and Applied Electronic Commerce Research10.3390/jtaer1904013819:4(2851-2879)Online publication date: 17-Oct-2024
https://doi.org/10.3390/jtaer19040138
Iana AAlam MPaulheim H(2024)A survey on knowledge-aware news recommender systemsSemantic Web10.3233/SW-22299115:1(21-82)Online publication date: 12-Jan-2024
https://doi.org/10.3233/SW-222991
Hu HKantardzic MKar S(2024)Explainable data stream mining: Why the new models are betterIntelligent Decision Technologies10.3233/IDT-23006518:1(371-385)Online publication date: 20-Feb-2024
https://doi.org/10.3233/IDT-230065
Liu YLi HZheng ZWu FChen GPapavassiliou SSchmid S(2024)On the Analysis of Two-Stage Stochastic BanditProceedings of the Twenty-fifth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing10.1145/3641512.3686360(51-60)Online publication date: 14-Oct-2024
https://dl.acm.org/doi/10.1145/3641512.3686360
Jeunen OMandav JPotapov IAgarwal NVaid SShi WUstimenko A(2024)Multi-Objective Recommendation via Multivariate Policy LearningProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688132(712-721)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3640457.3688132
Gupta SJeunen OOosterhuis Hde Rijke M(2024)Optimal Baseline Corrections for Off-Policy Contextual BanditsProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688105(722-732)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3640457.3688105
Lichtenberg JDi Benedetto GRuffini M(2024)Ranking Across Different Content Types: The Robust Beauty of Multinomial BlendingProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688059(823-825)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3640457.3688059
Tandoi MSolis Morales D(2024)Explore versus repeat: insights from an online supermarketProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688050(787-789)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3640457.3688050
Su YLu HLi YLiu LBi SChi EChen MBaeza-Yates RBonchi F(2024)Multi-Task Neural Linear Bandit for Exploration in Recommender SystemsProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671649(5723-5730)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671649
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents