Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Ranking the causal impact of recommendations under collider bias in k-spots recommender systems

Published: 14 May 2024 Publication History

Abstract

The first objective of recommender systems is to provide personalized recommendations for each user. However, personalization may not be its only use. Past recommendations can be further analyzed to gain global insights into users’ behavior with respect to recommended items. Such insights can help to answer design-related questions such as which items’ recommendations are the most impactful in terms of users’ utility, which type of recommendations are the most followed ones, which items could be dropped from the catalog, or which recommendations are under-performing compared to what one would expect. In order to answer those questions, we need to rank item recommendations’ performances in terms of their causal impact on some user-related outcome measures. Unfortunately, in previous work leveraging causal inference for recommendation systems, the attention is fully focused on correcting confounding bias and not on the collider bias. This bias is particularly relevant in the recommender context, where multiple items are simultaneously recommended. Indeed, when there is a fixed number of available spots (i.e., k-spots) and recommendations need to be provided at each session, we argue that it is not possible to estimate the causal impacts of recommendations but only the differences between them. Therefore, in this article, we provide an unbiased estimator of the differences in the impacts of items’ recommendations, that work for any outcome of interest, and any type of recommender system as long as it has some degree of randomization. We apply our results both in a simulated environment and in a real-world offline environment leveraging logged data for recommended items in a digital healthcare app.

References

[1]
Aman Agarwal, Ivan Zaitsev, Xuanhui Wang, Cheng Li, Marc Najork, and Thorsten Joachims. 2019. Estimating position bias without intrusive interventions. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining.Association for Computing Machinery, New York, NY, USA, 474–482. DOI:DOI:
[2]
Elias Bareinboim, Jin Tian, and Judea Pearl. 2014. Recovering from selection bias in causal and statistical inference. Proceedings of the AAAI Conference on Artificial Intelligence 28, 1(2014), 433–450. DOI:DOI:
[3]
Joseph Berkson. 1946. Limitations of the application of fourfold table analysis to hospital data. Biometrics 2 3 (1946), 47–53.
[4]
Lucas Bernardi, Themistoklis Mavridis, and Pablo Estevez. 2019. 150 successful machine learning models: 6 lessons learned at booking. com. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1743–1751.
[5]
William Black, Ercument Ilhan, Andrea Marchini, and Vilda Markeviciute. 2023. AdaptEx: A self-service contextual bandit platform. In Proceedings of the 17th ACM Conference on Recommender Systems. 426–429.
[6]
Stephen Bonner and Flavian Vasile. 2018. Causal embeddings for recommendation. In Proceedings of the 12th ACM Conference on Recommender Systems.Association for Computing Machinery, New York, NY, USA, 104–112. DOI:DOI:
[7]
Léon Bottou, Jonas Peters, Joaquin Quiñonero-Candela, Denis X. Charles, D. Max Chickering, Elon Portugaly, Dipankar Ray, Patrice Simard, and Ed Snelson. 2013. Counterfactual reasoning and learning systems: The example of computational advertising. Journal of Machine Learning Research 14, 11 (2013), 3207–3260.
[8]
Carlos Brito and Judea Pearl. 2012. Generalized instrumental variables. In Proceedings of the 18th Uncertainty in Artificial Intelligence Conference 18(2012), 85–93.
[9]
Emanuele Cavenaghi, Alessio Zanga, Fabio Stella, and Markus Zanker. 2023. Towards a causal decision-making framework for recommender systems. ACM Transactions on Recommender Systems (2023).
[10]
Jun Chen, Chaokun Wang, Jianmin Wang, and Philip S. Yu. 2016. Recommendation for repeat consumption from user implicit feedback. IEEE Transactions on Knowledge and Data Engineering 28, 11 (2016), 3083–3097. DOI:DOI:
[11]
Victor Chernozhukov, Denis Chetverikov, Mert Demirer, Esther Duflo, Christian Hansen, Whitney Newey, and James Robins. 2018. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal 21, 1(2018), C1–C68. DOI:DOI:https://academic.oup.com/ectj/article-pdf/21/1/C1/27684918/ectj00c1.pdf
[12]
Vivek Farias, Hao Li, Tianyi Peng, Xinyuyang Ren, Huawei Zhang, and Andrew Zheng. 2023. Correcting for interference in experiments: A case study at douyin. In Proceedings of the 17th ACM Conference on Recommender Systems.Association for Computing Machinery, New York, NY, USA, 455–466. DOI:DOI:
[13]
M. Frölich and S. Sperlich. 2019. Impact Evaluation: Treatment Effects and Causal Analysis. Cambridge University Press; 2019.
[14]
Chen Gao, Yu Zheng, Wenjie Wang, Fuli Feng, Xiangnan He, and Yong Li. 2022. Causal Inference in Recommender Systems: A Survey and Future Directions. arxiv:2208.12397. Retrieved from https://arxiv.org/abs/2208.12397
[15]
Carlos A. Gomez-Uribe and Neil Hunt. 2015. The netflix recommender system: Algorithms, business value, and innovation. ACM Transactions on Management Information Systems 6, 4 (2015), 1–19.
[16]
Pierre Gutierrez and Jean-Yves Gérardy. 2017. Causal inference and uplift modelling: A review of the literature. In Proceedings of the 3rd International Conference on Predictive Applications and APIs. Claire Hardgrove, Louis Dorard, Keiran Thompson, and Florian Douetteau (Eds.), PMLR, 1–13. Retrieved from https://proceedings.mlr.press/v67/gutierrez17a.html
[17]
Reinhard Heckel, Max Simchowitz, Kannan Ramchandran, and Martin Wainwright. 2018. Approximate ranking from pairwise comparisons. In Proceedings of the 21st International Conference on Artificial Intelligence and Statistics.Amos Storkey and Fernando Perez-Cruz (Eds.), PMLR, 1057–1066. Retrieved from https://proceedings.mlr.press/v84/heckel18a.html
[18]
Leonard Henckel, Martin Buttenschoen, and Marloes H Maathuis. 2023. Graphical tools for selecting conditional instrumental sets. Biometrika (2023), asad066.
[19]
Dietmar Jannach, Pearl Pu, Francesco Ricci, and Markus Zanker. 2021. Recommender systems: Past, present, future. AI Magazine 42, 3(2021), 3–6. DOI:DOI:
[20]
Olivier Jeunen, Ciarán Gilligan-Lee, Rishabh Mehrotra, and Mounia Lalmas. 2022. Disentangling causal effects from sets of interventions in the presence of unobserved confounders. Advances in Neural Information Processing Systems 35, (2022), 27850–27861.
[21]
Olivier Jeunen and Bart Goethals. 2021. Pessimistic reward models for off-policy learning in recommendation. In Proceedings of the 15th ACM Conference on Recommender Systems.Association for Computing Machinery, New York, NY, USA, 63–74. DOI:DOI:
[22]
Olivier Jeunen and Bart Goethals. 2023. Pessimistic decision-making for recommender systems. ACM Transactions on Recommender Systems 1, 1 (2023), 1–27.
[23]
Olivier Jeunen, Thorsten Joachims, Harrie Oosterhuis, Yuta Saito, and Flavian Vasile. 2022. CONSEQUENCES - causality, counterfactuals and sequential decision-making for recommender systems. In Proceedings of the 16th ACM Conference on Recommender Systems. . Association for Computing Machinery, New York, NY, USA, 654–657. DOI:DOI:
[24]
Thorsten Joachims, Adith Swaminathan, and Tobias Schnabel. 2017. Unbiased learning-to-rank with biased feedback. In Proceedings of the 10th ACM International Conference on Web Search and Data Mining. 781–789.
[25]
Ioannis Kangas, Maud Schwoerer, and Lucas J. Bernardi. 2021. Recommender systems for personalized user experience: lessons learned at Booking. com. In Proceedings of the 15th ACM Conference on Recommender Systems. 583–586.
[26]
Norman Knyazev and Harrie Oosterhuis. 2022. The bandwagon effect: Not just another bias. In Proceedings of the 2022 ACM SIGIR International Conference on Theory of Information Retrieval. . Association for Computing Machinery, New York, NY, USA, 243–253. DOI:DOI:
[27]
Sören R. Künzel, Jasjeet S. Sekhon, Peter J. Bickel, and Bin Yu. 2019. Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences 116, 10 (2019), 4156–4165. DOI:DOI:arXiv:https://www.pnas.org/content/116/10/4156.full.pdf
[28]
Dawen Liang, Laurent Charlin, and David M. Blei. 2016. Causal inference for recommendation. In Proceedings of the Causation: Foundation to Application, Workshop at UAI. AUAI.
[29]
Marloes H. Maathuis and Diego Colombo. 2015. A GENERALIZED BACK-DOOR CRITERION. The Annals of Statistics 43, 3 (2015), 1060–1088. Retrieved from http://www.jstor.org/stable/43556547
[30]
Sahand Negahban, Sewoong Oh, and Devavrat Shah. 2012. Iterative ranking from pair-wise comparisons. In Proceedings of the Advances in Neural Information Processing Systems.F. Pereira, C.J. Burges, L. Bottou, and K.Q. Weinberger (Eds.), Vol. 25, Curran Associates, Inc. Retrieved from https://proceedings.neurips.cc/paper/2012/file/9adeb82fffb5444e81fa0ce8ad8afe7a-Paper.pdf
[31]
Judea Pearl. 2009. Causality: Models, Reasoning and Inference (2nd. ed.). Cambridge University Press, USA.
[32]
Judea Pearl. 2017. Physical and metaphysical counterfactuals: Evaluating disjunctive actions. Journal of Causal Inference 5, 2 (2017), 20170018. DOI:DOI:
[33]
Judea Pearl. 2018. Does obesity shorten life? or is it the soda? on non-manipulable causes. Journal of Causal Inference 6, 2 (2018), 20182001. DOI:DOI:
[34]
Judea Pearl. 2019. On the interpretation of do(x). Journal of Causal Inference 7, 1 (2019), 20192002. DOI:DOI:
[35]
Jonas Peters, Dominik Janzing, and Bernhard Schlkopf. 2017. Elements of Causal Inference: Foundations and Learning Algorithms. The MIT.
[36]
James M. Robins, Andrea Rotnitzky, and Lue Ping Zhao. 1994. Estimation of regression coefficients when some regressors are not always observed. Journal of the American statistical Association 89, 427 (1994), 846–866. DOI:DOI: arXiv:https://doi.org/10.1080/01621459.1994.10476818
[37]
Paul R. Rosenbaum and Donald B. Rubin. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 1 (1983), 41–55. Retrieved from http://www.jstor.org/stable/2335942
[38]
Marco Rossetti, Fabio Stella, and Markus Zanker. 2016. Contrasting offline and online results when evaluating recommendation algorithms. In Proceedings of the 10th ACM Conference on Recommender Systems. . Association for Computing Machinery, New York, NY, USA, 31–34. DOI:DOI:
[39]
Matteo Ruffini, Vito Bellini, Alexander Buchholz, Giuseppe Di Benedetto, and Yannik Stein. 2022. Modeling position bias ranking for streaming media services. In Companion Proceedings of the Web Conference 2022. . Association for Computing Machinery, New York, NY, USA, 72–76. DOI:DOI:
[40]
Aleix Ruiz de Villa, Gabriele Sottocornola, Ludovik Coba, Giovanni Maffei, Federico Lucchesi, João Guerreiro, and Bartlomiej Skorulski. 2023. Leveraging causal inference to measure the impact of a mental health app on users’ well-being. In Proceedings of the 31st ACM Conference on User Modeling, Adaptation and Personalization.Association for Computing Machinery, New York, NY, USA, 228–237. DOI:DOI:
[41]
Yuta Saito. 2020. Doubly robust estimator for ranking metrics with post-click conversions. In Proceedings of the 14th ACM Conference on Recommender Systems. . Association for Computing Machinery, New York, NY, USA, 92–100. DOI:DOI:
[42]
Yuta Saito, Suguru Yaginuma, Yuta Nishino, Hayato Sakata, and Kazuhide Nakata. 2020. Unbiased recommender learning from missing-not-at-random implicit feedback. In Proceedings of the 13th International Conference on Web Search and Data Mining. 501–509.
[43]
Masahiro Sato. 2021. Online evaluation methods for the causal effect of recommendations. In Proceedings of the 15th ACM Conference on Recommender Systems. . Association for Computing Machinery, New York, NY, USA, 96–101. DOI:DOI:
[44]
Masahiro Sato, Janmajay Singh, Sho Takemori, Takashi Sonoda, Qian Zhang, and Tomoko Ohkuma. 2019. Uplift-based evaluation and optimization of recommenders. In Proceedings of the 13th ACM Conference on Recommender Systems.Association for Computing Machinery, New York, NY, USA, 296–304. DOI:DOI:
[45]
Masahiro Sato, Sho Takemori, Janmajay Singh, and Tomoko Ohkuma. 2020. Unbiased learning for the causal effect of recommendation. In Proceedings of the 14th ACM Conference on Recommender Systems.Association for Computing Machinery, New York, NY, USA, 378–387. DOI:DOI:
[46]
Uri Shalit, Fredrik D. Johansson, and David Sontag. 2017. Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Achine learning, PMLR, 3076–3085.
[47]
Amit Sharma, Jake M. Hofman, and Duncan J. Watts. 2015. Estimating the causal impact of recommendation systems from observational data. In Proceedings of the 17th ACM Conference on Economics and Computation. 453–470.
[48]
Vladimir N. Vapnik. 1995. The Nature of Statistical Learning Theory. Springer-Verlag New York, Inc.
[49]
Xuanhui Wang, Nadav Golbandi, Michael Bendersky, Donald Metzler, and Marc Najork. 2018. Position bias estimation for unbiased learning to rank in personal search. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining. 610–618.
[50]
Yixin Wang, Dawen Liang, Laurent Charlin, and David M. Blei. 2020. Causal inference for recommender systems. In Proceedings of the 14th ACM Conference on Recommender Systems. . Association for Computing Machinery, New York, NY, USA, 426–431. DOI:DOI:
[51]
Fabian Wauthier, Michael Jordan, and Nebojsa Jojic. 2013. Efficient ranking from pairwise comparisons. In Proceedings of the 30th International Conference on Machine Learning.Sanjoy Dasgupta and David McAllester (Eds.), PMLR, Atlanta, Georgia, USA, 109–117. Retrieved from https://proceedings.mlr.press/v28/wauthier13.html
[52]
Tianxin Wei, Fuli Feng, Jiawei Chen, Ziwei Wu, Jinfeng Yi, and Xiangnan He. 2021. Model-agnostic counterfactual reasoning for eliminating popularity bias in recommender system. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. . Association for Computing Machinery, New York, NY, USA, 1791–1800. DOI:DOI:
[53]
Peng Wu, Haoxuan Li, Yuhao Deng, Wenjie Hu, Quanyu Dai, Zhenhua Dong, Jie Sun, Rui Zhang, and Xiao-Hua Zhou. 2022. On the opportunity of causal learning in recommendation systems: Foundation, estimation, prediction and challenges. In Proceedings of the International Joint Conference on Artificial Intelligence, Vienna, Austria. 23–29.
[54]
Shuyuan Xu, Jianchao Ji, Yunqi Li, Yingqiang Ge, Juntao Tan, and Yongfeng Zhang. 2023. Causal inference for recommendation: foundations, methods and applications. arXiv preprint arXiv:2301.04016 (2023).
[55]
Shuyuan Xu, Juntao Tan, Shelby Heinecke, Vena Jia Li, and Yongfeng Zhang. 2023. Deconfounded causal collaborative filtering. ACM Transactions on Recommender Systems 1, 4 (2023), 1–25.
[56]
Yongfeng Zhang, Xu Chen, Yi Zhang, and Xianjie Chen. 2021. CSR 2021: The 1st international workshop on causality in search and recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. . Association for Computing Machinery, New York, NY, USA, 2677–2680. DOI:DOI:
[57]
Yang Zhang, Fuli Feng, Xiangnan He, Tianxin Wei, Chonggang Song, Guohui Ling, and Yongdong Zhang. 2021. Causal intervention for leveraging popularity bias in recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 11–20.
[58]
Yu Zheng, Chen Gao, Xiang Li, Xiangnan He, Yong Li, and Depeng Jin. 2021. Disentangling user interest and conformity for recommendation with causal embedding. In Proceedings of the Web Conference 2021.Association for Computing Machinery, New York, NY, USA, 2980–2991. DOI:DOI:
[59]
Xinyuan Zhu, Yang Zhang, Fuli Feng, Xun Yang, Dingxian Wang, and Xiangnan He. 2022. Mitigating Hidden Confounding Effects for Causal Recommendation. arXiv:2205.07499. Retrieved from https://arxiv.org/abs/2205.07499

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Recommender Systems
ACM Transactions on Recommender Systems  Volume 2, Issue 2
June 2024
180 pages
EISSN:2770-6699
DOI:10.1145/3613594
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 May 2024
Online AM: 31 January 2024
Accepted: 19 December 2023
Revised: 21 September 2023
Received: 10 January 2023
Published in TORS Volume 2, Issue 2

Check for updates

Author Tags

  1. User behavior
  2. causal inference in recommendation systems
  3. average causal effect
  4. collider bias

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 256
    Total Downloads
  • Downloads (Last 12 months)256
  • Downloads (Last 6 weeks)11
Reflects downloads up to 19 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media