Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3488560.3498521acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article
Public Access

Towards Unbiased and Robust Causal Ranking for Recommender Systems

Published: 15 February 2022 Publication History

Abstract

We study the problem of optimizing ranking metrics with unbiased and robust causal estimation for recommender systems. A user may click/purchase an item regardless of whether the item is recommended or not. Thus, it is important to estimate the causal effect of recommendation and rank items higher with a larger causal effect. However, most existing works focused on improving the accuracy of recommendations, which usually have large bias and variance. Therefore, in this paper, we provide a general and theoretically rigorous framework for causal recommender systems, which enables unbiased evaluation and learning for the ranking metrics with confounding bias. We first propose a robust estimator for unbiased ranking evaluation and theoretically show that this estimator has a smaller bias and variance. We then propose a deep variational information bottleneck (IB) approach to exploit the sufficiency of the propensity score for estimation adjustment and better generalization. We also provide the learning bound and develop an unbiased learning algorithm to optimize the causal metric. Results on semi-synthetic and real-world datasets show that our evaluation and learning algorithms significantly outperform existing methods.

Supplementary Material

MP4 File (WSDM22-fp811.mp4)
We study the problem of optimizing ranking metrics for causal recommender systems.

References

[1]
David Barber Felix Agakov. 2004. The im algorithm: a variational approach to information maximization. NIPS, Vol. 16 (2004), 201.
[2]
Aman Agarwal, Kenta Takatsu, Ivan Zaitsev, and Thorsten Joachims. 2019. A general framework for counterfactual learning-to-rank. In SIGIR . 5--14.
[3]
Alexander A Alemi, Ian Fischer, Joshua V Dillon, and Kevin Murphy. 2017. Deep variational information bottleneck. International Conference on Learning Representations (2017).
[4]
Mohammad Taha Bahadori, Krzysztof Chalupka, Edward Choi, Robert Chen, Walter F Stewart, and Jimeng Sun. 2017. Causal regularization. arXiv preprint arXiv:1702.02604 (2017).
[5]
Heejung Bang and James M Robins. 2005. Doubly robust estimation in missing data and causal inference models. Biometrics, Vol. 61, 4 (2005), 962--973.
[6]
Stephen Bonner and Flavian Vasile. 2018. Causal embeddings for recommendation. In Proceedings of the 12th ACM Conference on Recommender Systems. 104--112.
[7]
David Brandfonbrener, William F Whitney, Rajesh Ranganath, and Joan Bruna. 2020. Overfitting and Optimization in Offline Policy Learning. arXiv preprint arXiv:2006.15368 (2020).
[8]
Micael Carvalho, Rémi Cadène, David Picard, Laure Soulier, Nicolas Thome, and Matthieu Cord. 2018. Cross-modal retrieval in the cooking context: Learning semantic text-image embeddings. In SIGIR. 35--44.
[9]
Claes M Cassel, Carl E S"arndal, and Jan H Wretman. 1976. Some results on generalized difference estimation and generalized regression estimation for finite populations. Biometrika, Vol. 63, 3 (1976), 615--620.
[10]
Miroslav Dud'ik, John Langford, and Lihong Li. 2011. Doubly robust policy evaluation and learning. ICML (2011).
[11]
Max H Farrell, Tengyuan Liang, and Sanjog Misra. 2018. Deep neural networks for estimation and inference. arXiv preprint arXiv:1809.09953 (2018).
[12]
Marco Federici, Anjan Dutta, Patrick Forré, Nate Kushman, and Zeynep Akata. 2019. Learning Robust Representations via Multi-View Information Bottleneck. In International Conference on Learning Representations .
[13]
Michele Jonsson Funk, Daniel Westreich, Chris Wiesen, Til Stürmer, M Alan Brookhart, and Marie Davidian. 2011. Doubly robust estimation of causal effects. American journal of epidemiology (2011), 761--767.
[14]
Alexandre Gilotte, Clément Calauzènes, Thomas Nedelec, Alexandre Abraham, and Simon Dollé. 2018. Offline a/b testing for recommender systems. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining. 198--206.
[15]
Anirudh Goyal, Riashat Islam, DJ Strouse, Zafarali Ahmed, Hugo Larochelle, Matthew Botvinick, Yoshua Bengio, and Sergey Levine. 2018. InfoBot: Transfer and Exploration via the Information Bottleneck. In International Conference on Learning Representations .
[16]
Shantanu Gupta, Hao Wang, Zachary C Lipton, and Yuyang Wang. 2021. Correcting Exposure Bias for Link Recommendation. arXiv preprint arXiv:2106.07041 (2021).
[17]
Irina Higgins, Loic Matthey, Arka Pal, Christopher Burgess, Xavier Glorot, Matthew Botvinick, Shakir Mohamed, and Alexander Lerchner. 2016. beta-vae: Learning basic visual concepts with a constrained variational framework. (2016).
[18]
Wassily Hoeffding. 1994. Probability inequalities for sums of bounded random variables. In The Collected Works of Wassily Hoeffding. Springer, 409--426.
[19]
Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative filtering for implicit feedback datasets. In 2008 Eighth IEEE International Conference on Data Mining. Ieee, 263--272.
[20]
Jin Huang, Harrie Oosterhuis, and Maarten de Rijke. 2021. It Is Different When Items Are Older: Debiasing Recommendations When Selection Bias and User Preferences Are Dynamic. arXiv preprint arXiv:2111.12481 (2021).
[21]
Guido W Imbens and Donald B Rubin. 2015. Causal inference in statistics, social, and biomedical sciences .Cambridge University Press.
[22]
Nan Jiang and Lihong Li. 2016. Doubly robust off-policy value evaluation for reinforcement learning. In International Conference on Machine Learning. PMLR, 652--661.
[23]
Thorsten Joachims, Adith Swaminathan, and Maarten de Rijke. 2018. Deep learning with logged bandit feedback. In International Conference on Learning Representations .
[24]
Thorsten Joachims, Adith Swaminathan, and Tobias Schnabel. 2017. Unbiased learning-to-rank with biased feedback. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. 781--789.
[25]
Fredrik Johansson, Uri Shalit, and David Sontag. 2016. Learning representations for counterfactual inference. In International conference on machine learning. 3020--3029.
[26]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer, Vol. 42, 8 (2009), 30--37.
[27]
Xiang Lisa Li and Jason Eisner. 2019. Specializing Word Embeddings (for Parsing) by Information Bottleneck. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing . 2744--2754.
[28]
Dawen Liang, Laurent Charlin, James McInerney, and David M Blei. 2016. Modeling user exposure in recommendation. In Proceedings of the 25th international conference on World Wide Web. 951--961.
[29]
Christos Louizos, Uri Shalit, Joris M Mooij, David Sontag, Richard Zemel, and Max Welling. 2017. Causal effect inference with deep latent-variable models. In Advances in Neural Information Processing Systems. 6446--6456.
[30]
Ross Prentice. 1976. Use of the logistic model in retrospective studies. Biometrics (1976), 599--606.
[31]
James M Robins and Andrea Rotnitzky. 1995. Semiparametric efficiency in multivariate regression models with missing data. J. Amer. Statist. Assoc., Vol. 90, 429 (1995), 122--129.
[32]
Paul R Rosenbaum and Donald B Rubin. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika, Vol. 70, 1 (1983), 41--55.
[33]
Donald B Rubin. 1979. Using multivariate matched sampling and regression adjustment to control bias in observational studies. J. Amer. Statist. Assoc. (1979), 318--328.
[34]
Yuta Saito. 2020 a. Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions. In Fourteenth ACM Conference on Recommender Systems. 92--100.
[35]
Yuta Saito. 2020 b. Unbiased Pairwise Learning from Biased Implicit Feedback. In ICTIR '20: The 2020 ACM SIGIR International Conference on the Theory of Information Retrieval, Virtual Event, Norway, September 14--17, 2020. 5--12.
[36]
Yuta Saito, Suguru Yaginuma, Yuta Nishino, Hayato Sakata, and Kazuhide Nakata. 2020. Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback. In Proceedings of the 13th International Conference on Web Search and Data Mining. 501--509.
[37]
Masahiro Sato, Janmajay Singh, Sho Takemori, Takashi Sonoda, Qian Zhang, and Tomoko Ohkuma. 2019. Uplift-based evaluation and optimization of recommenders. In Proceedings of the 13th ACM Conference on Recommender Systems. 296--304.
[38]
Masahiro Sato, Sho Takemori, Janmajay Singh, and Tomoko Ohkuma. 2020. Unbiased Learning for the Causal Effect of Recommendation. Proceedings of the 14th ACM Conference on Recommender Systems (2020).
[39]
Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as treatments: Debiasing learning and evaluation. (2016), 1670--1679.
[40]
Uri Shalit, Fredrik D Johansson, and David Sontag. 2017. Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Machine Learning . 3076--3085.
[41]
Amit Sharma, Jake M Hofman, and Duncan J Watts. 2015. Estimating the causal impact of recommendation systems from observational data. In Proceedings of the Sixteenth ACM Conference on Economics and Computation . 453--470.
[42]
Claudia Shi, David Blei, and Victor Veitch. 2019. Adapting neural networks for the estimation of treatment effects. In NeurIPS . 2507--2517.
[43]
Yi Su, Maria Dimakopoulou, Akshay Krishnamurthy, and Miroslav Dud'ik. 2020. Doubly robust off-policy evaluation with shrinkage. In International Conference on Machine Learning. PMLR, 9167--9176.
[44]
Yi Su, Lequn Wang, Michele Santacatterina, and Thorsten Joachims. 2019. Cab: Continuous adaptive blending for policy evaluation and learning. In International Conference on Machine Learning. PMLR, 6005--6014.
[45]
Adith Swaminathan and Thorsten Joachims. 2015. Batch learning from logged bandit feedback through counterfactual risk minimization. The Journal of Machine Learning Research (2015), 1731--1755.
[46]
Philip Thomas and Emma Brunskill. 2016. Data-efficient off-policy policy evaluation for reinforcement learning. In International Conference on Machine Learning. PMLR, 2139--2148.
[47]
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019. Doubly robust joint learning for recommendation on data missing not at random. In International Conference on Machine Learning. 6638--6647.
[48]
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2021. Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining . 427--435.
[49]
Tailin Wu, Hongyu Ren, Pan Li, and Jure Leskovec. 2020. Graph Information Bottleneck. Advances in Neural Information Processing Systems, Vol. 33 (2020), 20437--20448.
[50]
Teng Xiao, Shangsong Liang, and Zaiqiao Meng. 2019. Hierarchical neural variational model for personalized sequential recommendation. In The World Wide Web Conference. 3377--3383.
[51]
Teng Xiao and Donglin Wang. 2021. A general offline reinforcement learning framework for interactive recommendation. In The Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021 .
[52]
Teng Xiao and Suhang Wang. 2022. Towards off-policy learning for ranking policies with logged feedback. In The Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022 .
[53]
Longqi Yang, Yin Cui, Yuan Xuan, Chenyang Wang, Serge Belongie, and Deborah Estrin. 2018. Unbiased offline recommender evaluation for missing-not-at-random implicit feedback. In Proceedings of the 12th ACM Conference on Recommender Systems. 279--287.
[54]
Jinsung Yoon, James Jordon, and Mihaela van der Schaar. 2018. GANITE: Estimation of individualized treatment effects using generative adversarial nets. In International Conference on Learning Representations .

Cited By

View all
  • (2024)Deep Causal Reasoning for RecommendationsACM Transactions on Intelligent Systems and Technology10.1145/365398515:4(1-25)Online publication date: 18-Jun-2024
  • (2024)RobustRecSys @ RecSys2024: Design, Evaluation and Deployment of Robust Recommender SystemsProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3687106(1265-1269)Online publication date: 8-Oct-2024
  • (2024)Causal Inference in Recommender Systems: A Survey and Future DirectionsACM Transactions on Information Systems10.1145/363904842:4(1-32)Online publication date: 9-Feb-2024
  • Show More Cited By

Index Terms

  1. Towards Unbiased and Robust Causal Ranking for Recommender Systems

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining
    February 2022
    1690 pages
    ISBN:9781450391320
    DOI:10.1145/3488560
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 February 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. causal inference
    2. counterfactual learning
    3. recommender systems

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    WSDM '22

    Acceptance Rates

    Overall Acceptance Rate 498 of 2,863 submissions, 17%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)382
    • Downloads (Last 6 weeks)37
    Reflects downloads up to 21 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Deep Causal Reasoning for RecommendationsACM Transactions on Intelligent Systems and Technology10.1145/365398515:4(1-25)Online publication date: 18-Jun-2024
    • (2024)RobustRecSys @ RecSys2024: Design, Evaluation and Deployment of Robust Recommender SystemsProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3687106(1265-1269)Online publication date: 8-Oct-2024
    • (2024)Causal Inference in Recommender Systems: A Survey and Future DirectionsACM Transactions on Information Systems10.1145/363904842:4(1-32)Online publication date: 9-Feb-2024
    • (2024)Towards a Causal Decision-Making Framework for Recommender SystemsACM Transactions on Recommender Systems10.1145/36291692:2(1-34)Online publication date: 14-May-2024
    • (2024)CausalMed: Causality-Based Personalized Medication Recommendation Centered on Patient Health StateProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679542(1276-1285)Online publication date: 21-Oct-2024
    • (2024)CIDGMed: Causal Inference-Driven Medication Recommendation with enhanced dual-granularity learningKnowledge-Based Systems10.1016/j.knosys.2024.112685(112685)Online publication date: Nov-2024
    • (2024)CBRec: A causal way balancing multidimensional attraction effect in POI recommendationsKnowledge-Based Systems10.1016/j.knosys.2024.112607305(112607)Online publication date: Dec-2024
    • (2024)Neural_BPRElectronic Commerce Research and Applications10.1016/j.elerap.2023.10132362:COnline publication date: 4-Mar-2024
    • (2024)Attention-based causal representation learning for out-of-distribution recommendationApplied Intelligence10.1007/s10489-024-05835-x54:24(12964-12978)Online publication date: 12-Oct-2024
    • (2023)Reconsidering Learning Objectives in Unbiased Recommendation: A Distribution Shift PerspectiveProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599487(2764-2775)Online publication date: 6-Aug-2023
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media