Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3583780.3615011acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article
Open access

Predictive Uncertainty-based Bias Mitigation in Ranking

Published: 21 October 2023 Publication History

Abstract

Societal biases that are contained in retrieved documents have received increased interest. Such biases, which are often prevalent in the training data and learned by the model, can cause societal harms, by misrepresenting certain groups, and by enforcing stereotypes. Mitigating such biases demands algorithms that balance the trade-off between maximized utility for the user with fairness objectives, which incentivize unbiased rankings. Prior work on bias mitigation often assumes that ranking scores, which correspond to the utility that a document holds for a user, can be accurately determined. In reality, there is always a degree of uncertainty in the estimate of expected document utility. This uncertainty can be approximated by viewing ranking models through a Bayesian perspective, where the standard deterministic score becomes a distribution.
In this work, we investigate whether uncertainty estimates can be used to decrease the amount of bias in the ranked results, while minimizing loss in measured utility. We introduce a simple method that uses the uncertainty of the ranking scores for an uncertainty-aware, post hoc approach to bias mitigation. We compare our proposed method with existing baselines for bias mitigation with respect to the utility-fairness trade-off, the controllability of methods, and computational costs. We show that an uncertainty-based approach can provide an intuitive and flexible trade-off that outperforms all baselines without additional training requirements, allowing for the post hoc use of this approach on top of arbitrary retrieval models.

Supplementary Material

MP4 File (2201-video.mp4)
Social biases that are often prevalent in training data and hence are learned by the model can cause societal harms, by misrepresenting certain groups, and by enforcing stereotypes. Mitigating such biases demands algorithms that balance the trade-off between maximized utility for the user with fairness objectives, which incentivize unbiased rankings. In this talk we will demonstrate that using the predictive uncertainty of the ranking model with respect to its scores can aid in keeping the loss in user utility to a minimum, while achieving fairer ranked lists. Our proposed approach, PUFR re-ranks the documents based on whether they are biased and the score distribution, which we obtain through Laplace approximation. We compare PUFR with existing baselines for bias mitigation with respect to the utility-fairness trade. We show that an uncertainty-based approach can provide an intuitive and flexible trade-off that outperforms all baselines without additional training requirements.

References

[1]
Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. 2016. Machine Bias. In Ethics of Data and Analytics. Auerbach Publications, 254--264.
[2]
Alex Beutel, Jilin Chen, Tulsee Doshi, Hai Qian, Li Wei, Yi Wu, Lukasz Heldt, Zhe Zhao, Lichan Hong, Ed H. Chi, and Cristos Goodrow. 2019. Fairness in Recommendation Ranking through Pairwise Comparisons. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2212--2220.
[3]
Asia J. Biega, Krishna P. Gummadi, and Gerhard Weikum. 2018. Equity of Attention: Amortizing Individual Fairness in Rankings. In Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval. 405--414.
[4]
Carlos Castillo. 2019. Fairness and Transparency in Ranking. In ACM SIGIR Forum, Vol. 52. 64--71.
[5]
L. Elisa Celis, Anay Mehrotra, and Nisheeth K. Vishnoi. 2020. Interventions for Ranking in the Presence of Implicit bias. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. 369--380.
[6]
L. Elisa Celis, Damian Straszak, and Nisheeth K. Vishnoi. 2018. Ranking with Fairness Constraints. In 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018). 28:1--28:15.
[7]
Le Chen, Ruijun Ma, Anikó Hannák, and Christo Wilson. 2018. Investigating the Impact of Gender on Rank in Resume Search Engines. ACM, 1--14.
[8]
Daniel Cohen, Kevin Du, Bhaskar Mitra, Laura Mercurio, Navid Rekabsaz, and Carsten Eickhoff. 2022. Inconsistent Ranking Assumptions in Medical Search and Their Downstream Consequences. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (Madrid, Spain). ACM, 2572--2577.
[9]
Daniel Cohen, Bhaskar Mitra, Oleg Lesota, Navid Rekabsaz, and Carsten Eickhoff. 2021. Not All Relevance Scores Are Equal: Efficient Uncertainty and Calibration Modeling for Deep Retrieval Models. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 654--664.
[10]
Zhijie Deng, Feng Zhou, and Jun Zhu. 2022. Accelerated Linearized Laplace Approximation for Bayesian Deep Learning. In Advances in Neural Information Processing Systems, Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho (Eds.), Vol. 35. 2695--2708.
[11]
Fernando Diaz, Bhaskar Mitra, Michael D. Ekstrand, Asia J. Biega, and Ben Carterette. 2020. Evaluating Stochastic Rankings with Expected Exposure. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 275--284.
[12]
Qian Dong, Yiding Liu, Suqi Cheng, Shuaiqiang Wang, Zhicong Cheng, Shuzi Niu, and Dawei Yin. 2022. Incorporating Explicit Knowledge in Pre-Trained Language Models for Passage Re-Ranking. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1490--1501.
[13]
Michael D. Ekstrand, Graham McDonald, Amifa Raj, and Isaac Johnson. 2022. Overview of the TREC 2021 Fair Ranking Track. In The Thirtieth Text REtrieval Conference (TREC 2021) Proceedings.
[14]
Alessandro Fabris, Alberto Purpura, Gianmaria Silvello, and Gian Antonio Susto. 2020. Gender Stereotype Reinforcement: Measuring the Gender Bias Conveyed by Ranking Algorithms. Information Processing & Management, Vol. 57 (2020), 102377.
[15]
Yaroslav Ganin and Victor S. Lempitsky. 2015. Unsupervised Domain Adaptation by Backpropagation. In Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6--11 July 2015 (JMLR Workshop and Conference Proceedings), Vol. 37. 1180--1189.
[16]
Sahin Cem Geyik, Stuart Ambler, and Krishnaram Kenthapadi. 2019. Fairness-aware Ranking in Search & Recommendation Systems with Application to Linkedin Talent Search. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2221--2231.
[17]
Avijit Ghosh, Ritam Dutt, and Christo Wilson. 2021. When Fair Ranking Meets Uncertain Inference. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1033--1043.
[18]
Sruthi Gorantla, Amit Deshpande, and Anand Louis. 2021. On the Problem of Underranking in Group-Fair Ranking. In International Conference on Machine Learning. PMLR, 3777--3787.
[19]
Maria Heuss, Fatemeh Sarvi, and Maarten de Rijke. 2022. Fairness of Exposure in Light of Incomplete Exposure Estimation. In SIGIR 2022: 45th international ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 759--769.
[20]
Sebastian Hofst"atter, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin, and Allan Hanbury. 2021. Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 113--122.
[21]
Jon Kleinberg and Manish Raghavan. 2018. Selection Problems in the Presence of Implicit Bias. In 9th Innovations in Theoretical Computer Science Conference (ITCS 2018). Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik, 33:1--33:17.
[22]
Till Kletti, Jean-Michel Renders, and Patrick Loiseau. 2022. Pareto-Optimal Fairness-Utility Amortizations in Rankings with a DBN Exposure Model. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 748--758.
[23]
Agustinus Kristiadi, Matthias Hein, and Philipp Hennig. 2020. Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks. In Proceedings of the 37th International Conference on Machine Learning. 5392--5402.
[24]
Preethi Lahoti, Krishna P. Gummadi, and Gerhard Weikum. 2019a. iFair: Learning Individually Fair Data Representations for Algorithmic Decision Making. In 2019 IEEE 35th International Conference on Data Engineering (ICDE). IEEE, 1334--1345.
[25]
Preethi Lahoti, Krishna P. Gummadi, and Gerhard Weikum. 2019b. Operationalizing Individual Fairness with Pairwise Fair Representations. Proceedings of the VLDB Endowment (2019), 506--518.
[26]
Jimmy Lin, Rodrigo Nogueira, and Andrew Yates. 2021. Pretrained Transformers for Text Ranking: BERT and Beyond. Synthesis Lectures on Human Language Technologies, Vol. 14, 4 (2021), 1--325.
[27]
David J. C. Mackay. 1992. A Practical Bayesian Framework for Backpropagation Networks. Neural Computation, Vol. 4 (1992), 448--472.
[28]
Anay Mehrotra and Nisheeth Vishnoi. 2022. Fair Ranking with Noisy Protected Attributes. In Advances in Neural Information Processing Systems, Vol. 35. 31711--31725.
[29]
Tri Nguyen, Mir Rosenberg, Xia Song, Jianfeng Gao, Saurabh Tiwary, Rangan Majumder, and Li Deng. 2016. MS MARCO: A Human Generated Machine Reading Comprehension Dataset. In Proceedings of the Workshop on Cognitive Computation: Integrating Neural and Symbolic Approaches 2016.
[30]
Rodrigo Frassetto Nogueira and Kyunghyun Cho. 2019. Passage Re-ranking with BERT. arXiv preprint arXiv:1901.04085 (2019).
[31]
Gourab K. Patro, Lorenzo Porcaro, Laura Mitchell, Qiuyue Zhang, Meike Zehlike, and Nikhil Garg. 2022. Fair Ranking: A Critical Review, Challenges, and Future Directions. In 2022 ACM Conference on Fairness, Accountability, and Transparency. ACM, 1929--1942.
[32]
Gustavo Penha and Claudia Hauff. 2021. On the Calibration and Uncertainty of Neural Learning to Rank Models for Conversational Search. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, EACL 2021, Online, April 19 - 23, 2021. Association for Computational Linguistics, 160--170.
[33]
Navid Rekabsaz, Simone Kopeinik, and Markus Schedl. 2021. Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation of Bert Rankers. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 306--316.
[34]
Navid Rekabsaz and Markus Schedl. 2020. Do Neural Ranking Models Intensify Gender Bias?. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2065--2068.
[35]
Hippolyt Ritter, Aleksandar Botev, and David Barber. 2018. A Scalable Laplace Approximation for Neural Networks. In 6th International Conference on Learning Representations, ICLR 2018-Conference Track Proceedings. International Conference on Representation Learning.
[36]
Stephen E Robertson. 1977. The Probability Ranking Principle in IR. Journal of Documentation, Vol. 33 (1977), 294--304.
[37]
Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, and Matei Zaharia. 2022. ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. ACM, 3715--3734.
[38]
Fatemeh Sarvi, Maria Heuss, Mohammad Aliannejadi, Sebastian Schelter, and Maarten de Rijke. 2022. Understanding and Mitigating the Effect of Outliers in Fair Ranking. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 861--869.
[39]
Ashudeep Singh and Thorsten Joachims. 2018. Fairness of Exposure in Rankings. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2219--2228.
[40]
Ashudeep Singh and Thorsten Joachims. 2019. Policy Learning for Fairness in Ranking. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. 5426--5436.
[41]
Ashudeep Singh, David Kempe, and Thorsten Joachims. 2021. Fairness in Ranking under Uncertainty. In Advances in Neural Information Processing Systems, Vol. 34. 11896--11908.
[42]
Julia Stoyanovich, Ke Yang, and HV Jagadish. 2018. Online Set Selection with Fairness and Diversity Constraints. In 21st International Conference on Extending Database Technology, EDBT 2018. 241--252.
[43]
Iulia Turc, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation. arXiv preprint arXiv:1908.08962 (2019).
[44]
Lequn Wang and Thorsten Joachims. 2021. User Fairness, Item Fairness, and Diversity for Rankings in Two-Sided Markets. In Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval. 23--41.
[45]
Ke Yang, Vasilis Gkatzelis, and Julia Stoyanovich. 2019. Balanced Ranking with Diversity Constraints. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI.
[46]
Tao Yang, Chen Luo, Hanqing Lu, Parth Gupta, Bing Yin, and Qingyao Ai. 2022. Can Clicks Be Both Labels and Features? Unbiased Behavior Feature Collection and Uncertainty-Aware Learning to Rank. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 6--17.
[47]
Tao Yang, Zhichao Xu, Zhenduo Wang, Anh Tran, and Qingyao Ai. 2023. Marginal-Certainty-aware Fair Ranking Algorithm. In Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining. 24--32.
[48]
Meike Zehlike, Francesco Bonchi, Carlos Castillo, Sara Hajian, Mohamed Megahed, and Ricardo Baeza-Yates. 2017. FA*IR: A Fair Top-k Ranking Algorithm. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 1569--1578.
[49]
Meike Zehlike and Carlos Castillo. 2020. Reducing Disparate Exposure in Ranking: A Learning To Rank Approach. In Proceedings of The Web Conference 2020. 2849--2855.
[50]
Meike Zehlike, Tom Sühr, Ricardo Baeza-Yates, Francesco Bonchi, Carlos Castillo, and Sara Hajian. 2022a. Fair Top-k Ranking with Multiple Protected Groups. Information Processing & Management, Vol. 59, 1 (2022), 102707.
[51]
Meike Zehlike, Ke Yang, and Julia Stoyanovich. 2022b. Fairness in Ranking, Part I: Score-based Ranking. Comput. Surveys, Vol. 55, 6 (2022), 1--36.
[52]
Meike Zehlike, Ke Yang, and Julia Stoyanovich. 2022c. Fairness in Ranking, Part II: Learning-to-rank and Recommender Systems. Comput. Surveys, Vol. 55, 6 (2022), 1--41.
[53]
George Zerveas, Navid Rekabsaz, Daniel Cohen, and Carsten Eickhoff. 2022. Mitigating Bias in Search Results Through Contextual Document Reranking and Neutrality Regularization. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2532--2538.
[54]
Jianhan Zhu, Jun Wang, Michael Taylor, and Ingemar J. Cox. 2009. Risk-Aware Information Retrieval. In Advances in Information Retrieval: 31th European Conference on IR Research, ECIR 2009, Toulouse, France, April 6--9, 2009. Proceedings 31. Springer-Verlag, 17--28.

Cited By

View all
  • (2024)Stability and multigroup fairness in ranking with uncertain predictionsProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692494(10661-10686)Online publication date: 21-Jul-2024
  • (2024)Report on the 21st Dutch-Belgian Information Retrieval Workshop (DIR 2023)ACM SIGIR Forum10.1145/3642979.364300457:2(1-5)Online publication date: 22-Jan-2024
  • (2024)A Self-Adaptive Fairness Constraint Framework for Industrial Recommender SystemProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680099(4726-4733)Online publication date: 21-Oct-2024
  • Show More Cited By

Index Terms

  1. Predictive Uncertainty-based Bias Mitigation in Ranking

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
      October 2023
      5508 pages
      ISBN:9798400701245
      DOI:10.1145/3583780
      This work is licensed under a Creative Commons Attribution International 4.0 License.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 21 October 2023

      Check for updates

      Author Tags

      1. fairness
      2. mitigating bias
      3. uncertainty
      4. utility-fairness trade-off

      Qualifiers

      • Research-article

      Funding Sources

      • Hybrid Intelligence Centre
      • LESSEN

      Conference

      CIKM '23
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)438
      • Downloads (Last 6 weeks)43
      Reflects downloads up to 06 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Stability and multigroup fairness in ranking with uncertain predictionsProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692494(10661-10686)Online publication date: 21-Jul-2024
      • (2024)Report on the 21st Dutch-Belgian Information Retrieval Workshop (DIR 2023)ACM SIGIR Forum10.1145/3642979.364300457:2(1-5)Online publication date: 22-Jan-2024
      • (2024)A Self-Adaptive Fairness Constraint Framework for Industrial Recommender SystemProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680099(4726-4733)Online publication date: 21-Oct-2024
      • (2024)Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading BanditsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679763(1638-1648)Online publication date: 21-Oct-2024
      • (2024)Wise Fusion: Group Fairness Enhanced Rank FusionProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679649(163-174)Online publication date: 21-Oct-2024

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media