research-article

SoftRank: optimizing non-smooth rank metrics

Authors:

Michael Taylor,

Stephen Robertson,

Tom MinkaAuthors Info & Claims

WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data Mining

Pages 77 - 86

https://doi.org/10.1145/1341531.1341544

Published: 11 February 2008 Publication History

Abstract

We address the problem of learning large complex ranking functions. Most IR applications use evaluation metrics that depend only upon the ranks of documents. However, most ranking functions generate document scores, which are sorted to produce a ranking. Hence IR metrics are innately non-smooth with respect to the scores, due to the sort. Unfortunately, many machine learning algorithms require the gradient of a training objective in order to perform the optimization of the model parameters,and because IR metrics are non-smooth,we need to find a smooth proxy objective that can be used for training. We present a new family of training objectives that are derived from the rank distributions of documents, induced by smoothed scores. We call this approach SoftRank. We focus on a smoothed approximation to Normalized Discounted Cumulative Gain (NDCG), called SoftNDCG and we compare it with three other training objectives in the recent literature. We present two main results. First, SoftRank yields a very good way of optimizing NDCG. Second, we show that it is possible to achieve state of the art test set NDCG results by optimizing a soft NDCG objective on the training set with a different discount function

References

[1]

E. Agichtein, E. Brill, and S. Dumais. Improving web search ranking by incporporating user behavior information. In SIGIR 2006.

Digital Library

[2]

H. Balakrishnan, I. Hwang, and C. Tomlin. Polynomial approximation algorithms for belief matrix maintenance in identity management. In IEEE Conference on Decision and Control 2004.

[3]

C. Burges, R. Ragno, and Q. V. L. Le. Learning to rank with nonsmooth cost functions. In NIPS 2006.

[4]

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In ICML 2005.

Digital Library

[5]

Z. Cao, T. Qin, T.-Y. Liu, M.-F. Tsai, and H. Li. Learning to rank: From pairwise approach to listwise approach. In ICML 2007.

Digital Library

[6]

R. Carnuana, S. Baluja, and T. Mitchell. Using the future to "sort out" the present: Rankprop and multitask learning for medical risk evaluation. In NIPS 8 1996.

[7]

W. Chu and Z. Ghahramani. Gaussian processes for ordinal regression. Journal of Machine Learning Research 6: 1019--1041, 2005.

Digital Library

[8]

K. Crammer and Y. Singer. Pranking with ranking. In pos-NIPS 14 2002.

[9]

R. Herbrich, T. Graepel, and K. Obermayer. Large margin rank boundaries for ordinal regression. In pow-Advances in Large Margin Classifiers pages 115--132. MIT Press, 2000.

[10]

Järvelin and J. Kekäläinen. IR evaluation methods for retrieving highly relevant documents. In SIGIR 2000.

Digital Library

[11]

T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of Knowledge Discovery in Databases 2002.

Digital Library

[12]

Y. LeCun, L. Bottou, G. Orr, and K.-R. Müller. Efficient backprop, 1998.

Digital Library

[13]

D. Metzler, T. Strohman, and W. Croft. Indri at trec 2006: Lessons learned from three terabyte tracks. In online Proceedings of Text REtrieval Conference 2005.

[14]

A. Papoulis. Probability, Random Variables and Stochastic Processes, Third Edition McGraw-Hill 1991.

[15]

S. Robertson, H. Zaragoza, and M. Taylor. A simple BM 25 extension to multiple weighted fields. In CIKM pages 42--29, 2004.

Digital Library

[16]

M. Taylor, H. Zaragoza, N. Craswell, S. Robertson, and C. Burges. Optimisation methods for ranking functions with multiple parameters. In CIKM 2006.

Digital Library

Cited By

Qiu ZHu QZhong YTu WZhang LYang T(2025)Optimal large-scale stochastic optimization of NDCG surrogates for deep learningMachine Learning10.1007/s10994-024-06631-x114:2Online publication date: 27-Jan-2025
https://doi.org/10.1007/s10994-024-06631-x
Devic SKorolova AKempe DSharan VSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Stability and multigroup fairness in ranking with uncertain predictionsProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692494(10661-10686)Online publication date: 21-Jul-2024
https://dl.acm.org/doi/10.5555/3692070.3692494
Xi YLiu WDai XTang RLiu QZhang WYu Y(2024)Utility-Oriented Reranking with Counterfactual ContextACM Transactions on Knowledge Discovery from Data10.1145/367100418:8(1-22)Online publication date: 4-Jun-2024
https://dl.acm.org/doi/10.1145/3671004
Show More Cited By

Index Terms

SoftRank: optimizing non-smooth rank metrics
1. Information systems
  1. Information retrieval

Recommendations

Learning to rank with SoftRank and Gaussian processes
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

In this paper we address the issue of learning to rank for document retrieval using Thurstonian models based on sparse Gaussian processes. Thurstonian models represent each document for a given query as a probability distribution in a score space; these ...
ERR.Rank: An algorithm based on learning to rank for direct optimization of Expected Reciprocal Rank

Learning to rank (LTR) is a machine learning-based ranking technique that constructs a ranking model to sort objects in response to a query, and is used in many applications especially in information retrieval. LTR ranking models are generally evaluated ...
Re-ranking search results using query logs
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management

This work addresses two common problems in search, frequently occurring with underspecified user queries: the top-ranked results for such queries may not contain documents relevant to the user's search intent, and fresh and relevant pages may not get ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data Mining

February 2008

270 pages

ISBN:9781595939272

DOI:10.1145/1341531

General Chair:
Marc Najork
Microsoft, USA
,
Program Chairs:
Andrei Broder
Yahoo!, USA
,
Soumen Chakrabarti
IIT Bombay, India

Copyright © 2008 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 February 2008

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Acceptance Rates

Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

235
Total Citations
View Citations
1,358
Total Downloads

Downloads (Last 12 months)55
Downloads (Last 6 weeks)7

Reflects downloads up to 31 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Qiu ZHu QZhong YTu WZhang LYang T(2025)Optimal large-scale stochastic optimization of NDCG surrogates for deep learningMachine Learning10.1007/s10994-024-06631-x114:2Online publication date: 27-Jan-2025
https://doi.org/10.1007/s10994-024-06631-x
Devic SKorolova AKempe DSharan VSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Stability and multigroup fairness in ranking with uncertain predictionsProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692494(10661-10686)Online publication date: 21-Jul-2024
https://dl.acm.org/doi/10.5555/3692070.3692494
Xi YLiu WDai XTang RLiu QZhang WYu Y(2024)Utility-Oriented Reranking with Counterfactual ContextACM Transactions on Knowledge Discovery from Data10.1145/367100418:8(1-22)Online publication date: 4-Jun-2024
https://dl.acm.org/doi/10.1145/3671004
Zhang CChen WZhang WXu M(2024)Mitigating the Impact of Inaccurate Feedback in Dynamic Learning-to-Rank: A Study of Overlooked Interesting ItemsACM Transactions on Intelligent Systems and Technology10.1145/365398316:1(1-26)Online publication date: 26-Dec-2024
https://dl.acm.org/doi/10.1145/3653983
Vecchiato TLucchese CNardini FBruch SHui Yang GWang HHan SHauff CZuccon GZhang Y(2024)A Learning-to-Rank Formulation of Clustering-Based Approximate Nearest Neighbor SearchProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657931(2261-2265)Online publication date: 10-Jul-2024
https://dl.acm.org/doi/10.1145/3626772.3657931
Kang Jde Rijke MOosterhuis HHui Yang GWang HHan SHauff CZuccon GZhang Y(2024)Estimating the Hessian Matrix of Ranking Objectives for Stochastic Learning to Rank with Gradient Boosted TreesProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657918(2390-2394)Online publication date: 10-Jul-2024
https://dl.acm.org/doi/10.1145/3626772.3657918
Cheng QTan ZWen KChen CGu X(2024)Semantic Pre-Alignment and Ranking Learning With Unified Framework for Cross-Modal RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2022.318254934:7(6503-6516)Online publication date: Jul-2024
https://doi.org/10.1109/TCSVT.2022.3182549
Lin ZLi ZLiao XWang XLiu X(2024)MAWSEO: Adversarial Wiki Search Poisoning for Illicit Online Promotion2024 IEEE Symposium on Security and Privacy (SP)10.1109/SP54263.2024.00049(388-406)Online publication date: 19-May-2024
https://doi.org/10.1109/SP54263.2024.00049
Uzunoglu AGahm CTuma A(2024)Machine Learning based Algorithm Selection and Genetic Algorithms for serial-batch schedulingComputers & Operations Research10.1016/j.cor.2024.106827(106827)Online publication date: Aug-2024
https://doi.org/10.1016/j.cor.2024.106827
Shi YZhang HLi NYang T(2024)An overview of sentence ordering taskInternational Journal of Data Science and Analytics10.1007/s41060-024-00550-918:1(1-18)Online publication date: 25-Apr-2024
https://doi.org/10.1007/s41060-024-00550-9
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten