Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1341531.1341544acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

SoftRank: optimizing non-smooth rank metrics

Published: 11 February 2008 Publication History

Abstract

We address the problem of learning large complex ranking functions. Most IR applications use evaluation metrics that depend only upon the ranks of documents. However, most ranking functions generate document scores, which are sorted to produce a ranking. Hence IR metrics are innately non-smooth with respect to the scores, due to the sort. Unfortunately, many machine learning algorithms require the gradient of a training objective in order to perform the optimization of the model parameters,and because IR metrics are non-smooth,we need to find a smooth proxy objective that can be used for training. We present a new family of training objectives that are derived from the rank distributions of documents, induced by smoothed scores. We call this approach SoftRank. We focus on a smoothed approximation to Normalized Discounted Cumulative Gain (NDCG), called SoftNDCG and we compare it with three other training objectives in the recent literature. We present two main results. First, SoftRank yields a very good way of optimizing NDCG. Second, we show that it is possible to achieve state of the art test set NDCG results by optimizing a soft NDCG objective on the training set with a different discount function

References

[1]
E. Agichtein, E. Brill, and S. Dumais. Improving web search ranking by incporporating user behavior information. In SIGIR 2006.
[2]
H. Balakrishnan, I. Hwang, and C. Tomlin. Polynomial approximation algorithms for belief matrix maintenance in identity management. In IEEE Conference on Decision and Control 2004.
[3]
C. Burges, R. Ragno, and Q. V. L. Le. Learning to rank with nonsmooth cost functions. In NIPS 2006.
[4]
C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In ICML 2005.
[5]
Z. Cao, T. Qin, T.-Y. Liu, M.-F. Tsai, and H. Li. Learning to rank: From pairwise approach to listwise approach. In ICML 2007.
[6]
R. Carnuana, S. Baluja, and T. Mitchell. Using the future to "sort out" the present: Rankprop and multitask learning for medical risk evaluation. In NIPS 8 1996.
[7]
W. Chu and Z. Ghahramani. Gaussian processes for ordinal regression. Journal of Machine Learning Research 6: 1019--1041, 2005.
[8]
K. Crammer and Y. Singer. Pranking with ranking. In pos-NIPS 14 2002.
[9]
R. Herbrich, T. Graepel, and K. Obermayer. Large margin rank boundaries for ordinal regression. In pow-Advances in Large Margin Classifiers pages 115--132. MIT Press, 2000.
[10]
Järvelin and J. Kekäläinen. IR evaluation methods for retrieving highly relevant documents. In SIGIR 2000.
[11]
T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of Knowledge Discovery in Databases 2002.
[12]
Y. LeCun, L. Bottou, G. Orr, and K.-R. Müller. Efficient backprop, 1998.
[13]
D. Metzler, T. Strohman, and W. Croft. Indri at trec 2006: Lessons learned from three terabyte tracks. In online Proceedings of Text REtrieval Conference 2005.
[14]
A. Papoulis. Probability, Random Variables and Stochastic Processes, Third Edition McGraw-Hill 1991.
[15]
S. Robertson, H. Zaragoza, and M. Taylor. A simple BM 25 extension to multiple weighted fields. In CIKM pages 42--29, 2004.
[16]
M. Taylor, H. Zaragoza, N. Craswell, S. Robertson, and C. Burges. Optimisation methods for ranking functions with multiple parameters. In CIKM 2006.

Cited By

View all
  • (2025)Optimal large-scale stochastic optimization of NDCG surrogates for deep learningMachine Learning10.1007/s10994-024-06631-x114:2Online publication date: 27-Jan-2025
  • (2024)Stability and multigroup fairness in ranking with uncertain predictionsProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692494(10661-10686)Online publication date: 21-Jul-2024
  • (2024)Utility-Oriented Reranking with Counterfactual ContextACM Transactions on Knowledge Discovery from Data10.1145/367100418:8(1-22)Online publication date: 4-Jun-2024
  • Show More Cited By

Index Terms

  1. SoftRank: optimizing non-smooth rank metrics

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data Mining
    February 2008
    270 pages
    ISBN:9781595939272
    DOI:10.1145/1341531
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 February 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. gradient descent
    2. learning
    3. metrics
    4. optimization
    5. ranking

    Qualifiers

    • Research-article

    Acceptance Rates

    Overall Acceptance Rate 498 of 2,863 submissions, 17%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)55
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 31 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Optimal large-scale stochastic optimization of NDCG surrogates for deep learningMachine Learning10.1007/s10994-024-06631-x114:2Online publication date: 27-Jan-2025
    • (2024)Stability and multigroup fairness in ranking with uncertain predictionsProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692494(10661-10686)Online publication date: 21-Jul-2024
    • (2024)Utility-Oriented Reranking with Counterfactual ContextACM Transactions on Knowledge Discovery from Data10.1145/367100418:8(1-22)Online publication date: 4-Jun-2024
    • (2024)Mitigating the Impact of Inaccurate Feedback in Dynamic Learning-to-Rank: A Study of Overlooked Interesting ItemsACM Transactions on Intelligent Systems and Technology10.1145/365398316:1(1-26)Online publication date: 26-Dec-2024
    • (2024)A Learning-to-Rank Formulation of Clustering-Based Approximate Nearest Neighbor SearchProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657931(2261-2265)Online publication date: 10-Jul-2024
    • (2024)Estimating the Hessian Matrix of Ranking Objectives for Stochastic Learning to Rank with Gradient Boosted TreesProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657918(2390-2394)Online publication date: 10-Jul-2024
    • (2024)Semantic Pre-Alignment and Ranking Learning With Unified Framework for Cross-Modal RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2022.318254934:7(6503-6516)Online publication date: Jul-2024
    • (2024)MAWSEO: Adversarial Wiki Search Poisoning for Illicit Online Promotion2024 IEEE Symposium on Security and Privacy (SP)10.1109/SP54263.2024.00049(388-406)Online publication date: 19-May-2024
    • (2024)Machine Learning based Algorithm Selection and Genetic Algorithms for serial-batch schedulingComputers & Operations Research10.1016/j.cor.2024.106827(106827)Online publication date: Aug-2024
    • (2024)An overview of sentence ordering taskInternational Journal of Data Science and Analytics10.1007/s41060-024-00550-918:1(1-18)Online publication date: 25-Apr-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media