Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

An Online Learning Framework for Refining Recency Search Results with User Click Feedback

Published: 01 November 2012 Publication History

Abstract

Traditional machine-learned ranking systems for Web search are often trained to capture stationary relevance of documents to queries, which have limited ability to track nonstationary user intention in a timely manner. In recency search, for instance, the relevance of documents to a query on breaking news often changes significantly over time, requiring effective adaptation to user intention. In this article, we focus on recency search and study a number of algorithms to improve ranking results by leveraging user click feedback. Our contributions are threefold. First, we use commercial search engine sessions collected in a random exploration bucket for reliable offline evaluation of these algorithms, which provides an unbiased comparison across algorithms without online bucket tests. Second, we propose an online learning approach that reranks and improves the search results for recency queries near real-time based on user clicks. This approach is very general and can be combined with sophisticated click models. Third, our empirical comparison of a dozen algorithms on real-world search data suggests importance of a few algorithmic choices in these applications, including generalization across different query-document pairs, specialization to popular queries, and near real-time adaptation of user clicks for reranking.

References

[1]
Agarwal, D., Chen, B., and Elango, P. 2009. Explore/exploit schemes for web content optimization. In Proceedings of the International Conference on Data Mining.
[2]
Agarwal, D., Chen, B., and Elango, P. 2010. Fast online learning through offline initialization for time-sensitive recommendation. In Proceedings of the ACM SIGKDD International Conference On Knowledge Discovery and Data Mining.
[3]
Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., and Hulldender, G. 2005. Learning to rank using gradient descent. In Proceedings of the International Conference on Machine Learning.
[4]
Burges, C. J., Le, Q. V., and Ragno, R. 2007. Learning to rank with nonsmooth cost functions. In Advances in Neural Information Processing Systems, vol. 19, B. Schölkopf, J. Platt, and T. Hofmann Eds.
[5]
Chapelle, O. and Zhang, Y. 2009. A dynamic bayesian network click model for web search ranking. In Proceedings of the 18th International World Wide Web Conference. ACM, 1--10.
[6]
Cortes, C., Mohri, M., and Rastogi, A. 2007. Magnitude-preserving ranking algorithms. In Proceedings of the 24th International Conference on Machine Learning.
[7]
Dai, N. and Davison, B. D. 2010. Freshness matters: In flowers, food and web authority. In Proceedings of the 3rd ACM SIGIR Conference.
[8]
Dong, A., Chang, Y., Zheng, Z., Mishne, G., Bai, J., Zhang, R., Buchner, K., Liao, C., and Diaz, F. 2010a. Towards recency ranking in web search. In Proceedings of the 3rd International ACM Conference on Web Search and Data Mining.
[9]
Dong, A., Zhang, R., Kolari, P., Bai, J., Diaz, F., Chang, Y., and Zheng, Z. 2010b. Time is of the essence: Improving recency ranking using twitter data. In Proceedings of the 19th International ACM Conference on World Wide Web.
[10]
Dupret, G. and Liao, C. 2010. Cumulated relevance: A model to estimate document relevance from the click-through logs of a web search engine. In Proceedings of the 3rd International ACM Conference on Web Search and Data Mining.
[11]
Elsas, J. and Dumais, S. 2010. Leveraging temporal dynamics of document content in relevance ranking. In Proceedings of the 3rd International Conference on Web Search and Data Mining.
[12]
Freund, Y., Iyer, R., Schapire, R., and Singer, Y. 2003. An efficient boosting algorithm for combining preferences. J. Mach. Learn. Resear. 4, 933--969.
[13]
Graepel, T., Candela, J. Q., Borchert, T., and Herbrich, R. 2010. Web-scale Bayesian click-through rate prediction for sponsored search advertising in Microsoft’s Bing search engine. In Proceedings of the 27th International Conference on Machine Learning. 13--20.
[14]
Guo, F., Liu, C., Kannan, A., Minka, T., Taylor, M., Wang, Y., and Faloutsos, C. 2009. Click chain model in web search. In Proceedings of 18th International World Wide Web Conference.
[15]
Inagaki, Y., Sadagopan, N., Dupret, G., Liao, C., Dong, A., Chang, Y., and Zheng, Z. 2010. Session based click features for recency ranking. In Proceedings of the 24th AAAI Conference on Artificial Intelligence.
[16]
Järvelin, K. and Kekäläinen, J. 2002. Cumulated gain-based evaluation of ir techniques. ACM Trans. Inf. Syst. 20.
[17]
Ji, S., Zhao, K., Liao, C., Zheng, Z., Xue, G., Chapelle, O., Sun, G., and Zha, H. 2009. Global ranking by exploiting user clicks. In Proceedings of the 28th International ACM SIGIR Conference. 35--42.
[18]
Joachims, T. 2002a. Optimizing search engines using clickthrough data. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM Press, New York, NY, 133--142.
[19]
Joachims, T. 2002b. Optimizing search engines using clickthrough data. In Proceedings of the ACM Conference on Knowledge Discovery and Data Mining. ACM.
[20]
Jones, R. and Diaz, F. 2007. Temporal profiles of queries. ACM Trans. Inf. Syst. 25.
[21]
Kang, C., Wang, X., Chen, J., Liao, C., Chang, Y., Tseng, B., and Zheng, Z. 2011. Learning to re-rank web search results with multiple pairwise features. In Proceedings of the 4th International Conference on Web Search and Web Data Mining.
[22]
Koren, Y. 2009. Collaborative filtering with temporal dynamics. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
[23]
Kulkarni, A., Teevan, J., Svore, K. M., and Dumais, S. 2011. Understanding temporal query dynamics. In Proceedings of the 4th International Conference on Web Search and Web Data Mining.
[24]
Langford, J., Li, L., and Strehl, A. 2007. Vowpal wabbit online learning project. http://hunch.net/?p=309.
[25]
Li, L., Chu, W., Langford, J., and Schapire, R. 2010. A contextual bandit approach to personalized news article recommendation. In Proceedings of 19th World Wide Web Conference.
[26]
Li, L., Chu, W., Langford, J., and Wang, X. 2011. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In Proceedings of the 4th International Conference on Web Search and Web Data Mining.
[27]
Liu, C., Li, M., and Wang, Y.-M. 2009. Post-rank reordering: Resolving preference misalignments between search engines and end users. In Proceedings of the 18th ACM Conference on Information and Knowledge Management.
[28]
Liu, T. Y. 2009. Learning to Rank for Information Retrieval. Now Publishers.
[29]
Moon, T., Li, L., Chu, W., Liao, C., Zheng, Z., and Chang, Y. 2010. Online learning for recency search ranking using real-time user feedback. In Proceedings of the 19th International Conference on Knowledge Management.
[30]
Radlinski, F. and Joachims, T. 2007. Active exploration for learning rankings from click-through data. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
[31]
Radlinski, F., Kurup, M., and Joachims, T. 2008. How does click-through data reflect retrieval quality? In Proceedings of 17th ACM Conference on Information and Knowledge Management.
[32]
Robbins, H. 1952. Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc. 58, 5, 527--535.
[33]
Stern, D., Herbrich, R., and Graepel, T. 2009. Matchbox: Large scale online bayesian recommendations. In Proceedings of the 18th International World Wide Web Conference.
[34]
Yue, Y. and Joachims, T. 2009. Interactively optimizing information retrieval systems as a dueling bandits problem. In Proceedings of the 26th International Conference on Machine Learning.
[35]
Zhang, V. and Jones, R. 2007. Comparing click logs and editorial labels for training query rewriting. In Query Log Analysis: Social and Technological Challenges. A Workshop at the 16th International World Wide Web Conference.
[36]
Zheng, Z., Zha, H., Zhang, T., Chapelle, O., and Chen, K. 2008. A general boosting method and its application to learning ranking functions for web search. In Advances in Neural Information Processing Systems, vol. 20. MIT Press, Cambridge, MA.
[37]
Zhu, Z. A., Chen, W., Minka, T., Zhu, C., and Chen, Z. 2010. A novel click model and its applications to online advertising. In Proceedings of the 3rd International Conference on Web Search and Web Data Mining.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Information Systems
ACM Transactions on Information Systems  Volume 30, Issue 4
November 2012
216 pages
ISSN:1046-8188
EISSN:1558-2868
DOI:10.1145/2382438
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 November 2012
Accepted: 01 March 2012
Revised: 01 January 2012
Received: 01 March 2011
Published in TOIS Volume 30, Issue 4

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)3
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Click-Through Rate Analysis for Understanding User Behavior in Information Retrieval2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT61001.2024.10725365(1-7)Online publication date: 24-Jun-2024
  • (2021)Content and link-structure perspective of ranking webpagesComputer Science Review10.1016/j.cosrev.2021.10039740:COnline publication date: 1-May-2021
  • (2021)A day at the racesApplied Intelligence10.1007/s10489-021-02719-2Online publication date: 17-Aug-2021
  • (2020)A Game-theoretic Approach to Data InteractionACM Transactions on Database Systems10.1145/335145045:1(1-44)Online publication date: 8-Feb-2020
  • (2020)Scalable Delay-Sensitive Polling of SensorsIEEE Access10.1109/ACCESS.2020.30262378(180231-180245)Online publication date: 2020
  • (2019)How Do Humans and Data Systems Establish a CommonQuery Language?ACM SIGMOD Record10.1145/3371316.337132948:1(51-58)Online publication date: 5-Nov-2019
  • (2019)Personalised Reranking of Paper Recommendations Using Paper Content and User BehaviorACM Transactions on Information Systems10.1145/331252837:3(1-23)Online publication date: 16-Mar-2019
  • (2018)The Data Interaction GameProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3196899(83-98)Online publication date: 27-May-2018
  • (2017)Value Congruence, Trust, and Their Effects on Purchase Intention and Reservation PriceACM Transactions on Management Information Systems10.1145/31109398:4(1-28)Online publication date: 30-Aug-2017
  • (2017)Learning Online Trends for Interactive Query Auto-CompletionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.273863929:11(2442-2454)Online publication date: 1-Nov-2017
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media