research-article

Exploitation and exploration in a performance based contextual advertising system

Authors:

Wei Li,

Rong JinAuthors Info & Claims

KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 27 - 36

https://doi.org/10.1145/1835804.1835811

Published: 25 July 2010 Publication History

Get Access

Abstract

The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of online data inevitably makes an advertising system choose between maximizing its expected revenue according to its current knowledge in short term (exploitation) and trying to learn more about the unknown to improve its knowledge (exploration), since the latter might increase its revenue in the future. The exploitation and exploration (EE) tradeoff has been extensively studied in the reinforcement learning community, however, not been paid much attention in online advertising until recently. In this paper, we develop two novel EE strategies for online advertising. Specifically, our methods can adaptively balance the two aspects of EE by automatically learning the optimal tradeoff and incorporating confidence metrics of historical performance. Within a deliberately designed offline simulation framework we apply our algorithms to an industry leading performance based contextual advertising system and conduct extensive evaluations with real online event log data. The experimental results and detailed analysis reveal several important findings of EE behaviors in online advertising and demonstrate that our algorithms perform superiorly in terms of ad reach and click-through-rate (CTR).

References

[1]

Deepak Agarwal, Bee-Chung Chen, and Pradheep Elango. Explore/exploit schemes for web content optimization. In Proceedings of the 9th IEEE International Conference on Data Mining, pages 1--10, 2009.

Digital Library

Google Scholar

[2]

Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3:397--422, 2002.

Digital Library

Google Scholar

[3]

John Battelle. The search: how Google and its rivals rewrote the rules of business and transformed our culture. Nicholas Brearley Publishing, 2005.

Digital Library

Google Scholar

[4]

Interactive Advertising Bureau. Internet ad revenues at $10.9 billion for first half of '09, 2009. http://www.iab.net/media/file/IAB-Ad-Revenue-Six-month-2009.pdf.

Google Scholar

[5]

Nicolo Cesa-Bianchi and Gabor Lugosi. Prediction, Learning, and Games. Cambridge University Press, 2006.

Digital Library

Google Scholar

[6]

Deepayan Chakrabarti, Deepak Agarwal, and Vanja Josifovski. Contextual advertising by combining relevance with click feedback. In Proceeding of the 17th International Conference on World Wide Web, pages 417--426, 2008.

Digital Library

Google Scholar

[7]

Patrali Chatterjee, Donna Hoffman, and Thomas Novak. Modeling the clickstream: Implications for Web-based advertising efforts. Marketing Science, 22(4):520--541, 2003.

Digital Library

Google Scholar

[8]

Sham Kakade, Shai Shalev-Shwartz, and Ambuj Tewari. Efficient bandit algorithms for online multiclass prediction. In Proceedings of the 25th International Conference on Machine learning, pages 440--447, 2008.

Digital Library

Google Scholar

[9]

Jyrki Kivinen and Manfred K. Warmuth. Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132:1--63, 1997.

Digital Library

Google Scholar

[10]

John Langford and Tong Zhang. The epoch-greedy algorithm for multi-armed bandits with side information. In Advances in Neural Information Processing Systems 20, pages 817--824, 2008.

Digital Library

Google Scholar

[11]

Herbert Robbins. Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58:527--535, 1952.

Crossref

Google Scholar

[12]

Richard Sutton and Andrew Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.

Digital Library

Google Scholar

[13]

Chris Watkins. Learning from delayed rewards. PhD thesis, Cambridge University, 1989.

Google Scholar

Cited By

View all

Ban YQi YHe JChua TNgo CKumar RLauw HKa-Wei Lee R(2024)Neural Contextual Bandits for Personalized RecommendationCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641241(1246-1249)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589335.3641241
Huang YZhang LXu J(2023)Adversarial Group Linear Bandits and Its Application to Collaborative Edge InferenceIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10228900(1-10)Online publication date: 17-May-2023
https://doi.org/10.1109/INFOCOM53939.2023.10228900
Wang JJiang YLiu XWang TShi Y(2023)Federated Linear Bandit Learning via Over-the-air ComputationGLOBECOM 2023 - 2023 IEEE Global Communications Conference10.1109/GLOBECOM54140.2023.10437441(1363-1368)Online publication date: 4-Dec-2023
https://doi.org/10.1109/GLOBECOM54140.2023.10437441
Show More Cited By

Index Terms

Exploitation and exploration in a performance based contextual advertising system
1. Information systems
  1. World Wide Web
    1. Web applications
    2. Web services

Recommendations

Is Combining Contextual and Behavioral Targeting Strategies Effective in Online Advertising?

Online targeting has been increasingly used to deliver ads to consumers. But discovering how to target the most valuable web visitors and generate a high response rate is still a challenge for advertising intermediaries and advertisers. The purpose of ...
Online Advertising: Experimental Facts on Ethics, Involvement, and Product Type

The purpose of this chapter is to provide some insights into advertisements on the Iranian websites. Firstly, in publisher side, is the ethic a matter of fact in accepting Internet advertisements to publish? Second, to provide a preliminary insight into ...
A Framework to Harvest Page Views of Web for Banner Advertising
BDA 2015: Proceedings of the 4th International Conference on Big Data Analytics - Volume 9498

Online advertising provides an opportunity for product sellers and service providers to reach customers and has become a key factor in the growth of economy. It is a major source of revenue for the major search engine and social networking sites. Search ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining

July 2010

1240 pages

ISBN:9781450300551

DOI:10.1145/1835804

General Chairs:
Bharat Rao
Siemens
,
Balaji Krishnapuram
Siemens
,
Program Chairs:
Andrew Tomkins
Google Inc.
,
Qiang Yang
Hong Kong University of Science and Technology

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD '10

Sponsor:

KDD '10: The 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

July 25 - 28, 2010

DC, Washington, USA

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

51
Total Citations
View Citations
1,118
Total Downloads

Downloads (Last 12 months)46
Downloads (Last 6 weeks)8

Reflects downloads up to 13 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Ban YQi YHe JChua TNgo CKumar RLauw HKa-Wei Lee R(2024)Neural Contextual Bandits for Personalized RecommendationCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641241(1246-1249)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589335.3641241
Huang YZhang LXu J(2023)Adversarial Group Linear Bandits and Its Application to Collaborative Edge InferenceIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10228900(1-10)Online publication date: 17-May-2023
https://doi.org/10.1109/INFOCOM53939.2023.10228900
Wang JJiang YLiu XWang TShi Y(2023)Federated Linear Bandit Learning via Over-the-air ComputationGLOBECOM 2023 - 2023 IEEE Global Communications Conference10.1109/GLOBECOM54140.2023.10437441(1363-1368)Online publication date: 4-Dec-2023
https://doi.org/10.1109/GLOBECOM54140.2023.10437441
Li CWang HKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Communication efficient federated learning for generalized linear banditsProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3603053(38411-38423)Online publication date: 28-Nov-2022
https://dl.acm.org/doi/10.5555/3600270.3603053
Li CWang HWang MWang HKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Communication efficient distributed learning for kernelized contextual banditsProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3601707(19773-19785)Online publication date: 28-Nov-2022
https://dl.acm.org/doi/10.5555/3600270.3601707
He JMin YWang TGu QKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)A simple and provably efficient algorithm for asynchronous federated contextual linear banditsProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3600614(4762-4775)Online publication date: 28-Nov-2022
https://dl.acm.org/doi/10.5555/3600270.3600614
Biniyaz AAzmoon BLiu Z(2022)Intelligent Control of Groundwater in Slopes with Deep Reinforcement LearningSensors10.3390/s2221850322:21(8503)Online publication date: 4-Nov-2022
https://doi.org/10.3390/s22218503
Wang HZhao DWang H(2022)Dynamic Global Sensitivity for Differentially Private Contextual BanditsProceedings of the 16th ACM Conference on Recommender Systems10.1145/3523227.3546781(179-187)Online publication date: 12-Sep-2022
https://dl.acm.org/doi/10.1145/3523227.3546781
Karimi PPlebani EMartin-Hammond ABolchini D(2022)Textflow: Toward Supporting Screen-free Manipulation of Situation-Relevant Smart MessagesACM Transactions on Interactive Intelligent Systems10.1145/351926312:4(1-29)Online publication date: 5-Nov-2022
https://dl.acm.org/doi/10.1145/3519263
Mishra SHu CVerma MYen KHu YSviridenko MDemartini GZuccon GCulpepper JHuang ZTong H(2021)TSIProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3481957(4036-4045)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3481957
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Is Combining Contextual and Behavioral Targeting Strategies Effective in Online Advertising?

Online Advertising: Experimental Facts on Ethics, Involvement, and Product Type

A Framework to Harvest Page Views of Web for Banner Advertising