Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1835804.1835811acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Exploitation and exploration in a performance based contextual advertising system

Published: 25 July 2010 Publication History

Abstract

The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of online data inevitably makes an advertising system choose between maximizing its expected revenue according to its current knowledge in short term (exploitation) and trying to learn more about the unknown to improve its knowledge (exploration), since the latter might increase its revenue in the future. The exploitation and exploration (EE) tradeoff has been extensively studied in the reinforcement learning community, however, not been paid much attention in online advertising until recently. In this paper, we develop two novel EE strategies for online advertising. Specifically, our methods can adaptively balance the two aspects of EE by automatically learning the optimal tradeoff and incorporating confidence metrics of historical performance. Within a deliberately designed offline simulation framework we apply our algorithms to an industry leading performance based contextual advertising system and conduct extensive evaluations with real online event log data. The experimental results and detailed analysis reveal several important findings of EE behaviors in online advertising and demonstrate that our algorithms perform superiorly in terms of ad reach and click-through-rate (CTR).

References

[1]
Deepak Agarwal, Bee-Chung Chen, and Pradheep Elango. Explore/exploit schemes for web content optimization. In Proceedings of the 9th IEEE International Conference on Data Mining, pages 1--10, 2009.
[2]
Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3:397--422, 2002.
[3]
John Battelle. The search: how Google and its rivals rewrote the rules of business and transformed our culture. Nicholas Brearley Publishing, 2005.
[4]
Interactive Advertising Bureau. Internet ad revenues at $10.9 billion for first half of '09, 2009. http://www.iab.net/media/file/IAB-Ad-Revenue-Six-month-2009.pdf.
[5]
Nicolo Cesa-Bianchi and Gabor Lugosi. Prediction, Learning, and Games. Cambridge University Press, 2006.
[6]
Deepayan Chakrabarti, Deepak Agarwal, and Vanja Josifovski. Contextual advertising by combining relevance with click feedback. In Proceeding of the 17th International Conference on World Wide Web, pages 417--426, 2008.
[7]
Patrali Chatterjee, Donna Hoffman, and Thomas Novak. Modeling the clickstream: Implications for Web-based advertising efforts. Marketing Science, 22(4):520--541, 2003.
[8]
Sham Kakade, Shai Shalev-Shwartz, and Ambuj Tewari. Efficient bandit algorithms for online multiclass prediction. In Proceedings of the 25th International Conference on Machine learning, pages 440--447, 2008.
[9]
Jyrki Kivinen and Manfred K. Warmuth. Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132:1--63, 1997.
[10]
John Langford and Tong Zhang. The epoch-greedy algorithm for multi-armed bandits with side information. In Advances in Neural Information Processing Systems 20, pages 817--824, 2008.
[11]
Herbert Robbins. Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58:527--535, 1952.
[12]
Richard Sutton and Andrew Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
[13]
Chris Watkins. Learning from delayed rewards. PhD thesis, Cambridge University, 1989.

Cited By

View all
  • (2024)Neural Contextual Bandits for Personalized RecommendationCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641241(1246-1249)Online publication date: 13-May-2024
  • (2023)Adversarial Group Linear Bandits and Its Application to Collaborative Edge InferenceIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10228900(1-10)Online publication date: 17-May-2023
  • (2023)Federated Linear Bandit Learning via Over-the-air ComputationGLOBECOM 2023 - 2023 IEEE Global Communications Conference10.1109/GLOBECOM54140.2023.10437441(1363-1368)Online publication date: 4-Dec-2023
  • Show More Cited By

Index Terms

  1. Exploitation and exploration in a performance based contextual advertising system

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 July 2010

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. click feedback
      2. click-through-rate
      3. exploitation and exploration
      4. online advertising

      Qualifiers

      • Research-article

      Conference

      KDD '10
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)46
      • Downloads (Last 6 weeks)8
      Reflects downloads up to 13 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Neural Contextual Bandits for Personalized RecommendationCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641241(1246-1249)Online publication date: 13-May-2024
      • (2023)Adversarial Group Linear Bandits and Its Application to Collaborative Edge InferenceIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10228900(1-10)Online publication date: 17-May-2023
      • (2023)Federated Linear Bandit Learning via Over-the-air ComputationGLOBECOM 2023 - 2023 IEEE Global Communications Conference10.1109/GLOBECOM54140.2023.10437441(1363-1368)Online publication date: 4-Dec-2023
      • (2022)Communication efficient federated learning for generalized linear banditsProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3603053(38411-38423)Online publication date: 28-Nov-2022
      • (2022)Communication efficient distributed learning for kernelized contextual banditsProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3601707(19773-19785)Online publication date: 28-Nov-2022
      • (2022)A simple and provably efficient algorithm for asynchronous federated contextual linear banditsProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3600614(4762-4775)Online publication date: 28-Nov-2022
      • (2022)Intelligent Control of Groundwater in Slopes with Deep Reinforcement LearningSensors10.3390/s2221850322:21(8503)Online publication date: 4-Nov-2022
      • (2022)Dynamic Global Sensitivity for Differentially Private Contextual BanditsProceedings of the 16th ACM Conference on Recommender Systems10.1145/3523227.3546781(179-187)Online publication date: 12-Sep-2022
      • (2022)Textflow: Toward Supporting Screen-free Manipulation of Situation-Relevant Smart MessagesACM Transactions on Interactive Intelligent Systems10.1145/351926312:4(1-29)Online publication date: 5-Nov-2022
      • (2021)TSIProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3481957(4036-4045)Online publication date: 26-Oct-2021
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media