Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1807167.1807277acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Forecasting high-dimensional data

Published: 06 June 2010 Publication History

Abstract

We propose a method for forecasting high-dimensional data (hundreds of attributes, trillions of attribute combinations) for a duration of several months. Our motivating application is guaranteed display advertising, a multi-billion dollar industry, whereby advertisers can buy targeted (high-dimensional) user visits from publishers many months or even years in advance. Forecasting high-dimensional data is challenging because of the many possible attribute combinations that need to be forecast. To address this issue, we propose a method whereby only a sub-set of attribute combinations are explicitly forecast and stored, while the other combinations are dynamically forecast on-the-fly using high-dimensional attribute correlation models. We evaluate various attribute correlation models, from simple models that assume the independence of attributes to more sophisticated sample-based models that fully capture the correlations in a high-dimensional space. Our evaluation using real-world display advertising data sets shows that fully capturing high-dimensional correlations leads to significant forecast accuracy gains. A variant of the proposed method has been implemented in the context of Yahoo!'s guaranteed display advertising system.

References

[1]
D. Agarwal and J. Tomlin. Optimal allocation of overlapping inventory in on-line advertising. Presented at INFORMS, 2008.
[2]
U. Feige, N. Immorlica, V. Mirrokni, and H. Nazerzadeh. A combinatorial allocation mechanism with penalties for banner advertising. Presented at INFORMS, 2008.
[3]
S. L. Lauritzen. Graphical Models. Oxford University Press, 1996.
[4]
P. E. O'Neil and D. Quass. Improved query performance with variant indexes. In J. Peckham, editor, SIGMOD Conference, pages 38--49. ACM Press, 1997.
[5]
R. H. Shumway and D. S. Stoffer. Time Series Analysis and Its Applications. Springer, 2007.
[6]
J. Tobias and A. Zellner. A note on aggregation,disaggregation and forecasting performance. Journal of Forecasting, 19:457--469.
[7]
A. M. West and J. Harrison. Bayesian Forecasting and Dynamic Models. Springer-Verlag, 1997.
[8]
K. Wu, E. Otoo, and A. Shoshani. Optimizing bitmap indices with efficient compression. ACM Transactions on Database Systems, 31:1--38, 2006.

Cited By

View all
  • (2024)Mystique: A Budget Pacing System for Performance Optimization in Online AdvertisingCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648342(433-442)Online publication date: 13-May-2024
  • (2021)Dimensional Inconsistency Measures and Postulates in Spatio-Temporal DatabasesJournal of Artificial Intelligence Research10.1613/jair.1.1243571(733-780)Online publication date: 3-Aug-2021
  • (2021)FlashPProceedings of the VLDB Endowment10.14778/3446095.344609614:5(721-729)Online publication date: 23-Mar-2021
  • Show More Cited By

Index Terms

  1. Forecasting high-dimensional data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '10: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
    June 2010
    1286 pages
    ISBN:9781450300322
    DOI:10.1145/1807167
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 06 June 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tag

    1. forecasting

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS '10
    Sponsor:
    SIGMOD/PODS '10: International Conference on Management of Data
    June 6 - 10, 2010
    Indiana, Indianapolis, USA

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)32
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Mystique: A Budget Pacing System for Performance Optimization in Online AdvertisingCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648342(433-442)Online publication date: 13-May-2024
    • (2021)Dimensional Inconsistency Measures and Postulates in Spatio-Temporal DatabasesJournal of Artificial Intelligence Research10.1613/jair.1.1243571(733-780)Online publication date: 3-Aug-2021
    • (2021)FlashPProceedings of the VLDB Endowment10.14778/3446095.344609614:5(721-729)Online publication date: 23-Mar-2021
    • (2020)Real-Time Device Reach Forecasting Using HLL and MinHash Data Sketches2020 7th International Conference on Soft Computing & Machine Intelligence (ISCMI)10.1109/ISCMI51676.2020.9311573(153-157)Online publication date: 14-Nov-2020
    • (2019)Improving Interaction in Integrated Chronic Care Management2019 IEEE 28th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)10.1109/WETICE.2019.00063(265-270)Online publication date: Jun-2019
    • (2016)GPU-Accelerated Bayesian Learning and Forecasting in Simultaneous Graphical Dynamic Linear ModelsBayesian Analysis10.1214/15-BA94611:1Online publication date: 1-Mar-2016
    • (2016)Predicting traffic of online advertising in real-time bidding systems from perspective of demand-side platforms2016 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2016.7841012(3491-3498)Online publication date: Dec-2016
    • (2016)Forecasting squatting of demand in display advertising2016 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2016.7840768(1587-1594)Online publication date: Dec-2016
    • (2014)Budget pacing for targeted online advertisements at LinkedInProceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/2623330.2623366(1613-1619)Online publication date: 24-Aug-2014
    • (2014)Repairs and Consistent Answers for Inconsistent Probabilistic Spatio-Temporal DatabasesProceedings of the 8th International Conference on Scalable Uncertainty Management - Volume 872010.1007/978-3-319-11508-5_22(265-279)Online publication date: 15-Sep-2014
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media