Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2213836.2213867acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Prediction-based geometric monitoring over distributed data streams

Published: 20 May 2012 Publication History

Abstract

Many modern streaming applications, such as online analysis of financial, network, sensor and other forms of data are inherently distributed in nature. An important query type that is the focal point in such application scenarios regards actuation queries, where proper action is dictated based on a trigger condition placed upon the current value that a monitored function receives. Recent work studies the problem of (non-linear) sophisticated function tracking in a distributed manner. The main concept behind the geometric monitoring approach proposed there, is for each distributed site to perform the function monitoring over an appropriate subset of the input domain. In the current work, we examine whether the distributed monitoring mechanism can become more efficient, in terms of the number of communicated messages, by extending the geometric monitoring framework to utilize prediction models. We initially describe a number of local estimators (predictors) that are useful for the applications that we consider and which have already been shown particularly useful in past work. We then demonstrate the feasibility of incorporating predictors in the geometric monitoring framework and show that prediction-based geometric monitoring in fact generalizes the original geometric monitoring framework. We propose a large variety of different prediction-based monitoring models for the distributed threshold monitoring of complex functions. Our extensive experimentation with a variety of real data sets, functions and parameter settings indicates that our approaches can provide significant communication savings ranging between two times and up to three orders of magnitude, compared to the transmission cost of the original monitoring framework.

References

[1]
B. Babcock and C. Olston. Distributed top-k monitoring. In SIGMOD, 2003.
[2]
G. Cormode and M. Garofalakis. Sketching streams through the net: Distributed approximate query tracking. In VLDB, 2005.
[3]
G. Cormode and M. Garofalakis. Streaming in a connected world: querying and tracking distributed data streams. In SIGMOD, 2007.
[4]
G. Cormode and M. Garofalakis. Approximate continuous querying over distributed streams. ACM Transactions on Database Systems, 33(2), 2008.
[5]
G. Cormode, M. Garofalakis, S. Muthukrishnan, and R. Rastogi. Holistic aggregates in a networked world: distributed tracking of approximate quantiles. In SIGMOD, 2005.
[6]
G. Cormode, S. Muthukrishnan, and K. Yi. Algorithms for distributed functional monitoring. ACM Trans. Algorithms, 7:21:1--21:20, 2011.
[7]
G. Cormode, S. Muthukrishnan, and W. Zhuang. Conquering the divide: Continuous clustering of distributed data streams. In ICDE, 2007.
[8]
A. Das, S. Ganguly, M. Garofalakis, and R. Rastogi. Distributed set-expression cardinality estimation. In VLDB, 2004.
[9]
A. Deligiannakis, Y. Kotidis, and N. Roussopoulos. Compressing Historical Information in Sensor Networks. In ACM SIGMOD, 2004.
[10]
L. Huang, M. Garofalakis, J. Hellerstein, A. Joseph, and N. Taft. Toward sophisticated detection with distributed triggers. In MineNet, 2006.
[11]
L. Huang, X. Nguyen, M. Garofalakis, and J. M. Hellerstein. Communication-efficient online detection of network-wide anomalies. In INFOCOM, 2007.
[12]
A. Jain, J. M. Hellestein, S. Ratnasamy, and D. Wetherall. A wakeup call for internet monitoring systems: The case for distributed triggers. In HotNets, 2004.
[13]
R. Keralapura, G. Cormode, and J. Ramamirtham. Communication-efficient distributed monitoring of thresholded counts. In SIGMOD, 2006.
[14]
D. D. Lewis, Y. Yang, T. G. Rose, and F. Li. Rcv1: A new benchmark collection for text categorization research. Journal of Machine Learning Research, 5:361--397, 2004.
[15]
S. R. Madden, M. J. Franklin, J. M. Hellerstein, and W. Hong. Tinydb: an acquisitional query processing system for sensor networks. ACM Trans. Database Syst., 30:122--173, March 2005.
[16]
C. Olston, J. Jiang, and J. Widom. Adaptive filters for continuous queries over distributed data streams. In SIGMOD, 2003.
[17]
G. Sagy, D. Keren, I. Sharfman, and A. Schuster. Distributed threshold querying of general functions by a difference of monotonic representation. PVLDB Endow., 4:46--57, 2010.
[18]
I. Sharfman, A. Schuster, and D. Keren. A geometric approach to monitoring threshold functions over distributed data streams. In SIGMOD, 2006.
[19]
I. Sharfman, A. Schuster, and D. Keren. Aggregate threshold queries in sensor networks. In IPDPS, 2007.
[20]
I. Sharfman, A. Schuster, and D. Keren. A geometric approach to monitoring threshold functions over distributed data streams. ACM Transactions on Database Systems, 32(4), 2007.
[21]
I. Sharfman, A. Schuster, and D. Keren. Shape sensitive geometric monitoring. In PODS, 2008.
[22]
K. Yi and Q. Zhang. Optimal tracking of distributed heavy hitters and quantiles. In PODS, 2009.
[23]
Q. Zhang, J. Liu, and W. Wang. Approximate clustering on distributed data streams. In ICDE, 2008.

Cited By

View all
  • (2022)AutoMon: Automatic Distributed Monitoring for Arbitrary Multivariate FunctionsProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517866(310-324)Online publication date: 10-Jun-2022
  • (2022)Approaches to Uncertainty Quantification in Federated Deep LearningMachine Learning and Principles and Practice of Knowledge Discovery in Databases10.1007/978-3-030-93736-2_12(128-145)Online publication date: 17-Feb-2022
  • (2021)An Optimization Approach to Multi-Sensor Operation for Multi-Context RecognitionSensors10.3390/s2120686221:20(6862)Online publication date: 15-Oct-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '12: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
May 2012
886 pages
ISBN:9781450312479
DOI:10.1145/2213836
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. continuous distributed monitoring
  2. data streams
  3. prediction models

Qualifiers

  • Research-article

Conference

SIGMOD/PODS '12
Sponsor:

Acceptance Rates

SIGMOD '12 Paper Acceptance Rate 48 of 289 submissions, 17%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)AutoMon: Automatic Distributed Monitoring for Arbitrary Multivariate FunctionsProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517866(310-324)Online publication date: 10-Jun-2022
  • (2022)Approaches to Uncertainty Quantification in Federated Deep LearningMachine Learning and Principles and Practice of Knowledge Discovery in Databases10.1007/978-3-030-93736-2_12(128-145)Online publication date: 17-Feb-2022
  • (2021)An Optimization Approach to Multi-Sensor Operation for Multi-Context RecognitionSensors10.3390/s2120686221:20(6862)Online publication date: 15-Oct-2021
  • (2021)Optimization of threshold functions over streamsProceedings of the VLDB Endowment10.14778/3447689.344769314:6(878-889)Online publication date: 1-Feb-2021
  • (2021)A Distance-Based Scheme for Reducing Bandwidth in Distributed Geometric Monitoring2021 IEEE 37th International Conference on Data Engineering (ICDE)10.1109/ICDE51399.2021.00105(1164-1175)Online publication date: Apr-2021
  • (2020)Learning Based Distributed TrackingProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3394486.3403255(2040-2050)Online publication date: 23-Aug-2020
  • (2019)Efficient, Consistent Distributed Computation with Predictive TreatiesProceedings of the Fourteenth EuroSys Conference 201910.1145/3302424.3303987(1-16)Online publication date: 25-Mar-2019
  • (2019)Continuous Monitoring meets Synchronous Transmissions and In-Network Aggregation2019 15th International Conference on Distributed Computing in Sensor Systems (DCOSS)10.1109/DCOSS.2019.00043(157-166)Online publication date: May-2019
  • (2018)Lightweight Monitoring of Distributed StreamsACM Transactions on Database Systems10.1145/322611343:2(1-37)Online publication date: 31-Jul-2018
  • (2018)Geometric Monitoring in Action: a Systems Perspective for the Internet of Things2018 IEEE 43rd Conference on Local Computer Networks (LCN)10.1109/LCN.2018.8638079(433-436)Online publication date: Oct-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media