Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2339530.2339766acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
demonstration

DAGger: clustering correlated uncertain data (to predict asset failure in energy networks)

Published: 12 August 2012 Publication History

Abstract

DAGger is a clustering algorithm for uncertain data. In contrast to prior work, DAGger can work on arbitrarily correlated data and can compute both exact and approximate clusterings with error guarantees.
We demonstrate DAGger using a real-world scenario in which partial discharge data from UK Power Networks is clustered to predict asset failure in the energy network.

References

[1]
C. C. Aggarwal. Managing and Mining Uncertain Data, volume 35 of Advances in Database Systems. Kluwer, 2009.
[2]
C. M. Bishop. Pattern Recognition and Machine Learning. Springer-Verlag, 2006.
[3]
R. Cheng and S. Prabhakar. Managing uncertainty in sensor database. SIGMOD Rec., 32:41--46, 12 2003.
[4]
A. Deshpande, C. Guestrin, S. Madden, J. Hellerstein, and W. Hong. Model-driven data acquisition in sensor networks. In VLDB, 2004.
[5]
F. Gullo, G. Ponti, and A. Tagarelli. Clustering uncertain data via k-medoids. In SUM, 2008.
[6]
A. Jha and D. Suciu. Probabilistic databases with markoviews. In VLDB, 2012.
[7]
J. B. MacQueen. Some methods for classification and analysis of multivariate observations. In Berkeley Symp. on Mathematical Statistics and Probability, 1967.
[8]
M. Michel. Innovative asset management and targeted investments using on-line partial discharge monitoring and mapping techniques. In CED, 2007.
[9]
M. Michel and C. Eastham. Improving the management of MV underground cable circuits using automated on-line cable partial discharge mapping. In CEED, 2011.
[10]
B. Qin, Y. Xia, R. Sathyesh, S. Prabhakar, and Y. Tu. urule: A rule-based classification system for uncertain data. In ICDM Workshops, 2010.
[11]
M. Richardson and P. Domingos. Markov logic networks. Machine learning, 62(1):107--136, 2006.
[12]
S. Sathe, H. Jeung, and K. Aberer. Creating probabilistic databases from imprecise time-series data. In ICDE, 2011.
[13]
D. Suciu, D. Olteanu, C. Ré, and C. Koch. Probabilistic Databases. Morgan & Claypool Publishers, 2011.
[14]
L. Sun, R. Cheng, D. W. Cheung, and J. Cheng. Mining uncertain data with probabilistic guarantees. In KDD, 2010.
[15]
G. Taylor, D. Wallom, S. Grenard, A. Yunta Huete, and C. J. Axon. Recent developments towards novel high performance computing and communications solutions for smart distribution network operation. In ISGT, 2011.

Cited By

View all
  • (2015)Mining Frequent Itemsets in Correlated Uncertain DatabasesJournal of Computer Science and Technology10.1007/s11390-015-1555-930:4(696-712)Online publication date: 8-Jul-2015

Index Terms

  1. DAGger: clustering correlated uncertain data (to predict asset failure in energy networks)

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2012
    1616 pages
    ISBN:9781450314626
    DOI:10.1145/2339530
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. classification
    2. clustering
    3. correlations
    4. dagger
    5. partial discharge
    6. probabilistic data
    7. uncertain data

    Qualifiers

    • Demonstration

    Conference

    KDD '12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)Mining Frequent Itemsets in Correlated Uncertain DatabasesJournal of Computer Science and Technology10.1007/s11390-015-1555-930:4(696-712)Online publication date: 8-Jul-2015

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media