Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2462130.2462132acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

An ensemble clustering model for mining concept drifting stream data in emergency management

Published: 12 August 2012 Publication History

Abstract

Mining data streams with concept drifts is always an important and challenge task for researchers in both application and theory areas, such as emergency management. Because of requiring massive training data with labels, it is a hard and time costing work for existing (ensemble) classical models, sometimes even impossible. Aim to resolve this issue, in this paper; we propose an ensemble clustering model for mining concept drifting stream data in emergency management. Motivated by classifiers, the model will mine the data in two steps: "training" and "testing", just with a small training set. According to the experiment, the results demonstrate the effect and performance of the proposed model in mining data streams with concept drifts.

References

[1]
J. Beringer and E. Hüllermeier. Online clustering of parallel data streams. Data and Knowledge Engineering, 58(2):180--204, August 2006.
[2]
M. Brun, C. Sima, J. Hua, J. Lowey, B. Carroll, E. Suh, and E. R. Dougherty. Model-based evaluation of clustering validation measures. Pattern Recognition, 40(3):807--824, 2007.
[3]
J. Figueira, S. Greco, M. Ehrogott, J.-P. Brans, and B. Mareschal. Promethee methods. In Multiple Criteria Decision Analysis: State of the Art Surveys, volume 78 of International Series in Operations Research and Management Science, pages 163--186. Springer New York, 2005.
[4]
M. Halkidi, Y. Batistakis, and M. Vazirgiannis. Cluster validity methods: part i. SIGMOD Rec., 31(2):40--45, 2002.
[5]
M. Halkidi, Y. Batistakis, and M. Vazirgiannis. Clustering validity checking methods: part ii. SIGMOD Rec., 31(3):19--27, sep 2002.
[6]
J. Han and M. Kamber. Data Mining: Concepts and Techniques, 2nd edition. Morgan Kaufmann, 2006.
[7]
L. Hubert and P. Arabie. Comparing partitions. Journal of Classification, 2:193--218, 1985. 10.1007/BF01908075.
[8]
A. K. Jain and R. C. Dubes. Algorithms for clustering data. Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 1988.
[9]
B. Larsen and C. Aone. Fast and effective text mining using linear-time document clustering. KDD-99, pages 16--22, 1999.
[10]
Y. Peng, G. Kou, Y. Shi, and Z. Chen. A descriptive framework for the field of data mining and knowledge discovery. International Journal of Information Technology & Decision Making, 7(4):639--682, 2008.
[11]
Y. Peng, G. Kou, G. Wang, and Y. Shi. Famcdm: A fusion approach of mcdm methods to rank multiclass classification algorithms. Omega, 39(6):677--689, 2011.
[12]
Y. Peng, G. Kou, G. Wang, W. Wu, and Y. Shi. Ensemble of software defect predictors: an ahp-based evaluation method. International Journal of Information Technology & Decision Making, 10(1):187--206, 2011.
[13]
P. Tan, M. Steinbach, and V. Kumar. Introduction to Data Mining. Addison-Wesley, 2005.
[14]
S. Theodoridis and K. Koutroubas. Pattern recognition, Fourth edition. Academic Press, 2008.
[15]
A. Tsymbal. The problem of concept drift: Definitions and related work. Technical report, Department of Computer Science, Trinity College: Dublin, Ireland, 2004.
[16]
L. Vendramin, R. Campello, and E. Hruschka. Relative clustering validity criteria: A comparative overview. Statistical Analysis and Data Mining, 3(4):209--235, 2010.
[17]
H. Wang, W. Fan, P. S. Yu, and J. Han. Mining concept-drifting data streams using ensemble classifiers. Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 24--27, August 2003.
[18]
G. Widmer and M. Kubat. Learning in the presence of concept drift and hidden contexts. Machine Learning, 23(1):69--101, 1996.
[19]
P. Zhang, X. Zhu, J. Tan, and L. Guo. Classifier and cluster ensembles for mining concept drifting data streams. 2010 IEEE International Conference on Data Mining, pages 1175--1180, 2010.
[20]
Y. Zhang, Y. Peng, J. Li, and Y. Shi. A clustering validity model based on multiple criteria decision making. The Sixth Chinese Academy of Management Annual Meeting, 2011.
[21]
Y. Zhao, G. Karypis, and U. Fayyad. Hierarchical clustering algorithms for document datasets. Data Mining and Knowledge Discovery, 10(2):141--168, 2005.

Index Terms

  1. An ensemble clustering model for mining concept drifting stream data in emergency management

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DM-IKM '12: Proceedings of the Data Mining and Intelligent Knowledge Management Workshop
    August 2012
    55 pages
    ISBN:9781450315517
    DOI:10.1145/2462130
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. clustering validity
    2. concept drift
    3. emergency management
    4. ensemble clustering
    5. stream data

    Qualifiers

    • Research-article

    Funding Sources

    • Foxconn "ZhuoCai" Funds

    Conference

    KDD '12
    Sponsor:

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 192
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 08 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media