Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Exponentially weighted moving average charts for detecting concept drift

Published: 01 January 2012 Publication History

Abstract

Classifying streaming data requires the development of methods which are computationally efficient and able to cope with changes in the underlying distribution of the stream, a phenomenon known in the literature as concept drift. We propose a new method for detecting concept drift which uses an exponentially weighted moving average (EWMA) chart to monitor the misclassification rate of an streaming classifier. Our approach is modular and can hence be run in parallel with any underlying classifier to provide an additional layer of concept drift detection. Moreover our method is computationally efficient with overhead O(1) and works in a fully online manner with no need to store data points in memory. Unlike many existing approaches to concept drift detection, our method allows the rate of false positive detections to be controlled and kept constant over time.

References

[1]
Paired learners for concept drift. In: ICDM'08: Proc. 2008 Eighth IEEE Internat. Conf. on Data Mining, IEEE Computer Society, Washington, DC, USA. pp. 23-32.
[2]
Baena-Garcia, M., Campo-Avila, J., Fidalgo, R., 2006. Early drift detection. In: Advances in Aritificial Intelligence, Lecture Notes Artificial Intelligence, vol. 3171, pp. 286-295.
[3]
Detection of Abrupt Change Theory and Application. Prentice Hall.
[4]
Bell, C., Gordon, L., Pollak, M., 1994. An efficient nonparametric detection scheme and its application to surveillance of a Bernoulli process with unknown baseline. In: Change Point Problem, vol. 23, pp. 7-27.
[5]
Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput. v10. 1895-1923.
[6]
A general framework for mining massive data stream. J. Comput. Graph. Statist. v12. 2003
[7]
Gama, J., Medas, P., Castillo, G., Rodrigues, P., 2004. Learning with drift detection. In: Advances in Artificial Intelligence, Lecture Notes in Artificial Intelligence, vol. 3171, pp. 286-295.
[8]
Statistical and structural approaches to texture. Proc. IEEE. v67 i5. 786-804.
[9]
Harries, M., 1999. Splice-2 comparative evaluation: Electricity pricing. Tech. rep., The University of New South Wales.
[10]
The Elements of Statistical Learning. Springer.
[11]
Image recognition and neuronal networks: Intelligent systems for the improvement of imaging information. Minimal Invasive Therapy Allied Technol. v9 i3-4. 225-230.
[12]
Klinkenberg, R., Joachims, T., 2000. Detecting concept drift with support vector machines. In: Proc. 17th Internat. Conf. on Machine Learning. ICML '00, pp. 487-494.
[13]
Dynamic weighted majority: An ensemble method for driffting concepts. J. Machine Learn. Res. v8. 2755-2790.
[14]
Kuncheva, L.I., 2009. Using control charts for detecting concept change in streaming data. Tech. rep., Bangor University.
[15]
Kuncheva, L.I., Plumpton, C.O., 2008. Adaptive learning rate for online linear discriminant classifiers. In: Structural, Syntactic, and Statistical Pattern Recognition, Lecture Notes in Computer Science, vol. 5342, pp. 510-519.
[16]
Law, Y.-N., Zaniolo, C., 2005. An adaptive nearest neighbor classification algorithm for data streams. In: PKDD, pp. 108-120.
[17]
A simple cumulative sum type statistic for the change-point problem with zero-one observations. Biometrika. v67 i1. 79-84.
[18]
A CUSUM chart for monitoring a proportion when inspecting continuously. J. Qual. Technol. v31 i1. 87-108.
[19]
Control chart tests based on geometric moving averages. Technometrics. v42 i1. 97-101.
[20]
CUSUM charts for signalling varying location shifts. J. Qual. Technol. v32.
[21]
Adaptive threshold computation for CUSUM-type procedures in change detection and isolation problems. Comput. Statist. Data Anal. v52 i9. 4161-4174.
[22]
A comparison of state-of-the-art classification techniques for expert automobile insurance claim fraud detection. J. Risk Insurance. v69. 373-421.
[23]
Learning in the presence of concept drift and hidden contexts. Machine Learn. v23. 69-101.
[24]
EWMA control charts for monitoring high-yield processes based on non-transformed observations. Internat. J. Prod. Res. v46 i20. 5679-5699.

Cited By

View all
  • (2025)Modyn: Data-Centric Machine Learning Pipeline OrchestrationProceedings of the ACM on Management of Data10.1145/37097053:1(1-30)Online publication date: 11-Feb-2025
  • (2024)Concept Drift Adaptation in Text Stream Mining Settings: A Systematic ReviewACM Transactions on Intelligent Systems and Technology10.1145/370492216:2(1-67)Online publication date: 21-Nov-2024
  • (2024)Type-LDD: A Type-Driven Lite Concept Drift Detector for Data StreamsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.334460236:12(9476-9489)Online publication date: 1-Dec-2024
  • Show More Cited By
  1. Exponentially weighted moving average charts for detecting concept drift

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Pattern Recognition Letters
      Pattern Recognition Letters  Volume 33, Issue 2
      January, 2012
      125 pages

      Publisher

      Elsevier Science Inc.

      United States

      Publication History

      Published: 01 January 2012

      Author Tags

      1. Change detection
      2. Concept drift
      3. Streaming classification

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 05 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Modyn: Data-Centric Machine Learning Pipeline OrchestrationProceedings of the ACM on Management of Data10.1145/37097053:1(1-30)Online publication date: 11-Feb-2025
      • (2024)Concept Drift Adaptation in Text Stream Mining Settings: A Systematic ReviewACM Transactions on Intelligent Systems and Technology10.1145/370492216:2(1-67)Online publication date: 21-Nov-2024
      • (2024)Type-LDD: A Type-Driven Lite Concept Drift Detector for Data StreamsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.334460236:12(9476-9489)Online publication date: 1-Dec-2024
      • (2024)Online Concept Drift DetectorProcedia Computer Science10.1016/j.procs.2024.05.140237:C(559-567)Online publication date: 24-Jul-2024
      • (2024)Evolving cybersecurity frontiersEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109143137:PAOnline publication date: 1-Nov-2024
      • (2024)Robust online active learning with cluster-based local drift detection for unbalanced imperfect dataApplied Soft Computing10.1016/j.asoc.2024.112051165:COnline publication date: 1-Nov-2024
      • (2024)Claims fraud detection with uncertain labelsAdvances in Data Analysis and Classification10.1007/s11634-023-00568-018:1(219-243)Online publication date: 1-Mar-2024
      • (2024)SETL: a transfer learning based dynamic ensemble classifier for concept drift detection in streaming dataCluster Computing10.1007/s10586-023-04149-w27:3(3417-3432)Online publication date: 1-Jun-2024
      • (2024)Comparison based analysis of window approach for concept drift detection and adaptationApplied Intelligence10.1007/s10489-024-05890-455:1Online publication date: 27-Nov-2024
      • (2024)A rehearsal framework for computational efficiency in online continual learningApplied Intelligence10.1007/s10489-024-05493-z54:8(6383-6399)Online publication date: 1-Apr-2024
      • Show More Cited By

      View Options

      View options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media