Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Exponentially weighted moving average charts for detecting concept drift

Published: 01 January 2012 Publication History

Abstract

Classifying streaming data requires the development of methods which are computationally efficient and able to cope with changes in the underlying distribution of the stream, a phenomenon known in the literature as concept drift. We propose a new method for detecting concept drift which uses an exponentially weighted moving average (EWMA) chart to monitor the misclassification rate of an streaming classifier. Our approach is modular and can hence be run in parallel with any underlying classifier to provide an additional layer of concept drift detection. Moreover our method is computationally efficient with overhead O(1) and works in a fully online manner with no need to store data points in memory. Unlike many existing approaches to concept drift detection, our method allows the rate of false positive detections to be controlled and kept constant over time.

References

[1]
Paired learners for concept drift. In: ICDM'08: Proc. 2008 Eighth IEEE Internat. Conf. on Data Mining, IEEE Computer Society, Washington, DC, USA. pp. 23-32.
[2]
Baena-Garcia, M., Campo-Avila, J., Fidalgo, R., 2006. Early drift detection. In: Advances in Aritificial Intelligence, Lecture Notes Artificial Intelligence, vol. 3171, pp. 286-295.
[3]
Detection of Abrupt Change Theory and Application. Prentice Hall.
[4]
Bell, C., Gordon, L., Pollak, M., 1994. An efficient nonparametric detection scheme and its application to surveillance of a Bernoulli process with unknown baseline. In: Change Point Problem, vol. 23, pp. 7-27.
[5]
Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput. v10. 1895-1923.
[6]
A general framework for mining massive data stream. J. Comput. Graph. Statist. v12. 2003
[7]
Gama, J., Medas, P., Castillo, G., Rodrigues, P., 2004. Learning with drift detection. In: Advances in Artificial Intelligence, Lecture Notes in Artificial Intelligence, vol. 3171, pp. 286-295.
[8]
Statistical and structural approaches to texture. Proc. IEEE. v67 i5. 786-804.
[9]
Harries, M., 1999. Splice-2 comparative evaluation: Electricity pricing. Tech. rep., The University of New South Wales.
[10]
The Elements of Statistical Learning. Springer.
[11]
Image recognition and neuronal networks: Intelligent systems for the improvement of imaging information. Minimal Invasive Therapy Allied Technol. v9 i3-4. 225-230.
[12]
Klinkenberg, R., Joachims, T., 2000. Detecting concept drift with support vector machines. In: Proc. 17th Internat. Conf. on Machine Learning. ICML '00, pp. 487-494.
[13]
Dynamic weighted majority: An ensemble method for driffting concepts. J. Machine Learn. Res. v8. 2755-2790.
[14]
Kuncheva, L.I., 2009. Using control charts for detecting concept change in streaming data. Tech. rep., Bangor University.
[15]
Kuncheva, L.I., Plumpton, C.O., 2008. Adaptive learning rate for online linear discriminant classifiers. In: Structural, Syntactic, and Statistical Pattern Recognition, Lecture Notes in Computer Science, vol. 5342, pp. 510-519.
[16]
Law, Y.-N., Zaniolo, C., 2005. An adaptive nearest neighbor classification algorithm for data streams. In: PKDD, pp. 108-120.
[17]
A simple cumulative sum type statistic for the change-point problem with zero-one observations. Biometrika. v67 i1. 79-84.
[18]
A CUSUM chart for monitoring a proportion when inspecting continuously. J. Qual. Technol. v31 i1. 87-108.
[19]
Control chart tests based on geometric moving averages. Technometrics. v42 i1. 97-101.
[20]
CUSUM charts for signalling varying location shifts. J. Qual. Technol. v32.
[21]
Adaptive threshold computation for CUSUM-type procedures in change detection and isolation problems. Comput. Statist. Data Anal. v52 i9. 4161-4174.
[22]
A comparison of state-of-the-art classification techniques for expert automobile insurance claim fraud detection. J. Risk Insurance. v69. 373-421.
[23]
Learning in the presence of concept drift and hidden contexts. Machine Learn. v23. 69-101.
[24]
EWMA control charts for monitoring high-yield processes based on non-transformed observations. Internat. J. Prod. Res. v46 i20. 5679-5699.

Cited By

View all
  1. Exponentially weighted moving average charts for detecting concept drift

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Pattern Recognition Letters
      Pattern Recognition Letters  Volume 33, Issue 2
      January, 2012
      125 pages

      Publisher

      Elsevier Science Inc.

      United States

      Publication History

      Published: 01 January 2012

      Author Tags

      1. Change detection
      2. Concept drift
      3. Streaming classification

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 13 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media