Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Online Concept Drift Detector: : Optimally Balancing Delay Detection, Runtime, Memory, and Accuracy.

Published: 24 July 2024 Publication History

Abstract

Online learning and real-time data processing are becoming increasingly vital across various domains such as sensor networks, banking, and telecommunications. A significant challenge in this context is concept drift, wherein the statistical properties of the data change over time. Traditional drift detectors often grapple with high memory usage, extended delay in detection, prolonged runtime, and accuracy inconsistencies. This paper introduces a novel Online Drift Detector that meticulously balances these four aspects. By processing data instance-by-instance, our proposed detector optimizes the trade-offs between delay detection, runtime, memory consumption, and accuracy. We incorporate a unique diversity calculation tailored for multi-label problems, ensuring swift drift detection with minimized memory usage and enhanced runtime efficiency. Comparative analyses reveal the dominance of our approach over contemporary drift detection techniques, particularly in the realms of memory efficiency, detection speed, and accuracy. This work substantially augments the field of online data stream processing by offering a refined strategy for timely and efficient concept drift detection across a myriad of applications.

References

[1]
B.R. Prasad, S. Agarwal, Stream data mining: platforms, algorithms, performance evaluators and research trends, International journal of database theory and application 9 (9) (2016) 201–218.
[2]
O.A. Mahdi, E. Pardede, N. Ali, KAPPA as drift detector in data stream mining, Procedia Computer Science 184 (2021) 314–321.
[3]
O.A. Mahdi, Diversity Measures as New Concept Drift Detection Methods in Data Stream Mining, La Trobe (2021).
[4]
P.E. Hart, D.G. Stork, R.O. Duda, Pattern classification, Wiley, Hoboken, 2000.
[5]
J. Gao, W. Fan, J. Han, P.S. Yu, A general framework for mining concept-drifting data streams with skewed distributions, in: Proceedings of the 2007 siam international conference on data mining, SIAM, 2007, pp. 3–14.
[6]
I. Žliobaitė, Learning under concept drift: an overview, arXiv preprint (2010) arXiv:1010.4784.
[7]
O.A. Mahdi, E. Pardede, J. Cao, Combination of information entropy and ensemble classification for detecting concept drift in data stream, in: Proceedings of the Australasian Computer Science Week Multiconference, 2018, pp. 1–5.
[8]
S. Wares, J. Isaacs, and E. Elyan, "Data stream mining: methods and challenges for handling concept drift," SN Applied Sciences, vol. 1, pp. 1-19, 2019.
[9]
J. Gama, P. Medas, G. Castillo, P. Rodrigues, Learning with drift detection, Advances in Artificial Intelligence–SBIA 2004: 17th Brazilian Symposium on Artificial Intelligence, 17, Springer, Sao Luis, Maranhao, Brazil, 2004, pp. 286–295. September 29-Ocotber 1, 2004. Proceedings.
[10]
O.A. Mahdi, E. Pardede, N. Ali, J. Cao, Fast reaction to sudden concept drift in the absence of class labels, Applied Sciences 10 (2) (2020) 606.
[11]
A. Pesaranghader, H.L. Viktor, Fast hoeffding drift detection method for evolving data streams, in: Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2016, Springer, Riva del Garda, Italy, 2016, pp. 96–111. September 19-23, 2016, Proceedings, Part II 16.
[12]
A. Bifet, R. Gavalda, Learning from time-changing data with adaptive windowing, in: Proceedings of the 2007 SIAM international conference on data mining, SIAM, 2007, pp. 443–448.
[13]
I. Frias-Blanco, J. del Campo-Ávila, G. Ramos-Jimenez, R. Morales-Bueno, A. Ortiz-Díaz, Y. Caballero-Mota, Online and non-parametric drift detection methods based on Hoeffding's bounds, IEEE Transactions on Knowledge and Data Engineering 27 (3) (2014) 810–823.
[14]
G.J. Ross, N.M. Adams, D.K. Tasoulis, D.J. Hand, Exponentially weighted moving average charts for detecting concept drift, Pattern recognition letters 33 (2) (2012) 191–198.
[15]
A. Pesaranghader, H.L. Viktor, E. Paquet, McDiarmid drift detection methods for evolving data streams, in: 2018 International joint conference on neural networks (IJCNN), IEEE, 2018, pp. 1–9.
[16]
C. McDiarmid, On the method of bounded differences, Surveys in combinatorics 141 (1) (1989) 148–188.
[17]
D.T.J. Huang, Y.S. Koh, G. Dobbie, R. Pears, Detecting volatility shift in data streams, in: 2014 IEEE International Conference on Data Mining, IEEE, 2014, pp. 863–868.
[18]
K. Nishida, K. Yamauchi, Detecting concept drift using statistical testing, in: International conference on discovery science, Springer, 2007, pp. 264–269.
[19]
R.S. Barros, D.R. Cabral, P.M. Gonçalves Jr, S.G. Santos, RDDM: Reactive drift detection method, Expert Systems with Applications 90 (2017) 344–355.
[20]
O.A. Mahdi, E. Pardede, N. Ali, J. Cao, Diversity measure as a new drift detection method in data streaming, Knowledge-Based Systems 191 (2020).

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Procedia Computer Science
Procedia Computer Science  Volume 237, Issue C
2024
971 pages
ISSN:1877-0509
EISSN:1877-0509
Issue’s Table of Contents

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 24 July 2024

Author Tags

  1. Concept drift
  2. Diversity measure
  3. Data stream mining
  4. Non-stationary environments

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media