Nothing Special   »   [go: up one dir, main page]

skip to main content
article

A hybrid decision tree training method using data streams

Published: 01 November 2011 Publication History

Abstract

Classical classification methods usually assume that pattern recognition models do not depend on the timing of the data. However, this assumption is not valid in cases where new data frequently become available. Such situations are common in practice, for example, spam filtering or fraud detection, where dependencies between feature values and class numbers are continually changing. Unfortunately, most classical machine learning methods (such as decision trees) do not take into consideration the possibility of the model changing, as a result of so-called concept drift and they cannot adapt to a new classification model. This paper focuses on the problem of concept drift, which is a very important issue, especially in data mining methods that use complex structures (such as decision trees) for making decisions. We propose an algorithm that is able to co-train decision trees using a modified NGE (Nested Generalized Exemplar) algorithm. The potential for adaptation of the proposed algorithm and the quality thereof are evaluated through computer experiments, carried out on benchmark datasets from the UCI Machine Learning Repository.

Cited By

View all
  • (2019)Handling Unforeseen Failures Using Argumentation-Based Learning2019 IEEE 15th International Conference on Automation Science and Engineering (CASE)10.1109/COASE.2019.8843207(1699-1704)Online publication date: 22-Aug-2019
  • (2018)An Empirical Insight Into Concept Drift Detectors Ensemble Strategies2018 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC.2018.8477962(1-8)Online publication date: 8-Jul-2018
  • (2018)Addressing Local Class Imbalance in Balanced Datasets with Dynamic Impurity Decision TreesDiscovery Science10.1007/978-3-030-01771-2_1(3-17)Online publication date: 29-Oct-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Knowledge and Information Systems
Knowledge and Information Systems  Volume 29, Issue 2
November 2011
242 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 November 2011

Author Tags

  1. Concept drift
  2. Decision tree
  3. Incremental learning
  4. Nearest hyperrectangle
  5. Pattern recognition

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Handling Unforeseen Failures Using Argumentation-Based Learning2019 IEEE 15th International Conference on Automation Science and Engineering (CASE)10.1109/COASE.2019.8843207(1699-1704)Online publication date: 22-Aug-2019
  • (2018)An Empirical Insight Into Concept Drift Detectors Ensemble Strategies2018 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC.2018.8477962(1-8)Online publication date: 8-Jul-2018
  • (2018)Addressing Local Class Imbalance in Balanced Datasets with Dynamic Impurity Decision TreesDiscovery Science10.1007/978-3-030-01771-2_1(3-17)Online publication date: 29-Oct-2018
  • (2017)Paired feature multilayer ensemble – concept and evaluation of a classifierJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-16913932:2(1427-1436)Online publication date: 1-Jan-2017
  • (2017)Active and adaptive ensemble learning for online activity recognition from data streamsKnowledge-Based Systems10.1016/j.knosys.2017.09.032138:C(69-78)Online publication date: 15-Dec-2017
  • (2017)Fault diagnosis of marine 4-stroke diesel engines using a one-vs-one extreme learning ensembleEngineering Applications of Artificial Intelligence10.1016/j.engappai.2016.10.01557:C(134-141)Online publication date: 1-Jan-2017
  • (2017)Selecting locally specialised classifiers for one-class classification ensemblesPattern Analysis & Applications10.1007/s10044-015-0505-z20:2(427-439)Online publication date: 1-May-2017
  • (2016)Efficient Computation of the Tensor Chordal KernelsProcedia Computer Science10.1016/j.procs.2016.05.51180:C(1702-1711)Online publication date: 1-Jun-2016
  • (2016)GPU-Accelerated Extreme Learning Machines for Imbalanced Data Streams with Concept DriftProcedia Computer Science10.1016/j.procs.2016.05.50980:C(1692-1701)Online publication date: 1-Jun-2016
  • (2016)Decision tree induction with a constrained number of leaf nodesApplied Intelligence10.1007/s10489-016-0785-z45:3(673-685)Online publication date: 1-Oct-2016
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media