Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Accurate parameter estimation for Bayesian network classifiers using hierarchical Dirichlet processes

Published: 01 September 2018 Publication History

Abstract

This paper introduces a novel parameter estimation method for the probability tables of Bayesian network classifiers (BNCs), using hierarchical Dirichlet processes (HDPs). The main result of this paper is to show that improved parameter estimation allows BNCs to outperform leading learning methods such as random forest for both 0---1 loss and RMSE, albeit just on categorical datasets. As data assets become larger, entering the hyped world of "big", efficient accurate classification requires three main elements: (1) classifiers with low-bias that can capture the fine-detail of large datasets (2) out-of-core learners that can learn from data without having to hold it all in main memory and (3) models that can classify new data very efficiently. The latest BNCs satisfy these requirements. Their bias can be controlled easily by increasing the number of parents of the nodes in the graph. Their structure can be learned out of core with a limited number of passes over the data. However, as the bias is made lower to accurately model classification tasks, so is the accuracy of their parameters' estimates, as each parameter is estimated from ever decreasing quantities of data. In this paper, we introduce the use of HDPs for accurate BNC parameter estimation even with lower bias. We conduct an extensive set of experiments on 68 standard datasets and demonstrate that our resulting classifiers perform very competitively with random forest in terms of prediction, while keeping the out-of-core capability and superior classification time.

Cited By

View all
  • (2023)Research on Bayesian Network garbage classification based on multi-source information fusionProceedings of the 2023 7th International Conference on Big Data and Internet of Things10.1145/3617695.3617714(142-149)Online publication date: 11-Aug-2023
  • (2022)Simulation of Logistics Delay in Bayesian Network Control Based on Genetic EM AlgorithmComputational Intelligence and Neuroscience10.1155/2022/69814502022Online publication date: 1-Jan-2022
  • (2022)Research on a dynamic full Bayesian classifier for time-series data with insufficient informationApplied Intelligence10.1007/s10489-021-02448-652:1(1059-1075)Online publication date: 1-Jan-2022
  • Show More Cited By
  1. Accurate parameter estimation for Bayesian network classifiers using hierarchical Dirichlet processes

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Machine Language
      Machine Language  Volume 107, Issue 8-10
      September 2018
      428 pages

      Publisher

      Kluwer Academic Publishers

      United States

      Publication History

      Published: 01 September 2018

      Author Tags

      1. Bayesian network
      2. Classification
      3. Dirichlet processes
      4. Graphical models
      5. Parameter estimation
      6. Smoothing

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 13 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Research on Bayesian Network garbage classification based on multi-source information fusionProceedings of the 2023 7th International Conference on Big Data and Internet of Things10.1145/3617695.3617714(142-149)Online publication date: 11-Aug-2023
      • (2022)Simulation of Logistics Delay in Bayesian Network Control Based on Genetic EM AlgorithmComputational Intelligence and Neuroscience10.1155/2022/69814502022Online publication date: 1-Jan-2022
      • (2022)Research on a dynamic full Bayesian classifier for time-series data with insufficient informationApplied Intelligence10.1007/s10489-021-02448-652:1(1059-1075)Online publication date: 1-Jan-2022
      • (2020)Bayesian network classifiers using ensembles and smoothingKnowledge and Information Systems10.1007/s10115-020-01458-z62:9(3457-3480)Online publication date: 1-Sep-2020
      • (2020)Hierarchical Gradient Smoothing for Probability Estimation TreesAdvances in Knowledge Discovery and Data Mining10.1007/978-3-030-47426-3_18(222-234)Online publication date: 11-May-2020
      • (2019)IAPSO-AIRSJournal of Medical Systems10.1007/s10916-019-1343-043:7(1-23)Online publication date: 1-Jul-2019

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media