Article

Learning class-discriminative dynamic Bayesian networks

Authors:

Terran LaneAuthors Info & Claims

ICML '05: Proceedings of the 22nd international conference on Machine learning

Pages 97 - 104

https://doi.org/10.1145/1102351.1102364

Published: 07 August 2005 Publication History

Abstract

In many domains, a Bayesian network's topological structure is not known a priori and must be inferred from data. This requires a scoring function to measure how well a proposed network topology describes a set of data. Many commonly used scores such as BD, BDE, BDEU, etc., are not well suited for class discrimination. Instead, scores such as the class-conditional likelihood (CCL) should be employed. Unfortunately, CCL does not decompose and its application to large domains is not feasible. We introduce a decomposable score, approximate conditional likelihood (ACL) that is capable of identifying class discriminative structures. We show that dynamic Bayesian networks (DBNs) trained with ACL have classification efficacies competitive to those trained with CCL on a set of simulated data experiments. We also show that ACL-trained DBNs outperform BDE-trained DBNs, Gaussian naïve Bayes networks and support vector machines within a neuroscience domain too large for CCL.

References

[1]

Bilmes, J. A. Dynamic Bayesian Multinets. (2000). Proceedings of the 16th conference on Uncertainty in Artificial Intelligence. pg. 38--45.

Digital Library

[2]

Bilmes, J., Zweig, G., Richardson, T., Filali, K., Livescu, K., Xu, P., Jackson, K., Brandman, Y., Sandness, E., Holtz, E., Torres, J., & Byrne, B. (2001). Discriminatively structured graphical models for speech recognition Tech. Report. Center for Language and Speech Processing, Johns Hopkins Univ., Baltimore, MD.

[3]

Buckner, R. L., Snyder, A., Sanders, A., Marcus, R., Morris, J. (2000). Functional Brain Imaging of Young, Nondemented, and Demented Older Adults. Journal of Cognitive Neuroscience, 12, 2. 24--34.

Digital Library

[4]

Burge, J. (2005) Dynamic Bayesian Networks: Class Discriminative Structure Search with an Application to Functional Magnetic Resonance Imaging Data. Tech Report. University of New Mexico. June, 2005.

[5]

Burge, J., Clark, V. P., Lane, T., Link, H., Qiu, S. (2004). Bayesian Classification of FMRI Data: Evidence for Altered Neural Networks in Dementia. In submission to Human Brain Mapping.

[6]

Buntine, W. (1991). Theory Refinement on Bayesian Networks. Proceedings of the Seventh Conference on UAI. 52--60.

Digital Library

[7]

Charniak, E. (1991). Bayesian Networks Without Tears. Al Magazine, 12 (4).

Digital Library

[8]

Chickering, D., Geiger, D., Heckerman, D. (1994). Learning Bayesian Networks is NP-Hard (Technical Report MSR-TR-94-17). Microsoft.

[9]

Cooper, G., Herskovits, E. (1992). A Bayesian method for the induction of probabilistic networks from data. Machine Learning, 9, 309--347.

Digital Library

[10]

Duda, R. O., Hart, P. E. (1973). Pattern classification and scene analysis. New York, NY: Wiley.

[11]

Geiger, D., Heckerman, D. (1996). Knowledge representation and inference in similarity networks and Bayesian Multinets. Artificial Intelligence. v. 82, pg. 45--74.

Digital Library

[12]

Greiner, R., Zhou, W. (2002). Structural extension to logistic regression: Discriminative parameter learning of belief net classifiers. Proc. 18th Natl. Conf. On Artificial Intelligence. pg. 167--173.

Digital Library

[13]

Grossman, D., Domingos, P. (2004). Learning Bayesian Network Classifiers by Maximizing Conditional Likelihood. International Conference on Machine Learning, 21. 361--368

Digital Library

[14]

Heckerman, D. (1991). Probability Similarity Networks. MIT Press, 1991.

Digital Library

[15]

Heckerman, D., Geiger, D., Chickering, D. M. (1995). Learning Bayesian Networks: the Combination of Knowledge and Statistical Data. Machine Learning, 20, 197--243.

Digital Library

[16]

Jensen, F. V., (2001). Bayesian Networks and Decision Graphs. Springer-Verlag, New York.

Digital Library

[17]

Lancaster J. L., Woldorff M. G., Parsons L. M., Liotti M., Freitas C. S., Rainey L., Kochunov PV, Nickerson D., Mikiten S. A., Fox P. T. (2000). Automated Talairach Atlas labels for functional brain mapping. Human Brain Mapping 10, 120--131.

[18]

Murphy, K. (2002). Dynamic Bayesian Networks: Representation, Inference and Learning. PhD dissertation, Berkeley, University of California, Computer Science Division.

Digital Library

[19]

Pearl, J. (1986) Fusion, Propagation and Structuring in Belief Networks. Artificial Intelligence, v. 29, n. 3, 241--288.

Digital Library

[20]

Sheskin, D. J. (2003). Handbook of Parameter and Nonparametric Statistical Procedures, 3rd Edition. Chapman & Hall/CRC. QA276.25.S54 2003.

Cited By

Bielza CLarrañaga P(2014)Discrete Bayesian Network ClassifiersACM Computing Surveys10.1145/257686847:1(1-43)Online publication date: 14-Jul-2014
https://dl.acm.org/doi/10.1145/2576868
Drugan MWiering M(2010)Feature selection for Bayesian network classifiers using the MDL-FS scoreInternational Journal of Approximate Reasoning10.1016/j.ijar.2010.02.00151:6(695-717)Online publication date: 1-Jul-2010
https://dl.acm.org/doi/10.1016/j.ijar.2010.02.001
Li NYu YZhou Z(2009)Semi-naive Exploitation of One-Dependence EstimatorsProceedings of the 2009 Ninth IEEE International Conference on Data Mining10.1109/ICDM.2009.64(278-287)Online publication date: 6-Dec-2009
https://dl.acm.org/doi/10.1109/ICDM.2009.64
Show More Cited By

Recommendations

Discriminative Parameter Learning of General Bayesian Network Classifiers
ICTAI '03: Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence

Greiner and Zhou [1] presented ELR, a discriminative parameter-learning algorithm that maximizes conditional likelihood (CL) for a fixed Bayesian Belief Network (BN) structure, and demonstrated that it often produces classifiers that are more accurate ...
Bayesian learning of Bayesian networks with informative priors

This paper presents and evaluates an approach to Bayesian model averaging where the models are Bayesian nets (BNs). A comprehensive study of the literature on structural priors for BNs is conducted. A number of prior distributions are defined using ...
Structural learning of mixed noisy-OR Bayesian networks
Abstract
In this paper we discuss learning Bayesian networks whose conditional probability tables are either Noisy-OR models or general conditional probability tables. We refer to these models as Mixed Noisy-OR Bayesian Networks. To learn their ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICML '05: Proceedings of the 22nd international conference on Machine learning

August 2005

1113 pages

ISBN:1595931805

DOI:10.1145/1102351

General Chair:
Saso Dzeroski
Jozef Stefan Institute, Slovenia
,
Program Chairs:
Luc De Raedt,
Stefan Wrobel

Copyright © 2005 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2005

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Acceptance Rates

Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

9
Total Citations
View Citations
376
Total Downloads

Downloads (Last 12 months)9
Downloads (Last 6 weeks)0

Reflects downloads up to 21 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Bielza CLarrañaga P(2014)Discrete Bayesian Network ClassifiersACM Computing Surveys10.1145/257686847:1(1-43)Online publication date: 14-Jul-2014
https://dl.acm.org/doi/10.1145/2576868
Drugan MWiering M(2010)Feature selection for Bayesian network classifiers using the MDL-FS scoreInternational Journal of Approximate Reasoning10.1016/j.ijar.2010.02.00151:6(695-717)Online publication date: 1-Jul-2010
https://dl.acm.org/doi/10.1016/j.ijar.2010.02.001
Li NYu YZhou Z(2009)Semi-naive Exploitation of One-Dependence EstimatorsProceedings of the 2009 Ninth IEEE International Conference on Data Mining10.1109/ICDM.2009.64(278-287)Online publication date: 6-Dec-2009
https://dl.acm.org/doi/10.1109/ICDM.2009.64
Kim DBurge JLane TPearlson GKiehl KCalhoun V(2008)Hybrid ICA–Bayesian network approach reveals distinct effective connectivity differences in schizophreniaNeuroImage10.1016/j.neuroimage.2008.05.06542:4(1560-1568)Online publication date: Oct-2008
https://doi.org/10.1016/j.neuroimage.2008.05.065
Burge JLane T(2007)Shrinkage Estimator for Bayesian Network ParametersProceedings of the 18th European conference on Machine Learning10.1007/978-3-540-74958-5_10(67-78)Online publication date: 17-Sep-2007
https://dl.acm.org/doi/10.1007/978-3-540-74958-5_10
Burge JLane TLink HQiu SClark V(2007)Discrete dynamic Bayesian network analysis of fMRI dataHuman Brain Mapping10.1002/hbm.2049030:1(122-137)Online publication date: 7-Nov-2007
https://doi.org/10.1002/hbm.20490
Lee YKang JKang BRyu K(2006)Bayesian Sampling of Virtual Examples to Improve Classification Accuracy2006 SICE-ICASE International Joint Conference10.1109/SICE.2006.315740(1009-1014)Online publication date: Oct-2006
https://doi.org/10.1109/SICE.2006.315740
Lee YKang JKang BRyu K(2006)Sampling of virtual examples to improve classification accuracy for nominal attribute dataProceedings of the 5th international conference on Rough Sets and Current Trends in Computing10.1007/11908029_66(637-646)Online publication date: 6-Nov-2006
https://dl.acm.org/doi/10.1007/11908029_66
Burge JLane T(2006)Improving bayesian network structure search with random variable aggregation hierarchiesProceedings of the 17th European conference on Machine Learning10.1007/11871842_11(66-77)Online publication date: 18-Sep-2006
https://dl.acm.org/doi/10.1007/11871842_11

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents