Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1102351.1102364acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Learning class-discriminative dynamic Bayesian networks

Published: 07 August 2005 Publication History

Abstract

In many domains, a Bayesian network's topological structure is not known a priori and must be inferred from data. This requires a scoring function to measure how well a proposed network topology describes a set of data. Many commonly used scores such as BD, BDE, BDEU, etc., are not well suited for class discrimination. Instead, scores such as the class-conditional likelihood (CCL) should be employed. Unfortunately, CCL does not decompose and its application to large domains is not feasible. We introduce a decomposable score, approximate conditional likelihood (ACL) that is capable of identifying class discriminative structures. We show that dynamic Bayesian networks (DBNs) trained with ACL have classification efficacies competitive to those trained with CCL on a set of simulated data experiments. We also show that ACL-trained DBNs outperform BDE-trained DBNs, Gaussian naïve Bayes networks and support vector machines within a neuroscience domain too large for CCL.

References

[1]
Bilmes, J. A. Dynamic Bayesian Multinets. (2000). Proceedings of the 16th conference on Uncertainty in Artificial Intelligence. pg. 38--45.
[2]
Bilmes, J., Zweig, G., Richardson, T., Filali, K., Livescu, K., Xu, P., Jackson, K., Brandman, Y., Sandness, E., Holtz, E., Torres, J., & Byrne, B. (2001). Discriminatively structured graphical models for speech recognition Tech. Report. Center for Language and Speech Processing, Johns Hopkins Univ., Baltimore, MD.
[3]
Buckner, R. L., Snyder, A., Sanders, A., Marcus, R., Morris, J. (2000). Functional Brain Imaging of Young, Nondemented, and Demented Older Adults. Journal of Cognitive Neuroscience, 12, 2. 24--34.
[4]
Burge, J. (2005) Dynamic Bayesian Networks: Class Discriminative Structure Search with an Application to Functional Magnetic Resonance Imaging Data. Tech Report. University of New Mexico. June, 2005.
[5]
Burge, J., Clark, V. P., Lane, T., Link, H., Qiu, S. (2004). Bayesian Classification of FMRI Data: Evidence for Altered Neural Networks in Dementia. In submission to Human Brain Mapping.
[6]
Buntine, W. (1991). Theory Refinement on Bayesian Networks. Proceedings of the Seventh Conference on UAI. 52--60.
[7]
Charniak, E. (1991). Bayesian Networks Without Tears. Al Magazine, 12 (4).
[8]
Chickering, D., Geiger, D., Heckerman, D. (1994). Learning Bayesian Networks is NP-Hard (Technical Report MSR-TR-94-17). Microsoft.
[9]
Cooper, G., Herskovits, E. (1992). A Bayesian method for the induction of probabilistic networks from data. Machine Learning, 9, 309--347.
[10]
Duda, R. O., Hart, P. E. (1973). Pattern classification and scene analysis. New York, NY: Wiley.
[11]
Geiger, D., Heckerman, D. (1996). Knowledge representation and inference in similarity networks and Bayesian Multinets. Artificial Intelligence. v. 82, pg. 45--74.
[12]
Greiner, R., Zhou, W. (2002). Structural extension to logistic regression: Discriminative parameter learning of belief net classifiers. Proc. 18th Natl. Conf. On Artificial Intelligence. pg. 167--173.
[13]
Grossman, D., Domingos, P. (2004). Learning Bayesian Network Classifiers by Maximizing Conditional Likelihood. International Conference on Machine Learning, 21. 361--368
[14]
Heckerman, D. (1991). Probability Similarity Networks. MIT Press, 1991.
[15]
Heckerman, D., Geiger, D., Chickering, D. M. (1995). Learning Bayesian Networks: the Combination of Knowledge and Statistical Data. Machine Learning, 20, 197--243.
[16]
Jensen, F. V., (2001). Bayesian Networks and Decision Graphs. Springer-Verlag, New York.
[17]
Lancaster J. L., Woldorff M. G., Parsons L. M., Liotti M., Freitas C. S., Rainey L., Kochunov PV, Nickerson D., Mikiten S. A., Fox P. T. (2000). Automated Talairach Atlas labels for functional brain mapping. Human Brain Mapping 10, 120--131.
[18]
Murphy, K. (2002). Dynamic Bayesian Networks: Representation, Inference and Learning. PhD dissertation, Berkeley, University of California, Computer Science Division.
[19]
Pearl, J. (1986) Fusion, Propagation and Structuring in Belief Networks. Artificial Intelligence, v. 29, n. 3, 241--288.
[20]
Sheskin, D. J. (2003). Handbook of Parameter and Nonparametric Statistical Procedures, 3rd Edition. Chapman & Hall/CRC. QA276.25.S54 2003.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICML '05: Proceedings of the 22nd international conference on Machine learning
August 2005
1113 pages
ISBN:1595931805
DOI:10.1145/1102351
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2005

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2014)Discrete Bayesian Network ClassifiersACM Computing Surveys10.1145/257686847:1(1-43)Online publication date: 14-Jul-2014
  • (2010)Feature selection for Bayesian network classifiers using the MDL-FS scoreInternational Journal of Approximate Reasoning10.1016/j.ijar.2010.02.00151:6(695-717)Online publication date: 1-Jul-2010
  • (2009)Semi-naive Exploitation of One-Dependence EstimatorsProceedings of the 2009 Ninth IEEE International Conference on Data Mining10.1109/ICDM.2009.64(278-287)Online publication date: 6-Dec-2009
  • (2008)Hybrid ICA–Bayesian network approach reveals distinct effective connectivity differences in schizophreniaNeuroImage10.1016/j.neuroimage.2008.05.06542:4(1560-1568)Online publication date: Oct-2008
  • (2007)Shrinkage Estimator for Bayesian Network ParametersProceedings of the 18th European conference on Machine Learning10.1007/978-3-540-74958-5_10(67-78)Online publication date: 17-Sep-2007
  • (2007)Discrete dynamic Bayesian network analysis of fMRI dataHuman Brain Mapping10.1002/hbm.2049030:1(122-137)Online publication date: 7-Nov-2007
  • (2006)Bayesian Sampling of Virtual Examples to Improve Classification Accuracy2006 SICE-ICASE International Joint Conference10.1109/SICE.2006.315740(1009-1014)Online publication date: Oct-2006
  • (2006)Sampling of virtual examples to improve classification accuracy for nominal attribute dataProceedings of the 5th international conference on Rough Sets and Current Trends in Computing10.1007/11908029_66(637-646)Online publication date: 6-Nov-2006
  • (2006)Improving bayesian network structure search with random variable aggregation hierarchiesProceedings of the 17th European conference on Machine Learning10.1007/11871842_11(66-77)Online publication date: 18-Sep-2006

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media