Abstract
This paper deals with the problem of multi-instance learning when label proportions are provided. In this classification problem, the instances of the dataset are divided into disjoint groups, where there is no certainty about the labels associated with individual samples. However, in each group the number of instances that belong to each class is known. We propose several versions of an EM-algorithm that learns naive Bayes models to deal with the exposed problem. The proposed algorithms are evaluated on synthetic and real datasets, and compared with state-of-the-art approaches. The obtained results show a competitive behaviour of our proposals.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Dietterich, T., Lathrop, R.H., Lozano-Pérez, T.: Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence 89(1-2), 31–71 (1997)
Kück, H., de Freitas, N.: Learning about individuals from group statistics. In: Proc. 21th Conference on Uncertainty in Artificial Intelligence, pp. 332–339 (2005)
Quadrianto, N., Smola, A.J., Caetano, T.S., Le, Q.V.: Estimating labels from label proportions. In: Proc. 25th International Conference on Machine Learning, New York, pp. 776–783 (2008)
Quadrianto, N., Smola, A.J., Caetano, T.S., Le, Q.V.: Estimating labels from label proportions. Journal of Machine Learning Research 10, 2349–2374 (2009)
Musicant, D.R., Christensen, J.M., Olson, J.F.: Supervised learning by training on aggregate outputs. In: Seventh IEEE International Conference on Data Mining, pp. 252–261 (2007)
Chen, S., Liu, B., Qian, M., Zhang, C.: Kernel K-means based framework for aggregate outputs classification. In: 2009 IEEE International Conference on Data Mining Workshops, pp. 356–361 (2009)
Rueping, S.: SVM Classifier Estimation from Group Probabilities. In: Proc. 27th International Conference on Machine Learning (2010)
Morales, D.: Clasificadores Bayesianos en la Selección Embrionaria en Tratamientos de Reproducción Asistida. PhD thesis, University of the Basque Country (2008)
McLachlan, G.J., Krishnan, T.: The EM Algorithm and Extensions. Wiley Series in Probability and Statistics. Wiley-Interscience (2008)
Frank, A., Asuncion, A.: UCI Machine Learning Repository. University of California, Irvine, http://archive.ics.uci.edu/ml
Fan, R.: LIBSVM Data: Classification, Regression and Multi-label. National Taiwan University, http://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hernández, J., Inza, I. (2011). Learning Naive Bayes Models for Multiple-Instance Learning with Label Proportions. In: Lozano, J.A., Gámez, J.A., Moreno, J.A. (eds) Advances in Artificial Intelligence. CAEPIA 2011. Lecture Notes in Computer Science(), vol 7023. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25274-7_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-25274-7_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25273-0
Online ISBN: 978-3-642-25274-7
eBook Packages: Computer ScienceComputer Science (R0)