research-article

Weakly Supervised Random Forest for Multi-Label Image Clustering and Segmentation

Authors:

Wei WeiAuthors Info & Claims

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

Pages 227 - 233

https://doi.org/10.1145/2671188.2749377

Published: 22 June 2015 Publication History

Abstract

Clustering is a useful statistical tool in data mining and computer vision. Supervised information is introduced to improve the clustering performance. However, labeling each piece of data accurately is extremely expensive when the amount of data is huge. Existing supervised clustering methods handle the huge workload of labeling large amount of data by transferring the bag-level labels into the instance-level descriptors. However, each bag has only one label limits the application scope seriously. In this paper, we propose weakly supervised multi-label clustering, which allows to label a bag of data multiple labels. The key technique is a weakly supervised random forest which can calculate the model parameters with a deterministic annealing strategy to optimize the non-convex objective function. The proposed algorithm is applied to two typical applications, image clustering and segmentation problems. Impressive efficiency in both training and testing stages on the state-of-the-art image data sets is achieved in our experiments.

References

[1]

D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.

Digital Library

[2]

L. Breiman. Random forests. Machine learning, 45(1):5--32, 2001.

Digital Library

[3]

C.-C. Chang and C.-J. Lin. Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 3(2):2:27:1--27:27, 2011.

Digital Library

[4]

L. Chen, D. Xu, I. W.-H. Tsang, and X. Li. Spectral embedded hashing for scalable image retrieval. IEEE Transactions on Cybernetics, 44(7):1180--1190, 2014.

[5]

Y. Cheng. Mean shift, mode seeking, and clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(8):790--799, 1995.

Digital Library

[6]

A. Criminisi. Microsoft research cambridge object recognition image data set. 2004.

[7]

S. Dasgupta and Y. Freund. Random projection trees and low dimensional manifolds. In Proceedings of ACM symposium on Theory of computing, pages 537--546, 2008.

Digital Library

[8]

R. Datta, D. Joshi, J. Li, and J. Z. Wang. Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys, 5:1--60, 2008.

Digital Library

[9]

L. Fei-Fei and P. Perona. A bayesian hierarchical model for learning natural scene categories. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 524--531, 2005.

Digital Library

[10]

C. Hou, F. Nie, X. Li, D. Yi, and Y. Wu. Joint embedding learning and sparse regression: A framework for unsupervised feature selection. IEEE Transactions on Cybernetics, 44(6):793--804, 2014.

[11]

B. Kulis, S. Basu, I. Dhillon, and R. Mooney. Semi-supervised graph clustering: a kernel approach. Machine Learning, 74(1):1--22, 2009.

Digital Library

[12]

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. In Proceedings of IEEE, pages 2278--2324, 1998.

[13]

C. Leistner, A. Saffari, and H. Bischof. Miforests: Multiple-instance learning with randomized trees. In Proceedings of European Conference on Computer Vision, pages 29--42, 2010.

Digital Library

[14]

J. Li, Y. Xia, Z. Shan, and Y. Liu. Scalable constrained spectral clustering. IEEE Transactions on Knowledge and Data Engineering, 27(2):589--593, 2015.

[15]

C. Liu, J. Yuen, and A. Torralba. Nonparametric scene parsing via label transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence, 12(33):2368--2382, 2011.

Digital Library

[16]

D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2(60):91--110, 2004.

Digital Library

[17]

J. MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pages 281--297, 1967.

[18]

F. Moosmann, B. Triggs, and F. Jurie. Fast discriminative visual codebooks using randomized clustering forests. In Proceedings of Annual Conference on Neural Information Processing Systems, pages 985--992, 2007.

[19]

T. Ojala, M. Pietikainen, and T. Maenpaa. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 7(24):971--987, 2002.

Digital Library

[20]

C. H. Papadimitriou and K. Steiglitz. Combinatorial Optimization: Algorithms and Complexity (Dover Books on Computer Science). Dover Publications, 1998.

Digital Library

[21]

K. Rose. Deterministic annealing, constrained clustering, and optimization. In Proceedings of International Joint Conference on Neural Networks, pages 2515--2520, 1991.

[22]

S. Sarkar, P. J. Phillips, Z. Liu, I. Robledo, Vega, P. Grother, and K. W. Bowyer. The humanid gait challenge problem: Data sets, performance, and analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(2):162--177, November 2005.

Digital Library

[23]

J. Shi and J. Malik. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):888--905, 2000.

Digital Library

[24]

J. Sivic and A. Zisserman. Video google: A text retrieval approach to object matching in videos. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 1470--1477, 2003.

Digital Library

[25]

M. A. Stricker and M. Orengo. Similarity of color images. In IS&T/SPIE's Symposium on Electronic Imaging: Science & Technology, pages 381--392, 1995.

[26]

S. Wei, D. Xu, X. Li, and Y. Zhao. Joint optimization toward effective and efficient image search. IEEE Transactions on Cybernetics, 43(6):2216--2227, 2013.

[27]

M. Wu and B. Schölkopf. Normalized cuts and image segmentation. In Proceedings of Annual Conference on Neural Information Processing Systems, pages 1529--1536, 2006.

[28]

Y. Xia, X. Li, and Z. Shan. Parallelized fusion on multisensor transportation data: A case study in cyberits. International Journal of Intelligent Systems, 28(6):540--564, 2013.

[29]

Y. Xia, C. Wang, X. Shi, and L. Zhang. Vehicles overtaking detection using rgb-d data. Signal Processing, 112:98--109, 2015.

Digital Library

[30]

Y. Xia, W. Xu, L. Zhang, X. Shi, and K. Mao. Integrating 3d structure into traffic scene understanding with rgb-d data. Neurocomputing, 151:700--709, 2015.

[31]

Y. Xia, T. Zhang, and S. Wang. A generic methodological framework for cyber-its: Using cyber-infrastructure in its data analysis cases. Fundamenta Informaticae, 133(1):35--53, 2014.

Digital Library

[32]

D. Xu, S. Yan, L. Zhang, S. Lin, H.-J. Zhang, and T. S. Huang. Reconstruction and recognition of tensor-based objects with concurrent subspaces analysis. IEEE Transactions on Circuits Systems for Video Technology, 18(1):36--47, 2008.

Digital Library

[33]

J. Ye. Discriminative k-means for clustering. In Proceedings of Annual Conference on Neural Information Processing Systems, pages 1649--1656, 2008.

[34]

S. X. Yu and J. Shi. Multiclass spectral clustering. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 313--319, 2003.

Digital Library

[35]

L. Zhang, Y. Gao, C. Hong, Y. Feng, J. Zhu, and D. Cai. Feature correlation hypergraph: Exploiting high-order potentials for multimodal recognition. IEEE Transactions on Cybernetics, 44(8):1408--1419, 2014.

[36]

L. Zhang, X. Li, L. Nie, Y. Yang, and Y. Xia. Weakly supervised human fixation prediction. IEEE Transactions on Cybernetics, 2015.

[37]

L. Zhang, M. Song, Y. Yang, Q. Zhao, Z. Chen, and N. Sebe. Weakly supervised photo cropping. IEEE Transactions on Multimedia, 16(1):94--107, 2014.

[38]

L. Zhang, Y. Yang, Y. Gao, C. Wang, Y. Yu, and X. Li. A probabilistic associative model for segmenting weakly-supervised images. IEEE Transactions on Image Processing, 23(9):4150--4159, 2014.

Cited By

Tseng M(2023)GA-based weighted ensemble learning for multi-label aerial image classification using convolutional neural networks and vision transformersMachine Learning: Science and Technology10.1088/2632-2153/ad10cf4:4(045045)Online publication date: 7-Dec-2023
https://doi.org/10.1088/2632-2153/ad10cf
Bakdi AKristensen NStakkeland M(2022)Multiple Instance Learning With Random Forest for Event Logs Analysis and Predictive Maintenance in Ship Electric Propulsion SystemIEEE Transactions on Industrial Informatics10.1109/TII.2022.314417718:11(7718-7728)Online publication date: Nov-2022
https://doi.org/10.1109/TII.2022.3144177

Index Terms

Weakly Supervised Random Forest for Multi-Label Image Clustering and Segmentation
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Image segmentation
        Video segmentation

Recommendations

Dynamic Label Propagation for Semi-supervised Multi-class Multi-label Classification
ICCV '13: Proceedings of the 2013 IEEE International Conference on Computer Vision

In graph-based semi-supervised learning approaches, the classification rate is highly dependent on the size of the availabel labeled data, as well as the accuracy of the similarity measures. Here, we propose a semi-supervised multi-class/multi-label ...
Weakly supervised multi-label learning via label enhancement
IJCAI'19: Proceedings of the 28th International Joint Conference on Artificial Intelligence

Weakly supervised multi-label learning (WSML) concentrates on a more challenging multilabel classification problem, where some labels in the training set are missing. Existing approaches make multi-label prediction by exploiting the incomplete logical ...
Semi-supervised multi-label classification using incomplete label information
Highlights
- An inductive semi-supervised method called Smile is proposed for multi-label classification using incomplete label information.
Abstract
Classifying multi-label instances using incompletely labeled instances is one of the fundamental tasks in multi-label learning. Most existing methods regard this task as supervised weak-label learning problem and assume sufficient ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

June 2015

700 pages

ISBN:9781450332743

DOI:10.1145/2671188

General Chairs:
Alex Hauptmann
Carnegie Mellon University, USA
,
Chong-Wah Ngo
City University of Hong Kong, China
,
Xiangyang Xue
Fudan University, China
,
Program Chairs:
Yu-Gang Jiang
Fudan University, China
,
Cees Snoek
University of Amsterdam and Qualcomm Research Netherlands
,
Nuno Vasconcelos
University of California, San Diego, USA

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 June 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China
Zhejiang Provincial Natural Science Foundation of China

Conference

ICMR '15

Sponsor:

SIGMM

ICMR '15: International Conference on Multimedia Retrieval

June 23 - 26, 2015

Shanghai, China

Acceptance Rates

ICMR '15 Paper Acceptance Rate 48 of 127 submissions, 38%;

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
213
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)1

Reflects downloads up to 16 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Tseng M(2023)GA-based weighted ensemble learning for multi-label aerial image classification using convolutional neural networks and vision transformersMachine Learning: Science and Technology10.1088/2632-2153/ad10cf4:4(045045)Online publication date: 7-Dec-2023
https://doi.org/10.1088/2632-2153/ad10cf
Bakdi AKristensen NStakkeland M(2022)Multiple Instance Learning With Random Forest for Event Logs Analysis and Predictive Maintenance in Ship Electric Propulsion SystemIEEE Transactions on Industrial Informatics10.1109/TII.2022.314417718:11(7718-7728)Online publication date: Nov-2022
https://doi.org/10.1109/TII.2022.3144177

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents