Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2623330.2623759acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Active learning for sparse bayesian multilabel classification

Published: 24 August 2014 Publication History

Abstract

We study the problem of active learning for multilabel classification. We focus on the real-world scenario where the average number of positive (relevant) labels per data point is small leading to positive label sparsity. Carrying out mutual information based near-optimal active learning in this setting is a challenging task since the computational complexity involved is exponential in the total number of labels. We propose a novel inference algorithm for the sparse Bayesian multilabel model of [17]. The benefit of this alternate inference scheme is that it enables a natural approximation of the mutual information objective. We prove that the approximation leads to an identical solution to the exact optimization problem but at a fraction of the optimization cost. This allows us to carry out efficient, non-myopic, and near-optimal active learning for sparse multilabel classification. Extensive experiments reveal the effectiveness of the method.

Supplementary Material

MP4 File (p472-sidebyside.mp4)

References

[1]
Mulan Multilabel Datasets. http://mulan.sourceforge.net/datasets.html.
[2]
R. Agrawal, A. Gupta, Y. Prabhu, and M. Varma. Multi-label learning with millions of labels: Recommending advertiser bid phrases for web pages. In WWW, 2013.
[3]
K. Balasubramanian and G. Lebanon. The Landmark Selection Method for Multiple Output Prediction. In ICML, 2012.
[4]
A. Bergamo, L. Torresani, and A. W. Fitzgibbon. PiCoDes: Learning a Compact Code for Novel-Category Recognition. In NIPS, 2011.
[5]
W. Bi and J. T.-Y. Kwok. Efficient Multi-label Classification with Many Labels. In ICML, pages 405--413, 2013.
[6]
C. M. Bishop and M. E. Tipping. Variational Relevance Vector Machines. In UAI, 2000.
[7]
W. Caselton and J. Zidek. Optimal monitoring network designs. Statistics and Probability Letters, 1984.
[8]
Y.-N. Chen and H.-T. Lin. Feature-aware Label Space Dimension Reduction for Multi-label Classification. In NIPS, pages 1538--1546, 2012.
[9]
W. Chu, V. Sindhwani, Z. Ghahramani, and S. Keerthi. Relational Learning with Gaussian Processes. In NIPS, 2006.
[10]
M. Cissé, N. Usunier, T. Arti'eres, and P. Gallinari. Robust Bloom Filters for Large MultiLabel Classification Tasks. In NIPS, pages 1851--1859, 2013.
[11]
A. Esuli and F. Sebastiani. Active Learning Strategies for Multi-Label Text Classification. In ECIR, 2009.
[12]
C.-S. Feng and H.-T. Lin. Multi-label Classification with Error-Correcting Codes. JMLR, pages 289--295, 2011.
[13]
A. Goldberg, X. Zhu, A. Furger, and J. Xu. OASIS: Online Active Semi-Supervised Learning. In AAAI, 2011.
[14]
A. Gretton, R. Herbrich, and A. Hyvärinen. Kernel methods for measuring independence. JMLR, 2005.
[15]
D. Hsu, S. Kakade, J. Langford, and T. Zhang. Multi-Label Prediction via Compressed Sensing. In NIPS, 2009.
[16]
S. Ji, L. Tang, S. Yu, and J. Ye. Extracting Shared Subspace for Multi-label Classification. In KDD, pages 381--389, 2008.
[17]
A. Kapoor, R. Viswanathan, and P. Jain. Multilabel Classification using Bayesian Compressed Sensing. In NIPS, 2012.
[18]
A. Krause and C. Guestrin. Near-optimal Nonmyopic Value of Information in Graphical Models. In UAI, 2005.
[19]
A. Krause, A. Singh, and C. Guestrin. Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies. JMLR, 2008.
[20]
X. Li and Y. Guo. Active Learning with Multi-label SVM Classification. In IJCAI, 2013.
[21]
X. Li, L. Wang, and E. Sung. Multi-label SVM Active Learning for Image Classification. In ICIP, 2004.
[22]
G. Nemhauser, L. Wolsey, and M. Fisher. An analysis of approximations for maximizing submodular set functions. Mathematical Programming, 1978.
[23]
J. C. Platt. Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods. In Advances In Large Margin Classifiers. MIT Press, 1999.
[24]
B. Settles. Active learning literature survey. Technical report, 2010.
[25]
Shihao, Y. Xue, and L. Carin. Bayesian Compressive Sensing, 2007.
[26]
A. Singh, A. Krause, C. Guestrin, and W. J. Kaiser. Efficient Informative Sensing using Multiple Robots.
[27]
M. Singh, E. Curran, and P. Cunningham. Active learning for multi-label image annotation. Technical report, University College Dublin, 2009.
[28]
F. Tai and H.-T. Lin. Multi-label Classification with Principal Label Space Transformation. In Workshop proceedings of learning from multi-label data, 2010.
[29]
J. Weston, S. Bengio, and N. Usunier. Wsabie: Scaling Up To Large Vocabulary Image Annotation. In IJCAI, 2011.
[30]
J. Weston, A. Makadia, and H. Yee. Label Partitioning for Sublinear Ranking. In ICML, 2013.
[31]
B. Yang, J. Sun, T. Wang, and Z. Chen. Effective multi-label active learning for text classification. In KDD, 2009.
[32]
H.-F. Yu, P. Jain, and I. S. Dhillon. Large-scale Multi-label Learning with Missing Labels. ICML, 2014.
[33]
Y. Zhang and J. G. Schneider. Multi-Label Output Codes using Canonical Correlation Analysis. In AISTATS, pages 873--882, 2011.
[34]
X. Zhu, J. Lafferty, and Z. Ghahramani. Semi-Supervised Learning: From Gaussian Fields to Gaussian Processes. Technical report, School of CS, Carnegie Mellon University, 2003.

Cited By

View all
  • (2024)Active Learning for Discrete Latent Variable ModelsNeural Computation10.1162/neco_a_0164636:3(437-474)Online publication date: 16-Feb-2024
  • (2024)Instance-Ambiguity Weighting for Multi-label Recognition with Limited AnnotationsAdvances in Knowledge Discovery and Data Mining10.1007/978-981-97-2242-6_13(156-167)Online publication date: 25-Apr-2024
  • (2023)CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00130(1348-1357)Online publication date: 1-Oct-2023
  • Show More Cited By

Index Terms

  1. Active learning for sparse bayesian multilabel classification

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2014
    2028 pages
    ISBN:9781450329569
    DOI:10.1145/2623330
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 August 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. active learning
    2. multi-label learning
    3. mutual information

    Qualifiers

    • Research-article

    Conference

    KDD '14
    Sponsor:

    Acceptance Rates

    KDD '14 Paper Acceptance Rate 151 of 1,036 submissions, 15%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)23
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 23 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Active Learning for Discrete Latent Variable ModelsNeural Computation10.1162/neco_a_0164636:3(437-474)Online publication date: 16-Feb-2024
    • (2024)Instance-Ambiguity Weighting for Multi-label Recognition with Limited AnnotationsAdvances in Knowledge Discovery and Data Mining10.1007/978-981-97-2242-6_13(156-167)Online publication date: 25-Apr-2024
    • (2023)CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00130(1348-1357)Online publication date: 1-Oct-2023
    • (2023)Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label Classification2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52729.2023.00332(3408-3417)Online publication date: Jun-2023
    • (2023)Active learning in multi-label image classification with graph convolutional network embeddingFuture Generation Computer Systems10.1016/j.future.2023.05.028148(56-65)Online publication date: Nov-2023
    • (2023)Learning Accurate Performance Predictors for Ultrafast Automated Model CompressionInternational Journal of Computer Vision10.1007/s11263-023-01783-0131:7(1761-1783)Online publication date: 13-Apr-2023
    • (2023)Multi-label classification with weak labels by learning label correlation and label regularizationApplied Intelligence10.1007/s10489-023-04562-z53:17(20110-20133)Online publication date: 30-Mar-2023
    • (2023)Category-Wise Fine-Tuning for Image Multi-label Classification with Partial LabelsNeural Information Processing10.1007/978-981-99-8145-8_26(332-345)Online publication date: 27-Nov-2023
    • (2022) Top- Partial Label Machine IEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.308339733:11(6775-6788)Online publication date: Nov-2022
    • (2022)Generalized Large Margin $k$NN for Partial Label LearningIEEE Transactions on Multimedia10.1109/TMM.2021.310943824(1055-1066)Online publication date: 2022
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media