Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1553374.1553496acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
research-article

Uncertainty sampling and transductive experimental design for active dual supervision

Published: 14 June 2009 Publication History

Abstract

Dual supervision refers to the general setting of learning from both labeled examples as well as labeled features. Labeled features are naturally available in tasks such as text classification where it is frequently possible to provide domain knowledge in the form of words that associate strongly with a class. In this paper, we consider the novel problem of active dual supervision, or, how to optimally query an example and feature labeling oracle to simultaneously collect two different forms of supervision, with the objective of building the best classifier in the most cost effective manner. We apply classical uncertainty and experimental design based active learning schemes to graph/kernel-based dual supervision models. Empirical studies confirm the potential of these schemes to significantly reduce the cost of acquiring labeled data for training high-quality models.

References

[1]
Belkin, M., Matveeva, I., & Niyogi, P. (2004). Regularization and semi-supervised learning on large graphs. Conference on Learning Theory (COLT) (pp. 486--500).
[2]
Druck, G., Mann, G., & McCallum, A. (2008). Learning from labeled features using generalized expectation criteria. 31st Annual ACM SIGIR Conference (pp. 595--602).
[3]
Globerson, A., Chechik, G., Pereira, F., & Tishby, N. (2007). Euclidean embedding of co-occurence data. Journal of Machine Learning Research, 8, 2265--2296.
[4]
Godbole, S., Harpale, A., Sarawagi, S., & Chakrabarti, S. (2004). Document classification through interactive supervision of document and term labels. Prin. and Prac. of Knowl. Disc. in Databases (PKDD) (pp. 185--196).
[5]
Ho, & Dooren, P. (2005). On the pseudo-inverse of the laplacian of a bipartite graph. Appl. Math. Letters, 8, 917--922.
[6]
Melville, P., Gryc, W., & Lawrence, R. (2009). Sentiment analysis of blogs by combining lexical knowledge with text classification. 15th ACM SIGKDD Conf. on Knowledge Discovery and Data Mining.
[7]
Melville, P., & Sindhwani, V. (2009). Active dual supervision: Reducing the cost of annotating examples and features. NAACL HLT Workshop on Active Learning for NLP.
[8]
Raghavan, H., Madani, O., & Jones, R. (2007). An interactive algorithm for asking and incorporating feature feedback into support vector machines. 30th Annual ACM SIGIR Conference (pp. 79--86).
[9]
Saar-Tsechansky, M., Melville, P., & Provost, F. (2009). Active feature-value acquisition. Management Science, 4, 664--684.
[10]
Sindhwani, V., Hu, J., & Mojsilovic, A. (2008). Regularized co-clustering with dual supervision. Neural Information Processing Systems (NIPS) (pp. 976--983).
[11]
Smola, A., & Kondor, R. (2004). Kernels and regularization on graphs. Conf. on Learning Theory (COLT) (pp. 144--158).
[12]
Tong, S., & Koller, D. (2001). Support vector machine active learning with applications to text classification. Journal of Machine Learning Research, 2, 45--66.
[13]
Yu, K., Bi, J., & Tresp, V. (2006). Active learning via transductive experimental design. International Conference on Machine Learning (ICML) (pp. 1081--1088).

Cited By

View all
  • (2023)For women, life, freedomProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/667(6013-6021)Online publication date: 19-Aug-2023
  • (2023)Disentangling societal inequality from model biasesProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/661(5959-5967)Online publication date: 19-Aug-2023
  • (2022)On The Effectiveness of Active Learning by Uncertainty Sampling in Classification of High-Dimensional Gaussian Mixture DataICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP43922.2022.9747685(4238-4242)Online publication date: 23-May-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning
June 2009
1331 pages
ISBN:9781605585161
DOI:10.1145/1553374

Sponsors

  • NSF
  • Microsoft Research: Microsoft Research
  • MITACS

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 June 2009

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

ICML '09
Sponsor:
  • Microsoft Research

Acceptance Rates

Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)1
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)For women, life, freedomProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/667(6013-6021)Online publication date: 19-Aug-2023
  • (2023)Disentangling societal inequality from model biasesProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/661(5959-5967)Online publication date: 19-Aug-2023
  • (2022)On The Effectiveness of Active Learning by Uncertainty Sampling in Classification of High-Dimensional Gaussian Mixture DataICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP43922.2022.9747685(4238-4242)Online publication date: 23-May-2022
  • (2022)Wisdom of the contexts: active ensemble learning for contextual anomaly detectionData Mining and Knowledge Discovery10.1007/s10618-022-00868-736:6(2410-2458)Online publication date: 4-Oct-2022
  • (2021)Analyzing Data Selection Techniques with Tools from the Theory of Information Losses2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671861(7-16)Online publication date: 15-Dec-2021
  • (2021)Semantic SamplingLow Resource Social Media Text Mining10.1007/978-981-16-5625-5_6(49-60)Online publication date: 2-Oct-2021
  • (2019)Interactive-COSMOProceedings of the Workshop on Interactive Data Mining10.1145/3304079.3310289(1-9)Online publication date: 15-Feb-2019
  • (2018)Learning with rationales for document classificationMachine Language10.1007/s10994-017-5671-3107:5(797-824)Online publication date: 1-May-2018
  • (2017)Active learningData Mining and Knowledge Discovery10.1007/s10618-016-0469-731:2(287-313)Online publication date: 1-Mar-2017
  • (2017)Evidence-based uncertainty sampling for active learningData Mining and Knowledge Discovery10.1007/s10618-016-0460-331:1(164-202)Online publication date: 1-Jan-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media