Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1148170.1148253acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Large scale semi-supervised linear SVMs

Published: 06 August 2006 Publication History

Abstract

Large scale learning is often realistic only in a semi-supervised setting where a small set of labeled examples is available together with a large collection of unlabeled data. In many information retrieval and data mining applications, linear classifiers are strongly preferred because of their ease of implementation, interpretability and empirical performance. In this work, we present a family of semi-supervised linear support vector classifiers that are designed to handle partially-labeled sparse datasets with possibly very large number of examples and features. At their core, our algorithms employ recently developed modified finite Newton techniques. Our contributions in this paper are as follows: (a) We provide an implementation of Transductive SVM (TSVM) that is significantly more efficient and scalable than currently used dual techniques, for linear classification problems involving large, sparse datasets. (b) We propose a variant of TSVM that involves multiple switching of labels. Experimental results show that this variant provides an order of magnitude further improvement in training efficiency. (c) We present a new algorithm for semi-supervised learning based on a Deterministic Annealing (DA) approach. This algorithm alleviates the problem of local minimum in the TSVM optimization procedure while also being computationally attractive. We conduct an empirical study on several document classification tasks which confirms the value of our methods in large scale semi-supervised settings.

References

[1]
K. Bennett and A. Demirez, Semi-Supervised Support Vector Machines, NIPS 1998.]]
[2]
G. Bilbro, R. Mann, T.K. Miller, W.E. Snyder and D.E. Van den, Optimization by Mean Field Annealing, NIPS 1989.]]
[3]
O. Chapelle and A. Zien, Semi-Supervised Classification by Low Density Separation, AI & Statistics, Barbados, January 2005.]]
[4]
R. Collobert, F. Sinz, J. Weston, and L. Bottou, Large Scale Transductive SVMs, (submitted) 2006.]]
[5]
T. Joachims, Transductive Inference for Text Classification using Support Vector Machines, ICML 1998.]]
[6]
G. Fung and O. Mangasarian, Semi-Supervised Support Vector Machines for Unlabeled Data Classification, Optimization Methods and Software 15, 2001, 29--44.]]
[7]
D. Lewis, Y. Yang, T. Rose and F. Li, RCV1: A New Benchmark Collection for Text Categorization Research, Journal of Machine Learning Research 5:361--397, 2004.]]
[8]
S. S. Keerthi and D. DeCoste, A Modified Finite Newton Method for Fast Solution of Large Scale Linear SVMs, Journal of Machine Learning Research 6:341--361, 2005.]]
[9]
C. Peterson and B. Soderberg, A new method for mapping optimization problems onto neural networks, International Journal of Neural Systems, 1(1):3--22, 1989.]]
[10]
V. Sindhwani, S. S. Keerthi, and O. Chapelle, Deterministic Annealing for Semi-supervised Kernel Machines, ICML 2006.]]
[11]
V. Sindhwani and S.S. Keerthi, Large Scale Semi-supervised Linear SVMs, Technical report, Yahoo research, 2006.]]
[12]
http://www.cs.uchicago.edu/~vikass/research.html]]
[13]
V. Vapnik, Statistical Learning Theory, John Wiley and Sons, New York, 1998.]]

Cited By

View all
  • (2024)Incremental quasi-Newton algorithms for solving a nonconvex, nonsmooth, finite-sum optimization problemOptimization Methods and Software10.1080/10556788.2023.229643239:2(345-367)Online publication date: 28-Jan-2024
  • (2024)Graph manifold learning with non-gradient decision layerNeurocomputing10.1016/j.neucom.2024.127390579(127390)Online publication date: Apr-2024
  • (2024)Reliability evaluation of individual predictions: a data-centric approachThe VLDB Journal10.1007/s00778-024-00857-w33:4(1203-1230)Online publication date: 30-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
August 2006
768 pages
ISBN:1595933697
DOI:10.1145/1148170
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 August 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. global optimization
  2. support vector machines
  3. text categorization
  4. unlabeled data

Qualifiers

  • Article

Conference

SIGIR06
Sponsor:
SIGIR06: The 29th Annual International SIGIR Conference
August 6 - 11, 2006
Washington, Seattle, USA

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)4
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Incremental quasi-Newton algorithms for solving a nonconvex, nonsmooth, finite-sum optimization problemOptimization Methods and Software10.1080/10556788.2023.229643239:2(345-367)Online publication date: 28-Jan-2024
  • (2024)Graph manifold learning with non-gradient decision layerNeurocomputing10.1016/j.neucom.2024.127390579(127390)Online publication date: Apr-2024
  • (2024)Reliability evaluation of individual predictions: a data-centric approachThe VLDB Journal10.1007/s00778-024-00857-w33:4(1203-1230)Online publication date: 30-May-2024
  • (2023)Laplacian Twin Support Vector Machine With Pinball Loss for Semi-Supervised ClassificationIEEE Access10.1109/ACCESS.2023.326227011(31399-31416)Online publication date: 2023
  • (2022)A Novel Chaotic Kernel Framework for Support Vector Machines using Probability-Based Feature Extraction Method2022 9th International Conference on Computing for Sustainable Global Development (INDIACom)10.23919/INDIACom54597.2022.9763302(885-890)Online publication date: 23-Mar-2022
  • (2022)Towards Practical Large Scale Non-Linear Semi-Supervised Learning with Balancing ConstraintsProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557150(3072-3081)Online publication date: 17-Oct-2022
  • (2022)The High Separation Probability Assumption for Semi-Supervised LearningIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2022.316106752:12(7561-7573)Online publication date: Dec-2022
  • (2021)Quantify Co-Residency Risks in the Cloud Through Deep LearningIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2020.303207318:4(1568-1579)Online publication date: 1-Jul-2021
  • (2021)Performing Selection on a Monotonic Function in Lieu of Sorting Using Layer-Ordered HeapsJournal of Proteome Research10.1021/acs.jproteome.0c0071120:4(1849-1854)Online publication date: 2-Feb-2021
  • (2020)Machine learning-based approaches for disease gene predictionBriefings in Functional Genomics10.1093/bfgp/elaa01319:5-6(350-363)Online publication date: 22-Jun-2020
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media