Article

Large scale semi-supervised linear SVMs

Authors:

SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval

Pages 477 - 484

https://doi.org/10.1145/1148170.1148253

Published: 06 August 2006 Publication History

Get Access

Abstract

Large scale learning is often realistic only in a semi-supervised setting where a small set of labeled examples is available together with a large collection of unlabeled data. In many information retrieval and data mining applications, linear classifiers are strongly preferred because of their ease of implementation, interpretability and empirical performance. In this work, we present a family of semi-supervised linear support vector classifiers that are designed to handle partially-labeled sparse datasets with possibly very large number of examples and features. At their core, our algorithms employ recently developed modified finite Newton techniques. Our contributions in this paper are as follows: (a) We provide an implementation of Transductive SVM (TSVM) that is significantly more efficient and scalable than currently used dual techniques, for linear classification problems involving large, sparse datasets. (b) We propose a variant of TSVM that involves multiple switching of labels. Experimental results show that this variant provides an order of magnitude further improvement in training efficiency. (c) We present a new algorithm for semi-supervised learning based on a Deterministic Annealing (DA) approach. This algorithm alleviates the problem of local minimum in the TSVM optimization procedure while also being computationally attractive. We conduct an empirical study on several document classification tasks which confirms the value of our methods in large scale semi-supervised settings.

References

[1]

K. Bennett and A. Demirez, Semi-Supervised Support Vector Machines, NIPS 1998.]]

Digital Library

Google Scholar

[2]

G. Bilbro, R. Mann, T.K. Miller, W.E. Snyder and D.E. Van den, Optimization by Mean Field Annealing, NIPS 1989.]]

Digital Library

Google Scholar

[3]

O. Chapelle and A. Zien, Semi-Supervised Classification by Low Density Separation, AI & Statistics, Barbados, January 2005.]]

Google Scholar

[4]

R. Collobert, F. Sinz, J. Weston, and L. Bottou, Large Scale Transductive SVMs, (submitted) 2006.]]

Google Scholar

[5]

T. Joachims, Transductive Inference for Text Classification using Support Vector Machines, ICML 1998.]]

Digital Library

Google Scholar

[6]

G. Fung and O. Mangasarian, Semi-Supervised Support Vector Machines for Unlabeled Data Classification, Optimization Methods and Software 15, 2001, 29--44.]]

Google Scholar

[7]

D. Lewis, Y. Yang, T. Rose and F. Li, RCV1: A New Benchmark Collection for Text Categorization Research, Journal of Machine Learning Research 5:361--397, 2004.]]

Digital Library

Google Scholar

[8]

S. S. Keerthi and D. DeCoste, A Modified Finite Newton Method for Fast Solution of Large Scale Linear SVMs, Journal of Machine Learning Research 6:341--361, 2005.]]

Digital Library

Google Scholar

[9]

C. Peterson and B. Soderberg, A new method for mapping optimization problems onto neural networks, International Journal of Neural Systems, 1(1):3--22, 1989.]]

Google Scholar

[10]

V. Sindhwani, S. S. Keerthi, and O. Chapelle, Deterministic Annealing for Semi-supervised Kernel Machines, ICML 2006.]]

Digital Library

Google Scholar

[11]

V. Sindhwani and S.S. Keerthi, Large Scale Semi-supervised Linear SVMs, Technical report, Yahoo research, 2006.]]

Google Scholar

[12]

http://www.cs.uchicago.edu/~vikass/research.html]]

Google Scholar

[13]

V. Vapnik, Statistical Learning Theory, John Wiley and Sons, New York, 1998.]]

Google Scholar

Cited By

View all

Dinc Yalcin GCurtis F(2024)Incremental quasi-Newton algorithms for solving a nonconvex, nonsmooth, finite-sum optimization problemOptimization Methods and Software10.1080/10556788.2023.229643239:2(345-367)Online publication date: 28-Jan-2024
https://doi.org/10.1080/10556788.2023.2296432
Jiao ZZhang HZhang RLi X(2024)Graph manifold learning with non-gradient decision layerNeurocomputing10.1016/j.neucom.2024.127390579(127390)Online publication date: Apr-2024
https://doi.org/10.1016/j.neucom.2024.127390
Shahbazi NAsudeh A(2024)Reliability evaluation of individual predictions: a data-centric approachThe VLDB Journal10.1007/s00778-024-00857-w33:4(1203-1230)Online publication date: 30-May-2024
https://doi.org/10.1007/s00778-024-00857-w
Show More Cited By

Index Terms

Large scale semi-supervised linear SVMs
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Classification and regression trees

Recommendations

Multiview Semi-Supervised Learning with Consensus

Obtaining high-quality and up-to-date labeled data can be difficult in many real-world machine learning applications. Semi-supervised learning aims to improve the performance of a classifier trained with limited number of labeled data by utilizing the ...
Semi-supervised text categorization: Exploiting unlabeled data using ensemble learning algorithms

Text categorization is one of the fundamental tasks in text mining. Classical supervised methods need lot of labeled data to train a classifier. Since assigning labels to the large amount of data is very costly and time consuming, it is useful to use ...
Semi-supervised multi-class Adaboost by exploiting unlabeled data

Research highlights We propose a semi-supervised learning method by using the multi-class boosting. It handles K-class classification without reducing into multiple two-class problems. The classification accuracy of base classifier requires only 1/K or ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval

August 2006

768 pages

ISBN:1595933697

DOI:10.1145/1148170

General Chair:
Efthimis N. Efthimiadis
University of Washington
,
Program Chairs:
Susan Dumais
Microsoft Research, Redmond
,
David Hawking
CSIRO ICT Centre, Canberra, Australia
,
Kalervo Järvelin,
University of Tampere, Finland

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 August 2006

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

SIGIR06

Sponsor:

SIGIR06: The 29th Annual International SIGIR Conference

August 6 - 11, 2006

Washington, Seattle, USA

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

134
Total Citations
View Citations
1,338
Total Downloads

Downloads (Last 12 months)20
Downloads (Last 6 weeks)4

Reflects downloads up to 14 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Dinc Yalcin GCurtis F(2024)Incremental quasi-Newton algorithms for solving a nonconvex, nonsmooth, finite-sum optimization problemOptimization Methods and Software10.1080/10556788.2023.229643239:2(345-367)Online publication date: 28-Jan-2024
https://doi.org/10.1080/10556788.2023.2296432
Jiao ZZhang HZhang RLi X(2024)Graph manifold learning with non-gradient decision layerNeurocomputing10.1016/j.neucom.2024.127390579(127390)Online publication date: Apr-2024
https://doi.org/10.1016/j.neucom.2024.127390
Shahbazi NAsudeh A(2024)Reliability evaluation of individual predictions: a data-centric approachThe VLDB Journal10.1007/s00778-024-00857-w33:4(1203-1230)Online publication date: 30-May-2024
https://doi.org/10.1007/s00778-024-00857-w
Damminsed VPanup WWangkeeree R(2023)Laplacian Twin Support Vector Machine With Pinball Loss for Semi-Supervised ClassificationIEEE Access10.1109/ACCESS.2023.326227011(31399-31416)Online publication date: 2023
https://doi.org/10.1109/ACCESS.2023.3262270
Bist USingh N(2022)A Novel Chaotic Kernel Framework for Support Vector Machines using Probability-Based Feature Extraction Method2022 9th International Conference on Computing for Sustainable Global Development (INDIACom)10.23919/INDIACom54597.2022.9763302(885-890)Online publication date: 23-Mar-2022
https://doi.org/10.23919/INDIACom54597.2022.9763302
Gao ZWu HTakáč MGu BAl Hasan MXiong L(2022)Towards Practical Large Scale Non-Linear Semi-Supervised Learning with Balancing ConstraintsProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557150(3072-3081)Online publication date: 17-Oct-2022
https://dl.acm.org/doi/10.1145/3511808.3557150
Huang GDu C(2022)The High Separation Probability Assumption for Semi-Supervised LearningIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2022.316106752:12(7561-7573)Online publication date: Dec-2022
https://doi.org/10.1109/TSMC.2022.3161067
Han JZang WYu MSandhu R(2021)Quantify Co-Residency Risks in the Cloud Through Deep LearningIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2020.303207318:4(1568-1579)Online publication date: 1-Jul-2021
https://doi.org/10.1109/TDSC.2020.3032073
Lucke KPennington JKreitzberg PKäll LSerang O(2021)Performing Selection on a Monotonic Function in Lieu of Sorting Using Layer-Ordered HeapsJournal of Proteome Research10.1021/acs.jproteome.0c0071120:4(1849-1854)Online publication date: 2-Feb-2021
https://doi.org/10.1021/acs.jproteome.0c00711
Le D(2020)Machine learning-based approaches for disease gene predictionBriefings in Functional Genomics10.1093/bfgp/elaa01319:5-6(350-363)Online publication date: 22-Jun-2020
https://doi.org/10.1093/bfgp/elaa013
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Multiview Semi-Supervised Learning with Consensus

Semi-supervised text categorization: Exploiting unlabeled data using ensemble learning algorithms

Semi-supervised multi-class Adaboost by exploiting unlabeled data