Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1390156.1390197acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
research-article

Optimized cutting plane algorithm for support vector machines

Published: 05 July 2008 Publication History

Abstract

We have developed a new Linear Support Vector Machine (SVM) training algorithm called OCAS. Its computational effort scales linearly with the sample size. In an extensive empirical evaluation OCAS significantly outperforms current state of the art SVM solvers, like SVMlight, SVMperf and BMRM, achieving speedups of over 1,000 on some datasets over SVMlight and 20 over SVMperf, while obtaining the same precise Support Vector solution. OCAS even in the early optimization steps shows often faster convergence than the so far in this domain prevailing approximative methods SGD and Pegasos. Effectively parallelizing OCAS we were able to train on a dataset of size 15 million examples (itself about 32GB in size) in just 671 seconds --- a competing string kernel SVM required 97,484 seconds to train on 10 million examples sub-sampled from this dataset.

References

[1]
Bottou, L., & Bousquet, O. (2008). The tradeoffs of large scale learning. In NIPS 20. MIT Press.
[2]
Chang, C.-C., & Lin, C.-J. (2001). LIBSVM: a library for svms. http://www.csie.ntu.edu.tw/~cjlin/libsvm.
[3]
Chapelle, O. (2007). Training a Support Vector Machine in the Primal. Neural Comp., 19, 1155--1178.
[4]
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20, 273--297.
[5]
Cristianini, N., & Shwawe-Taylor, J. (2000). An introduction to support vector machines. Cambridge, UK: CUP.
[6]
Fan, R.-E., Chen, P.-H., & Lin, C.-J. (2005). Working set selection using second order information for training svm. Journal of Machine Learning Research, 6, 1889--1918.
[7]
Fawcett, T. (2003). Roc graphs: Notes and practical considerations for data mining researchers. Technical Report HPL-2003-4). HP Laboratories, Palo Alto, CA, USA.
[8]
Franc, V., & Sonnenburg, S. (2007). Optimized cutting plane algorithm for SVMs. Research Report; Electronic Publication 1). Fraunhofer Institute FIRST.
[9]
Joachims, T. (1999). Making large-scale SVM learning practical. Advances in Kernel Methods --- Support Vector Learning (pp. 169--184). Cambridge, MA, USA: MIT Press.
[10]
Joachims, T. (2006). Training linear svms in linear time. KDD'06.
[11]
Lin, C.-J., Weng, R. C., & Keerthi, S. S. (2007). Trust region newton methods for large-scale logistic regression. ICML '07 (pp. 561--568). ACM New York.
[12]
Shwartz, S.-S., Singer, Y., & Srebro, N. (2007). Pegasos: Primal estimated sub-gradient solver for svm. ICML '07 (pp. 807--814). ACM Press.
[13]
Sindhwani, V., & Keerthi, S.-S. (2007). Newton methods for fast solution of semi-supervised linear svms. In Large scale kernel machines. MIT Press.
[14]
Teo, C. H., Le, Q., Smola, A., & Vishwanathan, S. (2007). A scalable modular convex solver for regularized risk minimization. KDD'07.
[15]
Zanni, L., Serafini, T., & Zanghirati, G. (2006). Parallel software for training. JMLR, 7, 1467--1492.

Cited By

View all
  • (2023)Low complexity VLSI architecture for improved primal–dual support vector machine learning coreMicroprocessors and Microsystems10.1016/j.micpro.2023.10480698(104806)Online publication date: Apr-2023
  • (2022)Nonlinear optimization and support vector machinesAnnals of Operations Research10.1007/s10479-022-04655-x314:1(15-47)Online publication date: 14-Apr-2022
  • (2021)Overview of Optimization Algorithms for Large-scale Support Vector Machines2021 International Conference on Data Mining Workshops (ICDMW)10.1109/ICDMW53433.2021.00119(909-916)Online publication date: Dec-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICML '08: Proceedings of the 25th international conference on Machine learning
July 2008
1310 pages
ISBN:9781605582054
DOI:10.1145/1390156
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Pascal
  • University of Helsinki
  • Xerox
  • Federation of Finnish Learned Societies
  • Google Inc.
  • NSF
  • Machine Learning Journal/Springer
  • Microsoft Research: Microsoft Research
  • Intel: Intel
  • Yahoo!
  • Helsinki Institute for Information Technology
  • IBM: IBM

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 July 2008

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

Conference

ICML '08
Sponsor:
  • Microsoft Research
  • Intel
  • IBM

Acceptance Rates

Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)22
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Low complexity VLSI architecture for improved primal–dual support vector machine learning coreMicroprocessors and Microsystems10.1016/j.micpro.2023.10480698(104806)Online publication date: Apr-2023
  • (2022)Nonlinear optimization and support vector machinesAnnals of Operations Research10.1007/s10479-022-04655-x314:1(15-47)Online publication date: 14-Apr-2022
  • (2021)Overview of Optimization Algorithms for Large-scale Support Vector Machines2021 International Conference on Data Mining Workshops (ICDMW)10.1109/ICDMW53433.2021.00119(909-916)Online publication date: Dec-2021
  • (2019)Microbiota and Antibiotic Resistome of Lettuce Leaves and Radishes Grown in Soils Receiving Manure-Based Amendments Derived From Antibiotic-Treated CowsFrontiers in Sustainable Food Systems10.3389/fsufs.2019.000223Online publication date: 10-Apr-2019
  • (2019)Mini-batch cutting plane method for regularized risk minimizationFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.180059620:11(1551-1563)Online publication date: 16-Dec-2019
  • (2019)Problem formulations and solvers in linear SVMArtificial Intelligence Review10.1007/s10462-018-9614-652:2(803-855)Online publication date: 1-Aug-2019
  • (2018)Nonlinear optimization and support vector machines4OR10.1007/s10288-018-0378-216:2(111-149)Online publication date: 23-May-2018
  • (2018)A divide-and-conquer method for large scale ź-nonparallel support vector machinesNeural Computing and Applications10.1007/s00521-016-2574-329:9(497-509)Online publication date: 1-May-2018
  • (2017)Exploring Locally Adaptive Dimensionality Reduction for Hyperspectral Image Classification: A Maximum Margin Metric Learning AspectIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing10.1109/JSTARS.2016.258774710:3(1136-1150)Online publication date: Mar-2017
  • (2017)LAM3LNeurocomputing10.1016/j.neucom.2016.12.008235:C(1-9)Online publication date: 26-Apr-2017
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media