Efficient SVM Regression Training with SMO

Gary William Flake¹ &
Steve Lawrence¹

6744 Accesses
239 Citations
Explore all metrics

Abstract

The sequential minimal optimization algorithm (SMO) has been shown to be an effective method for training support vector machines (SVMs) on classification tasks defined on sparse data sets. SMO differs from most SVM algorithms in that it does not require a quadratic programming solver. In this work, we generalize SMO so that it can handle regression problems. However, one problem with SMO is that its rate of convergence slows down dramatically when data is non-sparse and when there are many support vectors in the solution—as is often the case in regression—because kernel function evaluations tend to dominate the runtime in this case. Moreover, caching kernel function outputs can easily degrade SMO's performance even more because SMO tends to access kernel function outputs in an unstructured manner. We address these problems with several modifications that enable caching to be effectively used with SMO. For regression problems, our modifications improve convergence time by over an order of magnitude.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Burges, C. (1998). A Tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2:2, 955-974.
Google Scholar
Friess, T., Cristianini, N., & Campbell, C. (1998). The Kernel-adatron: A fast and simple learning procedure for support vector machines. In J. Shavlik (Ed.), Proceedings of the Fifteenth International Conference on Machine Learning (pp. 188-196).
Joachims, T. (1999). Making large-scale support vector machine learning practical. In B. Schölkopf, C. Burges, & A. Smola (Eds.), Advances in kernel methods-Support vector learning (pp. 169-184). MIT Press.
Keerthi, S. S., Shevade, S., Bhattacharyya, C., & Murthy K. R. K. (1999). Improvements to Platt's SMO algorithm for SVM classifier design. Technical Report CD-99-14, Dept. of Mechanical and Production Engineering, National University of Singapore.
Mackey, M. C. & Glass, L. (1977). Oscillation and chaos in physiological control systems. Science, 2:4300, 287-289.
Google Scholar
Mangasarian, O. L. & Musicant, D. R. (1999) Successive overrelaxation for support vector machines. IEEE Transactions on Neural Networks, 10:5, 1032-1037.
Google Scholar
Mattera, D., Palmieri, F., & Haykin, S. (1999). An explicit algorithm for training support vector machines. IEEE Signal Processing Letters, 6:9, 243-245.
Google Scholar
Mukherjee, S., Osuna, E., & Girosi, F. (1997). Nonlinear prediction of chaotic time series using support vector machines. In Proc. of IEEE NNSP'97 (pp. 511-519).
Müller, K., Smola, A., Rätsch, G., Schölkopf, B., Kohlmorgen, J., & Vapnik, V. (1997). Predicting time series with support vector machines. In W. Gerstner, A. Germond, M. Hasler, & J.-D. Nicoud (Eds.), Artificial neural networks-ICANN'97, Vol. 1327 of Springer Lecture Notes in Computer Science (pp. 999-1004). Berlin.
Osuna, E., Freund, R., & Girosi, F. (1997). An improved training algorithm for support vector machines. In Proc. of IEEE NNSP'97.
Platt, J. (1998). Fast training of support vector machines using sequential minimal optimization. In B. Schölkopf, C. Burges, & A. Smola (Eds.), Advances in Kernel methods-support vector learning. Cambridge, MA: MIT Press.
Google Scholar
Platt, J. (1999a). Private communication.
Platt, J. (1999b). Using sparseness and analytic QP to speed training of support vector machines. In M. S. Kearns, S. A. Solla, & D. A. Cohn (Eds.), Advances in neural information processing systems 11. Cambridge, MA: MIT Press.
Google Scholar
Saunders, C., Stitson, M. O., Weston, J., Bottou, L., Schölkopf, B., & Smola, A. (1998). Support vector machine reference manual. Technical Report CSD-TR-98-03, Royal Holloway, University of London.
Shevade, S. K., Keerthi, S. S., Bhattacharyya, C., & Murthy, K. R. K. (2000). Improvements to the SMO algorithms for SVM regression. IEEE Transactions on Neural Networks, 11:5, 1188-1193.
Google Scholar
Smola, A. & Schölkopf, B. (1998). A tutorial on support vector regression. Technical Report NC2-TR-1998-030, NeuroCOLT2.
Takens, F. (1980). Detecting strange attractors in turbulence. In D. A. Rand & L. S. Young (Eds.), Dynamical systems and turbulence (pp. 366-381). New York: Spinger-Verlag.
Google Scholar
Vapnik, V. (1995). The nature of statistical learning theory. New York: Springer Verlag.
Google Scholar

Download references

Author information

Authors and Affiliations

NEC Research Institute, 4 Independence Way, Princeton, NJ, 08540, USA
Gary William Flake & Steve Lawrence

Authors

Gary William Flake
View author publications
You can also search for this author in PubMed Google Scholar
Steve Lawrence
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Flake, G.W., Lawrence, S. Efficient SVM Regression Training with SMO. Machine Learning 46, 271–290 (2002). https://doi.org/10.1023/A:1012474916001

Download citation

Issue Date: January 2002
DOI: https://doi.org/10.1023/A:1012474916001

Efficient SVM Regression Training with SMO

Abstract

Article PDF

Similar content being viewed by others

Fast online algorithm for nonlinear support vector machines and other alike models

Efficient Sparse Approximation of Support Vector Machines Solving a Kernel Lasso

Large Scale Learning Techniques for Least Squares Support Vector Machines

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Efficient SVM Regression Training with SMO

Abstract

Article PDF

Similar content being viewed by others

Fast online algorithm for nonlinear support vector machines and other alike models

Efficient Sparse Approximation of Support Vector Machines Solving a Kernel Lasso

Large Scale Learning Techniques for Least Squares Support Vector Machines

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation