Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/775047.775122acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Scaling multi-class support vector machines using inter-class confusion

Published: 23 July 2002 Publication History

Abstract

Support vector machines (SVMs) excel at two-class discriminative learning problems. They often outperform generative classifiers, especially those that use inaccurate generative models, such as the naïve Bayes (NB) classifier. On the other hand, generative classifiers have no trouble in handling an arbitrary number of classes efficiently, and NB classifiers train much faster than SVMs owing to their extreme simplicity. In contrast, SVMs handle multi-class problems by learning redundant yes/no (one-vs-others) classifiers for each class, further worsening the performance gap. We propose a new technique for multi-way classification which exploits the accuracy of SVMs and the speed of NB classifiers. We first use a NB classifier to quickly compute a confusion matrix, which is used to reduce the number and complexity of the two-class SVMs that are built in the second stage. During testing, we first get the prediction of a NB classifier and use that to selectively apply only a subset of the two-class SVMs. On standard benchmarks, our algorithm is 3 to 6 times faster than SVMs and yet matches or even exceeds their accuracy.

References

[1]
E. L. Allwein, R. E. Schapire, and Y. Singer. Reducing multiclass to binary: A unifying approach for margin classifiers. In 17th ICML, 2000.
[2]
S. Chakrabarti, B. Dom, R. Agrawal, and P. Raghavan. Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies. The VLDB Journal, 1998.
[3]
T. G. Dietterich and G. Bakiri. Solving multiclass learning problems via ECOCs. JAIR, 2:263--286, 1995.
[4]
S. Dumais, J. Platt, D. Heckerman, and M. Sahami Inductive learning algorithms and representations for text categorization. In 7th CIKM, 1998.
[5]
Rayid Ghani. Using error-correcting codes for text classification. In 17th ICML, 2000.
[6]
Shantanu Godbole. Exploiting confusion matrices for automatic generation of topic hierarchies and scaling up multi-way classifiers. Technical Report, IIT Bombay, 2002. http://www.it.iitb.ac.in/~shantanu/work/aps2002.pdf
[7]
Ryan Rifkin and Jason D. M. Rennie. Improving multi-class text classification with the support vector machine, AI Memo, AIM-2001-026, MIT, 2001.
[8]
T. Joachims. A statistical learning model of text classification for SVMs. In SIGIR 2001, volume 24, ACM.
[9]
D. Koller and M. Sahami. Hierarchically classifying. documents using very few words. In 14th ICML, 1997.
[10]
U. Kressel. Pairwise classification and support vector machines. In Advances in Kernel Methods: Support Vector Learning, MIT Press, 1999.
[11]
T. M. Mitchell. Conditions for the equivalence of hierarchical and non-hierarchical Bayesian classifiers. Technical note, 1998. Online at http://www.cs.cmu.edu/~tom/hierproof.ps
[12]
J. Platt, N. Cristianini, and J. Shawe-Taylor. Large margin DAGs for multiclass classification. In Advances in NIPS 12, MIT Press, 2000.
[13]
M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130--137, 1980.
[14]
V. N. Vapnik. The nature of statistical learning theory. Springer Verlag, 1995.

Cited By

View all
  • (2022)MCMS-STM: An Extension of Support Tensor Machine for Multiclass Multiscale Object Recognition in Remote Sensing ImagesRemote Sensing10.3390/rs1401019614:1(196)Online publication date: 2-Jan-2022
  • (2022)Hyperspectral Image Classification using Digital Signature Comparison based Classifier2022 IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE)10.1109/ICDCECE53908.2022.9793137(1-6)Online publication date: 23-Apr-2022
  • (2021)A service classification model for IoT services discoveryComputing10.1007/s00607-021-01007-8Online publication date: 30-Aug-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
July 2002
719 pages
ISBN:158113567X
DOI:10.1145/775047
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 July 2002

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

KDD02
Sponsor:

Acceptance Rates

KDD '02 Paper Acceptance Rate 44 of 307 submissions, 14%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)MCMS-STM: An Extension of Support Tensor Machine for Multiclass Multiscale Object Recognition in Remote Sensing ImagesRemote Sensing10.3390/rs1401019614:1(196)Online publication date: 2-Jan-2022
  • (2022)Hyperspectral Image Classification using Digital Signature Comparison based Classifier2022 IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE)10.1109/ICDCECE53908.2022.9793137(1-6)Online publication date: 23-Apr-2022
  • (2021)A service classification model for IoT services discoveryComputing10.1007/s00607-021-01007-8Online publication date: 30-Aug-2021
  • (2019)Hierarchical Classification Using Binary DataAI Magazine10.1609/aimag.v40i2.284640:2(59-65)Online publication date: 1-Jun-2019
  • (2019)Improving the Prediction of Therapist Behaviors in Addiction Counseling by Exploiting Class ConfusionsICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2019.8682885(6605-6609)Online publication date: May-2019
  • (2019)Approaches for Using Machine Learning Algorithms with Large Label Sets for Rotorcraft Maintenance2019 IEEE Aerospace Conference10.1109/AERO.2019.8742027(1-8)Online publication date: Mar-2019
  • (2019)Confusion Matrix-Based Building of Hierarchical ClassificationProgress in Pattern Recognition, Image Analysis, Computer Vision, and Applications10.1007/978-3-030-13469-3_32(271-278)Online publication date: 3-Mar-2019
  • (2017)Label-Driven Learning Framework: Towards More Accurate Bayesian Network Classifiers through Discrimination of High-Confidence LabelsEntropy10.3390/e1912066119:12(661)Online publication date: 3-Dec-2017
  • (2017)Ontology Construction Based on Deep LearningAdvances in Computer Science and Ubiquitous Computing10.1007/978-981-10-7605-3_83(505-510)Online publication date: 20-Dec-2017
  • (2015)DeepBag: Recognizing Handbag ModelsIEEE Transactions on Multimedia10.1109/TMM.2015.248022817:11(2072-2083)Online publication date: Nov-2015
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media