Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3195106.3195129acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

A Comparative Study for Classification on Different Domain

Published: 26 February 2018 Publication History

Abstract

There is no individual classification technique has been shown to deal with all kinds of classification problems. The objective is to select the technique which more possibly reaches the best performance for any domain of data set. We focus on classifying datasets in different domains and properties such as numerical, categorical, and textual. We deal with one versus all strategy to handle multi-class problems. In the experiment, we compared the performance of 4 classification techniques namely Boosted C5.0, KNN, Naïve Bayes, and SVM on 10-fold cross-validation on different number of features. For numerical data set (low and high dimensional data set), the performance of KNN was better than other classification methods. For a categorical and textual data set, Naïve Bayes and SVM were outperformed, respectively.

References

[1]
Delgado, M. F., Cernadas, E. B., and Sen'en, A. D. Do We Need Hundreds of Classifiers to Solve Real World Classification Problems. Journal of Machine Learning Research 15(1), pp. 3133--3181, 2014.
[2]
Wolpert, D. H. The Lack of a Priori Distinctions between Learning Algorithms. Neural Computation 8(7), pp. 1341--1390, 1996.
[3]
Lichman, M. UCI Machine Learning Repository {http://archive.ics.uci.edu/ml}. Irvine, CA: University of California, School of Information and Computer Science, 2013.
[4]
Ma, C., Yang, W., and Cheng, B. How the Parameters of K-nearest Neighbor Algorithm Impact on the Best Classification Accuracy: In Case of Parkinson Dataset. Journal of Applied Sciences 14, pp. 171--176, 2014.
[5]
Ben-Hur, A., Ong, C.S., Sonnenburg, S., Scholkopf, B., and Ratsch, G. Support Vector Machines and Kernels for Computational Biology. PLoS Comput Biol 4(10):e1000173., 2008.
[6]
Arauzo-Azofra, A., Bertinez J., and Castro, J. A Feature Set Measure based on Relief, Proceedings of the 5th International Conference on Recent Advances in Soft Computing (RASC 2004), 2004.
[7]
Sikonja, M. and Kononenko, I. Theoretical and Empirical Analysis of ReliefF and RReliefF. Machine Learning Journal 52, pp. 23--69, 2003.
[8]
Pagnotta, F. and Amran, H. Using Data Mining to Predict Secondary School Student Alcohol Consumption. Department of Computer Science, University of Camerino, 2016.
[9]
Anguita, D., Ghio, A., Oneto, L., Parra, X., and Reyes-Ortiz, J. L. Human Activity Recognition on Smartphones Using a Multi-class Hardware-Friendly Support Vector Machine. In Proceedings of the 4th International Workshop on Ambient Assisted Leaving and Home Care, pp. 216--223, 2012

Cited By

View all
  • (2022)FPGA-based implementation of classification techniquesIntegration, the VLSI Journal10.1016/j.vlsi.2021.08.00481:C(280-299)Online publication date: 22-Apr-2022
  • (2021)Predictive Model using SVM to Improve the Effectiveness of Direct Marketing2021 2nd International Conference on Artificial Intelligence and Data Sciences (AiDAS)10.1109/AiDAS53897.2021.9574440(1-6)Online publication date: 8-Sep-2021
  • (undefined)Legal Actions in Brazilian Air Transport: A Machine Learning and Multinomial Logistic Regression AnalysisSSRN Electronic Journal10.2139/ssrn.4185448

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICMLC '18: Proceedings of the 2018 10th International Conference on Machine Learning and Computing
February 2018
411 pages
ISBN:9781450363532
DOI:10.1145/3195106
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • Southwest Jiaotong University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 February 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Classification
  2. Naïve Bayes
  3. boosted C5.0
  4. k-nearest neighbor
  5. machine learning
  6. relief
  7. support vector machine

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Ministry of Science and Technology, Taiwan

Conference

ICMLC 2018

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)FPGA-based implementation of classification techniquesIntegration, the VLSI Journal10.1016/j.vlsi.2021.08.00481:C(280-299)Online publication date: 22-Apr-2022
  • (2021)Predictive Model using SVM to Improve the Effectiveness of Direct Marketing2021 2nd International Conference on Artificial Intelligence and Data Sciences (AiDAS)10.1109/AiDAS53897.2021.9574440(1-6)Online publication date: 8-Sep-2021
  • (undefined)Legal Actions in Brazilian Air Transport: A Machine Learning and Multinomial Logistic Regression AnalysisSSRN Electronic Journal10.2139/ssrn.4185448

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media