Abstract
In this paper, we improve the conventional decision tree learning algorithm using rough set theory. First, our approach gets the upper approximate for each class. Next, it generates the decision tree from each upper approximate. Each decision tree shows whether the data item is in this class or not. Our approach classifies the unlabeled data item using every decision trees and integrates the outputs of these decision trees to decide the class of unlabeled data item. We evaluated our method using mechanically-prepared datasets whose the proportion of overlap of classes in datasets differs. Experimental result shows our approach is better than the conventional approach when the dataset has the high proportion of overlap of classes and few data items which have the same set of attributes. We guess it is possible to get better classification rules from uncertain and dispersed datasets using our approach. However, we don’t use enough datasets to show this advantage in this experiment. In order to evaluate and enhance our approach, we analyze various and big datasets by our approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees (Wadsworth Statistics/ Probability). CRC-Press, Boca Raton (1984)
Corinna, C., Vladimir, N.V.: Support-vector networks. Mach. Learn. 20(3), 237–297 (1995). Kluwer Academic Publishers, Hingham
Rosenblatt, F.: Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan Books, Michigan (1962)
Weka 3 - Data Mining with Open Source Machine Learning Software in Java. http://www.cs.waikato.ac.nz/ml/weka/. Accessed 12 March 2015
The R Project for Statistical Computing. http://www.r-project.org/. Accessed 12 March 2015
IBM SPSS software. http://www-01.ibm.com/software/analytics/spss/. Accessed 12 March 2015
Bach, M.P., Ćosić, D.: Data mining usage in health care management: literature survey and decision tree application. Medicinski Glasnik 5(1), 57–64 (2008). Bosnia and Herzegovina
MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297. University of California Press, Berkley (1967)
Kurematu, M., Hakura, J., Fujita, H.: An extraction of emotion in human speech using speech synthesize and each classifier for each emotion. WSEAS Trans. Inf. Sci. Appl. 3(5), 246–251 (2008). World Scientific and Engineering Academy and Society, Greece
Pawlak, Z.: Rough sets. Int. J. Parallel Program. 11(5), 341–356 (1982). Springer, Heidelberg
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001). Academic Publishers, Hingham
Terabe, M., Katai, O., Sawaragi, T., Washio, T., Motoda, H.: Attribute generation based on association rules. Knowl. Inf. Syst. 4(3), 329–349 (2002). Springer, Heidelberg
Liang, Y.-H.: Combining the K-means and decision tree methods to promote customer value for the automotive maintenance industry. In: IEEE International Conference on Industrial Engineering and Engineering Management 2009, pp. 1337–1341. IEEE, Hong-Kong (2009)
Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986). Springer, Heidelberg
Gaddam, S.R., Phoha, V.V., Balagani, K.S.: K-Means + ID3: a novel method for supervised anomaly detection by cascading k-means clustering and ID3 decision tree learning methods. IEEE Trans. Knowl. Data Eng. 9(3), 345–354 (2007). IEEE, Washington
Huang, L., Huang, M., Guo, B., Zhang, Z.: A new method for constructing decision tree based on rough set theory. In: Granular Computing, GRC 2007, p. 241. IEEE, Washington (2007)
Son, C.-S., Kim, Y.-N., Kim, H.-S., Park, H.-S., Kim, M.-S.: Decision-making model for early diagnosis of congestive heart failure using rough set and decision tree approaches. J. Biomed. Inf. 45(5), 999–1008 (2012). Elsevier, NewYork
Wei, J.-M., Wang, S.-Q., Wang, M.-Y., You, J.-P., Liu, D.-Y.: Rough set based approach for inducing decision trees. Knowl.-Based Syst. 20(8), 695–702 (2007). Elsevier, NewYork
UCI Machine Learning Repository. http://archive.ics.uci.edu/ml/datasets/. Accessed 29 March 2015
Acknowledgment
We would like to thank Mr. Yasutomo FUKADA who graduated Iwate Prefectural University in March, 2015 and Ms. Saori AMANUMA who has completed a master’s course of the graduate school of Iwate Prefectural University.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Kurematsu, M., Hakura, J., Fujita, H. (2015). A Framework for a Decision Tree Learning Algorithm with Rough Set Theory. In: Fujita, H., Guizzi, G. (eds) Intelligent Software Methodologies, Tools and Techniques. SoMeT 2015. Communications in Computer and Information Science, vol 532. Springer, Cham. https://doi.org/10.1007/978-3-319-22689-7_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-22689-7_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22688-0
Online ISBN: 978-3-319-22689-7
eBook Packages: Computer ScienceComputer Science (R0)