Abstract
In SVMs, the data points lying in the interactive regions of two classes are very important to form the hyperplane and difficult to be classified. How to select the reduced training set only including the interactive data points is one of the important issues. There are many methods by which the easy misclassified training data are selected to speed up training. The extraction method of the reduced training set is proposed by using the boundary of rough set. Firstly, for two-class problem, the entire training set is partitioned into three regions: the region only containing the positive samples, the region only composed of the negative samples and the boundary region including not only the positive samples but also the negative ones. Secondly, the boundary region is the intersection of two classes and selected to train SVMs. Thirdly, the two-class and multi-class problems are used to verify the feasibility of the proposed SVMs. The experimental results on the classic benchmark data set of machine learning show that the proposed learning machines can downsize the number of training data and hardly influence on their generalization abilities.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Pawlak, Z.: Rough Sets. International Journal of Computer and Information Science 11, 341–356 (1982)
Pawlak, Z.: Rough Sets: Theoretical Aspects of Reasoning About Data. Kluwer Academic Publisher, Boston (1991)
Pawlak, Z.: Drawing Conclusions From Data—the Rough Set Way. International Journal of Intelligent Systems 16(1), 3–11 (2001)
Wei, J.M.: Rough Set Based Approach to Selection of Node. International Journal of Computational Cognition 1(2), 25–40 (2003)
Thangavel, K., Pethalakshmi, A.: Feature Selection for Medical Database Using Rough System. International Journal on Artificial Intelligence and Machine Learning 6(1), 11–17 (2005)
Wang, F.H.: On Acquiring Classification Knowledge From Noisy Data Based on Rough Set. Expert Systems with Applications 29(1), 49–64 (2005)
Yu, C., Wu, M.H., Wu, M.: Combining Rough Set Theory with Neural Network Theory for Pattern Recognition. Neural Processing Letters 19(1), 73–87 (2004)
Vapnik, V.N.: Statistical Learning Theory. John Wiley & Sons, New York (1998)
Wang, L.P. (ed.): Support Vector Machines: Theory and Application. Springer, Heidelberg (2005)
Hsu, C.W., Lin, C.J.: A Comparison of Methods for Multi-class Support Vector Machines. IEEE Transactions on Neural Networks 13(2), 415–425 (2002)
Platt, J.C., Cristianini, N., Shawe-Taylor, J.: Large Margin DAG’s for Multi-class Classification, Advances in Neural Information Processing Systems, vol. 12, pp. 547–553. MIT Press, Cambridge (2000)
Abe, S., Inoue, T.: Fuzzy Support Vector Machines for Multiclass Problems. In: Proceedings of Tenth European Symposium on Artificial Neural Networks Conference, pp. 116–118 (2002)
Huang, H., Liu, Y.: Fuzzy Support Vector Machines for Pattern Recognition and Data Mining. International Journal of Fuzzy Systems 4(3), 826–835 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, H., Xiong, S., Chen, Q. (2009). Extraction of the Reduced Training Set Based on Rough Set in SVMs. In: Yu, W., He, H., Zhang, N. (eds) Advances in Neural Networks – ISNN 2009. ISNN 2009. Lecture Notes in Computer Science, vol 5552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01510-6_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-01510-6_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01509-0
Online ISBN: 978-3-642-01510-6
eBook Packages: Computer ScienceComputer Science (R0)