Abstract
Bio-data, which are obtained from human individuals, have been one of main applications of pattern classification these days. A critical property of bio-data classification is the small number of data in each class due to high cost of obtaining data from each individuals. Since most classification methods are based on the distribution of data in each class, the lack of data can be a main cause of low classification performance of conventional classifiers. To solve this problem, we propose a modified additive factor model for bio-data which has two factors; the individual factor and the environment factor. Under the proposed model, we estimate the distribution of environment factor which gives robust information even in case of small data set. We then define new similarity measures using the information. The similarity measure is applied to nearest neighbor method for classification. We also use the support vector machines (SVM) to find a sophisticated similarity measure. Through computational experiments, we confirm that the proposed model and similarity measure is appropriate enough to show better classification performance compared to conventional similarity measure as well as conventional SVM classifier.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bartlett, M., Sejnowsky, T.: Viewpoint Invariant Face Recognition using Independent Component Analysis and Attractor Networks. Naural Information Processing Systems-Natural and Synthetic 9, 817–823 (1997)
Belhumeur, P., Hespanha, J., Kriegman, D.: Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection. IEEE trans. on Pattern Recogntion and Machine Intelligence 19(7), 711–720 (1997)
Bell, A., Sejnowski, T.: An information maximization approach to blind separation and bllind deconvolution. Neural Compuation 7(6), 1129–1159 (1995)
Bishop, C.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford (1995)
Campbell, W.: A Sequence Kernel and its Applications to Speaker Recognition. Advances in Neural Information Processing Systems (in press, 2001)
Cho, M., Park, H.: A Robist SVM Design for Multi-class Classification. In: Zhang, S., Jarvis, R. (eds.) AI 2005. LNCS (LNAI), vol. 3809, pp. 1335–1338. Springer, Heidelberg (2005)
Fukunaga, K.: Introduction to Statistical Pattern Recogntion, 2nd edn. Academic Press, London (1990)
Daugman, J.G.: High Confidence Visual Recognition of Persons by a Test of Statistical Independence. IEEE Trans. on Pattern Analysis and Machine Intelligence 15(11), 1148–1161 (1993)
Lattin, J.: Analyzing Multivariate data, Thomson Learning, Inc. (2003)
Lee, O., Park, H., Choi, S.: PCA vs. ICA for Face Recogntion. In: The 2000 International Technical Conference on Circuits/Systems, Computers, and Communications, pp. 873–876 (2000)
Tenenbaum, J.B., Freeman, W.T.: Separating Style and content with bilinear models. Neural Computaion 12, 1247–1283 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Park, H., Cho, M. (2006). Classification of Bio-data with Small Data Set Using Additive Factor Model and SVM. In: Sattar, A., Kang, Bh. (eds) AI 2006: Advances in Artificial Intelligence. AI 2006. Lecture Notes in Computer Science(), vol 4304. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11941439_81
Download citation
DOI: https://doi.org/10.1007/11941439_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49787-5
Online ISBN: 978-3-540-49788-2
eBook Packages: Computer ScienceComputer Science (R0)