Abstract
Recognizing human emotion from speech signals, i.e., spoken emotion recognition, is a new and interesting subject in artificial intelligence field. In this paper we present a new method of spoken emotion recognition based on radial basis function neutral networks (RBFNN). The acoustic features related to human emotion expression are extracted from speech signals and then fed into RBFNN for emotion classification. The performance of RBFNN on spoken emotion recognition task is compared with several existing methods including linear discriminant classifiers (LDC), K-nearest-neighbor (KNN), and C4.5 decision tree. The experimental results on emotional Chinese speech corpus demonstrate the promising performance of RBFNN.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Picard, R.: Affective computing. MIT Press, Cambridge (1997)
Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., Taylor, J.G.: Emotion Recognition in Human-Computer Interaction. IEEE Signal Processing Magazine 18(01), 32–80 (2001)
Lee, C.M., Narayanan, S.S.: Toward Detecting Emotions in Spoken Dialogs. IEEE Transactions on Speech and Audio Processing 13(2), 293–303 (2005)
Dellaert, F., Polzin, T., Waibel, A.: Recognizing emotion in speech. In: Proceedings of 4th International Conference on Spoken Language Processing, Philadelphia, PA, USA, pp. 1970–1973 (1996)
Petrushin, V.: Emotion in speech: recognition and application to call centers. In: Proceedings of 1999 Artificial Neural Networks in Engineering, New York, pp. 7–10 (1999)
Yacoub, S., Simske, S., Lin, X., Burns, J.: Recognition of emotions in interactive voice response systems. In: Proceedings of EUROSPEECH 2003, Geneva, Switzerland, pp. 729–732 (2003)
Lee, C.C., Mower, E., Busso, C., Lee, S., Narayanan, S.S.: Emotion recognition using a hierarchical binary decision tree approach. In: Proceedings of INTERSPEECH 2009, Brighton, United Kingdom, pp. 320–323 (2009)
Park, J., Sandberg, I.W.: Universal approximation using radial-basis-function networks. Neural Computation 3(2), 246–257 (1991)
Er, M.J., Wu, S., Lu, J., Toh, H.L.: Face recognition with radial basis function (RBF) neural networks. IEEE Transactions on Neural Networks 13(3), 697–710 (2002)
Zhang, S.: Emotion Recognition in Chinese Natural Speech by Combining Prosody and Voice Quality Features. In: Sun, F., Zhang, J., Tan, Y., Cao, J., Yu, W. (eds.) ISNN 2008, Part II. LNCS, vol. 5264, pp. 457–464. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, S., Zhao, X., Lei, B. (2011). Spoken Emotion Recognition Using Radial Basis Function Neural Network. In: Lin, S., Huang, X. (eds) Advances in Computer Science, Environment, Ecoinformatics, and Education. CSEE 2011. Communications in Computer and Information Science, vol 214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23321-0_68
Download citation
DOI: https://doi.org/10.1007/978-3-642-23321-0_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23320-3
Online ISBN: 978-3-642-23321-0
eBook Packages: Computer ScienceComputer Science (R0)