Abstract
A wavelet packet based adaptive filter-bank construction method is proposed for speech signal processing in this paper. On this basis, a set of acoustic features are proposed for speech emotion recognition, namely Coiflet Wavelet Packet Cepstral Coefficients (CWPCC). CWPCC extends the conventional Mel-Frequency Cepstral Coefficients (MFCC) by adapting the filter-bank structure according to the decision task; Speech emotion recognition system is constructed with the proposed feature set and Gaussian mixture model as classifier. Experimental results on Berlin emotional speech database show that the Coiflet Wavelet Packet is more suitable in speech emotion recognition than other Wavelet Packets and proposed features improve emotion recognition performance over the conventional features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Morrison, D., Wang, R.L., De Silva, L.C.: Ensemble methods for spokenemotion recognition in call-centres. Speech Comm. 49(2), 98–112 (2007)
France, D.J., Shiavi, R.G., Silverman, S., Silverman, M., Wilkes, M.: Acoustical properties of speech as indicators of depression and suicidalrisk. IEEE Transactions on Biomedical Engineering 47(7), 829–837 (2000)
Caponetti, L., Buscicchio, C.A., Castellano, G.: Biologically inspired emo-tion recognition from speech. Eurasip Journal on Advances in Signal Processing
Malta, L., Miyajima, C., Kitaoka, N., Takeda, K.: Multimodal estimationof a driver’s spontaneous irritation. In: Intelligent Vehicles Symposium, pp. 573–577. IEEE (2009)
Stephane, M.: A Wavelet Tour of Signal Processing, 3rd edn. Academic Press, Burlington (2009)
Daubechies, I.: Ten Lectures on Wavelets. Society for Industrial and Applied Mathematics, Philadelphia (1992)
Pavez, E., Silva, J.F.: Analysis and design of wavelet-packet cepstral co-efficients for automaticspeech recognition. Speech Comm. 54(6), 814–835 (2012)
Saito, N., Coifman, R.R.: Local discriminant bases. In: SPIE 2303, Mathematical Imaging:Wavelet Applications in Signal and Image Processing, pp. 2–14 (1994)
Silva, J., Narayanan, S.S.: Discriminative wavelet packet filter bank selection for pattern recognition. IEEE Trans. Signal Process. 57(5), 1796–1810 (2009)
Rabiner, L., Juang, B.-H.: Fundamentals of Speech Recognition. Prentice-Hall, New Jersey (1993)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3), 1–27 (2011)
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., Weiss, B.: A database of german emotional speech. In: Proceeding INTERSPEECH 2005, ISCA, pp. 1517–1520 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huang, Y., Wu, A., Zhang, G., Li, Y. (2014). Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients. In: Li, S., Liu, C., Wang, Y. (eds) Pattern Recognition. CCPR 2014. Communications in Computer and Information Science, vol 484. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45643-9_46
Download citation
DOI: https://doi.org/10.1007/978-3-662-45643-9_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45642-2
Online ISBN: 978-3-662-45643-9
eBook Packages: Computer ScienceComputer Science (R0)