Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients

Yongming Huang^15,16,
Ao Wu^15,16,
Guobao Zhang^15,16 &
…
Yue Li^15,16

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 484))

Included in the following conference series:

Chinese Conference on Pattern Recognition

2426 Accesses

Abstract

A wavelet packet based adaptive filter-bank construction method is proposed for speech signal processing in this paper. On this basis, a set of acoustic features are proposed for speech emotion recognition, namely Coiflet Wavelet Packet Cepstral Coefficients (CWPCC). CWPCC extends the conventional Mel-Frequency Cepstral Coefficients (MFCC) by adapting the filter-bank structure according to the decision task; Speech emotion recognition system is constructed with the proposed feature set and Gaussian mixture model as classifier. Experimental results on Berlin emotional speech database show that the Coiflet Wavelet Packet is more suitable in speech emotion recognition than other Wavelet Packets and proposed features improve emotion recognition performance over the conventional features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Speech Emotion Recognition Based on Wavelet Packet Coefficients

Recognition of Human Speech Emotion Using Variants of Mel-Frequency Cepstral Coefficients

A Multiresolution-Based Fusion Strategy for Improving Speech Emotion Recognition Efficiency

References

Morrison, D., Wang, R.L., De Silva, L.C.: Ensemble methods for spokenemotion recognition in call-centres. Speech Comm. 49(2), 98–112 (2007)
Article Google Scholar
France, D.J., Shiavi, R.G., Silverman, S., Silverman, M., Wilkes, M.: Acoustical properties of speech as indicators of depression and suicidalrisk. IEEE Transactions on Biomedical Engineering 47(7), 829–837 (2000)
Article Google Scholar
Caponetti, L., Buscicchio, C.A., Castellano, G.: Biologically inspired emo-tion recognition from speech. Eurasip Journal on Advances in Signal Processing
Google Scholar
Malta, L., Miyajima, C., Kitaoka, N., Takeda, K.: Multimodal estimationof a driver’s spontaneous irritation. In: Intelligent Vehicles Symposium, pp. 573–577. IEEE (2009)
Google Scholar
Stephane, M.: A Wavelet Tour of Signal Processing, 3rd edn. Academic Press, Burlington (2009)
MATH Google Scholar
Daubechies, I.: Ten Lectures on Wavelets. Society for Industrial and Applied Mathematics, Philadelphia (1992)
Book MATH Google Scholar
Pavez, E., Silva, J.F.: Analysis and design of wavelet-packet cepstral co-efficients for automaticspeech recognition. Speech Comm. 54(6), 814–835 (2012)
Article Google Scholar
Saito, N., Coifman, R.R.: Local discriminant bases. In: SPIE 2303, Mathematical Imaging:Wavelet Applications in Signal and Image Processing, pp. 2–14 (1994)
Google Scholar
Silva, J., Narayanan, S.S.: Discriminative wavelet packet filter bank selection for pattern recognition. IEEE Trans. Signal Process. 57(5), 1796–1810 (2009)
Article MathSciNet Google Scholar
Rabiner, L., Juang, B.-H.: Fundamentals of Speech Recognition. Prentice-Hall, New Jersey (1993)
Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3), 1–27 (2011)
Article Google Scholar
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., Weiss, B.: A database of german emotional speech. In: Proceeding INTERSPEECH 2005, ISCA, pp. 1517–1520 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Automation, Southeast University, Nanjing, 210096, China
Yongming Huang, Ao Wu, Guobao Zhang & Yue Li
Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, China
Yongming Huang, Ao Wu, Guobao Zhang & Yue Li

Authors

Yongming Huang
View author publications
You can also search for this author in PubMed Google Scholar
Ao Wu
View author publications
You can also search for this author in PubMed Google Scholar
Guobao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yue Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

College of Electrical and Information Engineering, Hunan University, 410082, Changsha, P.R. China
Shutao Li
Chinese Academy of Sciences, Beijing, China
Chenglin Liu
College of electrical and information engineering, Hunan University, 410082, Changsha, P.R. China
Yaonan Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, Y., Wu, A., Zhang, G., Li, Y. (2014). Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients. In: Li, S., Liu, C., Wang, Y. (eds) Pattern Recognition. CCPR 2014. Communications in Computer and Information Science, vol 484. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45643-9_46

Download citation

DOI: https://doi.org/10.1007/978-3-662-45643-9_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45642-2
Online ISBN: 978-3-662-45643-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Speech Emotion Recognition Based on Wavelet Packet Coefficients

Recognition of Human Speech Emotion Using Variants of Mel-Frequency Cepstral Coefficients

A Multiresolution-Based Fusion Strategy for Improving Speech Emotion Recognition Efficiency

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Speech Emotion Recognition Based on Wavelet Packet Coefficients

Recognition of Human Speech Emotion Using Variants of Mel-Frequency Cepstral Coefficients

A Multiresolution-Based Fusion Strategy for Improving Speech Emotion Recognition Efficiency

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation