Emotion-State Conversion for Speaker Recognition

Dongdong Li¹⁹,
Yingchun Yang¹⁹,
Zhaohi Wu¹⁹ &
…
Tian Wu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3784))

Included in the following conference series:

International Conference on Affective Computing and Intelligent Interaction

5401 Accesses

Abstract

The performance of speaker recognition system is easily disturbed by the changes of the internal states of human. The ongoing work proposes an approach of speech emotion-state conversion to improve the performance of speaker identification system over various affective speech. The features of neutral speech are modified according to statistical prosodic parameters of emotion utterances. Speaker models are generated based on the converted speech. The experiments conducted on an emotion corpus with 14 emotion states shows promising results with an improved performance by 7.2%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

Article 01 May 2018

Speaker-Aware Training of Speech Emotion Classifier with Speaker Recognition

Review of Research on Speech Emotion Recognition

References

Scherer, K.R., Johnstone, T., Klasmeyer, G.: Can automatic speaker verification be improved by training the algorithms on emotional speech? In: Proceedings of the 6th International Conference on Spoken Language Processing, Beijing, China (2000)
Google Scholar
Scherer, K.R., Johnstone, T., Bänziger, T.: Automatic verification of emotionally stressed speakers: The problem of individual differences. In: Proc. of SPECOM 1998 (1998)
Google Scholar
Sato, J., Morishima, S.: Emotion modeling in speech production using emotion space. In: Proc. IEEE Int. Workshop on Robot and Human Communication (1996)
Google Scholar
Thévenaz, P., Hügli, H.: Usefulness of the LPC-Residue in Text-Independent Speaker Verification. Speech Communication 17, 145–157 (1995)
Article Google Scholar
Galanis, D., Darsinos, V., Kokkinakis, G.: Investigating Emotional Speech Parameter for Speech Synthesis. In: ICECS 1996, pp. 1227–1230 (1996)
Google Scholar
LDC: The Linguistic Data Consortium, web pages at www.ldc.upenn.edu
Banse, R., Scherer, K.R.: Acoustic profiles in vocal emotion expression. Journal of Per-sonality and Social Psychology 70, 614–636 (1996)
Article Google Scholar
Frank, D., Thomas, P., Alex, W.: Recognizing Emotion in Speech. In: ICSLP 1996, pp. 1970–1973 (1996)
Google Scholar
Sun, S.: Pitch Determination and Voice Quality Analysis Using Subharmonic-TO-Harmonic Ratio. In: IEEE International Conference on Acoustics, Speech, and Signal processing (2002)
Google Scholar
Feustel, T.C., Velius, G.A., Logan, R.J.: Human and Machine Performance on Speaker Identity Verification. Speech Tech., 169–170 (1989)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, Zhejiang University, Hangzhou, 310027, P.R. China
Dongdong Li, Yingchun Yang, Zhaohi Wu & Tian Wu

Authors

Dongdong Li
View author publications
You can also search for this author in PubMed Google Scholar
Yingchun Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zhaohi Wu
View author publications
You can also search for this author in PubMed Google Scholar
Tian Wu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences,
Jianhua Tao
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
MIT Media Laboratory, 20 Ames Street, 02139, Cambridge, MA, USA
Rosalind W. Picard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, D., Yang, Y., Wu, Z., Wu, T. (2005). Emotion-State Conversion for Speaker Recognition. In: Tao, J., Tan, T., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2005. Lecture Notes in Computer Science, vol 3784. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573548_52

Download citation

DOI: https://doi.org/10.1007/11573548_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29621-8
Online ISBN: 978-3-540-32273-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Emotion-State Conversion for Speaker Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

Speaker-Aware Training of Speech Emotion Classifier with Speaker Recognition

Review of Research on Speech Emotion Recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Emotion-State Conversion for Speaker Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

Speaker-Aware Training of Speech Emotion Classifier with Speaker Recognition

Review of Research on Speech Emotion Recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation