Speaker Recognition in Unknown Mismatched Conditions Using Augmented PCA

Ha-Jin Yu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3733))

Included in the following conference series:

International Symposium on Computer and Information Sciences

2697 Accesses
2 Citations

Abstract

Our goal was to build a text-independent speaker recognition system that could be used under any conditions without any additional adaptation process. Unknown mismatched microphones and noise conditions can severely degrade the performance of speaker recognition systems. This paper shows that principal component analysis (PCA) can increase performance under these conditions without reducing dimension. We also propose a PCA process that augments class discriminative information sent to original feature vectors before PCA transformation and selects the best direction between each pair of highly confusable speakers. In tests, the proposed method reduced errors in recognition by 32%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Robust Principal Component Analysis Based Speaker Verification Under Additive Noise Conditions

Modelling Speaker Variability Using Covariance Learning

A Comparison of Covariance Matrix and i-vector Based Speaker Recognition

References

Campbell, J.P.: Speaker Recognition: A Tutorial. Proceedings of the IEEE 85(9), 1437–1462 (1997)
Article Google Scholar
Acero, A.: Acoustical and Environmental Robustness in Automatic Speech Recognition. Kluwer Academic Publishers, Boston (1993)
Google Scholar
Huang, X., Acero, A., Hon, H.: Spoken Language Processing, A Guide to Theory, Algorithm, and System Development. Prentice-Hall, Englewood Cliffs (2001)
Google Scholar
Tsai, S.-N., Lee, L.-S.: Improved Robust Features for Speech Recognition by Integrating Time-Frequency Principal Components (TFPC) and Histogram Equalization (HEQ). In: IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 297–302 (2003)
Google Scholar
Wanfeng, Z., Yingchun, Y., Zhaohui, W., Lifeng, S.: Experimental Evaluation of a New Speaker Identification Framework using PCA. In: IEEE International Conference on Systems, Man and Cybernetics, vol. 5, pp. 4147–4152 (2003)
Google Scholar
Ding, P., Liming, Z.: Speaker Recognition using Principal Component Analysis. In: Proceedings of ICONIP 2001, 8th International Conference on Neural Information Processing, Shanghai (2001)
Google Scholar
Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech Audio Processing 3(1), 72–83 (1995)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, University of Seoul, Dongdaemungu, Seoul, 130-743, South Korea
Ha-Jin Yu

Authors

Ha-Jin Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Engineering, Boğaziçi University, 34342, Bebek, Istanbul, Turkey
pInar Yolum & Can Özturan &
Computer Engineering Department, Boğaziçi University, 34342, Bebek, İstanbul, Turkey
Tunga Güngör
Computer Engineering Department, Bogazici University, 80815, Bebek, Istanbul, Turkey
Fikret Gürgen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, HJ. (2005). Speaker Recognition in Unknown Mismatched Conditions Using Augmented PCA. In: Yolum, p., Güngör, T., Gürgen, F., Özturan, C. (eds) Computer and Information Sciences - ISCIS 2005. ISCIS 2005. Lecture Notes in Computer Science, vol 3733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11569596_69

Download citation

DOI: https://doi.org/10.1007/11569596_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29414-6
Online ISBN: 978-3-540-32085-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Speaker Recognition in Unknown Mismatched Conditions Using Augmented PCA

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Robust Principal Component Analysis Based Speaker Verification Under Additive Noise Conditions

Modelling Speaker Variability Using Covariance Learning

A Comparison of Covariance Matrix and i-vector Based Speaker Recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Speaker Recognition in Unknown Mismatched Conditions Using Augmented PCA

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Robust Principal Component Analysis Based Speaker Verification Under Additive Noise Conditions

Modelling Speaker Variability Using Covariance Learning

A Comparison of Covariance Matrix and i-vector Based Speaker Recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation