Abstract
Our goal was to build a text-independent speaker recognition system that could be used under any conditions without any additional adaptation process. Unknown mismatched microphones and noise conditions can severely degrade the performance of speaker recognition systems. This paper shows that principal component analysis (PCA) can increase performance under these conditions without reducing dimension. We also propose a PCA process that augments class discriminative information sent to original feature vectors before PCA transformation and selects the best direction between each pair of highly confusable speakers. In tests, the proposed method reduced errors in recognition by 32%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Campbell, J.P.: Speaker Recognition: A Tutorial. Proceedings of the IEEE 85(9), 1437–1462 (1997)
Acero, A.: Acoustical and Environmental Robustness in Automatic Speech Recognition. Kluwer Academic Publishers, Boston (1993)
Huang, X., Acero, A., Hon, H.: Spoken Language Processing, A Guide to Theory, Algorithm, and System Development. Prentice-Hall, Englewood Cliffs (2001)
Tsai, S.-N., Lee, L.-S.: Improved Robust Features for Speech Recognition by Integrating Time-Frequency Principal Components (TFPC) and Histogram Equalization (HEQ). In: IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 297–302 (2003)
Wanfeng, Z., Yingchun, Y., Zhaohui, W., Lifeng, S.: Experimental Evaluation of a New Speaker Identification Framework using PCA. In: IEEE International Conference on Systems, Man and Cybernetics, vol. 5, pp. 4147–4152 (2003)
Ding, P., Liming, Z.: Speaker Recognition using Principal Component Analysis. In: Proceedings of ICONIP 2001, 8th International Conference on Neural Information Processing, Shanghai (2001)
Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech Audio Processing 3(1), 72–83 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yu, HJ. (2005). Speaker Recognition in Unknown Mismatched Conditions Using Augmented PCA. In: Yolum, p., Güngör, T., Gürgen, F., Özturan, C. (eds) Computer and Information Sciences - ISCIS 2005. ISCIS 2005. Lecture Notes in Computer Science, vol 3733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11569596_69
Download citation
DOI: https://doi.org/10.1007/11569596_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29414-6
Online ISBN: 978-3-540-32085-2
eBook Packages: Computer ScienceComputer Science (R0)