Abstract
In this paper we study the ICA feature extraction method for Chinese speech signals. The generalized Gaussian model (GGM) is introduced as the p.d.f. estimator in ICA since it can provide a general method for modeling non-Gaussian statistical structure of univariate distributions. It is demonstrated that the ICA features of Chinese speech are localized in both time and frequency domain and the resulting coefficients are statistically independent and sparse. The GGM-based ICA method is also used in extracting the basis vectors directly from the noisy observation, which is an efficient method for noise reduction when priori knowledge of source data is not acquirable. The de-nosing experiments show that the proposed method is more efficient than conventional methods in the environment of additive white Gaussian noise.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Lee, T.-W., Jang, G.-J.: The Statistical Structures of Male and Female Speech Signals. In: Proc. ICASSP, Salt Lack City, Utah (May 2001)
Lee, J.-H., Jung, H.-Y.: Speech Feature Extraction Using Independent Component Analysis. In: Proc. ICASP, Istanbul, Turkey, vol. 3, pp. 1631–1634 (June 2000)
Bell, A.J., Sejnowski, T.J.: Learning the Higher-order structure of a nature sound. Network: Computation in Neural System 7, 261–266 (1996)
Jang, G.-J., Lee, T.-w.: Learning statistically efficient features for speaker recognition. Neurocomputing 49, 329–348 (2002)
Lee, T.-W., Lewicki, M.S.: The Generalized Gaussian Mixture Model Using ICA. In: International workshop on Independent Component Analysis (ICA 2000), Helsinki, Finland, pp. 239–244 (June 2000)
Hyvärinen, A.: Sparse code shrinkage: Denoising of nongaussian data by maximum likelihood estimation. Technical Report A51, Helsinki University of Technology, Laboratory of Computer and Information Science (1998)
Hyvärinen, A., Hoyer, P., Oja, E.: Sparse code shrinkage: Denoising by nonlinear maximum likelihood estimation. In: Advances in Neural Information Processing System 11, NIPS 1998 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bin, Y., Wei, K. (2005). Efficient Feature Extraction and De-noising Method for Chinese Speech Signals Using GGM-Based ICA. In: Sanfeliu, A., Cortés, M.L. (eds) Progress in Pattern Recognition, Image Analysis and Applications. CIARP 2005. Lecture Notes in Computer Science, vol 3773. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11578079_95
Download citation
DOI: https://doi.org/10.1007/11578079_95
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29850-2
Online ISBN: 978-3-540-32242-9
eBook Packages: Computer ScienceComputer Science (R0)