Abstract
The problem of recognition of a sequence of objects (e.g., video-based image recognition, phoneme recognition) is explored. The generalization of the fuzzy phonetic decoding method is proposed by assuming the distribution of the classified object to be of exponential type. Its preliminary phase includes association of each model object with the fuzzy set of model classes with grades of membership defined as the confusion probabilities estimated with the Kullback-Leibler divergence between model distributions. At first, each object (e.g., frame) in a classified sequence is put in correspondence with the fuzzy set which grades are defined as the posterior probabilities. Next, this fuzzy set is intersected with the fuzzy set corresponding to the nearest neighbor. Finally, the arithmetic mean of these fuzzy intersections is assigned to the decision for the whole sequence. In this paper we propose not to limit the method’s usage with the Kullback-Leibler discrimination and to estimate the grades of membership of models and query objects based on an arbitrary distance with appropriate scale factor. The experimental results in the problem of isolated Russian vowel phonemes and words recognition for state-of-the-art measures of similarity are presented. It is shown that the correct choice of the scale parameter can significantly increase the recognition accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Savchenko, A.V.: Probabilistic neural network with homogeneity testing in recognition of discrete patterns set. Neural Networks 46, 227–241 (2013)
Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Elsevier Inc. (2009)
Benesty, J., Sondh, M., Huang, Y. (eds.): Springer Handbook of Speech Recognition. Springer (2008)
Savchenko, L.V., Savchenko, A.V.: Fuzzy Phonetic Decoding Method in a Phoneme Recognition Problem. In: Drugman, T., Dutoit, T. (eds.) NOLISP 2013. LNCS, vol. 7911, pp. 176–183. Springer, Heidelberg (2013)
Wang, H., Wang, Y., Cao, Y.: Video-based face recognition: a survey. World Academy of Science. Engineering and Technologies 60, 293–302 (2009)
Zadeh, L.A.: Fuzzy Sets. Information Control 8, 338–353 (1965)
Sarkar, M.: Fuzzy-rough nearest neighbor algorithms in classification. Fuzzy Sets and Systems 158(19), 2134–2152 (2007)
Kullback, S.: Information Theory and Statistics. Dover Pub. (1997)
Anusuya, M.A., Katti, S.K.: Speech recognition by Machine: A Review. International Journal of Computer Science and Information Security 6(3), 181–205 (2009)
Kipyatkova, I.S., Karpov, A.A.: An Analytical Survey of Large Vocabulary Russian Speech Recognition Systems. SPIIRAS Proceedings 12, 7–20 (2010)
Keener, R.W.: Theoretical Statistics: Topics for a Core Course. Springer, New York (2010)
Reddy, D.R.: Speech recognition by machine: a review. Proceedings of the IEEE 64(4), 501–531 (1976)
Hill, J.E.: The minimum of n independent normal distributions, http://www.untruth.org/~josh/math/normal-min.pdf
Savchenko, A.V.: Adaptive Video Image image Recognition recognition System Using using a Committee committee Machinemachine. Optical Memory and Neural Networks (Information Optics) 21(4), 219–226 (2012)
Specht, D.F.: Probabilistic neural networks. Neural Networks 3(1), 109–118 (1990)
Itakura, F., Saito, S.: An analysis–synthesis telephony based on the maximum likelihood method. In: Proc. of International Congress on Acoustics c-5-5, vol. 5, pp. 17–20 (1968)
Basseville, M.: Distance measures for signal processing and pattern recognition. Signal Processing 18, 349–369 (1989)
Mérialdo, B.: Multilevel Decoding for Very-Large-Size-Dictionary Speech Recognition. IBM Journal of Research and Development 32(2), 227–237 (1988)
Sirigos, J., Fakotakis, N., Kokkinakis, G.: A hybrid syllable recognition system based on vowel spotting. Speech Communication 38, 427–440 (2002)
Savchenko, A.V.: Phonetic words decoding software in the problem of Russian speech recognition. Automation and Remote Control 74(7), 1225–1232 (2013)
Savchenko, A.V.: Phonetic encoding method in the isolated words recognition problem. Journal of Communications Technology and Electronics 59(4), 310–315 (2014)
CMU Sphinx, http://cmusphinx.sourceforge.net/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Savchenko, A.V., Savchenko, L.V. (2014). Classification of a Sequence of Objects with the Fuzzy Decoding Method. In: Cornelis, C., Kryszkiewicz, M., Ślȩzak, D., Ruiz, E.M., Bello, R., Shang, L. (eds) Rough Sets and Current Trends in Computing. RSCTC 2014. Lecture Notes in Computer Science(), vol 8536. Springer, Cham. https://doi.org/10.1007/978-3-319-08644-6_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-08644-6_32
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08643-9
Online ISBN: 978-3-319-08644-6
eBook Packages: Computer ScienceComputer Science (R0)