Abstract
Speech is a vehicular tool to detect neurological degeneration using certain accepted biomarkers derived from sustained vowels, diadochokinetic exercises, or running speech. Classically, mel-frequency cepstral coefficients (MFCCs) have been used in the organic and neurologic characterization of pathologic phonation using sustained vowels. In the present paper, a comparative study has been carried on comparing Parkinson’s disease detection results using MFCCs and vowel articulation kinematic distributions derived from the first two formants. Binary classification results using support vector machines avail the superior performance of articulation kinematic distributions with respect to MFCCs regarding sensitivity, specificity, and accuracy. The fusion of both types of features could lead to improve general performance in PD detection and monitoring from speech.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Morris, M.E.: Movement disorders in people with Parkinson disease: a model for physical therapy. Phys. Ther. 80(6), 578–597 (2000)
Sapir, S.: Multiple factors are involved in the dysarthria associated with Parkinson’s disease: a review with implications for clinical practice and research. J. Speech Lang. Hear. Res. 57, 1330–1343 (2014)
Skodda, S., Grönheit, W., Mancinelli, N., Schlegel, U.: Progression of voice and speech impairment in the course of Parkinson’s disease: a longitudinal study. Parkinson’s Dis. Article ID 389195 (2013)
Sapir, S., Ramig, L.O., Spielman, J.L., Fox, C.: Formant centralization ratio: a proposal for a new acoustic measure of dysarthric speech. J. Speech Lang. Hear. Res. 53(1), 114–125 (2010)
Hinton, G., Deng, L., Yu, D., Dahl, G.E., Mohamed, A., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, V., Sainath, T.N., Kingsbury, B.: Deep neural networks for acoustic modeling in speech recognition. IEEE Sig. Proc. Mag. 29, 82–97 (2012)
Kinnunen, T., Li, H.: An overview of text-independent speaker recognition: from features to supervectors. Speech Comm. 52, 12–40 (2010)
Fraile, R., Sáez, N., Godino, J.I., Osma, V., Fredouille, C.: Automatic detection of laryngeal pathologies in records of sustained vowels by means of mel-frequency cepstral coefficient parameters and differentiation of patients by sex. Folia Phoniatr. Logop. 61, 146–152 (2009)
Benba, A., Jilbab, A., Hammouch, A.: Detecting patients with Parkinson’s disease using mel frequency cepstral coefficients and support vector machines. Int. J. Electr. Eng. Inf. 7(2), 297–306 (2015)
Murphy, P.J., Akande, O.O.: Noise estimation in voice signals using short-term cepstral analysis. J. Acoust. Soc. Am. 121(3), 1679–1690 (2007)
Gómez, P., et al.: Parkinson disease detection form speech articulation neuromechanics. Front. Neuroinform. 11, 1–17 (2017)
Dromey, C., Jang, G.O., Hollis, K.: Assessing correlations between lingual movements and formants. Speech Comm. 55, 315–328 (2013)
Whitfield, J.A., Goberman, A.M.: Articulatory-acoustic vowel space: application to clear speech in individuals with Parkinson’s disease. J. Comm. Disord. 51, 19–28 (2014)
Deller, J.R., Proakis, J.G., Hansen, J.H.L.: Discrete-Time Processing of Speech Signals. Macmillan, New York (1993)
Mekyska, J., Janusova, E., Gómez, P., Smekal, Z., Rektorova, I., Eliasova, I., Kostalova, M., Mrackova, M., Alonso, J.B., Faúndez, M., López de Ipiña, K.: Robust and complex approach of pathological speech signal analysis. Neurocomput. 167, 94–111 (2015)
Kononenko, I., Šimec, E., Robnik-Šikonja, M.: Overcoming the myopia of inductive learning algorithms with RReliefF. Applied Intelligence 7, 39–55 (1997)
Robnik-Šikonja, M., Kononenko, I.: Theoretical and empirical analysis of ReliefF and RReliefF. Mach. Learn. 53(1–2), 23–69 (2003)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20, 273–297 (1995)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27:1–27:27 (2011)
Acknowledgements
This work was supported by grant TEC2016-77791-C4-4-R (Plan Nacional de I+D+i, Ministry of Economic Affairs and Competitiveness of Spain), CENIE_TECA-PARK_55_02 INTERREG V-A Spain-Portugal (POCTEP), and grant 16-30805A of the Czech Ministry of Health.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Gómez-Rodellar, A., Álvarez-Marquina, A., Mekyska, J., Palacios-Alonso, D., Meghraoui, D., Gómez-Vilda, P. (2020). Performance of Articulation Kinetic Distributions Vs MFCCs in Parkinson’s Detection from Vowel Utterances. In: Esposito, A., Faundez-Zanuy, M., Morabito, F., Pasero, E. (eds) Neural Approaches to Dynamics of Signal Exchanges. Smart Innovation, Systems and Technologies, vol 151. Springer, Singapore. https://doi.org/10.1007/978-981-13-8950-4_38
Download citation
DOI: https://doi.org/10.1007/978-981-13-8950-4_38
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8949-8
Online ISBN: 978-981-13-8950-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)