Abstract
Parkinson’s Disease (PD) is a major neurodegenerative disorder with steadily increasing incidence rates, demanding overgrowing resources from national health systems and imposing considerable burden on caregivers. Cost-effective and efficient turn-around time monitoring methods are required to facilitate regular, longitudinal, accurate clinical assessment and symptom management. Speech has proven to be an effective neuromotor biomarker, capitalizing on the capabilities of contact-free technology. This study aims to evaluate processing speech from people diagnosed with Parkinson’s Disease using Convolutional Neural Networks (CNN) towards characterizing speech articulation kinematics to explore differences between Healthy Controls (HC) and PD participants with Hypokinetic Dysarthria (HD), using Auditory Receptive Fields (ARFs) in the convolutional layers. The proposed proof of concept is based on a CNN described in detail, using an Extreme Learning Machine (ELM) at the output projection layer. This structure is evaluated on speech recordings from 6 PD and 6 HC participants. The performance of the approach is evaluated in terms of correlation and the log-likelihood ratio on the softmax output, showing the efficiency and retrieving properties of the CNN on speech auditory images, towards providing new insights on the pathophysiology of PD speech.
This research received funding from grants TEC2016-77791-C4-4-R (Ministry of Economic Affairs and Competitiveness of Spain), and Teca-Park-MonParLoc FGCSIC-CENIE 0348-CIE-6-E (InterReg Programme). The authors want to thank the APARKAM association of Parkinson’s Disease patients of Alcorcón and Leganés in Madrid, and the voluntary participants for contributing to this initiative.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Tysnes, B., Storstein, A.: Epidemiology of Parkinson’s disease. J. Neural Transm. 124, 901–905 (2017)
Duffy, J.R.: Motor Speech Disorders: Substrates, Differential Diagnosis, and Management, 3rd edn. Elsevier, Amsterdam (2013)
Parkinson. J.: An essay on the shaking palsy, Sherwood, Neely and Jones, London, 1817. J. Neuropsychiatry Clin. Neurosci. 12(2), 223–236 (2002)
Tsanas, A.: Accurate telemonitoring of Parkinson’s disease symptom severity using nonlinear speech signal processing and statistical machine learning. Ph.D. thesis, University of Oxford, UK, June 2012
Hedge, H., et al.: A survey on machine learning approaches for automatic detection of voice disorders. J. Voice 33(6), 947.E11–E33 (2019)
Cerasa, A.: Machine learning on Parkinson’s disease? Let’s translate into clinical practice. J. Neurosci. Meth. 266, 161–162 (2016)
Hubel, D.H., Wiesel, T.N.: Receptive fields and functional architecture of monkey striate cortex. J. Physiol. 195, 215–243 (1968)
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Suga, N.: Basic acoustic patterns and neural mechanisms shared by humans and animals for auditory perception. In: Greenberg, S., et al. (eds.) Speech Processing in the Auditory System, pp. 159–181. Springer, New York (2004)
Greenberg, S., Ainsworth, W.A.: Speech processing in the auditory system: an overview. In: Greenberg, S., et al. (eds.) Speech Processing in the Auditory System, vol. 18, pp. 1–62. Springer, New York (2004). https://doi.org/10.1007/0-387-21575-1_1
Forsyth, D.: Applied Machine Learning, pp. 401–419. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-18114-7
Deller, J.R., et al.: Discrete-Time Processing of Speech Signals. Macmillan, New York (1993)
Alku, P., et al.: OPENGLOT - an open environment for the evaluation of glottal inverse filtering. Speech Commun. 107, 38–47 (2019)
Gómez, P., et al.: Glottal source biometrical signature for voice pathology detection. Speech Commun. 51(9), 759–781 (2009)
Osma, V., et al.: An improved watershed algorithm based on efficient computation of shortest paths. Pattern Recogn. 40(3), 1078–1090 (2007)
Huang, G-B., Siew, C.-K.: Extreme learning machine: RBF network case. In: Proceedings of the ICARCV, pp. 1029–1033 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Gómez-Vilda, P., Gómez-Rodellar, A., Palacios-Alonso, D., Álvarez-Marquina, A., Tsanas, A. (2022). Characterization of Hypokinetic Dysarthria by a CNN Based on Auditory Receptive Fields. In: Ferrández Vicente, J.M., Álvarez-Sánchez, J.R., de la Paz López, F., Adeli, H. (eds) Artificial Intelligence in Neuroscience: Affective Analysis and Health Applications. IWINAC 2022. Lecture Notes in Computer Science, vol 13258. Springer, Cham. https://doi.org/10.1007/978-3-031-06242-1_34
Download citation
DOI: https://doi.org/10.1007/978-3-031-06242-1_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06241-4
Online ISBN: 978-3-031-06242-1
eBook Packages: Computer ScienceComputer Science (R0)