Characterization of Hypokinetic Dysarthria by a CNN Based on Auditory Receptive Fields

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13258))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

1497 Accesses
1 Citations

Abstract

Parkinson’s Disease (PD) is a major neurodegenerative disorder with steadily increasing incidence rates, demanding overgrowing resources from national health systems and imposing considerable burden on caregivers. Cost-effective and efficient turn-around time monitoring methods are required to facilitate regular, longitudinal, accurate clinical assessment and symptom management. Speech has proven to be an effective neuromotor biomarker, capitalizing on the capabilities of contact-free technology. This study aims to evaluate processing speech from people diagnosed with Parkinson’s Disease using Convolutional Neural Networks (CNN) towards characterizing speech articulation kinematics to explore differences between Healthy Controls (HC) and PD participants with Hypokinetic Dysarthria (HD), using Auditory Receptive Fields (ARFs) in the convolutional layers. The proposed proof of concept is based on a CNN described in detail, using an Extreme Learning Machine (ELM) at the output projection layer. This structure is evaluated on speech recordings from 6 PD and 6 HC participants. The performance of the approach is evaluated in terms of correlation and the log-likelihood ratio on the softmax output, showing the efficiency and retrieving properties of the CNN on speech auditory images, towards providing new insights on the pathophysiology of PD speech.

This research received funding from grants TEC2016-77791-C4-4-R (Ministry of Economic Affairs and Competitiveness of Spain), and Teca-Park-MonParLoc FGCSIC-CENIE 0348-CIE-6-E (InterReg Programme). The authors want to thank the APARKAM association of Parkinson’s Disease patients of Alcorcón and Leganés in Madrid, and the voluntary participants for contributing to this initiative.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Parkinson’s Disease Assessment from Speech Data Using Recurrence Plot

Variable STFT Layered CNN Model for Automated Dysarthria Detection and Severity Assessment Using Raw Speech

Article 22 February 2024

A generic optimization and learning framework for Parkinson disease via speech and handwritten records

Article Open access 26 August 2022

References

Tysnes, B., Storstein, A.: Epidemiology of Parkinson’s disease. J. Neural Transm. 124, 901–905 (2017)
Article Google Scholar
Duffy, J.R.: Motor Speech Disorders: Substrates, Differential Diagnosis, and Management, 3rd edn. Elsevier, Amsterdam (2013)
Google Scholar
Parkinson. J.: An essay on the shaking palsy, Sherwood, Neely and Jones, London, 1817. J. Neuropsychiatry Clin. Neurosci. 12(2), 223–236 (2002)
Google Scholar
Tsanas, A.: Accurate telemonitoring of Parkinson’s disease symptom severity using nonlinear speech signal processing and statistical machine learning. Ph.D. thesis, University of Oxford, UK, June 2012
Google Scholar
Hedge, H., et al.: A survey on machine learning approaches for automatic detection of voice disorders. J. Voice 33(6), 947.E11–E33 (2019)
Google Scholar
Cerasa, A.: Machine learning on Parkinson’s disease? Let’s translate into clinical practice. J. Neurosci. Meth. 266, 161–162 (2016)
Article Google Scholar
Hubel, D.H., Wiesel, T.N.: Receptive fields and functional architecture of monkey striate cortex. J. Physiol. 195, 215–243 (1968)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Article CAS Google Scholar
Suga, N.: Basic acoustic patterns and neural mechanisms shared by humans and animals for auditory perception. In: Greenberg, S., et al. (eds.) Speech Processing in the Auditory System, pp. 159–181. Springer, New York (2004)
Google Scholar
Greenberg, S., Ainsworth, W.A.: Speech processing in the auditory system: an overview. In: Greenberg, S., et al. (eds.) Speech Processing in the Auditory System, vol. 18, pp. 1–62. Springer, New York (2004). https://doi.org/10.1007/0-387-21575-1_1
Forsyth, D.: Applied Machine Learning, pp. 401–419. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-18114-7
Deller, J.R., et al.: Discrete-Time Processing of Speech Signals. Macmillan, New York (1993)
Google Scholar
Alku, P., et al.: OPENGLOT - an open environment for the evaluation of glottal inverse filtering. Speech Commun. 107, 38–47 (2019)
Article Google Scholar
Gómez, P., et al.: Glottal source biometrical signature for voice pathology detection. Speech Commun. 51(9), 759–781 (2009)
Article Google Scholar
Osma, V., et al.: An improved watershed algorithm based on efficient computation of shortest paths. Pattern Recogn. 40(3), 1078–1090 (2007)
Article Google Scholar
Huang, G-B., Siew, C.-K.: Extreme learning machine: RBF network case. In: Proceedings of the ICARCV, pp. 1029–1033 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

NeuSpeLab, CTB, Universidad Politécnica de Madrid, 28220, Pozuelo de Alarcón, Madrid, Spain
Pedro Gómez-Vilda & Agustín Álvarez-Marquina
Faculty of Medicine, Usher Institute, University of Edinburgh, Edinburgh, UK
Andrés Gómez-Rodellar & Athanasios Tsanas
E.T.S. de Ingeniería Informática - Universidad Rey Juan Carlos, Campus de Móstoles, Tulipán, s/n, 28933, Móstoles, Madrid, Spain
Daniel Palacios-Alonso

Authors

Pedro Gómez-Vilda
View author publications
You can also search for this author in PubMed Google Scholar
Andrés Gómez-Rodellar
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Palacios-Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Agustín Álvarez-Marquina
View author publications
You can also search for this author in PubMed Google Scholar
Athanasios Tsanas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pedro Gómez-Vilda .

Editor information

Editors and Affiliations

Universidad Politécnica de Cartagena, Cartagena, Spain
José Manuel Ferrández Vicente
Universidad Nacional de Educación a Distancia, Madrid, Spain
José Ramón Álvarez-Sánchez
Universidad Nacional de Educación a Distancia, Madrid, Spain
Félix de la Paz López
Ohio State University, Columbus, OH, USA
Hojjat Adeli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gómez-Vilda, P., Gómez-Rodellar, A., Palacios-Alonso, D., Álvarez-Marquina, A., Tsanas, A. (2022). Characterization of Hypokinetic Dysarthria by a CNN Based on Auditory Receptive Fields. In: Ferrández Vicente, J.M., Álvarez-Sánchez, J.R., de la Paz López, F., Adeli, H. (eds) Artificial Intelligence in Neuroscience: Affective Analysis and Health Applications. IWINAC 2022. Lecture Notes in Computer Science, vol 13258. Springer, Cham. https://doi.org/10.1007/978-3-031-06242-1_34

Download citation

DOI: https://doi.org/10.1007/978-3-031-06242-1_34
Published: 24 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06241-4
Online ISBN: 978-3-031-06242-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Characterization of Hypokinetic Dysarthria by a CNN Based on Auditory Receptive Fields

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Parkinson’s Disease Assessment from Speech Data Using Recurrence Plot

Variable STFT Layered CNN Model for Automated Dysarthria Detection and Severity Assessment Using Raw Speech

A generic optimization and learning framework for Parkinson disease via speech and handwritten records

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Characterization of Hypokinetic Dysarthria by a CNN Based on Auditory Receptive Fields

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Parkinson’s Disease Assessment from Speech Data Using Recurrence Plot

Variable STFT Layered CNN Model for Automated Dysarthria Detection and Severity Assessment Using Raw Speech

A generic optimization and learning framework for Parkinson disease via speech and handwritten records

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation