Abstract
Within the framework of the Carcinologic Speech Severity Index (C2SI) INCa Project, we collected a large database of French speech recordings aiming at validating Disorder Severity Indexes. Such a database will be useful for measuring the impact of oral and pharyngeal cavity cancer on speech production. It will permit to assess patients’ quality of life after treatment. The database is composed of audio recordings from 134 sessions and associated metadata. Several intelligibility and comprehensibility levels of speech functions have been evaluated. Acoustics and prosody have been assessed. Perceptual evaluation rates from both naive and expert juries are being produced. Automatic analyzes are being carried out. It is intended to provide speech therapists and physicians with objective tools, which take into account the intelligibility and comprehensibility of patients which received cancer treatment (surgery and/or radiotherapy and/or chemotherapy). The aim of this paper is to justify the necessity of such a corpus and to present its data collection. This C2SI corpus will be available to the scientific community through the Scientific Interest Group Parolothèque.
Similar content being viewed by others
References
Astésano, C., Bard, E. G., & Turk, A. (2007). Structural influences on initial accent placement in french. Language and Speech, 50(3), 423–446.
Aura, K. (2012). Protocole d’évaluation du langage fondé sur le traitement de fonctions prosodiques : étude exploratoire de deux patients atteints de gliomes de bas grade en contexte péri-opératoire. Ph.D. thesis, Université Toulouse 2. http://www.theses.fr/2012TOU20110/document.
Balaguer, M., Boisguerin, A., Galtier, A., Gaillard, N., Puech, M., & Woisard, V. (2019). Assessment of impairment of intelligibility and of speech signal after oral cavity and oropharynx cancer. Annales fran caises d’oto-rhino-laryngologie et de pathologie cervico-faciale, 136(5), 355–359. https://doi.org/10.1016/j.anorl.2019.05.012.
Borggreven, P. A., Aaronson, N. K., Verdonck-de Leeuw, I. M., Muller, M. J., Heiligers, M. L., & de Bree, R., et al. (2007). Quality of life after surgical treatment for oral and oropharyngeal cancer: a prospective longitudinal assessment of patients reconstructed by a microvascular flap. Oral oncology, 43(10), 1034–1042.
Brierley, J. D., Gospodarowicz, M. K., & Wittekind, C. (2016). TNM classification of malignant tumours. Hoboken: Wiley.
Cardol, M., Brandsma, J., De Groot, I., van den BOSOE, G., De Haan, R., & De Jong, B. (1999). Handicap questionnaires: what do they assess? Disability and rehabilitation, 21(3), 97–105.
Dwivedi, R. C., St Rose, Rose, Roe, J. W., Chisholm, E., Elmiyeh, B., & Nutting, C. M., et al. (2011). First report on the reliability and validity of speech handicap index in native english-speaking patients with head and neck cancer. Head & neck, 33(3), 341–348.
Enderby, P.M. (1983). Frenchay dysarthria assessment. Pro-ed
Enderby, P.M., & Palmer, R. (2008) FDA-2: Frenchay Dysarthria Assessment: Examiner’s Manual. Pro-ed
Fichaux-Bourin, P., Woisard, V., Grand, S., Puech, M., & Bodin, S. (2009). Validation of a self assessment for speech disorders (phonation handicap index). Revue de laryngologie-otologie-rhinologie, 130(1), 45–51.
Fredouille, C., Ghio, A., Laaridh, I., Lalain, M., & Woisard, V. (2019). Acoustic-phonetic decoding for speech intelligibility evaluation in the context of head and neck cancers. In: Proceedings of Intl Congress of Phonetic Sciences (ICPhS’19). Melbourne, Australia
Galliano, S., Geoffrois, E., Mostefa, D., Choukri, K., Bonastre, J.F., & Gravier, G. (2005). The ester phase ii evaluation campaign for the rich transcription of french broadcast news. In: Ninth European Conference on Speech Communication and Technology
Ghio, A., Lalain, M., Giusti, L., Pouchoulin, G., Robert, D., Rebourg, M., Fredouille, C., Laaridh, I., & Woisard, V. (2018).Une mesure d’intelligibilité par décodage acoustico-phonétique de pseudo-mots dans le cas de parole atypique. In: XXXII éme Journées d’Etudes sur la Parole 10.21437/JEP.2018-33. https://hal.archives-ouvertes.fr/hal-01770161/file/190996.pdf.
Ghio, A., Pouchoulin, G., Teston, B., Pinto, S., Fredouille, C., & De Looze, C., et al. (2012). How to manage sound, physiological and clinical data of 2500 dysphonic and dysarthric speakers? Speech Communication, 54(5), 664–679.
Laaridh, I., Fredouille, C., Ghio, A., Lalain, M., & Woisard, V. (2018). Automatic Evaluation of Speech Intelligibility Based on i-vectors in the Context of Head and Neck Cancers. In: Interspeech, pp. 2943–2947. ISCA, Hyderabad, India. 10.21437/interspeech.2018-1266. https://hal.archives-ouvertes.fr/hal-01962170.
Laaridh, I., Kheder, W.B., Fredouille, C., & Meunier, C. (2017). Automatic prediction of speech evaluation metrics for dysarthric speech. In: Proc. Interspeech, pp. 1834–1838
Lindblom, B. (1990). Explaining phonetic variation: A sketch of the h&h theory. In: Speech production and speech modelling, vol. 55, pp. 403–439. Springer, Dordrecht https://doi.org/10.1007/978-94-009-2037-8_16
Magne, C., Astésano, C., Lacheret-Dujour, A., Morel, M., Alter, K., & Besson, M. (2005). On-line processing of “pop-out” words in spoken french dialogues. Journal of cognitive neuroscience, 17(5), 740–756.
Middag, C., Clapham, R., Van Son, R., & Martens, J. P. (2014). Robust automatic intelligibility assessment techniques evaluated on speakers treated for head and neck cancer. Computer speech & language, 28(2), 467–482.
Mlynarek, A. M., Rieger, J. M., Harris, J. R., O’Connell, D. A., Al-Qahtani, K. H., & Ansari, K., et al. (2008). Methods of functional outcomes assessment following treatment of oral and oropharyngeal cancer: Review of the literature. Journal of otolaryngology - head and neck surgery, 37(1), 2–10.
Nocaudie, O., Astésano, C., Ghio, A., Lalain, M., & Woisard, V. (2018) .Evaluation de la compréhensibilité et conservation des fonctions prosodiques en perception de la parole de patients post traitement de cancers de la cavité buccale et du pharynx. In: XXXIIe Journées d’Etudes sur la Parole, pp. 196–204
Pisoni, D.B., & Dedina, M.J. (1986). Comprehension of digitally encoded natural speech using a sentence verification task: a first report. Tech. Rep. Progress report 12, Indiana University
Pisoni, D. B., Manous, L. M., & Dedina, M. J. (1987). Comprehension of natural and synthetic speech: Effects of predictability on the verification of sentences controlled for intelligibility. Computer speech & language, 2(3–4), 303–320.
Porcaro, C., Evitts, P., King, N., Hood, C., Campbell, E., & White, L., et al. (2019). Effect of dysphonia and cognitive-perceptual listener strategies on speech intelligibility. Journal of Voice in Press,. https://doi.org/10.1016/j.jvoice.2019.03.013.
Pyo HY, S.H.S. (2007). A study of speech intelligibility affected by voice quality degradation. Communication Sciences & Disorders, 12(2), 256–278 http://www.e-csd.org/journal/view.php?number=326.
Rinkel, R. N., Leeuw, I. M. V., van Reij, E. J., Aaronson, N. K., & Leemans, C. R. (2008). Speech handicap index in patients with oral and pharyngeal cancer: Better understanding of patients’ complaints. Journal for the Sciences and Specialties of the Head and Neck, 30(7), 868–874.
Samuel, A. G. (1981). Phonemic restoration: Insights from a new methodology. Journal of Experimental Psychology: General, 110(4), 474.
Sicard, E., Mauclair, J., & Woisard, V. (2017). Etude de paramètres acoustiques des voix de patients traités pour un cancer orl dans le cadre du projet c2si. In: 7èmes Journées de Phonétique Clinique
Thomas, L., Jones, T. M., Tandon, S., Carding, P., Lowe, D., & Rogers, S. (2009). Speech and voice outcomes in oropharyngeal cancer and evaluation of the university of washington quality of life speech domain. Clinical Otolaryngology, 34(1), 34–42.
Ware, J. E, Jr., & Sherbourne, C. D. (1992). The mos 36-item short-form health survey (sf-36): I. conceptual framework and item selection. Medical Care, 30(6), 473–483.
Warren, R. M., & Warren, R. P. (1970). Auditory illusions and confusions. Scientific American, 223(6), 30–37.
Woisard, V., Espesser, R., Ghio, A., & Duez, D. (2013). De l’intelligibilité à la compréhensibilité de la parole, quelles mesures en pratique clinique? Revue de Laryngologie Otologie Rhinologie, 1(134), 27–33.
Zumbiehl, O. (2010). Evaluation perceptive des dysphonies par la sentence verification task. Master’s thesis, Université Aix-Marseille . Mémoire d’Orthophonie (dir. : Cavé, C. and Ghio, Alain)
Acknowledgements
Grant 2014-135 from Institut National pour le CAncer (INCa) in 2014, “Sciences Humaines et Sociales, épidémiologie et Santé Publique” call. Lead by Pr Virginie Woisard at University Hospital of Toulouse and Grant ANR-18-CE45-0008 from The French National Research Agency in 2018 RUGBI project “Improving the measurement of intelligibility of pathological production disorders impaired speech” lead by Jérôme Farinas at IRIT. We thank the company Voxygen\(^{1}\) for providing us with their speech synthesis platform necessary for the realization of the corpus DAP.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Woisard, V., Astésano, C., Balaguer, M. et al. C2SI corpus: a database of speech disorder productions to assess intelligibility and quality of life in head and neck cancers. Lang Resources & Evaluation 55, 173–190 (2021). https://doi.org/10.1007/s10579-020-09496-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-020-09496-3