Towards an Intraoral-Based Silent Speech Restoration System for Post-laryngectomy Voice Replacement

Lam A. Cheah¹²,
James M. Gilbert¹²,
Jose A. Gonzalez¹³,
Jie Bai¹²,
Stephen R. Ell¹⁴,
Phil D. Green¹³ &
…
Roger K. Moore¹³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 690))

Included in the following conference series:

International Joint Conference on Biomedical Engineering Systems and Technologies

634 Accesses
3 Citations

Abstract

Silent Speech Interfaces (SSIs) are alternative assistive speech technologies that are capable of restoring speech communication for those individuals who have lost their voice due to laryngectomy or diseases affecting the vocal cords. However, many of these SSIs are still deemed as impractical due to a high degree of intrusiveness and discomfort, hence limiting their transition to outside of the laboratory environment. We aim to address the hardware challenges faced in developing a practical SSI for post-laryngectomy speech rehabilitation. A new Permanent Magnet Articulography (PMA) system is presented which fits within the palatal cavity of the user’s mouth, giving unobtrusive appearance and high portability. The prototype is comprised of a miniaturized circuit constructed using commercial off-the-shelf (COTS) components and is implemented in the form of a dental retainer, which is mounted under roof of the user’s mouth and firmly clasps onto the upper teeth. Preliminary evaluation via speech recognition experiments demonstrates that the intraoral prototype achieves reasonable word recognition accuracy and is comparable to the external PMA version. Moreover, the intraoral design is expected to improve on its stability and robustness, with a much improved appearance since it can be completely hidden inside the user’s mouth.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Review on Silent Speech Recognition Using EMG Sensors and Voices After Laryngectomy

Speech perception and hearing effort using a new active middle ear implant audio processor

Article Open access 07 December 2021

New technology can benefit established middle ear implant users: Samba 2 vs previous models of audio processors for Vibrant Soundbridge

Article Open access 28 November 2022

References

Fagan, M.J., Ell, S.R., Gilbert, J.M., Sarrazin, E., Chapman, P.M.: Development of a (silent) speech recognition system for patients following laryngectomy. Med. Eng. Phys. 30(4), 419–425 (2008)
Article Google Scholar
Braz, D.S.A., Ribas, M.M., Dedivitis, R.A., Nishimoto, I.N., Barros, A.P.B.: Quality of life and depression in patients undergoing total and partial laryngectomy. Clinics 60(2), 135–142 (2005)
Article Google Scholar
Gilbert, J.M., Rybchenko, S.I., Hofe, R., Ell, S.R., Fagan, M.J., Moore, R.K., Green, P.D.: Isolated word recognition of silent speech using magnetic implants and sensors. Med. Eng. Phys. 32(10), 1189–1197 (2010)
Article Google Scholar
Liu, H., Ng, M.: Electrolarynx in voice rehabilitation. Auris Nasus Larynx 30(3), 327–332 (2007)
Article Google Scholar
Wang, J., Samal, A., Green, J.R., Rudzicz, F.: Sentence recognition from articulatory movements for silent speech interfaces. In: Proceedings of 37th ICASSP, Kyoto, Japan, pp. 4985–4988 (2012)
Google Scholar
Toda, T., Nakagiri, M., Shikano, K.: Statistical voice conversion techniques for body-conducted unvoiced speech enhancement. IEEE Trans. Audio Speech Lang. Process. 20(9), 2505–2517 (2012)
Article Google Scholar
Doi, H., Nakamura, K., Toda, T., Saruwatari, H., Shikano, K.: Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture model. IEICE Trans. Inf. Syst. 93(9), 2472–2482 (2010)
Article Google Scholar
Denby, B., Schultz, T., Honda, K., Hueber, T., Gilbert, J.M., Brumberg, J.S.: Silent speech interfaces. Speech Commun. 52(4), 270–287 (2010)
Article Google Scholar
Brumberg, J.S., Wright, E.J., Andreasen, D.S., Guenther, F.H., Kennedy, P.R.: Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech-motor cortex. Frontiers Neurosci. 65(5), 1–12 (2011)
Google Scholar
Brumberg, J.S., Nieto-Castanon, A., Kennedy, P.R., Guenther, F.H.: Brain-computer interfaces for speech communication. Speech Commun. 52(4), 367–379 (2010)
Article Google Scholar
Porbadnigk, A., Wester, M., Calliess, J., Schultz, T.: EEG-based speech recognition – impact of temporal effects. In: Proceedings of 2nd Biosignals, Porto, Portugal, pp. 376–381 (2009)
Google Scholar
Jou, S.C.S., Schultz, T., Walliczek, M., Kraft, F., Waibel, A.: Towards continuous speech recognition using surface electromyography. In: Proceedings of 9th Interspeech, Pittsburgh, USA, pp. 573–576 (2006)
Google Scholar
Wand, M., Janke, M., Schultz, T.: Tackling speaking mode varieties in EMG-based speech recognition. IEEE Trans. Biomed. Eng. 61(10), 2515–2526 (2014)
Article Google Scholar
Wand, M., and Schultz, T.: Session-independent EMG-based speech recognition. In: Proceedings of 4th Biosignals, Rome, Italy, pp. 295–300 (2011)
Google Scholar
Petajan, E.D.: An architecture for automatic lipreading to enhance speech recognition. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, California, USA, pp. 40–47 (1985)
Google Scholar
Hueber, T., Benaroya, E.-L., Chollet, G., Denby, B., Dreyfus, G., Stone, M.: Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips. Speech Commun. 52(4), 288–300 (2010)
Article Google Scholar
Toda, T., Black, A.W., Tokuda, K.: Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model. Speech Commun. 50(3), 215–227 (2008)
Article Google Scholar
Hofe, R., Ell, S.R., Fagan, M.J., Gilbert, J.M., Green, P.D., Moore, R.K., Rybchenko, S.I.: Small-vocabulary speech recognition using silent speech interface based on magnetic sensing. Speech Commun. 55(1), 22–32 (2013)
Article Google Scholar
Hofe, R., Bai, J., Cheah, L.A., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: Performance of the MVOCA silent speech interface across multiple speakers. In: Proceedings of 14th Interspeech, Lyon, France, pp. 1140–1143 (2013)
Google Scholar
Cheah, L.A., Bai, J., Gonzalez, J.A., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: A user-centric design of permanent magnetic articulography based assistive speech technology. In: Proceedings of 8th Biosignals, Lisbon, Portugal, pp. 109–116 (2015)
Google Scholar
Hirsch, T., Forlizzi, J., Goetz, J., Stoback, J., Kurtx, C.: The ELDer project: social and emotional factors in the design of eldercare technologies. In: Proceedings on the 2000 conference of Universal Usability, Arlington, USA, pp. 72–79 (2000)
Google Scholar
Martin, J.L., Murphy, E., Crowe, J.A., Norris, B.J.: Capturing user requirements in medical devices development: the role of ergonomics. Physiol. Meas. 27(8), 49–62 (2006)
Article Google Scholar
Bright, A.K., Conventry, L.: Assistive technology for older adults: psychological and socio-emotional design requirements. In: Proceedings of 6th International Conference on PErvaesive Technologies Related to Assistive Environments, Rhodes, Greece, pp. 1–4 (2013)
Google Scholar
Tang, H., Beebe, D.J.: An oral interface for blind navigation. IEEE Trans. Neural Syst. Rehabil. Eng. 14(1), 116–123 (2006)
Article Google Scholar
Lontis, E.R., Lund, M.E., Christensen, H.V., Gaihede, M., Caltenco, H.A., Andreasen-Strujik, L.N.: Clinical evaluation of wireless inductive tongue computer interface for control of computers and assistive devices. In: Proceedings of 32nd IEEE EMBC, Beunos Aires, Argentina, pp. 3365–3368 (2010)
Google Scholar
Park, H., Kiani, M., Lee, H.M., Kim, J., Block, J., Gosselin, B., Ghovanloo, M.: A wireless magnetoresistive sensing system for an intraoral tongue-computer interface. IEEE Trans. Biomed. Circuits Syst. 6(6), 571–585 (2012)
Article Google Scholar
Bai, J., Cheah, L.A., Ell, S.R., Gilbert, J.M.: Design of an intraoral device based on permanent magnetic articulography. In: Proceedings of Macau Conference on Engineering, Technology and Applied Science, Macau, China, pp. 1–12 (2015)
Google Scholar
Leonard, R.G.: A database for speaker-independent digit recognition. In: Proceedings of 9th ICASSP, San Diego, USA, pp. 328–331 (1984)
Google Scholar
Young, S., Everman, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povery, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.4.1). Cambridge University Press, Cambridge (2009)
Google Scholar
Rabiner, L.R.: A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc. IEEE 77, 257–286 (1989)
Article Google Scholar
Maier-Hein, L., Metze, F., Schultz, T., Waibel, A.: Session independent non-audible speech recognition using surface electromyography. In: Proceedings of Automatic Speech Recognition and Understanding Workshop, Cancun, Mexico, pp. 331–336 (2005)
Google Scholar
Gonzalez, J.A., Cheah, L.A., Bai, J., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography. In: Proceedings of 15th Interspeech, Singapore, pp. 1018–1022 (2014)
Google Scholar
Gonzalez, J.A., Cheah, L.A., Gilbert, J.M., Bai, J., Ell, S.R., Green, P.D., Moore, R.K.: Direct speech generation for a silent speech interface based on permanent magnet articulography. In: Proceedings of 9th Biosignals, Lisbon, Portugal, pp. 109–116 (2016)
Google Scholar

Download references

Acknowledgements

The authors would like to thank Helen Dehkordy from Hull and East Yorkshire Hospitals NHS Trust for prototyping the dental retainers. The work is an independent research funded by the National Institute for Health Research (NIHR)’s Invention for Innovation Programme (Grant Reference Number II-LB-0814-20007). The views stated are those of the authors and not necessary reflecting the thoughts of the sponsor.

Author information

Authors and Affiliations

School of Engineering, University of Hull, Kingston upon Hull, UK
Lam A. Cheah, James M. Gilbert & Jie Bai
Department of Computer Science, University of Sheffield, Sheffield, UK
Jose A. Gonzalez, Phil D. Green & Roger K. Moore
Hull and East Yorkshire Hospitals Trust, Castle Hill Hospital, Cottingham, UK
Stephen R. Ell

Authors

Lam A. Cheah
View author publications
You can also search for this author in PubMed Google Scholar
James M. Gilbert
View author publications
You can also search for this author in PubMed Google Scholar
Jose A. Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Jie Bai
View author publications
You can also search for this author in PubMed Google Scholar
Stephen R. Ell
View author publications
You can also search for this author in PubMed Google Scholar
Phil D. Green
View author publications
You can also search for this author in PubMed Google Scholar
Roger K. Moore
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lam A. Cheah .

Editor information

Editors and Affiliations

Instituto de Telecomunicações, Lisbon, Portugal
Ana Fred
New University of Lisbon, Lisbon, Portugal
Hugo Gamboa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cheah, L.A. et al. (2017). Towards an Intraoral-Based Silent Speech Restoration System for Post-laryngectomy Voice Replacement. In: Fred, A., Gamboa, H. (eds) Biomedical Engineering Systems and Technologies. BIOSTEC 2016. Communications in Computer and Information Science, vol 690. Springer, Cham. https://doi.org/10.1007/978-3-319-54717-6_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-54717-6_2
Published: 04 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54716-9
Online ISBN: 978-3-319-54717-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards an Intraoral-Based Silent Speech Restoration System for Post-laryngectomy Voice Replacement

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Review on Silent Speech Recognition Using EMG Sensors and Voices After Laryngectomy

Speech perception and hearing effort using a new active middle ear implant audio processor

New technology can benefit established middle ear implant users: Samba 2 vs previous models of audio processors for Vibrant Soundbridge

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Towards an Intraoral-Based Silent Speech Restoration System for Post-laryngectomy Voice Replacement

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Review on Silent Speech Recognition Using EMG Sensors and Voices After Laryngectomy

Speech perception and hearing effort using a new active middle ear implant audio processor

New technology can benefit established middle ear implant users: Samba 2 vs previous models of audio processors for Vibrant Soundbridge

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation