Abstract
Silent Speech Interfaces (SSIs) are alternative assistive speech technologies that are capable of restoring speech communication for those individuals who have lost their voice due to laryngectomy or diseases affecting the vocal cords. However, many of these SSIs are still deemed as impractical due to a high degree of intrusiveness and discomfort, hence limiting their transition to outside of the laboratory environment. We aim to address the hardware challenges faced in developing a practical SSI for post-laryngectomy speech rehabilitation. A new Permanent Magnet Articulography (PMA) system is presented which fits within the palatal cavity of the user’s mouth, giving unobtrusive appearance and high portability. The prototype is comprised of a miniaturized circuit constructed using commercial off-the-shelf (COTS) components and is implemented in the form of a dental retainer, which is mounted under roof of the user’s mouth and firmly clasps onto the upper teeth. Preliminary evaluation via speech recognition experiments demonstrates that the intraoral prototype achieves reasonable word recognition accuracy and is comparable to the external PMA version. Moreover, the intraoral design is expected to improve on its stability and robustness, with a much improved appearance since it can be completely hidden inside the user’s mouth.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Fagan, M.J., Ell, S.R., Gilbert, J.M., Sarrazin, E., Chapman, P.M.: Development of a (silent) speech recognition system for patients following laryngectomy. Med. Eng. Phys. 30(4), 419–425 (2008)
Braz, D.S.A., Ribas, M.M., Dedivitis, R.A., Nishimoto, I.N., Barros, A.P.B.: Quality of life and depression in patients undergoing total and partial laryngectomy. Clinics 60(2), 135–142 (2005)
Gilbert, J.M., Rybchenko, S.I., Hofe, R., Ell, S.R., Fagan, M.J., Moore, R.K., Green, P.D.: Isolated word recognition of silent speech using magnetic implants and sensors. Med. Eng. Phys. 32(10), 1189–1197 (2010)
Liu, H., Ng, M.: Electrolarynx in voice rehabilitation. Auris Nasus Larynx 30(3), 327–332 (2007)
Wang, J., Samal, A., Green, J.R., Rudzicz, F.: Sentence recognition from articulatory movements for silent speech interfaces. In: Proceedings of 37th ICASSP, Kyoto, Japan, pp. 4985–4988 (2012)
Toda, T., Nakagiri, M., Shikano, K.: Statistical voice conversion techniques for body-conducted unvoiced speech enhancement. IEEE Trans. Audio Speech Lang. Process. 20(9), 2505–2517 (2012)
Doi, H., Nakamura, K., Toda, T., Saruwatari, H., Shikano, K.: Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture model. IEICE Trans. Inf. Syst. 93(9), 2472–2482 (2010)
Denby, B., Schultz, T., Honda, K., Hueber, T., Gilbert, J.M., Brumberg, J.S.: Silent speech interfaces. Speech Commun. 52(4), 270–287 (2010)
Brumberg, J.S., Wright, E.J., Andreasen, D.S., Guenther, F.H., Kennedy, P.R.: Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech-motor cortex. Frontiers Neurosci. 65(5), 1–12 (2011)
Brumberg, J.S., Nieto-Castanon, A., Kennedy, P.R., Guenther, F.H.: Brain-computer interfaces for speech communication. Speech Commun. 52(4), 367–379 (2010)
Porbadnigk, A., Wester, M., Calliess, J., Schultz, T.: EEG-based speech recognition – impact of temporal effects. In: Proceedings of 2nd Biosignals, Porto, Portugal, pp. 376–381 (2009)
Jou, S.C.S., Schultz, T., Walliczek, M., Kraft, F., Waibel, A.: Towards continuous speech recognition using surface electromyography. In: Proceedings of 9th Interspeech, Pittsburgh, USA, pp. 573–576 (2006)
Wand, M., Janke, M., Schultz, T.: Tackling speaking mode varieties in EMG-based speech recognition. IEEE Trans. Biomed. Eng. 61(10), 2515–2526 (2014)
Wand, M., and Schultz, T.: Session-independent EMG-based speech recognition. In: Proceedings of 4th Biosignals, Rome, Italy, pp. 295–300 (2011)
Petajan, E.D.: An architecture for automatic lipreading to enhance speech recognition. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, California, USA, pp. 40–47 (1985)
Hueber, T., Benaroya, E.-L., Chollet, G., Denby, B., Dreyfus, G., Stone, M.: Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips. Speech Commun. 52(4), 288–300 (2010)
Toda, T., Black, A.W., Tokuda, K.: Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model. Speech Commun. 50(3), 215–227 (2008)
Hofe, R., Ell, S.R., Fagan, M.J., Gilbert, J.M., Green, P.D., Moore, R.K., Rybchenko, S.I.: Small-vocabulary speech recognition using silent speech interface based on magnetic sensing. Speech Commun. 55(1), 22–32 (2013)
Hofe, R., Bai, J., Cheah, L.A., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: Performance of the MVOCA silent speech interface across multiple speakers. In: Proceedings of 14th Interspeech, Lyon, France, pp. 1140–1143 (2013)
Cheah, L.A., Bai, J., Gonzalez, J.A., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: A user-centric design of permanent magnetic articulography based assistive speech technology. In: Proceedings of 8th Biosignals, Lisbon, Portugal, pp. 109–116 (2015)
Hirsch, T., Forlizzi, J., Goetz, J., Stoback, J., Kurtx, C.: The ELDer project: social and emotional factors in the design of eldercare technologies. In: Proceedings on the 2000 conference of Universal Usability, Arlington, USA, pp. 72–79 (2000)
Martin, J.L., Murphy, E., Crowe, J.A., Norris, B.J.: Capturing user requirements in medical devices development: the role of ergonomics. Physiol. Meas. 27(8), 49–62 (2006)
Bright, A.K., Conventry, L.: Assistive technology for older adults: psychological and socio-emotional design requirements. In: Proceedings of 6th International Conference on PErvaesive Technologies Related to Assistive Environments, Rhodes, Greece, pp. 1–4 (2013)
Tang, H., Beebe, D.J.: An oral interface for blind navigation. IEEE Trans. Neural Syst. Rehabil. Eng. 14(1), 116–123 (2006)
Lontis, E.R., Lund, M.E., Christensen, H.V., Gaihede, M., Caltenco, H.A., Andreasen-Strujik, L.N.: Clinical evaluation of wireless inductive tongue computer interface for control of computers and assistive devices. In: Proceedings of 32nd IEEE EMBC, Beunos Aires, Argentina, pp. 3365–3368 (2010)
Park, H., Kiani, M., Lee, H.M., Kim, J., Block, J., Gosselin, B., Ghovanloo, M.: A wireless magnetoresistive sensing system for an intraoral tongue-computer interface. IEEE Trans. Biomed. Circuits Syst. 6(6), 571–585 (2012)
Bai, J., Cheah, L.A., Ell, S.R., Gilbert, J.M.: Design of an intraoral device based on permanent magnetic articulography. In: Proceedings of Macau Conference on Engineering, Technology and Applied Science, Macau, China, pp. 1–12 (2015)
Leonard, R.G.: A database for speaker-independent digit recognition. In: Proceedings of 9th ICASSP, San Diego, USA, pp. 328–331 (1984)
Young, S., Everman, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povery, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.4.1). Cambridge University Press, Cambridge (2009)
Rabiner, L.R.: A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc. IEEE 77, 257–286 (1989)
Maier-Hein, L., Metze, F., Schultz, T., Waibel, A.: Session independent non-audible speech recognition using surface electromyography. In: Proceedings of Automatic Speech Recognition and Understanding Workshop, Cancun, Mexico, pp. 331–336 (2005)
Gonzalez, J.A., Cheah, L.A., Bai, J., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography. In: Proceedings of 15th Interspeech, Singapore, pp. 1018–1022 (2014)
Gonzalez, J.A., Cheah, L.A., Gilbert, J.M., Bai, J., Ell, S.R., Green, P.D., Moore, R.K.: Direct speech generation for a silent speech interface based on permanent magnet articulography. In: Proceedings of 9th Biosignals, Lisbon, Portugal, pp. 109–116 (2016)
Acknowledgements
The authors would like to thank Helen Dehkordy from Hull and East Yorkshire Hospitals NHS Trust for prototyping the dental retainers. The work is an independent research funded by the National Institute for Health Research (NIHR)’s Invention for Innovation Programme (Grant Reference Number II-LB-0814-20007). The views stated are those of the authors and not necessary reflecting the thoughts of the sponsor.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Cheah, L.A. et al. (2017). Towards an Intraoral-Based Silent Speech Restoration System for Post-laryngectomy Voice Replacement. In: Fred, A., Gamboa, H. (eds) Biomedical Engineering Systems and Technologies. BIOSTEC 2016. Communications in Computer and Information Science, vol 690. Springer, Cham. https://doi.org/10.1007/978-3-319-54717-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-54717-6_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54716-9
Online ISBN: 978-3-319-54717-6
eBook Packages: Computer ScienceComputer Science (R0)