Nothing Special   »   [go: up one dir, main page]

Skip to main content

Towards an Intraoral-Based Silent Speech Restoration System for Post-laryngectomy Voice Replacement

  • Conference paper
  • First Online:
Biomedical Engineering Systems and Technologies (BIOSTEC 2016)

Abstract

Silent Speech Interfaces (SSIs) are alternative assistive speech technologies that are capable of restoring speech communication for those individuals who have lost their voice due to laryngectomy or diseases affecting the vocal cords. However, many of these SSIs are still deemed as impractical due to a high degree of intrusiveness and discomfort, hence limiting their transition to outside of the laboratory environment. We aim to address the hardware challenges faced in developing a practical SSI for post-laryngectomy speech rehabilitation. A new Permanent Magnet Articulography (PMA) system is presented which fits within the palatal cavity of the user’s mouth, giving unobtrusive appearance and high portability. The prototype is comprised of a miniaturized circuit constructed using commercial off-the-shelf (COTS) components and is implemented in the form of a dental retainer, which is mounted under roof of the user’s mouth and firmly clasps onto the upper teeth. Preliminary evaluation via speech recognition experiments demonstrates that the intraoral prototype achieves reasonable word recognition accuracy and is comparable to the external PMA version. Moreover, the intraoral design is expected to improve on its stability and robustness, with a much improved appearance since it can be completely hidden inside the user’s mouth.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Fagan, M.J., Ell, S.R., Gilbert, J.M., Sarrazin, E., Chapman, P.M.: Development of a (silent) speech recognition system for patients following laryngectomy. Med. Eng. Phys. 30(4), 419–425 (2008)

    Article  Google Scholar 

  2. Braz, D.S.A., Ribas, M.M., Dedivitis, R.A., Nishimoto, I.N., Barros, A.P.B.: Quality of life and depression in patients undergoing total and partial laryngectomy. Clinics 60(2), 135–142 (2005)

    Article  Google Scholar 

  3. Gilbert, J.M., Rybchenko, S.I., Hofe, R., Ell, S.R., Fagan, M.J., Moore, R.K., Green, P.D.: Isolated word recognition of silent speech using magnetic implants and sensors. Med. Eng. Phys. 32(10), 1189–1197 (2010)

    Article  Google Scholar 

  4. Liu, H., Ng, M.: Electrolarynx in voice rehabilitation. Auris Nasus Larynx 30(3), 327–332 (2007)

    Article  Google Scholar 

  5. Wang, J., Samal, A., Green, J.R., Rudzicz, F.: Sentence recognition from articulatory movements for silent speech interfaces. In: Proceedings of 37th ICASSP, Kyoto, Japan, pp. 4985–4988 (2012)

    Google Scholar 

  6. Toda, T., Nakagiri, M., Shikano, K.: Statistical voice conversion techniques for body-conducted unvoiced speech enhancement. IEEE Trans. Audio Speech Lang. Process. 20(9), 2505–2517 (2012)

    Article  Google Scholar 

  7. Doi, H., Nakamura, K., Toda, T., Saruwatari, H., Shikano, K.: Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture model. IEICE Trans. Inf. Syst. 93(9), 2472–2482 (2010)

    Article  Google Scholar 

  8. Denby, B., Schultz, T., Honda, K., Hueber, T., Gilbert, J.M., Brumberg, J.S.: Silent speech interfaces. Speech Commun. 52(4), 270–287 (2010)

    Article  Google Scholar 

  9. Brumberg, J.S., Wright, E.J., Andreasen, D.S., Guenther, F.H., Kennedy, P.R.: Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech-motor cortex. Frontiers Neurosci. 65(5), 1–12 (2011)

    Google Scholar 

  10. Brumberg, J.S., Nieto-Castanon, A., Kennedy, P.R., Guenther, F.H.: Brain-computer interfaces for speech communication. Speech Commun. 52(4), 367–379 (2010)

    Article  Google Scholar 

  11. Porbadnigk, A., Wester, M., Calliess, J., Schultz, T.: EEG-based speech recognition – impact of temporal effects. In: Proceedings of 2nd Biosignals, Porto, Portugal, pp. 376–381 (2009)

    Google Scholar 

  12. Jou, S.C.S., Schultz, T., Walliczek, M., Kraft, F., Waibel, A.: Towards continuous speech recognition using surface electromyography. In: Proceedings of 9th Interspeech, Pittsburgh, USA, pp. 573–576 (2006)

    Google Scholar 

  13. Wand, M., Janke, M., Schultz, T.: Tackling speaking mode varieties in EMG-based speech recognition. IEEE Trans. Biomed. Eng. 61(10), 2515–2526 (2014)

    Article  Google Scholar 

  14. Wand, M., and Schultz, T.: Session-independent EMG-based speech recognition. In: Proceedings of 4th Biosignals, Rome, Italy, pp. 295–300 (2011)

    Google Scholar 

  15. Petajan, E.D.: An architecture for automatic lipreading to enhance speech recognition. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, California, USA, pp. 40–47 (1985)

    Google Scholar 

  16. Hueber, T., Benaroya, E.-L., Chollet, G., Denby, B., Dreyfus, G., Stone, M.: Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips. Speech Commun. 52(4), 288–300 (2010)

    Article  Google Scholar 

  17. Toda, T., Black, A.W., Tokuda, K.: Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model. Speech Commun. 50(3), 215–227 (2008)

    Article  Google Scholar 

  18. Hofe, R., Ell, S.R., Fagan, M.J., Gilbert, J.M., Green, P.D., Moore, R.K., Rybchenko, S.I.: Small-vocabulary speech recognition using silent speech interface based on magnetic sensing. Speech Commun. 55(1), 22–32 (2013)

    Article  Google Scholar 

  19. Hofe, R., Bai, J., Cheah, L.A., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: Performance of the MVOCA silent speech interface across multiple speakers. In: Proceedings of 14th Interspeech, Lyon, France, pp. 1140–1143 (2013)

    Google Scholar 

  20. Cheah, L.A., Bai, J., Gonzalez, J.A., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: A user-centric design of permanent magnetic articulography based assistive speech technology. In: Proceedings of 8th Biosignals, Lisbon, Portugal, pp. 109–116 (2015)

    Google Scholar 

  21. Hirsch, T., Forlizzi, J., Goetz, J., Stoback, J., Kurtx, C.: The ELDer project: social and emotional factors in the design of eldercare technologies. In: Proceedings on the 2000 conference of Universal Usability, Arlington, USA, pp. 72–79 (2000)

    Google Scholar 

  22. Martin, J.L., Murphy, E., Crowe, J.A., Norris, B.J.: Capturing user requirements in medical devices development: the role of ergonomics. Physiol. Meas. 27(8), 49–62 (2006)

    Article  Google Scholar 

  23. Bright, A.K., Conventry, L.: Assistive technology for older adults: psychological and socio-emotional design requirements. In: Proceedings of 6th International Conference on PErvaesive Technologies Related to Assistive Environments, Rhodes, Greece, pp. 1–4 (2013)

    Google Scholar 

  24. Tang, H., Beebe, D.J.: An oral interface for blind navigation. IEEE Trans. Neural Syst. Rehabil. Eng. 14(1), 116–123 (2006)

    Article  Google Scholar 

  25. Lontis, E.R., Lund, M.E., Christensen, H.V., Gaihede, M., Caltenco, H.A., Andreasen-Strujik, L.N.: Clinical evaluation of wireless inductive tongue computer interface for control of computers and assistive devices. In: Proceedings of 32nd IEEE EMBC, Beunos Aires, Argentina, pp. 3365–3368 (2010)

    Google Scholar 

  26. Park, H., Kiani, M., Lee, H.M., Kim, J., Block, J., Gosselin, B., Ghovanloo, M.: A wireless magnetoresistive sensing system for an intraoral tongue-computer interface. IEEE Trans. Biomed. Circuits Syst. 6(6), 571–585 (2012)

    Article  Google Scholar 

  27. Bai, J., Cheah, L.A., Ell, S.R., Gilbert, J.M.: Design of an intraoral device based on permanent magnetic articulography. In: Proceedings of Macau Conference on Engineering, Technology and Applied Science, Macau, China, pp. 1–12 (2015)

    Google Scholar 

  28. Leonard, R.G.: A database for speaker-independent digit recognition. In: Proceedings of 9th ICASSP, San Diego, USA, pp. 328–331 (1984)

    Google Scholar 

  29. Young, S., Everman, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povery, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.4.1). Cambridge University Press, Cambridge (2009)

    Google Scholar 

  30. Rabiner, L.R.: A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc. IEEE 77, 257–286 (1989)

    Article  Google Scholar 

  31. Maier-Hein, L., Metze, F., Schultz, T., Waibel, A.: Session independent non-audible speech recognition using surface electromyography. In: Proceedings of Automatic Speech Recognition and Understanding Workshop, Cancun, Mexico, pp. 331–336 (2005)

    Google Scholar 

  32. Gonzalez, J.A., Cheah, L.A., Bai, J., Ell, S.R., Gilbert, J.M., Moore, R.K., Green, P.D.: Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography. In: Proceedings of 15th Interspeech, Singapore, pp. 1018–1022 (2014)

    Google Scholar 

  33. Gonzalez, J.A., Cheah, L.A., Gilbert, J.M., Bai, J., Ell, S.R., Green, P.D., Moore, R.K.: Direct speech generation for a silent speech interface based on permanent magnet articulography. In: Proceedings of 9th Biosignals, Lisbon, Portugal, pp. 109–116 (2016)

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank Helen Dehkordy from Hull and East Yorkshire Hospitals NHS Trust for prototyping the dental retainers. The work is an independent research funded by the National Institute for Health Research (NIHR)’s Invention for Innovation Programme (Grant Reference Number II-LB-0814-20007). The views stated are those of the authors and not necessary reflecting the thoughts of the sponsor.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lam A. Cheah .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Cheah, L.A. et al. (2017). Towards an Intraoral-Based Silent Speech Restoration System for Post-laryngectomy Voice Replacement. In: Fred, A., Gamboa, H. (eds) Biomedical Engineering Systems and Technologies. BIOSTEC 2016. Communications in Computer and Information Science, vol 690. Springer, Cham. https://doi.org/10.1007/978-3-319-54717-6_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-54717-6_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-54716-9

  • Online ISBN: 978-3-319-54717-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics