Abstract
In the literature several silent speech interfaces based on Surface Electromyography (EMG) can be found. However, it is yet unclear if we are able to sense muscles activity related to nasal port opening/closing. Detecting the nasality phenomena, would increase the performance of languages with strong nasal characteristics such as European Portuguese. In this paper we explore the use of surface EMG electrodes, a non-invasive method, positioned in the face and neck regions to explore the existence of useful information about the velum movement. For an accurate interpretation and validation of the proposed method, we use velum movement information extracted from Real-Time Magnetic Resonance Imaging (RT-MRI) data. Overall, results of this study show that differences can be found in the EMG signals for the case of nasal vowels, by sensors positioned below the ear between the mastoid process and the mandible in the upper neck region.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Huang, X., Acero, A., Hon, H.: Spoken Language Processing. Prentice Hall PTR, Upper Saddle River (2001)
Flynn, R., Jones, E.: Combined speech enhancement and auditory modelling for robust distributed speech recognition. Speech Commun. 50(10), 797–809 (2008)
Stark, A., Paliwal, K.: MMSE estimation of log-filterbank energies for robust speech recognition. Speech Commun. 53(3), 403–416 (2011)
Yang, C., Brown, G., Lu, L., Yamagishi, J., King, S.: Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation. In: 2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 220–223 (2012)
Denby, B., Schultz, T., Honda, K., Hueber, T., Gilbert, J.M., Brumberg, J.S.: Silent speech interfaces. Speech Commun. 52(4), 270–287 (2009)
Schultz, T., Wand, M.: Modeling coarticulation in large vocabulary EMG-based speech recognition. Speech Commun. 52(4), 341–353 (2010)
Jorgensen, C., Lee, D., Agabon, S.: Sub auditory speech recognition based on EMG signals. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN), pp. 3128–3133 (2003)
Teixeira, J.S.: Síntese Articulatória das Vogais Nasais do Português Europeu [Articulatory Synthesis of Nasal Vowels for European Portuguese]. Ph.D. Thesis, Universidade de Aveiro (2000)
Freitas, J., Teixeira, A. Dias, M.S.: Towards a silent speech interface for portuguese: surface electromyography and the nasality challenge. In: International Conference on Bio-Inspired Systems and Signal Processing, Vilamoura, Algarve, Portugal (2012)
Seikel, J.A., King, D.W., Drumright, D.G.: Anatomy and Physiology for Speech, Language, and Hearing. Delmar Learning, Clifton Park (2010)
Hardcastle, W.J.: Physiology of Speech Production: An Introduction for Speech Scientists. Academic Press, London (1976)
Fritzell, B.: The velopharyngeal muscles in speech: an electromyographic and cineradiographic study. Acta Otolaryngolica 250, 1–81 (1969)
Kuehn, D.P., Folkins, J.W., Linville, R.N.: An electromyographic study of the musculus uvulae. Cleft Palate J. 25(4), 348–355 (1988)
Rossato, S., Teixeira, A., Ferreira, L.: Les Nasales du Portugais et du Français: une étude comparative sur les données EMMA. In: XXVI Journées d’Études de la Parole, Dinard, France (2006)
Lacerda, A., Head, B.F.: Análise de sons nasais e sons nasalizados do Português. Revista do Laboratório de Fonética Experimental (de Coimbra), vol. 6, pp. 5–70 (1996)
Trigo, R.L.: The inherent structure of nasal segments. In: Huffman, M.K., Krakow, R.A. (eds.) Nasals, Nasalization, and the Velum, Phonetics and Phonology, vol. 5, pp. 369–400. Academic Press Inc., London (1993)
Teixeira, A., Moutinho, L.C., Coimbra, R.L.: Production, acoustic and perceptual studies on European portuguese nasal vowels height. In: Internat. Congress Phonetic Sciences (ICPhS), pp. 3033–3036 (2003)
Martins, P., Carbone, I.C., Pinto, A., Silva, A., Teixeira, A.: European Portuguese MRI based speech production studies. Speech Commun. 50(11/12), 925–952 (2008). ISSN 0167–6393
Bell-Berti, F.: An electromyographic study of velopharyngeal function. Speech J. Speech Hearing Res. 19, 225–240 (1976)
Kuehn, D.P., Folkins, J.W., Cutting, C.B.: Relationships between muscle activity and velar position. Cleft Palate J. 19(1), 25–35 (1982)
Lubker, J.F.: An electromyographic-cinefluorographic investigation of velar function during normal speech production. Cleft Palate J. 5(1), 17 (1968)
McGill, S., Juker, D., Kropf, P.: Appropriately placed surface EMG electrodes reflect deep muscle activity (psoas, quadratus lumborum, abdominal wall) in the lumbar spine. J. Biomech. 29(11), 1503–1507 (1996)
Chan, A.D.C., Englehart, K., Hudgins, B., Lovely, D.F.: Hidden Markov model classification of myoelectric signals in speech. In: Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, vol. 2, pp. 1727–1730 (2001)
Jou, S., Schultz, T., Waibel, A.: Continuous electromyographic speech recognition with a multi-stream decoding architecture. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2007, Honolulu, Hawaii, US (2007)
Wand, M., Schultz, T.: Investigations on speaking mode discrepancies in emg-based speech recognition. In: Interspeech 2011, Florence, Italy (2011)
Herff, C., Janke, M., Wand, M., Schultz, T.: Impact of different feedback mechanisms in EMG-based speech recognition. In: Interspeech 2011, Florence, Italy (2011)
Wand, M., Schultz, T.: Analysis of phone confusion in EMG-based speech recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2011, Prague, Czech Republic (2011)
Wand, M., Schultz, T.: Session-independent EMG-based speech recognition. In: International Conference on Bio-Inspired Systems and Signal Processing 2011, Biosignals 2011, Rome, Italy (2011)
Wand, M., Schultz, C., Janke, M., Schultz, T.: Array-based electromyographic silent speech interface. In: 6th International Conference on Bio-Inspired Systems and Signal Processing, Biosignals 2013, Barcelona, Spain (2013)
Freitas, J., Teixeira, A., Vaz, F., Dias, M.S.: Automatic speech recognition based on ultrasonic doppler sensing for European portuguese. In: Torre Toledano, D., Ortega Giménez, A., Teixeira, A., González Rodr\’ıguez, J., Hernández Gómez, L., San Segundo Hernández, R., Ramos Castro, D. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 227–236. Springer, Heidelberg (2012)
Galatas, G., Potamianos, G., Makedon, F.: Audio-visual speech recognition incorporating facial depth information captured by the Kinect. In: Proceedings of the 20th European Signal Processing Conference (EUSIPCO), pp. 2714–2717, 27–31 August 2012 (2012)
Teixeira, A., Martins, P., Oliveira, C., Ferreira, C., Silva, A., Shosted, R.: Real-Time MRI for portuguese. In: Caseli, H., Villavicencio, A., Teixeira, A., Perdigão, F. (eds.) PROPOR 2012. LNCS, vol. 7243, pp. 306–317. Springer, Heidelberg (2012)
Silva, S., Martins, P., Oliveira, C., Silva, A., Teixeira, A.: Segmentation and analysis of the oral and nasal cavities from MR time sequences. In: Campilho, A., Kamel, M. (eds.) ICIAR 2012, Part II. LNCS, vol. 7325, pp. 214–221. Springer, Heidelberg (2012)
Plux Wireless Biosignals, Portugal. http://www.plux.info/
Hudgins, B., Parker, P., Scott, R.: A new strategy for multifunction myoelectric control. IEEE Trans. Biomed. Eng. 40(1), 82–94 (1993)
Quatieri, T.F., Brady, K., Messing, D., Campbell, J.P., Campbell, W.M., Brandstein, M.S., Weinstein, C., Tardelli, J., Gatewood, P.D.: Exploiting nonacoustic sensors for speech encoding. IEEE Trans. Audio, Speech, Lang. Process. 14(2), 533–544 (2006)
Acknowledgements
This work was partially funded by Marie Curie IAPP Golem (ref.251415, FP7-PEOPLE-2009-IAPP), Marie Curie IAPP IRIS (ref. 610986, FP7-PEOPLE-2013-IAPP) and by FEDER through the Operational Program Competitiveness factors - COMPETE under the scope of QREN 5329 FalaGlobal, by National Funds through FCT (Foundation for Science and Technology) in the context of the Project HERON II (PTDC/EEA-PLP/098298/2008) and by project Cloud Thinking (funded by the QREN Mais Centro program: CENTRO-07-ST24-FEDER-002031).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Freitas, J., Teixeira, A., Silva, S., Oliveira, C., Dias, M.S. (2015). Velar Movement Assessment for Speech Interfaces: An Exploratory Study Using Surface Electromyography. In: Plantier, G., Schultz, T., Fred, A., Gamboa, H. (eds) Biomedical Engineering Systems and Technologies. BIOSTEC 2014. Communications in Computer and Information Science, vol 511. Springer, Cham. https://doi.org/10.1007/978-3-319-26129-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-26129-4_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26128-7
Online ISBN: 978-3-319-26129-4
eBook Packages: Computer ScienceComputer Science (R0)