Abstract
Short-time auditory attention detection (AAD) based on electroencephalography (EEG) can be utilized to help hearing-impaired people improve their perception abilities in multi-speaker environments. However, the large individual differences and very low signal-to-noise ratio (SNR) of EEG signals may prevent the AAD from working effectively across subjects in a short time duration. To address the above issues, this paper firstly used a sparse autoencoder with the same trial constraint (SAE-T) method to extract common features across subjects from EEG signals in a 2-s time window. Then we use a CNN-based speech temporal amplitude envelopes (TAEs) reconstruction model for attention detection by comparing the reconstructed accuracy of attended with unattended speech, and the time delay and segmented SAE-T features were also considered in the model. Moreover, the dataset we used has no directional information of speech, which can train a more general model for practical application. Experimental results show that the proposed method can achieve AAD detection accuracy to 86.31%, higher than the method of removing time delay or segmented SAE-T features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Haile, L., Orji, A., Briant, P., Adelson, J., Davis, A., Vos, T.: Updates on hearing from the global burden of disease study. Innov. Aging 4(Suppl 1), 808 (2020)
Humes, L.E., Rogers, S.E., Main, A.K., Kinney, D.L.: The acoustic environments in which older adults wear their hearing aids: insights from datalogging sound environment classification. Am. J. Audiol. 27(4), 594–603 (2018)
Haykin, S., Liu, K.R.: Handbook on Array Processing and Sensor Networks. John Wiley & Sons, Hoboken (2010)
Kurmanavičiūtė, D., Rantala, A., Jas, M., Välilä, A., Parkkonen, L.: Target of selective auditory attention can be robustly followed with meg. bioRxiv p. 588491 (2019)
O’sullivan, J.A., et al.: Attentional selection in a cocktail party environment can be decoded from single-trial EEG. Cereb. Cortex 25(7), 1697–1706 (2015)
Ding, N., Simon, J.Z.: Emergence of neural encoding of auditory objects while listening to competing speakers. Proc. Natl. Acad. Sci. 109(29), 11854–11859 (2012)
Thut, G., Miniussi, C., Gross, J.: The functional importance of rhythmic activity in the brain. Curr. Biol. 22(16), R658–R663 (2012)
Aiken, S.J., Picton, T.W.: Human cortical responses to the speech envelope. Ear Hear. 29(2), 139–157 (2008)
Crosse, M.J., Di Liberto, G.M., Bednar, A., Lalor, E.C.: The multivariate temporal response function (mTRF) toolbox: a MATLAB toolbox for relating neural signals to continuous stimuli. Front. Hum. Neurosci. 10, 604 (2016)
Geirnaert, S., et al.: Neuro-steered hearing devices: decoding auditory attention from the brain (2021)
Gu, X., et al.: EEG-based brain-computer interfaces (BCIs): a survey of recent studies on signal sensing technologies and computational intelligence approaches and their applications. IEEE/ACM Trans. Comput. Biol. Bioinform. 18, 1645–1666 (2021)
de Taillez, T., Kollmeier, B., Meyer, B.T.: Machine learning for decoding listeners’ attention from electroencephalography evoked by continuous speech. Eur. J. Neurosci. 51(5), 1234–1241 (2020)
Vandecappelle, S., Deckers, L., Das, N., Ansari, A.H., Bertrand, A., Francart, T.: EEG-based detection of the locus of auditory attention with convolutional neural networks. Elife 10, e56481 (2021)
Zhang, Z., Zhang, G., Dang, J., Wu, S., Zhou, D., Wang, L.: EEG-based short-time auditory attention detection using multi-task deep learning. In: INTERSPEECH, pp. 2517–2521 (2020)
Yao, Y., Plested, J., Gedeon, T.: Deep feature learning and visualization for EEG recording using autoencoders. In: Cheng, L., Leung, A.C.S., Ozawa, S. (eds.) ICONIP 2018. LNCS, vol. 11307, pp. 554–566. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-04239-4_50
Deckers, L., Das, N., Ansari, A., Bertrand, A., Francart, T.: EEG-based detection of the attended speaker and the locus of auditory attention with convolutional neural networks. biorxiv. 475673 (2018)
Lewis, J.L.: Semantic processing of unattended messages using dichotic listening. J. Exp. Psychol. 85(2), 225 (1970)
Vanthornhout, J., Decruy, L., Francart, T.: Effect of task and attention on neural tracking of speech. Front. Neurosci. 13, 977 (2019)
Oldfield, R.C.: The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia 9(1), 97–113 (1971)
Association, W.M., et al.: World medical association declaration of Helsinki. Ethical principles for medical research involving human subjects. Bull. World Health Organ. 79(4), 373 (2001)
Zhou, D., Zhang, G., Dang, J., Wu, S., Zhang, Z.: A multi-subject temporal-spatial hyper-alignment method for EEG-based neural entrainment to speech. In: 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 881–887. IEEE (2020)
Delorme, A., Makeig, S.: EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 134(1), 9–21 (2004)
Acknowledgements
This work was supported by the National Natural Science Foundation of China (No. 61876126 and 61503278).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Yang, K., Zhang, Z., Zhang, G., Masashi, U., Dang, J., Wang, L. (2023). An Improved Stimulus Reconstruction Method for EEG-Based Short-Time Auditory Attention Detection. In: Tanveer, M., Agarwal, S., Ozawa, S., Ekbal, A., Jatowt, A. (eds) Neural Information Processing. ICONIP 2022. Communications in Computer and Information Science, vol 1792. Springer, Singapore. https://doi.org/10.1007/978-981-99-1642-9_23
Download citation
DOI: https://doi.org/10.1007/978-981-99-1642-9_23
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-1641-2
Online ISBN: 978-981-99-1642-9
eBook Packages: Computer ScienceComputer Science (R0)