Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Model-Based Dereverberation Preserving Binaural Cues

Published: 01 September 2010 Publication History

Abstract

The ability of the human auditory system for sound localization mainly depends on the binaural cues, especially interaural time and level differences (ITD and ILD). In the context of digital hearing aids and binaural audio transmission systems, these cues can be severely degraded by independent bilateral signal processing such as dereverberation or noise reduction. This contribution presents a novel two-stage binaural dereverberation algorithm which explicitly preserves the binaural cues. The first stage is based on a statistical model of the room impulse responses (RIR) and comprises a spectral subtraction rule which reduces late reverberation only. It includes a smoothing process of the spectral gains to reduce musical tones. In a second stage, the residual reverberation is attenuated by a dual-channel Wiener filter. This is derived from a coherence model of the reverberant sound field taking into account shadowing effects of the head. The overall binaural-input binaural-output structure efficiently reduces both early and late reverberation. In experiments as well as informal listening tests using measured binaural room impulse responses, the proposed algorithm significantly improves speech quality according to objective and subjective measures.

Cited By

View all
  • (2018)Robust Speech Dereverberation With a Neural Network-Based Post-Filter That Exploits Multi-Conditional Training of Binaural CuesIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2017.276581926:2(406-414)Online publication date: 1-Feb-2018
  • (2015)Robust Acoustic Localization Via Time-Delay Compensation and Interaural Matching FilterIEEE Transactions on Signal Processing10.1109/TSP.2015.244749663:18(4771-4783)Online publication date: 13-Aug-2015
  • (2015)Multi-channel linear prediction-based speech dereverberation with sparse priorsIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2015.243854923:9(1509-1520)Online publication date: 1-Sep-2015
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing  Volume 18, Issue 7
September 2010
211 pages

Publisher

IEEE Press

Publication History

Published: 01 September 2010

Author Tags

  1. Binaural cue preservation
  2. dereverberation
  3. head shadowing
  4. spectral subtraction
  5. speech enhancement

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Robust Speech Dereverberation With a Neural Network-Based Post-Filter That Exploits Multi-Conditional Training of Binaural CuesIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2017.276581926:2(406-414)Online publication date: 1-Feb-2018
  • (2015)Robust Acoustic Localization Via Time-Delay Compensation and Interaural Matching FilterIEEE Transactions on Signal Processing10.1109/TSP.2015.244749663:18(4771-4783)Online publication date: 13-Aug-2015
  • (2015)Multi-channel linear prediction-based speech dereverberation with sparse priorsIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2015.243854923:9(1509-1520)Online publication date: 1-Sep-2015
  • (2015)Coherent-to-diffuse power ratio estimation for dereverberationIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2015.241857123:6(1006-1018)Online publication date: 1-Jun-2015
  • (2014)Variational Bayesian inference for multichannel dereverberation and noise reductionIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2014.232973222:8(1320-1335)Online publication date: 1-Aug-2014
  • (2014)Generalized Spherical Array Beamforming for Binaural Speech ReproductionIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2013.229049922:1(238-247)Online publication date: 1-Jan-2014
  • (2014)Reduced-bandwidth Multi-channel Wiener Filter based binaural noise reduction and localization cue preservation in binaural hearing aidsSignal Processing10.1016/j.sigpro.2013.12.01299(1-16)Online publication date: 1-Jun-2014

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media