Robust Localization in Reverberant Rooms

Joseph H. DiBiase⁵,
Harvey F. Silverman⁵ &
Michael S. Brandstein⁶

Part of the book series: Digital Signal Processing ((DIGSIGNAL))

2411 Accesses
355 Citations
6 Altmetric

Abstract

Talker localization with microphone arrays has received significant attention lately as a means for the automated tracking of individuals in an enclosure and as a necessary component of any general purpose speech capture system. Several algorithmic approaches are available for speech source localization with multi-channel data. This chapter summarizes the current field and comments on the general merits and shortcomings of each genre. A new localization method is then presented in detail. By utilizing key features of existing methods, this new algorithm is shown to be significantly more robust to acoustical conditions, particularly reverberation effects, than the traditional localization techniques in use today.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multichannel Spatial Clustering Using Model-Based Source Separation

Acoustic Parameter Estimation

Multimicrophone MMSE-Based Speech Source Separation

References

H. Silverman, W. Patterson, J. Flanagan, and D. Rabinkin, “A digital processing system for source location and sound capture by large microphone arrays,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-97), Munich, Germany, pp. 251–254, April 1997.
Google Scholar
M. Brandstein, J. Adcock, and H. Silverman, “Microphone array localization error estimation with application to sensor placement,” J. Acoust. Soc. Am., vol. 99, no. 6, pp. 3807–3816, 1996.
Article Google Scholar
J. Flanagan and H. Silverman, eds., International Workshop on Microphone-Array Systems: Theory and Practice, Brown University, Providence RI, USA, October 1992.
Google Scholar
B. Radlovic, R. Williamson, and R. Kennedy, “On the poor robustness of sound equalization in reverberant environments,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-99), Phoenix AZ, USA, pp. 881–884, March 1999.
Google Scholar
J. DiBiase, A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments, PhD thesis, Brown University, Providence RI, USA, May 2000.
Google Scholar
W. Bangs and P. Schultheis, “Space-time processing for optimal parameter estimation,” in Signal Processing (J. Griffiths, P. Stocklin, and C. V. Schooneveld, eds.), pp. 577–590, Academic Press, 1973.
Google Scholar
G. Carter, “Variance bounds for passively locating an acoustic source with a symmetric line array,” J. Acoust. Soc. Am., vol.. 62, pp. 922–926, October 1977.
Google Scholar
W. Hahn and S. Tretter, “Optimum processing for delay-vector estimation in passive signal arrays,” IEEE Trans. Inform Theory, vol. IT-19, pp. 608–614, September 1973.
Google Scholar
W. Hahn, “Optimum signal processing for passive sonar range and bearing estimation,” J. Acoust. Soc. Am., vol. 58, pp. 201–207, July 1975.
Article Google Scholar
M. Wax and T. Kailath, “Optimum localization of multiple sources by passive arrays,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-31, pp. 1210–1217, October 1983.
Google Scholar
V. M. Alvarado, Talker Localization and Optimal Placement of Microphones for a Linear Microphone Array using Stochastic Region Contraction. PhD thesis, Brown University, Providence RI, USA, May 1990.
Google Scholar
H. F. Silverman and S. E. Kirtman, “A two-stage algorithm for determining talker location from linear microphone-array data,” Computer, Speech, and Language, vol. 6, pp. 129–152, April 1992.
Google Scholar
D. Johnson and D. Dudgeon, Array Signal Processing- Concepts and Techniques, Prentice Hall, 1993.
Google Scholar
S. Haykin, Adaptive Filter Theory, Prentice Hall, second ed., 1991.
Google Scholar
R. Schmidt, A Signal Subspace Approach to Multiple Emitter Location and Spectral Estimation, PhD thesis, Stanford University, Stanford CA, USA, 1981.
Google Scholar
J. Krolik, “Focussed wide-band array processing for spatial spectral estimation,” in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 2, pp. 221–261, Prentice Hall, 1991.
Google Scholar
H. Wang and M. Kaveh, “Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 823–831, August 1985.
Google Scholar
K. Buckley and L. Griffiths, “Broad-band signal-subspace spatial-spectrum (BASS-ALE) estimation,” IEEE Trans. Acoust., Speech, Signal Processing, vol.. 36, pp. 953–964, July 1988.
Article MATH Google Scholar
A. Vural, “Effects of pertubations on the performance of optimum/adaptive arrays,” IEEE Trans. Aerosp. Electron., vol. AES-15, pp. 76–87, January 1979.
Google Scholar
R. Compton Jr., Adaptive AntennasPrentice Hall, 1988.
Google Scholar
T. Shan, M. Wax, and T. Kailath, “On spatial smoothing for direction-ofarrival estimation in coherent signals,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 806–811, August 1985.
Google Scholar
M. Brandstein, J. Adcock, and H. Silverman, “A closed-form location estimator for use with room environment microphone arrays,” IEEE Trans. Speech Audio Proc., vol. 5, pp. 45–50, January 1997.
Article Google Scholar
P. Svaizer, M. Matassoni, and M. Omologo, “Acoustic source location in a three-dimensional space using crosspower spectrum phase,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 231234, April 1997.
Google Scholar
Y. Huang, J. Benesty, and G. W. Elko“Adaptive eigenvalue decomposition algorithm for realtime acoustic source localization system,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-99)Phoenix AZ, USA, pp. 937–940, March 1999.
Google Scholar
R. Schmidt, “A new approach to geometry of range difference location,” IEEE Trans. Aerosp. Electron., vol. AES-8, pp. 821–835, November 1972.
Google Scholar
J. Smith and J. Abel, “Closed-form least-squares source location estimation from range-difference measurements,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, pp. 1661–1669, December 1987.
Google Scholar
H. Lee, “A novel procedure for accessing the accuracy of hyperbolic multilateration systems,” IEEE Trans. Aerosp. Electron., vol. AES-11, pp. 2–15, January 1975.
Google Scholar
N. Marchand, “Error distributions of best estimate of position from multiple time difference hyperbolic networks,” IEEE Trans. Aerosp. Navigat. Electron., vol.. 11, pp. 96–100, June 1964.
Google Scholar
C. H. Knapp and G. C. Carter, “The generalized correlation method for estimation of time delay,” IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-24, pp. 320–327, August 1976.
Google Scholar
M. Brandstein, J. Adcock, and H. Silverman“A practical time-delay estimator for localizing speech sources with a microphone array,” Computer Speech and Languagevol. 9, pp. 153–169, April 1995.
Google Scholar
M. Brandstein and H. Silverman, “A practical methodology for speech source localization with microphone arrays,” Computer, Speech, and Language, vol. 11, pp. 91–126, April 1997.
Google Scholar
S. Bédard, B. Champagne, and A. Stéphenne, “Effects of room reverberation on time-delay estimation performance,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-94), Adelaide, Australia, pp. II:261–264, April 1994.
Google Scholar
M. Brandstein and H. Silverman, “A robust method for speech signal time-delay estimation in reverberant rooms,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 375–378, April 1997.
Google Scholar
H. Wang and P. Chu“Voice source localization for automatic camera pointing system in videoconferencing,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 187–190, April 1997.
Google Scholar
M. Omologo and P. Svaizer, “Use of the crosspower-spectrum phase in acoustic event localization,” IEEE Trans. Speech Audio Proc., vol. 5, pp. 288–292, May 1997.
Article Google Scholar
P. Svaizer, M. Matassoni, and M. Omologo“Acoustic source location in a three-dimensional space using crosspower spectrum phase,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 231234, April 1997.
Google Scholar
M. Brandstein, “Time-delay estimation of reverberated speech exploiting harmonic structure,” J. Acoust. Soc. Am., vol. 105, no. 5, pp. 2914–2919, 1999.
Article Google Scholar
A. Stéphenne and B. Champagne“Cepstral prefiltering for time delay estimation in reverberant environments,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-95)Detroit MI, USA, pp. 3055–3058, May 1995.
Google Scholar
N. Strobel and R. Rabenstein, “Classification of time delay estimates in reverberant environments,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-99)Phoenix AZ, USA, pp. 3081–3084, March 1999.
Google Scholar
B. Friedlander, “A passive localization algorithm and its accuracy analysis,” IEEE Jour. Oceanic Engineering, vol. 0E-12, pp. 234–245, January 1987.
Google Scholar
Y. Chan and K. Ho, “A simple and efficient estimator for hyperbolic location,” IEEE Trans. Signal Processing, vol. 42, pp. 1905–1915, August 1994.
Article Google Scholar
D. Sturim, M. Brandstein, and H. Silverman, “Tracking multiple talkers using microphone-array measurements,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 371–374, April 1997.
Google Scholar
L. Kinsler, A. Frey, A. Coppens, and J. Sanders, Fundamentals of Acoustics, John Wiley & Sons, third ed., 1982.
Google Scholar
L. Ziomek, Fundamentals of Acoustic Field Theory and Space-Time Signal Processing, CRC Press, 1995.
Google Scholar

Download references

Author information

Authors and Affiliations

Brown University, Providence, RI, USA
Joseph H. DiBiase & Harvey F. Silverman
Harvard University, Cambridge, MA, USA
Michael S. Brandstein

Authors

Joseph H. DiBiase
View author publications
You can also search for this author in PubMed Google Scholar
Harvey F. Silverman
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Brandstein
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Div. of Eng. and Applied Scciences, Harvard University, 33 Oxford Street, 02138, Cambridge, MA, USA
Michael Brandstein
Dept. of Electrical Engineering, Imperial College, Exhibition Road, SW7 2AZ, London, GB
Darren Ward

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

DiBiase, J.H., Silverman, H.F., Brandstein, M.S. (2001). Robust Localization in Reverberant Rooms. In: Brandstein, M., Ward, D. (eds) Microphone Arrays. Digital Signal Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-04619-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-662-04619-7_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-07547-6
Online ISBN: 978-3-662-04619-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Robust Localization in Reverberant Rooms

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Multichannel Spatial Clustering Using Model-Based Source Separation

Acoustic Parameter Estimation

Multimicrophone MMSE-Based Speech Source Separation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Robust Localization in Reverberant Rooms

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Multichannel Spatial Clustering Using Model-Based Source Separation

Acoustic Parameter Estimation

Multimicrophone MMSE-Based Speech Source Separation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation