Abstract
Talker localization with microphone arrays has received significant attention lately as a means for the automated tracking of individuals in an enclosure and as a necessary component of any general purpose speech capture system. Several algorithmic approaches are available for speech source localization with multi-channel data. This chapter summarizes the current field and comments on the general merits and shortcomings of each genre. A new localization method is then presented in detail. By utilizing key features of existing methods, this new algorithm is shown to be significantly more robust to acoustical conditions, particularly reverberation effects, than the traditional localization techniques in use today.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
H. Silverman, W. Patterson, J. Flanagan, and D. Rabinkin, “A digital processing system for source location and sound capture by large microphone arrays,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-97), Munich, Germany, pp. 251–254, April 1997.
M. Brandstein, J. Adcock, and H. Silverman, “Microphone array localization error estimation with application to sensor placement,” J. Acoust. Soc. Am., vol. 99, no. 6, pp. 3807–3816, 1996.
J. Flanagan and H. Silverman, eds., International Workshop on Microphone-Array Systems: Theory and Practice, Brown University, Providence RI, USA, October 1992.
B. Radlovic, R. Williamson, and R. Kennedy, “On the poor robustness of sound equalization in reverberant environments,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-99), Phoenix AZ, USA, pp. 881–884, March 1999.
J. DiBiase, A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments, PhD thesis, Brown University, Providence RI, USA, May 2000.
W. Bangs and P. Schultheis, “Space-time processing for optimal parameter estimation,” in Signal Processing (J. Griffiths, P. Stocklin, and C. V. Schooneveld, eds.), pp. 577–590, Academic Press, 1973.
G. Carter, “Variance bounds for passively locating an acoustic source with a symmetric line array,” J. Acoust. Soc. Am., vol.. 62, pp. 922–926, October 1977.
W. Hahn and S. Tretter, “Optimum processing for delay-vector estimation in passive signal arrays,” IEEE Trans. Inform Theory, vol. IT-19, pp. 608–614, September 1973.
W. Hahn, “Optimum signal processing for passive sonar range and bearing estimation,” J. Acoust. Soc. Am., vol. 58, pp. 201–207, July 1975.
M. Wax and T. Kailath, “Optimum localization of multiple sources by passive arrays,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-31, pp. 1210–1217, October 1983.
V. M. Alvarado, Talker Localization and Optimal Placement of Microphones for a Linear Microphone Array using Stochastic Region Contraction. PhD thesis, Brown University, Providence RI, USA, May 1990.
H. F. Silverman and S. E. Kirtman, “A two-stage algorithm for determining talker location from linear microphone-array data,” Computer, Speech, and Language, vol. 6, pp. 129–152, April 1992.
D. Johnson and D. Dudgeon, Array Signal Processing- Concepts and Techniques, Prentice Hall, 1993.
S. Haykin, Adaptive Filter Theory, Prentice Hall, second ed., 1991.
R. Schmidt, A Signal Subspace Approach to Multiple Emitter Location and Spectral Estimation, PhD thesis, Stanford University, Stanford CA, USA, 1981.
J. Krolik, “Focussed wide-band array processing for spatial spectral estimation,” in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 2, pp. 221–261, Prentice Hall, 1991.
H. Wang and M. Kaveh, “Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 823–831, August 1985.
K. Buckley and L. Griffiths, “Broad-band signal-subspace spatial-spectrum (BASS-ALE) estimation,” IEEE Trans. Acoust., Speech, Signal Processing, vol.. 36, pp. 953–964, July 1988.
A. Vural, “Effects of pertubations on the performance of optimum/adaptive arrays,” IEEE Trans. Aerosp. Electron., vol. AES-15, pp. 76–87, January 1979.
R. Compton Jr., Adaptive AntennasPrentice Hall, 1988.
T. Shan, M. Wax, and T. Kailath, “On spatial smoothing for direction-ofarrival estimation in coherent signals,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 806–811, August 1985.
M. Brandstein, J. Adcock, and H. Silverman, “A closed-form location estimator for use with room environment microphone arrays,” IEEE Trans. Speech Audio Proc., vol. 5, pp. 45–50, January 1997.
P. Svaizer, M. Matassoni, and M. Omologo, “Acoustic source location in a three-dimensional space using crosspower spectrum phase,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 231234, April 1997.
Y. Huang, J. Benesty, and G. W. Elko“Adaptive eigenvalue decomposition algorithm for realtime acoustic source localization system,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-99)Phoenix AZ, USA, pp. 937–940, March 1999.
R. Schmidt, “A new approach to geometry of range difference location,” IEEE Trans. Aerosp. Electron., vol. AES-8, pp. 821–835, November 1972.
J. Smith and J. Abel, “Closed-form least-squares source location estimation from range-difference measurements,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, pp. 1661–1669, December 1987.
H. Lee, “A novel procedure for accessing the accuracy of hyperbolic multilateration systems,” IEEE Trans. Aerosp. Electron., vol. AES-11, pp. 2–15, January 1975.
N. Marchand, “Error distributions of best estimate of position from multiple time difference hyperbolic networks,” IEEE Trans. Aerosp. Navigat. Electron., vol.. 11, pp. 96–100, June 1964.
C. H. Knapp and G. C. Carter, “The generalized correlation method for estimation of time delay,” IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-24, pp. 320–327, August 1976.
M. Brandstein, J. Adcock, and H. Silverman“A practical time-delay estimator for localizing speech sources with a microphone array,” Computer Speech and Languagevol. 9, pp. 153–169, April 1995.
M. Brandstein and H. Silverman, “A practical methodology for speech source localization with microphone arrays,” Computer, Speech, and Language, vol. 11, pp. 91–126, April 1997.
S. Bédard, B. Champagne, and A. Stéphenne, “Effects of room reverberation on time-delay estimation performance,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-94), Adelaide, Australia, pp. II:261–264, April 1994.
M. Brandstein and H. Silverman, “A robust method for speech signal time-delay estimation in reverberant rooms,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 375–378, April 1997.
H. Wang and P. Chu“Voice source localization for automatic camera pointing system in videoconferencing,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 187–190, April 1997.
M. Omologo and P. Svaizer, “Use of the crosspower-spectrum phase in acoustic event localization,” IEEE Trans. Speech Audio Proc., vol. 5, pp. 288–292, May 1997.
P. Svaizer, M. Matassoni, and M. Omologo“Acoustic source location in a three-dimensional space using crosspower spectrum phase,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 231234, April 1997.
M. Brandstein, “Time-delay estimation of reverberated speech exploiting harmonic structure,” J. Acoust. Soc. Am., vol. 105, no. 5, pp. 2914–2919, 1999.
A. Stéphenne and B. Champagne“Cepstral prefiltering for time delay estimation in reverberant environments,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-95)Detroit MI, USA, pp. 3055–3058, May 1995.
N. Strobel and R. Rabenstein, “Classification of time delay estimates in reverberant environments,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-99)Phoenix AZ, USA, pp. 3081–3084, March 1999.
B. Friedlander, “A passive localization algorithm and its accuracy analysis,” IEEE Jour. Oceanic Engineering, vol. 0E-12, pp. 234–245, January 1987.
Y. Chan and K. Ho, “A simple and efficient estimator for hyperbolic location,” IEEE Trans. Signal Processing, vol. 42, pp. 1905–1915, August 1994.
D. Sturim, M. Brandstein, and H. Silverman, “Tracking multiple talkers using microphone-array measurements,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 371–374, April 1997.
L. Kinsler, A. Frey, A. Coppens, and J. Sanders, Fundamentals of Acoustics, John Wiley & Sons, third ed., 1982.
L. Ziomek, Fundamentals of Acoustic Field Theory and Space-Time Signal Processing, CRC Press, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
DiBiase, J.H., Silverman, H.F., Brandstein, M.S. (2001). Robust Localization in Reverberant Rooms. In: Brandstein, M., Ward, D. (eds) Microphone Arrays. Digital Signal Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-04619-7_8
Download citation
DOI: https://doi.org/10.1007/978-3-662-04619-7_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-07547-6
Online ISBN: 978-3-662-04619-7
eBook Packages: Springer Book Archive