Nothing Special   »   [go: up one dir, main page]

Skip to main content

Robust Localization in Reverberant Rooms

  • Chapter
Microphone Arrays

Part of the book series: Digital Signal Processing ((DIGSIGNAL))

Abstract

Talker localization with microphone arrays has received significant attention lately as a means for the automated tracking of individuals in an enclosure and as a necessary component of any general purpose speech capture system. Several algorithmic approaches are available for speech source localization with multi-channel data. This chapter summarizes the current field and comments on the general merits and shortcomings of each genre. A new localization method is then presented in detail. By utilizing key features of existing methods, this new algorithm is shown to be significantly more robust to acoustical conditions, particularly reverberation effects, than the traditional localization techniques in use today.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. H. Silverman, W. Patterson, J. Flanagan, and D. Rabinkin, “A digital processing system for source location and sound capture by large microphone arrays,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-97), Munich, Germany, pp. 251–254, April 1997.

    Google Scholar 

  2. M. Brandstein, J. Adcock, and H. Silverman, “Microphone array localization error estimation with application to sensor placement,” J. Acoust. Soc. Am., vol. 99, no. 6, pp. 3807–3816, 1996.

    Article  Google Scholar 

  3. J. Flanagan and H. Silverman, eds., International Workshop on Microphone-Array Systems: Theory and Practice, Brown University, Providence RI, USA, October 1992.

    Google Scholar 

  4. B. Radlovic, R. Williamson, and R. Kennedy, “On the poor robustness of sound equalization in reverberant environments,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-99), Phoenix AZ, USA, pp. 881–884, March 1999.

    Google Scholar 

  5. J. DiBiase, A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments, PhD thesis, Brown University, Providence RI, USA, May 2000.

    Google Scholar 

  6. W. Bangs and P. Schultheis, “Space-time processing for optimal parameter estimation,” in Signal Processing (J. Griffiths, P. Stocklin, and C. V. Schooneveld, eds.), pp. 577–590, Academic Press, 1973.

    Google Scholar 

  7. G. Carter, “Variance bounds for passively locating an acoustic source with a symmetric line array,” J. Acoust. Soc. Am., vol.. 62, pp. 922–926, October 1977.

    Google Scholar 

  8. W. Hahn and S. Tretter, “Optimum processing for delay-vector estimation in passive signal arrays,” IEEE Trans. Inform Theory, vol. IT-19, pp. 608–614, September 1973.

    Google Scholar 

  9. W. Hahn, “Optimum signal processing for passive sonar range and bearing estimation,” J. Acoust. Soc. Am., vol. 58, pp. 201–207, July 1975.

    Article  Google Scholar 

  10. M. Wax and T. Kailath, “Optimum localization of multiple sources by passive arrays,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-31, pp. 1210–1217, October 1983.

    Google Scholar 

  11. V. M. Alvarado, Talker Localization and Optimal Placement of Microphones for a Linear Microphone Array using Stochastic Region Contraction. PhD thesis, Brown University, Providence RI, USA, May 1990.

    Google Scholar 

  12. H. F. Silverman and S. E. Kirtman, “A two-stage algorithm for determining talker location from linear microphone-array data,” Computer, Speech, and Language, vol. 6, pp. 129–152, April 1992.

    Google Scholar 

  13. D. Johnson and D. Dudgeon, Array Signal Processing- Concepts and Techniques, Prentice Hall, 1993.

    Google Scholar 

  14. S. Haykin, Adaptive Filter Theory, Prentice Hall, second ed., 1991.

    Google Scholar 

  15. R. Schmidt, A Signal Subspace Approach to Multiple Emitter Location and Spectral Estimation, PhD thesis, Stanford University, Stanford CA, USA, 1981.

    Google Scholar 

  16. J. Krolik, “Focussed wide-band array processing for spatial spectral estimation,” in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 2, pp. 221–261, Prentice Hall, 1991.

    Google Scholar 

  17. H. Wang and M. Kaveh, “Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 823–831, August 1985.

    Google Scholar 

  18. K. Buckley and L. Griffiths, “Broad-band signal-subspace spatial-spectrum (BASS-ALE) estimation,” IEEE Trans. Acoust., Speech, Signal Processing, vol.. 36, pp. 953–964, July 1988.

    Article  MATH  Google Scholar 

  19. A. Vural, “Effects of pertubations on the performance of optimum/adaptive arrays,” IEEE Trans. Aerosp. Electron., vol. AES-15, pp. 76–87, January 1979.

    Google Scholar 

  20. R. Compton Jr., Adaptive AntennasPrentice Hall, 1988.

    Google Scholar 

  21. T. Shan, M. Wax, and T. Kailath, “On spatial smoothing for direction-ofarrival estimation in coherent signals,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 806–811, August 1985.

    Google Scholar 

  22. M. Brandstein, J. Adcock, and H. Silverman, “A closed-form location estimator for use with room environment microphone arrays,” IEEE Trans. Speech Audio Proc., vol. 5, pp. 45–50, January 1997.

    Article  Google Scholar 

  23. P. Svaizer, M. Matassoni, and M. Omologo, “Acoustic source location in a three-dimensional space using crosspower spectrum phase,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 231234, April 1997.

    Google Scholar 

  24. Y. Huang, J. Benesty, and G. W. Elko“Adaptive eigenvalue decomposition algorithm for realtime acoustic source localization system,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-99)Phoenix AZ, USA, pp. 937–940, March 1999.

    Google Scholar 

  25. R. Schmidt, “A new approach to geometry of range difference location,” IEEE Trans. Aerosp. Electron., vol. AES-8, pp. 821–835, November 1972.

    Google Scholar 

  26. J. Smith and J. Abel, “Closed-form least-squares source location estimation from range-difference measurements,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, pp. 1661–1669, December 1987.

    Google Scholar 

  27. H. Lee, “A novel procedure for accessing the accuracy of hyperbolic multilateration systems,” IEEE Trans. Aerosp. Electron., vol. AES-11, pp. 2–15, January 1975.

    Google Scholar 

  28. N. Marchand, “Error distributions of best estimate of position from multiple time difference hyperbolic networks,” IEEE Trans. Aerosp. Navigat. Electron., vol.. 11, pp. 96–100, June 1964.

    Google Scholar 

  29. C. H. Knapp and G. C. Carter, “The generalized correlation method for estimation of time delay,” IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-24, pp. 320–327, August 1976.

    Google Scholar 

  30. M. Brandstein, J. Adcock, and H. Silverman“A practical time-delay estimator for localizing speech sources with a microphone array,” Computer Speech and Languagevol. 9, pp. 153–169, April 1995.

    Google Scholar 

  31. M. Brandstein and H. Silverman, “A practical methodology for speech source localization with microphone arrays,” Computer, Speech, and Language, vol. 11, pp. 91–126, April 1997.

    Google Scholar 

  32. S. Bédard, B. Champagne, and A. Stéphenne, “Effects of room reverberation on time-delay estimation performance,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP-94), Adelaide, Australia, pp. II:261–264, April 1994.

    Google Scholar 

  33. M. Brandstein and H. Silverman, “A robust method for speech signal time-delay estimation in reverberant rooms,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 375–378, April 1997.

    Google Scholar 

  34. H. Wang and P. Chu“Voice source localization for automatic camera pointing system in videoconferencing,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 187–190, April 1997.

    Google Scholar 

  35. M. Omologo and P. Svaizer, “Use of the crosspower-spectrum phase in acoustic event localization,” IEEE Trans. Speech Audio Proc., vol. 5, pp. 288–292, May 1997.

    Article  Google Scholar 

  36. P. Svaizer, M. Matassoni, and M. Omologo“Acoustic source location in a three-dimensional space using crosspower spectrum phase,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 231234, April 1997.

    Google Scholar 

  37. M. Brandstein, “Time-delay estimation of reverberated speech exploiting harmonic structure,” J. Acoust. Soc. Am., vol. 105, no. 5, pp. 2914–2919, 1999.

    Article  Google Scholar 

  38. A. Stéphenne and B. Champagne“Cepstral prefiltering for time delay estimation in reverberant environments,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-95)Detroit MI, USA, pp. 3055–3058, May 1995.

    Google Scholar 

  39. N. Strobel and R. Rabenstein, “Classification of time delay estimates in reverberant environments,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-99)Phoenix AZ, USA, pp. 3081–3084, March 1999.

    Google Scholar 

  40. B. Friedlander, “A passive localization algorithm and its accuracy analysis,” IEEE Jour. Oceanic Engineering, vol. 0E-12, pp. 234–245, January 1987.

    Google Scholar 

  41. Y. Chan and K. Ho, “A simple and efficient estimator for hyperbolic location,” IEEE Trans. Signal Processing, vol. 42, pp. 1905–1915, August 1994.

    Article  Google Scholar 

  42. D. Sturim, M. Brandstein, and H. Silverman, “Tracking multiple talkers using microphone-array measurements,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Processing (ICASSP-97)Munich, Germany, pp. 371–374, April 1997.

    Google Scholar 

  43. L. Kinsler, A. Frey, A. Coppens, and J. Sanders, Fundamentals of Acoustics, John Wiley & Sons, third ed., 1982.

    Google Scholar 

  44. L. Ziomek, Fundamentals of Acoustic Field Theory and Space-Time Signal Processing, CRC Press, 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

DiBiase, J.H., Silverman, H.F., Brandstein, M.S. (2001). Robust Localization in Reverberant Rooms. In: Brandstein, M., Ward, D. (eds) Microphone Arrays. Digital Signal Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-04619-7_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-04619-7_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-07547-6

  • Online ISBN: 978-3-662-04619-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics