Nothing Special   »   [go: up one dir, main page]

Skip to main content

Audio-Based Event Detection for Sports Video

  • Conference paper
  • First Online:
Image and Video Retrieval (CIVR 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2728))

Included in the following conference series:

Abstract

In this paper, we present an audio-based event detection approach shown to be effective when applied to the Sports broadcast data. The main benefit of this approach is the ability to recognise patterns that indicate high levels of crowd response which can be correlated to key events. By applying Hidden Markov Model-based classifiers, where the predefined content classes are parameterised using Mel-Frequency Cepstral Coefficients, we were able to eliminate the need for defining a heuristic set of rules to determine event detection, thus avoiding a two-class approach shown not to be suitable for this problem. Experimentation indicated that this is an effective method for classifying crowd response in Soccer matches, thus providing a basis for automatic indexing and summarisation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Y.L. Chang, W. Zeng, I. Kamel, and R. Alonso. Integrated image and speech analysis for content-based video indexing. In ICMCS, pages 306–313. IEEE, 1996.

    Google Scholar 

  2. D. Keislar E. Wold, T. Blum and J. Wheaton. Content-based classification, search, and retrieval of audio. In In IEEE Multimedia, volume 3, pages 27–36. IEEE, 1996.

    Article  Google Scholar 

  3. Y. Gong, T. S. Lim, and H.C. Chua. Automatic parsing of tv soccer programs. In ICMCS, pages 167–174, Washington DC, May 1995.

    Google Scholar 

  4. TiVo Inc. http://www.tivo.com/. Last visited 24th April 2003.

    Google Scholar 

  5. S. Intille and A. Bobick. Visual tracking using closed worlds. Technical report, MIT Media Laboratory, 1995. http://web.media.mit.edu/ intille/.

    Google Scholar 

  6. A.K. Jain, R.P.W. Duin, and J. Mao. Statistical pattern recognition: A review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(1):4–37, January 2000.

    Article  Google Scholar 

  7. J.P. Cambell Jnr. Speaker recognition: A tutorial. In Proceedings of the IEEE, volume 85, pages 1437–1462, September 1997.

    Article  Google Scholar 

  8. J. Kittler, K. Messer, W. Christmas, B Levienaise-Obadia, and D. Koubaroulis. Generation of semantic cues for sports video annotation. In ICIP, pages 26–29, Thessaloniki, Greece, October 2001.

    Google Scholar 

  9. K. Kobla, D. Doermann, and D. DeMenthon. Identification of sports videos using replay, text, and camera motion features. In Conference on Storage and Retrieval for Media Databases, volume 3972, pages 332–343. SPIE, January 2000.

    Google Scholar 

  10. C. Li and G. Biswas. A bayesian approach to temporal data clustering using hidden markov models. In ICML, pages 543–550, Stanford, California, 2000.

    Google Scholar 

  11. M. R. Naphade, A. Garg, and T. S. Huang. Duration dependent input output markov models for audio-visual event detection. In ICME, Tokyo, Japan, August 2001. IEEE.

    Google Scholar 

  12. OPTA. http://www.opta.co.uk/. Last visited 24th April 2003.

    Google Scholar 

  13. D. Pye. Content-based methods for the management of digital music. In ICASSP, volume IV, pages 2437–2400, 2000.

    Google Scholar 

  14. L. Rabiner and B.H. Juang. Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs, NJ, USA, 1993.

    Google Scholar 

  15. Y. Rui, A. Gupta, and A. Acero. Automatically extracting highlights for tv baseball programs. In ACM Multimedia, pages 105–115, LA, 2000.

    Google Scholar 

  16. Sky+. http://www.sky.com/. Last visited 24th April 2003.

    Google Scholar 

  17. P. van Beek, H. Pan, and M. I. Sezan. Detection of slow-motion replay segments in sports video for highlights generation. In ICASSP, Utah, May 7–11 2001.

    Google Scholar 

  18. Y. Wang, Z. Liu, and J. Huang. Multimedia content analysis using both audio and visual clues. In IEEE Signal Processing Magazine, volume 17, pages 12–36. 2000.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Baillie, M., Jose, J.M. (2003). Audio-Based Event Detection for Sports Video. In: Bakker, E.M., Lew, M.S., Huang, T.S., Sebe, N., Zhou, X.S. (eds) Image and Video Retrieval. CIVR 2003. Lecture Notes in Computer Science, vol 2728. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45113-7_30

Download citation

  • DOI: https://doi.org/10.1007/3-540-45113-7_30

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40634-1

  • Online ISBN: 978-3-540-45113-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics