Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1734605.1734654acmconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

A generic approach to classify sports video shots and its application in event detection

Published: 23 November 2009 Publication History

Abstract

Shot type is useful information for semantic sports video analysis. Most existing approaches utilize predefined rules and domain knowledge to derive shot types in sports video. Although these methods have achieved promising results in some specific games, it is hard to extend them from one sport to another. To address this problem, we propose a generic approach to classify shots in sports video. Our approach utilizes bag of visual words model to represent key frame for each shot based on Scale Invariant Feature Transform (SIFT) feature points; either Support Vector Machine (SVM) or Probabilistic Latent Semantic Analysis (PLSA) are then employed to classify key frame to determine shot type. As our approach relies little on domain knowledge, it can be more easily extended to different sports. We have evaluated our shot classification approach over five types of sports video and have achieved promising results. To show the usefulness and effectiveness of our shot classification, we apply the results of shot type to detect events in basketball video via a generative-discriminative model. In addition, we have observed that some common visual parts frequently appear across various shots in the same sport or even different but relevant sports. For instance, soccer and basketball are relevant sports in the sense of field-ball game. Motivated by this observation, we attempt to alleviate the problem of insufficient sports video data in some applications by sharing these visual parts across different but relevant kinds of sports.

References

[1]
D. A. Sadier, N. E. O'Connor, "Event Detection in Field Sports Video Using Audio-Visual Features and a Support Vector Machine", IEEE Transactions on Circuits and Systems for Video Technology, pp. 1225--1233, 2005.
[2]
Y.-P. Tan, D. D. Saur, S. R. Kulkarni, and P. J. Ramadge, "Rapid Estimation of Camera Motion from Compressed Video with Application to Video Annotation," IEEE Transactions on Circuits and Systems for Video Technology, pp. 133--146, 2000.
[3]
A. Ekin, A. M. Teklap, and Mehrotra, "Automatic Soccer Video Analysis and Summarization". IEEE Transactions on Image Processing, pp. 796--807, 2003.
[4]
P. Chang, M. Han and Y. H. Gong, "Extract Highlights from Baseball Game Video with Hidden Markov Models", IEEE International Conference on Image Processing, pp. 609--612, 2002.
[5]
J. Assfalg, M. Bertini, C. Colombo, and A. D. Bimbo, "Semantic Annotation of Soccer Videos: Automatic Highlights Identification" Computer Vision and Image Understanding, pp. 285--305, 2003.
[6]
L. Xie, S.-F. Chang, A. Divakaran, and H. Sun, "Structure Analysis of Soccer Video with Hidden Markov Models", IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4096--4099, 2002.
[7]
L. Y. Duan, M. Xu, Q. Tian, C. S. Xu, and J. J. S, "A Unified Framework for Semantic Shot Classification in Sports Video", IEEE Transactions on Multimedia, pp. 1066--1083, 2005.
[8]
D. Lowe, "Distinctive Image Features from Scale-invariant Key Points", In International Journal of Computer Vision, pp. 91--110, 2004.
[9]
J. Yang, Y. G. Jiang, A. G. Hauptmann and C. W. Ngo, "Evaluating Bag-of-Visual-Words Representations in Scene Classification", ACM MIR, pp. 197--206, 2007.
[10]
F. F. Li, "Knowledge Transfer in Learning to Recognize Visual Objects Classes", International Conference on Development and Learning, 2006.
[11]
T. Hofmann, "Probabilistic Latent Semantic Analysis", Conference on Uncertainty in AI, pages 289--296, 1999.
[12]
R. Lienhart, M. Slaney, "PLSA on Large Scale Image Databases", IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1217--1220, 2007.
[13]
C. S. Xu, J. J. Wang, K. Wan, Y. Q. Li and L. Y. Duan, "Live Sports Event Detection Based on Broadcast Video and Webcasting Text", ACM Multimedia, pp. 221--230, 2006.
[14]
D. Q. Zhang, and S. F. Chang, "Event Detection in Baseball Video using Superimposed Caption Recognition", ACM Multimedia, pp. 315--318, 2002.
[15]
J. Wang., L. Y. Duan, H. Q. Lu, J. S. Jin and C. S. Xu, "A Mid-level Scene Change Representation via Audiovisual Alignment", IEEE International Conference on Acoustics, Speech, and Signal Processing, 2006.
[16]
G. Y. Miao, G. Y. Zhu, S. Q. Jiang, and Q. M. Huang, "a Real-Time Score Detection and Recognition Approach for Broad cast Basketball Video", IEEE International Conference on Multimedia and Expo, pp. 1691--1694, 2007.
[17]
K. T. About-Moustafa, C. Y. Suen, and M. Cheriet, "A Generative-Discriminative Hybrid for Sequential Data Classification", IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 805--808, 2004.

Cited By

View all
  • (2023)Deep Transfer Learning-Based Foot No-Ball Detection in Live Cricket MatchComputational Intelligence and Neuroscience10.1155/2023/23981212023Online publication date: 1-Jan-2023
  • (2020)SSET: a dataset for shot segmentation, event detection, player tracking in soccer videosMultimedia Tools and Applications10.1007/s11042-020-09414-3Online publication date: 7-Aug-2020
  • (2018)Comprehensive Dataset of Broadcast Soccer Videos2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)10.1109/MIPR.2018.00090(418-423)Online publication date: Apr-2018

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICIMCS '09: Proceedings of the First International Conference on Internet Multimedia Computing and Service
November 2009
263 pages
ISBN:9781605588407
DOI:10.1145/1734605
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 November 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. event detection
  2. shot classification
  3. sports video

Qualifiers

  • Research-article

Funding Sources

Conference

ICIMCS '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 163 of 456 submissions, 36%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Deep Transfer Learning-Based Foot No-Ball Detection in Live Cricket MatchComputational Intelligence and Neuroscience10.1155/2023/23981212023Online publication date: 1-Jan-2023
  • (2020)SSET: a dataset for shot segmentation, event detection, player tracking in soccer videosMultimedia Tools and Applications10.1007/s11042-020-09414-3Online publication date: 7-Aug-2020
  • (2018)Comprehensive Dataset of Broadcast Soccer Videos2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)10.1109/MIPR.2018.00090(418-423)Online publication date: Apr-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media