Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/584458.584471acmconferencesArticle/Chapter ViewAbstractPublication PagessccgConference Proceedingsconference-collections
Article

Entropy metrics used for video summarization

Published: 24 April 2002 Publication History

Abstract

New methods for detecting shot boundaries in video sequences and for extracting key frames using metrics based on information theory are proposed. The method for shot cut detection relies on the mutual information and the joint entropy between the frames. It can detect cuts, fade-ins and fade-outs. The detection technique was tested on TV video sequences having different types of shots and containing significant object and camera motion inside the shots. It is demonstrated that the method detects both fades and abrupt cuts with high accuracy. The method for key frame extraction is using the mutual information. We show that it captures satisfactorily the visual content of the shot.

References

[1]
G. Ahanger and T.D.C. Little. A survey of technologies for parsing and indexing digital video. Journal of visual Communication and image representation, 7(1):28-43, 1996.
[2]
A. Del Bimbo. Visual Information Retrieval. Morgan Kaufmann Publishers, Inc, San Francisco, California, 1999.
[3]
P. Browne, A. F. Smeaton, N. Murphy, N. O'Connor, S. Marlow, and C. Berrut. Evaluation and combining digital video shot boundary detection algorithms. In Proceedings of the Fourth Irish Machine Vision and Information Processing Conference, Queens University Belfast, 2000.
[4]
X. U. Cabedo and S. K. Bhattacharjee. Shot detection tools in digital video. In Proceedings of Nonlinear Model Based Image Analysis 1998, Springer Verlag, Glasgow, pages 121-126, July 1998.
[5]
T. M. Cover and J. A. Thomas. Elements of Information Theory. John Wiley and Sons, New York, 1991.
[6]
A. Dailianas, R. B. Allen, and P. England. Comparison of automatic video segmentation algorithms. In Proceedings, SPIE Photonics East '95: Integration Issues in Large Commercial Media Delivery Systems, Oct. 1995, Philadelphia, volume 2615, pages 2-16, 1995.
[7]
M. S. Drew, Z.-N. Li, and X. Zhong. Video dissolve and wipe detection via spatio-temporal images of chromatic histogram differences. In Proceeding of IEEE Int. Conf. on Image Processing (ICIP 2000), volume 3, pages 909-932, 2000.
[8]
B. Grünsel and A. M. Tekalp. Content-based video abstraction. In Proceeding of IEEE Int. Conf. on Image Processing (ICIP'98), Chicago IL, October 1998.
[9]
R. Lienhart. Comparison of automatic shot boundary detection algorithms. In Proc. of SPIE Storage and Retrieval for Image and Video Databases VII, San Jose, CA, U.S.A., volume 3656, pages 290-301, January 1999.
[10]
R. Lienhart. Reliable dissolve detection. In Proc. of SPIE Storage and Retrieval for Media Databases 2001, volume 4315, pages 219-230, January 2001.
[11]
R. Lienhart and A. Zaccarin. A system for reliable dissolve detection in video. In Proceeding of IEEE Intl. Conf. on Image Processing 2001 (ICIP'01), Thessaloniki, Greece, Oct. 2003.
[12]
C. E. Metz. Basic principles of ROC analysis. Seminars in Nuclear Medicine, 8:283-298, 1978.
[13]
A. Nagasaka and Y. Tanaka. Automatic video indexing and full-video search for object appearances. In Visual Database Systems II, 1992.
[14]
A. Papoulis. Probability, Random Variables, and Stochastic Processes. New York: McGraw-Hill, Inc., 1991.
[15]
N. V. Patel and I. K. Sethi. Video shot detection and characterization for video databases. Pattern Recognition, 30(4):583-592, April 1997.
[16]
I. Pitas and A.N. Venetsanopoulos. Nonlinear Digital Filters: Principles and Applications. Kluwer Academic, 1990.
[17]
S. Tsekeridou, S. Krinidis, and I. Pitas. Scene change detection based on audio-visual analysis and interaction. In 2000 Multi-Image Search and Analysis Workshop, accepted for publication, Schloss Dagstuhl, Germany, 12-17 March 2001, March 2001.
[18]
S. Tsekeridou and I. Pitas. Content-based video parsing and indexing based on audio-visual interaction. IEEE Trans. on Circuits and Systems for Video Technology, 11(4):522-535, 2001.
[19]
Y. Wang, Z. Liu, and J.-Ch. Huang. Multimedia content analysis using both audio and visual clues. IEEE Signal Processing Magazine, 17(6):12-36, November 2000.
[20]
W. Wolf. Key frame selection by motion analysis. In Proceeding IEEE Int. Vonf. Acoust., Speech and Signal Proc., 1996.
[21]
R. Zabih, J. Miller, and K. Mai. A feature-based algorithm for detecting and classifying production effects. ACM Journal of Multimedia Systems, 7:119-128, 1999.
[22]
Y. Zhuang, Y. Rui, T. S. Huang, and S. Metrotra. Adaptive key frame extraction using unsupervised clustering. In Proceeding of IEEE Int. Conf. on Image Processing (ICIP'98), Chicago IL, pages 886-890, October 1998.

Cited By

View all
  • (2022)Maximum Shannon Information Delivered in a LectureLatvian Journal of Physics and Technical Sciences10.2478/lpts-2022-000859:2(12-22)Online publication date: 22-Apr-2022
  • (2019)A Novel Framework for Efficient Extraction of Meaningful Key Frames From Surveillance VideoCensorship, Surveillance, and Privacy10.4018/978-1-5225-7113-1.ch019(342-359)Online publication date: 2019
  • (2015)A Novel Framework for Efficient Extraction of Meaningful Key Frames from Surveillance VideoInternational Journal of System Dynamics Applications10.4018/ijsda.20150401044:2(56-73)Online publication date: 1-Apr-2015
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SCCG '02: Proceedings of the 18th Spring Conference on Computer Graphics
April 2002
198 pages
ISBN:1581136080
DOI:10.1145/584458
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 April 2002

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. detection accuracy
  2. entropy
  3. key frame extraction
  4. mutual information
  5. shot boundary detection
  6. video analysis
  7. video segmentation

Qualifiers

  • Article

Conference

PCK50
Sponsor:

Acceptance Rates

Overall Acceptance Rate 42 of 81 submissions, 52%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 29 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Maximum Shannon Information Delivered in a LectureLatvian Journal of Physics and Technical Sciences10.2478/lpts-2022-000859:2(12-22)Online publication date: 22-Apr-2022
  • (2019)A Novel Framework for Efficient Extraction of Meaningful Key Frames From Surveillance VideoCensorship, Surveillance, and Privacy10.4018/978-1-5225-7113-1.ch019(342-359)Online publication date: 2019
  • (2015)A Novel Framework for Efficient Extraction of Meaningful Key Frames from Surveillance VideoInternational Journal of System Dynamics Applications10.4018/ijsda.20150401044:2(56-73)Online publication date: 1-Apr-2015
  • (2014)On summarising the ‘here and now’ of social videos for smart mobile browsing2014 International Workshop on Computational Intelligence for Multimedia Understanding (IWCIM)10.1109/IWCIM.2014.7008797(1-5)Online publication date: Nov-2014
  • (2013)Video key frame extraction through dynamic Delaunay clustering with a structural constraintJournal of Visual Communication and Image Representation10.1016/j.jvcir.2013.08.00324:7(1212-1227)Online publication date: 1-Oct-2013
  • (2012)Drive video summarization based on double articulation structure of driving behaviorProceedings of the 20th ACM international conference on Multimedia10.1145/2393347.2396410(1169-1172)Online publication date: 29-Oct-2012
  • (2010)A new player-enabled rapid video navigation method using temporal quantization and repeated weighted boosting search2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops10.1109/CVPRW.2010.5543733(64-71)Online publication date: Jun-2010
  • (2008)Video motion detection beyond reasonable doubtProceedings of the 1st international conference on Forensic applications and techniques in telecommunications, information, and multimedia and workshop10.5555/1363217.1363225(1-6)Online publication date: 21-Jan-2008
  • (2008)Detection of hard cuts and gradual transitions from video using fuzzy logicInternational Journal of Artificial Intelligence and Soft Computing10.1504/IJAISC.2008.0212651:1(77-98)Online publication date: 1-Nov-2008
  • (2008)LeeDeoProceedings of the 2008 Tenth IEEE International Symposium on Multimedia10.1109/ISM.2008.105(497-502)Online publication date: 15-Dec-2008
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media