Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2502081.2502187acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
poster

Violence detection in hollywood movies by the fusion of visual and mid-level audio cues

Published: 21 October 2013 Publication History

Abstract

Detecting violent scenes in movies is an important video content understanding functionality e.g., for providing automated youth protection services. One key issue in designing algorithms for violence detection is the choice of discriminative features. In this paper, we employ mid-level audio features and compare their discriminative power against low-level audio and visual features. We fuse these mid-level audio cues with low-level visual ones at the decision level in order to further improve the performance of violence detection. We use Mel-Frequency Cepstral Coefficients (MFCC) as audio and average motion as visual features. In order to learn a violence model, we choose two-class support vector machines (SVMs). Our experimental results on detecting violent video shots in Hollywood movies show that mid-level audio features are more discriminative and provide more precise results than low-level ones. The detection performance is further enhanced by fusing the mid-level audio cues with low-level visual ones using an SVM-based decision fusion.

Supplementary Material

ZIP File (mm106s.zip)

References

[1]
Media Eval VSD task proceedings, 2012. http://ceur-ws.org/Vol-927/.
[2]
VSD affect task, 2012. http://www.multimediaeval.org/mediaeval2012/.
[3]
R. Akbani, S. Kwek, and N. Japkowicz. Applying support vector machines to imbalanced datasets. ECML, pages 39--50, 2004.
[4]
S. Andrews, I. Tsochantaridis, and T. Hofmann. Support vector machines for multiple-instance learning. Advances in neural information processing systems, 15:561--568, 2002.
[5]
B. Bushman and L. Huesmann. Short-term and long-term effects of violent media on aggression in children and adults. Archives of Pediatrics & Adolescent Medicine, 160(4):348, 2006.
[6]
L.-H. Chen, H.-W. Hsu, L.-Y. Wang, and C.-W. Su. Horror video scene recognition via multiple-instance learning. In ICASSP, 2011.
[7]
C. Demarty, C. Penet, G. Gravier, and M. Soleymani. The MediaEval 2012 Affect Task: Violent Scenes Detection in Hollywood Movies. In MediaEval 2012 Workshop, Pisa, Italy, October 4--5 2012.
[8]
C.-H. Demarty, C. Penet, G. Gravier, and M. Soleymani. A benchmarking campaign for the multimodal detection of violent scenes in movies. In ECCV, 2012.
[9]
T. Giannakopoulos, A. Makris, D. Kosmopoulos, S. Perantonis, and S. Theodoridis. Audio-visual fusion for detecting violent scenes in videos. Artificial Intelligence: Theories, Models and Applications, pages 91--100, 2010.
[10]
Y. Gong, W. Wang, S. Jiang, Q. Huang, and W. Gao. Detecting violent scenes in movies by auditory and visual cues. Advances in Multimedia Information Processing 2008, pages 317--326, 2008.
[11]
H. He and E. Garcia. Learning from imbalanced data. Knowledge and Data Engineering, IEEE Transactions on, 21(9):1263--1284, 2009.
[12]
J. Lin and W. Wang. Weakly-supervised violence detection in movies with audio and video based co-training. Advances in Multimedia Information Processing, pages 930--935, 2009.
[13]
L. Rabiner and B.-H. Juang. Fundamentals of speech recognition. Signal Processing Series, 1993.
[14]
R. Yan and M. Naphade. Semi-supervised cross feature learning for semantic concept detection in videos. In CVPR, 2005.

Cited By

View all
  • (2024)Exploration of Speech and Music Information for Movie Genre ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366419720:8(1-19)Online publication date: 13-Jun-2024
  • (2024)Convolutional Neural Networks Based Video Anomaly Detection ApproachesAnomaly Detection in Video Surveillance10.1007/978-981-97-3023-0_12(287-312)Online publication date: 7-Aug-2024
  • (2023)A Comprehensive Review on Vision-Based Violence Detection in Surveillance VideosACM Computing Surveys10.1145/356197155:10(1-44)Online publication date: 2-Feb-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '13: Proceedings of the 21st ACM international conference on Multimedia
October 2013
1166 pages
ISBN:9781450324045
DOI:10.1145/2502081
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bag-of-audio-words
  2. decision fusion
  3. mel-frequency cepstral coefficients
  4. motion
  5. support vector machine

Qualifiers

  • Poster

Conference

MM '13
Sponsor:
MM '13: ACM Multimedia Conference
October 21 - 25, 2013
Barcelona, Spain

Acceptance Rates

MM '13 Paper Acceptance Rate 47 of 235 submissions, 20%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Exploration of Speech and Music Information for Movie Genre ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366419720:8(1-19)Online publication date: 13-Jun-2024
  • (2024)Convolutional Neural Networks Based Video Anomaly Detection ApproachesAnomaly Detection in Video Surveillance10.1007/978-981-97-3023-0_12(287-312)Online publication date: 7-Aug-2024
  • (2023)A Comprehensive Review on Vision-Based Violence Detection in Surveillance VideosACM Computing Surveys10.1145/356197155:10(1-44)Online publication date: 2-Feb-2023
  • (2022)Affect in Multimedia: Benchmarking Violent Scenes DetectionIEEE Transactions on Affective Computing10.1109/TAFFC.2020.298696913:1(347-366)Online publication date: 1-Jan-2022
  • (2022)VidHarm: A Clip Based Dataset for Harmful Content Detection2022 26th International Conference on Pattern Recognition (ICPR)10.1109/ICPR56361.2022.9956148(1543-1549)Online publication date: 21-Aug-2022
  • (2021)Intelligent Violence Video Detection System2021 3rd International Conference on Advancements in Computing (ICAC)10.1109/ICAC54203.2021.9671189(270-275)Online publication date: 9-Dec-2021
  • (2018)An architecture to identify violence in video surveillance system using ViF and LBP2018 4th International Conference on Recent Advances in Information Technology (RAIT)10.1109/RAIT.2018.8389027(1-6)Online publication date: Mar-2018
  • (2017)Temporal Robust Features for Violence Detection2017 IEEE Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV.2017.50(391-399)Online publication date: Mar-2017
  • (2017)Crowd Violence Detection Using Global Motion-Compensated Lagrangian Features and Scale-Sensitive Video-Level RepresentationIEEE Transactions on Information Forensics and Security10.1109/TIFS.2017.272582012:12(2945-2956)Online publication date: Dec-2017
  • (2017)Comparing models for gesture recognition of children's bullying behaviors2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII)10.1109/ACII.2017.8273591(138-145)Online publication date: Oct-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media