Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2578726.2578783acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
tutorial

Highly Efficient Multimedia Event Recounting from User Semantic Preferences

Published: 01 April 2014 Publication History

Abstract

We present the design of a video event recounting system that takes YouTube-like videos, and identifies a minimal set of video segments and textual keyword descriptions in order to convince a user, in a time efficient manner, that the video contains an instance of a user-specificed human activity. The system is based on extensive user studies that have lead to nine design principles about human preferences and limits in semantic understanding. The processing pipeline locates the presence of user query keywords within the video, segments the video according to a model of human short-term memory for semantic similarities, selects those segments that best contain query terms, and abbreviates both the video and textual presentation. Speed-ups of a factor of 6 over simple video viewing time are achievable, without loss of semantic accuracy. In the 2013 Trecvid Multimedia Event Recounting competition, this system placed first in time efficiency, while remaining above average in description accuracy.

References

[1]
Trecvid multimedia event recounting evaluation track, 2012. http://www.nist.gov/itl/iad/mig/mer.cfm.
[2]
A. Barbu, A. Bridge, Z. Burchill, et al. Video in sentences out. In UAI '12.
[3]
L. Cao, S. Chang, N. Codella, et al. Ibm research and columbia university trecvid-2012 multimedia event detection (med) multimedia event recounting (mer) and semantic indexing (sin) systems. TRECVID, 2012.
[4]
J. Deng, J. Krause, A. Berg, and L. Fei-Fei. Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition. In CVPR 2012.
[5]
D. Ding, F. Metze, S. Rawat, P. F. Schulam, S. Burger, E. Younessian, L. Bao, M. G. Christel, and A. Hauptmann. Beyond audio and video retrieval: Towards multimedia summarization. In ICMR '12.
[6]
J. R. Kender and B. lock Yeo. Video scene segmentation via continuous video coherence. In CVPR 1998.
[7]
M. U. G. Khan, L. Zhang, and Y. Gotoh. Towards coherent natural language description of video streams. In ICCV Workshops, 2011.
[8]
C. C. Tan, Y.-G. Jiang, and C.-W. Ngo. Towards textually describing complex video contents with audio-visual concept classifiers. In ACM MM '11.
[9]
Q. Yu, J. Liu, H. Cheng, A. Divakaran, and H. Sawhney. Multimedia event recounting with concept based representation. In ACM MM '12.

Cited By

View all
  • (2019)Video ImprintIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2018.286611441:12(3086-3099)Online publication date: 1-Dec-2019
  • (2017)GEO matching regionsMultimedia Tools and Applications10.1007/s11042-016-3834-z76:14(15377-15411)Online publication date: 1-Jul-2017
  • (2016)Zero-Example Multimedia Event Detection and Recounting with Unsupervised Evidence LocalizationProceedings of the 24th ACM international conference on Multimedia10.1145/2964284.2971480(1464-1468)Online publication date: 1-Oct-2016
  • Show More Cited By

Index Terms

  1. Highly Efficient Multimedia Event Recounting from User Semantic Preferences

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICMR '14: Proceedings of International Conference on Multimedia Retrieval
    April 2014
    564 pages
    ISBN:9781450327824
    DOI:10.1145/2578726
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 April 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Efficient video highlighting
    2. Multimedia event recounting
    3. User semantic preferences
    4. Video segment selection

    Qualifiers

    • Tutorial
    • Research
    • Refereed limited

    Conference

    ICMR '14
    ICMR '14: International Conference on Multimedia Retrieval
    April 1 - 4, 2014
    Glasgow, United Kingdom

    Acceptance Rates

    ICMR '14 Paper Acceptance Rate 21 of 111 submissions, 19%;
    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 09 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)Video ImprintIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2018.286611441:12(3086-3099)Online publication date: 1-Dec-2019
    • (2017)GEO matching regionsMultimedia Tools and Applications10.1007/s11042-016-3834-z76:14(15377-15411)Online publication date: 1-Jul-2017
    • (2016)Zero-Example Multimedia Event Detection and Recounting with Unsupervised Evidence LocalizationProceedings of the 24th ACM international conference on Multimedia10.1145/2964284.2971480(1464-1468)Online publication date: 1-Oct-2016
    • (2016)Event-based media processing and analysisImage and Vision Computing10.1016/j.imavis.2016.05.00553:C(3-19)Online publication date: 1-Sep-2016
    • (2015)Searching PersuasivelyProceedings of the 23rd ACM international conference on Multimedia10.1145/2733373.2806218(581-590)Online publication date: 13-Oct-2015
    • (2014)Content based image retrieval based on geo-location driven image tagging on the social web2014 11th International Computer Conference on Wavelet Actiev Media Technology and Information Processing(ICCWAMTIP)10.1109/ICCWAMTIP.2014.7073408(280-283)Online publication date: Dec-2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media