Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2578726.2578793acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
tutorial

Few-Example Video Event Retrieval using Tag Propagation

Published: 01 April 2014 Publication History

Abstract

An emerging topic in multimedia retrieval is to detect a complex event in video using only a handful of video examples. Different from existing work which learns a ranker from positive video examples and hundreds of negative examples, we aim to query web video for events using zero or only a few visual examples. To that end, we propose in this paper a tag-based video retrieval system which propagates tags from a tagged video source to an unlabeled video collection without the need of any training examples. Our algorithm is based on weighted frequency neighbor voting using concept vector similarity. Once tags are propagated to unlabeled video we can rely on off-the-shelf language models to rank these videos by the tag similarity. We study the behavior of our tag-based video event retrieval system by performing three experiments on web videos from the TRECVID multimedia event detection corpus, with zero, one and multiple query examples that beats a recent alternative.

References

[1]
L. Ballan, M. Bertini, A. Del Bimbo, and G. Serra. Enriching and localizing semantic tags in internet videos. In MM, 2011.
[2]
A. Habibian, K. E. A. van de Sande, and C. G. M. Snoek. Recommendations for video event recognition using concept vocabularies. In ICMR, 2013.
[3]
H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez, and C. Schmid. Aggregating local image descriptors into compact codes. TPAMI, 2012.
[4]
Z.-Z. Lan, L. Bao, S.-I. Yu, W. Liu, and A. G. Hauptmann. Double fusion for multimedia event detection. In MMM, 2012.
[5]
X. Li, C. G. M. Snoek, and M. Worring. Learning social tag relevance by neighbor voting. TMM, 2009.
[6]
M. Mazloom, A. Habibian, and C. G. M. Snoek. Querying for video events by semantic signatures from few examples. In MM, 2013.
[7]
M. Merler, B. Huang, L. Xie, G. Hua, and A. Natsev. Semantic model vectors for complex video event recognition. TMM, 2012.
[8]
P. Natarajan et al. Multimodal feature fusion for robust event detection in web videos. In CVPR, 2012.
[9]
A. Natsev, M. R. Naphade, and J. Tesic. Learning the semantics of multimedia queries and concepts from a small number of examples. In MM, 2005.
[10]
N. Rasiwasia, P. J. Moreno, and N. Vasconcelos. Bridging the gap: Query by semantic example. TMM, 2007.
[11]
S. Siersdorfer, J. S. Pedro, and M. Sanderson. Automatic video tagging using content redundancy. In SIGIR, 2009.
[12]
A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and TRECVid. In MIR, 2006.
[13]
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. TIS, 2004.

Cited By

View all
  • (2022)Generalized Few-Shot Video Classification With Video Retrieval and Feature GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2021.312055044:12(8949-8961)Online publication date: 1-Dec-2022
  • (2016)Event Detection with Zero ExampleProceedings of the 2016 ACM on International Conference on Multimedia Retrieval10.1145/2911996.2912015(127-134)Online publication date: 6-Jun-2016
  • (2016)Tag-based video retrieval by embedding semantic content in a continuous word space2016 IEEE Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV.2016.7477706(1-8)Online publication date: Mar-2016
  • Show More Cited By

Index Terms

  1. Few-Example Video Event Retrieval using Tag Propagation

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ICMR '14: Proceedings of International Conference on Multimedia Retrieval
      April 2014
      564 pages
      ISBN:9781450327824
      DOI:10.1145/2578726
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 April 2014

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Video event retrieval
      2. tag propagation

      Qualifiers

      • Tutorial
      • Research
      • Refereed limited

      Conference

      ICMR '14
      ICMR '14: International Conference on Multimedia Retrieval
      April 1 - 4, 2014
      Glasgow, United Kingdom

      Acceptance Rates

      ICMR '14 Paper Acceptance Rate 21 of 111 submissions, 19%;
      Overall Acceptance Rate 254 of 830 submissions, 31%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 09 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Generalized Few-Shot Video Classification With Video Retrieval and Feature GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2021.312055044:12(8949-8961)Online publication date: 1-Dec-2022
      • (2016)Event Detection with Zero ExampleProceedings of the 2016 ACM on International Conference on Multimedia Retrieval10.1145/2911996.2912015(127-134)Online publication date: 6-Jun-2016
      • (2016)Tag-based video retrieval by embedding semantic content in a continuous word space2016 IEEE Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV.2016.7477706(1-8)Online publication date: Mar-2016
      • (2016)TagBookIEEE Transactions on Multimedia10.1109/TMM.2016.255994718:7(1378-1388)Online publication date: 1-Jul-2016
      • (2015)Fast and Accurate Content-based Semantic Search in 100M Internet VideosProceedings of the 23rd ACM international conference on Multimedia10.1145/2733373.2806237(49-58)Online publication date: 13-Oct-2015
      • (2015)Encoding Concept Prototypes for Video Event Detection and SummarizationProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749402(123-130)Online publication date: 22-Jun-2015
      • (2015)Bridging the Ultimate Semantic GapProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749399(27-34)Online publication date: 22-Jun-2015
      • (2015)Text-to-video: a semantic search engine for internet videosInternational Journal of Multimedia Information Retrieval10.1007/s13735-015-0093-05:1(3-18)Online publication date: 24-Dec-2015
      • (2014)Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video ClassificationProceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2654931(167-176)Online publication date: 3-Nov-2014

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media