Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1282280.1282366acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
Article

Video search in concept subspace: a text-like paradigm

Published: 09 July 2007 Publication History

Abstract

Though both quantity and quality of semantic concept detection in video are continuously improving, it still remains unclear how to exploit these detected concepts as semantic indices in video search, given a specific query. In this paper, we tackle this problem and propose a video search framework which operates like searching text documents. Noteworthy for its adoption of the well-founded text search principles, this framework first selects a few related concepts for a given query, by employing a tf-idf like scheme, called c-tf-idf, to measure the informativeness of the concepts to this query. These selected concepts form a concept subspace. Then search can be conducted in this concept subspace, either by a Vector Model or a Language Model. Further, two algorithms, i.e., Linear Summation and Random Walk through Concept-Link, are explored to combine the concept search results and other baseline search results in a reranking scheme. This framework is both effective and efficient. Using a lexicon of 311 concepts from the LSCOM concept ontology, experiments conducted on the TRECVID 2006 search data set show that: when used solely, search within the concept subspace achieves the state-of-the-art concept search result; when used to rerank the baseline results, it can improve over the top 20 automatic search runs in TRECVID 2006 on average by approx. 20%, on the most significant one by approx. 50%, all within 180 milliseconds on a normal PC.

References

[1]
Trecvid home page. http://www-nlpir.nist.gov/projects/trecvid.
[2]
A. Aizawa. An information-theoretic perspective of tf-idf measures. Information Processing and Management, 39:45--65, January 2003.
[3]
A. Amir, J. Argillandery, M. Campbell, A. Haubold, G. Iyengar, S. Ebadollahi, F. Kang, M. R. Naphade, A. P. Natsev, J. R. Smith, J. Tešić, and T. Volkmer. IBM research trecvid-2005 video retrieval system. In Proc. of TRECVID, 2005.
[4]
R. A. Baeza-Yates and B. A. Ribeiro-Neto. Modern Information Retrieval. ACM Press/Addison-Wesley, 1999.
[5]
H. Bay, T. Tuytelaars, and L. Gool. Surf: Speeded up robust features. In Proc. of ECCV, 2006.
[6]
M. Campbell and et al. IBM research trecvid-2006 video retrieval system. In Proc. Of TRECVID, 2006.
[7]
J. Cao, Y. Lan, J. Li, and et al. Tsinghua university at trecvid 2006. In Proc. of TRECVID, 2006.
[8]
S.-F. Chang, W. Hsu, W. Jiang, L. Kennedy, D. Xu, A. Yanagawa, and E. Zavesky. Evaluating the impact of 374 visualbased lscom concept detectors on automatic search. In Proc. Of TRECVID, 2006.
[9]
T.-S. Chua, S.-Y. Neo, K.-Y. Li, G. Wang, R. Shi, M. Zhao, and H. Xu. Trecvid 2004 search and feature extraction task by nus pris. In Proc. of TRECVID, 2004.
[10]
K. M. Donald and A. F. Smeaton. A comparison of score, rank and probability-based fusion methods for video shot retrieval. In Proc. of CIVR, 2005.
[11]
Y. Freund, R. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research, 4:933--969, 2003.
[12]
W. H. Hsu, L. S. Kennedy, and S.-F. Chang. Video search reranking via information bottleneck principle. In Proc. of ACM Multimedia 2006.
[13]
M. Naphade, J. R. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and J. Curtis. Large-scale concept ontology for multimedia. IEEE Multimedia Magazine, 2006.
[14]
M. R. Naphade, L. Kennedy, J. R. Kender, S.-F. Chang, J. R. Smith, P. Over, and A. H. A. A light scale concept ontology for multimedia understanding for trecvid 2005. In Proc. of TRECVID, 2005.
[15]
A. P. Natsev, M. R. Naphade, and J. Tesic. Learning the semantics of multimedia queries and concepts from a small number of examples. In Proc. of ACM Multimedia, 2005.
[16]
P. Natsev. IBM marvel for trecvid06 automatic search. In Proc. of TRECVID. 2006.
[17]
S.-Y. Neo, J. Zhao, M.-Y. Kan, and T.-S. Chua. Video retrieval using high level features: Exploiting query matching and confidence-based weighting. In Proc. of CIVR, 2006.
[18]
P. Over, T. Ianeva, W. Kraaij, and A. F. Smeaton. Trecvid 2005 - an overview. In Proc. Of TRECVID, 2005.
[19]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
[20]
J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In Proc. of ACM SIGIR, 1998.
[21]
E. Seneta. Non-Negative Matrices and Markov Chains. Springer-Verlag, 1981.
[22]
C. G. Snoek and et al. The MediaMill trecvid 2006 semantic video search engine. In Proc. Of TRECVID, 2006.
[23]
C. G. Snoek, M. Worring, D. C. Koelma, and A. W. Smeulders. A learned lexicon-driven paradigm for interactive video retrieval. IEEE Trans. Multimeida, February 2007.
[24]
C. G. Snoek, M. Worring, J. C. van Gemert, J.-M. Geusebroek, and A. W. Smeulders. The challenge problem for automated detection of 101 semantic concepts in multimedia. In Proc. of ACM Multimedia, 2006.
[25]
V. N. Vapnik. The Nature of Statistical Learning Theory. Springer, 1995.
[26]
D. Wang, J. Li, and B. Zhang. Relay boost fusion for learning rare concepts in multimedia. In Proc. of CIVR, 2006.
[27]
R. Yan, A. Hauptmann, and R. Jin. Multimedia search with pseudo-relevance feedback. In Proc. of CIVR, 2003.
[28]
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proc. of ACM SIGIR, 2001.
[29]
W. Zheng, J. Li, Z. Si, F. Lin, and B. Zhang. Using high-level semantic features in video retrieval. In Proc. of CIVR, 2006.

Cited By

View all
  • (2023)An image reranking algorithm based on discrete-time quantum walkMultimedia Tools and Applications10.1007/s11042-023-16916-383:12(34979-34994)Online publication date: 28-Sep-2023
  • (2020)Searching for Actions on the Hyperbole2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR42600.2020.00122(1138-1147)Online publication date: Jun-2020
  • (2018)Learning a Multi-Concept Video Retrieval Model with Multiple Latent VariablesACM Transactions on Multimedia Computing, Communications, and Applications10.1145/317664714:2(1-21)Online publication date: 25-Apr-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval
July 2007
655 pages
ISBN:9781595937339
DOI:10.1145/1282280
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 July 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. concept subspace
  2. query concept mapping
  3. video search

Qualifiers

  • Article

Conference

CIVR07
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)An image reranking algorithm based on discrete-time quantum walkMultimedia Tools and Applications10.1007/s11042-023-16916-383:12(34979-34994)Online publication date: 28-Sep-2023
  • (2020)Searching for Actions on the Hyperbole2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR42600.2020.00122(1138-1147)Online publication date: Jun-2020
  • (2018)Learning a Multi-Concept Video Retrieval Model with Multiple Latent VariablesACM Transactions on Multimedia Computing, Communications, and Applications10.1145/317664714:2(1-21)Online publication date: 25-Apr-2018
  • (2018)Using semantic context for multiple concepts detection in still imagesPattern Analysis and Applications10.1007/s10044-018-0761-9Online publication date: 2-Nov-2018
  • (2017)Spatio-Temporal Person Retrieval via Natural Language Queries2017 IEEE International Conference on Computer Vision (ICCV)10.1109/ICCV.2017.162(1462-1471)Online publication date: Oct-2017
  • (2016)A comparative study for multiple visual concepts detection in images and videosMultimedia Tools and Applications10.1007/s11042-015-2730-275:15(8973-8997)Online publication date: 1-Aug-2016
  • (2016)Towards large-scale multimedia retrieval enriched by knowledge about human interpretationMultimedia Tools and Applications10.1007/s11042-014-2292-875:1(297-331)Online publication date: 1-Jan-2016
  • (2015)Understanding object descriptions in robotics by open-vocabulary object retrieval and detectionThe International Journal of Robotics Research10.1177/027836491560205935:1-3(265-280)Online publication date: 13-Oct-2015
  • (2015)Video Retrieval Based on Uncertain Concept Detection Using Dempster–Shafer TheoryMultimedia Data Mining and Analytics10.1007/978-3-319-14998-1_12(269-294)Online publication date: 1-Apr-2015
  • (2014)Infrequent concept pairs detection in multimedia documentsProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578787(435-438)Online publication date: 1-Apr-2014
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media