Article

Video search in concept subspace: a text-like paradigm

Authors:

Bo ZhangAuthors Info & Claims

CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval

Pages 603 - 610

https://doi.org/10.1145/1282280.1282366

Published: 09 July 2007 Publication History

Abstract

Though both quantity and quality of semantic concept detection in video are continuously improving, it still remains unclear how to exploit these detected concepts as semantic indices in video search, given a specific query. In this paper, we tackle this problem and propose a video search framework which operates like searching text documents. Noteworthy for its adoption of the well-founded text search principles, this framework first selects a few related concepts for a given query, by employing a tf-idf like scheme, called c-tf-idf, to measure the informativeness of the concepts to this query. These selected concepts form a concept subspace. Then search can be conducted in this concept subspace, either by a Vector Model or a Language Model. Further, two algorithms, i.e., Linear Summation and Random Walk through Concept-Link, are explored to combine the concept search results and other baseline search results in a reranking scheme. This framework is both effective and efficient. Using a lexicon of 311 concepts from the LSCOM concept ontology, experiments conducted on the TRECVID 2006 search data set show that: when used solely, search within the concept subspace achieves the state-of-the-art concept search result; when used to rerank the baseline results, it can improve over the top 20 automatic search runs in TRECVID 2006 on average by approx. 20%, on the most significant one by approx. 50%, all within 180 milliseconds on a normal PC.

References

[1]

Trecvid home page. http://www-nlpir.nist.gov/projects/trecvid.

[2]

A. Aizawa. An information-theoretic perspective of tf-idf measures. Information Processing and Management, 39:45--65, January 2003.

Digital Library

[3]

A. Amir, J. Argillandery, M. Campbell, A. Haubold, G. Iyengar, S. Ebadollahi, F. Kang, M. R. Naphade, A. P. Natsev, J. R. Smith, J. Tešić, and T. Volkmer. IBM research trecvid-2005 video retrieval system. In Proc. of TRECVID, 2005.

[4]

R. A. Baeza-Yates and B. A. Ribeiro-Neto. Modern Information Retrieval. ACM Press/Addison-Wesley, 1999.

Digital Library

[5]

H. Bay, T. Tuytelaars, and L. Gool. Surf: Speeded up robust features. In Proc. of ECCV, 2006.

Digital Library

[6]

M. Campbell and et al. IBM research trecvid-2006 video retrieval system. In Proc. Of TRECVID, 2006.

[7]

J. Cao, Y. Lan, J. Li, and et al. Tsinghua university at trecvid 2006. In Proc. of TRECVID, 2006.

[8]

S.-F. Chang, W. Hsu, W. Jiang, L. Kennedy, D. Xu, A. Yanagawa, and E. Zavesky. Evaluating the impact of 374 visualbased lscom concept detectors on automatic search. In Proc. Of TRECVID, 2006.

[9]

T.-S. Chua, S.-Y. Neo, K.-Y. Li, G. Wang, R. Shi, M. Zhao, and H. Xu. Trecvid 2004 search and feature extraction task by nus pris. In Proc. of TRECVID, 2004.

[10]

K. M. Donald and A. F. Smeaton. A comparison of score, rank and probability-based fusion methods for video shot retrieval. In Proc. of CIVR, 2005.

Digital Library

[11]

Y. Freund, R. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research, 4:933--969, 2003.

Digital Library

[12]

W. H. Hsu, L. S. Kennedy, and S.-F. Chang. Video search reranking via information bottleneck principle. In Proc. of ACM Multimedia 2006.

Digital Library

[13]

M. Naphade, J. R. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and J. Curtis. Large-scale concept ontology for multimedia. IEEE Multimedia Magazine, 2006.

Digital Library

[14]

M. R. Naphade, L. Kennedy, J. R. Kender, S.-F. Chang, J. R. Smith, P. Over, and A. H. A. A light scale concept ontology for multimedia understanding for trecvid 2005. In Proc. of TRECVID, 2005.

[15]

A. P. Natsev, M. R. Naphade, and J. Tesic. Learning the semantics of multimedia queries and concepts from a small number of examples. In Proc. of ACM Multimedia, 2005.

Digital Library

[16]

P. Natsev. IBM marvel for trecvid06 automatic search. In Proc. of TRECVID. 2006.

[17]

S.-Y. Neo, J. Zhao, M.-Y. Kan, and T.-S. Chua. Video retrieval using high level features: Exploiting query matching and confidence-based weighting. In Proc. of CIVR, 2006.

Digital Library

[18]

P. Over, T. Ianeva, W. Kraaij, and A. F. Smeaton. Trecvid 2005 - an overview. In Proc. Of TRECVID, 2005.

[19]

L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.

[20]

J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In Proc. of ACM SIGIR, 1998.

Digital Library

[21]

E. Seneta. Non-Negative Matrices and Markov Chains. Springer-Verlag, 1981.

[22]

C. G. Snoek and et al. The MediaMill trecvid 2006 semantic video search engine. In Proc. Of TRECVID, 2006.

[23]

C. G. Snoek, M. Worring, D. C. Koelma, and A. W. Smeulders. A learned lexicon-driven paradigm for interactive video retrieval. IEEE Trans. Multimeida, February 2007.

Digital Library

[24]

C. G. Snoek, M. Worring, J. C. van Gemert, J.-M. Geusebroek, and A. W. Smeulders. The challenge problem for automated detection of 101 semantic concepts in multimedia. In Proc. of ACM Multimedia, 2006.

Digital Library

[25]

V. N. Vapnik. The Nature of Statistical Learning Theory. Springer, 1995.

Digital Library

[26]

D. Wang, J. Li, and B. Zhang. Relay boost fusion for learning rare concepts in multimedia. In Proc. of CIVR, 2006.

Digital Library

[27]

R. Yan, A. Hauptmann, and R. Jin. Multimedia search with pseudo-relevance feedback. In Proc. of CIVR, 2003.

Digital Library

[28]

C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proc. of ACM SIGIR, 2001.

Digital Library

[29]

W. Zheng, J. Li, Z. Si, F. Lin, and B. Zhang. Using high-level semantic features in video retrieval. In Proc. of CIVR, 2006.

Digital Library

Cited By

Shi WZhuang QXue-Zhang Zhou YYang Y(2023)An image reranking algorithm based on discrete-time quantum walkMultimedia Tools and Applications10.1007/s11042-023-16916-383:12(34979-34994)Online publication date: 28-Sep-2023
https://doi.org/10.1007/s11042-023-16916-3
Long TMettes PShen HSnoek C(2020)Searching for Actions on the Hyperbole2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR42600.2020.00122(1138-1147)Online publication date: Jun-2020
https://doi.org/10.1109/CVPR42600.2020.00122
Mazaheri AGong BShah M(2018)Learning a Multi-Concept Video Retrieval Model with Multiple Latent VariablesACM Transactions on Multimedia Computing, Communications, and Applications10.1145/317664714:2(1-21)Online publication date: 25-Apr-2018
https://dl.acm.org/doi/10.1145/3176647
Show More Cited By

Index Terms

Video search in concept subspace: a text-like paradigm
1. Information systems
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Database theory
      1. Database query processing and optimization (theory)

Recommendations

Query representation by structured concept threads with application to interactive video retrieval

In this paper, we provide a new formulation for video queries as structured combination of concept threads, contributing to the general query-by-concept paradigm. Occupying a low-dimensional region in the concept space, concept thread defines a ranked ...
Learning concept bundles for video search with complex queries
MM '11: Proceedings of the 19th ACM international conference on Multimedia

Classifiers for primitive visual concepts like "car", "sky" have been well developed and widely used to support video search on simple queries. However, it is usually ineffective for complex queries like "one or more people at a table or desk with a ...
Optimizing video search reranking via minimum incremental information loss
MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval

This paper is concerned with video search reranking - the task of reordering the initial ranked documents (video shots) to improve the search performance - in an optimization framework. Conventional supervised reranking approaches empirically convert ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval

July 2007

655 pages

ISBN:9781595937339

DOI:10.1145/1282280

General Chairs:
Nicu Sebe
Univ. of Amsterdam, The Netherlands
,
Marcel Worring
Univ. of Amsterdam, The Netherlands

Copyright © 2007 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

In-Cooperation

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 July 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

CIVR07

Sponsor:

SIGMM

CIVR07: International Conference on Image and Video Retrieval 2007

July 9 - 11, 2007

Amsterdam, The Netherlands

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

51
Total Citations
View Citations
446
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Shi WZhuang QXue-Zhang Zhou YYang Y(2023)An image reranking algorithm based on discrete-time quantum walkMultimedia Tools and Applications10.1007/s11042-023-16916-383:12(34979-34994)Online publication date: 28-Sep-2023
https://doi.org/10.1007/s11042-023-16916-3
Long TMettes PShen HSnoek C(2020)Searching for Actions on the Hyperbole2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR42600.2020.00122(1138-1147)Online publication date: Jun-2020
https://doi.org/10.1109/CVPR42600.2020.00122
Mazaheri AGong BShah M(2018)Learning a Multi-Concept Video Retrieval Model with Multiple Latent VariablesACM Transactions on Multimedia Computing, Communications, and Applications10.1145/317664714:2(1-21)Online publication date: 25-Apr-2018
https://dl.acm.org/doi/10.1145/3176647
Hamadi ALattar HKhoussa MSafadi B(2018)Using semantic context for multiple concepts detection in still imagesPattern Analysis and Applications10.1007/s10044-018-0761-9Online publication date: 2-Nov-2018
https://doi.org/10.1007/s10044-018-0761-9
Yamaguchi MSaito KUshiku YHarada T(2017)Spatio-Temporal Person Retrieval via Natural Language Queries2017 IEEE International Conference on Computer Vision (ICCV)10.1109/ICCV.2017.162(1462-1471)Online publication date: Oct-2017
https://doi.org/10.1109/ICCV.2017.162
Hamadi AMulhem PQuénot G(2016)A comparative study for multiple visual concepts detection in images and videosMultimedia Tools and Applications10.1007/s11042-015-2730-275:15(8973-8997)Online publication date: 1-Aug-2016
https://dl.acm.org/doi/10.1007/s11042-015-2730-2
Shirahama KGrzegorzek M(2016)Towards large-scale multimedia retrieval enriched by knowledge about human interpretationMultimedia Tools and Applications10.1007/s11042-014-2292-875:1(297-331)Online publication date: 1-Jan-2016
https://dl.acm.org/doi/10.1007/s11042-014-2292-8
Guadarrama SRodner ESaenko KDarrell T(2015)Understanding object descriptions in robotics by open-vocabulary object retrieval and detectionThe International Journal of Robotics Research10.1177/027836491560205935:1-3(265-280)Online publication date: 13-Oct-2015
https://doi.org/10.1177/0278364915602059
Shirahama KKumabuchi KGrzegorzek MUehara K(2015)Video Retrieval Based on Uncertain Concept Detection Using Dempster–Shafer TheoryMultimedia Data Mining and Analytics10.1007/978-3-319-14998-1_12(269-294)Online publication date: 1-Apr-2015
https://doi.org/10.1007/978-3-319-14998-1_12
Hamadi AMulhem PQuénot GKankanhalli MRueger SManmatha RJose Jvan Rijsbergen K(2014)Infrequent concept pairs detection in multimedia documentsProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578787(435-438)Online publication date: 1-Apr-2014
https://dl.acm.org/doi/10.1145/2578726.2578787
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten