Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2393347.2393363acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Harvesting visual concepts for image search with complex queries

Published: 29 October 2012 Publication History

Abstract

The use of image reranking to boost retrieval performance has been found to be successful for simple queries. It is, however, less effective for complex queries due to the widened semantic gap. This paper presents a scheme to enhance web image reranking for complex queries by fully exploring the information from simple visual concepts. Given a complex query, our scheme first detects the noun-phrase based visual concepts and crawls their top ranked images from popular image search engines. Next, it constructs a heterogeneous probabilistic network to model the relatedness between the complex query and each of its crawled images. The network seamlessly integrates three layers of relationships, i.e., the semantic-level, cross-modality level as well as visual-level. These mutually reinforced layers are established among the complex query and its involved visual concepts, by harnessing the contents of images and their associated textual cues. Based on the derived relevance scores, a new ranking list is generated. Extensive evaluations on a real-world dataset demonstrate that our model is able to characterize the complex queries well and achieve promising performance as compared to the state-of-the-art methods. Based on the proposed scheme, we introduce two applications: photo-based question answering and textual news visualization. Comprehensive experiments well validate the proposed scheme.

References

[1]
Hitwise. see http://weblogs.hitwise.com/alan-long/2009/11/searches_getting_longer.html.
[2]
R. Aly, D. Hiemstra, and R. Ordelman. Building detectors to support searches on combined semantic concepts. In MIR Workshop, 2007.
[3]
J. Bai, D. Song, P. Bruza, J.-Y. Nie, and G. Cao. Query expansion using term relationships in language models for information retrieval. In CIKM, 2005.
[4]
N. Balasubramanian, G. Kumaran, and V. Carvalho. Exploring reductions for long web queries. In SIGIR, 2010.
[5]
M. Bendersky and W. B. Croft. Discovering key concepts in verbose queries. In SIGIR, 2008.
[6]
M. Bendersky, D. Metzler, and W. B. Croft. Learning concept importance using a weighted dependence model. In WSDM, 2010.
[7]
M. Bendersky, D. Metzler, and W. B. Croft. Parameterized concept weighting in verbose queries. In SIGIR, 2011.
[8]
X. Chen, J. Yuan, L. Nie, Z.-j. Zha, S. Yan, and T.-s. Chua. Known-item search by nus. In NIST TRECVID, 2010.
[9]
T.-S. Chua, R. Hong, G. Li, and J. Tang. From text question-answering to multimedia qa on web-scale media resources. In LS-MMRM, 2009.
[10]
R. L. Cilibrasi and P. M. B. Vitanyi. The google similarity distance. TKDE, 2007.
[11]
D. Delgado, J. Magalhaes, and N. Correia. Assisted news reading with automated illustration. In MM, 2010.
[12]
W. H. Hsu, L. S. Kennedy, and S.-F. Chang. Video search reranking through random walk over document-level context graph. In MM, 2007.
[13]
G. Kumaran and J. Allan. A case for shorter queries, and helping users create them. In NAACL-HLT, 2007.
[14]
G. Kumaran and J. Allan. Effective and efficient user interaction for long queries. In SIGIR, 2008.
[15]
M. Lease, J. Allan, and W. B. Croft. Regression rank: Learning to meet the opportunity of descriptive queries. In ECIR, 2009.
[16]
X. Li, C. G. M. Snoek, M. Worring, and A. W. M. Smeulders. Harvesting social images for bi-concept search. TMM, 2012.
[17]
Z. Li, M. Wang, J. Liu, C. Xu, and H. Lu. News contextualization with geographic and visual information. In MM, 2011.
[18]
D. Liu, X.-S. Hua, M. Wang, and H.-J. Zhang. Image retagging. In MM, 2010.
[19]
D. Liu, X.-S. Hua, L. Yang, M. Wang, and H.-J. Zhang. Tag ranking. In WWW, 2009.
[20]
Y. Liu, T. Mei, X.-S. Hua, J. Tang, X. Wu, and S. Li. Learning to video search rerank via pseudo preference feedback. In ICME, 2008.
[21]
M. C. Mihai Surdeanu and H. Zaragoza. Learning to rank answers on large online qa collections. In ACL, 2008.
[22]
N. Morioka and J. Wang. Robust visual reranking via sparsity and ranking constraints. In MM, 2011.
[23]
A. P. Natsev, M. R. Naphade, and J. Te\vsiĆ. Learning the semantics of multimedia queries and concepts from a small number of examples. In MM, 2005.
[24]
L. Nie, M. Wang, Z.-j. Zha, G. Li, and T.-S. Chua. Multimedia answering: enriching text qa with media information. In SIGIR, 2011.
[25]
L. N. Nie, M. Wang, Z. Zha, and T.-S. Chua. Oracle in image search: A content-based approach to performance prediction. TOIS, 2012.
[26]
J. H. Park and W. B. Croft. Query term ranking based on dependency parsing of verbose queries. In SIGIR, 2010.
[27]
S. U. Pillai, T. Suel, and S. Cha. The Perron-Frobenius theorem: some of its applications. Signal Processing Magazine, IEEE, 2005.
[28]
H.-T. Pu. An analysis of failed queries for web image retrieval. Journal of Information Science, 2008.
[29]
M. Szummer and T. Jaakkola. Partially labeled classification with markov random walks. NIPS, 2002.
[30]
X. Tian, L. Yang, J. Wang, Y. Yang, X. Wu, and X.-S. Hua. Bayesian video search reranking. In MM, 2008.
[31]
A. Torralba, R. Fergus, and W. Freeman. 80 million tiny images: A large data set for nonparametric object and scene recognition. TPAMI, 2008.
[32]
M. Wang, K. Yang, X.-S. Hua, and H.-J. Zhang. Towards a relevant and diverse search of social images. TMM, 2010.
[33]
H. Xu, J. Wang, X.-S. Hua, and S. Li. Image search by concept map. In SIGIR, 2010.
[34]
R. Yan, E. Hauptmann, and R. Jin. Multimedia search with pseudo-relevance feedback. In ICVR, 2003.
[35]
J. Yuan, Z.-J. Zha, Y.-T. Zheng, W. Meng, X. Zhou, and T.-S. Chua. Utilizing related samples to enhance interactive concept-based video search. TMM, 2011.
[36]
J. Yuan, Z.-J. Zha, Y.-T. Zheng, M. Wang, X. Zhou, and T.-S. Chua. Learning concept bundles for video search with complex queries. In MM, 2011.

Cited By

View all
  • (2024)Data reweighting net for web fine-grained image classificationMultimedia Tools and Applications10.1007/s11042-024-18598-x83:33(79985-80005)Online publication date: 2-Mar-2024
  • (2023)AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/358470319:6(1-22)Online publication date: 20-Feb-2023
  • (2023)Guided by Meta-Set: A Data-Driven Method for Fine-Grained Visual RecognitionIEEE Transactions on Multimedia10.1109/TMM.2022.318143925(4691-4703)Online publication date: 2023
  • Show More Cited By

Index Terms

  1. Harvesting visual concepts for image search with complex queries

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '12: Proceedings of the 20th ACM international conference on Multimedia
      October 2012
      1584 pages
      ISBN:9781450310895
      DOI:10.1145/2393347
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 29 October 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. complex query
      2. image search
      3. news visualization
      4. photo-based qa

      Qualifiers

      • Research-article

      Conference

      MM '12
      Sponsor:
      MM '12: ACM Multimedia Conference
      October 29 - November 2, 2012
      Nara, Japan

      Acceptance Rates

      Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)12
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 14 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Data reweighting net for web fine-grained image classificationMultimedia Tools and Applications10.1007/s11042-024-18598-x83:33(79985-80005)Online publication date: 2-Mar-2024
      • (2023)AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/358470319:6(1-22)Online publication date: 20-Feb-2023
      • (2023)Guided by Meta-Set: A Data-Driven Method for Fine-Grained Visual RecognitionIEEE Transactions on Multimedia10.1109/TMM.2022.318143925(4691-4703)Online publication date: 2023
      • (2022)Retrieving Black-box Optimal Images from External DatabasesProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3498462(879-887)Online publication date: 11-Feb-2022
      • (2022)Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2021.305376624(466-479)Online publication date: 2022
      • (2022)Video Moment Retrieval With Cross-Modal Neural Architecture SearchIEEE Transactions on Image Processing10.1109/TIP.2022.314061131(1204-1216)Online publication date: 2022
      • (2021)Boosting Temporal Binary Coding for Large-Scale Video SearchIEEE Transactions on Multimedia10.1109/TMM.2020.297859323(353-364)Online publication date: 2021
      • (2020)Model Optimization Boosting Framework for Linear Model Hash LearningIEEE Transactions on Image Processing10.1109/TIP.2020.297057729(4254-4268)Online publication date: 2020
      • (2020)HDMFH: Hypergraph Based Discrete Matrix Factorization Hashing for Multimodal RetrievalICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP40776.2020.9054516(1923-1927)Online publication date: May-2020
      • (2020)Fractional Local Neighborhood Intensity Pattern for Image Retrieval using Genetic AlgorithmMultimedia Tools and Applications10.1007/s11042-020-08752-6Online publication date: 5-Mar-2020
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media