Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3240876.3240898acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

Deep hashing with multilevel similarity learning for multimedia similarity search

Published: 17 August 2018 Publication History

Abstract

In this work, we propose a novel deep multimodal hashing method, termed as Deep Hashing with Multilevel Similarity Learning (DHMSL), which learns discriminative hash functions with deep neural networks by exploiting multilevel semantic similarity correlations of multimedia data. Firstly, we construct multilevel similarity correlation by jointly exploiting the local structure and semantic label information. Then, the unified binary codes are learned by preserving the multilevel similarity correlations as well as incorporating the bit balance and quantization error properties. Besides that, two deep neural networks are jointly trained to learn two sets of nonlinear hash functions by minimizing the errors of unified binary codes and outputs of the networks. We conduct experiments on two widely-used multimodal datasets, and the proposed DHMSL method can achieve the state-of-the-art performance compared with the baselines for both image-query-text and text-query-image tasks.

References

[1]
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore. In ACM ICIVR.
[2]
Guiguang Ding, Yuchen Guo, and Jile Zhou. 2014. Collective matrix factorization hashing for multimodal data. In CVPR. 2075--2082.
[3]
Mark J Huiskes and Michael S Lew. 2008. The MIR flickr retrieval evaluation. In ACM ICMIR. 39--43.
[4]
Shaishav Kumar and Raghavendra Udupa. 2011. Learning hash functions for cross-view similarity search. In IJCAI. 1360--1365.
[5]
Kai Li, Guojun Qi, Jun Ye, and Kien Hua. 2017. Linear subspace ranking hashing for cross-modal retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 9 (2017), 1825--1838.
[6]
Zechao Li, Jinhui Tang, and Tao Mei. 2018. Deep collaborative embedding for social image understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence (2018).
[7]
Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval. In CVPR. 3864--3872.
[8]
Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In CVPR. 2074--2081.
[9]
Maxime Oquab, Leon Bottou, Ivan Laptev, and Josef Sivic. 2014. Learning and transferring mid-level image representations using convolutional neural networks. In CVPR. 1717--1724.
[10]
Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen. 2013. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In ACM SIGMOD. 785--796.
[11]
Di Wang, Xinbo Gao, Xiumei Wang, and Lihuo He. 2015. Semantic Topic Multimodal Hashing for Cross-Media Retrieval. In IJCAI. 3890--3896.
[12]
Dongqing Zhang and Wu-Jun Li. 2014. Large-Scale Supervised Multimodal Hashing with Semantic Correlation Maximization. In AAAI. 2177--2183.
[13]
Jile Zhou, Guiguang Ding, and Yuchen Guo. 2014. Latent semantic sparse hashing for cross-modal similarity search. In ACM SIGIR. 415--424.

Cited By

View all
  • (2024)Study on Content‐Based Image RetrievalIntegrating Metaheuristics in Computer Vision for Real‐World Optimization Problems10.1002/9781394230952.ch15(253-272)Online publication date: 31-Jul-2024
  • (2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
  • (2022)Improve Deep Unsupervised Hashing via Structural and Intrinsic Similarity LearningIEEE Signal Processing Letters10.1109/LSP.2022.314867429(602-606)Online publication date: 2022

Index Terms

  1. Deep hashing with multilevel similarity learning for multimedia similarity search

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ICIMCS '18: Proceedings of the 10th International Conference on Internet Multimedia Computing and Service
      August 2018
      243 pages
      ISBN:9781450365208
      DOI:10.1145/3240876
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 17 August 2018

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. deep neural networks
      2. hashing and multilevel similarity correlation measurement
      3. multimedia similarity search

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      ICIMCS'18

      Acceptance Rates

      ICIMCS '18 Paper Acceptance Rate 46 of 116 submissions, 40%;
      Overall Acceptance Rate 163 of 456 submissions, 36%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 18 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Study on Content‐Based Image RetrievalIntegrating Metaheuristics in Computer Vision for Real‐World Optimization Problems10.1002/9781394230952.ch15(253-272)Online publication date: 31-Jul-2024
      • (2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
      • (2022)Improve Deep Unsupervised Hashing via Structural and Intrinsic Similarity LearningIEEE Signal Processing Letters10.1109/LSP.2022.314867429(602-606)Online publication date: 2022

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media