Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3240876.3240898acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

Deep hashing with multilevel similarity learning for multimedia similarity search

Published: 17 August 2018 Publication History

Abstract

In this work, we propose a novel deep multimodal hashing method, termed as Deep Hashing with Multilevel Similarity Learning (DHMSL), which learns discriminative hash functions with deep neural networks by exploiting multilevel semantic similarity correlations of multimedia data. Firstly, we construct multilevel similarity correlation by jointly exploiting the local structure and semantic label information. Then, the unified binary codes are learned by preserving the multilevel similarity correlations as well as incorporating the bit balance and quantization error properties. Besides that, two deep neural networks are jointly trained to learn two sets of nonlinear hash functions by minimizing the errors of unified binary codes and outputs of the networks. We conduct experiments on two widely-used multimodal datasets, and the proposed DHMSL method can achieve the state-of-the-art performance compared with the baselines for both image-query-text and text-query-image tasks.

References

[1]
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore. In ACM ICIVR.
[2]
Guiguang Ding, Yuchen Guo, and Jile Zhou. 2014. Collective matrix factorization hashing for multimodal data. In CVPR. 2075--2082.
[3]
Mark J Huiskes and Michael S Lew. 2008. The MIR flickr retrieval evaluation. In ACM ICMIR. 39--43.
[4]
Shaishav Kumar and Raghavendra Udupa. 2011. Learning hash functions for cross-view similarity search. In IJCAI. 1360--1365.
[5]
Kai Li, Guojun Qi, Jun Ye, and Kien Hua. 2017. Linear subspace ranking hashing for cross-modal retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 9 (2017), 1825--1838.
[6]
Zechao Li, Jinhui Tang, and Tao Mei. 2018. Deep collaborative embedding for social image understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence (2018).
[7]
Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval. In CVPR. 3864--3872.
[8]
Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In CVPR. 2074--2081.
[9]
Maxime Oquab, Leon Bottou, Ivan Laptev, and Josef Sivic. 2014. Learning and transferring mid-level image representations using convolutional neural networks. In CVPR. 1717--1724.
[10]
Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen. 2013. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In ACM SIGMOD. 785--796.
[11]
Di Wang, Xinbo Gao, Xiumei Wang, and Lihuo He. 2015. Semantic Topic Multimodal Hashing for Cross-Media Retrieval. In IJCAI. 3890--3896.
[12]
Dongqing Zhang and Wu-Jun Li. 2014. Large-Scale Supervised Multimodal Hashing with Semantic Correlation Maximization. In AAAI. 2177--2183.
[13]
Jile Zhou, Guiguang Ding, and Yuchen Guo. 2014. Latent semantic sparse hashing for cross-modal similarity search. In ACM SIGIR. 415--424.

Cited By

View all
  • (2024)Study on Content‐Based Image RetrievalIntegrating Metaheuristics in Computer Vision for Real‐World Optimization Problems10.1002/9781394230952.ch15(253-272)Online publication date: 31-Jul-2024
  • (2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
  • (2022)Improve Deep Unsupervised Hashing via Structural and Intrinsic Similarity LearningIEEE Signal Processing Letters10.1109/LSP.2022.314867429(602-606)Online publication date: 2022

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICIMCS '18: Proceedings of the 10th International Conference on Internet Multimedia Computing and Service
August 2018
243 pages
ISBN:9781450365208
DOI:10.1145/3240876
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 August 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. deep neural networks
  2. hashing and multilevel similarity correlation measurement
  3. multimedia similarity search

Qualifiers

  • Research-article

Funding Sources

Conference

ICIMCS'18

Acceptance Rates

ICIMCS '18 Paper Acceptance Rate 46 of 116 submissions, 40%;
Overall Acceptance Rate 163 of 456 submissions, 36%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Study on Content‐Based Image RetrievalIntegrating Metaheuristics in Computer Vision for Real‐World Optimization Problems10.1002/9781394230952.ch15(253-272)Online publication date: 31-Jul-2024
  • (2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
  • (2022)Improve Deep Unsupervised Hashing via Structural and Intrinsic Similarity LearningIEEE Signal Processing Letters10.1109/LSP.2022.314867429(602-606)Online publication date: 2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media