research-article

SimH: A Supervised Cross-View Hashing Framework Preserving Semantic Similarities in Hamming Space

Authors:

Hu WeijinAuthors Info & Claims

ICIMCS'16: Proceedings of the International Conference on Internet Multimedia Computing and Service

Pages 217 - 222

https://doi.org/10.1145/3007669.3007678

Published: 19 August 2016 Publication History

Abstract

To tackle the scalability issues for cross-view retrieval on large-scale databases, in this paper we propose a supervised cross-view hashing framework termed SimH that can well preserve semantic similarities of objects in Hamming space. The proposed SimH generates one unified hash code for all views of an object. For off-line training, SimH firstly exploits the similarity matrix of training objects to learn their corresponding similarity preserving hash codes and then learns hash functions for each view to map features into hash codes, which can be open for any predictive model. Afterwards, the hash codes learnt during training are discarded. For online hash encoding, given an unseen object, learnt hash functions in each of its observed views will firstly predict view-specific hashing results and then a novel expected value based combining strategy is utilized to merge them and determine the unified hash code. Experiments on benchmark datasets show that SimH outperforms several state-of-the-art cross-view hashing methods.

References

[1]

D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003.

Digital Library

[2]

M. Bronstein, A. Bronstein, F. Michel, and N. Paragios. Data fusion through cross-modality metric learning using similarity-sensitive hashing. In CVPR, 2010.

[3]

T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng. Nus-wide: A real-world web image database from national university of singapore. In CIVR, 2009.

Digital Library

[4]

J. Costa Pereira, E. Coviello, G. Doyle, N. Rasiwasia, G. Lanckriet, R. Levy, and N. Vasconcelos. On the role of correlation and abstraction in cross-modal multimedia retrieval. TPAMI, 36(3):521--535, 2014.

Digital Library

[5]

G. Ding, Y. Guo, and J. Zhou. Collective matrix factorization hashing for multimodal data. In IEEE Conference on Computer Vision and Pattern Recognition, 2014.

Digital Library

[6]

G. H. Golub and H. A. van der Vorst. Eigenvalue computation in the 20th century. Journal of Computational and Applied Mathematics, 123(1-2):35--65, 2000.

Digital Library

[7]

Y. Guo, G. Ding, Y. Gao, and J. Wang. Semi-supervised active learning with cross-class sample transfer. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pages 1526--1532, 2016.

Digital Library

[8]

Y. Guo, G. Ding, J. Han, and X. Jin. Robust iterative quantization for efficient &ell;p-norm similarity search. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pages 3382--3388, 2016.

Digital Library

[9]

Y. Guo, G. Ding, X. Jin, and J. Wang. Transductive zero-shot recognition via shared model space learning. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pages 3434--3500, 2016.

Digital Library

[10]

Y. Guo, G. Ding, and J. Zhou. Robust nonnegative matrix factorization with discriminability for image representation. In 2015 IEEE International Conference on Multimedia and Expo, pages 1--6, 2015.

[11]

Y. Guo, G. Ding, J. Zhou, and Q. Liu. Robust and discriminative concept factorization for image representation. In International Conference on Multimedia Retrieval, pages 115--122, 2015.

Digital Library

[12]

M. Hu, Y. Chen, and J.-Y. Kwok. Building sparse multiple-kernel svm classifiers. IEEE Transactions on Neural Networks, 20(5):827--839, 2009.

Digital Library

[13]

M. J. Huiskes and M. S. Lew. The mir flickr retrieval evaluation. In MIR, 2008.

Digital Library

[14]

I. Jolliffe. Principal Component Analysis. Springer Verlag, 1986.

[15]

S. Kumar and R. Udupa. Learning hash functions for cross-view similarity search. In IJCAI, 2011.

Digital Library

[16]

W. Liu, J. Wang, R. Ji, Y.-G. Jiang, and S.-F. Chang. Supervised hashing with kernels. In CVPR, 2012.

[17]

J. Song, Y. Yang, Y. Yang, Z. Huang, and H. T. Shen. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In SIGMOD, 2013.

Digital Library

[18]

F. Wu, Z. Yu, Y. Yang, S. Tang, Y. Zhang, and Y. Zhuang. Sparse multi-modal hashing. TMM, 16(2):427--139, 2014.

Digital Library

[19]

Z. Yu, F. Wu, Y. Yang, Q. Tian, J. Luo, and Y. Zhuang. Discriminative coupled dictionary hashing for fast cross-media retrieval. In SIGIR, 2014.

Digital Library

[20]

Z. Yu, Y. Zhang, S. Tang, Y. Yang, Q. Tian, and J. Luo. Cross-media hashing with kernel regression. In IEEE International Conference on Multimedia and Expo, 2014.

Digital Library

[21]

D. Zhai, H. Chang, Y. Zhen, X. Liu, X. Chen, and W. Gao. Parametric local multimodal hashing for cross-view similarity search. In IJCAI, 2013.

Digital Library

[22]

D. Zhang and W.-J. Li. Large-scale supervised multimodal hashing with semantic correlation maximization. In AAAI Conference on Artificial Intelligence, 2014.

[23]

D. Zhang, J. Wang, D. Cai, and J. Lu. Self-taught hashing for fast similarity search. In SIGIR, 2010.

Digital Library

[24]

Y. Zhen and D.-Y. Yeung. Co-regularized hashing for multimodal data. In NIPS, 2012.

Digital Library

[25]

Y. Zhen and D.-Y. Yeung. A probabilistic model for multimodal hash function learning. In SIGKDD, 2012.

Digital Library

[26]

J. Zhou, G. Ding, and Y. Guo. Latent semantic sparse hashing for cross-modal similarity search. In SIGIR, 2014.

Digital Library

[27]

J. Zhou, G. Ding, Y. Guo, Q. Liu, and X. Dong. Kernel-based supervised hashing for cross-view similarity search. In IEEE International Conference on Multimedia and Expo, 2014.

Cited By

Jin LLi KHu HQi GTang J(2018)Semantic Neighbor Graph Hashing for Multimodal RetrievalIEEE Transactions on Image Processing10.1109/TIP.2017.277674527:3(1405-1417)Online publication date: Mar-2018
https://doi.org/10.1109/TIP.2017.2776745

SimH: A Supervised Cross-View Hashing Framework Preserving Semantic Similarities in Hamming Space
1. Information systems

Recommendations

A simple multiple-fold correlation-based multi-view multi-label learning
Abstract
Correlations among different features and labels are ubiquitous in the present multi-view multi-label data sets and they are always described with within-view, cross-view, and consensus-view representations. While how to discover and measure these ...
MMA: a multi-view and multi-modality benchmark dataset for human action recognition

Human action recognition is an active research topic in both computer vision and machine learning communities, which has broad applications including surveillance, biometrics and human computer interaction. In the past decades, although some famous ...
DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes
Abstract
Cross-view multi-object tracking aims to link objects between frames and camera views with substantial overlaps. Although cross-view multi-object tracking has received increased attention in recent years, existing datasets still have several ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICIMCS'16: Proceedings of the International Conference on Internet Multimedia Computing and Service

August 2016

360 pages

ISBN:9781450348508

DOI:10.1145/3007669

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Xidian University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 August 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICIMCS'16

ICIMCS'16: International Conference on Internet Multimedia Computing and Service

August 19 - 21, 2016

Xi'an, China

Acceptance Rates

ICIMCS'16 Paper Acceptance Rate 77 of 118 submissions, 65%;

Overall Acceptance Rate 163 of 456 submissions, 36%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
60
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 04 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Jin LLi KHu HQi GTang J(2018)Semantic Neighbor Graph Hashing for Multimodal RetrievalIEEE Transactions on Image Processing10.1109/TIP.2017.277674527:3(1405-1417)Online publication date: Mar-2018
https://doi.org/10.1109/TIP.2017.2776745

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents