Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3007669.3007678acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

SimH: A Supervised Cross-View Hashing Framework Preserving Semantic Similarities in Hamming Space

Published: 19 August 2016 Publication History

Abstract

To tackle the scalability issues for cross-view retrieval on large-scale databases, in this paper we propose a supervised cross-view hashing framework termed SimH that can well preserve semantic similarities of objects in Hamming space. The proposed SimH generates one unified hash code for all views of an object. For off-line training, SimH firstly exploits the similarity matrix of training objects to learn their corresponding similarity preserving hash codes and then learns hash functions for each view to map features into hash codes, which can be open for any predictive model. Afterwards, the hash codes learnt during training are discarded. For online hash encoding, given an unseen object, learnt hash functions in each of its observed views will firstly predict view-specific hashing results and then a novel expected value based combining strategy is utilized to merge them and determine the unified hash code. Experiments on benchmark datasets show that SimH outperforms several state-of-the-art cross-view hashing methods.

References

[1]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003.
[2]
M. Bronstein, A. Bronstein, F. Michel, and N. Paragios. Data fusion through cross-modality metric learning using similarity-sensitive hashing. In CVPR, 2010.
[3]
T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng. Nus-wide: A real-world web image database from national university of singapore. In CIVR, 2009.
[4]
J. Costa Pereira, E. Coviello, G. Doyle, N. Rasiwasia, G. Lanckriet, R. Levy, and N. Vasconcelos. On the role of correlation and abstraction in cross-modal multimedia retrieval. TPAMI, 36(3):521--535, 2014.
[5]
G. Ding, Y. Guo, and J. Zhou. Collective matrix factorization hashing for multimodal data. In IEEE Conference on Computer Vision and Pattern Recognition, 2014.
[6]
G. H. Golub and H. A. van der Vorst. Eigenvalue computation in the 20th century. Journal of Computational and Applied Mathematics, 123(1-2):35--65, 2000.
[7]
Y. Guo, G. Ding, Y. Gao, and J. Wang. Semi-supervised active learning with cross-class sample transfer. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pages 1526--1532, 2016.
[8]
Y. Guo, G. Ding, J. Han, and X. Jin. Robust iterative quantization for efficient ℓp-norm similarity search. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pages 3382--3388, 2016.
[9]
Y. Guo, G. Ding, X. Jin, and J. Wang. Transductive zero-shot recognition via shared model space learning. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pages 3434--3500, 2016.
[10]
Y. Guo, G. Ding, and J. Zhou. Robust nonnegative matrix factorization with discriminability for image representation. In 2015 IEEE International Conference on Multimedia and Expo, pages 1--6, 2015.
[11]
Y. Guo, G. Ding, J. Zhou, and Q. Liu. Robust and discriminative concept factorization for image representation. In International Conference on Multimedia Retrieval, pages 115--122, 2015.
[12]
M. Hu, Y. Chen, and J.-Y. Kwok. Building sparse multiple-kernel svm classifiers. IEEE Transactions on Neural Networks, 20(5):827--839, 2009.
[13]
M. J. Huiskes and M. S. Lew. The mir flickr retrieval evaluation. In MIR, 2008.
[14]
I. Jolliffe. Principal Component Analysis. Springer Verlag, 1986.
[15]
S. Kumar and R. Udupa. Learning hash functions for cross-view similarity search. In IJCAI, 2011.
[16]
W. Liu, J. Wang, R. Ji, Y.-G. Jiang, and S.-F. Chang. Supervised hashing with kernels. In CVPR, 2012.
[17]
J. Song, Y. Yang, Y. Yang, Z. Huang, and H. T. Shen. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In SIGMOD, 2013.
[18]
F. Wu, Z. Yu, Y. Yang, S. Tang, Y. Zhang, and Y. Zhuang. Sparse multi-modal hashing. TMM, 16(2):427--139, 2014.
[19]
Z. Yu, F. Wu, Y. Yang, Q. Tian, J. Luo, and Y. Zhuang. Discriminative coupled dictionary hashing for fast cross-media retrieval. In SIGIR, 2014.
[20]
Z. Yu, Y. Zhang, S. Tang, Y. Yang, Q. Tian, and J. Luo. Cross-media hashing with kernel regression. In IEEE International Conference on Multimedia and Expo, 2014.
[21]
D. Zhai, H. Chang, Y. Zhen, X. Liu, X. Chen, and W. Gao. Parametric local multimodal hashing for cross-view similarity search. In IJCAI, 2013.
[22]
D. Zhang and W.-J. Li. Large-scale supervised multimodal hashing with semantic correlation maximization. In AAAI Conference on Artificial Intelligence, 2014.
[23]
D. Zhang, J. Wang, D. Cai, and J. Lu. Self-taught hashing for fast similarity search. In SIGIR, 2010.
[24]
Y. Zhen and D.-Y. Yeung. Co-regularized hashing for multimodal data. In NIPS, 2012.
[25]
Y. Zhen and D.-Y. Yeung. A probabilistic model for multimodal hash function learning. In SIGKDD, 2012.
[26]
J. Zhou, G. Ding, and Y. Guo. Latent semantic sparse hashing for cross-modal similarity search. In SIGIR, 2014.
[27]
J. Zhou, G. Ding, Y. Guo, Q. Liu, and X. Dong. Kernel-based supervised hashing for cross-view similarity search. In IEEE International Conference on Multimedia and Expo, 2014.

Cited By

View all
  • (2018)Semantic Neighbor Graph Hashing for Multimodal RetrievalIEEE Transactions on Image Processing10.1109/TIP.2017.277674527:3(1405-1417)Online publication date: Mar-2018
  1. SimH: A Supervised Cross-View Hashing Framework Preserving Semantic Similarities in Hamming Space

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICIMCS'16: Proceedings of the International Conference on Internet Multimedia Computing and Service
    August 2016
    360 pages
    ISBN:9781450348508
    DOI:10.1145/3007669
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Xidian University

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 August 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Cross-view
    2. Hash
    3. Retrieval

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICIMCS'16

    Acceptance Rates

    ICIMCS'16 Paper Acceptance Rate 77 of 118 submissions, 65%;
    Overall Acceptance Rate 163 of 456 submissions, 36%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 04 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Semantic Neighbor Graph Hashing for Multimodal RetrievalIEEE Transactions on Image Processing10.1109/TIP.2017.277674527:3(1405-1417)Online publication date: Mar-2018

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media