Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2600428.2609578acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Preference preserving hashing for efficient recommendation

Published: 03 July 2014 Publication History

Abstract

Recommender systems usually need to compare a large number of items before users' most preferred ones can be found This process can be very costly if recommendations are frequently made on large scale datasets. In this paper, a novel hashing algorithm, named Preference Preserving Hashing (PPH), is proposed to speed up recommendation. Hashing has been widely utilized in large scale similarity search (e.g. similar image search), and the search speed with binary hashing code is significantly faster than that with real-valued features. However, one challenge of applying hashing to recommendation is that, recommendation concerns users' preferences over items rather than their similarities. To address this challenge, PPH contains two novel components that work with the popular matrix factorization (MF) algorithm. In MF, users' preferences over items are calculated as the inner product between the learned real-valued user/item features. The first component of PPH constrains the learning process, so that users' preferences can be well approximated by user-item similarities. The second component, which is a novel quantization algorithm,generates the binary hashing code from the learned real-valued user/item features. Finally, recommendation can be achieved efficiently via fast hashing code search. Experiments on three real world datasets show that the recommendation speed of the proposed PPH algorithm can be hundreds of times faster than original MF with real-valued features, and the recommendation accuracy is significantly better than previous work of hashing for recommendation.

References

[1]
J. Bennett and S. Lanning. The netflix prize. KDD Cup and Workshop, 2007.
[2]
S. Chen, B. Ma, and K. Zhang. On the similarity metric and the distance metric. Theoretical Computer Science, pages 2365--2376, 2009.
[3]
W. Chen, W. Hsu, and M. L. Lee. Modeling users' receptiveness over time for recommendation. SIGIR, pages 373--382, 2013.
[4]
A. Das, M. Datar, A. Garg, and S. Rajaram. Google news personalization: Scalable online collaborative filtering. WWW, pages 271--280, 2007.
[5]
R. Gemulla, P. Haas, E. Nijkamp, and Y. Sismanis. Large-scale matrix factorization with distributed stochastic gradient descent. KDD, pages 69--77, 2011.
[6]
A. Gionis, P. Indyk, and R. Motwani. Similarity search in high dimensions via hashing. VLDB, pages 518--529, 1999.
[7]
Y. Gong, S. Lazebnik, A. Gordo, and F. Perronnin. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. TPAMI, pages 2916--2929, 2012.
[8]
K. Grauman and R. Fergus. Learning binary hash codes for large-scale image search. MLCV, pages 49--87, 2013.
[9]
K. Järvelin and J. Kekäläinen. Ir evaluation methods for retrieving highly relevant documents. SIGIR, 2000.
[10]
A. Karatzoglou, A. Smola, and M. Weimer. Collaborative filtering on a budget. AISTAT, pages 389--396, 2010.
[11]
N. Koenigstein, P. Ram, and Y. Shavitt. Efficient retrieval of recommendations in a matrix factorization framework. CIKM, pages 535--544, 2012.
[12]
W. Kong, W. Li, and M. Guo. Manhattan hashing for large-scale image retrieval. SIGIR, pages 45--54, 2012.
[13]
Y. Koren. Factorization meets the neighborhood: a multifaceted collaborative filtering model. KDD, pages 426--434, 2008.
[14]
Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. IEEE Computer, pages 30--37, 2009.
[15]
B. Kulis and K. Grauman. Kernelized locality-sensitive hashing for scalable image search. ICCV, 2009.
[16]
M. Li, X. Chen, X. Li, B. Ma, and P. M. Vit$\acutea$nyi. The similarity metric. IEEE Trans on Information Theory, pages 3250--3264, 2004.
[17]
G. Linden, B. Smith, and J. York. Amazon.com recommendations item-to-item collaborative filtering. IEEE Internet Computing, pages 76--80, 2003.
[18]
T. Liu, A. W. Moore, A. Gray, and K. Yang. An investigation of practical approximate nearest neighbor algorithms. NIPS, 2005.
[19]
W. Liu, J. Wang, R. Ji, Y. Jiang, and S. Chang. Supervised hashing with kernels. CVPR, pages 2074--2081, 2012.
[20]
W. Liu, J. Wang, Y. Mu, S. Kumar, and S. Chang. Compact hyperplane hashing with bilinear functions. ICML, 2012.
[21]
M. Norouzi, A. Punjani, and D. J. Fleet. Fast search in hamming space with multi-index hashing. In CVPR, pages 3108--3115, 2012.
[22]
E. Ntoutsi, K. Stefanidis, K. Nørvåg, and H.-P. Kriegel. Fast group recommendations by applying user clustering. Int'l Conf. on Conceptual Modeling, pages 126--140, 2012.
[23]
S. Rendle, Z. Gantner, C. Freudenthaler, and L. Schmidt-Thieme. Fast context-aware recommendations with factorization machines. SIGIR, pages 635--644, 2011.
[24]
R. Salakhutdinov and G. Hinton. Semantic hashing. SIGIR, pages 969--978, 2007.
[25]
K. Sugiyama, K. Hatano, and M. Yoshikawa. Adaptive web search based on user profile constructed without any effort from users. WWW, pages 675--684, 2004.
[26]
M. N. Volkovs and R. S. Zemel. Collaborative ranking with 17 parameters. NIPS, pages 2303--2311, 2012.
[27]
J. Wang, S. Kumar, and S. Chang. Semi-supervised hashing for large-scale search. TPAMI, pages 2393--2406, 2012.
[28]
Q. Wang, L. Ruan, Z. Zhang, and L. Si. Learning compact hashing codes for efficient tag completion and prediction. CIKM, pages 1789--1794, 2013.
[29]
Q. Wang, D. Zhang, and L. Si. Semantic hashing using tags and topic modeling. SIGIR, pages 213--222, 2013.
[30]
M. Weimer, A. Karatzoglou, Q. Le, and A. Smola. $\textrmCOFI^RANK$: Maximum margin matrix factorization for collaborative ranking. NIPS, 2007.
[31]
Y. Weiss, A. Torralba, and R. Fergus. Spectral hashing. NIPS, 2008.
[32]
X. Yang, H. Steck, Y. Guo, and Y. Liu. On top-k recommendation using social networks. RecSys, pages 67--74, 2012.
[33]
H. Yu, C. Hsieh, S. Si, and I. Dhillon. Scalable coordinate descent approaches to parallel matrix factorization for recommender systems. ICDM, pages 765--774, 2012.
[34]
D. Zhang, F. Wang, and L. Si. Composite hashing with multiple information sources. SIGIR, pages 225--234, 2011.
[35]
D. Zhang, J. Wang, D. Cai, and J. Lu. Self-taught hashing for fast similarity search. SIGIR, pages 18--25, 2010.
[36]
G. Zhao, M. L. Lee, W. Hsu, and W. Chen. Increasing temporal diversity with purchase intervals. SIGIR, pages 165--174, 2012.
[37]
K. Zhou and H. Zha. Learning binary codes for collaborative filtering. KDD, pages 498--506, 2012.
[38]
P. Zigoris and Y. Zhang. Bayesian adaptive user profiling with explicit & implicit feedback. CIKM, pages 397--404, 2006.

Cited By

View all
  • (2024)Discrete Federated Multi-behavior Recommendation for Privacy-Preserving Heterogeneous One-Class Collaborative FilteringACM Transactions on Information Systems10.1145/365285342:5(1-50)Online publication date: 29-Apr-2024
  • (2024)Temporal Social Graph Network Hashing for Efficient RecommendationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.3352255(1-14)Online publication date: 2024
  • (2024)A Multi-View Double Alignment Hashing Network with Weighted Contrastive Learning2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687739(1-6)Online publication date: 15-Jul-2024
  • Show More Cited By

Index Terms

  1. Preference preserving hashing for efficient recommendation

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval
      July 2014
      1330 pages
      ISBN:9781450322577
      DOI:10.1145/2600428
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 03 July 2014

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. efficiency
      2. hashing
      3. preference
      4. recommendation

      Qualifiers

      • Research-article

      Conference

      SIGIR '14
      Sponsor:

      Acceptance Rates

      SIGIR '14 Paper Acceptance Rate 82 of 387 submissions, 21%;
      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)16
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 26 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Discrete Federated Multi-behavior Recommendation for Privacy-Preserving Heterogeneous One-Class Collaborative FilteringACM Transactions on Information Systems10.1145/365285342:5(1-50)Online publication date: 29-Apr-2024
      • (2024)Temporal Social Graph Network Hashing for Efficient RecommendationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.3352255(1-14)Online publication date: 2024
      • (2024)A Multi-View Double Alignment Hashing Network with Weighted Contrastive Learning2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687739(1-6)Online publication date: 15-Jul-2024
      • (2023)Discrete Listwise Content-aware RecommendationACM Transactions on Knowledge Discovery from Data10.1145/360933418:1(1-20)Online publication date: 10-Aug-2023
      • (2023)Multi-modal Discrete Collaborative FilteringMulti-modal Hash Learning10.1007/978-3-031-37291-9_5(145-195)Online publication date: 5-Aug-2023
      • (2022)Explainable discrete Collaborative FilteringIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3185093(1-14)Online publication date: 2022
      • (2021)Efficient Retrieval of Matrix Factorization-Based Top-k RecommendationsJournal of Artificial Intelligence Research10.1613/jair.1.1240370(1441-1479)Online publication date: 1-May-2021
      • (2021)A Survey on Stream-Based Recommender SystemsACM Computing Surveys10.1145/345344354:5(1-36)Online publication date: 25-May-2021
      • (2021)xLightFM: Extremely Memory-Efficient Factorization MachineProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3462941(337-346)Online publication date: 11-Jul-2021
      • (2021)Multi-modal Discrete Collaborative Filtering for Efficient Cold-start RecommendationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.3079581(1-1)Online publication date: 2021
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media