Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3343031.3350953acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

User Diverse Preference Modeling by Multimodal Attentive Metric Learning

Published: 15 October 2019 Publication History

Abstract

Most existing recommender systems represent a user's preference with a feature vector, which is assumed to be fixed when predicting this user's preferences for different items. However, the same vector cannot accurately capture a user's varying preferences on all items, especially when considering the diverse characteristics of various items. To tackle this problem, in this paper, we propose a novel Multimodal Attentive Metric Learning (MAML) method to model user diverse preferences for various items. In particular, for each user-item pair, we propose an attention neural network, which exploits the item's multimodal features to estimate the user's special attention to different aspects of this item. The obtained attention is then integrated into a metric-based learning method to predict the user preference on this item. The advantage of metric learning is that it can naturally overcome the problem of dot product similarity, which is adopted by matrix factorization (MF) based recommendation models but does not satisfy the triangle inequality property. In addition, it is worth mentioning that the attention mechanism cannot only help model user's diverse preferences towards different items, but also overcome the geometrically restrictive problem caused by collaborative metric learning. Extensive experiments on large-scale real-world datasets show that our model can substantially outperform the state-of-the-art baselines, demonstrating the potential of modeling user diverse preference for recommendation.

References

[1]
Yoram Bachrach, Yehuda Finkelstein, Ran Gilad-Bachrach, Liran Katzir, Noam Koenigstein, Nir Nice, and Ulrich Paquet. 2014. Speeding up the xbox recommender system using a euclidean transformation for inner-product spaces. In RecSys. ACM, 257--264.
[2]
Robert M Bell and Yehuda Koren. 2007. Lessons from the Netflix prize challenge. SIGKDD Explorations Newsletter, Vol. 9, 2 (2007), 75--79.
[3]
Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation learning: A review and new perspectives. TPAMI, Vol. 35, 8 (2013), 1798--1828.
[4]
Rose Catherine and William Cohen. 2017. TransNets: Learning to Transform for Recommendation. In RecSys. ACM.
[5]
Chong Chen, Min Zhang, Yiqun Liu, and Shaoping Ma. 2018a. Neural attentional rating regression with review-level explanations. In WWW. IW3C2, 1583--1592.
[6]
Jingyuan Chen, Hanwang Zhang, Xiangnan He, Liqiang Nie, Wei Liu, and Tat-Seng Chua. 2017. Attentive collaborative filtering: Multimedia recommendation with item-and component-level attention. In SIGIR. ACM, 335--344.
[7]
Jingwu Chen, Fuzhen Zhuang, Xin Hong, Xiang Ao, Xing Xie, and Qing He. 2018b. Attention-driven Factor Model for Explainable Personalized Recommendation. In SIGIR. ACM, 909--912.
[8]
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, et almbox. 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. ACM, 7--10.
[9]
Zhiyong Cheng, Xiaojun Chang, Lei Zhu, Rose C Kanjirathinkal, and Mohan Kankanhalli. 2019. MMALFM: Explainable recommendation by leveraging reviews and images. TOIS, Vol. 37, 2 (2019), 16.
[10]
Zhiyong Cheng, Ying Ding, Xiangnan He, Lei Zhu, Xuemeng Song, and Mohan S Kankanhalli. 2018a. A$^3$NCF: An Adaptive Aspect Attention Model for Rating Prediction. In IJCAI. AAAI Press, 3748--3754.
[11]
Zhiyong Cheng, Ying Ding, Lei Zhu, and Kankanhalli Mohan. 2018b. Aspect-aware latent factor model: Rating prediction with ratings and reviews. In WWW. IW3C2, 639--648.
[12]
Jin Yao Chin, Kaiqi Zhao, Shafiq Joty, and Gao Cong. 2018. ANR: Aspect-based Neural Recommender. In CIKM. ACM, 147--156.
[13]
Michael Cogswell, Faruk Ahmed, Ross Girshick, Larry Zitnick, and Dhruv Batra. 2015. Reducing overfitting in deep networks by decorrelating representations. arXiv e-prints (2015).
[14]
Travis Ebesu, Bin Shen, and Yi Fang. 2018. Collaborative memory network for recommendation systems. In SIGIR. ACM, 515--524.
[15]
Li Gao, Hong Yang, Jia Wu, Chuan Zhou, Weixue Lu, and Yue Hu. 2018. Recommendation with multi-source heterogeneous information. In IJCAI. AAAI Press, 3378--3384.
[16]
Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Deep sparse rectifier neural networks. In AISTATS. JMLR, 315--323.
[17]
Yangyang Guo, Zhiyong Cheng, Liqiang Nie, Xin-Shun Xu, and Mohan Kankanhalli. 2018. Multi-modal preference modeling for product search. In MM. ACM, 1865--1873.
[18]
Ruining He and Julian McAuley. 2016. VBPR: visual bayesian personalized ranking from implicit feedback. In AAAI. AAAI Press, 144--150.
[19]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW. IW3C2, 173--182.
[20]
Cheng-Kang Hsieh, Longqi Yang, Yin Cui, Tsung-Yi Lin, Serge Belongie, and Deborah Estrin. 2017. Collaborative metric learning. In WWW. IW3C2, 193--201.
[21]
Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative filtering for implicit feedback datasets. In ICDM. IEEE, 263--272.
[22]
Kalervo J"arvelin and Jaana Kek"al"ainen. 2002. Cumulated gain-based evaluation of IR techniques. TOIS, Vol. 20, 4 (2002), 422--446.
[23]
Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional architecture for fast feature embedding. In MM. ACM, 675--678.
[24]
Yannis Kalantidis, Lyndon Kennedy, and Li-Jia Li. 2013. Getting the look: clothing recognition and segmentation for automatic product suggestions in everyday photos. In ICMR. ACM, 105--112.
[25]
Mohammad Khoshneshin and W Nick Street. 2010. Collaborative filtering via euclidean embedding. In RecSys. ACM, 87--94.
[26]
Diederik P Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In ICLR .
[27]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer, Vol. 42, 8 (2009), 42--49.
[28]
Quoc Le and Tomas Mikolov. 2014. Distributed Representations of Sentences and Documents. In ICML. JMLR, 1188--1196.
[29]
Meng Liu, Xiang Wang, Liqiang Nie, Xiangnan He, Baoquan Chen, and Tat-Seng Chua. 2018a. Attentive Moment Retrieval in Videos. In SIGIR. ACM, 15--24.
[30]
Meng Liu, Xiang Wang, Liqiang Nie, Qi Tian, Baoquan Chen, and Tat-Seng Chua. 2018b. Cross-modal Moment Localization in Videos. In MM. ACM, 843--851.
[31]
Julian McAuley and Jure Leskovec. 2013. Hidden factors and hidden topics: understanding rating dimensions with review text. In RecSys . ACM, 165--172.
[32]
Julian McAuley, Christopher Targett, Qinfeng Shi, and Anton Van Den Hengel. 2015. Image-based recommendations on styles and substitutes. In SIGIR. ACM, 43--52.
[33]
Andriy Mnih and Ruslan R Salakhutdinov. 2008. Probabilistic matrix factorization. In NIPS. MIT Press, 1257--1264.
[34]
Liqiang Nie, Xuemeng Song, and Tat Seng Chua. 2016. Learning from Multiple Social Networks. Synthesis Lectures on Information Concepts Retrieval & Services, Vol. 8 (2016), 1--118.
[35]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In UAI. AUAI, 452--461.
[36]
Xuemeng Song, Fuli Feng, Xianjing Han, Xin Yang, Wei Liu, and Liqiang Nie. 2018. Neural compatibility modeling with attentive knowledge distillation. In SIGIR. ACM, 5--14.
[37]
Xuemeng Song, Fuli Feng, Jinhuan Liu, Zekun Li, Liqiang Nie, and Jun Ma. 2017. Neurostylist: Neural compatibility modeling for clothing matching. In MM. ACM, 753--761.
[38]
Nitish Srivastava and Ruslan R Salakhutdinov. 2012. Multimodal learning with deep boltzmann machines. In NIPS. MIT press, 2222--2230.
[39]
Yunzhi Tan, Min Zhang, Yiqun Liu, and Shaoping Ma. 2016. Rating-boosted latent topics: Understanding users and items with ratings and reviews. In IJCAI . AAAI Press.
[40]
Duyu Tang, Bing Qin, Ting Liu, and Yuekui Yang. 2015. User modeling with neural network for review rating prediction. In IJCAI. AAAI Press, 1340--1346.
[41]
Yi Tay, Luu Anh Tuan, and Siu Cheung Hui. 2018. Latent relational metric learning via memory-based attention for collaborative ranking. In WWW . IW3C2, 729--739.
[42]
Amos Tversky and Itamar Gati. 1982. Similarity, separability, and the triangle inequality. Psychological review, Vol. 89, 2 (1982), 123.
[43]
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE . JMLR, Vol. 9 (2008), 2579--2605.
[44]
Meng Wang, Yue Gao, Ke Lu, and Yong Rui. 2013. View-Based Discriminative Probabilistic Modeling for 3D Object Retrieval and Recognition. TOIS, Vol. 22 (2013), 1395--1407.
[45]
Meng Wang, Hao Li, Dacheng Tao, Ke Lu, and Xindong Wu. 2012. Multimodal Graph-Based Reranking for Web Image Search. TOIS, Vol. 21 (2012), 4649--4661.
[46]
Mingyu Wang, Changzhi Luo, Bingbing Ni, Jun Yuan, Jianfeng Wang, and Shuicheng Yan. 2018. First-Person Daily Activity Recognition With Manipulated Object Proposals and Non-Linear Feature Fusion. TCSVT, Vol. 28 (2018), 2946--2955.
[47]
Suhang Wang, Yilin Wang, Jiliang Tang, Kai Shu, Suhas Ranganath, and Huan Liu. 2017. What your images reveal: Exploiting visual contents for point-of-interest recommendation. In WWW . IW3C2, 391--400.
[48]
Jason Weston, Samy Bengio, and Nicolas Usunier. 2010. Large scale image annotation: learning to rank with joint word-image embeddings. Machine learning, Vol. 81, 1 (2010), 21--35.
[49]
Hong-Jian Xue, Xin-Yu Dai, Jianbing Zhang, Shujian Huang, and Jiajun Chen. 2017. Deep matrix factorization models for recommender systems. In IJCAI. AAAI Press, 3203--3209.
[50]
Xun Yang, Mingyu Wang, and Dacheng Tao. 2018. Person Re-Identification With Metric Learning Using Privileged Information. TOIS, Vol. 27 (2018), 791--805.
[51]
Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, and Wei-Ying Ma. 2016. Collaborative knowledge base embedding for recommender systems. In SIGKDD. ACM, 353--362.
[52]
Hanwang Zhang, Yang Yang, Huanbo Luan, Shuicheng Yang, and Tat-Seng Chua. 2014. Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes. In MM. ACM, 187--196.
[53]
Shuai Zhang, Lina Yao, Yi Tay, Xiwei Xu, Xiang Zhang, and Liming Zhu. 2019. Metric Factorization: Recommendation beyond Matrix Factorization. In ICDE . IEEE.
[54]
Yongfeng Zhang, Qingyao Ai, Xu Chen, and W Bruce Croft. 2017. Joint representation learning for top-n recommendation with heterogeneous information sources. In WSDM. ACM, 1449--1458.
[55]
Tong Zhao, Julian McAuley, and Irwin King. 2015. Improving latent factor models via personalized feature projection for one class recommendation. In CIKM. ACM, 821--830.
[56]
Lei Zheng, Chun-Ta Lu, Lifang He, Sihong Xie, Vahid Noroozi, He Huang, and Philip S Yu. 2018. MARS: Memory attention-aware recommender system. arXiv e-prints (2018).
[57]
Lei Zheng, Vahid Noroozi, and Philip S Yu. 2017. Joint deep modeling of users and items using reviews for recommendation. In WSDM. ACM, 425--434.
[58]
Guorui Zhou, Xiaoqiang Zhu, Chenru Song, Ying Fan, Han Zhu, Xiao Ma, Yanghui Yan, Junqi Jin, Han Li, and Kun Gai. 2018. Deep interest network for click-through rate prediction. In KDD. ACM, 1059--1068.

Cited By

View all
  • (2024)MMKG-PAR: Multi-Modal Knowledge Graphs-Based Personalized Attraction RecommendationSustainability10.3390/su1605221116:5(2211)Online publication date: 6-Mar-2024
  • (2024)Explicitly Exploiting Implicit User and Item Relations in Graph Convolutional Network (GCN) for RecommendationElectronics10.3390/electronics1314281113:14(2811)Online publication date: 17-Jul-2024
  • (2024)Cluster-Based Graph Collaborative FilteringACM Transactions on Information Systems10.1145/368748142:6(1-24)Online publication date: 12-Aug-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '19: Proceedings of the 27th ACM International Conference on Multimedia
October 2019
2794 pages
ISBN:9781450368896
DOI:10.1145/3343031
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. attention mechanism
  2. metric learning
  3. multimodal information
  4. personalized recommendation

Qualifiers

  • Research-article

Funding Sources

  • Project of Thousand Youth Talents 2016
  • National Natural Science Foundation of China
  • Shandong Provincial Natural Science and Foundation
  • Future Talents Research Funds of Shandong University

Conference

MM '19
Sponsor:

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)101
  • Downloads (Last 6 weeks)12
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)MMKG-PAR: Multi-Modal Knowledge Graphs-Based Personalized Attraction RecommendationSustainability10.3390/su1605221116:5(2211)Online publication date: 6-Mar-2024
  • (2024)Explicitly Exploiting Implicit User and Item Relations in Graph Convolutional Network (GCN) for RecommendationElectronics10.3390/electronics1314281113:14(2811)Online publication date: 17-Jul-2024
  • (2024)Cluster-Based Graph Collaborative FilteringACM Transactions on Information Systems10.1145/368748142:6(1-24)Online publication date: 12-Aug-2024
  • (2024)Multimodal Pre-training for Sequential Recommendation via Contrastive LearningACM Transactions on Recommender Systems10.1145/36820753:1(1-23)Online publication date: 29-Jul-2024
  • (2024)Disentangled Cascaded Graph Convolution Networks for Multi-Behavior RecommendationACM Transactions on Recommender Systems10.1145/36732442:4(1-27)Online publication date: 17-Jun-2024
  • (2024)SOIL: Contrastive Second-Order Interest Learning for Multimodal RecommendationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681207(5838-5846)Online publication date: 28-Oct-2024
  • (2024)Formalizing Multimedia Recommendation through Multimodal Deep LearningACM Transactions on Recommender Systems10.1145/3662738Online publication date: 29-Apr-2024
  • (2024)MSI: Multi-modal Recommendation via Superfluous Semantics Discarding and Interaction PreservingProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658043(814-823)Online publication date: 30-May-2024
  • (2024)A Multi-modal Modeling Framework for Cold-start Short-video RecommendationProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688098(391-400)Online publication date: 8-Oct-2024
  • (2024)Dataset and Models for Item Recommendation Using Multi-Modal User InteractionsProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657881(709-718)Online publication date: 10-Jul-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media