research-article

Regression-based latent factor models

Authors:

Deepak Agarwal,

Bee-Chung ChenAuthors Info & Claims

KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 19 - 28

https://doi.org/10.1145/1557019.1557029

Published: 28 June 2009 Publication History

Abstract

We propose a novel latent factor model to accurately predict response for large scale dyadic data in the presence of features. Our approach is based on a model that predicts response as a multiplicative function of row and column latent factors that are estimated through separate regressions on known row and column features. In fact, our model provides a single unified framework to address both cold and warm start scenarios that are commonplace in practical applications like recommender systems, online advertising, web search, etc. We provide scalable and accurate model fitting methods based on Iterated Conditional Mode and Monte Carlo EM algorithms. We show our model induces a stochastic process on the dyadic space with kernel (covariance) given by a polynomial function of features. Methods that generalize our procedure to estimate factors in an online fashion for dynamic applications are also considered. Our method is illustrated on benchmark datasets and a novel content recommendation application that arises in the context of Yahoo! Front Page. We report significant improvements over several commonly used methods on all datasets.

Supplementary Material

JPG File (p19-agarwal.jpg)

Download
8.36 KB

MP4 File (p19-agarwal.mp4)

Download
98.01 MB

References

[1]

KDD cup and workshop. 2007.

[2]

D. Agarwal, B.-C. Chen, and P. Elango. Spatio-temporal models for estimating click rates. In WWW, 2009.

Digital Library

[3]

D. Agarwal and B.-C. Chen, et al. Online models for content optimization. In NIPS, 2008.

[4]

D. Agarwal and S. Merugu. Predictive discrete latent factor models. In KDD, 2007.

Digital Library

[5]

G. Allenby, P. Rossi, and R. McCulloch. Hierarchical bayes models: A practitioner's guide. http://ssrn.com/abstract=655541, 2005.

[6]

M. Balabanovic and Y. Shoham. Fab: content-based, collaborative recommendation. Comm. of the ACM, 1997.

Digital Library

[7]

A. Banerjee and I. Dhillon, et al. A generalized maximum entropy approach to Bregman co-clustering and matrix approximation. J. of Machine Learning Research, 2007.

Digital Library

[8]

R. Bell, Y. Koren, and C. Volinsky. Modeling relationships at multiple scales to improve accuracy of large recommender systems. In KDD, 2007.

Digital Library

[9]

J. Booth and J. Hobert. Maximizing generalized linear mixed model likelihoods with an automated monte carlo EM algorithm. J.R.Statist. Soc. B, 1999.

[10]

M. Claypool and A. Gokhale, et al. Combining content-based and collaborative filters in an online newspaper. In Recommender Systems Workshop, 1999.

[11]

A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. J. of the Royal Statistical Society, Series B, 1977.

[12]

A. Gelman and J. Hill. Data Analysis using Regression and Multilevel/Hierarchical Models. Cambridge, 2006.

[13]

A. Gelman and A. Jakulin, et al. A weakly informative default prior distribution for logistic and other regression models. Annals of Applied Statistics, 2008.

[14]

N. Good and J. B. Schafer, et al. Combining collaborative filtering with personal agents for better recommendations. In AAAI, 1999.

Digital Library

[15]

J. L. Herlocker and J. A. Konstan, et al. An algorithmic framework for performing collaborative filtering. In SIGIR, 1999.

Digital Library

[16]

T. Hofmann. Probabilistic latent semantic indexing. In SIGIR, 1999.

Digital Library

[17]

D. L. Lee and S. Seung. Algorithms for non-negative matrix factorization. In NIPS, 2001.

Digital Library

[18]

P. McCullagh and J. A. Nelder. Generalized Linear Models. Chapman&Hall/CRC, 1989.

[19]

R. Neal and G. Hinton. A view of the EM algorithm that justifies incremental, sparse, and other variants. In Learning in Graphical Models, 1998.

Digital Library

[20]

S.-T. Park and D. Pennock, et al. Naive filterbots for robust cold--start recommendations. In KDD, 2006.

Digital Library

[21]

C. Rasmussen and C. Williams. Gaussian Processes for Machine Learning. MIT Press, 2006.

Digital Library

[22]

J. Rennie and N. Srebro. Fast maximum margin matrix factorization for collaborative prediction. In ICML, 2005.

Digital Library

[23]

R. Salakhutdinov and A. Mnih. Bayesian probabilistic matrix factorization using markov chain monte carlo. In ICML'08.

Digital Library

[24]

R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In NIPS, 2008.

Digital Library

[25]

A. I. Schein and R. Popescul, et al. Methods and metrics for cold-start recommendations. In SIGIR, 2002.

Digital Library

[26]

A. I. Schein, L. K. Saul, and L. H. Ungar. A generalized linear model for principal component analysis of binary data. In AISTATS, 2003.

[27]

R. Smith. Bayesian and Frequentist Approaches to Parametric Predictive Inference. Oxford University, 1999.

[28]

Y. Zhang and J. Koren. Efficient bayesian hierarchical user modeling for recommendation system. In SIGIR, 2007.

Digital Library

Cited By

Mei LMao JWen J(2024)Optimizing Probabilistic Box Embeddings with Distance Measures2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00106(5088-5100)Online publication date: 13-May-2024
https://doi.org/10.1109/ICDE60146.2024.00106
Schiavon LNipoti BCanale A(2024)Accelerated Structured Matrix FactorizationJournal of Computational and Graphical Statistics10.1080/10618600.2023.230107233:3(917-927)Online publication date: 7-Feb-2024
https://doi.org/10.1080/10618600.2023.2301072
Nahta RChauhan GMeena YGopalani D(2024)Deep learning with the generative models for recommender systems: A surveyComputer Science Review10.1016/j.cosrev.2024.10064653(100646)Online publication date: Aug-2024
https://doi.org/10.1016/j.cosrev.2024.100646
Show More Cited By

Index Terms

Regression-based latent factor models
1. Computing methodologies
  1. Modeling and simulation
    1. Simulation theory
      1. Systems theory
2. Mathematics of computing
  1. Information theory
  2. Probability and statistics

Recommendations

Aspect-Aware Latent Factor Model: Rating Prediction with Ratings and Reviews
WWW '18: Proceedings of the 2018 World Wide Web Conference

Although latent factor models (e.g., matrix factorization) achieve good accuracy in rating prediction, they suffer from several problems including cold-start, non-transparency, and suboptimal recommendation for local users or items. In this paper, we ...
Personalised rating prediction for new users using latent factor models
HT '11: Proceedings of the 22nd ACM conference on Hypertext and hypermedia

In recent years, personalised recommendations have gained importance in helping users deal with the abundance of information available online. Personalised recommendations are often based on rating predictions, and thus accurate rating prediction is ...
Joint latent factors and attributes to discover interpretable preferences in recommendation
Abstract
Latent factor model (LFM), which uses a dot product between the resulting user and item latent factors to rank candidate items, is the most popular collaborative filtering (CF) based method in recommender systems, due to its better ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining

June 2009

1426 pages

ISBN:9781605584959

DOI:10.1145/1557019

General Chairs:
John Elder
Elder Research, Inc., USA
,
Françoise Soulié Fogelman
KXEN, France
,
Program Chairs:
Peter Flach
University of Bristol, UK
,
Mohammed Zaki
RPI, USA

Copyright © 2009 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 June 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD09

Sponsor:

KDD09: The 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

June 28 - July 1, 2009

Paris, France

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

358
Total Citations
View Citations
6,476
Total Downloads

Downloads (Last 12 months)136
Downloads (Last 6 weeks)21

Reflects downloads up to 26 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Mei LMao JWen J(2024)Optimizing Probabilistic Box Embeddings with Distance Measures2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00106(5088-5100)Online publication date: 13-May-2024
https://doi.org/10.1109/ICDE60146.2024.00106
Schiavon LNipoti BCanale A(2024)Accelerated Structured Matrix FactorizationJournal of Computational and Graphical Statistics10.1080/10618600.2023.230107233:3(917-927)Online publication date: 7-Feb-2024
https://doi.org/10.1080/10618600.2023.2301072
Nahta RChauhan GMeena YGopalani D(2024)Deep learning with the generative models for recommender systems: A surveyComputer Science Review10.1016/j.cosrev.2024.10064653(100646)Online publication date: Aug-2024
https://doi.org/10.1016/j.cosrev.2024.100646
Agarwal AHarris KWhitehouse JWu ZOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Adaptive principal component regression with applications to panel dataProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3669494(77104-77118)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3669494
Jin JChen XYe FYang MFeng YZhang WYu YWang JOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Lending interaction wings to recommender systems with conversational agentsProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667335(27951-27979)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3667335
Véras DNascimento ACallou G(2023)Towards Recommender Systems Integrating Contextual Information from Multiple Domains through Tensor FactorizationArtificial Intelligence and Data Science in Recommendation System: Current Trends, Technologies and Applications10.2174/9789815136746123010007(72-109)Online publication date: 14-Aug-2023
https://doi.org/10.2174/9789815136746123010007
Liu YDu JLi HLiu G(2023)Graph Disentangled Collaborative Filtering based on Multi-order Similarity Constraint2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA60987.2023.10302614(1-10)Online publication date: 9-Oct-2023
https://doi.org/10.1109/DSAA60987.2023.10302614
Liang SJin JRen JDu WQu S(2023)An Improved Dual-Channel Deep Q-Network Model for Tourism RecommendationBig Data10.1089/big.2021.035311:4(268-281)Online publication date: 1-Aug-2023
https://doi.org/10.1089/big.2021.0353
Demirel ÇTokuç ATekin A(2023)Click prediction boosting via Bayesian hyperparameter optimization-based ensemble learning pipelinesIntelligent Systems with Applications10.1016/j.iswa.2023.20018517(200185)Online publication date: Feb-2023
https://doi.org/10.1016/j.iswa.2023.200185
Barkan OShaked TFuchs YKoenigstein N(2023)Modeling users’ heterogeneous taste with diversified attentive user profilesUser Modeling and User-Adapted Interaction10.1007/s11257-023-09376-934:2(375-405)Online publication date: 1-Aug-2023
https://doi.org/10.1007/s11257-023-09376-9
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents