Abstract
The research of personalized recommendation techniques today has mostly parted into two mainstream directions, namely, the factorization-based approaches and topic models. Practically, they aim to benefit from the numerical ratings and textual reviews, correspondingly, which compose two major information sources in various real-world systems, including Amazon, Yelp, eBay, Netflix, and many others.
However, although the two approaches are supposed to be correlated for their same goal of accurate recommendation, there still lacks a clear theoretical understanding of how their objective functions can be mathematically bridged to leverage the numerical ratings and textual reviews collectively, and why such a bridge is intuitively reasonable to match up their learning procedures for the rating prediction and top-N recommendation tasks, respectively.
In this work, we exposit with mathematical analysis that, the vector-level randomization functions to harmonize the optimization objectives of factorizational and topic models unfortunately do not exist at all, although they are usually pre-assumed and intuitively designed in the literature.
Fortunately, we also point out that one can simply avoid the seeking of such a randomization function by optimizing a Joint Factorizational Topic (JFT) model directly. We further apply our JFT model to the cross-city Point of Interest (POI) recommendation tasks for performance validation, which is an extremely difficult task for its inherent cold-start nature. Experimental results on real-world datasets verified the appealing performance of our approach against previous methods with pre-assumed randomization functions in terms of both rating prediction and top-N recommendation tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aciar, S., Zhang, D., Simoff, S., Debenham, J.: Informed recommender: basing recommendations on consumer product reviews. Intell. Syst. 22(3), 39–47 (2007)
Agarwal, D., Chen, B.C.: fLDA: matrix factorization through latent Dirichlet allocation. In: WSDM (2010)
Baatarjav, E.-A., Phithakkitnukoon, S., Dantu, R.: Group recommendation system for Facebook. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM 2008. LNCS, vol. 5333, pp. 211–219. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88875-8_41
Blei, D.M.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. JMLR 2003(3), 993–1022 (2003)
Cremonesi, P., Koren, Y., Turrin, R.: Performance of recommender algorithms on top-N recommendation tasks. In: RecSys pp. 39–46 (2010)
Das, A., Datar, M., Garg, A., Rajaram, S.: Google news personalization: scalable online collaborative filtering. In: WWW pp. 271–280 (2007)
Davidson, J., Liebald, B., Liu, J., et al.: The YouTube video recommendation system. In: RecSys, pp. 293–296 (2010)
Ganu, G., Elhadad, N., Marian, A.: Beyond the stars: improving rating predictions using review text content. In: WebDB (2009)
Hu, Y., Koren, Y., Volinsky, C.: Collaborative filtering for implicit feedback datasets. In: ICDM (2008)
Jakob, N., Weber, S.H., Müller, M.C., et al.: Beyond the stars: exploiting free-text user reviews to improve the accuracy of movie recommendations. In: TSA (2009)
Knott, M., Bartholomew, D.: Latent Variable Models and Factor Analysis. Kendall’s Library of Statistics 2 (1999)
Ko, M., Kim, H.W., Yi, M.Y., Song, J., Liu, Y.: MovieCommenter: aspect-based collaborative filtering by utilizing user comments. In: CollaborateCom (2011)
Konstan, J.A., Riedl, J.: Recommender systems: from algorithms to user experience. User Model. User-Adap. Interact. 22(1–2), 101–123 (2012)
Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42, 8 (2009)
Koren, Y., Bell, R.: Advances in Collaborative Filtering. In: Ricci, F., Rokach, L., Shapira, B., Kantor, P.B. (eds.) Recommender Systems Handbook, pp. 145–186. Springer, Heidelberg (2011). doi:10.1007/978-0-387-85820-3_5
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Proceedings of NIPS (2001)
Lika, B., Kolomvatsos, K., Hadjiefthymiades, S.: Facing the cold start problem in recommender systems. Expert Syst. Appl. 41(4), 2065–2073 (2014)
Linden, G., Smith, B., York, J.: Amazon.com recommendations: item-to-item collaborative filtering. Internet Comput. 7(1), 76–80 (2003)
Ling, G., Lyu, M.R., King, I.: Ratings meet reviews: a combined approach to recommend. In: RecSys (2014)
McAuley, J., Leskovec, J.: Hidden factors and hidden topics: understanding rating dimensions with review text. In: RecSys, pp. 165–172 (2013)
Pazzani, M.J., Billsus, D.: Content-based recommendation systems. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) The Adaptive Web. LNCS, vol. 4321, pp. 325–341. Springer, Heidelberg (2007). doi:10.1007/978-3-540-72079-9_10
Purushotham, S., Liu, Y., Kuo, C.C.J.: collaborative topic regression with social matrix factorization for recommendation systems. In: ICML (2012)
Rendle, S., Freudenthaler, C., Gantner, Z., Thieme, L.S.: BPR: Bayesian Personalized Ranking from implicit feedback. In: UAI (2009)
Ricci, F., Rokach, L., Shapira, B.: Introduction to Recommender Systems Handbook. Springer, Heidelberg (2011)
Salakhutdinov, R., Mnih, A.: Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. In: Proceedings of ICML (2008)
Salakhutdinov, R., Mnih, A.: Probabilistic matrix factorization. In: Proceedings of NIPS (2008)
Srebro, N., Rennie, J.D.M., Jaakkola, T.S.: Maximum-margin matrix factorization. In: NIPS (2005)
Su, X., Khoshgoftaar, T.M.: A survey of collaborative filtering techniques. In: Advances in AI, p. 4 (2009)
Takacs, G., Pilaszy, I., Nemeth, B., Tikk, D.: Investigation of various matrix factorization methods for large recommender systems. In: Proceedings of ICDM (2008)
Terzi, M., Ferrario, M.A., Whittle, J.: Free text in user reviews: their role in recommender systems. In: RecSys (2011)
Wang, C., Blei, D.M.: Collaborative topic modeling for recommending scientific articles. In: KDD (2011)
Xu, X., Datta, A., Dutta, K.: Using adjective features from user reviews to generate higher quality and explainable recommendations. IFIP Advances in Info. and Com. Tech. 389, 18–34 (2012)
Zhang, Y., Lai, G., Zhang, M., Zhang, Y., Liu, Y., Ma, S.: Explicit factor models for explainable recommendation based on phrase-level sentiment analysis. In: SIGIR (2014)
Zhang, Y., Zhang, H., Zhang, M., Liu, Y., et al.: Do users rate or review? boost phrase-level sentiment labeling with review-level sentiment classification. In: SIGIR (2014)
Acknowledgement
We thank the reviewers for their valuable suggestions. This work is supported by Natural Science Foundation of China (Grant Nos. 61532011, 61672311) and National Key Basic Research Program (2015CB358700).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
Let \(\gamma \) denote arbitrary vectors with length K , and \(\mathcal {F}\) is the set of all randomization functions \(f:\mathbb {R}^K\rightarrow \mathbb {R}^K\) satisfying:
then there exists no randomization function \(f\in \mathcal {F}\) with the product-level monotonic property of:
Proof: Suppose there exists a randomization function \(f\in \mathcal {F}\) that meets Eq. (21). Let \(t>1\), and let \(\alpha \) and \(\beta \) be vectors with \(\alpha \cdot \beta >0\), then we have \(t\alpha \cdot \beta >\alpha \cdot \beta \). By applying the property of product-level monotonic in Eq. (21) we have:
and this can be equivalently written as:
Let \(\varDelta \doteq f(t\alpha ) - f(\alpha )\), and according to the definition of randomization function in Eq. (20), we know that \(\sum _{k}f(t\alpha )_{k}= \sum _{k}f(\alpha )_{k} = 1\), thus we have:
According to Eq. (23) we know that \(\varDelta \ne \mathbf 0 \). Let \(\mathcal {P}\) denote the indices of all positive elements in vector \(\varDelta \), and \(\mathcal {N}\) denote the indices of negative elements. We have:
As Eq. (23) holds for any \(\beta \) with \(\alpha \cdot \beta >0\), without loss of generally, let \(\beta \) be a vector where \(\beta _{k\in \mathcal {P}}=0\) and \(\beta _{k\in \mathcal {N}}=1\). According to the vector-level monotonic property in Eq. (20) and the fact that \(0<1\), we have \(f(\beta )_{k\in \mathcal {P}}<f(\beta )_{k\in \mathcal {N}}\) and \(0\le f(\beta )_{k\in \mathcal {P}\cup \mathcal {N}}\le 1\). Combined with Eq. (25), we further obtain the following:
which is a direct contradiction with Eq. (23). As a result, there exists no randomization function \(f\in \mathcal {F}\) that satisfies the product-level monotonic property in Eq. (21).
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Xiao, L., Min, Z., Yongfeng, Z. (2017). Joint Factorizational Topic Models for Cross-City Recommendation. In: Chen, L., Jensen, C., Shahabi, C., Yang, X., Lian, X. (eds) Web and Big Data. APWeb-WAIM 2017. Lecture Notes in Computer Science(), vol 10366. Springer, Cham. https://doi.org/10.1007/978-3-319-63579-8_45
Download citation
DOI: https://doi.org/10.1007/978-3-319-63579-8_45
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63578-1
Online ISBN: 978-3-319-63579-8
eBook Packages: Computer ScienceComputer Science (R0)