Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A hybrid recommendation technique using topic embedding for rating prediction and to handle cold-start problem

Published: 15 December 2022 Publication History

Abstract

Recommender systems aim to estimate item ratings and recommend items based on the users’ interests. The traditional recommender systems generally consider user–item rating information for rating prediction, but they suffer from various limitations, such as data sparsity, black-box recommendation, and cold-start problems. As a result, researchers have proposed amalgamating contextual information with rating data to provide effective recommendations. Although user-generated data in the form of reviews are a rich source of contextual information, they are rarely utilized in recommender algorithms. This study presents a hybrid recommendation technique, called RecTE, using rating data and topic embedding, which is an amalgamation of word embedding and topic modeling techniques. The novelty of RecTE lies in predicting item ratings using topic embeddings learned by incorporating local and global contextual information and integrating them with user-based collaborative filtering. RecTE is empirically evaluated over three real-world datasets – YelpNYC, YelpZip and TripAdvisor. This technique performs significantly better in comparison to nine baselines and five state-of-the-art recommendation techniques. On empirical analysis, we found that incorporating topic embedding in RecTE makes it capable of performing significantly better and handle cold-start problems effectively in comparison to the existing recommendation approaches.

Highlights

A novel recommendation approach to handle the cold-start problem.
Incorporating rating data and topic embedding into UBCF for rating prediction.
Assessing the impact of topic embedding on effective recommendation.
Comparative evaluation of the proposed approach with 9 baselines and 5 SOTA.
Assessing efficacy of the proposed approach to handle the cold-start problem.

References

[1]
Bathla G., Singh P., Kumar S., Verma M., Garg D., Kotecha K., Recop: fine-grained opinions and sentiments-based recommender system for industry 5.0, Soft Computing (2021) 1–10.
[2]
Bauman K., Tuzhilin A., Discovering contextual information from user reviews for recommendation purposes, in: Proceddings of the workshop on new trends in content based recommender systems, ACM, Silicon Valley, USA, 2014, pp. 2–9.
[3]
Belford M., MacNamee B., Greene D., Ensemble topic modeling via matrix factorization, in: Proceedings of the 24th Irish conference on artificial intelligence and cognitive science. Vol. 1751, CEUR Workshop Proceedings, Dublin, Ireland, 2016.
[4]
Bengio Y., Ducharme R., Vincent P., Jauvin C., A neural probabilistic language model, Journal of Machine Learning Research 3 (2003) 1137–1155.
[5]
Blei D.M., Ng A.Y., Jordan M.I., Latent Dirichlet allocation, Journal of Machine Learning Research 3 (2003) 993–1022.
[6]
Chen C., Zheng X., Wang Y., Hong F., Lin Z., Context-aware collaborative topic regression with social matrix factorization for recommender systems, in: Proceddings of the 28th AAAI conference on artificial intelligence, AAAI, Quebec, Canada, 2014, pp. 1–7.
[7]
Cheng Z., Shen J., Kankanhalli M., Nie L., Exploiting music play sequence for music recommendation, in: Proceedings of the 26th international joint conference on artificial intelligence, ACM, Melbourne, Australia, 2017, pp. 3654–3660.
[8]
Collobert R., Weston J., Bottou L., Karlen M., Kavukcuoglu K., Kuksa P., Natural language processing (almost) from scratch, Journal of Machine Learning Research 12 (2011) 2493–2537.
[9]
Deerwester S., Dumais S.T., Furnas G.W., Landauer T.K., Harshman R., Indexing by latent semantic analysis, Journal of the American Society for Information Science 41 (6) (1990) 391–407.
[10]
George T., Merugu S., A scalable collaborative filtering framework based on co-clustering, in: Proceedings of the 5th international conference on data mining, IEEE, Houston, USA, 2005, pp. 625–628.
[11]
Herlocker J.L., Konstan J.A., Terveen L.G., Riedl J.T., Evaluating collaborative filtering recommender systems, ACM Transactions on Information Systems (TOIS) 22 (1) (2004) 5–53.
[12]
Hu B., Ester M., Spatial topic modeling in online social media for location recommendation, in: Proceddings of the 7th international conference on recommender systems, ACM, Hong Kong, China, 2013, pp. 25–32.
[13]
Hu Y., Koren Y., Volinsky C., Collaborative filtering for implicit feedback datasets, in: Proceedings of 8th international conference on data mining, IEEE, Pisa, Italy, 2008, pp. 263–272.
[14]
Huang E.H., Socher R., Manning C.D., Ng A., Improving word representations via global context and multiple word prototypes, in: Proceedings of the 50th annual meeting of the association for computational linguistics, ACM, Jeju Island, Korea, 2008, pp. 873–882.
[15]
Hug N., Surprise, a python library for recommender systems, 2017, http://surpriselib.com.
[16]
Jamali M., Ester M., TrustWalker: A random walk model for combining trust-based and item-based recommendation, in: Proceedings of the 15th international conference on knowledge discovery and data mining, ACM, Paris, France, 2009, pp. 397–406.
[17]
Jing L., Wang P., Yang L., Sparse probabilistic matrix factorization by llaplace distribution for collaborative filtering, in: Proceedings of the 24th international conference on artificial intelligence, ACM, Buenos Aires, Argentina, 2015, pp. 1771–1777.
[18]
Koren Y., Factor in the neighbors: Scalable and accurate collaborative filtering, ACM Transactions on Knowledge Discovery from Data (TKDD) 4 (1) (2010) 1.
[19]
Levy O., Goldberg Y., Neural word embedding as implicit matrix factorization, in: Proceedings of the 27th international conference on neural information processing systems, ACM, Montreal, Canada, 2014, pp. 2177–2185.
[20]
Liu D., Li J., Du B., Chang J., Gao R., DAML: Dual attention mutual learning between ratings and reviews for item recommendation, in: Proceddings of the 25th ACM SIGKDD international conference on knowledge discovery and data mining, ACM, Anchorage, USA, 2019, pp. 344–352.
[21]
Liu L., Tang L., Dong W., Yao S., Zhou W., An overview of topic modeling and its current applications in bioinformatics, SpringerPlus 5 (2016) 1937–1960.
[22]
Liu Z., Yuan B., Ma Y., A multi-task dual attention deep recommendation model using ratings and review helpfulness, Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies (2021) 1–13.
[23]
Manotumruksa J., Macdonald C., Ounis I., Regularising factorised models for venue recommendation using friends and their comments, in: Proceedings of the 25th ACM international conference on information and knowledge management, ACM, Indianapolis, USA, 2016, pp. 1981–1984.
[24]
Massa P., Avesani P., Trust metrics on controversial users: Balancing between tyranny of the majority and echo chambers, International Journal on Semantic Web and Information Systems 3 (2007) 39–64.
[25]
Mikolov T., Sutskever I., Chen K., Corrado G.S., Dean J., Distributed representations of words and phrases and their compositionality, in: Proceddings of the 26th international conference on neural information processing systems, ACM, Lake Tahoe, Nevada, 2013, pp. 3111–3119.
[26]
Musto C., Franza T., Semeraro G., de Gemmis M., Lops P., Deep content-based recommender systems exploiting recurrent neural networks and linked open data, in: Proceedings of the 26th conference on user modeling, adaptation and personalization, ACM, 2018, pp. 239–244.
[27]
Musto C., Semeraro G., de Gemmis M., Lops P., Learning word embeddings from wikipedia for content-based recommender systems, in: European conference on information retrieval, Springer, 2016, pp. 729–734.
[28]
Nisha C.C., Mohan A., A social recommender system using deep architecture and network embedding, Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies 49 (2018) 1937–1953.
[29]
Ozsoy M.G., From word embeddings to item recommendation, 2016, arXiv preprint arXiv:1601.01356.
[30]
Pena F.J., Morgan D.O., Tragos E.Z., Hurley N., Duriakova E., Smyth B., Lawlor A., Combining rating and review data by initializing latent factor models with topic models for top-N recommendation, in: Proceedings of the 14th ACM conference on recommender system, ACM, Brazil, 2020, pp. 438–443.
[31]
Rayana S., Akoglu L., Collective opinion spam detection: Bridging review networks and metadata, in: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, ACM, Sydney, Australia, 2015, pp. 985–994.
[32]
Schafer J.B., Frankowski D., Herlocker J.L., Sen S., Collaborative filtering recommender systems, The Adaptive Web 4321 (2007) 291–324.
[33]
Sedhain S., Menon A.K., Sanner S.P., Xie L., AutoRec: Autoencoders meet collaborative filtering, in: Proceddings of the 24th international conference on world wide web, ACM, Florenance, Italy, 2015, pp. 111–112.
[34]
Sejwal V.K., Abulaish M., CAMO: A context-aware movie ontology generated from LOD and movie databases, Multimedia Tools and Applications 80 (2021) 7247–7269.
[35]
Sejwal V.K., Abulaish M., Jahiruddin S.P., CRecSys: A context-based recommender system using collaborative filtering and LOD, IEEE Access 8 (2020) 158432–158448.
[36]
Sheu H.-S., Li S., Context-aware graph embedding for session-based news recommendation, in: Proceedings of the 14th ACM conference on recommender system, ACM, Brazil, 2020, pp. 657–662.
[37]
Tan Y., Zhang M., Liu Y., Ma S., Rating-boosted latent topics: Understanding users and items with ratings and reviews, in: Proceedings of the 25th international joint conference on artificial intelligence, ACM, New York, USA, 2016, pp. 2640–2646.
[38]
Valizadegan, H., Jin, R., Zhang, R., & Mao, J. (2000). Learning to Rank by Optimizing ndcg Measure. In SIGIR (pp. 41–48).
[39]
Wang C., Blei D.M., Collaborative topic modeling for recommending scientific articles, in: Proceddings of the 17th international conference on knowledge discovery and data mining, ACM, California, USA, 2011, pp. 448–456.
[40]
Wang H., Lu Y., Zhai C., Latent aspect rating analysis without aspect keyword supervision, in: Proceedings of the 17th ACM SIGKDD conference on knowledge discovery and data mining, ACM, 2011, pp. 618–626.
[41]
Xun G., Li Y., Gao J., Zhang A., Collaboratively improving topic discovery and word embeddings by coordinating global and local contexts, in: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, ACM, Halifax, NS, Canada, 2017, pp. 535–543.
[42]
Yu J., Gao M., Li J., Yin H., Liu H., Adaptive implicit friends identification over heterogeneous network for social recommendation, in: Proceedings of the 27th ACM international conference on information and knowledge management, ACM, 2018, pp. 357–366.
[43]
Zhang L., Liu P., Gulla J.A., Dynamic attention-integrated neural network for session-based news recommendation, Machine Learning 108 (2019) 1851–1875.
[44]
Zhang C., Yu L., Wang Y., Shah C., Zhang X., Collaborative user network embedding for social recommender systems, in: Proceedings of the 2017 SIAM international conference on data mining, SIAM, Houston, USA, 2017, pp. 381–389.
[45]
Zhou Y., Wilkinson D., Schreiber R., Pan R., Large-scale parallel collaborative filtering for the netflix prize, in: Proceedings of the 4th international conference on algorithmic aspects in information and management, Springer, 2008, pp. 337–348.

Cited By

View all
  • (2024)Explainable recommendation based on fusion representation of multi-type feature embeddingThe Journal of Supercomputing10.1007/s11227-023-05831-x80:8(10370-10393)Online publication date: 1-May-2024
  • (2023)Recommendation-based trust computation and rating prediction model for security enhancement in cloud computing systemsService Oriented Computing and Applications10.1007/s11761-023-00377-517:4(239-257)Online publication date: 1-Dec-2023

Index Terms

  1. A hybrid recommendation technique using topic embedding for rating prediction and to handle cold-start problem
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image Expert Systems with Applications: An International Journal
          Expert Systems with Applications: An International Journal  Volume 209, Issue C
          Dec 2022
          1590 pages

          Publisher

          Pergamon Press, Inc.

          United States

          Publication History

          Published: 15 December 2022

          Author Tags

          1. Collaborative filtering
          2. Topic modeling
          3. Word embedding
          4. Topic embedding
          5. Cold-start

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 13 Nov 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)Explainable recommendation based on fusion representation of multi-type feature embeddingThe Journal of Supercomputing10.1007/s11227-023-05831-x80:8(10370-10393)Online publication date: 1-May-2024
          • (2023)Recommendation-based trust computation and rating prediction model for security enhancement in cloud computing systemsService Oriented Computing and Applications10.1007/s11761-023-00377-517:4(239-257)Online publication date: 1-Dec-2023

          View Options

          View options

          Get Access

          Login options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media