Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2792838.2799670acmconferencesArticle/Chapter ViewAbstractPublication PagesrecsysConference Proceedingsconference-collections
short-paper

Data Quality Matters in Recommender Systems

Published: 16 September 2015 Publication History

Abstract

Although data quality has been recognized as an important factor in the broad information systems research, it has received little attention in recommender systems. Data quality matters are typically addressed in recommenders by ad-hoc cleansing methods, which prune noisy or unreliable records from the data. However, the setting of the cleansing parameters is often done arbitrarily, without thorough consideration of the data characteristics. In this work, we turn to two central data quality problems in recommender systems: sparsity and redundancy. We devise models for setting data-dependent thresholds and sampling levels, and evaluate these using a collection of public and proprietary datasets. We observe that the models accurately predict data cleansing parameters, while having minor effect on the accuracy of the generated recommendations.

References

[1]
X. Amatriain, A. Jaimes, N. Oliver, and J. M. Pujol. Data mining methods for recommender systems. In Recommender Syst. Handbook. 2011.
[2]
X. Amatriain, J. M. Pujol, N. Tintarev, and N. Oliver. Rate it again: increasing recommendation accuracy by user re-rating. In Recommender Syst. Conf., pages 173--180, 2009.
[3]
S. Berkovsky, T. Kuflik, and F. Ricci. The impact of data obfuscation on the accuracy of collaborative filtering. Expert Syst. Appl., 39(5):5033--5042, 2012.
[4]
B. M. Marlin and R. S. Zemel. Collaborative prediction and ranking with non-random missing data. In Recommender Syst. Conf., pages 5--12, 2009.
[5]
U. Paquet and N. Koenigstein. One-class collaborative filtering with random graphs. In Int. World Wide Web Conf., pages 999--1008, 2013.
[6]
L. L. Pipino, Y. W. Lee, and R. Y. Wang. Data quality assessment. Commun. of the ACM, 45(4):211--218, 2002.
[7]
R. Ronen, N. Koenigstein, E. Ziklik, M. Sitruk, R. Yaari, and N. Haiby-Weiss. Sage: recommender engine as a cloud service. In Recommender Syst. Conf., pages 475--476, 2013.
[8]
A. Said, B. J. Jain, S. Narr, and T. Plumbaum. Users and noise: The magic barrier of recommender systems. In Int. Conf. on User Modeling, Adaptation, and Personalization, pages 237--248, 2012.
[9]
G. Shani and A. Gunawardana. Evaluating recommendation systems. In Recommender Syst. Handbook. 2011.
[10]
R. Y. Wang and D. M. Strong. Beyond accuracy: What data quality means to data consumers. Journal of Management Information Systems, pages 5--33, 1996.

Cited By

View all
  • (2024)Consumer Social Connectedness and Persuasiveness of Collaborative-Filtering Recommender Systems: Evidence From an Online-to-Offline Recommendation AppProduction and Operations Management10.1177/10591478241259422Online publication date: 25-Jul-2024
  • (2023)Improving Recommender Systems Through the Automation of Design DecisionsProceedings of the 17th ACM Conference on Recommender Systems10.1145/3604915.3608877(1332-1338)Online publication date: 14-Sep-2023
  • (2023)Semi-supervised Adversarial Learning for Complementary Item RecommendationProceedings of the ACM Web Conference 202310.1145/3543507.3583462(1804-1812)Online publication date: 30-Apr-2023
  • Show More Cited By

Index Terms

  1. Data Quality Matters in Recommender Systems

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      RecSys '15: Proceedings of the 9th ACM Conference on Recommender Systems
      September 2015
      414 pages
      ISBN:9781450336925
      DOI:10.1145/2792838
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 16 September 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. data quality
      2. recommender systems
      3. redundancy
      4. sparsity

      Qualifiers

      • Short-paper

      Funding Sources

      • ISF

      Conference

      RecSys '15
      Sponsor:
      RecSys '15: Ninth ACM Conference on Recommender Systems
      September 16 - 20, 2015
      Vienna, Austria

      Acceptance Rates

      RecSys '15 Paper Acceptance Rate 28 of 131 submissions, 21%;
      Overall Acceptance Rate 254 of 1,295 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)28
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 26 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Consumer Social Connectedness and Persuasiveness of Collaborative-Filtering Recommender Systems: Evidence From an Online-to-Offline Recommendation AppProduction and Operations Management10.1177/10591478241259422Online publication date: 25-Jul-2024
      • (2023)Improving Recommender Systems Through the Automation of Design DecisionsProceedings of the 17th ACM Conference on Recommender Systems10.1145/3604915.3608877(1332-1338)Online publication date: 14-Sep-2023
      • (2023)Semi-supervised Adversarial Learning for Complementary Item RecommendationProceedings of the ACM Web Conference 202310.1145/3543507.3583462(1804-1812)Online publication date: 30-Apr-2023
      • (2023)Empowering Recommender Systems in ITSM: A Pipeline Reference Model for AI-Based Textual Data Quality EnrichmentDesign Science Research for a New Society: Society 5.010.1007/978-3-031-32808-4_18(279-293)Online publication date: 19-May-2023
      • (2022)BRUCE: Bundle Recommendation Using Contextualized item EmbeddingsProceedings of the 16th ACM Conference on Recommender Systems10.1145/3523227.3546754(237-245)Online publication date: 12-Sep-2022
      • (2022)Collaborative Image UnderstandingProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557260(77-87)Online publication date: 17-Oct-2022
      • (2022)Investigating the Value of Subtitles for Improved Movie RecommendationsProceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization10.1145/3503252.3531291(99-109)Online publication date: 4-Jul-2022
      • (2022)From Data Analysis to Intent-Based Recommendation: An Industrial Case Study in the Video DomainIEEE Access10.1109/ACCESS.2022.314843410(14779-14796)Online publication date: 2022
      • (2021)A Black-Box Attack Model for Visually-Aware Recommender SystemsProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441757(94-102)Online publication date: 8-Mar-2021
      • (2020)Something’s Missing? A Procedure for Extending Item Content Data Sets in the Context of Recommender SystemsInformation Systems Frontiers10.1007/s10796-020-10071-y24:1(267-286)Online publication date: 22-Oct-2020
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media