Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2020408.2020440acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Selecting a comprehensive set of reviews

Published: 21 August 2011 Publication History

Abstract

Online user reviews play a central role in the decision-making process of users for a variety of tasks, ranging from entertainment and shopping to medical services. As user-generated reviews proliferate, it becomes critical to have a mechanism for helping the users (information consumers) deal with the information overload, and presenting them with a small comprehensive set of reviews that satisfies their information need. This is particularly important for mobile phone users, who need to make decisions quickly, and have a device with limited screen real-estate for displaying the reviews. Previous approaches have addressed the problem by ranking reviews according to their (estimated) helpfulness. However, such approaches do not account for the fact that the top few high-quality reviews may be highly redundant, repeating the same information, or presenting the same positive (or negative) perspective. In this work, we focus on the problem of selecting a comprehensive set of few high-quality reviews that cover many different aspects of the reviewed item. We formulate the problem as a maximum coverage problem, and we present a generic formalism that can model the different variants of review-set selection. We describe algorithms for the different variants we consider, and, whenever possible, we provide approximation guarantees with respect to the optimal solution. We also perform an experimental evaluation on real data in order to understand the value of coverage for users.

References

[1]
R. Agrawal, S. Gollapudi, A. Halverson, and S. Ieong. Diversifying search results. In WSDM, pages 5--14, 2009.
[2]
Y. Azar, I. Gamzu, and X. Yin. Multiple intents re-ranking. In Proceedings of the 41st annual ACM symposium on Theory of computing, STOC '09, 2009.
[3]
A. Bhaskara, M. Charikar, E. Chlamtac, U. Feige, and A. Vijayaraghavan. Detecting high log-densities: an (1/4) approximation for densest-subgraph. In STOC, pages 201--210, 2010.
[4]
G. Calinescu, C. Chekuri, M. Pál, and J. Vondrák. Maximizing a submodular set function subject to a matroid constraint (extended abstract). In IPCO, pages 182--196, 2007.
[5]
J. G. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In SIGIR, pages 335--336, 1998.
[6]
G. Carenini, R. T. Ng, and E. Zwart. Extracting knowledge from evaluative text. In Proceedings of the 3rd international conference on Knowledge capture, K-CAP '05, pages 11--18, New York, NY, USA, 2005. ACM.
[7]
C. Chekuri and A. Kumar. Maximum coverage problem with group budget constraints and applications. In APPROX-RANDOM, pages 72--83, 2004.
[8]
H. Chen and D. R. Karger. Less is more: probabilistic models for retrieving fewer relevant documents. In SIGIR, pages 429--436, 2006.
[9]
C. Danescu-Niculescu-Mizil, G. Kossinets, J. Kleinberg, and L. Lee. How opinions are received by online communities: a case study on amazon.com helpfulness votes. In WWW '09, pages 141--150, New York, NY, USA, 2009. ACM.
[10]
A. Ghose and P. G. Ipeirotis. Designing novel review ranking systems: predicting the usefulness and impact of reviews. In ICEC '07, pages 303--310, New York, NY, USA, 2007. ACM.
[11]
S. Gollapudi and A. Sharma. An axiomatic approach for result diversification. In WWW, pages 381--390, 2009.
[12]
D. Hochbaum, editor. Approximation algorithms for NP-hard problems. PWS Publishing Company, 1997.
[13]
M. Hu and B. Liu. Mining and summarizing customer reviews. In KDD, pages 168--177, 2004.
[14]
M. Hu and B. Liu. Mining opinion features in customer reviews. In AAAI, pages 755--760, 2004.
[15]
S.-M. Kim, P. Pantel, T. Chklovski, and M. Pennacchiotti. Automatically assessing review helpfulness. In EMNLP, pages 423--430, Sydney, Australia, July 2006.
[16]
T. Lappas and D. Gunopulos. Efficient confident search in large review corpora. In ECML/PKDD (2), pages 195--210, 2010.
[17]
J. Liu, Y. Cao, C.-Y. Lin, Y. Huang, and M. Zhou. Low-quality product review detection in opinion summarization. In EMNLP-CoNLL, pages 334--342, 2007. Poster paper.
[18]
K. Liu, E. Terzi, and T. Grandison. Highlighting diverse concepts in documents. In SDM, pages 545--556, 2009.
[19]
Y. Liu, X. Huang, A. An, and X. Yu. Modeling and predicting the helpfulness of online reviews. In ICDM, pages 443--452, 2008.
[20]
Y. Lu, P. Tsaparas, A. Ntoulas, and L. Polanyi. Exploiting social context for review quality prediction. In WWW, 2010.
[21]
Y. Lu and C. Zhai. Opinion integration through semi-supervised topic modeling. In WWW, pages 121--130, 2008.
[22]
Y. Lu, C. Zhai, and N. Sundaresan. Rated aspect summarization of short comments. In WWW, pages 131--140, 2009.
[23]
A.-M. Popescu, B. Nguyen, and O. Etzioni. Opine: Extracting product features and opinions from reviews. In HLT/EMNLP, 2005.
[24]
D. Radev, T. Allison, S. Blair-Goldensohn, J. Blitzer, A. Çelebi, S. Dimitrov, E. Drabek, A. Hakim, W. Lam, D. Liu, J. Otterbacher, H. Qi, H. Saggion, S. Teufel, M. Topper, A. Winkel, and Z. Zhang. MEAD - a platform for multidocument multilingual text summarization. In LREC 2004, Lisbon, Portugal, May 2004.
[25]
F. Radlinski, P. N. Bennett, B. Carterette, and T. Joachims. Redundancy, diversity and interdependent document relevance. SIGIR Forum, 43(2):46--52, 2009.
[26]
F. Radlinski, R. Kleinberg, and T. Joachims. Learning diverse rankings with multi-armed bandits. In ICML, pages 784--791, 2008.
[27]
D. Rafiei, K. Bharat, and A. Shukla. Diversifying web search results. In WWW, 2010.
[28]
A. Slivkins, F. Radlinski, and S. Gollapudi. Learning optimally diverse rankings over large document collections. In ICML, 2010.
[29]
O. Tsur and A. Rappoport. Revrank: a fully unsupervised algorithm for selecting the most helpful book reviews. In ICWSM, 2009.
[30]
E. Vee, U. Srivastava, J. Shanmugasundaram, P. Bhat, and S. Amer-Yahia. Efficient computation of diverse query results. In ICDE, pages 228--236, 2008.
[31]
H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis on review text data: a rating regression approach. In KDD, pages 783--792, 2010.
[32]
Z. Zhang and B. Varadarajan. Utility scoring of product reviews. In CIKM '06, pages 51--57, New York, NY, USA, 2006. ACM.

Cited By

View all
  • (2023)A Review Selection Method Based on Consumer Decision Phases in E-commerceACM Transactions on Information Systems10.1145/358726542:1(1-27)Online publication date: 21-Aug-2023
  • (2023)Hardness and approximation of submodular minimum linear ordering problemsMathematical Programming10.1007/s10107-023-02038-z208:1-2(277-318)Online publication date: 14-Dec-2023
  • (2022)Provable randomized rounding for minimum-similarity diversificationData Mining and Knowledge Discovery10.1007/s10618-021-00811-236:2(709-738)Online publication date: 4-Jan-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
August 2011
1446 pages
ISBN:9781450308137
DOI:10.1145/2020408
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 August 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. greedy algorithms
  2. review selection
  3. set cover

Qualifiers

  • Research-article

Conference

KDD '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)1
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)A Review Selection Method Based on Consumer Decision Phases in E-commerceACM Transactions on Information Systems10.1145/358726542:1(1-27)Online publication date: 21-Aug-2023
  • (2023)Hardness and approximation of submodular minimum linear ordering problemsMathematical Programming10.1007/s10107-023-02038-z208:1-2(277-318)Online publication date: 14-Dec-2023
  • (2022)Provable randomized rounding for minimum-similarity diversificationData Mining and Knowledge Discovery10.1007/s10618-021-00811-236:2(709-738)Online publication date: 4-Jan-2022
  • (2021)Improved approximations for min sum vertex cover and generalized min sum set coverProceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3458064.3458126(986-1005)Online publication date: 10-Jan-2021
  • (2021)Multi-Attribute Online Decision-Making Driven by Opinion MiningMathematics10.3390/math90808339:8(833)Online publication date: 11-Apr-2021
  • (2021)A Review Selection Method for Finding an Informative Subset from Online ReviewsINFORMS Journal on Computing10.1287/ijoc.2019.095033:1(280-299)Online publication date: 1-Jan-2021
  • (2021)Review Summary Generation in Online Systems: Frameworks for Supervised and Unsupervised ScenariosACM Transactions on the Web10.1145/344801515:3(1-33)Online publication date: 13-May-2021
  • (2021)Examining the impact of review tag function on product evaluation and information perception of popular productsInformation Systems and e-Business Management10.1007/s10257-021-00532-5Online publication date: 27-May-2021
  • (2020)Ranking Hotel Reviews Based on User's Aspects Importance and OpinionsProceedings of the 4th International Conference on Natural Language Processing and Information Retrieval10.1145/3443279.3443280(157-162)Online publication date: 18-Dec-2020
  • (2020)Designing Positive Behavior Change Experiences: a Systematic Review and Sentiment Analysis based on Online User Reviews of Fitness and Nutrition Mobile ApplicationsProceedings of the 19th International Conference on Mobile and Ubiquitous Multimedia10.1145/3428361.3428403(152-161)Online publication date: 22-Nov-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media