Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1835804.1835903acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Latent aspect rating analysis on review text data: a rating regression approach

Published: 25 July 2010 Publication History

Abstract

In this paper, we define and study a new opinionated text data analysis problem called Latent Aspect Rating Analysis (LARA), which aims at analyzing opinions expressed about an entity in an online review at the level of topical aspects to discover each individual reviewer's latent opinion on each aspect as well as the relative emphasis on different aspects when forming the overall judgment of the entity. We propose a novel probabilistic rating regression model to solve this new text mining problem in a general way. Empirical experiments on a hotel review data set show that the proposed latent rating regression model can effectively solve the problem of LARA, and that the detailed analysis of opinions at the level of topical aspects enabled by the proposed model can support a wide range of application tasks, such as aspect opinion summarization, entity ranking based on aspect ratings, and analysis of reviewers rating behavior.

Supplementary Material

JPG File (kdd2010_wang_lar_01.jpg)
MOV File (kdd2010_wang_lar_01.mov)

References

[1]
Onix text retrieval toolkit stopword list. http://www.lextek.com/manuals/onix/stopwords1.html.
[2]
D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. The Journal of Machine Learning Research, 3:993--1022, 2003.
[3]
C. Burges. A tutorial on support vector machines for pattern recognition. Data mining and knowledge discovery, 2(2):121--167, 1998.
[4]
C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/?cjlin/libsvm.
[5]
H. Cui, V. Mittal, and M. Datar. Comparative experiments on sentiment classification for online product reviews. In Twenty-First National Conference on Artificial Intelligence, volume 21, page 1265, 2006.
[6]
K. Dave, S. Lawrence, and D. M. Pennock. Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In WWW '03, pages 519--528, 2003.
[7]
A. Devitt and K. Ahmad. Sentiment polarity identification in financial news: A cohesion-based approach. In Proceedings of ACL'07, pages 984--991, 2007.
[8]
A. Esuli and F. Sebastiani. SentiWordNet: A publicly available lexical resource for opinion mining. In Proceedings of LREC, volume 6, 2006.
[9]
A. Goldberg and X. Zhu. Seeing stars when there arena2rt many stars: Graph-based semi-supervised learning for sentiment categorization. In HLT-NAACL 2006 Workshop on Textgraphs: Graph-based Algorithms for Natural Language Processing, 2006.
[10]
M. Hu and B. Liu. Mining and summarizing customer reviews. In W. Kim, R. Kohavi, J. Gehrke, and W. DuMouchel, editors, KDD, pages 168--177. ACM, 2004.
[11]
K. Jarvelin and J. Kekalainen. IR evaluation methods for retrieving highly relevant documents. In Proceedings of SIGIR'00, pages 41--48. ACM, 2000.
[12]
N. Jindal and B. Liu. Identifying comparative sentences in text documents. In Proceedings of SIGIR'06, pages 244--251, New York, NY, USA, 2006. ACM.
[13]
H. Kim and C. Zhai. Generating Comparative Summaries of Contradictory Opinions in Text. In Proceedings of CIKM'09, pages 385--394, 2009.
[14]
S. Kim and E. Hovy. Determining the sentiment of opinions. In Proceedings of COLING, volume 4, pages 1367--1373, 2004.
[15]
K. Lerman, S. Blair-Goldensohn, and R. T. McDonald. Sentiment summarization: Evaluating and learning user preferences. In EACL, pages 514--522, 2009.
[16]
B. Liu, M. Hu, and J. Cheng. Opinion observer: Analyzing and comparing opinions on the web. In WWW '05, pages 342--351, 2005.
[17]
Y. Lu, C. Zhai, and N. Sundaresan. Rated aspect summarization of short comments. In Proceedings of WWW'09, pages 131--140. ACM New York, NY, USA, 2009.
[18]
S. Morinaga, K. Yamanishi, K. Tateishi, and T. Fukushima. Mining product reputations on the web. In KDD '02, pages 341--349, 2002.
[19]
B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the ACL, pages 115--124, 2005.
[20]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? Sentiment classification using machine learning techniques. In EMNLP 2002, pages 79--86, 2002.
[21]
A.-M. Popescu and O. Etzioni. Extracting product features and opinions from reviews. In Proceedings of HLT '05, pages 339--346, Morristown, NJ, USA, 2005. Association for Computational Linguistics.
[22]
M. Porter. An algorithm for suffix stripping. Program, 14(3):130 -- 137, 1980.
[23]
B. Snyder and R. Barzilay. Multiple aspect ranking using the good grief algorithm. In Proceedings of NAACL HLT, pages 300--307, 2007.
[24]
I. Titov and R. McDonald. A joint model of text and aspect ratings for sentiment summarization. In ACL '08, pages 308--316.
[25]
Y. Yang and J. O.Pedersen. A comparative study on feature selection in text categorization. In Proceedings of ICML'97, pages 412 -- 420, 1997.
[26]
L. Zhuang, F. Jing, and X. Zhu. Movie review mining and summarization. In Proceedings of CIKM 2006, page 50. ACM, 2006.

Cited By

View all
  • (2024)StreetLines: A Smart and Scalable Tourism Platform Based on Efficient Knowledge-MiningDigital10.3390/digital40300344:3(676-697)Online publication date: 11-Aug-2024
  • (2024)From outputs to insights: a survey of rationalization approaches for explainable text classificationFrontiers in Artificial Intelligence10.3389/frai.2024.13635317Online publication date: 23-Jul-2024
  • (2024)Exploring the determinants of the user experience in P2P payment systems in Spain: a text mining approachFinancial Innovation10.1186/s40854-023-00496-010:1Online publication date: 2-Jan-2024
  • Show More Cited By

Index Terms

  1. Latent aspect rating analysis on review text data: a rating regression approach

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
    July 2010
    1240 pages
    ISBN:9781450300551
    DOI:10.1145/1835804
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 July 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. algorithms
    2. experimentation

    Qualifiers

    • Research-article

    Conference

    KDD '10
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)150
    • Downloads (Last 6 weeks)23
    Reflects downloads up to 24 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)StreetLines: A Smart and Scalable Tourism Platform Based on Efficient Knowledge-MiningDigital10.3390/digital40300344:3(676-697)Online publication date: 11-Aug-2024
    • (2024)From outputs to insights: a survey of rationalization approaches for explainable text classificationFrontiers in Artificial Intelligence10.3389/frai.2024.13635317Online publication date: 23-Jul-2024
    • (2024)Exploring the determinants of the user experience in P2P payment systems in Spain: a text mining approachFinancial Innovation10.1186/s40854-023-00496-010:1Online publication date: 2-Jan-2024
    • (2024)A Comparative Analysis of Text-Based Explainable Recommender SystemsProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688069(105-115)Online publication date: 8-Oct-2024
    • (2024)Enhancing the Rationale-Input Alignment for Self-explaining Rationalization2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00176(2218-2230)Online publication date: 13-May-2024
    • (2024)Taking the Chat out of Chatbot? Collecting User Reviews with Chatbots and Web FormsJournal of Management Information Systems10.1080/07421222.2023.230117541:1(146-177)Online publication date: 19-Feb-2024
    • (2024)Personalized Neural Network-Based Aggregation Function in Multi-Criteria Collaborative FilteringJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2024.101922(101922)Online publication date: Jan-2024
    • (2024)ChatGPT paraphrased product reviews can confuse consumers and undermine their trust in genuine reviews. Can you tell the difference?Information Processing and Management: an International Journal10.1016/j.ipm.2024.10384261:6Online publication date: 1-Nov-2024
    • (2024)Aspect-level Item Recommendation Based on User Reviews with Variational AutoencodersInformation Sciences10.1016/j.ins.2024.120655(120655)Online publication date: Apr-2024
    • (2024)Bipartite mixed membership distribution-free model. A novel model for community detection in overlapping bipartite weighted networksExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.121088235:COnline publication date: 10-Jan-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media