Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2766462.2767758acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Parametric and Non-parametric User-aware Sentiment Topic Models

Published: 09 August 2015 Publication History

Abstract

The popularity of Web 2.0 has resulted in a large number of publicly available online consumer reviews created by a demographically diverse user base. Information about the authors of these reviews, such as age, gender and location, provided by many on-line consumer review platforms may allow companies to better understand the preferences of different market segments and improve their product design, manufacturing processes and marketing campaigns accordingly. However, previous work in sentiment analysis has largely ignored these additional user meta-data. To address this deficiency, in this paper, we propose parametric and non-parametric User-aware Sentiment Topic Models (USTM) that incorporate demographic information of review authors into topic modeling process in order to discover associations between market segments, topical aspects and sentiments. Qualitative examination of the topics discovered using USTM framework in the two datasets collected from popular online consumer review platforms as well as quantitative evaluation of the methods utilizing those topics for the tasks of review sentiment classification and user attribute prediction both indicate the utility of accounting for demographic information of review authors in opinion mining.

References

[1]
D. M. Blei, T. L. Griffiths, and M. I. Jordan. The Nested Chinese Restaurant Process and Bayesian Nonparametric Inference of Topic Hierarchies. JACM, 57(7), 2010.
[2]
D. M. Blei and J. D. McAuliffe. Supervised Topic Models. In Proceedings of the 20th NIPS, pages 121--128, 2007.
[3]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[4]
W. Gao, N. Yoshinaga, N. Kaji, and M. Kitsuregawa. Modeling User Leniency and Product Popularity for Sentiment Classification. In Proceedings of the 6th IJCNLP, pages 1107--1111, 2013.
[5]
T. Hofmann. Probabilistic Latent Semantic Indexing. In Proceedings of the 22nd SIGIR, pages 50--57, 1999.
[6]
M. Hu and B. Liu. Mining and Summarizing Customer Reviews. In Proceedings of the 10th ACM SIGKDD, pages 168--177, 2004.
[7]
N. Jakob and I. Gurevych. Extracting Opinion Targets in a Single- and Cross-domain Setting with Conditional Random Fields. In Proceedings of the 2010 EMNLP, pages 1035--1045, 2010.
[8]
Y. Jo and A. H. Oh. Aspect and Sentiment Unification Model for Online Review Analysis. In Proceedings of the 4th WSDM, pages 815--824, 2011.
[9]
J. H. Kim, D. Kim, S. Kim, and A. Oh. Modeling Topic Hierarchies with the Recursive Chinese Restaurant Process. In Proceedings of the 21st ACM CIKM, pages 783--792, 2012.
[10]
S. Kim, J. Zhang, Z. Chen, A. H. Oh, and S. Liu. A Hierarchical Aspect-Sentiment Model for Online Reviews. In Proceedings of the 27th AAAI, pages 526--533, 2013.
[11]
A. Kotov and E. Agichtein. The Importance of Being Socially-Savvy: Quantifying the Influence of Social Networks on Microblog Retrieval. In Proceedings of the 22nd CIKM, pages 1905--1908, 2013.
[12]
A. Kotov, V. Rakesh, E. Agichtein, and C. K. Reddy. Geographical Latent Variable Models for Microblog Retrieval. In Proceedings of the 37th ECIR, pages 635--647, 2015.
[13]
A. Kotov, Y. Wang, and E. Agichtein. Leveraging Geographical Metadata to Improve Search over Social Media. In Proceedings of the 22nd WWW, pages 151--152, 2013.
[14]
H. Lakkaraju, C. Bhattacharyya, I. Bhattacharya, and S. Merugu. Exploiting Coherence for the Simultaneous Discovery of Latent Facets and associated Sentiments. In Proceedings of the 11th SDM, pages 498--509, 2011.
[15]
F. Li, C. Han, M. Huang, X. Zhu, Y.-J. Xia, S. Zhang, and H. Yu. Structure-aware Review Mining and Summarization. In Proceedings of the 23rd COLING, pages 653--661, 2010.
[16]
F. Li, S. Wang, S. Liu, and M. Zhang. SUIT: A Supervised User-Item Based Topic Model for Sentiment Analysis. In Proceedings of the 28th AAAI, pages 1636--1642, 2014.
[17]
C. Lin and Y. He. Joint Sentiment/Topic Model for Sentiment Analysis. In Proceedings of the 18th CIKM, pages 375--384, 2009.
[18]
C. Lin, Y. He, R. Everson, and S. Ruger. Weakly Supervised Joint Sentiment-Topic Detection from Text. IEEE Transactions on Knowledge and Data Engineering, 24(6):1134--1145, 2012.
[19]
T. Ma and X. Wan. Opinion Target Extraction in Chinese News Comments. In Proceedings of the 23rd COLING, pages 782--790, 2010.
[20]
Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs. In Proceedings of the 16th WWW, pages 171--180, 2007.
[21]
S. Moghaddam and M. Ester. ILDA: Interdependent LDA Model for Learning Latent Aspects and Their Ratings from Online Product Reviews. In Proceedings of the 34th SIGIR, pages 665--674, 2011.
[22]
A. Mukherjee and B. Liu. Aspect Extraction Through Semi-supervised Modeling. In Proceedings of the 50th ACL, pages 339--348, 2012.
[23]
A.-M. Popescu and O. Etzioni. Extracting Product Features and Opinions from Reviews. In Proceedings of the 2005 EMNLP-HLT, pages 339--346, 2005.
[24]
D. Ramage, C. D. Manning, and S. Dumais. Partially Labeled Topic Models for Interpretable Text Mining. In Proceedings of the 17th SIGKDD, pages 457--465, 2011.
[25]
C. Sauper, A. Haghighi, and R. Barzilay. Content Models with Attitude. In Proceedings of the 49th ACL-HLT, pages 350--358, 2011.
[26]
C. Tan, L. Lee, J. Tang, L. Jiang, M. Zhou, and P. Li. User-level Sentiment Analysis Incorporating Social Networks. In Proceedings of the 17th ACM SIGKDD, pages 1397--1405, 2011.
[27]
I. Titov and R. Mcdonald. A Joint Model of Text and Aspect Ratings for Sentiment Summarization. In Proceedings of 46th ACL-HLT, pages 308--316, 2008.
[28]
I. Titov and R. McDonald. Modeling Online Reviews with Multi-grain Topic Models. In Proceedings of the 17th WWW, pages 111--120, 2008.
[29]
P. D. Turney and M. L. Littman. Measuring Praise and Criticism: Inference of Semantic Orientation from Association. ACM TOIS, 21(4):315--346, 2003.
[30]
H. Wang, Y. Lu, and C. Zhai. Latent Aspect Rating Analysis on Review Text Data: A Rating Regression Approach. In Proceedings of the 16th SIGKDD, pages 783--792, 2010.
[31]
T. Zhao, C. Li, Q. Ding, and L. Li. User-sentiment Topic Model: Refining User's Topics with Sentiment Information. In Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics, 2012.

Cited By

View all
  • (2023)Topic Models with Sentiment Priors Based on Distributed RepresentationsJournal of Mathematical Sciences10.1007/s10958-023-06525-8273:4(639-652)Online publication date: 24-Jun-2023
  • (2023)Deep Learning for Natural Language Processing: A SurveyJournal of Mathematical Sciences10.1007/s10958-023-06519-6273:4(533-582)Online publication date: 26-Jun-2023
  • (2022)A Space-Time Framework for Sentiment Scope Analysis in Social MediaBig Data and Cognitive Computing10.3390/bdcc60401306:4(130)Online publication date: 3-Nov-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
August 2015
1198 pages
ISBN:9781450336215
DOI:10.1145/2766462
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 August 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. opinion mining
  2. topic models

Qualifiers

  • Research-article

Conference

SIGIR '15
Sponsor:

Acceptance Rates

SIGIR '15 Paper Acceptance Rate 70 of 351 submissions, 20%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)3
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Topic Models with Sentiment Priors Based on Distributed RepresentationsJournal of Mathematical Sciences10.1007/s10958-023-06525-8273:4(639-652)Online publication date: 24-Jun-2023
  • (2023)Deep Learning for Natural Language Processing: A SurveyJournal of Mathematical Sciences10.1007/s10958-023-06519-6273:4(533-582)Online publication date: 26-Jun-2023
  • (2022)A Space-Time Framework for Sentiment Scope Analysis in Social MediaBig Data and Cognitive Computing10.3390/bdcc60401306:4(130)Online publication date: 3-Nov-2022
  • (2022)Opinion Mining Using Enriched Joint Sentiment-Topic ModelInternational Journal of Information Technology & Decision Making10.1142/S021962202250058422:01(313-375)Online publication date: 28-Sep-2022
  • (2022)A survey on review summarization and sentiment classificationKnowledge and Information Systems10.1007/s10115-022-01728-y64:9(2289-2327)Online publication date: 1-Aug-2022
  • (2021)A Latent Topic Analysis and Visualization Framework for Category-Level Target Promotion in the SupermarketThe Review of Socionetwork Strategies10.1007/s12626-021-00092-7Online publication date: 27-Sep-2021
  • (2020)Enriched Latent Dirichlet Allocation for Sentiment AnalysisExpert Systems10.1111/exsy.1252737:4Online publication date: 28-Jan-2020
  • (2020)Characterizing User Decision based on Argumentative Reviews2020 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT)10.1109/BDCAT50828.2020.00002(161-170)Online publication date: Dec-2020
  • (2019)Community-based influence maximization for viral marketingApplied Intelligence10.1007/s10489-018-1387-849:6(2137-2150)Online publication date: 1-Jun-2019
  • (2019)Student Sentiment Analysis Using Gamification for Education ContextProceedings of the International Conference on Advanced Intelligent Systems and Informatics 201910.1007/978-3-030-31129-2_30(329-339)Online publication date: 2-Oct-2019
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media