Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2872427.2883086acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Public Access

Mining Aspect-Specific Opinion using a Holistic Lifelong Topic Model

Published: 11 April 2016 Publication History

Abstract

Aspect-level sentiment analysis or opinion mining consists of several core sub-tasks: aspect extraction, opinion identification, polarity classification, and separation of general and aspect-specific opinions. Various topic models have been proposed by researchers to address some of these sub-tasks. However, there is little work on modeling all of them together. In this paper, we first propose a holistic fine-grained topic model, called the JAST (Joint Aspect-based Sentiment Topic) model, that can simultaneously model all of above problems under a unified framework. To further improve it, we incorporate the idea of lifelong machine learning and propose a more advanced model, called the LAST (Lifelong Aspect-based Sentiment Topic) model. LAST automatically mines the prior knowledge of aspect, opinion, and their correspondence from other products or domains. Such knowledge is automatically extracted and incorporated into the proposed LAST model without any human involvement. Our experiments using reviews of a large number of product domains show major improvements of the proposed models over state-of-the-art baselines.

References

[1]
R. Agrawal, R. Srikant, et al. Fast algorithms for mining association rules. In Proc. 20th int. conf. very large data bases, VLDB, volume 1215, pages 487--499, 1994.
[2]
D. Andrzejewski, X. Zhu, and M. Craven. Incorporating domain knowledge into topic modeling via dirichlet forest priors. In Proceedings of the 26th Annual International Conference on Machine Learning, pages 25--32. ACM, 2009.
[3]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. the Journal of machine Learning research, 3:993--1022, 2003.
[4]
S. Branavan, H. Chen, J. Eisenstein, and R. Barzilay. Learning document-level semantic properties from free-text annotations. Journal of Artificial Intelligence Research, pages 569--603, 2009.
[5]
S. Brody and N. Elhadad. An unsupervised aspect-sentiment model for online reviews. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 804--812. Association for Computational Linguistics, 2010.
[6]
J. Chang, S. Gerrish, C. Wang, J. L. Boyd-graber, and D. M. Blei. Reading tea leaves: How humans interpret topic models. In Advances in neural information processing systems, pages 288--296, 2009.
[7]
Z. Chen and B. Liu. Mining topics in documents: standing on the shoulders of big data. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1116--1125. ACM, 2014.
[8]
Z. Chen and B. Liu. Topic Modeling using Topics from Many Domains, Lifelong Learning and Big Data. In ICML, pages 703--711, 2014.
[9]
Z. Chen, N. Ma, and B. Liu. Lifelong Learning for Sentiment Classification. In ACL, pages 750--756, 2015.
[10]
Q. Diao, M. Qiu, C.-Y. Wu, A. J. Smola, J. Jiang, and C. Wang. Jointly modeling aspects, ratings and sentiments for movie recommendation (jmars). In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 193--202. ACM, 2014.
[11]
L. Fang and M. Huang. Fine granular aspect analysis using latent structural models. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2, pages 333--337. Association for Computational Linguistics, 2012.
[12]
T. L. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences, 101(suppl 1):5228--5235, 2004.
[13]
T. L. Griffiths, M. Steyvers, D. M. Blei, and J. B. Tenenbaum. Integrating topics and syntax. In Advances in neural information processing systems, pages 537--544, 2004.
[14]
Y. He, C. Lin, and H. Alani. Automatically extracting polarity-bearing topics for cross-domain sentiment classification. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 123--131. Association for Computational Linguistics, 2011.
[15]
G. Heinrich. A generic approach to topic models. In Machine Learning and Knowledge Discovery in Databases, pages 517--532. Springer, 2009.
[16]
M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 168--177. ACM, 2004.
[17]
Y. Hu, J. Boyd-Graber, B. Satinoff, and A. Smith. Interactive topic modeling. Machine learning, 95(3):423--469, 2014.
[18]
J. Jagarlamudi, H. Daumé III, and R. Udupa. Incorporating lexical priors into topic models. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 204--213. Association for Computational Linguistics, 2012.
[19]
Y. Jo and A. H. Oh. Aspect and sentiment unification model for online review analysis. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 815--824. ACM, 2011.
[20]
J.-H. Kang, J. Ma, and Y. Liu. Transfer topic modeling with ease and scalability. In SDM, pages 564--575. SIAM, 2012.
[21]
N. Kawamae. Latent interest-topic model: finding the causal relationships behind dyadic data. In Proceedings of the 19th ACM international conference on Information and knowledge management, pages 649--658. ACM, 2010.
[22]
S. Kim, J. Zhang, Z. Chen, A. H. Oh, and S. Liu. A hierarchical aspect-sentiment model for online reviews. In AAAI, 2013.
[23]
A. Lazaridou, I. Titov, and C. Sporleder. A bayesian model for joint unsupervised induction of sentiment, aspect and discourse representations. In ACL (1), pages 1630--1639, 2013.
[24]
P. Li, Y. Wang, W. Gao, and J. Jiang. Generating aspect-oriented multi-document summarization with event-aspect model. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 1137--1146. Association for Computational Linguistics, 2011.
[25]
C. Lin and Y. He. Joint sentiment/topic model for sentiment analysis. In Proceedings of the 18th ACM conference on Information and knowledge management, pages 375--384. ACM, 2009.
[26]
B. Liu. Sentiment analysis and opinion mining (synthesis lectures on human language technologies). Morgan & Claypool Publishers, 2012.
[27]
Y. Lu and C. Zhai. Opinion integration through semi-supervised topic modeling. In Proceedings of the 17th international conference on World Wide Web, pages 121--130. ACM, 2008.
[28]
Y. Lu, C. Zhai, and N. Sundaresan. Rated aspect summarization of short comments. In Proceedings of the 18th international conference on World wide web, pages 131--140. ACM, 2009.
[29]
H. Mahmoud. Polya Urn Models. Chapman & Hall/CRC Texts in Statistical Science, 2008.
[30]
Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of the 16th international conference on World Wide Web, pages 171--180. ACM, 2007.
[31]
D. Mimno, H. M. Wallach, E. Talley, M. Leenders, and A. McCallum. Optimizing semantic coherence in topic models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 262--272. Association for Computational Linguistics, 2011.
[32]
S. Moghaddam and M. Ester. The flda model for aspect-based opinion mining: addressing the cold start problem. In Proceedings of the 22nd international conference on World Wide Web, pages 909--918. International World Wide Web Conferences Steering Committee, 2013.
[33]
A. Mukherjee and B. Liu. Aspect extraction through semi-supervised modeling. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1, pages 339--348. Association for Computational Linguistics, 2012.
[34]
D. Newman, J. H. Lau, K. Grieser, and T. Baldwin. Automatic evaluation of topic coherence. In HLT-NAACL, pages 100--108, 2010.
[35]
D. Newman, Y. Noh, E. Talley, S. Karimi, and T. Baldwin. Evaluating topic models for digital libraries. In Proceedings of the 10th annual joint conference on Digital libraries, pages 215--224. ACM, 2010.
[36]
S. J. Pan and Q. Yang. A Survey on Transfer Learning. IEEE Trans. Knowl. Data Eng., 22(10):1345--1359, 2010.
[37]
M. Paul and M. Dredze. Factorial lda: Sparse multi-dimensional text models. In Advances in Neural Information Processing Systems, pages 2582--2590, 2012.
[38]
M. Paul and R. Girju. Cross-cultural analysis of blogs and forums with mixed-collection topic models. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3-Volume 3, pages 1408--1417. Association for Computational Linguistics, 2009.
[39]
M. Paul and R. Girju. A two-dimensional topic-aspect model for discovering multi-faceted topics. Urbana, 51:61801, 2010.
[40]
M. J. Paul and M. Dredze. Drug extraction from the web: Summarizing drug experiences with multi-dimensional topic models. In HLT-NAACL, pages 168--178, 2013.
[41]
J. Petterson, W. Buntine, S. M. Narayanamurthy, T. S. Caetano, and A. J. Smola. Word features for latent dirichlet allocation. In Advances in Neural Information Processing Systems, pages 1921--1929, 2010.
[42]
R. Salakhutdinov and G. E. Hinton. Deep boltzmann machines. In International Conference on Artificial Intelligence and Statistics, pages 448--455, 2009.
[43]
C. Sauper and R. Barzilay. Automatic aggregation by joint modeling of aspects and values. Journal of Artificial Intelligence Research, 2013.
[44]
S. Thrun. Lifelong Learning Algorithms. In S. Thrun and L. Pratt, editors, Learning To Learn. Kluwer Academic Publishers, 1998.
[45]
I. Titov and R. T. McDonald. A joint model of text and aspect ratings for sentiment summarization. In ACL, volume 8, pages 308--316. Citeseer, 2008.
[46]
H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis on review text data: a rating regression approach. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 783--792. ACM, 2010.
[47]
L. Wang, K. Liu, Z. Cao, J. Zhao, and G. de Melo. Sentiment-aspect extraction based on restricted boltzmann machines. 2015.
[48]
G. Xue, W. Dai, Q. Yang, and Y. Yu. Topic-bridged PLSA for cross-domain text classification. In SIGIR, pages 627--634, 2008.
[49]
B. Yang and C. Cardie. Joint inference for fine-grained opinion extraction. In ACL (1), pages 1640--1649, 2013.
[50]
S.-h. Yang, S. P. Crain, and H. Zha. Bridging the language gap: topic adaptation for documents with different technicality. In International Conference on Artificial Intelligence and Statistics, pages 823--831, 2011.
[51]
C. Zhai, A. Velivelli, and B. Yu. A cross-collection mixture model for comparative text mining. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 743--748. ACM, 2004.
[52]
W. X. Zhao, J. Jiang, H. Yan, and X. Li. Jointly modeling aspects and opinions with a maxent-lda hybrid. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pages 56--65. Association for Computational Linguistics, 2010.
[53]
G. Zipf. Selective studies and the principle of relative frequencies in language, 1932.

Cited By

View all
  • (2024)A Structural Topic and Sentiment-Discourse Model for Text AnalysisSSRN Electronic Journal10.2139/ssrn.4020651Online publication date: 2024
  • (2024)Context-Aware Dynamic Word Embeddings for Aspect Term ExtractionIEEE Transactions on Affective Computing10.1109/TAFFC.2023.326294115:1(144-156)Online publication date: Jan-2024
  • (2024)The role of lifelong machine learning in bridging the gap between human and machine learning: A scientometric analysisWIREs Data Mining and Knowledge Discovery10.1002/widm.152614:2Online publication date: 10-Jan-2024
  • Show More Cited By

Index Terms

  1. Mining Aspect-Specific Opinion using a Holistic Lifelong Topic Model

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '16: Proceedings of the 25th International Conference on World Wide Web
    April 2016
    1482 pages
    ISBN:9781450341431

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    International World Wide Web Conferences Steering Committee

    Republic and Canton of Geneva, Switzerland

    Publication History

    Published: 11 April 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. aspect-specific opinion
    2. lifelong machine learning
    3. opinion mining
    4. topic model

    Qualifiers

    • Research-article

    Funding Sources

    • NCI
    • NSF
    • Bosch

    Conference

    WWW '16
    Sponsor:
    • IW3C2
    WWW '16: 25th International World Wide Web Conference
    April 11 - 15, 2016
    Québec, Montréal, Canada

    Acceptance Rates

    WWW '16 Paper Acceptance Rate 115 of 727 submissions, 16%;
    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)97
    • Downloads (Last 6 weeks)17
    Reflects downloads up to 23 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Structural Topic and Sentiment-Discourse Model for Text AnalysisSSRN Electronic Journal10.2139/ssrn.4020651Online publication date: 2024
    • (2024)Context-Aware Dynamic Word Embeddings for Aspect Term ExtractionIEEE Transactions on Affective Computing10.1109/TAFFC.2023.326294115:1(144-156)Online publication date: Jan-2024
    • (2024)The role of lifelong machine learning in bridging the gap between human and machine learning: A scientometric analysisWIREs Data Mining and Knowledge Discovery10.1002/widm.152614:2Online publication date: 10-Jan-2024
    • (2023)Dealing with Drift of Adaptation Spaces in Learning-based Self-Adaptive Systems Using Lifelong Self-AdaptationACM Transactions on Autonomous and Adaptive Systems10.1145/363642819:1(1-57)Online publication date: 13-Dec-2023
    • (2023)Policy generation network for zero‐shot policy learningComputational Intelligence10.1111/coin.1259139:5(707-733)Online publication date: 4-Jul-2023
    • (2023)Lifelong Bayesian Learning Machines for Streaming Industrial Big DataIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2022.319883353:3(1554-1565)Online publication date: Mar-2023
    • (2023)Hierarchical Lifelong Machine Learning With “Watchdog”IEEE Transactions on Big Data10.1109/TBDATA.2021.31108629:1(63-74)Online publication date: 1-Feb-2023
    • (2023)Aspect Specific Opinion Expression Extraction Using Attention Based LSTM-CRF NetworkComputational Linguistics and Intelligent Text Processing10.1007/978-3-031-23804-8_34(442-454)Online publication date: 26-Feb-2023
    • (2022)Hierarchical lifelong topic modeling using rules extracted from network communitiesPLOS ONE10.1371/journal.pone.026448117:3(e0264481)Online publication date: 3-Mar-2022
    • (2022)On Paradigm of Industrial Big Data Analytics: From Evolution to RevolutionIEEE Transactions on Industrial Informatics10.1109/TII.2022.319039418:12(8373-8388)Online publication date: Dec-2022
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media