Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A Weakly Supervised WordNet-Guided Deep Learning Approach to Extracting Aspect Terms from Online Reviews

Published: 21 July 2020 Publication History

Abstract

The unstructured nature of online reviews makes it inefficient and inconvenient for prospective consumers to research and use in support of purchase decision making. The aspects of products provide a fine-grained meaningful perspective for understanding and organizing review texts. Traditional aspect term extraction approaches rely on discrete language models that treat words in isolation. Despite that continuous-space language models have demonstrated promise in addressing a wide range of problems, their application in aspect term extraction faces significant challenges. For instance, existing continuous-space language models typically require large collections of labeled data, which remain difficult to obtain in many domains. More importantly, previous methods are largely data driven but overlook the role of human knowledge in guiding model development. To address these limitations, this study designs and develops weakly supervised WordNet-guided deep learning to aspect term extraction. The approach draws on deep-level semantic information from WordNet to guide not only the selection representative seed terms but also the pruning of aspect candidate terms. The weak supervision is provided by a very small set of labeled data. We conduct a comprehensive evaluation of the proposed method using both direct and indirect methods. The evaluation results with Yelp restaurant reviews demonstrate that our proposed method consistently outperforms all baseline methods including discrete models and the state-of-the-art continuous-space language models for aspect term extraction across both direct and indirect evaluations. The research findings have broad research, technical, and practical implications for various stakeholders of online reviews.

References

[1]
M. S. Akhtar, D. Gupta, A. Ekbal, and P. Bhattacharyya. 2017. Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis. Knowledge-Based Systems 125, 116--135
[2]
T. Al-Moslmi, N. Omar, S. Abdullah, and M. Albared. 2017. Approaches to cross-domain sentiment analysis: A systematic literature review. IEEE Access PP, 99, 1.
[3]
S. Aranganayagi and K. Thangavel. 2008. Clustering categorical data using silhouette coefficient as a relocating measure. In Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA’07). 13--17.
[4]
N. Archak, A. Ghose, and P. G. Ipeirotis. 2011. Deriving the pricing power of product features by mining consumer reviews. Management Science 57, 8, 1485--1509.
[5]
J. Chen, K. Li, J. Zhu, and W. Chen. 2016. WarpLDA: A cache efficient O(1) algorithm for latent Dirichlet allocation. Proceedings of the VLDB Endowment 9, 10, 744--755.
[6]
S. Danesh, T. Sumner, and J. H. Martin. 2015. SGRank: Combining statistical and graphical methods to improve the state of the art in unsupervised keyphrase extraction. In Proceedings of the 4th Joint Conference on Lexical and Computational Semantics (*SemEval’15). 117--126.
[7]
S. Deng, A. Sinha, and H. Zhao. 2015. Resolving ambiguity in sentiment classification: The role of dependency features. In Proceedings of the 25th Workshop on Information Technology and Systems (WITS’15). 4.
[8]
S. Deng, A. P. Sinha, and H. Zhao. 2017. Adapting sentiment lexicons to domain-specific social media texts. Decision Support Systems 94, 65--76.
[9]
H. H. Do, P. W. C. Prasad, A. Maag, and A. Alsadoon. 2019. Deep learning for aspect-based sentiment analysis: A comparative review. Expert Systems with Applications 118, 272--299.
[10]
A. Duric and F. Song. 2012. Feature selection for sentiment analysis based on content and syntax models. Decision Support Systems 53, 4, 704--711.
[11]
K. Floyd, R. Freling, S. Alhoqail, H. Y. Cho, and T. Freling. 2014. How online product reviews affect retail sales. Journal of Retailing 90, 2, 217--232.
[12]
R. Fu, J. Guo, B. Qin, W. Che, H. Wang, and T. Liu. 2015. Learning semantic hierarchies: A continuous vector space approach. IEEE Transactions on Audio, Speech, and Language Processing 23, 3, 461--471.
[13]
J. Ge and Y. Qiu. 2008. Concept similarity matching based on semantic distance. In Proceedings of the 2008 4th International Conference on Semantics, Knowledge, and Grid. IEEE, Los Alamitos, CA, 380--383.
[14]
X. Guan, Z. Cheng, X. He, Y. Zhang, Z. Zhu, Q. Peng, and T.-S. Chua. 2018. Attentive aspect modeling for review-aware recommendation. ACM Transactions on Information Systems 37, 3, Article 28.
[15]
Z. Hai, G. Cong, K. Chang, P. Cheng, and C. Miao. 2017. Analyzing sentiments in one go: A supervised joint topic modeling approach. IEEE Transactions on Knowledge and Data Engineering 29, 6, 1172--1185.
[16]
F. Hogenboom, F. Frasincar, U. Kaymak, F. De Jong, and E. Caron. 2016. A survey of event extraction methods from text for decision support systems. Decision Support Systems 85, 12--22.
[17]
H. Hong, D. Xu, G.A. Wang, and W. Fan. 2017. Understanding the determinants of online review helpfulness: A meta-analytic investigation. Decision Support Systems 102, 1--11.
[18]
H.-W. Hu, Y.-L. Chen, and P.-T. Hsu. 2016. A novel approach to rate and summarize online reviews according to user-specified aspects. Journal of Electronic Commerce Research 17, 2, 132--152.
[19]
N. Hu, N. S. Koh, and S. K. Reddy. 2013. Ratings lead you to the product, reviews help you clinch it : The dynamics and impact of online review sentiments on products sales the mediating role of online review sentiments on product sales. Decision Support Systems 57, 42--53.
[20]
M. Huang, Q. Qian, and X. Zhu. 2017. Encoding syntactic knowledge in neural networks for sentiment classification. ACM Transactions on Information Systems 35, 3, 1--27.
[21]
M. Hwang, C. Choi, and P. Kim. 2011. Automatic enrichment of semantic relation network and its application to word sense disambiguation. IEEE Transactions on Knowledge and Data Engineering 23, 6, 845--858.
[22]
N. Kang, E. M. van Mulligen, and J. A. Kors. 2011. Comparing and combining chunkers of biomedical text. Journal of Biomedical Informatics 44, 2, 354--60.
[23]
Y. Kang and L. Zhou. 2017. RubE: Rule-based methods for extracting product features from online consumer reviews. Information and Management 54, 2, 166--176.
[24]
H. Y. Kim and C. H. Won. 2018. Forecasting the volatility of stock price index: A hybrid model integrating LSTM with multiple GARCH-type models. Expert Systems with Applications 103, 25--37.
[25]
M. Kraus and S. Feuerriegel. 2017. Decision support from financial disclosures with deep neural networks and transfer learning. Decision Support Systems 104, 38--48.
[26]
C. Leacock and M. Chodorow. 1998. Combining local context and WordNet similarity for word sense identification. In WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA, 265--283.
[27]
G. Lee, J. Jeong, S. Seo, C. Kim, and P. Kang. 2018. Sentiment classification with word localization based on weakly supervised learning with a convolutional neural network. Knowledge-Based Systems 152, 70--82.
[28]
C. Li, Y. Duan, H. Wang, Z. Zhang, A. Sun, and Z. Ma. 2017a. Enhancing topic modeling for short texts with auxiliary word embeddings. ACM Transactions on Information Systems 36, 2, 1--30.
[29]
X. Li, C. Wu, and F. Mai. 2018. The effect of online reviews on product sales: A joint sentiment-topic analysis. Information and Management 56, 2, 172--184.
[30]
Y. Li, B. Wei, Y. Liu, L. Yao, H. Chen, J. Yu, and W. Zhu. 2017b. Incorporating knowledge into neural network for text representation. Expert Systems with Applications 96, 103--114.
[31]
D. Lin. 1998. An information-theoretic definition of similarity. In Proceedings of the 15th International Conference on Machine Learning (ICML’98). 296--304.
[32]
S. M. Liu and J. H. Chen. 2015. A multi-label classification based approach for sentiment classification. Expert Systems with Applications 42, 3, 1083--1093.
[33]
X. Liu, W. Shi, A. Kale, C. Ding, and Q. Yu. 2017. Statistical learning of domain-specific quality-of-service features from user reviews. ACM Transactions on Internet Technology 17, 2, 1--24.
[34]
T. Loughran. and B. McDonald. 2011. When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. Journal of Finance 66, 1, 35--65.
[35]
G. Madjarov, D. Kocev, D. Gjorgjevikj, and S. Džeroski. 2012. An extensive experimental comparison of methods for multi-label learning. Pattern Recognition 45, 9, 3084--3104.
[36]
G. Marcus. 2018. Deep learning: A critical appraisal. arXiv:1801.00631.
[37]
R. Mihalcea, and P. Tarau. 2004. TextRank: Bringing order into text. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing.
[38]
T. Mikolov, K. Chen, G. Corrado, and J. Dean. 2013. Efficient estimation of word representations in vector space. arXiv:1301.3781.
[39]
G. A. Miller. 1995. WordNet: A lexical database for English. Communications of the ACM 38, 11, 39--41.
[40]
R. Munro. 2020. Human-in-the-Loop Machine Learning. Manning Publications.
[41]
R. Navigli. 2009. Word sense disambiguation: A survey. ACM Computing Surveys 41, 2, 1--69.
[42]
I. Peñalver-Martinez, F. Garcia-Sanchez, R. Valencia-Garcia, M. A. Rodriguez-Garcia, V. Moreno, A. Fraga, and J. L. Sanchez-Cervantes. 2014. Feature-based opinion mining through ontologies. Expert Systems with Applications 41, 13, 5995--6008.
[43]
J. Pennington, R. Socher, and C. D. Manning. 2014. GloVe: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP’14). 1532--1543.
[44]
D. H. Pham and A. C. Le. 2018. Learning multiple layers of knowledge representation for aspect based sentiment analysis. Data and Knowledge Engineering 114, 26--39.
[45]
S. Poria, E. Cambria, and A. Gelbukh. 2016. Aspect extraction for opinion mining with a deep convolutional neural network. Knowledge-Based Systems 108, 42--49.
[46]
Princeton University. 2012. What Is WordNet. Retrieved May 29, 2020 from http://wordnet.princeton.edu/.
[47]
Z. Qiao, X. Zhang, M. Zhou, A. Wang, and W. Fan. 2017. A domain oriented LDA model for mining product defects from online customer reviews. In Proceedings of the 50th Hawaii International Conference on System Sciences. 1821--1830.
[48]
P. Resnik. 1995. Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence.
[49]
D. Sánchez, M. Batet, D. Isern, and A. Valls. 2012. Ontology-based semantic similarity: A new feature-based approach. Expert Systems with Applications 39, 9, 7718--7728.
[50]
M. Siering, A. V. Deokar, and C. Janze. 2018. Disentangling consumer recommendations: Explaining and predicting airline recommendations based on online reviews. Decision Support Systems 107, 52--63.
[51]
J. Tao, A. V. Deokar, and A. Deshmukh. 2018. Analysing forward-looking statements in initial public offering prospectuses: A text analytics approach. Journal of Business Analytics 1, 1, 54--70.
[52]
J. Tao, O. F. El-Gayar, A. V. Deokar, and Y. Chang. 2015. Term extraction and disambiguation for semantic knowledge enrichment: A case study on Initial Public Offering (IPO) Prospectus Corpus. In Proceedings of the 2015 48th Hawaii International Conference on System Sciences (HICSS-48). 3719--3728.
[53]
M.-F. Tsai, C.-J. Wang, and P.-C. Chien. 2016. Discovering finance keywords via continuous-space language models. ACM Transactions on Management Information Systems 7, 3, 1--17.
[54]
C. Wu, F. Wu, S. Wu, Z. Yuan, and Y. Huang. 2018. A hybrid unsupervised method for aspect term and opinion target extraction. Knowledge-Based Systems 148, 66--73.
[55]
Z. Wu and M. Palmer. 1994. Verbs semantics and lexical selection. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL’94). 133--138.
[56]
W. Zhao, Z. Guan, L. Chen, X. He, D. Cai, B. Wang, and Q. Wang. 2018. Weakly-supervised deep embedding for product review sentiment analysis. IEEE Transactions on Knowledge and Data Engineering 30, 1, 185--197.
[57]
L. Zhou and D. Zhang. 2002. Analysis of an agent-based approach for discovering term semantic relationship. In Proceedings of the Americas Conference on Information Systems (AMCIS’02). 207.
[58]
D. Zimbra, A. Abbasi, D. Zeng, and H. Chen. 2018. The state-of-the-art in Twitter sentiment analysis: A review and benchmark evaluation. ACM Transactions on Management Information Systems 9, 2, 5.

Cited By

View all
  • (2024)Aspect-level sentiment analysis based on aspect-sentence graph convolution networkInformation Fusion10.1016/j.inffus.2023.102143104:COnline publication date: 1-Apr-2024
  • (2021)Suggestion Mining from Opinionated Text of Big Social Media DataComputers, Materials & Continua10.32604/cmc.2021.01672768:3(3323-3338)Online publication date: 2021
  • (2021)DSSAE: Deep Stacked Sparse Autoencoder Analytical Model for COVID-19 Diagnosis by Fractional Fourier EntropyACM Transactions on Management Information Systems10.1145/345135713:1(1-20)Online publication date: 5-Oct-2021
  • Show More Cited By

Index Terms

  1. A Weakly Supervised WordNet-Guided Deep Learning Approach to Extracting Aspect Terms from Online Reviews

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Management Information Systems
        ACM Transactions on Management Information Systems  Volume 11, Issue 3
        Special Section on WITS 2018 and Regular Articles
        September 2020
        140 pages
        ISSN:2158-656X
        EISSN:2158-6578
        DOI:10.1145/3407737
        Issue’s Table of Contents
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 21 July 2020
        Accepted: 01 May 2020
        Revised: 01 November 2019
        Received: 01 May 2019
        Published in TMIS Volume 11, Issue 3

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Aspect term extraction
        2. continuous-space language model
        3. deep learning
        4. semantic knowledge
        5. text analytics

        Qualifiers

        • Research-article
        • Research
        • Refereed

        Funding Sources

        • National Science Foundation

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)16
        • Downloads (Last 6 weeks)3
        Reflects downloads up to 30 Nov 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Aspect-level sentiment analysis based on aspect-sentence graph convolution networkInformation Fusion10.1016/j.inffus.2023.102143104:COnline publication date: 1-Apr-2024
        • (2021)Suggestion Mining from Opinionated Text of Big Social Media DataComputers, Materials & Continua10.32604/cmc.2021.01672768:3(3323-3338)Online publication date: 2021
        • (2021)DSSAE: Deep Stacked Sparse Autoencoder Analytical Model for COVID-19 Diagnosis by Fractional Fourier EntropyACM Transactions on Management Information Systems10.1145/345135713:1(1-20)Online publication date: 5-Oct-2021
        • (2021)Aspect-Based Pair-Wise Opinion Generation in Chinese automotive reviews: Design of the task, dataset and modelInformation Processing & Management10.1016/j.ipm.2021.10272958:6(102729)Online publication date: Nov-2021
        • (2021)Enriching WordNet with Subject Specific Out of Vocabulary Terms Using Existing OntologyData Engineering for Smart Systems10.1007/978-981-16-2641-8_19(205-212)Online publication date: 14-Nov-2021

        View Options

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media