Abstract
The existing methods for Chinese sentiment Labeling mainly relies on the artificial sentiment corpus, but a sentiment word in the corpus may not be sentiment words in different sentences. This paper proposes a new method to label the words in the sentences by combining deep convolution neural network with sequential algorithm., We first extract the aspects comprised by words vectors, part of speech vectors, dependent syntax vectors to train the deep convolution neural network, and then employ the sequential algorithm to obtain the sentiment annotation of the sentence. Experimental results verify that our method is effective for sentiment labeling. Considering that the identification of the implicit aspects can improve the completeness of sentiment analysis, we suggest to construct the tuples including aspect, sentiment shifter, sentiment intensity, sentiment words after obtaining the sentiment labels for each word in the sentence. We develop new algorithm for implicit aspect identification by taking the two key factors of the aspects as a topic and the match degree of aspects and sentiment words, and the human language habit. The experiment demonstrates that the algorithm can effectively identify the implicit aspect. In this paper, we solve the problem of sentiment labeling and implicit aspect recognition in sentiment analysis. As a new tool for sentiment analysis, our method can be applied to the enterprise management information analysis, such as product online review, product online reputation, brand image and consumer preference management, and can also be used for the sentiment analysis of large-scale text data.
Similar content being viewed by others
References
Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In: Mining Text Data, pp. 459–526. Springer, New York (2012)
Thet, T.T., Na, J.C., Khoo, C.S.G.: Aspect-based sentiment analysis of movie reviews on discussion boards. J. Inf. Sci. 36(6), 823–848 (2010)
Hai, Z., Chang, K., Kim, J.J.: Implicit feature identification via co-occurrence association rule mining. In: International Conference on Computational Linguistics and Intelligent Text Processing. Springer, New York, pp. 393–404 (2011)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, Washington, USA, August. DBLP, pp. 168–177 (2004)
Popescu, A.E.: Extracting product features and opinions from reviews. In: Natural Language Processing and Text Mining, pp. 9–28. Springer, London (2007)
Moghaddam, S., Ester, M.: Opinion digger: an unsupervised opinion miner from unstructured product reviews. In: ACM International Conference on Information and Knowledge Management. ACM, pp. 1825–1828 (2010)
Long, C., Zhang, J., Zhut, X.A.: Review selection approach for accurate feature rating estimation. In: COLING 2010, International Conference on Computational Linguistics, Posters Volume, 23–27 August 2010, Beijing, China. DBLP, pp. 766–774 (2012)
Chonghui, G., Yitian, Z.: A product feature mining method based on online reviews. J. China Soci. Sci. Tech. Inf. 35(1), 77–83 (2016)
Qiu, G., Liu, B., Bu, J.: Opinion word expansion and target extraction through double propagation. Comput. Linguist. 37(1), 9–27 (2011)
Liu, K., Xu, L., Zhao, J.: Co-extracting opinion targets and opinion words from online reviews based on the word alignment model. IEEE Trans. Knowl. Data Eng. 27(3), 636–650 (2015)
Zhiyi, L., Mian, W., Pengwu, Z.: Study on the extraction of "opinion feature-opinion word" based on conditional random fields model. J. China Soc. Sci. Tech. Inf. 36(4), 411–421 (2017)
Hofmann, T.: Probabilistic latent semantic indexing. In: International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, pp. 50–57 (1999)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Mei, Q., Ling, X., Wondra, M., et al.: Topic sentiment mixture: modeling facets and opinions in weblogs. In: International Conference on World Wide Web. ACM, pp. 171–180 (2007)
Titov, I., Mcdonald, R.: Modeling online reviews with multi-grain topic models. In: International Conference on World Wide Web. ACM, pp. 111–120 (2008)
Brody, S., Elhadad, N.: Detecting salient aspects in online reviews of health providers. In: AMIA Annual Symposium proceedings, vol. 2010, p. 202 (2010)
Yin, S., Han, J., Huang, Y., et al.: Dependency-topic-affects-sentiment-LDA model for sentiment analysis. In: IEEE, International Conference on TOOLS with Artificial Intelligence, pp. 413–418. IEEE ( 2014)
Zhao, W.X., Jiang, J., Yan, H., et al.: Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid. In: Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, pp. 56–65 (2010)
Jiang, J.: Modeling Syntactic Structures of Topics with a Nested HMM-LDA. In: Ninth IEEE International Conference on Data Mining. pp. 824–829. IEEE (2009)
Wang, T., Cai, Y., Leung, H.F.: Product aspect extraction supervised with online domain knowledge. Knowl.-Based Syst. 71, 86–100 (2014)
Poria, S., Chaturvedi, I., Cambria, E., et al.: Sentic LDA: improving on LDA with semantic similarity for aspect-based sentiment analysis. In: International Joint Conference on Neural Networks, pp. 4465–4473. IEEE (2016)
Shams, M., Baraani-Dastjerdi, A.: Enriched LDA (ELDA): combination of latent Dirichlet allocation with word co-occurrence analysis for aspect extraction. Expert Syst. Appl. 80, 136–146 (2017)
Wei, J., Ho, H.H.: A novel lexicalized HMM-based learning framework for web opinion mining NOTE FROM ACM: a joint ACM Conference Committee has determined that the authors of this article violated ACM’s publication policy on simultaneous submissions. Therefore ACM has shut off. In: International Conference on Machine Learning. ACM, pp. 465–472 (2009)
Jakob, N., Gurevych, I.: Extracting opinion targets in a single- and cross-domain setting with conditional random fields. In: Conference on Empirical Methods in Natural Language Processing (2011)
Akhtar, M.S., Gupt, D., Ekbal, A., et al.: Feature selection and ensemble construction: a two-step method for aspect based sentiment analysis. Knowl.-Based Syst. 125, 116–135 (2017)
Kamps, J., Marx, M., Mokken, R.J., et al.: Using WordNet to measure semantic orientation of adjectives. In: International Conference on Language Resources and Evaluation, pp. 1115–1118 (2004)
Peng, W., Park, D.H.: Generate adjective sentiment dictionary for social media sentiment analysis using constrained nonnegative matrix factorization. In: International Conference on Weblogs and Social Media, Barcelona, Catalonia, Spain, July. DBLP (2011)
Hatzivassiloglou, V., Mckeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics, pp. 174–181 (1997)
Kanayama, H., Nasukawa, T.: Fully automatic lexicon expansion for domainoriented sentiment analysis. In: Conference on Empirical Methods in Natural Language Processing, pp. 355–363 (2006)
Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the Web. In: International Conference on World Wide Web. ACM, pp. 342–351 (2005)
Su, Q., Xiang, K., Wang, H., Sun, B., Yu, S.: Using pointwise mutual information to identify implicit features in customer reviews. In: Computer Processing of Oriental Languages, Beyond the Orient: The Research Challenges Ahead, Lecture Notes in Computer Science, vol. 4285, Springer, Berlin, pp. 22–30 (2006)
Hai, Z., Chang, K., Kim, J.-J.: Implicit feature identification via co-occurrenceassociation rule mining. In: Linguistics, Computational, TextProcessing, Intelligent (eds.) Lecture Notes in Computer Science, vol. 6608, pp. 393–404. Springer, Berlin (2011)
Wang, W., Xu, H., Wan, W.: Implicit feature identification via hybrid association rule mining. Expert Syst. Appl. 40(9), 3518–3531 (2012)
Chaturvedi, I., Ong, Y.S., Tsang, I.W.: Learning word dependencies in text by means of a deep recurrent belief network. Knowl.-Based Syst. 108, 144–154 (2016)
Kalchbrenner, N., Grefenstette, E., Blunsom, P.A.: Convolutional neural network for modelling sentences. Eprint Arxiv, 1 (2014)
Rsoy, O., Cardie, C.: Deep recursive neural networks for compositionality in language. In: International Conference on Neural Information Processing Systems. MIT Press, pp. 2096–2104 (2014)
Longfei, L., Liang, Y., Shaowu, Z., Hongfei, L.: Convolutional neural networks for chinese micro-blog sentiment analysis. J. Chin. Inf. Process. 29(6), 159–165 (2015)
Chen, T., Xu, R., He, Y., et al.: Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN. Expert Syst. Appl. 72, 221–230 (2017)
Sun, X., Li, C., Ren, F.: Sentiment analysis for Chinese microblog based on deep neural networks with convolutional extension features. Neurocomputing 210, 227–236 (2016)
Araque, O., Corcuera-Platas, I., Sánchez-Rada, J.F., et al.: Enhancing deep learning sentiment analysis with ensemble techniques in social applications. Expert Syst. Appl. 77, 236–246 (2017)
Hassan, A., Mahmood, A.: Deep Learning approach for sentiment analysis of short texts. In: International Conference on Control, Automation and Robotics, pp. 705–710. IEEE (2017)
Paredes-Valverde, M.A., Colomo-Palacios, R., Salas-Zárate, M.D.P.: Sentiment analysis in Spanish for improvement of products and services: a deep learning approach. Sci. Program. 2017(1), 1–6 (2017)
Poria, S., Cambria, E., Gelbukh, A.: Aspect extraction for opinion mining with a deep convolutional neural network. Knowl.-Based Syst. 108, 42–49 (2016)
Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Che, W., Li, Z., Liu, T.: LTP: a Chinese language technology platform. In: Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations. Association for Computational Linguistics, pp. 13–16, Beijing, China (2010)
Collobert, R., Weston, J., Karlen, M.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(1), 2493–2537 (2011)
Lecun, Y., Bottou, L., Bengio, Y.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Feng, J., Cai, S. & Ma, X. Enhanced sentiment labeling and implicit aspect identification by integration of deep convolution neural network and sequential algorithm. Cluster Comput 22 (Suppl 3), 5839–5857 (2019). https://doi.org/10.1007/s10586-017-1626-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-017-1626-5