Abstract
These days web users searching for opinions expressed by others on a particular product or service PS can turn to review repositories, such as Epinions.com or Imdb.com. While these repositories often provide a high quantity of reviews on PS, browsing through archived reviews to locate different opinions expressed on PS is a time-consuming and tedious task, and in most cases, a very labor-intensive process. To simplify the task of identifying reviews expressing positive, negative, and neutral opinions on PS, we introduce a simple, yet effective sentiment classifier, denoted SentiClass, which categorizes reviews on PS using the semantic, syntactic, and sentiment content of the reviews. To speed up the classification process, SentiClass summarizes each review to be classified using eSummar, a single-document, extractive, sentiment summarizer proposed in this paper, based on various sentence scores and anaphora resolution. SentiClass (eSummar, respectively) is domain and structure independent and does not require any training for performing the classification (summarization, respectively) task. Empirical studies conducted on two widely-used datasets, Movie Reviews and Game Reviews, in addition to a collection of Epinions.com reviews, show that SentiClass (i) is highly accurate in classifying summarized or full reviews and (ii) outperforms well-known classifiers in categorizing reviews.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Beineke, P., Hastie, T., Manning, C., Vaithyanathan, S.: An Exploration of Sentiment Summarization. In: Proc. of AAAI, pp. 12–15 (2003)
Gong, Y.: Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis. In: Proc. of ACM SIGIR, pp. 19–25 (2001)
Hu, X., Wu, B.: Classification and Summarization of Pros and Cons for Customer Reviews. In: Proc. of IEEE/WIC/ACM WI-IAT, pp. 73–76 (2009)
Jie, S., Xin, F., Wen, S., Quan-Xun, D.: BBS Sentiment Classification Based on Word Polarity. In: Proc. of ICCET, vol. 1, pp. 352–356 (2009)
Judea, P.: Probabilistic Reasoning in the Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Francisco (1988)
Kennedy, A., Inkpen, D.: Sentiment Classification of Movie Reviews Using Contextual Valence Shifters. Computational Intelligence 22(2), 110–125 (2006)
Koberstein, J., Ng, Y.-K.: Using Word Clusters to Detect Similar Web Documents. In: Lang, J., Lin, F., Wang, J. (eds.) KSEM 2006. LNCS (LNAI), vol. 4092, pp. 215–228. Springer, Heidelberg (2006)
Krestel, R., Bergler, S., Witte, R.: Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles. In: Proc. of LREC, pp. 2823–2828 (2008)
Ku, L., Liang, Y., Chen, H.: Opinion Extraction, Summarization and Tracking in News and Blog Corpora. In: Proc. of AAAI 2006 Spring Symposium on Computational Approaches to Analyzing Weblogs, pp. 100–107 (2006)
Lappin, S., Leass, H.: An Algorithm for Pronominal Anaphora Resolution. Computational Linguistics 20(4), 535–561 (1994)
Luger, G.: Artificial Intelligence: Structures and Strategies for Complex Problem Solving, 6th edn. Addison-Wesley, Reading (2009)
Luhn, H.: The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development 2(2), 159–165 (1958)
Melville, P., Gryc, W., Lawrence, R.: Sentiment Analysis of Blogs by Combining Lexical Knowledge with Text Classification. In: Proc. of KDD, pp. 1275–1284 (2009)
Minsky, M., Papert, S.: Perceptrons: An Introduction to Computational Geometry. The MIT Press, Cambridge (1972)
Na, J., Khoo, C., Wu, P.: Use of Negation Phrases in Automatic Sentiment Classification of Product Reviews. Library Collections, Acquisitions, and Technical Services 29(2), 180–191 (2005)
Pang, B., Lee, L.: A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts. In: Proc. of ACL, pp. 271–278 (2004)
Pang, B., Lee, L.: Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval 2(1-2), 1–135 (2008)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment Classification using Machine Learning Techniques. In: Proc. of EMNLP, pp. 79–86 (2002)
Polanyi, L., Zaenen, A.: Contextual Valence Shifters. In: Computing Attitude and Affect in Text: Theory and Applications, pp. 1–10. Springer, Heidelberg (2006)
Radev, D., Hovy, E., McKeown, K.: Introduction to the Special Issue on Summarization. Computational Linguistics 28(4), 399–408 (2002)
Wiebe, J., Wilson, T., Bruce, R., Bell, M., Martin, M.: Learning Subjective Language. Computational Linguistics 30, 277–308 (2004)
Zhao, J., Liu, K., Wang, G.: Adding Redundant Features for CRFs-based Sentence Sentiment Classification. In: Proc. of EMNLP, pp. 117–126 (2008)
Zhuang, L., Jing, F., Zhu, X.: Movie Review Mining and Summarization. In: Proc. of ACM CIKM, pp. 43–50 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pera, M.S., Qumsiyeh, R., Ng, YK. (2010). An Unsupervised Sentiment Classifier on Summarized or Full Reviews. In: Chen, L., Triantafillou, P., Suel, T. (eds) Web Information Systems Engineering – WISE 2010. WISE 2010. Lecture Notes in Computer Science, vol 6488. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17616-6_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-17616-6_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17615-9
Online ISBN: 978-3-642-17616-6
eBook Packages: Computer ScienceComputer Science (R0)