Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Measuring praise and criticism: Inference of semantic orientation from association

Published: 01 October 2003 Publication History

Abstract

The evaluative character of a word is called its semantic orientation. Positive semantic orientation indicates praise (e.g., "honest", "intrepid") and negative semantic orientation indicates criticism (e.g., "disturbing", "superfluous"). Semantic orientation varies in both direction (positive or negative) and degree (mild to strong). An automated system for measuring semantic orientation would have application in text classification, text filtering, tracking opinions in online discussions, analysis of survey responses, and automated chat systems (chatbots). This article introduces a method for inferring the semantic orientation of a word from its statistical association with a set of positive and negative paradigm words. Two instances of this approach are evaluated, based on two different statistical measures of word association: pointwise mutual information (PMI) and latent semantic analysis (LSA). The method is experimentally tested with 3,596 words (including adjectives, adverbs, nouns, and verbs) that have been manually labeled positive (1,614 words) and negative (1,982 words). The method attains an accuracy of 82.8% on the full test set, but the accuracy rises above 95% when the algorithm is allowed to abstain from classifying mild words.

References

[1]
Agresti, A. 1996. An Introduction to Categorical Data Analysis. Wiley, New York.]]
[2]
Bartell, B. T., Cottrell, G. W., and Belew, R. K. 1992. Latent semantic indexing is an optimal special case of multidimensional scaling. Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, 161--167.]]
[3]
Budanitsky, A. and Hirst, G. 2001. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Workshop on WordNet and Other Lexical Resources, Second meeting of the North American Chapter of the Association for Computational Linguistics (Pittsburgh, Pa).]]
[4]
Church, K. W. and Hanks, P. 1989. Word association norms, mutual information and lexicography. In Proceedings of the 27th Annual Conference of the Association of Computational Linguistics. Association for Computational Linguistics, New Brunswick, N.J., 76--83.]]
[5]
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., and Harshman, R. 1990. Indexing by latent semantic analysis. J. Amer. Soc. Inf. Sci. 41, 6, 391--407.]]
[6]
Dunning, T. 1993. Accurate methods for the statistics of surprise and coincidence. Comput. Ling. 19, 61--74.]]
[7]
Firth, J. R. 1957. A synopsis of linguistic theory 1930--1955. In Studies in Linguistic Analysis, Philological Society, Oxford, England, 1--32. (Reprinted in F. R. Palmer (ed.), Selected Papers of J. R. Firth 1952--1959, Longman, London, England, 1968).]]
[8]
Golub, G. H. and Van Loan, C. F. 1996. Matrix Computations. Third edition. Johns Hopkins University Press, Baltimore, Md.]]
[9]
Hatzivassiloglou, V. and McKeown, K. R. 1997. Predicting the semantic orientation of adjectives. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conference of the European Chapter of the ACL. Association for Computational Linguistics, New Brunswick, N.J., 174--181.]]
[10]
Hatzivassiloglou, V. and Wiebe, J. M. 2000. Effects of adjective orientation and gradability on sentence subjectivity. Proceedings of 18th International Conference on Computational Linguistics. Association for Computational Linguistics, New Brunswick, N.J.]]
[11]
Hearst, M. A. 1992. Direction-based text interpretation as an information access refinement. In Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval. P. Jacobs, Ed. Lawrence Erlbaum Associates, Mahwah, N.J.]]
[12]
Kamps, J. and Marx, M. 2002. Words with attitude. In Proceedings of the 1st International Conference on Global WordNet. CIIL, Mysore, India, 332--341.]]
[13]
Landauer, T. K. 2002. On the computational basis of learning and cognition: Arguments from LSA. In The Psychology of Learning and Motivation, B. H. Ross, Ed. To appear.]]
[14]
Landauer, T. K. and Dumais, S. T. 1997. A solution to Plato's problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psych. Rev. 104, 211--240.]]
[15]
Littman, M. L. 2001. Language games and other meaningful pursuits. Presentation slides. (http://www.cs.rutgers.edu/∼mlittman/talks/CA-lang.ppt).]]
[16]
Manning, C. D. and Schütze, H. 1999. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, Mass.]]
[17]
Miller, G. A. 1990. WordNet: An on-line lexical database. Int. J. Lexicog. 3, 4, 235--312.]]
[18]
Osgood, C. E., Suci, G. J., and Tannenbaum, P. H. 1957. The Measurement of Meaning. University of Illinois Press, Chicago, Ill.]]
[19]
Pang, B., Lee, L., and Vaithyanathan, S. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing. 79--86.]]
[20]
Resnik, P. 1995. Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence. Morgan-Kaufmann, San Mateo, Calif., 448--453.]]
[21]
Schütze, H. 1993. Word space. In Advances in Neural Information Processing Systems 5. S. J. Hanson, J. D. Cowan, and C. L. Giles, Ed. Morgan-Kaufmann., San Mateo, Calif., 895--902.]]
[22]
Smadja, F. 1993. Retrieving collocations from Text: Xtract. Comput. Ling. 19, 143--177.]]
[23]
Spertus, E. 1997. Smokey: Automatic recognition of hostile messages. In Proceedings of the Conference on Innovative Applications of Artificial Intelligence. AAAI Press, Menlo Park, Calif., 1058--1065.]]
[24]
Stone, P. J., Dunphy, D. C., Smith, M. S., and Ogilvie, D. M. 1966. The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge, Mass.]]
[25]
Tong, R. M. 2001. An operational system for detecting and tracking opinions in on-line discussions. Working Notes of the ACM SIGIR 2001 Workshop on Operational Text Classification. ACM, New York, 1--6.]]
[26]
Turney, P. D. 2001. Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In Proceedings of the 12th European Conference on Machine Learning. Springer-Verlag, Berlin, Germany, 491--502.]]
[27]
Turney, P. D. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the Association for Computational Linguistics 40th Anniversary Meeting. Association for Computational Linguistics, New Brunswick, N.J.]]
[28]
van Rijsbergen, C. J. 1979. Information Retrieval (2nd edition), Butterworths, London, England.]]
[29]
Wiebe, J. M. 2000. Learning subjective adjectives from corpora. In Proceedings of the 17th National Conference on Artificial Intelligence. AAAI Press, Menlo Park, Calif.]]
[30]
Wiebe, J. M., Bruce, R., Bell, M., Martin, M., & Wilson, T. 2001. A corpus study of evaluative and speculative language. In Proceedings of the 2nd ACL SIG on Dialogue Workshop on Discourse and Dialogue (Aalborg, Denmark).]]

Cited By

View all
  • (2024)Potential Impact of Sustainable Business Practices: Brand Equity in Fast Food and Millennials’ Purchase IntentionsSustainability10.3390/su1609376816:9(3768)Online publication date: 30-Apr-2024
  • (2024)A Framework Model of Mining Potential Public Opinion Events Pertaining to Suspected Research Integrity Issues with the Text Convolutional Neural Network model and a Mixed Event ExtractorInformation10.3390/info1506030315:6(303)Online publication date: 24-May-2024
  • (2024)Are Your Comments Positive? A Self-Distillation Contrastive Learning Method for Analyzing Online Public OpinionElectronics10.3390/electronics1313250913:13(2509)Online publication date: 26-Jun-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Information Systems
ACM Transactions on Information Systems  Volume 21, Issue 4
October 2003
179 pages
ISSN:1046-8188
EISSN:1558-2868
DOI:10.1145/944012
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2003
Published in TOIS Volume 21, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. latent semantic analysis
  2. mutual information
  3. semantic association
  4. semantic orientation
  5. text classification
  6. text mining
  7. unsupervised learning
  8. web mining

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)108
  • Downloads (Last 6 weeks)13
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Potential Impact of Sustainable Business Practices: Brand Equity in Fast Food and Millennials’ Purchase IntentionsSustainability10.3390/su1609376816:9(3768)Online publication date: 30-Apr-2024
  • (2024)A Framework Model of Mining Potential Public Opinion Events Pertaining to Suspected Research Integrity Issues with the Text Convolutional Neural Network model and a Mixed Event ExtractorInformation10.3390/info1506030315:6(303)Online publication date: 24-May-2024
  • (2024)Are Your Comments Positive? A Self-Distillation Contrastive Learning Method for Analyzing Online Public OpinionElectronics10.3390/electronics1313250913:13(2509)Online publication date: 26-Jun-2024
  • (2024)Financial Sentiment Analysis: Techniques and ApplicationsACM Computing Surveys10.1145/364945156:9(1-42)Online publication date: 24-Apr-2024
  • (2024)Sentiment Analysis Using Deep Learning Approaches on Multi-Domain Dataset in Telugu LanguageJournal of Information & Knowledge Management10.1142/S021964922450018723:03Online publication date: 10-Jan-2024
  • (2024)A Deep Semi-Supervised Community Detection Based on Point-Wise Mutual InformationIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.332781011:3(3444-3456)Online publication date: Jun-2024
  • (2024)Conceptual Level Semantic Sentiment Analysis Using Twitter Data2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS)10.1109/ADICS58448.2024.10533498(1-8)Online publication date: 18-Apr-2024
  • (2024)Understanding Environmental Posts: Sentiment and Emotion Analysis of Social Media DataIEEE Access10.1109/ACCESS.2024.337158512(33504-33523)Online publication date: 2024
  • (2024)Community relations discovery methods for users in Fancircle based on sentiment analysis in ChinaData Technologies and Applications10.1108/DTA-09-2023-0570Online publication date: 29-Jan-2024
  • (2024)JST and rJST: joint estimation of sentiment and topics in textual data using a semi-supervised approachCommunication Methods and Measures10.1080/19312458.2024.2383453(1-19)Online publication date: 18-Oct-2024
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media