Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A Meta-Framework for Modeling the Human Reading Process in Sentiment Analysis

Published: 11 August 2016 Publication History

Abstract

This article introduces a sentiment analysis approach that adopts the way humans read, interpret, and extract sentiment from text. Our motivation builds on the assumption that human interpretation should lead to the most accurate assessment of sentiment in text. We call this automated process Human Reading for Sentiment (HRS). Previous research in sentiment analysis has produced many frameworks that can fit one or more of the HRS aspects; however, none of these methods has addressed them all in one approach. HRS provides a meta-framework for developing new sentiment analysis methods or improving existing ones. The proposed framework provides a theoretical lens for zooming in and evaluating aspects of any sentiment analysis method to identify gaps for improvements towards matching the human reading process. Key steps in HRS include the automation of humans low-level and high-level cognitive text processing. This methodology paves the way towards the integration of psychology with computational linguistics and machine learning to employ models of pragmatics and discourse analysis for sentiment analysis. HRS is tested with two state-of-the-art methods; one is based on feature engineering, and the other is based on deep learning. HRS highlighted the gaps in both methods and showed improvements for both.

References

[1]
Ahmed Abbasi, Hsinchun Chen, and Arab Salem. 2008. Sentiment analysis in multiple languages: Feature selection for opinion classification in web forums. ACM Trans. Inform. Syst. 26, 3 (2008), 12.
[2]
Ahmed Abbasi, Stephen France, Zhu Zhang, and Hsinchun Chen. 2011. Selecting attributes for sentiment classification using feature relation networks. IEEE Trans. Knowl. Data Eng. 23, 3 (2011), 447--462.
[3]
Alan Baddeley and Graham Hitch. 2010. Working memory. (2010). http://www.scholarpedia.org/article/Working_memory.
[4]
Michele Banko, Michael J. Cafarella, Stephen Soderland, Matthew Broadhead, and Oren Etzioni. 2007. Open information extraction for the web. In IJCAI, Vol. 7. 2670--2676.
[5]
Israela Becker and Vered Aharonson. 2010. Last but definitely not least: On the role of the last sentence in automatic polarity-classification. In Proceedings of the acL 2010 Conference Short Papers. Association for Computational Linguistics, 331--335.
[6]
Eric Breck, Yejin Choi, and Claire Cardie. 2007. Identifying expressions of opinion in context. In IJCAI, Vol. 7. 2683--2688.
[7]
Hsinchun Chen and David Zimbra. 2010. AI and opinion mining. IEEE Intell. Syst. 25, 3 (2010), 74--80.
[8]
Yan Dang, Yulei Zhang, and Hsinchun Chen. 2010. A lexicon-enhanced method for sentiment classification: An experiment on online product reviews. IEEE Intell. Syst. 25, 4 (2010), 46--53.
[9]
Kushal Dave, Steve Lawrence, and David M. Pennock. 2003. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of the 12th International Conference on World Wide Web. ACM, 519--528.
[10]
Gwen Dewar. 2012. Parenting for the science-minded. (2012). http://www.parentingscience.com/working-mem ory.html
[11]
Xiaowen Ding, Bing Liu, and Philip S. Yu. 2008. A holistic lexicon-based approach to opinion mining. In Proceedings of the 2008 International Conference on Web Search and Data Mining. ACM, 231--240.
[12]
Andrea Esuli and Fabrizio Sebastiani. 2006. Sentiwordnet: A publicly available lexical resource for opinion mining. In Proceedings of LREC, Vol. 6. 417--422.
[13]
Oren Etzioni, Michele Banko, and Michael J. Cafarella. 2006. Machine reading. In AAAI, Vol. 6. 1517--1519.
[14]
Oren Etzioni, Michael Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, and Alexander Yates. 2005. Unsupervised named-entity extraction from the web: An experimental study. Artif. Intell. 165, 1 (2005), 91--134.
[15]
Noura Farra, Elie Challita, Rawad Abou Assi, and Hazem Hajj. 2010. Sentence-level and document-level sentiment mining for arabic texts. In Data Mining Workshops (ICDMW), 2010 IEEE International Conference on. IEEE, 1114--1119.
[16]
Christiane Fellbaum. 1999. WordNet. Wiley Online Library.
[17]
W. Fletcher. 2002. KfNgram. Retrieved July 29 (2002), 2009.
[18]
Andrea Frome, Greg S. Corrado, Jon Shlens, Samy Bengio, Jeff Dean, Tomas Mikolov, and others. 2013. Devise: A deep visual-semantic embedding model. In Advances in Neural Information Processing Systems. 2121--2129.
[19]
Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Domain adaptation for large-scale sentiment classification: A deep learning approach. In Proceedings of the 28th International Conference on Machine Learning (ICML-11). 513--520.
[20]
William Grabe and Fredricka L. Stoller. 2013. Teaching and Researching: Reading. Routledge.
[21]
Ammar Hassan, Ahmed Abbasi, and Daniel Zeng. 2013. Twitter sentiment analysis: A bootstrap ensemble framework. In Social Computing (SocialCom), 2013 International Conference on. IEEE, 357--364.
[22]
Jim Hendler. 2013. Broad data: Exploring the emerging web of data. Big Data 1, 1 (2013), 18--20.
[23]
Roula Hobeica, Hazem Hajj, and Wassim El Hajj. 2011. Machine reading for notion-based sentiment mining. In Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on. IEEE, 75--80.
[24]
Lindsay Hoffman. 2013. Reflecting on Twitter and Its Implications for Elections and Democracy. (2013). http://www.huffingtonpost.com/lindsay-hoffman/twitter-elections_b_2568989.html.
[25]
Vincent Foster Hopper. 1986. 1001 Pitfalls in English Grammar. Barron’s Educational Series.
[26]
Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 168--177.
[27]
Ioannis Manoussos Katakis, Iraklis Varlamis, and George Tsatsaronis. 2014. PYTHIA: Employing lexical and semantic features for sentiment analysis. In Machine Learning and Knowledge Discovery in Databases. Springer, 448--451.
[28]
Soo-Min Kim and Eduard Hovy. 2004. Determining the sentiment of opinions. In Proceedings of the 20th International Conference on Computational Linguistics. Association for Computational Linguistics, 1367.
[29]
Svetlana Kiritchenko, Xiaodan Zhu, and Saif M. Mohammad. 2014. Sentiment analysis of short informal texts. J. Artif. Intell. Res. (2014), 723--762.
[30]
Daniel J. Kurland. 2000. Inference: The Process. (2000). www.criticalreading.com/inference_process.htm.
[31]
Cody Kwok, Oren Etzioni, and Daniel S. Weld. 2001. Scaling question answering to the web. ACM Trans. Inform. Syst. 19, 3 (2001), 242--262.
[32]
Quoc V. Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. arXiv preprint arXiv:1405.4053 (2014).
[33]
Yann LeCun and Yoshua Bengio. 1995. Convolutional networks for images, speech, and time series. The Handbook of Brain Theory and Neural Networks 3361, 10 (1995).
[34]
Chenghua Lin, Yulan He, Richard Everson, and Stefan Ruger. 2012. Weakly supervised joint sentiment-topic detection from text. IEEE Trans. Knowl. Data Eng. 24, 6 (2012), 1134--1145.
[35]
Bing Liu and Lei Zhang. 2012. A survey of opinion mining and sentiment analysis. In Mining Text Data. Springer, 415--463.
[36]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems. 3111--3119.
[37]
Yasuhide Miura, Shigeyuki Sakaki, Keigo Hattori, and Tomoko Ohkuma. 2014. TeamX: A sentiment analyzer with enhanced lexicon mapping and weighting scheme for unbalanced data. SemEval 2014 (2014), 628.
[38]
Saif Mohammad, Svetlana Kiritchenko, and Xiaodan Zhu. 2013a. NRC-Canada: Building the state-of-the-art in sentiment analysis of tweets. In Proceedings of the Seventh International Workshop on Semantic Evaluation Exercises (SemEval-2013). Atlanta, Georgia, USA.
[39]
Saif M. Mohammad, Svetlana Kiritchenko, and Xiaodan Zhu. 2013b. NRC-Canada: Building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:1308.6242 (2013).
[40]
Saif M. Mohammad and Peter D. Turney. 2010. Emotions evoked by common words and phrases: Using mechanical turk to create an emotion lexicon. In Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text. Association for Computational Linguistics, 26--34.
[41]
Tim OKeefe and Irena Koprinska. 2009. Feature selection and weighting methods in sentiment analysis. ADCS 2009 (2009), 67.
[42]
Bo Pang and Lillian Lee. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 271.
[43]
Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing-Volume 10. Association for Computational Linguistics, 79--86.
[44]
Rawkes. 2011. The moment Twitter lost Steve Jobs. (2011). http://rawkes.com/articles/the-moment-twitter- lost-steve-jobs
[45]
Ellen Riloff, Siddharth Patwardhan, and Janyce Wiebe. 2006. Feature subsumption for opinion analysis. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 440--448.
[46]
Sara Rosenthal, Preslav Nakov, Alan Ritter, and Veselin Stoyanov. 2014. Semeval-2014 task 9: Sentiment analysis in twitter. Proc. SemEval (2014).
[47]
Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld. 2008. Scaling textual inference to the web. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 79--88.
[48]
Richard Socher, Cliff C. Lin, Chris Manning, and Andrew Y. Ng. 2011. Parsing natural scenes and natural language with recursive neural networks. In Proceedings of the 28th International Conference on Machine Learning (ICML-11). 129--136.
[49]
Richard Socher, Alex Perelygin, Jean Y. Wu, Jason Chuang, Christopher D. Manning, Andrew Y. Ng, and Christopher Potts. 2013. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Citeseer, 1631--1642.
[50]
Duyu Tang, Bing Qin, and Ting Liu. 2015a. Document modeling with gated recurrent neural network for sentiment classification. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 1422--1432.
[51]
Duyu Tang, Bing Qin, and Ting Liu. 2015b. Learning semantic representations of users and products for document level sentiment classification. In Proc. ACL.
[52]
Duyu Tang, Furu Wei, Bing Qin, Ting Liu, and Ming Zhou. 2014. Coooolll: A deep learning system for twitter sentiment classification. SemEval 2014 (2014), 208.
[53]
Kristina Toutanova, Dan Klein, Christopher D. Manning, and Yoram Singer. 2003. Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1. Association for Computational Linguistics, 173--180.
[54]
Zhaopeng Tu, Yifan He, Jennifer Foster, Josef van Genabith, Qun Liu, and Shouxun Lin. 2012. Identifying high-impact sub-structures for convolution kernels in document-level sentiment classification. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2. Association for Computational Linguistics, 338--343.
[55]
Peter D. Turney. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 417--424.
[56]
Casey Whitelaw, Navendu Garg, and Shlomo Argamon. 2005. Using appraisal groups for sentiment analysis. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management. ACM, 625--631.
[57]
Janyce Wiebe, Theresa Wilson, and Claire Cardie. 2005. Annotating expressions of opinions and emotions in language. Language Resour. Eval. 39, 2--3 (2005), 165--210.
[58]
Fei Wu and Daniel S. Weld. 2007. Autonomously semantifying wikipedia. In Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management. ACM, 41--50.
[59]
Yiming Yang and Jan O. Pedersen. 1997. A comparative study on feature selection in text categorization. In ICML, Vol. 97. 412--420.
[60]
Ainur Yessenalina, Yisong Yue, and Claire Cardie. 2010. Multi-level structured models for document-level sentiment classification. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1046--1056.
[61]
Alireza Yousefpour, Roliana Ibrahim, and Haza Nuzly Abdull Hamed. 2014. A novel feature reduction method in sentiment analysis. Int. J. Innovat. Comput. 4, 1 (2014).
[62]
Xiaoui Yu, Yang Liu, Xiangi Huang, and Aijun An. 2012. Mining online reviews for predicting sales performance: A case study in the movie domain. IEEE Trans. Knowl. Data Eng. 24, 4 (2012), 720--734.

Cited By

View all
  • (2024)Towards a robust deep learning framework for Arabic sentiment analysisNatural Language Processing10.1017/nlp.2024.35(1-35)Online publication date: 6-Sep-2024
  • (2022)Automated Islamic Jurisprudential Legal Opinions Generation Using Artificial IntelligencePertanika Journal of Science and Technology10.47836/pjst.30.2.1630:2(1135-1156)Online publication date: 14-Mar-2022
  • (2021)Empirical study of sentiment analysis tools and techniques on societal topicsJournal of Intelligent Information Systems10.1007/s10844-020-00616-756:2(379-407)Online publication date: 1-Apr-2021
  • Show More Cited By

Index Terms

  1. A Meta-Framework for Modeling the Human Reading Process in Sentiment Analysis

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Information Systems
    ACM Transactions on Information Systems  Volume 35, Issue 1
    January 2017
    233 pages
    ISSN:1046-8188
    EISSN:1558-2868
    DOI:10.1145/2986034
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 August 2016
    Accepted: 01 May 2016
    Revised: 01 April 2016
    Received: 01 August 2015
    Published in TOIS Volume 35, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Sentiment analysis
    2. human reading
    3. psychology
    4. supervised learning and notions

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • Qatar National Research Fund (a member of Qatar Foundation)

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Towards a robust deep learning framework for Arabic sentiment analysisNatural Language Processing10.1017/nlp.2024.35(1-35)Online publication date: 6-Sep-2024
    • (2022)Automated Islamic Jurisprudential Legal Opinions Generation Using Artificial IntelligencePertanika Journal of Science and Technology10.47836/pjst.30.2.1630:2(1135-1156)Online publication date: 14-Mar-2022
    • (2021)Empirical study of sentiment analysis tools and techniques on societal topicsJournal of Intelligent Information Systems10.1007/s10844-020-00616-756:2(379-407)Online publication date: 1-Apr-2021
    • (2021)Sentiment classification in social media data by combining triplet belief functionsJournal of the Association for Information Science and Technology10.1002/asi.2460573:7(968-991)Online publication date: 18-Nov-2021
    • (2020)How the Accuracy and Confidence of Sensitivity Classification Affects Digital Sensitivity ReviewACM Transactions on Information Systems10.1145/341733439:1(1-34)Online publication date: 12-Oct-2020
    • (2020)OutdoorSentACM Transactions on Information Systems10.1145/338518638:3(1-28)Online publication date: 21-Apr-2020
    • (2020)Mining Opinion and Sentiment from Arabic Text2020 Seventh International Conference on Information Technology Trends (ITT)10.1109/ITT51279.2020.9320774(165-168)Online publication date: 25-Nov-2020
    • (2019)A Survey of Opinion Mining in ArabicACM Transactions on Asian and Low-Resource Language Information Processing10.1145/329566218:3(1-52)Online publication date: 7-May-2019
    • (2018)A supervised aspect level sentiment model to predict overall sentiment on tweeter documentsInternational Journal of Metadata, Semantics and Ontologies10.5555/3302773.330277713:1(33-41)Online publication date: 1-Jan-2018
    • (2017)A Sentiment Treebank and Morphologically Enriched Recursive Deep Models for Effective Sentiment Analysis in ArabicACM Transactions on Asian and Low-Resource Language Information Processing10.1145/308657616:4(1-21)Online publication date: 13-Jul-2017

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media