Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3330204.3330219acmotherconferencesArticle/Chapter ViewAbstractPublication PagessbsiConference Proceedingsconference-collections
research-article

Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems

Published: 20 May 2019 Publication History

Abstract

This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.

References

[1]
Steven Bird and Edward Loper. 2004. NLTK: the natural language toolkit. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions. Association for Computational Linguistics, 31.
[2]
Tawunrat Chalothorn and Jeremy Ellman. 2012. Sentiment analysis of web forums: Comparison between sentiwordnet and sentistrength. The 4th International Conference on Computer Technology and Development (ICCTD 2012). 24-25 November 2012.
[3]
Thiago Hellen O da Silva, Lavínia Matoso Freitas, and Marília Soares Mendes. 2017. Beyond traditional evaluations: user's view in app stores. In Proceedings of the XVI Brazilian Symposium on Human Factors in Computing Systems. ACM, 15.
[4]
JL De Lucca and Maria das Graças Volpe Nunes. 2002. Lematização versus Stemming. USP, UFSCar, UNESP, São Carlos, São Paulo (2002).
[5]
Steffen Hedegaard and Jakob Grue Simonsen. 2013. Extracting usability and user experience information from online user reviews. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2089--2098.
[6]
Hannu Korhonen, Juha Arrasvuori, and Kaisa Väänänen-Vainio-Mattila. 2010. Let users tell the story: evaluating user experience with experience reports. In CHI'10 Extended Abstracts on Human Factors in Computing Systems. ACM, 4051--4056.
[7]
Afonso Matheus Sousa Lima, Paloma Bispo dos Santos Silva Silva, Lívia Almada Cruz, and Marilia Soares Mendes. 2017. Investigating the polarity of user postings in a Social System. In International Conference on Social Computing and Social Media. Springer, 246--257.
[8]
Steven Loria, P Keen, M Honnibal, R Yankovsky, D Karesh, E Dempsey, et al. 2014. Textblob: simplified text processing. Secondary TextBlob: Simplified Text Processing (2014).
[9]
Marilia S. Mendes. 2015. MALTU -- Um modelo para avaliação da interação em sistemas sociais a partir da linguagem textual do usuário. Ph.D. Dissertation. Universidade Federal do Ceará, Programa de Pós-Graduação em Ciência da Computação, Fortaleza.
[10]
Marilia S Mendes and Elizabeth Furtado. 2018. An Experience of Textual Evaluation Using the MALTU Methodology. In International Conference on Social Computing and Social Media. Springer, 236--246.
[11]
Marília S Mendes, Elizabeth Furtado, Vasco Furtado, and Miguel F de Castro. 2014. How do users express their emotions regarding the social system in use? A classification of their postings by using the emotional analysis of Norman. In International Conference on Social Computing and Social Media. Springer, 229--241.
[12]
Marília S Mendes, Elizabeth Furtado, Vasco Furtado, and Miguel F de Castro. 2015. Investigating Usability and User Experience from the user postings in Social Systems. In International Conference on Social Computing and Social Media. Springer, 216--228.
[13]
Marília Soares Mendes and Elizabeth Sucupira Furtado. 2017. UUX-Posts: a tool for extracting and classifying postings related to the use of a system. In Proceedings of the 8th Latin American Conference on Human-Computer Interaction. ACM, 2.
[14]
Joel Larocca Neto, Alexandre D Santos, Celso AA Kaestner, Neto Alexandre, D Santos, et al. 2000. Document clustering and text summarization. (2000).
[15]
Developers of Scrapy. 2016. Scrapy 1.5 documentation. https://docs.scrapy.org/en/latest/
[16]
Thomas Olsson and Markus Salo. 2012. Narratives of satisfying and unsatisfying experiences of current mobile augmented reality applications. In Proceedings of the SIGCHI conference on human factors in computing systems. ACM, 2779--2788.
[17]
Viviane Orengo and Christian Huyck. 2001. A stemming algorithmm for the portuguese language. In spire. IEEE, 0186.
[18]
Timo Partala and Aleksi Kallinen. 2011. Understanding the most satisfying and unsatisfying user experiences: Emotions, psychological needs, and context. Interacting with computers 24, 1 (2011), 25--34.
[19]
Juan Ramos et al. 2003. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, Vol. 242. 133--142.
[20]
Vitor Rolim, Rafael Ferreira, and Evandro Costa. 2016. Identificação Automática de Dúvidas em Fóruns Educacionais. In Brazilian Symposium on Computers in Education (Simpósio Brasileiro de Informática na Educação-SBIE), Vol. 27. Sociedade Brasileira de Computação, Uberlândia, 936.
[21]
Gayane Shalunts, Gerhard Backfried, and Prinz Prinz. 2014. Sentiment analysis of German social media data for natural disasters. In ISCRAM.
[22]
Mike Thelwall. 2017. The Heart and soul of the web? Sentiment strength detection in the social web with SentiStrength. In Cyberemotions. Springer, 119--134.
[23]
Bruno Trstenjak, Sasa Mikac, and Dzenana Donko. 2014. KNN with TF-IDF based Framework for Text Categorization. Procedia Engineering 69 (2014), 1356--1364.
[24]
Alexandre N Tuch, Rune Trusell, and Kasper Hornbæk. 2013. Analyzing users' narratives to understand experience with interactive products. In Proceedings of the SIGCHI Conference on human factors in computing systems. ACM, 2079--2088.
[25]
David Vilares, Mike Thelwall, and Miguel A Alonso. 2015. The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweets. Journal of Information Science 41, 6 (2015), 799--813.

Cited By

View all
  • (2019)Temporal analysis of posts related to useProceedings of the 18th Brazilian Symposium on Human Factors in Computing Systems10.1145/3357155.3358482(1-10)Online publication date: 22-Oct-2019

Index Terms

  1. Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        SBSI '19: Proceedings of the XV Brazilian Symposium on Information Systems
        May 2019
        623 pages
        ISBN:9781450372374
        DOI:10.1145/3330204
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        In-Cooperation

        • SBC: Brazilian Computer Society

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 20 May 2019

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Lexical Classifiers
        2. Sentiment Analysis
        3. Systems Evaluation

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        SBSI'19

        Acceptance Rates

        Overall Acceptance Rate 181 of 557 submissions, 32%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 30 Dec 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2019)Temporal analysis of posts related to useProceedings of the 18th Brazilian Symposium on Human Factors in Computing Systems10.1145/3357155.3358482(1-10)Online publication date: 22-Oct-2019

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media