Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2735522.2735528dlproceedingsArticle/Chapter ViewAbstractPublication PagescasconConference Proceedingsconference-collections
research-article

Monitoring sentiment in open source mailing lists: exploratory study on the apache ecosystem

Published: 03 November 2014 Publication History

Abstract

Large software projects, both open and closed source, are constructed and maintained collaboratively by teams of developers and testers, who are typically geographically dispersed. This dispersion creates a distance between team members, hiding feelings of distress or (un)happiness from their manager, which prevents him or her from using remediation techniques for those feelings. This paper evaluates the usage of automatic sentiment analysis to identify distress or happiness in a development team. Since mailing lists are one of the most popular media for discussion in distributed software projects, we extracted sentiment values of the user and developer mailing lists of two of the most successful and mature projects of the Apache software foundation. The results show that (1) user and developer mailing lists carry both positive and negative sentiment and have a slightly different focus, while (2) work is needed to customize automatic sentiment analysis techniques to the domain of software engineering, since they lack precision when facing technical terms

References

[1]
I. Robertson and C. Cooper, Wellbeing: Productivity and Happiness at Work. Palgrave Macmillan, 2011.
[2]
S. Rao, Happiness At Work. McGraw-Hill Education (India) Pvt Limited, 2010.
[3]
C. Barron and A. Barron, The Creativity Cure: How to Build Happiness with Your Own Two Hands. Scribner, 2013.
[4]
T. M. Amabile, S. G. Barsade, J. S. Mueller, and B. M. Staw, "Affect and Creativity at Work," Administrative Science Quarterly, vol. 50, no. 3, pp. 367--403, 2005.
[5]
A. J. Oswald, E. Proto, and D. Sgroi, "Happiness and productivity," Bonn, IZA discussion papers 4645, 2009.
[6]
"Predicting Turnover of Employees from Measured Job Attitudes." Organizational Behavior & Human Performance, vol. 13, no. 2, pp. 233--243, Apr. 1975.
[7]
I. Mistrík. J. Grundy, A. Hoek, and J. Whitehead, Eds., Collaborative Software Engineering. Springer, 2010.
[8]
A. Bacchelli, T. D. Sasso, M. D'Ambros, and M. Lanza, "Content classification of development emails," in Proc. of the 34th Intl. Conf. on Software Engineering (ICSE), 2012, pp. 375--385.
[9]
P. C. Rigby, D. M. German, and M.-A. Storey, "Open source software peer review practices: A case study of the apache server," in Proceedings of the 30th International Conference on Software Engineering, ser. ICSE '08. New York, NY, USA: ACM, 2008, pp. 541--550.
[10]
A. Bacchelli, M. Lanza, and R. Robbes, "Linking e-mails and source code artifacts," in Proceedings of the 32Nd ACM/IEEE International Conference on Software Engineering - Volume 1, ser. ICSE '10. New York, NY, USA: ACM, 2010, pp. 375--384.
[11]
N. Mishra and C. K. Jha, "Article: Classification of opinion mining techniques," International Journal of Computer Applications, vol. 56, no. 13, pp. 1--6, October 2012, published by Foundation of Computer Science, New York, USA.
[12]
M. Thelwall, "Heart and soul: Sentiment strength detection in the social web with sentistrength (summary book chapter)," In press.
[13]
B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Information Retrieval, vol. 2, no. 1-2, pp. 1--135, Jan. 2008.
[14]
A. Murgia, P. Tourani, B. Adams, and M. Ortu, "Do developers feel emotions? an exploratory analysis of emotions in software artifacts," in Proceedings of the 11th IEEE Working Conference on Mining Software Repositories (MSR), Hyderabad, India, May 2014.
[15]
C. Osgood, G. Suci, and P. Tenenbaum, The Measurement of meaning. Urbana:: University of Illinois Press, 1957.
[16]
R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning, A. Y. Ng, and C. Potts, "Recursive deep models for semantic compositionality over a sentiment treebank," in Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA: Association for Computational Linguistics, October 2013, pp. 1631--1642.
[17]
A. Aue and M. Gamon, "Customizing sentiment classifiers to new domains: A case study," in RANLP, 2005.
[18]
M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede, "Lexicon-based methods for sentiment analysis," Comput. Linguist., vol. 37, no. 2, pp. 267--307, Jun. 2011.
[19]
M. Thelwall, K. Buckley, G. Paltoglou, D. Cai, and A. Kappas, "Sentiment in short strength detection informal text," J. Am. Soc. Inf. Sci. Technol., vol. 61, no. 12, pp. 2544--2558, Dec. 2010.
[20]
O. Kucuktunc, B. B. Cambazoglu, I. Weber, and H. Ferhatosmanoglu, "A large-scale sentiment analysis for yahoo! answers," in Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, ser. WSDM '12. New York, NY, USA: ACM, 2012, pp. 633--642.
[21]
E. Guzman and B. Bruegge, "Towards emotional awareness in software development teams," in Proc. of the 2013 9th Joint Meeting on Foundations of Software Engineering (ESEC/FSE), 2013, pp. 671--674.
[22]
R. Kabacoff, R in Action: Data Analysis and Graphics With R. MANNING PUBN, 2013.
[23]
A. Guzzi, A. Bacchelli, M. Lanza, M. Pinzger, and A. van Deursen, "Communication in open source software development mailing lists," in The 10th Working Conference on Mining Software Repositories, 2013, pp. 277--286.
[24]
D. T. Campbell and J. C. Stanley, Experimental and quasi-experimental designs for generalized causal inference. Houghton Mifflin, 1963.
[25]
S. Ambler, "Agile modeling: effective practices for extreme programming and the unified process". John Wiley & Sons, Inc. New York, 2002.
[26]
"Linus Torvalds defends his right to shame Linux kernel developers," http://www.webcitation.org/6O2zErgzE, accessed: 01/10/2013.
[27]
B. L. Fredrickson, "The role of positive emotions in positive psychology: The broaden-and-build theory of positive emotions." American psychologist, vol. 56, no. 3, p. 218, 2001.
[28]
M. De Choudhury and S. Counts, "Understanding affect in the workplace via social media," in Proceedings of the 2013 Conference on Computer Supported Cooperative Work, ser. CSCW '13. New York, NY, USA: ACM, 2013, pp. 303--316.
[29]
R. Feldman, "Techniques and applications for sentiment analysis," Commun. ACM, vol. 56, no. 4, pp. 82--89, Apr. 2013.
[30]
V. Sehgal and C. Song, "Sops: Stock prediction using web sentiment," in Proceedings of the Seventh IEEE International Conference on Data Mining Workshops, ser. ICDMW '07. Washington, DC, USA: IEEE Computer Society, 2007, pp. 21--26.
[31]
G. Canfora, L. Cerulo, M. Cimitile, and M. D. Penta, "How changes affect software entropy: an empirical study," Empirical Software Engineering, vol. 19, no. 1, pp. 1--38, 2014.
[32]
P. C. Rigby and A. E. Hassan, "What can oss mailing lists tell us? a preliminary psychometric text analysis of the apache developer mailing list," in Proceedings of the Fourth International Workshop on Mining Software Repositories, ser. MSR '07. Washington, DC, USA: IEEE Computer Society, 2007, pp. 23--.
[33]
B. Bazelli, A. Hindle, and E. Stroulia, "On the personality traits of stackoverflow users." in ICSM. IEEE, 2013, pp. 460--463.
[34]
Y. Zhang and D. Hou, "Extracting problematic api features from forum discussions," in Proc. ICPC. IEEE, 2013, pp. 142--151.

Cited By

View all
  • (2019)Sentiment analysis of open source software community mailing listProceedings of the 15th International Symposium on Open Collaboration10.1145/3306446.3340824(1-5)Online publication date: 20-Aug-2019
  • (2019)Sentiment analysis of open source communitiesProceedings of the 15th International Symposium on Open Collaboration10.1145/3306446.3340816(1-5)Online publication date: 20-Aug-2019
  • (2019)Empirical analysis of affect of merged issues on GitHubProceedings of the 4th International Workshop on Emotion Awareness in Software Engineering10.1109/SEmotion.2019.00017(46-48)Online publication date: 28-May-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
CASCON '14: Proceedings of 24th Annual International Conference on Computer Science and Software Engineering
November 2014
399 pages
  • Conference Chair:
  • Joanna Ng,
  • Program Chairs:
  • Jin Li,
  • Ken Wong

Sponsors

  • IBM Canada: IBM Canada

Publisher

IBM Corp.

United States

Publication History

Published: 03 November 2014

Author Tags

  1. empirical software engineering
  2. mailing list data
  3. mining software repositories
  4. sentiment analysis

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 24 of 90 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Sentiment analysis of open source software community mailing listProceedings of the 15th International Symposium on Open Collaboration10.1145/3306446.3340824(1-5)Online publication date: 20-Aug-2019
  • (2019)Sentiment analysis of open source communitiesProceedings of the 15th International Symposium on Open Collaboration10.1145/3306446.3340816(1-5)Online publication date: 20-Aug-2019
  • (2019)Empirical analysis of affect of merged issues on GitHubProceedings of the 4th International Workshop on Emotion Awareness in Software Engineering10.1109/SEmotion.2019.00017(46-48)Online publication date: 28-May-2019
  • (2019)A longitudinal study on the maintainers' sentiment of a large scale open source ecosystemProceedings of the 4th International Workshop on Emotion Awareness in Software Engineering10.1109/SEmotion.2019.00011(17-22)Online publication date: 28-May-2019
  • (2019)Exploring word embedding techniques to improve sentiment analysis of software engineering textsProceedings of the 16th International Conference on Mining Software Repositories10.1109/MSR.2019.00020(68-78)Online publication date: 26-May-2019
  • (2019)Pattern-based mining of opinions in Q&A websitesProceedings of the 41st International Conference on Software Engineering10.1109/ICSE.2019.00066(548-559)Online publication date: 25-May-2019
  • (2018)MEMEProceedings of the 3rd International Workshop on Emotion Awareness in Software Engineering10.1145/3194932.3194941(20-24)Online publication date: 2-Jun-2018
  • (2018)Sentiment analysis for software engineeringProceedings of the 40th International Conference on Software Engineering10.1145/3180155.3180195(94-104)Online publication date: 27-May-2018
  • (2018)DEVAProceedings of the 33rd Annual ACM Symposium on Applied Computing10.1145/3167132.3167296(1536-1543)Online publication date: 9-Apr-2018
  • (2017)SentiCR: a customized sentiment analysis tool for code review interactionsProceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering10.5555/3155562.3155579(106-111)Online publication date: 30-Oct-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media