Abstract
Large open-source projects such as the Linux kernel provide a unique opportunity to analyze many of the socio-technical processes of open-source software development. Understanding how cognitive workload affects the quality of code and productivity of work in such environments can help better protect open-source projects from potential vulnerabilities and better utilize limited developer resources.
In this paper, we present two agent-based simulation models of developer interactions on the Linux Kernel Mailing List (LKML). We also develop several non-simulation machine learning (ML) models predicting patch reversal, to compare with our agent-based simulation models. In our experiments, simulation models perform slightly better than ML models at predicting the expected number and proportion of reverted patches, and considerably better in matching the distribution of these values. Results are further improved using an explicit process model within the simulation, modeling the patch view process and associated cognitive load on LKML reviewers when new code changes are introduced by developers. We find that the process model can capture the repeated, structured multi-agent activities within a socio-technical community.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Linux kernel git repository. https://git.kernel.org/
Linux kernel mailing list archive. https://lkml.org/
Dash agent-based modeling framework. https://github.com/isi-usc-edu/dash/
List of linux kernel maintainers and how to submit kernel changes. https://www.kernel.org/doc/linux/MAINTAINERS
Blythe, J., et al.: Massive multi-agent data-driven simulations of the GitHub ecosystem. In: Demazeau, Y., Matson, E., Corchado, J.M., De la Prieta, F. (eds.) PAAMS 2019. LNCS (LNAI), vol. 11523, pp. 3–15. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-24209-1_1
Blythe, J., Tregubov, A.: FARM: architecture for distributed agent-based social simulations. In: Lin, D., Ishida, T., Zambonelli, F., Noda, I. (eds.) MMAS 2018. LNCS (LNAI), vol. 11422, pp. 96–107. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20937-7_7
van Dam, K.H., Nikolic, I., Kukszo, Z.: Agent-Based Modeling of Socio-Technical Systems. Springer, Dordrecht (2013). https://doi.org/10.1007/978-94-007-4933-7
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2018). https://doi.org/10.48550/ARXIV.1810.04805. https://arxiv.org/abs/1810.04805
Jiang, Y., Adams, B., German, D.M.: Will my patch make it? And how fast? Case study on the Linux kernel. In: Working Conference on Mining Software Repositories (MSR) (2013)
Kovanis, M., Porcher, R., Ravaud, P., Trinquart, L.: Complex systems approach to scientific publication and peer-review system: development of an agent-based model calibrated with empirical journal data. Scientometrics 106(2), 695–715 (2016)
Meyer, A.N., Fritz, T., Murphy, G.C., Zimmermann, T.: Software developers’ perceptions of productivity. In: Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, pp. 19–29 (2014)
Spasic, B., Onggo, B.S.S.: Agent-based simulation of the software development process: a case study at AVL. In: Winter Simulation Conference (2012)
Tregubov, A., Boehm, B., Rodchenko, N., Lane, J.A.: Impact of task switching and work interruptions on software development processes. In: Proceedings of the 2017 International Conference on Software and System Process, pp. 134–138 (2017)
Weinberg, G.M.: Quality Software Management (Vol. 1) Systems Thinking. Dorset House Publishing Co., Inc., New York (1992)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Tregubov, A., Abramson, J., Hauser, C., Hussain, A., Blythe, J. (2024). Modeling Cognitive Workload in Open-Source Communities via Simulation. In: Nardin, L.G., Mehryar, S. (eds) Multi-Agent-Based Simulation XXIV. MABS 2023. Lecture Notes in Computer Science(), vol 14558. Springer, Cham. https://doi.org/10.1007/978-3-031-61034-9_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-61034-9_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-61033-2
Online ISBN: 978-3-031-61034-9
eBook Packages: Computer ScienceComputer Science (R0)