Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2786805.2786854acmconferencesArticle/Chapter ViewAbstractPublication PagesfseConference Proceedingsconference-collections
research-article

Developer onboarding in GitHub: the role of prior social links and language experience

Published: 30 August 2015 Publication History

Abstract

The team aspects of software engineering have been a subject of great interest since early work by Fred Brooks and others: how well do people work together in teams? why do people join teams? what happens if teams are distributed? Recently, the emergence of project ecosystems such as GitHub have created an entirely new, higher level of organization. GitHub supports numerous teams; they share a common technical platform (for work activities) and a common social platform (via following, commenting, etc). We explore the GitHub evidence for socialization as a precursor to joining a project, and how the technical factors of past experience and social factors of past connections to team members of a project affect productivity both initially and in the long run. We find developers preferentially join projects in GitHub where they have pre-existing relationships; furthermore, we find that the presence of past social connections combined with prior experience in languages dominant in the project leads to higher productivity both initially and cumulatively. Interestingly, we also find that stronger social connections are associated with slightly less productivity initially, but slightly more productivity in the long run.

References

[1]
A. Begel, J. Bosch, and M.-A. Storey. Social networking meets software development: Perspectives from GitHub, MSDN, Stack Exchange, and TopCoder. Software, IEEE, 30(1):52–66, 2013.
[2]
Y. Benjamini and Y. Hochberg. Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological), 57(1):289–300, 1995.
[3]
C. Bird, A. Gourley, P. Devanbu, A. Swaminathan, and G. Hsu. Open borders? Immigration in open source projects. In MSR, pages 6–6. IEEE, 2007.
[4]
M. Cataldo, J. D. Herbsleb, and K. M. Carley. Socio-technical congruence: a framework for assessing the impact of technical and work dependencies on software development productivity. In ESEM, pages 2–11. ACM, 2008.
[5]
K. Crowston and J. Howison. The social structure of free and open source software development. First Monday, 10(2), 2005.
[6]
K. Crowston, Q. Li, K. Wei, U. Y. Eseryel, and J. Howison. Self-organization of teams for free/libre open source software development. Information and Software Technology, 49(6):564–575, 2007.
[7]
L. Dabbish, C. Stuart, J. Tsay, and J. Herbsleb. Social coding in GitHub: transparency and collaboration in an open software repository. In CSCW, pages 1277–1286. ACM, 2012.
[8]
N. Ducheneaut. Socialization in an open source software community: A socio-technical analysis. Computer Supported Cooperative Work, 14(4):323–368, 2005.
[9]
S. Faraj and L. Sproull. Coordinating expertise in software development teams. Management Science, 46(12):1554–1568, 2000.
[10]
M. Gharehyazie, D. Posnett, B. Vasilescu, and V. Filkov. Developer initiation and social interactions in OSS: A case study of the Apache Software Foundation. Empir. Softw. Eng., pages 1–36, 2014.
[11]
G. Gousios and D. Spinellis. GHTorrent: Github’s data from a firehose. In MSR, pages 12–21. IEEE, 2012.
[12]
R. Guimera, B. Uzzi, J. Spiro, and L. A. N. Amaral. Team assembly mechanisms determine collaboration network structure and team performance. Science, 308(5722):697–702, 2005.
[13]
J. Hahn, J. Y. Moon, and C. Zhang. Emergence of new project teams from open source software developer networks: Impact of prior collaboration ties. Information Systems Research, 19(3):369–391, 2008.
[14]
D. A. Harrison, K. H. Price, and M. P. Bell. Beyond relational demography: Time and the effects of surface-and deep-level diversity on work group cohesion. Academy of Management Journal, 41(1):96–107, 1998.
[15]
A. Hindle, D. M. German, and R. Holt. What do large commits tell us?: A taxonomical study of large commits. In MSR, pages 99–108. ACM, 2008.
[16]
M. Hlavac. stargazer: LaTeX/HTML code and ASCII text for well-formatted regression and summary statistics tables. Harvard University, Cambridge, USA, 2014. R package version 5.1.
[17]
C. Jensen and W. Scacchi. Role migration and advancement processes in OSSD projects: A comparative case study. In ICSE, pages 364–374. IEEE, 2007.
[18]
E. Kalliamvakou, G. Gousios, K. Blincoe, L. Singer, D. M. German, and D. Damian. The promises and perils of mining GitHub. In MSR, MSR 2014, pages 92–101. ACM, 2014.
[19]
K. Lakhani and R. Wolf. Why hackers do what they do: Understanding motivation and effort in free/open source software projects. In Perspectives on Free and Open Source Software. MIT Press, Cambridge, 2005.
[20]
K. R. Lakhani and E. Von Hippel. How open source software works: “free” user-to-user assistance. Research Policy, 32(6):923–943, 2003.
[21]
J. M. Levine and R. L. Moreland. Progress in small group research. Annual Rev. Psych., 41(1):585–634, 1990.
[22]
A. Lima, L. Rossi, and M. Musolesi. Coding together at scale: Github as a collaborative social network. CoRR, abs/1407.2535, 2014.
[23]
J. Marlow, L. Dabbish, and J. Herbsleb. Impression formation in online peer production: activity traces and personal profiles in GitHub. In CSCW, pages 117–128. ACM, 2013.
[24]
T. Mens, M. Claes, and P. Grosjean. ECOS: Ecological studies of open source software ecosystems. In CSMR-WCRE, pages 403–406. IEEE, 2014.
[25]
T. Mens, M. Claes, P. Grosjean, and A. Serebrenik. Studying evolving software ecosystems based on ecological models. In Evolving Software Systems, pages 297–326. Springer, 2014.
[26]
R. L. Moreland and L. Thompson. Transactive memory: Learning who knows what in work groups and organizations. Small Groups: Key Readings, pages 327–346, 2006.
[27]
D. A. Owens, E. A. Mannix, and M. A. Neale. Strategic formation of groups: Issues in task performance and team member selection. In Composition, pages 149–165. Elsevier, 1998.
[28]
J. Rice. Mathematical Statistics and Data Analysis. Duxbury Press, 2007.
[29]
I. Rivals, L. Personnaz, L. Taing, and M.-C. Potier. Enrichment or depletion of a go category within a class of genes: which test? Bioinformatics, 23(4):401–407, 2007.
[30]
G. Robles and J. M. Gonzalez-Barahona. Contributor turnover in libre software projects. In Open Source Systems, pages 273–286. Springer, 2006.
[31]
G. R. Salancik and J. Pfeffer. A social information processing approach to job attitudes and task design. Administrative Science Quarterly, pages 224–253, 1978.
[32]
I. Steinmacher, T. U. Conte, M. Gerosa, and D. Redmiles. Social barriers faced by newcomers placing their first contribution in open source software projects. In CSCW, pages 1–13, 2015.
[33]
I. Steinmacher, M. A. Gerosa, and D. Redmiles. Attracting, onboarding, and retaining newcomer developers in open source software projects. In Workshop on Global Software Development in a CSCW Perspective, 2014.
[34]
D. Terdiman. Forget LinkedIn: Companies turn to GitHub to find tech talent. http://www.cnet.com/news/forget-linkedin-companiesturn-to-github-to-find-tech-talent/, 2012.
[35]
B. Uzzi and J. Spiro. Collaboration and creativity: The small world problem. American Journal of Sociology, 111(2):447–504, 2005.
[36]
B. Vasilescu, V. Filkov, and A. Serebrenik. Perceptions of diversity on GitHub: A user survey. In CHASE, 2015.
[37]
B. Vasilescu, D. Posnett, B. Ray, M. G. J. van den Brand, A. Serebrenik, P. Devanbu, and V. Filkov. Gender and tenure diversity in GitHub teams. In CHI, pages 3789–3798. ACM, 2015.
[38]
B. Vasilescu, A. Serebrenik, and V. Filkov. A data set for social diversity studies of GitHub teams. In MSR, 2015. to appear.
[39]
J. M. Ver Hoef and P. L. Boveng. Quasi-poisson vs. negative binomial regression: how should we model overdispersed count data? Ecology, 88(11):2766–2772, 2007.
[40]
G. Von Krogh, S. Spaeth, and K. R. Lakhani. Community, joining, and specialization in open source software innovation: A case study. Research Policy, 32(7):1217–1241, 2003.
[41]
M. Weiss and G. Moroiu. Ecology and dynamics of open source communities, pages 48–67. IGI Global, 2007.
[42]
M. Weiss, G. Moroiu, and P. Zhao. Evolution of open source communities. In Open Source Systems, pages 21–32. Springer, 2006.
[43]
Q. Xuan and V. Filkov. Building it together: Synchronous development in OSS. In ICSE, pages 222–233. ACM, 2014.
[44]
Y. Ye and K. Kishida. Toward an understanding of the motivation of open source software developers. In ICSE, pages 419–429. IEEE, 2003.
[45]
M. Zhou and A. Mockus. Does the initial environment impact the future of developers? In ICSE, pages 271–280. ACM, 2011.
[46]
M. Zhou and A. Mockus. What make long term contributors: Willingness and opportunity in OSS community. In ICSE, pages 518–528. IEEE, 2012.

Cited By

View all
  • (2024)Demand-Side Effects of Open Innovation: The Case of Cryptocurrency ForkingSSRN Electronic Journal10.2139/ssrn.4837150Online publication date: 2024
  • (2024)The Heterogeneous Productivity Effects of Generative AISSRN Electronic Journal10.2139/ssrn.4745624Online publication date: 2024
  • (2024)On the Way to SBOMs: Investigating Design Issues and Solutions in PracticeACM Transactions on Software Engineering and Methodology10.1145/365444233:6(1-25)Online publication date: 27-Jun-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ESEC/FSE 2015: Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering
August 2015
1068 pages
ISBN:9781450336758
DOI:10.1145/2786805
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 August 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. GitHub
  2. onboarding
  3. productivity
  4. social aspects

Qualifiers

  • Research-article

Conference

ESEC/FSE'15
Sponsor:

Acceptance Rates

Overall Acceptance Rate 112 of 543 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)54
  • Downloads (Last 6 weeks)6
Reflects downloads up to 18 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Demand-Side Effects of Open Innovation: The Case of Cryptocurrency ForkingSSRN Electronic Journal10.2139/ssrn.4837150Online publication date: 2024
  • (2024)The Heterogeneous Productivity Effects of Generative AISSRN Electronic Journal10.2139/ssrn.4745624Online publication date: 2024
  • (2024)On the Way to SBOMs: Investigating Design Issues and Solutions in PracticeACM Transactions on Software Engineering and Methodology10.1145/365444233:6(1-25)Online publication date: 27-Jun-2024
  • (2024)How Are Paid and Volunteer Open Source Developers Different? A Study of the Rust ProjectProceedings of the IEEE/ACM 46th International Conference on Software Engineering10.1145/3597503.3639197(1-13)Online publication date: 20-May-2024
  • (2024)Bringing Open Source Communication and Development Together: A Cross-Platform Study on Gitter and GitHubIEEE Transactions on Software Engineering10.1109/TSE.2024.341029250:11(2807-2826)Online publication date: Nov-2024
  • (2024)Understanding Newcomers’ Onboarding Process in Deep Learning ProjectsIEEE Transactions on Software Engineering10.1109/TSE.2024.335329750:3(443-460)Online publication date: Mar-2024
  • (2024)Towards Identifying Code Proficiency Through the Analysis of Python Textbooks2024 IEEE International Conference on Software Maintenance and Evolution (ICSME)10.1109/ICSME58944.2024.00035(288-299)Online publication date: 6-Oct-2024
  • (2024)Requirement-Driven Developer Recommendation Framework Based on Github Developer Social NetworkIntelligent Computing Systems and Applications10.1007/978-981-97-5412-0_17(231-246)Online publication date: 20-Sep-2024
  • (2023)Risk Assessment of Using Open Source Projects: Analysis of the Existing ApproachesAutomatic Control and Computer Sciences10.3103/S014641162308005957:8(938-946)Online publication date: 1-Dec-2023
  • (2023)Matching Skills, Past Collaboration, and Limited Competition: Modeling When Open-Source Projects Attract ContributorsProceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3611643.3616282(42-54)Online publication date: 30-Nov-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media