Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3617650.3624951acmconferencesArticle/Chapter ViewAbstractPublication PagescompedConference Proceedingsconference-collections
abstract

Where's the Data? Exploring Datasets in Computing Education

Published: 05 December 2023 Publication History

Abstract

This working group aims to identify available datasets within the context of computing education research. One particular area of interest is programming education, and the data in question may include students' steps, progress, or submissions in the form of program code. To achieve this goal, the working group will review well-known data resources and repositories (e.g., DataShop, GitHub, NSF Public Access Repository, and IEEE DataPort) and recent papers published within the SIGCSE community. As a result of the review process, the working group will create an overview of available datasets and characterize them while reflecting on current data practices, challenges, and the consequences of limited access to research data. Additionally, the group intends to propose a path for the community to become more open and move toward open data practices. This proposal highlights the importance of sharing research data within the computing education research community to make it stronger and more productive.

References

[1]
Katarzyna Biernacka, Adrian Mulligan, Jonathan Zimmermann, and Rudi Rudiak. 2023. Research Data Sharing and Reuse 2020. Online. https://doi.org/10.17632/nr9n75cpv2.1 Mendeley Data, V1.
[2]
Katarzyna Biernacka and Niels Pinkwart. 2021. Opportunities for Adopting Open Research Data in Learning Analytics. In Advancing the Power of Learning Analytics and Big Data in Education. https://doi.org/10.4018/978-1-7998-7103-3.ch002
[3]
Christine L. Borgman and Irene V. Pasquetto. 2017. Why Data Sharing and Reuse Are Hard To Do. https://escholarship.org/uc/item/0jj17309
[4]
Neil Christopher Charles Brown, Michael Kölling, Davin McCall, and Ian Utting. 2014. Blackbox: A Large Scale Repository of Novice Programmers' Activity. In Proceedings of the ACM Technical Symposium on Computer Science Education (SIGCSE). https://doi.org/10.1145/2538862.2538924
[5]
John Edwards. 2022. 2021 CS1 Keystroke Data. https://doi.org/10.7910/DVN/BVOF7S
[6]
European Union. 2023. European Open Science Cloud. https://eosc-portal.eu/
[7]
Petri Ihantola, Arto Vihavainen, Alireza Ahadi, Matthew Butler, Jürgen Börstler, Stephen H. Edwards, Essi Isohanni, Ari Korhonen, Andrew Petersen, Kelly Rivers, Miguel Ángel Rubio, Judy Sheard, Bronius Skupas, Jaime Spacco, Claudia Szabo, and Daniel Toll. 2015. Educational Data Mining and Learning Analytics in Programming: Literature Review and Case Studies. In Proceedings of the 2015 ITiCSE on Working Group Reports (ITICSE-WGR '15). ACM, New York, 41--63.
[8]
Johan Jeuring, Hieke Keuning, Samiha Marwan, Dennis Bouvier, Cruz Izu, Natalie Kiesler, Teemu Lehtinen, Dominic Lohr, Andrew Peterson, and Sami Sarsa. 2022. Towards Giving Timely Formative Feedback and Hints to Novice Programmers. In Proceedings of the 2022 Working Group Reports on Innovation and Technology in Computer Science Education (Dublin, Ireland) (ITiCSE-WGR '22). ACM, New York, 95--115. https://doi.org/10.1145/3571785.3574124
[9]
Natalie Kiesler. 2022. Dataset: Recursive problem solving in the online learning environment CodingBat by computer science students. Online. https://doi.org/10.21249/DZHW:studentsteps:1.0.0 Datenerhebung: 2017. Version: 1.0.0. Datenpaketzugangsweg: Download-SUF. Hannover: FDZ-DZHW. Datenkuratierung: İkiz-Akıncı, Dilek.
[10]
Natalie Kiesler and Benedikt Pfülb. 2023. Higher Education Programming Competencies: A Novel Dataset. In Artificial Neural Networks and Machine Learning -- ICANN 2023, Lazaros Iliadis, Antonios Papaleonidas, Plamen Angelov, and Chrisina Jayne (Eds.). Springer Nature Switzerland, Cham, 319--330. https://doi.org/10.1007/978-3-031-44198-1_27
[11]
Natalie Kiesler and Daniel Schiffner. 2022. On the Lack of Recognition of Software Artifacts and IT Infrastructure in Educational Technology Research. In 20. Fachtagung Bildungstechnologien (DELFI), Peter A. Henning, Michael Striewe, and Matthias Wölfel (Eds.). Gesellschaft für Informatik e.V., Bonn, 201--206. https://doi.org/10.18420/delfi2022-034
[12]
Natalie Kiesler and Daniel Schiffner. 2023. Why We Need Open Data in Computer Science Education Research. In Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education Vol. 1 (Turku, Finland) (ITiCSE 2023). ACM, New York. https://doi.org/10.1145/3587102.3588860
[13]
Michael Kölling and Ian Utting. 2012. Building an Open, Large-Scale Research Data Repository of Initial Programming Student Behaviour. In Proceedings of the 43rd ACM Technical Symposium on Computer Science Education (SIGCSE '12). ACM, New York, 323--324. https://doi.org/10.1145/2157136.2157234
[14]
National Science Foundation. 2013. Open Data at NSF. https://www.nsf.gov/data/
[15]
Benjamin Paaßen. 2020. Python Programming Dataset. https://doi.org/10.4119/unibi/2941052 Bielefeld University.
[16]
Dirk Pilat and Yukiko Fukasaku. 2007. OECD principles and guidelines for access to research data from public funding. Data Science Journal 6 (2007), OD4--OD11.
[17]
Keith Quille and Keith Nolan. 2022. Predicting Success in CS1-An Open Access Data Project. In Proceedings of the 53rd ACM Technical Symposium on Computer Science Education V. 2. ACM, New York. https://doi.org/10.1145/3478432.3499092

Cited By

View all
  • (2024)Conferences are Exclusive by NatureProceedings of the 2024 on RESPECT Annual Conference10.1145/3653666.3656077(288-292)Online publication date: 16-May-2024
  • (2024)"Let Them Try to Figure It Out First" - Reasons Why Experts (Do Not) Provide Feedback to Novice ProgrammersProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653530(38-44)Online publication date: 3-Jul-2024
  • (2024)Where's the Data? Finding and Reusing Datasets in Computing EducationWorking Group Reports on 2023 ACM Conference on Global Computing Education10.1145/3598579.3689378(31-60)Online publication date: 23-Sep-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CompEd 2023: Proceedings of the ACM Conference on Global Computing Education Vol 2
December 2023
50 pages
ISBN:9798400703744
DOI:10.1145/3617650
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 December 2023

Check for updates

Author Tags

  1. Open Data
  2. Open Science
  3. computing education
  4. datasets
  5. educational data mining
  6. reusing data
  7. secondary research

Qualifiers

  • Abstract

Conference

CompEd 2023
Sponsor:

Acceptance Rates

Overall Acceptance Rate 33 of 100 submissions, 33%

Upcoming Conference

CompEd '25
ACM Global Computing Education Conference 2025
October 21 - 25, 2025
Gaborone , Botswana

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)96
  • Downloads (Last 6 weeks)6
Reflects downloads up to 28 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Conferences are Exclusive by NatureProceedings of the 2024 on RESPECT Annual Conference10.1145/3653666.3656077(288-292)Online publication date: 16-May-2024
  • (2024)"Let Them Try to Figure It Out First" - Reasons Why Experts (Do Not) Provide Feedback to Novice ProgrammersProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653530(38-44)Online publication date: 3-Jul-2024
  • (2024)Where's the Data? Finding and Reusing Datasets in Computing EducationWorking Group Reports on 2023 ACM Conference on Global Computing Education10.1145/3598579.3689378(31-60)Online publication date: 23-Sep-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media