Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3587102.3588805acmconferencesArticle/Chapter ViewAbstractPublication PagesiticseConference Proceedingsconference-collections
research-article
Open access

Evaluating the Performance of Code Generation Models for Solving Parsons Problems With Small Prompt Variations

Published: 30 June 2023 Publication History

Abstract

The recent emergence of code generation tools powered by large language models has attracted wide attention. Models such as OpenAI Codex can take natural language problem descriptions as input and generate highly accurate source code solutions, with potentially significant implications for computing education. Given the many complexities that students face when learning to write code, they may quickly become reliant on such tools without properly understanding the underlying concepts. One popular approach for scaffolding the code writing process is to use Parsons problems, which present solution lines of code in a scrambled order. These remove the complexities of low-level syntax, and allow students to focus on algorithmic and design-level problem solving. It is unclear how well code generation models can be applied to solve Parsons problems, given the mechanics of these models and prior evidence that they underperform when problems include specific restrictions. In this paper, we explore the performance of the Codex model for solving Parsons problems over various prompt variations. Using a corpus of Parsons problems we sourced from the computing education literature, we find that Codex successfully reorders the problem blocks about half of the time, a much lower rate of success when compared to prior work on more free-form programming tasks. Regarding prompts, we find that small variations in prompting have a noticeable effect on model performance, although the effect is not as pronounced as between different problems.

References

[1]
Brett A. Becker, Paul Denny, James Finnie-Ansley, Andrew Luxton-Reilly, James Prather, et al. 2023. Programming Is Hard - Or at Least It Used to Be: Educational Opportunities and Challenges of AI Code Generation. In Proc. of the 54th ACM Technical Symposium on Computer Science Education (SIGCSE 2023). ACM, NY, NY, USA, 500--506. https://doi.org/10.1145/3545945.3569759
[2]
Nick Cheng and Brian Harrington. 2017. The Code Mangler: Evaluating Coding Ability Without Writing Any Code. In Proc. of the 2017 ACM SIGCSE Technical Symposium on Computer Science Education. ACM, NY, NY, USA, 123--128.
[3]
Arghavan Moradi Dakhel, Vahid Majdinasab, Amin Nikanjam, Foutse Khomh, Michel C. Desmarais, et al. 2022. GitHub Copilot AI Pair Programmer: Asset or Liability? https://doi.org/10.48550/arXiv.2206.15331 arxiv: cs/2206.15331
[4]
Paul Denny, Viraj Kumar, and Nasser Giacaman. 2022a. Conversing with Copilot: Exploring Prompt Engineering for Solving CS1 Problems Using Natural Language. https://doi.org/10.48550/ARXIV.2210.15157
[5]
Paul Denny, Andrew Luxton-Reilly, and Beth Simon. 2008. Evaluating a New Exam Question: Parsons Problems. In Proc. of the 4th Int. Workshop on Computing Education Research (ICER '08). ACM, NY, NY, USA, 113--124.
[6]
Paul Denny, Sami Sarsa, Arto Hellas, and Juho Leinonen. 2022b. Robosourcing Educational Resources--Leveraging Large Language Models for Learnersourcing. arXiv preprint arXiv:2211.04715 (2022).
[7]
Yuemeng Du, Andrew Luxton-Reilly, and Paul Denny. 2020. A Review of Research on Parsons Problems. In Proc. of the 22nd Australasian Computing Education Conf. (ACE'20). ACM, NY, NY, USA, 195--202. https://doi.org/10.1145/3373165.3373187
[8]
Barbara Ericson and Carl Haynes-Magyar. 2022. Adaptive Parsons Problems as Active Learning Activities During Lecture. In Proc. of the 27th ACM Conf. on on Innovation and Technology in Computer Science Education Vol. 1 (ITiCSE '22). ACM, New York, NY, USA, 290--296. https://doi.org/10.1145/3502718.3524808
[9]
Barbara J. Ericson, Paul Denny, James Prather, Rodrigo Duran, Arto Hellas, et al. 2022. Parsons Problems and Beyond: Systematic Literature Review and Empirical Study Designs. In Proc. of the 2022 Working Group Reports on Innovation and Technology in Computer Science Education. ACM, NY, NY, USA, 191--234.
[10]
Barbara J. Ericson, James D. Foley, and Jochen Rick. 2018. Evaluating the Efficiency and Effectiveness of Adaptive Parsons Problems. In Proc. of the 2018 ACM Conf. on Int. Computing Education Research (ICER '18). ACM, NY, NY, USA, 60--68.
[11]
Barbara J. Ericson, Mark J. Guzdial, and Briana B. Morrison. 2015. Analysis of Interactive Features Designed to Enhance Learning in an Ebook. In Proc. of the 11th Annual Int. Conf. on Int. Computing Education Research. ACM, 169--178.
[12]
Barbara J. Ericson, Lauren E. Margulieux, and Jochen Rick. 2017. Solving Parsons Problems versus Fixing and Writing Code. In Proc. of the 17th Koli Calling Int. Conf. on Computing Education Research (Koli Calling '17). ACM, New York, NY, USA, 20--29. https://doi.org/10.1145/3141880.3141895
[13]
Geela Venise Firmalo Fabic, Antonija Mitrovic, and Kourosh Neshatian. 2018. Adaptive Problem Selection in a Mobile Python Tutor. In Adjunct Publication of the 26th Conf. on User Modeling, Adaptation and Personalization (UMAP '18). ACM, New York, NY, USA, 269--274. https://doi.org/10.1145/3213586.3225235
[14]
James Finnie-Ansley, Paul Denny, Brett A. Becker, Andrew Luxton-Reilly, and James Prather. 2022. The Robots Are Coming: Exploring the Implications of OpenAI Codex on Introductory Programming. In Australasian Computing Education Conf. (ACE '22). ACM, Online, 10--19. https://doi.org/10.1145/3511861.3511863
[15]
James Finnie-Ansley, Paul Denny, Andrew Luxton-Reilly, Eddie Antonio Santos, James Prather, et al. 2023. My AI Wants to Know If This Will Be on the Exam: Testing OpenAI's Codex on CS2 Programming Exercises. In Proc. of the 25th Australasian Computing Education Conf. (ACE '23). ACM, NY, NY, USA, 97--104.
[16]
Kathi Fisler. 2014. The Recurring Rainfall Problem. In Proc. of the Tenth Annual Conf. on Int. Computing Education Research (ICER '14). ACM, NY, NY, USA, 35--42.
[17]
Rita Garcia, Katrina Falkner, and Rebecca Vivian. 2018. Scaffolding the Design Process Using Parsons Problems. In Proc. of the 18th Koli Calling Int. Conf. on Computing Education Research (Koli Calling '18). ACM, NY, NY, USA, Article 26, 2 pages. https://doi.org/10.1145/3279720.3279746
[18]
Kyle James Harms, Jason Chen, and Caitlin L. Kelleher. 2016. Distractors in Parsons Problems Decrease Learning Efficiency for Young Novice Programmers. In Proc. of the 2016 ACM Conf. on Int. Computing Education Research (ICER '16). ACM, New York, NY, USA, 241--250. https://doi.org/10.1145/2960310.2960314
[19]
Carl C. Haynes and Barbara J. Ericson. 2021. Problem-Solving Efficiency and Cognitive Load for Adaptive Parsons Problems vs. Writing the Equivalent Code. In Proc. of the 2021 CHI Conf. on Human Factors in Computing Systems (CHI '21). ACM, New York, NY, USA, Article 60, 15 pages. https://doi.org/10.1145/3411764.3445292
[20]
Carl Haynes-Magyar and Barbara Ericson. 2022. The Impact of Solving Adaptive Parsons Problems with Common and Uncommon Solutions. In Proc. of the 22nd Koli Calling Int. Conf. on Computing Education Research (Koli Calling '22). ACM, New York, NY, USA, Article 23, 14 pages. https://doi.org/10.1145/3564721.3564736
[21]
Juha Helminen, Petri Ihantola, Ville Karavirta, and Lauri Malmi. 2012. How Do Students Solve Parsons Programming Problems? An Analysis of Interaction Traces. In Proc. of the 9th Annual Int. Conf. on Int. Computing Education Research (ICER '12). ACM, NY, NY, USA, 119--126. https://doi.org/10.1145/2361276.2361300
[22]
Xinying Hou, Barbara Jane Ericson, and Xu Wang. 2022. Using Adaptive Parsons Problems to Scaffold Write-Code Problems. In Proc. of the 2022 ACM Conf. on Int. Computing Education Research - Volume 1 (ICER '22). ACM, NY, NY, USA, 15--26.
[23]
Petri Ihantola and Ville Karavirta. 2011. Two-Dimensional Parson's Puzzles: The Concept, Tools, and First Observations. J. of Information Technology Education: Innovations in Practice, Vol. 10 (2011), 119--132. https://doi.org/10.28945/1394
[24]
Ville Karavirta, Juha Helminen, and Petri Ihantola. 2012. A Mobile Learning Application for Parsons Problems with Automatic Feedback. In Proc. of the 12th Koli Calling Int. Conf. on Computing Education Research. ACM, 11--18.
[25]
Amruth N. Kumar. 2018. Epplets: A Tool for Solving Parsons Puzzles. In Proc. of the 49th ACM Technical Symposium on Computer Science Education (SIGCSE '18). ACM, NY, NY, USA, 527--532. https://doi.org/10.1145/3159450.3159576
[26]
Amruth N. Kumar. 2019. Helping Students Solve Parsons Puzzles Better. In Proc. of the 2019 ACM Conf. on Innovation and Technology in Computer Science Education (ITiCSE '19). ACM, New York, NY, USA, 65--70.
[27]
Juho Leinonen, Arto Hellas, Sami Sarsa, Brent Reeves, Paul Denny, et al. 2023. Using Large Language Models to Enhance Programming Error Messages. In Proc. of the 2023 ACM SIGCSE Technical Symposium on Computer Science Education.
[28]
Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, et al. 2023. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. Comput. Surveys, Vol. 55, 9 (2023), 1--35.
[29]
Stephen MacNeil, Andrew Tran, Arto Hellas, Joanne Kim, Sami Sarsa, et al. 2023. Experiences from Using Code Explanations Generated by Large Language Models in a Web Software Development E-Book. In Proc. of the 54th ACM Technical Symposium on Computer Science Education.
[30]
Stephen MacNeil, Andrew Tran, Juho Leinonen, Paul Denny, Joanne Kim, et al. 2022a. Automatically Generating CS Learning Materials with Large Language Models. arXiv preprint arXiv:2212.05113 (2022).
[31]
Stephen MacNeil, Andrew Tran, Dan Mogil, Seth Bernstein, Erin Ross, et al. 2022b. Generating Diverse Code Explanations Using the GPT-3 Large Language Model. In Proc. of the 2022 ACM Conf. on Int. Computing Education Research - Volume 2 (ICER '22). ACM, NY NY, USA, 37--39. https://doi.org/10.1145/3501709.3544280
[32]
Lauren Margulieux, Paul Denny, Kathryn Cunningham, Michael Deutsch, and Benjamin R. Shapiro. 2021. When Wrong is Right: The Instructional Power of Multiple Conceptions. In Proc. of the 17th ACM Conf. on Int. Computing Education Research (ICER 2021). ACM, NY, NY, USA, 184--197.
[33]
Briana B. Morrison, Lauren E. Margulieux, Barbara Ericson, and Mark Guzdial. 2016. Subgoals Help Students Solve Parsons Problems. In Proc. of the 47th ACM Technical Symposium on Computing Science Education (SIGCSE '16). ACM, New York, NY, USA, 42--47. https://doi.org/10.1145/2839509.2844617
[34]
Dale Parsons and Patricia Haden. 2006. Parson's Programming Puzzles: A Fun and Effective Learning Tool for First Programming Courses. In Proc. of the 8th Australasian Conf. on Computing Education - Volume 52 (ACE '06). Australian Computer Society, Inc., AUS, 157--163.
[35]
James Prather, John Homer, Paul Denny, Brett Becker, John Marsden, et al. 2022. Scaffolding Task Planning Using Abstract Parsons Problems. In Proc. of the 2022 World Conf. on Computers in Education (WCCE '22). 1--10.
[36]
Sami Sarsa, Paul Denny, Arto Hellas, and Juho Leinonen. 2022. Automatic Generation of Programming Exercises and Code Explanations Using Large Language Models. In Proc. of the 2022 ACM Conf. on Int. Computing Education Research - Volume 1 (ICER '22). ACM, NY, NY, USA, 27--43.
[37]
Otto Sepp"al"a, Petri Ihantola, Essi Isohanni, Juha Sorva, and Arto Vihavainen. 2015. Do We Know How Difficult the Rainfall Problem Is?. In Proc. of the 15th Koli Calling Conf. on Computing Education Research (Koli Calling '15). ACM, NY, NY, USA, 87--96. https://doi.org/10.1145/2828959.2828963
[38]
Priyan Vaithilingam, Tianyi Zhang, and Elena L. Glassman. 2022. Expectation vs. Experience: Evaluating the Usability of Code Generation Tools Powered by Large Language Models. In CHI Conf. on Human Factors in Computing Systems Extended Abstracts. ACM, NY NY, USA, 1--7.
[39]
Nathaniel Weinman, Armando Fox, and Marti Hearst. 2020. Exploring Challenging Variations of Parsons Problems. In Proc. of the 51st ACM Technical Symposium on Computer Science Education (SIGCSE '20). ACM, NY, NY, USA, 1349.
[40]
Nathaniel Weinman, Armando Fox, and Marti A. Hearst. 2021. Improving Instruction of Programming Patterns with Faded Parsons Problems. In Proc. of the 2021 CHI Conf. on Human Factors in Computing Systems (CHI '21). ACM, New York, NY, USA, Article 53, 4 pages. https://doi.org/10.1145/3411764.3445228
[41]
Matt Welsh. 2022. The End of Programming. Commun. ACM, Vol. 66, 1 (dec 2022), 34--35. https://doi.org/10.1145/3570220
[42]
Zihan Wu, Barbara Ericson, and Christopher Brooks. 2021. Regex Parsons: Using Horizontal Parsons Problems to Scaffold Learning Regex. In Proc. of the 21st Koli Calling Int. Conf. on Computing Education Research (Koli Calling '21). ACM, New York, NY, USA, Article 31, 3 pages. https://doi.org/10.1145/3488042.3489968
[43]
Rui Zhi, Min Chi, Tiffany Barnes, and Thomas W. Price. 2019. Evaluating the Effectiveness of Parsons Problems for Block-Based Programming. In Proc. of the 2019 ACM Conf. on Int. Computing Education Research. ACM, NY, NY, USA, 51--59.

Cited By

View all
  • (2024)Generative AI and Generative Pre-Trained Transformer ApplicationsMaking Art With Generative AI Tools10.4018/979-8-3693-1950-5.ch003(45-71)Online publication date: 19-Apr-2024
  • (2024)Cognitive Apprenticeship and Artificial Intelligence Coding AssistantsNavigating Computer Science Education in the 21st Century10.4018/979-8-3693-1066-3.ch013(261-281)Online publication date: 26-Feb-2024
  • (2024)IMF-MF: Interactive moment localization with adaptive multimodal fusion and self-attentionJournal of Intelligent & Fuzzy Systems10.3233/JIFS-233071(1-12)Online publication date: 4-Apr-2024
  • Show More Cited By

Index Terms

  1. Evaluating the Performance of Code Generation Models for Solving Parsons Problems With Small Prompt Variations

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ITiCSE 2023: Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education V. 1
      June 2023
      694 pages
      ISBN:9798400701382
      DOI:10.1145/3587102
      This work is licensed under a Creative Commons Attribution International 4.0 License.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 June 2023

      Check for updates

      Author Tags

      1. CS1
      2. GPT-3
      3. GitHub
      4. ML
      5. academic integrity
      6. ai
      7. artificial intelligence
      8. chatgpt
      9. code generation
      10. code writing
      11. codex
      12. computer programming
      13. copilot
      14. deep learning
      15. generative ai
      16. introductory programming
      17. large language models
      18. machine learning
      19. natural language processing
      20. neural networks
      21. novice programming
      22. openAI

      Qualifiers

      • Research-article

      Conference

      ITiCSE 2023
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 552 of 1,613 submissions, 34%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)749
      • Downloads (Last 6 weeks)61
      Reflects downloads up to 13 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Generative AI and Generative Pre-Trained Transformer ApplicationsMaking Art With Generative AI Tools10.4018/979-8-3693-1950-5.ch003(45-71)Online publication date: 19-Apr-2024
      • (2024)Cognitive Apprenticeship and Artificial Intelligence Coding AssistantsNavigating Computer Science Education in the 21st Century10.4018/979-8-3693-1066-3.ch013(261-281)Online publication date: 26-Feb-2024
      • (2024)IMF-MF: Interactive moment localization with adaptive multimodal fusion and self-attentionJournal of Intelligent & Fuzzy Systems10.3233/JIFS-233071(1-12)Online publication date: 4-Apr-2024
      • (2024)Prompting for Comprehension: Exploring the Intersection of Explain in Plain English Questions and Prompt WritingProceedings of the Eleventh ACM Conference on Learning @ Scale10.1145/3657604.3662039(39-50)Online publication date: 9-Jul-2024
      • (2024)Iterative Student Program Planning using Transformer-Driven FeedbackProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653607(45-51)Online publication date: 3-Jul-2024
      • (2024)Guidelines for the Evolving Role of Generative AI in Introductory Programming Based on Emerging PracticeProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653602(10-16)Online publication date: 3-Jul-2024
      • (2024)Code Generation Based Grading: Evaluating an Auto-grading Mechanism for "Explain-in-Plain-English" QuestionsProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653582(171-177)Online publication date: 3-Jul-2024
      • (2024)LLMs Still Can't Avoid Instanceof: An Investigation Into GPT-3.5, GPT-4 and Bard's Capacity to Handle Object-Oriented Programming AssignmentsProceedings of the 46th International Conference on Software Engineering: Software Engineering Education and Training10.1145/3639474.3640052(162-169)Online publication date: 14-Apr-2024
      • (2024)More Than Meets the AI: Evaluating the performance of GPT-4 on Computer Graphics assessment questionsProceedings of the 26th Australasian Computing Education Conference10.1145/3636243.3636263(182-191)Online publication date: 29-Jan-2024
      • (2024)“It's not like Jarvis, but it's pretty close!” - Examining ChatGPT's Usage among Undergraduate Students in Computer ScienceProceedings of the 26th Australasian Computing Education Conference10.1145/3636243.3636257(124-133)Online publication date: 29-Jan-2024
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media