Computer Science > Software Engineering

arXiv:2303.17780 (cs)

[Submitted on 31 Mar 2023 (v1), last revised 7 Sep 2023 (this version, v3)]

Title:AceCoder: Utilizing Existing Code to Enhance Code Generation

Authors:Jia Li, Yunfei Zhao, Yongmin Li, Ge Li, Zhi Jin

View PDF

Abstract:Large Language Models (LLMs) have shown great success in code generation. LLMs take as the input a prompt and output the code. A key question is how to make prompts (i.e., Prompting Techniques). Existing prompting techniques are designed for natural language generation and have low accuracy in code generation.
In this paper, we propose a new prompting technique named AceCoder. Our motivation is that code generation meets two unique challenges (i.e., requirement understanding and code implementation). AceCoder contains two novel mechanisms (i.e., guided code generation and example retrieval) to solve these challenges. (1) Guided code generation asks LLMs first to analyze requirements and output an intermediate preliminary (e.g., test cases). The preliminary is used to clarify requirements and tell LLMs "what to write". (2) Example retrieval selects similar programs as examples in prompts, which provide lots of relevant content (e.g., algorithms, APIs) and teach LLMs "how to write". We apply AceCoder to three LLMs (e.g., Codex) and evaluate it on three public benchmarks using the Pass@k. Results show that AceCoder can significantly improve the performance of LLMs on code generation. (1) In terms of Pass@1, AceCoder outperforms the state-of-the-art baseline by up to 56.4% in MBPP, 70.7% in MBJP, and 88.4% in MBJSP. (2) AceCoder is effective in LLMs with different sizes (i.e., 6B to 13B) and different languages (i.e., Python, Java, and JavaScript). (3) Human evaluation shows human developers prefer programs from AceCoder.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2303.17780 [cs.SE]
	(or arXiv:2303.17780v3 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2303.17780

Submission history

From: Jia Li [view email]
[v1] Fri, 31 Mar 2023 02:57:15 UTC (403 KB)
[v2] Fri, 11 Aug 2023 08:45:12 UTC (570 KB)
[v3] Thu, 7 Sep 2023 11:29:44 UTC (570 KB)

Computer Science > Software Engineering

Title:AceCoder: Utilizing Existing Code to Enhance Code Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:AceCoder: Utilizing Existing Code to Enhance Code Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators