Computer Science > Computation and Language

arXiv:2407.09136 (cs)

[Submitted on 12 Jul 2024]

Title:Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors

Authors:Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan

Abstract:Large language models (LLMs) present an opportunity to scale high-quality personalized education to all. A promising approach towards this means is to build dialog tutoring models that scaffold students' problem-solving. However, even though existing LLMs perform well in solving reasoning questions, they struggle to precisely detect student's errors and tailor their feedback to these errors. Inspired by real-world teaching practice where teachers identify student errors and customize their response based on them, we focus on verifying student solutions and show how grounding to such verification improves the overall quality of tutor response generation. We collect a dataset of 1K stepwise math reasoning chains with the first error step annotated by teachers. We show empirically that finding the mistake in a student solution is challenging for current models. We propose and evaluate several verifiers for detecting these errors. Using both automatic and human evaluation we show that the student solution verifiers steer the generation model towards highly targeted responses to student errors which are more often correct with less hallucinations compared to existing baselines.

Comments:	Preprint. Nico Daheim and Jakub Macina contributed equally. Code and dataset can be found under: this https URL
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2407.09136 [cs.CL]
	(or arXiv:2407.09136v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.09136

Submission history

From: Nico Daheim [view email]
[v1] Fri, 12 Jul 2024 10:11:40 UTC (960 KB)

Computer Science > Computation and Language

Title:Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators