Computer Science > Machine Learning

arXiv:2311.02775 (cs)

[Submitted on 5 Nov 2023 (v1), last revised 18 Dec 2023 (this version, v3)]

Title:AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs

Authors:Yann Hicke, Anmol Agarwal, Qianou Ma, Paul Denny

Abstract:Responding to the thousands of student questions on online QA platforms each semester has a considerable human cost, particularly in computing courses with rapidly growing enrollments. To address the challenges of scalable and intelligent question-answering (QA), we introduce an innovative solution that leverages open-source Large Language Models (LLMs) from the LLaMA-2 family to ensure data privacy. Our approach combines augmentation techniques such as retrieval augmented generation (RAG), supervised fine-tuning (SFT), and learning from human preferences data using Direct Preference Optimization (DPO). Through extensive experimentation on a Piazza dataset from an introductory CS course, comprising 10,000 QA pairs and 1,500 pairs of preference data, we demonstrate a significant 30% improvement in the quality of answers, with RAG being a particularly impactful addition. Our contributions include the development of a novel architecture for educational QA, extensive evaluations of LLM performance utilizing both human assessments and LLM-based metrics, and insights into the challenges and future directions of educational data processing. This work paves the way for the development of AI-TA, an intelligent QA assistant customizable for courses with an online QA platform

Comments:	Updates for camera-ready submission
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2311.02775 [cs.LG]
	(or arXiv:2311.02775v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.02775
Journal reference:	NeurIPS Workshop on Generative AI for Education (GAIED), 2023

Submission history

From: Yann Hicke [view email]
[v1] Sun, 5 Nov 2023 21:43:02 UTC (791 KB)
[v2] Mon, 13 Nov 2023 16:03:15 UTC (862 KB)
[v3] Mon, 18 Dec 2023 23:23:06 UTC (868 KB)

Computer Science > Machine Learning

Title:AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators