Computer Science > Computation and Language

arXiv:2407.10944 (cs)

[Submitted on 15 Jul 2024]

Title:Learning from Naturally Occurring Feedback

Authors:Shachar Don-Yehiya, Leshem Choshen, Omri Abend

Abstract:Human feedback data is a critical component in developing language models. However, collecting this feedback is costly and ultimately not scalable. We propose a scalable method for extracting feedback that users naturally include when interacting with chat models, and leveraging it for model training. We are further motivated by previous work that showed there are also qualitative advantages to using naturalistic (rather than auto-generated) feedback, such as less hallucinations and biases. We manually annotated conversation data to confirm the presence of naturally occurring feedback in a standard corpus, finding that as much as 30% of the chats include explicit feedback. We apply our method to over 1M conversations to obtain hundreds of thousands of feedback samples. Training with the extracted feedback shows significant performance improvements over baseline models, demonstrating the efficacy of our approach in enhancing model alignment to human preferences.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2407.10944 [cs.CL]
	(or arXiv:2407.10944v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.10944

Submission history

From: Shachar Don-Yehiya [view email]
[v1] Mon, 15 Jul 2024 17:41:34 UTC (827 KB)

Computer Science > Computation and Language

Title:Learning from Naturally Occurring Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning from Naturally Occurring Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators