Computer Science > Computation and Language

arXiv:2311.09533 (cs)

[Submitted on 16 Nov 2023 (v1), last revised 2 Apr 2024 (this version, v3)]

Title:Effective Large Language Model Adaptation for Improved Grounding and Citation Generation

Authors:Xi Ye, Ruoxi Sun, Sercan Ö. Arik, Tomas Pfister

Abstract:Large language models (LLMs) have achieved remarkable advancements in natural language understanding and generation. However, one major issue towards their widespread deployment in the real world is that they can generate "hallucinated" answers that are not factual. Towards this end, this paper focuses on improving LLMs by grounding their responses in retrieved passages and by providing citations. We propose a new framework, AGREE, Adaptation for GRounding EnhancEment, that improves the grounding from a holistic perspective. Our framework tunes LLMs to selfground the claims in their responses and provide accurate citations to retrieved documents. This tuning on top of the pre-trained LLMs requires well-grounded responses (with citations) for paired queries, for which we introduce a method that can automatically construct such data from unlabeled queries. The selfgrounding capability of tuned LLMs further grants them a test-time adaptation (TTA) capability that can actively retrieve passages to support the claims that have not been grounded, which iteratively improves the responses of LLMs. Across five datasets and two LLMs, our results show that the proposed tuningbased AGREE framework generates superior grounded responses with more accurate citations compared to prompting-based approaches and post-hoc citing-based approaches

Comments:	NAACL 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2311.09533 [cs.CL]
	(or arXiv:2311.09533v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.09533

Submission history

From: Xi Ye [view email]
[v1] Thu, 16 Nov 2023 03:22:25 UTC (74 KB)
[v2] Mon, 11 Mar 2024 05:36:36 UTC (302 KB)
[v3] Tue, 2 Apr 2024 20:04:01 UTC (302 KB)

Computer Science > Computation and Language

Title:Effective Large Language Model Adaptation for Improved Grounding and Citation Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Effective Large Language Model Adaptation for Improved Grounding and Citation Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators