Computer Science > Information Retrieval

arXiv:2406.16332 (cs)

[Submitted on 24 Jun 2024 (v1), last revised 25 Sep 2024 (this version, v2)]

Title:DemoRank: Selecting Effective Demonstrations for Large Language Models in Ranking Task

Authors:Wenhan Liu, Yutao Zhu, Zhicheng Dou

Abstract:Recently, there has been increasing interest in applying large language models (LLMs) as zero-shot passage rankers. However, few studies have explored how to select appropriate in-context demonstrations for the passage ranking task, which is the focus of this paper. Previous studies mainly use LLM's feedback to train a retriever for demonstration selection. These studies apply the LLM to score each demonstration independently, which ignores the dependencies between demonstrations (especially important in ranking task), leading to inferior performance of top-$k$ retrieved demonstrations. To mitigate this issue, we introduce a demonstration reranker to rerank the retrieved demonstrations so that top-$k$ ranked ones are more suitable for ICL. However, generating training data for such reranker is quite challenging. On the one hand, different from demonstration retriever, the training samples of reranker need to incorporate demonstration dependencies. On the other hand, obtaining the gold ranking from the retrieved demonstrations is an NP-hard problem, which is hard to implement. To overcome these challenges, we propose a method to approximate the optimal demonstration list iteratively and utilize LLM to score demonstration lists of varying lengths. By doing so, the search space is greatly reduced and demonstration dependencies are considered. Based on these scored demonstration lists, we further design a list-pairwise training approach which compares a pair of lists that only differ in the last demonstration, to teach the reranker how to select the next demonstration given a previous sequence. In this paper, we propose a demonstration selection framework DemoRank for ranking task and conduct extensive experiments to prove its strong ability.

Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:2406.16332 [cs.IR]
	(or arXiv:2406.16332v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2406.16332

Submission history

From: Wenhan Liu [view email]
[v1] Mon, 24 Jun 2024 06:10:13 UTC (1,473 KB)
[v2] Wed, 25 Sep 2024 09:36:49 UTC (2,144 KB)

Computer Science > Information Retrieval

Title:DemoRank: Selecting Effective Demonstrations for Large Language Models in Ranking Task

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:DemoRank: Selecting Effective Demonstrations for Large Language Models in Ranking Task

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators