Computer Science > Information Retrieval

arXiv:2402.04548 (cs)

[Submitted on 7 Feb 2024]

Title:NORMY: Non-Uniform History Modeling for Open Retrieval Conversational Question Answering

Authors:Muhammad Shihab Rashid, Jannat Ara Meem, Vagelis Hristidis

Abstract:Open Retrieval Conversational Question Answering (OrConvQA) answers a question given a conversation as context and a document collection. A typical OrConvQA pipeline consists of three modules: a Retriever to retrieve relevant documents from the collection, a Reranker to rerank them given the question and the context, and a Reader to extract an answer span. The conversational turns can provide valuable context to answer the final query. State-of-the-art OrConvQA systems use the same history modeling for all three modules of the pipeline. We hypothesize this as suboptimal. Specifically, we argue that a broader context is needed in the first modules of the pipeline to not miss relevant documents, while a narrower context is needed in the last modules to identify the exact answer span. We propose NORMY, the first unsupervised non-uniform history modeling pipeline which generates the best conversational history for each module. We further propose a novel Retriever for NORMY, which employs keyphrase extraction on the conversation history, and leverages passages retrieved in previous turns as additional context. We also created a new dataset for OrConvQA, by expanding the doc2dial dataset. We implemented various state-of-the-art history modeling techniques and comprehensively evaluated them separately for each module of the pipeline on three datasets: OR-QUAC, our doc2dial extension, and ConvMix. Our extensive experiments show that NORMY outperforms the state-of-the-art in the individual modules and in the end-to-end system.

Comments:	Accepted for publication at IEEE ICSC 2024
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2402.04548 [cs.IR]
	(or arXiv:2402.04548v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2402.04548

Submission history

From: Muhammad Shihab Rashid [view email]
[v1] Wed, 7 Feb 2024 03:05:54 UTC (317 KB)

Computer Science > Information Retrieval

Title:NORMY: Non-Uniform History Modeling for Open Retrieval Conversational Question Answering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:NORMY: Non-Uniform History Modeling for Open Retrieval Conversational Question Answering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators