Computer Science > Information Retrieval

arXiv:2302.11953 (cs)

[Submitted on 23 Feb 2023 (v1), last revised 21 Mar 2023 (this version, v2)]

Title:MFBE: Leveraging Multi-Field Information of FAQs for Efficient Dense Retrieval

Authors:Debopriyo Banerjee, Mausam Jain, Ashish Kulkarni

View PDF

Abstract:In the domain of question-answering in NLP, the retrieval of Frequently Asked Questions (FAQ) is an important sub-area which is well researched and has been worked upon for many languages. Here, in response to a user query, a retrieval system typically returns the relevant FAQs from a knowledge-base. The efficacy of such a system depends on its ability to establish semantic match between the query and the FAQs in real-time. The task becomes challenging due to the inherent lexical gap between queries and FAQs, lack of sufficient context in FAQ titles, scarcity of labeled data and high retrieval latency. In this work, we propose a bi-encoder-based query-FAQ matching model that leverages multiple combinations of FAQ fields (like, question, answer, and category) both during model training and inference. Our proposed Multi-Field Bi-Encoder (MFBE) model benefits from the additional context resulting from multiple FAQ fields and performs well even with minimal labeled data. We empirically support this claim through experiments on proprietary as well as open-source public datasets in both unsupervised and supervised settings. Our model achieves around 27% and 20% better top-1 accuracy for the FAQ retrieval task on internal and open datasets, respectively over the best performing baseline.

Comments:	The first two authors contributed equally to this work. 12 pages, 3 figures, 5 tables. Accepted at the 2023 Pacific-Asia Conference On Knowledge Discovery And Data Mining (PAKDD)
Subjects:	Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2302.11953 [cs.IR]
	(or arXiv:2302.11953v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2302.11953
Related DOI:	https://doi.org/10.1007/978-3-031-33380-4_9

Submission history

From: Mausam Jain [view email]
[v1] Thu, 23 Feb 2023 12:02:49 UTC (2,040 KB)
[v2] Tue, 21 Mar 2023 18:38:10 UTC (3,043 KB)

Computer Science > Information Retrieval

Title:MFBE: Leveraging Multi-Field Information of FAQs for Efficient Dense Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:MFBE: Leveraging Multi-Field Information of FAQs for Efficient Dense Retrieval

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators