Computer Science > Information Retrieval

arXiv:2310.17922 (cs)

[Submitted on 27 Oct 2023 (v1), last revised 3 Apr 2024 (this version, v2)]

Title:Chain-of-Choice Hierarchical Policy Learning for Conversational Recommendation

Authors:Wei Fan, Weijia Zhang, Weiqi Wang, Yangqiu Song, Hao Liu

Abstract:Conversational Recommender Systems (CRS) illuminate user preferences via multi-round interactive dialogues, ultimately navigating towards precise and satisfactory recommendations. However, contemporary CRS are limited to inquiring binary or multi-choice questions based on a single attribute type (e.g., color) per round, which causes excessive rounds of interaction and diminishes the user's experience. To address this, we propose a more realistic and efficient conversational recommendation problem setting, called Multi-Type-Attribute Multi-round Conversational Recommendation (MTAMCR), which enables CRS to inquire about multi-choice questions covering multiple types of attributes in each round, thereby improving interactive efficiency. Moreover, by formulating MTAMCR as a hierarchical reinforcement learning task, we propose a Chain-of-Choice Hierarchical Policy Learning (CoCHPL) framework to enhance both the questioning efficiency and recommendation effectiveness in MTAMCR. Specifically, a long-term policy over options (i.e., ask or recommend) determines the action type, while two short-term intra-option policies sequentially generate the chain of attributes or items through multi-step reasoning and selection, optimizing the diversity and interdependence of questioning attributes. Finally, extensive experiments on four benchmarks demonstrate the superior performance of CoCHPL over prevailing state-of-the-art methods.

Comments:	Accepted by DASFAA 2024
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2310.17922 [cs.IR]
	(or arXiv:2310.17922v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2310.17922

Submission history

From: Wei Fan [view email]
[v1] Fri, 27 Oct 2023 06:36:31 UTC (11,117 KB)
[v2] Wed, 3 Apr 2024 03:06:54 UTC (11,619 KB)

Computer Science > Information Retrieval

Title:Chain-of-Choice Hierarchical Policy Learning for Conversational Recommendation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Chain-of-Choice Hierarchical Policy Learning for Conversational Recommendation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators