Computer Science > Machine Learning

arXiv:2210.11692 (cs)

[Submitted on 21 Oct 2022 (v1), last revised 12 Jan 2023 (this version, v2)]

Title:Competing Bandits in Time Varying Matching Markets

Authors:Deepan Muthirayan, Chinmay Maheshwari, Pramod P. Khargonekar, Shankar Sastry

View PDF

Abstract:We study the problem of online learning in two-sided non-stationary matching markets, where the objective is to converge to a stable match. In particular, we consider the setting where one side of the market, the arms, has fixed known set of preferences over the other side, the players. While this problem has been studied when the players have fixed but unknown preferences, in this work we study the problem of how to learn when the preferences of the players are time varying and unknown. Our contribution is a methodology that can handle any type of preference structure and variation scenario. We show that, with the proposed algorithm, each player receives a uniform sub-linear regret of {$\widetilde{\mathcal{O}}(L^{1/2}_TT^{1/2})$} up to the number of changes in the underlying preferences of the agents, $L_T$. Therefore, we show that the optimal rates for single-agent learning can be achieved in spite of the competition up to a difference of a constant factor. We also discuss extensions of this algorithm to the case where the number of changes need not be known a priori.

Subjects:	Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
Cite as:	arXiv:2210.11692 [cs.LG]
	(or arXiv:2210.11692v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.11692

Submission history

From: Deepan Muthirayan [view email]
[v1] Fri, 21 Oct 2022 02:36:57 UTC (34 KB)
[v2] Thu, 12 Jan 2023 20:15:56 UTC (32 KB)

Computer Science > Machine Learning

Title:Competing Bandits in Time Varying Matching Markets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Competing Bandits in Time Varying Matching Markets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators