Computer Science > Machine Learning

arXiv:2302.07446 (cs)

[Submitted on 15 Feb 2023 (v1), last revised 31 Aug 2023 (this version, v2)]

Title:On-Demand Communication for Asynchronous Multi-Agent Bandits

Authors:Yu-Zhen Janice Chen, Lin Yang, Xuchuang Wang, Xutong Liu, Mohammad Hajiesmaili, John C.S. Lui, Don Towsley

View PDF

Abstract:This paper studies a cooperative multi-agent multi-armed stochastic bandit problem where agents operate asynchronously -- agent pull times and rates are unknown, irregular, and heterogeneous -- and face the same instance of a K-armed bandit problem. Agents can share reward information to speed up the learning process at additional communication costs. We propose ODC, an on-demand communication protocol that tailors the communication of each pair of agents based on their empirical pull times. ODC is efficient when the pull times of agents are highly heterogeneous, and its communication complexity depends on the empirical pull times of agents. ODC is a generic protocol that can be integrated into most cooperative bandit algorithms without degrading their performance. We then incorporate ODC into the natural extensions of UCB and AAE algorithms and propose two communication-efficient cooperative algorithms. Our analysis shows that both algorithms are near-optimal in regret.

Comments:	Accepted by AISTATS 2023
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
Cite as:	arXiv:2302.07446 [cs.LG]
	(or arXiv:2302.07446v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.07446

Submission history

From: Yu-Zhen Janice Chen [view email]
[v1] Wed, 15 Feb 2023 03:32:33 UTC (2,895 KB)
[v2] Thu, 31 Aug 2023 02:28:41 UTC (3,051 KB)

Computer Science > Machine Learning

Title:On-Demand Communication for Asynchronous Multi-Agent Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On-Demand Communication for Asynchronous Multi-Agent Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators