Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/3635637.3663161acmconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
extended-abstract

Efficient Collaboration with Unknown Agents: Ignoring Similar Agents without Checking Similarity

Published: 06 May 2024 Publication History

Abstract

Ad hoc teamwork (AHT) is concerned with developing an AI agent who learns to collaborate with different previously unseen partners. We consider a setting where the AI agent is provided with a hypothesis set of partners' policies. Several online algorithms that take the hypothesis set as input can be applied to solve the AHT problem. One way to speed up these online learning algorithms is to eliminate the redundant policies, i.e., partner models sharing the same collaborating policy, from the hypothesis set. Nevertheless, we show whether this elimination should be applied depends on the learning algorithm used by the AI agent. Specifically, we identify a property of a learning algorithm: redundancy-aware. When the learning algorithm is redundancy-aware, redundancy elimination is unnecessary. In other words, redundancy-aware algorithms can ignore similar agents in the hypothesis set. We demonstrate through an example that an online algorithm with redundancy-aware property exists when the hypothesis set contains the true partner policy. We test our approach on a team Markov game of two players. Comparative numerical analyses reveal that the redundancy-aware algorithm outperforms other standard no-regret learning algorithms including upper confidence bound (UCB), Q-learning with UCB exploration, and the optimistic posterior sampling algorithm when the set of partner policies contains many redundant policies.

References

[1]
Stefano V. Albrecht and Peter Stone. 2018. Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems. Artificial Intelligence, Vol. 258 (May 2018), 66--95. https://doi.org/10.1016/j.artint.2018.01.002
[2]
Kefan Dong, Yuanhao Wang, Xiaoyu Chen, and Liwei Wang. 2019. Q-Learning with UCB Exploration Is Sample Efficient for Infinite-Horizon MDP. arxiv: 1901.09311 [cs, stat]
[3]
Chi Jin, Zeyuan Allen-Zhu, Sebastien Bubeck, and Michael I. Jordan. 2018. Is Q-learning Provably Efficient ? https://doi.org/10.48550/arXiv.1807.03765 arxiv: 1807.03765 [cs, math, stat]
[4]
T. L Lai and Herbert Robbins. 1985. Asymptotically Efficient Adaptive Allocation Rules. Advances in Applied Mathematics, Vol. 6, 1 (March 1985), 4--22. https://doi.org/10.1016/0196-8858(85)90002-8
[5]
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, and Zhaoran Wang. 2023. One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration. https://doi.org/10.48550/arXiv.2305.18258 arxiv: 2305.18258 [cs, math, stat]
[6]
Reuth Mirsky, Ignacio Carlucho, Arrasy Rahman, Elliot Fosong, William Macke, Mohan Sridharan, Peter Stone, and Stefano V. Albrecht. 2022. A Survey of Ad Hoc Teamwork Research. In Multi-Agent Systems, Dorothea Baumeister and Jörg Rothe (Eds.). Vol. 13442. Springer International Publishing, Cham, 275--293. https://doi.org/10.1007/978-3-031-20614-6_16
[7]
Peter Stone, Gal Kaminka, Sarit Kraus, and Jeffrey Rosenschein. 2010. Ad Hoc Autonomous Agent Teams : Collaboration without Pre-Coordination. AAAI, Vol. 24, 1 (July 2010), 1504--1509. https://doi.org/10.1609/aaai.v24i1.7529
[8]
D. J. Strouse, Kevin R. McKee, Matt Botvinick, Edward Hughes, and Richard Everett. 2022. Collaborating with Humans without Human Data. https://doi.org/10.48550/arXiv.2110.08176 arxiv: 2110.08176 [cs]
[9]
Han Zhong, Wei Xiong, Sirui Zheng, Liwei Wang, Zhaoran Wang, Zhuoran Yang, and Tong Zhang. 2023. GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP, and Beyond. arxiv: 2211.01962 [cs, math, stat]

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems
May 2024
2898 pages
ISBN:9798400704864

Sponsors

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Richland, SC

Publication History

Published: 06 May 2024

Check for updates

Author Tags

  1. ad hoc teamwork
  2. game theory
  3. online learning

Qualifiers

  • Extended-abstract

Conference

AAMAS '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 16
    Total Downloads
  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)2
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media