The Role of Theory of Mind in Finding Predator-Prey Nash Equilibria

Tiffany Hwu⁹,
Chase McDonald^9,10,
Simon Haxby⁹,
Flávio Teixeira⁹,
Israel Knight⁹ &
…
Albert Wang⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14993))

Included in the following conference series:

International Conference on Simulation of Adaptive Behavior

70 Accesses

Abstract

When a predator chases its prey, a mind game ensues, requiring both predator and prey to predict what the other will do next. These elements of uncertainty and opponency are also seen in analyses of real-world tasks and games. For instance, one way to define an optimal solution of a non-cooperative game is to find the Nash equilibrium, a state in which each agent in a game has optimized its strategy given the strategies of others. The Regularized Nash Dynamics (R-NaD) algorithm guarantees that policies will converge to the Nash equilibrium, creating AIs that beat top human players in tasks with hidden information. Our research compares the performance of deep reinforcement learning agents trained with and without R-NaD in a simple hide-and-seek game, aiming to see how well the agents process unknowns in the environment. We then apply explainable AI (XAI) techniques to the trained model to examine the kinds of information that trained policies encode about opponent strategies. We find that policies trained with R-NaD outperform policies trained in regular self-play when there is hidden information. Furthermore, R-NaD policies use their opponent’s past positions to decide which actions to take, more so than regular self-play. These findings yield insights on how animals and artificial agents operate under spatial uncertainty.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

What’s in a Game? The Effect of Game Complexity on Deep Reinforcement Learning

Coordination and Control in Multiagent Systems for Enhanced Pursuit-Evasion Game Performance

Article 04 September 2024

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning

Article Open access 07 June 2021

References

Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.R., Samek, W.: On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10(7), e0130140 (2015)
Article Google Scholar
Ellis, B.J., Jordan, A.C., Grotuss, J., Csinady, A., Keenan, T., Bjorklund, D.F.: The predator-avoidance effect: an evolved constraint on emerging theory of mind. Evol. Hum. Behav. 35(3), 245–256 (2014)
Article Google Scholar
Knight, I.: So you want to make AI bots? A gentle intro into reinforcement learning. Github (2023). https://github.com/isknight/rl_intro
Krichmar, J.L., Hwu, T., Zou, X., Hylton, T.: Advantage of prediction and mental imagery for goal-directed behaviour in agents and robots. Cogn. Comput. Syst. 1(1), 12–19 (2019)
Article Google Scholar
Montavon, G., Binder, A., Lapuschkin, S., Samek, W., Müller, K.R.: Layer-wise relevance propagation: an overview. In: Explainable AI: Interpreting, Explaining and Visualizing Deep Learning, pp. 193–209 (2019)
Google Scholar
Perolat, J., et al.: Mastering the game of stratego with model-free multiagent reinforcement learning. Science 378(6623), 990–996 (2022)
Article MathSciNet Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)

Download references

Acknowledgments

We credit Riot Games for funding this research.

Author information

Authors and Affiliations

Riot Games, Santa Monica, CA, 90064, USA
Tiffany Hwu, Chase McDonald, Simon Haxby, Flávio Teixeira, Israel Knight & Albert Wang
Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Chase McDonald

Authors

Tiffany Hwu
View author publications
You can also search for this author in PubMed Google Scholar
Chase McDonald
View author publications
You can also search for this author in PubMed Google Scholar
Simon Haxby
View author publications
You can also search for this author in PubMed Google Scholar
Flávio Teixeira
View author publications
You can also search for this author in PubMed Google Scholar
Israel Knight
View author publications
You can also search for this author in PubMed Google Scholar
Albert Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tiffany Hwu .

Editor information

Editors and Affiliations

Technical University Berlin, Berlin, Berlin, Germany
Oliver Brock
University of California, Irvine, CA, USA
Jeffrey Krichmar

Ethics declarations

Disclosure of Interests

We declare that the authors have no competing interests.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hwu, T., McDonald, C., Haxby, S., Teixeira, F., Knight, I., Wang, A. (2025). The Role of Theory of Mind in Finding Predator-Prey Nash Equilibria. In: Brock, O., Krichmar, J. (eds) From Animals to Animats 17. SAB 2024. Lecture Notes in Computer Science(), vol 14993. Springer, Cham. https://doi.org/10.1007/978-3-031-71533-4_25

Download citation

DOI: https://doi.org/10.1007/978-3-031-71533-4_25
Published: 07 September 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-71532-7
Online ISBN: 978-3-031-71533-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

The Role of Theory of Mind in Finding Predator-Prey Nash Equilibria

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

What’s in a Game? The Effect of Game Complexity on Deep Reinforcement Learning

Coordination and Control in Multiagent Systems for Enhanced Pursuit-Evasion Game Performance

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Disclosure of Interests

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

The Role of Theory of Mind in Finding Predator-Prey Nash Equilibria

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

What’s in a Game? The Effect of Game Complexity on Deep Reinforcement Learning

Coordination and Control in Multiagent Systems for Enhanced Pursuit-Evasion Game Performance

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Disclosure of Interests

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation