research-article

Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning

Authors:

Donald A. Sofge,

Daniel M. LofaroAuthors Info & Claims

Artificial Life and Robotics, Volume 27, Issue 2

Pages 355 - 364

https://doi.org/10.1007/s10015-022-00761-y

Published: 01 May 2022 Publication History

Abstract

In this paper we study the multi-agent pursuit–evasion problem, and present an extension of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) deep reinforcement learning algorithm. Previous pursuit–evasion advancements with MADDPG have focused on training capture strategies dependent on the restriction of evader movement with environmental features. We demonstrate a method to train pursuer agents to collaboratively surround and encircle an evader for reliable capture without a strategy rooted in environment entrapment (i.e. cornering). Our method utilizes a novel two-stage, variable-aggression, continuous reward function based on geometrical inscribed circles (incircles), along with a corresponding observation space, with agents operating in an entrapment-disadvantaged environment. Our results show reliable capture of an intelligent, superior evader by three trained pursuers in open space with our encircling strategy. A key novelty of our work is demonstrating the ability to transition behaviors learned using deep reinforcement learning from a simulated robotic system with imperfect world assumptions to a real-world robotic agents.

References

[1]

Andreen D, Jenning P, Napp N, Petersen K (2016) Emergent structures assembled by large swarms of simple robots. In: Posthuman frontiers

[2]

Awheda MD and Schwartz HM A decentralized fuzzy learning algorithm for pursuit–evasion differential games with superior evaders J Intell Robot Syst 2016 83 35-53

[3]

DeMarco K, Squires E, Day M, Pippin C (2018) Simulating collaborative robots in a massive multi-agent game environment (SCRIMMAGE). In: International symposium on distributed autonomous robotic systems

[4]

Guadarrama S, Korattikara A, Ramirez O, Castro P, Holly E, Fishman S, Wang K, Gonina E, Wu N, Kokiopoulou E, Sbaiz L, Smith J, Bartók G, Berent J, Harris C, Vanhoucke V, Brevdo E (2018) TF-Agents: a library for reinforcement learning in tensorflow . https://github.com/tensorflow/agents

[5]

Hüttenrauch M, Sosic A, Neumann G (2018) Deep reinforcement learning for swarm systems. CoRR

[6]

Lillicrap T.P, Hunt J.J, Pritzel A, Heess N, Erez T, Tassa Y, Silver D, Wierstra D (2016) Continuous control with deep reinforcement learning. CoRR

[7]

Lowe R, Wu Y, Tamar A, Harb J, Abbeel P, Mordatch I (2017) Multi-agent actor-critic for mixed cooperative-competitive environments. In: Proceedings of the 31st international conference on neural information processing systems, NIPS’17, p 6382–6393. Curran Associates Inc., Red Hook, NY, USA

[8]

Mao H, Zhang Z, Xiao Z, Gong Z (2018) Modelling the dynamic joint policy of teammates with attention multi-agent DDPG. CoRR

[9]

Mnih V, Kavukcuoglu K, Silver D, Rusu A, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland A, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, Kumaran D, Wierstra D, Legg S, and Hassabis D Human-level control through deep reinforcement learning Nature 2015 518 529-533

[10]

Rycroft C (2009) Voro++: a three-dimensional voronoi cell library in c++. Chaos Interdiscip J Nonlinear Sci

[11]

Sheikh H.U, Bölöni L (2019) Designing a multi-objective reward function for creating teams of robotic bodyguards using deep reinforcement learning. ArXiv

[12]

Silver D, Lever G, Heess N, Degris T, Wierstra D, Riedmiller M (2014) Deterministic policy gradient algorithms. In: ICML

[13]

Singh G, Lofaro D, Sofge D (2020) Pursuit-evasion with decentralized robotic swarm in continuous state space and action space via deep reinforcement learning. In: Proceedings of the 12th international conference on agents and artificial intelligence, vol 1, ICAART, p 226–233. INSTICC, SciTePress

[14]

Wang J, Olson E (2016) Apriltag 2: efficient and robust fiducial detection. In: 2016 IEEE/RSJ International conference on intelligent robots and systems (IROS)

[15]

Wang X, Cruz J, Chen G, Pham K, Blasch E (2007) Formation control in multi-player pursuit evasion game with superior evaders. In: Proceedings of SPIE—The International Society for Optical Engineering

[16]

Weintraub I.E, Pachter M, Garcia E (2020) An introduction to pursuit-evasion differential games. In: 2020 American Control Conference (ACC), pp 1049–1066

[17]

Wu C, Lofaro D, Sofge D (2021) A Learned Encircling Strategy for Robot Swarm Pursuit-Evasion Against a Superior Evader. In: The 4th International Symposium on Swarm Behavior and Bio-Inspired Robotics

[18]

Wu C, Lofaro D, Sofge D (2021) A learned encircling strategy for robot swarm pursuit-evasion against a superior evader. In: The 15th International Symposium on Distributed Autonomous Robotic Systems (DARS)

Index Terms

Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning
1. Computing methodologies
  1. Artificial intelligence
    1. Distributed artificial intelligence
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Index terms have been assigned to the content through auto-classification.

Recommendations

Conversational Recommender System Using Deep Reinforcement Learning
RecSys '22: Proceedings of the 16th ACM Conference on Recommender Systems

Deep Reinforcement Learning (DRL) uses the best of both Reinforcement Learning and Deep Learning for solving problems which cannot be addressed by them individually. Deep Reinforcement Learning has been used widely for games, robotics etc. Limited work ...
Swarm Deep Reinforcement Learning for Robotic Manipulation
Abstract
Deep reinforcement learning scheme, which combines both deep learning and reinforcement learning, enables robots to learn from exploration and flexibly performance in a range of different operational tasks under highly dynamic and complex ...
Assured Deep Multi-Agent Reinforcement Learning for Safe Robotic Systems
Agents and Artificial Intelligence
Abstract
Using multi-agent reinforcement learning to find solutions to complex decision-making problems in shared environments has become standard practice in many scenarios. However, this is not the case in safety-critical scenarios, where the ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Artificial Life and Robotics

Artificial Life and Robotics Volume 27, Issue 2

May 2022

247 pages

ISSN:1433-5298

Issue’s Table of Contents

© This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply 2022.

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 May 2022

Accepted: 04 January 2022

Received: 30 August 2021

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

Figures

Tables

Media

View Issue’s Table of Contents