Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning

Published: 01 May 2022 Publication History

Abstract

In this paper we study the multi-agent pursuit–evasion problem, and present an extension of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) deep reinforcement learning algorithm. Previous pursuit–evasion advancements with MADDPG have focused on training capture strategies dependent on the restriction of evader movement with environmental features. We demonstrate a method to train pursuer agents to collaboratively surround and encircle an evader for reliable capture without a strategy rooted in environment entrapment (i.e. cornering). Our method utilizes a novel two-stage, variable-aggression, continuous reward function based on geometrical inscribed circles (incircles), along with a corresponding observation space, with agents operating in an entrapment-disadvantaged environment. Our results show reliable capture of an intelligent, superior evader by three trained pursuers in open space with our encircling strategy. A key novelty of our work is demonstrating the ability to transition behaviors learned using deep reinforcement learning from a simulated robotic system with imperfect world assumptions to a real-world robotic agents.

References

[1]
Andreen D, Jenning P, Napp N, Petersen K (2016) Emergent structures assembled by large swarms of simple robots. In: Posthuman frontiers
[2]
Awheda MD and Schwartz HM A decentralized fuzzy learning algorithm for pursuit–evasion differential games with superior evaders J Intell Robot Syst 2016 83 35-53
[3]
DeMarco K, Squires E, Day M, Pippin C (2018) Simulating collaborative robots in a massive multi-agent game environment (SCRIMMAGE). In: International symposium on distributed autonomous robotic systems
[4]
Guadarrama S, Korattikara A, Ramirez O, Castro P, Holly E, Fishman S, Wang K, Gonina E, Wu N, Kokiopoulou E, Sbaiz L, Smith J, Bartók G, Berent J, Harris C, Vanhoucke V, Brevdo E (2018) TF-Agents: a library for reinforcement learning in tensorflow . https://github.com/tensorflow/agents
[5]
Hüttenrauch M, Sosic A, Neumann G (2018) Deep reinforcement learning for swarm systems. CoRR
[6]
Lillicrap T.P, Hunt J.J, Pritzel A, Heess N, Erez T, Tassa Y, Silver D, Wierstra D (2016) Continuous control with deep reinforcement learning. CoRR
[7]
Lowe R, Wu Y, Tamar A, Harb J, Abbeel P, Mordatch I (2017) Multi-agent actor-critic for mixed cooperative-competitive environments. In: Proceedings of the 31st international conference on neural information processing systems, NIPS’17, p 6382–6393. Curran Associates Inc., Red Hook, NY, USA
[8]
Mao H, Zhang Z, Xiao Z, Gong Z (2018) Modelling the dynamic joint policy of teammates with attention multi-agent DDPG. CoRR
[9]
Mnih V, Kavukcuoglu K, Silver D, Rusu A, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland A, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, Kumaran D, Wierstra D, Legg S, and Hassabis D Human-level control through deep reinforcement learning Nature 2015 518 529-533
[10]
Rycroft C (2009) Voro++: a three-dimensional voronoi cell library in c++. Chaos Interdiscip J Nonlinear Sci
[11]
Sheikh H.U, Bölöni L (2019) Designing a multi-objective reward function for creating teams of robotic bodyguards using deep reinforcement learning. ArXiv
[12]
Silver D, Lever G, Heess N, Degris T, Wierstra D, Riedmiller M (2014) Deterministic policy gradient algorithms. In: ICML
[13]
Singh G, Lofaro D, Sofge D (2020) Pursuit-evasion with decentralized robotic swarm in continuous state space and action space via deep reinforcement learning. In: Proceedings of the 12th international conference on agents and artificial intelligence, vol 1, ICAART, p 226–233. INSTICC, SciTePress
[14]
Wang J, Olson E (2016) Apriltag 2: efficient and robust fiducial detection. In: 2016 IEEE/RSJ International conference on intelligent robots and systems (IROS)
[15]
Wang X, Cruz J, Chen G, Pham K, Blasch E (2007) Formation control in multi-player pursuit evasion game with superior evaders. In: Proceedings of SPIE—The International Society for Optical Engineering
[16]
Weintraub I.E, Pachter M, Garcia E (2020) An introduction to pursuit-evasion differential games. In: 2020 American Control Conference (ACC), pp 1049–1066
[17]
Wu C, Lofaro D, Sofge D (2021) A Learned Encircling Strategy for Robot Swarm Pursuit-Evasion Against a Superior Evader. In: The 4th International Symposium on Swarm Behavior and Bio-Inspired Robotics
[18]
Wu C, Lofaro D, Sofge D (2021) A learned encircling strategy for robot swarm pursuit-evasion against a superior evader. In: The 15th International Symposium on Distributed Autonomous Robotic Systems (DARS)

Index Terms

  1. Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Artificial Life and Robotics
      Artificial Life and Robotics  Volume 27, Issue 2
      May 2022
      247 pages

      Publisher

      Springer-Verlag

      Berlin, Heidelberg

      Publication History

      Published: 01 May 2022
      Accepted: 04 January 2022
      Received: 30 August 2021

      Author Tags

      1. Reinforcement Learning
      2. Swarm robotics
      3. MADDPG
      4. Hardware

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 0
        Total Downloads
      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 09 Jan 2025

      Other Metrics

      Citations

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media