Multi-agent Pathfinding with Communication Reinforcement Learning and Deadlock Detection

Zhaohui Ye¹⁴,
Yanjie Li¹⁴,
Ronghao Guo¹⁴,
Jianqi Gao¹⁴ &
…
Wen Fu¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13455))

Included in the following conference series:

International Conference on Intelligent Robotics and Applications

3212 Accesses
3 Citations

Abstract

The learning-based approach has been proved to be an effective way to solve multi-agent path finding (MAPF) problems. For large warehouse systems, the distributed strategy based on learning method can effectively improve efficiency and scalability. But compared with the traditional centralized planner, the learning-based approach is more prone to deadlocks. Communication learning has also made great progress in the field of multi-agent in recent years and has been be introduced into MAPF. However, the current communication methods provide redundant information for reinforcement learning and interfere with the decision-making of agents. In this paper, we combine the reinforcement learning with communication learning. The agents select its communication objectives based on priority and mask off redundant communication links. Then we use a feature interactive network based on graph neural network to achieve the information aggregation. We also introduce an additional deadlock detection mechanism to increase the likelihood of an agent escaping a deadlock. Experiments demonstrate our method is able to plan collision-free paths in different warehouse environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Pheromone Based Independent Reinforcement Learning for Multiagent Navigation

Learning structured communication for multi-agent reinforcement learning

Article 26 August 2022

Cooperative Multi-agent Reinforcement Learning with Hierachical Communication Architecture

References

Foerster, J., Assael, I.A., De Freitas, N., Whiteson, S.: Learning to communicate with deep multi-agent reinforcement learning. In: Advances in Neural Information Processing Systems 29 (2016)
Google Scholar
Jiang, J., Dun, C., Huang, T., Lu, Z.: Graph convolutional reinforcement learning. In: International Conference on Learning Representations (2019)
Google Scholar
Jiang, J., Lu, Z.: Learning attentional communication for multi-agent cooperation. In: Advances in Neural Information Processing Systems 31 (2018)
Google Scholar
Kim, D., Moon, S., Hostallero, D., Kang, W.J., Lee, T., Son, K., Yi, Y.: Learning to schedule communication in multi-agent reinforcement learning. In: International Conference on Learning Representations (2018)
Google Scholar
Li, Q., Gama, F., Ribeiro, A., Prorok, A.: Graph neural networks for decentralized multi-robot path planning. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 11785–11792. IEEE (2020)
Google Scholar
Li, Q., Lin, W., Liu, Z., Prorok, A.: Message-aware graph attention networks for large-scale multi-robot path planning. IEEE Robot. Autom. Lett. 6(3), 5533–5540 (2021)
Article Google Scholar
Liu, Z., Chen, B., Zhou, H., Koushik, G., Hebert, M., Zhao, D.: Mapper: multi-agent path planning with evolutionary reinforcement learning in mixed dynamic environments. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 11748–11754. IEEE (2020)
Google Scholar
Long, P., Fan, T., Liao, X., Liu, W., Zhang, H., Pan, J.: Towards optimally decentralized multi-robot collision avoidance via deep reinforcement learning. In: 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 6252–6259. IEEE (2018)
Google Scholar
Ma, Z., Luo, Y., Ma, H.: Distributed heuristic multi-agent path finding with communication. In: 2021 IEEE International Conference on Robotics and Automation (ICRA), pp. 8699–8705. IEEE (2021)
Google Scholar
Riviere, B., Hönig, W., Yue, Y., Chung, S.J.: Glas: global-to-local safe autonomy synthesis for multi-robot motion planning with end-to-end learning. IEEE Robot. Automation Lett. 5(3), 4249–4256 (2020)
Article Google Scholar
Sartoretti, G., Kerr, J., Shi, Y., Wagner, G., Kumar, T.S., Koenig, S., Choset, H.: Primal: Pathfinding via reinforcement and imitation multi-agent learning. IEEE Robot. Automation Lett. 4(3), 2378–2385 (2019)
Article Google Scholar
Sharon, G., Stern, R., Felner, A., Sturtevant, N.R.: Conflict-based search for optimal multi-agent pathfinding. Artif. Intell. 219, 40–66 (2015)
Article MathSciNet Google Scholar
Sheng, J., et al.: Learning structured communication for multi-agent reinforcement learning. arXiv preprint arXiv:2002.04235 (2020)
Silver, D.: Cooperative pathfinding. In: Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, vol. 1, pp. 117–122 (2005)
Google Scholar
Sukhbaatar, S., Fergus, R., et al.: Learning multiagent communication with backpropagation. In: Advances in Neural Information Processing Systems 29 (2016)
Google Scholar
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., Philip, S.Y.: A comprehensive survey on graph neural networks. IEEE Trans. Neural Networks Learning Syst. 32(1), 4–24 (2020)
Article MathSciNet Google Scholar
Xu, Y., Li, Y., Liu, Q., Gao, J., Liu, Y., Chen, M.: Multi-agent pathfinding with local and global guidance. In: 2021 IEEE International Conference on Networking, Sensing and Control (ICNSC), vol. 1, pp. 1–7 (2021)
Google Scholar

Download references

Acknowledgment

This research was supported by National Key R &D Program of China 2018YFB1305500, National Natural Science Foundation (61977019, U1813206) and Shenzhen basic research program (JCYJ20180507183837726, JSGG20201103093802006).

Author information

Authors and Affiliations

Harbin Institute of Technology Shenzhen, Shenzhen, China
Zhaohui Ye, Yanjie Li, Ronghao Guo, Jianqi Gao & Wen Fu

Authors

Zhaohui Ye
View author publications
You can also search for this author in PubMed Google Scholar
Yanjie Li
View author publications
You can also search for this author in PubMed Google Scholar
Ronghao Guo
View author publications
You can also search for this author in PubMed Google Scholar
Jianqi Gao
View author publications
You can also search for this author in PubMed Google Scholar
Wen Fu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yanjie Li .

Editor information

Editors and Affiliations

Harbin Institute of Technology, Shenzhen, China
Honghai Liu
Huazhong University of Science and Technology, Wuhan, China
Zhouping Yin
Shenyang Institute of Automation, Shenyang, Liaoning, China
Lianqing Liu
Harbin Institute of Technology, Harbin, China
Li Jiang
Shanghai Jiao Tong University, Shanghai, China
Guoying Gu
Shenzhen Institute of Advanced Technology, Shenzhen, China
Xinyu Wu
Harbin Institute of Technology, Shenzhen, China
Weihong Ren

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ye, Z., Li, Y., Guo, R., Gao, J., Fu, W. (2022). Multi-agent Pathfinding with Communication Reinforcement Learning and Deadlock Detection. In: Liu, H., et al. Intelligent Robotics and Applications. ICIRA 2022. Lecture Notes in Computer Science(), vol 13455. Springer, Cham. https://doi.org/10.1007/978-3-031-13844-7_47

Download citation

DOI: https://doi.org/10.1007/978-3-031-13844-7_47
Published: 04 August 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-13843-0
Online ISBN: 978-3-031-13844-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics