A New Deep Reinforcement Learning Algorithm for UAV Swarm Confrontation Game

Laicai Xie⁷,
Wanpeng Ma⁸,
Liping Wang⁸ &
…
Liangjun Ke^7,9

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2017))

Included in the following conference series:

International Conference on Data Mining and Big Data

377 Accesses

Abstract

UAV swarm confrontation game is a type of intelligent game problem. Multi-agent reinforcement learning theory provides an effective solution for this game. However, when using common multi-agent deep reinforcement learning algorithms, such as the multi-agent deep deterministic policy gradient (MADDPG) algorithm, to train the strategy of UAV swarm, there are issues such as slow convergence speed and weak generalization ability on similar tasks. To address these issues, this paper combines the model-agnostic meta-learning (MAML) algorithm in few-shot learning with the original MADDPG algorithm, and proposes an improved MB-MADDPG algorithm, which is applied to the strategy optimization of a UAV swarm confrontation task. Experimental results show that compared with the original algorithm, the improved algorithm can accelerate the convergence while maintaining the training effect, and the success rate of defense after training with both algorithms exceeds 50%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Multiagent Reinforcement Learning for Swarm Confrontation Environments

Air Combat Agent Construction Based on Hybrid Self-play Deep Reinforcement Learning

Research on Autonomous Decision-Making of Multi-UAV Air Combat Based on Deep Reinforcement Learning

References

Zhou, Y., Rao, B., Wang, W.: UAV swarm intelligence: recent advances and future trends. IEEE Access 8, 183856–183878 (2020)
Article Google Scholar
Xia, Z., et al.: Multi-agent reinforcement learning aided intelligent UAV swarm for target tracking. IEEE Trans. Veh. Technol. 71(1), 931–945 (2021)
Article Google Scholar
Lowe, R., Wu, Y.I., Tamar, A., Harb, J., Pieter Abbeel, O., Mordatch, I.: Multi-agent actor-critic for mixed cooperative-competitive environments. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Google Scholar
Finn, C., Abbeel, P., Levine, S.: Model-agnostic meta-learning for fast adaptation of deep networks. In: International Conference on Machine Learning, pp. 1126–1135. PMLR (2017)
Google Scholar
Tang, J., Duan, H., Lao, S.: Swarm intelligence algorithms for multiple unmanned aerial vehicles collaboration: a comprehensive review. Artif. Intell. Rev. 56(5), 4295–4327 (2023)
Article Google Scholar
Wu, H., Li, H., Xiao, R., Liu, J.: Modeling and simulation of dynamic ant colony’s labor division for task allocation of UAV swarm. Phys. A 491, 127–141 (2018)
Article MathSciNet Google Scholar
McMahon, D.C.: A neural network trained to select aircraft maneuvers during air combat: a comparison of network and rule based performance. In: 1990 IJCNN International Joint Conference on Neural Networks, pp. 107–112. IEEE (1990)
Google Scholar
Guo, J., et al.: Maneuver decision of UAV in air combat based on deterministic policy gradient. In: 2022 IEEE 17th International Conference on Control & Automation (ICCA), pp. 243–248. IEEE (2022)
Google Scholar
Gupta, J.K., Egorov, M., Kochenderfer, M.: Cooperative multi-agent control using deep reinforcement learning. In: Sukthankar, G., Rodriguez-Aguilar, J.A. (eds.) AAMAS 2017. LNCS (LNAI), vol. 10642, pp. 66–83. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-71682-4_5
Chapter Google Scholar
Sukhbaatar, S., Fergus, R., et al.: Learning multiagent communication with backpropagation. In: Advances in Neural Information Processing Systems, vol. 29 (2016)
Google Scholar
Foerster, J., Farquhar, G., Afouras, T., Nardelli, N., Whiteson, S.: Counterfactual multi-agent policy gradients. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32 (2018)
Google Scholar
Raileanu, R., Denton, E., Szlam, A., Fergus, R.: Modeling others using oneself in multi-agent reinforcement learning. In: International Conference on Machine Learning, pp. 4257–4266. PMLR (2018)
Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant No.61973244, 72001214, and 61573277) and the open fund of CETC Key Laboratory of Data Link Technology.

Author information

Authors and Affiliations

School of Automation Science and Engineering, Xi’an Jiaotong University, Xi’an, 710049, China
Laicai Xie & Liangjun Ke
Army Aviation Research Institute, Beijing, 101121, China
Wanpeng Ma & Liping Wang
State Key Laboratory for Manufacturing Systems Engineering, Xi’an, 710049, China
Liangjun Ke

Authors

Laicai Xie
View author publications
You can also search for this author in PubMed Google Scholar
Wanpeng Ma
View author publications
You can also search for this author in PubMed Google Scholar
Liping Wang
View author publications
You can also search for this author in PubMed Google Scholar
Liangjun Ke
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liangjun Ke .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Ying Tan
Southern University of Science and Techn, Shenzhen, China
Yuhui Shi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, L., Ma, W., Wang, L., Ke, L. (2024). A New Deep Reinforcement Learning Algorithm for UAV Swarm Confrontation Game. In: Tan, Y., Shi, Y. (eds) Data Mining and Big Data. DMBD 2023. Communications in Computer and Information Science, vol 2017. Springer, Singapore. https://doi.org/10.1007/978-981-97-0837-6_14

Download citation

DOI: https://doi.org/10.1007/978-981-97-0837-6_14
Published: 22 February 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0836-9
Online ISBN: 978-981-97-0837-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A New Deep Reinforcement Learning Algorithm for UAV Swarm Confrontation Game

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multiagent Reinforcement Learning for Swarm Confrontation Environments

Air Combat Agent Construction Based on Hybrid Self-play Deep Reinforcement Learning

Research on Autonomous Decision-Making of Multi-UAV Air Combat Based on Deep Reinforcement Learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A New Deep Reinforcement Learning Algorithm for UAV Swarm Confrontation Game

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multiagent Reinforcement Learning for Swarm Confrontation Environments

Air Combat Agent Construction Based on Hybrid Self-play Deep Reinforcement Learning

Research on Autonomous Decision-Making of Multi-UAV Air Combat Based on Deep Reinforcement Learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation