research-article

Bipartite containment control of multi-agent systems subject to adversarial inputs based on zero-sum game

Authors:

Jianbin QiuAuthors Info & Claims

Volume 681, Issue C

https://doi.org/10.1016/j.ins.2024.121234

Published: 18 October 2024 Publication History

Abstract

In this paper, we investigate bipartite containment control problem of multi-agent systems (MASs) with signed directed graph under adversarial inputs. Firstly, we define the bipartite containment error and establish the equivalence between the bipartite containment error converging to zero and the achievement of bipartite containment control. Subsequently, we prove that the bounded L 2-gain bipartite containment problem under adversarial inputs can be reformulated as a multi-player zero-sum differential graphical game problem and can be solved via the solution to the coupled Hamilton-Jacobi-Isaacs (HJI) equation. To address this, we propose a policy iteration (PI) algorithm and prove its convergence under different updating cases. The proposed algorithm is implemented by neural networks (NNs) and a numerical simulation example is provided to show its effectiveness.

References

[1]

Z. Kan, J.R. Klotz, E.L. Pasiliao Jr, W.E. Dixon, Containment control for a social network with state-dependent connectivity, Automatica 56 (2015) 86–92.

[2]

R. Han, L. Meng, G. Ferrari-Trecate, E.A.A. Coelho, J.C. Vasquez, J.M. Guerrero, Containment and consensus-based distributed coordination control to achieve bounded voltage and precise reactive power sharing in islanded AC microgrids, IEEE Trans. Ind. Appl. 53 (6) (2017) 5187–5199.

[3]

S. Jiang, S. Wang, Z. Zhan, Y. Wu, W.H. Lam, R. Zhong, Containment control of discrete-time multi-agent systems with application to escort control of multiple vehicles, Int. J. Robust Nonlinear Control 32 (12) (2022) 6913–6938.

[4]

J. Hu, P. Bhowmick, I. Jang, F. Arvin, A. Lanzon, A decentralized cluster formation containment framework for multirobot systems, IEEE Trans. Robot. 37 (6) (2021) 1936–1955.

[5]

S. Fan, T. Wang, C. Qin, J. Qiu, M. Li, Optimized backstepping attitude containment control for multiple spacecrafts, IEEE Trans. Fuzzy Syst. (2024) 1–10,.

Digital Library

[6]

T. Liu, Z. Hou, Model-free adaptive containment control for unknown multi-input multi-output nonlinear mass with output saturation, IEEE Trans. Circuits Syst. I, Regul. Pap. 70 (5) (2023) 2156–2166.

[7]

T. Li, W. Bai, Q. Liu, Y. Long, C.L.P. Chen, Distributed fault-tolerant containment control protocols for the discrete-time multiagent systems via reinforcement learning method, IEEE Trans. Neural Netw. Learn. Syst. 34 (8) (2023) 3979–3991.

[8]

L. Yan, B. Ma, Y. Jia, Y. Fu, Adaptive containment control of multiple underactuated hovercrafts subjected to switching and directed topologies, IEEE Syst. J. 17 (3) (2023) 3962–3973.

[9]

H. Liang, L. Chen, Y. Pan, H. Lam, Fuzzy-based robust precision consensus tracking for uncertain networked systems with cooperative–antagonistic interactions, IEEE Trans. Fuzzy Syst. 31 (4) (2022) 1362–1376.

[10]

M. Ye, J. Liu, L. Wang, B.D. Anderson, M. Cao, Consensus and disagreement of heterogeneous belief systems in influence networks, IEEE Trans. Autom. Control 65 (11) (2019) 4679–4694.

[11]

C. Altafini, Consensus problems on networks with antagonistic interactions, IEEE Trans. Autom. Control 58 (4) (2012) 935–946.

[12]

Y. Cai, H. Zhang, Y. Wang, Z. Gao, Q. He, Adaptive bipartite fixed-time time-varying output formation-containment tracking of heterogeneous linear multiagent systems, IEEE Trans. Neural Netw. Learn. Syst. 33 (9) (2021) 4688–4698.

[13]

X. Guo, H. Ma, H. Liang, H. Zhang, Command-filter-based fixed-time bipartite containment control for a class of stochastic multiagent systems, IEEE Trans. Syst. Man Cybern. Syst. 52 (6) (2021) 3519–3529.

[14]

Y. Bi, T. Wang, J. Qiu, M. Li, C. Wei, L. Yuan, Adaptive decentralized finite-time fuzzy secure control for uncertain nonlinear CPSs under deception attacks, IEEE Trans. Fuzzy Syst. 31 (8) (2023) 2568–2580.

[15]

J. Qiu, M. Ma, T. Wang, Event-triggered adaptive fuzzy fault-tolerant control for stochastic nonlinear systems via command filtering, IEEE Trans. Syst. Man Cybern. Syst. 52 (2) (2022) 1145–1155.

[16]

L.R.G. Carrillo, K.G. Vamvoudakis, Deep-learning tracking for autonomous flying systems under adversarial inputs, IEEE Trans. Aerosp. Electron. Syst. 56 (2) (2019) 1444–1459.

[17]

R. Moghadam, Q. Wei, H. Modares, Distributed control of leader-follower systems under adversarial inputs using reinforcement learning, in: 2017 IEEE Symposium Series on Computational Intelligence (SSCI), IEEE, 2017, pp. 1–8.

[18]

K.G. Vamvoudakis, J.P. Hespanha, Cooperative Q-learning for rejection of persistent adversarial inputs in networked linear quadratic systems, IEEE Trans. Autom. Control 63 (4) (2017) 1018–1031.

[19]

A. Rahdarian, S. Shamaghdari, Model-free H∞ synchronization of leader–follower systems with guaranteed convergence rate using reinforcement learning, Int. J. Dyn. Control 11 (1) (2023) 242–257.

[20]

Y. Kartal, A.T. Koru, F.L. Lewis, Y. Wan, A. Dogan, Adversarial multiagent output containment graphical game with local and global objectives for UAVs, IEEE Trans. Control Netw. Syst. 10 (2) (2023) 875–886.

[21]

M. Liu, Q. Cai, D. Li, W. Meng, M. Fu, Output feedback Q-learning for discrete-time finite-horizon zero-sum games with application to the H∞ control, Neurocomputing 529 (2023) 48–55.

[22]

M. Liu, Y. Wan, F.L. Lewis, V.G. Lopez, Adaptive optimal control for stochastic multiplayer differential games using on-policy and off-policy reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst. 31 (12) (2020) 5522–5533.

[23]

X. Zhong, H. He, A reinforcement learning-based control approach for unknown nonlinear systems with persistent adversarial inputs, in: 2021 International Joint Conference on Neural Networks (IJCNN), IEEE, 2021, pp. 1–8.

[24]

Q. Li, L. Xia, R. Song, J. Liu, Leader–follower bipartite output synchronization on signed digraphs under adversarial factors via data-based reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst. 31 (10) (2019) 4185–4195.

[25]

Q. Li, L. Xia, R. Song, Bipartite state synchronization of heterogeneous system with active leader on signed digraph under adversarial inputs, Neurocomputing 369 (2019) 69–79.

[26]

C. An, H. Su, S. Chen, H∞ consensus for discrete-time fractional-order multi-agent systems with disturbance via Q-learning in zero-sum games, IEEE Trans. Netw. Sci. Eng. 9 (4) (2022) 2803–2814.

[27]

H. Tang, Y. Xiao, W. Zhang, D. Lei, J. Wang, T. Xu, A DQL-NSGA-III algorithm for solving the flexible job shop dynamic scheduling problem, Expert Syst. Appl. 237 (2024).

[28]

H. Tang, H. Zhang, R. Liu, Y. Du, Integrating multi-index materials classification and inventory control in discrete manufacturing industry: using a hybrid ABC-Chaos algorithm, IEEE Trans. Eng. Manag. 69 (4) (2022) 1276–1293.

[29]

Z. Guo, H. Ren, H. Li, Q. Zhou, Adaptive-critic-based event-triggered intelligent cooperative control for a class of second-order constrained multiagent systems, IEEE Trans. Artif. Intell. 4 (6) (2023) 1654–1665.

[30]

B. Niu, X. Wang, H. Wang, P. Guo, B. Zhang, Adaptive RL optimized bipartite consensus tracking for heterogeneous nonlinear mass under a switching threshold event triggered strategy, IEEE Trans. Autom. Sci. Eng. (2023) 1–11,.

[31]

J. Qiu, W. Ji, H. Lam, A new design of fuzzy affine model-based output feedback control for discrete-time nonlinear systems, IEEE Trans. Fuzzy Syst. 31 (5) (2023) 1434–1444.

[32]

C. Huang, C. Chen, K. Xie, Z. Li, S. Xie, Adaptive output synchronization with designated convergence rate of multiagent systems based on off-policy reinforcement learning, IEEE Trans. Syst. Man Cybern. Syst. (2024) 1–12,.

[33]

D. Zhang, Y. Yao, Z. Wu, Reinforcement learning based optimal synchronization control for multi-agent systems with input constraints using vanishing viscosity method, Inf. Sci. 637 (2023).

[34]

H. Zhao, H. Wang, N. Xu, X. Zhao, S. Sharaf, Fuzzy approximation-based optimal consensus control for nonlinear multiagent systems via adaptive dynamic programming, Neurocomputing 553 (2023).

[35]

Y. Zhao, H. Liang, G. Zong, H. Wang, Event-based distributed finite-horizon H∞ consensus control for constrained nonlinear multiagent systems, IEEE Syst. J. 17 (4) (2023) 5369–5380.

[36]

S. Zuo, Y. Song, F.L. Lewis, A. Davoudi, Bipartite output containment of general linear heterogeneous multi-agent systems on signed digraphs, IET Control Theory Appl. 12 (9) (2018) 1180–1188.

[37]

D. Meng, M. Du, Y. Jia, Interval bipartite consensus of networked agents associated with signed digraphs, IEEE Trans. Autom. Control 61 (12) (2016) 3755–3770.

[38]

Q. Jiao, H. Modares, S. Xu, F.L. Lewis, K.G. Vamvoudakis, Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control, Automatica 69 (2016) 24–34.

[39]

M. Abu-Khalaf, F.L. Lewis, J. Huang, Neurodynamic programming and zero-sum games for constrained control systems, IEEE Trans. Neural Netw. 19 (7) (2008) 1243–1252.

[40]

B. Luo, Y. Yang, D. Liu, Policy iteration Q-learning for data-based two-player zero-sum game of linear discrete-time systems, IEEE Trans. Cybern. 51 (7) (2021) 3630–3640.

[41]

M. Lin, B. Zhao, D. Liu, Policy gradient adaptive critic designs for model-free optimal tracking control with experience replay, IEEE Trans. Syst. Man Cybern. Syst. 52 (6) (2022) 3692–3703.

[42]

Q. Li, L. Xia, R. Song, J. Liu, Leader–follower bipartite output synchronization on signed digraphs under adversarial factors via data-based reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst. 31 (10) (2020) 4185–4195.

[43]

D. Yu, S.S. Ge, D. Li, P. Wang, Finite-horizon robust formation-containment control of multi-agent networks with unknown dynamics, Neurocomputing 458 (2021) 403–415.

[44]

K.G. Vamvoudakis, J.P. Hespanha, Cooperative Q-learning for rejection of persistent adversarial inputs in networked linear quadratic systems, IEEE Trans. Autom. Control 63 (4) (2018) 1018–1031.

Index Terms

Bipartite containment control of multi-agent systems subject to adversarial inputs based on zero-sum game
1. Theory of computation
  1. Theory and algorithms for application domains

Index terms have been assigned to the content through auto-classification.

Recommendations

A note on dynamic zero-sum games
CCDC'09: Proceedings of the 21st annual international conference on Chinese control and decision conference

Dynamic games play prominently roles in the management field and in the economic community. In this note, a dynamic zero-sum game with two players is considered. By comparing one player under feedback information structure with the other player under ...
The neighbour-sum-distinguishing edge-colouring game

Let :E(G)N=N{0} be an edge colouring of a graph G and :V(G)N the vertex colouring given by (v)=ev(e) for every vV(G). A neighbour-sum-distinguishing edge-colouring of G is an edge colouring such that for every edge uv in G, (u)(v). The neighbour-sum-...
Game total domination for cyclic bipartite graphs
Abstract
Let G = ( V , E ) be a graph. A vertex u in G totally dominates a vertex v if u is adjacent to v in G. The total domination game played on G consists of two players, named Dominator and Staller, who alternately take turns choosing vertices of G ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Information Sciences: an International Journal

Information Sciences: an International Journal Volume 681, Issue C

Oct 2024

1022 pages

Issue’s Table of Contents

Elsevier Inc.

Publisher

Elsevier Science Inc.

United States

Publication History

Published: 18 October 2024

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 10 Nov 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents