Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Bipartite containment control of multi-agent systems subject to adversarial inputs based on zero-sum game

Published: 18 October 2024 Publication History

Abstract

In this paper, we investigate bipartite containment control problem of multi-agent systems (MASs) with signed directed graph under adversarial inputs. Firstly, we define the bipartite containment error and establish the equivalence between the bipartite containment error converging to zero and the achievement of bipartite containment control. Subsequently, we prove that the bounded L 2-gain bipartite containment problem under adversarial inputs can be reformulated as a multi-player zero-sum differential graphical game problem and can be solved via the solution to the coupled Hamilton-Jacobi-Isaacs (HJI) equation. To address this, we propose a policy iteration (PI) algorithm and prove its convergence under different updating cases. The proposed algorithm is implemented by neural networks (NNs) and a numerical simulation example is provided to show its effectiveness.

References

[1]
Z. Kan, J.R. Klotz, E.L. Pasiliao Jr, W.E. Dixon, Containment control for a social network with state-dependent connectivity, Automatica 56 (2015) 86–92.
[2]
R. Han, L. Meng, G. Ferrari-Trecate, E.A.A. Coelho, J.C. Vasquez, J.M. Guerrero, Containment and consensus-based distributed coordination control to achieve bounded voltage and precise reactive power sharing in islanded AC microgrids, IEEE Trans. Ind. Appl. 53 (6) (2017) 5187–5199.
[3]
S. Jiang, S. Wang, Z. Zhan, Y. Wu, W.H. Lam, R. Zhong, Containment control of discrete-time multi-agent systems with application to escort control of multiple vehicles, Int. J. Robust Nonlinear Control 32 (12) (2022) 6913–6938.
[4]
J. Hu, P. Bhowmick, I. Jang, F. Arvin, A. Lanzon, A decentralized cluster formation containment framework for multirobot systems, IEEE Trans. Robot. 37 (6) (2021) 1936–1955.
[5]
S. Fan, T. Wang, C. Qin, J. Qiu, M. Li, Optimized backstepping attitude containment control for multiple spacecrafts, IEEE Trans. Fuzzy Syst. (2024) 1–10,.
[6]
T. Liu, Z. Hou, Model-free adaptive containment control for unknown multi-input multi-output nonlinear mass with output saturation, IEEE Trans. Circuits Syst. I, Regul. Pap. 70 (5) (2023) 2156–2166.
[7]
T. Li, W. Bai, Q. Liu, Y. Long, C.L.P. Chen, Distributed fault-tolerant containment control protocols for the discrete-time multiagent systems via reinforcement learning method, IEEE Trans. Neural Netw. Learn. Syst. 34 (8) (2023) 3979–3991.
[8]
L. Yan, B. Ma, Y. Jia, Y. Fu, Adaptive containment control of multiple underactuated hovercrafts subjected to switching and directed topologies, IEEE Syst. J. 17 (3) (2023) 3962–3973.
[9]
H. Liang, L. Chen, Y. Pan, H. Lam, Fuzzy-based robust precision consensus tracking for uncertain networked systems with cooperative–antagonistic interactions, IEEE Trans. Fuzzy Syst. 31 (4) (2022) 1362–1376.
[10]
M. Ye, J. Liu, L. Wang, B.D. Anderson, M. Cao, Consensus and disagreement of heterogeneous belief systems in influence networks, IEEE Trans. Autom. Control 65 (11) (2019) 4679–4694.
[11]
C. Altafini, Consensus problems on networks with antagonistic interactions, IEEE Trans. Autom. Control 58 (4) (2012) 935–946.
[12]
Y. Cai, H. Zhang, Y. Wang, Z. Gao, Q. He, Adaptive bipartite fixed-time time-varying output formation-containment tracking of heterogeneous linear multiagent systems, IEEE Trans. Neural Netw. Learn. Syst. 33 (9) (2021) 4688–4698.
[13]
X. Guo, H. Ma, H. Liang, H. Zhang, Command-filter-based fixed-time bipartite containment control for a class of stochastic multiagent systems, IEEE Trans. Syst. Man Cybern. Syst. 52 (6) (2021) 3519–3529.
[14]
Y. Bi, T. Wang, J. Qiu, M. Li, C. Wei, L. Yuan, Adaptive decentralized finite-time fuzzy secure control for uncertain nonlinear CPSs under deception attacks, IEEE Trans. Fuzzy Syst. 31 (8) (2023) 2568–2580.
[15]
J. Qiu, M. Ma, T. Wang, Event-triggered adaptive fuzzy fault-tolerant control for stochastic nonlinear systems via command filtering, IEEE Trans. Syst. Man Cybern. Syst. 52 (2) (2022) 1145–1155.
[16]
L.R.G. Carrillo, K.G. Vamvoudakis, Deep-learning tracking for autonomous flying systems under adversarial inputs, IEEE Trans. Aerosp. Electron. Syst. 56 (2) (2019) 1444–1459.
[17]
R. Moghadam, Q. Wei, H. Modares, Distributed control of leader-follower systems under adversarial inputs using reinforcement learning, in: 2017 IEEE Symposium Series on Computational Intelligence (SSCI), IEEE, 2017, pp. 1–8.
[18]
K.G. Vamvoudakis, J.P. Hespanha, Cooperative Q-learning for rejection of persistent adversarial inputs in networked linear quadratic systems, IEEE Trans. Autom. Control 63 (4) (2017) 1018–1031.
[19]
A. Rahdarian, S. Shamaghdari, Model-free H∞ synchronization of leader–follower systems with guaranteed convergence rate using reinforcement learning, Int. J. Dyn. Control 11 (1) (2023) 242–257.
[20]
Y. Kartal, A.T. Koru, F.L. Lewis, Y. Wan, A. Dogan, Adversarial multiagent output containment graphical game with local and global objectives for UAVs, IEEE Trans. Control Netw. Syst. 10 (2) (2023) 875–886.
[21]
M. Liu, Q. Cai, D. Li, W. Meng, M. Fu, Output feedback Q-learning for discrete-time finite-horizon zero-sum games with application to the H∞ control, Neurocomputing 529 (2023) 48–55.
[22]
M. Liu, Y. Wan, F.L. Lewis, V.G. Lopez, Adaptive optimal control for stochastic multiplayer differential games using on-policy and off-policy reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst. 31 (12) (2020) 5522–5533.
[23]
X. Zhong, H. He, A reinforcement learning-based control approach for unknown nonlinear systems with persistent adversarial inputs, in: 2021 International Joint Conference on Neural Networks (IJCNN), IEEE, 2021, pp. 1–8.
[24]
Q. Li, L. Xia, R. Song, J. Liu, Leader–follower bipartite output synchronization on signed digraphs under adversarial factors via data-based reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst. 31 (10) (2019) 4185–4195.
[25]
Q. Li, L. Xia, R. Song, Bipartite state synchronization of heterogeneous system with active leader on signed digraph under adversarial inputs, Neurocomputing 369 (2019) 69–79.
[26]
C. An, H. Su, S. Chen, H∞ consensus for discrete-time fractional-order multi-agent systems with disturbance via Q-learning in zero-sum games, IEEE Trans. Netw. Sci. Eng. 9 (4) (2022) 2803–2814.
[27]
H. Tang, Y. Xiao, W. Zhang, D. Lei, J. Wang, T. Xu, A DQL-NSGA-III algorithm for solving the flexible job shop dynamic scheduling problem, Expert Syst. Appl. 237 (2024).
[28]
H. Tang, H. Zhang, R. Liu, Y. Du, Integrating multi-index materials classification and inventory control in discrete manufacturing industry: using a hybrid ABC-Chaos algorithm, IEEE Trans. Eng. Manag. 69 (4) (2022) 1276–1293.
[29]
Z. Guo, H. Ren, H. Li, Q. Zhou, Adaptive-critic-based event-triggered intelligent cooperative control for a class of second-order constrained multiagent systems, IEEE Trans. Artif. Intell. 4 (6) (2023) 1654–1665.
[30]
B. Niu, X. Wang, H. Wang, P. Guo, B. Zhang, Adaptive RL optimized bipartite consensus tracking for heterogeneous nonlinear mass under a switching threshold event triggered strategy, IEEE Trans. Autom. Sci. Eng. (2023) 1–11,.
[31]
J. Qiu, W. Ji, H. Lam, A new design of fuzzy affine model-based output feedback control for discrete-time nonlinear systems, IEEE Trans. Fuzzy Syst. 31 (5) (2023) 1434–1444.
[32]
C. Huang, C. Chen, K. Xie, Z. Li, S. Xie, Adaptive output synchronization with designated convergence rate of multiagent systems based on off-policy reinforcement learning, IEEE Trans. Syst. Man Cybern. Syst. (2024) 1–12,.
[33]
D. Zhang, Y. Yao, Z. Wu, Reinforcement learning based optimal synchronization control for multi-agent systems with input constraints using vanishing viscosity method, Inf. Sci. 637 (2023).
[34]
H. Zhao, H. Wang, N. Xu, X. Zhao, S. Sharaf, Fuzzy approximation-based optimal consensus control for nonlinear multiagent systems via adaptive dynamic programming, Neurocomputing 553 (2023).
[35]
Y. Zhao, H. Liang, G. Zong, H. Wang, Event-based distributed finite-horizon H∞ consensus control for constrained nonlinear multiagent systems, IEEE Syst. J. 17 (4) (2023) 5369–5380.
[36]
S. Zuo, Y. Song, F.L. Lewis, A. Davoudi, Bipartite output containment of general linear heterogeneous multi-agent systems on signed digraphs, IET Control Theory Appl. 12 (9) (2018) 1180–1188.
[37]
D. Meng, M. Du, Y. Jia, Interval bipartite consensus of networked agents associated with signed digraphs, IEEE Trans. Autom. Control 61 (12) (2016) 3755–3770.
[38]
Q. Jiao, H. Modares, S. Xu, F.L. Lewis, K.G. Vamvoudakis, Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control, Automatica 69 (2016) 24–34.
[39]
M. Abu-Khalaf, F.L. Lewis, J. Huang, Neurodynamic programming and zero-sum games for constrained control systems, IEEE Trans. Neural Netw. 19 (7) (2008) 1243–1252.
[40]
B. Luo, Y. Yang, D. Liu, Policy iteration Q-learning for data-based two-player zero-sum game of linear discrete-time systems, IEEE Trans. Cybern. 51 (7) (2021) 3630–3640.
[41]
M. Lin, B. Zhao, D. Liu, Policy gradient adaptive critic designs for model-free optimal tracking control with experience replay, IEEE Trans. Syst. Man Cybern. Syst. 52 (6) (2022) 3692–3703.
[42]
Q. Li, L. Xia, R. Song, J. Liu, Leader–follower bipartite output synchronization on signed digraphs under adversarial factors via data-based reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst. 31 (10) (2020) 4185–4195.
[43]
D. Yu, S.S. Ge, D. Li, P. Wang, Finite-horizon robust formation-containment control of multi-agent networks with unknown dynamics, Neurocomputing 458 (2021) 403–415.
[44]
K.G. Vamvoudakis, J.P. Hespanha, Cooperative Q-learning for rejection of persistent adversarial inputs in networked linear quadratic systems, IEEE Trans. Autom. Control 63 (4) (2018) 1018–1031.

Index Terms

  1. Bipartite containment control of multi-agent systems subject to adversarial inputs based on zero-sum game
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Information Sciences: an International Journal
    Information Sciences: an International Journal  Volume 681, Issue C
    Oct 2024
    1022 pages

    Publisher

    Elsevier Science Inc.

    United States

    Publication History

    Published: 18 October 2024

    Author Tags

    1. Adversarial inputs
    2. Bipartite containment control
    3. Policy iteration
    4. Signed digraph
    5. Zero-sum game

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 0
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media